TOP ACTIVATIONS
MAX = 0.115

They gave some to Sara and Tom and some to themselves
. They explain to Anna and her mom what happened .
it . They hugged Sara and Ben and took them outside
big bowl and gives Lia and Tom each a plate and
scared . They grab Anna and Ben and take them to
, I have grape ear rings . They are round and
and gave them to Sara and Tom . They said thank
about we play with your dolls or your blocks ? And
big event with their mom and dad . There were many
She grabs Sara and Tom . She takes them
. He looks at Sara and Tom . He looks angry
He gives it to Sara and says , " Here ,
was Elsa . Elsa had blonde hair , a blue dress
the cart and take Anna and Ben away . They say
the farm with their mom and dad . They saw many
sad . They told Sara and Tom that they should not
He walked over to Sara and Tom and handed them the
gives the car to Sara and Ben . She says ,
. They yell at Lily and Max and take them out
wanted to play with Sara and Tom more . He wished

MIN ACTIVATIONS
MIN = -0.132

So , the sp arrow flew around the c
you ". The bad ger was very surprised .
and farther , until the oct opus was finally blo oming
The le opard thanked the rabbit and
back . The oct opus asked El sie if
! The oct opus played all day until
So , the smart oct opus thought of a plan
. The oct opus had sn uck away
. The bad creature asked the fairy ,
comfortable . The oct opus felt so relieved and
! The oct opus knew right away that
Finally , the oct opus and his friends managed
. From then on the bad ger stopped taking Jack 's
relax !" The oct opus felt a bit nervous
â The le opard started walking around the
that day on , the oct opus realized that it 's
pushed and pulled on the seaw eed until finally , the
then on , whenever the le opard had an important task
pace ". The oct opus was encouraged by the
now !" The bad creature was so angry that

INTERVAL 0.053 - 0.115
CONTAINS 2.148%

of Tim and his dad with the passport . Then Tim
, c udd ly t eddy bear . When
She took off her hat and looked at her hair .
you can also play with Elsa . She is very nice
He was nice . They waved at him and ran to

INTERVAL -0.009 - 0.053
CONTAINS 61.019%

, wide smile whenever he played . One day
. One day , they saw a big snow man in
buy those bright fireworks ?" Tim my asked . His mom
's try it ," Ben said . He climbed down the
was so excited to look back at all of her adventures

INTERVAL -0.071 - -0.009
CONTAINS 36.629%

wise words , and for the jam . <|endoftext|> There was
" Maybe we can play with the numbers in a different
He saw something colorful and grabbed it . The colorful thing
on top of a big tree ! True didn 't know
you were just playing . But you have to be careful

INTERVAL -0.132 - -0.071
CONTAINS 0.204%

a cute face . The little girl was so happy to
promised to never forget the power of the golden thread .
It flew away but the good man felt happy . He
and the fox visited the o asis every day to play
From then on , the lazy cater pillar would show his
Token They
Token ID1119
Feature activation+2.61e-02

Pos logit contributions
 cliff+0.021
www+0.019
fire+0.019
 brew+0.019
 plunge+0.019
Neg logit contributions
 hugged-0.014
 played-0.013
 named-0.013
 loved-0.013
 both-0.012
Token gave
Token ID2921
Feature activation-2.59e-03

Pos logit contributions
 sincere+0.025
 His+0.023
 named+0.021
 hugged+0.021
 Her+0.021
Neg logit contributions
 cliff-0.063
fire-0.059
works-0.059
www-0.058
inner-0.058
Token some
Token ID617
Feature activation+1.13e-02

Pos logit contributions
 cliff+0.007
 plunge+0.006
www+0.006
works+0.006
fire+0.006
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.002
 loved-0.002
 both-0.002
Token to
Token ID284
Feature activation-1.88e-02

Pos logit contributions
 hugged+0.018
 named+0.017
 played+0.017
 loved+0.016
 both+0.016
Neg logit contributions
 cliff-0.022
 plunge-0.021
www-0.021
works-0.020
fire-0.020
Token Sara
Token ID24799
Feature activation+3.66e-02

Pos logit contributions
 cliff+0.050
www+0.048
works+0.047
fire+0.047
 plunge+0.047
Neg logit contributions
 hugged-0.017
 both-0.017
 named-0.017
 played-0.016
 loved-0.015
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
 sincere+0.026
 hugged+0.022
 played+0.020
 His+0.019
 named+0.017
Neg logit contributions
 cliff-0.096
fire-0.096
 brew-0.092
works-0.091
www-0.090
Token Tom
Token ID4186
Feature activation+4.33e-02

Pos logit contributions
 sincere+0.091
 loved+0.090
 hugged+0.088
 named+0.087
 played+0.082
Neg logit contributions
 cliff-0.286
www-0.274
works-0.270
fire-0.270
 plunge-0.270
Token and
Token ID290
Feature activation+1.06e-01

Pos logit contributions
 sincere+0.031
 hugged+0.021
 named+0.021
 His+0.020
 played+0.017
Neg logit contributions
 cliff-0.116
fire-0.115
inner-0.112
 brew-0.111
 plunge-0.110
Token some
Token ID617
Feature activation+2.16e-02

Pos logit contributions
 hugged+0.088
 loved+0.088
 sincere+0.088
 named+0.087
 both+0.085
Neg logit contributions
 cliff-0.262
www-0.252
works-0.248
fire-0.247
 plunge-0.245
Token to
Token ID284
Feature activation-8.76e-03

Pos logit contributions
 hugged+0.032
 named+0.032
 played+0.031
 loved+0.030
 both+0.030
Neg logit contributions
 cliff-0.043
 plunge-0.040
fire-0.040
works-0.040
www-0.040
Token themselves
Token ID2405
Feature activation+3.70e-02

Pos logit contributions
 cliff+0.020
www+0.018
 plunge+0.018
fire+0.018
works+0.018
Neg logit contributions
 hugged-0.011
 named-0.011
 played-0.011
 loved-0.011
 both-0.010
Token.
Token ID13
Feature activation+6.53e-03

Pos logit contributions
 hugged+0.006
 named+0.006
 played+0.005
 both+0.005
 loved+0.005
Neg logit contributions
 cliff-0.044
www-0.042
 plunge-0.041
works-0.041
fire-0.041
Token They
Token ID1119
Feature activation+2.81e-02

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.005
 both+0.005
 loved+0.005
Neg logit contributions
 cliff-0.018
www-0.017
fire-0.017
works-0.017
 plunge-0.017
Token explain
Token ID4727
Feature activation+1.17e-02

Pos logit contributions
 named+0.035
 hugged+0.034
 His+0.034
 played+0.033
 loved+0.033
Neg logit contributions
 cliff-0.062
 plunge-0.056
works-0.056
 brew-0.056
fire-0.056
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 hugged+0.011
 played+0.011
 named+0.011
 both+0.010
 loved+0.010
Neg logit contributions
 cliff-0.029
works-0.028
fire-0.027
www-0.027
 plunge-0.027
Token Anna
Token ID11735
Feature activation+4.81e-02

Pos logit contributions
 cliff+0.034
www+0.032
works+0.032
 plunge+0.032
fire+0.032
Neg logit contributions
 hugged-0.013
 named-0.012
 played-0.012
 loved-0.012
 both-0.012
Token and
Token ID290
Feature activation+1.13e-01

Pos logit contributions
 sincere+0.040
 played+0.035
 His+0.034
 swapped+0.032
 pockets+0.032
Neg logit contributions
 outage-0.113
 cliff-0.113
fire-0.113
works-0.111
 brew-0.107
Token her
Token ID607
Feature activation+4.62e-02

Pos logit contributions
 hugged+0.085
 named+0.080
 played+0.077
 both+0.075
 loved+0.074
Neg logit contributions
 cliff-0.318
www-0.301
works-0.299
 plunge-0.299
fire-0.298
Token mom
Token ID1995
Feature activation+5.59e-02

Pos logit contributions
 hugged+0.039
 played+0.038
 named+0.038
 both+0.036
 His+0.036
Neg logit contributions
 cliff-0.121
works-0.114
fire-0.111
 outage-0.110
 plunge-0.109
Token what
Token ID644
Feature activation-9.98e-04

Pos logit contributions
 His+0.034
 played+0.033
 hugged+0.032
 named+0.032
 sincere+0.030
Neg logit contributions
works-0.148
 cliff-0.145
fire-0.142
 brew-0.140
 plunge-0.138
Token happened
Token ID3022
Feature activation-3.08e-03

Pos logit contributions
 cliff+0.002
 plunge+0.002
www+0.002
fire+0.002
works+0.002
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.001
 loved-0.001
 both-0.001
Token.
Token ID13
Feature activation+6.38e-03

Pos logit contributions
 cliff+0.009
fire+0.008
works+0.008
 plunge+0.008
 outage+0.008
Neg logit contributions
 hugged-0.002
 played-0.002
 named-0.002
 His-0.001
 both-0.001
Token it
Token ID340
Feature activation-1.55e-02

Pos logit contributions
 hugged+0.030
 played+0.029
 named+0.029
 learned+0.028
 loved+0.028
Neg logit contributions
 cliff-0.065
 plunge-0.060
 brew-0.060
fire-0.059
works-0.059
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 cliff+0.053
www+0.050
 plunge+0.050
works+0.050
fire+0.049
Neg logit contributions
 hugged-0.003
 named-0.002
 both-0.002
 played-0.002
 loved-0.002
Token They
Token ID1119
Feature activation+2.76e-02

Pos logit contributions
 hugged+0.009
 named+0.008
 played+0.008
 both+0.008
 loved+0.008
Neg logit contributions
 cliff-0.035
www-0.033
fire-0.033
works-0.033
 plunge-0.033
Token hugged
Token ID41130
Feature activation+3.98e-02

Pos logit contributions
 His+0.028
 sincere+0.026
 named+0.024
 his+0.024
 loved+0.023
Neg logit contributions
 cliff-0.066
fire-0.062
 brew-0.060
 stairs-0.060
works-0.060
Token Sara
Token ID24799
Feature activation+5.89e-02

Pos logit contributions
 played+0.051
 named+0.050
 sincere+0.050
 hugged+0.050
 learned+0.049
Neg logit contributions
 cliff-0.084
 plunge-0.080
 brew-0.076
www-0.073
works-0.073
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
 sincere+0.067
 played+0.053
 His+0.052
 swapped+0.048
 traded+0.047
Neg logit contributions
fire-0.131
 cliff-0.129
works-0.124
www-0.122
 outage-0.120
Token Ben
Token ID3932
Feature activation+5.15e-02

Pos logit contributions
 loved+0.105
 named+0.103
 sincere+0.101
 His+0.097
 hugged+0.097
Neg logit contributions
 cliff-0.262
www-0.243
 plunge-0.243
 brew-0.239
fire-0.236
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
 sincere+0.055
 His+0.049
 played+0.045
 named+0.044
 swapped+0.043
Neg logit contributions
fire-0.110
 cliff-0.109
 brew-0.100
 plunge-0.100
 stairs-0.099
Token took
Token ID1718
Feature activation-9.24e-03

Pos logit contributions
 named+0.083
 loved+0.082
 hugged+0.082
 both+0.079
 played+0.078
Neg logit contributions
 cliff-0.260
www-0.245
 plunge-0.241
fire-0.241
 brew-0.240
Token them
Token ID606
Feature activation+3.31e-02

Pos logit contributions
 cliff+0.024
 plunge+0.022
www+0.022
works+0.022
fire+0.022
Neg logit contributions
 hugged-0.009
 named-0.009
 played-0.008
 loved-0.008
 both-0.008
Token outside
Token ID2354
Feature activation+2.41e-02

Pos logit contributions
 sincere+0.052
 loved+0.048
 named+0.048
 liked+0.048
 hugged+0.048
Neg logit contributions
 cliff-0.061
fire-0.061
 plunge-0.060
works-0.058
 stairs-0.058
Token big
Token ID1263
Feature activation+7.44e-03

Pos logit contributions
 hugged+0.019
 played+0.019
 named+0.019
 loved+0.018
 both+0.018
Neg logit contributions
 cliff-0.022
fire-0.020
www-0.020
 plunge-0.019
 brew-0.019
Token bowl
Token ID9396
Feature activation+1.32e-02

Pos logit contributions
 hugged+0.013
 loved+0.013
 played+0.013
 named+0.013
 both+0.013
Neg logit contributions
 cliff-0.013
www-0.011
 plunge-0.011
 brew-0.011
fire-0.011
Token and
Token ID290
Feature activation+5.39e-02

Pos logit contributions
 sincere+0.009
 hugged+0.008
 played+0.008
 loved+0.007
 His+0.007
Neg logit contributions
 cliff-0.037
fire-0.036
inner-0.034
 brew-0.034
 stairs-0.034
Token gives
Token ID3607
Feature activation-5.81e-03

Pos logit contributions
 hugged+0.071
 played+0.067
 both+0.066
 named+0.065
 loved+0.064
Neg logit contributions
 cliff-0.126
works-0.118
www-0.117
 plunge-0.117
fire-0.117
Token Lia
Token ID49520
Feature activation+2.52e-02

Pos logit contributions
 cliff+0.015
www+0.014
works+0.014
 plunge+0.014
fire+0.014
Neg logit contributions
 hugged-0.005
 named-0.005
 played-0.005
 both-0.005
 loved-0.005
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
 sincere+0.025
 hugged+0.023
 played+0.022
 His+0.022
 loved+0.021
Neg logit contributions
 cliff-0.061
fire-0.058
 brew-0.056
inner-0.056
works-0.056
Token Tom
Token ID4186
Feature activation+2.71e-02

Pos logit contributions
 named+0.094
 hugged+0.092
 loved+0.090
 sincere+0.090
 played+0.082
Neg logit contributions
 cliff-0.279
 plunge-0.263
www-0.263
works-0.260
 brew-0.259
Token each
Token ID1123
Feature activation+5.72e-02

Pos logit contributions
 sincere+0.031
 named+0.028
 hugged+0.027
 His+0.025
 loved+0.023
Neg logit contributions
 cliff-0.061
 brew-0.058
inner-0.058
fire-0.058
 plunge-0.057
Token a
Token ID257
Feature activation+4.35e-03

Pos logit contributions
 sincere+0.055
 hugged+0.052
 named+0.051
 played+0.048
 loved+0.043
Neg logit contributions
 cliff-0.139
 brew-0.136
www-0.135
inner-0.135
 plunge-0.135
Token plate
Token ID7480
Feature activation+1.83e-02

Pos logit contributions
 hugged+0.006
 named+0.006
 played+0.006
 both+0.006
 loved+0.006
Neg logit contributions
 cliff-0.009
www-0.008
 plunge-0.008
fire-0.008
works-0.008
Token and
Token ID290
Feature activation+7.94e-02

Pos logit contributions
 sincere+0.017
 played+0.015
 loved+0.015
 His+0.015
 named+0.015
Neg logit contributions
fire-0.044
 cliff-0.042
inner-0.042
 stairs-0.040
 outage-0.040
Token scared
Token ID12008
Feature activation-1.14e-03

Pos logit contributions
 hugged+0.024
 named+0.023
 played+0.022
 both+0.022
 loved+0.021
Neg logit contributions
 cliff-0.070
www-0.066
 plunge-0.066
works-0.065
fire-0.065
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 cliff+0.003
works+0.003
fire+0.003
 brew+0.003
www+0.003
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.001
 loved-0.000
 learned-0.000
Token They
Token ID1119
Feature activation+1.82e-02

Pos logit contributions
 hugged+0.009
 named+0.009
 played+0.008
 both+0.008
 loved+0.008
Neg logit contributions
 cliff-0.034
www-0.032
works-0.032
 plunge-0.032
fire-0.032
Token grab
Token ID5552
Feature activation-1.54e-02

Pos logit contributions
 hugged+0.025
 named+0.024
 both+0.023
 played+0.023
 loved+0.023
Neg logit contributions
 cliff-0.041
www-0.039
works-0.038
fire-0.038
 plunge-0.038
Token Anna
Token ID11735
Feature activation+5.54e-02

Pos logit contributions
 cliff+0.041
www+0.039
works+0.039
 plunge+0.039
fire+0.039
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.013
 loved-0.012
 both-0.012
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
 sincere+0.037
 His+0.031
 hugged+0.031
 played+0.030
 loved+0.030
Neg logit contributions
 cliff-0.153
fire-0.147
 plunge-0.144
 brew-0.144
 stair-0.142
Token Ben
Token ID3932
Feature activation+4.41e-02

Pos logit contributions
 hugged+0.083
 named+0.082
 loved+0.079
 played+0.078
 learned+0.071
Neg logit contributions
 cliff-0.302
 plunge-0.284
www-0.282
 brew-0.281
works-0.280
Token and
Token ID290
Feature activation+1.00e-01

Pos logit contributions
 sincere+0.029
 hugged+0.026
 loved+0.025
 named+0.025
 His+0.025
Neg logit contributions
 cliff-0.118
fire-0.115
 plunge-0.114
 brew-0.111
 outage-0.109
Token take
Token ID1011
Feature activation-2.37e-02

Pos logit contributions
 hugged+0.083
 named+0.077
 played+0.076
 both+0.074
 loved+0.072
Neg logit contributions
 cliff-0.277
www-0.262
works-0.261
fire-0.260
 plunge-0.260
Token them
Token ID606
Feature activation+2.92e-02

Pos logit contributions
 cliff+0.063
www+0.060
works+0.059
fire+0.059
 plunge+0.058
Neg logit contributions
 hugged-0.023
 both-0.021
 named-0.021
 played-0.021
 loved-0.020
Token to
Token ID284
Feature activation-8.43e-03

Pos logit contributions
 sincere+0.044
 hugged+0.044
 loved+0.044
 named+0.043
 liked+0.042
Neg logit contributions
 cliff-0.056
 plunge-0.053
fire-0.053
 brew-0.050
works-0.050
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
 hugged+0.029
 sincere+0.026
 played+0.025
 both+0.024
 loved+0.023
Neg logit contributions
 cliff-0.107
www-0.105
fire-0.103
works-0.102
 brew-0.098
Token I
Token ID314
Feature activation+2.52e-02

Pos logit contributions
 hugged+0.033
 named+0.032
 played+0.032
 both+0.031
 loved+0.031
Neg logit contributions
 cliff-0.044
fire-0.041
works-0.041
www-0.041
 plunge-0.040
Token have
Token ID423
Feature activation+3.65e-02

Pos logit contributions
 hugged+0.024
 named+0.022
 played+0.021
 both+0.021
 loved+0.021
Neg logit contributions
 cliff-0.066
www-0.062
fire-0.061
 plunge-0.061
works-0.061
Token grape
Token ID30777
Feature activation+3.02e-02

Pos logit contributions
 hugged+0.035
 named+0.033
 played+0.031
 loved+0.031
 His+0.030
Neg logit contributions
 cliff-0.093
www-0.088
 plunge-0.086
works-0.085
 brew-0.084
Token ear
Token ID1027
Feature activation+4.06e-02

Pos logit contributions
 hugged+0.039
 named+0.038
 both+0.038
 played+0.038
 and+0.036
Neg logit contributions
 cliff-0.069
 plunge-0.065
works-0.064
fire-0.064
www-0.064
Tokenrings
Token ID33173
Feature activation+1.10e-01

Pos logit contributions
 hugged+0.051
 named+0.049
 played+0.048
 both+0.048
 loved+0.047
Neg logit contributions
 cliff-0.094
www-0.087
fire-0.087
 plunge-0.087
works-0.087
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
 hugged+0.063
 named+0.058
 played+0.055
 loved+0.053
 both+0.051
Neg logit contributions
 cliff-0.323
www-0.306
fire-0.305
works-0.305
 plunge-0.303
Token They
Token ID1119
Feature activation+4.18e-02

Pos logit contributions
 hugged+0.008
 played+0.008
 named+0.007
 loved+0.007
 both+0.007
Neg logit contributions
 cliff-0.014
www-0.013
works-0.013
fire-0.013
 plunge-0.013
Token are
Token ID389
Feature activation+2.85e-02

Pos logit contributions
 hugged+0.042
 named+0.040
 played+0.039
 both+0.039
 loved+0.038
Neg logit contributions
 cliff-0.107
www-0.101
fire-0.100
works-0.100
 plunge-0.100
Token round
Token ID2835
Feature activation+6.10e-02

Pos logit contributions
 hugged+0.035
 played+0.033
 named+0.033
 His+0.032
 loved+0.032
Neg logit contributions
 cliff-0.064
www-0.060
works-0.059
 brew-0.058
fire-0.058
Token and
Token ID290
Feature activation+5.07e-02

Pos logit contributions
 hugged+0.026
 named+0.021
 played+0.020
 loved+0.018
 both+0.017
Neg logit contributions
 cliff-0.190
www-0.180
works-0.180
fire-0.179
 plunge-0.177
Token and
Token ID290
Feature activation+5.35e-02

Pos logit contributions
 named+0.063
 hugged+0.063
 played+0.059
 loved+0.059
 learned+0.057
Neg logit contributions
 cliff-0.093
 plunge-0.084
 brew-0.084
fire-0.084
works-0.081
Token gave
Token ID2921
Feature activation-3.31e-03

Pos logit contributions
 hugged+0.052
 named+0.051
 loved+0.048
 both+0.048
 played+0.048
Neg logit contributions
 cliff-0.134
works-0.124
fire-0.123
 plunge-0.123
www-0.123
Token them
Token ID606
Feature activation+4.10e-02

Pos logit contributions
 cliff+0.008
 plunge+0.008
www+0.008
works+0.008
fire+0.008
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.003
 learned-0.003
Token to
Token ID284
Feature activation-3.67e-03

Pos logit contributions
 named+0.047
 sincere+0.047
 hugged+0.044
 loved+0.041
 learned+0.040
Neg logit contributions
 cliff-0.093
 plunge-0.089
works-0.085
inner-0.084
fire-0.083
Token Sara
Token ID24799
Feature activation+3.38e-02

Pos logit contributions
 cliff+0.010
 plunge+0.009
 brew+0.009
fire+0.009
works+0.009
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.003
 learned-0.003
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
 sincere+0.024
 hugged+0.023
 played+0.021
 loved+0.019
 learned+0.019
Neg logit contributions
 cliff-0.088
fire-0.086
 brew-0.083
works-0.083
 plunge-0.081
Token Tom
Token ID4186
Feature activation+3.47e-02

Pos logit contributions
 loved+0.080
 named+0.080
 hugged+0.080
 sincere+0.076
 played+0.072
Neg logit contributions
 cliff-0.286
www-0.267
works-0.265
fire-0.265
 plunge-0.264
Token.
Token ID13
Feature activation+6.12e-03

Pos logit contributions
 sincere+0.027
 named+0.022
 His+0.020
 hugged+0.020
 okay+0.018
Neg logit contributions
 cliff-0.081
fire-0.081
 brew-0.079
 plunge-0.079
works-0.076
Token They
Token ID1119
Feature activation+2.31e-02

Pos logit contributions
 hugged+0.006
 played+0.006
 named+0.006
 loved+0.006
 learned+0.005
Neg logit contributions
 cliff-0.015
www-0.014
 brew-0.014
fire-0.014
 stairs-0.013
Token said
Token ID531
Feature activation-1.13e-02

Pos logit contributions
 His+0.022
 okay+0.021
 sincere+0.021
 OK+0.020
 Her+0.019
Neg logit contributions
 cliff-0.054
works-0.048
 casc-0.048
fire-0.048
 plunge-0.048
Token thank
Token ID5875
Feature activation+6.83e-04

Pos logit contributions
 cliff+0.026
 plunge+0.025
www+0.024
 inconvenience+0.023
works+0.023
Neg logit contributions
 hugged-0.012
 named-0.012
 both-0.011
 played-0.011
 loved-0.011
Token about
Token ID546
Feature activation+2.01e-04

Pos logit contributions
 cliff+0.103
works+0.098
inner+0.097
www+0.096
 plunge+0.096
Neg logit contributions
 hugged-0.043
 named-0.041
 played-0.038
 both-0.036
 sincere-0.035
Token we
Token ID356
Feature activation+2.06e-02

Pos logit contributions
 named+0.000
 both+0.000
 hugged+0.000
 played+0.000
 his+0.000
Neg logit contributions
 cliff-0.000
www-0.000
inner-0.000
umm-0.000
works-0.000
Token play
Token ID711
Feature activation+1.23e-02

Pos logit contributions
 hugged+0.027
 named+0.025
 loved+0.024
 played+0.024
 His+0.024
Neg logit contributions
 cliff-0.045
 plunge-0.042
works-0.042
fire-0.042
 brew-0.041
Token with
Token ID351
Feature activation+1.18e-02

Pos logit contributions
 hugged+0.011
 played+0.010
 named+0.010
 loved+0.010
 both+0.010
Neg logit contributions
 cliff-0.032
fire-0.030
works-0.030
 plunge-0.030
 brew-0.029
Token your
Token ID534
Feature activation+5.14e-02

Pos logit contributions
 hugged+0.015
 named+0.014
 played+0.014
 both+0.014
 loved+0.014
Neg logit contributions
 cliff-0.027
www-0.026
works-0.025
 plunge-0.025
fire-0.025
Token dolls
Token ID36062
Feature activation+1.09e-01

Pos logit contributions
 hugged+0.057
 named+0.054
 His+0.051
 played+0.051
 learned+0.050
Neg logit contributions
 cliff-0.126
 plunge-0.117
works-0.115
 stair-0.115
 stairs-0.114
Token or
Token ID393
Feature activation+6.89e-02

Pos logit contributions
 hugged+0.154
 His+0.146
 loved+0.142
 named+0.141
 sincere+0.140
Neg logit contributions
 cliff-0.222
works-0.206
fire-0.206
 brew-0.198
www-0.197
Token your
Token ID534
Feature activation+5.80e-02

Pos logit contributions
 hugged+0.075
 named+0.072
 played+0.072
 both+0.070
 loved+0.068
Neg logit contributions
 cliff-0.172
www-0.163
fire-0.160
 plunge-0.160
works-0.160
Token blocks
Token ID7021
Feature activation+8.15e-02

Pos logit contributions
 hugged+0.066
 named+0.061
 loved+0.059
 His+0.059
 played+0.059
Neg logit contributions
 cliff-0.140
 plunge-0.129
works-0.128
fire-0.126
 stair-0.126
Token?
Token ID30
Feature activation+8.93e-03

Pos logit contributions
 hugged+0.126
 named+0.119
 played+0.118
 loved+0.117
 both+0.114
Neg logit contributions
 cliff-0.159
works-0.147
fire-0.145
www-0.144
inner-0.142
Token And
Token ID843
Feature activation-2.86e-02

Pos logit contributions
 hugged+0.013
 played+0.012
 named+0.012
 loved+0.012
 both+0.012
Neg logit contributions
 cliff-0.019
works-0.018
fire-0.018
www-0.017
 plunge-0.017
Token big
Token ID1263
Feature activation-1.54e-02

Pos logit contributions
 named+0.002
 played+0.002
 loved+0.002
 hugged+0.002
 both+0.002
Neg logit contributions
works-0.004
eway-0.003
 cliff-0.003
iders-0.003
 plunge-0.003
Token event
Token ID1785
Feature activation-2.94e-02

Pos logit contributions
works+0.026
eway+0.026
fire+0.025
iders+0.025
 plunge+0.025
Neg logit contributions
 named-0.029
 played-0.028
 and-0.027
 hugged-0.027
 loved-0.027
Token with
Token ID351
Feature activation+5.33e-02

Pos logit contributions
 cliff+0.091
www+0.086
works+0.086
 plunge+0.086
fire+0.085
Neg logit contributions
 hugged-0.014
 named-0.014
 played-0.013
 both-0.012
 loved-0.012
Token their
Token ID511
Feature activation+6.49e-02

Pos logit contributions
 named+0.041
 hugged+0.041
 still+0.037
 learned+0.037
 loved+0.036
Neg logit contributions
 cliff-0.135
www-0.133
 plunge-0.130
works-0.130
 stairs-0.129
Token mom
Token ID1995
Feature activation+5.66e-02

Pos logit contributions
 hugged+0.044
 named+0.041
 played+0.040
 both+0.039
 loved+0.038
Neg logit contributions
 cliff-0.187
www-0.178
works-0.177
 plunge-0.176
fire-0.176
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
 sincere+0.047
 hugged+0.041
 learned+0.038
 played+0.035
 forg+0.035
Neg logit contributions
works-0.137
 cliff-0.135
www-0.134
 stairs-0.133
 site-0.131
Token dad
Token ID9955
Feature activation+5.29e-02

Pos logit contributions
 hugged+0.092
 sincere+0.090
 named+0.088
 played+0.083
 both+0.083
Neg logit contributions
 cliff-0.271
www-0.261
 plunge-0.257
works-0.250
inner-0.247
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
 sincere+0.043
 hugged+0.036
 learned+0.033
 eyes+0.031
 forg+0.031
Neg logit contributions
 cliff-0.129
 plunge-0.129
www-0.125
works-0.125
 stairs-0.125
Token There
Token ID1318
Feature activation+1.21e-02

Pos logit contributions
 sincere+0.032
 hugged+0.031
 forg+0.031
 learned+0.030
 named+0.028
Neg logit contributions
 stairs-0.050
 stair-0.048
 cliff-0.048
inner-0.047
works-0.047
Token were
Token ID547
Feature activation-1.98e-02

Pos logit contributions
 hugged+0.011
 named+0.010
 played+0.010
 both+0.010
 loved+0.009
Neg logit contributions
 cliff-0.032
 plunge-0.030
www-0.030
works-0.030
fire-0.030
Token many
Token ID867
Feature activation+9.98e-03

Pos logit contributions
 cliff+0.054
fire+0.052
www+0.051
 brew+0.051
works+0.051
Neg logit contributions
 hugged-0.017
 both-0.017
 played-0.017
 loved-0.016
 named-0.016
Token
Token ID198
Feature activation+4.61e-03

Pos logit contributions
 hugged+0.027
 played+0.026
 named+0.024
 loved+0.023
 both+0.023
Neg logit contributions
 cliff-0.100
www-0.094
 plunge-0.092
fire-0.092
 brew-0.091
Token
Token ID198
Feature activation-9.52e-03

Pos logit contributions
 hugged+0.003
 played+0.003
 named+0.003
 both+0.003
 loved+0.003
Neg logit contributions
 cliff-0.013
www-0.012
fire-0.012
 plunge-0.012
 brew-0.012
TokenShe
Token ID3347
Feature activation+2.05e-02

Pos logit contributions
 cliff+0.019
www+0.018
fire+0.017
 plunge+0.017
works+0.017
Neg logit contributions
 hugged-0.015
 played-0.015
 named-0.014
 both-0.014
 loved-0.014
Token grabs
Token ID22378
Feature activation+4.49e-03

Pos logit contributions
 hugged+0.029
 named+0.028
 played+0.028
 both+0.027
 loved+0.027
Neg logit contributions
 cliff-0.043
 plunge-0.040
www-0.040
works-0.040
 brew-0.039
Token Sara
Token ID24799
Feature activation+4.55e-02

Pos logit contributions
 hugged+0.004
 named+0.004
 played+0.003
 both+0.003
 loved+0.003
Neg logit contributions
 cliff-0.012
www-0.012
works-0.012
 plunge-0.012
fire-0.011
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
 sincere+0.031
 played+0.030
 named+0.029
 hugged+0.029
 His+0.027
Neg logit contributions
 cliff-0.121
fire-0.116
 plunge-0.114
www-0.112
 brew-0.112
Token Tom
Token ID4186
Feature activation+4.99e-02

Pos logit contributions
 named+0.078
 hugged+0.078
 loved+0.077
 played+0.072
 sincere+0.069
Neg logit contributions
 cliff-0.294
 plunge-0.278
www-0.277
 brew-0.272
works-0.270
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 sincere+0.050
 named+0.044
 hugged+0.042
 His+0.042
 played+0.041
Neg logit contributions
 cliff-0.114
 plunge-0.112
fire-0.111
 stairs-0.108
 brew-0.107
Token She
Token ID1375
Feature activation+3.60e-02

Pos logit contributions
 hugged+0.011
 played+0.010
 named+0.010
 loved+0.010
 both+0.009
Neg logit contributions
 cliff-0.030
www-0.029
fire-0.028
works-0.028
 brew-0.028
Token takes
Token ID2753
Feature activation-1.11e-02

Pos logit contributions
 hugged+0.051
 named+0.050
 played+0.050
 both+0.049
 loved+0.049
Neg logit contributions
 cliff-0.074
www-0.068
 plunge-0.068
works-0.067
 brew-0.067
Token them
Token ID606
Feature activation+3.27e-02

Pos logit contributions
 cliff+0.030
www+0.029
works+0.029
fire+0.028
 plunge+0.028
Neg logit contributions
 hugged-0.010
 both-0.009
 named-0.009
 played-0.009
 his-0.008
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
 named+0.028
 hugged+0.028
 played+0.026
 loved+0.023
 His+0.022
Neg logit contributions
 cliff-0.146
fire-0.138
 site-0.136
works-0.136
 plunge-0.135
Token He
Token ID679
Feature activation+2.02e-02

Pos logit contributions
 hugged+0.012
 played+0.011
 named+0.011
 loved+0.011
 both+0.011
Neg logit contributions
 cliff-0.043
fire-0.040
www-0.040
 plunge-0.040
works-0.039
Token looks
Token ID3073
Feature activation+1.95e-02

Pos logit contributions
 hugged+0.028
 named+0.028
 played+0.027
 both+0.026
 loved+0.026
Neg logit contributions
 cliff-0.042
works-0.039
www-0.039
 plunge-0.038
 brew-0.038
Token at
Token ID379
Feature activation+2.12e-02

Pos logit contributions
 played+0.019
ossible+0.018
 named+0.017
 Her+0.017
 hugged+0.017
Neg logit contributions
 cliff-0.041
 plunge-0.040
www-0.038
 brew-0.037
 casc-0.037
Token Sara
Token ID24799
Feature activation+3.98e-02

Pos logit contributions
 hugged+0.021
 named+0.020
 played+0.019
 loved+0.019
 both+0.019
Neg logit contributions
 cliff-0.055
www-0.051
works-0.051
 plunge-0.051
fire-0.051
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
 sincere+0.026
 hugged+0.025
 played+0.024
 named+0.024
 His+0.021
Neg logit contributions
 cliff-0.108
fire-0.102
 site-0.099
 brew-0.099
 stair-0.098
Token Tom
Token ID4186
Feature activation+3.96e-02

Pos logit contributions
 hugged+0.088
 named+0.086
 played+0.081
 loved+0.080
 learned+0.077
Neg logit contributions
 cliff-0.288
 plunge-0.268
www-0.267
 brew-0.264
works-0.262
Token.
Token ID13
Feature activation+2.35e-02

Pos logit contributions
 named+0.032
 hugged+0.029
 sincere+0.028
 played+0.027
 learned+0.025
Neg logit contributions
 cliff-0.098
fire-0.092
 brew-0.092
 plunge-0.091
 site-0.090
Token He
Token ID679
Feature activation+2.34e-02

Pos logit contributions
 hugged+0.018
 named+0.017
 played+0.017
 loved+0.016
 both+0.016
Neg logit contributions
 cliff-0.065
www-0.061
fire-0.060
works-0.060
 plunge-0.060
Token looks
Token ID3073
Feature activation+2.10e-02

Pos logit contributions
 hugged+0.033
 named+0.033
 played+0.032
 both+0.031
 loved+0.031
Neg logit contributions
 cliff-0.048
works-0.044
www-0.044
 plunge-0.043
 brew-0.043
Token angry
Token ID7954
Feature activation+3.68e-02

Pos logit contributions
 played+0.021
ossible+0.020
 named+0.019
 hugged+0.019
 learned+0.019
Neg logit contributions
 cliff-0.044
 plunge-0.043
www-0.041
 brew-0.040
 casc-0.040
Token He
Token ID679
Feature activation+1.66e-02

Pos logit contributions
 hugged+0.017
 played+0.016
 named+0.015
 both+0.015
 learned+0.015
Neg logit contributions
 cliff-0.047
fire-0.044
www-0.044
 site-0.044
 stairs-0.043
Token gives
Token ID3607
Feature activation+1.58e-03

Pos logit contributions
 named+0.026
 hugged+0.025
 played+0.024
 His+0.024
 both+0.024
Neg logit contributions
 cliff-0.032
 brew-0.028
works-0.028
 plunge-0.028
www-0.027
Token it
Token ID340
Feature activation+1.59e-02

Pos logit contributions
 hugged+0.002
 named+0.002
 played+0.002
 loved+0.001
 sincere+0.001
Neg logit contributions
 cliff-0.004
 plunge-0.004
www-0.004
works-0.004
 brew-0.004
Token to
Token ID284
Feature activation+4.96e-04

Pos logit contributions
 sincere+0.020
 named+0.019
 hugged+0.018
 loved+0.017
 learned+0.017
Neg logit contributions
 cliff-0.035
 plunge-0.034
fire-0.032
inner-0.032
works-0.032
Token Sara
Token ID24799
Feature activation+3.12e-02

Pos logit contributions
 hugged+0.000
 played+0.000
 named+0.000
 loved+0.000
 learned+0.000
Neg logit contributions
 cliff-0.001
www-0.001
 brew-0.001
 plunge-0.001
fire-0.001
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.027
 played+0.024
 hugged+0.023
 swapped+0.021
 learned+0.020
Neg logit contributions
fire-0.074
 cliff-0.073
 brew-0.071
works-0.069
 site-0.067
Token says
Token ID1139
Feature activation+4.53e-03

Pos logit contributions
 named+0.103
 loved+0.099
 hugged+0.098
 played+0.097
 both+0.093
Neg logit contributions
 cliff-0.259
 plunge-0.238
 brew-0.237
www-0.236
 stairs-0.229
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
 played+0.004
 named+0.004
 hugged+0.004
 learned+0.004
 both+0.004
Neg logit contributions
 cliff-0.011
 plunge-0.010
www-0.010
works-0.010
inner-0.010
Token "
Token ID366
Feature activation-1.70e-02

Pos logit contributions
 played+0.019
 named+0.018
 learned+0.018
 hugged+0.018
 both+0.018
Neg logit contributions
 cliff-0.033
fire-0.031
 brew-0.030
storm-0.030
 casc-0.030
TokenHere
Token ID4342
Feature activation-1.29e-02

Pos logit contributions
 cliff+0.033
www+0.030
works+0.030
 plunge+0.030
fire+0.030
Neg logit contributions
 hugged-0.028
 named-0.027
 played-0.027
 both-0.026
 loved-0.026
Token,
Token ID11
Feature activation-3.07e-02

Pos logit contributions
 cliff+0.035
www+0.033
 plunge+0.032
works+0.032
fire+0.032
Neg logit contributions
 hugged-0.011
 named-0.011
 played-0.011
 both-0.010
 loved-0.010
Token was
Token ID373
Feature activation+3.17e-03

Pos logit contributions
 cliff+0.006
works+0.006
www+0.006
fire+0.006
 plunge+0.006
Neg logit contributions
 hugged-0.003
 named-0.002
 played-0.002
 both-0.002
 loved-0.002
Token Elsa
Token ID19226
Feature activation+5.46e-02

Pos logit contributions
 hugged+0.003
 learned+0.003
 played+0.003
ossible+0.003
 loved+0.003
Neg logit contributions
 cliff-0.007
 stairs-0.006
 brew-0.006
works-0.006
 casc-0.006
Token.
Token ID13
Feature activation+8.81e-03

Pos logit contributions
 His+0.043
 hugged+0.041
 played+0.038
 sincere+0.038
ossible+0.038
Neg logit contributions
fire-0.134
 cliff-0.130
works-0.126
www-0.124
mer-0.123
Token Elsa
Token ID19226
Feature activation+1.99e-02

Pos logit contributions
 hugged+0.009
 played+0.009
 loved+0.009
 learned+0.009
 both+0.008
Neg logit contributions
 cliff-0.020
www-0.019
 stairs-0.019
fire-0.019
inner-0.018
Token had
Token ID550
Feature activation+3.83e-02

Pos logit contributions
 named+0.019
 both+0.019
 His+0.019
 his+0.018
 sincere+0.018
Neg logit contributions
 cliff-0.046
fire-0.043
www-0.043
works-0.043
 brew-0.042
Token blonde
Token ID21541
Feature activation+1.07e-01

Pos logit contributions
 named+0.036
 hugged+0.033
 sincere+0.033
 His+0.032
ossible+0.031
Neg logit contributions
 cliff-0.086
 plunge-0.084
www-0.083
 stairs-0.083
 inconvenience-0.079
Token hair
Token ID4190
Feature activation+9.90e-02

Pos logit contributions
 named+0.127
 sincere+0.126
 hugged+0.122
 forgive+0.121
ossible+0.118
Neg logit contributions
 cliff-0.191
 stairs-0.181
 plunge-0.180
 rav-0.180
 voltage-0.176
Token,
Token ID11
Feature activation+3.07e-02

Pos logit contributions
ossible+0.075
 sincere+0.074
 hugged+0.073
 named+0.071
 His+0.066
Neg logit contributions
 cliff-0.243
fire-0.239
works-0.230
storm-0.230
 stairs-0.228
Token a
Token ID257
Feature activation+2.15e-02

Pos logit contributions
ossible+0.036
 sincere+0.029
 named+0.028
 His+0.027
 traded+0.026
Neg logit contributions
 stairs-0.065
 cliff-0.063
 stair-0.060
 plunge-0.059
works-0.059
Token blue
Token ID4171
Feature activation+6.84e-02

Pos logit contributions
 hugged+0.026
 both+0.026
 played+0.025
 learned+0.024
 named+0.024
Neg logit contributions
 cliff-0.048
www-0.044
 stair-0.044
 plunge-0.043
fire-0.043
Token dress
Token ID6576
Feature activation+7.95e-02

Pos logit contributions
 learned+0.094
 loved+0.089
 both+0.088
 hugged+0.088
 played+0.087
Neg logit contributions
 cliff-0.143
 stair-0.136
 plunge-0.133
 stairs-0.128
www-0.126
Token the
Token ID262
Feature activation+1.10e-03

Pos logit contributions
 cliff+0.095
www+0.093
works+0.091
fire+0.090
umm+0.089
Neg logit contributions
 hugged-0.024
 both-0.023
 named-0.022
 played-0.021
 his-0.021
Token cart
Token ID6383
Feature activation-2.51e-02

Pos logit contributions
 hugged+0.002
 named+0.002
 loved+0.002
 played+0.002
 both+0.002
Neg logit contributions
www-0.002
 cliff-0.002
eway-0.002
 plunge-0.002
works-0.002
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
 cliff+0.079
fire+0.074
works+0.073
 plunge+0.073
 site+0.073
Neg logit contributions
 hugged-0.010
 named-0.008
 played-0.008
 loved-0.008
 both-0.007
Token take
Token ID1011
Feature activation-2.59e-02

Pos logit contributions
 hugged+0.081
 named+0.074
 played+0.073
 both+0.072
 his+0.068
Neg logit contributions
 cliff-0.156
www-0.150
works-0.148
fire-0.147
 plunge-0.145
Token Anna
Token ID11735
Feature activation+4.81e-02

Pos logit contributions
 cliff+0.069
www+0.064
works+0.064
fire+0.063
 plunge+0.062
Neg logit contributions
 hugged-0.026
 both-0.025
 played-0.024
 named-0.024
 his-0.023
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.055
 hugged+0.052
 His+0.052
 loved+0.051
 played+0.049
Neg logit contributions
 cliff-0.107
fire-0.103
 plunge-0.100
 brew-0.099
 outage-0.097
Token Ben
Token ID3932
Feature activation+4.03e-02

Pos logit contributions
 hugged+0.078
 named+0.075
 loved+0.071
 played+0.071
 both+0.068
Neg logit contributions
 cliff-0.299
 plunge-0.282
www-0.281
works-0.280
fire-0.279
Token away
Token ID1497
Feature activation+9.48e-03

Pos logit contributions
 sincere+0.034
 hugged+0.032
 loved+0.032
 His+0.032
 liked+0.030
Neg logit contributions
 cliff-0.098
fire-0.096
 plunge-0.093
 brew-0.090
 outage-0.090
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 hugged+0.001
 named+0.001
 both+0.001
.+0.001
 his+0.001
Neg logit contributions
 cliff-0.033
www-0.032
 plunge-0.032
works-0.031
inner-0.031
Token They
Token ID1119
Feature activation+2.68e-02

Pos logit contributions
 hugged+0.010
 named+0.010
 played+0.010
 loved+0.009
 learned+0.009
Neg logit contributions
 cliff-0.028
www-0.026
fire-0.026
works-0.025
 brew-0.025
Token say
Token ID910
Feature activation+4.23e-03

Pos logit contributions
 hugged+0.034
 named+0.034
 played+0.033
 loved+0.032
 both+0.032
Neg logit contributions
 cliff-0.060
 plunge-0.055
www-0.055
works-0.054
fire-0.054
Token the
Token ID262
Feature activation+8.49e-03

Pos logit contributions
 cliff+0.005
 plunge+0.004
 brew+0.004
www+0.004
works+0.004
Neg logit contributions
 hugged-0.002
 named-0.001
 loved-0.001
 played-0.001
 learned-0.001
Token farm
Token ID5318
Feature activation+5.84e-05

Pos logit contributions
 named+0.013
 both+0.012
 played+0.012
 his+0.012
 loved+0.012
Neg logit contributions
www-0.017
works-0.017
iders-0.017
eway-0.017
rated-0.017
Token with
Token ID351
Feature activation+4.59e-02

Pos logit contributions
 hugged+0.000
 sincere+0.000
 learned+0.000
 loved+0.000
 played+0.000
Neg logit contributions
 cliff-0.000
fire-0.000
 site-0.000
 plunge-0.000
works-0.000
Token their
Token ID511
Feature activation+7.69e-02

Pos logit contributions
 hugged+0.046
 named+0.044
 loved+0.044
 learned+0.043
 played+0.043
Neg logit contributions
 cliff-0.110
 plunge-0.104
 brew-0.104
www-0.099
works-0.098
Token mom
Token ID1995
Feature activation+5.51e-02

Pos logit contributions
 hugged+0.042
 named+0.039
 played+0.037
 both+0.036
 loved+0.035
Neg logit contributions
 cliff-0.232
www-0.221
works-0.220
fire-0.220
 plunge-0.219
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.045
 hugged+0.043
 learned+0.040
 played+0.039
 loved+0.037
Neg logit contributions
 cliff-0.135
works-0.135
 brew-0.130
fire-0.129
 stair-0.129
Token dad
Token ID9955
Feature activation+5.20e-02

Pos logit contributions
 sincere+0.100
 hugged+0.094
 named+0.089
 both+0.088
 learned+0.087
Neg logit contributions
 cliff-0.259
 plunge-0.240
www-0.240
fire-0.236
 stair-0.233
Token.
Token ID13
Feature activation+8.68e-03

Pos logit contributions
 sincere+0.048
 hugged+0.041
 learned+0.039
 played+0.039
 forg+0.036
Neg logit contributions
 cliff-0.123
fire-0.121
 plunge-0.119
works-0.117
inner-0.116
Token They
Token ID1119
Feature activation+2.94e-02

Pos logit contributions
 hugged+0.010
 sincere+0.009
 named+0.009
 forg+0.009
 learned+0.009
Neg logit contributions
 cliff-0.021
 brew-0.019
 plunge-0.019
 stair-0.019
inner-0.019
Token saw
Token ID2497
Feature activation+7.99e-03

Pos logit contributions
 sincere+0.028
 His+0.026
 hearts+0.026
 eyes+0.025
 hair+0.025
Neg logit contributions
 cliff-0.071
www-0.065
fire-0.064
 plunge-0.064
Fre-0.063
Token many
Token ID867
Feature activation+1.52e-03

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.005
 loved+0.005
 both+0.005
Neg logit contributions
 cliff-0.023
www-0.022
works-0.022
fire-0.022
 plunge-0.022
Token sad
Token ID6507
Feature activation+2.93e-02

Pos logit contributions
 His+0.033
 both+0.033
 named+0.033
 played+0.032
 hugged+0.032
Neg logit contributions
 cliff-0.088
fire-0.086
www-0.084
 plunge-0.083
works-0.082
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 hugged+0.015
 named+0.014
 played+0.012
 loved+0.011
 both+0.011
Neg logit contributions
 cliff-0.087
fire-0.084
works-0.084
 brew-0.082
www-0.081
Token They
Token ID1119
Feature activation+1.86e-02

Pos logit contributions
 hugged+0.019
 played+0.017
 named+0.016
 learned+0.016
 both+0.016
Neg logit contributions
 cliff-0.043
fire-0.041
www-0.041
works-0.040
Fre-0.040
Token told
Token ID1297
Feature activation-4.41e-03

Pos logit contributions
 His+0.018
 hearts+0.018
 eyes+0.017
 at+0.017
 sincere+0.017
Neg logit contributions
 cliff-0.040
www-0.039
fire-0.039
works-0.038
 brew-0.037
Token Sara
Token ID24799
Feature activation+2.91e-02

Pos logit contributions
 cliff+0.010
 brew+0.010
 plunge+0.010
works+0.010
www+0.009
Neg logit contributions
 hugged-0.005
 named-0.005
 played-0.005
 loved-0.004
 learned-0.004
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.032
 pockets+0.026
 swapped+0.025
 played+0.023
 both+0.023
Neg logit contributions
 outage-0.063
fire-0.061
works-0.060
 cliff-0.058
inner-0.058
Token Tom
Token ID4186
Feature activation+3.45e-02

Pos logit contributions
 named+0.102
 loved+0.102
 sincere+0.099
 hugged+0.099
 His+0.091
Neg logit contributions
 cliff-0.233
 inconvenience-0.223
www-0.220
works-0.217
 plunge-0.217
Token that
Token ID326
Feature activation+4.43e-03

Pos logit contributions
 named+0.041
 sincere+0.040
 jeans+0.039
 pockets+0.038
 hugged+0.038
Neg logit contributions
 plunge-0.066
 cliff-0.066
 inconvenience-0.065
 happen-0.064
 outage-0.062
Token they
Token ID484
Feature activation+2.48e-02

Pos logit contributions
 hugged+0.005
 named+0.004
 played+0.004
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.011
www-0.010
works-0.010
 plunge-0.010
fire-0.010
Token should
Token ID815
Feature activation-3.49e-03

Pos logit contributions
 His+0.025
 eyes+0.025
 hearts+0.024
 goodbye+0.024
 sincere+0.023
Neg logit contributions
works-0.054
 cliff-0.052
fire-0.052
 outage-0.051
www-0.051
Token not
Token ID407
Feature activation-1.87e-02

Pos logit contributions
 cliff+0.008
www+0.008
 plunge+0.008
fire+0.008
works+0.008
Neg logit contributions
 hugged-0.004
 named-0.003
 played-0.003
 loved-0.003
 at-0.003
Token He
Token ID679
Feature activation+2.23e-02

Pos logit contributions
 hugged+0.020
 played+0.019
 named+0.018
 loved+0.018
 learned+0.017
Neg logit contributions
 cliff-0.052
fire-0.050
 stairs-0.049
inner-0.049
www-0.049
Token walked
Token ID6807
Feature activation-4.54e-03

Pos logit contributions
 sincere+0.021
 named+0.021
 both+0.021
 hugged+0.020
 His+0.020
Neg logit contributions
 cliff-0.051
fire-0.049
www-0.049
 brew-0.048
works-0.047
Token over
Token ID625
Feature activation+8.51e-03

Pos logit contributions
 cliff+0.010
fire+0.010
 plunge+0.010
www+0.010
works+0.010
Neg logit contributions
 hugged-0.006
 named-0.005
 played-0.005
 loved-0.005
 sincere-0.005
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
 hugged+0.006
 named+0.006
 played+0.005
 both+0.005
 loved+0.005
Neg logit contributions
 cliff-0.024
www-0.023
works-0.023
 plunge-0.023
fire-0.023
Token Sara
Token ID24799
Feature activation+3.49e-02

Pos logit contributions
 cliff+0.042
www+0.041
works+0.041
fire+0.040
iders+0.040
Neg logit contributions
 both-0.010
 his-0.010
 hugged-0.010
 named-0.010
 played-0.009
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.022
 hugged+0.021
 played+0.020
 loved+0.018
 named+0.018
Neg logit contributions
 cliff-0.096
fire-0.093
 brew-0.090
 site-0.089
 stair-0.088
Token Tom
Token ID4186
Feature activation+4.42e-02

Pos logit contributions
 loved+0.080
 named+0.080
 hugged+0.077
 played+0.075
 sincere+0.075
Neg logit contributions
 cliff-0.276
www-0.265
fire-0.262
 plunge-0.257
works-0.256
Token and
Token ID290
Feature activation+9.97e-02

Pos logit contributions
 sincere+0.038
 named+0.031
 hugged+0.030
 His+0.030
 loved+0.030
Neg logit contributions
fire-0.104
 cliff-0.103
 brew-0.099
 plunge-0.099
 stairs-0.098
Token handed
Token ID10158
Feature activation-1.15e-03

Pos logit contributions
 hugged+0.087
 named+0.085
 loved+0.083
 both+0.083
 played+0.082
Neg logit contributions
 cliff-0.253
www-0.243
fire-0.239
works-0.238
 plunge-0.236
Token them
Token ID606
Feature activation+3.94e-02

Pos logit contributions
 plunge+0.002
 cliff+0.002
www+0.002
works+0.002
 brew+0.002
Neg logit contributions
 played-0.001
 sincere-0.001
 hugged-0.001
 named-0.001
 loved-0.001
Token the
Token ID262
Feature activation+4.05e-03

Pos logit contributions
 named+0.045
 sincere+0.045
 hugged+0.042
 loved+0.040
 learned+0.039
Neg logit contributions
 cliff-0.090
 plunge-0.087
works-0.085
inner-0.085
fire-0.084
Token gives
Token ID3607
Feature activation-4.36e-03

Pos logit contributions
 hugged+0.036
 named+0.036
 played+0.034
 loved+0.034
 both+0.034
Neg logit contributions
 cliff-0.062
www-0.058
works-0.057
 plunge-0.056
 brew-0.056
Token the
Token ID262
Feature activation+4.19e-03

Pos logit contributions
 cliff+0.012
www+0.011
fire+0.011
works+0.011
 plunge+0.011
Neg logit contributions
 hugged-0.004
 named-0.004
 both-0.004
 played-0.004
 loved-0.004
Token car
Token ID1097
Feature activation+1.77e-02

Pos logit contributions
 hugged+0.007
 named+0.006
 sincere+0.006
 played+0.006
 learned+0.006
Neg logit contributions
 cliff-0.008
 stairs-0.008
 plunge-0.008
 stair-0.007
 site-0.007
Token to
Token ID284
Feature activation+1.99e-04

Pos logit contributions
 hugged+0.016
 named+0.015
 sincere+0.014
 played+0.014
 loved+0.014
Neg logit contributions
 cliff-0.046
works-0.043
fire-0.043
www-0.043
inner-0.043
Token Sara
Token ID24799
Feature activation+3.74e-02

Pos logit contributions
 hugged+0.000
 named+0.000
 played+0.000
 both+0.000
 loved+0.000
Neg logit contributions
 cliff-0.000
www-0.000
works-0.000
fire-0.000
 plunge-0.000
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.029
 hugged+0.026
 played+0.025
 His+0.021
 named+0.020
Neg logit contributions
 cliff-0.093
fire-0.090
 brew-0.087
 stair-0.086
www-0.086
Token Ben
Token ID3932
Feature activation+3.56e-02

Pos logit contributions
 hugged+0.096
 named+0.095
 loved+0.090
 played+0.089
 learned+0.086
Neg logit contributions
 cliff-0.276
www-0.260
 plunge-0.259
 brew-0.253
works-0.253
Token.
Token ID13
Feature activation+4.25e-03

Pos logit contributions
 sincere+0.023
 named+0.022
 hugged+0.021
 His+0.020
 played+0.018
Neg logit contributions
 cliff-0.091
inner-0.087
 brew-0.086
 plunge-0.086
fire-0.085
Token She
Token ID1375
Feature activation+2.88e-02

Pos logit contributions
 hugged+0.005
 named+0.004
 learned+0.004
 played+0.004
 loved+0.004
Neg logit contributions
 cliff-0.010
www-0.010
 plunge-0.010
 brew-0.010
inner-0.009
Token says
Token ID1139
Feature activation-4.60e-03

Pos logit contributions
 named+0.046
 hugged+0.045
 played+0.044
 loved+0.044
 both+0.043
Neg logit contributions
 cliff-0.051
works-0.047
www-0.047
inner-0.047
 plunge-0.047
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
 cliff+0.012
www+0.011
 plunge+0.011
works+0.011
 inconvenience+0.011
Neg logit contributions
 hugged-0.004
 played-0.004
 both-0.004
 loved-0.004
 named-0.004
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
 hugged+0.015
 named+0.013
 played+0.012
 His+0.011
 loved+0.011
Neg logit contributions
 cliff-0.094
works-0.089
fire-0.089
www-0.088
 outage-0.088
Token They
Token ID1119
Feature activation+2.47e-02

Pos logit contributions
 hugged+0.004
 named+0.003
 played+0.003
 both+0.003
 loved+0.003
Neg logit contributions
 cliff-0.015
www-0.014
 plunge-0.014
works-0.014
fire-0.014
Token yell
Token ID28946
Feature activation+1.87e-02

Pos logit contributions
 hugged+0.031
 named+0.029
 played+0.029
 both+0.028
 loved+0.028
Neg logit contributions
 cliff-0.057
www-0.053
 plunge-0.053
works-0.053
fire-0.052
Token at
Token ID379
Feature activation-3.73e-03

Pos logit contributions
 named+0.012
 hugged+0.011
 sincere+0.011
 played+0.011
 His+0.010
Neg logit contributions
works-0.047
 cliff-0.047
fire-0.047
 brew-0.046
iders-0.046
Token Lily
Token ID20037
Feature activation+2.80e-02

Pos logit contributions
 cliff+0.010
works+0.009
www+0.009
 plunge+0.009
fire+0.009
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.003
 both-0.003
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 sincere+0.019
 hugged+0.017
 played+0.016
 loved+0.015
 named+0.015
Neg logit contributions
 cliff-0.074
fire-0.072
 outage-0.072
works-0.070
 brew-0.070
Token Max
Token ID5436
Feature activation+3.61e-02

Pos logit contributions
 hugged+0.077
 named+0.072
 played+0.070
 loved+0.070
 both+0.066
Neg logit contributions
 cliff-0.299
 plunge-0.281
www-0.281
works-0.280
fire-0.279
Token and
Token ID290
Feature activation+1.03e-01

Pos logit contributions
 hugged+0.021
 sincere+0.019
 named+0.019
 loved+0.018
 His+0.018
Neg logit contributions
 cliff-0.100
fire-0.096
 plunge-0.096
 brew-0.095
works-0.094
Token take
Token ID1011
Feature activation-2.11e-02

Pos logit contributions
 hugged+0.098
 named+0.094
 played+0.091
 loved+0.091
 both+0.090
Neg logit contributions
 cliff-0.263
www-0.247
works-0.246
 plunge-0.245
fire-0.244
Token them
Token ID606
Feature activation+2.75e-02

Pos logit contributions
 cliff+0.053
 plunge+0.050
www+0.050
works+0.050
fire+0.050
Neg logit contributions
 hugged-0.021
 named-0.020
 played-0.020
 loved-0.020
 both-0.019
Token out
Token ID503
Feature activation+1.20e-02

Pos logit contributions
 sincere+0.044
 hugged+0.043
 loved+0.043
 named+0.042
 liked+0.042
Neg logit contributions
 cliff-0.050
 plunge-0.048
fire-0.047
 outage-0.045
 stairs-0.045
Token wanted
Token ID2227
Feature activation-2.02e-02

Pos logit contributions
 hugged+0.007
 named+0.007
 both+0.007
 sincere+0.007
 His+0.007
Neg logit contributions
 cliff-0.021
www-0.019
fire-0.019
 plunge-0.019
 brew-0.019
Token to
Token ID284
Feature activation-1.66e-02

Pos logit contributions
 cliff+0.053
www+0.050
 plunge+0.049
fire+0.049
works+0.049
Neg logit contributions
 hugged-0.019
 named-0.018
 played-0.017
 loved-0.017
 both-0.017
Token play
Token ID711
Feature activation+8.32e-03

Pos logit contributions
 cliff+0.039
works+0.037
www+0.037
fire+0.036
 plunge+0.036
Neg logit contributions
 hugged-0.021
 named-0.020
 played-0.020
 both-0.019
 loved-0.019
Token with
Token ID351
Feature activation+1.12e-02

Pos logit contributions
 hugged+0.008
 played+0.007
 loved+0.007
 learned+0.006
 sincere+0.006
Neg logit contributions
 cliff-0.021
fire-0.021
 brew-0.020
 plunge-0.020
 site-0.020
Token Sara
Token ID24799
Feature activation+4.74e-02

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 learned+0.012
 loved+0.011
Neg logit contributions
 cliff-0.026
 plunge-0.025
 brew-0.024
www-0.024
fire-0.024
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
 hugged+0.033
 sincere+0.032
 His+0.028
 played+0.027
 named+0.027
Neg logit contributions
 cliff-0.125
fire-0.119
 plunge-0.117
 stair-0.115
works-0.114
Token Tom
Token ID4186
Feature activation+3.90e-02

Pos logit contributions
 hugged+0.097
 named+0.092
 played+0.089
 both+0.087
 loved+0.087
Neg logit contributions
 cliff-0.280
www-0.264
 plunge-0.262
works-0.261
fire-0.261
Token more
Token ID517
Feature activation+3.30e-02

Pos logit contributions
 hugged+0.026
 named+0.025
 sincere+0.025
 His+0.023
 played+0.020
Neg logit contributions
 cliff-0.102
 plunge-0.099
fire-0.098
 brew-0.093
works-0.092
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 hugged+0.025
 named+0.022
 played+0.022
 both+0.022
 loved+0.021
Neg logit contributions
 cliff-0.093
fire-0.087
www-0.087
 plunge-0.087
works-0.086
Token He
Token ID679
Feature activation+1.31e-02

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 loved+0.011
 both+0.011
Neg logit contributions
 cliff-0.038
www-0.036
 plunge-0.035
fire-0.035
works-0.035
Token wished
Token ID16555
Feature activation-9.49e-03

Pos logit contributions
 hugged+0.012
 named+0.012
 both+0.012
 played+0.012
 sincere+0.012
Neg logit contributions
 cliff-0.032
www-0.030
fire-0.030
works-0.030
 plunge-0.030
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
 cliff+0.086
www+0.079
fire+0.077
 plunge+0.074
works+0.073
Neg logit contributions
 hugged-0.080
 named-0.077
 his-0.076
 played-0.076
 both-0.076
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
 cliff+0.050
works+0.048
 plunge+0.047
fire+0.047
www+0.047
Neg logit contributions
 hugged-0.010
 named-0.010
 loved-0.009
 both-0.009
 played-0.009
TokenSo
Token ID2396
Feature activation-7.83e-02

Pos logit contributions
 cliff+0.060
works+0.057
 plunge+0.056
fire+0.054
 brew+0.054
Neg logit contributions
 hugged-0.033
 named-0.032
 both-0.031
 played-0.031
 loved-0.031
Token,
Token ID11
Feature activation-5.22e-02

Pos logit contributions
 cliff+0.215
www+0.203
works+0.202
 plunge+0.202
fire+0.202
Neg logit contributions
 hugged-0.064
 named-0.060
 played-0.058
 both-0.057
 loved-0.056
Token the
Token ID262
Feature activation-4.88e-02

Pos logit contributions
 cliff+0.133
works+0.126
fire+0.124
 plunge+0.123
www+0.123
Neg logit contributions
 hugged-0.055
 named-0.053
 both-0.052
 played-0.051
 loved-0.049
Token sp
Token ID599
Feature activation-1.32e-01

Pos logit contributions
works+0.102
 cliff+0.101
eway+0.101
 outage+0.101
www+0.100
Neg logit contributions
 hugged-0.068
 named-0.068
 loved-0.066
 played-0.065
 both-0.063
Tokenarrow
Token ID6018
Feature activation-4.98e-02

Pos logit contributions
 plunge+0.181
 cliff+0.176
 inconvenience+0.167
 slope+0.164
fire+0.164
Neg logit contributions
 loved-0.270
 hugged-0.265
 both-0.264
 at-0.260
 friends-0.258
Token flew
Token ID13112
Feature activation-1.48e-02

Pos logit contributions
 cliff+0.133
www+0.126
works+0.126
 plunge+0.126
fire+0.124
Neg logit contributions
 hugged-0.047
 named-0.044
 played-0.043
 loved-0.042
 both-0.041
Token around
Token ID1088
Feature activation-8.76e-04

Pos logit contributions
 cliff+0.033
www+0.031
works+0.030
 plunge+0.030
fire+0.030
Neg logit contributions
 hugged-0.020
 named-0.019
 played-0.019
 both-0.019
 loved-0.019
Token the
Token ID262
Feature activation-3.30e-02

Pos logit contributions
 cliff+0.003
www+0.003
works+0.003
 plunge+0.003
inner+0.002
Neg logit contributions
 named-0.000
 hugged-0.000
 played-0.000
 both-0.000
 his-0.000
Token c
Token ID269
Feature activation-6.39e-02

Pos logit contributions
works+0.067
eway+0.065
 plunge+0.064
inner+0.063
www+0.063
Neg logit contributions
 named-0.052
 hugged-0.051
 loved-0.051
 played-0.050
 learned-0.050
Token you
Token ID345
Feature activation-2.76e-02

Pos logit contributions
 cliff+0.031
 outage+0.030
works+0.029
eway+0.028
fire+0.028
Neg logit contributions
 both-0.024
 hugged-0.024
 named-0.024
 played-0.023
 his-0.023
Token".
Token ID1911
Feature activation-2.58e-03

Pos logit contributions
 cliff+0.065
works+0.061
www+0.060
 plunge+0.060
 brew+0.060
Neg logit contributions
 hugged-0.033
 named-0.033
 both-0.033
 and-0.031
 at-0.031
Token
Token ID198
Feature activation-4.87e-03

Pos logit contributions
 cliff+0.007
www+0.007
fire+0.007
 plunge+0.007
works+0.007
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 loved-0.002
 both-0.002
Token
Token ID198
Feature activation-2.16e-02

Pos logit contributions
 cliff+0.014
www+0.013
fire+0.013
 plunge+0.013
works+0.013
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 both-0.002
 loved-0.002
TokenThe
Token ID464
Feature activation-3.34e-02

Pos logit contributions
 cliff+0.042
www+0.038
works+0.038
 plunge+0.038
fire+0.038
Neg logit contributions
 hugged-0.035
 named-0.034
 played-0.034
 both-0.034
 loved-0.034
Token bad
Token ID2089
Feature activation-1.27e-01

Pos logit contributions
 cliff+0.075
works+0.075
www+0.072
fire+0.071
storm+0.070
Neg logit contributions
 named-0.042
 hugged-0.042
 played-0.041
 loved-0.041
 learned-0.040
Tokenger
Token ID1362
Feature activation-7.49e-02

Pos logit contributions
 cliff+0.176
works+0.174
fire+0.162
www+0.161
 plunge+0.160
Neg logit contributions
 named-0.262
 his-0.262
 and-0.259
 loved-0.258
 played-0.258
Token was
Token ID373
Feature activation-8.50e-03

Pos logit contributions
 cliff+0.199
www+0.188
 plunge+0.187
works+0.187
fire+0.186
Neg logit contributions
 hugged-0.068
 named-0.064
 played-0.063
 loved-0.061
 both-0.061
Token very
Token ID845
Feature activation-2.84e-02

Pos logit contributions
 cliff+0.020
fire+0.019
www+0.019
 brew+0.019
 plunge+0.018
Neg logit contributions
 hugged-0.009
 played-0.009
 named-0.008
 both-0.008
 His-0.008
Token surprised
Token ID6655
Feature activation+1.15e-02

Pos logit contributions
 cliff+0.076
works+0.073
 plunge+0.072
www+0.072
fire+0.071
Neg logit contributions
 hugged-0.026
 named-0.025
 played-0.024
 loved-0.024
 both-0.023
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
 hugged+0.009
 played+0.008
 named+0.008
 learned+0.008
 loved+0.007
Neg logit contributions
 cliff-0.031
www-0.030
 plunge-0.029
fire-0.029
works-0.029
Token and
Token ID290
Feature activation+1.11e-03

Pos logit contributions
 cliff+0.053
www+0.050
 plunge+0.050
works+0.050
fire+0.050
Neg logit contributions
 hugged-0.010
 named-0.009
 played-0.009
 loved-0.008
 both-0.008
Token farther
Token ID18485
Feature activation-1.98e-02

Pos logit contributions
 hugged+0.002
 both+0.002
 played+0.002
 learned+0.002
 still+0.002
Neg logit contributions
www-0.002
inner-0.002
works-0.002
iders-0.002
 cliff-0.002
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
 cliff+0.051
 plunge+0.047
fire+0.047
www+0.047
works+0.047
Neg logit contributions
 hugged-0.020
 named-0.019
 played-0.018
 loved-0.018
 both-0.018
Token until
Token ID1566
Feature activation-2.73e-02

Pos logit contributions
 cliff+0.068
www+0.065
works+0.065
 plunge+0.064
fire+0.064
Neg logit contributions
 hugged-0.020
 named-0.019
 played-0.019
 both-0.018
 loved-0.018
Token the
Token ID262
Feature activation-3.38e-02

Pos logit contributions
 cliff+0.074
works+0.071
www+0.071
fire+0.070
 plunge+0.070
Neg logit contributions
 hugged-0.022
 named-0.022
 played-0.021
 both-0.021
 loved-0.020
Token oct
Token ID19318
Feature activation-1.27e-01

Pos logit contributions
iders+0.067
www+0.067
 cliff+0.066
eway+0.066
 outage+0.065
Neg logit contributions
 hugged-0.049
 played-0.049
 both-0.049
 named-0.047
 loved-0.047
Tokenopus
Token ID25790
Feature activation-7.02e-02

Pos logit contributions
 plunge+0.172
works+0.172
 brew+0.168
 cliff+0.167
rated+0.163
Neg logit contributions
 named-0.258
 played-0.252
 hugged-0.251
 loved-0.249
 at-0.245
Token was
Token ID373
Feature activation-1.91e-02

Pos logit contributions
 cliff+0.178
www+0.170
works+0.170
 plunge+0.168
fire+0.165
Neg logit contributions
 hugged-0.073
 played-0.070
 named-0.069
 loved-0.067
 both-0.066
Token finally
Token ID3443
Feature activation-1.73e-03

Pos logit contributions
 cliff+0.042
 plunge+0.039
fire+0.038
 brew+0.038
www+0.037
Neg logit contributions
 hugged-0.025
 named-0.025
 played-0.024
 His-0.023
 loved-0.023
Token blo
Token ID24924
Feature activation-3.16e-02

Pos logit contributions
 cliff+0.004
 plunge+0.004
fire+0.004
www+0.004
 brew+0.004
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 loved-0.002
 both-0.002
Tokenoming
Token ID3383
Feature activation+1.40e-02

Pos logit contributions
 cliff+0.049
 plunge+0.044
www+0.044
works+0.044
fire+0.044
Neg logit contributions
 hugged-0.064
 named-0.062
 played-0.061
 loved-0.061
 both-0.060
Token
Token ID26391
Feature activation-4.50e-02

Pos logit contributions
 cliff+0.074
 plunge+0.064
 stairs+0.061
 brew+0.061
 stair+0.060
Neg logit contributions
 hugged-0.121
 named-0.119
 played-0.116
 learned-0.116
 loved-0.115
Token
Token ID220
Feature activation-7.32e-03

Pos logit contributions
 cliff+0.088
www+0.083
fire+0.080
works+0.077
 plunge+0.076
Neg logit contributions
 hugged-0.070
 named-0.067
 played-0.066
 his-0.066
 sincere-0.066
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
 cliff+0.021
 plunge+0.019
fire+0.019
www+0.019
 brew+0.019
Neg logit contributions
 both-0.006
 hugged-0.006
 His-0.005
 named-0.005
 loved-0.005
Token
Token ID198
Feature activation-2.47e-02

Pos logit contributions
 cliff+0.044
www+0.042
works+0.042
 plunge+0.042
fire+0.042
Neg logit contributions
 hugged-0.008
 named-0.007
 played-0.007
 both-0.007
 loved-0.006
TokenThe
Token ID464
Feature activation-3.32e-02

Pos logit contributions
 cliff+0.057
works+0.053
 plunge+0.053
www+0.052
fire+0.052
Neg logit contributions
 hugged-0.032
 named-0.031
 played-0.030
 both-0.030
 loved-0.029
Token le
Token ID443
Feature activation-1.26e-01

Pos logit contributions
 cliff+0.070
 outage+0.070
works+0.069
eway+0.069
inner+0.068
Neg logit contributions
 named-0.047
 loved-0.044
 played-0.043
 hugged-0.043
 both-0.043
Tokenopard
Token ID15478
Feature activation-5.83e-02

Pos logit contributions
 cliff+0.173
 plunge+0.157
 brew+0.154
works+0.151
eway+0.147
Neg logit contributions
 named-0.282
 hugged-0.275
 played-0.270
 loved-0.270
 both-0.262
Token thanked
Token ID26280
Feature activation+4.23e-03

Pos logit contributions
 cliff+0.157
www+0.148
works+0.148
 plunge+0.148
fire+0.147
Neg logit contributions
 hugged-0.051
 named-0.048
 played-0.047
 loved-0.046
 both-0.045
Token the
Token ID262
Feature activation-2.94e-02

Pos logit contributions
 hugged+0.004
 named+0.003
 both+0.003
 his+0.003
 played+0.003
Neg logit contributions
 cliff-0.012
works-0.011
fire-0.011
 plunge-0.011
www-0.011
Token rabbit
Token ID22746
Feature activation-2.41e-02

Pos logit contributions
 cliff+0.062
works+0.061
 outage+0.061
iders+0.060
inner+0.060
Neg logit contributions
 named-0.043
 hugged-0.040
 played-0.040
 loved-0.039
 learned-0.039
Token and
Token ID290
Feature activation+3.54e-02

Pos logit contributions
 cliff+0.072
fire+0.067
 plunge+0.067
www+0.067
works+0.067
Neg logit contributions
 hugged-0.013
 played-0.012
 named-0.011
 loved-0.011
 sincere-0.010
Token back
Token ID736
Feature activation+1.12e-03

Pos logit contributions
 hugged+0.034
 named+0.032
 learned+0.031
 played+0.031
 sincere+0.030
Neg logit contributions
 cliff-0.091
fire-0.087
 plunge-0.087
works-0.085
eway-0.085
Token.
Token ID13
Feature activation-1.45e-03

Pos logit contributions
 hugged+0.000
 named+0.000
 played+0.000
 loved+0.000
 both+0.000
Neg logit contributions
 cliff-0.004
www-0.003
 plunge-0.003
works-0.003
fire-0.003
Token
Token ID198
Feature activation+2.90e-03

Pos logit contributions
 cliff+0.004
www+0.004
works+0.004
fire+0.004
 plunge+0.004
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.001
 loved-0.001
 both-0.001
Token
Token ID198
Feature activation-1.42e-02

Pos logit contributions
 hugged+0.001
 named+0.001
 loved+0.001
 played+0.001
 both+0.001
Neg logit contributions
 cliff-0.009
works-0.009
www-0.009
fire-0.009
 plunge-0.009
TokenThe
Token ID464
Feature activation-2.36e-02

Pos logit contributions
 cliff+0.023
works+0.023
 inconvenience+0.023
storm+0.023
umm+0.022
Neg logit contributions
 loved-0.027
 both-0.027
 hugged-0.026
 named-0.026
 played-0.026
Token oct
Token ID19318
Feature activation-1.26e-01

Pos logit contributions
iders+0.040
 cliff+0.040
works+0.039
fire+0.039
 outage+0.039
Neg logit contributions
 named-0.042
 played-0.042
 hugged-0.042
 loved-0.041
 both-0.041
Tokenopus
Token ID25790
Feature activation-5.03e-02

Pos logit contributions
 brew+0.171
 cliff+0.170
 plunge+0.168
works+0.164
rated+0.164
Neg logit contributions
 named-0.263
 hugged-0.258
 loved-0.256
 played-0.250
 and-0.245
Token asked
Token ID1965
Feature activation-2.40e-02

Pos logit contributions
 cliff+0.121
fire+0.112
www+0.111
 plunge+0.111
works+0.109
Neg logit contributions
 hugged-0.050
 sincere-0.048
 both-0.048
 named-0.047
 His-0.046
Token El
Token ID2574
Feature activation-3.71e-02

Pos logit contributions
 cliff+0.055
www+0.052
works+0.051
fire+0.051
 plunge+0.051
Neg logit contributions
 hugged-0.031
 named-0.030
 played-0.029
 loved-0.029
 both-0.029
Tokensie
Token ID44524
Feature activation-1.91e-02

Pos logit contributions
 cliff+0.054
www+0.051
storm+0.050
works+0.049
fire+0.049
Neg logit contributions
 named-0.077
 hugged-0.075
 loved-0.074
 at-0.074
 played-0.074
Token if
Token ID611
Feature activation-2.51e-02

Pos logit contributions
 cliff+0.050
 plunge+0.047
fire+0.045
works+0.045
eway+0.045
Neg logit contributions
 hugged-0.017
 played-0.016
 named-0.015
 sincere-0.015
 learned-0.014
Token!
Token ID0
Feature activation-4.94e-03

Pos logit contributions
 cliff+0.106
www+0.102
works+0.101
 plunge+0.101
 brew+0.100
Neg logit contributions
 hugged-0.012
 named-0.011
 played-0.010
 both-0.010
 loved-0.010
Token
Token ID220
Feature activation-4.66e-03

Pos logit contributions
 cliff+0.013
www+0.013
fire+0.013
 plunge+0.012
works+0.012
Neg logit contributions
 hugged-0.004
 named-0.004
 played-0.003
 sincere-0.003
 loved-0.003
Token
Token ID198
Feature activation-6.61e-03

Pos logit contributions
 cliff+0.014
fire+0.013
works+0.013
www+0.013
 plunge+0.013
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.002
 both-0.002
Token
Token ID198
Feature activation-1.83e-02

Pos logit contributions
 cliff+0.021
works+0.020
www+0.020
 plunge+0.019
fire+0.019
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 both-0.003
 loved-0.003
TokenThe
Token ID464
Feature activation-2.26e-02

Pos logit contributions
 cliff+0.043
works+0.040
 plunge+0.040
www+0.040
fire+0.040
Neg logit contributions
 hugged-0.022
 named-0.021
 played-0.021
 both-0.021
 loved-0.021
Token oct
Token ID19318
Feature activation-1.25e-01

Pos logit contributions
 cliff+0.044
www+0.042
works+0.042
iders+0.041
fire+0.041
Neg logit contributions
 hugged-0.035
 named-0.035
 played-0.034
 both-0.034
 loved-0.034
Tokenopus
Token ID25790
Feature activation-6.02e-02

Pos logit contributions
 brew+0.167
 plunge+0.167
 cliff+0.166
works+0.159
rated+0.155
Neg logit contributions
 named-0.262
 hugged-0.259
 loved-0.259
 played-0.250
 and-0.247
Token played
Token ID2826
Feature activation-1.44e-02

Pos logit contributions
 cliff+0.155
fire+0.145
www+0.145
 plunge+0.143
works+0.143
Neg logit contributions
 hugged-0.055
 named-0.053
 both-0.050
 played-0.050
 loved-0.048
Token all
Token ID477
Feature activation-1.06e-03

Pos logit contributions
 cliff+0.042
www+0.040
works+0.040
 plunge+0.039
fire+0.039
Neg logit contributions
 hugged-0.009
 named-0.009
 played-0.009
 both-0.008
 loved-0.008
Token day
Token ID1110
Feature activation-2.66e-02

Pos logit contributions
 cliff+0.002
www+0.002
works+0.002
 plunge+0.002
fire+0.002
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 loved-0.002
 both-0.002
Token until
Token ID1566
Feature activation-2.72e-02

Pos logit contributions
 cliff+0.080
www+0.076
works+0.076
 plunge+0.076
fire+0.076
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.013
 both-0.012
 loved-0.012
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
 cliff+0.021
www+0.020
works+0.020
 plunge+0.020
fire+0.020
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 both-0.003
 loved-0.003
TokenSo
Token ID2396
Feature activation-7.81e-02

Pos logit contributions
 cliff+0.051
works+0.048
 plunge+0.048
www+0.047
 brew+0.047
Neg logit contributions
 hugged-0.021
 named-0.020
 both-0.020
 played-0.019
 loved-0.019
Token,
Token ID11
Feature activation-5.60e-02

Pos logit contributions
 cliff+0.228
www+0.215
 plunge+0.214
fire+0.213
works+0.212
Neg logit contributions
 hugged-0.050
 named-0.045
 played-0.044
 loved-0.043
 both-0.040
Token the
Token ID262
Feature activation-3.99e-02

Pos logit contributions
 cliff+0.160
works+0.153
www+0.153
 plunge+0.151
fire+0.151
Neg logit contributions
 hugged-0.039
 named-0.037
 both-0.036
 played-0.036
 loved-0.034
Token smart
Token ID4451
Feature activation-1.01e-01

Pos logit contributions
 cliff+0.065
works+0.064
www+0.064
iders+0.064
eway+0.063
Neg logit contributions
 hugged-0.073
 named-0.073
 played-0.072
 both-0.071
 loved-0.071
Token oct
Token ID19318
Feature activation-1.25e-01

Pos logit contributions
 cliff+0.164
www+0.153
 plunge+0.153
works+0.151
iders+0.149
Neg logit contributions
 hugged-0.192
 named-0.192
 loved-0.188
 played-0.186
 and-0.184
Tokenopus
Token ID25790
Feature activation-6.16e-02

Pos logit contributions
 brew+0.160
 cliff+0.159
 plunge+0.156
works+0.150
rated+0.144
Neg logit contributions
 named-0.272
 hugged-0.266
 loved-0.264
 played-0.257
 at-0.254
Token thought
Token ID1807
Feature activation-3.38e-02

Pos logit contributions
 cliff+0.159
fire+0.149
www+0.147
 plunge+0.146
works+0.145
Neg logit contributions
 hugged-0.056
 both-0.052
 named-0.052
 sincere-0.050
 played-0.050
Token of
Token ID286
Feature activation+1.76e-02

Pos logit contributions
 cliff+0.092
 plunge+0.087
fire+0.086
works+0.086
www+0.086
Neg logit contributions
 hugged-0.028
 named-0.026
 played-0.025
 both-0.025
 loved-0.024
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
 hugged+0.012
 named+0.012
 played+0.011
 both+0.011
 loved+0.011
Neg logit contributions
 cliff-0.050
www-0.048
works-0.047
 plunge-0.047
fire-0.047
Token plan
Token ID1410
Feature activation-3.15e-02

Pos logit contributions
 cliff+0.020
works+0.020
iders+0.020
www+0.020
inner+0.019
Neg logit contributions
 hugged-0.016
 named-0.016
 loved-0.015
 played-0.015
 learned-0.015
Token.
Token ID13
Feature activation-7.13e-03

Pos logit contributions
 cliff+0.014
 plunge+0.013
fire+0.013
www+0.013
 brew+0.013
Neg logit contributions
 hugged-0.006
 played-0.005
 named-0.005
 loved-0.005
 both-0.005
Token
Token ID220
Feature activation-6.49e-03

Pos logit contributions
 cliff+0.021
www+0.019
fire+0.019
 plunge+0.019
works+0.019
Neg logit contributions
 hugged-0.005
 named-0.004
 played-0.004
 loved-0.004
 both-0.004
Token
Token ID198
Feature activation-5.23e-03

Pos logit contributions
 cliff+0.020
www+0.019
works+0.019
 plunge+0.019
fire+0.019
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 both-0.003
 loved-0.003
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 cliff+0.015
www+0.015
fire+0.015
 plunge+0.015
works+0.014
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 both-0.003
 loved-0.003
TokenThe
Token ID464
Feature activation-2.65e-02

Pos logit contributions
 cliff+0.039
www+0.036
fire+0.036
 plunge+0.036
works+0.036
Neg logit contributions
 hugged-0.021
 named-0.020
 played-0.019
 both-0.019
 loved-0.019
Token oct
Token ID19318
Feature activation-1.25e-01

Pos logit contributions
works+0.051
 cliff+0.050
eway+0.050
 outage+0.049
www+0.049
Neg logit contributions
 hugged-0.042
 named-0.042
 loved-0.041
 both-0.041
 played-0.041
Tokenopus
Token ID25790
Feature activation-6.07e-02

Pos logit contributions
works+0.184
 plunge+0.180
 cliff+0.176
 brew+0.172
rated+0.164
Neg logit contributions
 named-0.251
 hugged-0.249
 loved-0.244
 at-0.241
 and-0.240
Token had
Token ID550
Feature activation+9.35e-03

Pos logit contributions
 cliff+0.160
www+0.151
fire+0.150
works+0.149
 plunge+0.149
Neg logit contributions
 hugged-0.054
 named-0.052
 played-0.050
 both-0.049
 loved-0.049
Token sn
Token ID3013
Feature activation-2.50e-02

Pos logit contributions
 named+0.010
 hugged+0.010
 played+0.009
 both+0.009
 loved+0.009
Neg logit contributions
 cliff-0.023
 plunge-0.021
www-0.021
 brew-0.021
fire-0.021
Tokenuck
Token ID1347
Feature activation-1.53e-02

Pos logit contributions
 cliff+0.046
www+0.042
 plunge+0.042
fire+0.042
works+0.042
Neg logit contributions
 hugged-0.043
 named-0.042
 played-0.041
 both-0.041
 loved-0.041
Token away
Token ID1497
Feature activation-3.54e-03

Pos logit contributions
 cliff+0.041
www+0.038
 plunge+0.038
works+0.038
fire+0.038
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.013
 both-0.012
 loved-0.012
Token.
Token ID13
Feature activation-4.73e-03

Pos logit contributions
 hugged+0.004
 named+0.004
 played+0.004
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.023
www-0.021
works-0.021
 plunge-0.021
fire-0.021
Token
Token ID220
Feature activation-3.06e-03

Pos logit contributions
 cliff+0.014
www+0.013
 plunge+0.013
fire+0.013
works+0.013
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.002
 both-0.002
Token
Token ID198
Feature activation-9.63e-03

Pos logit contributions
 cliff+0.009
www+0.008
works+0.008
fire+0.008
 plunge+0.008
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 loved-0.002
 both-0.002
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
 cliff+0.032
works+0.030
 plunge+0.030
www+0.030
storm+0.030
Neg logit contributions
 hugged-0.003
 named-0.003
 both-0.003
 loved-0.003
 learned-0.002
TokenThe
Token ID464
Feature activation-3.20e-02

Pos logit contributions
 cliff+0.048
works+0.046
 plunge+0.045
www+0.044
 brew+0.044
Neg logit contributions
 hugged-0.019
 named-0.019
 both-0.018
 loved-0.018
 played-0.018
Token bad
Token ID2089
Feature activation-1.24e-01

Pos logit contributions
www+0.061
 cliff+0.059
inner+0.059
works+0.058
 outage+0.057
Neg logit contributions
 named-0.052
 hugged-0.051
 loved-0.050
 learned-0.050
 sincere-0.049
Token creature
Token ID7185
Feature activation-5.91e-02

Pos logit contributions
 cliff+0.226
 plunge+0.221
www+0.218
inner+0.214
iders+0.212
Neg logit contributions
 loved-0.206
 named-0.205
 hugged-0.200
 and-0.197
 learned-0.195
Token asked
Token ID1965
Feature activation-2.21e-02

Pos logit contributions
 cliff+0.156
www+0.147
works+0.146
 plunge+0.146
fire+0.145
Neg logit contributions
 hugged-0.054
 named-0.052
 played-0.050
 both-0.049
 loved-0.049
Token the
Token ID262
Feature activation-3.90e-02

Pos logit contributions
 cliff+0.063
www+0.060
works+0.059
 plunge+0.059
fire+0.059
Neg logit contributions
 hugged-0.016
 named-0.015
 played-0.014
 both-0.014
 loved-0.014
Token fairy
Token ID25607
Feature activation-1.48e-02

Pos logit contributions
www+0.078
inner+0.078
works+0.076
 outage+0.074
 cliff+0.074
Neg logit contributions
 named-0.060
 sincere-0.058
 loved-0.058
 learned-0.057
 hugged-0.057
Token,
Token ID11
Feature activation-8.05e-03

Pos logit contributions
 cliff+0.039
 plunge+0.036
works+0.036
fire+0.036
www+0.036
Neg logit contributions
 hugged-0.013
 played-0.013
 named-0.012
 both-0.012
 loved-0.012
Token comfortable
Token ID6792
Feature activation+7.49e-03

Pos logit contributions
 cliff+0.002
fire+0.001
 plunge+0.001
www+0.001
 brew+0.001
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.001
 both-0.001
 His-0.001
Token.
Token ID13
Feature activation-6.48e-03

Pos logit contributions
 hugged+0.004
 named+0.003
 both+0.003
 played+0.003
 loved+0.003
Neg logit contributions
 cliff-0.023
www-0.022
works-0.022
 plunge-0.021
fire-0.021
Token
Token ID198
Feature activation-5.27e-03

Pos logit contributions
 cliff+0.019
www+0.018
 plunge+0.018
works+0.018
fire+0.018
Neg logit contributions
 hugged-0.004
 named-0.004
 played-0.004
 loved-0.003
 both-0.003
Token
Token ID198
Feature activation-1.95e-02

Pos logit contributions
 cliff+0.017
works+0.016
www+0.016
 plunge+0.016
fire+0.016
Neg logit contributions
 hugged-0.002
 named-0.002
 both-0.002
 loved-0.002
 played-0.002
TokenThe
Token ID464
Feature activation-2.21e-02

Pos logit contributions
 cliff+0.052
works+0.050
 plunge+0.050
 brew+0.049
 inconvenience+0.049
Neg logit contributions
 hugged-0.018
 named-0.017
 both-0.017
 learned-0.017
 loved-0.016
Token oct
Token ID19318
Feature activation-1.24e-01

Pos logit contributions
 cliff+0.041
www+0.039
works+0.039
iders+0.038
fire+0.038
Neg logit contributions
 hugged-0.037
 named-0.037
 played-0.036
 loved-0.036
 learned-0.035
Tokenopus
Token ID25790
Feature activation-6.53e-02

Pos logit contributions
 cliff+0.162
 plunge+0.158
 brew+0.157
works+0.153
 inconvenience+0.144
Neg logit contributions
 named-0.267
 hugged-0.266
 loved-0.262
 played-0.257
 and-0.253
Token felt
Token ID2936
Feature activation+1.00e-02

Pos logit contributions
 cliff+0.167
fire+0.157
www+0.156
works+0.156
 plunge+0.155
Neg logit contributions
 hugged-0.062
 named-0.060
 played-0.058
 both-0.057
 loved-0.055
Token so
Token ID523
Feature activation-1.41e-02

Pos logit contributions
 named+0.012
ossible+0.011
 played+0.011
 hugged+0.011
my+0.011
Neg logit contributions
 cliff-0.022
 plunge-0.021
fire-0.020
 brew-0.020
www-0.020
Token relieved
Token ID22598
Feature activation+2.33e-02

Pos logit contributions
 cliff+0.036
 plunge+0.034
fire+0.034
www+0.034
 brew+0.033
Neg logit contributions
 named-0.013
 hugged-0.012
 played-0.012
 both-0.011
 sincere-0.010
Token and
Token ID290
Feature activation+7.48e-03

Pos logit contributions
 hugged+0.016
 named+0.015
 played+0.014
 loved+0.014
 both+0.013
Neg logit contributions
 cliff-0.066
fire-0.063
 plunge-0.063
www-0.062
works-0.062
Token!
Token ID0
Feature activation-7.79e-03

Pos logit contributions
 hugged+0.008
 named+0.007
 played+0.007
 learned+0.006
 loved+0.006
Neg logit contributions
 cliff-0.040
fire-0.039
 plunge-0.038
works-0.038
 brew-0.037
Token
Token ID220
Feature activation-9.70e-04

Pos logit contributions
 cliff+0.025
works+0.024
www+0.024
 plunge+0.023
 brew+0.023
Neg logit contributions
 hugged-0.003
 both-0.003
 named-0.003
 His-0.003
 played-0.003
Token
Token ID198
Feature activation-5.64e-03

Pos logit contributions
 cliff+0.003
www+0.003
works+0.003
 plunge+0.003
fire+0.003
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.000
 loved-0.000
 both-0.000
Token
Token ID198
Feature activation-1.81e-02

Pos logit contributions
 cliff+0.019
works+0.018
 plunge+0.018
www+0.018
storm+0.018
Neg logit contributions
 hugged-0.001
 named-0.001
 both-0.001
 loved-0.001
 His-0.001
TokenThe
Token ID464
Feature activation-2.10e-02

Pos logit contributions
 cliff+0.047
works+0.045
 plunge+0.045
 brew+0.043
www+0.043
Neg logit contributions
 hugged-0.018
 named-0.017
 both-0.017
 loved-0.016
 learned-0.016
Token oct
Token ID19318
Feature activation-1.24e-01

Pos logit contributions
iders+0.036
www+0.035
 cliff+0.035
works+0.034
 outage+0.034
Neg logit contributions
 hugged-0.037
 named-0.037
 both-0.036
 played-0.036
 loved-0.036
Tokenopus
Token ID25790
Feature activation-5.81e-02

Pos logit contributions
 brew+0.168
 cliff+0.168
 plunge+0.165
works+0.159
 inconvenience+0.155
Neg logit contributions
 named-0.261
 hugged-0.257
 loved-0.256
 played-0.247
 and-0.244
Token knew
Token ID2993
Feature activation-2.40e-02

Pos logit contributions
 cliff+0.152
www+0.143
works+0.143
 plunge+0.142
fire+0.141
Neg logit contributions
 hugged-0.056
 named-0.053
 played-0.052
 loved-0.051
 both-0.050
Token right
Token ID826
Feature activation-8.85e-03

Pos logit contributions
 cliff+0.061
 plunge+0.057
fire+0.057
www+0.056
works+0.056
Neg logit contributions
 hugged-0.023
 played-0.023
 named-0.021
 loved-0.021
 both-0.021
Token away
Token ID1497
Feature activation-1.47e-02

Pos logit contributions
 cliff+0.023
www+0.021
works+0.021
fire+0.021
 plunge+0.021
Neg logit contributions
 hugged-0.009
 named-0.009
 played-0.009
 both-0.008
 loved-0.008
Token that
Token ID326
Feature activation-2.83e-02

Pos logit contributions
 cliff+0.041
www+0.038
fire+0.038
works+0.038
 plunge+0.038
Neg logit contributions
 hugged-0.011
 named-0.011
 played-0.010
 loved-0.010
 both-0.010
Token
Token ID198
Feature activation-3.18e-03

Pos logit contributions
 cliff+0.002
works+0.002
fire+0.002
www+0.002
 plunge+0.002
Neg logit contributions
 hugged-0.000
 named-0.000
 played-0.000
 loved-0.000
 both-0.000
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
 cliff+0.009
www+0.009
fire+0.009
 plunge+0.009
works+0.009
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 both-0.002
 loved-0.001
TokenFinally
Token ID11158
Feature activation-3.05e-02

Pos logit contributions
 cliff+0.036
www+0.034
works+0.034
 plunge+0.034
fire+0.034
Neg logit contributions
 hugged-0.021
 named-0.020
 played-0.020
 both-0.020
 loved-0.019
Token,
Token ID11
Feature activation-5.18e-02

Pos logit contributions
 cliff+0.087
 plunge+0.081
fire+0.081
www+0.080
 site+0.080
Neg logit contributions
 hugged-0.021
 loved-0.019
 played-0.018
 sincere-0.017
 named-0.017
Token the
Token ID262
Feature activation-3.05e-02

Pos logit contributions
 cliff+0.146
works+0.142
www+0.142
inner+0.139
 plunge+0.139
Neg logit contributions
 both-0.038
 hugged-0.037
 named-0.036
 played-0.035
 his-0.035
Token oct
Token ID19318
Feature activation-1.23e-01

Pos logit contributions
 cliff+0.050
iders+0.050
www+0.050
works+0.049
eway+0.048
Neg logit contributions
 hugged-0.055
 named-0.054
 played-0.054
 both-0.054
 loved-0.053
Tokenopus
Token ID25790
Feature activation-5.87e-02

Pos logit contributions
 cliff+0.160
 brew+0.155
 plunge+0.155
works+0.149
 inconvenience+0.143
Neg logit contributions
 named-0.266
 hugged-0.264
 loved-0.260
 played-0.255
 and-0.250
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
 cliff+0.154
www+0.145
works+0.144
 plunge+0.144
fire+0.143
Neg logit contributions
 hugged-0.055
 named-0.052
 played-0.051
 both-0.050
 loved-0.050
Token his
Token ID465
Feature activation+1.36e-02

Pos logit contributions
 hugged+0.008
 named+0.008
 played+0.007
 loved+0.007
 both+0.007
Neg logit contributions
 cliff-0.031
www-0.029
 plunge-0.029
works-0.029
fire-0.029
Token friends
Token ID2460
Feature activation-2.86e-03

Pos logit contributions
 hugged+0.004
 named+0.004
 played+0.003
 both+0.003
 loved+0.003
Neg logit contributions
 cliff-0.045
www-0.043
works-0.043
fire-0.042
 plunge-0.042
Token managed
Token ID5257
Feature activation-2.81e-02

Pos logit contributions
 cliff+0.007
fire+0.007
 plunge+0.007
www+0.007
works+0.007
Neg logit contributions
 His-0.002
 sincere-0.002
 named-0.002
 hugged-0.002
 his-0.002
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 cliff+0.023
fire+0.022
www+0.022
 plunge+0.022
works+0.022
Neg logit contributions
 hugged-0.004
 named-0.004
 played-0.004
 loved-0.004
 both-0.003
Token From
Token ID3574
Feature activation+2.00e-02

Pos logit contributions
 cliff+0.034
www+0.033
fire+0.032
 plunge+0.032
works+0.032
Neg logit contributions
 hugged-0.010
 named-0.010
 played-0.009
 loved-0.009
 both-0.009
Token then
Token ID788
Feature activation-4.01e-02

Pos logit contributions
 hugged+0.017
 named+0.017
 played+0.016
 both+0.016
 loved+0.016
Neg logit contributions
 cliff-0.054
www-0.051
works-0.051
 plunge-0.051
fire-0.051
Token on
Token ID319
Feature activation-1.74e-02

Pos logit contributions
 cliff+0.115
works+0.109
fire+0.109
 plunge+0.108
www+0.108
Neg logit contributions
 hugged-0.028
 named-0.027
 played-0.026
 both-0.025
 loved-0.025
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
 cliff+0.051
www+0.048
 plunge+0.047
fire+0.047
works+0.047
Neg logit contributions
 hugged-0.011
 played-0.010
 named-0.009
 loved-0.009
 learned-0.008
Token bad
Token ID2089
Feature activation-1.23e-01

Pos logit contributions
works+0.093
 cliff+0.086
www+0.085
 outage+0.084
fire+0.084
Neg logit contributions
 his-0.056
 learned-0.054
 named-0.054
 played-0.053
 hugged-0.053
Tokenger
Token ID1362
Feature activation-7.33e-02

Pos logit contributions
works+0.183
 cliff+0.174
www+0.165
eway+0.165
 outage+0.164
Neg logit contributions
 his-0.247
 and-0.244
 named-0.242
 at-0.242
 played-0.241
Token stopped
Token ID5025
Feature activation-2.56e-02

Pos logit contributions
 cliff+0.201
www+0.190
works+0.189
 plunge+0.189
fire+0.188
Neg logit contributions
 hugged-0.059
 named-0.055
 played-0.053
 both-0.052
 loved-0.052
Token taking
Token ID2263
Feature activation-3.78e-02

Pos logit contributions
 cliff+0.063
 plunge+0.059
www+0.059
fire+0.059
works+0.058
Neg logit contributions
 hugged-0.028
 named-0.026
 played-0.026
 both-0.025
 loved-0.025
Token Jack
Token ID3619
Feature activation-2.33e-02

Pos logit contributions
 cliff+0.106
www+0.100
works+0.099
 plunge+0.099
fire+0.099
Neg logit contributions
 hugged-0.029
 named-0.027
 played-0.027
 both-0.026
 loved-0.026
Token's
Token ID338
Feature activation-3.82e-03

Pos logit contributions
 cliff+0.065
www+0.062
 plunge+0.061
fire+0.061
works+0.061
Neg logit contributions
 hugged-0.017
 named-0.016
 played-0.016
 loved-0.015
 both-0.015
Token relax
Token ID8960
Feature activation+7.15e-03

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 both+0.012
 his+0.011
Neg logit contributions
 cliff-0.019
works-0.019
umm-0.019
iders-0.019
 brew-0.019
Token!"
Token ID2474
Feature activation+4.37e-03

Pos logit contributions
 and+0.009
 both+0.009
.+0.009
 at+0.008
 named+0.008
Neg logit contributions
 cliff-0.017
works-0.016
 brew-0.016
 plunge-0.016
www-0.016
Token
Token ID198
Feature activation-3.10e-03

Pos logit contributions
 hugged+0.003
 named+0.003
 both+0.003
 his+0.003
 played+0.003
Neg logit contributions
 cliff-0.013
works-0.012
inner-0.012
www-0.012
umm-0.012
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
 cliff+0.010
works+0.010
www+0.010
 plunge+0.010
umm+0.010
Neg logit contributions
 hugged-0.001
 named-0.001
 loved-0.001
 played-0.001
 both-0.001
TokenThe
Token ID464
Feature activation-2.34e-02

Pos logit contributions
 cliff+0.040
works+0.040
umm+0.039
inner+0.038
 plunge+0.038
Neg logit contributions
 hugged-0.012
 named-0.012
 learned-0.011
 played-0.010
 loved-0.010
Token oct
Token ID19318
Feature activation-1.23e-01

Pos logit contributions
iders+0.043
www+0.043
 cliff+0.042
storm+0.041
works+0.041
Neg logit contributions
 named-0.039
 hugged-0.039
 loved-0.038
 played-0.037
 and-0.037
Tokenopus
Token ID25790
Feature activation-5.88e-02

Pos logit contributions
 cliff+0.166
 plunge+0.163
 brew+0.163
works+0.157
rated+0.153
Neg logit contributions
 named-0.263
 hugged-0.257
 loved-0.253
 played-0.245
 and-0.243
Token felt
Token ID2936
Feature activation+6.16e-03

Pos logit contributions
 cliff+0.145
fire+0.136
www+0.135
 plunge+0.135
works+0.134
Neg logit contributions
 hugged-0.061
 named-0.059
 played-0.057
 both-0.057
 loved-0.056
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
 played+0.006
 hugged+0.006
 named+0.006
ossible+0.006
 both+0.006
Neg logit contributions
 cliff-0.014
 plunge-0.014
fire-0.014
 brew-0.013
www-0.013
Token bit
Token ID1643
Feature activation-3.64e-02

Pos logit contributions
 hugged+0.017
 named+0.016
 played+0.016
 both+0.016
 loved+0.016
Neg logit contributions
 cliff-0.027
www-0.025
works-0.025
 plunge-0.025
fire-0.025
Token nervous
Token ID10927
Feature activation+1.14e-03

Pos logit contributions
 cliff+0.078
 plunge+0.074
www+0.073
fire+0.073
 brew+0.072
Neg logit contributions
 hugged-0.048
 played-0.047
 named-0.047
 loved-0.045
 both-0.044
Tokenâ
Token ID22940
Feature activation-5.47e-02

Pos logit contributions
 cliff+0.037
works+0.035
www+0.035
 plunge+0.035
fire+0.035
Neg logit contributions
 hugged-0.016
 named-0.015
 played-0.015
 both-0.015
 loved-0.014
Token
Token ID26391
Feature activation-4.58e-02

Pos logit contributions
 cliff+0.071
 plunge+0.060
 brew+0.059
works+0.057
fire+0.057
Neg logit contributions
 hugged-0.129
 named-0.126
 played-0.124
 loved-0.122
 learned-0.122
Token
Token ID198
Feature activation-1.45e-02

Pos logit contributions
 cliff+0.094
www+0.089
fire+0.086
 plunge+0.083
works+0.083
Neg logit contributions
 hugged-0.067
 named-0.064
 sincere-0.063
 played-0.063
 both-0.063
Token
Token ID198
Feature activation-2.37e-02

Pos logit contributions
 cliff+0.045
works+0.043
 plunge+0.043
www+0.043
fire+0.042
Neg logit contributions
 hugged-0.007
 named-0.006
 played-0.005
 both-0.005
 loved-0.005
TokenThe
Token ID464
Feature activation-3.20e-02

Pos logit contributions
 cliff+0.056
works+0.053
 plunge+0.051
 brew+0.050
fire+0.050
Neg logit contributions
 hugged-0.030
 named-0.029
 both-0.028
 played-0.028
 loved-0.027
Token le
Token ID443
Feature activation-1.23e-01

Pos logit contributions
eway+0.069
 cliff+0.069
works+0.069
 outage+0.068
iders+0.067
Neg logit contributions
 named-0.047
 played-0.044
 loved-0.043
 hugged-0.042
 learned-0.041
Tokenopard
Token ID15478
Feature activation-5.73e-02

Pos logit contributions
 cliff+0.170
 plunge+0.155
 brew+0.150
works+0.148
eway+0.145
Neg logit contributions
 named-0.272
 hugged-0.266
 played-0.263
 loved-0.259
 liked-0.254
Token started
Token ID2067
Feature activation-2.05e-02

Pos logit contributions
 cliff+0.152
works+0.144
 plunge+0.143
www+0.143
fire+0.141
Neg logit contributions
 hugged-0.056
 named-0.053
 played-0.052
 loved-0.050
 both-0.048
Token walking
Token ID6155
Feature activation-1.98e-02

Pos logit contributions
 cliff+0.051
www+0.048
 plunge+0.048
fire+0.047
works+0.047
Neg logit contributions
 hugged-0.022
 named-0.021
 played-0.020
 both-0.020
 loved-0.020
Token around
Token ID1088
Feature activation+8.47e-04

Pos logit contributions
 cliff+0.053
www+0.051
works+0.051
 brew+0.050
fire+0.050
Neg logit contributions
 hugged-0.017
 named-0.016
 both-0.016
 played-0.016
 loved-0.015
Token the
Token ID262
Feature activation-3.02e-02

Pos logit contributions
 named+0.000
 and+0.000
 both+0.000
 hugged+0.000
 his+0.000
Neg logit contributions
 cliff-0.003
www-0.003
works-0.003
inner-0.002
 plunge-0.002
Token that
Token ID326
Feature activation-4.54e-02

Pos logit contributions
 hugged+0.023
 played+0.022
 named+0.022
 both+0.021
 loved+0.021
Neg logit contributions
 cliff-0.057
www-0.053
fire-0.053
 plunge-0.052
works-0.052
Token day
Token ID1110
Feature activation-4.22e-02

Pos logit contributions
 cliff+0.084
www+0.077
works+0.077
 plunge+0.076
fire+0.076
Neg logit contributions
 hugged-0.078
 named-0.076
 played-0.075
 loved-0.074
 both-0.074
Token on
Token ID319
Feature activation-2.28e-02

Pos logit contributions
 cliff+0.102
www+0.099
 plunge+0.096
fire+0.095
works+0.093
Neg logit contributions
 hugged-0.044
 played-0.042
 named-0.041
 loved-0.041
 sincere-0.040
Token,
Token ID11
Feature activation-5.33e-02

Pos logit contributions
 cliff+0.068
www+0.063
fire+0.063
 plunge+0.063
works+0.062
Neg logit contributions
 hugged-0.014
 played-0.012
 named-0.012
 loved-0.011
 both-0.011
Token the
Token ID262
Feature activation-3.17e-02

Pos logit contributions
 cliff+0.147
www+0.137
fire+0.137
 plunge+0.137
works+0.136
Neg logit contributions
 hugged-0.044
 named-0.040
 played-0.039
 loved-0.038
 both-0.037
Token oct
Token ID19318
Feature activation-1.23e-01

Pos logit contributions
 cliff+0.049
works+0.047
www+0.046
fire+0.046
eway+0.046
Neg logit contributions
 hugged-0.062
 named-0.061
 loved-0.060
 both-0.060
 played-0.060
Tokenopus
Token ID25790
Feature activation-5.85e-02

Pos logit contributions
 cliff+0.159
 plunge+0.154
 brew+0.154
works+0.152
 inconvenience+0.146
Neg logit contributions
 named-0.267
 hugged-0.267
 loved-0.264
 played-0.257
 and-0.255
Token realized
Token ID6939
Feature activation-5.46e-03

Pos logit contributions
 cliff+0.157
fire+0.147
www+0.144
 brew+0.142
works+0.142
Neg logit contributions
 hugged-0.045
 named-0.042
 both-0.041
 sincere-0.040
 played-0.039
Token that
Token ID326
Feature activation-2.60e-02

Pos logit contributions
 cliff+0.014
www+0.013
fire+0.013
 plunge+0.012
works+0.012
Neg logit contributions
 hugged-0.006
 played-0.005
 named-0.005
 both-0.005
 loved-0.005
Token it
Token ID340
Feature activation-4.07e-02

Pos logit contributions
 cliff+0.068
www+0.064
works+0.063
 plunge+0.063
fire+0.063
Neg logit contributions
 hugged-0.025
 named-0.024
 played-0.023
 both-0.023
 loved-0.022
Token's
Token ID338
Feature activation-3.71e-03

Pos logit contributions
 cliff+0.107
www+0.101
works+0.101
fire+0.100
 plunge+0.100
Neg logit contributions
 hugged-0.036
 named-0.035
 played-0.034
 both-0.033
 loved-0.032
Token pushed
Token ID7121
Feature activation-2.03e-02

Pos logit contributions
 cliff+0.118
fire+0.110
www+0.110
 plunge+0.109
works+0.109
Neg logit contributions
 hugged-0.045
 named-0.043
 played-0.042
 both-0.041
 loved-0.040
Token and
Token ID290
Feature activation+1.65e-02

Pos logit contributions
 cliff+0.061
works+0.058
 outage+0.058
www+0.058
fire+0.057
Neg logit contributions
 both-0.013
 hugged-0.012
 his-0.012
 and-0.012
 named-0.012
Token pulled
Token ID5954
Feature activation-2.49e-02

Pos logit contributions
 hugged+0.018
 played+0.017
 named+0.016
 both+0.016
 learned+0.015
Neg logit contributions
 cliff-0.043
works-0.042
inner-0.042
iders-0.041
www-0.041
Token on
Token ID319
Feature activation-1.65e-02

Pos logit contributions
 cliff+0.073
www+0.069
works+0.069
 plunge+0.069
fire+0.069
Neg logit contributions
 hugged-0.016
 named-0.015
 played-0.014
 both-0.014
 loved-0.013
Token the
Token ID262
Feature activation-2.77e-02

Pos logit contributions
 cliff+0.044
 plunge+0.041
fire+0.040
 brew+0.040
works+0.040
Neg logit contributions
 hugged-0.016
 named-0.015
 played-0.014
 loved-0.014
 learned-0.013
Token seaw
Token ID32085
Feature activation-1.22e-01

Pos logit contributions
works+0.055
 cliff+0.055
www+0.055
fire+0.054
eway+0.054
Neg logit contributions
 hugged-0.041
 both-0.040
 loved-0.039
 played-0.038
 his-0.038
Tokeneed
Token ID2308
Feature activation-2.89e-02

Pos logit contributions
 cliff+0.204
works+0.187
 plunge+0.184
www+0.184
fire+0.180
Neg logit contributions
 hugged-0.230
 named-0.223
 played-0.220
 both-0.219
 loved-0.218
Token until
Token ID1566
Feature activation-2.28e-02

Pos logit contributions
 cliff+0.089
www+0.085
works+0.085
 plunge+0.084
fire+0.084
Neg logit contributions
 hugged-0.014
 named-0.012
 both-0.012
 played-0.012
 loved-0.011
Token finally
Token ID3443
Feature activation-3.61e-02

Pos logit contributions
 cliff+0.059
www+0.055
 plunge+0.055
works+0.055
fire+0.055
Neg logit contributions
 hugged-0.023
 named-0.022
 played-0.021
 loved-0.020
 both-0.020
Token,
Token ID11
Feature activation-3.48e-02

Pos logit contributions
 cliff+0.099
 plunge+0.092
www+0.090
fire+0.090
 brew+0.089
Neg logit contributions
 hugged-0.030
 named-0.029
 loved-0.026
 played-0.026
 sincere-0.025
Token the
Token ID262
Feature activation-2.74e-02

Pos logit contributions
 cliff+0.093
works+0.091
www+0.090
fire+0.088
 plunge+0.088
Neg logit contributions
 hugged-0.030
 both-0.028
 named-0.028
 played-0.027
 loved-0.027
Token then
Token ID788
Feature activation-3.69e-02

Pos logit contributions
 hugged+0.011
 named+0.011
 played+0.011
 both+0.010
 loved+0.010
Neg logit contributions
 cliff-0.034
www-0.032
works-0.032
 plunge-0.032
fire-0.032
Token on
Token ID319
Feature activation-1.33e-02

Pos logit contributions
 cliff+0.100
www+0.095
 plunge+0.094
works+0.093
fire+0.093
Neg logit contributions
 hugged-0.031
 named-0.029
 played-0.028
 both-0.028
 loved-0.027
Token,
Token ID11
Feature activation-5.46e-02

Pos logit contributions
 cliff+0.041
www+0.039
 plunge+0.039
works+0.039
fire+0.039
Neg logit contributions
 hugged-0.006
 named-0.005
 played-0.005
 loved-0.005
 both-0.005
Token whenever
Token ID8797
Feature activation-3.14e-02

Pos logit contributions
 cliff+0.151
www+0.142
works+0.142
 plunge+0.141
fire+0.141
Neg logit contributions
 hugged-0.044
 named-0.042
 played-0.040
 both-0.039
 loved-0.039
Token the
Token ID262
Feature activation-3.11e-02

Pos logit contributions
 cliff+0.089
works+0.089
inner+0.088
www+0.087
fire+0.087
Neg logit contributions
 named-0.021
 his-0.020
 both-0.020
 played-0.020
 hugged-0.019
Token le
Token ID443
Feature activation-1.22e-01

Pos logit contributions
 cliff+0.062
works+0.060
eway+0.059
 plunge+0.059
 outage+0.059
Neg logit contributions
 named-0.049
 played-0.048
 hugged-0.048
 loved-0.047
 both-0.046
Tokenopard
Token ID15478
Feature activation-7.56e-02

Pos logit contributions
 cliff+0.168
 plunge+0.155
works+0.148
 brew+0.147
www+0.146
Neg logit contributions
 named-0.271
 hugged-0.266
 played-0.264
 loved-0.260
 both-0.256
Token had
Token ID550
Feature activation-3.39e-03

Pos logit contributions
 cliff+0.206
 plunge+0.198
works+0.198
www+0.197
fire+0.194
Neg logit contributions
 hugged-0.066
 played-0.065
 named-0.062
 loved-0.062
 learned-0.061
Token an
Token ID281
Feature activation-2.62e-02

Pos logit contributions
 cliff+0.009
www+0.008
 plunge+0.008
fire+0.008
works+0.008
Neg logit contributions
 hugged-0.003
 named-0.003
 played-0.003
 loved-0.003
 both-0.003
Token important
Token ID1593
Feature activation-2.70e-02

Pos logit contributions
 cliff+0.048
works+0.044
www+0.044
fire+0.044
 plunge+0.044
Neg logit contributions
 hugged-0.045
 named-0.044
 played-0.043
 both-0.043
 loved-0.043
Token task
Token ID4876
Feature activation-6.11e-02

Pos logit contributions
works+0.048
 cliff+0.047
iders+0.046
fire+0.044
 plunge+0.044
Neg logit contributions
 loved-0.048
 and-0.047
 named-0.046
 learned-0.045
 played-0.045
Token pace
Token ID8761
Feature activation-1.06e-03

Pos logit contributions
 cliff+0.000
www+0.000
works+0.000
inner+0.000
 plunge+0.000
Neg logit contributions
 named-0.000
 hugged-0.000
 loved-0.000
 played-0.000
 and-0.000
Token".
Token ID1911
Feature activation+4.26e-03

Pos logit contributions
 cliff+0.003
www+0.003
 plunge+0.003
works+0.003
 brew+0.003
Neg logit contributions
 hugged-0.001
 named-0.001
 both-0.001
 loved-0.001
 played-0.001
Token
Token ID198
Feature activation-2.25e-03

Pos logit contributions
 hugged+0.002
 named+0.002
 played+0.002
 both+0.002
 loved+0.002
Neg logit contributions
 cliff-0.013
www-0.012
works-0.012
 plunge-0.012
fire-0.012
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
 cliff+0.007
 plunge+0.007
works+0.007
www+0.007
 brew+0.007
Neg logit contributions
 hugged-0.001
 named-0.001
 loved-0.001
 both-0.001
 played-0.001
TokenThe
Token ID464
Feature activation-2.15e-02

Pos logit contributions
 cliff+0.038
 plunge+0.037
works+0.036
www+0.035
 brew+0.035
Neg logit contributions
 hugged-0.014
 named-0.013
 learned-0.013
 both-0.013
 loved-0.013
Token oct
Token ID19318
Feature activation-1.22e-01

Pos logit contributions
 cliff+0.036
www+0.036
iders+0.036
 outage+0.036
eway+0.036
Neg logit contributions
 named-0.038
 hugged-0.038
 loved-0.037
 played-0.037
 both-0.037
Tokenopus
Token ID25790
Feature activation-5.66e-02

Pos logit contributions
 plunge+0.174
 brew+0.174
 cliff+0.172
works+0.170
rated+0.165
Neg logit contributions
 named-0.246
 loved-0.237
 hugged-0.237
 and-0.231
 at-0.229
Token was
Token ID373
Feature activation-1.10e-03

Pos logit contributions
 cliff+0.150
www+0.140
fire+0.140
works+0.139
 plunge+0.139
Neg logit contributions
 hugged-0.048
 named-0.046
 played-0.044
 both-0.044
 loved-0.043
Token encouraged
Token ID10085
Feature activation-1.26e-02

Pos logit contributions
 cliff+0.003
fire+0.002
 brew+0.002
 plunge+0.002
 casc+0.002
Neg logit contributions
 played-0.001
 hugged-0.001
ossible-0.001
 named-0.001
 His-0.001
Token by
Token ID416
Feature activation-9.30e-03

Pos logit contributions
 cliff+0.038
www+0.036
fire+0.036
works+0.036
 plunge+0.036
Neg logit contributions
 hugged-0.006
 named-0.005
 played-0.005
 loved-0.005
 both-0.005
Token the
Token ID262
Feature activation-2.71e-02

Pos logit contributions
 cliff+0.024
www+0.023
works+0.022
 plunge+0.022
fire+0.022
Neg logit contributions
 hugged-0.009
 named-0.009
 played-0.009
 both-0.008
 loved-0.008
Token now
Token ID783
Feature activation+9.75e-03

Pos logit contributions
 hugged+0.004
 both+0.004
 named+0.004
 played+0.004
 loved+0.004
Neg logit contributions
 cliff-0.010
www-0.009
 plunge-0.009
works-0.009
fire-0.009
Token!"
Token ID2474
Feature activation+3.91e-03

Pos logit contributions
 hugged+0.012
 named+0.011
 both+0.011
 played+0.011
 loved+0.011
Neg logit contributions
 cliff-0.023
works-0.022
 plunge-0.022
fire-0.021
www-0.021
Token
Token ID198
Feature activation+1.48e-03

Pos logit contributions
 hugged+0.002
 named+0.002
 played+0.002
 both+0.002
 loved+0.002
Neg logit contributions
 cliff-0.012
www-0.011
 plunge-0.011
works-0.011
fire-0.011
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
 hugged+0.001
 named+0.001
 both+0.001
 played+0.001
 loved+0.001
Neg logit contributions
 cliff-0.005
works-0.004
www-0.004
 plunge-0.004
fire-0.004
TokenThe
Token ID464
Feature activation-3.10e-02

Pos logit contributions
 cliff+0.043
works+0.042
 plunge+0.041
 inconvenience+0.041
umm+0.040
Neg logit contributions
 both-0.015
 hugged-0.015
 learned-0.014
 named-0.014
 loved-0.014
Token bad
Token ID2089
Feature activation-1.22e-01

Pos logit contributions
fire+0.059
works+0.059
inner+0.059
www+0.059
 outage+0.058
Neg logit contributions
 friends-0.049
 loved-0.049
 sincere-0.049
 both-0.047
 named-0.047
Token creature
Token ID7185
Feature activation-5.14e-02

Pos logit contributions
 plunge+0.181
fire+0.178
vation+0.175
www+0.174
inner+0.172
Neg logit contributions
.-0.243
 and-0.242
,-0.237
 loved-0.236
 liked-0.228
Token was
Token ID373
Feature activation-3.59e-03

Pos logit contributions
 cliff+0.133
www+0.124
fire+0.124
 plunge+0.123
works+0.123
Neg logit contributions
 hugged-0.048
 named-0.046
 played-0.045
 both-0.044
 loved-0.044
Token so
Token ID523
Feature activation-1.71e-02

Pos logit contributions
 cliff+0.008
fire+0.008
www+0.008
 brew+0.008
 plunge+0.008
Neg logit contributions
 played-0.004
 hugged-0.004
 named-0.003
 loved-0.003
 His-0.003
Token angry
Token ID7954
Feature activation-1.38e-02

Pos logit contributions
 cliff+0.045
works+0.042
www+0.042
 plunge+0.042
fire+0.042
Neg logit contributions
 hugged-0.017
 named-0.016
 played-0.016
 both-0.015
 loved-0.015
Token that
Token ID326
Feature activation-4.38e-02

Pos logit contributions
 cliff+0.040
www+0.038
works+0.038
 plunge+0.037
fire+0.037
Neg logit contributions
 hugged-0.009
 named-0.009
 played-0.008
 both-0.008
 loved-0.008
Token of
Token ID286
Feature activation+2.09e-02

Pos logit contributions
 hugged+0.027
 played+0.026
 named+0.026
 loved+0.026
 sincere+0.025
Neg logit contributions
 cliff-0.067
fire-0.064
www-0.063
works-0.063
inner-0.063
Token Tim
Token ID5045
Feature activation+1.81e-03

Pos logit contributions
 hugged+0.017
 named+0.015
 played+0.015
 loved+0.014
 learned+0.014
Neg logit contributions
 cliff-0.057
 plunge-0.054
 brew-0.053
fire-0.053
works-0.053
Token and
Token ID290
Feature activation+6.27e-02

Pos logit contributions
 sincere+0.001
 hugged+0.001
 named+0.001
 His+0.001
 loved+0.001
Neg logit contributions
 cliff-0.005
fire-0.004
 plunge-0.004
 brew-0.004
works-0.004
Token his
Token ID465
Feature activation+3.84e-02

Pos logit contributions
 hugged+0.053
 named+0.052
 both+0.050
 played+0.049
 loved+0.049
Neg logit contributions
 cliff-0.163
www-0.154
works-0.151
fire-0.151
 plunge-0.151
Token dad
Token ID9955
Feature activation+2.62e-02

Pos logit contributions
 hugged+0.044
 named+0.042
 played+0.041
 both+0.040
 loved+0.040
Neg logit contributions
 cliff-0.092
 plunge-0.085
fire-0.085
www-0.085
works-0.085
Token with
Token ID351
Feature activation+5.30e-02

Pos logit contributions
 sincere+0.018
ossible+0.016
 named+0.016
 His+0.015
 okay+0.014
Neg logit contributions
fire-0.066
 cliff-0.065
 brew-0.063
 plunge-0.061
works-0.061
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
 named+0.059
 hugged+0.053
 loved+0.052
 Her+0.050
 sincere+0.050
Neg logit contributions
 cliff-0.114
 plunge-0.108
 brew-0.107
 stairs-0.104
 casc-0.100
Token passport
Token ID17981
Feature activation+1.33e-02

Pos logit contributions
 hugged+0.025
 named+0.024
 played+0.024
 both+0.023
 loved+0.023
Neg logit contributions
 cliff-0.033
 plunge-0.030
www-0.030
works-0.030
fire-0.030
Token.
Token ID13
Feature activation-6.77e-03

Pos logit contributions
 hugged+0.006
 named+0.006
 played+0.006
 loved+0.006
 learned+0.005
Neg logit contributions
 cliff-0.040
fire-0.038
works-0.038
 plunge-0.037
 site-0.037
Token Then
Token ID3244
Feature activation-3.26e-02

Pos logit contributions
 cliff+0.017
www+0.016
works+0.016
 site+0.016
 plunge+0.016
Neg logit contributions
 hugged-0.007
 learned-0.006
 named-0.006
 played-0.006
 loved-0.006
Token Tim
Token ID5045
Feature activation-3.20e-02

Pos logit contributions
 cliff+0.074
 plunge+0.069
www+0.069
 stair+0.065
 stairs+0.064
Neg logit contributions
 hugged-0.037
 loved-0.036
 sincere-0.035
 learned-0.035
 named-0.034
Token,
Token ID11
Feature activation+2.62e-04

Pos logit contributions
 hugged+0.047
 named+0.043
 learned+0.043
 played+0.042
 both+0.042
Neg logit contributions
 cliff-0.092
 plunge-0.084
www-0.083
fire-0.082
 stair-0.082
Token c
Token ID269
Feature activation+4.87e-03

Pos logit contributions
 hugged+0.000
 named+0.000
 played+0.000
 learned+0.000
ossible+0.000
Neg logit contributions
 cliff-0.001
www-0.001
 plunge-0.001
fire-0.001
 brew-0.001
Tokenudd
Token ID4185
Feature activation+2.90e-03

Pos logit contributions
 hugged+0.009
 named+0.009
 played+0.009
 both+0.009
 learned+0.009
Neg logit contributions
 cliff-0.008
www-0.008
inner-0.008
works-0.007
fire-0.007
Tokenly
Token ID306
Feature activation+5.42e-03

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.005
 both+0.005
 loved+0.004
Neg logit contributions
 cliff-0.006
www-0.005
works-0.005
 plunge-0.005
fire-0.005
Token t
Token ID256
Feature activation-1.49e-03

Pos logit contributions
 hugged+0.006
 named+0.005
 played+0.005
 both+0.005
 loved+0.005
Neg logit contributions
 cliff-0.014
www-0.013
 plunge-0.013
works-0.013
fire-0.013
Tokeneddy
Token ID21874
Feature activation+5.33e-02

Pos logit contributions
 cliff+0.003
www+0.003
works+0.003
fire+0.003
 plunge+0.003
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 learned-0.002
 loved-0.002
Token bear
Token ID6842
Feature activation+1.87e-02

Pos logit contributions
 sincere+0.075
 hugged+0.075
 His+0.073
 played+0.071
 learned+0.071
Neg logit contributions
 cliff-0.101
www-0.095
works-0.092
fire-0.092
 plunge-0.091
Token.
Token ID13
Feature activation-5.09e-03

Pos logit contributions
 hugged+0.012
 sincere+0.011
 learned+0.010
 named+0.010
 both+0.010
Neg logit contributions
 cliff-0.052
fire-0.050
www-0.049
works-0.049
 plunge-0.048
Token
Token ID198
Feature activation+1.79e-03

Pos logit contributions
 cliff+0.013
www+0.012
 stair+0.012
 plunge+0.012
 brew+0.012
Neg logit contributions
 hugged-0.005
 learned-0.004
 sincere-0.004
 played-0.004
 loved-0.004
Token
Token ID198
Feature activation-9.80e-03

Pos logit contributions
 hugged+0.001
 played+0.001
 named+0.001
 both+0.001
 loved+0.001
Neg logit contributions
 cliff-0.005
www-0.005
fire-0.005
 plunge-0.005
works-0.005
TokenWhen
Token ID2215
Feature activation-5.15e-02

Pos logit contributions
 cliff+0.022
works+0.021
 plunge+0.021
www+0.021
fire+0.021
Neg logit contributions
 hugged-0.012
 named-0.012
 played-0.012
 both-0.012
 loved-0.012
Token She
Token ID1375
Feature activation+2.17e-02

Pos logit contributions
 hugged+0.005
 named+0.004
 played+0.004
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.011
www-0.010
 brew-0.010
fire-0.010
eway-0.010
Token took
Token ID1718
Feature activation-1.11e-02

Pos logit contributions
 sincere+0.020
 His+0.020
 both+0.019
 named+0.019
 hugged+0.018
Neg logit contributions
 cliff-0.050
fire-0.047
 brew-0.046
www-0.046
 site-0.045
Token off
Token ID572
Feature activation+5.83e-03

Pos logit contributions
 cliff+0.032
www+0.030
works+0.030
fire+0.030
 plunge+0.029
Neg logit contributions
 hugged-0.009
 named-0.008
 both-0.008
 played-0.008
 loved-0.007
Token her
Token ID607
Feature activation+3.29e-02

Pos logit contributions
 hugged+0.004
 sincere+0.004
 loved+0.004
 played+0.004
 named+0.004
Neg logit contributions
 cliff-0.016
 stairs-0.015
fire-0.015
 brew-0.015
 stair-0.015
Token hat
Token ID6877
Feature activation+6.01e-02

Pos logit contributions
 hugged+0.040
 named+0.039
 played+0.038
 both+0.038
 loved+0.038
Neg logit contributions
 cliff-0.076
 plunge-0.071
fire-0.070
works-0.069
www-0.069
Token and
Token ID290
Feature activation+7.06e-02

Pos logit contributions
 sincere+0.039
 loved+0.031
 played+0.031
 hugged+0.031
 named+0.030
Neg logit contributions
 cliff-0.167
fire-0.163
storm-0.157
 stair-0.157
 plunge-0.156
Token looked
Token ID3114
Feature activation+2.28e-02

Pos logit contributions
 sincere+0.065
 named+0.063
 loved+0.061
 both+0.060
 hugged+0.060
Neg logit contributions
 cliff-0.172
www-0.160
fire-0.160
 plunge-0.160
 brew-0.158
Token at
Token ID379
Feature activation+1.54e-02

Pos logit contributions
 His+0.031
 played+0.028
 sincere+0.028
 named+0.027
 rode+0.027
Neg logit contributions
 cliff-0.043
 plunge-0.041
www-0.039
fire-0.039
 stairs-0.038
Token her
Token ID607
Feature activation+4.21e-02

Pos logit contributions
 hugged+0.011
 named+0.010
 played+0.010
 loved+0.010
 both+0.009
Neg logit contributions
 cliff-0.044
 plunge-0.041
www-0.041
works-0.041
fire-0.041
Token hair
Token ID4190
Feature activation+8.14e-02

Pos logit contributions
 hugged+0.039
 named+0.037
 played+0.036
 learned+0.035
 His+0.035
Neg logit contributions
 cliff-0.109
fire-0.100
 brew-0.100
 plunge-0.099
works-0.099
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
 hugged+0.042
 named+0.041
 played+0.037
 loved+0.034
 His+0.033
Neg logit contributions
 cliff-0.242
fire-0.230
works-0.227
 brew-0.226
 stair-0.224
Token you
Token ID345
Feature activation-1.44e-03

Pos logit contributions
 cliff+0.084
www+0.078
works+0.077
 plunge+0.077
fire+0.077
Neg logit contributions
 hugged-0.064
 named-0.063
 played-0.062
 loved-0.060
 both-0.060
Token can
Token ID460
Feature activation-2.38e-02

Pos logit contributions
 cliff+0.004
fire+0.003
works+0.003
 plunge+0.003
www+0.003
Neg logit contributions
 hugged-0.001
 named-0.001
 played-0.001
 loved-0.001
 His-0.001
Token also
Token ID635
Feature activation+3.12e-02

Pos logit contributions
 cliff+0.055
 plunge+0.052
 brew+0.051
www+0.050
works+0.050
Neg logit contributions
 hugged-0.026
 named-0.025
 His-0.024
 played-0.024
 loved-0.024
Token play
Token ID711
Feature activation+2.57e-03

Pos logit contributions
 hugged+0.040
 named+0.038
 played+0.038
 loved+0.037
 both+0.037
Neg logit contributions
 cliff-0.070
www-0.065
fire-0.065
 plunge-0.065
works-0.064
Token with
Token ID351
Feature activation+1.93e-03

Pos logit contributions
 hugged+0.002
 named+0.002
 played+0.002
 loved+0.002
 both+0.002
Neg logit contributions
 cliff-0.007
fire-0.007
www-0.007
works-0.007
 plunge-0.007
Token Elsa
Token ID19226
Feature activation+6.08e-02

Pos logit contributions
 hugged+0.002
 named+0.002
 played+0.002
 loved+0.002
 learned+0.002
Neg logit contributions
 cliff-0.005
www-0.004
 plunge-0.004
works-0.004
 brew-0.004
Token.
Token ID13
Feature activation-7.38e-03

Pos logit contributions
 hugged+0.044
 named+0.041
 His+0.040
 played+0.038
 sincere+0.036
Neg logit contributions
 cliff-0.163
fire-0.157
works-0.156
 brew-0.152
www-0.152
Token She
Token ID1375
Feature activation+2.66e-02

Pos logit contributions
 cliff+0.016
www+0.016
works+0.016
fire+0.015
 brew+0.015
Neg logit contributions
 hugged-0.009
 played-0.009
 loved-0.008
 named-0.008
 both-0.008
Token is
Token ID318
Feature activation+1.89e-02

Pos logit contributions
 named+0.037
 hugged+0.037
 His+0.035
 played+0.035
 his+0.035
Neg logit contributions
 cliff-0.053
www-0.050
works-0.049
 plunge-0.049
 brew-0.048
Token very
Token ID845
Feature activation-5.57e-03

Pos logit contributions
 His+0.022
 hugged+0.022
 played+0.021
ossible+0.021
 named+0.020
Neg logit contributions
 cliff-0.039
 brew-0.038
www-0.037
fire-0.036
 plunge-0.035
Token nice
Token ID3621
Feature activation+2.82e-02

Pos logit contributions
 cliff+0.012
www+0.012
fire+0.011
 brew+0.011
inner+0.011
Neg logit contributions
 played-0.007
 hugged-0.007
 named-0.006
 His-0.006
 both-0.006
Token He
Token ID679
Feature activation+6.87e-03

Pos logit contributions
 hugged+0.001
 played+0.001
 learned+0.001
 forg+0.001
 sincere+0.001
Neg logit contributions
 cliff-0.002
works-0.002
fire-0.002
eway-0.002
www-0.002
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
 sincere+0.006
 His+0.006
 his+0.006
 at+0.005
 both+0.005
Neg logit contributions
 cliff-0.016
www-0.015
fire-0.015
works-0.015
 brew-0.015
Token nice
Token ID3621
Feature activation+3.11e-02

Pos logit contributions
 hugged+0.019
ossible+0.019
 played+0.018
 His+0.018
 eyes+0.017
Neg logit contributions
 cliff-0.047
 brew-0.044
www-0.044
fire-0.043
Fre-0.043
Token.
Token ID13
Feature activation+5.87e-03

Pos logit contributions
 hugged+0.020
 sincere+0.019
 His+0.017
 named+0.017
 played+0.016
Neg logit contributions
 cliff-0.085
fire-0.081
 stairs-0.078
 stair-0.078
 outage-0.078
Token They
Token ID1119
Feature activation+1.82e-02

Pos logit contributions
 hugged+0.007
 played+0.006
 learned+0.006
 loved+0.006
 forg+0.006
Neg logit contributions
 cliff-0.012
works-0.012
www-0.012
fire-0.012
eway-0.012
Token waved
Token ID26834
Feature activation+5.55e-02

Pos logit contributions
 His+0.016
 sincere+0.016
 eyes+0.015
 his+0.014
 okay+0.013
Neg logit contributions
 cliff-0.042
www-0.038
fire-0.038
works-0.038
storm-0.037
Token at
Token ID379
Feature activation+6.81e-03

Pos logit contributions
 named+0.067
 sincere+0.062
 wore+0.058
my+0.057
 learned+0.056
Neg logit contributions
 cliff-0.103
 plunge-0.100
guard-0.094
mer-0.094
www-0.092
Token him
Token ID683
Feature activation+2.40e-02

Pos logit contributions
 hugged+0.007
 learned+0.006
 played+0.006
 named+0.006
 loved+0.006
Neg logit contributions
 cliff-0.017
 plunge-0.016
works-0.016
 brew-0.015
fire-0.015
Token and
Token ID290
Feature activation+6.62e-02

Pos logit contributions
 hugged+0.015
 named+0.015
 played+0.014
 sincere+0.014
 learned+0.013
Neg logit contributions
 cliff-0.065
fire-0.063
works-0.062
 brew-0.061
 plunge-0.059
Token ran
Token ID4966
Feature activation-2.21e-02

Pos logit contributions
 hugged+0.056
 named+0.054
 both+0.052
 played+0.052
 loved+0.051
Neg logit contributions
 cliff-0.175
www-0.164
fire-0.164
works-0.163
 plunge-0.163
Token to
Token ID284
Feature activation-2.61e-02

Pos logit contributions
 cliff+0.051
fire+0.048
 plunge+0.047
www+0.047
works+0.046
Neg logit contributions
 hugged-0.026
 named-0.026
 loved-0.025
 sincere-0.025
 played-0.025
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
 hugged+0.010
 named+0.009
 both+0.009
 played+0.009
 loved+0.009
Neg logit contributions
 cliff-0.015
www-0.014
 stair-0.014
 plunge-0.013
 slope-0.013
Token wide
Token ID3094
Feature activation-1.07e-02

Pos logit contributions
 cliff+0.028
www+0.026
 plunge+0.026
 stair+0.025
fire+0.025
Neg logit contributions
 hugged-0.017
 named-0.017
 played-0.016
 both-0.016
 loved-0.015
Token smile
Token ID8212
Feature activation+5.32e-02

Pos logit contributions
 cliff+0.023
www+0.021
 stair+0.020
 plunge+0.020
 slope+0.020
Neg logit contributions
 hugged-0.016
 named-0.015
 both-0.014
 played-0.014
 loved-0.014
Token whenever
Token ID8797
Feature activation-6.06e-03

Pos logit contributions
 sincere+0.045
 hugged+0.043
 both+0.038
 learned+0.038
 named+0.036
Neg logit contributions
 cliff-0.133
storm-0.128
www-0.128
fire-0.126
works-0.124
Token he
Token ID339
Feature activation-2.46e-02

Pos logit contributions
 cliff+0.015
www+0.014
 plunge+0.014
fire+0.014
works+0.014
Neg logit contributions
 hugged-0.006
 named-0.006
 learned-0.005
 both-0.005
 sincere-0.005
Token played
Token ID2826
Feature activation+7.28e-03

Pos logit contributions
 cliff+0.066
www+0.062
 plunge+0.061
 brew+0.060
fire+0.060
Neg logit contributions
 hugged-0.018
 named-0.017
 both-0.017
 sincere-0.017
 His-0.015
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
 hugged+0.010
 kissed+0.009
 named+0.008
ossible+0.008
 stro+0.008
Neg logit contributions
 cliff-0.015
fire-0.014
 plunge-0.014
 site-0.014
inner-0.014
Token
Token ID198
Feature activation+2.20e-02

Pos logit contributions
 hugged+0.030
 sincere+0.027
 learned+0.026
 kissed+0.024
 both+0.024
Neg logit contributions
 cliff-0.053
 site-0.050
 stair-0.049
works-0.049
 stairs-0.049
Token
Token ID198
Feature activation+5.00e-03

Pos logit contributions
 hugged+0.019
 played+0.016
 sincere+0.016
 named+0.015
 both+0.015
Neg logit contributions
 cliff-0.059
www-0.055
fire-0.055
eway-0.055
 plunge-0.054
TokenOne
Token ID3198
Feature activation-3.98e-02

Pos logit contributions
 hugged+0.009
 sincere+0.008
 eyes+0.008
 kissed+0.008
 learned+0.008
Neg logit contributions
 cliff-0.009
www-0.009
 site-0.008
 slope-0.008
storm-0.008
Token day
Token ID1110
Feature activation-1.89e-02

Pos logit contributions
 cliff+0.064
www+0.058
fire+0.056
inner+0.056
 plunge+0.055
Neg logit contributions
 hugged-0.078
 sincere-0.074
 named-0.073
 both-0.073
 played-0.072
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 hugged+0.005
 learned+0.004
 played+0.004
 sincere+0.004
 loved+0.003
Neg logit contributions
 cliff-0.020
fire-0.019
works-0.019
 stair-0.019
 plunge-0.018
Token One
Token ID1881
Feature activation-3.78e-02

Pos logit contributions
 hugged+0.016
 sincere+0.013
 learned+0.013
 named+0.013
 played+0.012
Neg logit contributions
 cliff-0.034
 stairs-0.032
 stair-0.032
www-0.031
works-0.031
Token day
Token ID1110
Feature activation-2.45e-02

Pos logit contributions
 cliff+0.069
www+0.063
works+0.063
 plunge+0.063
fire+0.062
Neg logit contributions
 hugged-0.066
 named-0.064
 played-0.063
 both-0.063
 loved-0.062
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 cliff+0.067
www+0.065
 plunge+0.065
works+0.062
inner+0.062
Neg logit contributions
 hugged-0.018
 sincere-0.016
 learned-0.016
 named-0.015
 played-0.015
Token they
Token ID484
Feature activation+1.12e-02

Pos logit contributions
 cliff+0.023
www+0.022
 plunge+0.022
works+0.021
inner+0.021
Neg logit contributions
 hugged-0.013
 named-0.012
 sincere-0.012
 learned-0.012
 played-0.012
Token saw
Token ID2497
Feature activation+1.21e-02

Pos logit contributions
 His+0.011
 sincere+0.011
 hugged+0.011
 hair+0.010
 his+0.010
Neg logit contributions
 cliff-0.027
 plunge-0.024
www-0.024
works-0.024
 stair-0.023
Token a
Token ID257
Feature activation-2.17e-02

Pos logit contributions
 hugged+0.009
 named+0.008
 played+0.007
 both+0.007
 loved+0.007
Neg logit contributions
 cliff-0.034
 plunge-0.032
works-0.032
www-0.032
fire-0.032
Token big
Token ID1263
Feature activation-1.34e-02

Pos logit contributions
eway+0.054
keeping+0.053
works+0.053
iders+0.053
lo+0.052
Neg logit contributions
 named-0.024
 loved-0.022
 and-0.022
 still-0.020
,-0.020
Token snow
Token ID6729
Feature activation-3.12e-03

Pos logit contributions
works+0.028
eway+0.028
fire+0.027
iders+0.027
 plunge+0.027
Neg logit contributions
 named-0.020
 and-0.019
,-0.019
 hugged-0.018
 at-0.018
Tokenman
Token ID805
Feature activation+2.43e-02

Pos logit contributions
 cliff+0.005
www+0.005
 plunge+0.005
Fre+0.005
works+0.005
Neg logit contributions
 named-0.006
 and-0.005
 played-0.005
.-0.005
 hugged-0.005
Token in
Token ID287
Feature activation+2.17e-02

Pos logit contributions
 hugged+0.023
 learned+0.021
 loved+0.020
 His+0.020
 sincere+0.020
Neg logit contributions
 cliff-0.060
fire-0.058
 plunge-0.056
 brew-0.055
storm-0.055
Token buy
Token ID2822
Feature activation-1.20e-02

Pos logit contributions
 both+0.007
 hugged+0.007
 named+0.007
 played+0.007
 loved+0.006
Neg logit contributions
 cliff-0.013
www-0.013
eway-0.012
works-0.012
iders-0.012
Token those
Token ID883
Feature activation-9.91e-03

Pos logit contributions
 cliff+0.035
iders+0.033
www+0.033
fire+0.033
eway+0.033
Neg logit contributions
 both-0.009
 hugged-0.008
 his-0.008
 at-0.007
 named-0.007
Token bright
Token ID6016
Feature activation+3.62e-02

Pos logit contributions
 cliff+0.022
www+0.021
storm+0.020
 slope+0.020
eway+0.020
Neg logit contributions
 both-0.014
 at-0.014
 loved-0.013
 and-0.013
 hugged-0.013
Token fireworks
Token ID26056
Feature activation+3.03e-02

Pos logit contributions
 hugged+0.056
 named+0.055
 played+0.054
 both+0.054
 loved+0.054
Neg logit contributions
 cliff-0.072
www-0.067
fire-0.066
 plunge-0.066
works-0.065
Token?"
Token ID1701
Feature activation+3.13e-02

Pos logit contributions
 and+0.035
 at+0.034
.+0.033
 both+0.033
,+0.032
Neg logit contributions
 cliff-0.076
eway-0.073
 plunge-0.072
www-0.072
 brew-0.071
Token Tim
Token ID5045
Feature activation+5.52e-04

Pos logit contributions
 hugged+0.029
 learned+0.027
 both+0.026
 sincere+0.026
 played+0.026
Neg logit contributions
 cliff-0.078
www-0.075
 plunge-0.073
fire-0.071
 brew-0.071
Tokenmy
Token ID1820
Feature activation+2.30e-02

Pos logit contributions
 hugged+0.000
 named+0.000
 played+0.000
 sincere+0.000
 both+0.000
Neg logit contributions
 cliff-0.001
www-0.001
works-0.001
 stair-0.001
 plunge-0.001
Token asked
Token ID1965
Feature activation+2.10e-02

Pos logit contributions
 named+0.027
 hugged+0.026
 sincere+0.026
 played+0.025
 both+0.025
Neg logit contributions
 cliff-0.049
www-0.047
 brew-0.045
 plunge-0.044
 site-0.044
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
 hugged+0.017
 learned+0.017
 played+0.017
 sincere+0.016
 hearts+0.015
Neg logit contributions
fire-0.048
eway-0.047
 site-0.046
 brew-0.046
 stair-0.046
Token His
Token ID2399
Feature activation+4.57e-02

Pos logit contributions
 hugged+0.018
 named+0.017
 both+0.016
 learned+0.016
 played+0.016
Neg logit contributions
 cliff-0.075
fire-0.071
www-0.070
works-0.070
 brew-0.070
Token mom
Token ID1995
Feature activation+3.54e-02

Pos logit contributions
 hugged+0.038
 named+0.037
 learned+0.034
 played+0.034
 loved+0.031
Neg logit contributions
 cliff-0.119
 plunge-0.111
www-0.110
 stair-0.110
works-0.109
Token's
Token ID338
Feature activation+4.07e-03

Pos logit contributions
 cliff+0.057
fire+0.055
works+0.055
eway+0.054
www+0.053
Neg logit contributions
 hugged-0.009
 both-0.008
 named-0.008
 played-0.008
 loved-0.007
Token try
Token ID1949
Feature activation-9.89e-03

Pos logit contributions
 hugged+0.006
 named+0.005
 played+0.005
 both+0.005
 loved+0.005
Neg logit contributions
 cliff-0.009
www-0.008
works-0.008
 plunge-0.008
fire-0.008
Token it
Token ID340
Feature activation-1.58e-02

Pos logit contributions
 cliff+0.025
www+0.023
 plunge+0.023
works+0.023
fire+0.023
Neg logit contributions
 hugged-0.011
 named-0.010
 played-0.010
 loved-0.010
 both-0.009
Token,"
Token ID553
Feature activation+2.58e-02

Pos logit contributions
 cliff+0.045
umm+0.042
 inconvenience+0.042
 plunge+0.042
www+0.042
Neg logit contributions
 and-0.016
.-0.016
 both-0.015
 at-0.014
 his-0.013
Token Ben
Token ID3932
Feature activation-1.62e-02

Pos logit contributions
 named+0.025
 sincere+0.023
 hugged+0.023
 loved+0.022
 learned+0.022
Neg logit contributions
 cliff-0.063
 plunge-0.061
www-0.060
inner-0.058
 brew-0.058
Token said
Token ID531
Feature activation+1.51e-02

Pos logit contributions
 cliff+0.041
www+0.039
works+0.039
 plunge+0.039
fire+0.039
Neg logit contributions
 hugged-0.016
 named-0.016
 played-0.015
 both-0.015
 loved-0.015
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 named+0.010
 played+0.010
 loved+0.010
 learned+0.009
 hugged+0.009
Neg logit contributions
fire-0.039
 plunge-0.038
 cliff-0.037
works-0.037
 brew-0.037
Token He
Token ID679
Feature activation+1.29e-02

Pos logit contributions
 hugged+0.010
 named+0.010
 played+0.009
 loved+0.009
 both+0.009
Neg logit contributions
 cliff-0.050
www-0.047
fire-0.047
works-0.047
 plunge-0.047
Token climbed
Token ID19952
Feature activation-1.65e-02

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 both+0.012
 loved+0.012
Neg logit contributions
 cliff-0.032
www-0.030
fire-0.030
works-0.030
 plunge-0.030
Token down
Token ID866
Feature activation-1.30e-02

Pos logit contributions
 cliff+0.038
www+0.037
works+0.037
fire+0.036
 inconvenience+0.036
Neg logit contributions
 hugged-0.020
 both-0.020
 named-0.020
 and-0.019
 his-0.019
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
 cliff+0.041
www+0.041
works+0.040
 plunge+0.039
storm+0.039
Neg logit contributions
 and-0.006
 his-0.005
 hugged-0.005
 both-0.005
 named-0.005
Token was
Token ID373
Feature activation+3.74e-03

Pos logit contributions
 cliff+0.007
www+0.006
works+0.006
 brew+0.006
fire+0.006
Neg logit contributions
 named-0.002
 hugged-0.002
 His-0.002
 played-0.002
 both-0.002
Token so
Token ID523
Feature activation-1.16e-02

Pos logit contributions
 His+0.004
 hugged+0.004
 named+0.004
 played+0.004
ossible+0.004
Neg logit contributions
 cliff-0.008
 brew-0.007
 plunge-0.007
fire-0.007
 site-0.007
Token excited
Token ID6568
Feature activation-6.56e-03

Pos logit contributions
 cliff+0.031
www+0.029
fire+0.028
 plunge+0.028
 brew+0.028
Neg logit contributions
 named-0.009
 hugged-0.009
 played-0.009
 His-0.008
 both-0.008
Token to
Token ID284
Feature activation-3.94e-02

Pos logit contributions
 cliff+0.019
 plunge+0.018
works+0.018
fire+0.018
www+0.018
Neg logit contributions
 hugged-0.005
 named-0.004
 played-0.004
 loved-0.004
 both-0.004
Token look
Token ID804
Feature activation-5.86e-04

Pos logit contributions
 cliff+0.088
www+0.082
works+0.081
fire+0.081
 plunge+0.081
Neg logit contributions
 hugged-0.053
 named-0.051
 played-0.050
 both-0.050
 loved-0.049
Token back
Token ID736
Feature activation+1.16e-02

Pos logit contributions
 cliff+0.001
 plunge+0.001
www+0.001
inner+0.001
 brew+0.001
Neg logit contributions
 played-0.001
 named-0.001
 His-0.001
 hugged-0.001
 sincere-0.000
Token at
Token ID379
Feature activation+7.76e-03

Pos logit contributions
 hugged+0.008
 named+0.008
 played+0.007
 loved+0.007
 learned+0.007
Neg logit contributions
 cliff-0.033
www-0.031
 plunge-0.031
works-0.030
 brew-0.030
Token all
Token ID477
Feature activation-2.00e-03

Pos logit contributions
 hugged+0.006
 named+0.006
 played+0.005
 loved+0.005
 learned+0.005
Neg logit contributions
 cliff-0.022
 plunge-0.020
works-0.020
 brew-0.020
www-0.020
Token of
Token ID286
Feature activation+7.68e-03

Pos logit contributions
 cliff+0.005
www+0.005
 plunge+0.005
works+0.005
 brew+0.005
Neg logit contributions
 hugged-0.002
 named-0.002
 played-0.002
 loved-0.002
 both-0.002
Token her
Token ID607
Feature activation+2.48e-02

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.005
 loved+0.004
 learned+0.004
Neg logit contributions
 cliff-0.022
 plunge-0.021
 brew-0.021
www-0.021
works-0.021
Token adventures
Token ID17545
Feature activation-1.30e-02

Pos logit contributions
 hugged+0.032
 named+0.031
 played+0.030
 both+0.029
 loved+0.029
Neg logit contributions
 cliff-0.056
works-0.052
www-0.052
 plunge-0.051
fire-0.051
Token wise
Token ID10787
Feature activation-7.72e-03

Pos logit contributions
 hugged+0.022
 named+0.021
 played+0.021
 both+0.020
 loved+0.020
Neg logit contributions
 cliff-0.034
 plunge-0.032
fire-0.031
www-0.031
works-0.031
Token words
Token ID2456
Feature activation+3.93e-02

Pos logit contributions
 cliff+0.014
www+0.013
works+0.013
fire+0.012
 plunge+0.012
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.013
 loved-0.013
 both-0.013
Token,
Token ID11
Feature activation+5.62e-03

Pos logit contributions
 played+0.027
 sincere+0.027
 hugged+0.026
 named+0.025
 loved+0.022
Neg logit contributions
fire-0.103
 cliff-0.102
works-0.100
 plunge-0.098
 outage-0.097
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
 sincere+0.004
 named+0.004
 hugged+0.004
 played+0.004
 both+0.004
Neg logit contributions
 cliff-0.014
www-0.013
fire-0.013
 plunge-0.013
 brew-0.013
Token for
Token ID329
Feature activation-2.26e-02

Pos logit contributions
 hugged+0.023
 named+0.022
 played+0.021
 both+0.021
 loved+0.021
Neg logit contributions
 cliff-0.070
www-0.065
works-0.065
 plunge-0.065
fire-0.065
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
 cliff+0.059
www+0.056
works+0.056
 plunge+0.056
fire+0.055
Neg logit contributions
 hugged-0.021
 named-0.020
 played-0.020
 both-0.019
 loved-0.019
Token jam
Token ID16853
Feature activation-2.40e-02

Pos logit contributions
 cliff+0.048
works+0.045
www+0.045
fire+0.045
eway+0.044
Neg logit contributions
 hugged-0.031
 named-0.030
 loved-0.029
 played-0.029
 learned-0.029
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
 cliff+0.072
www+0.068
works+0.067
 plunge+0.067
fire+0.067
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.012
 both-0.012
 loved-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.57e-02

Pos logit contributions
 cliff+0.033
www+0.032
fire+0.030
 plunge+0.030
inner+0.030
Neg logit contributions
 hugged-0.011
 played-0.010
 named-0.010
 loved-0.009
 sincere-0.009
TokenThere
Token ID1858
Feature activation-3.29e-03

Pos logit contributions
 cliff+0.076
www+0.071
works+0.071
 plunge+0.071
fire+0.070
Neg logit contributions
 hugged-0.050
 named-0.049
 played-0.048
 loved-0.047
 both-0.047
Token was
Token ID373
Feature activation-1.08e-02

Pos logit contributions
 cliff+0.008
 plunge+0.007
www+0.007
works+0.007
 stair+0.007
Neg logit contributions
 hugged-0.004
 named-0.003
 sincere-0.003
 learned-0.003
 both-0.003
Token "
Token ID366
Feature activation-2.20e-02

Pos logit contributions
 cliff+0.044
www+0.041
works+0.041
 plunge+0.041
fire+0.041
Neg logit contributions
 hugged-0.013
 named-0.012
 played-0.012
 both-0.012
 loved-0.012
TokenMaybe
Token ID13300
Feature activation-5.20e-02

Pos logit contributions
 cliff+0.048
 plunge+0.043
 site+0.042
 outage+0.041
 inconvenience+0.041
Neg logit contributions
 both-0.033
 played-0.033
 loved-0.033
 hugged-0.032
 liked-0.032
Token we
Token ID356
Feature activation+2.20e-03

Pos logit contributions
 cliff+0.127
storm+0.121
fire+0.121
www+0.119
 brew+0.119
Neg logit contributions
 both-0.060
 and-0.060
 at-0.059
 named-0.058
.-0.058
Token can
Token ID460
Feature activation-3.56e-02

Pos logit contributions
 hugged+0.002
 both+0.002
 played+0.002
 named+0.002
 loved+0.002
Neg logit contributions
 cliff-0.006
www-0.005
fire-0.005
 plunge-0.005
works-0.005
Token play
Token ID711
Feature activation-3.10e-03

Pos logit contributions
 cliff+0.080
www+0.076
works+0.076
fire+0.074
 plunge+0.074
Neg logit contributions
 both-0.049
 hugged-0.047
 played-0.045
 named-0.045
 and-0.045
Token with
Token ID351
Feature activation-1.02e-02

Pos logit contributions
 cliff+0.009
 plunge+0.008
www+0.008
works+0.008
 outage+0.008
Neg logit contributions
 both-0.003
 and-0.002
 at-0.002
 named-0.002
 loved-0.002
Token the
Token ID262
Feature activation+2.01e-03

Pos logit contributions
 cliff+0.026
fire+0.024
www+0.024
 plunge+0.024
works+0.024
Neg logit contributions
 both-0.012
 hugged-0.010
 and-0.010
 named-0.010
 at-0.010
Token numbers
Token ID3146
Feature activation+1.03e-02

Pos logit contributions
 hugged+0.003
 both+0.003
 loved+0.003
 played+0.003
 named+0.003
Neg logit contributions
 cliff-0.004
www-0.004
fire-0.004
 plunge-0.003
works-0.003
Token in
Token ID287
Feature activation+4.86e-03

Pos logit contributions
 both+0.010
 and+0.010
 at+0.009
.+0.009
 hugged+0.009
Neg logit contributions
 cliff-0.028
 plunge-0.026
 brew-0.026
www-0.026
fire-0.026
Token a
Token ID257
Feature activation-9.77e-03

Pos logit contributions
 both+0.004
 his+0.004
 named+0.004
 hugged+0.003
 and+0.003
Neg logit contributions
 cliff-0.014
 plunge-0.013
works-0.013
www-0.013
fire-0.013
Token different
Token ID1180
Feature activation-8.45e-03

Pos logit contributions
 cliff+0.020
works+0.020
 plunge+0.020
iders+0.020
eway+0.019
Neg logit contributions
 named-0.014
 hugged-0.014
 both-0.014
 loved-0.013
 learned-0.013
Token He
Token ID679
Feature activation-7.13e-03

Pos logit contributions
 hugged+0.014
 learned+0.012
 named+0.012
 played+0.012
 both+0.012
Neg logit contributions
 cliff-0.037
 stair-0.035
 stairs-0.035
 slope-0.034
fire-0.034
Token saw
Token ID2497
Feature activation+9.45e-03

Pos logit contributions
 cliff+0.018
fire+0.017
www+0.016
 brew+0.016
 stair+0.016
Neg logit contributions
 hugged-0.006
 sincere-0.006
 both-0.005
 named-0.005
 His-0.005
Token something
Token ID1223
Feature activation-1.71e-02

Pos logit contributions
 hugged+0.007
 named+0.007
 played+0.006
 both+0.006
 learned+0.006
Neg logit contributions
 cliff-0.026
 plunge-0.025
www-0.024
works-0.024
fire-0.024
Token colorful
Token ID20239
Feature activation+3.70e-02

Pos logit contributions
 cliff+0.038
www+0.036
works+0.036
 plunge+0.035
fire+0.035
Neg logit contributions
 hugged-0.022
 named-0.022
 played-0.021
 both-0.021
 loved-0.021
Token and
Token ID290
Feature activation+2.53e-02

Pos logit contributions
 hugged+0.024
 named+0.021
 played+0.021
 both+0.020
 loved+0.020
Neg logit contributions
 cliff-0.108
works-0.101
www-0.101
 plunge-0.101
fire-0.101
Token grabbed
Token ID13176
Feature activation-1.25e-02

Pos logit contributions
 hugged+0.036
 named+0.035
 played+0.034
 both+0.033
 loved+0.033
Neg logit contributions
 cliff-0.054
www-0.050
 plunge-0.050
works-0.050
fire-0.050
Token it
Token ID340
Feature activation+6.74e-03

Pos logit contributions
 cliff+0.032
 plunge+0.031
fire+0.030
www+0.030
 brew+0.030
Neg logit contributions
 hugged-0.012
 named-0.011
 played-0.011
 loved-0.010
 learned-0.010
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
 hugged+0.005
 played+0.004
 named+0.004
 learned+0.004
 sincere+0.004
Neg logit contributions
 cliff-0.019
 plunge-0.018
fire-0.018
works-0.018
eway-0.017
Token The
Token ID383
Feature activation+8.04e-03

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 learned+0.012
 sincere+0.011
Neg logit contributions
 cliff-0.032
 stairs-0.030
 stair-0.030
 site-0.030
 slope-0.030
Token colorful
Token ID20239
Feature activation-2.48e-02

Pos logit contributions
 hugged+0.012
 named+0.012
 loved+0.012
 both+0.011
 played+0.011
Neg logit contributions
 cliff-0.016
www-0.016
 brew-0.015
fire-0.015
 outage-0.015
Token thing
Token ID1517
Feature activation-4.58e-02

Pos logit contributions
 cliff+0.050
www+0.047
works+0.047
fire+0.047
 plunge+0.046
Neg logit contributions
 hugged-0.038
 named-0.037
 played-0.036
 loved-0.036
 both-0.035
Token on
Token ID319
Feature activation-6.54e-03

Pos logit contributions
 cliff+0.051
 plunge+0.047
www+0.047
fire+0.047
works+0.045
Neg logit contributions
 hugged-0.020
 played-0.019
 named-0.019
 His-0.018
 loved-0.018
Token top
Token ID1353
Feature activation-1.94e-02

Pos logit contributions
 cliff+0.017
www+0.016
works+0.016
 plunge+0.016
fire+0.016
Neg logit contributions
 hugged-0.006
 named-0.006
 both-0.006
 played-0.006
 loved-0.006
Token of
Token ID286
Feature activation-2.54e-03

Pos logit contributions
 cliff+0.058
works+0.057
 plunge+0.057
www+0.056
fire+0.056
Neg logit contributions
 and-0.011
.-0.010
 named-0.010
 both-0.010
 hugged-0.009
Token a
Token ID257
Feature activation-2.82e-02

Pos logit contributions
 cliff+0.007
www+0.007
works+0.007
fire+0.007
 plunge+0.007
Neg logit contributions
 hugged-0.002
 both-0.002
 named-0.002
 played-0.002
 loved-0.002
Token big
Token ID1263
Feature activation-1.74e-02

Pos logit contributions
works+0.066
eway+0.063
iders+0.062
 plunge+0.060
inner+0.059
Neg logit contributions
 named-0.036
.-0.036
 and-0.035
 loved-0.034
 still-0.034
Token tree
Token ID5509
Feature activation-9.62e-03

Pos logit contributions
works+0.035
iders+0.032
fire+0.032
eway+0.032
 plunge+0.032
Neg logit contributions
 named-0.029
 and-0.028
 played-0.028
 hugged-0.028
,-0.027
Token!
Token ID0
Feature activation+2.91e-03

Pos logit contributions
 cliff+0.030
fire+0.028
www+0.028
 plunge+0.028
works+0.028
Neg logit contributions
 hugged-0.004
 played-0.003
 named-0.003
 loved-0.003
 both-0.003
Token True
Token ID6407
Feature activation-1.99e-02

Pos logit contributions
 hugged+0.004
 named+0.004
 played+0.004
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.006
works-0.006
www-0.006
 plunge-0.006
fire-0.006
Token didn
Token ID1422
Feature activation-3.65e-02

Pos logit contributions
 cliff+0.051
www+0.047
fire+0.047
 plunge+0.047
works+0.046
Neg logit contributions
 hugged-0.019
 named-0.019
 played-0.018
 both-0.017
 loved-0.017
Token't
Token ID470
Feature activation-3.31e-02

Pos logit contributions
 cliff+0.069
works+0.069
fire+0.067
www+0.065
umm+0.065
Neg logit contributions
 both-0.063
 and-0.060
 still-0.059
 his-0.058
.-0.058
Token know
Token ID760
Feature activation+1.24e-02

Pos logit contributions
 cliff+0.078
www+0.072
 plunge+0.072
fire+0.072
works+0.071
Neg logit contributions
 hugged-0.040
 named-0.039
 played-0.038
 loved-0.037
 both-0.037
Token you
Token ID345
Feature activation+4.38e-03

Pos logit contributions
 hugged+0.014
 named+0.013
 played+0.013
 learned+0.013
 His+0.013
Neg logit contributions
 cliff-0.010
works-0.009
 plunge-0.009
www-0.009
fire-0.009
Token were
Token ID547
Feature activation+1.88e-02

Pos logit contributions
 named+0.004
 His+0.004
 sincere+0.004
 hugged+0.004
 his+0.003
Neg logit contributions
works-0.010
 cliff-0.010
fire-0.010
inner-0.010
 plunge-0.010
Token just
Token ID655
Feature activation+5.25e-02

Pos logit contributions
 hugged+0.024
 His+0.023
 named+0.023
 played+0.023
ossible+0.022
Neg logit contributions
 cliff-0.040
 brew-0.038
fire-0.038
works-0.037
 plunge-0.037
Token playing
Token ID2712
Feature activation+8.81e-03

Pos logit contributions
 hugged+0.051
 named+0.049
 played+0.048
 both+0.048
 loved+0.047
Neg logit contributions
 cliff-0.137
www-0.129
works-0.128
fire-0.128
 plunge-0.127
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
 hugged+0.004
 named+0.004
 played+0.004
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.027
www-0.026
 plunge-0.026
works-0.025
fire-0.025
Token But
Token ID887
Feature activation-1.61e-02

Pos logit contributions
 cliff+0.007
www+0.007
works+0.007
fire+0.007
 brew+0.007
Neg logit contributions
 hugged-0.005
 played-0.005
 both-0.005
 named-0.005
 loved-0.005
Token you
Token ID345
Feature activation+5.06e-03

Pos logit contributions
 cliff+0.034
fire+0.031
www+0.031
eway+0.031
storm+0.031
Neg logit contributions
 both-0.024
 hugged-0.024
 still-0.024
 played-0.023
 loved-0.023
Token have
Token ID423
Feature activation+2.52e-02

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.004
 His+0.004
 loved+0.004
Neg logit contributions
 cliff-0.013
fire-0.012
works-0.012
www-0.012
 plunge-0.012
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
 hugged+0.027
 named+0.027
 played+0.025
 loved+0.024
 both+0.024
Neg logit contributions
 cliff-0.060
www-0.057
 plunge-0.056
works-0.055
fire-0.055
Token be
Token ID307
Feature activation+1.65e-02

Pos logit contributions
 cliff+0.027
www+0.025
 plunge+0.025
fire+0.025
works+0.025
Neg logit contributions
 hugged-0.016
 named-0.015
 played-0.015
 loved-0.015
 both-0.015
Token careful
Token ID8161
Feature activation+2.19e-02

Pos logit contributions
 His+0.023
 hugged+0.021
 named+0.021
 Her+0.021
 played+0.020
Neg logit contributions
 cliff-0.030
www-0.030
fire-0.030
 brew-0.029
 plunge-0.028
Token a
Token ID257
Feature activation+3.87e-03

Pos logit contributions
 hugged+0.052
 named+0.050
 both+0.048
 played+0.048
 sincere+0.046
Neg logit contributions
 cliff-0.123
fire-0.115
 plunge-0.115
www-0.114
 brew-0.113
Token cute
Token ID13779
Feature activation+2.41e-03

Pos logit contributions
 hugged+0.005
 named+0.005
 played+0.005
 both+0.004
 loved+0.004
Neg logit contributions
 cliff-0.009
www-0.008
 plunge-0.008
works-0.008
fire-0.008
Token face
Token ID1986
Feature activation+2.63e-02

Pos logit contributions
 hugged+0.003
 named+0.003
 played+0.003
 both+0.003
 learned+0.003
Neg logit contributions
 cliff-0.006
www-0.005
 plunge-0.005
fire-0.005
 brew-0.005
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
 hugged+0.015
ossible+0.015
 learned+0.014
 both+0.014
 played+0.013
Neg logit contributions
 cliff-0.074
fire-0.071
 brew-0.069
storm-0.069
 casc-0.068
Token The
Token ID383
Feature activation-1.10e-02

Pos logit contributions
 cliff+0.031
www+0.029
fire+0.029
works+0.029
 plunge+0.029
Neg logit contributions
 hugged-0.010
 played-0.009
 loved-0.009
 named-0.009
 both-0.008
Token little
Token ID1310
Feature activation-7.37e-02

Pos logit contributions
 cliff+0.023
works+0.021
www+0.021
fire+0.021
eway+0.021
Neg logit contributions
 hugged-0.016
 named-0.016
 loved-0.016
 both-0.016
 played-0.016
Token girl
Token ID2576
Feature activation-2.63e-02

Pos logit contributions
 cliff+0.135
www+0.124
works+0.123
 plunge+0.123
fire+0.122
Neg logit contributions
 hugged-0.127
 named-0.124
 played-0.122
 both-0.121
 loved-0.120
Token was
Token ID373
Feature activation+9.77e-03

Pos logit contributions
 cliff+0.060
fire+0.057
www+0.056
 brew+0.055
 plunge+0.054
Neg logit contributions
 sincere-0.024
ossible-0.023
 both-0.023
 His-0.023
 his-0.022
Token so
Token ID523
Feature activation-1.32e-02

Pos logit contributions
 hugged+0.010
 played+0.009
 named+0.009
 loved+0.009
 His+0.008
Neg logit contributions
 cliff-0.024
 plunge-0.022
www-0.022
fire-0.022
 brew-0.022
Token happy
Token ID3772
Feature activation+1.69e-02

Pos logit contributions
 cliff+0.037
www+0.035
fire+0.035
 plunge+0.035
 brew+0.034
Neg logit contributions
 hugged-0.009
 named-0.008
 played-0.008
 both-0.007
 learned-0.007
Token to
Token ID284
Feature activation-3.48e-02

Pos logit contributions
 hugged+0.013
 named+0.012
 played+0.012
 both+0.012
 loved+0.011
Neg logit contributions
 cliff-0.047
www-0.044
fire-0.044
 plunge-0.043
works-0.043
Token promised
Token ID8072
Feature activation-1.20e-02

Pos logit contributions
 hugged+0.021
 played+0.019
 named+0.019
 both+0.018
 learned+0.018
Neg logit contributions
 cliff-0.066
works-0.063
www-0.063
 plunge-0.062
inner-0.062
Token to
Token ID284
Feature activation-4.17e-02

Pos logit contributions
 cliff+0.031
www+0.029
 plunge+0.029
fire+0.029
works+0.029
Neg logit contributions
 hugged-0.012
 named-0.011
 played-0.011
 loved-0.011
 both-0.011
Token never
Token ID1239
Feature activation-1.60e-02

Pos logit contributions
 cliff+0.100
www+0.093
 plunge+0.093
fire+0.093
works+0.092
Neg logit contributions
 hugged-0.048
 named-0.046
 played-0.045
 both-0.044
 loved-0.044
Token forget
Token ID6044
Feature activation+3.07e-02

Pos logit contributions
 cliff+0.035
www+0.034
works+0.034
fire+0.033
 plunge+0.033
Neg logit contributions
 hugged-0.022
 named-0.021
 played-0.021
 both-0.020
 loved-0.020
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
 hugged+0.025
 played+0.023
 named+0.023
 loved+0.021
 both+0.021
Neg logit contributions
 cliff-0.084
www-0.078
 plunge-0.078
works-0.078
fire-0.077
Token power
Token ID1176
Feature activation-7.08e-02

Pos logit contributions
 cliff+0.029
www+0.026
works+0.026
 plunge+0.026
fire+0.026
Neg logit contributions
 hugged-0.025
 named-0.024
 played-0.024
 both-0.024
 loved-0.024
Token of
Token ID286
Feature activation-4.90e-03

Pos logit contributions
 cliff+0.182
www+0.171
works+0.170
fire+0.170
 plunge+0.170
Neg logit contributions
 hugged-0.068
 named-0.066
 played-0.064
 both-0.063
 loved-0.062
Token the
Token ID262
Feature activation-2.82e-02

Pos logit contributions
 cliff+0.012
www+0.011
works+0.011
 plunge+0.011
fire+0.011
Neg logit contributions
 hugged-0.005
 named-0.005
 played-0.005
 both-0.005
 loved-0.005
Token golden
Token ID10861
Feature activation-3.56e-02

Pos logit contributions
 cliff+0.056
www+0.051
works+0.051
 plunge+0.051
fire+0.051
Neg logit contributions
 hugged-0.045
 named-0.044
 played-0.043
 loved-0.042
 both-0.042
Token thread
Token ID4704
Feature activation-2.53e-02

Pos logit contributions
 cliff+0.071
www+0.065
works+0.065
 plunge+0.065
fire+0.064
Neg logit contributions
 hugged-0.055
 named-0.053
 played-0.053
 both-0.053
 loved-0.052
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
 cliff+0.075
www+0.071
 plunge+0.071
works+0.071
fire+0.070
Neg logit contributions
 hugged-0.016
 named-0.014
 played-0.014
 both-0.013
 loved-0.013
Token It
Token ID632
Feature activation-3.22e-02

Pos logit contributions
 cliff+0.038
www+0.036
fire+0.036
works+0.036
 plunge+0.036
Neg logit contributions
 hugged-0.010
 named-0.009
 played-0.009
 loved-0.008
 both-0.008
Token flew
Token ID13112
Feature activation-1.10e-02

Pos logit contributions
 cliff+0.082
www+0.077
works+0.077
 plunge+0.077
fire+0.076
Neg logit contributions
 hugged-0.033
 named-0.031
 played-0.030
 both-0.030
 loved-0.030
Token away
Token ID1497
Feature activation-1.10e-02

Pos logit contributions
 cliff+0.023
 plunge+0.021
fire+0.021
www+0.021
works+0.020
Neg logit contributions
 hugged-0.016
 named-0.016
 played-0.015
 sincere-0.015
 loved-0.015
Token but
Token ID475
Feature activation-1.89e-02

Pos logit contributions
 cliff+0.033
www+0.031
fire+0.031
 plunge+0.031
works+0.031
Neg logit contributions
 hugged-0.006
 named-0.006
 played-0.005
 both-0.005
 loved-0.005
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
 cliff+0.052
www+0.050
works+0.050
 plunge+0.049
inner+0.049
Neg logit contributions
 hugged-0.015
 named-0.014
 played-0.014
 both-0.014
 loved-0.014
Token good
Token ID922
Feature activation-7.78e-02

Pos logit contributions
 cliff+0.028
www+0.026
works+0.026
 plunge+0.026
fire+0.026
Neg logit contributions
 hugged-0.021
 named-0.021
 played-0.020
 loved-0.020
 both-0.020
Token man
Token ID582
Feature activation-3.81e-02

Pos logit contributions
 cliff+0.151
www+0.138
 plunge+0.137
works+0.137
fire+0.137
Neg logit contributions
 hugged-0.128
 named-0.123
 played-0.122
 both-0.121
 loved-0.120
Token felt
Token ID2936
Feature activation+9.05e-03

Pos logit contributions
 cliff+0.092
fire+0.088
www+0.086
 brew+0.084
works+0.084
Neg logit contributions
 hugged-0.034
 both-0.034
 sincere-0.033
 His-0.033
 named-0.033
Token happy
Token ID3772
Feature activation+3.49e-02

Pos logit contributions
ossible+0.012
 played+0.011
 named+0.011
 hugged+0.011
 eyes+0.010
Neg logit contributions
 cliff-0.019
fire-0.017
 plunge-0.017
 brew-0.016
 happen-0.016
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 hugged+0.032
 named+0.031
 sincere+0.028
ossible+0.028
 played+0.027
Neg logit contributions
 cliff-0.088
fire-0.083
 brew-0.080
 plunge-0.079
eway-0.079
Token He
Token ID679
Feature activation-2.20e-02

Pos logit contributions
 cliff+0.030
www+0.029
fire+0.028
works+0.028
umm+0.028
Neg logit contributions
 hugged-0.010
 named-0.010
 played-0.009
 sincere-0.009
 both-0.009
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
 cliff+0.014
fire+0.013
www+0.013
 plunge+0.013
works+0.012
Neg logit contributions
 sincere-0.003
 hugged-0.003
 His-0.002
 named-0.002
ossible-0.002
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 hugged+0.017
 named+0.016
 played+0.016
 both+0.015
 loved+0.015
Neg logit contributions
 cliff-0.043
www-0.040
fire-0.040
 plunge-0.040
works-0.040
Token fox
Token ID21831
Feature activation-4.53e-02

Pos logit contributions
 cliff+0.044
works+0.041
www+0.041
inner+0.040
fire+0.040
Neg logit contributions
 hugged-0.045
 named-0.045
 loved-0.044
 played-0.044
 both-0.043
Token visited
Token ID8672
Feature activation-2.11e-02

Pos logit contributions
 cliff+0.119
fire+0.113
works+0.109
 plunge+0.108
 brew+0.107
Neg logit contributions
 hugged-0.032
 sincere-0.032
 His-0.031
 named-0.030
 both-0.029
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
 cliff+0.062
www+0.059
works+0.058
 plunge+0.058
fire+0.057
Neg logit contributions
 hugged-0.014
 named-0.013
 played-0.013
 both-0.013
 loved-0.012
Token o
Token ID267
Feature activation-1.02e-01

Pos logit contributions
iders+0.054
 outage+0.052
works+0.052
www+0.051
inner+0.051
Neg logit contributions
 still-0.051
 named-0.051
 loved-0.050
 and-0.050
 played-0.050
Tokenasis
Token ID17765
Feature activation-6.09e-02

Pos logit contributions
 cliff+0.088
www+0.081
 plunge+0.080
works+0.075
 outage+0.074
Neg logit contributions
 named-0.271
 hugged-0.263
 played-0.261
 loved-0.258
 both-0.254
Token every
Token ID790
Feature activation-2.24e-02

Pos logit contributions
 cliff+0.167
fire+0.158
works+0.155
 plunge+0.154
www+0.152
Neg logit contributions
 hugged-0.049
 named-0.045
 played-0.043
 both-0.041
 loved-0.040
Token day
Token ID1110
Feature activation-4.70e-02

Pos logit contributions
works+0.046
 cliff+0.046
iders+0.046
inner+0.044
 plunge+0.043
Neg logit contributions
 named-0.035
 played-0.032
 hugged-0.032
 loved-0.032
 and-0.031
Token to
Token ID284
Feature activation-5.01e-02

Pos logit contributions
 cliff+0.138
fire+0.133
 plunge+0.128
www+0.128
works+0.127
Neg logit contributions
 hugged-0.027
 named-0.022
 played-0.021
 sincere-0.020
 loved-0.020
Token play
Token ID711
Feature activation-1.17e-02

Pos logit contributions
 cliff+0.110
www+0.110
works+0.107
inner+0.105
iders+0.104
Neg logit contributions
 named-0.067
 hugged-0.066
 played-0.066
 both-0.064
 his-0.063
Token From
Token ID3574
Feature activation+1.80e-02

Pos logit contributions
 cliff+0.029
www+0.027
 plunge+0.027
fire+0.027
works+0.027
Neg logit contributions
 hugged-0.007
 named-0.006
 played-0.006
 both-0.006
 loved-0.006
Token then
Token ID788
Feature activation-3.99e-02

Pos logit contributions
 hugged+0.017
 named+0.016
 played+0.016
 both+0.015
 loved+0.015
Neg logit contributions
 cliff-0.047
www-0.044
 plunge-0.044
works-0.044
fire-0.044
Token on
Token ID319
Feature activation-1.66e-02

Pos logit contributions
 cliff+0.112
works+0.105
www+0.105
fire+0.105
 plunge+0.105
Neg logit contributions
 hugged-0.031
 named-0.029
 played-0.028
 both-0.027
 loved-0.027
Token,
Token ID11
Feature activation-5.56e-02

Pos logit contributions
 cliff+0.050
www+0.047
fire+0.047
 plunge+0.046
works+0.046
Neg logit contributions
 hugged-0.010
 played-0.008
 named-0.008
 loved-0.008
 both-0.007
Token the
Token ID262
Feature activation-2.04e-02

Pos logit contributions
 cliff+0.156
www+0.148
works+0.147
 plunge+0.147
fire+0.146
Neg logit contributions
 hugged-0.042
 named-0.039
 played-0.038
 both-0.037
 loved-0.037
Token lazy
Token ID16931
Feature activation-8.91e-02

Pos logit contributions
 cliff+0.042
www+0.040
works+0.039
 plunge+0.039
fire+0.038
Neg logit contributions
 hugged-0.030
 named-0.030
 played-0.029
 loved-0.029
 both-0.029
Token cater
Token ID20825
Feature activation-8.72e-02

Pos logit contributions
 cliff+0.169
www+0.157
works+0.157
 plunge+0.156
fire+0.154
Neg logit contributions
 hugged-0.146
 named-0.144
 played-0.142
 loved-0.140
 both-0.139
Tokenpillar
Token ID41643
Feature activation-6.04e-02

Pos logit contributions
 cliff+0.128
works+0.123
 plunge+0.121
 brew+0.117
 inconvenience+0.115
Neg logit contributions
 named-0.183
 hugged-0.181
 loved-0.177
 played-0.176
 learned-0.172
Token would
Token ID561
Feature activation-3.22e-02

Pos logit contributions
 cliff+0.160
fire+0.149
www+0.149
 plunge+0.147
works+0.146
Neg logit contributions
 hugged-0.052
 named-0.048
 both-0.048
 played-0.046
 loved-0.044
Token show
Token ID905
Feature activation-1.35e-02

Pos logit contributions
 cliff+0.081
www+0.076
works+0.075
 plunge+0.075
fire+0.075
Neg logit contributions
 hugged-0.033
 named-0.032
 played-0.031
 both-0.031
 loved-0.030
Token his
Token ID465
Feature activation+6.68e-03

Pos logit contributions
 cliff+0.036
www+0.034
 plunge+0.034
works+0.034
fire+0.033
Neg logit contributions
 hugged-0.012
 named-0.011
 played-0.011
 loved-0.011
 both-0.011