TOP ACTIVATIONS
MAX = 5.814

, so he wore his warm coat . Tim loved to
to play with me ?" asked Sammy . "
with green trees and white fluffy clouds in the sky .
big blue eyes and her curly blonde hair . She loved
What are you doing ?" asked Joe . Sam
, and Lily wore her warm coat . She saw many
giving him too many ch ocol ates ." The king learned
up and put on his cozy bath robe . His mom
reward . It was ch ocol ates and sweets !
ran and put on her warm est coat and shoes .
as he opened the ch ocol ates and started eating them
outside , Bella wore her cozy sweater to keep warm .
. He put on his warm est clothes and was about
t - shirt and her brightly coloured sneakers and then she
her thoughts while wearing her cozy vest . <|endoftext|> Once upon
! He put on his warm clothes , grabbed his swim
woods wanted to have such beautiful fur too ! They came
a large selection of ch ocol ates . Ell a picked
he ate all the ch ocol ates and sweets . Then
his friends and enjoyed his cozy hole . And even though

MIN ACTIVATIONS
MIN = -5.816

She says , " B oys , what did you do
She said , " B oys , it 's not important
. " B oys , stop yelling and fighting
and said , " B oys , you should not sail
She said , " B oys , let 's take turns
. " B oys , what are you doing
. " B oys , what are you doing
to them : " B oys , be careful next time
. " B oys , this is not a
told them , " B oys , you can 't be
She says , " B oys , boys . You have
and said , " B oys , you need to clean
. " B oys , do you want some
She said , " B oys , what have you done
. " B oys , you have to eat
calls them . " B oys , it 's time for
and said , " B oys , sharing is caring .
stern voice , " B oys and girls , if you
. " B oys , boys , boys .
They say , " T oys and books for sale !

INTERVAL 2.906 - 5.814
CONTAINS 0.287%

tub and filled it with warm water . She put her
frog , who just loved to hop around ! One day
ac orns , including the special white one . <|endoftext|> Once
homes to stay dry and cozy . <|endoftext|> Once upon a
and scared because they had never been kind to each other

INTERVAL -0.001 - 2.906
CONTAINS 50.352%

looked back . The shadow waved goodbye and disappeared in the
repair L izzy 's leg and used the string to tie
coming towards it . The sw an got scared and wanted
by the bright colors and funny shapes on the phone 's
needed . When he walked out of the

INTERVAL -2.909 - -0.001
CONTAINS 49.103%

when a s iren sounded , making both of the characters
and fell to the ground . They looked at each other
to join in their game . Sam smiled and kindly said
to move the items nearer to the flame . As the
out what 's hiding there ." Both friends

INTERVAL -5.816 - -2.909
CONTAINS 0.259%

hes were . They looked over the fence and saw some
done . Mom opened the oven . She took out the
up the blocks and started over . This time , Lily
big bag of broccoli . Mia opens the bag and sees
and came outside . She saw the wagon and the mud
Token,
Token ID11
Feature activation-1.69e+00

Pos logit contributions
ped+0.066
 elbow+0.062
orn+0.061
row+0.060
cher+0.059
Neg logit contributions
 about-0.028
 where-0.027
 Mrs-0.027
 booth-0.026
 Miss-0.025
Token so
Token ID523
Feature activation+1.32e+00

Pos logit contributions
 booth+0.026
 Mrs+0.023
 Miss+0.022
 closely+0.022
 stern+0.021
Neg logit contributions
ped-0.058
orn-0.055
row-0.054
orns-0.053
iest-0.052
Token he
Token ID339
Feature activation+7.12e-01

Pos logit contributions
ped+0.047
row+0.046
orns+0.045
orn+0.045
ates+0.044
Neg logit contributions
 they-0.020
 Mrs-0.019
 Miss-0.019
 where-0.018
 about-0.017
Token wore
Token ID12408
Feature activation+3.47e+00

Pos logit contributions
ped+0.022
orn+0.021
orns+0.021
row+0.021
iest+0.020
Neg logit contributions
 booth-0.013
 Mrs-0.012
 closely-0.012
 Miss-0.012
 they-0.011
Token his
Token ID465
Feature activation+4.01e+00

Pos logit contributions
ped+0.118
row+0.117
orn+0.116
py+0.113
orns+0.112
Neg logit contributions
 booth-0.047
 about-0.047
 where-0.046
 Mrs-0.045
 closely-0.045
Token warm
Token ID5814
Feature activation+5.81e+00

Pos logit contributions
umper+0.139
keeping+0.137
orns+0.136
row+0.136
ped+0.134
Neg logit contributions
 grandma-0.057
 stern-0.056
 mom-0.053
 where-0.052
 Anna-0.051
Token coat
Token ID13209
Feature activation+2.93e+00

Pos logit contributions
cher+0.179
ped+0.174
ayer+0.174
orns+0.168
umper+0.168
Neg logit contributions
 where-0.099
 Aunt-0.093
 grandma-0.093
 about-0.090
 mom-0.087
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
ped+0.089
orn+0.087
orns+0.085
row+0.083
py+0.082
Neg logit contributions
 booth-0.055
 Mrs-0.051
 closely-0.050
 about-0.050
 where-0.049
Token Tim
Token ID5045
Feature activation+1.36e-01

Pos logit contributions
ped+0.040
orn+0.038
row+0.037
orns+0.036
py+0.035
Neg logit contributions
 booth-0.024
 Mrs-0.022
 closely-0.022
 stern-0.021
 Miss-0.021
Token loved
Token ID6151
Feature activation+1.56e+00

Pos logit contributions
ped+0.005
orn+0.004
iest+0.004
row+0.004
orns+0.004
Neg logit contributions
 booth-0.002
 Mrs-0.002
 closely-0.002
 where-0.002
 about-0.002
Token to
Token ID284
Feature activation+2.27e+00

Pos logit contributions
ped+0.052
iest+0.050
row+0.050
orn+0.050
ayer+0.049
Neg logit contributions
 where-0.026
 Mrs-0.026
 about-0.026
 they-0.025
 Miss-0.025
Token to
Token ID284
Feature activation+2.41e+00

Pos logit contributions
ped+0.025
orn+0.024
row+0.024
iest+0.023
robat+0.023
Neg logit contributions
 Mrs-0.009
 booth-0.008
 Miss-0.008
 about-0.008
 where-0.007
Token play
Token ID711
Feature activation+1.86e+00

Pos logit contributions
ped+0.079
orn+0.072
iest+0.072
row+0.071
orns+0.071
Neg logit contributions
 Mrs-0.044
 Miss-0.041
 Aunt-0.040
 booth-0.039
 about-0.039
Token with
Token ID351
Feature activation+1.86e+00

Pos logit contributions
ped+0.065
row+0.060
orn+0.059
iest+0.057
 elbow+0.057
Neg logit contributions
 booth-0.028
 about-0.027
 Mrs-0.027
 Miss-0.027
 where-0.027
Token me
Token ID502
Feature activation+1.95e+00

Pos logit contributions
ped+0.057
row+0.053
orn+0.053
iest+0.052
 elbow+0.052
Neg logit contributions
 Mrs-0.036
 Miss-0.034
 closely-0.033
 booth-0.033
 where-0.032
Token?"
Token ID1701
Feature activation+3.67e+00

Pos logit contributions
ped+0.061
orn+0.055
row+0.054
iest+0.054
orns+0.053
Neg logit contributions
 booth-0.037
 about-0.035
 Mrs-0.034
 where-0.034
 closely-0.033
Token asked
Token ID1965
Feature activation+5.70e+00

Pos logit contributions
ped+0.099
row+0.098
ates+0.094
iest+0.094
orn+0.093
Neg logit contributions
 Mrs-0.085
 Miss-0.082
 closely-0.080
 they-0.077
 stern-0.076
Token Sammy
Token ID44067
Feature activation+1.09e+00

Pos logit contributions
ands+0.164
row+0.159
ped+0.159
robat+0.158
keeping+0.157
Neg logit contributions
 Mrs-0.132
 Miss-0.126
 Lisa-0.111
 Anna-0.111
 they-0.109
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
ped+0.030
orn+0.028
row+0.028
orns+0.027
iest+0.027
Neg logit contributions
 booth-0.024
 Mrs-0.023
 closely-0.022
 Miss-0.022
 stern-0.022
Token
Token ID198
Feature activation+1.80e+00

Pos logit contributions
ped+0.091
iest+0.087
row+0.087
orn+0.086
orns+0.085
Neg logit contributions
 Mrs-0.019
 Miss-0.017
 booth-0.015
 Sometimes-0.013
 closely-0.013
Token
Token ID198
Feature activation+2.03e+00

Pos logit contributions
ped+0.073
orn+0.069
row+0.069
orns+0.067
 elbow+0.067
Neg logit contributions
 Mrs-0.015
 booth-0.015
 Miss-0.014
 closely-0.014
 Sometimes-0.012
Token"
Token ID1
Feature activation+4.63e-01

Pos logit contributions
ped+0.073
orn+0.068
 elbow+0.068
orns+0.067
iest+0.067
Neg logit contributions
 booth-0.027
 Mrs-0.026
 closely-0.025
 Miss-0.025
 stern-0.023
Token with
Token ID351
Feature activation+1.60e+00

Pos logit contributions
ped+0.009
orn+0.009
row+0.009
iest+0.008
orns+0.008
Neg logit contributions
 booth-0.007
 Mrs-0.007
 Miss-0.006
 stern-0.006
 closely-0.006
Token green
Token ID4077
Feature activation+3.98e+00

Pos logit contributions
ped+0.048
orn+0.046
row+0.046
orns+0.045
py+0.043
Neg logit contributions
 booth-0.030
 Mrs-0.029
 closely-0.028
 Miss-0.027
 stern-0.026
Token trees
Token ID7150
Feature activation+1.18e+00

Pos logit contributions
cher+0.109
ped+0.106
orn+0.105
py+0.103
row+0.103
Neg logit contributions
 booth-0.081
 closely-0.079
 about-0.075
 Some-0.072
 they-0.071
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
ped+0.040
orn+0.038
row+0.038
orns+0.037
iest+0.036
Neg logit contributions
 booth-0.018
 Mrs-0.017
 closely-0.016
 Miss-0.016
 stern-0.015
Token white
Token ID2330
Feature activation+5.14e+00

Pos logit contributions
ped+0.056
orn+0.055
row+0.054
orns+0.052
robat+0.051
Neg logit contributions
 booth-0.041
 Mrs-0.041
 closely-0.039
 Miss-0.038
 where-0.036
Token fluffy
Token ID39145
Feature activation+5.69e+00

Pos logit contributions
cher+0.136
orn+0.132
row+0.128
robat+0.125
 elbow+0.125
Neg logit contributions
 Some-0.103
 billboards-0.102
 booth-0.101
 about-0.100
 They-0.098
Token clouds
Token ID15114
Feature activation+2.68e+00

Pos logit contributions
cher+0.144
orn+0.143
row+0.140
robat+0.135
iest+0.135
Neg logit contributions
 closely-0.118
 billboards-0.116
 booth-0.114
 They-0.113
 they-0.111
Token in
Token ID287
Feature activation+2.13e+00

Pos logit contributions
ped+0.078
orn+0.076
row+0.075
orns+0.073
py+0.072
Neg logit contributions
 booth-0.052
 closely-0.051
 about-0.048
 Mrs-0.047
 where-0.047
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
ped+0.070
orn+0.067
row+0.066
orns+0.065
iest+0.063
Neg logit contributions
 booth-0.035
 Mrs-0.033
 closely-0.031
 Miss-0.031
 stern-0.030
Token sky
Token ID6766
Feature activation+3.51e+00

Pos logit contributions
ped+0.054
 elbow+0.051
row+0.051
orn+0.050
orns+0.049
Neg logit contributions
 booth-0.040
 Mrs-0.038
 closely-0.036
 Miss-0.035
 where-0.034
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
ped+0.115
orn+0.111
 elbow+0.109
orns+0.108
row+0.107
Neg logit contributions
 booth-0.057
 about-0.056
 closely-0.055
 where-0.053
 Mrs-0.052
Token big
Token ID1263
Feature activation+1.95e+00

Pos logit contributions
ped+0.087
row+0.086
orn+0.084
orns+0.083
iest+0.079
Neg logit contributions
 Mrs-0.035
 Miss-0.034
 Aunt-0.033
 booth-0.032
 grandma-0.031
Token blue
Token ID4171
Feature activation+4.05e+00

Pos logit contributions
ped+0.062
row+0.060
orn+0.059
orns+0.058
iest+0.056
Neg logit contributions
 booth-0.034
 Mrs-0.034
 Aunt-0.032
 Miss-0.032
 stern-0.032
Token eyes
Token ID2951
Feature activation+1.61e+00

Pos logit contributions
ped+0.129
cher+0.122
orns+0.122
rest+0.119
row+0.119
Neg logit contributions
 booth-0.070
 Aunt-0.066
 grandma-0.063
 Mrs-0.062
 Miss-0.061
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
ped+0.049
orn+0.047
row+0.046
orns+0.045
py+0.044
Neg logit contributions
 booth-0.030
 Mrs-0.028
 closely-0.027
 Miss-0.027
 stern-0.026
Token her
Token ID607
Feature activation+3.16e+00

Pos logit contributions
row+0.062
orn+0.061
ped+0.061
orns+0.060
robat+0.058
Neg logit contributions
 Mrs-0.031
 Miss-0.028
 booth-0.028
 about-0.027
 Aunt-0.027
Token curly
Token ID45731
Feature activation+5.64e+00

Pos logit contributions
row+0.108
orns+0.107
orn+0.105
ped+0.104
umper+0.100
Neg logit contributions
 Mrs-0.049
 Aunt-0.049
 Miss-0.047
 grandma-0.046
 stern-0.046
Token blonde
Token ID21541
Feature activation+5.12e+00

Pos logit contributions
orns+0.165
cher+0.163
orn+0.162
robat+0.161
row+0.161
Neg logit contributions
 booth-0.095
 stern-0.093
 Mrs-0.092
 Miss-0.092
 about-0.091
Token hair
Token ID4190
Feature activation+3.84e+00

Pos logit contributions
ped+0.150
row+0.147
orn+0.147
orns+0.144
robat+0.144
Neg logit contributions
 booth-0.093
 Mrs-0.086
 closely-0.085
 stern-0.085
 Miss-0.082
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
orn+0.120
ped+0.119
orns+0.116
row+0.115
py+0.115
Neg logit contributions
 booth-0.063
 closely-0.060
 about-0.060
 where-0.057
 Mrs-0.056
Token She
Token ID1375
Feature activation+1.37e+00

Pos logit contributions
 booth+0.000
 stern+0.000
 closely+0.000
cases+0.000
abella+0.000
Neg logit contributions
ped-0.001
orn-0.001
orns-0.001
 himself-0.001
py-0.001
Token loved
Token ID6151
Feature activation+3.56e-01

Pos logit contributions
ped+0.040
orn+0.038
row+0.038
orns+0.037
iest+0.036
Neg logit contributions
 booth-0.027
 Mrs-0.026
 closely-0.024
 Miss-0.024
 stern-0.024
Token What
Token ID1867
Feature activation-4.68e-01

Pos logit contributions
ped+0.037
orn+0.036
row+0.035
py+0.034
orns+0.034
Neg logit contributions
 booth-0.024
 closely-0.022
 Mrs-0.021
 stern-0.021
 Miss-0.020
Token are
Token ID389
Feature activation+5.16e-02

Pos logit contributions
 booth+0.009
 Mrs+0.009
 closely+0.008
 Miss+0.008
 stern+0.008
Neg logit contributions
ped-0.014
orn-0.013
row-0.013
orns-0.013
iest-0.013
Token you
Token ID345
Feature activation+1.55e+00

Pos logit contributions
ped+0.001
orn+0.001
row+0.001
orns+0.001
robat+0.001
Neg logit contributions
 Mrs-0.001
 booth-0.001
 Miss-0.001
 where-0.001
 they-0.001
Token doing
Token ID1804
Feature activation+4.48e-01

Pos logit contributions
ped+0.033
orn+0.031
row+0.030
orns+0.030
 elbow+0.029
Neg logit contributions
 booth-0.043
 Mrs-0.042
 closely-0.041
 Miss-0.040
 about-0.040
Token?"
Token ID1701
Feature activation+3.71e+00

Pos logit contributions
ped+0.013
orn+0.012
row+0.012
orns+0.012
 elbow+0.011
Neg logit contributions
 booth-0.009
 Mrs-0.009
 closely-0.008
 Miss-0.008
 about-0.008
Token asked
Token ID1965
Feature activation+5.59e+00

Pos logit contributions
ped+0.104
orn+0.099
row+0.098
orns+0.096
iest+0.093
Neg logit contributions
 booth-0.077
 Mrs-0.076
 closely-0.072
 Miss-0.072
 stern-0.070
Token Joe
Token ID5689
Feature activation+1.04e+00

Pos logit contributions
orns+0.170
ands+0.167
robat+0.167
orn+0.166
 elbow+0.163
Neg logit contributions
 Mrs-0.108
 Miss-0.101
 Mary-0.098
 closely-0.097
 Lisa-0.097
Token.
Token ID13
Feature activation+3.04e+00

Pos logit contributions
ped+0.030
orn+0.029
orns+0.029
iest+0.028
row+0.028
Neg logit contributions
 Mrs-0.021
 closely-0.020
 booth-0.020
 stern-0.019
 Miss-0.019
Token
Token ID198
Feature activation+2.04e+00

Pos logit contributions
ped+0.117
orn+0.113
row+0.113
orns+0.112
iest+0.110
Neg logit contributions
 Mrs-0.032
 booth-0.032
 Miss-0.028
 closely-0.027
 stern-0.026
Token
Token ID198
Feature activation+2.28e+00

Pos logit contributions
ped+0.077
orn+0.074
row+0.073
orns+0.072
iest+0.071
Neg logit contributions
 booth-0.023
 Mrs-0.021
 closely-0.019
 Miss-0.019
 stern-0.018
TokenSam
Token ID16305
Feature activation+3.47e-01

Pos logit contributions
ped+0.067
orn+0.064
row+0.063
orns+0.061
iest+0.060
Neg logit contributions
 booth-0.045
 Mrs-0.043
 closely-0.041
 Miss-0.041
 stern-0.039
Token,
Token ID11
Feature activation-2.10e+00

Pos logit contributions
ped+0.018
orn+0.017
row+0.017
orns+0.017
iest+0.016
Neg logit contributions
 booth-0.009
 Mrs-0.009
 closely-0.008
 Miss-0.008
 stern-0.008
Token and
Token ID290
Feature activation-2.22e-01

Pos logit contributions
 booth+0.037
 Mrs+0.033
cases+0.032
 stern+0.031
 Miss+0.031
Neg logit contributions
ped-0.066
orn-0.061
row-0.060
ter-0.060
orns-0.059
Token Lily
Token ID20037
Feature activation+5.61e-01

Pos logit contributions
 booth+0.004
 Mrs+0.003
 closely+0.003
 stern+0.003
 Miss+0.003
Neg logit contributions
ped-0.007
orn-0.007
row-0.007
orns-0.007
 elbow-0.007
Token wore
Token ID12408
Feature activation+2.75e+00

Pos logit contributions
ped+0.016
orn+0.015
row+0.015
orns+0.015
iest+0.014
Neg logit contributions
 booth-0.012
 Mrs-0.011
 closely-0.010
 Miss-0.010
 stern-0.010
Token her
Token ID607
Feature activation+3.66e+00

Pos logit contributions
ped+0.087
orn+0.084
row+0.083
orns+0.080
iest+0.079
Neg logit contributions
 booth-0.048
 Mrs-0.046
 Miss-0.044
 closely-0.043
 stern-0.042
Token warm
Token ID5814
Feature activation+5.52e+00

Pos logit contributions
ped+0.127
orns+0.124
orn+0.124
row+0.124
umper+0.120
Neg logit contributions
 grandma-0.056
 Aunt-0.055
 where-0.051
 Miss-0.049
 stern-0.049
Token coat
Token ID13209
Feature activation+2.95e+00

Pos logit contributions
cher+0.164
ped+0.163
ayer+0.157
orn+0.157
orns+0.155
Neg logit contributions
 Aunt-0.104
 where-0.102
 grandma-0.099
 about-0.097
 booth-0.090
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
ped+0.087
orn+0.085
orns+0.082
row+0.081
py+0.080
Neg logit contributions
 booth-0.058
 Mrs-0.055
 about-0.053
 closely-0.053
 Miss-0.053
Token She
Token ID1375
Feature activation+1.58e+00

Pos logit contributions
ped+0.003
orn+0.003
orns+0.003
row+0.003
 himself+0.003
Neg logit contributions
 booth-0.002
 stern-0.002
 closely-0.002
cases-0.002
 horror-0.002
Token saw
Token ID2497
Feature activation-3.74e-01

Pos logit contributions
ped+0.049
orn+0.047
row+0.047
orns+0.046
iest+0.045
Neg logit contributions
 booth-0.028
 Mrs-0.027
 Miss-0.026
 closely-0.026
 about-0.025
Token many
Token ID867
Feature activation+8.47e-01

Pos logit contributions
 booth+0.007
 closely+0.007
 Mrs+0.007
 stern+0.007
 Miss+0.006
Neg logit contributions
ped-0.011
orn-0.010
orns-0.010
row-0.010
py-0.010
Token giving
Token ID3501
Feature activation+6.68e-01

Pos logit contributions
orn+0.005
ped+0.005
row+0.005
iest+0.005
orns+0.005
Neg logit contributions
 Mrs-0.004
 booth-0.004
 closely-0.004
 Miss-0.004
 where-0.004
Token him
Token ID683
Feature activation+1.04e+00

Pos logit contributions
ped+0.018
orn+0.018
row+0.018
iest+0.017
orns+0.017
Neg logit contributions
 Mrs-0.014
 booth-0.014
 closely-0.013
 Miss-0.013
 stern-0.012
Token too
Token ID1165
Feature activation+1.33e+00

Pos logit contributions
ped+0.032
orn+0.031
row+0.030
orns+0.029
iest+0.029
Neg logit contributions
 booth-0.019
 Mrs-0.018
 closely-0.017
 Miss-0.017
 stern-0.016
Token many
Token ID867
Feature activation+1.65e+00

Pos logit contributions
ped+0.048
orn+0.046
row+0.046
orns+0.044
iest+0.044
Neg logit contributions
 booth-0.018
 Mrs-0.016
 Miss-0.015
 closely-0.014
 Lisa-0.013
Token ch
Token ID442
Feature activation+2.00e+00

Pos logit contributions
ped+0.050
orn+0.048
row+0.047
robat+0.045
umper+0.045
Neg logit contributions
 about-0.030
 Mrs-0.030
 booth-0.029
 closely-0.028
 where-0.028
Tokenocol
Token ID4668
Feature activation+5.50e+00

Pos logit contributions
ped+0.040
robat+0.038
cher+0.036
 elbow+0.036
ter+0.035
Neg logit contributions
ours-0.051
 closely-0.051
 about-0.050
outs-0.049
 Sometimes-0.049
Tokenates
Token ID689
Feature activation+1.26e+00

Pos logit contributions
 mineral+0.102
 elbow+0.085
 seek+0.083
 current+0.083
 sensation+0.082
Neg logit contributions
outs-0.140
ews-0.133
aw-0.133
ours-0.131
 billboards-0.130
Token."
Token ID526
Feature activation+1.40e-01

Pos logit contributions
ped+0.038
orn+0.036
orns+0.034
py+0.034
 elbow+0.034
Neg logit contributions
 booth-0.024
 Mrs-0.023
 closely-0.022
 about-0.022
 where-0.022
Token The
Token ID383
Feature activation-1.36e+00

Pos logit contributions
ped+0.005
orn+0.004
row+0.004
orns+0.004
py+0.004
Neg logit contributions
 booth-0.002
 closely-0.002
cases-0.002
 Mrs-0.002
 stern-0.002
Token king
Token ID5822
Feature activation-3.87e-01

Pos logit contributions
 booth+0.028
 Mrs+0.026
 Miss+0.025
 closely+0.025
 stern+0.024
Neg logit contributions
ped-0.040
orn-0.038
row-0.037
orns-0.037
iest-0.036
Token learned
Token ID4499
Feature activation-1.31e+00

Pos logit contributions
 booth+0.008
 Mrs+0.008
 Miss+0.007
 closely+0.007
 stern+0.007
Neg logit contributions
ped-0.011
orn-0.011
row-0.010
orns-0.010
py-0.010
Token up
Token ID510
Feature activation+2.44e-01

Pos logit contributions
ped+0.048
orn+0.046
row+0.044
orns+0.043
py+0.042
Neg logit contributions
 booth-0.029
 Mrs-0.026
 Miss-0.025
 stern-0.025
 Lisa-0.024
Token and
Token ID290
Feature activation+2.70e+00

Pos logit contributions
ped+0.008
orn+0.008
row+0.008
orns+0.008
py+0.008
Neg logit contributions
 booth-0.003
 Mrs-0.003
 closely-0.003
 about-0.003
 Miss-0.003
Token put
Token ID1234
Feature activation+2.38e+00

Pos logit contributions
ped+0.081
row+0.078
orn+0.077
orns+0.077
py+0.074
Neg logit contributions
 Mrs-0.052
 booth-0.051
 closely-0.049
 Miss-0.048
 where-0.047
Token on
Token ID319
Feature activation+1.68e+00

Pos logit contributions
chie+0.089
ayer+0.086
py+0.085
row+0.083
 affect+0.082
Neg logit contributions
 where-0.037
 about-0.032
 they-0.031
 Anna-0.030
 what-0.030
Token his
Token ID465
Feature activation+2.55e+00

Pos logit contributions
ped+0.034
orn+0.032
row+0.031
orns+0.030
iest+0.029
Neg logit contributions
 booth-0.048
 Mrs-0.046
 closely-0.045
 Miss-0.045
 stern-0.044
Token cozy
Token ID37438
Feature activation+5.48e+00

Pos logit contributions
ped+0.090
row+0.086
orns+0.086
umper+0.084
orn+0.084
Neg logit contributions
 booth-0.038
 where-0.036
 grandma-0.035
 stern-0.035
 Aunt-0.033
Token bath
Token ID7837
Feature activation+3.14e+00

Pos logit contributions
ped+0.183
cher+0.175
umper+0.173
ayer+0.168
robat+0.166
Neg logit contributions
 where-0.090
 Aunt-0.086
 booth-0.086
 grandma-0.082
 gate-0.076
Tokenrobe
Token ID25481
Feature activation+2.19e+00

Pos logit contributions
ped+0.089
orn+0.084
ocol+0.081
umper+0.078
chie+0.076
Neg logit contributions
 where-0.072
 about-0.066
 booth-0.064
 they-0.063
cases-0.058
Token.
Token ID13
Feature activation+1.92e+00

Pos logit contributions
ped+0.070
py+0.065
orns+0.065
orn+0.065
 elbow+0.062
Neg logit contributions
 booth-0.041
 about-0.038
 where-0.037
 closely-0.037
 Mrs-0.036
Token His
Token ID2399
Feature activation+2.08e+00

Pos logit contributions
ped+0.058
orn+0.056
row+0.055
orns+0.054
iest+0.053
Neg logit contributions
 Mrs-0.036
 booth-0.036
 Miss-0.034
 about-0.033
 closely-0.033
Token mom
Token ID1995
Feature activation+4.29e-01

Pos logit contributions
orns+0.077
ped+0.075
orn+0.074
robat+0.071
row+0.070
Neg logit contributions
 grandma-0.028
 Aunt-0.025
 Miss-0.024
 booth-0.024
 stern-0.023
Token reward
Token ID6721
Feature activation+9.49e-01

Pos logit contributions
ped+0.010
row+0.009
orn+0.009
orns+0.009
robat+0.009
Neg logit contributions
 booth-0.007
 Mrs-0.007
 closely-0.006
 where-0.006
 stern-0.006
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
ped+0.030
orn+0.029
row+0.028
orns+0.028
 elbow+0.027
Neg logit contributions
 booth-0.017
 Mrs-0.017
 about-0.016
 where-0.015
 Miss-0.015
Token It
Token ID632
Feature activation-5.64e-01

Pos logit contributions
ped+0.014
orn+0.014
row+0.013
orns+0.013
py+0.013
Neg logit contributions
 booth-0.006
 closely-0.006
 Mrs-0.006
 stern-0.006
 Miss-0.005
Token was
Token ID373
Feature activation-1.19e-01

Pos logit contributions
 booth+0.010
 Mrs+0.010
 Miss+0.009
 closely+0.009
 stern+0.009
Neg logit contributions
ped-0.017
orn-0.017
row-0.016
orns-0.016
iest-0.016
Token ch
Token ID442
Feature activation+8.53e-01

Pos logit contributions
 booth+0.002
 Mrs+0.002
 closely+0.002
 about+0.002
 Miss+0.002
Neg logit contributions
ped-0.004
orn-0.004
row-0.004
orns-0.004
robat-0.004
Tokenocol
Token ID4668
Feature activation+5.46e+00

Pos logit contributions
ped+0.017
 elbow+0.016
robat+0.016
orn+0.016
row+0.015
Neg logit contributions
 closely-0.024
 stern-0.023
 booth-0.023
 about-0.022
 Some-0.022
Tokenates
Token ID689
Feature activation+5.55e-01

Pos logit contributions
 mineral+0.099
 elbow+0.086
boat+0.085
 affect+0.082
 weapon+0.082
Neg logit contributions
outs-0.135
 nine-0.133
ews-0.131
 billboards-0.130
aw-0.129
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
ped+0.017
orn+0.016
orns+0.016
row+0.015
 elbow+0.015
Neg logit contributions
 booth-0.011
 closely-0.010
 Mrs-0.010
 about-0.010
 where-0.009
Token sweets
Token ID42402
Feature activation+8.96e-01

Pos logit contributions
orn+0.038
ped+0.038
row+0.038
orns+0.037
robat+0.036
Neg logit contributions
 Mrs-0.026
 booth-0.025
 they-0.024
 where-0.024
 closely-0.023
Token!
Token ID0
Feature activation+2.55e-01

Pos logit contributions
ped+0.029
orn+0.028
orns+0.027
py+0.027
row+0.027
Neg logit contributions
 booth-0.015
 Mrs-0.014
 about-0.014
 closely-0.014
 where-0.014
Token
Token ID198
Feature activation-3.53e-01

Pos logit contributions
ped+0.008
orn+0.008
orns+0.008
row+0.008
py+0.008
Neg logit contributions
 booth-0.004
 closely-0.003
 stern-0.003
 Mrs-0.003
cases-0.003
Token ran
Token ID4966
Feature activation-3.04e-01

Pos logit contributions
ped+0.055
orn+0.053
orns+0.053
row+0.051
iest+0.051
Neg logit contributions
 booth-0.035
 Mrs-0.033
 closely-0.033
 Miss-0.032
 about-0.031
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
 booth+0.007
 Mrs+0.007
ours+0.007
 stern+0.007
 horror+0.007
Neg logit contributions
ped-0.007
row-0.007
 himself-0.007
orn-0.006
orns-0.006
Token put
Token ID1234
Feature activation+5.34e-01

Pos logit contributions
orn+0.062
ped+0.061
iest+0.059
orns+0.058
py+0.057
Neg logit contributions
 booth-0.044
 Mrs-0.043
 Miss-0.041
 closely-0.041
 they-0.040
Token on
Token ID319
Feature activation+2.13e-03

Pos logit contributions
row+0.015
ped+0.015
orn+0.015
py+0.015
ayer+0.014
Neg logit contributions
 Mrs-0.010
 booth-0.010
 Miss-0.010
 where-0.010
 Lisa-0.009
Token her
Token ID607
Feature activation+3.64e+00

Pos logit contributions
ped+0.000
orns+0.000
 himself+0.000
 powers+0.000
 elbow+0.000
Neg logit contributions
 booth-0.000
oke-0.000
 stern-0.000
 Mrs-0.000
 Whoever-0.000
Token warm
Token ID5814
Feature activation+5.42e+00

Pos logit contributions
ped+0.123
orns+0.115
row+0.115
orn+0.115
py+0.114
Neg logit contributions
 grandma-0.061
 where-0.059
 booth-0.059
 Aunt-0.054
cases-0.054
Tokenest
Token ID395
Feature activation+5.15e+00

Pos logit contributions
cher+0.163
ped+0.159
ayer+0.157
ter+0.155
umper+0.153
Neg logit contributions
 where-0.100
 booth-0.097
 grandma-0.096
 Aunt-0.095
 about-0.092
Token coat
Token ID13209
Feature activation+1.97e+00

Pos logit contributions
ped+0.150
py+0.142
orns+0.141
umper+0.141
orn+0.139
Neg logit contributions
 booth-0.097
 grandma-0.091
 where-0.090
 about-0.085
 Aunt-0.083
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
ped+0.064
orn+0.064
orns+0.061
py+0.060
row+0.059
Neg logit contributions
 booth-0.034
 about-0.031
 where-0.031
cases-0.031
 Mrs-0.030
Token shoes
Token ID10012
Feature activation+1.34e+00

Pos logit contributions
orn+0.065
orns+0.062
ped+0.062
py+0.061
row+0.061
Neg logit contributions
 Mrs-0.044
 booth-0.043
 they-0.043
 Miss-0.042
 Aunt-0.042
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
orn+0.046
ped+0.046
orns+0.045
py+0.044
umper+0.042
Neg logit contributions
 booth-0.021
 where-0.021
 about-0.020
 they-0.020
cases-0.019
Token as
Token ID355
Feature activation+7.22e-01

Pos logit contributions
ped+0.012
orn+0.011
orns+0.011
row+0.011
py+0.010
Neg logit contributions
 booth-0.008
 Mrs-0.007
 closely-0.007
 Miss-0.007
 about-0.007
Token he
Token ID339
Feature activation+1.27e+00

Pos logit contributions
ped+0.025
orn+0.025
row+0.024
orns+0.024
robat+0.023
Neg logit contributions
 booth-0.010
 Mrs-0.010
 Miss-0.010
 closely-0.010
 they-0.009
Token opened
Token ID4721
Feature activation+6.71e-02

Pos logit contributions
ped+0.037
orn+0.036
row+0.036
orns+0.035
iest+0.034
Neg logit contributions
 booth-0.025
 Mrs-0.023
 closely-0.023
 Miss-0.022
 stern-0.021
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
ped+0.002
orn+0.002
row+0.002
orns+0.002
iest+0.002
Neg logit contributions
 booth-0.001
 Mrs-0.001
 Miss-0.001
 closely-0.001
 stern-0.001
Token ch
Token ID442
Feature activation+2.27e+00

Pos logit contributions
 booth+0.024
 Mrs+0.023
 closely+0.022
 Miss+0.022
 stern+0.020
Neg logit contributions
ped-0.045
orn-0.044
row-0.043
orns-0.042
iest-0.041
Tokenocol
Token ID4668
Feature activation+5.40e+00

Pos logit contributions
ped+0.039
orn+0.036
robat+0.034
orns+0.034
thank+0.033
Neg logit contributions
 booth-0.069
 closely-0.067
 stern-0.065
ours-0.063
 about-0.062
Tokenates
Token ID689
Feature activation-4.52e-01

Pos logit contributions
 mineral+0.091
 elbow+0.080
 seek+0.075
 vest+0.070
 weapon+0.070
Neg logit contributions
ews-0.149
outs-0.147
 booth-0.145
ues-0.140
ours-0.138
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 booth+0.009
 Mrs+0.009
 closely+0.008
 Miss+0.008
 stern+0.008
Neg logit contributions
ped-0.013
orn-0.013
row-0.012
orns-0.012
py-0.012
Token started
Token ID2067
Feature activation-2.39e-01

Pos logit contributions
ped+0.029
orn+0.028
row+0.027
orns+0.027
iest+0.026
Neg logit contributions
 booth-0.022
 Mrs-0.021
 Miss-0.020
 closely-0.020
 stern-0.020
Token eating
Token ID6600
Feature activation-1.10e+00

Pos logit contributions
 booth+0.005
 Mrs+0.005
 closely+0.004
 Miss+0.004
 stern+0.004
Neg logit contributions
ped-0.007
orn-0.007
row-0.007
orns-0.006
iest-0.006
Token them
Token ID606
Feature activation+5.04e-01

Pos logit contributions
 booth+0.020
 Mrs+0.020
 closely+0.019
 Miss+0.018
 stern+0.018
Neg logit contributions
ped-0.034
orn-0.033
row-0.032
orns-0.031
py-0.031
Token outside
Token ID2354
Feature activation+8.68e-01

Pos logit contributions
ped+0.084
cher+0.082
 elbow+0.081
ates+0.080
orn+0.078
Neg logit contributions
 about-0.036
 Mrs-0.034
 they-0.031
 where-0.031
 Miss-0.030
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
ped+0.032
orn+0.031
robat+0.031
orns+0.030
row+0.030
Neg logit contributions
 Mrs-0.010
 booth-0.010
 closely-0.009
 where-0.009
 about-0.009
Token Bella
Token ID39304
Feature activation+6.92e-01

Pos logit contributions
ped+0.044
orn+0.042
row+0.041
orns+0.040
iest+0.039
Neg logit contributions
 booth-0.037
 Mrs-0.037
 Miss-0.035
 closely-0.035
 stern-0.034
Token wore
Token ID12408
Feature activation+3.17e+00

Pos logit contributions
ped+0.022
orn+0.021
row+0.020
orns+0.020
iest+0.019
Neg logit contributions
 booth-0.012
 Mrs-0.012
 closely-0.011
 Miss-0.011
 stern-0.011
Token her
Token ID607
Feature activation+3.57e+00

Pos logit contributions
ped+0.108
row+0.105
orn+0.105
py+0.105
robat+0.103
Neg logit contributions
 Mrs-0.044
 about-0.043
 where-0.043
 Miss-0.040
 booth-0.040
Token cozy
Token ID37438
Feature activation+5.35e+00

Pos logit contributions
ped+0.120
umper+0.119
row+0.117
orns+0.117
orn+0.116
Neg logit contributions
 grandma-0.055
 Aunt-0.053
 where-0.052
 stern-0.051
 about-0.048
Token sweater
Token ID35685
Feature activation+2.07e+00

Pos logit contributions
ped+0.169
cher+0.164
robat+0.159
umper+0.158
rest+0.154
Neg logit contributions
 Aunt-0.093
 where-0.093
 grandma-0.091
 booth-0.085
 mom-0.080
Token to
Token ID284
Feature activation+2.29e+00

Pos logit contributions
ped+0.062
orn+0.060
orns+0.059
py+0.058
row+0.058
Neg logit contributions
 booth-0.039
 Mrs-0.037
 closely-0.036
 about-0.036
 where-0.035
Token keep
Token ID1394
Feature activation+2.86e+00

Pos logit contributions
ped+0.077
orns+0.074
orn+0.071
robat+0.070
row+0.070
Neg logit contributions
 where-0.043
 about-0.042
 Mrs-0.041
 Miss-0.039
 grandma-0.039
Token warm
Token ID5814
Feature activation+3.42e+00

Pos logit contributions
orn+0.105
robat+0.105
chie+0.103
iest+0.103
eman+0.103
Neg logit contributions
 about-0.033
 they-0.032
 where-0.032
 Mrs-0.030
 closely-0.030
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
ped+0.124
orn+0.120
robat+0.115
py+0.113
ates+0.113
Neg logit contributions
 about-0.048
 where-0.045
 Mrs-0.040
 closely-0.040
 they-0.039
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
ped+0.103
row+0.092
orn+0.092
iest+0.089
orns+0.088
Neg logit contributions
 where-0.039
 about-0.038
 closely-0.037
 Miss-0.034
 Anna-0.034
Token He
Token ID679
Feature activation+1.32e+00

Pos logit contributions
ped+0.064
orn+0.061
row+0.061
orns+0.059
py+0.057
Neg logit contributions
 booth-0.034
 Mrs-0.031
 closely-0.030
 Miss-0.029
 stern-0.029
Token put
Token ID1234
Feature activation+2.04e+00

Pos logit contributions
ped+0.040
orn+0.038
row+0.037
orns+0.037
iest+0.036
Neg logit contributions
 booth-0.025
 Mrs-0.024
 closely-0.023
 Miss-0.023
 stern-0.022
Token on
Token ID319
Feature activation+1.34e+00

Pos logit contributions
ayer+0.071
 affect+0.071
chie+0.069
ands+0.068
keeping+0.067
Neg logit contributions
 Anna-0.028
 where-0.027
 they-0.025
 Mrs-0.025
 what-0.023
Token his
Token ID465
Feature activation+3.36e+00

Pos logit contributions
ped+0.024
orn+0.022
row+0.021
orns+0.020
iest+0.020
Neg logit contributions
 booth-0.042
 Mrs-0.041
 closely-0.040
 Miss-0.040
 stern-0.039
Token warm
Token ID5814
Feature activation+5.34e+00

Pos logit contributions
orns+0.112
emaker+0.110
umper+0.109
ped+0.108
keeping+0.107
Neg logit contributions
 grandma-0.050
 mom-0.049
 stern-0.048
 booth-0.048
 where-0.046
Tokenest
Token ID395
Feature activation+4.14e+00

Pos logit contributions
cher+0.172
ayer+0.169
ier+0.160
ped+0.159
umper+0.159
Neg logit contributions
 where-0.087
 Aunt-0.082
 about-0.080
 booth-0.079
 grandma-0.079
Token clothes
Token ID8242
Feature activation+8.50e-01

Pos logit contributions
orns+0.121
ped+0.121
ayer+0.119
umper+0.118
keeping+0.117
Neg logit contributions
 booth-0.070
 stern-0.065
 where-0.064
 grandma-0.062
 about-0.060
Token and
Token ID290
Feature activation+2.25e+00

Pos logit contributions
ped+0.034
orn+0.034
py+0.033
orns+0.033
robat+0.031
Neg logit contributions
 booth-0.009
 where-0.008
 they-0.008
 about-0.007
cases-0.007
Token was
Token ID373
Feature activation+1.08e+00

Pos logit contributions
orn+0.070
orns+0.067
ayer+0.066
py+0.066
robat+0.065
Neg logit contributions
 they-0.046
 Mrs-0.042
 where-0.040
 booth-0.039
 Aunt-0.039
Token about
Token ID546
Feature activation+2.51e-01

Pos logit contributions
orn+0.033
ped+0.033
orns+0.032
row+0.032
robat+0.031
Neg logit contributions
 about-0.019
 Mrs-0.019
 booth-0.019
 where-0.019
 closely-0.018
Token t
Token ID256
Feature activation+2.98e+00

Pos logit contributions
ped+0.058
row+0.053
orn+0.053
py+0.051
orns+0.051
Neg logit contributions
 booth-0.052
 Miss-0.047
 Mrs-0.046
 closely-0.046
 where-0.044
Token-
Token ID12
Feature activation+2.40e+00

Pos logit contributions
ped+0.072
py+0.062
orn+0.061
orns+0.060
 elbow+0.060
Neg logit contributions
 booth-0.077
 Mrs-0.068
 closely-0.068
cases-0.068
 Miss-0.067
Tokenshirt
Token ID15600
Feature activation+8.64e-01

Pos logit contributions
ped+0.054
orn+0.050
row+0.049
orns+0.048
iest+0.046
Neg logit contributions
 booth-0.064
 Mrs-0.061
 closely-0.059
 Miss-0.059
 stern-0.058
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
ped+0.033
orn+0.032
orns+0.031
row+0.031
py+0.031
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.008
 Miss-0.008
 about-0.008
Token her
Token ID607
Feature activation+2.99e+00

Pos logit contributions
orn+0.079
orns+0.078
ped+0.078
row+0.078
py+0.077
Neg logit contributions
 Mrs-0.055
 booth-0.055
 they-0.053
 Miss-0.053
 closely-0.051
Token brightly
Token ID35254
Feature activation+5.26e+00

Pos logit contributions
orns+0.107
row+0.104
ped+0.103
orn+0.102
py+0.099
Neg logit contributions
 booth-0.043
 Miss-0.043
 where-0.043
 grandma-0.043
 closely-0.042
Token coloured
Token ID34746
Feature activation+3.25e+00

Pos logit contributions
orns+0.138
umper+0.131
orn+0.128
cher+0.126
py+0.125
Neg logit contributions
 where-0.113
 about-0.112
 booth-0.108
 closely-0.106
 Mary-0.104
Token sneakers
Token ID42649
Feature activation+2.19e+00

Pos logit contributions
ped+0.092
py+0.088
orns+0.087
orn+0.087
row+0.086
Neg logit contributions
 booth-0.072
 Miss-0.063
 closely-0.062
 Mrs-0.060
 stern-0.059
Token and
Token ID290
Feature activation+2.09e+00

Pos logit contributions
py+0.074
orn+0.074
orns+0.073
ped+0.072
umper+0.069
Neg logit contributions
 booth-0.037
 about-0.033
 where-0.033
 they-0.031
 closely-0.031
Token then
Token ID788
Feature activation+1.51e+00

Pos logit contributions
orn+0.059
ped+0.058
row+0.058
orns+0.058
py+0.056
Neg logit contributions
 booth-0.044
 Mrs-0.043
 Miss-0.042
 they-0.041
 Lisa-0.040
Token she
Token ID673
Feature activation+2.10e+00

Pos logit contributions
ped+0.042
orn+0.040
row+0.040
orns+0.039
py+0.038
Neg logit contributions
 booth-0.033
 Mrs-0.031
 Miss-0.030
 closely-0.030
 stern-0.029
Token her
Token ID607
Feature activation+2.92e+00

Pos logit contributions
ped+0.012
orn+0.011
row+0.011
orns+0.011
robat+0.011
Neg logit contributions
 booth-0.008
 Mrs-0.008
 where-0.007
 closely-0.007
 Miss-0.007
Token thoughts
Token ID6066
Feature activation+2.68e-01

Pos logit contributions
ped+0.077
row+0.067
orns+0.066
py+0.066
orn+0.065
Neg logit contributions
 grandma-0.071
 closely-0.069
 where-0.066
 about-0.065
 Miss-0.065
Token while
Token ID981
Feature activation+1.86e+00

Pos logit contributions
ped+0.009
orn+0.009
orns+0.009
 elbow+0.008
row+0.008
Neg logit contributions
 booth-0.004
 closely-0.004
 about-0.004
 where-0.004
 Mrs-0.004
Token wearing
Token ID5762
Feature activation+1.81e+00

Pos logit contributions
ped+0.070
orn+0.070
row+0.069
robat+0.068
orns+0.067
Neg logit contributions
 Mrs-0.023
 they-0.022
 Miss-0.021
 where-0.020
 booth-0.019
Token her
Token ID607
Feature activation+3.96e+00

Pos logit contributions
ped+0.056
row+0.054
orn+0.054
orns+0.052
py+0.051
Neg logit contributions
 booth-0.032
 Mrs-0.031
 closely-0.031
 Miss-0.030
 stern-0.029
Token cozy
Token ID37438
Feature activation+5.26e+00

Pos logit contributions
ped+0.118
row+0.109
ter+0.103
py+0.103
orns+0.102
Neg logit contributions
 grandma-0.083
 where-0.080
 Mrs-0.077
 Miss-0.077
 about-0.077
Token vest
Token ID19750
Feature activation+2.55e+00

Pos logit contributions
ped+0.144
cher+0.131
ayer+0.130
rest+0.129
ter+0.127
Neg logit contributions
 grandma-0.117
 where-0.114
 Aunt-0.108
 mom-0.108
 about-0.101
Token.
Token ID13
Feature activation-9.48e-01

Pos logit contributions
ped+0.088
orn+0.082
orns+0.082
py+0.081
umper+0.078
Neg logit contributions
 booth-0.038
 about-0.037
 where-0.037
 Mrs-0.035
cases-0.034
Token<|endoftext|>
Token ID50256
Feature activation+1.14e+00

Pos logit contributions
 booth+0.019
 closely+0.017
cases+0.017
 stern+0.017
 billboards+0.015
Neg logit contributions
orn-0.027
orns-0.026
ped-0.026
row-0.024
py-0.024
TokenOnce
Token ID7454
Feature activation-1.57e+00

Pos logit contributions
ped+0.045
row+0.043
orn+0.042
py+0.041
 elbow+0.041
Neg logit contributions
 booth-0.011
 Mrs-0.011
 Miss-0.009
 closely-0.009
 about-0.008
Token upon
Token ID2402
Feature activation+3.82e-01

Pos logit contributions
 booth+0.039
 Mrs+0.035
 Miss+0.034
 stern+0.034
cases+0.033
Neg logit contributions
orn-0.037
ped-0.037
row-0.035
iest-0.034
orns-0.033
Token!
Token ID0
Feature activation+1.37e+00

Pos logit contributions
ped+0.020
orn+0.020
row+0.019
orns+0.019
iest+0.018
Neg logit contributions
 booth-0.012
 Mrs-0.011
 closely-0.010
 Miss-0.010
 stern-0.010
Token He
Token ID679
Feature activation+1.40e+00

Pos logit contributions
ped+0.036
orn+0.034
row+0.034
orns+0.033
iest+0.032
Neg logit contributions
 booth-0.031
 Mrs-0.030
 Miss-0.029
 closely-0.029
 stern-0.028
Token put
Token ID1234
Feature activation+1.76e+00

Pos logit contributions
ped+0.035
orn+0.034
orns+0.033
row+0.033
iest+0.031
Neg logit contributions
 booth-0.033
 Mrs-0.032
 closely-0.032
 Miss-0.030
 stern-0.030
Token on
Token ID319
Feature activation+1.38e+00

Pos logit contributions
 affect+0.065
ayer+0.065
ands+0.063
chie+0.063
keeping+0.062
Neg logit contributions
 where-0.022
 what-0.019
 they-0.019
 Anna-0.019
 many-0.018
Token his
Token ID465
Feature activation+3.51e+00

Pos logit contributions
ped+0.027
orn+0.026
row+0.025
orns+0.024
iest+0.024
Neg logit contributions
 booth-0.040
 Mrs-0.039
 closely-0.039
 Miss-0.038
 stern-0.037
Token warm
Token ID5814
Feature activation+5.25e+00

Pos logit contributions
orns+0.113
orn+0.108
ped+0.108
emaker+0.108
umper+0.107
Neg logit contributions
 booth-0.060
 stern-0.057
 where-0.056
 they-0.055
 grandma-0.053
Token clothes
Token ID8242
Feature activation+5.73e-01

Pos logit contributions
cher+0.163
orns+0.158
ayer+0.156
emaker+0.153
orn+0.153
Neg logit contributions
 where-0.090
 booth-0.086
 about-0.082
 Aunt-0.079
 grandma-0.079
Token,
Token ID11
Feature activation+2.26e-01

Pos logit contributions
ped+0.023
orn+0.022
orns+0.022
py+0.022
row+0.021
Neg logit contributions
 booth-0.006
 closely-0.005
 Mrs-0.005
 where-0.005
 about-0.004
Token grabbed
Token ID13176
Feature activation+1.28e+00

Pos logit contributions
orn+0.006
ped+0.006
row+0.006
orns+0.006
iest+0.006
Neg logit contributions
 booth-0.005
 Mrs-0.005
 closely-0.004
 where-0.004
 Miss-0.004
Token his
Token ID465
Feature activation+2.99e+00

Pos logit contributions
orn+0.035
row+0.035
orns+0.033
ped+0.033
robat+0.033
Neg logit contributions
 closely-0.026
 Mrs-0.025
 booth-0.025
 where-0.025
 stern-0.024
Token swim
Token ID9422
Feature activation+1.61e+00

Pos logit contributions
orns+0.093
orn+0.091
ped+0.090
py+0.087
umper+0.087
Neg logit contributions
 booth-0.054
 stern-0.052
 grandma-0.052
 where-0.051
 they-0.049
Token woods
Token ID16479
Feature activation-1.11e-01

Pos logit contributions
ped+0.053
 elbow+0.050
iest+0.049
row+0.048
cher+0.046
Neg logit contributions
 Mrs-0.037
 booth-0.035
 closely-0.035
 Miss-0.034
 Aunt-0.034
Token wanted
Token ID2227
Feature activation+1.69e-01

Pos logit contributions
 booth+0.002
 Mrs+0.002
 closely+0.002
 Miss+0.002
 about+0.002
Neg logit contributions
ped-0.003
orn-0.003
row-0.003
orns-0.003
robat-0.003
Token to
Token ID284
Feature activation+1.86e+00

Pos logit contributions
ped+0.006
orn+0.005
row+0.005
orns+0.005
iest+0.005
Neg logit contributions
 booth-0.003
 Mrs-0.003
 Miss-0.003
 closely-0.002
 stern-0.002
Token have
Token ID423
Feature activation+2.88e+00

Pos logit contributions
ped+0.058
orn+0.054
orns+0.054
ishing+0.054
row+0.053
Neg logit contributions
 Mrs-0.035
 Miss-0.034
 booth-0.033
 Their-0.032
 Aunt-0.032
Token such
Token ID884
Feature activation+1.62e+00

Pos logit contributions
ped+0.103
row+0.101
orn+0.100
iest+0.095
ble+0.095
Neg logit contributions
 Mrs-0.038
 booth-0.036
 about-0.035
 Miss-0.034
 where-0.032
Token beautiful
Token ID4950
Feature activation+5.24e+00

Pos logit contributions
ped+0.057
orn+0.057
row+0.056
orns+0.055
robat+0.054
Neg logit contributions
 Mrs-0.021
 booth-0.021
 stern-0.020
 Miss-0.019
 closely-0.019
Token fur
Token ID9230
Feature activation+1.69e+00

Pos logit contributions
cher+0.153
ped+0.150
chie+0.146
bled+0.142
boat+0.138
Neg logit contributions
 about-0.108
 Aunt-0.100
 closely-0.099
 where-0.096
 Sometimes-0.089
Token too
Token ID1165
Feature activation+1.99e+00

Pos logit contributions
ped+0.049
orn+0.047
row+0.045
orns+0.044
py+0.043
Neg logit contributions
 booth-0.034
 Mrs-0.033
 closely-0.031
 Miss-0.031
 stern-0.030
Token!
Token ID0
Feature activation+2.54e-01

Pos logit contributions
ped+0.052
orn+0.049
row+0.049
orns+0.047
iest+0.046
Neg logit contributions
 booth-0.046
 Mrs-0.044
 closely-0.042
 Miss-0.042
 stern-0.041
Token They
Token ID1119
Feature activation-2.21e-02

Pos logit contributions
orn+0.008
orns+0.008
ped+0.007
 himself+0.007
py+0.007
Neg logit contributions
 stern-0.004
 booth-0.004
 closely-0.004
cases-0.004
abella-0.004
Token came
Token ID1625
Feature activation-2.80e-01

Pos logit contributions
 Mrs+0.000
 booth+0.000
 closely+0.000
 Miss+0.000
 about+0.000
Neg logit contributions
ped-0.001
orn-0.001
row-0.001
orns-0.001
iest-0.001
Token a
Token ID257
Feature activation-1.63e+00

Pos logit contributions
 booth+0.016
 closely+0.015
 Mrs+0.015
 stern+0.014
 Miss+0.014
Neg logit contributions
ped-0.024
row-0.022
orn-0.022
orns-0.022
iest-0.021
Token large
Token ID1588
Feature activation-4.16e-01

Pos logit contributions
 closely+0.032
 Mrs+0.032
 booth+0.031
 Lisa+0.031
 Miss+0.031
Neg logit contributions
orn-0.041
 mineral-0.039
ped-0.038
row-0.038
iest-0.038
Token selection
Token ID6356
Feature activation-1.79e-01

Pos logit contributions
 booth+0.009
 Mrs+0.008
 closely+0.008
 Miss+0.008
 stern+0.008
Neg logit contributions
ped-0.011
orn-0.011
row-0.011
orns-0.010
iest-0.010
Token of
Token ID286
Feature activation+6.07e-02

Pos logit contributions
 booth+0.004
 Mrs+0.003
 closely+0.003
 Miss+0.003
 stern+0.003
Neg logit contributions
ped-0.005
orn-0.005
row-0.005
orns-0.005
iest-0.005
Token ch
Token ID442
Feature activation+1.39e+00

Pos logit contributions
ped+0.002
orn+0.001
row+0.001
orns+0.001
iest+0.001
Neg logit contributions
 booth-0.001
 Mrs-0.001
 closely-0.001
 Miss-0.001
 stern-0.001
Tokenocol
Token ID4668
Feature activation+5.23e+00

Pos logit contributions
ped+0.019
orn+0.017
row+0.015
 elbow+0.014
robat+0.014
Neg logit contributions
 booth-0.049
 closely-0.048
 Mrs-0.047
 Some-0.046
 billboards-0.046
Tokenates
Token ID689
Feature activation+4.58e-01

Pos logit contributions
 elbow+0.080
 mineral+0.078
boat+0.071
 sensation+0.067
 affect+0.067
Neg logit contributions
 billboards-0.145
ews-0.144
 booth-0.141
outs-0.139
ours-0.137
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
ped+0.013
orn+0.012
row+0.012
orns+0.011
py+0.011
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.009
 Miss-0.009
 stern-0.009
Token Ell
Token ID7122
Feature activation-1.75e+00

Pos logit contributions
 stern+0.035
 closely+0.035
 booth+0.034
 divorce+0.031
cases+0.031
Neg logit contributions
ter-0.041
ped-0.040
 himself-0.040
keeping-0.038
ates-0.037
Tokena
Token ID64
Feature activation-1.71e+00

Pos logit contributions
 booth+0.031
 Mrs+0.030
 Miss+0.029
 closely+0.028
 Sometimes+0.027
Neg logit contributions
ped-0.055
row-0.054
orn-0.052
iest-0.050
orns-0.048
Token picked
Token ID6497
Feature activation-4.72e-01

Pos logit contributions
 booth+0.036
 Mrs+0.035
 Miss+0.034
 Sometimes+0.033
 stern+0.032
Neg logit contributions
ped-0.046
row-0.043
orn-0.042
orns-0.040
 powers-0.039
Token he
Token ID339
Feature activation+7.26e-01

Pos logit contributions
ped+0.027
orn+0.026
row+0.025
orns+0.024
iest+0.023
Neg logit contributions
 booth-0.031
 Mrs-0.030
 closely-0.028
 Miss-0.028
 stern-0.028
Token ate
Token ID15063
Feature activation-2.91e-01

Pos logit contributions
orn+0.021
ped+0.021
orns+0.020
row+0.020
iest+0.019
Neg logit contributions
 booth-0.014
 Mrs-0.014
 closely-0.014
 Miss-0.013
 stern-0.013
Token all
Token ID477
Feature activation+4.68e-01

Pos logit contributions
 booth+0.006
 Mrs+0.006
 closely+0.006
 Miss+0.006
 stern+0.005
Neg logit contributions
ped-0.008
orn-0.008
row-0.008
orns-0.007
iest-0.007
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
ped+0.014
orn+0.014
row+0.014
orns+0.013
py+0.013
Neg logit contributions
 booth-0.009
 Mrs-0.008
 closely-0.008
 Miss-0.008
 stern-0.007
Token ch
Token ID442
Feature activation+2.10e+00

Pos logit contributions
ped+0.050
row+0.049
py+0.047
 elbow+0.047
ayer+0.047
Neg logit contributions
 Mrs-0.030
 booth-0.029
 closely-0.028
 about-0.027
 bakery-0.027
Tokenocol
Token ID4668
Feature activation+5.23e+00

Pos logit contributions
cher+0.038
robat+0.037
 seek+0.036
ter+0.036
boat+0.035
Neg logit contributions
 Some-0.057
ours-0.057
 closely-0.054
 about-0.053
ues-0.052
Tokenates
Token ID689
Feature activation+3.71e-01

Pos logit contributions
 mineral+0.090
 weapon+0.080
 elbow+0.079
boat+0.075
 current+0.073
Neg logit contributions
ews-0.140
outs-0.136
aney-0.132
ours-0.130
abella-0.129
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
ped+0.012
orn+0.011
row+0.011
orns+0.011
py+0.011
Neg logit contributions
 booth-0.007
 Mrs-0.006
 closely-0.006
 Miss-0.006
 stern-0.006
Token sweets
Token ID42402
Feature activation+1.31e+00

Pos logit contributions
orn+0.062
row+0.060
orns+0.059
iest+0.058
py+0.058
Neg logit contributions
 Mrs-0.039
 booth-0.037
 they-0.036
 closely-0.035
 where-0.035
Token.
Token ID13
Feature activation+4.07e-01

Pos logit contributions
ped+0.042
orn+0.041
py+0.041
orns+0.040
row+0.039
Neg logit contributions
 booth-0.023
 closely-0.021
 Mrs-0.021
 about-0.021
 they-0.020
Token Then
Token ID3244
Feature activation+2.05e+00

Pos logit contributions
ped+0.010
orn+0.010
row+0.009
orns+0.009
py+0.009
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.009
 stern-0.009
 Miss-0.009
Token his
Token ID465
Feature activation+2.69e+00

Pos logit contributions
row+0.046
ped+0.044
orn+0.043
iest+0.042
robat+0.042
Neg logit contributions
 Mrs-0.045
 Miss-0.043
 Their-0.040
 Aunt-0.039
 where-0.039
Token friends
Token ID2460
Feature activation-1.45e-01

Pos logit contributions
ped+0.076
aces+0.071
keeping+0.069
iest+0.069
py+0.069
Neg logit contributions
 grandma-0.056
 closely-0.054
 they-0.051
's-0.050
 where-0.049
Token and
Token ID290
Feature activation+9.53e-01

Pos logit contributions
 booth+0.003
 Mrs+0.003
 Miss+0.002
 stern+0.002
 closely+0.002
Neg logit contributions
ped-0.004
orn-0.004
row-0.004
orns-0.004
py-0.004
Token enjoyed
Token ID8359
Feature activation+8.68e-01

Pos logit contributions
row+0.033
ped+0.033
orn+0.033
orns+0.032
iest+0.031
Neg logit contributions
 Mrs-0.013
 they-0.013
 Miss-0.013
 booth-0.012
 about-0.012
Token his
Token ID465
Feature activation+3.81e+00

Pos logit contributions
row+0.029
iest+0.028
orn+0.027
ped+0.027
py+0.025
Neg logit contributions
 they-0.015
 Mrs-0.015
 where-0.015
 Miss-0.015
 what-0.014
Token cozy
Token ID37438
Feature activation+5.22e+00

Pos logit contributions
ped+0.103
py+0.099
keeping+0.099
cher+0.098
umper+0.097
Neg logit contributions
 they-0.070
 grandma-0.070
 closely-0.068
 where-0.068
's-0.066
Token hole
Token ID7604
Feature activation+1.66e+00

Pos logit contributions
cher+0.139
ped+0.131
umper+0.122
chie+0.122
robat+0.121
Neg logit contributions
 where-0.116
 grandma-0.114
 closely-0.106
 and-0.105
 they-0.105
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
ped+0.053
orn+0.051
robat+0.050
orns+0.050
agons+0.047
Neg logit contributions
 closely-0.030
 where-0.029
 about-0.029
 booth-0.026
 stern-0.026
Token And
Token ID843
Feature activation+1.41e-01

Pos logit contributions
ped+0.004
orn+0.004
orns+0.004
row+0.004
py+0.004
Neg logit contributions
 booth-0.003
 closely-0.002
 Mrs-0.002
 stern-0.002
 Miss-0.002
Token even
Token ID772
Feature activation+1.19e+00

Pos logit contributions
ped+0.004
orn+0.004
row+0.004
orns+0.004
iest+0.004
Neg logit contributions
 booth-0.003
 Mrs-0.003
 closely-0.003
 Miss-0.003
 stern-0.003
Token though
Token ID996
Feature activation+2.24e+00

Pos logit contributions
ped+0.042
row+0.040
orn+0.040
ates+0.040
orns+0.040
Neg logit contributions
 Mrs-0.017
 Miss-0.016
 closely-0.016
 about-0.015
 where-0.015
Token She
Token ID1375
Feature activation-2.61e+00

Pos logit contributions
 booth+0.051
 closely+0.048
abel+0.046
 batch+0.046
abella+0.046
Neg logit contributions
ped-0.049
orns-0.044
 himself-0.043
keeping-0.041
orn-0.040
Token says
Token ID1139
Feature activation-2.34e+00

Pos logit contributions
 booth+0.047
cases+0.040
 closely+0.040
 Mrs+0.040
 Eva+0.038
Neg logit contributions
ped-0.083
py-0.071
orn-0.071
row-0.071
orns-0.070
Token,
Token ID11
Feature activation-4.49e-01

Pos logit contributions
 booth+0.065
 closely+0.058
cases+0.056
 Whoever+0.055
 cinema+0.055
Neg logit contributions
ped-0.050
row-0.045
py-0.043
orn-0.043
ter-0.043
Token "
Token ID366
Feature activation-1.43e+00

Pos logit contributions
 booth+0.005
 closely+0.004
 Mrs+0.004
 Miss+0.003
cases+0.003
Neg logit contributions
ped-0.018
row-0.017
orn-0.017
orns-0.016
py-0.016
TokenB
Token ID33
Feature activation-8.48e-01

Pos logit contributions
 booth+0.032
 closely+0.032
 about+0.030
cases+0.029
 stern+0.028
Neg logit contributions
ped-0.038
orns-0.037
orn-0.036
row-0.034
ter-0.034
Tokenoys
Token ID19417
Feature activation-5.82e+00

Pos logit contributions
 booth+0.017
 Mrs+0.016
 Miss+0.016
 closely+0.015
 stern+0.015
Neg logit contributions
ped-0.024
orn-0.023
row-0.023
orns-0.022
py-0.022
Token,
Token ID11
Feature activation-2.73e+00

Pos logit contributions
 booth+0.132
 closely+0.121
 Mrs+0.115
 Whoever+0.114
 Miss+0.113
Neg logit contributions
ped-0.146
row-0.144
orn-0.138
orns-0.136
iest-0.134
Token what
Token ID644
Feature activation-1.80e+00

Pos logit contributions
 booth+0.050
 closely+0.046
cases+0.044
ews+0.041
bell+0.039
Neg logit contributions
ter-0.082
ped-0.081
row-0.078
 elbow-0.076
py-0.076
Token did
Token ID750
Feature activation-1.71e+00

Pos logit contributions
 booth+0.040
 closely+0.037
 Mrs+0.037
 Miss+0.035
 stern+0.034
Neg logit contributions
ped-0.050
orn-0.047
row-0.046
orns-0.045
iest-0.044
Token you
Token ID345
Feature activation-2.01e-01

Pos logit contributions
 booth+0.043
 Mrs+0.042
 Miss+0.040
 closely+0.040
 stern+0.039
Neg logit contributions
ped-0.041
orn-0.039
row-0.038
orns-0.037
iest-0.035
Token do
Token ID466
Feature activation-2.00e+00

Pos logit contributions
 Mrs+0.003
 booth+0.003
 about+0.003
 stern+0.003
 Miss+0.003
Neg logit contributions
ped-0.006
orns-0.006
orn-0.006
row-0.006
iest-0.006
Token She
Token ID1375
Feature activation-7.24e-01

Pos logit contributions
 booth+0.061
 France+0.059
 cinema+0.057
 closely+0.056
abella+0.056
Neg logit contributions
ped-0.053
 himself-0.043
orns-0.041
row-0.040
gie-0.040
Token said
Token ID531
Feature activation-2.50e+00

Pos logit contributions
 booth+0.018
 billboards+0.017
 Sometimes+0.017
 closely+0.016
cases+0.016
Neg logit contributions
ped-0.018
row-0.016
orn-0.015
py-0.014
ter-0.014
Token,
Token ID11
Feature activation-9.26e-01

Pos logit contributions
 booth+0.071
 cinema+0.063
cases+0.060
ews+0.060
 theatre+0.060
Neg logit contributions
ped-0.054
ter-0.047
row-0.045
 himself-0.042
py-0.042
Token "
Token ID366
Feature activation-1.14e+00

Pos logit contributions
 booth+0.012
 closely+0.010
 Mrs+0.009
cases+0.009
 Miss+0.009
Neg logit contributions
ped-0.034
row-0.031
orn-0.030
ter-0.030
py-0.030
TokenB
Token ID33
Feature activation-1.36e+00

Pos logit contributions
 booth+0.028
 closely+0.026
cases+0.025
 Mrs+0.025
 about+0.024
Neg logit contributions
ped-0.028
orns-0.027
orn-0.027
row-0.026
ter-0.025
Tokenoys
Token ID19417
Feature activation-5.73e+00

Pos logit contributions
 booth+0.030
 Mrs+0.029
 closely+0.028
 Miss+0.027
 stern+0.027
Neg logit contributions
ped-0.037
orn-0.035
row-0.035
orns-0.034
iest-0.033
Token,
Token ID11
Feature activation-3.39e+00

Pos logit contributions
 booth+0.128
 Mrs+0.111
 closely+0.110
 Miss+0.106
 horror+0.106
Neg logit contributions
ped-0.149
row-0.146
orn-0.137
iest-0.136
py-0.135
Token it
Token ID340
Feature activation-1.59e+00

Pos logit contributions
 booth+0.077
cases+0.068
 closely+0.068
ews+0.066
 Mrs+0.065
Neg logit contributions
ped-0.087
ter-0.086
row-0.084
orn-0.079
py-0.079
Token's
Token ID338
Feature activation-6.20e-01

Pos logit contributions
 booth+0.025
 Mrs+0.021
 Miss+0.019
 bakery+0.018
 Sophia+0.018
Neg logit contributions
ped-0.057
row-0.052
orn-0.052
py-0.051
ter-0.050
Token not
Token ID407
Feature activation-1.99e-02

Pos logit contributions
 booth+0.012
 Mrs+0.011
 closely+0.011
 Miss+0.010
 stern+0.010
Neg logit contributions
ped-0.019
orn-0.018
row-0.018
orns-0.017
py-0.017
Token important
Token ID1593
Feature activation-1.46e-01

Pos logit contributions
 Mrs+0.000
 about+0.000
 booth+0.000
 Miss+0.000
 where+0.000
Neg logit contributions
orn-0.001
ped-0.001
orns-0.001
row-0.001
robat-0.001
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
 booth+0.023
 Mrs+0.021
 closely+0.020
 stern+0.020
cases+0.020
Neg logit contributions
ped-0.020
row-0.018
orn-0.018
orns-0.018
py-0.018
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
 booth+0.035
 closely+0.031
cases+0.030
abella+0.029
 salon+0.029
Neg logit contributions
ped-0.047
orns-0.045
orn-0.045
row-0.043
 himself-0.042
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
 booth+0.030
 Mrs+0.026
 closely+0.026
cases+0.025
 stern+0.025
Neg logit contributions
ped-0.058
orn-0.057
row-0.055
orns-0.055
 himself-0.052
Token"
Token ID1
Feature activation-4.78e-01

Pos logit contributions
 booth+0.020
 Mrs+0.018
 closely+0.017
 Miss+0.017
 about+0.016
Neg logit contributions
ped-0.035
orn-0.034
row-0.034
orns-0.033
py-0.031
TokenB
Token ID33
Feature activation-3.92e-01

Pos logit contributions
 booth+0.010
 closely+0.009
 Mrs+0.009
 about+0.009
cases+0.008
Neg logit contributions
ped-0.014
orn-0.013
orns-0.013
row-0.013
ter-0.012
Tokenoys
Token ID19417
Feature activation-5.65e+00

Pos logit contributions
 booth+0.008
 Mrs+0.007
 Miss+0.007
 stern+0.007
 closely+0.007
Neg logit contributions
ped-0.012
orn-0.011
row-0.011
orns-0.011
py-0.010
Token,
Token ID11
Feature activation-2.50e+00

Pos logit contributions
 booth+0.126
 closely+0.115
 Mrs+0.112
 Miss+0.109
 Their+0.109
Neg logit contributions
ped-0.144
row-0.138
orn-0.136
orns-0.134
iest-0.132
Token stop
Token ID2245
Feature activation-1.53e+00

Pos logit contributions
 booth+0.044
 closely+0.038
 Mrs+0.036
cases+0.036
 stern+0.035
Neg logit contributions
ped-0.080
row-0.076
ter-0.076
py-0.074
orn-0.074
Token yelling
Token ID22187
Feature activation-1.32e+00

Pos logit contributions
 booth+0.035
 Mrs+0.032
 closely+0.031
 Miss+0.030
 stern+0.030
Neg logit contributions
ped-0.041
orn-0.038
row-0.038
orns-0.037
py-0.036
Token and
Token ID290
Feature activation-3.02e-01

Pos logit contributions
 booth+0.034
 Mrs+0.031
 bakery+0.031
 Sometimes+0.030
 salon+0.030
Neg logit contributions
ped-0.030
row-0.028
ter-0.026
 himself-0.025
orn-0.025
Token fighting
Token ID4330
Feature activation-9.36e-01

Pos logit contributions
 Mrs+0.006
 Miss+0.006
 booth+0.006
 where+0.005
 stern+0.005
Neg logit contributions
orn-0.010
row-0.009
ped-0.009
iest-0.009
orns-0.009
Token and
Token ID290
Feature activation-6.35e-01

Pos logit contributions
 booth+0.063
 Mrs+0.055
cases+0.055
abella+0.053
 Sometimes+0.053
Neg logit contributions
ped-0.077
row-0.072
py-0.068
orn-0.068
 himself-0.067
Token said
Token ID531
Feature activation-1.95e+00

Pos logit contributions
 booth+0.016
 billboards+0.015
 Sometimes+0.014
 bakery+0.014
case+0.013
Neg logit contributions
ped-0.015
row-0.013
b-0.013
cher-0.013
 elbow-0.012
Token,
Token ID11
Feature activation-9.39e-01

Pos logit contributions
 booth+0.052
 cinema+0.045
 closely+0.044
 bakery+0.044
 theatre+0.043
Neg logit contributions
ped-0.044
row-0.039
ter-0.039
 himself-0.038
 elbow-0.037
Token "
Token ID366
Feature activation-9.93e-01

Pos logit contributions
 booth+0.013
 Mrs+0.010
 closely+0.010
 Miss+0.010
 Sometimes+0.009
Neg logit contributions
ped-0.033
row-0.031
ter-0.030
 himself-0.030
 elbow-0.029
TokenB
Token ID33
Feature activation-8.56e-01

Pos logit contributions
 booth+0.023
 closely+0.023
cases+0.021
 about+0.021
 Mrs+0.021
Neg logit contributions
ped-0.025
orns-0.025
orn-0.024
row-0.023
ter-0.022
Tokenoys
Token ID19417
Feature activation-5.55e+00

Pos logit contributions
 booth+0.017
 Mrs+0.016
 Miss+0.016
 closely+0.015
 stern+0.015
Neg logit contributions
ped-0.025
orn-0.024
row-0.023
orns-0.023
py-0.022
Token,
Token ID11
Feature activation-3.30e+00

Pos logit contributions
 booth+0.122
 Mrs+0.108
 closely+0.107
 Miss+0.104
 bakery+0.101
Neg logit contributions
ped-0.146
row-0.143
orn-0.136
iest-0.134
orns-0.134
Token you
Token ID345
Feature activation-6.98e-01

Pos logit contributions
 booth+0.067
 bakery+0.056
cases+0.056
 closely+0.056
 Mrs+0.055
Neg logit contributions
ter-0.094
ped-0.094
row-0.092
 himself-0.088
py-0.086
Token should
Token ID815
Feature activation-2.02e-03

Pos logit contributions
 booth+0.016
 Mrs+0.015
 closely+0.015
 Miss+0.014
 stern+0.014
Neg logit contributions
ped-0.019
orn-0.018
row-0.018
orns-0.017
iest-0.017
Token not
Token ID407
Feature activation+1.33e+00

Pos logit contributions
 booth+0.000
 Mrs+0.000
 Miss+0.000
 closely+0.000
 stern+0.000
Neg logit contributions
ped-0.000
orn-0.000
row-0.000
orns-0.000
iest-0.000
Token sail
Token ID14936
Feature activation+6.78e-01

Pos logit contributions
ped+0.041
orn+0.038
orns+0.038
iest+0.037
 elbow+0.036
Neg logit contributions
 booth-0.024
 about-0.024
 where-0.024
 Miss-0.023
 closely-0.023
Token She
Token ID1375
Feature activation-8.28e-01

Pos logit contributions
abel+0.047
 booth+0.046
 divorce+0.046
 France+0.045
abella+0.045
Neg logit contributions
ped-0.041
 himself-0.037
aces-0.033
ags-0.033
orns-0.032
Token said
Token ID531
Feature activation-1.87e+00

Pos logit contributions
 booth+0.020
 billboards+0.019
 closely+0.018
 Sometimes+0.018
 horror+0.018
Neg logit contributions
ped-0.020
row-0.018
orn-0.017
b-0.016
ter-0.016
Token,
Token ID11
Feature activation-7.89e-01

Pos logit contributions
 booth+0.045
 cinema+0.038
 closely+0.038
 bakery+0.038
 theatre+0.037
Neg logit contributions
ped-0.047
ter-0.041
row-0.041
 himself-0.041
 elbow-0.041
Token "
Token ID366
Feature activation-7.74e-01

Pos logit contributions
 booth+0.010
 closely+0.008
 Mrs+0.007
cases+0.007
 Miss+0.007
Neg logit contributions
ped-0.029
row-0.027
orn-0.027
py-0.026
ter-0.026
TokenB
Token ID33
Feature activation-9.84e-01

Pos logit contributions
 booth+0.018
 closely+0.018
cases+0.017
 where+0.016
 about+0.016
Neg logit contributions
orns-0.019
ped-0.019
orn-0.019
row-0.018
robat-0.018
Tokenoys
Token ID19417
Feature activation-5.53e+00

Pos logit contributions
 booth+0.020
 Mrs+0.019
 Miss+0.018
 closely+0.018
 stern+0.018
Neg logit contributions
ped-0.028
orn-0.027
row-0.026
orns-0.026
py-0.025
Token,
Token ID11
Feature activation-3.02e+00

Pos logit contributions
 booth+0.128
 closely+0.111
 Mrs+0.111
 Miss+0.107
 bakery+0.107
Neg logit contributions
ped-0.135
row-0.135
orn-0.126
iest-0.124
orns-0.124
Token let
Token ID1309
Feature activation-1.92e+00

Pos logit contributions
 booth+0.067
 closely+0.060
cases+0.059
 Mrs+0.058
ews+0.058
Neg logit contributions
ped-0.078
ter-0.076
row-0.076
orn-0.073
py-0.072
Token's
Token ID338
Feature activation-5.46e-01

Pos logit contributions
 booth+0.036
 bakery+0.029
cases+0.029
 stern+0.028
 closely+0.028
Neg logit contributions
ped-0.060
row-0.057
orn-0.056
py-0.055
 himself-0.055
Token take
Token ID1011
Feature activation-9.72e-01

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.012
 Miss+0.012
 stern+0.012
Neg logit contributions
ped-0.014
orn-0.013
row-0.013
orns-0.012
iest-0.012
Token turns
Token ID4962
Feature activation-2.07e+00

Pos logit contributions
 booth+0.014
 Mrs+0.013
 closely+0.012
 Miss+0.012
 stern+0.011
Neg logit contributions
ped-0.034
orn-0.033
row-0.032
orns-0.032
iest-0.031
Token.
Token ID13
Feature activation-6.97e-01

Pos logit contributions
 booth+0.021
 Mrs+0.020
 stern+0.019
abella+0.019
 bakery+0.019
Neg logit contributions
ped-0.015
row-0.014
orns-0.014
ter-0.014
 himself-0.014
Token
Token ID198
Feature activation-9.99e-01

Pos logit contributions
 booth+0.013
 closely+0.012
cases+0.011
abella+0.010
 salon+0.010
Neg logit contributions
ped-0.021
orn-0.020
orns-0.020
row-0.019
 himself-0.019
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 booth+0.015
 Mrs+0.013
 closely+0.013
 Miss+0.012
 stern+0.012
Neg logit contributions
ped-0.034
orn-0.033
row-0.032
orns-0.032
py-0.030
Token"
Token ID1
Feature activation-9.94e-02

Pos logit contributions
 booth+0.004
 Mrs+0.004
 closely+0.004
 about+0.003
 Miss+0.003
Neg logit contributions
ped-0.007
orn-0.007
row-0.007
orns-0.007
iest-0.006
TokenB
Token ID33
Feature activation-1.33e-01

Pos logit contributions
 booth+0.002
 about+0.002
 closely+0.002
 Mrs+0.002
cases+0.002
Neg logit contributions
ped-0.003
orn-0.003
orns-0.003
row-0.003
ter-0.003
Tokenoys
Token ID19417
Feature activation-5.36e+00

Pos logit contributions
 booth+0.003
 Mrs+0.003
 stern+0.003
 Miss+0.003
 closely+0.003
Neg logit contributions
ped-0.004
orn-0.003
py-0.003
orns-0.003
row-0.003
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 booth+0.122
 closely+0.110
 Mrs+0.109
 Miss+0.106
 Some+0.106
Neg logit contributions
ped-0.138
row-0.131
orn-0.130
orns-0.125
iest-0.123
Token what
Token ID644
Feature activation-1.32e+00

Pos logit contributions
 booth+0.035
 closely+0.032
cases+0.030
ews+0.028
 bakery+0.027
Neg logit contributions
ter-0.067
ped-0.066
row-0.063
orn-0.062
py-0.062
Token are
Token ID389
Feature activation-1.75e+00

Pos logit contributions
 booth+0.030
 closely+0.028
 Mrs+0.027
 stern+0.026
 Miss+0.026
Neg logit contributions
ped-0.035
orn-0.034
row-0.032
orns-0.032
iest-0.032
Token you
Token ID345
Feature activation+9.13e-01

Pos logit contributions
 booth+0.045
 Mrs+0.043
 Miss+0.042
 closely+0.041
 stern+0.040
Neg logit contributions
ped-0.041
orn-0.039
row-0.038
orns-0.037
py-0.036
Token doing
Token ID1804
Feature activation-2.11e-01

Pos logit contributions
ped+0.020
orn+0.020
orns+0.019
row+0.019
iest+0.018
Neg logit contributions
 booth-0.024
 Mrs-0.024
 Miss-0.023
 closely-0.023
 stern-0.023
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
 booth+0.020
 Mrs+0.019
 Miss+0.018
 closely+0.018
 stern+0.018
Neg logit contributions
ped-0.023
row-0.022
orn-0.022
orns-0.021
py-0.021
Token
Token ID198
Feature activation-1.45e+00

Pos logit contributions
 booth+0.043
cases+0.041
 closely+0.041
abel+0.039
abella+0.038
Neg logit contributions
ped-0.039
 himself-0.033
orns-0.032
ags-0.031
orn-0.030
Token
Token ID198
Feature activation-5.54e-01

Pos logit contributions
 booth+0.030
 salon+0.028
cases+0.028
 closely+0.026
 stern+0.026
Neg logit contributions
ped-0.038
orn-0.036
 himself-0.036
row-0.035
orns-0.034
Token"
Token ID1
Feature activation-2.26e-01

Pos logit contributions
 booth+0.013
 about+0.013
 Mrs+0.012
 salon+0.012
abel+0.012
Neg logit contributions
ped-0.013
row-0.013
ble-0.011
orn-0.011
b-0.011
TokenB
Token ID33
Feature activation-5.91e-01

Pos logit contributions
 booth+0.006
 about+0.006
 closely+0.006
 where+0.006
 Mrs+0.005
Neg logit contributions
ped-0.005
orns-0.005
orn-0.005
ter-0.005
row-0.004
Tokenoys
Token ID19417
Feature activation-5.31e+00

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.012
 Miss+0.012
 stern+0.011
Neg logit contributions
ped-0.016
orn-0.015
row-0.015
orns-0.015
iest-0.014
Token,
Token ID11
Feature activation-2.56e+00

Pos logit contributions
 booth+0.124
 closely+0.114
 Mrs+0.110
 Their+0.110
 Some+0.109
Neg logit contributions
ped-0.128
row-0.124
orn-0.119
orns-0.118
ter-0.116
Token what
Token ID644
Feature activation-1.62e+00

Pos logit contributions
 booth+0.055
 closely+0.052
cases+0.049
 Mrs+0.048
ews+0.047
Neg logit contributions
ped-0.070
ter-0.068
row-0.065
py-0.064
orn-0.063
Token are
Token ID389
Feature activation-2.29e+00

Pos logit contributions
 booth+0.036
 closely+0.033
 Mrs+0.033
 stern+0.031
 Miss+0.030
Neg logit contributions
ped-0.044
orn-0.042
row-0.041
iest-0.040
orns-0.040
Token you
Token ID345
Feature activation+3.68e-01

Pos logit contributions
 booth+0.063
 Mrs+0.059
 closely+0.058
 stern+0.057
 Miss+0.056
Neg logit contributions
ped-0.051
row-0.045
orn-0.045
orns-0.044
iest-0.042
Token doing
Token ID1804
Feature activation-4.05e-01

Pos logit contributions
ped+0.008
orn+0.008
row+0.008
orns+0.008
iest+0.007
Neg logit contributions
 booth-0.010
 Mrs-0.010
 Miss-0.009
 closely-0.009
 about-0.009
Token to
Token ID284
Feature activation+4.17e-01

Pos logit contributions
 booth+0.035
ews+0.030
cases+0.030
 apartment+0.030
 closely+0.029
Neg logit contributions
ped-0.047
ter-0.040
orn-0.040
row-0.039
py-0.039
Token them
Token ID606
Feature activation-3.32e+00

Pos logit contributions
ped+0.011
orn+0.011
row+0.010
orns+0.010
py+0.010
Neg logit contributions
 booth-0.009
 Mrs-0.009
 closely-0.008
 stern-0.008
 Miss-0.008
Token:
Token ID25
Feature activation-1.36e+00

Pos logit contributions
 booth+0.080
ews+0.076
 apartment+0.072
 France+0.071
 salon+0.070
Neg logit contributions
ped-0.083
ter-0.075
 himself-0.075
py-0.073
row-0.069
Token "
Token ID366
Feature activation-6.31e-01

Pos logit contributions
 booth+0.018
cases+0.016
 closely+0.015
 Mrs+0.013
abella+0.013
Neg logit contributions
ped-0.046
orn-0.044
 himself-0.043
ter-0.043
row-0.042
TokenB
Token ID33
Feature activation-3.57e-01

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.012
 Miss+0.011
 stern+0.011
Neg logit contributions
ped-0.018
orn-0.018
row-0.017
orns-0.017
iest-0.016
Tokenoys
Token ID19417
Feature activation-5.30e+00

Pos logit contributions
 booth+0.007
 Mrs+0.007
 Miss+0.007
 closely+0.007
 stern+0.007
Neg logit contributions
ped-0.010
orn-0.010
row-0.009
orns-0.009
py-0.009
Token,
Token ID11
Feature activation-3.27e+00

Pos logit contributions
 booth+0.114
 closely+0.100
 Mrs+0.098
 Miss+0.095
cases+0.095
Neg logit contributions
ped-0.142
row-0.137
orn-0.135
orns-0.131
py-0.131
Token be
Token ID307
Feature activation-6.76e-01

Pos logit contributions
 booth+0.065
cases+0.058
 closely+0.057
 Mrs+0.056
ews+0.055
Neg logit contributions
ped-0.092
row-0.088
orn-0.087
ter-0.087
py-0.084
Token careful
Token ID8161
Feature activation-1.31e+00

Pos logit contributions
 booth+0.017
cases+0.016
 closely+0.016
 Mrs+0.016
 Miss+0.015
Neg logit contributions
ped-0.016
orn-0.015
row-0.015
orns-0.014
py-0.014
Token next
Token ID1306
Feature activation-1.33e+00

Pos logit contributions
 booth+0.023
 Mrs+0.021
 closely+0.020
 Miss+0.020
 stern+0.018
Neg logit contributions
ped-0.042
row-0.040
orn-0.039
py-0.038
iest-0.038
Token time
Token ID640
Feature activation-9.11e-01

Pos logit contributions
 booth+0.027
 Mrs+0.025
 closely+0.024
 Miss+0.024
 stern+0.024
Neg logit contributions
ped-0.039
orn-0.037
row-0.036
orns-0.036
iest-0.034
Token.
Token ID13
Feature activation-7.05e-01

Pos logit contributions
ped+0.029
row+0.026
orn+0.026
orns+0.024
iest+0.024
Neg logit contributions
 booth-0.031
 Mrs-0.030
 Miss-0.028
 closely-0.027
 Aunt-0.027
Token
Token ID198
Feature activation-8.02e-01

Pos logit contributions
 booth+0.013
cases+0.013
 salon+0.012
 closely+0.011
 divorce+0.011
Neg logit contributions
ped-0.019
orn-0.019
orns-0.018
 himself-0.018
ble-0.017
Token
Token ID198
Feature activation-8.54e-02

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.011
cases+0.011
 stern+0.011
Neg logit contributions
ped-0.026
orn-0.025
row-0.025
orns-0.024
py-0.023
Token"
Token ID1
Feature activation+8.08e-02

Pos logit contributions
 booth+0.002
 Mrs+0.002
 about+0.002
cases+0.002
 closely+0.002
Neg logit contributions
row-0.002
orn-0.002
ped-0.002
orns-0.002
ble-0.002
TokenB
Token ID33
Feature activation+1.29e-01

Pos logit contributions
ped+0.002
orn+0.002
row+0.002
orns+0.002
iest+0.002
Neg logit contributions
 booth-0.002
 Mrs-0.002
 closely-0.002
 about-0.001
 where-0.001
Tokenoys
Token ID19417
Feature activation-5.19e+00

Pos logit contributions
ped+0.004
orn+0.004
py+0.004
orns+0.003
row+0.003
Neg logit contributions
 booth-0.002
 Mrs-0.002
 Lisa-0.002
 stern-0.002
 Miss-0.002
Token,
Token ID11
Feature activation-2.18e+00

Pos logit contributions
 booth+0.106
 Mrs+0.096
 closely+0.093
 Miss+0.092
cases+0.087
Neg logit contributions
ped-0.148
orn-0.141
row-0.139
orns-0.135
iest-0.134
Token this
Token ID428
Feature activation-3.43e-01

Pos logit contributions
 booth+0.045
 Mrs+0.041
 closely+0.040
cases+0.039
 bakery+0.038
Neg logit contributions
ped-0.061
row-0.058
orn-0.057
 elbow-0.056
py-0.056
Token is
Token ID318
Feature activation-9.58e-01

Pos logit contributions
 booth+0.005
 Mrs+0.005
 closely+0.005
 Miss+0.004
 stern+0.004
Neg logit contributions
ped-0.012
row-0.011
orn-0.011
orns-0.011
py-0.011
Token not
Token ID407
Feature activation+8.43e-01

Pos logit contributions
 booth+0.013
 Mrs+0.011
 closely+0.011
 Miss+0.010
 stern+0.010
Neg logit contributions
ped-0.034
orn-0.032
row-0.032
orns-0.031
py-0.031
Token a
Token ID257
Feature activation-8.52e-01

Pos logit contributions
ped+0.029
orn+0.029
orns+0.028
row+0.028
robat+0.027
Neg logit contributions
 Mrs-0.013
 about-0.013
 where-0.013
 booth-0.012
 Miss-0.012
Token told
Token ID1297
Feature activation-1.16e+00

Pos logit contributions
 booth+0.004
 closely+0.004
 Mrs+0.004
 Sometimes+0.004
 Miss+0.004
Neg logit contributions
ped-0.005
row-0.005
orn-0.004
py-0.004
orns-0.004
Token them
Token ID606
Feature activation-3.19e+00

Pos logit contributions
 booth+0.024
cases+0.021
 closely+0.021
ews+0.021
 bakery+0.020
Neg logit contributions
ped-0.034
py-0.031
 himself-0.031
row-0.030
orns-0.030
Token,
Token ID11
Feature activation-9.23e-01

Pos logit contributions
 booth+0.074
 bakery+0.065
 horror+0.064
ara+0.064
 cinema+0.064
Neg logit contributions
 himself-0.083
ped-0.080
 powers-0.073
row-0.072
py-0.069
Token "
Token ID366
Feature activation-5.42e-01

Pos logit contributions
 booth+0.011
 closely+0.009
 Mrs+0.008
cases+0.008
 Miss+0.008
Neg logit contributions
ped-0.034
row-0.032
orn-0.032
py-0.031
ter-0.031
TokenB
Token ID33
Feature activation-2.54e-01

Pos logit contributions
 booth+0.012
 Mrs+0.011
 Miss+0.011
 closely+0.011
 stern+0.010
Neg logit contributions
ped-0.015
orn-0.014
row-0.014
orns-0.014
iest-0.013
Tokenoys
Token ID19417
Feature activation-5.15e+00

Pos logit contributions
 booth+0.005
 Mrs+0.004
 Miss+0.004
 stern+0.004
 closely+0.004
Neg logit contributions
ped-0.008
orn-0.007
 elbow-0.007
row-0.007
orns-0.007
Token,
Token ID11
Feature activation-3.00e+00

Pos logit contributions
 booth+0.101
 Mrs+0.092
 closely+0.090
 Miss+0.086
 stern+0.086
Neg logit contributions
ped-0.148
row-0.142
orn-0.139
orns-0.136
iest-0.135
Token you
Token ID345
Feature activation-1.04e+00

Pos logit contributions
 booth+0.057
 closely+0.052
 Mrs+0.051
cases+0.049
 stern+0.048
Neg logit contributions
ped-0.089
row-0.084
orn-0.084
ter-0.082
py-0.081
Token can
Token ID460
Feature activation-5.08e-01

Pos logit contributions
 booth+0.024
 Mrs+0.023
 closely+0.022
 Miss+0.022
 stern+0.021
Neg logit contributions
ped-0.027
orn-0.025
row-0.025
orns-0.024
iest-0.024
Token't
Token ID470
Feature activation+6.49e-01

Pos logit contributions
 booth+0.011
 Mrs+0.011
 Miss+0.010
 closely+0.010
 stern+0.010
Neg logit contributions
ped-0.014
orn-0.013
row-0.013
orns-0.013
iest-0.012
Token be
Token ID307
Feature activation+8.32e-02

Pos logit contributions
ped+0.020
orn+0.019
orns+0.019
row+0.018
 elbow+0.018
Neg logit contributions
 Miss-0.012
 booth-0.012
 Mrs-0.011
 about-0.011
 closely-0.011
Token She
Token ID1375
Feature activation-1.16e+00

Pos logit contributions
 booth+0.004
 closely+0.004
cases+0.003
 salon+0.003
abella+0.003
Neg logit contributions
ped-0.006
orn-0.005
orns-0.005
row-0.005
py-0.005
Token says
Token ID1139
Feature activation-2.12e+00

Pos logit contributions
 booth+0.020
cases+0.018
 Mrs+0.017
 Whoever+0.017
 Alice+0.017
Neg logit contributions
ped-0.036
orn-0.033
orns-0.032
 elbow-0.031
py-0.030
Token,
Token ID11
Feature activation-3.01e-01

Pos logit contributions
 booth+0.052
cases+0.045
 closely+0.045
 Mrs+0.044
 Miss+0.043
Neg logit contributions
ped-0.052
row-0.048
orn-0.048
orns-0.046
ter-0.046
Token "
Token ID366
Feature activation-1.03e+00

Pos logit contributions
 booth+0.003
 Mrs+0.002
 closely+0.002
 Miss+0.002
 stern+0.002
Neg logit contributions
ped-0.012
orn-0.011
row-0.011
orns-0.011
py-0.011
TokenB
Token ID33
Feature activation-7.47e-01

Pos logit contributions
 booth+0.018
 Mrs+0.017
 Miss+0.016
 closely+0.016
 stern+0.015
Neg logit contributions
ped-0.033
orn-0.031
row-0.031
orns-0.030
iest-0.030
Tokenoys
Token ID19417
Feature activation-5.14e+00

Pos logit contributions
 booth+0.013
 Mrs+0.012
 Miss+0.011
 stern+0.011
 Sometimes+0.011
Neg logit contributions
ped-0.023
orn-0.023
orns-0.021
 elbow-0.021
row-0.021
Token,
Token ID11
Feature activation-2.56e+00

Pos logit contributions
 booth+0.103
 Mrs+0.093
 closely+0.091
 Miss+0.089
cases+0.085
Neg logit contributions
ped-0.149
row-0.142
orn-0.141
orns-0.138
iest-0.134
Token boys
Token ID6510
Feature activation-3.55e+00

Pos logit contributions
 booth+0.058
 Mrs+0.052
 closely+0.052
 stern+0.050
cases+0.050
Neg logit contributions
ped-0.068
orn-0.065
row-0.064
orns-0.062
py-0.062
Token.
Token ID13
Feature activation-7.96e-01

Pos logit contributions
 booth+0.059
 Mrs+0.057
 Miss+0.053
 closely+0.052
 stern+0.052
Neg logit contributions
ped-0.115
orn-0.110
row-0.108
orns-0.106
py-0.103
Token You
Token ID921
Feature activation-7.75e-01

Pos logit contributions
 booth+0.015
 Mrs+0.014
 Miss+0.014
 closely+0.014
 stern+0.013
Neg logit contributions
ped-0.024
orn-0.023
row-0.023
orns-0.022
iest-0.022
Token have
Token ID423
Feature activation-8.12e-01

Pos logit contributions
 booth+0.015
 Mrs+0.014
 Miss+0.013
 closely+0.013
 stern+0.013
Neg logit contributions
ped-0.023
orn-0.022
row-0.022
orns-0.021
iest-0.021
Token and
Token ID290
Feature activation-3.65e-01

Pos logit contributions
abella+0.058
 Mrs+0.058
 Whoever+0.057
 booth+0.056
 Sometimes+0.056
Neg logit contributions
row-0.055
ped-0.054
thank-0.049
iest-0.048
 himself-0.048
Token said
Token ID531
Feature activation-1.96e+00

Pos logit contributions
 booth+0.009
 closely+0.009
 billboards+0.009
 Sometimes+0.008
 horror+0.008
Neg logit contributions
ped-0.008
row-0.007
cher-0.006
 elbow-0.006
ter-0.006
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 booth+0.049
 closely+0.045
 cinema+0.043
 apartment+0.042
 horror+0.042
Neg logit contributions
ped-0.041
iest-0.038
 powers-0.038
 himself-0.038
ter-0.037
Token "
Token ID366
Feature activation-7.17e-01

Pos logit contributions
 booth+0.017
 closely+0.015
 slams+0.014
 horror+0.014
 salon+0.014
Neg logit contributions
ped-0.034
ter-0.031
row-0.031
 himself-0.030
ble-0.029
TokenB
Token ID33
Feature activation-7.04e-01

Pos logit contributions
 closely+0.017
 booth+0.016
 about+0.015
 stern+0.015
cases+0.015
Neg logit contributions
ped-0.018
orn-0.018
orns-0.017
row-0.017
iest-0.017
Tokenoys
Token ID19417
Feature activation-5.13e+00

Pos logit contributions
 booth+0.015
 Mrs+0.014
 Miss+0.013
 closely+0.013
 stern+0.013
Neg logit contributions
ped-0.020
orn-0.019
row-0.018
orns-0.018
iest-0.018
Token,
Token ID11
Feature activation-3.05e+00

Pos logit contributions
 booth+0.115
 closely+0.103
 Mrs+0.101
 Miss+0.098
 stern+0.097
Neg logit contributions
ped-0.130
row-0.126
iest-0.121
orn-0.120
py-0.118
Token you
Token ID345
Feature activation-1.31e+00

Pos logit contributions
 booth+0.074
ews+0.068
cases+0.067
 closely+0.065
 bakery+0.065
Neg logit contributions
ter-0.072
 himself-0.067
row-0.066
ped-0.066
 mineral-0.063
Token need
Token ID761
Feature activation-7.54e-01

Pos logit contributions
 booth+0.040
 closely+0.036
 Mrs+0.036
 bakery+0.035
 Miss+0.035
Neg logit contributions
ped-0.025
row-0.024
orn-0.023
 powers-0.022
iest-0.022
Token to
Token ID284
Feature activation+2.19e-01

Pos logit contributions
 Mrs+0.009
 booth+0.009
 about+0.009
 Miss+0.009
 where+0.009
Neg logit contributions
ped-0.028
orn-0.028
robat-0.027
py-0.027
orns-0.027
Token clean
Token ID3424
Feature activation-1.20e+00

Pos logit contributions
ped+0.007
orn+0.007
orns+0.007
row+0.006
iest+0.006
Neg logit contributions
 booth-0.004
 Mrs-0.004
 closely-0.004
 Miss-0.004
 stern-0.003
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
 booth+0.029
 Mrs+0.027
 closely+0.026
 Miss+0.026
 stern+0.025
Neg logit contributions
ped-0.030
row-0.028
orn-0.027
orns-0.026
py-0.026
Token
Token ID198
Feature activation-9.34e-01

Pos logit contributions
 booth+0.009
 closely+0.009
cases+0.008
 stern+0.008
 Mrs+0.007
Neg logit contributions
ped-0.016
orn-0.016
orns-0.016
 himself-0.015
py-0.015
Token
Token ID198
Feature activation-3.41e-01

Pos logit contributions
 booth+0.016
cases+0.015
 Mrs+0.014
 closely+0.014
 Miss+0.014
Neg logit contributions
orn-0.029
ped-0.028
orns-0.028
row-0.027
py-0.026
Token"
Token ID1
Feature activation+7.21e-02

Pos logit contributions
 booth+0.007
cases+0.007
 Mrs+0.006
 closely+0.006
 about+0.006
Neg logit contributions
orn-0.010
orns-0.009
row-0.009
ped-0.009
robat-0.008
TokenB
Token ID33
Feature activation-2.56e-01

Pos logit contributions
ped+0.002
orn+0.002
row+0.002
orns+0.002
iest+0.002
Neg logit contributions
 booth-0.001
 Mrs-0.001
 closely-0.001
 stern-0.001
 Miss-0.001
Tokenoys
Token ID19417
Feature activation-5.02e+00

Pos logit contributions
 booth+0.006
 Mrs+0.005
 Miss+0.005
 closely+0.005
 stern+0.005
Neg logit contributions
ped-0.007
orn-0.007
row-0.006
orns-0.006
py-0.006
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
 booth+0.099
 Mrs+0.091
 closely+0.090
 stern+0.088
 Miss+0.087
Neg logit contributions
ped-0.142
orn-0.136
row-0.135
orns-0.132
iest-0.130
Token do
Token ID466
Feature activation-1.43e+00

Pos logit contributions
 booth+0.055
 closely+0.051
cases+0.050
ews+0.049
 stern+0.049
Neg logit contributions
ped-0.060
ter-0.058
row-0.058
py-0.057
orn-0.057
Token you
Token ID345
Feature activation+8.38e-01

Pos logit contributions
 booth+0.045
 closely+0.042
 Mrs+0.042
 horror+0.041
 Some+0.040
Neg logit contributions
ped-0.024
 powers-0.020
 himself-0.020
ble-0.020
orn-0.020
Token want
Token ID765
Feature activation-7.23e-01

Pos logit contributions
ped+0.028
orn+0.027
orns+0.027
row+0.026
 elbow+0.026
Neg logit contributions
 booth-0.013
 Mrs-0.012
 about-0.012
 closely-0.012
 Miss-0.012
Token some
Token ID617
Feature activation+8.27e-01

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.011
 Miss+0.011
 stern+0.011
Neg logit contributions
ped-0.023
orn-0.022
row-0.022
orns-0.021
iest-0.021
Token She
Token ID1375
Feature activation-2.50e-01

Pos logit contributions
 booth+0.013
 closely+0.012
cases+0.011
 salon+0.010
abella+0.010
Neg logit contributions
ped-0.024
orn-0.023
orns-0.023
row-0.022
 himself-0.022
Token said
Token ID531
Feature activation-1.65e+00

Pos logit contributions
 booth+0.006
 closely+0.005
 Sometimes+0.005
 Mrs+0.005
 billboards+0.005
Neg logit contributions
ped-0.007
orn-0.006
row-0.006
py-0.006
orns-0.006
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
 booth+0.038
 closely+0.034
 Mrs+0.032
cases+0.032
 Whoever+0.031
Neg logit contributions
ped-0.044
orn-0.040
row-0.040
py-0.038
iest-0.038
Token "
Token ID366
Feature activation-4.09e-01

Pos logit contributions
 booth+0.001
 closely+0.001
 Mrs+0.001
 Miss+0.001
cases+0.001
Neg logit contributions
ped-0.005
orn-0.004
row-0.004
orns-0.004
py-0.004
TokenB
Token ID33
Feature activation-4.10e-01

Pos logit contributions
 booth+0.008
 Mrs+0.008
 closely+0.008
 Miss+0.007
 stern+0.007
Neg logit contributions
ped-0.012
orn-0.011
row-0.011
orns-0.011
iest-0.010
Tokenoys
Token ID19417
Feature activation-5.00e+00

Pos logit contributions
 booth+0.008
 Mrs+0.007
 Miss+0.007
 stern+0.007
 where+0.006
Neg logit contributions
ped-0.012
orn-0.012
row-0.011
orns-0.011
py-0.011
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 booth+0.107
 closely+0.097
 Mrs+0.094
 Miss+0.092
 bakery+0.088
Neg logit contributions
ped-0.136
row-0.131
orn-0.129
orns-0.125
iest-0.122
Token what
Token ID644
Feature activation-1.17e+00

Pos logit contributions
 booth+0.042
 closely+0.039
cases+0.036
 Mrs+0.035
ews+0.035
Neg logit contributions
ped-0.061
orn-0.058
row-0.057
ter-0.056
py-0.055
Token have
Token ID423
Feature activation-4.85e-01

Pos logit contributions
 booth+0.026
 Mrs+0.025
 closely+0.024
 Miss+0.024
 stern+0.023
Neg logit contributions
ped-0.032
orn-0.030
row-0.029
orns-0.029
iest-0.028
Token you
Token ID345
Feature activation+1.08e+00

Pos logit contributions
 Mrs+0.011
 booth+0.011
 Miss+0.010
 they+0.010
 about+0.010
Neg logit contributions
ped-0.013
orn-0.013
row-0.012
robat-0.012
py-0.012
Token done
Token ID1760
Feature activation-9.37e-01

Pos logit contributions
orn+0.032
orns+0.031
ped+0.031
row+0.030
iest+0.030
Neg logit contributions
 Mrs-0.019
 about-0.019
 booth-0.018
 stern-0.018
 what-0.018
Token.
Token ID13
Feature activation-2.26e-02

Pos logit contributions
 booth+0.013
 where+0.012
 about+0.012
 they+0.012
 Mrs+0.011
Neg logit contributions
orn-0.024
ped-0.024
orns-0.022
py-0.022
row-0.022
Token
Token ID198
Feature activation-6.67e-01

Pos logit contributions
 booth+0.000
 closely+0.000
cases+0.000
 stern+0.000
 Mrs+0.000
Neg logit contributions
ped-0.001
orn-0.001
orns-0.001
row-0.001
py-0.001
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
 booth+0.011
 Mrs+0.009
 closely+0.009
cases+0.009
 stern+0.009
Neg logit contributions
ped-0.022
orn-0.021
row-0.021
orns-0.021
py-0.020
Token"
Token ID1
Feature activation-2.73e-03

Pos logit contributions
orn+0.009
ped+0.008
row+0.008
orns+0.008
robat+0.008
Neg logit contributions
 booth-0.006
 Mrs-0.005
cases-0.005
 closely-0.005
 about-0.005
TokenB
Token ID33
Feature activation-5.52e-01

Pos logit contributions
 booth+0.000
 Mrs+0.000
 Miss+0.000
 closely+0.000
 stern+0.000
Neg logit contributions
ped-0.000
orn-0.000
row-0.000
orns-0.000
iest-0.000
Tokenoys
Token ID19417
Feature activation-4.95e+00

Pos logit contributions
 booth+0.012
 Mrs+0.011
 Miss+0.011
 stern+0.011
 Sometimes+0.010
Neg logit contributions
ped-0.015
orn-0.014
orns-0.013
py-0.013
 elbow-0.013
Token,
Token ID11
Feature activation-2.46e+00

Pos logit contributions
 booth+0.097
 Mrs+0.090
 closely+0.089
 Miss+0.087
 stern+0.084
Neg logit contributions
ped-0.144
orn-0.137
row-0.136
orns-0.132
iest-0.130
Token you
Token ID345
Feature activation-3.99e-01

Pos logit contributions
 booth+0.057
 closely+0.053
cases+0.051
 Mrs+0.050
ews+0.050
Neg logit contributions
ped-0.061
row-0.058
ter-0.058
orn-0.056
py-0.056
Token have
Token ID423
Feature activation-2.64e-01

Pos logit contributions
 booth+0.008
 Mrs+0.008
 closely+0.007
 Miss+0.007
 stern+0.007
Neg logit contributions
ped-0.012
orn-0.011
row-0.011
orns-0.011
iest-0.010
Token to
Token ID284
Feature activation+1.04e+00

Pos logit contributions
 booth+0.004
 Mrs+0.004
 Miss+0.004
 stern+0.004
 about+0.004
Neg logit contributions
ped-0.009
orn-0.009
row-0.008
orns-0.008
iest-0.008
Token eat
Token ID4483
Feature activation-1.35e+00

Pos logit contributions
ped+0.032
orns+0.030
orn+0.030
iest+0.029
row+0.028
Neg logit contributions
 Mrs-0.020
 booth-0.020
 Miss-0.019
 Anna-0.019
 closely-0.018
Token calls
Token ID3848
Feature activation-5.68e-01

Pos logit contributions
ped+0.007
orn+0.006
row+0.006
orns+0.006
iest+0.006
Neg logit contributions
 booth-0.005
 Mrs-0.005
 Miss-0.004
 closely-0.004
 stern-0.004
Token them
Token ID606
Feature activation-3.31e-01

Pos logit contributions
 booth+0.010
 Mrs+0.009
 closely+0.009
 Miss+0.009
 stern+0.009
Neg logit contributions
ped-0.018
row-0.017
orn-0.017
orns-0.016
py-0.016
Token.
Token ID13
Feature activation-8.30e-01

Pos logit contributions
 booth+0.008
 Mrs+0.007
 closely+0.007
 stern+0.007
 Miss+0.007
Neg logit contributions
ped-0.008
row-0.008
py-0.007
orn-0.007
orns-0.007
Token "
Token ID366
Feature activation+8.79e-01

Pos logit contributions
 booth+0.015
 closely+0.015
 salon+0.013
cases+0.013
abella+0.013
Neg logit contributions
ped-0.022
 himself-0.020
py-0.020
gie-0.020
wood-0.019
TokenB
Token ID33
Feature activation+1.06e-01

Pos logit contributions
ped+0.022
orn+0.021
orns+0.020
row+0.020
robat+0.020
Neg logit contributions
 booth-0.021
 closely-0.020
 Mrs-0.020
 about-0.019
 stern-0.019
Tokenoys
Token ID19417
Feature activation-4.93e+00

Pos logit contributions
ped+0.003
orn+0.003
orns+0.003
 elbow+0.003
row+0.003
Neg logit contributions
 booth-0.002
 Mrs-0.002
 Miss-0.002
 stern-0.002
 where-0.002
Token,
Token ID11
Feature activation-1.99e+00

Pos logit contributions
 booth+0.103
 closely+0.095
 Mrs+0.095
 Miss+0.091
 stern+0.090
Neg logit contributions
ped-0.135
orn-0.128
row-0.126
iest-0.123
orns-0.122
Token it
Token ID340
Feature activation-3.81e-01

Pos logit contributions
 booth+0.046
 closely+0.043
 Mrs+0.042
cases+0.041
 stern+0.041
Neg logit contributions
ped-0.050
orn-0.048
row-0.047
ter-0.046
py-0.045
Token's
Token ID338
Feature activation+1.20e+00

Pos logit contributions
 booth+0.004
 Mrs+0.004
 closely+0.003
 Miss+0.003
 stern+0.003
Neg logit contributions
ped-0.015
orn-0.014
row-0.014
orns-0.014
iest-0.014
Token time
Token ID640
Feature activation-1.20e+00

Pos logit contributions
ped+0.037
orn+0.036
row+0.035
orns+0.034
iest+0.034
Neg logit contributions
 booth-0.022
 Mrs-0.022
 Miss-0.021
 closely-0.020
 about-0.020
Token for
Token ID329
Feature activation+5.78e-01

Pos logit contributions
 booth+0.021
 Mrs+0.019
 about+0.018
 closely+0.018
 Miss+0.018
Neg logit contributions
ped-0.039
orn-0.039
orns-0.037
row-0.037
py-0.036
Token and
Token ID290
Feature activation+5.66e-01

Pos logit contributions
 booth+0.024
 Mrs+0.022
 Miss+0.021
abella+0.021
 stern+0.020
Neg logit contributions
ped-0.034
row-0.032
orn-0.031
orns-0.030
py-0.030
Token said
Token ID531
Feature activation-1.08e+00

Pos logit contributions
ped+0.015
orn+0.013
row+0.013
ter+0.012
orns+0.012
Neg logit contributions
 booth-0.013
 Mrs-0.012
 closely-0.012
 Sometimes-0.012
 Some-0.011
Token,
Token ID11
Feature activation-5.16e-01

Pos logit contributions
 booth+0.022
 Mrs+0.020
 closely+0.019
cases+0.019
 Miss+0.018
Neg logit contributions
ped-0.031
row-0.028
orn-0.028
ter-0.027
iest-0.027
Token "
Token ID366
Feature activation+2.21e-01

Pos logit contributions
 booth+0.007
 Mrs+0.005
 closely+0.005
cases+0.005
 Miss+0.005
Neg logit contributions
ped-0.018
row-0.017
ter-0.016
py-0.016
orn-0.016
TokenB
Token ID33
Feature activation+3.56e-01

Pos logit contributions
ped+0.006
orn+0.006
row+0.006
orns+0.006
iest+0.006
Neg logit contributions
 booth-0.005
 Mrs-0.004
 Miss-0.004
 closely-0.004
 stern-0.004
Tokenoys
Token ID19417
Feature activation-4.91e+00

Pos logit contributions
ped+0.009
orn+0.009
 elbow+0.009
py+0.009
orns+0.008
Neg logit contributions
 booth-0.008
 Mrs-0.007
 Miss-0.007
 Sophia-0.007
 stern-0.007
Token,
Token ID11
Feature activation-2.88e+00

Pos logit contributions
 booth+0.104
 Mrs+0.093
 closely+0.092
 Miss+0.090
 stern+0.088
Neg logit contributions
ped-0.135
row-0.128
orn-0.125
py-0.122
iest-0.121
Token sharing
Token ID7373
Feature activation-2.05e+00

Pos logit contributions
 booth+0.058
 Mrs+0.051
cases+0.051
 closely+0.051
ews+0.048
Neg logit contributions
ped-0.081
row-0.078
ter-0.077
orn-0.077
py-0.075
Token is
Token ID318
Feature activation-7.45e-01

Pos logit contributions
 booth+0.043
cases+0.039
 Mrs+0.038
 closely+0.038
abella+0.037
Neg logit contributions
ped-0.057
orn-0.054
row-0.052
orns-0.052
py-0.050
Token caring
Token ID18088
Feature activation-2.11e-02

Pos logit contributions
 booth+0.014
 Mrs+0.012
 closely+0.012
cases+0.012
 Miss+0.012
Neg logit contributions
ped-0.023
row-0.021
orn-0.021
py-0.021
iest-0.020
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
 booth+0.000
 Mrs+0.000
 closely+0.000
 Miss+0.000
 stern+0.000
Neg logit contributions
ped-0.001
orn-0.001
row-0.001
orns-0.001
iest-0.000
Token stern
Token ID26370
Feature activation-3.04e-01

Pos logit contributions
ped+0.004
row+0.004
iest+0.004
orn+0.004
py+0.004
Neg logit contributions
 booth-0.003
 Mrs-0.003
 Miss-0.002
 billboards-0.002
 where-0.002
Token voice
Token ID3809
Feature activation-1.64e+00

Pos logit contributions
 booth+0.007
 Mrs+0.007
 closely+0.006
 Miss+0.006
 stern+0.006
Neg logit contributions
ped-0.008
row-0.007
orn-0.007
iest-0.007
py-0.007
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 booth+0.038
 Mrs+0.036
 closely+0.035
 Sometimes+0.034
 Some+0.034
Neg logit contributions
ped-0.041
row-0.038
iest-0.037
orn-0.035
py-0.035
Token "
Token ID366
Feature activation-8.20e-01

Pos logit contributions
 booth+0.018
 closely+0.015
cases+0.014
 Mrs+0.014
 bakery+0.013
Neg logit contributions
ped-0.047
ter-0.043
row-0.042
 himself-0.042
py-0.041
TokenB
Token ID33
Feature activation-1.84e-01

Pos logit contributions
 booth+0.018
 Mrs+0.017
 closely+0.016
 Miss+0.016
 stern+0.016
Neg logit contributions
ped-0.023
orn-0.022
row-0.021
orns-0.021
iest-0.020
Tokenoys
Token ID19417
Feature activation-4.91e+00

Pos logit contributions
 booth+0.004
 Mrs+0.004
 closely+0.004
 Miss+0.004
 stern+0.004
Neg logit contributions
ped-0.005
row-0.004
orn-0.004
orns-0.004
iest-0.004
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
 booth+0.106
 Mrs+0.097
 closely+0.096
 Miss+0.092
 stern+0.090
Neg logit contributions
ped-0.129
row-0.123
orn-0.120
iest-0.119
orns-0.116
Token girls
Token ID4813
Feature activation-2.93e+00

Pos logit contributions
 booth+0.023
 Mrs+0.022
 closely+0.021
 Miss+0.020
 stern+0.020
Neg logit contributions
ped-0.033
orn-0.031
row-0.030
orns-0.030
iest-0.029
Token,
Token ID11
Feature activation-3.67e+00

Pos logit contributions
 booth+0.062
 Mrs+0.059
 closely+0.057
 Miss+0.056
 stern+0.055
Neg logit contributions
ped-0.081
orn-0.076
row-0.076
orns-0.073
iest-0.072
Token if
Token ID611
Feature activation-7.33e-01

Pos logit contributions
 booth+0.085
 Mrs+0.078
 closely+0.077
 Miss+0.074
 stern+0.073
Neg logit contributions
ped-0.092
row-0.088
orn-0.087
iest-0.083
orns-0.082
Token you
Token ID345
Feature activation+2.61e-01

Pos logit contributions
 booth+0.017
 Mrs+0.017
 Miss+0.016
 closely+0.015
 stern+0.015
Neg logit contributions
ped-0.020
orn-0.019
row-0.018
orns-0.018
py-0.017
Token.
Token ID13
Feature activation-8.62e-01

Pos logit contributions
 booth+0.011
 Mrs+0.010
 closely+0.010
 Miss+0.010
 stern+0.010
Neg logit contributions
ped-0.013
orn-0.012
row-0.012
orns-0.011
iest-0.011
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
 booth+0.019
cases+0.017
 closely+0.017
 salon+0.016
 batch+0.015
Neg logit contributions
ped-0.022
orns-0.021
orn-0.021
 himself-0.021
py-0.019
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
 booth+0.020
 closely+0.017
 Mrs+0.017
cases+0.017
 stern+0.016
Neg logit contributions
ped-0.035
orn-0.034
row-0.034
orns-0.033
 himself-0.032
Token"
Token ID1
Feature activation+1.82e-01

Pos logit contributions
 booth+0.009
 Mrs+0.008
 closely+0.008
cases+0.008
 about+0.008
Neg logit contributions
row-0.011
ped-0.011
orn-0.011
orns-0.011
 himself-0.010
TokenB
Token ID33
Feature activation-1.51e-01

Pos logit contributions
ped+0.005
orn+0.005
orns+0.005
row+0.005
iest+0.005
Neg logit contributions
 booth-0.004
 Mrs-0.003
 closely-0.003
 about-0.003
 stern-0.003
Tokenoys
Token ID19417
Feature activation-4.90e+00

Pos logit contributions
 booth+0.003
 Mrs+0.003
 Miss+0.003
 closely+0.003
 stern+0.003
Neg logit contributions
ped-0.004
orn-0.004
row-0.004
orns-0.004
py-0.004
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
 booth+0.098
 Mrs+0.089
 closely+0.088
 Miss+0.086
 stern+0.082
Neg logit contributions
ped-0.142
orn-0.135
row-0.135
orns-0.130
iest-0.128
Token boys
Token ID6510
Feature activation-3.01e+00

Pos logit contributions
 booth+0.051
 closely+0.046
 Mrs+0.045
cases+0.043
 stern+0.043
Neg logit contributions
ped-0.061
row-0.057
orn-0.057
py-0.055
ter-0.055
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
 booth+0.052
 Mrs+0.050
 Miss+0.046
 closely+0.046
 stern+0.045
Neg logit contributions
ped-0.096
orn-0.092
row-0.090
orns-0.088
py-0.086
Token boys
Token ID6510
Feature activation-2.66e+00

Pos logit contributions
 booth+0.049
 Mrs+0.046
 closely+0.045
 Miss+0.044
 stern+0.043
Neg logit contributions
ped-0.055
orn-0.052
row-0.051
orns-0.050
iest-0.048
Token.
Token ID13
Feature activation-6.83e-01

Pos logit contributions
 booth+0.051
 Mrs+0.050
 Miss+0.047
 closely+0.046
 stern+0.045
Neg logit contributions
ped-0.080
orn-0.076
row-0.074
orns-0.073
py-0.071
Token They
Token ID1119
Feature activation-2.42e+00

Pos logit contributions
 closely+0.023
 booth+0.022
 stern+0.021
cases+0.020
 salon+0.019
Neg logit contributions
 himself-0.040
ped-0.040
ates-0.039
orns-0.039
ags-0.038
Token say
Token ID910
Feature activation-1.51e+00

Pos logit contributions
 booth+0.056
 Mrs+0.053
 closely+0.052
 Miss+0.052
 stern+0.051
Neg logit contributions
ped-0.060
orn-0.057
row-0.055
orns-0.054
 elbow-0.053
Token,
Token ID11
Feature activation-5.34e-01

Pos logit contributions
 booth+0.027
 closely+0.026
 Mrs+0.024
cases+0.022
 Miss+0.022
Neg logit contributions
ped-0.047
row-0.042
iest-0.042
orn-0.042
 elbow-0.041
Token "
Token ID366
Feature activation-2.87e-01

Pos logit contributions
 booth+0.006
 closely+0.005
 Mrs+0.005
 Miss+0.005
cases+0.005
Neg logit contributions
ped-0.019
orn-0.018
row-0.018
py-0.017
 elbow-0.017
TokenT
Token ID51
Feature activation-1.32e+00

Pos logit contributions
 booth+0.005
 Mrs+0.005
 Miss+0.005
 closely+0.005
 stern+0.004
Neg logit contributions
ped-0.009
orn-0.008
row-0.008
py-0.008
orns-0.008
Tokenoys
Token ID19417
Feature activation-4.88e+00

Pos logit contributions
 booth+0.029
 Mrs+0.028
 Miss+0.027
 closely+0.027
 stern+0.026
Neg logit contributions
ped-0.036
orn-0.034
row-0.033
orns-0.032
iest-0.031
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
 booth+0.105
 closely+0.099
 Mrs+0.095
 stern+0.094
 Miss+0.092
Neg logit contributions
ped-0.125
row-0.122
iest-0.118
orn-0.117
orns-0.115
Token books
Token ID3835
Feature activation-3.10e+00

Pos logit contributions
 booth+0.024
 Mrs+0.022
 closely+0.021
 Miss+0.020
 stern+0.020
Neg logit contributions
ped-0.054
orn-0.052
row-0.051
orns-0.050
iest-0.049
Token for
Token ID329
Feature activation-6.14e-01

Pos logit contributions
 booth+0.060
 Mrs+0.057
 closely+0.054
 Miss+0.054
 stern+0.052
Neg logit contributions
ped-0.093
orn-0.088
row-0.087
orns-0.085
iest-0.082
Token sale
Token ID5466
Feature activation-8.99e-01

Pos logit contributions
 Mrs+0.011
 booth+0.011
 Miss+0.010
 where+0.010
 grandma+0.009
Neg logit contributions
orn-0.020
ped-0.020
row-0.020
iest-0.019
orns-0.019
Token!
Token ID0
Feature activation-5.53e-01

Pos logit contributions
 booth+0.020
 Mrs+0.020
 closely+0.019
 Miss+0.019
 stern+0.018
Neg logit contributions
ped-0.024
orn-0.022
row-0.022
orns-0.021
iest-0.021
Tokentub
Token ID37995
Feature activation-1.95e+00

Pos logit contributions
 booth+0.022
 Mrs+0.020
 Miss+0.020
 where+0.020
 about+0.019
Neg logit contributions
ped-0.023
orn-0.022
orns-0.021
py-0.020
row-0.020
Token and
Token ID290
Feature activation+6.68e-01

Pos logit contributions
 booth+0.030
 Mrs+0.029
 Miss+0.027
 closely+0.026
 stern+0.025
Neg logit contributions
ped-0.063
row-0.061
orn-0.060
orns-0.058
iest-0.057
Token filled
Token ID5901
Feature activation-8.39e-01

Pos logit contributions
ped+0.017
row+0.016
orn+0.016
orns+0.015
 elbow+0.015
Neg logit contributions
 booth-0.016
 Mrs-0.015
 closely-0.014
 Miss-0.014
 stern-0.014
Token it
Token ID340
Feature activation+1.74e-02

Pos logit contributions
 booth+0.020
cases+0.018
 Mrs+0.017
 Miss+0.017
 stern+0.017
Neg logit contributions
ped-0.023
 himself-0.020
orn-0.020
row-0.019
orns-0.019
Token with
Token ID351
Feature activation+4.07e-01

Pos logit contributions
ped+0.001
row+0.000
orn+0.000
iest+0.000
orns+0.000
Neg logit contributions
 booth-0.000
 Mrs-0.000
 Miss-0.000
 Lisa-0.000
 Aunt-0.000
Token warm
Token ID5814
Feature activation+3.18e+00

Pos logit contributions
orn+0.012
row+0.012
robat+0.012
ped+0.012
py+0.012
Neg logit contributions
 Mrs-0.008
 they-0.007
 closely-0.007
 Miss-0.007
 where-0.007
Token water
Token ID1660
Feature activation+8.79e-01

Pos logit contributions
orn+0.098
cher+0.097
robat+0.095
py+0.093
ayer+0.092
Neg logit contributions
 they-0.052
 where-0.051
 about-0.051
 Mrs-0.050
 Anna-0.049
Token.
Token ID13
Feature activation-4.19e-01

Pos logit contributions
orn+0.027
ped+0.027
orns+0.025
py+0.025
robat+0.025
Neg logit contributions
 booth-0.016
 closely-0.015
 Mrs-0.015
 where-0.015
 about-0.015
Token She
Token ID1375
Feature activation+2.97e-01

Pos logit contributions
 booth+0.008
 closely+0.007
 stern+0.006
 billboards+0.006
cases+0.006
Neg logit contributions
ped-0.013
orn-0.012
orns-0.011
 elbow-0.011
 himself-0.011
Token put
Token ID1234
Feature activation+5.52e-02

Pos logit contributions
ped+0.007
row+0.007
orn+0.007
 elbow+0.006
orns+0.006
Neg logit contributions
 booth-0.007
 Mrs-0.007
 Miss-0.007
 billboards-0.007
 stern-0.006
Token her
Token ID607
Feature activation+1.47e+00

Pos logit contributions
row+0.002
py+0.002
ayer+0.002
orn+0.002
ped+0.002
Neg logit contributions
 Anna-0.001
 they-0.001
 Mrs-0.001
 where-0.001
 closely-0.001
Token frog
Token ID21264
Feature activation+1.16e+00

Pos logit contributions
ped+0.116
cher+0.112
ates+0.108
orns+0.108
robat+0.107
Neg logit contributions
 where-0.041
 about-0.040
 closely-0.037
 booth-0.036
 Mrs-0.035
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
ped+0.037
orn+0.035
row+0.035
 elbow+0.034
orns+0.034
Neg logit contributions
 booth-0.020
 Mrs-0.018
 where-0.018
 about-0.018
 closely-0.018
Token who
Token ID508
Feature activation+1.05e+00

Pos logit contributions
ped+0.021
orn+0.020
row+0.020
orns+0.019
iest+0.019
Neg logit contributions
 booth-0.013
 Mrs-0.012
 closely-0.011
 Miss-0.011
 stern-0.011
Token just
Token ID655
Feature activation+2.27e+00

Pos logit contributions
ped+0.033
row+0.032
 elbow+0.030
orn+0.030
iest+0.030
Neg logit contributions
 Mrs-0.018
 booth-0.017
 about-0.017
 Miss-0.017
 closely-0.016
Token loved
Token ID6151
Feature activation+2.17e+00

Pos logit contributions
orns+0.076
agons+0.076
orn+0.076
robat+0.074
ands+0.074
Neg logit contributions
 about-0.035
 where-0.034
 Mrs-0.032
 why-0.030
 closely-0.030
Token to
Token ID284
Feature activation+3.27e+00

Pos logit contributions
row+0.087
iest+0.086
orn+0.085
ped+0.085
 elbow+0.083
Neg logit contributions
 Mrs-0.021
 about-0.020
 where-0.019
 they-0.018
 closely-0.018
Token hop
Token ID1725
Feature activation+1.98e+00

Pos logit contributions
ped+0.101
row+0.096
ates+0.094
isher+0.094
ishing+0.094
Neg logit contributions
 where-0.060
 about-0.058
 One-0.057
 Sometimes-0.057
 Mrs-0.056
Token around
Token ID1088
Feature activation+1.64e+00

Pos logit contributions
ped+0.065
 elbow+0.063
orn+0.060
row+0.059
ates+0.059
Neg logit contributions
 closely-0.033
 booth-0.031
 about-0.030
 Mrs-0.029
 where-0.029
Token!
Token ID0
Feature activation+1.43e+00

Pos logit contributions
row+0.059
ped+0.059
 elbow+0.057
iest+0.056
orn+0.055
Neg logit contributions
 closely-0.024
 where-0.022
 Mrs-0.021
 about-0.020
 Anna-0.020
Token One
Token ID1881
Feature activation+9.53e-01

Pos logit contributions
ped+0.048
orn+0.047
orns+0.045
row+0.044
py+0.044
Neg logit contributions
 booth-0.022
 Mrs-0.020
 stern-0.020
 Miss-0.019
 closely-0.019
Token day
Token ID1110
Feature activation+2.52e+00

Pos logit contributions
ped+0.022
 elbow+0.020
orns+0.020
row+0.019
ates+0.019
Neg logit contributions
 Mrs-0.024
 booth-0.024
 closely-0.024
 about-0.023
 where-0.023
Token ac
Token ID936
Feature activation+1.81e+00

Pos logit contributions
 booth+0.010
 Mrs+0.010
 closely+0.009
 Miss+0.009
 stern+0.009
Neg logit contributions
ped-0.019
orn-0.018
row-0.018
orns-0.017
py-0.017
Tokenorns
Token ID19942
Feature activation-4.75e-03

Pos logit contributions
 elbow+0.026
ped+0.026
py+0.023
 powers+0.022
chie+0.022
Neg logit contributions
 Mrs-0.054
s-0.052
 about-0.051
 where-0.051
 Sometimes-0.051
Token,
Token ID11
Feature activation-1.93e+00

Pos logit contributions
 about+0.000
 booth+0.000
 Mrs+0.000
 where+0.000
 closely+0.000
Neg logit contributions
ped-0.000
orn-0.000
 elbow-0.000
row-0.000
py-0.000
Token including
Token ID1390
Feature activation-4.36e-01

Pos logit contributions
 booth+0.033
 Mrs+0.031
 closely+0.029
 Miss+0.029
 stern+0.028
Neg logit contributions
ped-0.062
orn-0.060
row-0.059
orns-0.058
iest-0.056
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
 Mrs+0.008
 where+0.007
 Miss+0.007
 they+0.007
 about+0.007
Neg logit contributions
ped-0.014
row-0.014
robat-0.014
orn-0.014
iest-0.013
Token special
Token ID2041
Feature activation+3.06e+00

Pos logit contributions
ped+0.039
ates+0.036
row+0.035
robat+0.034
cher+0.034
Neg logit contributions
 Mrs-0.035
 where-0.030
 closely-0.030
 Sometimes-0.030
 stern-0.029
Token white
Token ID2330
Feature activation+3.33e+00

Pos logit contributions
ped+0.073
ates+0.069
ags+0.068
cher+0.067
agged+0.065
Neg logit contributions
 Mrs-0.074
 where-0.074
 about-0.069
 what-0.068
 why-0.067
Token one
Token ID530
Feature activation+1.62e+00

Pos logit contributions
cher+0.079
ags+0.079
ands+0.076
ped+0.075
ates+0.074
Neg logit contributions
 and-0.075
 "-0.072
 where-0.072
 Mrs-0.072
 which-0.072
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
ped+0.057
 elbow+0.052
row+0.052
agons+0.052
ands+0.051
Neg logit contributions
 about-0.025
 where-0.025
 Mrs-0.024
 booth-0.023
 closely-0.023
Token<|endoftext|>
Token ID50256
Feature activation+1.33e+00

Pos logit contributions
 booth+0.004
 closely+0.003
 stern+0.003
 Mrs+0.003
 Miss+0.003
Neg logit contributions
ped-0.009
orn-0.009
orns-0.009
row-0.008
py-0.008
TokenOnce
Token ID7454
Feature activation-8.49e-01

Pos logit contributions
ped+0.050
orn+0.048
row+0.047
orns+0.046
iest+0.045
Neg logit contributions
 booth-0.015
 Mrs-0.014
 Miss-0.013
 closely-0.013
 stern-0.012
Token homes
Token ID5682
Feature activation-2.28e-01

Pos logit contributions
 booth+0.016
 Mrs+0.016
 stern+0.015
 Some+0.015
cases+0.015
Neg logit contributions
ped-0.015
orn-0.014
row-0.014
 powers-0.014
 elbow-0.014
Token to
Token ID284
Feature activation+6.48e-01

Pos logit contributions
 booth+0.004
 Mrs+0.003
 about+0.003
 where+0.003
 closely+0.003
Neg logit contributions
ped-0.008
orn-0.008
orns-0.007
row-0.007
py-0.007
Token stay
Token ID2652
Feature activation+2.02e+00

Pos logit contributions
ped+0.020
row+0.019
orn+0.019
orns+0.019
iest+0.018
Neg logit contributions
 Mrs-0.013
 where-0.013
 booth-0.013
 Miss-0.012
 about-0.012
Token dry
Token ID5894
Feature activation+2.00e+00

Pos logit contributions
ped+0.062
orn+0.060
orns+0.057
row+0.057
py+0.056
Neg logit contributions
 booth-0.037
 closely-0.037
 where-0.035
 Mrs-0.035
 about-0.035
Token and
Token ID290
Feature activation+2.61e+00

Pos logit contributions
orn+0.071
ped+0.070
orns+0.066
py+0.065
umper+0.065
Neg logit contributions
 about-0.028
 closely-0.028
 where-0.027
 booth-0.026
 they-0.025
Token cozy
Token ID37438
Feature activation+3.21e+00

Pos logit contributions
orn+0.085
orns+0.083
ped+0.082
row+0.079
umper+0.079
Neg logit contributions
 where-0.047
 they-0.046
 Mrs-0.046
 booth-0.044
 about-0.044
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
orn+0.115
ped+0.114
umper+0.112
py+0.107
orns+0.107
Neg logit contributions
 where-0.048
 about-0.044
 they-0.043
 closely-0.040
 booth-0.040
Token<|endoftext|>
Token ID50256
Feature activation+1.80e+00

Pos logit contributions
 salon+0.006
 booth+0.006
 closely+0.006
 stern+0.006
cases+0.006
Neg logit contributions
ped-0.008
orn-0.008
 himself-0.007
orns-0.007
ates-0.007
TokenOnce
Token ID7454
Feature activation-1.09e+00

Pos logit contributions
ped+0.071
row+0.067
orn+0.067
 elbow+0.065
orns+0.065
Neg logit contributions
 booth-0.018
 Mrs-0.017
 Miss-0.015
 closely-0.014
 about-0.013
Token upon
Token ID2402
Feature activation+6.53e-01

Pos logit contributions
 booth+0.026
 Mrs+0.023
 Miss+0.023
 stern+0.022
 bakery+0.022
Neg logit contributions
orn-0.027
ped-0.027
row-0.026
iest-0.025
py-0.024
Token a
Token ID257
Feature activation-7.79e-01

Pos logit contributions
ped+0.022
row+0.021
orn+0.021
orns+0.021
iest+0.020
Neg logit contributions
 booth-0.010
 Mrs-0.010
 closely-0.009
 Miss-0.009
 about-0.009
Token and
Token ID290
Feature activation-7.31e-01

Pos logit contributions
 booth+0.006
 Mrs+0.006
 closely+0.005
 Miss+0.005
 stern+0.005
Neg logit contributions
ped-0.013
orn-0.013
row-0.013
orns-0.012
iest-0.012
Token scared
Token ID12008
Feature activation-6.54e-02

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.012
 Miss+0.012
 stern+0.011
Neg logit contributions
ped-0.023
row-0.021
orn-0.021
iest-0.020
py-0.020
Token because
Token ID780
Feature activation-1.89e+00

Pos logit contributions
 booth+0.001
 Mrs+0.001
 closely+0.001
 stern+0.001
 Miss+0.001
Neg logit contributions
ped-0.002
row-0.002
orn-0.002
iest-0.002
py-0.002
Token they
Token ID484
Feature activation-3.56e-01

Pos logit contributions
 booth+0.023
 closely+0.020
 Mrs+0.019
 stern+0.019
cases+0.017
Neg logit contributions
ped-0.067
iest-0.065
orn-0.062
row-0.062
 powers-0.061
Token had
Token ID550
Feature activation+9.73e-02

Pos logit contributions
 booth+0.007
 Mrs+0.007
 closely+0.007
 Miss+0.007
 stern+0.006
Neg logit contributions
ped-0.010
orn-0.010
row-0.010
orns-0.009
iest-0.009
Token never
Token ID1239
Feature activation+3.33e+00

Pos logit contributions
ped+0.003
orn+0.003
row+0.003
orns+0.003
iest+0.003
Neg logit contributions
 booth-0.002
 Mrs-0.002
 Miss-0.002
 closely-0.002
 stern-0.002
Token been
Token ID587
Feature activation+1.07e+00

Pos logit contributions
orn+0.097
orns+0.094
agons+0.094
robat+0.092
ates+0.091
Neg logit contributions
 about-0.072
 Mrs-0.068
 where-0.066
 booth-0.065
 Miss-0.063
Token kind
Token ID1611
Feature activation+9.47e-01

Pos logit contributions
ped+0.032
orn+0.031
row+0.030
orns+0.030
robat+0.029
Neg logit contributions
 booth-0.020
 Mrs-0.020
 Miss-0.019
 closely-0.018
 stern-0.018
Token to
Token ID284
Feature activation+6.27e-01

Pos logit contributions
ped+0.031
orn+0.031
row+0.030
orns+0.029
 elbow+0.028
Neg logit contributions
 booth-0.015
 Mrs-0.014
 closely-0.014
 Miss-0.014
 about-0.013
Token each
Token ID1123
Feature activation+1.32e+00

Pos logit contributions
ped+0.021
orn+0.020
row+0.020
orns+0.019
iest+0.019
Neg logit contributions
 Mrs-0.010
 booth-0.010
 Miss-0.010
 closely-0.009
 stern-0.009
Token other
Token ID584
Feature activation+2.68e-01

Pos logit contributions
ped+0.036
orn+0.035
row+0.034
orns+0.033
 elbow+0.032
Neg logit contributions
 booth-0.029
 Mrs-0.028
 Miss-0.026
 closely-0.026
 stern-0.025
Token looked
Token ID3114
Feature activation-6.35e-01

Pos logit contributions
orn+0.064
orns+0.063
ped+0.062
row+0.061
iest+0.060
Neg logit contributions
 Mrs-0.041
 booth-0.040
 Miss-0.039
 stern-0.036
 where-0.036
Token back
Token ID736
Feature activation+1.22e+00

Pos logit contributions
 booth+0.017
 Whoever+0.016
 Some+0.016
 Mrs+0.016
 salon+0.016
Neg logit contributions
ped-0.013
row-0.011
orn-0.011
 himself-0.011
ishing-0.011
Token.
Token ID13
Feature activation-2.15e-01

Pos logit contributions
ped+0.039
orn+0.037
row+0.036
orns+0.036
iest+0.035
Neg logit contributions
 booth-0.021
 Mrs-0.020
 closely-0.019
 Miss-0.019
 stern-0.018
Token The
Token ID383
Feature activation-3.49e-01

Pos logit contributions
 booth+0.003
 closely+0.003
 stern+0.003
 Mrs+0.003
cases+0.003
Neg logit contributions
ped-0.007
orn-0.007
orns-0.007
row-0.006
py-0.006
Token shadow
Token ID9082
Feature activation+5.53e-01

Pos logit contributions
 booth+0.008
 Mrs+0.007
 closely+0.007
 Miss+0.007
 stern+0.007
Neg logit contributions
ped-0.009
orn-0.009
row-0.009
orns-0.009
iest-0.008
Token waved
Token ID26834
Feature activation+5.55e-02

Pos logit contributions
ped+0.015
orn+0.014
row+0.014
orns+0.014
iest+0.013
Neg logit contributions
 booth-0.012
 Mrs-0.011
 closely-0.011
 Miss-0.011
 stern-0.011
Token goodbye
Token ID24829
Feature activation+6.34e-01

Pos logit contributions
ped+0.002
orn+0.001
row+0.001
orns+0.001
iest+0.001
Neg logit contributions
 booth-0.001
 Mrs-0.001
 closely-0.001
 Miss-0.001
 stern-0.001
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
ped+0.022
orn+0.021
row+0.021
orns+0.021
iest+0.020
Neg logit contributions
 booth-0.009
 Mrs-0.008
 closely-0.008
 Miss-0.008
 stern-0.008
Token disappeared
Token ID12120
Feature activation+2.14e+00

Pos logit contributions
ped+0.041
orn+0.040
row+0.039
orns+0.039
ates+0.037
Neg logit contributions
 booth-0.028
 Mrs-0.028
 Miss-0.026
 closely-0.025
 stern-0.025
Token in
Token ID287
Feature activation-3.88e-01

Pos logit contributions
orn+0.068
ped+0.066
orns+0.065
robat+0.063
umper+0.061
Neg logit contributions
 closely-0.038
 booth-0.038
 Mrs-0.037
 where-0.037
 about-0.036
Token the
Token ID262
Feature activation+4.00e-01

Pos logit contributions
 booth+0.006
 Mrs+0.006
 Miss+0.006
 closely+0.006
 stern+0.006
Neg logit contributions
ped-0.013
orn-0.012
row-0.012
orns-0.012
iest-0.011
Token repair
Token ID9185
Feature activation-2.59e-01

Pos logit contributions
ped+0.006
orn+0.006
orns+0.006
iest+0.006
row+0.006
Neg logit contributions
 where-0.004
 closely-0.004
 they-0.004
 Mrs-0.004
 booth-0.004
Token L
Token ID406
Feature activation-5.06e-02

Pos logit contributions
 booth+0.005
 Mrs+0.005
 closely+0.004
 Miss+0.004
 stern+0.004
Neg logit contributions
ped-0.008
orn-0.008
row-0.007
orns-0.007
py-0.007
Tokenizzy
Token ID40593
Feature activation-9.37e-01

Pos logit contributions
 booth+0.001
 closely+0.001
 Lisa+0.001
 about+0.001
abella+0.001
Neg logit contributions
ped-0.001
 elbow-0.001
orn-0.001
thank-0.001
orns-0.001
Token's
Token ID338
Feature activation+9.14e-01

Pos logit contributions
 booth+0.010
 closely+0.009
 Mrs+0.008
 stern+0.008
 Miss+0.007
Neg logit contributions
ped-0.036
orn-0.035
orns-0.034
row-0.034
iest-0.033
Token leg
Token ID1232
Feature activation+3.78e-01

Pos logit contributions
ped+0.025
iest+0.024
robat+0.024
orn+0.024
rob+0.024
Neg logit contributions
 booth-0.020
 stern-0.019
 closely-0.019
 grandma-0.019
 where-0.019
Token and
Token ID290
Feature activation+7.70e-01

Pos logit contributions
ped+0.014
orn+0.014
orns+0.013
iest+0.013
py+0.013
Neg logit contributions
 booth-0.004
 about-0.004
 where-0.004
 Mrs-0.004
 stern-0.004
Token used
Token ID973
Feature activation-6.44e-01

Pos logit contributions
orn+0.024
ped+0.024
row+0.023
orns+0.023
iest+0.022
Neg logit contributions
 Mrs-0.014
 booth-0.014
 Miss-0.013
 they-0.013
 closely-0.013
Token the
Token ID262
Feature activation+5.09e-01

Pos logit contributions
 closely+0.009
 Mrs+0.008
 booth+0.008
 where+0.008
 about+0.008
Neg logit contributions
orn-0.022
ped-0.022
row-0.022
robat-0.022
iest-0.021
Token string
Token ID4731
Feature activation+2.59e-01

Pos logit contributions
ped+0.014
row+0.013
orn+0.013
umper+0.013
robat+0.013
Neg logit contributions
 stern-0.011
 booth-0.011
 closely-0.010
 Mrs-0.010
 Sometimes-0.010
Token to
Token ID284
Feature activation-2.54e-01

Pos logit contributions
ped+0.009
orn+0.009
robat+0.009
orns+0.009
row+0.008
Neg logit contributions
 booth-0.003
 closely-0.003
 where-0.003
 about-0.003
 Mrs-0.003
Token tie
Token ID9839
Feature activation+1.51e-01

Pos logit contributions
 Mrs+0.005
 where+0.005
 closely+0.005
 they+0.005
 booth+0.005
Neg logit contributions
ped-0.007
orn-0.007
orns-0.007
iest-0.007
row-0.007
Token coming
Token ID2406
Feature activation+8.11e-01

Pos logit contributions
ped+0.009
orn+0.009
row+0.009
orns+0.008
iest+0.008
Neg logit contributions
 booth-0.008
 Mrs-0.007
 closely-0.007
 Miss-0.007
 stern-0.007
Token towards
Token ID3371
Feature activation+6.12e-01

Pos logit contributions
ped+0.023
orn+0.023
orns+0.022
row+0.022
iest+0.021
Neg logit contributions
 booth-0.016
 Mrs-0.015
 closely-0.015
 Miss-0.015
 stern-0.014
Token it
Token ID340
Feature activation+2.87e-01

Pos logit contributions
ped+0.020
orn+0.019
row+0.019
orns+0.019
iest+0.018
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.009
 Miss-0.009
 stern-0.008
Token.
Token ID13
Feature activation+4.59e-01

Pos logit contributions
ped+0.009
orn+0.008
orns+0.008
row+0.008
 elbow+0.008
Neg logit contributions
 booth-0.006
 closely-0.006
 Mrs-0.005
 where-0.005
 about-0.005
Token The
Token ID383
Feature activation+1.01e+00

Pos logit contributions
ped+0.013
orn+0.013
row+0.012
 himself+0.012
orns+0.012
Neg logit contributions
 booth-0.008
 closely-0.008
 stern-0.007
cases-0.007
 horror-0.007
Token sw
Token ID1509
Feature activation+4.83e-01

Pos logit contributions
ped+0.023
orn+0.022
orns+0.021
row+0.021
iest+0.020
Neg logit contributions
 booth-0.027
 Mrs-0.025
 closely-0.025
 Miss-0.024
 stern-0.024
Tokenan
Token ID272
Feature activation+1.50e-01

Pos logit contributions
ped+0.010
orn+0.010
iest+0.010
orns+0.009
 elbow+0.009
Neg logit contributions
 booth-0.013
 closely-0.012
 stern-0.011
 where-0.011
 about-0.011
Token got
Token ID1392
Feature activation-6.09e-01

Pos logit contributions
ped+0.004
row+0.004
orn+0.004
py+0.003
orns+0.003
Neg logit contributions
 booth-0.003
 Mrs-0.003
 Miss-0.003
 Lisa-0.003
 closely-0.003
Token scared
Token ID12008
Feature activation+7.08e-01

Pos logit contributions
 booth+0.013
 Mrs+0.012
 Miss+0.011
 Lisa+0.011
 closely+0.011
Neg logit contributions
ped-0.018
row-0.016
orn-0.016
orns-0.015
ter-0.015
Token and
Token ID290
Feature activation+8.98e-01

Pos logit contributions
ped+0.025
orn+0.024
row+0.024
orns+0.024
py+0.023
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.008
 Miss-0.008
 stern-0.008
Token wanted
Token ID2227
Feature activation-2.62e-01

Pos logit contributions
orn+0.025
orns+0.024
ped+0.024
iest+0.024
row+0.023
Neg logit contributions
 booth-0.019
 Mrs-0.019
 closely-0.018
 stern-0.018
 Miss-0.017
Token by
Token ID416
Feature activation+5.78e-01

Pos logit contributions
ped+0.051
orn+0.049
row+0.048
orns+0.047
 elbow+0.045
Neg logit contributions
 booth-0.038
 Mrs-0.037
 closely-0.036
 Miss-0.035
 stern-0.034
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
ped+0.019
orn+0.018
row+0.018
orns+0.017
iest+0.017
Neg logit contributions
 booth-0.010
 Mrs-0.009
 closely-0.009
 Miss-0.009
 stern-0.008
Token bright
Token ID6016
Feature activation+3.04e-01

Pos logit contributions
 booth+0.003
 Mrs+0.003
 closely+0.003
 Sometimes+0.003
 Their+0.003
Neg logit contributions
ped-0.003
orn-0.003
iest-0.003
row-0.003
 powers-0.003
Token colors
Token ID7577
Feature activation+3.99e-01

Pos logit contributions
ped+0.007
orn+0.007
row+0.007
iest+0.007
orns+0.006
Neg logit contributions
 booth-0.008
 Mrs-0.007
 closely-0.007
 Miss-0.007
abella-0.007
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
ped+0.013
orn+0.013
orns+0.013
row+0.013
 elbow+0.012
Neg logit contributions
 booth-0.006
 closely-0.006
 Mrs-0.006
 about-0.006
 Miss-0.005
Token funny
Token ID8258
Feature activation+1.27e+00

Pos logit contributions
ped+0.043
orn+0.042
row+0.041
orns+0.041
py+0.039
Neg logit contributions
 Mrs-0.026
 booth-0.026
 closely-0.025
 Miss-0.025
 about-0.024
Token shapes
Token ID15268
Feature activation+3.87e-01

Pos logit contributions
ped+0.036
orn+0.034
row+0.033
orns+0.032
iest+0.032
Neg logit contributions
 booth-0.027
 Mrs-0.025
 Miss-0.024
 closely-0.024
 stern-0.023
Token on
Token ID319
Feature activation+1.25e+00

Pos logit contributions
ped+0.012
orn+0.012
orns+0.011
row+0.011
 elbow+0.011
Neg logit contributions
 booth-0.007
 closely-0.007
 about-0.007
 Mrs-0.006
 where-0.006
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
ped+0.035
orn+0.034
row+0.033
orns+0.032
iest+0.032
Neg logit contributions
 booth-0.026
 Mrs-0.024
 Miss-0.023
 closely-0.023
 stern-0.023
Token phone
Token ID3072
Feature activation+3.76e-01

Pos logit contributions
 booth+0.025
 Mrs+0.023
 closely+0.022
 Miss+0.022
 stern+0.021
Neg logit contributions
ped-0.040
orn-0.038
row-0.038
orns-0.037
iest-0.036
Token's
Token ID338
Feature activation-9.98e-04

Pos logit contributions
ped+0.010
orn+0.010
row+0.010
orns+0.009
iest+0.009
Neg logit contributions
 booth-0.008
 Mrs-0.008
 closely-0.007
 Miss-0.007
 stern-0.007
Token needed
Token ID2622
Feature activation+4.76e-01

Pos logit contributions
orn+0.065
iest+0.064
ble+0.062
ped+0.062
robat+0.062
Neg logit contributions
 about-0.028
 booth-0.028
 Mrs-0.028
's-0.028
 closely-0.028
Token.
Token ID13
Feature activation-4.59e-02

Pos logit contributions
orn+0.017
ped+0.017
py+0.016
robat+0.016
 elbow+0.016
Neg logit contributions
 where-0.007
 about-0.007
 booth-0.006
 Mrs-0.006
 closely-0.006
Token
Token ID220
Feature activation+1.46e+00

Pos logit contributions
 booth+0.001
 stern+0.001
 closely+0.001
 salon+0.001
 divorce+0.001
Neg logit contributions
ped-0.001
 himself-0.001
orn-0.001
orns-0.001
row-0.001
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
ped+0.055
orn+0.053
row+0.052
orns+0.051
iest+0.050
Neg logit contributions
 booth-0.017
 Mrs-0.016
 closely-0.014
 Miss-0.014
 stern-0.014
Token
Token ID198
Feature activation+6.92e-01

Pos logit contributions
 booth+0.005
 Mrs+0.004
 stern+0.004
 closely+0.004
 Miss+0.004
Neg logit contributions
ped-0.009
orn-0.009
row-0.009
orns-0.009
iest-0.008
TokenWhen
Token ID2215
Feature activation+9.42e-01

Pos logit contributions
ped+0.019
orn+0.019
row+0.018
orns+0.017
 himself+0.017
Neg logit contributions
 booth-0.015
 Mrs-0.014
 stern-0.013
 closely-0.013
 Miss-0.013
Token he
Token ID339
Feature activation-2.82e-02

Pos logit contributions
ped+0.032
orn+0.031
row+0.030
orns+0.030
iest+0.029
Neg logit contributions
 booth-0.015
 Mrs-0.013
 closely-0.013
 Miss-0.013
 stern-0.012
Token walked
Token ID6807
Feature activation-7.30e-01

Pos logit contributions
 booth+0.001
 Mrs+0.001
 closely+0.001
 Miss+0.001
 stern+0.001
Neg logit contributions
ped-0.001
orn-0.001
row-0.001
orns-0.001
iest-0.001
Token out
Token ID503
Feature activation-2.35e+00

Pos logit contributions
 booth+0.018
 Mrs+0.017
 Miss+0.016
 closely+0.016
 stern+0.016
Neg logit contributions
ped-0.018
orn-0.017
row-0.017
orns-0.016
iest-0.015
Token of
Token ID286
Feature activation-8.97e-01

Pos logit contributions
 booth+0.039
 Mrs+0.038
 closely+0.036
 Miss+0.035
 about+0.035
Neg logit contributions
ped-0.077
orn-0.074
row-0.072
orns-0.071
robat-0.069
Token the
Token ID262
Feature activation-8.09e-01

Pos logit contributions
 Mrs+0.013
 booth+0.012
 where+0.012
 Miss+0.012
 Lisa+0.012
Neg logit contributions
ped-0.032
row-0.031
orn-0.030
iest-0.030
orns-0.029
Token when
Token ID618
Feature activation+6.34e-01

Pos logit contributions
ped+0.005
row+0.005
orn+0.005
iest+0.005
orns+0.005
Neg logit contributions
 booth-0.003
 Mrs-0.003
 Miss-0.003
 closely-0.003
 stern-0.003
Token a
Token ID257
Feature activation+1.43e-02

Pos logit contributions
ped+0.017
orn+0.017
row+0.017
orns+0.016
robat+0.015
Neg logit contributions
 Mrs-0.014
 booth-0.014
 Miss-0.013
 closely-0.013
 stern-0.012
Token s
Token ID264
Feature activation+3.62e-01

Pos logit contributions
orn+0.000
ped+0.000
iest+0.000
row+0.000
 powers+0.000
Neg logit contributions
 booth-0.000
 closely-0.000
 Mrs-0.000
cases-0.000
 Miss-0.000
Tokeniren
Token ID24080
Feature activation+5.59e-01

Pos logit contributions
ped+0.008
orn+0.008
row+0.008
orns+0.008
iest+0.007
Neg logit contributions
 booth-0.009
 Mrs-0.009
 closely-0.009
 Miss-0.009
 stern-0.009
Token sounded
Token ID14846
Feature activation+3.04e-01

Pos logit contributions
ped+0.016
orn+0.016
row+0.015
orns+0.015
iest+0.014
Neg logit contributions
 booth-0.011
 Mrs-0.011
 Miss-0.010
 closely-0.010
 stern-0.010
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
ped+0.009
row+0.008
orn+0.008
iest+0.008
orns+0.008
Neg logit contributions
 booth-0.006
 Mrs-0.005
 Miss-0.005
 closely-0.005
 Lisa-0.005
Token making
Token ID1642
Feature activation-3.52e-01

Pos logit contributions
 booth+0.018
 Mrs+0.016
 closely+0.016
 Miss+0.015
 stern+0.015
Neg logit contributions
ped-0.038
orn-0.035
row-0.035
orns-0.034
py-0.033
Token both
Token ID1111
Feature activation+5.40e-01

Pos logit contributions
 booth+0.008
 Mrs+0.007
 closely+0.007
 Miss+0.007
 stern+0.007
Neg logit contributions
ped-0.010
orn-0.009
row-0.009
orns-0.009
iest-0.009
Token of
Token ID286
Feature activation-3.34e-01

Pos logit contributions
ped+0.015
row+0.013
orn+0.013
iest+0.013
py+0.013
Neg logit contributions
 booth-0.012
 closely-0.011
 Mrs-0.011
 Miss-0.010
abella-0.010
Token the
Token ID262
Feature activation-4.73e-02

Pos logit contributions
 Mrs+0.007
 booth+0.006
 Miss+0.006
 closely+0.006
 where+0.006
Neg logit contributions
ped-0.010
row-0.010
orn-0.010
iest-0.009
orns-0.009
Token characters
Token ID3435
Feature activation-1.29e+00

Pos logit contributions
 booth+0.001
 Mrs+0.001
 closely+0.001
 Miss+0.001
 Lisa+0.001
Neg logit contributions
ped-0.001
orn-0.001
row-0.001
orns-0.001
iest-0.001
Token and
Token ID290
Feature activation+5.97e-01

Pos logit contributions
 booth+0.029
 Mrs+0.027
 Miss+0.026
 closely+0.026
 stern+0.025
Neg logit contributions
ped-0.057
orn-0.055
row-0.054
orns-0.053
iest-0.052
Token fell
Token ID3214
Feature activation+1.77e-01

Pos logit contributions
ped+0.017
orn+0.016
row+0.016
orns+0.016
iest+0.015
Neg logit contributions
 booth-0.013
 Mrs-0.012
 Miss-0.012
 closely-0.011
 stern-0.011
Token to
Token ID284
Feature activation-5.96e-01

Pos logit contributions
orn+0.006
ped+0.005
atmeal+0.005
row+0.005
robat+0.005
Neg logit contributions
 where-0.003
 about-0.003
 they-0.003
 closely-0.003
 Miss-0.003
Token the
Token ID262
Feature activation-7.99e-01

Pos logit contributions
 booth+0.009
 Mrs+0.009
 closely+0.008
 Miss+0.008
 stern+0.008
Neg logit contributions
ped-0.020
orn-0.019
row-0.019
orns-0.019
iest-0.018
Token ground
Token ID2323
Feature activation-1.01e+00

Pos logit contributions
 booth+0.018
 Mrs+0.016
 Miss+0.015
 closely+0.015
 bakery+0.015
Neg logit contributions
ped-0.023
row-0.022
iest-0.021
orns-0.021
orn-0.021
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
 booth+0.024
 Mrs+0.023
 Miss+0.022
 closely+0.022
 stern+0.021
Neg logit contributions
ped-0.025
orn-0.024
row-0.024
orns-0.023
py-0.022
Token They
Token ID1119
Feature activation-9.45e-01

Pos logit contributions
 booth+0.022
 salon+0.019
 closely+0.018
cases+0.017
 Mrs+0.017
Neg logit contributions
ped-0.040
 himself-0.035
orns-0.033
row-0.033
orn-0.033
Token looked
Token ID3114
Feature activation-2.37e+00

Pos logit contributions
 booth+0.021
 Mrs+0.020
 closely+0.019
 Miss+0.019
 stern+0.018
Neg logit contributions
ped-0.026
orn-0.024
row-0.024
orns-0.023
iest-0.023
Token at
Token ID379
Feature activation-1.70e+00

Pos logit contributions
 booth+0.064
 salon+0.062
abella+0.061
 bakery+0.059
Mrs+0.059
Neg logit contributions
ped-0.053
ishing-0.043
 himself-0.043
orn-0.042
 mineral-0.042
Token each
Token ID1123
Feature activation+3.43e-01

Pos logit contributions
 booth+0.031
 Mrs+0.029
 stern+0.027
 Miss+0.027
cases+0.026
Neg logit contributions
ped-0.054
orn-0.051
row-0.049
orns-0.049
py-0.048
Token other
Token ID584
Feature activation-1.34e+00

Pos logit contributions
ped+0.009
orn+0.009
row+0.009
orns+0.008
iest+0.008
Neg logit contributions
 booth-0.008
 Mrs-0.007
 closely-0.007
 Miss-0.007
 stern-0.007
Token to
Token ID284
Feature activation+1.55e+00

Pos logit contributions
 booth+0.007
ews+0.007
cases+0.007
 divorce+0.006
 horror+0.006
Neg logit contributions
 himself-0.013
ped-0.012
row-0.012
py-0.012
ter-0.012
Token join
Token ID4654
Feature activation+8.47e-01

Pos logit contributions
ped+0.044
orn+0.043
orns+0.042
row+0.041
robat+0.039
Neg logit contributions
 booth-0.034
 Miss-0.033
 Mrs-0.033
 where-0.031
 closely-0.031
Token in
Token ID287
Feature activation-1.04e+00

Pos logit contributions
orn+0.025
ped+0.024
row+0.023
orns+0.023
 elbow+0.022
Neg logit contributions
 closely-0.017
 booth-0.017
 Miss-0.017
 Mrs-0.017
 Lisa-0.016
Token their
Token ID511
Feature activation+5.59e-01

Pos logit contributions
 booth+0.020
 Mrs+0.019
 closely+0.018
 Miss+0.018
 stern+0.017
Neg logit contributions
ped-0.031
orn-0.030
row-0.029
orns-0.029
iest-0.028
Token game
Token ID983
Feature activation-7.39e-01

Pos logit contributions
ped+0.014
orn+0.014
row+0.013
orns+0.013
iest+0.012
Neg logit contributions
 booth-0.013
 Mrs-0.013
 closely-0.012
 Miss-0.012
 stern-0.012
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
 booth+0.015
 Mrs+0.014
 Miss+0.014
 closely+0.014
 stern+0.013
Neg logit contributions
ped-0.021
orn-0.020
row-0.020
orns-0.020
py-0.019
Token Sam
Token ID3409
Feature activation-1.32e+00

Pos logit contributions
 booth+0.003
 closely+0.003
 salon+0.003
 stern+0.003
 divorce+0.003
Neg logit contributions
ped-0.004
orn-0.004
 himself-0.004
orns-0.004
row-0.003
Token smiled
Token ID13541
Feature activation+2.32e-01

Pos logit contributions
 booth+0.031
 Mrs+0.029
 Sometimes+0.028
 Miss+0.028
 billboards+0.027
Neg logit contributions
ped-0.034
row-0.030
orn-0.029
py-0.028
 powers-0.027
Token and
Token ID290
Feature activation+1.59e+00

Pos logit contributions
ped+0.008
orn+0.007
row+0.007
iest+0.007
py+0.007
Neg logit contributions
 booth-0.004
 Mrs-0.003
 Miss-0.003
 closely-0.003
 stern-0.003
Token kindly
Token ID26820
Feature activation+6.49e-01

Pos logit contributions
ped+0.048
orn+0.046
row+0.045
orns+0.044
iest+0.043
Neg logit contributions
 booth-0.031
 Mrs-0.029
 Miss-0.028
 closely-0.028
 stern-0.027
Token said
Token ID531
Feature activation-1.19e-01

Pos logit contributions
ped+0.019
orn+0.018
row+0.018
orns+0.018
iest+0.017
Neg logit contributions
 booth-0.013
 Mrs-0.012
 closely-0.012
 Miss-0.012
 stern-0.011
Token to
Token ID284
Feature activation+5.30e-01

Pos logit contributions
 booth+0.007
 Mrs+0.006
 closely+0.006
 stern+0.006
 Miss+0.006
Neg logit contributions
ped-0.012
orn-0.012
row-0.012
orns-0.012
iest-0.012
Token move
Token ID1445
Feature activation-5.37e-01

Pos logit contributions
orns+0.015
ped+0.015
orn+0.014
row+0.014
ates+0.014
Neg logit contributions
 closely-0.012
 Mrs-0.011
 stern-0.011
 booth-0.010
 Lisa-0.010
Token the
Token ID262
Feature activation-8.25e-01

Pos logit contributions
 booth+0.011
 Mrs+0.011
 Miss+0.010
cases+0.010
 stern+0.010
Neg logit contributions
ped-0.015
row-0.014
orn-0.014
iest-0.013
orns-0.013
Token items
Token ID3709
Feature activation-1.33e+00

Pos logit contributions
 booth+0.017
 closely+0.016
 Mrs+0.016
 stern+0.015
 bakery+0.014
Neg logit contributions
ped-0.024
orns-0.023
orn-0.023
row-0.023
umper-0.022
Token nearer
Token ID40671
Feature activation-3.31e-01

Pos logit contributions
 booth+0.028
 closely+0.027
 Mrs+0.026
 stern+0.025
 Miss+0.025
Neg logit contributions
ped-0.037
orn-0.036
orns-0.035
row-0.035
py-0.033
Token to
Token ID284
Feature activation-1.95e-01

Pos logit contributions
 booth+0.007
 Mrs+0.007
 closely+0.006
 Miss+0.006
 stern+0.006
Neg logit contributions
ped-0.009
orn-0.009
row-0.009
orns-0.008
iest-0.008
Token the
Token ID262
Feature activation-8.59e-01

Pos logit contributions
 booth+0.003
 Mrs+0.003
 closely+0.003
 Miss+0.003
 stern+0.003
Neg logit contributions
ped-0.007
orn-0.006
row-0.006
orns-0.006
iest-0.006
Token flame
Token ID16506
Feature activation-6.23e-01

Pos logit contributions
 booth+0.017
 Mrs+0.016
 closely+0.016
 Miss+0.015
 stern+0.015
Neg logit contributions
ped-0.026
orn-0.024
row-0.024
orns-0.024
py-0.023
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 booth+0.013
 Mrs+0.012
 closely+0.012
 Miss+0.012
 stern+0.012
Neg logit contributions
ped-0.017
orn-0.017
orns-0.016
row-0.016
iest-0.015
Token As
Token ID1081
Feature activation-1.49e+00

Pos logit contributions
 booth+0.024
 closely+0.023
 stern+0.023
 salon+0.022
 divorce+0.021
Neg logit contributions
ped-0.036
 himself-0.034
ates-0.032
wood-0.032
ter-0.031
Token the
Token ID262
Feature activation+5.51e-01

Pos logit contributions
 booth+0.022
 stern+0.018
 Some+0.018
cases+0.018
 Mrs+0.018
Neg logit contributions
ped-0.048
iest-0.048
row-0.045
 himself-0.045
ter-0.045
Token out
Token ID503
Feature activation-9.47e-01

Pos logit contributions
 booth+0.006
 Mrs+0.005
 Miss+0.005
 closely+0.005
 stern+0.005
Neg logit contributions
ped-0.008
orn-0.008
row-0.007
orns-0.007
iest-0.007
Token what
Token ID644
Feature activation-4.64e-01

Pos logit contributions
 booth+0.020
 Mrs+0.019
 Miss+0.018
 stern+0.017
 Their+0.017
Neg logit contributions
ped-0.026
row-0.024
iest-0.024
orn-0.024
py-0.023
Token's
Token ID338
Feature activation-1.98e-01

Pos logit contributions
 booth+0.007
 Mrs+0.007
 closely+0.006
 Miss+0.006
 stern+0.006
Neg logit contributions
ped-0.016
orn-0.015
row-0.015
orns-0.014
iest-0.014
Token hiding
Token ID11816
Feature activation+1.49e+00

Pos logit contributions
 booth+0.004
 Mrs+0.004
 closely+0.004
 Miss+0.004
 about+0.004
Neg logit contributions
ped-0.006
orn-0.006
row-0.005
orns-0.005
iest-0.005
Token there
Token ID612
Feature activation+1.90e+00

Pos logit contributions
ped+0.043
orn+0.040
row+0.039
orns+0.039
 elbow+0.038
Neg logit contributions
 booth-0.032
 closely-0.029
 Mrs-0.029
 where-0.028
 Miss-0.028
Token."
Token ID526
Feature activation-5.35e-01

Pos logit contributions
ped+0.057
orn+0.054
 elbow+0.053
row+0.052
orns+0.051
Neg logit contributions
 booth-0.038
 Mrs-0.036
 about-0.036
 closely-0.035
 where-0.034
Token
Token ID220
Feature activation+4.44e-01

Pos logit contributions
 booth+0.008
 closely+0.007
 stern+0.007
 Mrs+0.007
cases+0.006
Neg logit contributions
ped-0.018
orn-0.017
orns-0.017
row-0.016
 himself-0.016
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
ped+0.015
orn+0.014
row+0.014
orns+0.014
ates+0.013
Neg logit contributions
 booth-0.007
 Mrs-0.006
 closely-0.006
 stern-0.006
 Miss-0.006
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
 booth+0.005
 Mrs+0.004
 closely+0.004
 Miss+0.004
 stern+0.004
Neg logit contributions
ped-0.011
orn-0.011
row-0.011
orns-0.010
iest-0.010
TokenBoth
Token ID10265
Feature activation-7.19e-01

Pos logit contributions
 booth+0.007
 Mrs+0.007
 closely+0.007
 about+0.007
 stern+0.007
Neg logit contributions
ped-0.013
orn-0.013
row-0.013
orns-0.012
robat-0.012
Token friends
Token ID2460
Feature activation-1.54e+00

Pos logit contributions
 booth+0.017
 Mrs+0.016
 closely+0.016
 Miss+0.015
 stern+0.015
Neg logit contributions
ped-0.019
orn-0.017
row-0.017
iest-0.016
orns-0.016
Tokenhes
Token ID956
Feature activation-1.19e-01

Pos logit contributions
ped+0.044
orn+0.042
row+0.042
orns+0.040
iest+0.039
Neg logit contributions
 booth-0.030
 Mrs-0.028
 closely-0.027
 Miss-0.027
 stern-0.026
Token were
Token ID547
Feature activation+2.40e-01

Pos logit contributions
 booth+0.002
 Mrs+0.002
 closely+0.002
 Miss+0.002
 stern+0.002
Neg logit contributions
ped-0.004
orn-0.003
row-0.003
iest-0.003
orns-0.003
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
ped+0.007
orn+0.007
row+0.007
orns+0.006
iest+0.006
Neg logit contributions
 booth-0.005
 Mrs-0.005
 closely-0.004
 Miss-0.004
 stern-0.004
Token They
Token ID1119
Feature activation-6.31e-01

Pos logit contributions
 divorce+0.020
 stern+0.020
 closely+0.019
 calendar+0.019
Kat+0.019
Neg logit contributions
ped-0.024
wood-0.022
 himself-0.022
aces-0.019
ags-0.019
Token looked
Token ID3114
Feature activation-2.71e+00

Pos logit contributions
 booth+0.016
 Mrs+0.015
 Miss+0.015
 Sometimes+0.014
 stern+0.014
Neg logit contributions
ped-0.015
row-0.013
orn-0.012
py-0.012
ter-0.012
Token over
Token ID625
Feature activation-3.20e+00

Pos logit contributions
 salon+0.086
 booth+0.082
abella+0.082
Mrs+0.082
oph+0.079
Neg logit contributions
ped-0.049
ishing-0.038
ter-0.038
 breaths-0.037
 breath-0.035
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
 booth+0.071
cases+0.067
 Mrs+0.064
 salon+0.062
ews+0.062
Neg logit contributions
ped-0.081
orns-0.073
 himself-0.072
iest-0.072
 powers-0.070
Token fence
Token ID13990
Feature activation-2.87e+00

Pos logit contributions
abella+0.022
 Rebecca+0.022
 Mrs+0.022
 Alice+0.021
cases+0.021
Neg logit contributions
ped-0.027
orns-0.026
row-0.025
orn-0.025
b-0.024
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
abella+0.057
 Mrs+0.056
 Miss+0.054
 booth+0.053
 divorce+0.053
Neg logit contributions
ped-0.076
row-0.071
ter-0.067
orn-0.066
iest-0.065
Token saw
Token ID2497
Feature activation-1.58e+00

Pos logit contributions
 booth+0.008
 batch+0.008
 stern+0.007
 Sometimes+0.007
cases+0.007
Neg logit contributions
ped-0.009
ter-0.008
 himself-0.008
 elbow-0.007
row-0.007
Token some
Token ID617
Feature activation+3.62e-01

Pos logit contributions
 booth+0.038
 closely+0.038
cases+0.036
 Whoever+0.036
 stern+0.035
Neg logit contributions
ped-0.038
ter-0.033
 himself-0.031
iest-0.029
ble-0.029
Token done
Token ID1760
Feature activation-3.05e-01

Pos logit contributions
ped+0.009
orn+0.009
row+0.009
orns+0.009
iest+0.009
Neg logit contributions
 booth-0.005
 Mrs-0.005
 closely-0.005
 Miss-0.005
 stern-0.005
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
 booth+0.006
 Mrs+0.006
 closely+0.006
 Miss+0.005
 stern+0.005
Neg logit contributions
ped-0.009
orn-0.009
row-0.008
orns-0.008
py-0.008
Token Mom
Token ID11254
Feature activation-1.45e+00

Pos logit contributions
ped+0.007
orn+0.006
row+0.006
orns+0.006
py+0.006
Neg logit contributions
 booth-0.003
 closely-0.003
 Mrs-0.003
 stern-0.003
cases-0.003
Token opened
Token ID4721
Feature activation-1.75e+00

Pos logit contributions
 booth+0.033
 Mrs+0.030
 Miss+0.030
 closely+0.030
 stern+0.029
Neg logit contributions
ped-0.039
orn-0.036
row-0.036
orns-0.034
py-0.034
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
 booth+0.041
cases+0.038
 Miss+0.035
 divorce+0.035
 salon+0.035
Neg logit contributions
ped-0.044
 himself-0.039
 powers-0.037
orn-0.037
orns-0.037
Token oven
Token ID14361
Feature activation-3.02e+00

Pos logit contributions
 booth+0.034
 Mrs+0.034
 Miss+0.032
 closely+0.032
 stern+0.030
Neg logit contributions
ped-0.051
orn-0.050
row-0.048
orns-0.047
iest-0.046
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
 Miss+0.050
 Mrs+0.050
 booth+0.050
abella+0.049
 Sometimes+0.049
Neg logit contributions
ped-0.086
row-0.084
orn-0.080
iest-0.078
orns-0.076
Token She
Token ID1375
Feature activation-3.12e-01

Pos logit contributions
 booth+0.018
 closely+0.018
 salon+0.017
cases+0.017
 divorce+0.016
Neg logit contributions
ped-0.026
 himself-0.024
orn-0.023
orns-0.022
row-0.022
Token took
Token ID1718
Feature activation-8.82e-01

Pos logit contributions
 booth+0.007
 Miss+0.006
 Sometimes+0.006
 closely+0.006
 billboards+0.006
Neg logit contributions
ped-0.008
row-0.007
orn-0.007
 elbow-0.007
orns-0.007
Token out
Token ID503
Feature activation-2.30e+00

Pos logit contributions
 booth+0.020
 closely+0.018
 Miss+0.018
 stern+0.017
 salon+0.017
Neg logit contributions
ped-0.024
orn-0.021
row-0.021
iest-0.020
orns-0.020
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 booth+0.064
 Whoever+0.058
 salon+0.058
ensive+0.057
 Maybe+0.057
Neg logit contributions
ped-0.048
 himself-0.044
 powers-0.039
row-0.038
iest-0.038
Token up
Token ID510
Feature activation-3.18e+00

Pos logit contributions
 booth+0.041
 Mrs+0.037
 Miss+0.036
 stern+0.035
cases+0.033
Neg logit contributions
ped-0.059
row-0.056
orn-0.055
orns-0.055
iest-0.054
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
 booth+0.081
 Mrs+0.069
 Miss+0.069
 stern+0.068
abella+0.067
Neg logit contributions
ped-0.076
orns-0.070
 himself-0.067
iest-0.066
row-0.066
Token blocks
Token ID7021
Feature activation-2.46e+00

Pos logit contributions
 booth+0.033
 Mrs+0.032
 Miss+0.030
 closely+0.030
 stern+0.029
Neg logit contributions
ped-0.028
orn-0.027
row-0.026
orns-0.026
iest-0.025
Token and
Token ID290
Feature activation-5.42e-01

Pos logit contributions
 booth+0.043
 Mrs+0.041
 Miss+0.039
 closely+0.037
 stern+0.037
Neg logit contributions
ped-0.077
row-0.073
orn-0.073
orns-0.070
iest-0.070
Token started
Token ID2067
Feature activation-2.02e+00

Pos logit contributions
 booth+0.012
 Mrs+0.012
 closely+0.011
 Miss+0.011
 stern+0.011
Neg logit contributions
ped-0.015
orn-0.014
row-0.013
orns-0.013
iest-0.013
Token over
Token ID625
Feature activation-2.99e+00

Pos logit contributions
 booth+0.044
 Mrs+0.041
 Miss+0.039
 closely+0.038
 stern+0.037
Neg logit contributions
ped-0.056
row-0.052
orn-0.051
orns-0.050
 elbow-0.049
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
 booth+0.060
 Mrs+0.055
 Miss+0.053
 stern+0.051
 closely+0.051
Neg logit contributions
ped-0.086
orn-0.082
orns-0.080
row-0.080
iest-0.078
Token This
Token ID770
Feature activation-6.87e-01

Pos logit contributions
 booth+0.043
 stern+0.038
cases+0.037
 pastor+0.036
 closely+0.036
Neg logit contributions
ped-0.044
thank-0.039
ter-0.039
ags-0.038
 himself-0.037
Token time
Token ID640
Feature activation-8.59e-01

Pos logit contributions
 booth+0.019
 Mrs+0.018
 Miss+0.017
 stern+0.017
 closely+0.017
Neg logit contributions
orn-0.014
iest-0.014
ped-0.014
row-0.013
 powers-0.013
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
 booth+0.023
ensive+0.021
 closely+0.021
Mrs+0.020
 stern+0.020
Neg logit contributions
ped-0.017
iest-0.017
row-0.016
ter-0.015
 powers-0.015
Token Lily
Token ID20037
Feature activation-1.59e+00

Pos logit contributions
 booth+0.026
cases+0.023
ews+0.023
aney+0.021
 stern+0.021
Neg logit contributions
ter-0.042
ped-0.039
 elbow-0.038
 himself-0.036
ble-0.035
Token big
Token ID1263
Feature activation-1.63e+00

Pos logit contributions
 booth+0.032
 Mrs+0.030
 Miss+0.030
 closely+0.030
cases+0.028
Neg logit contributions
ped-0.031
orn-0.030
row-0.029
iest-0.028
orns-0.027
Token bag
Token ID6131
Feature activation-8.65e-01

Pos logit contributions
 booth+0.041
 Mrs+0.039
 Miss+0.038
 closely+0.038
 stern+0.036
Neg logit contributions
ped-0.038
orn-0.037
row-0.036
orns-0.034
iest-0.034
Token of
Token ID286
Feature activation+9.03e-01

Pos logit contributions
 booth+0.015
 Mrs+0.014
 closely+0.013
 Miss+0.013
 stern+0.013
Neg logit contributions
ped-0.027
orn-0.026
row-0.026
orns-0.025
iest-0.025
Token broccoli
Token ID44653
Feature activation-5.36e-01

Pos logit contributions
ped+0.025
orn+0.024
row+0.024
py+0.023
orns+0.022
Neg logit contributions
 Mrs-0.019
 booth-0.019
 Miss-0.018
 stern-0.018
 closely-0.018
Token.
Token ID13
Feature activation-7.12e-01

Pos logit contributions
 booth+0.008
 Mrs+0.007
 closely+0.007
 where+0.007
 about+0.006
Neg logit contributions
ped-0.019
orn-0.018
orns-0.018
row-0.018
py-0.018
Token Mia
Token ID32189
Feature activation-2.98e+00

Pos logit contributions
 booth+0.015
cases+0.014
 divorce+0.013
 closely+0.012
 billboards+0.012
Neg logit contributions
ped-0.019
 himself-0.018
aces-0.017
 elbow-0.017
orn-0.016
Token opens
Token ID9808
Feature activation-1.54e+00

Pos logit contributions
 booth+0.059
 Miss+0.055
 Mrs+0.053
 Aunt+0.050
cases+0.050
Neg logit contributions
ped-0.087
orn-0.083
orns-0.079
 elbow-0.079
ery-0.077
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
 booth+0.037
cases+0.036
oke+0.033
 Miss+0.033
 divorce+0.033
Neg logit contributions
ped-0.038
 himself-0.034
orn-0.033
orns-0.032
 elbow-0.031
Token bag
Token ID6131
Feature activation-1.66e+00

Pos logit contributions
 booth+0.023
 Mrs+0.022
 Miss+0.022
 closely+0.021
 Lisa+0.020
Neg logit contributions
ped-0.033
orn-0.032
row-0.031
orns-0.030
iest-0.030
Token and
Token ID290
Feature activation-9.56e-01

Pos logit contributions
 booth+0.025
 Mrs+0.025
 Miss+0.024
 closely+0.022
 stern+0.022
Neg logit contributions
ped-0.054
row-0.053
orn-0.052
orns-0.050
iest-0.049
Token sees
Token ID7224
Feature activation-1.52e+00

Pos logit contributions
 booth+0.018
 Miss+0.017
 Mrs+0.016
abella+0.016
 divorce+0.016
Neg logit contributions
ped-0.026
 elbow-0.025
orn-0.024
orns-0.024
 seaw-0.023
Token and
Token ID290
Feature activation-6.13e-01

Pos logit contributions
 booth+0.042
abella+0.038
 Mrs+0.037
 Sophia+0.034
 Miss+0.034
Neg logit contributions
ped-0.062
row-0.057
py-0.054
 himself-0.054
ping-0.053
Token came
Token ID1625
Feature activation-1.90e-01

Pos logit contributions
 booth+0.017
 closely+0.015
 Mrs+0.015
 bakery+0.015
 Sometimes+0.015
Neg logit contributions
ped-0.014
row-0.012
orn-0.012
py-0.012
ter-0.011
Token outside
Token ID2354
Feature activation-1.85e+00

Pos logit contributions
 booth+0.003
 Mrs+0.003
 closely+0.003
 Miss+0.003
 where+0.003
Neg logit contributions
ped-0.006
orn-0.006
row-0.006
iest-0.006
orns-0.006
Token.
Token ID13
Feature activation-2.60e+00

Pos logit contributions
 booth+0.050
 Mrs+0.045
cases+0.045
abella+0.044
 Miss+0.044
Neg logit contributions
ped-0.040
row-0.036
orn-0.034
 himself-0.034
py-0.034
Token She
Token ID1375
Feature activation-7.94e-01

Pos logit contributions
 booth+0.064
abella+0.059
 closely+0.058
cases+0.058
abel+0.056
Neg logit contributions
ped-0.056
orns-0.052
orn-0.047
 himself-0.047
row-0.047
Token saw
Token ID2497
Feature activation-2.96e+00

Pos logit contributions
 booth+0.015
 closely+0.014
 Mrs+0.014
 Miss+0.013
cases+0.013
Neg logit contributions
ped-0.024
row-0.022
orn-0.022
orns-0.021
py-0.021
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
 booth+0.067
case+0.064
cases+0.061
ews+0.060
 closely+0.059
Neg logit contributions
ped-0.072
orns-0.071
ter-0.070
 himself-0.067
py-0.063
Token wagon
Token ID31860
Feature activation-1.40e+00

Pos logit contributions
 booth+0.022
 Mrs+0.021
 Miss+0.020
 closely+0.020
 stern+0.020
Neg logit contributions
ped-0.033
row-0.031
orn-0.031
orns-0.030
iest-0.029
Token and
Token ID290
Feature activation-9.23e-01

Pos logit contributions
 booth+0.025
 Mrs+0.024
 Miss+0.023
 closely+0.022
abella+0.021
Neg logit contributions
ped-0.043
orn-0.041
row-0.041
orns-0.039
py-0.038
Token the
Token ID262
Feature activation-9.90e-01

Pos logit contributions
 booth+0.020
 closely+0.019
cases+0.018
 Sometimes+0.017
 bakery+0.017
Neg logit contributions
ped-0.026
row-0.023
orns-0.022
orn-0.022
 elbow-0.022
Token mud
Token ID17492
Feature activation-1.14e+00

Pos logit contributions
 booth+0.021
 Mrs+0.020
 stern+0.019
 Miss+0.019
 closely+0.019
Neg logit contributions
ped-0.028
row-0.027
orn-0.027
iest-0.025
orns-0.025