TOP ACTIVATIONS
MAX = 0.104

get stuck , and handed Sammy a butter knife . G
glass of juice and gave Jill a big smile .
's house , Tom showed Sam the big gem . Sam
Mom , can I invite Sam and Mia and Ben to
neck and ran to show Sam . Sam was
Then , Jack showed Sam how to look carefully before
. He offered to lend Mia his car so she could
. Then , Tim showed Sam his new seat . "
opened the bag and showed Emily the big box inside .
and asked him to give Sam his bear back . She
; he wanted to show Sam his famous toy too .
man said yes and showed Sam how to use it .
Suddenly , the bee gave Sandy a shock ! It st
the garden and then gave Jenny a big hug .
. She wants to show Luna the birds . She picks
They all cheered and gave Janet a big hug . From
Anna was happy and gave Luna a hug . Luna pur
friends . He even gave Rachel a lovely blue feather as
angry , but then gave Sandy some of their oats .
ily sniff les and shows Ben her knee . It has

MIN ACTIVATIONS
MIN = -0.105

to play outside in the sunny weather . One day he
lay it down in the deep , green grass .
flower , growing in the sunny garden . Every day ,
to play outside in the sunny weather . One day ,
was very content in the sunny me adow , where she
als sh ined in the sunny sky . One
It will glow in the dark and scare away the bad
to the poppy in the sunny field . She put the
would sit together in the sunny garden , and grand pa
wizard was walking through a sunny me adow when he spotted
it moved around in the sunny part of the ocean ,
many dangers lurking in the deep blue sea . But he
So they stayed in the sunny field until it was time
liked to relax in the sunny me adow .
play together in the big sunny garden . <|endoftext|> One day
am further out into the deep ocean . Sure enough ,
saw it shining in the dark . Pluto was not ugly
They come out in the dark and use their big eyes
, and skip in the sunny me adow .
saw a jet in the deep sky . The little boy

INTERVAL 0.052 - 0.104
CONTAINS 0.602%

Mom asks . " Why are you fighting ?"
bou quet . They give it to Tom . They say
. Lily agreed and handed Ros ie over . While her
, all because Max brought them together . <|endoftext|> One day
it . " Look ! I 'm holding a

INTERVAL -0.000 - 0.052
CONTAINS 50.232%

silence anymore . <|endoftext|> Once upon a time , there was
Suddenly , an incredible thing happened . The cloud started to
. " Here , Sue , this is corn
" Don 't worry , Lily . You can try again
have to come closer and listen . L ila

INTERVAL -0.053 - -0.000
CONTAINS 48.723%

cannot play with them . Go away !" Sam felt sad
! He was so sad that he thought he
Tom . You did not win the prize ." Tom was
out and enjoy the sunny day ." With a smile on
the people in the neighborhood stay safe . Julie and Max

INTERVAL -0.105 - -0.053
CONTAINS 0.444%

go away for a few days . So , Sarah carefully
She kept it warm and cozy , under her blanket .
many fish swimming in the night . Emma was so excited
boy kept walking for a long time , until he eventually
salad so much . From then on , Fl oppy asked
Token get
Token ID651
Feature activation+2.99e-02

Pos logit contributions
rob+0.015
akes+0.014
 slopes+0.014
 evenings+0.014
field+0.014
Neg logit contributions
 anyone-0.011
 sure-0.011
 okay-0.009
 worse-0.009
 up-0.009
Token stuck
Token ID7819
Feature activation-1.34e-02

Pos logit contributions
ust+0.045
amus+0.042
uted+0.039
 about+0.039
 awful+0.039
Neg logit contributions
 hill-0.043
thank-0.043
-0.042
 slopes-0.042
 evenings-0.041
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 hill+0.024
 slope+0.024
 slopes+0.023
+0.023
field+0.023
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.017
 closely-0.016
 worse-0.016
Token and
Token ID290
Feature activation-7.76e-04

Pos logit contributions
 hill+0.018
 slope+0.018
 slopes+0.017
field+0.017
+0.017
Neg logit contributions
 anyone-0.016
 sure-0.016
 up-0.015
 okay-0.014
 closely-0.014
Token handed
Token ID10158
Feature activation+4.07e-02

Pos logit contributions
 hill+0.001
 slope+0.001
 slopes+0.001
+0.001
field+0.001
Neg logit contributions
 anyone-0.001
 sure-0.001
 up-0.001
 worse-0.001
 closely-0.001
Token Sammy
Token ID44067
Feature activation+1.04e-01

Pos logit contributions
ust+0.065
 awful+0.061
 unfair+0.059
 worse+0.055
 okay+0.055
Neg logit contributions
 hill-0.066
 slopes-0.061
 evenings-0.061
-0.061
field-0.060
Token a
Token ID257
Feature activation+2.05e-02

Pos logit contributions
 anyone+0.155
 unfair+0.139
 awful+0.139
iqu+0.131
 anybody+0.131
Neg logit contributions
 hill-0.171
 slopes-0.154
 slope-0.154
 distance-0.151
thank-0.149
Token butter
Token ID9215
Feature activation+1.83e-02

Pos logit contributions
 anyone+0.038
 sure+0.036
 up+0.034
 okay+0.033
 worse+0.033
Neg logit contributions
 hill-0.029
 slope-0.028
 slopes-0.028
field-0.028
-0.028
Token knife
Token ID9845
Feature activation+2.81e-02

Pos logit contributions
 anyone+0.027
 sure+0.026
 up+0.024
 okay+0.023
 worse+0.022
Neg logit contributions
 hill-0.033
 slope-0.032
 slopes-0.032
-0.031
field-0.031
Token.
Token ID13
Feature activation-9.98e-03

Pos logit contributions
 anyone+0.043
 sure+0.041
 up+0.037
 worse+0.035
 okay+0.035
Neg logit contributions
 hill-0.049
 slope-0.047
 slopes-0.046
-0.046
field-0.046
Token G
Token ID402
Feature activation+4.38e-02

Pos logit contributions
 hill+0.019
 slope+0.018
robat+0.018
asis+0.018
thank+0.018
Neg logit contributions
 sure-0.014
 anyone-0.014
 up-0.013
 okay-0.011
 closely-0.011
Token glass
Token ID5405
Feature activation+1.26e-03

Pos logit contributions
thank+0.047
akes+0.047
 slopes+0.047
ursion+0.046
Now+0.045
Neg logit contributions
 up-0.047
 anyone-0.043
 sure-0.042
 closely-0.042
 okay-0.039
Token of
Token ID286
Feature activation-1.76e-02

Pos logit contributions
 up+0.002
 anyone+0.002
 sure+0.002
 closely+0.001
 about+0.001
Neg logit contributions
 slopes-0.002
-0.002
 evenings-0.002
thank-0.002
 slope-0.002
Token juice
Token ID13135
Feature activation-3.58e-03

Pos logit contributions
 hill+0.029
 slope+0.029
field+0.028
+0.028
 slopes+0.028
Neg logit contributions
 anyone-0.028
 sure-0.027
 up-0.026
 worse-0.024
 closely-0.024
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
 slopes+0.007
 evenings+0.007
ursion+0.006
 slope+0.006
 hallway+0.006
Neg logit contributions
 up-0.005
 anyone-0.005
 sure-0.004
 about-0.004
 closely-0.004
Token gave
Token ID2921
Feature activation+2.21e-02

Pos logit contributions
 hill+0.026
 slope+0.025
+0.025
 slopes+0.025
field+0.025
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.016
 worse-0.016
Token Jill
Token ID22772
Feature activation+1.01e-01

Pos logit contributions
ust+0.035
 worse+0.031
iqu+0.031
ares+0.030
 unfair+0.030
Neg logit contributions
 evenings-0.033
thank-0.033
 morning-0.033
 afternoon-0.031
 days-0.031
Token a
Token ID257
Feature activation+2.40e-02

Pos logit contributions
 anyone+0.120
 worse+0.111
 awful+0.105
 ok+0.105
ares+0.105
Neg logit contributions
 hill-0.189
 slopes-0.183
 evenings-0.181
 afternoon-0.179
thank-0.179
Token big
Token ID1263
Feature activation-3.23e-03

Pos logit contributions
 anyone+0.038
 sure+0.035
 up+0.032
 worse+0.032
 closely+0.031
Neg logit contributions
 hill-0.041
-0.039
 slope-0.039
 hallway-0.038
field-0.038
Token smile
Token ID8212
Feature activation-1.13e-02

Pos logit contributions
field+0.005
 slopes+0.005
rob+0.005
akes+0.005
forest+0.005
Neg logit contributions
 up-0.005
 sure-0.005
 anyone-0.005
 okay-0.005
 closely-0.004
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
 hill+0.019
 slope+0.019
+0.018
field+0.018
 slopes+0.018
Neg logit contributions
 anyone-0.018
 sure-0.017
 up-0.015
 worse-0.015
 closely-0.015
Token
Token ID198
Feature activation-8.46e-03

Pos logit contributions
 hill+0.032
field+0.032
 slope+0.032
robat+0.032
rob+0.031
Neg logit contributions
 anyone-0.027
 sure-0.026
 up-0.025
 closely-0.023
 about-0.023
Token's
Token ID338
Feature activation+4.91e-03

Pos logit contributions
 anyone+0.029
 sure+0.026
 worse+0.023
 awful+0.023
ust+0.023
Neg logit contributions
 hill-0.071
-0.070
field-0.067
forest-0.066
 hallway-0.066
Token house
Token ID2156
Feature activation-2.59e-02

Pos logit contributions
 anyone+0.012
 sure+0.011
 okay+0.011
 worse+0.011
 ok+0.011
Neg logit contributions
 hill-0.005
-0.004
field-0.004
 hallway-0.004
 street-0.004
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
akes+0.036
robat+0.035
 slope+0.035
rob+0.035
ursion+0.034
Neg logit contributions
 anyone-0.053
 up-0.051
 sure-0.047
 about-0.046
 okay-0.045
Token Tom
Token ID4186
Feature activation+2.06e-02

Pos logit contributions
ust+0.019
 awful+0.019
 sure+0.019
 anyone+0.018
iqu+0.018
Neg logit contributions
 hill-0.015
 slope-0.013
 slopes-0.012
 street-0.012
 sandbox-0.012
Token showed
Token ID3751
Feature activation+4.48e-02

Pos logit contributions
 anyone+0.036
 sure+0.035
 up+0.033
 closely+0.031
 okay+0.031
Neg logit contributions
 hill-0.031
 slope-0.031
 slopes-0.030
-0.030
field-0.029
Token Sam
Token ID3409
Feature activation+9.97e-02

Pos logit contributions
ust+0.095
 ok+0.088
 okay+0.087
umes+0.084
 awful+0.084
Neg logit contributions
 recess-0.047
 hill-0.045
 slopes-0.041
 course-0.040
loo-0.038
Token the
Token ID262
Feature activation+2.78e-02

Pos logit contributions
 anyone+0.162
 ok+0.161
ust+0.159
iqu+0.157
 unfair+0.156
Neg logit contributions
 hill-0.143
,-0.130
 slopes-0.120
 street-0.119
 sandbox-0.117
Token big
Token ID1263
Feature activation-1.51e-02

Pos logit contributions
 anyone+0.049
 sure+0.048
 worse+0.043
 up+0.042
 closely+0.042
Neg logit contributions
 hill-0.043
 slope-0.038
 street-0.037
thank-0.037
-0.037
Token gem
Token ID16840
Feature activation+4.40e-02

Pos logit contributions
field+0.025
+0.025
 slope+0.025
 slopes+0.025
akes+0.025
Neg logit contributions
 anyone-0.024
 up-0.022
 sure-0.022
 closely-0.020
 okay-0.020
Token.
Token ID13
Feature activation-9.53e-03

Pos logit contributions
 anyone+0.083
 sure+0.079
 ok+0.076
 okay+0.075
 worse+0.073
Neg logit contributions
 hill-0.064
field-0.055
 slope-0.054
 slopes-0.053
 street-0.053
Token Sam
Token ID3409
Feature activation+2.30e-02

Pos logit contributions
 hill+0.017
field+0.017
 slope+0.017
 slopes+0.017
+0.017
Neg logit contributions
 anyone-0.014
 sure-0.013
 up-0.012
 closely-0.011
 okay-0.011
TokenMom
Token ID29252
Feature activation+4.78e-03

Pos logit contributions
 hill+0.029
field+0.028
 slope+0.028
 slopes+0.028
asis+0.027
Neg logit contributions
 anyone-0.023
 sure-0.021
 up-0.020
 okay-0.018
 ok-0.018
Token,
Token ID11
Feature activation+4.82e-03

Pos logit contributions
 anyone+0.010
 sure+0.009
 up+0.009
 okay+0.009
 what+0.009
Neg logit contributions
 slope-0.006
 slopes-0.006
field-0.006
 hill-0.006
robat-0.006
Token can
Token ID460
Feature activation+1.64e-02

Pos logit contributions
 anyone+0.008
 sure+0.008
 up+0.008
 closely+0.007
 worse+0.007
Neg logit contributions
 hill-0.007
 slope-0.007
-0.007
 slopes-0.007
field-0.007
Token I
Token ID314
Feature activation-2.07e-05

Pos logit contributions
 anyone+0.032
 sure+0.032
 up+0.030
 closely+0.029
 worse+0.029
Neg logit contributions
 hill-0.021
-0.020
 slope-0.019
field-0.018
 slopes-0.018
Token invite
Token ID14037
Feature activation+3.59e-02

Pos logit contributions
 slope+0.000
 slopes+0.000
 hill+0.000
field+0.000
 hallway+0.000
Neg logit contributions
 anyone-0.000
 sure-0.000
 up-0.000
 okay-0.000
 closely-0.000
Token Sam
Token ID3409
Feature activation+9.96e-02

Pos logit contributions
 sure+0.058
ust+0.055
 anyone+0.054
iqu+0.053
 worse+0.053
Neg logit contributions
 hill-0.051
 recess-0.048
-0.048
 evenings-0.048
 slopes-0.047
Token and
Token ID290
Feature activation+3.28e-02

Pos logit contributions
 anyone+0.162
 sure+0.153
 awful+0.149
iqu+0.146
 closely+0.142
Neg logit contributions
 hill-0.151
forest-0.141
-0.141
thank-0.138
field-0.138
Token Mia
Token ID32189
Feature activation+8.10e-02

Pos logit contributions
 anyone+0.042
 sure+0.039
 up+0.036
 worse+0.035
 closely+0.035
Neg logit contributions
 hill-0.065
 slope-0.061
 slopes-0.061
 hallway-0.060
field-0.060
Token and
Token ID290
Feature activation+3.24e-02

Pos logit contributions
 anyone+0.130
 sure+0.123
 up+0.112
 worse+0.109
 closely+0.109
Neg logit contributions
 hill-0.134
 slope-0.126
field-0.125
-0.124
 slopes-0.123
Token Ben
Token ID3932
Feature activation+7.13e-02

Pos logit contributions
 anyone+0.042
 sure+0.039
 up+0.036
 closely+0.034
 worse+0.034
Neg logit contributions
 hill-0.064
 slope-0.061
 slopes-0.060
field-0.060
-0.060
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
 anyone+0.130
 sure+0.124
 up+0.116
 closely+0.112
 worse+0.112
Neg logit contributions
 hill-0.103
 slope-0.098
field-0.096
 slopes-0.095
-0.095
Token neck
Token ID7393
Feature activation+8.44e-03

Pos logit contributions
 anyone+0.013
 sure+0.012
 up+0.012
 okay+0.011
 closely+0.011
Neg logit contributions
 hill-0.011
 slope-0.011
 slopes-0.011
-0.011
thank-0.010
Token and
Token ID290
Feature activation-2.57e-02

Pos logit contributions
 anyone+0.017
 sure+0.016
 okay+0.015
 worse+0.015
 ok+0.015
Neg logit contributions
 hill-0.011
 slope-0.010
 slopes-0.010
field-0.010
forest-0.010
Token ran
Token ID4966
Feature activation+3.27e-03

Pos logit contributions
 hill+0.043
+0.043
akes+0.042
 slope+0.042
ursion+0.041
Neg logit contributions
 anyone-0.042
 sure-0.041
 up-0.039
 closely-0.036
 worse-0.036
Token to
Token ID284
Feature activation+4.14e-03

Pos logit contributions
 anyone+0.006
 sure+0.005
ust+0.005
 ok+0.005
 okay+0.005
Neg logit contributions
 hill-0.005
 slope-0.005
 slopes-0.005
 street-0.005
thank-0.005
Token show
Token ID905
Feature activation+3.31e-02

Pos logit contributions
ust+0.008
iqu+0.007
 awful+0.007
ight+0.007
 ok+0.006
Neg logit contributions
 hill-0.006
 recess-0.006
 slopes-0.005
 street-0.005
-0.005
Token Sam
Token ID3409
Feature activation+9.92e-02

Pos logit contributions
ust+0.066
iqu+0.060
umes+0.057
ight+0.055
 actually+0.054
Neg logit contributions
 recess-0.041
 care-0.034
 days-0.031
 Now-0.031
,-0.031
Token.
Token ID13
Feature activation+2.97e-03

Pos logit contributions
 anyone+0.187
 ok+0.178
 awful+0.178
 unfair+0.175
ust+0.175
Neg logit contributions
 hill-0.113
forest-0.101
 slopes-0.096
,-0.093
ro-0.093
Token
Token ID198
Feature activation+1.46e-02

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.005
 worse+0.005
 closely+0.004
Neg logit contributions
 hill-0.004
-0.004
 slope-0.004
 slopes-0.004
field-0.004
Token
Token ID198
Feature activation+6.39e-04

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.023
 worse+0.023
 okay+0.022
Neg logit contributions
 hill-0.021
 slope-0.020
 slopes-0.020
-0.020
field-0.020
TokenSam
Token ID16305
Feature activation+1.43e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
 closely+0.001
 up+0.001
 not+0.001
Neg logit contributions
field-0.001
-0.001
 hill-0.001
akes-0.001
 slope-0.001
Token was
Token ID373
Feature activation+1.87e-02

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.023
 closely+0.022
 worse+0.022
Neg logit contributions
 hill-0.021
 slope-0.021
-0.020
 slopes-0.020
field-0.020
Token
Token ID198
Feature activation+9.45e-05

Pos logit contributions
 anyone+0.009
 sure+0.009
 up+0.008
 worse+0.008
 okay+0.008
Neg logit contributions
 hill-0.008
 slope-0.008
-0.008
 slopes-0.007
field-0.007
TokenThen
Token ID6423
Feature activation-2.84e-02

Pos logit contributions
 anyone+0.000
 sure+0.000
 up+0.000
 closely+0.000
 worse+0.000
Neg logit contributions
 hill-0.000
 slope-0.000
field-0.000
 slopes-0.000
-0.000
Token,
Token ID11
Feature activation-3.83e-03

Pos logit contributions
 hill+0.036
 slope+0.034
field+0.033
 slopes+0.033
+0.033
Neg logit contributions
 anyone-0.059
 sure-0.056
 up-0.053
 okay-0.051
 worse-0.051
Token Jack
Token ID3619
Feature activation+2.13e-02

Pos logit contributions
 hill+0.007
 slope+0.007
 slopes+0.007
+0.007
field+0.007
Neg logit contributions
 anyone-0.005
 sure-0.004
ust-0.004
 closely-0.004
 up-0.004
Token showed
Token ID3751
Feature activation+4.30e-02

Pos logit contributions
 anyone+0.036
 sure+0.032
 up+0.031
ust+0.030
 worse+0.030
Neg logit contributions
 hill-0.033
field-0.031
 slope-0.031
forest-0.031
-0.030
Token Sam
Token ID3409
Feature activation+9.86e-02

Pos logit contributions
ust+0.092
 okay+0.087
umes+0.085
 ok+0.083
 awful+0.082
Neg logit contributions
 recess-0.041
 hill-0.037
loo-0.035
 course-0.034
 slopes-0.034
Token how
Token ID703
Feature activation+1.16e-02

Pos logit contributions
 anyone+0.148
ust+0.144
amus+0.135
ares+0.134
iqu+0.133
Neg logit contributions
 hill-0.142
,-0.134
 distance-0.133
 slopes-0.126
thank-0.126
Token to
Token ID284
Feature activation-2.44e-02

Pos logit contributions
 anyone+0.018
 sure+0.017
 up+0.016
 closely+0.016
ust+0.015
Neg logit contributions
 hill-0.019
 slopes-0.018
 slope-0.018
thank-0.018
 evenings-0.018
Token look
Token ID804
Feature activation+4.13e-02

Pos logit contributions
 slope+0.040
 hill+0.040
field+0.040
+0.039
 slopes+0.039
Neg logit contributions
 anyone-0.040
 sure-0.038
 up-0.036
 closely-0.034
 okay-0.033
Token carefully
Token ID7773
Feature activation+4.28e-03

Pos logit contributions
 anyone+0.061
ust+0.053
amus+0.053
 sure+0.052
iqu+0.051
Neg logit contributions
 hill-0.067
-0.064
 slopes-0.062
stairs-0.061
 slope-0.061
Token before
Token ID878
Feature activation-8.72e-03

Pos logit contributions
 anyone+0.007
 sure+0.006
ust+0.005
 up+0.005
 okay+0.005
Neg logit contributions
 hill-0.007
 slope-0.007
 slopes-0.007
-0.007
field-0.007
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.022
 closely+0.022
 okay+0.021
Neg logit contributions
 hill-0.030
 slope-0.028
 slopes-0.028
-0.028
thank-0.028
Token He
Token ID679
Feature activation+8.06e-03

Pos logit contributions
 hill+0.028
 slope+0.028
thank+0.027
field+0.027
robat+0.027
Neg logit contributions
 sure-0.021
 anyone-0.021
 up-0.019
 closely-0.018
 okay-0.017
Token offered
Token ID4438
Feature activation+3.70e-02

Pos logit contributions
 anyone+0.016
 up+0.014
 sure+0.014
 anybody+0.014
 okay+0.014
Neg logit contributions
 hill-0.010
thank-0.009
 hallway-0.009
 slope-0.009
ursion-0.008
Token to
Token ID284
Feature activation+3.12e-03

Pos logit contributions
ust+0.067
iqu+0.062
 awful+0.058
ight+0.058
ares+0.057
Neg logit contributions
 hill-0.048
 distance-0.045
 slopes-0.044
 evenings-0.043
 season-0.043
Token lend
Token ID22096
Feature activation+3.65e-02

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.005
 worse+0.004
 okay+0.004
Neg logit contributions
 hill-0.005
 slope-0.005
 slopes-0.005
field-0.005
-0.005
Token Mia
Token ID32189
Feature activation+9.85e-02

Pos logit contributions
ust+0.071
iqu+0.060
 awful+0.056
 worse+0.055
 unfair+0.055
Neg logit contributions
 Now-0.046
 care-0.045
 hill-0.045
 recess-0.044
 slopes-0.044
Token his
Token ID465
Feature activation+4.99e-02

Pos logit contributions
iqu+0.132
 anyone+0.128
ust+0.122
ares+0.122
 awful+0.118
Neg logit contributions
 hill-0.161
ite-0.146
 distance-0.144
 street-0.139
forest-0.139
Token car
Token ID1097
Feature activation+3.34e-02

Pos logit contributions
 anyone+0.082
 sure+0.078
ust+0.073
 closely+0.071
 worse+0.069
Neg logit contributions
 hill-0.084
 slope-0.076
 street-0.074
-0.074
 hallway-0.074
Token so
Token ID523
Feature activation-2.04e-03

Pos logit contributions
 anyone+0.051
 sure+0.050
 closely+0.045
ust+0.044
 awful+0.043
Neg logit contributions
 hill-0.056
thank-0.052
 slope-0.052
-0.052
 slopes-0.051
Token she
Token ID673
Feature activation+1.36e-02

Pos logit contributions
 hill+0.003
 slopes+0.003
+0.003
 slope+0.003
akes+0.003
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.003
 closely-0.003
 ok-0.003
Token could
Token ID714
Feature activation-1.20e-02

Pos logit contributions
 anyone+0.025
 sure+0.022
 closely+0.021
ust+0.021
 awful+0.021
Neg logit contributions
 hill-0.019
thank-0.018
-0.018
-0.017
field-0.017
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
+0.008
 slope+0.007
field+0.007
 hill+0.007
 hallway+0.007
Neg logit contributions
 anyone-0.005
 sure-0.005
 up-0.005
 about-0.004
 closely-0.004
Token Then
Token ID3244
Feature activation-3.59e-02

Pos logit contributions
 hill+0.023
 slope+0.022
 slopes+0.022
+0.022
field+0.022
Neg logit contributions
 anyone-0.021
 sure-0.020
 up-0.018
 worse-0.017
 closely-0.017
Token,
Token ID11
Feature activation+9.84e-04

Pos logit contributions
field+0.042
 hill+0.041
 slope+0.041
thank+0.040
forest+0.040
Neg logit contributions
 anyone-0.080
 sure-0.075
 up-0.072
 okay-0.071
 worse-0.068
Token Tim
Token ID5045
Feature activation+1.71e-02

Pos logit contributions
 sure+0.002
ust+0.002
 anyone+0.002
 awful+0.002
iqu+0.002
Neg logit contributions
 hill-0.001
 slope-0.001
 slopes-0.001
 street-0.001
 recess-0.001
Token showed
Token ID3751
Feature activation+4.69e-02

Pos logit contributions
 anyone+0.027
 sure+0.025
 up+0.023
 worse+0.022
 closely+0.022
Neg logit contributions
 hill-0.029
 slope-0.028
 slopes-0.028
-0.028
field-0.028
Token Sam
Token ID3409
Feature activation+9.78e-02

Pos logit contributions
ust+0.092
 awful+0.086
 ok+0.086
 okay+0.086
umes+0.085
Neg logit contributions
 hill-0.049
 recess-0.049
 slopes-0.046
loo-0.044
robat-0.042
Token his
Token ID465
Feature activation+5.07e-02

Pos logit contributions
 anyone+0.168
ust+0.168
iqu+0.167
 ok+0.162
 awful+0.159
Neg logit contributions
 hill-0.126
,-0.121
 sandbox-0.106
 slopes-0.104
 afternoon-0.102
Token new
Token ID649
Feature activation+1.51e-02

Pos logit contributions
 anyone+0.080
 sure+0.075
 worse+0.067
ust+0.066
 up+0.066
Neg logit contributions
 hill-0.090
 slope-0.084
 slopes-0.083
 hallway-0.082
 street-0.080
Token seat
Token ID5852
Feature activation-2.47e-03

Pos logit contributions
 anyone+0.029
 sure+0.028
 up+0.028
 closely+0.025
 okay+0.025
Neg logit contributions
-0.020
 hill-0.019
field-0.019
 slope-0.019
akes-0.019
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
 hill+0.004
 slope+0.004
+0.004
 slopes+0.004
field+0.004
Neg logit contributions
 anyone-0.004
 sure-0.004
 up-0.003
 worse-0.003
 closely-0.003
Token "
Token ID366
Feature activation-3.74e-03

Pos logit contributions
 hill+0.014
 slope+0.013
field+0.013
 slopes+0.013
+0.013
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 closely-0.009
 okay-0.009
Token opened
Token ID4721
Feature activation+1.48e-02

Pos logit contributions
 hill+0.002
 slope+0.002
field+0.002
+0.002
 slopes+0.002
Neg logit contributions
 anyone-0.003
 sure-0.003
 worse-0.003
 up-0.003
 okay-0.003
Token the
Token ID262
Feature activation+2.35e-03

Pos logit contributions
 anyone+0.024
ust+0.024
 okay+0.023
 worse+0.022
 sure+0.022
Neg logit contributions
 hill-0.023
 slopes-0.022
field-0.021
 slope-0.021
-0.021
Token bag
Token ID6131
Feature activation-1.16e-02

Pos logit contributions
 anyone+0.006
 sure+0.005
 worse+0.005
 okay+0.005
ust+0.005
Neg logit contributions
 hill-0.002
 slope-0.002
 hallway-0.002
-0.002
field-0.002
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
 slope+0.020
 hill+0.020
akes+0.020
 slopes+0.020
robat+0.020
Neg logit contributions
 up-0.017
 sure-0.017
 anyone-0.017
 closely-0.016
 about-0.015
Token showed
Token ID3751
Feature activation+4.34e-02

Pos logit contributions
 hill+0.037
 slope+0.035
field+0.035
 slopes+0.035
+0.034
Neg logit contributions
 anyone-0.042
 sure-0.038
 worse-0.036
 up-0.035
 okay-0.035
Token Emily
Token ID17608
Feature activation+9.77e-02

Pos logit contributions
ust+0.083
 worse+0.076
 okay+0.076
 actually+0.074
 anyone+0.072
Neg logit contributions
 Now-0.051
 evenings-0.049
 recess-0.047
 days-0.046
 sorts-0.046
Token the
Token ID262
Feature activation+2.67e-02

Pos logit contributions
 anyone+0.134
 worse+0.127
 unfair+0.126
 awful+0.124
ust+0.122
Neg logit contributions
 hill-0.159
field-0.153
 hallway-0.150
 slopes-0.149
forest-0.147
Token big
Token ID1263
Feature activation-1.57e-02

Pos logit contributions
 anyone+0.048
 sure+0.043
 worse+0.043
ust+0.042
ares+0.041
Neg logit contributions
 hill-0.043
 hallway-0.038
 slope-0.038
 street-0.038
 evenings-0.036
Token box
Token ID3091
Feature activation+2.90e-02

Pos logit contributions
+0.022
 slopes+0.022
field+0.022
akes+0.022
 hill+0.022
Neg logit contributions
 anyone-0.028
 sure-0.027
 up-0.026
 closely-0.025
 okay-0.024
Token inside
Token ID2641
Feature activation-1.34e-02

Pos logit contributions
 anyone+0.050
 sure+0.046
 worse+0.042
 okay+0.042
 ok+0.041
Neg logit contributions
 hill-0.045
 slope-0.042
 hallway-0.042
field-0.042
-0.042
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 hill+0.024
 slope+0.023
 slopes+0.023
+0.022
field+0.022
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 worse-0.017
 closely-0.017
Token and
Token ID290
Feature activation-8.58e-03

Pos logit contributions
 slope+0.009
 hill+0.009
akes+0.009
robat+0.009
rob+0.009
Neg logit contributions
 anyone-0.007
 sure-0.007
 up-0.007
 about-0.006
 closely-0.006
Token asked
Token ID1965
Feature activation+2.51e-02

Pos logit contributions
 hill+0.013
 slope+0.013
field+0.013
 slopes+0.013
 hallway+0.012
Neg logit contributions
 anyone-0.015
 sure-0.014
 up-0.013
 closely-0.012
 okay-0.012
Token him
Token ID683
Feature activation+5.58e-02

Pos logit contributions
ust+0.033
 sure+0.032
 anyone+0.032
amus+0.031
 awful+0.031
Neg logit contributions
 hill-0.041
 evenings-0.040
 slopes-0.040
use-0.039
 recess-0.039
Token to
Token ID284
Feature activation-1.26e-02

Pos logit contributions
 awful+0.090
ust+0.086
 anyone+0.083
amus+0.082
iqu+0.082
Neg logit contributions
 hill-0.075
,-0.074
 seek-0.072
use-0.071
forest-0.066
Token give
Token ID1577
Feature activation+3.91e-02

Pos logit contributions
 hill+0.020
 slope+0.020
 slopes+0.019
+0.019
field+0.019
Neg logit contributions
 anyone-0.021
 sure-0.020
 up-0.019
 worse-0.018
 okay-0.018
Token Sam
Token ID3409
Feature activation+9.76e-02

Pos logit contributions
ust+0.070
iqu+0.061
amus+0.058
 awful+0.058
ight+0.057
Neg logit contributions
 recess-0.057
 evenings-0.054
 morning-0.054
 afternoon-0.052
 days-0.051
Token his
Token ID465
Feature activation+4.48e-02

Pos logit contributions
 anyone+0.102
 sure+0.096
 closely+0.095
iqu+0.092
 clearly+0.090
Neg logit contributions
 hill-0.189
 evenings-0.180
forest-0.180
 afternoon-0.180
akes-0.179
Token bear
Token ID6842
Feature activation+5.89e-02

Pos logit contributions
 anyone+0.075
 sure+0.072
ust+0.065
 closely+0.065
 up+0.064
Neg logit contributions
 hill-0.072
 slope-0.066
-0.065
 hallway-0.065
 slopes-0.065
Token back
Token ID736
Feature activation+1.74e-02

Pos logit contributions
 anyone+0.088
 sure+0.086
 closely+0.080
 clearly+0.078
iqu+0.077
Neg logit contributions
 hill-0.097
thank-0.090
robat-0.088
forest-0.088
 distance-0.087
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
 anyone+0.025
 sure+0.024
 up+0.022
 closely+0.021
 worse+0.021
Neg logit contributions
 hill-0.032
 slope-0.031
-0.031
 slopes-0.031
field-0.031
Token She
Token ID1375
Feature activation-5.84e-03

Pos logit contributions
 hill+0.002
 slope+0.002
+0.002
 slopes+0.002
field+0.002
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.002
 worse-0.002
 closely-0.002
Token;
Token ID26
Feature activation+1.45e-02

Pos logit contributions
 hill+0.024
 slope+0.024
+0.023
 slopes+0.023
field+0.023
Neg logit contributions
 anyone-0.021
 sure-0.019
 up-0.018
 closely-0.017
 worse-0.017
Token he
Token ID339
Feature activation+7.36e-03

Pos logit contributions
 anyone+0.028
 sure+0.026
 worse+0.026
 awful+0.026
ust+0.025
Neg logit contributions
 hill-0.020
 slope-0.018
 slopes-0.018
-0.018
field-0.017
Token wanted
Token ID2227
Feature activation+1.55e-02

Pos logit contributions
 anyone+0.013
 sure+0.012
 up+0.011
 worse+0.011
 okay+0.011
Neg logit contributions
 hill-0.012
 slope-0.011
field-0.011
 slopes-0.011
-0.011
Token to
Token ID284
Feature activation-2.20e-02

Pos logit contributions
ust+0.028
 awful+0.027
 sure+0.026
 anyone+0.026
 okay+0.026
Neg logit contributions
 hill-0.020
 slopes-0.020
 slope-0.019
 street-0.018
 recess-0.018
Token show
Token ID905
Feature activation+2.88e-02

Pos logit contributions
field+0.035
 hill+0.035
 slope+0.035
+0.035
 slopes+0.034
Neg logit contributions
 anyone-0.037
 sure-0.036
 up-0.033
 okay-0.031
 closely-0.030
Token Sam
Token ID3409
Feature activation+9.76e-02

Pos logit contributions
ust+0.046
 unfair+0.044
 worse+0.044
 awful+0.043
iqu+0.042
Neg logit contributions
 slopes-0.041
 hill-0.038
 slope-0.036
 evenings-0.036
 recess-0.035
Token his
Token ID465
Feature activation+4.08e-02

Pos logit contributions
 anyone+0.131
iqu+0.131
 awful+0.128
 worse+0.124
ust+0.121
Neg logit contributions
 hill-0.157
 slopes-0.138
 afternoon-0.136
forest-0.136
 seek-0.136
Token famous
Token ID5863
Feature activation+4.77e-03

Pos logit contributions
 anyone+0.065
 sure+0.061
 worse+0.057
ust+0.057
 closely+0.055
Neg logit contributions
 hill-0.071
 slope-0.065
 street-0.063
 slopes-0.063
 hallway-0.062
Token toy
Token ID13373
Feature activation+2.67e-02

Pos logit contributions
 anyone+0.008
 sure+0.008
 up+0.007
 okay+0.007
 closely+0.007
Neg logit contributions
 hill-0.007
 slope-0.007
field-0.007
-0.007
 slopes-0.007
Token too
Token ID1165
Feature activation-2.14e-02

Pos logit contributions
 anyone+0.044
 sure+0.042
 up+0.038
 okay+0.038
 closely+0.037
Neg logit contributions
 hill-0.044
 slope-0.041
 slopes-0.041
-0.040
field-0.040
Token.
Token ID13
Feature activation-5.20e-03

Pos logit contributions
 hill+0.031
 slope+0.030
 slopes+0.029
+0.029
field+0.029
Neg logit contributions
 anyone-0.039
 sure-0.037
 up-0.035
 worse-0.033
 closely-0.033
Token man
Token ID582
Feature activation+4.62e-03

Pos logit contributions
 anyone+0.018
 sure+0.017
ust+0.016
 up+0.016
 worse+0.015
Neg logit contributions
 hill-0.020
 slope-0.019
 hallway-0.018
 street-0.018
field-0.018
Token said
Token ID531
Feature activation+1.69e-02

Pos logit contributions
 anyone+0.009
 sure+0.008
 worse+0.008
 up+0.008
ust+0.008
Neg logit contributions
 hill-0.006
field-0.005
thank-0.005
-0.005
 slope-0.005
Token yes
Token ID3763
Feature activation-1.73e-02

Pos logit contributions
 anyone+0.026
 sure+0.024
 up+0.024
ust+0.024
 closely+0.023
Neg logit contributions
 hill-0.026
-0.026
 slope-0.025
thank-0.025
 slopes-0.025
Token and
Token ID290
Feature activation-2.67e-02

Pos logit contributions
 hill+0.022
 slope+0.021
 slopes+0.020
+0.020
field+0.020
Neg logit contributions
 anyone-0.035
 sure-0.034
 up-0.032
 worse-0.031
 closely-0.031
Token showed
Token ID3751
Feature activation+4.35e-02

Pos logit contributions
 hill+0.044
 slope+0.043
field+0.042
 slopes+0.042
+0.042
Neg logit contributions
 anyone-0.043
 sure-0.040
 up-0.037
 worse-0.036
 closely-0.036
Token Sam
Token ID3409
Feature activation+9.70e-02

Pos logit contributions
ust+0.061
 okay+0.059
 worse+0.058
 anyone+0.057
 awful+0.056
Neg logit contributions
 hill-0.064
 evenings-0.063
 slopes-0.063
 Now-0.063
thank-0.063
Token how
Token ID703
Feature activation+1.70e-02

Pos logit contributions
 anyone+0.135
 awful+0.129
 unfair+0.129
ust+0.126
 worse+0.124
Neg logit contributions
 hill-0.159
 slopes-0.147
 afternoon-0.139
 slope-0.139
field-0.138
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
 anyone+0.028
 sure+0.026
 closely+0.025
 up+0.024
 worse+0.024
Neg logit contributions
 hill-0.025
 slopes-0.024
 evenings-0.023
thank-0.023
 slope-0.023
Token use
Token ID779
Feature activation+1.18e-02

Pos logit contributions
 hill+0.025
 slope+0.025
field+0.025
 slopes+0.025
+0.024
Neg logit contributions
 anyone-0.030
 sure-0.029
 up-0.027
 closely-0.026
 worse-0.025
Token it
Token ID340
Feature activation+3.68e-02

Pos logit contributions
 anyone+0.014
 sure+0.013
 up+0.012
 okay+0.012
 worse+0.011
Neg logit contributions
 hill-0.024
 slope-0.023
 slopes-0.023
-0.023
field-0.022
Token.
Token ID13
Feature activation-3.30e-03

Pos logit contributions
 anyone+0.057
 sure+0.054
 up+0.049
 worse+0.048
 okay+0.047
Neg logit contributions
 hill-0.062
 slope-0.060
 slopes-0.059
-0.058
field-0.058
Token Suddenly
Token ID24975
Feature activation-2.93e-02

Pos logit contributions
 hill+0.009
 slope+0.009
 slopes+0.008
field+0.008
robat+0.008
Neg logit contributions
 anyone-0.010
 sure-0.009
 up-0.009
 closely-0.008
 okay-0.008
Token,
Token ID11
Feature activation+9.05e-04

Pos logit contributions
 hill+0.025
 slope+0.023
 slopes+0.023
+0.023
field+0.022
Neg logit contributions
 anyone-0.071
 sure-0.069
 up-0.066
 worse-0.063
 closely-0.063
Token the
Token ID262
Feature activation+5.05e-03

Pos logit contributions
 anyone+0.002
 sure+0.001
ust+0.001
 worse+0.001
 closely+0.001
Neg logit contributions
 hill-0.001
 slopes-0.001
-0.001
 slope-0.001
field-0.001
Token bee
Token ID20697
Feature activation+1.05e-02

Pos logit contributions
 anyone+0.009
 sure+0.009
 up+0.008
 closely+0.008
 okay+0.008
Neg logit contributions
 hill-0.008
 slope-0.007
field-0.007
 slopes-0.007
 hallway-0.007
Token gave
Token ID2921
Feature activation+2.93e-02

Pos logit contributions
 anyone+0.020
 sure+0.019
 up+0.018
 okay+0.018
 worse+0.017
Neg logit contributions
 hill-0.014
 slope-0.013
field-0.013
 slopes-0.013
-0.013
Token Sandy
Token ID17581
Feature activation+9.70e-02

Pos logit contributions
 unfair+0.048
ust+0.048
 worse+0.046
 awful+0.046
 anyone+0.045
Neg logit contributions
cy-0.036
 distance-0.035
 morning-0.035
 afternoon-0.035
 breeze-0.035
Token a
Token ID257
Feature activation+2.62e-02

Pos logit contributions
 anyone+0.121
 unfair+0.116
iqu+0.114
 worse+0.109
ust+0.108
Neg logit contributions
 hill-0.166
 slopes-0.164
thank-0.161
ugg-0.157
ite-0.157
Token shock
Token ID6380
Feature activation-4.30e-03

Pos logit contributions
 anyone+0.042
 sure+0.038
 up+0.035
 closely+0.034
 worse+0.034
Neg logit contributions
 hill-0.045
 slope-0.043
-0.042
field-0.042
 slopes-0.041
Token!
Token ID0
Feature activation-1.32e-02

Pos logit contributions
 hill+0.008
+0.008
 slope+0.008
 slopes+0.008
field+0.008
Neg logit contributions
 anyone-0.006
 sure-0.006
 up-0.005
 worse-0.005
 closely-0.005
Token It
Token ID632
Feature activation+1.67e-02

Pos logit contributions
 hill+0.023
 slope+0.023
field+0.022
 slopes+0.022
+0.022
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
 okay-0.017
Token st
Token ID336
Feature activation-3.46e-02

Pos logit contributions
 anyone+0.030
 sure+0.028
ust+0.026
 okay+0.026
 up+0.026
Neg logit contributions
 hill-0.023
 slope-0.022
field-0.022
 slopes-0.021
 hallway-0.021
Token the
Token ID262
Feature activation-2.03e-02

Pos logit contributions
 hill+0.052
forest+0.052
akes+0.051
+0.051
 slope+0.051
Neg logit contributions
 anyone-0.026
 sure-0.024
 up-0.023
 closely-0.020
 worse-0.019
Token garden
Token ID11376
Feature activation-2.47e-02

Pos logit contributions
thank+0.027
+0.026
 slope+0.026
 slopes+0.025
field+0.025
Neg logit contributions
 anyone-0.038
 up-0.037
 sure-0.037
 worse-0.035
 closely-0.034
Token and
Token ID290
Feature activation-1.68e-02

Pos logit contributions
rob+0.041
robat+0.041
ursion+0.040
akes+0.040
 slope+0.040
Neg logit contributions
 anyone-0.038
 sure-0.038
 up-0.038
 closely-0.035
 okay-0.033
Token then
Token ID788
Feature activation-1.57e-02

Pos logit contributions
 hill+0.027
 slope+0.026
 slopes+0.026
field+0.026
+0.025
Neg logit contributions
 anyone-0.028
 sure-0.026
 up-0.024
 okay-0.023
 closely-0.023
Token gave
Token ID2921
Feature activation+3.33e-02

Pos logit contributions
 hill+0.025
 slope+0.024
 slopes+0.023
field+0.023
+0.023
Neg logit contributions
 anyone-0.026
 sure-0.025
 up-0.023
 closely-0.022
 worse-0.022
Token Jenny
Token ID28437
Feature activation+9.69e-02

Pos logit contributions
ust+0.054
 unfair+0.050
iqu+0.049
amus+0.047
 anyone+0.047
Neg logit contributions
 evenings-0.045
thank-0.043
 Now-0.043
 distance-0.043
 morning-0.043
Token a
Token ID257
Feature activation+1.80e-02

Pos logit contributions
 anyone+0.113
 unfair+0.096
iqu+0.094
 worse+0.094
ares+0.092
Neg logit contributions
 hill-0.185
 slopes-0.180
 distance-0.177
 evenings-0.177
thank-0.175
Token big
Token ID1263
Feature activation-8.35e-03

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.023
 worse+0.022
 closely+0.022
Neg logit contributions
 hill-0.033
 slope-0.032
 slopes-0.031
-0.031
field-0.031
Token hug
Token ID16225
Feature activation-1.48e-02

Pos logit contributions
field+0.014
rob+0.014
forest+0.014
 slopes+0.013
akes+0.013
Neg logit contributions
 sure-0.013
 up-0.013
 okay-0.012
 anyone-0.012
 ok-0.012
Token.
Token ID13
Feature activation-7.98e-03

Pos logit contributions
 hill+0.026
 slope+0.025
 slopes+0.025
+0.025
field+0.025
Neg logit contributions
 anyone-0.023
 sure-0.021
 up-0.020
 worse-0.019
 closely-0.019
Token
Token ID198
Feature activation+4.47e-03

Pos logit contributions
 hill+0.013
 slope+0.013
 slopes+0.012
+0.012
field+0.012
Neg logit contributions
 anyone-0.013
 sure-0.013
 up-0.012
 worse-0.011
 closely-0.011
Token.
Token ID13
Feature activation+7.76e-04

Pos logit contributions
 anyone+0.012
 sure+0.011
 up+0.010
 closely+0.010
 worse+0.010
Neg logit contributions
 hill-0.014
 slope-0.013
 slopes-0.013
-0.013
field-0.013
Token She
Token ID1375
Feature activation+1.73e-03

Pos logit contributions
 anyone+0.001
 sure+0.001
 up+0.001
 worse+0.001
 okay+0.001
Neg logit contributions
 hill-0.001
 slope-0.001
 slopes-0.001
-0.001
field-0.001
Token wants
Token ID3382
Feature activation+2.44e-02

Pos logit contributions
 anyone+0.003
 sure+0.003
 worse+0.003
 okay+0.003
 up+0.003
Neg logit contributions
 hill-0.002
 slope-0.002
 slopes-0.002
field-0.002
 hallway-0.002
Token to
Token ID284
Feature activation-1.16e-02

Pos logit contributions
ust+0.054
ares+0.050
iqu+0.050
ight+0.050
 awful+0.049
Neg logit contributions
 hill-0.023
 season-0.023
 slopes-0.022
 distance-0.022
 hills-0.021
Token show
Token ID905
Feature activation+3.24e-02

Pos logit contributions
 hill+0.017
 slope+0.017
 slopes+0.016
+0.016
field+0.016
Neg logit contributions
 anyone-0.021
 sure-0.020
 up-0.019
 closely-0.018
 okay-0.018
Token Luna
Token ID23694
Feature activation+9.69e-02

Pos logit contributions
ust+0.063
iqu+0.056
 awful+0.052
ight+0.052
 unfair+0.051
Neg logit contributions
 recess-0.039
 hill-0.036
 slopes-0.036
 care-0.035
 evenings-0.034
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
iqu+0.158
 awful+0.157
ust+0.150
 worse+0.149
 unfair+0.149
Neg logit contributions
 hill-0.139
,-0.124
 season-0.117
 slopes-0.116
forest-0.116
Token birds
Token ID10087
Feature activation+3.64e-02

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.023
 worse+0.022
 closely+0.022
Neg logit contributions
 hill-0.024
 slope-0.023
 slopes-0.022
field-0.022
 street-0.022
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
 anyone+0.066
 sure+0.063
 awful+0.058
 worse+0.058
ust+0.057
Neg logit contributions
 hill-0.051
forest-0.046
field-0.045
 slope-0.044
 slopes-0.043
Token She
Token ID1375
Feature activation+6.29e-05

Pos logit contributions
 anyone+0.002
 sure+0.002
 up+0.002
 worse+0.002
 okay+0.002
Neg logit contributions
 hill-0.002
 slope-0.002
 slopes-0.002
-0.002
field-0.002
Token picks
Token ID11103
Feature activation+5.32e-02

Pos logit contributions
 anyone+0.000
 okay+0.000
 sure+0.000
 worse+0.000
 ok+0.000
Neg logit contributions
 hill-0.000
 slope-0.000
 slopes-0.000
 hallway-0.000
field-0.000
TokenThey
Token ID2990
Feature activation-1.30e-02

Pos logit contributions
 anyone+0.030
 sure+0.029
 up+0.027
 worse+0.026
 okay+0.025
Neg logit contributions
 hill-0.030
 slope-0.029
-0.028
 slopes-0.028
field-0.028
Token all
Token ID477
Feature activation-2.67e-02

Pos logit contributions
 hill+0.022
 slope+0.022
 slopes+0.021
+0.021
field+0.021
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
 worse-0.017
Token cheered
Token ID34767
Feature activation+7.23e-03

Pos logit contributions
+0.043
 hill+0.042
 slope+0.041
field+0.041
akes+0.040
Neg logit contributions
 sure-0.045
 anyone-0.044
 up-0.042
 closely-0.039
 worse-0.038
Token and
Token ID290
Feature activation-3.94e-02

Pos logit contributions
 anyone+0.014
 sure+0.012
 worse+0.012
ust+0.012
 awful+0.012
Neg logit contributions
 hill-0.009
thank-0.008
 slopes-0.008
 evenings-0.008
 slope-0.008
Token gave
Token ID2921
Feature activation+2.37e-02

Pos logit contributions
 hill+0.068
akes+0.068
 slope+0.066
loo+0.066
forest+0.065
Neg logit contributions
 sure-0.065
 anyone-0.061
 up-0.060
 okay-0.053
 closely-0.053
Token Janet
Token ID28111
Feature activation+9.66e-02

Pos logit contributions
ust+0.034
 anyone+0.033
 awful+0.031
 worse+0.031
ares+0.031
Neg logit contributions
 evenings-0.038
thank-0.037
 hill-0.036
 afternoon-0.036
 slopes-0.035
Token a
Token ID257
Feature activation+1.41e-02

Pos logit contributions
 anyone+0.111
 worse+0.093
 sure+0.093
 okay+0.089
 awful+0.088
Neg logit contributions
 hill-0.194
 slopes-0.185
 evenings-0.184
ite-0.183
thank-0.182
Token big
Token ID1263
Feature activation-1.22e-02

Pos logit contributions
 anyone+0.020
 sure+0.019
 up+0.018
 worse+0.017
 okay+0.017
Neg logit contributions
 hill-0.026
 slope-0.025
 slopes-0.025
-0.024
field-0.024
Token hug
Token ID16225
Feature activation-2.05e-02

Pos logit contributions
field+0.019
forest+0.019
 slopes+0.019
rob+0.019
akes+0.019
Neg logit contributions
 sure-0.020
 up-0.020
 anyone-0.019
 okay-0.018
 ok-0.018
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
 hill+0.036
 slope+0.035
 slopes+0.034
+0.034
field+0.034
Neg logit contributions
 anyone-0.031
 sure-0.030
 up-0.028
 okay-0.026
 worse-0.026
Token From
Token ID3574
Feature activation-1.09e-02

Pos logit contributions
 hill+0.010
 slope+0.009
 slopes+0.009
field+0.009
+0.009
Neg logit contributions
 anyone-0.010
 sure-0.010
 up-0.009
 closely-0.009
 okay-0.009
TokenAnna
Token ID31160
Feature activation+1.81e-03

Pos logit contributions
 anyone+0.012
 sure+0.011
 closely+0.010
 up+0.010
 okay+0.010
Neg logit contributions
 hill-0.012
field-0.011
 slope-0.011
-0.011
akes-0.011
Token was
Token ID373
Feature activation+1.61e-02

Pos logit contributions
 anyone+0.003
 sure+0.003
 up+0.003
 closely+0.003
 worse+0.003
Neg logit contributions
 hill-0.003
 slope-0.003
 slopes-0.003
-0.003
field-0.003
Token happy
Token ID3772
Feature activation-2.07e-03

Pos logit contributions
 anyone+0.024
ust+0.020
 anybody+0.019
iqu+0.019
ares+0.018
Neg logit contributions
thank-0.027
 distance-0.026
 street-0.025
 hallway-0.025
 evenings-0.025
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 slope+0.004
 hill+0.004
akes+0.004
 slopes+0.004
robat+0.004
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.003
 closely-0.002
 about-0.002
Token gave
Token ID2921
Feature activation+3.56e-02

Pos logit contributions
 hill+0.027
 slope+0.027
 slopes+0.026
+0.026
field+0.026
Neg logit contributions
 anyone-0.021
 sure-0.020
 up-0.019
 okay-0.017
 closely-0.017
Token Luna
Token ID23694
Feature activation+9.63e-02

Pos logit contributions
ust+0.064
ares+0.057
 awful+0.056
iqu+0.056
 actually+0.055
Neg logit contributions
 morning-0.050
 evenings-0.049
 recess-0.049
 hill-0.047
 afternoon-0.047
Token a
Token ID257
Feature activation+1.72e-02

Pos logit contributions
ust+0.125
 anyone+0.123
ares+0.122
iqu+0.122
 awful+0.118
Neg logit contributions
 hill-0.161
 distance-0.155
-0.152
ossible-0.152
 evenings-0.148
Token hug
Token ID16225
Feature activation-1.55e-02

Pos logit contributions
 anyone+0.029
 sure+0.028
 up+0.026
 okay+0.025
 worse+0.025
Neg logit contributions
 hill-0.027
 slope-0.027
 slopes-0.026
field-0.026
-0.026
Token.
Token ID13
Feature activation+6.36e-03

Pos logit contributions
 hill+0.026
 slope+0.025
+0.025
field+0.025
 slopes+0.025
Neg logit contributions
 anyone-0.025
 sure-0.023
 up-0.021
 closely-0.021
 worse-0.020
Token Luna
Token ID23694
Feature activation+2.50e-02

Pos logit contributions
 anyone+0.012
 sure+0.012
 up+0.011
 worse+0.010
 closely+0.010
Neg logit contributions
 hill-0.009
 slope-0.008
-0.008
 slopes-0.008
field-0.008
Token pur
Token ID1308
Feature activation-2.42e-02

Pos logit contributions
 anyone+0.044
 sure+0.041
 up+0.038
 worse+0.037
 okay+0.037
Neg logit contributions
 hill-0.038
 slope-0.036
field-0.036
 slopes-0.036
-0.036
Token friends
Token ID2460
Feature activation+1.03e-02

Pos logit contributions
 anyone+0.023
 sure+0.021
 up+0.020
 worse+0.019
 okay+0.019
Neg logit contributions
 hill-0.028
 slopes-0.027
field-0.027
 slope-0.027
 evenings-0.027
Token.
Token ID13
Feature activation-3.82e-04

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 closely+0.013
 okay+0.013
Neg logit contributions
 hill-0.018
 slope-0.018
 slopes-0.017
-0.017
field-0.017
Token He
Token ID679
Feature activation-6.66e-03

Pos logit contributions
 hill+0.001
 slope+0.001
+0.001
 slopes+0.001
field+0.001
Neg logit contributions
 anyone-0.001
 sure-0.001
 up-0.001
 worse-0.001
 closely-0.001
Token even
Token ID772
Feature activation-1.94e-02

Pos logit contributions
 hill+0.011
 slope+0.011
 slopes+0.010
+0.010
field+0.010
Neg logit contributions
 anyone-0.011
 sure-0.010
 up-0.010
 closely-0.009
 worse-0.009
Token gave
Token ID2921
Feature activation+3.01e-02

Pos logit contributions
 hill+0.029
 slope+0.028
 slopes+0.027
+0.027
field+0.027
Neg logit contributions
 anyone-0.035
 sure-0.033
 up-0.031
 closely-0.030
 worse-0.030
Token Rachel
Token ID15984
Feature activation+9.62e-02

Pos logit contributions
 worse+0.038
ust+0.038
 anyone+0.036
 awful+0.035
 okay+0.034
Neg logit contributions
 evenings-0.049
 slopes-0.048
 hill-0.048
thank-0.048
field-0.047
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
 anyone+0.103
 worse+0.090
iqu+0.083
 awful+0.083
 unfair+0.082
Neg logit contributions
 hill-0.196
 evenings-0.191
field-0.191
 distance-0.190
 slopes-0.189
Token lovely
Token ID14081
Feature activation-2.83e-03

Pos logit contributions
 anyone+0.026
 sure+0.024
 up+0.022
 worse+0.021
 closely+0.021
Neg logit contributions
 hill-0.030
-0.028
 slope-0.028
field-0.028
 hallway-0.028
Token blue
Token ID4171
Feature activation-1.28e-02

Pos logit contributions
 slope+0.004
 hill+0.004
 slopes+0.004
+0.004
field+0.004
Neg logit contributions
 sure-0.005
 anyone-0.005
 up-0.004
 okay-0.004
 closely-0.004
Token feather
Token ID17972
Feature activation+2.79e-02

Pos logit contributions
 slope+0.021
ursion+0.021
 hill+0.021
 slopes+0.021
 hallway+0.021
Neg logit contributions
 anyone-0.019
 sure-0.018
 up-0.018
 just-0.016
 okay-0.016
Token as
Token ID355
Feature activation-2.02e-03

Pos logit contributions
 anyone+0.046
 sure+0.043
 up+0.040
 worse+0.039
 closely+0.038
Neg logit contributions
 hill-0.046
 slope-0.044
field-0.043
-0.043
 slopes-0.043
Token angry
Token ID7954
Feature activation-3.50e-02

Pos logit contributions
 hill+0.023
 slope+0.022
thank+0.022
ursion+0.022
 distance+0.022
Neg logit contributions
 anyone-0.013
 sure-0.010
 up-0.009
 anybody-0.009
amus-0.009
Token,
Token ID11
Feature activation-2.39e-02

Pos logit contributions
 hill+0.045
 slope+0.043
 slopes+0.042
+0.042
field+0.042
Neg logit contributions
 anyone-0.070
 sure-0.067
 up-0.064
 closely-0.061
 worse-0.061
Token but
Token ID475
Feature activation-2.74e-02

Pos logit contributions
 hill+0.041
 slope+0.040
akes+0.040
 slopes+0.040
+0.040
Neg logit contributions
 anyone-0.037
 sure-0.036
 up-0.033
 okay-0.032
 closely-0.032
Token then
Token ID788
Feature activation-1.92e-02

Pos logit contributions
 hill+0.041
field+0.040
 slope+0.039
+0.039
 slopes+0.039
Neg logit contributions
 anyone-0.045
 sure-0.042
 up-0.040
 worse-0.039
ust-0.038
Token gave
Token ID2921
Feature activation+2.63e-02

Pos logit contributions
 hill+0.027
 slopes+0.026
field+0.025
 slope+0.025
+0.025
Neg logit contributions
 anyone-0.032
 sure-0.031
ust-0.030
 up-0.030
 closely-0.029
Token Sandy
Token ID17581
Feature activation+9.62e-02

Pos logit contributions
ust+0.046
 anyone+0.045
 awful+0.045
 worse+0.044
 unfair+0.044
Neg logit contributions
cy-0.030
thank-0.030
 distance-0.029
 afternoon-0.029
 evenings-0.029
Token some
Token ID617
Feature activation+1.29e-02

Pos logit contributions
 anyone+0.126
iqu+0.116
ust+0.113
ares+0.113
 closely+0.112
Neg logit contributions
 hill-0.156
thank-0.155
ugg-0.151
 distance-0.151
 slopes-0.151
Token of
Token ID286
Feature activation-4.05e-03

Pos logit contributions
 anyone+0.020
 sure+0.018
 up+0.017
 closely+0.016
 okay+0.016
Neg logit contributions
 hill-0.022
 slope-0.021
 slopes-0.021
-0.021
thank-0.020
Token their
Token ID511
Feature activation+1.40e-02

Pos logit contributions
 hill+0.007
 evenings+0.006
field+0.006
+0.006
 slopes+0.006
Neg logit contributions
 anyone-0.006
 sure-0.005
ust-0.005
 worse-0.005
 okay-0.005
Token oats
Token ID47009
Feature activation+1.29e-02

Pos logit contributions
 anyone+0.023
 sure+0.021
 up+0.020
 worse+0.019
 closely+0.019
Neg logit contributions
 hill-0.023
 slope-0.022
 slopes-0.022
-0.022
field-0.022
Token.
Token ID13
Feature activation-6.35e-03

Pos logit contributions
 anyone+0.018
 sure+0.017
 up+0.016
 okay+0.015
 worse+0.015
Neg logit contributions
 hill-0.024
 slope-0.023
 slopes-0.023
field-0.023
 hallway-0.023
Tokenily
Token ID813
Feature activation+1.59e-02

Pos logit contributions
 anyone+0.018
 sure+0.016
 up+0.014
 closely+0.012
 okay+0.012
Neg logit contributions
 hill-0.047
 slope-0.046
-0.045
 slopes-0.045
akes-0.045
Token sniff
Token ID26300
Feature activation+1.98e-02

Pos logit contributions
 anyone+0.034
 sure+0.031
ust+0.029
 anybody+0.029
 okay+0.029
Neg logit contributions
 hill-0.020
-0.017
 slope-0.017
 slopes-0.017
-0.017
Tokenles
Token ID829
Feature activation+2.53e-02

Pos logit contributions
 anyone+0.041
 sure+0.040
 ok+0.039
 okay+0.039
ust+0.038
Neg logit contributions
akes-0.020
-0.020
 hill-0.020
ite-0.019
 slopes-0.019
Token and
Token ID290
Feature activation-9.75e-03

Pos logit contributions
 anyone+0.048
ust+0.047
 sure+0.046
iqu+0.046
umes+0.043
Neg logit contributions
 hill-0.031
 breeze-0.028
 slopes-0.026
 distance-0.025
 slope-0.025
Token shows
Token ID2523
Feature activation+4.00e-02

Pos logit contributions
 hill+0.015
 slope+0.014
 slopes+0.014
+0.014
field+0.014
Neg logit contributions
 anyone-0.017
 sure-0.016
 up-0.015
 okay-0.014
ust-0.014
Token Ben
Token ID3932
Feature activation+9.57e-02

Pos logit contributions
ust+0.070
 sure+0.063
 ok+0.061
 actually+0.061
 probably+0.059
Neg logit contributions
 recess-0.056
 evenings-0.049
 breeze-0.048
 morning-0.047
 slopes-0.046
Token her
Token ID607
Feature activation+5.91e-02

Pos logit contributions
 anyone+0.149
amus+0.148
ust+0.140
umes+0.137
iqu+0.136
Neg logit contributions
 hill-0.125
,-0.120
 care-0.119
ossible-0.117
 slopes-0.115
Token knee
Token ID10329
Feature activation+2.09e-02

Pos logit contributions
 anyone+0.099
 sure+0.094
ust+0.086
 anybody+0.084
amus+0.084
Neg logit contributions
 hill-0.092
 slope-0.082
 street-0.080
-0.080
ite-0.079
Token.
Token ID13
Feature activation+4.21e-03

Pos logit contributions
 anyone+0.036
 sure+0.035
 closely+0.030
 anybody+0.030
ust+0.030
Neg logit contributions
 hill-0.032
thank-0.030
 slope-0.030
forest-0.030
-0.029
Token It
Token ID632
Feature activation+3.05e-02

Pos logit contributions
 anyone+0.007
 sure+0.006
 up+0.006
 worse+0.006
 closely+0.006
Neg logit contributions
 hill-0.007
-0.007
 slope-0.007
 slopes-0.007
field-0.007
Token has
Token ID468
Feature activation+2.49e-02

Pos logit contributions
 anyone+0.063
 sure+0.059
ust+0.055
 ok+0.054
 anybody+0.054
Neg logit contributions
 hill-0.039
-0.034
 slope-0.034
 slopes-0.034
forest-0.034
Token to
Token ID284
Feature activation-2.11e-02

Pos logit contributions
 anyone+0.005
 sure+0.004
 up+0.004
 closely+0.004
 worse+0.004
Neg logit contributions
 hill-0.007
 slope-0.006
-0.006
field-0.006
 slopes-0.006
Token play
Token ID711
Feature activation-6.09e-03

Pos logit contributions
field+0.032
 slope+0.032
ursion+0.031
+0.031
 hill+0.031
Neg logit contributions
 anyone-0.037
 sure-0.035
 up-0.033
 closely-0.031
 okay-0.030
Token outside
Token ID2354
Feature activation-3.44e-02

Pos logit contributions
 hill+0.011
 slope+0.011
 slopes+0.011
+0.011
field+0.011
Neg logit contributions
 anyone-0.009
 sure-0.008
 up-0.007
 closely-0.007
 worse-0.007
Token in
Token ID287
Feature activation-2.65e-02

Pos logit contributions
ursion+0.070
field+0.069
+0.069
forest+0.069
robat+0.068
Neg logit contributions
 anyone-0.044
 up-0.042
 sure-0.038
 closely-0.036
 about-0.033
Token the
Token ID262
Feature activation-3.17e-02

Pos logit contributions
akes+0.068
+0.068
ugg+0.068
thank+0.068
field+0.067
Neg logit contributions
 anyone-0.022
 up-0.020
 sure-0.018
 what-0.017
 a-0.014
Token sunny
Token ID27737
Feature activation-1.05e-01

Pos logit contributions
+0.041
field+0.039
 slope+0.039
robat+0.038
thank+0.038
Neg logit contributions
 anyone-0.062
 up-0.061
 sure-0.060
 closely-0.055
 okay-0.055
Token weather
Token ID6193
Feature activation-5.31e-02

Pos logit contributions
leness+0.185
+0.183
use+0.182
ursion+0.182
opter+0.181
Neg logit contributions
 up-0.182
 a-0.148
's-0.141
 about-0.136
 over-0.134
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
ursion+0.114
+0.109
use+0.109
 slope+0.108
 hill+0.108
Neg logit contributions
 up-0.060
 anyone-0.059
 sure-0.054
 about-0.053
 closely-0.046
Token One
Token ID1881
Feature activation-4.76e-02

Pos logit contributions
 hill+0.023
 slope+0.022
 slopes+0.022
field+0.022
+0.022
Neg logit contributions
 anyone-0.024
 sure-0.022
 up-0.021
 closely-0.020
 okay-0.020
Token day
Token ID1110
Feature activation-3.94e-02

Pos logit contributions
+0.056
rob+0.053
robat+0.050
 slopes+0.050
 slope+0.050
Neg logit contributions
 up-0.102
 sure-0.102
 anyone-0.101
 okay-0.094
 about-0.093
Token he
Token ID339
Feature activation-4.94e-04

Pos logit contributions
 hill+0.049
field+0.047
forest+0.046
 slope+0.046
robat+0.045
Neg logit contributions
 anyone-0.085
 sure-0.081
 up-0.077
 okay-0.074
 what-0.072
Token lay
Token ID3830
Feature activation-9.66e-03

Pos logit contributions
+0.010
 hill+0.010
 slope+0.010
 slopes+0.010
akes+0.010
Neg logit contributions
 anyone-0.015
 sure-0.014
 up-0.014
 closely-0.013
 worse-0.013
Token it
Token ID340
Feature activation+7.05e-03

Pos logit contributions
 hill+0.019
 slope+0.019
+0.019
field+0.018
 slopes+0.018
Neg logit contributions
 anyone-0.013
 sure-0.012
 up-0.012
 okay-0.010
 worse-0.010
Token down
Token ID866
Feature activation-3.25e-02

Pos logit contributions
 anyone+0.008
 sure+0.007
 up+0.007
 closely+0.006
 worse+0.006
Neg logit contributions
 hill-0.015
 slope-0.015
-0.015
field-0.015
 slopes-0.014
Token in
Token ID287
Feature activation-4.62e-02

Pos logit contributions
robat+0.079
 slope+0.077
field+0.076
 hill+0.076
asis+0.076
Neg logit contributions
 up-0.035
 anyone-0.029
 closely-0.025
 sure-0.025
 about-0.024
Token the
Token ID262
Feature activation-4.25e-02

Pos logit contributions
akes+0.110
field+0.110
 slope+0.109
forest+0.109
+0.108
Neg logit contributions
 anyone-0.044
 sure-0.039
 up-0.036
 what-0.030
 closely-0.029
Token deep
Token ID2769
Feature activation-1.02e-01

Pos logit contributions
+0.065
robat+0.064
field+0.063
rob+0.062
 slope+0.061
Neg logit contributions
 up-0.074
 anyone-0.067
 sure-0.064
 awful-0.063
 worse-0.062
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
+0.219
robat+0.203
rob+0.198
packing+0.195
ursion+0.194
Neg logit contributions
 up-0.170
 closely-0.131
 strange-0.118
 fake-0.118
 out-0.117
Token green
Token ID4077
Feature activation-5.11e-02

Pos logit contributions
+0.027
field+0.026
akes+0.026
robat+0.026
ursion+0.026
Neg logit contributions
 up-0.020
 anyone-0.019
 sure-0.019
 closely-0.017
 okay-0.017
Token grass
Token ID8701
Feature activation-3.15e-02

Pos logit contributions
ursion+0.092
+0.089
packing+0.085
leness+0.084
mph+0.084
Neg logit contributions
 up-0.080
 anyone-0.070
 a-0.064
 out-0.063
's-0.061
Token.
Token ID13
Feature activation-5.00e-03

Pos logit contributions
 hill+0.058
 slope+0.057
robat+0.057
+0.056
ursion+0.056
Neg logit contributions
 anyone-0.043
 up-0.042
 sure-0.040
 closely-0.037
 worse-0.035
Token
Token ID198
Feature activation+8.79e-03

Pos logit contributions
 hill+0.008
 slope+0.008
 slopes+0.008
+0.007
field+0.007
Neg logit contributions
 anyone-0.009
 sure-0.008
 up-0.008
 closely-0.007
 worse-0.007
Tokenflower
Token ID25547
Feature activation-1.29e-02

Pos logit contributions
 slope+0.035
 slopes+0.034
 hill+0.032
 hallway+0.032
ursion+0.032
Neg logit contributions
 anyone-0.060
 up-0.059
 sure-0.057
 okay-0.055
 just-0.054
Token,
Token ID11
Feature activation+7.27e-03

Pos logit contributions
 slope+0.026
 slopes+0.025
 hill+0.025
+0.025
 hallway+0.024
Neg logit contributions
 anyone-0.016
 sure-0.015
 up-0.015
 about-0.014
 just-0.013
Token growing
Token ID3957
Feature activation+5.70e-04

Pos logit contributions
 anyone+0.012
 sure+0.011
 up+0.010
 worse+0.010
 closely+0.010
Neg logit contributions
 hill-0.012
 slope-0.012
 slopes-0.012
-0.012
field-0.012
Token in
Token ID287
Feature activation-2.67e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
 okay+0.000
 worse+0.000
 up+0.000
Neg logit contributions
 hill-0.001
 slopes-0.001
 slope-0.001
field-0.001
-0.001
Token the
Token ID262
Feature activation-3.54e-02

Pos logit contributions
 slope+0.064
+0.064
 hill+0.063
akes+0.063
forest+0.063
Neg logit contributions
 anyone-0.025
 sure-0.022
 up-0.020
 what-0.017
 okay-0.016
Token sunny
Token ID27737
Feature activation-1.02e-01

Pos logit contributions
+0.051
 slope+0.050
robat+0.050
rob+0.050
 slopes+0.047
Neg logit contributions
 anyone-0.064
 up-0.064
 sure-0.060
 okay-0.056
 awful-0.055
Token garden
Token ID11376
Feature activation-2.80e-02

Pos logit contributions
rob+0.193
robat+0.190
leness+0.189
+0.187
manship+0.182
Neg logit contributions
 up-0.161
's-0.131
 a-0.130
 out-0.118
 about-0.117
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
robat+0.063
 slope+0.061
rob+0.061
+0.060
 hill+0.059
Neg logit contributions
 up-0.032
 anyone-0.031
 sure-0.029
 about-0.027
 not-0.026
Token Every
Token ID3887
Feature activation-4.32e-02

Pos logit contributions
 hill+0.018
 slope+0.017
 slopes+0.017
+0.017
field+0.017
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
 okay-0.017
Token day
Token ID1110
Feature activation-5.96e-02

Pos logit contributions
rob+0.051
+0.050
thank+0.049
 slope+0.049
 slopes+0.048
Neg logit contributions
 anyone-0.090
 sure-0.088
 up-0.085
 about-0.082
 just-0.078
Token,
Token ID11
Feature activation-6.27e-03

Pos logit contributions
field+0.082
 hill+0.081
 slope+0.081
robat+0.080
 hallway+0.078
Neg logit contributions
 anyone-0.127
 sure-0.116
 up-0.114
 okay-0.110
 about-0.105
Token to
Token ID284
Feature activation-8.67e-03

Pos logit contributions
 anyone+0.017
 sure+0.016
 up+0.015
 okay+0.014
 closely+0.014
Neg logit contributions
 hill-0.021
 slope-0.021
 slopes-0.020
field-0.020
-0.020
Token play
Token ID711
Feature activation+4.71e-03

Pos logit contributions
 slope+0.014
field+0.014
 slopes+0.013
rob+0.013
robat+0.013
Neg logit contributions
 anyone-0.015
 sure-0.014
 up-0.013
 not-0.012
 okay-0.012
Token outside
Token ID2354
Feature activation-2.97e-02

Pos logit contributions
 anyone+0.007
 sure+0.007
 up+0.006
 worse+0.006
 okay+0.006
Neg logit contributions
 hill-0.008
 slope-0.008
-0.008
 slopes-0.008
thank-0.008
Token in
Token ID287
Feature activation-2.34e-02

Pos logit contributions
ursion+0.071
forest+0.070
robat+0.070
field+0.070
thank+0.070
Neg logit contributions
 up-0.027
 anyone-0.027
 sure-0.021
 closely-0.021
 just-0.019
Token the
Token ID262
Feature activation-3.02e-02

Pos logit contributions
akes+0.057
+0.057
thank+0.056
field+0.056
robat+0.055
Neg logit contributions
 anyone-0.021
 up-0.019
 sure-0.018
 what-0.016
 closely-0.014
Token sunny
Token ID27737
Feature activation-1.01e-01

Pos logit contributions
+0.046
field+0.043
robat+0.042
 slope+0.042
thank+0.041
Neg logit contributions
 up-0.054
 anyone-0.053
 sure-0.051
 okay-0.046
 worse-0.046
Token weather
Token ID6193
Feature activation-4.75e-02

Pos logit contributions
leness+0.191
+0.183
use+0.181
opter+0.178
ursion+0.176
Neg logit contributions
 up-0.172
 a-0.140
's-0.135
 about-0.126
 over-0.124
Token.
Token ID13
Feature activation-6.12e-03

Pos logit contributions
ursion+0.108
use+0.104
+0.103
robat+0.101
 slope+0.101
Neg logit contributions
 up-0.050
 anyone-0.048
 about-0.043
 sure-0.041
 not-0.038
Token One
Token ID1881
Feature activation-4.54e-02

Pos logit contributions
 hill+0.010
field+0.010
 slope+0.010
 slopes+0.010
thank+0.009
Neg logit contributions
 anyone-0.010
 sure-0.009
 up-0.009
 okay-0.008
 closely-0.008
Token day
Token ID1110
Feature activation-3.09e-02

Pos logit contributions
+0.043
 slope+0.041
 slopes+0.041
field+0.040
rob+0.039
Neg logit contributions
 anyone-0.105
 sure-0.104
 up-0.102
 okay-0.097
 about-0.095
Token,
Token ID11
Feature activation+1.48e-03

Pos logit contributions
field+0.048
forest+0.044
 hill+0.044
 hallway+0.044
robat+0.043
Neg logit contributions
 anyone-0.061
 up-0.056
 okay-0.054
 what-0.054
 sure-0.053
Token was
Token ID373
Feature activation+2.18e-02

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 closely+0.013
 not+0.013
Neg logit contributions
 hill-0.017
 slope-0.017
-0.017
 slopes-0.017
akes-0.016
Token very
Token ID845
Feature activation-1.38e-03

Pos logit contributions
 anyone+0.031
ust+0.028
ares+0.027
 worse+0.027
 closely+0.026
Neg logit contributions
 street-0.037
 hallway-0.037
thank-0.037
 evenings-0.036
forest-0.036
Token content
Token ID2695
Feature activation-4.54e-02

Pos logit contributions
 hill+0.003
 slope+0.003
 slopes+0.003
field+0.003
+0.003
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.001
 worse-0.001
 closely-0.001
Token in
Token ID287
Feature activation-1.78e-02

Pos logit contributions
ursion+0.105
+0.103
use+0.102
akes+0.102
 hill+0.101
Neg logit contributions
 up-0.051
 sure-0.048
 about-0.048
 not-0.046
 anyone-0.045
Token the
Token ID262
Feature activation-3.90e-02

Pos logit contributions
 hill+0.039
 slope+0.038
 slopes+0.038
+0.038
field+0.037
Neg logit contributions
 anyone-0.020
 sure-0.018
 up-0.016
 closely-0.015
 okay-0.015
Token sunny
Token ID27737
Feature activation-1.01e-01

Pos logit contributions
+0.062
 hill+0.061
 slope+0.061
 slopes+0.059
field+0.058
Neg logit contributions
 anyone-0.065
 sure-0.063
 up-0.060
 okay-0.056
 closely-0.055
Token me
Token ID502
Feature activation-2.10e-02

Pos logit contributions
+0.180
ursion+0.166
opter+0.160
packing+0.157
rob+0.156
Neg logit contributions
 up-0.181
 about-0.149
 anyone-0.148
's-0.146
 a-0.142
Tokenadow
Token ID4584
Feature activation-2.75e-02

Pos logit contributions
 slope+0.030
 hill+0.030
+0.028
 slopes+0.028
ursion+0.027
Neg logit contributions
 sure-0.039
 anyone-0.039
 up-0.037
 about-0.035
 okay-0.035
Token,
Token ID11
Feature activation+7.73e-03

Pos logit contributions
 hill+0.052
 slope+0.051
robat+0.050
+0.050
 slopes+0.050
Neg logit contributions
 anyone-0.038
 sure-0.036
 up-0.036
 closely-0.031
 about-0.031
Token where
Token ID810
Feature activation+2.80e-02

Pos logit contributions
 anyone+0.012
 sure+0.012
 up+0.011
 worse+0.010
 closely+0.010
Neg logit contributions
 hill-0.013
 slope-0.013
 slopes-0.012
-0.012
field-0.012
Token she
Token ID673
Feature activation+1.18e-02

Pos logit contributions
 anyone+0.044
ares+0.043
 sure+0.043
 closely+0.043
amus+0.042
Neg logit contributions
 hill-0.042
 evenings-0.040
 street-0.038
 season-0.038
 hallway-0.038
Tokenals
Token ID874
Feature activation-1.27e-02

Pos logit contributions
 slope+0.028
 hill+0.028
 slopes+0.027
 hallway+0.027
+0.026
Neg logit contributions
 anyone-0.032
 sure-0.029
 up-0.028
 worse-0.026
 okay-0.026
Token sh
Token ID427
Feature activation-2.64e-02

Pos logit contributions
 slope+0.020
 hill+0.020
 slopes+0.019
 hallway+0.019
+0.019
Neg logit contributions
 anyone-0.022
 sure-0.021
 up-0.020
 closely-0.019
 worse-0.019
Tokenined
Token ID1389
Feature activation-4.93e-03

Pos logit contributions
 hill+0.034
 slope+0.033
 hallway+0.031
 slopes+0.030
field+0.030
Neg logit contributions
 anyone-0.053
 sure-0.051
 up-0.050
 okay-0.047
 closely-0.047
Token in
Token ID287
Feature activation-3.30e-02

Pos logit contributions
 hill+0.008
 slopes+0.008
 slope+0.008
thank+0.008
akes+0.008
Neg logit contributions
 anyone-0.007
 sure-0.007
ust-0.006
 closely-0.006
 up-0.006
Token the
Token ID262
Feature activation-3.93e-02

Pos logit contributions
 hill+0.069
 slope+0.069
+0.068
field+0.067
 slopes+0.067
Neg logit contributions
 anyone-0.041
 sure-0.036
 up-0.034
 closely-0.031
 okay-0.031
Token sunny
Token ID27737
Feature activation-1.01e-01

Pos logit contributions
+0.048
 slope+0.048
robat+0.046
rob+0.045
 hill+0.045
Neg logit contributions
 anyone-0.080
 up-0.077
 sure-0.073
 awful-0.071
 worse-0.070
Token sky
Token ID6766
Feature activation-4.25e-02

Pos logit contributions
ursion+0.164
rob+0.163
packing+0.161
+0.158
opter+0.157
Neg logit contributions
 up-0.168
 anyone-0.137
's-0.136
 a-0.134
 about-0.133
Token.
Token ID13
Feature activation+6.77e-04

Pos logit contributions
 slope+0.087
 hill+0.086
+0.085
 slopes+0.084
robat+0.083
Neg logit contributions
 anyone-0.054
 sure-0.050
 up-0.049
 closely-0.043
 okay-0.043
Token
Token ID198
Feature activation+8.20e-03

Pos logit contributions
 anyone+0.001
 sure+0.001
 up+0.001
 worse+0.001
 closely+0.001
Neg logit contributions
 hill-0.001
-0.001
 slope-0.001
 slopes-0.001
field-0.001
Token
Token ID198
Feature activation-1.87e-04

Pos logit contributions
 anyone+0.017
 sure+0.015
 worse+0.015
 up+0.015
 awful+0.015
Neg logit contributions
 hill-0.010
-0.009
field-0.009
 slope-0.009
 slopes-0.009
TokenOne
Token ID3198
Feature activation-4.07e-02

Pos logit contributions
 hill+0.000
 slope+0.000
 slopes+0.000
 hallway+0.000
Now+0.000
Neg logit contributions
 anyone-0.000
 sure-0.000
 up-0.000
 worse-0.000
 okay-0.000
Token It
Token ID632
Feature activation+1.70e-02

Pos logit contributions
 hill+0.004
 slope+0.004
 slopes+0.004
field+0.004
+0.004
Neg logit contributions
 anyone-0.005
 sure-0.005
 up-0.004
 okay-0.004
 worse-0.004
Token will
Token ID481
Feature activation-9.64e-03

Pos logit contributions
 anyone+0.029
 sure+0.027
 up+0.025
 closely+0.025
 okay+0.024
Neg logit contributions
 hill-0.027
 slope-0.025
field-0.025
 slopes-0.025
-0.025
Token glow
Token ID19634
Feature activation+9.34e-03

Pos logit contributions
 slope+0.018
rob+0.017
robat+0.017
+0.017
 hill+0.017
Neg logit contributions
 anyone-0.015
 sure-0.014
 okay-0.013
 up-0.013
 not-0.013
Token in
Token ID287
Feature activation-2.06e-02

Pos logit contributions
 anyone+0.014
 sure+0.013
 up+0.013
 closely+0.011
 okay+0.011
Neg logit contributions
 hill-0.017
-0.017
 slope-0.017
 slopes-0.016
field-0.016
Token the
Token ID262
Feature activation-3.67e-02

Pos logit contributions
+0.045
 slope+0.045
 hill+0.045
forest+0.044
robat+0.044
Neg logit contributions
 anyone-0.023
 sure-0.021
 up-0.019
 closely-0.018
 okay-0.017
Token dark
Token ID3223
Feature activation-1.01e-01

Pos logit contributions
+0.055
robat+0.053
rob+0.053
 slope+0.052
ursion+0.050
Neg logit contributions
 up-0.066
 anyone-0.066
 sure-0.060
 worse-0.059
 okay-0.059
Token and
Token ID290
Feature activation-4.69e-04

Pos logit contributions
+0.248
ursion+0.221
rob+0.213
robat+0.211
oped+0.209
Neg logit contributions
 up-0.117
 anyone-0.109
 a-0.098
's-0.093
 about-0.091
Token scare
Token ID19437
Feature activation-1.32e-02

Pos logit contributions
 hill+0.001
 slope+0.001
+0.001
 slopes+0.001
field+0.001
Neg logit contributions
 anyone-0.001
 sure-0.001
 up-0.001
 worse-0.001
 closely-0.001
Token away
Token ID1497
Feature activation-1.16e-02

Pos logit contributions
 hill+0.025
 slopes+0.024
 slope+0.024
 evenings+0.024
field+0.023
Neg logit contributions
 anyone-0.017
 sure-0.016
 up-0.015
 closely-0.014
 okay-0.014
Token the
Token ID262
Feature activation-7.60e-03

Pos logit contributions
 hill+0.018
 slope+0.018
 slopes+0.017
field+0.017
+0.017
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 worse-0.017
 closely-0.017
Token bad
Token ID2089
Feature activation-2.97e-02

Pos logit contributions
 hill+0.011
+0.011
 slope+0.011
 slopes+0.011
field+0.011
Neg logit contributions
 anyone-0.014
 sure-0.013
 up-0.012
 worse-0.012
 okay-0.012
Token to
Token ID284
Feature activation-2.10e-02

Pos logit contributions
 up+0.004
 sure+0.004
 anyone+0.004
 closely+0.003
 about+0.003
Neg logit contributions
 hill-0.007
-0.007
akes-0.007
 slope-0.007
robat-0.007
Token the
Token ID262
Feature activation-9.34e-03

Pos logit contributions
 hill+0.042
 slope+0.041
 slopes+0.040
+0.040
field+0.040
Neg logit contributions
 anyone-0.027
 sure-0.025
 up-0.023
 closely-0.022
 worse-0.022
Token poppy
Token ID47867
Feature activation-4.63e-06

Pos logit contributions
 hill+0.011
 slope+0.010
 slopes+0.010
field+0.010
+0.010
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 okay-0.018
 worse-0.017
Token in
Token ID287
Feature activation-4.01e-02

Feature not active on prev token;
no predictions were affected.
Token the
Token ID262
Feature activation-4.25e-02

Pos logit contributions
 slope+0.093
akes+0.093
 hill+0.092
 slopes+0.091
asis+0.090
Neg logit contributions
 anyone-0.041
 sure-0.038
 up-0.034
 closely-0.029
 okay-0.029
Token sunny
Token ID27737
Feature activation-1.01e-01

Pos logit contributions
 slope+0.045
 hill+0.045
+0.044
 slopes+0.044
akes+0.043
Neg logit contributions
 anyone-0.092
 sure-0.088
 up-0.087
 worse-0.081
 closely-0.081
Token field
Token ID2214
Feature activation-4.53e-02

Pos logit contributions
rob+0.179
robat+0.177
akes+0.170
ursion+0.163
opter+0.160
Neg logit contributions
 up-0.163
's-0.130
 a-0.126
 something-0.121
 closely-0.119
Token.
Token ID13
Feature activation+1.67e-03

Pos logit contributions
rob+0.095
 slope+0.095
robat+0.095
 slopes+0.092
 hill+0.092
Neg logit contributions
 anyone-0.052
 up-0.051
 sure-0.049
 about-0.045
 not-0.042
Token She
Token ID1375
Feature activation-7.88e-03

Pos logit contributions
 anyone+0.003
 sure+0.003
 up+0.002
 closely+0.002
 okay+0.002
Neg logit contributions
 hill-0.003
 slope-0.003
robat-0.003
 slopes-0.003
field-0.003
Token put
Token ID1234
Feature activation-4.94e-03

Pos logit contributions
 hill+0.013
 slope+0.012
 slopes+0.012
+0.012
field+0.012
Neg logit contributions
 anyone-0.013
 sure-0.013
 up-0.012
 closely-0.011
 worse-0.011
Token the
Token ID262
Feature activation+6.90e-03

Pos logit contributions
 hill+0.010
+0.010
 slope+0.010
 slopes+0.010
field+0.010
Neg logit contributions
 anyone-0.006
 sure-0.005
ust-0.005
 worse-0.005
 closely-0.005
Token would
Token ID561
Feature activation-1.16e-02

Pos logit contributions
 sure+0.001
 anyone+0.001
 up+0.001
 closely+0.001
 just+0.001
Neg logit contributions
-0.001
 slope-0.001
 slopes-0.001
field-0.001
 hill-0.001
Token sit
Token ID1650
Feature activation-4.92e-03

Pos logit contributions
+0.018
akes+0.018
 slope+0.018
field+0.017
 hill+0.017
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
 not-0.017
Token together
Token ID1978
Feature activation-2.45e-03

Pos logit contributions
 hill+0.009
 slope+0.009
 slopes+0.009
+0.009
field+0.009
Neg logit contributions
 anyone-0.007
 sure-0.006
 up-0.006
 okay-0.006
 closely-0.006
Token in
Token ID287
Feature activation-2.34e-02

Pos logit contributions
+0.004
 slope+0.004
rob+0.004
 hill+0.004
 slopes+0.004
Neg logit contributions
 anyone-0.004
 sure-0.004
 up-0.004
 closely-0.003
 about-0.003
Token the
Token ID262
Feature activation-3.75e-02

Pos logit contributions
forest+0.053
asis+0.053
 slope+0.053
akes+0.053
thank+0.052
Neg logit contributions
 anyone-0.025
 sure-0.023
 up-0.021
 closely-0.019
 what-0.018
Token sunny
Token ID27737
Feature activation-1.01e-01

Pos logit contributions
+0.058
thank+0.058
akes+0.058
 slope+0.058
field+0.057
Neg logit contributions
 up-0.061
 anyone-0.061
 sure-0.054
 wrong-0.053
 awful-0.053
Token garden
Token ID11376
Feature activation-3.82e-02

Pos logit contributions
ursion+0.179
+0.172
leness+0.172
akes+0.171
oped+0.169
Neg logit contributions
 up-0.150
's-0.124
 a-0.122
 new-0.117
 about-0.113
Token,
Token ID11
Feature activation+1.66e-03

Pos logit contributions
rob+0.072
robat+0.072
 slope+0.072
ursion+0.071
+0.070
Neg logit contributions
 up-0.053
 anyone-0.051
 sure-0.050
 about-0.046
 closely-0.046
Token and
Token ID290
Feature activation-1.48e-03

Pos logit contributions
 anyone+0.003
 sure+0.003
 up+0.002
 closely+0.002
 okay+0.002
Neg logit contributions
 hill-0.003
 slope-0.003
 slopes-0.003
-0.003
field-0.003
Token grand
Token ID4490
Feature activation-2.21e-02

Pos logit contributions
 hill+0.003
 slope+0.002
+0.002
 slopes+0.002
field+0.002
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.002
 closely-0.002
 okay-0.002
Tokenpa
Token ID8957
Feature activation+1.26e-02

Pos logit contributions
+0.030
 slope+0.029
 hill+0.029
field+0.028
 slopes+0.027
Neg logit contributions
 anyone-0.044
 sure-0.042
 up-0.040
 closely-0.038
 what-0.037
Token wizard
Token ID18731
Feature activation-4.02e-04

Pos logit contributions
 hill+0.010
 slope+0.009
 slopes+0.009
field+0.009
 hallway+0.009
Neg logit contributions
 anyone-0.010
 sure-0.010
 up-0.009
 closely-0.009
 okay-0.009
Token was
Token ID373
Feature activation+1.43e-02

Pos logit contributions
 hill+0.001
 slope+0.001
 slopes+0.001
+0.001
field+0.001
Neg logit contributions
 anyone-0.001
 sure-0.001
 up-0.001
 closely-0.001
 worse-0.001
Token walking
Token ID6155
Feature activation-1.80e-02

Pos logit contributions
 anyone+0.020
ust+0.018
 sure+0.017
 awful+0.017
iqu+0.017
Neg logit contributions
thank-0.025
field-0.024
 hill-0.023
forest-0.023
 street-0.023
Token through
Token ID832
Feature activation-2.28e-02

Pos logit contributions
 hill+0.036
 slope+0.035
 slopes+0.035
field+0.034
+0.034
Neg logit contributions
 anyone-0.023
 sure-0.022
 up-0.019
 worse-0.019
 okay-0.019
Token a
Token ID257
Feature activation-3.00e-02

Pos logit contributions
 hill+0.048
 slope+0.047
 slopes+0.047
+0.046
field+0.046
Neg logit contributions
 anyone-0.026
 sure-0.024
 up-0.022
 okay-0.020
 worse-0.020
Token sunny
Token ID27737
Feature activation-1.00e-01

Pos logit contributions
+0.041
 hill+0.041
 slope+0.041
akes+0.041
thank+0.041
Neg logit contributions
 anyone-0.056
 sure-0.053
 up-0.050
 closely-0.048
 worse-0.047
Token me
Token ID502
Feature activation-2.95e-02

Pos logit contributions
akes+0.170
use+0.168
+0.164
ursion+0.156
leness+0.151
Neg logit contributions
 up-0.164
 a-0.142
 anyone-0.135
 closely-0.134
 about-0.133
Tokenadow
Token ID4584
Feature activation-3.87e-02

Pos logit contributions
 slope+0.046
 hill+0.046
+0.042
 slopes+0.042
thank+0.041
Neg logit contributions
 anyone-0.052
 sure-0.052
 up-0.047
 about-0.047
 okay-0.044
Token when
Token ID618
Feature activation+2.16e-03

Pos logit contributions
+0.078
robat+0.078
 slope+0.077
rob+0.077
 hill+0.076
Neg logit contributions
 anyone-0.049
 up-0.047
 sure-0.044
 about-0.041
 closely-0.039
Token he
Token ID339
Feature activation-6.85e-03

Pos logit contributions
 sure+0.004
 anyone+0.004
 up+0.003
ust+0.003
 closely+0.003
Neg logit contributions
 hill-0.003
 slope-0.003
 slopes-0.003
field-0.003
-0.003
Token spotted
Token ID13489
Feature activation+4.02e-02

Pos logit contributions
 hill+0.008
field+0.007
+0.007
 distance+0.007
thank+0.007
Neg logit contributions
 anyone-0.014
 sure-0.014
 up-0.013
 closely-0.013
 awful-0.013
Token it
Token ID340
Feature activation+4.79e-03

Pos logit contributions
 hill+0.021
 slope+0.021
+0.021
 slopes+0.021
akes+0.020
Neg logit contributions
 anyone-0.017
 sure-0.017
 up-0.015
 closely-0.014
 okay-0.014
Token moved
Token ID3888
Feature activation-2.07e-03

Pos logit contributions
 anyone+0.008
 sure+0.008
 up+0.007
 closely+0.007
 worse+0.007
Neg logit contributions
 hill-0.007
 slope-0.007
-0.007
 slopes-0.007
akes-0.007
Token around
Token ID1088
Feature activation-1.21e-02

Pos logit contributions
 hill+0.004
 slope+0.004
 slopes+0.004
field+0.004
thank+0.004
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.002
 worse-0.002
 okay-0.002
Token in
Token ID287
Feature activation-3.75e-02

Pos logit contributions
 hill+0.025
 slope+0.025
+0.024
forest+0.024
akes+0.024
Neg logit contributions
 anyone-0.016
 sure-0.015
 up-0.014
 closely-0.013
 okay-0.012
Token the
Token ID262
Feature activation-4.35e-02

Pos logit contributions
akes+0.082
forest+0.080
+0.080
ursion+0.079
 slope+0.079
Neg logit contributions
 anyone-0.045
 sure-0.043
 up-0.039
 what-0.035
 closely-0.035
Token sunny
Token ID27737
Feature activation-1.00e-01

Pos logit contributions
+0.054
 hill+0.053
 slope+0.052
akes+0.051
robat+0.050
Neg logit contributions
 anyone-0.089
 sure-0.085
 up-0.083
 closely-0.078
 worse-0.077
Token part
Token ID636
Feature activation-3.83e-02

Pos logit contributions
+0.180
ursion+0.175
rob+0.172
robat+0.170
akes+0.168
Neg logit contributions
 up-0.169
 anyone-0.134
 a-0.134
 closely-0.133
 about-0.132
Token of
Token ID286
Feature activation-2.24e-02

Pos logit contributions
+0.073
rob+0.073
 slope+0.072
robat+0.071
 slopes+0.069
Neg logit contributions
 anyone-0.053
 up-0.050
 sure-0.050
 about-0.044
 closely-0.044
Token the
Token ID262
Feature activation-3.20e-02

Pos logit contributions
 hill+0.045
 slope+0.044
 slopes+0.043
+0.043
field+0.043
Neg logit contributions
 anyone-0.029
 sure-0.027
 up-0.025
 closely-0.023
 worse-0.023
Token ocean
Token ID9151
Feature activation-2.26e-02

Pos logit contributions
+0.045
 hill+0.044
 slope+0.044
akes+0.043
 slopes+0.043
Neg logit contributions
 anyone-0.060
 sure-0.057
 up-0.054
 closely-0.051
 worse-0.051
Token,
Token ID11
Feature activation-1.54e-03

Pos logit contributions
+0.044
 slope+0.043
 hill+0.043
ursion+0.042
robat+0.042
Neg logit contributions
 anyone-0.030
 sure-0.029
 up-0.028
 closely-0.025
 okay-0.024
Token many
Token ID867
Feature activation-2.22e-02

Pos logit contributions
 anyone+0.039
 sure+0.038
 closely+0.037
 up+0.037
ares+0.035
Neg logit contributions
 hill-0.068
 slopes-0.065
 slope-0.064
 evenings-0.063
field-0.063
Token dangers
Token ID16169
Feature activation-2.69e-02

Pos logit contributions
 hill+0.041
 slope+0.039
 slopes+0.039
 evenings+0.038
 hallway+0.037
Neg logit contributions
 anyone-0.031
 sure-0.030
 up-0.028
 okay-0.026
 worse-0.026
Token lurking
Token ID36105
Feature activation-1.18e-02

Pos logit contributions
+0.056
 hill+0.055
field+0.055
 slope+0.055
 slopes+0.054
Neg logit contributions
 anyone-0.034
 sure-0.031
 up-0.029
 worse-0.026
 closely-0.026
Token in
Token ID287
Feature activation-2.55e-02

Pos logit contributions
 hill+0.024
 slope+0.024
 slopes+0.023
field+0.023
+0.023
Neg logit contributions
 anyone-0.014
 sure-0.014
 up-0.012
 okay-0.011
 worse-0.011
Token the
Token ID262
Feature activation-4.48e-02

Pos logit contributions
 hill+0.053
 slope+0.052
+0.052
field+0.051
 slopes+0.051
Neg logit contributions
 anyone-0.031
 sure-0.028
 up-0.026
 worse-0.024
 closely-0.023
Token deep
Token ID2769
Feature activation-9.98e-02

Pos logit contributions
+0.050
 hill+0.047
 slope+0.046
akes+0.046
thank+0.046
Neg logit contributions
 anyone-0.099
 sure-0.093
 up-0.092
 closely-0.087
 worse-0.087
Token blue
Token ID4171
Feature activation-7.26e-02

Pos logit contributions
+0.186
robat+0.164
ursion+0.162
thank+0.160
akes+0.159
Neg logit contributions
 up-0.170
 anyone-0.168
 closely-0.154
 strange-0.142
 fake-0.135
Token sea
Token ID5417
Feature activation-2.51e-02

Pos logit contributions
thank+0.102
+0.102
 slope+0.099
forest+0.096
ursion+0.095
Neg logit contributions
 anyone-0.135
 up-0.123
 something-0.112
 sure-0.110
 someone-0.108
Token.
Token ID13
Feature activation-2.22e-03

Pos logit contributions
+0.046
 slope+0.045
thank+0.044
 hill+0.043
ursion+0.043
Neg logit contributions
 anyone-0.038
 sure-0.035
 up-0.034
 closely-0.031
 okay-0.031
Token But
Token ID887
Feature activation-2.92e-02

Pos logit contributions
 hill+0.003
 slope+0.003
+0.003
 slopes+0.003
field+0.003
Neg logit contributions
 anyone-0.004
 sure-0.004
 up-0.004
 worse-0.004
 closely-0.004
Token he
Token ID339
Feature activation-2.67e-03

Pos logit contributions
 hill+0.048
 slope+0.046
field+0.045
 slopes+0.045
+0.044
Neg logit contributions
 anyone-0.048
 sure-0.045
 up-0.042
 worse-0.040
 closely-0.040
Token So
Token ID1406
Feature activation-5.33e-02

Pos logit contributions
 hill+0.011
 slope+0.011
 slopes+0.011
field+0.011
+0.011
Neg logit contributions
 anyone-0.017
 sure-0.016
 up-0.015
 closely-0.014
 okay-0.014
Token they
Token ID484
Feature activation-4.41e-03

Pos logit contributions
forest+0.088
robat+0.088
akes+0.087
rob+0.086
thank+0.086
Neg logit contributions
 sure-0.092
 anyone-0.090
 up-0.087
 closely-0.078
 okay-0.078
Token stayed
Token ID9658
Feature activation-1.82e-02

Pos logit contributions
 hill+0.007
 slope+0.007
+0.007
 slopes+0.007
field+0.007
Neg logit contributions
 anyone-0.007
 sure-0.007
 up-0.006
 closely-0.006
 okay-0.006
Token in
Token ID287
Feature activation-3.47e-02

Pos logit contributions
+0.039
 slope+0.039
 hill+0.039
akes+0.038
robat+0.038
Neg logit contributions
 sure-0.021
 up-0.020
 anyone-0.020
 okay-0.017
 closely-0.017
Token the
Token ID262
Feature activation-4.64e-02

Pos logit contributions
akes+0.084
forest+0.082
ursion+0.081
 slope+0.080
robat+0.080
Neg logit contributions
 anyone-0.032
 sure-0.031
 up-0.029
 what-0.025
 closely-0.024
Token sunny
Token ID27737
Feature activation-9.94e-02

Pos logit contributions
thank+0.068
+0.067
akes+0.067
robat+0.067
rob+0.065
Neg logit contributions
 up-0.083
 anyone-0.081
 sure-0.076
 closely-0.074
 awful-0.071
Token field
Token ID2214
Feature activation-4.75e-02

Pos logit contributions
rob+0.176
robat+0.174
ursion+0.171
bench+0.170
+0.170
Neg logit contributions
 up-0.158
 all-0.130
 about-0.123
 new-0.121
 closely-0.115
Token until
Token ID1566
Feature activation-3.54e-02

Pos logit contributions
robat+0.088
rob+0.088
 slope+0.087
+0.086
akes+0.086
Neg logit contributions
 up-0.069
 sure-0.067
 anyone-0.065
 about-0.062
 closely-0.059
Token it
Token ID340
Feature activation+3.44e-03

Pos logit contributions
 hill+0.072
 slope+0.071
akes+0.069
 slopes+0.069
forest+0.069
Neg logit contributions
 anyone-0.046
 sure-0.042
 up-0.039
 okay-0.035
 closely-0.035
Token was
Token ID373
Feature activation+2.55e-03

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.005
 closely+0.004
 worse+0.004
Neg logit contributions
 hill-0.006
 slope-0.006
 slopes-0.006
-0.006
field-0.006
Token time
Token ID640
Feature activation-1.22e-02

Pos logit contributions
 anyone+0.004
 sure+0.004
 up+0.004
 okay+0.004
 closely+0.004
Neg logit contributions
 hill-0.004
 slope-0.004
 slopes-0.004
-0.004
field-0.004
Token liked
Token ID8288
Feature activation+1.60e-02

Pos logit contributions
 anyone+0.010
 sure+0.010
 up+0.009
 closely+0.008
 okay+0.008
Neg logit contributions
 hill-0.010
-0.010
 slope-0.010
 slopes-0.009
field-0.009
Token to
Token ID284
Feature activation-1.72e-02

Pos logit contributions
 anyone+0.023
 sure+0.022
 up+0.020
 okay+0.019
 worse+0.019
Neg logit contributions
 hill-0.029
 slope-0.028
 slopes-0.028
field-0.027
-0.027
Token relax
Token ID8960
Feature activation-2.99e-02

Pos logit contributions
+0.027
ursion+0.026
field+0.026
 slope+0.026
akes+0.026
Neg logit contributions
 anyone-0.030
 sure-0.028
 up-0.027
 closely-0.024
 not-0.024
Token in
Token ID287
Feature activation-2.16e-02

Pos logit contributions
ursion+0.065
+0.063
ite+0.061
akes+0.061
forest+0.061
Neg logit contributions
 anyone-0.036
 up-0.035
 sure-0.032
 closely-0.029
 just-0.028
Token the
Token ID262
Feature activation-3.36e-02

Pos logit contributions
ursion+0.050
akes+0.050
forest+0.049
+0.049
thank+0.049
Neg logit contributions
 anyone-0.023
 sure-0.020
 up-0.019
 what-0.016
 closely-0.016
Token sunny
Token ID27737
Feature activation-9.84e-02

Pos logit contributions
+0.045
field+0.042
 slope+0.042
ursion+0.041
thank+0.041
Neg logit contributions
 anyone-0.063
 up-0.062
 sure-0.058
 awful-0.055
 closely-0.055
Token me
Token ID502
Feature activation-2.60e-02

Pos logit contributions
+0.176
ursion+0.174
leness+0.165
packing+0.164
opter+0.164
Neg logit contributions
 up-0.161
's-0.133
 a-0.128
 about-0.122
 anyone-0.122
Tokenadow
Token ID4584
Feature activation-3.21e-02

Pos logit contributions
 slope+0.041
 hill+0.040
+0.040
thank+0.038
 slopes+0.038
Neg logit contributions
 sure-0.046
 anyone-0.045
 up-0.042
 about-0.041
 okay-0.041
Token.
Token ID13
Feature activation-6.46e-03

Pos logit contributions
+0.061
 slope+0.061
 hill+0.060
robat+0.060
ursion+0.059
Neg logit contributions
 anyone-0.044
 up-0.042
 sure-0.041
 about-0.036
 closely-0.036
Token
Token ID198
Feature activation+5.85e-03

Pos logit contributions
 hill+0.010
 slope+0.010
 slopes+0.009
+0.009
field+0.009
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 worse-0.010
 closely-0.010
Token
Token ID198
Feature activation+1.68e-04

Pos logit contributions
 anyone+0.011
 sure+0.010
 up+0.010
 worse+0.010
 awful+0.010
Neg logit contributions
 hill-0.008
 slope-0.008
 slopes-0.008
-0.008
field-0.008
Token play
Token ID711
Feature activation+1.07e-02

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.004
 closely+0.004
 worse+0.004
Neg logit contributions
 hill-0.004
 slope-0.004
 slopes-0.004
field-0.004
-0.004
Token together
Token ID1978
Feature activation-1.27e-02

Pos logit contributions
 anyone+0.020
 sure+0.019
 up+0.018
 worse+0.017
 closely+0.017
Neg logit contributions
 hill-0.015
 slope-0.014
 slopes-0.014
-0.014
thank-0.013
Token in
Token ID287
Feature activation-2.41e-02

Pos logit contributions
 hill+0.024
 slope+0.023
 slopes+0.023
+0.023
field+0.022
Neg logit contributions
 anyone-0.018
 sure-0.017
 up-0.016
 closely-0.015
 okay-0.015
Token the
Token ID262
Feature activation-3.32e-02

Pos logit contributions
 hill+0.048
 slope+0.047
 slopes+0.046
+0.046
field+0.046
Neg logit contributions
 anyone-0.031
 sure-0.029
 up-0.027
 closely-0.025
 worse-0.025
Token big
Token ID1263
Feature activation-6.93e-02

Pos logit contributions
+0.034
 slope+0.034
 hill+0.034
 slopes+0.033
thank+0.033
Neg logit contributions
 anyone-0.073
 sure-0.069
 up-0.069
 closely-0.064
 worse-0.064
Token sunny
Token ID27737
Feature activation-9.83e-02

Pos logit contributions
thank+0.118
ugg+0.109
akes+0.107
+0.107
Now+0.106
Neg logit contributions
 up-0.119
 anyone-0.114
 fake-0.102
 closely-0.101
 strange-0.101
Token garden
Token ID11376
Feature activation-3.52e-02

Pos logit contributions
ursion+0.162
rob+0.160
leness+0.159
+0.157
opter+0.154
Neg logit contributions
 up-0.165
 a-0.129
 new-0.125
's-0.125
 about-0.124
Token.
Token ID13
Feature activation+9.21e-04

Pos logit contributions
 slope+0.069
+0.068
 hill+0.067
 slopes+0.067
ursion+0.067
Neg logit contributions
 anyone-0.045
 sure-0.041
 up-0.041
 closely-0.036
 about-0.035
Token<|endoftext|>
Token ID50256
Feature activation-2.14e-02

Pos logit contributions
 anyone+0.002
 sure+0.002
 worse+0.002
 up+0.002
ust+0.002
Neg logit contributions
-0.001
 hill-0.001
 slope-0.001
field-0.001
 slopes-0.001
TokenOne
Token ID3198
Feature activation-3.60e-02

Pos logit contributions
 hill+0.033
 slope+0.031
+0.031
 slopes+0.031
field+0.031
Neg logit contributions
 anyone-0.037
 sure-0.036
 up-0.033
 worse-0.032
 closely-0.032
Token day
Token ID1110
Feature activation-8.84e-03

Pos logit contributions
+0.034
 slope+0.033
 slopes+0.031
thank+0.031
rob+0.031
Neg logit contributions
 anyone-0.086
 sure-0.083
 up-0.081
 about-0.077
 okay-0.075
Tokenam
Token ID321
Feature activation-8.02e-03

Pos logit contributions
 hill+0.009
 slopes+0.008
 slope+0.008
field+0.008
akes+0.008
Neg logit contributions
 anyone-0.009
 sure-0.008
 worse-0.007
 up-0.007
 okay-0.007
Token further
Token ID2252
Feature activation-4.27e-02

Pos logit contributions
 hill+0.016
+0.015
 slope+0.015
field+0.015
 slopes+0.015
Neg logit contributions
 anyone-0.010
 sure-0.010
 up-0.009
 closely-0.008
 okay-0.008
Token out
Token ID503
Feature activation-1.88e-02

Pos logit contributions
+0.096
akes+0.090
ursion+0.087
robat+0.086
ugg+0.085
Neg logit contributions
 up-0.059
 anyone-0.051
 sure-0.051
 closely-0.050
 what-0.042
Token into
Token ID656
Feature activation-3.59e-02

Pos logit contributions
+0.040
akes+0.040
robat+0.040
rob+0.039
thank+0.039
Neg logit contributions
 anyone-0.021
 up-0.021
 sure-0.021
 closely-0.019
 worse-0.017
Token the
Token ID262
Feature activation-4.60e-02

Pos logit contributions
 hill+0.075
+0.074
akes+0.074
 slope+0.074
forest+0.073
Neg logit contributions
 anyone-0.043
 sure-0.040
 up-0.036
 closely-0.033
 worse-0.032
Token deep
Token ID2769
Feature activation-9.83e-02

Pos logit contributions
+0.054
akes+0.053
thank+0.052
robat+0.052
 slope+0.051
Neg logit contributions
 anyone-0.095
 up-0.091
 sure-0.090
 closely-0.085
 worse-0.085
Token ocean
Token ID9151
Feature activation-3.09e-02

Pos logit contributions
robat+0.166
+0.164
rob+0.161
akes+0.157
thank+0.156
Neg logit contributions
 up-0.177
 closely-0.168
 anyone-0.158
 strange-0.148
 fake-0.139
Token.
Token ID13
Feature activation-4.71e-03

Pos logit contributions
+0.065
robat+0.064
rob+0.063
ursion+0.063
akes+0.063
Neg logit contributions
 sure-0.036
 anyone-0.036
 up-0.035
 closely-0.031
 about-0.028
Token Sure
Token ID10889
Feature activation-1.89e-02

Pos logit contributions
 hill+0.008
 slope+0.007
field+0.007
 slopes+0.007
robat+0.007
Neg logit contributions
 anyone-0.008
 sure-0.008
 up-0.007
 okay-0.007
 closely-0.007
Token enough
Token ID1576
Feature activation-3.50e-02

Pos logit contributions
 hill+0.048
 slope+0.047
akes+0.047
 hallway+0.047
+0.046
Neg logit contributions
 anyone-0.016
 sure-0.015
 up-0.012
 okay-0.011
 closely-0.011
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
akes+0.057
thank+0.056
rob+0.054
forest+0.054
robat+0.053
Neg logit contributions
 anyone-0.068
 sure-0.063
 up-0.058
 okay-0.058
 what-0.057
Token saw
Token ID2497
Feature activation+3.37e-02

Pos logit contributions
 anyone+0.006
 sure+0.006
 up+0.005
 worse+0.005
 closely+0.005
Neg logit contributions
 hill-0.009
 slope-0.008
 slopes-0.008
field-0.008
-0.008
Token it
Token ID340
Feature activation+3.00e-02

Pos logit contributions
ust+0.048
 anyone+0.045
 ok+0.045
 okay+0.044
 sure+0.043
Neg logit contributions
 hill-0.055
 evenings-0.055
 slopes-0.054
field-0.051
 days-0.051
Token shining
Token ID22751
Feature activation+1.34e-04

Pos logit contributions
 anyone+0.043
 sure+0.041
 up+0.035
 okay+0.035
 worse+0.035
Neg logit contributions
 hill-0.054
field-0.052
 slopes-0.051
 slope-0.050
forest-0.050
Token in
Token ID287
Feature activation-3.77e-02

Pos logit contributions
 anyone+0.000
 sure+0.000
 worse+0.000
 okay+0.000
ust+0.000
Neg logit contributions
 hill-0.000
 slopes-0.000
 slope-0.000
field-0.000
thank-0.000
Token the
Token ID262
Feature activation-3.62e-02

Pos logit contributions
 hill+0.081
 slope+0.079
+0.078
 slopes+0.078
field+0.077
Neg logit contributions
 anyone-0.043
 sure-0.040
 up-0.036
 closely-0.033
 worse-0.033
Token dark
Token ID3223
Feature activation-9.79e-02

Pos logit contributions
+0.042
 slope+0.040
 hill+0.038
akes+0.038
thank+0.038
Neg logit contributions
 anyone-0.077
 up-0.074
 sure-0.072
 closely-0.068
 worse-0.068
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
+0.237
ursion+0.215
packing+0.210
thank+0.200
robat+0.200
Neg logit contributions
 anyone-0.114
 up-0.113
 about-0.098
 a-0.093
 closely-0.091
Token Pluto
Token ID32217
Feature activation+2.14e-02

Pos logit contributions
 anyone+0.004
 sure+0.004
 up+0.004
 worse+0.004
 closely+0.004
Neg logit contributions
 hill-0.003
 slope-0.003
-0.003
 slopes-0.003
field-0.003
Token was
Token ID373
Feature activation+1.60e-02

Pos logit contributions
 anyone+0.034
 sure+0.032
 up+0.030
 closely+0.028
 worse+0.028
Neg logit contributions
 hill-0.036
 slope-0.036
-0.035
 slopes-0.035
field-0.034
Token not
Token ID407
Feature activation+8.31e-03

Pos logit contributions
 anyone+0.021
 sure+0.018
ust+0.017
 worse+0.016
ares+0.016
Neg logit contributions
 hill-0.029
field-0.029
 hallway-0.029
 evenings-0.029
thank-0.029
Token ugly
Token ID13400
Feature activation-2.26e-02

Pos logit contributions
 anyone+0.010
 sure+0.008
 up+0.008
 closely+0.008
 worse+0.007
Neg logit contributions
 hill-0.017
thank-0.016
forest-0.016
field-0.016
 slope-0.016
Token They
Token ID1119
Feature activation+1.41e-02

Pos logit contributions
 hill+0.007
+0.007
 slope+0.007
 slopes+0.006
field+0.006
Neg logit contributions
 anyone-0.007
 sure-0.007
 up-0.006
 worse-0.006
 closely-0.006
Token come
Token ID1282
Feature activation+1.36e-02

Pos logit contributions
 anyone+0.025
 sure+0.024
 up+0.022
 closely+0.021
 worse+0.021
Neg logit contributions
 hill-0.021
 slope-0.020
 slopes-0.020
-0.020
field-0.020
Token out
Token ID503
Feature activation-1.49e-02

Pos logit contributions
 anyone+0.018
 sure+0.016
 awful+0.015
 worse+0.015
 unfair+0.014
Neg logit contributions
 hill-0.027
field-0.025
 slopes-0.025
 slope-0.025
-0.024
Token in
Token ID287
Feature activation-3.30e-02

Pos logit contributions
+0.031
 slope+0.030
robat+0.030
rob+0.030
 hill+0.029
Neg logit contributions
 sure-0.020
 up-0.019
 anyone-0.019
 closely-0.017
 about-0.017
Token the
Token ID262
Feature activation-4.16e-02

Pos logit contributions
 slope+0.073
rob+0.073
+0.072
robat+0.072
akes+0.072
Neg logit contributions
 anyone-0.037
 sure-0.035
 up-0.034
 what-0.030
 okay-0.028
Token dark
Token ID3223
Feature activation-9.78e-02

Pos logit contributions
 hill+0.040
 slope+0.040
+0.040
 slopes+0.038
field+0.038
Neg logit contributions
 anyone-0.095
 sure-0.090
 up-0.088
 worse-0.083
 closely-0.083
Token and
Token ID290
Feature activation+1.38e-04

Pos logit contributions
+0.211
ursion+0.170
thank+0.168
packing+0.166
robat+0.163
Neg logit contributions
 up-0.146
 anyone-0.144
 about-0.135
's-0.127
 closely-0.124
Token use
Token ID779
Feature activation+1.08e-02

Pos logit contributions
 anyone+0.000
 sure+0.000
 up+0.000
ust+0.000
 closely+0.000
Neg logit contributions
 hill-0.000
 slope-0.000
field-0.000
 slopes-0.000
forest-0.000
Token their
Token ID511
Feature activation+9.29e-03

Pos logit contributions
 anyone+0.013
 sure+0.012
 up+0.011
 worse+0.011
ust+0.011
Neg logit contributions
 hill-0.021
 slopes-0.020
 slope-0.020
field-0.020
 evenings-0.020
Token big
Token ID1263
Feature activation-1.94e-02

Pos logit contributions
 anyone+0.015
 sure+0.015
 up+0.013
 okay+0.013
 closely+0.013
Neg logit contributions
 hill-0.015
 slope-0.015
-0.015
 slopes-0.015
field-0.015
Token eyes
Token ID2951
Feature activation+1.34e-02

Pos logit contributions
+0.035
thank+0.034
akes+0.034
Now+0.034
 slope+0.034
Neg logit contributions
 anyone-0.030
 sure-0.028
 up-0.027
 okay-0.025
 closely-0.024
Token,
Token ID11
Feature activation-5.10e-04

Pos logit contributions
 anyone+0.018
 sure+0.017
 up+0.017
 closely+0.016
 worse+0.015
Neg logit contributions
 hill-0.014
 slope-0.014
field-0.013
-0.013
 slopes-0.013
Token and
Token ID290
Feature activation+6.79e-03

Pos logit contributions
+0.001
field+0.001
 slope+0.001
 hill+0.001
akes+0.001
Neg logit contributions
 anyone-0.001
 sure-0.001
 up-0.001
 closely-0.001
 okay-0.001
Token skip
Token ID14267
Feature activation-1.83e-02

Pos logit contributions
 anyone+0.012
 sure+0.012
 up+0.011
 closely+0.011
 worse+0.011
Neg logit contributions
 hill-0.010
 slope-0.010
 slopes-0.009
-0.009
field-0.009
Token in
Token ID287
Feature activation-2.14e-02

Pos logit contributions
field+0.038
+0.037
ursion+0.037
 hill+0.036
 hallway+0.036
Neg logit contributions
 up-0.022
 anyone-0.022
 sure-0.020
 closely-0.018
 about-0.017
Token the
Token ID262
Feature activation-3.45e-02

Pos logit contributions
 hill+0.043
 slope+0.042
+0.042
field+0.042
 slopes+0.042
Neg logit contributions
 anyone-0.027
 sure-0.025
 up-0.023
 closely-0.021
 worse-0.021
Token sunny
Token ID27737
Feature activation-9.76e-02

Pos logit contributions
thank+0.051
field+0.051
+0.050
 slope+0.050
robat+0.050
Neg logit contributions
 anyone-0.061
 up-0.058
 sure-0.057
 awful-0.053
 worse-0.053
Token me
Token ID502
Feature activation-2.18e-02

Pos logit contributions
ursion+0.184
use+0.180
packing+0.175
opter+0.174
leness+0.173
Neg logit contributions
 up-0.151
 a-0.124
 about-0.121
 all-0.120
's-0.119
Tokenadow
Token ID4584
Feature activation-2.35e-02

Pos logit contributions
 hill+0.030
 slope+0.030
+0.028
 slopes+0.028
field+0.028
Neg logit contributions
 anyone-0.042
 sure-0.042
 up-0.039
 okay-0.038
 about-0.037
Token.
Token ID13
Feature activation-5.21e-03

Pos logit contributions
 hill+0.046
 slope+0.045
+0.045
robat+0.044
ursion+0.044
Neg logit contributions
 anyone-0.031
 sure-0.029
 up-0.029
 okay-0.025
 closely-0.025
Token
Token ID198
Feature activation+7.79e-03

Pos logit contributions
 hill+0.008
 slope+0.008
 slopes+0.008
+0.008
field+0.008
Neg logit contributions
 anyone-0.009
 sure-0.008
 up-0.008
 closely-0.007
 worse-0.007
Token
Token ID198
Feature activation-3.60e-03

Pos logit contributions
 anyone+0.015
 sure+0.014
 worse+0.013
 up+0.013
 ok+0.013
Neg logit contributions
 hill-0.011
 slope-0.010
-0.010
field-0.010
 slopes-0.010
Token saw
Token ID2497
Feature activation+3.68e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
 up+0.001
 worse+0.001
 okay+0.001
Neg logit contributions
 hill-0.001
 slope-0.001
field-0.001
 slopes-0.001
-0.001
Token a
Token ID257
Feature activation+1.63e-02

Pos logit contributions
ust+0.051
ares+0.047
 ok+0.042
 closely+0.041
 okay+0.041
Neg logit contributions
 hill-0.070
 slopes-0.067
 slope-0.063
 sandbox-0.062
loo-0.061
Token jet
Token ID12644
Feature activation-3.03e-03

Pos logit contributions
 anyone+0.026
 sure+0.025
 up+0.022
 worse+0.021
 closely+0.021
Neg logit contributions
 hill-0.029
 slope-0.027
field-0.026
-0.026
 hallway-0.026
Token in
Token ID287
Feature activation-3.72e-02

Pos logit contributions
 hill+0.006
 slope+0.005
 slopes+0.005
+0.005
field+0.005
Neg logit contributions
 anyone-0.004
 sure-0.004
 up-0.004
 closely-0.004
 worse-0.004
Token the
Token ID262
Feature activation-3.43e-02

Pos logit contributions
 hill+0.074
 slope+0.071
 slopes+0.070
field+0.070
+0.069
Neg logit contributions
 anyone-0.049
 sure-0.046
 up-0.042
 okay-0.040
 worse-0.040
Token deep
Token ID2769
Feature activation-9.74e-02

Pos logit contributions
+0.037
 slope+0.036
 slopes+0.035
 hill+0.035
akes+0.035
Neg logit contributions
 anyone-0.075
 up-0.071
 sure-0.070
 worse-0.066
 closely-0.065
Token sky
Token ID6766
Feature activation-4.44e-02

Pos logit contributions
+0.181
rob+0.171
robat+0.170
ugg+0.168
akes+0.166
Neg logit contributions
 up-0.163
 anyone-0.150
 closely-0.146
 strange-0.129
's-0.128
Token.
Token ID13
Feature activation-7.81e-03

Pos logit contributions
robat+0.098
ursion+0.097
rob+0.097
+0.097
 slope+0.096
Neg logit contributions
 up-0.049
 anyone-0.047
 sure-0.044
 about-0.043
 just-0.040
Token The
Token ID383
Feature activation+2.15e-03

Pos logit contributions
 hill+0.012
 slope+0.012
 slopes+0.012
+0.012
field+0.012
Neg logit contributions
 anyone-0.013
 sure-0.013
 up-0.012
 closely-0.011
 worse-0.011
Token little
Token ID1310
Feature activation-3.21e-02

Pos logit contributions
 anyone+0.004
 sure+0.003
 up+0.003
 worse+0.003
 closely+0.003
Neg logit contributions
 hill-0.003
 slope-0.003
 slopes-0.003
-0.003
field-0.003
Token boy
Token ID2933
Feature activation+1.39e-02

Pos logit contributions
+0.055
field+0.054
akes+0.052
thank+0.052
forest+0.052
Neg logit contributions
 anyone-0.051
 up-0.048
 sure-0.047
 worse-0.043
 okay-0.042
Token Mom
Token ID11254
Feature activation+1.92e-03

Pos logit contributions
 hill+0.005
 slope+0.004
+0.004
 slopes+0.004
 evenings+0.004
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.002
 closely-0.002
 worse-0.002
Token asks
Token ID7893
Feature activation-5.03e-03

Pos logit contributions
 anyone+0.004
 sure+0.003
 okay+0.003
ust+0.003
 ok+0.003
Neg logit contributions
 hill-0.003
forest-0.003
 hallway-0.003
 slope-0.003
 slopes-0.003
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
 hill+0.006
+0.006
 slopes+0.006
 slope+0.006
 evenings+0.005
Neg logit contributions
 anyone-0.009
 sure-0.009
ust-0.009
 awful-0.008
amus-0.008
Token "
Token ID366
Feature activation-2.13e-03

Pos logit contributions
 anyone+0.006
 sure+0.005
 up+0.005
 okay+0.005
 worse+0.005
Neg logit contributions
 hill-0.007
 slope-0.007
 slopes-0.007
field-0.007
-0.007
TokenWhy
Token ID5195
Feature activation-1.09e-03

Pos logit contributions
 hill+0.004
 slope+0.004
 slopes+0.004
+0.004
field+0.004
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.003
 worse-0.002
 closely-0.002
Token are
Token ID389
Feature activation+5.71e-02

Pos logit contributions
 hill+0.001
+0.001
 slope+0.001
akes+0.001
 slopes+0.001
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.002
 closely-0.002
 worse-0.002
Token you
Token ID345
Feature activation+1.47e-02

Pos logit contributions
 closely+0.112
 sure+0.111
ust+0.110
 clearly+0.110
 ok+0.110
Neg logit contributions
 hill-0.056
 recess-0.050
 slopes-0.047
stairs-0.046
 Cookies-0.046
Token fighting
Token ID4330
Feature activation+1.78e-02

Pos logit contributions
 anyone+0.022
 closely+0.019
 sure+0.019
ust+0.018
 anybody+0.018
Neg logit contributions
 hill-0.025
-0.024
 hallway-0.023
 slope-0.023
 street-0.023
Token?"
Token ID1701
Feature activation-4.46e-03

Pos logit contributions
 anyone+0.031
 sure+0.029
 up+0.027
 worse+0.026
 okay+0.026
Neg logit contributions
 hill-0.028
 slope-0.027
-0.027
 slopes-0.027
field-0.027
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
 hill+0.007
 slope+0.007
+0.007
 slopes+0.007
field+0.006
Neg logit contributions
 anyone-0.008
 sure-0.007
 up-0.007
 worse-0.007
 closely-0.007
Token
Token ID198
Feature activation-5.31e-04

Pos logit contributions
 anyone+0.022
 sure+0.021
 up+0.020
 worse+0.019
 closely+0.019
Neg logit contributions
 hill-0.019
 slope-0.018
-0.018
 slopes-0.018
field-0.017
Token bou
Token ID35833
Feature activation-2.51e-03

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 worse+0.013
 okay+0.013
Neg logit contributions
 hill-0.014
 slope-0.013
 slopes-0.013
field-0.013
-0.013
Tokenquet
Token ID21108
Feature activation-1.51e-02

Pos logit contributions
 hill+0.003
+0.002
 slope+0.002
 slopes+0.002
thank+0.002
Neg logit contributions
 anyone-0.006
 sure-0.005
 up-0.005
 worse-0.005
 closely-0.005
Token.
Token ID13
Feature activation-2.48e-03

Pos logit contributions
 slope+0.033
 slopes+0.032
rob+0.032
+0.032
 evenings+0.032
Neg logit contributions
 up-0.017
 sure-0.016
 anyone-0.016
 about-0.015
 closely-0.013
Token They
Token ID1119
Feature activation+1.65e-04

Pos logit contributions
 hill+0.004
 slope+0.003
 slopes+0.003
field+0.003
+0.003
Neg logit contributions
 anyone-0.005
 sure-0.004
 up-0.004
 okay-0.004
 closely-0.004
Token give
Token ID1577
Feature activation+3.06e-02

Pos logit contributions
 anyone+0.000
 sure+0.000
 up+0.000
 closely+0.000
 okay+0.000
Neg logit contributions
 slope-0.000
 slopes-0.000
 hill-0.000
-0.000
field-0.000
Token it
Token ID340
Feature activation+5.18e-02

Pos logit contributions
ust+0.051
iqu+0.041
ares+0.040
 awful+0.040
ight+0.039
Neg logit contributions
 evenings-0.048
 days-0.048
 hill-0.048
 recess-0.048
 morning-0.046
Token to
Token ID284
Feature activation+7.04e-03

Pos logit contributions
 anyone+0.079
ust+0.076
 sure+0.073
 closely+0.071
 worse+0.070
Neg logit contributions
 hill-0.081
 season-0.075
 slopes-0.075
thank-0.075
ite-0.075
Token Tom
Token ID4186
Feature activation+5.33e-02

Pos logit contributions
ust+0.009
 sure+0.009
 up+0.009
 anyone+0.009
iqu+0.009
Neg logit contributions
 hill-0.012
 recess-0.012
 slopes-0.012
-0.012
field-0.012
Token.
Token ID13
Feature activation-4.98e-03

Pos logit contributions
 anyone+0.106
 sure+0.095
 anybody+0.091
 awful+0.091
ust+0.090
Neg logit contributions
 hill-0.068
forest-0.063
field-0.063
thank-0.062
-0.061
Token They
Token ID1119
Feature activation+3.81e-03

Pos logit contributions
 hill+0.009
 slope+0.008
field+0.008
 slopes+0.008
robat+0.008
Neg logit contributions
 anyone-0.008
 sure-0.007
 up-0.007
 okay-0.006
 closely-0.006
Token say
Token ID910
Feature activation+2.38e-02

Pos logit contributions
 anyone+0.006
 sure+0.006
 up+0.006
 closely+0.005
 okay+0.005
Neg logit contributions
 slope-0.006
 hill-0.006
 slopes-0.006
-0.006
field-0.006
Token.
Token ID13
Feature activation+4.52e-04

Pos logit contributions
 anyone+0.024
 sure+0.022
 up+0.022
 closely+0.020
 okay+0.019
Neg logit contributions
 slope-0.027
 hill-0.027
 slopes-0.027
-0.026
field-0.026
Token Lily
Token ID20037
Feature activation+1.59e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
 up+0.001
 okay+0.001
 closely+0.001
Neg logit contributions
 hill-0.001
 slope-0.001
 slopes-0.001
field-0.001
-0.001
Token agreed
Token ID4987
Feature activation+1.40e-02

Pos logit contributions
 sure+0.023
 anyone+0.022
 up+0.022
 closely+0.020
 about+0.019
Neg logit contributions
 slope-0.029
-0.029
 hill-0.029
 slopes-0.029
field-0.028
Token and
Token ID290
Feature activation-1.93e-02

Pos logit contributions
 anyone+0.025
 sure+0.023
 up+0.022
 worse+0.021
ust+0.021
Neg logit contributions
 hill-0.021
 slope-0.020
 slopes-0.019
-0.019
field-0.019
Token handed
Token ID10158
Feature activation+4.33e-02

Pos logit contributions
 hill+0.031
 slope+0.030
 slopes+0.030
+0.030
field+0.029
Neg logit contributions
 anyone-0.032
 sure-0.031
 up-0.029
 okay-0.027
 worse-0.027
Token Ros
Token ID10018
Feature activation+8.04e-02

Pos logit contributions
ust+0.090
iqu+0.075
 awful+0.073
 unfair+0.072
 ok+0.072
Neg logit contributions
 morning-0.053
 recess-0.052
 hill-0.052
 evenings-0.051
 afternoon-0.049
Tokenie
Token ID494
Feature activation+7.59e-02

Pos logit contributions
 anyone+0.164
 sure+0.159
 okay+0.156
 awful+0.153
 unfair+0.153
Neg logit contributions
cy-0.071
ite-0.071
 hill-0.070
akes-0.070
robat-0.069
Token over
Token ID625
Feature activation+8.82e-03

Pos logit contributions
ust+0.136
 ok+0.133
 okay+0.132
 awful+0.126
iqu+0.126
Neg logit contributions
 hill-0.085
ossible-0.082
 distance-0.082
ite-0.082
 recess-0.080
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
 anyone+0.014
 sure+0.014
ust+0.013
 awful+0.013
 worse+0.012
Neg logit contributions
-0.014
 hill-0.014
thank-0.013
 slope-0.013
 slopes-0.013
Token While
Token ID2893
Feature activation-2.54e-02

Pos logit contributions
 anyone+0.018
 sure+0.017
 up+0.016
 worse+0.016
 closely+0.015
Neg logit contributions
 hill-0.015
-0.015
 slope-0.014
 slopes-0.014
field-0.014
Token her
Token ID607
Feature activation+1.16e-02

Pos logit contributions
 hill+0.045
 slope+0.044
 slopes+0.043
field+0.043
akes+0.043
Neg logit contributions
 anyone-0.039
 sure-0.036
 up-0.034
 okay-0.032
 worse-0.031
Token,
Token ID11
Feature activation+5.68e-03

Pos logit contributions
 hill+0.017
 slope+0.016
+0.016
 slopes+0.016
field+0.016
Neg logit contributions
 anyone-0.017
 sure-0.016
 up-0.015
 worse-0.014
 closely-0.014
Token all
Token ID477
Feature activation-1.59e-02

Pos logit contributions
 anyone+0.011
 sure+0.010
 up+0.010
 closely+0.009
 worse+0.009
Neg logit contributions
 hill-0.008
 slope-0.007
 slopes-0.007
field-0.007
-0.007
Token because
Token ID780
Feature activation-5.33e-03

Pos logit contributions
 hill+0.030
 slope+0.028
 slopes+0.028
field+0.028
 hallway+0.027
Neg logit contributions
 anyone-0.023
 sure-0.021
 up-0.019
 worse-0.019
 closely-0.019
Token Max
Token ID5436
Feature activation-2.37e-04

Pos logit contributions
 hill+0.006
loo+0.005
 street+0.005
 evenings+0.005
 season+0.005
Neg logit contributions
 sure-0.011
 anyone-0.011
ust-0.010
amus-0.010
iqu-0.010
Token brought
Token ID3181
Feature activation+1.98e-02

Pos logit contributions
 slope+0.000
 hill+0.000
 slopes+0.000
+0.000
 hallway+0.000
Neg logit contributions
 anyone-0.000
 sure-0.000
 up-0.000
 okay-0.000
 closely-0.000
Token them
Token ID606
Feature activation+5.40e-02

Pos logit contributions
iqu+0.027
ust+0.027
ares+0.027
 anyone+0.025
 sure+0.025
Neg logit contributions
 evenings-0.033
 hill-0.031
 days-0.031
 afternoon-0.031
 slopes-0.030
Token together
Token ID1978
Feature activation+5.92e-03

Pos logit contributions
 anyone+0.079
 sure+0.073
 awful+0.067
 worse+0.066
 closely+0.065
Neg logit contributions
 hill-0.095
 slopes-0.089
thank-0.088
field-0.088
 distance-0.086
Token.
Token ID13
Feature activation-7.12e-03

Pos logit contributions
 anyone+0.008
 sure+0.008
 up+0.007
 closely+0.007
 okay+0.007
Neg logit contributions
 hill-0.011
 slope-0.011
-0.011
 slopes-0.011
 hallway-0.011
Token<|endoftext|>
Token ID50256
Feature activation-1.40e-02

Pos logit contributions
 hill+0.011
 slope+0.011
field+0.011
 slopes+0.011
+0.010
Neg logit contributions
 anyone-0.012
 sure-0.012
 up-0.011
 okay-0.010
 closely-0.010
TokenOne
Token ID3198
Feature activation-3.29e-02

Pos logit contributions
 hill+0.019
+0.018
 slopes+0.017
 slope+0.017
field+0.017
Neg logit contributions
 anyone-0.027
 sure-0.026
 worse-0.025
ust-0.024
 awful-0.024
Token day
Token ID1110
Feature activation-1.15e-02

Pos logit contributions
 hill+0.022
 slope+0.020
 slopes+0.019
+0.018
field+0.018
Neg logit contributions
 anyone-0.087
 sure-0.084
 up-0.080
 worse-0.078
 closely-0.078
Token it
Token ID340
Feature activation+2.89e-02

Pos logit contributions
ust+0.042
 sure+0.042
 anyone+0.039
 okay+0.039
 worse+0.038
Neg logit contributions
 hill-0.056
-0.052
 slopes-0.052
field-0.052
 slope-0.051
Token.
Token ID13
Feature activation+4.48e-03

Pos logit contributions
 anyone+0.055
 sure+0.050
 unfair+0.049
ust+0.047
 worse+0.047
Neg logit contributions
 hill-0.038
field-0.034
-0.033
 slope-0.033
 slopes-0.032
Token
Token ID198
Feature activation+1.35e-02

Pos logit contributions
 anyone+0.007
 sure+0.007
 up+0.007
 okay+0.006
 closely+0.006
Neg logit contributions
 hill-0.007
 slope-0.007
 slopes-0.007
field-0.007
-0.007
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 anyone+0.022
 sure+0.021
 up+0.019
 closely+0.018
 okay+0.018
Neg logit contributions
 hill-0.022
 slope-0.022
 slopes-0.021
field-0.021
-0.021
Token"
Token ID1
Feature activation-1.57e-02

Pos logit contributions
 sure+0.018
 anyone+0.018
 up+0.017
 closely+0.016
 okay+0.016
Neg logit contributions
 hill-0.016
field-0.015
 slope-0.015
 slopes-0.015
akes-0.015
TokenLook
Token ID8567
Feature activation+5.24e-02

Pos logit contributions
 hill+0.021
 slope+0.020
+0.020
 slopes+0.020
field+0.020
Neg logit contributions
 anyone-0.030
 sure-0.029
 up-0.028
 worse-0.026
 closely-0.026
Token!
Token ID0
Feature activation+1.14e-02

Pos logit contributions
 worse+0.100
 actually+0.097
 unfair+0.094
 awful+0.093
 anyone+0.093
Neg logit contributions
-0.062
,-0.051
 hill-0.050
ite-0.046
oy-0.045
Token I
Token ID314
Feature activation-1.12e-02

Pos logit contributions
 anyone+0.019
 sure+0.018
 up+0.017
 worse+0.017
 closely+0.016
Neg logit contributions
 hill-0.018
-0.018
 slope-0.017
 slopes-0.017
field-0.017
Token'm
Token ID1101
Feature activation+8.72e-03

Pos logit contributions
 hill+0.016
 slope+0.016
 slopes+0.015
field+0.015
+0.014
Neg logit contributions
 anyone-0.021
 sure-0.021
 up-0.019
 okay-0.018
 closely-0.018
Token holding
Token ID4769
Feature activation+9.83e-03

Pos logit contributions
ust+0.012
 anyone+0.012
 worse+0.011
 closely+0.010
ares+0.010
Neg logit contributions
-0.014
field-0.013
forest-0.013
 summer-0.013
 hill-0.013
Token a
Token ID257
Feature activation+5.90e-03

Pos logit contributions
ust+0.015
 worse+0.014
 anyone+0.014
iqu+0.013
 unfair+0.013
Neg logit contributions
 hill-0.015
-0.015
-0.014
 slopes-0.014
 Now-0.014
Token silence
Token ID9550
Feature activation-1.28e-02

Pos logit contributions
 hill+0.005
 slope+0.005
 slopes+0.005
 street+0.005
 hallway+0.005
Neg logit contributions
 anyone-0.005
 sure-0.005
 up-0.005
 closely-0.004
 okay-0.004
Token anymore
Token ID7471
Feature activation-1.74e-02

Pos logit contributions
 hill+0.023
 slope+0.022
 slopes+0.022
+0.022
field+0.021
Neg logit contributions
 anyone-0.019
 sure-0.018
 up-0.017
 worse-0.016
 closely-0.016
Token.
Token ID13
Feature activation+2.43e-03

Pos logit contributions
 hill+0.028
 slope+0.026
+0.026
field+0.026
 slopes+0.026
Neg logit contributions
 anyone-0.028
 sure-0.027
 up-0.026
 worse-0.024
 closely-0.024
Token<|endoftext|>
Token ID50256
Feature activation-1.86e-02

Pos logit contributions
 anyone+0.004
 sure+0.004
 up+0.004
 worse+0.004
 closely+0.004
Neg logit contributions
 hill-0.004
-0.003
 slope-0.003
 slopes-0.003
field-0.003
TokenOnce
Token ID7454
Feature activation+1.14e-02

Pos logit contributions
 hill+0.027
+0.026
 slope+0.026
 slopes+0.026
field+0.025
Neg logit contributions
 anyone-0.034
 sure-0.032
 up-0.030
 worse-0.029
 closely-0.029
Token upon
Token ID2402
Feature activation+3.30e-02

Pos logit contributions
 anyone+0.019
 sure+0.018
 up+0.016
 worse+0.016
 awful+0.016
Neg logit contributions
 hill-0.018
 slope-0.017
-0.017
field-0.017
 slopes-0.017
Token a
Token ID257
Feature activation-9.57e-03

Pos logit contributions
iqu+0.042
 anyone+0.039
umes+0.039
ares+0.039
 closely+0.037
Neg logit contributions
 hill-0.059
thank-0.058
-0.057
 hallway-0.057
 evenings-0.057
Token time
Token ID640
Feature activation-6.70e-04

Pos logit contributions
 hill+0.016
 slope+0.016
 slopes+0.015
field+0.015
thank+0.015
Neg logit contributions
 anyone-0.015
 sure-0.015
 up-0.014
 okay-0.013
 closely-0.013
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
 hill+0.001
 slope+0.001
field+0.001
 slopes+0.001
thank+0.001
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.001
 okay-0.001
 closely-0.001
Token there
Token ID612
Feature activation-2.26e-02

Pos logit contributions
 anyone+0.029
 sure+0.027
 worse+0.025
 awful+0.024
 closely+0.024
Neg logit contributions
 hill-0.038
 slope-0.036
-0.035
 slopes-0.035
 hallway-0.035
Token was
Token ID373
Feature activation+5.34e-02

Pos logit contributions
 hill+0.041
 slope+0.040
 slopes+0.040
+0.039
field+0.039
Neg logit contributions
 anyone-0.033
 sure-0.031
 up-0.029
 closely-0.027
 okay-0.027
TokenSuddenly
Token ID38582
Feature activation-3.17e-02

Pos logit contributions
 anyone+0.014
 sure+0.013
 up+0.012
 worse+0.012
 okay+0.012
Neg logit contributions
 hill-0.011
 slope-0.011
-0.010
field-0.010
 slopes-0.010
Token,
Token ID11
Feature activation-2.71e-03

Pos logit contributions
 hill+0.038
 slope+0.036
+0.036
field+0.035
thank+0.035
Neg logit contributions
 anyone-0.068
 sure-0.064
 up-0.062
 okay-0.059
 worse-0.059
Token an
Token ID281
Feature activation+1.88e-03

Pos logit contributions
 hill+0.005
 slope+0.005
 slopes+0.005
+0.005
field+0.005
Neg logit contributions
 anyone-0.004
 sure-0.003
 up-0.003
 closely-0.003
ust-0.003
Token incredible
Token ID8082
Feature activation-2.27e-02

Pos logit contributions
 anyone+0.003
 sure+0.002
 up+0.002
 okay+0.002
 worse+0.002
Neg logit contributions
 hill-0.004
 slope-0.003
 slopes-0.003
 hallway-0.003
 street-0.003
Token thing
Token ID1517
Feature activation+8.54e-03

Pos logit contributions
+0.035
 hill+0.035
 slope+0.035
akes+0.035
field+0.035
Neg logit contributions
 anyone-0.038
 sure-0.036
 up-0.035
 closely-0.032
 okay-0.032
Token happened
Token ID3022
Feature activation+9.49e-03

Pos logit contributions
 anyone+0.015
 sure+0.014
 up+0.013
 okay+0.013
 closely+0.013
Neg logit contributions
 hill-0.013
 slope-0.012
 slopes-0.012
field-0.012
 hallway-0.012
Token.
Token ID13
Feature activation+5.81e-03

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 worse+0.014
 closely+0.013
Neg logit contributions
 hill-0.015
 slope-0.014
 slopes-0.014
field-0.014
-0.014
Token The
Token ID383
Feature activation+8.70e-03

Pos logit contributions
 anyone+0.009
 sure+0.008
 up+0.008
 worse+0.007
 closely+0.007
Neg logit contributions
 hill-0.010
-0.010
 slope-0.010
 slopes-0.010
field-0.009
Token cloud
Token ID6279
Feature activation-5.61e-03

Pos logit contributions
 anyone+0.016
 sure+0.016
 up+0.014
ust+0.014
 okay+0.014
Neg logit contributions
 hill-0.013
 slope-0.012
 slopes-0.012
 hallway-0.012
field-0.012
Token started
Token ID2067
Feature activation-6.95e-03

Pos logit contributions
 hill+0.008
 slope+0.008
 slopes+0.008
field+0.008
+0.008
Neg logit contributions
 anyone-0.010
 sure-0.010
 up-0.009
 okay-0.009
 worse-0.009
Token to
Token ID284
Feature activation-2.42e-02

Pos logit contributions
 hill+0.012
 slope+0.012
 slopes+0.012
field+0.012
+0.012
Neg logit contributions
 anyone-0.010
 sure-0.010
 up-0.009
 okay-0.008
 worse-0.008
Token.
Token ID13
Feature activation-3.43e-03

Pos logit contributions
 anyone+0.001
 sure+0.001
 anybody+0.001
 awful+0.001
 worse+0.001
Neg logit contributions
 hill-0.001
thank-0.001
 slopes-0.001
 distance-0.001
 slope-0.001
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 hill+0.006
 slope+0.005
field+0.005
 slopes+0.005
+0.005
Neg logit contributions
 anyone-0.006
 sure-0.005
 up-0.005
 okay-0.005
 closely-0.005
Token
Token ID198
Feature activation-1.67e-03

Pos logit contributions
 anyone+0.023
 sure+0.022
 up+0.021
 worse+0.020
 okay+0.020
Neg logit contributions
 hill-0.019
 slope-0.018
 slopes-0.018
-0.018
field-0.018
Token"
Token ID1
Feature activation-2.27e-02

Pos logit contributions
field+0.003
akes+0.003
robat+0.003
+0.003
 spine+0.003
Neg logit contributions
 sure-0.002
 anyone-0.002
 what-0.002
 not-0.002
 up-0.002
TokenHere
Token ID4342
Feature activation+2.18e-03

Pos logit contributions
 hill+0.031
 slope+0.030
field+0.030
 slopes+0.029
+0.029
Neg logit contributions
 anyone-0.043
 sure-0.041
 up-0.039
 okay-0.038
 worse-0.038
Token,
Token ID11
Feature activation+4.21e-04

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.004
 closely+0.004
 worse+0.004
Neg logit contributions
 hill-0.002
 slope-0.002
 slopes-0.002
-0.002
field-0.002
Token Sue
Token ID26089
Feature activation+2.79e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
 closely+0.001
 up+0.001
ust+0.001
Neg logit contributions
 hill-0.001
 slope-0.001
 slopes-0.001
loo-0.001
 street-0.001
Token,
Token ID11
Feature activation+6.00e-03

Pos logit contributions
 sure+0.050
 anyone+0.048
 up+0.048
 what+0.044
 about+0.043
Neg logit contributions
 slope-0.045
-0.045
 slopes-0.043
 hallway-0.043
 hill-0.041
Token this
Token ID428
Feature activation-7.77e-04

Pos logit contributions
 anyone+0.009
 sure+0.008
 up+0.008
ust+0.008
 worse+0.008
Neg logit contributions
 hill-0.010
 slope-0.009
-0.009
 slopes-0.009
field-0.009
Token is
Token ID318
Feature activation+3.88e-02

Pos logit contributions
+0.001
 slope+0.001
thank+0.001
 slopes+0.001
 hill+0.001
Neg logit contributions
 sure-0.001
 anyone-0.001
 up-0.001
 closely-0.001
 what-0.001
Token corn
Token ID11676
Feature activation-5.59e-03

Pos logit contributions
 anyone+0.052
 sure+0.045
ust+0.042
 closely+0.041
 worse+0.041
Neg logit contributions
 hill-0.073
 slope-0.069
field-0.069
 slopes-0.068
thank-0.068
Token "
Token ID366
Feature activation-2.27e-02

Pos logit contributions
 hill+0.004
+0.004
 slope+0.004
 slopes+0.004
field+0.004
Neg logit contributions
 anyone-0.004
 sure-0.003
 up-0.003
ust-0.003
 worse-0.003
TokenDon
Token ID3987
Feature activation+2.48e-02

Pos logit contributions
 hill+0.035
 slope+0.033
field+0.033
 slopes+0.033
+0.033
Neg logit contributions
 anyone-0.040
 sure-0.038
 up-0.036
 worse-0.034
 okay-0.034
Token't
Token ID470
Feature activation-5.97e-03

Pos logit contributions
 anyone+0.042
 sure+0.040
 closely+0.038
ust+0.036
 clearly+0.036
Neg logit contributions
-0.037
 hill-0.037
-0.034
ursion-0.034
 slope-0.033
Token worry
Token ID5490
Feature activation+1.20e-02

Pos logit contributions
 hill+0.008
 slope+0.008
 slopes+0.008
forest+0.007
+0.007
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 closely-0.010
 worse-0.010
Token,
Token ID11
Feature activation-1.85e-03

Pos logit contributions
 sure+0.032
 anyone+0.032
 closely+0.030
 worse+0.030
amus+0.030
Neg logit contributions
 hill-0.005
 slope-0.004
 slopes-0.003
-0.003
-0.003
Token Lily
Token ID20037
Feature activation+1.17e-02

Pos logit contributions
 hill+0.003
 street+0.003
+0.003
 slope+0.003
forest+0.003
Neg logit contributions
 anyone-0.003
 up-0.003
 sure-0.003
 closely-0.003
ust-0.002
Token.
Token ID13
Feature activation-9.25e-03

Pos logit contributions
 anyone+0.018
 sure+0.017
 up+0.016
 okay+0.015
 worse+0.015
Neg logit contributions
 hill-0.021
 slope-0.021
 slopes-0.020
-0.020
field-0.020
Token You
Token ID921
Feature activation+3.48e-03

Pos logit contributions
 hill+0.015
 slope+0.015
 slopes+0.014
+0.014
field+0.014
Neg logit contributions
 anyone-0.015
 sure-0.014
 up-0.013
 worse-0.013
 closely-0.013
Token can
Token ID460
Feature activation-1.50e-02

Pos logit contributions
 anyone+0.007
 sure+0.006
 up+0.006
 closely+0.006
 worse+0.006
Neg logit contributions
 hill-0.005
-0.004
akes-0.004
field-0.004
forest-0.004
Token try
Token ID1949
Feature activation+1.53e-02

Pos logit contributions
 hill+0.023
+0.022
 slope+0.021
field+0.021
 slopes+0.021
Neg logit contributions
 anyone-0.026
 sure-0.025
 up-0.023
 worse-0.023
 closely-0.023
Token again
Token ID757
Feature activation-2.31e-02

Pos logit contributions
ust+0.023
 sure+0.023
 awful+0.022
 anyone+0.022
iqu+0.021
Neg logit contributions
 hill-0.024
 slopes-0.022
-0.022
 slope-0.021
forest-0.020
Token have
Token ID423
Feature activation+1.25e-02

Pos logit contributions
 anyone+0.022
 sure+0.021
 up+0.020
 closely+0.020
 worse+0.019
Neg logit contributions
 hill-0.015
 slope-0.013
field-0.013
akes-0.013
forest-0.013
Token to
Token ID284
Feature activation-2.83e-02

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 closely+0.013
 worse+0.013
Neg logit contributions
 hill-0.024
 slopes-0.023
 slope-0.023
field-0.023
-0.023
Token come
Token ID1282
Feature activation+2.13e-02

Pos logit contributions
 hill+0.046
 slope+0.046
+0.045
 slopes+0.045
akes+0.044
Neg logit contributions
 anyone-0.048
 sure-0.045
 up-0.041
 okay-0.040
 worse-0.039
Token closer
Token ID5699
Feature activation-7.68e-03

Pos logit contributions
 anyone+0.031
 sure+0.029
 awful+0.028
ust+0.026
 unfair+0.026
Neg logit contributions
 hill-0.038
 slopes-0.035
 slope-0.035
field-0.035
 street-0.035
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
+0.014
 hill+0.014
 slope+0.014
 slopes+0.014
field+0.014
Neg logit contributions
 anyone-0.011
 sure-0.010
 up-0.010
 closely-0.009
 okay-0.009
Token listen
Token ID6004
Feature activation+2.74e-02

Pos logit contributions
 hill+0.014
field+0.013
 slope+0.013
 slopes+0.012
 street+0.012
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
ust-0.017
Token.
Token ID13
Feature activation-1.90e-03

Pos logit contributions
 anyone+0.047
 sure+0.043
iqu+0.041
 anybody+0.040
ust+0.040
Neg logit contributions
 hill-0.038
 slopes-0.037
 slope-0.036
 evenings-0.036
thank-0.035
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 hill+0.003
 slopes+0.002
 slope+0.002
+0.002
field+0.002
Neg logit contributions
 anyone-0.004
 sure-0.003
 up-0.003
 worse-0.003
 closely-0.003
Token
Token ID198
Feature activation-5.30e-03

Pos logit contributions
 anyone+0.022
 sure+0.020
 up+0.019
 ok+0.019
 okay+0.019
Neg logit contributions
 hill-0.015
 slopes-0.014
 slope-0.014
field-0.014
-0.014
TokenL
Token ID43
Feature activation+8.18e-03

Pos logit contributions
 hill+0.009
+0.009
field+0.009
 slope+0.009
forest+0.008
Neg logit contributions
 anyone-0.008
 sure-0.008
 up-0.007
 worse-0.007
 closely-0.007
Tokenila
Token ID10102
Feature activation+8.88e-03

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.015
 worse+0.014
 okay+0.014
Neg logit contributions
 hill-0.011
 slope-0.010
 slopes-0.010
akes-0.010
-0.010
Token cannot
Token ID2314
Feature activation-1.85e-02

Pos logit contributions
 anyone+0.030
 sure+0.029
 up+0.028
 closely+0.027
 worse+0.026
Neg logit contributions
 hill-0.023
 slope-0.021
-0.021
field-0.021
 slopes-0.020
Token play
Token ID711
Feature activation+6.75e-03

Pos logit contributions
 hill+0.027
 slope+0.027
field+0.026
 slopes+0.026
+0.026
Neg logit contributions
 anyone-0.034
 sure-0.032
 up-0.030
 okay-0.029
 closely-0.028
Token with
Token ID351
Feature activation+2.64e-02

Pos logit contributions
iqu+0.013
 sure+0.012
ust+0.012
 anyone+0.012
 ok+0.011
Neg logit contributions
 hill-0.009
-0.008
 street-0.008
robat-0.008
 seek-0.008
Token them
Token ID606
Feature activation+3.67e-02

Pos logit contributions
 sure+0.041
ust+0.037
 ok+0.037
iqu+0.036
 okay+0.036
Neg logit contributions
 hill-0.044
 slopes-0.040
 slope-0.039
 street-0.039
-0.037
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
 sure+0.067
 anyone+0.066
iqu+0.064
 awful+0.060
 worse+0.059
Neg logit contributions
 hill-0.050
 slope-0.044
akes-0.043
 slopes-0.042
robat-0.041
Token Go
Token ID1514
Feature activation-1.50e-02

Pos logit contributions
 sure+0.024
 anyone+0.024
 up+0.022
 worse+0.022
 closely+0.022
Neg logit contributions
 hill-0.015
-0.015
 slope-0.014
forest-0.014
 street-0.014
Token away
Token ID1497
Feature activation-1.26e-02

Pos logit contributions
 hill+0.030
+0.030
 slope+0.030
field+0.030
 slopes+0.030
Neg logit contributions
 anyone-0.019
 up-0.019
 sure-0.017
 okay-0.015
 just-0.014
Token!"
Token ID2474
Feature activation-3.98e-03

Pos logit contributions
+0.024
ursion+0.023
 evenings+0.023
robat+0.022
akes+0.022
Neg logit contributions
 anyone-0.020
 up-0.019
 sure-0.018
 what-0.016
 about-0.016
Token Sam
Token ID3409
Feature activation+1.97e-02

Pos logit contributions
 hill+0.007
 slope+0.007
field+0.007
 slopes+0.007
+0.007
Neg logit contributions
 anyone-0.006
 sure-0.006
 up-0.005
 okay-0.005
 worse-0.005
Token felt
Token ID2936
Feature activation+2.93e-02

Pos logit contributions
 anyone+0.037
 sure+0.034
 up+0.032
 closely+0.031
ust+0.031
Neg logit contributions
 hill-0.027
 slope-0.026
-0.025
thank-0.025
 slopes-0.025
Token sad
Token ID6507
Feature activation-1.71e-02

Pos logit contributions
ust+0.040
amus+0.034
iqu+0.033
umes+0.033
ight+0.031
Neg logit contributions
 distance-0.051
 evenings-0.050
 street-0.049
-0.048
thank-0.047
Token!
Token ID0
Feature activation+7.44e-03

Pos logit contributions
 sure+0.022
 anyone+0.021
 up+0.019
ust+0.019
 worse+0.019
Neg logit contributions
 hill-0.061
 slopes-0.059
 slope-0.057
field-0.056
 evenings-0.056
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
 anyone+0.014
 sure+0.013
 up+0.013
 worse+0.013
 closely+0.013
Neg logit contributions
 hill-0.010
-0.010
 slope-0.009
 slopes-0.009
field-0.009
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
 anyone+0.026
 up+0.024
 sure+0.024
 worse+0.023
 awful+0.022
Neg logit contributions
 hill-0.016
-0.015
 slopes-0.015
 slope-0.014
field-0.014
TokenHe
Token ID1544
Feature activation-9.30e-03

Pos logit contributions
 anyone+0.021
 up+0.020
 sure+0.020
 closely+0.018
 worse+0.018
Neg logit contributions
 hill-0.015
 slope-0.014
 slopes-0.014
-0.014
Now-0.013
Token was
Token ID373
Feature activation+2.43e-03

Pos logit contributions
 hill+0.013
 slope+0.012
field+0.012
 slopes+0.012
 hallway+0.012
Neg logit contributions
 anyone-0.017
 sure-0.016
 up-0.015
 okay-0.015
 worse-0.015
Token so
Token ID523
Feature activation-5.24e-03

Pos logit contributions
 anyone+0.003
ust+0.002
ares+0.002
 worse+0.002
iqu+0.002
Neg logit contributions
 hallway-0.005
field-0.004
 hill-0.004
 evenings-0.004
 distance-0.004
Token sad
Token ID6507
Feature activation-2.45e-02

Pos logit contributions
 hill+0.011
 slope+0.011
 slopes+0.011
+0.011
field+0.010
Neg logit contributions
 anyone-0.006
 sure-0.005
 up-0.005
 worse-0.005
 closely-0.005
Token that
Token ID326
Feature activation+2.86e-03

Pos logit contributions
 hill+0.042
 slope+0.041
+0.041
field+0.041
robat+0.040
Neg logit contributions
 anyone-0.039
 sure-0.037
 up-0.035
 closely-0.033
 worse-0.032
Token he
Token ID339
Feature activation+8.45e-03

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.005
ust+0.004
 worse+0.004
Neg logit contributions
 hill-0.004
 slope-0.004
field-0.004
 slopes-0.004
 hallway-0.004
Token thought
Token ID1807
Feature activation+1.07e-02

Pos logit contributions
 anyone+0.015
 sure+0.014
 up+0.013
 okay+0.013
 worse+0.013
Neg logit contributions
 hill-0.012
 slope-0.012
field-0.012
 slopes-0.012
 hallway-0.012
Token he
Token ID339
Feature activation+2.77e-03

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 worse+0.014
ust+0.014
Neg logit contributions
 hill-0.018
 slope-0.017
 slopes-0.017
 hallway-0.017
field-0.017
Token Tom
Token ID4186
Feature activation+2.88e-02

Pos logit contributions
 hill+0.002
 street+0.002
 mountain+0.002
field+0.002
 hallway+0.002
Neg logit contributions
 anyone-0.001
 sure-0.001
ust-0.001
 up-0.001
iqu-0.001
Token.
Token ID13
Feature activation-6.68e-03

Pos logit contributions
 anyone+0.047
 sure+0.045
 up+0.042
 okay+0.040
 worse+0.039
Neg logit contributions
 hill-0.048
 slope-0.048
-0.047
 slopes-0.047
field-0.046
Token You
Token ID921
Feature activation-1.05e-03

Pos logit contributions
 hill+0.009
 slope+0.009
+0.008
 slopes+0.008
field+0.008
Neg logit contributions
 anyone-0.013
 sure-0.012
 up-0.012
 closely-0.011
 worse-0.011
Token did
Token ID750
Feature activation+3.47e-02

Pos logit contributions
 hill+0.001
+0.001
field+0.001
 slope+0.001
akes+0.001
Neg logit contributions
 anyone-0.002
 sure-0.002
 up-0.002
 closely-0.002
 worse-0.002
Token not
Token ID407
Feature activation-1.17e-02

Pos logit contributions
idget+0.045
ust+0.043
ares+0.042
iqu+0.042
rily+0.038
Neg logit contributions
-0.064
 seek-0.059
 hill-0.058
 recess-0.057
 job-0.055
Token win
Token ID1592
Feature activation-1.42e-02

Pos logit contributions
 hill+0.019
 slope+0.018
 slopes+0.018
field+0.018
+0.018
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.017
 closely-0.017
 worse-0.016
Token the
Token ID262
Feature activation-2.73e-03

Pos logit contributions
 hill+0.023
 slopes+0.022
 slope+0.022
thank+0.021
field+0.021
Neg logit contributions
 anyone-0.021
 sure-0.021
 up-0.019
 closely-0.019
ust-0.019
Token prize
Token ID11596
Feature activation-7.29e-03

Pos logit contributions
 hill+0.004
 slope+0.003
thank+0.003
 street+0.003
field+0.003
Neg logit contributions
 anyone-0.006
 sure-0.005
 up-0.005
ust-0.005
 worse-0.005
Token."
Token ID526
Feature activation+6.25e-05

Pos logit contributions
 hill+0.012
 slope+0.012
 slopes+0.012
+0.012
field+0.012
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 worse-0.010
 closely-0.010
Token Tom
Token ID4186
Feature activation+8.08e-03

Pos logit contributions
 anyone+0.000
 sure+0.000
 up+0.000
 okay+0.000
 worse+0.000
Neg logit contributions
 hill-0.000
 slope-0.000
field-0.000
 slopes-0.000
-0.000
Token was
Token ID373
Feature activation+2.81e-02

Pos logit contributions
 anyone+0.016
 sure+0.015
 up+0.014
 okay+0.014
 worse+0.014
Neg logit contributions
 hill-0.010
 slope-0.009
 slopes-0.009
field-0.009
-0.009
Token out
Token ID503
Feature activation-3.64e-04

Pos logit contributions
 slope+0.038
robat+0.037
 slopes+0.037
akes+0.037
field+0.037
Neg logit contributions
 up-0.025
 sure-0.025
 anyone-0.023
 what-0.020
 closely-0.020
Token and
Token ID290
Feature activation-6.02e-03

Pos logit contributions
+0.001
 slope+0.001
robat+0.001
 hill+0.001
field+0.001
Neg logit contributions
 sure-0.001
 anyone-0.001
 up-0.001
 about-0.000
 okay-0.000
Token enjoy
Token ID2883
Feature activation+4.31e-03

Pos logit contributions
 hill+0.009
 slope+0.008
+0.008
 slopes+0.008
field+0.008
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 closely-0.010
 worse-0.009
Token the
Token ID262
Feature activation-3.13e-03

Pos logit contributions
 anyone+0.006
 sure+0.006
 worse+0.005
ust+0.005
 up+0.005
Neg logit contributions
 hill-0.008
-0.007
 slopes-0.007
 slope-0.007
field-0.007
Token sunny
Token ID27737
Feature activation-7.40e-02

Pos logit contributions
 hill+0.004
 slope+0.004
 slopes+0.004
+0.004
field+0.004
Neg logit contributions
 anyone-0.006
 sure-0.006
 up-0.005
 closely-0.005
 worse-0.005
Token day
Token ID1110
Feature activation-3.55e-02

Pos logit contributions
ursion+0.095
+0.091
rob+0.088
akes+0.084
orted+0.084
Neg logit contributions
 up-0.150
 anyone-0.132
 about-0.126
 sure-0.123
 just-0.122
Token."
Token ID526
Feature activation+9.01e-03

Pos logit contributions
 slope+0.062
 hill+0.062
 slopes+0.060
+0.060
field+0.059
Neg logit contributions
 anyone-0.056
 sure-0.054
 up-0.051
 okay-0.047
 closely-0.047
Token With
Token ID2080
Feature activation-1.09e-02

Pos logit contributions
 anyone+0.013
 sure+0.013
 up+0.011
 closely+0.011
 about+0.010
Neg logit contributions
 slope-0.017
 hill-0.017
thank-0.016
 slopes-0.016
robat-0.016
Token a
Token ID257
Feature activation-1.74e-02

Pos logit contributions
 hill+0.023
field+0.022
 evenings+0.022
+0.022
 slopes+0.022
Neg logit contributions
 anyone-0.012
 sure-0.011
 up-0.010
ust-0.010
 okay-0.009
Token smile
Token ID8212
Feature activation-2.34e-02

Pos logit contributions
akes+0.030
 slope+0.029
 slopes+0.029
forest+0.029
rob+0.028
Neg logit contributions
 sure-0.028
 anyone-0.026
 up-0.025
 okay-0.025
 worse-0.024
Token on
Token ID319
Feature activation-2.44e-02

Pos logit contributions
 hill+0.034
 slope+0.033
 slopes+0.033
+0.032
akes+0.032
Neg logit contributions
 sure-0.045
 anyone-0.043
 up-0.042
 okay-0.041
 about-0.039
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
ust+0.040
iqu+0.038
 awful+0.036
ares+0.035
 exactly+0.035
Neg logit contributions
 street-0.041
 hill-0.040
 season-0.038
 evenings-0.037
 session-0.037
Token people
Token ID661
Feature activation+2.60e-02

Pos logit contributions
 anyone+0.015
 sure+0.014
 up+0.013
 closely+0.013
 worse+0.012
Neg logit contributions
 hill-0.024
 slope-0.023
 slopes-0.023
 hallway-0.022
 street-0.022
Token in
Token ID287
Feature activation-2.07e-02

Pos logit contributions
 anyone+0.038
 sure+0.035
 up+0.033
 worse+0.031
 closely+0.031
Neg logit contributions
 hill-0.048
 slope-0.046
 slopes-0.046
-0.045
field-0.045
Token the
Token ID262
Feature activation-3.60e-02

Pos logit contributions
 hill+0.040
 slope+0.039
field+0.039
 slopes+0.039
+0.039
Neg logit contributions
 anyone-0.028
 sure-0.027
 up-0.024
 okay-0.023
 worse-0.023
Token neighborhood
Token ID6232
Feature activation-1.49e-02

Pos logit contributions
field+0.046
robat+0.044
 slope+0.044
akes+0.044
+0.042
Neg logit contributions
 anyone-0.073
 up-0.069
 sure-0.067
 worse-0.064
 wrong-0.064
Token stay
Token ID2652
Feature activation-1.77e-02

Pos logit contributions
 hill+0.027
 slope+0.026
+0.026
 slopes+0.026
field+0.026
Neg logit contributions
 anyone-0.022
 sure-0.021
 up-0.019
 closely-0.018
 worse-0.018
Token safe
Token ID3338
Feature activation-3.62e-02

Pos logit contributions
 hill+0.031
 slope+0.030
 slopes+0.029
field+0.029
thank+0.029
Neg logit contributions
 anyone-0.027
 sure-0.024
 up-0.022
 worse-0.022
 closely-0.022
Token.
Token ID13
Feature activation-7.62e-03

Pos logit contributions
 hill+0.069
+0.068
 slope+0.068
field+0.067
 slopes+0.066
Neg logit contributions
 anyone-0.050
 sure-0.047
 up-0.045
 worse-0.040
 okay-0.040
Token Julie
Token ID21946
Feature activation-1.87e-03

Pos logit contributions
 hill+0.010
+0.010
 slope+0.010
 slopes+0.010
 hallway+0.010
Neg logit contributions
 anyone-0.015
 sure-0.014
 up-0.013
 worse-0.013
 okay-0.013
Token and
Token ID290
Feature activation-1.87e-03

Pos logit contributions
 slope+0.004
+0.003
 hill+0.003
akes+0.003
 slopes+0.003
Neg logit contributions
 sure-0.003
 anyone-0.003
 up-0.002
 closely-0.002
 just-0.002
Token Max
Token ID5436
Feature activation-1.15e-02

Pos logit contributions
 hill+0.003
 slope+0.003
+0.003
thank+0.003
field+0.003
Neg logit contributions
 anyone-0.003
 sure-0.003
 up-0.002
 closely-0.002
 worse-0.002
Token go
Token ID467
Feature activation-2.40e-02

Pos logit contributions
 hill+0.083
 slope+0.082
 slopes+0.081
+0.080
field+0.080
Neg logit contributions
 anyone-0.071
 sure-0.067
 up-0.062
 okay-0.059
 closely-0.057
Token away
Token ID1497
Feature activation-2.20e-02

Pos logit contributions
 slope+0.049
akes+0.049
robat+0.048
thank+0.048
+0.048
Neg logit contributions
 anyone-0.029
 up-0.029
 sure-0.028
 about-0.023
 closely-0.023
Token for
Token ID329
Feature activation-2.89e-02

Pos logit contributions
robat+0.042
 hill+0.042
 slope+0.042
+0.042
ursion+0.041
Neg logit contributions
 anyone-0.030
 sure-0.028
 up-0.028
 closely-0.025
 okay-0.025
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
 hill+0.068
 slopes+0.066
+0.066
 slope+0.066
 evenings+0.065
Neg logit contributions
 anyone-0.025
 sure-0.022
 up-0.021
ust-0.020
 closely-0.019
Token few
Token ID1178
Feature activation-5.35e-02

Pos logit contributions
 hill+0.024
 slope+0.022
+0.022
field+0.021
 slopes+0.021
Neg logit contributions
 anyone-0.043
 sure-0.042
 up-0.040
 worse-0.038
 closely-0.038
Token days
Token ID1528
Feature activation-5.86e-02

Pos logit contributions
field+0.062
+0.061
 hill+0.061
akes+0.061
 slope+0.060
Neg logit contributions
 anyone-0.117
 sure-0.111
 up-0.104
 closely-0.102
 worse-0.102
Token.
Token ID13
Feature activation-5.54e-03

Pos logit contributions
 hill+0.115
 slope+0.114
robat+0.114
 slopes+0.112
akes+0.112
Neg logit contributions
 anyone-0.076
 sure-0.075
 up-0.070
 about-0.065
 closely-0.064
Token So
Token ID1406
Feature activation-5.70e-02

Pos logit contributions
 hill+0.008
 slope+0.007
 slopes+0.007
field+0.007
+0.007
Neg logit contributions
 anyone-0.011
 sure-0.010
 up-0.010
 closely-0.009
 okay-0.009
Token,
Token ID11
Feature activation-6.91e-03

Pos logit contributions
 slope+0.086
robat+0.086
 hill+0.086
forest+0.085
thank+0.085
Neg logit contributions
 anyone-0.105
 sure-0.102
 up-0.096
 closely-0.090
 okay-0.089
Token Sarah
Token ID10490
Feature activation+9.82e-05

Pos logit contributions
 hill+0.010
field+0.010
 slope+0.010
+0.010
 slopes+0.010
Neg logit contributions
 anyone-0.012
 sure-0.012
 up-0.011
 worse-0.011
 okay-0.011
Token carefully
Token ID7773
Feature activation-3.12e-02

Pos logit contributions
 sure+0.000
 anyone+0.000
 up+0.000
 closely+0.000
 okay+0.000
Neg logit contributions
 hill-0.000
 slope-0.000
-0.000
 slopes-0.000
akes-0.000
Token She
Token ID1375
Feature activation-9.39e-03

Pos logit contributions
 hill+0.004
 slope+0.004
 slopes+0.004
+0.004
field+0.004
Neg logit contributions
 anyone-0.005
 sure-0.004
 up-0.004
 closely-0.004
 okay-0.004
Token kept
Token ID4030
Feature activation-1.42e-02

Pos logit contributions
 hill+0.016
 slope+0.016
 slopes+0.015
+0.015
akes+0.015
Neg logit contributions
 anyone-0.015
 sure-0.015
 up-0.013
 closely-0.013
 okay-0.012
Token it
Token ID340
Feature activation+1.50e-02

Pos logit contributions
 hill+0.024
+0.023
 slopes+0.023
field+0.023
 slope+0.023
Neg logit contributions
 anyone-0.022
ust-0.019
 sure-0.019
 worse-0.018
 awful-0.018
Token warm
Token ID5814
Feature activation-3.90e-02

Pos logit contributions
 anyone+0.019
 sure+0.019
 up+0.018
 okay+0.016
 closely+0.016
Neg logit contributions
 hill-0.029
 slope-0.029
-0.029
 slopes-0.029
field-0.028
Token and
Token ID290
Feature activation-3.10e-02

Pos logit contributions
robat+0.076
ursion+0.076
 hill+0.076
 hallway+0.076
rob+0.076
Neg logit contributions
 up-0.057
 sure-0.050
 anyone-0.050
 okay-0.045
 closely-0.045
Token cozy
Token ID37438
Feature activation-6.16e-02

Pos logit contributions
akes+0.060
 slope+0.060
 hill+0.060
+0.059
robat+0.059
Neg logit contributions
 sure-0.045
 up-0.041
 anyone-0.040
 okay-0.039
 ok-0.036
Token,
Token ID11
Feature activation-5.29e-03

Pos logit contributions
rob+0.140
robat+0.140
akes+0.140
ursion+0.137
ite+0.132
Neg logit contributions
 up-0.071
 sure-0.064
 anyone-0.062
 okay-0.057
 just-0.057
Token under
Token ID739
Feature activation-1.46e-02

Pos logit contributions
 hill+0.010
 slope+0.010
 slopes+0.010
robat+0.009
+0.009
Neg logit contributions
 sure-0.008
 anyone-0.007
 up-0.007
 okay-0.007
 closely-0.006
Token her
Token ID607
Feature activation+8.71e-03

Pos logit contributions
 hill+0.032
 slope+0.031
field+0.031
robat+0.031
forest+0.031
Neg logit contributions
 anyone-0.017
 sure-0.015
 up-0.014
 closely-0.013
 worse-0.012
Token blanket
Token ID18447
Feature activation-3.01e-02

Pos logit contributions
 anyone+0.017
 sure+0.016
 up+0.016
 closely+0.015
 worse+0.015
Neg logit contributions
 slope-0.011
thank-0.011
 hill-0.011
akes-0.011
 slopes-0.011
Token.
Token ID13
Feature activation-4.75e-04

Pos logit contributions
 slope+0.050
 hill+0.050
rob+0.050
robat+0.049
 slopes+0.049
Neg logit contributions
 anyone-0.047
 up-0.045
 sure-0.044
 closely-0.041
 okay-0.040
Token many
Token ID867
Feature activation-2.77e-02

Pos logit contributions
ust+0.034
ares+0.031
 anyone+0.031
 sure+0.029
amus+0.029
Neg logit contributions
 evenings-0.056
 hill-0.055
 hallway-0.053
 slopes-0.052
 street-0.052
Token fish
Token ID5916
Feature activation+1.85e-02

Pos logit contributions
 hill+0.050
 slope+0.048
 slopes+0.047
+0.047
field+0.047
Neg logit contributions
 anyone-0.041
 sure-0.039
 up-0.036
 worse-0.034
 okay-0.034
Token swimming
Token ID14899
Feature activation-8.59e-03

Pos logit contributions
 anyone+0.029
 sure+0.028
 up+0.026
 closely+0.025
 worse+0.024
Neg logit contributions
 hill-0.031
 slope-0.030
-0.030
 slopes-0.030
field-0.029
Token in
Token ID287
Feature activation-3.90e-02

Pos logit contributions
+0.018
 hill+0.018
 slope+0.018
field+0.018
 hallway+0.018
Neg logit contributions
 anyone-0.010
 up-0.009
 sure-0.009
 closely-0.008
 worse-0.007
Token the
Token ID262
Feature activation-4.44e-02

Pos logit contributions
akes+0.086
+0.085
 slope+0.085
robat+0.085
 hill+0.085
Neg logit contributions
 anyone-0.042
 sure-0.038
 up-0.036
 closely-0.031
 what-0.030
Token night
Token ID1755
Feature activation-8.16e-02

Pos logit contributions
+0.063
akes+0.060
robat+0.059
thank+0.058
 slope+0.056
Neg logit contributions
 anyone-0.084
 up-0.082
 sure-0.076
 worse-0.072
 closely-0.071
Token.
Token ID13
Feature activation-8.29e-03

Pos logit contributions
ursion+0.132
robat+0.132
+0.129
thank+0.128
use+0.128
Neg logit contributions
 anyone-0.130
 up-0.123
 sure-0.119
 closely-0.111
 about-0.108
Token Emma
Token ID18966
Feature activation+2.23e-03

Pos logit contributions
 hill+0.015
 slope+0.015
 slopes+0.014
field+0.014
+0.014
Neg logit contributions
 anyone-0.012
 sure-0.012
 up-0.011
 closely-0.010
 okay-0.010
Token was
Token ID373
Feature activation+1.13e-02

Pos logit contributions
 anyone+0.004
 sure+0.003
 up+0.003
 closely+0.003
 worse+0.003
Neg logit contributions
 hill-0.004
 slope-0.004
-0.004
 slopes-0.004
akes-0.004
Token so
Token ID523
Feature activation+3.85e-03

Pos logit contributions
 anyone+0.015
ust+0.013
iqu+0.012
 closely+0.012
ares+0.012
Neg logit contributions
 distance-0.021
 hill-0.020
field-0.020
 hallway-0.020
 street-0.020
Token excited
Token ID6568
Feature activation-1.48e-02

Pos logit contributions
 anyone+0.004
 sure+0.004
 up+0.004
 okay+0.004
 worse+0.003
Neg logit contributions
 hill-0.008
 slope-0.008
 slopes-0.008
-0.008
field-0.008
Token boy
Token ID2933
Feature activation-1.27e-02

Pos logit contributions
 anyone+0.001
 sure+0.001
ust+0.001
 up+0.001
 okay+0.001
Neg logit contributions
 hill-0.001
 slope-0.001
 street-0.001
 slopes-0.001
 hallway-0.001
Token kept
Token ID4030
Feature activation-2.62e-02

Pos logit contributions
+0.022
akes+0.022
 slope+0.022
 hill+0.021
 slopes+0.021
Neg logit contributions
 sure-0.020
 anyone-0.020
 up-0.018
 closely-0.017
 okay-0.016
Token walking
Token ID6155
Feature activation-3.87e-02

Pos logit contributions
 hill+0.048
 slope+0.046
 slopes+0.045
field+0.045
+0.045
Neg logit contributions
 anyone-0.038
 sure-0.035
 up-0.033
 worse-0.032
 okay-0.031
Token for
Token ID329
Feature activation-2.62e-02

Pos logit contributions
+0.070
akes+0.069
forest+0.065
field+0.065
ursion+0.064
Neg logit contributions
 up-0.064
 anyone-0.062
 sure-0.060
 closely-0.056
 about-0.053
Token a
Token ID257
Feature activation-2.53e-02

Pos logit contributions
 hill+0.046
 slopes+0.045
 slope+0.045
 evenings+0.044
 hallway+0.043
Neg logit contributions
 anyone-0.037
 sure-0.035
 up-0.034
ust-0.033
 closely-0.032
Token long
Token ID890
Feature activation-8.04e-02

Pos logit contributions
akes+0.035
 hill+0.034
+0.034
 slope+0.034
 slopes+0.033
Neg logit contributions
 anyone-0.047
 sure-0.046
 up-0.042
 closely-0.041
 okay-0.041
Token time
Token ID640
Feature activation-4.33e-02

Pos logit contributions
akes+0.167
+0.157
loo+0.156
ursion+0.154
thank+0.153
Neg logit contributions
 anyone-0.107
 sure-0.105
 closely-0.098
 up-0.092
 just-0.087
Token,
Token ID11
Feature activation-3.21e-02

Pos logit contributions
akes+0.074
ursion+0.072
 hill+0.071
field+0.071
+0.071
Neg logit contributions
 sure-0.071
 anyone-0.070
 up-0.064
 closely-0.063
 about-0.062
Token until
Token ID1566
Feature activation-3.98e-02

Pos logit contributions
akes+0.057
+0.056
 hill+0.055
ursion+0.055
 slope+0.055
Neg logit contributions
 sure-0.050
 anyone-0.048
 up-0.045
 okay-0.043
 closely-0.042
Token he
Token ID339
Feature activation-1.15e-02

Pos logit contributions
 hill+0.070
 slope+0.069
akes+0.067
+0.067
forest+0.067
Neg logit contributions
 anyone-0.062
 sure-0.057
 up-0.054
 worse-0.050
 closely-0.050
Token eventually
Token ID4191
Feature activation-3.03e-02

Pos logit contributions
 hill+0.016
field+0.015
 slope+0.015
 slopes+0.015
+0.015
Neg logit contributions
 anyone-0.021
 sure-0.020
 up-0.019
 worse-0.018
 closely-0.018
Token salad
Token ID20698
Feature activation-6.04e-03

Pos logit contributions
 anyone+0.007
 sure+0.006
 up+0.006
 closely+0.006
 worse+0.006
Neg logit contributions
 hill-0.009
 slope-0.008
-0.008
 slopes-0.008
field-0.008
Token so
Token ID523
Feature activation-9.03e-03

Pos logit contributions
 slope+0.012
 slopes+0.012
 hill+0.012
ursion+0.012
+0.012
Neg logit contributions
 anyone-0.008
 sure-0.007
 up-0.007
 okay-0.006
 worse-0.006
Token much
Token ID881
Feature activation-2.89e-02

Pos logit contributions
 hill+0.017
 slopes+0.016
+0.016
 slope+0.016
 evenings+0.016
Neg logit contributions
 anyone-0.011
 sure-0.011
 up-0.010
 ok-0.010
 okay-0.010
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 hill+0.059
 slope+0.059
field+0.058
+0.058
 hallway+0.058
Neg logit contributions
 anyone-0.036
 sure-0.034
 up-0.031
 closely-0.029
 worse-0.028
Token From
Token ID3574
Feature activation-1.26e-02

Pos logit contributions
 hill+0.020
 slope+0.019
 slopes+0.019
field+0.019
+0.019
Neg logit contributions
 anyone-0.020
 sure-0.019
 up-0.018
 closely-0.017
 okay-0.017
Token then
Token ID788
Feature activation-6.15e-02

Pos logit contributions
 hill+0.028
 slope+0.027
+0.027
 hallway+0.027
 slopes+0.027
Neg logit contributions
 anyone-0.014
 sure-0.013
 up-0.013
 closely-0.011
 worse-0.011
Token on
Token ID319
Feature activation-3.42e-02

Pos logit contributions
forest+0.161
field+0.157
thank+0.156
 hill+0.155
 slope+0.153
Neg logit contributions
 up-0.043
 sure-0.042
 anyone-0.042
 okay-0.035
 about-0.030
Token,
Token ID11
Feature activation+3.87e-03

Pos logit contributions
thank+0.054
ursion+0.053
forest+0.051
akes+0.051
 driveway+0.050
Neg logit contributions
 anyone-0.066
 up-0.062
 sure-0.057
 not-0.057
 okay-0.055
Token Fl
Token ID1610
Feature activation+1.86e-02

Pos logit contributions
 anyone+0.005
 sure+0.005
 up+0.004
 worse+0.004
 ok+0.004
Neg logit contributions
 hill-0.007
 slope-0.007
-0.007
 slopes-0.007
field-0.007
Tokenoppy
Token ID26696
Feature activation+7.92e-03

Pos logit contributions
 anyone+0.036
 sure+0.034
 up+0.032
 worse+0.031
 closely+0.031
Neg logit contributions
 hill-0.025
-0.024
 slope-0.024
field-0.024
 slopes-0.024
Token asked
Token ID1965
Feature activation+1.17e-02

Pos logit contributions
 sure+0.011
 anyone+0.011
 up+0.010
 just+0.009
 not+0.009
Neg logit contributions
-0.015
 slope-0.015
 hill-0.015
ursion-0.015
 slopes-0.015