TOP ACTIVATIONS
MAX = 0.120

Hello , Lily ," a voice said . Lily turned around
Hello , children ," a voice said . " I have
be happy . Even though situations can be serious , it
, little girl ," a voice said . She turned and
meet new friends ," a voice says . S
leaves and make a nice place for it to live .
, little one ," a voice said . Ben turned around
eat the mint and then breath gently , so the air
said , " Hello , world !" His voice was mild
's me ," said a voice from outside the bathroom door
name is Max ," a voice says . Lily and Anna
just me ," a small voice replied . Tim
" Hello !" a voice replied . The
out of difficult and sticky situations . <|endoftext|> The family had
, stop it !" a voice shouts . It is Mom
animals and keep a safe distance ." " Okay
, little girl ," a voice said . Lily was scared
going on here ?" a voice said . They
two hands , but no arms or legs ?" <|endoftext|> Sam
box . They make a hole in the box for the

MIN ACTIVATIONS
MIN = -0.166

on top of the ash tr ay . The
in the cold white snow fl akes and sh iver when
the trash from the ash tr ay . The people in
something moving in the wild fl owers . It was a
around town on his ash tr ay , z ipping everywhere
look at the big sun fl owers in her garden .
and accidentally dropped her ash tr ay on the ground .
Lily accidentally dropped the ash tr ay and it broke into
to play with the sun fl owers , and they soon
a me adow of wild fl owers . It was the
was running around the ash tr ay . The child said
accidentally knocked over the ash tr ay and it broke into
loved to watch the sun fl owers grow and remembered the
the top of the ash tr ay fell off . Bill
a garden full of wild fl owers . He decided to
the soil for the sun fl owers . From that day
me adow full of sun fl owers and tall grass .
to work on her ash tr ay right away . She
and look at their snow fl akes . S
to wear his flip - fl ops so he wouldn 't

INTERVAL 0.048 - 0.120
CONTAINS 1.255%

important to be kind to all animals , even if they
I am hungry . Give me your food ." Tom and
happy and gave Lily a big hug . Lily felt so
always made sure to put all the dishes in the dish
sky . The hill was no longer lonely .

INTERVAL -0.023 - 0.048
CONTAINS 85.817%

mom for her advice . He says that he loves her
to consider the consequences . It 's always good to think
â € said Bunny . The ac orn did
okay , the earthquake is over now ".
necklace with a charm that looked just like her special rock

INTERVAL -0.094 - -0.023
CONTAINS 12.909%

cat . " What 's wrong ? Why do you look
don 't wear dresses . This is a silly doll ."
lemon all day . He did not want to take a
asked him to keep the coal available for the fire .
<|endoftext|> Once upon a time , there was a little boy

INTERVAL -0.166 - -0.094
CONTAINS 0.019%

. Lily observed the snow fl akes falling from the sky
cream appeared in Jack â € ™ s hands . Jack
were off to start their next adventure together . <|endoftext|> Once
man grabbed their ice - cream away and took the cone
house and drank some hot chocolate until the storm had passed
TokenHello
Token ID15496
Feature activation-1.59e-02

Pos logit contributions
izing+0.079
iddled+0.076
opter+0.075
pole+0.073
string+0.071
Neg logit contributions
 once-0.042
 two-0.034
 next-0.031
 End-0.029
m-0.027
Token,
Token ID11
Feature activation+3.41e-02

Pos logit contributions
 once+0.026
 least+0.025
 End+0.024
 two+0.024
m+0.022
Neg logit contributions
izing-0.021
iddled-0.021
pole-0.020
ledge-0.019
opter-0.019
Token Lily
Token ID20037
Feature activation+4.10e-02

Pos logit contributions
izing+0.045
iddled+0.045
pole+0.041
opter+0.040
string+0.040
Neg logit contributions
 once-0.060
 least-0.054
 End-0.053
 two-0.052
m-0.049
Token,"
Token ID553
Feature activation-2.94e-02

Pos logit contributions
izing+0.066
iddled+0.066
pole+0.062
string+0.060
ledge+0.060
Neg logit contributions
 once-0.058
 least-0.051
 End-0.051
 two-0.051
m-0.046
Token a
Token ID257
Feature activation+6.88e-02

Pos logit contributions
 once+0.050
 least+0.044
 End+0.044
 two+0.042
m+0.041
Neg logit contributions
iddled-0.040
izing-0.040
pole-0.037
ledge-0.037
string-0.036
Token voice
Token ID3809
Feature activation+1.20e-01

Pos logit contributions
izing+0.103
iddled+0.102
pole+0.088
string+0.088
ues+0.088
Neg logit contributions
 once-0.114
 two-0.107
 End-0.101
 least-0.098
 next-0.097
Token said
Token ID531
Feature activation+2.01e-02

Pos logit contributions
izing+0.223
iddled+0.209
opter+0.201
resses+0.201
string+0.198
Neg logit contributions
 once-0.164
 two-0.142
 next-0.132
 least-0.129
 End-0.121
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
izing+0.042
iddled+0.041
opter+0.039
pole+0.038
string+0.037
Neg logit contributions
 once-0.023
 two-0.019
 least-0.017
 next-0.016
 End-0.016
Token Lily
Token ID20037
Feature activation+3.12e-02

Pos logit contributions
izing+0.059
iddled+0.057
opter+0.053
pole+0.053
string+0.053
Neg logit contributions
 once-0.027
 two-0.021
 End-0.019
 least-0.019
 next-0.018
Token turned
Token ID2900
Feature activation-1.60e-02

Pos logit contributions
iddled+0.050
izing+0.049
pole+0.045
string+0.045
opter+0.045
Neg logit contributions
 once-0.046
 End-0.041
 least-0.041
 two-0.039
 night-0.038
Token around
Token ID1088
Feature activation+2.64e-02

Pos logit contributions
 once+0.026
 two+0.023
 least+0.021
 End+0.021
 night+0.021
Neg logit contributions
izing-0.026
iddled-0.024
opter-0.023
ues-0.023
pole-0.022
TokenHello
Token ID15496
Feature activation-1.55e-02

Pos logit contributions
izing+0.051
iddled+0.051
opter+0.045
pole+0.045
ues+0.045
Neg logit contributions
 once-0.056
 two-0.050
 least-0.049
 End-0.048
 next-0.045
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
 once+0.023
 least+0.020
 End+0.020
 two+0.020
 night+0.018
Neg logit contributions
izing-0.025
iddled-0.025
opter-0.023
pole-0.023
string-0.022
Token children
Token ID1751
Feature activation+2.00e-02

Pos logit contributions
izing+0.053
iddled+0.052
opter+0.047
pole+0.047
ledge+0.046
Neg logit contributions
 once-0.063
 least-0.055
 End-0.054
 two-0.054
 night-0.050
Token,"
Token ID553
Feature activation-1.94e-02

Pos logit contributions
iddled+0.036
izing+0.036
string+0.032
ues+0.032
ledge+0.032
Neg logit contributions
 once-0.028
 two-0.024
 least-0.024
m-0.022
 night-0.022
Token a
Token ID257
Feature activation+5.71e-02

Pos logit contributions
 once+0.033
 least+0.030
 End+0.029
day+0.028
m+0.028
Neg logit contributions
izing-0.024
iddled-0.024
nam-0.023
taker-0.022
ledge-0.022
Token voice
Token ID3809
Feature activation+1.16e-01

Pos logit contributions
iddled+0.092
izing+0.091
 ours+0.082
string+0.080
pole+0.080
Neg logit contributions
 once-0.086
 two-0.086
 End-0.077
m-0.076
 next-0.075
Token said
Token ID531
Feature activation+2.99e-02

Pos logit contributions
izing+0.230
iddled+0.222
 ours+0.214
opter+0.212
osity+0.211
Neg logit contributions
 once-0.143
 two-0.122
 next-0.112
m-0.102
 night-0.099
Token.
Token ID13
Feature activation+3.37e-02

Pos logit contributions
izing+0.066
iddled+0.063
opter+0.061
pole+0.059
ues+0.058
Neg logit contributions
 once-0.033
 two-0.027
 next-0.023
 least-0.023
 End-0.022
Token "
Token ID366
Feature activation+3.26e-02

Pos logit contributions
izing+0.061
iddled+0.056
quet+0.054
pole+0.053
opter+0.053
Neg logit contributions
 once-0.053
 two-0.049
 next-0.043
 only-0.041
-0.040
TokenI
Token ID40
Feature activation+7.59e-03

Pos logit contributions
izing+0.059
iddled+0.058
pole+0.053
string+0.053
opter+0.053
Neg logit contributions
 once-0.044
 least-0.037
 two-0.037
 End-0.036
 night-0.033
Token have
Token ID423
Feature activation-4.20e-03

Pos logit contributions
iddled+0.011
izing+0.010
opter+0.010
string+0.010
pole+0.010
Neg logit contributions
 once-0.012
 End-0.011
 two-0.010
 least-0.010
 next-0.009
Token be
Token ID307
Feature activation+1.11e-02

Pos logit contributions
 once+0.011
 least+0.009
 two+0.009
 End+0.009
m+0.008
Neg logit contributions
izing-0.012
iddled-0.012
pole-0.010
opter-0.010
string-0.010
Token happy
Token ID3772
Feature activation-1.12e-02

Pos logit contributions
izing+0.017
iddled+0.017
opter+0.016
string+0.015
ues+0.015
Neg logit contributions
 once-0.019
 two-0.017
 least-0.016
 End-0.016
 next-0.016
Token.
Token ID13
Feature activation-4.03e-03

Pos logit contributions
 once+0.014
 least+0.012
 two+0.012
 End+0.012
m+0.011
Neg logit contributions
izing-0.021
iddled-0.020
pole-0.018
opter-0.018
ues-0.018
Token Even
Token ID3412
Feature activation+5.12e-03

Pos logit contributions
 once+0.006
 two+0.006
 least+0.006
 End+0.005
 night+0.005
Neg logit contributions
izing-0.006
iddled-0.006
pole-0.006
opter-0.006
ues-0.006
Token though
Token ID996
Feature activation+1.93e-02

Pos logit contributions
izing+0.005
iddled+0.004
nam+0.004
ledge+0.004
pole+0.004
Neg logit contributions
 once-0.011
 End-0.010
 least-0.010
 two-0.010
m-0.009
Token situations
Token ID7445
Feature activation+1.11e-01

Pos logit contributions
izing+0.026
iddled+0.022
opter+0.021
nam+0.021
 ours+0.020
Neg logit contributions
 once-0.033
 least-0.031
 End-0.030
mo-0.029
m-0.029
Token can
Token ID460
Feature activation-3.82e-02

Pos logit contributions
iddled+0.210
izing+0.204
pole+0.195
string+0.192
ues+0.192
Neg logit contributions
 once-0.143
 two-0.123
 least-0.120
 End-0.110
 best-0.109
Token be
Token ID307
Feature activation+5.33e-03

Pos logit contributions
 once+0.059
 End+0.054
 least+0.051
 two+0.051
m+0.050
Neg logit contributions
izing-0.057
iddled-0.052
opter-0.051
pole-0.051
ledge-0.049
Token serious
Token ID2726
Feature activation+1.32e-02

Pos logit contributions
izing+0.008
iddled+0.008
opter+0.008
ues+0.008
string+0.008
Neg logit contributions
 once-0.008
 two-0.007
 least-0.007
 End-0.007
 night-0.007
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
iddled+0.025
izing+0.024
opter+0.023
pole+0.023
string+0.022
Neg logit contributions
 once-0.018
 two-0.015
 least-0.014
 End-0.014
 next-0.013
Token it
Token ID340
Feature activation+3.25e-02

Pos logit contributions
izing+0.017
iddled+0.015
nam+0.014
ledge+0.013
opter+0.013
Neg logit contributions
 once-0.022
 least-0.020
 End-0.020
m-0.019
mo-0.019
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
 once+0.034
 least+0.031
 End+0.030
 two+0.029
m+0.027
Neg logit contributions
izing-0.036
iddled-0.034
opter-0.033
pole-0.032
ues-0.032
Token little
Token ID1310
Feature activation+4.78e-02

Pos logit contributions
izing+0.034
iddled+0.034
pole+0.032
opter+0.031
string+0.030
Neg logit contributions
 once-0.051
 least-0.045
 End-0.044
 two-0.043
m-0.042
Token girl
Token ID2576
Feature activation+4.57e-02

Pos logit contributions
izing+0.075
iddled+0.075
pole+0.067
opter+0.066
ues+0.066
Neg logit contributions
 once-0.076
 two-0.067
 least-0.066
 End-0.065
 night-0.061
Token,"
Token ID553
Feature activation-3.14e-02

Pos logit contributions
izing+0.066
iddled+0.063
pole+0.058
opter+0.058
ues+0.057
Neg logit contributions
 once-0.076
 least-0.068
 End-0.067
 two-0.066
 night-0.062
Token a
Token ID257
Feature activation+5.18e-02

Pos logit contributions
 once+0.057
 least+0.051
 End+0.050
m+0.047
 two+0.047
Neg logit contributions
izing-0.040
iddled-0.039
nam-0.035
ledge-0.035
pole-0.035
Token voice
Token ID3809
Feature activation+1.10e-01

Pos logit contributions
iddled+0.079
izing+0.077
 ours+0.069
pole+0.067
ues+0.067
Neg logit contributions
 once-0.084
 two-0.084
 End-0.076
 next-0.075
 end-0.072
Token said
Token ID531
Feature activation+2.08e-02

Pos logit contributions
izing+0.214
iddled+0.208
opter+0.196
string+0.194
 ours+0.193
Neg logit contributions
 once-0.140
 two-0.121
 next-0.109
m-0.103
 least-0.101
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
izing+0.043
iddled+0.042
opter+0.040
pole+0.039
ledge+0.039
Neg logit contributions
 once-0.024
 two-0.019
 least-0.018
 End-0.017
 next-0.017
Token She
Token ID1375
Feature activation+2.70e-02

Pos logit contributions
izing+0.041
iddled+0.038
pole+0.036
opter+0.036
ues+0.035
Neg logit contributions
 once-0.042
 two-0.038
 next-0.034
 End-0.033
 least-0.033
Token turned
Token ID2900
Feature activation-1.43e-02

Pos logit contributions
izing+0.045
iddled+0.045
pole+0.041
opter+0.040
string+0.040
Neg logit contributions
 once-0.040
 least-0.034
 two-0.034
 End-0.033
 night-0.031
Token and
Token ID290
Feature activation-1.23e-03

Pos logit contributions
 once+0.024
 least+0.021
 two+0.021
 End+0.020
 night+0.019
Neg logit contributions
izing-0.022
iddled-0.021
pole-0.019
opter-0.019
string-0.019
Token meet
Token ID1826
Feature activation-6.76e-03

Pos logit contributions
 once+0.017
 least+0.015
 two+0.015
 End+0.015
m+0.014
Neg logit contributions
izing-0.015
iddled-0.015
pole-0.013
opter-0.013
string-0.013
Token new
Token ID649
Feature activation-7.78e-03

Pos logit contributions
 once+0.012
 End+0.011
 least+0.011
 two+0.010
m+0.010
Neg logit contributions
izing-0.009
iddled-0.008
pole-0.008
 ours-0.008
opter-0.008
Token friends
Token ID2460
Feature activation-1.28e-03

Pos logit contributions
 once+0.011
 least+0.010
 two+0.010
 End+0.010
m+0.009
Neg logit contributions
izing-0.013
iddled-0.013
opter-0.012
pole-0.012
ues-0.012
Token,"
Token ID553
Feature activation-2.75e-02

Pos logit contributions
 once+0.002
 least+0.002
 End+0.002
 two+0.002
 night+0.002
Neg logit contributions
izing-0.002
iddled-0.002
pole-0.002
opter-0.002
ues-0.002
Token a
Token ID257
Feature activation+5.09e-02

Pos logit contributions
 once+0.047
 End+0.042
 least+0.041
day+0.041
 eventually+0.039
Neg logit contributions
nam-0.034
 brace-0.033
iddled-0.031
izing-0.031
ledge-0.031
Token voice
Token ID3809
Feature activation+1.10e-01

Pos logit contributions
iddled+0.093
izing+0.091
 ours+0.082
uts+0.081
string+0.081
Neg logit contributions
 two-0.069
 once-0.066
 End-0.061
 next-0.060
 end-0.059
Token says
Token ID1139
Feature activation+2.70e-02

Pos logit contributions
izing+0.207
iddled+0.200
opter+0.189
string+0.182
nam+0.179
Neg logit contributions
 once-0.148
 two-0.131
 next-0.121
 least-0.116
 End-0.114
Token.
Token ID13
Feature activation+3.12e-02

Pos logit contributions
izing+0.053
iddled+0.053
opter+0.049
pole+0.049
string+0.048
Neg logit contributions
 once-0.032
 two-0.026
 least-0.026
 End-0.025
 next-0.023
Token
Token ID198
Feature activation+2.66e-02

Pos logit contributions
iddled+0.069
izing+0.069
ledge+0.064
ues+0.064
opter+0.064
Neg logit contributions
 once-0.028
 least-0.023
 End-0.021
 two-0.021
 night-0.018
Token
Token ID198
Feature activation+2.30e-02

Pos logit contributions
izing+0.059
iddled+0.059
ues+0.055
opter+0.055
ledge+0.055
Neg logit contributions
 once-0.023
 least-0.019
 End-0.017
 two-0.017
 night-0.015
TokenS
Token ID50
Feature activation+3.03e-03

Pos logit contributions
iddled+0.039
izing+0.037
pole+0.035
imm+0.034
string+0.034
Neg logit contributions
 two-0.034
 once-0.034
 next-0.031
 End-0.031
m-0.029
Token leaves
Token ID5667
Feature activation+4.34e-02

Pos logit contributions
izing+0.014
iddled+0.014
pole+0.012
string+0.012
ues+0.012
Neg logit contributions
 once-0.019
 End-0.017
 least-0.017
m-0.016
 two-0.016
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
izing+0.083
iddled+0.082
opter+0.075
pole+0.075
ledge+0.074
Neg logit contributions
 once-0.054
 two-0.045
 least-0.045
 End-0.043
 next-0.040
Token make
Token ID787
Feature activation+3.10e-02

Pos logit contributions
izing+0.019
iddled+0.018
pole+0.017
string+0.017
ledge+0.016
Neg logit contributions
 once-0.025
 End-0.022
 least-0.022
 two-0.021
m-0.021
Token a
Token ID257
Feature activation+5.73e-02

Pos logit contributions
izing+0.052
iddled+0.051
pole+0.046
opter+0.046
ues+0.046
Neg logit contributions
 once-0.045
 least-0.039
 End-0.039
 two-0.039
m-0.036
Token nice
Token ID3621
Feature activation+5.33e-02

Pos logit contributions
izing+0.082
iddled+0.079
pole+0.073
opter+0.072
ues+0.072
Neg logit contributions
 once-0.098
 least-0.086
 two-0.085
 End-0.084
m-0.080
Token place
Token ID1295
Feature activation+1.10e-01

Pos logit contributions
iddled+0.093
izing+0.088
ledge+0.078
opter+0.078
string+0.078
Neg logit contributions
 once-0.083
 two-0.070
 least-0.066
 night-0.066
 end-0.063
Token for
Token ID329
Feature activation-8.46e-03

Pos logit contributions
izing+0.207
iddled+0.207
opter+0.191
string+0.185
pole+0.184
Neg logit contributions
 once-0.145
 two-0.121
 least-0.116
 next-0.110
 End-0.109
Token it
Token ID340
Feature activation+1.85e-02

Pos logit contributions
 once+0.012
 least+0.010
 End+0.010
 two+0.010
m+0.010
Neg logit contributions
izing-0.015
iddled-0.014
opter-0.013
pole-0.013
ues-0.013
Token to
Token ID284
Feature activation+2.34e-02

Pos logit contributions
izing+0.036
iddled+0.036
opter+0.033
pole+0.033
string+0.032
Neg logit contributions
 once-0.022
 two-0.018
 least-0.018
 End-0.018
 night-0.016
Token live
Token ID2107
Feature activation-9.81e-03

Pos logit contributions
izing+0.035
iddled+0.035
opter+0.031
pole+0.031
string+0.030
Neg logit contributions
 once-0.039
 two-0.034
 least-0.034
 End-0.033
 night-0.032
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
 once+0.013
 least+0.013
 End+0.012
 two+0.012
m+0.011
Neg logit contributions
izing-0.016
iddled-0.016
pole-0.015
ues-0.014
string-0.014
Token,
Token ID11
Feature activation+2.85e-02

Pos logit contributions
 once+0.031
 two+0.027
 least+0.026
 End+0.026
 night+0.024
Neg logit contributions
izing-0.037
iddled-0.037
pole-0.033
opter-0.033
string-0.033
Token little
Token ID1310
Feature activation+4.65e-02

Pos logit contributions
izing+0.045
iddled+0.044
opter+0.040
pole+0.040
string+0.039
Neg logit contributions
 once-0.045
 least-0.039
 two-0.039
 End-0.038
m-0.036
Token one
Token ID530
Feature activation-1.99e-02

Pos logit contributions
iddled+0.082
izing+0.081
pole+0.073
ues+0.073
opter+0.072
Neg logit contributions
 once-0.067
 two-0.060
 least-0.056
 End-0.055
 night-0.054
Token,"
Token ID553
Feature activation-3.25e-02

Pos logit contributions
 once+0.034
 least+0.030
 two+0.030
 End+0.029
m+0.027
Neg logit contributions
izing-0.029
iddled-0.028
pole-0.025
opter-0.025
ues-0.025
Token a
Token ID257
Feature activation+5.07e-02

Pos logit contributions
 once+0.056
 least+0.051
 End+0.050
m+0.047
 two+0.047
Neg logit contributions
izing-0.044
iddled-0.043
pole-0.039
opter-0.039
ledge-0.038
Token voice
Token ID3809
Feature activation+1.09e-01

Pos logit contributions
iddled+0.073
izing+0.072
pole+0.063
ues+0.063
string+0.062
Neg logit contributions
 once-0.087
 two-0.082
 End-0.077
 least-0.075
 next-0.074
Token said
Token ID531
Feature activation+2.14e-02

Pos logit contributions
izing+0.212
iddled+0.206
opter+0.193
string+0.192
 ours+0.192
Neg logit contributions
 once-0.138
 two-0.119
 next-0.107
m-0.100
 least-0.099
Token.
Token ID13
Feature activation+2.57e-02

Pos logit contributions
izing+0.044
iddled+0.043
opter+0.041
pole+0.040
ues+0.040
Neg logit contributions
 once-0.025
 two-0.021
 least-0.018
 End-0.018
 next-0.018
Token Ben
Token ID3932
Feature activation+3.08e-02

Pos logit contributions
izing+0.043
iddled+0.041
opter+0.038
pole+0.038
ues+0.038
Neg logit contributions
 once-0.040
 two-0.036
 least-0.033
 End-0.032
 next-0.032
Token turned
Token ID2900
Feature activation-1.37e-02

Pos logit contributions
izing+0.053
iddled+0.052
opter+0.047
pole+0.047
string+0.047
Neg logit contributions
 once-0.044
 two-0.038
 least-0.038
 End-0.037
 night-0.034
Token around
Token ID1088
Feature activation+3.34e-02

Pos logit contributions
 once+0.022
 two+0.019
 least+0.019
 End+0.019
 night+0.017
Neg logit contributions
izing-0.021
iddled-0.021
opter-0.019
pole-0.019
ues-0.019
Token eat
Token ID4483
Feature activation-9.42e-03

Pos logit contributions
 once+0.014
 End+0.014
 two+0.013
 least+0.013
fl+0.013
Neg logit contributions
iddled-0.016
izing-0.014
opter-0.014
ledge-0.014
ues-0.014
Token the
Token ID262
Feature activation+3.43e-03

Pos logit contributions
 once+0.015
 End+0.014
 least+0.014
 two+0.013
m+0.013
Neg logit contributions
izing-0.014
iddled-0.013
 ours-0.012
pole-0.012
ledge-0.011
Token mint
Token ID21043
Feature activation-3.82e-02

Pos logit contributions
izing+0.005
iddled+0.004
string+0.004
pole+0.004
igator+0.004
Neg logit contributions
 once-0.006
 least-0.005
 End-0.005
 two-0.005
 eventually-0.005
Token and
Token ID290
Feature activation+1.14e-03

Pos logit contributions
 once+0.057
 two+0.049
 least+0.048
 next+0.045
 End+0.045
Neg logit contributions
iddled-0.066
izing-0.063
ledge-0.059
opter-0.058
string-0.058
Token then
Token ID788
Feature activation-6.27e-02

Pos logit contributions
izing+0.002
iddled+0.002
pole+0.001
opter+0.001
string+0.001
Neg logit contributions
 once-0.002
 End-0.002
 least-0.002
 two-0.002
m-0.002
Token breath
Token ID8033
Feature activation+1.09e-01

Pos logit contributions
 once+0.111
 least+0.098
 End+0.097
 two+0.097
m+0.091
Neg logit contributions
izing-0.084
iddled-0.083
opter-0.074
pole-0.074
string-0.073
Token gently
Token ID15165
Feature activation-7.96e-03

Pos logit contributions
iddled+0.160
izing+0.158
opter+0.143
pole+0.142
string+0.139
Neg logit contributions
 once-0.175
 least-0.163
 End-0.159
 two-0.155
 best-0.142
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
 once+0.011
 two+0.009
 least+0.009
 End+0.009
 night+0.008
Neg logit contributions
izing-0.014
iddled-0.014
opter-0.013
pole-0.013
string-0.013
Token so
Token ID523
Feature activation+2.28e-02

Pos logit contributions
izing+0.028
iddled+0.028
pole+0.025
ledge+0.025
string+0.025
Neg logit contributions
 once-0.030
 least-0.026
 End-0.026
 two-0.026
m-0.024
Token the
Token ID262
Feature activation+1.55e-02

Pos logit contributions
izing+0.030
iddled+0.027
opter+0.026
ledge+0.025
string+0.025
Neg logit contributions
 once-0.040
 least-0.037
 End-0.036
m-0.035
mo-0.033
Token air
Token ID1633
Feature activation+6.42e-02

Pos logit contributions
izing+0.022
igator+0.020
nam+0.020
pole+0.019
iddled+0.019
Neg logit contributions
 once-0.025
 least-0.022
abel-0.021
mo-0.020
m-0.020
Token said
Token ID531
Feature activation-1.79e-02

Pos logit contributions
 once+0.007
 two+0.007
 least+0.006
 eventually+0.006
 next+0.006
Neg logit contributions
izing-0.009
iddled-0.009
opter-0.008
ues-0.008
pole-0.008
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
 once+0.021
 two+0.018
 best+0.015
 next+0.015
 End+0.014
Neg logit contributions
izing-0.038
iddled-0.037
 ours-0.034
pole-0.034
opter-0.033
Token "
Token ID366
Feature activation+2.83e-02

Pos logit contributions
izing+0.028
iddled+0.028
opter+0.025
pole+0.024
ues+0.024
Neg logit contributions
 once-0.040
 two-0.036
 least-0.034
 End-0.034
 next-0.033
TokenHello
Token ID15496
Feature activation-2.18e-02

Pos logit contributions
izing+0.047
iddled+0.045
pole+0.042
string+0.042
opter+0.041
Neg logit contributions
 once-0.042
 least-0.037
 two-0.036
 End-0.036
 best-0.034
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
 once+0.033
 least+0.029
 End+0.028
 two+0.028
m+0.026
Neg logit contributions
izing-0.035
iddled-0.034
opter-0.032
pole-0.032
ledge-0.031
Token world
Token ID995
Feature activation+1.08e-01

Pos logit contributions
izing+0.038
iddled+0.038
pole+0.034
opter+0.034
ledge+0.034
Neg logit contributions
 once-0.046
 least-0.041
 End-0.040
 two-0.040
m-0.038
Token!"
Token ID2474
Feature activation+1.30e-02

Pos logit contributions
iddled+0.183
izing+0.175
nam+0.163
ues+0.159
string+0.155
Neg logit contributions
 once-0.169
 two-0.143
 next-0.136
 least-0.134
 night-0.134
Token His
Token ID2399
Feature activation-1.46e-02

Pos logit contributions
izing+0.026
iddled+0.025
pole+0.024
opter+0.023
ledge+0.023
Neg logit contributions
 once-0.015
 least-0.012
 End-0.012
 two-0.012
m-0.011
Token voice
Token ID3809
Feature activation+8.16e-02

Pos logit contributions
 once+0.022
 least+0.021
m+0.021
 End+0.020
mo+0.019
Neg logit contributions
izing-0.021
iddled-0.020
ledge-0.019
string-0.019
pole-0.019
Token was
Token ID373
Feature activation+2.17e-03

Pos logit contributions
izing+0.134
iddled+0.132
opter+0.119
pole+0.119
string+0.118
Neg logit contributions
 once-0.123
 two-0.106
 least-0.106
 End-0.103
m-0.096
Token mild
Token ID11607
Feature activation+3.31e-02

Pos logit contributions
izing+0.003
iddled+0.003
pole+0.003
opter+0.003
string+0.003
Neg logit contributions
 once-0.003
 least-0.003
 two-0.003
 End-0.003
m-0.003
Token's
Token ID338
Feature activation+4.64e-03

Pos logit contributions
izing+0.086
iddled+0.086
ues+0.081
quet+0.079
string+0.078
Neg logit contributions
 once-0.038
 two-0.031
 least-0.027
 next-0.025
 night-0.025
Token me
Token ID502
Feature activation+5.17e-02

Pos logit contributions
izing+0.007
iddled+0.007
pole+0.007
 ours+0.007
ledge+0.006
Neg logit contributions
 once-0.007
 End-0.006
 least-0.006
m-0.006
 two-0.006
Token,"
Token ID553
Feature activation-3.32e-02

Pos logit contributions
izing+0.075
iddled+0.073
pole+0.069
ledge+0.068
opter+0.066
Neg logit contributions
 once-0.079
 End-0.075
 least-0.071
 two-0.071
 morning-0.064
Token said
Token ID531
Feature activation+2.55e-02

Pos logit contributions
 once+0.048
 least+0.042
 End+0.041
 two+0.041
m+0.038
Neg logit contributions
izing-0.056
iddled-0.055
opter-0.050
pole-0.050
ledge-0.049
Token a
Token ID257
Feature activation+5.15e-02

Pos logit contributions
izing+0.047
iddled+0.047
ues+0.043
opter+0.043
pole+0.042
Neg logit contributions
 once-0.034
 two-0.029
 least-0.028
 End-0.027
 night-0.025
Token voice
Token ID3809
Feature activation+1.08e-01

Pos logit contributions
iddled+0.090
izing+0.087
ues+0.080
 ours+0.079
pole+0.078
Neg logit contributions
 once-0.073
 two-0.071
 End-0.064
 next-0.062
 night-0.061
Token from
Token ID422
Feature activation+4.01e-03

Pos logit contributions
izing+0.208
iddled+0.207
opter+0.192
ues+0.189
string+0.189
Neg logit contributions
 once-0.134
 two-0.109
 least-0.103
 End-0.100
 next-0.099
Token outside
Token ID2354
Feature activation+2.16e-02

Pos logit contributions
iddled+0.005
izing+0.005
ledge+0.005
igator+0.005
nam+0.005
Neg logit contributions
m-0.007
 once-0.007
 least-0.006
 best-0.006
 End-0.006
Token the
Token ID262
Feature activation-2.12e-02

Pos logit contributions
izing+0.035
iddled+0.034
opter+0.031
pole+0.031
ledge+0.031
Neg logit contributions
 once-0.030
 least-0.029
 End-0.028
 two-0.026
 best-0.025
Token bathroom
Token ID12436
Feature activation-2.84e-03

Pos logit contributions
 once+0.036
 End+0.031
 least+0.031
mo+0.030
 eventually+0.029
Neg logit contributions
izing-0.026
iddled-0.026
 flow-0.025
rier-0.024
ledge-0.024
Token door
Token ID3420
Feature activation+3.13e-02

Pos logit contributions
 once+0.004
 least+0.003
 End+0.003
 two+0.003
 night+0.003
Neg logit contributions
izing-0.005
iddled-0.005
pole-0.005
ues-0.005
ledge-0.005
Token name
Token ID1438
Feature activation+2.05e-02

Pos logit contributions
izing+0.007
iddled+0.007
pole+0.007
string+0.007
opter+0.007
Neg logit contributions
 once-0.007
 least-0.006
 End-0.006
 two-0.006
m-0.006
Token is
Token ID318
Feature activation+6.09e-03

Pos logit contributions
iddled+0.044
izing+0.044
ues+0.041
nam+0.040
string+0.040
Neg logit contributions
 once-0.023
 two-0.019
 least-0.016
 End-0.016
 next-0.016
Token Max
Token ID5436
Feature activation+2.56e-02

Pos logit contributions
izing+0.009
iddled+0.009
pole+0.009
opter+0.009
ledge+0.008
Neg logit contributions
 once-0.009
 End-0.008
 least-0.008
 two-0.008
m-0.008
Token,"
Token ID553
Feature activation-2.76e-02

Pos logit contributions
izing+0.042
iddled+0.041
pole+0.039
opter+0.037
string+0.037
Neg logit contributions
 once-0.036
 least-0.033
 End-0.032
 two-0.031
 night-0.029
Token a
Token ID257
Feature activation+5.61e-02

Pos logit contributions
 once+0.045
 least+0.041
 End+0.039
m+0.038
day+0.037
Neg logit contributions
izing-0.039
iddled-0.039
ledge-0.037
pole-0.036
nam-0.035
Token voice
Token ID3809
Feature activation+1.07e-01

Pos logit contributions
iddled+0.092
izing+0.090
 ours+0.085
string+0.081
ues+0.081
Neg logit contributions
 two-0.085
 once-0.080
 next-0.074
 End-0.074
 end-0.070
Token says
Token ID1139
Feature activation+2.58e-02

Pos logit contributions
izing+0.205
iddled+0.193
opter+0.187
string+0.183
quet+0.182
Neg logit contributions
 once-0.141
 two-0.124
 next-0.114
 least-0.105
 End-0.104
Token.
Token ID13
Feature activation+3.61e-02

Pos logit contributions
izing+0.053
iddled+0.052
opter+0.049
pole+0.048
string+0.047
Neg logit contributions
 once-0.030
 two-0.024
 least-0.023
 End-0.022
 next-0.021
Token Lily
Token ID20037
Feature activation+3.01e-02

Pos logit contributions
izing+0.075
iddled+0.071
opter+0.068
pole+0.068
quet+0.067
Neg logit contributions
 once-0.044
 two-0.039
 next-0.032
 only-0.031
 End-0.031
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
iddled+0.051
izing+0.051
string+0.046
pole+0.046
ledge+0.046
Neg logit contributions
 once-0.042
 End-0.037
 least-0.037
 two-0.036
 night-0.034
Token Anna
Token ID11735
Feature activation+1.40e-02

Pos logit contributions
iddled+0.018
izing+0.017
ledge+0.016
string+0.016
opter+0.016
Neg logit contributions
 once-0.016
 End-0.014
 least-0.014
m-0.013
 two-0.013
Token just
Token ID655
Feature activation+2.60e-03

Pos logit contributions
izing+0.006
pole+0.005
iddled+0.005
 ours+0.005
ledge+0.005
Neg logit contributions
 once-0.005
 End-0.004
 least-0.004
m-0.004
 two-0.004
Token me
Token ID502
Feature activation+3.62e-02

Pos logit contributions
izing+0.005
iddled+0.004
pole+0.004
ledge+0.004
 ours+0.004
Neg logit contributions
 once-0.003
 End-0.003
 least-0.003
m-0.003
 two-0.003
Token,"
Token ID553
Feature activation-3.99e-02

Pos logit contributions
izing+0.051
iddled+0.049
pole+0.047
 ours+0.046
ledge+0.046
Neg logit contributions
 once-0.057
 End-0.055
 least-0.052
 two-0.050
 eventually-0.047
Token a
Token ID257
Feature activation+5.11e-02

Pos logit contributions
 once+0.064
 least+0.059
 End+0.058
m+0.054
 two+0.054
Neg logit contributions
izing-0.057
iddled-0.056
pole-0.051
ledge-0.051
opter-0.050
Token small
Token ID1402
Feature activation+6.18e-02

Pos logit contributions
iddled+0.085
izing+0.082
 ours+0.076
ues+0.075
pole+0.074
Neg logit contributions
 once-0.074
 two-0.072
 End-0.065
 next-0.063
m-0.063
Token voice
Token ID3809
Feature activation+1.07e-01

Pos logit contributions
izing+0.099
iddled+0.098
opter+0.087
pole+0.086
ues+0.086
Neg logit contributions
 once-0.098
 two-0.087
 End-0.083
 least-0.083
 next-0.079
Token replied
Token ID8712
Feature activation+1.92e-02

Pos logit contributions
izing+0.204
iddled+0.198
opter+0.188
ues+0.186
string+0.184
Neg logit contributions
 once-0.139
 two-0.115
 next-0.106
 least-0.103
 night-0.102
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
izing+0.038
iddled+0.037
opter+0.035
pole+0.034
ues+0.034
Neg logit contributions
 once-0.024
 two-0.020
 least-0.019
 End-0.018
 night-0.017
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
izing+0.027
iddled+0.026
string+0.025
rier+0.025
ues+0.025
Neg logit contributions
 once-0.010
 two-0.008
 next-0.006
 least-0.006
m-0.006
Token
Token ID198
Feature activation+8.22e-03

Pos logit contributions
izing+0.027
iddled+0.027
opter+0.025
pole+0.025
ues+0.025
Neg logit contributions
 once-0.009
 least-0.007
 two-0.007
 End-0.006
 night-0.005
TokenTim
Token ID14967
Feature activation+1.49e-02

Pos logit contributions
rier+0.015
iddled+0.015
izing+0.014
string+0.014
ues+0.013
Neg logit contributions
 once-0.011
 two-0.010
 next-0.010
 End-0.009
 eventually-0.009
Token
Token ID198
Feature activation+1.75e-03

Pos logit contributions
izing+0.010
iddled+0.010
pole+0.009
opter+0.009
string+0.009
Neg logit contributions
 once-0.003
 two-0.003
 least-0.002
 End-0.002
 next-0.002
Token"
Token ID1
Feature activation+2.10e-02

Pos logit contributions
iddled+0.003
izing+0.003
rier+0.003
ues+0.003
 ours+0.003
Neg logit contributions
 once-0.002
 two-0.002
 next-0.002
 End-0.002
One-0.002
TokenHello
Token ID15496
Feature activation-1.89e-02

Pos logit contributions
izing+0.036
iddled+0.035
pole+0.032
string+0.032
ledge+0.031
Neg logit contributions
 once-0.030
 least-0.027
 End-0.025
 two-0.025
 night-0.024
Token!"
Token ID2474
Feature activation-8.20e-03

Pos logit contributions
 once+0.030
 least+0.029
 End+0.029
 two+0.026
 eventually+0.025
Neg logit contributions
izing-0.027
iddled-0.025
opter-0.025
pole-0.024
ledge-0.023
Token a
Token ID257
Feature activation+5.06e-02

Pos logit contributions
 once+0.012
 two+0.010
 least+0.010
 End+0.010
 night+0.009
Neg logit contributions
izing-0.014
iddled-0.014
opter-0.013
pole-0.013
ues-0.013
Token voice
Token ID3809
Feature activation+1.07e-01

Pos logit contributions
iddled+0.080
izing+0.078
ues+0.070
string+0.070
pole+0.069
Neg logit contributions
 once-0.080
 two-0.074
 End-0.070
 least-0.068
 next-0.066
Token replied
Token ID8712
Feature activation+1.91e-02

Pos logit contributions
izing+0.192
iddled+0.188
opter+0.177
 ours+0.176
ues+0.175
Neg logit contributions
 once-0.150
 two-0.126
 next-0.117
 least-0.115
m-0.113
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
izing+0.037
iddled+0.036
opter+0.034
pole+0.033
ues+0.033
Neg logit contributions
 once-0.024
 two-0.020
 least-0.019
 End-0.019
 night-0.017
Token
Token ID198
Feature activation+1.43e-02

Pos logit contributions
izing+0.030
iddled+0.028
ues+0.027
string+0.026
opter+0.026
Neg logit contributions
 once-0.019
 two-0.016
 next-0.013
 only-0.013
 least-0.013
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
izing+0.035
iddled+0.034
pole+0.032
string+0.032
ues+0.032
Neg logit contributions
 once-0.011
 two-0.008
 End-0.007
 next-0.007
 least-0.006
TokenThe
Token ID464
Feature activation-1.79e-02

Pos logit contributions
izing+0.021
rier+0.021
iddled+0.021
string+0.020
 ours+0.019
Neg logit contributions
 two-0.014
 once-0.013
 next-0.013
 End-0.012
One-0.011
Token out
Token ID503
Feature activation+5.01e-02

Pos logit contributions
iddled+0.030
izing+0.028
opter+0.026
pole+0.026
ues+0.026
Neg logit contributions
 once-0.024
 two-0.021
 least-0.020
 next-0.019
 End-0.019
Token of
Token ID286
Feature activation-8.16e-04

Pos logit contributions
izing+0.086
iddled+0.085
opter+0.077
pole+0.077
string+0.077
Neg logit contributions
 once-0.072
 two-0.062
 least-0.061
 End-0.059
 next-0.055
Token difficult
Token ID2408
Feature activation-1.76e-02

Pos logit contributions
 once+0.001
 least+0.001
 two+0.001
 End+0.001
m+0.001
Neg logit contributions
izing-0.001
iddled-0.001
opter-0.001
ledge-0.001
pole-0.001
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
 once+0.033
 two+0.030
 least+0.029
 End+0.029
 night+0.028
Neg logit contributions
iddled-0.022
izing-0.022
pole-0.020
opter-0.020
string-0.019
Token sticky
Token ID23408
Feature activation-7.41e-03

Pos logit contributions
 once+0.030
 least+0.026
 two+0.026
 End+0.025
 next+0.024
Neg logit contributions
iddled-0.031
izing-0.031
opter-0.028
pole-0.028
ues-0.028
Token situations
Token ID7445
Feature activation+1.06e-01

Pos logit contributions
 once+0.012
 two+0.010
 least+0.010
 End+0.010
 night+0.009
Neg logit contributions
iddled-0.012
izing-0.012
pole-0.011
opter-0.011
string-0.011
Token.
Token ID13
Feature activation+3.02e-03

Pos logit contributions
iddled+0.210
izing+0.209
pole+0.193
opter+0.192
string+0.191
Neg logit contributions
 once-0.127
 two-0.105
 least-0.104
 End-0.098
 next-0.093
Token<|endoftext|>
Token ID50256
Feature activation+7.78e-03

Pos logit contributions
izing+0.005
iddled+0.005
pole+0.005
ledge+0.005
 Watch+0.005
Neg logit contributions
 once-0.003
 End-0.003
 night-0.003
 least-0.003
 best-0.003
TokenThe
Token ID464
Feature activation-4.22e-02

Pos logit contributions
iddled+0.011
oon+0.010
it+0.009
sc+0.009
 Exc+0.009
Neg logit contributions
 least-0.012
 End-0.010
 injuries-0.010
mo-0.010
abel-0.009
Token family
Token ID1641
Feature activation-3.21e-02

Pos logit contributions
 least+0.062
 once+0.059
 eventually+0.055
mo+0.055
abel+0.053
Neg logit contributions
 violin-0.056
izing-0.055
igator-0.055
 trophy-0.052
taker-0.052
Token had
Token ID550
Feature activation-1.71e-02

Pos logit contributions
 once+0.042
 End+0.037
 least+0.036
 two+0.035
 best+0.032
Neg logit contributions
iddled-0.054
izing-0.051
opter-0.049
pole-0.049
ledge-0.048
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
iddled+0.037
izing+0.037
ues+0.033
opter+0.033
string+0.033
Neg logit contributions
 once-0.033
 two-0.028
 least-0.028
 End-0.027
m-0.025
Token stop
Token ID2245
Feature activation+6.21e-04

Pos logit contributions
iddled+0.036
izing+0.036
ues+0.032
ledge+0.032
string+0.032
Neg logit contributions
 once-0.034
 two-0.031
 least-0.031
 End-0.029
 next-0.027
Token it
Token ID340
Feature activation+1.48e-02

Pos logit contributions
izing+0.001
iddled+0.001
pole+0.001
opter+0.001
string+0.001
Neg logit contributions
 once-0.001
 two-0.001
 least-0.001
 End-0.001
 night-0.001
Token!"
Token ID2474
Feature activation-2.12e-03

Pos logit contributions
iddled+0.022
izing+0.021
nam+0.020
ledge+0.020
ues+0.019
Neg logit contributions
 once-0.027
 two-0.023
 least-0.023
 next-0.022
m-0.021
Token a
Token ID257
Feature activation+4.33e-02

Pos logit contributions
 once+0.003
 least+0.003
 End+0.003
 two+0.003
m+0.003
Neg logit contributions
izing-0.003
iddled-0.003
opter-0.003
ledge-0.003
ues-0.003
Token voice
Token ID3809
Feature activation+1.06e-01

Pos logit contributions
iddled+0.071
izing+0.068
string+0.063
 ours+0.063
pole+0.063
Neg logit contributions
 two-0.066
 once-0.062
 End-0.059
 next-0.059
 end-0.058
Token shouts
Token ID34757
Feature activation+5.73e-02

Pos logit contributions
izing+0.198
iddled+0.191
opter+0.181
string+0.179
osity+0.174
Neg logit contributions
 once-0.142
 two-0.126
 next-0.116
 least-0.111
 End-0.108
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
iddled+0.104
izing+0.104
ues+0.095
string+0.095
pole+0.094
Neg logit contributions
 once-0.073
 least-0.065
 End-0.062
 two-0.060
 night-0.056
Token It
Token ID632
Feature activation+3.21e-02

Pos logit contributions
izing+0.048
iddled+0.046
pole+0.044
ues+0.043
string+0.043
Neg logit contributions
 once-0.042
 two-0.038
 least-0.033
 next-0.032
 End-0.032
Token is
Token ID318
Feature activation+8.64e-03

Pos logit contributions
izing+0.062
iddled+0.061
opter+0.056
pole+0.056
string+0.056
Neg logit contributions
 once-0.040
 two-0.033
 least-0.033
 End-0.032
 night-0.029
Token Mom
Token ID11254
Feature activation-1.25e-02

Pos logit contributions
izing+0.016
iddled+0.016
opter+0.015
pole+0.015
ues+0.015
Neg logit contributions
 once-0.011
 two-0.009
 least-0.009
 End-0.009
 next-0.008
Token animals
Token ID4695
Feature activation-1.60e-02

Pos logit contributions
izing+0.031
 ours+0.028
 violin+0.028
iddled+0.027
string+0.027
Neg logit contributions
 once-0.046
 least-0.041
 End-0.040
abel-0.039
m-0.039
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
 once+0.021
 two+0.017
 least+0.017
 End+0.017
 night+0.015
Neg logit contributions
izing-0.029
iddled-0.029
pole-0.027
opter-0.026
string-0.026
Token keep
Token ID1394
Feature activation+2.66e-02

Pos logit contributions
izing+0.032
iddled+0.032
opter+0.029
pole+0.029
string+0.029
Neg logit contributions
 once-0.026
 two-0.023
 least-0.022
 End-0.022
 night-0.020
Token a
Token ID257
Feature activation+3.67e-02

Pos logit contributions
izing+0.043
iddled+0.041
 ours+0.040
ledge+0.039
pole+0.038
Neg logit contributions
 once-0.040
 End-0.035
 least-0.034
m-0.033
 two-0.032
Token safe
Token ID3338
Feature activation+5.30e-02

Pos logit contributions
iddled+0.065
izing+0.062
opter+0.056
pole+0.056
string+0.055
Neg logit contributions
 once-0.054
 two-0.048
 least-0.046
 End-0.045
 night-0.044
Token distance
Token ID5253
Feature activation+1.06e-01

Pos logit contributions
izing+0.086
iddled+0.086
opter+0.077
pole+0.077
string+0.076
Neg logit contributions
 once-0.080
 two-0.069
 least-0.069
 End-0.067
 night-0.063
Token."
Token ID526
Feature activation-3.86e-03

Pos logit contributions
izing+0.171
iddled+0.160
pole+0.156
string+0.152
 ours+0.150
Neg logit contributions
 once-0.155
 least-0.140
 End-0.140
 two-0.136
m-0.126
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
 once+0.004
 least+0.004
 End+0.004
 best+0.003
 two+0.003
Neg logit contributions
iddled-0.007
izing-0.007
ledge-0.007
pole-0.006
 Watch-0.006
Token
Token ID198
Feature activation+1.36e-02

Pos logit contributions
iddled+0.032
izing+0.032
ledge+0.030
opter+0.030
ues+0.030
Neg logit contributions
 once-0.018
 least-0.018
 End-0.015
 best-0.014
 two-0.014
Token"
Token ID1
Feature activation+3.07e-02

Pos logit contributions
izing+0.022
iddled+0.022
pole+0.020
opter+0.020
string+0.020
Neg logit contributions
 once-0.021
 two-0.018
 least-0.018
 End-0.018
 night-0.016
TokenOkay
Token ID16454
Feature activation+1.73e-02

Pos logit contributions
izing+0.049
iddled+0.048
pole+0.043
opter+0.043
string+0.043
Neg logit contributions
 once-0.049
 least-0.042
 two-0.042
 End-0.041
 night-0.039
Token,
Token ID11
Feature activation+2.77e-02

Pos logit contributions
 once+0.025
 two+0.021
 least+0.021
 End+0.021
 night+0.019
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.028
opter-0.027
string-0.027
Token little
Token ID1310
Feature activation+5.09e-02

Pos logit contributions
izing+0.042
iddled+0.042
opter+0.037
ues+0.037
pole+0.037
Neg logit contributions
 once-0.046
 two-0.041
 least-0.040
 End-0.039
 next-0.037
Token girl
Token ID2576
Feature activation+4.47e-02

Pos logit contributions
iddled+0.094
izing+0.094
pole+0.085
ues+0.085
opter+0.085
Neg logit contributions
 once-0.067
 two-0.058
 least-0.056
 End-0.054
 next-0.051
Token,"
Token ID553
Feature activation-3.13e-02

Pos logit contributions
izing+0.072
iddled+0.072
opter+0.064
pole+0.064
string+0.064
Neg logit contributions
 once-0.069
 two-0.060
 least-0.059
 End-0.058
 next-0.054
Token a
Token ID257
Feature activation+5.12e-02

Pos logit contributions
 once+0.047
 least+0.041
 End+0.040
 two+0.040
m+0.038
Neg logit contributions
izing-0.050
iddled-0.049
ledge-0.045
pole-0.045
string-0.044
Token voice
Token ID3809
Feature activation+1.06e-01

Pos logit contributions
iddled+0.080
izing+0.079
 ours+0.070
pole+0.070
ues+0.069
Neg logit contributions
 two-0.081
 once-0.080
 End-0.072
 next-0.071
m-0.068
Token said
Token ID531
Feature activation+1.80e-02

Pos logit contributions
izing+0.211
iddled+0.200
opter+0.193
osity+0.190
string+0.189
Neg logit contributions
 once-0.132
 two-0.115
 next-0.106
m-0.095
 night-0.094
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
izing+0.040
iddled+0.038
opter+0.037
pole+0.036
ledge+0.036
Neg logit contributions
 once-0.020
 two-0.016
 next-0.013
 least-0.013
m-0.012
Token Lily
Token ID20037
Feature activation+2.74e-02

Pos logit contributions
izing+0.041
iddled+0.038
opter+0.036
pole+0.036
ues+0.035
Neg logit contributions
 once-0.039
 two-0.036
 next-0.032
 End-0.031
 least-0.031
Token was
Token ID373
Feature activation+2.64e-03

Pos logit contributions
izing+0.047
iddled+0.047
pole+0.042
opter+0.042
string+0.042
Neg logit contributions
 once-0.039
 two-0.033
 least-0.033
 End-0.032
 night-0.030
Token scared
Token ID12008
Feature activation+2.15e-02

Pos logit contributions
izing+0.004
iddled+0.004
opter+0.004
ues+0.004
string+0.004
Neg logit contributions
 once-0.005
 two-0.004
 least-0.004
 End-0.004
 next-0.004
Token going
Token ID1016
Feature activation-2.70e-02

Pos logit contributions
 once+0.012
 least+0.010
 two+0.010
 End+0.009
 next+0.009
Neg logit contributions
iddled-0.020
izing-0.020
opter-0.019
pole-0.019
string-0.019
Token on
Token ID319
Feature activation-2.00e-03

Pos logit contributions
 once+0.031
 two+0.026
 least+0.026
 End+0.025
m+0.022
Neg logit contributions
izing-0.052
iddled-0.052
pole-0.048
string-0.048
opter-0.048
Token here
Token ID994
Feature activation-1.06e-02

Pos logit contributions
 once+0.004
 End+0.003
 least+0.003
m+0.003
 two+0.003
Neg logit contributions
izing-0.002
iddled-0.002
 ours-0.002
opter-0.002
ledge-0.002
Token?"
Token ID1701
Feature activation-3.70e-03

Pos logit contributions
 once+0.018
 two+0.016
 least+0.016
 End+0.016
 night+0.015
Neg logit contributions
izing-0.015
iddled-0.015
pole-0.013
opter-0.013
string-0.013
Token a
Token ID257
Feature activation+4.10e-02

Pos logit contributions
 once+0.006
 least+0.005
 End+0.005
m+0.005
fl+0.005
Neg logit contributions
izing-0.005
iddled-0.005
ues-0.005
ledge-0.005
opter-0.005
Token voice
Token ID3809
Feature activation+1.06e-01

Pos logit contributions
iddled+0.072
izing+0.070
pole+0.066
string+0.065
 ours+0.065
Neg logit contributions
 two-0.060
 once-0.055
 End-0.052
 end-0.052
 next-0.050
Token said
Token ID531
Feature activation+2.11e-02

Pos logit contributions
izing+0.203
iddled+0.197
string+0.189
opter+0.188
ues+0.188
Neg logit contributions
 once-0.132
 two-0.116
 next-0.107
m-0.099
 night-0.095
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
izing+0.045
iddled+0.043
opter+0.041
pole+0.041
ledge+0.040
Neg logit contributions
 once-0.024
 two-0.020
 least-0.017
 End-0.017
 next-0.017
Token
Token ID198
Feature activation+2.23e-02

Pos logit contributions
izing+0.049
iddled+0.047
pole+0.045
quet+0.044
opter+0.044
Neg logit contributions
 once-0.043
 two-0.040
 next-0.034
 only-0.033
 End-0.033
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
izing+0.052
iddled+0.052
opter+0.048
ues+0.048
ledge+0.048
Neg logit contributions
 once-0.018
 least-0.014
 two-0.013
 End-0.013
 night-0.011
TokenThey
Token ID2990
Feature activation-8.28e-03

Pos logit contributions
izing+0.029
iddled+0.029
pole+0.028
string+0.027
opter+0.026
Neg logit contributions
 once-0.031
 two-0.029
 End-0.028
 next-0.027
 least-0.025
Token two
Token ID734
Feature activation-3.41e-02

Pos logit contributions
izing+0.053
iddled+0.053
string+0.049
 ours+0.048
opter+0.048
Neg logit contributions
 once-0.039
 End-0.033
 least-0.032
m-0.032
day-0.029
Token hands
Token ID2832
Feature activation+7.85e-02

Pos logit contributions
 once+0.055
 least+0.048
 End+0.048
 two+0.044
m+0.043
Neg logit contributions
izing-0.050
iddled-0.049
opter-0.046
pole-0.044
ues-0.044
Token,
Token ID11
Feature activation+3.64e-02

Pos logit contributions
izing+0.128
iddled+0.127
pole+0.116
opter+0.115
ues+0.115
Neg logit contributions
 once-0.115
 least-0.099
 End-0.099
 two-0.098
 night-0.088
Token but
Token ID475
Feature activation+1.04e-02

Pos logit contributions
izing+0.060
iddled+0.059
pole+0.055
string+0.055
ledge+0.055
Neg logit contributions
 once-0.051
 End-0.045
 least-0.044
m-0.042
 two-0.042
Token no
Token ID645
Feature activation-1.78e-02

Pos logit contributions
izing+0.015
iddled+0.015
pole+0.014
ledge+0.013
ues+0.013
Neg logit contributions
 once-0.017
 End-0.015
 least-0.015
 two-0.014
m-0.014
Token arms
Token ID5101
Feature activation+1.05e-01

Pos logit contributions
 once+0.028
 least+0.024
 End+0.024
 two+0.024
m+0.022
Neg logit contributions
izing-0.027
iddled-0.027
pole-0.025
opter-0.024
string-0.024
Token or
Token ID393
Feature activation+1.74e-02

Pos logit contributions
izing+0.195
iddled+0.192
opter+0.174
ues+0.174
string+0.173
Neg logit contributions
 once-0.139
 two-0.119
 least-0.116
m-0.107
 End-0.107
Token legs
Token ID7405
Feature activation+9.91e-02

Pos logit contributions
izing+0.026
iddled+0.025
opter+0.023
pole+0.023
string+0.022
Neg logit contributions
 once-0.029
 least-0.025
 two-0.025
 End-0.025
m-0.023
Token?"
Token ID1701
Feature activation+1.41e-02

Pos logit contributions
izing+0.180
iddled+0.179
opter+0.163
pole+0.162
ues+0.161
Neg logit contributions
 once-0.133
 two-0.112
 least-0.111
 End-0.106
m-0.101
Token<|endoftext|>
Token ID50256
Feature activation+3.01e-02

Pos logit contributions
izing+0.024
ues+0.024
iddled+0.024
 Watch+0.023
uts+0.023
Neg logit contributions
 least-0.016
 once-0.016
 End-0.015
mo-0.014
fl-0.014
TokenSam
Token ID16305
Feature activation-4.63e-03

Pos logit contributions
iddled+0.060
izing+0.057
oon+0.055
opter+0.053
 Watch+0.053
Neg logit contributions
 least-0.029
 once-0.026
 End-0.025
 eventually-0.024
mo-0.022
Token box
Token ID3091
Feature activation+2.85e-02

Pos logit contributions
 once+0.016
 two+0.014
 next+0.013
 night+0.012
 end+0.012
Neg logit contributions
iddled-0.020
izing-0.020
ivated-0.018
ledge-0.018
osity-0.017
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
izing+0.058
iddled+0.058
opter+0.054
ledge+0.053
string+0.052
Neg logit contributions
 once-0.033
 two-0.027
 next-0.024
 least-0.024
 night-0.023
Token They
Token ID1119
Feature activation+1.28e-02

Pos logit contributions
iddled+0.023
izing+0.022
ues+0.022
 Watch+0.021
ledge+0.021
Neg logit contributions
 once-0.027
 least-0.026
 End-0.025
m-0.023
 end-0.023
Token make
Token ID787
Feature activation+2.37e-02

Pos logit contributions
izing+0.020
iddled+0.020
opter+0.018
pole+0.018
string+0.018
Neg logit contributions
 once-0.020
 two-0.017
 least-0.017
 End-0.017
 night-0.016
Token a
Token ID257
Feature activation+4.98e-02

Pos logit contributions
izing+0.039
iddled+0.038
opter+0.035
pole+0.034
string+0.034
Neg logit contributions
 once-0.036
 two-0.031
 least-0.031
 End-0.030
 night-0.028
Token hole
Token ID7604
Feature activation+1.05e-01

Pos logit contributions
izing+0.071
iddled+0.070
opter+0.063
pole+0.063
ues+0.063
Neg logit contributions
 once-0.085
 least-0.075
 two-0.074
 End-0.072
m-0.069
Token in
Token ID287
Feature activation+2.12e-02

Pos logit contributions
iddled+0.209
izing+0.207
opter+0.198
string+0.187
ledge+0.186
Neg logit contributions
 once-0.133
 two-0.112
 next-0.107
 least-0.095
 End-0.094
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
izing+0.037
iddled+0.036
 ours+0.033
opter+0.032
ledge+0.032
Neg logit contributions
 once-0.028
 least-0.025
 End-0.024
m-0.023
mo-0.023
Token box
Token ID3091
Feature activation+1.82e-02

Pos logit contributions
 once+0.020
 least+0.017
m+0.017
 End+0.017
 two+0.016
Neg logit contributions
izing-0.013
iddled-0.012
pole-0.012
string-0.012
hips-0.012
Token for
Token ID329
Feature activation-2.10e-02

Pos logit contributions
iddled+0.038
izing+0.037
opter+0.035
ledge+0.034
string+0.034
Neg logit contributions
 once-0.021
 two-0.017
 least-0.016
 next-0.015
 End-0.015
Token the
Token ID262
Feature activation+6.75e-03

Pos logit contributions
 once+0.029
 least+0.025
m+0.025
 End+0.024
fl+0.023
Neg logit contributions
izing-0.036
iddled-0.035
nam-0.033
ues-0.033
opter-0.032
Token on
Token ID319
Feature activation-3.74e-03

Pos logit contributions
 once+0.002
 least+0.002
 End+0.002
 two+0.002
 night+0.002
Neg logit contributions
izing-0.003
iddled-0.003
string-0.003
rier-0.003
pole-0.003
Token top
Token ID1353
Feature activation-3.86e-02

Pos logit contributions
 least+0.006
 End+0.006
m+0.006
mo+0.005
 once+0.005
Neg logit contributions
izing-0.005
iddled-0.005
string-0.005
 brace-0.005
stick-0.004
Token of
Token ID286
Feature activation-1.57e-02

Pos logit contributions
 once+0.062
 least+0.054
 two+0.054
 End+0.053
m+0.049
Neg logit contributions
izing-0.059
iddled-0.058
pole-0.052
opter-0.052
string-0.052
Token the
Token ID262
Feature activation-4.31e-02

Pos logit contributions
 once+0.022
 End+0.021
m+0.020
mo+0.020
 least+0.020
Neg logit contributions
izing-0.024
iddled-0.023
nam-0.022
 ours-0.021
 brace-0.021
Token ash
Token ID12530
Feature activation-1.12e-01

Pos logit contributions
 once+0.079
abella+0.072
 eventually+0.070
abel+0.069
 End+0.069
Neg logit contributions
izing-0.050
igator-0.049
pole-0.046
opter-0.043
 canvas-0.042
Tokentr
Token ID2213
Feature activation-1.66e-01

Pos logit contributions
 once+0.147
 End+0.134
 least+0.131
 two+0.129
m+0.117
Neg logit contributions
izing-0.195
iddled-0.187
pole-0.185
string-0.178
opter-0.174
Tokenay
Token ID323
Feature activation-8.91e-03

Pos logit contributions
 once+0.302
 two+0.267
 least+0.267
 End+0.262
 night+0.247
Neg logit contributions
izing-0.219
iddled-0.217
opter-0.191
pole-0.191
string-0.188
Token.
Token ID13
Feature activation-8.66e-03

Pos logit contributions
 End+0.013
 least+0.013
 once+0.012
 moral+0.012
 two+0.012
Neg logit contributions
izing-0.014
iddled-0.013
pole-0.013
string-0.012
uts-0.012
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 once+0.013
 least+0.013
 End+0.012
 two+0.012
m+0.012
Neg logit contributions
iddled-0.013
izing-0.012
string-0.011
ledge-0.011
pole-0.011
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 least+0.014
 once+0.013
 End+0.012
 best+0.011
mo+0.011
Neg logit contributions
iddled-0.022
izing-0.021
opter-0.020
uts-0.020
ues-0.020
TokenThe
Token ID464
Feature activation-3.45e-02

Pos logit contributions
 once+0.017
 two+0.014
 least+0.014
 End+0.014
m+0.013
Neg logit contributions
izing-0.022
iddled-0.022
pole-0.020
opter-0.020
string-0.020
Token in
Token ID287
Feature activation+6.50e-03

Pos logit contributions
izing+0.040
iddled+0.038
pole+0.035
string+0.035
opter+0.035
Neg logit contributions
 once-0.028
 least-0.024
 End-0.024
 two-0.023
m-0.021
Token the
Token ID262
Feature activation-2.95e-02

Pos logit contributions
izing+0.010
iddled+0.010
ledge+0.009
pole+0.009
string+0.009
Neg logit contributions
 once-0.009
 least-0.009
 End-0.009
m-0.008
mo-0.007
Token cold
Token ID4692
Feature activation-3.87e-02

Pos logit contributions
 least+0.043
 End+0.043
 once+0.043
abella+0.041
 eventually+0.040
Neg logit contributions
pole-0.034
iddled-0.033
izing-0.033
 sound-0.033
 noise-0.032
Token white
Token ID2330
Feature activation-5.18e-02

Pos logit contributions
 once+0.046
 least+0.039
 two+0.038
 End+0.037
m+0.033
Neg logit contributions
izing-0.075
iddled-0.074
opter-0.069
pole-0.068
string-0.067
Token snow
Token ID6729
Feature activation-8.49e-02

Pos logit contributions
 once+0.062
 least+0.057
 End+0.056
 two+0.052
 eventually+0.048
Neg logit contributions
izing-0.094
iddled-0.093
opter-0.087
pole-0.087
string-0.085
Tokenfl
Token ID2704
Feature activation-1.64e-01

Pos logit contributions
 once+0.108
 least+0.093
 two+0.092
 End+0.092
m+0.081
Neg logit contributions
izing-0.158
iddled-0.153
pole-0.144
opter-0.141
string-0.141
Tokenakes
Token ID1124
Feature activation-3.16e-02

Pos logit contributions
 once+0.296
 End+0.295
 least+0.281
 two+0.279
 moral+0.267
Neg logit contributions
izing-0.196
iddled-0.195
pole-0.176
opter-0.175
imm-0.169
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
 once+0.035
 two+0.027
 least+0.026
 next+0.024
 night+0.024
Neg logit contributions
iddled-0.069
izing-0.067
opter-0.063
ues-0.063
ledge-0.063
Token sh
Token ID427
Feature activation-2.04e-02

Pos logit contributions
 once+0.026
 End+0.023
 least+0.023
 two+0.022
m+0.022
Neg logit contributions
izing-0.018
iddled-0.018
pole-0.016
string-0.016
ledge-0.016
Tokeniver
Token ID1428
Feature activation+3.55e-02

Pos logit contributions
 once+0.041
 two+0.037
 least+0.037
 End+0.037
 night+0.035
Neg logit contributions
izing-0.023
iddled-0.023
pole-0.019
opter-0.019
ues-0.019
Token when
Token ID618
Feature activation-1.03e-02

Pos logit contributions
izing+0.063
iddled+0.060
pole+0.056
opter+0.055
string+0.055
Neg logit contributions
 once-0.048
 least-0.043
 End-0.042
 two-0.041
m-0.037
Token the
Token ID262
Feature activation-1.95e-02

Pos logit contributions
izing+0.078
iddled+0.078
opter+0.072
ues+0.072
pole+0.071
Neg logit contributions
 once-0.047
 two-0.038
 least-0.038
 End-0.037
 night-0.035
Token trash
Token ID13913
Feature activation-1.27e-02

Pos logit contributions
 once+0.036
 least+0.033
 End+0.032
 two+0.031
 eventually+0.029
Neg logit contributions
izing-0.025
iddled-0.024
pole-0.021
opter-0.021
 ours-0.021
Token from
Token ID422
Feature activation-2.89e-02

Pos logit contributions
 once+0.016
 two+0.013
 least+0.013
 End+0.012
 next+0.012
Neg logit contributions
izing-0.025
iddled-0.025
opter-0.023
ues-0.022
pole-0.022
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
 once+0.043
m+0.042
abel+0.039
 End+0.039
 two+0.039
Neg logit contributions
nam-0.040
izing-0.040
igator-0.038
 ours-0.037
iddled-0.036
Token ash
Token ID12530
Feature activation-1.08e-01

Pos logit contributions
 once+0.053
abel+0.049
abella+0.047
 End+0.047
 least+0.047
Neg logit contributions
izing-0.033
 ours-0.030
 flow-0.029
igator-0.028
 hed-0.028
Tokentr
Token ID2213
Feature activation-1.63e-01

Pos logit contributions
 once+0.149
 End+0.144
 least+0.136
 two+0.135
m+0.121
Neg logit contributions
izing-0.173
pole-0.166
iddled-0.165
string-0.160
stick-0.156
Tokenay
Token ID323
Feature activation-1.10e-02

Pos logit contributions
 once+0.299
 two+0.267
 least+0.266
 End+0.263
 night+0.246
Neg logit contributions
izing-0.211
iddled-0.210
opter-0.184
pole-0.184
ues-0.182
Token.
Token ID13
Feature activation-7.38e-03

Pos logit contributions
 End+0.017
 least+0.016
 moral+0.015
 once+0.015
 two+0.015
Neg logit contributions
izing-0.016
iddled-0.015
uts-0.015
pole-0.015
string-0.015
Token The
Token ID383
Feature activation-2.67e-02

Pos logit contributions
 once+0.012
 two+0.010
 End+0.009
 least+0.009
 night+0.009
Neg logit contributions
izing-0.013
iddled-0.012
opter-0.011
ues-0.011
pole-0.011
Token people
Token ID661
Feature activation+5.41e-03

Pos logit contributions
 once+0.049
abel+0.049
abella+0.048
 least+0.048
mo+0.044
Neg logit contributions
izing-0.024
 flow-0.024
igator-0.023
 sound-0.023
 noise-0.022
Token in
Token ID287
Feature activation+1.20e-02

Pos logit contributions
izing+0.010
iddled+0.010
pole+0.009
opter+0.009
string+0.009
Neg logit contributions
 once-0.007
 least-0.006
 two-0.006
 End-0.006
 night-0.005
Token something
Token ID1223
Feature activation+3.82e-03

Pos logit contributions
 once+0.009
 least+0.009
 End+0.008
m+0.007
 night+0.007
Neg logit contributions
iddled-0.013
izing-0.013
string-0.011
pole-0.011
ledge-0.011
Token moving
Token ID3867
Feature activation-3.79e-02

Pos logit contributions
iddled+0.006
izing+0.006
opter+0.005
ues+0.005
pole+0.005
Neg logit contributions
 once-0.006
 two-0.005
 least-0.005
 End-0.005
 next-0.005
Token in
Token ID287
Feature activation+7.34e-03

Pos logit contributions
 once+0.058
 two+0.049
 least+0.048
 End+0.046
 next+0.045
Neg logit contributions
izing-0.063
iddled-0.062
opter-0.058
ues-0.057
pole-0.056
Token the
Token ID262
Feature activation-2.90e-02

Pos logit contributions
izing+0.013
iddled+0.012
string+0.011
ledge+0.011
pole+0.011
Neg logit contributions
 once-0.010
 least-0.009
 End-0.009
m-0.008
mo-0.008
Token wild
Token ID4295
Feature activation-2.40e-02

Pos logit contributions
 once+0.048
 least+0.042
 End+0.041
m+0.040
 two+0.039
Neg logit contributions
izing-0.041
pole-0.037
iddled-0.037
string-0.037
ledge-0.035
Tokenfl
Token ID2704
Feature activation-1.61e-01

Pos logit contributions
 once+0.034
 least+0.029
 two+0.029
 End+0.029
m+0.026
Neg logit contributions
izing-0.042
iddled-0.040
pole-0.037
string-0.037
ledge-0.037
Tokenowers
Token ID3618
Feature activation-2.95e-02

Pos logit contributions
 once+0.312
 End+0.302
 two+0.287
 least+0.287
 moral+0.278
Neg logit contributions
izing-0.185
iddled-0.176
pole-0.165
opter-0.156
quet-0.156
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 once+0.040
 End+0.036
 least+0.035
 two+0.034
 night+0.031
Neg logit contributions
izing-0.051
iddled-0.049
pole-0.046
string-0.045
ledge-0.044
Token It
Token ID632
Feature activation+1.36e-02

Pos logit contributions
 once+0.022
 least+0.020
 End+0.019
 two+0.019
m+0.018
Neg logit contributions
iddled-0.016
izing-0.016
pole-0.014
string-0.014
ledge-0.014
Token was
Token ID373
Feature activation-7.56e-03

Pos logit contributions
izing+0.029
iddled+0.029
pole+0.027
opter+0.027
string+0.027
Neg logit contributions
 once-0.013
 least-0.011
 two-0.010
 End-0.010
 night-0.009
Token a
Token ID257
Feature activation+1.68e-02

Pos logit contributions
 once+0.009
 least+0.008
 End+0.008
 two+0.007
m+0.007
Neg logit contributions
izing-0.014
iddled-0.014
pole-0.013
string-0.013
ledge-0.012
Token around
Token ID1088
Feature activation+1.65e-02

Pos logit contributions
izing+0.018
iddled+0.018
opter+0.016
ues+0.016
pole+0.016
Neg logit contributions
 once-0.019
 two-0.016
 least-0.016
 End-0.016
 next-0.015
Token town
Token ID3240
Feature activation+1.41e-02

Pos logit contributions
izing+0.027
iddled+0.026
ledge+0.024
string+0.024
ues+0.023
Neg logit contributions
 once-0.021
 least-0.020
 End-0.020
m-0.019
 two-0.019
Token on
Token ID319
Feature activation-2.44e-02

Pos logit contributions
izing+0.025
iddled+0.025
opter+0.022
pole+0.022
string+0.022
Neg logit contributions
 once-0.019
 two-0.016
 least-0.016
 End-0.016
 night-0.015
Token his
Token ID465
Feature activation-4.17e-02

Pos logit contributions
 once+0.034
m+0.034
 End+0.033
 least+0.032
mo+0.030
Neg logit contributions
izing-0.037
iddled-0.036
ledge-0.033
string-0.033
hips-0.032
Token ash
Token ID12530
Feature activation-1.07e-01

Pos logit contributions
 once+0.068
m+0.065
 End+0.061
 two+0.058
Sometimes+0.058
Neg logit contributions
izing-0.053
pole-0.050
 brace-0.049
 violin-0.049
ledge-0.048
Tokentr
Token ID2213
Feature activation-1.59e-01

Pos logit contributions
 once+0.146
 End+0.134
 least+0.132
 two+0.131
m+0.120
Neg logit contributions
izing-0.179
iddled-0.172
pole-0.170
string-0.164
stick-0.162
Tokenay
Token ID323
Feature activation-5.69e-03

Pos logit contributions
 once+0.288
 least+0.254
 two+0.254
 End+0.249
m+0.235
Neg logit contributions
izing-0.214
iddled-0.211
pole-0.186
opter-0.186
string-0.184
Token,
Token ID11
Feature activation-2.55e-03

Pos logit contributions
 once+0.008
 End+0.008
 least+0.008
 two+0.007
 moral+0.007
Neg logit contributions
izing-0.009
iddled-0.009
pole-0.008
string-0.008
stick-0.008
Token z
Token ID1976
Feature activation-3.48e-02

Pos logit contributions
 once+0.004
 two+0.003
 least+0.003
 End+0.003
 next+0.003
Neg logit contributions
izing-0.004
iddled-0.004
opter-0.004
ues-0.004
pole-0.004
Tokenipping
Token ID4501
Feature activation-4.31e-03

Pos logit contributions
 once+0.059
 two+0.052
 least+0.052
 End+0.051
 night+0.049
Neg logit contributions
iddled-0.051
izing-0.051
ues-0.045
opter-0.045
ledge-0.044
Token everywhere
Token ID8347
Feature activation+3.97e-02

Pos logit contributions
 once+0.007
 two+0.006
 least+0.005
 End+0.005
 night+0.005
Neg logit contributions
izing-0.007
iddled-0.007
opter-0.006
ues-0.006
pole-0.006
Token look
Token ID804
Feature activation+2.67e-03

Pos logit contributions
izing+0.028
iddled+0.027
pole+0.025
rier+0.025
ues+0.025
Neg logit contributions
 once-0.019
 two-0.017
 least-0.016
 night-0.015
 morning-0.015
Token at
Token ID379
Feature activation+5.72e-03

Pos logit contributions
iddled+0.006
izing+0.005
opter+0.005
string+0.005
pole+0.005
Neg logit contributions
 once-0.003
 two-0.003
 least-0.002
 next-0.002
 night-0.002
Token the
Token ID262
Feature activation-6.45e-03

Pos logit contributions
izing+0.009
iddled+0.009
nam+0.008
 ours+0.008
ledge+0.008
Neg logit contributions
 once-0.008
 End-0.008
m-0.007
 least-0.007
 eventually-0.007
Token big
Token ID1263
Feature activation-4.63e-04

Pos logit contributions
 once+0.010
 least+0.009
 End+0.009
 two+0.008
 eventually+0.008
Neg logit contributions
izing-0.010
iddled-0.009
pole-0.009
string-0.009
opter-0.009
Token sun
Token ID4252
Feature activation-8.53e-03

Pos logit contributions
 once+0.001
 two+0.001
 least+0.001
 End+0.001
 night+0.001
Neg logit contributions
iddled-0.001
izing-0.001
ues-0.001
pole-0.001
opter-0.001
Tokenfl
Token ID2704
Feature activation-1.59e-01

Pos logit contributions
 End+0.011
 least+0.011
 once+0.011
 two+0.010
 moral+0.009
Neg logit contributions
izing-0.015
pole-0.014
string-0.013
iddled-0.013
quet-0.012
Tokenowers
Token ID3618
Feature activation-2.41e-02

Pos logit contributions
 once+0.297
 End+0.282
 least+0.280
 two+0.273
 eventually+0.262
Neg logit contributions
izing-0.195
iddled-0.178
pole-0.168
opter-0.162
quet-0.159
Token in
Token ID287
Feature activation-5.49e-03

Pos logit contributions
 once+0.031
 least+0.027
 two+0.026
 End+0.026
 night+0.023
Neg logit contributions
izing-0.044
iddled-0.043
pole-0.040
string-0.040
opter-0.039
Token her
Token ID607
Feature activation-2.18e-02

Pos logit contributions
 once+0.008
 least+0.007
 End+0.007
mo+0.007
m+0.007
Neg logit contributions
izing-0.009
iddled-0.008
ledge-0.008
nam-0.008
 ours-0.007
Token garden
Token ID11376
Feature activation+7.09e-03

Pos logit contributions
 once+0.028
 least+0.024
 End+0.023
 two+0.022
m+0.021
Neg logit contributions
izing-0.038
iddled-0.038
ledge-0.036
pole-0.035
ues-0.035
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
izing+0.013
iddled+0.012
pole+0.011
ues+0.011
ledge+0.011
Neg logit contributions
 once-0.009
 least-0.008
 End-0.008
 two-0.008
m-0.007
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
 once+0.059
 End+0.054
 least+0.053
 two+0.050
day+0.046
Neg logit contributions
izing-0.078
iddled-0.075
pole-0.071
opter-0.069
string-0.069
Token accidentally
Token ID14716
Feature activation-5.12e-03

Pos logit contributions
 once+0.027
 two+0.024
 least+0.023
 End+0.022
 night+0.021
Neg logit contributions
izing-0.033
iddled-0.032
ues-0.029
pole-0.029
opter-0.029
Token dropped
Token ID5710
Feature activation-1.31e-02

Pos logit contributions
 once+0.007
 two+0.007
 least+0.006
 night+0.006
 only+0.006
Neg logit contributions
izing-0.010
iddled-0.009
pole-0.008
ues-0.008
ledge-0.008
Token her
Token ID607
Feature activation-5.84e-03

Pos logit contributions
 once+0.019
 least+0.016
 two+0.016
 End+0.016
m+0.015
Neg logit contributions
izing-0.022
iddled-0.022
pole-0.020
opter-0.020
string-0.020
Token ash
Token ID12530
Feature activation-1.06e-01

Pos logit contributions
 End+0.011
 once+0.010
 least+0.010
day+0.010
abel+0.010
Neg logit contributions
izing-0.006
 ours-0.006
pole-0.006
rier-0.006
adoes-0.006
Tokentr
Token ID2213
Feature activation-1.58e-01

Pos logit contributions
 once+0.135
 End+0.118
 least+0.116
 two+0.114
m+0.103
Neg logit contributions
izing-0.193
iddled-0.190
pole-0.177
string-0.176
opter-0.174
Tokenay
Token ID323
Feature activation-7.03e-03

Pos logit contributions
 once+0.281
 two+0.246
 least+0.245
 End+0.236
m+0.228
Neg logit contributions
izing-0.220
iddled-0.215
pole-0.191
ledge-0.190
opter-0.190
Token on
Token ID319
Feature activation-2.68e-02

Pos logit contributions
 End+0.009
 once+0.009
 least+0.009
 two+0.008
day+0.008
Neg logit contributions
izing-0.012
iddled-0.012
string-0.011
pole-0.011
uts-0.011
Token the
Token ID262
Feature activation-2.57e-02

Pos logit contributions
 once+0.033
 least+0.029
 End+0.028
 two+0.028
m+0.026
Neg logit contributions
izing-0.050
iddled-0.049
pole-0.045
string-0.045
opter-0.045
Token ground
Token ID2323
Feature activation+1.02e-02

Pos logit contributions
 once+0.047
 End+0.042
 least+0.040
m+0.040
 two+0.039
Neg logit contributions
izing-0.031
pole-0.030
string-0.029
igator-0.028
 ours-0.028
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
iddled+0.023
izing+0.022
opter+0.021
ues+0.021
ledge+0.021
Neg logit contributions
 once-0.010
 two-0.008
 next-0.007
 least-0.007
m-0.007
Token Lily
Token ID20037
Feature activation-6.63e-03

Pos logit contributions
 once+0.012
 least+0.011
 End+0.011
 two+0.011
m+0.010
Neg logit contributions
izing-0.010
iddled-0.010
pole-0.009
opter-0.009
ues-0.009
Token accidentally
Token ID14716
Feature activation-4.35e-03

Pos logit contributions
 once+0.009
 End+0.009
 least+0.008
 two+0.008
day+0.007
Neg logit contributions
iddled-0.010
izing-0.010
string-0.010
opter-0.009
pole-0.009
Token dropped
Token ID5710
Feature activation-1.64e-02

Pos logit contributions
 once+0.007
 two+0.006
 least+0.006
 night+0.005
 next+0.005
Neg logit contributions
izing-0.008
iddled-0.007
pole-0.007
opter-0.007
ues-0.007
Token the
Token ID262
Feature activation-2.88e-02

Pos logit contributions
 once+0.023
 End+0.021
 least+0.020
m+0.019
day+0.019
Neg logit contributions
izing-0.027
iddled-0.027
string-0.024
adoes-0.024
 ours-0.024
Token ash
Token ID12530
Feature activation-1.10e-01

Pos logit contributions
 once+0.051
 End+0.050
 least+0.048
day+0.047
abel+0.047
Neg logit contributions
izing-0.030
pole-0.030
igator-0.029
 ours-0.028
iddled-0.028
Tokentr
Token ID2213
Feature activation-1.58e-01

Pos logit contributions
 once+0.141
 End+0.124
 least+0.123
 two+0.121
m+0.108
Neg logit contributions
izing-0.198
iddled-0.195
pole-0.184
string-0.182
opter-0.179
Tokenay
Token ID323
Feature activation-1.17e-02

Pos logit contributions
 once+0.278
 two+0.242
 least+0.241
 End+0.233
m+0.225
Neg logit contributions
izing-0.222
iddled-0.217
pole-0.192
ledge-0.192
opter-0.191
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 End+0.016
 once+0.015
 least+0.015
 two+0.014
 moral+0.013
Neg logit contributions
izing-0.019
iddled-0.019
string-0.018
pole-0.018
opter-0.017
Token it
Token ID340
Feature activation+1.23e-03

Pos logit contributions
 once+0.028
 two+0.025
 least+0.024
 night+0.023
 eventually+0.022
Neg logit contributions
izing-0.034
iddled-0.033
ues-0.031
pole-0.031
ledge-0.031
Token broke
Token ID6265
Feature activation-3.02e-02

Pos logit contributions
izing+0.002
iddled+0.002
ledge+0.002
ues+0.002
opter+0.002
Neg logit contributions
 once-0.002
 two-0.002
 least-0.002
 next-0.002
 night-0.002
Token into
Token ID656
Feature activation+2.89e-02

Pos logit contributions
 once+0.047
 least+0.041
 End+0.041
 two+0.040
m+0.037
Neg logit contributions
izing-0.048
iddled-0.048
opter-0.043
pole-0.042
string-0.042
Token to
Token ID284
Feature activation-1.31e-03

Pos logit contributions
izing+0.003
iddled+0.003
pole+0.003
ues+0.003
ledge+0.003
Neg logit contributions
 once-0.003
 least-0.002
 End-0.002
 two-0.002
m-0.002
Token play
Token ID711
Feature activation-1.55e-02

Pos logit contributions
 once+0.002
 two+0.002
 least+0.002
 End+0.002
 night+0.002
Neg logit contributions
iddled-0.002
izing-0.002
opter-0.002
string-0.002
ues-0.002
Token with
Token ID351
Feature activation+5.86e-03

Pos logit contributions
 once+0.021
 two+0.018
 least+0.017
 End+0.017
 next+0.016
Neg logit contributions
iddled-0.028
izing-0.028
opter-0.026
string-0.026
ledge-0.026
Token the
Token ID262
Feature activation-2.90e-03

Pos logit contributions
izing+0.010
iddled+0.009
pole+0.008
ues+0.008
ledge+0.008
Neg logit contributions
 once-0.008
 End-0.008
 least-0.008
 two-0.007
m-0.007
Token sun
Token ID4252
Feature activation+2.42e-03

Pos logit contributions
 once+0.005
 End+0.004
 least+0.004
abel+0.004
 Until+0.004
Neg logit contributions
izing-0.004
pole-0.004
 flow-0.003
igator-0.003
 brace-0.003
Tokenfl
Token ID2704
Feature activation-1.56e-01

Pos logit contributions
pole+0.004
izing+0.004
quet+0.004
string+0.004
iddled+0.004
Neg logit contributions
 End-0.003
 least-0.003
 once-0.003
 moral-0.003
 two-0.003
Tokenowers
Token ID3618
Feature activation-1.68e-02

Pos logit contributions
 once+0.303
 End+0.299
 least+0.290
 two+0.285
 moral+0.272
Neg logit contributions
izing-0.176
iddled-0.161
pole-0.154
quet-0.149
imm-0.147
Token,
Token ID11
Feature activation+8.20e-03

Pos logit contributions
 once+0.022
 least+0.019
 End+0.019
 two+0.018
 night+0.016
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.028
string-0.027
opter-0.027
Token and
Token ID290
Feature activation+1.50e-03

Pos logit contributions
izing+0.014
iddled+0.014
pole+0.013
string+0.012
ledge+0.012
Neg logit contributions
 once-0.012
 End-0.010
 least-0.010
 two-0.010
m-0.009
Token they
Token ID484
Feature activation-9.18e-03

Pos logit contributions
izing+0.002
pole+0.002
iddled+0.002
nam+0.002
string+0.002
Neg logit contributions
 once-0.002
 End-0.002
 least-0.002
m-0.002
 two-0.002
Token soon
Token ID2582
Feature activation-4.92e-02

Pos logit contributions
 once+0.013
 End+0.011
 least+0.011
 two+0.011
 next+0.010
Neg logit contributions
iddled-0.015
pole-0.015
izing-0.014
string-0.014
opter-0.014
Token a
Token ID257
Feature activation-6.21e-03

Pos logit contributions
m+0.016
mo+0.015
 once+0.015
 End+0.015
abel+0.013
Neg logit contributions
ledge-0.014
 click-0.014
 ours-0.013
nam-0.013
string-0.013
Token me
Token ID502
Feature activation+1.80e-02

Pos logit contributions
 once+0.010
 least+0.009
 two+0.008
 End+0.008
m+0.008
Neg logit contributions
izing-0.010
iddled-0.009
opter-0.008
pole-0.008
ledge-0.008
Tokenadow
Token ID4584
Feature activation+3.02e-02

Pos logit contributions
iddled+0.016
ledge+0.014
ooting+0.013
pole+0.013
izing+0.013
Neg logit contributions
 once-0.038
 End-0.037
 two-0.037
 moral-0.035
 least-0.034
Token of
Token ID286
Feature activation-3.08e-02

Pos logit contributions
izing+0.054
iddled+0.053
pole+0.050
string+0.049
ledge+0.049
Neg logit contributions
 once-0.039
 least-0.035
 two-0.034
 End-0.033
m-0.030
Token wild
Token ID4295
Feature activation-1.78e-02

Pos logit contributions
 once+0.044
 least+0.042
 End+0.041
m+0.039
mo+0.037
Neg logit contributions
iddled-0.045
izing-0.043
igator-0.041
 hed-0.041
nam-0.041
Tokenfl
Token ID2704
Feature activation-1.56e-01

Pos logit contributions
 once+0.021
 least+0.021
 End+0.021
 two+0.019
m+0.017
Neg logit contributions
izing-0.033
iddled-0.030
ledge-0.029
pole-0.028
adoes-0.028
Tokenowers
Token ID3618
Feature activation-2.27e-02

Pos logit contributions
 End+0.329
 moral+0.311
 once+0.303
 least+0.300
 eventually+0.293
Neg logit contributions
izing-0.159
iddled-0.145
pole-0.139
imm-0.135
quet-0.134
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 once+0.031
 End+0.030
 least+0.029
 two+0.028
mo+0.026
Neg logit contributions
izing-0.037
iddled-0.035
pole-0.034
string-0.033
ledge-0.032
Token It
Token ID632
Feature activation+1.54e-02

Pos logit contributions
 once+0.018
 least+0.016
 two+0.016
 End+0.015
m+0.014
Neg logit contributions
izing-0.019
iddled-0.019
pole-0.017
opter-0.017
string-0.017
Token was
Token ID373
Feature activation-4.42e-03

Pos logit contributions
izing+0.030
iddled+0.029
pole+0.028
opter+0.027
string+0.027
Neg logit contributions
 once-0.017
 least-0.016
 End-0.015
 two-0.014
day-0.013
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
 once+0.006
 least+0.005
 End+0.005
 two+0.005
m+0.005
Neg logit contributions
izing-0.007
iddled-0.007
pole-0.007
string-0.007
ledge-0.007
Token was
Token ID373
Feature activation-1.38e-02

Pos logit contributions
 once+0.066
 least+0.055
 two+0.055
 End+0.054
m+0.048
Neg logit contributions
izing-0.107
iddled-0.106
opter-0.097
pole-0.097
string-0.096
Token running
Token ID2491
Feature activation-1.34e-02

Pos logit contributions
 once+0.021
 End+0.021
 least+0.020
 two+0.019
day+0.018
Neg logit contributions
iddled-0.018
izing-0.018
pole-0.017
ledge-0.017
string-0.016
Token around
Token ID1088
Feature activation+2.10e-02

Pos logit contributions
 once+0.021
 two+0.018
 least+0.017
 next+0.017
 night+0.016
Neg logit contributions
izing-0.022
iddled-0.021
opter-0.020
ues-0.020
pole-0.019
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
izing+0.034
iddled+0.032
ledge+0.029
pole+0.029
string+0.029
Neg logit contributions
 once-0.029
 least-0.028
 End-0.027
 two-0.026
m-0.025
Token ash
Token ID12530
Feature activation-1.01e-01

Pos logit contributions
 once+0.039
add+0.038
 End+0.037
oll+0.037
abella+0.037
Neg logit contributions
izing-0.032
pole-0.030
 frame-0.028
 square-0.028
 canvas-0.028
Tokentr
Token ID2213
Feature activation-1.56e-01

Pos logit contributions
 once+0.135
 End+0.128
 two+0.123
 least+0.122
m+0.111
Neg logit contributions
izing-0.170
pole-0.165
iddled-0.162
string-0.156
stick-0.153
Tokenay
Token ID323
Feature activation-1.08e-03

Pos logit contributions
 once+0.287
 two+0.256
 least+0.255
 End+0.253
 night+0.236
Neg logit contributions
izing-0.202
iddled-0.201
opter-0.176
pole-0.176
ues-0.173
Token.
Token ID13
Feature activation-7.22e-03

Pos logit contributions
 End+0.001
 least+0.001
 once+0.001
 two+0.001
 moral+0.001
Neg logit contributions
izing-0.002
iddled-0.002
pole-0.002
string-0.002
opter-0.002
Token The
Token ID383
Feature activation-2.30e-02

Pos logit contributions
 once+0.011
 least+0.010
 End+0.010
 two+0.010
 night+0.009
Neg logit contributions
iddled-0.011
izing-0.011
pole-0.010
ledge-0.010
ues-0.010
Token child
Token ID1200
Feature activation-6.93e-03

Pos logit contributions
 once+0.037
 least+0.033
 eventually+0.033
day+0.032
 Until+0.032
Neg logit contributions
izing-0.029
 sound-0.027
opter-0.027
 click-0.026
 noise-0.026
Token said
Token ID531
Feature activation-4.17e-03

Pos logit contributions
 once+0.010
 End+0.009
 least+0.009
 two+0.008
day+0.008
Neg logit contributions
iddled-0.011
izing-0.011
pole-0.011
opter-0.010
string-0.010
Token accidentally
Token ID14716
Feature activation-1.72e-03

Pos logit contributions
 once+0.028
 End+0.026
 least+0.026
fl+0.024
day+0.024
Neg logit contributions
iddled-0.028
izing-0.025
opter-0.025
ues-0.024
string-0.024
Token knocked
Token ID13642
Feature activation-5.94e-02

Pos logit contributions
 once+0.002
 two+0.002
 least+0.002
 night+0.002
 End+0.002
Neg logit contributions
izing-0.003
iddled-0.003
pole-0.003
ues-0.003
opter-0.003
Token over
Token ID625
Feature activation-2.81e-02

Pos logit contributions
 End+0.102
 least+0.096
abel+0.095
 once+0.094
uf+0.090
Neg logit contributions
iddled-0.075
izing-0.066
taker-0.064
igator-0.061
string-0.060
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
 End+0.047
 least+0.045
 once+0.041
day+0.040
mo+0.040
Neg logit contributions
iddled-0.037
nam-0.035
izing-0.033
taker-0.031
adoes-0.030
Token ash
Token ID12530
Feature activation-1.02e-01

Pos logit contributions
 once+0.047
 least+0.044
 End+0.043
day+0.041
 eventually+0.040
Neg logit contributions
izing-0.030
iddled-0.029
rier-0.029
 ours-0.028
nam-0.027
Tokentr
Token ID2213
Feature activation-1.55e-01

Pos logit contributions
 once+0.128
 least+0.109
 End+0.108
 two+0.108
m+0.096
Neg logit contributions
izing-0.190
iddled-0.188
pole-0.174
string-0.173
opter-0.172
Tokenay
Token ID323
Feature activation-6.30e-03

Pos logit contributions
 once+0.272
 least+0.236
 two+0.235
 End+0.225
m+0.219
Neg logit contributions
izing-0.223
iddled-0.217
pole-0.193
ledge-0.193
opter-0.192
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
 once+0.008
 End+0.008
 least+0.008
 two+0.007
 best+0.007
Neg logit contributions
izing-0.011
iddled-0.011
string-0.010
pole-0.010
opter-0.010
Token it
Token ID340
Feature activation+4.40e-03

Pos logit contributions
 once+0.026
 two+0.023
 least+0.022
 night+0.020
 End+0.020
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.028
ues-0.028
ledge-0.028
Token broke
Token ID6265
Feature activation-2.96e-02

Pos logit contributions
izing+0.007
iddled+0.007
ledge+0.007
ues+0.007
opter+0.006
Neg logit contributions
 once-0.007
 two-0.006
 least-0.006
 next-0.006
 End-0.006
Token into
Token ID656
Feature activation+3.09e-02

Pos logit contributions
 once+0.045
 two+0.039
 least+0.038
 End+0.037
 night+0.035
Neg logit contributions
izing-0.049
iddled-0.048
pole-0.044
opter-0.044
string-0.043
Token loved
Token ID6151
Feature activation-9.35e-03

Pos logit contributions
 once+0.027
 End+0.023
 two+0.023
 least+0.022
m+0.020
Neg logit contributions
iddled-0.043
izing-0.042
opter-0.039
pole-0.039
string-0.039
Token to
Token ID284
Feature activation+1.46e-02

Pos logit contributions
 once+0.014
 least+0.012
 End+0.012
 two+0.012
m+0.011
Neg logit contributions
izing-0.015
iddled-0.015
ledge-0.013
pole-0.013
ues-0.013
Token watch
Token ID2342
Feature activation-2.04e-02

Pos logit contributions
izing+0.023
iddled+0.023
pole+0.021
opter+0.021
string+0.021
Neg logit contributions
 once-0.023
 two-0.020
 least-0.020
 End-0.019
 night-0.018
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
 once+0.025
 two+0.021
 least+0.020
 End+0.020
 night+0.018
Neg logit contributions
izing-0.039
iddled-0.039
opter-0.036
pole-0.036
string-0.036
Token sun
Token ID4252
Feature activation+8.23e-03

Pos logit contributions
 once+0.020
 least+0.018
 End+0.017
mo+0.017
abel+0.017
Neg logit contributions
izing-0.017
 flow-0.014
pole-0.014
iddled-0.014
rier-0.014
Tokenfl
Token ID2704
Feature activation-1.54e-01

Pos logit contributions
izing+0.015
pole+0.014
iddled+0.014
string+0.013
quet+0.013
Neg logit contributions
 once-0.010
 least-0.010
 End-0.009
 two-0.009
abel-0.008
Tokenowers
Token ID3618
Feature activation-1.81e-02

Pos logit contributions
 once+0.292
 End+0.275
 least+0.271
 two+0.270
 eventually+0.247
Neg logit contributions
izing-0.187
iddled-0.172
pole-0.160
opter-0.154
quet-0.153
Token grow
Token ID1663
Feature activation+1.79e-02

Pos logit contributions
 once+0.025
 least+0.022
 two+0.022
 End+0.022
m+0.020
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.028
opter-0.028
string-0.028
Token and
Token ID290
Feature activation-5.26e-03

Pos logit contributions
izing+0.030
iddled+0.029
pole+0.027
ues+0.026
string+0.026
Neg logit contributions
 once-0.025
 least-0.023
 End-0.023
 two-0.022
 night-0.020
Token remembered
Token ID12086
Feature activation-1.69e-02

Pos logit contributions
 once+0.008
 least+0.007
 End+0.007
 two+0.007
m+0.007
Neg logit contributions
izing-0.008
iddled-0.008
pole-0.007
ledge-0.007
string-0.007
Token the
Token ID262
Feature activation+9.66e-03

Pos logit contributions
 once+0.027
 End+0.025
 least+0.024
m+0.024
 two+0.023
Neg logit contributions
izing-0.024
iddled-0.022
ledge-0.021
pole-0.020
nam-0.020
Token the
Token ID262
Feature activation+8.58e-03

Pos logit contributions
iddled+0.004
izing+0.004
pole+0.003
string+0.003
opter+0.003
Neg logit contributions
 once-0.004
 End-0.003
 least-0.003
m-0.003
 two-0.003
Token top
Token ID1353
Feature activation-2.87e-02

Pos logit contributions
taker+0.009
 click+0.009
igator+0.009
izing+0.009
 noise+0.009
Neg logit contributions
 once-0.015
 least-0.014
day-0.014
abella-0.013
abel-0.013
Token of
Token ID286
Feature activation-2.17e-02

Pos logit contributions
 once+0.045
 least+0.040
 two+0.040
 End+0.039
m+0.036
Neg logit contributions
izing-0.045
iddled-0.044
pole-0.040
opter-0.040
string-0.039
Token the
Token ID262
Feature activation-3.60e-02

Pos logit contributions
 once+0.033
m+0.032
 End+0.031
abel+0.030
day+0.030
Neg logit contributions
izing-0.030
iddled-0.029
 ours-0.027
ledge-0.026
pole-0.026
Token ash
Token ID12530
Feature activation-1.02e-01

Pos logit contributions
 once+0.064
abella+0.058
 least+0.056
 eventually+0.055
abel+0.055
Neg logit contributions
izing-0.038
igator-0.038
taker-0.037
 click-0.036
pole-0.034
Tokentr
Token ID2213
Feature activation-1.54e-01

Pos logit contributions
 once+0.133
 End+0.119
 least+0.119
 two+0.116
m+0.104
Neg logit contributions
izing-0.178
iddled-0.176
pole-0.168
string-0.164
opter-0.163
Tokenay
Token ID323
Feature activation-4.79e-03

Pos logit contributions
 once+0.280
 two+0.249
 least+0.248
 End+0.244
 night+0.230
Neg logit contributions
izing-0.203
iddled-0.201
opter-0.177
pole-0.177
string-0.175
Token fell
Token ID3214
Feature activation-2.18e-02

Pos logit contributions
 End+0.007
 once+0.007
 least+0.007
 two+0.006
 best+0.006
Neg logit contributions
iddled-0.007
izing-0.007
opter-0.007
pole-0.007
string-0.007
Token off
Token ID572
Feature activation+2.79e-03

Pos logit contributions
 once+0.036
 two+0.031
 next+0.029
 night+0.028
 least+0.027
Neg logit contributions
izing-0.036
iddled-0.034
ues-0.033
oker-0.033
opter-0.032
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
izing+0.005
iddled+0.005
pole+0.005
opter+0.005
string+0.005
Neg logit contributions
 once-0.004
 least-0.003
 two-0.003
 End-0.003
m-0.003
Token Bill
Token ID3941
Feature activation-4.64e-04

Pos logit contributions
 once+0.014
 least+0.014
 End+0.014
mo+0.012
 end+0.012
Neg logit contributions
iddled-0.016
izing-0.016
ledge-0.015
opter-0.015
uts-0.015
Token a
Token ID257
Feature activation+9.99e-03

Pos logit contributions
iddled+0.007
izing+0.006
string+0.006
pole+0.006
 ours+0.006
Neg logit contributions
 once-0.005
 least-0.005
 End-0.005
m-0.005
mo-0.004
Token garden
Token ID11376
Feature activation+5.04e-02

Pos logit contributions
izing+0.016
iddled+0.015
pole+0.014
opter+0.014
ues+0.014
Neg logit contributions
 once-0.016
 two-0.014
 least-0.014
 End-0.013
 night-0.013
Token full
Token ID1336
Feature activation+4.67e-03

Pos logit contributions
izing+0.093
iddled+0.092
pole+0.084
opter+0.084
string+0.083
Neg logit contributions
 once-0.066
 two-0.055
 least-0.055
 End-0.053
 night-0.049
Token of
Token ID286
Feature activation-2.06e-02

Pos logit contributions
izing+0.008
iddled+0.008
opter+0.007
ledge+0.007
ues+0.007
Neg logit contributions
 once-0.007
 two-0.006
 least-0.006
 End-0.006
 night-0.006
Token wild
Token ID4295
Feature activation-1.26e-02

Pos logit contributions
 once+0.034
 least+0.031
 End+0.030
 two+0.029
m+0.028
Neg logit contributions
iddled-0.030
izing-0.029
pole-0.026
opter-0.026
ledge-0.026
Tokenfl
Token ID2704
Feature activation-1.54e-01

Pos logit contributions
 once+0.019
 least+0.017
 End+0.016
 two+0.016
m+0.015
Neg logit contributions
izing-0.020
iddled-0.020
opter-0.018
pole-0.018
ledge-0.018
Tokenowers
Token ID3618
Feature activation-1.83e-02

Pos logit contributions
 once+0.302
 End+0.300
 least+0.284
 two+0.279
 end+0.270
Neg logit contributions
izing-0.170
iddled-0.168
pole-0.151
quet-0.148
opter-0.147
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 once+0.025
 End+0.023
 least+0.023
 two+0.021
 eventually+0.019
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.029
uts-0.028
string-0.028
Token He
Token ID679
Feature activation-1.53e-02

Pos logit contributions
 once+0.015
 least+0.014
 End+0.014
 two+0.013
m+0.013
Neg logit contributions
iddled-0.018
izing-0.017
pole-0.017
ledge-0.016
string-0.016
Token decided
Token ID3066
Feature activation-3.47e-02

Pos logit contributions
 once+0.024
 End+0.022
 least+0.022
m+0.020
 two+0.020
Neg logit contributions
iddled-0.023
izing-0.021
pole-0.020
string-0.020
ues-0.019
Token to
Token ID284
Feature activation+1.08e-02

Pos logit contributions
 once+0.049
 least+0.041
 two+0.041
 End+0.040
m+0.037
Neg logit contributions
izing-0.061
iddled-0.060
pole-0.055
opter-0.055
string-0.054
Token the
Token ID262
Feature activation-1.56e-03

Pos logit contributions
izing+0.015
iddled+0.014
ues+0.013
pole+0.013
ledge+0.013
Neg logit contributions
 once-0.011
 least-0.009
 End-0.009
 two-0.009
m-0.009
Token soil
Token ID9260
Feature activation+4.05e-02

Pos logit contributions
 once+0.002
 least+0.002
abella+0.002
mo+0.002
abel+0.002
Neg logit contributions
izing-0.002
 flow-0.002
pole-0.002
 shapes-0.002
igator-0.002
Token for
Token ID329
Feature activation-2.09e-02

Pos logit contributions
izing+0.075
iddled+0.075
opter+0.069
ues+0.068
ledge+0.067
Neg logit contributions
 once-0.053
 two-0.044
 least-0.044
 End-0.041
 next-0.040
Token the
Token ID262
Feature activation-7.55e-04

Pos logit contributions
 once+0.034
m+0.032
 End+0.031
mo+0.031
 least+0.031
Neg logit contributions
izing-0.030
iddled-0.025
nam-0.024
quet-0.024
ledge-0.024
Token sun
Token ID4252
Feature activation+8.20e-03

Pos logit contributions
 once+0.001
 least+0.001
abel+0.001
m+0.001
mo+0.001
Neg logit contributions
izing-0.001
pole-0.001
igator-0.001
rier-0.001
 flow-0.001
Tokenfl
Token ID2704
Feature activation-1.54e-01

Pos logit contributions
izing+0.013
pole+0.013
string+0.012
iddled+0.012
uts+0.012
Neg logit contributions
 End-0.011
 once-0.011
 least-0.011
 two-0.010
 moral-0.009
Tokenowers
Token ID3618
Feature activation-1.33e-02

Pos logit contributions
 once+0.296
 End+0.285
 two+0.279
 least+0.276
 moral+0.258
Neg logit contributions
izing-0.174
iddled-0.157
pole-0.150
quet-0.145
opter-0.141
Token.
Token ID13
Feature activation-7.33e-03

Pos logit contributions
 once+0.017
 End+0.015
 least+0.014
 two+0.014
m+0.013
Neg logit contributions
izing-0.024
iddled-0.024
pole-0.022
string-0.022
opter-0.022
Token From
Token ID3574
Feature activation-1.12e-02

Pos logit contributions
 once+0.012
 least+0.012
 End+0.011
 two+0.011
m+0.011
Neg logit contributions
iddled-0.010
izing-0.009
pole-0.009
ledge-0.009
string-0.009
Token that
Token ID326
Feature activation-1.52e-02

Pos logit contributions
 once+0.016
m+0.015
mo+0.014
abel+0.014
 End+0.014
Neg logit contributions
izing-0.019
ledge-0.016
iddled-0.016
pole-0.015
ize-0.014
Token day
Token ID1110
Feature activation+8.83e-03

Pos logit contributions
 least+0.013
 once+0.013
add+0.012
abel+0.011
mo+0.011
Neg logit contributions
izing-0.028
pole-0.028
taker-0.027
igator-0.027
opter-0.027
Token me
Token ID502
Feature activation+2.32e-02

Pos logit contributions
izing+0.008
iddled+0.008
pole+0.007
ledge+0.007
opter+0.007
Neg logit contributions
 once-0.007
 least-0.006
 two-0.006
 End-0.006
m-0.006
Tokenadow
Token ID4584
Feature activation+4.02e-02

Pos logit contributions
izing+0.023
iddled+0.023
pole+0.022
ledge+0.022
iding+0.020
Neg logit contributions
 once-0.045
 End-0.044
 two-0.043
 least-0.041
 moral-0.039
Token full
Token ID1336
Feature activation+3.22e-03

Pos logit contributions
izing+0.071
iddled+0.070
pole+0.064
opter+0.063
string+0.063
Neg logit contributions
 once-0.054
 least-0.047
 two-0.046
 End-0.046
m-0.042
Token of
Token ID286
Feature activation-2.12e-02

Pos logit contributions
iddled+0.005
izing+0.005
opter+0.004
ues+0.004
pole+0.004
Neg logit contributions
 once-0.005
 two-0.005
 least-0.004
 End-0.004
 night-0.004
Token sun
Token ID4252
Feature activation+2.35e-03

Pos logit contributions
 once+0.036
 least+0.033
 End+0.032
m+0.031
mo+0.030
Neg logit contributions
izing-0.029
iddled-0.028
ledge-0.025
 ours-0.024
 brace-0.024
Tokenfl
Token ID2704
Feature activation-1.53e-01

Pos logit contributions
izing+0.005
pole+0.005
iddled+0.004
string+0.004
quet+0.004
Neg logit contributions
 End-0.002
 least-0.002
 once-0.002
 two-0.002
 moral-0.002
Tokenowers
Token ID3618
Feature activation-1.72e-02

Pos logit contributions
 once+0.292
 End+0.291
 least+0.279
 two+0.272
 end+0.261
Neg logit contributions
izing-0.181
iddled-0.170
pole-0.155
opter-0.149
quet-0.149
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
 once+0.022
 End+0.019
 least+0.019
 two+0.019
m+0.017
Neg logit contributions
izing-0.031
iddled-0.030
pole-0.028
string-0.028
ledge-0.028
Token tall
Token ID7331
Feature activation+1.18e-02

Pos logit contributions
 once+0.027
 End+0.025
 least+0.024
m+0.024
mo+0.022
Neg logit contributions
izing-0.027
iddled-0.025
pole-0.024
nam-0.024
string-0.023
Token grass
Token ID8701
Feature activation+3.80e-02

Pos logit contributions
izing+0.019
iddled+0.018
 brace+0.017
pole+0.017
ues+0.016
Neg logit contributions
 once-0.017
 least-0.016
 End-0.016
m-0.015
 two-0.014
Token.
Token ID13
Feature activation-4.48e-03

Pos logit contributions
izing+0.068
iddled+0.066
pole+0.062
string+0.060
ledge+0.059
Neg logit contributions
 once-0.050
 least-0.046
 End-0.045
 two-0.043
 night-0.039
Token to
Token ID284
Feature activation+2.56e-03

Pos logit contributions
izing+0.022
iddled+0.022
opter+0.021
pole+0.021
string+0.020
Neg logit contributions
 once-0.015
 two-0.013
 least-0.011
 next-0.011
 night-0.011
Token work
Token ID670
Feature activation+2.56e-03

Pos logit contributions
izing+0.004
iddled+0.004
rier+0.004
ues+0.004
pole+0.004
Neg logit contributions
 once-0.004
 two-0.004
 least-0.004
 night-0.004
 End-0.003
Token on
Token ID319
Feature activation-4.85e-03

Pos logit contributions
izing+0.005
iddled+0.005
pole+0.004
opter+0.004
string+0.004
Neg logit contributions
 once-0.003
 least-0.003
 two-0.003
 End-0.003
m-0.002
Token her
Token ID607
Feature activation-1.14e-02

Pos logit contributions
 once+0.008
 End+0.007
 least+0.007
m+0.007
mo+0.006
Neg logit contributions
izing-0.007
iddled-0.007
 brace-0.006
nam-0.006
ues-0.006
Token ash
Token ID12530
Feature activation-9.76e-02

Pos logit contributions
 once+0.019
m+0.018
 End+0.018
 least+0.017
 two+0.016
Neg logit contributions
izing-0.014
 brace-0.013
pole-0.013
adoes-0.013
iddled-0.013
Tokentr
Token ID2213
Feature activation-1.52e-01

Pos logit contributions
 once+0.129
 End+0.117
 least+0.114
 two+0.114
m+0.103
Neg logit contributions
izing-0.170
iddled-0.165
pole-0.159
string-0.155
ues-0.153
Tokenay
Token ID323
Feature activation+3.61e-03

Pos logit contributions
 once+0.274
 least+0.241
 two+0.241
 End+0.235
m+0.223
Neg logit contributions
izing-0.206
iddled-0.202
opter-0.178
pole-0.178
string-0.177
Token right
Token ID826
Feature activation-5.26e-02

Pos logit contributions
izing+0.006
iddled+0.006
pole+0.005
uts+0.005
string+0.005
Neg logit contributions
 End-0.005
 once-0.005
 least-0.005
 two-0.005
mo-0.004
Token away
Token ID1497
Feature activation+2.53e-02

Pos logit contributions
 once+0.093
 least+0.086
 End+0.086
 two+0.085
m+0.083
Neg logit contributions
izing-0.069
iddled-0.063
ledge-0.056
ues-0.056
pole-0.055
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
izing+0.046
iddled+0.044
ues+0.041
pole+0.041
string+0.041
Neg logit contributions
 once-0.033
 least-0.029
 End-0.029
 two-0.028
m-0.026
Token She
Token ID1375
Feature activation-2.25e-03

Pos logit contributions
 once+0.025
 two+0.022
 least+0.020
 End+0.020
 morning+0.019
Neg logit contributions
izing-0.029
iddled-0.028
opter-0.026
pole-0.025
quet-0.025
Token and
Token ID290
Feature activation-7.96e-03

Pos logit contributions
iddled+0.005
izing+0.005
opter+0.005
ledge+0.005
nam+0.005
Neg logit contributions
 once-0.003
 two-0.002
 least-0.002
 next-0.002
 End-0.002
Token look
Token ID804
Feature activation+6.00e-03

Pos logit contributions
 once+0.014
 two+0.013
 least+0.012
 End+0.012
 next+0.012
Neg logit contributions
izing-0.011
iddled-0.011
opter-0.010
pole-0.010
ues-0.010
Token at
Token ID379
Feature activation+1.20e-02

Pos logit contributions
iddled+0.013
izing+0.013
opter+0.013
quet+0.012
string+0.012
Neg logit contributions
 once-0.006
 two-0.005
 best-0.005
 next-0.004
 least-0.004
Token their
Token ID511
Feature activation-1.48e-02

Pos logit contributions
izing+0.019
iddled+0.019
ledge+0.016
nam+0.016
ues+0.016
Neg logit contributions
 once-0.017
 End-0.016
m-0.015
 least-0.015
day-0.015
Token snow
Token ID6729
Feature activation-6.77e-02

Pos logit contributions
 once+0.026
 least+0.024
m+0.024
 End+0.024
mo+0.023
Neg logit contributions
izing-0.018
 brace-0.016
 shapes-0.016
ues-0.016
iddled-0.015
Tokenfl
Token ID2704
Feature activation-1.51e-01

Pos logit contributions
 once+0.112
 least+0.108
 End+0.105
 two+0.102
m+0.096
Neg logit contributions
izing-0.096
iddled-0.086
pole-0.085
ues-0.083
string-0.082
Tokenakes
Token ID1124
Feature activation-1.80e-02

Pos logit contributions
 once+0.272
 End+0.260
 two+0.251
 least+0.250
 eventually+0.235
Neg logit contributions
izing-0.194
iddled-0.186
opter-0.168
pole-0.168
ues-0.162
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 once+0.022
 least+0.018
 two+0.018
 End+0.018
 night+0.016
Neg logit contributions
izing-0.035
iddled-0.034
pole-0.032
opter-0.032
string-0.031
Token
Token ID198
Feature activation+2.91e-03

Pos logit contributions
iddled+0.017
izing+0.016
ues+0.015
uts+0.015
 Watch+0.015
Neg logit contributions
 least-0.024
 once-0.024
 End-0.023
 end-0.021
 night-0.021
Token
Token ID198
Feature activation+1.89e-03

Pos logit contributions
iddled+0.006
izing+0.005
ues+0.005
opter+0.005
ledge+0.005
Neg logit contributions
 once-0.003
 least-0.003
 End-0.003
 best-0.003
 two-0.002
TokenS
Token ID50
Feature activation-9.99e-03

Pos logit contributions
izing+0.003
iddled+0.003
pole+0.003
opter+0.003
string+0.003
Neg logit contributions
 once-0.003
 two-0.003
 least-0.003
 End-0.003
 next-0.003
Token to
Token ID284
Feature activation+2.09e-02

Pos logit contributions
izing+0.033
iddled+0.033
ues+0.030
opter+0.030
string+0.030
Neg logit contributions
 once-0.029
 least-0.026
 End-0.025
 two-0.025
m-0.023
Token wear
Token ID5806
Feature activation+1.91e-02

Pos logit contributions
izing+0.031
iddled+0.030
pole+0.028
ledge+0.028
string+0.027
Neg logit contributions
 once-0.033
 End-0.029
 least-0.028
 two-0.028
m-0.027
Token his
Token ID465
Feature activation-5.27e-03

Pos logit contributions
iddled+0.035
izing+0.035
ledge+0.031
string+0.031
pole+0.031
Neg logit contributions
 once-0.025
 least-0.021
 End-0.021
 two-0.021
m-0.019
Token flip
Token ID14283
Feature activation-1.93e-02

Pos logit contributions
 once+0.009
 least+0.008
 End+0.008
 two+0.007
m+0.007
Neg logit contributions
izing-0.007
iddled-0.007
ledge-0.007
 ours-0.007
opter-0.007
Token-
Token ID12
Feature activation-2.25e-02

Pos logit contributions
 once+0.027
 least+0.023
 End+0.023
 two+0.023
m+0.021
Neg logit contributions
izing-0.033
iddled-0.033
pole-0.030
string-0.030
ues-0.030
Tokenfl
Token ID2704
Feature activation-1.51e-01

Pos logit contributions
 once+0.041
 least+0.038
 End+0.038
 two+0.036
 best+0.034
Neg logit contributions
izing-0.029
iddled-0.028
string-0.025
ledge-0.025
pole-0.025
Tokenops
Token ID2840
Feature activation-6.77e-02

Pos logit contributions
 once+0.269
 End+0.248
 least+0.244
 two+0.243
 night+0.224
Neg logit contributions
izing-0.198
iddled-0.197
pole-0.174
opter-0.172
ues-0.171
Token so
Token ID523
Feature activation-1.12e-02

Pos logit contributions
 once+0.086
 least+0.073
 two+0.073
 End+0.072
m+0.064
Neg logit contributions
izing-0.125
iddled-0.124
pole-0.114
string-0.113
opter-0.113
Token he
Token ID339
Feature activation-1.86e-02

Pos logit contributions
 once+0.019
 least+0.018
 End+0.017
 two+0.017
m+0.017
Neg logit contributions
izing-0.016
iddled-0.015
opter-0.013
ues-0.013
taker-0.013
Token wouldn
Token ID3636
Feature activation-3.41e-02

Pos logit contributions
 once+0.031
 End+0.031
 least+0.028
 two+0.027
day+0.027
Neg logit contributions
iddled-0.025
izing-0.023
string-0.023
opter-0.022
pole-0.022
Token't
Token ID470
Feature activation-1.73e-02

Pos logit contributions
 once+0.058
 End+0.055
 least+0.053
 two+0.052
 best+0.049
Neg logit contributions
izing-0.044
iddled-0.043
opter-0.040
uts-0.040
string-0.039
Token important
Token ID1593
Feature activation-2.83e-02

Pos logit contributions
 once+0.029
abel+0.025
 least+0.025
m+0.024
mo+0.024
Neg logit contributions
izing-0.030
iddled-0.027
 ours-0.026
pole-0.026
ledge-0.026
Token to
Token ID284
Feature activation+2.05e-02

Pos logit contributions
 once+0.037
 two+0.030
 least+0.030
 End+0.028
 night+0.027
Neg logit contributions
iddled-0.054
izing-0.054
pole-0.050
opter-0.050
string-0.049
Token be
Token ID307
Feature activation+8.06e-03

Pos logit contributions
izing+0.030
iddled+0.028
ledge+0.027
pole+0.027
string+0.026
Neg logit contributions
 once-0.034
 End-0.029
 least-0.028
 two-0.028
m-0.028
Token kind
Token ID1611
Feature activation+1.55e-02

Pos logit contributions
opter+0.014
iddled+0.013
izing+0.013
string+0.012
ues+0.012
Neg logit contributions
 once-0.013
 two-0.012
 best-0.011
 next-0.011
 only-0.010
Token to
Token ID284
Feature activation+2.45e-02

Pos logit contributions
iddled+0.032
izing+0.032
opter+0.030
pole+0.030
ues+0.029
Neg logit contributions
 once-0.018
 two-0.015
 least-0.014
 next-0.013
 End-0.013
Token all
Token ID477
Feature activation+6.00e-02

Pos logit contributions
izing+0.036
iddled+0.034
pole+0.031
ledge+0.031
opter+0.031
Neg logit contributions
 once-0.042
 least-0.036
 two-0.036
 End-0.036
m-0.034
Token animals
Token ID4695
Feature activation-2.07e-02

Pos logit contributions
iddled+0.107
izing+0.099
ues+0.094
opter+0.094
string+0.092
Neg logit contributions
 once-0.091
 two-0.082
 least-0.079
 End-0.075
 night-0.074
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 once+0.027
 two+0.023
 least+0.023
 End+0.023
 night+0.021
Neg logit contributions
izing-0.038
iddled-0.037
pole-0.034
opter-0.034
string-0.034
Token even
Token ID772
Feature activation-4.28e-03

Pos logit contributions
izing+0.021
iddled+0.020
pole+0.019
string+0.019
ues+0.019
Neg logit contributions
 once-0.015
 End-0.013
 two-0.013
 least-0.013
m-0.012
Token if
Token ID611
Feature activation-3.02e-02

Pos logit contributions
 once+0.007
 least+0.007
 End+0.006
 two+0.006
m+0.006
Neg logit contributions
izing-0.006
iddled-0.006
pole-0.005
string-0.005
ledge-0.005
Token they
Token ID484
Feature activation-1.93e-02

Pos logit contributions
 once+0.057
 least+0.051
 End+0.051
 two+0.048
m+0.048
Neg logit contributions
izing-0.039
iddled-0.034
opter-0.031
 ours-0.030
ues-0.030
TokenI
Token ID40
Feature activation+3.72e-03

Pos logit contributions
izing+0.053
iddled+0.050
pole+0.047
string+0.047
opter+0.046
Neg logit contributions
 once-0.051
 least-0.046
 End-0.044
 best-0.044
 two-0.044
Token am
Token ID716
Feature activation-2.18e-02

Pos logit contributions
iddled+0.006
izing+0.006
opter+0.006
string+0.005
ues+0.005
Neg logit contributions
 once-0.005
 End-0.005
 two-0.005
 least-0.005
 next-0.004
Token hungry
Token ID14720
Feature activation+1.11e-02

Pos logit contributions
 once+0.036
 two+0.031
 least+0.031
 End+0.030
 night+0.029
Neg logit contributions
izing-0.033
iddled-0.033
opter-0.029
pole-0.029
string-0.029
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
iddled+0.022
izing+0.022
opter+0.020
pole+0.020
ues+0.020
Neg logit contributions
 once-0.013
 two-0.011
 least-0.011
 End-0.010
 next-0.010
Token Give
Token ID13786
Feature activation-1.71e-02

Pos logit contributions
izing+0.025
iddled+0.024
pole+0.021
string+0.021
ledge+0.021
Neg logit contributions
 once-0.065
 least-0.059
 End-0.059
 two-0.059
m-0.056
Token me
Token ID502
Feature activation+5.01e-02

Pos logit contributions
 once+0.027
 End+0.025
 least+0.024
 two+0.023
m+0.023
Neg logit contributions
izing-0.025
iddled-0.024
pole-0.022
string-0.022
 ours-0.022
Token your
Token ID534
Feature activation+1.65e-02

Pos logit contributions
izing+0.082
iddled+0.081
pole+0.075
string+0.075
ledge+0.075
Neg logit contributions
 once-0.071
 End-0.061
 least-0.061
 two-0.058
m-0.056
Token food
Token ID2057
Feature activation-1.50e-02

Pos logit contributions
izing+0.023
iddled+0.023
 ours+0.022
pole+0.022
string+0.022
Neg logit contributions
 once-0.025
 End-0.024
 least-0.023
 two-0.022
m-0.021
Token."
Token ID526
Feature activation+1.16e-02

Pos logit contributions
 once+0.020
 two+0.017
 least+0.017
 End+0.016
 next+0.015
Neg logit contributions
izing-0.027
iddled-0.027
opter-0.025
pole-0.025
ledge-0.025
Token Tom
Token ID4186
Feature activation+8.20e-03

Pos logit contributions
izing+0.023
iddled+0.023
pole+0.021
opter+0.021
ues+0.021
Neg logit contributions
 once-0.013
 least-0.011
 two-0.011
 End-0.011
 night-0.009
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
izing+0.015
iddled+0.014
opter+0.013
pole+0.013
string+0.013
Neg logit contributions
 once-0.011
 two-0.010
 least-0.010
 End-0.009
m-0.009
Token happy
Token ID3772
Feature activation-6.06e-04

Pos logit contributions
izing+0.025
iddled+0.025
string+0.024
pole+0.023
ledge+0.023
Neg logit contributions
 once-0.021
 two-0.019
 End-0.017
 next-0.017
m-0.017
Token and
Token ID290
Feature activation-1.25e-02

Pos logit contributions
 once+0.001
 two+0.001
 least+0.001
 next+0.001
 End+0.001
Neg logit contributions
iddled-0.001
izing-0.001
pole-0.001
opter-0.001
string-0.001
Token gave
Token ID2921
Feature activation+4.90e-03

Pos logit contributions
 once+0.015
 two+0.014
 next+0.013
 then+0.012
 eventually+0.012
Neg logit contributions
hips-0.026
iddled-0.026
izing-0.025
ues-0.025
ymes-0.024
Token Lily
Token ID20037
Feature activation+1.47e-02

Pos logit contributions
iddled+0.007
izing+0.007
pole+0.006
opter+0.006
ues+0.006
Neg logit contributions
 once-0.008
 least-0.007
 End-0.007
 two-0.007
m-0.007
Token a
Token ID257
Feature activation+3.12e-02

Pos logit contributions
iddled+0.029
izing+0.028
string+0.026
pole+0.026
ledge+0.026
Neg logit contributions
 once-0.016
 End-0.014
 least-0.014
 two-0.013
m-0.012
Token big
Token ID1263
Feature activation+5.51e-02

Pos logit contributions
izing+0.073
iddled+0.067
pole+0.064
adoes+0.064
ues+0.063
Neg logit contributions
 once-0.033
 two-0.030
 night-0.026
 least-0.024
m-0.024
Token hug
Token ID16225
Feature activation+4.41e-03

Pos logit contributions
izing+0.119
iddled+0.106
uts+0.104
pole+0.103
hips+0.101
Neg logit contributions
 once-0.071
 two-0.066
 night-0.059
 morning-0.056
 end-0.055
Token.
Token ID13
Feature activation-8.33e-03

Pos logit contributions
izing+0.007
iddled+0.007
string+0.006
pole+0.006
taker+0.006
Neg logit contributions
 End-0.006
 once-0.006
 least-0.006
 two-0.006
mo-0.005
Token Lily
Token ID20037
Feature activation+1.67e-02

Pos logit contributions
 once+0.011
 two+0.009
 least+0.009
 End+0.009
 night+0.008
Neg logit contributions
izing-0.015
iddled-0.015
opter-0.014
pole-0.014
string-0.014
Token felt
Token ID2936
Feature activation+2.86e-02

Pos logit contributions
izing+0.032
iddled+0.031
opter+0.029
pole+0.029
string+0.029
Neg logit contributions
 once-0.021
 least-0.017
 two-0.017
 End-0.017
 night-0.015
Token so
Token ID523
Feature activation+2.63e-02

Pos logit contributions
izing+0.058
iddled+0.057
opter+0.056
ues+0.055
pole+0.054
Neg logit contributions
 once-0.039
 two-0.032
 End-0.030
 least-0.030
 night-0.029
Token always
Token ID1464
Feature activation-8.90e-03

Pos logit contributions
 once+0.006
 End+0.005
 least+0.005
 two+0.005
mo+0.005
Neg logit contributions
iddled-0.008
izing-0.008
string-0.007
opter-0.007
pole-0.007
Token made
Token ID925
Feature activation+1.95e-02

Pos logit contributions
 once+0.012
 two+0.010
 least+0.010
 End+0.010
 night+0.009
Neg logit contributions
izing-0.017
iddled-0.016
pole-0.015
opter-0.015
ues-0.015
Token sure
Token ID1654
Feature activation+3.33e-03

Pos logit contributions
izing+0.032
iddled+0.032
pole+0.029
opter+0.029
string+0.029
Neg logit contributions
 once-0.030
 two-0.026
 least-0.025
 End-0.024
 next-0.023
Token to
Token ID284
Feature activation+1.04e-02

Pos logit contributions
izing+0.005
iddled+0.005
ledge+0.005
string+0.005
ues+0.005
Neg logit contributions
 once-0.005
 End-0.005
 least-0.004
 two-0.004
m-0.004
Token put
Token ID1234
Feature activation-1.15e-02

Pos logit contributions
izing+0.015
iddled+0.015
ledge+0.014
pole+0.013
string+0.013
Neg logit contributions
 once-0.017
 End-0.015
 two-0.014
 least-0.014
m-0.014
Token all
Token ID477
Feature activation+5.87e-02

Pos logit contributions
 once+0.016
 least+0.013
 End+0.013
 two+0.013
m+0.012
Neg logit contributions
izing-0.020
iddled-0.020
ledge-0.018
ues-0.018
opter-0.018
Token the
Token ID262
Feature activation+7.46e-03

Pos logit contributions
izing+0.109
iddled+0.106
opter+0.099
pole+0.099
ues+0.098
Neg logit contributions
 once-0.079
 two-0.068
 least-0.066
 End-0.063
 night-0.061
Token dishes
Token ID16759
Feature activation-1.04e-02

Pos logit contributions
izing+0.010
iddled+0.010
opter+0.009
pole+0.009
ledge+0.009
Neg logit contributions
 once-0.013
 least-0.012
 two-0.011
 End-0.011
 night-0.011
Token in
Token ID287
Feature activation+1.81e-02

Pos logit contributions
 once+0.014
 two+0.011
 least+0.011
 End+0.011
 next+0.010
Neg logit contributions
izing-0.019
iddled-0.019
opter-0.018
pole-0.018
ledge-0.018
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
izing+0.029
iddled+0.028
nam+0.025
ues+0.025
ledge+0.024
Neg logit contributions
 once-0.026
 End-0.023
 least-0.023
mo-0.022
m-0.022
Token dish
Token ID9433
Feature activation-7.95e-02

Pos logit contributions
 once+0.030
 least+0.026
 End+0.025
 two+0.025
 eventually+0.024
Neg logit contributions
izing-0.028
iddled-0.027
ledge-0.024
opter-0.024
taker-0.024
Token sky
Token ID6766
Feature activation+1.06e-02

Pos logit contributions
 once+0.016
 least+0.014
 two+0.014
 End+0.014
m+0.012
Neg logit contributions
izing-0.021
iddled-0.020
opter-0.018
pole-0.018
string-0.018
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
iddled+0.022
izing+0.021
opter+0.020
ledge+0.020
ues+0.020
Neg logit contributions
 once-0.013
 two-0.010
 next-0.009
 least-0.009
 night-0.009
Token
Token ID198
Feature activation-1.60e-02

Pos logit contributions
 once+0.015
 two+0.012
 least+0.012
 End+0.012
 night+0.011
Neg logit contributions
izing-0.027
iddled-0.027
opter-0.025
pole-0.025
ues-0.025
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
 once+0.015
 least+0.015
 End+0.014
 two+0.013
 best+0.012
Neg logit contributions
iddled-0.032
izing-0.032
opter-0.030
ues-0.030
pole-0.030
TokenThe
Token ID464
Feature activation-3.37e-02

Pos logit contributions
 once+0.015
 two+0.014
 End+0.013
 next+0.013
One+0.012
Neg logit contributions
iddled-0.038
rier-0.037
 ours-0.036
ues-0.036
izing-0.036
Token hill
Token ID12788
Feature activation+4.86e-02

Pos logit contributions
 once+0.059
 least+0.053
 two+0.051
 End+0.050
m+0.048
Neg logit contributions
izing-0.047
iddled-0.044
pole-0.040
string-0.040
opter-0.040
Token was
Token ID373
Feature activation-5.94e-03

Pos logit contributions
iddled+0.106
izing+0.106
opter+0.099
ledge+0.098
ues+0.098
Neg logit contributions
 once-0.048
 two-0.037
 least-0.036
 End-0.035
 next-0.032
Token no
Token ID645
Feature activation-2.81e-02

Pos logit contributions
 once+0.008
 two+0.007
 least+0.007
 End+0.007
 night+0.006
Neg logit contributions
izing-0.011
iddled-0.011
opter-0.010
ues-0.010
ledge-0.010
Token longer
Token ID2392
Feature activation-2.32e-03

Pos logit contributions
 once+0.034
 two+0.028
 least+0.027
 night+0.026
 End+0.025
Neg logit contributions
iddled-0.058
izing-0.055
ledge-0.052
string-0.052
ues-0.052
Token lonely
Token ID21757
Feature activation+1.33e-02

Pos logit contributions
 once+0.004
 two+0.003
 least+0.003
 night+0.003
 End+0.003
Neg logit contributions
iddled-0.004
izing-0.004
opter-0.003
pole-0.003
ues-0.003
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
izing+0.024
iddled+0.024
pole+0.022
opter+0.022
string+0.022
Neg logit contributions
 once-0.017
 least-0.015
 two-0.015
 End-0.014
m-0.013
Token mom
Token ID1995
Feature activation-4.64e-02

Pos logit contributions
izing+0.001
iddled+0.001
ues+0.001
opter+0.001
string+0.001
Neg logit contributions
 once-0.001
 least-0.001
m-0.001
 End-0.001
abel-0.001
Token for
Token ID329
Feature activation-1.17e-02

Pos logit contributions
 least+0.070
 End+0.069
 once+0.067
add+0.065
day+0.063
Neg logit contributions
pole-0.063
string-0.059
izing-0.058
ledge-0.054
igator-0.054
Token her
Token ID607
Feature activation+2.24e-02

Pos logit contributions
 once+0.019
 least+0.017
 two+0.017
 End+0.016
m+0.015
Neg logit contributions
izing-0.018
iddled-0.017
pole-0.016
ledge-0.015
opter-0.015
Token advice
Token ID5608
Feature activation+1.50e-03

Pos logit contributions
izing+0.034
iddled+0.034
ues+0.032
string+0.031
taker+0.031
Neg logit contributions
 once-0.032
 least-0.030
 End-0.028
 two-0.027
m-0.027
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
izing+0.003
iddled+0.003
pole+0.002
string+0.002
ues+0.002
Neg logit contributions
 once-0.002
 least-0.002
 End-0.002
 two-0.002
 night-0.001
Token He
Token ID679
Feature activation-2.77e-04

Pos logit contributions
iddled+0.023
izing+0.022
pole+0.021
ledge+0.021
uts+0.021
Neg logit contributions
 once-0.019
 least-0.017
 End-0.016
 two-0.016
m-0.015
Token says
Token ID1139
Feature activation+2.37e-03

Pos logit contributions
 once+0.000
 End+0.000
 least+0.000
m+0.000
 two+0.000
Neg logit contributions
iddled-0.000
izing-0.000
string-0.000
pole-0.000
ues-0.000
Token that
Token ID326
Feature activation+7.91e-03

Pos logit contributions
izing+0.004
iddled+0.004
pole+0.004
ledge+0.004
string+0.004
Neg logit contributions
 once-0.003
 two-0.003
 End-0.003
 next-0.003
 least-0.002
Token he
Token ID339
Feature activation+1.95e-04

Pos logit contributions
izing+0.011
iddled+0.011
string+0.010
pole+0.010
opter+0.010
Neg logit contributions
 once-0.013
 least-0.012
 End-0.012
 two-0.011
m-0.011
Token loves
Token ID10408
Feature activation-3.17e-02

Pos logit contributions
iddled+0.000
izing+0.000
opter+0.000
string+0.000
ues+0.000
Neg logit contributions
 once-0.000
 End-0.000
 two-0.000
 least-0.000
 morning-0.000
Token her
Token ID607
Feature activation+2.87e-02

Pos logit contributions
 once+0.049
 End+0.044
 least+0.044
 two+0.042
m+0.041
Neg logit contributions
izing-0.048
iddled-0.047
ledge-0.043
ues-0.042
pole-0.042
Token to
Token ID284
Feature activation+2.13e-02

Pos logit contributions
 once+0.010
 least+0.009
 End+0.009
 two+0.009
m+0.008
Neg logit contributions
izing-0.012
iddled-0.012
pole-0.011
ues-0.011
opter-0.011
Token consider
Token ID2074
Feature activation+7.53e-03

Pos logit contributions
izing+0.036
iddled+0.035
ledge+0.033
pole+0.033
opter+0.032
Neg logit contributions
 once-0.030
 End-0.025
 least-0.025
 two-0.025
m-0.024
Token the
Token ID262
Feature activation+5.49e-02

Pos logit contributions
izing+0.010
iddled+0.010
opter+0.009
pole+0.009
string+0.009
Neg logit contributions
 once-0.013
 two-0.012
 least-0.012
 End-0.011
 night-0.011
Token consequences
Token ID6948
Feature activation+1.43e-03

Pos logit contributions
izing+0.074
string+0.067
iddled+0.067
pole+0.066
 violin+0.066
Neg logit contributions
 once-0.088
 least-0.079
m-0.076
 End-0.074
mo-0.073
Token.
Token ID13
Feature activation+7.40e-03

Pos logit contributions
izing+0.003
iddled+0.003
opter+0.002
pole+0.002
string+0.002
Neg logit contributions
 once-0.002
 two-0.002
 least-0.002
 End-0.002
 night-0.001
Token It
Token ID632
Feature activation+2.69e-02

Pos logit contributions
iddled+0.012
izing+0.012
pole+0.012
ledge+0.011
 Watch+0.011
Neg logit contributions
 once-0.010
 least-0.009
 End-0.008
 two-0.008
m-0.008
Token's
Token ID338
Feature activation-4.74e-03

Pos logit contributions
izing+0.050
iddled+0.050
pole+0.046
opter+0.045
string+0.045
Neg logit contributions
 once-0.034
 least-0.028
 two-0.028
 End-0.028
m-0.026
Token always
Token ID1464
Feature activation+7.17e-03

Pos logit contributions
 once+0.006
m+0.006
abel+0.006
 least+0.006
fl+0.005
Neg logit contributions
izing-0.008
pole-0.007
 ours-0.007
iddled-0.007
ledge-0.007
Token good
Token ID922
Feature activation+6.34e-03

Pos logit contributions
iddled+0.012
izing+0.012
opter+0.011
ues+0.011
string+0.011
Neg logit contributions
 once-0.011
 two-0.009
 least-0.009
 End-0.009
 next-0.008
Token to
Token ID284
Feature activation+1.90e-02

Pos logit contributions
izing+0.011
iddled+0.011
ues+0.010
opter+0.010
pole+0.010
Neg logit contributions
 once-0.009
 least-0.008
 two-0.008
 End-0.008
m-0.007
Token think
Token ID892
Feature activation-3.03e-02

Pos logit contributions
izing+0.029
iddled+0.028
ledge+0.026
pole+0.025
opter+0.025
Neg logit contributions
 once-0.031
 End-0.027
 least-0.027
 two-0.026
m-0.025
Tokenâ
Token ID22940
Feature activation+4.98e-04

Pos logit contributions
izing+0.015
iddled+0.015
pole+0.014
opter+0.014
ledge+0.013
Neg logit contributions
 once-0.018
 least-0.016
 two-0.016
 End-0.016
 night-0.015
Token€
Token ID26391
Feature activation-4.33e-02

Pos logit contributions
iddled+0.001
izing+0.001
ues+0.001
opter+0.001
string+0.001
Neg logit contributions
 once-0.001
 least-0.001
 two-0.001
 End-0.001
 night-0.001
Token said
Token ID531
Feature activation+3.04e-03

Pos logit contributions
 once+0.050
 two+0.040
 least+0.037
 End+0.037
m+0.035
Neg logit contributions
iddled-0.090
izing-0.089
rier-0.081
string-0.081
pole-0.081
Token Bunny
Token ID28683
Feature activation+3.13e-02

Pos logit contributions
iddled+0.006
izing+0.006
ues+0.005
opter+0.005
pole+0.005
Neg logit contributions
 once-0.004
 two-0.003
 least-0.003
 End-0.003
 next-0.003
Token.
Token ID13
Feature activation+1.02e-03

Pos logit contributions
izing+0.057
iddled+0.056
opter+0.051
ues+0.051
pole+0.051
Neg logit contributions
 once-0.043
 two-0.036
 least-0.035
 End-0.034
 night-0.032
Token
Token ID198
Feature activation-3.25e-03

Pos logit contributions
izing+0.002
iddled+0.002
opter+0.002
ues+0.002
string+0.002
Neg logit contributions
 once-0.001
 two-0.001
 least-0.001
 End-0.001
 next-0.001
Token
Token ID198
Feature activation-5.72e-03

Pos logit contributions
 once+0.003
 least+0.003
 End+0.003
 two+0.002
 night+0.002
Neg logit contributions
izing-0.007
iddled-0.007
opter-0.006
pole-0.006
ledge-0.006
TokenThe
Token ID464
Feature activation-2.20e-02

Pos logit contributions
 once+0.008
 two+0.007
 least+0.007
 End+0.007
 next+0.007
Neg logit contributions
iddled-0.010
izing-0.009
pole-0.009
ues-0.009
string-0.009
Token ac
Token ID936
Feature activation-3.92e-02

Pos logit contributions
 once+0.035
abel+0.034
 least+0.032
mo+0.032
bie+0.031
Neg logit contributions
izing-0.029
 brace-0.026
pole-0.026
quet-0.025
string-0.025
Tokenorn
Token ID1211
Feature activation+8.20e-03

Pos logit contributions
 once+0.061
 least+0.054
 two+0.054
 End+0.052
 night+0.048
Neg logit contributions
izing-0.062
iddled-0.060
opter-0.055
pole-0.055
string-0.054
Token did
Token ID750
Feature activation-1.63e-02

Pos logit contributions
izing+0.014
iddled+0.013
pole+0.013
string+0.012
opter+0.012
Neg logit contributions
 once-0.011
 least-0.010
 End-0.010
 two-0.010
m-0.009
Token okay
Token ID8788
Feature activation-4.59e-02

Pos logit contributions
izing+0.011
 ours+0.011
iddled+0.010
 yours+0.010
pole+0.010
Neg logit contributions
 once-0.012
 least-0.011
m-0.011
 End-0.010
mo-0.010
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
 once+0.072
 least+0.068
 End+0.066
 two+0.061
day+0.059
Neg logit contributions
izing-0.069
iddled-0.060
ues-0.058
ledge-0.058
adoes-0.057
Token the
Token ID262
Feature activation+1.98e-02

Pos logit contributions
izing+0.023
iddled+0.022
opter+0.020
pole+0.020
string+0.019
Neg logit contributions
 once-0.028
 least-0.025
 two-0.025
 End-0.025
m-0.023
Token earthquake
Token ID16295
Feature activation+1.32e-02

Pos logit contributions
izing+0.024
iddled+0.022
adoes+0.021
pole+0.020
taker+0.020
Neg logit contributions
 once-0.036
 least-0.032
abel-0.032
 End-0.031
m-0.031
Token is
Token ID318
Feature activation-1.25e-02

Pos logit contributions
izing+0.024
iddled+0.024
opter+0.022
pole+0.022
ues+0.022
Neg logit contributions
 once-0.017
 least-0.014
 two-0.014
 End-0.014
 night-0.013
Token over
Token ID625
Feature activation-1.93e-02

Pos logit contributions
 once+0.018
 least+0.016
 two+0.016
 End+0.015
m+0.014
Neg logit contributions
izing-0.021
iddled-0.021
pole-0.019
opter-0.018
string-0.018
Token now
Token ID783
Feature activation-5.68e-02

Pos logit contributions
 once+0.032
 End+0.030
 least+0.029
mo+0.028
abel+0.028
Neg logit contributions
izing-0.027
iddled-0.023
 ours-0.022
ledge-0.021
ues-0.021
Token".
Token ID1911
Feature activation+1.34e-02

Pos logit contributions
 once+0.082
 least+0.073
 End+0.072
 two+0.071
m+0.065
Neg logit contributions
izing-0.094
iddled-0.089
pole-0.083
opter-0.082
ues-0.082
Token
Token ID220
Feature activation+5.41e-03

Pos logit contributions
iddled+0.026
izing+0.026
ledge+0.024
pole+0.024
ues+0.024
Neg logit contributions
 once-0.014
 End-0.012
 least-0.012
 two-0.012
mo-0.011
Token
Token ID198
Feature activation+3.04e-03

Pos logit contributions
iddled+0.011
izing+0.011
uts+0.010
ues+0.010
ledge+0.010
Neg logit contributions
 once-0.005
 least-0.005
 End-0.005
 two-0.004
 night-0.004
Token
Token ID198
Feature activation-8.91e-04

Pos logit contributions
iddled+0.006
izing+0.005
uts+0.005
opter+0.005
pole+0.005
Neg logit contributions
 once-0.003
 least-0.003
 End-0.003
mo-0.003
 night-0.003
Token necklace
Token ID38987
Feature activation+3.32e-03

Pos logit contributions
iddled+0.007
izing+0.007
uts+0.006
pole+0.006
ues+0.006
Neg logit contributions
 once-0.008
 two-0.007
 night-0.007
 least-0.007
 End-0.007
Token with
Token ID351
Feature activation+9.15e-03

Pos logit contributions
iddled+0.006
izing+0.006
opter+0.005
ues+0.005
ledge+0.005
Neg logit contributions
 once-0.005
 two-0.004
 least-0.004
 End-0.004
 next-0.003
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
izing+0.015
iddled+0.015
pole+0.014
string+0.014
opter+0.014
Neg logit contributions
 once-0.013
 least-0.012
 End-0.011
m-0.010
 two-0.010
Token charm
Token ID20024
Feature activation-2.38e-04

Pos logit contributions
izing+0.021
iddled+0.021
pole+0.018
ues+0.018
opter+0.018
Neg logit contributions
 once-0.028
 two-0.026
 least-0.025
 End-0.025
 night-0.023
Token that
Token ID326
Feature activation-1.36e-03

Pos logit contributions
 once+0.000
 two+0.000
 least+0.000
 End+0.000
 next+0.000
Neg logit contributions
izing-0.000
iddled-0.000
opter-0.000
pole-0.000
ledge-0.000
Token looked
Token ID3114
Feature activation-8.15e-03

Pos logit contributions
 once+0.002
 two+0.001
 least+0.001
 next+0.001
 night+0.001
Neg logit contributions
izing-0.003
iddled-0.003
ues-0.003
opter-0.003
iding-0.003
Token just
Token ID655
Feature activation-6.48e-03

Pos logit contributions
 once+0.012
 two+0.010
 least+0.010
 End+0.009
 next+0.009
Neg logit contributions
iddled-0.014
izing-0.014
opter-0.013
string-0.013
pole-0.013
Token like
Token ID588
Feature activation-1.53e-02

Pos logit contributions
 once+0.010
 least+0.009
 End+0.009
 two+0.008
m+0.008
Neg logit contributions
izing-0.010
iddled-0.010
ledge-0.009
pole-0.009
string-0.009
Token her
Token ID607
Feature activation+2.09e-02

Pos logit contributions
 once+0.021
 least+0.019
 End+0.018
 two+0.018
m+0.017
Neg logit contributions
izing-0.026
iddled-0.026
pole-0.023
string-0.023
opter-0.023
Token special
Token ID2041
Feature activation-4.91e-03

Pos logit contributions
izing+0.035
iddled+0.032
pole+0.031
ues+0.031
string+0.031
Neg logit contributions
 once-0.028
 End-0.026
 least-0.025
m-0.024
 two-0.024
Token rock
Token ID3881
Feature activation+3.55e-03

Pos logit contributions
 once+0.009
 least+0.008
 two+0.008
 End+0.008
m+0.008
Neg logit contributions
izing-0.006
iddled-0.006
pole-0.005
opter-0.005
string-0.005
Token cat
Token ID3797
Feature activation+1.29e-02

Pos logit contributions
 violin+0.047
izing+0.047
string+0.044
 chord+0.042
opter+0.042
Neg logit contributions
 once-0.074
 least-0.072
mo-0.066
 eventually-0.064
abel-0.063
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
izing+0.024
iddled+0.024
pole+0.022
string+0.022
opter+0.022
Neg logit contributions
 once-0.016
 least-0.013
 two-0.013
 End-0.013
 night-0.012
Token "
Token ID366
Feature activation+2.55e-02

Pos logit contributions
izing+0.029
iddled+0.028
opter+0.026
ues+0.025
string+0.025
Neg logit contributions
 once-0.030
 two-0.027
 least-0.025
 End-0.024
 next-0.024
TokenWhat
Token ID2061
Feature activation-3.80e-02

Pos logit contributions
izing+0.042
iddled+0.041
pole+0.039
string+0.038
ledge+0.037
Neg logit contributions
 once-0.036
 least-0.032
 two-0.031
 End-0.030
 best-0.030
Token's
Token ID338
Feature activation-1.21e-02

Pos logit contributions
 once+0.042
 two+0.038
 best+0.033
 night+0.031
 least+0.031
Neg logit contributions
izing-0.084
iddled-0.082
ues-0.077
rier-0.077
quet-0.075
Token wrong
Token ID2642
Feature activation-2.91e-02

Pos logit contributions
 once+0.019
 least+0.017
 two+0.017
 End+0.016
 night+0.016
Neg logit contributions
iddled-0.020
izing-0.020
opter-0.018
ues-0.018
string-0.017
Token?
Token ID30
Feature activation+1.05e-02

Pos logit contributions
 once+0.046
 two+0.040
 least+0.039
 End+0.038
 next+0.037
Neg logit contributions
iddled-0.047
izing-0.047
pole-0.042
opter-0.042
string-0.041
Token Why
Token ID4162
Feature activation-5.74e-02

Pos logit contributions
izing+0.014
iddled+0.014
pole+0.012
opter+0.012
ledge+0.012
Neg logit contributions
 once-0.019
 least-0.017
 two-0.017
 End-0.016
 night-0.016
Token do
Token ID466
Feature activation-1.94e-02

Pos logit contributions
 once+0.069
 two+0.062
 least+0.056
 only+0.053
 best+0.052
Neg logit contributions
izing-0.117
iddled-0.116
quet-0.106
ues-0.105
ledge-0.104
Token you
Token ID345
Feature activation+1.23e-02

Pos logit contributions
 once+0.032
 two+0.030
 least+0.028
 End+0.027
 next+0.027
Neg logit contributions
iddled-0.030
izing-0.029
pole-0.026
ues-0.026
string-0.026
Token look
Token ID804
Feature activation+1.64e-02

Pos logit contributions
izing+0.019
iddled+0.019
ues+0.017
opter+0.017
pole+0.017
Neg logit contributions
 once-0.020
 least-0.018
 two-0.018
 End-0.017
 night-0.016
Token don
Token ID836
Feature activation-1.40e-02

Pos logit contributions
 once+0.005
 least+0.004
 two+0.004
 End+0.004
 night+0.004
Neg logit contributions
iddled-0.007
izing-0.007
ues-0.006
opter-0.006
pole-0.006
Token't
Token ID470
Feature activation-4.46e-03

Pos logit contributions
 once+0.019
 two+0.016
 least+0.015
 next+0.014
 End+0.014
Neg logit contributions
iddled-0.026
izing-0.026
ledge-0.024
ues-0.024
pole-0.024
Token wear
Token ID5806
Feature activation+2.64e-02

Pos logit contributions
 once+0.006
 two+0.005
 least+0.005
 night+0.005
m+0.005
Neg logit contributions
iddled-0.008
izing-0.008
ues-0.007
string-0.007
opter-0.007
Token dresses
Token ID27309
Feature activation+3.01e-02

Pos logit contributions
izing+0.042
iddled+0.042
opter+0.038
pole+0.038
string+0.037
Neg logit contributions
 once-0.041
 two-0.036
 least-0.035
 End-0.034
 night-0.032
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
izing+0.051
iddled+0.050
pole+0.046
opter+0.046
ues+0.045
Neg logit contributions
 once-0.044
 least-0.037
 two-0.037
 End-0.037
 night-0.034
Token This
Token ID770
Feature activation-2.52e-02

Pos logit contributions
izing+0.018
iddled+0.018
ues+0.016
pole+0.016
ledge+0.016
Neg logit contributions
 once-0.024
 least-0.022
 End-0.021
 two-0.021
 night-0.020
Token is
Token ID318
Feature activation-8.66e-03

Pos logit contributions
 once+0.032
 two+0.027
 least+0.026
 End+0.024
 night+0.024
Neg logit contributions
iddled-0.049
izing-0.049
ues-0.044
ledge-0.044
string-0.043
Token a
Token ID257
Feature activation+2.22e-02

Pos logit contributions
 once+0.014
 End+0.012
 least+0.012
m+0.012
 two+0.012
Neg logit contributions
izing-0.013
iddled-0.012
 ours-0.012
pole-0.011
ledge-0.011
Token silly
Token ID14397
Feature activation+5.74e-03

Pos logit contributions
izing+0.033
iddled+0.033
opter+0.030
pole+0.030
ues+0.029
Neg logit contributions
 once-0.036
 two-0.032
 least-0.032
 End-0.031
 night-0.029
Token doll
Token ID3654
Feature activation-1.91e-02

Pos logit contributions
iddled+0.008
izing+0.008
ues+0.007
string+0.007
ledge+0.007
Neg logit contributions
 once-0.010
 two-0.009
 least-0.009
 End-0.008
 night-0.008
Token."
Token ID526
Feature activation+2.32e-03

Pos logit contributions
 once+0.027
 two+0.023
 least+0.023
 End+0.023
 night+0.021
Neg logit contributions
izing-0.033
iddled-0.032
pole-0.029
opter-0.029
string-0.029
Token lemon
Token ID18873
Feature activation-6.35e-02

Pos logit contributions
 once+0.061
 two+0.051
m+0.047
 End+0.045
 next+0.044
Neg logit contributions
iddled-0.107
izing-0.101
ledge-0.097
pole-0.097
hips-0.096
Token all
Token ID477
Feature activation+3.13e-02

Pos logit contributions
 once+0.078
 two+0.061
 next+0.058
 best+0.055
 night+0.055
Neg logit contributions
iddled-0.137
izing-0.129
ledge-0.126
opter-0.126
 States-0.122
Token day
Token ID1110
Feature activation-2.24e-02

Pos logit contributions
izing+0.075
iddled+0.074
pole+0.072
string+0.069
ledge+0.069
Neg logit contributions
 once-0.021
 least-0.016
 two-0.016
 End-0.014
abel-0.012
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
 once+0.030
 least+0.027
 End+0.026
 two+0.026
m+0.024
Neg logit contributions
izing-0.040
iddled-0.038
pole-0.035
string-0.035
opter-0.035
Token He
Token ID679
Feature activation-2.39e-02

Pos logit contributions
 once+0.030
 two+0.026
 least+0.025
 End+0.024
 night+0.023
Neg logit contributions
izing-0.034
iddled-0.033
opter-0.031
ues-0.030
pole-0.030
Token did
Token ID750
Feature activation-2.81e-02

Pos logit contributions
 once+0.034
 End+0.030
 least+0.030
 two+0.029
m+0.028
Neg logit contributions
iddled-0.038
izing-0.038
pole-0.035
string-0.035
opter-0.034
Token not
Token ID407
Feature activation-1.57e-02

Pos logit contributions
 once+0.037
 two+0.031
 End+0.028
 least+0.028
 next+0.027
Neg logit contributions
iddled-0.055
izing-0.054
opter-0.052
pole-0.050
string-0.050
Token want
Token ID765
Feature activation-2.00e-02

Pos logit contributions
 once+0.023
 least+0.021
 two+0.020
 night+0.019
 end+0.019
Neg logit contributions
izing-0.028
iddled-0.028
opter-0.025
ues-0.024
ledge-0.024
Token to
Token ID284
Feature activation+1.57e-02

Pos logit contributions
 once+0.030
 End+0.027
 least+0.026
 two+0.026
m+0.025
Neg logit contributions
izing-0.031
iddled-0.030
string-0.028
pole-0.027
ues-0.027
Token take
Token ID1011
Feature activation-1.23e-02

Pos logit contributions
iddled+0.025
izing+0.025
opter+0.023
ues+0.022
pole+0.022
Neg logit contributions
 once-0.025
 least-0.022
 two-0.022
 End-0.021
 night-0.020
Token a
Token ID257
Feature activation+6.59e-03

Pos logit contributions
 once+0.020
 least+0.018
 End+0.018
 two+0.017
m+0.017
Neg logit contributions
izing-0.018
iddled-0.017
string-0.016
ledge-0.016
opter-0.016
Token asked
Token ID1965
Feature activation-7.91e-03

Pos logit contributions
 once+0.033
 two+0.028
 least+0.028
 End+0.027
m+0.026
Neg logit contributions
izing-0.038
iddled-0.037
opter-0.033
pole-0.033
ues-0.033
Token him
Token ID683
Feature activation+1.40e-02

Pos logit contributions
 once+0.010
 least+0.009
 End+0.009
 two+0.009
m+0.008
Neg logit contributions
iddled-0.014
izing-0.014
ledge-0.013
pole-0.013
ues-0.013
Token to
Token ID284
Feature activation+1.78e-02

Pos logit contributions
iddled+0.021
izing+0.021
ues+0.020
string+0.020
pole+0.019
Neg logit contributions
 once-0.022
 least-0.019
 End-0.019
 two-0.018
 night-0.017
Token keep
Token ID1394
Feature activation+1.50e-02

Pos logit contributions
izing+0.032
iddled+0.031
opter+0.029
ues+0.028
rier+0.028
Neg logit contributions
 once-0.026
 two-0.024
 least-0.023
 next-0.022
 only-0.021
Token the
Token ID262
Feature activation+3.02e-02

Pos logit contributions
iddled+0.021
izing+0.021
ledge+0.019
nam+0.019
 ours+0.019
Neg logit contributions
 once-0.025
 End-0.024
 least-0.022
m-0.022
abel-0.021
Token coal
Token ID5655
Feature activation-2.77e-02

Pos logit contributions
pole+0.037
string+0.036
izing+0.036
iddled+0.036
 ours+0.035
Neg logit contributions
 once-0.051
 End-0.048
 least-0.046
abel-0.046
 eventually-0.044
Token available
Token ID1695
Feature activation-1.75e-02

Pos logit contributions
 once+0.043
 two+0.038
 least+0.037
 End+0.035
 next+0.035
Neg logit contributions
iddled-0.045
izing-0.045
opter-0.040
ues-0.039
ledge-0.039
Token for
Token ID329
Feature activation-2.58e-02

Pos logit contributions
 once+0.022
 End+0.019
 least+0.019
 two+0.019
m+0.017
Neg logit contributions
izing-0.032
iddled-0.031
pole-0.029
string-0.029
ues-0.029
Token the
Token ID262
Feature activation+7.98e-03

Pos logit contributions
 once+0.039
m+0.037
mo+0.036
fl+0.036
 least+0.035
Neg logit contributions
izing-0.038
nam-0.035
iddled-0.034
 ours-0.034
resses-0.032
Token fire
Token ID2046
Feature activation-2.48e-02

Pos logit contributions
 violin+0.011
izing+0.011
 pawn+0.011
pole+0.011
string+0.011
Neg logit contributions
 once-0.011
abel-0.010
mo-0.010
 End-0.009
bie-0.009
Token.
Token ID13
Feature activation-9.63e-03

Pos logit contributions
 once+0.035
 End+0.033
 two+0.031
 least+0.031
abel+0.028
Neg logit contributions
izing-0.039
iddled-0.038
pole-0.038
string-0.036
opter-0.036
Token<|endoftext|>
Token ID50256
Feature activation+2.16e-03

Pos logit contributions
 once+0.003
 least+0.003
 End+0.003
 two+0.003
m+0.002
Neg logit contributions
iddled-0.005
izing-0.005
pole-0.005
ledge-0.005
taker-0.005
TokenOnce
Token ID7454
Feature activation-1.96e-02

Pos logit contributions
iddled+0.003
oon+0.003
uts+0.002
it+0.002
 cheered+0.002
Neg logit contributions
 least-0.004
mo-0.003
 End-0.003
abel-0.003
 injuries-0.003
Token upon
Token ID2402
Feature activation+2.19e-02

Pos logit contributions
 once+0.028
 least+0.028
 two+0.026
 End+0.025
fl+0.025
Neg logit contributions
iddled-0.028
izing-0.027
opter-0.026
pole-0.026
taker-0.025
Token a
Token ID257
Feature activation-2.79e-02

Pos logit contributions
iddled+0.038
izing+0.037
string+0.035
pole+0.034
 ours+0.034
Neg logit contributions
 once-0.026
 least-0.026
 End-0.024
mo-0.022
 two-0.021
Token time
Token ID640
Feature activation+1.81e-02

Pos logit contributions
 least+0.025
 once+0.024
fl+0.022
 two+0.021
 End+0.020
Neg logit contributions
izing-0.055
iding-0.052
pole-0.051
iddled-0.051
ues-0.051
Token,
Token ID11
Feature activation-2.73e-02

Pos logit contributions
izing+0.032
iddled+0.032
pole+0.029
opter+0.029
string+0.028
Neg logit contributions
 once-0.025
 two-0.022
 least-0.021
 End-0.021
m-0.019
Token there
Token ID612
Feature activation+7.43e-03

Pos logit contributions
 least+0.046
 once+0.045
 North+0.041
abel+0.041
 End+0.040
Neg logit contributions
iddled-0.034
izing-0.034
pole-0.032
igator-0.030
opter-0.030
Token was
Token ID373
Feature activation-3.77e-02

Pos logit contributions
iddled+0.013
izing+0.013
opter+0.012
pole+0.012
ledge+0.011
Neg logit contributions
 least-0.008
 once-0.008
 End-0.008
 two-0.007
mo-0.007
Token a
Token ID257
Feature activation+7.90e-03

Pos logit contributions
 least+0.057
mo+0.056
m+0.054
 End+0.053
 once+0.051
Neg logit contributions
iddled-0.047
 ours-0.046
izing-0.043
nam-0.043
string-0.043
Token little
Token ID1310
Feature activation+1.74e-02

Pos logit contributions
izing+0.013
iddled+0.012
opter+0.012
pole+0.011
string+0.011
Neg logit contributions
 once-0.012
 least-0.010
 two-0.010
 End-0.010
m-0.009
Token boy
Token ID2933
Feature activation-1.97e-02

Pos logit contributions
iddled+0.031
izing+0.031
ues+0.028
pole+0.028
ledge+0.028
Neg logit contributions
 once-0.024
 two-0.021
 least-0.020
 End-0.020
 night-0.019
Token.
Token ID13
Feature activation-6.13e-03

Pos logit contributions
 once+0.001
 least+0.001
 End+0.001
 two+0.001
m+0.001
Neg logit contributions
izing-0.002
iddled-0.002
pole-0.002
opter-0.002
string-0.002
Token Lily
Token ID20037
Feature activation+1.83e-02

Pos logit contributions
 once+0.006
 least+0.005
 two+0.005
 End+0.004
 night+0.004
Neg logit contributions
izing-0.014
iddled-0.013
pole-0.012
opter-0.012
string-0.012
Token observed
Token ID6515
Feature activation-7.36e-03

Pos logit contributions
iddled+0.033
izing+0.032
pole+0.030
opter+0.029
string+0.029
Neg logit contributions
 once-0.024
 least-0.021
 End-0.021
 two-0.020
 night-0.018
Token the
Token ID262
Feature activation+1.51e-03

Pos logit contributions
 least+0.012
 once+0.012
 End+0.011
day+0.010
 narrative+0.010
Neg logit contributions
iddled-0.010
izing-0.010
ledge-0.008
pole-0.008
string-0.008
Token snow
Token ID6729
Feature activation-5.53e-02

Pos logit contributions
 sound+0.002
 noise+0.002
izing+0.002
 shapes+0.002
 click+0.002
Neg logit contributions
 least-0.002
 End-0.002
abella-0.002
 once-0.002
mo-0.002
Tokenfl
Token ID2704
Feature activation-1.33e-01

Pos logit contributions
 once+0.072
 least+0.068
 End+0.066
 two+0.065
day+0.058
Neg logit contributions
izing-0.098
iddled-0.091
pole-0.089
string-0.087
opter-0.084
Tokenakes
Token ID1124
Feature activation+8.63e-04

Pos logit contributions
 End+0.263
 least+0.255
 moral+0.251
 two+0.250
 once+0.248
Neg logit contributions
izing-0.136
iddled-0.129
imm-0.126
opter-0.120
pole-0.114
Token falling
Token ID7463
Feature activation-4.15e-02

Pos logit contributions
izing+0.002
iddled+0.002
opter+0.001
pole+0.001
string+0.001
Neg logit contributions
 once-0.001
 two-0.001
 least-0.001
 End-0.001
 night-0.001
Token from
Token ID422
Feature activation-3.22e-02

Pos logit contributions
 once+0.072
 two+0.062
 least+0.060
 night+0.059
 next+0.058
Neg logit contributions
iddled-0.061
izing-0.060
opter-0.055
ues-0.055
ledge-0.054
Token the
Token ID262
Feature activation-3.82e-02

Pos logit contributions
 once+0.045
m+0.042
abel+0.041
 End+0.040
 two+0.040
Neg logit contributions
izing-0.050
iddled-0.045
adoes-0.043
 ours-0.043
pole-0.043
Token sky
Token ID6766
Feature activation+1.98e-03

Pos logit contributions
 once+0.056
 End+0.054
 least+0.051
abel+0.050
day+0.049
Neg logit contributions
pole-0.051
izing-0.049
string-0.049
 flow-0.048
igator-0.047
Token cream
Token ID8566
Feature activation-4.47e-02

Pos logit contributions
 once+0.128
 least+0.117
 two+0.115
 End+0.114
m+0.106
Neg logit contributions
izing-0.077
iddled-0.075
pole-0.066
opter-0.066
ues-0.065
Token appeared
Token ID4120
Feature activation-1.22e-02

Pos logit contributions
 once+0.070
 least+0.061
 two+0.061
 End+0.060
m+0.055
Neg logit contributions
izing-0.070
iddled-0.069
pole-0.062
string-0.062
opter-0.062
Token in
Token ID287
Feature activation+6.52e-03

Pos logit contributions
 once+0.016
 least+0.014
 End+0.014
 two+0.014
m+0.013
Neg logit contributions
izing-0.022
iddled-0.021
pole-0.020
string-0.020
ledge-0.019
Token Jack
Token ID3619
Feature activation-2.48e-02

Pos logit contributions
izing+0.012
iddled+0.011
ledge+0.010
string+0.010
pole+0.010
Neg logit contributions
 once-0.008
 End-0.007
 least-0.007
m-0.007
mo-0.007
Tokenâ
Token ID22940
Feature activation-5.86e-02

Pos logit contributions
 once+0.031
 End+0.029
 least+0.028
 two+0.026
m+0.024
Neg logit contributions
izing-0.043
iddled-0.042
pole-0.041
string-0.040
ledge-0.039
Token€
Token ID26391
Feature activation-9.51e-02

Pos logit contributions
 once+0.088
 two+0.077
 least+0.075
 End+0.074
m+0.069
Neg logit contributions
izing-0.096
iddled-0.095
pole-0.086
ledge-0.086
string-0.085
Tokenâ„¢
Token ID8151
Feature activation-3.30e-02

Pos logit contributions
 once+0.153
 two+0.134
 least+0.133
 End+0.130
 night+0.122
Neg logit contributions
izing-0.146
iddled-0.144
pole-0.129
opter-0.129
string-0.128
Tokens
Token ID82
Feature activation-3.52e-02

Pos logit contributions
 once+0.048
 least+0.047
 End+0.046
 two+0.042
 night+0.042
Neg logit contributions
iddled-0.048
izing-0.048
pole-0.048
uts-0.046
opter-0.045
Token hands
Token ID2832
Feature activation+4.07e-02

Pos logit contributions
 once+0.056
 least+0.049
 End+0.048
 two+0.048
m+0.046
Neg logit contributions
izing-0.052
iddled-0.052
pole-0.047
ledge-0.047
string-0.047
Token.
Token ID13
Feature activation-2.85e-03

Pos logit contributions
izing+0.071
iddled+0.070
pole+0.066
string+0.065
ledge+0.063
Neg logit contributions
 once-0.052
 least-0.049
 End-0.048
 two-0.045
 night-0.041
Token Jack
Token ID3619
Feature activation+1.89e-02

Pos logit contributions
 once+0.004
 two+0.004
 least+0.004
 End+0.004
 next+0.003
Neg logit contributions
izing-0.005
iddled-0.005
opter-0.004
ues-0.004
pole-0.004
Token were
Token ID547
Feature activation-6.08e-03

Pos logit contributions
 once+0.007
 End+0.007
fl+0.007
 least+0.006
 next+0.006
Neg logit contributions
iddled-0.007
pole-0.007
string-0.006
taker-0.006
igator-0.006
Token off
Token ID572
Feature activation+1.93e-02

Pos logit contributions
 once+0.009
 least+0.008
 End+0.008
m+0.008
fl+0.008
Neg logit contributions
iddled-0.009
pole-0.008
izing-0.008
 ours-0.008
ledge-0.008
Token to
Token ID284
Feature activation+4.44e-03

Pos logit contributions
iddled+0.026
izing+0.025
 impressed+0.023
ooting+0.023
 flow+0.023
Neg logit contributions
 End-0.027
 least-0.025
 once-0.025
mo-0.025
abel-0.024
Token start
Token ID923
Feature activation-1.38e-02

Pos logit contributions
 cheer+0.005
 click+0.005
ledge+0.004
 flow+0.004
 ours+0.004
Neg logit contributions
m-0.008
 once-0.007
 End-0.007
mo-0.007
oral-0.007
Token their
Token ID511
Feature activation-2.64e-02

Pos logit contributions
 End+0.021
 least+0.021
 once+0.020
m+0.019
mo+0.019
Neg logit contributions
iddled-0.019
izing-0.017
ledge-0.017
ues-0.016
 flow-0.015
Token next
Token ID1306
Feature activation-9.80e-02

Pos logit contributions
 once+0.041
 End+0.037
m+0.037
 least+0.036
mo+0.036
Neg logit contributions
izing-0.035
 ours-0.031
 flow-0.030
 violin-0.029
 trophy-0.029
Token adventure
Token ID8855
Feature activation+1.62e-03

Pos logit contributions
 once+0.141
 End+0.140
m+0.131
 least+0.127
 two+0.119
Neg logit contributions
izing-0.148
ledge-0.128
pole-0.125
quet-0.124
resses-0.124
Token together
Token ID1978
Feature activation-1.45e-02

Pos logit contributions
izing+0.003
iddled+0.002
pole+0.002
ues+0.002
string+0.002
Neg logit contributions
 End-0.002
 once-0.002
 least-0.002
 two-0.002
 moral-0.002
Token.
Token ID13
Feature activation-4.09e-03

Pos logit contributions
 once+0.020
 End+0.020
 least+0.020
 two+0.019
mo+0.018
Neg logit contributions
izing-0.022
iddled-0.020
pole-0.019
ues-0.019
string-0.019
Token<|endoftext|>
Token ID50256
Feature activation+1.03e-02

Pos logit contributions
 least+0.005
 once+0.005
 End+0.005
 end+0.005
mo+0.005
Neg logit contributions
iddled-0.007
 Watch-0.006
ues-0.006
ledge-0.006
izing-0.006
TokenOnce
Token ID7454
Feature activation-1.60e-02

Pos logit contributions
iddled+0.017
oon+0.016
opter+0.014
it+0.014
uts+0.014
Neg logit contributions
 least-0.015
 End-0.012
 injuries-0.012
mo-0.012
 best-0.011
Token man
Token ID582
Feature activation-1.88e-02

Pos logit contributions
 least+0.009
 once+0.009
abel+0.009
mo+0.008
bie+0.008
Neg logit contributions
rier-0.005
 trophy-0.005
 sound-0.005
 click-0.005
 noise-0.005
Token grabbed
Token ID13176
Feature activation+2.77e-03

Pos logit contributions
 End+0.027
 once+0.026
 least+0.025
day+0.025
 night+0.024
Neg logit contributions
izing-0.027
pole-0.027
iddled-0.026
string-0.025
opter-0.023
Token their
Token ID511
Feature activation-1.03e-02

Pos logit contributions
iddled+0.004
izing+0.004
 ours+0.004
string+0.004
ledge+0.004
Neg logit contributions
 once-0.004
 End-0.004
m-0.004
 least-0.004
day-0.003
Token ice
Token ID4771
Feature activation-8.42e-02

Pos logit contributions
 once+0.014
 End+0.013
 narrative+0.013
abel+0.013
Sometimes+0.013
Neg logit contributions
 ours-0.014
nam-0.014
izing-0.013
 brace-0.013
 trophy-0.013
Token-
Token ID12
Feature activation-1.01e-02

Pos logit contributions
 once+0.167
 End+0.162
 least+0.159
 two+0.154
day+0.144
Neg logit contributions
izing-0.091
iddled-0.084
opter-0.075
rier-0.073
pole-0.073
Tokencream
Token ID36277
Feature activation-1.29e-01

Pos logit contributions
 once+0.018
 End+0.017
 least+0.017
 two+0.016
 best+0.016
Neg logit contributions
izing-0.013
iddled-0.013
string-0.011
pole-0.011
opter-0.011
Token away
Token ID1497
Feature activation+3.00e-02

Pos logit contributions
 once+0.177
 End+0.157
 least+0.153
 two+0.153
 night+0.136
Neg logit contributions
izing-0.222
iddled-0.215
pole-0.199
string-0.199
opter-0.198
Token and
Token ID290
Feature activation-3.07e-03

Pos logit contributions
izing+0.053
iddled+0.051
pole+0.047
string+0.047
ues+0.046
Neg logit contributions
 once-0.039
 End-0.037
 least-0.035
 two-0.033
day-0.031
Token took
Token ID1718
Feature activation-8.94e-03

Pos logit contributions
 once+0.005
 End+0.004
 least+0.004
 two+0.004
m+0.004
Neg logit contributions
izing-0.005
iddled-0.005
opter-0.004
pole-0.004
string-0.004
Token the
Token ID262
Feature activation+6.11e-03

Pos logit contributions
 once+0.014
 End+0.014
 least+0.013
m+0.013
 next+0.012
Neg logit contributions
iddled-0.013
izing-0.012
string-0.011
opter-0.011
ledge-0.011
Token cone
Token ID27763
Feature activation-5.71e-02

Pos logit contributions
rier+0.008
 trophy+0.008
 click+0.008
izing+0.008
 ours+0.008
Neg logit contributions
 once-0.008
 least-0.008
abel-0.008
 End-0.008
day-0.007
Token house
Token ID2156
Feature activation+1.08e-02

Pos logit contributions
 once+0.042
abella+0.039
mo+0.039
bie+0.037
 End+0.037
Neg logit contributions
izing-0.026
 frame-0.026
 violin-0.025
 flow-0.024
 click-0.024
Token and
Token ID290
Feature activation-4.15e-03

Pos logit contributions
izing+0.019
iddled+0.019
pole+0.017
ues+0.017
ledge+0.017
Neg logit contributions
 once-0.014
 End-0.013
 least-0.013
 two-0.013
m-0.011
Token drank
Token ID24070
Feature activation+8.57e-03

Pos logit contributions
 once+0.006
 End+0.006
 least+0.006
 moral+0.006
mo+0.005
Neg logit contributions
iddled-0.006
izing-0.005
nam-0.005
pole-0.005
ledge-0.005
Token some
Token ID617
Feature activation-7.15e-03

Pos logit contributions
izing+0.014
iddled+0.014
taker+0.013
string+0.012
pole+0.012
Neg logit contributions
 once-0.011
 least-0.011
 End-0.011
mo-0.010
 two-0.010
Token hot
Token ID3024
Feature activation-1.13e-02

Pos logit contributions
 once+0.011
 least+0.010
 End+0.010
 two+0.009
mo+0.009
Neg logit contributions
izing-0.011
iddled-0.010
taker-0.009
quet-0.009
opter-0.009
Token chocolate
Token ID11311
Feature activation-9.56e-02

Pos logit contributions
 once+0.013
 least+0.013
 End+0.012
 two+0.011
 eventually+0.011
Neg logit contributions
izing-0.020
iddled-0.020
quet-0.018
opter-0.018
taker-0.017
Token until
Token ID1566
Feature activation-3.33e-02

Pos logit contributions
 once+0.130
 End+0.124
 least+0.123
 two+0.116
day+0.103
Neg logit contributions
izing-0.165
iddled-0.152
pole-0.144
quet-0.142
opter-0.142
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 once+0.055
mo+0.053
 least+0.052
 End+0.050
m+0.050
Neg logit contributions
izing-0.043
iddled-0.040
 flow-0.035
 ours-0.035
string-0.035
Token storm
Token ID6388
Feature activation-1.39e-03

Pos logit contributions
 once+0.041
 least+0.040
mo+0.038
bie+0.037
abella+0.037
Neg logit contributions
izing-0.029
 flow-0.028
 click-0.028
taker-0.027
 noise-0.026
Token had
Token ID550
Feature activation-3.74e-02

Pos logit contributions
 once+0.002
 least+0.002
 two+0.002
 End+0.002
 best+0.002
Neg logit contributions
izing-0.002
iddled-0.002
opter-0.002
pole-0.002
string-0.002
Token passed
Token ID3804
Feature activation-2.69e-02

Pos logit contributions
 once+0.053
 two+0.045
 least+0.043
 End+0.041
 night+0.040
Neg logit contributions
izing-0.067
iddled-0.066
opter-0.060
ledge-0.060
ues-0.060