TOP ACTIVATIONS
MAX = 0.117

. Dad said it was called golf . He said he
it with me ?" Anna says . Ben looks
called Sam and one was called Alex . They were both
apples , please ?" Anna says . Mr .
to see it ?" dad says . L ily
join me ?" the voice says . Anna and
a judge is ?" Mom says . L ily
Bubble and the other was called Sl ap . Bubble was
book . The book was called a dictionary . It had
be my friends ?" Anna says . L ily
what is inside ?" she says . " Yes
play with me ?" grandma says . S ara
want to try ?" Mom says . Tom and
you hear me ?" Anna says into the microphone . Her
not know what it was called . It was a big
be cow boys and cow girls . They put on hats
picture with me ?" Ben says . M ia
big place . It is called a university . There are
They learned that this was called the tide .
useful . This one is called quartz , and it is

MIN ACTIVATIONS
MIN = -0.159

the tree to its once former beauty . He was happy
the village back to its former glory !"
back the village to its former state . With the help
to be restored to its former glory ."
, leaving it to its former state .
the pony went off on even greater adventures . He was
to stay in . So even though the sc ooter was
go ! Everyone watched in amaz ement as it took off
to climb again , reaching even higher than before . When
spread her love for teaching even more . She thought ,
her mom my watched in amaz ement as the machine flew
egg ! He watched in amaz ement as the egg began
eddy bear and watched in amaz ement as it shrunk an
ered in , blinking in amaz ement . But
big before ! She could even sit inside it .
continued to work , adding even more bubbles each day making
, the mighty bees worked even harder to make more honey
felt so sad and tried even harder . But
man drove off in his brand new car , and he
and be able to help even more people . <|endoftext|> Once

INTERVAL 0.048 - 0.117
CONTAINS 2.364%

his friends and be safe when he wanted to take something
, Sarah 's dad said to her , " Let 's
restore our loop ," he says . " You hold the
sorry to Sara ." Mom says , giving the phone to
to my party ?" Sara said . " Of

INTERVAL -0.021 - 0.048
CONTAINS 79.840%

it knew it had done a mighty job . It had
slowly walked up to it . The cat was scared and
huge wind started to blow and it scared him even more
Dad angry . They go to the kitchen and
One day , L ulu saw an exciting

INTERVAL -0.090 - -0.021
CONTAINS 17.758%

The two friends enjoyed walking down the hidden path together .
. He put on his brave face and marched to the
the forest . The fox wanted to eat the b erry
are scared . They hold Mom 's hand and hide behind
Maybe ." Tom said . " But only if we clean

INTERVAL -0.159 - -0.090
CONTAINS 0.038%

could . The moral of the story is that
. The moral of the story is :
, caught bugs , and even found a hidden treasure .
let Lily ride on its back . And whenever Lily felt
a word . After a few moments , the voice spoke
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
 may+0.033
 seem+0.032
 Win+0.031
 opener+0.029
 lik+0.029
Neg logit contributions
edly-0.023
ful-0.021
efully-0.020
 each-0.020
 least-0.020
Token Dad
Token ID17415
Feature activation-2.19e-02

Pos logit contributions
 seem+0.031
fters+0.030
 may+0.030
'd+0.029
 lur+0.029
Neg logit contributions
 each-0.021
 Dad-0.020
edly-0.019
efully-0.018
der-0.016
Token said
Token ID531
Feature activation+7.80e-02

Pos logit contributions
 each+0.023
efully+0.021
 now+0.020
 then+0.020
 least+0.020
Neg logit contributions
 seem-0.041
 Win-0.039
 may-0.038
fters-0.038
 trigger-0.038
Token it
Token ID340
Feature activation+2.64e-02

Pos logit contributions
ches+0.122
 opener+0.119
 ramp+0.117
 may+0.115
 Win+0.114
Neg logit contributions
ful-0.097
edly-0.092
 each-0.090
 now-0.080
efully-0.080
Token was
Token ID373
Feature activation+2.70e-02

Pos logit contributions
fters+0.049
 Win+0.046
 seem+0.043
 lik+0.042
 respond+0.042
Neg logit contributions
 each-0.026
edly-0.026
 now-0.023
efully-0.023
ful-0.022
Token called
Token ID1444
Feature activation+1.17e-01

Pos logit contributions
 may+0.059
 seem+0.058
ologists+0.056
ches+0.055
fters+0.055
Neg logit contributions
 each-0.018
 least-0.018
edly-0.015
ful-0.014
efully-0.014
Token golf
Token ID13126
Feature activation+5.50e-02

Pos logit contributions
 may+0.204
 seem+0.197
 opener+0.187
 Win+0.184
'd+0.181
Neg logit contributions
edly-0.137
 least-0.126
efully-0.126
ful-0.121
 each-0.110
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
 seem+0.084
 may+0.081
fters+0.076
 lik+0.076
 Dame+0.073
Neg logit contributions
edly-0.072
efully-0.069
 each-0.069
ful-0.067
 least-0.066
Token He
Token ID679
Feature activation+1.08e-02

Pos logit contributions
 seem+0.026
 may+0.025
 lur+0.025
fters+0.025
'd+0.025
Neg logit contributions
 each-0.013
 Dad-0.013
edly-0.012
efully-0.012
der-0.010
Token said
Token ID531
Feature activation+7.56e-02

Pos logit contributions
 seem+0.014
 may+0.014
 Win+0.013
 opener+0.013
 ramp+0.012
Neg logit contributions
edly-0.017
efully-0.016
 least-0.016
 each-0.015
ful-0.015
Token he
Token ID339
Feature activation+3.26e-02

Pos logit contributions
 may+0.098
 seem+0.095
 opener+0.095
 Win+0.090
ches+0.089
Neg logit contributions
edly-0.116
ful-0.111
efully-0.107
 least-0.106
 each-0.104
Token it
Token ID340
Feature activation+4.93e-03

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.001
 former+0.001
Neg logit contributions
 may-0.001
 seem-0.001
 opener-0.001
 Win-0.001
 hop-0.001
Token with
Token ID351
Feature activation+3.88e-02

Pos logit contributions
 seem+0.005
 opener+0.005
 may+0.005
 hop+0.005
 shoot+0.005
Neg logit contributions
edly-0.010
efully-0.009
years-0.008
 former-0.008
 least-0.008
Token me
Token ID502
Feature activation+1.98e-02

Pos logit contributions
 may+0.049
 might+0.043
 hop+0.043
 shoot+0.043
 Win+0.043
Neg logit contributions
edly-0.074
efully-0.064
der-0.061
 former-0.061
ment-0.058
Token?"
Token ID1701
Feature activation+2.93e-02

Pos logit contributions
 may+0.017
 opener+0.016
 seem+0.015
ches+0.015
 hop+0.014
Neg logit contributions
edly-0.043
efully-0.040
 former-0.037
 least-0.036
years-0.036
Token Anna
Token ID11735
Feature activation+2.86e-02

Pos logit contributions
 may+0.039
 seem+0.037
 opener+0.036
 Win+0.035
 ramp+0.035
Neg logit contributions
edly-0.046
efully-0.043
 least-0.043
ful-0.041
 each-0.040
Token says
Token ID1139
Feature activation+1.15e-01

Pos logit contributions
 may+0.049
ves+0.046
'd+0.045
 ramp+0.045
 opener+0.045
Neg logit contributions
efully-0.034
edly-0.032
 each-0.031
 least-0.030
der-0.027
Token.
Token ID13
Feature activation+4.06e-02

Pos logit contributions
 may+0.220
 ramp+0.203
 Win+0.203
 seem+0.200
'd+0.195
Neg logit contributions
edly-0.133
ful-0.124
 each-0.107
efully-0.104
 least-0.095
Token
Token ID198
Feature activation+3.14e-02

Pos logit contributions
 may+0.055
 seem+0.053
 opener+0.050
 Win+0.050
 hop+0.047
Neg logit contributions
edly-0.063
 least-0.060
efully-0.060
ful-0.057
 former-0.054
Token
Token ID198
Feature activation+2.54e-02

Pos logit contributions
 may+0.048
 seem+0.047
 Win+0.044
 opener+0.044
'd+0.043
Neg logit contributions
edly-0.042
efully-0.040
 least-0.039
ful-0.038
 each-0.036
TokenBen
Token ID11696
Feature activation+2.21e-02

Pos logit contributions
 may+0.033
 seem+0.031
 opener+0.029
 Win+0.029
'd+0.027
Neg logit contributions
edly-0.042
efully-0.041
 least-0.040
ful-0.039
 each-0.037
Token looks
Token ID3073
Feature activation+4.19e-03

Pos logit contributions
 seem+0.030
 may+0.029
 Win+0.029
 opener+0.029
fters+0.026
Neg logit contributions
efully-0.032
edly-0.032
 least-0.031
 each-0.029
der-0.028
Token called
Token ID1444
Feature activation+1.03e-01

Pos logit contributions
 may+0.026
 seem+0.025
 opener+0.023
'd+0.023
 Win+0.023
Neg logit contributions
edly-0.014
 least-0.013
efully-0.013
ful-0.012
 each-0.012
Token Sam
Token ID3409
Feature activation+7.67e-02

Pos logit contributions
 may+0.136
 seem+0.130
 opener+0.121
 Win+0.119
'd+0.115
Neg logit contributions
edly-0.166
 least-0.157
efully-0.156
ful-0.152
 former-0.142
Token and
Token ID290
Feature activation+5.54e-02

Pos logit contributions
 may+0.140
 seem+0.138
 Win+0.131
 opener+0.129
 ramp+0.125
Neg logit contributions
edly-0.083
 least-0.077
ful-0.076
efully-0.075
 each-0.071
Token one
Token ID530
Feature activation-1.41e-02

Pos logit contributions
 may+0.075
 seem+0.072
 opener+0.068
 Win+0.067
 ramp+0.063
Neg logit contributions
edly-0.088
 least-0.083
efully-0.083
ful-0.079
 former-0.074
Token was
Token ID373
Feature activation+3.33e-02

Pos logit contributions
edly+0.022
 least+0.021
efully+0.020
 former+0.019
ful+0.018
Neg logit contributions
 may-0.020
 seem-0.018
 opener-0.018
 Win-0.018
 hop-0.018
Token called
Token ID1444
Feature activation+1.14e-01

Pos logit contributions
 may+0.063
 seem+0.059
 opener+0.056
'd+0.055
 Win+0.054
Neg logit contributions
edly-0.034
 least-0.034
ful-0.033
efully-0.033
 each-0.031
Token Alex
Token ID4422
Feature activation+6.78e-02

Pos logit contributions
 may+0.159
 seem+0.146
'd+0.138
ches+0.130
 opener+0.130
Neg logit contributions
edly-0.170
ful-0.168
 least-0.160
efully-0.160
 each-0.157
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
 may+0.110
 seem+0.109
 opener+0.103
 Win+0.103
 ramp+0.100
Neg logit contributions
edly-0.087
ful-0.081
 least-0.080
efully-0.080
 each-0.077
Token They
Token ID1119
Feature activation+3.92e-02

Pos logit contributions
fters+0.025
ves+0.024
'd+0.024
 may+0.023
 seem+0.023
Neg logit contributions
 each-0.016
efully-0.015
 Dad-0.013
 least-0.012
edly-0.011
Token were
Token ID547
Feature activation+1.63e-02

Pos logit contributions
 may+0.051
 seem+0.050
 opener+0.047
 Win+0.047
 ramp+0.044
Neg logit contributions
edly-0.062
 least-0.060
efully-0.060
ful-0.056
 each-0.054
Token both
Token ID1111
Feature activation+2.18e-02

Pos logit contributions
 may+0.034
 opener+0.032
 seem+0.032
ches+0.030
 ramp+0.030
Neg logit contributions
efully-0.015
 each-0.014
 least-0.014
edly-0.013
ful-0.012
Token apples
Token ID22514
Feature activation-1.71e-03

Pos logit contributions
edly+0.031
efully+0.026
 least+0.026
ful+0.026
years+0.026
Neg logit contributions
 may-0.020
 opener-0.019
 seem-0.018
 might-0.017
 hop-0.016
Token,
Token ID11
Feature activation+2.52e-02

Pos logit contributions
edly+0.004
efully+0.003
years+0.003
 former+0.003
 example+0.003
Neg logit contributions
 opener-0.002
 may-0.002
 seem-0.002
 Win-0.001
 shoot-0.001
Token please
Token ID3387
Feature activation+4.79e-03

Pos logit contributions
 may+0.031
 seem+0.031
 Win+0.029
 opener+0.029
 hop+0.027
Neg logit contributions
edly-0.045
efully-0.042
 least-0.040
ful-0.039
 former-0.037
Token?"
Token ID1701
Feature activation+3.04e-02

Pos logit contributions
 may+0.007
 seem+0.007
 opener+0.006
 Win+0.006
 hop+0.006
Neg logit contributions
edly-0.007
efully-0.007
 least-0.007
ful-0.006
 former-0.006
Token Anna
Token ID11735
Feature activation+2.93e-02

Pos logit contributions
 may+0.042
 seem+0.039
 opener+0.037
 Win+0.037
'd+0.036
Neg logit contributions
edly-0.047
 least-0.044
efully-0.044
ful-0.042
 each-0.040
Token says
Token ID1139
Feature activation+1.13e-01

Pos logit contributions
 may+0.045
 seem+0.043
 opener+0.040
'd+0.040
 Win+0.040
Neg logit contributions
edly-0.041
efully-0.039
 least-0.038
 each-0.035
ful-0.035
Token.
Token ID13
Feature activation+4.73e-02

Pos logit contributions
 may+0.220
 ramp+0.203
 seem+0.203
 Win+0.201
'd+0.195
Neg logit contributions
edly-0.122
ful-0.117
 each-0.103
efully-0.097
 least-0.092
Token
Token ID198
Feature activation+3.47e-02

Pos logit contributions
 may+0.062
 seem+0.060
 opener+0.057
 Win+0.056
 hop+0.053
Neg logit contributions
edly-0.076
 least-0.072
efully-0.072
ful-0.068
 former-0.066
Token
Token ID198
Feature activation+2.84e-02

Pos logit contributions
 may+0.051
 seem+0.049
 opener+0.046
 Win+0.046
'd+0.044
Neg logit contributions
edly-0.050
efully-0.047
 least-0.047
ful-0.045
 each-0.042
TokenMr
Token ID5246
Feature activation+4.98e-02

Pos logit contributions
 may+0.039
 seem+0.038
 opener+0.036
 Win+0.036
 hop+0.033
Neg logit contributions
edly-0.044
 least-0.041
efully-0.041
ful-0.039
 former-0.037
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 may+0.081
 seem+0.078
 Win+0.076
fters+0.074
 hop+0.072
Neg logit contributions
edly-0.065
efully-0.059
 least-0.058
ful-0.056
 each-0.054
Token to
Token ID284
Feature activation+3.45e-02

Pos logit contributions
edly+0.062
efully+0.058
 former+0.054
 least+0.049
ths+0.048
Neg logit contributions
 opener-0.039
 may-0.039
 shoot-0.037
 seem-0.036
 Win-0.034
Token see
Token ID766
Feature activation-1.33e-03

Pos logit contributions
 seem+0.026
 may+0.026
 shoot+0.024
 Win+0.023
 hop+0.022
Neg logit contributions
edly-0.083
efully-0.078
years-0.071
ful-0.071
der-0.070
Token it
Token ID340
Feature activation+2.25e-02

Pos logit contributions
edly+0.003
 former+0.003
efully+0.003
keeping+0.003
amed+0.003
Neg logit contributions
 may-0.001
 shoot-0.001
 might-0.001
 seem-0.001
 Win-0.001
Token?"
Token ID1701
Feature activation+2.90e-02

Pos logit contributions
 may+0.025
 seem+0.025
 opener+0.025
 hop+0.023
 shoot+0.023
Neg logit contributions
edly-0.044
efully-0.041
years-0.036
ful-0.035
 least-0.035
Token dad
Token ID9955
Feature activation-1.98e-02

Pos logit contributions
 may+0.053
 seem+0.052
 Win+0.049
 opener+0.049
 hop+0.047
Neg logit contributions
edly-0.033
efully-0.030
 least-0.029
ful-0.027
 former-0.026
Token says
Token ID1139
Feature activation+1.13e-01

Pos logit contributions
efully+0.023
edly+0.021
 each+0.020
ful+0.020
 least+0.020
Neg logit contributions
 may-0.035
'd-0.033
 ramp-0.031
 seem-0.031
ves-0.030
Token.
Token ID13
Feature activation+3.83e-02

Pos logit contributions
 may+0.218
 Win+0.197
'd+0.195
 seem+0.194
 ramp+0.192
Neg logit contributions
edly-0.134
ful-0.124
efully-0.106
 each-0.105
der-0.098
Token
Token ID198
Feature activation+2.99e-02

Pos logit contributions
 may+0.051
 seem+0.049
 opener+0.047
 Win+0.046
 hop+0.043
Neg logit contributions
edly-0.061
 least-0.058
efully-0.058
ful-0.055
 former-0.053
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
 may+0.045
 seem+0.044
 Win+0.041
 opener+0.041
'd+0.039
Neg logit contributions
edly-0.042
efully-0.040
 least-0.039
ful-0.038
 each-0.035
TokenL
Token ID43
Feature activation-1.19e-02

Pos logit contributions
 may+0.033
 seem+0.032
 opener+0.031
 Win+0.030
 hop+0.029
Neg logit contributions
edly-0.037
 least-0.035
efully-0.034
ful-0.033
 former-0.031
Tokenily
Token ID813
Feature activation+5.34e-03

Pos logit contributions
edly+0.018
 least+0.016
efully+0.016
ful+0.015
 former+0.015
Neg logit contributions
 may-0.017
 seem-0.017
 opener-0.016
 Win-0.016
 lik-0.015
Token join
Token ID4654
Feature activation+1.79e-02

Pos logit contributions
 seem+0.027
 may+0.025
 Win+0.023
 shoot+0.022
 hop+0.022
Neg logit contributions
edly-0.086
efully-0.077
ful-0.072
der-0.072
 least-0.072
Token me
Token ID502
Feature activation+1.35e-03

Pos logit contributions
 may+0.025
 Win+0.025
 seem+0.025
 hop+0.024
 shoot+0.023
Neg logit contributions
edly-0.029
 former-0.025
 least-0.024
der-0.024
efully-0.024
Token?"
Token ID1701
Feature activation+3.16e-02

Pos logit contributions
 opener+0.001
 hop+0.001
 Win+0.001
 shoot+0.001
 seem+0.001
Neg logit contributions
edly-0.003
efully-0.003
der-0.003
 least-0.003
ths-0.003
Token the
Token ID262
Feature activation+3.17e-02

Pos logit contributions
 may+0.048
 seem+0.045
'd+0.045
 opener+0.044
 ramp+0.043
Neg logit contributions
 least-0.041
edly-0.040
 each-0.040
efully-0.039
ful-0.037
Token voice
Token ID3809
Feature activation+4.71e-02

Pos logit contributions
 may+0.038
 seem+0.036
 opener+0.034
 Win+0.034
 hop+0.033
Neg logit contributions
edly-0.055
 least-0.053
efully-0.053
ful-0.050
 former-0.049
Token says
Token ID1139
Feature activation+1.13e-01

Pos logit contributions
 may+0.063
 seem+0.060
 opener+0.057
 Win+0.056
 ramp+0.054
Neg logit contributions
edly-0.074
efully-0.071
 least-0.069
ful-0.066
 each-0.063
Token.
Token ID13
Feature activation+3.99e-02

Pos logit contributions
 may+0.227
 seem+0.206
 ramp+0.205
'd+0.204
fters+0.202
Neg logit contributions
edly-0.124
ful-0.111
efully-0.097
 each-0.096
ment-0.089
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
 may+0.067
 seem+0.064
 ramp+0.061
 Win+0.060
fters+0.060
Neg logit contributions
edly-0.049
efully-0.044
ful-0.042
 each-0.040
 least-0.040
Token
Token ID198
Feature activation+2.64e-02

Pos logit contributions
 may+0.048
 seem+0.047
 opener+0.044
 Win+0.044
'd+0.042
Neg logit contributions
edly-0.044
efully-0.042
 least-0.041
ful-0.040
 each-0.038
TokenAnna
Token ID31160
Feature activation+1.48e-02

Pos logit contributions
 may+0.037
 seem+0.036
 opener+0.033
 Win+0.033
 hop+0.031
Neg logit contributions
edly-0.041
efully-0.038
 least-0.038
ful-0.036
 former-0.034
Token and
Token ID290
Feature activation+5.34e-02

Pos logit contributions
 may+0.026
 seem+0.025
 opener+0.025
 Win+0.024
 lik+0.023
Neg logit contributions
edly-0.017
efully-0.016
 least-0.016
ful-0.014
 each-0.013
Token a
Token ID257
Feature activation+1.49e-02

Pos logit contributions
edly+0.013
efully+0.012
 former+0.012
 least+0.011
ful+0.011
Neg logit contributions
 may-0.009
 seem-0.009
 Win-0.008
 might-0.008
 hop-0.008
Token judge
Token ID5052
Feature activation-9.10e-03

Pos logit contributions
 may+0.018
 seem+0.016
 Win+0.015
 hop+0.015
 shoot+0.014
Neg logit contributions
edly-0.028
efully-0.027
ful-0.026
 least-0.025
years-0.024
Token is
Token ID318
Feature activation+2.68e-02

Pos logit contributions
edly+0.017
efully+0.016
 least+0.016
 former+0.015
ful+0.015
Neg logit contributions
 may-0.010
 seem-0.010
 Win-0.009
 opener-0.009
 hop-0.008
Token?"
Token ID1701
Feature activation+2.39e-02

Pos logit contributions
 may+0.034
 seem+0.033
 Win+0.031
 opener+0.030
 shoot+0.029
Neg logit contributions
edly-0.045
efully-0.043
 least-0.041
ful-0.040
 former-0.038
Token Mom
Token ID11254
Feature activation-1.58e-02

Pos logit contributions
 may+0.038
 seem+0.037
 opener+0.035
 Win+0.035
 hop+0.033
Neg logit contributions
edly-0.032
efully-0.030
 least-0.030
ful-0.028
 former-0.027
Token says
Token ID1139
Feature activation+1.12e-01

Pos logit contributions
efully+0.019
 least+0.017
 each+0.016
edly+0.016
ful+0.015
Neg logit contributions
 may-0.026
 seem-0.025
 Win-0.025
 ramp-0.025
ves-0.024
Token.
Token ID13
Feature activation+3.21e-02

Pos logit contributions
 may+0.214
 Win+0.201
 ramp+0.196
 seem+0.194
'd+0.191
Neg logit contributions
edly-0.134
ful-0.123
 each-0.103
efully-0.099
ment-0.096
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
 may+0.046
 seem+0.044
 Win+0.042
 opener+0.042
 ramp+0.039
Neg logit contributions
edly-0.048
efully-0.045
 least-0.045
ful-0.043
 former-0.040
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
 may+0.036
 seem+0.036
 Win+0.034
 opener+0.033
'd+0.033
Neg logit contributions
edly-0.026
efully-0.025
 least-0.025
ful-0.024
 each-0.023
TokenL
Token ID43
Feature activation-1.60e-02

Pos logit contributions
 may+0.027
 seem+0.026
 opener+0.025
 Win+0.025
 ramp+0.023
Neg logit contributions
edly-0.023
efully-0.021
 least-0.021
ful-0.020
 former-0.019
Tokenily
Token ID813
Feature activation+5.25e-03

Pos logit contributions
edly+0.021
efully+0.019
 least+0.019
ful+0.018
 former+0.017
Neg logit contributions
 may-0.026
 seem-0.025
 Win-0.024
 opener-0.024
 hop-0.023
Token Bubble
Token ID33691
Feature activation+7.26e-02

Pos logit contributions
 may+0.132
 seem+0.126
 opener+0.118
 Win+0.116
'd+0.111
Neg logit contributions
edly-0.165
 least-0.156
efully-0.156
ful-0.151
 former-0.141
Token and
Token ID290
Feature activation+5.29e-02

Pos logit contributions
 may+0.121
 seem+0.118
 opener+0.112
 Win+0.112
 ramp+0.106
Neg logit contributions
edly-0.091
efully-0.084
 least-0.084
ful-0.079
 former-0.073
Token the
Token ID262
Feature activation+5.85e-02

Pos logit contributions
 may+0.071
 seem+0.068
 opener+0.066
 Win+0.063
 ramp+0.061
Neg logit contributions
edly-0.082
efully-0.079
 least-0.079
ful-0.076
 each-0.073
Token other
Token ID584
Feature activation+1.70e-03

Pos logit contributions
 may+0.096
 seem+0.093
 opener+0.089
 Win+0.088
 ramp+0.083
Neg logit contributions
edly-0.075
 least-0.070
efully-0.070
ful-0.066
 former-0.062
Token was
Token ID373
Feature activation+3.28e-02

Pos logit contributions
 may+0.002
 seem+0.002
 opener+0.002
 Win+0.002
 ramp+0.002
Neg logit contributions
edly-0.003
efully-0.003
 least-0.003
ful-0.003
 former-0.003
Token called
Token ID1444
Feature activation+1.12e-01

Pos logit contributions
 may+0.065
 seem+0.061
 opener+0.058
 Win+0.056
 ramp+0.056
Neg logit contributions
 least-0.031
edly-0.031
ful-0.030
efully-0.029
 each-0.028
Token Sl
Token ID3454
Feature activation+7.70e-02

Pos logit contributions
 may+0.159
 seem+0.145
'd+0.142
 opener+0.134
fters+0.133
Neg logit contributions
ful-0.165
edly-0.165
 each-0.155
 least-0.154
efully-0.154
Tokenap
Token ID499
Feature activation+5.35e-02

Pos logit contributions
 may+0.125
 seem+0.115
 crash+0.108
 Win+0.105
 ramp+0.104
Neg logit contributions
edly-0.104
efully-0.096
ful-0.095
 least-0.095
 former-0.083
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
 may+0.077
 seem+0.074
 Win+0.070
 opener+0.070
 hop+0.066
Neg logit contributions
edly-0.080
 least-0.075
efully-0.075
ful-0.069
 former-0.068
Token Bubble
Token ID33691
Feature activation+3.26e-02

Pos logit contributions
fters+0.021
 seem+0.020
'd+0.020
ves+0.020
 may+0.020
Neg logit contributions
efully-0.016
 each-0.016
edly-0.015
 least-0.015
ful-0.014
Token was
Token ID373
Feature activation+2.46e-02

Pos logit contributions
 may+0.042
 seem+0.040
 opener+0.038
 Win+0.038
 ramp+0.035
Neg logit contributions
edly-0.053
efully-0.050
 least-0.050
ful-0.047
 former-0.045
Token book
Token ID1492
Feature activation-7.59e-03

Pos logit contributions
edly+0.013
efully+0.012
 least+0.012
ful+0.012
 former+0.011
Neg logit contributions
 may-0.007
 seem-0.007
 opener-0.006
 Win-0.006
 hop-0.006
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
efully+0.013
edly+0.013
 least+0.013
ful+0.011
 former+0.011
Neg logit contributions
 may-0.009
 opener-0.009
 seem-0.009
 Win-0.008
 ramp-0.008
Token The
Token ID383
Feature activation+2.52e-02

Pos logit contributions
 may+0.016
 seem+0.014
'd+0.014
ches+0.014
ves+0.014
Neg logit contributions
edly-0.023
efully-0.022
 least-0.022
 each-0.021
ful-0.020
Token book
Token ID1492
Feature activation-9.75e-03

Pos logit contributions
 may+0.029
 seem+0.028
 opener+0.026
 Win+0.026
 hop+0.024
Neg logit contributions
edly-0.045
 least-0.042
efully-0.042
ful-0.041
 former-0.039
Token was
Token ID373
Feature activation+2.07e-02

Pos logit contributions
edly+0.020
efully+0.019
 least+0.019
ful+0.018
 former+0.018
Neg logit contributions
 may-0.009
 seem-0.008
 opener-0.008
 Win-0.007
 hop-0.007
Token called
Token ID1444
Feature activation+1.12e-01

Pos logit contributions
 may+0.032
 seem+0.031
 Win+0.029
 opener+0.029
 ramp+0.028
Neg logit contributions
edly-0.028
 least-0.026
efully-0.026
ful-0.025
 former-0.024
Token a
Token ID257
Feature activation+3.35e-02

Pos logit contributions
 may+0.179
 seem+0.175
 opener+0.164
 Win+0.161
'd+0.160
Neg logit contributions
edly-0.144
 least-0.131
ful-0.130
efully-0.130
 each-0.118
Token dictionary
Token ID22155
Feature activation+1.76e-02

Pos logit contributions
 may+0.042
 seem+0.040
 opener+0.038
 Win+0.037
 hop+0.035
Neg logit contributions
edly-0.056
efully-0.054
 least-0.053
ful-0.051
 former-0.048
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 may+0.026
 seem+0.026
 Win+0.024
 opener+0.024
 lik+0.023
Neg logit contributions
edly-0.025
 least-0.023
efully-0.023
ful-0.022
 each-0.020
Token It
Token ID632
Feature activation+3.37e-02

Pos logit contributions
 may+0.023
 seem+0.022
'd+0.021
ches+0.020
 ramp+0.019
Neg logit contributions
edly-0.026
efully-0.025
 least-0.025
ful-0.024
 each-0.024
Token had
Token ID550
Feature activation-1.65e-02

Pos logit contributions
 may+0.036
 seem+0.035
 Win+0.033
 opener+0.032
 ramp+0.030
Neg logit contributions
edly-0.061
 least-0.058
efully-0.058
ful-0.055
 each-0.053
Token be
Token ID307
Feature activation+5.29e-03

Pos logit contributions
 seem+0.027
 may+0.025
 Win+0.025
 shoot+0.023
 opener+0.021
Neg logit contributions
edly-0.085
efully-0.079
der-0.073
 least-0.072
ful-0.072
Token my
Token ID616
Feature activation+3.55e-02

Pos logit contributions
 Win+0.006
 shoot+0.006
 hop+0.006
 lend+0.005
 roar+0.005
Neg logit contributions
edly-0.010
der-0.009
 least-0.009
ths-0.008
efully-0.008
Token friends
Token ID2460
Feature activation+2.95e-02

Pos logit contributions
 may+0.053
 Win+0.050
 seem+0.049
 opener+0.046
 hop+0.046
Neg logit contributions
edly-0.053
efully-0.048
ful-0.045
 least-0.043
ment-0.041
Token?"
Token ID1701
Feature activation+3.09e-02

Pos logit contributions
 may+0.031
 seem+0.031
 opener+0.030
 Win+0.029
 shoot+0.028
Neg logit contributions
edly-0.057
efully-0.053
 least-0.049
 former-0.048
ful-0.047
Token Anna
Token ID11735
Feature activation+2.97e-02

Pos logit contributions
 may+0.044
 seem+0.041
 opener+0.040
'd+0.040
 ramp+0.040
Neg logit contributions
edly-0.044
 least-0.043
efully-0.042
 each-0.042
ful-0.040
Token says
Token ID1139
Feature activation+1.12e-01

Pos logit contributions
 may+0.045
 seem+0.042
 opener+0.041
 ramp+0.041
'd+0.040
Neg logit contributions
edly-0.041
efully-0.040
 least-0.038
 each-0.037
ful-0.035
Token.
Token ID13
Feature activation+4.39e-02

Pos logit contributions
 may+0.214
 ramp+0.199
 Win+0.196
 seem+0.195
'd+0.191
Neg logit contributions
edly-0.127
ful-0.117
 each-0.103
efully-0.101
 least-0.092
Token
Token ID198
Feature activation+3.27e-02

Pos logit contributions
 may+0.058
 seem+0.057
 opener+0.054
 Win+0.053
 hop+0.050
Neg logit contributions
edly-0.070
 least-0.066
efully-0.066
ful-0.062
 former-0.060
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
 may+0.047
 seem+0.046
 opener+0.043
 Win+0.043
'd+0.041
Neg logit contributions
edly-0.048
efully-0.045
 least-0.045
ful-0.043
 each-0.040
TokenL
Token ID43
Feature activation-8.65e-03

Pos logit contributions
 may+0.033
 seem+0.031
 opener+0.029
 Win+0.029
 hop+0.027
Neg logit contributions
edly-0.044
 least-0.042
efully-0.042
ful-0.040
 former-0.038
Tokenily
Token ID813
Feature activation+7.45e-03

Pos logit contributions
edly+0.011
efully+0.010
 least+0.010
ful+0.010
 each+0.009
Neg logit contributions
 may-0.015
 seem-0.014
 Win-0.013
 opener-0.013
'd-0.013
Token what
Token ID644
Feature activation-1.44e-03

Pos logit contributions
 may+0.001
 might+0.001
 shoot+0.001
 Win+0.001
 seem+0.001
Neg logit contributions
edly-0.003
 former-0.003
efully-0.003
der-0.003
ment-0.002
Token is
Token ID318
Feature activation+2.98e-02

Pos logit contributions
edly+0.003
 least+0.002
efully+0.002
ful+0.002
 each+0.002
Neg logit contributions
 may-0.002
 seem-0.001
 opener-0.001
 Win-0.001
 ramp-0.001
Token inside
Token ID2641
Feature activation+1.64e-02

Pos logit contributions
 may+0.050
 seem+0.049
 opener+0.047
 Win+0.047
 hop+0.044
Neg logit contributions
edly-0.037
efully-0.034
 least-0.033
ful-0.031
 former-0.030
Token?"
Token ID1701
Feature activation+2.69e-02

Pos logit contributions
 may+0.021
 seem+0.020
 Win+0.019
 opener+0.019
ches+0.017
Neg logit contributions
edly-0.029
efully-0.026
 former-0.024
 least-0.024
der-0.023
Token she
Token ID673
Feature activation+3.50e-02

Pos logit contributions
 may+0.040
 seem+0.037
 opener+0.037
'd+0.036
 ramp+0.036
Neg logit contributions
edly-0.038
efully-0.035
 least-0.035
ful-0.034
 each-0.033
Token says
Token ID1139
Feature activation+1.12e-01

Pos logit contributions
 may+0.066
 ramp+0.065
'd+0.063
 opener+0.062
uddy+0.062
Neg logit contributions
efully-0.031
 each-0.029
edly-0.025
 least-0.025
 then-0.024
Token.
Token ID13
Feature activation+3.77e-02

Pos logit contributions
 may+0.213
 ramp+0.195
 Win+0.194
 seem+0.192
'd+0.188
Neg logit contributions
edly-0.132
ful-0.124
 each-0.107
efully-0.104
ment-0.097
Token
Token ID198
Feature activation+2.91e-02

Pos logit contributions
 may+0.056
 seem+0.054
 opener+0.051
 Win+0.051
 ramp+0.049
Neg logit contributions
edly-0.054
efully-0.050
 least-0.049
ful-0.048
 each-0.045
Token
Token ID198
Feature activation+2.39e-02

Pos logit contributions
 may+0.045
 seem+0.043
 Win+0.041
 opener+0.041
'd+0.039
Neg logit contributions
edly-0.039
efully-0.037
 least-0.036
ful-0.035
 each-0.034
Token"
Token ID1
Feature activation-2.74e-02

Pos logit contributions
 may+0.031
 seem+0.030
 opener+0.028
 Win+0.028
'd+0.026
Neg logit contributions
edly-0.039
efully-0.037
 least-0.037
ful-0.035
 each-0.033
TokenYes
Token ID5297
Feature activation+1.25e-02

Pos logit contributions
 least+0.040
 former+0.039
edly+0.036
efully+0.036
ful+0.035
Neg logit contributions
 opener-0.041
 Win-0.040
 seem-0.039
 may-0.038
 lik-0.036
Token play
Token ID711
Feature activation+1.36e-02

Pos logit contributions
 seem+0.027
 may+0.027
 opener+0.024
 Win+0.024
 shoot+0.022
Neg logit contributions
edly-0.070
efully-0.065
 least-0.062
ful-0.061
der-0.059
Token with
Token ID351
Feature activation+1.48e-02

Pos logit contributions
 may+0.017
 seem+0.017
 opener+0.017
 Win+0.016
 shoot+0.016
Neg logit contributions
edly-0.023
efully-0.021
 least-0.021
ful-0.020
 former-0.020
Token me
Token ID502
Feature activation+1.35e-02

Pos logit contributions
 may+0.015
ches+0.013
 seem+0.013
 Win+0.013
ves+0.013
Neg logit contributions
edly-0.031
efully-0.028
 least-0.027
 former-0.027
ment-0.026
Token?"
Token ID1701
Feature activation+2.97e-02

Pos logit contributions
 may+0.014
 opener+0.013
ches+0.013
 seem+0.012
 Win+0.011
Neg logit contributions
edly-0.027
efully-0.026
 former-0.023
 least-0.023
col-0.022
Token grandma
Token ID49890
Feature activation-1.44e-02

Pos logit contributions
 may+0.053
 seem+0.052
 opener+0.050
 Win+0.049
'd+0.047
Neg logit contributions
edly-0.033
 least-0.031
efully-0.031
ful-0.029
 each-0.027
Token says
Token ID1139
Feature activation+1.12e-01

Pos logit contributions
efully+0.017
 least+0.017
edly+0.017
 each+0.016
ful+0.016
Neg logit contributions
 may-0.024
 ramp-0.022
'd-0.022
 seem-0.022
ves-0.022
Token.
Token ID13
Feature activation+3.51e-02

Pos logit contributions
 may+0.199
 seem+0.184
 Win+0.183
 ramp+0.181
 opener+0.176
Neg logit contributions
edly-0.140
ful-0.129
efully-0.116
 each-0.112
 least-0.112
Token
Token ID198
Feature activation+2.53e-02

Pos logit contributions
 may+0.039
 seem+0.038
 opener+0.037
 Win+0.036
 hop+0.033
Neg logit contributions
 least-0.063
edly-0.062
efully-0.060
 former-0.058
ful-0.057
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
 may+0.033
 seem+0.031
 opener+0.030
 Win+0.030
 hop+0.028
Neg logit contributions
edly-0.042
 least-0.039
efully-0.039
ful-0.037
 former-0.036
TokenS
Token ID50
Feature activation-1.45e-02

Pos logit contributions
 may+0.037
 seem+0.036
 Win+0.035
 opener+0.035
 hop+0.033
Neg logit contributions
edly-0.017
 least-0.016
efully-0.015
ful-0.014
 former-0.014
Tokenara
Token ID3301
Feature activation+1.07e-02

Pos logit contributions
edly+0.019
efully+0.018
 least+0.018
ful+0.017
 each+0.016
Neg logit contributions
 may-0.023
 seem-0.023
 opener-0.022
 Win-0.022
 shoot-0.021
Token want
Token ID765
Feature activation-3.55e-02

Pos logit contributions
 may+0.019
 seem+0.018
 opener+0.017
 Win+0.017
 ramp+0.015
Neg logit contributions
edly-0.030
 least-0.028
efully-0.028
ful-0.027
 former-0.026
Token to
Token ID284
Feature activation+3.15e-02

Pos logit contributions
edly+0.080
efully+0.072
 former+0.066
ment+0.063
amb+0.061
Neg logit contributions
 opener-0.037
 may-0.035
 shoot-0.034
 seem-0.032
 might-0.031
Token try
Token ID1949
Feature activation-6.79e-03

Pos logit contributions
 seem+0.021
 may+0.020
 shoot+0.020
 opener+0.018
 Win+0.017
Neg logit contributions
edly-0.082
efully-0.074
ment-0.069
years-0.068
ful-0.068
Token?"
Token ID1701
Feature activation+3.12e-02

Pos logit contributions
edly+0.013
efully+0.012
 least+0.010
 former+0.010
ment+0.010
Neg logit contributions
 may-0.008
 opener-0.008
 seem-0.008
 shoot-0.008
 Win-0.008
Token Mom
Token ID11254
Feature activation-1.45e-02

Pos logit contributions
 seem+0.046
 may+0.045
 Win+0.043
 opener+0.042
 lik+0.041
Neg logit contributions
edly-0.047
efully-0.043
 least-0.043
ful-0.040
 former-0.040
Token says
Token ID1139
Feature activation+1.12e-01

Pos logit contributions
efully+0.018
 least+0.017
edly+0.016
 each+0.016
ful+0.016
Neg logit contributions
 may-0.024
 seem-0.022
ves-0.022
 Win-0.022
'd-0.022
Token.
Token ID13
Feature activation+3.90e-02

Pos logit contributions
 may+0.201
 Win+0.186
 seem+0.185
 ramp+0.182
'd+0.180
Neg logit contributions
edly-0.137
ful-0.128
 each-0.112
efully-0.111
 least-0.110
Token
Token ID198
Feature activation+3.21e-02

Pos logit contributions
 may+0.041
 seem+0.041
 opener+0.039
 Win+0.039
 lik+0.034
Neg logit contributions
edly-0.073
 least-0.072
efully-0.070
 former-0.069
ful-0.065
Token
Token ID198
Feature activation+2.47e-02

Pos logit contributions
 may+0.047
 seem+0.045
 opener+0.042
 Win+0.042
'd+0.040
Neg logit contributions
edly-0.047
efully-0.044
 least-0.044
ful-0.042
 each-0.039
TokenTom
Token ID13787
Feature activation+7.23e-03

Pos logit contributions
 may+0.033
 seem+0.031
 Win+0.030
 opener+0.030
 hop+0.028
Neg logit contributions
edly-0.040
 least-0.037
efully-0.037
ful-0.035
 former-0.034
Token and
Token ID290
Feature activation+4.61e-02

Pos logit contributions
 may+0.012
 seem+0.012
 opener+0.011
 Win+0.011
 ramp+0.011
Neg logit contributions
edly-0.009
efully-0.008
 least-0.008
ful-0.008
 former-0.007
Token you
Token ID345
Feature activation+9.17e-03

Pos logit contributions
edly+0.004
 least+0.004
ful+0.004
efully+0.004
 each+0.004
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.005
 ramp-0.005
 lik-0.005
Token hear
Token ID3285
Feature activation+1.12e-02

Pos logit contributions
 may+0.010
 seem+0.010
 opener+0.009
 Win+0.009
 hop+0.009
Neg logit contributions
edly-0.017
efully-0.016
 least-0.016
ful-0.015
 former-0.015
Token me
Token ID502
Feature activation+2.21e-02

Pos logit contributions
 shoot+0.014
 Win+0.014
 may+0.014
 seem+0.013
 hop+0.012
Neg logit contributions
edly-0.021
 former-0.018
efully-0.018
 least-0.017
der-0.016
Token?"
Token ID1701
Feature activation+2.26e-02

Pos logit contributions
 seem+0.029
 may+0.028
 Win+0.027
 opener+0.026
 hop+0.026
Neg logit contributions
edly-0.038
efully-0.034
 least-0.033
 former-0.031
ful-0.031
Token Anna
Token ID11735
Feature activation+2.67e-02

Pos logit contributions
 may+0.030
 opener+0.028
 seem+0.028
 ramp+0.027
'd+0.027
Neg logit contributions
edly-0.035
efully-0.034
 least-0.032
 each-0.032
ful-0.031
Token says
Token ID1139
Feature activation+1.11e-01

Pos logit contributions
 may+0.041
 seem+0.037
 opener+0.036
 ramp+0.036
 Win+0.036
Neg logit contributions
edly-0.038
efully-0.037
 least-0.035
ful-0.034
 each-0.033
Token into
Token ID656
Feature activation+2.83e-02

Pos logit contributions
 may+0.212
 ramp+0.191
 seem+0.189
 Win+0.188
'd+0.185
Neg logit contributions
edly-0.133
ful-0.121
efully-0.109
 each-0.109
 least-0.098
Token the
Token ID262
Feature activation+1.81e-02

Pos logit contributions
 may+0.051
 seem+0.049
 opener+0.047
 Win+0.047
'd+0.046
Neg logit contributions
edly-0.032
efully-0.029
 least-0.029
ful-0.027
 each-0.025
Token microphone
Token ID21822
Feature activation+2.06e-03

Pos logit contributions
 may+0.026
 seem+0.025
 Win+0.024
 hop+0.023
 opener+0.022
Neg logit contributions
edly-0.028
efully-0.026
 least-0.026
ful-0.026
 former-0.024
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
 may+0.003
 seem+0.003
 opener+0.003
 Win+0.003
 hop+0.002
Neg logit contributions
edly-0.003
 least-0.003
efully-0.003
ful-0.003
 former-0.003
Token Her
Token ID2332
Feature activation+2.41e-02

Pos logit contributions
 may+0.044
 seem+0.041
 ramp+0.040
'd+0.040
fters+0.039
Neg logit contributions
edly-0.043
efully-0.039
 each-0.038
ful-0.037
 least-0.036
Token not
Token ID407
Feature activation+1.51e-03

Pos logit contributions
 may+0.049
 seem+0.046
 opener+0.045
 Win+0.045
 ramp+0.043
Neg logit contributions
edly-0.047
 least-0.044
efully-0.043
ful-0.042
 each-0.040
Token know
Token ID760
Feature activation-7.50e-03

Pos logit contributions
 seem+0.002
 may+0.002
 opener+0.002
 Win+0.002
 hop+0.001
Neg logit contributions
edly-0.003
efully-0.003
 least-0.003
ful-0.002
 former-0.002
Token what
Token ID644
Feature activation+7.32e-03

Pos logit contributions
edly+0.009
efully+0.008
 least+0.008
 former+0.007
ful+0.007
Neg logit contributions
 may-0.013
 seem-0.013
 opener-0.012
 Win-0.012
 hop-0.011
Token it
Token ID340
Feature activation+1.86e-02

Pos logit contributions
 may+0.010
 seem+0.009
 opener+0.009
 Win+0.009
 hop+0.008
Neg logit contributions
edly-0.012
efully-0.011
 least-0.011
ful-0.011
 former-0.010
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
 may+0.021
 ramp+0.021
 Win+0.021
 opener+0.021
 seem+0.020
Neg logit contributions
edly-0.029
 least-0.029
efully-0.027
ful-0.027
 each-0.027
Token called
Token ID1444
Feature activation+1.11e-01

Pos logit contributions
 may+0.023
 seem+0.023
 opener+0.021
 Win+0.021
'd+0.020
Neg logit contributions
edly-0.029
efully-0.028
 least-0.026
ful-0.025
 former-0.025
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
 may+0.164
 seem+0.159
 opener+0.152
 Win+0.149
 ramp+0.144
Neg logit contributions
edly-0.158
 least-0.147
efully-0.146
ful-0.142
 former-0.132
Token It
Token ID632
Feature activation+2.97e-02

Pos logit contributions
 may+0.007
 seem+0.007
 ramp+0.007
 Win+0.006
 opener+0.006
Neg logit contributions
edly-0.007
efully-0.007
 least-0.006
ful-0.006
 each-0.006
Token was
Token ID373
Feature activation+1.94e-02

Pos logit contributions
 may+0.035
 seem+0.034
 opener+0.033
 Win+0.033
 ramp+0.031
Neg logit contributions
edly-0.049
efully-0.047
 least-0.047
ful-0.045
 each-0.044
Token a
Token ID257
Feature activation+3.56e-02

Pos logit contributions
 may+0.035
 seem+0.034
 opener+0.033
 Win+0.033
 hop+0.032
Neg logit contributions
edly-0.022
efully-0.020
 least-0.020
ful-0.018
 former-0.017
Token big
Token ID1263
Feature activation+2.60e-02

Pos logit contributions
 may+0.048
 seem+0.044
 opener+0.043
 Win+0.042
 hop+0.042
Neg logit contributions
edly-0.059
efully-0.056
 least-0.053
ful-0.051
 former-0.048
Token be
Token ID307
Feature activation+1.82e-03

Pos logit contributions
 may+0.034
 seem+0.031
 opener+0.029
 Win+0.029
 ramp+0.027
Neg logit contributions
edly-0.059
 least-0.058
efully-0.057
ful-0.055
 each-0.054
Token cow
Token ID9875
Feature activation+5.80e-02

Pos logit contributions
 Win+0.003
 may+0.003
 seem+0.003
 opener+0.003
 hop+0.003
Neg logit contributions
edly-0.003
efully-0.002
 least-0.002
der-0.002
ful-0.002
Tokenboys
Token ID13202
Feature activation+6.10e-02

Pos logit contributions
 may+0.073
 seem+0.071
 opener+0.067
 Win+0.065
 lik+0.061
Neg logit contributions
edly-0.095
 least-0.091
efully-0.090
ful-0.087
 each-0.084
Token and
Token ID290
Feature activation+2.73e-02

Pos logit contributions
 may+0.103
 seem+0.100
 opener+0.094
 Win+0.093
 lik+0.090
Neg logit contributions
edly-0.076
 least-0.069
efully-0.069
ful-0.067
 each-0.063
Token cow
Token ID9875
Feature activation+4.65e-02

Pos logit contributions
 may+0.025
 Win+0.023
 hop+0.023
 opener+0.023
 seem+0.022
Neg logit contributions
edly-0.059
efully-0.055
 least-0.052
ful-0.050
 former-0.050
Tokengirls
Token ID36960
Feature activation+1.11e-01

Pos logit contributions
 may+0.052
 seem+0.051
 opener+0.047
 Win+0.046
 lik+0.043
Neg logit contributions
edly-0.083
 least-0.079
efully-0.078
ful-0.076
 each-0.073
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
 may+0.172
 seem+0.167
 opener+0.159
 Win+0.158
 lik+0.151
Neg logit contributions
edly-0.150
 least-0.140
efully-0.140
ful-0.134
 each-0.129
Token They
Token ID1119
Feature activation+3.22e-02

Pos logit contributions
 may+0.023
 seem+0.022
 opener+0.021
 Win+0.020
'd+0.020
Neg logit contributions
edly-0.025
efully-0.024
 least-0.023
ful-0.022
 each-0.021
Token put
Token ID1234
Feature activation+1.88e-02

Pos logit contributions
ologists+0.034
ves+0.034
 opener+0.032
AZ+0.032
 may+0.032
Neg logit contributions
 least-0.056
 each-0.054
efully-0.049
ful-0.046
 then-0.046
Token on
Token ID319
Feature activation+1.49e-02

Pos logit contributions
 may+0.025
 Win+0.023
 seem+0.022
 hop+0.022
 opener+0.022
Neg logit contributions
edly-0.031
 least-0.030
 former-0.029
efully-0.028
der-0.028
Token hats
Token ID23910
Feature activation+4.51e-02

Pos logit contributions
 may+0.024
 seem+0.023
 Win+0.022
 opener+0.022
 hop+0.021
Neg logit contributions
edly-0.020
 least-0.019
efully-0.018
ful-0.017
 former-0.016
Token picture
Token ID4286
Feature activation-1.21e-02

Pos logit contributions
edly+0.008
efully+0.007
 least+0.007
ful+0.007
 former+0.006
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.005
 Win-0.005
 ramp-0.005
Token with
Token ID351
Feature activation+4.10e-02

Pos logit contributions
edly+0.023
efully+0.022
 former+0.021
 least+0.020
years+0.019
Neg logit contributions
 opener-0.014
 may-0.013
 seem-0.013
 Win-0.012
 shoot-0.012
Token me
Token ID502
Feature activation+2.46e-02

Pos logit contributions
 may+0.049
 Win+0.042
 shoot+0.042
 might+0.041
 Rocky+0.039
Neg logit contributions
edly-0.085
efully-0.073
 former-0.068
ment-0.067
amb-0.065
Token?"
Token ID1701
Feature activation+3.18e-02

Pos logit contributions
 may+0.026
 opener+0.023
 seem+0.022
ches+0.021
 Win+0.021
Neg logit contributions
edly-0.050
efully-0.048
 former-0.044
 least-0.043
ths-0.041
Token Ben
Token ID3932
Feature activation+4.08e-02

Pos logit contributions
 may+0.042
 seem+0.039
 opener+0.037
 Win+0.036
 ramp+0.036
Neg logit contributions
edly-0.051
 least-0.049
efully-0.048
ful-0.046
 each-0.045
Token says
Token ID1139
Feature activation+1.11e-01

Pos logit contributions
 may+0.059
 seem+0.056
 opener+0.054
 Win+0.053
 ramp+0.052
Neg logit contributions
edly-0.059
efully-0.058
 least-0.056
 each-0.052
ful-0.051
Token.
Token ID13
Feature activation+3.87e-02

Pos logit contributions
 may+0.195
 seem+0.180
 Win+0.178
 ramp+0.175
'd+0.172
Neg logit contributions
edly-0.141
ful-0.128
efully-0.118
 each-0.113
 least-0.113
Token
Token ID198
Feature activation+3.02e-02

Pos logit contributions
 may+0.055
 seem+0.053
 opener+0.049
 Win+0.049
 ramp+0.046
Neg logit contributions
edly-0.059
efully-0.055
 least-0.055
ful-0.052
 former-0.049
Token
Token ID198
Feature activation+2.37e-02

Pos logit contributions
 may+0.049
 seem+0.048
 Win+0.045
 opener+0.044
'd+0.044
Neg logit contributions
edly-0.036
efully-0.035
 least-0.035
ful-0.034
 each-0.032
TokenM
Token ID44
Feature activation-2.22e-02

Pos logit contributions
 may+0.036
 seem+0.036
 Win+0.033
 opener+0.033
 lik+0.031
Neg logit contributions
edly-0.033
efully-0.031
 least-0.031
ful-0.030
 each-0.027
Tokenia
Token ID544
Feature activation+9.34e-03

Pos logit contributions
edly+0.038
 least+0.036
efully+0.035
ful+0.034
 former+0.032
Neg logit contributions
 may-0.027
 seem-0.026
 Win-0.025
 opener-0.024
 ramp-0.023
Token big
Token ID1263
Feature activation+1.94e-02

Pos logit contributions
 may+0.020
 seem+0.019
 opener+0.018
 Win+0.018
 hop+0.017
Neg logit contributions
edly-0.020
efully-0.018
 least-0.018
ful-0.017
 former-0.016
Token place
Token ID1295
Feature activation+2.90e-02

Pos logit contributions
 may+0.027
 seem+0.025
 Win+0.024
 opener+0.024
 ramp+0.023
Neg logit contributions
edly-0.032
efully-0.029
 least-0.028
ful-0.027
 former-0.026
Token.
Token ID13
Feature activation+7.29e-03

Pos logit contributions
 may+0.039
 seem+0.039
 opener+0.038
 Win+0.037
 ramp+0.034
Neg logit contributions
edly-0.047
efully-0.045
 least-0.041
 former-0.040
der-0.039
Token It
Token ID632
Feature activation+3.54e-02

Pos logit contributions
 may+0.009
 seem+0.008
 opener+0.008
 Win+0.008
'd+0.007
Neg logit contributions
edly-0.013
 least-0.012
efully-0.012
ful-0.011
 former-0.011
Token is
Token ID318
Feature activation+2.44e-02

Pos logit contributions
 opener+0.046
 Win+0.044
ologists+0.043
fters+0.043
 lik+0.043
Neg logit contributions
 least-0.053
efully-0.048
 each-0.047
 now-0.045
edly-0.044
Token called
Token ID1444
Feature activation+1.11e-01

Pos logit contributions
 may+0.043
 seem+0.042
 opener+0.041
 Win+0.040
 hop+0.038
Neg logit contributions
edly-0.028
 least-0.026
efully-0.026
ful-0.024
 former-0.022
Token a
Token ID257
Feature activation+2.99e-02

Pos logit contributions
 may+0.191
 seem+0.186
 Win+0.180
 opener+0.177
 hop+0.171
Neg logit contributions
edly-0.136
efully-0.127
 least-0.126
ful-0.113
 former-0.110
Token university
Token ID6403
Feature activation-3.74e-03

Pos logit contributions
 may+0.041
 seem+0.039
 opener+0.037
 Win+0.037
 hop+0.034
Neg logit contributions
edly-0.047
efully-0.044
 least-0.044
ful-0.042
 former-0.040
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
edly+0.006
efully+0.006
 least+0.006
 former+0.006
ful+0.006
Neg logit contributions
 may-0.005
 seem-0.005
 Win-0.004
 opener-0.004
'd-0.004
Token There
Token ID1318
Feature activation+2.88e-02

Pos logit contributions
 may+0.012
 seem+0.011
 opener+0.010
 Win+0.010
'd+0.010
Neg logit contributions
edly-0.018
efully-0.017
 least-0.017
ful-0.016
 former-0.016
Token are
Token ID389
Feature activation-2.34e-02

Pos logit contributions
 opener+0.033
 lik+0.030
ologists+0.030
 Win+0.030
 ramp+0.030
Neg logit contributions
 least-0.046
 each-0.044
ful-0.043
efully-0.043
 ourselves-0.041
Token They
Token ID1119
Feature activation+2.93e-02

Pos logit contributions
 may+0.007
 seem+0.007
 Win+0.007
 opener+0.007
 hop+0.006
Neg logit contributions
edly-0.010
 least-0.010
efully-0.010
ful-0.009
 former-0.009
Token learned
Token ID4499
Feature activation-4.21e-03

Pos logit contributions
 may+0.034
 seem+0.032
 opener+0.030
 Win+0.029
'd+0.028
Neg logit contributions
edly-0.053
efully-0.050
 least-0.049
ful-0.047
 former-0.046
Token that
Token ID326
Feature activation+1.48e-02

Pos logit contributions
years+0.008
efully+0.008
edly+0.007
 least+0.007
umm+0.007
Neg logit contributions
 may-0.005
 seem-0.004
ves-0.004
 shoot-0.004
 might-0.004
Token this
Token ID428
Feature activation-1.89e-03

Pos logit contributions
 may+0.020
 seem+0.020
 opener+0.018
 Win+0.018
 hop+0.017
Neg logit contributions
edly-0.024
efully-0.022
 least-0.021
ful-0.020
 former-0.020
Token was
Token ID373
Feature activation+7.87e-03

Pos logit contributions
edly+0.004
efully+0.004
ful+0.004
 least+0.004
years+0.003
Neg logit contributions
 may-0.002
 seem-0.002
 opener-0.001
'd-0.001
 might-0.001
Token called
Token ID1444
Feature activation+1.11e-01

Pos logit contributions
 may+0.011
 seem+0.011
 Win+0.010
 opener+0.010
 hop+0.010
Neg logit contributions
edly-0.012
efully-0.011
 least-0.010
ful-0.010
 former-0.010
Token the
Token ID262
Feature activation+4.81e-02

Pos logit contributions
 may+0.173
 seem+0.168
 Win+0.162
 opener+0.160
 hop+0.153
Neg logit contributions
edly-0.154
efully-0.146
 least-0.143
ful-0.129
 former-0.127
Token tide
Token ID20013
Feature activation+3.99e-02

Pos logit contributions
 may+0.072
 seem+0.068
 Win+0.065
 opener+0.064
 hop+0.063
Neg logit contributions
edly-0.072
efully-0.067
ful-0.064
 least-0.061
years-0.057
Token.
Token ID13
Feature activation+8.58e-03

Pos logit contributions
 may+0.051
 seem+0.049
 opener+0.047
 Win+0.046
 lik+0.043
Neg logit contributions
edly-0.065
 least-0.061
efully-0.060
ful-0.059
 former-0.055
Token
Token ID198
Feature activation-9.38e-03

Pos logit contributions
 may+0.010
 seem+0.010
 Win+0.009
 opener+0.009
 hop+0.008
Neg logit contributions
edly-0.015
 least-0.014
efully-0.014
ful-0.014
 former-0.013
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
edly+0.011
ful+0.010
efully+0.010
 least+0.010
 each+0.010
Neg logit contributions
 may-0.016
 seem-0.015
 Win-0.014
 opener-0.014
 ramp-0.014
Token useful
Token ID4465
Feature activation+1.10e-02

Pos logit contributions
 may+0.050
 seem+0.045
 might+0.040
 hop+0.036
 Win+0.036
Neg logit contributions
edly-0.085
efully-0.079
view-0.074
 least-0.074
amed-0.071
Token.
Token ID13
Feature activation+4.60e-03

Pos logit contributions
 may+0.014
 seem+0.013
 opener+0.012
 Win+0.011
 might+0.011
Neg logit contributions
edly-0.019
efully-0.019
 least-0.018
 former-0.017
years-0.016
Token This
Token ID770
Feature activation-1.39e-02

Pos logit contributions
 may+0.005
 seem+0.005
 Win+0.004
 opener+0.004
 hop+0.004
Neg logit contributions
edly-0.009
 least-0.008
efully-0.008
ful-0.008
 former-0.007
Token one
Token ID530
Feature activation-3.67e-02

Pos logit contributions
edly+0.026
 least+0.025
efully+0.025
ful+0.024
 former+0.023
Neg logit contributions
 may-0.014
 seem-0.014
 opener-0.013
 Win-0.013
 lik-0.012
Token is
Token ID318
Feature activation+3.27e-02

Pos logit contributions
edly+0.084
efully+0.078
 former+0.075
ful+0.072
 least+0.072
Neg logit contributions
 may-0.035
 seem-0.028
 might-0.026
 hop-0.024
'd-0.023
Token called
Token ID1444
Feature activation+1.11e-01

Pos logit contributions
 may+0.049
 seem+0.045
 opener+0.043
 Win+0.043
 might+0.041
Neg logit contributions
edly-0.054
efully-0.049
 least-0.044
view-0.043
ful-0.041
Token quartz
Token ID47969
Feature activation+3.00e-02

Pos logit contributions
 may+0.160
 seem+0.150
 Win+0.146
 opener+0.144
 hop+0.135
Neg logit contributions
edly-0.176
efully-0.167
 least-0.161
 former-0.147
ful-0.144
Token,
Token ID11
Feature activation+3.96e-02

Pos logit contributions
 may+0.037
 seem+0.033
 opener+0.032
 might+0.029
'd+0.029
Neg logit contributions
edly-0.053
efully-0.051
 least-0.050
 former-0.048
ful-0.047
Token and
Token ID290
Feature activation+2.30e-02

Pos logit contributions
 may+0.056
 seem+0.054
 opener+0.050
 Win+0.050
 hop+0.048
Neg logit contributions
edly-0.062
efully-0.061
 least-0.059
ful-0.054
 former-0.052
Token it
Token ID340
Feature activation+2.64e-02

Pos logit contributions
 may+0.029
 seem+0.025
 might+0.022
 hop+0.022
 shoot+0.022
Neg logit contributions
edly-0.045
efully-0.042
view-0.041
 least-0.039
amed-0.039
Token is
Token ID318
Feature activation+2.75e-02

Pos logit contributions
 may+0.024
 seem+0.023
 opener+0.021
 Win+0.021
 lik+0.019
Neg logit contributions
edly-0.052
 least-0.050
efully-0.050
ful-0.048
 former-0.046
Token the
Token ID262
Feature activation-2.02e-02

Pos logit contributions
edly+0.054
efully+0.048
 least+0.046
 former+0.046
ful+0.045
Neg logit contributions
 may-0.030
 seem-0.028
 Win-0.028
 might-0.024
 ramp-0.024
Token tree
Token ID5509
Feature activation-1.90e-02

Pos logit contributions
edly+0.038
efully+0.036
ful+0.036
 least+0.034
ment+0.031
Neg logit contributions
 may-0.024
 seem-0.022
 hop-0.020
 might-0.020
 ramp-0.020
Token to
Token ID284
Feature activation+7.52e-03

Pos logit contributions
edly+0.030
efully+0.030
 least+0.028
ful+0.026
 former+0.025
Neg logit contributions
 may-0.025
 seem-0.025
 opener-0.023
 hop-0.022
 Win-0.022
Token its
Token ID663
Feature activation-1.45e-02

Pos logit contributions
 seem+0.008
 may+0.007
 shoot+0.007
 opener+0.007
 solve+0.007
Neg logit contributions
edly-0.016
efully-0.015
elt-0.014
der-0.013
aw-0.013
Token once
Token ID1752
Feature activation-1.50e-02

Pos logit contributions
edly+0.022
ful+0.021
efully+0.021
elt+0.019
years+0.019
Neg logit contributions
 may-0.020
 hop-0.020
 might-0.019
 solve-0.018
 roar-0.018
Token former
Token ID1966
Feature activation-1.59e-01

Pos logit contributions
edly+0.024
efully+0.024
 least+0.022
 yourselves+0.021
years+0.021
Neg logit contributions
 may-0.020
 seem-0.019
 hop-0.018
'd-0.018
 opener-0.017
Token beauty
Token ID8737
Feature activation-1.60e-02

Pos logit contributions
ful+0.223
 least+0.219
edly+0.213
efully+0.207
elt+0.192
Neg logit contributions
 may-0.243
 seem-0.233
 might-0.213
ches-0.202
ves-0.196
Token.
Token ID13
Feature activation-3.19e-02

Pos logit contributions
edly+0.026
efully+0.026
 least+0.026
 former+0.024
ful+0.023
Neg logit contributions
 may-0.021
 seem-0.020
 opener-0.018
 Win-0.017
'd-0.017
Token He
Token ID679
Feature activation-2.69e-02

Pos logit contributions
edly+0.068
 least+0.063
efully+0.063
ful+0.061
 former+0.059
Neg logit contributions
 may-0.028
 seem-0.026
 Win-0.026
 opener-0.023
 lik-0.020
Token was
Token ID373
Feature activation-1.63e-02

Pos logit contributions
edly+0.056
 yourselves+0.052
efully+0.051
ful+0.051
 least+0.050
Neg logit contributions
 may-0.027
'd-0.024
 hop-0.021
 might-0.020
 seem-0.020
Token happy
Token ID3772
Feature activation-2.17e-02

Pos logit contributions
edly+0.022
efully+0.019
der+0.019
 least+0.018
 former+0.017
Neg logit contributions
 seem-0.025
 may-0.025
'd-0.024
 hop-0.023
 Win-0.023
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
edly+0.036
efully+0.033
elt+0.031
years+0.031
view+0.030
Neg logit contributions
 may-0.014
 seem-0.012
 might-0.011
ches-0.010
 shoot-0.010
Token village
Token ID7404
Feature activation-3.85e-02

Pos logit contributions
edly+0.024
efully+0.023
ful+0.022
 least+0.021
der+0.019
Neg logit contributions
 may-0.019
 seem-0.018
 Win-0.016
 hop-0.016
 might-0.016
Token back
Token ID736
Feature activation-8.31e-02

Pos logit contributions
edly+0.065
efully+0.061
 least+0.057
years+0.053
ful+0.053
Neg logit contributions
 may-0.052
 seem-0.052
 might-0.043
 opener-0.043
 Win-0.043
Token to
Token ID284
Feature activation+1.29e-02

Pos logit contributions
edly+0.144
efully+0.132
years+0.123
 least+0.119
ths+0.119
Neg logit contributions
 may-0.106
 seem-0.098
 might-0.095
 hop-0.089
 Win-0.089
Token its
Token ID663
Feature activation-1.50e-02

Pos logit contributions
 seem+0.016
 may+0.016
 solve+0.014
 shoot+0.014
 hop+0.013
Neg logit contributions
edly-0.026
efully-0.024
elt-0.023
amb-0.021
amed-0.021
Token former
Token ID1966
Feature activation-1.52e-01

Pos logit contributions
edly+0.021
ful+0.020
efully+0.020
 least+0.018
elt+0.018
Neg logit contributions
 may-0.023
 hop-0.021
 seem-0.021
 might-0.020
 solve-0.019
Token glory
Token ID13476
Feature activation-2.60e-02

Pos logit contributions
ful+0.212
 least+0.210
edly+0.199
efully+0.196
elt+0.192
Neg logit contributions
 may-0.230
 seem-0.215
 might-0.197
ches-0.190
 opener-0.190
Token!"
Token ID2474
Feature activation-2.04e-02

Pos logit contributions
edly+0.044
efully+0.043
 least+0.042
years+0.038
 former+0.037
Neg logit contributions
 may-0.033
 seem-0.031
 opener-0.029
'd-0.027
 Win-0.026
Token
Token ID220
Feature activation-3.48e-03

Pos logit contributions
edly+0.042
efully+0.039
ful+0.039
 former+0.039
 least+0.039
Neg logit contributions
 may-0.018
 seem-0.017
 Win-0.016
 opener-0.016
 ramp-0.014
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
edly+0.005
 least+0.005
efully+0.005
ful+0.005
 former+0.004
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.004
 Win-0.004
'd-0.004
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
edly+0.020
 least+0.019
efully+0.018
ful+0.017
 former+0.017
Neg logit contributions
 may-0.012
 opener-0.011
 seem-0.011
 Win-0.011
 hop-0.010
Token back
Token ID736
Feature activation-8.31e-02

Pos logit contributions
edly+0.012
efully+0.010
der+0.010
 least+0.009
ful+0.009
Neg logit contributions
 may-0.009
 seem-0.008
 Win-0.008
 might-0.007
 hop-0.007
Token the
Token ID262
Feature activation-8.64e-03

Pos logit contributions
edly+0.163
der+0.156
efully+0.140
 least+0.136
ths+0.132
Neg logit contributions
 hop-0.079
 may-0.079
 Win-0.078
 solve-0.075
 seem-0.073
Token village
Token ID7404
Feature activation-2.81e-02

Pos logit contributions
edly+0.013
efully+0.013
ful+0.012
 least+0.012
der+0.011
Neg logit contributions
 may-0.013
 seem-0.012
 Win-0.011
 opener-0.011
 hop-0.011
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
efully+0.043
edly+0.043
 least+0.041
der+0.039
 yourselves+0.037
Neg logit contributions
 seem-0.038
 may-0.037
 hop-0.033
 opener-0.033
'd-0.032
Token its
Token ID663
Feature activation-1.09e-02

Pos logit contributions
 seem+0.017
 may+0.016
 solve+0.015
 Win+0.014
 shoot+0.014
Neg logit contributions
edly-0.031
efully-0.029
elt-0.026
amed-0.025
der-0.025
Token former
Token ID1966
Feature activation-1.48e-01

Pos logit contributions
edly+0.016
ful+0.016
efully+0.016
elt+0.015
cream+0.014
Neg logit contributions
 may-0.015
 hop-0.015
 might-0.014
 solve-0.014
 seem-0.013
Token state
Token ID1181
Feature activation-6.01e-02

Pos logit contributions
ful+0.208
 least+0.204
edly+0.197
efully+0.191
elt+0.183
Neg logit contributions
 may-0.225
 seem-0.208
 might-0.190
 opener-0.186
 hop-0.182
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
efully+0.105
edly+0.102
 least+0.100
rous+0.091
 former+0.090
Neg logit contributions
 may-0.071
 seem-0.067
 opener-0.064
'd-0.061
 Win-0.058
Token With
Token ID2080
Feature activation-1.67e-02

Pos logit contributions
edly+0.063
efully+0.058
 least+0.056
ful+0.055
 former+0.054
Neg logit contributions
 may-0.035
 seem-0.034
 Win-0.032
 opener-0.031
 lik-0.028
Token the
Token ID262
Feature activation-9.19e-04

Pos logit contributions
edly+0.030
efully+0.026
 yourselves+0.025
der+0.025
 former+0.025
Neg logit contributions
 may-0.022
 hop-0.020
 seem-0.019
 Win-0.018
 shoot-0.018
Token help
Token ID1037
Feature activation+1.82e-02

Pos logit contributions
edly+0.002
efully+0.001
ful+0.001
 least+0.001
 former+0.001
Neg logit contributions
 may-0.001
 seem-0.001
 opener-0.001
 Win-0.001
 hop-0.001
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
edly+0.046
efully+0.046
 former+0.042
 least+0.040
der+0.039
Neg logit contributions
 may-0.034
 seem-0.031
 might-0.030
 opener-0.030
 Win-0.029
Token be
Token ID307
Feature activation-7.26e-03

Pos logit contributions
 seem+0.009
 may+0.008
 opener+0.008
 Win+0.007
 shoot+0.007
Neg logit contributions
edly-0.023
efully-0.021
 least-0.020
ful-0.020
der-0.019
Token restored
Token ID15032
Feature activation-7.52e-03

Pos logit contributions
edly+0.013
efully+0.012
 least+0.011
ful+0.011
der+0.011
Neg logit contributions
 may-0.009
 seem-0.009
 opener-0.008
 shoot-0.008
 might-0.008
Token to
Token ID284
Feature activation+1.90e-02

Pos logit contributions
edly+0.014
efully+0.013
 least+0.013
years+0.012
ths+0.012
Neg logit contributions
 may-0.009
 seem-0.008
 opener-0.008
 might-0.007
 Win-0.007
Token its
Token ID663
Feature activation-5.43e-03

Pos logit contributions
 seem+0.022
 may+0.019
 solve+0.018
 shoot+0.018
 Win+0.017
Neg logit contributions
edly-0.040
efully-0.038
elt-0.033
amb-0.033
amed-0.033
Token former
Token ID1966
Feature activation-1.47e-01

Pos logit contributions
edly+0.008
ful+0.008
efully+0.008
elt+0.007
cream+0.007
Neg logit contributions
 may-0.008
 might-0.007
 hop-0.007
 seem-0.007
 solve-0.007
Token glory
Token ID13476
Feature activation-1.52e-02

Pos logit contributions
ful+0.213
 least+0.203
edly+0.201
efully+0.198
elt+0.185
Neg logit contributions
 may-0.223
 seem-0.214
 might-0.192
 opener-0.189
 ramp-0.186
Token."
Token ID526
Feature activation-8.60e-03

Pos logit contributions
edly+0.027
efully+0.027
 least+0.026
 former+0.023
ths+0.023
Neg logit contributions
 may-0.018
 seem-0.017
 opener-0.017
 might-0.015
'd-0.015
Token
Token ID220
Feature activation+4.99e-05

Pos logit contributions
edly+0.017
 least+0.016
efully+0.016
ful+0.015
 former+0.015
Neg logit contributions
 may-0.008
 seem-0.008
 opener-0.007
 Win-0.007
 hop-0.007
Token
Token ID198
Feature activation-8.92e-03

Pos logit contributions
 may+0.000
 seem+0.000
 Win+0.000
 opener+0.000
'd+0.000
Neg logit contributions
edly-0.000
efully-0.000
 least-0.000
ful-0.000
 each-0.000
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
edly+0.015
 least+0.014
efully+0.014
ful+0.013
 former+0.013
Neg logit contributions
 may-0.011
 seem-0.011
 opener-0.010
 Win-0.010
 hop-0.010
Token,
Token ID11
Feature activation+8.44e-03

Pos logit contributions
 may+0.014
 seem+0.014
 opener+0.013
 Win+0.013
 hop+0.012
Neg logit contributions
edly-0.014
 least-0.013
efully-0.013
ful-0.013
 former-0.012
Token leaving
Token ID4305
Feature activation+4.81e-03

Pos logit contributions
 may+0.012
 seem+0.011
 Win+0.010
 hop+0.010
 shoot+0.010
Neg logit contributions
edly-0.013
efully-0.013
 least-0.012
ful-0.011
 former-0.011
Token it
Token ID340
Feature activation-1.71e-03

Pos logit contributions
 may+0.007
 seem+0.006
 Win+0.006
 opener+0.006
 hop+0.006
Neg logit contributions
edly-0.008
efully-0.007
 least-0.007
ful-0.006
 former-0.006
Token to
Token ID284
Feature activation+1.34e-02

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.002
 former+0.002
Neg logit contributions
 may-0.003
 seem-0.003
 opener-0.002
 Win-0.002
 hop-0.002
Token its
Token ID663
Feature activation-1.76e-03

Pos logit contributions
 seem+0.016
 shoot+0.016
 hop+0.016
 may+0.016
 Win+0.015
Neg logit contributions
edly-0.025
efully-0.023
years-0.021
elt-0.021
ths-0.020
Token former
Token ID1966
Feature activation-1.40e-01

Pos logit contributions
edly+0.003
efully+0.003
ful+0.003
years+0.002
elt+0.002
Neg logit contributions
 may-0.002
 hop-0.002
 shoot-0.002
 ramp-0.002
 seem-0.002
Token state
Token ID1181
Feature activation-5.53e-02

Pos logit contributions
 least+0.184
ful+0.182
edly+0.181
efully+0.173
P+0.159
Neg logit contributions
 may-0.218
 seem-0.206
 ramp-0.189
 opener-0.187
 might-0.184
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
edly+0.090
efully+0.090
 least+0.088
 former+0.079
ful+0.076
Neg logit contributions
 may-0.069
 seem-0.067
 opener-0.063
 Win-0.060
'd-0.059
Token
Token ID220
Feature activation-1.47e-02

Pos logit contributions
edly+0.049
 least+0.046
efully+0.046
ful+0.045
 former+0.043
Neg logit contributions
 may-0.023
 seem-0.022
 Win-0.020
 opener-0.020
 lik-0.017
Token
Token ID198
Feature activation-2.59e-02

Pos logit contributions
edly+0.021
efully+0.020
 least+0.020
ful+0.019
 former+0.018
Neg logit contributions
 may-0.021
 seem-0.021
 opener-0.019
 Win-0.019
'd-0.018
Token
Token ID198
Feature activation-2.47e-02

Pos logit contributions
edly+0.042
 least+0.040
efully+0.039
ful+0.037
 former+0.036
Neg logit contributions
 may-0.034
 seem-0.033
 opener-0.031
 Win-0.031
 hop-0.029
Token the
Token ID262
Feature activation-3.37e-03

Pos logit contributions
edly+0.056
efully+0.053
 least+0.053
ful+0.051
 former+0.049
Neg logit contributions
 may-0.038
 seem-0.036
 opener-0.033
 Win-0.033
 hop-0.031
Token pony
Token ID26902
Feature activation-3.65e-02

Pos logit contributions
edly+0.005
efully+0.005
 least+0.005
ful+0.005
 former+0.004
Neg logit contributions
 may-0.005
 seem-0.005
 Win-0.004
 opener-0.004
 hop-0.004
Token went
Token ID1816
Feature activation-4.01e-02

Pos logit contributions
edly+0.067
efully+0.064
ful+0.062
 least+0.061
 former+0.059
Neg logit contributions
 may-0.043
 seem-0.037
'd-0.036
 hop-0.034
 might-0.033
Token off
Token ID572
Feature activation-4.08e-02

Pos logit contributions
edly+0.065
 least+0.061
efully+0.061
ful+0.057
 former+0.056
Neg logit contributions
 may-0.052
 seem-0.050
 Win-0.050
 hop-0.049
 opener-0.048
Token on
Token ID319
Feature activation-3.13e-02

Pos logit contributions
edly+0.063
efully+0.063
 least+0.060
ful+0.055
 former+0.053
Neg logit contributions
 may-0.055
 seem-0.054
 hop-0.051
 opener-0.051
 Win-0.050
Token even
Token ID772
Feature activation-1.24e-01

Pos logit contributions
edly+0.057
efully+0.054
 least+0.052
ful+0.052
elt+0.049
Neg logit contributions
 may-0.037
 seem-0.035
 hop-0.034
 Win-0.031
 opener-0.030
Token greater
Token ID3744
Feature activation-7.17e-02

Pos logit contributions
edly+0.227
efully+0.211
amb+0.199
der+0.198
ful+0.195
Neg logit contributions
 seem-0.137
 hop-0.131
 may-0.128
'd-0.121
 Win-0.120
Token adventures
Token ID17545
Feature activation-2.91e-02

Pos logit contributions
ful+0.106
edly+0.106
efully+0.106
 least+0.104
elt+0.099
Neg logit contributions
 may-0.106
 seem-0.098
 hop-0.095
 might-0.091
'd-0.086
Token.
Token ID13
Feature activation-3.62e-02

Pos logit contributions
edly+0.044
efully+0.043
 least+0.043
 former+0.039
ful+0.038
Neg logit contributions
 may-0.041
 seem-0.040
 opener-0.036
 Win-0.036
'd-0.035
Token He
Token ID679
Feature activation-2.84e-02

Pos logit contributions
edly+0.069
efully+0.065
 least+0.063
ful+0.061
 former+0.059
Neg logit contributions
 may-0.040
 seem-0.038
 Win-0.036
 opener-0.035
 hop-0.031
Token was
Token ID373
Feature activation-9.46e-03

Pos logit contributions
edly+0.058
efully+0.053
ful+0.053
 yourselves+0.050
 former+0.050
Neg logit contributions
 may-0.031
'd-0.027
 seem-0.025
 hop-0.024
 might-0.023
Token to
Token ID284
Feature activation+2.12e-02

Pos logit contributions
efully+0.079
edly+0.076
 least+0.070
 former+0.068
 example+0.067
Neg logit contributions
 seem-0.048
 may-0.048
 opener-0.043
'd-0.041
 Win-0.040
Token stay
Token ID2652
Feature activation+2.54e-02

Pos logit contributions
 seem+0.018
 may+0.017
 hop+0.017
 shoot+0.016
 Win+0.016
Neg logit contributions
edly-0.048
efully-0.045
der-0.042
elt-0.041
 least-0.040
Token in
Token ID287
Feature activation-1.55e-03

Pos logit contributions
 may+0.037
 seem+0.036
 Win+0.034
 opener+0.034
 hop+0.033
Neg logit contributions
edly-0.037
 least-0.034
efully-0.034
ful-0.031
der-0.030
Token.
Token ID13
Feature activation-2.33e-02

Pos logit contributions
edly+0.003
efully+0.003
 least+0.002
ful+0.002
der+0.002
Neg logit contributions
 may-0.002
 seem-0.002
 opener-0.002
 Win-0.002
 hop-0.002
Token So
Token ID1406
Feature activation-4.79e-02

Pos logit contributions
edly+0.041
 least+0.039
efully+0.039
ful+0.037
 former+0.036
Neg logit contributions
 may-0.027
 seem-0.026
 Win-0.025
 opener-0.024
 ramp-0.022
Token even
Token ID772
Feature activation-1.22e-01

Pos logit contributions
edly+0.091
efully+0.086
 least+0.084
ful+0.080
 former+0.079
Neg logit contributions
 may-0.054
 seem-0.049
 Win-0.046
 opener-0.043
 hop-0.043
Token though
Token ID996
Feature activation+4.94e-03

Pos logit contributions
edly+0.202
efully+0.200
 least+0.191
 former+0.182
view+0.175
Neg logit contributions
 may-0.148
 seem-0.146
 hop-0.139
 opener-0.137
 Win-0.137
Token the
Token ID262
Feature activation+5.04e-03

Pos logit contributions
 may+0.006
 seem+0.006
 opener+0.006
 Win+0.006
 ramp+0.005
Neg logit contributions
edly-0.008
 least-0.008
efully-0.008
ful-0.007
 former-0.007
Token sc
Token ID629
Feature activation-5.30e-02

Pos logit contributions
 may+0.007
 Win+0.006
 seem+0.006
 hop+0.006
 ramp+0.006
Neg logit contributions
edly-0.009
efully-0.008
ful-0.008
 least-0.007
der-0.007
Tokenooter
Token ID25141
Feature activation-5.63e-02

Pos logit contributions
edly+0.094
efully+0.087
 least+0.086
ful+0.084
 each+0.079
Neg logit contributions
 may-0.064
 seem-0.062
 Win-0.061
 opener-0.055
 ramp-0.053
Token was
Token ID373
Feature activation-5.24e-03

Pos logit contributions
edly+0.110
efully+0.106
 least+0.104
ful+0.102
 former+0.097
Neg logit contributions
 may-0.057
 seem-0.053
 Win-0.047
 opener-0.047
'd-0.045
Token go
Token ID467
Feature activation-3.60e-02

Pos logit contributions
 seem+0.010
 may+0.009
 hop+0.009
 shoot+0.009
 Win+0.008
Neg logit contributions
edly-0.024
efully-0.022
 least-0.021
der-0.021
years-0.021
Token!
Token ID0
Feature activation-3.78e-02

Pos logit contributions
edly+0.057
efully+0.054
 least+0.048
 former+0.048
der+0.047
Neg logit contributions
 seem-0.052
 may-0.051
 Win-0.049
 opener-0.048
 hop-0.047
Token Everyone
Token ID11075
Feature activation-5.61e-02

Pos logit contributions
edly+0.070
 least+0.067
efully+0.065
ful+0.063
 former+0.061
Neg logit contributions
 may-0.041
 seem-0.039
 Win-0.039
 opener-0.036
 hop-0.034
Token watched
Token ID7342
Feature activation+2.72e-02

Pos logit contributions
edly+0.105
 least+0.097
efully+0.097
 former+0.096
ful+0.093
Neg logit contributions
 may-0.063
 seem-0.059
'd-0.052
 Win-0.052
 opener-0.049
Token in
Token ID287
Feature activation-2.42e-02

Pos logit contributions
 may+0.038
 seem+0.038
 Win+0.035
 opener+0.034
 hop+0.033
Neg logit contributions
edly-0.041
 least-0.040
efully-0.039
 former-0.036
ful-0.035
Token amaz
Token ID40642
Feature activation-1.21e-01

Pos logit contributions
edly+0.039
efully+0.036
 least+0.035
ful+0.033
 former+0.033
Neg logit contributions
 may-0.033
 seem-0.032
 Win-0.030
 hop-0.029
 opener-0.029
Tokenement
Token ID972
Feature activation-4.02e-02

Pos logit contributions
 least+0.265
edly+0.262
 former+0.258
ful+0.246
 example+0.237
Neg logit contributions
 may-0.075
 seem-0.056
ves-0.053
ologists-0.051
 might-0.046
Token as
Token ID355
Feature activation+3.17e-03

Pos logit contributions
efully+0.069
 least+0.069
edly+0.067
P+0.065
 former+0.064
Neg logit contributions
 seem-0.048
 may-0.045
 Win-0.039
 opener-0.038
 hop-0.037
Token it
Token ID340
Feature activation-1.39e-02

Pos logit contributions
 may+0.004
 seem+0.004
 Win+0.003
 opener+0.003
 hop+0.003
Neg logit contributions
edly-0.006
efully-0.005
 least-0.005
 former-0.005
ful-0.005
Token took
Token ID1718
Feature activation-2.89e-02

Pos logit contributions
edly+0.027
 least+0.026
efully+0.026
ful+0.025
 former+0.024
Neg logit contributions
 may-0.014
 seem-0.013
 opener-0.012
 Win-0.012
'd-0.011
Token off
Token ID572
Feature activation-3.76e-02

Pos logit contributions
edly+0.050
 least+0.046
der+0.044
 former+0.043
efully+0.042
Neg logit contributions
 may-0.039
 seem-0.037
 Win-0.036
 shoot-0.034
 hop-0.034
Token to
Token ID284
Feature activation+1.10e-02

Pos logit contributions
edly+0.031
efully+0.029
 least+0.028
ful+0.028
 former+0.027
Neg logit contributions
 may-0.036
 seem-0.035
 might-0.032
 hop-0.032
 opener-0.031
Token climb
Token ID12080
Feature activation-3.42e-02

Pos logit contributions
 seem+0.010
 may+0.009
 hop+0.009
 shoot+0.009
 Win+0.008
Neg logit contributions
edly-0.024
efully-0.022
 least-0.021
der-0.021
years-0.021
Token again
Token ID757
Feature activation-6.99e-02

Pos logit contributions
edly+0.065
efully+0.062
 least+0.060
 former+0.060
ful+0.058
Neg logit contributions
 may-0.038
 seem-0.036
 might-0.032
 ramp-0.032
 Win-0.032
Token,
Token ID11
Feature activation-9.65e-04

Pos logit contributions
edly+0.142
efully+0.137
years+0.133
 least+0.132
ful+0.126
Neg logit contributions
 seem-0.065
 may-0.060
 hop-0.058
 might-0.053
 opener-0.051
Token reaching
Token ID8978
Feature activation-7.64e-03

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.002
 former+0.001
Neg logit contributions
 seem-0.001
 may-0.001
 Win-0.001
 hop-0.001
 shoot-0.001
Token even
Token ID772
Feature activation-1.21e-01

Pos logit contributions
edly+0.014
efully+0.012
years+0.011
 former+0.011
 least+0.011
Neg logit contributions
 may-0.010
 seem-0.010
 hop-0.009
 might-0.009
 shoot-0.008
Token higher
Token ID2440
Feature activation-1.50e-02

Pos logit contributions
edly+0.232
amb+0.205
efully+0.201
ful+0.190
 former+0.187
Neg logit contributions
 seem-0.147
 may-0.129
 hop-0.128
 might-0.118
 move-0.118
Token than
Token ID621
Feature activation-4.33e-02

Pos logit contributions
years+0.026
edly+0.026
efully+0.025
 former+0.025
 least+0.024
Neg logit contributions
 seem-0.017
 may-0.017
 might-0.015
 hop-0.014
 ramp-0.014
Token before
Token ID878
Feature activation+3.00e-02

Pos logit contributions
edly+0.078
efully+0.075
 former+0.069
amb+0.063
ful+0.063
Neg logit contributions
 seem-0.055
 may-0.054
'd-0.046
 Win-0.046
 might-0.045
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
 may+0.037
 seem+0.036
 opener+0.031
 Win+0.031
 hop+0.030
Neg logit contributions
edly-0.053
efully-0.051
 least-0.048
 former-0.046
ful-0.046
Token When
Token ID1649
Feature activation-8.86e-03

Pos logit contributions
edly+0.050
 least+0.048
efully+0.047
ful+0.046
 former+0.043
Neg logit contributions
 may-0.028
 seem-0.027
 Win-0.026
 opener-0.024
 hop-0.023
Token spread
Token ID4104
Feature activation+1.11e-02

Pos logit contributions
 seem+0.013
 may+0.013
 Win+0.011
 opener+0.011
 hop+0.011
Neg logit contributions
edly-0.031
efully-0.029
 least-0.028
ful-0.027
 former-0.026
Token her
Token ID607
Feature activation-2.28e-02

Pos logit contributions
 may+0.015
 hop+0.014
 seem+0.014
 bounce+0.013
 might+0.012
Neg logit contributions
edly-0.020
efully-0.018
 former-0.018
 least-0.018
ths-0.017
Token love
Token ID1842
Feature activation-5.24e-02

Pos logit contributions
edly+0.038
efully+0.037
ful+0.035
 least+0.034
der+0.034
Neg logit contributions
 may-0.033
 might-0.029
 hop-0.028
 seem-0.027
ches-0.025
Token for
Token ID329
Feature activation-5.31e-03

Pos logit contributions
efully+0.100
edly+0.096
 former+0.093
 least+0.092
amed+0.086
Neg logit contributions
 may-0.062
 might-0.054
 seem-0.050
 hop-0.046
 opener-0.046
Token teaching
Token ID7743
Feature activation+4.43e-04

Pos logit contributions
edly+0.008
efully+0.008
 least+0.007
elt+0.007
amed+0.007
Neg logit contributions
 seem-0.008
 may-0.007
 hop-0.007
 might-0.006
ves-0.006
Token even
Token ID772
Feature activation-1.20e-01

Pos logit contributions
 seem+0.001
 may+0.001
 Win+0.001
 might+0.001
 opener+0.001
Neg logit contributions
edly-0.001
 least-0.001
efully-0.001
 former-0.001
ful-0.001
Token more
Token ID517
Feature activation-4.05e-02

Pos logit contributions
edly+0.199
efully+0.183
 least+0.178
ths+0.157
amed+0.157
Neg logit contributions
 seem-0.171
 may-0.166
 opener-0.151
 hop-0.150
 Win-0.150
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
edly+0.069
 least+0.066
efully+0.064
ful+0.057
 former+0.056
Neg logit contributions
 may-0.055
 seem-0.053
 opener-0.048
 hop-0.047
 Win-0.046
Token She
Token ID1375
Feature activation-1.96e-02

Pos logit contributions
edly+0.039
 least+0.037
efully+0.036
ful+0.036
 former+0.033
Neg logit contributions
 may-0.021
 seem-0.020
 Win-0.020
 opener-0.018
 hop-0.017
Token thought
Token ID1807
Feature activation-1.93e-02

Pos logit contributions
edly+0.038
 least+0.037
efully+0.035
ful+0.035
 yourselves+0.034
Neg logit contributions
 may-0.020
'd-0.019
 hop-0.017
ches-0.017
 seem-0.017
Token,
Token ID11
Feature activation-1.29e-03

Pos logit contributions
edly+0.030
efully+0.029
 least+0.028
ful+0.027
 former+0.026
Neg logit contributions
 may-0.026
 seem-0.025
 opener-0.024
 Win-0.024
 hop-0.022
Token her
Token ID607
Feature activation+8.51e-03

Pos logit contributions
edly+0.145
efully+0.142
 least+0.135
 ourselves+0.126
 former+0.125
Neg logit contributions
 seem-0.094
 may-0.093
 Win-0.081
 opener-0.080
'd-0.080
Token mom
Token ID1995
Feature activation-2.91e-02

Pos logit contributions
 may+0.012
 seem+0.011
 hop+0.011
 might+0.011
'd+0.011
Neg logit contributions
edly-0.013
efully-0.013
ful-0.012
 least-0.011
ths-0.011
Tokenmy
Token ID1820
Feature activation-1.63e-02

Pos logit contributions
edly+0.053
efully+0.049
 least+0.046
 former+0.046
ful+0.045
Neg logit contributions
 may-0.035
 seem-0.032
'd-0.031
 Win-0.028
 opener-0.028
Token watched
Token ID7342
Feature activation+2.56e-02

Pos logit contributions
edly+0.028
efully+0.026
 least+0.025
ful+0.024
 former+0.024
Neg logit contributions
 may-0.021
 seem-0.020
 opener-0.018
'd-0.018
 Win-0.018
Token in
Token ID287
Feature activation-2.16e-02

Pos logit contributions
 seem+0.032
 may+0.031
 Win+0.028
 opener+0.027
 hop+0.027
Neg logit contributions
edly-0.042
efully-0.041
 least-0.041
 former-0.038
P-0.035
Token amaz
Token ID40642
Feature activation-1.20e-01

Pos logit contributions
edly+0.038
efully+0.035
 least+0.033
ful+0.033
P+0.033
Neg logit contributions
 may-0.026
 seem-0.026
 Win-0.023
 opener-0.022
 might-0.022
Tokenement
Token ID972
Feature activation-3.50e-02

Pos logit contributions
edly+0.273
 least+0.264
 former+0.257
ful+0.248
 example+0.243
Neg logit contributions
 may-0.069
 seem-0.046
ologists-0.044
ves-0.041
 might-0.037
Token as
Token ID355
Feature activation+8.85e-03

Pos logit contributions
 least+0.065
efully+0.064
edly+0.061
P+0.059
 former+0.059
Neg logit contributions
 seem-0.037
 may-0.034
 opener-0.029
 Win-0.028
ches-0.028
Token the
Token ID262
Feature activation-6.25e-03

Pos logit contributions
 may+0.011
 seem+0.011
 Win+0.010
 opener+0.009
 hop+0.009
Neg logit contributions
edly-0.016
efully-0.015
 least-0.015
 former-0.013
ful-0.013
Token machine
Token ID4572
Feature activation-2.12e-03

Pos logit contributions
edly+0.010
efully+0.009
 least+0.009
ful+0.009
der+0.008
Neg logit contributions
 may-0.009
 seem-0.009
 Win-0.008
 hop-0.008
 opener-0.008
Token flew
Token ID13112
Feature activation+1.53e-03

Pos logit contributions
edly+0.004
efully+0.004
 least+0.004
 former+0.003
ful+0.003
Neg logit contributions
 may-0.002
 seem-0.002
 Win-0.002
 opener-0.002
'd-0.002
Token egg
Token ID5935
Feature activation+1.40e-02

Pos logit contributions
edly+0.031
efully+0.030
 least+0.029
 former+0.028
ful+0.028
Neg logit contributions
 may-0.021
 seem-0.019
 hop-0.018
'd-0.017
 opener-0.017
Token!
Token ID0
Feature activation-2.79e-02

Pos logit contributions
 may+0.019
 seem+0.017
 opener+0.016
'd+0.016
 hop+0.015
Neg logit contributions
edly-0.023
efully-0.023
 least-0.021
 former-0.021
 example-0.020
Token He
Token ID679
Feature activation-1.88e-02

Pos logit contributions
edly+0.053
 least+0.050
efully+0.049
ful+0.047
 former+0.046
Neg logit contributions
 may-0.029
 seem-0.028
 Win-0.027
 opener-0.026
 hop-0.024
Token watched
Token ID7342
Feature activation+3.36e-02

Pos logit contributions
edly+0.036
 least+0.035
efully+0.034
ful+0.033
 former+0.032
Neg logit contributions
 may-0.020
 seem-0.018
'd-0.016
 hop-0.016
 opener-0.016
Token in
Token ID287
Feature activation-2.39e-02

Pos logit contributions
 may+0.046
 seem+0.046
 opener+0.042
 Win+0.042
 hop+0.041
Neg logit contributions
edly-0.052
efully-0.049
 least-0.049
 former-0.045
ful-0.045
Token amaz
Token ID40642
Feature activation-1.20e-01

Pos logit contributions
edly+0.037
efully+0.034
 least+0.033
ful+0.032
 former+0.031
Neg logit contributions
 may-0.035
 seem-0.033
 hop-0.031
 opener-0.030
 Win-0.030
Tokenement
Token ID972
Feature activation-3.64e-02

Pos logit contributions
 least+0.257
 former+0.251
edly+0.249
ful+0.228
 example+0.225
Neg logit contributions
 may-0.086
 seem-0.066
ologists-0.062
 might-0.056
ves-0.053
Token as
Token ID355
Feature activation+4.29e-03

Pos logit contributions
 least+0.061
efully+0.059
edly+0.057
 former+0.057
P+0.054
Neg logit contributions
 seem-0.044
 may-0.042
 opener-0.038
 hop-0.038
 Win-0.036
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
 may+0.004
 seem+0.004
 opener+0.004
 Win+0.004
 hop+0.004
Neg logit contributions
edly-0.008
efully-0.008
 least-0.008
 former-0.007
ful-0.007
Token egg
Token ID5935
Feature activation-1.57e-02

Pos logit contributions
edly+0.019
 least+0.018
efully+0.017
ful+0.017
 former+0.016
Neg logit contributions
 may-0.016
 seem-0.015
 might-0.014
'd-0.014
 roar-0.014
Token began
Token ID2540
Feature activation-3.14e-02

Pos logit contributions
edly+0.028
efully+0.027
 least+0.026
ful+0.025
 former+0.024
Neg logit contributions
 may-0.018
 seem-0.017
 opener-0.016
 Win-0.016
'd-0.015
Tokeneddy
Token ID21874
Feature activation-4.05e-02

Pos logit contributions
 least+0.156
 example+0.151
edly+0.149
 cher+0.149
 moments+0.146
Neg logit contributions
 opener-0.128
ches-0.125
ologists-0.125
 may-0.112
uddy-0.109
Token bear
Token ID6842
Feature activation-7.64e-03

Pos logit contributions
edly+0.093
efully+0.090
 least+0.086
 former+0.086
ful+0.085
Neg logit contributions
 may-0.030
 seem-0.022
ches-0.021
 hop-0.020
 opener-0.020
Token and
Token ID290
Feature activation-3.16e-03

Pos logit contributions
edly+0.014
efully+0.013
 least+0.013
ful+0.012
 former+0.012
Neg logit contributions
 may-0.009
 seem-0.008
 opener-0.008
 Win-0.008
 hop-0.007
Token watched
Token ID7342
Feature activation+2.24e-02

Pos logit contributions
edly+0.006
efully+0.006
 least+0.006
ful+0.005
rous+0.005
Neg logit contributions
 may-0.004
 hop-0.003
 seem-0.003
 shoot-0.003
'd-0.003
Token in
Token ID287
Feature activation-2.26e-02

Pos logit contributions
 seem+0.028
 may+0.028
 opener+0.026
 Win+0.026
 hop+0.025
Neg logit contributions
edly-0.036
efully-0.036
 least-0.036
 former-0.032
ful-0.031
Token amaz
Token ID40642
Feature activation-1.18e-01

Pos logit contributions
edly+0.036
efully+0.034
 least+0.032
ful+0.031
 former+0.029
Neg logit contributions
 may-0.031
 seem-0.030
 Win-0.028
 opener-0.028
 hop-0.027
Tokenement
Token ID972
Feature activation-4.00e-02

Pos logit contributions
 least+0.261
edly+0.250
 former+0.248
 example+0.234
ful+0.232
Neg logit contributions
 may-0.077
ves-0.055
 seem-0.053
ologists-0.049
 might-0.048
Token as
Token ID355
Feature activation+1.14e-02

Pos logit contributions
 least+0.072
efully+0.071
edly+0.068
 former+0.064
P+0.063
Neg logit contributions
 seem-0.044
 may-0.041
 opener-0.037
 Win-0.036
 hop-0.036
Token it
Token ID340
Feature activation-6.65e-03

Pos logit contributions
 may+0.014
 seem+0.013
 Win+0.012
 opener+0.012
 hop+0.012
Neg logit contributions
edly-0.020
 least-0.019
efully-0.019
 former-0.017
ful-0.017
Token shrunk
Token ID38900
Feature activation-5.75e-02

Pos logit contributions
edly+0.013
 least+0.012
efully+0.012
ful+0.011
 former+0.011
Neg logit contributions
 may-0.007
 seem-0.007
 opener-0.006
 Win-0.006
 hop-0.006
Token an
Token ID281
Feature activation-3.81e-02

Pos logit contributions
 least+0.101
edly+0.099
efully+0.094
years+0.091
 former+0.089
Neg logit contributions
 may-0.067
 seem-0.060
 might-0.060
ches-0.059
 hop-0.051
Tokenered
Token ID1068
Feature activation-3.87e-02

Pos logit contributions
edly+0.022
 least+0.020
efully+0.020
ful+0.019
 former+0.019
Neg logit contributions
 may-0.017
 seem-0.016
 opener-0.015
 Win-0.015
'd-0.014
Token in
Token ID287
Feature activation-1.80e-02

Pos logit contributions
edly+0.062
 least+0.060
efully+0.057
 former+0.056
ful+0.053
Neg logit contributions
 may-0.050
 seem-0.047
 hop-0.044
 opener-0.044
 Win-0.043
Token,
Token ID11
Feature activation-5.96e-03

Pos logit contributions
edly+0.027
 least+0.025
efully+0.025
ful+0.023
 former+0.022
Neg logit contributions
 may-0.026
 seem-0.026
 opener-0.024
 Win-0.024
 hop-0.023
Token blinking
Token ID43196
Feature activation-2.37e-02

Pos logit contributions
edly+0.010
efully+0.009
 least+0.009
 former+0.009
ful+0.009
Neg logit contributions
 may-0.008
 seem-0.007
 opener-0.007
 Win-0.007
 hop-0.007
Token in
Token ID287
Feature activation-2.22e-02

Pos logit contributions
 least+0.041
edly+0.036
efully+0.035
 former+0.035
 example+0.032
Neg logit contributions
 may-0.031
 seem-0.028
 hop-0.027
 Win-0.026
 shoot-0.025
Token amaz
Token ID40642
Feature activation-1.18e-01

Pos logit contributions
edly+0.040
 least+0.037
efully+0.035
ful+0.034
der+0.034
Neg logit contributions
 may-0.026
 seem-0.025
 Win-0.025
 hop-0.025
 shoot-0.023
Tokenement
Token ID972
Feature activation-3.60e-02

Pos logit contributions
 least+0.244
 former+0.233
edly+0.229
efully+0.212
ful+0.211
Neg logit contributions
 may-0.100
 seem-0.080
 might-0.070
ves-0.067
ologists-0.064
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
 least+0.064
efully+0.062
edly+0.059
 former+0.058
rous+0.054
Neg logit contributions
 seem-0.040
 may-0.039
 opener-0.035
 hop-0.034
 Win-0.034
Token
Token ID198
Feature activation-2.61e-02

Pos logit contributions
edly+0.044
 least+0.042
efully+0.042
ful+0.040
 former+0.038
Neg logit contributions
 may-0.027
 seem-0.025
 opener-0.024
 Win-0.024
 hop-0.022
Token
Token ID198
Feature activation-2.34e-02

Pos logit contributions
edly+0.040
efully+0.037
 least+0.037
ful+0.035
 former+0.033
Neg logit contributions
 may-0.037
 seem-0.035
 opener-0.033
 Win-0.033
 ramp-0.031
TokenBut
Token ID1537
Feature activation-2.87e-02

Pos logit contributions
edly+0.035
 least+0.034
efully+0.033
ful+0.031
 former+0.030
Neg logit contributions
 may-0.033
 seem-0.032
 opener-0.030
 Win-0.030
 hop-0.029
Token big
Token ID1263
Feature activation+8.40e-04

Pos logit contributions
 may+0.015
 seem+0.014
'd+0.013
 opener+0.012
 might+0.012
Neg logit contributions
edly-0.024
efully-0.024
 least-0.021
ful-0.021
years-0.021
Token before
Token ID878
Feature activation+2.83e-02

Pos logit contributions
 may+0.001
 seem+0.001
 Win+0.001
 ramp+0.001
 hop+0.001
Neg logit contributions
edly-0.002
efully-0.001
 least-0.001
ful-0.001
 former-0.001
Token!
Token ID0
Feature activation-3.09e-02

Pos logit contributions
 may+0.038
 seem+0.037
 opener+0.035
 Win+0.035
 hop+0.033
Neg logit contributions
edly-0.044
efully-0.043
 least-0.042
ful-0.039
 former-0.038
Token She
Token ID1375
Feature activation-9.99e-03

Pos logit contributions
edly+0.057
 least+0.055
efully+0.054
ful+0.051
 former+0.050
Neg logit contributions
 may-0.033
 seem-0.032
 Win-0.031
 opener-0.030
 hop-0.028
Token could
Token ID714
Feature activation-9.36e-03

Pos logit contributions
edly+0.019
 least+0.018
efully+0.017
ful+0.017
 former+0.017
Neg logit contributions
 may-0.011
 seem-0.010
'd-0.009
 Win-0.009
 hop-0.009
Token even
Token ID772
Feature activation-1.17e-01

Pos logit contributions
edly+0.018
efully+0.017
 least+0.017
ful+0.016
 former+0.016
Neg logit contributions
 may-0.009
 seem-0.009
 opener-0.008
 Win-0.008
 hop-0.008
Token sit
Token ID1650
Feature activation+3.13e-03

Pos logit contributions
edly+0.271
efully+0.258
der+0.244
 least+0.239
ful+0.237
Neg logit contributions
 seem-0.100
 hop-0.085
 may-0.084
 shoot-0.083
 move-0.074
Token inside
Token ID2641
Feature activation-2.05e-02

Pos logit contributions
 may+0.004
 seem+0.004
 Win+0.004
 opener+0.004
 hop+0.004
Neg logit contributions
edly-0.005
 least-0.005
efully-0.004
 former-0.004
ful-0.004
Token it
Token ID340
Feature activation-1.44e-02

Pos logit contributions
edly+0.032
efully+0.031
 least+0.030
 former+0.028
ful+0.028
Neg logit contributions
 may-0.028
 seem-0.027
 opener-0.025
 Win-0.025
 hop-0.024
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
efully+0.025
edly+0.024
 least+0.023
ful+0.022
years+0.022
Neg logit contributions
 may-0.018
 seem-0.017
 opener-0.016
ches-0.015
 hop-0.015
Token
Token ID220
Feature activation-7.60e-03

Pos logit contributions
edly+0.037
 least+0.034
efully+0.034
ful+0.033
 former+0.032
Neg logit contributions
 may-0.021
 seem-0.020
 Win-0.019
 opener-0.018
 hop-0.017
Token continued
Token ID3767
Feature activation-4.43e-02

Pos logit contributions
edly+0.017
 least+0.015
efully+0.015
ful+0.015
 former+0.015
Neg logit contributions
 may-0.010
 seem-0.009
'd-0.009
 hop-0.009
 opener-0.008
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
edly+0.060
 least+0.055
efully+0.055
ful+0.051
 former+0.049
Neg logit contributions
 may-0.070
 seem-0.068
 Win-0.065
 opener-0.064
 hop-0.063
Token work
Token ID670
Feature activation-3.97e-02

Pos logit contributions
edly+0.031
efully+0.029
 least+0.029
ful+0.028
 former+0.027
Neg logit contributions
 may-0.013
 seem-0.013
 opener-0.011
 Win-0.011
 hop-0.010
Token,
Token ID11
Feature activation-9.33e-03

Pos logit contributions
edly+0.044
efully+0.041
 least+0.041
ful+0.037
 former+0.035
Neg logit contributions
 may-0.072
 seem-0.070
 opener-0.067
 Win-0.067
 hop-0.064
Token adding
Token ID4375
Feature activation-6.25e-02

Pos logit contributions
edly+0.014
efully+0.014
 least+0.013
ful+0.013
 former+0.012
Neg logit contributions
 may-0.013
 seem-0.013
 Win-0.012
 opener-0.012
 hop-0.011
Token even
Token ID772
Feature activation-1.17e-01

Pos logit contributions
edly+0.105
 least+0.091
efully+0.089
 former+0.088
years+0.085
Neg logit contributions
 may-0.091
 hop-0.084
 seem-0.083
 might-0.079
 opener-0.076
Token more
Token ID517
Feature activation-4.80e-02

Pos logit contributions
edly+0.182
efully+0.151
amb+0.148
 least+0.144
 former+0.137
Neg logit contributions
 seem-0.175
 may-0.171
 hop-0.168
 opener-0.164
 Win-0.161
Token bubbles
Token ID25037
Feature activation+4.25e-03

Pos logit contributions
edly+0.077
 least+0.073
efully+0.071
ful+0.066
 former+0.065
Neg logit contributions
 may-0.069
 seem-0.066
 opener-0.061
 hop-0.060
 Win-0.060
Token each
Token ID1123
Feature activation-5.09e-02

Pos logit contributions
 may+0.006
 seem+0.006
 opener+0.006
 hop+0.005
 Win+0.005
Neg logit contributions
edly-0.006
efully-0.006
 least-0.006
 former-0.006
ful-0.005
Token day
Token ID1110
Feature activation-3.10e-03

Pos logit contributions
edly+0.090
efully+0.084
 least+0.082
 former+0.080
ful+0.078
Neg logit contributions
 may-0.064
 seem-0.060
 hop-0.056
 opener-0.055
 Win-0.054
Token making
Token ID1642
Feature activation+3.99e-02

Pos logit contributions
efully+0.005
edly+0.005
 least+0.005
 former+0.005
ful+0.004
Neg logit contributions
 may-0.004
 seem-0.004
 opener-0.004
 Win-0.004
'd-0.003
Token,
Token ID11
Feature activation-3.73e-02

Pos logit contributions
edly+0.097
 least+0.094
 former+0.092
efully+0.091
view+0.085
Neg logit contributions
 may-0.040
 seem-0.033
 Win-0.029
 hop-0.029
 might-0.027
Token the
Token ID262
Feature activation-7.57e-03

Pos logit contributions
edly+0.070
efully+0.066
 least+0.065
ful+0.062
 former+0.062
Neg logit contributions
 may-0.041
 seem-0.039
 Win-0.037
 hop-0.035
 opener-0.034
Token mighty
Token ID18680
Feature activation-4.03e-02

Pos logit contributions
edly+0.012
efully+0.011
 least+0.011
ful+0.011
 former+0.010
Neg logit contributions
 may-0.010
 seem-0.010
 hop-0.010
 Win-0.009
 opener-0.009
Token bees
Token ID17002
Feature activation-1.83e-03

Pos logit contributions
edly+0.059
efully+0.055
 least+0.055
ful+0.053
 former+0.049
Neg logit contributions
 may-0.059
 seem-0.058
 Win-0.054
 opener-0.053
 hop-0.052
Token worked
Token ID3111
Feature activation-4.37e-02

Pos logit contributions
edly+0.003
efully+0.003
 least+0.003
 former+0.003
ful+0.003
Neg logit contributions
 may-0.002
 seem-0.002
 hop-0.002
'd-0.002
 opener-0.002
Token even
Token ID772
Feature activation-1.17e-01

Pos logit contributions
edly+0.056
 least+0.055
efully+0.053
 former+0.048
ful+0.047
Neg logit contributions
 may-0.071
 seem-0.069
 opener-0.065
ches-0.064
 hop-0.063
Token harder
Token ID7069
Feature activation-4.50e-02

Pos logit contributions
edly+0.196
efully+0.184
 least+0.175
ful+0.171
 former+0.167
Neg logit contributions
 seem-0.154
 may-0.151
 hop-0.135
 opener-0.135
'd-0.134
Token to
Token ID284
Feature activation-6.02e-03

Pos logit contributions
efully+0.075
edly+0.073
 least+0.073
years+0.065
 former+0.064
Neg logit contributions
 seem-0.059
 may-0.058
 might-0.051
 opener-0.051
 hop-0.048
Token make
Token ID787
Feature activation+2.31e-02

Pos logit contributions
edly+0.014
efully+0.013
 least+0.012
ful+0.012
der+0.012
Neg logit contributions
 seem-0.005
 may-0.004
 hop-0.004
 opener-0.004
 shoot-0.004
Token more
Token ID517
Feature activation-1.82e-02

Pos logit contributions
 seem+0.033
 may+0.031
 hop+0.029
 Win+0.028
ches+0.027
Neg logit contributions
edly-0.037
 least-0.034
efully-0.033
years-0.030
ful-0.030
Token honey
Token ID12498
Feature activation+1.78e-02

Pos logit contributions
edly+0.029
 least+0.028
efully+0.027
ful+0.026
years+0.023
Neg logit contributions
 seem-0.026
 may-0.025
 hop-0.023
 opener-0.021
 might-0.021
Token felt
Token ID2936
Feature activation-1.60e-02

Pos logit contributions
efully+0.029
edly+0.028
elt+0.026
 cher+0.025
ment+0.025
Neg logit contributions
 seem-0.021
 may-0.018
 opener-0.017
 shoot-0.017
 hop-0.016
Token so
Token ID523
Feature activation+1.58e-02

Pos logit contributions
edly+0.027
efully+0.024
 least+0.023
 former+0.023
ful+0.022
Neg logit contributions
 seem-0.022
 may-0.021
 hop-0.019
 Win-0.018
 shoot-0.017
Token sad
Token ID6507
Feature activation-1.87e-02

Pos logit contributions
 seem+0.022
 may+0.021
 hop+0.019
 lik+0.018
 might+0.017
Neg logit contributions
edly-0.026
 least-0.025
efully-0.024
ful-0.023
 former-0.022
Token and
Token ID290
Feature activation-7.63e-04

Pos logit contributions
efully+0.039
years+0.036
 least+0.034
amed+0.034
rous+0.034
Neg logit contributions
 seem-0.019
 may-0.018
 hop-0.015
 he-0.014
 move-0.014
Token tried
Token ID3088
Feature activation-2.09e-02

Pos logit contributions
edly+0.001
efully+0.001
 least+0.001
ful+0.001
 former+0.001
Neg logit contributions
 may-0.001
 seem-0.001
 Win-0.001
 hop-0.001
 shoot-0.001
Token even
Token ID772
Feature activation-1.17e-01

Pos logit contributions
edly+0.035
efully+0.035
 least+0.032
 former+0.029
years+0.028
Neg logit contributions
 seem-0.028
 may-0.027
 hop-0.026
 opener-0.025
 shoot-0.025
Token harder
Token ID7069
Feature activation-3.65e-02

Pos logit contributions
edly+0.188
efully+0.172
 least+0.162
ful+0.162
 former+0.153
Neg logit contributions
 seem-0.165
 may-0.162
 opener-0.148
 hop-0.148
 shoot-0.147
Token.
Token ID13
Feature activation-2.74e-02

Pos logit contributions
efully+0.074
edly+0.070
 least+0.069
years+0.065
 former+0.064
Neg logit contributions
 seem-0.037
 may-0.033
 opener-0.031
 might-0.029
 hop-0.028
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
edly+0.051
 least+0.049
efully+0.048
ful+0.047
 former+0.044
Neg logit contributions
 may-0.029
 seem-0.028
 Win-0.026
 opener-0.026
 hop-0.023
Token
Token ID198
Feature activation-3.10e-02

Pos logit contributions
edly+0.054
 least+0.051
efully+0.050
ful+0.047
 former+0.047
Neg logit contributions
 may-0.038
 seem-0.037
 opener-0.035
 Win-0.035
 hop-0.033
TokenBut
Token ID1537
Feature activation-3.45e-02

Pos logit contributions
edly+0.045
 least+0.044
efully+0.041
 former+0.039
der+0.038
Neg logit contributions
 may-0.044
 seem-0.043
 Win-0.042
 opener-0.042
 hop-0.041
Token man
Token ID582
Feature activation-2.58e-02

Pos logit contributions
edly+0.017
 least+0.016
efully+0.016
ful+0.015
 former+0.015
Neg logit contributions
 may-0.015
 seem-0.015
 opener-0.014
 Win-0.014
 hop-0.013
Token drove
Token ID10357
Feature activation-8.30e-03

Pos logit contributions
edly+0.052
efully+0.049
 least+0.048
 former+0.047
ful+0.047
Neg logit contributions
 may-0.025
 seem-0.022
 opener-0.020
'd-0.020
 Win-0.020
Token off
Token ID572
Feature activation-4.35e-02

Pos logit contributions
edly+0.012
 least+0.011
efully+0.011
ful+0.010
 former+0.010
Neg logit contributions
 may-0.013
 seem-0.012
 Win-0.012
 opener-0.012
 ramp-0.011
Token in
Token ID287
Feature activation-4.92e-02

Pos logit contributions
edly+0.066
efully+0.063
 least+0.062
ful+0.058
 former+0.057
Neg logit contributions
 may-0.061
 seem-0.059
 opener-0.057
 Win-0.057
 ramp-0.053
Token his
Token ID465
Feature activation-4.99e-02

Pos logit contributions
edly+0.086
efully+0.079
 least+0.076
ful+0.073
der+0.073
Neg logit contributions
 may-0.062
 seem-0.060
 opener-0.058
 Win-0.058
 ramp-0.055
Token brand
Token ID4508
Feature activation-1.16e-01

Pos logit contributions
edly+0.103
ful+0.098
efully+0.097
der+0.094
 least+0.089
Neg logit contributions
 ramp-0.051
 may-0.049
 shoot-0.045
 hop-0.045
 drive-0.043
Token new
Token ID649
Feature activation-6.19e-02

Pos logit contributions
edly+0.246
 least+0.243
efully+0.240
ful+0.224
years+0.221
Neg logit contributions
 may-0.105
ves-0.089
 ramp-0.088
 opener-0.088
 might-0.080
Token car
Token ID1097
Feature activation-5.43e-02

Pos logit contributions
edly+0.111
efully+0.102
 least+0.101
ful+0.101
der+0.093
Neg logit contributions
 may-0.075
 ramp-0.071
 shoot-0.068
 Win-0.067
 hop-0.067
Token,
Token ID11
Feature activation-1.57e-03

Pos logit contributions
edly+0.096
efully+0.096
 least+0.091
ful+0.088
 former+0.087
Neg logit contributions
 may-0.063
 seem-0.058
 opener-0.055
 Win-0.053
 ramp-0.053
Token and
Token ID290
Feature activation-2.15e-02

Pos logit contributions
edly+0.003
efully+0.003
 least+0.002
ful+0.002
 former+0.002
Neg logit contributions
 may-0.002
 seem-0.002
 Win-0.002
 opener-0.002
 ramp-0.002
Token he
Token ID339
Feature activation-1.95e-02

Pos logit contributions
edly+0.039
efully+0.038
 least+0.036
 former+0.034
der+0.033
Neg logit contributions
 may-0.024
 Win-0.023
 shoot-0.022
 seem-0.021
 hop-0.021
Token and
Token ID290
Feature activation+1.63e-02

Pos logit contributions
efully+0.006
edly+0.006
years+0.005
 former+0.005
 least+0.005
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.004
'd-0.004
 shoot-0.004
Token be
Token ID307
Feature activation+7.89e-03

Pos logit contributions
 may+0.017
 seem+0.016
 hop+0.014
 shoot+0.014
'd+0.012
Neg logit contributions
edly-0.035
efully-0.034
ful-0.031
years-0.030
ths-0.030
Token able
Token ID1498
Feature activation+2.75e-02

Pos logit contributions
 hop+0.009
'd+0.008
 shoot+0.008
 seem+0.008
 may+0.008
Neg logit contributions
edly-0.014
der-0.013
 least-0.012
 former-0.011
efully-0.011
Token to
Token ID284
Feature activation-3.09e-03

Pos logit contributions
 may+0.041
 seem+0.040
 Win+0.036
 opener+0.036
 might+0.036
Neg logit contributions
edly-0.041
efully-0.039
 least-0.038
 former-0.035
ful-0.034
Token help
Token ID1037
Feature activation+2.19e-02

Pos logit contributions
edly+0.008
efully+0.007
ful+0.007
 least+0.007
years+0.007
Neg logit contributions
 seem-0.002
 may-0.002
 hop-0.002
 shoot-0.001
 opener-0.001
Token even
Token ID772
Feature activation-1.16e-01

Pos logit contributions
 seem+0.028
 may+0.027
 Win+0.025
 hop+0.025
 shoot+0.024
Neg logit contributions
edly-0.039
efully-0.036
 least-0.034
 former-0.033
years-0.032
Token more
Token ID517
Feature activation-3.36e-02

Pos logit contributions
edly+0.253
efully+0.245
 least+0.220
years+0.217
amed+0.214
Neg logit contributions
 seem-0.109
 hop-0.098
 shoot-0.096
 may-0.094
 move-0.089
Token people
Token ID661
Feature activation-2.68e-02

Pos logit contributions
edly+0.054
efully+0.054
 least+0.052
ful+0.046
years+0.046
Neg logit contributions
 may-0.045
 seem-0.044
 hop-0.039
 opener-0.039
 Win-0.038
Token.
Token ID13
Feature activation-1.34e-03

Pos logit contributions
edly+0.042
efully+0.040
 least+0.038
 former+0.036
ful+0.036
Neg logit contributions
 may-0.038
 seem-0.037
 opener-0.033
 Win-0.032
'd-0.032
Token<|endoftext|>
Token ID50256
Feature activation+1.09e-02

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.002
 former+0.002
Neg logit contributions
 may-0.002
 seem-0.002
 Win-0.002
 opener-0.002
 hop-0.001
TokenOnce
Token ID7454
Feature activation-8.00e-03

Pos logit contributions
 may+0.020
 seem+0.020
fters+0.019
'd+0.019
 Win+0.019
Neg logit contributions
efully-0.010
 least-0.009
edly-0.009
 each-0.008
ful-0.008
Token his
Token ID465
Feature activation+1.32e-03

Pos logit contributions
 seem+0.034
 may+0.033
 hop+0.032
 shoot+0.031
 Win+0.030
Neg logit contributions
edly-0.070
efully-0.065
 least-0.060
elt-0.057
ful-0.057
Token friends
Token ID2460
Feature activation-1.43e-02

Pos logit contributions
 may+0.002
 seem+0.002
 opener+0.002
 Win+0.002
 hop+0.002
Neg logit contributions
edly-0.002
efully-0.002
 least-0.002
ful-0.002
 former-0.001
Token and
Token ID290
Feature activation+9.81e-03

Pos logit contributions
edly+0.019
efully+0.018
 least+0.018
ful+0.017
 former+0.016
Neg logit contributions
 may-0.023
 seem-0.022
 opener-0.021
 Win-0.021
 hop-0.020
Token be
Token ID307
Feature activation+1.13e-02

Pos logit contributions
 may+0.010
 seem+0.009
 hop+0.009
 shoot+0.008
 opener+0.008
Neg logit contributions
edly-0.020
efully-0.019
years-0.018
 least-0.018
der-0.018
Token safe
Token ID3338
Feature activation+2.04e-02

Pos logit contributions
 hop+0.014
 Win+0.014
 lik+0.013
 seem+0.013
 shoot+0.013
Neg logit contributions
edly-0.019
der-0.017
 least-0.017
efully-0.016
 former-0.016
Token when
Token ID618
Feature activation+5.63e-02

Pos logit contributions
 may+0.024
 seem+0.023
 opener+0.021
 Win+0.021
 hop+0.021
Neg logit contributions
efully-0.034
edly-0.033
 least-0.032
 former-0.030
rous-0.029
Token he
Token ID339
Feature activation-4.19e-03

Pos logit contributions
 may+0.060
 seem+0.057
 opener+0.054
 Win+0.053
 ramp+0.049
Neg logit contributions
edly-0.104
 least-0.099
efully-0.098
ful-0.095
 former-0.090
Token wanted
Token ID2227
Feature activation-4.09e-02

Pos logit contributions
edly+0.007
 least+0.007
efully+0.007
ful+0.007
 former+0.006
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.005
 Win-0.004
 ramp-0.004
Token to
Token ID284
Feature activation+2.03e-02

Pos logit contributions
edly+0.052
efully+0.049
 least+0.048
ful+0.045
 former+0.043
Neg logit contributions
 may-0.069
 seem-0.067
 opener-0.063
 Win-0.063
 hop-0.060
Token take
Token ID1011
Feature activation+1.03e-03

Pos logit contributions
 may+0.020
 seem+0.019
 opener+0.017
 Win+0.017
 ramp+0.015
Neg logit contributions
edly-0.040
 least-0.038
efully-0.038
ful-0.036
 former-0.035
Token something
Token ID1223
Feature activation+8.93e-03

Pos logit contributions
 may+0.001
 seem+0.001
 Win+0.001
 hop+0.001
 shoot+0.001
Neg logit contributions
edly-0.002
efully-0.002
 least-0.002
der-0.002
 former-0.001
Token,
Token ID11
Feature activation+5.43e-03

Pos logit contributions
 seem+0.016
 may+0.016
 opener+0.016
 ramp+0.015
 Win+0.015
Neg logit contributions
edly-0.014
ful-0.014
 each-0.013
 least-0.013
efully-0.013
Token Sarah
Token ID10490
Feature activation-6.04e-03

Pos logit contributions
 may+0.007
 seem+0.007
 Win+0.006
 opener+0.006
 lik+0.006
Neg logit contributions
edly-0.009
efully-0.009
 least-0.009
ful-0.008
 former-0.008
Token's
Token ID338
Feature activation+2.08e-02

Pos logit contributions
edly+0.009
 least+0.009
efully+0.009
ful+0.008
 each+0.008
Neg logit contributions
 may-0.008
 seem-0.008
 opener-0.008
 Win-0.008
 ramp-0.007
Token dad
Token ID9955
Feature activation-3.19e-02

Pos logit contributions
 may+0.026
 hop+0.024
 opener+0.023
 seem+0.023
 lik+0.023
Neg logit contributions
edly-0.036
efully-0.034
 least-0.031
ful-0.030
ment-0.030
Token said
Token ID531
Feature activation+7.99e-02

Pos logit contributions
 least+0.042
 each+0.040
 now+0.039
ful+0.038
 then+0.037
Neg logit contributions
 ramp-0.050
fters-0.049
 seem-0.049
 may-0.047
 opener-0.046
Token to
Token ID284
Feature activation+5.12e-02

Pos logit contributions
 ramp+0.124
ches+0.123
 opener+0.121
 Win+0.120
 may+0.119
Neg logit contributions
ful-0.104
edly-0.096
 each-0.091
 now-0.082
der-0.081
Token her
Token ID607
Feature activation+6.11e-03

Pos logit contributions
 seem+0.079
 Win+0.072
 may+0.071
 opener+0.068
 shoot+0.068
Neg logit contributions
edly-0.080
efully-0.070
der-0.068
 least-0.066
 former-0.066
Token,
Token ID11
Feature activation+3.70e-02

Pos logit contributions
 may+0.007
 seem+0.007
 opener+0.006
 Win+0.006
 hop+0.006
Neg logit contributions
edly-0.011
efully-0.010
 least-0.010
ful-0.010
 former-0.010
Token "
Token ID366
Feature activation-4.70e-02

Pos logit contributions
 may+0.041
 seem+0.039
 opener+0.037
 Win+0.036
 ramp+0.035
Neg logit contributions
edly-0.065
ful-0.062
 least-0.060
efully-0.060
 each-0.059
TokenLet
Token ID5756
Feature activation+1.56e-02

Pos logit contributions
edly+0.077
 least+0.071
efully+0.069
ful+0.067
 former+0.065
Neg logit contributions
 may-0.060
 Win-0.059
 seem-0.059
'd-0.056
 opener-0.056
Token's
Token ID338
Feature activation-2.41e-02

Pos logit contributions
 may+0.015
 seem+0.015
 opener+0.014
 Win+0.013
 ramp+0.012
Neg logit contributions
edly-0.030
efully-0.029
 least-0.029
ful-0.027
 former-0.026
Token restore
Token ID11169
Feature activation-1.43e-02

Pos logit contributions
 seem+0.017
 may+0.016
 opener+0.016
 shoot+0.015
 solve+0.013
Neg logit contributions
edly-0.062
efully-0.059
der-0.053
elt-0.053
years-0.053
Token our
Token ID674
Feature activation-1.79e-02

Pos logit contributions
edly+0.031
years+0.027
efully+0.027
elt+0.026
 former+0.025
Neg logit contributions
 may-0.015
 might-0.014
 seem-0.013
ches-0.013
 shoot-0.012
Token loop
Token ID9052
Feature activation-1.30e-02

Pos logit contributions
edly+0.032
ful+0.031
efully+0.031
 least+0.028
years+0.027
Neg logit contributions
 may-0.024
 ramp-0.021
 might-0.020
 seem-0.020
 hop-0.019
Token,"
Token ID553
Feature activation+1.69e-02

Pos logit contributions
edly+0.025
years+0.025
efully+0.025
 least+0.022
 former+0.022
Neg logit contributions
 may-0.014
 seem-0.013
 opener-0.013
 ramp-0.013
ches-0.013
Token he
Token ID339
Feature activation+2.82e-03

Pos logit contributions
 may+0.016
 seem+0.016
 Win+0.015
 opener+0.014
 hop+0.013
Neg logit contributions
edly-0.033
 least-0.032
efully-0.032
 former-0.031
ful-0.030
Token says
Token ID1139
Feature activation+8.74e-02

Pos logit contributions
 may+0.005
 seem+0.005
fters+0.005
 Win+0.004
 haven+0.004
Neg logit contributions
efully-0.003
edly-0.003
 each-0.003
 least-0.003
der-0.003
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
 may+0.156
 Win+0.142
 seem+0.142
 ramp+0.138
'd+0.138
Neg logit contributions
edly-0.112
ful-0.100
efully-0.090
 each-0.089
 least-0.087
Token "
Token ID366
Feature activation-3.75e-02

Pos logit contributions
 may+0.031
 seem+0.030
 opener+0.029
 Win+0.028
 hop+0.026
Neg logit contributions
 least-0.053
edly-0.053
efully-0.052
ful-0.049
 former-0.048
TokenYou
Token ID1639
Feature activation-1.08e-02

Pos logit contributions
 least+0.055
efully+0.053
edly+0.051
 former+0.050
ful+0.049
Neg logit contributions
 opener-0.053
 seem-0.052
 Win-0.048
 may-0.047
'd-0.045
Token hold
Token ID1745
Feature activation-2.62e-03

Pos logit contributions
edly+0.025
efully+0.024
 least+0.024
ful+0.023
 former+0.022
Neg logit contributions
 may-0.007
 seem-0.006
 opener-0.006
 Win-0.005
 hop-0.005
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
edly+0.006
 former+0.006
view+0.005
efully+0.005
der+0.005
Neg logit contributions
 may-0.003
 Win-0.002
 might-0.002
ches-0.002
 seem-0.002
Token sorry
Token ID7926
Feature activation+2.48e-02

Pos logit contributions
 may+0.017
 seem+0.016
 opener+0.016
 Win+0.015
 ramp+0.015
Neg logit contributions
 least-0.010
edly-0.010
efully-0.010
ful-0.009
 each-0.009
Token to
Token ID284
Feature activation+4.07e-02

Pos logit contributions
 may+0.032
 seem+0.031
 opener+0.029
 Win+0.029
 hop+0.027
Neg logit contributions
edly-0.041
efully-0.039
 least-0.038
ful-0.036
 former-0.035
Token Sara
Token ID24799
Feature activation+2.28e-02

Pos logit contributions
 shoot+0.049
 seem+0.048
 Win+0.046
 tease+0.046
 hop+0.045
Neg logit contributions
edly-0.084
amed-0.074
efully-0.073
ment-0.070
elt-0.066
Token."
Token ID526
Feature activation+2.20e-02

Pos logit contributions
 may+0.033
 seem+0.032
 opener+0.031
 Win+0.030
 hop+0.029
Neg logit contributions
edly-0.033
efully-0.031
 least-0.031
ful-0.029
 former-0.028
Token Mom
Token ID11254
Feature activation-2.81e-02

Pos logit contributions
 may+0.037
 seem+0.036
 Win+0.035
 opener+0.035
 hop+0.034
Neg logit contributions
edly-0.029
efully-0.026
 least-0.025
ful-0.024
 former-0.024
Token says
Token ID1139
Feature activation+9.45e-02

Pos logit contributions
efully+0.032
 least+0.031
 each+0.029
ful+0.027
edly+0.027
Neg logit contributions
 may-0.049
 Win-0.046
 seem-0.046
ves-0.046
'd-0.045
Token,
Token ID11
Feature activation+5.49e-02

Pos logit contributions
 may+0.145
 Win+0.138
 seem+0.134
 opener+0.131
 ramp+0.131
Neg logit contributions
edly-0.138
ful-0.130
efully-0.118
 each-0.118
 least-0.115
Token giving
Token ID3501
Feature activation+1.44e-02

Pos logit contributions
 may+0.072
 seem+0.069
 opener+0.065
 Win+0.065
 hop+0.061
Neg logit contributions
edly-0.088
 least-0.083
efully-0.083
ful-0.079
 former-0.076
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
 may+0.023
 seem+0.022
 opener+0.022
 Win+0.021
 ramp+0.021
Neg logit contributions
edly-0.019
efully-0.018
 least-0.018
ful-0.017
 each-0.016
Token phone
Token ID3072
Feature activation-1.81e-02

Pos logit contributions
 may+0.022
 seem+0.021
 opener+0.020
 Win+0.020
 hop+0.019
Neg logit contributions
edly-0.020
 least-0.019
efully-0.019
ful-0.018
 each-0.017
Token to
Token ID284
Feature activation+4.92e-02

Pos logit contributions
 least+0.029
edly+0.028
efully+0.028
 former+0.026
ths+0.025
Neg logit contributions
 seem-0.024
 may-0.023
 opener-0.023
 Win-0.021
 hop-0.021
Token to
Token ID284
Feature activation+3.18e-02

Pos logit contributions
 may+0.045
 seem+0.043
 Win+0.041
 opener+0.041
 hop+0.039
Neg logit contributions
edly-0.045
efully-0.042
 least-0.041
ful-0.039
 former-0.037
Token my
Token ID616
Feature activation-5.27e-03

Pos logit contributions
 may+0.027
 seem+0.026
 Win+0.025
 opener+0.025
 hop+0.024
Neg logit contributions
edly-0.070
efully-0.064
 least-0.061
der-0.060
ful-0.059
Token party
Token ID2151
Feature activation-2.12e-02

Pos logit contributions
edly+0.009
efully+0.009
ful+0.009
ment+0.008
der+0.008
Neg logit contributions
 may-0.006
 hop-0.006
 Win-0.006
 ramp-0.006
 seem-0.006
Token?"
Token ID1701
Feature activation+1.64e-02

Pos logit contributions
edly+0.045
efully+0.040
years+0.038
der+0.038
 least+0.038
Neg logit contributions
 opener-0.018
 Win-0.018
 may-0.017
 seem-0.017
 shoot-0.017
Token Sara
Token ID24799
Feature activation+2.01e-02

Pos logit contributions
 may+0.023
 seem+0.023
 Win+0.021
 opener+0.021
 lik+0.020
Neg logit contributions
edly-0.025
 least-0.024
efully-0.023
ful-0.022
 former-0.021
Token said
Token ID531
Feature activation+9.01e-02

Pos logit contributions
 may+0.030
 seem+0.028
 Win+0.027
 opener+0.027
 ramp+0.026
Neg logit contributions
edly-0.030
efully-0.028
 least-0.027
ful-0.026
 each-0.025
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
 may+0.169
 Win+0.156
 ramp+0.155
 seem+0.153
fters+0.148
Neg logit contributions
edly-0.107
ful-0.099
 each-0.088
efully-0.086
 least-0.079
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
 may+0.038
 seem+0.036
 ramp+0.035
'd+0.034
ches+0.034
Neg logit contributions
edly-0.031
efully-0.029
 each-0.028
 least-0.028
ful-0.028
Token
Token ID198
Feature activation+9.97e-03

Pos logit contributions
 may+0.024
 seem+0.023
 ramp+0.021
 opener+0.021
'd+0.021
Neg logit contributions
edly-0.016
efully-0.016
 least-0.016
ful-0.015
 each-0.015
Token"
Token ID1
Feature activation-3.66e-02

Pos logit contributions
 may+0.015
 seem+0.014
 opener+0.013
 Win+0.013
 ramp+0.013
Neg logit contributions
edly-0.014
efully-0.014
 least-0.013
ful-0.013
 each-0.013
TokenOf
Token ID5189
Feature activation-3.44e-02

Pos logit contributions
 least+0.054
edly+0.053
 former+0.052
efully+0.051
ful+0.050
Neg logit contributions
 may-0.051
 Win-0.051
 opener-0.051
 seem-0.050
'd-0.048
Token it
Token ID340
Feature activation+1.64e-02

Pos logit contributions
edly+0.049
efully+0.046
amed+0.042
elt+0.041
P+0.041
Neg logit contributions
 may-0.019
 seem-0.016
 Win-0.015
 opener-0.013
 might-0.013
Token knew
Token ID2993
Feature activation-5.93e-02

Pos logit contributions
 may+0.016
 seem+0.015
 opener+0.014
 Win+0.014
 ramp+0.012
Neg logit contributions
edly-0.032
 least-0.031
efully-0.031
ful-0.029
 former-0.028
Token it
Token ID340
Feature activation+7.78e-03

Pos logit contributions
edly+0.100
efully+0.094
 least+0.083
 former+0.082
view+0.082
Neg logit contributions
 may-0.089
 seem-0.077
 might-0.073
'd-0.072
 Win-0.069
Token had
Token ID550
Feature activation-2.73e-02

Pos logit contributions
 may+0.007
 seem+0.006
 opener+0.006
 Win+0.006
 ramp+0.005
Neg logit contributions
edly-0.016
 least-0.015
efully-0.015
ful-0.015
 former-0.014
Token done
Token ID1760
Feature activation-9.61e-04

Pos logit contributions
edly+0.044
efully+0.041
 least+0.038
ful+0.038
 former+0.036
Neg logit contributions
 may-0.039
 seem-0.037
'd-0.034
 Win-0.033
 opener-0.033
Token a
Token ID257
Feature activation-3.91e-03

Pos logit contributions
edly+0.001
efully+0.001
 least+0.001
ful+0.001
 former+0.001
Neg logit contributions
 may-0.002
 seem-0.001
 Win-0.001
'd-0.001
 opener-0.001
Token mighty
Token ID18680
Feature activation-8.75e-03

Pos logit contributions
edly+0.007
efully+0.006
ful+0.006
 least+0.005
der+0.005
Neg logit contributions
 may-0.006
 seem-0.005
 Win-0.005
 hop-0.005
'd-0.005
Token job
Token ID1693
Feature activation+1.30e-02

Pos logit contributions
edly+0.015
efully+0.014
 least+0.014
ful+0.013
 former+0.012
Neg logit contributions
 may-0.011
 seem-0.011
 Win-0.010
 opener-0.010
 hop-0.009
Token.
Token ID13
Feature activation-2.63e-02

Pos logit contributions
 may+0.017
 seem+0.017
 opener+0.015
 Win+0.015
'd+0.015
Neg logit contributions
edly-0.022
efully-0.022
 least-0.020
 former-0.018
ful-0.018
Token It
Token ID632
Feature activation+1.16e-02

Pos logit contributions
edly+0.053
 least+0.049
ful+0.049
efully+0.048
 former+0.046
Neg logit contributions
 may-0.026
 seem-0.024
 Win-0.024
 opener-0.022
 lik-0.019
Token had
Token ID550
Feature activation-3.61e-02

Pos logit contributions
 may+0.011
'd+0.010
 seem+0.010
 hop+0.009
 opener+0.008
Neg logit contributions
edly-0.024
efully-0.023
 least-0.022
ful-0.022
 former-0.021
Token slowly
Token ID6364
Feature activation-2.66e-02

Pos logit contributions
edly+0.014
 least+0.014
efully+0.014
ful+0.013
 former+0.013
Neg logit contributions
 may-0.008
 seem-0.007
'd-0.007
 hop-0.007
 opener-0.007
Token walked
Token ID6807
Feature activation-1.33e-03

Pos logit contributions
edly+0.050
 least+0.048
efully+0.048
ful+0.045
 former+0.045
Neg logit contributions
 may-0.027
 seem-0.026
 opener-0.024
 Win-0.024
 hop-0.023
Token up
Token ID510
Feature activation-3.04e-02

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.002
 former+0.002
Neg logit contributions
 may-0.002
 seem-0.002
 opener-0.002
 Win-0.002
 ramp-0.002
Token to
Token ID284
Feature activation+6.65e-03

Pos logit contributions
edly+0.044
efully+0.043
 least+0.043
 former+0.040
ful+0.040
Neg logit contributions
 seem-0.043
 may-0.042
 opener-0.042
 Win-0.041
 hop-0.039
Token it
Token ID340
Feature activation-9.13e-03

Pos logit contributions
 may+0.009
 seem+0.008
 Win+0.008
 opener+0.008
 hop+0.007
Neg logit contributions
edly-0.012
efully-0.011
 least-0.010
 former-0.010
ful-0.010
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
edly+0.012
 least+0.011
efully+0.011
ful+0.010
 former+0.010
Neg logit contributions
 may-0.015
 seem-0.014
 opener-0.014
 Win-0.014
 hop-0.013
Token The
Token ID383
Feature activation+4.51e-03

Pos logit contributions
edly+0.031
 least+0.030
efully+0.030
ful+0.029
 former+0.028
Neg logit contributions
 may-0.017
 seem-0.017
 opener-0.016
 Win-0.016
 ramp-0.014
Token cat
Token ID3797
Feature activation-9.43e-03

Pos logit contributions
 may+0.005
 seem+0.005
 opener+0.005
 Win+0.005
'd+0.004
Neg logit contributions
edly-0.008
 least-0.008
efully-0.008
ful-0.007
 former-0.007
Token was
Token ID373
Feature activation+4.68e-03

Pos logit contributions
edly+0.017
 least+0.016
efully+0.016
ful+0.015
 former+0.015
Neg logit contributions
 may-0.011
 seem-0.010
 opener-0.009
 Win-0.009
 hop-0.009
Token scared
Token ID12008
Feature activation+3.35e-02

Pos logit contributions
 may+0.007
 seem+0.007
 Win+0.006
 opener+0.006
 hop+0.006
Neg logit contributions
edly-0.007
 least-0.006
efully-0.006
ful-0.006
 former-0.006
Token and
Token ID290
Feature activation+5.55e-03

Pos logit contributions
 may+0.046
 seem+0.045
 opener+0.042
 Win+0.042
 hop+0.039
Neg logit contributions
edly-0.052
efully-0.049
 least-0.049
ful-0.046
 former-0.045
Token huge
Token ID3236
Feature activation-4.58e-03

Pos logit contributions
 may+0.002
 seem+0.002
 Win+0.002
 opener+0.002
 lik+0.001
Neg logit contributions
edly-0.002
 least-0.002
efully-0.002
ful-0.002
 former-0.001
Token wind
Token ID2344
Feature activation+3.30e-02

Pos logit contributions
edly+0.007
 least+0.007
efully+0.007
ful+0.007
 former+0.006
Neg logit contributions
 may-0.006
 seem-0.006
 opener-0.005
 Win-0.005
'd-0.005
Token started
Token ID2067
Feature activation-1.21e-02

Pos logit contributions
 may+0.035
 seem+0.034
 Win+0.032
 opener+0.032
 ramp+0.029
Neg logit contributions
edly-0.061
ful-0.058
 least-0.057
efully-0.057
 each-0.054
Token to
Token ID284
Feature activation+2.18e-02

Pos logit contributions
edly+0.015
 least+0.013
efully+0.013
ful+0.012
 former+0.012
Neg logit contributions
 may-0.021
 seem-0.020
 opener-0.019
 Win-0.019
 hop-0.018
Token blow
Token ID6611
Feature activation+3.60e-02

Pos logit contributions
 may+0.024
 Win+0.023
 seem+0.022
 opener+0.021
ologists+0.020
Neg logit contributions
 least-0.038
edly-0.037
efully-0.036
ful-0.036
 each-0.035
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
 may+0.054
 seem+0.052
 Win+0.049
 opener+0.049
 hop+0.047
Neg logit contributions
edly-0.052
 least-0.049
efully-0.047
ful-0.044
 former-0.044
Token it
Token ID340
Feature activation+7.42e-03

Pos logit contributions
edly+0.021
efully+0.020
 least+0.019
 former+0.018
ful+0.018
Neg logit contributions
 may-0.013
 seem-0.012
 shoot-0.011
 Win-0.011
 opener-0.011
Token scared
Token ID12008
Feature activation+3.04e-02

Pos logit contributions
 may+0.008
 seem+0.008
 opener+0.008
 Win+0.008
 lik+0.007
Neg logit contributions
edly-0.013
 least-0.013
efully-0.013
ful-0.012
 former-0.011
Token him
Token ID683
Feature activation+1.68e-02

Pos logit contributions
 may+0.036
 seem+0.035
 Win+0.033
 opener+0.032
 hop+0.030
Neg logit contributions
edly-0.053
 least-0.050
efully-0.050
ful-0.047
 former-0.047
Token even
Token ID772
Feature activation-7.00e-02

Pos logit contributions
 may+0.027
 seem+0.026
 opener+0.026
 Win+0.026
 lik+0.025
Neg logit contributions
edly-0.021
 least-0.020
ful-0.020
efully-0.019
 each-0.018
Token more
Token ID517
Feature activation-8.63e-03

Pos logit contributions
edly+0.121
efully+0.111
 least+0.108
ful+0.106
 former+0.102
Neg logit contributions
 seem-0.087
 may-0.085
 hop-0.080
 shoot-0.079
 Win-0.078
Token Dad
Token ID17415
Feature activation-9.24e-03

Pos logit contributions
 seem+0.066
 may+0.063
 Win+0.061
 shoot+0.058
 hop+0.056
Neg logit contributions
edly-0.084
efully-0.072
 least-0.070
 former-0.069
ful-0.066
Token angry
Token ID7954
Feature activation+1.19e-02

Pos logit contributions
ful+0.011
edly+0.011
 least+0.011
efully+0.010
 each+0.010
Neg logit contributions
 may-0.016
 opener-0.014
'd-0.014
 Win-0.014
 ramp-0.013
Token.
Token ID13
Feature activation+3.38e-02

Pos logit contributions
 seem+0.014
 may+0.012
 Win+0.012
 opener+0.011
 hop+0.011
Neg logit contributions
edly-0.021
 former-0.021
 least-0.021
efully-0.021
rous-0.019
Token
Token ID198
Feature activation+1.67e-02

Pos logit contributions
 may+0.039
 seem+0.038
 Win+0.037
 opener+0.035
 lik+0.033
Neg logit contributions
edly-0.060
 least-0.057
efully-0.057
ful-0.054
 former-0.052
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
 may+0.019
 seem+0.018
 opener+0.018
 Win+0.017
 hop+0.016
Neg logit contributions
edly-0.030
 least-0.029
efully-0.028
 former-0.026
ful-0.026
TokenThey
Token ID2990
Feature activation+2.36e-02

Pos logit contributions
 may+0.016
 opener+0.015
 Win+0.015
 seem+0.015
 hop+0.014
Neg logit contributions
edly-0.022
 least-0.022
efully-0.021
 former-0.021
ful-0.019
Token go
Token ID467
Feature activation+1.23e-03

Pos logit contributions
 opener+0.029
ologists+0.028
ves+0.026
 Win+0.026
 may+0.026
Neg logit contributions
 least-0.039
efully-0.036
 each-0.036
edly-0.035
ful-0.032
Token to
Token ID284
Feature activation+1.33e-02

Pos logit contributions
 may+0.002
 seem+0.002
 Win+0.002
 opener+0.002
 hop+0.002
Neg logit contributions
edly-0.002
efully-0.001
 least-0.001
 former-0.001
ful-0.001
Token the
Token ID262
Feature activation+9.67e-03

Pos logit contributions
 seem+0.020
 may+0.020
 opener+0.019
 Win+0.019
 hop+0.018
Neg logit contributions
edly-0.019
efully-0.018
 least-0.016
ful-0.016
 former-0.016
Token kitchen
Token ID9592
Feature activation+2.21e-02

Pos logit contributions
 may+0.011
 opener+0.011
 ramp+0.010
 Win+0.010
 seem+0.010
Neg logit contributions
edly-0.017
efully-0.016
 least-0.015
ful-0.015
ment-0.015
Token and
Token ID290
Feature activation+2.33e-02

Pos logit contributions
 may+0.032
 seem+0.031
 opener+0.030
 Win+0.029
 ramp+0.028
Neg logit contributions
edly-0.032
efully-0.031
 least-0.030
ful-0.028
 former-0.028
Token
Token ID220
Feature activation-6.13e-04

Pos logit contributions
edly+0.005
 least+0.005
efully+0.005
ful+0.005
 each+0.005
Neg logit contributions
 may-0.005
 seem-0.005
 opener-0.004
 Win-0.004
ologists-0.004
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
edly+0.001
efully+0.001
 least+0.001
ful+0.001
 former+0.001
Neg logit contributions
 may-0.001
 seem-0.001
 opener-0.001
 Win-0.001
 ramp-0.001
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
edly+0.017
efully+0.016
 least+0.016
ful+0.015
 each+0.015
Neg logit contributions
 may-0.023
 seem-0.021
 opener-0.020
 Win-0.020
 ramp-0.020
TokenOne
Token ID3198
Feature activation-3.30e-02

Pos logit contributions
edly+0.013
 least+0.013
efully+0.013
ful+0.012
 each+0.012
Neg logit contributions
 may-0.022
 seem-0.021
 opener-0.020
ologists-0.020
 Win-0.020
Token day
Token ID1110
Feature activation+9.52e-03

Pos logit contributions
edly+0.043
 least+0.040
efully+0.040
ful+0.038
 former+0.035
Neg logit contributions
 may-0.054
 seem-0.052
 opener-0.050
 Win-0.049
 ramp-0.047
Token,
Token ID11
Feature activation+5.20e-03

Pos logit contributions
 may+0.014
 seem+0.013
 opener+0.013
 ramp+0.013
'd+0.013
Neg logit contributions
edly-0.013
ful-0.012
 least-0.012
 each-0.012
efully-0.012
Token L
Token ID406
Feature activation-9.33e-03

Pos logit contributions
 may+0.008
 seem+0.008
 opener+0.007
 Win+0.007
'd+0.007
Neg logit contributions
edly-0.007
 least-0.007
efully-0.007
ful-0.006
 each-0.006
Tokenulu
Token ID15712
Feature activation+3.81e-02

Pos logit contributions
edly+0.016
 least+0.015
efully+0.015
ful+0.015
 each+0.013
Neg logit contributions
 may-0.011
 seem-0.011
 opener-0.010
 Win-0.010
 ramp-0.010
Token saw
Token ID2497
Feature activation-1.13e-02

Pos logit contributions
 may+0.049
 seem+0.048
 opener+0.047
 Win+0.046
 ramp+0.044
Neg logit contributions
edly-0.060
 least-0.058
efully-0.057
ful-0.054
 each-0.051
Token an
Token ID281
Feature activation+8.65e-03

Pos logit contributions
edly+0.019
efully+0.017
 former+0.017
 least+0.016
amb+0.014
Neg logit contributions
 seem-0.015
 may-0.014
 hop-0.014
 Win-0.013
 lik-0.013
Token exciting
Token ID7895
Feature activation+3.67e-03

Pos logit contributions
 seem+0.011
 hop+0.011
 may+0.011
 shoot+0.010
 opener+0.010
Neg logit contributions
edly-0.014
efully-0.012
ment-0.011
 least-0.011
 former-0.011
TokenThe
Token ID464
Feature activation+2.23e-03

Pos logit contributions
 may+0.003
 seem+0.003
 opener+0.003
 Win+0.003
 hop+0.002
Neg logit contributions
edly-0.003
 least-0.003
efully-0.003
ful-0.003
 former-0.003
Token two
Token ID734
Feature activation-4.25e-02

Pos logit contributions
 may+0.004
 seem+0.004
 Win+0.004
 opener+0.004
 hop+0.004
Neg logit contributions
edly-0.002
 least-0.002
efully-0.002
ful-0.002
 former-0.002
Token friends
Token ID2460
Feature activation-2.61e-02

Pos logit contributions
efully+0.026
edly+0.023
 least+0.021
 each+0.021
ful+0.020
Neg logit contributions
 may-0.096
 opener-0.094
 seem-0.094
 Win-0.091
 ramp-0.090
Token enjoyed
Token ID8359
Feature activation-3.70e-02

Pos logit contributions
edly+0.046
 least+0.042
efully+0.042
 former+0.041
ful+0.040
Neg logit contributions
 may-0.031
 seem-0.029
 Win-0.027
'd-0.027
 opener-0.026
Token walking
Token ID6155
Feature activation-4.60e-03

Pos logit contributions
edly+0.057
efully+0.053
 least+0.052
ful+0.049
 former+0.048
Neg logit contributions
 may-0.052
 seem-0.051
 Win-0.048
 opener-0.047
 hop-0.046
Token down
Token ID866
Feature activation-5.87e-02

Pos logit contributions
edly+0.007
 least+0.007
efully+0.006
ful+0.006
 former+0.006
Neg logit contributions
 may-0.007
 Win-0.006
 seem-0.006
 hop-0.006
 shoot-0.006
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
edly+0.110
efully+0.102
 former+0.098
 least+0.097
ful+0.096
Neg logit contributions
 may-0.064
 Win-0.064
 seem-0.063
 opener-0.060
 hop-0.056
Token hidden
Token ID7104
Feature activation-3.59e-02

Pos logit contributions
edly+0.027
 least+0.027
efully+0.026
 each+0.024
ful+0.023
Neg logit contributions
 may-0.027
 seem-0.027
 opener-0.026
 Win-0.025
ologists-0.024
Token path
Token ID3108
Feature activation-1.18e-02

Pos logit contributions
edly+0.066
efully+0.062
 least+0.062
ful+0.061
 former+0.056
Neg logit contributions
 may-0.040
 seem-0.036
 Win-0.035
 ramp-0.034
 hop-0.034
Token together
Token ID1978
Feature activation-2.69e-02

Pos logit contributions
edly+0.016
efully+0.016
 least+0.015
 former+0.014
ful+0.014
Neg logit contributions
 may-0.018
 seem-0.017
 Win-0.016
 opener-0.016
'd-0.016
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
edly+0.043
efully+0.042
 least+0.041
ful+0.038
 former+0.038
Neg logit contributions
 may-0.036
 seem-0.034
 opener-0.032
 Win-0.031
 hop-0.030
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
edly+0.034
 least+0.032
efully+0.032
ful+0.029
 former+0.029
Neg logit contributions
 may-0.035
 seem-0.034
 opener-0.032
 Win-0.032
 hop-0.030
Token He
Token ID679
Feature activation-1.59e-02

Pos logit contributions
 least+0.035
edly+0.034
efully+0.033
ful+0.032
 former+0.032
Neg logit contributions
 seem-0.014
 may-0.013
 opener-0.012
 Win-0.011
 lik-0.010
Token put
Token ID1234
Feature activation+5.80e-03

Pos logit contributions
edly+0.029
efully+0.027
 least+0.027
ful+0.026
 former+0.025
Neg logit contributions
 may-0.018
 seem-0.017
 opener-0.016
'd-0.016
 Win-0.016
Token on
Token ID319
Feature activation+7.72e-03

Pos logit contributions
 may+0.007
 seem+0.006
 Win+0.006
 opener+0.006
 hop+0.006
Neg logit contributions
edly-0.011
 former-0.010
 least-0.010
der-0.010
efully-0.010
Token his
Token ID465
Feature activation-2.89e-02

Pos logit contributions
 seem+0.009
 may+0.009
 Win+0.009
 opener+0.008
 hop+0.008
Neg logit contributions
edly-0.013
 least-0.013
efully-0.012
der-0.012
ful-0.012
Token brave
Token ID14802
Feature activation-3.10e-02

Pos logit contributions
efully+0.051
ful+0.049
edly+0.049
 least+0.047
der+0.046
Neg logit contributions
 may-0.035
 seem-0.034
 hop-0.034
 shoot-0.032
 roar-0.031
Token face
Token ID1986
Feature activation-1.99e-02

Pos logit contributions
edly+0.049
 least+0.047
efully+0.047
ful+0.044
 former+0.042
Neg logit contributions
 may-0.041
 seem-0.041
 Win-0.039
 hop-0.038
 opener-0.037
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
efully+0.038
 least+0.036
 yourselves+0.034
 former+0.034
edly+0.033
Neg logit contributions
 may-0.021
 opener-0.019
 seem-0.019
 hop-0.017
ches-0.017
Token marched
Token ID23558
Feature activation+2.33e-03

Pos logit contributions
edly+0.021
efully+0.021
 least+0.021
 yourselves+0.019
ful+0.019
Neg logit contributions
 may-0.010
 shoot-0.009
 hop-0.009
 seem-0.008
 opener-0.008
Token to
Token ID284
Feature activation+2.12e-03

Pos logit contributions
 Win+0.003
 seem+0.003
 may+0.003
 opener+0.003
 shoot+0.003
Neg logit contributions
 least-0.003
edly-0.003
efully-0.003
 former-0.003
ful-0.003
Token the
Token ID262
Feature activation-1.97e-03

Pos logit contributions
 seem+0.002
 Win+0.002
 shoot+0.002
 solve+0.002
 opener+0.002
Neg logit contributions
edly-0.004
efully-0.004
der-0.004
elt-0.004
 least-0.004
Token the
Token ID262
Feature activation-8.25e-03

Pos logit contributions
edly+0.015
efully+0.014
 least+0.014
ful+0.013
 former+0.013
Neg logit contributions
 may-0.014
 seem-0.014
 Win-0.013
 opener-0.013
 hop-0.012
Token forest
Token ID8222
Feature activation+2.84e-03

Pos logit contributions
edly+0.017
ful+0.015
ment+0.015
der+0.015
efully+0.014
Neg logit contributions
 may-0.009
 hop-0.009
 might-0.008
 gra-0.008
 ramp-0.008
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
 may+0.004
 seem+0.004
 opener+0.004
 Win+0.003
 hop+0.003
Neg logit contributions
edly-0.004
efully-0.004
 least-0.004
ful-0.004
 former-0.004
Token The
Token ID383
Feature activation+1.81e-03

Pos logit contributions
edly+0.022
efully+0.021
 least+0.021
ful+0.020
 each+0.019
Neg logit contributions
 may-0.017
 seem-0.017
 opener-0.016
 Win-0.015
'd-0.015
Token fox
Token ID21831
Feature activation+4.12e-03

Pos logit contributions
 seem+0.002
 may+0.002
 Win+0.002
 opener+0.002
'd+0.002
Neg logit contributions
edly-0.003
 least-0.003
efully-0.003
ful-0.003
 former-0.003
Token wanted
Token ID2227
Feature activation-4.49e-02

Pos logit contributions
 may+0.005
 seem+0.005
 opener+0.004
 Win+0.004
'd+0.004
Neg logit contributions
edly-0.007
 least-0.007
efully-0.007
ful-0.007
 former-0.006
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
edly+0.069
efully+0.065
 least+0.060
 former+0.059
ful+0.056
Neg logit contributions
 may-0.067
 seem-0.066
 Win-0.062
 opener-0.061
 hop-0.058
Token eat
Token ID4483
Feature activation-7.47e-03

Pos logit contributions
 seem+0.013
 may+0.013
 opener+0.012
 Win+0.012
 hop+0.012
Neg logit contributions
edly-0.032
efully-0.030
 least-0.029
ful-0.028
der-0.027
Token the
Token ID262
Feature activation+2.08e-03

Pos logit contributions
edly+0.012
efully+0.011
 least+0.011
 former+0.010
ful+0.010
Neg logit contributions
 may-0.011
 seem-0.010
 Win-0.010
 opener-0.010
 hop-0.010
Token b
Token ID275
Feature activation-4.66e-02

Pos logit contributions
 may+0.003
 seem+0.003
 opener+0.003
 Win+0.003
'd+0.003
Neg logit contributions
edly-0.003
 least-0.002
efully-0.002
ful-0.002
 former-0.002
Tokenerry
Token ID6996
Feature activation-6.68e-03

Pos logit contributions
edly+0.067
 least+0.065
efully+0.065
 former+0.059
ful+0.058
Neg logit contributions
 may-0.067
 seem-0.067
 opener-0.064
 Win-0.061
ches-0.060
Token are
Token ID389
Feature activation+7.69e-03

Pos logit contributions
 opener+0.035
ves+0.033
ologists+0.032
 may+0.032
 ramp+0.032
Neg logit contributions
 each-0.045
 least-0.045
efully-0.045
edly-0.044
ful-0.039
Token scared
Token ID12008
Feature activation+3.68e-02

Pos logit contributions
 may+0.012
 seem+0.011
 opener+0.011
'd+0.010
 Win+0.010
Neg logit contributions
edly-0.010
efully-0.010
 least-0.010
ful-0.010
 each-0.010
Token.
Token ID13
Feature activation+2.82e-02

Pos logit contributions
 seem+0.044
 may+0.042
 hop+0.038
 Win+0.038
 opener+0.037
Neg logit contributions
edly-0.064
 former-0.063
efully-0.062
 least-0.062
rous-0.053
Token They
Token ID1119
Feature activation+2.95e-02

Pos logit contributions
 may+0.039
 seem+0.038
 opener+0.036
'd+0.035
 Win+0.034
Neg logit contributions
edly-0.043
efully-0.041
 least-0.039
 each-0.038
ful-0.037
Token hold
Token ID1745
Feature activation+3.28e-03

Pos logit contributions
 opener+0.035
ologists+0.034
ves+0.032
ver+0.030
 Win+0.030
Neg logit contributions
 each-0.048
efully-0.047
 least-0.047
edly-0.045
 then-0.041
Token Mom
Token ID11254
Feature activation-2.30e-02

Pos logit contributions
 may+0.006
 seem+0.006
 Win+0.006
 opener+0.005
 hop+0.005
Neg logit contributions
edly-0.004
 least-0.003
efully-0.003
ful-0.003
 former-0.003
Token's
Token ID338
Feature activation+1.87e-02

Pos logit contributions
ful+0.031
edly+0.031
 least+0.031
 each+0.031
efully+0.030
Neg logit contributions
 may-0.035
 seem-0.033
 opener-0.032
 Win-0.032
 ramp-0.031
Token hand
Token ID1021
Feature activation+2.94e-02

Pos logit contributions
 may+0.031
 seem+0.030
 opener+0.029
 Win+0.029
 hop+0.027
Neg logit contributions
edly-0.024
 least-0.022
efully-0.022
ful-0.021
 former-0.020
Token and
Token ID290
Feature activation+2.88e-02

Pos logit contributions
 may+0.041
 seem+0.039
 opener+0.038
 Win+0.037
 hop+0.037
Neg logit contributions
 least-0.042
edly-0.042
efully-0.042
 former-0.039
ful-0.037
Token hide
Token ID7808
Feature activation+3.26e-02

Pos logit contributions
 may+0.027
 seem+0.025
 opener+0.023
 Win+0.023
 hop+0.021
Neg logit contributions
edly-0.058
efully-0.055
 least-0.055
ful-0.053
 former-0.051
Token behind
Token ID2157
Feature activation+1.02e-02

Pos logit contributions
 may+0.041
 seem+0.040
 Win+0.038
 opener+0.037
 hop+0.036
Neg logit contributions
 least-0.052
edly-0.050
efully-0.050
 former-0.049
ful-0.044
TokenMaybe
Token ID13300
Feature activation-5.41e-02

Pos logit contributions
 least+0.052
ful+0.050
efully+0.048
 former+0.047
edly+0.047
Neg logit contributions
 may-0.036
 seem-0.035
 opener-0.034
 Win-0.033
'd-0.032
Token."
Token ID526
Feature activation+3.13e-02

Pos logit contributions
edly+0.113
efully+0.107
 former+0.099
 least+0.099
years+0.099
Neg logit contributions
 may-0.054
 seem-0.050
 Win-0.045
 opener-0.044
 might-0.044
Token Tom
Token ID4186
Feature activation+1.08e-02

Pos logit contributions
 may+0.029
 seem+0.028
 Win+0.026
 opener+0.025
 hop+0.022
Neg logit contributions
edly-0.065
efully-0.063
 least-0.060
ful-0.059
 former-0.058
Token said
Token ID531
Feature activation+7.89e-02

Pos logit contributions
 may+0.013
 seem+0.013
 opener+0.012
 Win+0.012
 hop+0.011
Neg logit contributions
edly-0.018
 least-0.018
efully-0.017
 former-0.017
ful-0.016
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
 may+0.123
 seem+0.116
 Win+0.112
 opener+0.110
'd+0.106
Neg logit contributions
edly-0.112
efully-0.101
 least-0.100
ful-0.100
 each-0.092
Token "
Token ID366
Feature activation-3.02e-02

Pos logit contributions
 opener+0.034
 may+0.034
 seem+0.033
 Win+0.031
 ramp+0.029
Neg logit contributions
 least-0.070
edly-0.069
efully-0.069
 former-0.065
ful-0.064
TokenBut
Token ID1537
Feature activation+1.61e-02

Pos logit contributions
 least+0.043
efully+0.041
 former+0.040
ful+0.039
edly+0.039
Neg logit contributions
 seem-0.044
 opener-0.042
 may-0.040
'd-0.039
 Win-0.038
Token only
Token ID691
Feature activation-1.15e-02

Pos logit contributions
 may+0.021
 seem+0.019
 opener+0.017
 Win+0.017
 might+0.017
Neg logit contributions
edly-0.029
efully-0.027
 former-0.025
view-0.025
 least-0.024
Token if
Token ID611
Feature activation+8.46e-03

Pos logit contributions
edly+0.022
efully+0.021
amed+0.018
view+0.018
years+0.018
Neg logit contributions
 may-0.014
 seem-0.014
 shoot-0.013
 hop-0.013
 move-0.012
Token we
Token ID356
Feature activation-8.66e-04

Pos logit contributions
 may+0.011
 seem+0.010
 Win+0.009
 might+0.009
 opener+0.009
Neg logit contributions
edly-0.015
efully-0.014
 least-0.014
 former-0.013
ful-0.013
Token clean
Token ID3424
Feature activation+7.73e-03

Pos logit contributions
edly+0.002
efully+0.002
 least+0.002
ful+0.002
 former+0.002
Neg logit contributions
 may-0.001
 seem-0.001
 opener-0.001
 Win-0.001
 hop-0.001
Token could
Token ID714
Feature activation-2.82e-05

Pos logit contributions
 may+0.006
 seem+0.006
'd+0.005
 opener+0.005
 Win+0.004
Neg logit contributions
edly-0.020
efully-0.019
 least-0.019
 former-0.018
ful-0.018
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
edly+0.000
 least+0.000
efully+0.000
ful+0.000
 each+0.000
Neg logit contributions
 may-0.000
 seem-0.000
 Win-0.000
 opener-0.000
 lik-0.000
Token
Token ID198
Feature activation-2.17e-02

Pos logit contributions
edly+0.035
 least+0.033
efully+0.032
ful+0.031
 former+0.030
Neg logit contributions
 may-0.023
 seem-0.022
 Win-0.021
 opener-0.020
 hop-0.019
Token
Token ID198
Feature activation-2.18e-02

Pos logit contributions
edly+0.032
efully+0.030
 least+0.030
ful+0.029
 former+0.027
Neg logit contributions
 may-0.031
 seem-0.030
 opener-0.028
 Win-0.028
 ramp-0.027
TokenThe
Token ID464
Feature activation-2.24e-03

Pos logit contributions
edly+0.029
 least+0.027
efully+0.027
ful+0.026
 former+0.024
Neg logit contributions
 may-0.034
 seem-0.033
 opener-0.032
 Win-0.032
 hop-0.030
Token moral
Token ID6573
Feature activation-9.01e-02

Pos logit contributions
edly+0.003
efully+0.003
ful+0.003
 least+0.003
 former+0.003
Neg logit contributions
 may-0.003
 seem-0.003
 opener-0.003
 Win-0.003
 hop-0.003
Token of
Token ID286
Feature activation-2.96e-02

Pos logit contributions
edly+0.170
 least+0.160
efully+0.159
 former+0.149
ful+0.146
Neg logit contributions
 may-0.100
 seem-0.095
 Win-0.093
 opener-0.091
 hop-0.080
Token the
Token ID262
Feature activation-3.67e-02

Pos logit contributions
edly+0.057
efully+0.050
 former+0.049
der+0.048
elt+0.046
Neg logit contributions
 may-0.033
 seem-0.030
 Win-0.030
 hop-0.029
 opener-0.029
Token story
Token ID1621
Feature activation-3.88e-02

Pos logit contributions
 least+0.065
edly+0.063
efully+0.062
 example+0.060
ful+0.060
Neg logit contributions
 opener-0.034
 may-0.034
 seem-0.033
 Win-0.032
 ramp-0.032
Token is
Token ID318
Feature activation-3.44e-02

Pos logit contributions
edly+0.080
efully+0.076
 least+0.076
ful+0.073
 former+0.071
Neg logit contributions
 may-0.035
 seem-0.032
 opener-0.029
 Win-0.029
'd-0.026
Token that
Token ID326
Feature activation+1.48e-02

Pos logit contributions
edly+0.038
efully+0.034
 least+0.029
 former+0.028
view+0.025
Neg logit contributions
 may-0.068
 seem-0.065
 Win-0.062
 opener-0.062
 hop-0.059
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
edly+0.057
efully+0.053
 least+0.049
 former+0.049
ful+0.047
Neg logit contributions
 may-0.056
 seem-0.055
 opener-0.053
 Win-0.051
 hop-0.050
Token
Token ID220
Feature activation-1.26e-02

Pos logit contributions
edly+0.029
 least+0.028
efully+0.028
ful+0.027
 former+0.026
Neg logit contributions
 may-0.018
 seem-0.017
 Win-0.016
 opener-0.015
 hop-0.014
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
edly+0.017
efully+0.016
 least+0.016
ful+0.015
 each+0.014
Neg logit contributions
 may-0.020
 seem-0.019
 Win-0.018
 opener-0.018
'd-0.017
Token
Token ID198
Feature activation-2.05e-02

Pos logit contributions
edly+0.029
efully+0.027
 least+0.027
ful+0.026
 each+0.024
Neg logit contributions
 may-0.031
 seem-0.030
 Win-0.028
 opener-0.028
 ramp-0.027
TokenThe
Token ID464
Feature activation-6.09e-03

Pos logit contributions
edly+0.031
 least+0.029
efully+0.029
ful+0.027
 former+0.026
Neg logit contributions
 may-0.029
 seem-0.028
 opener-0.027
 Win-0.027
 hop-0.025
Token moral
Token ID6573
Feature activation-9.15e-02

Pos logit contributions
edly+0.010
efully+0.009
 least+0.009
ful+0.009
 former+0.008
Neg logit contributions
 may-0.009
 seem-0.008
 Win-0.008
 opener-0.008
 ramp-0.007
Token of
Token ID286
Feature activation-3.51e-02

Pos logit contributions
edly+0.179
 least+0.168
efully+0.168
 former+0.158
ful+0.152
Neg logit contributions
 may-0.098
 seem-0.091
 Win-0.090
 opener-0.087
 might-0.080
Token the
Token ID262
Feature activation-4.10e-02

Pos logit contributions
edly+0.060
efully+0.054
 former+0.052
 least+0.049
der+0.049
Neg logit contributions
 may-0.047
 seem-0.044
 Win-0.042
 opener-0.041
ches-0.039
Token story
Token ID1621
Feature activation-3.73e-02

Pos logit contributions
 least+0.074
edly+0.071
 example+0.069
efully+0.068
ful+0.068
Neg logit contributions
 opener-0.039
 may-0.037
 seem-0.036
 lik-0.034
ologists-0.033
Token is
Token ID318
Feature activation-3.64e-02

Pos logit contributions
edly+0.082
efully+0.078
 least+0.078
 former+0.074
ful+0.074
Neg logit contributions
 may-0.031
 seem-0.027
 opener-0.023
 Win-0.023
'd-0.021
Token:
Token ID25
Feature activation-3.93e-02

Pos logit contributions
edly+0.039
efully+0.036
 least+0.031
 former+0.029
ful+0.027
Neg logit contributions
 may-0.072
 seem-0.069
 Win-0.066
 opener-0.065
 hop-0.062
Token,
Token ID11
Feature activation+4.87e-03

Pos logit contributions
edly+0.000
efully+0.000
 least+0.000
 former+0.000
ful+0.000
Neg logit contributions
 may-0.000
 seem-0.000
 opener-0.000
 Win-0.000
 hop-0.000
Token caught
Token ID4978
Feature activation+6.69e-03

Pos logit contributions
 may+0.006
 seem+0.006
 Win+0.006
 opener+0.006
 hop+0.006
Neg logit contributions
edly-0.008
efully-0.007
 least-0.007
ful-0.007
 former-0.007
Token bugs
Token ID11316
Feature activation+1.54e-02

Pos logit contributions
 may+0.009
 Win+0.009
 seem+0.009
 hop+0.008
 shoot+0.008
Neg logit contributions
edly-0.010
 least-0.010
efully-0.009
 former-0.009
ful-0.009
Token,
Token ID11
Feature activation+1.89e-02

Pos logit contributions
 may+0.023
 seem+0.022
 opener+0.021
 Win+0.020
 hop+0.020
Neg logit contributions
edly-0.022
efully-0.021
 least-0.021
 former-0.019
ful-0.019
Token and
Token ID290
Feature activation-4.05e-03

Pos logit contributions
 may+0.025
 seem+0.025
 hop+0.023
 opener+0.022
 Win+0.022
Neg logit contributions
efully-0.030
edly-0.030
 least-0.028
ful-0.028
 former-0.027
Token even
Token ID772
Feature activation-9.11e-02

Pos logit contributions
edly+0.007
efully+0.007
 least+0.007
ful+0.006
amb+0.006
Neg logit contributions
 seem-0.005
 hop-0.005
 may-0.005
 shoot-0.005
'd-0.004
Token found
Token ID1043
Feature activation-3.31e-02

Pos logit contributions
edly+0.153
efully+0.148
 least+0.140
amb+0.129
ful+0.129
Neg logit contributions
 seem-0.119
 hop-0.114
 may-0.112
 shoot-0.110
 opener-0.108
Token a
Token ID257
Feature activation-9.12e-03

Pos logit contributions
edly+0.047
 least+0.044
efully+0.042
 former+0.040
ths+0.038
Neg logit contributions
 may-0.051
 might-0.046
 seem-0.046
 hop-0.045
 Win-0.045
Token hidden
Token ID7104
Feature activation-1.48e-02

Pos logit contributions
edly+0.015
efully+0.014
 least+0.014
ful+0.013
 former+0.012
Neg logit contributions
 may-0.012
 seem-0.012
 opener-0.011
 Win-0.011
 ramp-0.011
Token treasure
Token ID14068
Feature activation-2.60e-02

Pos logit contributions
edly+0.028
efully+0.027
 least+0.025
ful+0.025
years+0.024
Neg logit contributions
 may-0.016
 opener-0.014
 ramp-0.014
 hop-0.014
 seem-0.013
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
edly+0.041
efully+0.041
 least+0.038
ful+0.037
 former+0.036
Neg logit contributions
 may-0.035
 opener-0.032
 seem-0.032
 Win-0.031
 ramp-0.030
Token let
Token ID1309
Feature activation+5.67e-02

Pos logit contributions
edly+0.184
efully+0.174
 least+0.167
ful+0.162
der+0.156
Neg logit contributions
 may-0.085
 seem-0.084
 hop-0.076
 shoot-0.074
 Win-0.072
Token Lily
Token ID20037
Feature activation-2.22e-02

Pos logit contributions
 may+0.086
 seem+0.083
 Win+0.078
 opener+0.078
 hop+0.074
Neg logit contributions
edly-0.081
efully-0.075
 least-0.075
ful-0.071
 former-0.067
Token ride
Token ID6594
Feature activation-2.62e-02

Pos logit contributions
edly+0.050
efully+0.047
 least+0.045
ful+0.044
der+0.043
Neg logit contributions
 seem-0.018
 may-0.018
 hop-0.016
 shoot-0.015
 Win-0.015
Token on
Token ID319
Feature activation-1.57e-02

Pos logit contributions
 least+0.043
edly+0.043
efully+0.041
 former+0.040
elt+0.037
Neg logit contributions
 may-0.032
 seem-0.031
 Win-0.031
 hop-0.029
ves-0.029
Token its
Token ID663
Feature activation-3.60e-02

Pos logit contributions
edly+0.030
efully+0.027
elt+0.027
 least+0.026
amed+0.025
Neg logit contributions
 may-0.018
 seem-0.017
 might-0.016
 Win-0.016
 hop-0.015
Token back
Token ID736
Feature activation-9.65e-02

Pos logit contributions
efully+0.061
edly+0.061
ful+0.059
 least+0.056
years+0.055
Neg logit contributions
 may-0.045
 hop-0.044
 shoot-0.041
 roar-0.040
 ramp-0.040
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
edly+0.202
years+0.185
efully+0.180
 least+0.175
ths+0.170
Neg logit contributions
 hop-0.089
 may-0.088
 ramp-0.081
 might-0.079
 bounce-0.077
Token And
Token ID843
Feature activation-4.19e-02

Pos logit contributions
edly+0.042
efully+0.040
 least+0.038
ful+0.037
 former+0.035
Neg logit contributions
 may-0.030
 seem-0.028
 Win-0.027
 opener-0.026
 lik-0.025
Token whenever
Token ID8797
Feature activation-9.65e-03

Pos logit contributions
edly+0.078
efully+0.074
 least+0.068
 former+0.066
 yourselves+0.064
Neg logit contributions
 may-0.050
 seem-0.048
 Win-0.046
 hop-0.041
 opener-0.040
Token Lily
Token ID20037
Feature activation-3.59e-02

Pos logit contributions
edly+0.016
 least+0.015
efully+0.015
ful+0.014
 former+0.014
Neg logit contributions
 may-0.012
 seem-0.012
 opener-0.011
 Win-0.011
 hop-0.010
Token felt
Token ID2936
Feature activation-2.02e-02

Pos logit contributions
edly+0.058
 least+0.057
efully+0.055
ful+0.052
 former+0.050
Neg logit contributions
 may-0.044
 seem-0.043
 opener-0.041
 Win-0.041
 ramp-0.038
Token a
Token ID257
Feature activation-1.97e-03

Pos logit contributions
 may+0.071
 seem+0.070
 Win+0.062
 opener+0.061
 hop+0.059
Neg logit contributions
edly-0.095
efully-0.091
 least-0.088
 former-0.082
ful-0.082
Token word
Token ID1573
Feature activation+1.41e-03

Pos logit contributions
edly+0.004
efully+0.003
 least+0.003
ful+0.003
 former+0.003
Neg logit contributions
 may-0.002
 seem-0.002
 opener-0.002
 Win-0.002
 hop-0.002
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 may+0.002
 seem+0.002
 opener+0.002
 Win+0.002
 lik+0.002
Neg logit contributions
edly-0.002
 least-0.002
ful-0.002
efully-0.002
 each-0.002
Token After
Token ID2293
Feature activation-2.12e-02

Pos logit contributions
edly+0.025
 least+0.025
efully+0.024
ful+0.024
 former+0.023
Neg logit contributions
 may-0.007
 seem-0.006
 Win-0.006
 opener-0.006
'd-0.005
Token a
Token ID257
Feature activation-3.89e-02

Pos logit contributions
efully+0.032
edly+0.031
 least+0.029
 former+0.028
view+0.026
Neg logit contributions
 seem-0.031
 may-0.030
 hop-0.028
 opener-0.028
 Win-0.028
Token few
Token ID1178
Feature activation-9.99e-02

Pos logit contributions
edly+0.058
efully+0.055
 least+0.055
ful+0.051
 former+0.049
Neg logit contributions
 may-0.057
 seem-0.054
 opener-0.052
 Win-0.051
 hop-0.049
Token moments
Token ID7188
Feature activation-3.97e-02

Pos logit contributions
edly+0.112
 least+0.108
efully+0.105
ful+0.100
 former+0.099
Neg logit contributions
 may-0.182
 seem-0.177
 hop-0.167
 Win-0.164
 opener-0.163
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
edly+0.064
 least+0.061
efully+0.060
ful+0.059
 each+0.055
Neg logit contributions
 may-0.051
 seem-0.049
 Win-0.046
 opener-0.046
 ramp-0.044
Token the
Token ID262
Feature activation+7.23e-03

Pos logit contributions
edly+0.032
efully+0.031
 least+0.029
ful+0.029
 former+0.028
Neg logit contributions
 may-0.013
 seem-0.011
 Win-0.010
 opener-0.010
 hop-0.009
Token voice
Token ID3809
Feature activation+9.30e-03

Pos logit contributions
 may+0.008
 seem+0.007
 opener+0.007
 hop+0.007
 Win+0.007
Neg logit contributions
edly-0.013
efully-0.013
 least-0.012
ful-0.012
 each-0.011
Token spoke
Token ID5158
Feature activation-8.25e-04

Pos logit contributions
 may+0.011
 seem+0.010
 Win+0.009
'd+0.009
 opener+0.009
Neg logit contributions
edly-0.017
 least-0.016
efully-0.016
ful-0.015
 former-0.015