TOP ACTIVATIONS
MAX = 0.097

always be careful about what she watches before bed . <|endoftext|>
she was sorry for what she did . He wished she
She wondered what it tasted like . She asked
She felt sorry for what she did . She did not
. She wrote about what she did , what she saw
she was sorry for what she did . He told her
. She can write whatever she wants . It is her
to be thankful for what she had . <|endoftext|> Once upon
was important to decide what she wanted before spending her money
She wondered what it tasted like . She opened the jar
She was sorry for what she did . She did not
could be happy with what she had . She
up . Nobody knew what she was like or what she
I wonder what it tastes like ." " I
He wondered what it tasted like . He opened his mouth
. She wrote about what she did , what she saw
her owners never knew what she was going to do next
, he made some money transactions . At the post office
very generous and shared whatever she had with people in need
She wondered what it tasted like . She climbed on a

MIN ACTIVATIONS
MIN = -0.165

and saw two birds qu arre ling ! They were squ
We have different likes and disl ikes . We can be
two squirrel s kept qu arre ling until finally Sammy said
other . They starting qu arre ling again and their voices
the mess and the qu arre ling twins . She is
lively sound . The qu arre ling stopped and they heard
them and never threw a tant rum again . <|endoftext|> Once
the parents were also qu arre ling . The first family
they just kept on qu arre ling !
box and saw a j igsaw puzzle inside . He paused
his feet and threw a tant rum . "
listen . They kept qu arre ling with their mom .
. They had stopped qu arre ling . To celebrate ,
sharing is important and qu arre lling is not good .
on , they never qu arre led again over clothes .
but today they were qu arre ling . Jenna wanted to
and their children , qu arre ling .
together . They never qu arre led again . <|endoftext|> Once
little birds that were qu arre ling . The
good friends and never qu arre led again . <|endoftext|> Once

INTERVAL 0.032 - 0.097
CONTAINS 4.542%

. She asked Tommy if she could march with him and
day . <|endoftext|> Once upon a time there was a little
a sign that something good was going to happen ." And
it . While she was eating , Jane noticed
not too much . <|endoftext|> Anna and Ben were playing in

INTERVAL -0.034 - 0.032
CONTAINS 92.088%

. They hugged him tightly and brought him over to the
it was as good as new . The girl
" Hello , dog . You are very loud ," Tom
her , but he could not . The ring was too
smaller ball of rock that goes around the earth ."

INTERVAL -0.099 - -0.034
CONTAINS 3.369%

a secret path and saw many different kinds of animals .
was shiny and red . It had big wheels and a
." The little boy smiled , gave the pilot
little girl who went on an adventure . She excited ly
high in the sky and it was very calm , too

INTERVAL -0.165 - -0.099
CONTAINS 0.000%

box and saw a j igsaw puzzle inside . He paused
they just kept on qu arre ling !
never happy . He qu arre led and yelled and sc
ended up in such bad situations . The
the mess and the qu arre ling twins . She is
Token always
Token ID1464
Feature activation+3.49e-02

Pos logit contributions
arre+0.008
 discoveries+0.008
inker+0.008
opter+0.008
beh+0.008
Neg logit contributions
 seven-0.006
 and-0.005
 The-0.004
 only-0.004
 or-0.004
Token be
Token ID307
Feature activation+3.22e-03

Pos logit contributions
 seven+0.057
 and+0.046
 only+0.040
 The+0.040
 like+0.039
Neg logit contributions
arre-0.079
 discoveries-0.075
beh-0.075
inker-0.074
opter-0.074
Token careful
Token ID8161
Feature activation-1.57e-02

Pos logit contributions
 seven+0.005
 and+0.004
 The+0.004
 only+0.004
 or+0.004
Neg logit contributions
arre-0.007
beh-0.007
 discoveries-0.006
oving-0.006
inker-0.006
Token about
Token ID546
Feature activation+3.65e-02

Pos logit contributions
arre+0.042
beh+0.040
 discoveries+0.040
 possessions+0.040
 situations+0.040
Neg logit contributions
 seven-0.020
 The-0.011
 and-0.011
 only-0.010
 like-0.009
Token what
Token ID644
Feature activation+8.22e-03

Pos logit contributions
 seven+0.055
 and+0.044
 only+0.038
 like+0.037
 The+0.037
Neg logit contributions
arre-0.088
 discoveries-0.083
inker-0.083
opter-0.083
beh-0.083
Token she
Token ID673
Feature activation+9.75e-02

Pos logit contributions
 seven+0.012
 and+0.010
 only+0.009
 like+0.009
 The+0.008
Neg logit contributions
arre-0.020
inker-0.019
opter-0.019
beh-0.019
 discoveries-0.019
Token watches
Token ID16860
Feature activation+4.34e-02

Pos logit contributions
 seven+0.155
 and+0.129
 only+0.113
 like+0.109
 The+0.109
Neg logit contributions
arre-0.226
beh-0.213
inker-0.213
opter-0.212
 discoveries-0.211
Token before
Token ID878
Feature activation+2.86e-02

Pos logit contributions
 seven+0.058
 only+0.032
 The+0.031
 also+0.027
 He+0.027
Neg logit contributions
arre-0.107
 situations-0.105
inker-0.104
 discoveries-0.101
beh-0.100
Token bed
Token ID3996
Feature activation+3.09e-02

Pos logit contributions
 seven+0.048
 and+0.036
 only+0.032
 like+0.032
 The+0.032
Neg logit contributions
arre-0.065
 discoveries-0.062
 situations-0.062
inker-0.061
opter-0.061
Token.
Token ID13
Feature activation-9.33e-04

Pos logit contributions
 seven+0.034
 and+0.018
 only+0.016
 The+0.016
 like+0.015
Neg logit contributions
arre-0.089
inker-0.085
 situations-0.084
opter-0.084
beh-0.083
Token<|endoftext|>
Token ID50256
Feature activation-2.97e-02

Pos logit contributions
arre+0.003
 discoveries+0.002
opter+0.002
beh+0.002
inker+0.002
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.001
 like-0.001
 or-0.001
Token she
Token ID673
Feature activation+4.30e-02

Pos logit contributions
 seven+0.021
 and+0.015
 only+0.013
 The+0.013
 like+0.013
Neg logit contributions
arre-0.032
inker-0.032
 discoveries-0.031
opter-0.031
beh-0.031
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
 seven+0.056
 and+0.049
 only+0.042
 like+0.040
 still+0.039
Neg logit contributions
arre-0.111
opter-0.106
beh-0.105
inker-0.105
 situations-0.105
Token sorry
Token ID7926
Feature activation+9.20e-03

Pos logit contributions
 seven+0.033
 and+0.024
 The+0.021
 or+0.019
 only+0.018
Neg logit contributions
arre-0.047
beh-0.046
 discoveries-0.044
inker-0.044
oving-0.043
Token for
Token ID329
Feature activation+9.63e-03

Pos logit contributions
 seven+0.013
 The+0.006
 only+0.006
 eight+0.005
 eighty+0.005
Neg logit contributions
arre-0.023
 discoveries-0.022
 situations-0.022
 possessions-0.022
inker-0.022
Token what
Token ID644
Feature activation+1.56e-02

Pos logit contributions
 seven+0.016
 and+0.013
 The+0.012
 only+0.012
 like+0.011
Neg logit contributions
arre-0.021
inker-0.020
 discoveries-0.020
beh-0.020
 situations-0.020
Token she
Token ID673
Feature activation+9.71e-02

Pos logit contributions
 seven+0.022
 and+0.018
 only+0.016
 like+0.015
 The+0.015
Neg logit contributions
arre-0.039
beh-0.037
inker-0.037
 discoveries-0.036
opter-0.036
Token did
Token ID750
Feature activation+5.39e-02

Pos logit contributions
 seven+0.131
 and+0.110
 only+0.094
 like+0.091
 The+0.087
Neg logit contributions
arre-0.248
opter-0.236
inker-0.235
beh-0.235
 discoveries-0.233
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 seven+0.047
 and+0.018
 The+0.018
 only+0.016
 like+0.011
Neg logit contributions
arre-0.166
inker-0.159
 discoveries-0.159
beh-0.155
oving-0.155
Token He
Token ID679
Feature activation+1.35e-02

Pos logit contributions
 seven+0.017
 only+0.010
 and+0.009
 like+0.009
 still+0.009
Neg logit contributions
arre-0.028
inker-0.028
 discoveries-0.027
 situations-0.027
rums-0.026
Token wished
Token ID16555
Feature activation-9.07e-03

Pos logit contributions
 seven+0.026
 and+0.020
 The+0.020
 like+0.019
 or+0.019
Neg logit contributions
arre-0.026
beh-0.023
 discoveries-0.023
inker-0.023
opter-0.023
Token she
Token ID673
Feature activation+6.83e-02

Pos logit contributions
 discoveries+0.018
arre+0.018
ert+0.018
inker+0.018
 possessions+0.017
Neg logit contributions
 seven-0.016
 and-0.011
 only-0.010
 like-0.010
 The-0.010
Token She
Token ID1375
Feature activation+1.53e-02

Pos logit contributions
 seven+0.003
 and+0.002
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.008
inker-0.008
opter-0.008
beh-0.008
 discoveries-0.008
Token wondered
Token ID14028
Feature activation-3.99e-03

Pos logit contributions
 seven+0.026
 and+0.021
 The+0.019
 like+0.019
 only+0.019
Neg logit contributions
arre-0.033
 discoveries-0.031
opter-0.030
beh-0.030
inker-0.030
Token what
Token ID644
Feature activation+1.94e-02

Pos logit contributions
arre+0.007
 discoveries+0.007
 possessions+0.007
inker+0.007
beh+0.006
Neg logit contributions
 seven-0.008
 and-0.007
 The-0.006
 only-0.006
 He-0.006
Token it
Token ID340
Feature activation+3.47e-02

Pos logit contributions
 seven+0.031
 and+0.023
 like+0.021
 only+0.020
 The+0.020
Neg logit contributions
 discoveries-0.042
arre-0.041
 possessions-0.041
opter-0.041
 situations-0.040
Token tasted
Token ID29187
Feature activation+4.43e-02

Pos logit contributions
 seven+0.060
 and+0.047
 The+0.042
 only+0.042
 like+0.040
Neg logit contributions
arre-0.075
 discoveries-0.071
inker-0.070
beh-0.070
opter-0.070
Token like
Token ID588
Feature activation+9.66e-02

Pos logit contributions
 seven+0.056
 The+0.038
 Leo+0.031
 eight+0.029
 driver+0.029
Neg logit contributions
arre-0.108
ade-0.107
 discoveries-0.104
angle-0.102
inker-0.101
Token.
Token ID13
Feature activation+1.44e-03

Pos logit contributions
 seven+0.122
 He+0.075
 driver+0.074
 The+0.073
 only+0.072
Neg logit contributions
 possessions-0.224
 discoveries-0.215
ade-0.214
 situations-0.213
 discovery-0.210
Token
Token ID198
Feature activation-9.23e-03

Pos logit contributions
 seven+0.001
 and+0.001
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.004
opter-0.004
beh-0.004
inker-0.004
 discoveries-0.004
Token
Token ID198
Feature activation+4.23e-03

Pos logit contributions
arre+0.029
beh+0.028
opter+0.028
 discoveries+0.028
inker+0.028
Neg logit contributions
 seven-0.007
 and-0.005
 The-0.003
 only-0.003
 like-0.003
TokenShe
Token ID3347
Feature activation+3.21e-02

Pos logit contributions
 seven+0.007
 and+0.006
 only+0.005
 still+0.005
 like+0.005
Neg logit contributions
arre-0.010
 discoveries-0.010
 situations-0.009
opter-0.009
beh-0.009
Token asked
Token ID1965
Feature activation-2.68e-02

Pos logit contributions
 seven+0.056
 and+0.044
 like+0.044
 The+0.044
 or+0.041
Neg logit contributions
arre-0.065
 discoveries-0.058
inker-0.058
beh-0.058
oving-0.057
Token She
Token ID1375
Feature activation+1.15e-02

Pos logit contributions
 seven+0.010
 and+0.006
 like+0.005
 only+0.005
 still+0.005
Neg logit contributions
arre-0.023
inker-0.022
 situations-0.022
 discoveries-0.022
opter-0.021
Token felt
Token ID2936
Feature activation+8.22e-03

Pos logit contributions
 seven+0.021
 and+0.016
 like+0.015
 The+0.015
 only+0.015
Neg logit contributions
arre-0.024
inker-0.022
opter-0.022
 discoveries-0.022
beh-0.021
Token sorry
Token ID7926
Feature activation-9.22e-04

Pos logit contributions
 seven+0.012
 and+0.010
 only+0.008
 The+0.008
 like+0.008
Neg logit contributions
arre-0.020
 discoveries-0.018
beh-0.018
 situations-0.018
inker-0.018
Token for
Token ID329
Feature activation+4.98e-03

Pos logit contributions
arre+0.003
 discoveries+0.003
inker+0.003
 situations+0.003
opter+0.002
Neg logit contributions
 seven-0.001
 and-0.000
 only-0.000
 The-0.000
 like-0.000
Token what
Token ID644
Feature activation+1.30e-02

Pos logit contributions
 seven+0.008
 and+0.006
 The+0.006
 like+0.005
 only+0.005
Neg logit contributions
arre-0.011
inker-0.011
 discoveries-0.010
beh-0.010
 situations-0.010
Token she
Token ID673
Feature activation+9.62e-02

Pos logit contributions
 seven+0.016
 and+0.014
 only+0.013
 like+0.012
 The+0.012
Neg logit contributions
arre-0.034
opter-0.033
inker-0.033
beh-0.033
 discoveries-0.032
Token did
Token ID750
Feature activation+5.52e-02

Pos logit contributions
 seven+0.132
 and+0.106
 only+0.091
 like+0.087
 The+0.087
Neg logit contributions
arre-0.245
inker-0.231
beh-0.231
opter-0.231
 discoveries-0.230
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 seven+0.043
 and+0.019
 The+0.015
 only+0.015
 like+0.011
Neg logit contributions
arre-0.174
inker-0.167
 discoveries-0.166
 situations-0.164
opter-0.164
Token She
Token ID1375
Feature activation+2.73e-02

Pos logit contributions
 seven+0.018
 and+0.012
 like+0.010
 only+0.010
 still+0.009
Neg logit contributions
arre-0.035
inker-0.032
 discoveries-0.032
 situations-0.032
opter-0.031
Token did
Token ID750
Feature activation+1.46e-02

Pos logit contributions
 seven+0.053
 and+0.040
 The+0.040
 like+0.039
 or+0.038
Neg logit contributions
arre-0.050
 discoveries-0.045
opter-0.044
beh-0.044
oving-0.044
Token not
Token ID407
Feature activation-7.14e-04

Pos logit contributions
 seven+0.018
 and+0.013
 only+0.010
 The+0.010
 like+0.010
Neg logit contributions
arre-0.039
beh-0.037
 discoveries-0.037
inker-0.037
oving-0.037
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
 seven+0.015
 only+0.008
 still+0.008
 and+0.007
 The+0.007
Neg logit contributions
 situations-0.032
 discoveries-0.032
arre-0.032
making-0.031
 possessions-0.031
Token She
Token ID1375
Feature activation+1.77e-02

Pos logit contributions
 seven+0.005
 and+0.003
 still+0.003
 then+0.003
 only+0.003
Neg logit contributions
arre-0.007
 situations-0.007
igsaw-0.007
 discoveries-0.007
irs-0.007
Token wrote
Token ID2630
Feature activation+1.94e-02

Pos logit contributions
 seven+0.036
 The+0.029
 and+0.027
 Leo+0.027
 like+0.026
Neg logit contributions
opter-0.028
arre-0.027
making-0.026
 possessions-0.026
 discoveries-0.025
Token about
Token ID546
Feature activation+2.89e-02

Pos logit contributions
 seven+0.026
 and+0.018
 only+0.018
 The+0.017
 still+0.016
Neg logit contributions
arre-0.046
inker-0.045
 discoveries-0.044
opter-0.044
 possessions-0.043
Token what
Token ID644
Feature activation+1.09e-02

Pos logit contributions
 seven+0.043
 and+0.036
 only+0.031
 like+0.030
 The+0.029
Neg logit contributions
arre-0.070
beh-0.066
opter-0.066
inker-0.066
 discoveries-0.066
Token she
Token ID673
Feature activation+9.59e-02

Pos logit contributions
 seven+0.018
 and+0.014
 only+0.013
 like+0.012
 The+0.012
Neg logit contributions
arre-0.024
 discoveries-0.023
beh-0.023
opter-0.023
inker-0.023
Token did
Token ID750
Feature activation+5.75e-02

Pos logit contributions
 seven+0.157
 and+0.119
 The+0.111
 only+0.105
 or+0.102
Neg logit contributions
arre-0.204
 discoveries-0.197
inker-0.197
beh-0.194
opter-0.194
Token,
Token ID11
Feature activation+4.44e-03

Pos logit contributions
 seven+0.061
 The+0.033
 and+0.028
 only+0.027
 He+0.023
Neg logit contributions
arre-0.159
 discoveries-0.157
inker-0.157
 possessions-0.152
 situations-0.152
Token what
Token ID644
Feature activation+8.36e-03

Pos logit contributions
 seven+0.006
 The+0.003
 and+0.003
 only+0.003
 He+0.003
Neg logit contributions
arre-0.011
 discoveries-0.010
 possessions-0.010
 situations-0.010
 discovery-0.010
Token she
Token ID673
Feature activation+8.80e-02

Pos logit contributions
 seven+0.013
 and+0.010
 The+0.009
 only+0.009
 like+0.009
Neg logit contributions
arre-0.019
beh-0.019
 discoveries-0.019
inker-0.018
opter-0.018
Token saw
Token ID2497
Feature activation+2.36e-02

Pos logit contributions
 seven+0.147
 and+0.115
 The+0.108
 only+0.101
 like+0.100
Neg logit contributions
arre-0.190
inker-0.180
 discoveries-0.180
beh-0.180
opter-0.178
Token she
Token ID673
Feature activation+3.49e-02

Pos logit contributions
 seven+0.026
 and+0.017
 only+0.015
 The+0.014
 like+0.014
Neg logit contributions
arre-0.048
 discoveries-0.047
 possessions-0.046
beh-0.046
inker-0.046
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
 seven+0.056
 and+0.046
 only+0.041
 The+0.040
 like+0.040
Neg logit contributions
arre-0.080
beh-0.076
inker-0.075
opter-0.075
 discoveries-0.075
Token sorry
Token ID7926
Feature activation+9.75e-03

Pos logit contributions
 seven+0.033
 and+0.027
 or+0.024
 He+0.024
 The+0.023
Neg logit contributions
beh-0.041
arre-0.040
oving-0.037
lled-0.035
 possessions-0.035
Token for
Token ID329
Feature activation+5.85e-03

Pos logit contributions
 seven+0.015
 The+0.007
 eight+0.007
 eighty+0.006
 ind+0.006
Neg logit contributions
 discoveries-0.023
arre-0.022
 possessions-0.022
 situations-0.021
 accidents-0.021
Token what
Token ID644
Feature activation+1.31e-02

Pos logit contributions
 seven+0.011
 and+0.009
 The+0.008
 like+0.008
 only+0.008
Neg logit contributions
arre-0.011
 discoveries-0.011
inker-0.011
beh-0.011
 situations-0.011
Token she
Token ID673
Feature activation+9.56e-02

Pos logit contributions
 seven+0.020
 and+0.015
 like+0.013
 only+0.013
 The+0.013
Neg logit contributions
arre-0.031
 discoveries-0.030
beh-0.030
inker-0.030
opter-0.029
Token did
Token ID750
Feature activation+5.38e-02

Pos logit contributions
 seven+0.144
 and+0.114
 only+0.099
 The+0.098
 like+0.095
Neg logit contributions
arre-0.230
beh-0.217
inker-0.216
 discoveries-0.216
opter-0.214
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
 seven+0.051
 The+0.022
 and+0.020
 only+0.019
 blue+0.014
Neg logit contributions
arre-0.162
 discoveries-0.154
inker-0.154
oving-0.151
 situations-0.151
Token He
Token ID679
Feature activation+1.16e-02

Pos logit contributions
 seven+0.028
 only+0.018
 like+0.017
 in+0.017
 spe+0.016
Neg logit contributions
arre-0.033
inker-0.032
ulk-0.030
 discoveries-0.030
irs-0.030
Token told
Token ID1297
Feature activation-1.86e-02

Pos logit contributions
 seven+0.021
 The+0.016
 and+0.015
 like+0.015
 or+0.014
Neg logit contributions
arre-0.023
 discoveries-0.021
beh-0.021
opter-0.021
inker-0.021
Token her
Token ID607
Feature activation+3.00e-02

Pos logit contributions
arre+0.043
inker+0.042
 discoveries+0.041
 situations+0.040
beh+0.040
Neg logit contributions
 seven-0.029
 and-0.022
 only-0.020
 like-0.019
 The-0.019
Token.
Token ID13
Feature activation+6.18e-03

Pos logit contributions
 seven+0.046
 The+0.023
 and+0.020
 only+0.020
 blue+0.016
Neg logit contributions
arre-0.119
 situations-0.113
inker-0.113
 discoveries-0.112
 possessions-0.112
Token She
Token ID1375
Feature activation+1.91e-02

Pos logit contributions
 seven+0.009
 and+0.007
 only+0.006
 like+0.006
 still+0.006
Neg logit contributions
arre-0.015
beh-0.014
inker-0.014
 situations-0.014
opter-0.013
Token can
Token ID460
Feature activation-5.98e-03

Pos logit contributions
 seven+0.031
 and+0.023
 The+0.021
 like+0.020
 only+0.020
Neg logit contributions
arre-0.044
beh-0.042
opter-0.041
inker-0.041
 discoveries-0.041
Token write
Token ID3551
Feature activation+1.14e-02

Pos logit contributions
arre+0.013
beh+0.012
inker+0.012
opter+0.012
 discoveries+0.012
Neg logit contributions
 seven-0.010
 and-0.009
 only-0.008
 like-0.008
 The-0.007
Token whatever
Token ID4232
Feature activation+3.55e-02

Pos logit contributions
 seven+0.015
 The+0.010
 only+0.010
 and+0.010
 He+0.008
Neg logit contributions
arre-0.027
 discoveries-0.026
inker-0.026
 possessions-0.026
opter-0.025
Token she
Token ID673
Feature activation+9.48e-02

Pos logit contributions
 seven+0.051
 and+0.044
 only+0.037
 like+0.036
 The+0.035
Neg logit contributions
arre-0.088
opter-0.083
inker-0.083
beh-0.082
 discoveries-0.081
Token wants
Token ID3382
Feature activation+1.95e-02

Pos logit contributions
 seven+0.155
 and+0.124
 The+0.110
 only+0.109
 like+0.104
Neg logit contributions
arre-0.214
beh-0.202
 discoveries-0.201
inker-0.199
opter-0.197
Token.
Token ID13
Feature activation+4.58e-03

Pos logit contributions
 seven+0.025
 The+0.014
 only+0.013
 still+0.013
 low+0.011
Neg logit contributions
 situations-0.048
arre-0.048
 possessions-0.047
 discoveries-0.047
 occasions-0.046
Token It
Token ID632
Feature activation-2.15e-02

Pos logit contributions
 seven+0.007
 and+0.005
 only+0.004
 still+0.004
 like+0.004
Neg logit contributions
arre-0.011
beh-0.011
 situations-0.011
inker-0.010
 discoveries-0.010
Token is
Token ID318
Feature activation+1.22e-02

Pos logit contributions
arre+0.051
beh+0.048
opter+0.047
 discoveries+0.047
inker+0.046
Neg logit contributions
 seven-0.033
 and-0.025
 The-0.022
 like-0.022
 only-0.021
Token her
Token ID607
Feature activation+1.37e-02

Pos logit contributions
 seven+0.019
 The+0.014
 and+0.014
 He+0.013
 driver+0.011
Neg logit contributions
arre-0.024
 possessions-0.024
 discoveries-0.024
 conflicts-0.024
beh-0.024
Token to
Token ID284
Feature activation-1.39e-02

Pos logit contributions
 seven+0.025
 and+0.017
 The+0.015
 only+0.014
 like+0.014
Neg logit contributions
arre-0.047
 discoveries-0.046
beh-0.045
 possessions-0.045
 situations-0.045
Token be
Token ID307
Feature activation+1.84e-03

Pos logit contributions
arre+0.029
 discoveries+0.027
beh+0.027
 situations+0.027
inker+0.026
Neg logit contributions
 seven-0.025
 and-0.020
 The-0.018
 only-0.017
 or-0.017
Token thankful
Token ID25535
Feature activation+4.54e-02

Pos logit contributions
 seven+0.003
 and+0.003
 The+0.002
 or+0.002
 only+0.002
Neg logit contributions
arre-0.004
beh-0.004
 discoveries-0.003
oving-0.003
 possessions-0.003
Token for
Token ID329
Feature activation+2.31e-02

Pos logit contributions
 seven+0.077
 The+0.050
 Leo+0.045
 eight+0.044
 He+0.044
Neg logit contributions
 possessions-0.097
 discoveries-0.095
 situations-0.090
arre-0.089
 belongings-0.089
Token what
Token ID644
Feature activation+6.54e-03

Pos logit contributions
 seven+0.033
 and+0.026
 only+0.022
 The+0.022
 like+0.022
Neg logit contributions
arre-0.056
 discoveries-0.054
beh-0.054
inker-0.053
 situations-0.053
Token she
Token ID673
Feature activation+9.46e-02

Pos logit contributions
 seven+0.009
 and+0.008
 only+0.006
 like+0.006
 The+0.006
Neg logit contributions
arre-0.017
inker-0.016
opter-0.016
beh-0.016
 discoveries-0.015
Token had
Token ID550
Feature activation+5.56e-02

Pos logit contributions
 seven+0.152
 and+0.119
 The+0.105
 only+0.104
 like+0.099
Neg logit contributions
arre-0.216
beh-0.204
 discoveries-0.204
inker-0.201
 possessions-0.197
Token.
Token ID13
Feature activation-3.77e-03

Pos logit contributions
 seven+0.057
 and+0.037
 The+0.030
 only+0.030
 like+0.027
Neg logit contributions
arre-0.162
inker-0.153
beh-0.153
 discoveries-0.153
opter-0.150
Token<|endoftext|>
Token ID50256
Feature activation-3.12e-02

Pos logit contributions
arre+0.010
 discoveries+0.010
inker+0.010
opter+0.010
beh+0.010
Neg logit contributions
 seven-0.004
 and-0.003
 only-0.002
 like-0.002
 or-0.002
TokenOnce
Token ID7454
Feature activation+3.27e-03

Pos logit contributions
arre+0.075
inker+0.072
 discoveries+0.070
opter+0.070
ert+0.069
Neg logit contributions
 seven-0.045
 only-0.028
 and-0.027
 like-0.027
 still-0.025
Token upon
Token ID2402
Feature activation+3.98e-02

Pos logit contributions
 seven+0.006
 and+0.006
 only+0.005
 The+0.005
 like+0.005
Neg logit contributions
arre-0.006
beh-0.006
inker-0.006
opter-0.006
 discoveries-0.006
Token was
Token ID373
Feature activation+1.39e-02

Pos logit contributions
 seven+0.009
 and+0.007
 only+0.006
 like+0.006
 The+0.006
Neg logit contributions
arre-0.012
beh-0.011
opter-0.011
inker-0.011
 discoveries-0.011
Token important
Token ID1593
Feature activation-1.37e-02

Pos logit contributions
 seven+0.027
 and+0.022
 The+0.021
 He+0.021
 Leo+0.020
Neg logit contributions
arre-0.025
beh-0.024
 possessions-0.024
 discoveries-0.023
oving-0.023
Token to
Token ID284
Feature activation-8.07e-03

Pos logit contributions
arre+0.035
 discoveries+0.035
 possessions+0.035
 situations+0.034
inker+0.034
Neg logit contributions
 seven-0.018
 The-0.011
 and-0.010
 only-0.009
 still-0.009
Token decide
Token ID5409
Feature activation+3.22e-02

Pos logit contributions
arre+0.018
beh+0.017
inker+0.017
 discoveries+0.017
opter+0.017
Neg logit contributions
 seven-0.013
 and-0.011
 only-0.009
 The-0.009
 like-0.009
Token what
Token ID644
Feature activation+1.23e-02

Pos logit contributions
 seven+0.040
 and+0.025
 The+0.025
 only+0.023
 He+0.021
Neg logit contributions
arre-0.084
 discoveries-0.082
 possessions-0.081
 situations-0.080
inker-0.080
Token she
Token ID673
Feature activation+9.45e-02

Pos logit contributions
 seven+0.015
 and+0.012
 only+0.010
 like+0.010
 The+0.010
Neg logit contributions
arre-0.033
inker-0.031
opter-0.031
beh-0.031
 discoveries-0.031
Token wanted
Token ID2227
Feature activation+1.75e-02

Pos logit contributions
 seven+0.152
 and+0.125
 only+0.110
 The+0.108
 like+0.106
Neg logit contributions
arre-0.217
beh-0.205
inker-0.203
 discoveries-0.203
opter-0.202
Token before
Token ID878
Feature activation+2.50e-02

Pos logit contributions
 seven+0.019
 The+0.009
 only+0.009
 and+0.008
 still+0.008
Neg logit contributions
arre-0.048
 situations-0.048
 discoveries-0.048
 possessions-0.047
inker-0.047
Token spending
Token ID4581
Feature activation+4.02e-02

Pos logit contributions
 seven+0.029
 and+0.018
 only+0.016
 like+0.016
 The+0.015
Neg logit contributions
arre-0.069
 situations-0.067
 discoveries-0.067
inker-0.066
beh-0.065
Token her
Token ID607
Feature activation-1.25e-02

Pos logit contributions
 seven+0.054
 and+0.040
 only+0.035
 The+0.034
 like+0.032
Neg logit contributions
arre-0.102
 discoveries-0.097
 situations-0.097
inker-0.096
beh-0.095
Token money
Token ID1637
Feature activation-2.26e-03

Pos logit contributions
arre+0.022
beh+0.021
 discoveries+0.021
inker+0.021
 situations+0.021
Neg logit contributions
 seven-0.026
 and-0.023
 only-0.021
 The-0.020
 like-0.020
Token She
Token ID1375
Feature activation+6.04e-03

Pos logit contributions
 seven+0.001
 and+0.001
 like+0.001
 only+0.001
 still+0.001
Neg logit contributions
arre-0.003
inker-0.003
 discoveries-0.003
 situations-0.003
opter-0.003
Token wondered
Token ID14028
Feature activation-7.31e-03

Pos logit contributions
 seven+0.010
 and+0.008
 only+0.007
 like+0.007
 The+0.007
Neg logit contributions
arre-0.014
beh-0.013
inker-0.013
opter-0.013
 discoveries-0.013
Token what
Token ID644
Feature activation+1.87e-02

Pos logit contributions
 discoveries+0.011
arre+0.011
 possessions+0.011
 situations+0.010
 discovery+0.010
Neg logit contributions
 seven-0.017
 The-0.013
 and-0.013
 only-0.012
 He-0.012
Token it
Token ID340
Feature activation+3.31e-02

Pos logit contributions
 seven+0.032
 and+0.025
 like+0.022
 only+0.022
 The+0.022
Neg logit contributions
arre-0.037
 discoveries-0.037
opter-0.037
 possessions-0.036
 situations-0.036
Token tasted
Token ID29187
Feature activation+4.25e-02

Pos logit contributions
 seven+0.057
 and+0.045
 only+0.040
 The+0.040
 like+0.039
Neg logit contributions
arre-0.072
 discoveries-0.068
inker-0.067
opter-0.067
beh-0.067
Token like
Token ID588
Feature activation+9.44e-02

Pos logit contributions
 seven+0.049
 The+0.031
 Leo+0.024
 eight+0.022
 only+0.021
Neg logit contributions
arre-0.114
ade-0.109
 discoveries-0.108
inker-0.104
oving-0.104
Token.
Token ID13
Feature activation+6.22e-04

Pos logit contributions
 seven+0.123
 The+0.080
 only+0.076
 driver+0.075
 still+0.073
Neg logit contributions
 possessions-0.210
ade-0.210
 situations-0.202
 discoveries-0.202
 occasions-0.199
Token She
Token ID1375
Feature activation+1.44e-02

Pos logit contributions
 seven+0.001
 and+0.000
 only+0.000
 like+0.000
 or+0.000
Neg logit contributions
arre-0.002
inker-0.002
 discoveries-0.002
beh-0.002
opter-0.002
Token opened
Token ID4721
Feature activation+3.88e-02

Pos logit contributions
 seven+0.024
 and+0.020
 like+0.019
 The+0.018
 only+0.018
Neg logit contributions
arre-0.031
opter-0.028
 discoveries-0.028
inker-0.028
beh-0.028
Token the
Token ID262
Feature activation+1.82e-02

Pos logit contributions
 seven+0.049
 and+0.034
 like+0.033
 only+0.031
 or+0.030
Neg logit contributions
arre-0.091
 discoveries-0.089
irs-0.088
 discovery-0.088
ert-0.087
Token jar
Token ID17379
Feature activation+1.54e-02

Pos logit contributions
 seven+0.038
 and+0.032
 only+0.030
 The+0.030
 like+0.029
Neg logit contributions
arre-0.032
 discoveries-0.029
opter-0.029
 possessions-0.028
inker-0.028
Token She
Token ID1375
Feature activation+2.09e-02

Pos logit contributions
 seven+0.016
 and+0.010
 like+0.009
 only+0.008
 or+0.008
Neg logit contributions
arre-0.036
inker-0.034
 situations-0.033
 discoveries-0.033
oving-0.032
Token was
Token ID373
Feature activation-5.93e-04

Pos logit contributions
 seven+0.037
 and+0.029
 The+0.027
 like+0.027
 or+0.026
Neg logit contributions
arre-0.043
inker-0.040
opter-0.039
beh-0.039
oving-0.039
Token sorry
Token ID7926
Feature activation+4.12e-04

Pos logit contributions
arre+0.001
beh+0.001
oving+0.001
inker+0.001
 discoveries+0.001
Neg logit contributions
 seven-0.001
 and-0.001
 The-0.001
 or-0.001
 like-0.001
Token for
Token ID329
Feature activation+8.65e-03

Pos logit contributions
 seven+0.000
 and+0.000
 only+0.000
 The+0.000
 like+0.000
Neg logit contributions
arre-0.001
 discoveries-0.001
inker-0.001
beh-0.001
 situations-0.001
Token what
Token ID644
Feature activation+1.75e-02

Pos logit contributions
 seven+0.013
 and+0.010
 The+0.009
 only+0.009
 like+0.009
Neg logit contributions
arre-0.020
inker-0.020
 discoveries-0.019
beh-0.019
opter-0.019
Token she
Token ID673
Feature activation+9.43e-02

Pos logit contributions
 seven+0.020
 and+0.016
 only+0.013
 like+0.013
 The+0.012
Neg logit contributions
arre-0.049
opter-0.046
beh-0.046
inker-0.046
 discoveries-0.046
Token did
Token ID750
Feature activation+5.35e-02

Pos logit contributions
 seven+0.114
 and+0.098
 only+0.083
 like+0.078
 still+0.075
Neg logit contributions
arre-0.254
opter-0.244
 situations-0.242
beh-0.242
inker-0.242
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 seven+0.042
 and+0.021
 The+0.015
 only+0.015
 like+0.012
Neg logit contributions
arre-0.168
inker-0.161
 discoveries-0.160
opter-0.158
beh-0.158
Token She
Token ID1375
Feature activation+2.95e-02

Pos logit contributions
 seven+0.017
 and+0.011
 like+0.009
 only+0.009
 or+0.008
Neg logit contributions
arre-0.032
inker-0.030
 situations-0.029
 discoveries-0.029
oving-0.029
Token did
Token ID750
Feature activation+1.63e-02

Pos logit contributions
 seven+0.054
 The+0.040
 and+0.040
 like+0.038
 or+0.038
Neg logit contributions
arre-0.058
beh-0.052
oving-0.051
opter-0.051
inker-0.050
Token not
Token ID407
Feature activation+1.56e-03

Pos logit contributions
 seven+0.015
 and+0.012
 only+0.009
 like+0.009
 The+0.008
Neg logit contributions
arre-0.049
opter-0.048
inker-0.047
beh-0.047
 discoveries-0.047
Token could
Token ID714
Feature activation+1.70e-02

Pos logit contributions
 seven+0.100
 and+0.076
 The+0.070
 like+0.064
 only+0.060
Neg logit contributions
arre-0.150
beh-0.146
opter-0.143
 possessions-0.143
inker-0.141
Token be
Token ID307
Feature activation+1.42e-02

Pos logit contributions
 seven+0.032
 and+0.024
 The+0.023
 or+0.020
 only+0.020
Neg logit contributions
arre-0.033
inker-0.031
 discoveries-0.031
beh-0.030
oving-0.030
Token happy
Token ID3772
Feature activation+1.81e-02

Pos logit contributions
 seven+0.022
 and+0.017
 The+0.016
 He+0.015
 only+0.015
Neg logit contributions
beh-0.030
 discoveries-0.030
arre-0.030
oving-0.028
 possessions-0.028
Token with
Token ID351
Feature activation+3.21e-02

Pos logit contributions
 seven+0.026
 The+0.015
 eight+0.013
 only+0.012
 He+0.011
Neg logit contributions
 discoveries-0.045
 situations-0.044
arre-0.043
inker-0.042
 discovery-0.041
Token what
Token ID644
Feature activation+6.87e-03

Pos logit contributions
 seven+0.045
 and+0.036
 only+0.031
 like+0.030
 The+0.030
Neg logit contributions
arre-0.080
inker-0.076
beh-0.076
 discoveries-0.076
opter-0.076
Token she
Token ID673
Feature activation+9.40e-02

Pos logit contributions
 seven+0.011
 and+0.009
 only+0.007
 like+0.007
 The+0.007
Neg logit contributions
arre-0.016
beh-0.015
 discoveries-0.015
inker-0.015
opter-0.015
Token had
Token ID550
Feature activation+5.40e-02

Pos logit contributions
 seven+0.157
 and+0.118
 The+0.115
 only+0.104
 or+0.100
Neg logit contributions
arre-0.201
 discoveries-0.193
beh-0.192
inker-0.190
opter-0.186
Token.
Token ID13
Feature activation+9.02e-03

Pos logit contributions
 seven+0.052
 The+0.028
 and+0.027
 only+0.023
 He+0.022
Neg logit contributions
arre-0.157
inker-0.151
beh-0.151
 discoveries-0.150
 situations-0.147
Token
Token ID198
Feature activation+2.25e-03

Pos logit contributions
 seven+0.011
 and+0.008
 only+0.006
 like+0.006
 or+0.006
Neg logit contributions
arre-0.024
inker-0.023
 discoveries-0.022
 situations-0.022
beh-0.022
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
 seven+0.003
 and+0.002
 only+0.002
 The+0.002
 like+0.002
Neg logit contributions
arre-0.006
beh-0.006
opter-0.006
inker-0.005
 discoveries-0.005
TokenShe
Token ID3347
Feature activation+3.30e-02

Pos logit contributions
 seven+0.021
 and+0.018
 only+0.015
 like+0.014
 still+0.014
Neg logit contributions
arre-0.038
beh-0.037
opter-0.037
 discoveries-0.037
 situations-0.036
Token up
Token ID510
Feature activation+5.49e-02

Pos logit contributions
 seven+0.026
 The+0.016
 and+0.016
 only+0.015
 He+0.013
Neg logit contributions
arre-0.048
inker-0.047
 discoveries-0.047
beh-0.046
 situations-0.045
Token.
Token ID13
Feature activation+6.26e-04

Pos logit contributions
 seven+0.072
 driver+0.047
 The+0.046
 He+0.045
 Leo+0.042
Neg logit contributions
inker-0.122
 discoveries-0.122
arre-0.120
beh-0.118
 situations-0.117
Token Nobody
Token ID15658
Feature activation-1.53e-02

Pos logit contributions
 seven+0.001
 and+0.000
 only+0.000
 like+0.000
 The+0.000
Neg logit contributions
arre-0.002
beh-0.002
opter-0.002
inker-0.002
 discoveries-0.002
Token knew
Token ID2993
Feature activation+1.29e-02

Pos logit contributions
arre+0.032
beh+0.030
 discoveries+0.030
opter+0.030
inker+0.029
Neg logit contributions
 seven-0.026
 and-0.019
 The-0.019
 only-0.018
 He-0.017
Token what
Token ID644
Feature activation+2.52e-02

Pos logit contributions
 seven+0.021
 and+0.017
 only+0.015
 The+0.015
 like+0.015
Neg logit contributions
arre-0.029
 discoveries-0.027
inker-0.027
beh-0.027
opter-0.027
Token she
Token ID673
Feature activation+9.36e-02

Pos logit contributions
 seven+0.033
 and+0.025
 only+0.021
 like+0.021
 The+0.020
Neg logit contributions
arre-0.063
 discoveries-0.061
inker-0.061
 situations-0.061
opter-0.060
Token was
Token ID373
Feature activation+4.50e-02

Pos logit contributions
 seven+0.140
 and+0.119
 only+0.104
 like+0.100
 The+0.097
Neg logit contributions
arre-0.225
opter-0.213
beh-0.212
inker-0.212
 discoveries-0.210
Token like
Token ID588
Feature activation+7.99e-02

Pos logit contributions
 seven+0.080
 and+0.062
 The+0.058
 only+0.058
 He+0.055
Neg logit contributions
arre-0.096
inker-0.090
 discoveries-0.088
beh-0.088
opter-0.088
Token or
Token ID393
Feature activation+1.25e-02

Pos logit contributions
 seven+0.101
 He+0.055
 The+0.055
 still+0.053
 also+0.050
Neg logit contributions
inker-0.195
 discoveries-0.193
 situations-0.192
 possessions-0.187
arre-0.186
Token what
Token ID644
Feature activation+7.50e-03

Pos logit contributions
 seven+0.019
 and+0.016
 only+0.015
 like+0.014
 The+0.014
Neg logit contributions
arre-0.029
opter-0.028
inker-0.028
beh-0.028
 discoveries-0.027
Token she
Token ID673
Feature activation+8.07e-02

Pos logit contributions
 seven+0.011
 and+0.009
 only+0.008
 like+0.007
 The+0.007
Neg logit contributions
arre-0.019
opter-0.018
inker-0.018
beh-0.018
 discoveries-0.018
Token I
Token ID314
Feature activation-1.86e-02

Pos logit contributions
 seven+0.008
 and+0.006
 still+0.006
 only+0.006
 very+0.005
Neg logit contributions
opter-0.007
beh-0.007
arre-0.007
 discoveries-0.007
igsaw-0.007
Token wonder
Token ID4240
Feature activation-4.17e-03

Pos logit contributions
arre+0.044
beh+0.041
 discoveries+0.041
opter+0.041
inker+0.040
Neg logit contributions
 seven-0.029
 and-0.023
 The-0.020
 only-0.020
 like-0.019
Token what
Token ID644
Feature activation+8.14e-03

Pos logit contributions
 discoveries+0.007
 possessions+0.006
 discovery+0.006
beh+0.006
ert+0.006
Neg logit contributions
 seven-0.008
 The-0.007
 and-0.006
 only-0.006
 He-0.006
Token it
Token ID340
Feature activation+3.39e-02

Pos logit contributions
 seven+0.011
 and+0.009
 like+0.007
 still+0.007
 only+0.007
Neg logit contributions
 discoveries-0.018
 situations-0.018
opter-0.018
 possessions-0.018
beh-0.018
Token tastes
Token ID18221
Feature activation+2.90e-02

Pos logit contributions
 seven+0.050
 and+0.039
 only+0.034
 The+0.034
 like+0.033
Neg logit contributions
arre-0.082
beh-0.077
 discoveries-0.077
inker-0.077
opter-0.076
Token like
Token ID588
Feature activation+9.32e-02

Pos logit contributions
 seven+0.036
 The+0.027
 driver+0.021
 only+0.021
 He+0.020
Neg logit contributions
arre-0.068
ade-0.067
 discoveries-0.065
beh-0.064
oving-0.063
Token."
Token ID526
Feature activation+1.46e-03

Pos logit contributions
 seven+0.130
 only+0.112
 still+0.106
 The+0.105
 He+0.102
Neg logit contributions
 situations-0.177
 possessions-0.172
 discoveries-0.168
ade-0.167
 occasions-0.163
Token
Token ID198
Feature activation-6.47e-04

Pos logit contributions
 seven+0.002
 only+0.002
 like+0.002
 or+0.001
 still+0.001
Neg logit contributions
 discoveries-0.003
arre-0.003
ulk-0.003
inker-0.003
opter-0.003
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
arre+0.002
inker+0.002
opter+0.002
 discoveries+0.002
beh+0.002
Neg logit contributions
 seven-0.001
 and-0.000
 only-0.000
 like-0.000
 or-0.000
Token"
Token ID1
Feature activation-2.64e-03

Pos logit contributions
 seven+0.020
 and+0.016
 only+0.016
 like+0.015
 or+0.015
Neg logit contributions
arre-0.020
opter-0.019
inker-0.019
beh-0.018
oving-0.018
TokenI
Token ID40
Feature activation-2.54e-03

Pos logit contributions
arre+0.005
opter+0.005
beh+0.005
inker+0.005
 discoveries+0.005
Neg logit contributions
 seven-0.005
 and-0.005
 only-0.004
 like-0.004
 The-0.004
Token He
Token ID679
Feature activation+7.22e-03

Pos logit contributions
 seven+0.020
 only+0.012
 still+0.012
 and+0.012
 also+0.011
Neg logit contributions
arre-0.025
inker-0.025
igsaw-0.025
 discoveries-0.025
 situations-0.025
Token wondered
Token ID14028
Feature activation-8.55e-03

Pos logit contributions
 seven+0.013
 The+0.010
 or+0.010
 and+0.010
 like+0.010
Neg logit contributions
arre-0.013
 discoveries-0.012
inker-0.012
 arose-0.011
oving-0.011
Token what
Token ID644
Feature activation+1.31e-02

Pos logit contributions
 discoveries+0.014
 possessions+0.013
arre+0.013
ert+0.013
inker+0.013
Neg logit contributions
 seven-0.019
 The-0.014
 and-0.014
 only-0.014
 He-0.013
Token it
Token ID340
Feature activation+3.31e-02

Pos logit contributions
 seven+0.020
 and+0.015
 like+0.013
 only+0.013
 The+0.013
Neg logit contributions
 discoveries-0.029
arre-0.028
 situations-0.028
beh-0.028
 possessions-0.028
Token tasted
Token ID29187
Feature activation+4.21e-02

Pos logit contributions
 seven+0.057
 and+0.045
 The+0.040
 only+0.039
 like+0.038
Neg logit contributions
arre-0.072
 discoveries-0.068
inker-0.068
beh-0.068
opter-0.067
Token like
Token ID588
Feature activation+9.32e-02

Pos logit contributions
 seven+0.057
 The+0.039
 eight+0.032
 driver+0.029
 Leo+0.027
Neg logit contributions
ade-0.100
 discoveries-0.098
arre-0.098
angle-0.095
inker-0.095
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
 seven+0.127
 The+0.080
 only+0.080
 still+0.076
 also+0.072
Neg logit contributions
 possessions-0.210
 situations-0.208
 discoveries-0.203
ade-0.194
 arose-0.193
Token He
Token ID679
Feature activation+5.50e-03

Pos logit contributions
 seven+0.019
 and+0.011
 only+0.011
 like+0.010
 still+0.010
Neg logit contributions
arre-0.029
inker-0.027
 discoveries-0.027
 situations-0.026
oving-0.026
Token opened
Token ID4721
Feature activation+3.47e-02

Pos logit contributions
 seven+0.010
 The+0.008
 like+0.008
 or+0.008
 and+0.008
Neg logit contributions
arre-0.009
 discoveries-0.008
 arose-0.008
oving-0.008
beh-0.008
Token his
Token ID465
Feature activation+4.67e-04

Pos logit contributions
 seven+0.049
 like+0.035
 driver+0.034
 only+0.033
 or+0.032
Neg logit contributions
 discovery-0.071
 discoveries-0.070
ert-0.070
angle-0.068
irs-0.068
Token mouth
Token ID5422
Feature activation+1.85e-02

Pos logit contributions
 seven+0.001
 and+0.001
 like+0.001
 only+0.001
 The+0.001
Neg logit contributions
 discoveries-0.001
arre-0.001
 possessions-0.001
inker-0.001
 discovery-0.001
Token.
Token ID13
Feature activation+4.44e-03

Pos logit contributions
 seven+0.026
 only+0.013
 still+0.013
 also+0.013
 eighty+0.012
Neg logit contributions
arre-0.046
 situations-0.046
 discoveries-0.044
 possessions-0.044
inker-0.044
Token She
Token ID1375
Feature activation+1.54e-02

Pos logit contributions
 seven+0.006
 and+0.004
 only+0.003
 still+0.003
 or+0.003
Neg logit contributions
arre-0.012
 situations-0.011
 discoveries-0.011
igsaw-0.011
inker-0.011
Token wrote
Token ID2630
Feature activation+2.02e-02

Pos logit contributions
 seven+0.031
 The+0.025
 and+0.024
 like+0.023
 Leo+0.023
Neg logit contributions
opter-0.025
arre-0.024
 possessions-0.023
 discoveries-0.023
making-0.023
Token about
Token ID546
Feature activation+2.95e-02

Pos logit contributions
 seven+0.027
 and+0.020
 only+0.018
 The+0.017
 like+0.017
Neg logit contributions
arre-0.050
inker-0.048
 discoveries-0.047
opter-0.047
 possessions-0.046
Token what
Token ID644
Feature activation+1.17e-02

Pos logit contributions
 seven+0.040
 and+0.035
 only+0.029
 like+0.028
 The+0.027
Neg logit contributions
arre-0.076
beh-0.073
opter-0.072
inker-0.072
oving-0.071
Token she
Token ID673
Feature activation+9.30e-02

Pos logit contributions
 seven+0.017
 and+0.015
 only+0.013
 like+0.012
 The+0.012
Neg logit contributions
arre-0.028
inker-0.027
opter-0.027
beh-0.027
 discoveries-0.026
Token did
Token ID750
Feature activation+5.70e-02

Pos logit contributions
 seven+0.147
 and+0.112
 The+0.102
 only+0.098
 like+0.096
Neg logit contributions
arre-0.207
inker-0.200
 discoveries-0.199
beh-0.197
opter-0.196
Token,
Token ID11
Feature activation+4.70e-03

Pos logit contributions
 seven+0.060
 The+0.032
 and+0.029
 only+0.027
 He+0.023
Neg logit contributions
arre-0.159
inker-0.156
 discoveries-0.155
 possessions-0.151
 situations-0.150
Token what
Token ID644
Feature activation+8.47e-03

Pos logit contributions
 seven+0.007
 The+0.004
 and+0.004
 only+0.003
 He+0.003
Neg logit contributions
arre-0.011
 discoveries-0.011
 possessions-0.011
 situations-0.010
inker-0.010
Token she
Token ID673
Feature activation+8.53e-02

Pos logit contributions
 seven+0.011
 and+0.010
 only+0.009
 like+0.008
 still+0.008
Neg logit contributions
arre-0.022
opter-0.021
inker-0.021
beh-0.021
oving-0.020
Token saw
Token ID2497
Feature activation+2.42e-02

Pos logit contributions
 seven+0.138
 and+0.109
 The+0.099
 only+0.096
 like+0.094
Neg logit contributions
arre-0.191
inker-0.181
beh-0.180
 discoveries-0.180
opter-0.179
Token her
Token ID607
Feature activation-1.53e-03

Pos logit contributions
arre+0.027
opter+0.026
inker+0.026
 discoveries+0.025
beh+0.025
Neg logit contributions
 seven-0.016
 and-0.014
 only-0.013
 like-0.012
 still-0.012
Token owners
Token ID4393
Feature activation-1.85e-02

Pos logit contributions
arre+0.003
 discoveries+0.003
inker+0.003
 possessions+0.003
 situations+0.003
Neg logit contributions
 seven-0.003
 and-0.002
 like-0.002
 The-0.002
 only-0.002
Token never
Token ID1239
Feature activation+4.42e-03

Pos logit contributions
arre+0.037
beh+0.034
opter+0.034
oving+0.033
inker+0.033
Neg logit contributions
 seven-0.036
 and-0.028
 The-0.026
 only-0.025
 like-0.024
Token knew
Token ID2993
Feature activation+4.15e-03

Pos logit contributions
 seven+0.007
 and+0.006
 like+0.006
 only+0.005
 still+0.005
Neg logit contributions
arre-0.010
opter-0.010
inker-0.010
 discoveries-0.010
irs-0.010
Token what
Token ID644
Feature activation+2.17e-02

Pos logit contributions
 seven+0.007
 and+0.005
 The+0.005
 only+0.005
 like+0.004
Neg logit contributions
arre-0.009
 discoveries-0.009
beh-0.009
 possessions-0.009
 situations-0.009
Token she
Token ID673
Feature activation+9.30e-02

Pos logit contributions
 seven+0.027
 and+0.019
 like+0.016
 only+0.016
 The+0.015
Neg logit contributions
arre-0.057
beh-0.055
opter-0.055
 situations-0.055
 discoveries-0.054
Token was
Token ID373
Feature activation+4.57e-02

Pos logit contributions
 seven+0.145
 and+0.123
 only+0.108
 like+0.104
 The+0.103
Neg logit contributions
arre-0.218
opter-0.206
inker-0.205
beh-0.205
 discoveries-0.204
Token going
Token ID1016
Feature activation+4.03e-02

Pos logit contributions
 seven+0.087
 and+0.070
 only+0.065
 The+0.064
 like+0.061
Neg logit contributions
arre-0.093
beh-0.086
inker-0.086
opter-0.085
 discoveries-0.085
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
 seven+0.058
 The+0.036
 only+0.035
 Anna+0.033
 and+0.033
Neg logit contributions
arre-0.099
inker-0.093
beh-0.092
 discoveries-0.089
 situations-0.089
Token do
Token ID466
Feature activation+5.57e-02

Pos logit contributions
 seven+0.019
 and+0.017
 only+0.015
 like+0.015
 The+0.015
Neg logit contributions
arre-0.025
beh-0.023
opter-0.023
inker-0.023
 discoveries-0.023
Token next
Token ID1306
Feature activation+3.60e-02

Pos logit contributions
 seven+0.054
 and+0.029
 The+0.026
 only+0.026
 still+0.021
Neg logit contributions
arre-0.162
inker-0.156
beh-0.155
 situations-0.154
 discoveries-0.154
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
arre+0.103
opter+0.099
inker+0.099
 discoveries+0.098
 situations+0.097
Neg logit contributions
 seven-0.032
 and-0.019
 only-0.015
 like-0.014
 The-0.014
Token he
Token ID339
Feature activation-4.01e-03

Pos logit contributions
arre+0.049
opter+0.048
 discoveries+0.048
inker+0.047
 situations+0.047
Neg logit contributions
 seven-0.026
 and-0.019
 like-0.016
 only-0.016
 The-0.015
Token made
Token ID925
Feature activation-1.28e-02

Pos logit contributions
arre+0.008
opter+0.007
 discoveries+0.007
inker+0.007
oving+0.007
Neg logit contributions
 seven-0.007
 and-0.006
 The-0.005
 like-0.005
 or-0.005
Token some
Token ID617
Feature activation-4.16e-02

Pos logit contributions
arre+0.036
beh+0.035
opter+0.035
inker+0.034
 situations+0.034
Neg logit contributions
 seven-0.014
 and-0.012
 only-0.009
 The-0.009
 like-0.009
Token money
Token ID1637
Feature activation-2.30e-02

Pos logit contributions
arre+0.082
 discoveries+0.077
opter+0.076
beh+0.076
inker+0.076
Neg logit contributions
 seven-0.080
 and-0.068
 only-0.063
 The-0.061
 like-0.060
Token transactions
Token ID8945
Feature activation+9.30e-02

Pos logit contributions
 discoveries+0.059
 possessions+0.058
arre+0.057
 discovery+0.056
beh+0.056
Neg logit contributions
 seven-0.029
 The-0.017
 only-0.014
 Tw-0.013
 Leo-0.012
Token.
Token ID13
Feature activation-6.75e-03

Pos logit contributions
 seven+0.109
 The+0.066
 only+0.062
 still+0.060
 also+0.059
Neg logit contributions
inker-0.234
arre-0.229
 situations-0.224
beh-0.223
ert-0.221
Token At
Token ID1629
Feature activation+6.71e-03

Pos logit contributions
arre+0.015
 situations+0.015
inker+0.015
opter+0.014
 discoveries+0.014
Neg logit contributions
 seven-0.010
 and-0.007
 still-0.007
 only-0.006
 like-0.006
Token the
Token ID262
Feature activation+3.55e-03

Pos logit contributions
 seven+0.006
 and+0.005
 like+0.004
 only+0.003
 The+0.003
Neg logit contributions
arre-0.019
inker-0.018
 discoveries-0.018
 situations-0.018
opter-0.018
Token post
Token ID1281
Feature activation-1.22e-02

Pos logit contributions
 seven+0.007
 and+0.005
 The+0.005
 like+0.005
 only+0.005
Neg logit contributions
arre-0.007
 possessions-0.006
 discoveries-0.006
opter-0.006
 situations-0.006
Token office
Token ID2607
Feature activation+1.57e-02

Pos logit contributions
opter+0.025
arre+0.024
 situations+0.024
 discoveries+0.024
inker+0.024
Neg logit contributions
 seven-0.021
 like-0.016
 and-0.014
 only-0.014
 The-0.014
Token very
Token ID845
Feature activation-8.37e-03

Pos logit contributions
 seven+0.002
 and+0.002
 He+0.002
 or+0.001
 The+0.001
Neg logit contributions
beh-0.004
arre-0.004
playing-0.003
 possessions-0.003
oving-0.003
Token generous
Token ID14431
Feature activation+6.38e-03

Pos logit contributions
beh+0.017
arre+0.017
 discoveries+0.016
opter+0.016
 possessions+0.016
Neg logit contributions
 seven-0.014
 and-0.012
 or-0.011
 The-0.010
 only-0.010
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
 seven+0.007
 The+0.004
 still+0.004
 low+0.004
 only+0.004
Neg logit contributions
 discoveries-0.016
inker-0.016
arre-0.015
 discovery-0.015
 possessions-0.015
Token shared
Token ID4888
Feature activation+3.24e-02

Pos logit contributions
 seven+0.018
 and+0.015
 The+0.012
 or+0.012
 only+0.012
Neg logit contributions
arre-0.028
beh-0.028
 discoveries-0.027
opter-0.027
 possessions-0.027
Token whatever
Token ID4232
Feature activation+3.57e-02

Pos logit contributions
 seven+0.050
 and+0.038
 only+0.036
 The+0.034
 still+0.034
Neg logit contributions
arre-0.073
 discoveries-0.072
beh-0.070
inker-0.070
 possessions-0.069
Token she
Token ID673
Feature activation+9.29e-02

Pos logit contributions
 seven+0.056
 and+0.045
 only+0.040
 like+0.039
 The+0.038
Neg logit contributions
arre-0.082
beh-0.079
 discoveries-0.079
inker-0.078
opter-0.078
Token had
Token ID550
Feature activation+6.22e-02

Pos logit contributions
 seven+0.164
 and+0.129
 The+0.126
 only+0.115
 He+0.111
Neg logit contributions
arre-0.185
beh-0.184
 discoveries-0.179
 possessions-0.175
inker-0.173
Token with
Token ID351
Feature activation+3.93e-02

Pos logit contributions
 seven+0.077
 The+0.051
 and+0.051
 only+0.046
 He+0.043
Neg logit contributions
arre-0.162
beh-0.158
inker-0.158
 discoveries-0.155
opter-0.152
Token people
Token ID661
Feature activation-6.73e-03

Pos logit contributions
 seven+0.054
 and+0.035
 The+0.033
 only+0.033
 He+0.030
Neg logit contributions
 possessions-0.094
 discoveries-0.091
beh-0.089
irs-0.089
inker-0.089
Token in
Token ID287
Feature activation+1.67e-02

Pos logit contributions
arre+0.019
beh+0.018
 possessions+0.018
 discoveries+0.018
opter+0.017
Neg logit contributions
 seven-0.008
 The-0.004
 only-0.004
 and-0.004
 He-0.003
Token need
Token ID761
Feature activation+4.01e-03

Pos logit contributions
 seven+0.021
 and+0.014
 only+0.012
 like+0.011
 The+0.011
Neg logit contributions
arre-0.042
 discoveries-0.042
 situations-0.042
inker-0.041
 possessions-0.041
Token She
Token ID1375
Feature activation+6.31e-03

Pos logit contributions
 seven+0.001
 and+0.001
 like+0.001
 still+0.001
 only+0.001
Neg logit contributions
arre-0.002
 situations-0.002
inker-0.002
opter-0.002
 discoveries-0.002
Token wondered
Token ID14028
Feature activation-1.01e-02

Pos logit contributions
 seven+0.010
 and+0.008
 only+0.007
 The+0.007
 still+0.007
Neg logit contributions
arre-0.015
beh-0.015
inker-0.015
opter-0.015
 situations-0.015
Token what
Token ID644
Feature activation+1.82e-02

Pos logit contributions
 discoveries+0.016
arre+0.015
 possessions+0.015
opter+0.015
inker+0.015
Neg logit contributions
 seven-0.023
 The-0.017
 and-0.017
 only-0.017
 He-0.016
Token it
Token ID340
Feature activation+3.31e-02

Pos logit contributions
 seven+0.032
 and+0.025
 like+0.023
 The+0.022
 only+0.022
Neg logit contributions
 discoveries-0.035
opter-0.035
 possessions-0.035
arre-0.035
making-0.034
Token tasted
Token ID29187
Feature activation+4.33e-02

Pos logit contributions
 seven+0.053
 and+0.043
 only+0.038
 The+0.037
 like+0.037
Neg logit contributions
arre-0.076
 discoveries-0.071
inker-0.071
opter-0.071
beh-0.071
Token like
Token ID588
Feature activation+9.28e-02

Pos logit contributions
 seven+0.050
 The+0.032
 Leo+0.025
 eight+0.023
 only+0.021
Neg logit contributions
arre-0.115
ade-0.110
 discoveries-0.110
inker-0.106
 possessions-0.106
Token.
Token ID13
Feature activation+5.61e-04

Pos logit contributions
 seven+0.118
 The+0.078
 He+0.075
 only+0.072
 still+0.071
Neg logit contributions
 possessions-0.214
ade-0.208
 discoveries-0.204
 situations-0.204
 occasions-0.203
Token She
Token ID1375
Feature activation+1.44e-02

Pos logit contributions
 seven+0.001
 and+0.001
 like+0.000
 only+0.000
 still+0.000
Neg logit contributions
arre-0.001
inker-0.001
 situations-0.001
 discoveries-0.001
beh-0.001
Token climbed
Token ID19952
Feature activation+5.29e-03

Pos logit contributions
 seven+0.024
 and+0.019
 like+0.017
 The+0.017
 only+0.017
Neg logit contributions
arre-0.032
inker-0.030
beh-0.030
opter-0.030
 discoveries-0.030
Token on
Token ID319
Feature activation-4.72e-03

Pos logit contributions
 seven+0.007
 like+0.005
 only+0.005
 or+0.005
 The+0.005
Neg logit contributions
ert-0.011
arre-0.011
inker-0.011
making-0.010
 discovery-0.010
Token a
Token ID257
Feature activation-9.29e-03

Pos logit contributions
 discovery+0.011
 discoveries+0.011
arre+0.010
ert+0.010
 possessions+0.010
Neg logit contributions
 seven-0.006
 like-0.004
 and-0.004
 He-0.004
 driver-0.003
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
 seven+0.026
 has+0.015
 Anna+0.015
 Sunny+0.014
 Africa+0.014
Neg logit contributions
 discovery-0.054
 discoveries-0.053
arre-0.049
ge-0.049
ert-0.049
Token saw
Token ID2497
Feature activation-1.90e-02

Pos logit contributions
arre+0.028
 discoveries+0.027
inker+0.027
opter+0.027
beh+0.026
Neg logit contributions
 seven-0.022
 and-0.017
 only-0.016
 like-0.015
 or-0.015
Token two
Token ID734
Feature activation-5.03e-02

Pos logit contributions
arre+0.057
beh+0.055
opter+0.054
inker+0.054
 discoveries+0.054
Neg logit contributions
 seven-0.017
 and-0.013
 only-0.010
 like-0.009
 The-0.009
Token birds
Token ID10087
Feature activation-1.12e-02

Pos logit contributions
arre+0.112
opter+0.105
 discoveries+0.105
inker+0.105
beh+0.104
Neg logit contributions
 seven-0.083
 and-0.070
 only-0.062
 The-0.061
 like-0.060
Token qu
Token ID627
Feature activation-5.91e-02

Pos logit contributions
arre+0.024
beh+0.022
opter+0.022
 discoveries+0.022
oving+0.022
Neg logit contributions
 seven-0.020
 The-0.014
 only-0.014
 and-0.014
 like-0.013
Tokenarre
Token ID9624
Feature activation-1.24e-01

Pos logit contributions
 discoveries+0.090
 possessions+0.088
 situations+0.083
opter+0.083
 discovery+0.081
Neg logit contributions
 seven-0.132
 and-0.126
 like-0.108
 The-0.107
 still-0.106
Tokenling
Token ID1359
Feature activation-2.59e-02

Pos logit contributions
igsaw+0.199
 discoveries+0.195
arre+0.193
inker+0.189
 weaknesses+0.186
Neg logit contributions
 and-0.268
 seven-0.258
 in-0.235
 like-0.224
 or-0.213
Token!
Token ID0
Feature activation-1.40e-02

Pos logit contributions
arre+0.070
 discoveries+0.068
beh+0.068
 situations+0.068
inker+0.066
Neg logit contributions
 seven-0.028
 only-0.016
 The-0.015
 still-0.015
 and-0.014
Token They
Token ID1119
Feature activation-2.04e-02

Pos logit contributions
arre+0.035
 situations+0.031
 discoveries+0.031
opter+0.031
ert+0.031
Neg logit contributions
 seven-0.018
 like-0.013
 and-0.013
 only-0.013
 still-0.012
Token were
Token ID547
Feature activation-2.73e-02

Pos logit contributions
arre+0.044
beh+0.042
inker+0.042
 situations+0.041
opter+0.041
Neg logit contributions
 seven-0.035
 and-0.031
 only-0.028
 like-0.027
 The-0.026
Token squ
Token ID2809
Feature activation-4.00e-02

Pos logit contributions
arre+0.064
oving+0.059
beh+0.059
inker+0.057
 discoveries+0.057
Neg logit contributions
 seven-0.041
 and-0.031
 The-0.029
 or-0.027
 He-0.026
Token We
Token ID775
Feature activation-2.02e-02

Pos logit contributions
 seven+0.015
 and+0.012
 still+0.011
 low+0.011
 high+0.011
Neg logit contributions
arre-0.018
beh-0.017
 situations-0.017
inker-0.015
 weaknesses-0.015
Token have
Token ID423
Feature activation-1.88e-02

Pos logit contributions
arre+0.047
beh+0.044
opter+0.043
inker+0.043
 discoveries+0.043
Neg logit contributions
 seven-0.032
 and-0.026
 The-0.023
 only-0.022
 like-0.021
Token different
Token ID1180
Feature activation-6.03e-02

Pos logit contributions
arre+0.047
beh+0.044
 discoveries+0.044
 possessions+0.044
 situations+0.044
Neg logit contributions
 seven-0.024
 and-0.019
 The-0.016
 only-0.015
 like-0.015
Token likes
Token ID7832
Feature activation+2.79e-02

Pos logit contributions
arre+0.097
 discoveries+0.092
inker+0.091
opter+0.090
 situations+0.090
Neg logit contributions
 seven-0.135
 and-0.117
 only-0.111
 The-0.108
 still-0.106
Token and
Token ID290
Feature activation-2.46e-03

Pos logit contributions
 seven+0.047
 The+0.035
 only+0.034
 still+0.034
 He+0.033
Neg logit contributions
 discoveries-0.056
 situations-0.055
 possessions-0.055
arre-0.054
 conflicts-0.052
Token disl
Token ID19621
Feature activation-1.22e-01

Pos logit contributions
arre+0.005
inker+0.005
opter+0.005
beh+0.005
 discoveries+0.005
Neg logit contributions
 seven-0.005
 and-0.004
 only-0.003
 like-0.003
 The-0.003
Tokenikes
Token ID7938
Feature activation-5.36e-02

Pos logit contributions
 situations+0.113
opter+0.111
 possessions+0.109
arre+0.107
igsaw+0.106
Neg logit contributions
 and-0.335
 seven-0.315
 only-0.305
 like-0.299
.-0.295
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
arre+0.139
opter+0.129
 situations+0.128
inker+0.124
oving+0.122
Neg logit contributions
 seven-0.062
 only-0.036
 The-0.035
 still-0.031
 and-0.029
Token We
Token ID775
Feature activation-2.32e-02

Pos logit contributions
arre+0.004
beh+0.004
 situations+0.004
 weaknesses+0.003
rums+0.003
Neg logit contributions
 seven-0.003
 and-0.003
 still-0.002
 only-0.002
 high-0.002
Token can
Token ID460
Feature activation-9.28e-03

Pos logit contributions
arre+0.055
beh+0.051
opter+0.051
inker+0.051
 discoveries+0.050
Neg logit contributions
 seven-0.036
 and-0.029
 The-0.025
 only-0.025
 like-0.024
Token be
Token ID307
Feature activation-1.38e-02

Pos logit contributions
arre+0.020
opter+0.018
inker+0.018
 discoveries+0.018
oving+0.018
Neg logit contributions
 seven-0.016
 and-0.013
 The-0.012
 like-0.011
 only-0.011
Token two
Token ID734
Feature activation-3.71e-02

Pos logit contributions
 seven+0.047
 and+0.040
 like+0.034
 The+0.034
 only+0.034
Neg logit contributions
arre-0.072
inker-0.068
 discoveries-0.067
opter-0.067
beh-0.067
Token squirrel
Token ID33039
Feature activation-3.12e-02

Pos logit contributions
arre+0.071
opter+0.066
inker+0.066
beh+0.066
 discoveries+0.066
Neg logit contributions
 seven-0.074
 and-0.064
 only-0.058
 like-0.057
 The-0.057
Tokens
Token ID82
Feature activation-2.18e-02

Pos logit contributions
arre+0.062
opter+0.058
beh+0.058
inker+0.057
 discoveries+0.057
Neg logit contributions
 seven-0.060
 and-0.051
 only-0.047
 like-0.045
 The-0.045
Token kept
Token ID4030
Feature activation-1.59e-02

Pos logit contributions
arre+0.048
inker+0.045
opter+0.045
 discoveries+0.044
beh+0.044
Neg logit contributions
 seven-0.037
 and-0.029
 The-0.026
 like-0.026
 only-0.026
Token qu
Token ID627
Feature activation-4.74e-02

Pos logit contributions
arre+0.033
beh+0.031
 discoveries+0.031
inker+0.031
oving+0.030
Neg logit contributions
 seven-0.028
 and-0.022
 The-0.021
 only-0.020
 like-0.020
Tokenarre
Token ID9624
Feature activation-1.20e-01

Pos logit contributions
 possessions+0.050
 discoveries+0.049
opter+0.046
 discovery+0.044
 situations+0.044
Neg logit contributions
 seven-0.127
 and-0.125
 like-0.111
 still-0.109
 The-0.108
Tokenling
Token ID1359
Feature activation-1.75e-02

Pos logit contributions
irs+0.203
 discoveries+0.202
igsaw+0.202
 weaknesses+0.193
inker+0.193
Neg logit contributions
 and-0.242
 seven-0.233
 in-0.206
 like-0.203
 very-0.187
Token until
Token ID1566
Feature activation-9.09e-03

Pos logit contributions
arre+0.050
 discoveries+0.049
 situations+0.049
beh+0.048
inker+0.048
Neg logit contributions
 seven-0.016
 only-0.009
 The-0.007
 and-0.007
 still-0.007
Token finally
Token ID3443
Feature activation+6.03e-03

Pos logit contributions
arre+0.020
 discoveries+0.019
opter+0.019
beh+0.019
ert+0.018
Neg logit contributions
 seven-0.014
 and-0.011
 like-0.010
 only-0.009
 The-0.009
Token Sammy
Token ID44067
Feature activation-8.20e-03

Pos logit contributions
 seven+0.008
 and+0.006
 only+0.005
 like+0.005
 The+0.005
Neg logit contributions
arre-0.015
opter-0.015
 discoveries-0.014
beh-0.014
inker-0.014
Token said
Token ID531
Feature activation+5.82e-03

Pos logit contributions
arre+0.017
opter+0.015
inker+0.015
beh+0.015
ert+0.014
Neg logit contributions
 seven-0.015
 and-0.011
 only-0.011
 The-0.011
 like-0.011
Token other
Token ID584
Feature activation-2.82e-02

Pos logit contributions
arre+0.038
beh+0.035
inker+0.035
opter+0.035
 discoveries+0.035
Neg logit contributions
 seven-0.030
 and-0.027
 only-0.024
 like-0.023
 The-0.022
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
arre+0.085
 situations+0.081
 discoveries+0.081
inker+0.080
opter+0.080
Neg logit contributions
 seven-0.026
 only-0.010
 The-0.010
 and-0.009
 still-0.008
Token They
Token ID1119
Feature activation-1.41e-02

Pos logit contributions
arre+0.027
opter+0.025
oving+0.025
inker+0.025
 discoveries+0.025
Neg logit contributions
 seven-0.012
 and-0.009
 like-0.008
 only-0.007
 or-0.007
Token starting
Token ID3599
Feature activation-3.14e-03

Pos logit contributions
arre+0.031
inker+0.030
beh+0.030
opter+0.030
 discoveries+0.029
Neg logit contributions
 seven-0.024
 and-0.020
 only-0.018
 like-0.017
 The-0.017
Token qu
Token ID627
Feature activation-5.72e-02

Pos logit contributions
arre+0.009
opter+0.009
inker+0.009
 discoveries+0.009
 situations+0.009
Neg logit contributions
 seven-0.003
 and-0.002
 only-0.002
 The-0.002
 like-0.002
Tokenarre
Token ID9624
Feature activation-1.16e-01

Pos logit contributions
 possessions+0.045
 discoveries+0.043
 discovery+0.038
 occasions+0.037
 situations+0.036
Neg logit contributions
 and-0.171
 seven-0.167
 The-0.151
 still-0.151
 like-0.150
Tokenling
Token ID1359
Feature activation-1.48e-02

Pos logit contributions
igsaw+0.190
 discoveries+0.187
irs+0.180
 possessions+0.177
inker+0.176
Neg logit contributions
 and-0.257
 seven-0.232
 in-0.222
 like-0.207
 very-0.197
Token again
Token ID757
Feature activation-8.56e-03

Pos logit contributions
arre+0.046
 discoveries+0.044
 situations+0.043
inker+0.043
oving+0.043
Neg logit contributions
 seven-0.012
 only-0.005
 and-0.005
 The-0.004
 like-0.003
Token and
Token ID290
Feature activation-2.80e-02

Pos logit contributions
arre+0.029
opter+0.028
inker+0.028
beh+0.028
 discoveries+0.028
Neg logit contributions
 seven-0.004
 and-0.002
 only-0.001
 like-0.000
 The-0.000
Token their
Token ID511
Feature activation-1.57e-02

Pos logit contributions
arre+0.069
inker+0.067
opter+0.067
beh+0.066
 discoveries+0.065
Neg logit contributions
 seven-0.040
 and-0.035
 only-0.031
 like-0.029
 still-0.028
Token voices
Token ID10839
Feature activation-2.42e-02

Pos logit contributions
arre+0.030
opter+0.028
beh+0.028
inker+0.028
 discoveries+0.027
Neg logit contributions
 seven-0.032
 and-0.028
 only-0.025
 like-0.024
 The-0.024
Token the
Token ID262
Feature activation+6.60e-03

Pos logit contributions
arre+0.038
inker+0.036
opter+0.036
 discoveries+0.036
 situations+0.036
Neg logit contributions
 seven-0.014
 and-0.009
 only-0.008
 like-0.007
 still-0.007
Token mess
Token ID2085
Feature activation-4.95e-02

Pos logit contributions
 seven+0.017
 and+0.015
 only+0.014
 like+0.014
 The+0.013
Neg logit contributions
arre-0.009
beh-0.009
inker-0.009
opter-0.009
 discoveries-0.008
Token and
Token ID290
Feature activation-4.88e-03

Pos logit contributions
arre+0.152
inker+0.142
opter+0.141
oving+0.140
 situations+0.138
Neg logit contributions
 seven-0.046
 The-0.019
 only-0.017
 like-0.012
 and-0.010
Token the
Token ID262
Feature activation+2.80e-04

Pos logit contributions
arre+0.014
opter+0.014
beh+0.014
inker+0.014
 discoveries+0.014
Neg logit contributions
 seven-0.005
 and-0.004
 only-0.003
 like-0.003
 The-0.003
Token qu
Token ID627
Feature activation-4.73e-02

Pos logit contributions
 seven+0.001
 and+0.001
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.000
beh-0.000
inker-0.000
opter-0.000
 discoveries-0.000
Tokenarre
Token ID9624
Feature activation-1.16e-01

Pos logit contributions
 discoveries+0.064
 possessions+0.060
beh+0.056
 situations+0.056
 discovery+0.055
Neg logit contributions
 and-0.117
 seven-0.113
.-0.100
 The-0.100
 like-0.098
Tokenling
Token ID1359
Feature activation-1.50e-02

Pos logit contributions
 discoveries+0.204
beh+0.203
inker+0.199
igsaw+0.199
ert+0.198
Neg logit contributions
 and-0.235
 seven-0.217
 in-0.198
.-0.187
 like-0.187
Token twins
Token ID20345
Feature activation-5.15e-04

Pos logit contributions
arre+0.030
beh+0.029
inker+0.029
 discoveries+0.028
opter+0.028
Neg logit contributions
 seven-0.028
 and-0.026
 only-0.022
 like-0.022
 The-0.022
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
arre+0.001
 discoveries+0.001
 situations+0.001
beh+0.001
inker+0.001
Neg logit contributions
 seven-0.001
 The-0.000
 only-0.000
 He-0.000
 and-0.000
Token She
Token ID1375
Feature activation+4.11e-03

Pos logit contributions
arre+0.007
inker+0.007
opter+0.007
 discoveries+0.007
beh+0.006
Neg logit contributions
 seven-0.002
 and-0.002
 only-0.001
 like-0.001
 still-0.001
Token is
Token ID318
Feature activation-1.27e-03

Pos logit contributions
 seven+0.006
 and+0.005
 only+0.004
 like+0.004
 The+0.004
Neg logit contributions
arre-0.010
beh-0.010
opter-0.010
inker-0.010
 discoveries-0.010
Token lively
Token ID29696
Feature activation-3.41e-03

Pos logit contributions
arre+0.019
 discoveries+0.018
opter+0.017
 situations+0.017
 possessions+0.017
Neg logit contributions
 seven-0.015
 and-0.012
 only-0.011
 The-0.011
 or-0.011
Token sound
Token ID2128
Feature activation-1.37e-02

Pos logit contributions
arre+0.006
beh+0.005
inker+0.005
opter+0.005
 discoveries+0.005
Neg logit contributions
 seven-0.008
 and-0.007
 only-0.006
 like-0.006
 The-0.006
Token.
Token ID13
Feature activation-2.77e-03

Pos logit contributions
arre+0.037
opter+0.035
 discoveries+0.034
inker+0.034
oving+0.034
Neg logit contributions
 seven-0.016
 The-0.009
 only-0.009
 and-0.008
 He-0.008
Token The
Token ID383
Feature activation+1.45e-02

Pos logit contributions
arre+0.007
irs+0.006
opter+0.006
igsaw+0.006
inker+0.006
Neg logit contributions
 seven-0.004
 and-0.003
 like-0.003
 only-0.003
 or-0.003
Token qu
Token ID627
Feature activation-4.75e-02

Pos logit contributions
 seven+0.032
 and+0.029
 only+0.026
 like+0.025
 still+0.025
Neg logit contributions
arre-0.025
beh-0.025
opter-0.024
inker-0.023
 discoveries-0.023
Tokenarre
Token ID9624
Feature activation-1.12e-01

Pos logit contributions
 discoveries+0.076
 possessions+0.073
 situations+0.071
beh+0.071
ge+0.069
Neg logit contributions
 seven-0.101
 and-0.100
 The-0.087
 still-0.085
 like-0.083
Tokenling
Token ID1359
Feature activation-1.96e-02

Pos logit contributions
igsaw+0.182
 discoveries+0.181
beh+0.180
inker+0.180
arre+0.177
Neg logit contributions
 and-0.235
 seven-0.232
 in-0.203
 like-0.196
 only-0.196
Token stopped
Token ID5025
Feature activation+5.79e-03

Pos logit contributions
arre+0.043
opter+0.040
inker+0.039
 situations+0.039
 discoveries+0.039
Neg logit contributions
 seven-0.034
 and-0.026
 only-0.024
 like-0.024
 The-0.024
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
 seven+0.009
 The+0.006
 and+0.006
 only+0.006
 He+0.005
Neg logit contributions
arre-0.014
 discoveries-0.013
oving-0.012
inker-0.012
ert-0.012
Token they
Token ID484
Feature activation-1.71e-02

Pos logit contributions
arre+0.034
opter+0.033
beh+0.033
 discoveries+0.032
inker+0.032
Neg logit contributions
 seven-0.019
 and-0.015
 only-0.013
 like-0.013
 The-0.013
Token heard
Token ID2982
Feature activation+3.66e-03

Pos logit contributions
arre+0.037
opter+0.034
beh+0.034
inker+0.034
 discoveries+0.033
Neg logit contributions
 seven-0.029
 and-0.024
 The-0.021
 only-0.021
 or-0.021
Token them
Token ID606
Feature activation+1.48e-02

Pos logit contributions
arre+0.001
 discoveries+0.001
beh+0.001
inker+0.001
 situations+0.001
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.001
 The-0.001
 like-0.001
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 seven+0.019
 The+0.011
 only+0.010
 He+0.010
 and+0.010
Neg logit contributions
arre-0.039
 discoveries-0.039
 possessions-0.038
beh-0.038
 situations-0.036
Token never
Token ID1239
Feature activation+4.48e-03

Pos logit contributions
 seven+0.015
 and+0.012
 only+0.011
 like+0.010
 The+0.010
Neg logit contributions
arre-0.028
opter-0.027
inker-0.027
beh-0.027
 discoveries-0.026
Token threw
Token ID9617
Feature activation-2.09e-02

Pos logit contributions
 seven+0.007
 and+0.007
 only+0.006
 like+0.006
 The+0.006
Neg logit contributions
arre-0.010
inker-0.010
opter-0.010
beh-0.009
 discoveries-0.009
Token a
Token ID257
Feature activation-5.95e-03

Pos logit contributions
arre+0.044
 discoveries+0.042
 situations+0.042
beh+0.042
 possessions+0.041
Neg logit contributions
 seven-0.035
 and-0.027
 The-0.025
 only-0.025
 still-0.024
Token tant
Token ID24246
Feature activation-1.12e-01

Pos logit contributions
arre+0.011
beh+0.010
inker+0.010
opter+0.010
 discoveries+0.010
Neg logit contributions
 seven-0.012
 and-0.011
 only-0.010
 like-0.010
 The-0.009
Tokenrum
Token ID6582
Feature activation-3.52e-02

Pos logit contributions
arre+0.167
 discoveries+0.149
igsaw+0.140
uddle+0.137
opter+0.133
Neg logit contributions
 seven-0.244
 and-0.238
 or-0.213
.-0.213
:-0.206
Token again
Token ID757
Feature activation+1.79e-02

Pos logit contributions
arre+0.096
beh+0.094
inker+0.094
 situations+0.092
ert+0.092
Neg logit contributions
 seven-0.038
 and-0.020
 only-0.018
 The-0.018
 like-0.017
Token.
Token ID13
Feature activation+7.00e-03

Pos logit contributions
 seven+0.018
 and+0.008
 only+0.008
 The+0.007
 like+0.006
Neg logit contributions
arre-0.052
 discoveries-0.051
 situations-0.051
beh-0.050
 possessions-0.050
Token<|endoftext|>
Token ID50256
Feature activation-3.17e-02

Pos logit contributions
 seven+0.010
 and+0.007
 like+0.007
 only+0.006
 or+0.006
Neg logit contributions
arre-0.017
beh-0.016
 discoveries-0.016
inker-0.015
opter-0.015
TokenOnce
Token ID7454
Feature activation+2.68e-03

Pos logit contributions
arre+0.084
inker+0.081
 discoveries+0.079
opter+0.078
 situations+0.077
Neg logit contributions
 seven-0.040
 and-0.025
 only-0.024
 like-0.022
 still-0.020
Token the
Token ID262
Feature activation+2.52e-03

Pos logit contributions
 seven+0.019
 and+0.013
 only+0.011
 The+0.011
 like+0.011
Neg logit contributions
arre-0.035
 discoveries-0.035
beh-0.034
inker-0.034
 possessions-0.034
Token parents
Token ID3397
Feature activation-7.59e-03

Pos logit contributions
 seven+0.005
 and+0.004
 The+0.004
 like+0.004
 or+0.004
Neg logit contributions
arre-0.005
 discoveries-0.005
 possessions-0.004
inker-0.004
opter-0.004
Token were
Token ID547
Feature activation-1.07e-02

Pos logit contributions
arre+0.016
opter+0.015
inker+0.014
beh+0.014
 discoveries+0.014
Neg logit contributions
 seven-0.014
 The-0.010
 and-0.010
 like-0.010
 only-0.009
Token also
Token ID635
Feature activation+6.28e-03

Pos logit contributions
arre+0.024
oving+0.022
beh+0.022
 discoveries+0.022
lled+0.021
Neg logit contributions
 seven-0.016
 The-0.012
 and-0.012
 or-0.011
 He-0.011
Token qu
Token ID627
Feature activation-4.70e-02

Pos logit contributions
 seven+0.009
 and+0.007
 or+0.006
 only+0.006
 The+0.006
Neg logit contributions
arre-0.015
beh-0.014
 discoveries-0.014
inker-0.014
opter-0.013
Tokenarre
Token ID9624
Feature activation-1.12e-01

Pos logit contributions
 discoveries+0.036
 possessions+0.032
 situations+0.032
 discovery+0.028
 occasions+0.027
Neg logit contributions
 seven-0.140
 and-0.135
 still-0.124
 like-0.122
 The-0.120
Tokenling
Token ID1359
Feature activation-1.91e-02

Pos logit contributions
inker+0.174
 discoveries+0.171
arre+0.167
irs+0.163
ert+0.163
Neg logit contributions
 and-0.245
 seven-0.236
 in-0.214
 like-0.212
 only-0.201
Token.
Token ID13
Feature activation-5.52e-03

Pos logit contributions
arre+0.052
 discoveries+0.049
 situations+0.048
beh+0.048
inker+0.048
Neg logit contributions
 seven-0.021
 The-0.010
 only-0.010
 also-0.008
 still-0.008
Token The
Token ID383
Feature activation+1.24e-02

Pos logit contributions
arre+0.014
irs+0.013
inker+0.013
igsaw+0.013
opter+0.012
Neg logit contributions
 seven-0.008
 and-0.006
 like-0.005
 only-0.005
 still-0.005
Token first
Token ID717
Feature activation-3.08e-03

Pos logit contributions
 seven+0.023
 and+0.020
 The+0.019
 like+0.018
 only+0.018
Neg logit contributions
arre-0.024
 discoveries-0.023
inker-0.022
 possessions-0.022
opter-0.022
Token family
Token ID1641
Feature activation-3.42e-02

Pos logit contributions
arre+0.006
 discoveries+0.006
inker+0.006
opter+0.006
 situations+0.006
Neg logit contributions
 seven-0.006
 and-0.005
 The-0.004
 like-0.004
 only-0.004
Token they
Token ID484
Feature activation-2.50e-03

Pos logit contributions
arre+0.071
 discoveries+0.067
beh+0.067
opter+0.067
inker+0.067
Neg logit contributions
 seven-0.032
 and-0.024
 like-0.019
 The-0.019
 only-0.018
Token just
Token ID655
Feature activation-1.91e-03

Pos logit contributions
arre+0.006
inker+0.005
beh+0.005
opter+0.005
 discoveries+0.005
Neg logit contributions
 seven-0.004
 and-0.004
 only-0.003
 like-0.003
 The-0.003
Token kept
Token ID4030
Feature activation-1.45e-02

Pos logit contributions
arre+0.004
opter+0.004
beh+0.004
 discoveries+0.004
inker+0.003
Neg logit contributions
 seven-0.004
 and-0.003
 The-0.003
 only-0.003
 like-0.003
Token on
Token ID319
Feature activation-2.64e-02

Pos logit contributions
arre+0.027
beh+0.025
 discoveries+0.025
oving+0.024
inker+0.024
Neg logit contributions
 seven-0.030
 and-0.025
 The-0.023
 only-0.022
 like-0.022
Token qu
Token ID627
Feature activation-4.72e-02

Pos logit contributions
arre+0.047
beh+0.043
 discoveries+0.043
inker+0.043
oving+0.042
Neg logit contributions
 seven-0.057
 and-0.048
 only-0.044
 like-0.044
 The-0.043
Tokenarre
Token ID9624
Feature activation-1.11e-01

Pos logit contributions
 possessions+0.045
 discoveries+0.044
opter+0.042
 situations+0.039
 discovery+0.038
Neg logit contributions
 seven-0.130
 and-0.126
 The-0.114
 like-0.113
 still-0.112
Tokenling
Token ID1359
Feature activation-9.97e-03

Pos logit contributions
 discoveries+0.169
igsaw+0.168
irs+0.164
inker+0.162
arre+0.157
Neg logit contributions
 and-0.247
 seven-0.234
 in-0.212
 like-0.209
.-0.202
Token!
Token ID0
Feature activation-1.31e-02

Pos logit contributions
arre+0.028
 situations+0.027
 discoveries+0.027
beh+0.026
inker+0.026
Neg logit contributions
 seven-0.010
 and-0.006
 only-0.005
 The-0.005
 still-0.005
Token
Token ID220
Feature activation-4.31e-02

Pos logit contributions
arre+0.032
 discoveries+0.029
opter+0.029
inker+0.029
beh+0.028
Neg logit contributions
 seven-0.020
 and-0.016
 like-0.014
 only-0.013
 or-0.013
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
arre+0.128
inker+0.122
opter+0.121
beh+0.121
 discoveries+0.121
Neg logit contributions
 seven-0.041
 and-0.029
 only-0.022
 like-0.021
 The-0.019
Token
Token ID198
Feature activation+3.58e-03

Pos logit contributions
arre+0.033
inker+0.031
opter+0.031
beh+0.031
 discoveries+0.031
Neg logit contributions
 seven-0.010
 and-0.008
 only-0.006
 like-0.006
 The-0.005
Token box
Token ID3091
Feature activation-2.27e-03

Pos logit contributions
 seven+0.032
 and+0.027
 only+0.025
 like+0.025
 The+0.025
Neg logit contributions
arre-0.032
 discoveries-0.031
opter-0.030
inker-0.030
 situations-0.030
Token and
Token ID290
Feature activation-4.51e-04

Pos logit contributions
 discovery+0.006
 discoveries+0.006
ert+0.006
arre+0.006
inker+0.006
Neg logit contributions
 seven-0.002
 only-0.001
 like-0.001
 or-0.001
 The-0.001
Token saw
Token ID2497
Feature activation-1.79e-02

Pos logit contributions
arre+0.001
beh+0.001
inker+0.001
opter+0.001
 situations+0.001
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.000
 like-0.000
 The-0.000
Token a
Token ID257
Feature activation-6.27e-03

Pos logit contributions
arre+0.051
beh+0.049
inker+0.049
opter+0.049
 discoveries+0.048
Neg logit contributions
 seven-0.019
 and-0.015
 only-0.012
 like-0.011
 The-0.011
Token j
Token ID474
Feature activation-1.65e-02

Pos logit contributions
arre+0.014
opter+0.013
 discoveries+0.013
inker+0.013
beh+0.013
Neg logit contributions
 seven-0.010
 and-0.009
 only-0.008
 The-0.008
 like-0.008
Tokenigsaw
Token ID45636
Feature activation-1.11e-01

Pos logit contributions
arre+0.029
beh+0.027
inker+0.027
opter+0.027
 discoveries+0.027
Neg logit contributions
 seven-0.035
 and-0.031
 only-0.028
 like-0.028
 The-0.028
Token puzzle
Token ID15027
Feature activation+4.88e-03

Pos logit contributions
arre+0.237
beh+0.228
inker+0.223
oving+0.216
 situations+0.216
Neg logit contributions
 seven-0.197
 and-0.181
 only-0.150
 like-0.150
 The-0.143
Token inside
Token ID2641
Feature activation-2.18e-02

Pos logit contributions
 seven+0.006
 The+0.004
 only+0.004
 and+0.004
 like+0.003
Neg logit contributions
 discoveries-0.012
opter-0.012
arre-0.012
 discovery-0.012
inker-0.012
Token.
Token ID13
Feature activation-2.59e-03

Pos logit contributions
 discoveries+0.056
opter+0.054
arre+0.054
inker+0.054
 discovery+0.054
Neg logit contributions
 seven-0.027
 only-0.015
 and-0.014
 The-0.013
 like-0.012
Token He
Token ID679
Feature activation-7.52e-03

Pos logit contributions
arre+0.006
opter+0.005
igsaw+0.005
 discoveries+0.005
inker+0.005
Neg logit contributions
 seven-0.004
 like-0.003
 and-0.003
 or-0.003
 only-0.003
Token paused
Token ID24487
Feature activation+1.76e-02

Pos logit contributions
arre+0.016
opter+0.015
 discoveries+0.015
inker+0.015
beh+0.015
Neg logit contributions
 seven-0.012
 and-0.010
 like-0.009
 only-0.009
 The-0.009
Token his
Token ID465
Feature activation-2.64e-02

Pos logit contributions
arre+0.036
inker+0.035
 situations+0.033
beh+0.033
 discoveries+0.033
Neg logit contributions
 seven-0.020
 and-0.014
 only-0.013
 still-0.013
 like-0.012
Token feet
Token ID3625
Feature activation+2.58e-03

Pos logit contributions
 discoveries+0.047
 situations+0.046
arre+0.046
 possessions+0.045
 discovery+0.045
Neg logit contributions
 seven-0.048
 The-0.040
 like-0.039
 only-0.037
 and-0.036
Token and
Token ID290
Feature activation-1.13e-02

Pos logit contributions
 seven+0.002
 The+0.001
 only+0.001
 Leo+0.001
 Anna+0.001
Neg logit contributions
arre-0.007
ge-0.007
oving-0.007
beh-0.007
inker-0.007
Token threw
Token ID9617
Feature activation-4.02e-02

Pos logit contributions
arre+0.022
beh+0.020
 discoveries+0.020
 situations+0.020
oving+0.020
Neg logit contributions
 seven-0.021
 and-0.017
 only-0.015
 like-0.015
 The-0.015
Token a
Token ID257
Feature activation-1.84e-02

Pos logit contributions
arre+0.086
 situations+0.081
 discoveries+0.079
beh+0.079
 tant+0.077
Neg logit contributions
 seven-0.054
 and-0.040
 still-0.039
 only-0.039
 The-0.038
Token tant
Token ID24246
Feature activation-1.10e-01

Pos logit contributions
arre+0.036
beh+0.034
inker+0.034
 discoveries+0.033
opter+0.033
Neg logit contributions
 seven-0.036
 and-0.032
 only-0.028
 like-0.028
 The-0.027
Tokenrum
Token ID6582
Feature activation-3.77e-02

Pos logit contributions
arre+0.172
 discoveries+0.154
igsaw+0.141
 situations+0.137
opter+0.135
Neg logit contributions
 seven-0.249
 and-0.238
 or-0.211
.-0.206
 like-0.202
Token.
Token ID13
Feature activation+6.31e-03

Pos logit contributions
arre+0.095
oving+0.095
inker+0.095
ert+0.095
opter+0.093
Neg logit contributions
 seven-0.046
 The-0.023
 only-0.022
 like-0.022
 blue-0.020
Token
Token ID198
Feature activation+2.11e-03

Pos logit contributions
 seven+0.009
 like+0.006
 and+0.006
 or+0.005
 only+0.005
Neg logit contributions
arre-0.015
inker-0.013
ulk-0.013
ert-0.013
oving-0.013
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
 seven+0.003
 and+0.002
 like+0.002
 only+0.002
 or+0.001
Neg logit contributions
arre-0.006
inker-0.005
opter-0.005
oving-0.005
irs-0.005
Token"
Token ID1
Feature activation-9.22e-04

Pos logit contributions
 seven+0.025
 and+0.022
 only+0.020
 like+0.020
 or+0.019
Neg logit contributions
arre-0.034
inker-0.031
opter-0.031
oving-0.031
beh-0.031
Token listen
Token ID6004
Feature activation+1.62e-02

Pos logit contributions
arre+0.011
opter+0.010
inker+0.010
 discoveries+0.010
beh+0.010
Neg logit contributions
 seven-0.007
 and-0.007
 like-0.006
 only-0.006
 or-0.005
Token.
Token ID13
Feature activation+9.07e-03

Pos logit contributions
 seven+0.019
 The+0.009
 only+0.009
 and+0.008
 like+0.008
Neg logit contributions
arre-0.045
inker-0.042
beh-0.042
 discoveries-0.042
oving-0.042
Token They
Token ID1119
Feature activation+1.15e-03

Pos logit contributions
 seven+0.012
 and+0.009
 like+0.008
 only+0.007
 still+0.007
Neg logit contributions
arre-0.023
 situations-0.021
inker-0.021
 discoveries-0.021
oving-0.021
Token kept
Token ID4030
Feature activation-1.40e-02

Pos logit contributions
 seven+0.002
 and+0.002
 The+0.001
 or+0.001
 like+0.001
Neg logit contributions
arre-0.003
beh-0.002
opter-0.002
oving-0.002
inker-0.002
Token qu
Token ID627
Feature activation-4.79e-02

Pos logit contributions
arre+0.027
beh+0.025
oving+0.024
 discoveries+0.024
 situations+0.024
Neg logit contributions
 seven-0.028
 and-0.023
 like-0.021
 The-0.021
 only-0.021
Tokenarre
Token ID9624
Feature activation-1.10e-01

Pos logit contributions
 discoveries+0.033
opter+0.031
 possessions+0.031
ert+0.029
 discovery+0.028
Neg logit contributions
 and-0.146
 seven-0.145
 The-0.131
 like-0.130
 still-0.130
Tokenling
Token ID1359
Feature activation-9.04e-03

Pos logit contributions
 discoveries+0.201
igsaw+0.199
ert+0.194
irs+0.194
opter+0.188
Neg logit contributions
 and-0.220
 seven-0.208
 in-0.185
 like-0.182
.-0.173
Token with
Token ID351
Feature activation+1.66e-02

Pos logit contributions
arre+0.025
 situations+0.024
 discoveries+0.023
 possessions+0.023
beh+0.023
Neg logit contributions
 seven-0.009
 only-0.004
 The-0.003
 I-0.003
 also-0.003
Token their
Token ID511
Feature activation-1.14e-02

Pos logit contributions
 seven+0.019
 and+0.015
 only+0.012
 like+0.012
 The+0.012
Neg logit contributions
arre-0.046
opter-0.044
inker-0.044
beh-0.044
 discoveries-0.043
Token mom
Token ID1995
Feature activation+1.72e-02

Pos logit contributions
arre+0.028
 discoveries+0.027
 possessions+0.027
inker+0.026
 situations+0.026
Neg logit contributions
 seven-0.016
 and-0.012
 like-0.011
 The-0.010
 only-0.010
Token.
Token ID13
Feature activation+2.57e-03

Pos logit contributions
 seven+0.023
 The+0.012
 He+0.012
 only+0.011
 Tw+0.011
Neg logit contributions
arre-0.042
beh-0.040
 possessions-0.039
 situations-0.039
ge-0.038
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
 seven+0.005
 The+0.003
 only+0.002
 low+0.002
 still+0.002
Neg logit contributions
 discoveries-0.010
arre-0.010
 discovery-0.010
 situations-0.010
 possessions-0.010
Token They
Token ID1119
Feature activation+3.28e-03

Pos logit contributions
 seven+0.025
 and+0.019
 only+0.018
 like+0.018
 in+0.017
Neg logit contributions
arre-0.033
irs-0.030
igsaw-0.028
 situations-0.028
 discoveries-0.028
Token had
Token ID550
Feature activation-1.31e-02

Pos logit contributions
 seven+0.006
 and+0.005
 The+0.005
 or+0.004
 He+0.004
Neg logit contributions
arre-0.006
opter-0.006
 discoveries-0.005
beh-0.005
inker-0.005
Token stopped
Token ID5025
Feature activation+1.18e-02

Pos logit contributions
arre+0.030
beh+0.028
 discoveries+0.028
oving+0.027
 possessions+0.027
Neg logit contributions
 seven-0.020
 and-0.017
 The-0.014
 or-0.014
 like-0.014
Token qu
Token ID627
Feature activation-3.21e-02

Pos logit contributions
 seven+0.022
 and+0.016
 only+0.015
 The+0.015
 like+0.015
Neg logit contributions
arre-0.025
 discoveries-0.023
beh-0.023
inker-0.023
 situations-0.023
Tokenarre
Token ID9624
Feature activation-1.10e-01

Pos logit contributions
 discoveries+0.019
 possessions+0.018
opter+0.016
 situations+0.016
ert+0.015
Neg logit contributions
 seven-0.101
 and-0.101
 The-0.091
 like-0.091
 still-0.090
Tokenling
Token ID1359
Feature activation-8.53e-03

Pos logit contributions
 discoveries+0.167
igsaw+0.166
irs+0.164
inker+0.162
opter+0.162
Neg logit contributions
 and-0.243
 seven-0.231
 in-0.208
 like-0.203
 or-0.195
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
arre+0.024
 discoveries+0.023
 situations+0.023
inker+0.022
oving+0.022
Neg logit contributions
 seven-0.008
 only-0.004
 The-0.003
 also-0.003
 still-0.003
Token To
Token ID1675
Feature activation-1.60e-03

Pos logit contributions
 seven+0.002
 and+0.002
 only+0.001
 like+0.001
 or+0.001
Neg logit contributions
arre-0.003
irs-0.003
igsaw-0.003
 discoveries-0.003
 situations-0.003
Token celebrate
Token ID10648
Feature activation+1.26e-02

Pos logit contributions
arre+0.004
opter+0.004
inker+0.004
beh+0.004
 discoveries+0.004
Neg logit contributions
 seven-0.002
 and-0.002
 only-0.001
 like-0.001
 The-0.001
Token,
Token ID11
Feature activation-1.32e-02

Pos logit contributions
 seven+0.013
 and+0.007
 only+0.007
 The+0.007
 like+0.006
Neg logit contributions
arre-0.036
 discoveries-0.035
inker-0.034
 situations-0.034
opter-0.034
Token sharing
Token ID7373
Feature activation+2.50e-02

Pos logit contributions
arre+0.052
opter+0.050
inker+0.050
beh+0.049
 discoveries+0.049
Neg logit contributions
 seven-0.031
 and-0.027
 only-0.024
 like-0.022
 The-0.022
Token is
Token ID318
Feature activation+1.89e-02

Pos logit contributions
 seven+0.034
 and+0.028
 only+0.024
 like+0.023
 The+0.023
Neg logit contributions
arre-0.063
beh-0.060
opter-0.060
inker-0.060
 discoveries-0.060
Token important
Token ID1593
Feature activation-1.54e-02

Pos logit contributions
 seven+0.033
 and+0.027
 The+0.024
 only+0.023
 like+0.023
Neg logit contributions
arre-0.041
beh-0.039
 discoveries-0.038
inker-0.037
oving-0.037
Token and
Token ID290
Feature activation-3.95e-03

Pos logit contributions
arre+0.048
 discoveries+0.046
beh+0.045
 situations+0.045
inker+0.045
Neg logit contributions
 seven-0.013
 and-0.006
 The-0.005
 only-0.005
 like-0.004
Token qu
Token ID627
Feature activation-4.92e-02

Pos logit contributions
arre+0.010
inker+0.010
opter+0.010
oving+0.009
 situations+0.009
Neg logit contributions
 seven-0.005
 and-0.005
 only-0.004
 like-0.004
 still-0.004
Tokenarre
Token ID9624
Feature activation-1.09e-01

Pos logit contributions
 discoveries+0.032
ert+0.028
 possessions+0.028
opter+0.027
 discovery+0.027
Neg logit contributions
 seven-0.148
 and-0.148
 The-0.136
 like-0.135
 still-0.134
Tokenlling
Token ID2680
Feature activation-4.17e-02

Pos logit contributions
ert+0.187
igsaw+0.186
 discoveries+0.184
inker+0.183
irs+0.181
Neg logit contributions
 and-0.224
 seven-0.198
 in-0.197
 like-0.193
.-0.178
Token is
Token ID318
Feature activation+1.00e-02

Pos logit contributions
arre+0.114
 discoveries+0.107
opter+0.106
beh+0.106
 situations+0.106
Neg logit contributions
 seven-0.051
 and-0.032
 The-0.029
 only-0.028
 like-0.026
Token not
Token ID407
Feature activation+3.32e-03

Pos logit contributions
 seven+0.017
 and+0.014
 The+0.012
 only+0.012
 or+0.012
Neg logit contributions
arre-0.021
beh-0.021
 discoveries-0.020
 possessions-0.020
oving-0.020
Token good
Token ID922
Feature activation+1.00e-02

Pos logit contributions
 seven+0.005
 and+0.004
 only+0.004
 like+0.004
 The+0.003
Neg logit contributions
arre-0.008
inker-0.008
opter-0.007
beh-0.007
 discoveries-0.007
Token.
Token ID13
Feature activation-2.34e-03

Pos logit contributions
 seven+0.009
 and+0.006
 only+0.005
 The+0.005
 like+0.004
Neg logit contributions
arre-0.030
beh-0.029
 discoveries-0.029
inker-0.028
opter-0.028
Token on
Token ID319
Feature activation+3.33e-03

Pos logit contributions
 seven+0.000
 and+0.000
 like+0.000
 only+0.000
 The+0.000
Neg logit contributions
arre-0.000
opter-0.000
beh-0.000
oving-0.000
inker-0.000
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
 seven+0.004
 like+0.003
 and+0.002
 only+0.002
 The+0.002
Neg logit contributions
arre-0.009
 situations-0.008
opter-0.008
 discoveries-0.008
 possessions-0.008
Token they
Token ID484
Feature activation-4.85e-03

Pos logit contributions
arre+0.039
 discoveries+0.037
beh+0.036
opter+0.036
 situations+0.035
Neg logit contributions
 seven-0.024
 and-0.017
 like-0.014
 only-0.014
 still-0.014
Token never
Token ID1239
Feature activation+3.00e-03

Pos logit contributions
arre+0.011
opter+0.010
beh+0.010
inker+0.010
 discoveries+0.010
Neg logit contributions
 seven-0.008
 and-0.006
 The-0.005
 like-0.005
 only-0.005
Token qu
Token ID627
Feature activation-3.90e-02

Pos logit contributions
 seven+0.005
 and+0.005
 only+0.004
 like+0.004
 The+0.004
Neg logit contributions
arre-0.006
inker-0.006
opter-0.006
 discoveries-0.006
oving-0.006
Tokenarre
Token ID9624
Feature activation-1.09e-01

Pos logit contributions
 discoveries+0.023
 possessions+0.021
opter+0.020
 situations+0.019
 discovery+0.018
Neg logit contributions
 seven-0.122
 and-0.121
 The-0.112
 like-0.111
 still-0.110
Tokenled
Token ID992
Feature activation+3.69e-03

Pos logit contributions
igsaw+0.171
 discoveries+0.171
ert+0.167
inker+0.165
irs+0.164
Neg logit contributions
 and-0.237
 seven-0.217
 in-0.216
 like-0.202
 only-0.198
Token again
Token ID757
Feature activation+1.75e-02

Pos logit contributions
 seven+0.007
 The+0.005
 and+0.004
 Leo+0.004
 He+0.004
Neg logit contributions
arre-0.008
 discoveries-0.007
beh-0.007
 situations-0.007
inker-0.007
Token over
Token ID625
Feature activation+2.09e-02

Pos logit contributions
 seven+0.016
 and+0.007
 only+0.006
 The+0.006
 like+0.005
Neg logit contributions
arre-0.054
 discoveries-0.051
 situations-0.051
beh-0.050
inker-0.050
Token clothes
Token ID8242
Feature activation-2.62e-03

Pos logit contributions
 seven+0.025
 and+0.016
 only+0.014
 The+0.013
 like+0.013
Neg logit contributions
arre-0.057
 discoveries-0.054
 situations-0.054
inker-0.054
beh-0.053
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
arre+0.006
inker+0.006
 discoveries+0.006
opter+0.006
ge+0.006
Neg logit contributions
 seven-0.004
 The-0.002
 eight-0.002
 only-0.002
 He-0.002
Token but
Token ID475
Feature activation-1.45e-02

Pos logit contributions
arre+0.036
 discoveries+0.035
 discovery+0.033
 possessions+0.032
beh+0.032
Neg logit contributions
 seven-0.020
 only-0.011
 The-0.011
 eight-0.010
 still-0.010
Token today
Token ID1909
Feature activation+5.81e-03

Pos logit contributions
arre+0.034
 discoveries+0.032
beh+0.032
inker+0.031
 possessions+0.031
Neg logit contributions
 seven-0.022
 and-0.016
 The-0.014
 like-0.014
 only-0.014
Token they
Token ID484
Feature activation+5.28e-03

Pos logit contributions
 seven+0.009
 and+0.006
 only+0.006
 like+0.006
 The+0.006
Neg logit contributions
arre-0.014
 discoveries-0.014
opter-0.013
inker-0.013
beh-0.013
Token were
Token ID547
Feature activation-5.45e-03

Pos logit contributions
 seven+0.010
 and+0.008
 The+0.008
 only+0.007
 like+0.007
Neg logit contributions
arre-0.011
opter-0.010
beh-0.010
inker-0.010
 discoveries-0.010
Token qu
Token ID627
Feature activation-4.05e-02

Pos logit contributions
arre+0.012
beh+0.011
oving+0.011
 discoveries+0.010
 possessions+0.010
Neg logit contributions
 seven-0.009
 and-0.007
 The-0.006
 or-0.006
 He-0.006
Tokenarre
Token ID9624
Feature activation-1.09e-01

Pos logit contributions
 discoveries+0.028
 possessions+0.027
 situations+0.024
 discovery+0.022
opter+0.022
Neg logit contributions
 seven-0.124
 and-0.119
 still-0.111
 like-0.109
 only-0.108
Tokenling
Token ID1359
Feature activation-1.74e-02

Pos logit contributions
igsaw+0.160
 discoveries+0.159
inker+0.154
irs+0.154
opter+0.151
Neg logit contributions
 and-0.239
 seven-0.234
 in-0.219
 like-0.209
 only-0.201
Token.
Token ID13
Feature activation-5.94e-03

Pos logit contributions
arre+0.047
 discoveries+0.045
 situations+0.045
beh+0.044
 possessions+0.043
Neg logit contributions
 seven-0.019
 only-0.011
 The-0.010
 still-0.009
 and-0.009
Token Jenna
Token ID45377
Feature activation-2.41e-02

Pos logit contributions
arre+0.015
inker+0.014
 discoveries+0.014
opter+0.013
 situations+0.013
Neg logit contributions
 seven-0.009
 and-0.007
 only-0.006
 like-0.006
 or-0.006
Token wanted
Token ID2227
Feature activation-1.24e-02

Pos logit contributions
arre+0.050
opter+0.046
oving+0.045
 discoveries+0.044
beh+0.044
Neg logit contributions
 seven-0.044
 and-0.031
 The-0.031
 like-0.030
 only-0.030
Token to
Token ID284
Feature activation-1.64e-02

Pos logit contributions
arre+0.034
 discoveries+0.033
beh+0.033
inker+0.033
opter+0.032
Neg logit contributions
 seven-0.013
 and-0.010
 only-0.008
 The-0.007
 like-0.007
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
arre+0.061
beh+0.057
 discoveries+0.057
inker+0.056
 situations+0.056
Neg logit contributions
 seven-0.040
 and-0.030
 only-0.027
 The-0.027
 like-0.025
Token their
Token ID511
Feature activation-2.61e-02

Pos logit contributions
arre+0.044
inker+0.042
opter+0.042
beh+0.041
oving+0.041
Neg logit contributions
 seven-0.036
 and-0.033
 only-0.030
 like-0.029
 The-0.028
Token children
Token ID1751
Feature activation-3.07e-02

Pos logit contributions
arre+0.050
opter+0.047
beh+0.047
inker+0.047
 discoveries+0.046
Neg logit contributions
 seven-0.052
 and-0.045
 only-0.041
 like-0.040
 The-0.040
Token,
Token ID11
Feature activation-2.89e-02

Pos logit contributions
arre+0.086
beh+0.082
 discoveries+0.081
inker+0.081
opter+0.081
Neg logit contributions
 seven-0.035
 and-0.024
 only-0.020
 The-0.019
 like-0.019
Token qu
Token ID627
Feature activation-4.63e-02

Pos logit contributions
arre+0.072
 discoveries+0.067
 situations+0.065
inker+0.065
beh+0.065
Neg logit contributions
 seven-0.041
 and-0.025
 only-0.024
 The-0.023
 still-0.022
Tokenarre
Token ID9624
Feature activation-1.08e-01

Pos logit contributions
 discoveries+0.057
 possessions+0.055
 situations+0.053
opter+0.052
 discovery+0.049
Neg logit contributions
 and-0.115
 seven-0.115
 still-0.100
 The-0.099
 like-0.097
Tokenling
Token ID1359
Feature activation-2.18e-02

Pos logit contributions
igsaw+0.219
 discoveries+0.207
 weaknesses+0.203
irs+0.196
beh+0.195
Neg logit contributions
 and-0.201
 seven-0.178
 in-0.175
 like-0.154
.-0.153
Token.
Token ID13
Feature activation-5.47e-03

Pos logit contributions
arre+0.063
inker+0.060
beh+0.060
 discoveries+0.060
 situations+0.060
Neg logit contributions
 seven-0.022
 and-0.014
 only-0.012
 still-0.011
 The-0.011
Token
Token ID220
Feature activation-3.66e-02

Pos logit contributions
arre+0.015
opter+0.013
 discoveries+0.013
inker+0.013
oving+0.013
Neg logit contributions
 seven-0.007
 and-0.005
 like-0.004
 only-0.004
 or-0.004
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
arre+0.115
beh+0.111
opter+0.111
 discoveries+0.110
inker+0.110
Neg logit contributions
 seven-0.026
 and-0.020
 The-0.013
 only-0.013
 like-0.011
Token
Token ID198
Feature activation+4.02e-03

Pos logit contributions
arre+0.030
inker+0.029
opter+0.029
beh+0.028
 discoveries+0.028
Neg logit contributions
 seven-0.010
 and-0.007
 only-0.005
 like-0.005
 The-0.005
Token together
Token ID1978
Feature activation-4.12e-03

Pos logit contributions
 seven+0.006
 and+0.005
 only+0.004
 like+0.004
 The+0.004
Neg logit contributions
arre-0.008
opter-0.008
beh-0.008
inker-0.008
 discoveries-0.008
Token.
Token ID13
Feature activation-3.21e-03

Pos logit contributions
arre+0.011
 discoveries+0.010
 situations+0.010
oving+0.010
beh+0.010
Neg logit contributions
 seven-0.005
 The-0.003
 and-0.002
 only-0.002
 He-0.002
Token They
Token ID1119
Feature activation-1.40e-02

Pos logit contributions
arre+0.008
 discoveries+0.007
 situations+0.007
inker+0.007
ert+0.007
Neg logit contributions
 seven-0.005
 and-0.004
 only-0.004
 like-0.004
 or-0.003
Token never
Token ID1239
Feature activation-1.82e-03

Pos logit contributions
arre+0.032
beh+0.030
opter+0.030
inker+0.030
 discoveries+0.030
Neg logit contributions
 seven-0.023
 and-0.019
 only-0.017
 The-0.016
 like-0.016
Token qu
Token ID627
Feature activation-4.16e-02

Pos logit contributions
arre+0.004
opter+0.004
inker+0.004
irs+0.004
 discoveries+0.004
Neg logit contributions
 seven-0.003
 and-0.003
 only-0.003
 like-0.003
 The-0.002
Tokenarre
Token ID9624
Feature activation-1.08e-01

Pos logit contributions
 discoveries+0.024
 possessions+0.024
opter+0.021
 situations+0.019
 discovery+0.019
Neg logit contributions
 seven-0.132
 and-0.130
 like-0.120
 The-0.120
 still-0.117
Tokenled
Token ID992
Feature activation+5.33e-03

Pos logit contributions
igsaw+0.156
 discoveries+0.152
inker+0.148
ert+0.148
irs+0.145
Neg logit contributions
 and-0.249
 seven-0.233
 in-0.227
 like-0.215
 only-0.206
Token again
Token ID757
Feature activation+1.39e-02

Pos logit contributions
 seven+0.008
 The+0.006
 only+0.005
 and+0.005
 He+0.005
Neg logit contributions
arre-0.013
 discoveries-0.012
 discovery-0.011
beh-0.011
 situations-0.011
Token.
Token ID13
Feature activation-3.84e-03

Pos logit contributions
 seven+0.011
 and+0.004
 only+0.004
 The+0.003
 like+0.003
Neg logit contributions
arre-0.044
 discoveries-0.042
inker-0.042
 situations-0.041
beh-0.041
Token<|endoftext|>
Token ID50256
Feature activation-3.22e-02

Pos logit contributions
arre+0.010
 discoveries+0.009
inker+0.009
 situations+0.009
opter+0.009
Neg logit contributions
 seven-0.005
 and-0.004
 only-0.003
 like-0.003
 or-0.003
TokenOnce
Token ID7454
Feature activation+2.98e-04

Pos logit contributions
arre+0.070
inker+0.067
opter+0.064
 discoveries+0.064
ert+0.063
Neg logit contributions
 seven-0.055
 only-0.039
 and-0.038
 like-0.038
 still-0.036
Token little
Token ID1310
Feature activation-4.86e-02

Pos logit contributions
arre+0.093
beh+0.088
opter+0.088
inker+0.088
 discoveries+0.087
Neg logit contributions
 seven-0.064
 and-0.054
 only-0.048
 like-0.046
 The-0.045
Token birds
Token ID10087
Feature activation+5.63e-04

Pos logit contributions
arre+0.099
inker+0.092
beh+0.092
opter+0.092
 discoveries+0.092
Neg logit contributions
 seven-0.090
 and-0.078
 only-0.070
 like-0.068
 The-0.068
Token that
Token ID326
Feature activation-3.28e-02

Pos logit contributions
 seven+0.001
 The+0.001
 and+0.001
 only+0.001
 like+0.001
Neg logit contributions
arre-0.001
beh-0.001
opter-0.001
inker-0.001
 discoveries-0.001
Token were
Token ID547
Feature activation-1.22e-02

Pos logit contributions
arre+0.070
opter+0.066
beh+0.066
inker+0.066
 discoveries+0.065
Neg logit contributions
 seven-0.057
 and-0.049
 only-0.044
 like-0.042
 The-0.042
Token qu
Token ID627
Feature activation-4.44e-02

Pos logit contributions
arre+0.021
beh+0.020
oving+0.020
inker+0.019
 discoveries+0.019
Neg logit contributions
 seven-0.025
 The-0.020
 and-0.019
 He-0.019
 like-0.018
Tokenarre
Token ID9624
Feature activation-1.07e-01

Pos logit contributions
 discoveries+0.057
 possessions+0.056
 discovery+0.050
 situations+0.050
opter+0.048
Neg logit contributions
 seven-0.110
 and-0.108
 still-0.095
 like-0.095
 only-0.092
Tokenling
Token ID1359
Feature activation-1.38e-02

Pos logit contributions
 discoveries+0.187
igsaw+0.186
 weaknesses+0.180
arre+0.178
 possessions+0.178
Neg logit contributions
 and-0.221
 seven-0.202
 in-0.198
 like-0.187
 or-0.175
Token.
Token ID13
Feature activation-5.80e-03

Pos logit contributions
arre+0.036
beh+0.035
inker+0.034
 discoveries+0.034
 situations+0.034
Neg logit contributions
 seven-0.017
 The-0.009
 only-0.009
 also-0.007
 still-0.007
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
arre+0.015
opter+0.013
 situations+0.013
inker+0.013
 discoveries+0.013
Neg logit contributions
 seven-0.007
 like-0.005
 still-0.005
 and-0.005
 only-0.005
Token
Token ID198
Feature activation-1.07e-03

Pos logit contributions
arre+0.045
inker+0.043
opter+0.043
 discoveries+0.042
beh+0.042
Neg logit contributions
 seven-0.017
 and-0.011
 only-0.010
 like-0.009
 still-0.008
TokenThe
Token ID464
Feature activation+3.12e-02

Pos logit contributions
arre+0.003
opter+0.002
inker+0.002
beh+0.002
oving+0.002
Neg logit contributions
 seven-0.002
 and-0.001
 only-0.001
 like-0.001
 or-0.001
Token good
Token ID922
Feature activation-8.40e-03

Pos logit contributions
arre+0.025
beh+0.023
oving+0.023
 discoveries+0.022
opter+0.021
Neg logit contributions
 seven-0.019
 and-0.015
 The-0.013
 or-0.013
 like-0.013
Token friends
Token ID2460
Feature activation+5.52e-03

Pos logit contributions
arre+0.014
beh+0.013
inker+0.013
 discoveries+0.013
opter+0.012
Neg logit contributions
 seven-0.019
 and-0.017
 only-0.015
 The-0.015
 like-0.015
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
 seven+0.008
 eight+0.005
 blue+0.005
 brown+0.004
 The+0.004
Neg logit contributions
arre-0.013
 discoveries-0.012
 situations-0.012
beh-0.012
inker-0.011
Token never
Token ID1239
Feature activation-5.46e-03

Pos logit contributions
arre+0.027
beh+0.026
opter+0.026
inker+0.026
 discoveries+0.025
Neg logit contributions
 seven-0.016
 and-0.014
 only-0.012
 like-0.011
 The-0.011
Token qu
Token ID627
Feature activation-4.36e-02

Pos logit contributions
inker+0.013
opter+0.013
arre+0.013
irs+0.013
 discoveries+0.012
Neg logit contributions
 seven-0.008
 and-0.008
 only-0.007
 like-0.007
 The-0.006
Tokenarre
Token ID9624
Feature activation-1.07e-01

Pos logit contributions
 discoveries+0.030
opter+0.029
 possessions+0.028
 discovery+0.024
ert+0.023
Neg logit contributions
 seven-0.132
 and-0.132
 The-0.121
 like-0.121
 still-0.120
Tokenled
Token ID992
Feature activation+9.15e-04

Pos logit contributions
igsaw+0.194
 discoveries+0.191
ert+0.188
irs+0.187
inker+0.181
Neg logit contributions
 and-0.213
 in-0.199
 seven-0.188
 like-0.183
 only-0.174
Token again
Token ID757
Feature activation+1.52e-02

Pos logit contributions
 seven+0.002
 The+0.001
 and+0.001
 only+0.001
 also+0.001
Neg logit contributions
arre-0.002
beh-0.002
 discoveries-0.002
 situations-0.002
inker-0.002
Token.
Token ID13
Feature activation-2.07e-03

Pos logit contributions
 seven+0.014
 and+0.007
 only+0.006
 The+0.006
 like+0.005
Neg logit contributions
arre-0.045
 discoveries-0.044
beh-0.043
 situations-0.043
oving-0.043
Token<|endoftext|>
Token ID50256
Feature activation-3.09e-02

Pos logit contributions
arre+0.006
 discoveries+0.005
inker+0.005
beh+0.005
igsaw+0.005
Neg logit contributions
 seven-0.003
 and-0.002
 only-0.002
 like-0.002
 still-0.001
TokenOnce
Token ID7454
Feature activation+3.41e-03

Pos logit contributions
arre+0.067
inker+0.064
opter+0.061
 discoveries+0.061
 situations+0.060
Neg logit contributions
 seven-0.054
 and-0.039
 only-0.039
 like-0.038
 still-0.036
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
 seven+0.037
 The+0.022
 only+0.020
 still+0.018
 blue+0.018
Neg logit contributions
 discoveries-0.065
 arose-0.064
ert-0.064
inker-0.064
 discovery-0.063
Token She
Token ID1375
Feature activation+1.97e-02

Pos logit contributions
 seven+0.030
 and+0.021
 only+0.020
 like+0.020
 still+0.019
Neg logit contributions
arre-0.051
igsaw-0.048
inker-0.047
 situations-0.047
 discoveries-0.047
Token asked
Token ID1965
Feature activation-2.65e-02

Pos logit contributions
 seven+0.034
 and+0.027
 The+0.027
 like+0.026
 or+0.025
Neg logit contributions
arre-0.040
opter-0.037
beh-0.037
 discoveries-0.037
inker-0.037
Token Tommy
Token ID19919
Feature activation+3.36e-03

Pos logit contributions
arre+0.061
 discoveries+0.060
inker+0.060
 situations+0.058
ert+0.058
Neg logit contributions
 seven-0.039
 and-0.029
 only-0.026
 still-0.025
 like-0.025
Token if
Token ID611
Feature activation+3.63e-02

Pos logit contributions
 seven+0.005
 only+0.003
 The+0.003
 and+0.003
 blue+0.003
Neg logit contributions
arre-0.008
 possessions-0.008
 discoveries-0.008
ert-0.007
ge-0.007
Token she
Token ID673
Feature activation+5.00e-02

Pos logit contributions
 seven+0.056
 and+0.044
 like+0.037
 The+0.037
 only+0.036
Neg logit contributions
arre-0.083
beh-0.081
 discoveries-0.080
inker-0.080
opter-0.080
Token could
Token ID714
Feature activation+1.14e-02

Pos logit contributions
 seven+0.079
 and+0.060
 The+0.056
 like+0.052
 or+0.051
Neg logit contributions
arre-0.108
opter-0.104
 discoveries-0.104
beh-0.104
 possessions-0.102
Token march
Token ID9960
Feature activation-1.28e-02

Pos logit contributions
 seven+0.023
 and+0.017
 The+0.016
 only+0.015
 blue+0.015
Neg logit contributions
arre-0.021
 discoveries-0.020
oving-0.020
beh-0.020
inker-0.020
Token with
Token ID351
Feature activation+1.65e-02

Pos logit contributions
arre+0.034
inker+0.034
 discoveries+0.033
beh+0.033
 situations+0.033
Neg logit contributions
 seven-0.014
 and-0.009
 only-0.008
 The-0.008
 still-0.007
Token him
Token ID683
Feature activation+6.56e-03

Pos logit contributions
 seven+0.024
 and+0.019
 only+0.016
 The+0.016
 like+0.016
Neg logit contributions
arre-0.039
 discoveries-0.038
inker-0.038
beh-0.037
 situations-0.037
Token and
Token ID290
Feature activation+3.11e-03

Pos logit contributions
 seven+0.007
 The+0.004
 only+0.004
 and+0.003
 eight+0.003
Neg logit contributions
 discoveries-0.017
arre-0.017
 possessions-0.017
inker-0.017
 situations-0.017
Token day
Token ID1110
Feature activation+3.06e-02

Pos logit contributions
 seven+0.052
 and+0.044
 only+0.041
 The+0.040
 like+0.040
Neg logit contributions
arre-0.048
 discoveries-0.045
opter-0.044
inker-0.044
beh-0.044
Token.
Token ID13
Feature activation-1.85e-03

Pos logit contributions
 seven+0.047
 only+0.032
 The+0.032
 He+0.031
 still+0.029
Neg logit contributions
 discoveries-0.065
arre-0.064
ert-0.062
 discovery-0.062
opter-0.062
Token<|endoftext|>
Token ID50256
Feature activation-3.25e-02

Pos logit contributions
arre+0.005
 discoveries+0.005
beh+0.005
opter+0.005
 situations+0.004
Neg logit contributions
 seven-0.002
 and-0.002
 only-0.002
 like-0.001
 or-0.001
TokenOnce
Token ID7454
Feature activation+2.78e-04

Pos logit contributions
arre+0.063
inker+0.058
opter+0.057
ert+0.055
 situations+0.055
Neg logit contributions
 seven-0.064
 only-0.047
 and-0.047
 like-0.046
 still-0.044
Token upon
Token ID2402
Feature activation+3.58e-02

Pos logit contributions
 seven+0.001
 and+0.000
 only+0.000
 like+0.000
 The+0.000
Neg logit contributions
arre-0.001
opter-0.000
inker-0.000
 discoveries-0.000
beh-0.000
Token a
Token ID257
Feature activation+3.48e-02

Pos logit contributions
 seven+0.035
 and+0.023
 only+0.018
 The+0.016
 like+0.015
Neg logit contributions
arre-0.103
 discoveries-0.099
inker-0.099
beh-0.098
opter-0.098
Token time
Token ID640
Feature activation+1.80e-02

Pos logit contributions
 seven+0.056
 and+0.043
 The+0.041
 only+0.039
 He+0.039
Neg logit contributions
arre-0.078
 discoveries-0.069
oving-0.069
 discovery-0.066
inker-0.066
Token there
Token ID612
Feature activation-1.77e-02

Pos logit contributions
 seven+0.016
 and+0.010
 only+0.008
 like+0.007
 The+0.007
Neg logit contributions
arre-0.053
opter-0.051
inker-0.051
 discoveries-0.051
beh-0.050
Token was
Token ID373
Feature activation+2.09e-02

Pos logit contributions
arre+0.039
opter+0.037
inker+0.037
 discoveries+0.036
beh+0.036
Neg logit contributions
 seven-0.030
 and-0.025
 only-0.022
 like-0.022
 The-0.022
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
 seven+0.023
 and+0.016
 only+0.013
 or+0.012
 still+0.012
Neg logit contributions
arre-0.056
 discoveries-0.054
inker-0.053
beh-0.052
opter-0.052
Token little
Token ID1310
Feature activation-1.08e-02

Pos logit contributions
 seven+0.024
 and+0.019
 or+0.018
 like+0.018
 The+0.018
Neg logit contributions
arre-0.024
opter-0.024
 discoveries-0.023
 discovery-0.022
inker-0.022
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
 seven+0.032
 and+0.024
 The+0.023
 He+0.021
 or+0.019
Neg logit contributions
arre-0.047
beh-0.046
 discoveries-0.045
oving-0.045
making-0.044
Token sign
Token ID1051
Feature activation+3.09e-02

Pos logit contributions
 seven+0.028
 and+0.024
 The+0.022
 or+0.021
 only+0.021
Neg logit contributions
arre-0.031
 discoveries-0.030
opter-0.030
 discovery-0.029
igsaw-0.028
Token that
Token ID326
Feature activation-1.16e-02

Pos logit contributions
 seven+0.051
 The+0.039
 only+0.038
 still+0.037
 also+0.035
Neg logit contributions
opter-0.059
arre-0.055
beh-0.052
making-0.052
ade-0.051
Token something
Token ID1223
Feature activation-1.43e-02

Pos logit contributions
arre+0.027
opter+0.025
beh+0.025
 discoveries+0.025
inker+0.025
Neg logit contributions
 seven-0.018
 and-0.015
 only-0.013
 like-0.013
 The-0.013
Token good
Token ID922
Feature activation+2.35e-02

Pos logit contributions
arre+0.027
 discoveries+0.027
opter+0.027
beh+0.026
 situations+0.026
Neg logit contributions
 seven-0.027
 and-0.021
 The-0.020
 only-0.019
 He-0.018
Token was
Token ID373
Feature activation+3.48e-02

Pos logit contributions
 seven+0.038
 The+0.028
 and+0.028
 only+0.026
 still+0.025
Neg logit contributions
arre-0.052
opter-0.050
 discoveries-0.050
beh-0.048
 situations-0.048
Token going
Token ID1016
Feature activation+2.77e-02

Pos logit contributions
 seven+0.051
 The+0.039
 He+0.037
 and+0.037
 only+0.034
Neg logit contributions
arre-0.074
 discoveries-0.074
oving-0.073
 possessions-0.073
beh-0.072
Token to
Token ID284
Feature activation-4.95e-03

Pos logit contributions
 seven+0.037
 The+0.026
 also+0.026
 only+0.025
 and+0.025
Neg logit contributions
arre-0.063
beh-0.062
inker-0.059
opter-0.059
oving-0.059
Token happen
Token ID1645
Feature activation+1.47e-03

Pos logit contributions
arre+0.010
opter+0.009
inker+0.009
beh+0.009
 discoveries+0.009
Neg logit contributions
 seven-0.010
 and-0.008
 only-0.007
 like-0.007
 The-0.007
Token."
Token ID526
Feature activation-1.00e-02

Pos logit contributions
 seven+0.002
 The+0.002
 only+0.002
 and+0.002
 He+0.001
Neg logit contributions
arre-0.003
oving-0.003
 discoveries-0.003
ert-0.003
opter-0.003
Token And
Token ID843
Feature activation+1.86e-02

Pos logit contributions
arre+0.026
 discoveries+0.025
opter+0.025
beh+0.025
inker+0.025
Neg logit contributions
 seven-0.014
 and-0.010
 only-0.009
 like-0.008
 or-0.008
Token it
Token ID340
Feature activation+5.44e-03

Pos logit contributions
 discoveries+0.000
 discovery+0.000
arre+0.000
 possessions+0.000
making+0.000
Neg logit contributions
 seven-0.000
 and-0.000
 only-0.000
 or-0.000
 driver-0.000
Token.
Token ID13
Feature activation-8.20e-03

Pos logit contributions
 seven+0.006
 The+0.003
 only+0.003
 named+0.002
 Leo+0.002
Neg logit contributions
arre-0.015
 discoveries-0.014
 discovery-0.014
oving-0.014
ert-0.013
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
arre+0.019
 discoveries+0.018
inker+0.017
oving+0.017
 situations+0.017
Neg logit contributions
 seven-0.013
 and-0.010
 only-0.009
 like-0.009
 or-0.009
Token
Token ID198
Feature activation+3.36e-04

Pos logit contributions
arre+0.041
inker+0.039
opter+0.039
 discoveries+0.039
beh+0.039
Neg logit contributions
 seven-0.013
 and-0.009
 only-0.007
 like-0.007
 The-0.006
TokenWhile
Token ID3633
Feature activation+2.84e-02

Pos logit contributions
 seven+0.001
 and+0.001
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.001
opter-0.001
inker-0.001
beh-0.001
 discoveries-0.001
Token she
Token ID673
Feature activation+3.57e-02

Pos logit contributions
 seven+0.049
 and+0.040
 like+0.036
 only+0.035
 or+0.034
Neg logit contributions
arre-0.060
 discoveries-0.058
opter-0.057
beh-0.056
inker-0.056
Token was
Token ID373
Feature activation+6.90e-03

Pos logit contributions
 seven+0.061
 and+0.051
 only+0.045
 like+0.045
 The+0.044
Neg logit contributions
arre-0.078
opter-0.073
beh-0.073
inker-0.073
 discoveries-0.073
Token eating
Token ID6600
Feature activation+4.34e-03

Pos logit contributions
 seven+0.014
 and+0.011
 The+0.010
 only+0.010
 or+0.010
Neg logit contributions
arre-0.013
beh-0.012
oving-0.012
 discoveries-0.012
 situations-0.011
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
 seven+0.005
 only+0.003
 and+0.003
 The+0.003
 driver+0.003
Neg logit contributions
 discoveries-0.011
arre-0.011
 discovery-0.011
 situations-0.010
igsaw-0.010
Token Jane
Token ID12091
Feature activation-8.25e-03

Pos logit contributions
arre+0.056
opter+0.053
inker+0.053
beh+0.052
 discoveries+0.052
Neg logit contributions
 seven-0.044
 and-0.037
 only-0.033
 like-0.032
 The-0.032
Token noticed
Token ID6810
Feature activation+4.12e-03

Pos logit contributions
arre+0.016
opter+0.013
ert+0.013
making+0.013
oving+0.013
Neg logit contributions
 seven-0.016
 The-0.012
 like-0.012
 only-0.011
 He-0.011
Token not
Token ID407
Feature activation+7.03e-03

Pos logit contributions
arre+0.023
opter+0.022
inker+0.022
beh+0.022
 discoveries+0.021
Neg logit contributions
 seven-0.011
 and-0.009
 only-0.008
 like-0.007
 The-0.007
Token too
Token ID1165
Feature activation+1.13e-02

Pos logit contributions
 seven+0.010
 and+0.008
 only+0.007
 The+0.006
 like+0.006
Neg logit contributions
arre-0.018
beh-0.017
 discoveries-0.017
opter-0.017
inker-0.017
Token much
Token ID881
Feature activation+1.81e-02

Pos logit contributions
 seven+0.017
 and+0.014
 only+0.012
 like+0.012
 The+0.012
Neg logit contributions
arre-0.027
beh-0.026
opter-0.026
inker-0.026
 discoveries-0.025
Token.
Token ID13
Feature activation+8.56e-03

Pos logit contributions
 seven+0.018
 and+0.013
 only+0.010
 The+0.010
 like+0.009
Neg logit contributions
arre-0.052
beh-0.050
 discoveries-0.050
opter-0.050
inker-0.050
Token<|endoftext|>
Token ID50256
Feature activation-2.63e-02

Pos logit contributions
 seven+0.014
 and+0.010
 like+0.009
 only+0.009
 still+0.008
Neg logit contributions
arre-0.020
beh-0.019
inker-0.018
irs-0.018
 situations-0.018
TokenAnna
Token ID31160
Feature activation+4.10e-02

Pos logit contributions
arre+0.046
inker+0.042
 situations+0.040
 discoveries+0.039
irs+0.038
Neg logit contributions
 seven-0.057
 only-0.043
 and-0.042
 like-0.042
 still-0.041
Token and
Token ID290
Feature activation+3.83e-02

Pos logit contributions
 seven+0.053
 low+0.027
 eight+0.027
 blue+0.024
 The+0.024
Neg logit contributions
arre-0.104
 discoveries-0.096
making-0.092
tell-0.091
 possessions-0.091
Token Ben
Token ID3932
Feature activation+1.70e-02

Pos logit contributions
 seven+0.058
 and+0.035
 blue+0.030
 or+0.030
 only+0.030
Neg logit contributions
arre-0.083
 discoveries-0.083
beh-0.080
opter-0.078
tell-0.078
Token were
Token ID547
Feature activation-7.92e-04

Pos logit contributions
 seven+0.031
 only+0.021
 The+0.021
 and+0.020
 blue+0.020
Neg logit contributions
arre-0.034
 discoveries-0.031
opter-0.031
beh-0.029
inker-0.029
Token playing
Token ID2712
Feature activation-1.15e-03

Pos logit contributions
arre+0.001
 discoveries+0.001
inker+0.001
beh+0.001
uc+0.001
Neg logit contributions
 seven-0.002
 or-0.001
 like-0.001
 and-0.001
 The-0.001
Token in
Token ID287
Feature activation+9.93e-03

Pos logit contributions
arre+0.003
 discoveries+0.003
inker+0.003
oving+0.003
 situations+0.003
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.001
 The-0.001
 or-0.001
Token.
Token ID13
Feature activation-3.59e-03

Pos logit contributions
arre+0.042
 discoveries+0.041
opter+0.040
inker+0.040
 situations+0.040
Neg logit contributions
 seven-0.025
 The-0.017
 and-0.017
 only-0.016
 also-0.015
Token They
Token ID1119
Feature activation-1.53e-02

Pos logit contributions
arre+0.008
 discoveries+0.008
opter+0.008
inker+0.007
ert+0.007
Neg logit contributions
 seven-0.006
 and-0.004
 like-0.004
 or-0.004
 only-0.004
Token hugged
Token ID41130
Feature activation+1.22e-02

Pos logit contributions
arre+0.031
opter+0.030
 discoveries+0.029
beh+0.029
inker+0.029
Neg logit contributions
 seven-0.027
 and-0.022
 The-0.020
 only-0.019
 or-0.019
Token him
Token ID683
Feature activation+1.34e-02

Pos logit contributions
 seven+0.017
 only+0.010
 or+0.009
 also+0.009
 still+0.009
Neg logit contributions
arre-0.029
 discoveries-0.029
 discovery-0.027
 possessions-0.027
inker-0.027
Token tightly
Token ID17707
Feature activation+1.80e-02

Pos logit contributions
 seven+0.015
 only+0.008
 or+0.008
Can+0.007
 named+0.007
Neg logit contributions
 discoveries-0.034
 discovery-0.032
arre-0.032
ert-0.031
 possessions-0.031
Token and
Token ID290
Feature activation-4.95e-03

Pos logit contributions
 seven+0.018
 The+0.008
 Leo+0.007
 only+0.006
 Africa+0.006
Neg logit contributions
 discoveries-0.047
 discovery-0.046
arre-0.046
 possessions-0.046
ert-0.044
Token brought
Token ID3181
Feature activation-2.77e-02

Pos logit contributions
arre+0.012
beh+0.011
 discoveries+0.011
opter+0.011
inker+0.011
Neg logit contributions
 seven-0.008
 and-0.006
 only-0.006
 like-0.005
 The-0.005
Token him
Token ID683
Feature activation+8.14e-04

Pos logit contributions
arre+0.067
 discoveries+0.063
inker+0.063
beh+0.063
opter+0.063
Neg logit contributions
 seven-0.041
 and-0.034
 only-0.029
 like-0.028
 The-0.028
Token over
Token ID625
Feature activation+1.72e-02

Pos logit contributions
 seven+0.001
 and+0.001
 only+0.001
 The+0.001
 like+0.001
Neg logit contributions
arre-0.002
 discoveries-0.002
beh-0.002
inker-0.002
opter-0.002
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
 seven+0.023
 Sunny+0.012
 The+0.012
 only+0.012
 ind+0.012
Neg logit contributions
 discoveries-0.041
 discovery-0.040
 possessions-0.038
ert-0.037
ge-0.037
Token the
Token ID262
Feature activation+9.02e-03

Pos logit contributions
 discoveries+0.050
arre+0.050
 situations+0.048
inker+0.048
 discovery+0.048
Neg logit contributions
 seven-0.028
 and-0.020
 only-0.017
 or-0.016
 like-0.016
Token it
Token ID340
Feature activation-2.23e-02

Pos logit contributions
 seven+0.024
 and+0.019
 only+0.015
 like+0.015
 The+0.014
Neg logit contributions
arre-0.059
inker-0.056
opter-0.056
beh-0.056
 discoveries-0.056
Token was
Token ID373
Feature activation+2.98e-03

Pos logit contributions
arre+0.053
beh+0.051
inker+0.051
 discoveries+0.050
opter+0.050
Neg logit contributions
 seven-0.034
 and-0.030
 only-0.027
 still-0.025
 like-0.025
Token as
Token ID355
Feature activation+7.16e-03

Pos logit contributions
 seven+0.005
 and+0.004
 The+0.004
 only+0.003
 or+0.003
Neg logit contributions
arre-0.006
 discoveries-0.006
oving-0.006
 discovery-0.006
 possessions-0.005
Token good
Token ID922
Feature activation+3.11e-03

Pos logit contributions
 seven+0.011
 and+0.009
 only+0.008
 The+0.008
 like+0.008
Neg logit contributions
arre-0.017
opter-0.016
beh-0.016
 discoveries-0.016
inker-0.016
Token as
Token ID355
Feature activation+2.49e-02

Pos logit contributions
 seven+0.004
 and+0.003
 only+0.002
 The+0.002
 like+0.002
Neg logit contributions
arre-0.009
beh-0.008
 discoveries-0.008
opter-0.008
inker-0.008
Token new
Token ID649
Feature activation-2.88e-03

Pos logit contributions
 seven+0.037
 and+0.027
 only+0.024
 The+0.024
 like+0.023
Neg logit contributions
 discoveries-0.059
 possessions-0.056
ert-0.056
opter-0.056
arre-0.056
Token.
Token ID13
Feature activation-4.02e-03

Pos logit contributions
arre+0.009
 discoveries+0.009
beh+0.009
inker+0.009
opter+0.009
Neg logit contributions
 seven-0.002
 and-0.001
 only-0.001
 The-0.001
 like-0.001
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
arre+0.012
 discoveries+0.011
opter+0.011
inker+0.011
beh+0.011
Neg logit contributions
 seven-0.004
 and-0.003
 only-0.002
 like-0.002
 or-0.002
Token
Token ID198
Feature activation+1.03e-03

Pos logit contributions
arre+0.042
beh+0.040
opter+0.040
inker+0.040
 discoveries+0.040
Neg logit contributions
 seven-0.010
 and-0.007
 The-0.005
 only-0.005
 like-0.004
TokenThe
Token ID464
Feature activation+3.27e-02

Pos logit contributions
 seven+0.002
 and+0.001
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.002
opter-0.002
beh-0.002
 discoveries-0.002
inker-0.002
Token girl
Token ID2576
Feature activation-1.32e-02

Pos logit contributions
 seven+0.056
 and+0.049
 only+0.044
 like+0.042
 still+0.041
Neg logit contributions
arre-0.073
beh-0.069
opter-0.069
inker-0.069
oving-0.067
Token"
Token ID1
Feature activation-4.46e-03

Pos logit contributions
 seven+0.025
 and+0.021
 only+0.019
 like+0.019
 or+0.018
Neg logit contributions
arre-0.032
beh-0.030
inker-0.030
opter-0.030
 discoveries-0.029
TokenHello
Token ID15496
Feature activation-1.34e-02

Pos logit contributions
arre+0.008
beh+0.008
opter+0.007
inker+0.007
 discoveries+0.007
Neg logit contributions
 seven-0.009
 and-0.008
 only-0.007
 like-0.007
 The-0.007
Token,
Token ID11
Feature activation+1.41e-02

Pos logit contributions
 discoveries+0.035
opter+0.035
arre+0.035
inker+0.034
 discovery+0.034
Neg logit contributions
 seven-0.015
 and-0.009
 only-0.009
 The-0.008
 still-0.008
Token dog
Token ID3290
Feature activation+5.54e-04

Pos logit contributions
 seven+0.022
 and+0.017
 only+0.015
 The+0.015
 like+0.014
Neg logit contributions
arre-0.032
opter-0.030
 discoveries-0.030
inker-0.029
 discovery-0.029
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
 seven+0.001
 only+0.001
 still+0.001
 The+0.001
 also+0.001
Neg logit contributions
opter-0.001
 discoveries-0.001
arre-0.001
 situations-0.001
beh-0.001
Token You
Token ID921
Feature activation-1.44e-02

Pos logit contributions
 seven+0.030
 and+0.027
 still+0.025
 only+0.023
 also+0.023
Neg logit contributions
beh-0.026
 situations-0.026
arre-0.024
playing-0.024
opter-0.024
Token are
Token ID389
Feature activation+4.30e-03

Pos logit contributions
arre+0.029
opter+0.027
beh+0.027
inker+0.027
 discoveries+0.027
Neg logit contributions
 seven-0.027
 and-0.023
 The-0.020
 only-0.020
 like-0.019
Token very
Token ID845
Feature activation-1.31e-02

Pos logit contributions
 seven+0.006
 and+0.005
 The+0.004
 He+0.004
 or+0.004
Neg logit contributions
beh-0.009
arre-0.009
 discoveries-0.009
inker-0.008
 discovery-0.008
Token loud
Token ID7812
Feature activation+1.75e-03

Pos logit contributions
arre+0.029
beh+0.028
opter+0.028
inker+0.028
 discoveries+0.028
Neg logit contributions
 seven-0.021
 and-0.018
 only-0.016
 The-0.015
 or-0.015
Token,"
Token ID553
Feature activation+1.29e-02

Pos logit contributions
 seven+0.002
 only+0.002
 still+0.002
 The+0.001
 also+0.001
Neg logit contributions
arre-0.004
 situations-0.004
 discoveries-0.004
inker-0.004
oving-0.004
Token Tom
Token ID4186
Feature activation+1.56e-04

Pos logit contributions
 seven+0.019
 like+0.014
 and+0.013
 only+0.012
 still+0.012
Neg logit contributions
 discoveries-0.026
opter-0.025
inker-0.025
 situations-0.024
 discovery-0.024
Token her
Token ID607
Feature activation+1.78e-02

Pos logit contributions
inker+0.059
ert+0.058
 situations+0.057
 discoveries+0.056
irs+0.056
Neg logit contributions
 seven-0.044
 and-0.026
 The-0.026
 like-0.026
 blue-0.026
Token,
Token ID11
Feature activation+2.93e-03

Pos logit contributions
 seven+0.017
 and+0.011
 only+0.008
 The+0.008
 like+0.008
Neg logit contributions
arre-0.053
 discoveries-0.051
inker-0.051
opter-0.051
beh-0.051
Token but
Token ID475
Feature activation-1.13e-02

Pos logit contributions
 seven+0.005
 The+0.003
 Tw+0.003
 ind+0.003
 driver+0.003
Neg logit contributions
arre-0.005
 situations-0.005
ge-0.005
 discoveries-0.005
 discovery-0.005
Token he
Token ID339
Feature activation+4.65e-03

Pos logit contributions
arre+0.026
opter+0.025
 discoveries+0.025
inker+0.025
ert+0.025
Neg logit contributions
 seven-0.017
 and-0.013
 The-0.011
 like-0.011
 only-0.011
Token could
Token ID714
Feature activation-3.09e-03

Pos logit contributions
 seven+0.008
 and+0.006
 The+0.006
 like+0.006
 only+0.006
Neg logit contributions
arre-0.010
opter-0.010
inker-0.010
beh-0.010
 discoveries-0.009
Token not
Token ID407
Feature activation+5.08e-03

Pos logit contributions
arre+0.007
oving+0.007
beh+0.007
inker+0.007
opter+0.007
Neg logit contributions
 seven-0.005
 and-0.003
 The-0.003
 blue-0.003
 like-0.002
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 seven+0.007
 and+0.005
 The+0.005
 only+0.005
 like+0.004
Neg logit contributions
arre-0.012
opter-0.012
inker-0.012
 discoveries-0.012
beh-0.012
Token The
Token ID383
Feature activation+2.79e-02

Pos logit contributions
 seven+0.017
 and+0.010
 like+0.010
 ind+0.010
 then+0.009
Neg logit contributions
arre-0.027
 situations-0.026
ulk-0.025
inker-0.024
igsaw-0.024
Token ring
Token ID5858
Feature activation+7.40e-03

Pos logit contributions
 seven+0.050
 and+0.043
 The+0.040
 like+0.039
 or+0.037
Neg logit contributions
opter-0.053
 discoveries-0.052
arre-0.052
inker-0.051
 possessions-0.051
Token was
Token ID373
Feature activation+5.30e-03

Pos logit contributions
 seven+0.015
 The+0.011
 and+0.011
 like+0.011
 only+0.010
Neg logit contributions
opter-0.013
arre-0.013
oving-0.012
inker-0.012
 arose-0.012
Token too
Token ID1165
Feature activation+1.62e-03

Pos logit contributions
 seven+0.008
 The+0.007
 and+0.006
 He+0.005
 But+0.005
Neg logit contributions
arre-0.010
beh-0.010
 arose-0.010
oving-0.010
ge-0.010
Token smaller
Token ID4833
Feature activation+9.27e-03

Pos logit contributions
arre+0.000
 discoveries+0.000
opter+0.000
 situations+0.000
 discovery+0.000
Neg logit contributions
 seven-0.000
 and-0.000
 The-0.000
 only-0.000
 like-0.000
Token ball
Token ID2613
Feature activation+1.19e-02

Pos logit contributions
 seven+0.015
 The+0.012
 and+0.012
 only+0.011
 still+0.011
Neg logit contributions
 discoveries-0.019
opter-0.019
 situations-0.019
arre-0.019
 possessions-0.018
Token of
Token ID286
Feature activation+1.19e-02

Pos logit contributions
 seven+0.016
 The+0.010
 only+0.008
 still+0.008
 He+0.008
Neg logit contributions
opter-0.027
arre-0.025
ade-0.025
making-0.025
 situations-0.025
Token rock
Token ID3881
Feature activation+8.94e-03

Pos logit contributions
 seven+0.020
 and+0.016
 The+0.015
 only+0.015
 still+0.014
Neg logit contributions
 discoveries-0.024
 situations-0.024
arre-0.023
 possessions-0.023
opter-0.023
Token that
Token ID326
Feature activation-1.98e-02

Pos logit contributions
 seven+0.011
 The+0.006
 only+0.005
 He+0.005
 still+0.005
Neg logit contributions
arre-0.022
opter-0.022
 situations-0.022
making-0.021
 possessions-0.021
Token goes
Token ID2925
Feature activation+1.75e-02

Pos logit contributions
arre+0.041
opter+0.039
 discoveries+0.039
 situations+0.039
beh+0.039
Neg logit contributions
 seven-0.033
 and-0.026
 The-0.023
 like-0.023
 only-0.022
Token around
Token ID1088
Feature activation-3.06e-02

Pos logit contributions
 seven+0.027
 The+0.020
 Leo+0.019
 and+0.019
 He+0.018
Neg logit contributions
beh-0.035
arre-0.034
 possessions-0.034
ge-0.033
irs-0.033
Token the
Token ID262
Feature activation-2.74e-02

Pos logit contributions
 situations+0.076
arre+0.072
 discoveries+0.072
ert+0.071
inker+0.070
Neg logit contributions
 seven-0.037
 only-0.021
 driver-0.021
 still-0.020
 The-0.020
Token earth
Token ID4534
Feature activation-4.44e-02

Pos logit contributions
arre+0.051
 situations+0.050
 discoveries+0.048
opter+0.048
inker+0.046
Neg logit contributions
 seven-0.050
 and-0.041
 only-0.040
 The-0.038
 like-0.038
Token."
Token ID526
Feature activation+1.37e-03

Pos logit contributions
arre+0.110
opter+0.110
 discoveries+0.108
 situations+0.108
ert+0.105
Neg logit contributions
 seven-0.051
 The-0.030
 only-0.028
 Tom-0.026
 like-0.023
Token
Token ID198
Feature activation-2.29e-05

Pos logit contributions
 seven+0.002
 only+0.002
 driver+0.002
 like+0.001
 and+0.001
Neg logit contributions
arre-0.003
 discoveries-0.003
inker-0.002
irs-0.002
ulk-0.002
Token a
Token ID257
Feature activation-7.64e-03

Pos logit contributions
arre+0.081
opter+0.076
beh+0.076
inker+0.075
oving+0.075
Neg logit contributions
 seven-0.039
 and-0.033
 only-0.028
 like-0.026
 The-0.025
Token secret
Token ID3200
Feature activation-2.16e-02

Pos logit contributions
arre+0.017
opter+0.016
inker+0.016
beh+0.016
 discoveries+0.016
Neg logit contributions
 seven-0.012
 and-0.010
 only-0.009
 like-0.009
 The-0.009
Token path
Token ID3108
Feature activation+9.14e-03

Pos logit contributions
 discoveries+0.041
 possessions+0.039
arre+0.039
 situations+0.039
 discovery+0.039
Neg logit contributions
 seven-0.041
 The-0.033
 and-0.032
 only-0.031
 like-0.029
Token and
Token ID290
Feature activation-4.24e-03

Pos logit contributions
 seven+0.011
 The+0.007
 only+0.006
 still+0.005
 Santa+0.005
Neg logit contributions
 possessions-0.022
 discoveries-0.022
 situations-0.022
 discovery-0.022
ert-0.022
Token saw
Token ID2497
Feature activation-1.80e-02

Pos logit contributions
arre+0.010
opter+0.009
beh+0.009
inker+0.009
oving+0.009
Neg logit contributions
 seven-0.007
 and-0.006
 only-0.005
 like-0.005
 The-0.005
Token many
Token ID867
Feature activation-4.81e-02

Pos logit contributions
arre+0.046
inker+0.043
opter+0.043
beh+0.043
 discoveries+0.043
Neg logit contributions
 seven-0.025
 and-0.020
 only-0.017
 like-0.017
 The-0.016
Token different
Token ID1180
Feature activation-6.44e-02

Pos logit contributions
arre+0.104
beh+0.096
opter+0.094
inker+0.092
oving+0.091
Neg logit contributions
 seven-0.089
 and-0.082
 like-0.071
 only-0.070
 or-0.067
Token kinds
Token ID6982
Feature activation-3.99e-02

Pos logit contributions
arre+0.122
beh+0.109
opter+0.109
oving+0.105
making+0.104
Neg logit contributions
 seven-0.136
 and-0.133
 like-0.117
 only-0.114
 or-0.110
Token of
Token ID286
Feature activation-1.61e-02

Pos logit contributions
 discoveries+0.086
arre+0.085
opter+0.083
inker+0.083
beh+0.083
Neg logit contributions
 seven-0.067
 and-0.049
 only-0.049
 The-0.048
 still-0.045
Token animals
Token ID4695
Feature activation-1.65e-02

Pos logit contributions
arre+0.032
opter+0.030
beh+0.029
inker+0.029
oving+0.029
Neg logit contributions
 seven-0.032
 and-0.029
 only-0.026
 like-0.025
 The-0.024
Token.
Token ID13
Feature activation-4.29e-03

Pos logit contributions
 discoveries+0.046
arre+0.045
beh+0.044
 situations+0.044
 possessions+0.044
Neg logit contributions
 seven-0.018
 The-0.010
 only-0.009
 and-0.009
 still-0.007
Token was
Token ID373
Feature activation-7.13e-03

Pos logit contributions
opter+0.066
arre+0.063
inker+0.060
beh+0.059
ert+0.059
Neg logit contributions
 seven-0.066
 and-0.054
 like-0.051
 The-0.049
 or-0.048
Token shiny
Token ID22441
Feature activation-1.06e-02

Pos logit contributions
 err+0.013
 viol+0.013
inker+0.013
 discovery+0.013
 possessions+0.013
Neg logit contributions
 seven-0.009
 and-0.007
 The-0.007
 Sunny-0.006
 or-0.006
Token and
Token ID290
Feature activation+1.08e-03

Pos logit contributions
arre+0.028
opter+0.028
 situations+0.028
 discoveries+0.028
 possessions+0.028
Neg logit contributions
 seven-0.010
 The-0.006
 Leo-0.005
 Sunny-0.004
 eight-0.004
Token red
Token ID2266
Feature activation-2.32e-02

Pos logit contributions
 seven+0.002
 and+0.001
 The+0.001
 or+0.001
 only+0.001
Neg logit contributions
arre-0.002
opter-0.002
 discoveries-0.002
beh-0.002
inker-0.002
Token.
Token ID13
Feature activation-1.60e-03

Pos logit contributions
arre+0.063
 situations+0.061
opter+0.060
inker+0.059
 discoveries+0.059
Neg logit contributions
 seven-0.024
 The-0.011
 still-0.010
 only-0.009
 Leo-0.009
Token It
Token ID632
Feature activation-3.38e-02

Pos logit contributions
arre+0.004
inker+0.004
 situations+0.004
 discoveries+0.004
opter+0.004
Neg logit contributions
 seven-0.002
 and-0.001
 only-0.001
 still-0.001
 like-0.001
Token had
Token ID550
Feature activation-1.43e-02

Pos logit contributions
opter+0.066
arre+0.064
beh+0.061
inker+0.061
ert+0.060
Neg logit contributions
 seven-0.058
 and-0.046
 like-0.043
 The-0.042
 or-0.041
Token big
Token ID1263
Feature activation-2.17e-02

Pos logit contributions
opter+0.032
arre+0.031
inker+0.031
 discoveries+0.030
 possessions+0.030
Neg logit contributions
 seven-0.018
 and-0.015
 still-0.013
 or-0.013
 like-0.012
Token wheels
Token ID13666
Feature activation-2.79e-02

Pos logit contributions
arre+0.047
opter+0.045
 discoveries+0.044
 situations+0.044
inker+0.043
Neg logit contributions
 seven-0.035
 and-0.027
 only-0.027
 still-0.025
 like-0.025
Token and
Token ID290
Feature activation-8.91e-04

Pos logit contributions
opter+0.078
arre+0.077
inker+0.074
ge+0.073
playing+0.073
Neg logit contributions
 seven-0.024
 Sunny-0.011
 Leo-0.010
 The-0.009
 only-0.008
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
arre+0.002
opter+0.002
inker+0.002
 discoveries+0.002
beh+0.002
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.001
 like-0.001
 The-0.001
Token."
Token ID526
Feature activation+2.02e-03

Pos logit contributions
arre+0.029
 discoveries+0.028
 situations+0.028
opter+0.028
 possessions+0.028
Neg logit contributions
 seven-0.016
 and-0.011
 only-0.010
 The-0.010
 like-0.010
Token
Token ID198
Feature activation-3.97e-03

Pos logit contributions
 seven+0.003
 like+0.002
 only+0.002
 and+0.002
 or+0.002
Neg logit contributions
arre-0.005
 discoveries-0.005
opter-0.004
beh-0.004
inker-0.004
Token
Token ID198
Feature activation+6.96e-03

Pos logit contributions
arre+0.008
inker+0.008
 discoveries+0.008
opter+0.008
beh+0.008
Neg logit contributions
 seven-0.007
 and-0.005
 only-0.005
 like-0.005
 or-0.005
TokenThe
Token ID464
Feature activation+3.25e-02

Pos logit contributions
 seven+0.012
 and+0.010
 only+0.009
 like+0.009
 or+0.009
Neg logit contributions
arre-0.015
inker-0.014
beh-0.014
opter-0.014
 discoveries-0.014
Token little
Token ID1310
Feature activation-3.64e-02

Pos logit contributions
 seven+0.057
 and+0.049
 like+0.045
 The+0.045
 or+0.043
Neg logit contributions
arre-0.066
 discoveries-0.064
opter-0.062
inker-0.062
 possessions-0.061
Token boy
Token ID2933
Feature activation-3.53e-02

Pos logit contributions
 discoveries+0.064
arre+0.063
opter+0.061
 possessions+0.060
 situations+0.060
Neg logit contributions
 seven-0.072
 and-0.061
 The-0.059
 like-0.059
 only-0.056
Token smiled
Token ID13541
Feature activation+2.87e-02

Pos logit contributions
arre+0.071
 discoveries+0.063
beh+0.061
 discovery+0.060
opter+0.060
Neg logit contributions
 seven-0.061
 like-0.043
 blue-0.042
 low-0.042
 or-0.041
Token,
Token ID11
Feature activation-5.22e-03

Pos logit contributions
 seven+0.034
 Africa+0.019
 only+0.017
 Leo+0.017
 Anna+0.016
Neg logit contributions
arre-0.072
 discoveries-0.070
 discovery-0.068
inker-0.065
 possessions-0.063
Token gave
Token ID2921
Feature activation+3.17e-03

Pos logit contributions
arre+0.010
 discoveries+0.010
 discovery+0.010
 possessions+0.009
oving+0.009
Neg logit contributions
 seven-0.009
 or-0.006
 only-0.006
 like-0.006
 The-0.005
Token the
Token ID262
Feature activation+2.14e-03

Pos logit contributions
 seven+0.003
 and+0.002
 like+0.002
 only+0.002
 or+0.002
Neg logit contributions
arre-0.009
 discoveries-0.008
inker-0.008
beh-0.008
ert-0.008
Token pilot
Token ID8022
Feature activation+3.04e-02

Pos logit contributions
 seven+0.004
 and+0.003
 like+0.003
 only+0.003
 The+0.003
Neg logit contributions
arre-0.004
 discoveries-0.004
opter-0.004
inker-0.004
 possessions-0.004
Token little
Token ID1310
Feature activation-1.09e-02

Pos logit contributions
 seven+0.025
 and+0.019
 or+0.019
 like+0.019
 The+0.019
Neg logit contributions
opter-0.024
arre-0.024
 discoveries-0.022
 discovery-0.022
inker-0.021
Token girl
Token ID2576
Feature activation+2.39e-02

Pos logit contributions
opter+0.019
arre+0.019
 discoveries+0.018
inker+0.017
 situations+0.016
Neg logit contributions
 seven-0.022
 and-0.018
 only-0.017
 The-0.017
 like-0.017
Token who
Token ID508
Feature activation+1.22e-03

Pos logit contributions
 seven+0.038
 He+0.026
 only+0.023
 still+0.022
 eighty+0.021
Neg logit contributions
beh-0.048
 possessions-0.046
playing-0.045
 situations-0.045
oving-0.045
Token went
Token ID1816
Feature activation+9.92e-03

Pos logit contributions
 seven+0.002
 and+0.002
 or+0.001
 only+0.001
 The+0.001
Neg logit contributions
arre-0.003
beh-0.003
 discoveries-0.002
opter-0.002
inker-0.002
Token on
Token ID319
Feature activation-1.44e-02

Pos logit contributions
 seven+0.012
 and+0.009
 only+0.008
 still+0.007
 like+0.007
Neg logit contributions
arre-0.026
beh-0.025
inker-0.025
 discoveries-0.025
 situations-0.024
Token an
Token ID281
Feature activation-3.41e-02

Pos logit contributions
arre+0.043
inker+0.041
beh+0.041
opter+0.041
 discoveries+0.041
Neg logit contributions
 seven-0.014
 and-0.010
 only-0.008
 like-0.007
 The-0.007
Token adventure
Token ID8855
Feature activation+6.54e-03

Pos logit contributions
 discoveries+0.050
arre+0.048
inker+0.048
beh+0.048
 situations+0.047
Neg logit contributions
 seven-0.076
 and-0.066
 only-0.063
 The-0.062
 like-0.061
Token.
Token ID13
Feature activation+9.71e-03

Pos logit contributions
 seven+0.008
 only+0.004
 still+0.004
 The+0.004
 and+0.004
Neg logit contributions
 situations-0.016
beh-0.016
 discoveries-0.016
 possessions-0.016
inker-0.016
Token She
Token ID1375
Feature activation+9.39e-03

Pos logit contributions
 seven+0.012
 and+0.008
 only+0.007
 still+0.007
 or+0.007
Neg logit contributions
arre-0.025
 discoveries-0.023
 situations-0.023
inker-0.023
igsaw-0.023
Token excited
Token ID6568
Feature activation-1.45e-02

Pos logit contributions
 seven+0.016
 and+0.013
 like+0.012
 The+0.011
 or+0.011
Neg logit contributions
arre-0.020
inker-0.019
opter-0.019
 discoveries-0.018
beh-0.018
Tokenly
Token ID306
Feature activation-9.29e-03

Pos logit contributions
arre+0.035
inker+0.034
 discoveries+0.034
beh+0.034
opter+0.034
Neg logit contributions
 seven-0.021
 and-0.016
 only-0.015
 like-0.014
 The-0.014
Token high
Token ID1029
Feature activation+2.18e-02

Pos logit contributions
arre+0.022
 discoveries+0.020
oving+0.019
beh+0.019
 discovery+0.019
Neg logit contributions
 seven-0.020
 and-0.016
 Leo-0.015
 The-0.015
 or-0.015
Token in
Token ID287
Feature activation+6.79e-03

Pos logit contributions
 seven+0.023
 The+0.014
 eight+0.014
 Leo+0.013
 Anna+0.011
Neg logit contributions
arre-0.054
 discoveries-0.051
 situations-0.051
ing-0.051
oving-0.051
Token the
Token ID262
Feature activation-2.02e-02

Pos logit contributions
 seven+0.009
 Please+0.006
 and+0.006
 only+0.006
 like+0.006
Neg logit contributions
irs-0.015
 discoveries-0.015
 situations-0.015
lled-0.014
inker-0.014
Token sky
Token ID6766
Feature activation-2.65e-03

Pos logit contributions
arre+0.044
 situations+0.042
 discoveries+0.041
opter+0.041
igsaw+0.040
Neg logit contributions
 seven-0.030
 and-0.023
 only-0.023
 or-0.021
 like-0.021
Token and
Token ID290
Feature activation-9.62e-03

Pos logit contributions
arre+0.007
 discoveries+0.007
opter+0.006
 discovery+0.006
 situations+0.006
Neg logit contributions
 seven-0.003
 The-0.001
 Please-0.001
 only-0.001
 He-0.001
Token it
Token ID340
Feature activation-4.09e-02

Pos logit contributions
arre+0.024
 discoveries+0.022
opter+0.022
inker+0.022
 situations+0.022
Neg logit contributions
 seven-0.013
 and-0.009
 The-0.008
 only-0.008
 like-0.008
Token was
Token ID373
Feature activation-7.47e-03

Pos logit contributions
arre+0.094
inker+0.088
opter+0.088
beh+0.088
 discoveries+0.087
Neg logit contributions
 seven-0.066
 and-0.056
 only-0.049
 like-0.048
 The-0.047
Token very
Token ID845
Feature activation-1.64e-02

Pos logit contributions
arre+0.015
lled+0.015
oving+0.014
 discoveries+0.014
 discovery+0.014
Neg logit contributions
 seven-0.011
 The-0.008
 He-0.008
 and-0.007
 or-0.007
Token calm
Token ID9480
Feature activation-2.77e-03

Pos logit contributions
 discoveries+0.032
arre+0.032
opter+0.030
beh+0.030
oving+0.030
Neg logit contributions
 seven-0.029
 and-0.024
 or-0.021
 The-0.021
 only-0.021
Token,
Token ID11
Feature activation-1.98e-02

Pos logit contributions
arre+0.007
 situations+0.007
 discoveries+0.007
beh+0.007
oving+0.007
Neg logit contributions
 seven-0.004
 eight-0.002
 The-0.002
 Leo-0.002
 only-0.002
Token too
Token ID1165
Feature activation-3.99e-03

Pos logit contributions
arre+0.044
 discoveries+0.042
 discovery+0.040
oving+0.040
 possessions+0.039
Neg logit contributions
 seven-0.028
 The-0.016
 eight-0.015
:-0.015
 driver-0.013
Token box
Token ID3091
Feature activation-2.27e-03

Pos logit contributions
 seven+0.032
 and+0.027
 only+0.025
 like+0.025
 The+0.025
Neg logit contributions
arre-0.032
 discoveries-0.031
opter-0.030
inker-0.030
 situations-0.030
Token and
Token ID290
Feature activation-4.51e-04

Pos logit contributions
 discovery+0.006
 discoveries+0.006
ert+0.006
arre+0.006
inker+0.006
Neg logit contributions
 seven-0.002
 only-0.001
 like-0.001
 or-0.001
 The-0.001
Token saw
Token ID2497
Feature activation-1.79e-02

Pos logit contributions
arre+0.001
beh+0.001
inker+0.001
opter+0.001
 situations+0.001
Neg logit contributions
 seven-0.001
 and-0.001
 only-0.000
 like-0.000
 The-0.000
Token a
Token ID257
Feature activation-6.27e-03

Pos logit contributions
arre+0.051
beh+0.049
inker+0.049
opter+0.049
 discoveries+0.048
Neg logit contributions
 seven-0.019
 and-0.015
 only-0.012
 like-0.011
 The-0.011
Token j
Token ID474
Feature activation-1.65e-02

Pos logit contributions
arre+0.014
opter+0.013
 discoveries+0.013
inker+0.013
beh+0.013
Neg logit contributions
 seven-0.010
 and-0.009
 only-0.008
 The-0.008
 like-0.008
Tokenigsaw
Token ID45636
Feature activation-1.11e-01

Pos logit contributions
arre+0.029
beh+0.027
inker+0.027
opter+0.027
 discoveries+0.027
Neg logit contributions
 seven-0.035
 and-0.031
 only-0.028
 like-0.028
 The-0.028
Token puzzle
Token ID15027
Feature activation+4.88e-03

Pos logit contributions
arre+0.237
beh+0.228
inker+0.223
oving+0.216
 situations+0.216
Neg logit contributions
 seven-0.197
 and-0.181
 only-0.150
 like-0.150
 The-0.143
Token inside
Token ID2641
Feature activation-2.18e-02

Pos logit contributions
 seven+0.006
 The+0.004
 only+0.004
 and+0.004
 like+0.003
Neg logit contributions
 discoveries-0.012
opter-0.012
arre-0.012
 discovery-0.012
inker-0.012
Token.
Token ID13
Feature activation-2.59e-03

Pos logit contributions
 discoveries+0.056
opter+0.054
arre+0.054
inker+0.054
 discovery+0.054
Neg logit contributions
 seven-0.027
 only-0.015
 and-0.014
 The-0.013
 like-0.012
Token He
Token ID679
Feature activation-7.52e-03

Pos logit contributions
arre+0.006
opter+0.005
igsaw+0.005
 discoveries+0.005
inker+0.005
Neg logit contributions
 seven-0.004
 like-0.003
 and-0.003
 or-0.003
 only-0.003
Token paused
Token ID24487
Feature activation+1.76e-02

Pos logit contributions
arre+0.016
opter+0.015
 discoveries+0.015
inker+0.015
beh+0.015
Neg logit contributions
 seven-0.012
 and-0.010
 like-0.009
 only-0.009
 The-0.009
Token they
Token ID484
Feature activation-2.50e-03

Pos logit contributions
arre+0.071
 discoveries+0.067
beh+0.067
opter+0.067
inker+0.067
Neg logit contributions
 seven-0.032
 and-0.024
 like-0.019
 The-0.019
 only-0.018
Token just
Token ID655
Feature activation-1.91e-03

Pos logit contributions
arre+0.006
inker+0.005
beh+0.005
opter+0.005
 discoveries+0.005
Neg logit contributions
 seven-0.004
 and-0.004
 only-0.003
 like-0.003
 The-0.003
Token kept
Token ID4030
Feature activation-1.45e-02

Pos logit contributions
arre+0.004
opter+0.004
beh+0.004
 discoveries+0.004
inker+0.003
Neg logit contributions
 seven-0.004
 and-0.003
 The-0.003
 only-0.003
 like-0.003
Token on
Token ID319
Feature activation-2.64e-02

Pos logit contributions
arre+0.027
beh+0.025
 discoveries+0.025
oving+0.024
inker+0.024
Neg logit contributions
 seven-0.030
 and-0.025
 The-0.023
 only-0.022
 like-0.022
Token qu
Token ID627
Feature activation-4.72e-02

Pos logit contributions
arre+0.047
beh+0.043
 discoveries+0.043
inker+0.043
oving+0.042
Neg logit contributions
 seven-0.057
 and-0.048
 only-0.044
 like-0.044
 The-0.043
Tokenarre
Token ID9624
Feature activation-1.11e-01

Pos logit contributions
 possessions+0.045
 discoveries+0.044
opter+0.042
 situations+0.039
 discovery+0.038
Neg logit contributions
 seven-0.130
 and-0.126
 The-0.114
 like-0.113
 still-0.112
Tokenling
Token ID1359
Feature activation-9.97e-03

Pos logit contributions
 discoveries+0.169
igsaw+0.168
irs+0.164
inker+0.162
arre+0.157
Neg logit contributions
 and-0.247
 seven-0.234
 in-0.212
 like-0.209
.-0.202
Token!
Token ID0
Feature activation-1.31e-02

Pos logit contributions
arre+0.028
 situations+0.027
 discoveries+0.027
beh+0.026
inker+0.026
Neg logit contributions
 seven-0.010
 and-0.006
 only-0.005
 The-0.005
 still-0.005
Token
Token ID220
Feature activation-4.31e-02

Pos logit contributions
arre+0.032
 discoveries+0.029
opter+0.029
inker+0.029
beh+0.028
Neg logit contributions
 seven-0.020
 and-0.016
 like-0.014
 only-0.013
 or-0.013
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
arre+0.128
inker+0.122
opter+0.121
beh+0.121
 discoveries+0.121
Neg logit contributions
 seven-0.041
 and-0.029
 only-0.022
 like-0.021
 The-0.019
Token
Token ID198
Feature activation+3.58e-03

Pos logit contributions
arre+0.033
inker+0.031
opter+0.031
beh+0.031
 discoveries+0.031
Neg logit contributions
 seven-0.010
 and-0.008
 only-0.006
 like-0.006
 The-0.005
Token never
Token ID1239
Feature activation+3.09e-03

Pos logit contributions
 seven+0.015
 and+0.012
 The+0.011
 He+0.011
 or+0.010
Neg logit contributions
arre-0.022
beh-0.022
oving-0.021
making-0.020
 possessions-0.020
Token happy
Token ID3772
Feature activation+8.54e-03

Pos logit contributions
 seven+0.005
 and+0.004
 only+0.004
 The+0.004
 like+0.004
Neg logit contributions
arre-0.007
beh-0.007
 discoveries-0.006
inker-0.006
opter-0.006
Token.
Token ID13
Feature activation+3.45e-03

Pos logit contributions
 seven+0.012
 The+0.007
 only+0.006
 also+0.006
 still+0.006
Neg logit contributions
arre-0.021
 situations-0.021
 discoveries-0.020
beh-0.020
 possessions-0.019
Token He
Token ID679
Feature activation+3.17e-03

Pos logit contributions
 seven+0.005
 and+0.003
 like+0.003
 only+0.003
 or+0.003
Neg logit contributions
arre-0.009
 discoveries-0.008
inker-0.008
 situations-0.008
opter-0.008
Token qu
Token ID627
Feature activation-4.43e-02

Pos logit contributions
 seven+0.006
 The+0.005
 blue+0.005
 low+0.004
 like+0.004
Neg logit contributions
arre-0.005
 discoveries-0.005
beh-0.005
 possessions-0.005
opter-0.005
Tokenarre
Token ID9624
Feature activation-1.03e-01

Pos logit contributions
 discoveries+0.068
 possessions+0.065
 situations+0.063
opter+0.063
 discovery+0.063
Neg logit contributions
 seven-0.098
 and-0.094
 The-0.085
 still-0.081
 like-0.081
Tokenled
Token ID992
Feature activation+3.00e-03

Pos logit contributions
igsaw+0.178
 discoveries+0.175
inker+0.170
arre+0.169
irs+0.168
Neg logit contributions
 and-0.211
 seven-0.205
 in-0.186
 like-0.176
 only-0.174
Token and
Token ID290
Feature activation+1.12e-03

Pos logit contributions
 seven+0.004
 also+0.002
 The+0.002
 ind+0.002
Can+0.002
Neg logit contributions
 discoveries-0.007
arre-0.007
 discovery-0.007
 possessions-0.006
 situations-0.006
Token yelled
Token ID22760
Feature activation-2.77e-02

Pos logit contributions
 seven+0.002
 and+0.002
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.002
beh-0.002
 discoveries-0.002
opter-0.002
inker-0.002
Token and
Token ID290
Feature activation+2.86e-03

Pos logit contributions
arre+0.075
 discoveries+0.066
oving+0.066
opter+0.066
 situations+0.065
Neg logit contributions
 seven-0.032
 only-0.020
 also-0.019
 still-0.016
 and-0.015
Token sc
Token ID629
Feature activation-3.66e-02

Pos logit contributions
 seven+0.005
 and+0.004
 The+0.004
 only+0.004
 like+0.003
Neg logit contributions
arre-0.006
beh-0.006
 discoveries-0.006
opter-0.006
inker-0.006
Token ended
Token ID4444
Feature activation+1.51e-02

Pos logit contributions
 seven+0.020
 and+0.017
 only+0.015
 like+0.014
 The+0.014
Neg logit contributions
arre-0.036
inker-0.034
opter-0.034
beh-0.033
 discoveries-0.033
Token up
Token ID510
Feature activation+2.83e-02

Pos logit contributions
 seven+0.024
 and+0.019
 only+0.017
 like+0.016
 The+0.016
Neg logit contributions
arre-0.035
beh-0.033
 discoveries-0.033
inker-0.033
 situations-0.033
Token in
Token ID287
Feature activation+1.28e-02

Pos logit contributions
 seven+0.036
 and+0.024
 only+0.022
 The+0.022
 like+0.020
Neg logit contributions
arre-0.075
 discoveries-0.071
 situations-0.071
beh-0.070
inker-0.069
Token such
Token ID884
Feature activation+4.45e-03

Pos logit contributions
 seven+0.016
 and+0.011
 only+0.009
 like+0.009
 The+0.009
Neg logit contributions
arre-0.033
 situations-0.032
 discoveries-0.032
inker-0.031
opter-0.031
Token bad
Token ID2089
Feature activation-4.07e-02

Pos logit contributions
 seven+0.005
 and+0.004
 The+0.003
 only+0.003
 like+0.003
Neg logit contributions
arre-0.012
 discoveries-0.012
 situations-0.011
beh-0.011
opter-0.011
Token situations
Token ID7445
Feature activation-1.03e-01

Pos logit contributions
arre+0.064
beh+0.059
opter+0.059
inker+0.059
oving+0.056
Neg logit contributions
 seven-0.094
 and-0.088
 only-0.079
 like-0.078
 or-0.076
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
arre+0.301
 situations+0.289
beh+0.289
 discoveries+0.288
opter+0.286
Neg logit contributions
 seven-0.100
 and-0.058
 only-0.052
 The-0.051
 still-0.040
Token
Token ID220
Feature activation-3.92e-02

Pos logit contributions
 seven+0.003
 and+0.002
 like+0.002
 only+0.002
 or+0.002
Neg logit contributions
arre-0.005
beh-0.004
 situations-0.004
opter-0.004
inker-0.004
Token
Token ID198
Feature activation-6.72e-03

Pos logit contributions
arre+0.120
beh+0.115
opter+0.115
inker+0.115
 discoveries+0.114
Neg logit contributions
 seven-0.032
 and-0.023
 only-0.017
 The-0.015
 like-0.015
Token
Token ID198
Feature activation+3.59e-03

Pos logit contributions
arre+0.020
inker+0.019
beh+0.019
opter+0.019
 discoveries+0.018
Neg logit contributions
 seven-0.007
 and-0.005
 only-0.004
 like-0.004
 The-0.003
TokenThe
Token ID464
Feature activation+2.85e-02

Pos logit contributions
 seven+0.007
 and+0.006
 only+0.005
 like+0.005
 The+0.005
Neg logit contributions
arre-0.007
beh-0.007
inker-0.007
opter-0.007
 discoveries-0.007
Token the
Token ID262
Feature activation+6.60e-03

Pos logit contributions
arre+0.038
inker+0.036
opter+0.036
 discoveries+0.036
 situations+0.036
Neg logit contributions
 seven-0.014
 and-0.009
 only-0.008
 like-0.007
 still-0.007
Token mess
Token ID2085
Feature activation-4.95e-02

Pos logit contributions
 seven+0.017
 and+0.015
 only+0.014
 like+0.014
 The+0.013
Neg logit contributions
arre-0.009
beh-0.009
inker-0.009
opter-0.009
 discoveries-0.008
Token and
Token ID290
Feature activation-4.88e-03

Pos logit contributions
arre+0.152
inker+0.142
opter+0.141
oving+0.140
 situations+0.138
Neg logit contributions
 seven-0.046
 The-0.019
 only-0.017
 like-0.012
 and-0.010
Token the
Token ID262
Feature activation+2.80e-04

Pos logit contributions
arre+0.014
opter+0.014
beh+0.014
inker+0.014
 discoveries+0.014
Neg logit contributions
 seven-0.005
 and-0.004
 only-0.003
 like-0.003
 The-0.003
Token qu
Token ID627
Feature activation-4.73e-02

Pos logit contributions
 seven+0.001
 and+0.001
 only+0.001
 like+0.001
 The+0.001
Neg logit contributions
arre-0.000
beh-0.000
inker-0.000
opter-0.000
 discoveries-0.000
Tokenarre
Token ID9624
Feature activation-1.16e-01

Pos logit contributions
 discoveries+0.064
 possessions+0.060
beh+0.056
 situations+0.056
 discovery+0.055
Neg logit contributions
 and-0.117
 seven-0.113
.-0.100
 The-0.100
 like-0.098
Tokenling
Token ID1359
Feature activation-1.50e-02

Pos logit contributions
 discoveries+0.204
beh+0.203
inker+0.199
igsaw+0.199
ert+0.198
Neg logit contributions
 and-0.235
 seven-0.217
 in-0.198
.-0.187
 like-0.187
Token twins
Token ID20345
Feature activation-5.15e-04

Pos logit contributions
arre+0.030
beh+0.029
inker+0.029
 discoveries+0.028
opter+0.028
Neg logit contributions
 seven-0.028
 and-0.026
 only-0.022
 like-0.022
 The-0.022
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
arre+0.001
 discoveries+0.001
 situations+0.001
beh+0.001
inker+0.001
Neg logit contributions
 seven-0.001
 The-0.000
 only-0.000
 He-0.000
 and-0.000
Token She
Token ID1375
Feature activation+4.11e-03

Pos logit contributions
arre+0.007
inker+0.007
opter+0.007
 discoveries+0.007
beh+0.006
Neg logit contributions
 seven-0.002
 and-0.002
 only-0.001
 like-0.001
 still-0.001
Token is
Token ID318
Feature activation-1.27e-03

Pos logit contributions
 seven+0.006
 and+0.005
 only+0.004
 like+0.004
 The+0.004
Neg logit contributions
arre-0.010
beh-0.010
opter-0.010
inker-0.010
 discoveries-0.010