TOP ACTIVATIONS
MAX = 9.553

, something smooth and something out of ice . <|endoftext|> Once
, and Little Joe laughed out loud . "
. She said the words out loud and started to understand
these gifts !" she said out loud . Max
funny that the girl laughed out loud .
is this ?" she said out loud . She tried to
. The little boy laughed out loud ! He
smile , and Tom laughed out loud .
The creature laughed out loud and walked away .
book and said the word out loud . " Spell !"
. F olding the phone out is a special thing you
. They said the word out loud . "
some blood . She cried out loud and ran to her
sharp pain . She cries out loud . "
the floor . She cries out loud . The music stops
the arm . Ben cries out and lets go of Lily
the arm . He cries out in pain .
the cabinet . She cried out loud and felt a bump
his knee . He cries out loud . "
the head . Tom cries out in pain and falls to

MIN ACTIVATIONS
MIN = -9.307

arm chair . THE <|endoftext|> Once upon a time
again tomorrow . THE <|endoftext|> Once upon a time
The End . <|endoftext|> Once upon a time there
The end . <|endoftext|> Once there was a gentle
The End <|endoftext|> Once upon a time there
The end . <|endoftext|> L ily and Tom are
The End . <|endoftext|> Once upon a time ,
his friends to come . <|endoftext|> Once upon a time ,
The end . <|endoftext|> Once upon a time ,
The end . <|endoftext|> One day , Sarah wanted
The end . <|endoftext|> Once , there was a
think they should do ? <|endoftext|> Once upon a time ,
The end . <|endoftext|> Once upon a time there
The End . <|endoftext|> John was weak . He
happy , clean dog . <|endoftext|> Once there was a label
He started to cry . <|endoftext|> Once upon a time there
The End . <|endoftext|> Once upon a time there
The end . <|endoftext|> Once upon a time there
The end . <|endoftext|> Once upon a time there
They are lucky . <|endoftext|> L ily and Ben were

INTERVAL 4.838 - 9.553
CONTAINS 0.733%

put the past ries back in the box and closed it
and fun ! She laughed with joy and ran around and
mercury . He felt something sharp in his tongue . He
answer . They felt ashamed of what they did . They
ucky noticed that the water in the pond was getting dirty

INTERVAL 0.123 - 4.838
CONTAINS 50.120%

, which was parked nearby . Amy enjoyed
M illy heard a strange noise . She looked down and
wants and be persistent in finding new ways to have fun
close . Tim my asked his mom , "
playing with some balls . Amy showed her

INTERVAL -4.592 - 0.123
CONTAINS 48.019%

was excited . He wanted to send something too . He
to use it . R uffy tried to use
" You 're welcome , kids . I 'm
the fourth of July celebration and watch it together ," she
doll hair on my head and I didn 't want it

INTERVAL -9.307 - -4.592
CONTAINS 1.128%

small bucket . And we can share it ."
The End . <|endoftext|> Once there was a bird
favorite . The end . <|endoftext|> Once upon a time ,
be alert and safe . <|endoftext|> Once upon a time ,
are very smart . The moral of this story is that
Token,
Token ID11
Feature activation+1.76e+00

Pos logit contributions
lor+0.020
+0.018
ane+0.018
ival+0.018
 skies+0.018
Neg logit contributions
 type-0.017
 grun-0.017
 Liz-0.016
 shape-0.015
 sorts-0.015
Token something
Token ID1223
Feature activation+4.03e+00

Pos logit contributions
 type+0.035
 shape+0.032
 sorts+0.032
 grun+0.032
 Liz+0.032
Neg logit contributions
lor-0.039
-0.037
ival-0.036
 skies-0.035
ane-0.035
Token smooth
Token ID7209
Feature activation+1.77e+00

Pos logit contributions
uzzle+0.086
ooks+0.083
ook+0.080
rum+0.080
 Liz+0.076
Neg logit contributions
ival-0.073
lor-0.068
 skies-0.068
hel-0.065
actus-0.065
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
 type+0.035
 sorts+0.032
 shape+0.032
 Liz+0.032
 grun+0.032
Neg logit contributions
lor-0.041
-0.039
ival-0.038
ane-0.038
 skies-0.037
Token something
Token ID1223
Feature activation+3.87e+00

Pos logit contributions
 skies+0.031
ival+0.030
hel+0.029
+0.028
lor+0.028
Neg logit contributions
uzzle-0.026
ook-0.024
rum-0.024
ooks-0.024
 type-0.023
Token out
Token ID503
Feature activation+9.55e+00

Pos logit contributions
uzzle+0.083
ooks+0.081
rum+0.079
ook+0.078
 Liz+0.075
Neg logit contributions
ival-0.067
 skies-0.064
lor-0.063
actus-0.062
-0.061
Token of
Token ID286
Feature activation+6.54e+00

Pos logit contributions
 type+0.196
 grun+0.183
 shape+0.178
 Liz+0.175
ook+0.173
Neg logit contributions
lor-0.198
ane-0.190
ival-0.185
 skies-0.185
-0.182
Token ice
Token ID4771
Feature activation-1.69e+00

Pos logit contributions
 type+0.095
 grun+0.085
 Liz+0.085
ook+0.084
uzzle+0.082
Neg logit contributions
-0.176
lor-0.175
 skies-0.172
ival-0.163
hel-0.163
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
lor+0.042
+0.040
ival+0.039
ane+0.039
 skies+0.038
Neg logit contributions
 type-0.031
 shape-0.029
 sorts-0.028
 its-0.028
 Liz-0.028
Token<|endoftext|>
Token ID50256
Feature activation-7.39e+00

Pos logit contributions
 type+0.034
 shape+0.032
 grun+0.031
 sorts+0.031
 its+0.031
Neg logit contributions
lor-0.027
-0.025
ival-0.024
ane-0.023
 skies-0.023
TokenOnce
Token ID7454
Feature activation-3.01e+00

Pos logit contributions
lor+0.173
ival+0.164
 salon+0.160
ican+0.159
actus+0.158
Neg logit contributions
 its-0.133
 their-0.132
 type-0.131
 M-0.131
 Liz-0.130
Token,
Token ID11
Feature activation+9.92e-01

Pos logit contributions
 type+0.033
 sorts+0.030
 shape+0.029
 grun+0.029
 Liz+0.029
Neg logit contributions
lor-0.047
-0.045
ival-0.044
ane-0.043
 skies-0.042
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
 type+0.020
 shape+0.018
 sorts+0.018
 Liz+0.018
 grun+0.018
Neg logit contributions
lor-0.023
-0.022
ival-0.021
ane-0.021
 skies-0.020
Token Little
Token ID7703
Feature activation-1.61e+00

Pos logit contributions
 skies+0.041
hel+0.040
lor+0.039
ival+0.039
ane+0.037
Neg logit contributions
 type-0.042
uzzle-0.037
ooks-0.037
ook-0.036
 shape-0.036
Token Joe
Token ID5689
Feature activation+1.85e+00

Pos logit contributions
lor+0.049
+0.048
ival+0.046
ane+0.046
 skies+0.046
Neg logit contributions
 type-0.021
 sorts-0.018
 shape-0.018
 its-0.017
 grun-0.017
Token laughed
Token ID13818
Feature activation+1.75e-01

Pos logit contributions
 type+0.030
 shape+0.028
 its+0.027
 grun+0.027
 sorts+0.027
Neg logit contributions
lor-0.051
-0.049
ival-0.047
ane-0.046
actus-0.046
Token out
Token ID503
Feature activation+9.49e+00

Pos logit contributions
 type+0.004
 sorts+0.004
 grun+0.004
 Liz+0.004
 shape+0.004
Neg logit contributions
lor-0.003
-0.003
 skies-0.003
ane-0.003
ival-0.003
Token loud
Token ID7812
Feature activation+2.79e+00

Pos logit contributions
 type+0.203
ooks+0.190
ook+0.186
 Liz+0.182
 sorts+0.175
Neg logit contributions
 skies-0.192
-0.173
hel-0.167
 sky-0.167
 next-0.163
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 type+0.050
 sorts+0.045
 Liz+0.044
 shape+0.044
 grun+0.044
Neg logit contributions
lor-0.070
-0.068
ival-0.065
 skies-0.065
ane-0.065
Token
Token ID198
Feature activation-8.11e-01

Pos logit contributions
 type+0.035
 shape+0.032
cest+0.031
 grun+0.031
 sorts+0.030
Neg logit contributions
-0.027
lor-0.025
 skies-0.024
ane-0.023
ival-0.022
Token
Token ID198
Feature activation-7.63e-01

Pos logit contributions
lor+0.015
+0.014
ival+0.013
ane+0.013
 skies+0.013
Neg logit contributions
 type-0.021
 shape-0.019
 sorts-0.019
 grun-0.019
 Liz-0.018
Token"
Token ID1
Feature activation-1.67e+00

Pos logit contributions
lor+0.014
+0.013
ival+0.013
There+0.013
 skies+0.012
Neg logit contributions
 type-0.019
 shape-0.018
 grun-0.017
 sorts-0.017
 Liz-0.017
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
 type+0.002
 sorts+0.002
 shape+0.002
 grun+0.002
 Liz+0.002
Neg logit contributions
lor-0.003
-0.003
ival-0.003
ane-0.003
 skies-0.003
Token She
Token ID1375
Feature activation+1.12e+00

Pos logit contributions
 type+0.022
 shape+0.020
 Liz+0.020
 sorts+0.020
 its+0.020
Neg logit contributions
lor-0.028
-0.026
ival-0.025
ane-0.025
actus-0.025
Token said
Token ID531
Feature activation-2.52e+00

Pos logit contributions
 type+0.020
 sorts+0.018
 their+0.018
 its+0.017
 Liz+0.017
Neg logit contributions
lor-0.028
ival-0.027
-0.027
 skies-0.026
hel-0.026
Token the
Token ID262
Feature activation-9.82e-01

Pos logit contributions
lor+0.062
+0.058
ival+0.058
ane+0.057
 skies+0.055
Neg logit contributions
 type-0.048
 sorts-0.044
 shape-0.043
 grun-0.042
 Liz-0.042
Token words
Token ID2456
Feature activation+2.00e+00

Pos logit contributions
lor+0.026
+0.024
ane+0.023
ival+0.023
hel+0.022
Neg logit contributions
 type-0.019
 shape-0.018
 grun-0.016
 sorts-0.015
 its-0.015
Token out
Token ID503
Feature activation+9.46e+00

Pos logit contributions
 type+0.041
 shape+0.038
 sorts+0.037
 grun+0.037
 Liz+0.037
Neg logit contributions
lor-0.044
-0.042
ane-0.042
ival-0.042
hel-0.040
Token loud
Token ID7812
Feature activation+2.70e+00

Pos logit contributions
 type+0.180
 sorts+0.169
ooks+0.163
uzzle+0.162
ook+0.160
Neg logit contributions
lor-0.205
 skies-0.204
hel-0.196
ival-0.193
-0.191
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
 type+0.056
 sorts+0.052
 shape+0.051
 grun+0.051
 Liz+0.050
Neg logit contributions
lor-0.060
-0.056
ival-0.055
ane-0.054
 skies-0.054
Token started
Token ID2067
Feature activation+1.27e+00

Pos logit contributions
 skies+0.036
ival+0.036
hel+0.035
lor+0.034
+0.033
Neg logit contributions
 type-0.033
uzzle-0.031
ooks-0.031
ook-0.031
 sorts-0.031
Token to
Token ID284
Feature activation+6.20e-01

Pos logit contributions
 type+0.024
 sorts+0.021
 shape+0.021
 grun+0.021
 Liz+0.021
Neg logit contributions
lor-0.031
-0.029
ival-0.028
ane-0.028
 skies-0.028
Token understand
Token ID1833
Feature activation+1.14e+00

Pos logit contributions
 type+0.011
 Liz+0.011
ook+0.010
 sorts+0.010
 their+0.010
Neg logit contributions
lor-0.014
-0.014
 skies-0.013
hel-0.013
ival-0.013
Token these
Token ID777
Feature activation+1.30e+00

Pos logit contributions
 type+0.076
 grun+0.073
 shape+0.068
uzzle+0.067
 Liz+0.065
Neg logit contributions
lor-0.099
ival-0.093
ane-0.093
 skies-0.092
hel-0.090
Token gifts
Token ID13201
Feature activation+5.38e-01

Pos logit contributions
 type+0.028
 grun+0.027
 shape+0.026
 sorts+0.026
 its+0.026
Neg logit contributions
lor-0.028
-0.026
ival-0.026
ane-0.025
 skies-0.025
Token!"
Token ID2474
Feature activation-1.45e+00

Pos logit contributions
 type+0.009
 grun+0.009
 sorts+0.008
 shape+0.008
 its+0.008
Neg logit contributions
lor-0.014
-0.013
ival-0.013
ane-0.013
hel-0.012
Token she
Token ID673
Feature activation+1.22e+00

Pos logit contributions
lor+0.032
+0.030
ival+0.029
ane+0.029
actus+0.028
Neg logit contributions
 type-0.031
 shape-0.029
 their-0.029
 its-0.029
 Liz-0.029
Token said
Token ID531
Feature activation-1.06e+00

Pos logit contributions
 type+0.028
 sorts+0.027
 shape+0.025
ook+0.024
 its+0.024
Neg logit contributions
lor-0.023
ival-0.021
hel-0.021
 skies-0.021
-0.020
Token out
Token ID503
Feature activation+9.46e+00

Pos logit contributions
lor+0.026
+0.025
ival+0.024
ane+0.024
hel+0.024
Neg logit contributions
 type-0.019
 sorts-0.018
 shape-0.017
 grun-0.017
 Liz-0.016
Token loud
Token ID7812
Feature activation+2.47e+00

Pos logit contributions
 sorts+0.202
 type+0.201
cest+0.189
ooks+0.189
 proper+0.182
Neg logit contributions
hel-0.174
 skies-0.174
't-0.168
-0.167
 next-0.157
Token.
Token ID13
Feature activation+3.25e+00

Pos logit contributions
 type+0.045
 shape+0.041
 sorts+0.040
 its+0.040
 their+0.040
Neg logit contributions
lor-0.064
-0.060
ival-0.059
ane-0.058
actus-0.056
Token
Token ID198
Feature activation+4.20e-01

Pos logit contributions
 type+0.058
 shape+0.053
 its+0.052
 their+0.052
 Liz+0.052
Neg logit contributions
lor-0.084
-0.079
ival-0.078
ane-0.077
actus-0.075
Token
Token ID198
Feature activation+3.78e-01

Pos logit contributions
 their+0.008
 type+0.008
 its+0.008
 Liz+0.008
 shape+0.007
Neg logit contributions
lor-0.010
-0.010
ane-0.010
ival-0.009
actus-0.009
TokenMax
Token ID11518
Feature activation+7.98e-01

Pos logit contributions
 type+0.008
 their+0.008
 shape+0.008
 its+0.008
 Liz+0.007
Neg logit contributions
lor-0.009
-0.008
ane-0.008
actus-0.008
ival-0.008
Token funny
Token ID8258
Feature activation+9.30e-01

Pos logit contributions
lor+0.051
+0.049
ival+0.049
 skies+0.048
ane+0.047
Neg logit contributions
 type-0.047
 sorts-0.043
uzzle-0.042
 shape-0.042
 grun-0.041
Token that
Token ID326
Feature activation+1.95e+00

Pos logit contributions
 type+0.017
 sorts+0.016
 grun+0.016
 shape+0.015
 Liz+0.015
Neg logit contributions
lor-0.022
-0.021
 skies-0.020
ival-0.020
ane-0.020
Token the
Token ID262
Feature activation-8.11e-01

Pos logit contributions
 type+0.038
 grun+0.035
 sorts+0.035
uzzle+0.034
 shape+0.034
Neg logit contributions
lor-0.043
-0.041
ane-0.040
 skies-0.040
ival-0.040
Token girl
Token ID2576
Feature activation+1.43e+00

Pos logit contributions
lor+0.022
ival+0.020
+0.020
ane+0.020
hel+0.020
Neg logit contributions
 type-0.015
 shape-0.014
 their-0.013
 its-0.013
 sorts-0.013
Token laughed
Token ID13818
Feature activation+3.40e-01

Pos logit contributions
 type+0.025
 sorts+0.023
 Liz+0.023
 their+0.022
 shape+0.022
Neg logit contributions
lor-0.035
-0.034
ival-0.033
ane-0.033
actus-0.032
Token out
Token ID503
Feature activation+9.39e+00

Pos logit contributions
 type+0.008
 sorts+0.007
 grun+0.007
 Liz+0.007
 shape+0.007
Neg logit contributions
lor-0.007
-0.007
ane-0.006
 skies-0.006
ival-0.006
Token loud
Token ID7812
Feature activation+2.93e+00

Pos logit contributions
 type+0.188
 sorts+0.171
ooks+0.165
ook+0.162
 Liz+0.162
Neg logit contributions
 skies-0.191
-0.190
lor-0.185
ival-0.179
hel-0.178
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
 type+0.051
 sorts+0.046
 shape+0.045
 Liz+0.045
 grun+0.045
Neg logit contributions
lor-0.076
-0.072
ival-0.070
ane-0.070
 skies-0.069
Token
Token ID220
Feature activation+8.43e-03

Pos logit contributions
 type+0.024
 Liz+0.023
 its+0.022
 their+0.022
 shape+0.022
Neg logit contributions
lor-0.033
ival-0.031
-0.030
ane-0.030
actus-0.030
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
 type+0.000
 Liz+0.000
 their+0.000
 its+0.000
 sorts+0.000
Neg logit contributions
lor-0.000
-0.000
ival-0.000
actus-0.000
 skies-0.000
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
lor+0.028
+0.027
ane+0.026
actus+0.026
ival+0.026
Neg logit contributions
 type-0.022
 its-0.021
 Liz-0.021
 their-0.021
 sorts-0.021
Token is
Token ID318
Feature activation+1.69e+00

Pos logit contributions
 type+0.007
 grun+0.007
 Liz+0.007
 sorts+0.007
 shape+0.007
Neg logit contributions
lor-0.006
ane-0.006
ival-0.006
hel-0.005
-0.005
Token this
Token ID428
Feature activation+1.79e+00

Pos logit contributions
 type+0.044
 sorts+0.041
 shape+0.040
 grun+0.040
 Liz+0.040
Neg logit contributions
lor-0.029
-0.027
ival-0.027
ane-0.026
 skies-0.026
Token?"
Token ID1701
Feature activation-1.05e+00

Pos logit contributions
 type+0.037
 sorts+0.034
 Liz+0.034
 their+0.034
 grun+0.034
Neg logit contributions
lor-0.041
-0.038
ane-0.037
ival-0.037
 skies-0.036
Token she
Token ID673
Feature activation+1.41e+00

Pos logit contributions
lor+0.022
+0.021
ival+0.020
ane+0.019
 skies+0.019
Neg logit contributions
 type-0.024
 sorts-0.022
 shape-0.022
 grun-0.021
 Liz-0.021
Token said
Token ID531
Feature activation-1.91e+00

Pos logit contributions
 type+0.025
 sorts+0.023
 their+0.022
 shape+0.022
 Liz+0.022
Neg logit contributions
lor-0.035
-0.032
 skies-0.032
ival-0.032
ane-0.031
Token out
Token ID503
Feature activation+9.36e+00

Pos logit contributions
+0.036
lor+0.035
ane+0.034
hel+0.034
ival+0.032
Neg logit contributions
 type-0.043
 sorts-0.042
cest-0.038
 Liz-0.037
ooks-0.036
Token loud
Token ID7812
Feature activation+2.64e+00

Pos logit contributions
 type+0.184
 sorts+0.177
ooks+0.167
cest+0.158
 proper+0.158
Neg logit contributions
hel-0.199
-0.193
lor-0.190
 skies-0.185
ane-0.184
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 type+0.047
 sorts+0.043
 shape+0.042
 Liz+0.041
 their+0.041
Neg logit contributions
lor-0.068
-0.064
ival-0.062
ane-0.062
 skies-0.061
Token She
Token ID1375
Feature activation+1.18e+00

Pos logit contributions
 type+0.026
 shape+0.023
 its+0.023
 sorts+0.023
 their+0.023
Neg logit contributions
lor-0.050
-0.048
ival-0.047
ane-0.046
actus-0.045
Token tried
Token ID3088
Feature activation-1.76e+00

Pos logit contributions
 type+0.021
 sorts+0.020
 their+0.019
 Liz+0.019
 shape+0.019
Neg logit contributions
lor-0.029
-0.027
ival-0.027
 skies-0.027
ane-0.026
Token to
Token ID284
Feature activation-1.71e+00

Pos logit contributions
lor+0.041
+0.040
ival+0.039
ane+0.038
 skies+0.038
Neg logit contributions
 type-0.034
 sorts-0.031
 shape-0.030
 grun-0.030
 Liz-0.029
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
lor+0.001
ane+0.001
+0.001
ival+0.001
hel+0.001
Neg logit contributions
 type-0.001
 sorts-0.001
 grun-0.001
 Liz-0.001
 shape-0.001
Token The
Token ID383
Feature activation-8.50e-01

Pos logit contributions
 type+0.020
 shape+0.018
 sorts+0.018
 its+0.018
 their+0.018
Neg logit contributions
lor-0.034
-0.032
ival-0.031
ane-0.031
actus-0.030
Token little
Token ID1310
Feature activation-4.74e-01

Pos logit contributions
lor+0.023
+0.022
ane+0.021
hel+0.021
ops+0.020
Neg logit contributions
 type-0.015
 shape-0.015
 their-0.013
 grun-0.013
 Liz-0.013
Token boy
Token ID2933
Feature activation+1.11e+00

Pos logit contributions
lor+0.011
+0.011
ival+0.010
ane+0.010
 skies+0.010
Neg logit contributions
 type-0.009
 sorts-0.009
 shape-0.008
 its-0.008
 grun-0.008
Token laughed
Token ID13818
Feature activation-2.47e-01

Pos logit contributions
 type+0.022
 sorts+0.021
uzzle+0.020
 their+0.020
ooks+0.020
Neg logit contributions
ival-0.023
lor-0.022
 skies-0.022
-0.022
ane-0.021
Token out
Token ID503
Feature activation+9.35e+00

Pos logit contributions
lor+0.004
+0.004
 skies+0.004
ival+0.004
ane+0.004
Neg logit contributions
 type-0.006
 sorts-0.006
 Liz-0.006
 grun-0.006
 shape-0.006
Token loud
Token ID7812
Feature activation+2.23e+00

Pos logit contributions
 type+0.199
ooks+0.187
ook+0.181
 sorts+0.179
 Liz+0.174
Neg logit contributions
 skies-0.194
-0.180
hel-0.174
 sky-0.170
ival-0.168
Token!
Token ID0
Feature activation+1.10e+00

Pos logit contributions
 type+0.049
 sorts+0.045
 Liz+0.045
 shape+0.045
 grun+0.044
Neg logit contributions
lor-0.047
-0.044
ival-0.043
 skies-0.042
ane-0.042
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
 type+0.018
 shape+0.016
 sorts+0.016
 grun+0.016
 its+0.016
Neg logit contributions
lor-0.030
-0.029
ival-0.028
ane-0.027
 skies-0.027
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
lor+0.028
+0.026
ival+0.025
ane+0.025
actus+0.025
Neg logit contributions
 type-0.027
 shape-0.025
 its-0.025
 their-0.025
 Liz-0.025
TokenHe
Token ID1544
Feature activation+5.83e-01

Pos logit contributions
lor+0.028
+0.027
ival+0.026
ane+0.025
 skies+0.025
Neg logit contributions
 type-0.027
 shape-0.025
 sorts-0.025
 its-0.025
 their-0.024
Token smile
Token ID8212
Feature activation+6.02e-01

Pos logit contributions
 type+0.057
 sorts+0.054
ook+0.054
 Liz+0.054
ooks+0.052
Neg logit contributions
-0.062
lor-0.058
ane-0.057
 skies-0.056
ival-0.056
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 type+0.013
 sorts+0.012
 shape+0.011
 Liz+0.011
 grun+0.011
Neg logit contributions
lor-0.014
-0.013
ane-0.012
ival-0.012
 skies-0.012
Token and
Token ID290
Feature activation-2.28e+00

Pos logit contributions
 type+0.023
 shape+0.021
 its+0.021
 their+0.021
 sorts+0.021
Neg logit contributions
lor-0.024
-0.022
ival-0.022
ane-0.021
actus-0.021
Token Tom
Token ID4186
Feature activation+6.63e-01

Pos logit contributions
ival+0.040
lor+0.039
+0.038
 skies+0.037
hel+0.037
Neg logit contributions
 type-0.052
ooks-0.048
uzzle-0.047
ook-0.046
 sorts-0.046
Token laughed
Token ID13818
Feature activation+1.09e-01

Pos logit contributions
 type+0.014
 sorts+0.012
 Liz+0.012
 shape+0.012
 its+0.012
Neg logit contributions
lor-0.014
ival-0.013
-0.013
ane-0.013
hel-0.013
Token out
Token ID503
Feature activation+9.33e+00

Pos logit contributions
 type+0.003
 sorts+0.002
 grun+0.002
 Liz+0.002
 shape+0.002
Neg logit contributions
-0.002
lor-0.002
 skies-0.002
ival-0.002
hel-0.002
Token loud
Token ID7812
Feature activation+2.25e+00

Pos logit contributions
 type+0.193
 Liz+0.182
ooks+0.182
 sorts+0.173
ook+0.171
Neg logit contributions
 skies-0.177
-0.177
hel-0.170
ival-0.169
ican-0.159
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 type+0.041
 sorts+0.037
 Liz+0.036
 grun+0.035
 shape+0.035
Neg logit contributions
lor-0.056
-0.054
 skies-0.052
ival-0.052
ane-0.051
Token
Token ID220
Feature activation+8.10e-01

Pos logit contributions
 type+0.025
 shape+0.023
 sorts+0.023
 its+0.022
 Liz+0.022
Neg logit contributions
lor-0.038
-0.036
ival-0.035
ane-0.035
actus-0.034
Token
Token ID198
Feature activation-7.23e-01

Pos logit contributions
 type+0.018
 shape+0.017
 sorts+0.017
 Liz+0.017
 their+0.016
Neg logit contributions
lor-0.017
-0.016
ival-0.016
ane-0.015
 skies-0.015
Token
Token ID198
Feature activation-6.45e-01

Pos logit contributions
lor+0.016
+0.015
ane+0.015
ival+0.015
 skies+0.014
Neg logit contributions
 type-0.015
 their-0.014
 Liz-0.014
 sorts-0.014
 its-0.014
Token
Token ID198
Feature activation-8.74e-01

Pos logit contributions
 type+0.024
 shape+0.022
 sorts+0.022
 its+0.022
 Liz+0.022
Neg logit contributions
lor-0.036
-0.034
ival-0.034
ane-0.033
actus-0.033
Token
Token ID198
Feature activation-8.33e-01

Pos logit contributions
lor+0.019
+0.017
ival+0.017
ane+0.017
 skies+0.016
Neg logit contributions
 type-0.019
 shape-0.018
 sorts-0.018
 its-0.018
 Liz-0.018
TokenThe
Token ID464
Feature activation-5.58e-01

Pos logit contributions
lor+0.018
+0.017
ival+0.017
ane+0.017
actus+0.016
Neg logit contributions
 type-0.018
 shape-0.017
 sorts-0.016
 its-0.016
 their-0.016
Token creature
Token ID7185
Feature activation+2.69e+00

Pos logit contributions
lor+0.016
+0.014
ane+0.014
ival+0.014
hel+0.014
Neg logit contributions
 type-0.010
 shape-0.009
 their-0.008
 sorts-0.008
 its-0.008
Token laughed
Token ID13818
Feature activation+2.60e-02

Pos logit contributions
 type+0.047
 sorts+0.042
 shape+0.042
 Liz+0.042
 their+0.041
Neg logit contributions
lor-0.069
-0.066
ival-0.065
ane-0.063
 skies-0.062
Token out
Token ID503
Feature activation+9.29e+00

Pos logit contributions
 type+0.001
 sorts+0.001
 Liz+0.001
 grun+0.001
 shape+0.001
Neg logit contributions
lor-0.000
-0.000
 skies-0.000
ival-0.000
ane-0.000
Token loud
Token ID7812
Feature activation+2.54e+00

Pos logit contributions
 type+0.223
 sorts+0.202
 proper+0.192
ooks+0.191
ook+0.187
Neg logit contributions
 skies-0.172
-0.155
 next-0.151
 sky-0.148
hel-0.146
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
 type+0.055
 sorts+0.051
 Liz+0.050
 shape+0.049
 grun+0.049
Neg logit contributions
lor-0.055
-0.052
ival-0.050
 skies-0.050
ane-0.049
Token walked
Token ID6807
Feature activation+8.75e-02

Pos logit contributions
 skies+0.030
ival+0.030
hel+0.029
+0.029
lor+0.028
Neg logit contributions
ooks-0.039
 type-0.038
uzzle-0.037
ook-0.037
rum-0.035
Token away
Token ID1497
Feature activation+2.13e+00

Pos logit contributions
 type+0.002
 grun+0.002
 Liz+0.002
 sorts+0.002
apy+0.002
Neg logit contributions
lor-0.002
ival-0.002
 skies-0.002
hel-0.001
-0.001
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 type+0.052
 sorts+0.048
 Liz+0.047
 grun+0.045
 proper+0.043
Neg logit contributions
-0.035
 skies-0.034
lor-0.033
hel-0.032
 sky-0.030
Token book
Token ID1492
Feature activation-1.48e+00

Pos logit contributions
lor+0.054
+0.052
ival+0.049
ane+0.049
actus+0.047
Neg logit contributions
 type-0.052
 shape-0.049
 sorts-0.046
 its-0.046
 grun-0.045
Token and
Token ID290
Feature activation-2.35e+00

Pos logit contributions
lor+0.037
+0.035
actus+0.034
ival+0.033
ican+0.033
Neg logit contributions
 type-0.029
 shape-0.027
 its-0.027
 their-0.026
 sorts-0.026
Token said
Token ID531
Feature activation-2.13e+00

Pos logit contributions
hel+0.039
 skies+0.039
ival+0.038
lor+0.036
There+0.036
Neg logit contributions
 type-0.056
ooks-0.054
uzzle-0.054
 sorts-0.053
ook-0.052
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
lor+0.052
ival+0.049
+0.048
ane+0.048
 skies+0.045
Neg logit contributions
 type-0.041
 sorts-0.040
 shape-0.037
 Liz-0.037
 grun-0.037
Token word
Token ID1573
Feature activation+1.53e+00

Pos logit contributions
lor+0.025
+0.024
ival+0.023
ane+0.023
actus+0.022
Neg logit contributions
 type-0.020
 shape-0.018
 sorts-0.017
 its-0.017
 grun-0.017
Token out
Token ID503
Feature activation+9.29e+00

Pos logit contributions
 type+0.028
 sorts+0.026
 grun+0.025
 shape+0.025
 Liz+0.024
Neg logit contributions
lor-0.037
ane-0.036
ival-0.035
-0.035
 skies-0.034
Token loud
Token ID7812
Feature activation+2.69e+00

Pos logit contributions
 sorts+0.174
 type+0.174
ooks+0.161
cest+0.156
 Suz+0.153
Neg logit contributions
lor-0.208
 skies-0.194
ival-0.193
hel-0.188
ane-0.188
Token.
Token ID13
Feature activation+2.88e+00

Pos logit contributions
 type+0.047
 sorts+0.043
 shape+0.042
 Liz+0.042
 grun+0.042
Neg logit contributions
lor-0.069
-0.065
ival-0.064
ane-0.063
 skies-0.063
Token "
Token ID366
Feature activation+2.96e-01

Pos logit contributions
 type+0.048
 sorts+0.043
 shape+0.042
 grun+0.040
 their+0.040
Neg logit contributions
lor-0.077
-0.076
ival-0.073
ane-0.071
 skies-0.071
TokenSpell
Token ID31221
Feature activation-6.69e-01

Pos logit contributions
 type+0.006
 shape+0.006
 sorts+0.006
 its+0.006
 grun+0.006
Neg logit contributions
lor-0.007
-0.006
ival-0.006
ane-0.006
actus-0.006
Token!"
Token ID2474
Feature activation-6.27e-01

Pos logit contributions
lor+0.016
+0.015
ival+0.015
actus+0.014
 skies+0.014
Neg logit contributions
 type-0.013
 shape-0.012
 sorts-0.012
 its-0.012
 their-0.012
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 type+0.020
 sorts+0.019
 grun+0.018
 shape+0.018
 Liz+0.018
Neg logit contributions
lor-0.028
-0.026
ival-0.026
ane-0.025
 skies-0.025
Token F
Token ID376
Feature activation+2.11e+00

Pos logit contributions
 type+0.033
 Liz+0.031
 shape+0.031
 sorts+0.030
 its+0.030
Neg logit contributions
lor-0.039
-0.036
ival-0.036
ane-0.035
actus-0.035
Tokenolding
Token ID33266
Feature activation+2.87e+00

Pos logit contributions
 type+0.037
 shape+0.032
 sorts+0.032
 Liz+0.032
 grun+0.031
Neg logit contributions
lor-0.057
-0.054
ival-0.053
 skies-0.052
ane-0.051
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
 type+0.056
 shape+0.051
 its+0.051
 sorts+0.051
 their+0.050
Neg logit contributions
lor-0.069
-0.067
ival-0.064
ane-0.063
hel-0.061
Token phone
Token ID3072
Feature activation+1.93e+00

Pos logit contributions
lor+0.029
+0.027
ival+0.026
ane+0.026
hel+0.026
Neg logit contributions
 type-0.024
 shape-0.022
 their-0.021
 sorts-0.021
 grun-0.020
Token out
Token ID503
Feature activation+9.29e+00

Pos logit contributions
 type+0.037
 sorts+0.034
 shape+0.034
 grun+0.034
 its+0.033
Neg logit contributions
lor-0.046
-0.044
ival-0.043
ane-0.042
 skies-0.041
Token is
Token ID318
Feature activation+2.21e+00

Pos logit contributions
 type+0.188
 grun+0.175
 sorts+0.174
 shape+0.168
 their+0.166
Neg logit contributions
lor-0.211
ival-0.196
-0.196
ane-0.194
 skies-0.187
Token a
Token ID257
Feature activation+2.84e+00

Pos logit contributions
 type+0.044
 sorts+0.043
 grun+0.041
 Liz+0.040
 shape+0.040
Neg logit contributions
lor-0.046
-0.042
 skies-0.042
ival-0.042
ane-0.040
Token special
Token ID2041
Feature activation+1.18e+00

Pos logit contributions
 Liz+0.050
 sorts+0.049
 type+0.049
 grun+0.048
 its+0.047
Neg logit contributions
lor-0.067
-0.066
ival-0.063
actus-0.060
ane-0.060
Token thing
Token ID1517
Feature activation+6.84e-01

Pos logit contributions
 type+0.023
 grun+0.023
 Liz+0.022
 its+0.022
 sorts+0.022
Neg logit contributions
lor-0.025
-0.024
ane-0.023
ival-0.023
actus-0.022
Token you
Token ID345
Feature activation+1.75e+00

Pos logit contributions
 type+0.013
 grun+0.012
 sorts+0.012
 shape+0.012
 Liz+0.012
Neg logit contributions
lor-0.016
-0.015
ane-0.015
ival-0.015
 skies-0.014
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
lor+0.032
+0.030
ival+0.030
ane+0.029
 skies+0.029
Neg logit contributions
 type-0.018
 shape-0.016
 sorts-0.016
 its-0.016
 their-0.016
Token They
Token ID1119
Feature activation+1.21e+00

Pos logit contributions
 type+0.030
 shape+0.026
 sorts+0.026
 grun+0.025
 its+0.025
Neg logit contributions
lor-0.058
-0.056
ival-0.054
ane-0.053
 skies-0.053
Token said
Token ID531
Feature activation-2.25e+00

Pos logit contributions
 type+0.023
 Liz+0.022
 sorts+0.021
uzzle+0.021
 its+0.021
Neg logit contributions
hel-0.027
-0.026
ival-0.026
lor-0.025
 skies-0.024
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
lor+0.055
+0.053
ival+0.052
ane+0.051
 skies+0.050
Neg logit contributions
 type-0.043
 sorts-0.039
 shape-0.039
 grun-0.038
 Liz-0.038
Token word
Token ID1573
Feature activation+1.28e+00

Pos logit contributions
lor+0.028
+0.027
ival+0.026
ane+0.025
actus+0.025
Neg logit contributions
 type-0.024
 shape-0.022
 its-0.021
 their-0.021
 sorts-0.020
Token out
Token ID503
Feature activation+9.29e+00

Pos logit contributions
 type+0.024
 sorts+0.023
 shape+0.022
 grun+0.022
 Liz+0.022
Neg logit contributions
lor-0.030
ane-0.028
-0.028
ival-0.028
 skies-0.027
Token loud
Token ID7812
Feature activation+2.50e+00

Pos logit contributions
 sorts+0.183
 type+0.181
 proper+0.161
cest+0.160
 grun+0.159
Neg logit contributions
lor-0.197
 skies-0.192
ival-0.188
-0.187
hel-0.184
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
 type+0.044
 shape+0.040
 sorts+0.040
 its+0.040
 their+0.039
Neg logit contributions
lor-0.065
-0.062
ival-0.060
ane-0.059
actus-0.058
Token
Token ID198
Feature activation+8.64e-02

Pos logit contributions
 type+0.046
 shape+0.042
 sorts+0.042
 grun+0.041
 its+0.041
Neg logit contributions
lor-0.059
-0.056
ival-0.055
ane-0.054
 skies-0.052
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 type+0.002
 shape+0.002
 sorts+0.002
 grun+0.002
 its+0.002
Neg logit contributions
lor-0.002
-0.002
ival-0.002
ane-0.002
 skies-0.001
Token"
Token ID1
Feature activation-1.23e+00

Pos logit contributions
 type+0.002
 shape+0.002
 sorts+0.002
 its+0.002
 their+0.002
Neg logit contributions
lor-0.003
-0.002
ival-0.002
ane-0.002
 skies-0.002
Token some
Token ID617
Feature activation+1.95e+00

Pos logit contributions
 type+0.053
 sorts+0.048
 shape+0.046
 grun+0.045
uzzle+0.044
Neg logit contributions
lor-0.061
-0.060
 skies-0.057
ival-0.056
ane-0.056
Token blood
Token ID2910
Feature activation+4.01e+00

Pos logit contributions
 type+0.050
 sorts+0.049
uzzle+0.045
 Liz+0.045
ook+0.045
Neg logit contributions
-0.032
 skies-0.031
lor-0.029
ival-0.028
ops-0.028
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
 type+0.080
 sorts+0.072
 shape+0.070
 Liz+0.069
 grun+0.069
Neg logit contributions
lor-0.094
-0.090
ane-0.086
hel-0.086
ival-0.085
Token She
Token ID1375
Feature activation+1.77e+00

Pos logit contributions
 type+0.043
 Liz+0.041
 its+0.040
 shape+0.040
 their+0.040
Neg logit contributions
lor-0.063
ival-0.059
ane-0.058
-0.057
hel-0.056
Token cried
Token ID16896
Feature activation+2.21e+00

Pos logit contributions
 type+0.038
 sorts+0.037
uzzle+0.034
 Liz+0.033
cest+0.032
Neg logit contributions
lor-0.037
actus-0.035
 skies-0.035
-0.034
ival-0.033
Token out
Token ID503
Feature activation+9.29e+00

Pos logit contributions
 type+0.052
 sorts+0.049
 shape+0.047
 grun+0.047
 Liz+0.046
Neg logit contributions
lor-0.043
-0.041
ival-0.039
ane-0.039
 skies-0.039
Token loud
Token ID7812
Feature activation+2.47e+00

Pos logit contributions
 type+0.217
 sorts+0.205
ooks+0.200
ook+0.193
cest+0.193
Neg logit contributions
hel-0.150
-0.147
 skies-0.145
 next-0.134
't-0.132
Token and
Token ID290
Feature activation-1.85e+00

Pos logit contributions
 type+0.049
 its+0.046
 shape+0.045
 their+0.045
 Liz+0.044
Neg logit contributions
lor-0.059
-0.057
ival-0.055
ane-0.054
actus-0.052
Token ran
Token ID4966
Feature activation+1.10e-01

Pos logit contributions
There+0.028
ival+0.025
 skies+0.024
 next+0.024
 unatt+0.024
Neg logit contributions
 sorts-0.046
 type-0.046
uzzle-0.044
ooks-0.044
ook-0.042
Token to
Token ID284
Feature activation+8.09e-01

Pos logit contributions
 type+0.002
 sorts+0.002
 shape+0.002
 grun+0.002
 Liz+0.002
Neg logit contributions
lor-0.002
-0.002
ival-0.002
ane-0.002
 skies-0.002
Token her
Token ID607
Feature activation-6.76e-01

Pos logit contributions
 type+0.016
cest+0.015
 sorts+0.015
 shape+0.014
ooks+0.013
Neg logit contributions
-0.019
lor-0.017
hel-0.017
ican-0.016
ival-0.016
Token sharp
Token ID7786
Feature activation+4.49e+00

Pos logit contributions
 Liz+0.070
 sorts+0.069
 type+0.068
 Suz+0.063
 Maria+0.061
Neg logit contributions
lor-0.078
-0.077
actus-0.072
hel-0.068
ival-0.068
Token pain
Token ID2356
Feature activation+2.36e+00

Pos logit contributions
 sorts+0.100
 type+0.099
 Liz+0.098
 their+0.090
 Maria+0.089
Neg logit contributions
lor-0.086
-0.084
ane-0.079
actus-0.079
hel-0.077
Token.
Token ID13
Feature activation+2.84e+00

Pos logit contributions
 type+0.051
 sorts+0.050
 Liz+0.047
 grun+0.045
 shape+0.043
Neg logit contributions
lor-0.049
-0.048
ane-0.046
hel-0.044
 skies-0.043
Token She
Token ID1375
Feature activation+1.60e+00

Pos logit contributions
 type+0.060
 sorts+0.055
 shape+0.055
 grun+0.054
 their+0.053
Neg logit contributions
lor-0.064
-0.061
ival-0.059
ane-0.058
 skies-0.057
Token cries
Token ID24691
Feature activation+3.02e+00

Pos logit contributions
 type+0.034
 shape+0.032
 sorts+0.031
 its+0.031
 grun+0.031
Neg logit contributions
lor-0.036
-0.034
ival-0.033
ane-0.032
 skies-0.031
Token out
Token ID503
Feature activation+9.28e+00

Pos logit contributions
 type+0.067
 sorts+0.062
 shape+0.059
 grun+0.059
 Liz+0.058
Neg logit contributions
lor-0.065
-0.061
ane-0.059
ival-0.059
 skies-0.058
Token loud
Token ID7812
Feature activation+2.43e+00

Pos logit contributions
 type+0.232
 sorts+0.226
ooks+0.216
ook+0.207
 proper+0.205
Neg logit contributions
 next-0.139
hel-0.134
't-0.133
 skies-0.133
-0.120
Token.
Token ID13
Feature activation+3.05e+00

Pos logit contributions
 type+0.041
 its+0.039
 their+0.039
 shape+0.038
 Liz+0.036
Neg logit contributions
lor-0.065
-0.063
ival-0.062
ane-0.060
actus-0.058
Token
Token ID198
Feature activation-2.29e-02

Pos logit contributions
 type+0.081
 sorts+0.078
 shape+0.073
cest+0.072
 grun+0.072
Neg logit contributions
-0.054
lor-0.053
 skies-0.047
ane-0.047
ival-0.046
Token
Token ID198
Feature activation+5.23e-02

Pos logit contributions
lor+0.001
+0.000
ival+0.000
ane+0.000
 skies+0.000
Neg logit contributions
 type-0.000
 its-0.000
 their-0.000
 Liz-0.000
 shape-0.000
Token"
Token ID1
Feature activation-1.19e+00

Pos logit contributions
 type+0.001
 shape+0.001
 sorts+0.001
 its+0.001
 their+0.001
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
 skies-0.001
Token the
Token ID262
Feature activation-2.37e+00

Pos logit contributions
 type+0.022
 sorts+0.020
 shape+0.019
 grun+0.018
 Liz+0.018
Neg logit contributions
lor-0.043
-0.043
ival-0.041
ane-0.040
 skies-0.040
Token floor
Token ID4314
Feature activation+2.13e+00

Pos logit contributions
lor+0.041
+0.038
ival+0.037
ane+0.036
 skies+0.035
Neg logit contributions
 type-0.062
 sorts-0.058
 Liz-0.057
 shape-0.057
 their-0.057
Token.
Token ID13
Feature activation+2.88e+00

Pos logit contributions
 type+0.044
 shape+0.041
 its+0.040
 sorts+0.040
 their+0.040
Neg logit contributions
lor-0.050
-0.047
ival-0.046
ane-0.045
actus-0.043
Token She
Token ID1375
Feature activation+1.68e+00

Pos logit contributions
 type+0.061
 shape+0.056
 sorts+0.055
 its+0.054
 their+0.054
Neg logit contributions
lor-0.066
-0.062
ival-0.060
ane-0.059
 skies-0.058
Token cries
Token ID24691
Feature activation+3.15e+00

Pos logit contributions
 type+0.039
 shape+0.036
 sorts+0.036
 its+0.036
 their+0.036
Neg logit contributions
lor-0.034
-0.032
ival-0.031
ane-0.031
 skies-0.030
Token out
Token ID503
Feature activation+9.27e+00

Pos logit contributions
 type+0.074
 sorts+0.069
 grun+0.064
 shape+0.064
 Liz+0.062
Neg logit contributions
lor-0.063
-0.059
ane-0.058
 skies-0.058
ival-0.056
Token loud
Token ID7812
Feature activation+2.67e+00

Pos logit contributions
 type+0.233
 sorts+0.225
ooks+0.214
ook+0.208
 proper+0.204
Neg logit contributions
 next-0.137
 skies-0.131
't-0.130
hel-0.130
 sky-0.118
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
 type+0.044
 its+0.042
 their+0.042
 shape+0.042
 Liz+0.040
Neg logit contributions
lor-0.072
-0.070
ival-0.068
ane-0.066
actus-0.064
Token The
Token ID383
Feature activation-6.16e-01

Pos logit contributions
 type+0.076
 sorts+0.072
 shape+0.069
 grun+0.068
 Liz+0.067
Neg logit contributions
lor-0.063
-0.061
ane-0.056
ival-0.056
 skies-0.055
Token music
Token ID2647
Feature activation-3.61e-01

Pos logit contributions
lor+0.014
+0.013
ival+0.013
ane+0.013
hel+0.012
Neg logit contributions
 type-0.013
 shape-0.012
 sorts-0.012
 grun-0.012
 its-0.012
Token stops
Token ID9911
Feature activation-3.35e-01

Pos logit contributions
lor+0.009
+0.008
ival+0.008
ane+0.008
 skies+0.008
Neg logit contributions
 type-0.007
 sorts-0.006
 shape-0.006
 Liz-0.006
 its-0.006
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
 type+0.049
 sorts+0.045
 shape+0.040
 grun+0.039
 Liz+0.038
Neg logit contributions
lor-0.105
-0.102
 skies-0.097
ival-0.096
ane-0.096
Token arm
Token ID3211
Feature activation-5.98e-02

Pos logit contributions
lor+0.039
+0.037
ival+0.036
ane+0.035
 skies+0.034
Neg logit contributions
 type-0.042
 sorts-0.038
 shape-0.038
 its-0.038
 Liz-0.038
Token.
Token ID13
Feature activation+2.81e+00

Pos logit contributions
lor+0.002
+0.002
ival+0.002
ane+0.002
 skies+0.002
Neg logit contributions
 type-0.001
 shape-0.001
 sorts-0.001
 its-0.001
 their-0.001
Token Ben
Token ID3932
Feature activation+5.21e-01

Pos logit contributions
 type+0.051
 sorts+0.047
 shape+0.044
 grun+0.043
cest+0.042
Neg logit contributions
-0.072
lor-0.071
ival-0.067
 skies-0.066
ane-0.065
Token cries
Token ID24691
Feature activation+3.03e+00

Pos logit contributions
 type+0.011
 shape+0.010
 grun+0.010
 its+0.010
 their+0.010
Neg logit contributions
lor-0.012
-0.011
ival-0.011
ane-0.011
ari-0.011
Token out
Token ID503
Feature activation+9.27e+00

Pos logit contributions
 type+0.075
 sorts+0.071
 shape+0.066
 grun+0.066
 Liz+0.065
Neg logit contributions
lor-0.055
-0.052
ane-0.051
ival-0.050
 skies-0.050
Token and
Token ID290
Feature activation-2.27e+00

Pos logit contributions
 sorts+0.238
 type+0.225
ooks+0.213
 proper+0.212
ook+0.206
Neg logit contributions
 next-0.142
't-0.133
 skies-0.130
-0.128
hel-0.120
Token lets
Token ID8781
Feature activation+4.75e-01

Pos logit contributions
lor+0.040
+0.038
 skies+0.037
actus+0.037
ival+0.036
Neg logit contributions
 type-0.056
 sorts-0.052
ook-0.049
 shape-0.048
ooks-0.048
Token go
Token ID467
Feature activation-2.80e+00

Pos logit contributions
 type+0.010
 sorts+0.010
 shape+0.009
cest+0.009
 grun+0.009
Neg logit contributions
lor-0.010
-0.010
ival-0.009
ane-0.009
hel-0.009
Token of
Token ID286
Feature activation+3.43e+00

Pos logit contributions
lor+0.074
+0.070
ival+0.068
ane+0.067
 skies+0.066
Neg logit contributions
 type-0.049
 shape-0.044
 sorts-0.044
 their-0.044
 Liz-0.044
Token Lily
Token ID20037
Feature activation-1.02e+00

Pos logit contributions
 type+0.050
 sorts+0.044
 shape+0.042
 grun+0.042
ook+0.040
Neg logit contributions
-0.096
lor-0.094
 skies-0.090
ival-0.090
ane-0.087
Token the
Token ID262
Feature activation-1.30e+00

Pos logit contributions
 skies+0.040
hel+0.039
 salon+0.038
 rides+0.038
lor+0.038
Neg logit contributions
 type-0.045
ooks-0.043
ook-0.043
 sorts-0.042
uzzle-0.040
Token arm
Token ID3211
Feature activation+5.83e-01

Pos logit contributions
lor+0.028
+0.027
 skies+0.026
ival+0.026
ane+0.026
Neg logit contributions
 type-0.028
 sorts-0.026
 Liz-0.025
 its-0.025
 grun-0.025
Token.
Token ID13
Feature activation+2.72e+00

Pos logit contributions
 type+0.012
 sorts+0.010
 grun+0.010
 shape+0.010
 their+0.010
Neg logit contributions
lor-0.014
-0.013
ane-0.013
ival-0.013
 skies-0.013
Token He
Token ID679
Feature activation+7.40e-01

Pos logit contributions
 type+0.052
 sorts+0.048
 shape+0.046
 grun+0.044
 its+0.043
Neg logit contributions
-0.067
lor-0.066
ival-0.062
 skies-0.061
ane-0.061
Token cries
Token ID24691
Feature activation+3.02e+00

Pos logit contributions
 type+0.016
 sorts+0.014
 Liz+0.014
 its+0.014
 shape+0.014
Neg logit contributions
lor-0.016
-0.015
ival-0.015
 skies-0.015
ane-0.015
Token out
Token ID503
Feature activation+9.27e+00

Pos logit contributions
 type+0.080
 sorts+0.075
 grun+0.070
 shape+0.068
ook+0.067
Neg logit contributions
lor-0.047
 skies-0.046
ane-0.046
-0.045
hel-0.044
Token in
Token ID287
Feature activation+6.37e+00

Pos logit contributions
 type+0.220
 sorts+0.220
ooks+0.206
 proper+0.201
ook+0.198
Neg logit contributions
 next-0.150
 skies-0.144
't-0.138
hel-0.133
-0.132
Token pain
Token ID2356
Feature activation+2.87e+00

Pos logit contributions
ook+0.156
ooks+0.147
 type+0.146
 sorts+0.142
apy+0.136
Neg logit contributions
-0.104
 skies-0.100
There-0.097
stand-0.097
 course-0.095
Token.
Token ID13
Feature activation+3.13e+00

Pos logit contributions
 type+0.060
 sorts+0.056
 grun+0.052
 Liz+0.052
 shape+0.051
Neg logit contributions
lor-0.064
-0.061
 skies-0.059
ane-0.059
ival-0.057
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 type+0.087
 sorts+0.084
 shape+0.079
cest+0.078
 grun+0.075
Neg logit contributions
-0.053
lor-0.049
ival-0.044
 skies-0.043
There-0.042
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
 type+0.004
 sorts+0.004
 shape+0.004
 grun+0.004
cest+0.004
Neg logit contributions
-0.002
lor-0.002
ival-0.002
ane-0.002
There-0.002
Token the
Token ID262
Feature activation-1.87e+00

Pos logit contributions
 type+0.055
 sorts+0.050
 shape+0.047
 grun+0.046
 Liz+0.045
Neg logit contributions
lor-0.106
-0.103
 skies-0.099
ival-0.098
ane-0.097
Token cabinet
Token ID13447
Feature activation+1.81e+00

Pos logit contributions
lor+0.032
 skies+0.031
+0.031
actus+0.030
ival+0.028
Neg logit contributions
 type-0.047
 sorts-0.046
ook-0.044
 its-0.043
 their-0.043
Token.
Token ID13
Feature activation+2.99e+00

Pos logit contributions
 type+0.030
 sorts+0.027
 shape+0.027
 their+0.027
 Liz+0.027
Neg logit contributions
lor-0.048
-0.046
ival-0.044
ane-0.044
hel-0.043
Token She
Token ID1375
Feature activation+1.95e+00

Pos logit contributions
 type+0.058
 shape+0.054
 Liz+0.053
 its+0.053
 their+0.053
Neg logit contributions
lor-0.072
-0.067
ival-0.067
ane-0.065
actus-0.064
Token cried
Token ID16896
Feature activation+2.22e+00

Pos logit contributions
 type+0.040
 sorts+0.038
 their+0.035
 Liz+0.035
 shape+0.034
Neg logit contributions
lor-0.044
-0.041
ival-0.040
 skies-0.040
actus-0.040
Token out
Token ID503
Feature activation+9.24e+00

Pos logit contributions
 type+0.052
 sorts+0.049
 grun+0.047
 shape+0.046
 Liz+0.046
Neg logit contributions
lor-0.043
-0.042
ival-0.040
ane-0.040
hel-0.039
Token loud
Token ID7812
Feature activation+2.52e+00

Pos logit contributions
 type+0.215
 sorts+0.206
ooks+0.201
ook+0.195
cest+0.192
Neg logit contributions
hel-0.150
 skies-0.143
 next-0.140
-0.140
't-0.131
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 type+0.052
 shape+0.049
 its+0.048
 their+0.048
 sorts+0.047
Neg logit contributions
lor-0.058
-0.055
ival-0.054
ane-0.052
actus-0.051
Token felt
Token ID2936
Feature activation+3.22e+00

Pos logit contributions
There+0.029
 skies+0.027
 salon+0.027
ival+0.026
hel+0.026
Neg logit contributions
 sorts-0.040
 type-0.039
ook-0.038
ooks-0.037
uzzle-0.036
Token a
Token ID257
Feature activation+3.69e+00

Pos logit contributions
 type+0.073
 sorts+0.067
ook+0.064
 Liz+0.062
 shape+0.062
Neg logit contributions
lor-0.061
-0.061
 skies-0.058
There-0.056
ival-0.056
Token bump
Token ID13852
Feature activation+3.88e+00

Pos logit contributions
 type+0.074
 sorts+0.073
 Liz+0.071
ook+0.066
 Suz+0.066
Neg logit contributions
lor-0.083
-0.083
actus-0.078
hel-0.076
 skies-0.074
Token his
Token ID465
Feature activation+2.86e-01

Pos logit contributions
 type+0.070
 sorts+0.065
 proper+0.054
 wond+0.053
 prod+0.052
Neg logit contributions
 skies-0.098
-0.092
There-0.092
lor-0.091
 sky-0.089
Token knee
Token ID10329
Feature activation-1.70e+00

Pos logit contributions
 type+0.006
 shape+0.006
 sorts+0.006
 its+0.006
 their+0.006
Neg logit contributions
lor-0.006
-0.006
ival-0.006
ane-0.006
 skies-0.005
Token.
Token ID13
Feature activation+2.91e+00

Pos logit contributions
lor+0.052
ival+0.049
+0.049
ane+0.048
 skies+0.047
Neg logit contributions
 type-0.023
 shape-0.021
 its-0.020
 their-0.020
 grun-0.020
Token He
Token ID679
Feature activation+1.01e+00

Pos logit contributions
 type+0.060
 sorts+0.057
cest+0.052
 shape+0.049
ook+0.048
Neg logit contributions
-0.069
lor-0.065
 skies-0.062
There-0.060
ane-0.060
Token cries
Token ID24691
Feature activation+3.08e+00

Pos logit contributions
 type+0.026
 sorts+0.024
 Liz+0.023
 shape+0.023
 their+0.022
Neg logit contributions
lor-0.018
-0.017
actus-0.016
 skies-0.016
ival-0.016
Token out
Token ID503
Feature activation+9.24e+00

Pos logit contributions
 type+0.080
 sorts+0.075
 grun+0.069
 shape+0.067
 Liz+0.065
Neg logit contributions
lor-0.050
ane-0.049
 skies-0.049
-0.048
hel-0.046
Token loud
Token ID7812
Feature activation+2.46e+00

Pos logit contributions
 type+0.229
 sorts+0.226
ooks+0.211
 proper+0.208
ook+0.202
Neg logit contributions
 next-0.140
't-0.131
 skies-0.123
hel-0.120
-0.114
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
 type+0.040
 shape+0.037
 its+0.036
 their+0.036
 sorts+0.035
Neg logit contributions
lor-0.067
-0.065
ival-0.063
ane-0.062
actus-0.060
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
 type+0.090
 sorts+0.089
cest+0.082
 shape+0.078
ook+0.078
Neg logit contributions
-0.047
lor-0.043
 skies-0.040
There-0.040
't-0.037
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
 type+0.003
 sorts+0.003
 shape+0.003
cest+0.003
 grun+0.003
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
There-0.001
Token"
Token ID1
Feature activation-1.20e+00

Pos logit contributions
 type+0.003
 sorts+0.003
 shape+0.003
 grun+0.003
 its+0.003
Neg logit contributions
lor-0.002
There-0.002
ival-0.002
-0.002
ari-0.001
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
 type+0.069
 sorts+0.062
ook+0.057
cest+0.056
 shape+0.055
Neg logit contributions
lor-0.115
-0.112
 skies-0.109
hel-0.108
ival-0.103
Token head
Token ID1182
Feature activation+3.39e+00

Pos logit contributions
lor+0.040
+0.037
ival+0.037
ane+0.035
hel+0.034
Neg logit contributions
 type-0.048
 shape-0.048
 its-0.046
 grun-0.046
 their-0.045
Token.
Token ID13
Feature activation+2.69e+00

Pos logit contributions
 type+0.052
 shape+0.046
 sorts+0.046
 its+0.045
 their+0.045
Neg logit contributions
lor-0.096
-0.091
ival-0.089
ane-0.088
 skies-0.086
Token Tom
Token ID4186
Feature activation+9.40e-01

Pos logit contributions
 type+0.043
 its+0.040
 shape+0.039
 their+0.039
 Liz+0.039
Neg logit contributions
lor-0.075
-0.070
ival-0.069
ane-0.068
actus-0.066
Token cries
Token ID24691
Feature activation+3.16e+00

Pos logit contributions
 type+0.016
 shape+0.016
 its+0.016
 grun+0.015
 their+0.015
Neg logit contributions
lor-0.025
-0.023
ival-0.023
ane-0.022
hel-0.021
Token out
Token ID503
Feature activation+9.24e+00

Pos logit contributions
 type+0.078
 sorts+0.073
 shape+0.067
 grun+0.067
 Liz+0.065
Neg logit contributions
lor-0.057
-0.056
ane-0.054
 skies-0.054
ival-0.053
Token in
Token ID287
Feature activation+6.30e+00

Pos logit contributions
 type+0.234
 sorts+0.223
 proper+0.211
ooks+0.206
ook+0.199
Neg logit contributions
 next-0.139
 skies-0.135
't-0.134
hel-0.129
-0.127
Token pain
Token ID2356
Feature activation+2.86e+00

Pos logit contributions
 type+0.129
ook+0.127
ooks+0.122
apy+0.119
 sorts+0.115
Neg logit contributions
 skies-0.119
-0.119
lor-0.118
hel-0.116
There-0.116
Token and
Token ID290
Feature activation-1.63e+00

Pos logit contributions
 type+0.053
 sorts+0.049
 shape+0.046
 Liz+0.045
 grun+0.045
Neg logit contributions
lor-0.072
-0.068
ane-0.066
 skies-0.065
ival-0.065
Token falls
Token ID8953
Feature activation+1.60e+00

Pos logit contributions
lor+0.030
 skies+0.030
actus+0.029
+0.028
ival+0.028
Neg logit contributions
 type-0.038
 sorts-0.035
ooks-0.033
ook-0.033
 shape-0.032
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
 type+0.036
 sorts+0.032
 grun+0.030
 shape+0.030
ook+0.029
Neg logit contributions
lor-0.033
ane-0.032
-0.032
 skies-0.031
hel-0.030
Token arm
Token ID3211
Feature activation-1.08e+00

Pos logit contributions
lor+0.012
+0.011
ival+0.011
ane+0.011
 skies+0.010
Neg logit contributions
 type-0.010
 shape-0.009
 sorts-0.009
 Liz-0.009
 grun-0.009
Tokenchair
Token ID16337
Feature activation-1.80e+00

Pos logit contributions
lor+0.028
+0.026
ival+0.026
ane+0.025
actus+0.025
Neg logit contributions
 type-0.019
 shape-0.018
 its-0.018
 Liz-0.018
 sorts-0.018
Token.
Token ID13
Feature activation-4.15e-03

Pos logit contributions
lor+0.048
+0.045
ival+0.045
ane+0.043
actus+0.043
Neg logit contributions
 type-0.031
 shape-0.028
 sorts-0.027
 its-0.027
 their-0.027
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
lor+0.000
+0.000
ival+0.000
ane+0.000
actus+0.000
Neg logit contributions
 type-0.000
 Liz-0.000
 its-0.000
 sorts-0.000
 their-0.000
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
lor+0.033
+0.031
ival+0.030
ane+0.030
 skies+0.029
Neg logit contributions
 type-0.032
 sorts-0.030
 its-0.029
 Liz-0.029
 their-0.029
TokenTHE
Token ID10970
Feature activation-8.31e+00

Pos logit contributions
lor+0.030
+0.029
ane+0.027
actus+0.027
ival+0.027
Neg logit contributions
 type-0.029
 sorts-0.026
 shape-0.026
 its-0.026
 their-0.026
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
lor+0.196
actus+0.187
ival+0.185
ane+0.179
 skies+0.176
Neg logit contributions
 type-0.158
 its-0.154
 their-0.151
 shape-0.151
 sorts-0.150
TokenOnce
Token ID7454
Feature activation-2.66e+00

Pos logit contributions
lor+0.155
ival+0.147
+0.145
ane+0.141
actus+0.140
Neg logit contributions
 type-0.132
 its-0.132
 sorts-0.131
 shape-0.129
 their-0.128
Token upon
Token ID2402
Feature activation-1.08e+00

Pos logit contributions
lor+0.067
+0.063
ival+0.061
 skies+0.059
ican+0.058
Neg logit contributions
 type-0.049
 their-0.048
 its-0.047
 Liz-0.047
 sorts-0.046
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
lor+0.041
+0.040
ane+0.040
ari+0.039
ican+0.039
Neg logit contributions
 its-0.009
 his-0.009
 their-0.008
 type-0.007
 shape-0.007
Token time
Token ID640
Feature activation-2.97e+00

Pos logit contributions
 type+0.026
 shape+0.024
 sorts+0.023
 their+0.023
 its+0.023
Neg logit contributions
lor-0.025
-0.023
ival-0.023
ane-0.022
 skies-0.022
Token again
Token ID757
Feature activation+6.70e-01

Pos logit contributions
lor+0.060
+0.058
ival+0.057
 skies+0.054
ane+0.053
Neg logit contributions
 type-0.051
 grun-0.048
 sorts-0.047
 Liz-0.046
 shape-0.045
Token tomorrow
Token ID9439
Feature activation-2.15e+00

Pos logit contributions
 type+0.013
 grun+0.013
 Liz+0.012
 sorts+0.012
 shape+0.012
Neg logit contributions
lor-0.015
-0.014
 skies-0.013
ival-0.013
hel-0.013
Token.
Token ID13
Feature activation+5.65e-02

Pos logit contributions
lor+0.063
+0.060
ival+0.059
ane+0.057
actus+0.056
Neg logit contributions
 type-0.031
 shape-0.028
 their-0.028
 its-0.028
 sorts-0.027
Token
Token ID198
Feature activation-1.42e+00

Pos logit contributions
 type+0.001
 Liz+0.001
 sorts+0.001
 its+0.001
 their+0.001
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
actus-0.001
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
lor+0.033
+0.033
ane+0.032
 skies+0.031
ival+0.031
Neg logit contributions
 type-0.028
 its-0.026
 their-0.026
 sorts-0.026
 Liz-0.026
TokenTHE
Token ID10970
Feature activation-8.08e+00

Pos logit contributions
lor+0.028
+0.028
ane+0.027
actus+0.026
 skies+0.026
Neg logit contributions
 type-0.027
 sorts-0.025
 shape-0.025
 their-0.025
 its-0.025
Token<|endoftext|>
Token ID50256
Feature activation-7.02e+00

Pos logit contributions
lor+0.180
ival+0.168
actus+0.167
ane+0.164
+0.164
Neg logit contributions
 type-0.168
 its-0.160
 their-0.159
 shape-0.159
 sorts-0.158
TokenOnce
Token ID7454
Feature activation-2.66e+00

Pos logit contributions
lor+0.156
ival+0.147
+0.145
ane+0.142
ican+0.142
Neg logit contributions
 type-0.137
 its-0.136
 sorts-0.135
 shape-0.133
 their-0.133
Token upon
Token ID2402
Feature activation-1.14e+00

Pos logit contributions
lor+0.068
+0.064
ival+0.061
 skies+0.060
ican+0.059
Neg logit contributions
 type-0.048
 their-0.047
 its-0.047
 Liz-0.046
 sorts-0.045
Token a
Token ID257
Feature activation+9.69e-01

Pos logit contributions
lor+0.042
ane+0.042
+0.042
ari+0.041
ican+0.041
Neg logit contributions
 its-0.010
 his-0.010
 their-0.009
 type-0.008
 shape-0.008
Token time
Token ID640
Feature activation-2.90e+00

Pos logit contributions
 type+0.021
 shape+0.019
 their+0.019
 sorts+0.019
 its+0.019
Neg logit contributions
lor-0.022
-0.020
ival-0.020
ane-0.020
 skies-0.019
Token
Token ID198
Feature activation-1.28e+00

Pos logit contributions
 type+0.019
 shape+0.017
 sorts+0.017
 grun+0.017
 their+0.016
Neg logit contributions
lor-0.028
-0.027
ival-0.026
ane-0.026
 skies-0.026
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
lor+0.029
+0.028
ival+0.027
ane+0.027
 skies+0.026
Neg logit contributions
 type-0.026
 its-0.025
 sorts-0.025
 their-0.025
 shape-0.024
TokenThe
Token ID464
Feature activation-9.24e-01

Pos logit contributions
lor+0.026
+0.024
ival+0.024
ane+0.023
 skies+0.022
Neg logit contributions
 type-0.028
 shape-0.026
 sorts-0.026
 Liz-0.026
 grun-0.026
Token End
Token ID5268
Feature activation-4.25e+00

Pos logit contributions
lor+0.025
ane+0.021
hel+0.021
pill+0.020
orns+0.019
Neg logit contributions
 type-0.020
 their-0.020
 shape-0.019
 part-0.018
 his-0.018
Token.
Token ID13
Feature activation+8.49e-01

Pos logit contributions
+0.095
lor+0.093
ival+0.086
hel+0.084
ane+0.082
Neg logit contributions
 type-0.086
 grun-0.082
 shape-0.079
 its-0.075
 Liz-0.075
Token<|endoftext|>
Token ID50256
Feature activation-8.06e+00

Pos logit contributions
 type+0.018
 its+0.017
 sorts+0.017
 their+0.017
 shape+0.017
Neg logit contributions
lor-0.019
-0.017
ival-0.017
ane-0.017
actus-0.017
TokenOnce
Token ID7454
Feature activation-3.09e+00

Pos logit contributions
lor+0.207
ival+0.196
ican+0.192
 salon+0.191
ane+0.188
Neg logit contributions
 sorts-0.133
 their-0.132
 its-0.131
 type-0.131
 shape-0.128
Token upon
Token ID2402
Feature activation-1.26e+00

Pos logit contributions
lor+0.074
+0.069
ival+0.067
 skies+0.064
ane+0.063
Neg logit contributions
 type-0.062
 their-0.058
 Liz-0.058
 sorts-0.057
 its-0.057
Token a
Token ID257
Feature activation+9.36e-01

Pos logit contributions
lor+0.045
ane+0.043
ari+0.043
+0.043
ican+0.042
Neg logit contributions
 their-0.013
 its-0.012
 type-0.012
 his-0.012
 shape-0.011
Token time
Token ID640
Feature activation-2.92e+00

Pos logit contributions
 type+0.020
 shape+0.018
 their+0.017
 sorts+0.017
 its+0.017
Neg logit contributions
lor-0.023
ane-0.021
ival-0.020
-0.020
 skies-0.020
Token there
Token ID612
Feature activation-4.44e-01

Pos logit contributions
lor+0.090
actus+0.082
ival+0.081
+0.081
 skies+0.079
Neg logit contributions
 type-0.041
 their-0.040
 Liz-0.039
 its-0.038
 M-0.036
Token
Token ID220
Feature activation+1.56e-01

Pos logit contributions
 type+0.030
 grun+0.027
cest+0.027
 sorts+0.026
 shape+0.026
Neg logit contributions
lor-0.029
 skies-0.028
-0.027
ival-0.026
ane-0.025
Token
Token ID198
Feature activation-1.28e+00

Pos logit contributions
 type+0.003
 Liz+0.003
 its+0.003
 their+0.003
 shape+0.003
Neg logit contributions
lor-0.004
-0.003
ival-0.003
ane-0.003
actus-0.003
TokenThe
Token ID464
Feature activation-1.13e+00

Pos logit contributions
lor+0.028
+0.026
ival+0.026
ane+0.025
 skies+0.024
Neg logit contributions
 type-0.028
 sorts-0.026
 shape-0.026
 grun-0.025
 Liz-0.025
Token end
Token ID886
Feature activation+3.01e-01

Pos logit contributions
lor+0.033
ival+0.030
ane+0.030
hel+0.029
+0.029
Neg logit contributions
 type-0.022
 shape-0.021
 their-0.019
 Liz-0.017
 part-0.017
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 type+0.008
 grun+0.007
ooks+0.007
 sorts+0.007
ook+0.007
Neg logit contributions
lor-0.004
-0.004
hel-0.004
ari-0.004
ane-0.004
Token<|endoftext|>
Token ID50256
Feature activation-8.05e+00

Pos logit contributions
 type+0.029
 grun+0.027
 shape+0.026
 Liz+0.026
 sorts+0.025
Neg logit contributions
lor-0.022
-0.020
 skies-0.019
ane-0.019
ival-0.019
TokenOnce
Token ID7454
Feature activation-3.31e+00

Pos logit contributions
lor+0.202
ival+0.192
+0.188
ane+0.186
ican+0.185
Neg logit contributions
 type-0.142
 its-0.139
 their-0.139
 sorts-0.137
 shape-0.135
Token there
Token ID612
Feature activation-5.74e-01

Pos logit contributions
lor+0.083
+0.078
ival+0.076
 skies+0.072
ican+0.072
Neg logit contributions
 type-0.063
 their-0.060
 Liz-0.060
 its-0.058
 sorts-0.058
Token was
Token ID373
Feature activation-5.01e-01

Pos logit contributions
+0.015
lor+0.014
ane+0.014
hel+0.014
ival+0.013
Neg logit contributions
 type-0.010
 sorts-0.009
 shape-0.009
 grun-0.009
 Liz-0.008
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
lor+0.018
ival+0.016
ari+0.016
+0.016
ane+0.016
Neg logit contributions
 their-0.006
 its-0.006
 type-0.005
 his-0.005
 shape-0.005
Token gentle
Token ID10296
Feature activation+2.70e-01

Pos logit contributions
 sorts+0.037
 type+0.035
 Liz+0.035
 its+0.035
 grun+0.034
Neg logit contributions
-0.031
actus-0.031
ival-0.029
lor-0.028
opus-0.028
Token
Token ID220
Feature activation+2.96e-01

Pos logit contributions
 type+0.026
 shape+0.023
 sorts+0.023
 grun+0.022
 its+0.022
Neg logit contributions
lor-0.037
-0.035
ival-0.034
ane-0.034
 skies-0.033
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
 Liz+0.005
 type+0.005
 their+0.005
 its+0.005
 sorts+0.005
Neg logit contributions
lor-0.007
actus-0.007
-0.007
ival-0.006
 skies-0.006
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
lor+0.025
+0.024
ane+0.023
 skies+0.023
ival+0.023
Neg logit contributions
 type-0.022
 its-0.021
 their-0.021
 sorts-0.021
 shape-0.020
TokenThe
Token ID464
Feature activation-7.98e-01

Pos logit contributions
lor+0.022
+0.021
ane+0.020
actus+0.020
 skies+0.020
Neg logit contributions
 type-0.020
 sorts-0.018
 their-0.018
 its-0.018
 shape-0.018
Token End
Token ID5268
Feature activation-4.12e+00

Pos logit contributions
lor+0.025
ane+0.019
+0.019
pill+0.019
hel+0.019
Neg logit contributions
 type-0.016
 shape-0.016
 their-0.014
 his-0.014
 part-0.013
Token<|endoftext|>
Token ID50256
Feature activation-8.05e+00

Pos logit contributions
lor+0.097
+0.094
ival+0.089
ane+0.085
hel+0.084
Neg logit contributions
 type-0.082
 shape-0.076
 grun-0.075
 Liz-0.072
 sorts-0.072
TokenOnce
Token ID7454
Feature activation-3.10e+00

Pos logit contributions
lor+0.198
ival+0.187
 salon+0.183
ican+0.182
ane+0.181
Neg logit contributions
 type-0.140
 sorts-0.139
 their-0.139
 its-0.137
 shape-0.135
Token upon
Token ID2402
Feature activation-1.20e+00

Pos logit contributions
lor+0.074
+0.069
ival+0.067
 skies+0.064
ane+0.063
Neg logit contributions
 type-0.063
 their-0.059
 Liz-0.059
 its-0.058
 sorts-0.058
Token a
Token ID257
Feature activation+8.11e-01

Pos logit contributions
lor+0.042
ane+0.040
+0.040
ican+0.039
ari+0.039
Neg logit contributions
 their-0.013
 its-0.013
 type-0.012
 his-0.012
 shape-0.011
Token time
Token ID640
Feature activation-3.02e+00

Pos logit contributions
 type+0.017
 shape+0.015
 their+0.015
 sorts+0.015
 its+0.015
Neg logit contributions
lor-0.019
ane-0.018
ival-0.018
-0.018
 skies-0.017
Token there
Token ID612
Feature activation-4.89e-01

Pos logit contributions
lor+0.090
actus+0.082
ival+0.081
+0.081
 skies+0.079
Neg logit contributions
 type-0.045
 their-0.043
 Liz-0.043
 its-0.041
 shape-0.039
Token
Token ID198
Feature activation-7.41e-01

Pos logit contributions
 type+0.056
 sorts+0.052
 shape+0.050
 grun+0.049
cest+0.048
Neg logit contributions
lor-0.051
-0.050
ival-0.045
 skies-0.045
ane-0.045
Token
Token ID198
Feature activation-6.50e-01

Pos logit contributions
lor+0.014
+0.013
ival+0.013
ane+0.012
actus+0.012
Neg logit contributions
 type-0.018
 shape-0.017
 sorts-0.017
 grun-0.017
 Liz-0.016
TokenThe
Token ID464
Feature activation-1.08e+00

Pos logit contributions
lor+0.013
+0.012
ival+0.012
actus+0.011
ane+0.011
Neg logit contributions
 type-0.015
 sorts-0.015
 Liz-0.014
 shape-0.014
 its-0.014
Token end
Token ID886
Feature activation+1.37e-01

Pos logit contributions
lor+0.034
ival+0.032
+0.032
hel+0.031
ari+0.031
Neg logit contributions
 shape-0.016
 type-0.015
 their-0.014
 its-0.013
 grun-0.013
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
 type+0.003
 sorts+0.003
 grun+0.003
cest+0.003
ooks+0.003
Neg logit contributions
ane-0.002
lor-0.002
ari-0.002
 skies-0.002
-0.002
Token<|endoftext|>
Token ID50256
Feature activation-8.04e+00

Pos logit contributions
 type+0.039
 sorts+0.036
 shape+0.035
 grun+0.034
 Liz+0.034
Neg logit contributions
lor-0.042
-0.041
ival-0.038
ane-0.038
 skies-0.037
TokenL
Token ID43
Feature activation+1.54e+00

Pos logit contributions
lor+0.198
ival+0.190
 salon+0.185
ican+0.179
ari+0.176
Neg logit contributions
 their-0.142
 its-0.142
 shape-0.137
 M-0.137
 type-0.136
Tokenily
Token ID813
Feature activation+9.81e-01

Pos logit contributions
 their+0.026
 its+0.025
 his+0.025
 type+0.023
 Liz+0.023
Neg logit contributions
-0.045
lor-0.041
ival-0.039
ican-0.039
 skies-0.038
Token and
Token ID290
Feature activation-5.14e+00

Pos logit contributions
 type+0.018
 shape+0.017
 sorts+0.017
 their+0.017
 its+0.017
Neg logit contributions
lor-0.025
-0.024
ival-0.023
ane-0.022
 skies-0.021
Token Tom
Token ID4186
Feature activation-9.68e-01

Pos logit contributions
lor+0.158
+0.150
ival+0.148
ane+0.143
actus+0.142
Neg logit contributions
 type-0.070
 their-0.064
 its-0.063
 shape-0.062
 sorts-0.060
Token are
Token ID389
Feature activation-8.17e-01

Pos logit contributions
lor+0.024
+0.023
ival+0.022
ane+0.022
 skies+0.021
Neg logit contributions
 type-0.018
 shape-0.017
 sorts-0.016
 its-0.016
 Liz-0.016
Token
Token ID220
Feature activation+3.43e-01

Pos logit contributions
lor+0.033
+0.032
ane+0.031
ival+0.031
 skies+0.030
Neg logit contributions
 type-0.025
 its-0.025
 their-0.025
 sorts-0.024
 shape-0.023
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 Liz+0.006
 type+0.006
 its+0.006
 their+0.006
 sorts+0.006
Neg logit contributions
lor-0.008
ival-0.008
ane-0.008
actus-0.008
-0.007
TokenThe
Token ID464
Feature activation-8.86e-01

Pos logit contributions
lor+0.029
ane+0.028
+0.028
ival+0.027
actus+0.026
Neg logit contributions
 type-0.028
 its-0.026
 shape-0.026
 sorts-0.025
 their-0.025
Token End
Token ID5268
Feature activation-4.23e+00

Pos logit contributions
lor+0.022
ival+0.019
pill+0.019
ane+0.019
ari+0.018
Neg logit contributions
 type-0.020
 shape-0.020
 sunny-0.018
 their-0.018
 part-0.018
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
lor+0.098
+0.097
ival+0.089
hel+0.087
ane+0.086
Neg logit contributions
 type-0.084
 grun-0.078
 shape-0.076
 Liz-0.074
 sorts-0.073
Token<|endoftext|>
Token ID50256
Feature activation-8.04e+00

Pos logit contributions
 type+0.013
 its+0.012
 sorts+0.012
 shape+0.012
 their+0.012
Neg logit contributions
lor-0.015
-0.014
ival-0.014
ane-0.014
actus-0.014
TokenOnce
Token ID7454
Feature activation-3.20e+00

Pos logit contributions
lor+0.208
ival+0.198
ane+0.193
 salon+0.192
ican+0.191
Neg logit contributions
 its-0.132
 their-0.132
 type-0.132
 sorts-0.130
 shape-0.127
Token upon
Token ID2402
Feature activation-1.22e+00

Pos logit contributions
lor+0.079
+0.074
ival+0.072
 skies+0.069
ane+0.067
Neg logit contributions
 type-0.062
 their-0.059
 Liz-0.059
 its-0.058
 sorts-0.057
Token a
Token ID257
Feature activation+7.98e-01

Pos logit contributions
lor+0.043
ane+0.041
+0.040
ival+0.040
ari+0.040
Neg logit contributions
 its-0.013
 their-0.013
 type-0.012
 shape-0.012
 his-0.011
Token time
Token ID640
Feature activation-3.11e+00

Pos logit contributions
 type+0.016
 shape+0.014
 their+0.014
 sorts+0.014
 its+0.013
Neg logit contributions
lor-0.020
ane-0.019
ival-0.019
 skies-0.018
-0.018
Token,
Token ID11
Feature activation-7.31e-01

Pos logit contributions
lor+0.098
actus+0.089
ival+0.089
+0.088
 skies+0.087
Neg logit contributions
 type-0.042
 their-0.041
 Liz-0.040
 its-0.039
 M-0.036
Token his
Token ID465
Feature activation-2.14e+00

Pos logit contributions
 type+0.007
 shape+0.006
 its+0.006
 their+0.006
 sorts+0.006
Neg logit contributions
lor-0.015
-0.015
ival-0.014
ane-0.014
actus-0.014
Token friends
Token ID2460
Feature activation-1.26e+00

Pos logit contributions
lor+0.066
hel+0.061
ival+0.060
+0.058
ane+0.057
Neg logit contributions
 type-0.036
 shape-0.034
 their-0.032
 its-0.029
 M-0.026
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
lor+0.038
+0.036
ival+0.035
actus+0.034
ane+0.034
Neg logit contributions
 type-0.018
 their-0.016
 shape-0.016
 its-0.016
 sorts-0.015
Token come
Token ID1282
Feature activation-1.56e+00

Pos logit contributions
lor+0.034
ival+0.032
ane+0.032
actus+0.031
+0.031
Neg logit contributions
 type-0.018
 shape-0.017
 its-0.016
 their-0.016
 make-0.013
Token.
Token ID13
Feature activation+3.81e-01

Pos logit contributions
lor+0.040
+0.038
ival+0.037
ane+0.036
actus+0.035
Neg logit contributions
 type-0.029
 shape-0.027
 their-0.027
 its-0.027
 Liz-0.026
Token<|endoftext|>
Token ID50256
Feature activation-8.04e+00

Pos logit contributions
 type+0.006
 their+0.006
 its+0.006
 shape+0.005
 sorts+0.005
Neg logit contributions
lor-0.011
-0.010
actus-0.010
ane-0.010
ival-0.010
TokenOnce
Token ID7454
Feature activation-3.43e+00

Pos logit contributions
lor+0.223
ican+0.213
+0.212
ival+0.211
 salon+0.211
Neg logit contributions
 its-0.123
 their-0.122
 type-0.114
 his-0.110
 sorts-0.109
Token upon
Token ID2402
Feature activation-1.34e+00

Pos logit contributions
lor+0.088
+0.084
ival+0.080
 skies+0.078
ican+0.077
Neg logit contributions
 type-0.062
 their-0.061
 its-0.060
 Liz-0.059
 sorts-0.056
Token a
Token ID257
Feature activation+6.43e-01

Pos logit contributions
lor+0.047
+0.046
ane+0.045
actus+0.044
ican+0.044
Neg logit contributions
 its-0.014
 their-0.014
 type-0.014
 his-0.013
 shape-0.012
Token time
Token ID640
Feature activation-3.30e+00

Pos logit contributions
 type+0.013
 shape+0.012
 their+0.012
 sorts+0.011
 its+0.011
Neg logit contributions
lor-0.016
ival-0.015
-0.015
ane-0.015
 skies-0.014
Token,
Token ID11
Feature activation-8.46e-01

Pos logit contributions
lor+0.101
actus+0.092
ival+0.092
+0.091
 skies+0.089
Neg logit contributions
 type-0.047
 their-0.044
 Liz-0.044
 its-0.042
 shape-0.041
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
 type+0.007
 Liz+0.007
 their+0.007
 its+0.006
 shape+0.006
Neg logit contributions
lor-0.008
-0.007
ival-0.007
actus-0.007
 skies-0.007
Token
Token ID198
Feature activation-1.11e+00

Pos logit contributions
lor+0.028
+0.027
ane+0.026
ival+0.026
 skies+0.025
Neg logit contributions
 type-0.025
 sorts-0.023
 its-0.023
 their-0.023
 shape-0.023
TokenThe
Token ID464
Feature activation-7.21e-01

Pos logit contributions
lor+0.025
+0.024
ane+0.023
ival+0.022
actus+0.022
Neg logit contributions
 type-0.024
 sorts-0.022
 shape-0.022
 their-0.021
 its-0.021
Token end
Token ID886
Feature activation+6.90e-01

Pos logit contributions
lor+0.021
orns+0.018
+0.018
pill+0.018
ican+0.017
Neg logit contributions
 type-0.015
 shape-0.014
 their-0.013
 his-0.013
 part-0.012
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 type+0.017
 grun+0.016
 prod+0.016
apy+0.016
ooks+0.015
Neg logit contributions
ari-0.011
hel-0.011
lor-0.011
-0.011
ane-0.011
Token<|endoftext|>
Token ID50256
Feature activation-8.03e+00

Pos logit contributions
 type+0.022
 its+0.021
 their+0.021
 Liz+0.021
 sorts+0.021
Neg logit contributions
lor-0.028
ival-0.026
actus-0.026
-0.025
ane-0.025
TokenOnce
Token ID7454
Feature activation-3.14e+00

Pos logit contributions
lor+0.219
ican+0.209
ival+0.208
 salon+0.205
ane+0.203
Neg logit contributions
 their-0.122
 its-0.120
 sorts-0.116
 type-0.116
 his-0.112
Token upon
Token ID2402
Feature activation-1.22e+00

Pos logit contributions
lor+0.077
+0.072
ival+0.070
 skies+0.067
ane+0.065
Neg logit contributions
 type-0.062
 their-0.059
 Liz-0.058
 its-0.057
 sorts-0.057
Token a
Token ID257
Feature activation+9.42e-01

Pos logit contributions
lor+0.043
ane+0.041
+0.041
ican+0.041
ari+0.041
Neg logit contributions
 their-0.013
 its-0.012
 his-0.012
 type-0.012
 shape-0.011
Token time
Token ID640
Feature activation-2.96e+00

Pos logit contributions
 type+0.019
 their+0.017
 shape+0.016
 sorts+0.016
 its+0.016
Neg logit contributions
lor-0.024
ane-0.022
ival-0.022
 skies-0.022
-0.022
Token,
Token ID11
Feature activation-4.42e-01

Pos logit contributions
lor+0.097
actus+0.089
ival+0.088
 skies+0.086
+0.086
Neg logit contributions
 their-0.037
 type-0.036
 Liz-0.036
 its-0.035
 M-0.032
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 type+0.007
 Liz+0.007
 its+0.007
 their+0.007
 shape+0.007
Neg logit contributions
lor-0.009
-0.008
ival-0.008
actus-0.008
 skies-0.008
Token
Token ID198
Feature activation-9.46e-01

Pos logit contributions
lor+0.024
+0.023
ane+0.023
ival+0.022
 skies+0.022
Neg logit contributions
 type-0.021
 its-0.020
 their-0.019
 shape-0.019
 Liz-0.019
TokenThe
Token ID464
Feature activation-8.80e-01

Pos logit contributions
lor+0.022
+0.022
ane+0.021
actus+0.020
 skies+0.020
Neg logit contributions
 type-0.019
 shape-0.017
 its-0.017
 sorts-0.017
 their-0.017
Token end
Token ID886
Feature activation+4.94e-01

Pos logit contributions
lor+0.030
ives+0.027
pill+0.025
hel+0.025
ival+0.024
Neg logit contributions
 type-0.014
 shape-0.014
 part-0.013
 their-0.012
 grun-0.011
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 type+0.013
apy+0.012
ooks+0.012
 prod+0.012
 grun+0.012
Neg logit contributions
lor-0.006
hel-0.006
-0.006
ari-0.006
THE-0.006
Token<|endoftext|>
Token ID50256
Feature activation-8.03e+00

Pos logit contributions
 type+0.027
 its+0.025
 sorts+0.025
 their+0.025
 shape+0.025
Neg logit contributions
lor-0.028
-0.026
ival-0.026
ane-0.025
actus-0.025
TokenOne
Token ID3198
Feature activation+9.18e-01

Pos logit contributions
lor+0.211
ival+0.201
ican+0.198
 salon+0.195
ane+0.194
Neg logit contributions
 their-0.128
 its-0.127
 type-0.127
 sorts-0.124
 shape-0.122
Token day
Token ID1110
Feature activation-4.64e+00

Pos logit contributions
 type+0.021
 shape+0.019
 sorts+0.019
 grun+0.019
 its+0.019
Neg logit contributions
lor-0.019
-0.018
ival-0.017
ane-0.017
 skies-0.017
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
lor+0.099
+0.098
ival+0.092
ane+0.091
 skies+0.088
Neg logit contributions
 type-0.100
 sorts-0.096
 shape-0.092
 grun-0.091
 Liz-0.090
Token Sarah
Token ID10490
Feature activation-1.32e-02

Pos logit contributions
+0.024
lor+0.022
ival+0.022
There+0.022
hel+0.021
Neg logit contributions
 type-0.020
 sorts-0.019
 shape-0.018
 grun-0.018
 Liz-0.018
Token wanted
Token ID2227
Feature activation-4.74e+00

Pos logit contributions
lor+0.000
+0.000
ival+0.000
ane+0.000
 skies+0.000
Neg logit contributions
 type-0.000
 shape-0.000
 sorts-0.000
 its-0.000
 their-0.000
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
 type+0.016
 sorts+0.015
 Liz+0.015
 its+0.015
 their+0.015
Neg logit contributions
lor-0.030
ival-0.027
-0.027
ane-0.027
actus-0.027
Token
Token ID198
Feature activation-1.14e+00

Pos logit contributions
lor+0.028
+0.026
ival+0.025
ane+0.025
 skies+0.025
Neg logit contributions
 type-0.026
 its-0.024
 sorts-0.024
 their-0.024
 shape-0.024
TokenThe
Token ID464
Feature activation-7.82e-01

Pos logit contributions
lor+0.027
+0.025
ane+0.025
actus+0.024
ival+0.024
Neg logit contributions
 type-0.023
 sorts-0.022
 shape-0.021
 its-0.021
 their-0.021
Token end
Token ID886
Feature activation+6.15e-01

Pos logit contributions
lor+0.028
anes+0.025
ane+0.025
+0.024
ival+0.024
Neg logit contributions
 type-0.011
 shape-0.009
 part-0.008
 their-0.007
 M-0.007
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 type+0.016
 grun+0.016
ooks+0.015
apy+0.015
 prod+0.015
Neg logit contributions
lor-0.008
-0.008
ari-0.007
THE-0.007
ane-0.007
Token<|endoftext|>
Token ID50256
Feature activation-8.03e+00

Pos logit contributions
 type+0.022
 its+0.021
 their+0.021
 sorts+0.021
 Liz+0.021
Neg logit contributions
lor-0.027
ival-0.026
-0.025
actus-0.025
ane-0.025
TokenOnce
Token ID7454
Feature activation-3.15e+00

Pos logit contributions
lor+0.209
ival+0.198
ican+0.194
 salon+0.193
+0.189
Neg logit contributions
 their-0.130
 its-0.130
 type-0.128
 sorts-0.128
 shape-0.125
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
lor+0.078
+0.072
ival+0.070
 skies+0.067
actus+0.066
Neg logit contributions
 type-0.061
 their-0.059
 Liz-0.058
 its-0.057
 sorts-0.057
Token there
Token ID612
Feature activation-4.53e-01

Pos logit contributions
+0.022
lor+0.022
ane+0.021
ival+0.020
hel+0.019
Neg logit contributions
 type-0.039
 sorts-0.037
 grun-0.036
 shape-0.036
 Liz-0.035
Token was
Token ID373
Feature activation-2.74e-01

Pos logit contributions
+0.011
lor+0.011
ane+0.010
ival+0.010
hel+0.010
Neg logit contributions
 type-0.008
 Liz-0.007
 sorts-0.007
 grun-0.007
 shape-0.007
Token a
Token ID257
Feature activation+1.96e+00

Pos logit contributions
lor+0.009
ival+0.008
+0.008
ane+0.008
actus+0.008
Neg logit contributions
 type-0.003
 their-0.003
 its-0.003
 shape-0.003
 sorts-0.003
Token think
Token ID892
Feature activation-2.08e+00

Pos logit contributions
lor+0.002
+0.002
hel+0.001
ival+0.001
ane+0.001
Neg logit contributions
 type-0.001
 Liz-0.001
 sorts-0.001
 its-0.001
apy-0.001
Token they
Token ID484
Feature activation+3.94e-01

Pos logit contributions
lor+0.051
+0.048
ival+0.047
ane+0.046
 skies+0.045
Neg logit contributions
 type-0.040
 shape-0.037
 sorts-0.037
 its-0.036
 their-0.036
Token should
Token ID815
Feature activation-3.05e+00

Pos logit contributions
 type+0.007
 shape+0.006
 sorts+0.006
 their+0.006
 its+0.006
Neg logit contributions
lor-0.011
-0.010
ival-0.010
ane-0.010
 skies-0.010
Token do
Token ID466
Feature activation-2.84e+00

Pos logit contributions
lor+0.090
+0.084
ival+0.084
ane+0.082
actus+0.081
Neg logit contributions
 type-0.048
 shape-0.043
 their-0.042
 its-0.041
 sorts-0.039
Token?
Token ID30
Feature activation+1.50e+00

Pos logit contributions
lor+0.066
+0.063
ival+0.061
ane+0.060
 skies+0.059
Neg logit contributions
 type-0.056
 sorts-0.052
 shape-0.052
 grun-0.052
 Liz-0.051
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
 type+0.027
 shape+0.025
 sorts+0.025
 its+0.024
 grun+0.024
Neg logit contributions
lor-0.038
-0.036
ival-0.035
ane-0.035
 skies-0.034
TokenOnce
Token ID7454
Feature activation-3.44e+00

Pos logit contributions
lor+0.213
ival+0.204
 salon+0.201
cases+0.193
ari+0.191
Neg logit contributions
 M-0.125
 their-0.123
 its-0.123
 his-0.121
 shape-0.118
Token upon
Token ID2402
Feature activation-9.69e-01

Pos logit contributions
lor+0.077
+0.072
ival+0.070
ane+0.068
 skies+0.067
Neg logit contributions
 type-0.073
 shape-0.068
 sorts-0.067
 their-0.067
 its-0.067
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
lor+0.032
+0.030
ival+0.030
ane+0.030
ari+0.030
Neg logit contributions
 type-0.012
 their-0.011
 its-0.011
 shape-0.011
 his-0.010
Token time
Token ID640
Feature activation-3.03e+00

Pos logit contributions
 type+0.023
 shape+0.022
 sorts+0.021
 their+0.021
 its+0.021
Neg logit contributions
lor-0.023
-0.021
ival-0.021
ane-0.020
 skies-0.020
Token,
Token ID11
Feature activation-5.28e-01

Pos logit contributions
lor+0.093
ival+0.085
actus+0.084
+0.082
 skies+0.082
Neg logit contributions
 type-0.043
 their-0.042
 Liz-0.041
 its-0.040
 shape-0.038
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
 type+0.020
 Liz+0.020
 its+0.020
 sorts+0.020
 their+0.019
Neg logit contributions
lor-0.028
ival-0.026
-0.026
actus-0.026
ane-0.026
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
lor+0.033
+0.031
ane+0.030
ival+0.030
 skies+0.030
Neg logit contributions
 type-0.025
 its-0.024
 their-0.024
 Liz-0.024
 sorts-0.023
TokenThe
Token ID464
Feature activation-8.77e-01

Pos logit contributions
lor+0.031
+0.031
ane+0.031
actus+0.029
 skies+0.029
Neg logit contributions
 type-0.021
 its-0.019
 shape-0.019
 their-0.018
 sorts-0.018
Token end
Token ID886
Feature activation+5.90e-01

Pos logit contributions
lor+0.026
+0.023
ane+0.023
ival+0.023
anes+0.022
Neg logit contributions
 type-0.016
 shape-0.016
 their-0.013
 grun-0.013
 sorts-0.013
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
 type+0.015
ooks+0.014
 grun+0.014
apy+0.014
 sorts+0.014
Neg logit contributions
lor-0.008
-0.008
hel-0.008
ari-0.008
ane-0.007
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
 type+0.023
 its+0.022
 sorts+0.022
 their+0.022
 Liz+0.022
Neg logit contributions
lor-0.026
ival-0.025
-0.025
ane-0.024
actus-0.024
TokenOnce
Token ID7454
Feature activation-3.15e+00

Pos logit contributions
lor+0.211
ival+0.202
 salon+0.196
ican+0.196
ane+0.193
Neg logit contributions
 their-0.129
 its-0.128
 type-0.126
 sorts-0.125
 shape-0.123
Token upon
Token ID2402
Feature activation-1.08e+00

Pos logit contributions
lor+0.078
+0.073
ival+0.071
 skies+0.068
ican+0.067
Neg logit contributions
 type-0.061
 their-0.058
 Liz-0.058
 its-0.057
 sorts-0.056
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
lor+0.038
ane+0.036
+0.036
ari+0.036
ival+0.035
Neg logit contributions
 their-0.012
 its-0.011
 type-0.011
 his-0.010
 shape-0.010
Token time
Token ID640
Feature activation-2.82e+00

Pos logit contributions
 type+0.022
 shape+0.020
 their+0.020
 sorts+0.020
 its+0.020
Neg logit contributions
lor-0.025
ane-0.023
ival-0.023
-0.023
 skies-0.022
Token there
Token ID612
Feature activation-3.62e-01

Pos logit contributions
lor+0.088
actus+0.080
ival+0.080
+0.079
 skies+0.078
Neg logit contributions
 type-0.038
 their-0.038
 Liz-0.037
 its-0.036
 M-0.034
Token
Token ID198
Feature activation-9.07e-01

Pos logit contributions
 Liz+0.012
 its+0.011
 type+0.011
 their+0.011
 M+0.010
Neg logit contributions
lor-0.016
actus-0.015
ican-0.015
ival-0.015
 skies-0.015
Token
Token ID198
Feature activation-7.76e-01

Pos logit contributions
lor+0.023
+0.022
ane+0.021
 skies+0.021
ican+0.021
Neg logit contributions
 type-0.016
 its-0.016
 sorts-0.015
 their-0.015
 Liz-0.015
TokenThe
Token ID464
Feature activation-3.39e-01

Pos logit contributions
lor+0.019
+0.019
ane+0.019
actus+0.019
 skies+0.018
Neg logit contributions
 type-0.014
 sorts-0.013
 its-0.013
 their-0.012
 shape-0.012
Token End
Token ID5268
Feature activation-3.77e+00

Pos logit contributions
lor+0.011
ane+0.010
anes+0.010
icans+0.010
ican+0.010
Neg logit contributions
 type-0.005
 shape-0.005
 neat-0.004
 part-0.004
 stuffed-0.004
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
lor+0.091
+0.088
ival+0.082
hel+0.080
ane+0.079
Neg logit contributions
 type-0.073
 shape-0.067
 grun-0.067
 Liz-0.064
 its-0.063
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
 its+0.023
 sorts+0.022
 M+0.022
 type+0.022
 their+0.022
Neg logit contributions
ival-0.033
lor-0.033
actus-0.033
ican-0.032
-0.031
TokenJohn
Token ID7554
Feature activation+6.81e-02

Pos logit contributions
lor+0.204
ival+0.193
 salon+0.191
ican+0.189
ane+0.185
Neg logit contributions
 sorts-0.133
 their-0.131
 its-0.131
 type-0.130
 M-0.127
Token was
Token ID373
Feature activation+1.08e-01

Pos logit contributions
 type+0.001
 shape+0.001
 sorts+0.001
 its+0.001
 their+0.001
Neg logit contributions
lor-0.002
-0.002
ival-0.002
ane-0.002
 skies-0.002
Token weak
Token ID4939
Feature activation-2.73e-01

Pos logit contributions
 type+0.002
 shape+0.002
 their+0.002
 its+0.002
 sorts+0.002
Neg logit contributions
lor-0.003
-0.003
ival-0.003
ane-0.002
actus-0.002
Token.
Token ID13
Feature activation-4.94e-01

Pos logit contributions
lor+0.007
+0.006
ival+0.006
ane+0.006
actus+0.006
Neg logit contributions
 type-0.005
 shape-0.005
 sorts-0.005
 its-0.005
 their-0.005
Token He
Token ID679
Feature activation-8.89e-02

Pos logit contributions
lor+0.013
+0.013
ane+0.012
 skies+0.012
ival+0.012
Neg logit contributions
 type-0.008
 grun-0.007
 shape-0.007
 their-0.007
 sorts-0.007
Token happy
Token ID3772
Feature activation-1.62e+00

Pos logit contributions
 type+0.041
 sorts+0.037
 Liz+0.036
 shape+0.036
 its+0.036
Neg logit contributions
lor-0.067
-0.065
ival-0.063
ane-0.061
actus-0.061
Token,
Token ID11
Feature activation+1.54e+00

Pos logit contributions
lor+0.036
+0.034
ival+0.033
ane+0.032
 skies+0.031
Neg logit contributions
 type-0.035
 shape-0.033
 sorts-0.032
 its-0.032
 their-0.032
Token clean
Token ID3424
Feature activation+8.67e-01

Pos logit contributions
 type+0.023
 shape+0.021
 its+0.021
 sorts+0.021
 their+0.020
Neg logit contributions
lor-0.045
-0.043
ival-0.042
ane-0.041
actus-0.040
Token dog
Token ID3290
Feature activation-1.22e+00

Pos logit contributions
 type+0.018
 shape+0.017
 sorts+0.017
 their+0.017
 its+0.016
Neg logit contributions
lor-0.020
-0.019
ival-0.019
ane-0.018
hel-0.018
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
lor+0.033
+0.031
ival+0.030
ane+0.030
 skies+0.029
Neg logit contributions
 type-0.021
 shape-0.018
 sorts-0.018
 grun-0.018
 Liz-0.018
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
 type+0.019
 shape+0.017
 sorts+0.017
 grun+0.016
 their+0.016
Neg logit contributions
lor-0.030
-0.029
ival-0.028
ane-0.028
 skies-0.027
TokenOnce
Token ID7454
Feature activation-3.39e+00

Pos logit contributions
lor+0.203
ival+0.195
ican+0.189
+0.189
 salon+0.187
Neg logit contributions
 its-0.133
 type-0.133
 their-0.132
 sorts-0.128
 shape-0.127
Token there
Token ID612
Feature activation-5.24e-01

Pos logit contributions
lor+0.084
+0.078
ival+0.076
 skies+0.073
ican+0.071
Neg logit contributions
 type-0.067
 their-0.063
 Liz-0.063
 its-0.062
 sorts-0.061
Token was
Token ID373
Feature activation-4.71e-01

Pos logit contributions
+0.012
lor+0.012
hel+0.012
ane+0.012
ival+0.012
Neg logit contributions
 type-0.009
 sorts-0.009
 shape-0.009
uzzle-0.008
 grun-0.008
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
lor+0.017
ival+0.015
+0.015
ari+0.015
ane+0.015
Neg logit contributions
 their-0.006
 its-0.006
 type-0.005
 sorts-0.005
 shape-0.005
Token label
Token ID6167
Feature activation+1.47e+00

Pos logit contributions
 sorts+0.036
 type+0.035
 Liz+0.034
 its+0.034
 grun+0.033
Neg logit contributions
-0.031
actus-0.031
ival-0.029
lor-0.028
opus-0.027
TokenHe
Token ID1544
Feature activation+7.81e-01

Pos logit contributions
lor+0.022
+0.021
ane+0.020
ival+0.020
actus+0.020
Neg logit contributions
 type-0.019
 its-0.017
 shape-0.017
 their-0.017
 grun-0.017
Token started
Token ID2067
Feature activation+8.48e-01

Pos logit contributions
 type+0.014
 sorts+0.013
 shape+0.013
 Liz+0.013
 their+0.012
Neg logit contributions
lor-0.020
-0.018
ival-0.018
 skies-0.018
ane-0.018
Token to
Token ID284
Feature activation-4.20e-01

Pos logit contributions
 type+0.016
 sorts+0.015
 shape+0.015
 Liz+0.014
 grun+0.014
Neg logit contributions
lor-0.020
-0.020
ival-0.019
 skies-0.018
ane-0.018
Token cry
Token ID3960
Feature activation+3.46e-01

Pos logit contributions
lor+0.008
+0.008
hel+0.007
 skies+0.007
actus+0.007
Neg logit contributions
 type-0.009
 sorts-0.009
 Liz-0.008
ook-0.008
 shape-0.008
Token.
Token ID13
Feature activation+2.08e+00

Pos logit contributions
 type+0.008
 sorts+0.008
 grun+0.007
 Liz+0.007
 shape+0.007
Neg logit contributions
lor-0.006
-0.006
 skies-0.006
ane-0.006
hel-0.006
Token<|endoftext|>
Token ID50256
Feature activation-8.02e+00

Pos logit contributions
 type+0.034
 sorts+0.031
 shape+0.030
 their+0.029
 Liz+0.029
Neg logit contributions
lor-0.056
-0.055
ival-0.053
 skies-0.053
ane-0.052
TokenOnce
Token ID7454
Feature activation-3.46e+00

Pos logit contributions
lor+0.198
ival+0.189
 salon+0.185
ican+0.182
actus+0.177
Neg logit contributions
 their-0.139
 its-0.138
 M-0.136
 type-0.135
 sorts-0.134
Token upon
Token ID2402
Feature activation-1.10e+00

Pos logit contributions
lor+0.081
+0.076
ival+0.074
 skies+0.071
ane+0.070
Neg logit contributions
 type-0.071
 their-0.066
 Liz-0.066
 its-0.065
 sorts-0.065
Token a
Token ID257
Feature activation+9.10e-01

Pos logit contributions
lor+0.039
ane+0.037
+0.037
ari+0.037
ival+0.036
Neg logit contributions
 its-0.012
 their-0.012
 his-0.011
 type-0.011
 shape-0.011
Token time
Token ID640
Feature activation-3.23e+00

Pos logit contributions
 type+0.021
 shape+0.020
 sorts+0.019
 its+0.019
 their+0.019
Neg logit contributions
lor-0.019
-0.017
ival-0.017
ane-0.016
 skies-0.016
Token there
Token ID612
Feature activation-4.31e-01

Pos logit contributions
lor+0.096
actus+0.089
ival+0.089
+0.088
 skies+0.086
Neg logit contributions
 type-0.047
 their-0.046
 Liz-0.046
 its-0.044
 shape-0.041
Token
Token ID198
Feature activation-8.55e-01

Pos logit contributions
 type+0.029
 shape+0.026
 sorts+0.026
 its+0.026
 their+0.026
Neg logit contributions
lor-0.033
-0.031
ival-0.030
ane-0.030
 skies-0.029
Token
Token ID198
Feature activation-8.02e-01

Pos logit contributions
lor+0.020
+0.020
ane+0.019
 skies+0.018
ival+0.018
Neg logit contributions
 type-0.017
 its-0.016
 their-0.016
 sorts-0.015
 Liz-0.015
TokenThe
Token ID464
Feature activation-4.90e-01

Pos logit contributions
lor+0.018
+0.018
ane+0.017
actus+0.017
 skies+0.017
Neg logit contributions
 type-0.016
 its-0.015
 their-0.015
 sorts-0.015
 shape-0.015
Token End
Token ID5268
Feature activation-3.93e+00

Pos logit contributions
lor+0.014
+0.010
owers+0.010
hel+0.010
pill+0.010
Neg logit contributions
 type-0.011
 shape-0.011
 grun-0.010
 their-0.010
 his-0.009
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
lor+0.090
+0.090
ival+0.083
hel+0.081
ane+0.079
Neg logit contributions
 type-0.078
 grun-0.072
 shape-0.072
 sorts-0.069
 Liz-0.069
Token<|endoftext|>
Token ID50256
Feature activation-8.01e+00

Pos logit contributions
 type+0.022
 its+0.022
 their+0.022
 Liz+0.021
 sorts+0.021
Neg logit contributions
lor-0.028
actus-0.026
ival-0.026
-0.026
ane-0.025
TokenOnce
Token ID7454
Feature activation-3.16e+00

Pos logit contributions
lor+0.206
ival+0.195
ican+0.192
 salon+0.191
ane+0.188
Neg logit contributions
 type-0.130
 their-0.129
 sorts-0.129
 its-0.128
 shape-0.125
Token upon
Token ID2402
Feature activation-1.20e+00

Pos logit contributions
lor+0.080
+0.074
ival+0.072
 skies+0.069
ican+0.068
Neg logit contributions
 type-0.060
 their-0.058
 Liz-0.058
 its-0.056
 sorts-0.056
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
lor+0.043
ane+0.041
+0.041
ican+0.041
ari+0.041
Neg logit contributions
 their-0.012
 its-0.012
 type-0.011
 his-0.011
 shape-0.010
Token time
Token ID640
Feature activation-2.94e+00

Pos logit contributions
 type+0.022
 shape+0.019
 their+0.019
 sorts+0.019
 its+0.019
Neg logit contributions
lor-0.025
ane-0.023
ival-0.023
-0.023
 skies-0.022
Token there
Token ID612
Feature activation-4.00e-01

Pos logit contributions
lor+0.092
actus+0.085
ival+0.083
+0.083
 skies+0.082
Neg logit contributions
 type-0.039
 their-0.039
 Liz-0.039
 its-0.037
 M-0.035
Token
Token ID198
Feature activation-1.06e+00

Pos logit contributions
 type+0.011
 shape+0.010
 sorts+0.010
 Liz+0.010
 their+0.010
Neg logit contributions
lor-0.011
-0.010
ival-0.010
ane-0.009
 skies-0.009
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
lor+0.023
+0.021
ival+0.021
ane+0.021
 skies+0.020
Neg logit contributions
 type-0.023
 shape-0.022
 sorts-0.022
 its-0.022
 their-0.022
TokenThe
Token ID464
Feature activation-8.33e-01

Pos logit contributions
lor+0.022
+0.021
ival+0.020
ane+0.020
actus+0.019
Neg logit contributions
 type-0.022
 shape-0.020
 sorts-0.020
 their-0.020
 its-0.020
Token end
Token ID886
Feature activation+5.67e-01

Pos logit contributions
lor+0.026
ane+0.022
anes+0.021
ival+0.021
orns+0.020
Neg logit contributions
 type-0.017
 shape-0.015
 their-0.015
 part-0.013
 neat-0.013
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
 type+0.015
 grun+0.015
 prod+0.015
apy+0.014
ooks+0.014
Neg logit contributions
-0.007
hel-0.007
lor-0.007
THE-0.007
 next-0.007
Token<|endoftext|>
Token ID50256
Feature activation-8.00e+00

Pos logit contributions
 type+0.024
 its+0.023
 their+0.023
 sorts+0.023
 Liz+0.023
Neg logit contributions
lor-0.027
ival-0.025
-0.025
ane-0.024
actus-0.024
TokenOnce
Token ID7454
Feature activation-3.13e+00

Pos logit contributions
lor+0.210
ival+0.199
ican+0.198
ane+0.195
actus+0.193
Neg logit contributions
 their-0.129
 type-0.128
 its-0.127
 sorts-0.123
 shape-0.122
Token upon
Token ID2402
Feature activation-1.25e+00

Pos logit contributions
lor+0.073
+0.068
ival+0.066
 skies+0.063
ane+0.063
Neg logit contributions
 type-0.065
 their-0.060
 Liz-0.060
 shape-0.059
 its-0.059
Token a
Token ID257
Feature activation+7.98e-01

Pos logit contributions
lor+0.043
+0.041
ane+0.041
ican+0.041
ival+0.040
Neg logit contributions
 their-0.014
 its-0.013
 type-0.013
 his-0.012
 shape-0.012
Token time
Token ID640
Feature activation-3.11e+00

Pos logit contributions
 type+0.016
 their+0.014
 shape+0.014
 sorts+0.014
 its+0.014
Neg logit contributions
lor-0.020
ane-0.018
ival-0.018
-0.018
 skies-0.018
Token there
Token ID612
Feature activation-4.93e-01

Pos logit contributions
lor+0.099
actus+0.090
ival+0.090
 skies+0.088
+0.087
Neg logit contributions
 type-0.041
 their-0.040
 Liz-0.039
 its-0.038
 M-0.036
Token
Token ID198
Feature activation-7.87e-01

Pos logit contributions
 Liz+0.011
 type+0.010
 their+0.010
 its+0.010
 sorts+0.010
Neg logit contributions
lor-0.014
actus-0.013
-0.013
ival-0.013
 skies-0.013
Token
Token ID198
Feature activation-7.07e-01

Pos logit contributions
lor+0.018
+0.017
ane+0.016
ival+0.016
 skies+0.016
Neg logit contributions
 type-0.016
 its-0.015
 their-0.015
 Liz-0.015
 sorts-0.015
TokenThe
Token ID464
Feature activation-7.80e-01

Pos logit contributions
lor+0.016
+0.015
ival+0.014
ane+0.014
 skies+0.014
Neg logit contributions
 type-0.015
 shape-0.014
 sorts-0.014
 its-0.014
 their-0.014
Token end
Token ID886
Feature activation+5.37e-01

Pos logit contributions
lor+0.027
owers+0.022
ane+0.021
+0.021
pp+0.020
Neg logit contributions
 type-0.013
 shape-0.013
 his-0.012
 their-0.011
 neat-0.010
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
 type+0.013
 grun+0.012
ooks+0.012
apy+0.012
cest+0.012
Neg logit contributions
-0.008
ari-0.007
hel-0.007
lor-0.007
ival-0.007
Token<|endoftext|>
Token ID50256
Feature activation-8.00e+00

Pos logit contributions
 type+0.038
 shape+0.036
 grun+0.035
 sorts+0.034
 Liz+0.033
Neg logit contributions
lor-0.024
-0.024
ival-0.022
hel-0.021
ane-0.021
TokenOnce
Token ID7454
Feature activation-3.17e+00

Pos logit contributions
lor+0.210
ival+0.201
 salon+0.195
ican+0.195
+0.192
Neg logit contributions
 their-0.127
 its-0.127
 type-0.125
 sorts-0.125
 shape-0.121
Token upon
Token ID2402
Feature activation-1.11e+00

Pos logit contributions
lor+0.079
+0.073
ival+0.071
 skies+0.068
ican+0.067
Neg logit contributions
 type-0.061
 their-0.059
 Liz-0.058
 its-0.057
 sorts-0.056
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
lor+0.040
ane+0.038
+0.037
ival+0.037
ican+0.037
Neg logit contributions
 their-0.012
 its-0.012
 type-0.011
 his-0.011
 shape-0.010
Token time
Token ID640
Feature activation-2.94e+00

Pos logit contributions
 type+0.023
 their+0.021
 shape+0.021
 sorts+0.020
 its+0.020
Neg logit contributions
lor-0.026
ane-0.024
ival-0.024
-0.024
 skies-0.023
Token there
Token ID612
Feature activation-3.49e-01

Pos logit contributions
lor+0.092
actus+0.084
ival+0.084
+0.082
 skies+0.082
Neg logit contributions
 type-0.040
 their-0.040
 Liz-0.039
 its-0.038
 M-0.035
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
lor+0.007
+0.006
ival+0.006
ane+0.006
 skies+0.006
Neg logit contributions
 type-0.007
 shape-0.007
 sorts-0.007
 its-0.007
 their-0.007
TokenThey
Token ID2990
Feature activation+6.51e-01

Pos logit contributions
lor+0.007
+0.006
ival+0.006
ane+0.006
 skies+0.006
Neg logit contributions
 type-0.005
 shape-0.005
 their-0.005
 its-0.005
 sorts-0.005
Token are
Token ID389
Feature activation+6.20e-01

Pos logit contributions
 type+0.010
 shape+0.010
 their+0.009
 sorts+0.009
 its+0.009
Neg logit contributions
lor-0.018
-0.017
ival-0.017
ane-0.017
 skies-0.016
Token lucky
Token ID9670
Feature activation-1.56e+00

Pos logit contributions
 type+0.013
 sorts+0.012
 shape+0.012
 grun+0.012
 Liz+0.012
Neg logit contributions
lor-0.014
-0.013
ival-0.013
ane-0.012
 skies-0.012
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
lor+0.040
+0.039
ival+0.038
ane+0.037
 skies+0.036
Neg logit contributions
 type-0.028
 sorts-0.025
 shape-0.025
 grun-0.024
 its-0.024
Token<|endoftext|>
Token ID50256
Feature activation-8.00e+00

Pos logit contributions
 type+0.043
 grun+0.039
 sorts+0.038
 shape+0.038
 Liz+0.036
Neg logit contributions
-0.046
lor-0.046
 skies-0.042
ane-0.041
ival-0.041
TokenL
Token ID43
Feature activation+1.37e+00

Pos logit contributions
lor+0.198
ival+0.187
 salon+0.183
+0.173
ari+0.172
Neg logit contributions
 their-0.142
 its-0.139
 sorts-0.139
 type-0.138
 shape-0.138
Tokenily
Token ID813
Feature activation+9.63e-01

Pos logit contributions
 their+0.022
 Liz+0.020
 its+0.020
 his+0.019
 type+0.019
Neg logit contributions
-0.041
lor-0.039
ival-0.037
 skies-0.035
ican-0.035
Token and
Token ID290
Feature activation-4.97e+00

Pos logit contributions
 type+0.020
 Liz+0.018
 shape+0.018
 grun+0.018
 sorts+0.018
Neg logit contributions
lor-0.022
-0.020
ane-0.020
ival-0.020
actus-0.020
Token Ben
Token ID3932
Feature activation-1.25e+00

Pos logit contributions
lor+0.141
+0.135
ival+0.132
ane+0.130
 skies+0.127
Neg logit contributions
 type-0.076
 shape-0.068
 sorts-0.067
 its-0.065
 Liz-0.065
Token were
Token ID547
Feature activation-5.87e-01

Pos logit contributions
lor+0.028
+0.027
ival+0.026
ane+0.026
hel+0.026
Neg logit contributions
 type-0.024
 Liz-0.023
 grun-0.022
 sorts-0.022
 shape-0.022
Token put
Token ID1234
Feature activation+6.93e-01

Pos logit contributions
 type+0.025
 sorts+0.023
 Liz+0.022
 shape+0.022
 its+0.022
Neg logit contributions
lor-0.029
ival-0.029
hel-0.028
-0.028
 skies-0.027
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
 type+0.012
 sorts+0.010
 shape+0.010
 grun+0.010
 Liz+0.010
Neg logit contributions
lor-0.018
-0.018
ival-0.017
ane-0.017
 skies-0.017
Token past
Token ID1613
Feature activation+2.04e+00

Pos logit contributions
lor+0.071
+0.068
ival+0.066
ane+0.064
 skies+0.063
Neg logit contributions
 type-0.065
 shape-0.059
 sorts-0.059
 its-0.058
 Liz-0.058
Tokenries
Token ID1678
Feature activation+5.42e-01

Pos logit contributions
 type+0.056
 their+0.052
 shape+0.052
 sorts+0.052
 its+0.052
Neg logit contributions
lor-0.034
-0.031
ival-0.030
ane-0.028
actus-0.028
Token back
Token ID736
Feature activation+2.16e+00

Pos logit contributions
 their+0.008
 its+0.008
 type+0.008
 shape+0.008
 sorts+0.007
Neg logit contributions
lor-0.016
ival-0.015
-0.015
actus-0.014
 skies-0.014
Token in
Token ID287
Feature activation+5.32e+00

Pos logit contributions
 type+0.039
 shape+0.035
 sorts+0.035
 its+0.034
 their+0.034
Neg logit contributions
lor-0.055
-0.053
ival-0.051
ane-0.050
 skies-0.049
Token the
Token ID262
Feature activation-6.51e-01

Pos logit contributions
ook+0.087
 type+0.084
 grun+0.082
apy+0.082
 Liz+0.082
Neg logit contributions
-0.127
 skies-0.117
hel-0.114
lor-0.112
 sky-0.112
Token box
Token ID3091
Feature activation+2.34e+00

Pos logit contributions
lor+0.011
+0.011
 skies+0.010
ival+0.010
actus+0.010
Neg logit contributions
 type-0.017
 grun-0.016
 Liz-0.016
 sorts-0.016
 its-0.015
Token and
Token ID290
Feature activation-2.64e+00

Pos logit contributions
 type+0.039
 shape+0.036
 their+0.035
 its+0.035
 sorts+0.035
Neg logit contributions
lor-0.063
-0.060
ival-0.059
ane-0.058
actus-0.057
Token closed
Token ID4838
Feature activation+2.20e+00

Pos logit contributions
hel+0.052
ival+0.052
+0.051
 skies+0.051
lor+0.050
Neg logit contributions
 type-0.055
 sorts-0.051
uzzle-0.049
rum-0.049
ooks-0.048
Token it
Token ID340
Feature activation+1.21e+00

Pos logit contributions
 type+0.047
 sorts+0.046
cest+0.045
 shape+0.039
uzzle+0.039
Neg logit contributions
 skies-0.043
-0.042
lor-0.042
hel-0.038
icans-0.037
Token and
Token ID290
Feature activation-2.02e+00

Pos logit contributions
 type+0.011
 shape+0.011
 sorts+0.010
 its+0.010
 their+0.010
Neg logit contributions
lor-0.013
-0.012
ival-0.012
ane-0.012
actus-0.011
Token fun
Token ID1257
Feature activation-1.41e+00

Pos logit contributions
lor+0.047
 skies+0.047
+0.046
ival+0.045
hel+0.043
Neg logit contributions
 type-0.034
ook-0.032
uzzle-0.031
 sorts-0.031
ooks-0.031
Token!
Token ID0
Feature activation+9.40e-01

Pos logit contributions
lor+0.035
+0.033
ival+0.032
ane+0.032
 skies+0.032
Neg logit contributions
 type-0.025
 grun-0.023
 Liz-0.022
 shape-0.022
 sorts-0.022
Token She
Token ID1375
Feature activation+1.42e+00

Pos logit contributions
 type+0.016
 Liz+0.016
 its+0.016
 sorts+0.015
 their+0.015
Neg logit contributions
lor-0.024
ival-0.023
ane-0.022
actus-0.022
-0.022
Token laughed
Token ID13818
Feature activation+6.91e-02

Pos logit contributions
 type+0.025
 sorts+0.022
 shape+0.022
 Liz+0.022
 their+0.021
Neg logit contributions
lor-0.036
-0.034
ival-0.034
ane-0.033
 skies-0.033
Token with
Token ID351
Feature activation+5.70e+00

Pos logit contributions
 type+0.002
 sorts+0.002
 shape+0.002
 grun+0.001
 Liz+0.001
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
 skies-0.001
Token joy
Token ID8716
Feature activation+1.29e+00

Pos logit contributions
 type+0.103
 shape+0.093
 sorts+0.092
 Liz+0.090
 grun+0.090
Neg logit contributions
lor-0.146
-0.139
ival-0.135
ane-0.133
 skies-0.131
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
 type+0.029
 sorts+0.027
 shape+0.027
 Liz+0.026
 grun+0.026
Neg logit contributions
lor-0.027
-0.025
ival-0.024
ane-0.024
 skies-0.024
Token ran
Token ID4966
Feature activation-2.69e-01

Pos logit contributions
lor+0.045
+0.044
ival+0.042
 skies+0.042
hel+0.041
Neg logit contributions
 type-0.044
ook-0.038
 shape-0.038
uzzle-0.038
ooks-0.038
Token around
Token ID1088
Feature activation+1.85e+00

Pos logit contributions
lor+0.006
+0.006
ival+0.006
ane+0.006
 skies+0.006
Neg logit contributions
 type-0.005
 shape-0.005
 sorts-0.005
 grun-0.005
 its-0.005
Token and
Token ID290
Feature activation-2.42e+00

Pos logit contributions
 type+0.035
 grun+0.031
 sorts+0.031
 Liz+0.030
 shape+0.030
Neg logit contributions
-0.044
lor-0.042
 skies-0.041
ival-0.040
hel-0.040
Token mercury
Token ID27394
Feature activation+3.34e-01

Pos logit contributions
lor+0.031
+0.031
 skies+0.029
actus+0.028
ival+0.028
Neg logit contributions
 type-0.030
 sorts-0.029
 Liz-0.028
ook-0.027
 its-0.027
Token.
Token ID13
Feature activation+3.04e+00

Pos logit contributions
 type+0.006
 sorts+0.005
 shape+0.005
 Liz+0.005
 grun+0.005
Neg logit contributions
lor-0.009
-0.008
ival-0.008
ane-0.008
 skies-0.008
Token He
Token ID679
Feature activation+7.60e-01

Pos logit contributions
 type+0.058
 sorts+0.053
 shape+0.051
 grun+0.049
 their+0.048
Neg logit contributions
lor-0.076
-0.074
ival-0.070
ane-0.068
 skies-0.068
Token felt
Token ID2936
Feature activation+2.94e+00

Pos logit contributions
 type+0.017
 sorts+0.017
uzzle+0.016
ook+0.016
ooks+0.016
Neg logit contributions
lor-0.014
actus-0.014
ops-0.013
-0.013
 skies-0.012
Token something
Token ID1223
Feature activation+3.68e+00

Pos logit contributions
 type+0.073
 sorts+0.069
 Liz+0.067
ook+0.065
uzzle+0.063
Neg logit contributions
-0.047
lor-0.045
There-0.044
 skies-0.044
actus-0.043
Token sharp
Token ID7786
Feature activation+4.97e+00

Pos logit contributions
 sorts+0.084
 type+0.081
ook+0.076
 Liz+0.076
ooks+0.075
Neg logit contributions
-0.068
There-0.065
actus-0.063
lor-0.062
hel-0.061
Token in
Token ID287
Feature activation+6.25e+00

Pos logit contributions
 type+0.102
 shape+0.093
 sorts+0.093
 Liz+0.091
 its+0.091
Neg logit contributions
lor-0.115
-0.109
ival-0.105
ane-0.104
 skies-0.101
Token his
Token ID465
Feature activation+1.67e+00

Pos logit contributions
 type+0.075
 sorts+0.070
ook+0.067
uzzle+0.064
ooks+0.062
Neg logit contributions
lor-0.175
-0.169
There-0.167
 skies-0.166
stand-0.165
Token tongue
Token ID11880
Feature activation+2.51e+00

Pos logit contributions
 type+0.032
 sorts+0.032
 Liz+0.028
 grun+0.028
 Suz+0.028
Neg logit contributions
lor-0.040
-0.038
 skies-0.035
actus-0.035
ane-0.035
Token.
Token ID13
Feature activation+2.66e+00

Pos logit contributions
 type+0.035
 shape+0.033
 its+0.032
 their+0.031
 grun+0.030
Neg logit contributions
lor-0.074
ival-0.071
-0.071
ane-0.069
actus-0.067
Token He
Token ID679
Feature activation+8.34e-01

Pos logit contributions
 type+0.050
 sorts+0.046
 shape+0.043
 grun+0.041
 their+0.041
Neg logit contributions
lor-0.067
-0.066
ival-0.062
ane-0.061
 skies-0.060
Token answer
Token ID3280
Feature activation+2.09e+00

Pos logit contributions
 type+0.013
 sorts+0.011
 its+0.011
 grun+0.011
 shape+0.011
Neg logit contributions
lor-0.021
ival-0.020
hel-0.020
ane-0.020
-0.020
Token.
Token ID13
Feature activation+2.69e+00

Pos logit contributions
 type+0.039
 sorts+0.036
 shape+0.035
 grun+0.034
 Liz+0.033
Neg logit contributions
lor-0.050
-0.048
ane-0.046
 skies-0.046
ival-0.046
Token They
Token ID1119
Feature activation+1.02e+00

Pos logit contributions
 type+0.035
 sorts+0.030
 shape+0.030
 grun+0.029
 its+0.028
Neg logit contributions
lor-0.082
-0.081
ival-0.078
ane-0.077
 skies-0.076
Token felt
Token ID2936
Feature activation+3.18e+00

Pos logit contributions
 type+0.024
 sorts+0.022
ook+0.021
 Liz+0.021
ooks+0.021
Neg logit contributions
hel-0.019
-0.019
lor-0.019
ival-0.019
 skies-0.017
Token ashamed
Token ID22461
Feature activation+1.10e+00

Pos logit contributions
 type+0.073
 sorts+0.067
 Liz+0.067
 shape+0.065
 grun+0.065
Neg logit contributions
lor-0.061
-0.059
ival-0.058
 skies-0.056
ane-0.055
Token of
Token ID286
Feature activation+5.37e+00

Pos logit contributions
 type+0.024
 shape+0.022
 sorts+0.022
 their+0.022
 its+0.022
Neg logit contributions
lor-0.024
-0.023
ival-0.022
ane-0.022
 skies-0.021
Token what
Token ID644
Feature activation+2.79e+00

Pos logit contributions
 type+0.083
 sorts+0.077
 shape+0.069
 grun+0.068
ook+0.064
Neg logit contributions
-0.143
lor-0.136
 skies-0.135
ival-0.132
hel-0.131
Token they
Token ID484
Feature activation+1.48e+00

Pos logit contributions
 type+0.061
 sorts+0.061
 shape+0.058
 grun+0.056
cest+0.055
Neg logit contributions
ival-0.050
ane-0.049
-0.049
lor-0.049
hel-0.047
Token did
Token ID750
Feature activation-3.46e+00

Pos logit contributions
 type+0.024
 shape+0.022
 sorts+0.022
 their+0.021
 its+0.021
Neg logit contributions
lor-0.041
-0.038
ival-0.038
ane-0.037
 skies-0.036
Token.
Token ID13
Feature activation+2.34e+00

Pos logit contributions
lor+0.103
+0.100
ival+0.097
actus+0.094
 skies+0.093
Neg logit contributions
 type-0.054
 their-0.052
 shape-0.048
 its-0.048
 sorts-0.045
Token They
Token ID1119
Feature activation+1.30e+00

Pos logit contributions
 type+0.034
 sorts+0.030
 shape+0.030
 grun+0.029
 its+0.028
Neg logit contributions
lor-0.068
-0.066
ival-0.064
ane-0.063
 skies-0.062
Tokenucky
Token ID5309
Feature activation+1.41e+00

Pos logit contributions
lor+0.051
ival+0.048
hel+0.043
ane+0.043
 skies+0.043
Neg logit contributions
 type-0.056
 shape-0.055
 M-0.052
 its-0.051
ook-0.051
Token noticed
Token ID6810
Feature activation+1.33e+00

Pos logit contributions
 type+0.022
 shape+0.020
 their+0.020
 its+0.019
 sorts+0.019
Neg logit contributions
lor-0.040
-0.039
ival-0.037
ane-0.037
hel-0.036
Token that
Token ID326
Feature activation+7.24e-01

Pos logit contributions
 type+0.025
 sorts+0.023
 shape+0.022
 Liz+0.022
 grun+0.022
Neg logit contributions
lor-0.033
-0.031
ival-0.030
ane-0.030
 skies-0.030
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
 type+0.007
 sorts+0.006
 grun+0.006
 Liz+0.006
 shape+0.006
Neg logit contributions
lor-0.022
 skies-0.021
actus-0.021
ane-0.021
-0.021
Token water
Token ID1660
Feature activation+2.13e-02

Pos logit contributions
lor+0.052
+0.049
ival+0.049
hel+0.047
ane+0.047
Neg logit contributions
 type-0.039
 shape-0.037
 their-0.035
 its-0.034
 grun-0.033
Token in
Token ID287
Feature activation+5.06e+00

Pos logit contributions
 type+0.000
 sorts+0.000
 Liz+0.000
 shape+0.000
 its+0.000
Neg logit contributions
lor-0.001
-0.001
 skies-0.001
ane-0.001
ival-0.001
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
 type+0.076
 Liz+0.069
 grun+0.068
 sorts+0.067
ook+0.067
Neg logit contributions
lor-0.134
-0.131
 skies-0.128
actus-0.122
There-0.120
Token pond
Token ID16723
Feature activation+1.43e-01

Pos logit contributions
lor+0.002
+0.002
ival+0.002
ane+0.002
 skies+0.002
Neg logit contributions
 type-0.002
 shape-0.002
 sorts-0.002
 its-0.002
 their-0.002
Token was
Token ID373
Feature activation+9.30e-01

Pos logit contributions
 type+0.002
 Liz+0.002
 sorts+0.002
 grun+0.002
 shape+0.002
Neg logit contributions
lor-0.004
-0.004
ival-0.003
ane-0.003
 skies-0.003
Token getting
Token ID1972
Feature activation+3.58e-01

Pos logit contributions
 type+0.019
 sorts+0.018
 Liz+0.018
 grun+0.017
ook+0.017
Neg logit contributions
lor-0.019
-0.019
 skies-0.019
actus-0.017
 sky-0.017
Token dirty
Token ID11841
Feature activation+5.17e-02

Pos logit contributions
 type+0.007
 sorts+0.006
 shape+0.006
 Liz+0.006
 grun+0.006
Neg logit contributions
lor-0.009
-0.009
ival-0.008
ane-0.008
 skies-0.008
Token,
Token ID11
Feature activation+9.59e-01

Pos logit contributions
lor+0.039
+0.037
ival+0.037
ane+0.036
actus+0.036
Neg logit contributions
 type-0.021
 shape-0.019
 sorts-0.019
 its-0.019
 their-0.018
Token which
Token ID543
Feature activation+8.92e-01

Pos logit contributions
 type+0.020
 shape+0.018
 its+0.018
 sorts+0.018
 their+0.018
Neg logit contributions
lor-0.022
-0.021
ival-0.020
ane-0.020
actus-0.020
Token was
Token ID373
Feature activation+9.83e-01

Pos logit contributions
 type+0.016
 shape+0.015
 grun+0.015
 sorts+0.015
 Liz+0.015
Neg logit contributions
lor-0.022
-0.021
ival-0.020
ane-0.020
 skies-0.020
Token parked
Token ID19584
Feature activation-2.80e+00

Pos logit contributions
 type+0.019
 sorts+0.019
uzzle+0.018
 grun+0.018
cest+0.018
Neg logit contributions
lor-0.018
 skies-0.018
-0.018
 salon-0.018
 sky-0.017
Token nearby
Token ID6716
Feature activation+1.83e-01

Pos logit contributions
lor+0.074
+0.069
ival+0.068
ane+0.067
actus+0.065
Neg logit contributions
 type-0.050
 shape-0.045
 its-0.045
 sorts-0.045
 their-0.045
Token.
Token ID13
Feature activation+9.57e-01

Pos logit contributions
 type+0.003
 shape+0.003
 sorts+0.003
 grun+0.003
 Liz+0.003
Neg logit contributions
lor-0.005
-0.005
ival-0.005
ane-0.004
 skies-0.004
Token
Token ID220
Feature activation+3.22e-01

Pos logit contributions
 type+0.021
 shape+0.019
 sorts+0.019
 grun+0.019
 its+0.019
Neg logit contributions
lor-0.021
-0.020
ival-0.019
ane-0.019
 skies-0.018
Token
Token ID198
Feature activation-1.37e+00

Pos logit contributions
 type+0.006
 Liz+0.006
 their+0.006
 its+0.006
 sorts+0.006
Neg logit contributions
lor-0.008
-0.007
ival-0.007
actus-0.007
 skies-0.007
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
lor+0.031
+0.029
ane+0.028
ival+0.028
 skies+0.028
Neg logit contributions
 type-0.028
 sorts-0.026
 Liz-0.026
 their-0.026
 its-0.026
TokenAmy
Token ID40797
Feature activation+2.31e+00

Pos logit contributions
lor+0.025
+0.023
ival+0.023
ane+0.022
 skies+0.021
Neg logit contributions
 type-0.029
 shape-0.027
 sorts-0.027
 its-0.027
 grun-0.027
Token enjoyed
Token ID8359
Feature activation+6.41e-02

Pos logit contributions
 type+0.043
 shape+0.040
 sorts+0.039
 grun+0.039
 its+0.038
Neg logit contributions
lor-0.058
-0.055
ival-0.053
ane-0.052
 skies-0.051
Token M
Token ID337
Feature activation+1.23e-01

Pos logit contributions
 type+0.019
 sorts+0.017
uzzle+0.016
 shape+0.016
 grun+0.016
Neg logit contributions
-0.021
 skies-0.020
lor-0.020
There-0.019
hel-0.019
Tokenilly
Token ID6548
Feature activation-1.63e-02

Pos logit contributions
 M+0.003
 type+0.003
 Suz+0.002
 Liz+0.002
 shape+0.002
Neg logit contributions
ival-0.003
lor-0.003
ane-0.002
-0.002
hel-0.002
Token heard
Token ID2982
Feature activation+2.70e+00

Pos logit contributions
lor+0.001
+0.001
ival+0.000
ane+0.000
hel+0.000
Neg logit contributions
 type-0.000
 shape-0.000
 grun-0.000
 their-0.000
 Liz-0.000
Token a
Token ID257
Feature activation+1.83e+00

Pos logit contributions
 type+0.048
 sorts+0.043
 shape+0.042
 grun+0.042
 Liz+0.040
Neg logit contributions
lor-0.069
-0.065
ival-0.064
 skies-0.064
ane-0.063
Token strange
Token ID6283
Feature activation+1.96e+00

Pos logit contributions
 type+0.035
 sorts+0.034
ooks+0.034
ook+0.033
 Liz+0.032
Neg logit contributions
-0.038
lor-0.038
ival-0.038
actus-0.037
 skies-0.037
Token noise
Token ID7838
Feature activation+2.76e+00

Pos logit contributions
 type+0.040
 sorts+0.037
 Liz+0.037
 their+0.036
ooks+0.035
Neg logit contributions
lor-0.041
-0.039
actus-0.039
ane-0.038
 skies-0.038
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 type+0.058
 sorts+0.053
 Liz+0.052
 shape+0.051
 grun+0.051
Neg logit contributions
lor-0.058
-0.057
hel-0.056
ane-0.055
 skies-0.054
Token She
Token ID1375
Feature activation+9.04e-01

Pos logit contributions
 type+0.027
cest+0.026
 sorts+0.025
 grun+0.024
 their+0.024
Neg logit contributions
-0.017
lor-0.017
 skies-0.017
 sky-0.015
actus-0.015
Token looked
Token ID3114
Feature activation+3.38e+00

Pos logit contributions
 type+0.020
 sorts+0.019
ooks+0.018
uzzle+0.018
ook+0.018
Neg logit contributions
 skies-0.017
lor-0.017
actus-0.016
hel-0.016
ival-0.016
Token down
Token ID866
Feature activation+2.82e+00

Pos logit contributions
 type+0.088
uzzle+0.078
 shape+0.077
ook+0.077
apy+0.076
Neg logit contributions
actus-0.053
 skies-0.052
hel-0.051
ival-0.050
icans-0.049
Token and
Token ID290
Feature activation-3.33e+00

Pos logit contributions
 type+0.062
 shape+0.058
 its+0.058
 their+0.058
 sorts+0.058
Neg logit contributions
lor-0.061
-0.057
ival-0.056
ane-0.054
actus-0.052
Token wants
Token ID3382
Feature activation-3.01e+00

Pos logit contributions
lor+0.033
+0.030
ival+0.030
hel+0.029
ane+0.029
Neg logit contributions
 type-0.024
 their-0.022
 shape-0.022
 its-0.020
 sorts-0.020
Token and
Token ID290
Feature activation-2.42e+00

Pos logit contributions
lor+0.074
+0.070
ival+0.068
ane+0.067
 skies+0.066
Neg logit contributions
 type-0.056
 shape-0.052
 sorts-0.051
 grun-0.051
 Liz-0.050
Token be
Token ID307
Feature activation+2.31e+00

Pos logit contributions
lor+0.070
+0.066
ival+0.064
ane+0.064
actus+0.064
Neg logit contributions
 type-0.039
 their-0.036
 shape-0.035
 its-0.034
 sorts-0.034
Token persistent
Token ID16218
Feature activation+7.08e-01

Pos logit contributions
 grun+0.049
 Liz+0.048
 type+0.048
 Suz+0.046
 sorts+0.046
Neg logit contributions
lor-0.044
There-0.041
-0.039
 skies-0.039
ival-0.039
Token in
Token ID287
Feature activation+3.99e+00

Pos logit contributions
 type+0.014
 grun+0.013
 sorts+0.013
 Liz+0.013
 shape+0.012
Neg logit contributions
lor-0.016
-0.016
ival-0.015
ane-0.015
 skies-0.015
Token finding
Token ID4917
Feature activation+1.79e+00

Pos logit contributions
 grun+0.071
 type+0.068
 Liz+0.067
 sorts+0.064
apy+0.063
Neg logit contributions
lor-0.093
-0.091
 skies-0.087
ival-0.083
ane-0.081
Token new
Token ID649
Feature activation+1.05e-01

Pos logit contributions
 type+0.038
 grun+0.038
 Liz+0.036
 shape+0.034
uzzle+0.034
Neg logit contributions
lor-0.036
 skies-0.034
ival-0.034
-0.032
actus-0.031
Token ways
Token ID2842
Feature activation+1.85e+00

Pos logit contributions
 type+0.002
 grun+0.002
 Liz+0.002
 shape+0.002
 sorts+0.002
Neg logit contributions
lor-0.002
ival-0.002
 skies-0.002
-0.002
ane-0.002
Token to
Token ID284
Feature activation+1.17e-01

Pos logit contributions
 type+0.031
 shape+0.028
 sorts+0.028
 grun+0.027
 its+0.027
Neg logit contributions
lor-0.049
-0.047
ival-0.046
ane-0.045
 skies-0.044
Token have
Token ID423
Feature activation-1.17e+00

Pos logit contributions
 grun+0.002
 type+0.002
 sorts+0.002
 Liz+0.002
apy+0.002
Neg logit contributions
lor-0.003
-0.003
ival-0.003
 skies-0.003
hel-0.002
Token fun
Token ID1257
Feature activation-1.52e+00

Pos logit contributions
lor+0.030
+0.029
ival+0.028
ane+0.027
 skies+0.027
Neg logit contributions
 type-0.021
 shape-0.019
 sorts-0.019
 grun-0.018
 Liz-0.018
Token close
Token ID1969
Feature activation+3.65e+00

Pos logit contributions
 type+0.035
 Liz+0.034
 grun+0.032
 sorts+0.032
 shape+0.031
Neg logit contributions
-0.034
hel-0.033
lor-0.033
ane-0.032
 skies-0.031
Token.
Token ID13
Feature activation+6.35e-01

Pos logit contributions
 type+0.071
 sorts+0.064
 grun+0.063
 shape+0.062
 Liz+0.062
Neg logit contributions
lor-0.086
-0.081
ane-0.081
 skies-0.081
hel-0.079
Token
Token ID198
Feature activation-1.72e+00

Pos logit contributions
 type+0.014
 shape+0.013
 sorts+0.013
 its+0.013
 their+0.013
Neg logit contributions
lor-0.013
-0.013
ival-0.012
ane-0.012
 skies-0.012
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
lor+0.036
+0.034
ival+0.033
ane+0.032
 skies+0.031
Neg logit contributions
 type-0.039
 shape-0.036
 sorts-0.036
 its-0.036
 their-0.036
TokenTim
Token ID14967
Feature activation+1.15e+00

Pos logit contributions
lor+0.028
+0.027
ival+0.027
There+0.025
hel+0.024
Neg logit contributions
 type-0.040
 sorts-0.038
 its-0.038
 Liz-0.038
 shape-0.038
Tokenmy
Token ID1820
Feature activation+7.59e-01

Pos logit contributions
 type+0.026
 shape+0.025
 sorts+0.025
 Liz+0.025
 their+0.024
Neg logit contributions
lor-0.025
-0.025
ival-0.023
ane-0.021
 skies-0.020
Token asked
Token ID1965
Feature activation-1.22e+00

Pos logit contributions
 type+0.012
 shape+0.011
 their+0.011
 its+0.010
 grun+0.010
Neg logit contributions
lor-0.022
-0.022
ival-0.020
ane-0.020
hel-0.020
Token his
Token ID465
Feature activation-1.33e+00

Pos logit contributions
 next+0.025
ival+0.023
 skies+0.023
orting+0.023
ari+0.023
Neg logit contributions
 type-0.021
 sorts-0.021
cest-0.020
 shape-0.020
apy-0.020
Token mom
Token ID1995
Feature activation-1.59e+00

Pos logit contributions
lor+0.031
+0.029
ival+0.029
ane+0.029
actus+0.028
Neg logit contributions
 type-0.026
 sorts-0.024
 its-0.024
 grun-0.023
 shape-0.023
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
lor+0.047
+0.045
ival+0.043
actus+0.041
ane+0.041
Neg logit contributions
 type-0.026
 their-0.023
 shape-0.023
 Liz-0.022
 its-0.022
Token "
Token ID366
Feature activation-1.54e+00

Pos logit contributions
+0.007
ival+0.006
lor+0.006
 skies+0.006
There+0.006
Neg logit contributions
 sorts-0.006
 type-0.005
 shape-0.005
cest-0.005
ook-0.005
Token playing
Token ID2712
Feature activation-1.67e+00

Pos logit contributions
 type+0.002
 their+0.001
 shape+0.001
 sorts+0.001
 its+0.001
Neg logit contributions
lor-0.002
-0.002
ival-0.002
actus-0.002
ane-0.002
Token with
Token ID351
Feature activation+2.55e+00

Pos logit contributions
lor+0.040
+0.040
ival+0.039
ane+0.037
 skies+0.037
Neg logit contributions
 type-0.030
 grun-0.028
 sorts-0.028
 shape-0.027
 Liz-0.027
Token some
Token ID617
Feature activation+1.44e+00

Pos logit contributions
 type+0.038
 their+0.034
 shape+0.034
 its+0.034
 sorts+0.034
Neg logit contributions
lor-0.075
-0.071
ival-0.069
ane-0.069
actus-0.067
Token balls
Token ID11333
Feature activation-1.83e+00

Pos logit contributions
 type+0.028
 Liz+0.026
 grun+0.026
 sorts+0.025
 its+0.025
Neg logit contributions
lor-0.033
ival-0.032
-0.032
 skies-0.031
ane-0.030
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
lor+0.057
+0.053
ival+0.052
actus+0.051
 skies+0.050
Neg logit contributions
 type-0.024
 their-0.023
 shape-0.023
 its-0.022
 sorts-0.022
Token
Token ID220
Feature activation+2.38e-01

Pos logit contributions
 type+0.021
 grun+0.019
 shape+0.019
 sorts+0.018
 its+0.018
Neg logit contributions
-0.031
lor-0.031
ival-0.029
 skies-0.029
ane-0.028
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
 Liz+0.004
 type+0.004
 their+0.004
 its+0.004
 sorts+0.004
Neg logit contributions
lor-0.006
actus-0.005
ival-0.005
-0.005
 skies-0.005
Token
Token ID198
Feature activation-9.68e-01

Pos logit contributions
lor+0.024
+0.023
ival+0.022
ane+0.022
actus+0.021
Neg logit contributions
 type-0.025
 shape-0.023
 sorts-0.023
 its-0.023
 their-0.023
TokenAmy
Token ID40797
Feature activation+2.46e+00

Pos logit contributions
lor+0.021
+0.020
ival+0.020
ane+0.019
hel+0.018
Neg logit contributions
 type-0.021
 shape-0.019
 grun-0.019
 sorts-0.019
 Liz-0.019
Token showed
Token ID3751
Feature activation+7.05e-02

Pos logit contributions
 type+0.045
 sorts+0.041
 shape+0.041
 its+0.040
 Liz+0.040
Neg logit contributions
lor-0.061
-0.058
ival-0.057
ane-0.056
 skies-0.055
Token her
Token ID607
Feature activation-6.29e-01

Pos logit contributions
 type+0.001
 shape+0.001
 sorts+0.001
 its+0.001
 their+0.001
Neg logit contributions
lor-0.002
-0.002
ival-0.002
ane-0.002
actus-0.002
Token was
Token ID373
Feature activation+5.93e-02

Pos logit contributions
 type+0.016
 shape+0.014
 sorts+0.014
 its+0.014
 grun+0.014
Neg logit contributions
lor-0.028
-0.027
ival-0.026
ane-0.025
actus-0.025
Token excited
Token ID6568
Feature activation-3.90e+00

Pos logit contributions
 type+0.001
 sorts+0.001
 shape+0.001
 Liz+0.001
 grun+0.001
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
 skies-0.001
Token.
Token ID13
Feature activation+7.04e-01

Pos logit contributions
lor+0.108
+0.102
ival+0.100
ane+0.098
actus+0.096
Neg logit contributions
 type-0.067
 shape-0.061
 their-0.061
 sorts-0.060
 its-0.059
Token He
Token ID679
Feature activation+2.09e-01

Pos logit contributions
 type+0.014
 grun+0.012
 shape+0.012
 sorts+0.012
cest+0.012
Neg logit contributions
lor-0.016
-0.016
ane-0.015
 skies-0.015
ival-0.015
Token wanted
Token ID2227
Feature activation-3.93e+00

Pos logit contributions
 type+0.004
 sorts+0.004
 Liz+0.004
ooks+0.004
uzzle+0.004
Neg logit contributions
ival-0.004
lor-0.004
-0.004
ari-0.004
 skies-0.004
Token to
Token ID284
Feature activation-2.44e+00

Pos logit contributions
lor+0.084
ival+0.081
 skies+0.080
ane+0.077
+0.076
Neg logit contributions
 type-0.080
 grun-0.075
 sorts-0.075
 shape-0.072
 Liz-0.071
Token send
Token ID3758
Feature activation-1.07e+00

Pos logit contributions
+0.055
lor+0.052
ival+0.050
 skies+0.049
hel+0.048
Neg logit contributions
 type-0.044
 grun-0.043
 Liz-0.042
 sorts-0.042
ooks-0.042
Token something
Token ID1223
Feature activation+2.26e+00

Pos logit contributions
lor+0.027
ival+0.026
 skies+0.026
ane+0.025
hel+0.025
Neg logit contributions
 type-0.018
 grun-0.016
 sorts-0.016
 shape-0.015
cest-0.015
Token too
Token ID1165
Feature activation+1.55e+00

Pos logit contributions
 type+0.044
 grun+0.043
 Liz+0.041
 sorts+0.041
ook+0.040
Neg logit contributions
lor-0.047
ival-0.046
ane-0.044
-0.043
 skies-0.042
Token.
Token ID13
Feature activation+9.68e-01

Pos logit contributions
 type+0.028
 Liz+0.027
 grun+0.027
 sorts+0.026
uzzle+0.025
Neg logit contributions
lor-0.035
-0.035
ival-0.034
ane-0.034
hel-0.033
Token He
Token ID679
Feature activation+2.90e-01

Pos logit contributions
 type+0.019
 grun+0.017
 shape+0.017
cest+0.017
 sorts+0.017
Neg logit contributions
lor-0.022
-0.021
 skies-0.020
ival-0.020
ane-0.020
Token to
Token ID284
Feature activation-2.38e+00

Pos logit contributions
 skies+0.069
+0.066
ival+0.064
lor+0.064
There+0.063
Neg logit contributions
 type-0.095
cest-0.091
 sorts-0.091
 grun-0.090
uzzle-0.087
Token use
Token ID779
Feature activation+1.38e+00

Pos logit contributions
+0.045
't+0.041
There+0.037
n+0.037
 skies+0.035
Neg logit contributions
cest-0.054
ooks-0.050
uzzle-0.048
 grun-0.048
 type-0.047
Token it
Token ID340
Feature activation-5.78e-01

Pos logit contributions
 type+0.026
cest+0.024
 sorts+0.024
uzzle+0.023
ook+0.023
Neg logit contributions
-0.032
 skies-0.030
lor-0.029
There-0.029
ival-0.027
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
lor+0.015
+0.014
ival+0.014
ane+0.014
 skies+0.013
Neg logit contributions
 type-0.010
 shape-0.009
 sorts-0.009
 its-0.009
 their-0.009
Token
Token ID198
Feature activation-1.72e+00

Pos logit contributions
 type+0.018
 shape+0.017
 sorts+0.017
 its+0.016
 their+0.016
Neg logit contributions
lor-0.019
-0.018
ival-0.017
ane-0.017
 skies-0.017
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
lor+0.037
+0.035
ival+0.034
ane+0.034
 skies+0.033
Neg logit contributions
 type-0.037
 sorts-0.034
 shape-0.034
 its-0.034
 their-0.034
TokenR
Token ID49
Feature activation+3.07e-01

Pos logit contributions
lor+0.035
+0.032
ival+0.031
ane+0.030
actus+0.030
Neg logit contributions
 type-0.035
 sorts-0.033
 shape-0.033
 Liz-0.032
 its-0.032
Tokenuffy
Token ID15352
Feature activation+1.10e+00

Pos logit contributions
 type+0.006
 shape+0.006
 Liz+0.006
 grun+0.006
 its+0.006
Neg logit contributions
lor-0.007
-0.007
ival-0.007
ane-0.006
 skies-0.006
Token tried
Token ID3088
Feature activation-2.21e+00

Pos logit contributions
 type+0.017
 shape+0.015
 sorts+0.015
 its+0.015
 their+0.015
Neg logit contributions
lor-0.031
-0.030
ival-0.029
ane-0.028
 skies-0.028
Token to
Token ID284
Feature activation-1.72e+00

Pos logit contributions
+0.043
lor+0.042
 skies+0.040
ival+0.040
hel+0.039
Neg logit contributions
 type-0.048
 sorts-0.045
cest-0.043
 grun-0.043
 shape-0.042
Token use
Token ID779
Feature activation+1.51e+00

Pos logit contributions
+0.035
hel+0.034
 skies+0.032
There+0.032
't+0.031
Neg logit contributions
 sorts-0.032
ooks-0.031
apy-0.031
cest-0.031
 type-0.031
Token
Token ID198
Feature activation-7.88e-01

Pos logit contributions
 type+0.023
 sorts+0.021
 shape+0.021
 grun+0.021
 its+0.020
Neg logit contributions
lor-0.029
-0.028
ival-0.027
ane-0.027
 skies-0.026
Token
Token ID198
Feature activation-8.06e-01

Pos logit contributions
lor+0.019
+0.018
ival+0.018
ane+0.017
actus+0.017
Neg logit contributions
 type-0.015
 their-0.015
 its-0.015
 Liz-0.015
 shape-0.014
Token"
Token ID1
Feature activation-2.13e+00

Pos logit contributions
lor+0.020
+0.019
ival+0.019
ane+0.019
actus+0.019
Neg logit contributions
 type-0.015
 their-0.014
 its-0.014
 shape-0.014
 Liz-0.014
TokenYou
Token ID1639
Feature activation-5.64e-01

Pos logit contributions
lor+0.052
+0.048
ival+0.047
ane+0.046
actus+0.045
Neg logit contributions
 type-0.043
 shape-0.039
 their-0.039
 its-0.038
 grun-0.038
Token're
Token ID821
Feature activation-1.10e-01

Pos logit contributions
lor+0.015
+0.014
ival+0.014
actus+0.014
ane+0.013
Neg logit contributions
 type-0.010
 shape-0.009
 their-0.009
 sorts-0.009
 its-0.009
Token welcome
Token ID7062
Feature activation-2.22e+00

Pos logit contributions
lor+0.002
+0.002
ival+0.002
 skies+0.002
ane+0.002
Neg logit contributions
 type-0.002
 sorts-0.002
 grun-0.002
 shape-0.002
 Liz-0.002
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
lor+0.064
+0.061
ival+0.061
ane+0.060
actus+0.058
Neg logit contributions
 type-0.034
 their-0.033
 its-0.032
 Liz-0.031
 shape-0.030
Token kids
Token ID3988
Feature activation-1.24e-01

Pos logit contributions
 type+0.019
 their+0.017
 Liz+0.017
 its+0.017
 shape+0.016
Neg logit contributions
lor-0.043
-0.041
ival-0.040
ane-0.040
actus-0.039
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
lor+0.003
+0.003
ane+0.003
ival+0.003
ari+0.003
Neg logit contributions
 type-0.002
 grun-0.002
 shape-0.002
 sorts-0.002
 its-0.002
Token I
Token ID314
Feature activation+3.13e-02

Pos logit contributions
 type+0.042
 grun+0.039
 shape+0.039
 sorts+0.039
 their+0.037
Neg logit contributions
lor-0.033
-0.031
ane-0.029
ival-0.029
 skies-0.029
Token'm
Token ID1101
Feature activation+2.00e-01

Pos logit contributions
 type+0.001
 its+0.000
 Liz+0.000
 grun+0.000
 shape+0.000
Neg logit contributions
-0.001
lor-0.001
ival-0.001
ane-0.001
hel-0.001
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
lor+0.002
+0.002
 skies+0.002
ival+0.002
hel+0.002
Neg logit contributions
 grun-0.002
 type-0.002
 sorts-0.002
apy-0.001
 Liz-0.001
Token fourth
Token ID5544
Feature activation+1.98e+00

Pos logit contributions
lor+0.071
+0.068
ival+0.066
 skies+0.065
ane+0.065
Neg logit contributions
 type-0.062
 sorts-0.058
 grun-0.057
 Liz-0.057
 its-0.057
Token of
Token ID286
Feature activation+4.06e+00

Pos logit contributions
 type+0.040
 grun+0.038
 Liz+0.038
 its+0.038
 their+0.037
Neg logit contributions
lor-0.044
-0.042
ival-0.041
ane-0.041
 skies-0.040
Token July
Token ID2901
Feature activation-2.58e+00

Pos logit contributions
 type+0.069
 grun+0.064
 Liz+0.063
 sorts+0.062
 shape+0.061
Neg logit contributions
lor-0.100
-0.098
 skies-0.097
ival-0.096
ane-0.094
Token celebration
Token ID14395
Feature activation-1.32e+00

Pos logit contributions
lor+0.074
+0.072
ival+0.069
actus+0.068
ane+0.066
Neg logit contributions
 type-0.041
 shape-0.038
 their-0.037
 sorts-0.036
 its-0.034
Token and
Token ID290
Feature activation-2.97e+00

Pos logit contributions
lor+0.037
+0.036
ival+0.034
actus+0.034
ane+0.033
Neg logit contributions
 type-0.022
 shape-0.019
 sorts-0.019
 their-0.019
 its-0.018
Token watch
Token ID2342
Feature activation+1.36e+00

Pos logit contributions
lor+0.072
ival+0.070
+0.070
 skies+0.069
ane+0.067
Neg logit contributions
 type-0.050
 grun-0.047
 sorts-0.046
 Liz-0.045
 shape-0.044
Token it
Token ID340
Feature activation-4.61e-01

Pos logit contributions
 type+0.023
 grun+0.021
 sorts+0.021
 Liz+0.020
 shape+0.020
Neg logit contributions
lor-0.035
ival-0.033
-0.033
 skies-0.033
ane-0.032
Token together
Token ID1978
Feature activation+1.06e+00

Pos logit contributions
lor+0.011
+0.010
ival+0.010
ane+0.010
 skies+0.010
Neg logit contributions
 type-0.009
 grun-0.008
 sorts-0.008
 Liz-0.008
 shape-0.008
Token,"
Token ID553
Feature activation-1.99e+00

Pos logit contributions
 type+0.020
 grun+0.018
 sorts+0.018
 Liz+0.018
 shape+0.018
Neg logit contributions
lor-0.025
-0.024
ival-0.024
ane-0.023
 skies-0.023
Token she
Token ID673
Feature activation+9.86e-01

Pos logit contributions
+0.045
lor+0.045
ival+0.043
ane+0.042
 skies+0.041
Neg logit contributions
 type-0.040
 shape-0.037
 sorts-0.036
 grun-0.035
 its-0.035
Token doll
Token ID3654
Feature activation-1.66e+00

Pos logit contributions
 type+0.085
 sorts+0.080
 grun+0.073
ook+0.073
ooks+0.072
Neg logit contributions
ane-0.071
 skies-0.071
lor-0.071
hel-0.066
ival-0.065
Token hair
Token ID4190
Feature activation-7.20e-01

Pos logit contributions
lor+0.043
+0.042
ival+0.041
 skies+0.040
actus+0.038
Neg logit contributions
 type-0.029
 shape-0.029
 their-0.028
 its-0.028
 Liz-0.027
Token on
Token ID319
Feature activation+3.33e+00

Pos logit contributions
lor+0.019
ival+0.018
+0.018
ari+0.017
 skies+0.017
Neg logit contributions
 type-0.012
 their-0.012
 shape-0.012
 its-0.012
 Liz-0.011
Token my
Token ID616
Feature activation-1.88e+00

Pos logit contributions
 type+0.053
 sorts+0.049
 grun+0.047
 shape+0.045
 Liz+0.045
Neg logit contributions
lor-0.089
-0.086
 skies-0.083
ane-0.081
ival-0.081
Token head
Token ID1182
Feature activation+2.12e+00

Pos logit contributions
lor+0.041
+0.039
ane+0.037
 skies+0.036
ival+0.036
Neg logit contributions
 type-0.040
 sorts-0.039
 grun-0.036
 Liz-0.036
 its-0.036
Token and
Token ID290
Feature activation-2.79e+00

Pos logit contributions
 type+0.033
 shape+0.030
 its+0.029
 their+0.029
 Liz+0.029
Neg logit contributions
lor-0.059
-0.057
ival-0.056
ane-0.054
 skies-0.053
Token I
Token ID314
Feature activation+1.21e+00

Pos logit contributions
lor+0.057
 skies+0.056
+0.055
ane+0.053
 salon+0.053
Neg logit contributions
 type-0.055
 sorts-0.054
ook-0.054
ooks-0.051
uzzle-0.050
Token didn
Token ID1422
Feature activation-4.20e+00

Pos logit contributions
 type+0.023
 its+0.022
 sorts+0.022
 their+0.021
 shape+0.021
Neg logit contributions
-0.027
lor-0.025
uch-0.025
ops-0.025
actus-0.024
Token't
Token ID470
Feature activation+5.80e-01

Pos logit contributions
lor+0.060
ari+0.052
ival+0.051
actus+0.050
 skies+0.050
Neg logit contributions
 their-0.121
 its-0.121
 type-0.121
 Liz-0.120
 shape-0.118
Token want
Token ID765
Feature activation-3.78e+00

Pos logit contributions
apy+0.012
 grun+0.011
 type+0.011
 sorts+0.011
 Ali+0.011
Neg logit contributions
hel-0.010
There-0.010
ovi-0.010
orns-0.010
uch-0.010
Token it
Token ID340
Feature activation-1.14e+00

Pos logit contributions
lor+0.087
+0.082
ival+0.080
 skies+0.080
ane+0.080
Neg logit contributions
 type-0.075
 sorts-0.069
 grun-0.068
 shape-0.067
 Liz-0.064
Token small
Token ID1402
Feature activation+8.09e-01

Pos logit contributions
lor+0.064
+0.061
ival+0.060
ane+0.059
 skies+0.057
Neg logit contributions
 type-0.041
 shape-0.037
 sorts-0.036
 their-0.035
 its-0.035
Token bucket
Token ID19236
Feature activation+7.80e-02

Pos logit contributions
 type+0.015
 grun+0.014
 Liz+0.014
 its+0.014
 sorts+0.014
Neg logit contributions
 skies-0.018
-0.018
ane-0.017
lor-0.017
actus-0.017
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
 type+0.001
 shape+0.001
 their+0.001
 its+0.001
 sorts+0.001
Neg logit contributions
lor-0.002
-0.002
ival-0.002
hel-0.002
ane-0.002
Token And
Token ID843
Feature activation-1.12e+00

Pos logit contributions
 type+0.041
 shape+0.037
 sorts+0.037
 grun+0.036
 its+0.036
Neg logit contributions
lor-0.057
-0.054
ival-0.052
ane-0.052
 skies-0.051
Token we
Token ID356
Feature activation-1.99e-01

Pos logit contributions
lor+0.024
+0.022
ival+0.022
ane+0.021
actus+0.021
Neg logit contributions
 type-0.026
 shape-0.024
 their-0.024
 its-0.023
 sorts-0.023
Token can
Token ID460
Feature activation-4.87e+00

Pos logit contributions
lor+0.006
+0.005
ival+0.005
ane+0.005
 skies+0.005
Neg logit contributions
 type-0.003
 shape-0.003
 sorts-0.003
 their-0.003
 its-0.003
Token share
Token ID2648
Feature activation+1.48e-01

Pos logit contributions
lor+0.149
ival+0.139
ane+0.136
+0.135
actus+0.134
Neg logit contributions
 type-0.069
 shape-0.064
 their-0.061
 its-0.058
 Liz-0.058
Token it
Token ID340
Feature activation-6.70e-02

Pos logit contributions
 type+0.002
 sorts+0.002
 shape+0.002
 grun+0.002
 Liz+0.002
Neg logit contributions
lor-0.004
-0.004
ival-0.004
ane-0.004
 skies-0.004
Token."
Token ID526
Feature activation-6.38e-01

Pos logit contributions
lor+0.002
+0.002
ival+0.002
ane+0.002
 skies+0.002
Neg logit contributions
 type-0.001
 shape-0.001
 their-0.001
 its-0.001
 sorts-0.001
Token
Token ID198
Feature activation+4.93e-02

Pos logit contributions
lor+0.017
ival+0.015
ane+0.015
+0.015
ican+0.015
Neg logit contributions
 their-0.012
 its-0.011
 type-0.011
 Liz-0.011
 shape-0.010
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
 type+0.001
 their+0.001
 its+0.001
 shape+0.001
 Liz+0.001
Neg logit contributions
lor-0.001
-0.001
ival-0.001
ane-0.001
hel-0.001
Token
Token ID198
Feature activation-9.22e-01

Pos logit contributions
 type+0.024
 shape+0.022
 sorts+0.022
 its+0.021
 their+0.021
Neg logit contributions
lor-0.026
-0.024
ival-0.024
ane-0.023
 skies-0.023
Token
Token ID198
Feature activation-8.16e-01

Pos logit contributions
lor+0.022
+0.021
ane+0.020
actus+0.020
 skies+0.020
Neg logit contributions
 type-0.017
 its-0.017
 their-0.017
 sorts-0.017
 shape-0.016
TokenThe
Token ID464
Feature activation-6.02e-01

Pos logit contributions
lor+0.019
+0.018
actus+0.018
ane+0.017
 skies+0.017
Neg logit contributions
 type-0.016
 its-0.015
 sorts-0.015
 their-0.015
 shape-0.015
Token End
Token ID5268
Feature activation-4.08e+00

Pos logit contributions
lor+0.016
+0.014
ives+0.013
pill+0.013
orns+0.012
Neg logit contributions
 type-0.013
 shape-0.012
 their-0.012
 grun-0.011
 part-0.011
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
lor+0.098
+0.093
ival+0.090
hel+0.086
ane+0.085
Neg logit contributions
 type-0.078
 shape-0.073
 grun-0.072
 its-0.068
 Liz-0.068
Token<|endoftext|>
Token ID50256
Feature activation-7.35e+00

Pos logit contributions
 type+0.020
 its+0.019
 sorts+0.019
 their+0.019
 Liz+0.019
Neg logit contributions
lor-0.023
actus-0.022
ival-0.022
-0.022
ane-0.021
TokenOnce
Token ID7454
Feature activation-2.64e+00

Pos logit contributions
lor+0.164
ival+0.156
 salon+0.156
ican+0.154
+0.151
Neg logit contributions
 sorts-0.138
 its-0.136
 type-0.135
 their-0.134
 M-0.132
Token there
Token ID612
Feature activation-2.21e-01

Pos logit contributions
lor+0.062
+0.058
ival+0.056
 skies+0.054
ane+0.053
Neg logit contributions
 type-0.054
 their-0.051
 its-0.050
 Liz-0.050
 sorts-0.050
Token was
Token ID373
Feature activation-2.44e-02

Pos logit contributions
+0.005
lor+0.005
ival+0.005
ane+0.005
hel+0.005
Neg logit contributions
 type-0.004
 shape-0.004
 sorts-0.003
 grun-0.003
uzzle-0.003
Token a
Token ID257
Feature activation+1.89e+00

Pos logit contributions
lor+0.001
+0.001
actus+0.001
ival+0.001
ari+0.001
Neg logit contributions
 their-0.000
 its-0.000
 type-0.000
 sorts-0.000
 shape-0.000
Token bird
Token ID6512
Feature activation-2.82e-01

Pos logit contributions
 sorts+0.042
 type+0.041
 its+0.041
 Liz+0.040
 grun+0.039
Neg logit contributions
-0.036
actus-0.034
ival-0.034
lor-0.033
opus-0.031
Token favorite
Token ID4004
Feature activation+1.06e-02

Pos logit contributions
 type+0.008
 shape+0.007
 sorts+0.007
 their+0.007
 its+0.007
Neg logit contributions
lor-0.010
-0.009
ival-0.009
ane-0.009
 skies-0.009
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 grun+0.000
 type+0.000
 Liz+0.000
 sorts+0.000
 its+0.000
Neg logit contributions
lor-0.000
-0.000
actus-0.000
 skies-0.000
ival-0.000
Token The
Token ID383
Feature activation-5.63e-01

Pos logit contributions
 type+0.021
 Liz+0.020
 their+0.020
 sorts+0.020
 its+0.020
Neg logit contributions
lor-0.026
ival-0.024
-0.024
ane-0.024
 skies-0.023
Token end
Token ID886
Feature activation+5.97e-01

Pos logit contributions
lor+0.015
ane+0.014
ival+0.013
anes+0.013
hel+0.013
Neg logit contributions
 type-0.012
 shape-0.011
 their-0.010
 sorts-0.010
 M-0.010
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 type+0.013
 grun+0.013
cest+0.012
ooks+0.012
apy+0.012
Neg logit contributions
lor-0.011
-0.011
hel-0.011
ari-0.011
ival-0.011
Token<|endoftext|>
Token ID50256
Feature activation-7.56e+00

Pos logit contributions
 type+0.021
 their+0.021
 its+0.021
 sorts+0.020
 Liz+0.020
Neg logit contributions
lor-0.026
ival-0.024
-0.024
ane-0.024
 skies-0.023
TokenOnce
Token ID7454
Feature activation-3.17e+00

Pos logit contributions
lor+0.206
ican+0.197
ival+0.197
ane+0.194
 salon+0.194
Neg logit contributions
 their-0.116
 its-0.113
 M-0.108
 type-0.107
 Liz-0.105
Token upon
Token ID2402
Feature activation-9.09e-01

Pos logit contributions
lor+0.079
+0.073
ival+0.071
 skies+0.069
ican+0.067
Neg logit contributions
 type-0.061
 their-0.059
 Liz-0.058
 its-0.057
 sorts-0.056
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
lor+0.031
ane+0.030
+0.030
ival+0.029
ican+0.029
Neg logit contributions
 their-0.010
 its-0.010
 type-0.010
 shape-0.009
 his-0.009
Token time
Token ID640
Feature activation-2.87e+00

Pos logit contributions
 type+0.019
 their+0.017
 shape+0.017
 sorts+0.016
 its+0.016
Neg logit contributions
lor-0.027
ane-0.025
ival-0.025
 skies-0.025
anes-0.025
Token,
Token ID11
Feature activation-7.88e-01

Pos logit contributions
lor+0.097
actus+0.090
ival+0.088
 skies+0.087
ican+0.086
Neg logit contributions
 their-0.034
 Liz-0.032
 its-0.031
 type-0.031
 M-0.030
Token be
Token ID307
Feature activation+1.84e+00

Pos logit contributions
 type+0.020
 sorts+0.018
 shape+0.018
 Liz+0.018
 grun+0.018
Neg logit contributions
lor-0.034
-0.033
ival-0.031
ane-0.031
 skies-0.031
Token alert
Token ID7995
Feature activation+2.21e+00

Pos logit contributions
 type+0.034
 grun+0.032
 Liz+0.031
 sorts+0.030
 shape+0.030
Neg logit contributions
lor-0.042
-0.040
ival-0.039
 skies-0.038
ane-0.037
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
 type+0.040
 shape+0.036
 sorts+0.036
 grun+0.035
 Liz+0.035
Neg logit contributions
lor-0.056
-0.053
ival-0.052
ane-0.051
 skies-0.050
Token safe
Token ID3338
Feature activation+4.71e-01

Pos logit contributions
lor+0.034
+0.034
ival+0.033
 skies+0.033
hel+0.031
Neg logit contributions
 type-0.025
 Liz-0.025
 grun-0.024
 sorts-0.024
 shape-0.023
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 type+0.008
 grun+0.008
 sorts+0.008
 Liz+0.008
 shape+0.007
Neg logit contributions
lor-0.012
-0.011
 skies-0.011
ane-0.011
ival-0.011
Token<|endoftext|>
Token ID50256
Feature activation-6.96e+00

Pos logit contributions
 type+0.026
 sorts+0.024
 shape+0.024
 Liz+0.024
 its+0.024
Neg logit contributions
lor-0.029
-0.027
ival-0.027
ane-0.026
 skies-0.025
TokenOnce
Token ID7454
Feature activation-2.80e+00

Pos logit contributions
lor+0.171
ival+0.163
ican+0.159
 salon+0.157
+0.155
Neg logit contributions
 its-0.121
 their-0.120
 type-0.117
 sorts-0.115
 M-0.114
Token upon
Token ID2402
Feature activation-5.76e-01

Pos logit contributions
lor+0.064
+0.060
ival+0.058
ane+0.056
 skies+0.056
Neg logit contributions
 type-0.059
 their-0.054
 shape-0.054
 Liz-0.054
 its-0.054
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
lor+0.019
+0.018
ane+0.018
ival+0.018
ari+0.018
Neg logit contributions
 type-0.007
 its-0.007
 their-0.006
 shape-0.006
 his-0.006
Token time
Token ID640
Feature activation-2.49e+00

Pos logit contributions
 type+0.026
 shape+0.023
 their+0.023
 sorts+0.022
 grun+0.022
Neg logit contributions
lor-0.031
ane-0.028
ival-0.028
-0.028
 skies-0.027
Token,
Token ID11
Feature activation-2.94e-01

Pos logit contributions
lor+0.077
actus+0.071
ival+0.070
+0.069
 skies+0.069
Neg logit contributions
 type-0.034
 their-0.034
 Liz-0.033
 its-0.032
 M-0.030
Token are
Token ID389
Feature activation+9.91e-02

Pos logit contributions
lor+0.014
+0.013
ival+0.012
ane+0.012
actus+0.012
Neg logit contributions
 type-0.009
 shape-0.008
 sorts-0.008
 their-0.007
 its-0.007
Token very
Token ID845
Feature activation+1.54e+00

Pos logit contributions
 type+0.002
 grun+0.002
apy+0.002
uzzle+0.002
 sorts+0.002
Neg logit contributions
lor-0.002
 skies-0.002
-0.002
ival-0.002
There-0.002
Token smart
Token ID4451
Feature activation+1.43e+00

Pos logit contributions
 grun+0.028
 Liz+0.028
 type+0.027
 its+0.027
 sorts+0.027
Neg logit contributions
ival-0.033
-0.032
lor-0.031
actus-0.030
uch-0.030
Token.
Token ID13
Feature activation+2.05e+00

Pos logit contributions
 type+0.028
 shape+0.026
 sorts+0.026
 their+0.025
 its+0.025
Neg logit contributions
lor-0.035
-0.034
ival-0.032
ane-0.031
actus-0.031
Token The
Token ID383
Feature activation-9.72e-01

Pos logit contributions
 type+0.042
 shape+0.039
 sorts+0.039
 their+0.038
 its+0.038
Neg logit contributions
lor-0.047
-0.044
ival-0.043
ane-0.042
 skies-0.041
Token moral
Token ID6573
Feature activation-5.97e+00

Pos logit contributions
lor+0.024
+0.021
ival+0.021
ane+0.021
hel+0.021
Neg logit contributions
 type-0.021
 shape-0.019
 their-0.018
 sorts-0.018
 grun-0.018
Token of
Token ID286
Feature activation+4.08e+00

Pos logit contributions
lor+0.166
+0.163
ival+0.152
actus+0.151
ane+0.149
Neg logit contributions
 type-0.100
 sorts-0.092
 shape-0.088
 its-0.087
 their-0.086
Token this
Token ID428
Feature activation+4.01e-01

Pos logit contributions
 type+0.054
 its+0.048
 their+0.048
 sorts+0.048
 shape+0.047
Neg logit contributions
lor-0.125
-0.121
ival-0.118
ane-0.116
actus-0.114
Token story
Token ID1621
Feature activation-2.17e+00

Pos logit contributions
 type+0.012
 shape+0.011
 sorts+0.011
 their+0.011
 its+0.011
Neg logit contributions
lor-0.006
-0.006
ival-0.006
hel-0.005
ane-0.005
Token is
Token ID318
Feature activation+9.83e-01

Pos logit contributions
lor+0.054
+0.052
ival+0.050
ane+0.049
 skies+0.049
Neg logit contributions
 type-0.040
 shape-0.037
 sorts-0.036
 its-0.036
 their-0.036
Token that
Token ID326
Feature activation+5.99e-01

Pos logit contributions
 grun+0.022
 type+0.022
 Liz+0.021
 shape+0.021
 sorts+0.021
Neg logit contributions
lor-0.019
-0.018
ival-0.017
ari-0.017
 skies-0.017