TOP ACTIVATIONS
MAX = 5.004

things are not what they seem . And they learned that
they are not what they seem . Be kind and respectful
things are not what they seem . They learned that some
. What does your doll seem like ?" L ila asked
learned that sometimes things might seem bad at first , but
things are not what they seem . They thanked mom my
things aren 't what they seem . From that day on
things are not what they seem ," mom said .
that even though something might seem scary , it 's okay
. To her amaz ement , the clothes came
other people . Things may seem charming , but they are
things are not what they seem , just like the lion
matter how sour something might seem , sweet things might be
knew that sometimes things might seem hard , but with a
didn 't know what " he al " meant but he
L ily , your books he lf is very messy .
" simple " it may seem . Zoe y smiled ,
, Lily ? Sometimes things seem poor at first , but
They had never seen anyone un pack vegetables so quickly before
her that sometimes things can seem scary , but they 're

MIN ACTIVATIONS
MIN = -5.331

wanted to increase . By living life , it found joy
She grabbed oven m itt s and carefully pulled out
little boy getting a buzz cut and a lady getting a
to take a short - cut . She followed
ever , made from flexible wire ! <|endoftext|> Once upon a
They put on oven m itt s and took out the
He jumped around with gust o and spun in circles as
. They are for wood cut ters ." They took Lily
went back to playing with dough . <|endoftext|> Once upon a
time there was a wood cut ter . He had an
mouse looked up at k itty and said , " We
there was a kind wood cut ter who lived in the
threw it in the ash tr ay . Tim my was
same spot , full of cher ries . <|endoftext|> One day
stop and be careful around envelop es they found on the
time together drinking and eating dough . The end
was a very good k itty . <|endoftext|> Once there was
. She used a m itt to take out the pan
all hopped in a big oven . The
time there was a wood cut ter who was strong and

INTERVAL 2.420 - 5.004
CONTAINS 0.281%

mom and dad . She wished she had stayed home and
door . He hoped their parents would not notice anything .
the cows , the pigs , the chickens and the sheep
it to her friends the next day ! <|endoftext|> Kelly was
used his wings to cover himself . He stayed dry and

INTERVAL -0.164 - 2.420
CONTAINS 58.858%

day , her mom my took her to her office .
big , scary bird that lived in the park too .
liked to watch the sun rise every morning . She would
. But when Lily put it on , she noticed something
How do you hang from the roof ?" S

INTERVAL -2.747 - -0.164
CONTAINS 40.738%

, you have a lot of buttons . What are you
. So , together , the frog and the
." Tom and Mia say , " We will
back to grandma and told her what she saw . Grand
then one day , while she was still repeating the same

INTERVAL -5.331 - -2.747
CONTAINS 0.123%

. Rob was carrying a whip . Sam asked Rob ,
help . K itty 's friends , a dog
they were finished , Nem o put the necklace on and
We have apple pie , cherry pie , pumpkin pie ,
. They shouted and b anged their tails so the fox
Token things
Token ID1243
Feature activation+3.94e-01

Pos logit contributions
 dot+0.040
 pavement+0.039
 distances+0.038
 horizon+0.036
 wheels+0.035
Neg logit contributions
 exactly-0.052
itting-0.048
str-0.045
bek-0.045
o-0.045
Token are
Token ID389
Feature activation+1.32e+00

Pos logit contributions
 dot+0.014
 pavement+0.013
 distances+0.013
 horizon+0.013
 dashboard+0.013
Neg logit contributions
 exactly-0.018
itting-0.016
sm-0.015
str-0.015
 known-0.015
Token not
Token ID407
Feature activation-3.57e-01

Pos logit contributions
 dot+0.054
 distances+0.054
 wheels+0.052
 horizon+0.052
 pavement+0.051
Neg logit contributions
itting-0.045
bek-0.042
 exactly-0.041
uc-0.041
itt-0.041
Token what
Token ID644
Feature activation-9.87e-01

Pos logit contributions
 exactly+0.013
itting+0.012
str+0.011
sm+0.011
itt+0.011
Neg logit contributions
 dot-0.015
 distances-0.014
 pavement-0.014
 dashboard-0.013
 wheels-0.013
Token they
Token ID484
Feature activation+1.59e+00

Pos logit contributions
 exactly+0.035
itting+0.032
sm+0.029
str+0.029
bek+0.029
Neg logit contributions
 dot-0.043
 distances-0.042
 pavement-0.041
 horizon-0.040
 dashboard-0.040
Token seem
Token ID1283
Feature activation+5.00e+00

Pos logit contributions
 dot+0.058
 pavement+0.054
 distances+0.054
 horizon+0.053
 spoiler+0.052
Neg logit contributions
 exactly-0.072
itting-0.066
 known-0.064
str-0.062
o-0.062
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
 dot+0.218
 pavement+0.210
 distances+0.208
 horizon+0.203
 dashboard+0.200
Neg logit contributions
 exactly-0.184
itting-0.164
sm-0.152
str-0.151
 known-0.151
Token And
Token ID843
Feature activation-4.37e-01

Pos logit contributions
 dot+0.008
 pavement+0.008
 distances+0.008
 horizon+0.008
 dashboard+0.007
Neg logit contributions
 exactly-0.009
itting-0.008
 known-0.008
o-0.008
sm-0.008
Token they
Token ID484
Feature activation-2.81e-01

Pos logit contributions
 exactly+0.011
itting+0.011
str+0.010
sm+0.010
eh+0.009
Neg logit contributions
 dot-0.023
 distances-0.022
 pavement-0.021
 horizon-0.021
ro-0.020
Token learned
Token ID4499
Feature activation+5.33e-01

Pos logit contributions
 exactly+0.009
itting+0.009
sm+0.008
str+0.008
o+0.007
Neg logit contributions
 dot-0.013
 distances-0.012
 pavement-0.012
 horizon-0.012
 dashboard-0.012
Token that
Token ID326
Feature activation-1.34e+00

Pos logit contributions
 dot+0.022
 wheels+0.021
 horizon+0.021
 dashboard+0.021
 pavement+0.021
Neg logit contributions
itting-0.018
eh-0.017
bek-0.017
str-0.016
live-0.016
Token they
Token ID484
Feature activation+8.09e-01

Pos logit contributions
 dot+0.049
 distances+0.048
 pavement+0.047
 horizon+0.046
 dashboard+0.045
Neg logit contributions
 exactly-0.041
itting-0.037
sm-0.034
str-0.034
 known-0.034
Token are
Token ID389
Feature activation+1.71e+00

Pos logit contributions
 dot+0.031
 pavement+0.031
 horizon+0.029
 distances+0.029
 dashboard+0.028
Neg logit contributions
 exactly-0.034
itting-0.030
 known-0.029
sm-0.028
 wrong-0.028
Token not
Token ID407
Feature activation-3.05e-01

Pos logit contributions
 dot+0.067
 distances+0.067
 wheels+0.062
 pavement+0.060
 sides+0.059
Neg logit contributions
itting-0.065
bek-0.062
 exactly-0.061
str-0.060
uc-0.059
Token what
Token ID644
Feature activation-8.65e-01

Pos logit contributions
 exactly+0.011
itting+0.011
bek+0.010
str+0.010
itt+0.010
Neg logit contributions
 dot-0.012
 distances-0.012
 pavement-0.011
 wheels-0.011
 dashboard-0.011
Token they
Token ID484
Feature activation+1.74e+00

Pos logit contributions
 exactly+0.035
itting+0.032
bek+0.030
str+0.030
rou+0.030
Neg logit contributions
 dot-0.033
 distances-0.033
 pavement-0.031
 horizon-0.030
 Novel-0.029
Token seem
Token ID1283
Feature activation+4.95e+00

Pos logit contributions
 dot+0.062
 pavement+0.058
 distances+0.058
 horizon+0.057
 dashboard+0.055
Neg logit contributions
 exactly-0.079
itting-0.072
 known-0.069
str-0.067
sm-0.067
Token.
Token ID13
Feature activation+6.54e-01

Pos logit contributions
 dot+0.199
 distances+0.192
 pavement+0.189
 horizon+0.182
 dashboard+0.175
Neg logit contributions
 exactly-0.193
itting-0.175
sm-0.162
bek-0.162
str-0.162
Token Be
Token ID1355
Feature activation+8.54e-01

Pos logit contributions
 pavement+0.027
 dot+0.027
 horizon+0.026
 distances+0.026
 surfaces+0.025
Neg logit contributions
 exactly-0.023
o-0.022
itting-0.022
 known-0.021
itt-0.020
Token kind
Token ID1611
Feature activation+4.41e-01

Pos logit contributions
 distances+0.038
 dot+0.037
 wheels+0.036
 sides+0.035
 edges+0.035
Neg logit contributions
itting-0.026
 exactly-0.025
str-0.025
ip-0.024
live-0.023
Token and
Token ID290
Feature activation+5.01e-01

Pos logit contributions
 dot+0.018
 pavement+0.017
 distances+0.017
 horizon+0.017
 dashboard+0.016
Neg logit contributions
 exactly-0.018
itting-0.016
sm-0.015
str-0.015
 known-0.014
Token respectful
Token ID25923
Feature activation+6.36e-01

Pos logit contributions
 dot+0.022
 distances+0.022
 pavement+0.022
 horizon+0.021
 spoiler+0.020
Neg logit contributions
 exactly-0.017
itting-0.016
str-0.014
sm-0.014
o-0.014
Token things
Token ID1243
Feature activation+5.18e-01

Pos logit contributions
 dot+0.044
 pavement+0.043
 distances+0.042
 horizon+0.040
 dashboard+0.039
Neg logit contributions
 exactly-0.054
itting-0.050
str-0.046
o-0.046
sm-0.046
Token are
Token ID389
Feature activation+1.30e+00

Pos logit contributions
 dot+0.019
 pavement+0.018
 distances+0.018
 horizon+0.017
 dashboard+0.017
Neg logit contributions
 exactly-0.023
itting-0.020
sm-0.019
str-0.019
 known-0.019
Token not
Token ID407
Feature activation-4.12e-01

Pos logit contributions
 dot+0.052
 distances+0.051
 wheels+0.051
 pavement+0.050
 horizon+0.049
Neg logit contributions
itting-0.045
bek-0.042
o-0.041
uc-0.041
 exactly-0.040
Token what
Token ID644
Feature activation-9.87e-01

Pos logit contributions
 exactly+0.014
itting+0.014
str+0.013
sm+0.013
itt+0.013
Neg logit contributions
 dot-0.017
 distances-0.017
 pavement-0.017
 dashboard-0.016
 wheels-0.016
Token they
Token ID484
Feature activation+1.60e+00

Pos logit contributions
 exactly+0.032
itting+0.028
bek+0.026
rou+0.026
sm+0.026
Neg logit contributions
 dot-0.046
 distances-0.045
 pavement-0.045
 horizon-0.042
 dashboard-0.042
Token seem
Token ID1283
Feature activation+4.94e+00

Pos logit contributions
 dot+0.057
 distances+0.054
 pavement+0.053
 horizon+0.053
 dashboard+0.051
Neg logit contributions
 exactly-0.073
itting-0.066
 known-0.064
str-0.062
sm-0.062
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
 dot+0.207
 pavement+0.199
 distances+0.198
 horizon+0.192
 dashboard+0.187
Neg logit contributions
 exactly-0.189
itting-0.170
sm-0.158
str-0.157
 known-0.155
Token They
Token ID1119
Feature activation+8.87e-01

Pos logit contributions
 distances+0.011
 dot+0.011
 pavement+0.011
 horizon+0.010
 dashboard+0.010
Neg logit contributions
 exactly-0.009
itting-0.009
 known-0.008
o-0.008
sm-0.008
Token learned
Token ID4499
Feature activation+7.18e-01

Pos logit contributions
 dot+0.042
 distances+0.041
 pavement+0.041
 horizon+0.040
 dashboard+0.039
Neg logit contributions
 exactly-0.029
itting-0.025
sm-0.023
str-0.023
 known-0.023
Token that
Token ID326
Feature activation-1.30e+00

Pos logit contributions
 dot+0.029
 wheels+0.029
 pavement+0.029
 dashboard+0.028
 buttons+0.028
Neg logit contributions
itting-0.025
bek-0.022
eh-0.022
 exactly-0.022
str-0.021
Token some
Token ID617
Feature activation+1.28e+00

Pos logit contributions
 exactly+0.044
itting+0.041
str+0.037
sm+0.037
bek+0.037
Neg logit contributions
 dot-0.060
 distances-0.057
 pavement-0.056
 horizon-0.055
 wheels-0.053
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
 dot+0.008
 pavement+0.008
 horizon+0.008
 spoiler+0.007
 dashboard+0.007
Neg logit contributions
 exactly-0.007
itting-0.006
 wrong-0.006
o-0.006
 known-0.006
Token What
Token ID1867
Feature activation-8.43e-01

Pos logit contributions
 exactly+0.001
itting+0.001
sm+0.001
str+0.001
o+0.001
Neg logit contributions
 dot-0.000
 distances-0.000
 pavement-0.000
 horizon-0.000
 dashboard-0.000
Token does
Token ID857
Feature activation+8.25e-01

Pos logit contributions
 exactly+0.040
itting+0.038
str+0.037
sm+0.036
 known+0.036
Neg logit contributions
 dot-0.025
 distances-0.022
 pavement-0.021
 dashboard-0.021
 Novel-0.021
Token your
Token ID534
Feature activation-9.48e-02

Pos logit contributions
 dot+0.038
 pavement+0.037
 distances+0.037
 horizon+0.035
 dashboard+0.034
Neg logit contributions
 exactly-0.029
itting-0.025
str-0.024
sm-0.023
 known-0.023
Token doll
Token ID3654
Feature activation-9.18e-01

Pos logit contributions
 exactly+0.004
itting+0.004
sm+0.003
str+0.003
 known+0.003
Neg logit contributions
 dot-0.004
 pavement-0.003
 distances-0.003
 horizon-0.003
 dashboard-0.003
Token seem
Token ID1283
Feature activation+4.85e+00

Pos logit contributions
 exactly+0.037
 wrong+0.032
o+0.031
itting+0.031
str+0.030
Neg logit contributions
 dot-0.038
 distances-0.038
 pavement-0.037
 horizon-0.037
 dashboard-0.035
Token like
Token ID588
Feature activation-8.35e-01

Pos logit contributions
 dot+0.179
 pavement+0.171
 distances+0.170
 horizon+0.164
 dashboard+0.159
Neg logit contributions
 exactly-0.209
itting-0.190
sm-0.179
str-0.178
 known-0.176
Token?"
Token ID1701
Feature activation+1.15e+00

Pos logit contributions
 exactly+0.037
itting+0.033
sm+0.032
str+0.031
 known+0.031
Neg logit contributions
 dot-0.030
 pavement-0.029
 distances-0.029
 horizon-0.028
 dashboard-0.027
Token L
Token ID406
Feature activation-7.78e-01

Pos logit contributions
 dot+0.044
 distances+0.041
 pavement+0.041
 horizon+0.040
 dashboard+0.038
Neg logit contributions
 exactly-0.051
itting-0.047
rou-0.044
sm-0.042
uc-0.042
Tokenila
Token ID10102
Feature activation-2.39e-01

Pos logit contributions
 exactly+0.032
itting+0.029
sm+0.027
 known+0.027
str+0.027
Neg logit contributions
 dot-0.030
 distances-0.029
 pavement-0.028
 horizon-0.027
 dashboard-0.027
Token asked
Token ID1965
Feature activation+1.48e+00

Pos logit contributions
 exactly+0.009
o+0.007
itting+0.007
 known+0.007
 named+0.007
Neg logit contributions
 dot-0.011
 pavement-0.010
 horizon-0.010
 distances-0.010
 spoiler-0.010
Token learned
Token ID4499
Feature activation-1.09e-01

Pos logit contributions
 exactly+0.012
itting+0.011
sm+0.010
str+0.010
 known+0.010
Neg logit contributions
 dot-0.011
 pavement-0.010
 distances-0.010
 horizon-0.010
 dashboard-0.009
Token that
Token ID326
Feature activation-1.86e+00

Pos logit contributions
itting+0.004
o+0.003
str+0.003
 exactly+0.003
bek+0.003
Neg logit contributions
 pavement-0.004
 distances-0.004
 dot-0.004
 horizon-0.004
 myself-0.004
Token sometimes
Token ID3360
Feature activation+6.47e-01

Pos logit contributions
 exactly+0.072
itting+0.066
str+0.062
sm+0.062
o+0.060
Neg logit contributions
 dot-0.078
 distances-0.073
 pavement-0.073
 horizon-0.071
 dashboard-0.067
Token things
Token ID1243
Feature activation+3.26e-01

Pos logit contributions
 dot+0.029
 pavement+0.028
 distances+0.027
 horizon+0.027
 dashboard+0.026
Neg logit contributions
 exactly-0.024
itting-0.021
sm-0.019
str-0.019
 known-0.019
Token might
Token ID1244
Feature activation+8.06e-01

Pos logit contributions
 dot+0.012
 pavement+0.012
 distances+0.012
 horizon+0.011
 dashboard+0.011
Neg logit contributions
 exactly-0.014
itting-0.012
 wrong-0.012
str-0.012
sm-0.012
Token seem
Token ID1283
Feature activation+4.80e+00

Pos logit contributions
 dot+0.026
 pavement+0.025
 distances+0.025
 dashboard+0.025
 spoiler+0.024
Neg logit contributions
 exactly-0.038
 known-0.035
 wrong-0.034
str-0.034
itting-0.034
Token bad
Token ID2089
Feature activation+2.01e+00

Pos logit contributions
 dot+0.205
 pavement+0.197
 distances+0.197
 horizon+0.191
 dashboard+0.185
Neg logit contributions
 exactly-0.179
itting-0.160
sm-0.149
str-0.148
 known-0.146
Token at
Token ID379
Feature activation+3.00e-01

Pos logit contributions
 dot+0.101
 pavement+0.098
 distances+0.096
 horizon+0.095
 dashboard+0.093
Neg logit contributions
 exactly-0.061
itting-0.053
sm-0.048
 known-0.047
str-0.046
Token first
Token ID717
Feature activation+1.27e+00

Pos logit contributions
 dot+0.016
 pavement+0.015
 dashboard+0.015
 distances+0.015
 horizon+0.015
Neg logit contributions
 exactly-0.009
 wrong-0.007
itting-0.007
 known-0.007
 helping-0.007
Token,
Token ID11
Feature activation-2.14e-01

Pos logit contributions
 dot+0.064
 pavement+0.062
 horizon+0.061
 dashboard+0.061
 distances+0.060
Neg logit contributions
 exactly-0.039
itting-0.032
 wrong-0.031
 helping-0.029
 known-0.029
Token but
Token ID475
Feature activation+1.54e-01

Pos logit contributions
 exactly+0.009
itting+0.008
 known+0.008
sm+0.008
 helping+0.008
Neg logit contributions
 dot-0.008
 pavement-0.008
 distances-0.007
 horizon-0.007
 dashboard-0.007
Token things
Token ID1243
Feature activation+1.62e-01

Pos logit contributions
 dot+0.045
 pavement+0.043
 distances+0.043
 horizon+0.042
 dashboard+0.041
Neg logit contributions
 exactly-0.034
itting-0.030
sm-0.027
 known-0.027
str-0.027
Token are
Token ID389
Feature activation+9.92e-01

Pos logit contributions
 dot+0.006
 pavement+0.006
 distances+0.006
 horizon+0.006
 dashboard+0.006
Neg logit contributions
 exactly-0.007
itting-0.006
str-0.006
 wrong-0.006
sm-0.006
Token not
Token ID407
Feature activation-7.46e-01

Pos logit contributions
 dot+0.045
 distances+0.043
 horizon+0.042
 pavement+0.042
 dashboard+0.041
Neg logit contributions
 exactly-0.031
itting-0.031
sm-0.028
str-0.028
o-0.028
Token what
Token ID644
Feature activation-1.19e+00

Pos logit contributions
 exactly+0.026
itting+0.024
str+0.022
sm+0.022
o+0.022
Neg logit contributions
 dot-0.033
 distances-0.032
 pavement-0.031
 horizon-0.030
 dashboard-0.030
Token they
Token ID484
Feature activation+1.39e+00

Pos logit contributions
 exactly+0.046
itting+0.041
sm+0.039
str+0.038
 known+0.038
Neg logit contributions
 dot-0.049
 distances-0.047
 pavement-0.047
 horizon-0.045
 dashboard-0.044
Token seem
Token ID1283
Feature activation+4.78e+00

Pos logit contributions
 dot+0.051
 pavement+0.048
 distances+0.047
 horizon+0.047
 spoiler+0.046
Neg logit contributions
 exactly-0.062
itting-0.057
 known-0.056
 wrong-0.054
ll-0.053
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
 dot+0.203
 pavement+0.198
 distances+0.194
 horizon+0.190
 dashboard+0.189
Neg logit contributions
 exactly-0.181
itting-0.161
 known-0.150
 wrong-0.150
sm-0.148
Token They
Token ID1119
Feature activation+6.49e-01

Pos logit contributions
 pavement+0.006
 dot+0.006
 distances+0.006
 horizon+0.006
 dashboard+0.006
Neg logit contributions
 exactly-0.005
itting-0.004
 known-0.004
o-0.004
sm-0.004
Token thanked
Token ID26280
Feature activation+1.52e+00

Pos logit contributions
 dot+0.030
 pavement+0.029
 distances+0.029
 horizon+0.028
 dashboard+0.027
Neg logit contributions
 exactly-0.022
itting-0.019
 known-0.018
sm-0.018
str-0.018
Token mom
Token ID1995
Feature activation-1.57e+00

Pos logit contributions
 dot+0.061
 pavement+0.056
 distances+0.056
 horizon+0.056
 dashboard+0.053
Neg logit contributions
 exactly-0.061
itting-0.060
str-0.056
eh-0.053
sm-0.053
Tokenmy
Token ID1820
Feature activation+1.75e-02

Pos logit contributions
itting+0.084
str+0.083
live+0.079
eh+0.078
 known+0.078
Neg logit contributions
 distances-0.031
 horizon-0.031
 heads-0.030
 dot-0.030
 Novel-0.030
Token things
Token ID1243
Feature activation+3.49e-01

Pos logit contributions
 dot+0.029
 distances+0.029
 pavement+0.028
 horizon+0.028
 dashboard+0.027
Neg logit contributions
 exactly-0.026
itting-0.024
sm-0.022
str-0.022
o-0.022
Token aren
Token ID3588
Feature activation-2.04e-01

Pos logit contributions
 dot+0.012
 pavement+0.012
 distances+0.012
 horizon+0.011
 dashboard+0.011
Neg logit contributions
 exactly-0.016
itting-0.014
sm-0.013
str-0.013
 known-0.013
Token't
Token ID470
Feature activation+6.09e-01

Pos logit contributions
 exactly+0.008
itting+0.007
str+0.006
 known+0.006
sm+0.006
Neg logit contributions
 dot-0.009
 pavement-0.009
 distances-0.008
 horizon-0.008
 dashboard-0.008
Token what
Token ID644
Feature activation-1.22e+00

Pos logit contributions
 dot+0.026
 pavement+0.026
 distances+0.026
 horizon+0.024
 dashboard+0.024
Neg logit contributions
 exactly-0.021
itting-0.020
uc-0.018
itt-0.018
str-0.018
Token they
Token ID484
Feature activation+1.54e+00

Pos logit contributions
 exactly+0.047
itting+0.043
sm+0.040
str+0.040
 known+0.039
Neg logit contributions
 dot-0.048
 distances-0.048
 pavement-0.047
 horizon-0.045
ro-0.044
Token seem
Token ID1283
Feature activation+4.73e+00

Pos logit contributions
 dot+0.054
 distances+0.050
 pavement+0.050
 horizon+0.048
 dashboard+0.047
Neg logit contributions
 exactly-0.072
itting-0.066
 known-0.063
str-0.062
sm-0.061
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
 dot+0.193
 pavement+0.185
 distances+0.185
 horizon+0.179
 dashboard+0.173
Neg logit contributions
 exactly-0.185
itting-0.167
sm-0.156
str-0.155
 known-0.152
Token From
Token ID3574
Feature activation+6.64e-01

Pos logit contributions
 exactly+0.010
itting+0.010
sm+0.009
str+0.009
 known+0.009
Neg logit contributions
 dot-0.007
 pavement-0.007
 distances-0.007
 horizon-0.007
 dashboard-0.006
Token that
Token ID326
Feature activation-1.97e+00

Pos logit contributions
 dot+0.023
 pavement+0.022
 horizon+0.022
 distances+0.022
 dashboard+0.020
Neg logit contributions
 exactly-0.029
itting-0.027
sm-0.026
str-0.025
 known-0.025
Token day
Token ID1110
Feature activation-9.57e-01

Pos logit contributions
 exactly+0.075
itting+0.064
uc+0.062
sm+0.061
rou+0.059
Neg logit contributions
 dot-0.085
 distances-0.081
 horizon-0.081
 pavement-0.080
ro-0.076
Token on
Token ID319
Feature activation-7.39e-01

Pos logit contributions
 exactly+0.032
rou+0.032
sm+0.031
itting+0.031
str+0.030
Neg logit contributions
 distances-0.038
 wheels-0.036
 pavement-0.036
 dot-0.036
 sides-0.035
Token things
Token ID1243
Feature activation+7.28e-01

Pos logit contributions
 dot+0.022
 pavement+0.021
 distances+0.021
 horizon+0.020
 dashboard+0.020
Neg logit contributions
 exactly-0.018
itting-0.016
sm-0.015
str-0.014
 known-0.014
Token are
Token ID389
Feature activation+1.24e+00

Pos logit contributions
 dot+0.027
 pavement+0.026
 distances+0.026
 horizon+0.025
 dashboard+0.024
Neg logit contributions
 exactly-0.032
itting-0.028
str-0.027
sm-0.027
 known-0.026
Token not
Token ID407
Feature activation-4.66e-01

Pos logit contributions
 dot+0.053
 distances+0.052
 pavement+0.051
 horizon+0.050
 dashboard+0.048
Neg logit contributions
 exactly-0.042
itting-0.041
sm-0.038
str-0.037
o-0.037
Token what
Token ID644
Feature activation-9.44e-01

Pos logit contributions
 exactly+0.017
itting+0.016
sm+0.014
str+0.014
o+0.014
Neg logit contributions
 dot-0.020
 distances-0.019
 pavement-0.019
 horizon-0.018
 dashboard-0.018
Token they
Token ID484
Feature activation+1.66e+00

Pos logit contributions
 exactly+0.039
itting+0.036
sm+0.033
str+0.033
 known+0.032
Neg logit contributions
 dot-0.036
 distances-0.035
 pavement-0.035
 horizon-0.033
 dashboard-0.032
Token seem
Token ID1283
Feature activation+4.72e+00

Pos logit contributions
 dot+0.057
 pavement+0.054
 distances+0.054
 horizon+0.052
 dashboard+0.050
Neg logit contributions
 exactly-0.078
itting-0.070
 known-0.067
str-0.066
sm-0.066
Token,"
Token ID553
Feature activation+1.42e+00

Pos logit contributions
 dot+0.198
 distances+0.190
 pavement+0.189
 horizon+0.184
 dashboard+0.177
Neg logit contributions
 exactly-0.177
itting-0.160
sm-0.149
str-0.148
 known-0.145
Token mom
Token ID1995
Feature activation-1.48e+00

Pos logit contributions
 dot+0.049
 pavement+0.047
 distances+0.045
 horizon+0.044
 dashboard+0.044
Neg logit contributions
itting-0.061
 exactly-0.060
rou-0.055
sm-0.055
str-0.052
Token said
Token ID531
Feature activation+9.29e-02

Pos logit contributions
 exactly+0.062
itting+0.060
str+0.055
sm+0.055
 known+0.055
Neg logit contributions
 dot-0.050
 pavement-0.049
 distances-0.048
 horizon-0.047
 dashboard-0.045
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
 cheeks+0.002
 pavement+0.002
 dot+0.002
 widened+0.002
ly+0.002
Neg logit contributions
live-0.005
itting-0.005
eh-0.004
bek-0.004
 known-0.004
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
 exactly+0.012
itting+0.011
sm+0.011
str+0.011
 known+0.010
Neg logit contributions
 dot-0.010
 pavement-0.010
 distances-0.010
 horizon-0.010
ro-0.009
Token that
Token ID326
Feature activation-1.76e+00

Pos logit contributions
itting+0.005
bek+0.004
str+0.004
itt+0.004
eh+0.004
Neg logit contributions
 distances-0.006
 pavement-0.006
 dot-0.006
 horizon-0.006
 myself-0.006
Token even
Token ID772
Feature activation-5.00e-01

Pos logit contributions
 exactly+0.073
itting+0.068
str+0.064
sm+0.063
uc+0.061
Neg logit contributions
 dot-0.068
 distances-0.064
 pavement-0.062
 horizon-0.061
 dashboard-0.058
Token though
Token ID996
Feature activation+1.40e-01

Pos logit contributions
 exactly+0.016
itting+0.014
str+0.013
sm+0.013
o+0.013
Neg logit contributions
 dot-0.024
 distances-0.023
 pavement-0.023
 horizon-0.022
 dashboard-0.022
Token something
Token ID1223
Feature activation+1.56e+00

Pos logit contributions
 dot+0.006
 pavement+0.006
 distances+0.006
 horizon+0.006
 dashboard+0.006
Neg logit contributions
 exactly-0.005
itting-0.004
sm-0.004
str-0.004
 known-0.004
Token might
Token ID1244
Feature activation+9.01e-01

Pos logit contributions
 dot+0.055
 pavement+0.053
 distances+0.053
 horizon+0.051
 dashboard+0.049
Neg logit contributions
 exactly-0.069
itting-0.063
sm-0.060
str-0.059
 known-0.059
Token seem
Token ID1283
Feature activation+4.63e+00

Pos logit contributions
 dot+0.027
 pavement+0.026
 distances+0.026
 horizon+0.024
 dashboard+0.024
Neg logit contributions
 exactly-0.045
itting-0.041
str-0.039
sm-0.039
 known-0.039
Token scary
Token ID14343
Feature activation+2.05e+00

Pos logit contributions
 dot+0.192
 distances+0.183
 pavement+0.183
 horizon+0.177
 dashboard+0.171
Neg logit contributions
 exactly-0.179
itting-0.161
sm-0.150
str-0.149
 known-0.146
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
 dot+0.102
 pavement+0.100
 horizon+0.098
 spoiler+0.098
 dashboard+0.098
Neg logit contributions
 exactly-0.060
itting-0.051
 wrong-0.051
 helping-0.049
sm-0.047
Token it
Token ID340
Feature activation-1.35e-01

Pos logit contributions
 dot+0.008
 pavement+0.008
 distances+0.008
 horizon+0.008
 dashboard+0.008
Neg logit contributions
 exactly-0.007
itting-0.006
sm-0.006
 known-0.006
str-0.006
Token's
Token ID338
Feature activation-1.12e+00

Pos logit contributions
 exactly+0.006
itting+0.005
sm+0.005
str+0.005
 known+0.005
Neg logit contributions
 dot-0.005
 distances-0.005
 pavement-0.005
 horizon-0.005
 dashboard-0.004
Token okay
Token ID8788
Feature activation-5.93e-01

Pos logit contributions
itting+0.035
uc+0.035
itt+0.034
bek+0.033
str+0.033
Neg logit contributions
 distances-0.047
 dot-0.047
 edges-0.044
 horizon-0.044
 wheels-0.044
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 exactly+0.059
itting+0.053
str+0.049
sm+0.049
 known+0.049
Neg logit contributions
 dot-0.071
 pavement-0.069
 distances-0.068
 horizon-0.066
 dashboard-0.064
Token
Token ID198
Feature activation-7.33e-02

Pos logit contributions
 exactly+0.025
itting+0.023
sm+0.022
str+0.022
 known+0.021
Neg logit contributions
 dot-0.019
 pavement-0.018
 distances-0.018
 horizon-0.017
 dashboard-0.017
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
 exactly+0.003
itting+0.003
uc+0.002
str+0.002
 wrong+0.002
Neg logit contributions
 dot-0.003
 distances-0.003
 pavement-0.003
 horizon-0.003
 dashboard-0.003
TokenTo
Token ID2514
Feature activation+4.47e-01

Pos logit contributions
 distances+0.006
 dot+0.006
 pavement+0.006
 horizon+0.006
 dashboard+0.005
Neg logit contributions
 exactly-0.007
itting-0.006
 wrong-0.005
uc-0.005
str-0.005
Token her
Token ID607
Feature activation-6.56e-01

Pos logit contributions
 dot+0.020
 distances+0.019
 pavement+0.019
 horizon+0.019
 dashboard+0.018
Neg logit contributions
 exactly-0.015
itting-0.014
str-0.013
sm-0.013
 known-0.013
Token amaz
Token ID40642
Feature activation+4.62e+00

Pos logit contributions
 exactly+0.031
itting+0.030
str+0.028
sm+0.028
o+0.027
Neg logit contributions
 dot-0.022
 horizon-0.020
 distances-0.019
 pavement-0.019
 wheels-0.018
Tokenement
Token ID972
Feature activation+9.92e-01

Pos logit contributions
 dot+0.160
 pavement+0.155
 distances+0.152
 horizon+0.142
 dashboard+0.141
Neg logit contributions
 exactly-0.219
itting-0.194
sm-0.182
str-0.180
 wrong-0.180
Token,
Token ID11
Feature activation-3.81e-01

Pos logit contributions
 dot+0.039
 horizon+0.036
 distances+0.036
 pavement+0.036
ro+0.034
Neg logit contributions
 exactly-0.038
 known-0.037
itting-0.036
sm-0.035
str-0.035
Token the
Token ID262
Feature activation-7.04e-01

Pos logit contributions
itting+0.014
 exactly+0.013
str+0.012
uc+0.012
sm+0.012
Neg logit contributions
 dot-0.016
 pavement-0.015
 distances-0.015
 horizon-0.015
 wheels-0.014
Token clothes
Token ID8242
Feature activation+7.73e-01

Pos logit contributions
 exactly+0.033
itting+0.031
o+0.029
 known+0.028
sm+0.028
Neg logit contributions
 dot-0.025
 pavement-0.024
 horizon-0.023
 dashboard-0.022
 distances-0.022
Token came
Token ID1625
Feature activation-6.73e-02

Pos logit contributions
 dot+0.029
 pavement+0.028
 distances+0.028
 horizon+0.027
 dashboard+0.026
Neg logit contributions
 exactly-0.033
itting-0.030
sm-0.028
str-0.028
 known-0.027
Token other
Token ID584
Feature activation+1.45e-02

Pos logit contributions
 exactly+0.001
itting+0.001
sm+0.001
str+0.001
 known+0.001
Neg logit contributions
 dot-0.001
 pavement-0.001
 distances-0.001
 horizon-0.001
 dashboard-0.001
Token people
Token ID661
Feature activation+1.80e+00

Pos logit contributions
 dot+0.001
 pavement+0.001
 horizon+0.001
 distances+0.001
 spoiler+0.001
Neg logit contributions
 exactly-0.001
itting-0.001
sm-0.001
 known-0.000
 wrong-0.000
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
 dot+0.075
 distances+0.074
 pavement+0.071
 horizon+0.067
 dashboard+0.067
Neg logit contributions
 exactly-0.068
itting-0.062
str-0.057
 known-0.057
bek-0.056
Token Things
Token ID11597
Feature activation+5.08e-01

Pos logit contributions
 dot+0.001
 pavement+0.001
 distances+0.001
 horizon+0.001
 dashboard+0.001
Neg logit contributions
 exactly-0.001
itting-0.001
sm-0.001
str-0.001
uc-0.001
Token may
Token ID743
Feature activation+1.53e+00

Pos logit contributions
 dot+0.021
 distances+0.020
 pavement+0.020
 horizon+0.020
 dashboard+0.019
Neg logit contributions
 exactly-0.019
itting-0.017
str-0.016
sm-0.016
o-0.016
Token seem
Token ID1283
Feature activation+4.55e+00

Pos logit contributions
 dot+0.059
 pavement+0.057
 distances+0.056
 dashboard+0.054
 horizon+0.054
Neg logit contributions
 exactly-0.063
itting-0.055
 known-0.055
 wrong-0.055
str-0.054
Token charming
Token ID23332
Feature activation+1.44e+00

Pos logit contributions
 dot+0.194
 distances+0.186
 pavement+0.184
 horizon+0.179
 dashboard+0.171
Neg logit contributions
 exactly-0.166
itting-0.153
sm-0.141
str-0.140
o-0.138
Token,
Token ID11
Feature activation+8.82e-01

Pos logit contributions
 dot+0.061
 pavement+0.059
 distances+0.059
 horizon+0.057
 dashboard+0.056
Neg logit contributions
 exactly-0.054
itting-0.048
sm-0.045
str-0.044
 known-0.044
Token but
Token ID475
Feature activation+6.08e-01

Pos logit contributions
 dot+0.032
 pavement+0.031
 distances+0.031
 horizon+0.030
 dashboard+0.029
Neg logit contributions
 exactly-0.038
itting-0.035
sm-0.033
str-0.032
 known-0.032
Token they
Token ID484
Feature activation+7.04e-02

Pos logit contributions
 dot+0.027
 distances+0.026
 pavement+0.026
 horizon+0.026
 dashboard+0.024
Neg logit contributions
 exactly-0.021
itting-0.019
sm-0.018
str-0.018
o-0.017
Token are
Token ID389
Feature activation+8.27e-01

Pos logit contributions
 dot+0.003
 distances+0.003
 pavement+0.003
 horizon+0.003
 dashboard+0.003
Neg logit contributions
 exactly-0.003
itting-0.002
sm-0.002
str-0.002
uc-0.002
Token things
Token ID1243
Feature activation+1.52e-01

Pos logit contributions
 dot+0.028
 pavement+0.027
 distances+0.027
 horizon+0.026
 dashboard+0.025
Neg logit contributions
 exactly-0.022
itting-0.020
sm-0.018
str-0.018
 known-0.018
Token are
Token ID389
Feature activation+8.94e-01

Pos logit contributions
 dot+0.006
 pavement+0.005
 distances+0.005
 horizon+0.005
 dashboard+0.005
Neg logit contributions
 exactly-0.007
itting-0.006
sm-0.006
str-0.006
 known-0.006
Token not
Token ID407
Feature activation-7.50e-01

Pos logit contributions
 dot+0.037
 distances+0.036
 horizon+0.035
 pavement+0.034
 wheels+0.034
Neg logit contributions
itting-0.030
 exactly-0.029
itt-0.028
bek-0.028
uc-0.028
Token what
Token ID644
Feature activation-1.31e+00

Pos logit contributions
 exactly+0.026
itting+0.025
sm+0.023
str+0.023
o+0.022
Neg logit contributions
 dot-0.033
 distances-0.031
 pavement-0.031
 horizon-0.030
 dashboard-0.029
Token they
Token ID484
Feature activation+1.29e+00

Pos logit contributions
 exactly+0.050
itting+0.045
sm+0.042
str+0.042
 known+0.041
Neg logit contributions
 dot-0.054
 distances-0.052
 pavement-0.052
 horizon-0.050
 dashboard-0.048
Token seem
Token ID1283
Feature activation+4.53e+00

Pos logit contributions
 dot+0.044
 distances+0.041
 pavement+0.041
 dashboard+0.040
 horizon+0.040
Neg logit contributions
 exactly-0.061
itting-0.056
 known-0.055
 wrong-0.053
ll-0.053
Token,
Token ID11
Feature activation+6.59e-01

Pos logit contributions
 dot+0.195
 pavement+0.189
 distances+0.187
 horizon+0.182
 dashboard+0.180
Neg logit contributions
 exactly-0.169
itting-0.150
str-0.139
sm-0.139
 known-0.138
Token just
Token ID655
Feature activation-1.51e+00

Pos logit contributions
 dot+0.024
 pavement+0.024
 distances+0.023
 horizon+0.023
 dashboard+0.022
Neg logit contributions
 exactly-0.029
itting-0.026
sm-0.024
 known-0.024
 wrong-0.023
Token like
Token ID588
Feature activation-2.25e+00

Pos logit contributions
 exactly+0.051
itting+0.046
str+0.044
sm+0.043
o+0.043
Neg logit contributions
 dot-0.068
 distances-0.065
 pavement-0.064
 horizon-0.063
 dashboard-0.060
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
itting+0.079
 exactly+0.078
str+0.077
bek+0.075
o+0.075
Neg logit contributions
 dot-0.092
 distances-0.091
 pavement-0.088
 wheels-0.084
 horizon-0.083
Token lion
Token ID18744
Feature activation-7.22e-01

Pos logit contributions
 exactly+0.005
itting+0.005
uc+0.005
sm+0.005
str+0.004
Neg logit contributions
 dot-0.004
 pavement-0.004
 horizon-0.004
 distances-0.004
 dashboard-0.004
Token matter
Token ID2300
Feature activation-1.99e-01

Pos logit contributions
 exactly+0.001
itting+0.001
str+0.001
o+0.001
ll+0.001
Neg logit contributions
 dot-0.000
 distances-0.000
 pavement-0.000
 sides-0.000
 dots-0.000
Token how
Token ID703
Feature activation-1.64e+00

Pos logit contributions
str+0.006
itting+0.006
 exactly+0.006
 known+0.006
eh+0.005
Neg logit contributions
 dot-0.009
 distances-0.009
 pavement-0.008
 myself-0.008
 sides-0.008
Token sour
Token ID11348
Feature activation+3.69e-01

Pos logit contributions
 exactly+0.058
itting+0.053
bek+0.051
 known+0.050
o+0.050
Neg logit contributions
 distances-0.063
 horizon-0.063
 dot-0.062
 sides-0.061
 pavement-0.061
Token something
Token ID1223
Feature activation+1.05e+00

Pos logit contributions
 dot+0.016
 pavement+0.015
 distances+0.015
 horizon+0.015
 surfaces+0.014
Neg logit contributions
 exactly-0.014
itting-0.012
sm-0.012
 known-0.012
str-0.011
Token might
Token ID1244
Feature activation+4.99e-01

Pos logit contributions
 dot+0.038
 pavement+0.037
 distances+0.037
 horizon+0.036
 surfaces+0.033
Neg logit contributions
 exactly-0.044
itting-0.041
 known-0.038
str-0.038
sm-0.038
Token seem
Token ID1283
Feature activation+4.51e+00

Pos logit contributions
 dot+0.016
 pavement+0.016
 distances+0.015
 dashboard+0.015
 spoiler+0.015
Neg logit contributions
 exactly-0.024
itting-0.021
str-0.021
sm-0.020
 known-0.020
Token,
Token ID11
Feature activation+5.58e-01

Pos logit contributions
 dot+0.214
 pavement+0.207
 distances+0.206
 horizon+0.200
 dashboard+0.198
Neg logit contributions
 exactly-0.147
itting-0.129
sm-0.119
str-0.118
 known-0.116
Token sweet
Token ID6029
Feature activation-5.29e-01

Pos logit contributions
 dot+0.024
 distances+0.023
 pavement+0.023
 horizon+0.022
 surfaces+0.021
Neg logit contributions
 exactly-0.020
itting-0.019
o-0.017
str-0.017
sm-0.017
Token things
Token ID1243
Feature activation-5.33e-01

Pos logit contributions
 exactly+0.022
itting+0.020
str+0.020
 known+0.020
itt+0.019
Neg logit contributions
 dot-0.019
 pavement-0.019
 horizon-0.018
 distances-0.018
 surfaces-0.017
Token might
Token ID1244
Feature activation+8.52e-02

Pos logit contributions
 exactly+0.022
itting+0.020
str+0.019
 known+0.019
o+0.019
Neg logit contributions
 dot-0.020
 distances-0.019
 pavement-0.019
 horizon-0.018
ro-0.018
Token be
Token ID307
Feature activation+8.18e-01

Pos logit contributions
 dot+0.003
 pavement+0.003
 distances+0.003
 horizon+0.003
 dashboard+0.003
Neg logit contributions
 exactly-0.004
itting-0.004
sm-0.003
 known-0.003
str-0.003
Token knew
Token ID2993
Feature activation-5.21e-01

Pos logit contributions
 exactly+0.020
itting+0.018
sm+0.017
str+0.017
 known+0.016
Neg logit contributions
 dot-0.022
 pavement-0.021
 distances-0.021
 horizon-0.021
 dashboard-0.020
Token that
Token ID326
Feature activation-2.04e+00

Pos logit contributions
itting+0.022
eh+0.020
ats+0.019
live+0.019
sm+0.019
Neg logit contributions
 dot-0.016
 wheels-0.016
 pavement-0.016
 horizon-0.015
 itself-0.015
Token sometimes
Token ID3360
Feature activation+4.53e-01

Pos logit contributions
 exactly+0.076
itting+0.076
live+0.070
str+0.070
sm+0.069
Neg logit contributions
 dot-0.083
 pavement-0.077
 distances-0.077
 horizon-0.073
 wheels-0.071
Token things
Token ID1243
Feature activation+2.09e-01

Pos logit contributions
 dot+0.021
 pavement+0.021
 distances+0.021
 horizon+0.020
 dashboard+0.020
Neg logit contributions
 exactly-0.015
itting-0.013
 helping-0.012
 wrong-0.012
 known-0.012
Token might
Token ID1244
Feature activation+5.70e-01

Pos logit contributions
 dot+0.008
 pavement+0.007
 distances+0.007
 horizon+0.007
 dashboard+0.007
Neg logit contributions
 exactly-0.009
itting-0.008
sm-0.008
str-0.008
 known-0.008
Token seem
Token ID1283
Feature activation+4.50e+00

Pos logit contributions
 dot+0.021
 pavement+0.020
 distances+0.020
 dashboard+0.020
 spoiler+0.019
Neg logit contributions
 exactly-0.025
 known-0.022
 wrong-0.022
 helping-0.022
str-0.021
Token hard
Token ID1327
Feature activation+1.78e+00

Pos logit contributions
 dot+0.201
 pavement+0.195
 distances+0.194
 horizon+0.189
 dashboard+0.185
Neg logit contributions
 exactly-0.160
itting-0.140
sm-0.130
str-0.130
 known-0.129
Token,
Token ID11
Feature activation+3.37e-01

Pos logit contributions
 dot+0.106
 pavement+0.106
 dashboard+0.103
 distances+0.102
 spoiler+0.102
Neg logit contributions
 exactly-0.037
itting-0.029
 wrong-0.027
 helping-0.025
 known-0.023
Token but
Token ID475
Feature activation+3.36e-01

Pos logit contributions
 dot+0.013
 pavement+0.013
 distances+0.013
 horizon+0.013
 dashboard+0.012
Neg logit contributions
 exactly-0.014
itting-0.012
 helping-0.012
 known-0.011
 wrong-0.011
Token with
Token ID351
Feature activation-6.75e-01

Pos logit contributions
 dot+0.016
 pavement+0.015
 distances+0.015
 horizon+0.015
 dashboard+0.015
Neg logit contributions
 exactly-0.011
itting-0.010
sm-0.009
str-0.009
 known-0.009
Token a
Token ID257
Feature activation-9.61e-01

Pos logit contributions
 exactly+0.023
itting+0.023
str+0.022
o+0.021
sm+0.020
Neg logit contributions
 dot-0.029
 distances-0.028
 pavement-0.028
 horizon-0.027
 wheels-0.026
Token didn
Token ID1422
Feature activation+2.42e-01

Pos logit contributions
 exactly+0.006
itting+0.006
sm+0.006
 known+0.005
str+0.005
Neg logit contributions
 dot-0.005
 pavement-0.005
 distances-0.005
 horizon-0.005
ro-0.005
Token't
Token ID470
Feature activation+1.16e-01

Pos logit contributions
 dot+0.012
 pavement+0.011
 distances+0.011
 horizon+0.011
 dashboard+0.011
Neg logit contributions
 exactly-0.008
itting-0.006
 wrong-0.006
 helping-0.006
sm-0.006
Token know
Token ID760
Feature activation+8.21e-01

Pos logit contributions
 horizon+0.005
 distances+0.005
 pavement+0.005
 dot+0.005
 dashboard+0.005
Neg logit contributions
 exactly-0.004
itting-0.004
 known-0.004
 wrong-0.004
o-0.004
Token what
Token ID644
Feature activation-2.28e+00

Pos logit contributions
 dot+0.034
 distances+0.034
 wheels+0.033
 dashboard+0.033
 Novel+0.032
Neg logit contributions
itting-0.027
bek-0.026
 known-0.025
str-0.025
uc-0.025
Token "
Token ID366
Feature activation+8.06e-01

Pos logit contributions
itting+0.089
 exactly+0.086
rou+0.082
bek+0.079
bi+0.076
Neg logit contributions
 Novel-0.080
 dot-0.079
 happen-0.077
 buttons-0.077
 pavement-0.075
Tokenhe
Token ID258
Feature activation+4.49e+00

Pos logit contributions
 distances+0.035
 pavement+0.035
 dot+0.035
 horizon+0.035
 dashboard+0.034
Neg logit contributions
 exactly-0.027
o-0.026
itting-0.026
sm-0.025
 known-0.024
Tokenal
Token ID282
Feature activation+1.31e+00

Pos logit contributions
 dot+0.162
 pavement+0.154
 distances+0.147
 dashboard+0.143
ro+0.142
Neg logit contributions
 exactly-0.193
itting-0.184
 known-0.170
sm-0.169
rou-0.166
Token"
Token ID1
Feature activation+2.26e+00

Pos logit contributions
 dot+0.042
ro+0.037
 distances+0.036
 pavement+0.036
 Novel+0.035
Neg logit contributions
 exactly-0.059
 known-0.056
itting-0.056
bek-0.053
sm-0.052
Token meant
Token ID4001
Feature activation+1.78e+00

Pos logit contributions
 dot+0.082
 distances+0.079
 pavement+0.077
 horizon+0.076
 dashboard+0.073
Neg logit contributions
 exactly-0.100
itting-0.091
o-0.086
 known-0.085
sm-0.085
Token but
Token ID475
Feature activation+1.32e+00

Pos logit contributions
 dot+0.057
 pavement+0.054
 Novel+0.054
 distances+0.053
 seem+0.049
Neg logit contributions
 exactly-0.079
 known-0.078
itting-0.077
eh-0.076
str-0.075
Token he
Token ID339
Feature activation+1.68e+00

Pos logit contributions
 dot+0.042
 pavement+0.037
 wheels+0.037
 pedals+0.036
 tear+0.036
Neg logit contributions
itting-0.060
str-0.055
ats-0.055
sm-0.055
uc-0.055
TokenL
Token ID43
Feature activation+1.78e+00

Pos logit contributions
 dot+0.042
 distances+0.041
 pavement+0.041
 horizon+0.039
 dashboard+0.038
Neg logit contributions
 exactly-0.037
itting-0.033
sm-0.031
str-0.031
 known-0.030
Tokenily
Token ID813
Feature activation+2.12e+00

Pos logit contributions
 dot+0.066
 pavement+0.063
 distances+0.063
 horizon+0.061
 dashboard+0.059
Neg logit contributions
 exactly-0.077
itting-0.070
sm-0.066
str-0.066
 known-0.065
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 dot+0.071
 pavement+0.066
 Novel+0.064
 distances+0.061
ro+0.061
Neg logit contributions
 exactly-0.097
itting-0.089
 known-0.086
bek-0.083
sm-0.081
Token your
Token ID534
Feature activation+5.69e-02

Pos logit contributions
 dot+0.071
 pavement+0.068
 distances+0.066
 horizon+0.065
 dashboard+0.064
Neg logit contributions
 exactly-0.050
itting-0.047
 known-0.044
str-0.044
sm-0.042
Token books
Token ID3835
Feature activation+1.38e-01

Pos logit contributions
 dot+0.002
 pavement+0.002
 distances+0.002
 horizon+0.002
 dashboard+0.002
Neg logit contributions
 exactly-0.002
itting-0.002
sm-0.002
str-0.002
 known-0.002
Tokenhe
Token ID258
Feature activation+4.49e+00

Pos logit contributions
 dot+0.006
 distances+0.006
 pavement+0.005
 horizon+0.005
 dashboard+0.005
Neg logit contributions
 exactly-0.006
 wrong-0.005
itting-0.005
sm-0.005
 living-0.005
Tokenlf
Token ID1652
Feature activation+1.61e+00

Pos logit contributions
 dot+0.146
 pavement+0.136
 distances+0.135
 horizon+0.133
 dashboard+0.126
Neg logit contributions
 exactly-0.218
itting-0.197
sm-0.189
str-0.186
o-0.185
Token is
Token ID318
Feature activation+3.63e-01

Pos logit contributions
 dot+0.067
 distances+0.064
 pavement+0.064
 horizon+0.062
 dashboard+0.061
Neg logit contributions
 exactly-0.062
itting-0.055
sm-0.052
str-0.052
o-0.051
Token very
Token ID845
Feature activation-5.71e-01

Pos logit contributions
 dot+0.016
 pavement+0.015
 distances+0.015
 horizon+0.015
 dashboard+0.015
Neg logit contributions
 exactly-0.013
itting-0.011
sm-0.011
str-0.011
 wrong-0.011
Token messy
Token ID24097
Feature activation-6.42e-01

Pos logit contributions
 exactly+0.024
itting+0.021
str+0.020
sm+0.020
 wrong+0.020
Neg logit contributions
 dot-0.023
 distances-0.022
 pavement-0.022
 horizon-0.021
 dashboard-0.021
Token.
Token ID13
Feature activation+6.93e-01

Pos logit contributions
 exactly+0.018
 wrong+0.016
 helping+0.015
 living+0.015
 together+0.014
Neg logit contributions
 dot-0.033
 horizon-0.032
 spoiler-0.032
 distances-0.031
 pavement-0.031
Token "
Token ID366
Feature activation+7.69e-02

Pos logit contributions
 exactly+0.059
itting+0.053
o+0.049
str+0.049
bek+0.049
Neg logit contributions
 dot-0.075
 distances-0.074
 horizon-0.073
 pavement-0.072
 sides-0.069
Tokensimple
Token ID36439
Feature activation+3.26e+00

Pos logit contributions
 dot+0.003
 pavement+0.003
 distances+0.003
 horizon+0.003
 dashboard+0.003
Neg logit contributions
 exactly-0.003
itting-0.003
sm-0.002
str-0.002
 known-0.002
Token"
Token ID1
Feature activation+1.68e+00

Pos logit contributions
 distances+0.121
 dot+0.113
 horizon+0.110
 pavement+0.109
 sides+0.109
Neg logit contributions
 exactly-0.130
itting-0.118
Deal-0.114
rou-0.111
bek-0.110
Token it
Token ID340
Feature activation+2.25e+00

Pos logit contributions
 dot+0.067
 pavement+0.064
 distances+0.064
 horizon+0.062
 dashboard+0.060
Neg logit contributions
 exactly-0.067
itting-0.060
sm-0.057
str-0.056
 known-0.055
Token may
Token ID743
Feature activation+1.42e+00

Pos logit contributions
 dot+0.081
 pavement+0.080
 distances+0.079
 horizon+0.076
 dashboard+0.074
Neg logit contributions
itting-0.087
 exactly-0.086
sm-0.080
itt-0.079
uc-0.079
Token seem
Token ID1283
Feature activation+4.49e+00

Pos logit contributions
 dot+0.050
 pavement+0.048
 distances+0.047
 horizon+0.045
 dashboard+0.045
Neg logit contributions
 exactly-0.064
itting-0.058
 known-0.055
str-0.055
sm-0.054
Token.
Token ID13
Feature activation+5.74e-01

Pos logit contributions
 dot+0.195
 distances+0.186
 pavement+0.186
 horizon+0.182
 dashboard+0.173
Neg logit contributions
 exactly-0.162
itting-0.146
sm-0.136
str-0.134
o-0.131
Token Zoe
Token ID32280
Feature activation+1.12e+00

Pos logit contributions
 dot+0.020
 distances+0.020
 pavement+0.019
 horizon+0.019
 dashboard+0.018
Neg logit contributions
 exactly-0.025
itting-0.023
sm-0.022
str-0.022
uc-0.021
Tokeny
Token ID88
Feature activation+1.94e+00

Pos logit contributions
 dot+0.041
 distances+0.038
ro+0.037
 pavement+0.037
 horizon+0.036
Neg logit contributions
 exactly-0.047
itting-0.042
 known-0.040
sm-0.040
 feeding-0.040
Token smiled
Token ID13541
Feature activation+9.86e-02

Pos logit contributions
 dot+0.077
 distances+0.073
 pavement+0.073
 horizon+0.071
 dashboard+0.069
Neg logit contributions
 exactly-0.078
itting-0.071
sm-0.067
str-0.066
 known-0.064
Token,
Token ID11
Feature activation+8.13e-02

Pos logit contributions
 dot+0.004
 distances+0.003
 horizon+0.003
 pavement+0.003
 dashboard+0.003
Neg logit contributions
 exactly-0.004
str-0.004
itting-0.004
sm-0.004
o-0.004
Token,
Token ID11
Feature activation+5.19e-01

Pos logit contributions
itting+0.068
 known+0.067
 exactly+0.066
sm+0.065
eh+0.065
Neg logit contributions
 dot-0.052
 myself-0.052
 Novel-0.051
 pavement-0.048
 horizon-0.047
Token Lily
Token ID20037
Feature activation-2.61e-01

Pos logit contributions
 dot+0.017
 pavement+0.016
 distances+0.015
 horizon+0.015
ro+0.015
Neg logit contributions
itting-0.023
 exactly-0.022
str-0.021
sm-0.021
o-0.021
Token?
Token ID30
Feature activation+2.40e-01

Pos logit contributions
 exactly+0.011
itting+0.011
str+0.011
sm+0.011
 known+0.010
Neg logit contributions
 dot-0.008
 Novel-0.007
 myself-0.007
 pavement-0.007
ro-0.007
Token Sometimes
Token ID8975
Feature activation+7.81e-01

Pos logit contributions
 dot+0.007
 pavement+0.007
 distances+0.007
 horizon+0.007
 dashboard+0.006
Neg logit contributions
 exactly-0.012
itting-0.011
sm-0.010
str-0.010
o-0.010
Token things
Token ID1243
Feature activation+7.54e-01

Pos logit contributions
 dot+0.032
 distances+0.030
 pavement+0.030
 horizon+0.029
 dashboard+0.028
Neg logit contributions
 exactly-0.030
itting-0.028
str-0.026
sm-0.026
 known-0.026
Token seem
Token ID1283
Feature activation+4.48e+00

Pos logit contributions
 dot+0.027
 pavement+0.025
 distances+0.025
 horizon+0.024
 dashboard+0.024
Neg logit contributions
 exactly-0.034
itting-0.031
str-0.029
sm-0.029
 known-0.028
Token poor
Token ID3595
Feature activation+2.09e+00

Pos logit contributions
 dot+0.178
 distances+0.168
 pavement+0.166
 horizon+0.163
 dashboard+0.154
Neg logit contributions
 exactly-0.176
itting-0.165
sm-0.152
str-0.152
bek-0.149
Token at
Token ID379
Feature activation+3.72e-01

Pos logit contributions
 dot+0.095
 pavement+0.094
 distances+0.091
 spoiler+0.091
 dashboard+0.090
Neg logit contributions
 exactly-0.072
itting-0.061
 wrong-0.059
 helping-0.058
 feeding-0.056
Token first
Token ID717
Feature activation+1.24e+00

Pos logit contributions
 dot+0.020
 pavement+0.019
 distances+0.019
 dashboard+0.019
 spoiler+0.018
Neg logit contributions
 exactly-0.011
 wrong-0.008
itting-0.008
 known-0.008
 helping-0.008
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
 dot+0.057
 pavement+0.055
 distances+0.054
 horizon+0.054
 dashboard+0.053
Neg logit contributions
 exactly-0.043
itting-0.037
 wrong-0.034
sm-0.034
 helping-0.033
Token but
Token ID475
Feature activation+3.41e-01

Pos logit contributions
 dot+0.007
 pavement+0.007
 distances+0.007
 horizon+0.007
 dashboard+0.007
Neg logit contributions
 exactly-0.008
itting-0.007
sm-0.006
 known-0.006
str-0.006
Token They
Token ID1119
Feature activation-1.49e-01

Pos logit contributions
 exactly+0.014
itting+0.012
 known+0.012
o+0.011
str+0.011
Neg logit contributions
 dot-0.012
 pavement-0.011
 distances-0.011
 horizon-0.010
 surfaces-0.010
Token had
Token ID550
Feature activation-1.27e+00

Pos logit contributions
 exactly+0.006
itting+0.006
str+0.005
sm+0.005
 wrong+0.005
Neg logit contributions
 dot-0.006
 distances-0.006
 pavement-0.006
 horizon-0.005
 widened-0.005
Token never
Token ID1239
Feature activation-9.56e-01

Pos logit contributions
itting+0.046
 exactly+0.045
str+0.040
sm+0.040
itt+0.039
Neg logit contributions
 distances-0.053
 dot-0.052
 pavement-0.052
 horizon-0.049
 dashboard-0.049
Token seen
Token ID1775
Feature activation-7.90e-01

Pos logit contributions
 exactly+0.037
itting+0.033
 known+0.031
sm+0.031
str+0.031
Neg logit contributions
 dot-0.040
 pavement-0.038
 distances-0.038
 horizon-0.037
 dashboard-0.036
Token anyone
Token ID2687
Feature activation+2.20e+00

Pos logit contributions
itting+0.030
 exactly+0.030
str+0.028
bek+0.027
uc+0.026
Neg logit contributions
 dot-0.031
 distances-0.031
 pavement-0.030
 horizon-0.029
 dashboard-0.028
Token un
Token ID555
Feature activation+4.37e+00

Pos logit contributions
 dot+0.092
 distances+0.089
 pavement+0.088
 horizon+0.086
 dashboard+0.082
Neg logit contributions
 exactly-0.083
itting-0.076
str-0.070
 known-0.070
uc-0.069
Tokenpack
Token ID8002
Feature activation-1.46e-01

Pos logit contributions
 distances+0.167
 dot+0.165
 pavement+0.160
 horizon+0.155
 dashboard+0.155
Neg logit contributions
 exactly-0.176
itting-0.159
str-0.152
sm-0.151
 known-0.148
Token vegetables
Token ID13701
Feature activation-2.52e-01

Pos logit contributions
 exactly+0.005
itting+0.005
str+0.005
o+0.005
 known+0.005
Neg logit contributions
 distances-0.006
 dot-0.006
 pavement-0.006
 horizon-0.006
 dashboard-0.005
Token so
Token ID523
Feature activation+2.16e-01

Pos logit contributions
 exactly+0.010
itting+0.010
str+0.009
sm+0.009
 known+0.009
Neg logit contributions
 dot-0.010
 distances-0.009
 horizon-0.009
 pavement-0.009
ro-0.009
Token quickly
Token ID2952
Feature activation+1.19e+00

Pos logit contributions
 dot+0.009
 distances+0.009
 pavement+0.009
 horizon+0.009
ro+0.009
Neg logit contributions
 exactly-0.007
itting-0.007
str-0.006
o-0.006
 known-0.006
Token before
Token ID878
Feature activation-3.95e-01

Pos logit contributions
 dot+0.051
 pavement+0.050
 distances+0.050
 horizon+0.048
 dashboard+0.047
Neg logit contributions
 exactly-0.043
itting-0.040
str-0.037
sm-0.036
 known-0.036
Token her
Token ID607
Feature activation-9.72e-02

Pos logit contributions
 dot+0.027
 pavement+0.027
 distances+0.027
 horizon+0.025
 surfaces+0.025
Neg logit contributions
 exactly-0.020
itting-0.019
str-0.018
 known-0.017
o-0.017
Token that
Token ID326
Feature activation-8.00e-01

Pos logit contributions
itting+0.004
eh+0.004
bek+0.003
Deal+0.003
str+0.003
Neg logit contributions
 Novel-0.003
 seem-0.003
 dot-0.003
 wheels-0.003
 pavement-0.003
Token sometimes
Token ID3360
Feature activation+1.30e+00

Pos logit contributions
 exactly+0.032
itting+0.032
str+0.028
uc+0.028
bek+0.028
Neg logit contributions
 dot-0.031
 distances-0.029
 pavement-0.029
 horizon-0.028
 dashboard-0.027
Token things
Token ID1243
Feature activation+7.69e-01

Pos logit contributions
 dot+0.057
 distances+0.055
 pavement+0.055
 horizon+0.053
 dashboard+0.051
Neg logit contributions
 exactly-0.048
itting-0.043
sm-0.040
str-0.039
 known-0.039
Token can
Token ID460
Feature activation+7.36e-01

Pos logit contributions
 dot+0.029
 pavement+0.028
 distances+0.028
 horizon+0.027
 dashboard+0.026
Neg logit contributions
 exactly-0.032
itting-0.029
sm-0.027
str-0.027
 known-0.027
Token seem
Token ID1283
Feature activation+4.36e+00

Pos logit contributions
 pavement+0.029
 distances+0.029
 dot+0.028
 dashboard+0.027
 horizon+0.027
Neg logit contributions
 exactly-0.027
 wrong-0.026
 known-0.025
 helping-0.024
ll-0.024
Token scary
Token ID14343
Feature activation+1.67e+00

Pos logit contributions
 dot+0.192
 distances+0.184
 pavement+0.182
 horizon+0.178
 dashboard+0.169
Neg logit contributions
 exactly-0.153
itting-0.140
sm-0.129
str-0.128
bek-0.127
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
 dot+0.081
 pavement+0.079
 horizon+0.078
 spoiler+0.077
 dashboard+0.077
Neg logit contributions
 exactly-0.053
itting-0.045
 wrong-0.044
sm-0.043
 helping-0.042
Token but
Token ID475
Feature activation+6.79e-01

Pos logit contributions
 dot+0.021
 pavement+0.020
 horizon+0.020
 distances+0.020
 dashboard+0.020
Neg logit contributions
 exactly-0.024
itting-0.021
 wrong-0.020
 helping-0.020
 known-0.019
Token they
Token ID484
Feature activation+2.94e-01

Pos logit contributions
 dot+0.031
 pavement+0.030
 distances+0.030
 horizon+0.029
 dashboard+0.028
Neg logit contributions
 exactly-0.023
itting-0.021
sm-0.019
str-0.019
 known-0.019
Token're
Token ID821
Feature activation+1.05e+00

Pos logit contributions
 dot+0.012
 distances+0.011
 pavement+0.011
 horizon+0.011
 dashboard+0.010
Neg logit contributions
 exactly-0.012
itting-0.011
sm-0.010
str-0.010
 known-0.010
Token wanted
Token ID2227
Feature activation+9.32e-01

Pos logit contributions
 exactly+0.014
sm+0.012
itting+0.012
 known+0.011
o+0.011
Neg logit contributions
 distances-0.017
 dot-0.017
 pavement-0.017
 horizon-0.016
 dashboard-0.016
Token to
Token ID284
Feature activation-2.47e-01

Pos logit contributions
 dot+0.044
 distances+0.043
 horizon+0.042
 pavement+0.042
 wheels+0.040
Neg logit contributions
itting-0.030
 exactly-0.026
bek-0.026
str-0.025
live-0.025
Token increase
Token ID2620
Feature activation-2.86e-01

Pos logit contributions
 exactly+0.011
 known+0.009
itting+0.009
str+0.009
sm+0.009
Neg logit contributions
 dot-0.009
 distances-0.009
 pavement-0.009
 horizon-0.009
 dashboard-0.008
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
 exactly+0.010
itting+0.010
str+0.010
o+0.009
 known+0.009
Neg logit contributions
 dot-0.012
 distances-0.012
 pavement-0.011
 horizon-0.011
 surfaces-0.011
Token By
Token ID2750
Feature activation-1.22e+00

Pos logit contributions
 exactly+0.014
itting+0.013
sm+0.012
str+0.012
 known+0.012
Neg logit contributions
 dot-0.010
 pavement-0.009
 distances-0.009
 horizon-0.009
 dashboard-0.008
Token living
Token ID2877
Feature activation-5.33e+00

Pos logit contributions
 exactly+0.040
itting+0.036
str+0.033
sm+0.033
o+0.032
Neg logit contributions
 dot-0.058
 distances-0.055
 pavement-0.055
 horizon-0.054
 dashboard-0.052
Token life
Token ID1204
Feature activation+9.50e-01

Pos logit contributions
str+0.237
 exactly+0.236
itting+0.234
sm+0.217
o+0.214
Neg logit contributions
 dot-0.170
 distances-0.169
 surfaces-0.155
 pavement-0.153
 waters-0.152
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
 dot+0.040
 distances+0.039
 pavement+0.038
 horizon+0.038
ro+0.036
Neg logit contributions
 exactly-0.035
itting-0.032
sm-0.030
 known-0.029
o-0.029
Token it
Token ID340
Feature activation-7.56e-01

Pos logit contributions
 exactly+0.024
itting+0.023
sm+0.021
o+0.021
str+0.021
Neg logit contributions
 dot-0.025
 pavement-0.023
 horizon-0.023
 distances-0.023
 dashboard-0.021
Token found
Token ID1043
Feature activation-6.34e-01

Pos logit contributions
 exactly+0.027
itting+0.024
sm+0.022
str+0.022
 known+0.022
Neg logit contributions
 dot-0.034
 pavement-0.033
 distances-0.032
 horizon-0.031
 dashboard-0.031
Token joy
Token ID8716
Feature activation+4.14e-01

Pos logit contributions
itting+0.024
 exactly+0.021
bek+0.021
str+0.021
live+0.021
Neg logit contributions
 dot-0.026
 distances-0.024
 horizon-0.024
 pavement-0.023
 sides-0.023
Token
Token ID220
Feature activation-1.04e+00

Pos logit contributions
 exactly+0.030
itting+0.028
str+0.027
 wrong+0.027
sm+0.027
Neg logit contributions
 dot-0.015
 pavement-0.015
 distances-0.014
 horizon-0.014
ro-0.013
Token She
Token ID1375
Feature activation-5.85e-01

Pos logit contributions
 exactly+0.042
itting+0.037
sm+0.036
 known+0.035
str+0.035
Neg logit contributions
 dot-0.039
 pavement-0.039
 distances-0.038
 horizon-0.037
ro-0.036
Token grabbed
Token ID13176
Feature activation-1.26e+00

Pos logit contributions
 exactly+0.021
itting+0.019
sm+0.018
str+0.018
 known+0.017
Neg logit contributions
 dot-0.025
 pavement-0.024
 distances-0.024
 horizon-0.023
 dashboard-0.023
Token oven
Token ID14361
Feature activation-4.28e+00

Pos logit contributions
itting+0.049
 exactly+0.047
str+0.045
ll+0.045
rou+0.044
Neg logit contributions
 dot-0.048
 pavement-0.046
 distances-0.045
 wheels-0.045
 sides-0.044
Token m
Token ID285
Feature activation-1.94e+00

Pos logit contributions
 exactly+0.207
itting+0.203
 known+0.197
str+0.194
sm+0.192
Neg logit contributions
 dot-0.123
 pavement-0.120
 horizon-0.118
 distances-0.115
ro-0.111
Tokenitt
Token ID715
Feature activation-5.04e+00

Pos logit contributions
 exactly+0.042
 wrong+0.034
 known+0.034
itting+0.030
 living+0.030
Neg logit contributions
 pavement-0.109
 distances-0.108
 dot-0.107
 horizon-0.104
 dashboard-0.102
Tokens
Token ID82
Feature activation-4.16e-02

Pos logit contributions
 exactly+0.206
 known+0.195
Deal+0.194
itting+0.187
bek+0.184
Neg logit contributions
 dot-0.189
 distances-0.187
 pavement-0.170
en-0.169
 horizon-0.168
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
 exactly+0.002
itting+0.002
 known+0.002
str+0.002
bek+0.002
Neg logit contributions
 dot-0.001
 distances-0.001
 pavement-0.001
 wheels-0.001
 sides-0.001
Token carefully
Token ID7773
Feature activation-8.97e-02

Pos logit contributions
 exactly+0.006
itting+0.005
str+0.005
 wrong+0.005
sm+0.005
Neg logit contributions
 dot-0.005
 pavement-0.005
 distances-0.004
 horizon-0.004
 wheels-0.004
Token pulled
Token ID5954
Feature activation-1.54e+00

Pos logit contributions
 exactly+0.004
itting+0.004
str+0.004
 known+0.004
 wrong+0.004
Neg logit contributions
 dot-0.003
 distances-0.003
 pavement-0.003
 horizon-0.003
 dashboard-0.002
Token out
Token ID503
Feature activation-1.08e+00

Pos logit contributions
itting+0.056
 exactly+0.055
str+0.053
 known+0.052
ll+0.050
Neg logit contributions
 dot-0.063
 pavement-0.060
 distances-0.060
 wheels-0.058
 sides-0.057
Token little
Token ID1310
Feature activation+9.85e-02

Pos logit contributions
 exactly+0.022
itting+0.021
sm+0.020
o+0.019
 known+0.019
Neg logit contributions
 dot-0.018
 pavement-0.017
 horizon-0.016
 dashboard-0.016
 wheels-0.015
Token boy
Token ID2933
Feature activation+5.75e-02

Pos logit contributions
 dot+0.004
 pavement+0.003
 distances+0.003
 horizon+0.003
 dashboard+0.003
Neg logit contributions
 exactly-0.004
itting-0.004
sm-0.004
str-0.004
 known-0.004
Token getting
Token ID1972
Feature activation+2.62e-01

Pos logit contributions
 dot+0.003
 distances+0.003
 pavement+0.003
 dashboard+0.002
 wheels+0.002
Neg logit contributions
 exactly-0.002
itting-0.002
str-0.002
 known-0.002
bek-0.002
Token a
Token ID257
Feature activation-4.96e-01

Pos logit contributions
 dot+0.009
 wheels+0.008
 distances+0.008
 pavement+0.008
 pedals+0.008
Neg logit contributions
itting-0.011
 exactly-0.011
str-0.011
bek-0.011
live-0.010
Token buzz
Token ID14713
Feature activation-2.70e-01

Pos logit contributions
 exactly+0.022
itting+0.021
 known+0.020
o+0.020
sm+0.020
Neg logit contributions
 dot-0.018
 pavement-0.016
 horizon-0.015
 dashboard-0.015
ro-0.015
Tokencut
Token ID8968
Feature activation-4.85e+00

Pos logit contributions
 exactly+0.011
itting+0.009
 known+0.009
str+0.009
bek+0.009
Neg logit contributions
 dot-0.011
 distances-0.011
 pavement-0.011
ro-0.010
 horizon-0.010
Token and
Token ID290
Feature activation+1.69e-01

Pos logit contributions
 exactly+0.213
itting+0.201
 known+0.184
sm+0.184
str+0.181
Neg logit contributions
 dot-0.183
 pavement-0.168
 distances-0.165
 horizon-0.161
 dashboard-0.157
Token a
Token ID257
Feature activation-4.39e-01

Pos logit contributions
 dot+0.006
 distances+0.006
 pavement+0.006
 wheels+0.006
 horizon+0.006
Neg logit contributions
 exactly-0.007
itting-0.007
str-0.006
sm-0.006
bek-0.006
Token lady
Token ID10846
Feature activation-1.52e+00

Pos logit contributions
 exactly+0.021
itting+0.020
live+0.019
ll+0.019
 known+0.019
Neg logit contributions
 dot-0.016
 pavement-0.013
 dashboard-0.012
 tear-0.012
 wheels-0.012
Token getting
Token ID1972
Feature activation+2.38e-01

Pos logit contributions
 exactly+0.056
itting+0.053
live+0.048
bek+0.048
str+0.047
Neg logit contributions
 dot-0.067
 distances-0.059
 pavement-0.057
 widened-0.057
 dashboard-0.056
Token a
Token ID257
Feature activation-2.79e-01

Pos logit contributions
 dot+0.008
 wheels+0.008
 distances+0.008
 buttons+0.008
 pedals+0.008
Neg logit contributions
itting-0.010
 exactly-0.010
bek-0.010
live-0.010
str-0.009
Token to
Token ID284
Feature activation-8.40e-01

Pos logit contributions
itting+0.003
 exactly+0.003
itt+0.003
uc+0.003
sm+0.003
Neg logit contributions
 pavement-0.005
 distances-0.005
 horizon-0.005
 wheels-0.005
ro-0.005
Token take
Token ID1011
Feature activation-8.62e-01

Pos logit contributions
 exactly+0.040
itting+0.037
sm+0.035
str+0.034
 known+0.034
Neg logit contributions
 dot-0.027
 pavement-0.026
 distances-0.026
 horizon-0.025
 dashboard-0.024
Token a
Token ID257
Feature activation-8.89e-01

Pos logit contributions
 exactly+0.031
itting+0.029
str+0.026
sm+0.026
o+0.025
Neg logit contributions
 dot-0.037
 distances-0.037
 pavement-0.037
 horizon-0.035
 surfaces-0.034
Token short
Token ID1790
Feature activation+3.02e-01

Pos logit contributions
 exactly+0.043
itting+0.039
uc+0.038
sm+0.037
 known+0.037
Neg logit contributions
 dot-0.029
 pavement-0.026
 horizon-0.026
 distances-0.026
 dashboard-0.024
Token-
Token ID12
Feature activation-1.97e+00

Pos logit contributions
 dot+0.011
 pavement+0.010
 distances+0.010
 horizon+0.010
 dashboard+0.009
Neg logit contributions
 exactly-0.014
itting-0.012
sm-0.012
str-0.012
 known-0.012
Tokencut
Token ID8968
Feature activation-4.84e+00

Pos logit contributions
 exactly+0.090
itting+0.076
 feeding+0.075
uc+0.075
 helping+0.074
Neg logit contributions
 dot-0.066
 pavement-0.065
 distances-0.064
ro-0.061
 horizon-0.059
Token.
Token ID13
Feature activation-7.98e-01

Pos logit contributions
itting+0.226
 exactly+0.225
sm+0.208
str+0.204
 known+0.203
Neg logit contributions
 dot-0.158
 distances-0.143
 pavement-0.141
 horizon-0.138
 dashboard-0.135
Token
Token ID198
Feature activation-6.73e-01

Pos logit contributions
 exactly+0.038
itting+0.035
sm+0.033
str+0.033
 known+0.032
Neg logit contributions
 dot-0.026
 pavement-0.025
 distances-0.025
 horizon-0.024
ro-0.023
Token
Token ID198
Feature activation-4.56e-01

Pos logit contributions
 exactly+0.027
itting+0.024
str+0.022
sm+0.022
 known+0.021
Neg logit contributions
 dot-0.027
 distances-0.026
 pavement-0.026
 horizon-0.025
ro-0.024
TokenShe
Token ID3347
Feature activation-5.46e-01

Pos logit contributions
 exactly+0.018
itting+0.016
o+0.015
str+0.015
sm+0.015
Neg logit contributions
 dot-0.018
 distances-0.018
 pavement-0.017
 horizon-0.017
ro-0.016
Token followed
Token ID3940
Feature activation+4.10e-01

Pos logit contributions
 exactly+0.021
itting+0.019
sm+0.018
str+0.018
 known+0.018
Neg logit contributions
 dot-0.022
 distances-0.021
 pavement-0.021
 horizon-0.021
 dashboard-0.020
Token ever
Token ID1683
Feature activation-2.49e-01

Pos logit contributions
 exactly+0.068
itting+0.066
 known+0.060
str+0.060
uc+0.059
Neg logit contributions
 dot-0.063
 distances-0.058
 pavement-0.057
 dashboard-0.055
ro-0.054
Token,
Token ID11
Feature activation+2.21e-01

Pos logit contributions
 exactly+0.009
itting+0.008
str+0.008
sm+0.008
 known+0.008
Neg logit contributions
 dot-0.011
 distances-0.010
 pavement-0.010
 horizon-0.010
 dashboard-0.010
Token made
Token ID925
Feature activation-4.44e-01

Pos logit contributions
 dot+0.008
 distances+0.007
 pavement+0.007
 horizon+0.007
 dashboard+0.007
Neg logit contributions
 exactly-0.010
itting-0.009
str-0.009
sm-0.009
uc-0.009
Token from
Token ID422
Feature activation+1.70e+00

Pos logit contributions
itting+0.017
 exactly+0.017
str+0.016
 known+0.015
bek+0.015
Neg logit contributions
 dot-0.016
 pavement-0.016
 distances-0.016
 sides-0.015
 wheels-0.015
Token flexible
Token ID12846
Feature activation-1.64e+00

Pos logit contributions
 dot+0.073
 distances+0.072
 pavement+0.071
 horizon+0.069
 dashboard+0.067
Neg logit contributions
 exactly-0.060
itting-0.058
str-0.053
sm-0.051
o-0.050
Token wire
Token ID6503
Feature activation-4.68e+00

Pos logit contributions
 exactly+0.072
itting+0.062
sm+0.062
 living+0.061
 wrong+0.061
Neg logit contributions
 dot-0.056
 pavement-0.053
 horizon-0.053
 distances-0.051
 dashboard-0.051
Token!
Token ID0
Feature activation-3.97e-01

Pos logit contributions
 exactly+0.178
itting+0.170
str+0.160
 known+0.158
sm+0.150
Neg logit contributions
 dot-0.194
 distances-0.191
 pavement-0.186
 horizon-0.178
 dashboard-0.175
Token<|endoftext|>
Token ID50256
Feature activation-3.59e-01

Pos logit contributions
 exactly+0.017
itting+0.015
sm+0.014
str+0.014
 known+0.014
Neg logit contributions
 dot-0.015
 pavement-0.014
 distances-0.014
 horizon-0.014
 dashboard-0.013
TokenOnce
Token ID7454
Feature activation+8.61e-03

Pos logit contributions
 exactly+0.013
itting+0.011
sm+0.011
str+0.011
 known+0.010
Neg logit contributions
 dot-0.016
 pavement-0.015
 distances-0.015
 horizon-0.015
 dashboard-0.014
Token upon
Token ID2402
Feature activation-4.87e-01

Pos logit contributions
 dot+0.000
 horizon+0.000
 distances+0.000
 pavement+0.000
 sides+0.000
Neg logit contributions
 exactly-0.000
 feeding-0.000
sm-0.000
itting-0.000
bek-0.000
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 exactly+0.013
itting+0.010
sm+0.010
 known+0.010
o+0.009
Neg logit contributions
 dot-0.026
 pavement-0.026
 horizon-0.026
 distances-0.025
 dashboard-0.025
Token They
Token ID1119
Feature activation+5.92e-01

Pos logit contributions
 dot+0.015
 distances+0.015
 pavement+0.014
 spoiler+0.014
 horizon+0.014
Neg logit contributions
 exactly-0.016
itting-0.015
sm-0.014
 living-0.014
 feeding-0.014
Token put
Token ID1234
Feature activation+7.40e-01

Pos logit contributions
 dot+0.021
 pavement+0.019
 distances+0.019
 dashboard+0.019
 horizon+0.019
Neg logit contributions
itting-0.026
 exactly-0.025
str-0.022
sm-0.022
rou-0.022
Token on
Token ID319
Feature activation+2.53e-01

Pos logit contributions
 dot+0.031
 distances+0.030
 wheels+0.030
 pavement+0.030
 horizon+0.029
Neg logit contributions
itting-0.027
 exactly-0.026
rou-0.024
str-0.024
 known-0.023
Token oven
Token ID14361
Feature activation-3.89e+00

Pos logit contributions
 distances+0.010
 pavement+0.010
 dot+0.010
 wheels+0.010
 heads+0.009
Neg logit contributions
itting-0.010
 exactly-0.010
str-0.009
rou-0.009
 known-0.009
Token m
Token ID285
Feature activation-1.50e+00

Pos logit contributions
 exactly+0.168
itting+0.159
str+0.149
 known+0.149
sm+0.146
Neg logit contributions
 dot-0.138
 pavement-0.136
 distances-0.133
 horizon-0.131
 dashboard-0.127
Tokenitt
Token ID715
Feature activation-4.63e+00

Pos logit contributions
 exactly+0.037
itting+0.030
 known+0.028
 wrong+0.027
str+0.027
Neg logit contributions
 pavement-0.081
 dot-0.081
 distances-0.081
 horizon-0.078
 dashboard-0.078
Tokens
Token ID82
Feature activation+5.48e-01

Pos logit contributions
 exactly+0.188
itting+0.171
 known+0.168
Deal+0.163
bek+0.162
Neg logit contributions
 dot-0.182
 distances-0.180
 pavement-0.168
 horizon-0.165
ro-0.163
Token and
Token ID290
Feature activation+6.23e-01

Pos logit contributions
 distances+0.021
 dot+0.020
 pavement+0.020
ro+0.019
 wheels+0.019
Neg logit contributions
 exactly-0.022
itting-0.021
 known-0.019
str-0.019
bek-0.019
Token took
Token ID1718
Feature activation-5.84e-01

Pos logit contributions
 dot+0.022
 pavement+0.021
 distances+0.021
 horizon+0.020
 dashboard+0.020
Neg logit contributions
 exactly-0.027
itting-0.026
str-0.024
sm-0.023
 known-0.023
Token out
Token ID503
Feature activation-9.58e-01

Pos logit contributions
 exactly+0.021
itting+0.020
str+0.018
rou+0.017
 known+0.017
Neg logit contributions
 dot-0.026
 distances-0.025
 pavement-0.024
 horizon-0.023
 dashboard-0.023
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
itting+0.042
str+0.040
live+0.039
ll+0.038
Deal+0.038
Neg logit contributions
 sides-0.032
 wheels-0.032
 buttons-0.032
 dot-0.031
 distances-0.031
Token He
Token ID679
Feature activation-6.07e-01

Pos logit contributions
 exactly+0.047
itting+0.044
sm+0.042
str+0.042
 known+0.041
Neg logit contributions
 dot-0.024
 pavement-0.022
 distances-0.022
 horizon-0.021
 dashboard-0.020
Token jumped
Token ID11687
Feature activation-5.94e-01

Pos logit contributions
 exactly+0.023
itting+0.020
sm+0.019
str+0.019
 known+0.019
Neg logit contributions
 dot-0.026
 pavement-0.025
 distances-0.025
 horizon-0.024
 dashboard-0.024
Token around
Token ID1088
Feature activation-5.79e-01

Pos logit contributions
 exactly+0.024
itting+0.021
str+0.020
sm+0.020
 known+0.020
Neg logit contributions
 dot-0.024
 distances-0.023
 pavement-0.023
 horizon-0.022
 dashboard-0.022
Token with
Token ID351
Feature activation-6.25e-01

Pos logit contributions
 exactly+0.025
itting+0.024
str+0.023
 known+0.022
o+0.022
Neg logit contributions
 dot-0.021
 distances-0.021
 pavement-0.020
 horizon-0.019
ro-0.019
Token gust
Token ID35253
Feature activation-2.39e+00

Pos logit contributions
 exactly+0.028
itting+0.025
sm+0.024
 known+0.023
str+0.023
Neg logit contributions
 dot-0.023
 pavement-0.022
 distances-0.021
 horizon-0.021
 dashboard-0.020
Tokeno
Token ID78
Feature activation-4.62e+00

Pos logit contributions
 exactly+0.086
 known+0.075
Deal+0.073
str+0.073
 wrong+0.073
Neg logit contributions
 dot-0.111
 distances-0.105
ro-0.100
 pavement-0.097
 horizon-0.093
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 exactly+0.194
 known+0.178
str+0.170
itting+0.169
Deal+0.165
Neg logit contributions
 dot-0.185
 distances-0.172
 pavement-0.169
ro-0.164
 horizon-0.163
Token spun
Token ID26843
Feature activation-1.45e+00

Pos logit contributions
 exactly+0.044
itting+0.043
sm+0.039
str+0.038
uc+0.037
Neg logit contributions
 dot-0.038
 pavement-0.037
 distances-0.037
 horizon-0.035
ro-0.034
Token in
Token ID287
Feature activation-6.85e-01

Pos logit contributions
 exactly+0.059
itting+0.053
sm+0.050
str+0.049
 known+0.049
Neg logit contributions
 dot-0.057
 pavement-0.054
 distances-0.054
 horizon-0.052
 dashboard-0.051
Token circles
Token ID13332
Feature activation-1.98e+00

Pos logit contributions
 exactly+0.026
itting+0.022
sm+0.021
 known+0.020
 helping+0.020
Neg logit contributions
 dot-0.029
 pavement-0.028
 distances-0.028
 horizon-0.027
 dashboard-0.027
Token as
Token ID355
Feature activation-8.77e-01

Pos logit contributions
 exactly+0.082
str+0.077
bek+0.075
itting+0.075
 known+0.074
Neg logit contributions
 dot-0.076
 distances-0.074
 pavement-0.072
 horizon-0.068
ro-0.067
Token.
Token ID13
Feature activation+4.10e-01

Pos logit contributions
 dot+0.010
 pavement+0.010
 distances+0.009
 horizon+0.009
 spoiler+0.009
Neg logit contributions
 exactly-0.008
itting-0.006
sm-0.006
 wrong-0.006
o-0.006
Token They
Token ID1119
Feature activation+6.82e-01

Pos logit contributions
 pavement+0.016
 distances+0.016
 horizon+0.016
 dot+0.015
 widened+0.015
Neg logit contributions
 exactly-0.015
o-0.015
itt-0.014
itting-0.014
 Mitt-0.014
Token are
Token ID389
Feature activation+1.03e+00

Pos logit contributions
 dot+0.024
 pavement+0.023
 distances+0.023
 horizon+0.022
 dashboard+0.021
Neg logit contributions
 exactly-0.031
itting-0.028
sm-0.027
str-0.027
 known-0.026
Token for
Token ID329
Feature activation-1.01e-01

Pos logit contributions
 dot+0.043
 distances+0.042
 wheels+0.042
 pavement+0.041
 edges+0.039
Neg logit contributions
itting-0.036
str-0.033
bek-0.033
 exactly-0.032
uc-0.032
Token wood
Token ID4898
Feature activation-1.38e+00

Pos logit contributions
 exactly+0.003
itting+0.003
str+0.002
sm+0.002
bek+0.002
Neg logit contributions
 dot-0.005
 distances-0.005
 pavement-0.005
 horizon-0.005
 wheels-0.005
Tokencut
Token ID8968
Feature activation-4.61e+00

Pos logit contributions
 exactly+0.059
itting+0.053
str+0.050
sm+0.050
 known+0.049
Neg logit contributions
 dot-0.052
 distances-0.049
 pavement-0.049
 horizon-0.048
 dashboard-0.046
Tokenters
Token ID1010
Feature activation-1.56e+00

Pos logit contributions
 exactly+0.183
itting+0.164
sm+0.153
str+0.152
o+0.151
Neg logit contributions
 dot-0.187
 pavement-0.179
 distances-0.178
 horizon-0.173
 dashboard-0.168
Token."
Token ID526
Feature activation+6.61e-01

Pos logit contributions
 exactly+0.069
itting+0.064
str+0.061
 known+0.060
bek+0.060
Neg logit contributions
 dot-0.055
 distances-0.054
 pavement-0.051
ro-0.049
 dashboard-0.049
Token They
Token ID1119
Feature activation+1.69e-01

Pos logit contributions
 dot+0.028
 distances+0.027
 pavement+0.027
 horizon+0.026
 dashboard+0.025
Neg logit contributions
 exactly-0.025
itting-0.023
sm-0.021
str-0.021
o-0.021
Token took
Token ID1718
Feature activation-1.01e-01

Pos logit contributions
 widened+0.006
 dot+0.006
 pavement+0.005
 dashboard+0.005
 wheels+0.005
Neg logit contributions
itting-0.007
 exactly-0.007
str-0.006
rou-0.006
sm-0.006
Token Lily
Token ID20037
Feature activation-4.86e-01

Pos logit contributions
 exactly+0.003
itting+0.003
str+0.003
sm+0.003
 known+0.003
Neg logit contributions
 dot-0.005
 pavement-0.005
 distances-0.005
 horizon-0.004
 dashboard-0.004
Token went
Token ID1816
Feature activation-4.45e-01

Pos logit contributions
 dot+0.030
 pavement+0.028
ro+0.028
 distances+0.027
 horizon+0.026
Neg logit contributions
 exactly-0.034
itting-0.031
 wrong-0.029
sm-0.029
str-0.028
Token back
Token ID736
Feature activation-2.09e+00

Pos logit contributions
 exactly+0.019
str+0.018
itting+0.018
bek+0.017
sm+0.017
Neg logit contributions
 distances-0.016
 pavement-0.015
 dot-0.015
 horizon-0.015
 lands-0.014
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
 exactly+0.057
itting+0.049
str+0.044
 known+0.042
sm+0.042
Neg logit contributions
 dot-0.110
 distances-0.108
 pavement-0.107
 horizon-0.104
ro-0.101
Token playing
Token ID2712
Feature activation-1.17e+00

Pos logit contributions
 exactly+0.042
itting+0.041
str+0.038
live+0.037
sm+0.037
Neg logit contributions
 dot-0.043
 pavement-0.042
 distances-0.041
 arc-0.038
 surfaces-0.037
Token with
Token ID351
Feature activation+2.32e-01

Pos logit contributions
 exactly+0.045
itting+0.044
str+0.043
 known+0.041
o+0.040
Neg logit contributions
 dot-0.047
 pavement-0.046
 distances-0.046
 surfaces-0.043
 wheels-0.042
Token dough
Token ID15756
Feature activation-4.58e+00

Pos logit contributions
 dot+0.009
 wheels+0.008
 sides+0.008
 pavement+0.008
 distances+0.008
Neg logit contributions
itting-0.010
 exactly-0.009
str-0.009
ll-0.009
bek-0.009
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
 exactly+0.208
itting+0.203
str+0.199
 known+0.192
bek+0.189
Neg logit contributions
 dot-0.158
 distances-0.149
 pavement-0.145
 horizon-0.138
ro-0.135
Token<|endoftext|>
Token ID50256
Feature activation+1.67e-01

Pos logit contributions
 exactly+0.019
itting+0.018
str+0.016
 known+0.016
 wrong+0.016
Neg logit contributions
 dot-0.016
 pavement-0.015
 distances-0.014
 horizon-0.014
 spoiler-0.013
TokenOnce
Token ID7454
Feature activation+6.98e-02

Pos logit contributions
 dot+0.007
 distances+0.007
 pavement+0.007
 horizon+0.007
 dashboard+0.006
Neg logit contributions
 exactly-0.006
itting-0.006
str-0.005
sm-0.005
 known-0.005
Token upon
Token ID2402
Feature activation-3.99e-01

Pos logit contributions
 dot+0.003
 horizon+0.003
 distances+0.003
 pavement+0.003
 sides+0.003
Neg logit contributions
 exactly-0.002
itting-0.002
bek-0.002
 known-0.002
 feeding-0.002
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
 exactly+0.011
itting+0.009
sm+0.009
 known+0.008
o+0.008
Neg logit contributions
 dot-0.021
 pavement-0.021
 distances-0.020
 horizon-0.020
 dashboard-0.020
Token time
Token ID640
Feature activation-2.61e-01

Pos logit contributions
 exactly+0.059
uc+0.053
 known+0.053
itting+0.052
bek+0.051
Neg logit contributions
 dot-0.044
 distances-0.042
 pavement-0.040
 horizon-0.039
ro-0.038
Token there
Token ID612
Feature activation+5.08e-01

Pos logit contributions
 exactly+0.009
itting+0.008
 known+0.008
sm+0.008
str+0.008
Neg logit contributions
 dot-0.011
 distances-0.011
 pavement-0.011
 horizon-0.010
 dashboard-0.010
Token was
Token ID373
Feature activation-1.76e-02

Pos logit contributions
 dot+0.023
 distances+0.023
 pavement+0.022
 horizon+0.022
 dashboard+0.021
Neg logit contributions
 exactly-0.018
itting-0.015
str-0.015
o-0.015
sm-0.015
Token a
Token ID257
Feature activation-5.33e-01

Pos logit contributions
 exactly+0.001
itting+0.001
str+0.000
sm+0.000
 known+0.000
Neg logit contributions
 dot-0.001
 distances-0.001
 pavement-0.001
 horizon-0.001
 dashboard-0.001
Token wood
Token ID4898
Feature activation-1.17e+00

Pos logit contributions
 exactly+0.030
itting+0.027
 known+0.026
ied+0.025
icking+0.025
Neg logit contributions
 dot-0.016
 pavement-0.012
 arc-0.011
 horizon-0.011
ro-0.011
Tokencut
Token ID8968
Feature activation-4.49e+00

Pos logit contributions
 exactly+0.052
itting+0.049
 known+0.048
bek+0.044
 backup+0.043
Neg logit contributions
 dot-0.045
ro-0.041
 floor-0.041
 pavement-0.041
 surface-0.039
Tokenter
Token ID353
Feature activation-3.35e-01

Pos logit contributions
 exactly+0.216
itting+0.208
sm+0.190
bek+0.189
 known+0.187
Neg logit contributions
 dot-0.151
 pavement-0.133
 distances-0.127
 horizon-0.127
 dashboard-0.124
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
 exactly+0.014
bek+0.013
 backup+0.013
itting+0.012
 Enjoy+0.012
Neg logit contributions
 dot-0.013
ro-0.011
 surface-0.010
 distances-0.010
 wheels-0.010
Token He
Token ID679
Feature activation+6.33e-01

Pos logit contributions
 dot+0.011
 distances+0.011
 pavement+0.010
 horizon+0.010
 dashboard+0.010
Neg logit contributions
 exactly-0.014
itting-0.013
sm-0.013
o-0.013
str-0.013
Token had
Token ID550
Feature activation-7.44e-01

Pos logit contributions
 dot+0.030
 pavement+0.029
 distances+0.029
 horizon+0.028
 dashboard+0.027
Neg logit contributions
 exactly-0.021
itting-0.019
sm-0.017
str-0.017
 known-0.017
Token an
Token ID281
Feature activation+9.55e-01

Pos logit contributions
 exactly+0.030
itting+0.029
bek+0.027
str+0.025
ll+0.025
Neg logit contributions
 dot-0.029
 distances-0.029
 wheels-0.028
 pavement-0.028
 edges-0.027
Token mouse
Token ID10211
Feature activation-5.47e-01

Pos logit contributions
 exactly+0.049
itting+0.046
str+0.041
sm+0.041
o+0.041
Neg logit contributions
 dot-0.044
 pavement-0.039
 horizon-0.039
 distances-0.038
 wheels-0.037
Token looked
Token ID3114
Feature activation+8.07e-01

Pos logit contributions
 exactly+0.023
itting+0.022
sm+0.020
str+0.020
 known+0.020
Neg logit contributions
 dot-0.020
 pavement-0.019
 distances-0.019
 horizon-0.019
 dashboard-0.018
Token up
Token ID510
Feature activation-7.81e-02

Pos logit contributions
 dot+0.034
 horizon+0.033
 distances+0.033
 pavement+0.033
 widened+0.031
Neg logit contributions
itting-0.027
 exactly-0.026
str-0.026
 known-0.025
live-0.024
Token at
Token ID379
Feature activation-6.16e-01

Pos logit contributions
 exactly+0.003
itting+0.003
str+0.003
 known+0.003
live+0.003
Neg logit contributions
 dot-0.003
 horizon-0.003
 distances-0.003
 pavement-0.003
ro-0.002
Token k
Token ID479
Feature activation-4.35e+00

Pos logit contributions
 exactly+0.022
itting+0.020
sm+0.018
str+0.018
 known+0.017
Neg logit contributions
 dot-0.028
 distances-0.027
 pavement-0.027
 horizon-0.026
 dashboard-0.025
Tokenitty
Token ID9760
Feature activation-4.48e+00

Pos logit contributions
 exactly+0.164
 known+0.158
 wrong+0.140
str+0.134
 caring+0.127
Neg logit contributions
 dot-0.196
ro-0.165
 distances-0.165
 arc-0.152
 pavement-0.150
Token and
Token ID290
Feature activation+4.77e-01

Pos logit contributions
itting+0.241
 exactly+0.240
 known+0.237
str+0.231
bek+0.221
Neg logit contributions
 dot-0.118
 distances-0.108
 pavement-0.106
 horizon-0.100
ro-0.093
Token said
Token ID531
Feature activation-6.45e-01

Pos logit contributions
 dot+0.017
 pavement+0.016
 distances+0.015
 horizon+0.015
 widened+0.015
Neg logit contributions
itting-0.021
 exactly-0.020
sm-0.018
rou-0.018
str-0.018
Token,
Token ID11
Feature activation-6.55e-01

Pos logit contributions
itting+0.030
 exactly+0.028
bek+0.027
live+0.027
rou+0.027
Neg logit contributions
 distances-0.018
 dot-0.018
 Novel-0.018
 sides-0.016
 widened-0.016
Token "
Token ID366
Feature activation-6.02e-01

Pos logit contributions
 exactly+0.026
itting+0.026
ats+0.025
sm+0.024
live+0.024
Neg logit contributions
 dot-0.024
 pavement-0.023
 horizon-0.022
 distances-0.022
ro-0.021
TokenWe
Token ID1135
Feature activation-1.88e+00

Pos logit contributions
 exactly+0.026
itting+0.023
str+0.022
 wrong+0.021
sm+0.021
Neg logit contributions
 dot-0.023
 distances-0.022
 pavement-0.022
 horizon-0.021
 dashboard-0.020
Token there
Token ID612
Feature activation+6.12e-01

Pos logit contributions
 dot+0.002
 pavement+0.002
 distances+0.002
 horizon+0.002
 dashboard+0.002
Neg logit contributions
 exactly-0.001
itting-0.001
sm-0.001
str-0.001
 known-0.001
Token was
Token ID373
Feature activation+8.46e-02

Pos logit contributions
 dot+0.028
 distances+0.027
 pavement+0.027
 horizon+0.026
 dashboard+0.026
Neg logit contributions
 exactly-0.021
itting-0.018
str-0.018
o-0.017
sm-0.017
Token a
Token ID257
Feature activation-4.51e-01

Pos logit contributions
 dot+0.004
 pavement+0.004
 distances+0.004
 horizon+0.004
 dashboard+0.004
Neg logit contributions
 exactly-0.003
itting-0.002
str-0.002
sm-0.002
 known-0.002
Token kind
Token ID1611
Feature activation-7.85e-01

Pos logit contributions
 exactly+0.024
itting+0.022
 known+0.022
uc+0.020
icking+0.020
Neg logit contributions
 dot-0.014
 pavement-0.011
 horizon-0.010
 arc-0.010
ro-0.010
Token wood
Token ID4898
Feature activation-1.37e+00

Pos logit contributions
 exactly+0.034
itting+0.031
 known+0.029
sm+0.028
str+0.028
Neg logit contributions
 dot-0.030
 pavement-0.028
 horizon-0.027
 distances-0.026
ro-0.026
Tokencut
Token ID8968
Feature activation-4.47e+00

Pos logit contributions
 exactly+0.057
itting+0.055
 known+0.050
bek+0.047
ll+0.046
Neg logit contributions
 dot-0.056
 pavement-0.052
ro-0.052
 floor-0.052
 dashboard-0.048
Tokenter
Token ID353
Feature activation-4.78e-01

Pos logit contributions
 exactly+0.222
itting+0.212
bek+0.194
sm+0.191
 known+0.190
Neg logit contributions
 dot-0.148
 pavement-0.124
 distances-0.118
 horizon-0.118
 dashboard-0.117
Token who
Token ID508
Feature activation-1.14e+00

Pos logit contributions
 exactly+0.019
bek+0.017
itting+0.016
 backup+0.016
Deal+0.015
Neg logit contributions
 dot-0.021
ro-0.018
 distances-0.016
 surface-0.016
 wheels-0.016
Token lived
Token ID5615
Feature activation+1.64e+00

Pos logit contributions
 exactly+0.046
itting+0.043
str+0.037
rou+0.037
uc+0.036
Neg logit contributions
 dot-0.044
 pavement-0.042
 distances-0.042
 dashboard-0.039
 wheels-0.039
Token in
Token ID287
Feature activation+8.35e-01

Pos logit contributions
 myself+0.035
 surface+0.034
 distances+0.034
 open+0.032
ro+0.032
Neg logit contributions
str-0.085
bek-0.083
live-0.078
itting-0.078
 exactly-0.078
Token the
Token ID262
Feature activation+4.52e-01

Pos logit contributions
 dot+0.038
 distances+0.037
 pavement+0.037
 horizon+0.035
 dashboard+0.035
Neg logit contributions
 exactly-0.029
itting-0.027
str-0.024
sm-0.023
rou-0.023
Token threw
Token ID9617
Feature activation-1.62e+00

Pos logit contributions
itting+0.061
 exactly+0.057
sm+0.052
itt+0.052
live+0.051
Neg logit contributions
 pavement-0.040
 dot-0.038
 wheels-0.036
 distances-0.035
 widened-0.035
Token it
Token ID340
Feature activation-3.33e-01

Pos logit contributions
 exactly+0.057
itting+0.048
sm+0.046
str+0.045
 known+0.044
Neg logit contributions
 dot-0.074
 distances-0.071
 pavement-0.071
 horizon-0.070
 dashboard-0.068
Token in
Token ID287
Feature activation+4.19e-02

Pos logit contributions
 exactly+0.013
itting+0.012
str+0.012
 known+0.011
bek+0.011
Neg logit contributions
 dot-0.013
 distances-0.013
 pavement-0.013
 wheels-0.012
 horizon-0.012
Token the
Token ID262
Feature activation-5.87e-01

Pos logit contributions
 dot+0.002
 pavement+0.002
 distances+0.002
 horizon+0.002
 dashboard+0.002
Neg logit contributions
 exactly-0.001
itting-0.001
sm-0.001
 known-0.001
str-0.001
Token ash
Token ID12530
Feature activation-1.85e+00

Pos logit contributions
 exactly+0.027
itting+0.025
sm+0.024
str+0.024
 known+0.023
Neg logit contributions
 dot-0.020
 pavement-0.019
 distances-0.019
 horizon-0.018
 dashboard-0.018
Tokentr
Token ID2213
Feature activation-4.45e+00

Pos logit contributions
 exactly+0.069
itting+0.064
 known+0.061
sm+0.059
str+0.059
Neg logit contributions
 dot-0.079
 pavement-0.077
 distances-0.075
 dashboard-0.073
 horizon-0.073
Tokenay
Token ID323
Feature activation+1.43e+00

Pos logit contributions
 exactly+0.205
itting+0.178
sm+0.165
rou+0.165
o+0.164
Neg logit contributions
 distances-0.169
 pavement-0.167
 dot-0.165
 horizon-0.160
 dashboard-0.149
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
 dot+0.056
 pavement+0.056
 distances+0.053
ro+0.053
 horizon+0.051
Neg logit contributions
 exactly-0.057
itting-0.053
 known-0.051
str-0.049
sm-0.049
Token Tim
Token ID5045
Feature activation-8.35e-01

Pos logit contributions
 exactly+0.010
itting+0.008
str+0.008
o+0.008
sm+0.008
Neg logit contributions
 dot-0.011
 distances-0.010
 horizon-0.010
 pavement-0.010
 dashboard-0.010
Tokenmy
Token ID1820
Feature activation+4.57e-02

Pos logit contributions
 exactly+0.024
itting+0.020
sm+0.018
str+0.018
o+0.018
Neg logit contributions
 dot-0.043
 distances-0.042
 pavement-0.042
 horizon-0.041
 dashboard-0.040
Token was
Token ID373
Feature activation+8.36e-02

Pos logit contributions
 dot+0.002
 pavement+0.002
 distances+0.002
 horizon+0.002
 dashboard+0.002
Neg logit contributions
 exactly-0.002
itting-0.002
sm-0.002
str-0.002
 known-0.002
Token same
Token ID976
Feature activation-1.26e-01

Pos logit contributions
 exactly+0.036
itting+0.034
uc+0.032
o+0.031
sm+0.031
Neg logit contributions
 pavement-0.018
 dot-0.018
 horizon-0.018
 distances-0.017
 dashboard-0.015
Token spot
Token ID4136
Feature activation+7.04e-02

Pos logit contributions
 exactly+0.006
itting+0.005
str+0.005
sm+0.005
 known+0.005
Neg logit contributions
 dot-0.004
 distances-0.004
 pavement-0.004
 horizon-0.004
 dashboard-0.004
Token,
Token ID11
Feature activation+8.28e-02

Pos logit contributions
 distances+0.002
 horizon+0.002
ro+0.002
 dot+0.002
 pavement+0.002
Neg logit contributions
itting-0.003
str-0.003
 exactly-0.003
uc-0.003
 known-0.003
Token full
Token ID1336
Feature activation+3.13e-01

Pos logit contributions
 dot+0.003
 distances+0.003
 pavement+0.003
 horizon+0.003
 dashboard+0.003
Neg logit contributions
itting-0.003
 exactly-0.003
uc-0.003
str-0.003
sm-0.003
Token of
Token ID286
Feature activation-1.50e+00

Pos logit contributions
 dot+0.012
 pavement+0.011
 horizon+0.011
 distances+0.011
 dashboard+0.010
Neg logit contributions
 exactly-0.013
itting-0.013
 known-0.012
str-0.012
uc-0.012
Token cher
Token ID21382
Feature activation-4.44e+00

Pos logit contributions
 exactly+0.056
itting+0.053
str+0.048
o+0.047
 known+0.047
Neg logit contributions
 dot-0.062
 pavement-0.061
 distances-0.060
 horizon-0.058
 dashboard-0.056
Tokenries
Token ID1678
Feature activation-3.64e+00

Pos logit contributions
 exactly+0.208
itting+0.202
 known+0.198
 teaching+0.187
 Enjoy+0.186
Neg logit contributions
 distances-0.126
 dot-0.123
 horizon-0.121
 pavement-0.117
 surfaces-0.108
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 exactly+0.135
itting+0.134
 known+0.126
str+0.123
sm+0.120
Neg logit contributions
 dot-0.145
 distances-0.140
 horizon-0.140
 pavement-0.139
 dashboard-0.130
Token<|endoftext|>
Token ID50256
Feature activation+2.04e-01

Pos logit contributions
 exactly+0.014
itting+0.013
uc+0.012
str+0.012
sm+0.012
Neg logit contributions
 dot-0.012
 pavement-0.012
 distances-0.012
 horizon-0.012
 dashboard-0.011
TokenOne
Token ID3198
Feature activation+1.26e-01

Pos logit contributions
 dot+0.008
 distances+0.008
 pavement+0.008
 horizon+0.008
ro+0.007
Neg logit contributions
 exactly-0.008
itting-0.007
str-0.007
 known-0.007
sm-0.007
Token day
Token ID1110
Feature activation-7.60e-01

Pos logit contributions
 dot+0.005
 pavement+0.005
 horizon+0.005
 distances+0.005
 sides+0.005
Neg logit contributions
 exactly-0.005
itting-0.004
uc-0.004
bek-0.004
 known-0.004
Token stop
Token ID2245
Feature activation-1.35e+00

Pos logit contributions
 exactly+0.050
itting+0.047
sm+0.044
str+0.044
o+0.042
Neg logit contributions
 dot-0.037
 pavement-0.036
 distances-0.036
 horizon-0.034
 dashboard-0.033
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
 exactly+0.039
str+0.036
itting+0.036
live+0.036
bek+0.034
Neg logit contributions
 dot-0.069
 distances-0.065
 pavement-0.065
 horizon-0.064
 wheels-0.062
Token be
Token ID307
Feature activation-8.30e-01

Pos logit contributions
 exactly+0.047
itting+0.046
sm+0.044
str+0.042
eh+0.041
Neg logit contributions
 dot-0.033
 pavement-0.033
 distances-0.031
 horizon-0.030
 dashboard-0.030
Token careful
Token ID8161
Feature activation-1.72e+00

Pos logit contributions
itting+0.027
 exactly+0.026
str+0.025
itt+0.025
live+0.024
Neg logit contributions
 distances-0.035
 wheels-0.034
 dot-0.033
 pavement-0.033
 shades-0.032
Token around
Token ID1088
Feature activation-3.90e-01

Pos logit contributions
 exactly+0.064
str+0.063
bek+0.061
 known+0.059
uc+0.056
Neg logit contributions
 distances-0.067
 dot-0.065
 sides-0.064
 pavement-0.064
 horizon-0.061
Token envelop
Token ID16441
Feature activation-4.43e+00

Pos logit contributions
 exactly+0.015
itting+0.014
str+0.013
o+0.012
 known+0.012
Neg logit contributions
 dot-0.017
 distances-0.016
 pavement-0.016
 horizon-0.015
 dashboard-0.015
Tokenes
Token ID274
Feature activation-9.18e-01

Pos logit contributions
str+0.193
 exactly+0.178
itting+0.175
sm+0.170
 feeding+0.168
Neg logit contributions
 dot-0.153
 distances-0.152
 dashboard-0.139
aces-0.137
 heads-0.135
Token they
Token ID484
Feature activation-7.07e-02

Pos logit contributions
 exactly+0.037
itting+0.036
str+0.035
 known+0.033
bek+0.032
Neg logit contributions
 dot-0.034
 distances-0.034
 pavement-0.032
ro-0.031
 horizon-0.030
Token found
Token ID1043
Feature activation-3.36e-03

Pos logit contributions
 exactly+0.002
itting+0.002
sm+0.002
 known+0.002
str+0.002
Neg logit contributions
 dot-0.003
 pavement-0.003
 distances-0.003
 horizon-0.003
 dashboard-0.003
Token on
Token ID319
Feature activation+5.18e-02

Pos logit contributions
itting+0.000
str+0.000
 exactly+0.000
live+0.000
itt+0.000
Neg logit contributions
 distances-0.000
 pavement-0.000
 dot-0.000
 wheels-0.000
 dashboard-0.000
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
 dot+0.002
 distances+0.002
 pavement+0.002
 horizon+0.002
 dashboard+0.002
Neg logit contributions
 exactly-0.002
itting-0.002
str-0.001
sm-0.001
o-0.001
Token time
Token ID640
Feature activation-1.73e+00

Pos logit contributions
 exactly+0.037
itting+0.034
str+0.032
sm+0.031
o+0.031
Neg logit contributions
 dot-0.039
 distances-0.039
 pavement-0.038
 horizon-0.037
 dashboard-0.035
Token together
Token ID1978
Feature activation-1.50e+00

Pos logit contributions
 exactly+0.066
itting+0.060
str+0.060
bek+0.058
 known+0.055
Neg logit contributions
 distances-0.071
 dot-0.066
ro-0.063
 pavement-0.062
 horizon-0.060
Token drinking
Token ID7722
Feature activation-1.90e+00

Pos logit contributions
str+0.060
 exactly+0.060
itting+0.058
bek+0.056
 known+0.054
Neg logit contributions
 distances-0.058
 dot-0.055
 horizon-0.052
ro-0.051
 pavement-0.051
Token and
Token ID290
Feature activation-5.47e-01

Pos logit contributions
 exactly+0.076
itting+0.073
 known+0.067
str+0.067
bek+0.065
Neg logit contributions
 dot-0.075
 pavement-0.072
 distances-0.071
 horizon-0.071
 surfaces-0.067
Token eating
Token ID6600
Feature activation-1.53e+00

Pos logit contributions
 exactly+0.014
itting+0.013
str+0.011
sm+0.010
bek+0.010
Neg logit contributions
 pavement-0.029
 dot-0.029
 distances-0.029
 horizon-0.028
 surfaces-0.028
Token dough
Token ID15756
Feature activation-4.42e+00

Pos logit contributions
itting+0.058
 exactly+0.055
str+0.053
 known+0.052
o+0.052
Neg logit contributions
 dot-0.060
 pavement-0.060
 distances-0.059
 horizon-0.057
 surfaces-0.057
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
 exactly+0.167
itting+0.160
str+0.153
 known+0.149
bek+0.146
Neg logit contributions
 dot-0.178
 distances-0.177
 pavement-0.173
 horizon-0.166
ro-0.165
Token
Token ID220
Feature activation-1.01e+00

Pos logit contributions
 exactly+0.021
itting+0.019
str+0.018
uc+0.018
sm+0.018
Neg logit contributions
 dot-0.014
 pavement-0.014
 distances-0.014
 horizon-0.014
 dashboard-0.013
Token
Token ID198
Feature activation-5.55e-01

Pos logit contributions
 exactly+0.038
itting+0.034
sm+0.032
str+0.032
 known+0.031
Neg logit contributions
 dot-0.042
 pavement-0.041
 distances-0.041
 horizon-0.039
 dashboard-0.039
TokenThe
Token ID464
Feature activation-1.24e+00

Pos logit contributions
 exactly+0.025
itting+0.022
 wrong+0.021
uc+0.021
icking+0.021
Neg logit contributions
 dot-0.019
 distances-0.019
 pavement-0.018
 horizon-0.018
ro-0.017
Token end
Token ID886
Feature activation+1.04e+00

Pos logit contributions
 exactly+0.059
itting+0.054
o+0.053
 known+0.051
rou+0.051
Neg logit contributions
 pavement-0.042
 dot-0.042
 horizon-0.040
 floor-0.040
 sides-0.039
Token was
Token ID373
Feature activation-1.08e+00

Pos logit contributions
 dot+0.011
 distances+0.010
ro+0.009
 pavement+0.009
 horizon+0.009
Neg logit contributions
 exactly-0.011
itting-0.010
str-0.009
bek-0.009
sm-0.009
Token a
Token ID257
Feature activation-1.04e+00

Pos logit contributions
itting+0.040
uc+0.040
itt+0.037
bek+0.036
ll+0.036
Neg logit contributions
 distances-0.039
 wheels-0.036
 wonders-0.035
 sill-0.034
 widen-0.034
Token very
Token ID845
Feature activation-1.40e+00

Pos logit contributions
itting+0.046
ied+0.044
uc+0.044
 exactly+0.044
ats+0.043
Neg logit contributions
 dot-0.039
 horizon-0.034
 tear-0.031
 pavement-0.030
 widened-0.030
Token good
Token ID922
Feature activation-1.34e+00

Pos logit contributions
 exactly+0.054
itting+0.053
uc+0.049
o+0.048
icking+0.045
Neg logit contributions
 pavement-0.051
 dot-0.051
 horizon-0.050
 distances-0.048
ro-0.048
Token k
Token ID479
Feature activation-3.34e+00

Pos logit contributions
 exactly+0.055
itting+0.049
str+0.047
uc+0.046
o+0.045
Neg logit contributions
 dot-0.055
 pavement-0.051
 distances-0.049
ro-0.049
 horizon-0.048
Tokenitty
Token ID9760
Feature activation-4.39e+00

Pos logit contributions
 exactly+0.124
 known+0.108
 wrong+0.100
 watching+0.096
str+0.094
Neg logit contributions
 dot-0.158
ro-0.132
 distances-0.126
 dashboard-0.126
 spoiler-0.126
Token.
Token ID13
Feature activation-2.58e-01

Pos logit contributions
itting+0.224
 exactly+0.221
uc+0.220
str+0.213
bek+0.204
Neg logit contributions
 dot-0.130
 distances-0.104
est-0.100
 pavement-0.094
ro-0.091
Token<|endoftext|>
Token ID50256
Feature activation+9.91e-02

Pos logit contributions
 exactly+0.010
itting+0.009
sm+0.008
str+0.008
 known+0.008
Neg logit contributions
 dot-0.011
 distances-0.010
 pavement-0.010
 horizon-0.010
 dashboard-0.010
TokenOnce
Token ID7454
Feature activation-2.29e-02

Pos logit contributions
 dot+0.005
 pavement+0.004
 distances+0.004
 horizon+0.004
 dashboard+0.004
Neg logit contributions
 exactly-0.003
itting-0.003
sm-0.003
o-0.003
str-0.003
Token there
Token ID612
Feature activation+2.52e-01

Pos logit contributions
 exactly+0.001
itting+0.001
bek+0.001
 known+0.001
sm+0.001
Neg logit contributions
 dot-0.001
 horizon-0.001
 distances-0.001
 pavement-0.001
 Novel-0.001
Token was
Token ID373
Feature activation-1.25e-01

Pos logit contributions
 dot+0.010
 pavement+0.009
 distances+0.009
 horizon+0.009
 dashboard+0.009
Neg logit contributions
 exactly-0.010
itting-0.009
sm-0.009
str-0.009
 known-0.009
Token.
Token ID13
Feature activation-2.77e-02

Pos logit contributions
 dot+0.030
 distances+0.029
 pavement+0.029
 horizon+0.028
 dashboard+0.027
Neg logit contributions
 exactly-0.021
itting-0.018
sm-0.017
str-0.016
 known-0.016
Token She
Token ID1375
Feature activation+3.37e-01

Pos logit contributions
 exactly+0.001
itting+0.001
sm+0.001
str+0.001
 known+0.001
Neg logit contributions
 dot-0.001
 distances-0.001
 pavement-0.001
 horizon-0.001
 dashboard-0.001
Token used
Token ID973
Feature activation-4.30e-01

Pos logit contributions
 dot+0.015
 pavement+0.015
 distances+0.015
 horizon+0.014
 spoiler+0.014
Neg logit contributions
 exactly-0.012
itting-0.010
sm-0.010
o-0.010
 known-0.010
Token a
Token ID257
Feature activation-4.00e-01

Pos logit contributions
itting+0.016
 exactly+0.016
str+0.015
bek+0.014
o+0.014
Neg logit contributions
 dot-0.018
 distances-0.017
 pavement-0.017
 wheels-0.017
 horizon-0.016
Token m
Token ID285
Feature activation-1.24e+00

Pos logit contributions
 exactly+0.018
itting+0.018
live+0.017
 known+0.017
o+0.017
Neg logit contributions
 dot-0.014
 pavement-0.012
 horizon-0.011
 dashboard-0.011
 gasp-0.011
Tokenitt
Token ID715
Feature activation-4.39e+00

Pos logit contributions
 exactly+0.026
itting+0.021
sm+0.019
str+0.018
 known+0.017
Neg logit contributions
 dot-0.073
 distances-0.071
 pavement-0.071
 horizon-0.070
 dashboard-0.068
Token to
Token ID284
Feature activation+6.73e-01

Pos logit contributions
 exactly+0.174
Deal+0.161
 known+0.158
itting+0.158
bek+0.155
Neg logit contributions
 dot-0.177
 distances-0.166
 pavement-0.154
ro-0.150
 horizon-0.149
Token take
Token ID1011
Feature activation+3.14e-03

Pos logit contributions
 dot+0.024
 distances+0.022
 pavement+0.022
 horizon+0.022
 dashboard+0.021
Neg logit contributions
 exactly-0.031
itting-0.027
sm-0.026
str-0.026
 known-0.026
Token out
Token ID503
Feature activation-1.42e-01

Pos logit contributions
 dot+0.000
 distances+0.000
 pavement+0.000
 wheels+0.000
 horizon+0.000
Neg logit contributions
 exactly-0.000
itting-0.000
str-0.000
ll-0.000
rou-0.000
Token the
Token ID262
Feature activation+3.25e-01

Pos logit contributions
itting+0.005
str+0.005
 exactly+0.005
live+0.005
Deal+0.005
Neg logit contributions
 distances-0.005
 dot-0.005
 wheels-0.005
 sides-0.005
 buttons-0.005
Token pan
Token ID3425
Feature activation-7.01e-01

Pos logit contributions
 dot+0.013
 pavement+0.012
 distances+0.012
 horizon+0.012
 dashboard+0.012
Neg logit contributions
 exactly-0.013
itting-0.013
o-0.011
 known-0.011
str-0.011
Token all
Token ID477
Feature activation+1.54e+00

Pos logit contributions
 exactly+0.012
itting+0.012
str+0.010
rou+0.010
 wrong+0.010
Neg logit contributions
 dot-0.011
 pavement-0.010
 widened-0.010
 distances-0.010
 wheels-0.009
Token hopped
Token ID45230
Feature activation+2.09e-01

Pos logit contributions
 dot+0.050
 pavement+0.047
 widened+0.045
 arc+0.043
 distances+0.043
Neg logit contributions
 exactly-0.072
itting-0.067
rou-0.061
 feeding-0.060
itt-0.059
Token in
Token ID287
Feature activation-2.39e-01

Pos logit contributions
 dot+0.008
 distances+0.008
 pavement+0.008
ro+0.008
 surface+0.007
Neg logit contributions
 exactly-0.009
itting-0.008
str-0.008
 known-0.008
live-0.007
Token a
Token ID257
Feature activation-8.71e-01

Pos logit contributions
 exactly+0.010
itting+0.009
str+0.009
live+0.008
rou+0.008
Neg logit contributions
 dot-0.009
 distances-0.009
 pavement-0.009
 horizon-0.009
 wheels-0.008
Token big
Token ID1263
Feature activation+3.32e-01

Pos logit contributions
 exactly+0.045
itting+0.044
 known+0.042
ll+0.042
icking+0.042
Neg logit contributions
 floor-0.024
 dot-0.022
 pavement-0.022
 sides-0.021
 wheels-0.021
Token oven
Token ID14361
Feature activation-4.38e+00

Pos logit contributions
 dot+0.013
 pavement+0.012
 distances+0.012
 horizon+0.012
 dashboard+0.011
Neg logit contributions
 exactly-0.014
itting-0.013
sm-0.012
 known-0.012
str-0.012
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 exactly+0.177
itting+0.165
 known+0.160
str+0.157
rou+0.152
Neg logit contributions
 dot-0.176
 pavement-0.174
 horizon-0.168
 dashboard-0.162
 distances-0.159
Token
Token ID220
Feature activation-5.33e-01

Pos logit contributions
 exactly+0.004
itting+0.004
sm+0.004
str+0.004
 known+0.004
Neg logit contributions
 dot-0.005
 distances-0.005
 pavement-0.005
 horizon-0.004
 dashboard-0.004
Token
Token ID198
Feature activation+1.06e-03

Pos logit contributions
 exactly+0.020
itting+0.018
sm+0.017
str+0.017
 known+0.016
Neg logit contributions
 dot-0.022
 distances-0.022
 pavement-0.022
 horizon-0.021
 dashboard-0.020
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
 dot+0.000
 pavement+0.000
 distances+0.000
 horizon+0.000
 dashboard+0.000
Neg logit contributions
 exactly-0.000
itting-0.000
sm-0.000
str-0.000
 known-0.000
TokenThe
Token ID464
Feature activation-5.53e-01

Pos logit contributions
 dot+0.011
 distances+0.010
 pavement+0.010
 horizon+0.010
 dashboard+0.010
Neg logit contributions
 exactly-0.010
itting-0.009
str-0.009
sm-0.009
uc-0.008
Token time
Token ID640
Feature activation-1.13e-01

Pos logit contributions
 exactly+0.049
 known+0.044
uc+0.044
itting+0.043
str+0.042
Neg logit contributions
 dot-0.037
 distances-0.036
 pavement-0.035
 horizon-0.034
ro-0.033
Token there
Token ID612
Feature activation+6.48e-01

Pos logit contributions
 exactly+0.004
itting+0.003
sm+0.003
 known+0.003
str+0.003
Neg logit contributions
 dot-0.005
 distances-0.005
 pavement-0.005
 horizon-0.005
 dashboard-0.005
Token was
Token ID373
Feature activation+1.98e-01

Pos logit contributions
 dot+0.031
 distances+0.030
 pavement+0.030
 horizon+0.029
 dashboard+0.029
Neg logit contributions
 exactly-0.022
itting-0.018
o-0.018
str-0.018
sm-0.017
Token a
Token ID257
Feature activation-3.66e-01

Pos logit contributions
 dot+0.010
 pavement+0.009
 distances+0.009
 horizon+0.009
 dashboard+0.009
Neg logit contributions
 exactly-0.006
itting-0.006
str-0.005
sm-0.005
 known-0.005
Token wood
Token ID4898
Feature activation-1.14e+00

Pos logit contributions
 exactly+0.020
itting+0.018
 known+0.017
uc+0.017
icking+0.017
Neg logit contributions
 dot-0.011
 pavement-0.009
 horizon-0.008
 arc-0.008
ro-0.008
Tokencut
Token ID8968
Feature activation-4.37e+00

Pos logit contributions
 exactly+0.049
itting+0.047
 known+0.045
bek+0.043
Acc+0.042
Neg logit contributions
 dot-0.044
ro-0.040
 floor-0.040
 pavement-0.040
 wheels-0.038
Tokenter
Token ID353
Feature activation-2.71e-01

Pos logit contributions
 exactly+0.216
itting+0.208
bek+0.192
sm+0.191
 known+0.188
Neg logit contributions
 dot-0.141
 pavement-0.123
 distances-0.117
 wheels-0.114
 horizon-0.114
Token who
Token ID508
Feature activation-8.84e-01

Pos logit contributions
 exactly+0.011
bek+0.011
 backup+0.010
itting+0.010
Deal+0.009
Neg logit contributions
 dot-0.011
ro-0.009
 distances-0.009
 surface-0.009
 wheels-0.008
Token was
Token ID373
Feature activation+1.04e+00

Pos logit contributions
 exactly+0.036
itting+0.033
str+0.029
rou+0.029
bek+0.028
Neg logit contributions
 dot-0.034
 distances-0.033
 pavement-0.032
 dashboard-0.030
 wheels-0.030
Token strong
Token ID1913
Feature activation+1.08e+00

Pos logit contributions
 dot+0.039
 wheels+0.039
 distances+0.039
 surface+0.036
 edges+0.036
Neg logit contributions
itting-0.041
bek-0.038
 exactly-0.038
Deal-0.035
live-0.034
Token and
Token ID290
Feature activation+7.87e-01

Pos logit contributions
 dot+0.039
 distances+0.036
 pavement+0.033
 wheels+0.033
ro+0.032
Neg logit contributions
 exactly-0.049
itting-0.047
str-0.046
bek-0.044
 known-0.044
Token mom
Token ID1995
Feature activation-1.03e+00

Pos logit contributions
 exactly+0.012
itting+0.012
str+0.011
sm+0.011
 known+0.010
Neg logit contributions
 dot-0.016
 pavement-0.015
 distances-0.015
 horizon-0.015
 dashboard-0.015
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
 exactly+0.049
itting+0.047
str+0.046
 known+0.044
uc+0.043
Neg logit contributions
 dot-0.031
 distances-0.031
 pavement-0.030
 horizon-0.029
 dashboard-0.028
Token dad
Token ID9955
Feature activation+9.53e-01

Pos logit contributions
 dot+0.050
 pavement+0.050
 heads+0.049
 dashboard+0.049
 distances+0.048
Neg logit contributions
itting-0.057
 exactly-0.054
str-0.050
sm-0.049
itt-0.047
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
 distances+0.030
 dot+0.029
 pavement+0.028
 dashboard+0.027
 horizon+0.027
Neg logit contributions
 exactly-0.044
itting-0.044
str-0.043
eh-0.041
live-0.041
Token She
Token ID1375
Feature activation+5.87e-01

Pos logit contributions
 dot+0.022
 distances+0.021
 pavement+0.021
 horizon+0.020
 spoiler+0.020
Neg logit contributions
 exactly-0.018
itting-0.016
 known-0.015
sm-0.015
 wrong-0.015
Token wished
Token ID16555
Feature activation+2.59e+00

Pos logit contributions
 dot+0.023
 pavement+0.022
 distances+0.022
 horizon+0.021
 spoiler+0.020
Neg logit contributions
 exactly-0.025
 known-0.022
 named-0.022
sm-0.022
 wrong-0.022
Token she
Token ID673
Feature activation+7.02e-01

Pos logit contributions
 dot+0.112
 horizon+0.110
 pavement+0.110
 distances+0.109
 dashboard+0.105
Neg logit contributions
 exactly-0.084
itting-0.081
bek-0.078
str-0.075
o-0.072
Token had
Token ID550
Feature activation-6.20e-01

Pos logit contributions
 dot+0.028
 distances+0.026
 pavement+0.026
 dashboard+0.025
 horizon+0.025
Neg logit contributions
 exactly-0.028
itting-0.026
str-0.024
sm-0.023
uc-0.023
Token stayed
Token ID9658
Feature activation+1.14e+00

Pos logit contributions
itting+0.025
 exactly+0.022
itt+0.022
ip+0.021
bek+0.021
Neg logit contributions
 distances-0.022
 pavement-0.022
 wheels-0.022
 myself-0.021
 heads-0.021
Token home
Token ID1363
Feature activation+1.15e+00

Pos logit contributions
 dot+0.042
 distances+0.042
 pavement+0.039
 horizon+0.039
 wheels+0.039
Neg logit contributions
str-0.043
itting-0.043
 exactly-0.041
itt-0.040
bek-0.039
Token and
Token ID290
Feature activation+3.82e-03

Pos logit contributions
 dot+0.048
 pavement+0.046
 distances+0.046
 horizon+0.045
 dashboard+0.044
Neg logit contributions
 exactly-0.044
itting-0.039
sm-0.037
str-0.036
 known-0.036
Token door
Token ID3420
Feature activation+8.16e-02

Pos logit contributions
 exactly+0.035
 wrong+0.031
sm+0.031
 living+0.030
itting+0.030
Neg logit contributions
 dot-0.026
 distances-0.025
 pavement-0.024
 horizon-0.023
 spoiler-0.022
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
 dot+0.004
 pavement+0.004
 distances+0.004
 horizon+0.004
 dashboard+0.004
Neg logit contributions
 exactly-0.003
itting-0.002
sm-0.002
str-0.002
 known-0.002
Token He
Token ID679
Feature activation-2.27e-02

Pos logit contributions
 dot+0.001
 distances+0.001
 pavement+0.001
 horizon+0.001
 dashboard+0.001
Neg logit contributions
 exactly-0.001
itting-0.001
sm-0.001
 known-0.001
o-0.001
Token hoped
Token ID10719
Feature activation+1.30e+00

Pos logit contributions
 exactly+0.001
itting+0.001
sm+0.001
 known+0.001
o+0.001
Neg logit contributions
 dot-0.001
 pavement-0.001
 distances-0.001
 horizon-0.001
 spoiler-0.001
Token their
Token ID511
Feature activation-6.52e-01

Pos logit contributions
 pavement+0.057
 dot+0.057
 distances+0.056
 horizon+0.056
 dashboard+0.054
Neg logit contributions
itting-0.042
 exactly-0.040
str-0.039
bek-0.037
uc-0.036
Token parents
Token ID3397
Feature activation+2.86e+00

Pos logit contributions
 exactly+0.022
itting+0.020
sm+0.018
str+0.018
 known+0.018
Neg logit contributions
 dot-0.030
 pavement-0.029
 distances-0.029
 horizon-0.028
 dashboard-0.027
Token would
Token ID561
Feature activation+7.16e-02

Pos logit contributions
 dot+0.144
 distances+0.144
 pavement+0.140
 horizon+0.139
 dashboard+0.138
Neg logit contributions
 exactly-0.076
itting-0.070
str-0.064
sm-0.061
uc-0.061
Token not
Token ID407
Feature activation+1.67e-01

Pos logit contributions
 dot+0.003
 pavement+0.003
 distances+0.003
 spoiler+0.003
 horizon+0.003
Neg logit contributions
 exactly-0.003
 known-0.002
 helping-0.002
itting-0.002
 feeding-0.002
Token notice
Token ID4003
Feature activation+2.07e-01

Pos logit contributions
 dot+0.007
 spoiler+0.007
 pavement+0.006
 distances+0.006
 horizon+0.006
Neg logit contributions
 exactly-0.007
 known-0.006
 helping-0.006
 wrong-0.006
 to-0.006
Token anything
Token ID1997
Feature activation+5.73e-01

Pos logit contributions
 dot+0.009
 distances+0.008
 pavement+0.008
 horizon+0.008
 dashboard+0.008
Neg logit contributions
 exactly-0.008
itting-0.007
str-0.007
 known-0.007
o-0.006
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
 dot+0.026
 pavement+0.025
 distances+0.025
 horizon+0.024
 dashboard+0.024
Neg logit contributions
 exactly-0.020
itting-0.018
str-0.016
sm-0.016
 known-0.016
Token the
Token ID262
Feature activation+4.06e-01

Pos logit contributions
 dot+0.009
 distances+0.009
 pavement+0.009
 wheels+0.009
 horizon+0.008
Neg logit contributions
itting-0.009
 exactly-0.009
bek-0.008
str-0.008
live-0.008
Token cows
Token ID22575
Feature activation-5.57e-02

Pos logit contributions
 dot+0.014
 horizon+0.013
 pavement+0.013
 wheels+0.012
 distances+0.012
Neg logit contributions
 exactly-0.019
itting-0.018
bek-0.017
 known-0.017
o-0.017
Token,
Token ID11
Feature activation+3.03e+00

Pos logit contributions
 exactly+0.003
itting+0.002
bek+0.002
sm+0.002
str+0.002
Neg logit contributions
 dot-0.002
 distances-0.002
 pavement-0.002
 horizon-0.002
 dashboard-0.002
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
 dot+0.130
 distances+0.125
 pavement+0.125
 horizon+0.120
 dashboard+0.118
Neg logit contributions
 exactly-0.113
itting-0.106
sm-0.093
bek-0.092
str-0.092
Token pigs
Token ID22333
Feature activation+5.79e-01

Pos logit contributions
 dot+0.050
 horizon+0.048
 pavement+0.047
 wheels+0.046
 distances+0.045
Neg logit contributions
 exactly-0.055
itting-0.050
bek-0.045
sm-0.045
live-0.045
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
 dot+0.020
 distances+0.020
 pavement+0.019
ro+0.018
 horizon+0.018
Neg logit contributions
 exactly-0.026
bek-0.024
itting-0.023
sm-0.023
rou-0.022
Token the
Token ID262
Feature activation+1.42e+00

Pos logit contributions
 dot+0.101
 distances+0.098
 pavement+0.095
 horizon+0.091
 dashboard+0.090
Neg logit contributions
 exactly-0.136
itting-0.133
bek-0.122
str-0.119
rou-0.116
Token chickens
Token ID25972
Feature activation+5.03e-01

Pos logit contributions
 dot+0.056
 horizon+0.054
 pavement+0.054
 wheels+0.052
 distances+0.052
Neg logit contributions
 exactly-0.062
itting-0.058
sm-0.052
bek-0.052
live-0.052
Token and
Token ID290
Feature activation+3.01e+00

Pos logit contributions
 dot+0.016
 distances+0.016
ro+0.015
 pavement+0.015
 horizon+0.014
Neg logit contributions
 exactly-0.024
bek-0.023
itting-0.022
sm-0.021
rou-0.021
Token the
Token ID262
Feature activation+1.95e+00

Pos logit contributions
 dot+0.127
 distances+0.123
 pavement+0.122
 horizon+0.116
 dashboard+0.115
Neg logit contributions
 exactly-0.115
itting-0.111
str-0.095
bek-0.094
rou-0.094
Token sheep
Token ID15900
Feature activation+7.91e-01

Pos logit contributions
 dot+0.079
 pavement+0.076
 horizon+0.075
 distances+0.074
 wheels+0.073
Neg logit contributions
 exactly-0.083
itting-0.077
live-0.069
sm-0.069
bek-0.067
Token it
Token ID340
Feature activation-1.34e-01

Pos logit contributions
 pavement+0.006
 dot+0.006
 distances+0.006
 horizon+0.006
 sides+0.006
Neg logit contributions
itting-0.007
str-0.006
o-0.006
 exactly-0.006
sm-0.006
Token to
Token ID284
Feature activation-1.74e-01

Pos logit contributions
itting+0.004
str+0.004
 exactly+0.004
itt+0.004
o+0.004
Neg logit contributions
 distances-0.006
 dot-0.006
 pavement-0.006
 horizon-0.006
 sides-0.006
Token her
Token ID607
Feature activation+3.77e-01

Pos logit contributions
itting+0.006
 exactly+0.006
str+0.005
sm+0.005
o+0.005
Neg logit contributions
 dot-0.007
 pavement-0.007
 distances-0.007
 horizon-0.007
 surfaces-0.007
Token friends
Token ID2460
Feature activation+8.64e-01

Pos logit contributions
 dot+0.016
 pavement+0.015
 horizon+0.014
 distances+0.014
 sides+0.014
Neg logit contributions
itting-0.014
 exactly-0.014
str-0.013
 known-0.012
sm-0.012
Token the
Token ID262
Feature activation+9.91e-01

Pos logit contributions
 dot+0.025
 distances+0.025
 pavement+0.024
 horizon+0.023
 chooses+0.023
Neg logit contributions
str-0.039
itting-0.038
 exactly-0.038
 known-0.037
itt-0.037
Token next
Token ID1306
Feature activation+2.83e+00

Pos logit contributions
 pavement+0.036
 dot+0.036
 horizon+0.035
 wheels+0.032
 floor+0.031
Neg logit contributions
 exactly-0.046
o-0.042
itting-0.042
live-0.040
sm-0.040
Token day
Token ID1110
Feature activation+4.35e-01

Pos logit contributions
 dot+0.122
 pavement+0.117
 horizon+0.114
 distances+0.112
ro+0.107
Neg logit contributions
 exactly-0.111
itting-0.096
str-0.092
 known-0.091
sm-0.086
Token!
Token ID0
Feature activation-5.05e-01

Pos logit contributions
 dot+0.017
 pavement+0.016
 distances+0.016
 horizon+0.016
ro+0.014
Neg logit contributions
 exactly-0.018
str-0.016
 known-0.016
itting-0.015
bek-0.015
Token<|endoftext|>
Token ID50256
Feature activation+2.85e-02

Pos logit contributions
 exactly+0.019
itting+0.017
str+0.016
sm+0.016
 known+0.016
Neg logit contributions
 dot-0.021
 pavement-0.020
 distances-0.020
 horizon-0.020
 dashboard-0.019
TokenKelly
Token ID34831
Feature activation-4.72e-01

Pos logit contributions
 dot+0.001
 distances+0.001
 pavement+0.001
 horizon+0.001
 dashboard+0.001
Neg logit contributions
 exactly-0.001
itting-0.001
str-0.001
 known-0.001
sm-0.001
Token was
Token ID373
Feature activation+2.16e-01

Pos logit contributions
 exactly+0.019
itting+0.017
sm+0.016
str+0.016
o+0.015
Neg logit contributions
 dot-0.019
 distances-0.018
 pavement-0.018
 horizon-0.018
 dashboard-0.017
Token used
Token ID973
Feature activation-4.02e-01

Pos logit contributions
 exactly+0.021
itting+0.018
str+0.017
sm+0.017
o+0.017
Neg logit contributions
 dot-0.024
 pavement-0.023
 distances-0.022
 horizon-0.022
 dashboard-0.021
Token his
Token ID465
Feature activation-6.18e-02

Pos logit contributions
 exactly+0.014
itting+0.014
str+0.013
sm+0.012
bek+0.012
Neg logit contributions
 dot-0.017
 distances-0.017
 pavement-0.017
 horizon-0.016
 wheels-0.016
Token wings
Token ID12098
Feature activation+9.73e-01

Pos logit contributions
 exactly+0.003
itting+0.002
sm+0.002
str+0.002
 known+0.002
Neg logit contributions
 dot-0.002
 pavement-0.002
 distances-0.002
 horizon-0.002
 dashboard-0.002
Token to
Token ID284
Feature activation+5.09e-01

Pos logit contributions
 distances+0.048
 dot+0.048
ro+0.046
 pavement+0.046
 horizon+0.045
Neg logit contributions
 exactly-0.025
bek-0.025
str-0.024
itting-0.024
sm-0.023
Token cover
Token ID3002
Feature activation+1.58e-01

Pos logit contributions
 dot+0.017
 distances+0.016
 pavement+0.016
 horizon+0.015
 dashboard+0.015
Neg logit contributions
 exactly-0.024
itting-0.022
sm-0.021
str-0.021
 known-0.020
Token himself
Token ID2241
Feature activation+2.68e+00

Pos logit contributions
 dot+0.006
 pavement+0.006
 distances+0.006
 horizon+0.006
 dashboard+0.006
Neg logit contributions
 exactly-0.006
itting-0.006
str-0.005
sm-0.005
 known-0.005
Token.
Token ID13
Feature activation+4.44e-02

Pos logit contributions
 dot+0.127
 pavement+0.122
 horizon+0.120
 distances+0.120
 spoiler+0.117
Neg logit contributions
 exactly-0.092
itting-0.075
sm-0.069
 wrong-0.068
 feeding-0.068
Token He
Token ID679
Feature activation+1.16e-01

Pos logit contributions
 dot+0.001
 distances+0.001
 horizon+0.001
 pavement+0.001
 dashboard+0.001
Neg logit contributions
 exactly-0.002
itting-0.002
o-0.002
sm-0.002
 known-0.002
Token stayed
Token ID9658
Feature activation+9.30e-01

Pos logit contributions
 dot+0.005
 pavement+0.005
 distances+0.005
 horizon+0.005
 spoiler+0.005
Neg logit contributions
 exactly-0.004
o-0.003
 known-0.003
sm-0.003
str-0.003
Token dry
Token ID5894
Feature activation-2.12e-01

Pos logit contributions
 dot+0.040
 distances+0.040
 pavement+0.038
 horizon+0.037
 wheels+0.037
Neg logit contributions
itting-0.031
 exactly-0.031
str-0.030
sm-0.028
o-0.027
Token and
Token ID290
Feature activation-5.79e-01

Pos logit contributions
 exactly+0.008
itting+0.007
sm+0.007
str+0.007
 known+0.007
Neg logit contributions
 dot-0.009
 pavement-0.008
 distances-0.008
 horizon-0.008
 dashboard-0.008
Token day
Token ID1110
Feature activation-2.68e-01

Pos logit contributions
 dot+0.033
 pavement+0.032
 horizon+0.030
 distances+0.030
ro+0.029
Neg logit contributions
 exactly-0.023
itting-0.019
 known-0.018
str-0.017
uc-0.017
Token,
Token ID11
Feature activation+1.92e-01

Pos logit contributions
 exactly+0.012
 known+0.011
bek+0.011
itting+0.011
rou+0.011
Neg logit contributions
 dot-0.009
 pavement-0.008
 distances-0.008
ro-0.008
 Novel-0.008
Token her
Token ID607
Feature activation-1.89e-01

Pos logit contributions
 dot+0.008
 pavement+0.007
ro+0.007
 wheels+0.006
 distances+0.006
Neg logit contributions
 exactly-0.008
itting-0.007
str-0.007
rou-0.007
 known-0.007
Token mom
Token ID1995
Feature activation-3.28e-01

Pos logit contributions
 exactly+0.007
itting+0.006
 known+0.006
sm+0.006
bek+0.006
Neg logit contributions
 dot-0.008
 pavement-0.008
 horizon-0.007
 wheels-0.007
 dashboard-0.007
Tokenmy
Token ID1820
Feature activation+8.37e-01

Pos logit contributions
 exactly+0.013
itting+0.012
bek+0.011
sm+0.011
str+0.011
Neg logit contributions
 dot-0.012
 pavement-0.011
 distances-0.011
 horizon-0.011
 wheels-0.011
Token took
Token ID1718
Feature activation-1.07e-02

Pos logit contributions
 dot+0.032
 pavement+0.029
 distances+0.028
ro+0.027
 horizon+0.027
Neg logit contributions
 exactly-0.034
itting-0.032
bek-0.028
sm-0.028
str-0.028
Token her
Token ID607
Feature activation+9.97e-01

Pos logit contributions
 exactly+0.000
itting+0.000
str+0.000
sm+0.000
 known+0.000
Neg logit contributions
 dot-0.001
 pavement-0.001
 distances-0.001
 horizon-0.000
 dashboard-0.000
Token to
Token ID284
Feature activation+4.92e-01

Pos logit contributions
 dot+0.052
 pavement+0.050
 distances+0.048
 wheels+0.047
 horizon+0.047
Neg logit contributions
 exactly-0.028
itting-0.025
 known-0.024
str-0.023
bek-0.022
Token her
Token ID607
Feature activation+1.13e+00

Pos logit contributions
 dot+0.026
 distances+0.025
 pavement+0.025
 horizon+0.025
 spoiler+0.024
Neg logit contributions
 exactly-0.013
 known-0.011
itting-0.011
sm-0.010
str-0.010
Token office
Token ID2607
Feature activation-5.03e-01

Pos logit contributions
 dot+0.031
 pavement+0.030
 horizon+0.028
 distances+0.028
 dashboard+0.027
Neg logit contributions
 exactly-0.056
itting-0.056
str-0.053
 known-0.052
sm-0.051
Token.
Token ID13
Feature activation+6.18e-01

Pos logit contributions
str+0.016
 exactly+0.016
itting+0.016
 known+0.015
bek+0.015
Neg logit contributions
 pavement-0.022
 floor-0.021
 distances-0.020
 dashboard-0.020
ro-0.020
Token big
Token ID1263
Feature activation+1.10e+00

Pos logit contributions
 exactly+0.018
itting+0.017
sm+0.015
uc+0.015
ll+0.015
Neg logit contributions
 dot-0.014
 pavement-0.013
 horizon-0.013
 spoiler-0.012
 wheels-0.012
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
 dot+0.044
 distances+0.044
 pavement+0.043
 horizon+0.042
 dashboard+0.041
Neg logit contributions
 exactly-0.043
itting-0.038
str-0.036
sm-0.036
 wrong-0.035
Token scary
Token ID14343
Feature activation+5.42e-01

Pos logit contributions
 dot+0.021
 distances+0.021
 pavement+0.021
 horizon+0.020
 dashboard+0.019
Neg logit contributions
 exactly-0.027
itting-0.024
sm-0.023
str-0.023
 known-0.023
Token bird
Token ID6512
Feature activation-1.56e+00

Pos logit contributions
 dot+0.022
 distances+0.021
 pavement+0.021
 horizon+0.020
 dashboard+0.020
Neg logit contributions
 exactly-0.022
itting-0.020
sm-0.018
str-0.018
 known-0.018
Token that
Token ID326
Feature activation-2.01e-01

Pos logit contributions
 exactly+0.058
itting+0.056
 known+0.054
bek+0.053
uc+0.052
Neg logit contributions
 dot-0.065
 distances-0.061
 pavement-0.058
ro-0.057
 lands-0.056
Token lived
Token ID5615
Feature activation+1.47e+00

Pos logit contributions
 exactly+0.008
itting+0.008
sm+0.007
uc+0.007
str+0.007
Neg logit contributions
 dot-0.008
 pavement-0.007
 distances-0.007
 horizon-0.007
 dashboard-0.007
Token in
Token ID287
Feature activation+9.37e-01

Pos logit contributions
 distances+0.042
 myself+0.041
ro+0.041
 surface+0.040
 dot+0.037
Neg logit contributions
str-0.064
ama-0.057
ip-0.057
uc-0.057
itting-0.057
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
 dot+0.045
 pavement+0.044
 distances+0.044
 horizon+0.043
 dashboard+0.042
Neg logit contributions
 exactly-0.030
itting-0.026
sm-0.024
str-0.024
 known-0.023
Token park
Token ID3952
Feature activation-2.29e-01

Pos logit contributions
 dot+0.005
 pavement+0.005
 horizon+0.004
 distances+0.004
 dashboard+0.004
Neg logit contributions
 exactly-0.007
itting-0.006
sm-0.006
uc-0.006
o-0.006
Token too
Token ID1165
Feature activation-5.86e-01

Pos logit contributions
 exactly+0.010
itting+0.009
str+0.009
uc+0.009
bek+0.009
Neg logit contributions
 distances-0.008
 pavement-0.008
 dot-0.008
 horizon-0.008
ro-0.007
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
 exactly+0.019
itting+0.017
str+0.016
 known+0.016
sm+0.015
Neg logit contributions
 dot-0.028
 distances-0.027
 pavement-0.026
 horizon-0.026
 dashboard-0.025
Token liked
Token ID8288
Feature activation-4.49e-01

Pos logit contributions
 exactly+0.016
itting+0.014
sm+0.013
o+0.013
str+0.013
Neg logit contributions
 dot-0.015
 distances-0.015
 pavement-0.015
 horizon-0.014
 dashboard-0.014
Token to
Token ID284
Feature activation-6.36e-02

Pos logit contributions
 exactly+0.015
itting+0.014
bek+0.013
 known+0.012
str+0.012
Neg logit contributions
 pavement-0.021
 dot-0.021
 wheels-0.020
 distances-0.020
 pedals-0.020
Token watch
Token ID2342
Feature activation-1.10e+00

Pos logit contributions
 exactly+0.002
 helping+0.002
 named+0.002
itting+0.002
 known+0.002
Neg logit contributions
 horizon-0.003
 dot-0.002
 distances-0.002
 pavement-0.002
 dashboard-0.002
Token the
Token ID262
Feature activation+1.61e-01

Pos logit contributions
 exactly+0.041
itting+0.035
sm+0.033
 feeding+0.033
str+0.033
Neg logit contributions
 dot-0.047
 distances-0.045
 pavement-0.045
 horizon-0.044
 dashboard-0.043
Token sun
Token ID4252
Feature activation-5.71e-01

Pos logit contributions
 dot+0.006
 pavement+0.005
 horizon+0.005
 distances+0.005
 wheels+0.005
Neg logit contributions
 exactly-0.008
itting-0.007
 known-0.007
o-0.007
live-0.007
Token rise
Token ID4485
Feature activation+6.37e-01

Pos logit contributions
 exactly+0.024
itting+0.021
sm+0.020
str+0.020
 known+0.020
Neg logit contributions
 dot-0.022
 pavement-0.021
 distances-0.021
 horizon-0.021
 dashboard-0.020
Token every
Token ID790
Feature activation+1.99e+00

Pos logit contributions
 dot+0.028
 distances+0.027
 pavement+0.027
 horizon+0.026
 dashboard+0.025
Neg logit contributions
 exactly-0.023
itting-0.020
str-0.019
sm-0.019
 known-0.019
Token morning
Token ID3329
Feature activation-3.00e-01

Pos logit contributions
 dot+0.093
 pavement+0.089
 horizon+0.088
 distances+0.087
 sides+0.081
Neg logit contributions
 exactly-0.071
itting-0.062
sm-0.056
str-0.056
 known-0.056
Token.
Token ID13
Feature activation+7.29e-03

Pos logit contributions
 exactly+0.011
itting+0.009
str+0.009
sm+0.009
 known+0.008
Neg logit contributions
 dot-0.014
 pavement-0.013
 distances-0.013
 horizon-0.013
 dashboard-0.012
Token She
Token ID1375
Feature activation+5.92e-01

Pos logit contributions
 dot+0.000
 distances+0.000
 horizon+0.000
 pavement+0.000
 dashboard+0.000
Neg logit contributions
 exactly-0.000
itting-0.000
o-0.000
sm-0.000
 helping-0.000
Token would
Token ID561
Feature activation-3.41e-01

Pos logit contributions
 dot+0.027
 pavement+0.026
 distances+0.026
 horizon+0.025
 dashboard+0.025
Neg logit contributions
 exactly-0.021
itting-0.018
sm-0.017
str-0.017
 known-0.017
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
 dot+0.061
 pavement+0.059
 distances+0.058
 horizon+0.057
 dashboard+0.055
Neg logit contributions
 exactly-0.054
itting-0.048
str-0.045
sm-0.045
 known-0.045
Token But
Token ID887
Feature activation-4.80e-02

Pos logit contributions
 exactly+0.011
itting+0.010
sm+0.009
str+0.009
 known+0.009
Neg logit contributions
 dot-0.008
 pavement-0.008
 distances-0.008
 horizon-0.007
 dashboard-0.007
Token when
Token ID618
Feature activation-4.73e-01

Pos logit contributions
 exactly+0.002
itting+0.002
sm+0.002
str+0.002
 known+0.002
Neg logit contributions
 dot-0.002
 distances-0.002
 pavement-0.002
 horizon-0.002
ro-0.002
Token Lily
Token ID20037
Feature activation-6.58e-01

Pos logit contributions
 exactly+0.018
itting+0.017
sm+0.016
str+0.015
rou+0.015
Neg logit contributions
 dot-0.019
 distances-0.019
 pavement-0.019
 horizon-0.018
 dashboard-0.017
Token put
Token ID1234
Feature activation+9.21e-01

Pos logit contributions
 exactly+0.023
itting+0.021
o+0.020
sm+0.020
str+0.019
Neg logit contributions
 dot-0.029
 distances-0.028
 pavement-0.028
 horizon-0.028
 spoiler-0.027
Token it
Token ID340
Feature activation+4.58e-01

Pos logit contributions
 dot+0.042
 pavement+0.041
 distances+0.041
 horizon+0.040
 dashboard+0.039
Neg logit contributions
 exactly-0.031
itting-0.028
sm-0.026
str-0.026
 known-0.025
Token on
Token ID319
Feature activation+8.50e-01

Pos logit contributions
 dot+0.022
 pavement+0.022
 distances+0.022
 horizon+0.021
 dashboard+0.021
Neg logit contributions
 exactly-0.014
itting-0.012
sm-0.011
str-0.011
 known-0.011
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
 dot+0.037
 pavement+0.036
 distances+0.036
 horizon+0.035
 dashboard+0.034
Neg logit contributions
 exactly-0.030
itting-0.028
str-0.026
sm-0.025
 known-0.025
Token she
Token ID673
Feature activation-3.84e-01

Pos logit contributions
 dot+0.011
 pavement+0.011
 distances+0.011
 horizon+0.010
 wheels+0.010
Neg logit contributions
 exactly-0.008
itting-0.008
str-0.007
sm-0.007
o-0.007
Token noticed
Token ID6810
Feature activation-2.45e-01

Pos logit contributions
 exactly+0.015
itting+0.013
str+0.012
sm+0.012
 known+0.012
Neg logit contributions
 dot-0.016
 pavement-0.016
 distances-0.016
 horizon-0.015
 dashboard-0.015
Token something
Token ID1223
Feature activation+1.80e+00

Pos logit contributions
itting+0.008
 exactly+0.008
bek+0.007
str+0.007
ll+0.007
Neg logit contributions
 dot-0.011
 pavement-0.011
 distances-0.010
 wheels-0.010
 horizon-0.010
TokenHow
Token ID2437
Feature activation-4.97e-01

Pos logit contributions
 dot+0.009
 distances+0.009
 pavement+0.009
 horizon+0.009
ro+0.008
Neg logit contributions
 exactly-0.012
itting-0.010
str-0.010
sm-0.010
 wrong-0.010
Token do
Token ID466
Feature activation+2.20e+00

Pos logit contributions
 exactly+0.022
itting+0.020
 feeding+0.020
sm+0.019
str+0.019
Neg logit contributions
 dot-0.017
 pavement-0.015
 distances-0.015
 dashboard-0.014
 Novel-0.014
Token you
Token ID345
Feature activation-3.37e-01

Pos logit contributions
 dot+0.063
 distances+0.062
 pavement+0.060
 myself+0.055
 sides+0.055
Neg logit contributions
 exactly-0.107
itting-0.102
rou-0.096
str-0.094
sm-0.094
Token hang
Token ID8181
Feature activation-7.33e-01

Pos logit contributions
 exactly+0.015
itting+0.013
 known+0.013
str+0.012
 wrong+0.012
Neg logit contributions
 dot-0.012
 distances-0.012
 pavement-0.012
 horizon-0.012
 dashboard-0.012
Token from
Token ID422
Feature activation+2.20e+00

Pos logit contributions
 exactly+0.024
itting+0.020
sm+0.019
o+0.019
 wrong+0.019
Neg logit contributions
 dot-0.035
 pavement-0.034
 distances-0.034
 horizon-0.033
 dashboard-0.033
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
 dot+0.098
 distances+0.094
 pavement+0.094
 horizon+0.092
 dashboard+0.089
Neg logit contributions
 exactly-0.078
itting-0.071
sm-0.065
str-0.065
 known-0.064
Token roof
Token ID9753
Feature activation-1.05e+00

Pos logit contributions
 dot+0.008
 pavement+0.008
 distances+0.008
 horizon+0.008
 dashboard+0.008
Neg logit contributions
 exactly-0.007
itting-0.007
sm-0.006
str-0.006
 known-0.006
Token?"
Token ID1701
Feature activation+1.17e+00

Pos logit contributions
 exactly+0.042
itting+0.037
o+0.035
str+0.035
sm+0.034
Neg logit contributions
 dot-0.043
 pavement-0.041
 distances-0.041
 dashboard-0.039
 horizon-0.039
Token
Token ID198
Feature activation-3.44e-01

Pos logit contributions
 dot+0.044
 pavement+0.043
ro+0.042
 distances+0.041
 horizon+0.040
Neg logit contributions
 exactly-0.049
itting-0.045
sm-0.042
str-0.041
bek-0.041
Token
Token ID198
Feature activation-1.43e-03

Pos logit contributions
 exactly+0.015
itting+0.013
sm+0.013
 feeding+0.012
bek+0.012
Neg logit contributions
 dot-0.012
 distances-0.011
 pavement-0.011
 horizon-0.011
ro-0.011
TokenS
Token ID50
Feature activation-3.18e-01

Pos logit contributions
 exactly+0.000
itting+0.000
 feeding+0.000
 wrong+0.000
 known+0.000
Neg logit contributions
 dot-0.000
 pavement-0.000
 distances-0.000
ro-0.000
 horizon-0.000
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 exactly+0.005
itting+0.005
 known+0.005
str+0.005
sm+0.005
Neg logit contributions
 dot-0.004
 Novel-0.004
 pavement-0.003
 wheels-0.003
 distances-0.003
Token you
Token ID345
Feature activation-1.70e+00

Pos logit contributions
 dot+0.021
 pavement+0.020
 distances+0.019
 horizon+0.018
 dashboard+0.018
Neg logit contributions
 exactly-0.021
itting-0.021
str-0.019
 known-0.019
o-0.019
Token have
Token ID423
Feature activation-8.99e-01

Pos logit contributions
 exactly+0.060
 known+0.051
itting+0.050
 wrong+0.047
o+0.047
Neg logit contributions
 horizon-0.082
 distances-0.081
 pavement-0.079
 dot-0.076
 dashboard-0.075
Token a
Token ID257
Feature activation-3.93e-01

Pos logit contributions
 exactly+0.029
itting+0.026
sm+0.023
str+0.023
 known+0.023
Neg logit contributions
 dot-0.043
 distances-0.042
 pavement-0.042
 horizon-0.040
 dashboard-0.040
Token lot
Token ID1256
Feature activation-1.69e+00

Pos logit contributions
 exactly+0.015
itting+0.014
sm+0.013
o+0.013
str+0.013
Neg logit contributions
 dot-0.017
 pavement-0.015
 horizon-0.015
 distances-0.015
 dashboard-0.014
Token of
Token ID286
Feature activation-5.12e-01

Pos logit contributions
 exactly+0.053
itting+0.046
sm+0.044
str+0.041
o+0.041
Neg logit contributions
 dot-0.085
 distances-0.081
 pavement-0.081
 horizon-0.079
 dashboard-0.078
Token buttons
Token ID12163
Feature activation-7.98e-01

Pos logit contributions
 exactly+0.019
 wrong+0.015
sm+0.015
 helping+0.015
 living+0.015
Neg logit contributions
 dot-0.024
 horizon-0.023
 distances-0.022
 pavement-0.022
 dashboard-0.021
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 exactly+0.025
itting+0.022
sm+0.020
o+0.019
 wrong+0.019
Neg logit contributions
 dot-0.039
 pavement-0.037
 distances-0.037
 horizon-0.037
 dashboard-0.035
Token What
Token ID1867
Feature activation-1.07e+00

Pos logit contributions
 dot+0.013
 horizon+0.013
 distances+0.013
 pavement+0.013
 widened+0.012
Neg logit contributions
 exactly-0.014
itting-0.013
o-0.012
 Enjoy-0.012
sm-0.012
Token are
Token ID389
Feature activation+1.29e+00

Pos logit contributions
 exactly+0.049
itting+0.044
str+0.043
sm+0.042
uc+0.041
Neg logit contributions
 dot-0.036
 distances-0.033
 pavement-0.032
 dashboard-0.030
 Novel-0.030
Token you
Token ID345
Feature activation-3.88e-01

Pos logit contributions
 distances+0.041
 pavement+0.041
 dot+0.040
 wheels+0.040
 myself+0.039
Neg logit contributions
 exactly-0.054
itting-0.053
str-0.051
sm-0.050
rou-0.050
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
 exactly+0.019
str+0.018
itting+0.018
live+0.017
itt+0.017
Neg logit contributions
 dot-0.017
 distances-0.016
 horizon-0.015
 widened-0.015
 pavement-0.015
Token
Token ID198
Feature activation-4.03e-01

Pos logit contributions
 exactly+0.018
itting+0.017
sm+0.016
str+0.015
uc+0.015
Neg logit contributions
 dot-0.015
 distances-0.014
 pavement-0.014
 horizon-0.014
ro-0.013
Token
Token ID198
Feature activation-1.84e-01

Pos logit contributions
 exactly+0.019
itting+0.018
uc+0.016
icking+0.016
 wrong+0.016
Neg logit contributions
 distances-0.012
 dot-0.012
 pavement-0.012
 horizon-0.011
ro-0.011
TokenSo
Token ID2396
Feature activation-1.04e+00

Pos logit contributions
 exactly+0.008
itting+0.007
 wrong+0.007
icking+0.007
uc+0.007
Neg logit contributions
 distances-0.006
 dot-0.006
 pavement-0.006
ro-0.006
 horizon-0.006
Token,
Token ID11
Feature activation-9.05e-01

Pos logit contributions
itting+0.045
 exactly+0.042
sm+0.040
rou+0.040
eh+0.039
Neg logit contributions
 dot-0.038
 distances-0.036
 horizon-0.035
 pavement-0.034
 sides-0.034
Token together
Token ID1978
Feature activation-1.28e+00

Pos logit contributions
itting+0.041
 exactly+0.037
live+0.036
str+0.036
rou+0.036
Neg logit contributions
 dot-0.032
 horizon-0.029
 distances-0.028
 pavement-0.028
ro-0.026
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
itting+0.052
rou+0.052
eh+0.051
Acc+0.050
 exactly+0.049
Neg logit contributions
 dot-0.048
 distances-0.045
 horizon-0.043
 widened-0.043
 pavement-0.041
Token the
Token ID262
Feature activation-6.19e-01

Pos logit contributions
itting+0.049
 exactly+0.046
live+0.044
str+0.044
uc+0.043
Neg logit contributions
 dot-0.047
 distances-0.045
 horizon-0.044
 pavement-0.044
 wheels-0.041
Token frog
Token ID21264
Feature activation-4.22e-01

Pos logit contributions
 exactly+0.026
itting+0.024
sm+0.022
str+0.022
 known+0.022
Neg logit contributions
 dot-0.024
 pavement-0.023
 distances-0.022
 horizon-0.022
 dashboard-0.021
Token and
Token ID290
Feature activation+3.49e-01

Pos logit contributions
 exactly+0.018
itting+0.017
 wrong+0.016
str+0.016
sm+0.016
Neg logit contributions
 dot-0.015
 distances-0.015
 horizon-0.014
 pavement-0.014
ro-0.013
Token the
Token ID262
Feature activation+6.74e-01

Pos logit contributions
 dot+0.014
 pavement+0.012
 distances+0.012
 horizon+0.012
ro+0.011
Neg logit contributions
itting-0.015
 exactly-0.015
ll-0.013
rou-0.013
live-0.013
Token."
Token ID526
Feature activation+4.65e-01

Pos logit contributions
 dot+0.032
 pavement+0.031
 distances+0.030
 horizon+0.030
 dashboard+0.029
Neg logit contributions
 exactly-0.025
itting-0.022
sm-0.020
o-0.020
str-0.020
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 dot+0.018
 pavement+0.018
 distances+0.018
 horizon+0.017
 dashboard+0.017
Neg logit contributions
 exactly-0.019
itting-0.017
sm-0.016
str-0.016
 known-0.015
Token
Token ID198
Feature activation+3.62e-01

Pos logit contributions
 dot+0.000
 distances+0.000
 pavement+0.000
 horizon+0.000
 dashboard+0.000
Neg logit contributions
 exactly-0.000
itting-0.000
 known-0.000
uc-0.000
bek-0.000
TokenTom
Token ID13787
Feature activation-7.09e-01

Pos logit contributions
 dot+0.011
 distances+0.010
 pavement+0.010
ro+0.010
 dashboard+0.010
Neg logit contributions
 exactly-0.018
itting-0.016
icking-0.016
 wrong-0.015
uc-0.015
Token and
Token ID290
Feature activation+3.38e-01

Pos logit contributions
 exactly+0.032
itting+0.029
sm+0.028
str+0.028
o+0.028
Neg logit contributions
 dot-0.025
 pavement-0.023
 distances-0.023
 horizon-0.022
 dashboard-0.022
Token Mia
Token ID32189
Feature activation-1.73e-01

Pos logit contributions
 dot+0.012
 pavement+0.012
 distances+0.011
ro+0.011
 horizon+0.011
Neg logit contributions
 exactly-0.015
itting-0.014
str-0.013
sm-0.013
rou-0.012
Token say
Token ID910
Feature activation-8.86e-03

Pos logit contributions
 exactly+0.007
itting+0.007
sm+0.006
str+0.006
 known+0.006
Neg logit contributions
 dot-0.007
 pavement-0.006
 distances-0.006
 horizon-0.006
 dashboard-0.006
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
eh+0.000
live+0.000
itting+0.000
str+0.000
 exactly+0.000
Neg logit contributions
 Novel-0.000
 dot-0.000
 spoiler-0.000
 wheels-0.000
 distances-0.000
Token "
Token ID366
Feature activation-1.50e-01

Pos logit contributions
 exactly+0.012
itting+0.011
ats+0.010
live+0.010
eh+0.010
Neg logit contributions
 dot-0.011
ro-0.010
 pavement-0.010
 Novel-0.010
 spoiler-0.010
TokenWe
Token ID1135
Feature activation-1.17e+00

Pos logit contributions
 exactly+0.005
itting+0.005
sm+0.004
str+0.004
 known+0.004
Neg logit contributions
 dot-0.007
 distances-0.007
 pavement-0.007
 horizon-0.006
 dashboard-0.006
Token will
Token ID481
Feature activation+1.08e-01

Pos logit contributions
 exactly+0.057
itting+0.054
sm+0.053
str+0.051
eh+0.051
Neg logit contributions
 dot-0.035
 spoiler-0.030
 distances-0.029
 pavement-0.029
 wheels-0.029
Token back
Token ID736
Feature activation-1.11e+00

Pos logit contributions
 dot+0.006
 distances+0.006
 pavement+0.006
 horizon+0.006
 dashboard+0.005
Neg logit contributions
 exactly-0.003
itting-0.003
str-0.003
 known-0.002
sm-0.002
Token to
Token ID284
Feature activation-4.06e-01

Pos logit contributions
 exactly+0.020
itting+0.016
sm+0.013
str+0.013
 known+0.013
Neg logit contributions
 dot-0.068
 pavement-0.067
 distances-0.066
 horizon-0.065
 dashboard-0.064
Token grandma
Token ID49890
Feature activation-8.27e-01

Pos logit contributions
 exactly+0.015
itting+0.015
str+0.013
sm+0.013
uc+0.012
Neg logit contributions
 dot-0.017
 pavement-0.017
 distances-0.017
 horizon-0.016
 dashboard-0.016
Token and
Token ID290
Feature activation+4.86e-01

Pos logit contributions
 exactly+0.041
itting+0.040
 known+0.038
str+0.038
bek+0.036
Neg logit contributions
 dot-0.025
 distances-0.025
 pavement-0.023
 horizon-0.022
ro-0.022
Token told
Token ID1297
Feature activation+3.40e-01

Pos logit contributions
 dot+0.018
 pavement+0.017
 distances+0.017
 dashboard+0.017
 wheels+0.016
Neg logit contributions
itting-0.020
 exactly-0.020
str-0.017
sm-0.017
rou-0.017
Token her
Token ID607
Feature activation-4.91e-01

Pos logit contributions
 dot+0.015
 pavement+0.015
 distances+0.015
 horizon+0.014
 dashboard+0.014
Neg logit contributions
 exactly-0.012
itting-0.011
str-0.010
o-0.010
bek-0.010
Token what
Token ID644
Feature activation-1.89e+00

Pos logit contributions
itting+0.020
eh+0.019
bek+0.019
P+0.018
live+0.018
Neg logit contributions
 Novel-0.018
 opens-0.017
 seem-0.017
 buttons-0.016
bell-0.016
Token she
Token ID673
Feature activation+1.07e+00

Pos logit contributions
 exactly+0.081
itting+0.073
icking+0.068
bek+0.066
rou+0.065
Neg logit contributions
 dot-0.069
 pavement-0.066
 Novel-0.065
ro-0.063
 wheels-0.062
Token saw
Token ID2497
Feature activation+4.47e-01

Pos logit contributions
 dot+0.051
 distances+0.050
 pavement+0.049
 horizon+0.048
 spoiler+0.047
Neg logit contributions
 exactly-0.037
itting-0.031
sm-0.031
o-0.030
 known-0.030
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
 dot+0.018
 pavement+0.018
 distances+0.018
 horizon+0.017
 dashboard+0.017
Neg logit contributions
 exactly-0.017
itting-0.016
str-0.015
 known-0.015
o-0.014
Token Grand
Token ID5675
Feature activation+1.25e+00

Pos logit contributions
 dot+0.012
 distances+0.012
 pavement+0.011
 horizon+0.011
 spoiler+0.011
Neg logit contributions
 exactly-0.009
itting-0.008
 wrong-0.007
 known-0.007
 living-0.007
Token then
Token ID788
Feature activation+5.30e-01

Pos logit contributions
itting+0.023
sm+0.022
o+0.022
str+0.021
 exactly+0.021
Neg logit contributions
 dot-0.014
 horizon-0.013
 pavement-0.013
 distances-0.013
 wheels-0.013
Token one
Token ID530
Feature activation+3.92e-01

Pos logit contributions
 dot+0.021
 distances+0.020
 pavement+0.020
 horizon+0.020
 wheels+0.019
Neg logit contributions
itting-0.019
 exactly-0.019
o-0.018
ll-0.018
str-0.017
Token day
Token ID1110
Feature activation+6.03e-01

Pos logit contributions
 dot+0.016
 horizon+0.014
 pavement+0.014
ro+0.014
 sides+0.013
Neg logit contributions
 exactly-0.016
uc-0.014
itting-0.014
str-0.013
ll-0.013
Token,
Token ID11
Feature activation-8.91e-02

Pos logit contributions
 dot+0.024
 pavement+0.023
 distances+0.022
ro+0.022
 horizon+0.022
Neg logit contributions
 exactly-0.023
itting-0.022
o-0.021
ll-0.021
rou-0.020
Token while
Token ID981
Feature activation-9.21e-02

Pos logit contributions
itting+0.004
o+0.004
ll+0.004
str+0.003
rou+0.003
Neg logit contributions
 dot-0.003
 pavement-0.003
 wheels-0.003
 horizon-0.003
ro-0.003
Token she
Token ID673
Feature activation-6.60e-01

Pos logit contributions
 exactly+0.004
itting+0.003
str+0.003
ll+0.003
o+0.003
Neg logit contributions
 dot-0.004
 pavement-0.003
 distances-0.003
 horizon-0.003
 wheels-0.003
Token was
Token ID373
Feature activation-2.19e-01

Pos logit contributions
 exactly+0.021
o+0.017
sm+0.017
itting+0.017
 known+0.016
Neg logit contributions
 pavement-0.032
 distances-0.032
 dashboard-0.032
 dot-0.031
 horizon-0.031
Token still
Token ID991
Feature activation-6.09e-01

Pos logit contributions
bek+0.007
str+0.007
itting+0.007
Deal+0.007
 exactly+0.007
Neg logit contributions
 distances-0.009
 dot-0.009
 pavement-0.009
 horizon-0.009
 wheels-0.008
Token repeating
Token ID20394
Feature activation-1.64e+00

Pos logit contributions
 exactly+0.020
str+0.019
itting+0.019
live+0.018
bek+0.018
Neg logit contributions
 dot-0.027
 distances-0.026
 pavement-0.025
 horizon-0.025
 wheels-0.024
Token the
Token ID262
Feature activation-3.56e-01

Pos logit contributions
 exactly+0.056
itting+0.053
str+0.050
sm+0.049
o+0.049
Neg logit contributions
 dot-0.072
 distances-0.071
 pavement-0.070
 horizon-0.068
 dashboard-0.065
Token same
Token ID976
Feature activation+4.22e-01

Pos logit contributions
 exactly+0.017
itting+0.016
sm+0.015
o+0.015
str+0.015
Neg logit contributions
 dot-0.012
 pavement-0.011
 horizon-0.011
 distances-0.011
 dashboard-0.010
Token.
Token ID13
Feature activation+9.71e-01

Pos logit contributions
 dot+0.032
en+0.025
 opens+0.025
 guesses+0.025
 wonders+0.025
Neg logit contributions
live-0.069
liv-0.068
 exactly-0.067
bek-0.067
itting-0.066
Token Rob
Token ID3851
Feature activation+8.36e-01

Pos logit contributions
 dot+0.039
 distances+0.037
 pavement+0.036
 horizon+0.035
 surfaces+0.034
Neg logit contributions
 exactly-0.038
o-0.035
str-0.033
itting-0.032
 feeding-0.032
Token was
Token ID373
Feature activation+7.47e-01

Pos logit contributions
 dot+0.026
 widened+0.023
 distances+0.022
 heads+0.022
 wonders+0.022
Neg logit contributions
 exactly-0.038
itting-0.037
sm-0.034
rou-0.034
bek-0.033
Token carrying
Token ID6872
Feature activation+8.01e-01

Pos logit contributions
 dot+0.025
 distances+0.024
 wheels+0.024
 pavement+0.023
 widen+0.022
Neg logit contributions
itting-0.032
 exactly-0.031
bek-0.031
sm-0.029
uc-0.028
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
 dot+0.035
 distances+0.034
 pavement+0.034
 horizon+0.033
 dashboard+0.032
Neg logit contributions
 exactly-0.028
itting-0.027
str-0.024
sm-0.023
 known-0.023
Token whip
Token ID22972
Feature activation-2.85e+00

Pos logit contributions
 exactly+0.007
itting+0.007
 known+0.006
sm+0.006
rou+0.006
Neg logit contributions
 dot-0.005
 tear-0.004
 pavement-0.004
 wheels-0.004
 rip-0.004
Token.
Token ID13
Feature activation+5.73e-01

Pos logit contributions
 known+0.137
str+0.132
itting+0.129
 exactly+0.123
bek+0.120
Neg logit contributions
 dot-0.098
 wheels-0.087
 pavement-0.083
ro-0.083
 dashboard-0.082
Token Sam
Token ID3409
Feature activation-2.16e-01

Pos logit contributions
 dot+0.024
 distances+0.022
 horizon+0.022
 pavement+0.021
 surfaces+0.021
Neg logit contributions
 exactly-0.022
o-0.019
itting-0.018
str-0.018
 helping-0.018
Token asked
Token ID1965
Feature activation+1.22e+00

Pos logit contributions
 exactly+0.010
itting+0.009
sm+0.008
 known+0.008
rou+0.008
Neg logit contributions
 dot-0.007
 pavement-0.007
 distances-0.007
 horizon-0.006
ro-0.006
Token Rob
Token ID3851
Feature activation-4.68e-02

Pos logit contributions
 distances+0.038
 dot+0.037
 pavement+0.036
 dashboard+0.035
 horizon+0.035
Neg logit contributions
itting-0.055
 exactly-0.055
sm-0.050
str-0.049
rou-0.049
Token,
Token ID11
Feature activation+8.44e-01

Pos logit contributions
itting+0.002
bek+0.002
itt+0.002
live+0.002
str+0.002
Neg logit contributions
 opens-0.001
 wonders-0.001
 guesses-0.001
 Novel-0.001
 counts-0.001
Token help
Token ID1037
Feature activation-3.71e-01

Pos logit contributions
 dot+0.001
 distances+0.001
 pavement+0.001
 horizon+0.001
 dashboard+0.001
Neg logit contributions
 exactly-0.001
itting-0.000
sm-0.000
str-0.000
 known-0.000
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 exactly+0.015
itting+0.015
rou+0.014
bek+0.014
 known+0.014
Neg logit contributions
 pavement-0.014
 dot-0.013
 distances-0.013
 surfaces-0.013
 Novel-0.012
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 dot+0.000
 pavement+0.000
 distances+0.000
 horizon+0.000
ro+0.000
Neg logit contributions
 exactly-0.001
itting-0.000
 known-0.000
rou-0.000
sm-0.000
Token
Token ID198
Feature activation+6.97e-02

Pos logit contributions
 exactly+0.006
itting+0.005
 known+0.005
sm+0.005
rou+0.005
Neg logit contributions
 dot-0.005
 distances-0.005
 pavement-0.005
 horizon-0.005
 dashboard-0.005
TokenK
Token ID42
Feature activation-9.69e-01

Pos logit contributions
 distances+0.003
 pavement+0.003
 dot+0.003
 dashboard+0.003
ro+0.003
Neg logit contributions
 exactly-0.003
itting-0.002
rou-0.002
 known-0.002
uc-0.002
Tokenitty
Token ID9760
Feature activation-2.92e+00

Pos logit contributions
 exactly+0.014
itting+0.011
sm+0.009
str+0.009
o+0.008
Neg logit contributions
 dot-0.065
 pavement-0.063
 distances-0.063
 horizon-0.062
 dashboard-0.061
Token's
Token ID338
Feature activation-5.25e-01

Pos logit contributions
itting+0.144
 exactly+0.143
bek+0.124
rou+0.124
 known+0.123
Neg logit contributions
 dot-0.082
 distances-0.079
 pavement-0.078
 heads-0.076
 wheels-0.075
Token friends
Token ID2460
Feature activation+8.88e-01

Pos logit contributions
 exactly+0.015
itting+0.015
bek+0.014
str+0.014
sm+0.013
Neg logit contributions
 dot-0.025
 wheels-0.025
 sides-0.025
 pavement-0.024
 distances-0.024
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
 widened+0.022
 dot+0.022
 heads+0.020
 pavement+0.019
 tear+0.019
Neg logit contributions
 exactly-0.043
itting-0.043
live-0.042
bek-0.042
rou-0.042
Token a
Token ID257
Feature activation-5.15e-01

Pos logit contributions
 pedals+0.026
 wheels+0.026
 buttons+0.025
 pavement+0.025
 dot+0.025
Neg logit contributions
itting-0.049
bek-0.046
 exactly-0.045
uc-0.045
itt-0.045
Token dog
Token ID3290
Feature activation-6.40e-01

Pos logit contributions
 exactly+0.028
itting+0.026
ied+0.024
sm+0.024
icking+0.023
Neg logit contributions
 dot-0.016
 tear-0.014
 pavement-0.013
ro-0.012
 wheels-0.012
Token they
Token ID484
Feature activation-2.77e-01

Pos logit contributions
 dot+0.009
 horizon+0.009
 distances+0.009
 pavement+0.009
 surface+0.008
Neg logit contributions
itting-0.007
 exactly-0.007
sm-0.007
rou-0.007
o-0.007
Token were
Token ID547
Feature activation+8.83e-01

Pos logit contributions
 exactly+0.010
itting+0.009
 known+0.008
sm+0.008
str+0.008
Neg logit contributions
 dot-0.012
 pavement-0.012
 distances-0.012
 horizon-0.012
 dashboard-0.011
Token finished
Token ID5201
Feature activation+8.93e-02

Pos logit contributions
 dot+0.040
 distances+0.040
 pavement+0.039
 horizon+0.039
 wheels+0.037
Neg logit contributions
 exactly-0.027
itting-0.026
sm-0.025
str-0.024
bek-0.024
Token,
Token ID11
Feature activation-9.96e-01

Pos logit contributions
 dot+0.003
 distances+0.003
 pavement+0.003
 horizon+0.003
 surfaces+0.003
Neg logit contributions
 exactly-0.003
itting-0.003
sm-0.003
str-0.003
live-0.003
Token Nem
Token ID22547
Feature activation-4.31e-01

Pos logit contributions
itting+0.036
 exactly+0.035
sm+0.032
o+0.031
str+0.031
Neg logit contributions
 dot-0.042
 pavement-0.040
 horizon-0.040
 distances-0.039
 spoiler-0.037
Tokeno
Token ID78
Feature activation-2.75e+00

Pos logit contributions
 exactly+0.008
itting+0.005
 wrong+0.005
sm+0.005
 known+0.004
Neg logit contributions
 dot-0.028
ro-0.026
 pavement-0.026
 distances-0.026
 horizon-0.025
Token put
Token ID1234
Feature activation+3.84e-01

Pos logit contributions
 exactly+0.111
itting+0.102
sm+0.094
str+0.094
 known+0.093
Neg logit contributions
 dot-0.109
 pavement-0.103
 distances-0.103
 horizon-0.100
 dashboard-0.097
Token the
Token ID262
Feature activation+7.18e-01

Pos logit contributions
 dot+0.016
 pavement+0.015
 distances+0.015
 horizon+0.015
 wheels+0.014
Neg logit contributions
itting-0.014
 exactly-0.014
str-0.013
sm-0.012
 known-0.012
Token necklace
Token ID38987
Feature activation+2.64e-01

Pos logit contributions
 dot+0.024
 pavement+0.022
 distances+0.022
 horizon+0.022
 dashboard+0.021
Neg logit contributions
 exactly-0.034
itting-0.031
sm-0.030
str-0.029
 known-0.029
Token on
Token ID319
Feature activation+9.01e-01

Pos logit contributions
 dot+0.011
 distances+0.011
 pavement+0.011
 horizon+0.010
ro+0.010
Neg logit contributions
 exactly-0.009
itting-0.009
 known-0.009
str-0.009
sm-0.008
Token and
Token ID290
Feature activation+5.07e-01

Pos logit contributions
 dot+0.038
 pavement+0.036
 distances+0.036
 horizon+0.036
ro+0.034
Neg logit contributions
 exactly-0.034
itting-0.033
str-0.030
sm-0.029
o-0.029
Token We
Token ID775
Feature activation-9.50e-01

Pos logit contributions
 dot+0.001
 pavement+0.001
 distances+0.001
 horizon+0.001
 surfaces+0.001
Neg logit contributions
 exactly-0.001
itting-0.001
o-0.001
 Enjoy-0.001
 known-0.001
Token have
Token ID423
Feature activation+1.59e-01

Pos logit contributions
 exactly+0.041
itting+0.038
str+0.036
sm+0.035
 known+0.034
Neg logit contributions
 dot-0.035
 distances-0.033
 pavement-0.032
 horizon-0.031
 dashboard-0.030
Token apple
Token ID17180
Feature activation-1.43e+00

Pos logit contributions
 distances+0.007
 dot+0.007
 pavement+0.007
 horizon+0.007
 wheels+0.007
Neg logit contributions
itting-0.005
 exactly-0.005
str-0.005
sm-0.004
bek-0.004
Token pie
Token ID2508
Feature activation-2.57e+00

Pos logit contributions
 exactly+0.044
itting+0.038
sm+0.035
str+0.035
o+0.034
Neg logit contributions
 dot-0.071
 pavement-0.068
 distances-0.068
 horizon-0.066
 dashboard-0.065
Token,
Token ID11
Feature activation+1.33e+00

Pos logit contributions
 exactly+0.101
itting+0.090
sm+0.085
str+0.084
 known+0.083
Neg logit contributions
 dot-0.105
 pavement-0.101
 distances-0.101
 horizon-0.098
 dashboard-0.095
Token cherry
Token ID23612
Feature activation-2.85e+00

Pos logit contributions
 dot+0.057
 pavement+0.055
 distances+0.054
 horizon+0.053
 dashboard+0.051
Neg logit contributions
 exactly-0.050
itting-0.045
sm-0.042
str-0.042
 known-0.041
Token pie
Token ID2508
Feature activation-2.92e+00

Pos logit contributions
 exactly+0.120
itting+0.110
 known+0.102
str+0.101
sm+0.100
Neg logit contributions
 dot-0.109
 distances-0.104
 pavement-0.104
 horizon-0.101
 dashboard-0.097
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
 exactly+0.133
itting+0.122
sm+0.115
str+0.115
 known+0.114
Neg logit contributions
 dot-0.101
 pavement-0.096
 distances-0.096
 horizon-0.092
 dashboard-0.089
Token pumpkin
Token ID30089
Feature activation-2.15e+00

Pos logit contributions
 dot+0.044
 pavement+0.042
 distances+0.042
 horizon+0.041
 dashboard+0.040
Neg logit contributions
 exactly-0.046
itting-0.040
sm-0.038
str-0.038
o-0.037
Token pie
Token ID2508
Feature activation-2.85e+00

Pos logit contributions
 exactly+0.095
itting+0.091
 known+0.087
str+0.082
bek+0.081
Neg logit contributions
 dot-0.075
 distances-0.070
 pavement-0.070
ro-0.067
 horizon-0.067
Token,
Token ID11
Feature activation+9.11e-01

Pos logit contributions
 exactly+0.128
itting+0.117
sm+0.110
str+0.110
 known+0.109
Neg logit contributions
 dot-0.100
 pavement-0.096
 distances-0.095
 horizon-0.092
 dashboard-0.089
Token.
Token ID13
Feature activation+4.12e-03

Pos logit contributions
 dot+0.059
 distances+0.057
 pavement+0.056
 horizon+0.055
ro+0.053
Neg logit contributions
 exactly-0.051
itting-0.048
str-0.044
 known-0.042
sm-0.042
Token They
Token ID1119
Feature activation+7.78e-02

Pos logit contributions
 dot+0.000
 pavement+0.000
 distances+0.000
 horizon+0.000
 dashboard+0.000
Neg logit contributions
 exactly-0.000
itting-0.000
str-0.000
sm-0.000
 known-0.000
Token shouted
Token ID17293
Feature activation+1.11e-01

Pos logit contributions
 dot+0.002
 widened+0.002
 distances+0.002
 pavement+0.002
 pedals+0.002
Neg logit contributions
itting-0.004
 exactly-0.003
itt-0.003
uc-0.003
rou-0.003
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
 dot+0.004
 distances+0.004
 pavement+0.004
 horizon+0.004
 wheels+0.004
Neg logit contributions
str-0.004
 known-0.004
 exactly-0.004
itting-0.004
live-0.004
Token b
Token ID275
Feature activation-3.96e-01

Pos logit contributions
itting+0.077
 exactly+0.069
rou+0.066
itt+0.065
live+0.063
Neg logit contributions
 dot-0.042
 pavement-0.042
 widened-0.041
 distances-0.041
 heads-0.039
Tokenanged
Token ID5102
Feature activation-2.82e+00

Pos logit contributions
 exactly+0.019
 wrong+0.018
 living+0.018
 teaching+0.018
 known+0.017
Neg logit contributions
 dot-0.011
 distances-0.011
ro-0.011
 pavement-0.010
en-0.009
Token their
Token ID511
Feature activation-8.74e-01

Pos logit contributions
itting+0.110
str+0.106
 exactly+0.102
 known+0.100
live+0.099
Neg logit contributions
 distances-0.115
 dot-0.114
 pavement-0.109
 wheels-0.107
 widened-0.106
Token tails
Token ID30514
Feature activation-6.61e-01

Pos logit contributions
 exactly+0.048
itting+0.047
 known+0.045
str+0.044
sm+0.044
Neg logit contributions
 dot-0.022
 distances-0.021
 heads-0.020
 pavement-0.020
 horizon-0.019
Token so
Token ID523
Feature activation-4.40e-02

Pos logit contributions
 exactly+0.026
str+0.025
itting+0.025
 known+0.024
bek+0.024
Neg logit contributions
 dot-0.026
 distances-0.025
 pavement-0.024
 horizon-0.024
ro-0.023
Token the
Token ID262
Feature activation-2.09e-02

Pos logit contributions
 exactly+0.002
itting+0.002
 known+0.002
o+0.002
uc+0.002
Neg logit contributions
 dot-0.002
 pavement-0.002
 distances-0.002
 horizon-0.002
 winds-0.001
Token fox
Token ID21831
Feature activation-2.59e-02

Pos logit contributions
 exactly+0.001
itting+0.001
sm+0.001
o+0.001
str+0.001
Neg logit contributions
 dot-0.001
 pavement-0.001
 distances-0.001
 horizon-0.001
ro-0.001