TOP ACTIVATIONS
MAX = 9.166

different styles like stars or spir als . Everyone was so
happily ever after . <|endoftext|> Once upon a time , there
and his friends . <|endoftext|> Once upon a time , there
making new friends . <|endoftext|> Once upon a time , there
and played together . <|endoftext|> Once upon a time , there
place for everyone . <|endoftext|> Once upon a time , there
a big difference . <|endoftext|> Once upon a time , there
and his friends . <|endoftext|> Once upon a time , there
happiness to others . <|endoftext|> Once upon a time , there
by their size . <|endoftext|> Once upon a time , in
writing the story . <|endoftext|> Once upon a time , in
and were happy . <|endoftext|> Once upon a time , there
became good friends . <|endoftext|> Once upon a time , there
happiness to everyone . <|endoftext|> Once upon a time , there
fun time together . <|endoftext|> Once upon a time , there
its garden home . <|endoftext|> Once upon a time , there
on their faces . <|endoftext|> Once upon a time , there
day matching pages . <|endoftext|> Once upon a time , there
more fun swimming . <|endoftext|> Once upon a time , there
y universe toy . <|endoftext|> Once upon a time , there

MIN ACTIVATIONS
MIN = -7.529

remembered that her parents had taught her to be polite and
Grand ma was teaching her grand child how to use a
full of love and made Daddy feel special . Every evening
Tim my remembered his dad teaching him how to whistle .
farm . She told her grand pa that she was looking
Mia remembered how her parents taught her to be gentle and
. Her tutor taught her a lot of things . One
too . Daddy was teaching Mom my how to fly the
of his house to teach Gary a lesson . He was
mom many questions . She teaches him the names of the
. Her mom my was teaching her a game that involved
hot today ! I wish a cool breeze would come ."
He even taught his little sister how to twist the clock
remembered a lesson her mother taught her - that things that
his gr anny had always taught him the importance of being
and asked if he could teach him how to rake .
. Her mum and dad taught her all the answers .
my 's mom was very strict and said he couldn 't
his house to teach Gary a lesson . He was harsh
â € ™ s story teaches us an important lesson -

INTERVAL 4.992 - 9.166
CONTAINS 0.788%

not a dancer . <|endoftext|> S ara likes to design things
. <|endoftext|> Once upon a time , there was a big
more beautiful pictures . <|endoftext|> Once upon a time , there
<|endoftext|> Once upon a time , there was a gentle rabbit
stars tw ink ling in the night sky . " Wow

INTERVAL 0.818 - 4.992
CONTAINS 29.662%

with you . Don â € ™ t worry , I
saw Ben 's pistol on the ground . Ben had dropped
boy called Jimmy . One day , Jimmy went to his
ay ons hard and fast . They want to see purple
ducks in the water and sand castles on the shore .

INTERVAL -3.356 - 0.818
CONTAINS 67.537%

" Wow !" said Jake . " This looks fine !"
the river and saw something new . All of a sudden
the dirt !" M ama was very impressed .
He said , " Yes , Lily . I promise .
. Blue lived in a tiny nest in the west part

INTERVAL -7.529 - -3.356
CONTAINS 2.013%

and didn 't want to learn . So
said , " That 's a beautiful flower ."
was walking , he heard a strange noise . He looked
mom answered , " You should tell Billy how you feel
the door , he saw a big dragon inside ! The
Token different
Token ID1180
Feature activation-1.23e-02

Pos logit contributions
 occasions+0.012
 instructions+0.012
So+0.012
quet+0.012
and+0.012
Neg logit contributions
 seeker-0.005
 tired-0.004
 dusk-0.004
 spl-0.004
 d-0.004
Token styles
Token ID12186
Feature activation+1.17e+00

Pos logit contributions
quet+0.000
 instructions+0.000
So+0.000
Mic+0.000
vern+0.000
Neg logit contributions
 seeker-0.000
ained-0.000
 tired-0.000
 morning-0.000
 d-0.000
Token like
Token ID588
Feature activation-3.66e-01

Pos logit contributions
 seeker+0.029
ained+0.026
 tired+0.024
 forward+0.023
 slid+0.022
Neg logit contributions
 instructions-0.021
So-0.020
 occasions-0.019
 distress-0.019
ymes-0.017
Token stars
Token ID5788
Feature activation+1.68e+00

Pos logit contributions
 instructions+0.009
So+0.008
 occasions+0.008
 slices+0.008
quet+0.008
Neg logit contributions
 seeker-0.007
ained-0.006
 tired-0.005
 forward-0.005
 slid-0.005
Token or
Token ID393
Feature activation+1.28e+00

Pos logit contributions
 seeker+0.045
ained+0.039
 tired+0.036
 forward+0.035
 slid+0.035
Neg logit contributions
 instructions-0.028
So-0.025
 occasions-0.024
 slices-0.024
quet-0.023
Token spir
Token ID9158
Feature activation+9.17e+00

Pos logit contributions
 seeker+0.027
ained+0.022
 tired+0.021
 slid+0.019
 forward+0.019
Neg logit contributions
 instructions-0.028
So-0.027
 occasions-0.026
quet-0.025
 slices-0.025
Tokenals
Token ID874
Feature activation+3.48e+00

Pos logit contributions
 seeker+0.235
ained+0.187
 tired+0.175
 surfing+0.166
ancer+0.163
Neg logit contributions
 instructions-0.166
 slices-0.154
 brace-0.154
and-0.149
quet-0.149
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 seeker+0.089
ained+0.077
 tired+0.072
 forward+0.069
 slid+0.068
Neg logit contributions
 instructions-0.061
So-0.056
 occasions-0.055
 slices-0.052
 distress-0.052
Token Everyone
Token ID11075
Feature activation-1.09e+00

Pos logit contributions
 instructions+0.005
 occasions+0.005
So+0.005
quet+0.005
 slices+0.005
Neg logit contributions
 seeker-0.004
ained-0.003
 tired-0.003
 F-0.002
 slid-0.002
Token was
Token ID373
Feature activation+7.57e-01

Pos logit contributions
 instructions+0.025
So+0.023
 occasions+0.023
 slices+0.022
quet+0.022
Neg logit contributions
 seeker-0.022
ained-0.018
 tired-0.017
 slid-0.016
 forward-0.016
Token so
Token ID523
Feature activation+7.76e-01

Pos logit contributions
 seeker+0.018
ained+0.015
ancer+0.014
asc+0.013
 slid+0.013
Neg logit contributions
 instructions-0.015
So-0.013
 slices-0.013
 distress-0.013
 brace-0.013
Token happily
Token ID18177
Feature activation+1.78e-01

Pos logit contributions
 seeker+0.035
ained+0.030
ancer+0.029
asc+0.028
ahs+0.026
Neg logit contributions
 instructions-0.031
keeping-0.029
 words-0.028
So-0.028
 note-0.028
Token ever
Token ID1683
Feature activation+2.55e+00

Pos logit contributions
 seeker+0.005
ained+0.004
ancer+0.004
asc+0.004
 forward+0.004
Neg logit contributions
 instructions-0.003
So-0.003
 occasions-0.002
keeping-0.002
 slices-0.002
Token after
Token ID706
Feature activation+8.03e-01

Pos logit contributions
 seeker+0.046
ained+0.039
asc+0.034
 slid+0.033
ancer+0.032
Neg logit contributions
 instructions-0.062
So-0.058
 slices-0.057
 occasions-0.056
keeping-0.056
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
 seeker+0.022
ained+0.020
 tired+0.018
 forward+0.017
 slid+0.017
Neg logit contributions
 instructions-0.013
So-0.011
 occasions-0.011
 slices-0.011
quet-0.011
Token<|endoftext|>
Token ID50256
Feature activation+2.16e+00

Pos logit contributions
 seeker+0.037
ained+0.031
 tired+0.029
 forward+0.028
ancer+0.028
Neg logit contributions
 instructions-0.025
So-0.023
 occasions-0.022
 slices-0.021
quet-0.021
TokenOnce
Token ID7454
Feature activation+8.35e+00

Pos logit contributions
 seeker+0.041
ained+0.035
 tired+0.031
 F+0.030
 morning+0.029
Neg logit contributions
 instructions-0.055
 occasions-0.052
quet-0.052
 brace-0.051
 recipe-0.050
Token upon
Token ID2402
Feature activation+1.37e+00

Pos logit contributions
 seeker+0.112
ained+0.080
 tired+0.063
asc+0.060
 slid+0.059
Neg logit contributions
 instructions-0.247
So-0.237
 slices-0.229
 occasions-0.228
and-0.226
Token a
Token ID257
Feature activation+5.01e+00

Pos logit contributions
 seeker+0.017
ained+0.012
 tired+0.011
 morning+0.009
 spl+0.008
Neg logit contributions
 instructions-0.045
 occasions-0.042
quet-0.042
 recipe-0.041
keeping-0.040
Token time
Token ID640
Feature activation+6.36e+00

Pos logit contributions
 seeker+0.065
ained+0.047
 tired+0.042
 morning+0.038
 dusk+0.033
Neg logit contributions
 instructions-0.165
keeping-0.154
quet-0.151
 occasions-0.148
 brace-0.146
Token,
Token ID11
Feature activation+5.74e+00

Pos logit contributions
 seeker+0.123
ained+0.100
 tired+0.085
ancer+0.085
asc+0.084
Neg logit contributions
 instructions-0.144
So-0.143
and-0.134
 slices-0.134
 distress-0.132
Token there
Token ID612
Feature activation+3.25e+00

Pos logit contributions
 seeker+0.111
ained+0.087
ancer+0.077
 tired+0.074
ahs+0.074
Neg logit contributions
So-0.130
 instructions-0.127
 distress-0.122
 slices-0.121
and-0.118
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
 instructions+0.038
So+0.035
 occasions+0.034
quet+0.034
 distress+0.033
Neg logit contributions
 seeker-0.050
ained-0.043
 tired-0.041
 forward-0.038
 slid-0.038
Token his
Token ID465
Feature activation-8.32e-02

Pos logit contributions
 seeker+0.025
ained+0.022
ancer+0.020
asc+0.018
 forward+0.018
Neg logit contributions
 instructions-0.023
So-0.021
 occasions-0.021
keeping-0.021
 brace-0.020
Token friends
Token ID2460
Feature activation-1.43e+00

Pos logit contributions
 instructions+0.002
So+0.002
 occasions+0.002
keeping+0.002
quet+0.002
Neg logit contributions
 seeker-0.001
 tired-0.001
ained-0.001
 slid-0.001
 spl-0.001
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 instructions+0.020
So+0.018
 occasions+0.017
 slices+0.016
quet+0.016
Neg logit contributions
 seeker-0.042
ained-0.037
 tired-0.034
ancer-0.033
asc-0.033
Token<|endoftext|>
Token ID50256
Feature activation+1.95e+00

Pos logit contributions
 seeker+0.042
ained+0.036
ancer+0.034
 tired+0.034
 forward+0.033
Neg logit contributions
 instructions-0.022
So-0.021
 slices-0.018
and-0.018
 occasions-0.018
TokenOnce
Token ID7454
Feature activation+8.27e+00

Pos logit contributions
 seeker+0.038
ained+0.033
 tired+0.029
 F+0.028
 morning+0.028
Neg logit contributions
 instructions-0.048
 occasions-0.045
quet-0.045
 brace-0.045
 recipe-0.043
Token upon
Token ID2402
Feature activation+1.28e+00

Pos logit contributions
 seeker+0.133
ained+0.102
 tired+0.085
asc+0.081
 forward+0.080
Neg logit contributions
 instructions-0.224
So-0.213
 slices-0.206
 occasions-0.205
and-0.202
Token a
Token ID257
Feature activation+4.98e+00

Pos logit contributions
 seeker+0.017
ained+0.012
 tired+0.012
 slid+0.009
 morning+0.009
Neg logit contributions
 instructions-0.040
 occasions-0.038
quet-0.037
 recipe-0.036
keeping-0.036
Token time
Token ID640
Feature activation+6.15e+00

Pos logit contributions
 seeker+0.062
ained+0.045
 tired+0.040
 morning+0.037
 slid+0.032
Neg logit contributions
 instructions-0.165
keeping-0.155
quet-0.152
 occasions-0.148
 brace-0.146
Token,
Token ID11
Feature activation+5.64e+00

Pos logit contributions
 seeker+0.124
ained+0.104
ancer+0.089
asc+0.088
 tired+0.088
Neg logit contributions
So-0.132
 instructions-0.130
and-0.124
 slices-0.123
 distress-0.121
Token there
Token ID612
Feature activation+3.14e+00

Pos logit contributions
 seeker+0.108
ained+0.086
ancer+0.076
 tired+0.073
 exhausted+0.072
Neg logit contributions
So-0.129
 instructions-0.123
 slices-0.119
 distress-0.119
and-0.117
Token making
Token ID1642
Feature activation-6.08e-02

Pos logit contributions
 seeker+0.007
ained+0.006
 tired+0.005
 forward+0.005
ancer+0.005
Neg logit contributions
 instructions-0.008
So-0.008
 occasions-0.008
 slices-0.008
quet-0.007
Token new
Token ID649
Feature activation-1.30e+00

Pos logit contributions
quet+0.002
 occasions+0.002
 instructions+0.002
So+0.002
keeping+0.001
Neg logit contributions
 seeker-0.001
 tired-0.001
ained-0.001
 spl-0.001
 F-0.001
Token friends
Token ID2460
Feature activation-2.51e+00

Pos logit contributions
 instructions+0.031
quet+0.031
 occasions+0.030
So+0.030
keeping+0.029
Neg logit contributions
 seeker-0.022
ained-0.018
 tired-0.018
 kay-0.015
 spl-0.015
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 instructions+0.044
So+0.041
 occasions+0.041
 distress+0.039
 slices+0.038
Neg logit contributions
 seeker-0.062
ained-0.053
 tired-0.052
 forward-0.050
 slid-0.048
Token<|endoftext|>
Token ID50256
Feature activation+1.69e+00

Pos logit contributions
 seeker+0.030
ained+0.025
ancer+0.023
 tired+0.023
 forward+0.023
Neg logit contributions
 instructions-0.015
So-0.014
 slices-0.013
and-0.013
 occasions-0.012
TokenOnce
Token ID7454
Feature activation+8.26e+00

Pos logit contributions
 seeker+0.033
ained+0.029
 tired+0.026
 F+0.025
 morning+0.024
Neg logit contributions
 instructions-0.042
 occasions-0.039
quet-0.039
 brace-0.038
 recipe-0.038
Token upon
Token ID2402
Feature activation+1.03e+00

Pos logit contributions
 seeker+0.107
ained+0.076
 tired+0.061
 forward+0.055
asc+0.054
Neg logit contributions
 instructions-0.250
So-0.238
 occasions-0.232
 slices-0.231
quet-0.227
Token a
Token ID257
Feature activation+4.92e+00

Pos logit contributions
 seeker+0.013
ained+0.009
 tired+0.009
 slid+0.007
 morning+0.007
Neg logit contributions
 instructions-0.033
 occasions-0.031
quet-0.030
 recipe-0.030
keeping-0.029
Token time
Token ID640
Feature activation+6.36e+00

Pos logit contributions
 seeker+0.062
ained+0.044
 tired+0.040
 morning+0.036
 slid+0.031
Neg logit contributions
 instructions-0.165
keeping-0.154
quet-0.151
 occasions-0.147
 brace-0.144
Token,
Token ID11
Feature activation+5.81e+00

Pos logit contributions
 seeker+0.122
ained+0.100
 tired+0.085
ancer+0.084
asc+0.084
Neg logit contributions
 instructions-0.145
So-0.144
and-0.135
 slices-0.135
 distress-0.133
Token there
Token ID612
Feature activation+3.34e+00

Pos logit contributions
 seeker+0.112
ained+0.090
ancer+0.080
ahs+0.079
asc+0.078
Neg logit contributions
So-0.129
 instructions-0.122
 distress-0.120
 slices-0.119
and-0.116
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
 instructions+0.069
So+0.064
 occasions+0.062
 slices+0.060
 brace+0.059
Neg logit contributions
 seeker-0.072
ained-0.060
 tired-0.054
ancer-0.052
 slid-0.052
Token played
Token ID2826
Feature activation+4.06e-02

Pos logit contributions
 seeker+0.021
ained+0.017
 tired+0.015
ancer+0.015
 forward+0.015
Neg logit contributions
 instructions-0.020
So-0.019
 occasions-0.018
 slices-0.018
keeping-0.018
Token together
Token ID1978
Feature activation-6.53e-01

Pos logit contributions
 seeker+0.001
ained+0.001
 tired+0.001
 forward+0.001
 slid+0.001
Neg logit contributions
 instructions-0.001
So-0.001
 occasions-0.001
 slices-0.001
quet-0.001
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 instructions+0.014
So+0.014
 distress+0.013
 occasions+0.013
quet+0.013
Neg logit contributions
 seeker-0.014
ained-0.011
 forward-0.011
 tired-0.010
 morning-0.010
Token<|endoftext|>
Token ID50256
Feature activation+1.86e+00

Pos logit contributions
 seeker+0.045
ained+0.039
 tired+0.036
 forward+0.034
 slid+0.034
Neg logit contributions
 instructions-0.028
So-0.025
 occasions-0.025
 slices-0.024
quet-0.024
TokenOnce
Token ID7454
Feature activation+8.24e+00

Pos logit contributions
 seeker+0.039
ained+0.034
 tired+0.030
 F+0.029
 morning+0.029
Neg logit contributions
 instructions-0.043
 occasions-0.039
quet-0.039
 brace-0.039
 recipe-0.038
Token upon
Token ID2402
Feature activation+1.31e+00

Pos logit contributions
 seeker+0.101
ained+0.068
asc+0.050
 tired+0.050
 slid+0.047
Neg logit contributions
 instructions-0.253
So-0.244
 slices-0.236
and-0.234
 occasions-0.234
Token a
Token ID257
Feature activation+5.07e+00

Pos logit contributions
 seeker+0.016
ained+0.011
 tired+0.011
 morning+0.009
 slid+0.008
Neg logit contributions
 instructions-0.042
 occasions-0.040
quet-0.039
 recipe-0.039
keeping-0.038
Token time
Token ID640
Feature activation+6.27e+00

Pos logit contributions
 seeker+0.064
ained+0.045
 tired+0.042
 morning+0.038
 forward+0.033
Neg logit contributions
 instructions-0.168
keeping-0.158
quet-0.154
 occasions-0.150
 brace-0.148
Token,
Token ID11
Feature activation+5.70e+00

Pos logit contributions
 seeker+0.126
ained+0.106
asc+0.090
ancer+0.089
 tired+0.089
Neg logit contributions
So-0.136
 instructions-0.135
and-0.129
 slices-0.127
 distress-0.125
Token there
Token ID612
Feature activation+3.23e+00

Pos logit contributions
 seeker+0.116
ained+0.094
ahs+0.086
ancer+0.085
 surfing+0.083
Neg logit contributions
So-0.120
 distress-0.110
 instructions-0.108
 slices-0.108
and-0.107
Token place
Token ID1295
Feature activation+6.95e-01

Pos logit contributions
 instructions+0.043
So+0.041
quet+0.039
 slices+0.039
 occasions+0.039
Neg logit contributions
 seeker-0.040
ained-0.032
 tired-0.030
 forward-0.028
 morning-0.028
Token for
Token ID329
Feature activation+3.95e-01

Pos logit contributions
 seeker+0.016
ained+0.014
 tired+0.013
 forward+0.012
 slid+0.012
Neg logit contributions
 instructions-0.014
So-0.013
 occasions-0.012
 slices-0.012
quet-0.012
Token everyone
Token ID2506
Feature activation-1.68e+00

Pos logit contributions
 seeker+0.009
ained+0.007
 tired+0.007
 slid+0.006
asc+0.006
Neg logit contributions
 instructions-0.008
So-0.008
 occasions-0.008
 slices-0.007
quet-0.007
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
 instructions+0.033
So+0.030
 occasions+0.030
 slices+0.029
quet+0.029
Neg logit contributions
 seeker-0.039
ained-0.033
 tired-0.031
 forward-0.030
 slid-0.029
Token<|endoftext|>
Token ID50256
Feature activation+1.60e+00

Pos logit contributions
 seeker+0.034
ained+0.030
 tired+0.027
 forward+0.026
ancer+0.026
Neg logit contributions
 instructions-0.028
 occasions-0.025
 brace-0.025
quet-0.025
So-0.024
TokenOnce
Token ID7454
Feature activation+8.23e+00

Pos logit contributions
 seeker+0.037
ained+0.031
 tired+0.028
 forward+0.027
ancer+0.027
Neg logit contributions
 instructions-0.033
So-0.030
 occasions-0.029
 slices-0.029
quet-0.028
Token upon
Token ID2402
Feature activation+1.27e+00

Pos logit contributions
 seeker+0.091
ained+0.060
 tired+0.044
asc+0.040
 forward+0.039
Neg logit contributions
 instructions-0.264
So-0.253
 occasions-0.246
 slices-0.246
and-0.242
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 seeker+0.014
ained+0.010
 tired+0.010
 morning+0.007
 slid+0.007
Neg logit contributions
 instructions-0.042
 occasions-0.040
quet-0.039
 recipe-0.039
keeping-0.038
Token time
Token ID640
Feature activation+6.18e+00

Pos logit contributions
 seeker+0.054
ained+0.036
 tired+0.034
 morning+0.031
 tim+0.025
Neg logit contributions
 instructions-0.167
keeping-0.159
quet-0.155
 occasions-0.149
 brace-0.147
Token,
Token ID11
Feature activation+5.75e+00

Pos logit contributions
 seeker+0.127
ained+0.108
asc+0.092
ancer+0.090
 tired+0.089
Neg logit contributions
So-0.131
 instructions-0.130
and-0.126
 slices-0.122
 distress-0.120
Token there
Token ID612
Feature activation+3.14e+00

Pos logit contributions
 seeker+0.116
ained+0.095
ancer+0.088
ahs+0.087
 surfing+0.083
Neg logit contributions
So-0.121
 distress-0.113
and-0.110
 instructions-0.109
 slices-0.108
Token a
Token ID257
Feature activation-2.33e+00

Pos logit contributions
 instructions+0.012
quet+0.012
So+0.012
 occasions+0.012
keeping+0.012
Neg logit contributions
 seeker-0.009
ained-0.007
 tired-0.007
 forward-0.007
 F-0.007
Token big
Token ID1263
Feature activation-1.66e+00

Pos logit contributions
 instructions+0.047
 occasions+0.047
keeping+0.046
So+0.045
quet+0.044
Neg logit contributions
 seeker-0.051
 tired-0.043
ained-0.039
 forward-0.038
 kay-0.037
Token difference
Token ID3580
Feature activation+6.00e-01

Pos logit contributions
 instructions+0.025
So+0.023
 occasions+0.023
and+0.022
keeping+0.022
Neg logit contributions
 seeker-0.047
 tired-0.040
ained-0.039
 forward-0.037
 pool-0.037
Token.
Token ID13
Feature activation+7.77e-01

Pos logit contributions
 seeker+0.015
ained+0.013
 tired+0.012
 forward+0.012
ancer+0.011
Neg logit contributions
 instructions-0.011
So-0.010
 occasions-0.010
 slices-0.010
quet-0.009
Token<|endoftext|>
Token ID50256
Feature activation+2.03e+00

Pos logit contributions
 seeker+0.017
ained+0.015
 tired+0.013
 F+0.013
 Jump+0.013
Neg logit contributions
 instructions-0.017
 brace-0.015
 occasions-0.015
 slices-0.015
 recipe-0.015
TokenOnce
Token ID7454
Feature activation+8.21e+00

Pos logit contributions
 seeker+0.040
ained+0.034
 tired+0.031
 F+0.029
 morning+0.029
Neg logit contributions
 instructions-0.051
 occasions-0.047
quet-0.047
 brace-0.046
 recipe-0.045
Token upon
Token ID2402
Feature activation+1.31e+00

Pos logit contributions
 seeker+0.102
ained+0.071
 tired+0.056
 forward+0.050
 slid+0.050
Neg logit contributions
 instructions-0.254
So-0.242
 occasions-0.236
 slices-0.235
quet-0.231
Token a
Token ID257
Feature activation+4.78e+00

Pos logit contributions
 seeker+0.015
ained+0.010
 tired+0.010
 morning+0.008
 slid+0.007
Neg logit contributions
 instructions-0.043
 occasions-0.041
quet-0.040
 recipe-0.040
keeping-0.039
Token time
Token ID640
Feature activation+6.13e+00

Pos logit contributions
 seeker+0.059
ained+0.040
 tired+0.038
 morning+0.034
 slid+0.029
Neg logit contributions
 instructions-0.161
keeping-0.153
quet-0.148
 occasions-0.143
 brace-0.142
Token,
Token ID11
Feature activation+5.40e+00

Pos logit contributions
 seeker+0.123
ained+0.105
asc+0.088
 tired+0.087
ancer+0.087
Neg logit contributions
 instructions-0.133
So-0.132
and-0.126
 slices-0.124
 distress-0.121
Token there
Token ID612
Feature activation+2.96e+00

Pos logit contributions
 seeker+0.107
ained+0.088
ancer+0.079
ahs+0.078
 dusk+0.075
Neg logit contributions
So-0.118
 distress-0.108
 instructions-0.108
 slices-0.107
and-0.104
Token and
Token ID290
Feature activation+9.16e-01

Pos logit contributions
 instructions+0.035
So+0.032
quet+0.032
 occasions+0.032
 slices+0.030
Neg logit contributions
 seeker-0.046
ained-0.039
 tired-0.038
 forward-0.035
 slid-0.035
Token his
Token ID465
Feature activation-2.98e-01

Pos logit contributions
 seeker+0.019
ained+0.017
ancer+0.015
asc+0.014
 tired+0.014
Neg logit contributions
 instructions-0.020
So-0.018
 brace-0.018
 occasions-0.018
keeping-0.018
Token friends
Token ID2460
Feature activation-1.34e+00

Pos logit contributions
 instructions+0.007
So+0.007
 occasions+0.007
quet+0.007
 slices+0.007
Neg logit contributions
 seeker-0.006
ained-0.004
 tired-0.004
 slid-0.004
 forward-0.004
Token.
Token ID13
Feature activation+9.71e-01

Pos logit contributions
 instructions+0.021
So+0.019
 occasions+0.018
 slices+0.018
quet+0.018
Neg logit contributions
 seeker-0.037
ained-0.032
 tired-0.030
 forward-0.029
 slid-0.029
Token<|endoftext|>
Token ID50256
Feature activation+1.82e+00

Pos logit contributions
 seeker+0.026
ained+0.022
 tired+0.020
ancer+0.020
 forward+0.020
Neg logit contributions
 instructions-0.017
So-0.016
 slices-0.014
 occasions-0.014
and-0.014
TokenOnce
Token ID7454
Feature activation+8.21e+00

Pos logit contributions
 seeker+0.036
ained+0.031
 tired+0.027
 F+0.026
 morning+0.026
Neg logit contributions
 instructions-0.044
 occasions-0.042
quet-0.042
 brace-0.041
 recipe-0.040
Token upon
Token ID2402
Feature activation+1.17e+00

Pos logit contributions
 seeker+0.118
ained+0.087
 tired+0.070
asc+0.066
 forward+0.065
Neg logit contributions
 instructions-0.236
So-0.226
 slices-0.218
 occasions-0.218
and-0.215
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 seeker+0.014
ained+0.010
 tired+0.010
 morning+0.008
 slid+0.007
Neg logit contributions
 instructions-0.038
 occasions-0.036
quet-0.035
 recipe-0.035
keeping-0.034
Token time
Token ID640
Feature activation+6.14e+00

Pos logit contributions
 seeker+0.065
ained+0.047
 tired+0.042
 morning+0.038
 slid+0.035
Neg logit contributions
 instructions-0.155
keeping-0.144
quet-0.143
 occasions-0.139
 brace-0.137
Token,
Token ID11
Feature activation+5.63e+00

Pos logit contributions
 seeker+0.121
ained+0.100
ancer+0.085
 tired+0.085
asc+0.085
Neg logit contributions
So-0.136
 instructions-0.135
and-0.127
 slices-0.126
 distress-0.125
Token there
Token ID612
Feature activation+3.23e+00

Pos logit contributions
 seeker+0.108
ained+0.085
ancer+0.076
ahs+0.073
asc+0.072
Neg logit contributions
So-0.128
 instructions-0.122
 distress-0.119
 slices-0.119
and-0.116
Token happiness
Token ID12157
Feature activation-2.13e+00

Pos logit contributions
 seeker+0.010
ained+0.009
 tired+0.008
 forward+0.008
 slid+0.007
Neg logit contributions
 instructions-0.010
So-0.009
 occasions-0.009
quet-0.009
 slices-0.009
Token to
Token ID284
Feature activation-1.74e+00

Pos logit contributions
 instructions+0.040
So+0.037
 occasions+0.036
 slices+0.035
quet+0.035
Neg logit contributions
 seeker-0.052
ained-0.044
 tired-0.041
 forward-0.040
 slid-0.039
Token others
Token ID1854
Feature activation-1.36e+00

Pos logit contributions
 instructions+0.041
So+0.040
 slices+0.039
quet+0.039
 distress+0.039
Neg logit contributions
 seeker-0.031
ained-0.023
 tired-0.023
 forward-0.022
 F-0.021
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 instructions+0.026
So+0.024
 occasions+0.023
 slices+0.023
quet+0.022
Neg logit contributions
 seeker-0.033
ained-0.028
 tired-0.026
 forward-0.025
 slid-0.025
Token<|endoftext|>
Token ID50256
Feature activation+1.90e+00

Pos logit contributions
 seeker+0.023
ained+0.021
 tired+0.018
 F+0.017
 morning+0.017
Neg logit contributions
 instructions-0.023
 occasions-0.021
 brace-0.020
 distress-0.020
quet-0.020
TokenOnce
Token ID7454
Feature activation+8.20e+00

Pos logit contributions
 seeker+0.040
ained+0.035
 tired+0.031
 F+0.030
 morning+0.029
Neg logit contributions
 instructions-0.044
 occasions-0.041
quet-0.040
 brace-0.040
 recipe-0.039
Token upon
Token ID2402
Feature activation+1.01e+00

Pos logit contributions
 seeker+0.095
ained+0.065
 tired+0.049
asc+0.044
 forward+0.043
Neg logit contributions
 instructions-0.258
So-0.247
 occasions-0.240
 slices-0.240
and-0.236
Token a
Token ID257
Feature activation+5.07e+00

Pos logit contributions
 seeker+0.012
ained+0.009
 tired+0.008
 morning+0.006
 slid+0.006
Neg logit contributions
 instructions-0.033
 occasions-0.031
quet-0.031
 recipe-0.030
keeping-0.030
Token time
Token ID640
Feature activation+6.38e+00

Pos logit contributions
 seeker+0.063
ained+0.043
 tired+0.039
 morning+0.036
 forward+0.030
Neg logit contributions
 instructions-0.171
keeping-0.161
quet-0.158
 occasions-0.152
 brace-0.150
Token,
Token ID11
Feature activation+5.60e+00

Pos logit contributions
 seeker+0.127
ained+0.107
asc+0.090
ancer+0.090
 tired+0.089
Neg logit contributions
So-0.139
 instructions-0.139
and-0.131
 slices-0.130
 distress-0.128
Token there
Token ID612
Feature activation+3.18e+00

Pos logit contributions
 seeker+0.111
ained+0.088
ancer+0.079
ahs+0.079
 surfing+0.077
Neg logit contributions
So-0.123
 instructions-0.115
 distress-0.114
 slices-0.112
and-0.111
Token by
Token ID416
Feature activation+1.07e+00

Pos logit contributions
 instructions+0.011
 occasions+0.010
So+0.010
 slices+0.010
quet+0.010
Neg logit contributions
 seeker-0.014
ained-0.012
 tired-0.012
 forward-0.011
ancer-0.011
Token their
Token ID511
Feature activation+3.40e+00

Pos logit contributions
 seeker+0.024
ained+0.021
 tired+0.019
 forward+0.018
 F+0.018
Neg logit contributions
 instructions-0.022
So-0.020
 occasions-0.020
 slices-0.020
quet-0.019
Token size
Token ID2546
Feature activation+2.47e+00

Pos logit contributions
 seeker+0.081
 tired+0.067
ained+0.067
 forward+0.064
 morning+0.061
Neg logit contributions
quet-0.062
So-0.061
 instructions-0.059
 slices-0.059
 occasions-0.057
Token.
Token ID13
Feature activation+6.35e-01

Pos logit contributions
 seeker+0.064
ained+0.056
 tired+0.052
 forward+0.051
ancer+0.049
Neg logit contributions
 instructions-0.042
So-0.039
 occasions-0.038
 slices-0.037
quet-0.037
Token<|endoftext|>
Token ID50256
Feature activation+1.71e+00

Pos logit contributions
 seeker+0.012
ained+0.011
 Jump+0.010
 F+0.010
 tired+0.010
Neg logit contributions
 instructions-0.015
 brace-0.014
 occasions-0.013
 slices-0.013
ymes-0.013
TokenOnce
Token ID7454
Feature activation+8.20e+00

Pos logit contributions
 seeker+0.038
ained+0.033
 tired+0.030
 F+0.029
 forward+0.028
Neg logit contributions
 instructions-0.037
 occasions-0.034
quet-0.034
 brace-0.033
 recipe-0.032
Token upon
Token ID2402
Feature activation+1.24e+00

Pos logit contributions
 seeker+0.092
ained+0.062
 tired+0.047
 forward+0.041
 slid+0.040
Neg logit contributions
 instructions-0.264
So-0.251
 occasions-0.246
 slices-0.244
quet-0.241
Token a
Token ID257
Feature activation+5.02e+00

Pos logit contributions
 seeker+0.013
 tired+0.009
ained+0.009
 morning+0.007
 spl+0.006
Neg logit contributions
 instructions-0.042
 occasions-0.039
quet-0.039
 recipe-0.038
keeping-0.038
Token time
Token ID640
Feature activation+6.38e+00

Pos logit contributions
 seeker+0.062
ained+0.042
 tired+0.040
 morning+0.036
 dusk+0.031
Neg logit contributions
 instructions-0.168
keeping-0.160
quet-0.155
 occasions-0.150
 brace-0.148
Token,
Token ID11
Feature activation+5.64e+00

Pos logit contributions
 seeker+0.130
ained+0.111
asc+0.094
 slid+0.092
ancer+0.092
Neg logit contributions
So-0.136
 instructions-0.136
and-0.130
 slices-0.127
 distress-0.125
Token in
Token ID287
Feature activation+5.34e+00

Pos logit contributions
 seeker+0.112
ained+0.092
ahs+0.084
ancer+0.083
asc+0.080
Neg logit contributions
So-0.121
 instructions-0.111
 distress-0.111
 slices-0.110
and-0.108
Token writing
Token ID3597
Feature activation+2.05e-02

Pos logit contributions
 seeker+0.028
ained+0.024
ancer+0.021
asc+0.020
 tired+0.020
Neg logit contributions
 instructions-0.033
 occasions-0.030
 slices-0.030
 brace-0.030
So-0.029
Token the
Token ID262
Feature activation+9.20e-01

Pos logit contributions
 seeker+0.000
ained+0.000
 tired+0.000
ancer+0.000
 slid+0.000
Neg logit contributions
 instructions-0.000
So-0.000
 occasions-0.000
 slices-0.000
 brace-0.000
Token story
Token ID1621
Feature activation-1.05e-01

Pos logit contributions
 seeker+0.021
ained+0.018
 tired+0.016
 slid+0.016
asc+0.016
Neg logit contributions
 instructions-0.020
So-0.018
 occasions-0.017
 words-0.017
 slices-0.016
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
 instructions+0.002
So+0.002
 occasions+0.002
 distress+0.002
 slices+0.002
Neg logit contributions
 seeker-0.003
ained-0.002
 tired-0.002
 forward-0.002
 slid-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.69e+00

Pos logit contributions
 seeker+0.023
ained+0.020
ancer+0.019
 forward+0.019
 tired+0.018
Neg logit contributions
So-0.010
 instructions-0.010
 slices-0.009
and-0.009
 words-0.008
TokenOnce
Token ID7454
Feature activation+8.19e+00

Pos logit contributions
 seeker+0.038
ained+0.033
 tired+0.030
 F+0.028
 forward+0.028
Neg logit contributions
 instructions-0.036
 occasions-0.033
quet-0.033
 brace-0.032
 recipe-0.032
Token upon
Token ID2402
Feature activation+9.19e-01

Pos logit contributions
 seeker+0.105
ained+0.075
 tired+0.058
asc+0.053
 forward+0.053
Neg logit contributions
 instructions-0.248
So-0.237
 slices-0.230
 occasions-0.230
and-0.226
Token a
Token ID257
Feature activation+4.85e+00

Pos logit contributions
 seeker+0.011
ained+0.008
 tired+0.007
 morning+0.006
 slid+0.006
Neg logit contributions
 instructions-0.030
 occasions-0.028
quet-0.028
 recipe-0.027
keeping-0.027
Token time
Token ID640
Feature activation+6.30e+00

Pos logit contributions
 seeker+0.061
ained+0.042
 tired+0.040
 morning+0.035
 slid+0.030
Neg logit contributions
 instructions-0.161
keeping-0.152
quet-0.150
 occasions-0.144
 brace-0.142
Token,
Token ID11
Feature activation+5.60e+00

Pos logit contributions
 seeker+0.121
ained+0.101
 tired+0.085
asc+0.084
ancer+0.084
Neg logit contributions
 instructions-0.143
So-0.142
and-0.134
 slices-0.133
 distress-0.132
Token in
Token ID287
Feature activation+5.06e+00

Pos logit contributions
 seeker+0.109
ained+0.087
ancer+0.080
ahs+0.077
 surfing+0.075
Neg logit contributions
So-0.123
 instructions-0.115
 distress-0.115
 slices-0.112
and-0.112
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
 instructions+0.024
 occasions+0.023
So+0.023
quet+0.023
keeping+0.023
Neg logit contributions
 seeker-0.017
ained-0.013
 tired-0.013
 forward-0.013
 spl-0.011
Token were
Token ID547
Feature activation+1.67e+00

Pos logit contributions
 seeker+0.052
ained+0.044
ancer+0.043
asc+0.041
 forward+0.039
Neg logit contributions
 instructions-0.047
 brace-0.046
 words-0.044
So-0.044
keeping-0.044
Token happy
Token ID3772
Feature activation-2.43e+00

Pos logit contributions
 seeker+0.039
ained+0.032
ancer+0.032
asc+0.030
 slid+0.028
Neg logit contributions
 instructions-0.032
So-0.031
 brace-0.029
 slices-0.029
 distress-0.028
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 instructions+0.049
So+0.045
 occasions+0.044
quet+0.043
 distress+0.043
Neg logit contributions
 seeker-0.057
ained-0.048
 tired-0.046
 slid-0.043
 spl-0.042
Token<|endoftext|>
Token ID50256
Feature activation+2.16e+00

Pos logit contributions
 seeker+0.027
ained+0.023
 tired+0.021
ancer+0.021
 forward+0.021
Neg logit contributions
 instructions-0.019
So-0.018
 slices-0.016
 occasions-0.016
and-0.016
TokenOnce
Token ID7454
Feature activation+8.18e+00

Pos logit contributions
 seeker+0.040
ained+0.034
 tired+0.031
 F+0.029
 morning+0.029
Neg logit contributions
 instructions-0.056
 occasions-0.053
quet-0.053
 brace-0.052
 recipe-0.050
Token upon
Token ID2402
Feature activation+1.16e+00

Pos logit contributions
 seeker+0.105
ained+0.075
 tired+0.058
asc+0.054
 forward+0.053
Neg logit contributions
 instructions-0.247
So-0.237
 slices-0.230
 occasions-0.229
and-0.226
Token a
Token ID257
Feature activation+5.06e+00

Pos logit contributions
 seeker+0.014
ained+0.009
 tired+0.009
 morning+0.007
 spl+0.007
Neg logit contributions
 instructions-0.039
 occasions-0.037
quet-0.036
 recipe-0.035
keeping-0.035
Token time
Token ID640
Feature activation+6.20e+00

Pos logit contributions
 seeker+0.061
ained+0.043
 tired+0.039
 morning+0.035
 spl+0.029
Neg logit contributions
 instructions-0.173
keeping-0.163
quet-0.159
 occasions-0.154
 brace-0.152
Token,
Token ID11
Feature activation+5.63e+00

Pos logit contributions
 seeker+0.118
ained+0.098
 tired+0.083
ancer+0.082
 forward+0.082
Neg logit contributions
 instructions-0.141
So-0.140
 slices-0.132
and-0.132
 distress-0.130
Token there
Token ID612
Feature activation+3.22e+00

Pos logit contributions
 seeker+0.106
ained+0.084
ancer+0.074
ahs+0.071
 forward+0.071
Neg logit contributions
So-0.129
 instructions-0.125
 distress-0.121
 slices-0.120
ymes-0.117
Token became
Token ID2627
Feature activation+2.30e+00

Pos logit contributions
 seeker+0.042
ained+0.038
ancer+0.035
asc+0.035
ahs+0.032
Neg logit contributions
So-0.032
 instructions-0.027
 distress-0.025
 note-0.025
and-0.024
Token good
Token ID922
Feature activation+1.20e+00

Pos logit contributions
 seeker+0.046
ained+0.039
ancer+0.037
asc+0.035
 slid+0.034
Neg logit contributions
 instructions-0.050
So-0.050
 brace-0.047
 slices-0.045
 distress-0.045
Token friends
Token ID2460
Feature activation-1.47e+00

Pos logit contributions
 seeker+0.025
ained+0.021
 tired+0.019
 forward+0.018
asc+0.018
Neg logit contributions
 instructions-0.027
So-0.025
 occasions-0.024
 slices-0.024
and-0.023
Token.
Token ID13
Feature activation+8.75e-01

Pos logit contributions
 instructions+0.026
 occasions+0.023
So+0.023
 slices+0.022
quet+0.022
Neg logit contributions
 seeker-0.038
ained-0.033
 tired-0.031
 forward-0.030
 slid-0.029
Token<|endoftext|>
Token ID50256
Feature activation+2.00e+00

Pos logit contributions
 seeker+0.024
ained+0.020
 tired+0.018
ancer+0.018
 forward+0.018
Neg logit contributions
 instructions-0.014
So-0.014
 slices-0.013
 occasions-0.013
and-0.012
TokenOnce
Token ID7454
Feature activation+8.17e+00

Pos logit contributions
 seeker+0.040
ained+0.035
 tired+0.031
 F+0.029
 morning+0.029
Neg logit contributions
 instructions-0.049
 occasions-0.046
quet-0.045
 brace-0.045
 recipe-0.044
Token upon
Token ID2402
Feature activation+1.26e+00

Pos logit contributions
 seeker+0.116
ained+0.085
 tired+0.068
asc+0.065
 forward+0.063
Neg logit contributions
 instructions-0.235
So-0.226
 slices-0.218
 occasions-0.216
and-0.215
Token a
Token ID257
Feature activation+5.24e+00

Pos logit contributions
 seeker+0.016
ained+0.012
 tired+0.011
 slid+0.009
 morning+0.009
Neg logit contributions
 instructions-0.040
 occasions-0.038
quet-0.037
 recipe-0.036
keeping-0.036
Token time
Token ID640
Feature activation+6.39e+00

Pos logit contributions
 seeker+0.065
ained+0.047
 tired+0.043
 morning+0.038
 slid+0.033
Neg logit contributions
 instructions-0.176
keeping-0.165
quet-0.161
 occasions-0.157
 brace-0.154
Token,
Token ID11
Feature activation+5.84e+00

Pos logit contributions
 seeker+0.126
ained+0.105
asc+0.088
 tired+0.088
 forward+0.087
Neg logit contributions
So-0.141
 instructions-0.139
and-0.133
 slices-0.131
 distress-0.128
Token there
Token ID612
Feature activation+3.24e+00

Pos logit contributions
 seeker+0.112
ained+0.088
ancer+0.077
 F+0.075
asc+0.075
Neg logit contributions
So-0.132
 instructions-0.127
 slices-0.123
 distress-0.122
and-0.120
Token happiness
Token ID12157
Feature activation-2.16e+00

Pos logit contributions
 seeker+0.024
ained+0.020
 tired+0.019
 forward+0.018
 F+0.017
Neg logit contributions
 instructions-0.024
quet-0.023
So-0.022
 occasions-0.022
 slices-0.021
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
 instructions+0.044
So+0.041
 occasions+0.040
quet+0.039
 slices+0.039
Neg logit contributions
 seeker-0.048
ained-0.041
 tired-0.038
 forward-0.037
 slid-0.035
Token everyone
Token ID2506
Feature activation-1.41e+00

Pos logit contributions
 instructions+0.022
So+0.020
 occasions+0.020
 slices+0.019
 brace+0.019
Neg logit contributions
 seeker-0.023
ained-0.020
 tired-0.017
 slid-0.017
 forward-0.017
Token.
Token ID13
Feature activation+8.99e-01

Pos logit contributions
 instructions+0.029
So+0.027
 occasions+0.026
 slices+0.026
quet+0.025
Neg logit contributions
 seeker-0.032
ained-0.027
 tired-0.024
 forward-0.023
 slid-0.023
Token<|endoftext|>
Token ID50256
Feature activation+1.96e+00

Pos logit contributions
 seeker+0.019
ained+0.017
 tired+0.015
 F+0.015
 Jump+0.014
Neg logit contributions
 instructions-0.020
 occasions-0.018
 brace-0.018
ymes-0.018
quet-0.018
TokenOnce
Token ID7454
Feature activation+8.17e+00

Pos logit contributions
 seeker+0.036
ained+0.031
 tired+0.027
 F+0.026
 morning+0.026
Neg logit contributions
 instructions-0.051
quet-0.048
 occasions-0.048
 brace-0.047
 recipe-0.046
Token upon
Token ID2402
Feature activation+1.06e+00

Pos logit contributions
 seeker+0.106
ained+0.077
 tired+0.062
 forward+0.056
 slid+0.055
Neg logit contributions
 instructions-0.248
So-0.236
 occasions-0.231
 slices-0.229
quet-0.226
Token a
Token ID257
Feature activation+4.99e+00

Pos logit contributions
 seeker+0.011
ained+0.007
 tired+0.007
 morning+0.006
 slid+0.005
Neg logit contributions
 instructions-0.036
 occasions-0.034
quet-0.034
 recipe-0.033
keeping-0.032
Token time
Token ID640
Feature activation+6.38e+00

Pos logit contributions
 seeker+0.057
ained+0.038
 tired+0.035
 morning+0.033
 tim+0.026
Neg logit contributions
 instructions-0.172
keeping-0.164
quet-0.160
 occasions-0.152
 brace-0.152
Token,
Token ID11
Feature activation+5.74e+00

Pos logit contributions
 seeker+0.115
ained+0.093
 tired+0.079
asc+0.076
ancer+0.076
Neg logit contributions
 instructions-0.159
So-0.152
 slices-0.146
 occasions-0.146
and-0.144
Token there
Token ID612
Feature activation+3.07e+00

Pos logit contributions
 seeker+0.113
ained+0.092
ancer+0.080
ahs+0.080
asc+0.078
Neg logit contributions
So-0.126
 instructions-0.120
 slices-0.119
 distress-0.118
 note-0.113
Token fun
Token ID1257
Feature activation-7.98e-01

Pos logit contributions
 instructions+0.048
So+0.047
 slices+0.044
 recipe+0.043
 brace+0.042
Neg logit contributions
 seeker-0.043
ained-0.040
asc-0.033
ancer-0.033
 F-0.030
Token time
Token ID640
Feature activation+6.05e-03

Pos logit contributions
 instructions+0.022
So+0.021
 slices+0.020
 occasions+0.020
quet+0.020
Neg logit contributions
 seeker-0.013
ained-0.009
 tired-0.008
 forward-0.008
 slid-0.007
Token together
Token ID1978
Feature activation+8.69e-02

Pos logit contributions
 seeker+0.000
ained+0.000
asc+0.000
ancer+0.000
 tired+0.000
Neg logit contributions
 instructions-0.000
So-0.000
 slices-0.000
 occasions-0.000
and-0.000
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 seeker+0.002
ained+0.002
 tired+0.002
 forward+0.002
 slid+0.002
Neg logit contributions
 instructions-0.001
So-0.001
 occasions-0.001
 slices-0.001
quet-0.001
Token<|endoftext|>
Token ID50256
Feature activation+2.01e+00

Pos logit contributions
 seeker+0.031
ained+0.024
ancer+0.024
 tired+0.023
 forward+0.023
Neg logit contributions
So-0.015
 instructions-0.015
and-0.014
 occasions-0.013
 slices-0.012
TokenOnce
Token ID7454
Feature activation+8.17e+00

Pos logit contributions
 seeker+0.039
ained+0.034
 tired+0.030
 F+0.029
 morning+0.028
Neg logit contributions
 instructions-0.050
quet-0.047
 occasions-0.047
 brace-0.046
 recipe-0.045
Token upon
Token ID2402
Feature activation+1.03e+00

Pos logit contributions
 seeker+0.099
ained+0.068
 tired+0.050
asc+0.047
 forward+0.046
Neg logit contributions
 instructions-0.253
So-0.243
 slices-0.236
 occasions-0.235
and-0.232
Token a
Token ID257
Feature activation+4.98e+00

Pos logit contributions
 seeker+0.013
ained+0.010
 tired+0.009
 morning+0.008
 slid+0.007
Neg logit contributions
 instructions-0.032
 occasions-0.030
quet-0.030
 recipe-0.029
keeping-0.029
Token time
Token ID640
Feature activation+6.28e+00

Pos logit contributions
 seeker+0.066
ained+0.048
 tired+0.043
 morning+0.039
 slid+0.035
Neg logit contributions
 instructions-0.161
keeping-0.151
quet-0.148
 occasions-0.144
 brace-0.142
Token,
Token ID11
Feature activation+5.47e+00

Pos logit contributions
 seeker+0.127
ained+0.107
asc+0.091
ancer+0.090
 tired+0.090
Neg logit contributions
So-0.135
 instructions-0.134
and-0.127
 slices-0.126
 distress-0.124
Token there
Token ID612
Feature activation+3.02e+00

Pos logit contributions
 seeker+0.111
ained+0.089
ancer+0.079
 surfing+0.078
ahs+0.077
Neg logit contributions
So-0.118
 slices-0.108
 distress-0.107
 instructions-0.107
and-0.106
Token its
Token ID663
Feature activation+4.37e+00

Pos logit contributions
 seeker+0.020
ained+0.016
 tired+0.014
 forward+0.013
 slid+0.013
Neg logit contributions
 instructions-0.029
So-0.027
 occasions-0.027
 slices-0.026
quet-0.026
Token garden
Token ID11376
Feature activation+2.83e-01

Pos logit contributions
 seeker+0.099
ained+0.084
ancer+0.078
asc+0.075
 forward+0.073
Neg logit contributions
 instructions-0.091
So-0.082
 occasions-0.082
 slices-0.080
 brace-0.080
Token home
Token ID1363
Feature activation-2.35e-01

Pos logit contributions
 seeker+0.007
ained+0.006
 tired+0.006
 forward+0.006
 slid+0.006
Neg logit contributions
 instructions-0.005
So-0.005
 occasions-0.004
 distress-0.004
quet-0.004
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
 instructions+0.004
So+0.003
 occasions+0.003
 slices+0.003
quet+0.003
Neg logit contributions
 seeker-0.007
ained-0.006
 tired-0.005
 forward-0.005
 slid-0.005
Token<|endoftext|>
Token ID50256
Feature activation+1.58e+00

Pos logit contributions
 seeker+0.017
ained+0.014
 tired+0.013
 forward+0.012
asc+0.012
Neg logit contributions
 instructions-0.013
So-0.012
 occasions-0.012
 slices-0.011
and-0.011
TokenOnce
Token ID7454
Feature activation+8.17e+00

Pos logit contributions
 seeker+0.032
ained+0.027
 tired+0.025
 F+0.024
 morning+0.023
Neg logit contributions
 instructions-0.038
 occasions-0.036
quet-0.036
 brace-0.035
 recipe-0.034
Token upon
Token ID2402
Feature activation+1.25e+00

Pos logit contributions
 seeker+0.100
ained+0.070
 tired+0.055
 forward+0.049
 slid+0.049
Neg logit contributions
 instructions-0.254
So-0.241
 occasions-0.236
 slices-0.235
quet-0.231
Token a
Token ID257
Feature activation+4.70e+00

Pos logit contributions
 seeker+0.015
ained+0.010
 tired+0.010
 morning+0.008
 slid+0.007
Neg logit contributions
 instructions-0.041
 occasions-0.039
quet-0.038
 recipe-0.037
keeping-0.037
Token time
Token ID640
Feature activation+6.18e+00

Pos logit contributions
 seeker+0.056
ained+0.037
 tired+0.035
 morning+0.032
 tim+0.026
Neg logit contributions
 instructions-0.161
keeping-0.152
quet-0.149
 occasions-0.143
 brace-0.142
Token,
Token ID11
Feature activation+5.76e+00

Pos logit contributions
 seeker+0.119
ained+0.100
 tired+0.084
ancer+0.083
asc+0.083
Neg logit contributions
 instructions-0.139
So-0.139
and-0.131
 slices-0.130
 distress-0.128
Token there
Token ID612
Feature activation+3.20e+00

Pos logit contributions
 seeker+0.110
ained+0.089
ancer+0.079
ahs+0.077
asc+0.074
Neg logit contributions
So-0.130
 instructions-0.124
 distress-0.122
 slices-0.120
and-0.117
Token on
Token ID319
Feature activation+1.33e+00

Pos logit contributions
 seeker+0.004
ained+0.003
 kidding+0.003
asc+0.003
Whe+0.003
Neg logit contributions
 instructions-0.005
 slices-0.005
 words-0.004
So-0.004
quet-0.004
Token their
Token ID511
Feature activation+2.47e+00

Pos logit contributions
 seeker+0.024
ained+0.019
ancer+0.017
 tired+0.016
 forward+0.015
Neg logit contributions
 instructions-0.035
So-0.033
 slices-0.032
 occasions-0.032
quet-0.031
Token faces
Token ID6698
Feature activation+4.57e-01

Pos logit contributions
 seeker+0.067
ained+0.058
 tired+0.058
 slid+0.055
 forward+0.054
Neg logit contributions
 instructions-0.036
So-0.034
 occasions-0.033
quet-0.032
ymes-0.031
Token.
Token ID13
Feature activation+6.24e-01

Pos logit contributions
 seeker+0.014
ained+0.013
asc+0.012
ancer+0.011
 tired+0.011
Neg logit contributions
 instructions-0.005
So-0.005
 slices-0.005
quet-0.004
 recipe-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.94e+00

Pos logit contributions
 seeker+0.018
ancer+0.015
ained+0.015
 forward+0.014
 tired+0.014
Neg logit contributions
So-0.009
 instructions-0.008
and-0.008
 slices-0.007
 truth-0.007
TokenOnce
Token ID7454
Feature activation+8.16e+00

Pos logit contributions
 seeker+0.038
ained+0.033
 tired+0.029
 F+0.028
 morning+0.027
Neg logit contributions
 instructions-0.048
 occasions-0.045
quet-0.045
 brace-0.044
 recipe-0.044
Token upon
Token ID2402
Feature activation+1.16e+00

Pos logit contributions
 seeker+0.114
ained+0.081
 tired+0.063
asc+0.063
 forward+0.060
Neg logit contributions
 instructions-0.235
So-0.228
 slices-0.219
and-0.217
 occasions-0.216
Token a
Token ID257
Feature activation+4.85e+00

Pos logit contributions
 seeker+0.015
ained+0.011
 tired+0.010
 slid+0.008
 morning+0.008
Neg logit contributions
 instructions-0.037
 occasions-0.035
quet-0.034
 recipe-0.033
keeping-0.033
Token time
Token ID640
Feature activation+6.32e+00

Pos logit contributions
 seeker+0.067
ained+0.049
 tired+0.044
 morning+0.039
 slid+0.037
Neg logit contributions
 instructions-0.155
keeping-0.143
quet-0.141
 occasions-0.139
 brace-0.136
Token,
Token ID11
Feature activation+5.64e+00

Pos logit contributions
 seeker+0.125
ained+0.104
ancer+0.089
 tired+0.088
 forward+0.087
Neg logit contributions
So-0.138
 instructions-0.137
and-0.130
 slices-0.129
 distress-0.127
Token there
Token ID612
Feature activation+3.08e+00

Pos logit contributions
 seeker+0.106
ained+0.083
ancer+0.074
 tired+0.071
 forward+0.071
Neg logit contributions
So-0.130
 instructions-0.125
 distress-0.121
 slices-0.121
and-0.118
Token day
Token ID1110
Feature activation+3.41e-01

Pos logit contributions
 seeker+0.008
ained+0.005
 morning+0.005
 tired+0.004
 dusk+0.004
Neg logit contributions
 instructions-0.017
quet-0.016
keeping-0.016
So-0.016
ymes-0.016
Token matching
Token ID12336
Feature activation+2.01e-01

Pos logit contributions
 seeker+0.008
ained+0.007
ancer+0.006
asc+0.006
 tired+0.006
Neg logit contributions
 instructions-0.006
So-0.006
 slices-0.006
 occasions-0.006
quet-0.006
Token pages
Token ID5468
Feature activation+2.31e-01

Pos logit contributions
 seeker+0.004
ained+0.003
 tired+0.003
 forward+0.003
 morning+0.003
Neg logit contributions
 instructions-0.004
So-0.004
 distress-0.004
quet-0.004
 occasions-0.004
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
 seeker+0.005
ained+0.004
 tired+0.004
 forward+0.004
 slid+0.004
Neg logit contributions
 instructions-0.005
So-0.005
 occasions-0.005
 distress-0.005
ymes-0.004
Token<|endoftext|>
Token ID50256
Feature activation+1.91e+00

Pos logit contributions
 seeker+0.036
ained+0.031
 tired+0.028
ancer+0.028
 forward+0.028
Neg logit contributions
 instructions-0.021
So-0.020
 slices-0.018
 occasions-0.018
and-0.017
TokenOnce
Token ID7454
Feature activation+8.16e+00

Pos logit contributions
 seeker+0.037
ained+0.032
 tired+0.029
 F+0.027
 morning+0.027
Neg logit contributions
 instructions-0.047
 occasions-0.044
quet-0.044
 brace-0.043
 recipe-0.042
Token upon
Token ID2402
Feature activation+1.15e+00

Pos logit contributions
 seeker+0.127
ained+0.097
 tired+0.082
 forward+0.076
 slid+0.075
Neg logit contributions
 instructions-0.227
So-0.215
 occasions-0.210
 slices-0.208
quet-0.205
Token a
Token ID257
Feature activation+4.68e+00

Pos logit contributions
 seeker+0.015
ained+0.011
 tired+0.010
 morning+0.008
 slid+0.008
Neg logit contributions
 instructions-0.036
 occasions-0.034
quet-0.033
 recipe-0.033
keeping-0.032
Token time
Token ID640
Feature activation+6.21e+00

Pos logit contributions
 seeker+0.061
ained+0.044
 tired+0.039
 morning+0.037
 slid+0.031
Neg logit contributions
 instructions-0.154
keeping-0.142
quet-0.141
 occasions-0.137
 brace-0.135
Token,
Token ID11
Feature activation+5.59e+00

Pos logit contributions
 seeker+0.121
ained+0.100
ancer+0.085
 tired+0.084
asc+0.084
Neg logit contributions
 instructions-0.138
So-0.138
and-0.129
 slices-0.129
 distress-0.128
Token there
Token ID612
Feature activation+3.03e+00

Pos logit contributions
 seeker+0.108
ained+0.086
ancer+0.076
 tired+0.073
ahs+0.072
Neg logit contributions
So-0.126
 instructions-0.123
 distress-0.118
 slices-0.117
and-0.114
Token more
Token ID517
Feature activation+1.34e-01

Pos logit contributions
 seeker+0.025
ained+0.021
 tired+0.019
 slid+0.018
 forward+0.017
Neg logit contributions
 instructions-0.032
 occasions-0.030
So-0.029
quet-0.029
 slices-0.029
Token fun
Token ID1257
Feature activation-2.19e+00

Pos logit contributions
 seeker+0.003
ained+0.003
 tired+0.003
ancer+0.002
 forward+0.002
Neg logit contributions
 instructions-0.003
So-0.002
 occasions-0.002
 slices-0.002
 brace-0.002
Token swimming
Token ID14899
Feature activation+1.52e+00

Pos logit contributions
 instructions+0.041
So+0.037
 slices+0.036
quet+0.036
 distress+0.036
Neg logit contributions
 seeker-0.054
ained-0.047
 tired-0.043
 forward-0.041
 slid-0.041
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 seeker+0.037
ained+0.031
 tired+0.028
 forward+0.027
 slid+0.027
Neg logit contributions
 instructions-0.029
So-0.027
 occasions-0.026
 slices-0.026
quet-0.025
Token<|endoftext|>
Token ID50256
Feature activation+2.25e+00

Pos logit contributions
 seeker+0.039
ained+0.032
ancer+0.031
 forward+0.030
 tired+0.030
Neg logit contributions
So-0.021
 instructions-0.020
and-0.018
 slices-0.018
 words-0.017
TokenOnce
Token ID7454
Feature activation+8.16e+00

Pos logit contributions
 seeker+0.046
ained+0.040
 tired+0.036
 F+0.035
 Jump+0.033
Neg logit contributions
 instructions-0.054
 occasions-0.050
quet-0.049
 brace-0.048
 recipe-0.048
Token upon
Token ID2402
Feature activation+1.38e+00

Pos logit contributions
 seeker+0.111
ained+0.080
 tired+0.063
asc+0.060
 forward+0.059
Neg logit contributions
 instructions-0.239
So-0.230
 slices-0.221
 occasions-0.220
and-0.219
Token a
Token ID257
Feature activation+4.79e+00

Pos logit contributions
 seeker+0.017
ained+0.012
 tired+0.012
 morning+0.009
 spl+0.009
Neg logit contributions
 instructions-0.045
 occasions-0.043
quet-0.042
 recipe-0.041
keeping-0.040
Token time
Token ID640
Feature activation+6.06e+00

Pos logit contributions
 seeker+0.060
ained+0.043
 tired+0.039
 morning+0.035
 spl+0.030
Neg logit contributions
 instructions-0.161
keeping-0.150
quet-0.147
 occasions-0.144
 brace-0.141
Token,
Token ID11
Feature activation+5.70e+00

Pos logit contributions
 seeker+0.120
ained+0.098
ancer+0.084
 tired+0.083
asc+0.083
Neg logit contributions
So-0.134
 instructions-0.133
and-0.126
 slices-0.125
 distress-0.122
Token there
Token ID612
Feature activation+3.24e+00

Pos logit contributions
 seeker+0.109
ained+0.085
ancer+0.076
 tired+0.073
 forward+0.073
Neg logit contributions
So-0.130
 instructions-0.127
 distress-0.121
 slices-0.121
and-0.119
Tokeny
Token ID88
Feature activation+1.52e-01

Pos logit contributions
 seeker+0.034
ained+0.029
 tired+0.027
 forward+0.026
 slid+0.026
Neg logit contributions
 instructions-0.023
So-0.021
 occasions-0.020
 slices-0.020
quet-0.019
Token universe
Token ID6881
Feature activation+1.36e-02

Pos logit contributions
 seeker+0.004
ained+0.003
asc+0.003
 tired+0.003
 forward+0.003
Neg logit contributions
 instructions-0.003
So-0.003
 occasions-0.003
 slices-0.003
 brace-0.003
Token toy
Token ID13373
Feature activation+1.24e+00

Pos logit contributions
 seeker+0.000
ained+0.000
 tired+0.000
 forward+0.000
 morning+0.000
Neg logit contributions
 instructions-0.000
 distress-0.000
So-0.000
 occasions-0.000
Mic-0.000
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
 seeker+0.030
ained+0.025
 tired+0.023
 forward+0.023
 slid+0.022
Neg logit contributions
 instructions-0.024
So-0.022
 occasions-0.022
 slices-0.021
 distress-0.021
Token<|endoftext|>
Token ID50256
Feature activation+1.98e+00

Pos logit contributions
 seeker+0.024
ained+0.020
ancer+0.018
 tired+0.018
 forward+0.018
Neg logit contributions
So-0.013
 instructions-0.013
and-0.012
 occasions-0.012
 slices-0.012
TokenOnce
Token ID7454
Feature activation+8.15e+00

Pos logit contributions
 seeker+0.038
ained+0.033
 tired+0.030
 F+0.028
 morning+0.028
Neg logit contributions
 instructions-0.050
 occasions-0.046
quet-0.046
 brace-0.045
 recipe-0.044
Token upon
Token ID2402
Feature activation+1.09e+00

Pos logit contributions
 seeker+0.101
ained+0.069
 tired+0.051
asc+0.050
 slid+0.048
Neg logit contributions
 instructions-0.248
So-0.239
 slices-0.232
and-0.230
 occasions-0.229
Token a
Token ID257
Feature activation+5.05e+00

Pos logit contributions
 seeker+0.013
ained+0.010
 tired+0.009
 morning+0.007
 slid+0.007
Neg logit contributions
 instructions-0.036
 occasions-0.034
quet-0.033
 recipe-0.033
keeping-0.032
Token time
Token ID640
Feature activation+6.52e+00

Pos logit contributions
 seeker+0.071
ained+0.052
 tired+0.047
 morning+0.042
 slid+0.038
Neg logit contributions
 instructions-0.160
keeping-0.147
quet-0.146
 occasions-0.143
 brace-0.140
Token,
Token ID11
Feature activation+5.71e+00

Pos logit contributions
 seeker+0.121
ained+0.099
 tired+0.084
ancer+0.083
asc+0.082
Neg logit contributions
 instructions-0.153
So-0.151
 slices-0.143
and-0.142
 distress-0.140
Token there
Token ID612
Feature activation+3.20e+00

Pos logit contributions
 seeker+0.109
ained+0.086
ancer+0.076
ahs+0.074
 surfing+0.073
Neg logit contributions
So-0.129
 instructions-0.123
 distress-0.120
 slices-0.119
and-0.117
Token remembered
Token ID12086
Feature activation-2.83e+00

Pos logit contributions
 instructions+0.012
So+0.012
 occasions+0.011
 slices+0.011
 distress+0.011
Neg logit contributions
 seeker-0.012
ained-0.010
 tired-0.009
ancer-0.008
 forward-0.008
Token that
Token ID326
Feature activation-1.32e+00

Pos logit contributions
 instructions+0.066
So+0.062
 occasions+0.061
 slices+0.060
quet+0.059
Neg logit contributions
 seeker-0.057
ained-0.047
 tired-0.042
 forward-0.040
ancer-0.039
Token her
Token ID607
Feature activation-7.84e-01

Pos logit contributions
 instructions+0.028
So+0.025
 words+0.025
 occasions+0.025
 distress+0.024
Neg logit contributions
 seeker-0.030
ained-0.026
asc-0.023
ancer-0.023
 slid-0.022
Token parents
Token ID3397
Feature activation-1.92e+00

Pos logit contributions
 instructions+0.016
So+0.015
 occasions+0.015
 slices+0.015
quet+0.015
Neg logit contributions
 seeker-0.018
ained-0.015
 tired-0.014
 forward-0.013
 slid-0.013
Token had
Token ID550
Feature activation+9.10e-01

Pos logit contributions
 instructions+0.043
So+0.040
 occasions+0.038
 distress+0.038
 slices+0.037
Neg logit contributions
 seeker-0.041
ained-0.034
 tired-0.030
 forward-0.029
asc-0.029
Token taught
Token ID7817
Feature activation-5.55e+00

Pos logit contributions
 seeker+0.018
ained+0.015
 tired+0.014
 slid+0.013
 F+0.013
Neg logit contributions
 instructions-0.020
So-0.019
quet-0.019
 occasions-0.019
 slices-0.019
Token her
Token ID607
Feature activation-3.92e+00

Pos logit contributions
quet+0.148
 slices+0.145
 lair+0.144
 instructions+0.143
So+0.142
Neg logit contributions
 seeker-0.066
ained-0.062
 morning-0.057
 F-0.056
 forward-0.055
Token to
Token ID284
Feature activation-4.03e+00

Pos logit contributions
 instructions+0.078
So+0.070
 occasions+0.068
 slices+0.066
 distress+0.065
Neg logit contributions
 seeker-0.095
ained-0.080
 tired-0.073
ancer-0.071
 slid-0.071
Token be
Token ID307
Feature activation+9.01e-01

Pos logit contributions
 instructions+0.081
So+0.075
 slices+0.074
 occasions+0.073
quet+0.072
Neg logit contributions
 seeker-0.093
ained-0.078
 tired-0.071
 slid-0.067
 forward-0.067
Token polite
Token ID23507
Feature activation-4.59e+00

Pos logit contributions
 seeker+0.023
ancer+0.019
ained+0.019
ahs+0.017
 dusk+0.017
Neg logit contributions
 instructions-0.017
So-0.016
 distress-0.014
 brace-0.014
 occasions-0.014
Token and
Token ID290
Feature activation-8.04e-01

Pos logit contributions
quet+0.097
 instructions+0.093
So+0.092
 brace+0.090
Mic+0.090
Neg logit contributions
 seeker-0.094
ained-0.079
 tired-0.076
 forward-0.075
 F-0.068
TokenGrand
Token ID23581
Feature activation-1.03e+00

Pos logit contributions
 seeker+0.035
ained+0.029
 tired+0.027
 forward+0.026
 slid+0.026
Neg logit contributions
 instructions-0.026
So-0.025
 slices-0.024
 occasions-0.024
and-0.023
Tokenma
Token ID2611
Feature activation+3.14e-01

Pos logit contributions
 instructions+0.020
So+0.018
 occasions+0.017
quet+0.017
 slices+0.017
Neg logit contributions
 seeker-0.024
ained-0.021
 tired-0.019
 slid-0.018
ancer-0.018
Token was
Token ID373
Feature activation+3.90e+00

Pos logit contributions
 seeker+0.008
ained+0.007
 forward+0.006
ancer+0.006
asc+0.006
Neg logit contributions
 instructions-0.005
So-0.005
 slices-0.005
 occasions-0.005
 recipe-0.005
Token teaching
Token ID7743
Feature activation-4.22e+00

Pos logit contributions
 seeker+0.076
ancer+0.059
ained+0.054
ahs+0.052
 slid+0.052
Neg logit contributions
So-0.084
 slices-0.082
 instructions-0.082
and-0.081
 distress-0.079
Token her
Token ID607
Feature activation-3.19e+00

Pos logit contributions
 instructions+0.094
So+0.087
 occasions+0.085
 slices+0.084
quet+0.082
Neg logit contributions
 seeker-0.090
ained-0.074
 tired-0.067
 forward-0.064
 slid-0.063
Token grand
Token ID4490
Feature activation-5.52e+00

Pos logit contributions
 instructions+0.073
So+0.068
 occasions+0.067
 slices+0.066
quet+0.065
Neg logit contributions
 seeker-0.065
ained-0.054
 tired-0.048
 forward-0.046
 slid-0.045
Tokenchild
Token ID9410
Feature activation-5.11e+00

Pos logit contributions
So+0.100
 instructions+0.094
quet+0.093
keeping+0.086
 slices+0.086
Neg logit contributions
 seeker-0.118
 morning-0.100
ained-0.096
ahs-0.095
 surfing-0.094
Token how
Token ID703
Feature activation-7.27e-01

Pos logit contributions
 instructions+0.113
So+0.104
 occasions+0.103
 slices+0.102
quet+0.098
Neg logit contributions
 seeker-0.112
ained-0.092
 tired-0.081
ancer-0.080
asc-0.078
Token to
Token ID284
Feature activation-3.49e+00

Pos logit contributions
 instructions+0.011
So+0.009
and+0.008
 occasions+0.008
 slices+0.008
Neg logit contributions
 seeker-0.023
ained-0.019
ancer-0.017
asc-0.017
 dusk-0.017
Token use
Token ID779
Feature activation+1.19e-01

Pos logit contributions
 instructions+0.071
So+0.066
 slices+0.065
and+0.064
 occasions+0.063
Neg logit contributions
 seeker-0.078
ained-0.064
 tired-0.059
 slid-0.057
 F-0.056
Token a
Token ID257
Feature activation-2.63e+00

Pos logit contributions
 seeker+0.003
ained+0.002
asc+0.002
 tired+0.002
ancer+0.002
Neg logit contributions
 instructions-0.003
 occasions-0.002
So-0.002
 slices-0.002
 brace-0.002
Token full
Token ID1336
Feature activation-2.36e+00

Pos logit contributions
 instructions+0.007
So+0.007
 slices+0.007
 recipe+0.007
 distress+0.006
Neg logit contributions
 seeker-0.008
ained-0.007
ancer-0.007
 slid-0.006
asc-0.006
Token of
Token ID286
Feature activation+1.11e-02

Pos logit contributions
 instructions+0.049
So+0.045
 occasions+0.045
 distress+0.044
keeping+0.044
Neg logit contributions
 seeker-0.050
ained-0.043
 tired-0.041
 forward-0.039
 morning-0.038
Token love
Token ID1842
Feature activation-2.47e+00

Pos logit contributions
 seeker+0.000
ained+0.000
ancer+0.000
 slid+0.000
asc+0.000
Neg logit contributions
 instructions-0.000
 occasions-0.000
So-0.000
 slices-0.000
 brace-0.000
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
 instructions+0.050
So+0.046
 occasions+0.045
 slices+0.044
quet+0.043
Neg logit contributions
 seeker-0.058
ained-0.049
 tired-0.044
 forward-0.042
 slid-0.042
Token made
Token ID925
Feature activation-1.72e+00

Pos logit contributions
 seeker+0.004
ained+0.003
asc+0.003
ancer+0.003
 forward+0.003
Neg logit contributions
 instructions-0.003
 occasions-0.003
 words-0.003
So-0.003
 slices-0.003
Token Daddy
Token ID26926
Feature activation-5.46e+00

Pos logit contributions
 instructions+0.040
So+0.037
 occasions+0.036
 slices+0.036
quet+0.035
Neg logit contributions
 seeker-0.036
ained-0.029
 tired-0.026
 slid-0.025
asc-0.025
Token feel
Token ID1254
Feature activation+1.04e+00

Pos logit contributions
 instructions+0.125
So+0.117
 occasions+0.117
quet+0.113
keeping+0.112
Neg logit contributions
 seeker-0.105
ained-0.088
 tired-0.085
 forward-0.078
 slid-0.076
Token special
Token ID2041
Feature activation-3.39e+00

Pos logit contributions
 seeker+0.022
ained+0.018
 tired+0.015
ancer+0.015
asc+0.015
Neg logit contributions
 instructions-0.024
So-0.022
 occasions-0.021
 slices-0.021
 distress-0.021
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
 instructions+0.063
So+0.059
 occasions+0.058
quet+0.057
 brace+0.056
Neg logit contributions
 seeker-0.081
ained-0.068
 tired-0.068
 forward-0.065
 F-0.060
Token Every
Token ID3887
Feature activation+1.82e-01

Pos logit contributions
 instructions+0.006
So+0.006
 occasions+0.005
 slices+0.005
and+0.005
Neg logit contributions
 seeker-0.009
ained-0.007
asc-0.007
 slid-0.006
 forward-0.006
Token evening
Token ID6180
Feature activation+6.54e-01

Pos logit contributions
 seeker+0.002
ained+0.002
 tired+0.001
 forward+0.001
 slid+0.001
Neg logit contributions
 instructions-0.006
So-0.005
 occasions-0.005
 slices-0.005
quet-0.005
Token Tim
Token ID5045
Feature activation-1.89e-01

Pos logit contributions
 seeker+0.097
asc+0.094
ained+0.093
ashing+0.089
ancer+0.086
Neg logit contributions
 slices-0.044
 instructions-0.040
 repair-0.038
 forgiveness-0.038
 mercy-0.037
Tokenmy
Token ID1820
Feature activation-3.03e-01

Pos logit contributions
 instructions+0.005
 occasions+0.005
quet+0.005
 slices+0.005
So+0.005
Neg logit contributions
 seeker-0.003
ained-0.002
 tired-0.002
 slid-0.002
 forward-0.002
Token remembered
Token ID12086
Feature activation-1.97e+00

Pos logit contributions
 instructions+0.006
So+0.005
 occasions+0.005
 slices+0.005
 distress+0.005
Neg logit contributions
 seeker-0.008
ained-0.006
 tired-0.006
 forward-0.006
ancer-0.006
Token his
Token ID465
Feature activation-4.05e-01

Pos logit contributions
 instructions+0.039
So+0.036
 distress+0.035
 slices+0.035
 occasions+0.034
Neg logit contributions
 seeker-0.047
ained-0.041
ancer-0.037
 forward-0.036
asc-0.035
Token dad
Token ID9955
Feature activation-3.04e+00

Pos logit contributions
 instructions+0.009
 occasions+0.008
So+0.008
quet+0.008
 slices+0.008
Neg logit contributions
 seeker-0.009
ained-0.007
 tired-0.007
 slid-0.006
 forward-0.006
Token teaching
Token ID7743
Feature activation-5.46e+00

Pos logit contributions
 instructions+0.059
 occasions+0.054
So+0.054
 slices+0.053
quet+0.053
Neg logit contributions
 seeker-0.071
ained-0.061
 tired-0.058
 forward-0.055
 slid-0.054
Token him
Token ID683
Feature activation-2.62e+00

Pos logit contributions
quet+0.132
 instructions+0.130
So+0.129
 slices+0.123
 occasions+0.123
Neg logit contributions
 seeker-0.089
ained-0.080
 tired-0.070
 F-0.068
 morning-0.065
Token how
Token ID703
Feature activation-1.06e+00

Pos logit contributions
 instructions+0.052
So+0.046
 occasions+0.044
 slices+0.044
and+0.043
Neg logit contributions
 seeker-0.066
ained-0.054
ancer-0.050
 tired-0.049
 forward-0.048
Token to
Token ID284
Feature activation-2.92e+00

Pos logit contributions
 instructions+0.017
So+0.015
 occasions+0.014
 slices+0.014
quet+0.014
Neg logit contributions
 seeker-0.030
ained-0.026
 tired-0.024
 forward-0.023
ancer-0.023
Token whistle
Token ID16121
Feature activation+4.93e-01

Pos logit contributions
 instructions+0.060
So+0.055
 slices+0.055
 occasions+0.054
and+0.054
Neg logit contributions
 seeker-0.066
ained-0.055
 tired-0.051
 slid-0.049
 F-0.048
Token.
Token ID13
Feature activation-8.37e-01

Pos logit contributions
 seeker+0.011
ained+0.009
 tired+0.009
 forward+0.008
 slid+0.008
Neg logit contributions
 instructions-0.010
So-0.009
 occasions-0.009
 slices-0.009
quet-0.009
Token farm
Token ID5318
Feature activation-1.85e+00

Pos logit contributions
 instructions+0.002
So+0.002
quet+0.002
 slices+0.002
ymes+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
 tired-0.002
 pool-0.001
 morning-0.001
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
 instructions+0.031
So+0.029
 occasions+0.028
quet+0.027
 distress+0.027
Neg logit contributions
 seeker-0.047
ained-0.041
 tired-0.038
 forward-0.037
 F-0.036
Token She
Token ID1375
Feature activation-9.81e-01

Pos logit contributions
 instructions+0.026
 occasions+0.024
So+0.024
 slices+0.024
ymes+0.024
Neg logit contributions
 seeker-0.023
ained-0.019
 tired-0.018
 F-0.017
 forward-0.016
Token told
Token ID1297
Feature activation-2.99e+00

Pos logit contributions
 instructions+0.021
So+0.020
 occasions+0.019
 slices+0.019
quet+0.018
Neg logit contributions
 seeker-0.022
ained-0.018
 tired-0.016
 forward-0.016
ancer-0.015
Token her
Token ID607
Feature activation-2.92e+00

Pos logit contributions
 instructions+0.069
So+0.067
 occasions+0.064
quet+0.064
 slices+0.064
Neg logit contributions
 seeker-0.056
ained-0.045
 tired-0.043
 forward-0.041
 F-0.041
Token grand
Token ID4490
Feature activation-5.44e+00

Pos logit contributions
 instructions+0.058
So+0.053
 occasions+0.050
 recipe+0.049
 slices+0.049
Neg logit contributions
 seeker-0.072
ained-0.064
ancer-0.055
 tired-0.055
asc-0.054
Tokenpa
Token ID8957
Feature activation-4.35e+00

Pos logit contributions
So+0.112
 instructions+0.111
quet+0.109
 slices+0.107
and+0.103
Neg logit contributions
 seeker-0.110
ained-0.084
 morning-0.084
 tired-0.080
 F-0.080
Token that
Token ID326
Feature activation-1.15e+00

Pos logit contributions
 instructions+0.085
So+0.074
 words+0.073
 recipe+0.073
 slices+0.072
Neg logit contributions
 seeker-0.111
ained-0.095
asc-0.083
 slid-0.081
Whe-0.081
Token she
Token ID673
Feature activation-8.02e-01

Pos logit contributions
 instructions+0.024
So+0.021
 occasions+0.021
 words+0.021
 recipe+0.021
Neg logit contributions
 seeker-0.027
ained-0.022
asc-0.021
 forward-0.020
 kidding-0.020
Token was
Token ID373
Feature activation+4.52e-01

Pos logit contributions
 instructions+0.019
So+0.017
 occasions+0.017
 slices+0.017
quet+0.017
Neg logit contributions
 seeker-0.016
ained-0.013
 tired-0.012
 slid-0.011
 forward-0.011
Token looking
Token ID2045
Feature activation+1.08e+00

Pos logit contributions
 seeker+0.010
ained+0.008
ancer+0.008
Whe+0.008
 slid+0.008
Neg logit contributions
 instructions-0.009
 recipe-0.009
 distress-0.008
 truth-0.008
 slices-0.008
Token Mia
Token ID32189
Feature activation-6.78e-01

Pos logit contributions
 instructions+0.001
So+0.001
 slices+0.001
 occasions+0.001
and+0.001
Neg logit contributions
 seeker-0.001
ained-0.001
 tired-0.001
asc-0.001
ancer-0.001
Token remembered
Token ID12086
Feature activation-2.69e+00

Pos logit contributions
 instructions+0.015
 occasions+0.014
So+0.014
 slices+0.014
quet+0.014
Neg logit contributions
 seeker-0.014
ained-0.011
 tired-0.011
 slid-0.010
 forward-0.010
Token how
Token ID703
Feature activation+3.50e-01

Pos logit contributions
 instructions+0.059
So+0.054
 occasions+0.052
 slices+0.052
 distress+0.052
Neg logit contributions
 seeker-0.059
ained-0.050
 tired-0.043
ancer-0.043
asc-0.043
Token her
Token ID607
Feature activation-3.93e-01

Pos logit contributions
 seeker+0.007
ained+0.006
 tired+0.006
 forward+0.005
 slid+0.005
Neg logit contributions
 instructions-0.008
So-0.007
 occasions-0.007
 slices-0.007
quet-0.007
Token parents
Token ID3397
Feature activation-1.83e+00

Pos logit contributions
 instructions+0.009
 occasions+0.008
So+0.008
 slices+0.008
quet+0.008
Neg logit contributions
 seeker-0.008
ained-0.007
 tired-0.007
 F-0.006
 forward-0.006
Token taught
Token ID7817
Feature activation-5.42e+00

Pos logit contributions
 instructions+0.043
So+0.040
 occasions+0.039
 slices+0.038
quet+0.038
Neg logit contributions
 seeker-0.036
ained-0.030
 tired-0.027
 slid-0.026
 forward-0.025
Token her
Token ID607
Feature activation-3.53e+00

Pos logit contributions
quet+0.133
So+0.131
 instructions+0.131
 slices+0.129
 occasions+0.128
Neg logit contributions
 seeker-0.081
ained-0.065
 tired-0.064
 F-0.062
 forward-0.061
Token to
Token ID284
Feature activation-3.58e+00

Pos logit contributions
 instructions+0.069
So+0.064
 occasions+0.063
 slices+0.062
quet+0.061
Neg logit contributions
 seeker-0.082
ained-0.070
 tired-0.064
 forward-0.062
 slid-0.060
Token be
Token ID307
Feature activation+8.97e-01

Pos logit contributions
 instructions+0.070
 slices+0.066
So+0.065
and+0.064
 occasions+0.064
Neg logit contributions
 seeker-0.080
ained-0.067
 tired-0.064
 F-0.060
 slid-0.059
Token gentle
Token ID10296
Feature activation-4.22e+00

Pos logit contributions
 seeker+0.022
ained+0.018
ancer+0.017
 tired+0.016
 slid+0.016
Neg logit contributions
 instructions-0.018
So-0.017
 slices-0.015
 occasions-0.015
 distress-0.015
Token and
Token ID290
Feature activation-2.15e-01

Pos logit contributions
 instructions+0.095
 occasions+0.093
So+0.092
quet+0.091
 slices+0.090
Neg logit contributions
 seeker-0.078
ained-0.063
 tired-0.062
 forward-0.061
 F-0.056
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
 instructions+0.029
So+0.027
 distress+0.026
 slices+0.026
quet+0.026
Neg logit contributions
 seeker-0.041
ained-0.034
 tired-0.034
 forward-0.032
 F-0.031
Token Her
Token ID2332
Feature activation-7.10e-01

Pos logit contributions
 instructions+0.016
So+0.015
 occasions+0.015
quet+0.015
 slices+0.015
Neg logit contributions
 seeker-0.012
ained-0.010
 tired-0.009
 F-0.008
 forward-0.008
Token tutor
Token ID48680
Feature activation-3.72e+00

Pos logit contributions
 instructions+0.010
So+0.009
 occasions+0.009
quet+0.009
 slices+0.009
Neg logit contributions
 seeker-0.021
ained-0.018
 tired-0.017
 forward-0.017
ancer-0.016
Token taught
Token ID7817
Feature activation-4.99e+00

Pos logit contributions
 instructions+0.078
So+0.074
 occasions+0.072
 brace+0.069
 slices+0.069
Neg logit contributions
 seeker-0.083
ained-0.071
 tired-0.061
ancer-0.061
 forward-0.060
Token her
Token ID607
Feature activation-3.77e+00

Pos logit contributions
quet+0.125
 instructions+0.123
So+0.123
 slices+0.118
 occasions+0.116
Neg logit contributions
 seeker-0.075
ained-0.065
 tired-0.061
 morning-0.059
 F-0.056
Token a
Token ID257
Feature activation-5.33e+00

Pos logit contributions
 instructions+0.075
So+0.070
 occasions+0.069
 slices+0.068
quet+0.068
Neg logit contributions
 seeker-0.085
ained-0.072
 tired-0.067
 forward-0.063
 slid-0.062
Token lot
Token ID1256
Feature activation-1.52e+00

Pos logit contributions
 instructions+0.125
keeping+0.120
 slices+0.119
So+0.118
 occasions+0.117
Neg logit contributions
 seeker-0.101
ained-0.079
 tired-0.075
 forward-0.069
 morning-0.067
Token of
Token ID286
Feature activation-2.01e+00

Pos logit contributions
 instructions+0.026
So+0.024
 occasions+0.023
and+0.022
 slices+0.022
Neg logit contributions
 seeker-0.041
ained-0.035
 tired-0.032
 slid-0.031
ancer-0.031
Token things
Token ID1243
Feature activation-8.62e-01

Pos logit contributions
 instructions+0.040
So+0.040
quet+0.039
 slices+0.038
keeping+0.037
Neg logit contributions
 seeker-0.043
ained-0.036
 tired-0.035
 forward-0.033
 morning-0.032
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
 instructions+0.016
So+0.014
 occasions+0.014
 slices+0.014
quet+0.014
Neg logit contributions
 seeker-0.021
ained-0.019
 tired-0.017
 forward-0.017
 slid-0.016
Token One
Token ID1881
Feature activation+6.15e-01

Pos logit contributions
 instructions+0.006
 occasions+0.005
So+0.005
quet+0.005
 slices+0.005
Neg logit contributions
 seeker-0.005
ained-0.004
 tired-0.004
 F-0.004
ancer-0.004
Token too
Token ID1165
Feature activation-3.82e-01

Pos logit contributions
 instructions+0.012
So+0.011
 distress+0.011
 slices+0.011
 brace+0.010
Neg logit contributions
 seeker-0.020
ained-0.017
ancer-0.016
asc-0.016
inea-0.015
Token.
Token ID13
Feature activation-3.80e-01

Pos logit contributions
 instructions+0.005
So+0.005
 slices+0.004
 distress+0.004
quet+0.004
Neg logit contributions
 seeker-0.012
ained-0.010
asc-0.009
ancer-0.009
 slid-0.009
Token Daddy
Token ID26926
Feature activation-2.81e+00

Pos logit contributions
So+0.006
 instructions+0.006
 slices+0.005
and+0.005
 truth+0.005
Neg logit contributions
 seeker-0.011
ained-0.009
asc-0.008
 forward-0.008
ancer-0.008
Token was
Token ID373
Feature activation-7.81e-01

Pos logit contributions
 instructions+0.048
So+0.047
 slices+0.042
 brace+0.041
 distress+0.041
Neg logit contributions
 seeker-0.078
ained-0.066
 forward-0.056
asc-0.056
 dusk-0.055
Token teaching
Token ID7743
Feature activation-4.96e+00

Pos logit contributions
 instructions+0.017
 distress+0.015
So+0.015
 slices+0.015
 recipe+0.015
Neg logit contributions
 seeker-0.017
ained-0.014
ancer-0.013
ahs-0.012
 slid-0.012
Token Mom
Token ID11254
Feature activation-5.33e+00

Pos logit contributions
 instructions+0.117
So+0.108
 occasions+0.106
 slices+0.104
and+0.102
Neg logit contributions
 seeker-0.101
ained-0.082
 tired-0.073
 forward-0.070
 slid-0.069
Tokenmy
Token ID1820
Feature activation-4.26e+00

Pos logit contributions
 instructions+0.121
So+0.114
 slices+0.109
 occasions+0.109
and+0.108
Neg logit contributions
 seeker-0.114
ained-0.090
 tired-0.080
 forward-0.078
 slid-0.077
Token how
Token ID703
Feature activation-1.01e+00

Pos logit contributions
 instructions+0.081
So+0.074
 occasions+0.071
 slices+0.070
and+0.069
Neg logit contributions
 seeker-0.107
ained-0.089
 tired-0.081
ancer-0.080
 slid-0.079
Token to
Token ID284
Feature activation-3.45e+00

Pos logit contributions
 instructions+0.015
So+0.013
 occasions+0.012
 slices+0.012
and+0.012
Neg logit contributions
 seeker-0.030
ained-0.026
ancer-0.023
 tired-0.023
asc-0.023
Token fly
Token ID6129
Feature activation+2.26e+00

Pos logit contributions
 instructions+0.075
 slices+0.071
So+0.070
and+0.069
 occasions+0.069
Neg logit contributions
 seeker-0.073
ained-0.059
 tired-0.055
 slid-0.052
 F-0.051
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 seeker+0.053
ained+0.044
ancer+0.040
 tired+0.039
asc+0.039
Neg logit contributions
 instructions-0.046
So-0.042
 occasions-0.041
 brace-0.040
 slices-0.040
Token of
Token ID286
Feature activation+1.74e+00

Pos logit contributions
 seeker+0.014
 tired+0.013
 forward+0.013
ained+0.012
 spl+0.011
Neg logit contributions
 instructions-0.018
 occasions-0.018
So-0.018
 brace-0.018
keeping-0.017
Token his
Token ID465
Feature activation+2.87e+00

Pos logit contributions
 seeker+0.026
ained+0.018
 tired+0.017
 morning+0.014
 dusk+0.014
Neg logit contributions
So-0.051
 instructions-0.050
quet-0.048
and-0.048
 occasions-0.047
Token house
Token ID2156
Feature activation-9.92e-02

Pos logit contributions
 seeker+0.062
ained+0.052
 tired+0.047
 forward+0.045
 slid+0.044
Neg logit contributions
 instructions-0.063
So-0.058
 occasions-0.057
 slices-0.056
quet-0.055
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
 instructions+0.002
So+0.002
 occasions+0.002
 brace+0.002
ymes+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
 tired-0.002
 forward-0.002
 slid-0.002
Token teach
Token ID4545
Feature activation-4.67e+00

Pos logit contributions
 instructions+0.043
So+0.040
 occasions+0.039
 slices+0.038
quet+0.038
Neg logit contributions
 seeker-0.040
ained-0.033
 tired-0.029
 forward-0.028
 slid-0.028
Token Gary
Token ID10936
Feature activation-5.30e+00

Pos logit contributions
quet+0.112
So+0.107
keeping+0.105
 instructions+0.105
 slices+0.104
Neg logit contributions
 seeker-0.074
 forward-0.062
ained-0.061
 tired-0.061
 morning-0.057
Token a
Token ID257
Feature activation-5.22e+00

Pos logit contributions
 instructions+0.120
So+0.112
 occasions+0.110
quet+0.108
 slices+0.108
Neg logit contributions
 seeker-0.108
ained-0.090
 tired-0.082
 forward-0.076
 slid-0.076
Token lesson
Token ID11483
Feature activation-3.79e+00

Pos logit contributions
 instructions+0.097
So+0.089
 occasions+0.087
 slices+0.086
quet+0.084
Neg logit contributions
 seeker-0.129
ained-0.110
 tired-0.102
 forward-0.097
 slid-0.096
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 instructions+0.061
So+0.055
 occasions+0.054
 slices+0.052
quet+0.051
Neg logit contributions
 seeker-0.104
ained-0.090
 tired-0.083
 forward-0.080
 slid-0.080
Token He
Token ID679
Feature activation-1.45e+00

Pos logit contributions
 instructions+0.005
quet+0.005
 occasions+0.005
So+0.005
 slices+0.004
Neg logit contributions
 seeker-0.004
ained-0.003
 tired-0.003
 forward-0.003
 F-0.003
Token was
Token ID373
Feature activation-5.29e-02

Pos logit contributions
 instructions+0.028
So+0.027
 occasions+0.025
 slices+0.024
 distress+0.024
Neg logit contributions
 seeker-0.035
ained-0.030
 forward-0.026
 tired-0.026
ancer-0.026
Token mom
Token ID1995
Feature activation-2.42e+00

Pos logit contributions
 instructions+0.044
So+0.041
 occasions+0.040
 slices+0.039
quet+0.039
Neg logit contributions
 seeker-0.045
ained-0.038
 tired-0.034
 forward-0.033
 slid-0.032
Token many
Token ID867
Feature activation-8.00e-01

Pos logit contributions
 instructions+0.051
So+0.046
 slices+0.045
 recipe+0.044
 occasions+0.044
Neg logit contributions
 seeker-0.056
ained-0.048
asc-0.041
 slid-0.041
 tired-0.041
Token questions
Token ID2683
Feature activation-1.05e+00

Pos logit contributions
 instructions+0.018
So+0.017
quet+0.017
keeping+0.017
 slices+0.016
Neg logit contributions
 seeker-0.015
ained-0.012
 tired-0.012
 forward-0.011
 morning-0.011
Token.
Token ID13
Feature activation-7.17e-01

Pos logit contributions
 instructions+0.020
So+0.018
 occasions+0.018
 slices+0.017
quet+0.017
Neg logit contributions
 seeker-0.025
ained-0.022
 tired-0.020
 forward-0.019
ancer-0.019
Token She
Token ID1375
Feature activation-5.42e-01

Pos logit contributions
 instructions+0.014
So+0.012
 occasions+0.012
 slices+0.012
quet+0.012
Neg logit contributions
 seeker-0.017
ained-0.015
 tired-0.014
 forward-0.013
 slid-0.013
Token teaches
Token ID17324
Feature activation-5.30e+00

Pos logit contributions
 instructions+0.010
So+0.009
 occasions+0.009
 slices+0.009
 brace+0.008
Neg logit contributions
 seeker-0.014
ained-0.012
 tired-0.011
 forward-0.010
ancer-0.010
Token him
Token ID683
Feature activation-2.68e+00

Pos logit contributions
quet+0.131
keeping+0.124
and+0.123
 lair+0.123
So+0.122
Neg logit contributions
 seeker-0.071
 F-0.065
 morning-0.064
 tired-0.062
 B-0.059
Token the
Token ID262
Feature activation-2.69e+00

Pos logit contributions
 instructions+0.052
 words+0.045
So+0.044
 occasions+0.044
 slices+0.043
Neg logit contributions
 seeker-0.070
ained-0.058
ancer-0.053
asc-0.052
 forward-0.052
Token names
Token ID3891
Feature activation-1.37e+00

Pos logit contributions
 instructions+0.047
So+0.042
 occasions+0.041
 slices+0.040
 recipe+0.039
Neg logit contributions
 seeker-0.071
ained-0.062
 tired-0.057
 forward-0.055
 slid-0.054
Token of
Token ID286
Feature activation-1.56e-01

Pos logit contributions
 instructions+0.027
So+0.024
 occasions+0.024
 slices+0.023
quet+0.023
Neg logit contributions
 seeker-0.034
ained-0.028
 tired-0.026
 slid-0.025
ancer-0.025
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 instructions+0.004
 occasions+0.003
So+0.003
 slices+0.003
 brace+0.003
Neg logit contributions
 seeker-0.003
ained-0.003
 tired-0.002
 slid-0.002
 forward-0.002
Token.
Token ID13
Feature activation-7.68e-01

Pos logit contributions
 instructions+0.060
So+0.055
 slices+0.052
 occasions+0.052
quet+0.051
Neg logit contributions
 seeker-0.089
ained-0.078
 tired-0.072
 forward-0.069
 morning-0.068
Token Her
Token ID2332
Feature activation-2.99e-01

Pos logit contributions
 instructions+0.014
So+0.013
 occasions+0.013
 slices+0.012
quet+0.012
Neg logit contributions
 seeker-0.019
ained-0.016
 tired-0.015
 forward-0.015
 slid-0.014
Token mom
Token ID1995
Feature activation-2.90e+00

Pos logit contributions
 instructions+0.007
So+0.006
 occasions+0.006
 distress+0.006
 slices+0.006
Neg logit contributions
 seeker-0.006
ained-0.005
 tired-0.005
 forward-0.005
 slid-0.005
Tokenmy
Token ID1820
Feature activation-2.82e+00

Pos logit contributions
 instructions+0.061
So+0.060
 occasions+0.056
 slices+0.055
 recipe+0.055
Neg logit contributions
 seeker-0.065
ained-0.053
 forward-0.046
 tired-0.045
ancer-0.045
Token was
Token ID373
Feature activation-5.03e-01

Pos logit contributions
 instructions+0.053
So+0.052
 recipe+0.047
 occasions+0.047
 slices+0.047
Neg logit contributions
 seeker-0.069
ained-0.058
 forward-0.051
asc-0.050
ancer-0.050
Token teaching
Token ID7743
Feature activation-5.29e+00

Pos logit contributions
 instructions+0.012
So+0.011
 recipe+0.011
 slices+0.011
 distress+0.010
Neg logit contributions
 seeker-0.010
ained-0.008
 slid-0.007
ancer-0.007
ahs-0.007
Token her
Token ID607
Feature activation-3.86e+00

Pos logit contributions
quet+0.128
 instructions+0.128
 slices+0.125
So+0.125
keeping+0.121
Neg logit contributions
 seeker-0.081
ained-0.073
 tired-0.068
 F-0.067
 morning-0.067
Token a
Token ID257
Feature activation-5.10e+00

Pos logit contributions
 instructions+0.082
So+0.076
 occasions+0.074
 slices+0.074
quet+0.074
Neg logit contributions
 seeker-0.083
ained-0.071
 tired-0.064
 forward-0.060
 slid-0.059
Token game
Token ID983
Feature activation-3.60e+00

Pos logit contributions
 instructions+0.103
So+0.095
 occasions+0.093
 slices+0.093
keeping+0.092
Neg logit contributions
 seeker-0.116
ained-0.097
 tired-0.090
 forward-0.085
 slid-0.084
Token that
Token ID326
Feature activation-5.25e-01

Pos logit contributions
 instructions+0.069
 distress+0.064
So+0.064
keeping+0.062
 slices+0.061
Neg logit contributions
 seeker-0.083
ained-0.073
 tired-0.065
 forward-0.064
 slid-0.063
Token involved
Token ID2950
Feature activation-1.74e+00

Pos logit contributions
 instructions+0.012
So+0.011
 occasions+0.011
 slices+0.010
 brace+0.010
Neg logit contributions
 seeker-0.011
ained-0.009
 tired-0.008
 forward-0.008
ancer-0.008
Token hot
Token ID3024
Feature activation-3.34e-01

Pos logit contributions
 truth+0.003
So+0.003
 instructions+0.003
 rules+0.003
 words+0.003
Neg logit contributions
 seeker-0.007
ained-0.006
ancer-0.005
inea-0.005
 forward-0.005
Token today
Token ID1909
Feature activation-1.47e+00

Pos logit contributions
 instructions+0.006
 occasions+0.006
So+0.005
 slices+0.005
 recipe+0.005
Neg logit contributions
 seeker-0.008
ained-0.007
 tired-0.006
 slid-0.006
ancer-0.006
Token!
Token ID0
Feature activation-5.94e-01

Pos logit contributions
 instructions+0.024
So+0.022
 occasions+0.021
 slices+0.021
 recipe+0.020
Neg logit contributions
 seeker-0.040
ained-0.035
 tired-0.031
 slid-0.031
asc-0.031
Token I
Token ID314
Feature activation-1.23e-01

Pos logit contributions
 instructions+0.009
So+0.008
and+0.007
 occasions+0.007
 words+0.007
Neg logit contributions
 seeker-0.016
ained-0.015
 tired-0.013
 slid-0.013
ahs-0.013
Token wish
Token ID4601
Feature activation-6.72e-01

Pos logit contributions
 instructions+0.002
So+0.002
 distress+0.002
 occasions+0.002
 truth+0.002
Neg logit contributions
 seeker-0.003
ained-0.002
ancer-0.002
asc-0.002
 forward-0.002
Token a
Token ID257
Feature activation-5.28e+00

Pos logit contributions
 instructions+0.012
 words+0.010
 occasions+0.009
 recipe+0.009
 rules+0.009
Neg logit contributions
 seeker-0.018
ained-0.016
asc-0.014
 slid-0.014
 forward-0.014
Token cool
Token ID3608
Feature activation-1.17e+00

Pos logit contributions
 instructions+0.108
So+0.101
 recipe+0.093
 truth+0.092
 occasions+0.092
Neg logit contributions
 seeker-0.123
ained-0.110
asc-0.093
ancer-0.093
 slid-0.092
Token breeze
Token ID28633
Feature activation+1.34e+00

Pos logit contributions
 instructions+0.026
 recipe+0.024
 occasions+0.024
 words+0.023
 truth+0.023
Neg logit contributions
 seeker-0.024
ained-0.022
 exhausted-0.019
ancer-0.019
 tired-0.019
Token would
Token ID561
Feature activation-1.79e+00

Pos logit contributions
 seeker+0.034
ained+0.028
 tired+0.026
 dusk+0.025
 slid+0.025
Neg logit contributions
 instructions-0.024
 recipe-0.021
So-0.021
 occasions-0.021
 slices-0.021
Token come
Token ID1282
Feature activation+6.73e-01

Pos logit contributions
 instructions+0.038
So+0.037
 slices+0.036
 occasions+0.035
ymes+0.035
Neg logit contributions
 seeker-0.038
ained-0.031
 tired-0.029
 forward-0.027
 slid-0.027
Token."
Token ID526
Feature activation-1.87e+00

Pos logit contributions
 seeker+0.015
ained+0.012
 slid+0.012
asc+0.011
 kidding+0.011
Neg logit contributions
 instructions-0.014
So-0.013
and-0.012
 slices-0.012
quet-0.012
Token He
Token ID679
Feature activation-4.77e-01

Pos logit contributions
 instructions+0.002
So+0.002
 slices+0.002
and+0.002
 occasions+0.002
Neg logit contributions
 seeker-0.003
ained-0.002
asc-0.002
 tired-0.002
 slid-0.002
Token even
Token ID772
Feature activation+1.55e+00

Pos logit contributions
 instructions+0.011
So+0.010
 occasions+0.010
 slices+0.009
 brace+0.009
Neg logit contributions
 seeker-0.010
ained-0.008
 tired-0.007
ancer-0.007
 forward-0.007
Token taught
Token ID7817
Feature activation-4.44e+00

Pos logit contributions
 seeker+0.035
ained+0.029
asc+0.025
 tired+0.025
ancer+0.024
Neg logit contributions
 instructions-0.033
So-0.031
 occasions-0.030
 slices-0.030
 brace-0.030
Token his
Token ID465
Feature activation-3.02e+00

Pos logit contributions
 instructions+0.107
So+0.106
quet+0.105
 occasions+0.102
 slices+0.101
Neg logit contributions
 seeker-0.074
ained-0.061
 tired-0.056
 F-0.055
 forward-0.052
Token little
Token ID1310
Feature activation-3.58e+00

Pos logit contributions
 instructions+0.066
quet+0.064
So+0.063
 occasions+0.063
keeping+0.062
Neg logit contributions
 seeker-0.060
 tired-0.048
ained-0.047
 forward-0.043
 morning-0.043
Token sister
Token ID6621
Feature activation-5.28e+00

Pos logit contributions
 instructions+0.073
So+0.068
 occasions+0.066
 slices+0.065
quet+0.064
Neg logit contributions
 seeker-0.082
ained-0.069
 tired-0.063
 forward-0.060
 slid-0.060
Token how
Token ID703
Feature activation-2.06e-01

Pos logit contributions
 instructions+0.099
So+0.090
 occasions+0.087
 slices+0.086
and+0.084
Neg logit contributions
 seeker-0.133
ained-0.113
 tired-0.102
ancer-0.101
asc-0.099
Token to
Token ID284
Feature activation-2.87e+00

Pos logit contributions
 instructions+0.003
So+0.003
 occasions+0.003
 slices+0.003
quet+0.003
Neg logit contributions
 seeker-0.006
ained-0.005
 tired-0.005
 forward-0.004
ancer-0.004
Token twist
Token ID14528
Feature activation+1.47e+00

Pos logit contributions
 instructions+0.058
So+0.054
 occasions+0.052
 slices+0.051
quet+0.050
Neg logit contributions
 seeker-0.067
ained-0.057
 tired-0.051
 forward-0.049
 slid-0.049
Token the
Token ID262
Feature activation+9.38e-01

Pos logit contributions
 seeker+0.032
ained+0.026
 tired+0.023
asc+0.023
ancer+0.022
Neg logit contributions
 instructions-0.033
So-0.030
 slices-0.029
 occasions-0.029
 brace-0.029
Token clock
Token ID8801
Feature activation+4.54e-01

Pos logit contributions
 seeker+0.020
ained+0.017
 tired+0.016
 forward+0.016
 morning+0.016
Neg logit contributions
keeping-0.019
 occasions-0.018
So-0.018
quet-0.018
 instructions-0.018
Token remembered
Token ID12086
Feature activation-2.71e+00

Pos logit contributions
 instructions+0.012
So+0.011
 occasions+0.011
 slices+0.011
 distress+0.011
Neg logit contributions
 seeker-0.011
ained-0.009
 tired-0.008
 forward-0.008
ancer-0.007
Token a
Token ID257
Feature activation-4.00e+00

Pos logit contributions
 instructions+0.059
So+0.055
 occasions+0.053
 slices+0.052
 distress+0.051
Neg logit contributions
 seeker-0.060
ained-0.050
 tired-0.044
 forward-0.043
ancer-0.043
Token lesson
Token ID11483
Feature activation-2.90e+00

Pos logit contributions
 instructions+0.086
 occasions+0.081
So+0.080
 slices+0.080
keeping+0.079
Neg logit contributions
 seeker-0.086
ained-0.068
 tired-0.067
 forward-0.061
 slid-0.060
Token her
Token ID607
Feature activation-1.29e+00

Pos logit contributions
 instructions+0.053
So+0.048
 occasions+0.047
and+0.045
 slices+0.045
Neg logit contributions
 seeker-0.074
ained-0.063
 tired-0.058
ancer-0.056
asc-0.056
Token mother
Token ID2802
Feature activation-2.90e+00

Pos logit contributions
 instructions+0.025
So+0.024
 slices+0.023
quet+0.023
 occasions+0.023
Neg logit contributions
 seeker-0.031
 tired-0.024
ained-0.024
 morning-0.023
 F-0.022
Token taught
Token ID7817
Feature activation-5.27e+00

Pos logit contributions
 instructions+0.070
 occasions+0.064
So+0.064
 slices+0.064
quet+0.063
Neg logit contributions
 seeker-0.055
ained-0.045
 tired-0.041
 slid-0.039
 forward-0.039
Token her
Token ID607
Feature activation-3.96e+00

Pos logit contributions
 instructions+0.121
quet+0.117
 slices+0.117
So+0.116
 recipe+0.112
Neg logit contributions
 seeker-0.097
ained-0.085
 tired-0.074
 F-0.074
 forward-0.072
Token -
Token ID532
Feature activation-2.56e+00

Pos logit contributions
 instructions+0.076
So+0.071
 occasions+0.070
 slices+0.069
quet+0.068
Neg logit contributions
 seeker-0.092
ained-0.080
 tired-0.073
 forward-0.070
 F-0.068
Token that
Token ID326
Feature activation-8.47e-01

Pos logit contributions
 instructions+0.050
So+0.045
 slices+0.042
 occasions+0.042
 brace+0.042
Neg logit contributions
 seeker-0.064
ained-0.054
asc-0.049
ancer-0.048
 tired-0.047
Token things
Token ID1243
Feature activation+1.30e-02

Pos logit contributions
 instructions+0.019
So+0.017
 occasions+0.017
 slices+0.017
 brace+0.016
Neg logit contributions
 seeker-0.018
ained-0.015
 tired-0.014
 forward-0.013
asc-0.013
Token that
Token ID326
Feature activation+8.95e-01

Pos logit contributions
 seeker+0.000
ained+0.000
 tired+0.000
 forward+0.000
asc+0.000
Neg logit contributions
 instructions-0.000
So-0.000
 occasions-0.000
 slices-0.000
 brace-0.000
Token his
Token ID465
Feature activation-5.14e-01

Pos logit contributions
 instructions+0.026
 slices+0.023
So+0.023
 words+0.023
 distress+0.023
Neg logit contributions
 seeker-0.026
ained-0.023
asc-0.020
ancer-0.020
 slid-0.019
Token gr
Token ID1036
Feature activation-9.47e-01

Pos logit contributions
 instructions+0.011
So+0.010
quet+0.010
 occasions+0.010
and+0.010
Neg logit contributions
 seeker-0.012
 tired-0.009
ained-0.009
 forward-0.008
 morning-0.008
Tokenanny
Token ID7737
Feature activation-2.40e+00

Pos logit contributions
 instructions+0.022
So+0.020
 brace+0.020
 slices+0.020
 occasions+0.020
Neg logit contributions
 seeker-0.018
ained-0.018
 tired-0.014
asc-0.014
ancer-0.014
Token had
Token ID550
Feature activation+1.42e+00

Pos logit contributions
 instructions+0.057
So+0.053
 occasions+0.052
 slices+0.051
quet+0.051
Neg logit contributions
 seeker-0.048
ained-0.039
 tired-0.034
 forward-0.032
ancer-0.032
Token always
Token ID1464
Feature activation+1.58e-01

Pos logit contributions
 seeker+0.031
ained+0.025
 tired+0.023
 forward+0.022
 slid+0.022
Neg logit contributions
 instructions-0.031
So-0.029
 occasions-0.028
 slices-0.028
quet-0.027
Token taught
Token ID7817
Feature activation-5.26e+00

Pos logit contributions
 seeker+0.003
ained+0.003
 tired+0.003
 forward+0.002
 slid+0.002
Neg logit contributions
 instructions-0.003
So-0.003
 occasions-0.003
 slices-0.003
quet-0.003
Token him
Token ID683
Feature activation-3.31e+00

Pos logit contributions
 instructions+0.114
quet+0.112
So+0.112
 slices+0.109
 occasions+0.107
Neg logit contributions
 seeker-0.101
ained-0.086
 tired-0.079
 F-0.078
 forward-0.078
Token the
Token ID262
Feature activation-2.95e+00

Pos logit contributions
 instructions+0.068
So+0.060
 occasions+0.058
 slices+0.058
and+0.057
Neg logit contributions
 seeker-0.080
ained-0.067
ancer-0.060
 tired-0.059
asc-0.059
Token importance
Token ID6817
Feature activation-2.96e+00

Pos logit contributions
 instructions+0.041
So+0.039
quet+0.037
 slices+0.037
 occasions+0.036
Neg logit contributions
 seeker-0.086
ained-0.071
 tired-0.069
 F-0.066
 forward-0.066
Token of
Token ID286
Feature activation-1.09e+00

Pos logit contributions
 instructions+0.057
So+0.052
 occasions+0.051
 slices+0.050
quet+0.049
Neg logit contributions
 seeker-0.071
ained-0.061
 tired-0.056
 forward-0.054
 slid-0.053
Token being
Token ID852
Feature activation+1.65e+00

Pos logit contributions
 instructions+0.024
So+0.022
 occasions+0.022
 slices+0.021
 distress+0.021
Neg logit contributions
 seeker-0.024
ained-0.020
 tired-0.018
 forward-0.017
ancer-0.017
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 instructions+0.076
So+0.073
 occasions+0.072
ymes+0.069
quet+0.069
Neg logit contributions
 seeker-0.066
ained-0.054
 tired-0.052
 forward-0.049
 slid-0.047
Token asked
Token ID1965
Feature activation-2.58e+00

Pos logit contributions
 instructions+0.027
So+0.025
 occasions+0.024
 slices+0.024
quet+0.024
Neg logit contributions
 seeker-0.029
ained-0.024
 tired-0.022
 slid-0.021
 forward-0.021
Token if
Token ID611
Feature activation-2.46e+00

Pos logit contributions
 instructions+0.057
So+0.053
 occasions+0.052
 slices+0.051
quet+0.051
Neg logit contributions
 seeker-0.055
ained-0.046
 tired-0.041
 forward-0.039
 slid-0.039
Token he
Token ID339
Feature activation+1.14e+00

Pos logit contributions
 instructions+0.051
So+0.046
 occasions+0.044
 slices+0.044
 brace+0.043
Neg logit contributions
 seeker-0.056
ained-0.052
asc-0.045
 forward-0.041
 tired-0.041
Token could
Token ID714
Feature activation-1.89e+00

Pos logit contributions
 seeker+0.022
ained+0.017
 tired+0.017
 slid+0.016
 F+0.015
Neg logit contributions
 instructions-0.028
 occasions-0.026
So-0.025
quet-0.025
 slices-0.025
Token teach
Token ID4545
Feature activation-5.25e+00

Pos logit contributions
 instructions+0.036
So+0.033
 occasions+0.032
 slices+0.031
quet+0.031
Neg logit contributions
 seeker-0.046
ained-0.040
 tired-0.036
 forward-0.035
ancer-0.035
Token him
Token ID683
Feature activation-3.02e+00

Pos logit contributions
quet+0.122
So+0.122
 instructions+0.122
 recipe+0.117
ymes+0.116
Neg logit contributions
 seeker-0.084
 F-0.070
ained-0.070
 tired-0.068
 Jump-0.064
Token how
Token ID703
Feature activation-1.14e+00

Pos logit contributions
 instructions+0.055
So+0.049
 slices+0.047
 occasions+0.047
and+0.045
Neg logit contributions
 seeker-0.079
ained-0.067
 tired-0.061
ancer-0.060
 forward-0.059
Token to
Token ID284
Feature activation-3.29e+00

Pos logit contributions
 instructions+0.019
So+0.017
 occasions+0.017
 slices+0.016
quet+0.016
Neg logit contributions
 seeker-0.030
ained-0.026
 tired-0.025
 forward-0.024
 slid-0.023
Token rake
Token ID47885
Feature activation+9.16e-01

Pos logit contributions
 instructions+0.074
and+0.071
keeping+0.070
 slices+0.070
So+0.069
Neg logit contributions
 seeker-0.065
ained-0.052
 tired-0.049
 slid-0.047
 spl-0.046
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 seeker+0.018
ained+0.015
 tired+0.013
asc+0.013
ancer+0.012
Neg logit contributions
 instructions-0.022
So-0.020
 occasions-0.020
 slices-0.020
 brace-0.019
Token.
Token ID13
Feature activation-6.21e-03

Pos logit contributions
 instructions+0.026
So+0.024
 occasions+0.023
 slices+0.022
quet+0.022
Neg logit contributions
 seeker-0.033
ained-0.028
 tired-0.026
 forward-0.024
 slid-0.024
Token Her
Token ID2332
Feature activation-2.39e-01

Pos logit contributions
 instructions+0.000
So+0.000
 occasions+0.000
 slices+0.000
and+0.000
Neg logit contributions
 seeker-0.000
ained-0.000
 tired-0.000
asc-0.000
 forward-0.000
Token mum
Token ID25682
Feature activation-2.26e+00

Pos logit contributions
 instructions+0.005
So+0.005
 occasions+0.004
 slices+0.004
 distress+0.004
Neg logit contributions
 seeker-0.006
ained-0.005
asc-0.004
 forward-0.004
ancer-0.004
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
 instructions+0.043
So+0.041
 slices+0.038
 occasions+0.038
 distress+0.037
Neg logit contributions
 seeker-0.056
ained-0.048
 forward-0.041
ancer-0.041
asc-0.041
Token dad
Token ID9955
Feature activation-1.61e+00

Pos logit contributions
 seeker+0.008
ained+0.007
asc+0.006
 forward+0.006
ancer+0.006
Neg logit contributions
 instructions-0.006
So-0.006
 slices-0.005
 distress-0.005
 occasions-0.005
Token taught
Token ID7817
Feature activation-5.25e+00

Pos logit contributions
 instructions+0.031
So+0.031
 slices+0.027
 distress+0.027
 occasions+0.026
Neg logit contributions
 seeker-0.040
ained-0.035
asc-0.030
ancer-0.029
 dusk-0.028
Token her
Token ID607
Feature activation-3.51e+00

Pos logit contributions
 instructions+0.120
So+0.116
quet+0.114
 slices+0.113
 occasions+0.111
Neg logit contributions
 seeker-0.095
ained-0.081
 tired-0.074
 F-0.072
 forward-0.071
Token all
Token ID477
Feature activation-2.01e+00

Pos logit contributions
 instructions+0.065
So+0.059
 occasions+0.057
 slices+0.056
 recipe+0.054
Neg logit contributions
 seeker-0.091
ained-0.076
 tired-0.069
ancer-0.068
 forward-0.068
Token the
Token ID262
Feature activation-3.40e+00

Pos logit contributions
 instructions+0.039
So+0.036
 occasions+0.035
 words+0.034
 recipe+0.033
Neg logit contributions
 seeker-0.054
ained-0.041
ancer-0.040
 tired-0.039
 forward-0.038
Token answers
Token ID7429
Feature activation-4.85e-01

Pos logit contributions
 instructions+0.067
So+0.060
 occasions+0.059
 slices+0.056
 recipe+0.056
Neg logit contributions
 seeker-0.084
ained-0.073
 tired-0.065
 forward-0.065
ancer-0.064
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
 instructions+0.009
So+0.008
 occasions+0.008
 slices+0.008
quet+0.008
Neg logit contributions
 seeker-0.013
ained-0.011
 tired-0.009
asc-0.009
ancer-0.009
Tokenmy
Token ID1820
Feature activation-1.14e+00

Pos logit contributions
 instructions+0.029
quet+0.027
 occasions+0.027
 slices+0.027
 brace+0.026
Neg logit contributions
 seeker-0.016
ained-0.012
 tired-0.010
 slid-0.010
 morning-0.009
Token's
Token ID338
Feature activation-1.31e+00

Pos logit contributions
 instructions+0.024
So+0.022
 occasions+0.021
 slices+0.021
 distress+0.021
Neg logit contributions
 seeker-0.026
ained-0.022
 tired-0.019
ancer-0.019
 forward-0.019
Token mom
Token ID1995
Feature activation-2.35e+00

Pos logit contributions
 instructions+0.030
So+0.028
 occasions+0.028
quet+0.027
 slices+0.027
Neg logit contributions
 seeker-0.027
ained-0.022
 tired-0.020
 slid-0.019
 forward-0.019
Token was
Token ID373
Feature activation-4.48e-01

Pos logit contributions
 instructions+0.050
So+0.046
 occasions+0.045
 slices+0.044
quet+0.043
Neg logit contributions
 seeker-0.053
ained-0.044
 tired-0.040
 forward-0.038
 slid-0.038
Token very
Token ID845
Feature activation-9.70e-01

Pos logit contributions
 instructions+0.010
So+0.009
 slices+0.009
 recipe+0.009
 distress+0.009
Neg logit contributions
 seeker-0.010
ained-0.008
ancer-0.007
asc-0.007
 slid-0.007
Token strict
Token ID7646
Feature activation-5.24e+00

Pos logit contributions
 instructions+0.020
So+0.019
 slices+0.017
 recipe+0.017
 distress+0.017
Neg logit contributions
 seeker-0.022
ained-0.019
ancer-0.016
 dusk-0.016
 F-0.016
Token and
Token ID290
Feature activation-3.76e-01

Pos logit contributions
 instructions+0.121
quet+0.114
 slices+0.113
So+0.112
 brace+0.112
Neg logit contributions
 seeker-0.100
ained-0.085
 tired-0.079
 forward-0.076
 morning-0.073
Token said
Token ID531
Feature activation-4.73e-01

Pos logit contributions
 instructions+0.006
So+0.006
 recipe+0.005
 brace+0.005
 words+0.005
Neg logit contributions
 seeker-0.011
ained-0.009
asc-0.008
 dusk-0.008
Whe-0.008
Token he
Token ID339
Feature activation-9.17e-01

Pos logit contributions
 instructions+0.010
 words+0.009
So+0.009
 slices+0.009
 brace+0.008
Neg logit contributions
 seeker-0.012
ained-0.010
asc-0.009
ancer-0.008
 dusk-0.008
Token couldn
Token ID3521
Feature activation-1.31e+00

Pos logit contributions
 instructions+0.019
So+0.018
 distress+0.016
 occasions+0.016
 words+0.016
Neg logit contributions
 seeker-0.021
ained-0.018
 forward-0.016
 dusk-0.016
asc-0.016
Token't
Token ID470
Feature activation-1.74e+00

Pos logit contributions
 instructions+0.025
So+0.023
 occasions+0.023
 slices+0.022
quet+0.022
Neg logit contributions
 seeker-0.031
ained-0.027
 tired-0.025
 forward-0.023
 slid-0.023
Token his
Token ID465
Feature activation+2.87e+00

Pos logit contributions
 seeker+0.026
ained+0.018
 tired+0.017
 morning+0.014
 dusk+0.014
Neg logit contributions
So-0.051
 instructions-0.050
quet-0.048
and-0.048
 occasions-0.047
Token house
Token ID2156
Feature activation-9.92e-02

Pos logit contributions
 seeker+0.062
ained+0.052
 tired+0.047
 forward+0.045
 slid+0.044
Neg logit contributions
 instructions-0.063
So-0.058
 occasions-0.057
 slices-0.056
quet-0.055
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
 instructions+0.002
So+0.002
 occasions+0.002
 brace+0.002
ymes+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
 tired-0.002
 forward-0.002
 slid-0.002
Token teach
Token ID4545
Feature activation-4.67e+00

Pos logit contributions
 instructions+0.043
So+0.040
 occasions+0.039
 slices+0.038
quet+0.038
Neg logit contributions
 seeker-0.040
ained-0.033
 tired-0.029
 forward-0.028
 slid-0.028
Token Gary
Token ID10936
Feature activation-5.30e+00

Pos logit contributions
quet+0.112
So+0.107
keeping+0.105
 instructions+0.105
 slices+0.104
Neg logit contributions
 seeker-0.074
 forward-0.062
ained-0.061
 tired-0.061
 morning-0.057
Token a
Token ID257
Feature activation-5.22e+00

Pos logit contributions
 instructions+0.120
So+0.112
 occasions+0.110
quet+0.108
 slices+0.108
Neg logit contributions
 seeker-0.108
ained-0.090
 tired-0.082
 forward-0.076
 slid-0.076
Token lesson
Token ID11483
Feature activation-3.79e+00

Pos logit contributions
 instructions+0.097
So+0.089
 occasions+0.087
 slices+0.086
quet+0.084
Neg logit contributions
 seeker-0.129
ained-0.110
 tired-0.102
 forward-0.097
 slid-0.096
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 instructions+0.061
So+0.055
 occasions+0.054
 slices+0.052
quet+0.051
Neg logit contributions
 seeker-0.104
ained-0.090
 tired-0.083
 forward-0.080
 slid-0.080
Token He
Token ID679
Feature activation-1.45e+00

Pos logit contributions
 instructions+0.005
quet+0.005
 occasions+0.005
So+0.005
 slices+0.004
Neg logit contributions
 seeker-0.004
ained-0.003
 tired-0.003
 forward-0.003
 F-0.003
Token was
Token ID373
Feature activation-5.29e-02

Pos logit contributions
 instructions+0.028
So+0.027
 occasions+0.025
 slices+0.024
 distress+0.024
Neg logit contributions
 seeker-0.035
ained-0.030
 forward-0.026
 tired-0.026
ancer-0.026
Token harsh
Token ID11859
Feature activation-2.32e+00

Pos logit contributions
 instructions+0.001
 distress+0.001
 recipe+0.001
 truth+0.001
So+0.001
Neg logit contributions
 seeker-0.001
ained-0.001
ancer-0.001
 slid-0.001
Whe-0.001
Tokenâ
Token ID22940
Feature activation+1.53e+00

Pos logit contributions
 instructions+0.021
So+0.020
 occasions+0.019
 slices+0.019
 brace+0.018
Neg logit contributions
 seeker-0.025
ained-0.021
ancer-0.019
asc-0.018
 tired-0.018
Token€
Token ID26391
Feature activation+2.42e+00

Pos logit contributions
 seeker+0.037
ained+0.031
 tired+0.030
ancer+0.028
 forward+0.027
Neg logit contributions
 instructions-0.029
So-0.028
 occasions-0.027
quet-0.026
keeping-0.026
Tokenâ„¢
Token ID8151
Feature activation+4.26e+00

Pos logit contributions
 seeker+0.056
ained+0.048
 tired+0.043
 forward+0.042
ancer+0.041
Neg logit contributions
 instructions-0.048
So-0.046
 slices-0.043
 occasions-0.043
quet-0.042
Tokens
Token ID82
Feature activation-5.03e-01

Pos logit contributions
 seeker+0.101
ained+0.081
 tired+0.074
asc+0.072
ancer+0.071
Neg logit contributions
So-0.084
 instructions-0.082
and-0.078
 slices-0.076
quet-0.073
Token story
Token ID1621
Feature activation-5.14e-01

Pos logit contributions
 instructions+0.011
So+0.010
 occasions+0.010
 slices+0.009
 distress+0.009
Neg logit contributions
 seeker-0.011
ained-0.010
ancer-0.008
asc-0.008
 tired-0.008
Token teaches
Token ID17324
Feature activation-5.20e+00

Pos logit contributions
 instructions+0.011
So+0.010
 occasions+0.009
quet+0.009
 slices+0.009
Neg logit contributions
 seeker-0.012
ained-0.010
 tired-0.009
ancer-0.009
asc-0.009
Token us
Token ID514
Feature activation-2.52e+00

Pos logit contributions
 instructions+0.104
 slices+0.098
quet+0.096
So+0.096
 occasions+0.095
Neg logit contributions
 seeker-0.113
ained-0.099
 tired-0.090
 F-0.088
 forward-0.086
Token an
Token ID281
Feature activation-3.71e+00

Pos logit contributions
 instructions+0.049
So+0.042
 occasions+0.040
 words+0.040
and+0.038
Neg logit contributions
 seeker-0.067
ained-0.056
ancer-0.051
asc-0.050
 slid-0.050
Token important
Token ID1593
Feature activation-2.36e+00

Pos logit contributions
 instructions+0.070
So+0.064
 occasions+0.062
 slices+0.060
quet+0.060
Neg logit contributions
 seeker-0.092
ained-0.079
 tired-0.072
 forward-0.069
 slid-0.069
Token lesson
Token ID11483
Feature activation-3.38e+00

Pos logit contributions
 instructions+0.039
So+0.035
 occasions+0.034
 slices+0.034
quet+0.033
Neg logit contributions
 seeker-0.064
ained-0.055
 tired-0.051
 forward-0.049
 slid-0.049
Token -
Token ID532
Feature activation-1.73e+00

Pos logit contributions
 instructions+0.055
 occasions+0.050
So+0.048
keeping+0.046
 words+0.046
Neg logit contributions
 seeker-0.095
ained-0.080
 tired-0.077
 slid-0.074
 exhausted-0.073
Token not
Token ID407
Feature activation-5.65e-01

Pos logit contributions
 instructions+0.003
 brace+0.002
 truth+0.002
 distress+0.002
 words+0.002
Neg logit contributions
 seeker-0.003
ained-0.003
ahs-0.003
ancer-0.003
 Su-0.003
Token a
Token ID257
Feature activation-2.06e+00

Pos logit contributions
 instructions+0.012
So+0.010
 distress+0.010
 brace+0.010
 slices+0.010
Neg logit contributions
 seeker-0.013
ained-0.011
ancer-0.010
 forward-0.009
asc-0.009
Token dancer
Token ID38619
Feature activation-9.96e-02

Pos logit contributions
 instructions+0.050
So+0.047
 occasions+0.047
 slices+0.046
quet+0.046
Neg logit contributions
 seeker-0.040
ained-0.031
 tired-0.029
 forward-0.027
 slid-0.027
Token.
Token ID13
Feature activation+9.29e-01

Pos logit contributions
 instructions+0.002
So+0.002
 occasions+0.002
 slices+0.002
quet+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
 tired-0.002
 forward-0.002
ancer-0.002
Token<|endoftext|>
Token ID50256
Feature activation+2.14e+00

Pos logit contributions
 seeker+0.021
ained+0.017
 tired+0.016
 F+0.015
 forward+0.015
Neg logit contributions
 instructions-0.020
 occasions-0.019
quet-0.019
 slices-0.019
 recipe-0.018
TokenS
Token ID50
Feature activation+5.58e+00

Pos logit contributions
 seeker+0.038
ained+0.031
 tired+0.027
 F+0.025
 forward+0.025
Neg logit contributions
 instructions-0.056
 occasions-0.052
So-0.051
quet-0.051
 brace-0.051
Tokenara
Token ID3301
Feature activation+2.91e+00

Pos logit contributions
 seeker+0.126
ained+0.106
 tired+0.098
 forward+0.092
 slid+0.092
Neg logit contributions
 instructions-0.117
So-0.106
 occasions-0.106
quet-0.102
 slices-0.101
Token likes
Token ID7832
Feature activation-8.79e-01

Pos logit contributions
 seeker+0.072
ained+0.063
 forward+0.057
ancer+0.057
 tired+0.056
Neg logit contributions
 instructions-0.050
So-0.048
keeping-0.046
 distress-0.046
 slices-0.045
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
 instructions+0.016
So+0.015
 occasions+0.014
quet+0.014
 slices+0.014
Neg logit contributions
 seeker-0.022
ained-0.019
 tired-0.018
 slid-0.017
 F-0.017
Token design
Token ID1486
Feature activation+1.69e-01

Pos logit contributions
 instructions+0.030
So+0.029
 slices+0.028
quet+0.028
ymes+0.028
Neg logit contributions
 seeker-0.024
ained-0.018
 tired-0.017
 slid-0.017
 morning-0.016
Token things
Token ID1243
Feature activation-7.93e-01

Pos logit contributions
 seeker+0.004
ained+0.003
 tired+0.003
 forward+0.003
 slid+0.003
Neg logit contributions
 instructions-0.004
So-0.003
 occasions-0.003
 slices-0.003
quet-0.003
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
 seeker+0.017
ained+0.015
 forward+0.014
 tired+0.014
 morning+0.013
Neg logit contributions
 instructions-0.014
So-0.013
quet-0.013
 occasions-0.013
 brace-0.013
Token<|endoftext|>
Token ID50256
Feature activation+1.53e+00

Pos logit contributions
 seeker+0.003
ained+0.003
 tired+0.003
 F+0.003
 Jump+0.003
Neg logit contributions
 instructions-0.003
 brace-0.003
 occasions-0.003
 slices-0.003
ymes-0.003
TokenOnce
Token ID7454
Feature activation+7.28e+00

Pos logit contributions
 seeker+0.036
ained+0.031
 tired+0.028
 forward+0.026
 F+0.026
Neg logit contributions
 instructions-0.031
 occasions-0.028
So-0.028
quet-0.027
 brace-0.027
Token upon
Token ID2402
Feature activation+6.09e-01

Pos logit contributions
 seeker+0.085
ained+0.056
asc+0.040
 tired+0.039
 slid+0.038
Neg logit contributions
 instructions-0.228
So-0.220
 slices-0.213
and-0.212
 occasions-0.211
Token a
Token ID257
Feature activation+4.16e+00

Pos logit contributions
 seeker+0.008
ained+0.005
 tired+0.005
 morning+0.004
 F+0.004
Neg logit contributions
 instructions-0.019
 occasions-0.018
quet-0.018
 recipe-0.018
keeping-0.017
Token time
Token ID640
Feature activation+5.91e+00

Pos logit contributions
 seeker+0.050
 tired+0.033
ained+0.032
 morning+0.030
 forward+0.026
Neg logit contributions
 instructions-0.140
keeping-0.135
quet-0.129
 occasions-0.125
 brace-0.123
Token,
Token ID11
Feature activation+5.13e+00

Pos logit contributions
 seeker+0.122
ained+0.105
asc+0.088
 tired+0.086
ancer+0.085
Neg logit contributions
 instructions-0.126
So-0.125
and-0.121
 slices-0.117
 occasions-0.114
Token there
Token ID612
Feature activation+2.60e+00

Pos logit contributions
 seeker+0.105
ained+0.085
ahs+0.077
ancer+0.076
 surfing+0.075
Neg logit contributions
So-0.109
 instructions-0.099
and-0.099
 distress-0.098
 slices-0.098
Token was
Token ID373
Feature activation+3.54e+00

Pos logit contributions
 seeker+0.057
ained+0.048
 tired+0.043
 slid+0.041
 forward+0.040
Neg logit contributions
 instructions-0.057
So-0.052
 occasions-0.051
quet-0.050
 slices-0.050
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
 seeker+0.071
ained+0.058
 tired+0.052
ancer+0.050
 forward+0.050
Neg logit contributions
 instructions-0.081
So-0.078
 slices-0.075
 occasions-0.074
 brace-0.072
Token big
Token ID1263
Feature activation-5.96e-01

Pos logit contributions
 instructions+0.023
So+0.023
 recipe+0.021
 slices+0.021
 brace+0.020
Neg logit contributions
ained-0.042
 seeker-0.042
asc-0.035
 forward-0.035
ancer-0.034
Token more
Token ID517
Feature activation+8.86e-01

Pos logit contributions
 seeker+0.035
ained+0.030
 tired+0.029
 forward+0.027
 slid+0.027
Neg logit contributions
 instructions-0.030
So-0.028
 occasions-0.027
quet-0.027
 slices-0.027
Token beautiful
Token ID4950
Feature activation+6.16e-02

Pos logit contributions
 seeker+0.021
ained+0.018
 tired+0.017
 forward+0.016
 slid+0.016
Neg logit contributions
 instructions-0.017
So-0.016
 occasions-0.015
quet-0.015
 slices-0.015
Token pictures
Token ID5986
Feature activation+1.88e-02

Pos logit contributions
 seeker+0.001
ained+0.001
 tired+0.001
 forward+0.001
 morning+0.001
Neg logit contributions
 instructions-0.001
So-0.001
quet-0.001
 distress-0.001
 slices-0.001
Token.
Token ID13
Feature activation+7.10e-01

Pos logit contributions
 seeker+0.000
ained+0.000
 tired+0.000
 forward+0.000
 slid+0.000
Neg logit contributions
 instructions-0.000
So-0.000
 occasions-0.000
 slices-0.000
 distress-0.000
Token<|endoftext|>
Token ID50256
Feature activation+1.58e+00

Pos logit contributions
 seeker+0.018
ained+0.015
 tired+0.014
 forward+0.013
 F+0.013
Neg logit contributions
 instructions-0.013
 occasions-0.012
So-0.012
quet-0.011
 brace-0.011
TokenOnce
Token ID7454
Feature activation+7.29e+00

Pos logit contributions
 seeker+0.034
ained+0.030
 tired+0.027
 F+0.026
 morning+0.025
Neg logit contributions
 instructions-0.035
 occasions-0.032
quet-0.031
 brace-0.031
 recipe-0.030
Token upon
Token ID2402
Feature activation+3.42e-01

Pos logit contributions
 seeker+0.082
ained+0.055
 tired+0.041
 forward+0.036
 slid+0.035
Neg logit contributions
 instructions-0.234
So-0.224
 occasions-0.219
 slices-0.218
quet-0.215
Token a
Token ID257
Feature activation+4.38e+00

Pos logit contributions
 seeker+0.004
ained+0.003
 tired+0.003
 morning+0.002
 spl+0.002
Neg logit contributions
 instructions-0.011
 occasions-0.011
quet-0.010
 recipe-0.010
keeping-0.010
Token time
Token ID640
Feature activation+5.66e+00

Pos logit contributions
 seeker+0.057
ained+0.040
 tired+0.039
 morning+0.033
 forward+0.032
Neg logit contributions
 instructions-0.141
keeping-0.134
quet-0.131
 occasions-0.128
 brace-0.125
Token,
Token ID11
Feature activation+5.14e+00

Pos logit contributions
 seeker+0.112
ained+0.095
asc+0.080
 tired+0.078
ancer+0.078
Neg logit contributions
 instructions-0.126
So-0.125
and-0.119
 slices-0.118
 distress-0.115
Token there
Token ID612
Feature activation+2.82e+00

Pos logit contributions
 seeker+0.101
ained+0.083
ahs+0.074
ancer+0.073
asc+0.072
Neg logit contributions
So-0.112
 distress-0.103
 slices-0.103
and-0.102
 instructions-0.102
Token<|endoftext|>
Token ID50256
Feature activation+1.83e+00

Pos logit contributions
 seeker+0.016
ained+0.013
ancer+0.012
 forward+0.012
 tired+0.012
Neg logit contributions
So-0.008
 instructions-0.008
and-0.007
 slices-0.007
 occasions-0.007
TokenOnce
Token ID7454
Feature activation+7.16e+00

Pos logit contributions
 seeker+0.040
ained+0.035
 tired+0.031
 F+0.030
 morning+0.029
Neg logit contributions
 instructions-0.041
 occasions-0.037
quet-0.036
 brace-0.036
 recipe-0.036
Token upon
Token ID2402
Feature activation+5.39e-01

Pos logit contributions
 seeker+0.085
ained+0.058
 tired+0.043
asc+0.040
 forward+0.039
Neg logit contributions
 instructions-0.224
So-0.215
 slices-0.208
 occasions-0.208
and-0.205
Token a
Token ID257
Feature activation+4.54e+00

Pos logit contributions
 seeker+0.006
ained+0.004
 tired+0.004
 morning+0.003
 slid+0.003
Neg logit contributions
 instructions-0.018
 occasions-0.017
quet-0.016
 recipe-0.016
keeping-0.016
Token time
Token ID640
Feature activation+5.94e+00

Pos logit contributions
 seeker+0.058
ained+0.041
 tired+0.039
 morning+0.034
 slid+0.031
Neg logit contributions
 instructions-0.149
keeping-0.140
quet-0.137
 occasions-0.134
 brace-0.131
Token,
Token ID11
Feature activation+5.23e+00

Pos logit contributions
 seeker+0.116
ained+0.097
 tired+0.081
asc+0.081
ancer+0.080
Neg logit contributions
 instructions-0.134
So-0.133
and-0.126
 slices-0.125
 occasions-0.123
Token there
Token ID612
Feature activation+2.88e+00

Pos logit contributions
 seeker+0.099
ained+0.078
ancer+0.068
asc+0.067
ahs+0.067
Neg logit contributions
So-0.120
 instructions-0.115
 slices-0.111
 distress-0.111
and-0.110
Token was
Token ID373
Feature activation+3.62e+00

Pos logit contributions
 seeker+0.063
ained+0.053
 tired+0.048
 slid+0.046
 forward+0.045
Neg logit contributions
 instructions-0.063
So-0.057
 occasions-0.056
quet-0.055
 slices-0.055
Token a
Token ID257
Feature activation-1.38e+00

Pos logit contributions
 seeker+0.071
ained+0.058
 tired+0.052
ancer+0.050
 forward+0.050
Neg logit contributions
 instructions-0.084
So-0.080
 slices-0.078
 occasions-0.077
 distress-0.075
Token gentle
Token ID10296
Feature activation-2.09e+00

Pos logit contributions
 instructions+0.020
So+0.020
 slices+0.018
 brace+0.018
 recipe+0.018
Neg logit contributions
ained-0.037
 seeker-0.036
asc-0.030
 forward-0.030
ancer-0.029
Token rabbit
Token ID22746
Feature activation-2.36e+00

Pos logit contributions
 instructions+0.029
 recipe+0.028
 note+0.027
quet+0.026
 poem+0.026
Neg logit contributions
ained-0.056
 seeker-0.056
 forward-0.048
asc-0.047
 slid-0.047
Token stars
Token ID5788
Feature activation+1.12e+00

Pos logit contributions
 instructions+0.014
 words+0.013
 brace+0.013
So+0.012
 slices+0.012
Neg logit contributions
 seeker-0.017
ained-0.015
 slid-0.014
asc-0.013
 forward-0.013
Token tw
Token ID665
Feature activation+3.91e+00

Pos logit contributions
 seeker+0.024
ained+0.019
 tired+0.017
asc+0.016
ancer+0.016
Neg logit contributions
 instructions-0.025
So-0.023
 slices-0.023
quet-0.022
 occasions-0.022
Tokenink
Token ID676
Feature activation+2.86e+00

Pos logit contributions
 seeker+0.083
ained+0.065
 tired+0.060
 forward+0.058
ancer+0.057
Neg logit contributions
 instructions-0.088
So-0.082
 occasions-0.081
 brace-0.081
quet-0.080
Tokenling
Token ID1359
Feature activation-4.05e-01

Pos logit contributions
 seeker+0.079
ained+0.063
ancer+0.058
 F+0.057
asc+0.057
Neg logit contributions
 instructions-0.051
So-0.046
 brace-0.046
 slices-0.045
quet-0.044
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
 instructions+0.009
So+0.009
 occasions+0.008
 slices+0.008
quet+0.008
Neg logit contributions
 seeker-0.008
ained-0.007
 tired-0.006
ancer-0.006
 slid-0.006
Token the
Token ID262
Feature activation+5.01e+00

Pos logit contributions
 seeker+0.016
 tired+0.010
ained+0.010
 dusk+0.009
 F+0.009
Neg logit contributions
 instructions-0.042
and-0.042
 occasions-0.042
quet-0.042
So-0.041
Token night
Token ID1755
Feature activation+2.83e+00

Pos logit contributions
 seeker+0.111
ained+0.094
ancer+0.088
 kidding+0.085
 surfing+0.084
Neg logit contributions
 instructions-0.103
So-0.096
 words-0.093
 slices-0.092
 brace-0.091
Token sky
Token ID6766
Feature activation+3.10e-01

Pos logit contributions
 seeker+0.056
ained+0.046
 tired+0.042
 forward+0.039
 slid+0.039
Neg logit contributions
 instructions-0.066
So-0.062
 occasions-0.061
 slices-0.059
quet-0.059
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
 seeker+0.008
ained+0.007
 tired+0.007
 slid+0.006
 forward+0.006
Neg logit contributions
 instructions-0.005
So-0.005
 occasions-0.005
 slices-0.005
and-0.005
Token "
Token ID366
Feature activation-3.03e+00

Pos logit contributions
 instructions+0.008
So+0.007
 occasions+0.007
 slices+0.007
 brace+0.007
Neg logit contributions
 seeker-0.008
ained-0.007
 tired-0.006
 forward-0.006
asc-0.006
TokenWow
Token ID22017
Feature activation+3.41e+00

Pos logit contributions
 instructions+0.067
 occasions+0.060
 slices+0.060
So+0.059
 distress+0.059
Neg logit contributions
 seeker-0.065
ained-0.055
 tired-0.048
 forward-0.047
 slid-0.047
Token with
Token ID351
Feature activation+6.57e-01

Pos logit contributions
 instructions+0.039
So+0.036
 occasions+0.034
 slices+0.033
and+0.033
Neg logit contributions
 seeker-0.057
ained-0.050
 tired-0.044
 slid-0.044
asc-0.043
Token you
Token ID345
Feature activation+7.86e-01

Pos logit contributions
 seeker+0.016
ained+0.014
asc+0.012
ancer+0.012
 tired+0.012
Neg logit contributions
 instructions-0.014
So-0.012
 occasions-0.012
 slices-0.011
 words-0.011
Token.
Token ID13
Feature activation+4.17e-01

Pos logit contributions
 seeker+0.022
ained+0.020
asc+0.018
 slid+0.018
ancer+0.017
Neg logit contributions
 instructions-0.012
So-0.011
 occasions-0.010
and-0.009
 slices-0.009
Token Don
Token ID2094
Feature activation+4.52e-01

Pos logit contributions
 seeker+0.010
ained+0.009
 tired+0.008
ahs+0.008
 slid+0.008
Neg logit contributions
So-0.008
 instructions-0.007
and-0.007
 occasions-0.006
keeping-0.006
Tokenâ
Token ID22940
Feature activation+1.53e+00

Pos logit contributions
 seeker+0.013
ained+0.011
 tired+0.010
asc+0.010
 kidding+0.010
Neg logit contributions
 instructions-0.007
So-0.007
quet-0.006
 occasions-0.006
 truth-0.006
Token€
Token ID26391
Feature activation+2.61e+00

Pos logit contributions
 seeker+0.035
ained+0.029
 tired+0.028
ancer+0.026
 slid+0.026
Neg logit contributions
 instructions-0.031
So-0.030
 occasions-0.028
quet-0.027
keeping-0.026
Tokenâ„¢
Token ID8151
Feature activation+4.46e+00

Pos logit contributions
 seeker+0.066
ained+0.059
 forward+0.053
asc+0.052
ancer+0.052
Neg logit contributions
 instructions-0.042
So-0.041
 slices-0.037
keeping-0.037
and-0.036
Tokent
Token ID83
Feature activation-1.71e+00

Pos logit contributions
 seeker+0.116
ained+0.097
 tired+0.088
asc+0.086
 dusk+0.086
Neg logit contributions
 instructions-0.075
So-0.074
and-0.069
 slices-0.067
quet-0.064
Token worry
Token ID5490
Feature activation-1.18e+00

Pos logit contributions
 instructions+0.032
So+0.030
 occasions+0.028
 distress+0.027
and+0.027
Neg logit contributions
 seeker-0.042
ained-0.037
asc-0.033
 tired-0.032
 forward-0.032
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
 instructions+0.023
So+0.022
 occasions+0.019
 truth+0.019
 words+0.019
Neg logit contributions
 seeker-0.029
ained-0.025
 tired-0.021
 slid-0.021
 dusk-0.021
Token I
Token ID314
Feature activation+2.09e-01

Pos logit contributions
 instructions+0.005
So+0.005
 words+0.004
 truth+0.004
and+0.004
Neg logit contributions
 seeker-0.008
ained-0.007
asc-0.006
ahs-0.006
ancer-0.006
Token saw
Token ID2497
Feature activation-2.71e-01

Pos logit contributions
 seeker+0.006
ained+0.005
ancer+0.004
 forward+0.004
 tired+0.004
Neg logit contributions
 instructions-0.006
So-0.006
 slices-0.005
 occasions-0.005
 brace-0.005
Token Ben
Token ID3932
Feature activation-1.68e+00

Pos logit contributions
 instructions+0.007
So+0.006
 slices+0.006
 occasions+0.006
 distress+0.006
Neg logit contributions
 seeker-0.005
ained-0.004
 tired-0.004
 forward-0.004
ancer-0.004
Token's
Token ID338
Feature activation+6.74e-01

Pos logit contributions
 instructions+0.032
So+0.029
 slices+0.028
 occasions+0.028
 brace+0.028
Neg logit contributions
 seeker-0.041
ained-0.035
 tired-0.031
ancer-0.031
 forward-0.031
Token pistol
Token ID16790
Feature activation+1.13e+00

Pos logit contributions
 seeker+0.012
 tired+0.010
ained+0.008
 kay+0.008
 forward+0.008
Neg logit contributions
 occasions-0.018
quet-0.018
So-0.017
and-0.017
qu-0.017
Token on
Token ID319
Feature activation+5.78e-01

Pos logit contributions
 seeker+0.025
ained+0.021
 tired+0.019
 forward+0.018
 slid+0.018
Neg logit contributions
 instructions-0.024
So-0.022
 occasions-0.022
 slices-0.021
quet-0.021
Token the
Token ID262
Feature activation+2.68e+00

Pos logit contributions
 seeker+0.009
ained+0.007
 tired+0.006
 F+0.006
 forward+0.006
Neg logit contributions
 instructions-0.016
So-0.015
 occasions-0.015
quet-0.015
and-0.014
Token ground
Token ID2323
Feature activation+1.30e+00

Pos logit contributions
 seeker+0.057
ained+0.043
 tired+0.041
 forward+0.039
 morning+0.039
Neg logit contributions
 instructions-0.058
So-0.057
 occasions-0.056
quet-0.056
ymes-0.055
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
 seeker+0.031
ained+0.027
 tired+0.025
 forward+0.024
 slid+0.024
Neg logit contributions
 instructions-0.024
 occasions-0.023
So-0.022
quet-0.022
 brace-0.021
Token Ben
Token ID3932
Feature activation-7.15e-01

Pos logit contributions
 seeker+0.024
ained+0.019
 tired+0.018
 F+0.017
 slid+0.016
Neg logit contributions
 instructions-0.029
 occasions-0.027
quet-0.026
So-0.026
keeping-0.025
Token had
Token ID550
Feature activation+2.34e+00

Pos logit contributions
 instructions+0.017
So+0.015
 occasions+0.015
quet+0.015
 slices+0.015
Neg logit contributions
 seeker-0.014
ained-0.012
 tired-0.011
 slid-0.010
 forward-0.010
Token dropped
Token ID5710
Feature activation+9.36e-01

Pos logit contributions
 seeker+0.042
ained+0.033
 tired+0.031
 slid+0.029
 F+0.028
Neg logit contributions
 instructions-0.059
quet-0.057
 occasions-0.056
So-0.056
and-0.054
Token boy
Token ID2933
Feature activation-1.21e+00

Pos logit contributions
 instructions+0.024
 recipe+0.023
 slices+0.023
 note+0.023
 distress+0.022
Neg logit contributions
ained-0.047
 seeker-0.044
asc-0.039
 forward-0.038
 slid-0.038
Token called
Token ID1444
Feature activation+1.63e+00

Pos logit contributions
 instructions+0.038
So+0.036
 occasions+0.035
 slices+0.035
quet+0.035
Neg logit contributions
 seeker-0.015
ained-0.010
 tired-0.008
 forward-0.007
 slid-0.007
Token Jimmy
Token ID12963
Feature activation-7.44e-01

Pos logit contributions
 seeker+0.032
ained+0.031
 forward+0.025
 tired+0.024
 slid+0.024
Neg logit contributions
 instructions-0.037
 slices-0.034
and-0.033
So-0.033
 occasions-0.033
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 instructions+0.013
So+0.011
 occasions+0.011
quet+0.011
 slices+0.011
Neg logit contributions
 seeker-0.020
ained-0.017
 tired-0.016
 forward-0.016
 slid-0.015
Token One
Token ID1881
Feature activation+1.11e+00

Pos logit contributions
 seeker+0.029
ained+0.024
 tired+0.022
 forward+0.021
 slid+0.021
Neg logit contributions
 instructions-0.030
So-0.028
 occasions-0.027
 slices-0.026
quet-0.026
Token day
Token ID1110
Feature activation+3.15e+00

Pos logit contributions
 seeker+0.022
ained+0.018
ancer+0.017
 kidding+0.016
ahs+0.016
Neg logit contributions
So-0.023
and-0.022
 instructions-0.022
 poem-0.022
 note-0.022
Token,
Token ID11
Feature activation+2.10e+00

Pos logit contributions
 seeker+0.073
ained+0.067
asc+0.058
ancer+0.054
 dusk+0.052
Neg logit contributions
 instructions-0.059
So-0.057
 slices-0.055
 occasions-0.055
 words-0.053
Token Jimmy
Token ID12963
Feature activation-4.13e-01

Pos logit contributions
 seeker+0.048
ained+0.042
asc+0.038
ahs+0.034
 forward+0.033
Neg logit contributions
 instructions-0.043
So-0.041
 slices-0.039
 occasions-0.038
and-0.037
Token went
Token ID1816
Feature activation+2.15e+00

Pos logit contributions
 instructions+0.008
So+0.008
 slices+0.008
 distress+0.008
 occasions+0.008
Neg logit contributions
 seeker-0.009
ained-0.008
ancer-0.007
 forward-0.007
 tired-0.007
Token to
Token ID284
Feature activation-1.00e-02

Pos logit contributions
 seeker+0.065
ained+0.059
Whe+0.056
ancer+0.054
 kidding+0.054
Neg logit contributions
 instructions-0.027
 recipe-0.022
and-0.021
 occasions-0.019
 words-0.019
Token his
Token ID465
Feature activation+1.56e+00

Pos logit contributions
 instructions+0.000
So+0.000
 occasions+0.000
 slices+0.000
 distress+0.000
Neg logit contributions
 seeker-0.000
ained-0.000
 forward-0.000
 tired-0.000
asc-0.000
Tokenay
Token ID323
Feature activation+5.11e-01

Pos logit contributions
 seeker+0.043
ained+0.041
 forward+0.032
ancer+0.031
 tired+0.031
Neg logit contributions
 instructions-0.066
 distress-0.063
 occasions-0.063
So-0.061
keeping-0.061
Tokenons
Token ID684
Feature activation+1.31e+00

Pos logit contributions
 seeker+0.011
ained+0.010
 tired+0.009
 forward+0.009
 slid+0.009
Neg logit contributions
 instructions-0.010
So-0.009
 distress-0.009
 occasions-0.009
keeping-0.009
Token hard
Token ID1327
Feature activation+1.04e+00

Pos logit contributions
 seeker+0.023
ained+0.019
 tired+0.018
 forward+0.017
 slid+0.016
Neg logit contributions
 instructions-0.032
 occasions-0.031
So-0.031
 distress-0.030
keeping-0.030
Token and
Token ID290
Feature activation+2.23e+00

Pos logit contributions
 seeker+0.019
 forward+0.016
ained+0.016
 tired+0.015
 morning+0.015
Neg logit contributions
 distress-0.024
 occasions-0.023
keeping-0.023
 instructions-0.023
quet-0.023
Token fast
Token ID3049
Feature activation+8.17e-01

Pos logit contributions
 seeker+0.054
ained+0.046
 tired+0.043
 forward+0.041
 slid+0.041
Neg logit contributions
 instructions-0.043
So-0.039
 occasions-0.038
 slices-0.037
quet-0.037
Token.
Token ID13
Feature activation+1.87e+00

Pos logit contributions
 seeker+0.021
ained+0.018
 tired+0.017
 forward+0.016
 slid+0.016
Neg logit contributions
 instructions-0.014
So-0.013
 occasions-0.013
quet-0.012
 slices-0.012
Token They
Token ID1119
Feature activation+1.67e+00

Pos logit contributions
 seeker+0.048
ained+0.040
asc+0.036
 forward+0.036
 tired+0.036
Neg logit contributions
 instructions-0.035
So-0.033
 slices-0.030
and-0.030
 occasions-0.029
Token want
Token ID765
Feature activation-1.53e+00

Pos logit contributions
 seeker+0.041
ained+0.036
asc+0.031
ancer+0.031
 tired+0.031
Neg logit contributions
 instructions-0.032
So-0.029
 slices-0.028
 brace-0.028
 occasions-0.027
Token to
Token ID284
Feature activation-2.95e+00

Pos logit contributions
quet+0.033
keeping+0.032
So+0.032
 occasions+0.031
 instructions+0.031
Neg logit contributions
 seeker-0.029
 tired-0.026
 spl-0.025
 surfing-0.025
ained-0.024
Token see
Token ID766
Feature activation-1.10e+00

Pos logit contributions
 instructions+0.067
keeping+0.064
So+0.064
and+0.064
ymes+0.063
Neg logit contributions
 seeker-0.059
 tired-0.045
ained-0.044
 slid-0.044
 spl-0.043
Token purple
Token ID14032
Feature activation+4.06e-02

Pos logit contributions
 instructions+0.027
So+0.026
 occasions+0.026
quet+0.025
keeping+0.025
Neg logit contributions
 seeker-0.020
ained-0.016
 tired-0.015
 forward-0.014
 slid-0.014
Token ducks
Token ID39694
Feature activation+4.03e-03

Pos logit contributions
 instructions+0.019
So+0.018
 slices+0.017
 brace+0.017
 distress+0.017
Neg logit contributions
 seeker-0.017
ained-0.014
 tired-0.012
ancer-0.012
asc-0.012
Token in
Token ID287
Feature activation+1.17e+00

Pos logit contributions
 seeker+0.000
ained+0.000
asc+0.000
Whe+0.000
ancer+0.000
Neg logit contributions
 instructions-0.000
 slices-0.000
So-0.000
 distress-0.000
 brace-0.000
Token the
Token ID262
Feature activation+4.01e+00

Pos logit contributions
 seeker+0.020
ained+0.015
 tired+0.014
 forward+0.012
 slid+0.012
Neg logit contributions
 instructions-0.031
So-0.029
 occasions-0.029
quet-0.028
 slices-0.028
Token water
Token ID1660
Feature activation+2.30e+00

Pos logit contributions
 seeker+0.091
ained+0.082
ancer+0.078
asc+0.076
ahs+0.074
Neg logit contributions
 instructions-0.080
So-0.073
 truth-0.069
 rules-0.068
 slices-0.067
Token and
Token ID290
Feature activation+4.93e-01

Pos logit contributions
 seeker+0.058
ained+0.049
asc+0.044
ancer+0.044
 tired+0.043
Neg logit contributions
 instructions-0.042
So-0.038
 slices-0.038
 recipe-0.036
 occasions-0.036
Token sand
Token ID6450
Feature activation+1.66e+00

Pos logit contributions
 seeker+0.012
ained+0.010
 forward+0.009
asc+0.009
ancer+0.009
Neg logit contributions
 instructions-0.010
 slices-0.009
So-0.009
 brace-0.009
 words-0.009
Token castles
Token ID49203
Feature activation+1.35e+00

Pos logit contributions
 seeker+0.043
ained+0.036
 tired+0.033
 forward+0.032
ancer+0.032
Neg logit contributions
 instructions-0.030
So-0.027
 occasions-0.026
 slices-0.026
quet-0.025
Token on
Token ID319
Feature activation+1.27e+00

Pos logit contributions
 seeker+0.032
ained+0.028
 tired+0.026
 forward+0.025
 slid+0.025
Neg logit contributions
 instructions-0.026
So-0.024
 occasions-0.023
 slices-0.022
quet-0.022
Token the
Token ID262
Feature activation+3.13e+00

Pos logit contributions
 seeker+0.018
ained+0.013
 tired+0.011
 F+0.011
 dusk+0.011
Neg logit contributions
 instructions-0.037
So-0.035
 occasions-0.035
quet-0.034
and-0.034
Token shore
Token ID15191
Feature activation+1.50e+00

Pos logit contributions
 seeker+0.065
ained+0.058
ancer+0.049
asc+0.049
 tired+0.046
Neg logit contributions
 instructions-0.075
So-0.063
 slices-0.063
 brace-0.063
 occasions-0.063
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
 seeker+0.044
ained+0.038
 tired+0.035
asc+0.035
ancer+0.034
Neg logit contributions
 instructions-0.022
So-0.019
 slices-0.018
 occasions-0.018
quet-0.017
Token "
Token ID366
Feature activation-3.25e+00

Pos logit contributions
 instructions+0.010
So+0.009
 occasions+0.009
 slices+0.009
quet+0.009
Neg logit contributions
 seeker-0.008
ained-0.006
 tired-0.006
 forward-0.005
 slid-0.005
TokenWow
Token ID22017
Feature activation+3.18e+00

Pos logit contributions
 instructions+0.065
So+0.060
 occasions+0.059
 slices+0.058
quet+0.057
Neg logit contributions
 seeker-0.076
ained-0.064
 tired-0.058
 forward-0.056
 slid-0.056
Token!"
Token ID2474
Feature activation-2.23e-01

Pos logit contributions
 seeker+0.074
ained+0.070
asc+0.057
 tired+0.057
ancer+0.056
Neg logit contributions
 instructions-0.062
So-0.057
 slices-0.055
 brace-0.054
quet-0.054
Token said
Token ID531
Feature activation-2.12e-01

Pos logit contributions
 instructions+0.005
So+0.005
 occasions+0.005
 slices+0.005
quet+0.005
Neg logit contributions
 seeker-0.005
ained-0.004
 tired-0.003
 slid-0.003
 forward-0.003
Token Jake
Token ID14757
Feature activation-2.39e+00

Pos logit contributions
 instructions+0.005
So+0.004
 slices+0.004
 occasions+0.004
 words+0.004
Neg logit contributions
 seeker-0.004
ained-0.004
asc-0.003
 tired-0.003
ancer-0.003
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
 instructions+0.043
So+0.040
 occasions+0.040
 slices+0.038
quet+0.038
Neg logit contributions
 seeker-0.060
ained-0.051
 tired-0.048
 forward-0.046
 slid-0.046
Token "
Token ID366
Feature activation-3.28e+00

Pos logit contributions
 instructions+0.018
So+0.017
 occasions+0.016
 slices+0.016
quet+0.016
Neg logit contributions
 seeker-0.018
ained-0.015
 tired-0.013
 forward-0.013
 slid-0.012
TokenThis
Token ID1212
Feature activation+3.05e-01

Pos logit contributions
 instructions+0.069
 occasions+0.063
 distress+0.063
 recipe+0.063
 slices+0.062
Neg logit contributions
 seeker-0.071
ained-0.061
Whe-0.055
 forward-0.054
 slid-0.054
Token looks
Token ID3073
Feature activation+8.03e-01

Pos logit contributions
 seeker+0.007
ained+0.006
asc+0.006
 tired+0.006
 dusk+0.005
Neg logit contributions
 instructions-0.006
So-0.006
 words-0.005
 brace-0.005
 occasions-0.005
Token fine
Token ID3734
Feature activation-1.63e+00

Pos logit contributions
 seeker+0.020
ained+0.018
ancer+0.016
asc+0.016
 dusk+0.015
Neg logit contributions
 instructions-0.015
So-0.013
 brace-0.013
 words-0.013
and-0.012
Token!"
Token ID2474
Feature activation-1.56e+00

Pos logit contributions
 instructions+0.028
 occasions+0.023
So+0.023
 words+0.023
 brace+0.023
Neg logit contributions
 seeker-0.044
ained-0.040
 tired-0.035
 slid-0.035
asc-0.035
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
 instructions+0.028
So+0.026
 occasions+0.025
 slices+0.025
 distress+0.025
Neg logit contributions
 seeker-0.021
ained-0.017
 tired-0.015
ancer-0.014
asc-0.014
Token river
Token ID7850
Feature activation-6.25e-01

Pos logit contributions
 instructions+0.000
So+0.000
 slices+0.000
 truth+0.000
 words+0.000
Neg logit contributions
 seeker-0.000
ained-0.000
asc-0.000
 slid-0.000
ancer-0.000
Token and
Token ID290
Feature activation+9.75e-01

Pos logit contributions
 instructions+0.013
So+0.012
 occasions+0.011
 slices+0.011
quet+0.011
Neg logit contributions
 seeker-0.014
ained-0.012
 tired-0.011
 forward-0.011
 slid-0.011
Token saw
Token ID2497
Feature activation-7.07e-01

Pos logit contributions
 seeker+0.027
ained+0.023
asc+0.021
ancer+0.021
 forward+0.019
Neg logit contributions
 instructions-0.017
So-0.016
 slices-0.016
 distress-0.015
 brace-0.015
Token something
Token ID1223
Feature activation-1.65e+00

Pos logit contributions
 instructions+0.016
So+0.015
 slices+0.014
 distress+0.014
 occasions+0.014
Neg logit contributions
 seeker-0.015
ained-0.012
ancer-0.011
 tired-0.011
asc-0.011
Token new
Token ID649
Feature activation-2.12e+00

Pos logit contributions
 instructions+0.034
So+0.032
 occasions+0.031
 slices+0.030
quet+0.030
Neg logit contributions
 seeker-0.038
ained-0.032
 tired-0.029
 forward-0.028
ancer-0.028
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
 instructions+0.039
So+0.036
 occasions+0.035
quet+0.034
 slices+0.033
Neg logit contributions
 seeker-0.053
ained-0.046
 tired-0.042
 slid-0.040
 forward-0.040
Token All
Token ID1439
Feature activation+7.50e-01

Pos logit contributions
 instructions+0.007
So+0.007
 slices+0.007
and+0.006
 truth+0.006
Neg logit contributions
 seeker-0.010
ained-0.009
asc-0.008
 forward-0.007
 tired-0.007
Token of
Token ID286
Feature activation+1.08e+00

Pos logit contributions
 seeker+0.019
ained+0.014
ancer+0.013
asc+0.013
 kidding+0.012
Neg logit contributions
 instructions-0.015
So-0.014
 distress-0.014
 slices-0.014
and-0.013
Token a
Token ID257
Feature activation-2.25e+00

Pos logit contributions
 seeker+0.020
ained+0.016
 tired+0.014
 forward+0.013
 slid+0.013
Neg logit contributions
 instructions-0.028
So-0.026
 occasions-0.025
 slices-0.025
quet-0.025
Token sudden
Token ID4802
Feature activation+1.82e+00

Pos logit contributions
 instructions+0.040
 occasions+0.037
So+0.036
keeping+0.035
quet+0.035
Neg logit contributions
 seeker-0.056
ained-0.047
 tired-0.046
 forward-0.043
 slid-0.043
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 instructions+0.001
 slices+0.001
So+0.001
 occasions+0.001
keeping+0.001
Neg logit contributions
 seeker-0.002
ained-0.001
 tired-0.001
ancer-0.001
 slid-0.001
Token dirt
Token ID13647
Feature activation+1.18e+00

Pos logit contributions
 seeker+0.026
ained+0.021
 tired+0.020
 forward+0.019
 slid+0.019
Neg logit contributions
 instructions-0.021
So-0.020
 occasions-0.019
quet-0.019
and-0.019
Token!"
Token ID2474
Feature activation-1.21e+00

Pos logit contributions
 seeker+0.028
ained+0.024
 tired+0.023
 forward+0.023
 morning+0.022
Neg logit contributions
 distress-0.020
 instructions-0.020
So-0.020
 occasions-0.019
ymes-0.019
Token
Token ID198
Feature activation-2.48e+00

Pos logit contributions
 instructions+0.028
quet+0.027
 occasions+0.027
keeping+0.027
ymes+0.027
Neg logit contributions
 seeker-0.020
 tired-0.018
 F-0.016
 morning-0.016
ained-0.016
Token
Token ID198
Feature activation-3.39e+00

Pos logit contributions
 brace+0.057
keeping+0.055
 instructions+0.053
 occasions+0.053
ymes+0.052
Neg logit contributions
 seeker-0.043
 F-0.038
 tired-0.037
 morning-0.035
 Jump-0.035
TokenM
Token ID44
Feature activation+5.02e-02

Pos logit contributions
keeping+0.066
 brace+0.065
 instructions+0.061
 occasions+0.060
 distress+0.058
Neg logit contributions
 seeker-0.072
 forward-0.059
 tired-0.058
 F-0.056
 dusk-0.056
Tokenama
Token ID1689
Feature activation-1.30e+00

Pos logit contributions
 seeker+0.001
ained+0.001
 tired+0.001
 forward+0.001
ancer+0.001
Neg logit contributions
 instructions-0.001
So-0.001
 occasions-0.001
 slices-0.001
 brace-0.001
Token was
Token ID373
Feature activation+4.42e-01

Pos logit contributions
 instructions+0.026
So+0.024
 slices+0.023
 occasions+0.023
quet+0.022
Neg logit contributions
 seeker-0.031
ained-0.027
 tired-0.023
 forward-0.023
ancer-0.023
Token very
Token ID845
Feature activation-3.03e-01

Pos logit contributions
 seeker+0.009
ained+0.008
 tired+0.007
ancer+0.007
 slid+0.006
Neg logit contributions
 instructions-0.010
So-0.009
 slices-0.009
 occasions-0.009
 distress-0.009
Token impressed
Token ID12617
Feature activation-4.06e+00

Pos logit contributions
 instructions+0.007
So+0.006
 slices+0.006
 occasions+0.006
quet+0.006
Neg logit contributions
 seeker-0.007
ained-0.006
ancer-0.005
 tired-0.005
asc-0.005
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
 instructions+0.094
 occasions+0.092
 distress+0.090
 lair+0.089
keeping+0.088
Neg logit contributions
 seeker-0.066
ained-0.057
 tired-0.057
 forward-0.056
 on-0.055
Token He
Token ID679
Feature activation+8.59e-02

Pos logit contributions
 instructions+0.002
So+0.002
and+0.002
 slices+0.002
 occasions+0.002
Neg logit contributions
 seeker-0.003
ained-0.003
asc-0.002
ancer-0.002
 tired-0.002
Token said
Token ID531
Feature activation+5.58e-01

Pos logit contributions
 seeker+0.002
ained+0.002
 tired+0.002
 forward+0.002
ancer+0.002
Neg logit contributions
 instructions-0.002
So-0.002
 occasions-0.001
 slices-0.001
 brace-0.001
Token,
Token ID11
Feature activation+4.32e-01

Pos logit contributions
 seeker+0.012
ained+0.010
asc+0.008
ancer+0.008
 tired+0.008
Neg logit contributions
 instructions-0.013
So-0.012
 slices-0.012
 occasions-0.012
 brace-0.012
Token "
Token ID366
Feature activation-3.66e+00

Pos logit contributions
 seeker+0.011
ained+0.010
asc+0.009
ancer+0.008
 dusk+0.008
Neg logit contributions
 instructions-0.008
So-0.007
 slices-0.007
 recipe-0.007
 words-0.007
TokenYes
Token ID5297
Feature activation+6.70e-01

Pos logit contributions
 instructions+0.079
 slices+0.070
 occasions+0.070
 recipe+0.070
 distress+0.069
Neg logit contributions
 seeker-0.081
ained-0.068
 tired-0.061
 forward-0.060
 slid-0.060
Token,
Token ID11
Feature activation-8.38e-01

Pos logit contributions
 seeker+0.015
ained+0.015
asc+0.012
ancer+0.011
 dusk+0.011
Neg logit contributions
 instructions-0.014
So-0.013
 slices-0.012
 occasions-0.012
 words-0.012
Token Lily
Token ID20037
Feature activation-8.76e-01

Pos logit contributions
 instructions+0.016
So+0.015
 slices+0.014
 occasions+0.014
quet+0.013
Neg logit contributions
 seeker-0.020
ained-0.019
asc-0.016
 tired-0.016
ancer-0.015
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
 instructions+0.016
So+0.015
 occasions+0.014
 slices+0.014
quet+0.013
Neg logit contributions
 seeker-0.022
ained-0.019
 tired-0.017
ancer-0.017
 slid-0.017
Token I
Token ID314
Feature activation+4.44e-01

Pos logit contributions
 seeker+0.013
ained+0.013
 tired+0.011
ahs+0.011
 spl+0.011
Neg logit contributions
 instructions-0.007
So-0.006
 occasions-0.006
and-0.006
 words-0.006
Token promise
Token ID6991
Feature activation-2.60e+00

Pos logit contributions
 seeker+0.010
ained+0.009
ancer+0.008
 forward+0.008
asc+0.008
Neg logit contributions
 instructions-0.008
So-0.008
 occasions-0.008
 truth-0.007
 recipe-0.007
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
 instructions+0.047
So+0.043
 occasions+0.042
 slices+0.042
quet+0.041
Neg logit contributions
 seeker-0.066
ained-0.056
 tired-0.053
 forward-0.050
 slid-0.050
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
 instructions+0.010
 occasions+0.009
So+0.008
 slices+0.008
quet+0.008
Neg logit contributions
 seeker-0.014
ained-0.013
 tired-0.012
 forward-0.011
asc-0.011
Token Blue
Token ID4518
Feature activation-1.05e+00

Pos logit contributions
 seeker+0.029
ained+0.024
 tired+0.023
 F+0.022
 morning+0.021
Neg logit contributions
 instructions-0.038
 occasions-0.034
 recipe-0.034
keeping-0.034
and-0.034
Token lived
Token ID5615
Feature activation+1.15e+00

Pos logit contributions
 instructions+0.028
So+0.026
 occasions+0.026
 slices+0.026
quet+0.025
Neg logit contributions
 seeker-0.017
ained-0.014
 tired-0.012
 slid-0.011
 forward-0.011
Token in
Token ID287
Feature activation+2.82e+00

Pos logit contributions
 seeker+0.021
ained+0.017
 tired+0.014
ancer+0.014
 slid+0.014
Neg logit contributions
 instructions-0.028
So-0.027
 slices-0.026
and-0.026
 brace-0.025
Token a
Token ID257
Feature activation+8.88e-01

Pos logit contributions
 seeker+0.044
ained+0.034
 tired+0.031
 F+0.027
 morning+0.027
Neg logit contributions
 instructions-0.079
 occasions-0.075
So-0.074
quet-0.072
 recipe-0.072
Token tiny
Token ID7009
Feature activation+4.83e-01

Pos logit contributions
 seeker+0.020
ained+0.016
 tired+0.015
 forward+0.014
 slid+0.014
Neg logit contributions
 instructions-0.019
So-0.018
 occasions-0.018
 slices-0.017
quet-0.017
Token nest
Token ID16343
Feature activation+1.63e+00

Pos logit contributions
 seeker+0.011
ained+0.008
 tired+0.008
 forward+0.007
 pool+0.007
Neg logit contributions
 instructions-0.011
 occasions-0.010
So-0.010
quet-0.010
 slices-0.009
Token in
Token ID287
Feature activation+1.64e+00

Pos logit contributions
 seeker+0.034
ained+0.027
 tired+0.025
ancer+0.024
 slid+0.024
Neg logit contributions
 instructions-0.036
So-0.034
 slices-0.032
 occasions-0.032
and-0.032
Token the
Token ID262
Feature activation+3.32e+00

Pos logit contributions
 seeker+0.024
ained+0.018
 tired+0.016
 F+0.015
 morning+0.014
Neg logit contributions
 instructions-0.048
So-0.045
 occasions-0.045
quet-0.044
 recipe-0.044
Token west
Token ID7421
Feature activation+9.68e-01

Pos logit contributions
 seeker+0.063
ained+0.052
 tired+0.048
 forward+0.043
 morning+0.043
Neg logit contributions
 instructions-0.082
 occasions-0.076
So-0.075
quet-0.074
 slices-0.074
Token part
Token ID636
Feature activation-2.78e-01

Pos logit contributions
 seeker+0.024
ained+0.021
 tired+0.020
 forward+0.019
 morning+0.018
Neg logit contributions
 instructions-0.018
 occasions-0.016
So-0.016
 slices-0.015
 brace-0.015
Token and
Token ID290
Feature activation+8.77e-01

Pos logit contributions
 instructions+0.063
quet+0.060
 occasions+0.060
 slices+0.058
So+0.058
Neg logit contributions
 seeker-0.057
ained-0.047
 tired-0.045
 forward-0.042
 slid-0.039
Token didn
Token ID1422
Feature activation-6.64e-01

Pos logit contributions
 seeker+0.020
ained+0.016
ancer+0.016
asc+0.015
 dusk+0.014
Neg logit contributions
 instructions-0.018
So-0.017
 brace-0.016
 occasions-0.016
 words-0.016
Token't
Token ID470
Feature activation-2.49e+00

Pos logit contributions
 instructions+0.014
 occasions+0.014
keeping+0.013
 slices+0.013
So+0.013
Neg logit contributions
 seeker-0.014
ained-0.012
 tired-0.011
 forward-0.011
 F-0.010
Token want
Token ID765
Feature activation-2.16e+00

Pos logit contributions
 instructions+0.056
So+0.052
 occasions+0.050
 slices+0.049
quet+0.049
Neg logit contributions
 seeker-0.053
ained-0.044
 tired-0.039
 forward-0.037
ancer-0.037
Token to
Token ID284
Feature activation-2.68e+00

Pos logit contributions
 instructions+0.039
So+0.039
quet+0.037
 occasions+0.036
keeping+0.036
Neg logit contributions
 seeker-0.052
ained-0.044
 tired-0.042
 forward-0.040
 F-0.040
Token learn
Token ID2193
Feature activation-3.97e+00

Pos logit contributions
 instructions+0.059
and+0.055
So+0.055
 slices+0.055
keeping+0.055
Neg logit contributions
 seeker-0.058
ained-0.045
 tired-0.044
 slid-0.040
 F-0.039
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
 instructions+0.077
So+0.072
 occasions+0.072
quet+0.071
 slices+0.070
Neg logit contributions
 seeker-0.091
ained-0.079
 tired-0.072
 forward-0.070
 F-0.068
Token
Token ID220
Feature activation+1.82e+00

Pos logit contributions
 instructions+0.021
quet+0.020
 occasions+0.020
ymes+0.020
 distress+0.019
Neg logit contributions
 seeker-0.016
ained-0.014
 tired-0.013
 morning-0.012
 F-0.012
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
ained+0.031
 seeker+0.031
 tired+0.026
 forward+0.025
ancer+0.025
Neg logit contributions
 instructions-0.045
 occasions-0.044
quet-0.042
 recipe-0.042
 brace-0.042
Token
Token ID198
Feature activation-3.20e+00

Pos logit contributions
 instructions+0.060
quet+0.060
 brace+0.058
 occasions+0.058
 recipe+0.058
Neg logit contributions
 seeker-0.030
ained-0.029
 tired-0.026
 upon-0.025
 F-0.025
TokenSo
Token ID2396
Feature activation+2.10e+00

Pos logit contributions
 instructions+0.062
 slices+0.057
keeping+0.056
 distress+0.055
 occasions+0.055
Neg logit contributions
 seeker-0.077
ained-0.068
 tired-0.061
 forward-0.059
 slid-0.058
Token said
Token ID531
Feature activation-6.09e-02

Pos logit contributions
 seeker+0.055
ained+0.050
asc+0.043
ancer+0.043
 forward+0.042
Neg logit contributions
So-0.017
 instructions-0.016
 brace-0.014
 slices-0.013
keeper-0.013
Token,
Token ID11
Feature activation-9.14e-02

Pos logit contributions
 instructions+0.001
So+0.001
 slices+0.001
 brace+0.001
 occasions+0.001
Neg logit contributions
 seeker-0.001
ained-0.001
ancer-0.001
asc-0.001
 tired-0.001
Token "
Token ID366
Feature activation-4.15e+00

Pos logit contributions
 instructions+0.002
So+0.002
 slices+0.001
 recipe+0.001
quet+0.001
Neg logit contributions
 seeker-0.002
ained-0.002
asc-0.002
ancer-0.002
 dusk-0.002
TokenThat
Token ID2504
Feature activation+9.72e-01

Pos logit contributions
 instructions+0.075
 occasions+0.067
So+0.066
 slices+0.066
 distress+0.064
Neg logit contributions
 seeker-0.103
ained-0.090
 tired-0.082
 slid-0.080
 forward-0.080
Token's
Token ID338
Feature activation-1.87e+00

Pos logit contributions
 seeker+0.027
ained+0.023
 tired+0.021
 forward+0.021
ancer+0.021
Neg logit contributions
 instructions-0.016
So-0.014
 occasions-0.014
 slices-0.013
 brace-0.013
Token a
Token ID257
Feature activation-3.87e+00

Pos logit contributions
So+0.032
 instructions+0.031
 truth+0.030
 words+0.027
 itself+0.027
Neg logit contributions
 seeker-0.046
 slid-0.043
ancer-0.043
ahs-0.041
ashed-0.041
Token beautiful
Token ID4950
Feature activation-1.76e+00

Pos logit contributions
 instructions+0.077
So+0.070
 occasions+0.068
 recipe+0.067
 brace+0.066
Neg logit contributions
 seeker-0.091
ained-0.081
 tired-0.070
ancer-0.070
 slid-0.069
Token flower
Token ID15061
Feature activation-5.97e-01

Pos logit contributions
 instructions+0.029
 occasions+0.025
So+0.025
 slices+0.025
 recipe+0.024
Neg logit contributions
 seeker-0.048
ained-0.042
 tired-0.038
 forward-0.037
ancer-0.037
Token."
Token ID526
Feature activation-1.46e+00

Pos logit contributions
 instructions+0.010
So+0.009
 occasions+0.009
 slices+0.009
quet+0.009
Neg logit contributions
 seeker-0.015
ained-0.013
 tired-0.012
 forward-0.012
 slid-0.012
Token
Token ID220
Feature activation+1.57e+00

Pos logit contributions
 instructions+0.029
So+0.029
and+0.026
 occasions+0.025
 slices+0.025
Neg logit contributions
 seeker-0.037
ained-0.030
asc-0.026
 forward-0.025
ancer-0.025
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
 seeker+0.036
ained+0.033
 tired+0.030
 F+0.029
 slid+0.028
Neg logit contributions
 instructions-0.030
 occasions-0.028
quet-0.027
 slices-0.027
 brace-0.026
Token was
Token ID373
Feature activation+2.46e+00

Pos logit contributions
 seeker+0.073
ained+0.060
asc+0.052
 forward+0.052
ancer+0.050
Neg logit contributions
So-0.075
 instructions-0.073
 distress-0.067
 occasions-0.067
 brace-0.066
Token walking
Token ID6155
Feature activation+1.78e+00

Pos logit contributions
 seeker+0.049
ancer+0.039
ained+0.037
asc+0.034
Whe+0.033
Neg logit contributions
So-0.051
 instructions-0.051
 distress-0.050
 brace-0.048
and-0.048
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
 seeker+0.042
ained+0.040
asc+0.037
ancer+0.034
ric+0.033
Neg logit contributions
 instructions-0.031
So-0.029
and-0.028
keeper-0.028
 keeper-0.027
Token he
Token ID339
Feature activation-3.96e-01

Pos logit contributions
asc+0.026
ained+0.026
 seeker+0.025
ashing+0.022
ash+0.022
Neg logit contributions
 instructions-0.009
 slices-0.008
 distress-0.006
So-0.006
and-0.006
Token heard
Token ID2982
Feature activation-1.72e+00

Pos logit contributions
So+0.006
 instructions+0.006
 distress+0.006
keeper+0.005
urious+0.005
Neg logit contributions
 seeker-0.010
ained-0.009
asc-0.009
ancer-0.008
 kidding-0.008
Token a
Token ID257
Feature activation-3.49e+00

Pos logit contributions
 instructions+0.037
So+0.034
 words+0.034
 slices+0.033
and+0.032
Neg logit contributions
 seeker-0.040
ained-0.033
ancer-0.029
asc-0.028
 tired-0.028
Token strange
Token ID6283
Feature activation-2.29e+00

Pos logit contributions
 instructions+0.061
keeping+0.061
 occasions+0.060
 slices+0.058
and+0.055
Neg logit contributions
 seeker-0.081
 tired-0.071
ained-0.066
 forward-0.065
 spl-0.065
Token noise
Token ID7838
Feature activation-1.15e+00

Pos logit contributions
 instructions+0.032
So+0.029
 occasions+0.028
 slices+0.027
quet+0.026
Neg logit contributions
 seeker-0.067
ained-0.058
 tired-0.055
 forward-0.053
 slid-0.052
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 instructions+0.021
So+0.019
 occasions+0.019
 slices+0.018
quet+0.018
Neg logit contributions
 seeker-0.029
ained-0.025
 tired-0.022
 forward-0.022
ancer-0.022
Token He
Token ID679
Feature activation-1.29e+00

Pos logit contributions
 instructions+0.017
So+0.016
and+0.015
 slices+0.014
 truth+0.014
Neg logit contributions
 seeker-0.029
ained-0.025
asc-0.022
 forward-0.022
ancer-0.021
Token looked
Token ID3114
Feature activation+3.33e+00

Pos logit contributions
So+0.026
 instructions+0.025
 distress+0.023
 brace+0.022
 words+0.022
Neg logit contributions
 seeker-0.032
ained-0.028
ancer-0.024
asc-0.023
 forward-0.023
Token mom
Token ID1995
Feature activation-1.57e+00

Pos logit contributions
 instructions+0.021
 occasions+0.021
quet+0.021
and+0.020
ymes+0.020
Neg logit contributions
 seeker-0.014
 tired-0.011
ained-0.010
 slid-0.009
 morning-0.008
Token answered
Token ID9373
Feature activation-2.32e+00

Pos logit contributions
 instructions+0.038
 occasions+0.036
keeping+0.035
and+0.034
 slices+0.034
Neg logit contributions
 seeker-0.029
 tired-0.023
 slid-0.023
ained-0.023
 spl-0.021
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
 instructions+0.060
So+0.057
 occasions+0.057
 slices+0.055
quet+0.054
Neg logit contributions
 seeker-0.039
ained-0.030
 tired-0.029
 forward-0.028
 slid-0.026
Token "
Token ID366
Feature activation-3.63e+00

Pos logit contributions
 seeker+0.001
ained+0.001
 tired+0.001
 forward+0.001
 slid+0.001
Neg logit contributions
 instructions-0.001
So-0.001
 occasions-0.001
 slices-0.001
quet-0.001
TokenYou
Token ID1639
Feature activation+2.20e+00

Pos logit contributions
 instructions+0.069
 occasions+0.064
 slices+0.063
 distress+0.063
 brace+0.062
Neg logit contributions
 seeker-0.082
ained-0.077
 tired-0.066
 slid-0.066
ancer-0.066
Token should
Token ID815
Feature activation-3.54e+00

Pos logit contributions
 seeker+0.052
ained+0.050
asc+0.043
 dusk+0.042
Whe+0.041
Neg logit contributions
 instructions-0.041
So-0.040
 truth-0.036
 distress-0.035
 note-0.035
Token tell
Token ID1560
Feature activation-3.12e+00

Pos logit contributions
 instructions+0.067
So+0.063
 occasions+0.060
 truth+0.056
 distress+0.055
Neg logit contributions
 seeker-0.084
ained-0.077
ancer-0.069
 tired-0.067
asc-0.067
Token Billy
Token ID15890
Feature activation-2.70e+00

Pos logit contributions
 instructions+0.061
So+0.056
 occasions+0.054
 slices+0.053
quet+0.052
Neg logit contributions
 seeker-0.076
ained-0.064
 tired-0.058
ancer-0.056
 slid-0.056
Token how
Token ID703
Feature activation-6.66e-01

Pos logit contributions
 instructions+0.058
So+0.053
 occasions+0.051
 slices+0.051
 recipe+0.050
Neg logit contributions
 seeker-0.061
ained-0.051
 tired-0.045
 slid-0.044
asc-0.044
Token you
Token ID345
Feature activation+2.59e+00

Pos logit contributions
 instructions+0.010
So+0.009
 occasions+0.008
and+0.008
 slices+0.008
Neg logit contributions
 seeker-0.020
ained-0.017
asc-0.015
ancer-0.015
 slid-0.015
Token feel
Token ID1254
Feature activation+1.06e+00

Pos logit contributions
 seeker+0.063
ained+0.057
asc+0.049
 forward+0.048
 dusk+0.047
Neg logit contributions
 instructions-0.046
So-0.044
 distress-0.041
 occasions-0.040
 truth-0.039
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
 seeker+0.022
ained+0.020
asc+0.018
ancer+0.018
 slid+0.016
Neg logit contributions
 instructions-0.019
 slices-0.016
 brace-0.016
 distress-0.016
So-0.015
Token door
Token ID3420
Feature activation+2.16e+00

Pos logit contributions
 seeker+0.052
ained+0.043
 tired+0.040
 forward+0.039
 slid+0.038
Neg logit contributions
 instructions-0.040
So-0.037
 occasions-0.037
 slices-0.035
quet-0.035
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
 seeker+0.038
ained+0.031
 tired+0.027
 forward+0.025
 slid+0.025
Neg logit contributions
 instructions-0.055
So-0.052
 occasions-0.051
 slices-0.050
quet-0.050
Token he
Token ID339
Feature activation-9.86e-02

Pos logit contributions
ained+0.049
 seeker+0.048
asc+0.044
ashing+0.041
ashed+0.039
Neg logit contributions
 instructions-0.035
 slices-0.032
keeper-0.031
 distress-0.029
So-0.029
Token saw
Token ID2497
Feature activation-7.67e-01

Pos logit contributions
 instructions+0.002
So+0.002
 slices+0.002
 occasions+0.002
 distress+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
asc-0.001
 tired-0.001
ancer-0.001
Token a
Token ID257
Feature activation-3.37e+00

Pos logit contributions
 instructions+0.018
So+0.017
 slices+0.017
 occasions+0.016
 distress+0.016
Neg logit contributions
 seeker-0.016
ained-0.013
 tired-0.011
ancer-0.011
asc-0.011
Token big
Token ID1263
Feature activation-2.40e+00

Pos logit contributions
 instructions+0.065
So+0.059
 slices+0.056
 occasions+0.056
quet+0.055
Neg logit contributions
 seeker-0.082
ained-0.072
 tired-0.063
asc-0.062
 forward-0.062
Token dragon
Token ID10441
Feature activation-1.45e+00

Pos logit contributions
 instructions+0.043
 slices+0.040
 note+0.038
 distress+0.038
 brace+0.038
Neg logit contributions
ained-0.058
 seeker-0.058
ancer-0.049
asc-0.048
 slid-0.046
Token inside
Token ID2641
Feature activation+7.03e-01

Pos logit contributions
 instructions+0.030
So+0.028
 occasions+0.027
 slices+0.026
and+0.026
Neg logit contributions
 seeker-0.033
ained-0.027
 tired-0.025
 forward-0.024
 slid-0.024
Token!
Token ID0
Feature activation-8.10e-02

Pos logit contributions
 seeker+0.019
ained+0.016
 tired+0.015
ancer+0.015
 forward+0.015
Neg logit contributions
 instructions-0.012
So-0.010
 occasions-0.010
 slices-0.010
quet-0.010
Token The
Token ID383
Feature activation-9.76e-01

Pos logit contributions
 instructions+0.002
So+0.002
 slices+0.002
 occasions+0.002
 distress+0.002
Neg logit contributions
 seeker-0.002
ained-0.002
 tired-0.001
asc-0.001
ancer-0.001