TOP ACTIVATIONS
MAX = 0.140

started to pick up the seaw eed and put it in
he decided to remove the seaw eed from the water so
tentacles to reach for the seaw eed , and he wrapped
under a roof made of seaw eed . The shark couldn
home . He grabbed some seaw eed and started to build
Buzz y asked Grand p apa , " Can we go
provide you with some delicious seaw eed to eat ."
Tim didn 't like the seaw eed because it would get
She told him it was seaw eed . Despite the smell
examine it with the magn ifying glass ?" "
pushed and pulled on the seaw eed until finally , the
started to chew on some seaw eed . Mary was so
She also picked some green seaw eed to show her friends
it come out of the seaw eed and walk around the
eggs while mom prepared the pans and pre he ated the
her how to keep the stove clean . Lily thought it
happens when clouds get too full of rain ." Johnny said
found it stuck in some seaw eed . Tim was so
Joe a tasty piece of seaw eed , and Joe ate
told by her grand p apa that a modern hive was

MIN ACTIVATIONS
MIN = -0.135

available . THE <|endoftext|> Once upon a time
arm chair . THE <|endoftext|> Once upon a time
again tomorrow . THE <|endoftext|> Once upon a time
happy . THE <|endoftext|> Once upon a time
so happy . THE <|endoftext|> Once upon a time
delicious ! THE <|endoftext|> Elsa was playing chess
was . THE <|endoftext|> T ina was feeling
came back . THE <|endoftext|> Once upon a time
special talent . THE <|endoftext|> Once upon a time
. THE <|endoftext|> Once upon a time
called Zig zag . Zig zag was a very happy pup
face . THE <|endoftext|> Once upon a time
feel better . THE <|endoftext|> John and his mother
collecting . THE <|endoftext|> Once upon a time
together . The moral of the story is :
were all happy . The moral of the story is to
the big forest . The moral of the story is that
breakfast . The moral value of the story is
scary for everyone . The moral of the story is that
together . The moral of the story is :

INTERVAL 0.071 - 0.140
CONTAINS 0.247%

rubbed it to feel how soft it was . She broke
and thanked him for the warm welcome . Then he ran
did not care about the hole and the stain . He
little arms to reach the stones near the top . Jack
to because it was too deep . Ellie didn 't listen

INTERVAL 0.002 - 0.071
CONTAINS 47.038%

on , Bella , T ing and the dog were the
it ugly ?" Ben thought for a moment .
The farmer smiled . " I can help you ," he
very curious about the world . One day ,
" The dog is chasing us !" They ran

INTERVAL -0.066 - 0.002
CONTAINS 51.877%

she was disappointed . She explained to them that perfume was
said , " Let 's play shoot the ball . If
Max went to the park man and said , "
the click . He did not see the wind push it
of herself for trying something new . At the

INTERVAL -0.135 - -0.066
CONTAINS 0.838%

day on , Lily was more careful and never got her
very much . But the price was unknown .
very adventurous . One day , Lucy decided to
her ears because it was too loud . But then ,
and the teacher started the lesson . <|endoftext|> Josh and Jane
Token started
Token ID2067
Feature activation-1.42e-02

Pos logit contributions
 next+0.036
rav+0.035
 lesson+0.035
Good+0.034
S+0.033
Neg logit contributions
 shave-0.055
 anchor-0.052
 carve-0.051
isel-0.050
apa-0.050
Token to
Token ID284
Feature activation-1.75e-03

Pos logit contributions
 next+0.013
 lesson+0.012
rav+0.012
Good+0.011
S+0.011
Neg logit contributions
 shave-0.032
 anchor-0.031
apa-0.030
 carve-0.030
isel-0.030
Token pick
Token ID2298
Feature activation+5.93e-03

Pos logit contributions
 seek+0.002
 next+0.002
 lesson+0.002
 Sue+0.002
Good+0.002
Neg logit contributions
apa-0.003
on-0.003
asts-0.003
igh-0.003
isel-0.003
Token up
Token ID510
Feature activation-4.07e-03

Pos logit contributions
 shave+0.011
 anchor+0.011
 carve+0.010
apa+0.010
isel+0.010
Neg logit contributions
 next-0.007
rav-0.007
 lesson-0.007
Good-0.007
S-0.007
Token the
Token ID262
Feature activation+3.34e-02

Pos logit contributions
rav+0.006
ider+0.006
Good+0.006
 lesson+0.006
OK+0.005
Neg logit contributions
 shave-0.007
 anchor-0.007
 carve-0.006
apa-0.006
bowl-0.006
Token seaw
Token ID32085
Feature activation+1.40e-01

Pos logit contributions
 shave+0.048
apa+0.047
igh+0.045
pen+0.045
isel+0.045
Neg logit contributions
 next-0.058
 lesson-0.056
Good-0.054
rav-0.053
 seek-0.052
Tokeneed
Token ID2308
Feature activation+6.82e-02

Pos logit contributions
 shave+0.190
 anchor+0.176
igh+0.174
 carve+0.172
 harbour+0.166
Neg logit contributions
S-0.238
rav-0.237
 next-0.235
ider-0.235
 lesson-0.229
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
 shave+0.148
 anchor+0.141
 carve+0.138
apa+0.137
isel+0.137
Neg logit contributions
 next-0.064
 lesson-0.062
rav-0.061
Good-0.060
S-0.056
Token put
Token ID1234
Feature activation+1.88e-02

Pos logit contributions
 shave+0.028
 anchor+0.027
 carve+0.026
apa+0.025
bowl+0.025
Neg logit contributions
rav-0.028
Good-0.027
 lesson-0.027
 next-0.027
S-0.026
Token it
Token ID340
Feature activation+1.18e-02

Pos logit contributions
 shave+0.032
 anchor+0.031
 carve+0.029
bowl+0.029
apa+0.029
Neg logit contributions
rav-0.028
Good-0.026
 lesson-0.026
ider-0.025
 next-0.025
Token in
Token ID287
Feature activation+3.27e-02

Pos logit contributions
 shave+0.019
 anchor+0.019
bowl+0.018
 carve+0.018
apa+0.018
Neg logit contributions
rav-0.018
Good-0.017
 lesson-0.016
oys-0.015
S-0.015
Token he
Token ID339
Feature activation+5.24e-03

Pos logit contributions
rav+0.066
Good+0.063
 lesson+0.062
ider+0.062
bows+0.061
Neg logit contributions
 shave-0.042
 anchor-0.039
 carve-0.037
bowl-0.036
on-0.035
Token decided
Token ID3066
Feature activation-2.85e-02

Pos logit contributions
 shave+0.010
 anchor+0.010
 carve+0.009
apa+0.009
bowl+0.009
Neg logit contributions
rav-0.006
Good-0.006
 lesson-0.006
 next-0.006
S-0.006
Token to
Token ID284
Feature activation-1.75e-02

Pos logit contributions
rav+0.041
Good+0.038
bows+0.038
ider+0.037
 lesson+0.035
Neg logit contributions
 shave-0.051
 anchor-0.050
 carve-0.046
 harbour-0.043
pen-0.043
Token remove
Token ID4781
Feature activation+1.23e-02

Pos logit contributions
 next+0.023
 lesson+0.023
 seek+0.023
OK+0.021
 Sue+0.021
Neg logit contributions
apa-0.029
 frying-0.028
vice-0.028
 boiling-0.028
ann-0.028
Token the
Token ID262
Feature activation+2.71e-02

Pos logit contributions
 shave+0.024
apa+0.023
 carve+0.023
 frying+0.022
bowl+0.022
Neg logit contributions
 next-0.014
 lesson-0.014
Good-0.013
S-0.013
 Sue-0.012
Token seaw
Token ID32085
Feature activation+1.38e-01

Pos logit contributions
apa+0.042
igh+0.041
pen+0.039
 shave+0.038
isel+0.038
Neg logit contributions
 next-0.050
 lesson-0.046
rav-0.041
Good-0.041
OK-0.039
Tokeneed
Token ID2308
Feature activation+6.96e-02

Pos logit contributions
igh+0.204
neath+0.201
 shave+0.200
 Junior+0.192
 harbour+0.191
Neg logit contributions
S-0.206
 next-0.206
ider-0.204
rav-0.203
oys-0.199
Token from
Token ID422
Feature activation+4.33e-02

Pos logit contributions
 shave+0.158
apa+0.149
isel+0.149
igh+0.148
acket+0.147
Neg logit contributions
 next-0.071
 lesson-0.061
S-0.048
oys-0.047
 seek-0.045
Token the
Token ID262
Feature activation+2.05e-02

Pos logit contributions
 shave+0.063
 anchor+0.061
 carve+0.058
isel+0.055
bowl+0.055
Neg logit contributions
rav-0.074
Good-0.071
 lesson-0.070
 next-0.069
S-0.068
Token water
Token ID1660
Feature activation+5.43e-02

Pos logit contributions
igh+0.030
apa+0.030
 shave+0.029
pen+0.029
ann+0.028
Neg logit contributions
 next-0.039
 lesson-0.034
Good-0.032
OK-0.032
rav-0.031
Token so
Token ID523
Feature activation+3.42e-02

Pos logit contributions
 shave+0.126
apa+0.116
isel+0.116
 carve+0.114
pen+0.113
Neg logit contributions
 next-0.053
 lesson-0.043
oys-0.038
S-0.038
ider-0.034
Token tentacles
Token ID41192
Feature activation+4.94e-02

Pos logit contributions
rav+0.029
Good+0.029
 next+0.028
S+0.028
 lesson+0.028
Neg logit contributions
 shave-0.028
 anchor-0.027
 carve-0.026
apa-0.026
bowl-0.025
Token to
Token ID284
Feature activation-9.23e-03

Pos logit contributions
 shave+0.101
 anchor+0.099
 carve+0.095
apa+0.095
bowl+0.094
Neg logit contributions
rav-0.051
Good-0.050
 lesson-0.049
 next-0.048
S-0.046
Token reach
Token ID3151
Feature activation+3.51e-02

Pos logit contributions
rav+0.015
Good+0.015
 next+0.015
 lesson+0.015
S+0.014
Neg logit contributions
 shave-0.014
 anchor-0.013
 carve-0.012
apa-0.012
bowl-0.012
Token for
Token ID329
Feature activation+7.34e-03

Pos logit contributions
 shave+0.067
 anchor+0.064
 carve+0.062
apa+0.062
isel+0.062
Neg logit contributions
 next-0.042
 lesson-0.041
rav-0.040
Good-0.039
S-0.037
Token the
Token ID262
Feature activation+2.40e-02

Pos logit contributions
 shave+0.012
 anchor+0.012
 carve+0.011
apa+0.011
bowl+0.011
Neg logit contributions
rav-0.011
Good-0.010
 lesson-0.010
 next-0.010
S-0.010
Token seaw
Token ID32085
Feature activation+1.37e-01

Pos logit contributions
 shave+0.024
 anchor+0.023
 carve+0.021
apa+0.021
bowl+0.020
Neg logit contributions
rav-0.050
 next-0.050
 lesson-0.050
Good-0.049
S-0.048
Tokeneed
Token ID2308
Feature activation+7.02e-02

Pos logit contributions
 shave+0.188
 anchor+0.174
 carve+0.171
igh+0.170
neath+0.167
Neg logit contributions
rav-0.229
S-0.228
 next-0.225
ider-0.225
 lesson-0.224
Token,
Token ID11
Feature activation+6.41e-03

Pos logit contributions
 shave+0.135
 anchor+0.131
 carve+0.126
apa+0.125
bowl+0.125
Neg logit contributions
rav-0.081
Good-0.080
 lesson-0.079
 next-0.079
S-0.074
Token and
Token ID290
Feature activation-3.89e-03

Pos logit contributions
 shave+0.013
 anchor+0.012
 carve+0.012
bowl+0.012
apa+0.012
Neg logit contributions
rav-0.008
Good-0.007
 lesson-0.007
 next-0.007
S-0.007
Token he
Token ID339
Feature activation+2.19e-02

Pos logit contributions
rav+0.006
Good+0.006
ider+0.006
bows+0.006
S+0.006
Neg logit contributions
 anchor-0.006
 shave-0.006
 carve-0.006
bowl-0.006
apa-0.006
Token wrapped
Token ID12908
Feature activation-1.74e-02

Pos logit contributions
 shave+0.039
 anchor+0.038
 carve+0.037
apa+0.037
bowl+0.036
Neg logit contributions
rav-0.028
Good-0.028
 lesson-0.027
 next-0.027
S-0.026
Token under
Token ID739
Feature activation+1.77e-02

Pos logit contributions
 shave+0.037
 anchor+0.037
bowl+0.034
apa+0.034
 carve+0.034
Neg logit contributions
rav-0.030
Good-0.029
 lesson-0.029
 next-0.028
bows-0.027
Token a
Token ID257
Feature activation-4.86e-03

Pos logit contributions
 shave+0.037
apa+0.035
 carve+0.034
bowl+0.034
 frying+0.034
Neg logit contributions
 next-0.020
 lesson-0.018
Good-0.017
 Sue-0.017
S-0.017
Token roof
Token ID9753
Feature activation-2.55e-03

Pos logit contributions
 lesson+0.004
 next+0.004
 lady+0.004
 fast+0.004
S+0.004
Neg logit contributions
igh-0.011
 shave-0.011
apa-0.011
isel-0.010
pen-0.010
Token made
Token ID925
Feature activation+3.37e-02

Pos logit contributions
 next+0.003
 lesson+0.003
S+0.003
Good+0.003
rav+0.002
Neg logit contributions
 shave-0.005
 anchor-0.005
apa-0.005
 carve-0.005
isel-0.005
Token of
Token ID286
Feature activation+2.53e-02

Pos logit contributions
 shave+0.057
 anchor+0.053
 carve+0.052
bowl+0.051
 frying+0.051
Neg logit contributions
 next-0.051
 lesson-0.049
Good-0.047
rav-0.046
S-0.045
Token seaw
Token ID32085
Feature activation+1.33e-01

Pos logit contributions
apa+0.046
 shave+0.045
pen+0.044
acket+0.044
igh+0.044
Neg logit contributions
 next-0.031
 lesson-0.030
 Sue-0.028
 songs-0.027
Good-0.027
Tokeneed
Token ID2308
Feature activation+7.26e-02

Pos logit contributions
neath+0.228
 shave+0.216
igh+0.212
 Jason+0.204
 anchor+0.201
Neg logit contributions
S-0.189
ider-0.185
oys-0.172
rav-0.168
 next-0.166
Token.
Token ID13
Feature activation-3.54e-03

Pos logit contributions
 shave+0.150
apa+0.142
isel+0.141
 carve+0.140
 anchor+0.139
Neg logit contributions
 next-0.082
 lesson-0.072
rav-0.069
ider-0.067
S-0.066
Token The
Token ID383
Feature activation-1.73e-02

Pos logit contributions
 next+0.004
Good+0.004
 lesson+0.004
S+0.004
rav+0.004
Neg logit contributions
 shave-0.007
 anchor-0.007
igh-0.007
isel-0.007
 carve-0.007
Token shark
Token ID21027
Feature activation+7.89e-04

Pos logit contributions
 next+0.036
 lesson+0.035
Good+0.034
rav+0.034
 seek+0.033
Neg logit contributions
 shave-0.020
apa-0.018
pen-0.017
 carve-0.017
isel-0.017
Token couldn
Token ID3521
Feature activation+3.45e-03

Pos logit contributions
 shave+0.001
 anchor+0.001
bowl+0.001
 carve+0.001
apa+0.001
Neg logit contributions
rav-0.001
Good-0.001
 lesson-0.001
 next-0.001
S-0.001
Token home
Token ID1363
Feature activation-4.85e-02

Pos logit contributions
 lesson+0.017
 next+0.016
rav+0.016
Good+0.016
S+0.015
Neg logit contributions
 shave-0.020
 anchor-0.019
apa-0.018
 carve-0.018
pen-0.018
Token.
Token ID13
Feature activation+7.24e-03

Pos logit contributions
rav+0.073
Good+0.072
 lesson+0.067
S+0.066
OK+0.066
Neg logit contributions
 shave-0.080
 anchor-0.079
 carve-0.074
bowl-0.074
apa-0.074
Token He
Token ID679
Feature activation+1.63e-02

Pos logit contributions
 shave+0.011
 anchor+0.010
apa+0.010
 carve+0.010
bowl+0.010
Neg logit contributions
rav-0.012
 lesson-0.011
Good-0.011
 next-0.011
S-0.011
Token grabbed
Token ID13176
Feature activation+6.22e-03

Pos logit contributions
 shave+0.027
 anchor+0.027
apa+0.026
 carve+0.025
bowl+0.025
Neg logit contributions
rav-0.025
Good-0.023
 lesson-0.022
bows-0.022
 next-0.022
Token some
Token ID617
Feature activation-4.00e-02

Pos logit contributions
 shave+0.012
 anchor+0.011
 carve+0.011
bowl+0.011
apa+0.011
Neg logit contributions
 next-0.008
 lesson-0.008
Good-0.007
rav-0.007
S-0.007
Token seaw
Token ID32085
Feature activation+1.30e-01

Pos logit contributions
 next+0.060
 lesson+0.059
Good+0.054
S+0.053
 songs+0.052
Neg logit contributions
 shave-0.064
isel-0.061
 harbour-0.060
apa-0.060
 carve-0.060
Tokeneed
Token ID2308
Feature activation+5.98e-02

Pos logit contributions
 shave+0.177
 anchor+0.166
 harbour+0.162
 carve+0.158
 boiling+0.153
Neg logit contributions
S-0.219
rav-0.216
ider-0.214
 next-0.213
 lesson-0.211
Token and
Token ID290
Feature activation-5.59e-04

Pos logit contributions
 shave+0.132
 anchor+0.127
 carve+0.123
apa+0.122
isel+0.121
Neg logit contributions
 next-0.053
rav-0.053
 lesson-0.052
Good-0.051
S-0.047
Token started
Token ID2067
Feature activation-1.59e-02

Pos logit contributions
rav+0.001
Good+0.001
bows+0.001
ider+0.001
 lesson+0.001
Neg logit contributions
 shave-0.001
 anchor-0.001
 carve-0.001
apa-0.001
bowl-0.001
Token to
Token ID284
Feature activation-3.97e-03

Pos logit contributions
 next+0.014
 lesson+0.012
rav+0.011
Good+0.011
S+0.010
Neg logit contributions
 shave-0.036
 anchor-0.035
apa-0.034
isel-0.034
igh-0.034
Token build
Token ID1382
Feature activation+1.44e-03

Pos logit contributions
 seek+0.006
 next+0.006
 lesson+0.005
Good+0.005
 Sue+0.005
Neg logit contributions
apa-0.006
ete-0.006
ime-0.006
vice-0.006
isel-0.006
Token Buzz
Token ID15753
Feature activation+1.16e-02

Pos logit contributions
 anchor+0.000
 carve+0.000
 shave+0.000
apa+0.000
 boiling+0.000
Neg logit contributions
rav-0.000
ider-0.000
oys-0.000
bows-0.000
S-0.000
Tokeny
Token ID88
Feature activation+2.60e-02

Pos logit contributions
bowl+0.019
apa+0.019
 shave+0.019
 anchor+0.018
pen+0.018
Neg logit contributions
Good-0.017
rav-0.017
 lesson-0.017
ider-0.015
OK-0.015
Token asked
Token ID1965
Feature activation+3.65e-02

Pos logit contributions
 shave+0.041
apa+0.040
 anchor+0.039
 carve+0.039
pen+0.039
Neg logit contributions
rav-0.040
Good-0.039
 lesson-0.037
oys-0.037
S-0.036
Token Grand
Token ID5675
Feature activation+8.43e-02

Pos logit contributions
 shave+0.068
 anchor+0.066
apa+0.065
 carve+0.063
bowl+0.062
Neg logit contributions
rav-0.046
Good-0.045
 lesson-0.043
ider-0.042
bows-0.042
Tokenp
Token ID79
Feature activation-8.05e-04

Pos logit contributions
 shave+0.136
isel+0.130
 carve+0.127
 anchor+0.127
bowl+0.124
Neg logit contributions
 next-0.137
 lesson-0.123
Good-0.123
S-0.122
rav-0.121
Tokenapa
Token ID32678
Feature activation+1.30e-01

Pos logit contributions
Good+0.001
rav+0.001
 lesson+0.001
 next+0.001
OK+0.001
Neg logit contributions
 shave-0.001
apa-0.001
 anchor-0.001
bowl-0.001
 carve-0.001
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
 shave+0.223
 anchor+0.215
apa+0.211
bowl+0.206
 carve+0.204
Neg logit contributions
Good-0.179
rav-0.178
 lesson-0.175
S-0.167
 next-0.166
Token "
Token ID366
Feature activation-1.76e-03

Pos logit contributions
 shave+0.042
 anchor+0.042
apa+0.040
 boiling+0.040
 carve+0.039
Neg logit contributions
rav-0.025
bows-0.024
 lesson-0.024
Good-0.023
OK-0.021
TokenCan
Token ID6090
Feature activation+1.14e-02

Pos logit contributions
 lesson+0.002
rav+0.002
bows+0.002
ses+0.002
sed+0.002
Neg logit contributions
 shave-0.003
pen-0.003
 anchor-0.003
bowl-0.003
Son-0.003
Token we
Token ID356
Feature activation+3.92e-02

Pos logit contributions
 shave+0.021
 anchor+0.020
apa+0.019
 carve+0.019
ete+0.018
Neg logit contributions
rav-0.014
 lesson-0.014
Good-0.014
bows-0.014
S-0.013
Token go
Token ID467
Feature activation+4.94e-03

Pos logit contributions
 shave+0.067
 anchor+0.065
apa+0.063
isel+0.063
bowl+0.062
Neg logit contributions
 next-0.054
rav-0.051
Good-0.051
 lesson-0.051
 seek-0.050
Token provide
Token ID2148
Feature activation+1.62e-02

Pos logit contributions
 next+0.007
 lesson+0.006
Good+0.006
 seek+0.006
rav+0.006
Neg logit contributions
 shave-0.010
isel-0.010
 anchor-0.010
apa-0.010
vice-0.010
Token you
Token ID345
Feature activation-4.01e-03

Pos logit contributions
 shave+0.031
 anchor+0.030
 carve+0.029
apa+0.028
bowl+0.028
Neg logit contributions
rav-0.020
Good-0.019
 lesson-0.019
 next-0.019
S-0.018
Token with
Token ID351
Feature activation+9.96e-03

Pos logit contributions
rav+0.005
Good+0.004
 lesson+0.004
oys+0.004
OK+0.004
Neg logit contributions
 shave-0.008
 anchor-0.008
bowl-0.007
 carve-0.007
apa-0.007
Token some
Token ID617
Feature activation-3.16e-02

Pos logit contributions
 shave+0.018
 anchor+0.018
 carve+0.017
apa+0.017
bowl+0.017
Neg logit contributions
rav-0.013
Good-0.012
 lesson-0.012
 next-0.012
S-0.012
Token delicious
Token ID12625
Feature activation+4.42e-02

Pos logit contributions
 next+0.042
 lesson+0.042
Good+0.039
rav+0.039
S+0.037
Neg logit contributions
 shave-0.056
 anchor-0.053
isel-0.053
apa-0.052
 carve-0.052
Token seaw
Token ID32085
Feature activation+1.30e-01

Pos logit contributions
 shave+0.080
apa+0.074
pen+0.074
isel+0.074
 harbour+0.073
Neg logit contributions
 next-0.060
 lesson-0.058
S-0.053
rav-0.052
Good-0.052
Tokeneed
Token ID2308
Feature activation+6.90e-02

Pos logit contributions
 shave+0.168
 anchor+0.157
 carve+0.150
 harbour+0.149
 boiling+0.144
Neg logit contributions
rav-0.228
 next-0.224
S-0.224
 lesson-0.224
ider-0.220
Token to
Token ID284
Feature activation+1.56e-03

Pos logit contributions
 anchor+0.124
 shave+0.122
bowl+0.120
apa+0.117
 carve+0.116
Neg logit contributions
Good-0.092
rav-0.090
 lesson-0.082
OK-0.080
S-0.079
Token eat
Token ID4483
Feature activation+3.06e-02

Pos logit contributions
isel+0.002
 shave+0.002
apa+0.002
 anchor+0.002
pen+0.002
Neg logit contributions
 next-0.002
 seek-0.002
 lesson-0.002
Good-0.002
S-0.002
Token."
Token ID526
Feature activation-1.39e-02

Pos logit contributions
 shave+0.059
 anchor+0.057
 carve+0.055
apa+0.055
bowl+0.055
Neg logit contributions
rav-0.036
Good-0.035
 lesson-0.034
 next-0.034
S-0.032
Token
Token ID220
Feature activation+1.98e-02

Pos logit contributions
 next+0.017
Good+0.017
rav+0.017
S+0.017
 lesson+0.017
Neg logit contributions
 shave-0.026
 anchor-0.025
isel-0.024
 carve-0.023
pen-0.023
Token Tim
Token ID5045
Feature activation-6.34e-03

Pos logit contributions
 next+0.010
 lesson+0.010
Good+0.009
rav+0.009
 Sue+0.009
Neg logit contributions
 shave-0.010
 anchor-0.010
pen-0.010
 carve-0.010
apa-0.009
Token didn
Token ID1422
Feature activation-2.96e-02

Pos logit contributions
 next+0.006
 lesson+0.006
Good+0.006
rav+0.005
S+0.005
Neg logit contributions
 shave-0.014
 anchor-0.013
isel-0.013
 carve-0.013
apa-0.013
Token't
Token ID470
Feature activation-5.40e-02

Pos logit contributions
 next+0.024
 lesson+0.024
Good+0.022
rav+0.020
S+0.020
Neg logit contributions
 anchor-0.069
 shave-0.068
bowl-0.067
apa-0.066
isel-0.066
Token like
Token ID588
Feature activation+2.62e-02

Pos logit contributions
rav+0.074
 next+0.072
Good+0.072
 lesson+0.070
ider+0.068
Neg logit contributions
 shave-0.096
 anchor-0.091
 carve-0.089
bowl-0.087
isel-0.086
Token the
Token ID262
Feature activation+2.30e-02

Pos logit contributions
apa+0.053
 shave+0.052
bowl+0.050
pen+0.049
 carve+0.049
Neg logit contributions
 next-0.030
 lesson-0.028
 Sue-0.024
 songs-0.024
Good-0.024
Token seaw
Token ID32085
Feature activation+1.29e-01

Pos logit contributions
apa+0.040
igh+0.039
pen+0.037
 shave+0.035
isel+0.035
Neg logit contributions
 lesson-0.037
 next-0.037
 moment-0.031
OK-0.031
Good-0.030
Tokeneed
Token ID2308
Feature activation+6.67e-02

Pos logit contributions
igh+0.179
 shave+0.177
neath+0.175
apa+0.175
 harbour+0.175
Neg logit contributions
rav-0.206
S-0.203
ider-0.200
 next-0.198
 lesson-0.196
Token because
Token ID780
Feature activation+9.68e-03

Pos logit contributions
 shave+0.121
 anchor+0.116
apa+0.114
 carve+0.113
isel+0.112
Neg logit contributions
 next-0.084
 lesson-0.083
rav-0.082
Good-0.081
S-0.076
Token it
Token ID340
Feature activation-9.97e-03

Pos logit contributions
 shave+0.016
 carve+0.015
apa+0.015
pen+0.015
 anchor+0.015
Neg logit contributions
 next-0.015
 lesson-0.014
Good-0.014
rav-0.013
S-0.013
Token would
Token ID561
Feature activation-3.31e-02

Pos logit contributions
 next+0.006
 lesson+0.005
Good+0.004
 seek+0.004
fl+0.004
Neg logit contributions
apa-0.025
isel-0.025
 shave-0.025
 harbour-0.024
 anchor-0.024
Token get
Token ID651
Feature activation-1.33e-02

Pos logit contributions
 next+0.045
 lesson+0.045
 seek+0.042
Good+0.042
rav+0.039
Neg logit contributions
apa-0.059
pen-0.056
igh-0.056
 shave-0.056
isel-0.055
Token She
Token ID1375
Feature activation+1.28e-02

Pos logit contributions
 shave+0.017
 anchor+0.016
 carve+0.015
apa+0.015
bowl+0.015
Neg logit contributions
rav-0.022
Good-0.022
 lesson-0.022
 next-0.022
S-0.021
Token told
Token ID1297
Feature activation-1.20e-02

Pos logit contributions
 shave+0.020
 anchor+0.020
apa+0.019
 carve+0.019
bowl+0.018
Neg logit contributions
rav-0.020
Good-0.019
bows-0.019
 lesson-0.019
ider-0.018
Token him
Token ID683
Feature activation+2.19e-03

Pos logit contributions
 next+0.023
 lesson+0.022
 Sue+0.021
Good+0.021
 herself+0.020
Neg logit contributions
bowl-0.014
 shave-0.014
 anchor-0.013
 frying-0.013
 carve-0.013
Token it
Token ID340
Feature activation-1.06e-02

Pos logit contributions
 shave+0.004
 anchor+0.003
apa+0.003
 carve+0.003
bowl+0.003
Neg logit contributions
rav-0.003
Good-0.003
 next-0.003
S-0.003
 lesson-0.003
Token was
Token ID373
Feature activation+1.63e-02

Pos logit contributions
rav+0.014
Good+0.014
S+0.014
ider+0.013
OK+0.012
Neg logit contributions
 shave-0.019
 anchor-0.019
apa-0.018
bowl-0.018
 carve-0.017
Token seaw
Token ID32085
Feature activation+1.29e-01

Pos logit contributions
 shave+0.030
 anchor+0.030
apa+0.028
 carve+0.028
 boiling+0.028
Neg logit contributions
rav-0.022
Good-0.021
ider-0.019
OK-0.019
bows-0.019
Tokeneed
Token ID2308
Feature activation+5.74e-02

Pos logit contributions
 shave+0.171
 anchor+0.166
 harbour+0.163
 carve+0.162
 boiling+0.154
Neg logit contributions
ider-0.215
rav-0.211
S-0.211
oys-0.210
 next-0.207
Token.
Token ID13
Feature activation+2.77e-03

Pos logit contributions
 shave+0.101
 anchor+0.099
bowl+0.094
apa+0.094
 carve+0.093
Neg logit contributions
rav-0.077
Good-0.077
 lesson-0.073
 next-0.071
S-0.071
Token Despite
Token ID7945
Feature activation+7.39e-03

Pos logit contributions
 anchor+0.004
 shave+0.004
 carve+0.004
isel+0.004
 iceberg+0.004
Neg logit contributions
 next-0.004
Good-0.004
 lesson-0.004
rav-0.004
S-0.004
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
 shave+0.012
 anchor+0.012
 carve+0.011
apa+0.011
bowl+0.011
Neg logit contributions
rav-0.011
Good-0.010
 lesson-0.010
 next-0.010
S-0.010
Token smell
Token ID8508
Feature activation+3.10e-02

Pos logit contributions
 shave+0.041
 anchor+0.038
apa+0.037
 carve+0.037
pen+0.037
Neg logit contributions
 next-0.042
 lesson-0.042
rav-0.040
Good-0.040
S-0.038
Token examine
Token ID10716
Feature activation+2.94e-02

Pos logit contributions
 anchor+0.071
 shave+0.071
apa+0.071
 iceberg+0.069
bowl+0.069
Neg logit contributions
 next-0.062
 seek-0.058
Good-0.058
rav-0.058
 lesson-0.057
Token it
Token ID340
Feature activation+2.74e-02

Pos logit contributions
 shave+0.051
 anchor+0.049
 carve+0.047
apa+0.046
bowl+0.046
Neg logit contributions
rav-0.040
Good-0.039
 lesson-0.039
 next-0.039
S-0.037
Token with
Token ID351
Feature activation+3.05e-02

Pos logit contributions
 shave+0.038
 anchor+0.036
 carve+0.034
bowl+0.033
pen+0.033
Neg logit contributions
rav-0.050
Good-0.048
 lesson-0.047
oys-0.046
S-0.045
Token the
Token ID262
Feature activation+4.31e-02

Pos logit contributions
 shave+0.056
 anchor+0.053
apa+0.052
bowl+0.052
 carve+0.052
Neg logit contributions
 next-0.038
rav-0.037
 lesson-0.037
Good-0.037
S-0.035
Token magn
Token ID7842
Feature activation+9.66e-02

Pos logit contributions
 shave+0.070
 anchor+0.067
 carve+0.063
bowl+0.059
apa+0.059
Neg logit contributions
rav-0.068
Good-0.065
S-0.063
 lesson-0.062
 next-0.061
Tokenifying
Token ID4035
Feature activation+1.27e-01

Pos logit contributions
 shave+0.096
 anchor+0.086
 carve+0.081
apa+0.081
isel+0.078
Neg logit contributions
 next-0.204
rav-0.203
ider-0.197
Good-0.196
 lesson-0.195
Token glass
Token ID5405
Feature activation+4.34e-02

Pos logit contributions
 shave+0.161
 anchor+0.153
apa+0.146
isel+0.144
 carve+0.140
Neg logit contributions
 next-0.234
 lesson-0.234
Good-0.231
S-0.227
rav-0.226
Token?"
Token ID1701
Feature activation-1.82e-03

Pos logit contributions
 anchor+0.056
 shave+0.054
bowl+0.051
 carve+0.050
on+0.049
Neg logit contributions
rav-0.082
Good-0.080
 lesson-0.074
K-0.073
OK-0.072
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
rav+0.003
bows+0.002
Good+0.002
 lesson+0.002
ider+0.002
Neg logit contributions
 shave-0.003
 anchor-0.003
bowl-0.003
 carve-0.003
apa-0.003
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
 shave+0.043
 anchor+0.041
apa+0.040
 carve+0.040
bowl+0.040
Neg logit contributions
rav-0.034
 lesson-0.034
Good-0.033
 next-0.033
bows-0.032
Token"
Token ID1
Feature activation-4.78e-02

Pos logit contributions
bowl+0.036
 shave+0.035
apa+0.035
ete+0.033
 anchor+0.033
Neg logit contributions
bows-0.025
ses-0.024
 lesson-0.023
rav-0.023
sed-0.022
Token pushed
Token ID7121
Feature activation-9.61e-03

Pos logit contributions
 shave+0.008
 anchor+0.008
bowl+0.008
 carve+0.008
apa+0.008
Neg logit contributions
rav-0.005
Good-0.005
 lesson-0.005
 next-0.005
S-0.005
Token and
Token ID290
Feature activation-1.45e-04

Pos logit contributions
rav+0.014
Good+0.013
 lesson+0.013
 next+0.013
ider+0.013
Neg logit contributions
 shave-0.016
 anchor-0.016
apa-0.015
 carve-0.015
bowl-0.015
Token pulled
Token ID5954
Feature activation+1.76e-02

Pos logit contributions
rav+0.000
ider+0.000
bows+0.000
 lesson+0.000
Good+0.000
Neg logit contributions
 anchor-0.000
 shave-0.000
bowl-0.000
 carve-0.000
apa-0.000
Token on
Token ID319
Feature activation+1.12e-02

Pos logit contributions
 shave+0.037
 anchor+0.035
 carve+0.035
bowl+0.034
apa+0.034
Neg logit contributions
 next-0.019
 lesson-0.018
Good-0.018
rav-0.018
S-0.016
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 shave+0.021
 anchor+0.020
 carve+0.020
apa+0.020
bowl+0.019
Neg logit contributions
 next-0.014
 lesson-0.014
Good-0.013
rav-0.013
S-0.012
Token seaw
Token ID32085
Feature activation+1.27e-01

Pos logit contributions
 shave+0.018
apa+0.017
isel+0.016
 carve+0.016
pen+0.016
Neg logit contributions
 next-0.023
 lesson-0.023
Good-0.022
rav-0.022
 seek-0.021
Tokeneed
Token ID2308
Feature activation+6.09e-02

Pos logit contributions
 shave+0.182
igh+0.169
 anchor+0.167
 carve+0.165
 boiling+0.163
Neg logit contributions
S-0.205
ider-0.202
rav-0.200
 lesson-0.197
 next-0.194
Token until
Token ID1566
Feature activation+2.08e-02

Pos logit contributions
 shave+0.127
 anchor+0.118
 carve+0.118
isel+0.117
apa+0.116
Neg logit contributions
 next-0.066
 lesson-0.062
rav-0.057
Good-0.054
S-0.053
Token finally
Token ID3443
Feature activation-6.30e-02

Pos logit contributions
 shave+0.039
 anchor+0.037
 carve+0.036
bowl+0.036
apa+0.036
Neg logit contributions
 next-0.026
 lesson-0.025
Good-0.025
rav-0.025
S-0.023
Token,
Token ID11
Feature activation-8.97e-03

Pos logit contributions
rav+0.080
 next+0.080
 lesson+0.079
Good+0.078
S+0.074
Neg logit contributions
 shave-0.118
 anchor-0.109
 carve-0.107
bowl-0.106
apa-0.105
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
rav+0.013
Good+0.013
 lesson+0.013
 next+0.012
ider+0.012
Neg logit contributions
 shave-0.015
 anchor-0.015
 carve-0.014
apa-0.014
bowl-0.013
Token started
Token ID2067
Feature activation-1.05e-02

Pos logit contributions
rav+0.000
 lesson+0.000
Good+0.000
 next+0.000
ider+0.000
Neg logit contributions
 shave-0.000
 anchor-0.000
 carve-0.000
apa-0.000
bowl-0.000
Token to
Token ID284
Feature activation+1.86e-03

Pos logit contributions
 next+0.009
 lesson+0.008
rav+0.008
Good+0.007
S+0.007
Neg logit contributions
 shave-0.024
 anchor-0.023
apa-0.023
isel-0.023
igh-0.023
Token chew
Token ID34722
Feature activation+3.67e-02

Pos logit contributions
apa+0.003
isel+0.003
ete+0.003
pen+0.003
ime+0.003
Neg logit contributions
 next-0.003
 seek-0.003
Good-0.002
 lesson-0.002
fl-0.002
Token on
Token ID319
Feature activation+1.74e-02

Pos logit contributions
 shave+0.056
 anchor+0.055
on+0.054
apa+0.053
bowl+0.053
Neg logit contributions
rav-0.059
ider-0.054
Good-0.053
oys-0.052
bows-0.051
Token some
Token ID617
Feature activation-3.49e-02

Pos logit contributions
 shave+0.031
 anchor+0.029
 carve+0.028
apa+0.028
bowl+0.028
Neg logit contributions
rav-0.023
Good-0.023
 lesson-0.023
 next-0.023
S-0.021
Token seaw
Token ID32085
Feature activation+1.27e-01

Pos logit contributions
 next+0.047
 lesson+0.043
S+0.037
 songs+0.037
Good+0.036
Neg logit contributions
 harbour-0.069
 shave-0.066
igh-0.066
apa-0.065
 workshop-0.064
Tokeneed
Token ID2308
Feature activation+6.03e-02

Pos logit contributions
 shave+0.186
neath+0.179
 anchor+0.176
 harbour+0.175
igh+0.174
Neg logit contributions
rav-0.198
S-0.198
ider-0.193
oys-0.192
 lesson-0.191
Token.
Token ID13
Feature activation-4.73e-03

Pos logit contributions
 shave+0.114
 anchor+0.109
 carve+0.106
apa+0.105
isel+0.105
Neg logit contributions
 next-0.072
 lesson-0.071
rav-0.070
Good-0.069
S-0.066
Token Mary
Token ID5335
Feature activation-1.89e-02

Pos logit contributions
rav+0.006
 next+0.006
Good+0.006
 lesson+0.006
ider+0.006
Neg logit contributions
 shave-0.008
apa-0.008
 carve-0.008
 anchor-0.008
isel-0.008
Token was
Token ID373
Feature activation-2.03e-03

Pos logit contributions
 next+0.017
 lesson+0.015
Good+0.014
rav+0.014
S+0.013
Neg logit contributions
 shave-0.042
 anchor-0.041
isel-0.040
 carve-0.039
 harbour-0.039
Token so
Token ID523
Feature activation+3.40e-02

Pos logit contributions
 next+0.003
 lesson+0.003
rav+0.002
Good+0.002
S+0.002
Neg logit contributions
 shave-0.004
 anchor-0.004
apa-0.003
 carve-0.003
isel-0.003
Token She
Token ID1375
Feature activation+3.19e-03

Pos logit contributions
 shave+0.000
apa+0.000
 carve+0.000
 anchor+0.000
bowl+0.000
Neg logit contributions
rav-0.000
 lesson-0.000
Good-0.000
 next-0.000
 seek-0.000
Token also
Token ID635
Feature activation-2.61e-02

Pos logit contributions
 shave+0.006
 anchor+0.006
 carve+0.006
apa+0.006
bowl+0.006
Neg logit contributions
rav-0.004
 next-0.004
 lesson-0.004
Good-0.004
S-0.003
Token picked
Token ID6497
Feature activation-7.67e-03

Pos logit contributions
rav+0.037
Good+0.036
 lesson+0.036
 next+0.035
ider+0.034
Neg logit contributions
 shave-0.045
 anchor-0.044
 carve-0.042
bowl-0.041
apa-0.041
Token some
Token ID617
Feature activation-2.97e-02

Pos logit contributions
ider+0.012
rav+0.011
Good+0.011
 lesson+0.011
OK+0.011
Neg logit contributions
 anchor-0.012
 shave-0.012
bowl-0.012
 carve-0.011
pen-0.011
Token green
Token ID4077
Feature activation+3.40e-02

Pos logit contributions
 next+0.040
 lesson+0.039
Good+0.036
rav+0.035
 songs+0.035
Neg logit contributions
 shave-0.052
apa-0.050
isel-0.049
 carve-0.049
 anchor-0.048
Token seaw
Token ID32085
Feature activation+1.27e-01

Pos logit contributions
 shave+0.052
apa+0.048
 carve+0.047
isel+0.047
 anchor+0.047
Neg logit contributions
 next-0.055
 lesson-0.052
rav-0.051
fl-0.051
Good-0.051
Tokeneed
Token ID2308
Feature activation+5.61e-02

Pos logit contributions
 shave+0.173
 anchor+0.159
 carve+0.156
 harbour+0.150
apa+0.150
Neg logit contributions
rav-0.214
 next-0.209
oys-0.208
ider-0.206
S-0.204
Token to
Token ID284
Feature activation+3.51e-03

Pos logit contributions
 shave+0.116
 anchor+0.113
 carve+0.109
apa+0.108
bowl+0.108
Neg logit contributions
rav-0.058
Good-0.057
 lesson-0.055
 next-0.054
S-0.052
Token show
Token ID905
Feature activation-5.04e-03

Pos logit contributions
 shave+0.005
 anchor+0.005
apa+0.005
bowl+0.005
pen+0.005
Neg logit contributions
 next-0.006
 lesson-0.005
Good-0.005
rav-0.005
 seek-0.005
Token her
Token ID607
Feature activation-1.26e-02

Pos logit contributions
ider+0.009
Good+0.008
rav+0.008
OK+0.008
 seek+0.008
Neg logit contributions
 shave-0.008
 anchor-0.007
apa-0.007
 carve-0.007
pen-0.007
Token friends
Token ID2460
Feature activation-8.85e-03

Pos logit contributions
 lesson+0.014
 next+0.014
rav+0.012
 songs+0.011
Good+0.011
Neg logit contributions
 shave-0.026
isel-0.025
pen-0.024
 carve-0.024
 anchor-0.024
Token it
Token ID340
Feature activation-5.80e-03

Pos logit contributions
 shave+0.042
apa+0.040
pen+0.039
 carve+0.039
isel+0.039
Neg logit contributions
 next-0.020
 lesson-0.020
Good-0.018
rav-0.018
 Sue-0.017
Token come
Token ID1282
Feature activation-2.15e-02

Pos logit contributions
rav+0.007
Good+0.007
 lesson+0.007
S+0.007
oys+0.006
Neg logit contributions
 shave-0.011
 anchor-0.011
 carve-0.010
bowl-0.010
pen-0.010
Token out
Token ID503
Feature activation+3.49e-02

Pos logit contributions
 lesson+0.030
Good+0.030
rav+0.030
OK+0.028
S+0.028
Neg logit contributions
 shave-0.036
 anchor-0.035
 carve-0.034
bowl-0.033
apa-0.033
Token of
Token ID286
Feature activation+1.99e-02

Pos logit contributions
 shave+0.069
 anchor+0.065
apa+0.064
 carve+0.064
bowl+0.064
Neg logit contributions
 next-0.043
 lesson-0.039
Good-0.038
rav-0.038
S-0.037
Token the
Token ID262
Feature activation+9.64e-03

Pos logit contributions
 shave+0.038
apa+0.036
pen+0.036
bowl+0.035
 carve+0.035
Neg logit contributions
 next-0.025
 lesson-0.023
Good-0.023
S-0.021
 seek-0.021
Token seaw
Token ID32085
Feature activation+1.27e-01

Pos logit contributions
 shave+0.018
igh+0.018
pen+0.018
apa+0.018
isel+0.017
Neg logit contributions
 next-0.016
 lesson-0.013
 lady-0.012
 seek-0.011
 moment-0.011
Tokeneed
Token ID2308
Feature activation+5.53e-02

Pos logit contributions
 shave+0.191
igh+0.175
neath+0.169
 carve+0.169
apa+0.167
Neg logit contributions
S-0.205
 next-0.195
ider-0.194
rav-0.191
oys-0.188
Token and
Token ID290
Feature activation-2.29e-03

Pos logit contributions
 shave+0.121
isel+0.114
apa+0.113
 carve+0.112
 anchor+0.111
Neg logit contributions
 next-0.056
 lesson-0.048
rav-0.044
S-0.044
Good-0.043
Token walk
Token ID2513
Feature activation-4.51e-02

Pos logit contributions
rav+0.004
Good+0.004
 lesson+0.003
 next+0.003
S+0.003
Neg logit contributions
 shave-0.004
 anchor-0.004
 carve-0.003
bowl-0.003
apa-0.003
Token around
Token ID1088
Feature activation-7.98e-03

Pos logit contributions
Good+0.070
rav+0.069
ider+0.065
 lesson+0.064
bows+0.064
Neg logit contributions
 anchor-0.068
 shave-0.066
bowl-0.064
pen-0.061
 carve-0.061
Token the
Token ID262
Feature activation-1.54e-03

Pos logit contributions
rav+0.011
Good+0.011
 lesson+0.011
 next+0.010
S+0.010
Neg logit contributions
 shave-0.014
 anchor-0.013
 carve-0.013
apa-0.012
bowl-0.012
Token eggs
Token ID9653
Feature activation+2.94e-02

Pos logit contributions
rav+0.004
 next+0.004
Good+0.004
 lesson+0.004
S+0.004
Neg logit contributions
 shave-0.004
 anchor-0.003
 carve-0.003
apa-0.003
bowl-0.003
Token while
Token ID981
Feature activation+1.58e-02

Pos logit contributions
 shave+0.059
 anchor+0.057
isel+0.055
apa+0.054
pen+0.054
Neg logit contributions
 next-0.033
rav-0.030
 lesson-0.030
Good-0.029
bows-0.029
Token mom
Token ID1995
Feature activation+2.57e-03

Pos logit contributions
 shave+0.028
 anchor+0.027
 carve+0.027
apa+0.026
 frying+0.026
Neg logit contributions
rav-0.021
Good-0.021
 lesson-0.020
S-0.019
OK-0.019
Token prepared
Token ID5597
Feature activation+3.23e-02

Pos logit contributions
 shave+0.004
 anchor+0.004
 carve+0.004
apa+0.004
bowl+0.004
Neg logit contributions
rav-0.004
Good-0.004
 lesson-0.004
 next-0.004
S-0.004
Token the
Token ID262
Feature activation+2.46e-02

Pos logit contributions
 shave+0.066
 anchor+0.064
bowl+0.062
pen+0.062
isel+0.062
Neg logit contributions
 next-0.035
 lesson-0.033
rav-0.031
Good-0.031
S-0.030
Token pans
Token ID36209
Feature activation+1.25e-01

Pos logit contributions
 shave+0.028
 anchor+0.027
 carve+0.026
apa+0.025
 frying+0.024
Neg logit contributions
rav-0.049
Good-0.047
 lesson-0.046
 next-0.045
S-0.045
Token and
Token ID290
Feature activation+1.75e-02

Pos logit contributions
 shave+0.270
 anchor+0.264
isel+0.263
pen+0.253
 iceberg+0.248
Neg logit contributions
 next-0.132
 lesson-0.106
bows-0.099
rav-0.096
Good-0.096
Token pre
Token ID662
Feature activation+4.14e-02

Pos logit contributions
 shave+0.027
 anchor+0.026
 carve+0.025
apa+0.024
bowl+0.024
Neg logit contributions
rav-0.028
Good-0.027
 lesson-0.027
 next-0.027
S-0.026
Tokenhe
Token ID258
Feature activation+3.71e-02

Pos logit contributions
 anchor+0.067
 shave+0.063
bowl+0.061
 harbour+0.059
 iceberg+0.059
Neg logit contributions
 next-0.063
S-0.061
Good-0.061
 lesson-0.059
bows-0.055
Tokenated
Token ID515
Feature activation-5.66e-03

Pos logit contributions
 shave+0.058
 anchor+0.055
 carve+0.055
apa+0.054
 frying+0.053
Neg logit contributions
rav-0.060
Good-0.057
 lesson-0.054
S-0.053
OK-0.052
Token the
Token ID262
Feature activation+6.79e-03

Pos logit contributions
 next+0.007
 lesson+0.007
Good+0.006
rav+0.006
S+0.006
Neg logit contributions
 shave-0.011
 anchor-0.010
isel-0.010
bowl-0.010
 carve-0.010
Token her
Token ID607
Feature activation+1.51e-04

Pos logit contributions
rav+0.006
 next+0.006
 lesson+0.006
Good+0.006
S+0.005
Neg logit contributions
 shave-0.018
 anchor-0.017
 carve-0.017
apa-0.017
bowl-0.017
Token how
Token ID703
Feature activation+2.09e-02

Pos logit contributions
 shave+0.000
 anchor+0.000
 carve+0.000
isel+0.000
bowl+0.000
Neg logit contributions
 lesson-0.000
 next-0.000
Good-0.000
rav-0.000
S-0.000
Token to
Token ID284
Feature activation+2.45e-02

Pos logit contributions
 shave+0.052
pen+0.052
igh+0.051
 anchor+0.051
 harbour+0.050
Neg logit contributions
 next-0.015
 lesson-0.014
Good-0.013
rav-0.012
S-0.011
Token keep
Token ID1394
Feature activation+5.75e-02

Pos logit contributions
pen+0.038
ann+0.038
ete+0.037
 harbour+0.036
igh+0.036
Neg logit contributions
 next-0.038
 seek-0.033
Good-0.031
 lesson-0.031
OK-0.030
Token the
Token ID262
Feature activation+5.94e-02

Pos logit contributions
 shave+0.098
isel+0.097
ete+0.095
 harbour+0.095
 anchor+0.095
Neg logit contributions
 next-0.083
 lesson-0.072
Good-0.068
fl-0.068
S-0.066
Token stove
Token ID27635
Feature activation+1.25e-01

Pos logit contributions
pen+0.074
ete+0.073
isel+0.069
igh+0.066
 shave+0.064
Neg logit contributions
 next-0.132
 lesson-0.124
 moment-0.110
fl-0.109
Good-0.107
Token clean
Token ID3424
Feature activation+7.84e-02

Pos logit contributions
 shave+0.234
isel+0.226
 anchor+0.225
pen+0.222
apa+0.217
Neg logit contributions
 next-0.164
 lesson-0.145
Good-0.131
 seek-0.131
bows-0.128
Token.
Token ID13
Feature activation+6.39e-03

Pos logit contributions
 shave+0.154
 anchor+0.149
 carve+0.144
apa+0.143
isel+0.142
Neg logit contributions
 next-0.088
 lesson-0.087
rav-0.086
Good-0.086
S-0.080
Token Lily
Token ID20037
Feature activation-3.09e-02

Pos logit contributions
 shave+0.012
 carve+0.012
 anchor+0.012
bowl+0.011
apa+0.011
Neg logit contributions
rav-0.007
 lesson-0.007
Good-0.007
 next-0.007
ider-0.007
Token thought
Token ID1807
Feature activation-1.47e-02

Pos logit contributions
 next+0.025
 lesson+0.024
Good+0.023
rav+0.022
S+0.021
Neg logit contributions
 shave-0.071
 anchor-0.069
isel-0.067
 carve-0.067
igh-0.066
Token it
Token ID340
Feature activation-3.96e-03

Pos logit contributions
 lesson+0.017
 next+0.016
rav+0.015
Good+0.015
 Sue+0.014
Neg logit contributions
 shave-0.030
bowl-0.029
 harbour-0.028
 anchor-0.028
pen-0.028
Token happens
Token ID4325
Feature activation-1.57e-02

Pos logit contributions
rav+0.023
Good+0.022
 lesson+0.022
 next+0.022
S+0.021
Neg logit contributions
 shave-0.022
 anchor-0.022
 carve-0.020
apa-0.020
bowl-0.020
Token when
Token ID618
Feature activation+4.39e-02

Pos logit contributions
rav+0.021
Good+0.020
S+0.020
 lesson+0.020
bows+0.019
Neg logit contributions
 anchor-0.028
 shave-0.026
bowl-0.026
apa-0.026
 boiling-0.025
Token clouds
Token ID15114
Feature activation+5.13e-02

Pos logit contributions
 anchor+0.074
 shave+0.071
 carve+0.067
apa+0.066
 boiling+0.065
Neg logit contributions
rav-0.066
S-0.061
Good-0.061
bows-0.060
ider-0.060
Token get
Token ID651
Feature activation+9.80e-03

Pos logit contributions
 shave+0.085
 anchor+0.081
apa+0.079
 carve+0.077
bowl+0.077
Neg logit contributions
 next-0.073
rav-0.072
Good-0.071
 lesson-0.070
S-0.066
Token too
Token ID1165
Feature activation-4.15e-02

Pos logit contributions
 shave+0.018
pen+0.017
apa+0.017
ete+0.017
 anchor+0.016
Neg logit contributions
 next-0.014
 lesson-0.013
Good-0.012
S-0.011
rav-0.011
Token full
Token ID1336
Feature activation+1.25e-01

Pos logit contributions
 next+0.061
 lesson+0.060
Good+0.059
rav+0.059
S+0.057
Neg logit contributions
 shave-0.068
 anchor-0.064
 carve-0.062
pen-0.062
apa-0.062
Token of
Token ID286
Feature activation+2.24e-02

Pos logit contributions
 shave+0.231
 anchor+0.225
 carve+0.215
apa+0.215
bowl+0.213
Neg logit contributions
rav-0.153
Good-0.149
 lesson-0.147
 next-0.144
S-0.140
Token rain
Token ID6290
Feature activation-2.30e-02

Pos logit contributions
 shave+0.036
 anchor+0.035
 carve+0.033
apa+0.033
bowl+0.032
Neg logit contributions
rav-0.034
Good-0.033
 lesson-0.033
 next-0.033
S-0.031
Token."
Token ID526
Feature activation-4.45e-03

Pos logit contributions
rav+0.033
Good+0.033
 lesson+0.031
S+0.031
OK+0.030
Neg logit contributions
 shave-0.038
 anchor-0.037
 carve-0.035
bowl-0.035
acket-0.035
Token Johnny
Token ID15470
Feature activation+3.35e-02

Pos logit contributions
 next+0.006
Good+0.006
 Sue+0.006
 lesson+0.005
rav+0.005
Neg logit contributions
 anchor-0.008
 shave-0.007
isel-0.007
 frying-0.007
igh-0.007
Token said
Token ID531
Feature activation+1.44e-02

Pos logit contributions
 shave+0.065
 anchor+0.063
 carve+0.060
apa+0.060
bowl+0.060
Neg logit contributions
rav-0.040
Good-0.038
 lesson-0.038
 next-0.038
S-0.036
Token found
Token ID1043
Feature activation-9.23e-03

Pos logit contributions
 next+0.001
rav+0.001
 lesson+0.001
 seek+0.001
S+0.001
Neg logit contributions
 shave-0.002
 frying-0.002
apa-0.002
pen-0.002
ime-0.002
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
 next+0.010
 lesson+0.009
Good+0.009
 Sue+0.009
rav+0.009
Neg logit contributions
 shave-0.020
 carve-0.018
 frying-0.018
pen-0.018
apa-0.018
Token stuck
Token ID7819
Feature activation-8.08e-03

Pos logit contributions
rav+0.012
Good+0.012
 lesson+0.012
 next+0.011
S+0.011
Neg logit contributions
 shave-0.020
 anchor-0.019
 carve-0.018
bowl-0.018
apa-0.018
Token in
Token ID287
Feature activation+1.11e-02

Pos logit contributions
rav+0.012
Good+0.011
 lesson+0.011
S+0.011
bows+0.010
Neg logit contributions
 anchor-0.014
 shave-0.014
bowl-0.013
pen-0.013
 carve-0.012
Token some
Token ID617
Feature activation-4.43e-02

Pos logit contributions
 shave+0.025
 anchor+0.024
apa+0.023
 carve+0.023
isel+0.023
Neg logit contributions
 next-0.010
 lesson-0.010
Good-0.010
S-0.009
rav-0.009
Token seaw
Token ID32085
Feature activation+1.23e-01

Pos logit contributions
 next+0.059
 lesson+0.055
 seek+0.047
 animals+0.047
Good+0.046
Neg logit contributions
 shave-0.088
apa-0.083
acket-0.082
igh-0.080
ete-0.079
Tokeneed
Token ID2308
Feature activation+5.45e-02

Pos logit contributions
 shave+0.197
neath+0.188
 boiling+0.181
igh+0.181
 carve+0.178
Neg logit contributions
S-0.176
oys-0.171
ider-0.169
rav-0.165
 lesson-0.161
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 shave+0.115
 anchor+0.107
 carve+0.107
isel+0.107
apa+0.106
Neg logit contributions
 next-0.058
 lesson-0.053
rav-0.049
Good-0.048
S-0.047
Token Tim
Token ID5045
Feature activation-5.87e-03

Pos logit contributions
 next+0.016
Good+0.016
rav+0.016
 lesson+0.015
S+0.015
Neg logit contributions
 shave-0.026
 anchor-0.025
 carve-0.024
pen-0.024
isel-0.024
Token was
Token ID373
Feature activation-8.55e-03

Pos logit contributions
 next+0.005
 lesson+0.004
Good+0.004
rav+0.004
 seek+0.004
Neg logit contributions
 shave-0.014
 anchor-0.013
isel-0.013
apa-0.013
 carve-0.013
Token so
Token ID523
Feature activation+3.32e-02

Pos logit contributions
 next+0.010
 lesson+0.009
Good+0.009
rav+0.009
 seek+0.009
Neg logit contributions
 shave-0.017
 anchor-0.016
apa-0.016
 carve-0.016
isel-0.016
Token Joe
Token ID5689
Feature activation+6.73e-02

Pos logit contributions
 shave+0.016
 anchor+0.015
 carve+0.014
apa+0.014
pen+0.014
Neg logit contributions
rav-0.031
 next-0.031
 lesson-0.031
Good-0.031
S-0.030
Token a
Token ID257
Feature activation+4.07e-03

Pos logit contributions
 shave+0.148
 anchor+0.144
 carve+0.140
isel+0.138
apa+0.138
Neg logit contributions
 next-0.059
rav-0.058
 lesson-0.058
Good-0.058
S-0.053
Token tasty
Token ID25103
Feature activation+1.57e-02

Pos logit contributions
 shave+0.007
pen+0.007
 anchor+0.007
isel+0.007
 harbour+0.006
Neg logit contributions
 lesson-0.006
 next-0.005
Good-0.005
rav-0.005
S-0.005
Token piece
Token ID3704
Feature activation+6.58e-03

Pos logit contributions
 shave+0.026
 anchor+0.025
 carve+0.024
apa+0.024
bowl+0.024
Neg logit contributions
 lesson-0.023
 next-0.022
rav-0.022
Good-0.022
S-0.021
Token of
Token ID286
Feature activation-4.02e-03

Pos logit contributions
 shave+0.011
 anchor+0.010
 carve+0.010
apa+0.010
bowl+0.010
Neg logit contributions
rav-0.009
 next-0.009
 lesson-0.009
Good-0.009
S-0.009
Token seaw
Token ID32085
Feature activation+1.23e-01

Pos logit contributions
 lesson+0.006
 next+0.006
Good+0.006
rav+0.006
S+0.006
Neg logit contributions
 shave-0.006
 anchor-0.006
apa-0.006
 carve-0.005
isel-0.005
Tokeneed
Token ID2308
Feature activation+5.80e-02

Pos logit contributions
 shave+0.155
 anchor+0.149
 carve+0.145
 harbour+0.144
 boiling+0.139
Neg logit contributions
rav-0.215
ider-0.212
S-0.211
 lesson-0.210
 next-0.208
Token,
Token ID11
Feature activation-2.45e-05

Pos logit contributions
 shave+0.119
 anchor+0.115
 carve+0.111
apa+0.111
bowl+0.110
Neg logit contributions
rav-0.061
Good-0.059
 lesson-0.058
 next-0.058
S-0.054
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
rav+0.000
Good+0.000
ider+0.000
 lesson+0.000
bows+0.000
Neg logit contributions
 shave-0.000
 anchor-0.000
 carve-0.000
bowl-0.000
 boiling-0.000
Token Joe
Token ID5689
Feature activation+2.62e-02

Pos logit contributions
rav+0.022
ider+0.022
Good+0.021
bows+0.021
S+0.021
Neg logit contributions
 anchor-0.014
 shave-0.013
apa-0.012
 carve-0.012
bowl-0.012
Token ate
Token ID15063
Feature activation+8.56e-03

Pos logit contributions
 shave+0.051
 anchor+0.049
 carve+0.047
isel+0.047
apa+0.047
Neg logit contributions
 next-0.030
rav-0.030
 lesson-0.029
Good-0.029
S-0.027
Token told
Token ID1297
Feature activation+1.00e-03

Pos logit contributions
rav+0.062
 lesson+0.058
OK+0.058
Good+0.057
oys+0.055
Neg logit contributions
 shave-0.053
 frying-0.053
bowl-0.052
 boiling-0.052
 carve-0.052
Token by
Token ID416
Feature activation+1.90e-02

Pos logit contributions
 shave+0.002
 anchor+0.002
apa+0.002
 carve+0.002
 iceberg+0.001
Neg logit contributions
rav-0.002
Good-0.001
OK-0.001
ider-0.001
S-0.001
Token her
Token ID607
Feature activation+6.75e-03

Pos logit contributions
 shave+0.033
 anchor+0.032
 carve+0.031
apa+0.030
bowl+0.030
Neg logit contributions
rav-0.026
Good-0.025
 lesson-0.025
 next-0.025
S-0.024
Token grand
Token ID4490
Feature activation+1.04e-01

Pos logit contributions
 shave+0.013
 anchor+0.013
 carve+0.012
apa+0.012
bowl+0.012
Neg logit contributions
rav-0.008
Good-0.008
 lesson-0.007
OK-0.007
S-0.007
Tokenp
Token ID79
Feature activation-8.13e-03

Pos logit contributions
 shave+0.131
 anchor+0.120
 carve+0.117
bowl+0.115
isel+0.115
Neg logit contributions
 next-0.195
rav-0.193
 lesson-0.193
Good-0.188
S-0.182
Tokenapa
Token ID32678
Feature activation+1.22e-01

Pos logit contributions
Good+0.012
rav+0.011
OK+0.011
 lesson+0.011
 seek+0.010
Neg logit contributions
apa-0.014
 anchor-0.013
 shave-0.013
bowl-0.013
igh-0.013
Token that
Token ID326
Feature activation+5.20e-02

Pos logit contributions
 shave+0.199
 anchor+0.192
apa+0.187
bowl+0.184
pen+0.182
Neg logit contributions
Good-0.182
rav-0.181
 lesson-0.174
S-0.170
ider-0.168
Token a
Token ID257
Feature activation-3.66e-03

Pos logit contributions
 shave+0.085
 anchor+0.082
 carve+0.078
bowl+0.077
apa+0.077
Neg logit contributions
rav-0.077
Good-0.074
 lesson-0.074
 next-0.073
S-0.071
Token modern
Token ID3660
Feature activation-1.00e-02

Pos logit contributions
rav+0.005
bows+0.005
oys+0.005
Good+0.005
ider+0.005
Neg logit contributions
 shave-0.006
 anchor-0.006
 carve-0.006
apa-0.006
bowl-0.006
Token hive
Token ID35881
Feature activation+1.25e-02

Pos logit contributions
rav+0.021
Good+0.020
oys+0.020
Wh+0.019
K+0.018
Neg logit contributions
 anchor-0.011
 pot-0.010
 stove-0.010
 fork-0.010
 shave-0.010
Token was
Token ID373
Feature activation+1.01e-02

Pos logit contributions
 shave+0.019
bowl+0.019
 anchor+0.019
on+0.018
apa+0.018
Neg logit contributions
rav-0.020
Good-0.019
S-0.018
 lesson-0.017
OK-0.017
Token available
Token ID1695
Feature activation-4.79e-03

Pos logit contributions
rav+0.010
ider+0.009
Good+0.009
 lesson+0.009
Sweet+0.009
Neg logit contributions
 anchor-0.009
 shave-0.008
 boiling-0.008
bowl-0.008
 harbour-0.008
Token.
Token ID13
Feature activation-8.82e-03

Pos logit contributions
rav+0.007
Good+0.007
S+0.007
OK+0.007
 lesson+0.007
Neg logit contributions
 anchor-0.008
 shave-0.008
pen-0.007
bowl-0.007
apa-0.007
Token
Token ID220
Feature activation+8.99e-03

Pos logit contributions
rav+0.015
 lesson+0.015
 next+0.014
ider+0.014
Good+0.014
Neg logit contributions
apa-0.012
bowl-0.012
 shave-0.012
ete-0.011
pen-0.011
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
 shave+0.015
apa+0.014
 anchor+0.014
 carve+0.014
bowl+0.014
Neg logit contributions
rav-0.013
 lesson-0.013
Good-0.013
 next-0.012
 seek-0.012
Token
Token ID198
Feature activation+8.82e-03

Pos logit contributions
 shave+0.015
apa+0.015
bowl+0.015
pen+0.014
 carve+0.014
Neg logit contributions
rav-0.015
 lesson-0.015
ider-0.014
 next-0.014
Good-0.014
TokenTHE
Token ID10970
Feature activation-1.35e-01

Pos logit contributions
bowl+0.013
apa+0.013
 shave+0.013
pen+0.013
ete+0.013
Neg logit contributions
ses-0.013
rav-0.013
bows-0.013
sed-0.013
fl-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.37e-02

Pos logit contributions
oys+0.372
Good+0.368
S+0.364
rav+0.360
 lesson+0.360
Neg logit contributions
on-0.021
 -0.012
 carve-0.011
bowl-0.007
 shave-0.000
TokenOnce
Token ID7454
Feature activation+2.83e-04

Pos logit contributions
rav+0.050
 lesson+0.049
 next+0.048
Good+0.048
ider+0.046
Neg logit contributions
 shave-0.053
 anchor-0.050
bowl-0.050
pen-0.050
 carve-0.049
Token upon
Token ID2402
Feature activation-3.05e-02

Pos logit contributions
 shave+0.001
 anchor+0.001
apa+0.000
 carve+0.000
bowl+0.000
Neg logit contributions
rav-0.000
Good-0.000
 lesson-0.000
 next-0.000
S-0.000
Token a
Token ID257
Feature activation+1.08e-04

Pos logit contributions
rav+0.052
oys+0.048
ider+0.046
fl+0.044
bows+0.044
Neg logit contributions
 shave-0.049
on-0.043
pen-0.041
bowl-0.041
 anchor-0.040
Token time
Token ID640
Feature activation-1.63e-02

Pos logit contributions
 shave+0.000
 anchor+0.000
 carve+0.000
apa+0.000
isel+0.000
Neg logit contributions
 next-0.000
rav-0.000
 lesson-0.000
Good-0.000
S-0.000
Token arm
Token ID3211
Feature activation-3.88e-03

Pos logit contributions
rav+0.071
Good+0.069
S+0.066
oys+0.064
 lesson+0.063
Neg logit contributions
 shave-0.076
 anchor-0.074
 carve-0.068
apa-0.067
bowl-0.066
Tokenchair
Token ID16337
Feature activation-1.10e-02

Pos logit contributions
rav+0.007
Good+0.006
S+0.006
oys+0.006
OK+0.006
Neg logit contributions
 shave-0.006
on-0.005
 anchor-0.005
ime-0.005
pen-0.005
Token.
Token ID13
Feature activation-2.54e-03

Pos logit contributions
rav+0.019
Good+0.018
oys+0.017
 lesson+0.017
S+0.017
Neg logit contributions
 shave-0.016
on-0.015
bowl-0.015
 anchor-0.015
pen-0.015
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
rav+0.004
 lesson+0.004
 next+0.004
Good+0.004
bows+0.004
Neg logit contributions
 shave-0.004
apa-0.003
bowl-0.003
on-0.003
pen-0.003
Token
Token ID198
Feature activation+1.03e-02

Pos logit contributions
 shave+0.015
apa+0.014
bowl+0.014
 anchor+0.014
on+0.014
Neg logit contributions
rav-0.017
bows-0.017
Good-0.016
ider-0.016
 lesson-0.016
TokenTHE
Token ID10970
Feature activation-1.35e-01

Pos logit contributions
 shave+0.018
apa+0.017
bowl+0.017
on+0.017
pen+0.017
Neg logit contributions
rav-0.013
bows-0.013
 lesson-0.013
fl-0.013
sed-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.86e-02

Pos logit contributions
oys+0.335
S+0.320
Good+0.315
 lesson+0.311
rav+0.308
Neg logit contributions
on-0.076
bowl-0.054
 carve-0.052
pen-0.049
 shave-0.047
TokenOnce
Token ID7454
Feature activation-5.03e-03

Pos logit contributions
rav+0.058
 lesson+0.057
 next+0.055
Good+0.055
ider+0.054
Neg logit contributions
 shave-0.061
bowl-0.057
pen-0.057
 anchor-0.057
 carve-0.055
Token upon
Token ID2402
Feature activation-3.72e-02

Pos logit contributions
rav+0.007
Good+0.007
 lesson+0.007
 next+0.006
ider+0.006
Neg logit contributions
 shave-0.008
 anchor-0.008
apa-0.008
bowl-0.008
 carve-0.008
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
rav+0.066
oys+0.062
ider+0.059
fl+0.056
bows+0.056
Neg logit contributions
 shave-0.055
on-0.051
pen-0.047
 iceberg-0.046
bowl-0.046
Token time
Token ID640
Feature activation-2.18e-02

Pos logit contributions
rav+0.014
 next+0.014
Good+0.014
 lesson+0.014
S+0.013
Neg logit contributions
 shave-0.021
 anchor-0.020
 carve-0.019
apa-0.019
isel-0.019
Token again
Token ID757
Feature activation-7.50e-02

Pos logit contributions
rav+0.065
Good+0.061
 lesson+0.059
oys+0.056
bows+0.056
Neg logit contributions
 shave-0.085
 anchor-0.081
bowl-0.078
pen-0.077
on-0.075
Token tomorrow
Token ID9439
Feature activation-4.44e-02

Pos logit contributions
rav+0.129
Good+0.120
 lesson+0.119
bows+0.114
OK+0.113
Neg logit contributions
 shave-0.116
 anchor-0.111
 carve-0.105
bowl-0.105
pen-0.102
Token.
Token ID13
Feature activation+2.84e-03

Pos logit contributions
rav+0.063
Good+0.061
 lesson+0.061
 next+0.059
S+0.058
Neg logit contributions
 shave-0.076
 anchor-0.072
 carve-0.070
apa-0.069
pen-0.069
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 shave+0.004
 anchor+0.004
apa+0.004
bowl+0.004
 carve+0.004
Neg logit contributions
rav-0.005
 lesson-0.004
Good-0.004
 next-0.004
S-0.004
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 shave+0.017
 anchor+0.017
 carve+0.016
apa+0.016
bowl+0.016
Neg logit contributions
rav-0.018
Good-0.017
 next-0.017
 lesson-0.017
S-0.017
TokenTHE
Token ID10970
Feature activation-1.33e-01

Pos logit contributions
 shave+0.019
 anchor+0.018
apa+0.018
bowl+0.017
pen+0.017
Neg logit contributions
rav-0.013
 lesson-0.013
bows-0.013
 next-0.013
Good-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.71e-02

Pos logit contributions
S+0.326
Good+0.326
oys+0.323
rav+0.317
 lesson+0.314
Neg logit contributions
on-0.074
 shave-0.060
bowl-0.059
 carve-0.057
pen-0.046
TokenOnce
Token ID7454
Feature activation-4.97e-03

Pos logit contributions
rav+0.056
 lesson+0.055
 next+0.053
Good+0.053
ider+0.052
Neg logit contributions
 shave-0.058
 anchor-0.054
bowl-0.054
pen-0.054
 carve-0.053
Token upon
Token ID2402
Feature activation-3.69e-02

Pos logit contributions
rav+0.006
 next+0.006
 lesson+0.006
Good+0.006
S+0.005
Neg logit contributions
 shave-0.010
 anchor-0.009
 carve-0.009
apa-0.009
bowl-0.009
Token a
Token ID257
Feature activation-9.01e-03

Pos logit contributions
rav+0.061
oys+0.057
ider+0.055
bows+0.053
fl+0.053
Neg logit contributions
 shave-0.059
on-0.053
pen-0.050
 anchor-0.050
bowl-0.049
Token time
Token ID640
Feature activation-2.22e-02

Pos logit contributions
rav+0.012
 next+0.012
Good+0.012
 lesson+0.011
S+0.011
Neg logit contributions
 shave-0.016
 anchor-0.016
 carve-0.015
apa-0.015
bowl-0.015
Token happy
Token ID3772
Feature activation-1.75e-02

Pos logit contributions
rav+0.000
ider+0.000
S+0.000
oys+0.000
Good+0.000
Neg logit contributions
 shave-0.001
 anchor-0.001
 carve-0.000
bowl-0.000
apa-0.000
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
rav+0.030
Good+0.030
S+0.029
oys+0.029
K+0.028
Neg logit contributions
 anchor-0.022
 shave-0.021
bowl-0.020
pen-0.019
ime-0.019
Token
Token ID220
Feature activation-2.98e-04

Pos logit contributions
rav+0.012
 next+0.012
 lesson+0.012
Good+0.012
ider+0.012
Neg logit contributions
 shave-0.013
apa-0.012
 anchor-0.012
bowl-0.012
 carve-0.012
Token
Token ID198
Feature activation+5.42e-03

Pos logit contributions
rav+0.000
 lesson+0.000
Good+0.000
 next+0.000
S+0.000
Neg logit contributions
 shave-0.001
 anchor-0.001
apa-0.000
 carve-0.000
bowl-0.000
Token
Token ID198
Feature activation+5.51e-03

Pos logit contributions
 shave+0.009
apa+0.008
 anchor+0.008
bowl+0.008
 carve+0.008
Neg logit contributions
rav-0.008
ider-0.008
 lesson-0.008
Good-0.008
 next-0.008
TokenTHE
Token ID10970
Feature activation-1.33e-01

Pos logit contributions
 shave+0.008
pen+0.007
apa+0.007
on+0.007
bowl+0.007
Neg logit contributions
ider-0.009
rav-0.009
bows-0.009
sed-0.008
ses-0.008
Token<|endoftext|>
Token ID50256
Feature activation-2.44e-02

Pos logit contributions
oys+0.363
S+0.356
ider+0.352
Good+0.347
rav+0.341
Neg logit contributions
 carve-0.027
on-0.022
 shave-0.021
bowl-0.016
 harbour-0.014
TokenOnce
Token ID7454
Feature activation+9.57e-03

Pos logit contributions
rav+0.037
 lesson+0.036
Good+0.035
 next+0.035
ider+0.034
Neg logit contributions
 shave-0.038
 anchor-0.036
bowl-0.035
pen-0.035
 carve-0.035
Token upon
Token ID2402
Feature activation-2.04e-02

Pos logit contributions
 shave+0.020
 anchor+0.019
 carve+0.019
pen+0.018
isel+0.018
Neg logit contributions
 next-0.010
rav-0.010
 lesson-0.010
Good-0.009
S-0.009
Token a
Token ID257
Feature activation+4.75e-03

Pos logit contributions
rav+0.030
oys+0.027
ider+0.027
 lesson+0.026
bows+0.025
Neg logit contributions
 shave-0.037
on-0.032
 anchor-0.032
bowl-0.032
pen-0.032
Token time
Token ID640
Feature activation-1.30e-02

Pos logit contributions
 shave+0.009
 anchor+0.008
 carve+0.008
apa+0.008
bowl+0.008
Neg logit contributions
rav-0.006
 next-0.006
 lesson-0.006
Good-0.006
S-0.006
Token so
Token ID523
Feature activation+2.81e-02

Pos logit contributions
rav+0.013
oys+0.013
Good+0.013
 lesson+0.012
ider+0.012
Neg logit contributions
 shave-0.014
 anchor-0.014
 carve-0.014
on-0.013
apa-0.013
Token happy
Token ID3772
Feature activation-1.15e-02

Pos logit contributions
 shave+0.047
 carve+0.046
 anchor+0.046
apa+0.045
bowl+0.044
Neg logit contributions
rav-0.036
 lesson-0.036
Good-0.036
oys-0.036
ider-0.035
Token.
Token ID13
Feature activation-2.05e-03

Pos logit contributions
rav+0.020
Good+0.020
 lesson+0.019
Wh+0.019
Tweet+0.019
Neg logit contributions
 shave-0.013
 anchor-0.013
on-0.013
pen-0.013
?'-0.012
Token
Token ID198
Feature activation+9.04e-03

Pos logit contributions
 seek+0.003
 lesson+0.003
Good+0.003
rav+0.003
ider+0.003
Neg logit contributions
 shave-0.003
apa-0.003
 carve-0.003
pen-0.003
 harbour-0.003
Token
Token ID198
Feature activation+8.96e-03

Pos logit contributions
 shave+0.013
 anchor+0.013
apa+0.012
 carve+0.012
pen+0.012
Neg logit contributions
rav-0.014
Good-0.014
 lesson-0.014
 next-0.014
ider-0.014
TokenTHE
Token ID10970
Feature activation-1.29e-01

Pos logit contributions
 shave+0.015
pen+0.015
 carve+0.014
 anchor+0.014
apa+0.014
Neg logit contributions
rav-0.011
ider-0.011
bows-0.011
 lesson-0.011
oys-0.011
Token<|endoftext|>
Token ID50256
Feature activation-2.34e-02

Pos logit contributions
S+0.318
oys+0.315
Good+0.310
ider+0.303
 lesson+0.302
Neg logit contributions
 carve-0.074
on-0.068
 shave-0.066
 harbour-0.059
pen-0.057
TokenOnce
Token ID7454
Feature activation+9.86e-03

Pos logit contributions
rav+0.035
 lesson+0.034
Good+0.034
 next+0.034
ider+0.032
Neg logit contributions
 shave-0.037
 anchor-0.035
bowl-0.034
 carve-0.034
pen-0.034
Token upon
Token ID2402
Feature activation-2.00e-02

Pos logit contributions
 shave+0.021
 anchor+0.020
 carve+0.020
bowl+0.019
pen+0.019
Neg logit contributions
 next-0.010
rav-0.010
 lesson-0.010
Good-0.009
S-0.009
Token a
Token ID257
Feature activation+3.59e-03

Pos logit contributions
rav+0.028
oys+0.026
ider+0.025
 lesson+0.025
bows+0.024
Neg logit contributions
 shave-0.037
on-0.032
pen-0.032
 anchor-0.032
 carve-0.032
Token time
Token ID640
Feature activation-1.53e-02

Pos logit contributions
 shave+0.006
 anchor+0.006
 carve+0.006
apa+0.006
bowl+0.006
Neg logit contributions
rav-0.005
 next-0.005
Good-0.005
 lesson-0.005
S-0.004
Token delicious
Token ID12625
Feature activation+4.34e-02

Pos logit contributions
 shave+0.071
 anchor+0.068
 carve+0.066
apa+0.065
bowl+0.065
Neg logit contributions
rav-0.053
Good-0.051
 next-0.051
 lesson-0.051
S-0.048
Token!
Token ID0
Feature activation+2.16e-03

Pos logit contributions
 shave+0.064
 anchor+0.062
bowl+0.060
pen+0.059
apa+0.059
Neg logit contributions
rav-0.071
Good-0.066
 lesson-0.064
ider-0.064
OK-0.062
Token
Token ID220
Feature activation-1.13e-03

Pos logit contributions
apa+0.003
 shave+0.003
on+0.003
bowl+0.003
ete+0.003
Neg logit contributions
rav-0.003
ider-0.003
 next-0.003
 lesson-0.003
 seek-0.003
Token
Token ID198
Feature activation+3.03e-03

Pos logit contributions
rav+0.002
 lesson+0.002
 next+0.002
Good+0.002
 seek+0.001
Neg logit contributions
 shave-0.002
apa-0.002
 carve-0.002
pen-0.002
bowl-0.002
Token
Token ID198
Feature activation+3.08e-03

Pos logit contributions
 shave+0.005
on+0.004
apa+0.004
bowl+0.004
pen+0.004
Neg logit contributions
rav-0.005
 lesson-0.004
ider-0.004
 next-0.004
bows-0.004
TokenTHE
Token ID10970
Feature activation-1.29e-01

Pos logit contributions
on+0.005
 shave+0.005
pen+0.005
bowl+0.005
apa+0.004
Neg logit contributions
ses-0.005
sed-0.004
rav-0.004
bows-0.004
oys-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.00e-02

Pos logit contributions
ider+0.336
Good+0.335
rav+0.332
oys+0.332
S+0.332
Neg logit contributions
on-0.036
 shave-0.032
 carve-0.032
bowl-0.020
apa-0.011
TokenElsa
Token ID49050
Feature activation-2.01e-03

Pos logit contributions
rav+0.029
 lesson+0.029
Good+0.028
 next+0.028
ider+0.027
Neg logit contributions
 shave-0.032
 anchor-0.031
bowl-0.030
 carve-0.030
pen-0.030
Token was
Token ID373
Feature activation+2.87e-02

Pos logit contributions
rav+0.002
 lesson+0.002
Good+0.002
 next+0.002
S+0.002
Neg logit contributions
 shave-0.004
 anchor-0.004
 carve-0.004
apa-0.004
bowl-0.004
Token playing
Token ID2712
Feature activation-1.56e-02

Pos logit contributions
 shave+0.047
 anchor+0.045
 carve+0.044
apa+0.043
bowl+0.043
Neg logit contributions
rav-0.043
Good-0.041
 lesson-0.041
 next-0.039
S-0.038
Token chess
Token ID19780
Feature activation-1.05e-03

Pos logit contributions
rav+0.023
 lesson+0.021
Good+0.021
oys+0.020
bows+0.020
Neg logit contributions
 shave-0.026
 anchor-0.025
bowl-0.025
pen-0.025
 carve-0.024
Token was
Token ID373
Feature activation+2.70e-02

Pos logit contributions
 shave+0.112
 anchor+0.107
 carve+0.104
apa+0.102
pen+0.102
Neg logit contributions
rav-0.076
 lesson-0.073
Good-0.073
 next-0.071
S-0.070
Token.
Token ID13
Feature activation-2.09e-03

Pos logit contributions
 shave+0.049
 anchor+0.047
 carve+0.045
apa+0.045
bowl+0.045
Neg logit contributions
rav-0.034
 next-0.034
 lesson-0.034
Good-0.034
S-0.032
Token
Token ID220
Feature activation+6.03e-03

Pos logit contributions
rav+0.003
Good+0.003
 next+0.003
 lesson+0.003
S+0.002
Neg logit contributions
 shave-0.004
 anchor-0.004
 carve-0.004
apa-0.004
bowl-0.003
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
 shave+0.011
 anchor+0.010
 carve+0.010
apa+0.010
bowl+0.010
Neg logit contributions
rav-0.008
 lesson-0.008
Good-0.008
 next-0.008
S-0.007
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
 shave+0.015
 anchor+0.014
apa+0.014
 carve+0.014
bowl+0.014
Neg logit contributions
rav-0.016
 lesson-0.016
Good-0.016
 next-0.016
ider-0.015
TokenTHE
Token ID10970
Feature activation-1.28e-01

Pos logit contributions
 shave+0.015
 anchor+0.014
apa+0.014
on+0.013
bowl+0.013
Neg logit contributions
rav-0.013
bows-0.013
ider-0.013
 lesson-0.013
 next-0.012
Token<|endoftext|>
Token ID50256
Feature activation-2.91e-02

Pos logit contributions
oys+0.312
S+0.310
ider+0.310
rav+0.309
Good+0.309
Neg logit contributions
 shave-0.070
on-0.060
 carve-0.059
bowl-0.049
 harbour-0.042
TokenT
Token ID51
Feature activation+3.32e-02

Pos logit contributions
rav+0.042
 lesson+0.042
 next+0.041
Good+0.041
S+0.039
Neg logit contributions
 shave-0.047
 anchor-0.045
 carve-0.043
bowl-0.043
apa-0.043
Tokenina
Token ID1437
Feature activation+3.29e-02

Pos logit contributions
 shave+0.047
 anchor+0.045
 carve+0.043
apa+0.042
bowl+0.042
Neg logit contributions
rav-0.056
 next-0.055
Good-0.055
 lesson-0.055
S-0.052
Token was
Token ID373
Feature activation+1.91e-02

Pos logit contributions
 shave+0.064
 anchor+0.062
 carve+0.060
apa+0.060
bowl+0.059
Neg logit contributions
rav-0.038
 lesson-0.036
Good-0.036
 next-0.035
ider-0.034
Token feeling
Token ID4203
Feature activation+4.26e-02

Pos logit contributions
 shave+0.033
 anchor+0.031
 carve+0.030
apa+0.030
bowl+0.030
Neg logit contributions
rav-0.027
Good-0.026
 lesson-0.026
 next-0.026
S-0.025
Token came
Token ID1625
Feature activation-1.15e-02

Pos logit contributions
rav+0.071
ider+0.071
S+0.071
Good+0.070
oys+0.069
Neg logit contributions
 shave-0.084
 anchor-0.081
 carve-0.080
pen-0.076
ime-0.074
Token back
Token ID736
Feature activation-1.99e-02

Pos logit contributions
rav+0.019
Good+0.017
ider+0.017
OK+0.017
 lesson+0.017
Neg logit contributions
 anchor-0.017
 shave-0.017
 carve-0.016
 afloat-0.015
pen-0.015
Token.
Token ID13
Feature activation+7.18e-03

Pos logit contributions
rav+0.036
Good+0.035
 lesson+0.032
ider+0.032
OK+0.032
Neg logit contributions
 shave-0.026
 anchor-0.026
 carve-0.025
apa-0.023
pen-0.023
Token
Token ID198
Feature activation+1.50e-02

Pos logit contributions
 shave+0.011
apa+0.011
 anchor+0.010
bowl+0.010
 carve+0.010
Neg logit contributions
rav-0.011
 lesson-0.011
 next-0.011
Good-0.011
bows-0.010
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
 shave+0.020
 anchor+0.019
apa+0.019
 carve+0.018
bowl+0.018
Neg logit contributions
rav-0.026
Good-0.026
 lesson-0.025
 next-0.025
bows-0.025
TokenTHE
Token ID10970
Feature activation-1.28e-01

Pos logit contributions
 shave+0.021
apa+0.021
ete+0.020
bowl+0.020
on+0.020
Neg logit contributions
bows-0.017
rav-0.016
fl-0.016
 lesson-0.016
 next-0.015
Token<|endoftext|>
Token ID50256
Feature activation-3.71e-02

Pos logit contributions
oys+0.334
Good+0.326
S+0.322
 lesson+0.318
rav+0.317
Neg logit contributions
on-0.045
 carve-0.035
bowl-0.035
 shave-0.030
pen-0.024
TokenOnce
Token ID7454
Feature activation-6.16e-03

Pos logit contributions
rav+0.058
 lesson+0.057
 next+0.055
Good+0.055
ider+0.054
Neg logit contributions
 shave-0.056
bowl-0.052
pen-0.052
 anchor-0.052
on-0.051
Token upon
Token ID2402
Feature activation-3.87e-02

Pos logit contributions
rav+0.008
Good+0.008
 lesson+0.008
 next+0.008
S+0.007
Neg logit contributions
 shave-0.011
 anchor-0.011
apa-0.010
 carve-0.010
bowl-0.010
Token a
Token ID257
Feature activation-1.28e-02

Pos logit contributions
rav+0.069
oys+0.065
ider+0.062
fl+0.059
bows+0.059
Neg logit contributions
 shave-0.057
on-0.052
pen-0.048
 iceberg-0.048
 anchor-0.047
Token time
Token ID640
Feature activation-2.31e-02

Pos logit contributions
rav+0.017
 next+0.016
Good+0.016
 lesson+0.016
S+0.015
Neg logit contributions
 shave-0.023
 anchor-0.022
 carve-0.021
apa-0.021
bowl-0.021
Token special
Token ID2041
Feature activation+9.75e-03

Pos logit contributions
rav+0.006
 next+0.006
 lesson+0.006
Good+0.005
S+0.005
Neg logit contributions
 shave-0.008
 anchor-0.007
 carve-0.007
apa-0.007
bowl-0.007
Token talent
Token ID7401
Feature activation-2.32e-02

Pos logit contributions
 shave+0.016
 anchor+0.015
 carve+0.014
pen+0.014
bowl+0.014
Neg logit contributions
Good-0.014
rav-0.014
S-0.014
ider-0.013
 next-0.013
Token.
Token ID13
Feature activation-5.41e-03

Pos logit contributions
rav+0.032
Good+0.031
S+0.031
ider+0.029
K+0.028
Neg logit contributions
 shave-0.040
 anchor-0.038
on-0.037
pen-0.036
apa-0.036
Token
Token ID198
Feature activation+5.81e-03

Pos logit contributions
rav+0.008
 lesson+0.008
 next+0.008
Good+0.008
ider+0.008
Neg logit contributions
 shave-0.008
apa-0.008
bowl-0.007
 carve-0.007
pen-0.007
Token
Token ID198
Feature activation+6.11e-03

Pos logit contributions
 shave+0.009
 anchor+0.008
 carve+0.008
apa+0.008
bowl+0.008
Neg logit contributions
rav-0.009
 next-0.009
Good-0.009
 lesson-0.009
S-0.009
TokenTHE
Token ID10970
Feature activation-1.27e-01

Pos logit contributions
 shave+0.010
on+0.010
pen+0.010
apa+0.010
ete+0.009
Neg logit contributions
bows-0.008
rav-0.008
oys-0.008
fl-0.008
ider-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.90e-02

Pos logit contributions
oys+0.326
S+0.322
ider+0.321
Good+0.315
rav+0.310
Neg logit contributions
on-0.044
 shave-0.043
 carve-0.039
bowl-0.026
pen-0.023
TokenOnce
Token ID7454
Feature activation+1.55e-02

Pos logit contributions
rav+0.028
 lesson+0.027
Good+0.027
 next+0.027
ider+0.025
Neg logit contributions
 shave-0.031
 anchor-0.029
bowl-0.028
 carve-0.028
pen-0.028
Token upon
Token ID2402
Feature activation-1.46e-02

Pos logit contributions
 shave+0.033
 anchor+0.031
 carve+0.030
pen+0.030
bowl+0.030
Neg logit contributions
rav-0.016
 next-0.016
 lesson-0.016
Good-0.015
S-0.015
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
rav+0.021
ider+0.019
oys+0.019
 lesson+0.018
bows+0.018
Neg logit contributions
 shave-0.027
on-0.023
bowl-0.023
 anchor-0.023
 carve-0.023
Token time
Token ID640
Feature activation-9.59e-03

Pos logit contributions
 shave+0.019
 anchor+0.019
 carve+0.018
apa+0.018
bowl+0.018
Neg logit contributions
rav-0.013
 next-0.013
 lesson-0.013
Good-0.013
S-0.012
Token.
Token ID13
Feature activation-3.14e-03

Pos logit contributions
 shave+0.054
on+0.051
pen+0.050
 anchor+0.050
apa+0.050
Neg logit contributions
rav-0.050
Good-0.050
 lesson-0.046
OK-0.046
S-0.044
Token
Token ID220
Feature activation-4.01e-04

Pos logit contributions
rav+0.004
 next+0.004
 lesson+0.004
Good+0.004
ider+0.004
Neg logit contributions
 shave-0.005
apa-0.005
 anchor-0.005
bowl-0.005
 carve-0.005
Token
Token ID198
Feature activation+4.41e-03

Pos logit contributions
rav+0.001
 lesson+0.001
 next+0.001
Good+0.001
S+0.000
Neg logit contributions
 shave-0.001
 anchor-0.001
apa-0.001
 carve-0.001
bowl-0.001
Token
Token ID220
Feature activation-2.91e-03

Pos logit contributions
 shave+0.007
 anchor+0.006
 carve+0.006
apa+0.006
bowl+0.006
Neg logit contributions
rav-0.007
Good-0.007
 next-0.007
 lesson-0.007
S-0.006
Token
Token ID198
Feature activation+4.04e-03

Pos logit contributions
rav+0.004
 next+0.004
Good+0.004
 lesson+0.004
S+0.003
Neg logit contributions
 shave-0.005
 anchor-0.005
 carve-0.005
apa-0.005
bowl-0.005
TokenTHE
Token ID10970
Feature activation-1.27e-01

Pos logit contributions
 shave+0.007
bowl+0.007
apa+0.007
on+0.007
 anchor+0.006
Neg logit contributions
rav-0.005
bows-0.005
ider-0.005
 lesson-0.005
 next-0.005
Token<|endoftext|>
Token ID50256
Feature activation-7.84e-03

Pos logit contributions
ider+0.283
oys+0.278
rav+0.275
S+0.270
Good+0.269
Neg logit contributions
 carve-0.091
on-0.090
 shave-0.089
bowl-0.087
apa-0.078
TokenOnce
Token ID7454
Feature activation+2.05e-02

Pos logit contributions
rav+0.011
 next+0.011
Good+0.011
 lesson+0.010
S+0.010
Neg logit contributions
 shave-0.014
 anchor-0.013
 carve-0.013
apa-0.012
isel-0.012
Token upon
Token ID2402
Feature activation-1.21e-02

Pos logit contributions
 shave+0.047
 anchor+0.045
 carve+0.044
pen+0.044
 frying+0.043
Neg logit contributions
 next-0.019
rav-0.018
 lesson-0.018
 Sue-0.016
S-0.016
Token a
Token ID257
Feature activation+1.13e-02

Pos logit contributions
rav+0.014
 lesson+0.013
ider+0.013
oys+0.013
Good+0.013
Neg logit contributions
 shave-0.024
 anchor-0.022
bowl-0.022
 carve-0.022
pen-0.021
Token time
Token ID640
Feature activation-5.30e-03

Pos logit contributions
 shave+0.021
 anchor+0.020
 carve+0.019
apa+0.019
bowl+0.019
Neg logit contributions
rav-0.014
 next-0.014
 lesson-0.014
Good-0.014
S-0.013
Token called
Token ID1444
Feature activation+5.32e-02

Pos logit contributions
 lesson+0.029
rav+0.029
Good+0.028
oys+0.026
OK+0.026
Neg logit contributions
 shave-0.027
pen-0.026
 anchor-0.026
bowl-0.026
apa-0.025
Token Zig
Token ID24992
Feature activation-3.96e-02

Pos logit contributions
 shave+0.079
apa+0.076
 anchor+0.072
ete+0.070
pen+0.070
Neg logit contributions
rav-0.086
Good-0.085
 lesson-0.085
ider-0.084
OK-0.081
Tokenzag
Token ID50183
Feature activation-1.06e-01

Pos logit contributions
Good+0.061
 lesson+0.060
rav+0.054
OK+0.054
Sweet+0.052
Neg logit contributions
pen-0.063
apa-0.061
 anchor-0.061
 shave-0.060
ete-0.059
Token.
Token ID13
Feature activation+5.30e-03

Pos logit contributions
rav+0.162
Good+0.162
Sweet+0.148
S+0.146
oys+0.144
Neg logit contributions
 anchor-0.169
 shave-0.162
bowl-0.159
ime-0.155
pen-0.155
Token Zig
Token ID24992
Feature activation-6.63e-02

Pos logit contributions
 shave+0.007
bowl+0.007
 anchor+0.006
apa+0.006
 iceberg+0.006
Neg logit contributions
 lesson-0.009
rav-0.009
Good-0.009
ider-0.009
 next-0.008
Tokenzag
Token ID50183
Feature activation-1.27e-01

Pos logit contributions
 lesson+0.104
Good+0.103
OK+0.090
Sweet+0.090
S+0.088
Neg logit contributions
pen-0.098
ime-0.098
apa-0.097
ete-0.095
igh-0.095
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
rav+0.187
Good+0.186
S+0.175
oys+0.175
Sweet+0.170
Neg logit contributions
 anchor-0.203
bowl-0.193
ime-0.190
pen-0.187
 shave-0.187
Token a
Token ID257
Feature activation-2.95e-02

Pos logit contributions
rav+0.020
 lesson+0.019
Good+0.018
oys+0.018
ider+0.018
Neg logit contributions
 shave-0.019
 anchor-0.019
bowl-0.018
on-0.017
pen-0.017
Token very
Token ID845
Feature activation-1.02e-03

Pos logit contributions
rav+0.044
oys+0.041
bows+0.041
ider+0.040
 lesson+0.040
Neg logit contributions
 anchor-0.048
 shave-0.047
bowl-0.043
 iceberg-0.042
 frying-0.042
Token happy
Token ID3772
Feature activation-2.78e-02

Pos logit contributions
rav+0.002
oys+0.002
bows+0.002
fl+0.002
Good+0.002
Neg logit contributions
 anchor-0.002
 shave-0.001
 frying-0.001
bowl-0.001
 carve-0.001
Token pup
Token ID15552
Feature activation-3.55e-02

Pos logit contributions
rav+0.041
Good+0.041
ider+0.041
oys+0.038
bows+0.036
Neg logit contributions
 anchor-0.047
 shave-0.043
 ig-0.042
 iceberg-0.042
 ocean-0.041
Token face
Token ID1986
Feature activation+3.49e-02

Pos logit contributions
 next+0.023
 lesson+0.023
rav+0.022
Good+0.022
S+0.021
Neg logit contributions
 shave-0.017
 anchor-0.016
pen-0.016
bowl-0.016
 carve-0.016
Token.
Token ID13
Feature activation-5.24e-03

Pos logit contributions
 shave+0.056
 anchor+0.053
 carve+0.051
apa+0.051
pen+0.050
Neg logit contributions
rav-0.054
Good-0.054
 lesson-0.052
OK-0.050
S-0.048
Token
Token ID220
Feature activation+1.98e-03

Pos logit contributions
rav+0.007
 lesson+0.007
Good+0.007
 next+0.007
S+0.006
Neg logit contributions
 shave-0.010
 anchor-0.009
 carve-0.009
isel-0.009
pen-0.009
Token
Token ID198
Feature activation+3.35e-03

Pos logit contributions
 shave+0.004
 anchor+0.004
 carve+0.004
isel+0.004
bowl+0.004
Neg logit contributions
rav-0.002
 next-0.002
Good-0.002
 lesson-0.002
S-0.002
Token
Token ID198
Feature activation+2.57e-03

Pos logit contributions
 shave+0.006
 anchor+0.006
 carve+0.006
isel+0.006
bowl+0.006
Neg logit contributions
 next-0.004
rav-0.004
 lesson-0.004
Good-0.004
S-0.004
TokenTHE
Token ID10970
Feature activation-1.26e-01

Pos logit contributions
 shave+0.005
 anchor+0.005
 carve+0.005
apa+0.005
bowl+0.005
Neg logit contributions
rav-0.003
 lesson-0.003
 next-0.003
Good-0.003
ider-0.003
Token<|endoftext|>
Token ID50256
Feature activation-8.66e-03

Pos logit contributions
ider+0.290
oys+0.284
rav+0.282
Good+0.279
 lesson+0.275
Neg logit contributions
 shave-0.090
 carve-0.087
 harbour-0.078
apa-0.076
bowl-0.074
TokenOnce
Token ID7454
Feature activation+2.04e-02

Pos logit contributions
rav+0.012
 lesson+0.012
Good+0.012
 next+0.012
S+0.011
Neg logit contributions
 shave-0.014
 anchor-0.014
 carve-0.013
apa-0.013
bowl-0.013
Token upon
Token ID2402
Feature activation-1.13e-02

Pos logit contributions
 shave+0.046
 anchor+0.043
 carve+0.043
pen+0.042
bowl+0.041
Neg logit contributions
 next-0.020
rav-0.019
 lesson-0.019
Good-0.017
S-0.017
Token a
Token ID257
Feature activation+1.29e-02

Pos logit contributions
rav+0.014
 lesson+0.012
ider+0.012
oys+0.012
Good+0.012
Neg logit contributions
 shave-0.023
 anchor-0.021
bowl-0.020
 carve-0.020
pen-0.020
Token time
Token ID640
Feature activation-4.04e-03

Pos logit contributions
 shave+0.024
 anchor+0.023
 carve+0.023
apa+0.022
isel+0.022
Neg logit contributions
 next-0.016
rav-0.016
Good-0.016
 lesson-0.016
S-0.015
Token feel
Token ID1254
Feature activation+3.79e-02

Pos logit contributions
 shave+0.005
 anchor+0.005
 carve+0.005
apa+0.005
bowl+0.005
Neg logit contributions
rav-0.005
Good-0.005
 lesson-0.005
 next-0.005
S-0.005
Token better
Token ID1365
Feature activation-2.27e-02

Pos logit contributions
 shave+0.069
 anchor+0.067
 carve+0.064
apa+0.064
bowl+0.063
Neg logit contributions
rav-0.048
 next-0.047
 lesson-0.047
Good-0.047
S-0.044
Token.
Token ID13
Feature activation-1.47e-03

Pos logit contributions
rav+0.041
Good+0.038
K+0.037
 lesson+0.037
Tweet+0.036
Neg logit contributions
 shave-0.032
 anchor-0.031
 carve-0.030
bowl-0.029
pen-0.029
Token
Token ID198
Feature activation+8.89e-03

Pos logit contributions
rav+0.002
 next+0.002
 lesson+0.002
Good+0.002
ider+0.002
Neg logit contributions
 shave-0.002
apa-0.002
 anchor-0.002
bowl-0.002
 carve-0.002
Token
Token ID198
Feature activation+7.93e-03

Pos logit contributions
 shave+0.015
 anchor+0.014
 carve+0.013
isel+0.013
pen+0.013
Neg logit contributions
 next-0.013
 lesson-0.013
rav-0.013
Good-0.013
S-0.013
TokenTHE
Token ID10970
Feature activation-1.26e-01

Pos logit contributions
 shave+0.014
apa+0.014
bowl+0.013
on+0.013
 anchor+0.013
Neg logit contributions
bows-0.010
rav-0.010
ider-0.010
 lesson-0.009
 next-0.009
Token<|endoftext|>
Token ID50256
Feature activation-1.22e-02

Pos logit contributions
ider+0.311
S+0.306
Good+0.304
oys+0.303
rav+0.302
Neg logit contributions
 carve-0.065
 shave-0.064
on-0.056
apa-0.051
bowl-0.048
TokenJohn
Token ID7554
Feature activation+3.00e-02

Pos logit contributions
rav+0.017
 next+0.017
Good+0.017
 lesson+0.017
S+0.016
Neg logit contributions
 shave-0.021
 anchor-0.020
 carve-0.019
apa-0.019
isel-0.019
Token and
Token ID290
Feature activation+5.55e-03

Pos logit contributions
 shave+0.065
 anchor+0.063
isel+0.061
apa+0.061
 carve+0.061
Neg logit contributions
 next-0.028
rav-0.027
Good-0.026
 lesson-0.026
S-0.024
Token his
Token ID465
Feature activation-2.93e-02

Pos logit contributions
 shave+0.007
 anchor+0.007
 carve+0.007
apa+0.006
bowl+0.006
Neg logit contributions
rav-0.010
 next-0.010
Good-0.010
 lesson-0.010
S-0.009
Token mother
Token ID2802
Feature activation+1.79e-02

Pos logit contributions
rav+0.043
Good+0.042
 lesson+0.042
 next+0.041
S+0.041
Neg logit contributions
 shave-0.048
 anchor-0.046
 carve-0.044
apa-0.044
bowl-0.042
Token collecting
Token ID13157
Feature activation+2.73e-02

Pos logit contributions
 shave+0.023
 anchor+0.023
 carve+0.021
apa+0.021
pen+0.021
Neg logit contributions
rav-0.019
Good-0.019
S-0.018
 lesson-0.017
 next-0.017
Token.
Token ID13
Feature activation-9.57e-04

Pos logit contributions
 shave+0.047
 anchor+0.045
 carve+0.043
apa+0.042
bowl+0.042
Neg logit contributions
rav-0.039
Good-0.037
 lesson-0.037
 next-0.036
S-0.035
Token
Token ID220
Feature activation+2.97e-03

Pos logit contributions
rav+0.001
 lesson+0.001
 next+0.001
Good+0.001
S+0.001
Neg logit contributions
 shave-0.002
 anchor-0.002
apa-0.002
 carve-0.002
bowl-0.001
Token
Token ID198
Feature activation+8.92e-03

Pos logit contributions
 shave+0.005
apa+0.005
 anchor+0.005
 carve+0.005
bowl+0.005
Neg logit contributions
rav-0.004
 lesson-0.004
 next-0.004
Good-0.004
S-0.004
Token
Token ID198
Feature activation+7.81e-03

Pos logit contributions
 shave+0.017
 anchor+0.017
 carve+0.016
isel+0.016
apa+0.015
Neg logit contributions
rav-0.011
 next-0.011
Good-0.011
 lesson-0.011
S-0.010
TokenTHE
Token ID10970
Feature activation-1.26e-01

Pos logit contributions
 shave+0.014
 anchor+0.014
apa+0.013
bowl+0.013
 carve+0.013
Neg logit contributions
rav-0.010
 lesson-0.009
 next-0.009
Good-0.009
bows-0.009
Token<|endoftext|>
Token ID50256
Feature activation-1.73e-02

Pos logit contributions
ider+0.327
oys+0.323
S+0.321
Good+0.319
rav+0.317
Neg logit contributions
 shave-0.046
on-0.044
 carve-0.042
bowl-0.034
apa-0.030
TokenOnce
Token ID7454
Feature activation+1.64e-02

Pos logit contributions
rav+0.024
 lesson+0.024
 next+0.024
Good+0.024
S+0.023
Neg logit contributions
 shave-0.029
 anchor-0.028
 carve-0.027
apa-0.026
bowl-0.026
Token upon
Token ID2402
Feature activation-1.51e-02

Pos logit contributions
 shave+0.037
 anchor+0.035
 carve+0.034
pen+0.034
bowl+0.033
Neg logit contributions
 next-0.016
rav-0.016
 lesson-0.015
Good-0.014
S-0.014
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
rav+0.020
ider+0.018
oys+0.018
 lesson+0.017
bows+0.017
Neg logit contributions
 shave-0.029
 anchor-0.026
bowl-0.026
 carve-0.026
on-0.025
Token time
Token ID640
Feature activation-7.59e-03

Pos logit contributions
 shave+0.019
 anchor+0.019
 carve+0.018
apa+0.018
isel+0.017
Neg logit contributions
 next-0.012
rav-0.012
 lesson-0.012
Good-0.012
S-0.012
Token together
Token ID1978
Feature activation-6.27e-02

Pos logit contributions
rav+0.064
K+0.060
Good+0.057
fl+0.056
Sweet+0.056
Neg logit contributions
 shave-0.078
pen-0.078
bowl-0.077
 anchor-0.073
on-0.073
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
rav+0.094
Good+0.091
S+0.084
oys+0.083
 lesson+0.083
Neg logit contributions
 shave-0.101
 anchor-0.098
pen-0.097
bowl-0.094
 carve-0.094
Token
Token ID198
Feature activation-6.91e-03

Pos logit contributions
rav+0.027
 next+0.026
 lesson+0.026
Good+0.026
ider+0.025
Neg logit contributions
 shave-0.036
 anchor-0.034
 carve-0.033
apa-0.033
bowl-0.033
Token
Token ID198
Feature activation-5.72e-03

Pos logit contributions
 lesson+0.009
 next+0.009
rav+0.008
Good+0.008
S+0.008
Neg logit contributions
 shave-0.013
 anchor-0.013
isel-0.012
 carve-0.012
pen-0.012
TokenThe
Token ID464
Feature activation-3.58e-02

Pos logit contributions
rav+0.007
bows+0.007
ider+0.007
 lesson+0.007
fl+0.007
Neg logit contributions
 shave-0.010
bowl-0.009
pen-0.009
apa-0.009
on-0.009
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.041
bows+0.038
Good+0.036
ider+0.036
S+0.035
Neg logit contributions
 anchor-0.072
 shave-0.070
 iceberg-0.065
 carve-0.064
 boiling-0.064
Token of
Token ID286
Feature activation-7.50e-03

Pos logit contributions
rav+0.217
Good+0.205
fl+0.201
ider+0.201
 next+0.199
Neg logit contributions
 shave-0.171
 anchor-0.164
isel-0.152
apa-0.151
pen-0.151
Token the
Token ID262
Feature activation-1.71e-02

Pos logit contributions
rav+0.012
ider+0.012
Good+0.012
bows+0.012
 lesson+0.011
Neg logit contributions
 shave-0.011
 anchor-0.011
 carve-0.010
bowl-0.010
on-0.010
Token story
Token ID1621
Feature activation-6.77e-02

Pos logit contributions
rav+0.028
Good+0.028
 next+0.027
 lesson+0.027
S+0.026
Neg logit contributions
 shave-0.025
 anchor-0.024
 carve-0.023
apa-0.023
bowl-0.022
Token is
Token ID318
Feature activation-5.62e-03

Pos logit contributions
Good+0.083
rav+0.082
 seek+0.079
S+0.079
ider+0.077
Neg logit contributions
 shave-0.119
 anchor-0.117
on-0.112
apa-0.111
isel-0.110
Token:
Token ID25
Feature activation-1.64e-02

Pos logit contributions
 next+0.007
 lesson+0.007
rav+0.007
Good+0.007
S+0.006
Neg logit contributions
 shave-0.011
 anchor-0.010
 carve-0.010
pen-0.010
bowl-0.010
Token were
Token ID547
Feature activation+1.46e-02

Pos logit contributions
rav+0.009
Good+0.009
 lesson+0.009
 next+0.009
S+0.008
Neg logit contributions
 shave-0.012
 anchor-0.011
 carve-0.011
apa-0.011
bowl-0.011
Token all
Token ID477
Feature activation+9.75e-03

Pos logit contributions
 shave+0.027
 anchor+0.027
 carve+0.026
apa+0.026
bowl+0.025
Neg logit contributions
rav-0.018
Good-0.017
 lesson-0.017
 next-0.016
S-0.016
Token happy
Token ID3772
Feature activation-1.61e-02

Pos logit contributions
 shave+0.018
 anchor+0.017
 carve+0.017
bowl+0.017
apa+0.017
Neg logit contributions
rav-0.012
Good-0.012
 lesson-0.012
 next-0.012
S-0.011
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
rav+0.023
Good+0.023
K+0.022
S+0.021
 lesson+0.021
Neg logit contributions
 anchor-0.026
 shave-0.025
bowl-0.024
 carve-0.024
pen-0.024
Token The
Token ID383
Feature activation-2.96e-02

Pos logit contributions
rav+0.023
 lesson+0.022
Good+0.022
 next+0.022
ider+0.021
Neg logit contributions
 shave-0.029
 anchor-0.028
apa-0.028
 carve-0.027
bowl-0.027
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.038
Good+0.035
bows+0.034
S+0.034
ider+0.034
Neg logit contributions
 anchor-0.056
 shave-0.053
 carve-0.051
apa-0.050
 boiling-0.048
Token of
Token ID286
Feature activation-9.54e-03

Pos logit contributions
rav+0.216
Good+0.205
ider+0.204
 next+0.199
 seek+0.198
Neg logit contributions
 shave-0.173
 anchor-0.167
apa-0.155
 carve-0.153
bowl-0.152
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
rav+0.014
Good+0.014
 lesson+0.013
 next+0.013
ider+0.013
Neg logit contributions
 shave-0.016
 anchor-0.015
 carve-0.014
apa-0.014
bowl-0.014
Token story
Token ID1621
Feature activation-7.05e-02

Pos logit contributions
rav+0.029
Good+0.028
 next+0.028
 lesson+0.028
S+0.027
Neg logit contributions
 shave-0.024
 anchor-0.024
 carve-0.022
apa-0.022
bowl-0.021
Token is
Token ID318
Feature activation-6.26e-03

Pos logit contributions
Good+0.097
S+0.095
 seek+0.094
rav+0.094
ider+0.093
Neg logit contributions
 anchor-0.109
 shave-0.108
apa-0.106
on-0.106
igh-0.104
Token to
Token ID284
Feature activation-4.62e-03

Pos logit contributions
 next+0.007
 lesson+0.007
rav+0.007
Good+0.007
S+0.006
Neg logit contributions
 shave-0.013
 anchor-0.012
 carve-0.012
pen-0.012
bowl-0.012
Token the
Token ID262
Feature activation+1.01e-02

Pos logit contributions
 shave+0.020
 anchor+0.019
 carve+0.018
pen+0.018
bowl+0.018
Neg logit contributions
rav-0.019
Good-0.018
 lesson-0.018
 next-0.018
ider-0.018
Token big
Token ID1263
Feature activation+6.22e-02

Pos logit contributions
 shave+0.016
 anchor+0.016
 carve+0.015
bowl+0.015
apa+0.015
Neg logit contributions
rav-0.015
Good-0.014
 lesson-0.014
 next-0.014
ider-0.014
Token forest
Token ID8222
Feature activation-1.85e-02

Pos logit contributions
 shave+0.101
apa+0.093
 anchor+0.093
 carve+0.093
bowl+0.089
Neg logit contributions
 next-0.094
 lesson-0.093
Good-0.092
rav-0.092
S-0.086
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
rav+0.031
Good+0.030
 lesson+0.029
S+0.028
oys+0.028
Neg logit contributions
pen-0.026
 anchor-0.025
bowl-0.025
 shave-0.025
 carve-0.024
Token The
Token ID383
Feature activation-3.11e-02

Pos logit contributions
rav+0.020
Good+0.020
 next+0.020
 lesson+0.020
S+0.019
Neg logit contributions
 shave-0.031
 anchor-0.030
 carve-0.028
apa-0.028
bowl-0.028
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.051
ider+0.049
bows+0.049
edly+0.048
S+0.046
Neg logit contributions
 anchor-0.046
igh-0.040
 harbour-0.040
 shave-0.040
 iceberg-0.039
Token of
Token ID286
Feature activation-2.47e-03

Pos logit contributions
rav+0.229
ider+0.224
Good+0.218
fl+0.215
bows+0.214
Neg logit contributions
 shave-0.151
 anchor-0.146
ime-0.138
apa-0.135
on-0.135
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
rav+0.004
ider+0.004
Good+0.004
bows+0.004
 lesson+0.004
Neg logit contributions
 shave-0.004
 anchor-0.004
bowl-0.003
 carve-0.003
apa-0.003
Token story
Token ID1621
Feature activation-6.59e-02

Pos logit contributions
rav+0.022
ider+0.022
Good+0.021
bows+0.021
S+0.021
Neg logit contributions
 anchor-0.015
 shave-0.015
apa-0.014
 iceberg-0.014
 carve-0.013
Token is
Token ID318
Feature activation-8.41e-03

Pos logit contributions
Good+0.086
rav+0.085
S+0.085
ider+0.084
 seek+0.084
Neg logit contributions
 anchor-0.107
 shave-0.106
igh-0.103
pen-0.103
on-0.102
Token that
Token ID326
Feature activation+2.78e-02

Pos logit contributions
 lesson+0.010
 next+0.010
rav+0.010
Good+0.010
S+0.009
Neg logit contributions
 shave-0.016
 anchor-0.015
 carve-0.015
bowl-0.015
pen-0.014
Token breakfast
Token ID12607
Feature activation-3.42e-03

Pos logit contributions
rav+0.026
Good+0.025
ider+0.024
oys+0.024
S+0.024
Neg logit contributions
 shave-0.023
 anchor-0.021
 carve-0.021
apa-0.020
bowl-0.020
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
rav+0.005
Good+0.005
 lesson+0.005
S+0.004
OK+0.004
Neg logit contributions
 shave-0.006
 carve-0.005
 anchor-0.005
bowl-0.005
on-0.005
Token
Token ID198
Feature activation-5.72e-03

Pos logit contributions
rav+0.032
ider+0.032
 seek+0.031
 next+0.031
 lesson+0.030
Neg logit contributions
 shave-0.030
apa-0.030
 carve-0.029
 Jason-0.028
 Mick-0.028
Token
Token ID198
Feature activation-4.80e-03

Pos logit contributions
 next+0.009
 lesson+0.009
Good+0.009
rav+0.009
S+0.009
Neg logit contributions
 shave-0.009
 anchor-0.009
isel-0.008
 carve-0.008
pen-0.008
TokenThe
Token ID464
Feature activation-3.24e-02

Pos logit contributions
rav+0.006
bows+0.006
 lesson+0.005
ider+0.005
fl+0.005
Neg logit contributions
 shave-0.009
 carve-0.008
 anchor-0.008
bowl-0.008
apa-0.008
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.038
bows+0.036
oys+0.034
ider+0.033
 seek+0.033
Neg logit contributions
 anchor-0.062
 shave-0.061
 fork-0.058
 boiling-0.058
 frying-0.058
Token value
Token ID1988
Feature activation-3.43e-02

Pos logit contributions
rav+0.231
ider+0.218
Good+0.214
fl+0.214
 seek+0.214
Neg logit contributions
 shave-0.152
 anchor-0.144
apa-0.134
ime-0.134
on-0.133
Token of
Token ID286
Feature activation+3.97e-03

Pos logit contributions
Good+0.051
rav+0.051
ider+0.050
 next+0.049
 lesson+0.048
Neg logit contributions
 shave-0.054
 anchor-0.052
apa-0.050
bowl-0.049
 carve-0.049
Token the
Token ID262
Feature activation+1.94e-03

Pos logit contributions
 shave+0.006
 anchor+0.005
 boiling+0.005
 carve+0.005
on+0.005
Neg logit contributions
rav-0.007
ider-0.007
bows-0.007
Good-0.006
OK-0.006
Token story
Token ID1621
Feature activation-5.72e-02

Pos logit contributions
 shave+0.002
 anchor+0.002
 carve+0.002
apa+0.002
 iceberg+0.002
Neg logit contributions
rav-0.004
ider-0.004
Good-0.004
oys-0.003
S-0.003
Token is
Token ID318
Feature activation-4.56e-04

Pos logit contributions
Tweet+0.094
Good+0.094
K+0.091
 seek+0.090
Wh+0.090
Neg logit contributions
on-0.072
washer-0.071
 shave-0.070
apa-0.067
 anchor-0.066
Token scary
Token ID14343
Feature activation+1.20e-02

Pos logit contributions
rav+0.053
Good+0.051
 lesson+0.051
 next+0.051
S+0.049
Neg logit contributions
 shave-0.069
 anchor-0.067
 carve-0.064
apa-0.063
bowl-0.063
Token for
Token ID329
Feature activation-1.85e-02

Pos logit contributions
 anchor+0.018
 shave+0.018
bowl+0.017
pen+0.017
ime+0.017
Neg logit contributions
rav-0.018
S-0.018
Good-0.018
K-0.017
 lesson-0.017
Token everyone
Token ID2506
Feature activation-1.04e-02

Pos logit contributions
rav+0.028
 lesson+0.027
Good+0.026
ider+0.026
OK+0.026
Neg logit contributions
 shave-0.030
 anchor-0.030
 carve-0.028
pen-0.027
ime-0.027
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
rav+0.014
Good+0.013
 lesson+0.013
 next+0.013
S+0.013
Neg logit contributions
 shave-0.018
 anchor-0.018
 carve-0.017
bowl-0.017
apa-0.017
Token The
Token ID383
Feature activation-3.22e-02

Pos logit contributions
ider+0.031
rav+0.030
 next+0.029
 seek+0.029
 lesson+0.029
Neg logit contributions
 shave-0.029
on-0.028
apa-0.028
bowl-0.028
 carve-0.027
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.050
bows+0.047
ider+0.047
S+0.046
Good+0.046
Neg logit contributions
 anchor-0.051
 shave-0.047
 carve-0.044
 iceberg-0.043
apa-0.043
Token of
Token ID286
Feature activation-4.72e-03

Pos logit contributions
rav+0.243
ider+0.239
fl+0.234
 seek+0.229
Good+0.227
Neg logit contributions
 shave-0.132
 anchor-0.125
ime-0.121
on-0.120
pen-0.115
Token the
Token ID262
Feature activation-1.32e-02

Pos logit contributions
ider+0.009
rav+0.008
bows+0.008
Good+0.008
OK+0.008
Neg logit contributions
 shave-0.006
 anchor-0.006
on-0.006
 carve-0.006
bowl-0.006
Token story
Token ID1621
Feature activation-6.78e-02

Pos logit contributions
rav+0.024
ider+0.023
Good+0.023
bows+0.023
S+0.022
Neg logit contributions
 anchor-0.016
 shave-0.016
 carve-0.014
apa-0.014
 iceberg-0.014
Token is
Token ID318
Feature activation-6.81e-03

Pos logit contributions
 seek+0.106
Good+0.106
S+0.103
ider+0.102
K+0.101
Neg logit contributions
on-0.089
igh-0.085
pen-0.085
 anchor-0.084
 shave-0.083
Token that
Token ID326
Feature activation+2.71e-02

Pos logit contributions
rav+0.009
Good+0.009
 next+0.009
 lesson+0.009
S+0.009
Neg logit contributions
 shave-0.012
 anchor-0.011
 carve-0.011
apa-0.011
bowl-0.011
Token together
Token ID1978
Feature activation-5.09e-02

Pos logit contributions
Good+0.023
rav+0.023
OK+0.021
ider+0.020
 lesson+0.020
Neg logit contributions
 shave-0.026
 anchor-0.025
bowl-0.024
 carve-0.024
pen-0.024
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
rav+0.092
Good+0.088
 lesson+0.082
oys+0.082
OK+0.080
Neg logit contributions
 shave-0.070
pen-0.066
 anchor-0.065
 carve-0.064
bowl-0.063
Token
Token ID198
Feature activation-6.38e-03

Pos logit contributions
ider+0.033
rav+0.032
 next+0.031
 lesson+0.031
 seek+0.031
Neg logit contributions
apa-0.029
 shave-0.029
bowl-0.028
on-0.027
ete-0.027
Token
Token ID198
Feature activation-4.18e-03

Pos logit contributions
rav+0.011
Good+0.011
 next+0.011
 lesson+0.011
ider+0.010
Neg logit contributions
 shave-0.009
 anchor-0.008
apa-0.008
 carve-0.008
bowl-0.008
TokenThe
Token ID464
Feature activation-3.24e-02

Pos logit contributions
bows+0.006
rav+0.006
sed+0.006
ider+0.006
ses+0.006
Neg logit contributions
 shave-0.007
on-0.006
bowl-0.006
pen-0.006
apa-0.006
Token moral
Token ID6573
Feature activation-1.25e-01

Pos logit contributions
rav+0.045
bows+0.043
ider+0.041
oys+0.040
seek+0.040
Neg logit contributions
 anchor-0.057
 ig-0.051
 iceberg-0.050
apa-0.050
 ocean-0.049
Token of
Token ID286
Feature activation-7.09e-03

Pos logit contributions
rav+0.248
ider+0.240
fl+0.230
Good+0.229
 seek+0.229
Neg logit contributions
 shave-0.133
 anchor-0.128
ime-0.122
apa-0.117
on-0.117
Token the
Token ID262
Feature activation-1.35e-02

Pos logit contributions
rav+0.011
Good+0.011
ider+0.010
 lesson+0.010
 next+0.010
Neg logit contributions
 shave-0.011
 anchor-0.011
 carve-0.010
apa-0.010
bowl-0.010
Token story
Token ID1621
Feature activation-6.83e-02

Pos logit contributions
rav+0.025
Good+0.024
ider+0.024
S+0.023
 next+0.023
Neg logit contributions
 shave-0.017
 anchor-0.016
apa-0.015
 carve-0.015
 iceberg-0.014
Token is
Token ID318
Feature activation-4.17e-03

Pos logit contributions
Good+0.106
 seek+0.103
S+0.103
Tweet+0.102
ider+0.100
Neg logit contributions
apa-0.090
 anchor-0.090
on-0.089
 shave-0.089
igh-0.088
Token:
Token ID25
Feature activation-1.52e-02

Pos logit contributions
rav+0.005
 next+0.005
 lesson+0.005
Good+0.005
S+0.005
Neg logit contributions
 shave-0.008
 anchor-0.008
 carve-0.007
apa-0.007
bowl-0.007
Token rubbed
Token ID31862
Feature activation+5.38e-03

Pos logit contributions
 shave+0.017
 anchor+0.016
 carve+0.016
apa+0.015
bowl+0.015
Neg logit contributions
rav-0.014
Good-0.014
 lesson-0.014
 next-0.013
S-0.013
Token it
Token ID340
Feature activation+1.35e-02

Pos logit contributions
 shave+0.008
apa+0.007
 anchor+0.007
on+0.007
 carve+0.007
Neg logit contributions
rav-0.009
ider-0.009
Good-0.009
 lesson-0.009
 seek-0.008
Token to
Token ID284
Feature activation+7.10e-03

Pos logit contributions
 shave+0.024
 anchor+0.023
 carve+0.023
bowl+0.022
apa+0.022
Neg logit contributions
rav-0.019
Good-0.017
 lesson-0.016
S-0.016
oys-0.016
Token feel
Token ID1254
Feature activation+4.14e-02

Pos logit contributions
 shave+0.012
 anchor+0.011
 carve+0.011
apa+0.010
pen+0.010
Neg logit contributions
rav-0.011
Good-0.011
 lesson-0.011
 next-0.010
ider-0.010
Token how
Token ID703
Feature activation+1.32e-02

Pos logit contributions
 shave+0.085
 anchor+0.083
acket+0.080
pen+0.080
igh+0.080
Neg logit contributions
 lesson-0.045
 next-0.045
Good-0.042
S-0.039
rav-0.035
Token soft
Token ID2705
Feature activation+7.54e-02

Pos logit contributions
 shave+0.024
 anchor+0.023
 carve+0.022
bowl+0.022
apa+0.022
Neg logit contributions
rav-0.017
 next-0.017
 lesson-0.017
Good-0.017
S-0.016
Token it
Token ID340
Feature activation+5.75e-03

Pos logit contributions
 shave+0.110
 anchor+0.109
on+0.105
 boiling+0.101
pen+0.101
Neg logit contributions
rav-0.130
ider-0.115
 lesson-0.113
oys-0.112
Good-0.111
Token was
Token ID373
Feature activation+1.85e-02

Pos logit contributions
 shave+0.010
 anchor+0.009
 carve+0.009
bowl+0.009
pen+0.008
Neg logit contributions
rav-0.009
oys-0.008
Good-0.008
S-0.008
 lesson-0.008
Token.
Token ID13
Feature activation+7.53e-05

Pos logit contributions
 shave+0.032
 anchor+0.031
 carve+0.030
apa+0.029
bowl+0.029
Neg logit contributions
rav-0.026
Good-0.025
 lesson-0.024
 next-0.024
S-0.023
Token She
Token ID1375
Feature activation+6.04e-03

Pos logit contributions
apa+0.000
 shave+0.000
 carve+0.000
pen+0.000
ete+0.000
Neg logit contributions
rav-0.000
 seek-0.000
 lesson-0.000
Good-0.000
 next-0.000
Token broke
Token ID6265
Feature activation-2.90e-02

Pos logit contributions
 shave+0.011
 anchor+0.010
 carve+0.010
apa+0.010
bowl+0.009
Neg logit contributions
rav-0.009
Good-0.008
 lesson-0.008
 next-0.008
oys-0.008
Token and
Token ID290
Feature activation+4.19e-03

Pos logit contributions
rav+0.007
Good+0.007
 lesson+0.007
S+0.007
OK+0.006
Neg logit contributions
 shave-0.011
 anchor-0.011
 carve-0.011
pen-0.011
apa-0.011
Token thanked
Token ID26280
Feature activation+4.17e-02

Pos logit contributions
 anchor+0.006
 shave+0.006
 carve+0.006
apa+0.006
 harbour+0.006
Neg logit contributions
ider-0.007
rav-0.007
bows-0.007
Good-0.007
oys-0.006
Token him
Token ID683
Feature activation+2.90e-02

Pos logit contributions
 shave+0.053
 anchor+0.050
 carve+0.047
apa+0.045
pen+0.045
Neg logit contributions
rav-0.078
Good-0.076
 lesson-0.075
ider-0.075
S-0.074
Token for
Token ID329
Feature activation+7.71e-03

Pos logit contributions
 shave+0.052
apa+0.049
pen+0.049
 anchor+0.049
on+0.047
Neg logit contributions
Good-0.038
rav-0.036
OK-0.035
S-0.035
bows-0.034
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
 shave+0.013
 anchor+0.013
 carve+0.012
 boiling+0.011
on+0.011
Neg logit contributions
rav-0.012
bows-0.011
ider-0.011
Good-0.011
 lesson-0.011
Token warm
Token ID5814
Feature activation+7.41e-02

Pos logit contributions
 shave+0.029
 anchor+0.028
 carve+0.025
apa+0.025
bowl+0.025
Neg logit contributions
rav-0.023
S-0.023
ider-0.022
Good-0.021
bows-0.020
Token welcome
Token ID7062
Feature activation+3.97e-02

Pos logit contributions
 shave+0.136
 anchor+0.131
 carve+0.125
apa+0.124
bowl+0.123
Neg logit contributions
rav-0.095
Good-0.093
 next-0.090
 lesson-0.090
S-0.088
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
 shave+0.067
 anchor+0.065
apa+0.061
 carve+0.060
pen+0.060
Neg logit contributions
rav-0.057
Good-0.057
 lesson-0.056
ider-0.053
 next-0.053
Token Then
Token ID3244
Feature activation-1.52e-02

Pos logit contributions
 shave+0.006
 anchor+0.006
 carve+0.005
isel+0.005
apa+0.005
Neg logit contributions
rav-0.005
 next-0.004
Good-0.004
 lesson-0.004
S-0.004
Token he
Token ID339
Feature activation+1.87e-02

Pos logit contributions
rav+0.025
 next+0.024
Good+0.024
 lesson+0.024
S+0.023
Neg logit contributions
 shave-0.023
 anchor-0.022
 carve-0.021
apa-0.020
bowl-0.020
Token ran
Token ID4966
Feature activation-2.11e-02

Pos logit contributions
 anchor+0.030
 shave+0.030
apa+0.028
 carve+0.028
ete+0.027
Neg logit contributions
rav-0.028
bows-0.028
Good-0.027
 lesson-0.027
ider-0.027
Token did
Token ID750
Feature activation-6.31e-03

Pos logit contributions
 shave+0.029
 anchor+0.027
 carve+0.026
bowl+0.024
 fork+0.023
Neg logit contributions
rav-0.037
ider-0.035
Good-0.035
S-0.033
oys-0.033
Token not
Token ID407
Feature activation-2.87e-02

Pos logit contributions
rav+0.008
fl+0.008
Good+0.007
oys+0.007
K+0.007
Neg logit contributions
 shave-0.013
 carve-0.012
 anchor-0.011
pen-0.011
isel-0.011
Token care
Token ID1337
Feature activation+1.90e-02

Pos logit contributions
rav+0.038
Good+0.037
 next+0.037
 lesson+0.037
S+0.035
Neg logit contributions
 shave-0.052
 anchor-0.049
 carve-0.048
bowl-0.047
apa-0.047
Token about
Token ID546
Feature activation+9.26e-03

Pos logit contributions
 shave+0.035
 anchor+0.033
 carve+0.032
apa+0.032
bowl+0.032
Neg logit contributions
rav-0.024
Good-0.024
 next-0.023
 lesson-0.023
S-0.022
Token the
Token ID262
Feature activation+2.59e-02

Pos logit contributions
 shave+0.014
 anchor+0.013
 carve+0.012
pen+0.012
apa+0.011
Neg logit contributions
rav-0.016
Good-0.015
ider-0.015
 lesson-0.015
OK-0.015
Token hole
Token ID7604
Feature activation+7.61e-02

Pos logit contributions
 shave+0.044
 anchor+0.043
 carve+0.040
pen+0.039
bowl+0.039
Neg logit contributions
rav-0.037
Good-0.035
 lesson-0.034
 next-0.034
S-0.034
Token and
Token ID290
Feature activation+2.25e-02

Pos logit contributions
 anchor+0.113
 shave+0.112
bowl+0.106
pen+0.106
on+0.104
Neg logit contributions
rav-0.126
Good-0.122
 lesson-0.113
S-0.111
OK-0.111
Token the
Token ID262
Feature activation+1.88e-03

Pos logit contributions
 shave+0.034
 anchor+0.032
 carve+0.031
bowl+0.028
pen+0.028
Neg logit contributions
rav-0.039
Good-0.037
ider-0.036
S-0.036
 lesson-0.036
Token stain
Token ID18746
Feature activation-6.47e-03

Pos logit contributions
 shave+0.003
 anchor+0.003
 carve+0.003
bowl+0.003
pen+0.003
Neg logit contributions
rav-0.003
Good-0.003
 lesson-0.003
S-0.003
 next-0.003
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
rav+0.011
Good+0.011
S+0.010
 lesson+0.010
OK+0.010
Neg logit contributions
 shave-0.009
 anchor-0.009
on-0.009
pen-0.009
bowl-0.008
Token He
Token ID679
Feature activation+1.39e-02

Pos logit contributions
 shave+0.002
 anchor+0.002
apa+0.002
 carve+0.002
isel+0.002
Neg logit contributions
rav-0.003
 lesson-0.003
Good-0.003
 next-0.003
ider-0.003
Token little
Token ID1310
Feature activation+2.59e-02

Pos logit contributions
 next+0.011
 lesson+0.011
rav+0.010
 seek+0.009
 songs+0.009
Neg logit contributions
 shave-0.014
pen-0.013
igh-0.013
 harbour-0.013
Son-0.013
Token arms
Token ID5101
Feature activation+7.11e-02

Pos logit contributions
 shave+0.037
 anchor+0.035
apa+0.034
 carve+0.034
pen+0.033
Neg logit contributions
 next-0.043
rav-0.042
 lesson-0.042
Good-0.041
 seek-0.039
Token to
Token ID284
Feature activation+8.36e-03

Pos logit contributions
 shave+0.143
 anchor+0.139
 carve+0.137
isel+0.136
apa+0.136
Neg logit contributions
 next-0.079
 lesson-0.070
rav-0.067
Good-0.066
bows-0.064
Token reach
Token ID3151
Feature activation+4.41e-02

Pos logit contributions
 shave+0.013
 anchor+0.012
apa+0.012
bowl+0.012
 carve+0.011
Neg logit contributions
 next-0.013
 lesson-0.013
Good-0.013
rav-0.012
 seek-0.012
Token the
Token ID262
Feature activation+4.01e-02

Pos logit contributions
 shave+0.089
apa+0.084
bowl+0.084
 carve+0.084
pen+0.083
Neg logit contributions
 next-0.051
 lesson-0.047
Good-0.041
rav-0.041
 seek-0.039
Token stones
Token ID14966
Feature activation+7.22e-02

Pos logit contributions
 shave+0.061
 anchor+0.055
apa+0.054
bowl+0.054
 carve+0.054
Neg logit contributions
 next-0.067
 lesson-0.066
Good-0.064
rav-0.063
ider-0.060
Token near
Token ID1474
Feature activation+3.04e-02

Pos logit contributions
 shave+0.150
apa+0.142
isel+0.142
 anchor+0.140
 carve+0.138
Neg logit contributions
 next-0.081
 lesson-0.068
 seek-0.064
Good-0.064
rav-0.063
Token the
Token ID262
Feature activation+2.62e-02

Pos logit contributions
 shave+0.060
isel+0.057
apa+0.057
 carve+0.057
 frying+0.057
Neg logit contributions
 next-0.037
 lesson-0.032
 Sue-0.030
Good-0.029
 seek-0.028
Token top
Token ID1353
Feature activation+1.26e-02

Pos logit contributions
 shave+0.038
isel+0.034
apa+0.034
pen+0.033
 carve+0.033
Neg logit contributions
 next-0.048
 lesson-0.045
Good-0.044
rav-0.043
 seek-0.042
Token.
Token ID13
Feature activation+1.63e-03

Pos logit contributions
 shave+0.024
 anchor+0.023
apa+0.022
 carve+0.022
isel+0.022
Neg logit contributions
 next-0.015
 lesson-0.015
rav-0.015
Good-0.015
S-0.014
Token Jack
Token ID3619
Feature activation+1.14e-02

Pos logit contributions
 shave+0.003
 anchor+0.003
igh+0.003
 carve+0.003
pen+0.002
Neg logit contributions
 next-0.002
 lesson-0.002
Good-0.002
 Sue-0.002
OK-0.002
Token to
Token ID284
Feature activation+1.73e-02

Pos logit contributions
Good+0.044
rav+0.044
ider+0.044
OK+0.041
bows+0.041
Neg logit contributions
 shave-0.049
 anchor-0.049
 carve-0.046
bowl-0.044
isel-0.043
Token because
Token ID780
Feature activation+2.06e-02

Pos logit contributions
 shave+0.031
apa+0.031
 anchor+0.031
isel+0.030
on+0.030
Neg logit contributions
 next-0.022
 lesson-0.021
rav-0.020
Good-0.020
S-0.020
Token it
Token ID340
Feature activation-5.48e-04

Pos logit contributions
 anchor+0.033
 boiling+0.029
on+0.029
 shave+0.028
 ig+0.027
Neg logit contributions
rav-0.034
ider-0.033
Good-0.031
OK-0.031
bows-0.031
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
rav+0.001
 next+0.001
 lesson+0.001
Good+0.001
S+0.000
Neg logit contributions
 shave-0.001
 anchor-0.001
 carve-0.001
apa-0.001
bowl-0.001
Token too
Token ID1165
Feature activation-6.20e-02

Pos logit contributions
 anchor+0.018
 shave+0.017
 boiling+0.017
 iceberg+0.016
bowl+0.016
Neg logit contributions
rav-0.016
Good-0.014
OK-0.013
ider-0.013
oys-0.013
Token deep
Token ID2769
Feature activation+9.00e-02

Pos logit contributions
 next+0.084
 lesson+0.083
Good+0.080
rav+0.080
S+0.079
Neg logit contributions
 shave-0.112
 anchor-0.104
 carve-0.103
pen-0.103
apa-0.102
Token.
Token ID13
Feature activation+2.08e-03

Pos logit contributions
 anchor+0.133
 shave+0.128
 boiling+0.122
pen+0.121
bowl+0.121
Neg logit contributions
rav-0.154
Good-0.143
S-0.139
OK-0.135
 lesson-0.133
Token Ellie
Token ID44801
Feature activation-4.55e-02

Pos logit contributions
apa+0.002
 shave+0.002
bowl+0.002
 Peter+0.002
 harbour+0.002
Neg logit contributions
rav-0.004
S-0.004
Good-0.004
ider-0.004
 seek-0.003
Token didn
Token ID1422
Feature activation-2.35e-02

Pos logit contributions
rav+0.071
Good+0.067
 lesson+0.063
bows+0.063
ider+0.063
Neg logit contributions
 anchor-0.073
 shave-0.073
apa-0.071
bowl-0.071
pen-0.070
Token't
Token ID470
Feature activation-4.79e-02

Pos logit contributions
 next+0.022
 lesson+0.021
Good+0.020
rav+0.019
S+0.018
Neg logit contributions
 shave-0.052
 anchor-0.051
bowl-0.050
 carve-0.050
apa-0.049
Token listen
Token ID6004
Feature activation-2.11e-02

Pos logit contributions
rav+0.080
ider+0.077
 next+0.077
Good+0.076
fl+0.073
Neg logit contributions
 shave-0.075
 anchor-0.067
 carve-0.064
isel-0.063
bowl-0.063
Token on
Token ID319
Feature activation-8.25e-03

Pos logit contributions
rav+0.040
bows+0.040
Good+0.038
 lesson+0.038
ider+0.037
Neg logit contributions
 shave-0.058
on-0.058
igh-0.057
bowl-0.056
 anchor-0.056
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
rav+0.016
Good+0.016
bows+0.015
 lesson+0.015
ider+0.015
Neg logit contributions
 shave-0.009
 anchor-0.009
apa-0.009
on-0.009
pen-0.008
Token Bella
Token ID39304
Feature activation-2.99e-02

Pos logit contributions
ider+0.026
oys+0.024
rav+0.024
edly+0.023
 lesson+0.022
Neg logit contributions
 Mick-0.020
on-0.019
bowl-0.019
pen-0.018
 Toby-0.018
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
rav+0.043
ider+0.043
Good+0.041
oys+0.040
 lesson+0.039
Neg logit contributions
pen-0.048
 anchor-0.047
 shave-0.047
bowl-0.046
apa-0.045
Token T
Token ID309
Feature activation+2.70e-02

Pos logit contributions
ider+0.026
OK+0.023
rav+0.023
Good+0.023
oys+0.023
Neg logit contributions
 shave-0.019
pen-0.019
on-0.019
 anchor-0.019
bowl-0.018
Tokening
Token ID278
Feature activation+1.04e-02

Pos logit contributions
 shave+0.046
 anchor+0.046
pen+0.045
on+0.044
ete+0.044
Neg logit contributions
 seek-0.034
Good-0.034
 lesson-0.033
rav-0.033
OK-0.032
Token and
Token ID290
Feature activation-6.47e-03

Pos logit contributions
 anchor+0.016
bowl+0.016
 shave+0.016
ime+0.015
pen+0.015
Neg logit contributions
rav-0.017
Good-0.016
ider-0.016
 lesson-0.014
Sweet-0.014
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
rav+0.009
Good+0.009
 lesson+0.009
 next+0.009
S+0.009
Neg logit contributions
 shave-0.011
 anchor-0.010
 carve-0.010
apa-0.010
bowl-0.010
Token dog
Token ID3290
Feature activation-2.63e-02

Pos logit contributions
ider+0.020
rav+0.020
bows+0.018
edly+0.017
oys+0.017
Neg logit contributions
 anchor-0.021
 shave-0.020
 razor-0.019
 fork-0.019
bowl-0.019
Token were
Token ID547
Feature activation+2.84e-02

Pos logit contributions
rav+0.039
Good+0.039
ider+0.038
 lesson+0.037
 next+0.037
Neg logit contributions
 shave-0.041
 anchor-0.040
bowl-0.040
 carve-0.039
pen-0.038
Token the
Token ID262
Feature activation+1.94e-02

Pos logit contributions
 anchor+0.047
bowl+0.045
 shave+0.045
pen+0.043
 carve+0.042
Neg logit contributions
rav-0.042
ider-0.041
oys-0.038
OK-0.038
Good-0.037
Token it
Token ID340
Feature activation+7.76e-03

Pos logit contributions
 shave+0.106
 anchor+0.100
apa+0.096
 carve+0.094
igh+0.092
Neg logit contributions
rav-0.093
Good-0.092
ider-0.090
 lesson-0.088
S-0.086
Token ugly
Token ID13400
Feature activation+9.72e-03

Pos logit contributions
 shave+0.012
 carve+0.011
 anchor+0.011
bowl+0.011
apa+0.010
Neg logit contributions
rav-0.012
Good-0.012
S-0.011
OK-0.011
oys-0.011
Token?"
Token ID1701
Feature activation+5.08e-04

Pos logit contributions
 shave+0.013
 anchor+0.012
bowl+0.010
 carve+0.010
 fork+0.010
Neg logit contributions
Good-0.019
rav-0.019
K-0.018
S-0.017
ider-0.017
Token
Token ID198
Feature activation+2.28e-02

Pos logit contributions
 shave+0.001
 anchor+0.001
 carve+0.001
apa+0.001
bowl+0.001
Neg logit contributions
rav-0.001
 lesson-0.001
Good-0.001
 next-0.001
S-0.001
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
 shave+0.049
 anchor+0.048
 carve+0.046
isel+0.046
bowl+0.046
Neg logit contributions
 next-0.023
S-0.022
Good-0.022
 lesson-0.021
rav-0.021
TokenBen
Token ID11696
Feature activation+2.55e-02

Pos logit contributions
 shave+0.036
 anchor+0.035
 carve+0.033
isel+0.033
bowl+0.033
Neg logit contributions
 next-0.018
S-0.018
Good-0.017
rav-0.017
 lesson-0.016
Token thought
Token ID1807
Feature activation-5.45e-03

Pos logit contributions
 shave+0.048
 anchor+0.046
 carve+0.045
apa+0.044
bowl+0.044
Neg logit contributions
rav-0.031
 lesson-0.030
Good-0.030
 next-0.030
S-0.028
Token for
Token ID329
Feature activation-9.69e-04

Pos logit contributions
rav+0.006
Good+0.006
S+0.005
 seek+0.005
OK+0.005
Neg logit contributions
 shave-0.011
 anchor-0.011
apa-0.011
 carve-0.010
isel-0.010
Token a
Token ID257
Feature activation-9.81e-03

Pos logit contributions
rav+0.001
Good+0.001
S+0.001
ider+0.001
 lesson+0.001
Neg logit contributions
 shave-0.002
 anchor-0.002
 boiling-0.002
 carve-0.002
pen-0.002
Token moment
Token ID2589
Feature activation-5.62e-02

Pos logit contributions
ider+0.007
 next+0.007
rav+0.007
fl+0.007
bows+0.006
Neg logit contributions
 shave-0.024
 anchor-0.022
 fork-0.021
 razor-0.021
 carve-0.021
Token.
Token ID13
Feature activation+4.71e-03

Pos logit contributions
rav+0.082
Good+0.080
 lesson+0.076
OK+0.073
bows+0.073
Neg logit contributions
 shave-0.094
 anchor-0.090
bowl-0.085
apa-0.084
pen-0.084
TokenThe
Token ID464
Feature activation-1.41e-02

Pos logit contributions
 shave+0.018
apa+0.017
pen+0.017
 anchor+0.017
bowl+0.017
Neg logit contributions
rav-0.020
bows-0.019
 lesson-0.019
fl-0.019
ses-0.018
Token farmer
Token ID18739
Feature activation+3.92e-03

Pos logit contributions
rav+0.021
bows+0.020
ider+0.019
Good+0.019
 lesson+0.018
Neg logit contributions
 anchor-0.024
 shave-0.023
 carve-0.022
apa-0.021
bowl-0.021
Token smiled
Token ID13541
Feature activation-8.99e-03

Pos logit contributions
 shave+0.007
 anchor+0.007
apa+0.006
 carve+0.006
pen+0.006
Neg logit contributions
rav-0.005
Good-0.005
 lesson-0.005
 next-0.005
S-0.005
Token.
Token ID13
Feature activation+8.55e-03

Pos logit contributions
rav+0.009
Good+0.009
 lesson+0.009
 next+0.009
S+0.008
Neg logit contributions
 shave-0.019
 anchor-0.018
 carve-0.018
apa-0.017
bowl-0.017
Token "
Token ID366
Feature activation-3.00e-02

Pos logit contributions
 shave+0.018
 anchor+0.017
apa+0.017
 carve+0.017
bowl+0.017
Neg logit contributions
rav-0.009
 lesson-0.008
Good-0.008
 next-0.008
 seek-0.007
TokenI
Token ID40
Feature activation+2.11e-02

Pos logit contributions
sed+0.042
ses+0.039
rav+0.039
oys+0.038
 angels+0.037
Neg logit contributions
 shave-0.050
Son-0.050
bowl-0.047
pen-0.046
vice-0.045
Token can
Token ID460
Feature activation-6.28e-03

Pos logit contributions
 shave+0.034
 anchor+0.031
 carve+0.031
bowl+0.031
apa+0.031
Neg logit contributions
Good-0.031
oys-0.030
rav-0.030
 next-0.030
 songs-0.030
Token help
Token ID1037
Feature activation+1.62e-02

Pos logit contributions
 next+0.007
rav+0.007
Good+0.007
 lesson+0.007
 seek+0.006
Neg logit contributions
 shave-0.012
 anchor-0.012
apa-0.012
bowl-0.011
isel-0.011
Token you
Token ID345
Feature activation-3.63e-03

Pos logit contributions
 shave+0.033
 anchor+0.032
isel+0.031
apa+0.031
 carve+0.031
Neg logit contributions
 next-0.018
rav-0.016
Good-0.016
 lesson-0.016
S-0.015
Token,"
Token ID553
Feature activation+1.95e-02

Pos logit contributions
 next+0.005
 lesson+0.005
rav+0.005
Good+0.005
 seek+0.005
Neg logit contributions
 shave-0.006
 anchor-0.006
apa-0.006
isel-0.006
bowl-0.006
Token he
Token ID339
Feature activation+5.01e-02

Pos logit contributions
 shave+0.027
 anchor+0.026
 carve+0.024
apa+0.024
bowl+0.024
Neg logit contributions
rav-0.034
Good-0.033
 lesson-0.033
 next-0.033
S-0.031
Token very
Token ID845
Feature activation+1.84e-02

Pos logit contributions
 shave+0.019
 anchor+0.018
on+0.017
apa+0.017
 frying+0.017
Neg logit contributions
rav-0.015
ider-0.014
 lesson-0.014
S-0.014
Good-0.013
Token curious
Token ID11040
Feature activation+5.50e-02

Pos logit contributions
 shave+0.031
 anchor+0.030
 carve+0.029
apa+0.029
 frying+0.028
Neg logit contributions
rav-0.025
Good-0.024
 lesson-0.024
 next-0.024
S-0.023
Token about
Token ID546
Feature activation-5.99e-03

Pos logit contributions
 shave+0.104
 anchor+0.100
 carve+0.098
apa+0.097
isel+0.096
Neg logit contributions
 next-0.066
rav-0.066
 lesson-0.066
Good-0.064
S-0.060
Token the
Token ID262
Feature activation+2.45e-02

Pos logit contributions
rav+0.008
Good+0.008
 next+0.008
 lesson+0.008
ider+0.008
Neg logit contributions
 shave-0.011
 anchor-0.010
 carve-0.010
bowl-0.010
pen-0.009
Token world
Token ID995
Feature activation+9.62e-03

Pos logit contributions
 shave+0.041
 anchor+0.040
 carve+0.038
apa+0.038
isel+0.038
Neg logit contributions
rav-0.034
 next-0.034
 lesson-0.034
Good-0.034
S-0.032
Token.
Token ID13
Feature activation+3.87e-03

Pos logit contributions
 shave+0.013
pen+0.012
on+0.012
 anchor+0.012
bowl+0.012
Neg logit contributions
rav-0.017
Good-0.016
 lesson-0.016
S-0.015
ider-0.015
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
 shave+0.006
apa+0.006
 anchor+0.006
 carve+0.006
bowl+0.006
Neg logit contributions
rav-0.006
 lesson-0.005
Good-0.005
 next-0.005
ider-0.005
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 shave+0.020
 anchor+0.019
apa+0.019
 carve+0.018
bowl+0.018
Neg logit contributions
rav-0.017
 lesson-0.017
Good-0.016
 next-0.016
ider-0.016
TokenOne
Token ID3198
Feature activation-8.01e-02

Pos logit contributions
on+0.021
 shave+0.020
?'+0.020
bowl+0.020
pen+0.019
Neg logit contributions
rav-0.017
ider-0.016
fl-0.015
bows-0.015
ses-0.015
Token day
Token ID1110
Feature activation-4.56e-02

Pos logit contributions
rav+0.090
Good+0.084
ider+0.083
bows+0.081
 lesson+0.079
Neg logit contributions
 shave-0.157
 anchor-0.153
apa-0.146
 carve-0.146
bowl-0.145
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
rav+0.093
Good+0.088
 lesson+0.088
bows+0.088
ider+0.088
Neg logit contributions
on-0.043
 shave-0.039
apa-0.038
?'-0.035
 fork-0.035
Token "
Token ID366
Feature activation-2.90e-02

Pos logit contributions
apa+0.017
 shave+0.016
 anchor+0.015
 carve+0.015
bowl+0.015
Neg logit contributions
rav-0.018
Good-0.017
 lesson-0.016
 seek-0.016
bows-0.016
TokenThe
Token ID464
Feature activation-7.24e-03

Pos logit contributions
rav+0.029
 lesson+0.028
bows+0.026
 next+0.026
ider+0.026
Neg logit contributions
 shave-0.061
 anchor-0.058
 carve-0.057
bowl-0.057
apa-0.056
Token dog
Token ID3290
Feature activation-3.57e-02

Pos logit contributions
rav+0.012
Good+0.011
S+0.010
bows+0.010
oys+0.010
Neg logit contributions
 anchor-0.012
 shave-0.011
 iceberg-0.010
 carve-0.010
 boiling-0.010
Token is
Token ID318
Feature activation-1.62e-03

Pos logit contributions
rav+0.049
 lesson+0.047
Good+0.046
S+0.044
bows+0.044
Neg logit contributions
 shave-0.060
 anchor-0.060
bowl-0.057
pen-0.057
 carve-0.056
Token chasing
Token ID20023
Feature activation-7.57e-03

Pos logit contributions
rav+0.003
Good+0.002
bows+0.002
OK+0.002
 lesson+0.002
Neg logit contributions
 shave-0.003
 anchor-0.003
 boiling-0.002
 carve-0.002
bowl-0.002
Token us
Token ID514
Feature activation+6.98e-03

Pos logit contributions
rav+0.014
ider+0.014
bows+0.013
 lesson+0.013
Good+0.013
Neg logit contributions
 anchor-0.009
 shave-0.009
bowl-0.009
pen-0.009
apa-0.009
Token!"
Token ID2474
Feature activation+7.44e-03

Pos logit contributions
 shave+0.012
 anchor+0.012
bowl+0.011
pen+0.011
apa+0.011
Neg logit contributions
rav-0.009
Good-0.009
 lesson-0.009
 next-0.008
OK-0.008
Token
Token ID198
Feature activation+2.40e-02

Pos logit contributions
 shave+0.011
apa+0.011
 anchor+0.011
 carve+0.010
bowl+0.010
Neg logit contributions
rav-0.012
 lesson-0.011
Good-0.011
bows-0.011
 seek-0.011
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
 shave+0.036
apa+0.035
 anchor+0.034
 carve+0.033
bowl+0.032
Neg logit contributions
rav-0.038
bows-0.037
 lesson-0.036
Good-0.036
ider-0.035
TokenThey
Token ID2990
Feature activation+3.00e-03

Pos logit contributions
apa+0.033
 shave+0.032
 anchor+0.032
bowl+0.031
Son+0.030
Neg logit contributions
bows-0.028
rav-0.027
 lesson-0.025
fl-0.023
ider-0.021
Token ran
Token ID4966
Feature activation-2.87e-02

Pos logit contributions
 shave+0.006
 anchor+0.006
 carve+0.006
apa+0.006
pen+0.006
Neg logit contributions
 next-0.003
Good-0.003
rav-0.003
 lesson-0.003
S-0.003
Token she
Token ID673
Feature activation+2.71e-02

Pos logit contributions
 shave+0.041
on+0.039
 boiling+0.039
 anchor+0.037
washer+0.037
Neg logit contributions
rav-0.040
ider-0.039
fl-0.037
S-0.037
opard-0.037
Token was
Token ID373
Feature activation+3.66e-03

Pos logit contributions
 shave+0.051
 anchor+0.048
apa+0.046
 carve+0.045
pen+0.045
Neg logit contributions
rav-0.034
Good-0.032
 lesson-0.032
ider-0.031
bows-0.030
Token disappointed
Token ID11679
Feature activation+1.40e-02

Pos logit contributions
 shave+0.006
 anchor+0.006
 boiling+0.006
 frying+0.006
 carve+0.006
Neg logit contributions
rav-0.006
fl-0.005
 lesson-0.005
Good-0.005
bows-0.005
Token.
Token ID13
Feature activation+5.43e-03

Pos logit contributions
 shave+0.020
 anchor+0.018
apa+0.017
 boiling+0.016
pen+0.016
Neg logit contributions
rav-0.025
Good-0.023
 seek-0.022
bows-0.022
 lesson-0.022
Token She
Token ID1375
Feature activation+1.77e-02

Pos logit contributions
 shave+0.008
apa+0.008
bowl+0.008
 carve+0.008
pen+0.008
Neg logit contributions
rav-0.008
 lesson-0.008
 seek-0.008
Good-0.007
ider-0.007
Token explained
Token ID4893
Feature activation-5.81e-03

Pos logit contributions
 shave+0.026
 anchor+0.023
 carve+0.022
apa+0.022
 frying+0.021
Neg logit contributions
rav-0.031
bows-0.030
ider-0.029
fl-0.029
 lesson-0.028
Token to
Token ID284
Feature activation+3.06e-03

Pos logit contributions
rav+0.008
Good+0.008
ider+0.008
 seek+0.008
 lesson+0.008
Neg logit contributions
 shave-0.010
 anchor-0.009
apa-0.009
 carve-0.009
 frying-0.009
Token them
Token ID606
Feature activation-1.62e-02

Pos logit contributions
 shave+0.005
 carve+0.005
 anchor+0.005
apa+0.005
 Jason+0.005
Neg logit contributions
ider-0.005
rav-0.004
 seek-0.004
 lesson-0.004
bows-0.004
Token that
Token ID326
Feature activation+4.57e-02

Pos logit contributions
rav+0.025
Good+0.025
S+0.023
 seek+0.023
ider+0.023
Neg logit contributions
 shave-0.025
 anchor-0.024
apa-0.024
 carve-0.022
pen-0.022
Token perfume
Token ID40094
Feature activation+7.30e-04

Pos logit contributions
 shave+0.084
 anchor+0.080
bowl+0.076
 carve+0.076
on+0.075
Neg logit contributions
ider-0.060
rav-0.060
Good-0.056
OK-0.054
S-0.054
Token was
Token ID373
Feature activation+1.99e-02

Pos logit contributions
 shave+0.001
 anchor+0.001
apa+0.001
bowl+0.001
 carve+0.001
Neg logit contributions
rav-0.001
Good-0.001
S-0.001
ider-0.001
 seek-0.001
Token said
Token ID531
Feature activation+9.36e-04

Pos logit contributions
rav+0.011
bows+0.011
Good+0.011
ider+0.011
 lesson+0.010
Neg logit contributions
 anchor-0.008
 shave-0.007
 carve-0.007
on-0.007
apa-0.007
Token,
Token ID11
Feature activation-5.39e-03

Pos logit contributions
on+0.001
 shave+0.001
apa+0.001
 anchor+0.001
isel+0.001
Neg logit contributions
Good-0.002
bows-0.002
OK-0.002
rav-0.002
 lesson-0.002
Token "
Token ID366
Feature activation-1.90e-02

Pos logit contributions
rav+0.008
Good+0.008
 lesson+0.008
bows+0.007
 seek+0.007
Neg logit contributions
 shave-0.009
on-0.008
 anchor-0.008
 carve-0.008
apa-0.008
TokenLet
Token ID5756
Feature activation+1.46e-02

Pos logit contributions
rav+0.022
 lesson+0.021
ses+0.021
sed+0.021
 angels+0.021
Neg logit contributions
 shave-0.036
bowl-0.033
pen-0.032
Son-0.032
 carve-0.032
Token's
Token ID338
Feature activation+1.65e-02

Pos logit contributions
 shave+0.019
 anchor+0.018
 carve+0.017
isel+0.017
apa+0.017
Neg logit contributions
 next-0.026
rav-0.026
Good-0.026
 lesson-0.026
S-0.025
Token play
Token ID711
Feature activation-5.01e-02

Pos logit contributions
apa+0.031
 shave+0.030
 anchor+0.030
isel+0.029
 frying+0.029
Neg logit contributions
 next-0.020
 seek-0.019
Good-0.018
 lesson-0.018
OK-0.017
Token shoot
Token ID2686
Feature activation-1.47e-02

Pos logit contributions
rav+0.084
bows+0.080
fl+0.077
oys+0.076
Good+0.074
Neg logit contributions
 shave-0.077
bowl-0.073
on-0.072
pen-0.072
 anchor-0.069
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
rav+0.018
Good+0.017
 lesson+0.017
ider+0.017
 next+0.017
Neg logit contributions
 shave-0.027
 anchor-0.027
bowl-0.025
pen-0.025
 carve-0.025
Token ball
Token ID2613
Feature activation+1.31e-02

Pos logit contributions
 anchor+0.035
 shave+0.033
bowl+0.031
 magnet+0.031
 fork+0.030
Neg logit contributions
rav-0.034
S-0.031
ider-0.031
Good-0.030
oys-0.030
Token.
Token ID13
Feature activation-3.64e-03

Pos logit contributions
bowl+0.022
 anchor+0.022
on+0.021
pen+0.021
 shave+0.020
Neg logit contributions
rav-0.019
Good-0.018
Sweet-0.017
S-0.016
 lesson-0.016
Token If
Token ID1002
Feature activation+1.60e-02

Pos logit contributions
 lesson+0.004
rav+0.004
Good+0.004
S+0.004
 next+0.004
Neg logit contributions
 shave-0.007
 anchor-0.007
 carve-0.006
bowl-0.006
apa-0.006
Token
Token ID198
Feature activation+5.94e-03

Pos logit contributions
 shave+0.017
 anchor+0.016
 carve+0.016
apa+0.015
isel+0.015
Neg logit contributions
rav-0.011
 next-0.011
 lesson-0.011
Good-0.010
S-0.010
TokenMax
Token ID11518
Feature activation-1.80e-02

Pos logit contributions
 shave+0.011
pen+0.010
apa+0.010
bowl+0.009
ete+0.009
Neg logit contributions
rav-0.008
bows-0.008
fl-0.008
 lesson-0.007
ider-0.007
Token went
Token ID1816
Feature activation-1.11e-02

Pos logit contributions
rav+0.027
Good+0.027
 lesson+0.026
ider+0.025
bows+0.024
Neg logit contributions
 shave-0.028
 anchor-0.027
pen-0.027
apa-0.027
bowl-0.026
Token to
Token ID284
Feature activation-2.01e-02

Pos logit contributions
rav+0.019
Good+0.019
bows+0.018
Sweet+0.017
fl+0.017
Neg logit contributions
 shave-0.016
 anchor-0.015
 harbour-0.014
 wreck-0.014
pen-0.014
Token the
Token ID262
Feature activation+6.02e-03

Pos logit contributions
rav+0.032
Good+0.031
 lesson+0.030
 next+0.030
ider+0.029
Neg logit contributions
 shave-0.032
 anchor-0.030
 carve-0.029
apa-0.028
pen-0.028
Token park
Token ID3952
Feature activation-4.92e-02

Pos logit contributions
 anchor+0.008
 harbour+0.008
 shave+0.008
 carve+0.008
 wreck+0.007
Neg logit contributions
rav-0.011
bows-0.010
ider-0.010
Good-0.010
S-0.009
Token man
Token ID582
Feature activation-2.39e-02

Pos logit contributions
rav+0.072
Good+0.071
 lesson+0.062
Sweet+0.060
fl+0.060
Neg logit contributions
 shave-0.083
 anchor-0.082
bowl-0.079
 harbour-0.078
pen-0.078
Token and
Token ID290
Feature activation+6.17e-04

Pos logit contributions
 next+0.020
 lesson+0.015
rav+0.014
 seek+0.014
S+0.014
Neg logit contributions
 shave-0.056
isel-0.055
 anchor-0.055
apa-0.053
 carve-0.053
Token said
Token ID531
Feature activation+7.76e-03

Pos logit contributions
 shave+0.001
 anchor+0.001
 carve+0.001
bowl+0.001
apa+0.001
Neg logit contributions
rav-0.001
Good-0.001
 lesson-0.001
ider-0.001
 next-0.001
Token,
Token ID11
Feature activation-8.17e-03

Pos logit contributions
 shave+0.009
 anchor+0.009
apa+0.009
on+0.009
isel+0.008
Neg logit contributions
Good-0.014
bows-0.014
rav-0.014
 lesson-0.013
OK-0.013
Token "
Token ID366
Feature activation-2.66e-02

Pos logit contributions
rav+0.010
 lesson+0.010
Good+0.009
bows+0.009
 seek+0.009
Neg logit contributions
 shave-0.015
 anchor-0.015
apa-0.014
 carve-0.014
on-0.014
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
 shave+0.010
 anchor+0.010
 carve+0.009
apa+0.009
bowl+0.009
Neg logit contributions
 next-0.009
 lesson-0.009
rav-0.008
Good-0.008
S-0.008
Token click
Token ID3904
Feature activation+8.64e-03

Pos logit contributions
 anchor+0.023
 shave+0.021
 iceberg+0.020
 carve+0.020
 ig+0.020
Neg logit contributions
rav-0.023
Good-0.021
oys-0.020
S-0.020
ider-0.020
Token.
Token ID13
Feature activation-3.97e-04

Pos logit contributions
 shave+0.015
 anchor+0.015
 carve+0.014
pen+0.014
apa+0.014
Neg logit contributions
rav-0.012
Good-0.012
 lesson-0.011
S-0.011
OK-0.011
Token He
Token ID679
Feature activation+1.05e-02

Pos logit contributions
rav+0.001
Good+0.001
 next+0.001
 lesson+0.001
S+0.001
Neg logit contributions
 shave-0.001
 anchor-0.000
 carve-0.000
apa-0.000
isel-0.000
Token did
Token ID750
Feature activation-2.55e-02

Pos logit contributions
 shave+0.018
 anchor+0.018
 carve+0.017
pen+0.016
apa+0.016
Neg logit contributions
rav-0.016
Good-0.015
 lesson-0.014
oys-0.014
 next-0.014
Token not
Token ID407
Feature activation-4.38e-02

Pos logit contributions
rav+0.034
fl+0.030
Good+0.030
oys+0.030
bows+0.028
Neg logit contributions
 shave-0.051
 anchor-0.046
 carve-0.046
isel-0.043
asts-0.042
Token see
Token ID766
Feature activation+1.94e-02

Pos logit contributions
 next+0.063
rav+0.063
 lesson+0.062
Good+0.062
S+0.060
Neg logit contributions
 shave-0.070
 anchor-0.068
apa-0.067
bowl-0.065
 carve-0.065
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
 shave+0.035
 anchor+0.033
apa+0.032
 carve+0.032
bowl+0.032
Neg logit contributions
 next-0.025
 lesson-0.025
rav-0.024
Good-0.024
S-0.023
Token wind
Token ID2344
Feature activation+3.55e-02

Pos logit contributions
 anchor+0.026
 shave+0.024
 carve+0.023
 iceberg+0.023
 boiling+0.022
Neg logit contributions
rav-0.025
Good-0.023
oys-0.021
S-0.021
ider-0.021
Token push
Token ID4574
Feature activation-6.10e-03

Pos logit contributions
 shave+0.063
 anchor+0.062
 carve+0.059
pen+0.057
bowl+0.057
Neg logit contributions
rav-0.048
 lesson-0.047
Good-0.046
S-0.044
OK-0.043
Token it
Token ID340
Feature activation+2.02e-03

Pos logit contributions
rav+0.010
Good+0.010
 lesson+0.010
 next+0.009
ider+0.009
Neg logit contributions
 shave-0.009
 anchor-0.009
apa-0.008
pen-0.008
bowl-0.008
Token of
Token ID286
Feature activation+1.15e-02

Pos logit contributions
 shave+0.060
 anchor+0.058
apa+0.054
 carve+0.054
pen+0.054
Neg logit contributions
rav-0.065
Good-0.063
 lesson-0.062
S-0.059
OK-0.059
Token herself
Token ID5223
Feature activation-1.02e-02

Pos logit contributions
 shave+0.021
 anchor+0.020
on+0.020
ete+0.019
pen+0.019
Neg logit contributions
rav-0.015
ider-0.014
bows-0.014
oys-0.014
S-0.013
Token for
Token ID329
Feature activation-5.37e-03

Pos logit contributions
rav+0.014
Good+0.014
S+0.013
OK+0.013
bows+0.013
Neg logit contributions
 shave-0.017
 anchor-0.017
apa-0.016
on-0.016
pen-0.016
Token trying
Token ID2111
Feature activation+2.65e-03

Pos logit contributions
rav+0.008
S+0.007
bows+0.007
oys+0.007
Good+0.007
Neg logit contributions
 shave-0.010
 anchor-0.009
pen-0.008
 carve-0.008
on-0.008
Token something
Token ID1223
Feature activation+2.72e-02

Pos logit contributions
 shave+0.005
 anchor+0.005
 carve+0.005
apa+0.005
pen+0.005
Neg logit contributions
rav-0.003
Good-0.003
 next-0.003
 lesson-0.003
ider-0.003
Token new
Token ID649
Feature activation-4.06e-02

Pos logit contributions
 shave+0.050
 anchor+0.049
apa+0.047
 carve+0.046
bowl+0.046
Neg logit contributions
rav-0.034
Good-0.033
 lesson-0.032
 next-0.032
S-0.031
Token.
Token ID13
Feature activation+2.71e-03

Pos logit contributions
rav+0.067
Good+0.065
S+0.062
ider+0.061
K+0.061
Neg logit contributions
 shave-0.062
 anchor-0.057
pen-0.056
bowl-0.055
apa-0.052
Token
Token ID198
Feature activation+9.87e-03

Pos logit contributions
 shave+0.004
apa+0.004
 anchor+0.004
 carve+0.004
bowl+0.004
Neg logit contributions
rav-0.004
 lesson-0.004
Good-0.004
 next-0.004
ider-0.004
Token
Token ID198
Feature activation+9.39e-03

Pos logit contributions
 shave+0.018
 anchor+0.018
 carve+0.017
apa+0.017
bowl+0.017
Neg logit contributions
rav-0.012
 next-0.012
 lesson-0.012
Good-0.012
S-0.011
TokenAt
Token ID2953
Feature activation-2.56e-02

Pos logit contributions
 shave+0.015
apa+0.015
on+0.015
ete+0.015
pen+0.015
Neg logit contributions
rav-0.013
bows-0.012
ses-0.012
sed-0.012
fl-0.012
Token the
Token ID262
Feature activation+5.96e-03

Pos logit contributions
rav+0.036
Good+0.035
 lesson+0.034
S+0.034
ider+0.034
Neg logit contributions
 shave-0.043
 anchor-0.042
 carve-0.040
apa-0.040
pen-0.039
Token day
Token ID1110
Feature activation-4.41e-02

Pos logit contributions
 shave+0.047
 anchor+0.046
 carve+0.044
apa+0.044
isel+0.044
Neg logit contributions
 next-0.015
 lesson-0.015
rav-0.014
Good-0.014
S-0.012
Token on
Token ID319
Feature activation-2.27e-02

Pos logit contributions
rav+0.042
 lesson+0.041
Good+0.040
 next+0.039
bows+0.038
Neg logit contributions
 shave-0.092
 anchor-0.089
apa-0.087
 carve-0.087
bowl-0.086
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
rav+0.039
Good+0.038
 lesson+0.037
ider+0.036
OK+0.036
Neg logit contributions
 shave-0.031
 anchor-0.030
apa-0.029
 carve-0.028
pen-0.028
Token Lily
Token ID20037
Feature activation-4.50e-02

Pos logit contributions
ider+0.030
rav+0.029
oys+0.028
edly+0.027
ses+0.027
Neg logit contributions
 carve-0.036
 shave-0.036
on-0.035
pen-0.034
bowl-0.033
Token was
Token ID373
Feature activation-1.40e-02

Pos logit contributions
rav+0.045
 next+0.044
 lesson+0.044
Good+0.044
S+0.040
Neg logit contributions
 shave-0.094
 anchor-0.091
 carve-0.088
apa-0.088
bowl-0.087
Token more
Token ID517
Feature activation-6.84e-02

Pos logit contributions
rav+0.018
ider+0.016
oys+0.016
bows+0.016
Good+0.016
Neg logit contributions
 shave-0.026
 anchor-0.025
 carve-0.024
bowl-0.024
on-0.024
Token careful
Token ID8161
Feature activation+2.36e-02

Pos logit contributions
rav+0.068
Good+0.065
 lesson+0.065
 next+0.065
S+0.060
Neg logit contributions
 shave-0.145
 anchor-0.141
 carve-0.136
apa-0.135
bowl-0.134
Token and
Token ID290
Feature activation-1.15e-02

Pos logit contributions
 shave+0.041
 anchor+0.038
on+0.038
bowl+0.037
ime+0.037
Neg logit contributions
rav-0.033
Good-0.031
S-0.029
ider-0.029
 Lia-0.028
Token never
Token ID1239
Feature activation-6.89e-02

Pos logit contributions
ider+0.020
rav+0.018
oys+0.017
bows+0.017
S+0.016
Neg logit contributions
 shave-0.017
 anchor-0.016
 carve-0.015
on-0.015
bowl-0.015
Token got
Token ID1392
Feature activation-2.25e-02

Pos logit contributions
rav+0.088
 lesson+0.086
Good+0.086
 next+0.085
S+0.081
Neg logit contributions
 shave-0.127
 anchor-0.120
 carve-0.118
bowl-0.115
pen-0.114
Token her
Token ID607
Feature activation-1.19e-02

Pos logit contributions
rav+0.033
 next+0.032
 lesson+0.032
Good+0.032
S+0.031
Neg logit contributions
 shave-0.037
 anchor-0.035
 carve-0.034
apa-0.034
bowl-0.033
Token very
Token ID845
Feature activation+3.52e-02

Pos logit contributions
 shave+0.016
 anchor+0.015
bowl+0.015
on+0.015
 carve+0.014
Neg logit contributions
rav-0.018
Good-0.017
ider-0.017
Sweet-0.016
OK-0.016
Token much
Token ID881
Feature activation+1.95e-02

Pos logit contributions
 shave+0.067
 anchor+0.065
 carve+0.064
apa+0.063
bowl+0.062
Neg logit contributions
 lesson-0.042
 next-0.041
rav-0.041
Good-0.041
S-0.039
Token.
Token ID13
Feature activation+3.61e-03

Pos logit contributions
 shave+0.036
 anchor+0.035
 carve+0.034
apa+0.034
bowl+0.034
Neg logit contributions
 next-0.024
 lesson-0.024
rav-0.024
Good-0.023
S-0.022
Token But
Token ID887
Feature activation+1.10e-02

Pos logit contributions
 anchor+0.008
 shave+0.008
igh+0.008
pen+0.008
isel+0.008
Neg logit contributions
 next-0.003
 Sue-0.003
S-0.003
Good-0.003
OK-0.003
Token the
Token ID262
Feature activation-3.13e-03

Pos logit contributions
 shave+0.017
 anchor+0.017
 carve+0.016
apa+0.015
bowl+0.015
Neg logit contributions
rav-0.017
Good-0.017
 lesson-0.017
 next-0.016
S-0.016
Token price
Token ID2756
Feature activation-7.23e-02

Pos logit contributions
rav+0.004
Good+0.004
bows+0.004
ider+0.004
 lesson+0.004
Neg logit contributions
 shave-0.006
 anchor-0.005
 carve-0.005
bowl-0.005
 iceberg-0.005
Token was
Token ID373
Feature activation+1.33e-03

Pos logit contributions
rav+0.092
Good+0.088
 next+0.082
ider+0.082
S+0.080
Neg logit contributions
 shave-0.135
 anchor-0.134
on-0.126
pen-0.125
bowl-0.125
Token unknown
Token ID6439
Feature activation-1.66e-02

Pos logit contributions
 anchor+0.002
 shave+0.002
 carve+0.002
 boiling+0.002
on+0.002
Neg logit contributions
rav-0.002
Good-0.002
bows-0.002
S-0.002
OK-0.002
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
rav+0.023
Good+0.023
 lesson+0.021
S+0.021
 next+0.021
Neg logit contributions
 shave-0.029
 anchor-0.029
bowl-0.027
 carve-0.027
pen-0.026
Token
Token ID220
Feature activation+4.28e-03

Pos logit contributions
 shave+0.004
 anchor+0.004
apa+0.004
pen+0.004
 carve+0.004
Neg logit contributions
 next-0.002
Good-0.002
 Sue-0.002
S-0.002
 lesson-0.002
Token
Token ID198
Feature activation+9.59e-03

Pos logit contributions
 shave+0.008
 anchor+0.008
 carve+0.008
apa+0.008
isel+0.008
Neg logit contributions
rav-0.005
 next-0.005
 lesson-0.005
Good-0.005
S-0.005
Token very
Token ID845
Feature activation+1.82e-02

Pos logit contributions
 shave+0.017
 anchor+0.016
apa+0.016
bowl+0.016
on+0.016
Neg logit contributions
rav-0.012
ider-0.011
 lesson-0.011
Good-0.011
S-0.011
Token adventurous
Token ID39770
Feature activation+1.49e-02

Pos logit contributions
 shave+0.033
 anchor+0.032
 carve+0.031
apa+0.031
bowl+0.030
Neg logit contributions
rav-0.023
Good-0.023
 lesson-0.023
 next-0.022
S-0.021
Token.
Token ID13
Feature activation-3.48e-03

Pos logit contributions
 shave+0.024
 anchor+0.023
bowl+0.022
apa+0.022
pen+0.022
Neg logit contributions
rav-0.022
Good-0.021
 lesson-0.021
 next-0.021
S-0.020
Token
Token ID220
Feature activation+1.43e-03

Pos logit contributions
rav+0.005
 next+0.005
Good+0.005
 lesson+0.005
S+0.004
Neg logit contributions
 shave-0.006
 anchor-0.006
 carve-0.006
apa-0.006
bowl-0.005
Token
Token ID198
Feature activation+9.55e-03

Pos logit contributions
 shave+0.002
 anchor+0.002
apa+0.002
 carve+0.002
bowl+0.002
Neg logit contributions
rav-0.002
Good-0.002
 lesson-0.002
 next-0.002
ider-0.002
TokenOne
Token ID3198
Feature activation-8.26e-02

Pos logit contributions
 shave+0.015
 anchor+0.014
bowl+0.014
 carve+0.014
pen+0.014
Neg logit contributions
rav-0.014
 lesson-0.013
Good-0.013
 next-0.013
ider-0.013
Token day
Token ID1110
Feature activation-4.99e-02

Pos logit contributions
rav+0.087
Good+0.082
ider+0.079
 lesson+0.078
bows+0.077
Neg logit contributions
 shave-0.169
 anchor-0.164
 carve-0.156
bowl-0.156
apa-0.155
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
rav+0.082
 lesson+0.081
Good+0.080
 next+0.079
ider+0.076
Neg logit contributions
 shave-0.069
 anchor-0.065
apa-0.064
 carve-0.062
bowl-0.061
Token Lucy
Token ID22162
Feature activation-2.50e-02

Pos logit contributions
rav+0.020
ider+0.018
oys+0.017
bows+0.017
 lesson+0.017
Neg logit contributions
 shave-0.020
 Daddy-0.018
 frying-0.018
 boiling-0.018
 fork-0.018
Token decided
Token ID3066
Feature activation-2.45e-02

Pos logit contributions
rav+0.030
 lesson+0.030
Good+0.029
 next+0.029
S+0.027
Neg logit contributions
 shave-0.047
 anchor-0.045
 carve-0.044
apa-0.044
bowl-0.044
Token to
Token ID284
Feature activation-1.34e-02

Pos logit contributions
ider+0.042
rav+0.042
Good+0.039
bows+0.039
oys+0.038
Neg logit contributions
 shave-0.034
 anchor-0.032
 carve-0.031
 oy-0.028
 frying-0.028
Token her
Token ID607
Feature activation-8.96e-03

Pos logit contributions
 shave+0.056
 boiling+0.055
 anchor+0.055
on+0.054
apa+0.051
Neg logit contributions
rav-0.061
ider-0.055
OK-0.052
Good-0.052
 lesson-0.051
Token ears
Token ID11368
Feature activation+4.61e-02

Pos logit contributions
rav+0.014
 next+0.013
Good+0.013
 lesson+0.013
S+0.013
Neg logit contributions
 shave-0.014
 anchor-0.014
 carve-0.013
apa-0.013
bowl-0.013
Token because
Token ID780
Feature activation+1.17e-02

Pos logit contributions
 shave+0.104
 anchor+0.102
apa+0.100
 carve+0.099
isel+0.099
Neg logit contributions
 next-0.039
 lesson-0.035
Good-0.034
rav-0.034
S-0.032
Token it
Token ID340
Feature activation-9.81e-03

Pos logit contributions
 anchor+0.018
on+0.017
 shave+0.017
 boiling+0.016
apa+0.016
Neg logit contributions
rav-0.019
ider-0.018
Good-0.017
oys-0.017
OK-0.017
Token was
Token ID373
Feature activation-8.88e-04

Pos logit contributions
 next+0.009
 lesson+0.009
Good+0.009
rav+0.009
 seek+0.008
Neg logit contributions
 shave-0.021
 anchor-0.020
apa-0.020
 carve-0.020
isel-0.020
Token too
Token ID1165
Feature activation-6.89e-02

Pos logit contributions
rav+0.001
Good+0.001
 lesson+0.001
 next+0.001
S+0.001
Neg logit contributions
 shave-0.002
 anchor-0.002
 carve-0.001
apa-0.001
bowl-0.001
Token loud
Token ID7812
Feature activation+3.69e-02

Pos logit contributions
 next+0.087
 lesson+0.087
S+0.081
Good+0.080
rav+0.076
Neg logit contributions
 shave-0.131
 anchor-0.125
pen-0.124
 carve-0.124
bowl-0.123
Token.
Token ID13
Feature activation+2.03e-03

Pos logit contributions
 shave+0.059
 anchor+0.055
pen+0.054
on+0.052
 iceberg+0.052
Neg logit contributions
rav-0.056
Good-0.054
S-0.053
OK-0.052
 lesson-0.051
Token But
Token ID887
Feature activation+3.29e-03

Pos logit contributions
 shave+0.003
apa+0.003
bowl+0.003
 carve+0.002
on+0.002
Neg logit contributions
rav-0.004
 lesson-0.003
ider-0.003
Good-0.003
 next-0.003
Token then
Token ID788
Feature activation-2.72e-02

Pos logit contributions
 anchor+0.004
 shave+0.003
 rum+0.003
 Mick+0.003
apa+0.003
Neg logit contributions
rav-0.007
oys-0.006
ider-0.006
bows-0.006
 lesson-0.006
Token,
Token ID11
Feature activation-5.66e-03

Pos logit contributions
rav+0.051
ider+0.049
oys+0.049
 lesson+0.049
Good+0.048
Neg logit contributions
 shave-0.033
 anchor-0.032
apa-0.031
on-0.029
 carve-0.029
Token and
Token ID290
Feature activation+1.34e-03

Pos logit contributions
Good+0.022
rav+0.022
 lesson+0.020
Sweet+0.018
S+0.018
Neg logit contributions
 shave-0.024
 anchor-0.022
on-0.022
pen-0.021
 carve-0.021
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 shave+0.002
 anchor+0.002
 carve+0.002
apa+0.002
bowl+0.002
Neg logit contributions
rav-0.002
 lesson-0.002
Good-0.002
S-0.002
ider-0.002
Token teacher
Token ID4701
Feature activation-5.28e-02

Pos logit contributions
rav+0.016
S+0.015
Good+0.015
bows+0.014
oys+0.014
Neg logit contributions
 shave-0.019
 anchor-0.018
 carve-0.017
 razor-0.017
 frying-0.017
Token started
Token ID2067
Feature activation-1.07e-02

Pos logit contributions
rav+0.074
 lesson+0.073
Good+0.073
 next+0.069
S+0.069
Neg logit contributions
 shave-0.090
 anchor-0.086
 carve-0.083
apa-0.083
pen-0.082
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
rav+0.011
Good+0.011
 lesson+0.011
 next+0.011
S+0.010
Neg logit contributions
 shave-0.022
 anchor-0.021
 carve-0.021
apa-0.020
bowl-0.020
Token lesson
Token ID11483
Feature activation-7.20e-02

Pos logit contributions
 shave+0.026
 anchor+0.024
 carve+0.023
pen+0.022
 frying+0.022
Neg logit contributions
rav-0.022
S-0.021
Good-0.021
oys-0.020
 lesson-0.020
Token.
Token ID13
Feature activation-6.77e-03

Pos logit contributions
Good+0.143
rav+0.141
Tweet+0.137
opard+0.135
ses+0.135
Neg logit contributions
pen-0.072
 shave-0.071
on-0.068
apa-0.065
ime-0.063
Token<|endoftext|>
Token ID50256
Feature activation-1.20e-02

Pos logit contributions
rav+0.011
 lesson+0.011
 next+0.011
Good+0.011
 seek+0.010
Neg logit contributions
 shave-0.010
apa-0.009
 carve-0.009
pen-0.009
bowl-0.009
TokenJosh
Token ID23808
Feature activation+3.86e-02

Pos logit contributions
rav+0.019
 lesson+0.019
Good+0.018
 next+0.018
ider+0.018
Neg logit contributions
 shave-0.018
bowl-0.017
pen-0.017
 anchor-0.016
on-0.016
Token and
Token ID290
Feature activation+7.75e-03

Pos logit contributions
 shave+0.077
 anchor+0.075
 carve+0.072
apa+0.072
bowl+0.071
Neg logit contributions
rav-0.042
 lesson-0.041
Good-0.041
 next-0.041
S-0.038
Token Jane
Token ID12091
Feature activation+4.89e-03

Pos logit contributions
 shave+0.012
 anchor+0.011
 carve+0.011
apa+0.011
bowl+0.011
Neg logit contributions
rav-0.012
 lesson-0.012
Good-0.012
 next-0.012
ider-0.011