TOP ACTIVATIONS
MAX = 6.265

liked to play with her ham ster , M imi .
the ghost and was <|endoftext|> Dr ina walked into the room
wanted him to take his medicine . Will was so mad
your fault and ask for mercy . I will spare you
. She had a pet ham ster named Spark y .
He found it wrapped in seaw eed and brought it back
mom took her to a yoga class . Lily didn 't
says : " Dr ink some water . It
to always ask for help rather than steal .
" Every time I ask nicely , you don 't listen
very scared and begged for mercy . He said , '
and said " hi St itch !" St
girl put on her special yoga clothes and went to her
as his mum opened the l <|endoftext|> Dave and Amy were
was a small creature called Flip . Flip was a friendly
they needed to bury the ham ster in the backyard .
where he stood on his hind legs and spun around in
borrowed her mom 's magn ifying glass to take a closer
happy with her new pet ham ster and loved him very
, she would say " Please motor , will you take

MIN ACTIVATIONS
MIN = -6.630

be happy and well - beh aved . The end .
became friendly and well - beh aved , just like Buddy
. The driver and the paramed ic jumped out . They
, but it was all fore shadow ed by the kind
and dirty . And maybe there is something in the p
the hills . Everywhere P add ington went , he found
grass and rye grass and al f alf a and cl
together happily and never qu arre lled again . <|endoftext|> Once
hands and they never qu arre led again . <|endoftext|> Once
safe again . The town spe ople thanked the bear many
the tea party . Maybe there are some cookies left for
. They have no umb rell as . They
good friends and never qu arre led again . <|endoftext|> Once
about rain bows and umb rell as . They were very
toys and puzzles and cr ay ons . There are hats
could be and never mis beh aved . <|endoftext|> Once upon
sharing is important and qu arre lling is not good .
their station . They hope there will be no more false
mom m ies and d add ies , who cl apped
Pam are friends again and there is peace between them .

INTERVAL 3.041 - 6.265
CONTAINS 0.541%

home . <|endoftext|> L ily liked to play with her doll
the kitchen to get a cookie , but her mom said
. They liked to catch bugs and leaves with the net
, she said , " Please , Lily , take the
she didn 't play with anymore . Her mom helped her

INTERVAL -0.183 - 3.041
CONTAINS 55.730%

After a while , Tim my and Max got
a cool one on her instead . Lily felt much better
you collect them ?" Asked grandma . Kat y
. Tom my was glad he had avoided the
my and Daddy gathered Bobby up in a big hug .

INTERVAL -3.406 - -0.183
CONTAINS 43.344%

and climbed up a tree . The all igator tried to
every day . They were happy . The moral of the
or package in it . Jack y wanted
to read the whole spell . Everyone in the class was
" Wow , they are so nice ," Tom said

INTERVAL -6.630 - -3.406
CONTAINS 0.384%

away . The little boy thanked the baker for
help and learn more . There is no need to be
Sam !". Sam wasn 't going to give up . She
In the end , they all had a great time playing
all over the place . Ag nes said ,
Token liked
Token ID8288
Feature activation+2.12e+00

Pos logit contributions
 Flip+0.038
 shark+0.037
 Mitch+0.036
 fin+0.036
 Finn+0.034
Neg logit contributions
cases-0.038
packing-0.037
cakes-0.037
grand-0.037
days-0.035
Token to
Token ID284
Feature activation-6.58e-01

Pos logit contributions
 shark+0.077
 Flip+0.077
 Mitch+0.074
 fin+0.074
 lobster+0.071
Neg logit contributions
cases-0.056
packing-0.055
grand-0.054
cakes-0.054
days-0.052
Token play
Token ID711
Feature activation-3.33e-02

Pos logit contributions
cases+0.017
packing+0.017
cakes+0.017
grand+0.017
ics+0.016
Neg logit contributions
 shark-0.024
 Flip-0.024
 Mitch-0.023
 fin-0.023
 lobster-0.022
Token with
Token ID351
Feature activation+1.82e+00

Pos logit contributions
ics+0.001
cakes+0.001
house+0.001
pack+0.001
days+0.001
Neg logit contributions
 fin-0.001
 Mitch-0.001
 Shark-0.001
 Flip-0.001
 shark-0.001
Token her
Token ID607
Feature activation+5.91e-01

Pos logit contributions
 shark+0.073
 Flip+0.072
 fin+0.071
 Mitch+0.071
 Shark+0.068
Neg logit contributions
 brushes-0.041
cakes-0.040
cases-0.040
grand-0.039
ics-0.038
Token ham
Token ID8891
Feature activation+6.27e+00

Pos logit contributions
 Flip+0.026
 shark+0.026
 Mitch+0.025
 fin+0.025
 Rory+0.024
Neg logit contributions
cases-0.011
grand-0.011
packing-0.011
cakes-0.011
 brushes-0.010
Tokenster
Token ID1706
Feature activation+2.67e+00

Pos logit contributions
 Flip+0.198
 shark+0.189
 Rory+0.189
 Otto+0.188
 Mitch+0.187
Neg logit contributions
packs-0.189
cases-0.188
cakes-0.183
ics-0.180
bows-0.175
Token,
Token ID11
Feature activation+9.72e-01

Pos logit contributions
 Flip+0.086
 Mitch+0.086
 Rory+0.084
 shark+0.083
 Shark+0.083
Neg logit contributions
packs-0.080
house-0.076
cases-0.075
cakes-0.074
days-0.073
Token M
Token ID337
Feature activation+1.77e-01

Pos logit contributions
 shark+0.031
 Flip+0.031
 Mitch+0.031
 fin+0.030
 lobster+0.029
Neg logit contributions
 brushes-0.028
 Kim-0.028
 Sarah-0.028
grand-0.027
days-0.026
Tokenimi
Token ID25236
Feature activation+1.67e-01

Pos logit contributions
 Flip+0.005
 shark+0.005
 fin+0.005
 Mitch+0.005
 Rory+0.005
Neg logit contributions
cases-0.006
cakes-0.006
grand-0.006
packing-0.006
days-0.006
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
 shark+0.004
 Flip+0.004
 fin+0.004
 Mitch+0.004
 lobster+0.004
Neg logit contributions
cases-0.006
grand-0.006
packing-0.006
cakes-0.006
days-0.006
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
 shark+0.023
 Flip+0.023
 fin+0.023
 Mitch+0.022
 lobster+0.022
Neg logit contributions
 Sarah-0.014
days-0.014
 Kim-0.014
cakes-0.014
packing-0.013
Token ghost
Token ID10905
Feature activation+9.86e-01

Pos logit contributions
cases+0.005
grand+0.005
cakes+0.005
packing+0.005
days+0.005
Neg logit contributions
 Flip-0.007
 shark-0.007
 Mitch-0.007
 fin-0.007
 Finn-0.007
Token and
Token ID290
Feature activation+2.27e-03

Pos logit contributions
 shark+0.021
 Flip+0.021
 fin+0.019
 Mitch+0.019
 Finn+0.018
Neg logit contributions
packing-0.040
cases-0.039
cakes-0.039
 brushes-0.038
grand-0.038
Token was
Token ID373
Feature activation-5.08e-01

Pos logit contributions
 Flip+0.000
 shark+0.000
 Mitch+0.000
 Finn+0.000
 Rory+0.000
Neg logit contributions
cases-0.000
packing-0.000
cakes-0.000
ugs-0.000
days-0.000
Token<|endoftext|>
Token ID50256
Feature activation+3.87e-01

Pos logit contributions
cases+0.019
packing+0.018
cakes+0.018
grand+0.018
days+0.017
Neg logit contributions
 Flip-0.014
 shark-0.014
 Mitch-0.013
 fin-0.013
 Rory-0.013
TokenDr
Token ID6187
Feature activation+6.07e+00

Pos logit contributions
 shark+0.013
 fin+0.013
 Flip+0.013
 Mitch+0.012
 lobster+0.012
Neg logit contributions
days-0.011
cases-0.011
packs-0.011
cakes-0.011
grand-0.011
Tokenina
Token ID1437
Feature activation+2.15e+00

Pos logit contributions
 shark+0.176
 Flip+0.173
 fin+0.166
 Mitch+0.165
 lobster+0.161
Neg logit contributions
ics-0.208
grand-0.202
cases-0.197
 baked-0.195
any-0.194
Token walked
Token ID6807
Feature activation+5.51e-01

Pos logit contributions
 shark+0.043
 Flip+0.042
 Mitch+0.040
 fin+0.040
 Rory+0.037
Neg logit contributions
cases-0.091
cakes-0.091
packing-0.090
grand-0.087
ics-0.086
Token into
Token ID656
Feature activation+1.66e+00

Pos logit contributions
 shark+0.018
 Flip+0.018
 Mitch+0.017
 fin+0.017
 Finn+0.017
Neg logit contributions
cases-0.017
grand-0.017
packing-0.016
cakes-0.016
days-0.016
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
 Flip+0.048
 shark+0.048
 Mitch+0.046
 fin+0.046
 lobster+0.044
Neg logit contributions
cases-0.056
packing-0.054
grand-0.054
cakes-0.054
days-0.053
Token room
Token ID2119
Feature activation+9.20e-01

Pos logit contributions
 shark+0.043
 Flip+0.042
 fin+0.041
 Mitch+0.041
 Finn+0.039
Neg logit contributions
cases-0.032
packing-0.030
grand-0.030
cakes-0.030
days-0.029
Token wanted
Token ID2227
Feature activation+5.31e-01

Pos logit contributions
cakes+0.020
days+0.019
bows+0.019
cases+0.019
 brushes+0.019
Neg logit contributions
 shark-0.015
 Flip-0.015
 Shark-0.014
 Fish-0.014
 Finn-0.014
Token him
Token ID683
Feature activation+2.33e+00

Pos logit contributions
 Flip+0.021
 shark+0.020
 Mitch+0.020
 fin+0.020
 lobster+0.019
Neg logit contributions
cases-0.013
grand-0.012
packing-0.012
cakes-0.012
ics-0.012
Token to
Token ID284
Feature activation+4.15e-01

Pos logit contributions
 Flip+0.062
 shark+0.061
 Finn+0.058
 fin+0.058
 crabs+0.058
Neg logit contributions
packing-0.082
cakes-0.080
cases-0.079
grand-0.078
any-0.078
Token take
Token ID1011
Feature activation+1.55e+00

Pos logit contributions
 shark+0.014
 Flip+0.014
 fin+0.014
 Mitch+0.013
 crabs+0.013
Neg logit contributions
cases-0.013
cakes-0.012
packing-0.012
grand-0.011
any-0.011
Token his
Token ID465
Feature activation+1.76e+00

Pos logit contributions
 shark+0.031
 Flip+0.030
 fin+0.029
 Mitch+0.028
 lobster+0.028
Neg logit contributions
cases-0.065
grand-0.064
packing-0.062
cakes-0.062
 Kim-0.061
Token medicine
Token ID9007
Feature activation+5.97e+00

Pos logit contributions
 shark+0.064
 fin+0.060
 Flip+0.058
 Finn+0.055
 crabs+0.055
Neg logit contributions
cakes-0.051
packing-0.051
ics-0.048
cases-0.048
grand-0.047
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
 Flip+0.175
 shark+0.173
 Mitch+0.170
 fin+0.166
 lobster+0.161
Neg logit contributions
cases-0.204
grand-0.193
cakes-0.192
packing-0.190
ics-0.186
Token Will
Token ID2561
Feature activation-6.16e-01

Pos logit contributions
 Flip+0.005
 shark+0.005
 Mitch+0.005
 fin+0.005
 Finn+0.004
Neg logit contributions
cases-0.006
grand-0.006
packing-0.006
cakes-0.006
ics-0.006
Token was
Token ID373
Feature activation-3.43e-02

Pos logit contributions
cakes+0.018
cases+0.018
grand+0.017
packing+0.017
 brushes+0.016
Neg logit contributions
 shark-0.022
 Flip-0.021
 Mitch-0.021
 fin-0.021
 Finn-0.020
Token so
Token ID523
Feature activation-1.27e+00

Pos logit contributions
cases+0.001
packing+0.001
grand+0.001
cakes+0.001
days+0.001
Neg logit contributions
 shark-0.001
 Flip-0.001
 Mitch-0.001
 fin-0.001
 lobster-0.001
Token mad
Token ID8805
Feature activation-3.18e-01

Pos logit contributions
cakes+0.045
cases+0.044
packing+0.043
any+0.043
bows+0.042
Neg logit contributions
 Flip-0.036
 shark-0.036
 fin-0.034
 Mitch-0.034
 Finn-0.033
Token your
Token ID534
Feature activation+7.12e-01

Pos logit contributions
 Flip+0.048
 shark+0.048
 fin+0.046
 Mitch+0.046
 lobster+0.044
Neg logit contributions
cases-0.063
grand-0.062
packing-0.061
cakes-0.061
ics-0.059
Token fault
Token ID8046
Feature activation+1.80e-02

Pos logit contributions
 shark+0.024
 fin+0.023
 Flip+0.022
 Mitch+0.022
 lobster+0.021
Neg logit contributions
cases-0.022
cakes-0.022
packing-0.021
grand-0.020
ics-0.020
Token and
Token ID290
Feature activation-5.43e-01

Pos logit contributions
 Flip+0.000
 shark+0.000
 Mitch+0.000
 fin+0.000
 lobster+0.000
Neg logit contributions
cases-0.001
grand-0.001
packing-0.001
cakes-0.001
days-0.001
Token ask
Token ID1265
Feature activation+1.12e+00

Pos logit contributions
cases+0.019
cakes+0.019
days+0.018
any+0.018
packing+0.017
Neg logit contributions
 shark-0.016
 Flip-0.016
 Mitch-0.015
 fin-0.015
 Finn-0.015
Token for
Token ID329
Feature activation+1.59e+00

Pos logit contributions
 shark+0.035
 Flip+0.035
 fin+0.034
 Mitch+0.033
 lobster+0.033
Neg logit contributions
grand-0.034
cases-0.033
cakes-0.032
days-0.032
packing-0.032
Token mercy
Token ID17703
Feature activation+5.84e+00

Pos logit contributions
 Flip+0.052
 shark+0.052
 Mitch+0.050
 fin+0.049
 lobster+0.048
Neg logit contributions
grand-0.046
cases-0.046
packing-0.045
cakes-0.045
days-0.044
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 Flip+0.184
 shark+0.176
 Mitch+0.175
 Otto+0.167
 Richie+0.166
Neg logit contributions
days-0.192
packs-0.187
grand-0.179
cases-0.174
packing-0.170
Token I
Token ID314
Feature activation+1.43e+00

Pos logit contributions
grand+0.004
 Kim+0.004
packing+0.004
cases+0.004
any+0.003
Neg logit contributions
 shark-0.006
 crabs-0.005
 lobster-0.005
 Flip-0.005
 fin-0.005
Token will
Token ID481
Feature activation+8.23e-01

Pos logit contributions
 Flip+0.051
 shark+0.051
 Mitch+0.049
 fin+0.049
 Finn+0.047
Neg logit contributions
cases-0.040
cakes-0.039
packing-0.038
grand-0.037
ics-0.037
Token spare
Token ID13952
Feature activation+1.15e-01

Pos logit contributions
 shark+0.031
 Flip+0.031
 Mitch+0.030
 fin+0.029
 lobster+0.029
Neg logit contributions
grand-0.020
cases-0.020
packing-0.020
ics-0.019
days-0.019
Token you
Token ID345
Feature activation-8.27e-01

Pos logit contributions
 Flip+0.004
 shark+0.004
 Mitch+0.004
 lobster+0.004
 fin+0.004
Neg logit contributions
grand-0.003
cases-0.002
 Kim-0.002
 brushes-0.002
cakes-0.002
Token.
Token ID13
Feature activation+7.04e-01

Pos logit contributions
 shark+0.046
 Flip+0.046
 Mitch+0.043
 fin+0.042
 lobster+0.041
Neg logit contributions
grand-0.066
cases-0.065
packing-0.064
any-0.064
days-0.063
Token She
Token ID1375
Feature activation+1.26e+00

Pos logit contributions
 shark+0.030
 fin+0.029
 Flip+0.027
 crabs+0.027
 Mitch+0.027
Neg logit contributions
 Sarah-0.015
cakes-0.015
 Kim-0.015
any-0.015
 brushes-0.015
Token had
Token ID550
Feature activation-7.18e-01

Pos logit contributions
 Flip+0.041
 shark+0.041
 Mitch+0.039
 fin+0.039
 Finn+0.038
Neg logit contributions
cases-0.039
cakes-0.037
packing-0.037
grand-0.036
ics-0.035
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
cases+0.033
packing+0.032
days+0.031
cakes+0.031
grand+0.031
Neg logit contributions
 Flip-0.013
 Mitch-0.013
 shark-0.012
 fin-0.012
 crabs-0.012
Token pet
Token ID4273
Feature activation+1.82e+00

Pos logit contributions
cases+0.043
packing+0.042
grand+0.042
cakes+0.041
days+0.040
Neg logit contributions
 Flip-0.055
 shark-0.055
 Mitch-0.053
 fin-0.053
 Finn-0.051
Token ham
Token ID8891
Feature activation+5.84e+00

Pos logit contributions
 fin+0.065
 Mitch+0.063
Tony+0.062
 shark+0.061
 Flip+0.060
Neg logit contributions
cakes-0.046
grand-0.045
cases-0.042
 Kim-0.041
 brushes-0.041
Tokenster
Token ID1706
Feature activation+2.54e+00

Pos logit contributions
 shark+0.182
Tony+0.180
 conclusions+0.180
Nice+0.177
 Richie+0.176
Neg logit contributions
ies-0.157
tr-0.149
 baked-0.148
cases-0.148
ics-0.147
Token named
Token ID3706
Feature activation+1.93e+00

Pos logit contributions
Tony+0.088
 shark+0.086
 Mitch+0.085
 Flip+0.085
 Richie+0.084
Neg logit contributions
grand-0.071
house-0.065
 baked-0.063
cases-0.062
cakes-0.061
Token Spark
Token ID17732
Feature activation+2.82e+00

Pos logit contributions
 shark+0.056
 fin+0.054
 crabs+0.049
 lobster+0.048
 coral+0.048
Neg logit contributions
grand-0.063
 Kim-0.062
 Sarah-0.062
 Susan-0.060
cases-0.060
Tokeny
Token ID88
Feature activation+1.63e+00

Pos logit contributions
 shark+0.106
 Flip+0.106
 fin+0.105
 Mitch+0.104
 lobster+0.099
Neg logit contributions
cases-0.072
grand-0.068
packing-0.068
any-0.065
cakes-0.065
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
 shark+0.054
 Flip+0.054
 fin+0.053
 Mitch+0.053
 lobster+0.051
Neg logit contributions
cases-0.047
grand-0.047
bows-0.046
any-0.044
packs-0.044
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
 shark+0.002
 Flip+0.002
 Mitch+0.002
 fin+0.002
 lobster+0.002
Neg logit contributions
cases-0.003
packing-0.003
cakes-0.003
grand-0.003
days-0.003
Token found
Token ID1043
Feature activation-1.02e+00

Pos logit contributions
 Flip+0.069
 shark+0.069
 Mitch+0.067
 fin+0.065
 lobster+0.065
Neg logit contributions
cases-0.041
packing-0.040
grand-0.038
cakes-0.037
days-0.037
Token it
Token ID340
Feature activation-8.36e-02

Pos logit contributions
 Kim+0.023
 Sarah+0.022
 unlocked+0.021
 Susan+0.021
 baked+0.019
Neg logit contributions
ascal-0.039
olo-0.037
 lobster-0.036
 shark-0.036
Col-0.036
Token wrapped
Token ID12908
Feature activation+1.00e+00

Pos logit contributions
cases+0.003
packing+0.002
ics+0.002
cakes+0.002
 baked+0.002
Neg logit contributions
 shark-0.003
 Flip-0.003
 Mitch-0.003
 lobster-0.003
 Rory-0.002
Token in
Token ID287
Feature activation+1.57e+00

Pos logit contributions
ascal+0.039
olo+0.038
 sincere+0.038
 lobster+0.037
 Flip+0.037
Neg logit contributions
 Sarah-0.020
 Kim-0.019
packing-0.018
ci-0.017
 Susan-0.017
Token seaw
Token ID32085
Feature activation+5.84e+00

Pos logit contributions
ascal+0.056
 Mitch+0.054
olo+0.054
 Flip+0.054
 Richie+0.053
Neg logit contributions
 brushes-0.037
 toys-0.036
 Sarah-0.036
 Kim-0.035
 baked-0.035
Tokeneed
Token ID2308
Feature activation+3.49e+00

Pos logit contributions
 Flip+0.183
 shark+0.175
 Frog+0.174
 Richie+0.173
 sincere+0.171
Neg logit contributions
packing-0.167
EEP-0.167
ages-0.162
cases-0.161
z-0.159
Token and
Token ID290
Feature activation+8.96e-01

Pos logit contributions
 Flip+0.119
 shark+0.115
 Mitch+0.115
 fin+0.112
Nice+0.111
Neg logit contributions
cases-0.100
cakes-0.088
ics-0.086
packing-0.086
packs-0.082
Token brought
Token ID3181
Feature activation-1.07e+00

Pos logit contributions
 Flip+0.031
 shark+0.031
 fin+0.031
 Mitch+0.030
 Finn+0.030
Neg logit contributions
grand-0.026
cases-0.026
packing-0.026
cakes-0.025
days-0.025
Token it
Token ID340
Feature activation-1.87e-01

Pos logit contributions
 Kim+0.026
packing+0.026
 Sarah+0.026
cases+0.026
 baked+0.025
Neg logit contributions
 Flip-0.037
 shark-0.037
 lobster-0.036
ascal-0.035
 Mitch-0.034
Token back
Token ID736
Feature activation-5.43e-01

Pos logit contributions
cases+0.005
packing+0.005
cakes+0.005
grand+0.005
days+0.005
Neg logit contributions
 Flip-0.007
 shark-0.007
 Mitch-0.007
 lobster-0.006
 fin-0.006
Token mom
Token ID1995
Feature activation-1.34e+00

Pos logit contributions
packing+0.054
grand+0.054
days+0.053
cases+0.053
cakes+0.052
Neg logit contributions
 shark-0.038
 Flip-0.035
 fin-0.034
 Finn-0.033
 Mitch-0.032
Token took
Token ID1718
Feature activation+3.26e-01

Pos logit contributions
cases+0.053
bows+0.052
packs+0.051
days+0.050
keeping+0.049
Neg logit contributions
 like-0.029
 Flip-0.028
 also-0.028
 Fish-0.027
 then-0.027
Token her
Token ID607
Feature activation+4.91e-01

Pos logit contributions
 Flip+0.009
 Mitch+0.008
 Finn+0.008
 Rory+0.008
 shark+0.008
Neg logit contributions
cases-0.013
packing-0.012
days-0.012
grand-0.012
cakes-0.012
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
 shark+0.012
 Flip+0.011
 fin+0.011
 Mitch+0.010
 crabs+0.010
Neg logit contributions
cakes-0.018
packing-0.018
cases-0.018
days-0.018
bows-0.018
Token a
Token ID257
Feature activation+7.39e-01

Pos logit contributions
 Flip+0.004
 shark+0.004
 Mitch+0.003
 Otto+0.003
 Rory+0.003
Neg logit contributions
cases-0.012
cakes-0.011
packing-0.011
ugs-0.011
z-0.011
Token yoga
Token ID20351
Feature activation+5.83e+00

Pos logit contributions
 Flip+0.028
 Mitch+0.028
 fin+0.027
 shark+0.027
 Finn+0.026
Neg logit contributions
packing-0.017
cases-0.017
cakes-0.017
grand-0.017
 baked-0.016
Token class
Token ID1398
Feature activation+9.57e-01

Pos logit contributions
 Flip+0.238
 Mitch+0.232
 fin+0.229
 shark+0.228
 Rory+0.225
Neg logit contributions
cakes-0.116
packing-0.116
grand-0.114
cases-0.113
ital-0.110
Token.
Token ID13
Feature activation-5.40e-02

Pos logit contributions
 Flip+0.011
 shark+0.011
 rather+0.010
 crabs+0.010
 Mitch+0.010
Neg logit contributions
packing-0.044
cakes-0.043
cases-0.043
grand-0.043
ics-0.043
Token Lily
Token ID20037
Feature activation-1.13e+00

Pos logit contributions
grand+0.002
ics+0.002
packing+0.002
cases+0.002
days+0.002
Neg logit contributions
 Flip-0.001
 Finn-0.001
 Mitch-0.001
 Rocky-0.001
 Otto-0.001
Token didn
Token ID1422
Feature activation+7.01e-02

Pos logit contributions
grand+0.039
cakes+0.038
packs+0.038
days+0.037
cases+0.037
Neg logit contributions
 Flip-0.034
 shark-0.031
 Mitch-0.031
 Finn-0.031
 Rory-0.030
Token't
Token ID470
Feature activation-4.16e-01

Pos logit contributions
 shark+0.003
 Flip+0.003
 Mitch+0.003
 fin+0.003
 lobster+0.002
Neg logit contributions
cases-0.002
grand-0.002
packing-0.002
cakes-0.002
days-0.002
Token says
Token ID1139
Feature activation+2.18e-01

Pos logit contributions
days+0.023
packing+0.023
any+0.023
grand+0.023
z+0.023
Neg logit contributions
 shark-0.016
 Flip-0.015
 Mitch-0.015
 crabs-0.015
 Finn-0.015
Token:
Token ID25
Feature activation+1.24e+00

Pos logit contributions
 Flip+0.007
 shark+0.007
 fin+0.006
 Mitch+0.006
 lobster+0.006
Neg logit contributions
cases-0.007
cakes-0.007
grand-0.007
packing-0.007
days-0.007
Token
Token ID198
Feature activation+1.78e+00

Pos logit contributions
 shark+0.042
 Flip+0.041
 fin+0.040
 Mitch+0.039
 crabs+0.038
Neg logit contributions
cases-0.036
packing-0.035
grand-0.035
cakes-0.035
days-0.034
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
 shark+0.047
 Flip+0.046
 Mitch+0.044
 fin+0.044
 lobster+0.042
Neg logit contributions
cases-0.065
grand-0.064
packing-0.064
cakes-0.063
days-0.062
Token"
Token ID1
Feature activation-2.31e-01

Pos logit contributions
 Flip+0.060
 shark+0.058
 Mitch+0.058
 fin+0.056
 Finn+0.055
Neg logit contributions
cases-0.066
cakes-0.063
packing-0.062
grand-0.061
days-0.061
TokenDr
Token ID6187
Feature activation+5.81e+00

Pos logit contributions
cases+0.008
grand+0.008
packing+0.008
cakes+0.008
days+0.008
Neg logit contributions
 Flip-0.006
 shark-0.006
 Mitch-0.006
 fin-0.006
 Finn-0.006
Tokenink
Token ID676
Feature activation+6.42e-01

Pos logit contributions
 Flip+0.181
 shark+0.181
 Mitch+0.173
 fin+0.172
 Finn+0.165
Neg logit contributions
cases-0.185
grand-0.180
packing-0.180
cakes-0.179
days-0.173
Token some
Token ID617
Feature activation-1.04e+00

Pos logit contributions
 shark+0.018
 Mitch+0.017
 Flip+0.017
 fin+0.016
 lobster+0.015
Neg logit contributions
cases-0.023
packing-0.023
cakes-0.022
days-0.022
grand-0.022
Token water
Token ID1660
Feature activation+1.76e+00

Pos logit contributions
cakes+0.028
cases+0.028
grand+0.028
 brushes+0.027
packing+0.027
Neg logit contributions
 Flip-0.037
 Mitch-0.035
 shark-0.035
 fin-0.034
 lobster-0.034
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
 shark+0.050
 Flip+0.049
 Mitch+0.047
 fin+0.047
 Finn+0.045
Neg logit contributions
cases-0.061
packing-0.059
grand-0.059
cakes-0.058
days-0.057
Token It
Token ID632
Feature activation+1.95e-01

Pos logit contributions
 shark+0.038
 Flip+0.037
 lobster+0.036
 fin+0.036
 crabs+0.036
Neg logit contributions
grand-0.032
cases-0.031
packing-0.031
cakes-0.030
 Kim-0.030
Token to
Token ID284
Feature activation-5.82e-01

Pos logit contributions
 shark+0.011
 Flip+0.011
 fin+0.011
 Mitch+0.011
 lobster+0.010
Neg logit contributions
cases-0.010
grand-0.010
packing-0.010
cakes-0.010
ics-0.009
Token always
Token ID1464
Feature activation+2.35e+00

Pos logit contributions
grand+0.014
packing+0.014
cases+0.013
ics+0.013
cakes+0.013
Neg logit contributions
 shark-0.022
 Flip-0.022
 Mitch-0.021
 fin-0.021
 lobster-0.020
Token ask
Token ID1265
Feature activation+1.89e+00

Pos logit contributions
 Flip+0.086
 shark+0.086
 Mitch+0.083
 fin+0.082
 Finn+0.079
Neg logit contributions
cases-0.061
packing-0.060
grand-0.060
cakes-0.059
days-0.057
Token for
Token ID329
Feature activation+1.83e+00

Pos logit contributions
 shark+0.063
 Flip+0.063
 fin+0.061
 Mitch+0.059
 lobster+0.058
Neg logit contributions
cases-0.052
grand-0.052
days-0.049
cakes-0.049
 Sarah-0.049
Token help
Token ID1037
Feature activation+9.07e-01

Pos logit contributions
 Flip+0.061
 shark+0.060
 Mitch+0.058
 fin+0.057
 Rory+0.056
Neg logit contributions
grand-0.052
cases-0.050
packing-0.050
cakes-0.049
days-0.049
Token rather
Token ID2138
Feature activation+5.78e+00

Pos logit contributions
 shark+0.026
 Flip+0.026
 Mitch+0.025
 fin+0.025
 lobster+0.024
Neg logit contributions
cases-0.030
grand-0.029
days-0.029
cakes-0.028
ics-0.028
Token than
Token ID621
Feature activation+1.26e+00

Pos logit contributions
 Flip+0.195
 shark+0.193
 Mitch+0.186
 fin+0.180
 Finn+0.180
Neg logit contributions
grand-0.168
cases-0.165
packing-0.161
days-0.158
packs-0.157
Token steal
Token ID8711
Feature activation+7.84e-01

Pos logit contributions
 Flip+0.040
 shark+0.040
 Mitch+0.039
 fin+0.038
 Finn+0.037
Neg logit contributions
cases-0.038
grand-0.038
packing-0.037
cakes-0.037
days-0.036
Token.
Token ID13
Feature activation-9.75e-02

Pos logit contributions
 Flip+0.022
 shark+0.022
 Mitch+0.021
 fin+0.021
 lobster+0.020
Neg logit contributions
cases-0.027
grand-0.026
cakes-0.025
packs-0.025
packing-0.025
Token
Token ID198
Feature activation-9.37e-02

Pos logit contributions
cases+0.003
grand+0.003
packing+0.003
cakes+0.003
days+0.003
Neg logit contributions
 shark-0.003
 Flip-0.003
 fin-0.003
 Mitch-0.003
 lobster-0.003
Token
Token ID198
Feature activation+4.96e-01

Pos logit contributions
grand+0.003
cakes+0.003
ics+0.003
packing+0.003
cases+0.003
Neg logit contributions
 shark-0.003
 fin-0.003
 Flip-0.003
 crabs-0.003
 Mitch-0.003
Token "
Token ID366
Feature activation+1.56e+00

Pos logit contributions
 Flip+0.016
 shark+0.016
 Mitch+0.015
 fin+0.015
 lobster+0.015
Neg logit contributions
cases-0.021
cakes-0.020
packing-0.020
grand-0.020
days-0.019
TokenEvery
Token ID6109
Feature activation+4.56e-01

Pos logit contributions
 Flip+0.037
 Mitch+0.035
 shark+0.034
 lobster+0.033
 Shark+0.033
Neg logit contributions
cases-0.062
packs-0.059
packing-0.058
ics-0.058
grand-0.057
Token time
Token ID640
Feature activation-7.67e-01

Pos logit contributions
 shark+0.017
 Flip+0.016
 fin+0.016
 Mitch+0.016
 crabs+0.015
Neg logit contributions
cases-0.013
packing-0.012
cakes-0.012
grand-0.012
any-0.011
Token I
Token ID314
Feature activation+2.86e+00

Pos logit contributions
cases+0.027
packing+0.026
grand+0.026
cakes+0.026
any+0.025
Neg logit contributions
 Flip-0.021
 shark-0.020
 Mitch-0.020
 Finn-0.019
 Rory-0.019
Token ask
Token ID1265
Feature activation+2.19e+00

Pos logit contributions
 Flip+0.100
 shark+0.099
 Mitch+0.095
 fin+0.095
 Finn+0.093
Neg logit contributions
cases-0.081
cakes-0.078
packing-0.078
grand-0.077
ics-0.075
Token nicely
Token ID16576
Feature activation+5.77e+00

Pos logit contributions
 Flip+0.065
 shark+0.064
 Mitch+0.063
 fin+0.061
 Finn+0.059
Neg logit contributions
cases-0.073
packing-0.071
cakes-0.070
grand-0.070
days-0.068
Token,
Token ID11
Feature activation+1.92e+00

Pos logit contributions
 Flip+0.164
 shark+0.164
 Mitch+0.157
 fin+0.156
 lobster+0.149
Neg logit contributions
cases-0.197
grand-0.192
packing-0.192
cakes-0.191
days-0.185
Token you
Token ID345
Feature activation+7.07e-01

Pos logit contributions
 Flip+0.050
 shark+0.049
 Mitch+0.047
 fin+0.047
 Finn+0.045
Neg logit contributions
cases-0.071
packing-0.069
grand-0.069
cakes-0.068
ics-0.067
Token don
Token ID836
Feature activation+2.26e+00

Pos logit contributions
 Flip+0.021
 shark+0.021
 Mitch+0.020
 fin+0.020
 Finn+0.019
Neg logit contributions
cases-0.024
cakes-0.023
grand-0.023
packing-0.023
ics-0.022
Token't
Token ID470
Feature activation-9.68e-02

Pos logit contributions
 Flip+0.083
 shark+0.082
 Mitch+0.079
 fin+0.078
 Finn+0.076
Neg logit contributions
cases-0.060
packing-0.058
grand-0.058
cakes-0.057
days-0.056
Token listen
Token ID6004
Feature activation+7.06e-01

Pos logit contributions
cases+0.003
packing+0.003
grand+0.003
cakes+0.003
days+0.003
Neg logit contributions
 Flip-0.003
 shark-0.003
 Mitch-0.003
 fin-0.003
 lobster-0.003
Token very
Token ID845
Feature activation+6.10e-01

Pos logit contributions
cases+0.003
 Kim+0.003
 baked+0.003
ics+0.003
 unlocked+0.003
Neg logit contributions
 shark-0.004
 Flip-0.004
 Mitch-0.004
 lobster-0.004
 Richie-0.004
Token scared
Token ID12008
Feature activation+2.94e-01

Pos logit contributions
 Flip+0.020
 shark+0.019
 Mitch+0.019
 fin+0.018
 Richie+0.018
Neg logit contributions
cases-0.018
grand-0.017
packing-0.017
cakes-0.017
ics-0.017
Token and
Token ID290
Feature activation+3.52e-01

Pos logit contributions
 Flip+0.012
 Mitch+0.012
 lobster+0.012
Col+0.012
 Richie+0.012
Neg logit contributions
 Kim-0.007
 Sarah-0.006
cases-0.005
 Susan-0.005
packs-0.005
Token begged
Token ID32946
Feature activation-3.92e-01

Pos logit contributions
 Flip+0.012
 shark+0.012
 Mitch+0.011
 fin+0.011
 lobster+0.011
Neg logit contributions
cases-0.010
packing-0.010
grand-0.010
cakes-0.010
days-0.009
Token for
Token ID329
Feature activation+1.52e+00

Pos logit contributions
 Kim+0.013
 Sarah+0.013
 Susan+0.011
grand+0.011
packs+0.011
Neg logit contributions
 shark-0.011
 fin-0.011
 Flip-0.011
Nice-0.011
 lobster-0.010
Token mercy
Token ID17703
Feature activation+5.76e+00

Pos logit contributions
 Flip+0.053
 lobster+0.052
 shark+0.052
 fin+0.050
 Mitch+0.050
Neg logit contributions
 Kim-0.038
 Sarah-0.035
 Susan-0.034
grand-0.033
cakes-0.033
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
 Flip+0.214
 shark+0.206
 Mitch+0.206
 Richie+0.198
 Otto+0.196
Neg logit contributions
days-0.154
packs-0.146
cases-0.127
any-0.124
 Kim-0.123
Token He
Token ID679
Feature activation+1.99e+00

Pos logit contributions
 shark+0.011
 fin+0.010
 Flip+0.010
 crabs+0.010
 coral+0.010
Neg logit contributions
cases-0.010
 Kim-0.010
packing-0.009
cakes-0.009
grand-0.009
Token said
Token ID531
Feature activation-5.51e-01

Pos logit contributions
 shark+0.081
 Flip+0.080
 Mitch+0.078
 fin+0.078
 lobster+0.076
Neg logit contributions
cases-0.044
packing-0.043
grand-0.042
cakes-0.042
days-0.039
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
 Kim+0.016
cases+0.016
cakes+0.016
grand+0.016
 Sarah+0.015
Neg logit contributions
 Flip-0.016
 shark-0.016
 fin-0.016
 lobster-0.016
 coral-0.016
Token '
Token ID705
Feature activation+6.54e-01

Pos logit contributions
 shark+0.046
 crabs+0.045
 lobster+0.044
 coral+0.044
 Flip+0.044
Neg logit contributions
cases-0.019
packing-0.019
 Kim-0.018
cakes-0.018
grand-0.017
Token and
Token ID290
Feature activation+9.77e-01

Pos logit contributions
 Flip+0.014
 shark+0.014
 Mitch+0.013
 fin+0.013
 Finn+0.012
Neg logit contributions
cases-0.020
grand-0.020
packing-0.020
cakes-0.020
days-0.019
Token said
Token ID531
Feature activation+1.16e-01

Pos logit contributions
 shark+0.032
 Flip+0.031
 Mitch+0.030
 fin+0.029
 Finn+0.029
Neg logit contributions
days-0.031
cases-0.031
cakes-0.031
ics-0.030
ugs-0.030
Token "
Token ID366
Feature activation+2.12e+00

Pos logit contributions
 Flip+0.004
 fin+0.004
 shark+0.004
 Mitch+0.004
 lobster+0.003
Neg logit contributions
cases-0.003
cakes-0.003
grand-0.003
packing-0.003
ics-0.003
Tokenhi
Token ID5303
Feature activation+1.57e+00

Pos logit contributions
 shark+0.056
 Flip+0.056
 Mitch+0.054
 fin+0.053
 Finn+0.051
Neg logit contributions
cases-0.077
grand-0.075
packing-0.075
cakes-0.075
days-0.073
Token St
Token ID520
Feature activation+5.13e+00

Pos logit contributions
 shark+0.055
 fin+0.055
 Mitch+0.054
 Flip+0.053
 coral+0.052
Neg logit contributions
grand-0.040
packing-0.040
cakes-0.039
cases-0.039
days-0.038
Tokenitch
Token ID2007
Feature activation+5.74e+00

Pos logit contributions
 shark+0.096
 Flip+0.092
 fin+0.085
 crabs+0.083
 coral+0.083
Neg logit contributions
cases-0.225
packing-0.223
grand-0.223
EEP-0.218
cakes-0.215
Token!"
Token ID2474
Feature activation+1.66e+00

Pos logit contributions
 shark+0.177
 Flip+0.176
 fin+0.167
 Mitch+0.167
 Finn+0.159
Neg logit contributions
cases-0.192
grand-0.186
cakes-0.186
packing-0.182
 brushes-0.178
Token
Token ID220
Feature activation-3.64e-01

Pos logit contributions
 shark+0.053
 crabs+0.050
 fin+0.049
 coral+0.048
 lobster+0.048
Neg logit contributions
cases-0.052
grand-0.052
packing-0.049
 Kim-0.049
cakes-0.049
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
cases+0.012
grand+0.012
packing+0.012
cakes+0.012
ics+0.012
Neg logit contributions
 shark-0.011
 Flip-0.010
 fin-0.010
 crabs-0.010
 lobster-0.010
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
 shark+0.055
 crabs+0.053
 fin+0.051
 lobster+0.051
 seaw+0.050
Neg logit contributions
grand-0.038
z-0.035
cases-0.034
packing-0.033
eth-0.032
TokenSt
Token ID1273
Feature activation+1.31e+00

Pos logit contributions
 shark+0.064
 crabs+0.061
 fin+0.059
 lobster+0.058
 Finn+0.057
Neg logit contributions
grand-0.051
packing-0.048
cases-0.048
z-0.047
EEP-0.044
Token girl
Token ID2576
Feature activation-1.03e+00

Pos logit contributions
cases+0.041
packing+0.039
grand+0.038
ics+0.038
packs+0.038
Neg logit contributions
 shark-0.045
 Mitch-0.042
 Flip-0.042
 lobster-0.042
 crabs-0.041
Token put
Token ID1234
Feature activation+5.99e-01

Pos logit contributions
grand+0.029
cases+0.029
days+0.028
packing+0.028
ics+0.027
Neg logit contributions
 Flip-0.036
 Mitch-0.034
 shark-0.034
 Finn-0.033
 lobster-0.033
Token on
Token ID319
Feature activation+1.64e-01

Pos logit contributions
 shark+0.020
 Flip+0.019
 fin+0.019
 Mitch+0.019
 lobster+0.018
Neg logit contributions
cases-0.018
packing-0.018
cakes-0.018
grand-0.017
ics-0.017
Token her
Token ID607
Feature activation-4.63e-02

Pos logit contributions
 Flip+0.005
 shark+0.005
 Mitch+0.005
 fin+0.005
 Finn+0.005
Neg logit contributions
cases-0.005
grand-0.005
packing-0.005
cakes-0.005
days-0.005
Token special
Token ID2041
Feature activation+1.28e+00

Pos logit contributions
cases+0.002
grand+0.002
packing+0.001
cakes+0.001
days+0.001
Neg logit contributions
 Flip-0.001
 shark-0.001
 Mitch-0.001
 fin-0.001
 lobster-0.001
Token yoga
Token ID20351
Feature activation+5.72e+00

Pos logit contributions
 shark+0.039
 Flip+0.038
 Mitch+0.037
 fin+0.037
 lobster+0.035
Neg logit contributions
cases-0.042
grand-0.042
packing-0.041
cakes-0.041
days-0.040
Token clothes
Token ID8242
Feature activation+1.09e+00

Pos logit contributions
 Flip+0.207
 shark+0.207
 Mitch+0.200
 fin+0.199
 Shark+0.194
Neg logit contributions
cases-0.148
cakes-0.144
packing-0.143
grand-0.140
 brushes-0.137
Token and
Token ID290
Feature activation-5.97e-01

Pos logit contributions
 Flip+0.023
 shark+0.022
 Mitch+0.022
 fin+0.021
 Rory+0.020
Neg logit contributions
grand-0.044
packing-0.043
cases-0.043
days-0.042
cakes-0.042
Token went
Token ID1816
Feature activation-1.71e+00

Pos logit contributions
grand+0.020
packing+0.019
days+0.019
cakes+0.019
cases+0.018
Neg logit contributions
 Flip-0.018
 Mitch-0.018
 shark-0.018
 Finn-0.017
 fin-0.017
Token to
Token ID284
Feature activation-9.38e-01

Pos logit contributions
grand+0.054
cases+0.053
packing+0.053
cakes+0.051
days+0.050
Neg logit contributions
 Flip-0.055
 shark-0.054
 Mitch-0.052
 fin-0.051
 Finn-0.050
Token her
Token ID607
Feature activation-2.55e-01

Pos logit contributions
cases+0.031
grand+0.030
packing+0.030
cakes+0.030
days+0.029
Neg logit contributions
 Flip-0.029
 shark-0.029
 Mitch-0.028
 fin-0.027
 Finn-0.026
Token as
Token ID355
Feature activation+2.98e-01

Pos logit contributions
 Flip+0.120
 shark+0.120
 Mitch+0.117
 fin+0.112
 Finn+0.111
Neg logit contributions
cases-0.116
days-0.110
packing-0.110
packs-0.108
cakes-0.108
Token his
Token ID465
Feature activation-2.06e-01

Pos logit contributions
 Flip+0.007
 fin+0.007
 shark+0.007
 Mitch+0.007
 lobster+0.007
Neg logit contributions
cases-0.011
 Kim-0.011
packing-0.010
grand-0.010
ics-0.010
Token mum
Token ID25682
Feature activation-1.55e-01

Pos logit contributions
cakes+0.006
cases+0.006
 brushes+0.006
grand+0.005
days+0.005
Neg logit contributions
 Flip-0.007
 Mitch-0.007
 shark-0.007
 Richie-0.006
 coral-0.006
Token opened
Token ID4721
Feature activation+2.71e-01

Pos logit contributions
cases+0.004
grand+0.004
packing+0.004
cakes+0.004
days+0.004
Neg logit contributions
 Flip-0.006
 shark-0.005
 Mitch-0.005
 fin-0.005
 Finn-0.005
Token the
Token ID262
Feature activation+1.80e+00

Pos logit contributions
 shark+0.009
 Flip+0.009
 lobster+0.009
 fin+0.009
 Mitch+0.009
Neg logit contributions
 Sarah-0.007
 Kim-0.007
packs-0.007
 brushes-0.007
cases-0.007
Token l
Token ID300
Feature activation+5.71e+00

Pos logit contributions
 Flip+0.087
 Mitch+0.085
 shark+0.084
 Finn+0.083
 Richie+0.082
Neg logit contributions
cases-0.027
 brushes-0.025
cakes-0.025
grand-0.024
packs-0.023
Token<|endoftext|>
Token ID50256
Feature activation+1.71e+00

Pos logit contributions
 shark+0.206
 Shark+0.201
 Fish+0.199
 Flip+0.199
 Finn+0.191
Neg logit contributions
cakes-0.156
ugs-0.148
cases-0.145
 brushes-0.142
packing-0.140
TokenDave
Token ID27984
Feature activation+3.05e+00

Pos logit contributions
 shark+0.047
 Flip+0.047
 Mitch+0.044
 fin+0.044
 lobster+0.042
Neg logit contributions
cases-0.061
grand-0.059
packing-0.059
cakes-0.059
days-0.057
Token and
Token ID290
Feature activation+2.02e+00

Pos logit contributions
 shark+0.068
 Flip+0.068
 fin+0.066
 Mitch+0.066
 lobster+0.061
Neg logit contributions
cases-0.127
cakes-0.125
grand-0.120
packing-0.120
ics-0.117
Token Amy
Token ID14235
Feature activation-4.32e-01

Pos logit contributions
 shark+0.048
 Flip+0.045
 fin+0.045
 Mitch+0.044
 crabs+0.042
Neg logit contributions
cases-0.084
cakes-0.082
packing-0.079
ics-0.077
grand-0.077
Token were
Token ID547
Feature activation+7.40e-01

Pos logit contributions
cases+0.010
packing+0.010
grand+0.010
cakes+0.010
days+0.009
Neg logit contributions
 Flip-0.017
 shark-0.017
 Mitch-0.016
 fin-0.016
 Finn-0.016
Token was
Token ID373
Feature activation+1.88e+00

Pos logit contributions
cases+0.062
grand+0.061
packing+0.061
days+0.060
cakes+0.060
Neg logit contributions
 Flip-0.128
 shark-0.128
 Mitch-0.125
 fin-0.125
 lobster-0.122
Token a
Token ID257
Feature activation-7.38e-01

Pos logit contributions
 shark+0.063
 fin+0.061
 Flip+0.061
 Mitch+0.061
 lobster+0.059
Neg logit contributions
 baked-0.051
ics-0.051
 brushes-0.050
 Sarah-0.050
grand-0.050
Token small
Token ID1402
Feature activation+3.87e-01

Pos logit contributions
packing+0.014
grand+0.013
 baked+0.013
cakes+0.013
days+0.013
Neg logit contributions
 Mitch-0.030
 Flip-0.030
 fin-0.029
 Rory-0.029
 Richie-0.029
Token creature
Token ID7185
Feature activation+1.53e+00

Pos logit contributions
 Flip+0.015
 Mitch+0.015
 fin+0.014
 Richie+0.014
 Rory+0.014
Neg logit contributions
packing-0.008
grand-0.008
days-0.008
 brushes-0.008
cakes-0.008
Token called
Token ID1444
Feature activation+2.74e+00

Pos logit contributions
 Mitch+0.057
 Flip+0.057
 Richie+0.054
 Rory+0.054
Col+0.052
Neg logit contributions
grand-0.041
days-0.039
house-0.038
any-0.035
packing-0.035
Token Flip
Token ID29583
Feature activation+5.71e+00

Pos logit contributions
 shark+0.085
 Flip+0.084
 fin+0.081
 Mitch+0.080
 lobster+0.078
Neg logit contributions
cases-0.085
grand-0.084
packing-0.083
cakes-0.081
days-0.080
Token.
Token ID13
Feature activation+9.36e-01

Pos logit contributions
 Flip+0.227
 Richie+0.219
 Mitch+0.218
 shark+0.218
 Rory+0.211
Neg logit contributions
grand-0.133
days-0.120
any-0.116
packs-0.115
ies-0.111
Token Flip
Token ID29583
Feature activation+3.00e+00

Pos logit contributions
 crabs+0.027
 pearl+0.025
ascal+0.024
Tony+0.024
 conclusions+0.023
Neg logit contributions
 This-0.025
 fits-0.023
 job-0.023
 Kim-0.023
 These-0.022
Token was
Token ID373
Feature activation+4.06e-01

Pos logit contributions
 Flip+0.127
 shark+0.126
 Mitch+0.123
 fin+0.121
 Rory+0.118
Neg logit contributions
grand-0.062
packing-0.058
days-0.057
any-0.056
 baked-0.056
Token a
Token ID257
Feature activation-1.92e+00

Pos logit contributions
 shark+0.013
 seaw+0.012
 conclusions+0.012
Tony+0.012
 Flip+0.012
Neg logit contributions
 Kim-0.012
 Sarah-0.011
ics-0.011
 baked-0.011
 Susan-0.011
Token friendly
Token ID8030
Feature activation+1.15e+00

Pos logit contributions
 job+0.047
packing+0.045
grand+0.043
 baked+0.041
days+0.040
Neg logit contributions
 Mitch-0.065
 Flip-0.064
 crabs-0.064
 goals-0.062
izards-0.062
Token they
Token ID484
Feature activation+1.35e+00

Pos logit contributions
 Flip+0.057
 shark+0.057
 Mitch+0.054
 fin+0.054
 lobster+0.051
Neg logit contributions
cases-0.078
grand-0.076
packing-0.076
cakes-0.076
days-0.074
Token needed
Token ID2622
Feature activation+1.35e+00

Pos logit contributions
 shark+0.043
 Flip+0.042
 fin+0.041
 Mitch+0.041
 Finn+0.039
Neg logit contributions
cases-0.041
grand-0.041
packing-0.041
cakes-0.040
ics-0.039
Token to
Token ID284
Feature activation-4.98e-01

Pos logit contributions
 Flip+0.052
 shark+0.052
 Mitch+0.050
 fin+0.049
 lobster+0.049
Neg logit contributions
grand-0.032
cases-0.032
cakes-0.031
packing-0.030
 brushes-0.030
Token bury
Token ID30033
Feature activation+3.07e+00

Pos logit contributions
cases+0.014
packing+0.013
cakes+0.013
grand+0.013
any+0.012
Neg logit contributions
 shark-0.018
 Flip-0.018
 fin-0.017
 Mitch-0.017
 Finn-0.017
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
Tony+0.101
 Flip+0.101
 lobster+0.098
 Mitch+0.097
 Richie+0.097
Neg logit contributions
packs-0.080
 Kim-0.079
 Sarah-0.075
 brushes-0.075
grand-0.075
Token ham
Token ID8891
Feature activation+5.70e+00

Pos logit contributions
 Flip+0.037
Tony+0.036
 Richie+0.035
 Mitch+0.035
 Finn+0.035
Neg logit contributions
grand-0.025
 baked-0.025
 picture-0.023
 job-0.022
 toys-0.022
Tokenster
Token ID1706
Feature activation+2.24e+00

Pos logit contributions
 Flip+0.158
 shark+0.155
Tony+0.152
 Finn+0.150
Nice+0.149
Neg logit contributions
cases-0.188
cakes-0.182
packs-0.180
grand-0.178
 baked-0.177
Token in
Token ID287
Feature activation+1.80e+00

Pos logit contributions
 Flip+0.059
 shark+0.059
 Mitch+0.057
 fin+0.056
 lobster+0.053
Neg logit contributions
cases-0.081
grand-0.079
packing-0.079
cakes-0.079
days-0.077
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
 Flip+0.055
 shark+0.055
 Mitch+0.053
 lobster+0.052
 fin+0.052
Neg logit contributions
grand-0.055
packing-0.054
packs-0.054
days-0.054
cases-0.054
Token backyard
Token ID24296
Feature activation+4.50e-01

Pos logit contributions
 Flip+0.044
 shark+0.042
 Mitch+0.041
 Richie+0.041
 Finn+0.040
Neg logit contributions
grand-0.026
 baked-0.025
cakes-0.025
 brushes-0.024
cases-0.023
Token.
Token ID13
Feature activation+4.40e-02

Pos logit contributions
 shark+0.009
 Flip+0.009
 Mitch+0.008
 fin+0.008
 Finn+0.008
Neg logit contributions
cases-0.019
packing-0.019
grand-0.019
cakes-0.018
any-0.018
Token where
Token ID810
Feature activation-7.32e-01

Pos logit contributions
 Flip+0.085
 shark+0.083
 Mitch+0.082
 Otto+0.081
 fin+0.079
Neg logit contributions
cases-0.076
cakes-0.074
packing-0.072
grand-0.072
days-0.069
Token he
Token ID339
Feature activation+1.87e+00

Pos logit contributions
packing+0.029
cases+0.028
grand+0.028
days+0.028
cakes+0.028
Neg logit contributions
 shark-0.018
 Flip-0.017
 Mitch-0.016
 Finn-0.016
 crabs-0.016
Token stood
Token ID6204
Feature activation+8.15e-01

Pos logit contributions
 Flip+0.067
 shark+0.067
 Mitch+0.064
 fin+0.064
 lobster+0.062
Neg logit contributions
cases-0.051
packing-0.049
grand-0.049
cakes-0.048
days-0.047
Token on
Token ID319
Feature activation+1.06e+00

Pos logit contributions
 shark+0.032
 Flip+0.032
 Finn+0.032
 lobster+0.032
 crabs+0.031
Neg logit contributions
 job-0.014
cases-0.013
z-0.013
ics-0.013
 Sarah-0.013
Token his
Token ID465
Feature activation+1.12e+00

Pos logit contributions
 Flip+0.032
 shark+0.032
 Mitch+0.031
 fin+0.031
 lobster+0.030
Neg logit contributions
cases-0.033
cakes-0.031
 Kim-0.031
grand-0.031
packing-0.031
Token hind
Token ID16222
Feature activation+5.69e+00

Pos logit contributions
 Flip+0.040
 Mitch+0.039
 Rory+0.038
 Richie+0.038
 Otto+0.037
Neg logit contributions
cases-0.028
 job-0.026
grand-0.026
EEP-0.026
cakes-0.025
Token legs
Token ID7405
Feature activation+2.95e+00

Pos logit contributions
 Flip+0.198
 shark+0.194
 Mitch+0.186
 Otto+0.184
 Rory+0.183
Neg logit contributions
cases-0.154
grand-0.149
 brushes-0.146
ics-0.145
packing-0.145
Token and
Token ID290
Feature activation+7.01e-01

Pos logit contributions
 shark+0.087
 Flip+0.086
 Mitch+0.083
 fin+0.079
 Finn+0.079
Neg logit contributions
cases-0.102
cakes-0.094
ics-0.094
any-0.093
packing-0.093
Token spun
Token ID26843
Feature activation-8.71e-02

Pos logit contributions
 shark+0.022
 Flip+0.022
 fin+0.022
 Mitch+0.022
 Finn+0.021
Neg logit contributions
cases-0.022
grand-0.022
cakes-0.022
packing-0.022
days-0.021
Token around
Token ID1088
Feature activation-4.62e-01

Pos logit contributions
z+0.002
cases+0.002
 Sarah+0.002
bows+0.002
 Kim+0.002
Neg logit contributions
 shark-0.003
 Flip-0.003
 Mitch-0.003
ascal-0.003
 lobster-0.003
Token in
Token ID287
Feature activation+1.46e+00

Pos logit contributions
 Sarah+0.013
 Kim+0.012
days+0.012
cakes+0.012
z+0.012
Neg logit contributions
 Mitch-0.015
 Flip-0.015
 shark-0.015
 fin-0.015
olo-0.014
Token borrowed
Token ID22546
Feature activation+1.09e+00

Pos logit contributions
 shark+0.046
 Flip+0.046
 Mitch+0.045
 fin+0.044
 Rory+0.043
Neg logit contributions
packing-0.024
cases-0.024
grand-0.024
cakes-0.023
any-0.023
Token her
Token ID607
Feature activation+6.77e-01

Pos logit contributions
 shark+0.037
 fin+0.035
 Flip+0.034
 Mitch+0.034
 lobster+0.033
Neg logit contributions
cakes-0.029
 Sarah-0.029
 brushes-0.029
 Kim-0.029
 Susan-0.029
Token mom
Token ID1995
Feature activation-6.04e-01

Pos logit contributions
 Flip+0.021
 shark+0.021
 Mitch+0.020
 fin+0.020
 lobster+0.019
Neg logit contributions
cases-0.022
grand-0.021
packing-0.021
cakes-0.021
days-0.021
Token's
Token ID338
Feature activation+1.10e+00

Pos logit contributions
cases+0.015
packing+0.014
grand+0.014
cakes+0.014
days+0.014
Neg logit contributions
 Flip-0.023
 shark-0.023
 Mitch-0.022
 fin-0.022
 lobster-0.021
Token magn
Token ID7842
Feature activation+2.75e+00

Pos logit contributions
 Flip+0.037
 shark+0.037
 Mitch+0.036
 fin+0.034
 Finn+0.034
Neg logit contributions
cases-0.031
grand-0.031
packing-0.031
cakes-0.030
 brushes-0.030
Tokenifying
Token ID4035
Feature activation+5.69e+00

Pos logit contributions
 shark+0.079
 Flip+0.078
 Mitch+0.074
 Finn+0.072
 crabs+0.072
Neg logit contributions
cases-0.098
packing-0.091
z-0.090
cakes-0.088
packs-0.088
Token glass
Token ID5405
Feature activation+1.78e+00

Pos logit contributions
 Flip+0.210
 shark+0.207
 Mitch+0.202
 fin+0.198
 crabs+0.197
Neg logit contributions
cakes-0.139
grand-0.136
cases-0.134
packing-0.132
 brushes-0.130
Token to
Token ID284
Feature activation-4.04e-01

Pos logit contributions
 Flip+0.059
 shark+0.058
 Mitch+0.057
 fin+0.054
 Rory+0.054
Neg logit contributions
cakes-0.053
cases-0.052
 brushes-0.050
packing-0.049
grand-0.049
Token take
Token ID1011
Feature activation+1.10e+00

Pos logit contributions
grand+0.010
packing+0.010
cases+0.010
cakes+0.010
 brushes+0.010
Neg logit contributions
 shark-0.015
 Flip-0.014
 fin-0.014
 Mitch-0.014
 lobster-0.013
Token a
Token ID257
Feature activation+2.71e-01

Pos logit contributions
 shark+0.037
 Flip+0.037
 Mitch+0.035
 fin+0.035
 lobster+0.034
Neg logit contributions
cases-0.032
grand-0.032
packing-0.032
cakes-0.031
days-0.030
Token closer
Token ID5699
Feature activation+1.15e+00

Pos logit contributions
 Flip+0.009
 shark+0.009
 Mitch+0.008
 fin+0.008
 Finn+0.008
Neg logit contributions
cases-0.008
packing-0.008
grand-0.008
cakes-0.008
days-0.008
Token happy
Token ID3772
Feature activation-1.25e+00

Pos logit contributions
cases+0.014
grand+0.014
cakes+0.014
packing+0.013
days+0.013
Neg logit contributions
 Flip-0.010
 shark-0.010
 Mitch-0.010
 fin-0.010
 Rory-0.009
Token with
Token ID351
Feature activation+2.98e-01

Pos logit contributions
packing+0.032
grand+0.032
cakes+0.032
cases+0.031
days+0.031
Neg logit contributions
 Flip-0.044
 Mitch-0.043
 shark-0.043
 fin-0.042
 Rory-0.041
Token her
Token ID607
Feature activation-3.52e-01

Pos logit contributions
 shark+0.013
 fin+0.013
 Flip+0.013
 lobster+0.013
 Mitch+0.013
Neg logit contributions
 Kim-0.006
 Sarah-0.005
 Susan-0.005
 brushes-0.005
packing-0.005
Token new
Token ID649
Feature activation+1.70e+00

Pos logit contributions
packing+0.008
grand+0.008
cases+0.008
 brushes+0.008
cakes+0.008
Neg logit contributions
 Flip-0.013
 Mitch-0.013
 shark-0.013
 Rory-0.013
 fin-0.012
Token pet
Token ID4273
Feature activation+1.97e+00

Pos logit contributions
 shark+0.062
 Flip+0.062
 fin+0.060
 Mitch+0.060
 lobster+0.058
Neg logit contributions
cases-0.045
packing-0.044
cakes-0.043
grand-0.043
days-0.041
Token ham
Token ID8891
Feature activation+5.69e+00

Pos logit contributions
 Flip+0.059
 shark+0.059
 Mitch+0.058
 fin+0.058
 Richie+0.054
Neg logit contributions
cakes-0.062
cases-0.062
grand-0.060
packing-0.059
bows-0.058
Tokenster
Token ID1706
Feature activation+2.20e+00

Pos logit contributions
 Flip+0.173
 shark+0.170
 Mitch+0.166
 Finn+0.162
 Richie+0.160
Neg logit contributions
cases-0.175
packs-0.169
cakes-0.168
ics-0.168
bows-0.165
Token and
Token ID290
Feature activation-4.18e-01

Pos logit contributions
 Flip+0.068
 Mitch+0.067
 shark+0.066
 Finn+0.063
 Richie+0.062
Neg logit contributions
grand-0.067
cases-0.067
cakes-0.065
packing-0.064
any-0.064
Token loved
Token ID6151
Feature activation+1.61e+00

Pos logit contributions
cases+0.014
packing+0.013
cakes+0.013
days+0.013
grand+0.012
Neg logit contributions
 shark-0.013
 Flip-0.013
 Mitch-0.013
 fin-0.013
 crabs-0.012
Token him
Token ID683
Feature activation+1.48e+00

Pos logit contributions
 shark+0.058
 fin+0.057
 Flip+0.057
 Mitch+0.057
 Richie+0.053
Neg logit contributions
grand-0.039
cakes-0.037
ics-0.037
 brushes-0.037
 Kim-0.037
Token very
Token ID845
Feature activation+2.13e+00

Pos logit contributions
 shark+0.045
 Mitch+0.045
 Flip+0.044
 Shark+0.042
 fin+0.042
Neg logit contributions
days-0.048
ics-0.044
grand-0.044
packing-0.044
cases-0.044
Token,
Token ID11
Feature activation+8.12e-01

Pos logit contributions
cases+0.023
grand+0.022
packing+0.022
cakes+0.022
any+0.021
Neg logit contributions
 Flip-0.014
 shark-0.014
 Mitch-0.014
 fin-0.013
 Rory-0.013
Token she
Token ID673
Feature activation+1.20e+00

Pos logit contributions
 Flip+0.030
 shark+0.029
 Mitch+0.028
 fin+0.028
 Finn+0.027
Neg logit contributions
cases-0.022
packing-0.021
grand-0.021
cakes-0.021
days-0.020
Token would
Token ID561
Feature activation+1.35e+00

Pos logit contributions
 Flip+0.041
 shark+0.041
 Mitch+0.040
 fin+0.039
 lobster+0.038
Neg logit contributions
cases-0.034
cakes-0.034
packing-0.033
grand-0.033
ics-0.032
Token say
Token ID910
Feature activation+1.33e+00

Pos logit contributions
 Flip+0.050
 shark+0.050
 Mitch+0.048
 fin+0.048
 lobster+0.046
Neg logit contributions
cases-0.035
cakes-0.034
grand-0.034
packing-0.034
days-0.032
Token "
Token ID366
Feature activation+1.49e+00

Pos logit contributions
 shark+0.032
 Flip+0.032
 Mitch+0.030
 fin+0.029
 Rory+0.028
Neg logit contributions
cases-0.051
packing-0.051
grand-0.051
any-0.050
days-0.050
TokenPlease
Token ID5492
Feature activation+5.67e+00

Pos logit contributions
 Flip+0.045
 shark+0.041
 Mitch+0.041
 fin+0.040
 Rory+0.040
Neg logit contributions
cases-0.049
cakes-0.046
ics-0.045
grand-0.045
packing-0.045
Token motor
Token ID5584
Feature activation+1.43e+00

Pos logit contributions
 Flip+0.193
 shark+0.192
 Mitch+0.185
 fin+0.184
 Finn+0.177
Neg logit contributions
cases-0.164
grand-0.159
packing-0.159
cakes-0.158
days-0.152
Token,
Token ID11
Feature activation+1.97e+00

Pos logit contributions
 Flip+0.043
 shark+0.043
 Mitch+0.040
 fin+0.040
 Finn+0.039
Neg logit contributions
cases-0.047
grand-0.046
cakes-0.046
packing-0.046
packs-0.044
Token will
Token ID481
Feature activation+1.55e+00

Pos logit contributions
 shark+0.055
 Flip+0.055
 Mitch+0.052
 fin+0.052
 Finn+0.049
Neg logit contributions
cases-0.070
cakes-0.068
grand-0.067
packing-0.067
ics-0.065
Token you
Token ID345
Feature activation-4.74e-02

Pos logit contributions
 Flip+0.039
 shark+0.038
 Mitch+0.036
 Rocky+0.035
 Rory+0.035
Neg logit contributions
cases-0.061
packs-0.060
cakes-0.059
packing-0.058
ugs-0.058
Token take
Token ID1011
Feature activation+4.99e-01

Pos logit contributions
cases+0.001
cakes+0.001
packing+0.001
grand+0.001
ics+0.001
Neg logit contributions
 Flip-0.002
 shark-0.002
 fin-0.001
 Mitch-0.001
 lobster-0.001
Token be
Token ID307
Feature activation-1.02e+00

Pos logit contributions
 shark+0.037
 Flip+0.037
 fin+0.036
 Mitch+0.036
 lobster+0.034
Neg logit contributions
cases-0.033
cakes-0.032
grand-0.031
packing-0.031
ics-0.030
Token happy
Token ID3772
Feature activation-1.72e+00

Pos logit contributions
cases+0.038
grand+0.036
cakes+0.036
packing+0.034
EEP+0.034
Neg logit contributions
 Flip-0.029
 fin-0.028
 shark-0.028
 Mitch-0.027
 crabs-0.026
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
cases+0.058
cakes+0.057
days+0.056
grand+0.056
packing+0.056
Neg logit contributions
 Flip-0.049
 Mitch-0.047
 shark-0.047
 fin-0.046
 Rory-0.044
Token well
Token ID880
Feature activation+1.83e+00

Pos logit contributions
cases+0.004
ugs+0.004
cakes+0.004
packs+0.004
z+0.004
Neg logit contributions
 fin-0.003
 shark-0.003
 rather-0.003
 Mitch-0.003
 Flip-0.003
Token-
Token ID12
Feature activation-3.39e+00

Pos logit contributions
 Flip+0.042
 shark+0.042
 Mitch+0.040
 fin+0.040
 Finn+0.037
Neg logit contributions
cases-0.072
grand-0.071
packing-0.071
cakes-0.070
days-0.069
Tokenbeh
Token ID20709
Feature activation-6.63e+00

Pos logit contributions
packs+0.093
yt+0.084
 brushes+0.084
 coats+0.080
 porch+0.080
Neg logit contributions
 Flip-0.115
 Mitch-0.107
 lobster-0.105
 Sno-0.104
 Richie-0.104
Tokenaved
Token ID9586
Feature activation-2.67e+00

Pos logit contributions
cases+0.218
grand+0.215
cakes+0.210
ics+0.207
packing+0.202
Neg logit contributions
 Flip-0.191
 shark-0.189
 fin-0.187
 Mitch-0.181
 Finn-0.179
Token.
Token ID13
Feature activation-9.76e-01

Pos logit contributions
grand+0.108
cases+0.107
packing+0.107
cakes+0.105
 brushes+0.102
Neg logit contributions
 shark-0.059
 Flip-0.057
 Mitch-0.054
 fin-0.054
 Finn-0.051
Token The
Token ID383
Feature activation-1.20e+00

Pos logit contributions
grand+0.032
packing+0.031
cakes+0.030
days+0.030
cases+0.030
Neg logit contributions
 shark-0.030
 Flip-0.029
 fin-0.029
 Mitch-0.028
 crabs-0.028
Token end
Token ID886
Feature activation+1.45e-01

Pos logit contributions
cases+0.043
cakes+0.042
packing+0.042
grand+0.041
days+0.041
Neg logit contributions
 Flip-0.032
 Mitch-0.031
 shark-0.030
 fin-0.030
 Rory-0.029
Token.
Token ID13
Feature activation-9.76e-01

Pos logit contributions
 Flip+0.004
 shark+0.004
 Mitch+0.004
 fin+0.004
 crabs+0.004
Neg logit contributions
packing-0.005
grand-0.005
cases-0.005
cakes-0.005
days-0.005
Token became
Token ID2627
Feature activation-1.82e+00

Pos logit contributions
cases+0.131
cakes+0.125
grand+0.122
packing+0.119
ics+0.114
Neg logit contributions
 Flip-0.146
 shark-0.145
 Mitch-0.141
 Finn-0.139
 fin-0.138
Token friendly
Token ID8030
Feature activation+5.54e-01

Pos logit contributions
packing+0.042
ics+0.042
cakes+0.042
packs+0.042
 Kim+0.042
Neg logit contributions
 shark-0.067
 fin-0.066
 Mitch-0.065
 Flip-0.064
 lobster-0.062
Token and
Token ID290
Feature activation-8.27e-02

Pos logit contributions
 Flip+0.017
 Mitch+0.017
 fin+0.017
 shark+0.017
 lobster+0.016
Neg logit contributions
cases-0.017
cakes-0.016
packs-0.016
days-0.016
bows-0.016
Token well
Token ID880
Feature activation+2.07e+00

Pos logit contributions
cases+0.003
grand+0.003
packs+0.003
ics+0.003
ugs+0.003
Neg logit contributions
 Flip-0.003
 shark-0.003
 Finn-0.003
 Mitch-0.002
 Fish-0.002
Token-
Token ID12
Feature activation-3.53e+00

Pos logit contributions
 Flip+0.054
 shark+0.054
 Mitch+0.051
 fin+0.051
 Finn+0.048
Neg logit contributions
cases-0.076
grand-0.075
packing-0.075
cakes-0.074
days-0.072
Tokenbeh
Token ID20709
Feature activation-6.61e+00

Pos logit contributions
packs+0.089
 brushes+0.085
cases+0.083
packing+0.082
yt+0.078
Neg logit contributions
 Flip-0.130
 Mitch-0.121
 Finn-0.119
 shark-0.119
 Sno-0.118
Tokenaved
Token ID9586
Feature activation-2.61e+00

Pos logit contributions
cases+0.209
grand+0.203
cakes+0.201
packing+0.196
ics+0.195
Neg logit contributions
 Flip-0.203
 shark-0.201
 fin-0.196
 Mitch-0.193
 Finn-0.189
Token,
Token ID11
Feature activation-9.70e-01

Pos logit contributions
cases+0.098
grand+0.097
packing+0.097
cakes+0.095
days+0.092
Neg logit contributions
 shark-0.066
 Flip-0.066
 Mitch-0.062
 fin-0.062
 Finn-0.059
Token just
Token ID655
Feature activation+6.08e-01

Pos logit contributions
grand+0.037
cases+0.037
cakes+0.036
packing+0.036
days+0.035
Neg logit contributions
 Flip-0.025
 shark-0.024
 fin-0.023
 Mitch-0.022
 Finn-0.022
Token like
Token ID588
Feature activation+1.84e+00

Pos logit contributions
 Flip+0.010
 shark+0.009
 Mitch+0.009
 fin+0.009
 lobster+0.008
Neg logit contributions
cases-0.029
grand-0.028
packing-0.028
cakes-0.028
days-0.027
Token Buddy
Token ID36896
Feature activation+1.30e-01

Pos logit contributions
 Flip+0.055
 shark+0.055
 Mitch+0.053
 fin+0.053
 lobster+0.050
Neg logit contributions
cases-0.060
grand-0.059
packing-0.059
cakes-0.058
days-0.056
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
 Flip+0.034
 shark+0.034
 Mitch+0.032
 lobster+0.032
 fin+0.032
Neg logit contributions
cases-0.046
cakes-0.043
packs-0.042
packing-0.042
days-0.042
Token The
Token ID383
Feature activation-3.25e-01

Pos logit contributions
cases+0.003
 Kim+0.003
cakes+0.003
 Sarah+0.003
grand+0.003
Neg logit contributions
 crabs-0.004
 shark-0.004
 lobster-0.004
 fin-0.004
 Flip-0.003
Token driver
Token ID4639
Feature activation-1.73e+00

Pos logit contributions
packing+0.011
grand+0.010
cases+0.010
ics+0.010
days+0.010
Neg logit contributions
 shark-0.011
 fin-0.010
 Flip-0.009
 crabs-0.009
 sharks-0.009
Token and
Token ID290
Feature activation-4.76e-01

Pos logit contributions
grand+0.050
packing+0.048
ics+0.047
cakes+0.047
cases+0.046
Neg logit contributions
 shark-0.060
 Flip-0.058
 fin-0.057
 Mitch-0.056
 Shark-0.054
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
cases+0.020
grand+0.020
packing+0.019
cakes+0.019
days+0.019
Neg logit contributions
 shark-0.011
 Flip-0.011
 Mitch-0.010
 fin-0.010
 Finn-0.009
Token paramed
Token ID34581
Feature activation-6.58e+00

Pos logit contributions
packing+0.047
cases+0.043
ics+0.043
grand+0.043
cakes+0.042
Neg logit contributions
 shark-0.047
 fin-0.043
 crabs-0.040
 Flip-0.040
 sharks-0.039
Tokenic
Token ID291
Feature activation-3.90e+00

Pos logit contributions
 Susan+0.199
packs+0.197
 Mill+0.189
 Sarah+0.189
packing+0.187
Neg logit contributions
aw-0.173
ascal-0.170
der-0.162
ino-0.157
lish-0.153
Token jumped
Token ID11687
Feature activation-1.13e-02

Pos logit contributions
packing+0.120
cakes+0.118
 brushes+0.117
bows+0.116
 Susan+0.114
Neg logit contributions
 shark-0.123
 fin-0.116
 crabs-0.106
 Shark-0.104
 sharks-0.103
Token out
Token ID503
Feature activation-2.69e-01

Pos logit contributions
cases+0.000
cakes+0.000
packing+0.000
ics+0.000
grand+0.000
Neg logit contributions
 Flip-0.000
 shark-0.000
 Mitch-0.000
 fin-0.000
 lobster-0.000
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
packing+0.010
grand+0.010
cases+0.010
cakes+0.009
ics+0.009
Neg logit contributions
 shark-0.007
 Flip-0.007
 fin-0.007
 Mitch-0.007
 Finn-0.007
Token They
Token ID1119
Feature activation+4.14e-01

Pos logit contributions
cases+0.010
cakes+0.010
 Kim+0.009
grand+0.009
 Sarah+0.009
Neg logit contributions
 crabs-0.012
 lobster-0.011
 fin-0.011
 shark-0.011
 Flip-0.011
Token,
Token ID11
Feature activation-9.83e-02

Pos logit contributions
 shark+0.000
 Flip+0.000
 fin+0.000
 Mitch+0.000
 Finn+0.000
Neg logit contributions
cases-0.000
packing-0.000
cakes-0.000
grand-0.000
 brushes-0.000
Token but
Token ID475
Feature activation+1.13e+00

Pos logit contributions
cases+0.004
grand+0.004
packing+0.004
cakes+0.003
days+0.003
Neg logit contributions
 Flip-0.003
 shark-0.003
 Mitch-0.002
 fin-0.002
 lobster-0.002
Token it
Token ID340
Feature activation-1.29e+00

Pos logit contributions
 Flip+0.029
 shark+0.028
 Mitch+0.027
 fin+0.027
 lobster+0.027
Neg logit contributions
grand-0.040
cases-0.040
packing-0.040
days-0.039
 Kim-0.038
Token was
Token ID373
Feature activation-6.98e-01

Pos logit contributions
cases+0.039
cakes+0.038
grand+0.038
packing+0.038
packs+0.037
Neg logit contributions
 shark-0.044
 Flip-0.043
 Mitch-0.042
 fin-0.041
 Finn-0.040
Token all
Token ID477
Feature activation-4.14e+00

Pos logit contributions
packing+0.023
cases+0.023
grand+0.023
days+0.023
any+0.023
Neg logit contributions
 shark-0.020
 Flip-0.019
 Mitch-0.019
 fin-0.018
 lobster-0.018
Token fore
Token ID1674
Feature activation-6.46e+00

Pos logit contributions
cases+0.157
packing+0.148
cakes+0.147
grand+0.147
ugs+0.145
Neg logit contributions
 shark-0.112
 Flip-0.103
 fin-0.102
 rather-0.097
 Finn-0.096
Tokenshadow
Token ID19106
Feature activation-3.04e+00

Pos logit contributions
cakes+0.176
bows+0.173
 brushes+0.169
 baked+0.153
grand+0.150
Neg logit contributions
 fin-0.221
 rather-0.217
per-0.216
 Flip-0.208
He-0.207
Tokened
Token ID276
Feature activation-9.58e-01

Pos logit contributions
grand+0.091
cases+0.089
packing+0.088
days+0.087
cakes+0.085
Neg logit contributions
 Flip-0.101
 Mitch-0.097
 shark-0.097
 fin-0.093
 Finn-0.092
Token by
Token ID416
Feature activation+1.44e+00

Pos logit contributions
cases+0.033
packing+0.032
grand+0.032
cakes+0.032
days+0.031
Neg logit contributions
 Flip-0.027
 shark-0.027
 Mitch-0.026
 fin-0.026
 lobster-0.025
Token the
Token ID262
Feature activation+4.84e-01

Pos logit contributions
 Flip+0.040
 shark+0.040
 Mitch+0.038
 fin+0.038
 lobster+0.036
Neg logit contributions
cases-0.049
grand-0.048
packing-0.048
cakes-0.047
days-0.047
Token kind
Token ID1611
Feature activation+1.88e+00

Pos logit contributions
 Flip+0.017
 Mitch+0.016
 shark+0.016
 fin+0.015
 Finn+0.015
Neg logit contributions
grand-0.013
cases-0.013
packing-0.013
cakes-0.013
days-0.013
Token and
Token ID290
Feature activation+2.24e-01

Pos logit contributions
 Flip+0.017
 Mitch+0.017
 shark+0.016
 fin+0.016
 lobster+0.016
Neg logit contributions
cases-0.017
cakes-0.017
days-0.016
ics-0.016
packing-0.016
Token dirty
Token ID11841
Feature activation-3.41e-01

Pos logit contributions
 shark+0.007
 Flip+0.007
 fin+0.007
 Finn+0.007
 crabs+0.006
Neg logit contributions
cases-0.008
grand-0.008
packing-0.007
cakes-0.007
ics-0.007
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
cases+0.011
cakes+0.011
packing+0.011
grand+0.011
days+0.011
Neg logit contributions
 Flip-0.010
 shark-0.010
 Mitch-0.010
 fin-0.010
 lobster-0.009
Token And
Token ID843
Feature activation-2.48e-01

Pos logit contributions
grand+0.014
cases+0.014
packing+0.014
cakes+0.014
days+0.014
Neg logit contributions
 shark-0.022
 Flip-0.022
 Mitch-0.022
 fin-0.021
 lobster-0.021
Token maybe
Token ID3863
Feature activation-1.22e-01

Pos logit contributions
cases+0.008
grand+0.007
cakes+0.007
packing+0.007
any+0.007
Neg logit contributions
 shark-0.009
 Flip-0.008
 Mitch-0.008
 fin-0.008
 Finn-0.008
Token there
Token ID612
Feature activation-6.45e+00

Pos logit contributions
cases+0.003
packing+0.003
grand+0.003
cakes+0.003
any+0.003
Neg logit contributions
 shark-0.005
 Flip-0.005
 Mitch-0.005
 fin-0.004
 Finn-0.004
Token is
Token ID318
Feature activation-1.43e-01

Pos logit contributions
packing+0.253
any+0.244
cases+0.235
grand+0.235
cakes+0.229
Neg logit contributions
 shark-0.162
 Flip-0.150
 crabs-0.147
 sharks-0.147
 Mitch-0.144
Token something
Token ID1223
Feature activation-2.02e-01

Pos logit contributions
cases+0.005
grand+0.005
packing+0.005
cakes+0.005
days+0.005
Neg logit contributions
 Flip-0.004
 shark-0.004
 Mitch-0.003
 fin-0.003
 lobster-0.003
Token in
Token ID287
Feature activation-4.57e-01

Pos logit contributions
cases+0.007
cakes+0.007
grand+0.007
packing+0.007
days+0.006
Neg logit contributions
 Flip-0.006
 shark-0.006
 Mitch-0.006
 fin-0.006
 lobster-0.005
Token the
Token ID262
Feature activation-3.59e-01

Pos logit contributions
packing+0.014
grand+0.014
cases+0.014
packs+0.014
days+0.014
Neg logit contributions
 Flip-0.014
 shark-0.014
 Mitch-0.013
 fin-0.013
 lobster-0.013
Token p
Token ID279
Feature activation-1.71e+00

Pos logit contributions
days+0.008
cakes+0.008
 brushes+0.008
 stairs+0.007
 baked+0.007
Neg logit contributions
 Flip-0.014
 Mitch-0.014
 Richie-0.013
 shark-0.013
 Rory-0.013
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
packs+0.069
cases+0.067
 Kim+0.066
 brushes+0.066
 Sarah+0.065
Neg logit contributions
 Flip-0.078
 shark-0.075
 fin-0.074
 Mitch-0.074
 lobster-0.073
Token hills
Token ID18639
Feature activation-9.73e-01

Pos logit contributions
 Flip+0.005
 shark+0.005
 Mitch+0.005
 fin+0.005
 Richie+0.005
Neg logit contributions
cases-0.005
 brushes-0.005
cakes-0.005
any-0.005
bows-0.004
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
cases+0.037
cakes+0.035
packing+0.035
grand+0.035
ics+0.034
Neg logit contributions
 shark-0.025
 Flip-0.025
 Mitch-0.024
 fin-0.024
 lobster-0.023
Token Everywhere
Token ID48201
Feature activation-9.11e-01

Pos logit contributions
cakes+0.005
cases+0.005
 Kim+0.005
EEP+0.005
 Sarah+0.005
Neg logit contributions
 shark-0.007
 fin-0.006
 pearl-0.006
 sharks-0.006
olo-0.006
Token P
Token ID350
Feature activation-2.40e+00

Pos logit contributions
 Kim+0.026
 Sarah+0.026
grand+0.026
cases+0.026
packing+0.026
Neg logit contributions
 fin-0.027
 shark-0.026
 Flip-0.025
 lobster-0.025
ascal-0.025
Tokenadd
Token ID2860
Feature activation-6.43e+00

Pos logit contributions
cases+0.078
cakes+0.075
packing+0.075
grand+0.074
ics+0.074
Neg logit contributions
 shark-0.072
 Flip-0.071
 fin-0.069
 Mitch-0.069
 lobster-0.066
Tokenington
Token ID9557
Feature activation+1.75e-01

Pos logit contributions
grand+0.250
packing+0.249
cakes+0.235
cases+0.235
EEP+0.233
Neg logit contributions
 shark-0.167
 Finn-0.164
 crabs-0.156
 Flip-0.156
 Fish-0.151
Token went
Token ID1816
Feature activation-2.11e+00

Pos logit contributions
 shark+0.004
 Mitch+0.004
 crabs+0.004
 Flip+0.004
 fin+0.004
Neg logit contributions
cakes-0.007
bows-0.006
packing-0.006
grand-0.006
EEP-0.006
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
cases+0.071
packing+0.068
cakes+0.068
grand+0.067
ics+0.066
Neg logit contributions
 shark-0.061
 Flip-0.061
 fin-0.060
 Mitch-0.059
 lobster-0.056
Token he
Token ID339
Feature activation+1.38e+00

Pos logit contributions
 shark+0.003
 Flip+0.003
 fin+0.003
 Mitch+0.003
 lobster+0.003
Neg logit contributions
cases-0.005
packing-0.005
grand-0.005
cakes-0.005
days-0.004
Token found
Token ID1043
Feature activation-9.48e-01

Pos logit contributions
 Flip+0.047
 shark+0.046
 Mitch+0.045
 fin+0.045
 Finn+0.043
Neg logit contributions
cases-0.040
grand-0.040
cakes-0.040
packing-0.039
days-0.037
Token grass
Token ID8701
Feature activation-4.78e-01

Pos logit contributions
packing+0.062
grand+0.062
cases+0.058
days+0.057
ital+0.056
Neg logit contributions
 Flip-0.053
 shark-0.052
 Mitch-0.050
 fin-0.048
 Finn-0.048
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
grand+0.019
packing+0.018
cases+0.018
cakes+0.018
days+0.018
Neg logit contributions
 Flip-0.011
 shark-0.011
 fin-0.011
 Mitch-0.011
 Finn-0.010
Token rye
Token ID47553
Feature activation+7.57e-01

Pos logit contributions
cases+0.045
packing+0.044
grand+0.044
cakes+0.043
days+0.042
Neg logit contributions
 Flip-0.038
 shark-0.038
 Mitch-0.036
 fin-0.036
 crabs-0.035
Token grass
Token ID8701
Feature activation-1.57e+00

Pos logit contributions
 Flip+0.021
 shark+0.021
 Mitch+0.021
 fin+0.020
 lobster+0.019
Neg logit contributions
cases-0.026
packing-0.025
grand-0.025
cakes-0.025
days-0.024
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
cases+0.061
packing+0.060
grand+0.060
cakes+0.059
days+0.057
Neg logit contributions
 shark-0.037
 Flip-0.037
 Mitch-0.035
 fin-0.035
 Finn-0.033
Token al
Token ID435
Feature activation-6.38e+00

Pos logit contributions
cases+0.061
packing+0.059
grand+0.058
cakes+0.058
days+0.056
Neg logit contributions
 Flip-0.051
 shark-0.051
 fin-0.049
 Mitch-0.049
 crabs-0.047
Tokenf
Token ID69
Feature activation-3.68e+00

Pos logit contributions
cases+0.190
grand+0.188
cakes+0.185
packing+0.184
 Kim+0.183
Neg logit contributions
 Flip-0.180
 crabs-0.174
 Mitch-0.172
 fin-0.171
 lobster-0.170
Tokenalf
Token ID1604
Feature activation-7.00e-01

Pos logit contributions
packing+0.110
grand+0.109
cases+0.108
cakes+0.105
days+0.104
Neg logit contributions
 Flip-0.119
 shark-0.116
 Mitch-0.115
 fin-0.112
 Rory-0.110
Tokena
Token ID64
Feature activation+3.06e-01

Pos logit contributions
cases+0.024
packing+0.024
cakes+0.023
grand+0.023
days+0.022
Neg logit contributions
 Flip-0.020
 shark-0.020
 Mitch-0.019
 fin-0.019
 Rory-0.018
Token and
Token ID290
Feature activation-5.01e-01

Pos logit contributions
 shark+0.008
 Flip+0.008
 Mitch+0.008
 fin+0.008
 lobster+0.007
Neg logit contributions
cases-0.011
packing-0.011
grand-0.011
cakes-0.011
days-0.010
Token cl
Token ID537
Feature activation-1.31e+00

Pos logit contributions
cases+0.018
packing+0.017
cakes+0.017
grand+0.017
days+0.016
Neg logit contributions
 Flip-0.015
 shark-0.015
 Mitch-0.014
 fin-0.014
 crabs-0.013
Token together
Token ID1978
Feature activation-2.16e+00

Pos logit contributions
cases+0.037
grand+0.036
packing+0.036
cakes+0.035
days+0.034
Neg logit contributions
 Flip-0.060
 shark-0.060
 Mitch-0.058
 fin-0.058
 Finn-0.056
Token happily
Token ID18177
Feature activation+1.65e-01

Pos logit contributions
packing+0.062
ics+0.061
cases+0.060
cakes+0.060
days+0.059
Neg logit contributions
 shark-0.070
 Flip-0.069
 Mitch-0.069
 fin-0.068
 Otto-0.066
Token and
Token ID290
Feature activation-4.97e-01

Pos logit contributions
 Flip+0.004
 shark+0.004
 fin+0.004
 Mitch+0.004
 lobster+0.004
Neg logit contributions
cases-0.006
grand-0.006
cakes-0.006
packing-0.006
 brushes-0.006
Token never
Token ID1239
Feature activation-1.58e+00

Pos logit contributions
cases+0.015
cakes+0.014
grand+0.014
packing+0.014
ics+0.014
Neg logit contributions
 Flip-0.017
 shark-0.017
 Mitch-0.016
 fin-0.016
 Finn-0.016
Token qu
Token ID627
Feature activation-1.89e+00

Pos logit contributions
cases+0.043
grand+0.041
cakes+0.041
packing+0.040
ics+0.040
Neg logit contributions
 Flip-0.058
 shark-0.057
 fin-0.055
 Mitch-0.055
 lobster-0.054
Tokenarre
Token ID9624
Feature activation-6.35e+00

Pos logit contributions
grand+0.038
cases+0.037
packing+0.035
cakes+0.034
days+0.033
Neg logit contributions
 Flip-0.081
 shark-0.080
 fin-0.079
 Mitch-0.078
 lobster-0.077
Tokenlled
Token ID3353
Feature activation-5.01e+00

Pos logit contributions
grand+0.245
cakes+0.223
 Kim+0.222
cases+0.216
packing+0.215
Neg logit contributions
 shark-0.161
 Flip-0.156
 Richie-0.144
 fin-0.142
 crabs-0.141
Token again
Token ID757
Feature activation+3.42e-01

Pos logit contributions
cases+0.170
cakes+0.164
packs+0.159
 Sarah+0.158
bows+0.158
Neg logit contributions
 Flip-0.137
 fin-0.136
 Mitch-0.135
 shark-0.129
 lobster-0.125
Token.
Token ID13
Feature activation-6.30e-01

Pos logit contributions
 Flip+0.008
 shark+0.008
 Mitch+0.008
 fin+0.008
 lobster+0.007
Neg logit contributions
cases-0.013
grand-0.013
packing-0.013
cakes-0.013
days-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.01e+00

Pos logit contributions
cases+0.024
packing+0.023
grand+0.023
cakes+0.023
days+0.022
Neg logit contributions
 Flip-0.016
 shark-0.016
 Mitch-0.015
 fin-0.015
 Finn-0.015
TokenOnce
Token ID7454
Feature activation-6.06e-01

Pos logit contributions
cakes+0.028
ics+0.028
cases+0.028
days+0.028
grand+0.028
Neg logit contributions
 shark-0.033
 fin-0.033
 Flip-0.033
 crabs-0.032
 Mitch-0.032
Token hands
Token ID2832
Feature activation+2.51e-01

Pos logit contributions
 Flip+0.089
 shark+0.089
 Mitch+0.086
 fin+0.085
 Finn+0.082
Neg logit contributions
cases-0.066
grand-0.064
packing-0.064
cakes-0.063
days-0.061
Token and
Token ID290
Feature activation+3.19e-01

Pos logit contributions
 Flip+0.008
 shark+0.008
 Mitch+0.007
 fin+0.007
 Finn+0.007
Neg logit contributions
cases-0.008
grand-0.008
packing-0.008
cakes-0.008
days-0.008
Token they
Token ID484
Feature activation-8.76e-01

Pos logit contributions
 shark+0.010
 Flip+0.010
 fin+0.010
 Finn+0.010
 crabs+0.009
Neg logit contributions
cases-0.011
grand-0.011
cakes-0.010
ugs-0.010
packing-0.010
Token never
Token ID1239
Feature activation-1.53e+00

Pos logit contributions
cases+0.026
grand+0.026
cakes+0.026
packing+0.025
days+0.024
Neg logit contributions
 Flip-0.029
 shark-0.029
 fin-0.028
 Mitch-0.028
 Finn-0.027
Token qu
Token ID627
Feature activation-2.04e+00

Pos logit contributions
cases+0.044
cakes+0.043
grand+0.043
ics+0.041
ugs+0.040
Neg logit contributions
 shark-0.055
 Flip-0.053
 fin-0.052
 crabs-0.052
 Finn-0.052
Tokenarre
Token ID9624
Feature activation-6.34e+00

Pos logit contributions
cases+0.033
packing+0.031
grand+0.031
cakes+0.030
days+0.028
Neg logit contributions
 Flip-0.096
 shark-0.096
 Mitch-0.093
 fin-0.093
 lobster-0.090
Tokenled
Token ID992
Feature activation-2.01e+00

Pos logit contributions
grand+0.260
 Kim+0.239
cakes+0.232
 brushes+0.230
packing+0.228
Neg logit contributions
 shark-0.159
 Flip-0.141
 Richie-0.135
 fin-0.135
 crabs-0.135
Token again
Token ID757
Feature activation+3.54e-01

Pos logit contributions
cases+0.070
packing+0.068
days+0.068
cakes+0.068
packs+0.067
Neg logit contributions
 Flip-0.054
 shark-0.053
 Mitch-0.052
 fin-0.051
 lobster-0.049
Token.
Token ID13
Feature activation-9.61e-01

Pos logit contributions
 shark+0.007
 Flip+0.007
 fin+0.007
 Mitch+0.007
 Finn+0.006
Neg logit contributions
cases-0.015
packing-0.015
grand-0.015
cakes-0.015
ics-0.014
Token<|endoftext|>
Token ID50256
Feature activation-1.10e+00

Pos logit contributions
cases+0.035
packing+0.034
grand+0.034
cakes+0.034
days+0.033
Neg logit contributions
 Flip-0.026
 shark-0.026
 Mitch-0.024
 fin-0.024
 lobster-0.023
TokenOnce
Token ID7454
Feature activation-6.19e-01

Pos logit contributions
cases+0.033
cakes+0.033
days+0.032
ics+0.032
grand+0.032
Neg logit contributions
 shark-0.034
 fin-0.034
 Flip-0.034
 Mitch-0.032
 crabs-0.032
Token safe
Token ID3338
Feature activation+4.04e-01

Pos logit contributions
packing+0.074
EEP+0.074
grand+0.074
ugs+0.069
liv+0.068
Neg logit contributions
 coral-0.042
 shark-0.042
 crabs-0.041
 fin-0.040
 like-0.040
Token again
Token ID757
Feature activation+1.67e-01

Pos logit contributions
 shark+0.009
 Flip+0.008
 fin+0.008
 Mitch+0.008
 Finn+0.008
Neg logit contributions
packing-0.017
grand-0.017
cases-0.017
cakes-0.016
 brushes-0.016
Token.
Token ID13
Feature activation-9.53e-01

Pos logit contributions
 Flip+0.003
 shark+0.003
 fin+0.003
 Mitch+0.003
 Finn+0.003
Neg logit contributions
cases-0.007
cakes-0.007
packing-0.007
grand-0.007
ics-0.007
Token The
Token ID383
Feature activation-1.21e+00

Pos logit contributions
cases+0.031
grand+0.030
packing+0.030
cakes+0.030
days+0.029
Neg logit contributions
 Flip-0.030
 shark-0.029
 Mitch-0.028
 fin-0.028
 Finn-0.027
Token town
Token ID3240
Feature activation-3.73e+00

Pos logit contributions
cases+0.039
packing+0.038
cakes+0.038
days+0.038
grand+0.037
Neg logit contributions
 Flip-0.037
 Mitch-0.036
 Rory-0.034
 shark-0.034
 fin-0.034
Tokenspe
Token ID4125
Feature activation-6.31e+00

Pos logit contributions
grand+0.109
cakes+0.104
packing+0.102
cases+0.102
EEP+0.100
Neg logit contributions
 shark-0.127
 fin-0.120
 Flip-0.117
 Shark-0.113
 Fish-0.113
Tokenople
Token ID643
Feature activation-2.73e+00

Pos logit contributions
cases+0.151
cakes+0.147
grand+0.147
packing+0.139
days+0.138
Neg logit contributions
 shark-0.245
 Flip-0.243
 fin-0.236
 Mitch-0.234
 crabs-0.230
Token thanked
Token ID26280
Feature activation+3.00e-01

Pos logit contributions
cases+0.081
grand+0.079
cakes+0.078
packing+0.076
 brushes+0.073
Neg logit contributions
 shark-0.092
 Flip-0.091
 fin-0.089
 Mitch-0.088
 lobster-0.084
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
 fin+0.009
 Flip+0.009
 shark+0.009
 lobster+0.009
 Mitch+0.009
Neg logit contributions
packing-0.009
cases-0.009
 Sarah-0.009
 Kim-0.009
grand-0.009
Token bear
Token ID6842
Feature activation+3.00e+00

Pos logit contributions
 Flip+0.035
 Mitch+0.034
 shark+0.033
 fin+0.033
 Rory+0.032
Neg logit contributions
cases-0.038
cakes-0.037
packing-0.037
grand-0.037
days-0.036
Token many
Token ID867
Feature activation+1.21e-01

Pos logit contributions
 Flip+0.080
 shark+0.079
 Mitch+0.076
 fin+0.075
 Finn+0.071
Neg logit contributions
cases-0.108
grand-0.107
packing-0.106
cakes-0.106
 brushes-0.103
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
cases+0.062
cakes+0.059
packing+0.059
grand+0.059
days+0.057
Neg logit contributions
 Flip-0.057
 shark-0.057
 Mitch-0.054
 fin-0.054
 Finn-0.052
Token tea
Token ID8887
Feature activation+6.33e-01

Pos logit contributions
cases+0.007
grand+0.007
cakes+0.007
packing+0.007
days+0.007
Neg logit contributions
 Flip-0.010
 shark-0.010
 Mitch-0.010
 fin-0.010
 lobster-0.010
Token party
Token ID2151
Feature activation-2.86e+00

Pos logit contributions
 shark+0.023
 Flip+0.023
 Mitch+0.022
 Finn+0.021
 fin+0.021
Neg logit contributions
grand-0.017
packing-0.017
cases-0.016
any-0.016
cakes-0.015
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
packing+0.121
grand+0.119
any+0.116
cases+0.116
 brushes+0.114
Neg logit contributions
 Flip-0.054
 shark-0.054
 Mitch-0.049
 Finn-0.048
 crabs-0.047
Token Maybe
Token ID6674
Feature activation+4.93e-01

Pos logit contributions
cases+0.041
grand+0.040
packing+0.039
cakes+0.039
days+0.038
Neg logit contributions
 shark-0.049
 Flip-0.049
 Mitch-0.048
 fin-0.047
 lobster-0.046
Token there
Token ID612
Feature activation-6.30e+00

Pos logit contributions
 Flip+0.021
 shark+0.021
 Mitch+0.020
 fin+0.020
 Finn+0.020
Neg logit contributions
cases-0.010
packing-0.010
cakes-0.010
grand-0.009
days-0.009
Token are
Token ID389
Feature activation-7.28e-01

Pos logit contributions
packing+0.218
cases+0.214
any+0.211
cakes+0.205
grand+0.202
Neg logit contributions
 shark-0.179
 Flip-0.177
 Mitch-0.166
 crabs-0.164
 Finn-0.161
Token some
Token ID617
Feature activation-2.67e+00

Pos logit contributions
cases+0.026
packing+0.026
grand+0.026
cakes+0.025
days+0.025
Neg logit contributions
 Flip-0.020
 shark-0.020
 Mitch-0.019
 fin-0.018
 Finn-0.018
Token cookies
Token ID14746
Feature activation-4.27e-01

Pos logit contributions
cases+0.073
cakes+0.072
grand+0.070
packing+0.069
days+0.069
Neg logit contributions
 Flip-0.095
 Mitch-0.092
 shark-0.091
 fin-0.090
 lobster-0.087
Token left
Token ID1364
Feature activation-8.32e-01

Pos logit contributions
cases+0.016
grand+0.015
packing+0.015
cakes+0.015
any+0.015
Neg logit contributions
 shark-0.011
 Flip-0.011
 Mitch-0.011
 fin-0.010
 Finn-0.010
Token for
Token ID329
Feature activation+6.54e-01

Pos logit contributions
cases+0.027
grand+0.026
cakes+0.026
packing+0.026
days+0.025
Neg logit contributions
 Flip-0.025
 shark-0.025
 Mitch-0.024
 fin-0.024
 lobster-0.023
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
cases+0.012
packs+0.012
days+0.011
 brushes+0.011
 Kim+0.011
Neg logit contributions
 Flip-0.013
 Mitch-0.013
 lobster-0.013
 Rory-0.012
 Richie-0.012
Token They
Token ID1119
Feature activation-1.21e-01

Pos logit contributions
 Kim+0.024
days+0.023
packs+0.023
 Sarah+0.022
 brushes+0.022
Neg logit contributions
 shark-0.059
 crabs-0.058
 lobster-0.058
 seaw-0.057
 fin-0.057
Token have
Token ID423
Feature activation-1.91e+00

Pos logit contributions
 brushes+0.002
cases+0.002
 rang+0.002
days+0.002
packing+0.002
Neg logit contributions
 Flip-0.005
 shark-0.005
 Mitch-0.005
 lobster-0.005
 Rory-0.005
Token no
Token ID645
Feature activation-1.74e-01

Pos logit contributions
cases+0.069
grand+0.067
packing+0.067
cakes+0.067
days+0.065
Neg logit contributions
 Flip-0.051
 shark-0.051
 Mitch-0.049
 fin-0.048
 lobster-0.046
Token umb
Token ID20810
Feature activation-5.21e+00

Pos logit contributions
packing+0.006
cases+0.006
grand+0.005
cakes+0.005
ics+0.005
Neg logit contributions
 crabs-0.005
 shark-0.005
 fin-0.005
 Flip-0.005
 Finn-0.005
Tokenrell
Token ID11252
Feature activation-6.30e+00

Pos logit contributions
grand+0.115
packing+0.103
days+0.102
cases+0.097
packs+0.096
Neg logit contributions
 Flip-0.202
 Finn-0.191
 rather-0.188
 Sno-0.187
 like-0.187
Tokenas
Token ID292
Feature activation-1.79e+00

Pos logit contributions
grand+0.230
cases+0.219
packing+0.215
 Susan+0.214
ital+0.213
Neg logit contributions
 Flip-0.155
 crabs-0.154
 Finn-0.149
 shark-0.144
 Mitch-0.142
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
grand+0.073
cases+0.073
packing+0.073
cakes+0.072
 brushes+0.069
Neg logit contributions
 shark-0.039
 Flip-0.038
 fin-0.036
 Mitch-0.035
 Finn-0.034
Token
Token ID198
Feature activation-6.19e-01

Pos logit contributions
 Kim+0.027
days+0.026
packs+0.026
grand+0.025
 brushes+0.025
Neg logit contributions
 shark-0.064
 lobster-0.062
 crabs-0.062
 fin-0.062
 seaw-0.062
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
grand+0.016
 Kim+0.015
days+0.015
cases+0.015
packs+0.014
Neg logit contributions
 lobster-0.023
 shark-0.023
 Flip-0.023
 crabs-0.022
 Mitch-0.022
TokenThey
Token ID2990
Feature activation-8.43e-02

Pos logit contributions
 shark+0.005
 Flip+0.005
 lobster+0.005
 fin+0.005
 Mitch+0.005
Neg logit contributions
packing-0.003
grand-0.003
cases-0.003
days-0.003
 brushes-0.003
Token good
Token ID922
Feature activation+3.14e-01

Pos logit contributions
cases+0.052
grand+0.051
packing+0.051
cakes+0.051
days+0.049
Neg logit contributions
 Flip-0.041
 shark-0.040
 Mitch-0.039
 fin-0.038
 Finn-0.037
Token friends
Token ID2460
Feature activation-1.28e+00

Pos logit contributions
 Flip+0.012
 shark+0.012
 Mitch+0.011
 fin+0.011
 lobster+0.011
Neg logit contributions
cases-0.008
cakes-0.008
packing-0.008
grand-0.008
days-0.008
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
cases+0.041
days+0.041
packing+0.040
cakes+0.040
grand+0.040
Neg logit contributions
 Flip-0.038
 shark-0.038
 Mitch-0.037
 fin-0.037
 lobster-0.036
Token never
Token ID1239
Feature activation-1.78e+00

Pos logit contributions
cases+0.060
ugs+0.055
z+0.055
grand+0.055
packing+0.054
Neg logit contributions
 shark-0.047
 Flip-0.047
 crabs-0.046
 Finn-0.045
 Mitch-0.044
Token qu
Token ID627
Feature activation-1.91e+00

Pos logit contributions
cases+0.054
cakes+0.053
ics+0.051
grand+0.051
ugs+0.050
Neg logit contributions
 shark-0.059
 Flip-0.059
 Finn-0.057
 fin-0.057
 crabs-0.056
Tokenarre
Token ID9624
Feature activation-6.28e+00

Pos logit contributions
grand+0.039
cases+0.037
packing+0.036
cakes+0.034
days+0.033
Neg logit contributions
 Flip-0.082
 shark-0.081
 fin-0.079
 Mitch-0.079
 lobster-0.078
Tokenled
Token ID992
Feature activation-1.93e+00

Pos logit contributions
grand+0.265
 Kim+0.242
cakes+0.238
 brushes+0.235
cases+0.235
Neg logit contributions
 shark-0.133
 Flip-0.126
 Richie-0.120
 crabs-0.115
 Finn-0.115
Token again
Token ID757
Feature activation+4.85e-01

Pos logit contributions
cases+0.064
cakes+0.063
days+0.063
grand+0.062
packing+0.061
Neg logit contributions
 Flip-0.055
 shark-0.054
 fin-0.053
 Mitch-0.053
 lobster-0.050
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
 Flip+0.009
 shark+0.009
 fin+0.008
 Mitch+0.008
 Finn+0.008
Neg logit contributions
cases-0.022
packing-0.022
cakes-0.021
grand-0.021
ics-0.021
Token<|endoftext|>
Token ID50256
Feature activation-8.93e-01

Pos logit contributions
cases+0.024
packing+0.024
grand+0.023
cakes+0.023
ics+0.023
Neg logit contributions
 Flip-0.021
 shark-0.021
 Mitch-0.020
 fin-0.019
 Finn-0.019
TokenOnce
Token ID7454
Feature activation-4.79e-01

Pos logit contributions
cases+0.028
cakes+0.028
days+0.027
grand+0.027
packing+0.027
Neg logit contributions
 shark-0.027
 Flip-0.027
 fin-0.027
 Mitch-0.026
 lobster-0.025
Token about
Token ID546
Feature activation+1.85e-01

Pos logit contributions
cases+0.003
cakes+0.003
packing+0.003
grand+0.002
days+0.002
Neg logit contributions
 Flip-0.002
 shark-0.002
 Mitch-0.002
 fin-0.002
 lobster-0.002
Token rain
Token ID6290
Feature activation-2.65e+00

Pos logit contributions
 Flip+0.006
 shark+0.006
 Mitch+0.006
 fin+0.006
 lobster+0.005
Neg logit contributions
 brushes-0.005
cases-0.005
packing-0.005
cakes-0.005
ics-0.005
Tokenbows
Token ID25435
Feature activation-2.32e+00

Pos logit contributions
grand+0.051
packing+0.047
cases+0.043
packs+0.039
 Susan+0.037
Neg logit contributions
 Flip-0.115
 shark-0.115
 Finn-0.110
 fin-0.110
 like-0.109
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
cases+0.084
cakes+0.082
packing+0.080
grand+0.080
ics+0.079
Neg logit contributions
 Flip-0.062
 shark-0.061
 Mitch-0.059
 fin-0.059
 lobster-0.056
Token umb
Token ID20810
Feature activation-5.27e+00

Pos logit contributions
cases+0.042
packing+0.041
grand+0.041
cakes+0.040
days+0.039
Neg logit contributions
 shark-0.035
 Flip-0.034
 fin-0.033
 Mitch-0.033
 crabs-0.032
Tokenrell
Token ID11252
Feature activation-6.25e+00

Pos logit contributions
grand+0.106
packing+0.103
days+0.096
packs+0.095
cases+0.094
Neg logit contributions
 Flip-0.211
 Finn-0.202
 Sno-0.200
 coral-0.196
 Qu-0.195
Tokenas
Token ID292
Feature activation-1.73e+00

Pos logit contributions
grand+0.234
packing+0.219
ital+0.216
 Susan+0.214
cases+0.213
Neg logit contributions
 crabs-0.148
 Flip-0.148
 Finn-0.140
ascal-0.136
 shark-0.134
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
cases+0.071
grand+0.070
packing+0.070
cakes+0.069
days+0.067
Neg logit contributions
 Flip-0.037
 shark-0.037
 fin-0.035
 Mitch-0.035
 Finn-0.033
Token They
Token ID1119
Feature activation-3.07e-01

Pos logit contributions
 Kim+0.026
cakes+0.025
 brushes+0.025
cases+0.025
 Sarah+0.024
Neg logit contributions
 shark-0.052
 fin-0.050
 lobster-0.049
 Flip-0.049
 Mitch-0.048
Token were
Token ID547
Feature activation-2.18e+00

Pos logit contributions
cases+0.008
grand+0.008
packing+0.008
cakes+0.008
days+0.008
Neg logit contributions
 Flip-0.011
 shark-0.011
 Mitch-0.010
 fin-0.010
 crabs-0.010
Token very
Token ID845
Feature activation-4.47e-01

Pos logit contributions
cases+0.077
grand+0.076
packing+0.075
cakes+0.074
days+0.071
Neg logit contributions
 Flip-0.062
 shark-0.061
 fin-0.060
 Mitch-0.059
 Finn-0.057
Token toys
Token ID14958
Feature activation-2.83e+00

Pos logit contributions
packing+0.059
cases+0.055
grand+0.054
ics+0.053
any+0.053
Neg logit contributions
 shark-0.081
 Flip-0.080
 crabs-0.078
 coral-0.077
 fin-0.077
Token and
Token ID290
Feature activation-2.53e+00

Pos logit contributions
packing+0.113
cases+0.112
grand+0.112
cakes+0.110
any+0.109
Neg logit contributions
 shark-0.062
 Flip-0.062
 Mitch-0.058
 fin-0.057
 crabs-0.055
Token puzzles
Token ID24367
Feature activation-3.20e+00

Pos logit contributions
packing+0.069
cases+0.065
ics+0.063
grand+0.062
any+0.062
Neg logit contributions
 shark-0.092
 Flip-0.091
 crabs-0.090
 fin-0.088
 coral-0.087
Token and
Token ID290
Feature activation-3.06e+00

Pos logit contributions
cases+0.130
packing+0.128
grand+0.128
cakes+0.126
days+0.123
Neg logit contributions
 shark-0.070
 Flip-0.070
 Mitch-0.066
 fin-0.065
 crabs-0.061
Token cr
Token ID1067
Feature activation-2.60e+00

Pos logit contributions
packing+0.089
cases+0.081
ics+0.080
grand+0.079
any+0.079
Neg logit contributions
 shark-0.105
 crabs-0.104
 Flip-0.103
 coral-0.101
 seaw-0.100
Tokenay
Token ID323
Feature activation-6.23e+00

Pos logit contributions
packing+0.066
days+0.064
grand+0.064
cases+0.060
any+0.059
Neg logit contributions
 Flip-0.092
 Mitch-0.090
 shark-0.088
 fin-0.088
 Rory-0.085
Tokenons
Token ID684
Feature activation-4.04e+00

Pos logit contributions
packing+0.135
cases+0.127
days+0.122
grand+0.120
any+0.117
Neg logit contributions
 Flip-0.246
 crabs-0.245
 shark-0.244
 Mitch-0.237
 Finn-0.235
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
cases+0.162
packing+0.159
grand+0.159
cakes+0.158
days+0.154
Neg logit contributions
 Flip-0.091
 shark-0.091
 Mitch-0.086
 fin-0.085
 lobster-0.080
Token There
Token ID1318
Feature activation-4.27e+00

Pos logit contributions
cases+0.040
grand+0.039
 Kim+0.039
cakes+0.038
days+0.037
Neg logit contributions
 shark-0.049
 fin-0.048
 Flip-0.047
 lobster-0.045
 Mitch-0.045
Token are
Token ID389
Feature activation-9.47e-01

Pos logit contributions
any+0.148
packing+0.146
cases+0.142
cakes+0.140
grand+0.138
Neg logit contributions
 shark-0.123
 Flip-0.120
 Mitch-0.118
 crabs-0.113
 Finn-0.111
Token hats
Token ID23910
Feature activation-1.69e+00

Pos logit contributions
cases+0.026
cakes+0.025
grand+0.025
packing+0.025
days+0.024
Neg logit contributions
 shark-0.033
 Flip-0.033
 Mitch-0.032
 fin-0.032
 lobster-0.031
Token could
Token ID714
Feature activation+5.85e-01

Pos logit contributions
cases+0.011
cakes+0.011
 brushes+0.011
ics+0.011
packs+0.011
Neg logit contributions
 shark-0.012
 Flip-0.012
 Mitch-0.011
 fin-0.011
 Finn-0.011
Token be
Token ID307
Feature activation-3.24e-01

Pos logit contributions
 Flip+0.016
 shark+0.016
 fin+0.016
 Mitch+0.016
 Finn+0.015
Neg logit contributions
cases-0.021
cakes-0.020
packing-0.020
grand-0.020
 brushes-0.019
Token and
Token ID290
Feature activation-3.59e-01

Pos logit contributions
cases+0.012
packing+0.012
cakes+0.012
grand+0.011
 brushes+0.011
Neg logit contributions
 shark-0.009
 Flip-0.008
 fin-0.008
 crabs-0.008
 Mitch-0.008
Token never
Token ID1239
Feature activation-1.49e+00

Pos logit contributions
cases+0.013
cakes+0.013
packing+0.013
grand+0.012
ics+0.012
Neg logit contributions
 shark-0.010
 Flip-0.010
 fin-0.009
 Mitch-0.009
 Finn-0.009
Token mis
Token ID2984
Feature activation-4.30e+00

Pos logit contributions
cases+0.046
cakes+0.045
packing+0.045
grand+0.043
ics+0.042
Neg logit contributions
 shark-0.049
 Flip-0.049
 fin-0.047
 Mitch-0.047
 Finn-0.046
Tokenbeh
Token ID20709
Feature activation-6.22e+00

Pos logit contributions
cases+0.102
cakes+0.102
grand+0.097
packing+0.092
 brushes+0.088
Neg logit contributions
 Flip-0.169
 shark-0.166
 fin-0.166
 Mitch-0.162
 crabs-0.158
Tokenaved
Token ID9586
Feature activation-2.96e+00

Pos logit contributions
cases+0.173
grand+0.172
cakes+0.169
packing+0.163
ics+0.160
Neg logit contributions
 Flip-0.216
 shark-0.216
 fin-0.210
 Mitch-0.206
 Finn-0.201
Token.
Token ID13
Feature activation-7.52e-01

Pos logit contributions
cases+0.107
packing+0.104
grand+0.104
cakes+0.103
days+0.101
Neg logit contributions
 Flip-0.079
 shark-0.079
 Mitch-0.075
 fin-0.075
 lobster-0.071
Token<|endoftext|>
Token ID50256
Feature activation-9.60e-01

Pos logit contributions
grand+0.025
packing+0.024
cases+0.024
any+0.024
days+0.024
Neg logit contributions
 shark-0.022
 Flip-0.022
 fin-0.021
 Mitch-0.021
 crabs-0.021
TokenOnce
Token ID7454
Feature activation-4.77e-01

Pos logit contributions
cases+0.029
cakes+0.029
days+0.028
ics+0.028
grand+0.028
Neg logit contributions
 shark-0.030
 fin-0.030
 Flip-0.029
 Mitch-0.029
 crabs-0.028
Token upon
Token ID2402
Feature activation+2.38e-01

Pos logit contributions
cases+0.013
packing+0.013
cakes+0.013
grand+0.012
any+0.012
Neg logit contributions
 shark-0.017
 Flip-0.017
 Mitch-0.016
 fin-0.016
 Finn-0.016
Token sharing
Token ID7373
Feature activation+7.27e-01

Pos logit contributions
cases+0.024
packing+0.023
cakes+0.023
grand+0.023
days+0.022
Neg logit contributions
 shark-0.024
 Flip-0.024
 Mitch-0.022
 fin-0.022
 Finn-0.022
Token is
Token ID318
Feature activation+4.52e-01

Pos logit contributions
 shark+0.024
 Flip+0.024
 Mitch+0.023
 fin+0.023
 Finn+0.022
Neg logit contributions
cases-0.022
packing-0.021
grand-0.021
cakes-0.020
days-0.020
Token important
Token ID1593
Feature activation-4.40e-01

Pos logit contributions
 shark+0.013
 Flip+0.013
 Mitch+0.013
 fin+0.013
 lobster+0.012
Neg logit contributions
cases-0.015
packing-0.015
grand-0.015
cakes-0.015
days-0.014
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
cases+0.015
grand+0.015
packing+0.015
cakes+0.015
days+0.015
Neg logit contributions
 shark-0.012
 Flip-0.012
 Mitch-0.012
 fin-0.012
 Finn-0.011
Token qu
Token ID627
Feature activation-1.90e+00

Pos logit contributions
cases+0.015
cakes+0.014
ugs+0.014
z+0.014
packs+0.013
Neg logit contributions
 shark-0.011
 Flip-0.011
 Finn-0.010
 Mitch-0.010
 fin-0.010
Tokenarre
Token ID9624
Feature activation-6.21e+00

Pos logit contributions
cases+0.039
grand+0.038
packing+0.036
cakes+0.035
days+0.033
Neg logit contributions
 Flip-0.081
 shark-0.080
 fin-0.078
 Mitch-0.078
 Finn-0.076
Tokenlling
Token ID2680
Feature activation-4.06e+00

Pos logit contributions
grand+0.274
 Kim+0.268
cakes+0.265
cases+0.264
EEP+0.257
Neg logit contributions
 shark-0.110
 Richie-0.096
 Flip-0.095
 fin-0.094
 Mitch-0.087
Token is
Token ID318
Feature activation+1.81e-01

Pos logit contributions
grand+0.150
packing+0.149
cases+0.147
cakes+0.145
days+0.142
Neg logit contributions
 shark-0.104
 Flip-0.100
 fin-0.098
 Mitch-0.096
 Finn-0.092
Token not
Token ID407
Feature activation-1.31e+00

Pos logit contributions
 shark+0.005
 Flip+0.005
 Mitch+0.005
 fin+0.005
 lobster+0.005
Neg logit contributions
cases-0.006
packing-0.006
grand-0.006
cakes-0.006
days-0.006
Token good
Token ID922
Feature activation-2.79e-01

Pos logit contributions
cases+0.050
cakes+0.049
grand+0.048
packing+0.047
days+0.046
Neg logit contributions
 Flip-0.035
 shark-0.035
 fin-0.033
 Mitch-0.032
 crabs-0.032
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
cases+0.011
packing+0.011
grand+0.011
cakes+0.011
days+0.010
Neg logit contributions
 shark-0.007
 Flip-0.006
 Mitch-0.006
 fin-0.006
 Finn-0.006
Token their
Token ID511
Feature activation-1.88e+00

Pos logit contributions
cases+0.090
cakes+0.086
packing+0.083
grand+0.081
bows+0.077
Neg logit contributions
 shark-0.056
 Flip-0.055
 Mitch-0.052
 Finn-0.051
 fin-0.051
Token station
Token ID4429
Feature activation-1.71e+00

Pos logit contributions
packing+0.051
cases+0.051
cakes+0.050
grand+0.048
bows+0.047
Neg logit contributions
 shark-0.072
 fin-0.067
 Flip-0.065
 coral-0.064
 crabs-0.064
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
packing+0.073
grand+0.072
cases+0.072
cakes+0.071
any+0.070
Neg logit contributions
 shark-0.033
 Flip-0.031
 fin-0.030
 Mitch-0.028
 crabs-0.028
Token They
Token ID1119
Feature activation-3.94e-01

Pos logit contributions
cases+0.030
grand+0.030
packing+0.028
cakes+0.028
days+0.028
Neg logit contributions
 shark-0.042
 Flip-0.042
 fin-0.041
 lobster-0.040
 crabs-0.040
Token hope
Token ID2911
Feature activation-1.40e+00

Pos logit contributions
cases+0.011
packing+0.011
grand+0.011
cakes+0.011
days+0.010
Neg logit contributions
 Flip-0.014
 shark-0.014
 Mitch-0.013
 fin-0.013
 Finn-0.013
Token there
Token ID612
Feature activation-6.20e+00

Pos logit contributions
packing+0.047
cases+0.046
grand+0.045
cakes+0.044
any+0.044
Neg logit contributions
 shark-0.043
 Flip-0.042
 Mitch-0.041
 fin-0.040
 Finn-0.039
Token will
Token ID481
Feature activation-5.26e-02

Pos logit contributions
cases+0.190
packing+0.188
cakes+0.183
grand+0.182
any+0.181
Neg logit contributions
 shark-0.199
 Flip-0.196
 Mitch-0.189
 fin-0.188
 crabs-0.181
Token be
Token ID307
Feature activation-1.57e+00

Pos logit contributions
cases+0.002
packing+0.002
cakes+0.002
ugs+0.002
any+0.002
Neg logit contributions
 shark-0.002
 Flip-0.002
 fin-0.002
 Mitch-0.002
 Shark-0.002
Token no
Token ID645
Feature activation-6.43e-01

Pos logit contributions
packing+0.064
cases+0.064
cakes+0.062
grand+0.061
any+0.060
Neg logit contributions
 shark-0.037
 Flip-0.035
 fin-0.034
 crabs-0.034
 Mitch-0.033
Token more
Token ID517
Feature activation-4.34e-01

Pos logit contributions
cases+0.030
packing+0.030
grand+0.029
cakes+0.029
ics+0.028
Neg logit contributions
 shark-0.013
 fin-0.011
 Flip-0.011
 crabs-0.011
 sharks-0.010
Token false
Token ID3991
Feature activation-2.40e+00

Pos logit contributions
packing+0.015
cakes+0.015
cases+0.015
days+0.014
grand+0.014
Neg logit contributions
 shark-0.013
 fin-0.013
 Flip-0.012
 crabs-0.012
 sharks-0.012
Token mom
Token ID1995
Feature activation-1.40e+00

Pos logit contributions
cases+0.025
packing+0.025
grand+0.024
cakes+0.024
days+0.023
Neg logit contributions
 shark-0.030
 Flip-0.029
 fin-0.029
 Mitch-0.028
 crabs-0.028
Tokenm
Token ID76
Feature activation+7.71e-01

Pos logit contributions
cases+0.045
grand+0.044
packing+0.044
cakes+0.044
days+0.042
Neg logit contributions
 Flip-0.043
 shark-0.043
 Mitch-0.041
 fin-0.041
 lobster-0.039
Tokenies
Token ID444
Feature activation-2.85e-01

Pos logit contributions
 shark+0.025
 fin+0.024
 Flip+0.024
 crabs+0.024
 Mitch+0.024
Neg logit contributions
packing-0.023
cases-0.023
grand-0.022
cakes-0.022
days-0.021
Token and
Token ID290
Feature activation-7.38e-01

Pos logit contributions
cases+0.009
cakes+0.009
packing+0.009
grand+0.009
ics+0.009
Neg logit contributions
 Flip-0.009
 shark-0.008
 Mitch-0.008
 fin-0.008
 Finn-0.008
Token d
Token ID288
Feature activation-2.46e+00

Pos logit contributions
cases+0.025
z+0.025
packing+0.024
ital+0.024
days+0.024
Neg logit contributions
 shark-0.022
 crabs-0.022
 Flip-0.021
 Finn-0.020
 fin-0.020
Tokenadd
Token ID2860
Feature activation-6.19e+00

Pos logit contributions
cases+0.072
grand+0.070
packing+0.070
cakes+0.069
days+0.067
Neg logit contributions
 Flip-0.083
 shark-0.083
 Mitch-0.080
 fin-0.079
 Finn-0.076
Tokenies
Token ID444
Feature activation-2.45e+00

Pos logit contributions
grand+0.208
packing+0.205
cases+0.198
cakes+0.197
days+0.191
Neg logit contributions
 crabs-0.169
 Flip-0.164
 Finn-0.162
 shark-0.161
 sharks-0.156
Token,
Token ID11
Feature activation-1.61e+00

Pos logit contributions
grand+0.088
cases+0.086
packing+0.086
cakes+0.084
days+0.082
Neg logit contributions
 shark-0.067
 Flip-0.067
 fin-0.064
 Mitch-0.062
 crabs-0.061
Token who
Token ID508
Feature activation-8.90e-01

Pos logit contributions
cases+0.062
grand+0.061
packing+0.060
cakes+0.059
ics+0.059
Neg logit contributions
 shark-0.040
 Flip-0.040
 fin-0.038
 crabs-0.038
 Mitch-0.037
Token cl
Token ID537
Feature activation-2.32e+00

Pos logit contributions
cases+0.027
cakes+0.025
EEP+0.025
grand+0.025
packing+0.024
Neg logit contributions
 shark-0.030
 Flip-0.030
 Mitch-0.028
 Finn-0.028
 crabs-0.027
Tokenapped
Token ID6320
Feature activation-1.62e+00

Pos logit contributions
cases+0.070
grand+0.069
packing+0.066
cakes+0.066
ics+0.063
Neg logit contributions
 Flip-0.076
 shark-0.076
 fin-0.074
 Mitch-0.074
 Finn-0.072
Token Pam
Token ID24497
Feature activation-2.28e+00

Pos logit contributions
cases+0.036
cakes+0.034
packing+0.033
grand+0.033
days+0.032
Neg logit contributions
 Flip-0.025
 Mitch-0.025
 shark-0.024
 crabs-0.024
 Finn-0.024
Token are
Token ID389
Feature activation-1.16e+00

Pos logit contributions
cases+0.066
cakes+0.066
packing+0.064
grand+0.064
days+0.061
Neg logit contributions
 shark-0.074
 Flip-0.074
 Mitch-0.072
 fin-0.071
 crabs-0.070
Token friends
Token ID2460
Feature activation-1.51e+00

Pos logit contributions
grand+0.047
cases+0.047
cakes+0.045
packing+0.045
EEP+0.042
Neg logit contributions
 Flip-0.029
 shark-0.027
 Mitch-0.027
 fin-0.026
 crabs-0.026
Token again
Token ID757
Feature activation-3.23e-01

Pos logit contributions
days+0.052
packing+0.051
cases+0.051
cakes+0.050
grand+0.050
Neg logit contributions
 Flip-0.043
 shark-0.042
 Mitch-0.041
 fin-0.040
 lobster-0.039
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
cases+0.015
cakes+0.014
grand+0.014
packing+0.014
ics+0.013
Neg logit contributions
 Flip-0.006
 shark-0.006
 fin-0.006
 Mitch-0.006
 crabs-0.006
Token there
Token ID612
Feature activation-6.15e+00

Pos logit contributions
cases+0.070
grand+0.066
cakes+0.065
packing+0.064
ugs+0.064
Neg logit contributions
 Flip-0.068
 shark-0.067
 crabs-0.066
 Finn-0.066
 Mitch-0.065
Token is
Token ID318
Feature activation-1.07e-01

Pos logit contributions
grand+0.218
cases+0.218
packing+0.212
cakes+0.210
any+0.205
Neg logit contributions
 Flip-0.169
 Mitch-0.166
 shark-0.165
 Finn-0.158
 crabs-0.157
Token peace
Token ID4167
Feature activation-3.12e-01

Pos logit contributions
grand+0.004
cases+0.003
cakes+0.003
days+0.003
 brushes+0.003
Neg logit contributions
 shark-0.003
 Flip-0.003
 fin-0.003
 Mitch-0.003
 lobster-0.003
Token between
Token ID1022
Feature activation-7.06e-01

Pos logit contributions
cases+0.012
grand+0.012
packing+0.012
cakes+0.012
days+0.011
Neg logit contributions
 shark-0.008
 Flip-0.008
 Mitch-0.007
 fin-0.007
 Finn-0.007
Token them
Token ID606
Feature activation-1.45e+00

Pos logit contributions
cases+0.021
cakes+0.021
days+0.021
grand+0.021
packing+0.021
Neg logit contributions
 Flip-0.022
 shark-0.022
 Mitch-0.021
 fin-0.021
 lobster-0.020
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
cases+0.060
packing+0.060
grand+0.060
cakes+0.059
days+0.057
Neg logit contributions
 shark-0.030
 Flip-0.030
 fin-0.028
 Mitch-0.028
 Finn-0.026
Token home
Token ID1363
Feature activation-1.15e+00

Pos logit contributions
cases+0.016
packing+0.016
grand+0.016
cakes+0.016
days+0.015
Neg logit contributions
 shark-0.023
 Flip-0.022
 fin-0.022
 Mitch-0.022
 Finn-0.021
Token.
Token ID13
Feature activation-7.04e-01

Pos logit contributions
packing+0.053
cakes+0.052
cases+0.052
grand+0.051
 brushes+0.050
Neg logit contributions
 shark-0.020
 Flip-0.019
 fin-0.019
 Mitch-0.018
 Rory-0.017
Token<|endoftext|>
Token ID50256
Feature activation-8.52e-01

Pos logit contributions
cases+0.020
grand+0.020
packing+0.020
cakes+0.020
 brushes+0.019
Neg logit contributions
 shark-0.023
 Flip-0.023
 fin-0.023
 lobster-0.022
 Mitch-0.022
TokenL
Token ID43
Feature activation+3.51e-01

Pos logit contributions
ics+0.020
packs+0.020
cakes+0.019
days+0.018
z+0.018
Neg logit contributions
 fin-0.033
 shark-0.032
 crabs-0.031
 Flip-0.030
 lobster-0.030
Tokenily
Token ID813
Feature activation+8.99e-01

Pos logit contributions
 shark+0.008
 Flip+0.008
 Mitch+0.008
 fin+0.008
 crabs+0.007
Neg logit contributions
cases-0.014
packing-0.014
cakes-0.014
grand-0.014
days-0.013
Token liked
Token ID8288
Feature activation+3.29e+00

Pos logit contributions
 shark+0.024
 Flip+0.023
 Mitch+0.023
 fin+0.022
 Finn+0.021
Neg logit contributions
cases-0.034
cakes-0.034
packing-0.032
days-0.032
grand-0.031
Token to
Token ID284
Feature activation-9.91e-02

Pos logit contributions
 shark+0.104
 Flip+0.101
 crabs+0.097
 fin+0.094
 Finn+0.094
Neg logit contributions
cases-0.111
packing-0.107
days-0.102
grand-0.102
any-0.102
Token play
Token ID711
Feature activation+3.18e-01

Pos logit contributions
cases+0.003
packing+0.003
cakes+0.003
grand+0.003
any+0.003
Neg logit contributions
 shark-0.003
 Flip-0.003
 fin-0.003
 Mitch-0.003
 crabs-0.003
Token with
Token ID351
Feature activation+2.07e+00

Pos logit contributions
 Mitch+0.012
 fin+0.012
 Flip+0.012
 lobster+0.011
 shark+0.011
Neg logit contributions
ics-0.007
cakes-0.007
pack-0.007
house-0.007
packing-0.007
Token her
Token ID607
Feature activation+6.48e-01

Pos logit contributions
 Flip+0.072
 shark+0.072
 Mitch+0.070
 fin+0.069
 Finn+0.067
Neg logit contributions
cases-0.059
packing-0.057
grand-0.056
cakes-0.056
days-0.054
Token doll
Token ID3654
Feature activation+2.42e+00

Pos logit contributions
 shark+0.026
 Flip+0.026
 Mitch+0.025
 fin+0.025
 Finn+0.024
Neg logit contributions
cases-0.015
packing-0.014
grand-0.014
cakes-0.014
days-0.013
Token the
Token ID262
Feature activation-3.09e+00

Pos logit contributions
 shark+0.003
 Flip+0.003
 Mitch+0.003
 fin+0.003
 coral+0.003
Neg logit contributions
packing-0.003
cases-0.003
grand-0.003
packs-0.003
any-0.003
Token kitchen
Token ID9592
Feature activation-1.03e+00

Pos logit contributions
cases+0.087
grand+0.085
cakes+0.084
packing+0.084
days+0.081
Neg logit contributions
 Flip-0.105
 shark-0.103
 Mitch-0.101
 fin-0.099
 Finn-0.097
Token to
Token ID284
Feature activation-8.90e-01

Pos logit contributions
packing+0.044
grand+0.042
cases+0.041
any+0.040
days+0.040
Neg logit contributions
 shark-0.022
 fin-0.021
 Mitch-0.020
 Flip-0.019
 coral-0.019
Token get
Token ID651
Feature activation-3.97e-02

Pos logit contributions
cases+0.027
packing+0.026
cakes+0.025
grand+0.025
any+0.024
Neg logit contributions
 shark-0.030
 Flip-0.030
 Mitch-0.029
 fin-0.029
 Finn-0.028
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
cases+0.001
packing+0.001
grand+0.001
cakes+0.001
days+0.001
Neg logit contributions
 shark-0.001
 Flip-0.001
 Mitch-0.001
 fin-0.001
 Finn-0.001
Token cookie
Token ID19751
Feature activation+3.04e+00

Pos logit contributions
cases+0.038
grand+0.037
packing+0.037
cakes+0.036
ics+0.035
Neg logit contributions
 shark-0.046
 Flip-0.045
 fin-0.044
 Mitch-0.043
 lobster-0.042
Token,
Token ID11
Feature activation-6.12e-02

Pos logit contributions
 shark+0.069
 Flip+0.068
 Mitch+0.064
 fin+0.064
 Finn+0.060
Neg logit contributions
cases-0.122
packing-0.121
grand-0.120
cakes-0.117
days-0.116
Token but
Token ID475
Feature activation+1.45e+00

Pos logit contributions
cases+0.002
grand+0.002
packing+0.002
cakes+0.002
ics+0.002
Neg logit contributions
 Flip-0.001
 shark-0.001
 Mitch-0.001
 fin-0.001
 Finn-0.001
Token her
Token ID607
Feature activation-1.78e+00

Pos logit contributions
 Flip+0.041
 shark+0.040
 Mitch+0.039
 fin+0.038
 Finn+0.037
Neg logit contributions
cases-0.052
grand-0.051
packing-0.051
cakes-0.050
days-0.049
Token mom
Token ID1995
Feature activation-1.35e+00

Pos logit contributions
cases+0.052
packing+0.051
grand+0.051
cakes+0.050
days+0.049
Neg logit contributions
 shark-0.060
 Flip-0.060
 Mitch-0.058
 fin-0.057
 lobster-0.055
Token said
Token ID531
Feature activation-1.26e-01

Pos logit contributions
cases+0.037
grand+0.037
packing+0.037
packs+0.036
days+0.036
Neg logit contributions
 Flip-0.045
 Mitch-0.043
 shark-0.043
 fin-0.042
 lobster-0.041
Token.
Token ID13
Feature activation+3.28e-01

Pos logit contributions
cases+0.002
grand+0.002
cakes+0.002
packing+0.002
days+0.002
Neg logit contributions
 Flip-0.001
 shark-0.001
 Mitch-0.001
 fin-0.001
 Finn-0.001
Token They
Token ID1119
Feature activation+1.05e+00

Pos logit contributions
 shark+0.011
 Flip+0.011
 fin+0.011
 Mitch+0.011
 lobster+0.011
Neg logit contributions
cases-0.009
packing-0.009
cakes-0.009
grand-0.009
days-0.009
Token liked
Token ID8288
Feature activation+1.92e+00

Pos logit contributions
 Flip+0.043
 shark+0.043
 Mitch+0.041
 fin+0.041
 lobster+0.040
Neg logit contributions
cases-0.023
packing-0.023
 brushes-0.022
grand-0.021
days-0.021
Token to
Token ID284
Feature activation-6.65e-01

Pos logit contributions
 Flip+0.073
 shark+0.072
 Mitch+0.071
 fin+0.070
 Rory+0.068
Neg logit contributions
cases-0.046
packing-0.046
cakes-0.045
grand-0.045
ics-0.043
Token catch
Token ID4929
Feature activation+2.14e+00

Pos logit contributions
packing+0.020
cases+0.020
 brushes+0.019
grand+0.019
cakes+0.019
Neg logit contributions
 Flip-0.021
 shark-0.021
 Mitch-0.020
 fin-0.020
 Rory-0.019
Token bugs
Token ID11316
Feature activation+3.08e+00

Pos logit contributions
 Mitch+0.075
 shark+0.073
 Flip+0.073
ascal+0.073
 fin+0.072
Neg logit contributions
 brushes-0.055
ics-0.052
cakes-0.050
packs-0.050
cases-0.049
Token and
Token ID290
Feature activation-2.16e-01

Pos logit contributions
 shark+0.095
 Flip+0.092
 Mitch+0.092
 fin+0.091
 lobster+0.087
Neg logit contributions
cases-0.098
cakes-0.095
ics-0.094
packing-0.092
packs-0.090
Token leaves
Token ID5667
Feature activation+2.30e-01

Pos logit contributions
cases+0.006
grand+0.006
packing+0.006
cakes+0.006
days+0.006
Neg logit contributions
 Flip-0.007
 shark-0.007
 Mitch-0.007
 fin-0.007
 lobster-0.007
Token with
Token ID351
Feature activation+4.06e-01

Pos logit contributions
 shark+0.007
 Mitch+0.007
 Flip+0.007
 fin+0.007
 lobster+0.007
Neg logit contributions
ics-0.007
cases-0.007
cakes-0.007
packs-0.007
bows-0.007
Token the
Token ID262
Feature activation+5.91e-01

Pos logit contributions
 Flip+0.015
 shark+0.015
ascal+0.015
 Mitch+0.014
 lobster+0.014
Neg logit contributions
 brushes-0.011
packs-0.010
 Kim-0.010
 Sarah-0.009
ics-0.009
Token net
Token ID2010
Feature activation+5.76e-01

Pos logit contributions
 Flip+0.018
 Mitch+0.017
 shark+0.017
 Otto+0.017
 Rory+0.016
Neg logit contributions
 brushes-0.019
cakes-0.017
cases-0.017
packs-0.017
bows-0.017
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
cases+0.023
grand+0.022
packing+0.022
cakes+0.022
days+0.021
Neg logit contributions
 shark-0.025
 Flip-0.025
 Mitch-0.024
 fin-0.024
 lobster-0.023
Token she
Token ID673
Feature activation+7.23e-01

Pos logit contributions
 Flip+0.008
 shark+0.008
 Mitch+0.008
 Finn+0.008
 crabs+0.008
Neg logit contributions
grand-0.008
packing-0.008
cases-0.008
any-0.008
days-0.008
Token said
Token ID531
Feature activation-7.56e-01

Pos logit contributions
 shark+0.027
 Flip+0.027
 Mitch+0.026
 fin+0.026
 Finn+0.025
Neg logit contributions
cases-0.019
grand-0.018
packing-0.018
cakes-0.018
days-0.017
Token,
Token ID11
Feature activation+5.40e-01

Pos logit contributions
cases+0.027
grand+0.026
packing+0.026
cakes+0.026
days+0.025
Neg logit contributions
 Flip-0.021
 shark-0.021
 Mitch-0.020
 fin-0.020
 lobster-0.019
Token "
Token ID366
Feature activation+3.60e-01

Pos logit contributions
 Flip+0.017
 shark+0.017
 Mitch+0.017
 fin+0.016
 Finn+0.016
Neg logit contributions
cases-0.017
grand-0.016
packing-0.016
cakes-0.016
days-0.016
TokenPlease
Token ID5492
Feature activation+4.60e+00

Pos logit contributions
 Flip+0.008
 Mitch+0.007
 shark+0.007
 Otto+0.007
 fin+0.007
Neg logit contributions
cases-0.015
ics-0.014
packs-0.014
cakes-0.014
days-0.014
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
 shark+0.151
 Flip+0.151
 Mitch+0.146
 fin+0.145
 lobster+0.139
Neg logit contributions
cases-0.138
packing-0.134
grand-0.134
cakes-0.133
days-0.129
Token Lily
Token ID20037
Feature activation+1.15e-01

Pos logit contributions
 shark+0.049
 Flip+0.049
 Mitch+0.047
 fin+0.046
 Finn+0.045
Neg logit contributions
cases-0.066
ics-0.062
cakes-0.061
any-0.060
packs-0.060
Token,
Token ID11
Feature activation+7.09e-01

Pos logit contributions
 Flip+0.002
 shark+0.002
 Mitch+0.002
 fin+0.002
 Finn+0.002
Neg logit contributions
cases-0.006
cakes-0.005
packing-0.005
grand-0.005
 brushes-0.005
Token take
Token ID1011
Feature activation-3.48e-01

Pos logit contributions
 shark+0.020
 Flip+0.020
 Mitch+0.019
 fin+0.019
 Finn+0.018
Neg logit contributions
cases-0.026
cakes-0.024
ics-0.024
grand-0.024
packing-0.024
Token the
Token ID262
Feature activation+4.37e-01

Pos logit contributions
cases+0.013
packing+0.012
cakes+0.012
grand+0.012
days+0.012
Neg logit contributions
 Flip-0.010
 shark-0.010
 Mitch-0.009
 fin-0.009
 Finn-0.009
Token she
Token ID673
Feature activation+1.41e+00

Pos logit contributions
 Mitch+0.012
 Flip+0.012
 Finn+0.011
 Rory+0.011
 Ho+0.010
Neg logit contributions
cases-0.016
EEP-0.015
packing-0.015
ugs-0.015
packs-0.015
Token didn
Token ID1422
Feature activation+7.44e-01

Pos logit contributions
 Flip+0.049
 shark+0.048
 Mitch+0.047
 fin+0.047
 Finn+0.046
Neg logit contributions
cases-0.038
grand-0.038
cakes-0.037
packing-0.037
ics-0.037
Token't
Token ID470
Feature activation-8.53e-02

Pos logit contributions
 Flip+0.025
 shark+0.025
 Mitch+0.024
 fin+0.023
 Finn+0.023
Neg logit contributions
packing-0.021
grand-0.021
cases-0.021
days-0.020
packs-0.020
Token play
Token ID711
Feature activation-9.22e-01

Pos logit contributions
cases+0.003
cakes+0.002
packing+0.002
ics+0.002
grand+0.002
Neg logit contributions
 Flip-0.003
 shark-0.003
 fin-0.003
 Mitch-0.003
 crabs-0.003
Token with
Token ID351
Feature activation+1.09e+00

Pos logit contributions
cases+0.035
packing+0.035
grand+0.034
cakes+0.034
any+0.033
Neg logit contributions
 Flip-0.023
 shark-0.023
 Mitch-0.021
 fin-0.021
 Finn-0.020
Token anymore
Token ID7471
Feature activation+3.63e+00

Pos logit contributions
 shark+0.032
 Flip+0.032
 Mitch+0.031
 fin+0.031
 lobster+0.030
Neg logit contributions
cases-0.037
cakes-0.035
grand-0.035
packs-0.034
days-0.034
Token.
Token ID13
Feature activation-4.95e-01

Pos logit contributions
 shark+0.093
 Flip+0.092
 Mitch+0.089
 fin+0.088
 lobster+0.084
Neg logit contributions
cases-0.136
grand-0.132
cakes-0.131
days-0.130
packing-0.129
Token Her
Token ID2332
Feature activation-1.20e+00

Pos logit contributions
ics+0.020
days+0.020
packing+0.020
cases+0.019
packs+0.019
Neg logit contributions
 Flip-0.011
 Mitch-0.010
 Finn-0.010
 Rory-0.010
 Otto-0.010
Token mom
Token ID1995
Feature activation-2.51e+00

Pos logit contributions
cases+0.043
packing+0.042
grand+0.041
cakes+0.040
ics+0.040
Neg logit contributions
 shark-0.035
 fin-0.033
 Flip-0.032
 Mitch-0.031
 Finn-0.031
Token helped
Token ID4193
Feature activation-6.75e-01

Pos logit contributions
packs+0.077
cases+0.076
grand+0.076
days+0.072
bows+0.072
Neg logit contributions
 Flip-0.080
 shark-0.077
 Mitch-0.074
 fin-0.073
 Finn-0.073
Token her
Token ID607
Feature activation-6.22e-01

Pos logit contributions
cases+0.023
packing+0.022
grand+0.022
cakes+0.022
ics+0.021
Neg logit contributions
 Flip-0.020
 Mitch-0.019
 shark-0.019
 Finn-0.018
 fin-0.018
Token
Token ID198
Feature activation+6.10e-01

Pos logit contributions
 Flip+0.000
 shark+0.000
 Finn+0.000
 Mitch+0.000
 fin+0.000
Neg logit contributions
grand-0.001
packing-0.001
days-0.001
cases-0.001
cakes-0.001
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
 Flip+0.015
 shark+0.015
 Mitch+0.014
 fin+0.014
 Finn+0.013
Neg logit contributions
cases-0.024
packing-0.023
grand-0.023
cakes-0.023
days-0.022
TokenAfter
Token ID3260
Feature activation+9.09e-01

Pos logit contributions
 Flip+0.030
He+0.029
 shark+0.027
 Finn+0.027
 fin+0.026
Neg logit contributions
bows-0.048
cakes-0.048
days-0.048
packs-0.047
 brushes-0.047
Token a
Token ID257
Feature activation-6.74e-01

Pos logit contributions
 Flip+0.028
 shark+0.028
 Mitch+0.027
 fin+0.027
 Finn+0.026
Neg logit contributions
cases-0.029
grand-0.029
packing-0.029
cakes-0.029
days-0.028
Token while
Token ID981
Feature activation+3.71e+00

Pos logit contributions
cases+0.020
grand+0.020
cakes+0.019
packing+0.019
ics+0.019
Neg logit contributions
 shark-0.022
 Flip-0.022
 Mitch-0.021
 fin-0.021
 Finn-0.020
Token,
Token ID11
Feature activation+5.59e-01

Pos logit contributions
 shark+0.093
 Flip+0.092
 fin+0.087
 Mitch+0.086
 Finn+0.083
Neg logit contributions
cases-0.144
grand-0.142
packing-0.141
cakes-0.140
ics-0.136
Token Tim
Token ID5045
Feature activation-1.59e+00

Pos logit contributions
 Flip+0.022
 shark+0.022
 Mitch+0.021
 fin+0.021
 lobster+0.021
Neg logit contributions
cases-0.013
packing-0.012
cakes-0.012
grand-0.012
days-0.012
Tokenmy
Token ID1820
Feature activation-1.07e+00

Pos logit contributions
cakes+0.054
cases+0.051
grand+0.050
packing+0.049
ics+0.047
Neg logit contributions
 fin-0.045
 shark-0.045
 Flip-0.045
 Mitch-0.044
 Shark-0.042
Token and
Token ID290
Feature activation-3.63e-01

Pos logit contributions
cakes+0.033
packing+0.033
grand+0.032
days+0.032
bows+0.031
Neg logit contributions
 shark-0.036
 Flip-0.035
 fin-0.035
 Mitch-0.034
 Shark-0.033
Token Max
Token ID5436
Feature activation-6.07e-01

Pos logit contributions
cakes+0.013
cases+0.013
packing+0.013
days+0.013
grand+0.013
Neg logit contributions
 Flip-0.010
 shark-0.010
 Rocky-0.009
 Mitch-0.009
 Finn-0.009
Token got
Token ID1392
Feature activation-1.71e+00

Pos logit contributions
grand+0.014
cakes+0.014
cases+0.014
packing+0.014
days+0.013
Neg logit contributions
 Flip-0.024
 shark-0.024
 fin-0.023
 Mitch-0.023
 Finn-0.022
Token a
Token ID257
Feature activation-1.54e+00

Pos logit contributions
cases+0.001
cakes+0.001
packing+0.001
grand+0.001
days+0.001
Neg logit contributions
 shark-0.001
 Flip-0.001
 fin-0.001
 Mitch-0.001
 lobster-0.001
Token cool
Token ID3608
Feature activation-2.25e-01

Pos logit contributions
cases+0.043
grand+0.042
packing+0.042
cakes+0.041
ics+0.040
Neg logit contributions
 shark-0.055
 Flip-0.054
 fin-0.053
 Mitch-0.052
 Finn-0.049
Token one
Token ID530
Feature activation-1.30e-01

Pos logit contributions
cases+0.006
grand+0.006
packing+0.006
cakes+0.006
ics+0.005
Neg logit contributions
 shark-0.008
 Flip-0.008
 fin-0.008
 Mitch-0.008
 Finn-0.008
Token on
Token ID319
Feature activation+4.96e-01

Pos logit contributions
cases+0.004
grand+0.004
packing+0.004
cakes+0.004
days+0.004
Neg logit contributions
 Flip-0.004
 shark-0.004
 Mitch-0.004
 fin-0.004
 lobster-0.004
Token her
Token ID607
Feature activation+1.31e-03

Pos logit contributions
 shark+0.018
 Flip+0.018
 lobster+0.017
 fin+0.017
 Mitch+0.017
Neg logit contributions
cakes-0.013
cases-0.013
 Kim-0.012
grand-0.012
 Sarah-0.012
Token instead
Token ID2427
Feature activation+1.89e+00

Pos logit contributions
 shark+0.000
 Flip+0.000
 fin+0.000
 Mitch+0.000
 Finn+0.000
Neg logit contributions
cases-0.000
packing-0.000
grand-0.000
cakes-0.000
days-0.000
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
 Flip+0.039
 shark+0.039
 Mitch+0.037
 fin+0.037
 Rory+0.034
Neg logit contributions
packing-0.079
cases-0.079
grand-0.077
cakes-0.077
days-0.074
Token Lily
Token ID20037
Feature activation-1.12e+00

Pos logit contributions
packing+0.007
grand+0.007
cases+0.006
ics+0.006
days+0.006
Neg logit contributions
 Flip-0.004
 Mitch-0.004
 Finn-0.004
 Otto-0.004
 Rory-0.004
Token felt
Token ID2936
Feature activation+1.16e+00

Pos logit contributions
grand+0.034
cases+0.034
packing+0.033
cakes+0.033
days+0.032
Neg logit contributions
 Flip-0.038
 shark-0.037
 Mitch-0.036
 fin-0.036
 Finn-0.034
Token much
Token ID881
Feature activation-1.47e+00

Pos logit contributions
 shark+0.046
 lobster+0.046
Tony+0.045
 Flip+0.045
ino+0.044
Neg logit contributions
 baked-0.024
grand-0.022
 done-0.021
cakes-0.021
 unlocked-0.021
Token better
Token ID1365
Feature activation+2.14e+00

Pos logit contributions
 baked+0.045
cakes+0.044
cases+0.042
 works+0.041
 brushes+0.041
Neg logit contributions
 Flip-0.045
 shark-0.044
 Mitch-0.044
 lobster-0.043
 Richie-0.042
Token you
Token ID345
Feature activation+1.02e+00

Pos logit contributions
 Flip+0.061
 shark+0.059
 Mitch+0.057
 Finn+0.057
 Otto+0.055
Neg logit contributions
cases-0.068
packs-0.067
packing-0.064
grand-0.064
any-0.064
Token collect
Token ID2824
Feature activation+6.22e-01

Pos logit contributions
 Flip+0.039
 shark+0.039
 Mitch+0.038
 Rory+0.037
 fin+0.037
Neg logit contributions
packing-0.023
days-0.023
grand-0.022
cases-0.022
ics-0.022
Token them
Token ID606
Feature activation-6.67e-02

Pos logit contributions
 shark+0.020
 Flip+0.020
 Mitch+0.020
 fin+0.020
 lobster+0.020
Neg logit contributions
cakes-0.018
cases-0.018
 brushes-0.018
grand-0.017
packing-0.017
Token?"
Token ID1701
Feature activation+1.79e+00

Pos logit contributions
cases+0.002
packing+0.002
cakes+0.002
grand+0.002
days+0.002
Neg logit contributions
 Flip-0.002
 shark-0.002
 Mitch-0.002
 fin-0.002
 lobster-0.002
Token Asked
Token ID26220
Feature activation+1.47e+00

Pos logit contributions
 shark+0.052
 Mitch+0.052
 Flip+0.052
 fin+0.050
 Richie+0.049
Neg logit contributions
packing-0.058
cases-0.058
cakes-0.057
days-0.056
ics-0.055
Token grandma
Token ID49890
Feature activation+4.74e-01

Pos logit contributions
 Flip+0.050
 shark+0.050
 Mitch+0.050
 Rory+0.048
 fin+0.048
Neg logit contributions
cases-0.040
packing-0.040
grand-0.039
cakes-0.039
days-0.038
Token.
Token ID13
Feature activation+5.76e-01

Pos logit contributions
 shark+0.014
 Flip+0.014
 fin+0.013
 Mitch+0.013
 coral+0.013
Neg logit contributions
cases-0.016
any-0.016
days-0.016
cakes-0.015
grand-0.015
Token
Token ID198
Feature activation+3.89e-01

Pos logit contributions
 Flip+0.013
 Mitch+0.013
 shark+0.013
 Rory+0.012
 Richie+0.012
Neg logit contributions
cases-0.023
packing-0.023
days-0.022
grand-0.022
cakes-0.022
Token
Token ID198
Feature activation+8.05e-01

Pos logit contributions
 Flip+0.009
 shark+0.009
 Mitch+0.009
 fin+0.008
 lobster+0.008
Neg logit contributions
cases-0.015
packing-0.015
cakes-0.015
grand-0.015
days-0.015
TokenKat
Token ID25881
Feature activation-3.70e+00

Pos logit contributions
 Flip+0.021
 Mitch+0.021
Col+0.020
 Prince+0.020
 lobster+0.020
Neg logit contributions
cases-0.028
days-0.027
bows-0.026
packs-0.026
 Kim-0.026
Tokeny
Token ID88
Feature activation-1.33e+00

Pos logit contributions
cases+0.122
cakes+0.120
packing+0.118
grand+0.115
ics+0.113
Neg logit contributions
 shark-0.105
 Mitch-0.099
 fin-0.099
 Flip-0.096
 lobster-0.096
Token.
Token ID13
Feature activation+3.08e-02

Pos logit contributions
 Flip+0.045
 shark+0.044
 Mitch+0.042
 fin+0.041
 lobster+0.040
Neg logit contributions
cases-0.059
grand-0.056
packing-0.055
days-0.055
cakes-0.055
Token
Token ID198
Feature activation+2.96e-01

Pos logit contributions
 Flip+0.001
 shark+0.001
 Mitch+0.001
 fin+0.001
 Finn+0.001
Neg logit contributions
cases-0.001
packing-0.001
grand-0.001
cakes-0.001
days-0.001
Token
Token ID198
Feature activation+8.61e-01

Pos logit contributions
 shark+0.008
 Flip+0.008
 fin+0.008
 Mitch+0.008
 lobster+0.007
Neg logit contributions
cases-0.011
grand-0.010
packing-0.010
cakes-0.010
ics-0.010
TokenTom
Token ID13787
Feature activation-1.01e+00

Pos logit contributions
 Flip+0.022
 shark+0.021
 Mitch+0.021
 fin+0.020
 Rory+0.020
Neg logit contributions
cases-0.031
days-0.031
cakes-0.030
packing-0.030
packs-0.030
Tokenmy
Token ID1820
Feature activation-1.49e+00

Pos logit contributions
grand+0.025
cakes+0.024
cases+0.024
packing+0.023
days+0.022
Neg logit contributions
 Flip-0.039
 shark-0.038
 fin-0.038
 Mitch-0.038
 Finn-0.036
Token was
Token ID373
Feature activation-8.28e-02

Pos logit contributions
cases+0.040
grand+0.040
cakes+0.039
packing+0.039
days+0.038
Neg logit contributions
 Flip-0.054
 shark-0.054
 fin-0.053
 Mitch-0.052
 Finn-0.050
Token glad
Token ID9675
Feature activation-2.27e-01

Pos logit contributions
cases+0.003
packing+0.003
ics+0.003
cakes+0.003
grand+0.003
Neg logit contributions
 shark-0.002
 Flip-0.002
 Mitch-0.002
 fin-0.002
 lobster-0.002
Token he
Token ID339
Feature activation+1.68e+00

Pos logit contributions
 Kim+0.007
cases+0.007
 Sarah+0.007
grand+0.007
cakes+0.007
Neg logit contributions
 Flip-0.007
 Mitch-0.006
 fin-0.006
 shark-0.006
 lobster-0.006
Token had
Token ID550
Feature activation-6.27e-01

Pos logit contributions
 Flip+0.052
 shark+0.050
 Mitch+0.048
 Finn+0.048
 fin+0.047
Neg logit contributions
packs-0.056
 brushes-0.056
cakes-0.055
grand-0.054
ugs-0.054
Token avoided
Token ID13941
Feature activation+1.19e+00

Pos logit contributions
cases+0.020
cakes+0.019
grand+0.019
days+0.019
bows+0.018
Neg logit contributions
 Flip-0.021
 fin-0.021
 shark-0.020
 Mitch-0.020
 crabs-0.020
Token the
Token ID262
Feature activation-2.56e-01

Pos logit contributions
 shark+0.040
 Flip+0.039
 Mitch+0.039
 fin+0.038
 lobster+0.037
Neg logit contributions
 Kim-0.032
 Sarah-0.031
cases-0.030
 Susan-0.030
packs-0.030
Tokenmy
Token ID1820
Feature activation-1.92e+00

Pos logit contributions
cases+0.115
grand+0.113
packing+0.110
ics+0.108
ital+0.107
Neg logit contributions
 Flip-0.084
 shark-0.083
 Finn-0.080
 him-0.080
 fin-0.078
Token and
Token ID290
Feature activation-6.25e-01

Pos logit contributions
packs+0.060
cases+0.059
days+0.057
cakes+0.057
 brushes+0.056
Neg logit contributions
 Flip-0.060
 shark-0.058
 Mitch-0.056
 Finn-0.056
 fin-0.055
Token Daddy
Token ID26926
Feature activation-3.43e+00

Pos logit contributions
bows+0.027
ics+0.026
cases+0.026
grand+0.026
cakes+0.025
Neg logit contributions
 shark-0.015
 Finn-0.014
 Otto-0.014
 Flip-0.013
 Mitch-0.013
Token gathered
Token ID9272
Feature activation-1.58e+00

Pos logit contributions
grand+0.106
 brushes+0.105
bows+0.102
cakes+0.102
cases+0.101
Neg logit contributions
 Flip-0.106
 shark-0.101
 Finn-0.100
 Otto-0.099
 Mitch-0.099
Token Bobby
Token ID17155
Feature activation-3.96e-01

Pos logit contributions
cases+0.053
cakes+0.052
packing+0.052
grand+0.051
days+0.050
Neg logit contributions
 Flip-0.046
 shark-0.046
 fin-0.044
 Mitch-0.044
 lobster-0.043
Token up
Token ID510
Feature activation+7.91e-01

Pos logit contributions
grand+0.013
cases+0.013
packing+0.013
cakes+0.013
 brushes+0.013
Neg logit contributions
 Flip-0.012
 shark-0.012
 Mitch-0.011
 fin-0.011
 Finn-0.011
Token in
Token ID287
Feature activation+2.02e+00

Pos logit contributions
 shark+0.023
 Flip+0.023
 Mitch+0.022
 fin+0.022
 Finn+0.021
Neg logit contributions
cases-0.026
packing-0.026
grand-0.026
cakes-0.025
days-0.025
Token a
Token ID257
Feature activation+1.03e-01

Pos logit contributions
 shark+0.053
 Flip+0.053
 Mitch+0.051
 fin+0.050
 Finn+0.048
Neg logit contributions
cases-0.075
packing-0.073
grand-0.073
cakes-0.073
ics-0.070
Token big
Token ID1263
Feature activation+1.20e-01

Pos logit contributions
 shark+0.004
 Flip+0.003
 fin+0.003
 Mitch+0.003
 lobster+0.003
Neg logit contributions
cases-0.003
packing-0.003
grand-0.003
cakes-0.003
ics-0.003
Token hug
Token ID16225
Feature activation+2.46e+00

Pos logit contributions
 shark+0.004
 Flip+0.004
 fin+0.004
 Mitch+0.004
 lobster+0.004
Neg logit contributions
cases-0.003
grand-0.003
cakes-0.003
packing-0.003
ics-0.003
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
 shark+0.051
 Flip+0.051
 Mitch+0.047
 fin+0.047
 Finn+0.044
Neg logit contributions
cases-0.102
packing-0.102
grand-0.101
cakes-0.099
ics-0.098
Token and
Token ID290
Feature activation-5.59e-01

Pos logit contributions
grand+0.057
cases+0.056
packing+0.056
cakes+0.055
days+0.054
Neg logit contributions
 shark-0.034
 Flip-0.034
 fin-0.032
 Mitch-0.032
 Finn-0.030
Token climbed
Token ID19952
Feature activation-9.40e-01

Pos logit contributions
ugs+0.018
grand+0.017
cakes+0.017
cases+0.017
packing+0.017
Neg logit contributions
 shark-0.019
 Flip-0.019
 Finn-0.019
 Mitch-0.017
 fin-0.017
Token up
Token ID510
Feature activation+2.37e-01

Pos logit contributions
 Kim+0.018
cakes+0.017
 stairs+0.017
packs+0.016
ics+0.016
Neg logit contributions
ascal-0.041
olo-0.038
 fin-0.038
 shark-0.037
 lobster-0.037
Token a
Token ID257
Feature activation-5.99e-01

Pos logit contributions
 Flip+0.007
 shark+0.007
 Mitch+0.007
 fin+0.007
 lobster+0.007
Neg logit contributions
cases-0.008
cakes-0.008
packing-0.007
grand-0.007
days-0.007
Token tree
Token ID5509
Feature activation-3.05e-01

Pos logit contributions
cases+0.016
cakes+0.016
packing+0.016
grand+0.015
days+0.015
Neg logit contributions
 Flip-0.021
 Mitch-0.020
 shark-0.020
 fin-0.020
 Rory-0.019
Token.
Token ID13
Feature activation-5.22e-01

Pos logit contributions
cases+0.012
packing+0.011
grand+0.011
cakes+0.011
days+0.011
Neg logit contributions
 shark-0.008
 Flip-0.008
 Mitch-0.007
 fin-0.007
 lobster-0.007
Token The
Token ID383
Feature activation-6.95e-01

Pos logit contributions
packs+0.021
ics+0.021
days+0.021
packing+0.021
grand+0.020
Neg logit contributions
 Flip-0.012
 Finn-0.012
 shark-0.012
 Otto-0.011
 Mitch-0.011
Token all
Token ID477
Feature activation-3.56e+00

Pos logit contributions
cases+0.024
cakes+0.024
packing+0.023
ics+0.023
grand+0.023
Neg logit contributions
 Flip-0.019
 shark-0.019
 Mitch-0.019
 fin-0.019
 lobster-0.017
Tokenigator
Token ID23823
Feature activation-1.85e+00

Pos logit contributions
cases+0.168
packing+0.160
grand+0.159
bows+0.158
cakes+0.156
Neg logit contributions
 shark-0.072
 Flip-0.062
 Finn-0.061
 fin-0.057
 Mitch-0.056
Token tried
Token ID3088
Feature activation-5.85e-01

Pos logit contributions
cases+0.048
grand+0.047
packing+0.047
cakes+0.046
days+0.045
Neg logit contributions
 shark-0.068
 Flip-0.068
 Mitch-0.066
 fin-0.066
 Finn-0.063
Token to
Token ID284
Feature activation-6.27e-01

Pos logit contributions
cases+0.010
 Kim+0.010
 Sarah+0.009
ics+0.009
cakes+0.009
Neg logit contributions
 shark-0.025
 Mitch-0.025
 Flip-0.024
 fin-0.024
olo-0.024
Token every
Token ID790
Feature activation-4.27e-01

Pos logit contributions
cases+0.044
ics+0.044
packs+0.043
packing+0.043
 brushes+0.042
Neg logit contributions
 Flip-0.047
 shark-0.047
 Mitch-0.046
 Shark-0.044
 fin-0.044
Token day
Token ID1110
Feature activation-2.30e+00

Pos logit contributions
cases+0.009
grand+0.008
packing+0.008
cakes+0.008
days+0.008
Neg logit contributions
 shark-0.018
 Flip-0.018
 Mitch-0.018
 fin-0.018
 lobster-0.017
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
cases+0.085
packing+0.082
grand+0.082
cakes+0.081
days+0.080
Neg logit contributions
 shark-0.060
 Flip-0.060
 Mitch-0.057
 fin-0.056
 lobster-0.054
Token They
Token ID1119
Feature activation-1.01e-01

Pos logit contributions
grand+0.014
cases+0.014
cakes+0.014
days+0.014
packing+0.014
Neg logit contributions
 shark-0.018
 fin-0.018
 Flip-0.017
 Mitch-0.017
 crabs-0.017
Token were
Token ID547
Feature activation-2.08e+00

Pos logit contributions
cases+0.002
packing+0.002
grand+0.002
ics+0.002
days+0.002
Neg logit contributions
 shark-0.004
 Flip-0.004
 fin-0.004
 Mitch-0.004
 lobster-0.004
Token happy
Token ID3772
Feature activation-1.75e+00

Pos logit contributions
cases+0.078
grand+0.077
packing+0.075
cakes+0.075
days+0.072
Neg logit contributions
 Flip-0.056
 shark-0.054
 Mitch-0.053
 fin-0.051
 Finn-0.050
Token.
Token ID13
Feature activation-8.94e-01

Pos logit contributions
cases+0.058
cakes+0.058
packing+0.057
grand+0.057
days+0.056
Neg logit contributions
 Flip-0.050
 shark-0.049
 Mitch-0.049
 fin-0.048
 lobster-0.046
Token The
Token ID383
Feature activation-7.10e-01

Pos logit contributions
grand+0.023
 Kim+0.023
 Sarah+0.022
days+0.021
packs+0.021
Neg logit contributions
 shark-0.034
 fin-0.033
 crabs-0.032
 Flip-0.032
 lobster-0.032
Token moral
Token ID6573
Feature activation-1.64e+00

Pos logit contributions
cakes+0.024
grand+0.024
cases+0.024
packing+0.023
days+0.023
Neg logit contributions
 Flip-0.020
 Mitch-0.019
 shark-0.019
 fin-0.018
 Rory-0.018
Token of
Token ID286
Feature activation-1.71e+00

Pos logit contributions
grand+0.045
cakes+0.044
cases+0.044
packing+0.043
 brushes+0.042
Neg logit contributions
 Flip-0.060
 Mitch-0.058
 shark-0.058
 fin-0.056
 Finn-0.055
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
grand+0.053
packing+0.052
ics+0.050
days+0.050
cakes+0.049
Neg logit contributions
 Flip-0.053
 shark-0.053
 fin-0.052
 Mitch-0.052
 lobster-0.050
Token or
Token ID393
Feature activation+1.49e+00

Pos logit contributions
 shark+0.004
 Flip+0.004
 fin+0.004
 Mitch+0.004
 lobster+0.004
Neg logit contributions
cases-0.005
grand-0.005
packing-0.005
cakes-0.005
days-0.005
Token package
Token ID5301
Feature activation-1.99e+00

Pos logit contributions
 shark+0.040
 Flip+0.039
 fin+0.039
 Mitch+0.038
 crabs+0.036
Neg logit contributions
packing-0.054
cakes-0.054
cases-0.054
days-0.052
any-0.052
Token in
Token ID287
Feature activation+1.05e+00

Pos logit contributions
packing+0.073
grand+0.071
cases+0.071
cakes+0.071
days+0.070
Neg logit contributions
 shark-0.050
 Flip-0.050
 Mitch-0.047
 fin-0.047
 crabs-0.045
Token it
Token ID340
Feature activation-1.85e-02

Pos logit contributions
 shark+0.026
 Flip+0.025
 fin+0.025
 Mitch+0.025
 Rory+0.023
Neg logit contributions
cases-0.041
cakes-0.041
packing-0.040
grand-0.039
ics-0.038
Token.
Token ID13
Feature activation-2.15e-01

Pos logit contributions
packing+0.001
cases+0.001
cakes+0.001
grand+0.001
any+0.001
Neg logit contributions
 Flip-0.000
 shark-0.000
 Mitch-0.000
 fin-0.000
 Finn-0.000
Token
Token ID220
Feature activation-2.31e+00

Pos logit contributions
cases+0.008
packing+0.008
grand+0.008
ics+0.007
days+0.007
Neg logit contributions
 Flip-0.006
 Mitch-0.006
 shark-0.006
 Finn-0.006
 Richie-0.005
Token
Token ID198
Feature activation-2.29e-02

Pos logit contributions
packing+0.097
cases+0.097
grand+0.096
cakes+0.094
days+0.093
Neg logit contributions
 Flip-0.047
 Mitch-0.044
 shark-0.043
 Finn-0.042
 fin-0.040
Token
Token ID198
Feature activation+5.77e-01

Pos logit contributions
cases+0.001
grand+0.001
cakes+0.001
ics+0.001
packing+0.001
Neg logit contributions
 shark-0.001
 fin-0.001
 Flip-0.001
 Mitch-0.001
 crabs-0.001
TokenJack
Token ID14295
Feature activation-1.34e+00

Pos logit contributions
 shark+0.018
 Flip+0.017
 fin+0.017
 Mitch+0.017
 Rory+0.016
Neg logit contributions
cases-0.019
packing-0.019
cakes-0.019
grand-0.019
days-0.018
Tokeny
Token ID88
Feature activation-1.23e+00

Pos logit contributions
cases+0.041
cakes+0.040
grand+0.039
packing+0.039
ics+0.037
Neg logit contributions
 Flip-0.044
 shark-0.044
 fin-0.043
 Mitch-0.042
 lobster-0.041
Token wanted
Token ID2227
Feature activation-9.16e-02

Pos logit contributions
cakes+0.038
packing+0.037
cases+0.037
grand+0.036
ics+0.036
Neg logit contributions
 shark-0.039
 Flip-0.038
 fin-0.038
 Mitch-0.038
 lobster-0.036
Token to
Token ID284
Feature activation-1.87e+00

Pos logit contributions
cases+0.026
grand+0.026
packing+0.026
cakes+0.026
days+0.025
Neg logit contributions
 Flip-0.031
 shark-0.031
 Mitch-0.030
 fin-0.029
 Finn-0.028
Token read
Token ID1100
Feature activation+7.71e-01

Pos logit contributions
cases+0.053
cakes+0.051
grand+0.051
packing+0.051
days+0.048
Neg logit contributions
 Flip-0.065
 shark-0.065
 Mitch-0.063
 fin-0.063
 lobster-0.060
Token the
Token ID262
Feature activation+7.77e-01

Pos logit contributions
 shark+0.024
 Flip+0.024
 fin+0.023
 Mitch+0.023
 lobster+0.022
Neg logit contributions
cases-0.025
cakes-0.023
 Sarah-0.023
 Kim-0.023
 brushes-0.022
Token whole
Token ID2187
Feature activation-1.26e+00

Pos logit contributions
 Mitch+0.017
 Flip+0.017
 shark+0.017
 fin+0.016
 lobster+0.016
Neg logit contributions
cases-0.032
cakes-0.030
days-0.030
packing-0.030
ics-0.030
Token spell
Token ID4822
Feature activation+3.15e+00

Pos logit contributions
cases+0.042
packing+0.039
ics+0.039
grand+0.038
cakes+0.038
Neg logit contributions
 shark-0.040
 lobster-0.038
 fin-0.038
 Mitch-0.038
 Flip-0.037
Token.
Token ID13
Feature activation-5.54e-01

Pos logit contributions
 shark+0.070
 Flip+0.069
 Mitch+0.066
 fin+0.065
 Richie+0.061
Neg logit contributions
packing-0.126
grand-0.125
cases-0.124
cakes-0.122
days-0.120
Token Everyone
Token ID11075
Feature activation-2.25e+00

Pos logit contributions
cases+0.020
grand+0.019
packing+0.019
cakes+0.019
days+0.018
Neg logit contributions
 shark-0.015
 Flip-0.015
 fin-0.014
 Mitch-0.014
 crabs-0.014
Token in
Token ID287
Feature activation-5.87e-01

Pos logit contributions
cases+0.071
grand+0.069
cakes+0.068
packing+0.065
EEP+0.065
Neg logit contributions
 shark-0.073
 Flip-0.070
 Mitch-0.068
 fin-0.068
 lobster-0.066
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
packs+0.016
days+0.015
 Sarah+0.015
packing+0.015
 Kim+0.015
Neg logit contributions
 Flip-0.020
 shark-0.019
 Mitch-0.019
 Otto-0.018
 fin-0.018
Token class
Token ID1398
Feature activation-1.12e+00

Pos logit contributions
cases+0.029
grand+0.028
packing+0.028
cakes+0.028
days+0.027
Neg logit contributions
 shark-0.037
 Flip-0.037
 Mitch-0.036
 fin-0.036
 lobster-0.035
Token was
Token ID373
Feature activation-6.68e-01

Pos logit contributions
grand+0.038
packing+0.037
burn+0.037
cakes+0.036
any+0.035
Neg logit contributions
 shark-0.028
 Mitch-0.028
 crabs-0.028
 Flip-0.027
 Richie-0.027
Token
Token ID198
Feature activation+5.49e-01

Pos logit contributions
grand+0.004
 Kim+0.004
cases+0.004
ics+0.004
days+0.004
Neg logit contributions
 shark-0.005
 Flip-0.005
 lobster-0.005
 fin-0.005
 Mitch-0.005
Token"
Token ID1
Feature activation-1.07e+00

Pos logit contributions
 shark+0.018
 Flip+0.018
 fin+0.017
 Mitch+0.017
 lobster+0.016
Neg logit contributions
cases-0.017
packing-0.017
grand-0.017
cakes-0.016
days-0.016
TokenWow
Token ID22017
Feature activation-1.36e+00

Pos logit contributions
cases+0.038
grand+0.037
packing+0.037
cakes+0.036
days+0.035
Neg logit contributions
 Flip-0.030
 shark-0.030
 Mitch-0.029
 fin-0.028
 Finn-0.027
Token,
Token ID11
Feature activation+8.95e-01

Pos logit contributions
grand+0.063
any+0.060
cakes+0.060
cases+0.059
packing+0.059
Neg logit contributions
 shark-0.027
 Flip-0.026
 Mitch-0.025
 Finn-0.024
 fin-0.024
Token they
Token ID484
Feature activation+3.36e-01

Pos logit contributions
 shark+0.024
 Flip+0.023
 Mitch+0.022
 crabs+0.022
 Finn+0.022
Neg logit contributions
any-0.036
cases-0.035
cakes-0.034
ugs-0.034
days-0.033
Token are
Token ID389
Feature activation-3.35e-01

Pos logit contributions
 Flip+0.009
 shark+0.009
 Mitch+0.009
 fin+0.009
 crabs+0.009
Neg logit contributions
cases-0.012
any-0.011
cakes-0.011
grand-0.011
packing-0.011
Token so
Token ID523
Feature activation-1.76e+00

Pos logit contributions
cases+0.011
cakes+0.011
grand+0.011
packing+0.011
ics+0.011
Neg logit contributions
 shark-0.010
 Flip-0.010
 fin-0.009
 Mitch-0.009
 lobster-0.009
Token nice
Token ID3621
Feature activation-1.46e+00

Pos logit contributions
any+0.063
cases+0.062
packing+0.061
grand+0.060
packs+0.060
Neg logit contributions
 shark-0.050
 Flip-0.049
 Mitch-0.048
 crabs-0.048
 coral-0.047
Token,"
Token ID553
Feature activation+2.34e+00

Pos logit contributions
packing+0.048
cases+0.048
grand+0.047
days+0.045
any+0.045
Neg logit contributions
 shark-0.046
 Flip-0.045
 Mitch-0.042
 crabs-0.042
 fin-0.041
Token Tom
Token ID4186
Feature activation-1.12e-01

Pos logit contributions
 Flip+0.078
 shark+0.077
 fin+0.076
 Mitch+0.074
 lobster+0.073
Neg logit contributions
cases-0.066
grand-0.065
cakes-0.065
 brushes-0.064
 Kim-0.063
Token said
Token ID531
Feature activation-9.13e-01

Pos logit contributions
cases+0.003
packing+0.003
grand+0.003
cakes+0.003
days+0.003
Neg logit contributions
 Flip-0.004
 shark-0.004
 Mitch-0.004
 fin-0.004
 Finn-0.004
Token away
Token ID1497
Feature activation+1.57e-01

Pos logit contributions
cases+0.039
packing+0.038
grand+0.038
cakes+0.038
days+0.037
Neg logit contributions
 Flip-0.043
 shark-0.042
 Mitch-0.041
 fin-0.041
 Finn-0.039
Token.
Token ID13
Feature activation-6.35e-01

Pos logit contributions
 Flip+0.004
 shark+0.004
 Mitch+0.004
 fin+0.004
 lobster+0.003
Neg logit contributions
cases-0.006
grand-0.006
packing-0.006
cakes-0.006
days-0.006
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
cases+0.020
grand+0.020
packing+0.020
cakes+0.020
 Kim+0.019
Neg logit contributions
 shark-0.020
 fin-0.019
 Flip-0.018
 crabs-0.018
 Mitch-0.018
Token
Token ID198
Feature activation+4.29e-01

Pos logit contributions
grand+0.005
cakes+0.005
ics+0.005
 Kim+0.005
packing+0.005
Neg logit contributions
 shark-0.007
 fin-0.007
 crabs-0.006
 lobster-0.006
 coral-0.006
TokenThe
Token ID464
Feature activation-5.84e-01

Pos logit contributions
 shark+0.014
 fin+0.013
 Mitch+0.013
 Flip+0.012
 lobster+0.012
Neg logit contributions
grand-0.014
packing-0.014
cakes-0.014
cases-0.013
ics-0.013
Token little
Token ID1310
Feature activation-3.51e+00

Pos logit contributions
cases+0.016
cakes+0.016
packing+0.015
grand+0.015
days+0.015
Neg logit contributions
 Flip-0.021
 Mitch-0.020
 fin-0.020
 shark-0.020
 Rory-0.019
Token boy
Token ID2933
Feature activation-1.05e+00

Pos logit contributions
cases+0.090
grand+0.087
packing+0.085
cakes+0.083
ics+0.081
Neg logit contributions
 shark-0.138
 Flip-0.135
 Mitch-0.130
 fin-0.130
 lobster-0.127
Token thanked
Token ID26280
Feature activation+2.88e-01

Pos logit contributions
cases+0.029
packing+0.029
grand+0.028
cakes+0.028
days+0.027
Neg logit contributions
 Flip-0.037
 shark-0.037
 Mitch-0.036
 fin-0.035
 Finn-0.034
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
 fin+0.011
 shark+0.010
 Flip+0.010
 lobster+0.010
 Mitch+0.010
Neg logit contributions
 Kim-0.007
 Sarah-0.007
grand-0.006
 Su-0.006
 Susan-0.006
Token baker
Token ID46412
Feature activation+2.20e+00

Pos logit contributions
 Flip+0.038
 Mitch+0.037
 fin+0.036
 Rory+0.035
 Finn+0.034
Neg logit contributions
cakes-0.032
cases-0.030
grand-0.030
packing-0.029
 baked-0.029
Token for
Token ID329
Feature activation+1.83e+00

Pos logit contributions
 shark+0.074
 Flip+0.073
 fin+0.072
 Mitch+0.071
 crabs+0.068
Neg logit contributions
cases-0.063
packing-0.061
cakes-0.060
grand-0.059
days-0.058
Token help
Token ID1037
Feature activation+5.31e-01

Pos logit contributions
 Flip+0.047
 shark+0.046
 Mitch+0.045
 fin+0.044
 lobster+0.043
Neg logit contributions
cases-0.042
grand-0.042
packing-0.041
cakes-0.040
days-0.040
Token and
Token ID290
Feature activation+3.75e-01

Pos logit contributions
 Flip+0.016
 shark+0.016
 Mitch+0.015
 fin+0.015
 lobster+0.015
Neg logit contributions
cases-0.017
grand-0.017
days-0.017
cakes-0.017
packing-0.017
Token learn
Token ID2193
Feature activation-4.78e-01

Pos logit contributions
 shark+0.012
 fin+0.012
 Flip+0.012
 Mitch+0.011
 crabs+0.011
Neg logit contributions
cases-0.012
cakes-0.012
grand-0.011
packing-0.011
ics-0.011
Token more
Token ID517
Feature activation+1.65e+00

Pos logit contributions
cases+0.015
grand+0.015
cakes+0.015
 brushes+0.015
packs+0.014
Neg logit contributions
 Flip-0.015
 Mitch-0.015
 shark-0.015
 fin-0.014
 lobster-0.014
Token.
Token ID13
Feature activation-9.17e-01

Pos logit contributions
 shark+0.047
 Flip+0.047
 Mitch+0.044
 fin+0.044
 Finn+0.042
Neg logit contributions
cases-0.057
packing-0.056
grand-0.056
cakes-0.056
days-0.054
Token There
Token ID1318
Feature activation-3.92e+00

Pos logit contributions
grand+0.024
packing+0.023
cases+0.023
days+0.022
cakes+0.022
Neg logit contributions
 shark-0.034
 Flip-0.033
 fin-0.032
 Mitch-0.032
 lobster-0.032
Token is
Token ID318
Feature activation+2.34e-02

Pos logit contributions
packing+0.143
any+0.140
cases+0.139
cakes+0.139
grand+0.137
Neg logit contributions
 shark-0.102
 Flip-0.098
 Mitch-0.098
 crabs-0.097
 fin-0.094
Token no
Token ID645
Feature activation-3.00e-02

Pos logit contributions
 shark+0.001
 Flip+0.001
 fin+0.001
 Mitch+0.001
 lobster+0.001
Neg logit contributions
cases-0.001
 brushes-0.001
grand-0.001
packs-0.001
days-0.001
Token need
Token ID761
Feature activation-1.72e+00

Pos logit contributions
cases+0.001
packing+0.001
grand+0.001
cakes+0.001
ics+0.001
Neg logit contributions
 shark-0.001
 Flip-0.001
 fin-0.001
 Mitch-0.001
 lobster-0.001
Token to
Token ID284
Feature activation-1.91e+00

Pos logit contributions
cases+0.049
grand+0.048
packing+0.047
cakes+0.047
days+0.046
Neg logit contributions
 Flip-0.059
 shark-0.058
 Mitch-0.057
 fin-0.056
 lobster-0.054
Token be
Token ID307
Feature activation-7.43e-01

Pos logit contributions
cases+0.050
packing+0.048
grand+0.048
cakes+0.048
days+0.046
Neg logit contributions
 shark-0.070
 Flip-0.070
 Mitch-0.068
 fin-0.067
 lobster-0.065
Token Sam
Token ID3409
Feature activation-1.20e-01

Pos logit contributions
 Flip+0.018
 shark+0.018
 Mitch+0.017
 fin+0.017
 Finn+0.017
Neg logit contributions
cases-0.025
packing-0.024
grand-0.024
cakes-0.024
ics-0.024
Token!".
Token ID48220
Feature activation+2.95e-01

Pos logit contributions
cases+0.005
packing+0.005
cakes+0.005
grand+0.005
 brushes+0.004
Neg logit contributions
 Flip-0.003
 shark-0.003
 Mitch-0.003
 fin-0.003
 Finn-0.003
Token Sam
Token ID3409
Feature activation-1.00e+00

Pos logit contributions
 Flip+0.008
 shark+0.008
 Mitch+0.008
 fin+0.008
 Finn+0.007
Neg logit contributions
cases-0.011
packing-0.010
grand-0.010
cakes-0.010
days-0.010
Token wasn
Token ID2492
Feature activation-5.18e-01

Pos logit contributions
cases+0.027
grand+0.027
packing+0.027
cakes+0.026
days+0.025
Neg logit contributions
 Flip-0.036
 shark-0.035
 Mitch-0.034
 fin-0.034
 Finn-0.033
Token't
Token ID470
Feature activation-1.10e+00

Pos logit contributions
cases+0.011
cakes+0.010
ics+0.010
packing+0.010
grand+0.010
Neg logit contributions
 shark-0.022
 fin-0.022
 Flip-0.021
 coral-0.021
 Mitch-0.021
Token going
Token ID1016
Feature activation-3.48e+00

Pos logit contributions
cakes+0.038
cases+0.038
grand+0.036
packing+0.036
 brushes+0.035
Neg logit contributions
 Flip-0.033
 Mitch-0.031
 shark-0.031
 fin-0.031
 Rory-0.031
Token to
Token ID284
Feature activation-1.53e+00

Pos logit contributions
cases+0.089
cakes+0.083
grand+0.082
packing+0.081
ics+0.078
Neg logit contributions
 Flip-0.132
 shark-0.131
 Mitch-0.129
 fin-0.127
 Finn-0.123
Token give
Token ID1577
Feature activation+7.83e-01

Pos logit contributions
cases+0.036
grand+0.036
packing+0.035
cakes+0.035
days+0.034
Neg logit contributions
 shark-0.059
 Flip-0.059
 Mitch-0.057
 fin-0.057
 Finn-0.055
Token up
Token ID510
Feature activation+3.46e-01

Pos logit contributions
 shark+0.029
 Flip+0.029
 fin+0.028
 Mitch+0.028
 lobster+0.027
Neg logit contributions
cases-0.020
cakes-0.019
grand-0.018
 Kim-0.018
packing-0.018
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
 shark+0.009
 Flip+0.009
 Mitch+0.009
 fin+0.009
 lobster+0.009
Neg logit contributions
cases-0.012
grand-0.012
cakes-0.012
packing-0.012
days-0.011
Token She
Token ID1375
Feature activation+1.23e+00

Pos logit contributions
 shark+0.007
 Flip+0.007
 fin+0.007
 Mitch+0.007
 lobster+0.007
Neg logit contributions
cases-0.010
cakes-0.010
grand-0.010
packing-0.010
days-0.010
Token In
Token ID554
Feature activation-4.01e-01

Pos logit contributions
cases+0.016
packing+0.016
grand+0.016
cakes+0.015
days+0.015
Neg logit contributions
 Flip-0.011
 shark-0.011
 Mitch-0.011
 fin-0.010
 Finn-0.010
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
 Kim+0.011
 Sarah+0.010
packing+0.010
packs+0.010
 brushes+0.010
Neg logit contributions
 Flip-0.013
ascal-0.013
 crabs-0.013
 lobster-0.013
 shark-0.013
Token end
Token ID886
Feature activation+2.36e+00

Pos logit contributions
 Flip+0.002
 shark+0.002
 Mitch+0.002
 fin+0.002
 Rory+0.002
Neg logit contributions
cases-0.005
cakes-0.005
packing-0.005
grand-0.005
 brushes-0.005
Token,
Token ID11
Feature activation-1.90e-02

Pos logit contributions
 fin+0.077
 Flip+0.075
 shark+0.074
 Mitch+0.074
 lobster+0.071
Neg logit contributions
cases-0.070
cakes-0.069
 Kim-0.067
days-0.067
packing-0.065
Token they
Token ID484
Feature activation-4.45e-01

Pos logit contributions
cases+0.001
grand+0.001
packing+0.001
cakes+0.001
days+0.001
Neg logit contributions
 shark-0.001
 Flip-0.001
 Mitch-0.001
 fin-0.001
 Finn-0.001
Token all
Token ID477
Feature activation-3.79e+00

Pos logit contributions
cases+0.010
packing+0.010
 baked+0.010
cakes+0.010
ics+0.010
Neg logit contributions
 Flip-0.017
 shark-0.017
 Mitch-0.017
 fin-0.017
 Rory-0.016
Token had
Token ID550
Feature activation-1.61e+00

Pos logit contributions
cases+0.113
cakes+0.106
grand+0.105
packing+0.101
ugs+0.098
Neg logit contributions
 shark-0.134
 Flip-0.129
 Finn-0.127
 fin-0.127
 Mitch-0.125
Token a
Token ID257
Feature activation-2.43e+00

Pos logit contributions
cases+0.050
grand+0.049
packing+0.049
cakes+0.048
days+0.047
Neg logit contributions
 Flip-0.051
 shark-0.051
 Mitch-0.049
 fin-0.049
 Finn-0.047
Token great
Token ID1049
Feature activation-7.97e-02

Pos logit contributions
cases+0.065
packing+0.065
cakes+0.063
grand+0.063
days+0.062
Neg logit contributions
 Flip-0.086
 Mitch-0.084
 shark-0.084
 fin-0.081
 Rory-0.079
Token time
Token ID640
Feature activation-1.73e+00

Pos logit contributions
cases+0.002
grand+0.002
packing+0.002
cakes+0.002
days+0.002
Neg logit contributions
 shark-0.003
 Flip-0.003
 Mitch-0.002
 fin-0.002
 Finn-0.002
Token playing
Token ID2712
Feature activation-9.43e-01

Pos logit contributions
cases+0.058
packing+0.056
grand+0.055
cakes+0.055
days+0.055
Neg logit contributions
 Flip-0.051
 shark-0.051
 Mitch-0.050
 fin-0.049
 lobster-0.047
Token all
Token ID477
Feature activation-2.74e+00

Pos logit contributions
 shark+0.021
 Flip+0.021
 fin+0.020
 Mitch+0.019
 crabs+0.019
Neg logit contributions
grand-0.039
packing-0.039
cases-0.038
cakes-0.038
 brushes-0.036
Token over
Token ID625
Feature activation-2.08e+00

Pos logit contributions
cases+0.110
EEP+0.105
packing+0.105
grand+0.104
cakes+0.102
Neg logit contributions
 shark-0.057
 like-0.056
 Flip-0.054
 coral-0.053
 Finn-0.052
Token the
Token ID262
Feature activation+4.82e-01

Pos logit contributions
cases+0.067
grand+0.066
packing+0.065
cakes+0.065
days+0.065
Neg logit contributions
 Flip-0.063
 shark-0.063
 Mitch-0.060
 fin-0.060
 lobster-0.057
Token place
Token ID1295
Feature activation+9.89e-01

Pos logit contributions
 shark+0.021
 Flip+0.020
 Mitch+0.020
 fin+0.020
 lobster+0.019
Neg logit contributions
cases-0.010
packing-0.009
grand-0.009
cakes-0.009
days-0.009
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
 shark+0.015
 Flip+0.014
 fin+0.014
 Mitch+0.014
 crabs+0.013
Neg logit contributions
packing-0.045
grand-0.044
cases-0.043
EEP-0.042
cakes-0.042
Token Ag
Token ID2449
Feature activation-5.00e+00

Pos logit contributions
packing+0.015
cases+0.015
grand+0.014
cakes+0.014
ics+0.014
Neg logit contributions
 Flip-0.014
 shark-0.013
 Mitch-0.013
 Rory-0.013
 Finn-0.013
Tokennes
Token ID2516
Feature activation-3.50e+00

Pos logit contributions
 brushes+0.120
packs+0.112
 Kim+0.112
cakes+0.110
 baked+0.109
Neg logit contributions
 Flip-0.180
 Rory-0.178
 Richie-0.175
 Mitch-0.172
 Finn-0.171
Token said
Token ID531
Feature activation-5.55e-01

Pos logit contributions
cakes+0.115
cases+0.114
packing+0.112
bows+0.112
grand+0.111
Neg logit contributions
 fin-0.094
 shark-0.093
 Mitch-0.092
 Finn-0.090
 Flip-0.090
Token,
Token ID11
Feature activation+7.99e-01

Pos logit contributions
cases+0.019
grand+0.018
packing+0.018
cakes+0.018
days+0.018
Neg logit contributions
 Flip-0.016
 shark-0.016
 Mitch-0.015
 fin-0.015
 lobster-0.015
Token
Token ID6184
Feature activation+2.38e-01

Pos logit contributions
 shark+0.027
 Flip+0.027
 Mitch+0.026
 fin+0.026
 lobster+0.025
Neg logit contributions
cases-0.023
packing-0.022
grand-0.022
cakes-0.022
days-0.021
Token
Token ID95
Feature activation+5.62e-01

Pos logit contributions
 Flip+0.008
 shark+0.008
 Mitch+0.007
 fin+0.007
 Finn+0.007
Neg logit contributions
cases-0.007
grand-0.007
packing-0.007
cakes-0.007
days-0.007