TOP ACTIVATIONS
MAX = 0.046

generous . She broke the pastry in two little pieces ,
to look at the or nam ents on the tree .
who liked to collect or nam ents . Everywhere she went
outside . Lily took her pastry and went to the park
away . They dropped their pastry on the ground . The
the dog and shared her pastry with him . The dog
now . She drank her grape juice and forgot about the
a tree with shiny or nam ents hanging from its branches
and tried to grab their pastry . Lily and Tom screamed
a knife and cut the pastry . He gave Lily a
bitter and made her face scr unch up .
smiled as he held his pastry in his hands .
be able to share her pastry with the bird . <|endoftext|>
went she searched for or nam ents , and the more
to help sort the or nam ents . She picked up
Tim my accidentally spilled his milk on the table . Lily
time playing with her or nam ents and making her special
and play with your or nam ents . Maybe you can
. It makes her face scr unch up . She sp
a box of pretty or nam ents that she wanted to

MIN ACTIVATIONS
MIN = -0.050

to buy them ". Little Cornel ius smiled and agreed .
We are still heroes and villains !" They ran
then walking away . Little Cornel ius was so proud that
wall . Far ley watched as his mom
anyone . I treated the poor man very badly . For
" The Mystery of the Bur ied Bone ". She showed
pretend they were heroes and villains . Ben had a stick
?" The voice belonged to G iz mo 's friend ,
pretend they were superheroes and villains . Sara was Wonder Woman
let 's see what it does ," Tom said . They
driver and Anna was the navig ator . They put on
was a z ebra named Stri pe y . He was
where the arrow had come from . She saw a smiling
leaves look tasty ," said Stri pe y .
keep going ." Max and Scot ty kept walking , but
wonders what dinosaur it came from and how it lived .
and helping each other ." Big gie and his friends listened
finally left the shop , Cornel ius was feeling much better
cool . Far ley was so happy to
she got rid of the intr uders . She did not

INTERVAL 0.022 - 0.046
CONTAINS 0.255%

! The moral of the story is to not
happy and they lived happily ever after . The end .
the song . From that day on , Lily and her
to play with Lily 's skirt too . He wanted to
. Gina scoop ed a handful of the pasta into her

INTERVAL -0.002 - 0.022
CONTAINS 61.141%

family was so happy that it had gone . The sky
saw it shining in the rain as if it was made
his four friends , set off for their jog . It
cooked all sorts of cold food in the big silver pot
they were bored . They looked out the window , but

INTERVAL -0.026 - -0.002
CONTAINS 38.520%

, they were all spark ly and clean again ! Mom
was so proud of his work . Then , the distant
ma said , giving L ola a big hug .
But you have to write on the right things . You
can use stickers , paint , or glitter . You can

INTERVAL -0.050 - -0.026
CONTAINS 0.084%

word they shared - ' sp oil ! '. <|endoftext|> One
uncle Tom . Sh r ink me , shrink me !"
tough enough to explore the mysterious room . She was very
What are you doing , Milo ?" asked Lucy .
a family , a spaceship crew and princess es in a
Token generous
Token ID14431
Feature activation+7.34e-03

Pos logit contributions
 gloss+0.006
ifying+0.005
 licking+0.005
 chewing+0.005
 spaghetti+0.005
Neg logit contributions
an-0.005
 island-0.005
 captain-0.004
 idea-0.004
 pilot-0.004
Token.
Token ID13
Feature activation-2.85e-03

Pos logit contributions
 gloss+0.014
 licking+0.013
ifying+0.013
 chewing+0.013
 spaghetti+0.012
Neg logit contributions
an-0.011
 island-0.011
 captain-0.010
 idea-0.010
 pilot-0.010
Token She
Token ID1375
Feature activation-1.68e-04

Pos logit contributions
an+0.006
 idea+0.006
 island+0.005
 man+0.005
 captain+0.005
Neg logit contributions
 gloss-0.004
ifying-0.003
 licking-0.003
 spaghetti-0.003
 chewing-0.003
Token broke
Token ID6265
Feature activation+1.12e-02

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
 Mitt-0.000
 spaghetti-0.000
Token the
Token ID262
Feature activation+5.74e-03

Pos logit contributions
 gloss+0.021
 licking+0.020
 chewing+0.020
ifying+0.019
 spaghetti+0.019
Neg logit contributions
an-0.017
 island-0.017
 captain-0.016
 idea-0.015
 pilot-0.015
Token pastry
Token ID41738
Feature activation+4.63e-02

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.006
ifying+0.006
 spaghetti+0.005
Neg logit contributions
an-0.013
 island-0.013
 captain-0.013
 pilot-0.012
 idea-0.012
Token in
Token ID287
Feature activation+4.54e-03

Pos logit contributions
 gloss+0.098
 licking+0.094
ifying+0.094
 chewing+0.093
 Midnight+0.090
Neg logit contributions
 island-0.056
an-0.055
 idea-0.051
 man-0.049
 captain-0.045
Token two
Token ID734
Feature activation+1.39e-02

Pos logit contributions
 gloss+0.007
 licking+0.007
 chewing+0.007
ifying+0.006
 spaghetti+0.006
Neg logit contributions
an-0.009
 island-0.008
 captain-0.008
 idea-0.008
 pilot-0.008
Token little
Token ID1310
Feature activation+7.10e-03

Pos logit contributions
 gloss+0.026
 licking+0.024
 Mitt+0.024
ifying+0.023
 chewing+0.023
Neg logit contributions
an-0.022
 island-0.021
 idea-0.020
 captain-0.019
 pilot-0.019
Token pieces
Token ID5207
Feature activation-1.56e-03

Pos logit contributions
 gloss+0.014
 licking+0.014
ifying+0.013
 chewing+0.013
 Mitt+0.013
Neg logit contributions
an-0.010
 island-0.010
 idea-0.009
 captain-0.009
 pilot-0.009
Token,
Token ID11
Feature activation-1.57e-03

Pos logit contributions
an+0.003
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
 spaghetti-0.003
Token to
Token ID284
Feature activation-2.03e-02

Pos logit contributions
 gloss+0.024
ifying+0.023
 licking+0.021
 chewing+0.020
 Mitt+0.019
Neg logit contributions
 island-0.027
an-0.026
 captain-0.026
 pilot-0.025
 anchor-0.025
Token look
Token ID804
Feature activation+1.27e-02

Pos logit contributions
an+0.036
 island+0.035
 captain+0.033
 idea+0.033
 pilot+0.032
Neg logit contributions
 gloss-0.034
 licking-0.032
 chewing-0.031
ifying-0.031
 spaghetti-0.030
Token at
Token ID379
Feature activation-8.98e-03

Pos logit contributions
 gloss+0.020
 spaghetti+0.019
 licking+0.019
 chewing+0.019
ifying+0.019
Neg logit contributions
an-0.023
 island-0.023
 captain-0.021
 pilot-0.021
 idea-0.020
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
an+0.015
 island+0.014
 captain+0.014
 idea+0.013
 pilot+0.013
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
 spaghetti-0.015
ifying-0.014
Token or
Token ID393
Feature activation+6.74e-03

Pos logit contributions
 gloss+0.026
 licking+0.025
 chewing+0.025
ifying+0.024
 spaghetti+0.024
Neg logit contributions
an-0.019
 island-0.018
 captain-0.017
 idea-0.016
 pilot-0.016
Tokennam
Token ID7402
Feature activation+4.60e-02

Pos logit contributions
 gloss+0.012
umbs+0.011
 By+0.011
ives+0.011
 Mitt+0.011
Neg logit contributions
 island-0.011
 captain-0.010
an-0.010
 pilot-0.010
 man-0.010
Tokenents
Token ID658
Feature activation-1.30e-02

Pos logit contributions
 gloss+0.063
 licking+0.059
 Mitt+0.059
 Midnight+0.057
 spaghetti+0.057
Neg logit contributions
an-0.098
 island-0.094
 captain-0.089
 pilot-0.088
 idea-0.082
Token on
Token ID319
Feature activation-3.08e-03

Pos logit contributions
an+0.023
 island+0.022
 captain+0.021
 idea+0.020
 pilot+0.020
Neg logit contributions
 gloss-0.021
 licking-0.021
 chewing-0.020
ifying-0.020
 spaghetti-0.019
Token the
Token ID262
Feature activation+8.35e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.005
 licking-0.005
 chewing-0.005
 spaghetti-0.005
ifying-0.005
Token tree
Token ID5509
Feature activation+5.26e-03

Pos logit contributions
 gloss+0.015
 licking+0.015
 chewing+0.014
ifying+0.014
 spaghetti+0.014
Neg logit contributions
an-0.014
 island-0.013
 captain-0.012
 pilot-0.012
 idea-0.012
Token.
Token ID13
Feature activation-2.21e-03

Pos logit contributions
 gloss+0.010
 licking+0.010
 Mitt+0.010
 chewing+0.010
ifying+0.009
Neg logit contributions
 island-0.008
an-0.008
 idea-0.007
 captain-0.007
 pilot-0.007
Token who
Token ID508
Feature activation+9.01e-03

Pos logit contributions
 gloss+0.026
ifying+0.025
 spaghetti+0.023
umbs+0.023
 licking+0.023
Neg logit contributions
an-0.022
 island-0.021
 pilot-0.019
 captain-0.019
 idea-0.017
Token liked
Token ID8288
Feature activation+8.07e-03

Pos logit contributions
 gloss+0.018
 licking+0.017
ifying+0.017
 chewing+0.017
 spaghetti+0.016
Neg logit contributions
an-0.013
 idea-0.012
 island-0.012
 pilot-0.011
 captain-0.011
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
ifying+0.012
 gloss+0.012
aling+0.011
 licking+0.010
 Plus+0.010
Neg logit contributions
 island-0.016
 pilot-0.015
an-0.014
 captain-0.014
 man-0.014
Token collect
Token ID2824
Feature activation+1.93e-03

Pos logit contributions
 island+0.022
an+0.022
 idea+0.021
 captain+0.021
 pilot+0.020
Neg logit contributions
 gloss-0.020
ifying-0.020
 licking-0.020
 chewing-0.019
 Mitt-0.018
Token or
Token ID393
Feature activation-4.61e-03

Pos logit contributions
 gloss+0.004
 spaghetti+0.004
 chewing+0.004
 licking+0.004
 Mitt+0.004
Neg logit contributions
an-0.003
 captain-0.003
 pilot-0.002
tha-0.002
 island-0.002
Tokennam
Token ID7402
Feature activation+4.55e-02

Pos logit contributions
 island+0.008
an+0.008
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.008
 licking-0.007
ifying-0.007
 chewing-0.007
 Mitt-0.007
Tokenents
Token ID658
Feature activation-1.46e-02

Pos logit contributions
 gloss+0.055
 licking+0.052
 chewing+0.049
 spaghetti+0.049
 Mitt+0.049
Neg logit contributions
an-0.102
 island-0.099
 captain-0.094
 pilot-0.093
 idea-0.091
Token.
Token ID13
Feature activation-5.42e-03

Pos logit contributions
an+0.028
 island+0.026
 captain+0.026
 pilot+0.025
 idea+0.025
Neg logit contributions
 gloss-0.022
 licking-0.021
 chewing-0.021
 eating-0.020
ifying-0.020
Token Everywhere
Token ID48201
Feature activation-1.54e-02

Pos logit contributions
an+0.013
 island+0.013
 idea+0.012
 captain+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.006
 licking-0.005
 chewing-0.005
ifying-0.005
 spaghetti-0.005
Token she
Token ID673
Feature activation+1.25e-02

Pos logit contributions
an+0.039
 island+0.037
 captain+0.037
 idea+0.035
 anchor+0.035
Neg logit contributions
 gloss-0.015
 licking-0.013
 chewing-0.013
 Mitt-0.013
 spaghetti-0.013
Token went
Token ID1816
Feature activation-1.83e-03

Pos logit contributions
 gloss+0.017
 chewing+0.016
 spaghetti+0.015
 licking+0.015
 sauce+0.015
Neg logit contributions
an-0.025
 captain-0.024
 island-0.023
 pilot-0.022
 route-0.022
Token outside
Token ID2354
Feature activation+4.55e-03

Pos logit contributions
an+0.006
 island+0.006
 idea+0.005
 captain+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.008
ifying-0.008
 Mitt-0.007
Token.
Token ID13
Feature activation+1.98e-03

Pos logit contributions
 gloss+0.009
 licking+0.009
 chewing+0.008
ifying+0.008
 Mitt+0.008
Neg logit contributions
 island-0.006
an-0.006
 idea-0.006
 captain-0.006
 pilot-0.006
Token Lily
Token ID20037
Feature activation+6.70e-03

Pos logit contributions
 licking+0.003
ifying+0.003
 chewing+0.003
 gloss+0.003
 exercising+0.003
Neg logit contributions
 idea-0.003
an-0.003
 island-0.003
 man-0.003
 girl-0.003
Token took
Token ID1718
Feature activation-6.86e-03

Pos logit contributions
 gloss+0.011
 licking+0.011
 chewing+0.011
 spaghetti+0.010
 Mitt+0.010
Neg logit contributions
an-0.012
 island-0.011
 captain-0.011
 idea-0.010
 pilot-0.010
Token her
Token ID607
Feature activation+7.89e-03

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 pilot+0.012
 idea+0.012
Neg logit contributions
 gloss-0.010
 licking-0.010
 Mitt-0.010
 chewing-0.010
 spaghetti-0.010
Token pastry
Token ID41738
Feature activation+4.51e-02

Pos logit contributions
 gloss+0.011
 licking+0.010
 chewing+0.010
ifying+0.010
 spaghetti+0.010
Neg logit contributions
an-0.016
 island-0.016
 captain-0.015
 idea-0.015
 pilot-0.014
Token and
Token ID290
Feature activation-6.54e-03

Pos logit contributions
 gloss+0.083
 licking+0.081
 chewing+0.079
ifying+0.078
 exercising+0.074
Neg logit contributions
an-0.071
 island-0.071
 captain-0.064
 idea-0.063
 pilot-0.061
Token went
Token ID1816
Feature activation+5.70e-03

Pos logit contributions
an+0.012
 island+0.011
 captain+0.011
 idea+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.010
 spaghetti-0.010
 Mitt-0.010
Token to
Token ID284
Feature activation+9.26e-03

Pos logit contributions
 gloss+0.011
ifying+0.011
pill+0.010
 licking+0.010
 subt+0.010
Neg logit contributions
 island-0.008
an-0.008
 captain-0.008
 man-0.007
 idea-0.007
Token the
Token ID262
Feature activation+3.12e-03

Pos logit contributions
 gloss+0.016
ifying+0.015
 licking+0.014
 chewing+0.014
heddar+0.013
Neg logit contributions
 island-0.016
an-0.016
 idea-0.015
 captain-0.015
 pilot-0.015
Token park
Token ID3952
Feature activation+6.21e-03

Pos logit contributions
 gloss+0.007
 licking+0.006
ifying+0.006
 chewing+0.006
aling+0.006
Neg logit contributions
 island-0.004
an-0.004
 man-0.004
 pilot-0.004
 captain-0.003
Token away
Token ID1497
Feature activation-6.28e-03

Pos logit contributions
 gloss+0.012
 licking+0.011
ifying+0.011
 chewing+0.011
 spaghetti+0.010
Neg logit contributions
an-0.009
 island-0.008
 captain-0.008
 idea-0.008
 pilot-0.007
Token.
Token ID13
Feature activation-5.11e-03

Pos logit contributions
an+0.011
 island+0.011
 captain+0.010
 pilot+0.010
 idea+0.010
Neg logit contributions
 gloss-0.010
 licking-0.010
 chewing-0.010
ifying-0.009
 Mitt-0.009
Token They
Token ID1119
Feature activation-4.33e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.009
 idea+0.009
 man+0.008
Neg logit contributions
 gloss-0.008
 chewing-0.007
 spaghetti-0.007
 licking-0.007
ifying-0.007
Token dropped
Token ID5710
Feature activation+5.49e-03

Pos logit contributions
an+0.008
 island+0.007
 captain+0.007
 idea+0.007
 pilot+0.007
Neg logit contributions
 gloss-0.007
 licking-0.007
 chewing-0.007
ifying-0.007
 Mitt-0.007
Token their
Token ID511
Feature activation+1.32e-02

Pos logit contributions
 gloss+0.011
 licking+0.010
 chewing+0.010
ifying+0.010
 spaghetti+0.010
Neg logit contributions
an-0.008
 island-0.008
 captain-0.007
 idea-0.007
 pilot-0.007
Token pastry
Token ID41738
Feature activation+4.47e-02

Pos logit contributions
 gloss+0.016
 licking+0.016
 chewing+0.015
 spaghetti+0.014
ifying+0.014
Neg logit contributions
an-0.029
 island-0.028
 captain-0.027
 pilot-0.027
 idea-0.026
Token on
Token ID319
Feature activation+4.35e-04

Pos logit contributions
 gloss+0.094
ifying+0.088
 licking+0.087
 Mitt+0.087
 chewing+0.086
Neg logit contributions
 island-0.058
an-0.057
 idea-0.051
 captain-0.049
 man-0.048
Token the
Token ID262
Feature activation+9.92e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 spaghetti+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token ground
Token ID2323
Feature activation-1.75e-03

Pos logit contributions
 gloss+0.022
 licking+0.020
 chewing+0.020
ifying+0.020
 Mitt+0.020
Neg logit contributions
an-0.013
 island-0.013
 captain-0.011
 man-0.011
 idea-0.011
Token.
Token ID13
Feature activation-2.99e-03

Pos logit contributions
an+0.002
 island+0.002
 idea+0.002
 captain+0.002
 man+0.002
Neg logit contributions
 gloss-0.004
ifying-0.004
 Mitt-0.004
 spaghetti-0.004
 Midnight-0.004
Token The
Token ID383
Feature activation+2.01e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.005
 idea+0.005
 man+0.005
Neg logit contributions
 gloss-0.004
 chewing-0.004
 licking-0.004
 exercising-0.004
ifying-0.004
Token the
Token ID262
Feature activation+1.92e-03

Pos logit contributions
 gloss+0.005
ifying+0.005
 licking+0.005
 chewing+0.005
 exercising+0.004
Neg logit contributions
 island-0.005
an-0.005
 captain-0.005
 pilot-0.005
 idea-0.005
Token dog
Token ID3290
Feature activation+2.74e-03

Pos logit contributions
 gloss+0.003
 licking+0.003
 chewing+0.003
 spaghetti+0.003
 eating+0.003
Neg logit contributions
an-0.003
 island-0.003
 captain-0.003
 anchor-0.003
 pilot-0.003
Token and
Token ID290
Feature activation-3.44e-03

Pos logit contributions
 gloss+0.005
ifying+0.005
 spaghetti+0.004
 subt+0.004
 licking+0.004
Neg logit contributions
an-0.005
 island-0.004
 pilot-0.004
 idea-0.004
 deer-0.004
Token shared
Token ID4888
Feature activation+1.35e-02

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 pilot+0.004
 idea+0.004
Neg logit contributions
 gloss-0.007
 licking-0.007
ifying-0.007
 chewing-0.007
 spaghetti-0.006
Token her
Token ID607
Feature activation+1.64e-03

Pos logit contributions
 gloss+0.026
 licking+0.025
 chewing+0.024
ifying+0.024
 spaghetti+0.024
Neg logit contributions
an-0.020
 island-0.020
 captain-0.018
 idea-0.018
 pilot-0.018
Token pastry
Token ID41738
Feature activation+4.44e-02

Pos logit contributions
 licking+0.002
 gloss+0.002
 eating+0.002
 chewing+0.002
 spaghetti+0.002
Neg logit contributions
an-0.004
 captain-0.004
 island-0.004
 pilot-0.003
 anchor-0.003
Token with
Token ID351
Feature activation+2.19e-03

Pos logit contributions
ifying+0.081
 gloss+0.081
 chewing+0.075
 Midnight+0.075
 licking+0.074
Neg logit contributions
an-0.071
 island-0.069
 idea-0.062
 captain-0.061
 man-0.059
Token him
Token ID683
Feature activation-6.50e-03

Pos logit contributions
 gloss+0.004
ifying+0.004
 licking+0.004
 chewing+0.004
 exercising+0.004
Neg logit contributions
an-0.003
 island-0.003
 captain-0.003
 deer-0.003
 idea-0.003
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
an+0.011
 island+0.010
 captain+0.010
 idea+0.009
 pilot+0.009
Neg logit contributions
 gloss-0.012
 licking-0.011
 chewing-0.011
ifying-0.011
 spaghetti-0.011
Token The
Token ID383
Feature activation-6.06e-03

Pos logit contributions
an+0.014
 man+0.014
 island+0.014
 captain+0.014
 pilot+0.014
Neg logit contributions
ifying-0.012
 gloss-0.012
 licking-0.011
 chewing-0.011
 exercising-0.010
Token dog
Token ID3290
Feature activation-2.39e-03

Pos logit contributions
an+0.008
 island+0.007
 captain+0.007
 pilot+0.006
 idea+0.006
Neg logit contributions
 gloss-0.013
 licking-0.013
 chewing-0.013
 spaghetti-0.013
 Mitt-0.013
Token now
Token ID783
Feature activation+8.50e-03

Pos logit contributions
ifying+0.037
 gloss+0.034
aling+0.031
heddar+0.031
ction+0.030
Neg logit contributions
 island-0.022
 man-0.020
 idea-0.020
an-0.020
 captain-0.019
Token.
Token ID13
Feature activation+6.42e-03

Pos logit contributions
ifying+0.019
 gloss+0.018
pill+0.017
 licking+0.017
gary+0.017
Neg logit contributions
 idea-0.010
 man-0.010
 island-0.009
an-0.009
 pilot-0.009
Token She
Token ID1375
Feature activation+4.36e-03

Pos logit contributions
ifying+0.011
 gloss+0.010
 licking+0.010
aling+0.009
ction+0.009
Neg logit contributions
 idea-0.011
 man-0.010
 girl-0.010
 landscape-0.010
 boy-0.010
Token drank
Token ID24070
Feature activation+2.34e-02

Pos logit contributions
ifying+0.008
 gloss+0.008
 licking+0.007
 chewing+0.007
 exercising+0.007
Neg logit contributions
an-0.007
 island-0.007
 captain-0.007
 idea-0.007
 pilot-0.006
Token her
Token ID607
Feature activation+1.19e-02

Pos logit contributions
ifying+0.058
seek+0.056
 gloss+0.054
aling+0.054
 subt+0.052
Neg logit contributions
 idea-0.021
 man-0.020
 people-0.017
 island-0.017
an-0.016
Token grape
Token ID30777
Feature activation+4.41e-02

Pos logit contributions
 gloss+0.013
ifying+0.012
 licking+0.012
 chewing+0.011
 exercising+0.011
Neg logit contributions
an-0.028
 island-0.026
 idea-0.026
 captain-0.025
 man-0.024
Token juice
Token ID13135
Feature activation+3.14e-02

Pos logit contributions
ifying+0.085
aling+0.074
 licking+0.073
 Plus+0.071
 By+0.070
Neg logit contributions
an-0.068
 idea-0.067
 island-0.062
 man-0.059
ies-0.054
Token and
Token ID290
Feature activation-3.89e-04

Pos logit contributions
 gloss+0.068
ifying+0.067
 licking+0.067
 chewing+0.065
 exercising+0.064
Neg logit contributions
an-0.036
 island-0.034
 idea-0.034
 man-0.031
 captain-0.030
Token forgot
Token ID16453
Feature activation+9.30e-03

Pos logit contributions
 island+0.001
 idea+0.001
 man+0.001
an+0.001
 captain+0.001
Neg logit contributions
ifying-0.001
 gloss-0.001
aling-0.001
 By-0.001
ction-0.001
Token about
Token ID546
Feature activation+1.14e-03

Pos logit contributions
ifying+0.015
 gloss+0.015
ction+0.013
�+0.013
aling+0.013
Neg logit contributions
 island-0.016
 idea-0.016
 man-0.015
 people-0.015
 captain-0.014
Token the
Token ID262
Feature activation+1.79e-03

Pos logit contributions
ifying+0.002
 gloss+0.002
ives+0.002
aling+0.002
seek+0.002
Neg logit contributions
 island-0.001
 idea-0.001
an-0.001
 man-0.001
 captain-0.001
Token a
Token ID257
Feature activation+4.38e-03

Pos logit contributions
an+0.005
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
 spaghetti-0.003
Token tree
Token ID5509
Feature activation+1.12e-02

Pos logit contributions
 gloss+0.009
 licking+0.009
 chewing+0.009
 Mitt+0.008
 spaghetti+0.008
Neg logit contributions
an-0.006
 island-0.006
 captain-0.005
 idea-0.005
 pilot-0.005
Token with
Token ID351
Feature activation+8.23e-03

Pos logit contributions
 gloss+0.023
ifying+0.022
 spaghetti+0.021
 Midnight+0.021
 Mitt+0.021
Neg logit contributions
 island-0.017
an-0.016
 idea-0.014
 man-0.014
 pilot-0.013
Token shiny
Token ID22441
Feature activation-6.07e-03

Pos logit contributions
 gloss+0.013
 licking+0.012
 chewing+0.012
ifying+0.012
 exercising+0.012
Neg logit contributions
 island-0.015
an-0.015
 captain-0.014
 idea-0.014
 pilot-0.014
Token or
Token ID393
Feature activation-1.32e-03

Pos logit contributions
an+0.010
 island+0.010
 captain+0.009
 pilot+0.009
 crew+0.009
Neg logit contributions
 gloss-0.011
 licking-0.010
 chewing-0.010
 spaghetti-0.010
 eating-0.010
Tokennam
Token ID7402
Feature activation+4.35e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
 idea+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Tokenents
Token ID658
Feature activation-1.16e-02

Pos logit contributions
 gloss+0.049
 licking+0.047
 chewing+0.044
 spaghetti+0.044
 Mitt+0.043
Neg logit contributions
an-0.100
 island-0.097
 captain-0.093
 pilot-0.091
 idea-0.090
Token hanging
Token ID10938
Feature activation+6.53e-04

Pos logit contributions
an+0.023
 island+0.022
 captain+0.021
 pilot+0.021
 idea+0.020
Neg logit contributions
 gloss-0.017
 licking-0.016
 chewing-0.016
 eating-0.015
ifying-0.015
Token from
Token ID422
Feature activation-2.17e-02

Pos logit contributions
 gloss+0.001
 licking+0.001
ifying+0.001
 chewing+0.001
 eating+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 pilot-0.001
 idea-0.001
Token its
Token ID663
Feature activation+5.32e-03

Pos logit contributions
an+0.042
 captain+0.039
 island+0.037
 idea+0.036
 pilot+0.035
Neg logit contributions
 licking-0.032
 chewing-0.032
 eating-0.032
 Mitt-0.031
 spaghetti-0.031
Token branches
Token ID13737
Feature activation+1.18e-02

Pos logit contributions
 gloss+0.011
 licking+0.010
 chewing+0.010
ifying+0.010
 spaghetti+0.010
Neg logit contributions
an-0.007
 island-0.007
 captain-0.007
 idea-0.006
 pilot-0.006
Token and
Token ID290
Feature activation+1.34e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
 spaghetti-0.003
Token tried
Token ID3088
Feature activation+6.19e-03

Pos logit contributions
 gloss+0.002
 licking+0.002
ifying+0.002
 chewing+0.002
 spaghetti+0.002
Neg logit contributions
 island-0.002
an-0.002
 captain-0.002
 pilot-0.002
 idea-0.002
Token to
Token ID284
Feature activation-1.25e-03

Pos logit contributions
ifying+0.010
 gloss+0.010
heddar+0.009
umbs+0.009
 Midnight+0.008
Neg logit contributions
 island-0.010
an-0.010
 captain-0.010
 idea-0.010
 anchor-0.010
Token grab
Token ID5552
Feature activation-7.79e-04

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
 idea+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
 Mitt-0.002
 spaghetti-0.002
Token their
Token ID511
Feature activation+1.46e-02

Pos logit contributions
an+0.001
 island+0.001
 pilot+0.001
 captain+0.001
 idea+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 Mitt-0.001
 chewing-0.001
ifying-0.001
Token pastry
Token ID41738
Feature activation+4.31e-02

Pos logit contributions
 licking+0.027
 gloss+0.026
 chewing+0.026
 spaghetti+0.025
ifying+0.025
Neg logit contributions
an-0.023
 island-0.022
 captain-0.021
 pilot-0.021
 idea-0.019
Token.
Token ID13
Feature activation-4.84e-04

Pos logit contributions
 gloss+0.079
 licking+0.075
 chewing+0.074
ifying+0.072
 Mitt+0.071
Neg logit contributions
an-0.069
 island-0.067
 captain-0.063
 idea-0.062
 pilot-0.059
Token Lily
Token ID20037
Feature activation+8.92e-03

Pos logit contributions
an+0.001
 captain+0.001
 island+0.001
 idea+0.001
 man+0.001
Neg logit contributions
 gloss-0.001
ifying-0.001
 licking-0.001
 chewing-0.001
 spaghetti-0.001
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
 gloss+0.017
 licking+0.016
ifying+0.016
 chewing+0.016
 spaghetti+0.016
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.012
Token Tom
Token ID4186
Feature activation+4.49e-03

Pos logit contributions
 gloss+0.011
 licking+0.011
ifying+0.010
 chewing+0.010
 spaghetti+0.009
Neg logit contributions
an-0.023
 island-0.023
 captain-0.022
 idea-0.022
 pilot-0.022
Token screamed
Token ID25421
Feature activation+5.22e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.008
ifying+0.007
 spaghetti+0.007
Neg logit contributions
an-0.007
 island-0.007
 captain-0.007
 idea-0.007
 pilot-0.006
Token a
Token ID257
Feature activation+6.80e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.007
 licking-0.007
ifying-0.007
 chewing-0.007
 exercising-0.006
Token knife
Token ID9845
Feature activation+1.00e-02

Pos logit contributions
 gloss+0.012
 licking+0.012
 chewing+0.011
ifying+0.011
 Mitt+0.011
Neg logit contributions
an-0.011
 island-0.011
 idea-0.010
 captain-0.010
 pilot-0.010
Token and
Token ID290
Feature activation-2.57e-03

Pos logit contributions
 gloss+0.021
 licking+0.020
ifying+0.020
 chewing+0.019
 spaghetti+0.019
Neg logit contributions
an-0.014
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.012
Token cut
Token ID2005
Feature activation+8.14e-03

Pos logit contributions
 island+0.004
an+0.004
 captain+0.004
 anchor+0.003
 idea+0.003
Neg logit contributions
 gloss-0.005
ifying-0.005
 licking-0.004
 exercising-0.004
 chewing-0.004
Token the
Token ID262
Feature activation+7.98e-03

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
ifying+0.014
 spaghetti+0.014
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.012
Token pastry
Token ID41738
Feature activation+4.27e-02

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
ifying+0.014
 Mitt+0.014
Neg logit contributions
an-0.012
 island-0.012
 captain-0.011
 idea-0.011
 pilot-0.011
Token.
Token ID13
Feature activation+1.30e-03

Pos logit contributions
 gloss+0.083
 Mitt+0.079
 Midnight+0.079
 licking+0.078
ifying+0.078
Neg logit contributions
an-0.061
 island-0.060
 idea-0.057
 man-0.054
 captain-0.054
Token He
Token ID679
Feature activation-2.56e-03

Pos logit contributions
 gloss+0.002
ifying+0.002
 exercising+0.002
 licking+0.002
 chewing+0.002
Neg logit contributions
 captain-0.002
 idea-0.002
an-0.002
 man-0.002
 island-0.002
Token gave
Token ID2921
Feature activation-2.59e-05

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.005
ifying-0.004
 licking-0.004
 chewing-0.004
 exercising-0.004
Token Lily
Token ID20037
Feature activation+1.15e-02

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
ifying-0.000
 spaghetti-0.000
Token a
Token ID257
Feature activation+9.77e-04

Pos logit contributions
 gloss+0.021
ifying+0.020
 licking+0.019
 exercising+0.019
 subt+0.019
Neg logit contributions
an-0.019
 island-0.017
 idea-0.016
 captain-0.016
 man-0.016
Token bitter
Token ID12922
Feature activation+9.87e-03

Pos logit contributions
 gloss+0.017
seek+0.017
ifying+0.016
ives+0.016
stand+0.015
Neg logit contributions
 idea-0.013
 island-0.012
an-0.012
 man-0.012
 anchor-0.012
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
ifying+0.022
 gloss+0.022
aling+0.021
 subt+0.020
 licking+0.020
Neg logit contributions
 idea-0.011
an-0.011
 island-0.011
 man-0.009
ies-0.009
Token made
Token ID925
Feature activation+6.97e-03

Pos logit contributions
 gloss+0.022
seek+0.021
ifying+0.021
stand+0.020
 By+0.020
Neg logit contributions
 idea-0.018
an-0.018
 island-0.018
 man-0.017
 anchor-0.017
Token her
Token ID607
Feature activation+1.90e-02

Pos logit contributions
 gloss+0.012
 licking+0.012
ifying+0.012
 chewing+0.012
 exercising+0.011
Neg logit contributions
an-0.011
 idea-0.011
 island-0.010
 captain-0.010
 pilot-0.010
Token face
Token ID1986
Feature activation+9.29e-03

Pos logit contributions
 gloss+0.033
ifying+0.031
 By+0.030
 Midnight+0.029
 exercising+0.029
Neg logit contributions
an-0.033
 idea-0.033
 island-0.031
 captain-0.029
 pilot-0.028
Token scr
Token ID6040
Feature activation+4.26e-02

Pos logit contributions
 gloss+0.016
 By+0.015
 subt+0.015
 Midnight+0.015
seek+0.014
Neg logit contributions
an-0.015
 idea-0.015
 island-0.014
 captain-0.013
ies-0.013
Tokenunch
Token ID3316
Feature activation-1.05e-03

Pos logit contributions
 gloss+0.065
 Midnight+0.065
ifying+0.061
 licking+0.060
 subt+0.058
Neg logit contributions
an-0.085
 idea-0.073
 island-0.071
 anchor-0.066
ble-0.065
Token up
Token ID510
Feature activation-1.94e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Token.
Token ID13
Feature activation+3.88e-03

Pos logit contributions
an+0.032
 island+0.031
 captain+0.029
 idea+0.029
 pilot+0.028
Neg logit contributions
 gloss-0.035
 licking-0.033
 chewing-0.032
ifying-0.032
 Mitt-0.031
Token
Token ID198
Feature activation-1.84e-04

Pos logit contributions
ifying+0.006
 gloss+0.006
 licking+0.006
 chewing+0.005
 exercising+0.005
Neg logit contributions
an-0.006
 idea-0.006
 man-0.006
 island-0.006
 captain-0.006
Token
Token ID198
Feature activation+2.93e-03

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
ifying-0.000
 chewing-0.000
 spaghetti-0.000
Token smiled
Token ID13541
Feature activation-6.08e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.007
ifying-0.007
 spaghetti-0.007
Token as
Token ID355
Feature activation-1.99e-02

Pos logit contributions
an+0.010
 island+0.009
 captain+0.009
 idea+0.009
 pilot+0.009
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.011
ifying-0.010
 spaghetti-0.010
Token he
Token ID339
Feature activation+9.68e-03

Pos logit contributions
an+0.043
 island+0.041
 captain+0.039
 idea+0.039
 pilot+0.038
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.024
ifying-0.023
 spaghetti-0.023
Token held
Token ID2714
Feature activation-3.02e-03

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
ifying+0.014
 spaghetti+0.013
Neg logit contributions
an-0.018
 island-0.018
 captain-0.017
 idea-0.016
 pilot-0.016
Token his
Token ID465
Feature activation+7.38e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 pilot+0.005
 idea+0.005
Neg logit contributions
 gloss-0.005
 licking-0.005
 chewing-0.005
 spaghetti-0.004
ifying-0.004
Token pastry
Token ID41738
Feature activation+4.26e-02

Pos logit contributions
 licking+0.013
 chewing+0.012
 gloss+0.012
 spaghetti+0.012
ifying+0.012
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 pilot-0.012
 idea-0.011
Token in
Token ID287
Feature activation+7.39e-03

Pos logit contributions
 gloss+0.082
 licking+0.077
 Mitt+0.075
 Midnight+0.075
ifying+0.075
Neg logit contributions
an-0.063
 island-0.063
 captain-0.059
 idea-0.058
 anchor-0.055
Token his
Token ID465
Feature activation+5.04e-03

Pos logit contributions
 gloss+0.013
 licking+0.012
 chewing+0.011
ifying+0.011
 spaghetti+0.011
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.012
Token hands
Token ID2832
Feature activation+9.25e-03

Pos logit contributions
 gloss+0.010
 licking+0.009
 chewing+0.009
 exercising+0.009
ifying+0.009
Neg logit contributions
an-0.008
 island-0.007
 idea-0.007
 captain-0.007
 pilot-0.007
Token.
Token ID13
Feature activation+2.97e-03

Pos logit contributions
 gloss+0.019
 licking+0.018
 Mitt+0.018
 Midnight+0.018
ifying+0.018
Neg logit contributions
an-0.012
 island-0.012
 captain-0.011
 idea-0.011
 anchor-0.010
Token
Token ID198
Feature activation+1.81e-03

Pos logit contributions
 gloss+0.006
ifying+0.005
 licking+0.005
 spaghetti+0.005
 exercising+0.005
Neg logit contributions
 man-0.004
 idea-0.004
an-0.004
 captain-0.004
 boy-0.004
Token be
Token ID307
Feature activation-9.76e-04

Pos logit contributions
 island+0.001
 idea+0.001
an+0.001
 captain+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
ifying-0.001
 spaghetti-0.001
 licking-0.001
 Mitt-0.001
Token able
Token ID1498
Feature activation+9.95e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.001
ifying-0.001
 licking-0.001
 spaghetti-0.001
 chewing-0.001
Token to
Token ID284
Feature activation+1.03e-02

Pos logit contributions
 gloss+0.016
ifying+0.016
heddar+0.014
 spaghetti+0.014
aling+0.014
Neg logit contributions
 island-0.017
an-0.017
 idea-0.016
 man-0.016
 captain-0.016
Token share
Token ID2648
Feature activation+1.84e-02

Pos logit contributions
 gloss+0.018
ifying+0.018
 spaghetti+0.016
 By+0.016
 licking+0.016
Neg logit contributions
 island-0.017
an-0.016
 idea-0.016
 captain-0.016
 ring-0.015
Token her
Token ID607
Feature activation-1.26e-03

Pos logit contributions
 gloss+0.036
ifying+0.034
 licking+0.034
 chewing+0.033
 Mitt+0.032
Neg logit contributions
an-0.027
 island-0.026
 idea-0.024
 captain-0.024
 pilot-0.023
Token pastry
Token ID41738
Feature activation+4.21e-02

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 pilot+0.003
 anchor+0.003
Neg logit contributions
 licking-0.001
 gloss-0.001
 chewing-0.001
 spaghetti-0.001
 eating-0.001
Token with
Token ID351
Feature activation+1.69e-03

Pos logit contributions
ifying+0.079
 gloss+0.077
 licking+0.071
 chewing+0.071
 Mitt+0.069
Neg logit contributions
an-0.065
 island-0.063
 idea-0.060
 man-0.056
 captain-0.056
Token the
Token ID262
Feature activation-9.06e-03

Pos logit contributions
 gloss+0.003
ifying+0.003
 licking+0.003
 chewing+0.003
 spaghetti+0.003
Neg logit contributions
an-0.003
 island-0.003
 captain-0.002
 idea-0.002
 pilot-0.002
Token bird
Token ID6512
Feature activation+6.18e-04

Pos logit contributions
an+0.013
 island+0.012
 captain+0.011
 anchor+0.011
 route+0.011
Neg logit contributions
 chewing-0.017
 gloss-0.017
 licking-0.017
 eating-0.017
 spaghetti-0.017
Token.
Token ID13
Feature activation+9.48e-03

Pos logit contributions
 gloss+0.001
ifying+0.001
 licking+0.001
 spaghetti+0.001
 chewing+0.001
Neg logit contributions
an-0.001
 island-0.001
 idea-0.001
 captain-0.001
 pilot-0.001
Token<|endoftext|>
Token ID50256
Feature activation-3.90e-03

Pos logit contributions
ifying+0.016
 gloss+0.015
 spaghetti+0.014
 chewing+0.014
 exercising+0.014
Neg logit contributions
an-0.016
 idea-0.015
 island-0.014
 man-0.013
 landscape-0.013
Token went
Token ID1816
Feature activation-1.83e-03

Pos logit contributions
 gloss+0.017
 chewing+0.016
 spaghetti+0.015
 licking+0.015
 sauce+0.015
Neg logit contributions
an-0.025
 captain-0.024
 island-0.023
 pilot-0.022
 route-0.022
Token she
Token ID673
Feature activation+6.34e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 anchor+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.003
 chewing-0.003
 licking-0.003
 eating-0.003
 Mitt-0.003
Token searched
Token ID16499
Feature activation-1.76e-02

Pos logit contributions
 gloss+0.009
 chewing+0.008
 eating+0.008
 licking+0.008
 spaghetti+0.008
Neg logit contributions
an-0.013
 captain-0.012
 island-0.011
 idea-0.011
 route-0.011
Token for
Token ID329
Feature activation-1.36e-02

Pos logit contributions
an+0.035
 captain+0.032
 island+0.032
 pilot+0.031
 anchor+0.029
Neg logit contributions
 licking-0.025
 chewing-0.025
 eating-0.025
 spaghetti-0.024
 gloss-0.024
Token or
Token ID393
Feature activation-3.25e-03

Pos logit contributions
an+0.026
 captain+0.022
 island+0.021
 pilot+0.020
tha+0.020
Neg logit contributions
 licking-0.023
 chewing-0.023
 eating-0.023
 spaghetti-0.021
 much-0.021
Tokennam
Token ID7402
Feature activation+4.18e-02

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.005
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Tokenents
Token ID658
Feature activation-1.47e-02

Pos logit contributions
 gloss+0.050
 licking+0.047
 chewing+0.045
 spaghetti+0.044
 Mitt+0.044
Neg logit contributions
an-0.095
 island-0.092
 captain-0.088
 pilot-0.087
 idea-0.084
Token,
Token ID11
Feature activation+5.51e-04

Pos logit contributions
an+0.028
 island+0.027
 captain+0.026
 pilot+0.025
 idea+0.025
Neg logit contributions
 gloss-0.022
 licking-0.021
 chewing-0.020
 eating-0.020
ifying-0.020
Token and
Token ID290
Feature activation+2.70e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
 spaghetti+0.001
 eating+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token the
Token ID262
Feature activation-2.26e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
ifying+0.004
 chewing+0.004
 spaghetti+0.004
Neg logit contributions
an-0.005
 island-0.005
 captain-0.005
 pilot-0.005
 idea-0.005
Token more
Token ID517
Feature activation-5.40e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 anchor+0.003
 idea+0.003
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
 spaghetti-0.004
 eating-0.004
Token to
Token ID284
Feature activation-5.50e-03

Pos logit contributions
ifying+0.038
 gloss+0.036
heddar+0.034
 licking+0.032
aling+0.032
Neg logit contributions
 island-0.032
 idea-0.031
 captain-0.030
an-0.030
 man-0.029
Token help
Token ID1037
Feature activation+1.10e-03

Pos logit contributions
an+0.011
 island+0.010
 captain+0.009
 pilot+0.009
 idea+0.009
Neg logit contributions
 gloss-0.009
 licking-0.008
 chewing-0.008
 Mitt-0.008
ifying-0.008
Token sort
Token ID3297
Feature activation+2.31e-02

Pos logit contributions
 gloss+0.002
 licking+0.002
ifying+0.002
 chewing+0.002
 spaghetti+0.002
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token the
Token ID262
Feature activation-1.10e-03

Pos logit contributions
 gloss+0.041
 licking+0.039
 chewing+0.038
ifying+0.037
 spaghetti+0.036
Neg logit contributions
an-0.039
 island-0.037
 captain-0.035
 idea-0.035
 pilot-0.034
Token or
Token ID393
Feature activation-2.20e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
tha+0.001
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
 spaghetti-0.002
 eating-0.002
Tokennam
Token ID7402
Feature activation+4.17e-02

Pos logit contributions
 island+0.004
an+0.004
 captain+0.004
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.004
 licking-0.004
ifying-0.003
 chewing-0.003
 spaghetti-0.003
Tokenents
Token ID658
Feature activation-7.76e-03

Pos logit contributions
 gloss+0.051
 licking+0.048
 chewing+0.046
 Mitt+0.046
 spaghetti+0.045
Neg logit contributions
an-0.093
 island-0.090
 captain-0.087
 pilot-0.085
 idea-0.083
Token.
Token ID13
Feature activation-1.17e-03

Pos logit contributions
an+0.015
 island+0.014
 captain+0.014
 pilot+0.014
 idea+0.013
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.011
 eating-0.010
ifying-0.010
Token She
Token ID1375
Feature activation+4.35e-03

Pos logit contributions
an+0.002
 idea+0.002
 island+0.002
 captain+0.002
 man+0.002
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 eating-0.001
Token picked
Token ID6497
Feature activation-1.07e-03

Pos logit contributions
 gloss+0.007
 licking+0.006
 chewing+0.006
 Mitt+0.006
 spaghetti+0.006
Neg logit contributions
an-0.008
 island-0.008
 pilot-0.007
 captain-0.007
 idea-0.007
Token up
Token ID510
Feature activation+1.74e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
 idea+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
 spaghetti-0.002
 Mitt-0.002
Token Tim
Token ID5045
Feature activation+3.91e-03

Pos logit contributions
ifying+0.001
 gloss+0.001
aling+0.001
 chewing+0.001
 choke+0.001
Neg logit contributions
 idea-0.000
 man-0.000
an-0.000
 boy-0.000
 girl-0.000
Tokenmy
Token ID1820
Feature activation-1.66e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
ifying+0.004
 chewing+0.003
 spaghetti+0.003
Neg logit contributions
an-0.010
 island-0.009
 captain-0.009
 idea-0.009
 pilot-0.009
Token accidentally
Token ID14716
Feature activation+7.34e-03

Pos logit contributions
an+0.003
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.003
ifying-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
Token spilled
Token ID31896
Feature activation+2.39e-02

Pos logit contributions
 gloss+0.012
ifying+0.012
 By+0.012
 Plus+0.012
 Sharing+0.011
Neg logit contributions
 island-0.011
an-0.011
 captain-0.011
 sw-0.011
 anchor-0.010
Token his
Token ID465
Feature activation-6.72e-04

Pos logit contributions
 gloss+0.046
seek+0.044
aling+0.043
ifying+0.042
 licking+0.042
Neg logit contributions
 island-0.032
 idea-0.031
an-0.029
 anchor-0.029
 boat-0.028
Token milk
Token ID7545
Feature activation+4.17e-02

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 pilot+0.001
 idea+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token on
Token ID319
Feature activation-3.77e-03

Pos logit contributions
 licking+0.087
 gloss+0.086
 subt+0.086
aling+0.085
 chewing+0.083
Neg logit contributions
an-0.052
 island-0.046
 idea-0.044
 man-0.044
 boat-0.042
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
 island+0.005
an+0.005
 captain+0.004
 idea+0.004
 man+0.004
Neg logit contributions
 gloss-0.008
 licking-0.007
ifying-0.007
 chewing-0.007
 exercising-0.007
Token table
Token ID3084
Feature activation+2.14e-02

Pos logit contributions
 gloss+0.020
 By+0.018
 licking+0.018
aling+0.018
ifying+0.018
Neg logit contributions
an-0.017
 island-0.017
 captain-0.015
 man-0.015
 pilot-0.014
Token.
Token ID13
Feature activation+1.95e-03

Pos logit contributions
 gloss+0.047
ifying+0.045
 licking+0.044
 chewing+0.043
 exercising+0.043
Neg logit contributions
 island-0.024
an-0.024
 idea-0.023
 man-0.022
 captain-0.021
Token Lily
Token ID20037
Feature activation+1.03e-02

Pos logit contributions
ifying+0.003
 exercising+0.003
aling+0.003
 gloss+0.003
 licking+0.003
Neg logit contributions
 man-0.003
 idea-0.003
an-0.003
 boy-0.003
 captain-0.003
Token time
Token ID640
Feature activation-1.93e-02

Pos logit contributions
an+0.025
 island+0.022
 captain+0.022
 pilot+0.020
 anchor+0.019
Neg logit contributions
 gloss-0.033
 licking-0.032
 chewing-0.032
 spaghetti-0.031
 eating-0.031
Token playing
Token ID2712
Feature activation+2.69e-04

Pos logit contributions
 island+0.034
an+0.033
 captain+0.032
 idea+0.031
 pilot+0.031
Neg logit contributions
 gloss-0.032
ifying-0.029
 Mitt-0.029
 Midnight-0.029
 licking-0.029
Token with
Token ID351
Feature activation-1.29e-04

Pos logit contributions
 gloss+0.000
 licking+0.000
 chewing+0.000
 spaghetti+0.000
 Mitt+0.000
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 pilot-0.001
 idea-0.001
Token her
Token ID607
Feature activation+2.66e-03

Pos logit contributions
 island+0.000
an+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
ifying-0.000
 chewing-0.000
 spaghetti-0.000
Token or
Token ID393
Feature activation+4.68e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
 chewing+0.005
 spaghetti+0.005
ifying+0.005
Neg logit contributions
an-0.004
 island-0.004
 captain-0.003
 pilot-0.003
 idea-0.003
Tokennam
Token ID7402
Feature activation+4.14e-02

Pos logit contributions
 gloss+0.007
ifying+0.007
umbs+0.007
heddar+0.007
ives+0.007
Neg logit contributions
 island-0.008
 captain-0.008
 idea-0.008
 pilot-0.008
 man-0.007
Tokenents
Token ID658
Feature activation-1.53e-02

Pos logit contributions
 gloss+0.053
 licking+0.052
 Mitt+0.051
 spaghetti+0.050
 chewing+0.049
Neg logit contributions
an-0.088
 island-0.084
 captain-0.081
 pilot-0.080
 idea-0.077
Token and
Token ID290
Feature activation-1.67e-04

Pos logit contributions
an+0.027
 island+0.026
 captain+0.025
 idea+0.024
 pilot+0.024
Neg logit contributions
 gloss-0.025
 licking-0.024
 chewing-0.023
ifying-0.023
 spaghetti-0.022
Token making
Token ID1642
Feature activation+5.17e-03

Pos logit contributions
 island+0.000
an+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
ifying-0.000
 licking-0.000
 chewing-0.000
 spaghetti-0.000
Token her
Token ID607
Feature activation+1.13e-02

Pos logit contributions
 gloss+0.009
 licking+0.008
 chewing+0.008
 spaghetti+0.008
 Mitt+0.008
Neg logit contributions
an-0.009
 island-0.009
 captain-0.008
 pilot-0.008
 idea-0.008
Token special
Token ID2041
Feature activation+1.54e-03

Pos logit contributions
 gloss+0.021
 licking+0.020
 chewing+0.019
ifying+0.019
 spaghetti+0.019
Neg logit contributions
an-0.018
 island-0.018
 captain-0.017
 idea-0.016
 pilot-0.016
Token and
Token ID290
Feature activation+4.18e-03

Pos logit contributions
an+0.019
 island+0.018
 captain+0.017
 idea+0.016
 pilot+0.016
Neg logit contributions
 gloss-0.019
 licking-0.018
 chewing-0.017
ifying-0.017
 spaghetti-0.017
Token play
Token ID711
Feature activation+1.05e-03

Pos logit contributions
 gloss+0.007
 licking+0.007
 chewing+0.007
ifying+0.007
 spaghetti+0.006
Neg logit contributions
an-0.007
 island-0.007
 captain-0.007
 idea-0.007
 pilot-0.007
Token with
Token ID351
Feature activation-9.10e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
 spaghetti+0.001
 Mitt+0.001
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 pilot-0.002
 idea-0.002
Token your
Token ID534
Feature activation-5.76e-04

Pos logit contributions
an+0.016
 island+0.016
 captain+0.015
 idea+0.014
 pilot+0.014
Neg logit contributions
 gloss-0.015
 licking-0.015
 chewing-0.014
ifying-0.014
 spaghetti-0.014
Token or
Token ID393
Feature activation+8.89e-04

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 pilot+0.001
 idea+0.001
Neg logit contributions
 licking-0.001
 spaghetti-0.001
 gloss-0.001
 chewing-0.001
 Mitt-0.001
Tokennam
Token ID7402
Feature activation+4.13e-02

Pos logit contributions
 gloss+0.002
ifying+0.002
 licking+0.001
heddar+0.001
 By+0.001
Neg logit contributions
 island-0.001
 captain-0.001
an-0.001
 pilot-0.001
 anchor-0.001
Tokenents
Token ID658
Feature activation-1.13e-02

Pos logit contributions
 gloss+0.054
 licking+0.053
 Mitt+0.051
 spaghetti+0.050
 chewing+0.050
Neg logit contributions
an-0.088
 island-0.083
 captain-0.080
 pilot-0.079
 idea-0.076
Token.
Token ID13
Feature activation+1.73e-03

Pos logit contributions
an+0.020
 island+0.020
 captain+0.019
 idea+0.018
 pilot+0.018
Neg logit contributions
 gloss-0.018
 licking-0.018
 chewing-0.017
ifying-0.017
 spaghetti-0.017
Token Maybe
Token ID6674
Feature activation-2.49e-03

Pos logit contributions
 gloss+0.002
 licking+0.002
 chewing+0.002
ifying+0.002
 choking+0.002
Neg logit contributions
 island-0.003
an-0.003
 captain-0.003
 pilot-0.003
 idea-0.003
Token you
Token ID345
Feature activation-2.81e-03

Pos logit contributions
an+0.005
 island+0.005
 idea+0.004
 captain+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.004
 licking-0.003
 chewing-0.003
 spaghetti-0.003
 Mitt-0.003
Token can
Token ID460
Feature activation-1.79e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.005
 idea+0.004
 anchor+0.004
Neg logit contributions
 gloss-0.004
ifying-0.004
 chewing-0.004
 licking-0.004
 spaghetti-0.004
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
ifying+0.022
 gloss+0.021
aling+0.021
 licking+0.021
 Midnight+0.020
Neg logit contributions
 idea-0.012
an-0.011
 island-0.010
 man-0.010
ies-0.009
Token It
Token ID632
Feature activation+7.26e-03

Pos logit contributions
 gloss+0.018
ifying+0.018
 exercising+0.017
 licking+0.017
seek+0.017
Neg logit contributions
 idea-0.018
an-0.017
 man-0.016
 island-0.016
 boy-0.016
Token makes
Token ID1838
Feature activation+3.06e-03

Pos logit contributions
 gloss+0.015
seek+0.014
ifying+0.014
stand+0.014
 exercising+0.014
Neg logit contributions
an-0.010
 idea-0.009
 island-0.008
 man-0.008
 sw-0.008
Token her
Token ID607
Feature activation+1.89e-02

Pos logit contributions
 gloss+0.005
ifying+0.005
 licking+0.005
 chewing+0.005
 exercising+0.005
Neg logit contributions
an-0.005
 island-0.005
 idea-0.005
 captain-0.005
 pilot-0.005
Token face
Token ID1986
Feature activation+5.57e-03

Pos logit contributions
 gloss+0.035
ifying+0.034
 By+0.033
 subt+0.032
aling+0.032
Neg logit contributions
 idea-0.030
an-0.029
 island-0.028
 captain-0.025
 pilot-0.025
Token scr
Token ID6040
Feature activation+4.12e-02

Pos logit contributions
 By+0.009
 gloss+0.009
 subt+0.009
seek+0.009
 licking+0.009
Neg logit contributions
an-0.009
 idea-0.009
 island-0.008
ies-0.008
 captain-0.008
Tokenunch
Token ID3316
Feature activation-1.64e-03

Pos logit contributions
 Midnight+0.062
 gloss+0.062
 licking+0.058
ifying+0.058
 spaghetti+0.056
Neg logit contributions
an-0.080
 idea-0.066
ble-0.065
 island-0.064
 anchor-0.061
Token up
Token ID510
Feature activation-2.09e-02

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.002
 spaghetti-0.002
Token.
Token ID13
Feature activation+6.59e-03

Pos logit contributions
an+0.034
 island+0.033
 captain+0.031
 idea+0.030
 pilot+0.030
Neg logit contributions
 gloss-0.038
 licking-0.036
 chewing-0.035
ifying-0.035
 spaghetti-0.034
Token She
Token ID1375
Feature activation+5.72e-03

Pos logit contributions
ifying+0.010
 gloss+0.010
 exercising+0.010
 licking+0.010
 chewing+0.009
Neg logit contributions
 idea-0.011
an-0.011
 man-0.010
 island-0.010
 captain-0.009
Token sp
Token ID599
Feature activation+1.13e-02

Pos logit contributions
 gloss+0.012
ifying+0.011
seek+0.011
 licking+0.011
 Mitt+0.011
Neg logit contributions
an-0.008
 island-0.007
 idea-0.007
 anchor-0.006
 captain-0.006
Token a
Token ID257
Feature activation-3.23e-03

Pos logit contributions
 island+0.009
an+0.009
 pilot+0.009
 captain+0.009
 idea+0.009
Neg logit contributions
 gloss-0.009
 licking-0.009
ifying-0.009
 chewing-0.009
 exercising-0.008
Token box
Token ID3091
Feature activation+7.59e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.006
 licking-0.006
 chewing-0.006
ifying-0.006
 spaghetti-0.006
Token of
Token ID286
Feature activation+3.71e-03

Pos logit contributions
 gloss+0.012
 licking+0.011
 spaghetti+0.011
 chewing+0.011
 Mitt+0.011
Neg logit contributions
 island-0.014
an-0.014
 captain-0.013
 pilot-0.013
 idea-0.013
Token pretty
Token ID2495
Feature activation-1.74e-03

Pos logit contributions
 gloss+0.006
 spaghetti+0.006
 licking+0.006
 chewing+0.006
 Mitt+0.006
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 pilot-0.006
tha-0.005
Token or
Token ID393
Feature activation+9.34e-04

Pos logit contributions
an+0.003
 captain+0.002
 island+0.002
 pilot+0.002
tha+0.002
Neg logit contributions
 gloss-0.003
 spaghetti-0.003
 licking-0.003
 chewing-0.003
 eating-0.003
Tokennam
Token ID7402
Feature activation+4.12e-02

Pos logit contributions
 gloss+0.002
 licking+0.002
ifying+0.002
 chewing+0.001
 By+0.001
Neg logit contributions
 island-0.001
 idea-0.001
an-0.001
 captain-0.001
 pilot-0.001
Tokenents
Token ID658
Feature activation-1.09e-02

Pos logit contributions
 gloss+0.050
 licking+0.048
 Mitt+0.045
 chewing+0.045
 spaghetti+0.045
Neg logit contributions
an-0.092
 island-0.089
 captain-0.085
 pilot-0.084
 idea-0.081
Token that
Token ID326
Feature activation+1.35e-03

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 pilot+0.019
 idea+0.019
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
 eating-0.015
ifying-0.015
Token she
Token ID673
Feature activation+1.29e-02

Pos logit contributions
 gloss+0.002
 licking+0.001
 chewing+0.001
 Mitt+0.001
 spaghetti+0.001
Neg logit contributions
an-0.003
 island-0.003
 captain-0.003
 pilot-0.003
 idea-0.003
Token wanted
Token ID2227
Feature activation-1.24e-03

Pos logit contributions
 gloss+0.020
 chewing+0.018
 licking+0.018
 spaghetti+0.017
 eating+0.017
Neg logit contributions
an-0.025
 island-0.025
 captain-0.023
 pilot-0.022
 anchor-0.022
Token to
Token ID284
Feature activation+6.05e-04

Pos logit contributions
 island+0.003
an+0.003
 captain+0.003
 idea+0.003
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.001
ifying-0.001
 chewing-0.001
 Mitt-0.001
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
an+0.031
 island+0.030
 captain+0.028
 pilot+0.027
 idea+0.027
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.024
ifying-0.024
 spaghetti-0.023
Token buy
Token ID2822
Feature activation+3.25e-04

Pos logit contributions
 gloss+0.017
 licking+0.017
 chewing+0.017
 eating+0.016
ifying+0.016
Neg logit contributions
an-0.021
 island-0.019
 captain-0.018
 pilot-0.018
 idea-0.018
Token them
Token ID606
Feature activation-2.14e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 spaghetti+0.001
 chewing+0.001
 Mitt+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 pilot-0.001
 idea-0.000
Token".
Token ID1911
Feature activation-9.19e-04

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 pilot+0.004
 idea+0.004
Neg logit contributions
 gloss-0.003
 spaghetti-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
Token Little
Token ID7703
Feature activation-2.16e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.001
 licking-0.001
ifying-0.001
 chewing-0.001
 spaghetti-0.001
Token Cornel
Token ID48214
Feature activation-5.03e-02

Pos logit contributions
 island+0.039
an+0.037
 captain+0.036
 pilot+0.035
 idea+0.034
Neg logit contributions
 gloss-0.034
 Mitt-0.033
 licking-0.033
 chewing-0.033
 spaghetti-0.032
Tokenius
Token ID3754
Feature activation-1.41e-02

Pos logit contributions
 pilot+0.057
 passengers+0.056
 idea+0.056
 route+0.056
 island+0.055
Neg logit contributions
o-0.089
 licking-0.087
heddar-0.087
 chewing-0.085
ccoli-0.082
Token smiled
Token ID13541
Feature activation-7.00e-03

Pos logit contributions
 island+0.024
an+0.023
 captain+0.022
 pilot+0.022
 idea+0.021
Neg logit contributions
 gloss-0.024
 licking-0.023
 chewing-0.022
 Mitt-0.021
 spaghetti-0.021
Token and
Token ID290
Feature activation+2.33e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 pilot+0.010
 idea+0.010
Neg logit contributions
 gloss-0.012
 licking-0.012
 chewing-0.012
ifying-0.011
 spaghetti-0.011
Token agreed
Token ID4987
Feature activation+1.23e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
ifying+0.004
 chewing+0.004
 spaghetti+0.004
Neg logit contributions
an-0.004
 island-0.004
 captain-0.003
 idea-0.003
 pilot-0.003
Token.
Token ID13
Feature activation+5.24e-03

Pos logit contributions
 gloss+0.002
ifying+0.002
 licking+0.002
 chewing+0.002
 spaghetti+0.002
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
TokenWe
Token ID1135
Feature activation-5.51e-03

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.009
 licking-0.008
 chewing-0.008
ifying-0.008
 spaghetti-0.008
Token are
Token ID389
Feature activation+4.21e-03

Pos logit contributions
an+0.011
 island+0.010
tha+0.009
 pilot+0.009
 captain+0.009
Neg logit contributions
 gloss-0.008
ifying-0.008
 spaghetti-0.007
 chewing-0.007
 Mitt-0.007
Token still
Token ID991
Feature activation-1.59e-02

Pos logit contributions
 gloss+0.007
ifying+0.006
 licking+0.006
 chewing+0.006
 spaghetti+0.006
Neg logit contributions
 island-0.008
an-0.008
 captain-0.008
 pilot-0.007
 idea-0.007
Token heroes
Token ID10281
Feature activation-2.46e-02

Pos logit contributions
an+0.029
 island+0.027
 idea+0.025
 pilot+0.024
 captain+0.024
Neg logit contributions
 chewing-0.026
 licking-0.025
 eating-0.025
 gloss-0.025
 Mitt-0.025
Token and
Token ID290
Feature activation+5.99e-03

Pos logit contributions
an+0.044
 island+0.042
 idea+0.040
 anchor+0.039
 captain+0.039
Neg logit contributions
 gloss-0.038
 licking-0.037
 chewing-0.036
ifying-0.036
 spaghetti-0.035
Token villains
Token ID25239
Feature activation-4.33e-02

Pos logit contributions
 gloss+0.010
 licking+0.010
 chewing+0.009
ifying+0.009
 spaghetti+0.009
Neg logit contributions
 island-0.010
an-0.010
 captain-0.010
 pilot-0.010
 idea-0.010
Token!"
Token ID2474
Feature activation+3.53e-03

Pos logit contributions
an+0.080
 island+0.078
 anchor+0.074
 captain+0.072
 pilot+0.071
Neg logit contributions
 licking-0.064
 gloss-0.061
 chewing-0.060
 eating-0.059
ifying-0.056
Token
Token ID198
Feature activation-1.72e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
ifying+0.005
 chewing+0.005
 spaghetti+0.005
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 idea-0.006
 pilot-0.006
Token
Token ID198
Feature activation+8.87e-04

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
 Mitt-0.002
 spaghetti-0.002
TokenThey
Token ID2990
Feature activation-8.77e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 spaghetti+0.001
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token ran
Token ID4966
Feature activation-2.78e-03

Pos logit contributions
an+0.018
 island+0.016
 idea+0.015
 pilot+0.015
 anchor+0.014
Neg logit contributions
 gloss-0.013
 licking-0.012
 chewing-0.012
 Mitt-0.012
ifying-0.012
Token then
Token ID788
Feature activation+9.01e-04

Pos logit contributions
 gloss+0.002
ifying+0.002
 licking+0.002
heddar+0.002
 chewing+0.002
Neg logit contributions
 island-0.002
 idea-0.002
 captain-0.002
an-0.002
 pilot-0.002
Token walking
Token ID6155
Feature activation-6.40e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
 Mitt+0.001
 spaghetti+0.001
Neg logit contributions
an-0.002
 island-0.002
 captain-0.001
 idea-0.001
 pilot-0.001
Token away
Token ID1497
Feature activation-5.18e-03

Pos logit contributions
an+0.012
 island+0.012
 pilot+0.011
 idea+0.011
 anchor+0.011
Neg logit contributions
 gloss-0.009
 licking-0.008
ifying-0.008
 chewing-0.008
 Mitt-0.008
Token.
Token ID13
Feature activation+4.31e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 pilot+0.008
 idea+0.008
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.008
ifying-0.008
 Mitt-0.008
Token Little
Token ID7703
Feature activation-1.59e-02

Pos logit contributions
 gloss+0.007
 licking+0.006
ifying+0.006
 spaghetti+0.006
 chewing+0.006
Neg logit contributions
an-0.008
 captain-0.007
 island-0.007
 pilot-0.007
 idea-0.007
Token Cornel
Token ID48214
Feature activation-4.20e-02

Pos logit contributions
 island+0.029
an+0.028
 captain+0.028
 pilot+0.027
 idea+0.027
Neg logit contributions
 chewing-0.025
 gloss-0.025
 licking-0.024
 Mitt-0.023
 eating-0.022
Tokenius
Token ID3754
Feature activation-1.35e-02

Pos logit contributions
 pilot+0.063
 passengers+0.059
 idea+0.059
 island+0.058
 captain+0.058
Neg logit contributions
 chewing-0.063
 licking-0.063
heddar-0.058
o-0.058
 eating-0.055
Token was
Token ID373
Feature activation-4.23e-04

Pos logit contributions
an+0.023
 island+0.023
 captain+0.022
 idea+0.021
 pilot+0.021
Neg logit contributions
 gloss-0.023
 licking-0.022
 chewing-0.021
 Mitt-0.020
 eating-0.020
Token so
Token ID523
Feature activation-5.60e-04

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
ifying-0.001
 licking-0.001
 spaghetti-0.001
 Mitt-0.001
Token proud
Token ID6613
Feature activation+1.71e-02

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
ifying-0.001
 chewing-0.001
 spaghetti-0.001
Token that
Token ID326
Feature activation-4.10e-03

Pos logit contributions
 gloss+0.028
ifying+0.027
 licking+0.026
 Midnight+0.026
 Mitt+0.025
Neg logit contributions
 island-0.030
an-0.030
 captain-0.028
 idea-0.028
 man-0.027
Token wall
Token ID3355
Feature activation-6.04e-03

Pos logit contributions
an+0.008
 island+0.008
 captain+0.008
 pilot+0.007
 idea+0.007
Neg logit contributions
 gloss-0.006
 licking-0.006
 chewing-0.006
ifying-0.006
 spaghetti-0.006
Token.
Token ID13
Feature activation+2.82e-04

Pos logit contributions
an+0.010
 island+0.010
 captain+0.009
 idea+0.009
 pilot+0.009
Neg logit contributions
 gloss-0.010
 licking-0.010
 chewing-0.010
ifying-0.010
 Mitt-0.009
Token
Token ID220
Feature activation+9.88e-03

Pos logit contributions
 gloss+0.000
 licking+0.000
 chewing+0.000
 Mitt+0.000
ifying+0.000
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token
Token ID198
Feature activation-3.38e-03

Pos logit contributions
 gloss+0.013
 licking+0.012
 chewing+0.012
ifying+0.012
 spaghetti+0.011
Neg logit contributions
an-0.021
 island-0.020
 captain-0.019
 idea-0.019
 pilot-0.019
Token
Token ID198
Feature activation+1.81e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.003
 licking-0.003
 Mitt-0.003
 chewing-0.003
 By-0.003
TokenFar
Token ID21428
Feature activation-4.16e-02

Pos logit contributions
 gloss+0.002
 Mitt+0.002
ifying+0.002
 licking+0.002
 Midnight+0.002
Neg logit contributions
an-0.004
 island-0.004
 idea-0.004
 captain-0.004
 anchor-0.003
Tokenley
Token ID1636
Feature activation-2.04e-02

Pos logit contributions
 idea+0.062
 island+0.060
 captain+0.056
 anchor+0.054
an+0.051
Neg logit contributions
ifying-0.074
 licking-0.071
 Midnight-0.068
 chewing-0.066
 gloss-0.066
Token watched
Token ID7342
Feature activation-7.02e-03

Pos logit contributions
 island+0.036
an+0.036
 captain+0.033
 idea+0.033
 pilot+0.033
Neg logit contributions
 gloss-0.031
 licking-0.030
 chewing-0.029
 spaghetti-0.028
 Mitt-0.028
Token as
Token ID355
Feature activation-1.40e-02

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.010
 Mitt-0.010
 spaghetti-0.010
Token his
Token ID465
Feature activation+2.11e-03

Pos logit contributions
an+0.027
 island+0.026
 captain+0.023
 idea+0.023
 pilot+0.022
Neg logit contributions
 gloss-0.021
 Mitt-0.021
 licking-0.021
 chewing-0.021
 spaghetti-0.020
Token mom
Token ID1995
Feature activation+8.15e-03

Pos logit contributions
 gloss+0.005
 licking+0.004
 chewing+0.004
ifying+0.004
 spaghetti+0.004
Neg logit contributions
an-0.003
 island-0.003
 captain-0.002
 idea-0.002
 pilot-0.002
Token anyone
Token ID2687
Feature activation-9.13e-03

Pos logit contributions
 gloss+0.002
 licking+0.002
 chewing+0.002
ifying+0.002
 spaghetti+0.002
Neg logit contributions
an-0.003
 island-0.003
 idea-0.003
 captain-0.003
 pilot-0.002
Token.
Token ID13
Feature activation+9.33e-03

Pos logit contributions
an+0.015
 island+0.014
 captain+0.013
 pilot+0.013
 idea+0.013
Neg logit contributions
 gloss-0.016
 licking-0.016
 chewing-0.015
ifying-0.015
 Mitt-0.015
Token I
Token ID314
Feature activation-5.65e-03

Pos logit contributions
 gloss+0.015
 licking+0.014
 exercising+0.014
 eating+0.014
 chewing+0.014
Neg logit contributions
 captain-0.015
an-0.015
 pilot-0.015
 idea-0.014
 man-0.014
Token treated
Token ID5716
Feature activation+4.45e-03

Pos logit contributions
an+0.010
 island+0.010
 captain+0.009
 pilot+0.009
 anchor+0.009
Neg logit contributions
 gloss-0.009
 licking-0.009
 chewing-0.009
ifying-0.009
 Mitt-0.009
Token the
Token ID262
Feature activation-1.59e-02

Pos logit contributions
 gloss+0.007
 licking+0.007
 chewing+0.006
ifying+0.006
 spaghetti+0.006
Neg logit contributions
an-0.008
 island-0.008
 captain-0.008
 pilot-0.007
 idea-0.007
Token poor
Token ID3595
Feature activation-4.13e-02

Pos logit contributions
an+0.025
tha+0.022
 island+0.021
 idea+0.021
 route+0.020
Neg logit contributions
 chewing-0.029
 licking-0.027
 eating-0.027
 spaghetti-0.026
 gloss-0.026
Token man
Token ID582
Feature activation-7.35e-03

Pos logit contributions
an+0.058
tha+0.052
 idea+0.052
 route+0.052
 island+0.051
Neg logit contributions
 chewing-0.074
 eating-0.071
 licking-0.070
 much-0.063
 spaghetti-0.063
Token very
Token ID845
Feature activation-9.32e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 pilot+0.011
 idea+0.011
Neg logit contributions
 gloss-0.013
 licking-0.013
 chewing-0.012
ifying-0.012
 spaghetti-0.012
Token badly
Token ID11234
Feature activation-1.25e-02

Pos logit contributions
an+0.020
 island+0.019
 captain+0.019
 idea+0.018
 pilot+0.018
Neg logit contributions
 gloss-0.012
 licking-0.012
 chewing-0.011
ifying-0.011
 spaghetti-0.011
Token.
Token ID13
Feature activation+7.78e-03

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 idea+0.018
 pilot+0.018
Neg logit contributions
 gloss-0.022
 licking-0.021
 chewing-0.021
ifying-0.020
 spaghetti-0.020
Token For
Token ID1114
Feature activation+9.47e-03

Pos logit contributions
 gloss+0.013
 licking+0.012
 exercising+0.011
 eating+0.011
 chewing+0.011
Neg logit contributions
 captain-0.013
an-0.012
 pilot-0.012
 idea-0.012
 man-0.012
Token "
Token ID366
Feature activation+1.39e-02

Pos logit contributions
 island+0.002
 captain+0.002
an+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
ifying-0.002
 chewing-0.002
 exercising-0.001
TokenThe
Token ID464
Feature activation-7.58e-03

Pos logit contributions
 gloss+0.021
ifying+0.019
 licking+0.019
 chewing+0.018
 Mitt+0.018
Neg logit contributions
an-0.028
 island-0.027
 captain-0.025
 idea-0.025
 anchor-0.025
Token Mystery
Token ID18208
Feature activation-2.23e-02

Pos logit contributions
an+0.012
 island+0.011
 captain+0.011
 pilot+0.010
 anchor+0.010
Neg logit contributions
 gloss-0.014
 licking-0.014
 chewing-0.014
 spaghetti-0.013
ifying-0.013
Token of
Token ID286
Feature activation-4.88e-04

Pos logit contributions
an+0.036
 island+0.035
 captain+0.032
 anchor+0.032
 idea+0.032
Neg logit contributions
 gloss-0.039
ifying-0.037
 licking-0.037
 spaghetti-0.036
 chewing-0.036
Token the
Token ID262
Feature activation-6.65e-03

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
 spaghetti-0.001
ifying-0.001
Token Bur
Token ID5481
Feature activation-4.12e-02

Pos logit contributions
an+0.012
 island+0.011
 captain+0.011
 pilot+0.010
 idea+0.010
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.011
 spaghetti-0.010
ifying-0.010
Tokenied
Token ID798
Feature activation-1.39e-02

Pos logit contributions
 island+0.066
an+0.065
 captain+0.061
 idea+0.060
 anchor+0.059
Neg logit contributions
 gloss-0.068
ifying-0.066
 spaghetti-0.063
 chewing-0.062
 eating-0.061
Token Bone
Token ID21717
Feature activation-1.69e-02

Pos logit contributions
an+0.024
 island+0.023
 captain+0.022
 idea+0.021
 pilot+0.021
Neg logit contributions
ifying-0.022
 gloss-0.022
 spaghetti-0.022
 licking-0.021
 chewing-0.021
Token".
Token ID1911
Feature activation-6.37e-04

Pos logit contributions
an+0.029
 island+0.028
 captain+0.026
 idea+0.026
 anchor+0.025
Neg logit contributions
 gloss-0.029
ifying-0.027
 licking-0.027
 chewing-0.027
 spaghetti-0.027
Token She
Token ID1375
Feature activation-8.14e-03

Pos logit contributions
an+0.001
 island+0.001
 idea+0.001
 captain+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token showed
Token ID3751
Feature activation-4.00e-03

Pos logit contributions
an+0.015
 island+0.014
 idea+0.013
 captain+0.013
 pilot+0.013
Neg logit contributions
 gloss-0.014
 licking-0.013
 chewing-0.013
 spaghetti-0.012
 Mitt-0.012
Token pretend
Token ID16614
Feature activation-5.59e-03

Pos logit contributions
an+0.028
 island+0.027
 captain+0.025
 pilot+0.025
 idea+0.024
Neg logit contributions
 gloss-0.020
 licking-0.019
 chewing-0.019
 spaghetti-0.018
 Mitt-0.018
Token they
Token ID484
Feature activation-6.71e-03

Pos logit contributions
 island+0.010
an+0.010
 captain+0.009
 idea+0.009
 pilot+0.009
Neg logit contributions
 gloss-0.009
ifying-0.009
 licking-0.009
 chewing-0.008
 Mitt-0.008
Token were
Token ID547
Feature activation-1.30e-02

Pos logit contributions
an+0.016
 island+0.014
 captain+0.014
 pilot+0.013
 anchor+0.013
Neg logit contributions
 gloss-0.008
 chewing-0.007
 eating-0.007
 licking-0.007
 Mitt-0.007
Token heroes
Token ID10281
Feature activation-2.56e-02

Pos logit contributions
an+0.019
 island+0.018
 captain+0.017
 idea+0.017
 pilot+0.016
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.024
ifying-0.024
 spaghetti-0.024
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
an+0.047
 island+0.044
 idea+0.042
 captain+0.041
 anchor+0.041
Neg logit contributions
 gloss-0.040
 licking-0.039
 chewing-0.038
 spaghetti-0.038
ifying-0.037
Token villains
Token ID25239
Feature activation-4.10e-02

Pos logit contributions
 island+0.016
 captain+0.015
 pilot+0.015
an+0.015
 idea+0.014
Neg logit contributions
 gloss-0.020
 licking-0.019
ifying-0.019
 chewing-0.018
heddar-0.018
Token.
Token ID13
Feature activation+1.72e-03

Pos logit contributions
an+0.083
 island+0.079
 anchor+0.076
 pilot+0.075
 captain+0.075
Neg logit contributions
 gloss-0.054
 licking-0.054
 chewing-0.051
 eating-0.051
seek-0.049
Token Ben
Token ID3932
Feature activation-1.59e-02

Pos logit contributions
 gloss+0.003
 licking+0.002
 chewing+0.002
 choking+0.002
 choke+0.002
Neg logit contributions
 island-0.003
 idea-0.003
an-0.003
 captain-0.003
 landscape-0.003
Token had
Token ID550
Feature activation-1.22e-02

Pos logit contributions
an+0.032
 island+0.031
 captain+0.029
 pilot+0.029
 idea+0.029
Neg logit contributions
 chewing-0.021
 licking-0.021
 gloss-0.020
 spaghetti-0.020
 eating-0.020
Token a
Token ID257
Feature activation-1.46e-02

Pos logit contributions
 island+0.023
an+0.022
 captain+0.022
 pilot+0.021
 idea+0.020
Neg logit contributions
 gloss-0.020
 licking-0.018
ifying-0.018
 chewing-0.017
 spaghetti-0.017
Token stick
Token ID4859
Feature activation-2.62e-03

Pos logit contributions
an+0.023
 island+0.022
 captain+0.020
 pilot+0.020
 idea+0.020
Neg logit contributions
 gloss-0.027
 licking-0.026
 chewing-0.026
 spaghetti-0.025
ifying-0.025
Token?"
Token ID1701
Feature activation-2.18e-03

Pos logit contributions
an+0.039
 island+0.035
 captain+0.033
 idea+0.032
 pilot+0.032
Neg logit contributions
 chewing-0.038
 licking-0.037
 eating-0.037
 spaghetti-0.037
 Mitt-0.037
Token The
Token ID383
Feature activation-7.22e-03

Pos logit contributions
an+0.004
 island+0.004
 idea+0.004
 captain+0.004
 pilot+0.003
Neg logit contributions
 licking-0.003
 chewing-0.003
 Mitt-0.003
 gloss-0.003
 eating-0.003
Token voice
Token ID3809
Feature activation-1.94e-02

Pos logit contributions
an+0.008
 island+0.008
tha+0.007
 route+0.007
 idea+0.007
Neg logit contributions
 licking-0.015
 chewing-0.014
 Mitt-0.014
 eating-0.014
 choking-0.013
Token belonged
Token ID19611
Feature activation+4.40e-03

Pos logit contributions
an+0.038
 island+0.037
 landscape+0.035
 idea+0.033
 route+0.033
Neg logit contributions
 gloss-0.025
 licking-0.025
 chewing-0.022
 Mitt-0.021
 eating-0.021
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
 gloss+0.006
 licking+0.005
 chewing+0.005
ifying+0.005
 spaghetti+0.005
Neg logit contributions
an-0.010
 island-0.009
 captain-0.009
 idea-0.009
 pilot-0.009
Token G
Token ID402
Feature activation-4.09e-02

Pos logit contributions
an+0.024
 idea+0.022
 pilot+0.021
 island+0.021
ies+0.020
Neg logit contributions
 Mitt-0.012
 chewing-0.011
 eating-0.010
 licking-0.009
 much-0.008
Tokeniz
Token ID528
Feature activation-1.96e-02

Pos logit contributions
 island+0.080
 pilot+0.076
 captain+0.076
 idea+0.075
an+0.073
Neg logit contributions
 gloss-0.053
o-0.051
 chewing-0.050
 licking-0.047
onna-0.047
Tokenmo
Token ID5908
Feature activation+2.16e-03

Pos logit contributions
 island+0.034
an+0.032
 idea+0.032
 captain+0.032
 pilot+0.030
Neg logit contributions
 gloss-0.030
 licking-0.030
 chewing-0.029
ifying-0.028
o-0.028
Token's
Token ID338
Feature activation+3.67e-03

Pos logit contributions
 gloss+0.004
 licking+0.003
 chewing+0.003
ifying+0.003
 spaghetti+0.003
Neg logit contributions
an-0.004
 island-0.004
 captain-0.003
 idea-0.003
 pilot-0.003
Token friend
Token ID1545
Feature activation+9.67e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.008
 Mitt+0.008
 spaghetti+0.008
Neg logit contributions
an-0.004
 island-0.004
 captain-0.004
 idea-0.004
 pilot-0.004
Token,
Token ID11
Feature activation-6.67e-03

Pos logit contributions
 gloss+0.016
 licking+0.016
 chewing+0.015
ifying+0.015
 spaghetti+0.015
Neg logit contributions
an-0.017
 island-0.016
 captain-0.016
 idea-0.015
 pilot-0.015
Token pretend
Token ID16614
Feature activation-4.94e-03

Pos logit contributions
an+0.029
 island+0.027
 pilot+0.025
 captain+0.025
 idea+0.024
Neg logit contributions
 gloss-0.022
 licking-0.021
 chewing-0.021
 spaghetti-0.021
 Mitt-0.020
Token they
Token ID484
Feature activation-7.30e-03

Pos logit contributions
 island+0.009
an+0.008
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.008
ifying-0.008
 licking-0.008
 chewing-0.007
 Mitt-0.007
Token were
Token ID547
Feature activation-1.37e-02

Pos logit contributions
an+0.016
 island+0.015
 captain+0.014
 pilot+0.014
 idea+0.014
Neg logit contributions
 gloss-0.009
 chewing-0.008
 licking-0.008
 eating-0.008
 Mitt-0.008
Token superheroes
Token ID40896
Feature activation-2.09e-02

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 pilot+0.018
 idea+0.018
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.025
ifying-0.024
 spaghetti-0.024
Token and
Token ID290
Feature activation-9.89e-03

Pos logit contributions
an+0.037
 island+0.035
 captain+0.033
 pilot+0.033
 idea+0.032
Neg logit contributions
 gloss-0.034
 licking-0.033
 chewing-0.032
 spaghetti-0.032
 eating-0.031
Token villains
Token ID25239
Feature activation-4.06e-02

Pos logit contributions
 captain+0.014
 island+0.014
 pilot+0.013
 idea+0.013
an+0.013
Neg logit contributions
 gloss-0.019
ifying-0.019
 licking-0.018
heddar-0.018
 chewing-0.018
Token.
Token ID13
Feature activation+6.97e-04

Pos logit contributions
an+0.079
 island+0.076
 anchor+0.071
 captain+0.071
 pilot+0.071
Neg logit contributions
 gloss-0.059
 licking-0.058
 chewing-0.055
 eating-0.054
seek-0.050
Token Sara
Token ID24799
Feature activation-5.43e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 spaghetti+0.001
Neg logit contributions
an-0.001
 island-0.001
 idea-0.001
 captain-0.001
 pilot-0.001
Token was
Token ID373
Feature activation+4.18e-03

Pos logit contributions
an+0.010
 island+0.010
 captain+0.009
 idea+0.009
 pilot+0.009
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.008
 spaghetti-0.007
 Mitt-0.007
Token Wonder
Token ID12902
Feature activation-2.71e-02

Pos logit contributions
 gloss+0.007
ifying+0.007
 spaghetti+0.007
 licking+0.007
heddar+0.007
Neg logit contributions
 captain-0.007
 pilot-0.007
an-0.007
 island-0.006
 idea-0.006
Token Woman
Token ID14381
Feature activation-2.50e-02

Pos logit contributions
an+0.050
 island+0.049
 captain+0.046
 pilot+0.045
 idea+0.044
Neg logit contributions
 gloss-0.042
 licking-0.041
 chewing-0.040
 Mitt-0.039
 spaghetti-0.039
Token let
Token ID1309
Feature activation+2.76e-03

Pos logit contributions
 spaghetti+0.009
 gloss+0.009
 Mitt+0.009
 licking+0.009
 chewing+0.009
Neg logit contributions
an-0.014
 island-0.013
 pilot-0.012
 idea-0.012
 captain-0.012
Token's
Token ID338
Feature activation-1.58e-03

Pos logit contributions
 Mitt+0.002
 gloss+0.001
 spaghetti+0.001
 Midnight+0.001
 chewing+0.001
Neg logit contributions
 island-0.008
 captain-0.008
an-0.007
 pilot-0.007
 crew-0.007
Token see
Token ID766
Feature activation+1.27e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 pilot+0.003
 idea+0.003
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Token what
Token ID644
Feature activation-7.85e-03

Pos logit contributions
 spaghetti+0.002
 chewing+0.002
 eating+0.002
 licking+0.002
 Mitt+0.002
Neg logit contributions
an-0.002
 captain-0.002
 idea-0.002
 island-0.002
 pilot-0.002
Token it
Token ID340
Feature activation-2.59e-04

Pos logit contributions
an+0.014
 captain+0.013
 island+0.013
 idea+0.012
 pilot+0.011
Neg logit contributions
 gloss-0.013
 spaghetti-0.013
 licking-0.012
 chewing-0.012
 eating-0.012
Token does
Token ID857
Feature activation-4.05e-02

Pos logit contributions
 captain+0.001
an+0.001
 island+0.001
 pilot+0.000
tha+0.000
Neg logit contributions
 gloss-0.000
 spaghetti-0.000
 chewing-0.000
 eating-0.000
 licking-0.000
Token,"
Token ID553
Feature activation+5.06e-03

Pos logit contributions
an+0.078
 pilot+0.070
 island+0.069
tha+0.066
 lamb+0.066
Neg logit contributions
 eating-0.060
 spaghetti-0.059
 chewing-0.058
 gloss-0.056
 much-0.056
Token Tom
Token ID4186
Feature activation-8.42e-04

Pos logit contributions
 gloss+0.005
 licking+0.005
 chewing+0.005
 spaghetti+0.004
 Mitt+0.004
Neg logit contributions
an-0.012
 island-0.012
 captain-0.012
 pilot-0.011
 idea-0.011
Token said
Token ID531
Feature activation-9.29e-03

Pos logit contributions
 island+0.002
an+0.001
 pilot+0.001
 captain+0.001
 anchor+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
 spaghetti-0.001
 Mitt-0.001
Token.
Token ID13
Feature activation-9.07e-03

Pos logit contributions
an+0.015
 island+0.014
 idea+0.014
 captain+0.014
 pilot+0.013
Neg logit contributions
 gloss-0.017
 licking-0.016
ifying-0.016
 chewing-0.016
 spaghetti-0.015
Token They
Token ID1119
Feature activation-1.54e-02

Pos logit contributions
 idea+0.015
 man+0.015
an+0.015
 captain+0.014
 boy+0.014
Neg logit contributions
ifying-0.015
 gloss-0.015
 licking-0.014
 chewing-0.014
 exercising-0.013
Token driver
Token ID4639
Feature activation-2.87e-03

Pos logit contributions
 gloss+0.020
 licking+0.019
 chewing+0.019
 Mitt+0.019
 spaghetti+0.019
Neg logit contributions
an-0.013
 island-0.012
 idea-0.011
 anchor-0.010
tha-0.010
Token and
Token ID290
Feature activation+4.03e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Token Anna
Token ID11735
Feature activation-2.43e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
 chewing+0.005
 spaghetti+0.005
 Mitt+0.005
Neg logit contributions
an-0.009
 island-0.008
 idea-0.008
 captain-0.008
 pilot-0.008
Token was
Token ID373
Feature activation+9.13e-05

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 idea+0.004
 anchor+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
 eating-0.003
 Mitt-0.003
Token the
Token ID262
Feature activation+6.41e-03

Pos logit contributions
 gloss+0.000
 licking+0.000
 chewing+0.000
ifying+0.000
 spaghetti+0.000
Neg logit contributions
an-0.000
 island-0.000
 captain-0.000
 pilot-0.000
 idea-0.000
Token navig
Token ID20436
Feature activation-4.04e-02

Pos logit contributions
 gloss+0.013
 licking+0.013
 chewing+0.012
 Mitt+0.012
ifying+0.012
Neg logit contributions
an-0.009
 island-0.008
 idea-0.007
 anchor-0.007
 captain-0.006
Tokenator
Token ID1352
Feature activation-2.66e-03

Pos logit contributions
 island+0.044
 idea+0.041
 deer+0.040
an+0.039
 captain+0.038
Neg logit contributions
ifying-0.084
 gloss-0.083
 licking-0.082
stand-0.080
 chewing-0.080
Token.
Token ID13
Feature activation-9.55e-04

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Token They
Token ID1119
Feature activation-1.64e-02

Pos logit contributions
 man+0.002
 idea+0.002
an+0.002
 island+0.002
 captain+0.002
Neg logit contributions
 gloss-0.001
 chewing-0.001
ifying-0.001
 licking-0.001
 exercising-0.001
Token put
Token ID1234
Feature activation-1.77e-03

Pos logit contributions
an+0.040
 island+0.033
 idea+0.033
rier+0.032
 Voltage+0.030
Neg logit contributions
 Mitt-0.016
 eating-0.016
 chewing-0.016
ifying-0.016
o-0.016
Token on
Token ID319
Feature activation-9.48e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.003
 pilot+0.003
 idea+0.003
Neg logit contributions
 gloss-0.002
 licking-0.002
 spaghetti-0.002
 Mitt-0.002
 chewing-0.002
Token was
Token ID373
Feature activation-4.46e-03

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.006
 Mitt+0.006
 eating+0.006
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 pilot-0.006
 anchor-0.005
Token a
Token ID257
Feature activation+3.27e-03

Pos logit contributions
 island+0.007
 idea+0.007
an+0.006
 captain+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.008
 licking-0.008
ifying-0.008
 chewing-0.007
 choke-0.007
Token z
Token ID1976
Feature activation-1.59e-02

Pos logit contributions
 gloss+0.007
 chewing+0.007
 licking+0.007
 Mitt+0.007
 spaghetti+0.007
Neg logit contributions
an-0.004
 island-0.004
 idea-0.003
 captain-0.003
ered-0.003
Tokenebra
Token ID37052
Feature activation-7.25e-03

Pos logit contributions
 idea+0.018
 island+0.018
an+0.016
 landscape+0.015
 captain+0.015
Neg logit contributions
 gloss-0.033
 chewing-0.032
 licking-0.032
ifying-0.032
 Mitt-0.030
Token named
Token ID3706
Feature activation-1.06e-03

Pos logit contributions
an+0.015
 island+0.014
 captain+0.014
 pilot+0.013
 idea+0.013
Neg logit contributions
 gloss-0.011
 licking-0.010
 chewing-0.010
ifying-0.010
 spaghetti-0.009
Token Stri
Token ID26137
Feature activation-4.04e-02

Pos logit contributions
an+0.002
 island+0.002
 idea+0.002
 captain+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 Mitt-0.002
 chewing-0.002
 spaghetti-0.002
Tokenpe
Token ID431
Feature activation-2.10e-02

Pos logit contributions
 idea+0.049
 island+0.048
an+0.045
 captain+0.044
 pilot+0.044
Neg logit contributions
 licking-0.082
 chewing-0.082
 gloss-0.079
stand-0.079
 eating-0.078
Tokeny
Token ID88
Feature activation-3.70e-04

Pos logit contributions
an+0.036
 island+0.035
 captain+0.033
 idea+0.032
 pilot+0.032
Neg logit contributions
 gloss-0.035
 licking-0.034
 chewing-0.033
ifying-0.033
 eating-0.032
Token.
Token ID13
Feature activation+4.05e-03

Pos logit contributions
an+0.001
 island+0.001
 idea+0.001
 captain+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
 Mitt-0.001
 spaghetti-0.001
Token He
Token ID679
Feature activation+4.35e-04

Pos logit contributions
 gloss+0.007
 licking+0.007
ifying+0.006
 chewing+0.006
 spaghetti+0.006
Neg logit contributions
an-0.007
 island-0.007
 captain-0.006
 idea-0.006
 pilot-0.006
Token was
Token ID373
Feature activation+8.28e-03

Pos logit contributions
 chewing+0.001
 licking+0.001
 gloss+0.001
 eating+0.001
 Mitt+0.001
Neg logit contributions
an-0.001
 island-0.001
 idea-0.001
 captain-0.001
 anchor-0.001
Token where
Token ID810
Feature activation-9.38e-03

Pos logit contributions
 gloss+0.013
 licking+0.012
 chewing+0.012
ifying+0.011
 spaghetti+0.011
Neg logit contributions
an-0.016
 island-0.016
 captain-0.015
 idea-0.015
 pilot-0.015
Token the
Token ID262
Feature activation-3.99e-03

Pos logit contributions
an+0.018
 island+0.017
 captain+0.016
 idea+0.015
 pilot+0.015
Neg logit contributions
 gloss-0.015
 licking-0.014
 Mitt-0.014
 chewing-0.014
 spaghetti-0.014
Token arrow
Token ID15452
Feature activation-1.41e-02

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.008
 chewing-0.008
 licking-0.008
 spaghetti-0.008
ifying-0.008
Token had
Token ID550
Feature activation-1.15e-02

Pos logit contributions
an+0.028
 island+0.025
 captain+0.024
 idea+0.024
 crew+0.023
Neg logit contributions
 gloss-0.021
 chewing-0.020
 licking-0.020
 eating-0.020
 choking-0.017
Token come
Token ID1282
Feature activation-1.40e-02

Pos logit contributions
an+0.026
 island+0.024
 captain+0.023
 pilot+0.023
 idea+0.023
Neg logit contributions
 licking-0.014
 chewing-0.014
 gloss-0.014
 eating-0.013
 spaghetti-0.012
Token from
Token ID422
Feature activation-3.99e-02

Pos logit contributions
an+0.026
 island+0.025
 captain+0.024
 pilot+0.023
 idea+0.023
Neg logit contributions
 gloss-0.021
 licking-0.020
ifying-0.020
 chewing-0.020
 Mitt-0.019
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
an+0.087
 captain+0.082
 pilot+0.079
 island+0.078
 crew+0.077
Neg logit contributions
 chewing-0.047
 eating-0.046
 licking-0.046
 Mitt-0.045
 spaghetti-0.045
Token She
Token ID1375
Feature activation-4.73e-03

Pos logit contributions
an+0.028
 island+0.028
 idea+0.028
 captain+0.027
 pilot+0.027
Neg logit contributions
 gloss-0.015
 licking-0.015
ifying-0.014
 chewing-0.014
 spaghetti-0.013
Token saw
Token ID2497
Feature activation-1.66e-03

Pos logit contributions
an+0.009
 island+0.008
 idea+0.008
 captain+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.007
 licking-0.007
 chewing-0.007
 Mitt-0.007
 eating-0.007
Token a
Token ID257
Feature activation-2.74e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Token smiling
Token ID16755
Feature activation-7.78e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.003
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.006
 licking-0.005
 chewing-0.005
ifying-0.005
 spaghetti-0.005
Token leaves
Token ID5667
Feature activation+8.80e-04

Pos logit contributions
an+0.021
 captain+0.021
 island+0.020
 pilot+0.020
 idea+0.019
Neg logit contributions
 licking-0.017
 gloss-0.017
 chewing-0.017
 eating-0.017
 Mitt-0.017
Token look
Token ID804
Feature activation+6.21e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 eating+0.001
Neg logit contributions
an-0.002
 island-0.001
 captain-0.001
 pilot-0.001
 idea-0.001
Token tasty
Token ID25103
Feature activation+4.37e-03

Pos logit contributions
 gloss+0.011
 licking+0.011
 chewing+0.010
ifying+0.010
 Mitt+0.010
Neg logit contributions
an-0.010
 island-0.010
 captain-0.009
 idea-0.009
 pilot-0.009
Token,"
Token ID553
Feature activation+3.79e-03

Pos logit contributions
 gloss+0.009
 licking+0.009
 chewing+0.009
ifying+0.008
 spaghetti+0.008
Neg logit contributions
an-0.006
 island-0.006
 captain-0.005
 idea-0.005
 pilot-0.005
Token said
Token ID531
Feature activation-1.09e-02

Pos logit contributions
 Mitt+0.006
 licking+0.006
 gloss+0.006
 chewing+0.006
 spaghetti+0.006
Neg logit contributions
 island-0.007
an-0.007
 captain-0.006
 idea-0.006
 pilot-0.006
Token Stri
Token ID26137
Feature activation-3.97e-02

Pos logit contributions
an+0.022
 island+0.022
 idea+0.020
 captain+0.020
 pilot+0.019
Neg logit contributions
 gloss-0.016
 Mitt-0.015
 licking-0.015
 chewing-0.015
 spaghetti-0.014
Tokenpe
Token ID431
Feature activation-1.89e-02

Pos logit contributions
 island+0.044
 idea+0.040
 captain+0.038
 pilot+0.038
an+0.037
Neg logit contributions
 licking-0.087
 chewing-0.087
 gloss-0.086
 Mitt-0.081
 Midnight-0.081
Tokeny
Token ID88
Feature activation+2.69e-03

Pos logit contributions
 island+0.026
an+0.026
 captain+0.024
 pilot+0.023
 idea+0.023
Neg logit contributions
 gloss-0.037
 licking-0.036
 chewing-0.035
ifying-0.034
 eating-0.034
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 gloss+0.005
 licking+0.005
ifying+0.005
 chewing+0.005
 spaghetti+0.004
Neg logit contributions
an-0.004
 island-0.004
 captain-0.004
 idea-0.004
 pilot-0.004
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
an+0.023
 island+0.023
 captain+0.022
 idea+0.021
 pilot+0.021
Neg logit contributions
 gloss-0.014
 licking-0.014
 chewing-0.013
 Mitt-0.013
ifying-0.013
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
 island+0.029
an+0.029
 captain+0.028
 pilot+0.027
 idea+0.027
Neg logit contributions
 gloss-0.011
 licking-0.011
 Mitt-0.011
 chewing-0.011
 spaghetti-0.010
Token keep
Token ID1394
Feature activation+5.70e-03

Pos logit contributions
 Mitt+0.003
 gloss+0.003
 chewing+0.003
 Midnight+0.003
 licking+0.003
Neg logit contributions
 island-0.004
an-0.004
 idea-0.004
 captain-0.004
 pilot-0.003
Token going
Token ID1016
Feature activation-4.30e-03

Pos logit contributions
 gloss+0.010
 licking+0.009
ifying+0.009
 chewing+0.009
 spaghetti+0.008
Neg logit contributions
an-0.010
 island-0.010
 captain-0.009
 pilot-0.009
 idea-0.009
Token."
Token ID526
Feature activation-6.77e-03

Pos logit contributions
an+0.008
 island+0.007
 idea+0.007
 captain+0.007
 pilot+0.007
Neg logit contributions
 gloss-0.007
 chewing-0.007
 licking-0.006
 Mitt-0.006
ifying-0.006
Token Max
Token ID5436
Feature activation-8.93e-03

Pos logit contributions
an+0.013
 island+0.012
 captain+0.012
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.010
ifying-0.010
 licking-0.009
 chewing-0.009
 spaghetti-0.009
Token and
Token ID290
Feature activation-2.44e-03

Pos logit contributions
 island+0.016
an+0.015
 pilot+0.014
 captain+0.014
 anchor+0.014
Neg logit contributions
 gloss-0.015
 licking-0.014
 Mitt-0.014
 chewing-0.014
 spaghetti-0.013
Token Scot
Token ID7130
Feature activation-3.96e-02

Pos logit contributions
 island+0.005
an+0.005
 idea+0.005
 pilot+0.004
 captain+0.004
Neg logit contributions
 Mitt-0.004
 gloss-0.003
 licking-0.003
 chewing-0.003
 eating-0.003
Tokenty
Token ID774
Feature activation-1.78e-02

Pos logit contributions
 island+0.084
 landscape+0.083
an+0.080
 idea+0.079
 pilot+0.079
Neg logit contributions
 gloss-0.041
o-0.039
 Midnight-0.037
 Mitt-0.036
seek-0.036
Token kept
Token ID4030
Feature activation-3.96e-04

Pos logit contributions
an+0.039
 island+0.036
 pilot+0.035
 idea+0.034
rier+0.034
Neg logit contributions
 Mitt-0.021
 gloss-0.021
 licking-0.018
 chewing-0.017
�-0.017
Token walking
Token ID6155
Feature activation-1.08e-02

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 licking-0.000
 gloss-0.000
 chewing-0.000
 Mitt-0.000
 eating-0.000
Token,
Token ID11
Feature activation-5.16e-03

Pos logit contributions
an+0.024
 island+0.022
 pilot+0.021
 anchor+0.021
 captain+0.020
Neg logit contributions
 licking-0.012
 Mitt-0.012
 chewing-0.012
 eating-0.011
 much-0.011
Token but
Token ID475
Feature activation-1.20e-02

Pos logit contributions
an+0.008
 island+0.008
 captain+0.008
 idea+0.007
 pilot+0.007
Neg logit contributions
 gloss-0.009
 licking-0.009
ifying-0.009
 chewing-0.009
 spaghetti-0.008
Token wonders
Token ID19294
Feature activation-2.03e-03

Pos logit contributions
an+0.012
 island+0.011
 captain+0.010
 idea+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
ifying-0.015
 spaghetti-0.014
Token what
Token ID644
Feature activation+1.37e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
 spaghetti-0.003
Token dinosaur
Token ID28257
Feature activation-4.48e-04

Pos logit contributions
 gloss+0.003
ifying+0.002
 licking+0.002
 chewing+0.002
 Mitt+0.002
Neg logit contributions
 island-0.002
an-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token it
Token ID340
Feature activation+5.70e-03

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token came
Token ID1625
Feature activation-1.15e-02

Pos logit contributions
 gloss+0.010
 chewing+0.010
 licking+0.010
 spaghetti+0.009
 eating+0.009
Neg logit contributions
an-0.009
 island-0.009
 captain-0.009
 pilot-0.009
 anchor-0.008
Token from
Token ID422
Feature activation-3.96e-02

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 pilot+0.019
 idea+0.019
Neg logit contributions
 gloss-0.018
 licking-0.018
 chewing-0.017
ifying-0.017
 Mitt-0.017
Token and
Token ID290
Feature activation+1.36e-02

Pos logit contributions
an+0.079
 captain+0.072
 island+0.071
 pilot+0.071
 anchor+0.066
Neg logit contributions
 chewing-0.056
 licking-0.056
 eating-0.054
 spaghetti-0.052
 gloss-0.050
Token how
Token ID703
Feature activation-8.16e-03

Pos logit contributions
 gloss+0.027
ifying+0.025
 spaghetti+0.024
 Mitt+0.024
 licking+0.024
Neg logit contributions
 island-0.020
an-0.019
 captain-0.019
 pilot-0.018
 anchor-0.017
Token it
Token ID340
Feature activation+8.39e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 idea+0.011
 pilot+0.011
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
ifying-0.015
 spaghetti-0.014
Token lived
Token ID5615
Feature activation-2.67e-02

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
 spaghetti+0.014
ifying+0.014
Neg logit contributions
an-0.014
 island-0.013
 captain-0.012
 pilot-0.012
 idea-0.012
Token.
Token ID13
Feature activation+4.64e-03

Pos logit contributions
an+0.052
 island+0.049
 captain+0.048
 pilot+0.048
 idea+0.047
Neg logit contributions
 chewing-0.038
 licking-0.037
 gloss-0.037
 eating-0.036
 spaghetti-0.035
Token and
Token ID290
Feature activation+4.18e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 idea+0.011
 pilot+0.011
Neg logit contributions
 gloss-0.012
 licking-0.011
 chewing-0.011
ifying-0.011
 spaghetti-0.010
Token helping
Token ID5742
Feature activation-2.78e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.007
ifying+0.007
 spaghetti+0.007
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 idea-0.006
 pilot-0.006
Token each
Token ID1123
Feature activation-9.71e-04

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.005
 licking-0.005
 chewing-0.005
ifying-0.005
 spaghetti-0.005
Token other
Token ID584
Feature activation-2.34e-03

Pos logit contributions
an+0.002
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
 Mitt-0.002
 spaghetti-0.002
Token."
Token ID526
Feature activation-2.91e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 pilot+0.004
 idea+0.004
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Token Big
Token ID4403
Feature activation-3.95e-02

Pos logit contributions
an+0.007
 island+0.007
 idea+0.006
 captain+0.006
 pilot+0.006
Neg logit contributions
 Mitt-0.003
 gloss-0.003
 licking-0.003
 chewing-0.003
 Midnight-0.003
Tokengie
Token ID22699
Feature activation-2.52e-02

Pos logit contributions
 island+0.088
 idea+0.083
 captain+0.083
 pilot+0.081
 traveler+0.077
Neg logit contributions
 Mitt-0.050
 chewing-0.048
 licking-0.045
o-0.044
 gloss-0.041
Token and
Token ID290
Feature activation+7.64e-03

Pos logit contributions
 island+0.041
an+0.040
 captain+0.037
 pilot+0.037
 idea+0.037
Neg logit contributions
 gloss-0.044
 licking-0.043
 chewing-0.042
 Mitt-0.041
 eating-0.040
Token his
Token ID465
Feature activation-3.91e-03

Pos logit contributions
 Mitt+0.011
 licking+0.010
 chewing+0.010
 gloss+0.009
 eating+0.009
Neg logit contributions
an-0.016
 island-0.015
 idea-0.014
 route-0.013
tha-0.013
Token friends
Token ID2460
Feature activation-1.09e-02

Pos logit contributions
 island+0.006
an+0.006
mith+0.006
 captain+0.006
 pilot+0.006
Neg logit contributions
 licking-0.006
 chewing-0.006
 Mitt-0.006
 gloss-0.006
 eating-0.006
Token listened
Token ID16399
Feature activation-9.50e-03

Pos logit contributions
an+0.023
 island+0.021
 captain+0.019
 pilot+0.019
 idea+0.019
Neg logit contributions
 gloss-0.014
 licking-0.014
 chewing-0.013
 eating-0.013
 Mitt-0.012
Token finally
Token ID3443
Feature activation+2.18e-03

Pos logit contributions
 gloss+0.017
 licking+0.016
 chewing+0.016
ifying+0.016
 spaghetti+0.015
Neg logit contributions
an-0.021
 island-0.019
 captain-0.018
 idea-0.018
 pilot-0.018
Token left
Token ID1364
Feature activation-3.32e-03

Pos logit contributions
 gloss+0.003
 licking+0.003
 chewing+0.003
 eating+0.003
 Mitt+0.003
Neg logit contributions
an-0.005
 island-0.004
 captain-0.004
 pilot-0.004
 anchor-0.004
Token the
Token ID262
Feature activation-4.39e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 pilot+0.006
 idea+0.006
Neg logit contributions
 licking-0.005
 chewing-0.005
 Mitt-0.005
 eating-0.005
 gloss-0.005
Token shop
Token ID6128
Feature activation-6.71e-03

Pos logit contributions
an+0.007
 island+0.007
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.008
ifying-0.008
 spaghetti-0.007
Token,
Token ID11
Feature activation-3.03e-03

Pos logit contributions
an+0.012
 island+0.011
 captain+0.010
 idea+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.011
ifying-0.010
 Mitt-0.010
Token Cornel
Token ID48214
Feature activation-3.94e-02

Pos logit contributions
an+0.006
 island+0.006
 idea+0.005
 captain+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.004
 Mitt-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
Tokenius
Token ID3754
Feature activation-5.36e-03

Pos logit contributions
 pilot+0.061
 idea+0.059
 passengers+0.057
 island+0.056
 route+0.054
Neg logit contributions
 chewing-0.056
 licking-0.055
ifying-0.052
heddar-0.051
o-0.051
Token was
Token ID373
Feature activation-2.15e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.009
 idea+0.009
 pilot+0.008
Neg logit contributions
 gloss-0.009
 licking-0.009
 chewing-0.008
ifying-0.008
 spaghetti-0.008
Token feeling
Token ID4203
Feature activation+8.47e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
ifying-0.003
 spaghetti-0.003
 chewing-0.003
Token much
Token ID881
Feature activation+6.14e-03

Pos logit contributions
 gloss+0.014
 licking+0.013
ifying+0.012
 chewing+0.012
heddar+0.012
Neg logit contributions
 island-0.014
 captain-0.014
 idea-0.014
an-0.014
 pilot-0.014
Token better
Token ID1365
Feature activation-4.47e-03

Pos logit contributions
 gloss+0.010
ifying+0.009
 licking+0.009
 Mitt+0.008
 chewing+0.008
Neg logit contributions
 idea-0.011
 island-0.011
an-0.011
 captain-0.011
 pilot-0.010
Token cool
Token ID3608
Feature activation+8.56e-05

Pos logit contributions
 idea+0.004
 island+0.004
an+0.004
 captain+0.003
 anchor+0.003
Neg logit contributions
 gloss-0.006
 licking-0.006
ifying-0.006
 chewing-0.006
stand-0.006
Token.
Token ID13
Feature activation+1.11e-03

Pos logit contributions
 gloss+0.000
 licking+0.000
ifying+0.000
 chewing+0.000
 Mitt+0.000
Neg logit contributions
an-0.000
 island-0.000
 idea-0.000
 captain-0.000
 pilot-0.000
Token
Token ID220
Feature activation+8.13e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
 spaghetti+0.001
ifying+0.001
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 pilot-0.002
 idea-0.002
Token
Token ID198
Feature activation-2.08e-03

Pos logit contributions
 gloss+0.012
 licking+0.011
 chewing+0.011
ifying+0.011
 spaghetti+0.011
Neg logit contributions
an-0.016
 island-0.015
 captain-0.015
 idea-0.014
 pilot-0.014
Token
Token ID198
Feature activation+1.94e-03

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
 licking-0.002
 chewing-0.002
 Mitt-0.002
ifying-0.002
TokenFar
Token ID21428
Feature activation-3.93e-02

Pos logit contributions
 gloss+0.002
 licking+0.002
 chewing+0.002
ifying+0.002
 Mitt+0.002
Neg logit contributions
an-0.004
 island-0.004
 idea-0.004
 captain-0.004
 anchor-0.004
Tokenley
Token ID1636
Feature activation-1.77e-02

Pos logit contributions
 island+0.055
 idea+0.055
 captain+0.050
 anchor+0.050
an+0.048
Neg logit contributions
ifying-0.073
 licking-0.069
 Midnight-0.066
 gloss-0.066
 chewing-0.065
Token was
Token ID373
Feature activation+2.09e-03

Pos logit contributions
an+0.030
 island+0.030
 idea+0.028
 captain+0.028
 pilot+0.027
Neg logit contributions
 gloss-0.030
 licking-0.029
 chewing-0.028
 Mitt-0.027
 spaghetti-0.027
Token so
Token ID523
Feature activation-2.23e-03

Pos logit contributions
 gloss+0.003
ifying+0.003
 spaghetti+0.003
 licking+0.003
heddar+0.003
Neg logit contributions
an-0.004
 island-0.004
 captain-0.004
 idea-0.004
 pilot-0.003
Token happy
Token ID3772
Feature activation+8.58e-03

Pos logit contributions
an+0.004
 island+0.003
 captain+0.003
 idea+0.003
 pilot+0.003
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Token to
Token ID284
Feature activation-6.43e-03

Pos logit contributions
 gloss+0.014
ifying+0.014
 licking+0.013
heddar+0.013
 Midnight+0.012
Neg logit contributions
 island-0.015
an-0.015
 captain-0.014
 idea-0.014
 man-0.013
Token she
Token ID673
Feature activation+1.07e-02

Pos logit contributions
 island+0.004
an+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
ifying-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
Token got
Token ID1392
Feature activation+2.56e-03

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
 Mitt+0.013
 eating+0.013
Neg logit contributions
an-0.022
 island-0.021
 idea-0.021
 captain-0.020
 anchor-0.020
Token rid
Token ID5755
Feature activation+1.27e-02

Pos logit contributions
 gloss+0.004
ifying+0.004
 licking+0.004
heddar+0.004
 chewing+0.004
Neg logit contributions
an-0.004
 captain-0.004
 island-0.004
 idea-0.004
 pilot-0.004
Token of
Token ID286
Feature activation+7.39e-04

Pos logit contributions
 gloss+0.018
 licking+0.018
 chewing+0.017
 spaghetti+0.017
ifying+0.017
Neg logit contributions
an-0.025
 island-0.023
 captain-0.023
 idea-0.022
 pilot-0.022
Token the
Token ID262
Feature activation-1.59e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
ifying+0.001
 chewing+0.001
 spaghetti+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token intr
Token ID9913
Feature activation-3.93e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
ifying-0.003
Tokenuders
Token ID48739
Feature activation-1.12e-02

Pos logit contributions
an+0.048
 island+0.047
 captain+0.045
 idea+0.043
 pilot+0.043
Neg logit contributions
 licking-0.082
 gloss-0.082
 chewing-0.081
 eating-0.077
ifying-0.076
Token.
Token ID13
Feature activation+6.15e-03

Pos logit contributions
an+0.018
 island+0.018
 captain+0.017
 pilot+0.016
 idea+0.016
Neg logit contributions
 gloss-0.020
 licking-0.019
 chewing-0.019
ifying-0.018
 spaghetti-0.018
Token She
Token ID1375
Feature activation+1.94e-05

Pos logit contributions
 gloss+0.009
ifying+0.009
 exercising+0.008
 licking+0.008
 chewing+0.008
Neg logit contributions
 man-0.010
 idea-0.010
an-0.010
 island-0.010
 boy-0.010
Token did
Token ID750
Feature activation-6.26e-03

Pos logit contributions
 gloss+0.000
 licking+0.000
 chewing+0.000
 Mitt+0.000
 spaghetti+0.000
Neg logit contributions
an-0.000
 island-0.000
 idea-0.000
 captain-0.000
 pilot-0.000
Token not
Token ID407
Feature activation+1.85e-02

Pos logit contributions
 idea+0.011
an+0.011
 island+0.011
 captain+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.010
ifying-0.010
 licking-0.009
heddar-0.009
 chewing-0.009
Token!
Token ID0
Feature activation+1.26e-03

Pos logit contributions
 gloss+0.013
ifying+0.012
 licking+0.012
 chewing+0.012
 Mitt+0.012
Neg logit contributions
an-0.012
 island-0.012
 captain-0.011
 idea-0.010
 pilot-0.010
Token
Token ID198
Feature activation+2.22e-03

Pos logit contributions
 gloss+0.002
 licking+0.002
 chewing+0.002
ifying+0.002
 spaghetti+0.002
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token
Token ID198
Feature activation+4.38e-03

Pos logit contributions
 gloss+0.003
 licking+0.002
 chewing+0.002
 Mitt+0.002
 spaghetti+0.002
Neg logit contributions
an-0.005
 island-0.005
 captain-0.005
 pilot-0.005
 idea-0.005
TokenThe
Token ID464
Feature activation-1.38e-03

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.005
 Mitt+0.005
ifying+0.005
Neg logit contributions
an-0.009
 island-0.009
 captain-0.009
 idea-0.009
 pilot-0.008
Token moral
Token ID6573
Feature activation-6.31e-04

Pos logit contributions
ered+0.002
tha+0.002
an+0.002
yl+0.002
 inspection+0.002
Neg logit contributions
 spaghetti-0.002
 sauce-0.002
 chewing-0.002
 eating-0.002
 pizza-0.002
Token of
Token ID286
Feature activation+2.39e-02

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 pilot+0.001
 idea+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token the
Token ID262
Feature activation+2.86e-03

Pos logit contributions
 gloss+0.044
 licking+0.042
 chewing+0.041
ifying+0.041
 spaghetti+0.040
Neg logit contributions
an-0.038
 island-0.037
 captain-0.035
 idea-0.034
 pilot-0.033
Token story
Token ID1621
Feature activation-3.64e-03

Pos logit contributions
 spaghetti+0.004
 eating+0.004
 chewing+0.004
 sauce+0.003
 Mitt+0.003
Neg logit contributions
an-0.006
 captain-0.005
tha-0.005
 pilot-0.005
ener-0.005
Token is
Token ID318
Feature activation+1.11e-02

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 pilot+0.006
 anchor+0.006
Neg logit contributions
 gloss-0.006
 licking-0.006
 chewing-0.005
ifying-0.005
 eating-0.005
Token to
Token ID284
Feature activation+5.18e-03

Pos logit contributions
 gloss+0.016
 licking+0.015
ifying+0.014
 spaghetti+0.014
 chewing+0.014
Neg logit contributions
an-0.023
 island-0.022
 captain-0.021
 idea-0.021
 anchor-0.020
Token not
Token ID407
Feature activation+6.30e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.008
ifying+0.008
 spaghetti+0.008
Neg logit contributions
an-0.009
 island-0.009
 captain-0.009
 pilot-0.008
 idea-0.008
Token happy
Token ID3772
Feature activation+5.69e-03

Pos logit contributions
an+0.019
 island+0.018
 captain+0.017
 idea+0.016
 pilot+0.016
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
 Mitt-0.014
ifying-0.014
Token and
Token ID290
Feature activation+3.38e-03

Pos logit contributions
 gloss+0.009
ifying+0.009
 licking+0.008
 spaghetti+0.008
 chewing+0.008
Neg logit contributions
 island-0.010
an-0.010
 captain-0.010
 idea-0.009
 pilot-0.009
Token they
Token ID484
Feature activation-1.02e-03

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.005
ifying+0.005
 spaghetti+0.005
Neg logit contributions
an-0.006
 island-0.006
 captain-0.005
 idea-0.005
 pilot-0.005
Token lived
Token ID5615
Feature activation-1.36e-02

Pos logit contributions
an+0.002
 island+0.002
 idea+0.002
 route+0.002
 captain+0.002
Neg logit contributions
 Mitt-0.001
 gloss-0.001
 Midnight-0.001
ifying-0.001
 licking-0.001
Token happily
Token ID18177
Feature activation+9.20e-03

Pos logit contributions
an+0.027
 island+0.025
 captain+0.023
 anchor+0.023
 idea+0.023
Neg logit contributions
 licking-0.020
 chewing-0.020
 eating-0.019
 gloss-0.019
 exercising-0.018
Token ever
Token ID1683
Feature activation+2.50e-02

Pos logit contributions
 gloss+0.014
 spaghetti+0.013
 licking+0.013
 chewing+0.013
ifying+0.013
Neg logit contributions
an-0.017
 island-0.017
 captain-0.016
 pilot-0.016
 idea-0.015
Token after
Token ID706
Feature activation-1.89e-02

Pos logit contributions
 gloss+0.053
 licking+0.048
 spaghetti+0.047
 Midnight+0.046
heddar+0.045
Neg logit contributions
 island-0.032
 captain-0.030
an-0.030
 idea-0.028
 girl-0.026
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
an+0.034
 island+0.033
 captain+0.031
 pilot+0.030
 idea+0.030
Neg logit contributions
 gloss-0.031
 licking-0.030
 chewing-0.030
ifying-0.028
 spaghetti-0.028
Token The
Token ID383
Feature activation-2.47e-03

Pos logit contributions
 gloss+0.023
 licking+0.021
 chewing+0.020
ifying+0.020
 spaghetti+0.019
Neg logit contributions
an-0.025
 island-0.024
 captain-0.022
 pilot-0.022
 idea-0.022
Token end
Token ID886
Feature activation+5.97e-03

Pos logit contributions
an+0.005
 island+0.004
tha+0.004
 captain+0.004
 anchor+0.004
Neg logit contributions
 chewing-0.004
 Mitt-0.004
 gloss-0.004
 exercising-0.004
 Midnight-0.004
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
 gloss+0.012
 licking+0.012
 chewing+0.011
ifying+0.011
 spaghetti+0.011
Neg logit contributions
an-0.008
 island-0.008
 idea-0.007
 captain-0.007
 pilot-0.007
Token the
Token ID262
Feature activation+6.63e-03

Pos logit contributions
 gloss+0.001
 spaghetti+0.001
 licking+0.001
ifying+0.001
 eating+0.001
Neg logit contributions
an-0.001
 island-0.001
 anchor-0.001
 captain-0.001
 pilot-0.001
Token song
Token ID3496
Feature activation+9.99e-03

Pos logit contributions
 spaghetti+0.010
 gloss+0.010
 eating+0.010
 licking+0.010
ifying+0.010
Neg logit contributions
an-0.011
 captain-0.010
tha-0.010
 island-0.010
 anchor-0.010
Token.
Token ID13
Feature activation+6.61e-03

Pos logit contributions
 gloss+0.016
 licking+0.016
 chewing+0.016
ifying+0.015
 Mitt+0.015
Neg logit contributions
an-0.018
 island-0.017
 idea-0.016
 captain-0.016
 pilot-0.016
Token From
Token ID3574
Feature activation+5.55e-03

Pos logit contributions
ifying+0.011
 licking+0.011
 gloss+0.011
 choke+0.010
 exercising+0.010
Neg logit contributions
an-0.010
 idea-0.010
 man-0.009
 girl-0.009
 landscape-0.009
Token that
Token ID326
Feature activation+5.25e-03

Pos logit contributions
 licking+0.004
 gloss+0.004
 chewing+0.004
 Mitt+0.004
 eating+0.004
Neg logit contributions
an-0.014
 captain-0.014
 idea-0.014
 island-0.014
 pilot-0.013
Token day
Token ID1110
Feature activation+2.35e-02

Pos logit contributions
 gloss+0.010
 licking+0.009
 chewing+0.009
 spaghetti+0.009
 Mitt+0.009
Neg logit contributions
an-0.008
 island-0.008
 captain-0.008
 pilot-0.007
 idea-0.007
Token on
Token ID319
Feature activation+1.53e-02

Pos logit contributions
 gloss+0.040
 licking+0.039
 spaghetti+0.037
 chewing+0.036
ifying+0.036
Neg logit contributions
an-0.038
 island-0.037
 captain-0.036
 idea-0.036
 pilot-0.035
Token,
Token ID11
Feature activation+6.13e-03

Pos logit contributions
 gloss+0.029
 licking+0.028
 chewing+0.027
ifying+0.027
 spaghetti+0.026
Neg logit contributions
an-0.024
 island-0.023
 captain-0.022
 idea-0.021
 pilot-0.021
Token Lily
Token ID20037
Feature activation+2.06e-03

Pos logit contributions
 gloss+0.009
 licking+0.009
ifying+0.009
 chewing+0.008
 spaghetti+0.008
Neg logit contributions
 island-0.011
 idea-0.011
an-0.011
 pilot-0.011
 captain-0.011
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 gloss+0.003
 licking+0.003
 chewing+0.003
 Mitt+0.003
 eating+0.003
Neg logit contributions
an-0.004
 island-0.004
 captain-0.003
 idea-0.003
 anchor-0.003
Token her
Token ID607
Feature activation+3.90e-03

Pos logit contributions
 gloss+0.016
 licking+0.015
 chewing+0.015
 spaghetti+0.014
 Mitt+0.014
Neg logit contributions
an-0.023
 island-0.022
 captain-0.021
 pilot-0.020
 idea-0.020
Token to
Token ID284
Feature activation+3.14e-03

Pos logit contributions
ifying+0.026
 gloss+0.026
 licking+0.023
heddar+0.023
aling+0.022
Neg logit contributions
 island-0.029
an-0.028
 captain-0.027
 idea-0.027
 pilot-0.027
Token play
Token ID711
Feature activation+6.47e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
ifying+0.005
 chewing+0.005
 spaghetti+0.004
Neg logit contributions
an-0.006
 island-0.006
 captain-0.005
 idea-0.005
 pilot-0.005
Token with
Token ID351
Feature activation-1.23e-02

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.008
 spaghetti+0.008
 Mitt+0.008
Neg logit contributions
an-0.014
 island-0.013
 captain-0.013
 pilot-0.013
 anchor-0.012
Token Lily
Token ID20037
Feature activation+7.43e-03

Pos logit contributions
an+0.024
 island+0.023
 captain+0.022
 idea+0.021
 pilot+0.021
Neg logit contributions
 gloss-0.019
 licking-0.018
 chewing-0.017
 Mitt-0.017
 spaghetti-0.017
Token's
Token ID338
Feature activation-2.35e-04

Pos logit contributions
 gloss+0.012
 licking+0.011
ifying+0.011
 chewing+0.011
 Mitt+0.010
Neg logit contributions
an-0.014
 island-0.013
 idea-0.013
 captain-0.012
 pilot-0.012
Token skirt
Token ID23967
Feature activation+2.36e-02

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 pilot+0.000
 anchor+0.000
Neg logit contributions
 gloss-0.000
 spaghetti-0.000
 chewing-0.000
 licking-0.000
 Mitt-0.000
Token too
Token ID1165
Feature activation-1.43e-02

Pos logit contributions
 gloss+0.045
 licking+0.044
ifying+0.043
 chewing+0.042
 Mitt+0.040
Neg logit contributions
an-0.035
 island-0.035
 idea-0.033
 captain-0.032
 pilot-0.031
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
an+0.023
 island+0.022
 idea+0.021
 captain+0.021
 pilot+0.020
Neg logit contributions
 gloss-0.027
 licking-0.025
ifying-0.025
 chewing-0.024
 Mitt-0.024
Token He
Token ID679
Feature activation-7.70e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.002
ifying-0.002
 licking-0.002
 chewing-0.002
 spaghetti-0.002
Token wanted
Token ID2227
Feature activation+4.44e-03

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.013
 licking-0.013
 chewing-0.012
 Mitt-0.012
 spaghetti-0.012
Token to
Token ID284
Feature activation-7.05e-03

Pos logit contributions
ifying+0.007
 gloss+0.007
aling+0.006
 licking+0.006
heddar+0.006
Neg logit contributions
 island-0.008
 captain-0.008
 man-0.007
an-0.007
 pilot-0.007
Token.
Token ID13
Feature activation+3.33e-04

Pos logit contributions
 gloss+0.014
ifying+0.014
 licking+0.014
 Mitt+0.014
 Plus+0.013
Neg logit contributions
 island-0.012
an-0.011
 idea-0.010
 captain-0.010
 man-0.010
Token Gina
Token ID43002
Feature activation-1.15e-02

Pos logit contributions
 gloss+0.000
 licking+0.000
 Mitt+0.000
 chewing+0.000
ifying+0.000
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token scoop
Token ID23529
Feature activation+2.23e-03

Pos logit contributions
an+0.019
 island+0.018
 captain+0.017
 idea+0.017
 pilot+0.017
Neg logit contributions
 gloss-0.021
 licking-0.020
 chewing-0.020
 spaghetti-0.019
 Mitt-0.019
Tokened
Token ID276
Feature activation-4.66e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
 chewing+0.004
ifying+0.004
 spaghetti+0.004
Neg logit contributions
an-0.004
 island-0.003
 captain-0.003
 idea-0.003
 pilot-0.003
Token a
Token ID257
Feature activation+1.56e-02

Pos logit contributions
an+0.007
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.009
 licking-0.009
 chewing-0.009
ifying-0.009
 Mitt-0.008
Token handful
Token ID10089
Feature activation+2.28e-02

Pos logit contributions
 gloss+0.030
 exercising+0.028
 licking+0.028
seek+0.028
 Mitt+0.028
Neg logit contributions
an-0.023
 island-0.022
 idea-0.022
 captain-0.021
 man-0.021
Token of
Token ID286
Feature activation+1.81e-02

Pos logit contributions
 gloss+0.039
 licking+0.037
 Midnight+0.036
 Mitt+0.036
 chewing+0.036
Neg logit contributions
an-0.039
 island-0.037
 idea-0.036
 captain-0.034
 anchor-0.034
Token the
Token ID262
Feature activation+9.29e-03

Pos logit contributions
 gloss+0.024
ifying+0.022
 licking+0.020
seek+0.020
 By+0.020
Neg logit contributions
 island-0.038
an-0.038
 idea-0.037
 anchor-0.036
 captain-0.035
Token pasta
Token ID26296
Feature activation+1.97e-02

Pos logit contributions
 gloss+0.010
 licking+0.009
 Mitt+0.009
ifying+0.009
 Midnight+0.009
Neg logit contributions
an-0.022
 island-0.021
 idea-0.021
 captain-0.020
 anchor-0.020
Token into
Token ID656
Feature activation+9.91e-03

Pos logit contributions
 gloss+0.037
 licking+0.035
ifying+0.035
 Mitt+0.035
 chewing+0.034
Neg logit contributions
an-0.030
 island-0.030
 idea-0.028
 captain-0.027
 anchor-0.026
Token her
Token ID607
Feature activation+1.35e-02

Pos logit contributions
 gloss+0.018
 licking+0.017
 chewing+0.016
ifying+0.016
 spaghetti+0.016
Neg logit contributions
an-0.016
 island-0.016
 captain-0.015
 idea-0.015
 pilot-0.014
Token family
Token ID1641
Feature activation+2.36e-03

Pos logit contributions
an+0.005
 island+0.004
 captain+0.004
 idea+0.004
tha+0.004
Neg logit contributions
 gloss-0.006
 spaghetti-0.006
 licking-0.006
 chewing-0.006
 Mitt-0.006
Token was
Token ID373
Feature activation-5.52e-03

Pos logit contributions
 licking+0.003
 gloss+0.003
 eating+0.003
 chewing+0.003
 Mitt+0.003
Neg logit contributions
an-0.005
 idea-0.004
 island-0.004
 captain-0.004
 pilot-0.004
Token so
Token ID523
Feature activation-4.46e-03

Pos logit contributions
an+0.011
 island+0.011
 captain+0.010
 idea+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.008
 licking-0.007
 chewing-0.007
ifying-0.007
 spaghetti-0.007
Token happy
Token ID3772
Feature activation+6.81e-03

Pos logit contributions
an+0.008
 island+0.008
 captain+0.008
 pilot+0.007
 idea+0.007
Neg logit contributions
 licking-0.007
 gloss-0.007
 chewing-0.007
 eating-0.006
ifying-0.006
Token that
Token ID326
Feature activation-4.03e-03

Pos logit contributions
 gloss+0.011
 licking+0.010
ifying+0.010
 chewing+0.010
 spaghetti+0.009
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.011
Token it
Token ID340
Feature activation+6.56e-03

Pos logit contributions
an+0.009
 island+0.008
 captain+0.008
 idea+0.007
 pilot+0.007
Neg logit contributions
 licking-0.005
 gloss-0.005
 Mitt-0.005
 chewing-0.005
 eating-0.005
Token had
Token ID550
Feature activation-3.10e-03

Pos logit contributions
 gloss+0.009
 licking+0.009
 chewing+0.008
 Mitt+0.008
ifying+0.008
Neg logit contributions
an-0.013
 island-0.013
 idea-0.012
 captain-0.012
 anchor-0.012
Token gone
Token ID3750
Feature activation-4.98e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
 spaghetti-0.004
ifying-0.004
Token.
Token ID13
Feature activation-2.36e-05

Pos logit contributions
an+0.009
 island+0.009
 captain+0.009
 pilot+0.008
 idea+0.008
Neg logit contributions
 gloss-0.008
 licking-0.008
 chewing-0.007
ifying-0.007
 spaghetti-0.007
Token The
Token ID383
Feature activation-2.12e-03

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
ifying-0.000
 spaghetti-0.000
Token sky
Token ID6766
Feature activation+7.07e-03

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 idea+0.003
tha+0.003
Neg logit contributions
 spaghetti-0.004
 gloss-0.004
 Mitt-0.004
 chewing-0.004
 licking-0.004
Token saw
Token ID2497
Feature activation-1.32e-03

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
 Mitt-0.000
 spaghetti-0.000
Token it
Token ID340
Feature activation+7.42e-03

Pos logit contributions
an+0.003
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Token shining
Token ID22751
Feature activation-6.39e-03

Pos logit contributions
 gloss+0.013
 licking+0.013
ifying+0.012
 chewing+0.012
 spaghetti+0.012
Neg logit contributions
an-0.012
 island-0.012
 captain-0.011
 idea-0.011
 pilot-0.010
Token in
Token ID287
Feature activation-3.67e-03

Pos logit contributions
an+0.011
 island+0.011
 captain+0.010
 pilot+0.010
 idea+0.010
Neg logit contributions
 gloss-0.011
 licking-0.010
ifying-0.010
 chewing-0.010
 spaghetti-0.010
Token the
Token ID262
Feature activation-2.50e-05

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 pilot+0.005
 idea+0.005
Neg logit contributions
 gloss-0.007
 licking-0.006
 chewing-0.006
ifying-0.006
 spaghetti-0.006
Token rain
Token ID6290
Feature activation-8.60e-04

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
ifying-0.000
 spaghetti-0.000
Token as
Token ID355
Feature activation-2.31e-02

Pos logit contributions
an+0.001
 island+0.001
 idea+0.001
 captain+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.002
ifying-0.002
 spaghetti-0.002
Token if
Token ID611
Feature activation-1.96e-02

Pos logit contributions
an+0.044
 island+0.043
 captain+0.041
 idea+0.040
 pilot+0.040
Neg logit contributions
 gloss-0.036
 licking-0.034
 chewing-0.033
ifying-0.032
 spaghetti-0.032
Token it
Token ID340
Feature activation+5.34e-03

Pos logit contributions
an+0.038
 island+0.035
 captain+0.033
 idea+0.032
 pilot+0.032
Neg logit contributions
 gloss-0.031
 licking-0.030
 chewing-0.029
 Mitt-0.029
 spaghetti-0.029
Token was
Token ID373
Feature activation-1.28e-02

Pos logit contributions
 gloss+0.009
 licking+0.008
 chewing+0.008
 Mitt+0.008
 spaghetti+0.008
Neg logit contributions
an-0.010
 island-0.009
 captain-0.009
 pilot-0.009
 idea-0.009
Token made
Token ID925
Feature activation-8.00e-03

Pos logit contributions
an+0.024
 island+0.023
 captain+0.022
 idea+0.022
 pilot+0.021
Neg logit contributions
 gloss-0.020
 licking-0.019
 chewing-0.018
ifying-0.018
 spaghetti-0.018
Token his
Token ID465
Feature activation-1.39e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.002
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token four
Token ID1440
Feature activation-3.29e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
 idea+0.002
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
ifying-0.002
Token friends
Token ID2460
Feature activation-7.77e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 pilot+0.005
 idea+0.005
Neg logit contributions
 gloss-0.006
 licking-0.006
 chewing-0.005
ifying-0.005
 eating-0.005
Token,
Token ID11
Feature activation+2.48e-03

Pos logit contributions
an+0.014
 island+0.013
 captain+0.012
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.013
 licking-0.013
 chewing-0.012
ifying-0.012
 eating-0.012
Token set
Token ID900
Feature activation-8.69e-03

Pos logit contributions
 gloss+0.004
ifying+0.004
 licking+0.004
 chewing+0.003
 spaghetti+0.003
Neg logit contributions
an-0.005
 island-0.004
 captain-0.004
 pilot-0.004
 idea-0.004
Token off
Token ID572
Feature activation+4.32e-03

Pos logit contributions
an+0.018
 island+0.017
 captain+0.016
 pilot+0.016
 idea+0.016
Neg logit contributions
 gloss-0.012
 licking-0.012
 chewing-0.011
 Mitt-0.011
 eating-0.011
Token for
Token ID329
Feature activation+2.99e-03

Pos logit contributions
 gloss+0.007
ifying+0.006
 licking+0.006
 spaghetti+0.006
 chewing+0.006
Neg logit contributions
 island-0.008
an-0.008
 idea-0.007
 captain-0.007
 pilot-0.007
Token their
Token ID511
Feature activation+4.16e-03

Pos logit contributions
 gloss+0.005
 licking+0.004
ifying+0.004
 chewing+0.004
 spaghetti+0.004
Neg logit contributions
an-0.005
 island-0.005
 captain-0.005
 idea-0.005
 pilot-0.005
Token jog
Token ID48342
Feature activation-4.16e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.007
 spaghetti+0.007
 eating+0.007
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 pilot-0.006
 idea-0.006
Token.
Token ID13
Feature activation+8.43e-03

Pos logit contributions
an+0.008
 island+0.008
 captain+0.007
 anchor+0.007
 pilot+0.007
Neg logit contributions
 gloss-0.006
 licking-0.006
 chewing-0.006
ifying-0.005
 Mitt-0.005
Token It
Token ID632
Feature activation-3.77e-03

Pos logit contributions
 gloss+0.011
 licking+0.011
 chewing+0.010
 spaghetti+0.010
ifying+0.010
Neg logit contributions
an-0.017
 captain-0.016
 island-0.016
 idea-0.016
 pilot-0.016
Token cooked
Token ID15847
Feature activation+8.86e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
ifying+0.001
 chewing+0.001
 spaghetti+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token all
Token ID477
Feature activation+6.94e-03

Pos logit contributions
 gloss+0.015
 licking+0.014
 chewing+0.014
ifying+0.013
 spaghetti+0.013
Neg logit contributions
an-0.016
 island-0.015
 captain-0.014
 pilot-0.014
 idea-0.014
Token sorts
Token ID10524
Feature activation+8.38e-03

Pos logit contributions
 gloss+0.015
ifying+0.014
 licking+0.014
 chewing+0.013
 Midnight+0.013
Neg logit contributions
an-0.009
 island-0.009
 idea-0.009
 anchor-0.008
 captain-0.008
Token of
Token ID286
Feature activation+1.07e-02

Pos logit contributions
 gloss+0.012
 licking+0.012
 chewing+0.011
ifying+0.011
 Mitt+0.011
Neg logit contributions
an-0.016
 island-0.016
 captain-0.015
 idea-0.015
 pilot-0.015
Token cold
Token ID4692
Feature activation-9.89e-04

Pos logit contributions
 gloss+0.018
 licking+0.017
 chewing+0.017
ifying+0.016
 spaghetti+0.016
Neg logit contributions
an-0.019
 island-0.018
 captain-0.017
 idea-0.017
 pilot-0.017
Token food
Token ID2057
Feature activation+3.71e-03

Pos logit contributions
an+0.001
 island+0.001
 idea+0.001
 captain+0.001
 man+0.001
Neg logit contributions
 gloss-0.002
ifying-0.002
 licking-0.002
 chewing-0.002
aling-0.002
Token in
Token ID287
Feature activation+4.83e-03

Pos logit contributions
 gloss+0.007
 licking+0.007
ifying+0.007
 chewing+0.006
 Mitt+0.006
Neg logit contributions
an-0.006
 island-0.006
 captain-0.005
 idea-0.005
 pilot-0.005
Token the
Token ID262
Feature activation-3.43e-03

Pos logit contributions
 gloss+0.010
 licking+0.009
 chewing+0.009
ifying+0.009
 Midnight+0.009
Neg logit contributions
 island-0.007
an-0.007
 captain-0.006
 idea-0.006
 anchor-0.006
Token big
Token ID1263
Feature activation-3.48e-03

Pos logit contributions
 island+0.006
an+0.006
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.007
 licking-0.006
 chewing-0.006
ifying-0.006
seek-0.006
Token silver
Token ID8465
Feature activation-1.36e-02

Pos logit contributions
 island+0.005
an+0.005
 captain+0.005
 idea+0.005
 anchor+0.004
Neg logit contributions
 gloss-0.007
 licking-0.006
ifying-0.006
 chewing-0.006
 Mitt-0.006
Token pot
Token ID1787
Feature activation-3.60e-03

Pos logit contributions
an+0.022
 island+0.021
 captain+0.020
 pilot+0.020
 idea+0.019
Neg logit contributions
 gloss-0.024
 licking-0.023
 chewing-0.023
 spaghetti-0.023
ifying-0.023
Token they
Token ID484
Feature activation-1.72e-03

Pos logit contributions
an+0.017
 island+0.016
 captain+0.015
 idea+0.015
 pilot+0.015
Neg logit contributions
 gloss-0.015
 licking-0.015
 chewing-0.014
ifying-0.014
 spaghetti-0.014
Token were
Token ID547
Feature activation+1.66e-02

Pos logit contributions
an+0.003
 island+0.003
 captain+0.003
 pilot+0.003
 idea+0.003
Neg logit contributions
 gloss-0.003
 licking-0.002
 chewing-0.002
 Mitt-0.002
 eating-0.002
Token bored
Token ID17533
Feature activation+4.53e-03

Pos logit contributions
 gloss+0.027
ifying+0.024
 licking+0.024
 Mitt+0.023
heddar+0.023
Neg logit contributions
 island-0.030
an-0.030
 captain-0.029
 idea-0.029
 pilot-0.028
Token.
Token ID13
Feature activation+8.22e-04

Pos logit contributions
 gloss+0.010
ifying+0.010
 licking+0.010
heddar+0.009
 Midnight+0.009
Neg logit contributions
 island-0.005
 idea-0.005
an-0.005
 man-0.004
 captain-0.004
Token They
Token ID1119
Feature activation-9.15e-03

Pos logit contributions
 gloss+0.002
 licking+0.001
 chewing+0.001
aling+0.001
seek+0.001
Neg logit contributions
 idea-0.001
an-0.001
 island-0.001
 boy-0.001
 girl-0.001
Token looked
Token ID3114
Feature activation+2.66e-03

Pos logit contributions
an+0.018
 island+0.016
 captain+0.015
 pilot+0.015
 idea+0.015
Neg logit contributions
 gloss-0.014
 licking-0.014
 spaghetti-0.013
 chewing-0.013
 Mitt-0.013
Token out
Token ID503
Feature activation+1.30e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
 chewing+0.004
ifying+0.004
 spaghetti+0.004
Neg logit contributions
an-0.005
 island-0.005
 captain-0.004
 pilot-0.004
 idea-0.004
Token the
Token ID262
Feature activation+9.78e-03

Pos logit contributions
 licking+0.002
 gloss+0.002
 chewing+0.002
ifying+0.002
 spaghetti+0.002
Neg logit contributions
an-0.003
 island-0.002
 captain-0.002
 pilot-0.002
 idea-0.002
Token window
Token ID4324
Feature activation+5.88e-03

Pos logit contributions
 gloss+0.017
 licking+0.016
 chewing+0.016
 Mitt+0.015
ifying+0.015
Neg logit contributions
an-0.017
 island-0.017
 captain-0.016
 idea-0.015
 pilot-0.015
Token,
Token ID11
Feature activation-7.72e-03

Pos logit contributions
 gloss+0.012
 licking+0.012
 chewing+0.011
ifying+0.011
 Mitt+0.011
Neg logit contributions
an-0.008
 island-0.008
 idea-0.007
 captain-0.007
 pilot-0.007
Token but
Token ID475
Feature activation-7.24e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 idea+0.010
 pilot+0.010
Neg logit contributions
 gloss-0.015
 licking-0.014
 chewing-0.014
ifying-0.014
 spaghetti-0.013
Token,
Token ID11
Feature activation-4.66e-04

Pos logit contributions
an+0.004
 island+0.004
 idea+0.004
 captain+0.004
 anchor+0.004
Neg logit contributions
 gloss-0.005
ifying-0.005
 licking-0.005
 chewing-0.005
 Mitt-0.005
Token they
Token ID484
Feature activation-2.19e-03

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token were
Token ID547
Feature activation+4.05e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.004
 idea+0.004
 pilot+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
ifying-0.003
 spaghetti-0.003
Token all
Token ID477
Feature activation+1.93e-03

Pos logit contributions
 gloss+0.006
ifying+0.006
 licking+0.006
 spaghetti+0.005
 chewing+0.005
Neg logit contributions
 island-0.007
an-0.007
 captain-0.007
 idea-0.007
 pilot-0.007
Token spark
Token ID9009
Feature activation-1.83e-04

Pos logit contributions
 gloss+0.003
ifying+0.003
 licking+0.003
 chewing+0.003
heddar+0.003
Neg logit contributions
 idea-0.003
 island-0.003
an-0.003
 captain-0.003
 pilot-0.003
Tokenly
Token ID306
Feature activation-4.03e-03

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 pilot+0.000
 idea+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
ifying-0.000
 spaghetti-0.000
 chewing-0.000
Token and
Token ID290
Feature activation+3.58e-04

Pos logit contributions
 island+0.006
an+0.006
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.008
 licking-0.008
 Mitt-0.008
 chewing-0.007
ifying-0.007
Token clean
Token ID3424
Feature activation+9.06e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
ifying+0.001
 chewing+0.001
 spaghetti+0.001
Neg logit contributions
 island-0.001
an-0.001
 captain-0.001
 idea-0.000
 pilot-0.000
Token again
Token ID757
Feature activation-3.78e-03

Pos logit contributions
 gloss+0.016
 licking+0.015
 chewing+0.015
ifying+0.015
 spaghetti+0.014
Neg logit contributions
an-0.015
 island-0.015
 captain-0.014
 idea-0.014
 pilot-0.014
Token!
Token ID0
Feature activation-3.04e-04

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 idea+0.006
 pilot+0.006
Neg logit contributions
 gloss-0.007
 licking-0.006
 chewing-0.006
ifying-0.006
 spaghetti-0.006
Token Mom
Token ID11254
Feature activation+7.73e-03

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 pilot+0.000
 idea+0.000
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.000
 spaghetti-0.000
ifying-0.000
Token was
Token ID373
Feature activation+4.22e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.010
 licking-0.009
 chewing-0.009
 spaghetti-0.009
 Mitt-0.009
Token so
Token ID523
Feature activation-4.54e-03

Pos logit contributions
 gloss+0.007
ifying+0.006
 spaghetti+0.006
 licking+0.006
heddar+0.006
Neg logit contributions
an-0.008
 island-0.008
 captain-0.007
 idea-0.007
 pilot-0.007
Token proud
Token ID6613
Feature activation+2.14e-02

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.010
 licking-0.009
ifying-0.009
 spaghetti-0.009
 chewing-0.009
Token of
Token ID286
Feature activation+8.43e-03

Pos logit contributions
 gloss+0.036
ifying+0.036
 Midnight+0.034
 licking+0.033
 Mitt+0.032
Neg logit contributions
 island-0.036
an-0.035
 captain-0.034
 idea-0.033
 man-0.032
Token his
Token ID465
Feature activation-6.79e-03

Pos logit contributions
 gloss+0.014
 licking+0.014
 chewing+0.013
ifying+0.013
 spaghetti+0.013
Neg logit contributions
an-0.015
 island-0.014
 captain-0.014
 idea-0.013
 pilot-0.013
Token work
Token ID670
Feature activation-4.62e-03

Pos logit contributions
an+0.014
 island+0.014
 captain+0.013
 pilot+0.013
 idea+0.012
Neg logit contributions
 gloss-0.009
 licking-0.009
 chewing-0.009
 spaghetti-0.008
 eating-0.008
Token.
Token ID13
Feature activation+2.54e-03

Pos logit contributions
an+0.007
 island+0.007
 idea+0.007
 captain+0.007
 pilot+0.006
Neg logit contributions
 gloss-0.009
 licking-0.008
ifying-0.008
 chewing-0.008
 Mitt-0.008
Token Then
Token ID3244
Feature activation+8.52e-03

Pos logit contributions
 gloss+0.004
ifying+0.003
 licking+0.003
 spaghetti+0.003
 chewing+0.003
Neg logit contributions
an-0.005
 captain-0.005
 island-0.005
 idea-0.005
 pilot-0.005
Token,
Token ID11
Feature activation+7.08e-03

Pos logit contributions
 gloss+0.012
 licking+0.011
 chewing+0.011
 Mitt+0.011
 spaghetti+0.011
Neg logit contributions
an-0.018
 island-0.017
 captain-0.016
 idea-0.016
 pilot-0.016
Token the
Token ID262
Feature activation-5.70e-03

Pos logit contributions
 gloss+0.010
 licking+0.010
 chewing+0.009
ifying+0.009
 spaghetti+0.009
Neg logit contributions
an-0.014
 island-0.014
 captain-0.013
 idea-0.013
 pilot-0.013
Token distant
Token ID12899
Feature activation-2.13e-02

Pos logit contributions
an+0.007
 island+0.006
ela+0.005
tha+0.005
 captain+0.005
Neg logit contributions
 chewing-0.012
 licking-0.012
 gloss-0.012
 spaghetti-0.012
 eating-0.011
Tokenma
Token ID2611
Feature activation+1.50e-03

Pos logit contributions
an+0.020
 island+0.020
 captain+0.019
 pilot+0.018
 idea+0.018
Neg logit contributions
 gloss-0.020
 licking-0.018
 chewing-0.018
 spaghetti-0.018
ifying-0.017
Token said
Token ID531
Feature activation-7.05e-03

Pos logit contributions
 gloss+0.003
 licking+0.003
 chewing+0.003
 Mitt+0.003
 spaghetti+0.003
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token,
Token ID11
Feature activation-1.15e-02

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 idea+0.011
 pilot+0.011
Neg logit contributions
 gloss-0.012
 licking-0.012
 chewing-0.011
ifying-0.011
 spaghetti-0.011
Token giving
Token ID3501
Feature activation+3.50e-03

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 pilot+0.019
 idea+0.018
Neg logit contributions
 gloss-0.019
 licking-0.017
ifying-0.017
 chewing-0.017
 spaghetti-0.017
Token L
Token ID406
Feature activation-7.87e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
 chewing+0.005
ifying+0.005
 spaghetti+0.005
Neg logit contributions
an-0.007
 island-0.006
 captain-0.006
 idea-0.006
 pilot-0.006
Tokenola
Token ID5708
Feature activation-8.22e-03

Pos logit contributions
 island+0.015
 captain+0.014
an+0.014
 pilot+0.014
 anchor+0.013
Neg logit contributions
 gloss-0.013
 chewing-0.011
 spaghetti-0.011
 licking-0.011
ifying-0.011
Token a
Token ID257
Feature activation-5.77e-03

Pos logit contributions
an+0.016
 island+0.016
 captain+0.015
 idea+0.015
 pilot+0.015
Neg logit contributions
 gloss-0.012
 licking-0.011
 chewing-0.011
ifying-0.011
 spaghetti-0.011
Token big
Token ID1263
Feature activation-2.18e-03

Pos logit contributions
an+0.009
 island+0.009
 idea+0.008
 captain+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.011
 licking-0.010
 chewing-0.010
ifying-0.010
 Mitt-0.010
Token hug
Token ID16225
Feature activation-7.01e-05

Pos logit contributions
an+0.005
 island+0.005
 captain+0.004
 pilot+0.004
 idea+0.004
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
ifying-0.003
Token.
Token ID13
Feature activation-7.18e-04

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 idea+0.000
 pilot+0.000
Neg logit contributions
 gloss-0.000
 licking-0.000
 chewing-0.000
ifying-0.000
 spaghetti-0.000
Token
Token ID198
Feature activation+4.26e-04

Pos logit contributions
an+0.001
 captain+0.001
 idea+0.001
 island+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
ifying-0.001
 chewing-0.001
 spaghetti-0.001
Token But
Token ID887
Feature activation-6.61e-03

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.006
 eating+0.006
 exercising+0.006
Neg logit contributions
an-0.008
 captain-0.008
 pilot-0.008
 island-0.008
 idea-0.007
Token you
Token ID345
Feature activation+5.50e-03

Pos logit contributions
an+0.012
 island+0.012
 captain+0.011
 idea+0.011
 pilot+0.011
Neg logit contributions
 gloss-0.011
 licking-0.010
 chewing-0.010
ifying-0.010
 spaghetti-0.010
Token have
Token ID423
Feature activation+3.50e-03

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.007
ifying+0.007
 spaghetti+0.007
Neg logit contributions
an-0.011
 island-0.011
 captain-0.010
 idea-0.010
 pilot-0.010
Token to
Token ID284
Feature activation+6.69e-04

Pos logit contributions
 gloss+0.006
ifying+0.005
 licking+0.005
heddar+0.005
aling+0.005
Neg logit contributions
 island-0.006
 captain-0.006
an-0.006
 anchor-0.006
 pilot-0.006
Token write
Token ID3551
Feature activation+6.10e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 spaghetti+0.001
Neg logit contributions
an-0.001
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token on
Token ID319
Feature activation-5.42e-03

Pos logit contributions
 gloss+0.010
 licking+0.009
 chewing+0.009
ifying+0.009
 spaghetti+0.009
Neg logit contributions
an-0.011
 island-0.011
 captain-0.010
 idea-0.010
 pilot-0.010
Token the
Token ID262
Feature activation+7.73e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.010
 licking-0.009
 chewing-0.009
ifying-0.009
 Mitt-0.008
Token right
Token ID826
Feature activation-1.30e-02

Pos logit contributions
 gloss+0.014
 licking+0.014
 chewing+0.013
 spaghetti+0.013
ifying+0.013
Neg logit contributions
an-0.012
 island-0.012
 captain-0.011
 idea-0.011
 pilot-0.011
Token things
Token ID1243
Feature activation+8.27e-04

Pos logit contributions
an+0.016
 island+0.015
 pilot+0.015
 captain+0.015
 deer+0.014
Neg logit contributions
 licking-0.025
 gloss-0.025
 chewing-0.024
 spaghetti-0.024
 eating-0.024
Token.
Token ID13
Feature activation+7.51e-03

Pos logit contributions
 gloss+0.001
 licking+0.001
 chewing+0.001
ifying+0.001
 spaghetti+0.001
Neg logit contributions
an-0.002
 island-0.001
 captain-0.001
 idea-0.001
 pilot-0.001
Token You
Token ID921
Feature activation-3.47e-03

Pos logit contributions
 licking+0.011
 gloss+0.011
 chewing+0.010
 exercising+0.010
 eating+0.010
Neg logit contributions
an-0.013
 captain-0.013
 island-0.012
 pilot-0.012
 idea-0.012
Token can
Token ID460
Feature activation+7.50e-03

Pos logit contributions
an+0.017
 island+0.015
tha+0.014
 captain+0.014
 landscape+0.014
Neg logit contributions
 gloss-0.010
ifying-0.010
 chewing-0.009
 Midnight-0.009
 much-0.009
Token use
Token ID779
Feature activation+1.39e-04

Pos logit contributions
 gloss+0.013
 licking+0.012
 chewing+0.012
ifying+0.012
 spaghetti+0.011
Neg logit contributions
an-0.013
 island-0.013
 captain-0.012
 idea-0.012
 pilot-0.012
Token stickers
Token ID28568
Feature activation+1.45e-02

Pos logit contributions
 gloss+0.000
 licking+0.000
 chewing+0.000
 spaghetti+0.000
ifying+0.000
Neg logit contributions
an-0.000
 island-0.000
 captain-0.000
 pilot-0.000
 idea-0.000
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 gloss+0.025
 licking+0.024
 chewing+0.024
ifying+0.023
 spaghetti+0.023
Neg logit contributions
an-0.025
 island-0.024
 captain-0.022
 idea-0.022
 pilot-0.022
Token paint
Token ID7521
Feature activation+1.09e-02

Pos logit contributions
an+0.018
 island+0.017
 captain+0.016
 pilot+0.016
 idea+0.016
Neg logit contributions
 gloss-0.017
 licking-0.016
 chewing-0.016
 spaghetti-0.016
ifying-0.016
Token,
Token ID11
Feature activation-9.06e-03

Pos logit contributions
 gloss+0.019
 licking+0.018
 chewing+0.017
ifying+0.017
 spaghetti+0.017
Neg logit contributions
an-0.019
 island-0.018
 captain-0.017
 pilot-0.017
 idea-0.017
Token or
Token ID393
Feature activation-4.30e-03

Pos logit contributions
an+0.016
 island+0.015
 captain+0.014
 idea+0.014
 pilot+0.013
Neg logit contributions
 gloss-0.016
 licking-0.015
 chewing-0.015
 spaghetti-0.014
ifying-0.014
Token glitter
Token ID31133
Feature activation-3.80e-03

Pos logit contributions
 island+0.007
 captain+0.007
an+0.007
 pilot+0.006
 idea+0.006
Neg logit contributions
 gloss-0.008
 licking-0.007
ifying-0.007
 chewing-0.007
 Mitt-0.007
Token.
Token ID13
Feature activation+6.96e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.006
 pilot+0.006
 idea+0.006
Neg logit contributions
 gloss-0.007
 licking-0.006
 chewing-0.006
ifying-0.006
 spaghetti-0.006
Token You
Token ID921
Feature activation-7.90e-03

Pos logit contributions
 licking+0.010
 gloss+0.010
 chewing+0.009
 choking+0.009
 eating+0.009
Neg logit contributions
an-0.013
 captain-0.012
 pilot-0.012
 island-0.012
 idea-0.012
Token can
Token ID460
Feature activation+6.20e-03

Pos logit contributions
an+0.016
 island+0.014
 captain+0.013
tha+0.013
 pilot+0.013
Neg logit contributions
 gloss-0.012
ifying-0.011
 chewing-0.011
 spaghetti-0.011
 licking-0.010
Token word
Token ID1573
Feature activation-6.90e-04

Pos logit contributions
an+0.019
 island+0.018
 captain+0.017
 pilot+0.016
 anchor+0.015
Neg logit contributions
 gloss-0.028
 licking-0.027
 chewing-0.027
 spaghetti-0.026
 Mitt-0.026
Token they
Token ID484
Feature activation+8.94e-03

Pos logit contributions
an+0.001
 island+0.001
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token shared
Token ID4888
Feature activation+4.91e-03

Pos logit contributions
 gloss+0.011
ifying+0.011
 licking+0.011
 chewing+0.011
 eating+0.010
Neg logit contributions
an-0.019
 island-0.018
 captain-0.017
 idea-0.017
 anchor-0.017
Token -
Token ID532
Feature activation-4.35e-04

Pos logit contributions
 gloss+0.008
 licking+0.008
 chewing+0.007
ifying+0.007
 spaghetti+0.007
Neg logit contributions
an-0.009
 island-0.009
 captain-0.008
 pilot-0.008
 idea-0.008
Token '
Token ID705
Feature activation-4.39e-03

Pos logit contributions
 island+0.001
an+0.001
 pilot+0.001
 captain+0.001
 idea+0.001
Neg logit contributions
 gloss-0.001
ifying-0.001
 licking-0.001
 chewing-0.001
heddar-0.001
Tokensp
Token ID2777
Feature activation-3.46e-02

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 idea+0.008
 pilot+0.008
Neg logit contributions
 gloss-0.006
ifying-0.006
 licking-0.006
 chewing-0.006
 Mitt-0.005
Tokenoil
Token ID9437
Feature activation-4.71e-03

Pos logit contributions
an+0.075
 island+0.072
 captain+0.069
 pilot+0.068
 idea+0.067
Neg logit contributions
 gloss-0.044
 licking-0.042
 chewing-0.041
 spaghetti-0.039
ifying-0.039
Token!
Token ID0
Feature activation+4.98e-03

Pos logit contributions
an+0.009
 island+0.009
 captain+0.008
 pilot+0.008
 idea+0.008
Neg logit contributions
ifying-0.007
 gloss-0.007
 licking-0.007
 chewing-0.007
 spaghetti-0.006
Token'.
Token ID4458
Feature activation+4.38e-03

Pos logit contributions
 gloss+0.006
 licking+0.005
 chewing+0.005
 spaghetti+0.005
 eating+0.005
Neg logit contributions
an-0.011
 island-0.011
 captain-0.011
 pilot-0.010
 idea-0.010
Token<|endoftext|>
Token ID50256
Feature activation-6.32e-03

Pos logit contributions
 gloss+0.006
 licking+0.006
 chewing+0.006
 spaghetti+0.005
 exercising+0.005
Neg logit contributions
an-0.009
 island-0.009
 captain-0.008
 pilot-0.008
 idea-0.008
TokenOne
Token ID3198
Feature activation+6.00e-03

Pos logit contributions
an+0.011
 island+0.011
 captain+0.011
 pilot+0.010
 idea+0.010
Neg logit contributions
 gloss-0.010
 licking-0.010
 chewing-0.010
ifying-0.009
 spaghetti-0.009
Token uncle
Token ID7711
Feature activation-1.34e-02

Pos logit contributions
 gloss+0.002
 licking+0.002
 chewing+0.002
 spaghetti+0.002
ifying+0.002
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 idea-0.002
 pilot-0.002
Token Tom
Token ID4186
Feature activation-1.57e-02

Pos logit contributions
an+0.026
 island+0.025
 captain+0.024
 idea+0.024
 pilot+0.024
Neg logit contributions
 gloss-0.020
 licking-0.019
 chewing-0.018
ifying-0.018
 spaghetti-0.018
Token.
Token ID13
Feature activation+6.64e-03

Pos logit contributions
an+0.028
 island+0.027
 captain+0.026
 pilot+0.025
 idea+0.025
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.024
ifying-0.024
 spaghetti-0.024
Token Sh
Token ID911
Feature activation-7.57e-03

Pos logit contributions
 gloss+0.010
 licking+0.009
 chewing+0.009
 eating+0.009
 exercising+0.009
Neg logit contributions
an-0.013
 captain-0.012
 island-0.012
 pilot-0.012
 idea-0.011
Tokenr
Token ID81
Feature activation-8.38e-03

Pos logit contributions
an+0.014
 island+0.013
 captain+0.013
 idea+0.012
 pilot+0.012
Neg logit contributions
 gloss-0.013
 licking-0.012
 chewing-0.012
ifying-0.011
 spaghetti-0.011
Tokenink
Token ID676
Feature activation-2.84e-02

Pos logit contributions
an+0.014
 island+0.013
 captain+0.013
 pilot+0.012
 idea+0.012
Neg logit contributions
 gloss-0.015
 licking-0.014
 chewing-0.014
ifying-0.014
 spaghetti-0.014
Token me
Token ID502
Feature activation-7.95e-03

Pos logit contributions
 island+0.048
 captain+0.047
 pilot+0.047
an+0.045
 anchor+0.044
Neg logit contributions
 gloss-0.042
 spaghetti-0.040
 chewing-0.039
 eating-0.039
 sauce-0.038
Token,
Token ID11
Feature activation+3.39e-03

Pos logit contributions
an+0.015
 island+0.015
 captain+0.014
 pilot+0.014
 idea+0.014
Neg logit contributions
 gloss-0.012
 licking-0.011
 chewing-0.011
ifying-0.011
 spaghetti-0.011
Token shrink
Token ID22085
Feature activation-5.15e-03

Pos logit contributions
 gloss+0.005
 licking+0.005
 chewing+0.005
 Mitt+0.005
 spaghetti+0.004
Neg logit contributions
an-0.007
 island-0.007
 captain-0.006
 idea-0.006
 pilot-0.006
Token me
Token ID502
Feature activation+2.24e-03

Pos logit contributions
an+0.009
 island+0.008
 captain+0.008
 pilot+0.008
 idea+0.008
Neg logit contributions
 gloss-0.009
 licking-0.009
 chewing-0.009
 spaghetti-0.008
ifying-0.008
Token!"
Token ID2474
Feature activation+2.33e-03

Pos logit contributions
 gloss+0.004
 licking+0.004
 chewing+0.004
ifying+0.004
 spaghetti+0.004
Neg logit contributions
an-0.004
 island-0.003
 captain-0.003
 idea-0.003
 pilot-0.003
Token tough
Token ID5802
Feature activation-2.08e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.001
 eating-0.001
 Mitt-0.001
Token enough
Token ID1576
Feature activation+1.58e-03

Pos logit contributions
an+0.004
 island+0.004
 captain+0.003
 pilot+0.003
 idea+0.003
Neg logit contributions
 gloss-0.003
 licking-0.003
 chewing-0.003
 spaghetti-0.003
 Mitt-0.003
Token to
Token ID284
Feature activation+1.54e-02

Pos logit contributions
 gloss+0.002
 licking+0.002
ifying+0.002
 chewing+0.002
 Mitt+0.002
Neg logit contributions
an-0.004
 island-0.003
 captain-0.003
 idea-0.003
 pilot-0.003
Token explore
Token ID7301
Feature activation-3.46e-03

Pos logit contributions
 gloss+0.025
 licking+0.025
 chewing+0.024
 Mitt+0.023
 eating+0.023
Neg logit contributions
an-0.028
 island-0.026
 captain-0.024
 pilot-0.024
 idea-0.024
Token the
Token ID262
Feature activation-4.41e-05

Pos logit contributions
an+0.007
 captain+0.006
 island+0.006
 pilot+0.006
 idea+0.006
Neg logit contributions
 gloss-0.005
 spaghetti-0.005
 licking-0.005
 chewing-0.005
 Mitt-0.005
Token mysterious
Token ID11428
Feature activation-2.68e-02

Pos logit contributions
an+0.000
 island+0.000
 captain+0.000
 pilot+0.000
 idea+0.000
Neg logit contributions
 gloss-0.000
 spaghetti-0.000
 licking-0.000
 chewing-0.000
ifying-0.000
Token room
Token ID2119
Feature activation-3.26e-03

Pos logit contributions
an+0.051
 captain+0.045
 island+0.045
tha+0.044
 anchor+0.043
Neg logit contributions
 spaghetti-0.039
 gloss-0.039
 chewing-0.038
 licking-0.037
 eating-0.036
Token.
Token ID13
Feature activation+1.45e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.005
 licking-0.005
 chewing-0.005
ifying-0.005
 spaghetti-0.005
Token She
Token ID1375
Feature activation-1.05e-03

Pos logit contributions
 licking+0.002
 gloss+0.002
 chewing+0.002
ifying+0.002
 spaghetti+0.002
Neg logit contributions
an-0.003
 idea-0.003
 island-0.003
 captain-0.003
 pilot-0.003
Token was
Token ID373
Feature activation+5.87e-03

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 pilot+0.002
 idea+0.002
Neg logit contributions
 gloss-0.002
 licking-0.002
 chewing-0.001
 Mitt-0.001
 spaghetti-0.001
Token very
Token ID845
Feature activation+4.18e-03

Pos logit contributions
 gloss+0.009
 licking+0.008
 chewing+0.008
ifying+0.008
 spaghetti+0.008
Neg logit contributions
an-0.011
 island-0.011
 captain-0.010
 idea-0.010
 pilot-0.010
TokenWhat
Token ID2061
Feature activation+1.01e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.002
 idea+0.002
 pilot+0.002
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
ifying-0.001
 spaghetti-0.001
Token are
Token ID389
Feature activation-2.89e-03

Pos logit contributions
 gloss+0.019
 licking+0.019
ifying+0.018
 chewing+0.018
 Mitt+0.017
Neg logit contributions
 island-0.015
an-0.015
 idea-0.015
 pilot-0.014
 captain-0.014
Token you
Token ID345
Feature activation-3.88e-03

Pos logit contributions
an+0.006
 island+0.006
 captain+0.005
 idea+0.005
 pilot+0.005
Neg logit contributions
 gloss-0.004
 licking-0.004
 chewing-0.004
ifying-0.004
 spaghetti-0.004
Token doing
Token ID1804
Feature activation-1.88e-02

Pos logit contributions
an+0.010
 island+0.009
 pilot+0.009
 captain+0.009
 route+0.009
Neg logit contributions
 chewing-0.004
 eating-0.004
 exercising-0.003
 licking-0.003
 Mitt-0.003
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
an+0.035
 island+0.032
 captain+0.030
 pilot+0.029
 anchor+0.029
Neg logit contributions
 chewing-0.030
 eating-0.029
 spaghetti-0.029
 gloss-0.029
 licking-0.029
Token Milo
Token ID32662
Feature activation-2.86e-02

Pos logit contributions
an+0.021
 island+0.020
 captain+0.019
 pilot+0.019
 idea+0.018
Neg logit contributions
 Mitt-0.013
 spaghetti-0.013
 gloss-0.013
 licking-0.013
 chewing-0.013
Token?"
Token ID1701
Feature activation-8.92e-04

Pos logit contributions
an+0.048
 island+0.045
 captain+0.043
 idea+0.043
 pilot+0.041
Neg logit contributions
 chewing-0.046
 gloss-0.045
 Mitt-0.045
 eating-0.045
 licking-0.045
Token asked
Token ID1965
Feature activation-1.32e-02

Pos logit contributions
an+0.002
 island+0.002
 captain+0.001
 idea+0.001
 pilot+0.001
Neg logit contributions
 gloss-0.001
 licking-0.001
 chewing-0.001
 Mitt-0.001
 spaghetti-0.001
Token Lucy
Token ID22162
Feature activation+1.03e-02

Pos logit contributions
an+0.031
 island+0.031
 captain+0.028
 pilot+0.028
 anchor+0.028
Neg logit contributions
 gloss-0.015
 Mitt-0.014
 licking-0.014
 chewing-0.014
 Midnight-0.013
Token.
Token ID13
Feature activation-8.63e-03

Pos logit contributions
 gloss+0.019
 licking+0.018
ifying+0.017
 spaghetti+0.017
 chewing+0.017
Neg logit contributions
an-0.017
 island-0.016
 captain-0.015
 idea-0.015
 pilot-0.015
Token
Token ID198
Feature activation-9.83e-03

Pos logit contributions
an+0.014
 captain+0.014
 island+0.013
 pilot+0.013
 man+0.013
Neg logit contributions
 gloss-0.015
ifying-0.014
 licking-0.013
 spaghetti-0.013
 exercising-0.013
Token a
Token ID257
Feature activation-9.18e-03

Pos logit contributions
an+0.025
 island+0.024
 captain+0.023
 idea+0.022
 pilot+0.022
Neg logit contributions
 gloss-0.024
 licking-0.023
 chewing-0.023
ifying-0.022
 spaghetti-0.022
Token family
Token ID1641
Feature activation-7.62e-03

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 idea+0.011
 pilot+0.011
Neg logit contributions
 gloss-0.018
 licking-0.018
 chewing-0.017
 spaghetti-0.017
 Mitt-0.017
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
an+0.013
 island+0.013
 captain+0.012
 pilot+0.012
 idea+0.012
Neg logit contributions
 gloss-0.013
 licking-0.012
 chewing-0.012
ifying-0.012
 Mitt-0.012
Token a
Token ID257
Feature activation-6.24e-03

Pos logit contributions
an+0.018
 island+0.018
 captain+0.017
 pilot+0.016
 idea+0.016
Neg logit contributions
 gloss-0.017
 licking-0.016
ifying-0.016
 chewing-0.016
 spaghetti-0.015
Token spaceship
Token ID40663
Feature activation-1.47e-02

Pos logit contributions
an+0.009
 island+0.008
 captain+0.008
 idea+0.007
 pilot+0.007
Neg logit contributions
 gloss-0.013
 licking-0.012
 chewing-0.012
ifying-0.012
 spaghetti-0.012
Token crew
Token ID5462
Feature activation-2.84e-02

Pos logit contributions
an+0.025
 island+0.023
 captain+0.022
 idea+0.022
 pilot+0.021
Neg logit contributions
 gloss-0.026
 licking-0.025
 chewing-0.024
ifying-0.024
 spaghetti-0.023
Token and
Token ID290
Feature activation-6.67e-03

Pos logit contributions
an+0.049
 island+0.046
 captain+0.044
 idea+0.043
 pilot+0.042
Neg logit contributions
 gloss-0.049
 licking-0.047
 chewing-0.046
ifying-0.045
 spaghetti-0.044
Token princess
Token ID21752
Feature activation-1.14e-02

Pos logit contributions
 island+0.011
an+0.011
 captain+0.011
 pilot+0.011
 idea+0.010
Neg logit contributions
 gloss-0.011
 licking-0.011
 chewing-0.011
ifying-0.011
 spaghetti-0.010
Tokenes
Token ID274
Feature activation-2.46e-02

Pos logit contributions
an+0.025
 island+0.024
 captain+0.023
 idea+0.023
 anchor+0.022
Neg logit contributions
 gloss-0.014
 licking-0.013
 chewing-0.013
ifying-0.012
 spaghetti-0.012
Token in
Token ID287
Feature activation+1.03e-03

Pos logit contributions
an+0.045
 island+0.042
 captain+0.039
 idea+0.038
 pilot+0.038
Neg logit contributions
 gloss-0.041
 licking-0.039
 chewing-0.038
ifying-0.037
 eating-0.037
Token a
Token ID257
Feature activation-8.82e-04

Pos logit contributions
 gloss+0.002
 licking+0.002
ifying+0.002
 chewing+0.002
 spaghetti+0.002
Neg logit contributions
an-0.002
 island-0.002
 captain-0.002
 pilot-0.002
 idea-0.002