TOP ACTIVATIONS
MAX = 5.110

Can I have your ear rings ? They are so shiny
and you can make your dreams come true ."
I am blo oming ! â said
word . It is ' bat '. It means a stick
and her dream would keep blo oming . <|endoftext|> Once upon
to school . At school , she saw Ben
blo oming . It is blo oming now . It is
you get those ice cre ams ? Where are your toys
and asked , " Are stars high up ?"
magical and can make our dreams come true !"
and soon her garden was blo oming with all sorts of
the park . At the park , Lily saw
little better . At school , Tom meets his
you like the b unn ies ?" the lady asks .
enough , he found a blo oming l ily . He
the flower that was almost blo oming . It is blo
party together . At the party , Lily danced
university is . At the university , they see
Can we go to the stars and planets ?" Anna asks
her bed . At Grand ma 's house ,

MIN ACTIVATIONS
MIN = -4.905

knows how to get things unst uck ." So , they
had learned an important lesson - with hard work and determination
time , not so long ago , there was a big
wanted to keep it all for himself . He was being
<|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are friends
me . I forgive myself for getting lost ."
to take all the things for yourself . Sharing is caring
stubborn about learning new things anymore . She realized that it
He thinks they are toys for him . He
they didn 't get it unst uck . Just
want us to be friends for a long time ." <|endoftext|>
food and took it all for himself .
it lots of sugar cubes for treats . He worked very
will trade you my car for some of your flowers ."
Lily that stones are not for eating and they can hurt
license . I got it for being a good cat ."
need to cover my nuts for the winter ." Tom smiled
Bob wanted to take it for himself , but Joe wanted
. The squirrel told Bun - B un to climb up
He wanted to save it for later . So , he

INTERVAL 2.606 - 5.110
CONTAINS 0.231%

. We can do it if we try ."
something fun at home . But they also said they wanted
are friends . They like to play in the park .
herself ! At the end of the day
up a big hill . At the top of the hill

INTERVAL 0.102 - 2.606
CONTAINS 43.897%

from the cups . They ate cookies and talked in funny
colors . You can trip in them ." Tom
! You have to go to bed now !"
's warning . As the day went on ,
was amazed . She felt like she was watching a movie

INTERVAL -2.401 - 0.102
CONTAINS 55.289%

baby brother all about cleaning . Mom my smiled and said
scared . They hug their t eddy bears and each other
flew down and tried to take their food . The mom
the whisper said , I am
Lily was very sad and realized that being rude and not

INTERVAL -4.905 - -2.401
CONTAINS 0.583%

did not feel it chy anymore . She was a brave
then placed it somewhere safe for the next day .
. He smiled and said , " I want this one
have to save the mouse !" Lily 's mom my said
him . It was very y ummy ! Jack took the
TokenCan
Token ID6090
Feature activation+3.43e-01

Pos logit contributions
 wave+0.010
 see+0.010
 waves+0.009
seat+0.009
 paddle+0.008
Neg logit contributions
xy-0.019
oni-0.018
liv-0.018
ork-0.017
 fault-0.017
Token I
Token ID314
Feature activation+6.23e-01

Pos logit contributions
 see+0.004
 wave+0.003
 waves+0.003
 voice+0.003
Leave+0.002
Neg logit contributions
xy-0.015
oni-0.015
liv-0.014
 fault-0.014
ork-0.014
Token have
Token ID423
Feature activation-1.44e-01

Pos logit contributions
seat+0.013
 valleys+0.013
 amaz+0.013
 waves+0.013
 sle+0.013
Neg logit contributions
ork-0.017
xy-0.017
oni-0.017
ratch-0.015
her-0.015
Token your
Token ID534
Feature activation+3.27e-01

Pos logit contributions
xy+0.004
oni+0.004
liv+0.004
ork+0.004
 fault+0.004
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.003
seat-0.003
 paddle-0.003
Token ear
Token ID1027
Feature activation+3.25e+00

Pos logit contributions
 wave+0.007
 see+0.007
seat+0.007
 waves+0.007
 transported+0.007
Neg logit contributions
xy-0.010
oni-0.009
 fault-0.009
liv-0.009
ork-0.009
Tokenrings
Token ID33173
Feature activation+5.11e+00

Pos logit contributions
 wave+0.065
 waves+0.064
seat+0.060
 see+0.059
 hops+0.058
Neg logit contributions
xy-0.102
oni-0.098
liv-0.095
ork-0.095
 fault-0.093
Token?
Token ID30
Feature activation+9.06e-01

Pos logit contributions
 wave+0.097
 waves+0.094
seat+0.093
 reins+0.090
 see+0.089
Neg logit contributions
xy-0.168
oni-0.164
liv-0.156
ork-0.152
 fault-0.149
Token They
Token ID1119
Feature activation+1.73e+00

Pos logit contributions
 wave+0.019
 see+0.018
 waves+0.017
seat+0.017
 paddle+0.016
Neg logit contributions
xy-0.030
oni-0.028
liv-0.027
ork-0.026
 fault-0.026
Token are
Token ID389
Feature activation+1.14e-01

Pos logit contributions
 wave+0.031
seat+0.031
 waves+0.029
 see+0.028
Leave+0.027
Neg logit contributions
xy-0.059
oni-0.056
liv-0.053
ork-0.053
 fault-0.051
Token so
Token ID523
Feature activation+1.08e+00

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
seat+0.002
 sle+0.002
Neg logit contributions
xy-0.004
liv-0.004
oni-0.004
ork-0.003
 fault-0.003
Token shiny
Token ID22441
Feature activation+8.63e-02

Pos logit contributions
 wave+0.019
 see+0.018
 waves+0.017
seat+0.017
 paddle+0.015
Neg logit contributions
xy-0.039
oni-0.037
liv-0.037
ork-0.035
 fault-0.035
Token and
Token ID290
Feature activation+6.09e-01

Pos logit contributions
 wave+0.020
 see+0.020
 waves+0.018
seat+0.018
 paddle+0.016
Neg logit contributions
xy-0.036
liv-0.034
oni-0.033
ork-0.032
 fault-0.031
Token you
Token ID345
Feature activation-4.90e-01

Pos logit contributions
 wave+0.011
 see+0.010
seat+0.010
 waves+0.010
 sle+0.009
Neg logit contributions
xy-0.021
oni-0.020
liv-0.020
ork-0.019
 fault-0.019
Token can
Token ID460
Feature activation+3.65e-01

Pos logit contributions
xy+0.018
oni+0.017
liv+0.017
ork+0.016
 fault+0.016
Neg logit contributions
 wave-0.008
 see-0.008
 waves-0.007
seat-0.007
 paddle-0.006
Token make
Token ID787
Feature activation+9.16e-01

Pos logit contributions
 see+0.006
 wave+0.006
 waves+0.005
 paddle+0.004
 sail+0.004
Neg logit contributions
xy-0.014
liv-0.014
oni-0.013
ork-0.013
 fault-0.013
Token your
Token ID534
Feature activation+1.20e+00

Pos logit contributions
 see+0.017
 waves+0.016
 wave+0.016
 voice+0.014
 sailor+0.013
Neg logit contributions
xy-0.033
liv-0.032
oni-0.030
ork-0.030
live-0.030
Token dreams
Token ID10625
Feature activation+4.85e+00

Pos logit contributions
 wave+0.021
 see+0.020
 waves+0.019
seat+0.019
 paddle+0.017
Neg logit contributions
xy-0.043
oni-0.041
liv-0.040
ork-0.039
 fault-0.038
Token come
Token ID1282
Feature activation+1.54e+00

Pos logit contributions
 wave+0.079
seat+0.076
 waves+0.074
 see+0.073
 reins+0.070
Neg logit contributions
xy-0.174
oni-0.166
ork-0.158
liv-0.158
 fault-0.153
Token true
Token ID2081
Feature activation-4.43e-01

Pos logit contributions
 see+0.032
 wave+0.032
 waves+0.029
seat+0.028
 voice+0.027
Neg logit contributions
xy-0.052
liv-0.051
oni-0.049
ork-0.046
 fault-0.045
Token."
Token ID526
Feature activation-2.11e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.014
ork+0.013
 fault+0.013
Neg logit contributions
 wave-0.009
 see-0.009
 waves-0.008
seat-0.008
 paddle-0.007
Token
Token ID198
Feature activation+6.84e-01

Pos logit contributions
xy+0.007
oni+0.007
liv+0.007
ork+0.007
 fault+0.007
Neg logit contributions
 see-0.004
 wave-0.004
seat-0.004
 waves-0.004
 voice-0.003
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 see+0.012
 wave+0.011
seat+0.010
 voice+0.009
 waves+0.009
Neg logit contributions
xy-0.025
oni-0.024
 fault-0.024
liv-0.024
ork-0.023
Token
Token ID26391
Feature activation-4.43e-01

Pos logit contributions
 wave+0.007
 see+0.006
 waves+0.006
 trusts+0.006
 voice+0.006
Neg logit contributions
 fault-0.006
ork-0.006
xy-0.006
oni-0.006
ula-0.005
Token
Token ID129
Feature activation-1.45e+00

Pos logit contributions
xy+0.014
oni+0.014
liv+0.013
 fault+0.012
ork+0.012
Neg logit contributions
 see-0.010
 wave-0.010
seat-0.009
 waves-0.009
 transported-0.008
Token
Token ID241
Feature activation-9.01e-01

Pos logit contributions
xy+0.038
 fault+0.037
oni+0.036
tr+0.035
ork+0.035
Neg logit contributions
 see-0.036
 wave-0.034
 waves-0.033
 voice-0.031
 paddle-0.030
TokenI
Token ID40
Feature activation+4.38e-02

Pos logit contributions
xy+0.031
oni+0.031
liv+0.029
ork+0.029
 fault+0.028
Neg logit contributions
 see-0.018
 wave-0.016
seat-0.016
Leave-0.014
 waves-0.014
Token am
Token ID716
Feature activation+5.21e-01

Pos logit contributions
 wave+0.001
 waves+0.001
seat+0.001
 see+0.001
 reins+0.001
Neg logit contributions
xy-0.002
liv-0.001
oni-0.001
ork-0.001
 fault-0.001
Token blo
Token ID24924
Feature activation+4.70e+00

Pos logit contributions
 wave+0.011
 see+0.010
 waves+0.010
seat+0.009
 voice+0.009
Neg logit contributions
xy-0.018
oni-0.017
liv-0.017
ork-0.016
 fault-0.015
Tokenoming
Token ID3383
Feature activation+1.02e+00

Pos logit contributions
 wave+0.114
 see+0.112
 waves+0.105
 paddle+0.102
 sle+0.101
Neg logit contributions
xy-0.135
oni-0.123
liv-0.120
ork-0.114
ula-0.114
Token!
Token ID0
Feature activation+1.22e+00

Pos logit contributions
 wave+0.019
 see+0.018
 waves+0.017
seat+0.016
 voice+0.015
Neg logit contributions
xy-0.036
liv-0.034
oni-0.034
ork-0.033
 fault-0.032
Tokenâ
Token ID22940
Feature activation+3.44e-01

Pos logit contributions
 wave+0.025
 see+0.024
 waves+0.023
seat+0.023
 paddle+0.021
Neg logit contributions
xy-0.040
oni-0.038
liv-0.037
ork-0.035
 fault-0.035
Token
Token ID26391
Feature activation-5.38e-01

Pos logit contributions
 wave+0.010
 see+0.010
 waves+0.009
seat+0.009
 trusts+0.009
Neg logit contributions
xy-0.008
oni-0.008
 fault-0.007
ork-0.007
liv-0.006
Token said
Token ID531
Feature activation+1.35e+00

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
ork+0.015
 fault+0.015
Neg logit contributions
 wave-0.012
 see-0.012
seat-0.011
 waves-0.011
 transported-0.010
Token word
Token ID1573
Feature activation+1.26e+00

Pos logit contributions
 wave+0.011
 see+0.011
 waves+0.010
 voice+0.010
seat+0.009
Neg logit contributions
xy-0.018
liv-0.017
oni-0.016
ork-0.015
 fault-0.015
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 wave+0.023
 see+0.023
 voice+0.020
 waves+0.020
 hand+0.020
Neg logit contributions
xy-0.044
liv-0.043
oni-0.040
 fault-0.040
live-0.038
Token It
Token ID632
Feature activation+5.48e-02

Pos logit contributions
 see+0.030
 wave+0.029
seat+0.027
 waves+0.026
 voice+0.025
Neg logit contributions
xy-0.045
oni-0.043
liv-0.043
ork-0.040
 fault-0.040
Token is
Token ID318
Feature activation-4.71e-01

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
 voice+0.001
 trusts+0.001
Neg logit contributions
xy-0.002
liv-0.002
oni-0.002
 fault-0.002
ork-0.002
Token '
Token ID705
Feature activation-1.03e+00

Pos logit contributions
xy+0.018
liv+0.017
ork+0.016
oni+0.015
live+0.015
Neg logit contributions
 see-0.009
 wave-0.009
 waves-0.008
 hand-0.007
 voice-0.007
Tokenbat
Token ID8664
Feature activation+4.59e+00

Pos logit contributions
xy+0.030
liv+0.030
 fault+0.029
oni+0.029
ork+0.026
Neg logit contributions
 see-0.024
 wave-0.023
seat-0.022
Leave-0.021
 waves-0.021
Token'.
Token ID4458
Feature activation+1.96e-01

Pos logit contributions
 wave+0.097
 see+0.095
seat+0.085
 waves+0.085
 voice+0.083
Neg logit contributions
xy-0.149
liv-0.147
 fault-0.139
oni-0.139
ork-0.132
Token It
Token ID632
Feature activation+4.92e-01

Pos logit contributions
 see+0.004
 wave+0.004
seat+0.004
 waves+0.004
 voice+0.004
Neg logit contributions
xy-0.006
oni-0.006
liv-0.006
ork-0.006
 fault-0.006
Token means
Token ID1724
Feature activation+2.46e-01

Pos logit contributions
 wave+0.009
 see+0.009
 waves+0.008
seat+0.008
 voice+0.008
Neg logit contributions
xy-0.017
liv-0.017
oni-0.016
ork-0.016
 fault-0.015
Token a
Token ID257
Feature activation-4.51e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
 voice+0.005
seat+0.004
Neg logit contributions
xy-0.008
liv-0.008
oni-0.008
ork-0.007
live-0.007
Token stick
Token ID4859
Feature activation+1.72e-01

Pos logit contributions
xy+0.016
liv+0.016
oni+0.015
 fault+0.014
live+0.014
Neg logit contributions
 wave-0.009
 see-0.009
 waves-0.008
 voice-0.008
 sle-0.008
Token and
Token ID290
Feature activation+5.03e-01

Pos logit contributions
xy+0.003
oni+0.003
ork+0.003
 fault+0.003
liv+0.003
Neg logit contributions
seat-0.002
 wave-0.002
 waves-0.002
 see-0.002
 reins-0.002
Token her
Token ID607
Feature activation+6.53e-01

Pos logit contributions
 wave+0.006
 see+0.006
 waves+0.006
seat+0.006
 reins+0.005
Neg logit contributions
xy-0.021
oni-0.020
liv-0.019
ork-0.019
 fault-0.018
Token dream
Token ID4320
Feature activation+1.23e+00

Pos logit contributions
 wave+0.011
 see+0.011
seat+0.010
 waves+0.010
 paddle+0.009
Neg logit contributions
xy-0.024
oni-0.022
liv-0.022
ork-0.021
 fault-0.021
Token would
Token ID561
Feature activation-5.08e-02

Pos logit contributions
 wave+0.023
 see+0.023
 waves+0.021
seat+0.021
 voice+0.019
Neg logit contributions
xy-0.043
oni-0.041
liv-0.040
 fault-0.038
ork-0.038
Token keep
Token ID1394
Feature activation-5.20e-01

Pos logit contributions
xy+0.002
oni+0.002
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
seat-0.001
 paddle-0.001
Token blo
Token ID24924
Feature activation+4.44e+00

Pos logit contributions
oni+0.019
liv+0.018
xy+0.018
ork+0.017
live+0.017
Neg logit contributions
 see-0.010
 wave-0.010
 waves-0.009
 voice-0.008
 shine-0.008
Tokenoming
Token ID3383
Feature activation+9.87e-01

Pos logit contributions
 wave+0.105
 see+0.101
 waves+0.098
seat+0.098
Leave+0.098
Neg logit contributions
xy-0.127
oni-0.117
ork-0.114
her-0.110
liv-0.107
Token.
Token ID13
Feature activation-2.77e-03

Pos logit contributions
 wave+0.020
 see+0.020
 waves+0.019
seat+0.018
 voice+0.017
Neg logit contributions
xy-0.034
liv-0.032
oni-0.032
ork-0.030
 fault-0.029
Token<|endoftext|>
Token ID50256
Feature activation+9.77e-01

Pos logit contributions
xy+0.000
oni+0.000
liv+0.000
ork+0.000
 fault+0.000
Neg logit contributions
 see-0.000
 wave-0.000
seat-0.000
 waves-0.000
 voice-0.000
TokenOnce
Token ID7454
Feature activation-4.77e-01

Pos logit contributions
 see+0.014
 wave+0.013
seat+0.013
 waves+0.012
 hops+0.010
Neg logit contributions
xy-0.038
oni-0.037
liv-0.036
 fault-0.035
ork-0.034
Token upon
Token ID2402
Feature activation-2.60e-01

Pos logit contributions
xy+0.017
oni+0.017
liv+0.016
ork+0.015
 fault+0.015
Neg logit contributions
 see-0.009
 wave-0.008
seat-0.007
 waves-0.007
 hand-0.007
Token to
Token ID284
Feature activation+1.26e+00

Pos logit contributions
 wave+0.047
 see+0.046
 waves+0.043
seat+0.043
 paddle+0.040
Neg logit contributions
xy-0.056
oni-0.052
liv-0.052
ork-0.048
 fault-0.047
Token school
Token ID1524
Feature activation+2.88e-01

Pos logit contributions
 see+0.025
 wave+0.022
 waves+0.018
seat+0.018
 hand+0.017
Neg logit contributions
xy-0.051
liv-0.050
oni-0.047
live-0.046
ork-0.045
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 see+0.006
 wave+0.006
seat+0.005
 waves+0.005
 voice+0.005
Neg logit contributions
xy-0.010
oni-0.010
liv-0.010
 fault-0.009
ork-0.009
Token
Token ID198
Feature activation+4.50e-01

Pos logit contributions
 see+0.004
 wave+0.004
seat+0.004
 waves+0.004
 voice+0.004
Neg logit contributions
xy-0.007
oni-0.006
liv-0.006
ork-0.006
 fault-0.005
Token
Token ID198
Feature activation+1.45e+00

Pos logit contributions
 see+0.008
 wave+0.007
seat+0.007
 waves+0.006
 voice+0.006
Neg logit contributions
xy-0.017
oni-0.016
liv-0.016
 fault-0.015
ork-0.015
TokenAt
Token ID2953
Feature activation+4.43e+00

Pos logit contributions
 see+0.028
 wave+0.025
seat+0.022
Leave+0.022
 waves+0.020
Neg logit contributions
xy-0.053
 fault-0.053
oni-0.051
liv-0.047
ork-0.045
Token school
Token ID1524
Feature activation+4.58e-01

Pos logit contributions
 see+0.085
 wave+0.084
seat+0.071
 waves+0.070
 breath+0.067
Neg logit contributions
xy-0.166
oni-0.155
liv-0.153
ork-0.146
 fault-0.145
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
 wave+0.008
 see+0.008
 waves+0.007
seat+0.007
 paddle+0.006
Neg logit contributions
xy-0.017
oni-0.016
liv-0.016
ork-0.015
 fault-0.015
Token she
Token ID673
Feature activation+1.00e+00

Pos logit contributions
 see+0.035
 wave+0.033
 waves+0.031
seat+0.030
 hand+0.028
Neg logit contributions
xy-0.056
oni-0.052
liv-0.049
ork-0.047
 fault-0.047
Token saw
Token ID2497
Feature activation+1.07e+00

Pos logit contributions
 wave+0.018
 see+0.017
seat+0.016
 waves+0.016
 reins+0.016
Neg logit contributions
xy-0.035
oni-0.034
liv-0.033
ork-0.032
 fault-0.031
Token Ben
Token ID3932
Feature activation+1.33e+00

Pos logit contributions
 see+0.022
 wave+0.021
 waves+0.021
seat+0.019
 voice+0.018
Neg logit contributions
xy-0.039
liv-0.036
oni-0.034
ork-0.034
live-0.033
Token blo
Token ID24924
Feature activation+4.24e+00

Pos logit contributions
 wave+0.014
 see+0.014
 waves+0.013
seat+0.012
 voice+0.011
Neg logit contributions
xy-0.026
liv-0.024
oni-0.024
ork-0.023
 fault-0.022
Tokenoming
Token ID3383
Feature activation+1.34e-01

Pos logit contributions
 wave+0.093
 see+0.092
 waves+0.083
seat+0.083
 paddle+0.080
Neg logit contributions
xy-0.137
oni-0.131
liv-0.122
ork-0.121
 fault-0.114
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 wave+0.002
 waves+0.002
 see+0.002
seat+0.002
 voice+0.002
Neg logit contributions
xy-0.005
liv-0.005
oni-0.005
abella-0.005
ork-0.005
Token It
Token ID632
Feature activation+2.82e-01

Pos logit contributions
 see+0.022
 wave+0.021
 waves+0.020
seat+0.020
 voice+0.019
Neg logit contributions
xy-0.033
oni-0.031
liv-0.031
ork-0.029
 fault-0.028
Token is
Token ID318
Feature activation-6.64e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.004
 hops+0.004
Neg logit contributions
xy-0.010
oni-0.010
liv-0.009
ork-0.009
 fault-0.009
Token blo
Token ID24924
Feature activation+4.40e+00

Pos logit contributions
xy+0.026
liv+0.024
oni+0.023
ork+0.023
 fault+0.022
Neg logit contributions
 see-0.012
 wave-0.012
 waves-0.010
 voice-0.009
 hand-0.009
Tokenoming
Token ID3383
Feature activation+6.83e-01

Pos logit contributions
 wave+0.109
 see+0.107
 paddle+0.098
Leave+0.097
 reins+0.097
Neg logit contributions
xy-0.132
oni-0.125
ork-0.113
liv-0.111
her-0.108
Token now
Token ID783
Feature activation-3.75e-01

Pos logit contributions
 wave+0.013
 see+0.012
 waves+0.012
seat+0.011
 voice+0.010
Neg logit contributions
xy-0.025
liv-0.024
oni-0.023
ork-0.022
 fault-0.021
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
xy+0.015
liv+0.015
oni+0.014
ork+0.013
 fault+0.013
Neg logit contributions
 wave-0.005
 see-0.005
 waves-0.005
 voice-0.004
 hand-0.004
Token It
Token ID632
Feature activation+3.44e-01

Pos logit contributions
 see+0.027
 wave+0.027
 waves+0.025
seat+0.025
 voice+0.023
Neg logit contributions
xy-0.034
oni-0.032
liv-0.031
ork-0.029
 fault-0.028
Token is
Token ID318
Feature activation-4.27e-01

Pos logit contributions
 wave+0.007
 see+0.006
 waves+0.006
seat+0.006
 paddle+0.005
Neg logit contributions
xy-0.012
oni-0.011
liv-0.011
ork-0.011
 fault-0.010
Token you
Token ID345
Feature activation-2.82e-01

Pos logit contributions
 wave+0.015
 see+0.015
 waves+0.013
seat+0.013
 voice+0.012
Neg logit contributions
xy-0.025
oni-0.024
liv-0.023
 fault-0.022
ork-0.022
Token get
Token ID651
Feature activation+5.07e-01

Pos logit contributions
xy+0.009
oni+0.009
 fault+0.009
her+0.008
ork+0.008
Neg logit contributions
seat-0.005
 reins-0.005
 sle-0.004
Leave-0.004
 amaz-0.004
Token those
Token ID883
Feature activation+9.54e-01

Pos logit contributions
 wave+0.011
 see+0.011
seat+0.011
 waves+0.010
 paddle+0.010
Neg logit contributions
xy-0.016
oni-0.015
liv-0.015
ork-0.014
 fault-0.014
Token ice
Token ID4771
Feature activation+6.35e-01

Pos logit contributions
 wave+0.019
 see+0.018
seat+0.017
 waves+0.017
 paddle+0.016
Neg logit contributions
xy-0.032
oni-0.031
liv-0.030
ork-0.028
 fault-0.028
Token cre
Token ID1126
Feature activation-4.51e-01

Pos logit contributions
 wave+0.017
 see+0.016
seat+0.015
 waves+0.015
Leave+0.015
Neg logit contributions
xy-0.017
oni-0.017
liv-0.016
ork-0.015
 fault-0.015
Tokenams
Token ID4105
Feature activation+4.37e+00

Pos logit contributions
xy+0.015
liv+0.014
oni+0.014
ork+0.014
 fault+0.013
Neg logit contributions
 wave-0.009
 see-0.008
seat-0.008
 waves-0.008
 sle-0.007
Token?
Token ID30
Feature activation+4.00e-01

Pos logit contributions
 wave+0.074
 see+0.072
seat+0.067
 waves+0.067
Leave+0.062
Neg logit contributions
xy-0.158
oni-0.153
liv-0.147
ork-0.141
 fault-0.139
Token Where
Token ID6350
Feature activation+7.23e-01

Pos logit contributions
 wave+0.008
 see+0.008
seat+0.007
 waves+0.007
 paddle+0.006
Neg logit contributions
xy-0.014
oni-0.013
liv-0.013
ork-0.012
 fault-0.012
Token are
Token ID389
Feature activation+4.46e-01

Pos logit contributions
 see+0.013
 wave+0.013
 waves+0.011
seat+0.011
 voice+0.010
Neg logit contributions
xy-0.026
oni-0.025
liv-0.025
ork-0.023
 fault-0.023
Token your
Token ID534
Feature activation-1.59e-01

Pos logit contributions
 wave+0.009
 see+0.009
 waves+0.009
seat+0.008
 paddle+0.008
Neg logit contributions
xy-0.015
oni-0.014
liv-0.014
ork-0.013
 fault-0.013
Token toys
Token ID14958
Feature activation+1.81e+00

Pos logit contributions
oni+0.005
xy+0.005
 fault+0.005
liv+0.004
ork+0.004
Neg logit contributions
 wave-0.003
 see-0.003
Leave-0.003
seat-0.003
 transported-0.003
Token and
Token ID290
Feature activation-6.53e-02

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
seat+0.001
 paddle+0.001
Neg logit contributions
xy-0.002
oni-0.002
liv-0.002
ork-0.002
 fault-0.002
Token asked
Token ID1965
Feature activation+2.63e-01

Pos logit contributions
xy+0.002
oni+0.002
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
seat-0.001
 paddle-0.001
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
 see+0.004
 wave+0.003
 waves+0.003
 voice+0.003
 hand+0.003
Neg logit contributions
xy-0.011
liv-0.011
oni-0.010
ork-0.010
 fault-0.010
Token "
Token ID366
Feature activation-3.84e-01

Pos logit contributions
xy+0.036
oni+0.034
liv+0.033
ork+0.031
 fault+0.030
Neg logit contributions
 wave-0.038
 see-0.036
seat-0.035
 waves-0.035
 sle-0.033
TokenAre
Token ID8491
Feature activation+3.88e-01

Pos logit contributions
xy+0.014
oni+0.013
liv+0.013
ork+0.013
 fault+0.012
Neg logit contributions
 wave-0.007
 see-0.006
 waves-0.006
seat-0.006
 paddle-0.005
Token stars
Token ID5788
Feature activation+4.35e+00

Pos logit contributions
 see+0.006
 wave+0.005
 waves+0.005
 hello+0.004
 sailors+0.004
Neg logit contributions
xy-0.015
liv-0.015
ork-0.015
oni-0.015
 virtue-0.014
Token high
Token ID1029
Feature activation+1.83e-01

Pos logit contributions
seat+0.089
Leave+0.086
 reins+0.084
 wave+0.083
 waves+0.080
Neg logit contributions
xy-0.136
oni-0.132
 fault-0.115
ork-0.115
liv-0.114
Token up
Token ID510
Feature activation-5.89e-01

Pos logit contributions
 wave+0.003
 see+0.003
seat+0.003
 waves+0.003
 paddle+0.003
Neg logit contributions
xy-0.007
oni-0.006
liv-0.006
ork-0.006
 fault-0.006
Token?"
Token ID1701
Feature activation-1.14e+00

Pos logit contributions
xy+0.021
oni+0.020
liv+0.019
ork+0.018
 fault+0.018
Neg logit contributions
 wave-0.011
 see-0.010
 waves-0.010
seat-0.010
 reins-0.009
Token
Token ID198
Feature activation+8.89e-01

Pos logit contributions
xy+0.035
oni+0.033
liv+0.033
ork+0.031
 fault+0.030
Neg logit contributions
 wave-0.026
 see-0.025
 waves-0.024
seat-0.024
 paddle-0.022
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 see+0.017
 wave+0.017
seat+0.015
 waves+0.015
 voice+0.014
Neg logit contributions
xy-0.031
oni-0.030
liv-0.029
 fault-0.028
ork-0.027
Token magical
Token ID10883
Feature activation+3.56e-01

Pos logit contributions
xy+0.028
liv+0.027
oni+0.026
ork+0.025
 fault+0.024
Neg logit contributions
 wave-0.019
 see-0.018
 waves-0.017
seat-0.016
 transported-0.015
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
 wave+0.008
 see+0.008
seat+0.008
Leave+0.008
 paddle+0.007
Neg logit contributions
xy-0.011
oni-0.010
ork-0.009
 fault-0.009
liv-0.009
Token can
Token ID460
Feature activation-1.53e-01

Pos logit contributions
 wave+0.007
seat+0.007
 see+0.007
 waves+0.007
 reins+0.007
Neg logit contributions
xy-0.010
oni-0.009
liv-0.009
ork-0.009
 fault-0.008
Token make
Token ID787
Feature activation+1.09e+00

Pos logit contributions
xy+0.006
oni+0.006
liv+0.005
ork+0.005
her+0.005
Neg logit contributions
 wave-0.002
seat-0.002
 waves-0.002
 see-0.002
 reins-0.002
Token our
Token ID674
Feature activation+3.56e-01

Pos logit contributions
 see+0.022
 wave+0.021
 waves+0.021
 voice+0.018
seat+0.018
Neg logit contributions
xy-0.037
liv-0.036
oni-0.036
live-0.033
ork-0.033
Token dreams
Token ID10625
Feature activation+4.35e+00

Pos logit contributions
 wave+0.007
seat+0.007
 see+0.007
 reins+0.006
 waves+0.006
Neg logit contributions
xy-0.011
oni-0.011
 fault-0.011
ork-0.011
liv-0.010
Token come
Token ID1282
Feature activation+1.21e+00

Pos logit contributions
seat+0.066
Leave+0.065
 reins+0.065
 wave+0.062
 trusts+0.061
Neg logit contributions
xy-0.154
oni-0.146
ork-0.145
her-0.138
 fault-0.134
Token true
Token ID2081
Feature activation-3.54e-01

Pos logit contributions
 wave+0.030
 see+0.028
 waves+0.028
seat+0.027
 paddle+0.026
Neg logit contributions
xy-0.035
oni-0.033
ork-0.031
liv-0.030
 fault-0.030
Token!"
Token ID2474
Feature activation-1.67e+00

Pos logit contributions
xy+0.011
oni+0.011
liv+0.010
ork+0.010
 fault+0.010
Neg logit contributions
 wave-0.008
 see-0.007
seat-0.007
 waves-0.007
 paddle-0.007
Token
Token ID220
Feature activation-5.82e-01

Pos logit contributions
xy+0.059
oni+0.055
liv+0.053
ork+0.052
 fault+0.051
Neg logit contributions
 see-0.033
 wave-0.032
seat-0.030
 waves-0.030
 hand-0.026
Token
Token ID198
Feature activation+3.75e-01

Pos logit contributions
xy+0.021
oni+0.020
liv+0.020
ork+0.019
 fault+0.018
Neg logit contributions
 see-0.010
seat-0.009
 wave-0.009
 waves-0.008
 voice-0.007
Token and
Token ID290
Feature activation+3.33e-01

Pos logit contributions
xy+0.027
liv+0.025
oni+0.025
ork+0.024
 fault+0.023
Neg logit contributions
 wave-0.013
 see-0.013
 waves-0.012
seat-0.011
 voice-0.010
Token soon
Token ID2582
Feature activation+1.47e+00

Pos logit contributions
 see+0.007
 wave+0.007
 waves+0.006
seat+0.006
 amaz+0.006
Neg logit contributions
xy-0.011
oni-0.011
liv-0.010
ork-0.010
 fault-0.010
Token her
Token ID607
Feature activation+5.90e-01

Pos logit contributions
 see+0.031
 wave+0.031
 waves+0.029
seat+0.027
 transported+0.026
Neg logit contributions
xy-0.050
oni-0.047
liv-0.046
ork-0.043
 fault-0.042
Token garden
Token ID11376
Feature activation-2.33e-01

Pos logit contributions
 wave+0.010
 see+0.009
 waves+0.009
 voice+0.008
seat+0.008
Neg logit contributions
xy-0.023
liv-0.021
oni-0.021
ork-0.020
 fault-0.019
Token was
Token ID373
Feature activation-1.05e-01

Pos logit contributions
xy+0.008
liv+0.008
oni+0.008
 fault+0.007
ork+0.007
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
 voice-0.004
seat-0.004
Token blo
Token ID24924
Feature activation+4.32e+00

Pos logit contributions
xy+0.004
liv+0.003
oni+0.003
ork+0.003
 fault+0.003
Neg logit contributions
 see-0.002
 wave-0.002
 waves-0.002
seat-0.002
 transported-0.002
Tokenoming
Token ID3383
Feature activation+4.12e-01

Pos logit contributions
 wave+0.095
 see+0.093
seat+0.086
 waves+0.086
 paddle+0.082
Neg logit contributions
xy-0.139
oni-0.131
liv-0.125
ork-0.123
her-0.116
Token with
Token ID351
Feature activation-6.48e-01

Pos logit contributions
 wave+0.008
 see+0.007
 waves+0.007
seat+0.006
 transported+0.006
Neg logit contributions
xy-0.015
liv-0.014
oni-0.014
ork-0.013
 fault-0.013
Token all
Token ID477
Feature activation+2.92e-01

Pos logit contributions
xy+0.022
liv+0.021
oni+0.020
ork+0.020
 fault+0.019
Neg logit contributions
 see-0.013
 wave-0.013
 waves-0.013
seat-0.011
 amaz-0.011
Token sorts
Token ID10524
Feature activation-5.46e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.005
 voice+0.004
Neg logit contributions
xy-0.011
oni-0.010
liv-0.010
ork-0.010
 fault-0.009
Token of
Token ID286
Feature activation-1.17e+00

Pos logit contributions
xy+0.023
oni+0.022
liv+0.022
ork+0.021
 fault+0.020
Neg logit contributions
 wave-0.006
 see-0.006
 waves-0.005
seat-0.005
 paddle-0.005
Token the
Token ID262
Feature activation+3.49e-01

Pos logit contributions
 see+0.020
 wave+0.019
 waves+0.017
seat+0.016
 paddle+0.015
Neg logit contributions
xy-0.047
oni-0.045
liv-0.045
ork-0.043
 fault-0.042
Token park
Token ID3952
Feature activation-3.94e-01

Pos logit contributions
 wave+0.007
 see+0.007
 waves+0.006
seat+0.006
 paddle+0.006
Neg logit contributions
xy-0.012
liv-0.012
oni-0.012
 fault-0.011
ork-0.011
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
xy+0.016
liv+0.016
oni+0.015
 fault+0.015
live+0.014
Neg logit contributions
 see-0.007
 wave-0.006
seat-0.005
 hand-0.005
 waves-0.005
Token
Token ID198
Feature activation-2.53e-01

Pos logit contributions
xy+0.024
oni+0.023
liv+0.023
ork+0.021
 fault+0.021
Neg logit contributions
 see-0.013
 wave-0.012
seat-0.012
 waves-0.011
 hand-0.010
Token
Token ID198
Feature activation+7.05e-01

Pos logit contributions
xy+0.009
oni+0.009
liv+0.009
 fault+0.008
ork+0.008
Neg logit contributions
 see-0.005
 wave-0.004
seat-0.004
 waves-0.004
 voice-0.003
TokenAt
Token ID2953
Feature activation+4.27e+00

Pos logit contributions
 see+0.013
 wave+0.012
Leave+0.011
seat+0.010
 waves+0.010
Neg logit contributions
xy-0.026
oni-0.025
 fault-0.025
liv-0.023
ork-0.022
Token the
Token ID262
Feature activation+5.65e-01

Pos logit contributions
 wave+0.067
 see+0.066
 waves+0.054
seat+0.053
 breath+0.051
Neg logit contributions
xy-0.174
oni-0.162
liv-0.160
ork-0.158
 fault-0.154
Token park
Token ID3952
Feature activation-2.07e-01

Pos logit contributions
 wave+0.012
 see+0.012
 waves+0.011
seat+0.011
 paddle+0.010
Neg logit contributions
xy-0.019
oni-0.018
liv-0.017
ork-0.017
 fault-0.016
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
xy+0.009
oni+0.008
liv+0.008
 fault+0.008
ork+0.008
Neg logit contributions
 see-0.003
 wave-0.003
 waves-0.002
seat-0.002
 voice-0.002
Token Lily
Token ID20037
Feature activation+2.29e-01

Pos logit contributions
 see+0.031
 wave+0.028
 waves+0.026
 hand+0.025
 girl+0.025
Neg logit contributions
xy-0.040
oni-0.037
 fault-0.034
liv-0.034
ork-0.032
Token saw
Token ID2497
Feature activation+7.86e-01

Pos logit contributions
 see+0.003
 wave+0.003
 waves+0.003
seat+0.002
 transported+0.002
Neg logit contributions
xy-0.009
liv-0.009
oni-0.009
 fault-0.009
ork-0.008
Token little
Token ID1310
Feature activation+2.84e+00

Pos logit contributions
seat+0.019
 waves+0.017
 see+0.017
 transported+0.016
 reins+0.016
Neg logit contributions
ork-0.026
oni-0.026
xy-0.025
 virtue-0.025
her-0.024
Token better
Token ID1365
Feature activation+1.08e+00

Pos logit contributions
seat+0.073
 reins+0.069
 see+0.066
 sle+0.065
 transported+0.065
Neg logit contributions
 fault-0.065
oni-0.065
xy-0.061
 wrong-0.060
 virtue-0.057
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
 wave+0.023
 see+0.022
 waves+0.021
seat+0.021
 reins+0.020
Neg logit contributions
xy-0.035
oni-0.033
liv-0.032
ork-0.031
 fault-0.030
Token
Token ID198
Feature activation+7.83e-01

Pos logit contributions
 wave+0.011
 see+0.010
 waves+0.010
seat+0.010
 paddle+0.008
Neg logit contributions
xy-0.023
oni-0.022
liv-0.021
ork-0.020
 fault-0.020
Token
Token ID198
Feature activation+1.60e+00

Pos logit contributions
 see+0.014
 wave+0.013
seat+0.012
 waves+0.012
 voice+0.011
Neg logit contributions
xy-0.028
oni-0.027
liv-0.027
 fault-0.026
ork-0.025
TokenAt
Token ID2953
Feature activation+4.27e+00

Pos logit contributions
 see+0.037
 wave+0.037
seat+0.034
 waves+0.033
Leave+0.032
Neg logit contributions
xy-0.049
 fault-0.048
oni-0.046
liv-0.044
ork-0.043
Token school
Token ID1524
Feature activation+4.84e-01

Pos logit contributions
 wave+0.080
 see+0.078
 waves+0.067
seat+0.065
 breath+0.063
Neg logit contributions
xy-0.162
liv-0.152
oni-0.152
ork-0.146
 fault-0.144
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
 see+0.007
 wave+0.007
 waves+0.006
seat+0.006
 voice+0.006
Neg logit contributions
xy-0.019
oni-0.018
liv-0.018
ork-0.017
 fault-0.017
Token Tom
Token ID4186
Feature activation-4.20e-01

Pos logit contributions
 see+0.040
 wave+0.039
 waves+0.036
seat+0.036
 sailor+0.033
Neg logit contributions
xy-0.039
oni-0.035
liv-0.034
ork-0.033
 fault-0.032
Token meets
Token ID11185
Feature activation+4.93e-01

Pos logit contributions
xy+0.015
oni+0.015
liv+0.014
 fault+0.013
ork+0.013
Neg logit contributions
 wave-0.007
 see-0.007
seat-0.007
 reins-0.006
 waves-0.006
Token his
Token ID465
Feature activation+4.10e-01

Pos logit contributions
 see+0.008
 wave+0.008
 waves+0.008
seat+0.007
 voice+0.007
Neg logit contributions
xy-0.020
liv-0.018
ork-0.017
oni-0.017
live-0.017
Token you
Token ID345
Feature activation-4.62e-01

Pos logit contributions
xy+0.014
oni+0.014
liv+0.013
 fault+0.013
ork+0.013
Neg logit contributions
 see-0.008
 wave-0.008
 waves-0.007
 voice-0.007
 hops-0.006
Token like
Token ID588
Feature activation+5.89e-01

Pos logit contributions
xy+0.015
oni+0.015
liv+0.014
 fault+0.014
ork+0.014
Neg logit contributions
seat-0.008
 sle-0.008
 waves-0.008
 reins-0.008
 wave-0.008
Token the
Token ID262
Feature activation+4.40e-01

Pos logit contributions
 wave+0.013
 see+0.013
seat+0.012
 waves+0.012
 paddle+0.011
Neg logit contributions
xy-0.019
oni-0.018
liv-0.017
ork-0.016
 fault-0.016
Token b
Token ID275
Feature activation+7.02e-02

Pos logit contributions
 wave+0.008
 see+0.007
 waves+0.007
seat+0.007
 paddle+0.006
Neg logit contributions
xy-0.016
oni-0.015
liv-0.015
ork-0.014
 fault-0.014
Tokenunn
Token ID20935
Feature activation+6.28e-01

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.001
seat+0.001
 reins+0.001
Neg logit contributions
xy-0.002
liv-0.002
oni-0.002
ork-0.002
 fault-0.002
Tokenies
Token ID444
Feature activation+4.27e+00

Pos logit contributions
 wave+0.013
 see+0.013
seat+0.012
 waves+0.012
 amaz+0.011
Neg logit contributions
xy-0.021
oni-0.020
liv-0.018
 fault-0.018
ork-0.018
Token?"
Token ID1701
Feature activation-2.37e-01

Pos logit contributions
 wave+0.068
 see+0.064
seat+0.062
 waves+0.061
 reins+0.056
Neg logit contributions
xy-0.159
oni-0.152
liv-0.148
ork-0.143
 fault-0.140
Token the
Token ID262
Feature activation+6.46e-01

Pos logit contributions
xy+0.008
liv+0.008
oni+0.007
 fault+0.007
ork+0.007
Neg logit contributions
 see-0.006
 wave-0.005
 waves-0.005
 voice-0.005
seat-0.004
Token lady
Token ID10846
Feature activation+8.11e-01

Pos logit contributions
 wave+0.010
 see+0.010
seat+0.009
 waves+0.009
 transported+0.008
Neg logit contributions
xy-0.024
oni-0.023
liv-0.022
ork-0.022
 fault-0.021
Token asks
Token ID7893
Feature activation+4.16e-01

Pos logit contributions
 wave+0.011
 see+0.010
seat+0.010
 waves+0.009
 reins+0.009
Neg logit contributions
xy-0.032
oni-0.031
liv-0.029
ork-0.028
her-0.027
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
 see+0.006
 wave+0.006
 voice+0.006
 waves+0.005
 hand+0.005
Neg logit contributions
liv-0.017
xy-0.017
live-0.015
 fault-0.015
oni-0.015
Token enough
Token ID1576
Feature activation-4.96e-02

Pos logit contributions
 wave+0.034
 see+0.034
 waves+0.031
seat+0.028
 sailor+0.027
Neg logit contributions
xy-0.072
oni-0.068
liv-0.067
ula-0.064
ork-0.063
Token,
Token ID11
Feature activation-4.31e-01

Pos logit contributions
xy+0.002
oni+0.002
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.001
 see-0.001
seat-0.001
 waves-0.001
 reins-0.001
Token he
Token ID339
Feature activation-9.66e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.014
ork+0.014
 fault+0.013
Neg logit contributions
 wave-0.008
 see-0.007
 waves-0.007
seat-0.007
 reins-0.006
Token found
Token ID1043
Feature activation+2.49e-01

Pos logit contributions
xy+0.037
oni+0.035
liv+0.035
 fault+0.033
ork+0.032
Neg logit contributions
 see-0.016
 wave-0.016
 waves-0.015
seat-0.013
 paddle-0.013
Token a
Token ID257
Feature activation-4.47e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.004
seat+0.004
 paddle+0.004
Neg logit contributions
xy-0.009
oni-0.008
liv-0.008
ork-0.008
 fault-0.008
Token blo
Token ID24924
Feature activation+4.26e+00

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
 fault+0.015
ork+0.015
Neg logit contributions
 wave-0.008
 see-0.007
 waves-0.007
seat-0.006
 voice-0.006
Tokenoming
Token ID3383
Feature activation+2.43e-01

Pos logit contributions
 wave+0.090
 see+0.090
 steer+0.086
 paddle+0.085
seat+0.083
Neg logit contributions
xy-0.128
oni-0.124
ork-0.123
liv-0.121
ula-0.113
Token l
Token ID300
Feature activation+9.52e-01

Pos logit contributions
 wave+0.004
 see+0.004
 waves+0.004
 voice+0.003
seat+0.003
Neg logit contributions
xy-0.010
liv-0.009
oni-0.009
 fault-0.008
ork-0.008
Tokenily
Token ID813
Feature activation+2.75e-01

Pos logit contributions
 wave+0.029
 see+0.028
 waves+0.027
seat+0.027
 transported+0.026
Neg logit contributions
xy-0.022
oni-0.020
liv-0.020
ork-0.018
 fault-0.018
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 wave+0.006
 see+0.006
 waves+0.006
 hand+0.006
 transported+0.006
Neg logit contributions
xy-0.008
liv-0.008
oni-0.008
 fault-0.007
ork-0.007
Token He
Token ID679
Feature activation+4.72e-02

Pos logit contributions
xy+0.000
oni+0.000
liv+0.000
ork+0.000
 fault+0.000
Neg logit contributions
 see-0.000
 wave-0.000
seat-0.000
 waves-0.000
 voice-0.000
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
 see+0.012
 wave+0.012
 waves+0.012
seat+0.010
 voice+0.010
Neg logit contributions
xy-0.025
liv-0.023
ork-0.022
oni-0.022
 virtue-0.021
Token flower
Token ID15061
Feature activation+1.17e+00

Pos logit contributions
 wave+0.003
 waves+0.002
 see+0.002
 voice+0.002
seat+0.002
Neg logit contributions
xy-0.004
liv-0.004
abella-0.003
ork-0.003
live-0.003
Token that
Token ID326
Feature activation+1.08e+00

Pos logit contributions
 wave+0.022
 see+0.021
 waves+0.020
seat+0.020
 paddle+0.018
Neg logit contributions
xy-0.041
oni-0.039
liv-0.039
ork-0.037
 fault-0.036
Token was
Token ID373
Feature activation-2.69e-01

Pos logit contributions
 see+0.019
 wave+0.019
 waves+0.017
seat+0.015
 hops+0.014
Neg logit contributions
xy-0.041
liv-0.039
oni-0.038
 fault-0.037
ork-0.036
Token almost
Token ID2048
Feature activation+7.20e-01

Pos logit contributions
xy+0.010
liv+0.010
oni+0.009
ork+0.009
live+0.008
Neg logit contributions
 see-0.005
 wave-0.005
 waves-0.005
 hand-0.004
 voice-0.004
Token blo
Token ID24924
Feature activation+4.24e+00

Pos logit contributions
 wave+0.014
 see+0.014
 waves+0.013
seat+0.012
 voice+0.011
Neg logit contributions
xy-0.026
liv-0.024
oni-0.024
ork-0.023
 fault-0.022
Tokenoming
Token ID3383
Feature activation+1.34e-01

Pos logit contributions
 wave+0.093
 see+0.092
 waves+0.083
seat+0.083
 paddle+0.080
Neg logit contributions
xy-0.137
oni-0.131
liv-0.122
ork-0.121
 fault-0.114
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 wave+0.002
 waves+0.002
 see+0.002
seat+0.002
 voice+0.002
Neg logit contributions
xy-0.005
liv-0.005
oni-0.005
abella-0.005
ork-0.005
Token It
Token ID632
Feature activation+2.82e-01

Pos logit contributions
 see+0.022
 wave+0.021
 waves+0.020
seat+0.020
 voice+0.019
Neg logit contributions
xy-0.033
oni-0.031
liv-0.031
ork-0.029
 fault-0.028
Token is
Token ID318
Feature activation-6.64e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.004
 hops+0.004
Neg logit contributions
xy-0.010
oni-0.010
liv-0.009
ork-0.009
 fault-0.009
Token blo
Token ID24924
Feature activation+4.40e+00

Pos logit contributions
xy+0.026
liv+0.024
oni+0.023
ork+0.023
 fault+0.022
Neg logit contributions
 see-0.012
 wave-0.012
 waves-0.010
 voice-0.009
 hand-0.009
Token party
Token ID2151
Feature activation-3.39e-01

Pos logit contributions
 wave+0.038
 see+0.036
 waves+0.034
seat+0.033
 voice+0.031
Neg logit contributions
xy-0.055
oni-0.051
liv-0.051
 fault-0.048
ork-0.046
Token together
Token ID1978
Feature activation-6.21e-01

Pos logit contributions
xy+0.013
liv+0.012
oni+0.012
 fault+0.011
ork+0.011
Neg logit contributions
 wave-0.006
 see-0.006
 waves-0.005
seat-0.005
 voice-0.005
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
xy+0.025
liv+0.024
oni+0.023
 fault+0.022
ork+0.022
Neg logit contributions
 see-0.010
 wave-0.010
 waves-0.008
 hand-0.008
seat-0.008
Token
Token ID198
Feature activation-8.20e-01

Pos logit contributions
xy+0.038
oni+0.036
liv+0.035
 fault+0.033
ork+0.033
Neg logit contributions
 see-0.033
 wave-0.031
seat-0.031
 waves-0.030
 voice-0.027
Token
Token ID198
Feature activation-9.46e-02

Pos logit contributions
xy+0.027
oni+0.026
liv+0.025
ork+0.024
 fault+0.024
Neg logit contributions
 wave-0.017
 see-0.017
 waves-0.015
seat-0.015
 paddle-0.014
TokenAt
Token ID2953
Feature activation+4.23e+00

Pos logit contributions
xy+0.003
oni+0.003
 fault+0.003
liv+0.003
ork+0.003
Neg logit contributions
 see-0.002
 wave-0.002
 waves-0.002
seat-0.002
Leave-0.002
Token the
Token ID262
Feature activation+5.94e-01

Pos logit contributions
 see+0.068
 wave+0.068
 waves+0.055
seat+0.053
 breath+0.053
Neg logit contributions
xy-0.169
oni-0.161
liv-0.158
ork-0.155
 fault-0.152
Token party
Token ID2151
Feature activation+8.54e-02

Pos logit contributions
 wave+0.015
 see+0.014
 waves+0.014
seat+0.012
 voice+0.012
Neg logit contributions
xy-0.019
oni-0.017
liv-0.017
 fault-0.016
ork-0.016
Token,
Token ID11
Feature activation+7.14e-01

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
seat+0.001
 voice+0.001
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
 fault-0.003
ork-0.003
Token Lily
Token ID20037
Feature activation+4.90e-01

Pos logit contributions
 see+0.018
 wave+0.017
 waves+0.016
seat+0.015
 sail+0.015
Neg logit contributions
xy-0.023
oni-0.021
 fault-0.020
liv-0.020
ork-0.020
Token danced
Token ID39480
Feature activation+1.20e+00

Pos logit contributions
 wave+0.008
 see+0.008
 waves+0.008
seat+0.007
 paddle+0.006
Neg logit contributions
xy-0.018
oni-0.017
liv-0.017
ork-0.016
 fault-0.016
Token university
Token ID6403
Feature activation+8.16e-02

Pos logit contributions
xy+0.020
oni+0.018
liv+0.018
ork+0.017
 fault+0.016
Neg logit contributions
 wave-0.020
 see-0.020
 waves-0.019
seat-0.018
 paddle-0.017
Token is
Token ID318
Feature activation-1.09e+00

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.001
seat+0.001
 voice+0.001
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
ork-0.003
 fault-0.003
Token.
Token ID13
Feature activation+7.36e-01

Pos logit contributions
xy+0.037
oni+0.035
liv+0.034
ork+0.032
 fault+0.032
Neg logit contributions
 wave-0.022
 see-0.022
 waves-0.020
seat-0.020
 paddle-0.018
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
 see+0.016
 wave+0.016
seat+0.015
 waves+0.015
 voice+0.014
Neg logit contributions
xy-0.024
oni-0.022
liv-0.022
ork-0.021
 fault-0.020
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
 see+0.003
 wave+0.002
seat+0.002
 waves+0.002
 voice+0.002
Neg logit contributions
xy-0.005
oni-0.005
liv-0.005
 fault-0.005
ork-0.005
TokenAt
Token ID2953
Feature activation+4.23e+00

Pos logit contributions
 see+0.024
 wave+0.023
seat+0.021
 waves+0.020
 pack+0.020
Neg logit contributions
xy-0.046
 fault-0.044
oni-0.044
liv-0.041
ork-0.039
Token the
Token ID262
Feature activation+5.48e-01

Pos logit contributions
 wave+0.076
 see+0.075
 waves+0.066
seat+0.066
 breath+0.060
Neg logit contributions
xy-0.156
oni-0.148
liv-0.146
ork-0.140
 fault-0.138
Token university
Token ID6403
Feature activation+5.00e-01

Pos logit contributions
 see+0.013
 wave+0.013
 waves+0.012
 voice+0.011
seat+0.011
Neg logit contributions
xy-0.018
oni-0.017
liv-0.016
 fault-0.015
ork-0.015
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
 see+0.008
 wave+0.007
seat+0.006
 waves+0.006
 voice+0.006
Neg logit contributions
xy-0.020
oni-0.019
liv-0.019
 fault-0.018
ork-0.018
Token they
Token ID484
Feature activation+8.51e-01

Pos logit contributions
 see+0.032
 wave+0.030
 waves+0.028
seat+0.026
 hand+0.026
Neg logit contributions
xy-0.049
oni-0.046
liv-0.044
ork-0.043
 fault-0.042
Token see
Token ID766
Feature activation+1.67e+00

Pos logit contributions
 wave+0.006
 see+0.005
seat+0.005
 waves+0.005
 reins+0.003
Neg logit contributions
xy-0.039
oni-0.038
liv-0.037
ork-0.036
 fault-0.035
TokenCan
Token ID6090
Feature activation+8.00e-01

Pos logit contributions
Leave+0.015
 wave+0.015
 see+0.015
Who+0.014
 waves+0.013
Neg logit contributions
xy-0.035
oni-0.035
 fault-0.034
liv-0.034
ork-0.033
Token we
Token ID356
Feature activation+5.95e-01

Pos logit contributions
 see+0.011
 wave+0.010
 waves+0.009
 voice+0.008
 sail+0.008
Neg logit contributions
xy-0.033
oni-0.032
liv-0.032
 fault-0.030
ork-0.029
Token go
Token ID467
Feature activation+9.65e-01

Pos logit contributions
seat+0.007
 wave+0.007
 waves+0.007
 see+0.006
 reins+0.006
Neg logit contributions
xy-0.024
oni-0.023
liv-0.022
ork-0.022
 fault-0.021
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
 amaz+0.025
Leave+0.023
 hops+0.022
 trusts+0.022
 delight+0.022
Neg logit contributions
xy-0.025
ork-0.022
oni-0.022
her-0.020
ess-0.020
Token the
Token ID262
Feature activation+2.37e-01

Pos logit contributions
 wave+0.023
seat+0.021
 see+0.021
 waves+0.021
 reins+0.019
Neg logit contributions
xy-0.038
oni-0.036
liv-0.035
ork-0.034
 fault-0.033
Token stars
Token ID5788
Feature activation+4.23e+00

Pos logit contributions
 wave+0.004
 see+0.004
 waves+0.004
 sail+0.003
 sailor+0.003
Neg logit contributions
liv-0.010
xy-0.010
oni-0.009
live-0.009
abella-0.009
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
 wave+0.072
 see+0.069
 waves+0.066
seat+0.066
 reins+0.060
Neg logit contributions
xy-0.153
oni-0.146
liv-0.141
ork-0.136
 fault-0.134
Token planets
Token ID14705
Feature activation+3.63e+00

Pos logit contributions
seat+0.022
 wave+0.019
 reins+0.019
 transported+0.018
Leave+0.018
Neg logit contributions
xy-0.060
oni-0.057
ork-0.055
liv-0.053
 fault-0.053
Token?"
Token ID1701
Feature activation-8.94e-02

Pos logit contributions
 wave+0.065
seat+0.062
 waves+0.061
 see+0.061
 reins+0.057
Neg logit contributions
xy-0.125
oni-0.119
liv-0.113
ork-0.111
 fault-0.109
Token Anna
Token ID11735
Feature activation+1.15e+00

Pos logit contributions
xy+0.002
oni+0.002
liv+0.002
ork+0.001
 fault+0.001
Neg logit contributions
 wave-0.003
 see-0.003
seat-0.003
 waves-0.003
 reins-0.003
Token asks
Token ID7893
Feature activation+7.22e-01

Pos logit contributions
 wave+0.009
 see+0.009
 waves+0.007
seat+0.007
 voice+0.005
Neg logit contributions
xy-0.053
liv-0.051
oni-0.051
ork-0.049
 fault-0.048
Token her
Token ID607
Feature activation+2.35e+00

Pos logit contributions
 see+0.018
 wave+0.017
seat+0.016
 hand+0.015
 voice+0.015
Neg logit contributions
xy-0.039
oni-0.036
liv-0.035
ork-0.033
 fault-0.032
Token bed
Token ID3996
Feature activation+5.73e-01

Pos logit contributions
 wave+0.045
 see+0.043
 waves+0.041
seat+0.040
Leave+0.039
Neg logit contributions
ork-0.074
xy-0.074
liv-0.070
 fault-0.068
oni-0.068
Token.
Token ID13
Feature activation-4.79e-01

Pos logit contributions
 see+0.008
 wave+0.008
 voice+0.007
seat+0.007
 hand+0.007
Neg logit contributions
xy-0.023
liv-0.021
oni-0.021
 fault-0.020
live-0.020
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
xy+0.018
oni+0.016
liv+0.016
 fault+0.015
ork+0.015
Neg logit contributions
 see-0.009
 wave-0.008
seat-0.008
 waves-0.008
 hand-0.007
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
xy+0.006
oni+0.006
 fault+0.006
liv+0.006
ork+0.005
Neg logit contributions
 see-0.003
 wave-0.002
seat-0.002
 waves-0.002
 voice-0.002
TokenAt
Token ID2953
Feature activation+4.23e+00

Pos logit contributions
 see+0.015
 wave+0.014
Leave+0.013
 waves+0.012
seat+0.012
Neg logit contributions
xy-0.033
 fault-0.033
oni-0.030
liv-0.028
ork-0.027
Token Grand
Token ID5675
Feature activation+8.50e-01

Pos logit contributions
 see+0.071
 wave+0.071
 breath+0.060
seat+0.059
 waves+0.057
Neg logit contributions
xy-0.164
oni-0.154
liv-0.151
ork-0.149
 fault-0.146
Tokenma
Token ID2611
Feature activation+9.28e-01

Pos logit contributions
 wave+0.022
 see+0.022
 waves+0.021
seat+0.020
 paddle+0.020
Neg logit contributions
xy-0.024
liv-0.022
oni-0.022
ork-0.021
 fault-0.020
Token's
Token ID338
Feature activation+5.24e-01

Pos logit contributions
 wave+0.024
 see+0.023
 waves+0.023
seat+0.023
 reins+0.022
Neg logit contributions
xy-0.025
oni-0.023
liv-0.023
ork-0.022
 fault-0.021
Token house
Token ID2156
Feature activation+4.06e-01

Pos logit contributions
 wave+0.012
 see+0.012
Leave+0.012
 waves+0.011
 paddle+0.011
Neg logit contributions
xy-0.014
ork-0.014
 fault-0.014
oni-0.014
liv-0.013
Token,
Token ID11
Feature activation+9.54e-01

Pos logit contributions
 see+0.006
 wave+0.006
seat+0.005
 waves+0.005
 voice+0.005
Neg logit contributions
xy-0.016
oni-0.015
liv-0.015
 fault-0.014
ork-0.014
Token knows
Token ID4206
Feature activation+9.24e-01

Pos logit contributions
xy+0.007
oni+0.006
 fault+0.006
liv+0.006
ork+0.006
Neg logit contributions
seat-0.006
 wave-0.006
Leave-0.005
 see-0.005
 amaz-0.005
Token how
Token ID703
Feature activation-5.51e-01

Pos logit contributions
 wave+0.018
 see+0.018
 waves+0.017
seat+0.016
 paddle+0.015
Neg logit contributions
xy-0.032
oni-0.030
liv-0.029
ork-0.028
 fault-0.028
Token to
Token ID284
Feature activation+1.36e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.014
ork+0.013
 fault+0.013
Neg logit contributions
 wave-0.015
 see-0.014
seat-0.014
 waves-0.014
 reins-0.013
Token get
Token ID651
Feature activation-1.36e-01

Pos logit contributions
 wave+0.002
seat+0.002
 waves+0.002
 see+0.002
 reins+0.002
Neg logit contributions
xy-0.005
oni-0.005
liv-0.005
ork-0.005
 fault-0.004
Token things
Token ID1243
Feature activation+1.30e+00

Pos logit contributions
xy+0.004
oni+0.004
 fault+0.004
liv+0.003
ork+0.003
Neg logit contributions
 wave-0.003
seat-0.003
 see-0.003
 amaz-0.003
 waves-0.003
Token unst
Token ID15014
Feature activation-4.50e+00

Pos logit contributions
 wave+0.029
seat+0.028
 see+0.028
 waves+0.027
 amaz+0.026
Neg logit contributions
xy-0.039
oni-0.037
liv-0.035
 fault-0.034
ork-0.034
Tokenuck
Token ID1347
Feature activation-6.55e-01

Pos logit contributions
liv+0.139
xy+0.136
abella+0.132
oni+0.128
 fault+0.126
Neg logit contributions
 see-0.094
 wave-0.091
 waves-0.088
seat-0.087
Leave-0.082
Token."
Token ID526
Feature activation-1.23e+00

Pos logit contributions
xy+0.024
liv+0.024
oni+0.023
ork+0.022
 fault+0.022
Neg logit contributions
 see-0.011
 wave-0.011
 waves-0.010
seat-0.009
 hand-0.008
Token So
Token ID1406
Feature activation+9.58e-01

Pos logit contributions
xy+0.050
liv+0.045
oni+0.044
live+0.043
 fault+0.042
Neg logit contributions
 see-0.021
seat-0.017
 voice-0.017
 wave-0.016
 hand-0.016
Token,
Token ID11
Feature activation-1.87e-01

Pos logit contributions
 see+0.021
 wave+0.021
 waves+0.019
 voice+0.018
seat+0.018
Neg logit contributions
xy-0.032
liv-0.029
oni-0.029
ork-0.028
 fault-0.027
Token they
Token ID484
Feature activation-4.47e-01

Pos logit contributions
xy+0.006
liv+0.005
 fault+0.005
ork+0.005
oni+0.005
Neg logit contributions
 see-0.005
 wave-0.005
 waves-0.004
seat-0.004
 voice-0.004
Token had
Token ID550
Feature activation-3.47e-01

Pos logit contributions
 wave+0.017
 see+0.017
 waves+0.016
seat+0.015
 paddle+0.014
Neg logit contributions
xy-0.030
oni-0.029
liv-0.028
ork-0.027
 fault-0.027
Token learned
Token ID4499
Feature activation-1.30e+00

Pos logit contributions
xy+0.012
oni+0.011
liv+0.011
ork+0.011
live+0.010
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.007
seat-0.006
 transported-0.006
Token an
Token ID281
Feature activation-2.48e+00

Pos logit contributions
xy+0.052
oni+0.049
liv+0.048
live+0.048
storm+0.046
Neg logit contributions
 see-0.020
 waves-0.020
 wave-0.019
 voice-0.016
seat-0.015
Token important
Token ID1593
Feature activation-1.98e+00

Pos logit contributions
xy+0.063
liv+0.058
oni+0.057
ork+0.053
live+0.049
Neg logit contributions
 wave-0.072
 see-0.069
 waves-0.068
seat-0.064
 adventurer-0.064
Token lesson
Token ID11483
Feature activation-1.65e+00

Pos logit contributions
oni+0.050
xy+0.049
liv+0.047
live+0.041
ratch+0.040
Neg logit contributions
 wave-0.056
 see-0.054
 voice-0.053
 offer-0.051
 paddle-0.051
Token-
Token ID12
Feature activation-4.40e+00

Pos logit contributions
liv+0.058
oni+0.057
xy+0.057
tr+0.055
live+0.055
Neg logit contributions
 wave-0.025
 see-0.024
 waves-0.022
seat-0.021
 voice-0.020
Tokenwith
Token ID4480
Feature activation-1.90e+00

Pos logit contributions
 fault+0.104
liv+0.104
oni+0.104
tr+0.098
isk+0.097
Neg logit contributions
 see-0.104
 wave-0.102
 of-0.097
iest-0.095
 voice-0.095
Token hard
Token ID1327
Feature activation-2.63e+00

Pos logit contributions
xy+0.067
liv+0.064
oni+0.064
ork+0.060
 fault+0.058
Neg logit contributions
 wave-0.037
 see-0.037
 waves-0.033
 voice-0.031
seat-0.030
Token work
Token ID670
Feature activation-1.82e+00

Pos logit contributions
xy+0.083
oni+0.078
liv+0.078
ork+0.072
 fault+0.071
Neg logit contributions
 wave-0.059
 see-0.057
 waves-0.054
seat-0.053
 paddle-0.050
Token and
Token ID290
Feature activation-1.55e-01

Pos logit contributions
xy+0.067
liv+0.067
oni+0.066
live+0.062
ork+0.062
Neg logit contributions
 wave-0.026
 see-0.025
 waves-0.023
seat-0.021
 voice-0.021
Token determination
Token ID12123
Feature activation-6.01e-01

Pos logit contributions
xy+0.005
oni+0.005
liv+0.005
ork+0.005
 fault+0.005
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.003
seat-0.003
 paddle-0.003
Token time
Token ID640
Feature activation-1.59e+00

Pos logit contributions
xy+0.066
oni+0.063
liv+0.059
ork+0.056
 fault+0.054
Neg logit contributions
 see-0.058
 wave-0.058
 waves-0.052
seat-0.052
 voice-0.050
Token,
Token ID11
Feature activation-5.62e-01

Pos logit contributions
xy+0.064
oni+0.062
liv+0.060
ork+0.057
 fault+0.057
Neg logit contributions
 see-0.022
 wave-0.021
seat-0.018
 waves-0.017
 voice-0.016
Token not
Token ID407
Feature activation-1.62e+00

Pos logit contributions
xy+0.024
oni+0.023
liv+0.023
ork+0.022
 fault+0.022
Neg logit contributions
 wave-0.006
 see-0.006
seat-0.005
 waves-0.005
 paddle-0.004
Token so
Token ID523
Feature activation+1.18e-02

Pos logit contributions
xy+0.064
liv+0.063
oni+0.060
 virtue+0.057
ork+0.056
Neg logit contributions
 see-0.026
 wave-0.026
 sail-0.021
 waves-0.020
 hop-0.020
Token long
Token ID890
Feature activation-1.63e+00

Pos logit contributions
 see+0.000
 wave+0.000
 waves+0.000
 hops+0.000
seat+0.000
Neg logit contributions
xy-0.000
oni-0.000
liv-0.000
 fault-0.000
ork-0.000
Token ago
Token ID2084
Feature activation-4.30e+00

Pos logit contributions
xy+0.061
oni+0.058
liv+0.057
ork+0.054
 fault+0.053
Neg logit contributions
 wave-0.028
 see-0.027
 waves-0.025
seat-0.024
 paddle-0.022
Token,
Token ID11
Feature activation-5.21e-01

Pos logit contributions
xy+0.151
oni+0.140
liv+0.139
ork+0.130
 fault+0.128
Neg logit contributions
 wave-0.083
 see-0.081
 waves-0.076
seat-0.073
 voice-0.068
Token there
Token ID612
Feature activation-4.01e-02

Pos logit contributions
xy+0.019
oni+0.019
liv+0.018
 fault+0.018
ork+0.017
Neg logit contributions
 see-0.009
 wave-0.009
 waves-0.008
 hops-0.008
seat-0.008
Token was
Token ID373
Feature activation-3.12e-01

Pos logit contributions
xy+0.001
oni+0.001
liv+0.001
ork+0.001
 fault+0.001
Neg logit contributions
 see-0.001
 wave-0.001
 waves-0.001
seat-0.001
 hops-0.001
Token a
Token ID257
Feature activation-2.64e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.013
ork+0.013
ess+0.013
Neg logit contributions
 see-0.003
 waves-0.002
 wave-0.002
 hop-0.002
Leave-0.002
Token big
Token ID1263
Feature activation+7.40e-01

Pos logit contributions
xy+0.011
liv+0.011
oni+0.011
 fault+0.010
ork+0.010
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.002
seat-0.002
 sailor-0.002
Token wanted
Token ID2227
Feature activation-5.10e-01

Pos logit contributions
xy+0.003
oni+0.003
liv+0.003
ork+0.003
her+0.002
Neg logit contributions
seat-0.002
 wave-0.002
 see-0.002
 sle-0.002
 reins-0.002
Token to
Token ID284
Feature activation-3.69e-01

Pos logit contributions
liv+0.019
xy+0.019
oni+0.017
live+0.017
rom+0.016
Neg logit contributions
 see-0.010
 wave-0.009
 waves-0.009
 hand-0.008
 help-0.008
Token keep
Token ID1394
Feature activation-8.77e-01

Pos logit contributions
oni+0.012
xy+0.011
ork+0.011
liv+0.011
live+0.010
Neg logit contributions
seat-0.008
 sle-0.007
 reins-0.007
 wave-0.007
 waves-0.007
Token it
Token ID340
Feature activation+4.18e-01

Pos logit contributions
xy+0.026
oni+0.025
liv+0.025
ork+0.023
 fault+0.023
Neg logit contributions
 wave-0.021
 see-0.021
 waves-0.019
seat-0.019
 paddle-0.018
Token all
Token ID477
Feature activation-4.84e-01

Pos logit contributions
 wave+0.010
 see+0.010
seat+0.010
 waves+0.010
Leave+0.009
Neg logit contributions
xy-0.012
oni-0.011
liv-0.011
ork-0.011
 fault-0.010
Token for
Token ID329
Feature activation-4.27e+00

Pos logit contributions
xy+0.016
oni+0.015
liv+0.015
ork+0.015
abella+0.014
Neg logit contributions
 wave-0.011
 see-0.010
 waves-0.009
seat-0.009
 hops-0.008
Token himself
Token ID2241
Feature activation-1.17e+00

Pos logit contributions
xy+0.155
liv+0.154
oni+0.150
 fault+0.145
tr+0.144
Neg logit contributions
 see-0.079
 wave-0.072
 waves-0.065
 breath-0.063
 voice-0.063
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
xy+0.044
liv+0.044
oni+0.042
ork+0.040
abella+0.039
Neg logit contributions
 wave-0.018
 see-0.018
 waves-0.016
seat-0.016
 hand-0.015
Token He
Token ID679
Feature activation-7.09e-01

Pos logit contributions
xy+0.018
oni+0.016
liv+0.016
 fault+0.015
ork+0.015
Neg logit contributions
 wave-0.010
 see-0.010
 waves-0.010
seat-0.009
 voice-0.009
Token was
Token ID373
Feature activation-1.62e-01

Pos logit contributions
xy+0.029
 fault+0.028
liv+0.028
oni+0.027
ork+0.026
Neg logit contributions
 wave-0.010
 see-0.009
 waves-0.008
 paddle-0.007
 hops-0.007
Token being
Token ID852
Feature activation-4.06e-01

Pos logit contributions
xy+0.005
oni+0.005
liv+0.004
 fault+0.004
her+0.004
Neg logit contributions
 wave-0.004
seat-0.004
 waves-0.004
 see-0.003
 sle-0.003
Token<|endoftext|>
Token ID50256
Feature activation+2.09e+00

Pos logit contributions
 see+0.024
 wave+0.023
seat+0.022
 waves+0.019
 hand+0.019
Neg logit contributions
xy-0.052
oni-0.049
liv-0.048
ork-0.046
 fault-0.045
Token<|endoftext|>
Token ID50256
Feature activation+1.32e+00

Pos logit contributions
 wave+0.038
 see+0.035
 waves+0.033
seat+0.032
 paddle+0.031
Neg logit contributions
xy-0.076
oni-0.072
liv-0.070
ork-0.068
 fault-0.065
Token<|endoftext|>
Token ID50256
Feature activation+8.60e-01

Pos logit contributions
 wave+0.025
 see+0.023
 waves+0.022
seat+0.021
 reins+0.021
Neg logit contributions
xy-0.046
oni-0.044
liv-0.043
ork-0.041
 fault-0.039
Token<|endoftext|>
Token ID50256
Feature activation+6.05e-01

Pos logit contributions
 wave+0.016
 see+0.015
 waves+0.015
seat+0.014
 reins+0.014
Neg logit contributions
xy-0.030
oni-0.028
liv-0.028
ork-0.027
 fault-0.025
Token<|endoftext|>
Token ID50256
Feature activation+4.71e-01

Pos logit contributions
 wave+0.011
 see+0.011
 waves+0.010
seat+0.010
 reins+0.010
Neg logit contributions
xy-0.021
oni-0.020
liv-0.020
ork-0.019
 fault-0.018
TokenL
Token ID43
Feature activation-4.26e+00

Pos logit contributions
 wave+0.009
 see+0.008
 waves+0.008
seat+0.008
 reins+0.007
Neg logit contributions
xy-0.017
oni-0.016
liv-0.016
ork-0.015
 fault-0.014
Tokenila
Token ID10102
Feature activation-2.08e+00

Pos logit contributions
xy+0.124
liv+0.104
 virtue+0.104
oni+0.104
 fault+0.100
Neg logit contributions
 see-0.113
 wave-0.105
 hand-0.103
 waves-0.099
 eye-0.096
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
xy+0.077
oni+0.073
liv+0.072
ork+0.068
 fault+0.068
Neg logit contributions
 wave-0.036
 see-0.035
 waves-0.032
seat-0.031
 voice-0.028
Token Ben
Token ID3932
Feature activation-1.86e+00

Pos logit contributions
xy+0.021
oni+0.020
liv+0.020
ork+0.019
 fault+0.018
Neg logit contributions
 wave-0.015
 see-0.014
 waves-0.014
seat-0.013
 paddle-0.012
Token are
Token ID389
Feature activation-8.65e-01

Pos logit contributions
xy+0.064
oni+0.061
liv+0.059
ork+0.057
 fault+0.056
Neg logit contributions
 wave-0.036
 see-0.034
seat-0.033
 waves-0.033
 reins-0.030
Token friends
Token ID2460
Feature activation+8.90e-01

Pos logit contributions
xy+0.030
liv+0.028
oni+0.028
ork+0.027
 fault+0.025
Neg logit contributions
 see-0.017
 wave-0.017
 waves-0.016
seat-0.015
 hand-0.014
Token me
Token ID502
Feature activation-6.85e-01

Pos logit contributions
xy+0.051
oni+0.049
liv+0.047
ork+0.045
 fault+0.043
Neg logit contributions
 wave-0.040
seat-0.038
 see-0.038
 waves-0.037
 paddle-0.035
Token.
Token ID13
Feature activation-5.08e-01

Pos logit contributions
xy+0.017
ork+0.016
oni+0.016
ess+0.016
her+0.016
Neg logit contributions
Leave-0.018
 amaz-0.017
seat-0.016
Who-0.016
 waves-0.016
Token I
Token ID314
Feature activation-1.00e+00

Pos logit contributions
xy+0.017
liv+0.015
oni+0.015
ork+0.014
ess+0.014
Neg logit contributions
 wave-0.011
 waves-0.010
 see-0.009
seat-0.009
 breath-0.009
Token forgive
Token ID20927
Feature activation-1.73e+00

Pos logit contributions
xy+0.036
oni+0.034
liv+0.033
ork+0.032
 fault+0.031
Neg logit contributions
 wave-0.018
 see-0.018
 waves-0.017
seat-0.016
 paddle-0.015
Token myself
Token ID3589
Feature activation+9.34e-01

Pos logit contributions
xy+0.062
liv+0.061
oni+0.059
ork+0.058
 fault+0.056
Neg logit contributions
 see-0.033
 wave-0.032
 waves-0.029
 voice-0.027
 sailor-0.025
Token for
Token ID329
Feature activation-4.24e+00

Pos logit contributions
 wave+0.020
 see+0.019
 waves+0.019
seat+0.019
Leave+0.017
Neg logit contributions
xy-0.030
oni-0.028
liv-0.027
ork-0.026
 fault-0.026
Token getting
Token ID1972
Feature activation-2.54e-02

Pos logit contributions
xy+0.127
oni+0.118
liv+0.115
ork+0.109
 fault+0.107
Neg logit contributions
 wave-0.100
 see-0.096
seat-0.096
 waves-0.093
 paddle-0.087
Token lost
Token ID2626
Feature activation-1.14e+00

Pos logit contributions
xy+0.001
oni+0.001
her+0.001
ork+0.001
liv+0.001
Neg logit contributions
 wave-0.001
seat-0.001
 amaz-0.001
 see-0.001
 waves-0.001
Token."
Token ID526
Feature activation-1.78e+00

Pos logit contributions
xy+0.041
oni+0.039
liv+0.039
ork+0.037
 fault+0.036
Neg logit contributions
 wave-0.021
 see-0.020
 waves-0.018
seat-0.018
 paddle-0.017
Token
Token ID198
Feature activation-7.90e-01

Pos logit contributions
xy+0.069
oni+0.066
liv+0.066
 fault+0.064
 virtue+0.061
Neg logit contributions
 see-0.032
 wave-0.027
 voice-0.026
 waves-0.025
 hand-0.025
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
xy+0.030
oni+0.030
 fault+0.029
liv+0.029
 virtue+0.027
Neg logit contributions
 see-0.013
 wave-0.011
seat-0.010
 voice-0.009
 waves-0.009
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
liv+0.019
xy+0.018
oni+0.016
live+0.016
ork+0.016
Neg logit contributions
 see-0.012
 wave-0.011
 waves-0.010
 sail-0.009
 help-0.009
Token take
Token ID1011
Feature activation-8.99e-01

Pos logit contributions
 see+0.016
 wave+0.015
 waves+0.012
 voice+0.011
 paddle+0.010
Neg logit contributions
xy-0.044
liv-0.043
oni-0.041
ork-0.039
 fault-0.039
Token all
Token ID477
Feature activation-2.02e-01

Pos logit contributions
xy+0.038
liv+0.037
oni+0.035
live+0.033
ork+0.033
Neg logit contributions
 see-0.013
 wave-0.012
 waves-0.011
 voice-0.010
 hops-0.009
Token the
Token ID262
Feature activation-8.74e-01

Pos logit contributions
xy+0.008
liv+0.008
oni+0.008
abella+0.007
ork+0.007
Neg logit contributions
 see-0.003
 wave-0.003
 waves-0.002
seat-0.002
 hand-0.002
Token things
Token ID1243
Feature activation+1.06e+00

Pos logit contributions
xy+0.030
oni+0.028
liv+0.027
ork+0.026
 fault+0.026
Neg logit contributions
 wave-0.017
 see-0.017
seat-0.016
 waves-0.016
 paddle-0.014
Token for
Token ID329
Feature activation-4.24e+00

Pos logit contributions
 wave+0.022
 see+0.022
 waves+0.020
seat+0.019
 paddle+0.018
Neg logit contributions
xy-0.035
liv-0.034
oni-0.033
ork-0.032
 fault-0.031
Token yourself
Token ID3511
Feature activation-3.41e-01

Pos logit contributions
liv+0.152
xy+0.143
tr+0.139
 fault+0.133
oni+0.132
Neg logit contributions
 see-0.088
 wave-0.078
 sailors-0.070
 pack-0.070
 hand-0.069
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
xy+0.012
oni+0.011
liv+0.011
ork+0.010
 fault+0.010
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.006
seat-0.006
 paddle-0.006
Token Sharing
Token ID36644
Feature activation-8.19e-01

Pos logit contributions
xy+0.009
liv+0.009
oni+0.009
ork+0.008
 fault+0.008
Neg logit contributions
 see-0.004
seat-0.004
 wave-0.004
 hand-0.004
 waves-0.004
Token is
Token ID318
Feature activation-5.31e-02

Pos logit contributions
xy+0.030
liv+0.030
oni+0.028
ork+0.028
abella+0.026
Neg logit contributions
 wave-0.014
 see-0.014
 waves-0.013
seat-0.012
 hops-0.011
Token caring
Token ID18088
Feature activation-2.97e-01

Pos logit contributions
xy+0.002
liv+0.002
oni+0.002
ork+0.001
 fault+0.001
Neg logit contributions
 see-0.001
 wave-0.001
 waves-0.001
seat-0.001
 voice-0.001
Token stubborn
Token ID24444
Feature activation-5.23e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.004
seat+0.004
 voice+0.004
Neg logit contributions
xy-0.011
liv-0.011
oni-0.011
ork-0.010
 fault-0.010
Token about
Token ID546
Feature activation-8.47e-01

Pos logit contributions
xy+0.012
oni+0.010
liv+0.010
ork+0.009
 fault+0.009
Neg logit contributions
 wave-0.017
 see-0.017
 waves-0.016
seat-0.016
 voice-0.015
Token learning
Token ID4673
Feature activation-9.96e-01

Pos logit contributions
xy+0.032
liv+0.032
ess+0.029
oni+0.028
ork+0.028
Neg logit contributions
 see-0.015
 waves-0.013
 wave-0.013
 hops-0.012
 rub-0.012
Token new
Token ID649
Feature activation-1.46e+00

Pos logit contributions
xy+0.038
liv+0.036
oni+0.034
live+0.032
ork+0.032
Neg logit contributions
 see-0.017
 wave-0.016
 waves-0.016
seat-0.014
 voice-0.014
Token things
Token ID1243
Feature activation-1.15e-01

Pos logit contributions
xy+0.053
oni+0.051
liv+0.049
ork+0.048
 fault+0.047
Neg logit contributions
 wave-0.025
 see-0.024
seat-0.023
 waves-0.022
 paddle-0.021
Token anymore
Token ID7471
Feature activation-4.23e+00

Pos logit contributions
xy+0.004
liv+0.004
oni+0.004
ork+0.004
abella+0.004
Neg logit contributions
 see-0.002
 wave-0.002
 waves-0.001
 shine-0.001
seat-0.001
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
liv+0.165
xy+0.164
oni+0.156
 fault+0.153
ratch+0.151
Neg logit contributions
 see-0.062
 wave-0.051
 voice-0.042
 hops-0.042
 waves-0.041
Token She
Token ID1375
Feature activation+3.67e-01

Pos logit contributions
xy+0.041
oni+0.039
liv+0.039
ork+0.037
her+0.037
Neg logit contributions
 see-0.017
seat-0.013
 wave-0.013
 waves-0.012
 Are-0.011
Token realized
Token ID6939
Feature activation-2.51e-01

Pos logit contributions
 wave+0.008
 see+0.008
 waves+0.007
 hops+0.006
 trusts+0.006
Neg logit contributions
xy-0.013
liv-0.012
oni-0.011
ork-0.011
 fault-0.011
Token that
Token ID326
Feature activation+2.25e-01

Pos logit contributions
xy+0.010
oni+0.009
liv+0.009
 fault+0.009
ork+0.009
Neg logit contributions
 see-0.004
 wave-0.004
 waves-0.003
 voice-0.003
 sailor-0.003
Token it
Token ID340
Feature activation-9.91e-01

Pos logit contributions
 see+0.004
 wave+0.004
 waves+0.003
 dreams+0.003
 hop+0.003
Neg logit contributions
xy-0.008
liv-0.008
oni-0.008
ork-0.008
ratch-0.008
Token He
Token ID679
Feature activation-3.29e-01

Pos logit contributions
 wave+0.012
 see+0.011
 waves+0.011
seat+0.011
 transported+0.010
Neg logit contributions
xy-0.018
oni-0.017
liv-0.016
ork-0.016
 fault-0.015
Token thinks
Token ID6834
Feature activation+4.57e-01

Pos logit contributions
xy+0.012
oni+0.011
liv+0.011
ork+0.010
 fault+0.010
Neg logit contributions
 wave-0.005
seat-0.005
 see-0.005
 reins-0.005
 sailor-0.005
Token they
Token ID484
Feature activation-5.06e-01

Pos logit contributions
 wave+0.012
 see+0.012
 waves+0.011
seat+0.011
 paddle+0.010
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.011
 fault-0.011
Token are
Token ID389
Feature activation+5.00e-02

Pos logit contributions
xy+0.016
oni+0.015
 fault+0.014
liv+0.014
ork+0.014
Neg logit contributions
seat-0.010
 wave-0.010
 sle-0.009
 waves-0.009
 reins-0.009
Token toys
Token ID14958
Feature activation+1.05e+00

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
seat+0.001
 paddle+0.001
Neg logit contributions
xy-0.002
liv-0.002
oni-0.002
ork-0.002
 fault-0.001
Token for
Token ID329
Feature activation-4.21e+00

Pos logit contributions
 wave+0.021
 see+0.020
 waves+0.019
seat+0.018
 sailor+0.017
Neg logit contributions
xy-0.036
liv-0.035
oni-0.034
ork-0.032
 fault-0.031
Token him
Token ID683
Feature activation-1.72e+00

Pos logit contributions
xy+0.134
liv+0.129
oni+0.126
ork+0.119
 fault+0.117
Neg logit contributions
 see-0.095
 wave-0.093
 waves-0.086
seat-0.083
 paddle-0.080
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
xy+0.063
liv+0.062
oni+0.058
ork+0.056
abella+0.055
Neg logit contributions
 see-0.031
 wave-0.030
 waves-0.027
seat-0.026
 hand-0.024
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.005
 transported+0.004
Neg logit contributions
xy-0.007
oni-0.007
liv-0.007
ork-0.006
 fault-0.006
Token
Token ID198
Feature activation+9.81e-01

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
seat+0.002
 reins+0.002
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
ork-0.003
 fault-0.003
TokenHe
Token ID1544
Feature activation-2.77e-01

Pos logit contributions
 wave+0.025
 see+0.024
 waves+0.023
seat+0.022
 amaz+0.022
Neg logit contributions
xy-0.029
liv-0.027
oni-0.026
ork-0.025
live-0.024
Token they
Token ID484
Feature activation-1.09e+00

Pos logit contributions
 see+0.021
 wave+0.020
 waves+0.019
 sail+0.018
 paddle+0.017
Neg logit contributions
xy-0.033
liv-0.032
oni-0.032
live-0.029
ork-0.029
Token didn
Token ID1422
Feature activation-1.39e+00

Pos logit contributions
xy+0.038
oni+0.036
liv+0.036
ork+0.034
 fault+0.033
Neg logit contributions
 wave-0.021
 see-0.021
 waves-0.019
seat-0.018
 paddle-0.017
Token't
Token ID470
Feature activation-9.59e-01

Pos logit contributions
xy+0.037
liv+0.035
oni+0.035
 fault+0.033
abella+0.031
Neg logit contributions
 see-0.038
 wave-0.036
 sail-0.033
 waves-0.033
seat-0.033
Token get
Token ID651
Feature activation-1.02e+00

Pos logit contributions
xy+0.032
oni+0.031
ork+0.030
 fault+0.029
ratch+0.029
Neg logit contributions
 amaz-0.017
seat-0.017
 waves-0.017
 sle-0.016
 reins-0.016
Token it
Token ID340
Feature activation-8.39e-01

Pos logit contributions
xy+0.030
oni+0.029
 fault+0.027
ork+0.027
liv+0.026
Neg logit contributions
 wave-0.023
seat-0.022
 see-0.022
 waves-0.021
 amaz-0.021
Token unst
Token ID15014
Feature activation-4.20e+00

Pos logit contributions
xy+0.029
liv+0.028
oni+0.028
ork+0.026
 fault+0.025
Neg logit contributions
 wave-0.016
 see-0.016
seat-0.014
 waves-0.014
 paddle-0.013
Tokenuck
Token ID1347
Feature activation-6.57e-01

Pos logit contributions
xy+0.121
liv+0.118
oni+0.115
abella+0.108
 fault+0.106
Neg logit contributions
 see-0.099
 wave-0.098
seat-0.093
 waves-0.093
 paddle-0.087
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
xy+0.025
liv+0.025
oni+0.023
ork+0.022
live+0.022
Neg logit contributions
 see-0.011
 wave-0.011
 waves-0.010
seat-0.009
 hand-0.009
Token
Token ID198
Feature activation-7.55e-01

Pos logit contributions
xy+0.054
oni+0.053
liv+0.050
 fault+0.049
ork+0.048
Neg logit contributions
 see-0.014
 Curious-0.010
 Are-0.010
seat-0.010
 voice-0.009
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
xy+0.029
oni+0.029
 fault+0.028
liv+0.028
storm+0.026
Neg logit contributions
 see-0.011
seat-0.008
 wave-0.008
 voice-0.007
 hand-0.007
TokenJust
Token ID5703
Feature activation+2.31e+00

Pos logit contributions
 see+0.003
 wave+0.003
Leave+0.003
seat+0.003
 voice+0.003
Neg logit contributions
oni-0.007
xy-0.006
 fault-0.006
liv-0.006
ork-0.006
Token want
Token ID765
Feature activation-7.01e-01

Pos logit contributions
xy+0.002
oni+0.002
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
seat-0.001
 paddle-0.001
Token us
Token ID514
Feature activation-6.22e-01

Pos logit contributions
xy+0.024
liv+0.023
oni+0.022
ork+0.021
live+0.020
Neg logit contributions
 see-0.015
 wave-0.015
 waves-0.014
 voice-0.013
 sailor-0.012
Token to
Token ID284
Feature activation+6.02e-01

Pos logit contributions
xy+0.017
oni+0.016
liv+0.015
ork+0.015
 fault+0.015
Neg logit contributions
 wave-0.016
 see-0.016
seat-0.015
 waves-0.015
 sle-0.014
Token be
Token ID307
Feature activation+3.59e-01

Pos logit contributions
 wave+0.008
 see+0.008
 waves+0.007
seat+0.007
 paddle+0.006
Neg logit contributions
xy-0.025
oni-0.023
liv-0.023
ork-0.022
 fault-0.022
Token friends
Token ID2460
Feature activation+1.10e+00

Pos logit contributions
 wave+0.007
 see+0.006
 waves+0.006
seat+0.006
 paddle+0.005
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.011
 fault-0.011
Token for
Token ID329
Feature activation-4.18e+00

Pos logit contributions
 wave+0.019
 see+0.019
 waves+0.016
seat+0.016
 sailor+0.015
Neg logit contributions
xy-0.041
liv-0.040
oni-0.039
ork-0.037
 fault-0.036
Token a
Token ID257
Feature activation-1.27e+00

Pos logit contributions
xy+0.142
oni+0.135
liv+0.134
ork+0.127
 fault+0.124
Neg logit contributions
 see-0.087
 wave-0.086
 waves-0.079
seat-0.073
 voice-0.072
Token long
Token ID890
Feature activation-7.87e-01

Pos logit contributions
xy+0.049
liv+0.049
oni+0.048
 fault+0.043
her+0.043
Neg logit contributions
 wave-0.022
 see-0.021
 sail-0.018
 sailor-0.018
 paddle-0.017
Token time
Token ID640
Feature activation-8.56e-02

Pos logit contributions
xy+0.023
oni+0.021
liv+0.021
ork+0.019
 fault+0.019
Neg logit contributions
 wave-0.020
 see-0.020
 waves-0.019
seat-0.018
 voice-0.018
Token."
Token ID526
Feature activation-1.95e+00

Pos logit contributions
xy+0.003
oni+0.003
liv+0.003
ork+0.003
 fault+0.003
Neg logit contributions
 wave-0.002
 see-0.002
 waves-0.001
seat-0.001
 paddle-0.001
Token<|endoftext|>
Token ID50256
Feature activation+4.99e-01

Pos logit contributions
xy+0.077
liv+0.072
oni+0.071
 fault+0.070
ork+0.069
Neg logit contributions
 see-0.035
 wave-0.029
 waves-0.027
 Curious-0.026
 voice-0.025
Token food
Token ID2057
Feature activation-1.46e-01

Pos logit contributions
oni+0.029
xy+0.029
 fault+0.028
ork+0.027
ess+0.025
Neg logit contributions
 wave-0.018
 see-0.018
seat-0.018
Leave-0.017
 waves-0.017
Token and
Token ID290
Feature activation-1.90e-01

Pos logit contributions
xy+0.005
liv+0.005
oni+0.005
ork+0.005
abella+0.005
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.002
 sailor-0.002
seat-0.002
Token took
Token ID1718
Feature activation-9.46e-01

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.004
seat-0.003
 waves-0.003
 reins-0.003
Token it
Token ID340
Feature activation-2.98e-01

Pos logit contributions
xy+0.034
liv+0.032
oni+0.031
 fault+0.029
ork+0.029
Neg logit contributions
 see-0.019
 wave-0.019
 waves-0.017
seat-0.016
 voice-0.015
Token all
Token ID477
Feature activation-9.39e-01

Pos logit contributions
liv+0.011
xy+0.011
oni+0.011
abella+0.010
live+0.010
Neg logit contributions
 see-0.005
 wave-0.005
 breath-0.004
 hand-0.004
 sail-0.004
Token for
Token ID329
Feature activation-4.17e+00

Pos logit contributions
xy+0.034
oni+0.033
liv+0.033
ork+0.032
abella+0.032
Neg logit contributions
 wave-0.017
 see-0.016
seat-0.014
 waves-0.014
 welcome-0.014
Token himself
Token ID2241
Feature activation-1.48e+00

Pos logit contributions
xy+0.176
liv+0.173
oni+0.166
 fault+0.163
tr+0.162
Neg logit contributions
 see-0.055
 wave-0.045
 waves-0.041
 breath-0.037
 voice-0.037
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
xy+0.056
liv+0.056
oni+0.053
ork+0.049
abella+0.049
Neg logit contributions
 wave-0.023
 see-0.023
 waves-0.020
seat-0.020
 hand-0.018
Token
Token ID220
Feature activation-1.26e+00

Pos logit contributions
xy+0.042
oni+0.041
liv+0.041
ork+0.039
 fault+0.038
Neg logit contributions
 see-0.015
seat-0.013
 wave-0.012
 waves-0.011
 hand-0.010
Token
Token ID198
Feature activation-7.62e-01

Pos logit contributions
xy+0.047
oni+0.045
liv+0.045
ork+0.043
 fault+0.042
Neg logit contributions
 see-0.021
seat-0.018
 wave-0.017
 waves-0.015
 hand-0.015
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
xy+0.028
oni+0.028
liv+0.027
 fault+0.026
ork+0.025
Neg logit contributions
 see-0.013
 wave-0.011
seat-0.011
 waves-0.010
 voice-0.010
Token it
Token ID340
Feature activation+7.65e-02

Pos logit contributions
xy+0.038
liv+0.037
oni+0.035
 fault+0.034
ork+0.034
Neg logit contributions
 see-0.023
 waves-0.022
 wave-0.022
seat-0.019
 hand-0.019
Token lots
Token ID6041
Feature activation-1.15e+00

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
seat+0.001
 reins+0.001
Neg logit contributions
xy-0.003
liv-0.002
oni-0.002
ork-0.002
 fault-0.002
Token of
Token ID286
Feature activation-1.22e+00

Pos logit contributions
xy+0.045
oni+0.044
liv+0.042
ork+0.042
 fault+0.040
Neg logit contributions
 wave-0.016
 see-0.016
seat-0.014
 waves-0.014
 paddle-0.013
Token sugar
Token ID7543
Feature activation-3.01e-01

Pos logit contributions
xy+0.040
oni+0.038
liv+0.037
ork+0.036
 fault+0.035
Neg logit contributions
 wave-0.025
 see-0.024
seat-0.023
 waves-0.022
 transported-0.021
Token cubes
Token ID34896
Feature activation-1.05e-01

Pos logit contributions
xy+0.011
liv+0.011
ork+0.010
abella+0.010
oni+0.010
Neg logit contributions
 wave-0.005
 waves-0.005
 see-0.005
seat-0.004
 sail-0.004
Token for
Token ID329
Feature activation-4.16e+00

Pos logit contributions
xy+0.004
liv+0.004
ork+0.003
oni+0.003
live+0.003
Neg logit contributions
 wave-0.002
 see-0.002
 waves-0.002
seat-0.002
 sail-0.002
Token treats
Token ID18432
Feature activation-5.86e-01

Pos logit contributions
xy+0.154
liv+0.151
 fault+0.142
oni+0.142
ork+0.140
Neg logit contributions
 see-0.072
 wave-0.072
 waves-0.065
 sail-0.060
 breath-0.059
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
liv+0.023
xy+0.023
ork+0.021
live+0.020
oni+0.020
Neg logit contributions
 wave-0.008
 see-0.008
 waves-0.007
seat-0.006
 sail-0.006
Token He
Token ID679
Feature activation-1.91e-01

Pos logit contributions
xy+0.032
liv+0.031
oni+0.030
ork+0.029
 fault+0.029
Neg logit contributions
 see-0.016
 wave-0.016
seat-0.015
 waves-0.015
 paddle-0.013
Token worked
Token ID3111
Feature activation-1.22e+00

Pos logit contributions
xy+0.007
liv+0.007
oni+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.003
 waves-0.003
seat-0.003
 paddle-0.003
Token very
Token ID845
Feature activation-1.60e+00

Pos logit contributions
xy+0.040
liv+0.039
storm+0.035
oni+0.035
tr+0.035
Neg logit contributions
 see-0.025
 wave-0.024
 breath-0.023
 waves-0.023
seat-0.021
Token will
Token ID481
Feature activation-1.00e+00

Pos logit contributions
xy+0.028
oni+0.026
liv+0.026
ork+0.025
 fault+0.024
Neg logit contributions
 wave-0.011
 see-0.011
 waves-0.009
seat-0.009
 paddle-0.009
Token trade
Token ID3292
Feature activation-7.70e-01

Pos logit contributions
xy+0.035
oni+0.034
ork+0.033
liv+0.032
her+0.032
Neg logit contributions
seat-0.017
 wave-0.016
 waves-0.015
 reins-0.015
 sle-0.015
Token you
Token ID345
Feature activation-3.95e-01

Pos logit contributions
xy+0.027
liv+0.026
oni+0.025
ork+0.024
 fault+0.024
Neg logit contributions
 wave-0.015
 see-0.015
 waves-0.014
seat-0.013
 voice-0.012
Token my
Token ID616
Feature activation-2.18e-01

Pos logit contributions
xy+0.014
liv+0.014
oni+0.013
ork+0.013
 fault+0.013
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.007
seat-0.006
 reins-0.006
Token car
Token ID1097
Feature activation-2.24e-01

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.005
 see-0.005
seat-0.005
 waves-0.005
Leave-0.005
Token for
Token ID329
Feature activation-4.16e+00

Pos logit contributions
xy+0.008
liv+0.008
oni+0.007
ork+0.007
 fault+0.007
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
seat-0.004
 voice-0.003
Token some
Token ID617
Feature activation-9.61e-01

Pos logit contributions
xy+0.145
oni+0.135
liv+0.135
ork+0.128
 fault+0.127
Neg logit contributions
 see-0.082
 wave-0.081
 waves-0.074
seat-0.071
 voice-0.067
Token of
Token ID286
Feature activation-2.26e-01

Pos logit contributions
xy+0.022
oni+0.021
ork+0.021
 fault+0.019
her+0.019
Neg logit contributions
 wave-0.026
 see-0.026
seat-0.026
 amaz-0.025
 transported-0.025
Token your
Token ID534
Feature activation-2.93e-01

Pos logit contributions
xy+0.009
liv+0.009
oni+0.008
ork+0.008
 fault+0.008
Neg logit contributions
 see-0.004
 wave-0.004
 waves-0.003
 hand-0.003
 voice-0.003
Token flowers
Token ID12734
Feature activation+1.72e+00

Pos logit contributions
xy+0.008
ork+0.008
oni+0.008
 fault+0.008
liv+0.007
Neg logit contributions
 wave-0.007
 see-0.007
seat-0.007
 waves-0.007
 reins-0.006
Token."
Token ID526
Feature activation-2.20e+00

Pos logit contributions
 wave+0.032
 see+0.032
 waves+0.028
seat+0.028
 paddle+0.025
Neg logit contributions
xy-0.061
liv-0.058
oni-0.058
ork-0.055
 fault-0.053
Token Lily
Token ID20037
Feature activation-2.80e-01

Pos logit contributions
xy+0.010
liv+0.009
oni+0.009
 fault+0.009
live+0.008
Neg logit contributions
 see-0.007
 wave-0.006
 voice-0.006
 waves-0.006
 breath-0.005
Token that
Token ID326
Feature activation-3.75e-01

Pos logit contributions
xy+0.010
liv+0.010
oni+0.009
rom+0.008
icy+0.008
Neg logit contributions
 see-0.006
 wave-0.005
 voice-0.004
 hand-0.004
 waves-0.004
Token stones
Token ID14966
Feature activation+5.52e-01

Pos logit contributions
liv+0.014
xy+0.014
oni+0.013
ork+0.013
ess+0.012
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.006
 hops-0.006
 rock-0.005
Token are
Token ID389
Feature activation-1.36e+00

Pos logit contributions
 wave+0.010
 see+0.010
 waves+0.009
seat+0.009
 paddle+0.008
Neg logit contributions
xy-0.019
oni-0.018
liv-0.018
ork-0.017
 fault-0.017
Token not
Token ID407
Feature activation-1.26e+00

Pos logit contributions
xy+0.040
liv+0.038
oni+0.035
ork+0.033
abella+0.032
Neg logit contributions
 see-0.035
 wave-0.035
 waves-0.032
 paddle-0.030
 hops-0.030
Token for
Token ID329
Feature activation-4.16e+00

Pos logit contributions
xy+0.040
liv+0.038
oni+0.036
ork+0.035
abella+0.033
Neg logit contributions
 wave-0.029
 see-0.029
 waves-0.026
seat-0.025
 paddle-0.025
Token eating
Token ID6600
Feature activation-9.06e-01

Pos logit contributions
xy+0.140
liv+0.133
oni+0.130
ork+0.122
 fault+0.121
Neg logit contributions
 see-0.087
 wave-0.086
 waves-0.079
seat-0.074
 voice-0.072
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
liv+0.036
xy+0.035
oni+0.032
live+0.032
abella+0.031
Neg logit contributions
 see-0.015
 wave-0.014
 waves-0.014
 sail-0.012
 rock-0.012
Token they
Token ID484
Feature activation+2.99e-01

Pos logit contributions
xy+0.037
liv+0.036
oni+0.034
ork+0.034
live+0.032
Neg logit contributions
 see-0.023
 wave-0.022
 waves-0.021
 hops-0.018
seat-0.018
Token can
Token ID460
Feature activation-5.08e-01

Pos logit contributions
 wave+0.006
 see+0.006
 waves+0.006
seat+0.006
 paddle+0.005
Neg logit contributions
xy-0.010
liv-0.009
oni-0.009
ork-0.009
 fault-0.008
Token hurt
Token ID5938
Feature activation-4.74e-01

Pos logit contributions
xy+0.019
liv+0.018
oni+0.017
 fault+0.017
abella+0.016
Neg logit contributions
 see-0.010
 wave-0.010
 paddle-0.008
 sail-0.008
 waves-0.008
Token license
Token ID5964
Feature activation-4.16e-01

Pos logit contributions
xy+0.007
liv+0.007
oni+0.007
abella+0.007
ork+0.006
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.003
 hand-0.003
 voice-0.003
Token.
Token ID13
Feature activation-4.19e-01

Pos logit contributions
xy+0.016
liv+0.015
oni+0.014
ork+0.014
abella+0.014
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.006
seat-0.006
 voice-0.006
Token I
Token ID314
Feature activation-5.70e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.014
ork+0.014
 fault+0.013
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.007
seat-0.007
 paddle-0.006
Token got
Token ID1392
Feature activation+4.62e-01

Pos logit contributions
xy+0.021
oni+0.020
liv+0.020
ork+0.019
 fault+0.018
Neg logit contributions
 wave-0.010
 see-0.009
seat-0.009
 waves-0.009
 reins-0.008
Token it
Token ID340
Feature activation+3.38e-02

Pos logit contributions
 wave+0.010
 see+0.010
 waves+0.009
seat+0.009
 paddle+0.008
Neg logit contributions
xy-0.015
liv-0.014
oni-0.014
ork-0.014
 fault-0.013
Token for
Token ID329
Feature activation-4.16e+00

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
seat+0.001
 sail+0.001
Neg logit contributions
xy-0.001
liv-0.001
oni-0.001
ork-0.001
abella-0.001
Token being
Token ID852
Feature activation-1.39e+00

Pos logit contributions
xy+0.151
liv+0.143
oni+0.142
ork+0.135
 fault+0.134
Neg logit contributions
 see-0.077
 wave-0.074
 waves-0.068
seat-0.063
 voice-0.062
Token a
Token ID257
Feature activation-1.70e+00

Pos logit contributions
xy+0.053
liv+0.050
oni+0.048
ork+0.048
 fault+0.046
Neg logit contributions
 see-0.024
 wave-0.024
 waves-0.022
 voice-0.020
seat-0.019
Token good
Token ID922
Feature activation-1.08e+00

Pos logit contributions
xy+0.058
liv+0.055
oni+0.054
ork+0.050
 fault+0.049
Neg logit contributions
 wave-0.036
 see-0.035
 waves-0.032
seat-0.030
 sailor-0.030
Token cat
Token ID3797
Feature activation-8.56e-01

Pos logit contributions
xy+0.040
liv+0.038
oni+0.038
ork+0.035
 fault+0.034
Neg logit contributions
 wave-0.019
 see-0.019
 waves-0.017
seat-0.017
 paddle-0.015
Token."
Token ID526
Feature activation-2.26e+00

Pos logit contributions
xy+0.033
liv+0.032
oni+0.031
ork+0.030
 fault+0.028
Neg logit contributions
 wave-0.013
 see-0.013
 waves-0.012
seat-0.011
 sailor-0.010
Token need
Token ID761
Feature activation-6.69e-01

Pos logit contributions
xy+0.017
oni+0.015
ork+0.015
 fault+0.015
liv+0.014
Neg logit contributions
seat-0.010
 waves-0.010
 reins-0.010
 wave-0.009
 sle-0.009
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
xy+0.020
oni+0.019
ork+0.018
 fault+0.017
liv+0.017
Neg logit contributions
 wave-0.015
seat-0.015
 see-0.014
 waves-0.014
 amaz-0.013
Token cover
Token ID3002
Feature activation-3.37e-01

Pos logit contributions
seat+0.004
 wave+0.003
 waves+0.003
Leave+0.003
 reins+0.003
Neg logit contributions
xy-0.008
oni-0.008
ork-0.008
her-0.007
liv-0.007
Token my
Token ID616
Feature activation+4.98e-01

Pos logit contributions
xy+0.014
liv+0.014
 fault+0.014
storm+0.013
ork+0.013
Neg logit contributions
 see-0.004
 waves-0.004
 wave-0.003
 hand-0.003
 steer-0.003
Token nuts
Token ID14380
Feature activation+8.12e-01

Pos logit contributions
 wave+0.009
 see+0.008
seat+0.008
Leave+0.008
 waves+0.008
Neg logit contributions
xy-0.017
oni-0.017
 fault-0.016
ork-0.016
liv-0.015
Token for
Token ID329
Feature activation-4.16e+00

Pos logit contributions
 wave+0.015
 see+0.015
 waves+0.013
seat+0.013
 paddle+0.012
Neg logit contributions
xy-0.029
liv-0.028
oni-0.027
ork-0.026
 fault-0.026
Token the
Token ID262
Feature activation-2.07e+00

Pos logit contributions
xy+0.132
oni+0.125
liv+0.122
ork+0.116
 fault+0.113
Neg logit contributions
 wave-0.092
 see-0.089
seat-0.084
 waves-0.084
 paddle-0.078
Token winter
Token ID7374
Feature activation-1.01e+00

Pos logit contributions
xy+0.075
liv+0.071
oni+0.071
ork+0.067
 fault+0.065
Neg logit contributions
 wave-0.037
 see-0.036
 waves-0.034
seat-0.033
 paddle-0.030
Token."
Token ID526
Feature activation-1.37e+00

Pos logit contributions
xy+0.035
oni+0.033
liv+0.033
ork+0.031
 fault+0.030
Neg logit contributions
 wave-0.020
 see-0.019
 waves-0.018
seat-0.018
 paddle-0.016
Token Tom
Token ID4186
Feature activation+2.44e-02

Pos logit contributions
xy+0.034
liv+0.030
oni+0.029
 fault+0.028
ork+0.027
Neg logit contributions
 see-0.044
 wave-0.042
 waves-0.040
seat-0.038
 voice-0.037
Token smiled
Token ID13541
Feature activation+1.37e+00

Pos logit contributions
 wave+0.000
 see+0.000
 waves+0.000
 voice+0.000
 trusts+0.000
Neg logit contributions
xy-0.001
liv-0.001
 fault-0.001
oni-0.001
ork-0.001
Token Bob
Token ID5811
Feature activation-3.69e-01

Pos logit contributions
 see+0.015
 waves+0.013
 wave+0.013
seat+0.013
 Curious+0.012
Neg logit contributions
xy-0.025
liv-0.023
oni-0.022
 fault-0.022
live-0.021
Token wanted
Token ID2227
Feature activation-8.27e-01

Pos logit contributions
xy+0.012
oni+0.012
liv+0.012
ork+0.011
 fault+0.011
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.007
seat-0.007
 reins-0.006
Token to
Token ID284
Feature activation-3.38e-01

Pos logit contributions
xy+0.027
liv+0.026
oni+0.025
ork+0.023
 fault+0.023
Neg logit contributions
 see-0.019
 wave-0.018
 waves-0.017
seat-0.016
 hand-0.016
Token take
Token ID1011
Feature activation-1.30e+00

Pos logit contributions
ork+0.008
 Tim+0.007
xy+0.007
her+0.007
oni+0.007
Neg logit contributions
Leave-0.008
 reins-0.008
 sle-0.008
 trusts-0.008
 importantly-0.007
Token it
Token ID340
Feature activation-3.68e-01

Pos logit contributions
xy+0.042
oni+0.040
liv+0.039
ork+0.038
 fault+0.037
Neg logit contributions
 wave-0.028
 see-0.027
seat-0.026
 waves-0.025
 reins-0.023
Token for
Token ID329
Feature activation-4.15e+00

Pos logit contributions
xy+0.011
oni+0.010
liv+0.010
ork+0.010
 fault+0.010
Neg logit contributions
 wave-0.009
 see-0.008
 waves-0.008
seat-0.008
 paddle-0.007
Token himself
Token ID2241
Feature activation-1.23e+00

Pos logit contributions
xy+0.157
oni+0.149
liv+0.145
 fault+0.139
ork+0.138
Neg logit contributions
 see-0.073
 wave-0.070
 waves-0.063
 voice-0.058
seat-0.058
Token,
Token ID11
Feature activation-5.67e-01

Pos logit contributions
xy+0.048
liv+0.046
oni+0.045
ork+0.042
abella+0.041
Neg logit contributions
 wave-0.019
 see-0.019
seat-0.016
 waves-0.016
 hand-0.015
Token but
Token ID475
Feature activation+8.74e-01

Pos logit contributions
xy+0.015
oni+0.015
liv+0.015
 fault+0.014
her+0.013
Neg logit contributions
seat-0.013
 wave-0.013
 waves-0.012
 sle-0.012
 see-0.012
Token Joe
Token ID5689
Feature activation-1.20e-01

Pos logit contributions
 wave+0.013
 see+0.013
 waves+0.011
seat+0.010
 paddle+0.010
Neg logit contributions
xy-0.036
oni-0.034
ork-0.033
liv-0.032
 fault-0.032
Token wanted
Token ID2227
Feature activation-9.32e-01

Pos logit contributions
xy+0.004
oni+0.004
liv+0.004
 fault+0.004
ork+0.004
Neg logit contributions
 wave-0.002
 see-0.002
 waves-0.002
 trusts-0.002
 voice-0.002
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
xy+0.067
liv+0.064
oni+0.064
abella+0.059
ork+0.059
Neg logit contributions
 wave-0.032
 see-0.031
seat-0.028
 waves-0.026
 voice-0.025
Token The
Token ID383
Feature activation+2.19e-01

Pos logit contributions
xy+0.022
oni+0.020
ork+0.019
liv+0.018
her+0.018
Neg logit contributions
 wave-0.023
 see-0.021
 waves-0.021
seat-0.020
 sailor-0.020
Token squirrel
Token ID33039
Feature activation-1.53e+00

Pos logit contributions
 see+0.003
 wave+0.003
seat+0.003
 waves+0.003
 transported+0.003
Neg logit contributions
xy-0.008
oni-0.008
liv-0.007
ork-0.007
 fault-0.007
Token told
Token ID1297
Feature activation-1.04e+00

Pos logit contributions
xy+0.055
oni+0.053
liv+0.052
 fault+0.051
ork+0.049
Neg logit contributions
 wave-0.028
 see-0.027
 waves-0.024
seat-0.024
 hops-0.022
Token Bun
Token ID28515
Feature activation-2.80e+00

Pos logit contributions
xy+0.027
oni+0.025
liv+0.024
ork+0.023
 fault+0.022
Neg logit contributions
 wave-0.030
 see-0.029
 waves-0.028
seat-0.028
 paddle-0.026
Token-
Token ID12
Feature activation-4.15e+00

Pos logit contributions
xy+0.076
oni+0.073
liv+0.071
 fault+0.066
ork+0.064
Neg logit contributions
 wave-0.072
 see-0.072
 waves-0.067
seat-0.066
 voice-0.064
TokenB
Token ID33
Feature activation-3.92e+00

Pos logit contributions
 fault+0.120
oni+0.120
xy+0.119
liv+0.118
arre+0.110
Neg logit contributions
 see-0.088
 wave-0.080
 waves-0.079
 rock-0.077
 reins-0.076
Tokenun
Token ID403
Feature activation-1.83e+00

Pos logit contributions
xy+0.109
oni+0.105
liv+0.101
ork+0.094
 fault+0.091
Neg logit contributions
 wave-0.098
 see-0.096
seat-0.093
 waves-0.090
 reins-0.088
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
liv+0.062
xy+0.060
oni+0.059
tr+0.058
 fault+0.057
Neg logit contributions
 wave-0.033
 see-0.031
 voice-0.028
 rock-0.028
 waves-0.028
Token climb
Token ID12080
Feature activation-4.63e-01

Pos logit contributions
xy+0.045
oni+0.043
liv+0.042
ork+0.040
 fault+0.040
Neg logit contributions
 wave-0.013
 see-0.013
 waves-0.010
 paddle-0.009
 voice-0.009
Token up
Token ID510
Feature activation-1.16e+00

Pos logit contributions
liv+0.019
xy+0.019
oni+0.018
live+0.017
ork+0.017
Neg logit contributions
 see-0.006
 wave-0.006
 waves-0.006
seat-0.005
 rock-0.005
Token He
Token ID679
Feature activation-8.02e-01

Pos logit contributions
xy+0.003
oni+0.003
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.002
 see-0.002
seat-0.002
 waves-0.002
 sailor-0.002
Token wanted
Token ID2227
Feature activation-7.64e-01

Pos logit contributions
xy+0.026
oni+0.025
liv+0.025
ork+0.024
 fault+0.022
Neg logit contributions
 wave-0.017
 see-0.016
seat-0.016
 waves-0.015
 reins-0.014
Token to
Token ID284
Feature activation-5.74e-01

Pos logit contributions
xy+0.027
liv+0.026
oni+0.024
ork+0.023
 fault+0.022
Neg logit contributions
 see-0.017
 wave-0.015
 waves-0.015
 hand-0.013
seat-0.013
Token save
Token ID3613
Feature activation-1.90e+00

Pos logit contributions
oni+0.018
xy+0.017
ork+0.017
liv+0.017
live+0.016
Neg logit contributions
seat-0.011
 sle-0.010
 waves-0.010
 reins-0.010
 transported-0.010
Token it
Token ID340
Feature activation-9.37e-01

Pos logit contributions
xy+0.077
liv+0.075
tr+0.068
oni+0.068
 fault+0.068
Neg logit contributions
 see-0.032
 wave-0.028
 waves-0.026
 steer-0.023
 voice-0.023
Token for
Token ID329
Feature activation-4.14e+00

Pos logit contributions
xy+0.033
liv+0.032
oni+0.030
abella+0.029
ork+0.029
Neg logit contributions
 see-0.019
 wave-0.018
 waves-0.016
seat-0.015
 sail-0.015
Token later
Token ID1568
Feature activation-2.06e+00

Pos logit contributions
xy+0.148
liv+0.140
oni+0.139
 fault+0.131
ork+0.130
Neg logit contributions
 see-0.079
 wave-0.076
 waves-0.068
 voice-0.064
 paddle-0.064
Token.
Token ID13
Feature activation-7.25e-01

Pos logit contributions
xy+0.089
liv+0.087
oni+0.081
live+0.079
ork+0.079
Neg logit contributions
 see-0.026
 wave-0.023
 waves-0.018
 voice-0.018
 hand-0.017
Token So
Token ID1406
Feature activation-1.87e-01

Pos logit contributions
xy+0.024
oni+0.023
liv+0.022
ork+0.021
 fault+0.021
Neg logit contributions
 wave-0.015
 see-0.015
 waves-0.014
seat-0.014
 voice-0.013
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
ork+0.005
live+0.005
Neg logit contributions
 wave-0.004
seat-0.004
 waves-0.004
 see-0.004
 reins-0.003
Token he
Token ID339
Feature activation-1.75e+00

Pos logit contributions
xy+0.031
oni+0.030
liv+0.030
ork+0.028
live+0.027
Neg logit contributions
 wave-0.024
seat-0.023
 see-0.023
 waves-0.023
 amaz-0.022
Token.
Token ID13
Feature activation+5.28e-01

Pos logit contributions
 see+0.006
 wave+0.006
 waves+0.006
seat+0.005
 hand+0.005
Neg logit contributions
xy-0.013
liv-0.013
ork-0.012
oni-0.012
 fault-0.012
Token We
Token ID775
Feature activation-4.97e-02

Pos logit contributions
 see+0.011
 wave+0.010
seat+0.010
 waves+0.010
 voice+0.009
Neg logit contributions
xy-0.018
oni-0.017
liv-0.017
ork-0.016
 fault-0.015
Token can
Token ID460
Feature activation-1.14e-01

Pos logit contributions
xy+0.002
liv+0.002
oni+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 see-0.001
 wave-0.001
 waves-0.001
 paddle-0.001
 sail-0.001
Token do
Token ID466
Feature activation-3.25e-01

Pos logit contributions
xy+0.004
oni+0.004
liv+0.004
ork+0.004
 fault+0.004
Neg logit contributions
 wave-0.002
 see-0.002
 waves-0.002
seat-0.002
 paddle-0.002
Token it
Token ID340
Feature activation-2.16e-01

Pos logit contributions
xy+0.011
liv+0.010
oni+0.010
ork+0.009
 fault+0.009
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.007
seat-0.006
 paddle-0.006
Token if
Token ID611
Feature activation+2.70e+00

Pos logit contributions
xy+0.007
liv+0.006
oni+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 see-0.005
 wave-0.005
 waves-0.005
seat-0.004
 sail-0.004
Token we
Token ID356
Feature activation-7.19e-01

Pos logit contributions
 wave+0.054
 see+0.053
 waves+0.049
seat+0.047
 paddle+0.044
Neg logit contributions
xy-0.092
oni-0.087
liv-0.086
ork-0.081
 fault-0.080
Token try
Token ID1949
Feature activation-5.35e-01

Pos logit contributions
xy+0.029
liv+0.028
oni+0.028
ork+0.026
 fault+0.026
Neg logit contributions
 see-0.010
 wave-0.010
 waves-0.009
seat-0.008
 paddle-0.008
Token."
Token ID526
Feature activation-1.41e+00

Pos logit contributions
liv+0.020
xy+0.020
oni+0.018
live+0.018
ork+0.017
Neg logit contributions
 see-0.010
 wave-0.009
 waves-0.009
seat-0.008
 sail-0.008
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
xy+0.047
oni+0.045
liv+0.043
ork+0.042
 fault+0.041
Neg logit contributions
 see-0.029
 wave-0.029
seat-0.026
 waves-0.026
 voice-0.024
Token
Token ID198
Feature activation+7.66e-01

Pos logit contributions
 wave+0.000
 see+0.000
 waves+0.000
seat+0.000
 paddle+0.000
Neg logit contributions
xy-0.000
oni-0.000
liv-0.000
ork-0.000
 fault-0.000
Token something
Token ID1223
Feature activation-6.04e-01

Pos logit contributions
xy+0.019
liv+0.018
oni+0.017
live+0.016
 fault+0.016
Neg logit contributions
 see-0.010
 wave-0.010
 waves-0.009
 voice-0.008
seat-0.008
Token fun
Token ID1257
Feature activation-6.22e-01

Pos logit contributions
xy+0.021
oni+0.020
liv+0.020
ork+0.019
 fault+0.018
Neg logit contributions
 see-0.012
 wave-0.012
 waves-0.010
 sailor-0.009
 voice-0.009
Token at
Token ID379
Feature activation+3.24e-01

Pos logit contributions
xy+0.021
liv+0.021
oni+0.020
ork+0.019
 fault+0.019
Neg logit contributions
 see-0.012
 wave-0.012
 waves-0.011
seat-0.011
 voice-0.010
Token home
Token ID1363
Feature activation-1.10e+00

Pos logit contributions
 see+0.005
 wave+0.005
 waves+0.005
seat+0.004
 breath+0.004
Neg logit contributions
xy-0.013
liv-0.012
oni-0.012
ork-0.012
live-0.011
Token.
Token ID13
Feature activation+4.94e-01

Pos logit contributions
xy+0.043
liv+0.043
oni+0.041
live+0.039
 fault+0.039
Neg logit contributions
 see-0.017
 wave-0.016
 voice-0.014
seat-0.014
 waves-0.013
Token But
Token ID887
Feature activation+3.21e+00

Pos logit contributions
 see+0.013
 wave+0.012
seat+0.011
 waves+0.011
 voice+0.010
Neg logit contributions
xy-0.015
oni-0.014
liv-0.014
ork-0.013
 fault-0.012
Token they
Token ID484
Feature activation+4.75e-02

Pos logit contributions
 see+0.062
 wave+0.056
 hand+0.050
 feet+0.047
 there+0.045
Neg logit contributions
xy-0.126
oni-0.118
ork-0.114
liv-0.111
ratch-0.108
Token also
Token ID635
Feature activation+2.49e-02

Pos logit contributions
 see+0.001
 wave+0.001
 wonder+0.001
 trusts+0.001
 hand+0.001
Neg logit contributions
xy-0.002
oni-0.002
ork-0.002
liv-0.002
 fault-0.002
Token said
Token ID531
Feature activation+8.82e-01

Pos logit contributions
 see+0.000
 wave+0.000
 waves+0.000
 trusts+0.000
 voice+0.000
Neg logit contributions
xy-0.001
liv-0.001
oni-0.001
ork-0.001
 fault-0.001
Token they
Token ID484
Feature activation+5.46e-03

Pos logit contributions
 see+0.021
 wave+0.021
 waves+0.020
 voice+0.018
 sailor+0.018
Neg logit contributions
xy-0.028
liv-0.027
oni-0.026
live-0.025
ork-0.024
Token wanted
Token ID2227
Feature activation-1.92e-01

Pos logit contributions
 see+0.000
 wave+0.000
 waves+0.000
 wonder+0.000
 hand+0.000
Neg logit contributions
xy-0.000
liv-0.000
oni-0.000
ork-0.000
ula-0.000
Token are
Token ID389
Feature activation+2.50e-01

Pos logit contributions
xy+0.008
oni+0.008
liv+0.007
ork+0.007
her+0.007
Neg logit contributions
seat-0.005
 reins-0.005
 sle-0.005
 wave-0.005
 voice-0.005
Token friends
Token ID2460
Feature activation+1.76e+00

Pos logit contributions
 wave+0.006
 see+0.005
 waves+0.005
seat+0.005
 paddle+0.005
Neg logit contributions
xy-0.008
oni-0.007
liv-0.007
 fault-0.007
ork-0.006
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
 wave+0.043
 see+0.041
 waves+0.040
seat+0.040
 reins+0.038
Neg logit contributions
xy-0.051
oni-0.048
liv-0.047
ork-0.044
 fault-0.042
Token They
Token ID1119
Feature activation+1.15e+00

Pos logit contributions
 see+0.067
 wave+0.067
seat+0.063
 waves+0.062
 paddle+0.058
Neg logit contributions
xy-0.069
oni-0.063
liv-0.062
ork-0.059
 fault-0.058
Token like
Token ID588
Feature activation+8.97e-01

Pos logit contributions
seat+0.024
 sle+0.023
 transported+0.022
 reins+0.022
 wave+0.022
Neg logit contributions
xy-0.035
oni-0.034
liv-0.032
 fault-0.030
her-0.030
Token to
Token ID284
Feature activation+2.63e+00

Pos logit contributions
 wave+0.025
 see+0.024
seat+0.023
 waves+0.023
 transported+0.022
Neg logit contributions
xy-0.024
oni-0.022
liv-0.021
ork-0.020
 fault-0.020
Token play
Token ID711
Feature activation+6.39e-01

Pos logit contributions
seat+0.045
 wave+0.043
 sle+0.043
 reins+0.042
 transported+0.042
Neg logit contributions
xy-0.090
oni-0.085
liv-0.081
ork-0.080
 fault-0.077
Token in
Token ID287
Feature activation+2.98e-01

Pos logit contributions
 see+0.013
 wave+0.013
 waves+0.012
seat+0.011
 sail+0.011
Neg logit contributions
xy-0.021
liv-0.021
oni-0.020
 fault-0.019
ork-0.019
Token the
Token ID262
Feature activation+9.00e-01

Pos logit contributions
 wave+0.006
 see+0.005
 waves+0.005
seat+0.005
 reins+0.005
Neg logit contributions
xy-0.011
oni-0.010
liv-0.010
ork-0.009
 fault-0.009
Token park
Token ID3952
Feature activation-1.10e+00

Pos logit contributions
 wave+0.017
 see+0.016
 waves+0.016
seat+0.015
 sail+0.013
Neg logit contributions
xy-0.033
oni-0.031
liv-0.031
ork-0.029
 fault-0.029
Token.
Token ID13
Feature activation+2.07e+00

Pos logit contributions
liv+0.049
xy+0.047
 fault+0.045
oni+0.045
ork+0.044
Neg logit contributions
 wave-0.013
 see-0.013
 waves-0.011
 hand-0.010
seat-0.010
Token herself
Token ID5223
Feature activation-6.58e-01

Pos logit contributions
xy+0.020
oni+0.019
ork+0.019
liv+0.019
tr+0.018
Neg logit contributions
 see-0.009
 wave-0.008
 amaz-0.007
 help-0.007
 cheer-0.007
Token!
Token ID0
Feature activation-5.20e-01

Pos logit contributions
xy+0.024
liv+0.022
oni+0.022
ork+0.021
 fault+0.020
Neg logit contributions
 wave-0.012
 see-0.012
 waves-0.011
seat-0.011
 voice-0.010
Token
Token ID220
Feature activation-4.90e-01

Pos logit contributions
xy+0.021
oni+0.020
liv+0.020
ork+0.019
 fault+0.019
Neg logit contributions
 see-0.008
seat-0.006
 wave-0.006
 hand-0.005
 welcome-0.005
Token
Token ID198
Feature activation+4.52e-02

Pos logit contributions
xy+0.019
oni+0.017
liv+0.017
ork+0.017
ho+0.017
Neg logit contributions
 see-0.008
seat-0.007
 wave-0.006
 waves-0.005
 transported-0.005
Token
Token ID198
Feature activation+7.36e-01

Pos logit contributions
 see+0.001
 wave+0.001
seat+0.000
 voice+0.000
 waves+0.000
Neg logit contributions
 fault-0.002
xy-0.002
oni-0.002
liv-0.002
ork-0.002
TokenAt
Token ID2953
Feature activation+3.91e+00

Pos logit contributions
 see+0.013
 wave+0.011
Leave+0.010
 voice+0.010
Who+0.010
Neg logit contributions
 fault-0.028
xy-0.026
oni-0.025
liv-0.024
ork-0.023
Token the
Token ID262
Feature activation+3.10e-01

Pos logit contributions
 wave+0.061
 see+0.060
 breath+0.048
 waves+0.045
seat+0.043
Neg logit contributions
xy-0.161
oni-0.153
ork-0.149
liv-0.149
ess-0.146
Token end
Token ID886
Feature activation-3.53e-01

Pos logit contributions
 wave+0.005
 see+0.004
 waves+0.004
 amaz+0.003
seat+0.003
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.012
 fault-0.011
Token of
Token ID286
Feature activation-1.35e-01

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
 fault+0.015
ork+0.015
Neg logit contributions
 see-0.003
 wave-0.003
seat-0.002
 voice-0.002
 waves-0.002
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
xy+0.006
oni+0.006
liv+0.005
ork+0.005
 fault+0.005
Neg logit contributions
 see-0.002
 wave-0.002
 breath-0.002
 voice-0.001
 hand-0.001
Token day
Token ID1110
Feature activation+1.16e+00

Pos logit contributions
xy+0.004
oni+0.004
liv+0.004
ork+0.004
 fault+0.004
Neg logit contributions
 wave-0.004
 see-0.003
 waves-0.003
seat-0.003
 voice-0.003
Token up
Token ID510
Feature activation-3.63e-01

Pos logit contributions
 amaz+0.040
 wave+0.038
Leave+0.037
 waves+0.037
 trusts+0.037
Neg logit contributions
xy-0.030
oni-0.026
ork-0.025
liv-0.024
her-0.023
Token a
Token ID257
Feature activation+1.40e-01

Pos logit contributions
xy+0.012
liv+0.012
oni+0.012
 fault+0.011
ork+0.011
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.007
seat-0.007
 sle-0.006
Token big
Token ID1263
Feature activation+1.07e+00

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
seat+0.002
 sailor+0.002
Neg logit contributions
xy-0.006
oni-0.006
liv-0.005
 fault-0.005
ork-0.005
Token hill
Token ID12788
Feature activation-1.95e+00

Pos logit contributions
Leave+0.023
 see+0.023
seat+0.023
 wave+0.023
 transported+0.023
Neg logit contributions
xy-0.031
oni-0.028
ork-0.028
liv-0.027
 fault-0.025
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
xy+0.069
oni+0.066
liv+0.066
 fault+0.061
ork+0.061
Neg logit contributions
 wave-0.038
 see-0.038
seat-0.034
 waves-0.033
 hand-0.030
Token At
Token ID1629
Feature activation+3.70e+00

Pos logit contributions
xy+0.005
oni+0.005
liv+0.005
ork+0.005
 fault+0.004
Neg logit contributions
 wave-0.005
 see-0.005
 waves-0.005
seat-0.005
 sailor-0.005
Token the
Token ID262
Feature activation+8.02e-02

Pos logit contributions
 wave+0.070
 see+0.067
 waves+0.065
seat+0.063
 reins+0.060
Neg logit contributions
xy-0.127
oni-0.120
liv-0.119
ork-0.113
 fault-0.111
Token top
Token ID1353
Feature activation+8.37e-01

Pos logit contributions
 wave+0.002
 see+0.002
seat+0.002
 reins+0.002
Leave+0.002
Neg logit contributions
xy-0.002
oni-0.002
liv-0.002
ork-0.002
her-0.002
Token of
Token ID286
Feature activation+3.07e-01

Pos logit contributions
 reins+0.017
 wave+0.016
Leave+0.015
 see+0.015
 sle+0.015
Neg logit contributions
xy-0.027
oni-0.026
ork-0.026
liv-0.025
her-0.023
Token the
Token ID262
Feature activation-5.41e-01

Pos logit contributions
 wave+0.006
 see+0.006
seat+0.006
 waves+0.006
 reins+0.006
Neg logit contributions
xy-0.010
oni-0.009
liv-0.009
ork-0.009
 fault-0.008
Token hill
Token ID12788
Feature activation-9.65e-01

Pos logit contributions
xy+0.015
oni+0.014
ork+0.014
liv+0.013
 fault+0.013
Neg logit contributions
Leave-0.012
 wave-0.012
 reins-0.012
 see-0.012
 transported-0.011
Token from
Token ID422
Feature activation+1.92e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
 paddle+0.004
seat+0.004
Neg logit contributions
xy-0.011
liv-0.010
oni-0.010
live-0.010
ork-0.009
Token the
Token ID262
Feature activation+9.53e-01

Pos logit contributions
 see+0.002
 waves+0.002
 wave+0.002
 sail+0.002
 hand+0.002
Neg logit contributions
xy-0.008
liv-0.008
live-0.008
arre-0.008
 fault-0.008
Token cups
Token ID14180
Feature activation+1.47e+00

Pos logit contributions
 wave+0.021
 see+0.020
 waves+0.020
seat+0.018
 voice+0.018
Neg logit contributions
xy-0.032
liv-0.029
oni-0.029
 fault-0.027
ork-0.027
Token.
Token ID13
Feature activation+4.01e-02

Pos logit contributions
 wave+0.023
 see+0.021
 waves+0.020
 hand+0.019
seat+0.019
Neg logit contributions
xy-0.058
liv-0.058
abella-0.052
live-0.051
oni-0.051
Token They
Token ID1119
Feature activation+1.31e-01

Pos logit contributions
 see+0.001
 waves+0.001
 wave+0.001
seat+0.001
 sail+0.001
Neg logit contributions
xy-0.001
liv-0.001
oni-0.001
live-0.001
 fault-0.001
Token ate
Token ID15063
Feature activation+5.09e-01

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
 paddle+0.002
 sail+0.002
Neg logit contributions
xy-0.005
liv-0.005
oni-0.005
 fault-0.005
ork-0.005
Token cookies
Token ID14746
Feature activation-1.97e-01

Pos logit contributions
 see+0.009
 wave+0.009
 waves+0.009
 paddle+0.008
 sail+0.008
Neg logit contributions
xy-0.019
liv-0.018
oni-0.017
live-0.016
ork-0.016
Token and
Token ID290
Feature activation-7.06e-01

Pos logit contributions
liv+0.010
xy+0.010
 fault+0.009
abella+0.009
 virtue+0.009
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
 cheer-0.001
 hand-0.001
Token talked
Token ID6619
Feature activation+5.29e-02

Pos logit contributions
xy+0.023
liv+0.021
oni+0.021
ork+0.020
 fault+0.019
Neg logit contributions
 wave-0.016
 see-0.016
 waves-0.015
seat-0.014
 paddle-0.014
Token in
Token ID287
Feature activation+3.13e-01

Pos logit contributions
 see+0.001
 wave+0.001
 waves+0.001
 voice+0.001
 hand+0.001
Neg logit contributions
xy-0.002
liv-0.002
oni-0.002
live-0.002
ratch-0.002
Token funny
Token ID8258
Feature activation-2.00e-01

Pos logit contributions
 see+0.006
 wave+0.006
 waves+0.005
seat+0.005
 voice+0.005
Neg logit contributions
xy-0.012
liv-0.011
oni-0.011
ork-0.010
 fault-0.010
Token colors
Token ID7577
Feature activation+1.80e+00

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
 fault+0.006
ork+0.006
Neg logit contributions
 wave-0.004
 see-0.004
seat-0.004
 waves-0.004
 paddle-0.004
Token.
Token ID13
Feature activation+9.75e-01

Pos logit contributions
 wave+0.026
 see+0.025
 waves+0.024
seat+0.022
 hand+0.019
Neg logit contributions
xy-0.072
liv-0.070
oni-0.066
ork-0.065
abella-0.064
Token You
Token ID921
Feature activation+6.55e-01

Pos logit contributions
 see+0.022
seat+0.020
 wave+0.019
 Are+0.018
 waves+0.018
Neg logit contributions
xy-0.032
oni-0.031
liv-0.030
ork-0.028
 fault-0.027
Token can
Token ID460
Feature activation+1.11e-01

Pos logit contributions
 see+0.010
 wave+0.010
 waves+0.009
seat+0.008
 paddle+0.008
Neg logit contributions
xy-0.026
oni-0.024
liv-0.024
ork-0.023
 fault-0.023
Token trip
Token ID5296
Feature activation+1.99e-01

Pos logit contributions
 see+0.002
 wave+0.002
 waves+0.001
 sail+0.001
 paddle+0.001
Neg logit contributions
xy-0.005
liv-0.004
oni-0.004
ork-0.004
 fault-0.004
Token in
Token ID287
Feature activation+1.03e+00

Pos logit contributions
 see+0.004
 wave+0.003
 waves+0.003
 voice+0.002
seat+0.002
Neg logit contributions
xy-0.008
liv-0.008
oni-0.007
 fault-0.007
live-0.007
Token them
Token ID606
Feature activation+1.06e+00

Pos logit contributions
 wave+0.021
 see+0.020
 waves+0.019
seat+0.019
 paddle+0.018
Neg logit contributions
xy-0.034
oni-0.032
liv-0.032
ork-0.030
 fault-0.030
Token."
Token ID526
Feature activation+1.39e-01

Pos logit contributions
 see+0.019
 wave+0.019
 waves+0.017
seat+0.017
 voice+0.015
Neg logit contributions
xy-0.039
liv-0.037
oni-0.037
ork-0.034
live-0.033
Token
Token ID198
Feature activation+8.31e-01

Pos logit contributions
 see+0.003
 wave+0.003
seat+0.002
 waves+0.002
 voice+0.002
Neg logit contributions
xy-0.005
oni-0.005
liv-0.005
ork-0.004
 fault-0.004
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 see+0.016
 wave+0.016
seat+0.014
 waves+0.014
 voice+0.013
Neg logit contributions
xy-0.029
oni-0.027
liv-0.027
 fault-0.026
ork-0.025
TokenTom
Token ID13787
Feature activation-3.88e-01

Pos logit contributions
 see+0.038
 wave+0.037
seat+0.034
Leave+0.033
 waves+0.032
Neg logit contributions
xy-0.047
oni-0.046
 fault-0.045
liv-0.043
ork-0.041
Token!
Token ID0
Feature activation+2.13e-01

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
ork+0.015
 fault+0.014
Neg logit contributions
 wave-0.010
 see-0.010
 waves-0.009
seat-0.009
 voice-0.009
Token You
Token ID921
Feature activation+2.30e-01

Pos logit contributions
 wave+0.004
 see+0.004
 waves+0.004
seat+0.004
 paddle+0.004
Neg logit contributions
xy-0.007
oni-0.007
liv-0.007
ork-0.006
 fault-0.006
Token have
Token ID423
Feature activation-5.07e-01

Pos logit contributions
 wave+0.005
 see+0.004
seat+0.004
 waves+0.004
 sle+0.004
Neg logit contributions
xy-0.008
oni-0.007
liv-0.007
ork-0.007
 fault-0.007
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
xy+0.017
oni+0.015
liv+0.015
ork+0.014
live+0.013
Neg logit contributions
 see-0.012
 wave-0.011
 waves-0.011
seat-0.010
 hand-0.010
Token go
Token ID467
Feature activation+4.33e-01

Pos logit contributions
 wave+0.016
 see+0.016
 waves+0.013
 paddle+0.012
 voice+0.011
Neg logit contributions
xy-0.049
oni-0.046
liv-0.046
ork-0.044
 fault-0.043
Token to
Token ID284
Feature activation+1.17e+00

Pos logit contributions
 wave+0.010
 see+0.009
 waves+0.009
seat+0.009
 reins+0.008
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.012
 fault-0.011
Token bed
Token ID3996
Feature activation-9.33e-02

Pos logit contributions
 see+0.021
 wave+0.020
 waves+0.017
seat+0.016
 voice+0.016
Neg logit contributions
xy-0.046
oni-0.043
liv-0.043
ork-0.040
live-0.039
Token now
Token ID783
Feature activation-2.34e-01

Pos logit contributions
xy+0.003
liv+0.003
oni+0.003
 fault+0.003
ratch+0.003
Neg logit contributions
 see-0.002
 wave-0.001
 voice-0.001
 waves-0.001
 pack-0.001
Token!"
Token ID2474
Feature activation-1.90e+00

Pos logit contributions
xy+0.008
oni+0.008
liv+0.008
ork+0.007
 fault+0.007
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
seat-0.004
 paddle-0.004
Token
Token ID220
Feature activation-7.81e-01

Pos logit contributions
xy+0.072
oni+0.068
liv+0.066
ork+0.064
 fault+0.063
Neg logit contributions
 see-0.034
 wave-0.031
seat-0.030
 waves-0.028
 hand-0.028
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
xy+0.030
oni+0.028
liv+0.028
ork+0.026
storm+0.026
Neg logit contributions
 see-0.012
seat-0.010
 wave-0.009
 waves-0.009
 hand-0.009
Token's
Token ID338
Feature activation-6.70e-01

Pos logit contributions
xy+0.016
oni+0.015
liv+0.015
ork+0.014
abella+0.014
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.006
 voice-0.006
seat-0.006
Token warning
Token ID6509
Feature activation-1.19e+00

Pos logit contributions
xy+0.025
liv+0.024
oni+0.024
ork+0.022
abella+0.022
Neg logit contributions
 wave-0.012
 see-0.011
 waves-0.010
 voice-0.010
seat-0.009
Token.
Token ID13
Feature activation-9.74e-01

Pos logit contributions
xy+0.047
liv+0.045
oni+0.044
live+0.042
ork+0.041
Neg logit contributions
 see-0.019
 wave-0.019
 waves-0.016
 voice-0.016
 hand-0.015
Token
Token ID198
Feature activation-3.89e-01

Pos logit contributions
xy+0.040
oni+0.038
liv+0.036
 fault+0.036
ork+0.035
Neg logit contributions
 see-0.015
seat-0.012
 Curious-0.012
 Are-0.011
 voice-0.010
Token
Token ID198
Feature activation+3.86e-01

Pos logit contributions
oni+0.015
xy+0.015
 fault+0.015
liv+0.014
tr+0.014
Neg logit contributions
 see-0.006
 wave-0.004
seat-0.004
 voice-0.004
Leave-0.004
TokenAs
Token ID1722
Feature activation+1.63e+00

Pos logit contributions
 see+0.006
Leave+0.005
 wave+0.005
 voice+0.004
seat+0.004
Neg logit contributions
 fault-0.015
oni-0.015
xy-0.015
liv-0.013
ork-0.013
Token the
Token ID262
Feature activation-7.64e-01

Pos logit contributions
 see+0.024
 wave+0.021
 breath+0.016
 waves+0.015
seat+0.015
Neg logit contributions
xy-0.071
oni-0.068
 fault-0.067
liv-0.064
issors-0.062
Token day
Token ID1110
Feature activation+8.59e-01

Pos logit contributions
xy+0.028
oni+0.027
liv+0.026
ork+0.026
 fault+0.025
Neg logit contributions
 see-0.012
 wave-0.011
seat-0.011
 waves-0.010
 transported-0.010
Token went
Token ID1816
Feature activation+1.20e+00

Pos logit contributions
 wave+0.016
 see+0.015
 reins+0.015
 waves+0.015
seat+0.014
Neg logit contributions
xy-0.029
oni-0.027
liv-0.027
ork-0.027
live-0.024
Token on
Token ID319
Feature activation-4.39e-01

Pos logit contributions
 wave+0.021
 see+0.020
 waves+0.018
seat+0.018
 voice+0.017
Neg logit contributions
xy-0.046
oni-0.044
liv-0.043
 fault-0.041
ork-0.040
Token,
Token ID11
Feature activation+5.16e-01

Pos logit contributions
xy+0.019
oni+0.019
liv+0.018
 fault+0.017
abella+0.017
Neg logit contributions
 see-0.006
 wave-0.005
 voice-0.005
 breath-0.004
 waves-0.004
Token was
Token ID373
Feature activation-2.09e-01

Pos logit contributions
xy+0.005
oni+0.005
liv+0.005
ork+0.005
 fault+0.005
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.003
seat-0.003
 paddle-0.003
Token amazed
Token ID24562
Feature activation-4.07e-01

Pos logit contributions
xy+0.007
oni+0.007
liv+0.007
ork+0.007
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
seat-0.003
 voice-0.003
Token.
Token ID13
Feature activation-7.80e-01

Pos logit contributions
xy+0.014
oni+0.013
liv+0.013
ork+0.012
 fault+0.012
Neg logit contributions
 wave-0.008
 see-0.008
 waves-0.008
seat-0.008
 paddle-0.007
Token She
Token ID1375
Feature activation+1.81e-01

Pos logit contributions
xy+0.030
liv+0.029
oni+0.029
 fault+0.027
edy+0.027
Neg logit contributions
 see-0.013
seat-0.011
 Curious-0.011
 voice-0.011
 waves-0.010
Token felt
Token ID2936
Feature activation+3.12e-01

Pos logit contributions
 wave+0.004
 see+0.003
seat+0.003
 waves+0.003
 reins+0.003
Neg logit contributions
xy-0.006
oni-0.006
liv-0.006
ork-0.005
 fault-0.005
Token like
Token ID588
Feature activation+3.79e-01

Pos logit contributions
seat+0.008
 sle+0.007
 reins+0.007
 wave+0.007
 waves+0.007
Neg logit contributions
oni-0.008
xy-0.008
her-0.007
liv-0.007
 fault-0.007
Token she
Token ID673
Feature activation+3.83e-01

Pos logit contributions
 wave+0.008
 see+0.008
 waves+0.007
seat+0.007
 sailor+0.007
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.011
 fault-0.011
Token was
Token ID373
Feature activation-1.54e-01

Pos logit contributions
 wave+0.009
seat+0.009
 see+0.009
 waves+0.009
 reins+0.009
Neg logit contributions
xy-0.011
oni-0.010
liv-0.010
ork-0.009
her-0.009
Token watching
Token ID4964
Feature activation+1.90e+00

Pos logit contributions
xy+0.005
oni+0.005
liv+0.005
ork+0.005
 fault+0.005
Neg logit contributions
 wave-0.003
 see-0.003
 waves-0.003
seat-0.003
 transported-0.003
Token a
Token ID257
Feature activation+3.27e-01

Pos logit contributions
 see+0.033
 wave+0.031
 voice+0.030
 waves+0.029
 cheer+0.026
Neg logit contributions
liv-0.078
xy-0.077
live-0.071
oni-0.068
 fault-0.067
Token movie
Token ID3807
Feature activation+1.63e-02

Pos logit contributions
 wave+0.006
 see+0.006
 waves+0.005
seat+0.005
 paddle+0.005
Neg logit contributions
xy-0.012
oni-0.011
liv-0.011
ork-0.010
 fault-0.010
Token baby
Token ID5156
Feature activation+2.50e-01

Pos logit contributions
xy+0.006
 fault+0.006
ork+0.005
liv+0.005
oni+0.005
Neg logit contributions
 wave-0.004
 see-0.004
 sle-0.004
 waves-0.004
seat-0.004
Token brother
Token ID3956
Feature activation+6.92e-02

Pos logit contributions
 wave+0.006
 see+0.005
seat+0.005
 waves+0.005
 sle+0.005
Neg logit contributions
xy-0.007
oni-0.007
 fault-0.007
liv-0.007
ork-0.007
Token all
Token ID477
Feature activation-1.41e-01

Pos logit contributions
 see+0.001
 wave+0.001
 waves+0.001
seat+0.001
 voice+0.001
Neg logit contributions
xy-0.003
oni-0.002
liv-0.002
ork-0.002
live-0.002
Token about
Token ID546
Feature activation-1.41e+00

Pos logit contributions
xy+0.004
oni+0.004
liv+0.003
ork+0.003
live+0.003
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
seat-0.004
 voice-0.003
Token cleaning
Token ID12724
Feature activation+1.59e-01

Pos logit contributions
xy+0.052
liv+0.052
oni+0.051
ess+0.049
live+0.049
Neg logit contributions
 see-0.024
 wave-0.021
 waves-0.021
 polish-0.019
 rock-0.019
Token.
Token ID13
Feature activation-8.55e-01

Pos logit contributions
 see+0.003
 waves+0.003
 wave+0.003
seat+0.003
 voice+0.002
Neg logit contributions
xy-0.006
oni-0.005
liv-0.005
ork-0.005
live-0.005
Token Mom
Token ID11254
Feature activation+5.32e-01

Pos logit contributions
xy+0.028
oni+0.026
liv+0.025
ork+0.024
 fault+0.024
Neg logit contributions
 see-0.019
 wave-0.018
seat-0.018
 waves-0.017
 hand-0.016
Tokenmy
Token ID1820
Feature activation-1.99e-01

Pos logit contributions
 wave+0.014
 see+0.013
 waves+0.013
seat+0.013
 paddle+0.012
Neg logit contributions
xy-0.015
oni-0.014
liv-0.014
ork-0.013
 fault-0.012
Token smiled
Token ID13541
Feature activation+1.58e+00

Pos logit contributions
xy+0.007
oni+0.007
liv+0.007
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.003
seat-0.003
 voice-0.003
Token and
Token ID290
Feature activation-1.85e-01

Pos logit contributions
 wave+0.037
 see+0.036
seat+0.035
 waves+0.035
 paddle+0.035
Neg logit contributions
xy-0.044
oni-0.043
liv-0.039
ork-0.038
her-0.038
Token said
Token ID531
Feature activation+4.69e-01

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.003
 waves-0.003
seat-0.003
 paddle-0.003
Token scared
Token ID12008
Feature activation+8.17e-01

Pos logit contributions
 wave+0.014
 see+0.013
seat+0.013
 adventurer+0.013
 trusts+0.012
Neg logit contributions
xy-0.018
oni-0.017
liv-0.017
 fault-0.016
her-0.016
Token.
Token ID13
Feature activation+6.14e-01

Pos logit contributions
 wave+0.015
 see+0.015
 waves+0.014
seat+0.014
 voice+0.013
Neg logit contributions
xy-0.029
oni-0.028
liv-0.028
ork-0.026
 fault-0.025
Token They
Token ID1119
Feature activation+3.40e-01

Pos logit contributions
 see+0.013
seat+0.012
 wave+0.011
 voice+0.011
 waves+0.011
Neg logit contributions
xy-0.021
oni-0.021
liv-0.020
 fault-0.018
live-0.018
Token hug
Token ID16225
Feature activation-3.43e-01

Pos logit contributions
 wave+0.005
seat+0.004
 see+0.004
 waves+0.004
 sle+0.004
Neg logit contributions
xy-0.013
oni-0.012
liv-0.012
ork-0.012
 fault-0.012
Token their
Token ID511
Feature activation-5.33e-01

Pos logit contributions
xy+0.013
liv+0.013
oni+0.012
live+0.012
ork+0.012
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.006
 voice-0.006
 hand-0.006
Token t
Token ID256
Feature activation-1.01e+00

Pos logit contributions
xy+0.021
liv+0.020
oni+0.019
ork+0.018
live+0.018
Neg logit contributions
 wave-0.009
 waves-0.009
 see-0.008
 voice-0.008
seat-0.007
Tokeneddy
Token ID21874
Feature activation+1.41e+00

Pos logit contributions
xy+0.028
oni+0.026
liv+0.026
ork+0.024
 fault+0.024
Neg logit contributions
 wave-0.026
 see-0.025
 waves-0.024
seat-0.024
 paddle-0.022
Token bears
Token ID13062
Feature activation+8.89e-01

Pos logit contributions
 wave+0.018
 see+0.017
 waves+0.015
seat+0.015
 paddle+0.013
Neg logit contributions
xy-0.059
oni-0.056
liv-0.055
ork-0.053
 fault-0.052
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
 wave+0.012
 see+0.010
 hand+0.010
 waves+0.009
 breath+0.008
Neg logit contributions
liv-0.039
xy-0.039
oni-0.035
ork-0.035
abella-0.035
Token each
Token ID1123
Feature activation-1.34e+00

Pos logit contributions
xy+0.013
oni+0.013
liv+0.012
 fault+0.012
ork+0.011
Neg logit contributions
seat-0.007
 transported-0.006
 wave-0.006
 see-0.006
 adventurer-0.006
Token other
Token ID584
Feature activation+4.80e-02

Pos logit contributions
xy+0.043
oni+0.041
liv+0.040
ork+0.038
 fault+0.037
Neg logit contributions
 wave-0.029
 see-0.028
 waves-0.027
seat-0.026
 paddle-0.024
Token flew
Token ID13112
Feature activation+1.50e+00

Pos logit contributions
xy+0.016
oni+0.014
ork+0.014
ho+0.014
liv+0.013
Neg logit contributions
 reins-0.012
 amaz-0.012
seat-0.012
 adventurer-0.011
 sailor-0.011
Token down
Token ID866
Feature activation-5.51e-02

Pos logit contributions
 wave+0.023
 see+0.020
 waves+0.020
seat+0.018
 voice+0.018
Neg logit contributions
xy-0.062
liv-0.060
oni-0.058
 fault-0.057
ork-0.056
Token and
Token ID290
Feature activation-4.09e-01

Pos logit contributions
xy+0.002
liv+0.002
oni+0.002
 fault+0.002
ork+0.002
Neg logit contributions
 see-0.001
 wave-0.001
seat-0.001
 waves-0.001
 voice-0.001
Token tried
Token ID3088
Feature activation-1.09e+00

Pos logit contributions
xy+0.016
liv+0.015
oni+0.015
 fault+0.015
ork+0.015
Neg logit contributions
 wave-0.007
 see-0.006
 waves-0.006
seat-0.005
 paddle-0.005
Token to
Token ID284
Feature activation-3.58e-01

Pos logit contributions
liv+0.049
xy+0.047
live+0.043
issors+0.042
oni+0.042
Neg logit contributions
 see-0.017
 wave-0.015
 waves-0.013
 hand-0.013
 breath-0.012
Token take
Token ID1011
Feature activation-1.01e+00

Pos logit contributions
xy+0.013
oni+0.012
liv+0.012
ork+0.011
her+0.011
Neg logit contributions
seat-0.006
 wave-0.005
 waves-0.005
 see-0.005
 sle-0.005
Token their
Token ID511
Feature activation+1.01e-01

Pos logit contributions
xy+0.039
liv+0.037
oni+0.036
 fault+0.034
live+0.033
Neg logit contributions
 see-0.019
 wave-0.018
 waves-0.017
 voice-0.015
 hand-0.015
Token food
Token ID2057
Feature activation+9.83e-01

Pos logit contributions
seat+0.003
 wave+0.003
Leave+0.003
 see+0.003
 Are+0.002
Neg logit contributions
xy-0.002
 fault-0.002
oni-0.002
ork-0.002
liv-0.002
Token.
Token ID13
Feature activation-6.83e-01

Pos logit contributions
 wave+0.018
 see+0.018
 waves+0.016
seat+0.015
 hand+0.015
Neg logit contributions
xy-0.037
liv-0.035
oni-0.033
abella-0.032
ork-0.031
Token The
Token ID383
Feature activation-1.80e-01

Pos logit contributions
xy+0.026
liv+0.025
oni+0.024
 fault+0.023
live+0.023
Neg logit contributions
 see-0.013
 wave-0.012
 waves-0.011
 voice-0.011
seat-0.010
Token mom
Token ID1995
Feature activation+1.54e-01

Pos logit contributions
oni+0.006
xy+0.006
ork+0.006
 fault+0.005
liv+0.005
Neg logit contributions
seat-0.003
 see-0.003
 wave-0.003
 hops-0.003
 paddle-0.003
Token the
Token ID262
Feature activation-4.55e-01

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.002
seat+0.002
 paddle+0.002
Neg logit contributions
xy-0.006
oni-0.006
liv-0.006
ork-0.006
 fault-0.006
Token whisper
Token ID31992
Feature activation+1.51e+00

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
ork+0.016
 fault+0.015
Neg logit contributions
 wave-0.007
 see-0.007
 waves-0.006
seat-0.006
 paddle-0.006
Token said
Token ID531
Feature activation+9.52e-01

Pos logit contributions
 wave+0.029
 see+0.029
 waves+0.027
seat+0.027
 paddle+0.024
Neg logit contributions
xy-0.051
oni-0.049
liv-0.048
ork-0.046
 fault-0.045
Token,
Token ID11
Feature activation-1.86e+00

Pos logit contributions
 wave+0.013
 see+0.013
 waves+0.011
 voice+0.011
 sailor+0.010
Neg logit contributions
xy-0.039
liv-0.037
oni-0.037
 fault-0.035
ork-0.034
Token
Token ID6184
Feature activation+5.35e-01

Pos logit contributions
xy+0.062
liv+0.062
oni+0.058
 fault+0.057
live+0.056
Neg logit contributions
 wave-0.039
 see-0.039
 waves-0.035
 voice-0.034
 paddle-0.032
Token
Token ID95
Feature activation-6.26e-01

Pos logit contributions
 wave+0.014
 see+0.014
 waves+0.013
 voice+0.013
 paddle+0.012
Neg logit contributions
xy-0.014
oni-0.014
liv-0.013
 fault-0.013
ork-0.012
Token
Token ID26391
Feature activation-1.21e+00

Pos logit contributions
xy+0.016
oni+0.016
 fault+0.015
liv+0.014
ork+0.014
Neg logit contributions
 see-0.017
 wave-0.017
 voice-0.015
 waves-0.015
seat-0.015
Token
Token ID129
Feature activation-1.38e+00

Pos logit contributions
xy+0.035
liv+0.033
oni+0.032
ork+0.031
 fault+0.030
Neg logit contributions
 wave-0.030
 see-0.028
 waves-0.028
seat-0.028
 reins-0.026
Token
Token ID241
Feature activation-1.17e+00

Pos logit contributions
xy+0.041
oni+0.039
liv+0.038
 fault+0.037
tr+0.036
Neg logit contributions
 see-0.034
 wave-0.032
 waves-0.030
 voice-0.030
 hops-0.027
TokenI
Token ID40
Feature activation+3.27e-01

Pos logit contributions
xy+0.038
oni+0.036
liv+0.035
ork+0.033
 fault+0.032
Neg logit contributions
 see-0.026
 wave-0.025
seat-0.023
 waves-0.023
 paddle-0.021
Token am
Token ID716
Feature activation+4.29e-01

Pos logit contributions
 wave+0.006
seat+0.005
 waves+0.005
 reins+0.005
 see+0.005
Neg logit contributions
xy-0.012
oni-0.011
liv-0.011
ork-0.011
 fault-0.010
Token Lily
Token ID20037
Feature activation-3.20e-01

Pos logit contributions
xy+0.028
oni+0.026
 fault+0.023
liv+0.023
live+0.022
Neg logit contributions
 Curious-0.014
 see-0.014
 Are-0.013
 voice-0.012
 hand-0.011
Token was
Token ID373
Feature activation-3.37e-01

Pos logit contributions
xy+0.011
oni+0.011
liv+0.010
 fault+0.010
ork+0.010
Neg logit contributions
 wave-0.006
 see-0.006
 waves-0.006
seat-0.005
 voice-0.005
Token very
Token ID845
Feature activation-3.71e-01

Pos logit contributions
xy+0.011
oni+0.011
liv+0.010
ork+0.010
 fault+0.009
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.007
seat-0.006
 voice-0.006
Token sad
Token ID6507
Feature activation-7.17e-02

Pos logit contributions
xy+0.013
oni+0.012
liv+0.012
ork+0.011
 fault+0.011
Neg logit contributions
 wave-0.008
 see-0.008
 waves-0.007
seat-0.007
 voice-0.007
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
xy+0.003
oni+0.002
liv+0.002
ork+0.002
 fault+0.002
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
seat-0.001
 voice-0.001
Token realized
Token ID6939
Feature activation-7.60e-01

Pos logit contributions
xy+0.048
oni+0.045
liv+0.044
ork+0.043
 fault+0.043
Neg logit contributions
 see-0.026
 wave-0.026
 waves-0.023
 voice-0.022
seat-0.020
Token that
Token ID326
Feature activation-6.78e-02

Pos logit contributions
xy+0.030
oni+0.028
liv+0.027
 fault+0.026
 virtue+0.026
Neg logit contributions
 see-0.012
 wave-0.011
 waves-0.010
 voice-0.010
seat-0.009
Token being
Token ID852
Feature activation-9.62e-01

Pos logit contributions
xy+0.003
oni+0.003
liv+0.003
ork+0.002
ess+0.002
Neg logit contributions
 see-0.001
 wave-0.001
 waves-0.001
 Are-0.001
Leave-0.001
Token rude
Token ID22066
Feature activation-1.79e+00

Pos logit contributions
xy+0.037
liv+0.033
oni+0.033
ork+0.032
abella+0.029
Neg logit contributions
 see-0.018
 wave-0.017
 waves-0.016
 brave-0.015
Leave-0.015
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
xy+0.063
liv+0.060
oni+0.059
ork+0.055
abella+0.053
Neg logit contributions
 wave-0.034
 see-0.034
 waves-0.032
seat-0.030
 voice-0.029
Token not
Token ID407
Feature activation-1.26e+00

Pos logit contributions
xy+0.048
liv+0.047
ork+0.047
oni+0.046
 fault+0.044
Neg logit contributions
 see-0.028
 wave-0.027
 waves-0.025
 helps-0.023
Leave-0.022
Token did
Token ID750
Feature activation+8.94e-02

Pos logit contributions
 wave+0.018
 see+0.018
 waves+0.016
seat+0.015
 paddle+0.014
Neg logit contributions
xy-0.036
oni-0.034
liv-0.034
ork-0.032
 fault-0.032
Token not
Token ID407
Feature activation-9.35e-01

Pos logit contributions
 see+0.003
 wave+0.003
 waves+0.003
seat+0.003
 voice+0.003
Neg logit contributions
xy-0.002
oni-0.002
liv-0.002
ork-0.002
 fault-0.001
Token feel
Token ID1254
Feature activation+7.47e-01

Pos logit contributions
oni+0.034
xy+0.034
liv+0.032
 fault+0.032
ork+0.032
Neg logit contributions
seat-0.013
 reins-0.012
 sle-0.012
 waves-0.011
 wave-0.011
Token it
Token ID340
Feature activation+4.23e-01

Pos logit contributions
 wave+0.017
seat+0.017
 sle+0.017
 reins+0.016
 waves+0.016
Neg logit contributions
oni-0.020
xy-0.020
 fault-0.020
liv-0.019
ork-0.018
Tokenchy
Token ID29658
Feature activation+7.87e-02

Pos logit contributions
 wave+0.008
 see+0.007
 waves+0.007
seat+0.006
 shine+0.006
Neg logit contributions
xy-0.015
liv-0.014
oni-0.014
ork-0.013
abella-0.013
Token anymore
Token ID7471
Feature activation-2.82e+00

Pos logit contributions
 wave+0.002
 see+0.002
 waves+0.001
seat+0.001
 voice+0.001
Neg logit contributions
xy-0.003
liv-0.003
oni-0.003
ork-0.002
 fault-0.002
Token.
Token ID13
Feature activation+5.28e-01

Pos logit contributions
xy+0.123
liv+0.122
oni+0.114
ratch+0.108
abella+0.107
Neg logit contributions
 see-0.035
 wave-0.031
 voice-0.026
 hand-0.024
 feet-0.023
Token She
Token ID1375
Feature activation+1.05e+00

Pos logit contributions
 see+0.012
 wave+0.011
seat+0.010
 waves+0.010
 voice+0.010
Neg logit contributions
xy-0.017
oni-0.016
liv-0.016
ork-0.015
 fault-0.015
Token was
Token ID373
Feature activation+5.40e-01

Pos logit contributions
 wave+0.020
 see+0.019
 waves+0.018
seat+0.016
 hops+0.015
Neg logit contributions
xy-0.039
oni-0.036
liv-0.035
ork-0.034
 fault-0.034
Token a
Token ID257
Feature activation+7.72e-01

Pos logit contributions
 see+0.011
 wave+0.010
 waves+0.009
 transported+0.008
seat+0.008
Neg logit contributions
xy-0.020
oni-0.018
liv-0.018
ork-0.018
 fault-0.017
Token brave
Token ID14802
Feature activation+1.22e+00

Pos logit contributions
 wave+0.014
 see+0.013
 waves+0.012
seat+0.012
 paddle+0.011
Neg logit contributions
xy-0.028
oni-0.027
liv-0.026
ork-0.025
 fault-0.025
Token then
Token ID788
Feature activation+7.86e-01

Pos logit contributions
xy+0.011
oni+0.011
liv+0.010
ork+0.010
 fault+0.010
Neg logit contributions
 wave-0.006
 see-0.005
 waves-0.005
seat-0.005
 paddle-0.004
Token placed
Token ID4624
Feature activation+8.81e-02

Pos logit contributions
 wave+0.014
 see+0.013
seat+0.013
 waves+0.012
 reins+0.012
Neg logit contributions
xy-0.028
oni-0.026
liv-0.026
ork-0.025
abella-0.024
Token it
Token ID340
Feature activation+2.60e-01

Pos logit contributions
 wave+0.002
 see+0.001
 waves+0.001
seat+0.001
 paddle+0.001
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
ork-0.003
 fault-0.003
Token somewhere
Token ID7382
Feature activation-5.43e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.005
 sle+0.005
Neg logit contributions
xy-0.008
oni-0.008
liv-0.008
ork-0.007
 fault-0.007
Token safe
Token ID3338
Feature activation-1.59e+00

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
ork+0.015
 fault+0.015
Neg logit contributions
 wave-0.011
 see-0.011
seat-0.011
 waves-0.010
 reins-0.010
Token for
Token ID329
Feature activation-2.77e+00

Pos logit contributions
xy+0.055
oni+0.053
liv+0.052
ork+0.049
 fault+0.047
Neg logit contributions
 wave-0.031
 see-0.030
 waves-0.028
seat-0.028
 paddle-0.025
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
xy+0.097
liv+0.091
oni+0.091
 fault+0.086
ork+0.086
Neg logit contributions
 wave-0.055
 see-0.055
 waves-0.049
seat-0.047
 voice-0.046
Token next
Token ID1306
Feature activation-9.84e-01

Pos logit contributions
xy+0.048
oni+0.045
liv+0.045
ork+0.043
 fault+0.041
Neg logit contributions
 wave-0.026
 see-0.025
 waves-0.024
seat-0.022
 voice-0.021
Token day
Token ID1110
Feature activation+2.64e-01

Pos logit contributions
xy+0.024
oni+0.022
liv+0.022
ork+0.021
 fault+0.020
Neg logit contributions
 see-0.027
 wave-0.027
seat-0.026
 waves-0.025
 reins-0.025
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
 wave+0.005
 see+0.005
 waves+0.005
seat+0.005
 sle+0.004
Neg logit contributions
xy-0.009
liv-0.009
oni-0.009
ork-0.008
 fault-0.008
Token
Token ID220
Feature activation-6.07e-01

Pos logit contributions
xy+0.012
oni+0.012
liv+0.011
ork+0.011
 fault+0.011
Neg logit contributions
 see-0.007
 wave-0.006
seat-0.006
 waves-0.006
 voice-0.005
Token.
Token ID13
Feature activation-8.60e-01

Pos logit contributions
xy+0.019
liv+0.019
oni+0.017
 fault+0.017
ork+0.017
Neg logit contributions
 wave-0.009
 see-0.009
 waves-0.008
seat-0.008
 hand-0.007
Token He
Token ID679
Feature activation-4.47e-01

Pos logit contributions
xy+0.030
liv+0.028
oni+0.027
ork+0.026
 fault+0.026
Neg logit contributions
 see-0.017
 wave-0.017
 waves-0.016
seat-0.015
 voice-0.014
Token smiled
Token ID13541
Feature activation+1.44e+00

Pos logit contributions
xy+0.017
oni+0.016
liv+0.016
 fault+0.016
ork+0.015
Neg logit contributions
 see-0.007
 wave-0.007
 waves-0.006
seat-0.006
 paddle-0.005
Token and
Token ID290
Feature activation-9.45e-01

Pos logit contributions
 wave+0.035
 sle+0.034
 reins+0.033
 see+0.033
seat+0.033
Neg logit contributions
oni-0.040
xy-0.039
 fault-0.034
her-0.033
liv-0.033
Token said
Token ID531
Feature activation+9.73e-02

Pos logit contributions
xy+0.033
oni+0.031
liv+0.030
ork+0.030
 fault+0.029
Neg logit contributions
 wave-0.019
 see-0.018
 waves-0.017
seat-0.016
 paddle-0.015
Token,
Token ID11
Feature activation-2.53e+00

Pos logit contributions
 wave+0.002
seat+0.002
 see+0.002
 waves+0.002
 reins+0.002
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
ork-0.003
 fault-0.003
Token "
Token ID366
Feature activation-1.05e+00

Pos logit contributions
xy+0.075
liv+0.070
oni+0.070
ork+0.066
 fault+0.065
Neg logit contributions
 wave-0.062
 see-0.061
 waves-0.057
seat-0.056
 paddle-0.053
TokenI
Token ID40
Feature activation-5.66e-01

Pos logit contributions
xy+0.033
oni+0.031
liv+0.030
ork+0.028
 fault+0.027
Neg logit contributions
 wave-0.024
 see-0.023
 waves-0.022
seat-0.022
 paddle-0.021
Token want
Token ID765
Feature activation-9.65e-01

Pos logit contributions
xy+0.021
oni+0.020
liv+0.019
ork+0.019
 fault+0.018
Neg logit contributions
 wave-0.010
 waves-0.009
seat-0.009
 see-0.009
 reins-0.008
Token this
Token ID428
Feature activation-1.37e+00

Pos logit contributions
xy+0.027
oni+0.025
liv+0.024
ork+0.023
 fault+0.023
Neg logit contributions
 wave-0.025
 see-0.024
seat-0.024
 waves-0.023
 paddle-0.022
Token one
Token ID530
Feature activation+6.07e-01

Pos logit contributions
xy+0.045
oni+0.042
liv+0.042
ork+0.039
 fault+0.038
Neg logit contributions
 wave-0.030
 see-0.029
 waves-0.027
seat-0.027
 paddle-0.025
Token have
Token ID423
Feature activation-9.00e-01

Pos logit contributions
xy+0.015
oni+0.014
liv+0.014
ork+0.013
 fault+0.013
Neg logit contributions
 wave-0.009
 see-0.009
 waves-0.008
seat-0.008
 paddle-0.007
Token to
Token ID284
Feature activation+7.43e-01

Pos logit contributions
xy+0.031
liv+0.028
oni+0.028
ork+0.025
live+0.025
Neg logit contributions
 see-0.020
 wave-0.020
 waves-0.019
 hand-0.017
 transported-0.017
Token save
Token ID3613
Feature activation-1.73e+00

Pos logit contributions
 wave+0.010
 see+0.010
 waves+0.009
seat+0.009
 paddle+0.008
Neg logit contributions
xy-0.030
oni-0.028
liv-0.028
ork-0.027
 fault-0.027
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
liv+0.072
xy+0.072
live+0.065
oni+0.064
 fault+0.062
Neg logit contributions
 see-0.028
 waves-0.022
 wave-0.022
 breath-0.020
 help-0.020
Token mouse
Token ID10211
Feature activation-9.17e-01

Pos logit contributions
xy+0.039
liv+0.038
oni+0.036
abella+0.035
ork+0.034
Neg logit contributions
 wave-0.017
 see-0.016
 waves-0.016
 voice-0.016
 girl-0.014
Token!"
Token ID2474
Feature activation-2.46e+00

Pos logit contributions
xy+0.034
liv+0.032
oni+0.032
ork+0.030
 fault+0.030
Neg logit contributions
 wave-0.016
 see-0.015
 waves-0.014
seat-0.014
 voice-0.012
Token Lily
Token ID20037
Feature activation+5.80e-02

Pos logit contributions
xy+0.064
oni+0.058
liv+0.057
 fault+0.054
ork+0.052
Neg logit contributions
 see-0.072
 wave-0.071
 waves-0.067
seat-0.065
 voice-0.065
Token's
Token ID338
Feature activation-3.36e-01

Pos logit contributions
 wave+0.001
 see+0.001
 waves+0.001
seat+0.001
 voice+0.001
Neg logit contributions
xy-0.002
oni-0.002
liv-0.002
ork-0.002
 fault-0.002
Token mom
Token ID1995
Feature activation+1.73e-02

Pos logit contributions
oni+0.009
xy+0.009
ork+0.008
liv+0.008
 fault+0.008
Neg logit contributions
 wave-0.008
seat-0.008
 see-0.008
 waves-0.008
 hops-0.008
Tokenmy
Token ID1820
Feature activation-1.94e-01

Pos logit contributions
 wave+0.001
 waves+0.001
 see+0.000
 paddle+0.000
 hops+0.000
Neg logit contributions
xy-0.000
liv-0.000
oni-0.000
her-0.000
ess-0.000
Token said
Token ID531
Feature activation+5.74e-01

Pos logit contributions
xy+0.006
oni+0.006
liv+0.006
ork+0.006
 fault+0.006
Neg logit contributions
 wave-0.004
 see-0.004
 waves-0.004
seat-0.004
 paddle-0.003
Token him
Token ID683
Feature activation-9.62e-01

Pos logit contributions
xy+0.077
liv+0.073
oni+0.070
ork+0.068
 fault+0.068
Neg logit contributions
 see-0.052
 wave-0.052
 waves-0.048
 voice-0.043
 sailor-0.043
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
xy+0.035
liv+0.034
oni+0.033
ork+0.031
abella+0.030
Neg logit contributions
 see-0.016
 wave-0.016
 hand-0.015
 waves-0.014
seat-0.014
Token It
Token ID632
Feature activation-7.55e-01

Pos logit contributions
xy+0.016
oni+0.015
liv+0.014
ork+0.014
 fault+0.013
Neg logit contributions
 wave-0.013
 see-0.013
 waves-0.012
seat-0.012
 paddle-0.011
Token was
Token ID373
Feature activation-1.30e+00

Pos logit contributions
xy+0.027
oni+0.025
liv+0.025
 fault+0.024
ork+0.024
Neg logit contributions
 wave-0.014
 see-0.014
 waves-0.013
seat-0.012
 paddle-0.011
Token very
Token ID845
Feature activation-9.89e-01

Pos logit contributions
xy+0.047
liv+0.043
oni+0.043
ork+0.041
 fault+0.040
Neg logit contributions
 wave-0.024
 see-0.024
 waves-0.022
seat-0.021
 sle-0.020
Token y
Token ID331
Feature activation-2.73e+00

Pos logit contributions
xy+0.036
liv+0.035
oni+0.034
 fault+0.033
ork+0.033
Neg logit contributions
 wave-0.018
 see-0.017
 waves-0.015
seat-0.014
 sail-0.014
Tokenummy
Token ID13513
Feature activation-2.92e-01

Pos logit contributions
xy+0.066
liv+0.061
oni+0.060
ork+0.056
 fault+0.055
Neg logit contributions
 wave-0.080
 see-0.078
 waves-0.076
seat-0.076
 sle-0.070
Token!
Token ID0
Feature activation+1.15e-01

Pos logit contributions
xy+0.011
liv+0.010
oni+0.010
ork+0.010
 fault+0.009
Neg logit contributions
 wave-0.005
 see-0.005
 waves-0.005
seat-0.004
 sailor-0.004
Token Jack
Token ID3619
Feature activation-8.02e-02

Pos logit contributions
 see+0.003
 wave+0.003
 waves+0.003
seat+0.003
 voice+0.003
Neg logit contributions
xy-0.003
oni-0.003
liv-0.003
ork-0.003
 fault-0.003
Token took
Token ID1718
Feature activation-4.50e-01

Pos logit contributions
xy+0.003
oni+0.003
liv+0.003
 fault+0.003
ork+0.003
Neg logit contributions
 wave-0.001
 see-0.001
 waves-0.001
seat-0.001
 voice-0.001
Token the
Token ID262
Feature activation+6.41e-02

Pos logit contributions
oni+0.013
xy+0.013
ork+0.012
liv+0.012
her+0.011
Neg logit contributions
 wave-0.011
seat-0.010
 see-0.010
Leave-0.009
 waves-0.009