TOP ACTIVATIONS
MAX = 3.659

a field of beautiful caul if l owers . They were
" Okay , Mom . Can we own the rain ,
replied , " My blanket flew away , and it landed
followed the crane as it flew high into the sky .
taught her about how ants lived . He was very patient
scared of that trap . Can you help me get past
said the bird , " if you promise to share your
came across a secret cave guarded by an old wizard .
Oh yes , please ! Can we go now ?" Lily
and went on many more saf aris together . <|endoftext|> Once
" Yes , please ! Can you help me cover my
" Yes , please . Can you help me carry some
," That 's smart ! Can I learn too ?" Rory
, a swarm of bees flew out of the hive and
where a group of bees live ," replied the parent .
a large carrot which was guarded by a large turtle .
Isn â t this
, Lily and Max ! Can you speak ?" Lily said
. Suddenly , something scary flew out of the wheat .
and Sam went on a saf ari with their mom and

MIN ACTIVATIONS
MIN = -3.843

with fries and a mil ksh ake . But after eating
doing ? Why are you arguing ?" Tom and
as they drank the mil ksh akes .
don 't you enter a panc ake cooking contest ?" Tim
not to stop stirring the panc ake batter .
mom my her drawing . " Wow , Lily ! I
" Why are you arguing ?" Mom asks .
Mia , why are you arguing ?" Tom 's mom asks
. You should never press it unless you are in danger
crown and playing in the freshly cut grass . It looked
day drinking y ummy mil ksh akes in the sun .
to show her mom . " Look , Mom ! I
to show his mom . " Look , Mom ," said
ksh akes . The mil ksh akes were blue and pink
She had never seen a telephone before and was very excited
squ ished it together . " What can I make this
made a funny face . " Y uck , this apple
and continued to stir the panc ake batter .
put it on top . " Hello , friend !" said
to her mom my . " Look mom my , I

INTERVAL 1.784 - 3.659
CONTAINS 0.758%

smiled and said , " Thank you Stella ! What should
it was so dark he couldn 't see anything . Then
, " Come on , I 'll show you . I
day , her mom said she had to wear a green
see animals , trees , flowers and a house .

INTERVAL -0.092 - 1.784
CONTAINS 54.200%

said , " If you keep signing the ice , it
't know ," Ben said . " Maybe it belongs to
go to the circus too . But he didn 't have
a time , there was a little girl named Lily .
a sad river . The river whispered in the wind ,

INTERVAL -1.967 - -0.092
CONTAINS 44.767%

was a good idea . So , they both ran to
likes yoga . She feels good and relaxed . She thinks
day on , Lily always looked for new things to discover
the screw came out . Tom was sad and didn 't
the door . He tried hard , but it never opened

INTERVAL -3.843 - -1.967
CONTAINS 0.275%

promise me you will not do this again . Do you
! It has a funny face on it ."
Wow , Lily ! Your painting is so pretty . Can
They wish they did not fight . They want to play
careful not to release the snowball !" They
Token a
Token ID257
Feature activation+8.79e-01

Pos logit contributions
ze+0.024
zed+0.022
 character+0.020
hop+0.019
 bottom+0.018
Neg logit contributions
arl-0.018
 upon-0.017
ll-0.017
ety-0.016
endant-0.015
Token field
Token ID2214
Feature activation+1.26e+00

Pos logit contributions
arl+0.021
 upon+0.019
ll+0.019
ety+0.018
endant+0.017
Neg logit contributions
ze-0.026
zed-0.023
 character-0.022
hop-0.021
 bottom-0.020
Token of
Token ID286
Feature activation+1.05e-01

Pos logit contributions
arl+0.027
ll+0.025
 upon+0.024
ety+0.024
endant+0.022
Neg logit contributions
ze-0.040
zed-0.036
 character-0.034
hop-0.033
 bottom-0.032
Token beautiful
Token ID4950
Feature activation+6.67e-01

Pos logit contributions
arl+0.003
 upon+0.003
ll+0.003
ety+0.002
endant+0.002
Neg logit contributions
ze-0.003
zed-0.003
 character-0.002
hop-0.002
 bottom-0.002
Token caul
Token ID46087
Feature activation+6.61e-02

Pos logit contributions
 upon+0.015
arl+0.014
ll+0.013
ety+0.013
endant+0.013
Neg logit contributions
ze-0.020
zed-0.019
 character-0.017
hop-0.016
 bottom-0.015
Tokenif
Token ID361
Feature activation+3.66e+00

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Tokenl
Token ID75
Feature activation+6.07e-01

Pos logit contributions
arl+0.102
ety+0.090
 Bian+0.089
endant+0.088
ll+0.084
Neg logit contributions
ze-0.073
zed-0.067
 bottom-0.065
th-0.058
liest-0.057
Tokenowers
Token ID3618
Feature activation+1.16e+00

Pos logit contributions
arl+0.016
 upon+0.015
ll+0.014
ety+0.014
endant+0.014
Neg logit contributions
ze-0.017
zed-0.015
 character-0.014
hop-0.013
 bottom-0.013
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
arl+0.033
ety+0.028
ll+0.027
 upon+0.027
endant+0.027
Neg logit contributions
ze-0.029
zed-0.026
 character-0.024
hop-0.024
 bottom-0.023
Token They
Token ID1119
Feature activation-2.58e-02

Pos logit contributions
ze+0.005
zed+0.005
 character+0.005
hop+0.005
 bottom+0.004
Neg logit contributions
arl-0.003
ll-0.003
 upon-0.003
ety-0.002
endant-0.002
Token were
Token ID547
Feature activation-4.91e-01

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token"
Token ID1
Feature activation-6.02e-02

Pos logit contributions
arl+0.051
 upon+0.048
ll+0.047
ety+0.046
endant+0.044
Neg logit contributions
ze-0.042
zed-0.037
 character-0.034
hop-0.032
 bottom-0.030
TokenOkay
Token ID16454
Feature activation+1.23e+00

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 tunes+0.001
Neg logit contributions
arl-0.002
 upon-0.002
ll-0.002
endant-0.002
ety-0.002
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
arl+0.027
 upon+0.025
ll+0.024
ety+0.023
endant+0.022
Neg logit contributions
ze-0.039
zed-0.035
 character-0.033
hop-0.032
 bottom-0.030
Token Mom
Token ID11254
Feature activation+9.77e-01

Pos logit contributions
ze+0.008
 delight+0.006
 character+0.006
zed+0.006
 bottom+0.006
Neg logit contributions
arl-0.012
ll-0.011
ety-0.011
 Bian-0.011
endant-0.010
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
arl+0.031
 Bian+0.029
 upon+0.028
endant+0.028
add+0.026
Neg logit contributions
ze-0.019
est-0.015
 character-0.014
hop-0.014
zed-0.014
Token Can
Token ID1680
Feature activation+3.54e+00

Pos logit contributions
arl+0.025
 upon+0.024
ll+0.024
ety+0.023
endant+0.022
Neg logit contributions
ze-0.029
zed-0.026
 character-0.025
hop-0.023
lder-0.023
Token we
Token ID356
Feature activation-1.88e-02

Pos logit contributions
arl+0.099
endant+0.082
 upon+0.080
ety+0.079
ucker+0.079
Neg logit contributions
ze-0.088
zed-0.082
hop-0.073
 bottom-0.073
 middle-0.073
Token own
Token ID898
Feature activation+1.09e-01

Pos logit contributions
ze+0.000
zed+0.000
hop+0.000
 bottom+0.000
 character+0.000
Neg logit contributions
arl-0.001
 upon-0.001
endant-0.001
ll-0.001
ety-0.001
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
 Bian+0.002
endant+0.002
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
lder-0.003
Token rain
Token ID6290
Feature activation+4.18e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.003
ll-0.003
 upon-0.003
ety-0.003
endant-0.003
Token,
Token ID11
Feature activation-3.66e-02

Pos logit contributions
arl+0.011
 Bian+0.010
endant+0.009
 upon+0.009
ll+0.009
Neg logit contributions
ze-0.011
hop-0.009
zed-0.009
 character-0.009
 movies-0.009
Token replied
Token ID8712
Feature activation+5.08e-02

Pos logit contributions
arl+0.015
ll+0.013
 upon+0.012
ety+0.012
endant+0.011
Neg logit contributions
ze-0.027
zed-0.025
 character-0.023
hop-0.023
lder-0.022
Token,
Token ID11
Feature activation+2.09e-01

Pos logit contributions
 upon+0.001
arl+0.001
ll+0.001
 ago+0.001
 Tweet+0.000
Neg logit contributions
ze-0.002
zed-0.002
hop-0.002
 tunes-0.002
 character-0.002
Token "
Token ID366
Feature activation-3.34e-01

Pos logit contributions
arl+0.005
 upon+0.005
ll+0.005
ety+0.005
endant+0.004
Neg logit contributions
ze-0.006
zed-0.005
 character-0.005
hop-0.005
 tunes-0.004
TokenMy
Token ID3666
Feature activation-8.39e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.006
Neg logit contributions
arl-0.009
 upon-0.009
ll-0.008
ety-0.008
endant-0.008
Token blanket
Token ID18447
Feature activation+4.67e-02

Pos logit contributions
ze+0.023
zed+0.021
 character+0.019
hop+0.019
 bottom+0.018
Neg logit contributions
arl-0.021
 upon-0.020
ll-0.019
ety-0.019
endant-0.018
Token flew
Token ID13112
Feature activation+3.53e+00

Pos logit contributions
arl+0.001
ety+0.001
 Bian+0.001
 upon+0.001
ll+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
 bottom-0.001
hop-0.001
Token away
Token ID1497
Feature activation+8.38e-01

Pos logit contributions
ll+0.116
arl+0.112
endant+0.109
emy+0.106
 Bian+0.105
Neg logit contributions
ze-0.068
 bottom-0.052
zed-0.046
 middle-0.046
est-0.044
Token,
Token ID11
Feature activation-6.16e-01

Pos logit contributions
arl+0.025
ety+0.023
endant+0.023
 Bian+0.022
ll+0.022
Neg logit contributions
ze-0.019
zed-0.015
 bottom-0.015
 character-0.014
hop-0.013
Token and
Token ID290
Feature activation-3.36e-01

Pos logit contributions
ze+0.019
zed+0.017
 character+0.016
hop+0.015
 bottom+0.015
Neg logit contributions
arl-0.014
ll-0.013
ety-0.012
 upon-0.012
endant-0.012
Token it
Token ID340
Feature activation-3.48e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.007
ll-0.006
 upon-0.006
ety-0.006
endant-0.006
Token landed
Token ID11406
Feature activation+1.93e+00

Pos logit contributions
ze+0.007
zed+0.005
 character+0.004
 bottom+0.004
hop+0.004
Neg logit contributions
arl-0.013
 upon-0.011
ety-0.011
ll-0.011
 ago-0.011
Token followed
Token ID3940
Feature activation-6.46e-01

Pos logit contributions
ze+0.014
zed+0.012
hop+0.011
 character+0.010
 middle+0.010
Neg logit contributions
arl-0.019
endant-0.017
ll-0.017
 upon-0.016
 ago-0.016
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
ze+0.019
zed+0.017
 character+0.016
hop+0.015
 bottom+0.015
Neg logit contributions
arl-0.015
 upon-0.014
ll-0.014
ety-0.013
endant-0.013
Token crane
Token ID41175
Feature activation+2.14e-01

Pos logit contributions
ze+0.004
zed+0.003
 character+0.003
 bottom+0.003
hop+0.003
Neg logit contributions
arl-0.004
 upon-0.004
ll-0.004
ety-0.004
endant-0.004
Token as
Token ID355
Feature activation+7.23e-01

Pos logit contributions
arl+0.005
endant+0.004
 upon+0.004
ll+0.004
ety+0.004
Neg logit contributions
ze-0.006
zed-0.006
hop-0.005
 character-0.005
 bottom-0.005
Token it
Token ID340
Feature activation+1.03e+00

Pos logit contributions
arl+0.021
ll+0.017
endant+0.017
ety+0.017
aj+0.016
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
 bottom-0.016
hop-0.015
Token flew
Token ID13112
Feature activation+3.49e+00

Pos logit contributions
arl+0.030
ll+0.025
ety+0.025
 upon+0.024
endant+0.024
Neg logit contributions
ze-0.028
zed-0.025
hop-0.022
 character-0.021
 bottom-0.020
Token high
Token ID1029
Feature activation+1.10e+00

Pos logit contributions
arl+0.121
ll+0.119
endant+0.116
ucker+0.112
aj+0.111
Neg logit contributions
ze-0.058
 bottom-0.046
 across-0.044
 middle-0.043
est-0.038
Token into
Token ID656
Feature activation-1.43e-01

Pos logit contributions
arl+0.033
ll+0.030
 upon+0.029
endant+0.029
 haven+0.029
Neg logit contributions
ze-0.025
zed-0.021
hop-0.020
 bottom-0.020
 character-0.019
Token the
Token ID262
Feature activation+2.55e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.004
ll-0.004
 upon-0.004
ety-0.004
endant-0.004
Token sky
Token ID6766
Feature activation+8.26e-01

Pos logit contributions
arl+0.007
ll+0.006
 upon+0.006
ety+0.006
 Bian+0.006
Neg logit contributions
ze-0.007
zed-0.006
 character-0.005
hop-0.005
 bottom-0.005
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
arl+0.023
ll+0.020
 upon+0.020
ety+0.020
 Bian+0.020
Neg logit contributions
ze-0.022
zed-0.018
 character-0.017
hop-0.017
 bottom-0.017
Token taught
Token ID7817
Feature activation+4.98e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
 upon-0.002
ll-0.002
ety-0.002
endant-0.002
Token her
Token ID607
Feature activation+1.11e+00

Pos logit contributions
arl+0.013
ll+0.012
 upon+0.012
ety+0.012
endant+0.011
Neg logit contributions
ze-0.013
zed-0.012
 character-0.011
hop-0.011
 bottom-0.010
Token about
Token ID546
Feature activation+1.93e-01

Pos logit contributions
 upon+0.024
arl+0.023
ll+0.022
ety+0.020
endant+0.020
Neg logit contributions
ze-0.033
zed-0.030
 character-0.028
hop-0.028
lder-0.027
Token how
Token ID703
Feature activation-3.93e-01

Pos logit contributions
arl+0.006
ll+0.005
ety+0.005
 upon+0.005
endant+0.005
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.003
Token ants
Token ID27842
Feature activation+6.42e-01

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.010
 bottom+0.009
Neg logit contributions
arl-0.009
ll-0.008
ety-0.008
 upon-0.007
endant-0.007
Token lived
Token ID5615
Feature activation+3.46e+00

Pos logit contributions
arl+0.018
ety+0.016
 upon+0.015
endant+0.015
aj+0.015
Neg logit contributions
ze-0.017
zed-0.015
 character-0.014
hop-0.013
 bottom-0.013
Token.
Token ID13
Feature activation+4.64e-02

Pos logit contributions
arl+0.100
ll+0.086
ety+0.084
endant+0.082
 Bian+0.082
Neg logit contributions
ze-0.093
zed-0.078
hop-0.071
 character-0.070
 bottom-0.070
Token He
Token ID679
Feature activation+7.60e-01

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token was
Token ID373
Feature activation-2.84e-01

Pos logit contributions
arl+0.020
 upon+0.018
ll+0.018
ety+0.017
endant+0.016
Neg logit contributions
ze-0.021
zed-0.019
 character-0.018
hop-0.017
 bottom-0.016
Token very
Token ID845
Feature activation-2.91e-03

Pos logit contributions
ze+0.008
zed+0.007
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.006
 upon-0.006
ll-0.006
ety-0.006
 Bian-0.005
Token patient
Token ID5827
Feature activation+5.79e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
ll-0.000
 upon-0.000
ety-0.000
endant-0.000
Token scared
Token ID12008
Feature activation-4.21e-01

Pos logit contributions
arl+0.033
ll+0.032
ety+0.031
endant+0.029
aj+0.029
Neg logit contributions
ze-0.029
zed-0.023
hop-0.020
 character-0.019
 middle-0.019
Token of
Token ID286
Feature activation+1.02e+00

Pos logit contributions
ze+0.012
 character+0.010
zed+0.010
hop+0.010
est+0.009
Neg logit contributions
arl-0.010
endant-0.009
ll-0.009
 Bian-0.009
ety-0.009
Token that
Token ID326
Feature activation-2.65e-01

Pos logit contributions
arl+0.031
ll+0.029
 upon+0.027
ety+0.027
endant+0.026
Neg logit contributions
ze-0.025
zed-0.021
 bottom-0.019
hop-0.019
 character-0.018
Token trap
Token ID12840
Feature activation+6.45e-01

Pos logit contributions
ze+0.008
zed+0.008
 character+0.007
hop+0.007
 tunes+0.007
Neg logit contributions
arl-0.006
 upon-0.005
ll-0.005
ety-0.005
endant-0.005
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
arl+0.020
ety+0.018
endant+0.018
 Bian+0.017
ll+0.017
Neg logit contributions
ze-0.016
zed-0.013
 character-0.012
hop-0.011
 bottom-0.011
Token Can
Token ID1680
Feature activation+3.45e+00

Pos logit contributions
ze+0.005
 character+0.004
zed+0.004
hop+0.004
+0.004
Neg logit contributions
ll-0.007
arl-0.006
 upon-0.006
endant-0.006
ety-0.006
Token you
Token ID345
Feature activation+9.41e-01

Pos logit contributions
arl+0.112
endant+0.093
 ago+0.092
aj+0.091
ucker+0.091
Neg logit contributions
ze-0.079
zed-0.071
hop-0.063
 bottom-0.059
 everybody-0.057
Token help
Token ID1037
Feature activation+1.71e-01

Pos logit contributions
arl+0.032
endant+0.031
 ago+0.031
ety+0.030
aj+0.029
Neg logit contributions
ze-0.021
zed-0.014
 bottom-0.014
hop-0.012
 end-0.012
Token me
Token ID502
Feature activation+4.69e-01

Pos logit contributions
arl+0.004
 upon+0.004
ll+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.004
Token get
Token ID651
Feature activation+1.72e-01

Pos logit contributions
arl+0.013
 upon+0.012
ety+0.012
ll+0.012
endant+0.012
Neg logit contributions
ze-0.012
zed-0.010
hop-0.009
 bottom-0.009
 character-0.008
Token past
Token ID1613
Feature activation-5.83e-02

Pos logit contributions
arl+0.005
ll+0.004
 upon+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.003
Token said
Token ID531
Feature activation-5.44e-01

Pos logit contributions
ze+0.002
 character+0.002
zed+0.002
hop+0.002
 delight+0.002
Neg logit contributions
arl-0.002
ll-0.002
ety-0.002
endant-0.002
 upon-0.002
Token the
Token ID262
Feature activation+1.75e-01

Pos logit contributions
ze+0.014
zed+0.013
 character+0.012
hop+0.011
 bottom+0.011
Neg logit contributions
arl-0.015
ll-0.014
 upon-0.014
ety-0.013
endant-0.013
Token bird
Token ID6512
Feature activation+3.11e-01

Pos logit contributions
arl+0.005
ll+0.005
 upon+0.005
ety+0.005
 Bian+0.004
Neg logit contributions
ze-0.004
 character-0.004
zed-0.003
 bottom-0.003
lder-0.003
Token,
Token ID11
Feature activation+3.92e-01

Pos logit contributions
arl+0.008
ll+0.007
 upon+0.007
ety+0.007
 Bian+0.007
Neg logit contributions
ze-0.009
zed-0.008
 character-0.007
hop-0.007
 bottom-0.006
Token "
Token ID366
Feature activation+2.70e-01

Pos logit contributions
arl+0.012
ll+0.011
ety+0.011
 upon+0.010
endant+0.010
Neg logit contributions
ze-0.010
zed-0.009
 character-0.008
hop-0.007
 bottom-0.007
Tokenif
Token ID361
Feature activation+3.43e+00

Pos logit contributions
arl+0.008
ll+0.008
 upon+0.008
ety+0.007
endant+0.007
Neg logit contributions
ze-0.006
zed-0.005
hop-0.005
 character-0.005
 tunes-0.004
Token you
Token ID345
Feature activation+7.78e-01

Pos logit contributions
ety+0.093
arl+0.093
aj+0.086
ucker+0.084
endant+0.083
Neg logit contributions
ze-0.074
zed-0.072
 middle-0.061
 everybody-0.059
hop-0.059
Token promise
Token ID6991
Feature activation-3.44e-01

Pos logit contributions
arl+0.022
 Bian+0.018
ucker+0.018
emy+0.018
ety+0.018
Neg logit contributions
ze-0.021
zed-0.017
 bottom-0.015
hop-0.015
 character-0.015
Token to
Token ID284
Feature activation-2.29e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.008
ll-0.007
 upon-0.007
ety-0.006
endant-0.006
Token share
Token ID2648
Feature activation+1.16e-01

Pos logit contributions
ze+0.006
zed+0.005
hop+0.004
 bottom+0.004
 character+0.004
Neg logit contributions
arl-0.006
ety-0.006
ll-0.006
 upon-0.006
endant-0.006
Token your
Token ID534
Feature activation+7.17e-02

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.004
zed-0.003
 character-0.003
hop-0.003
lder-0.003
Token came
Token ID1625
Feature activation+4.36e-02

Pos logit contributions
ze+0.003
zed+0.003
hop+0.003
 character+0.003
 bottom+0.003
Neg logit contributions
arl-0.004
endant-0.003
ll-0.003
ety-0.003
 upon-0.003
Token across
Token ID1973
Feature activation-1.72e-01

Pos logit contributions
arl+0.001
endant+0.001
ll+0.001
 Bian+0.001
ety+0.001
Neg logit contributions
ze-0.001
zed-0.001
 bottom-0.001
hop-0.001
 middle-0.001
Token a
Token ID257
Feature activation+8.15e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.003
 bottom+0.003
hop+0.003
Neg logit contributions
arl-0.006
ll-0.005
ety-0.005
endant-0.005
ucker-0.005
Token secret
Token ID3200
Feature activation+1.09e+00

Pos logit contributions
arl+0.019
 upon+0.018
ll+0.017
ety+0.016
endant+0.016
Neg logit contributions
ze-0.024
zed-0.022
 character-0.020
hop-0.020
 bottom-0.019
Token cave
Token ID11527
Feature activation+1.73e+00

Pos logit contributions
arl+0.031
ll+0.028
ety+0.028
 upon+0.026
 Bian+0.026
Neg logit contributions
ze-0.027
zed-0.024
 character-0.024
 bottom-0.022
hop-0.022
Token guarded
Token ID28178
Feature activation+3.43e+00

Pos logit contributions
arl+0.052
ety+0.047
 Bian+0.046
ll+0.046
endant+0.043
Neg logit contributions
ze-0.041
zed-0.037
 character-0.034
hop-0.033
 bottom-0.032
Token by
Token ID416
Feature activation-2.42e-01

Pos logit contributions
arl+0.107
endant+0.099
ll+0.096
ucker+0.092
aj+0.089
Neg logit contributions
ze-0.073
 bottom-0.063
 middle-0.062
zed-0.061
 character-0.061
Token an
Token ID281
Feature activation+2.20e-02

Pos logit contributions
ze+0.006
zed+0.005
 character+0.005
 bottom+0.005
hop+0.005
Neg logit contributions
arl-0.007
ll-0.007
ety-0.006
endant-0.006
ucker-0.006
Token old
Token ID1468
Feature activation+1.82e+00

Pos logit contributions
arl+0.000
 upon+0.000
ll+0.000
ety+0.000
endant+0.000
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
lder-0.001
Token wizard
Token ID18731
Feature activation+1.05e+00

Pos logit contributions
arl+0.051
ll+0.047
ety+0.044
 upon+0.044
 Bian+0.042
Neg logit contributions
ze-0.047
 character-0.041
zed-0.040
hop-0.038
 bottom-0.037
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
arl+0.029
ll+0.025
ety+0.025
 upon+0.025
 Bian+0.024
Neg logit contributions
ze-0.029
zed-0.025
 character-0.024
hop-0.024
 tunes-0.021
TokenOh
Token ID5812
Feature activation+7.95e-01

Pos logit contributions
ze+0.007
zed+0.007
 character+0.006
hop+0.006
 bottom+0.006
Neg logit contributions
arl-0.006
 upon-0.006
ll-0.006
ety-0.005
 Bian-0.005
Token yes
Token ID3763
Feature activation+9.74e-01

Pos logit contributions
arl+0.018
 upon+0.016
ll+0.016
ety+0.015
endant+0.015
Neg logit contributions
ze-0.025
zed-0.022
 character-0.021
hop-0.020
 bottom-0.019
Token,
Token ID11
Feature activation+6.54e-02

Pos logit contributions
arl+0.023
 upon+0.021
ll+0.021
ety+0.020
endant+0.019
Neg logit contributions
ze-0.029
zed-0.026
 character-0.025
hop-0.024
 bottom-0.023
Token please
Token ID3387
Feature activation+1.47e+00

Pos logit contributions
ll+0.002
arl+0.002
ety+0.002
 upon+0.002
 ago+0.002
Neg logit contributions
ze-0.002
 character-0.001
zed-0.001
 delight-0.001
est-0.001
Token!
Token ID0
Feature activation+7.68e-01

Pos logit contributions
arl+0.043
 upon+0.040
ll+0.039
ety+0.039
endant+0.038
Neg logit contributions
ze-0.035
zed-0.031
 character-0.029
hop-0.027
lder-0.026
Token Can
Token ID1680
Feature activation+3.43e+00

Pos logit contributions
ll+0.021
arl+0.021
 upon+0.020
endant+0.019
ety+0.019
Neg logit contributions
ze-0.018
zed-0.016
 character-0.016
hop-0.015
five-0.013
Token we
Token ID356
Feature activation+4.15e-02

Pos logit contributions
arl+0.098
 upon+0.079
endant+0.079
ucker+0.078
ll+0.078
Neg logit contributions
ze-0.085
zed-0.081
hop-0.075
 character-0.071
 middle-0.069
Token go
Token ID467
Feature activation+7.85e-01

Pos logit contributions
arl+0.001
endant+0.001
 upon+0.001
 ago+0.001
ll+0.001
Neg logit contributions
ze-0.001
zed-0.001
hop-0.001
 bottom-0.001
 character-0.001
Token now
Token ID783
Feature activation-7.58e-01

Pos logit contributions
arl+0.024
endant+0.021
 Bian+0.020
ucker+0.020
emy+0.019
Neg logit contributions
ze-0.018
hop-0.016
zed-0.015
 middle-0.013
 bottom-0.013
Token?"
Token ID1701
Feature activation-2.62e-01

Pos logit contributions
ze+0.020
zed+0.017
hop+0.016
 bottom+0.016
 character+0.016
Neg logit contributions
arl-0.020
endant-0.018
ll-0.018
ety-0.018
 upon-0.018
Token Lily
Token ID20037
Feature activation-1.10e+00

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
hop+0.008
 tunes+0.008
Neg logit contributions
arl-0.004
ll-0.004
 upon-0.004
endant-0.003
ety-0.003
Token and
Token ID290
Feature activation+3.97e-01

Pos logit contributions
arl+0.003
 upon+0.003
ll+0.003
ety+0.003
endant+0.003
Neg logit contributions
ze-0.005
zed-0.005
 character-0.004
hop-0.004
 tunes-0.004
Token went
Token ID1816
Feature activation-2.79e-02

Pos logit contributions
arl+0.011
 upon+0.010
endant+0.009
ll+0.009
ety+0.009
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 tunes-0.008
Token on
Token ID319
Feature activation-1.13e+00

Pos logit contributions
ze+0.001
zed+0.001
hop+0.001
 character+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
endant-0.001
ety-0.001
Token many
Token ID867
Feature activation+6.60e-01

Pos logit contributions
ze+0.034
zed+0.031
 character+0.029
hop+0.028
 bottom+0.026
Neg logit contributions
arl-0.026
 upon-0.024
ll-0.024
ety-0.023
endant-0.022
Token more
Token ID517
Feature activation+7.17e-01

Pos logit contributions
arl+0.015
 upon+0.014
ll+0.014
ety+0.013
 Bian+0.013
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
hop-0.016
lder-0.015
Token saf
Token ID1932
Feature activation+3.42e+00

Pos logit contributions
arl+0.026
endant+0.022
 upon+0.022
ety+0.022
ll+0.021
Neg logit contributions
ze-0.015
zed-0.013
 tunes-0.012
hop-0.011
 character-0.011
Tokenaris
Token ID20066
Feature activation+1.48e+00

Pos logit contributions
arl+0.066
 upon+0.066
ety+0.064
endant+0.060
 Bian+0.060
Neg logit contributions
ze-0.102
hop-0.095
zed-0.094
 character-0.090
seek-0.089
Token together
Token ID1978
Feature activation-1.34e-01

Pos logit contributions
arl+0.041
endant+0.033
ety+0.033
 upon+0.033
 Bian+0.033
Neg logit contributions
ze-0.041
zed-0.036
 character-0.035
hop-0.035
 tunes-0.032
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.003
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.003
endant-0.002
Token<|endoftext|>
Token ID50256
Feature activation+5.22e-02

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.002
endant+0.002
ety+0.002
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
 bottom-0.003
TokenOnce
Token ID7454
Feature activation+9.58e-01

Pos logit contributions
arl+0.001
ll+0.001
endant+0.001
ety+0.001
 upon+0.001
Neg logit contributions
ze-0.002
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token "
Token ID366
Feature activation+5.15e-01

Pos logit contributions
ze+0.006
zed+0.005
 character+0.005
hop+0.005
 bottom+0.004
Neg logit contributions
arl-0.005
 upon-0.005
ll-0.004
ety-0.004
 Bian-0.004
TokenYes
Token ID5297
Feature activation+6.93e-01

Pos logit contributions
arl+0.015
 upon+0.014
ll+0.013
endant+0.013
ety+0.013
Neg logit contributions
ze-0.012
zed-0.011
 character-0.010
hop-0.010
 tunes-0.010
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
arl+0.018
ll+0.017
ety+0.017
 upon+0.017
endant+0.016
Neg logit contributions
ze-0.019
zed-0.016
 character-0.015
 bottom-0.015
hop-0.014
Token please
Token ID3387
Feature activation+1.29e+00

Pos logit contributions
ze+0.010
zed+0.008
 character+0.008
hop+0.008
 thing+0.008
Neg logit contributions
arl-0.010
ll-0.009
ety-0.009
endant-0.008
 upon-0.008
Token!
Token ID0
Feature activation+1.23e+00

Pos logit contributions
arl+0.036
 upon+0.034
ety+0.034
endant+0.033
ll+0.033
Neg logit contributions
ze-0.033
zed-0.028
 character-0.025
lder-0.024
hop-0.024
Token Can
Token ID1680
Feature activation+3.39e+00

Pos logit contributions
ll+0.033
arl+0.032
 upon+0.032
endant+0.031
ety+0.030
Neg logit contributions
ze-0.031
zed-0.027
 character-0.026
hop-0.025
lder-0.023
Token you
Token ID345
Feature activation+6.70e-01

Pos logit contributions
arl+0.106
endant+0.088
ucker+0.087
 upon+0.084
 ago+0.084
Neg logit contributions
ze-0.081
zed-0.074
hop-0.068
 middle-0.061
 bottom-0.060
Token help
Token ID1037
Feature activation-4.00e-01

Pos logit contributions
arl+0.019
 upon+0.017
ll+0.017
ety+0.017
endant+0.017
Neg logit contributions
ze-0.018
zed-0.014
 bottom-0.013
 character-0.013
hop-0.013
Token me
Token ID502
Feature activation+2.60e-01

Pos logit contributions
ze+0.012
zed+0.012
 character+0.011
hop+0.010
 bottom+0.010
Neg logit contributions
arl-0.008
 upon-0.008
ll-0.008
ety-0.008
 Bian-0.007
Token cover
Token ID3002
Feature activation+9.00e-02

Pos logit contributions
arl+0.007
 upon+0.006
ll+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.007
zed-0.006
hop-0.006
 character-0.006
 bottom-0.005
Token my
Token ID616
Feature activation-1.57e-01

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.003
zed-0.002
 character-0.002
hop-0.002
 bottom-0.002
Token "
Token ID366
Feature activation+4.41e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.010
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.010
 upon-0.010
ll-0.009
ety-0.009
 Bian-0.008
TokenYes
Token ID5297
Feature activation+4.49e-01

Pos logit contributions
arl+0.014
 upon+0.013
ll+0.012
endant+0.012
 ago+0.012
Neg logit contributions
ze-0.010
zed-0.008
 character-0.008
hop-0.008
 tunes-0.007
Token,
Token ID11
Feature activation-4.54e-01

Pos logit contributions
arl+0.011
ll+0.010
 upon+0.010
ety+0.010
endant+0.010
Neg logit contributions
ze-0.013
zed-0.011
 character-0.011
hop-0.010
 bottom-0.010
Token please
Token ID3387
Feature activation+1.20e+00

Pos logit contributions
ze+0.012
 character+0.010
zed+0.010
 thing+0.009
 lull+0.009
Neg logit contributions
arl-0.012
ll-0.012
ety-0.011
endant-0.010
 upon-0.010
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
arl+0.034
 upon+0.032
ety+0.032
ll+0.032
endant+0.031
Neg logit contributions
ze-0.029
zed-0.025
 character-0.022
lder-0.022
hop-0.022
Token Can
Token ID1680
Feature activation+3.38e+00

Pos logit contributions
ll+0.030
 upon+0.030
arl+0.030
endant+0.028
ety+0.028
Neg logit contributions
ze-0.027
zed-0.023
 character-0.022
hop-0.022
 middle-0.019
Token you
Token ID345
Feature activation+6.26e-01

Pos logit contributions
arl+0.106
endant+0.088
 ago+0.086
 upon+0.086
ucker+0.086
Neg logit contributions
ze-0.078
zed-0.071
hop-0.065
 middle-0.058
 everybody-0.058
Token help
Token ID1037
Feature activation-5.48e-01

Pos logit contributions
arl+0.018
 upon+0.016
ety+0.016
ll+0.016
endant+0.016
Neg logit contributions
ze-0.017
zed-0.013
 bottom-0.012
hop-0.012
 character-0.012
Token me
Token ID502
Feature activation+2.08e-01

Pos logit contributions
ze+0.016
zed+0.015
 character+0.014
hop+0.013
 bottom+0.013
Neg logit contributions
arl-0.012
 upon-0.012
ll-0.012
ety-0.011
 Bian-0.010
Token carry
Token ID3283
Feature activation+7.76e-01

Pos logit contributions
arl+0.006
 upon+0.005
ll+0.005
ety+0.005
endant+0.005
Neg logit contributions
ze-0.006
zed-0.005
hop-0.004
 character-0.004
 bottom-0.004
Token some
Token ID617
Feature activation-1.23e+00

Pos logit contributions
arl+0.022
ll+0.021
 upon+0.020
ety+0.019
endant+0.019
Neg logit contributions
ze-0.019
zed-0.017
hop-0.016
 character-0.015
 bottom-0.014
Token,"
Token ID553
Feature activation-9.01e-01

Pos logit contributions
 upon+0.004
arl+0.004
ll+0.004
ety+0.003
 Tweet+0.003
Neg logit contributions
ze-0.010
zed-0.009
hop-0.009
 character-0.009
 tunes-0.008
Token That
Token ID1320
Feature activation-1.53e+00

Pos logit contributions
ze+0.025
zed+0.022
 character+0.021
hop+0.021
 tunes+0.019
Neg logit contributions
arl-0.023
ll-0.021
 upon-0.021
ety-0.020
endant-0.020
Token's
Token ID338
Feature activation-2.85e-02

Pos logit contributions
ze+0.049
zed+0.046
hop+0.041
seek+0.038
five+0.038
Neg logit contributions
 upon-0.028
ll-0.025
 might-0.025
 Bian-0.022
 planned-0.022
Token smart
Token ID4451
Feature activation-2.45e-01

Pos logit contributions
ze+0.001
zed+0.001
hop+0.001
 character+0.001
 tunes+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.000
endant-0.000
Token!
Token ID0
Feature activation+1.24e+00

Pos logit contributions
ze+0.007
zed+0.007
 character+0.006
hop+0.006
 bottom+0.006
Neg logit contributions
arl-0.006
 upon-0.005
ll-0.005
ety-0.005
endant-0.005
Token Can
Token ID1680
Feature activation+3.37e+00

Pos logit contributions
arl+0.027
ll+0.025
 upon+0.025
ety+0.024
endant+0.024
Neg logit contributions
ze-0.036
zed-0.034
 character-0.032
hop-0.031
 bottom-0.030
Token I
Token ID314
Feature activation+5.61e-01

Pos logit contributions
arl+0.101
endant+0.087
ety+0.082
 upon+0.081
 ago+0.080
Neg logit contributions
ze-0.082
zed-0.076
hop-0.073
 character-0.066
 everybody-0.063
Token learn
Token ID2193
Feature activation-4.40e-01

Pos logit contributions
arl+0.015
 upon+0.014
ll+0.014
ety+0.013
endant+0.013
Neg logit contributions
ze-0.015
zed-0.013
 character-0.012
hop-0.012
 bottom-0.011
Token too
Token ID1165
Feature activation-1.58e+00

Pos logit contributions
ze+0.012
zed+0.010
 character+0.010
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.012
 upon-0.011
ll-0.011
ety-0.010
endant-0.010
Token?"
Token ID1701
Feature activation+3.11e-01

Pos logit contributions
ze+0.052
zed+0.048
 character+0.044
hop+0.043
 bottom+0.041
Neg logit contributions
arl-0.032
 upon-0.029
ll-0.028
ety-0.027
endant-0.025
Token Rory
Token ID37178
Feature activation+3.25e-01

Pos logit contributions
arl+0.008
ll+0.007
 upon+0.007
ety+0.007
endant+0.007
Neg logit contributions
ze-0.009
zed-0.008
 character-0.007
hop-0.007
 bottom-0.007
Token,
Token ID11
Feature activation+4.13e-01

Pos logit contributions
arl+0.005
 upon+0.005
ll+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.007
zed-0.006
 character-0.006
hop-0.006
 tunes-0.006
Token a
Token ID257
Feature activation+9.57e-01

Pos logit contributions
arl+0.009
 upon+0.009
ll+0.008
ety+0.008
endant+0.008
Neg logit contributions
ze-0.013
zed-0.011
 character-0.011
hop-0.010
 bottom-0.010
Token swarm
Token ID30077
Feature activation+6.45e-01

Pos logit contributions
arl+0.024
ll+0.022
 upon+0.022
ety+0.022
endant+0.020
Neg logit contributions
ze-0.027
zed-0.023
 character-0.023
hop-0.021
 bottom-0.021
Token of
Token ID286
Feature activation-5.94e-02

Pos logit contributions
arl+0.013
ll+0.012
ety+0.012
 upon+0.012
endant+0.011
Neg logit contributions
ze-0.022
zed-0.020
 character-0.018
hop-0.018
 bottom-0.017
Token bees
Token ID17002
Feature activation+5.29e-01

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 tunes+0.001
Neg logit contributions
arl-0.002
ll-0.002
 upon-0.002
ety-0.002
endant-0.002
Token flew
Token ID13112
Feature activation+3.36e+00

Pos logit contributions
arl+0.015
ll+0.013
ety+0.013
 upon+0.013
endant+0.013
Neg logit contributions
ze-0.014
zed-0.012
hop-0.011
 character-0.011
 bottom-0.011
Token out
Token ID503
Feature activation+2.24e-01

Pos logit contributions
endant+0.118
ll+0.115
arl+0.113
aj+0.109
 ago+0.108
Neg logit contributions
ze-0.060
 across-0.037
 bottom-0.035
 middle-0.034
zed-0.034
Token of
Token ID286
Feature activation+7.42e-01

Pos logit contributions
endant+0.007
ety+0.007
arl+0.007
ll+0.006
aj+0.006
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
est-0.004
 bottom-0.003
Token the
Token ID262
Feature activation+5.70e-01

Pos logit contributions
ll+0.023
arl+0.022
endant+0.021
ety+0.021
ancer+0.020
Neg logit contributions
ze-0.016
zed-0.015
 character-0.013
 delight-0.013
 bottom-0.013
Token hive
Token ID35881
Feature activation+1.24e+00

Pos logit contributions
arl+0.017
ll+0.016
 upon+0.015
ety+0.015
 Bian+0.014
Neg logit contributions
ze-0.014
zed-0.012
 bottom-0.011
hop-0.011
 character-0.011
Token and
Token ID290
Feature activation+9.65e-01

Pos logit contributions
arl+0.032
ety+0.029
ll+0.027
endant+0.026
 Bian+0.026
Neg logit contributions
ze-0.035
zed-0.031
hop-0.028
 character-0.028
 bottom-0.027
Token where
Token ID810
Feature activation-3.76e-01

Pos logit contributions
ze+0.007
zed+0.006
 character+0.005
 bottom+0.005
 middle+0.005
Neg logit contributions
arl-0.007
ll-0.006
ety-0.006
 upon-0.006
endant-0.005
Token a
Token ID257
Feature activation+2.52e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.010
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.009
ll-0.008
 upon-0.008
ety-0.008
endant-0.007
Token group
Token ID1448
Feature activation+4.25e-01

Pos logit contributions
arl+0.007
ll+0.006
 upon+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.007
zed-0.006
 character-0.006
hop-0.005
 bottom-0.005
Token of
Token ID286
Feature activation+1.54e-01

Pos logit contributions
arl+0.010
ety+0.009
ll+0.009
endant+0.008
 Bian+0.008
Neg logit contributions
ze-0.013
zed-0.012
 character-0.011
hop-0.011
 middle-0.010
Token bees
Token ID17002
Feature activation+1.12e+00

Pos logit contributions
arl+0.005
ll+0.005
ety+0.005
 upon+0.004
endant+0.004
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
 bottom-0.002
Token live
Token ID2107
Feature activation+3.36e+00

Pos logit contributions
arl+0.029
ety+0.026
 upon+0.025
ll+0.025
endant+0.024
Neg logit contributions
ze-0.032
zed-0.027
hop-0.025
 character-0.025
lder-0.024
Token,"
Token ID553
Feature activation-6.38e-01

Pos logit contributions
arl+0.097
ll+0.084
ety+0.082
 Bian+0.081
endant+0.080
Neg logit contributions
ze-0.085
zed-0.065
lder-0.062
hop-0.062
est-0.062
Token replied
Token ID8712
Feature activation-4.58e-01

Pos logit contributions
ze+0.020
 character+0.019
zed+0.018
hop+0.018
 tunes+0.017
Neg logit contributions
arl-0.015
ll-0.012
 upon-0.012
endant-0.011
ety-0.011
Token the
Token ID262
Feature activation-5.92e-01

Pos logit contributions
ze+0.013
zed+0.012
 character+0.011
hop+0.011
 bottom+0.010
Neg logit contributions
arl-0.010
 upon-0.010
ll-0.010
ety-0.009
 Bian-0.009
Token parent
Token ID2560
Feature activation+3.57e-01

Pos logit contributions
ze+0.016
zed+0.014
 character+0.013
hop+0.012
 bottom+0.011
Neg logit contributions
arl-0.016
 upon-0.015
ll-0.015
ety-0.014
 Bian-0.013
Token.
Token ID13
Feature activation+7.93e-01

Pos logit contributions
arl+0.009
ll+0.008
 upon+0.008
ety+0.008
 Bian+0.007
Neg logit contributions
ze-0.010
zed-0.009
 character-0.008
hop-0.008
 bottom-0.008
Token a
Token ID257
Feature activation+8.61e-01

Pos logit contributions
arl+0.028
ll+0.025
 upon+0.024
ety+0.024
endant+0.022
Neg logit contributions
ze-0.031
zed-0.028
 character-0.026
hop-0.025
 bottom-0.024
Token large
Token ID1588
Feature activation+1.53e+00

Pos logit contributions
arl+0.020
 upon+0.019
ll+0.018
ety+0.017
endant+0.016
Neg logit contributions
ze-0.026
zed-0.023
 character-0.021
hop-0.021
 bottom-0.020
Token carrot
Token ID40562
Feature activation+2.92e-01

Pos logit contributions
arl+0.039
ll+0.036
 upon+0.035
ety+0.034
 Bian+0.032
Neg logit contributions
ze-0.043
zed-0.037
 character-0.037
hop-0.034
 bottom-0.033
Token which
Token ID543
Feature activation+7.23e-01

Pos logit contributions
arl+0.007
 upon+0.007
ll+0.007
ety+0.006
 Bian+0.006
Neg logit contributions
ze-0.008
zed-0.007
 character-0.007
hop-0.007
 bottom-0.006
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
arl+0.018
 upon+0.017
ll+0.017
ety+0.016
endant+0.015
Neg logit contributions
ze-0.020
zed-0.018
 character-0.017
hop-0.016
 bottom-0.015
Token guarded
Token ID28178
Feature activation+3.36e+00

Pos logit contributions
arl+0.027
ll+0.024
 upon+0.023
ety+0.023
 Bian+0.022
Neg logit contributions
ze-0.030
zed-0.026
 character-0.025
 bottom-0.023
hop-0.023
Token by
Token ID416
Feature activation-4.50e-01

Pos logit contributions
arl+0.094
endant+0.084
ll+0.081
ucker+0.074
 Bian+0.073
Neg logit contributions
ze-0.087
zed-0.072
 character-0.071
hop-0.070
 bottom-0.067
Token a
Token ID257
Feature activation+8.83e-01

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.013
ll-0.012
ety-0.011
endant-0.011
 upon-0.011
Token large
Token ID1588
Feature activation+1.18e+00

Pos logit contributions
arl+0.020
 upon+0.019
ll+0.018
ety+0.017
endant+0.017
Neg logit contributions
ze-0.026
zed-0.024
 character-0.022
hop-0.022
 tunes-0.020
Token turtle
Token ID28699
Feature activation+7.04e-01

Pos logit contributions
arl+0.030
ll+0.028
 upon+0.028
ety+0.026
endant+0.025
Neg logit contributions
ze-0.033
zed-0.029
 character-0.028
hop-0.027
 bottom-0.025
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
arl+0.019
ll+0.017
 upon+0.017
ety+0.017
 Bian+0.016
Neg logit contributions
ze-0.019
zed-0.016
 character-0.016
hop-0.015
 bottom-0.014
Token
Token ID6184
Feature activation+2.69e-02

Pos logit contributions
ze+0.006
zed+0.006
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.006
 upon-0.006
ll-0.006
ety-0.006
endant-0.005
Token
Token ID95
Feature activation+1.41e-01

Pos logit contributions
arl+0.001
 Bian+0.001
ll+0.001
 ago+0.001
 upon+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
 bottom-0.001
hop-0.001
Token
Token ID26391
Feature activation-3.95e-01

Pos logit contributions
arl+0.003
 ago+0.003
ll+0.003
 Bian+0.003
 upon+0.003
Neg logit contributions
ze-0.003
 bottom-0.003
 character-0.003
zed-0.003
 hairst-0.003
Token
Token ID129
Feature activation-1.50e+00

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
 bottom+0.008
hop+0.008
Neg logit contributions
 upon-0.010
ll-0.010
arl-0.009
 ago-0.009
ety-0.009
Token
Token ID241
Feature activation-4.08e-01

Pos logit contributions
ze+0.023
 character+0.023
 eater+0.019
 bottom+0.019
zed+0.018
Neg logit contributions
arl-0.048
 ago-0.045
ll-0.044
 upon-0.043
 Bian-0.043
TokenIsn
Token ID41451
Feature activation+3.35e+00

Pos logit contributions
ze+0.010
zed+0.009
 character+0.008
hop+0.008
 bottom+0.008
Neg logit contributions
arl-0.011
 upon-0.011
ll-0.010
ety-0.010
endant-0.009
Tokenâ
Token ID22940
Feature activation+6.52e-01

Pos logit contributions
arl+0.103
endant+0.092
ety+0.091
 ago+0.083
ll+0.083
Neg logit contributions
ze-0.079
zed-0.076
hop-0.065
 character-0.065
 bottom-0.060
Token
Token ID26391
Feature activation+1.83e-01

Pos logit contributions
arl+0.020
ll+0.019
 upon+0.019
ety+0.018
endant+0.017
Neg logit contributions
ze-0.015
zed-0.013
 character-0.012
hop-0.011
 bottom-0.011
Token
Token ID8151
Feature activation+6.30e-01

Pos logit contributions
arl+0.003
 upon+0.003
ll+0.003
ety+0.003
 Bian+0.003
Neg logit contributions
ze-0.006
zed-0.006
 character-0.005
hop-0.005
 bottom-0.005
Tokent
Token ID83
Feature activation-7.07e-01

Pos logit contributions
arl+0.013
 upon+0.011
ll+0.011
ety+0.011
endant+0.011
Neg logit contributions
ze-0.021
zed-0.019
 character-0.018
hop-0.017
lder-0.016
Token this
Token ID428
Feature activation-1.98e-01

Pos logit contributions
ze+0.019
zed+0.017
 character+0.016
hop+0.016
 bottom+0.015
Neg logit contributions
arl-0.018
 upon-0.017
ll-0.017
ety-0.016
endant-0.015
Token,
Token ID11
Feature activation-1.78e-01

Pos logit contributions
arl+0.008
ll+0.007
endant+0.007
ety+0.007
 upon+0.007
Neg logit contributions
ze-0.008
 character-0.007
zed-0.007
 bottom-0.007
hop-0.007
Token Lily
Token ID20037
Feature activation+4.72e-01

Pos logit contributions
ze+0.005
 character+0.005
zed+0.005
 movies+0.004
 delight+0.004
Neg logit contributions
ety-0.004
endant-0.003
arl-0.003
ll-0.003
 ago-0.003
Token and
Token ID290
Feature activation+8.55e-02

Pos logit contributions
arl+0.012
 upon+0.011
ll+0.011
ety+0.011
endant+0.010
Neg logit contributions
ze-0.013
zed-0.011
 character-0.011
hop-0.010
 bottom-0.010
Token Max
Token ID5436
Feature activation+5.77e-02

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.001
endant+0.001
ety+0.001
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.002
 tunes-0.002
Token!
Token ID0
Feature activation-9.71e-03

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token Can
Token ID1680
Feature activation+3.35e+00

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
lder+0.000
Neg logit contributions
ll-0.000
arl-0.000
endant-0.000
 upon-0.000
ety-0.000
Token you
Token ID345
Feature activation+8.16e-01

Pos logit contributions
arl+0.100
endant+0.087
ucker+0.085
 ago+0.078
ll+0.078
Neg logit contributions
ze-0.081
zed-0.076
hop-0.068
 character-0.063
Give-0.061
Token speak
Token ID2740
Feature activation-8.72e-01

Pos logit contributions
arl+0.026
endant+0.024
ety+0.023
 ago+0.023
ll+0.023
Neg logit contributions
ze-0.020
zed-0.015
hop-0.014
 character-0.014
 bottom-0.013
Token?"
Token ID1701
Feature activation+5.64e-01

Pos logit contributions
ze+0.029
zed+0.027
 character+0.025
hop+0.025
 bottom+0.024
Neg logit contributions
arl-0.017
 upon-0.016
ll-0.015
ety-0.014
endant-0.013
Token Lily
Token ID20037
Feature activation-5.02e-01

Pos logit contributions
arl+0.015
ll+0.013
endant+0.013
 upon+0.012
ety+0.012
Neg logit contributions
ze-0.016
zed-0.015
 character-0.014
hop-0.013
 tunes-0.012
Token said
Token ID531
Feature activation+2.50e-03

Pos logit contributions
ze+0.017
zed+0.016
 character+0.015
hop+0.014
 bottom+0.014
Neg logit contributions
ll-0.009
 upon-0.009
ety-0.008
arl-0.008
 Bian-0.007
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
arl+0.016
 upon+0.015
ll+0.015
ety+0.014
endant+0.013
Neg logit contributions
ze-0.020
zed-0.018
 character-0.017
hop-0.016
 bottom-0.015
Token Suddenly
Token ID24975
Feature activation-5.34e-03

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.010
 bottom+0.009
Neg logit contributions
arl-0.014
ll-0.013
 upon-0.012
ety-0.012
endant-0.011
Token,
Token ID11
Feature activation+3.67e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
ll-0.000
 upon-0.000
ety-0.000
endant-0.000
Token something
Token ID1223
Feature activation+3.11e-01

Pos logit contributions
arl+0.011
ll+0.010
ety+0.010
 upon+0.009
endant+0.009
Neg logit contributions
ze-0.009
zed-0.008
 character-0.007
 bottom-0.007
hop-0.007
Token scary
Token ID14343
Feature activation-7.13e-01

Pos logit contributions
 upon+0.006
arl+0.005
ll+0.005
endant+0.005
ety+0.005
Neg logit contributions
ze-0.010
zed-0.009
hop-0.009
 character-0.009
lder-0.008
Token flew
Token ID13112
Feature activation+3.33e+00

Pos logit contributions
ze+0.023
zed+0.021
hop+0.020
 character+0.019
 tunes+0.018
Neg logit contributions
arl-0.014
 upon-0.013
ll-0.012
ety-0.012
endant-0.011
Token out
Token ID503
Feature activation-1.09e-01

Pos logit contributions
ll+0.094
arl+0.092
endant+0.089
aj+0.085
 Bian+0.085
Neg logit contributions
ze-0.079
 bottom-0.064
zed-0.061
 character-0.058
 middle-0.056
Token of
Token ID286
Feature activation+2.11e-01

Pos logit contributions
ze+0.003
zed+0.002
 bottom+0.002
 character+0.002
est+0.002
Neg logit contributions
arl-0.003
ety-0.003
endant-0.003
ll-0.003
 Bian-0.003
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
ll+0.007
arl+0.007
ety+0.006
 upon+0.006
endant+0.006
Neg logit contributions
ze-0.004
zed-0.004
 bottom-0.004
 character-0.003
hop-0.003
Token wheat
Token ID17135
Feature activation+7.34e-01

Pos logit contributions
arl+0.000
 upon+0.000
ety+0.000
ll+0.000
 ago+0.000
Neg logit contributions
ze-0.000
zed-0.000
 character-0.000
hop-0.000
 tunes-0.000
Token.
Token ID13
Feature activation-3.35e-01

Pos logit contributions
arl+0.019
 upon+0.018
ll+0.018
ety+0.017
endant+0.017
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
hop-0.016
 bottom-0.015
Token and
Token ID290
Feature activation+7.48e-02

Pos logit contributions
ze+0.014
zed+0.013
 character+0.013
hop+0.012
 bottom+0.012
Neg logit contributions
arl-0.005
 upon-0.005
ll-0.004
ety-0.004
endant-0.004
Token Sam
Token ID3409
Feature activation-1.46e-01

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.002
endant+0.002
ety+0.002
Neg logit contributions
ze-0.002
zed-0.002
hop-0.002
 character-0.002
lder-0.001
Token went
Token ID1816
Feature activation-7.96e-01

Pos logit contributions
ze+0.005
zed+0.004
 character+0.004
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.003
endant-0.003
Token on
Token ID319
Feature activation-1.25e+00

Pos logit contributions
ze+0.019
zed+0.018
hop+0.017
 character+0.015
 middle+0.015
Neg logit contributions
arl-0.020
endant-0.020
ll-0.019
 haven-0.019
ety-0.019
Token a
Token ID257
Feature activation+7.18e-01

Pos logit contributions
ze+0.035
zed+0.032
 character+0.029
hop+0.028
 bottom+0.028
Neg logit contributions
arl-0.033
ll-0.030
 upon-0.028
ety-0.028
endant-0.027
Token saf
Token ID1932
Feature activation+3.33e+00

Pos logit contributions
arl+0.018
ll+0.016
 upon+0.016
ety+0.015
endant+0.015
Neg logit contributions
ze-0.020
zed-0.018
 character-0.017
hop-0.016
 bottom-0.016
Tokenari
Token ID2743
Feature activation+1.25e+00

Pos logit contributions
arl+0.058
 Bian+0.052
ety+0.052
 upon+0.050
 haven+0.048
Neg logit contributions
ze-0.101
hop-0.097
 character-0.096
 bottom-0.094
zed-0.094
Token with
Token ID351
Feature activation+3.67e-01

Pos logit contributions
arl+0.036
ll+0.031
 upon+0.030
ety+0.029
endant+0.029
Neg logit contributions
ze-0.032
zed-0.028
hop-0.027
 character-0.027
 bottom-0.025
Token their
Token ID511
Feature activation+1.13e+00

Pos logit contributions
arl+0.010
ll+0.009
 upon+0.009
ety+0.008
endant+0.008
Neg logit contributions
ze-0.010
zed-0.009
 character-0.008
hop-0.008
 bottom-0.008
Token mom
Token ID1995
Feature activation+1.71e-01

Pos logit contributions
arl+0.037
ll+0.034
ety+0.032
 upon+0.032
endant+0.031
Neg logit contributions
ze-0.023
 character-0.020
zed-0.019
 bottom-0.019
 middle-0.018
Token and
Token ID290
Feature activation-5.18e-01

Pos logit contributions
arl+0.004
endant+0.003
ll+0.003
 Bian+0.003
 upon+0.003
Neg logit contributions
ze-0.005
hop-0.004
zed-0.004
 character-0.004
 bottom-0.004
Token with
Token ID351
Feature activation-5.01e-01

Pos logit contributions
ze+0.020
zed+0.019
 character+0.017
hop+0.017
 tunes+0.017
Neg logit contributions
arl-0.009
 upon-0.008
ll-0.008
ety-0.007
 ago-0.006
Token fries
Token ID31757
Feature activation-1.07e+00

Pos logit contributions
ze+0.014
zed+0.013
 character+0.012
hop+0.012
lder+0.011
Neg logit contributions
arl-0.012
 upon-0.011
ll-0.010
ety-0.010
endant-0.010
Token and
Token ID290
Feature activation-2.79e-01

Pos logit contributions
ze+0.038
zed+0.035
 character+0.033
hop+0.032
 tunes+0.032
Neg logit contributions
arl-0.018
 upon-0.017
ll-0.016
ety-0.014
 ago-0.013
Token a
Token ID257
Feature activation+9.11e-02

Pos logit contributions
ze+0.006
zed+0.005
 character+0.004
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.009
 upon-0.009
ll-0.008
ety-0.008
endant-0.008
Token mil
Token ID1465
Feature activation-1.25e+00

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.003
zed-0.002
hop-0.002
 character-0.002
 tunes-0.002
Tokenksh
Token ID50133
Feature activation-3.84e+00

Pos logit contributions
ze+0.031
zed+0.029
 character+0.026
hop+0.025
 bottom+0.024
Neg logit contributions
arl-0.035
ll-0.032
 upon-0.032
ety-0.031
endant-0.030
Tokenake
Token ID539
Feature activation-1.38e-01

Pos logit contributions
ze+0.094
zed+0.083
 character+0.079
hop+0.075
 tunes+0.070
Neg logit contributions
arl-0.109
ll-0.106
ety-0.095
 upon-0.092
ucker-0.089
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.004
hop+0.003
 tunes+0.003
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.003
endant-0.002
Token But
Token ID887
Feature activation-4.37e-01

Pos logit contributions
ze+0.023
zed+0.020
 character+0.020
hop+0.019
 bottom+0.018
Neg logit contributions
ll-0.019
arl-0.019
 upon-0.018
endant-0.017
ety-0.017
Token after
Token ID706
Feature activation-1.72e-01

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.010
 bottom+0.009
Neg logit contributions
arl-0.012
ll-0.011
 upon-0.010
endant-0.010
ety-0.010
Token eating
Token ID6600
Feature activation-7.87e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.003
hop+0.003
 breakfast+0.003
Neg logit contributions
arl-0.006
ll-0.005
ety-0.005
endant-0.005
 upon-0.005
Token doing
Token ID1804
Feature activation-1.07e+00

Pos logit contributions
arl+0.002
ll+0.002
 upon+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.002
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token?
Token ID30
Feature activation-5.86e-01

Pos logit contributions
ze+0.038
zed+0.036
hop+0.034
 character+0.033
 tunes+0.032
Neg logit contributions
arl-0.018
 upon-0.016
ll-0.015
ety-0.014
 Bian-0.013
Token Why
Token ID4162
Feature activation-3.75e-01

Pos logit contributions
ze+0.014
zed+0.012
 character+0.012
hop+0.011
lder+0.010
Neg logit contributions
arl-0.017
ll-0.017
 upon-0.016
ety-0.015
endant-0.015
Token are
Token ID389
Feature activation+6.08e-01

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
 bottom+0.008
hop+0.008
Neg logit contributions
arl-0.010
ll-0.008
ety-0.008
endant-0.008
 upon-0.008
Token you
Token ID345
Feature activation-1.69e-01

Pos logit contributions
arl+0.014
ll+0.012
 upon+0.012
ety+0.012
endant+0.011
Neg logit contributions
ze-0.017
zed-0.016
 character-0.015
hop-0.014
 movies-0.014
Token arguing
Token ID11810
Feature activation-3.68e+00

Pos logit contributions
ze+0.005
zed+0.004
 bottom+0.004
 character+0.004
hop+0.004
Neg logit contributions
arl-0.004
endant-0.004
ll-0.004
ety-0.003
 upon-0.003
Token?"
Token ID1701
Feature activation-4.42e-01

Pos logit contributions
ze+0.135
zed+0.131
hop+0.118
 character+0.114
 tunes+0.109
Neg logit contributions
arl-0.064
 upon-0.062
ll-0.055
ety-0.052
 haven-0.043
Token
Token ID198
Feature activation+3.05e-01

Pos logit contributions
ze+0.010
 character+0.009
zed+0.009
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.012
ll-0.012
endant-0.011
ety-0.011
 upon-0.011
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
arl+0.007
ll+0.006
 upon+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.009
zed-0.008
 character-0.008
hop-0.008
 bottom-0.008
TokenTom
Token ID13787
Feature activation-4.71e-01

Pos logit contributions
arl+0.025
 upon+0.023
ll+0.023
ety+0.022
endant+0.021
Neg logit contributions
ze-0.030
zed-0.027
 character-0.025
hop-0.024
 bottom-0.023
Token and
Token ID290
Feature activation-1.22e-01

Pos logit contributions
ze+0.016
zed+0.014
 character+0.014
hop+0.013
lder+0.013
Neg logit contributions
arl-0.009
 upon-0.008
ll-0.008
ety-0.007
endant-0.007
Token as
Token ID355
Feature activation+2.36e-01

Pos logit contributions
arl+0.007
 upon+0.006
ll+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.009
zed-0.008
 character-0.008
hop-0.008
 tunes-0.007
Token they
Token ID484
Feature activation-4.64e-01

Pos logit contributions
arl+0.005
ll+0.005
 upon+0.005
ety+0.005
endant+0.005
Neg logit contributions
ze-0.007
zed-0.007
 character-0.006
hop-0.006
 bottom-0.006
Token drank
Token ID24070
Feature activation+2.02e-01

Pos logit contributions
ze+0.012
zed+0.010
 character+0.009
hop+0.009
lder+0.008
Neg logit contributions
arl-0.013
 upon-0.012
ll-0.012
ety-0.011
endant-0.011
Token the
Token ID262
Feature activation-7.56e-01

Pos logit contributions
arl+0.005
 upon+0.004
ll+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.006
zed-0.006
 character-0.005
hop-0.005
lder-0.005
Token mil
Token ID1465
Feature activation-1.35e+00

Pos logit contributions
ze+0.019
zed+0.017
 character+0.015
hop+0.015
 bottom+0.014
Neg logit contributions
arl-0.021
 upon-0.020
ll-0.020
ety-0.019
endant-0.018
Tokenksh
Token ID50133
Feature activation-3.67e+00

Pos logit contributions
ze+0.033
zed+0.030
 character+0.028
hop+0.027
 bottom+0.025
Neg logit contributions
arl-0.038
ll-0.035
 upon-0.035
ety-0.034
endant-0.033
Tokenakes
Token ID1124
Feature activation-9.54e-01

Pos logit contributions
ze+0.120
zed+0.111
 character+0.108
hop+0.102
 bottom+0.099
Neg logit contributions
ll-0.071
arl-0.071
ety-0.062
 upon-0.056
ucker-0.054
Token.
Token ID13
Feature activation-5.11e-01

Pos logit contributions
ze+0.029
zed+0.026
 character+0.024
hop+0.023
 bottom+0.022
Neg logit contributions
arl-0.022
 upon-0.020
ll-0.020
ety-0.019
endant-0.018
Token
Token ID220
Feature activation-2.58e-01

Pos logit contributions
ze+0.013
zed+0.012
 character+0.012
hop+0.012
 Everyone+0.011
Neg logit contributions
arl-0.013
ll-0.012
endant-0.012
 upon-0.012
ety-0.011
Token
Token ID198
Feature activation+6.10e-03

Pos logit contributions
ze+0.008
zed+0.007
hop+0.006
 character+0.006
 bottom+0.006
Neg logit contributions
arl-0.006
endant-0.005
ll-0.005
ety-0.005
 upon-0.005
Token
Token ID198
Feature activation+4.98e-01

Pos logit contributions
arl+0.000
endant+0.000
 upon+0.000
ll+0.000
ety+0.000
Neg logit contributions
ze-0.000
zed-0.000
hop-0.000
 character-0.000
 bottom-0.000
Token don
Token ID836
Feature activation+1.18e+00

Pos logit contributions
ze+0.009
zed+0.009
 character+0.008
hop+0.008
 bottom+0.008
Neg logit contributions
arl-0.005
ll-0.004
 upon-0.004
ety-0.004
endant-0.004
Token't
Token ID470
Feature activation-1.81e-01

Pos logit contributions
arl+0.016
 upon+0.014
ll+0.014
 Bian+0.013
ety+0.011
Neg logit contributions
ze-0.045
zed-0.041
hop-0.039
 character-0.039
 tunes-0.038
Token you
Token ID345
Feature activation-1.82e-01

Pos logit contributions
ze+0.008
zed+0.008
 character+0.007
hop+0.007
five+0.007
Neg logit contributions
ll-0.001
 upon-0.001
arl-0.001
ety-0.001
 Bian-0.001
Token enter
Token ID3802
Feature activation-2.66e-01

Pos logit contributions
ze+0.005
zed+0.004
 character+0.004
hop+0.004
 bottom+0.003
Neg logit contributions
arl-0.005
 upon-0.005
ll-0.005
ety-0.004
endant-0.004
Token a
Token ID257
Feature activation-7.66e-01

Pos logit contributions
ze+0.007
zed+0.006
 character+0.006
hop+0.006
 bottom+0.005
Neg logit contributions
arl-0.007
ll-0.007
ety-0.006
 upon-0.006
endant-0.006
Token panc
Token ID22154
Feature activation-3.62e+00

Pos logit contributions
ze+0.023
zed+0.021
hop+0.019
 character+0.019
 tunes+0.018
Neg logit contributions
 upon-0.017
arl-0.016
ll-0.016
ety-0.014
 Bian-0.013
Tokenake
Token ID539
Feature activation-1.45e+00

Pos logit contributions
ze+0.075
 character+0.067
zed+0.066
 bottom+0.060
hop+0.057
Neg logit contributions
arl-0.109
ll-0.109
ety-0.099
 upon-0.097
uffy-0.091
Token cooking
Token ID10801
Feature activation-1.52e+00

Pos logit contributions
ze+0.049
zed+0.046
 character+0.042
hop+0.041
 tunes+0.040
Neg logit contributions
arl-0.027
 upon-0.026
ll-0.025
ety-0.022
 Bian-0.020
Token contest
Token ID8414
Feature activation+3.51e-02

Pos logit contributions
ze+0.054
zed+0.050
 character+0.046
hop+0.046
 tunes+0.044
Neg logit contributions
arl-0.025
 upon-0.024
ll-0.022
 there-0.018
ety-0.017
Token?"
Token ID1701
Feature activation-9.72e-04

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
 Bian+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token Tim
Token ID5045
Feature activation-5.33e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
ll-0.000
 upon-0.000
endant-0.000
ety-0.000
Token not
Token ID407
Feature activation-7.41e-01

Pos logit contributions
arl+0.000
 upon+0.000
ll+0.000
ety+0.000
endant+0.000
Neg logit contributions
ze-0.000
zed-0.000
 character-0.000
hop-0.000
lder-0.000
Token to
Token ID284
Feature activation-1.15e+00

Pos logit contributions
ze+0.022
zed+0.020
 character+0.019
hop+0.018
 bottom+0.017
Neg logit contributions
arl-0.017
 upon-0.016
ll-0.016
ety-0.015
endant-0.014
Token stop
Token ID2245
Feature activation-1.09e+00

Pos logit contributions
ze+0.031
zed+0.029
 character+0.027
hop+0.026
 tunes+0.024
Neg logit contributions
arl-0.028
ll-0.026
 upon-0.026
ety-0.025
 Bian-0.023
Token stirring
Token ID26547
Feature activation-1.09e-01

Pos logit contributions
ze+0.030
zed+0.028
 character+0.026
hop+0.025
lder+0.024
Neg logit contributions
arl-0.026
 upon-0.025
ll-0.024
ety-0.023
 Bian-0.022
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
hop+0.003
lder+0.003
Neg logit contributions
arl-0.002
 upon-0.002
ll-0.002
ety-0.002
 Bian-0.002
Token panc
Token ID22154
Feature activation-3.60e+00

Pos logit contributions
ze+0.035
zed+0.034
hop+0.032
 character+0.031
seek+0.031
Neg logit contributions
 upon-0.029
arl-0.028
ll-0.027
oms-0.023
 there-0.023
Tokenake
Token ID539
Feature activation-1.54e+00

Pos logit contributions
ze+0.078
 character+0.076
zed+0.068
 bottom+0.066
hop+0.063
Neg logit contributions
ll-0.104
arl-0.100
ety-0.091
ucker-0.088
 upon-0.085
Token batter
Token ID4984
Feature activation-1.30e+00

Pos logit contributions
ze+0.048
zed+0.046
hop+0.042
 character+0.042
lder+0.041
Neg logit contributions
arl-0.028
 upon-0.026
ll-0.025
ety-0.023
 ago-0.021
Token.
Token ID13
Feature activation-5.78e-01

Pos logit contributions
ze+0.037
zed+0.033
 character+0.030
hop+0.029
 bottom+0.029
Neg logit contributions
arl-0.033
 upon-0.030
ll-0.030
ety-0.029
endant-0.028
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
ze+0.018
zed+0.016
 character+0.015
hop+0.015
 bottom+0.014
Neg logit contributions
arl-0.013
ll-0.011
 upon-0.011
ety-0.011
endant-0.011
Token
Token ID198
Feature activation-7.73e-03

Pos logit contributions
ze+0.035
zed+0.033
 character+0.031
hop+0.030
 bottom+0.028
Neg logit contributions
arl-0.025
endant-0.022
ll-0.022
ety-0.022
 upon-0.022
Token mom
Token ID1995
Feature activation-6.18e-01

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 tunes+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Tokenmy
Token ID1820
Feature activation-9.15e-01

Pos logit contributions
ze+0.025
zed+0.024
 character+0.022
hop+0.022
 bottom+0.021
Neg logit contributions
 upon-0.008
arl-0.007
ety-0.007
ll-0.007
endant-0.005
Token her
Token ID607
Feature activation+2.31e-01

Pos logit contributions
ze+0.032
zed+0.029
 character+0.028
hop+0.027
 bottom+0.026
Neg logit contributions
 upon-0.017
arl-0.015
ll-0.015
ety-0.015
endant-0.013
Token drawing
Token ID8263
Feature activation-2.00e+00

Pos logit contributions
arl+0.006
ll+0.006
 upon+0.006
ety+0.005
 Bian+0.005
Neg logit contributions
ze-0.006
zed-0.005
 character-0.005
hop-0.005
 bottom-0.005
Token.
Token ID13
Feature activation-1.82e+00

Pos logit contributions
ze+0.067
zed+0.063
hop+0.057
 character+0.057
lder+0.056
Neg logit contributions
arl-0.036
 upon-0.036
ll-0.033
ety-0.032
endant-0.031
Token "
Token ID366
Feature activation-3.56e+00

Pos logit contributions
ze+0.061
zed+0.056
 character+0.052
hop+0.051
 tunes+0.048
Neg logit contributions
arl-0.037
 upon-0.034
ety-0.031
ll-0.031
 Bian-0.029
TokenWow
Token ID22017
Feature activation-4.72e-01

Pos logit contributions
zed+0.112
ze+0.110
hop+0.092
 character+0.092
 aids+0.085
Neg logit contributions
ety-0.075
 upon-0.073
arl-0.071
 haven-0.059
ll-0.058
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
ze+0.017
zed+0.015
hop+0.015
 character+0.015
 tunes+0.014
Neg logit contributions
arl-0.008
 upon-0.008
ll-0.008
ety-0.007
endant-0.007
Token Lily
Token ID20037
Feature activation-9.83e-02

Pos logit contributions
ze+0.066
zed+0.062
hop+0.059
 tunes+0.054
 character+0.053
Neg logit contributions
arl-0.042
 upon-0.040
endant-0.035
 Bian-0.035
ll-0.035
Token!
Token ID0
Feature activation-8.35e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
 upon-0.002
ll-0.002
ety-0.002
endant-0.002
Token I
Token ID314
Feature activation-1.02e+00

Pos logit contributions
ze+0.025
zed+0.023
 character+0.021
hop+0.020
 tunes+0.019
Neg logit contributions
arl-0.020
 upon-0.018
ll-0.018
ety-0.017
endant-0.016
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
ze+0.017
zed+0.016
hop+0.014
 character+0.014
 bottom+0.014
Neg logit contributions
arl-0.014
endant-0.013
ety-0.013
ll-0.012
 upon-0.012
Token"
Token ID1
Feature activation-1.45e+00

Pos logit contributions
arl+0.013
 upon+0.012
ll+0.012
ety+0.011
 Bian+0.011
Neg logit contributions
ze-0.013
zed-0.012
 character-0.011
hop-0.011
 tunes-0.010
TokenWhy
Token ID5195
Feature activation-3.69e-01

Pos logit contributions
ze+0.030
zed+0.026
 character+0.023
hop+0.022
 bottom+0.020
Neg logit contributions
arl-0.048
 upon-0.045
ll-0.044
ety-0.043
 Bian-0.041
Token are
Token ID389
Feature activation+9.58e-01

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.010
 bottom+0.009
Neg logit contributions
arl-0.008
 upon-0.007
ll-0.007
ety-0.007
endant-0.006
Token you
Token ID345
Feature activation-3.94e-02

Pos logit contributions
arl+0.022
 upon+0.019
endant+0.019
ll+0.019
ety+0.019
Neg logit contributions
ze-0.027
zed-0.024
 character-0.024
hop-0.022
 movies-0.022
Token arguing
Token ID11810
Feature activation-3.54e+00

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
 bottom+0.001
hop+0.001
Neg logit contributions
arl-0.001
endant-0.001
ety-0.001
ll-0.001
 upon-0.001
Token?"
Token ID1701
Feature activation-1.12e+00

Pos logit contributions
ze+0.120
zed+0.120
hop+0.105
 character+0.103
 tunes+0.096
Neg logit contributions
 upon-0.068
arl-0.067
ll-0.061
ety-0.058
 haven-0.049
Token Mom
Token ID11254
Feature activation-3.90e-01

Pos logit contributions
ze+0.028
zed+0.025
 character+0.024
hop+0.022
 bottom+0.021
Neg logit contributions
arl-0.031
ll-0.029
 upon-0.028
ety-0.028
endant-0.027
Token asks
Token ID7893
Feature activation+8.93e-01

Pos logit contributions
ze+0.011
zed+0.009
 character+0.009
hop+0.009
lder+0.008
Neg logit contributions
arl-0.010
 upon-0.008
endant-0.008
ll-0.008
ety-0.008
Token.
Token ID13
Feature activation+7.42e-02

Pos logit contributions
arl+0.027
ll+0.026
endant+0.025
ety+0.024
 upon+0.023
Neg logit contributions
ze-0.020
 character-0.016
zed-0.016
hop-0.016
 bottom-0.015
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
arl+0.002
ll+0.002
endant+0.002
ety+0.002
 upon+0.002
Neg logit contributions
ze-0.002
zed-0.002
 character-0.002
hop-0.002
 bottom-0.002
Token Mia
Token ID32189
Feature activation+9.10e-02

Pos logit contributions
ze+0.013
zed+0.012
 character+0.011
hop+0.011
 bottom+0.010
Neg logit contributions
arl-0.013
ll-0.013
ety-0.012
endant-0.012
 upon-0.012
Token,
Token ID11
Feature activation-5.28e-01

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.002
zed-0.002
 character-0.002
hop-0.002
lder-0.002
Token why
Token ID1521
Feature activation-3.66e-01

Pos logit contributions
ze+0.015
zed+0.014
 character+0.013
hop+0.012
 bottom+0.012
Neg logit contributions
arl-0.013
ll-0.012
ety-0.012
 upon-0.012
endant-0.011
Token are
Token ID389
Feature activation+8.82e-01

Pos logit contributions
ze+0.012
zed+0.011
 character+0.010
hop+0.010
 bottom+0.010
Neg logit contributions
arl-0.008
 upon-0.007
ll-0.007
ety-0.006
endant-0.006
Token you
Token ID345
Feature activation-7.06e-02

Pos logit contributions
arl+0.020
endant+0.018
 upon+0.018
ety+0.018
ll+0.017
Neg logit contributions
ze-0.026
zed-0.024
 character-0.022
hop-0.021
lder-0.021
Token arguing
Token ID11810
Feature activation-3.49e+00

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
 bottom+0.002
hop+0.002
Neg logit contributions
arl-0.002
endant-0.002
ety-0.002
ll-0.002
 upon-0.001
Token?"
Token ID1701
Feature activation-9.70e-01

Pos logit contributions
ze+0.134
zed+0.131
hop+0.119
 character+0.118
lder+0.111
Neg logit contributions
arl-0.053
 upon-0.051
ll-0.045
ety-0.043
 haven-0.032
Token Tom
Token ID4186
Feature activation-3.85e-01

Pos logit contributions
ze+0.029
zed+0.026
 character+0.025
hop+0.024
 bottom+0.024
Neg logit contributions
arl-0.023
ll-0.022
ety-0.020
endant-0.020
 upon-0.020
Token's
Token ID338
Feature activation+2.84e-01

Pos logit contributions
ze+0.011
zed+0.009
 character+0.009
hop+0.009
 bottom+0.008
Neg logit contributions
arl-0.010
 upon-0.009
ll-0.009
endant-0.009
ety-0.008
Token mom
Token ID1995
Feature activation-1.82e-01

Pos logit contributions
arl+0.007
 upon+0.006
ll+0.006
ety+0.006
endant+0.005
Neg logit contributions
ze-0.008
zed-0.008
 character-0.007
hop-0.007
lder-0.007
Token asks
Token ID7893
Feature activation+1.02e+00

Pos logit contributions
ze+0.005
zed+0.004
 character+0.004
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.005
 upon-0.004
ll-0.004
endant-0.004
ety-0.004
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
ze+0.022
zed+0.019
 character+0.018
hop+0.017
 tunes+0.016
Neg logit contributions
arl-0.020
 upon-0.019
ll-0.019
ety-0.018
 Bian-0.017
Token You
Token ID921
Feature activation-1.00e-01

Pos logit contributions
ze+0.029
zed+0.024
 character+0.024
hop+0.022
lder+0.021
Neg logit contributions
ll-0.031
 upon-0.030
arl-0.029
ety-0.028
endant-0.027
Token should
Token ID815
Feature activation-1.03e-02

Pos logit contributions
ze+0.003
 character+0.002
zed+0.002
hop+0.002
lder+0.002
Neg logit contributions
arl-0.003
 upon-0.002
ety-0.002
ll-0.002
endant-0.002
Token never
Token ID1239
Feature activation-6.51e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
 upon-0.000
arl-0.000
ll-0.000
ety-0.000
endant-0.000
Token press
Token ID1803
Feature activation-1.45e+00

Pos logit contributions
ze+0.018
zed+0.015
 character+0.014
hop+0.014
 bottom+0.013
Neg logit contributions
arl-0.017
 upon-0.016
ll-0.016
ety-0.015
endant-0.014
Token it
Token ID340
Feature activation-3.49e+00

Pos logit contributions
ze+0.039
zed+0.035
 character+0.032
hop+0.031
 bottom+0.030
Neg logit contributions
arl-0.038
 upon-0.035
ll-0.035
ety-0.034
endant-0.032
Token unless
Token ID4556
Feature activation-4.82e-01

Pos logit contributions
ze+0.108
zed+0.101
 character+0.094
hop+0.091
lder+0.087
Neg logit contributions
arl-0.076
 upon-0.068
ll-0.067
ety-0.063
 Bian-0.056
Token you
Token ID345
Feature activation-2.84e-01

Pos logit contributions
ze+0.014
zed+0.012
 character+0.012
hop+0.012
 bottom+0.011
Neg logit contributions
arl-0.012
ll-0.011
endant-0.010
ety-0.010
 upon-0.010
Token are
Token ID389
Feature activation-1.57e+00

Pos logit contributions
ze+0.008
zed+0.007
hop+0.006
 character+0.006
 bottom+0.006
Neg logit contributions
arl-0.007
 upon-0.006
endant-0.006
ll-0.006
ety-0.006
Token in
Token ID287
Feature activation+5.05e-02

Pos logit contributions
ze+0.047
zed+0.041
 character+0.039
hop+0.037
 tunes+0.034
Neg logit contributions
arl-0.038
 upon-0.035
ll-0.034
ety-0.034
endant-0.032
Token danger
Token ID3514
Feature activation-1.08e+00

Pos logit contributions
arl+0.001
ll+0.001
 upon+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token crown
Token ID12389
Feature activation-4.59e-01

Pos logit contributions
arl+0.015
 upon+0.014
ll+0.014
ety+0.013
endant+0.013
Neg logit contributions
ze-0.018
zed-0.016
 character-0.015
hop-0.015
 bottom-0.014
Token and
Token ID290
Feature activation-3.03e-01

Pos logit contributions
ze+0.013
zed+0.011
 character+0.011
hop+0.010
 bottom+0.010
Neg logit contributions
arl-0.012
 upon-0.011
ll-0.010
ety-0.010
 Bian-0.010
Token playing
Token ID2712
Feature activation-5.20e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.007
 upon-0.007
ll-0.007
ety-0.007
endant-0.006
Token in
Token ID287
Feature activation-1.65e-01

Pos logit contributions
ze+0.015
zed+0.015
 character+0.013
hop+0.013
lder+0.013
Neg logit contributions
 upon-0.010
arl-0.010
ll-0.009
 Tweet-0.008
 Bian-0.008
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.004
hop+0.004
 bottom+0.003
Neg logit contributions
arl-0.004
ll-0.004
 upon-0.004
ety-0.004
endant-0.004
Token freshly
Token ID29026
Feature activation-3.48e+00

Pos logit contributions
ze+0.006
zed+0.006
 character+0.006
hop+0.005
lder+0.005
Neg logit contributions
 upon-0.004
arl-0.004
ety-0.004
ll-0.004
endant-0.004
Token cut
Token ID2005
Feature activation-3.96e-01

Pos logit contributions
ze+0.100
 character+0.091
zed+0.086
hop+0.085
 bottom+0.083
Neg logit contributions
ll-0.068
 upon-0.068
arl-0.066
ety-0.065
endant-0.061
Token grass
Token ID8701
Feature activation+7.68e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
 bottom+0.008
Neg logit contributions
arl-0.010
 upon-0.010
ll-0.010
ety-0.009
endant-0.009
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
arl+0.019
ll+0.017
 upon+0.017
 Bian+0.016
ety+0.016
Neg logit contributions
ze-0.022
zed-0.019
 character-0.018
hop-0.018
 bottom-0.017
Token It
Token ID632
Feature activation+2.95e-01

Pos logit contributions
ze+0.053
zed+0.048
 character+0.046
hop+0.045
 tunes+0.042
Neg logit contributions
arl-0.030
 upon-0.028
ll-0.027
ety-0.026
endant-0.023
Token looked
Token ID3114
Feature activation-7.53e-01

Pos logit contributions
ll+0.007
 upon+0.007
arl+0.007
ety+0.007
endant+0.006
Neg logit contributions
ze-0.008
zed-0.007
 character-0.007
hop-0.007
lder-0.006
Token day
Token ID1110
Feature activation+6.82e-02

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token drinking
Token ID7722
Feature activation-3.40e-01

Pos logit contributions
arl+0.002
 upon+0.002
ety+0.002
ll+0.002
endant+0.001
Neg logit contributions
ze-0.002
zed-0.002
hop-0.002
 character-0.002
 tunes-0.001
Token y
Token ID331
Feature activation-1.31e+00

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
lder+0.009
Neg logit contributions
arl-0.007
 upon-0.007
ll-0.006
ety-0.006
endant-0.005
Tokenummy
Token ID13513
Feature activation-1.32e+00

Pos logit contributions
ze+0.041
zed+0.037
 character+0.035
hop+0.034
 bottom+0.032
Neg logit contributions
arl-0.029
ll-0.027
 upon-0.026
ety-0.025
endant-0.024
Token mil
Token ID1465
Feature activation-1.11e+00

Pos logit contributions
ze+0.036
zed+0.034
 character+0.030
hop+0.030
lder+0.028
Neg logit contributions
ll-0.029
 upon-0.029
arl-0.029
ety-0.027
endant-0.026
Tokenksh
Token ID50133
Feature activation-3.48e+00

Pos logit contributions
ze+0.027
zed+0.025
 character+0.023
hop+0.022
 bottom+0.021
Neg logit contributions
arl-0.031
ll-0.029
 upon-0.028
ety-0.028
endant-0.026
Tokenakes
Token ID1124
Feature activation-6.80e-02

Pos logit contributions
ze+0.108
zed+0.097
 character+0.095
hop+0.089
 bottom+0.087
Neg logit contributions
ll-0.077
arl-0.074
ety-0.065
 upon-0.062
uffy-0.058
Token in
Token ID287
Feature activation-3.90e-01

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token the
Token ID262
Feature activation-3.88e-01

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
hop+0.008
 bottom+0.008
Neg logit contributions
arl-0.011
ll-0.010
 upon-0.010
ety-0.009
endant-0.009
Token sun
Token ID4252
Feature activation+3.85e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
 tunes+0.008
Neg logit contributions
arl-0.010
 upon-0.009
ll-0.009
ety-0.008
endant-0.008
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
arl+0.009
 upon+0.008
ll+0.008
ety+0.008
endant+0.008
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.009
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
ze+0.018
zed+0.016
hop+0.015
 bottom+0.014
 character+0.014
Neg logit contributions
arl-0.021
ll-0.019
endant-0.019
 Bian-0.019
ety-0.019
Token show
Token ID905
Feature activation-8.55e-01

Pos logit contributions
ze+0.026
zed+0.023
 bottom+0.021
 character+0.020
hop+0.020
Neg logit contributions
arl-0.043
ll-0.042
endant-0.039
ety-0.038
omed-0.038
Token her
Token ID607
Feature activation-1.14e-01

Pos logit contributions
ze+0.027
zed+0.024
 character+0.023
hop+0.023
lder+0.022
Neg logit contributions
arl-0.018
 upon-0.016
ety-0.015
 Tweet-0.014
endant-0.014
Token mom
Token ID1995
Feature activation-8.35e-02

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
hop+0.003
lder+0.002
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.003
endant-0.002
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
ll-0.002
 Bian-0.002
 upon-0.002
endant-0.002
Token "
Token ID366
Feature activation-3.46e+00

Pos logit contributions
ze+0.047
zed+0.043
 character+0.041
hop+0.040
 tunes+0.038
Neg logit contributions
arl-0.019
 upon-0.017
ll-0.016
ety-0.016
endant-0.014
TokenLook
Token ID8567
Feature activation-7.95e-01

Pos logit contributions
zed+0.107
ze+0.103
 character+0.087
hop+0.083
 tunes+0.078
Neg logit contributions
ety-0.078
arl-0.075
 upon-0.069
 Tweet-0.061
 Bian-0.059
Token,
Token ID11
Feature activation-2.13e+00

Pos logit contributions
ze+0.031
zed+0.030
 tunes+0.028
hop+0.028
 character+0.028
Neg logit contributions
 upon-0.012
ety-0.011
ll-0.010
arl-0.010
uffy-0.009
Token Mom
Token ID11254
Feature activation-1.28e-01

Pos logit contributions
ze+0.074
zed+0.072
hop+0.067
 tunes+0.065
 basis+0.061
Neg logit contributions
 upon-0.039
arl-0.037
 Tweet-0.035
ety-0.034
endant-0.034
Token!
Token ID0
Feature activation-1.16e+00

Pos logit contributions
ze+0.004
zed+0.003
 character+0.003
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.002
endant-0.002
Token I
Token ID314
Feature activation-1.07e+00

Pos logit contributions
ze+0.033
zed+0.030
 character+0.028
hop+0.027
 bottom+0.026
Neg logit contributions
arl-0.027
ll-0.026
 upon-0.026
ety-0.024
endant-0.023
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
arl+0.018
 upon+0.016
ll+0.016
 Bian+0.015
ety+0.015
Neg logit contributions
ze-0.022
zed-0.020
hop-0.019
 character-0.019
 bottom-0.018
Token show
Token ID905
Feature activation-1.21e+00

Pos logit contributions
ze+0.026
zed+0.023
 character+0.021
hop+0.020
 bottom+0.020
Neg logit contributions
arl-0.033
ll-0.032
 upon-0.031
ety-0.030
endant-0.029
Token his
Token ID465
Feature activation+4.53e-02

Pos logit contributions
ze+0.041
zed+0.036
lder+0.036
hop+0.035
 character+0.034
Neg logit contributions
arl-0.020
 upon-0.019
 Kim-0.018
 Tweet-0.016
ety-0.016
Token mom
Token ID1995
Feature activation-8.20e-02

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
ll-0.002
 upon-0.002
ety-0.002
endant-0.002
Token "
Token ID366
Feature activation-3.44e+00

Pos logit contributions
ze+0.056
zed+0.050
 character+0.048
hop+0.046
 tunes+0.045
Neg logit contributions
arl-0.022
 upon-0.020
ll-0.018
ety-0.017
endant-0.015
TokenLook
Token ID8567
Feature activation-7.31e-01

Pos logit contributions
zed+0.103
ze+0.100
 character+0.087
 vows+0.083
hop+0.081
Neg logit contributions
ety-0.076
arl-0.070
 upon-0.067
 late-0.060
 haven-0.056
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
ze+0.029
zed+0.028
 tunes+0.027
hop+0.027
 basis+0.026
Neg logit contributions
 upon-0.009
ll-0.008
ety-0.008
arl-0.008
uffy-0.007
Token Mom
Token ID11254
Feature activation-1.96e-01

Pos logit contributions
ze+0.083
zed+0.081
hop+0.075
 tunes+0.072
 basis+0.070
Neg logit contributions
 upon-0.039
arl-0.036
 Kim-0.034
 Tweet-0.033
 Tim-0.033
Token,"
Token ID553
Feature activation-1.33e+00

Pos logit contributions
ze+0.006
zed+0.005
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.004
 upon-0.004
ll-0.004
ety-0.004
endant-0.004
Token said
Token ID531
Feature activation-5.14e-01

Pos logit contributions
ze+0.051
zed+0.046
 character+0.043
hop+0.043
 bottom+0.041
Neg logit contributions
arl-0.020
 upon-0.018
ll-0.017
ety-0.017
endant-0.015
Tokenksh
Token ID50133
Feature activation-3.02e+00

Pos logit contributions
ze+0.028
zed+0.025
 character+0.023
hop+0.023
 bottom+0.021
Neg logit contributions
arl-0.029
 upon-0.027
ll-0.026
ety-0.025
endant-0.024
Tokenakes
Token ID1124
Feature activation-2.00e-01

Pos logit contributions
ze+0.082
zed+0.076
 character+0.072
hop+0.067
 bottom+0.066
Neg logit contributions
arl-0.075
ll-0.075
ety-0.067
 upon-0.066
ucker-0.061
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
ze+0.006
zed+0.005
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.005
 upon-0.004
ll-0.004
ety-0.004
endant-0.004
Token The
Token ID383
Feature activation-1.59e-02

Pos logit contributions
arl+0.005
ll+0.005
endant+0.005
 upon+0.004
ety+0.004
Neg logit contributions
ze-0.006
 character-0.005
zed-0.005
hop-0.005
 movies-0.004
Token mil
Token ID1465
Feature activation-9.23e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
 upon-0.000
ll-0.000
ety-0.000
endant-0.000
Tokenksh
Token ID50133
Feature activation-3.44e+00

Pos logit contributions
ze+0.022
zed+0.020
 character+0.018
hop+0.017
 bottom+0.016
Neg logit contributions
arl-0.027
 upon-0.025
ll-0.025
ety-0.024
endant-0.023
Tokenakes
Token ID1124
Feature activation-5.61e-01

Pos logit contributions
ze+0.098
zed+0.092
 character+0.089
hop+0.084
 bottom+0.083
Neg logit contributions
ll-0.077
arl-0.076
ety-0.066
 upon-0.062
ucker-0.061
Token were
Token ID547
Feature activation-8.11e-02

Pos logit contributions
ze+0.016
zed+0.014
 character+0.013
hop+0.013
 bottom+0.012
Neg logit contributions
arl-0.014
 upon-0.013
ll-0.013
ety-0.012
endant-0.012
Token blue
Token ID4171
Feature activation+6.51e-01

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
hop+0.002
 bottom+0.002
Neg logit contributions
arl-0.002
ll-0.002
 upon-0.002
ety-0.002
endant-0.002
Token and
Token ID290
Feature activation+7.31e-01

Pos logit contributions
arl+0.013
 upon+0.012
ll+0.012
ety+0.011
endant+0.011
Neg logit contributions
ze-0.021
zed-0.019
 character-0.018
hop-0.018
 bottom-0.017
Token pink
Token ID11398
Feature activation+3.26e-01

Pos logit contributions
arl+0.019
ll+0.018
 upon+0.017
ety+0.017
endant+0.016
Neg logit contributions
ze-0.020
zed-0.018
 character-0.017
hop-0.016
 bottom-0.015
Token She
Token ID1375
Feature activation-1.29e-02

Pos logit contributions
ze+0.002
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
ll-0.001
 upon-0.001
ety-0.001
endant-0.001
Token had
Token ID550
Feature activation-5.93e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
 upon-0.000
ll-0.000
ety-0.000
endant-0.000
Token never
Token ID1239
Feature activation-2.53e-01

Pos logit contributions
ze+0.017
zed+0.015
 character+0.014
hop+0.013
 bottom+0.013
Neg logit contributions
arl-0.016
ll-0.014
 upon-0.014
ety-0.013
 Bian-0.012
Token seen
Token ID1775
Feature activation-9.56e-01

Pos logit contributions
ze+0.008
zed+0.007
 character+0.007
hop+0.007
 tunes+0.006
Neg logit contributions
arl-0.006
ll-0.005
 upon-0.005
 Bian-0.004
endant-0.004
Token a
Token ID257
Feature activation-5.87e-01

Pos logit contributions
ze+0.026
zed+0.024
 character+0.022
hop+0.021
lder+0.020
Neg logit contributions
arl-0.024
 upon-0.023
ll-0.022
ety-0.021
endant-0.021
Token telephone
Token ID11426
Feature activation-3.40e+00

Pos logit contributions
ze+0.017
zed+0.015
 character+0.014
hop+0.014
 bottom+0.013
Neg logit contributions
arl-0.014
 upon-0.013
ll-0.013
ety-0.012
endant-0.012
Token before
Token ID878
Feature activation+1.54e-01

Pos logit contributions
zed+0.102
ze+0.101
lder+0.093
+0.087
 tunes+0.086
Neg logit contributions
 upon-0.065
arl-0.062
 but-0.058
ll-0.056
 ago-0.054
Token and
Token ID290
Feature activation-7.43e-01

Pos logit contributions
arl+0.004
ll+0.003
ety+0.003
 upon+0.003
endant+0.003
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.004
Token was
Token ID373
Feature activation-1.11e+00

Pos logit contributions
ze+0.021
zed+0.019
 character+0.017
hop+0.017
 bottom+0.016
Neg logit contributions
arl-0.019
ll-0.018
 upon-0.017
ety-0.017
endant-0.016
Token very
Token ID845
Feature activation-1.12e+00

Pos logit contributions
ze+0.031
zed+0.028
 character+0.026
hop+0.025
 bottom+0.023
Neg logit contributions
arl-0.029
ll-0.027
 upon-0.026
ety-0.025
endant-0.024
Token excited
Token ID6568
Feature activation-1.97e-01

Pos logit contributions
ze+0.030
zed+0.026
 character+0.024
hop+0.023
 bottom+0.022
Neg logit contributions
arl-0.030
ll-0.028
 upon-0.027
ety-0.027
endant-0.025
Token squ
Token ID2809
Feature activation-8.43e-01

Pos logit contributions
ze+0.026
zed+0.023
 character+0.022
hop+0.022
 bottom+0.020
Neg logit contributions
 upon-0.018
arl-0.018
ll-0.017
ety-0.017
endant-0.016
Tokenished
Token ID1348
Feature activation-3.26e-01

Pos logit contributions
ze+0.025
zed+0.022
 character+0.021
hop+0.020
 bottom+0.019
Neg logit contributions
arl-0.020
 upon-0.019
ll-0.018
ety-0.017
endant-0.017
Token it
Token ID340
Feature activation-1.20e+00

Pos logit contributions
ze+0.011
zed+0.010
lder+0.009
hop+0.009
 character+0.009
Neg logit contributions
 upon-0.006
arl-0.006
ety-0.005
ll-0.005
 Tweet-0.005
Token together
Token ID1978
Feature activation-6.45e-01

Pos logit contributions
ze+0.035
zed+0.031
 character+0.030
hop+0.029
lder+0.027
Neg logit contributions
arl-0.028
 upon-0.027
ll-0.026
ety-0.024
endant-0.023
Token.
Token ID13
Feature activation-9.29e-01

Pos logit contributions
ze+0.022
zed+0.020
 character+0.019
hop+0.018
lder+0.018
Neg logit contributions
arl-0.011
 upon-0.011
ll-0.011
ety-0.010
 ago-0.009
Token "
Token ID366
Feature activation-3.40e+00

Pos logit contributions
ze+0.027
zed+0.025
 character+0.023
hop+0.022
 bottom+0.022
Neg logit contributions
arl-0.023
ll-0.020
 upon-0.020
ety-0.019
endant-0.019
TokenWhat
Token ID2061
Feature activation-1.85e+00

Pos logit contributions
ze+0.090
zed+0.088
 character+0.078
hop+0.072
 bottom+0.071
Neg logit contributions
arl-0.082
ety-0.081
 upon-0.075
ll-0.072
 Bian-0.068
Token can
Token ID460
Feature activation+1.17e+00

Pos logit contributions
ze+0.059
zed+0.058
hop+0.054
 bottom+0.048
 character+0.048
Neg logit contributions
 upon-0.036
ll-0.033
arl-0.031
ety-0.031
 haven-0.030
Token I
Token ID314
Feature activation+9.73e-02

Pos logit contributions
arl+0.024
 upon+0.021
ll+0.021
ety+0.020
endant+0.018
Neg logit contributions
ze-0.040
zed-0.036
 character-0.034
hop-0.033
 bottom-0.032
Token make
Token ID787
Feature activation-7.73e-01

Pos logit contributions
arl+0.003
ll+0.003
 upon+0.003
ety+0.003
endant+0.002
Neg logit contributions
ze-0.002
zed-0.002
 character-0.002
hop-0.002
 bottom-0.002
Token this
Token ID428
Feature activation-8.43e-01

Pos logit contributions
ze+0.024
zed+0.024
hop+0.021
 character+0.021
 bottom+0.020
Neg logit contributions
arl-0.015
 upon-0.014
ll-0.013
ety-0.012
endant-0.012
Token made
Token ID925
Feature activation-7.96e-01

Pos logit contributions
ze+0.021
zed+0.019
hop+0.019
 basis+0.019
 character+0.018
Neg logit contributions
 upon-0.008
ety-0.007
ll-0.007
 Tweet-0.006
endant-0.006
Token a
Token ID257
Feature activation-4.12e-01

Pos logit contributions
ze+0.028
zed+0.027
hop+0.026
 basis+0.026
lder+0.026
Neg logit contributions
 upon-0.011
arl-0.009
 Tweet-0.008
endant-0.008
ety-0.008
Token funny
Token ID8258
Feature activation-7.47e-01

Pos logit contributions
zed+0.014
ze+0.013
hop+0.013
ation+0.012
seek+0.011
Neg logit contributions
 upon-0.008
arl-0.007
ll-0.006
endant-0.006
ety-0.006
Token face
Token ID1986
Feature activation-1.65e+00

Pos logit contributions
zed+0.020
hop+0.019
ation+0.018
 Sundays+0.017
+0.017
Neg logit contributions
 upon-0.015
 but-0.014
 too-0.013
 and-0.012
 Tweet-0.012
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
zed+0.060
ze+0.057
hop+0.054
 basis+0.054
lder+0.054
Neg logit contributions
 upon-0.019
arl-0.018
ety-0.017
 too-0.016
endant-0.015
Token "
Token ID366
Feature activation-3.38e+00

Pos logit contributions
ze+0.056
zed+0.051
 character+0.049
hop+0.048
 tunes+0.047
Neg logit contributions
arl-0.024
 upon-0.023
ety-0.020
ll-0.019
 Bian-0.017
TokenY
Token ID56
Feature activation-1.55e-02

Pos logit contributions
zed+0.101
ze+0.094
 character+0.084
hop+0.083
 vows+0.081
Neg logit contributions
ety-0.075
arl-0.064
 upon-0.058
endant-0.053
Tweet-0.052
Tokenuck
Token ID1347
Feature activation-1.65e+00

Pos logit contributions
ze+0.000
zed+0.000
 tunes+0.000
 character+0.000
hop+0.000
Neg logit contributions
arl-0.000
ll-0.000
ety-0.000
 upon-0.000
endant-0.000
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
zed+0.054
ze+0.054
hop+0.048
 character+0.048
 tunes+0.047
Neg logit contributions
arl-0.031
 upon-0.029
ll-0.028
ety-0.024
 Tweet-0.023
Token this
Token ID428
Feature activation-3.35e-01

Pos logit contributions
ze+0.084
zed+0.083
hop+0.077
 tunes+0.072
seek+0.072
Neg logit contributions
arl-0.046
 upon-0.046
 Tweet-0.043
endant-0.041
ety-0.039
Token apple
Token ID17180
Feature activation-6.07e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 tunes+0.007
Neg logit contributions
 upon-0.008
ety-0.008
arl-0.008
ll-0.008
endant-0.007
Token and
Token ID290
Feature activation+4.82e-01

Pos logit contributions
ze+0.013
zed+0.012
 character+0.011
hop+0.011
 bottom+0.011
Neg logit contributions
arl-0.008
ll-0.007
 upon-0.007
ety-0.006
endant-0.006
Token continued
Token ID3767
Feature activation-4.03e-01

Pos logit contributions
arl+0.009
 upon+0.009
ll+0.009
ety+0.008
 Bian+0.008
Neg logit contributions
ze-0.016
zed-0.014
 character-0.014
hop-0.013
 bottom-0.013
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
ze+0.016
 character+0.014
zed+0.014
est+0.013
hop+0.013
Neg logit contributions
arl-0.005
 upon-0.005
ll-0.004
 Bian-0.004
ety-0.004
Token stir
Token ID11240
Feature activation-7.36e-01

Pos logit contributions
ze+0.029
zed+0.026
 character+0.025
hop+0.024
 tunes+0.023
Neg logit contributions
arl-0.024
 upon-0.022
ll-0.022
 Bian-0.021
ety-0.021
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
ze+0.025
zed+0.023
lder+0.022
 character+0.021
hop+0.021
Neg logit contributions
 upon-0.012
arl-0.012
 Bian-0.010
ll-0.010
ety-0.010
Token panc
Token ID22154
Feature activation-3.37e+00

Pos logit contributions
zed+0.035
ze+0.034
hop+0.031
 bottom+0.028
+0.027
Neg logit contributions
arl-0.030
 upon-0.028
ll-0.027
endant-0.024
 ago-0.023
Tokenake
Token ID539
Feature activation-1.11e+00

Pos logit contributions
ze+0.073
 character+0.069
zed+0.064
 bottom+0.064
hop+0.060
Neg logit contributions
ll-0.097
arl-0.092
ety-0.084
ucker-0.082
 upon-0.078
Token batter
Token ID4984
Feature activation-1.02e+00

Pos logit contributions
ze+0.032
zed+0.030
hop+0.027
 character+0.026
lder+0.026
Neg logit contributions
arl-0.023
ll-0.021
 upon-0.021
ety-0.019
 Bian-0.018
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
ze+0.031
zed+0.028
 character+0.026
hop+0.025
 bottom+0.024
Neg logit contributions
arl-0.023
 upon-0.021
ll-0.021
ety-0.020
endant-0.019
Token
Token ID220
Feature activation-7.03e-01

Pos logit contributions
ze+0.037
zed+0.034
 character+0.032
hop+0.031
 tunes+0.030
Neg logit contributions
arl-0.019
 upon-0.018
ll-0.017
ety-0.017
endant-0.015
Token
Token ID198
Feature activation-1.61e+00

Pos logit contributions
ze+0.019
zed+0.018
 character+0.016
hop+0.016
 bottom+0.015
Neg logit contributions
arl-0.016
endant-0.015
ll-0.015
ety-0.014
 Bian-0.014
Token put
Token ID1234
Feature activation+3.36e-01

Pos logit contributions
ze+0.024
zed+0.022
 character+0.021
hop+0.020
 bottom+0.019
Neg logit contributions
arl-0.017
 upon-0.016
ll-0.016
ety-0.015
endant-0.015
Token it
Token ID340
Feature activation-1.16e+00

Pos logit contributions
arl+0.010
ll+0.010
 upon+0.009
ety+0.009
endant+0.009
Neg logit contributions
ze-0.008
zed-0.007
 character-0.007
hop-0.006
 middle-0.006
Token on
Token ID319
Feature activation-8.95e-01

Pos logit contributions
ze+0.028
zed+0.024
 character+0.021
hop+0.021
 bottom+0.020
Neg logit contributions
arl-0.034
ll-0.031
ety-0.030
 Bian-0.030
 haven-0.029
Token top
Token ID1353
Feature activation-6.22e-03

Pos logit contributions
ze+0.024
zed+0.021
 character+0.020
hop+0.019
 bottom+0.019
Neg logit contributions
arl-0.023
ll-0.022
 upon-0.022
ety-0.021
endant-0.020
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
ll-0.000
 upon-0.000
ety-0.000
 Bian-0.000
Token "
Token ID366
Feature activation-3.35e+00

Pos logit contributions
ze+0.036
zed+0.033
 character+0.031
hop+0.030
 bottom+0.029
Neg logit contributions
arl-0.024
ll-0.023
 upon-0.022
ety-0.021
endant-0.020
TokenHello
Token ID15496
Feature activation-7.89e-01

Pos logit contributions
ze+0.096
zed+0.093
 character+0.082
 bottom+0.078
hop+0.074
Neg logit contributions
arl-0.075
ety-0.074
 upon-0.067
 Bian-0.062
ll-0.061
Token,
Token ID11
Feature activation-9.66e-01

Pos logit contributions
ze+0.028
zed+0.028
hop+0.025
 character+0.024
 tunes+0.023
Neg logit contributions
 upon-0.013
arl-0.013
ety-0.012
ll-0.012
endant-0.011
Token friend
Token ID1545
Feature activation-5.32e-01

Pos logit contributions
ze+0.029
 character+0.026
zed+0.025
hop+0.023
 bottom+0.023
Neg logit contributions
arl-0.021
ll-0.020
 upon-0.019
ety-0.019
endant-0.018
Token!"
Token ID2474
Feature activation-5.76e-01

Pos logit contributions
ze+0.016
zed+0.014
 character+0.013
hop+0.013
 bottom+0.012
Neg logit contributions
arl-0.012
 upon-0.012
ll-0.012
ety-0.011
endant-0.011
Token said
Token ID531
Feature activation-5.78e-01

Pos logit contributions
ze+0.018
zed+0.017
 character+0.017
hop+0.016
 middle+0.015
Neg logit contributions
arl-0.013
ll-0.012
 upon-0.011
ety-0.011
endant-0.010
Token to
Token ID284
Feature activation-1.59e+00

Pos logit contributions
ze+0.056
zed+0.052
 character+0.051
hop+0.050
lder+0.049
Neg logit contributions
 upon-0.017
arl-0.015
ety-0.014
ll-0.014
endant-0.013
Token her
Token ID607
Feature activation+1.42e-01

Pos logit contributions
ze+0.046
zed+0.041
 character+0.038
hop+0.037
 bottom+0.035
Neg logit contributions
arl-0.039
 upon-0.037
ll-0.036
ety-0.034
 Bian-0.033
Token mom
Token ID1995
Feature activation+2.45e-01

Pos logit contributions
arl+0.004
 upon+0.003
ll+0.003
ety+0.003
endant+0.003
Neg logit contributions
ze-0.004
zed-0.004
 character-0.003
hop-0.003
 bottom-0.003
Tokenmy
Token ID1820
Feature activation+2.70e-01

Pos logit contributions
arl+0.005
ll+0.004
 Bian+0.004
endant+0.004
 upon+0.004
Neg logit contributions
ze-0.008
zed-0.007
hop-0.006
 character-0.006
 bottom-0.006
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
arl+0.007
ll+0.006
 upon+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.007
zed-0.006
 character-0.006
hop-0.006
 bottom-0.006
Token "
Token ID366
Feature activation-3.33e+00

Pos logit contributions
ze+0.046
zed+0.042
 character+0.039
hop+0.038
 bottom+0.036
Neg logit contributions
arl-0.029
 upon-0.027
ll-0.026
ety-0.025
endant-0.023
TokenLook
Token ID8567
Feature activation-8.98e-01

Pos logit contributions
zed+0.102
ze+0.099
 character+0.085
hop+0.082
 bottom+0.079
Neg logit contributions
ety-0.073
arl-0.071
 upon-0.069
 Bian-0.058
endant-0.058
Token mom
Token ID1995
Feature activation-1.49e-01

Pos logit contributions
ze+0.034
zed+0.033
hop+0.031
 character+0.030
 tunes+0.030
Neg logit contributions
 upon-0.014
ll-0.012
ety-0.012
arl-0.012
endant-0.011
Tokenmy
Token ID1820
Feature activation-5.77e-02

Pos logit contributions
ze+0.005
zed+0.004
 character+0.004
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.003
 upon-0.002
ll-0.002
 Bian-0.002
endant-0.002
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
ze+0.002
zed+0.002
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token I
Token ID314
Feature activation-6.87e-01

Pos logit contributions
ze+0.073
zed+0.067
hop+0.062
 character+0.061
 tunes+0.058
Neg logit contributions
arl-0.047
 upon-0.045
ll-0.042
ety-0.041
endant-0.040
Token smiled
Token ID13541
Feature activation-2.22e-01

Pos logit contributions
ze+0.014
zed+0.012
 character+0.011
lder+0.011
hop+0.011
Neg logit contributions
arl-0.014
 upon-0.011
endant-0.011
ety-0.011
ll-0.011
Token and
Token ID290
Feature activation+5.90e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.003
endant-0.002
Token said
Token ID531
Feature activation+1.43e-01

Pos logit contributions
arl+0.013
 upon+0.011
ll+0.011
 ago+0.011
endant+0.010
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
hop-0.016
lder-0.015
Token,
Token ID11
Feature activation-4.92e-01

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
 Bian+0.002
ety+0.002
Neg logit contributions
ze-0.005
zed-0.005
hop-0.005
 character-0.004
 tunes-0.004
Token "
Token ID366
Feature activation-1.20e-01

Pos logit contributions
ze+0.013
zed+0.011
 character+0.010
hop+0.010
 bottom+0.009
Neg logit contributions
arl-0.013
 upon-0.012
ll-0.012
ety-0.012
endant-0.012
TokenThank
Token ID10449
Feature activation+2.47e+00

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
 bottom+0.003
hop+0.003
Neg logit contributions
arl-0.003
ety-0.003
 upon-0.003
ll-0.003
 Bian-0.003
Token you
Token ID345
Feature activation+9.16e-01

Pos logit contributions
arl+0.043
 upon+0.039
ll+0.038
ety+0.037
endant+0.035
Neg logit contributions
ze-0.088
zed-0.080
 character-0.077
hop-0.075
lder-0.071
Token Stella
Token ID45856
Feature activation+1.24e+00

Pos logit contributions
arl+0.020
 upon+0.018
ll+0.018
ety+0.017
 Bian+0.016
Neg logit contributions
ze-0.029
zed-0.026
 character-0.025
hop-0.024
 tunes-0.023
Token!
Token ID0
Feature activation+6.93e-01

Pos logit contributions
arl+0.028
 upon+0.026
ll+0.026
ety+0.024
endant+0.023
Neg logit contributions
ze-0.038
zed-0.034
 character-0.032
hop-0.031
 bottom-0.030
Token What
Token ID1867
Feature activation-7.58e-01

Pos logit contributions
ll+0.018
arl+0.018
 upon+0.017
endant+0.017
ety+0.017
Neg logit contributions
ze-0.017
zed-0.015
 character-0.015
lder-0.014
hop-0.013
Token should
Token ID815
Feature activation+8.20e-01

Pos logit contributions
ze+0.023
zed+0.021
hop+0.019
 character+0.019
 bottom+0.018
Neg logit contributions
 upon-0.017
arl-0.016
ll-0.016
ety-0.015
 haven-0.014
Token it
Token ID340
Feature activation+4.59e-01

Pos logit contributions
ze+0.020
zed+0.018
 character+0.017
hop+0.016
 bottom+0.015
Neg logit contributions
arl-0.018
 upon-0.017
ll-0.017
ety-0.016
endant-0.015
Token was
Token ID373
Feature activation+9.29e-01

Pos logit contributions
arl+0.013
 upon+0.012
ll+0.012
ety+0.012
endant+0.011
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.008
Token so
Token ID523
Feature activation-2.24e-01

Pos logit contributions
arl+0.023
ll+0.019
endant+0.018
ety+0.017
 Bian+0.016
Neg logit contributions
ze-0.028
zed-0.026
 bottom-0.024
 character-0.024
hop-0.022
Token dark
Token ID3223
Feature activation+1.02e+00

Pos logit contributions
ze+0.005
zed+0.005
 bottom+0.005
 character+0.004
 middle+0.004
Neg logit contributions
arl-0.006
ety-0.005
ll-0.005
ist-0.005
endant-0.005
Token he
Token ID339
Feature activation+8.79e-01

Pos logit contributions
arl+0.027
ll+0.024
 upon+0.024
ety+0.024
endant+0.023
Neg logit contributions
ze-0.028
zed-0.025
 character-0.023
hop-0.022
 bottom-0.021
Token couldn
Token ID3521
Feature activation+2.17e+00

Pos logit contributions
arl+0.026
ll+0.022
ety+0.021
 upon+0.021
endant+0.021
Neg logit contributions
ze-0.024
zed-0.021
 character-0.019
hop-0.018
 bottom-0.018
Token't
Token ID470
Feature activation+9.23e-02

Pos logit contributions
arl+0.050
 upon+0.045
ety+0.044
ll+0.044
endant+0.043
Neg logit contributions
ze-0.067
zed-0.062
 character-0.056
hop-0.055
 bottom-0.055
Token see
Token ID766
Feature activation-9.96e-01

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
ety+0.002
 Bian+0.002
Neg logit contributions
ze-0.003
zed-0.002
 character-0.002
lder-0.002
hop-0.002
Token anything
Token ID1997
Feature activation-5.36e-02

Pos logit contributions
ze+0.028
zed+0.025
 character+0.023
hop+0.022
lder+0.022
Neg logit contributions
arl-0.024
 upon-0.023
ll-0.022
ety-0.021
endant-0.021
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
ze+0.002
zed+0.002
 character+0.001
hop+0.001
lder+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token Then
Token ID3244
Feature activation-2.24e-02

Pos logit contributions
ze+0.008
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.005
 upon-0.004
ll-0.004
ety-0.004
endant-0.004
Token,
Token ID11
Feature activation-3.52e-01

Pos logit contributions
arl+0.028
 upon+0.025
ll+0.025
ety+0.024
endant+0.023
Neg logit contributions
ze-0.038
zed-0.034
 character-0.032
hop-0.031
 bottom-0.029
Token "
Token ID366
Feature activation+3.63e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.010
 upon-0.009
ll-0.009
ety-0.009
endant-0.008
TokenCome
Token ID16773
Feature activation+1.11e+00

Pos logit contributions
arl+0.010
 upon+0.010
ll+0.010
ety+0.009
endant+0.009
Neg logit contributions
ze-0.009
zed-0.008
 character-0.007
hop-0.007
 tunes-0.007
Token on
Token ID319
Feature activation-9.17e-01

Pos logit contributions
arl+0.029
ety+0.026
ll+0.026
endant+0.026
 upon+0.026
Neg logit contributions
ze-0.030
zed-0.026
hop-0.024
 character-0.024
 bottom-0.023
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
ze+0.026
zed+0.023
 character+0.022
hop+0.021
 bottom+0.020
Neg logit contributions
arl-0.023
 upon-0.021
ll-0.021
ety-0.021
endant-0.020
Token I
Token ID314
Feature activation+2.08e+00

Pos logit contributions
ze+0.014
 character+0.012
zed+0.011
hop+0.011
 bottom+0.011
Neg logit contributions
arl-0.014
ll-0.014
endant-0.013
ety-0.013
 Bian-0.012
Token'll
Token ID1183
Feature activation+1.05e+00

Pos logit contributions
arl+0.056
endant+0.051
 Bian+0.048
ety+0.047
ll+0.045
Neg logit contributions
ze-0.053
hop-0.045
zed-0.044
 character-0.042
 bottom-0.042
Token show
Token ID905
Feature activation+5.99e-01

Pos logit contributions
arl+0.031
 upon+0.029
ety+0.028
ll+0.028
endant+0.027
Neg logit contributions
ze-0.026
zed-0.023
hop-0.021
 character-0.020
 bottom-0.020
Token you
Token ID345
Feature activation+3.16e-01

Pos logit contributions
arl+0.013
ll+0.012
 upon+0.012
ety+0.012
endant+0.011
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
hop-0.016
 tunes-0.015
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
arl+0.007
 upon+0.007
ll+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.010
zed-0.009
 character-0.008
hop-0.008
 bottom-0.007
Token I
Token ID314
Feature activation+1.43e+00

Pos logit contributions
ll+0.019
arl+0.019
 upon+0.018
ety+0.018
endant+0.017
Neg logit contributions
ze-0.015
zed-0.014
 character-0.013
hop-0.013
five-0.012
Token day
Token ID1110
Feature activation-1.20e+00

Pos logit contributions
ze+0.024
zed+0.021
hop+0.018
 character+0.018
 bottom+0.018
Neg logit contributions
 upon-0.026
arl-0.026
ll-0.025
ety-0.025
 ago-0.023
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
ze+0.041
zed+0.038
hop+0.035
 character+0.035
 tunes+0.034
Neg logit contributions
 upon-0.021
arl-0.020
ll-0.018
ety-0.018
 Bian-0.018
Token her
Token ID607
Feature activation-2.05e-01

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
 tunes+0.000
Neg logit contributions
arl-0.000
ll-0.000
endant-0.000
 upon-0.000
ety-0.000
Token mom
Token ID1995
Feature activation+2.49e-01

Pos logit contributions
ze+0.005
zed+0.005
 character+0.005
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.006
ll-0.005
 upon-0.005
ety-0.005
endant-0.005
Token said
Token ID531
Feature activation+5.65e-01

Pos logit contributions
arl+0.007
endant+0.006
 Bian+0.006
 upon+0.006
utt+0.005
Neg logit contributions
ze-0.007
zed-0.005
 character-0.005
lder-0.005
hop-0.005
Token she
Token ID673
Feature activation+1.94e+00

Pos logit contributions
arl+0.011
 upon+0.010
ll+0.010
ety+0.009
endant+0.009
Neg logit contributions
ze-0.019
zed-0.017
 character-0.016
hop-0.016
 bottom-0.015
Token had
Token ID550
Feature activation-1.00e-01

Pos logit contributions
arl+0.051
 upon+0.043
ll+0.043
ety+0.040
endant+0.040
Neg logit contributions
ze-0.057
zed-0.050
 character-0.048
hop-0.044
lder-0.043
Token to
Token ID284
Feature activation-3.66e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.002
ll-0.002
 upon-0.002
ety-0.002
endant-0.002
Token wear
Token ID5806
Feature activation+4.31e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.010
 upon-0.010
ll-0.010
ety-0.009
endant-0.009
Token a
Token ID257
Feature activation-4.06e-01

Pos logit contributions
arl+0.012
ll+0.012
ety+0.011
 upon+0.011
endant+0.010
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 tunes-0.008
Token green
Token ID4077
Feature activation+7.84e-03

Pos logit contributions
ze+0.011
zed+0.009
 character+0.009
 bottom+0.008
hop+0.008
Neg logit contributions
arl-0.011
ll-0.011
ety-0.010
 upon-0.010
 ago-0.009
Token see
Token ID766
Feature activation-8.05e-01

Pos logit contributions
arl+0.005
ll+0.004
 upon+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.004
Token animals
Token ID4695
Feature activation+1.01e+00

Pos logit contributions
ze+0.021
zed+0.018
 character+0.017
hop+0.016
 bottom+0.016
Neg logit contributions
arl-0.023
ll-0.021
 upon-0.020
ety-0.020
 Bian-0.019
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
arl+0.028
ety+0.025
ll+0.023
 Bian+0.023
endant+0.023
Neg logit contributions
ze-0.027
zed-0.023
hop-0.022
 character-0.022
 bottom-0.021
Token trees
Token ID7150
Feature activation+2.12e+00

Pos logit contributions
arl+0.035
ll+0.034
ucker+0.029
endant+0.028
uffy+0.028
Neg logit contributions
ze-0.022
 movies-0.019
 character-0.018
 tunes-0.018
hop-0.017
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
arl+0.070
ety+0.064
endant+0.063
 upon+0.062
ll+0.062
Neg logit contributions
ze-0.039
 bottom-0.032
hop-0.031
zed-0.031
 character-0.029
Token flowers
Token ID12734
Feature activation+2.03e+00

Pos logit contributions
arl+0.040
ll+0.038
ucker+0.034
utt+0.033
endant+0.033
Neg logit contributions
ze-0.029
 movies-0.025
 tunes-0.023
hop-0.023
 middle-0.022
Token and
Token ID290
Feature activation+2.69e-04

Pos logit contributions
arl+0.066
ety+0.060
 Bian+0.057
 upon+0.057
endant+0.057
Neg logit contributions
ze-0.039
hop-0.032
zed-0.032
 bottom-0.032
 character-0.030
Token a
Token ID257
Feature activation+5.84e-01

Pos logit contributions
arl+0.000
ll+0.000
 upon+0.000
ety+0.000
endant+0.000
Neg logit contributions
ze-0.000
zed-0.000
 character-0.000
hop-0.000
 tunes-0.000
Token house
Token ID2156
Feature activation+6.08e-01

Pos logit contributions
arl+0.015
ll+0.014
 upon+0.014
ety+0.013
 Bian+0.013
Neg logit contributions
ze-0.016
zed-0.014
 character-0.013
hop-0.013
 bottom-0.013
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
arl+0.016
ll+0.015
 upon+0.015
ety+0.015
 Bian+0.014
Neg logit contributions
ze-0.016
zed-0.014
 character-0.013
hop-0.013
 bottom-0.012
Token
Token ID198
Feature activation+7.60e-02

Pos logit contributions
ll+0.014
arl+0.012
endant+0.012
ist+0.011
ucker+0.011
Neg logit contributions
ze-0.010
 Everyone-0.010
 movies-0.009
 character-0.009
 middle-0.009
Token said
Token ID531
Feature activation+5.85e-01

Pos logit contributions
arl+0.043
 upon+0.036
endant+0.036
ll+0.035
ety+0.035
Neg logit contributions
ze-0.042
zed-0.037
 character-0.035
lder-0.032
hop-0.031
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
arl+0.013
 upon+0.012
ll+0.011
ety+0.011
endant+0.010
Neg logit contributions
ze-0.018
zed-0.017
 character-0.016
hop-0.015
 bottom-0.015
Token "
Token ID366
Feature activation+2.68e-01

Pos logit contributions
ze+0.006
zed+0.005
 character+0.005
hop+0.004
lder+0.004
Neg logit contributions
arl-0.007
ll-0.007
ety-0.007
 upon-0.007
endant-0.007
TokenIf
Token ID1532
Feature activation+1.21e+00

Pos logit contributions
arl+0.008
 upon+0.007
ll+0.007
endant+0.007
ety+0.007
Neg logit contributions
ze-0.006
zed-0.005
hop-0.005
 character-0.005
 tunes-0.004
Token you
Token ID345
Feature activation+5.20e-01

Pos logit contributions
arl+0.033
endant+0.027
ll+0.026
ety+0.026
 upon+0.026
Neg logit contributions
ze-0.034
zed-0.032
 character-0.029
hop-0.029
 bottom-0.027
Token keep
Token ID1394
Feature activation+3.91e-01

Pos logit contributions
arl+0.014
 upon+0.013
endant+0.012
 Bian+0.012
ll+0.012
Neg logit contributions
ze-0.013
hop-0.011
 character-0.010
zed-0.010
 bottom-0.010
Token signing
Token ID8415
Feature activation-7.92e-01

Pos logit contributions
arl+0.011
ll+0.010
 upon+0.010
ety+0.010
endant+0.009
Neg logit contributions
ze-0.010
zed-0.009
 character-0.008
hop-0.008
 bottom-0.008
Token the
Token ID262
Feature activation-7.70e-01

Pos logit contributions
ze+0.023
zed+0.020
 character+0.019
hop+0.019
 bottom+0.018
Neg logit contributions
arl-0.019
ll-0.018
 upon-0.017
ety-0.017
endant-0.016
Token ice
Token ID4771
Feature activation-8.14e-02

Pos logit contributions
ze+0.021
zed+0.019
 character+0.017
hop+0.017
 bottom+0.016
Neg logit contributions
arl-0.020
 upon-0.019
ll-0.018
ety-0.018
endant-0.017
Token,
Token ID11
Feature activation-6.86e-01

Pos logit contributions
ze+0.002
zed+0.002
 character+0.002
hop+0.002
 bottom+0.001
Neg logit contributions
arl-0.002
 upon-0.002
ll-0.002
 Bian-0.002
ety-0.002
Token it
Token ID340
Feature activation-1.93e-01

Pos logit contributions
ze+0.020
zed+0.018
 character+0.017
hop+0.016
 bottom+0.016
Neg logit contributions
arl-0.016
ll-0.015
ety-0.015
endant-0.014
 upon-0.014
Token't
Token ID470
Feature activation-7.42e-01

Pos logit contributions
arl+0.018
 upon+0.017
ll+0.017
ety+0.015
endant+0.014
Neg logit contributions
ze-0.034
zed-0.031
 character-0.029
hop-0.028
 bottom-0.027
Token know
Token ID760
Feature activation-7.84e-04

Pos logit contributions
ze+0.020
zed+0.017
hop+0.017
 character+0.016
 bottom+0.015
Neg logit contributions
arl-0.019
ll-0.018
 upon-0.018
endant-0.017
ety-0.017
Token,"
Token ID553
Feature activation-1.03e+00

Pos logit contributions
ze+0.000
zed+0.000
 character+0.000
hop+0.000
est+0.000
Neg logit contributions
arl-0.000
ll-0.000
 upon-0.000
ety-0.000
 Bian-0.000
Token Ben
Token ID3932
Feature activation-2.94e-01

Pos logit contributions
ze+0.025
 character+0.024
hop+0.022
 delight+0.021
zed+0.021
Neg logit contributions
arl-0.033
ll-0.029
 upon-0.026
endant-0.025
omed-0.025
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.008
hop+0.008
lder+0.008
Neg logit contributions
arl-0.007
 upon-0.006
endant-0.006
ll-0.006
ety-0.005
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
hop+0.003
 bottom+0.002
Neg logit contributions
arl-0.003
ll-0.003
 upon-0.003
ety-0.003
endant-0.003
Token "
Token ID366
Feature activation-2.50e-01

Pos logit contributions
ll+0.014
endant+0.013
arl+0.013
ety+0.013
 upon+0.012
Neg logit contributions
ze-0.006
 character-0.005
zed-0.005
hop-0.004
 bottom-0.004
TokenMaybe
Token ID13300
Feature activation+4.62e-01

Pos logit contributions
ze+0.007
zed+0.006
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.007
 upon-0.006
ll-0.006
ety-0.006
endant-0.006
Token it
Token ID340
Feature activation+2.22e-01

Pos logit contributions
arl+0.013
ll+0.012
 upon+0.011
ety+0.010
 Bian+0.010
Neg logit contributions
ze-0.012
zed-0.010
 character-0.010
 bottom-0.010
hop-0.009
Token belongs
Token ID14448
Feature activation+2.03e+00

Pos logit contributions
arl+0.008
 ago+0.007
 upon+0.007
 Bian+0.007
endant+0.007
Neg logit contributions
ze-0.005
zed-0.004
 character-0.003
est-0.003
 bottom-0.003
Token to
Token ID284
Feature activation-5.30e-01

Pos logit contributions
arl+0.060
 Bian+0.052
ll+0.049
ucker+0.048
endant+0.047
Neg logit contributions
ze-0.049
zed-0.041
 character-0.039
hop-0.038
 thing-0.037
Token go
Token ID467
Feature activation+4.09e-01

Pos logit contributions
arl+0.008
 upon+0.008
ll+0.007
ety+0.007
 Bian+0.007
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
lder-0.008
Token to
Token ID284
Feature activation-5.72e-01

Pos logit contributions
arl+0.012
endant+0.011
ucker+0.010
ll+0.010
 Bian+0.010
Neg logit contributions
ze-0.009
hop-0.009
zed-0.008
 character-0.007
 bottom-0.007
Token the
Token ID262
Feature activation-1.33e-01

Pos logit contributions
ze+0.015
zed+0.014
 character+0.013
hop+0.012
 bottom+0.012
Neg logit contributions
arl-0.015
ll-0.014
 upon-0.014
ety-0.013
endant-0.013
Token circus
Token ID33125
Feature activation+3.42e-01

Pos logit contributions
ze+0.004
zed+0.004
liest+0.003
est+0.003
hop+0.003
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
 here-0.002
 there-0.002
Token too
Token ID1165
Feature activation-2.98e-01

Pos logit contributions
 upon+0.007
arl+0.007
ll+0.006
ety+0.006
 Bian+0.005
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.009
Token.
Token ID13
Feature activation+6.15e-01

Pos logit contributions
ze+0.007
zed+0.006
 character+0.006
hop+0.006
 bottom+0.006
Neg logit contributions
arl-0.008
 upon-0.008
endant-0.008
ll-0.008
ety-0.007
Token But
Token ID887
Feature activation-3.34e-01

Pos logit contributions
arl+0.013
ll+0.012
 upon+0.011
endant+0.011
ety+0.011
Neg logit contributions
ze-0.020
zed-0.018
 character-0.018
hop-0.016
 tunes-0.016
Token he
Token ID339
Feature activation+6.44e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.010
hop+0.009
 bottom+0.009
Neg logit contributions
arl-0.007
ll-0.006
endant-0.006
 upon-0.006
ety-0.006
Token didn
Token ID1422
Feature activation+6.23e-01

Pos logit contributions
arl+0.015
 upon+0.013
ll+0.013
ety+0.013
endant+0.013
Neg logit contributions
ze-0.020
zed-0.018
 character-0.017
hop-0.016
 bottom-0.015
Token't
Token ID470
Feature activation-8.48e-01

Pos logit contributions
arl+0.011
 upon+0.010
ll+0.010
ety+0.009
 Bian+0.009
Neg logit contributions
ze-0.021
zed-0.020
 character-0.019
hop-0.018
 tunes-0.017
Token have
Token ID423
Feature activation-5.43e-01

Pos logit contributions
ze+0.027
zed+0.024
 character+0.023
hop+0.022
 bottom+0.021
Neg logit contributions
arl-0.019
 upon-0.017
ll-0.017
ety-0.016
 Bian-0.015
Token a
Token ID257
Feature activation+1.16e-01

Pos logit contributions
arl+0.002
ll+0.001
 upon+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.002
 character-0.002
hop-0.001
 bottom-0.001
Token time
Token ID640
Feature activation+1.10e+00

Pos logit contributions
arl+0.003
ll+0.003
 upon+0.003
endant+0.003
 Bian+0.002
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
 bottom-0.003
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
arl+0.033
endant+0.029
ety+0.029
ll+0.028
 Bian+0.027
Neg logit contributions
ze-0.024
zed-0.022
 character-0.020
 bottom-0.019
hop-0.019
Token there
Token ID612
Feature activation-1.23e+00

Pos logit contributions
arl+0.025
ll+0.023
endant+0.022
 Bian+0.020
ety+0.019
Neg logit contributions
ze-0.043
zed-0.039
 bottom-0.038
 character-0.037
 middle-0.037
Token was
Token ID373
Feature activation+9.92e-01

Pos logit contributions
ze+0.039
zed+0.034
hop+0.032
 tunes+0.032
 character+0.032
Neg logit contributions
 upon-0.027
ll-0.024
arl-0.022
 haven-0.022
ety-0.022
Token a
Token ID257
Feature activation+7.34e-02

Pos logit contributions
arl+0.026
ll+0.025
 upon+0.023
ety+0.022
endant+0.022
Neg logit contributions
ze-0.027
zed-0.024
 character-0.023
hop-0.022
 bottom-0.021
Token little
Token ID1310
Feature activation+8.25e-01

Pos logit contributions
arl+0.002
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.002
 character-0.002
hop-0.002
 bottom-0.002
Token girl
Token ID2576
Feature activation-3.35e-01

Pos logit contributions
arl+0.023
ll+0.022
 upon+0.022
endant+0.020
ety+0.020
Neg logit contributions
ze-0.020
 character-0.017
hop-0.016
zed-0.015
lder-0.015
Token named
Token ID3706
Feature activation+8.74e-01

Pos logit contributions
ze+0.013
zed+0.013
 character+0.012
hop+0.012
 bottom+0.011
Neg logit contributions
arl-0.004
 upon-0.004
ll-0.004
ety-0.004
endant-0.003
Token Lily
Token ID20037
Feature activation-2.00e-01

Pos logit contributions
arl+0.014
ll+0.013
 upon+0.012
endant+0.012
ety+0.011
Neg logit contributions
ze-0.031
zed-0.029
 character-0.027
hop-0.026
 bottom-0.026
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
ze+0.007
zed+0.006
 character+0.006
hop+0.006
 bottom+0.006
Neg logit contributions
 upon-0.003
arl-0.003
ety-0.003
ll-0.003
 ago-0.003
Token a
Token ID257
Feature activation+3.68e-02

Pos logit contributions
arl+0.025
ll+0.024
 upon+0.022
ety+0.021
endant+0.021
Neg logit contributions
ze-0.029
zed-0.025
 character-0.024
hop-0.023
 bottom-0.022
Token sad
Token ID6507
Feature activation+5.23e-01

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token river
Token ID7850
Feature activation+8.33e-01

Pos logit contributions
arl+0.014
ll+0.013
 upon+0.013
ety+0.012
endant+0.012
Neg logit contributions
ze-0.014
 character-0.012
zed-0.012
hop-0.011
 bottom-0.011
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
arl+0.018
 upon+0.017
ll+0.016
ety+0.016
endant+0.014
Neg logit contributions
ze-0.026
zed-0.024
 character-0.022
hop-0.021
 bottom-0.020
Token The
Token ID383
Feature activation-5.14e-01

Pos logit contributions
ze+0.008
zed+0.007
 character+0.006
hop+0.006
 bottom+0.006
Neg logit contributions
arl-0.006
 upon-0.006
ll-0.006
ety-0.005
endant-0.005
Token river
Token ID7850
Feature activation+1.88e-01

Pos logit contributions
ze+0.011
 bottom+0.009
zed+0.008
 middle+0.008
 character+0.008
Neg logit contributions
arl-0.016
endant-0.015
ll-0.015
ety-0.015
 upon-0.014
Token whispered
Token ID25029
Feature activation-7.70e-01

Pos logit contributions
arl+0.005
 upon+0.005
ll+0.005
ety+0.005
 Bian+0.004
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.004
Token in
Token ID287
Feature activation-1.52e+00

Pos logit contributions
ze+0.025
zed+0.024
 character+0.022
hop+0.022
 tunes+0.021
Neg logit contributions
 upon-0.015
arl-0.014
ll-0.012
ety-0.012
endant-0.011
Token the
Token ID262
Feature activation-3.59e-01

Pos logit contributions
ze+0.051
zed+0.047
hop+0.045
 character+0.044
 bottom+0.042
Neg logit contributions
arl-0.027
 upon-0.025
ety-0.022
 Bian-0.021
 Tweet-0.021
Token wind
Token ID2344
Feature activation-6.28e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.007
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.010
ll-0.010
 upon-0.009
ety-0.009
endant-0.009
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
ze+0.018
zed+0.016
 character+0.015
hop+0.015
 tunes+0.014
Neg logit contributions
arl-0.016
ll-0.014
 upon-0.014
ety-0.013
 Bian-0.013
Token was
Token ID373
Feature activation+1.35e-02

Pos logit contributions
arl+0.025
 upon+0.022
ll+0.022
ety+0.021
endant+0.021
Neg logit contributions
ze-0.020
zed-0.017
 character-0.016
hop-0.015
 bottom-0.014
Token a
Token ID257
Feature activation+4.21e-02

Pos logit contributions
arl+0.000
ll+0.000
 upon+0.000
ety+0.000
endant+0.000
Neg logit contributions
ze-0.000
zed-0.000
 character-0.000
 bottom-0.000
hop-0.000
Token good
Token ID922
Feature activation-3.50e-02

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
hop-0.001
 character-0.001
lder-0.001
Token idea
Token ID2126
Feature activation+9.96e-02

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token.
Token ID13
Feature activation+4.03e-01

Pos logit contributions
arl+0.002
 upon+0.002
ll+0.002
ety+0.002
endant+0.002
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
 bottom-0.002
Token So
Token ID1406
Feature activation-1.84e-01

Pos logit contributions
arl+0.008
ll+0.007
 upon+0.007
ety+0.007
endant+0.007
Neg logit contributions
ze-0.013
zed-0.012
 character-0.011
hop-0.011
 bottom-0.011
Token,
Token ID11
Feature activation-5.95e-01

Pos logit contributions
ze+0.006
zed+0.006
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.004
ll-0.003
 upon-0.003
ety-0.003
endant-0.003
Token they
Token ID484
Feature activation-1.58e-01

Pos logit contributions
ze+0.021
zed+0.019
 character+0.018
hop+0.017
 tunes+0.017
Neg logit contributions
arl-0.011
 upon-0.010
ll-0.010
ety-0.009
 Bian-0.008
Token both
Token ID1111
Feature activation-3.66e-01

Pos logit contributions
ze+0.004
zed+0.004
 character+0.004
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.004
 upon-0.004
ll-0.004
ety-0.004
endant-0.004
Token ran
Token ID4966
Feature activation+4.29e-01

Pos logit contributions
ze+0.011
zed+0.010
 character+0.009
hop+0.009
 bottom+0.008
Neg logit contributions
arl-0.009
 upon-0.008
ll-0.008
ety-0.008
endant-0.007
Token to
Token ID284
Feature activation-9.76e-01

Pos logit contributions
arl+0.011
 upon+0.011
ll+0.011
ety+0.010
endant+0.010
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.009
Token likes
Token ID7832
Feature activation+9.10e-03

Pos logit contributions
ze+0.025
zed+0.023
 character+0.022
hop+0.021
 bottom+0.020
Neg logit contributions
 upon-0.017
ll-0.017
arl-0.017
ety-0.015
endant-0.014
Token yoga
Token ID20351
Feature activation+2.50e-01

Pos logit contributions
arl+0.000
 upon+0.000
ll+0.000
ety+0.000
 Bian+0.000
Neg logit contributions
ze-0.000
zed-0.000
 character-0.000
hop-0.000
 bottom-0.000
Token.
Token ID13
Feature activation+4.44e-02

Pos logit contributions
arl+0.004
ll+0.004
 upon+0.004
ety+0.004
 ago+0.004
Neg logit contributions
ze-0.008
zed-0.008
hop-0.007
 character-0.007
lder-0.007
Token She
Token ID1375
Feature activation+4.05e-01

Pos logit contributions
ll+0.001
arl+0.001
 upon+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token feels
Token ID5300
Feature activation-1.49e+00

Pos logit contributions
arl+0.010
 upon+0.009
ll+0.009
ety+0.009
endant+0.009
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.009
Token good
Token ID922
Feature activation-1.01e+00

Pos logit contributions
ze+0.050
zed+0.050
lder+0.048
hop+0.048
 basis+0.046
Neg logit contributions
arl-0.024
 upon-0.023
 Bian-0.018
 there-0.017
 too-0.017
Token and
Token ID290
Feature activation-5.84e-01

Pos logit contributions
ze+0.037
zed+0.036
lder+0.033
hop+0.032
 bottom+0.032
Neg logit contributions
arl-0.015
 upon-0.015
ll-0.015
 ago-0.012
ety-0.011
Token relaxed
Token ID18397
Feature activation+4.76e-01

Pos logit contributions
ze+0.018
zed+0.017
 character+0.016
hop+0.016
 bottom+0.015
Neg logit contributions
arl-0.012
 upon-0.011
ll-0.011
ety-0.010
endant-0.010
Token.
Token ID13
Feature activation-7.83e-01

Pos logit contributions
arl+0.009
 upon+0.009
ll+0.008
ety+0.008
 ago+0.007
Neg logit contributions
ze-0.016
zed-0.015
 character-0.014
hop-0.013
lder-0.013
Token She
Token ID1375
Feature activation+1.45e-01

Pos logit contributions
ze+0.024
zed+0.021
 character+0.021
hop+0.020
 bottom+0.019
Neg logit contributions
ll-0.017
arl-0.016
 upon-0.015
endant-0.015
ety-0.014
Token thinks
Token ID6834
Feature activation-6.56e-01

Pos logit contributions
arl+0.004
 upon+0.003
ll+0.003
ety+0.003
endant+0.003
Neg logit contributions
ze-0.004
zed-0.004
 character-0.003
hop-0.003
 bottom-0.003
Token day
Token ID1110
Feature activation-1.40e+00

Pos logit contributions
ze+0.036
zed+0.031
 character+0.028
hop+0.027
 tunes+0.025
Neg logit contributions
arl-0.044
 upon-0.043
ll-0.042
ety-0.040
 Bian-0.038
Token on
Token ID319
Feature activation+3.06e-02

Pos logit contributions
ze+0.045
zed+0.039
 character+0.039
 tunes+0.038
hop+0.037
Neg logit contributions
arl-0.030
 upon-0.027
ll-0.026
 Bian-0.024
ety-0.023
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
arl+0.001
ll+0.001
ety+0.001
endant+0.001
utt+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
 bottom-0.001
hop-0.000
Token Lily
Token ID20037
Feature activation-1.71e+00

Pos logit contributions
arl+0.004
ll+0.004
endant+0.004
 upon+0.003
ety+0.003
Neg logit contributions
ze-0.007
zed-0.007
 character-0.006
hop-0.006
 bottom-0.006
Token always
Token ID1464
Feature activation-4.53e-02

Pos logit contributions
ze+0.053
zed+0.047
 character+0.047
hop+0.044
 bottom+0.044
Neg logit contributions
ll-0.035
 upon-0.033
arl-0.031
ety-0.031
 Bian-0.028
Token looked
Token ID3114
Feature activation-1.59e+00

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
 upon-0.001
ll-0.001
ety-0.001
endant-0.001
Token for
Token ID329
Feature activation+3.81e-01

Pos logit contributions
ze+0.038
zed+0.033
 character+0.031
hop+0.029
 bottom+0.027
Neg logit contributions
arl-0.046
ll-0.044
ety-0.042
endant-0.040
 upon-0.040
Token new
Token ID649
Feature activation+2.90e-01

Pos logit contributions
arl+0.011
ll+0.011
ety+0.010
 upon+0.010
endant+0.009
Neg logit contributions
ze-0.009
zed-0.008
 character-0.008
hop-0.007
 tunes-0.007
Token things
Token ID1243
Feature activation+7.42e-01

Pos logit contributions
arl+0.008
 upon+0.007
ll+0.007
ety+0.007
endant+0.007
Neg logit contributions
ze-0.007
zed-0.007
 character-0.006
hop-0.006
 bottom-0.006
Token to
Token ID284
Feature activation-2.59e-01

Pos logit contributions
arl+0.022
emy+0.018
 upon+0.018
ucker+0.018
ety+0.018
Neg logit contributions
ze-0.018
zed-0.016
hop-0.015
 character-0.015
 tunes-0.013
Token discover
Token ID7073
Feature activation+2.58e-02

Pos logit contributions
ze+0.006
zed+0.006
 character+0.005
hop+0.005
 bottom+0.005
Neg logit contributions
arl-0.008
ll-0.007
 upon-0.007
ety-0.007
endant-0.007
Token the
Token ID262
Feature activation-9.80e-01

Pos logit contributions
 upon+0.001
arl+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.002
zed-0.002
hop-0.002
 character-0.002
lder-0.002
Token screw
Token ID9580
Feature activation-5.32e-01

Pos logit contributions
ze+0.027
zed+0.023
 character+0.022
hop+0.021
 bottom+0.021
Neg logit contributions
arl-0.026
ll-0.024
 upon-0.024
ety-0.023
endant-0.022
Token came
Token ID1625
Feature activation+6.67e-01

Pos logit contributions
ze+0.015
zed+0.013
 character+0.012
hop+0.012
 bottom+0.011
Neg logit contributions
arl-0.014
 upon-0.013
ll-0.013
ety-0.012
 Bian-0.012
Token out
Token ID503
Feature activation-9.48e-01

Pos logit contributions
arl+0.019
ll+0.017
 Bian+0.017
endant+0.017
ety+0.016
Neg logit contributions
ze-0.017
zed-0.014
 bottom-0.013
 character-0.012
hop-0.012
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
ze+0.028
zed+0.025
 character+0.024
 bottom+0.022
hop+0.022
Neg logit contributions
arl-0.022
ety-0.020
ll-0.020
 Bian-0.019
endant-0.019
Token Tom
Token ID4186
Feature activation-7.60e-01

Pos logit contributions
ze+0.021
zed+0.019
 character+0.018
hop+0.018
 tunes+0.018
Neg logit contributions
arl-0.007
 upon-0.006
ll-0.005
ety-0.005
 ago-0.004
Token was
Token ID373
Feature activation-7.46e-01

Pos logit contributions
ze+0.021
zed+0.019
 character+0.018
hop+0.017
 bottom+0.017
Neg logit contributions
 upon-0.018
arl-0.018
ll-0.018
ety-0.017
endant-0.016
Token sad
Token ID6507
Feature activation-1.45e-01

Pos logit contributions
ze+0.024
zed+0.022
 character+0.021
hop+0.021
 bottom+0.020
Neg logit contributions
arl-0.015
 upon-0.014
ll-0.013
ety-0.013
endant-0.012
Token and
Token ID290
Feature activation-3.26e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.003
 bottom+0.003
hop+0.003
Neg logit contributions
arl-0.004
endant-0.004
 Bian-0.004
ety-0.004
ll-0.003
Token didn
Token ID1422
Feature activation-1.76e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.008
hop+0.007
lder+0.007
Neg logit contributions
arl-0.008
ll-0.008
 upon-0.007
ety-0.007
endant-0.007
Token't
Token ID470
Feature activation-9.20e-01

Pos logit contributions
ze+0.006
zed+0.006
 character+0.006
hop+0.005
 tunes+0.005
Neg logit contributions
arl-0.003
 upon-0.003
ll-0.003
ety-0.002
 Bian-0.002
Token the
Token ID262
Feature activation-8.79e-01

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
hop+0.008
 bottom+0.008
Neg logit contributions
arl-0.013
ll-0.012
ety-0.011
 upon-0.011
 Bian-0.010
Token door
Token ID3420
Feature activation-1.02e+00

Pos logit contributions
ze+0.028
zed+0.026
 character+0.024
hop+0.024
 bottom+0.023
Neg logit contributions
arl-0.018
 upon-0.017
ll-0.017
ety-0.016
endant-0.015
Token.
Token ID13
Feature activation-3.15e-01

Pos logit contributions
ze+0.024
zed+0.022
hop+0.020
 bottom+0.020
 character+0.020
Neg logit contributions
arl-0.030
ll-0.027
 Bian-0.027
ety-0.026
emy-0.026
Token He
Token ID679
Feature activation+1.16e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.008
hop+0.008
 bottom+0.007
Neg logit contributions
arl-0.008
ll-0.007
 upon-0.007
ety-0.006
endant-0.006
Token tried
Token ID3088
Feature activation-5.15e-01

Pos logit contributions
arl+0.003
 upon+0.003
ll+0.003
ety+0.003
endant+0.003
Neg logit contributions
ze-0.003
zed-0.003
 character-0.003
hop-0.003
lder-0.002
Token hard
Token ID1327
Feature activation-9.53e-01

Pos logit contributions
ze+0.016
zed+0.014
 character+0.013
hop+0.013
 bottom+0.013
Neg logit contributions
arl-0.012
ll-0.011
ety-0.010
endant-0.010
 upon-0.010
Token,
Token ID11
Feature activation+9.92e-01

Pos logit contributions
ze+0.022
zed+0.020
 character+0.019
hop+0.019
 tunes+0.017
Neg logit contributions
arl-0.027
ety-0.027
endant-0.026
ulu-0.025
 upon-0.023
Token but
Token ID475
Feature activation-5.23e-01

Pos logit contributions
arl+0.025
ll+0.022
endant+0.021
uffy+0.020
ety+0.020
Neg logit contributions
ze-0.031
zed-0.028
hop-0.028
 bottom-0.025
 character-0.025
Token it
Token ID340
Feature activation+4.73e-02

Pos logit contributions
ze+0.014
zed+0.013
 character+0.012
hop+0.012
 bottom+0.012
Neg logit contributions
arl-0.015
ll-0.013
ety-0.012
endant-0.012
 upon-0.012
Token never
Token ID1239
Feature activation+4.46e-01

Pos logit contributions
arl+0.001
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token opened
Token ID4721
Feature activation+4.43e-01

Pos logit contributions
arl+0.012
ll+0.010
 upon+0.010
ety+0.010
endant+0.010
Neg logit contributions
ze-0.012
zed-0.011
 character-0.010
hop-0.010
 bottom-0.009
Token promise
Token ID6991
Feature activation+3.71e-01

Pos logit contributions
 upon+0.019
arl+0.019
endant+0.019
ll+0.019
ety+0.018
Neg logit contributions
ze-0.017
zed-0.014
 bottom-0.013
hop-0.013
 character-0.012
Token me
Token ID502
Feature activation+1.74e-01

Pos logit contributions
arl+0.009
 upon+0.008
ll+0.008
ety+0.007
endant+0.007
Neg logit contributions
ze-0.011
zed-0.010
 character-0.009
hop-0.009
 bottom-0.009
Token you
Token ID345
Feature activation+1.17e+00

Pos logit contributions
arl+0.005
 upon+0.004
ll+0.004
ety+0.004
endant+0.004
Neg logit contributions
ze-0.005
zed-0.004
 character-0.004
hop-0.004
 bottom-0.004
Token will
Token ID481
Feature activation+3.71e-01

Pos logit contributions
arl+0.024
 upon+0.021
ll+0.021
ety+0.020
 Bian+0.019
Neg logit contributions
ze-0.039
zed-0.035
 character-0.033
hop-0.032
 bottom-0.031
Token not
Token ID407
Feature activation-2.13e-01

Pos logit contributions
arl+0.009
 upon+0.009
endant+0.009
ety+0.009
ll+0.009
Neg logit contributions
ze-0.010
zed-0.008
 bottom-0.008
hop-0.007
 character-0.007
Token do
Token ID466
Feature activation-2.51e+00

Pos logit contributions
ze+0.006
zed+0.005
 character+0.004
hop+0.004
 bottom+0.004
Neg logit contributions
arl-0.006
 upon-0.005
ll-0.005
endant-0.005
 Bian-0.005
Token this
Token ID428
Feature activation-1.63e+00

Pos logit contributions
ze+0.072
zed+0.069
hop+0.066
 character+0.065
lder+0.062
Neg logit contributions
arl-0.051
 upon-0.047
ll-0.045
ety-0.043
endant-0.043
Token again
Token ID757
Feature activation-3.21e-01

Pos logit contributions
ze+0.045
zed+0.041
 character+0.038
hop+0.037
 bottom+0.034
Neg logit contributions
arl-0.041
 upon-0.038
ll-0.037
ety-0.036
endant-0.034
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
ze+0.009
zed+0.008
 character+0.008
hop+0.007
 bottom+0.007
Neg logit contributions
arl-0.008
 upon-0.007
ll-0.007
ety-0.007
endant-0.007
Token Do
Token ID2141
Feature activation+1.11e+00

Pos logit contributions
ze+0.010
zed+0.009
 character+0.009
hop+0.008
lder+0.008
Neg logit contributions
ll-0.011
 upon-0.011
arl-0.011
ety-0.010
endant-0.010
Token you
Token ID345
Feature activation+1.80e+00

Pos logit contributions
arl+0.026
 upon+0.023
endant+0.022
ety+0.021
ll+0.021
Neg logit contributions
ze-0.033
zed-0.030
 character-0.028
hop-0.027
lder-0.026
Token!
Token ID0
Feature activation-3.07e-01

Pos logit contributions
ze+0.003
 character+0.002
zed+0.002
 bottom+0.002
hop+0.002
Neg logit contributions
arl-0.003
 Bian-0.003
 upon-0.003
ll-0.003
ety-0.003
Token It
Token ID632
Feature activation+2.06e-01

Pos logit contributions
ze+0.007
 character+0.006
zed+0.006
hop+0.006
lder+0.005
Neg logit contributions
ll-0.009
arl-0.009
 upon-0.009
ety-0.008
endant-0.008
Token has
Token ID468
Feature activation-1.09e+00

Pos logit contributions
arl+0.008
 upon+0.007
ll+0.006
 ago+0.006
endant+0.006
Neg logit contributions
ze-0.004
zed-0.003
 character-0.003
 bottom-0.003
lder-0.003
Token a
Token ID257
Feature activation-1.94e-02

Pos logit contributions
ze+0.030
zed+0.026
 character+0.025
hop+0.023
 bottom+0.022
Neg logit contributions
arl-0.030
ll-0.028
 upon-0.026
ety-0.025
 Bian-0.025
Token funny
Token ID8258
Feature activation-2.10e-01

Pos logit contributions
ze+0.001
zed+0.000
 character+0.000
hop+0.000
 bottom+0.000
Neg logit contributions
arl-0.000
 upon-0.000
ll-0.000
ety-0.000
endant-0.000
Token face
Token ID1986
Feature activation-2.38e+00

Pos logit contributions
ze+0.005
zed+0.004
 character+0.004
hop+0.004
 bottom+0.003
Neg logit contributions
arl-0.006
 upon-0.006
ll-0.006
ety-0.006
endant-0.005
Token on
Token ID319
Feature activation-1.51e+00

Pos logit contributions
ze+0.069
zed+0.065
 character+0.059
hop+0.058
 tunes+0.056
Neg logit contributions
arl-0.055
 upon-0.050
ll-0.048
ety-0.047
 ago-0.046
Token it
Token ID340
Feature activation-1.18e+00

Pos logit contributions
ze+0.043
zed+0.040
 character+0.037
hop+0.037
 tunes+0.034
Neg logit contributions
arl-0.035
 upon-0.033
ety-0.030
ll-0.030
endant-0.029
Token."
Token ID526
Feature activation+2.22e-01

Pos logit contributions
ze+0.031
zed+0.027
 character+0.025
hop+0.025
 bottom+0.023
Neg logit contributions
arl-0.031
 upon-0.029
ll-0.029
ety-0.028
 Bian-0.027
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
arl+0.006
ll+0.005
endant+0.005
 upon+0.005
ist+0.005
Neg logit contributions
ze-0.005
zed-0.005
hop-0.005
 character-0.005
 Everyone-0.004
Token
Token ID198
Feature activation+6.09e-01

Pos logit contributions
ze+0.012
zed+0.011
hop+0.010
 character+0.010
 bottom+0.009
Neg logit contributions
arl-0.011
endant-0.010
ety-0.010
ll-0.010
 Bian-0.009
TokenWow
Token ID22017
Feature activation+4.46e-01

Pos logit contributions
ze+0.003
zed+0.003
 character+0.002
hop+0.002
 tunes+0.002
Neg logit contributions
arl-0.005
ll-0.004
 upon-0.004
endant-0.004
ety-0.004
Token,
Token ID11
Feature activation-8.65e-01

Pos logit contributions
arl+0.010
 upon+0.009
ll+0.009
ety+0.009
endant+0.009
Neg logit contributions
ze-0.013
zed-0.012
 character-0.011
hop-0.011
 bottom-0.010
Token Lily
Token ID20037
Feature activation+4.52e-01

Pos logit contributions
ze+0.026
 character+0.023
zed+0.023
hop+0.021
 bottom+0.020
Neg logit contributions
arl-0.020
ll-0.019
ety-0.018
 upon-0.017
endant-0.017
Token!
Token ID0
Feature activation-4.18e-01

Pos logit contributions
arl+0.013
 upon+0.012
ll+0.012
ety+0.012
endant+0.012
Neg logit contributions
ze-0.011
zed-0.009
 character-0.009
hop-0.008
lder-0.008
Token Your
Token ID3406
Feature activation-1.53e-01

Pos logit contributions
ze+0.011
zed+0.009
 character+0.009
hop+0.008
lder+0.008
Neg logit contributions
arl-0.011
ll-0.011
 upon-0.011
ety-0.010
endant-0.010
Token painting
Token ID12036
Feature activation-2.10e+00

Pos logit contributions
ze+0.004
zed+0.004
 character+0.004
hop+0.003
 bottom+0.003
Neg logit contributions
arl-0.004
ll-0.004
 upon-0.004
ety-0.003
endant-0.003
Token is
Token ID318
Feature activation-9.48e-01

Pos logit contributions
ze+0.063
zed+0.058
 character+0.053
hop+0.053
 bottom+0.049
Neg logit contributions
arl-0.045
 upon-0.044
ll-0.043
ety-0.040
endant-0.038
Token so
Token ID523
Feature activation-1.03e+00

Pos logit contributions
ze+0.029
zed+0.026
hop+0.025
 character+0.024
lder+0.023
Neg logit contributions
arl-0.021
 upon-0.020
ll-0.019
ety-0.018
endant-0.017
Token pretty
Token ID2495
Feature activation+2.94e-01

Pos logit contributions
ze+0.033
zed+0.030
 character+0.028
hop+0.028
 bottom+0.026
Neg logit contributions
arl-0.022
 upon-0.020
ll-0.020
ety-0.019
endant-0.018
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
arl+0.007
 upon+0.006
ll+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.009
zed-0.008
 character-0.007
hop-0.007
 bottom-0.007
Token Can
Token ID1680
Feature activation+1.67e+00

Pos logit contributions
ze+0.027
zed+0.024
 character+0.023
hop+0.022
 tunes+0.020
Neg logit contributions
arl-0.024
ll-0.023
 upon-0.023
ety-0.022
endant-0.021
Token They
Token ID1119
Feature activation+2.68e-01

Pos logit contributions
ze+0.020
zed+0.019
 character+0.018
hop+0.018
 bottom+0.017
Neg logit contributions
arl-0.007
 upon-0.006
ll-0.006
ety-0.006
endant-0.005
Token wish
Token ID4601
Feature activation+1.57e+00

Pos logit contributions
arl+0.008
ll+0.008
 upon+0.008
ety+0.008
endant+0.007
Neg logit contributions
ze-0.006
zed-0.005
 character-0.005
 bottom-0.004
hop-0.004
Token they
Token ID484
Feature activation-5.93e-01

Pos logit contributions
arl+0.026
ety+0.023
ll+0.023
 upon+0.022
endant+0.021
Neg logit contributions
ze-0.057
zed-0.054
 character-0.051
hop-0.048
 bottom-0.048
Token did
Token ID750
Feature activation-1.66e+00

Pos logit contributions
ze+0.019
zed+0.017
 character+0.016
hop+0.016
 bottom+0.015
Neg logit contributions
arl-0.013
 upon-0.011
ll-0.011
ety-0.011
endant-0.010
Token not
Token ID407
Feature activation-9.78e-01

Pos logit contributions
ze+0.060
zed+0.055
 character+0.053
hop+0.052
lder+0.050
Neg logit contributions
arl-0.026
 upon-0.025
ll-0.023
ety-0.021
endant-0.020
Token fight
Token ID1907
Feature activation-2.12e+00

Pos logit contributions
ze+0.030
zed+0.027
 character+0.025
hop+0.024
 bottom+0.023
Neg logit contributions
arl-0.022
 upon-0.021
ll-0.020
ety-0.019
endant-0.018
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
ze+0.071
zed+0.070
lder+0.064
hop+0.064
 character+0.060
Neg logit contributions
 upon-0.037
arl-0.036
ety-0.032
ll-0.030
 ago-0.027
Token They
Token ID1119
Feature activation+4.95e-02

Pos logit contributions
ze+0.024
zed+0.022
 character+0.021
hop+0.021
 bottom+0.020
Neg logit contributions
arl-0.010
 upon-0.009
ll-0.008
ety-0.008
endant-0.007
Token want
Token ID765
Feature activation-6.11e-01

Pos logit contributions
arl+0.002
 upon+0.001
ll+0.001
ety+0.001
endant+0.001
Neg logit contributions
ze-0.001
zed-0.001
 character-0.001
hop-0.001
 bottom-0.001
Token to
Token ID284
Feature activation+2.42e-01

Pos logit contributions
ze+0.023
zed+0.021
hop+0.020
 character+0.019
lder+0.019
Neg logit contributions
arl-0.009
 upon-0.009
ll-0.008
ety-0.008
 Bian-0.007
Token play
Token ID711
Feature activation-3.31e-01

Pos logit contributions
arl+0.007
 upon+0.006
ll+0.006
ety+0.006
endant+0.006
Neg logit contributions
ze-0.006
zed-0.006
 character-0.005
hop-0.005
 bottom-0.005
Token careful
Token ID8161
Feature activation-5.28e-01

Pos logit contributions
arl+0.010
 upon+0.010
ll+0.009
ety+0.009
endant+0.008
Neg logit contributions
ze-0.018
zed-0.017
 character-0.016
hop-0.016
 bottom-0.015
Token not
Token ID407
Feature activation-6.49e-01

Pos logit contributions
ze+0.016
zed+0.015
 character+0.014
hop+0.014
 bottom+0.013
Neg logit contributions
arl-0.011
 upon-0.011
ll-0.011
ety-0.010
endant-0.009
Token to
Token ID284
Feature activation-9.92e-01

Pos logit contributions
ze+0.022
zed+0.019
 character+0.018
hop+0.018
 bottom+0.017
Neg logit contributions
arl-0.013
 upon-0.012
ll-0.011
ety-0.011
endant-0.010
Token release
Token ID2650
Feature activation+2.14e-01

Pos logit contributions
ze+0.030
zed+0.028
 character+0.026
hop+0.025
 tunes+0.023
Neg logit contributions
arl-0.023
 upon-0.021
ll-0.021
ety-0.020
endant-0.018
Token the
Token ID262
Feature activation-4.19e-01

Pos logit contributions
arl+0.005
ll+0.005
 upon+0.005
ety+0.005
endant+0.005
Neg logit contributions
ze-0.006
zed-0.005
 character-0.005
hop-0.005
 bottom-0.005
Token snowball
Token ID46275
Feature activation-2.60e+00

Pos logit contributions
ze+0.010
 bottom+0.008
zed+0.007
 character+0.007
 middle+0.007
Neg logit contributions
arl-0.013
ll-0.012
 Bian-0.012
ety-0.011
 upon-0.011
Token!"
Token ID2474
Feature activation-5.42e-01

Pos logit contributions
ze+0.078
zed+0.075
 character+0.071
hop+0.068
 tunes+0.066
Neg logit contributions
arl-0.055
ll-0.051
 upon-0.051
ety-0.048
endant-0.042
Token
Token ID220
Feature activation-3.36e-02

Pos logit contributions
ze+0.015
zed+0.014
 character+0.013
hop+0.013
lder+0.011
Neg logit contributions
arl-0.014
endant-0.013
ll-0.013
 upon-0.012
 ago-0.012
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
ze+0.001
zed+0.001
 character+0.001
hop+0.001
 bottom+0.001
Neg logit contributions
arl-0.001
endant-0.001
ety-0.001
ll-0.001
 upon-0.001
Token
Token ID198
Feature activation+6.34e-01

Pos logit contributions
ze+0.006
zed+0.006
hop+0.005
 character+0.005
 bottom+0.005
Neg logit contributions
arl-0.004
endant-0.004
 upon-0.004
ety-0.004
 ago-0.004
TokenThey
Token ID2990
Feature activation-1.01e-01

Pos logit contributions
arl+0.013
ll+0.013
 upon+0.013
ety+0.012
 Bian+0.011
Neg logit contributions
ze-0.019
zed-0.018
 character-0.017
hop-0.017
 bottom-0.016