TOP ACTIVATIONS
MAX = 4.310

nation . Anna and Ben feel happy . They
. L ila and Ben are sad . They
to go home . Anna and Ben say goodbye to each
. It dropped . Lily and Ben are sad . They
them . Anna and Ben smile . They feel
and flies away . Lily and Ben wave goodbye to the
. L ily and Ben scream . They drop
. L ily and Ben are happy too .
eer . Anna and Ben go back to their
horn . Anna and Ben run to the slide
. L ily and Max are sorry . They
from the shore . Lily and Tom are scared .
sound . Anna and Ben gasp . They look
wine ?" Anna and Ben shake their heads .
. L ily and Ben swim to their dad
fire . Anna and Ben scream . They jump
. S ara and Ben peek from the fence
mean . Anna and Ben are not lucky .
it . Anna and Ben are happy . They
. L ila and Ben have a fun day

MIN ACTIVATIONS
MIN = -4.056

back the village to its former state . With the help
the village back to its former glory !"
, leaving it to its former state .
to be restored to its former glory ."
back the other toys you don 't want ."
the tree to its once former beauty . He was happy
They had so much fun swimming and playing . From that
Leave this door shut and don 't let it out ."
add some spark ly shoes too !" The shop
Here , hold yours and don 't lose it , okay
your turn . If you don 't , we will go
and hold hands so we don 't get lost ." So
That 's right . Trees don 't suddenly move or fall
not get angry when they don 't fit . <|endoftext|> Once
with lots of water birds swimming in it . He looked
get rewarded and when you don 't listen , you get
need to drink some water too ." Anna and
will be sad if they don 't get the letters .
. Hold on tight and don 't let go ." Mom
make us sick if we don 't wear jackets ."

INTERVAL 2.219 - 4.310
CONTAINS 0.097%

a loud noise . Lily and Tom heard someone scream .
." L ily and Ben felt bad . They
He is proud of Lily and Tom . He knows they
Lucy 's mom was not troubled anymore , and they all
Nicola wanted to put a smile on her little face so

INTERVAL 0.127 - 2.219
CONTAINS 40.313%

." Sara said . " That 's right .
. OK ?" Tom and Mia smile . They
. One day , Anna wants to play with Ben at
, I don 't know . He was here a minute
why did you do that ? You shouldn 't disturb things

INTERVAL -1.964 - 0.127
CONTAINS 59.481%

who he was . The mouse replied , "
be part of the perfect parade . He waved goodbye to
his sc ooter and hurt his knee . A little girl
had learned an important lesson , that it 's always important
continued to play together , happy to have each other as

INTERVAL -4.056 - -1.964
CONTAINS 0.109%

a kind mail man named Max . Max had a big
man and woman shared their first kiss , the guests all
was a little bunny named Benny . Benny loved to bounce
getting more terrible by the second . Tony looked around for
in the park with her friends . One day , her
Token nation
Token ID3277
Feature activation+4.02e-01

Pos logit contributions
 decided+0.000
gl+0.000
year+0.000
 inside+0.000
Mill+0.000
Neg logit contributions
 salon-0.000
 mat-0.000
 losers-0.000
 mats-0.000
 yoga-0.000
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
 salon+0.012
ucker+0.011
 pose+0.011
 mat+0.011
 Fro+0.010
Neg logit contributions
 decided-0.007
 inside-0.006
 UFO-0.006
 should-0.005
 mainland-0.005
Token
Token ID198
Feature activation+8.18e-02

Pos logit contributions
 salon+0.011
ucker+0.011
 Fro+0.010
ips+0.010
 Dr+0.010
Neg logit contributions
 decided-0.009
 inside-0.009
 should-0.008
 soon-0.008
 even-0.008
Token
Token ID198
Feature activation+4.71e-01

Pos logit contributions
ucker+0.002
 Fro+0.002
 salon+0.002
 Dr+0.002
 Mara+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 UFO-0.001
 should-0.001
 even-0.001
TokenAnna
Token ID31160
Feature activation+2.02e+00

Pos logit contributions
ucker+0.010
 salon+0.010
 Fro+0.009
ips+0.009
ittens+0.009
Neg logit contributions
 decided-0.012
 inside-0.011
 should-0.010
 soon-0.010
 even-0.010
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
ucker+0.060
 salon+0.060
 Fro+0.056
ittens+0.056
ips+0.056
Neg logit contributions
 decided-0.033
 inside-0.031
 should-0.026
 soon-0.025
 even-0.025
Token Ben
Token ID3932
Feature activation+4.32e-01

Pos logit contributions
ucker+0.099
eh+0.097
ittens+0.090
 hind+0.082
liv+0.081
Neg logit contributions
 inside-0.102
 next-0.098
 decided-0.089
 soon-0.085
 even-0.085
Token feel
Token ID1254
Feature activation-8.81e-01

Pos logit contributions
 salon+0.009
ucker+0.008
 Fro+0.008
ips+0.008
 Dr+0.008
Neg logit contributions
 decided-0.011
 inside-0.011
 soon-0.010
 should-0.010
 even-0.009
Token happy
Token ID3772
Feature activation+3.27e-01

Pos logit contributions
 decided+0.019
 inside+0.019
 should+0.017
 even+0.017
 soon+0.016
Neg logit contributions
 salon-0.022
ucker-0.022
ips-0.020
ittens-0.020
 pose-0.020
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
ucker+0.006
 elbow+0.006
 salon+0.006
ips+0.006
 Dr+0.006
Neg logit contributions
 decided-0.007
 UFO-0.007
 generation-0.007
col-0.007
 mainland-0.007
Token They
Token ID1119
Feature activation+9.32e-01

Pos logit contributions
 decided+0.000
 inside+0.000
 even+0.000
 soon+0.000
 should+0.000
Neg logit contributions
ucker-0.000
 salon-0.000
 Fro-0.000
 Mara-0.000
 Dr-0.000
Token.
Token ID13
Feature activation+3.99e-01

Pos logit contributions
 Leo+0.006
 Fro+0.006
 Dr+0.006
ips+0.005
 salon+0.005
Neg logit contributions
 UFO-0.006
 decided-0.006
col-0.006
year-0.006
 mainland-0.006
Token
Token ID198
Feature activation-2.42e-01

Pos logit contributions
ucker+0.010
 salon+0.010
 Fro+0.009
 Dr+0.009
 Mara+0.009
Neg logit contributions
 decided-0.008
 inside-0.008
 even-0.007
 should-0.007
 soon-0.007
Token
Token ID198
Feature activation+1.71e-01

Pos logit contributions
 decided+0.005
 inside+0.005
 UFO+0.004
makers+0.004
 even+0.004
Neg logit contributions
ucker-0.006
 Fro-0.006
 Dr-0.006
 salon-0.006
ilo-0.006
TokenL
Token ID43
Feature activation+5.30e-01

Pos logit contributions
ucker+0.004
 salon+0.004
 Fro+0.003
ilo+0.003
ittens+0.003
Neg logit contributions
 inside-0.004
 decided-0.004
 UFO-0.004
makers-0.004
 even-0.004
Tokenila
Token ID10102
Feature activation+1.85e+00

Pos logit contributions
 salon+0.010
ucker+0.010
 hind+0.009
 Dr+0.009
 losers+0.009
Neg logit contributions
 decided-0.015
 inside-0.014
 soon-0.013
 should-0.013
 next-0.012
Token and
Token ID290
Feature activation+3.96e+00

Pos logit contributions
 salon+0.054
ucker+0.053
 Fro+0.050
ips+0.050
 Dr+0.049
Neg logit contributions
 decided-0.031
 inside-0.029
 should-0.025
 soon-0.025
 even-0.024
Token Ben
Token ID3932
Feature activation+5.69e-01

Pos logit contributions
eh+0.101
ucker+0.098
ittens+0.092
 hind+0.089
 losers+0.087
Neg logit contributions
 inside-0.091
 next-0.089
 decided-0.083
 soon-0.078
 even-0.075
Token are
Token ID389
Feature activation+1.82e-01

Pos logit contributions
 salon+0.010
ucker+0.009
 Fro+0.009
 losers+0.009
 pose+0.009
Neg logit contributions
 decided-0.015
 inside-0.015
 generations-0.014
 UFO-0.014
makers-0.014
Token sad
Token ID6507
Feature activation+5.51e-01

Pos logit contributions
 salon+0.005
ucker+0.005
ittens+0.005
ilo+0.005
 Fro+0.005
Neg logit contributions
 inside-0.004
 decided-0.004
 even-0.003
 soon-0.003
 should-0.003
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
ucker+0.014
 salon+0.014
ips+0.013
 pose+0.013
 mat+0.013
Neg logit contributions
 decided-0.011
 UFO-0.010
 generation-0.009
col-0.009
 mainland-0.009
Token They
Token ID1119
Feature activation+6.88e-01

Pos logit contributions
ucker+0.006
 salon+0.006
 Fro+0.006
 Dr+0.005
 Mara+0.005
Neg logit contributions
 decided-0.006
 inside-0.006
 soon-0.005
 even-0.005
 should-0.005
Token to
Token ID284
Feature activation-6.99e-01

Pos logit contributions
ucker+0.001
 salon+0.001
ips+0.001
 Fro+0.001
ittens+0.001
Neg logit contributions
 decided-0.002
 inside-0.001
 should-0.001
 UFO-0.001
 even-0.001
Token go
Token ID467
Feature activation-1.24e+00

Pos logit contributions
 decided+0.017
 inside+0.016
makers+0.016
 should+0.016
 UFO+0.015
Neg logit contributions
 salon-0.015
 pose-0.014
ags-0.013
 losers-0.013
 mat-0.013
Token home
Token ID1363
Feature activation-4.15e-01

Pos logit contributions
 decided+0.023
 inside+0.020
 should+0.019
 UFO+0.018
makers+0.018
Neg logit contributions
 salon-0.034
ucker-0.033
ips-0.032
 Fro-0.031
ags-0.031
Token.
Token ID13
Feature activation+7.00e-02

Pos logit contributions
 decided+0.007
 UFO+0.006
 inside+0.006
 should+0.006
makers+0.005
Neg logit contributions
 salon-0.012
ucker-0.012
 Dr-0.011
 pose-0.011
ips-0.011
Token Anna
Token ID11735
Feature activation+1.73e+00

Pos logit contributions
ucker+0.002
 salon+0.002
eh+0.002
ittens+0.002
 Fro+0.002
Neg logit contributions
 decided-0.001
 inside-0.001
 should-0.001
 soon-0.001
 even-0.001
Token and
Token ID290
Feature activation+3.90e+00

Pos logit contributions
ucker+0.056
 salon+0.054
ittens+0.054
 Fro+0.053
ilo+0.052
Neg logit contributions
 decided-0.025
 inside-0.024
 should-0.020
 soon-0.019
 even-0.019
Token Ben
Token ID3932
Feature activation+4.92e-01

Pos logit contributions
ucker+0.119
eh+0.116
liv+0.109
ittens+0.108
 hind+0.107
Neg logit contributions
 next-0.077
 inside-0.077
 decided-0.070
 two-0.068
 soon-0.066
Token say
Token ID910
Feature activation+3.41e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 Fro+0.010
 pose+0.010
ips+0.010
Neg logit contributions
 decided-0.011
 inside-0.011
 should-0.010
 UFO-0.010
makers-0.009
Token goodbye
Token ID24829
Feature activation+3.10e-01

Pos logit contributions
 salon+0.007
ucker+0.007
 Fro+0.006
ips+0.006
 Dr+0.006
Neg logit contributions
 decided-0.009
 inside-0.008
 should-0.007
 even-0.007
 soon-0.007
Token to
Token ID284
Feature activation+4.44e-01

Pos logit contributions
 salon+0.006
ucker+0.006
 pose+0.006
 Fro+0.006
ow+0.006
Neg logit contributions
 decided-0.007
 UFO-0.007
 should-0.007
 generations-0.006
 lighthouse-0.006
Token each
Token ID1123
Feature activation+1.15e+00

Pos logit contributions
ucker+0.012
eh+0.011
ittens+0.011
 salon+0.011
pill+0.010
Neg logit contributions
 inside-0.010
 decided-0.009
 even-0.009
 soon-0.008
 next-0.008
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 decided+0.004
 mainland+0.004
 UFO+0.004
 should+0.004
makers+0.004
Neg logit contributions
 salon-0.004
 Dr-0.004
ips-0.004
ucker-0.004
 mat-0.004
Token It
Token ID632
Feature activation+5.50e-01

Pos logit contributions
ucker+0.000
 salon+0.000
 Dr+0.000
 Fro+0.000
ips+0.000
Neg logit contributions
 decided-0.000
 inside-0.000
 soon-0.000
 even-0.000
 should-0.000
Token dropped
Token ID5710
Feature activation-1.99e-01

Pos logit contributions
ittens+0.016
ucker+0.016
 salon+0.015
ags+0.015
ilo+0.014
Neg logit contributions
 decided-0.011
 inside-0.010
 should-0.010
 even-0.009
 soon-0.009
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
 decided+0.006
 mainland+0.006
 lifetime+0.006
edient+0.005
 generation+0.005
Neg logit contributions
ips-0.003
 Dr-0.003
 Leo-0.003
 Luna-0.003
ags-0.003
Token Lily
Token ID20037
Feature activation+1.31e+00

Pos logit contributions
ucker+0.005
 salon+0.004
 Fro+0.004
 Dr+0.004
ips+0.004
Neg logit contributions
 decided-0.005
 inside-0.005
 soon-0.004
 even-0.004
 should-0.004
Token and
Token ID290
Feature activation+3.89e+00

Pos logit contributions
 salon+0.041
ucker+0.041
 Fro+0.039
ittens+0.038
ilo+0.038
Neg logit contributions
 decided-0.020
 inside-0.019
 should-0.016
 even-0.015
 soon-0.015
Token Ben
Token ID3932
Feature activation+5.15e-02

Pos logit contributions
eh+0.100
ucker+0.096
pill+0.094
ittens+0.089
 losers+0.088
Neg logit contributions
 inside-0.095
 decided-0.084
 next-0.084
 sunset-0.079
 even-0.078
Token are
Token ID389
Feature activation-3.28e-01

Pos logit contributions
ucker+0.001
 salon+0.001
 Fro+0.001
ips+0.001
 losers+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 UFO-0.001
 soon-0.001
makers-0.001
Token sad
Token ID6507
Feature activation+8.50e-02

Pos logit contributions
 inside+0.007
 decided+0.007
 even+0.006
 should+0.006
 soon+0.006
Neg logit contributions
 salon-0.009
ucker-0.008
 Fro-0.008
ilo-0.008
ittens-0.008
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
ucker+0.002
 salon+0.002
ips+0.002
inches+0.002
ittens+0.002
Neg logit contributions
 decided-0.002
 UFO-0.002
 generation-0.002
col-0.001
 mainland-0.001
Token They
Token ID1119
Feature activation+4.56e-01

Pos logit contributions
ucker+0.006
 salon+0.006
 Fro+0.006
 Dr+0.006
 Mara+0.006
Neg logit contributions
 decided-0.006
 inside-0.005
 even-0.005
 soon-0.005
 UFO-0.005
Token them
Token ID606
Feature activation-7.10e-03

Pos logit contributions
 decided+0.014
 UFO+0.013
 inside+0.012
inct+0.012
col+0.012
Neg logit contributions
 Leo-0.009
 Dr-0.009
 salon-0.009
 yoga-0.009
 Mara-0.009
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
 decided+0.000
 UFO+0.000
col+0.000
 mainland+0.000
makers+0.000
Neg logit contributions
 Dr-0.000
 salon-0.000
 losers-0.000
 pose-0.000
 Fro-0.000
Token
Token ID198
Feature activation-5.78e-01

Pos logit contributions
 decided+0.013
 inside+0.013
makers+0.012
 UFO+0.012
lling+0.012
Neg logit contributions
 Dr-0.012
ucker-0.011
 Leo-0.011
 Mara-0.011
 Velvet-0.011
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
 decided+0.015
 UFO+0.013
 inside+0.013
 generation+0.012
 even+0.012
Neg logit contributions
 Dr-0.012
ucker-0.012
 Fro-0.012
 Leo-0.011
 Luna-0.011
TokenAnna
Token ID31160
Feature activation+2.20e+00

Pos logit contributions
 salon+0.003
ucker+0.003
 Mara+0.002
ips+0.002
 Dr+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 UFO-0.002
 should-0.002
 even-0.002
Token and
Token ID290
Feature activation+3.85e+00

Pos logit contributions
 salon+0.064
ucker+0.063
 Fro+0.060
ips+0.059
 Mara+0.059
Neg logit contributions
 decided-0.037
 inside-0.035
 should-0.030
 soon-0.029
 even-0.029
Token Ben
Token ID3932
Feature activation+8.68e-01

Pos logit contributions
eh+0.118
ucker+0.116
liv+0.112
pill+0.107
live+0.102
Neg logit contributions
 inside-0.073
 next-0.072
 two-0.065
 decided-0.065
 even-0.063
Token smile
Token ID8212
Feature activation+1.94e-01

Pos logit contributions
 salon+0.015
ucker+0.014
 pose+0.013
 Dr+0.013
ags+0.013
Neg logit contributions
 decided-0.024
 inside-0.023
 UFO-0.022
makers-0.022
 generations-0.022
Token.
Token ID13
Feature activation-5.11e-01

Pos logit contributions
ucker+0.005
 Dr+0.004
ips+0.004
ags+0.004
 pose+0.004
Neg logit contributions
 UFO-0.004
col-0.004
 decided-0.004
 generation-0.004
makers-0.004
Token They
Token ID1119
Feature activation+1.28e+00

Pos logit contributions
 decided+0.013
 inside+0.012
 UFO+0.012
makers+0.012
 soon+0.011
Neg logit contributions
ucker-0.010
 Mara-0.010
 Dr-0.009
 salon-0.009
 Leo-0.009
Token feel
Token ID1254
Feature activation-3.87e-01

Pos logit contributions
 salon+0.028
ucker+0.027
 Fro+0.025
 Dr+0.025
 losers+0.025
Neg logit contributions
 decided-0.030
 inside-0.030
 soon-0.026
makers-0.026
 UFO-0.026
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
ucker+0.000
 salon+0.000
 Fro+0.000
ips+0.000
 pose+0.000
Neg logit contributions
 decided-0.000
 inside-0.000
 should-0.000
 soon-0.000
makers-0.000
Token flies
Token ID17607
Feature activation-2.65e-01

Pos logit contributions
 salon+0.028
ucker+0.028
 Fro+0.026
 pose+0.025
ips+0.025
Neg logit contributions
 decided-0.037
 inside-0.036
 should-0.032
makers-0.032
 soon-0.032
Token away
Token ID1497
Feature activation+4.06e-01

Pos logit contributions
 UFO+0.006
 generations+0.006
 decided+0.006
makers+0.006
 mainland+0.006
Neg logit contributions
 Fro-0.005
 Leo-0.005
ags-0.005
 Luna-0.005
 Velvet-0.005
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
ucker+0.009
 pose+0.009
 salon+0.009
 Fro+0.009
ips+0.009
Neg logit contributions
 decided-0.008
 UFO-0.008
 should-0.007
 mainland-0.007
 generation-0.007
Token Lily
Token ID20037
Feature activation+2.40e+00

Pos logit contributions
ucker+0.006
 salon+0.006
 Fro+0.005
ips+0.005
 Dr+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
 should-0.004
 soon-0.004
 even-0.004
Token and
Token ID290
Feature activation+3.84e+00

Pos logit contributions
 salon+0.082
ittens+0.082
ucker+0.081
eh+0.079
ilo+0.078
Neg logit contributions
 decided-0.030
 inside-0.028
 should-0.023
 even-0.022
 soon-0.021
Token Ben
Token ID3932
Feature activation+9.17e-01

Pos logit contributions
eh+0.119
 hind+0.108
ucker+0.105
ittens+0.104
imm+0.104
Neg logit contributions
 inside-0.072
 next-0.069
 decided-0.063
 two-0.062
 sunset-0.060
Token wave
Token ID6769
Feature activation+3.81e-01

Pos logit contributions
 salon+0.017
ucker+0.017
 Fro+0.016
 pose+0.016
 losers+0.016
Neg logit contributions
 decided-0.024
 inside-0.023
makers-0.021
 UFO-0.021
 soon-0.021
Token goodbye
Token ID24829
Feature activation+4.81e-01

Pos logit contributions
 salon+0.010
ucker+0.010
ittens+0.010
ilo+0.009
ips+0.009
Neg logit contributions
 decided-0.008
 inside-0.007
 even-0.007
 soon-0.006
 should-0.006
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
ucker+0.009
 salon+0.009
 Fro+0.008
 pose+0.008
 Leo+0.008
Neg logit contributions
 decided-0.012
 UFO-0.011
 should-0.011
 inside-0.011
 generations-0.011
Token the
Token ID262
Feature activation-7.35e-01

Pos logit contributions
ucker+0.008
 salon+0.008
ittens+0.007
eh+0.007
ips+0.007
Neg logit contributions
 inside-0.007
 decided-0.007
 even-0.006
 soon-0.006
 should-0.006
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 decided+0.007
 inside+0.007
 UFO+0.006
 should+0.006
makers+0.006
Neg logit contributions
 salon-0.013
ucker-0.013
 Fro-0.012
 Dr-0.012
 pose-0.012
Token
Token ID198
Feature activation-1.55e-02

Pos logit contributions
 salon+0.004
ucker+0.004
 pose+0.004
ips+0.004
 elbow+0.004
Neg logit contributions
 decided-0.003
 inside-0.002
 should-0.002
 soon-0.002
 even-0.002
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
 decided+0.000
 inside+0.000
 UFO+0.000
 even+0.000
makers+0.000
Neg logit contributions
ucker-0.000
 Fro-0.000
 Dr-0.000
 salon-0.000
 Velvet-0.000
TokenL
Token ID43
Feature activation+5.98e-01

Pos logit contributions
ucker+0.004
 salon+0.003
 Dr+0.003
ilo+0.003
 Mara+0.003
Neg logit contributions
 inside-0.004
 decided-0.004
 UFO-0.004
 even-0.004
makers-0.004
Tokenily
Token ID813
Feature activation+1.83e+00

Pos logit contributions
ucker+0.019
 salon+0.019
 Fro+0.018
 Dr+0.018
ittens+0.018
Neg logit contributions
 inside-0.009
 decided-0.009
 should-0.007
makers-0.007
 even-0.007
Token and
Token ID290
Feature activation+3.83e+00

Pos logit contributions
 salon+0.063
ucker+0.062
ittens+0.061
ags+0.059
ilo+0.058
Neg logit contributions
 decided-0.023
 inside-0.022
 should-0.019
 even-0.016
 soon-0.016
Token Ben
Token ID3932
Feature activation+4.38e-01

Pos logit contributions
eh+0.112
pill+0.110
ucker+0.108
ittens+0.101
 hind+0.097
Neg logit contributions
 inside-0.084
 next-0.076
 decided-0.071
 two-0.068
 soon-0.066
Token scream
Token ID8196
Feature activation-5.16e-01

Pos logit contributions
 salon+0.008
ucker+0.008
 Fro+0.008
ips+0.008
 pose+0.008
Neg logit contributions
 decided-0.011
 inside-0.011
 soon-0.010
 should-0.010
 even-0.010
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 decided+0.011
 UFO+0.010
 mainland+0.010
 generation+0.010
 generations+0.010
Neg logit contributions
ucker-0.012
 Dr-0.011
 Fro-0.011
 pose-0.011
 Luna-0.011
Token They
Token ID1119
Feature activation+7.26e-01

Pos logit contributions
ucker+0.004
 salon+0.004
 Fro+0.004
 Dr+0.004
 Mara+0.004
Neg logit contributions
 decided-0.004
 inside-0.003
 even-0.003
 should-0.003
 soon-0.003
Token drop
Token ID4268
Feature activation+2.71e-01

Pos logit contributions
 salon+0.016
ucker+0.016
 Fro+0.014
ips+0.014
ittens+0.014
Neg logit contributions
 decided-0.018
 inside-0.017
 should-0.015
 soon-0.015
 even-0.015
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
 decided+0.005
 inside+0.004
 UFO+0.004
 should+0.004
makers+0.004
Neg logit contributions
 salon-0.009
ucker-0.009
 Fro-0.008
 pose-0.008
ips-0.008
Token
Token ID198
Feature activation-4.71e-01

Pos logit contributions
 decided+0.002
 inside+0.002
 should+0.002
 soon+0.002
 even+0.002
Neg logit contributions
 salon-0.003
ucker-0.003
ips-0.003
 pose-0.003
ittens-0.003
Token
Token ID198
Feature activation+7.47e-02

Pos logit contributions
 decided+0.010
 inside+0.009
 UFO+0.008
 even+0.008
 should+0.008
Neg logit contributions
ucker-0.012
 Fro-0.012
 salon-0.012
 Dr-0.011
 Mara-0.011
TokenL
Token ID43
Feature activation+3.26e-01

Pos logit contributions
 salon+0.002
ucker+0.002
 Fro+0.002
ips+0.002
 pose+0.002
Neg logit contributions
 decided-0.002
 inside-0.001
 should-0.001
 soon-0.001
 even-0.001
Tokenily
Token ID813
Feature activation+2.30e+00

Pos logit contributions
 salon+0.011
ucker+0.011
 hind+0.011
 pose+0.010
 Fro+0.010
Neg logit contributions
 inside-0.004
 decided-0.004
makers-0.003
 should-0.003
 next-0.003
Token and
Token ID290
Feature activation+3.83e+00

Pos logit contributions
ittens+0.077
ucker+0.077
 salon+0.077
ilo+0.074
eh+0.073
Neg logit contributions
 decided-0.030
 inside-0.029
 should-0.022
 even-0.021
 soon-0.021
Token Ben
Token ID3932
Feature activation+8.21e-01

Pos logit contributions
ucker+0.108
eh+0.107
pill+0.095
 hind+0.095
 salon+0.095
Neg logit contributions
 inside-0.080
 next-0.074
 decided-0.068
 soon-0.067
 two-0.066
Token are
Token ID389
Feature activation+2.45e-01

Pos logit contributions
 salon+0.017
ucker+0.017
 Fro+0.016
ips+0.015
 Dr+0.015
Neg logit contributions
 decided-0.021
 inside-0.020
 should-0.018
 soon-0.018
 even-0.018
Token happy
Token ID3772
Feature activation+9.53e-01

Pos logit contributions
 salon+0.007
ucker+0.007
pill+0.007
ittens+0.007
ilo+0.007
Neg logit contributions
 inside-0.004
 decided-0.004
 even-0.004
 soon-0.004
 should-0.003
Token too
Token ID1165
Feature activation-6.51e-01

Pos logit contributions
ucker+0.026
 salon+0.025
 Fro+0.025
 Dr+0.025
 Leo+0.025
Neg logit contributions
 decided-0.017
 UFO-0.015
 inside-0.014
 generation-0.014
 should-0.014
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
 decided+0.013
 UFO+0.011
col+0.011
 inside+0.011
 generation+0.010
Neg logit contributions
ucker-0.016
 Dr-0.016
 Leo-0.015
 Fro-0.015
ittens-0.015
Tokeneer
Token ID28153
Feature activation-4.75e-01

Pos logit contributions
 decided+0.028
 inside+0.026
 should+0.024
 soon+0.024
 even+0.024
Neg logit contributions
ucker-0.022
 salon-0.022
 Fro-0.020
ips-0.020
ittens-0.020
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
 decided+0.009
 UFO+0.008
 mainland+0.008
makers+0.008
 generation+0.008
Neg logit contributions
 salon-0.012
 Fro-0.012
ucker-0.012
 Leo-0.012
 pose-0.011
Token
Token ID198
Feature activation-5.48e-01

Pos logit contributions
 decided+0.003
 inside+0.003
 UFO+0.002
 should+0.002
makers+0.002
Neg logit contributions
ucker-0.002
 salon-0.002
 Dr-0.002
 Fro-0.002
 Mara-0.002
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
 decided+0.012
 inside+0.011
 UFO+0.010
 should+0.010
makers+0.010
Neg logit contributions
ucker-0.013
 Dr-0.013
 Fro-0.013
 salon-0.012
 Mara-0.012
TokenAnna
Token ID31160
Feature activation+1.81e+00

Pos logit contributions
 decided+0.005
 inside+0.005
 should+0.004
 UFO+0.004
 even+0.004
Neg logit contributions
 salon-0.005
ucker-0.005
 Dr-0.004
 Mara-0.004
 Fro-0.004
Token and
Token ID290
Feature activation+3.82e+00

Pos logit contributions
 salon+0.054
ucker+0.054
 Fro+0.051
ips+0.051
ittens+0.051
Neg logit contributions
 decided-0.029
 inside-0.027
 should-0.023
 soon-0.023
 even-0.022
Token Ben
Token ID3932
Feature activation+3.77e-01

Pos logit contributions
ucker+0.091
eh+0.078
pill+0.075
liv+0.075
 salon+0.073
Neg logit contributions
 inside-0.100
 decided-0.096
 next-0.094
 soon-0.088
 even-0.088
Token go
Token ID467
Feature activation-6.39e-01

Pos logit contributions
 salon+0.006
ucker+0.006
 pose+0.006
ags+0.006
 Fro+0.006
Neg logit contributions
 decided-0.010
 inside-0.010
 generations-0.009
makers-0.009
 UFO-0.009
Token back
Token ID736
Feature activation+2.95e-01

Pos logit contributions
 decided+0.013
 should+0.013
 UFO+0.013
 generations+0.012
 generation+0.012
Neg logit contributions
 salon-0.015
ips-0.014
 Fro-0.014
ucker-0.013
 Dr-0.013
Token to
Token ID284
Feature activation+5.92e-01

Pos logit contributions
 salon+0.006
ucker+0.006
ips+0.006
 Fro+0.005
 Dr+0.005
Neg logit contributions
 decided-0.007
 should-0.007
 UFO-0.007
 inside-0.006
 even-0.006
Token their
Token ID511
Feature activation+9.83e-02

Pos logit contributions
ucker+0.014
 salon+0.013
ittens+0.012
eh+0.012
ips+0.012
Neg logit contributions
 inside-0.014
 decided-0.014
 soon-0.013
 even-0.012
 should-0.012
Token horn
Token ID12718
Feature activation+1.56e-01

Pos logit contributions
 salon+0.007
ucker+0.007
 pose+0.006
 Fro+0.006
ittens+0.006
Neg logit contributions
 decided-0.008
 inside-0.008
 should-0.007
makers-0.007
 soon-0.007
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
 salon+0.004
ucker+0.004
 pose+0.004
 losers+0.004
ips+0.004
Neg logit contributions
 decided-0.003
 inside-0.002
 UFO-0.002
makers-0.002
 generations-0.002
Token
Token ID198
Feature activation-3.87e-01

Pos logit contributions
 salon+0.003
ucker+0.003
 Fro+0.003
ips+0.003
 Dr+0.003
Neg logit contributions
 decided-0.003
 inside-0.003
 should-0.003
 soon-0.003
 even-0.003
Token
Token ID198
Feature activation-8.17e-03

Pos logit contributions
 decided+0.009
 inside+0.008
 UFO+0.008
 generation+0.008
 gem+0.008
Neg logit contributions
 Dr-0.008
 Fro-0.008
ucker-0.008
 Leo-0.008
 Velvet-0.008
TokenAnna
Token ID31160
Feature activation+2.24e+00

Pos logit contributions
 decided+0.000
 inside+0.000
 UFO+0.000
 should+0.000
 even+0.000
Neg logit contributions
 salon-0.000
ucker-0.000
 Dr-0.000
ilo-0.000
 losers-0.000
Token and
Token ID290
Feature activation+3.79e+00

Pos logit contributions
ucker+0.069
 salon+0.069
ittens+0.065
 Fro+0.064
ips+0.064
Neg logit contributions
 decided-0.034
 inside-0.033
 should-0.027
 soon-0.027
 even-0.027
Token Ben
Token ID3932
Feature activation+6.19e-01

Pos logit contributions
ucker+0.106
eh+0.099
ittens+0.092
liv+0.087
 hind+0.082
Neg logit contributions
 inside-0.086
 next-0.081
 decided-0.075
 two-0.073
 even-0.071
Token run
Token ID1057
Feature activation-7.29e-01

Pos logit contributions
 salon+0.011
ucker+0.010
 Fro+0.010
 losers+0.010
 pose+0.010
Neg logit contributions
 decided-0.017
 inside-0.016
makers-0.015
 UFO-0.015
 generations-0.015
Token to
Token ID284
Feature activation+3.02e-01

Pos logit contributions
 decided+0.018
 UFO+0.018
 should+0.017
 generations+0.017
edient+0.017
Neg logit contributions
 salon-0.013
 Fro-0.013
 Dr-0.013
 losers-0.012
 pose-0.012
Token the
Token ID262
Feature activation+9.71e-02

Pos logit contributions
ucker+0.007
 salon+0.006
ittens+0.006
ips+0.006
ilo+0.006
Neg logit contributions
 decided-0.007
 inside-0.007
 soon-0.007
 even-0.007
 should-0.006
Token slide
Token ID10649
Feature activation-8.64e-02

Pos logit contributions
 salon+0.002
 mat+0.002
 Dr+0.002
 losers+0.002
ags+0.002
Neg logit contributions
 decided-0.003
 inside-0.002
 should-0.002
 generations-0.002
makers-0.002
Token.
Token ID13
Feature activation-7.54e-02

Pos logit contributions
 decided+0.004
 UFO+0.004
 should+0.003
col+0.003
makers+0.003
Neg logit contributions
 salon-0.006
ags-0.005
ips-0.005
 mat-0.005
ucker-0.005
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
 decided+0.002
 inside+0.002
 UFO+0.001
 even+0.001
makers+0.001
Neg logit contributions
ucker-0.002
 Dr-0.002
 Velvet-0.002
 salon-0.002
 Fro-0.002
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
 decided+0.010
 UFO+0.009
 inside+0.009
 generation+0.008
 gem+0.008
Neg logit contributions
 Dr-0.008
ucker-0.008
 Fro-0.008
 Velvet-0.008
 Luna-0.008
TokenL
Token ID43
Feature activation+3.26e-01

Pos logit contributions
 salon+0.004
 Velvet+0.004
ucker+0.004
ittens+0.004
 Dr+0.004
Neg logit contributions
 inside-0.006
 decided-0.006
 UFO-0.006
 should-0.005
 even-0.005
Tokenily
Token ID813
Feature activation+2.16e+00

Pos logit contributions
ucker+0.010
 salon+0.010
 Fro+0.010
 pose+0.010
 elbow+0.009
Neg logit contributions
 inside-0.005
 decided-0.005
makers-0.004
 should-0.004
 even-0.004
Token and
Token ID290
Feature activation+3.78e+00

Pos logit contributions
ucker+0.069
 salon+0.068
ittens+0.065
 Fro+0.065
ips+0.064
Neg logit contributions
 decided-0.031
 inside-0.030
 should-0.024
 soon-0.024
 even-0.024
Token Max
Token ID5436
Feature activation+4.88e-01

Pos logit contributions
ucker+0.113
eh+0.104
pill+0.096
 salon+0.095
etchup+0.091
Neg logit contributions
 inside-0.081
 next-0.071
 soon-0.068
 decided-0.067
 sometimes-0.065
Token are
Token ID389
Feature activation-7.13e-03

Pos logit contributions
 salon+0.010
ucker+0.009
 Dr+0.009
 pose+0.009
 Fro+0.009
Neg logit contributions
 decided-0.012
 inside-0.012
 UFO-0.011
makers-0.011
 generations-0.011
Token sorry
Token ID7926
Feature activation+2.70e-02

Pos logit contributions
 decided+0.000
 inside+0.000
 should+0.000
 UFO+0.000
makers+0.000
Neg logit contributions
 salon-0.000
ucker-0.000
 Fro-0.000
 Dr-0.000
ips-0.000
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
ucker+0.001
 Dr+0.001
 salon+0.001
ittens+0.001
 Fro+0.001
Neg logit contributions
 decided-0.001
 UFO-0.000
 inside-0.000
 generation-0.000
col-0.000
Token They
Token ID1119
Feature activation+1.08e+00

Pos logit contributions
 decided+0.007
 inside+0.007
 soon+0.006
makers+0.006
 UFO+0.006
Neg logit contributions
ucker-0.006
 salon-0.006
 Dr-0.006
 Velvet-0.006
ags-0.006
Token from
Token ID422
Feature activation-4.92e-01

Pos logit contributions
 salon+0.014
ucker+0.014
 pose+0.013
 Fro+0.013
ips+0.013
Neg logit contributions
 decided-0.009
 UFO-0.008
 should-0.008
 inside-0.007
col-0.007
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
 decided+0.013
 inside+0.012
 should+0.012
col+0.012
makers+0.012
Neg logit contributions
 salon-0.009
 Dr-0.009
 Fro-0.008
 Velvet-0.008
ucker-0.008
Token shore
Token ID15191
Feature activation-1.91e-01

Pos logit contributions
year+0.003
Mill+0.003
te+0.003
 decided+0.003
col+0.003
Neg logit contributions
 salon-0.002
 losers-0.002
 mat-0.002
 fle-0.002
 elbow-0.002
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
 decided+0.004
 UFO+0.003
Mill+0.003
 inside+0.003
 should+0.003
Neg logit contributions
 salon-0.005
ucker-0.005
ips-0.005
 pose-0.004
 Fro-0.004
Token Lily
Token ID20037
Feature activation+1.69e+00

Pos logit contributions
ucker+0.006
 salon+0.005
 Fro+0.005
 Dr+0.005
ips+0.005
Neg logit contributions
 decided-0.007
 inside-0.007
 should-0.006
 soon-0.006
 even-0.006
Token and
Token ID290
Feature activation+3.77e+00

Pos logit contributions
ucker+0.055
 salon+0.055
ittens+0.053
ilo+0.052
ags+0.052
Neg logit contributions
 decided-0.023
 inside-0.022
 should-0.018
 soon-0.017
 even-0.017
Token Tom
Token ID4186
Feature activation+5.94e-01

Pos logit contributions
ucker+0.104
eh+0.101
pill+0.098
ilo+0.093
ittens+0.092
Neg logit contributions
 inside-0.089
 next-0.078
 decided-0.076
 soon-0.070
 even-0.067
Token are
Token ID389
Feature activation+1.30e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 Fro+0.010
 pose+0.010
 losers+0.010
Neg logit contributions
 decided-0.016
 inside-0.015
 UFO-0.014
 generations-0.014
 soon-0.014
Token scared
Token ID12008
Feature activation+3.26e-01

Pos logit contributions
 salon+0.004
ucker+0.004
ittens+0.004
ilo+0.004
pill+0.004
Neg logit contributions
 inside-0.003
 decided-0.002
 even-0.002
 soon-0.002
 should-0.002
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
ucker+0.009
 salon+0.009
 Fro+0.009
ips+0.009
 Dr+0.009
Neg logit contributions
 decided-0.005
 inside-0.005
 UFO-0.004
 should-0.004
 generation-0.004
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
ucker+0.006
 salon+0.005
 Fro+0.005
 Dr+0.005
ips+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
 even-0.004
 soon-0.004
 should-0.004
Token sound
Token ID2128
Feature activation-5.44e-01

Pos logit contributions
 decided+0.016
 inside+0.014
 should+0.013
 UFO+0.013
makers+0.013
Neg logit contributions
 salon-0.014
ucker-0.014
 Dr-0.013
 Fro-0.013
 Mara-0.013
Token.
Token ID13
Feature activation-8.33e-02

Pos logit contributions
 decided+0.010
 UFO+0.009
 generation+0.009
 mainland+0.009
 inside+0.009
Neg logit contributions
ucker-0.014
 salon-0.014
 Dr-0.014
 pose-0.013
 losers-0.013
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 inside+0.001
 decided+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.003
ucker-0.002
ittens-0.002
eh-0.002
ips-0.002
Token
Token ID198
Feature activation+8.16e-02

Pos logit contributions
 decided+0.002
 inside+0.002
 should+0.002
 soon+0.002
 even+0.002
Neg logit contributions
 salon-0.003
ucker-0.003
 Fro-0.003
ips-0.003
ittens-0.003
TokenAnna
Token ID31160
Feature activation+2.02e+00

Pos logit contributions
 salon+0.002
ucker+0.002
 Fro+0.002
 elbow+0.002
ittens+0.002
Neg logit contributions
 decided-0.001
 inside-0.001
 soon-0.001
 should-0.001
 next-0.001
Token and
Token ID290
Feature activation+3.76e+00

Pos logit contributions
 salon+0.064
ucker+0.063
ittens+0.061
 Fro+0.060
ags+0.059
Neg logit contributions
 decided-0.029
 inside-0.029
 should-0.024
 even-0.023
 soon-0.023
Token Ben
Token ID3932
Feature activation+4.69e-01

Pos logit contributions
eh+0.119
ucker+0.116
pill+0.111
ittens+0.105
imm+0.102
Neg logit contributions
 inside-0.074
 next-0.068
 two-0.062
 even-0.060
 soon-0.059
Token gasp
Token ID47440
Feature activation-4.35e-01

Pos logit contributions
 salon+0.008
ucker+0.008
 pose+0.007
 losers+0.007
ips+0.007
Neg logit contributions
 decided-0.013
 inside-0.012
 generations-0.012
 generation-0.012
 UFO-0.012
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 decided+0.008
 UFO+0.008
 mainland+0.007
 generation+0.007
col+0.007
Neg logit contributions
ucker-0.011
ags-0.010
ips-0.010
 Dr-0.010
ilo-0.010
Token They
Token ID1119
Feature activation+4.49e-01

Pos logit contributions
 decided+0.006
 inside+0.006
 even+0.005
 soon+0.005
 UFO+0.005
Neg logit contributions
ucker-0.006
 salon-0.006
 Mara-0.006
 Fro-0.006
 Dr-0.005
Token look
Token ID804
Feature activation-9.63e-01

Pos logit contributions
 salon+0.008
ucker+0.008
 losers+0.007
 Fro+0.007
 Dr+0.007
Neg logit contributions
 decided-0.012
 inside-0.012
 generations-0.012
 generation-0.011
 soon-0.011
Token wine
Token ID8237
Feature activation+1.40e-01

Pos logit contributions
 decided+0.017
makers+0.016
 inside+0.016
 UFO+0.016
edient+0.015
Neg logit contributions
ucker-0.009
 salon-0.009
ags-0.009
 losers-0.009
 toast-0.009
Token?"
Token ID1701
Feature activation+9.51e-01

Pos logit contributions
 salon+0.003
ucker+0.003
 Fro+0.003
ips+0.003
 pose+0.003
Neg logit contributions
 decided-0.003
 inside-0.003
 should-0.003
 soon-0.003
 even-0.003
Token
Token ID198
Feature activation-9.06e-02

Pos logit contributions
 salon+0.031
ucker+0.031
ips+0.030
 losers+0.029
ags+0.029
Neg logit contributions
 decided-0.013
 inside-0.013
 soon-0.011
 should-0.010
 even-0.010
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.003
ucker-0.003
ips-0.003
 Fro-0.003
 losers-0.003
TokenAnna
Token ID31160
Feature activation+2.05e+00

Pos logit contributions
ucker+0.005
 salon+0.005
 Fro+0.004
ags+0.004
ittens+0.004
Neg logit contributions
 decided-0.005
 inside-0.004
 soon-0.004
 next-0.004
 should-0.004
Token and
Token ID290
Feature activation+3.76e+00

Pos logit contributions
ucker+0.058
 salon+0.058
 Fro+0.055
ips+0.054
 Mara+0.054
Neg logit contributions
 decided-0.035
 inside-0.033
 should-0.029
 soon-0.028
 even-0.028
Token Ben
Token ID3932
Feature activation+8.57e-01

Pos logit contributions
ucker+0.109
eh+0.108
liv+0.093
imm+0.093
ittens+0.092
Neg logit contributions
 next-0.080
 inside-0.074
 two-0.072
 sometimes-0.066
 decided-0.063
Token shake
Token ID13279
Feature activation+1.51e-01

Pos logit contributions
 salon+0.014
ucker+0.014
ags+0.013
 pose+0.013
 Fro+0.013
Neg logit contributions
 decided-0.024
 inside-0.023
 generations-0.022
makers-0.022
 UFO-0.022
Token their
Token ID511
Feature activation+5.95e-01

Pos logit contributions
 salon+0.003
ucker+0.003
 Fro+0.003
ips+0.003
ittens+0.003
Neg logit contributions
 decided-0.004
 inside-0.004
 should-0.004
 soon-0.004
 even-0.003
Token heads
Token ID6665
Feature activation-1.12e+00

Pos logit contributions
ucker+0.016
 salon+0.016
 Dr+0.015
eh+0.015
 Mara+0.015
Neg logit contributions
 inside-0.012
 decided-0.011
 next-0.009
 should-0.009
 even-0.009
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 UFO+0.027
 mainland+0.025
 generations+0.025
 generation+0.024
 decided+0.024
Neg logit contributions
 salon-0.021
ucker-0.020
 pose-0.020
 downward-0.020
ips-0.020
Token.
Token ID13
Feature activation+4.25e-01

Pos logit contributions
 salon+0.021
ucker+0.021
 Fro+0.020
 pose+0.019
ips+0.019
Neg logit contributions
 decided-0.011
 inside-0.010
 should-0.009
 UFO-0.008
 even-0.008
Token
Token ID198
Feature activation+5.47e-02

Pos logit contributions
 salon+0.012
ucker+0.012
 pose+0.011
ips+0.011
ittens+0.011
Neg logit contributions
 decided-0.007
 inside-0.007
 should-0.006
 soon-0.006
 even-0.006
Token
Token ID198
Feature activation+4.02e-01

Pos logit contributions
ucker+0.001
 Fro+0.001
 salon+0.001
 Dr+0.001
ilo+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 UFO-0.001
 even-0.001
 should-0.001
TokenL
Token ID43
Feature activation+4.01e-01

Pos logit contributions
 salon+0.009
ucker+0.009
 Fro+0.009
ips+0.009
 pose+0.008
Neg logit contributions
 decided-0.009
 inside-0.009
 should-0.008
 soon-0.008
 even-0.008
Tokenily
Token ID813
Feature activation+1.96e+00

Pos logit contributions
ucker+0.011
 salon+0.011
 Dr+0.011
 Fro+0.010
 pose+0.010
Neg logit contributions
 inside-0.007
 decided-0.007
 should-0.006
makers-0.006
 even-0.006
Token and
Token ID290
Feature activation+3.72e+00

Pos logit contributions
ucker+0.066
 salon+0.065
ittens+0.064
ilo+0.062
ags+0.062
Neg logit contributions
 decided-0.026
 inside-0.025
 should-0.021
 even-0.019
 soon-0.019
Token Ben
Token ID3932
Feature activation+4.15e-01

Pos logit contributions
ucker+0.094
eh+0.089
pill+0.089
ittens+0.083
 hind+0.077
Neg logit contributions
 inside-0.100
 next-0.085
 decided-0.083
 even-0.078
 soon-0.077
Token swim
Token ID9422
Feature activation-1.29e+00

Pos logit contributions
 salon+0.008
ucker+0.008
 Fro+0.007
 Dr+0.007
 losers+0.007
Neg logit contributions
 decided-0.011
 inside-0.010
 soon-0.009
 UFO-0.009
makers-0.009
Token to
Token ID284
Feature activation+2.02e-01

Pos logit contributions
 decided+0.026
 inside+0.024
 should+0.022
 even+0.021
 soon+0.021
Neg logit contributions
 salon-0.033
ucker-0.033
 Fro-0.031
ips-0.031
 pose-0.030
Token their
Token ID511
Feature activation+2.23e-01

Pos logit contributions
ucker+0.005
 salon+0.005
ittens+0.004
ips+0.004
eh+0.004
Neg logit contributions
 inside-0.005
 decided-0.005
 soon-0.004
 even-0.004
 should-0.004
Token dad
Token ID9955
Feature activation-1.50e-01

Pos logit contributions
 salon+0.005
ucker+0.005
 Fro+0.005
 mat+0.005
 pose+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
 should-0.005
makers-0.004
 even-0.004
Token fire
Token ID2046
Feature activation+1.79e-01

Pos logit contributions
 decided+0.003
 inside+0.002
 should+0.002
makers+0.002
 UFO+0.002
Neg logit contributions
 Dr-0.001
 Leo-0.001
 salon-0.001
 Fro-0.001
 Mara-0.001
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
 salon+0.005
ucker+0.005
 pose+0.005
 Leo+0.005
ips+0.005
Neg logit contributions
 decided-0.003
 UFO-0.003
 should-0.003
 inside-0.003
 even-0.003
Token
Token ID198
Feature activation-5.90e-01

Pos logit contributions
 decided+0.005
 inside+0.005
 should+0.005
 soon+0.004
 even+0.004
Neg logit contributions
 salon-0.007
ucker-0.006
ips-0.006
ittens-0.006
 pose-0.006
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
 decided+0.011
 inside+0.010
 should+0.009
 even+0.009
 soon+0.009
Neg logit contributions
ucker-0.016
 salon-0.016
 Fro-0.015
 Dr-0.015
ips-0.015
TokenAnna
Token ID31160
Feature activation+2.03e+00

Pos logit contributions
 decided+0.003
 inside+0.003
 soon+0.002
 should+0.002
 even+0.002
Neg logit contributions
ucker-0.003
 salon-0.003
 elbow-0.003
ags-0.003
ips-0.003
Token and
Token ID290
Feature activation+3.71e+00

Pos logit contributions
 salon+0.060
ucker+0.060
 Fro+0.056
ips+0.056
ittens+0.056
Neg logit contributions
 decided-0.033
 inside-0.031
 should-0.026
 soon-0.026
 even-0.026
Token Ben
Token ID3932
Feature activation+4.42e-01

Pos logit contributions
ucker+0.115
eh+0.101
imm+0.094
awed+0.094
 salon+0.093
Neg logit contributions
 next-0.074
 inside-0.070
 soon-0.069
 two-0.068
 sometimes-0.067
Token scream
Token ID8196
Feature activation-6.61e-01

Pos logit contributions
 salon+0.007
 pose+0.007
 Leo+0.006
ucker+0.006
 losers+0.006
Neg logit contributions
 decided-0.012
 generations-0.012
 inside-0.012
 UFO-0.011
 generation-0.011
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
 decided+0.014
 UFO+0.014
 mainland+0.014
 generations+0.013
col+0.013
Neg logit contributions
 Dr-0.014
 pose-0.014
 Fro-0.014
 Luna-0.014
 Leo-0.014
Token They
Token ID1119
Feature activation+6.64e-01

Pos logit contributions
 salon+0.003
ucker+0.003
ips+0.003
 Fro+0.003
ittens+0.003
Neg logit contributions
 decided-0.002
 inside-0.002
 should-0.002
 soon-0.002
 even-0.002
Token jump
Token ID4391
Feature activation-5.92e-01

Pos logit contributions
 salon+0.012
 pose+0.012
 losers+0.012
ucker+0.012
 Fro+0.012
Neg logit contributions
 decided-0.017
 inside-0.016
 generations-0.016
makers-0.015
 UFO-0.015
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
 decided+0.005
 UFO+0.005
 should+0.004
col+0.004
 inside+0.004
Neg logit contributions
 salon-0.008
 mat-0.008
ips-0.008
ucker-0.007
 pose-0.007
Token
Token ID198
Feature activation+9.86e-02

Pos logit contributions
 salon+0.015
ucker+0.015
 pose+0.014
ittens+0.014
ips+0.014
Neg logit contributions
 decided-0.009
 inside-0.008
 should-0.007
 soon-0.007
 even-0.007
Token
Token ID198
Feature activation+3.56e-01

Pos logit contributions
ucker+0.003
 salon+0.003
 Fro+0.003
 Dr+0.002
 Mara+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 UFO-0.002
 should-0.002
 even-0.002
TokenS
Token ID50
Feature activation+1.66e+00

Pos logit contributions
ucker+0.008
 salon+0.008
 Fro+0.007
 Mara+0.007
ips+0.007
Neg logit contributions
 decided-0.008
 inside-0.008
 should-0.007
 UFO-0.007
 even-0.007
Tokenara
Token ID3301
Feature activation+2.42e+00

Pos logit contributions
 hind+0.044
 elbow+0.040
 losers+0.039
 Fro+0.038
 Scout+0.036
Neg logit contributions
 inside-0.036
 decided-0.033
 next-0.032
 soon-0.029
 should-0.029
Token and
Token ID290
Feature activation+3.70e+00

Pos logit contributions
ucker+0.077
 salon+0.077
ittens+0.073
 Fro+0.072
ips+0.071
Neg logit contributions
 decided-0.035
 inside-0.034
 should-0.028
 soon-0.027
 even-0.027
Token Ben
Token ID3932
Feature activation+5.33e-01

Pos logit contributions
eh+0.105
ucker+0.102
ittens+0.096
pill+0.095
liv+0.093
Neg logit contributions
 inside-0.080
 next-0.074
 decided-0.065
 two-0.065
 sunset-0.064
Token peek
Token ID27185
Feature activation-2.37e-01

Pos logit contributions
 salon+0.010
ucker+0.010
 Fro+0.010
 pose+0.009
 Dr+0.009
Neg logit contributions
 decided-0.014
 inside-0.013
makers-0.012
 UFO-0.012
 should-0.012
Token from
Token ID422
Feature activation-5.12e-01

Pos logit contributions
 decided+0.005
 UFO+0.005
makers+0.005
 should+0.004
 generation+0.004
Neg logit contributions
ips-0.005
 Fro-0.005
ags-0.005
 Mara-0.005
 salon-0.005
Token the
Token ID262
Feature activation+7.64e-02

Pos logit contributions
 decided+0.010
 inside+0.009
 should+0.008
makers+0.008
 even+0.008
Neg logit contributions
 salon-0.013
ucker-0.013
 Fro-0.012
 Dr-0.012
ips-0.012
Token fence
Token ID13990
Feature activation+7.64e-01

Pos logit contributions
ucker+0.002
 salon+0.002
 Fro+0.002
ips+0.002
 Mara+0.002
Neg logit contributions
 inside-0.002
 decided-0.002
 should-0.002
 soon-0.001
 even-0.001
Token mean
Token ID1612
Feature activation-3.70e-02

Pos logit contributions
 salon+0.006
ucker+0.006
ips+0.005
ittens+0.005
 Fro+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
 should-0.004
 soon-0.004
 even-0.004
Token.
Token ID13
Feature activation-1.48e-03

Pos logit contributions
 decided+0.001
 UFO+0.001
 mainland+0.001
 generation+0.001
 inside+0.001
Neg logit contributions
 salon-0.001
ucker-0.001
ips-0.001
 Fro-0.001
 Dr-0.001
Token
Token ID198
Feature activation-3.79e-01

Pos logit contributions
 decided+0.000
 inside+0.000
 should+0.000
 soon+0.000
 even+0.000
Neg logit contributions
 salon-0.000
ucker-0.000
ips-0.000
 pose-0.000
ittens-0.000
Token
Token ID198
Feature activation+3.99e-02

Pos logit contributions
 decided+0.009
 inside+0.008
 UFO+0.008
 generation+0.007
col+0.007
Neg logit contributions
ucker-0.008
 Dr-0.008
 Fro-0.008
 Leo-0.008
ilo-0.008
TokenAnna
Token ID31160
Feature activation+1.96e+00

Pos logit contributions
 salon+0.001
ucker+0.001
 Fro+0.001
ips+0.001
ittens+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 should-0.001
 soon-0.001
 even-0.001
Token and
Token ID290
Feature activation+3.70e+00

Pos logit contributions
 salon+0.058
ucker+0.058
 Fro+0.054
ips+0.054
ittens+0.053
Neg logit contributions
 decided-0.032
 inside-0.031
 should-0.026
 soon-0.025
 even-0.025
Token Ben
Token ID3932
Feature activation+5.72e-01

Pos logit contributions
ucker+0.106
eh+0.101
ittens+0.094
pill+0.093
liv+0.091
Neg logit contributions
 next-0.084
 inside-0.082
 two-0.072
 decided-0.071
 even-0.070
Token are
Token ID389
Feature activation-2.37e-02

Pos logit contributions
 salon+0.010
ucker+0.009
 Fro+0.009
 losers+0.009
 pose+0.009
Neg logit contributions
 decided-0.016
 inside-0.015
 generations-0.014
 UFO-0.014
makers-0.014
Token not
Token ID407
Feature activation+1.84e-02

Pos logit contributions
 decided+0.000
 inside+0.000
 even+0.000
 should+0.000
 soon+0.000
Neg logit contributions
 salon-0.001
ucker-0.001
ittens-0.001
 Fro-0.001
ips-0.001
Token lucky
Token ID9670
Feature activation+2.26e+00

Pos logit contributions
 salon+0.001
ucker+0.001
ips+0.000
ilo+0.000
ittens+0.000
Neg logit contributions
 decided-0.000
 inside-0.000
 even-0.000
 should-0.000
 soon-0.000
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
 salon+0.071
ucker+0.070
 Fro+0.067
ips+0.066
 Dr+0.066
Neg logit contributions
 decided-0.033
 inside-0.030
 should-0.025
 soon-0.024
 even-0.024
Token it
Token ID340
Feature activation+7.70e-01

Pos logit contributions
 salon+0.013
ucker+0.013
 Fro+0.013
 Dr+0.012
 Leo+0.012
Neg logit contributions
 decided-0.017
 inside-0.016
makers-0.015
 UFO-0.015
 should-0.014
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 pose+0.018
 Leo+0.018
 salon+0.017
 yoga+0.017
 mat+0.017
Neg logit contributions
 UFO-0.016
 decided-0.016
 lighthouse-0.015
 mainland-0.015
Mill-0.014
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
ucker+0.004
 salon+0.004
 Fro+0.004
 Dr+0.004
ips+0.004
Neg logit contributions
 decided-0.004
 inside-0.004
 should-0.004
 even-0.004
 soon-0.004
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
 decided+0.006
 inside+0.006
 UFO+0.005
 even+0.005
 soon+0.005
Neg logit contributions
ucker-0.009
 salon-0.008
 Fro-0.008
 Dr-0.008
ilo-0.008
TokenAnna
Token ID31160
Feature activation+2.15e+00

Pos logit contributions
ucker+0.002
 salon+0.002
 Fro+0.002
ips+0.002
 elbow+0.002
Neg logit contributions
 decided-0.003
 inside-0.003
 soon-0.003
 should-0.003
 even-0.003
Token and
Token ID290
Feature activation+3.68e+00

Pos logit contributions
 salon+0.057
ucker+0.057
 Fro+0.053
ips+0.052
 pose+0.052
Neg logit contributions
 decided-0.041
 inside-0.039
 UFO-0.034
makers-0.034
 should-0.034
Token Ben
Token ID3932
Feature activation+7.82e-01

Pos logit contributions
eh+0.110
ucker+0.107
liv+0.099
ittens+0.098
imm+0.097
Neg logit contributions
 next-0.076
 inside-0.075
 two-0.065
 sometimes-0.061
 again-0.061
Token are
Token ID389
Feature activation+9.47e-02

Pos logit contributions
 salon+0.013
ucker+0.012
 pose+0.012
 losers+0.012
ags+0.011
Neg logit contributions
 decided-0.021
 inside-0.021
 generations-0.020
 UFO-0.020
makers-0.020
Token happy
Token ID3772
Feature activation+9.47e-01

Pos logit contributions
ucker+0.002
 salon+0.002
 Fro+0.002
ips+0.002
 Dr+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 should-0.002
 soon-0.002
 even-0.002
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
ucker+0.026
 salon+0.026
 Fro+0.025
 Dr+0.024
ips+0.024
Neg logit contributions
 decided-0.017
 UFO-0.015
 inside-0.014
 should-0.014
col-0.014
Token They
Token ID1119
Feature activation+1.16e+00

Pos logit contributions
 decided+0.000
 inside+0.000
 should+0.000
 soon+0.000
 even+0.000
Neg logit contributions
 salon-0.000
ucker-0.000
ittens-0.000
eh-0.000
ips-0.000
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
ucker+0.008
 Fro+0.008
 pose+0.008
 Leo+0.008
 salon+0.008
Neg logit contributions
 UFO-0.007
 decided-0.006
makers-0.006
 inside-0.006
col-0.006
Token
Token ID198
Feature activation+7.89e-02

Pos logit contributions
ucker+0.006
 salon+0.005
 Fro+0.005
ips+0.005
 Mara+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
makers-0.004
 UFO-0.004
 should-0.004
Token
Token ID198
Feature activation+3.73e-01

Pos logit contributions
ucker+0.002
 Fro+0.002
 salon+0.002
 Dr+0.002
 Mara+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 UFO-0.001
makers-0.001
 even-0.001
TokenL
Token ID43
Feature activation+3.56e-01

Pos logit contributions
ucker+0.008
 salon+0.008
 Fro+0.008
ips+0.008
 Mara+0.008
Neg logit contributions
 inside-0.009
 decided-0.008
 UFO-0.008
 should-0.007
 even-0.007
Tokenila
Token ID10102
Feature activation+2.19e+00

Pos logit contributions
 salon+0.006
ucker+0.006
 Dr+0.005
 hind+0.005
 elbow+0.005
Neg logit contributions
 decided-0.011
 inside-0.010
 soon-0.009
 should-0.009
 even-0.009
Token and
Token ID290
Feature activation+3.68e+00

Pos logit contributions
 salon+0.065
ucker+0.065
 Fro+0.061
ips+0.061
ittens+0.060
Neg logit contributions
 decided-0.035
 inside-0.033
 should-0.028
 soon-0.027
 even-0.027
Token Ben
Token ID3932
Feature activation+7.94e-01

Pos logit contributions
eh+0.115
ucker+0.104
 hind+0.101
ittens+0.098
 losers+0.098
Neg logit contributions
 inside-0.059
 next-0.058
 decided-0.055
 two-0.052
 sunset-0.050
Token have
Token ID423
Feature activation+3.63e-01

Pos logit contributions
 salon+0.014
ucker+0.013
 Fro+0.013
 pose+0.013
 Leo+0.012
Neg logit contributions
 decided-0.021
 inside-0.020
 generations-0.020
 UFO-0.020
makers-0.019
Token a
Token ID257
Feature activation+4.08e-01

Pos logit contributions
 salon+0.010
ucker+0.010
 Fro+0.009
ips+0.009
ittens+0.009
Neg logit contributions
 decided-0.007
 inside-0.007
 should-0.006
 soon-0.006
 even-0.006
Token fun
Token ID1257
Feature activation-2.68e-01

Pos logit contributions
 salon+0.009
ucker+0.009
 Fro+0.008
 pose+0.008
 mat+0.008
Neg logit contributions
 decided-0.010
 inside-0.009
makers-0.009
col-0.008
 generations-0.008
Token day
Token ID1110
Feature activation+3.93e-01

Pos logit contributions
 decided+0.006
 should+0.005
 rooft+0.005
year+0.005
 shouldn+0.005
Neg logit contributions
 mat-0.006
 pose-0.006
 yoga-0.006
ittens-0.006
 Fro-0.006
Token back
Token ID736
Feature activation+2.17e-01

Pos logit contributions
 Fro+0.002
ucker+0.002
 Dr+0.002
 toast+0.002
 yoga+0.002
Neg logit contributions
 decided-0.003
 UFO-0.003
makers-0.003
edient-0.003
 inside-0.003
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
 Fro+0.003
 yoga+0.003
ips+0.003
 salon+0.003
ags+0.003
Neg logit contributions
 decided-0.006
 UFO-0.006
makers-0.006
 should-0.005
edient-0.005
Token village
Token ID7404
Feature activation-2.42e-01

Pos logit contributions
 violently+0.005
year+0.005
te+0.005
 decided+0.005
pt+0.005
Neg logit contributions
 salon-0.004
 fle-0.003
 pitcher-0.003
 toast-0.003
 cookie-0.003
Token to
Token ID284
Feature activation+3.24e-01

Pos logit contributions
 decided+0.005
 UFO+0.005
 should+0.004
makers+0.004
 inside+0.004
Neg logit contributions
 salon-0.006
 pose-0.005
ucker-0.005
 mat-0.005
 yoga-0.005
Token its
Token ID663
Feature activation-6.15e-01

Pos logit contributions
 Dr+0.005
 salon+0.005
 Fro+0.004
 yoga+0.004
 pose+0.004
Neg logit contributions
 decided-0.010
 inside-0.009
 UFO-0.009
makers-0.009
 should-0.009
Token former
Token ID1966
Feature activation-4.06e+00

Pos logit contributions
 decided+0.015
makers+0.014
col+0.013
 UFO+0.013
year+0.013
Neg logit contributions
 salon-0.013
 yoga-0.013
 pose-0.013
 hind-0.012
 mat-0.012
Token state
Token ID1181
Feature activation-6.40e-01

Pos logit contributions
 decided+0.111
 inside+0.105
makers+0.098
 UFO+0.098
 should+0.097
Neg logit contributions
ucker-0.077
ags-0.071
 hind-0.068
 salon-0.068
 pose-0.066
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 UFO+0.012
 decided+0.012
 inside+0.011
makers+0.011
 mainland+0.010
Neg logit contributions
 salon-0.016
ucker-0.015
ips-0.015
 pose-0.015
 Mara-0.015
Token With
Token ID2080
Feature activation-6.73e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 should+0.004
 soon+0.004
 UFO+0.004
Neg logit contributions
ucker-0.005
 salon-0.005
 Fro-0.005
 Dr-0.005
ips-0.005
Token the
Token ID262
Feature activation-9.06e-02

Pos logit contributions
 decided+0.017
 inside+0.016
makers+0.016
 UFO+0.015
 should+0.014
Neg logit contributions
 Fro-0.012
ucker-0.012
 salon-0.012
 Dr-0.012
 Leo-0.012
Token help
Token ID1037
Feature activation+2.02e-01

Pos logit contributions
 decided+0.002
 inside+0.002
makers+0.002
 UFO+0.002
 soon+0.002
Neg logit contributions
 salon-0.002
 Fro-0.002
 pose-0.002
ucker-0.002
 toast-0.002
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
 your+0.003
 my+0.003
 yoga+0.003
 Dr+0.003
ucker+0.003
Neg logit contributions
 decided-0.004
 UFO-0.004
year-0.004
makers-0.004
 mainland-0.003
Token village
Token ID7404
Feature activation-1.00e+00

Pos logit contributions
year+0.009
te+0.008
esh+0.008
pt+0.008
 decided+0.008
Neg logit contributions
 salon-0.007
 fle-0.006
 toast-0.006
 cookie-0.006
 yoga-0.006
Token back
Token ID736
Feature activation-2.45e-01

Pos logit contributions
 decided+0.020
 UFO+0.019
 inside+0.017
makers+0.017
 should+0.017
Neg logit contributions
 salon-0.024
 pose-0.023
ucker-0.022
 losers-0.022
 yoga-0.021
Token to
Token ID284
Feature activation-9.93e-02

Pos logit contributions
 decided+0.006
 UFO+0.005
makers+0.005
 wished+0.005
 should+0.005
Neg logit contributions
 salon-0.005
 pose-0.005
ucker-0.005
 your-0.005
ips-0.005
Token its
Token ID663
Feature activation-5.03e-01

Pos logit contributions
 decided+0.003
 UFO+0.003
col+0.003
makers+0.003
year+0.003
Neg logit contributions
 your-0.002
 my-0.001
 Dr-0.001
 yoga-0.001
 salon-0.001
Token former
Token ID1966
Feature activation-3.74e+00

Pos logit contributions
 decided+0.012
makers+0.011
year+0.011
 UFO+0.010
col+0.010
Neg logit contributions
 salon-0.011
 pose-0.011
 yoga-0.011
 losers-0.010
 mat-0.010
Token glory
Token ID13476
Feature activation-7.31e-01

Pos logit contributions
 decided+0.099
 inside+0.093
 UFO+0.086
makers+0.085
 should+0.085
Neg logit contributions
ucker-0.076
 salon-0.068
ags-0.068
 hind-0.066
 pose-0.066
Token!"
Token ID2474
Feature activation-2.77e-01

Pos logit contributions
 decided+0.014
 UFO+0.014
makers+0.012
 inside+0.012
 wished+0.012
Neg logit contributions
 salon-0.018
ucker-0.017
 pose-0.017
ips-0.017
 mat-0.016
Token
Token ID220
Feature activation-2.33e-02

Pos logit contributions
 decided+0.006
 inside+0.006
 UFO+0.005
 should+0.005
makers+0.005
Neg logit contributions
 salon-0.006
ucker-0.006
 Fro-0.006
 Dr-0.006
ips-0.006
Token
Token ID198
Feature activation+2.90e-01

Pos logit contributions
 decided+0.000
 inside+0.000
 should+0.000
 soon+0.000
 even+0.000
Neg logit contributions
 salon-0.001
ucker-0.001
 pose-0.001
 losers-0.001
 Fro-0.001
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
ucker+0.007
 Fro+0.007
 salon+0.007
 Dr+0.007
 Mara+0.006
Neg logit contributions
 decided-0.006
 inside-0.006
 UFO-0.006
 should-0.005
makers-0.005
Token,
Token ID11
Feature activation-3.82e-01

Pos logit contributions
 UFO+0.027
 decided+0.027
Hello+0.026
year+0.025
 should+0.024
Neg logit contributions
ucker-0.028
 salon-0.027
 Fro-0.027
 pose-0.026
 Leo-0.025
Token leaving
Token ID4305
Feature activation-1.18e-03

Pos logit contributions
 decided+0.008
 UFO+0.008
makers+0.008
 should+0.008
 generations+0.008
Neg logit contributions
 salon-0.008
ucker-0.008
 Fro-0.008
 losers-0.007
 Leo-0.007
Token it
Token ID340
Feature activation+2.69e-01

Pos logit contributions
 decided+0.000
 UFO+0.000
makers+0.000
 inside+0.000
col+0.000
Neg logit contributions
 Dr-0.000
 Leo-0.000
 Fro-0.000
 Mara-0.000
 Luna-0.000
Token to
Token ID284
Feature activation-2.82e-01

Pos logit contributions
 salon+0.006
ucker+0.006
ips+0.006
 Fro+0.006
 pose+0.006
Neg logit contributions
 decided-0.005
 UFO-0.005
 should-0.005
 generation-0.005
 generations-0.005
Token its
Token ID663
Feature activation-6.58e-01

Pos logit contributions
 decided+0.007
makers+0.007
 inside+0.007
 should+0.007
 UFO+0.007
Neg logit contributions
 Fro-0.005
 Dr-0.005
 Leo-0.005
 pose-0.005
 losers-0.005
Token former
Token ID1966
Feature activation-3.71e+00

Pos logit contributions
year+0.016
 decided+0.016
makers+0.016
col+0.015
te+0.015
Neg logit contributions
 Fro-0.013
 pose-0.012
 hind-0.012
 mouth-0.012
 yoga-0.012
Token state
Token ID1181
Feature activation-4.01e-01

Pos logit contributions
 decided+0.101
 inside+0.094
makers+0.089
 should+0.089
 UFO+0.088
Neg logit contributions
ucker-0.071
ags-0.065
 hind-0.062
ittens-0.059
ips-0.058
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 UFO+0.008
 decided+0.007
 mainland+0.007
 should+0.007
makers+0.007
Neg logit contributions
 salon-0.010
ucker-0.009
 pose-0.009
ips-0.009
 Mara-0.009
Token
Token ID220
Feature activation+5.49e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 UFO+0.003
 even+0.003
makers+0.003
Neg logit contributions
 Fro-0.003
 Dr-0.003
ucker-0.003
 Leo-0.003
 Mara-0.003
Token
Token ID198
Feature activation+5.48e-01

Pos logit contributions
 salon+0.016
ucker+0.016
 Fro+0.015
ips+0.014
ittens+0.014
Neg logit contributions
 decided-0.009
 inside-0.009
 should-0.008
 soon-0.008
 even-0.008
Token
Token ID198
Feature activation+3.41e-01

Pos logit contributions
 Fro+0.012
ucker+0.012
 Dr+0.012
 Leo+0.012
 Mara+0.011
Neg logit contributions
 decided-0.012
 inside-0.012
 UFO-0.011
 should-0.010
 even-0.010
Token to
Token ID284
Feature activation+3.90e-01

Pos logit contributions
 decided+0.027
 UFO+0.027
 mainland+0.024
inct+0.024
 lighthouse+0.024
Neg logit contributions
 salon-0.025
ips-0.024
ags-0.024
 losers-0.023
ucker-0.023
Token be
Token ID307
Feature activation-5.68e-01

Pos logit contributions
 salon+0.008
 pose+0.007
 Dr+0.007
 losers+0.007
 Fro+0.006
Neg logit contributions
 decided-0.010
 inside-0.009
col-0.009
makers-0.009
 UFO-0.009
Token restored
Token ID15032
Feature activation-1.94e-01

Pos logit contributions
 UFO+0.013
makers+0.012
 decided+0.012
col+0.012
Wow+0.011
Neg logit contributions
eh-0.011
 your-0.011
ucker-0.011
ags-0.011
 Dr-0.010
Token to
Token ID284
Feature activation+2.08e-01

Pos logit contributions
 UFO+0.005
 decided+0.004
 mainland+0.004
 lighthouse+0.004
 wished+0.004
Neg logit contributions
 your-0.004
 my-0.004
 Dr-0.003
 losers-0.003
 salon-0.003
Token its
Token ID663
Feature activation-3.78e-01

Pos logit contributions
 your+0.004
 salon+0.003
 pose+0.003
 Dr+0.003
 my+0.003
Neg logit contributions
 decided-0.006
col-0.005
 UFO-0.005
year-0.005
makers-0.005
Token former
Token ID1966
Feature activation-3.39e+00

Pos logit contributions
makers+0.009
year+0.009
 decided+0.009
gl+0.008
ull+0.008
Neg logit contributions
 salon-0.008
 yoga-0.007
 pose-0.007
 hind-0.007
 mat-0.007
Token glory
Token ID13476
Feature activation-4.77e-01

Pos logit contributions
 decided+0.090
 inside+0.084
makers+0.078
 should+0.077
 UFO+0.077
Neg logit contributions
ucker-0.067
ags-0.063
 salon-0.062
 hind-0.060
ittens-0.059
Token."
Token ID526
Feature activation-8.09e-01

Pos logit contributions
 UFO+0.010
 decided+0.010
makers+0.009
 wished+0.009
 mainland+0.009
Neg logit contributions
 salon-0.011
 pose-0.010
ucker-0.010
ips-0.010
 mat-0.010
Token
Token ID220
Feature activation+2.78e-02

Pos logit contributions
 decided+0.016
 inside+0.015
makers+0.013
 should+0.013
 UFO+0.013
Neg logit contributions
ucker-0.021
 salon-0.020
 Fro-0.020
 Mara-0.019
ips-0.019
Token
Token ID198
Feature activation+9.29e-02

Pos logit contributions
ucker+0.001
 salon+0.001
 Fro+0.001
ips+0.001
 Dr+0.001
Neg logit contributions
 decided-0.000
 inside-0.000
 should-0.000
 soon-0.000
 even-0.000
Token
Token ID198
Feature activation-1.66e-01

Pos logit contributions
ucker+0.002
 Fro+0.002
 salon+0.002
 Mara+0.002
ilo+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 UFO-0.002
makers-0.002
 should-0.002
Token back
Token ID736
Feature activation+5.37e-01

Pos logit contributions
ags+0.008
 your+0.007
ips+0.007
 Velvet+0.007
 my+0.007
Neg logit contributions
 decided-0.008
 UFO-0.008
makers-0.007
edient-0.007
 generations-0.007
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
 salon+0.008
ips+0.007
 your+0.007
ags+0.007
 Dr+0.007
Neg logit contributions
 decided-0.016
 UFO-0.016
makers-0.015
 should-0.015
 lighthouse-0.015
Token other
Token ID584
Feature activation-5.78e-02

Pos logit contributions
 salon+0.006
 mat+0.005
 losers+0.005
ags+0.005
 elbow+0.005
Neg logit contributions
 decided-0.007
 soon-0.006
 should-0.006
 generations-0.006
year-0.006
Token toys
Token ID14958
Feature activation-1.43e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.001
ucker-0.001
 Fro-0.001
ips-0.001
ittens-0.001
Token you
Token ID345
Feature activation-2.45e-01

Pos logit contributions
 decided+0.003
 UFO+0.003
 should+0.002
 generations+0.002
makers+0.002
Neg logit contributions
 salon-0.004
ips-0.003
ucker-0.003
 losers-0.003
ags-0.003
Token don
Token ID836
Feature activation-3.12e+00

Pos logit contributions
 UFO+0.006
inct+0.005
Mill+0.005
 decided+0.005
 lighthouse+0.005
Neg logit contributions
 salon-0.005
ucker-0.005
ips-0.005
etchup-0.005
inches-0.005
Token't
Token ID470
Feature activation-7.47e-01

Pos logit contributions
 decided+0.066
 inside+0.063
 should+0.055
 soon+0.055
 even+0.055
Neg logit contributions
 salon-0.078
ucker-0.077
 Fro-0.072
ips-0.070
 Mara-0.070
Token want
Token ID765
Feature activation-6.41e-01

Pos logit contributions
 decided+0.018
 UFO+0.018
inct+0.018
 rooft+0.017
 inside+0.017
Neg logit contributions
 salon-0.015
ucker-0.014
 losers-0.013
 pose-0.013
ittens-0.013
Token."
Token ID526
Feature activation+3.30e-01

Pos logit contributions
 decided+0.013
 UFO+0.013
 lighthouse+0.012
inct+0.012
 mainland+0.012
Neg logit contributions
 Dr-0.015
 salon-0.015
 Velvet-0.014
 losers-0.014
 yoga-0.014
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 losers+0.011
 pose+0.011
eh+0.010
Neg logit contributions
 decided-0.004
 inside-0.004
 even-0.003
 soon-0.003
 should-0.003
Token
Token ID198
Feature activation+3.89e-02

Pos logit contributions
 decided+0.006
 inside+0.006
 should+0.005
 soon+0.005
 even+0.005
Neg logit contributions
ucker-0.009
 salon-0.009
 Fro-0.009
ips-0.008
 Dr-0.008
Token the
Token ID262
Feature activation-5.15e-01

Pos logit contributions
 salon+0.010
ucker+0.010
 Dr+0.010
 Fro+0.010
 Mara+0.010
Neg logit contributions
 decided-0.020
 inside-0.018
 should-0.018
 UFO-0.018
makers-0.018
Token tree
Token ID5509
Feature activation+7.60e-01

Pos logit contributions
 decided+0.014
makers+0.012
 inside+0.012
col+0.012
year+0.012
Neg logit contributions
 salon-0.011
 mat-0.010
 pose-0.009
 toast-0.009
eh-0.009
Token to
Token ID284
Feature activation+2.49e-01

Pos logit contributions
 salon+0.019
ucker+0.018
 pose+0.017
 Fro+0.017
 mat+0.016
Neg logit contributions
 decided-0.016
 inside-0.014
 UFO-0.014
 should-0.014
makers-0.013
Token its
Token ID663
Feature activation-5.80e-01

Pos logit contributions
 Dr+0.004
 salon+0.004
 pose+0.004
 Fro+0.004
 Leo+0.004
Neg logit contributions
 decided-0.007
 inside-0.006
makers-0.006
 UFO-0.006
 should-0.006
Token once
Token ID1752
Feature activation-8.27e-01

Pos logit contributions
 decided+0.013
makers+0.012
 inside+0.012
 UFO+0.011
year+0.011
Neg logit contributions
 salon-0.013
 pose-0.013
 yoga-0.012
 hind-0.012
 Fro-0.012
Token former
Token ID1966
Feature activation-3.03e+00

Pos logit contributions
 UFO+0.016
 decided+0.016
makers+0.016
 should+0.014
 mainland+0.014
Neg logit contributions
 salon-0.019
 pose-0.019
ucker-0.019
eh-0.018
 toast-0.017
Token beauty
Token ID8737
Feature activation-2.12e-01

Pos logit contributions
 decided+0.074
 inside+0.070
 should+0.063
 UFO+0.063
makers+0.062
Neg logit contributions
ucker-0.065
 salon-0.063
ags-0.059
ittens-0.059
ips-0.058
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
 decided+0.004
 UFO+0.004
 mainland+0.003
 inside+0.003
makers+0.003
Neg logit contributions
 salon-0.005
ucker-0.005
 pose-0.005
 mat-0.005
 Fro-0.005
Token He
Token ID679
Feature activation-3.77e-01

Pos logit contributions
 decided+0.011
 inside+0.011
 UFO+0.009
makers+0.009
 soon+0.009
Neg logit contributions
 Dr-0.007
 Fro-0.007
ucker-0.007
 salon-0.007
 Mara-0.007
Token was
Token ID373
Feature activation-2.05e-01

Pos logit contributions
makers+0.007
 UFO+0.007
 inside+0.007
 mainland+0.006
arp+0.006
Neg logit contributions
 salon-0.009
 pose-0.009
 Fro-0.008
ucker-0.008
 Mara-0.008
Token happy
Token ID3772
Feature activation+7.23e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 UFO+0.004
makers+0.004
 should+0.004
Neg logit contributions
 salon-0.005
ucker-0.004
 Fro-0.004
 Dr-0.004
 Mara-0.004
Token They
Token ID1119
Feature activation-7.16e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 should+0.004
 soon+0.004
 even+0.004
Neg logit contributions
 salon-0.006
ucker-0.006
ips-0.006
 Fro-0.006
 pose-0.006
Token had
Token ID550
Feature activation-5.65e-01

Pos logit contributions
makers+0.012
 inside+0.012
 generation+0.012
 UFO+0.012
 generations+0.011
Neg logit contributions
 Fro-0.019
 Leo-0.017
 losers-0.017
 Mara-0.017
ucker-0.017
Token so
Token ID523
Feature activation-5.16e-01

Pos logit contributions
 decided+0.011
 inside+0.011
 should+0.009
 soon+0.009
 even+0.009
Neg logit contributions
 salon-0.015
ucker-0.015
 Fro-0.014
ips-0.014
ittens-0.014
Token much
Token ID881
Feature activation-8.68e-01

Pos logit contributions
 decided+0.011
 inside+0.011
 UFO+0.010
 generation+0.010
inct+0.009
Neg logit contributions
ucker-0.012
 Fro-0.011
 Leo-0.011
 losers-0.010
ips-0.010
Token fun
Token ID1257
Feature activation-1.74e+00

Pos logit contributions
 decided+0.021
 UFO+0.019
 generation+0.019
edient+0.019
 inside+0.019
Neg logit contributions
ucker-0.017
 Fro-0.017
 yoga-0.016
 Dr-0.016
 pose-0.015
Token swimming
Token ID14899
Feature activation-3.01e+00

Pos logit contributions
 decided+0.038
 UFO+0.034
 inside+0.034
 should+0.032
 generation+0.032
Neg logit contributions
ucker-0.038
 Fro-0.038
ips-0.037
 yoga-0.037
 pose-0.036
Token and
Token ID290
Feature activation-6.22e-01

Pos logit contributions
 UFO+0.062
 decided+0.061
 generation+0.057
makers+0.056
bell+0.056
Neg logit contributions
 Fro-0.067
ags-0.064
 pose-0.062
 mat-0.062
ips-0.062
Token playing
Token ID2712
Feature activation-1.76e+00

Pos logit contributions
 decided+0.016
 inside+0.016
 UFO+0.015
 should+0.015
 generation+0.015
Neg logit contributions
 Fro-0.012
ucker-0.011
 Leo-0.010
 salon-0.010
 Dr-0.010
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 decided+0.034
 UFO+0.034
bell+0.033
makers+0.031
col+0.031
Neg logit contributions
 Fro-0.042
 mat-0.041
 yoga-0.040
 pose-0.040
ags-0.040
Token From
Token ID3574
Feature activation-6.70e-01

Pos logit contributions
 decided+0.008
 inside+0.007
 UFO+0.007
 should+0.006
makers+0.006
Neg logit contributions
ucker-0.009
 salon-0.009
 Fro-0.008
 Dr-0.008
 Mara-0.008
Token that
Token ID326
Feature activation-5.04e-01

Pos logit contributions
 decided+0.014
makers+0.012
 UFO+0.012
 generations+0.012
 generation+0.012
Neg logit contributions
ucker-0.016
 Fro-0.016
 salon-0.015
 Leo-0.015
 Dr-0.015
Token Leave
Token ID17446
Feature activation-3.67e-01

Pos logit contributions
 inside+0.004
 decided+0.004
 UFO+0.003
makers+0.003
 gem+0.003
Neg logit contributions
 Dr-0.002
 My-0.002
 Leo-0.002
 Fro-0.002
 Mara-0.002
Token this
Token ID428
Feature activation-8.62e-01

Pos logit contributions
 decided+0.009
 inside+0.009
makers+0.008
 UFO+0.008
 should+0.008
Neg logit contributions
 Dr-0.007
 salon-0.007
 Fro-0.007
 losers-0.007
 Mara-0.006
Token door
Token ID3420
Feature activation+1.06e+00

Pos logit contributions
 decided+0.021
year+0.019
gl+0.018
 should+0.018
 UFO+0.018
Neg logit contributions
 salon-0.019
 mat-0.016
 losers-0.016
 pose-0.016
eh-0.016
Token shut
Token ID4423
Feature activation-2.98e-01

Pos logit contributions
 salon+0.025
 pose+0.022
 losers+0.022
 mat+0.021
ucker+0.021
Neg logit contributions
 decided-0.022
 UFO-0.021
 mainland-0.020
 generations-0.020
makers-0.020
Token and
Token ID290
Feature activation-2.82e-01

Pos logit contributions
 decided+0.006
 UFO+0.006
gl+0.006
 mainland+0.006
makers+0.006
Neg logit contributions
 salon-0.006
 my-0.006
 your-0.006
 losers-0.006
 Dr-0.006
Token don
Token ID836
Feature activation-2.97e+00

Pos logit contributions
arp+0.006
 UFO+0.006
makers+0.006
edient+0.006
col+0.006
Neg logit contributions
 salon-0.006
 your-0.005
 Dr-0.005
 losers-0.005
 elbow-0.005
Token't
Token ID470
Feature activation-5.26e-01

Pos logit contributions
 decided+0.079
 inside+0.076
 should+0.068
 soon+0.068
 even+0.068
Neg logit contributions
 salon-0.056
ucker-0.055
ips-0.051
 Dr-0.050
 Fro-0.050
Token let
Token ID1309
Feature activation+2.10e-01

Pos logit contributions
year+0.014
 generations+0.014
col+0.013
 mainland+0.013
 decided+0.013
Neg logit contributions
 pose-0.008
 salon-0.008
 losers-0.008
ow-0.008
 Dr-0.007
Token it
Token ID340
Feature activation-7.02e-01

Pos logit contributions
 salon+0.004
 Dr+0.004
 losers+0.004
 Fro+0.004
 pose+0.004
Neg logit contributions
 decided-0.005
 inside-0.005
 UFO-0.005
makers-0.004
col-0.004
Token out
Token ID503
Feature activation-7.90e-01

Pos logit contributions
 decided+0.017
 generations+0.016
 UFO+0.016
year+0.016
makers+0.015
Neg logit contributions
 pose-0.013
 salon-0.013
ow-0.012
 losers-0.012
 Dr-0.012
Token."
Token ID526
Feature activation-3.25e-02

Pos logit contributions
 UFO+0.019
 mainland+0.018
Wow+0.017
 decided+0.017
inct+0.016
Neg logit contributions
 your-0.015
 my-0.015
 pose-0.013
 losers-0.013
 salon-0.013
Token add
Token ID751
Feature activation+2.37e-01

Pos logit contributions
makers+0.040
year+0.039
 generations+0.039
 UFO+0.038
edient+0.037
Neg logit contributions
 salon-0.022
 pose-0.020
 your-0.020
ucker-0.020
 yoga-0.020
Token some
Token ID617
Feature activation-3.05e-01

Pos logit contributions
 yoga+0.005
 toast+0.005
 your+0.005
ips+0.005
 my+0.005
Neg logit contributions
 decided-0.005
edient-0.005
 UFO-0.005
makers-0.005
 wished-0.005
Token spark
Token ID9009
Feature activation+7.98e-01

Pos logit contributions
 decided+0.008
edient+0.008
makers+0.008
 UFO+0.008
 mainland+0.007
Neg logit contributions
 yoga-0.006
 toast-0.005
 pose-0.005
 mat-0.005
 salon-0.005
Tokenly
Token ID306
Feature activation-4.88e-01

Pos logit contributions
ucker+0.018
 salon+0.018
ips+0.017
 Fro+0.017
 pose+0.017
Neg logit contributions
 decided-0.018
 inside-0.017
 should-0.015
 UFO-0.015
 soon-0.015
Token shoes
Token ID10012
Feature activation-8.16e-01

Pos logit contributions
 decided+0.011
edient+0.011
 UFO+0.011
makers+0.011
 mainland+0.010
Neg logit contributions
 mat-0.009
 salon-0.009
 pose-0.009
 mats-0.009
 yoga-0.009
Token too
Token ID1165
Feature activation-2.96e+00

Pos logit contributions
 decided+0.019
 UFO+0.018
 should+0.017
 wished+0.017
 inside+0.017
Neg logit contributions
 salon-0.017
ucker-0.017
ips-0.016
 losers-0.015
 Mara-0.015
Token!"
Token ID2474
Feature activation-1.03e-01

Pos logit contributions
 UFO+0.077
 decided+0.076
edient+0.073
 raged+0.073
 wished+0.072
Neg logit contributions
 my-0.049
 your-0.049
ucker-0.048
 Leo-0.046
 Mara-0.045
Token
Token ID198
Feature activation-5.30e-02

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.003
ucker-0.003
 losers-0.003
 pose-0.003
 Fro-0.003
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.002
ucker-0.002
 Fro-0.001
ips-0.001
 losers-0.001
TokenThe
Token ID464
Feature activation-1.45e-01

Pos logit contributions
ucker+0.004
 salon+0.004
 Fro+0.004
ips+0.004
 losers+0.004
Neg logit contributions
 decided-0.003
 inside-0.003
 should-0.002
 soon-0.002
 even-0.002
Token shop
Token ID6128
Feature activation-8.73e-01

Pos logit contributions
 decided+0.004
col+0.004
ull+0.004
gl+0.004
 soon+0.003
Neg logit contributions
 salon-0.003
 Dr-0.003
 cookie-0.002
 mat-0.002
 yoga-0.002
Token Here
Token ID3423
Feature activation+4.20e-01

Pos logit contributions
 decided+0.006
 inside+0.006
 UFO+0.006
makers+0.006
lling+0.006
Neg logit contributions
 Dr-0.005
 Leo-0.005
 Velvet-0.004
 Luna-0.004
 Mara-0.004
Token,
Token ID11
Feature activation-3.55e-01

Pos logit contributions
 salon+0.010
ucker+0.010
 Fro+0.009
ips+0.009
ittens+0.009
Neg logit contributions
 decided-0.009
 inside-0.009
 should-0.008
 soon-0.008
 even-0.008
Token hold
Token ID1745
Feature activation-2.21e-01

Pos logit contributions
edient+0.008
 decided+0.008
 generations+0.008
 UFO+0.008
makers+0.008
Neg logit contributions
 salon-0.008
 pose-0.007
 losers-0.007
 Dr-0.007
 Leo-0.006
Token yours
Token ID12431
Feature activation-9.36e-01

Pos logit contributions
 decided+0.006
 inside+0.005
 UFO+0.005
 should+0.005
 even+0.005
Neg logit contributions
 salon-0.004
ucker-0.004
ips-0.004
 Mara-0.004
 Fro-0.004
Token and
Token ID290
Feature activation-2.94e-01

Pos logit contributions
 decided+0.019
 UFO+0.018
 should+0.016
 generations+0.016
col+0.016
Neg logit contributions
 salon-0.023
 pose-0.022
 mat-0.021
 losers-0.021
ips-0.021
Token don
Token ID836
Feature activation-2.95e+00

Pos logit contributions
 decided+0.007
makers+0.007
 UFO+0.007
 inside+0.006
edient+0.006
Neg logit contributions
 salon-0.006
 pose-0.006
 losers-0.006
 yoga-0.005
 mat-0.005
Token't
Token ID470
Feature activation-7.12e-01

Pos logit contributions
 decided+0.074
 inside+0.072
 should+0.065
 soon+0.064
 even+0.064
Neg logit contributions
 salon-0.061
ucker-0.061
 Fro-0.056
ips-0.055
ittens-0.054
Token lose
Token ID4425
Feature activation-4.17e-01

Pos logit contributions
year+0.019
 decided+0.018
 generations+0.018
 inside+0.017
col+0.017
Neg logit contributions
 salon-0.013
 pose-0.013
 losers-0.012
ucker-0.011
 Fro-0.011
Token it
Token ID340
Feature activation-6.93e-02

Pos logit contributions
 decided+0.011
 UFO+0.010
year+0.009
gl+0.009
 raged+0.009
Neg logit contributions
 your-0.008
 losers-0.008
 Dr-0.008
 my-0.008
 Leo-0.008
Token,
Token ID11
Feature activation+3.16e-02

Pos logit contributions
 UFO+0.001
 decided+0.001
inct+0.001
 lighthouse+0.001
col+0.001
Neg logit contributions
 salon-0.002
 losers-0.002
 pose-0.001
 Dr-0.001
 your-0.001
Token okay
Token ID8788
Feature activation-1.88e+00

Pos logit contributions
 salon+0.001
 losers+0.001
 Fro+0.001
 Dr+0.001
ucker+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 UFO-0.001
 generations-0.001
makers-0.001
Token your
Token ID534
Feature activation+5.05e-03

Pos logit contributions
 salon+0.006
ucker+0.005
 yoga+0.005
 mat+0.005
 pose+0.005
Neg logit contributions
 decided-0.007
 UFO-0.006
 should-0.006
 inside-0.006
makers-0.006
Token turn
Token ID1210
Feature activation+6.08e-01

Pos logit contributions
 salon+0.000
ucker+0.000
 pose+0.000
 mat+0.000
ips+0.000
Neg logit contributions
 decided-0.000
 inside-0.000
 generation-0.000
 UFO-0.000
 should-0.000
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 salon+0.016
ucker+0.016
ips+0.015
 pose+0.015
 losers+0.015
Neg logit contributions
 decided-0.012
 UFO-0.010
 inside-0.010
 mainland-0.010
 generation-0.010
Token If
Token ID1002
Feature activation-1.02e+00

Pos logit contributions
 Dr+0.002
ucker+0.002
 salon+0.002
 Fro+0.002
 Mara+0.002
Neg logit contributions
 decided-0.004
 inside-0.003
 UFO-0.003
 even-0.003
 soon-0.003
Token you
Token ID345
Feature activation-2.03e-01

Pos logit contributions
 inside+0.033
 decided+0.033
col+0.033
year+0.033
 UFO+0.032
Neg logit contributions
 salon-0.012
 Dr-0.011
 yoga-0.010
 your-0.010
 losers-0.010
Token don
Token ID836
Feature activation-2.95e+00

Pos logit contributions
 decided+0.005
 inside+0.005
 UFO+0.005
 generations+0.005
makers+0.004
Neg logit contributions
 salon-0.004
ucker-0.004
 pose-0.004
 losers-0.004
 Fro-0.004
Token't
Token ID470
Feature activation-7.93e-01

Pos logit contributions
 decided+0.070
 inside+0.068
 should+0.062
 even+0.061
 soon+0.061
Neg logit contributions
 salon-0.065
ucker-0.065
 Fro-0.060
 Mara-0.058
ilo-0.058
Token,
Token ID11
Feature activation-3.98e-01

Pos logit contributions
year+0.023
 generations+0.022
 generation+0.021
 UFO+0.021
 decided+0.021
Neg logit contributions
 salon-0.014
 pose-0.014
ucker-0.013
 losers-0.012
ow-0.012
Token we
Token ID356
Feature activation-9.05e-01

Pos logit contributions
 inside+0.012
year+0.012
 decided+0.011
makers+0.011
 UFO+0.011
Neg logit contributions
 salon-0.006
 losers-0.005
 Fro-0.005
 Dr-0.005
 your-0.005
Token will
Token ID481
Feature activation-2.75e-01

Pos logit contributions
 generations+0.022
 UFO+0.021
year+0.021
Mill+0.021
 generation+0.021
Neg logit contributions
 salon-0.019
 losers-0.019
ucker-0.018
 pose-0.018
 Fro-0.017
Token go
Token ID467
Feature activation-1.24e+00

Pos logit contributions
year+0.008
 decided+0.007
 generations+0.007
 UFO+0.007
 inside+0.007
Neg logit contributions
 salon-0.005
 pose-0.005
 losers-0.004
ucker-0.004
 Fro-0.004
Token and
Token ID290
Feature activation-6.01e-01

Pos logit contributions
 UFO+0.024
 decided+0.024
makers+0.022
col+0.022
year+0.021
Neg logit contributions
 salon-0.026
 pose-0.025
 mat-0.024
ips-0.024
 Mara-0.023
Token hold
Token ID1745
Feature activation-6.62e-01

Pos logit contributions
 generations+0.015
year+0.015
 decided+0.015
arp+0.014
makers+0.014
Neg logit contributions
 salon-0.012
 pose-0.011
 Fro-0.011
 Leo-0.011
 losers-0.010
Token hands
Token ID2832
Feature activation-1.00e+00

Pos logit contributions
 decided+0.019
 inside+0.017
 UFO+0.017
makers+0.016
 should+0.016
Neg logit contributions
ips-0.011
ags-0.011
ucker-0.011
 Fro-0.011
ittens-0.011
Token so
Token ID523
Feature activation-1.65e+00

Pos logit contributions
 decided+0.019
 UFO+0.018
 inside+0.016
 should+0.016
 mainland+0.016
Neg logit contributions
 salon-0.025
ucker-0.024
ips-0.024
 losers-0.023
ittens-0.023
Token we
Token ID356
Feature activation-1.72e+00

Pos logit contributions
 decided+0.044
 inside+0.043
 UFO+0.043
col+0.041
 mainland+0.040
Neg logit contributions
ucker-0.028
 salon-0.027
 Velvet-0.027
 losers-0.027
 yoga-0.026
Token don
Token ID836
Feature activation-2.95e+00

Pos logit contributions
 generations+0.045
Mill+0.041
keeping+0.041
col+0.039
makers+0.039
Neg logit contributions
 losers-0.032
 slips-0.030
 salon-0.029
 pose-0.028
ow-0.026
Token't
Token ID470
Feature activation-6.94e-01

Pos logit contributions
 decided+0.074
 inside+0.071
 should+0.064
 soon+0.064
 even+0.064
Neg logit contributions
 salon-0.061
ucker-0.061
 Fro-0.056
ips-0.055
 Mara-0.054
Token get
Token ID651
Feature activation-4.49e-01

Pos logit contributions
year+0.018
 decided+0.018
 generations+0.017
 inside+0.017
col+0.017
Neg logit contributions
 salon-0.012
 Fro-0.012
 pose-0.012
 losers-0.012
ucker-0.012
Token lost
Token ID2626
Feature activation-5.30e-01

Pos logit contributions
 decided+0.010
 inside+0.009
 should+0.009
 UFO+0.009
 generation+0.009
Neg logit contributions
ucker-0.010
 salon-0.009
 losers-0.009
 Fro-0.009
 Dr-0.009
Token."
Token ID526
Feature activation+9.18e-02

Pos logit contributions
 decided+0.012
year+0.012
 UFO+0.011
inct+0.011
 should+0.011
Neg logit contributions
 your-0.011
 my-0.011
 losers-0.011
 salon-0.010
 Velvet-0.009
Token So
Token ID1406
Feature activation+6.19e-01

Pos logit contributions
 salon+0.003
ucker+0.003
 losers+0.002
 pose+0.002
ips+0.002
Neg logit contributions
 decided-0.002
 inside-0.001
 soon-0.001
 should-0.001
 even-0.001
TokenThat
Token ID2504
Feature activation+2.93e-01

Pos logit contributions
 inside+0.014
 decided+0.012
 UFO+0.012
 lighthouse+0.012
 should+0.011
Neg logit contributions
 salon-0.016
ucker-0.016
 Mara-0.015
eh-0.015
 losers-0.014
Token's
Token ID338
Feature activation-7.65e-01

Pos logit contributions
 salon+0.005
ucker+0.005
 pose+0.005
 Dr+0.004
 Mara+0.004
Neg logit contributions
 decided-0.008
 inside-0.008
makers-0.007
 soon-0.007
 mainland-0.007
Token right
Token ID826
Feature activation-6.22e-01

Pos logit contributions
 decided+0.016
 inside+0.015
 UFO+0.014
makers+0.014
col+0.013
Neg logit contributions
 Dr-0.018
ucker-0.018
 Fro-0.018
ilo-0.017
 salon-0.017
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 generation+0.016
 decided+0.016
 mainland+0.015
 UFO+0.015
 generations+0.015
Neg logit contributions
 my-0.010
 your-0.010
ucker-0.010
 yoga-0.010
uffy-0.010
Token Trees
Token ID34925
Feature activation-5.96e-01

Pos logit contributions
ucker+0.005
 Dr+0.005
 Fro+0.004
 My+0.004
 salon+0.004
Neg logit contributions
 decided-0.007
 inside-0.007
 even-0.006
 UFO-0.006
 soon-0.006
Token don
Token ID836
Feature activation-2.94e+00

Pos logit contributions
 decided+0.015
year+0.015
 UFO+0.014
 generations+0.014
 generation+0.014
Neg logit contributions
 salon-0.011
ucker-0.010
 pose-0.010
 slips-0.010
ips-0.010
Token't
Token ID470
Feature activation-3.15e-01

Pos logit contributions
 decided+0.085
 inside+0.080
 UFO+0.075
 generation+0.073
 mainland+0.073
Neg logit contributions
ucker-0.047
 salon-0.045
ips-0.045
ittens-0.044
 losers-0.043
Token suddenly
Token ID6451
Feature activation+1.11e+00

Pos logit contributions
 decided+0.007
 inside+0.006
 UFO+0.006
col+0.006
 generations+0.006
Neg logit contributions
 salon-0.007
ucker-0.007
 losers-0.007
 pose-0.006
 Dr-0.006
Token move
Token ID1445
Feature activation-3.28e-01

Pos logit contributions
 salon+0.024
 pose+0.023
ucker+0.022
 losers+0.022
 yoga+0.021
Neg logit contributions
year-0.025
 UFO-0.024
 decided-0.024
 mainland-0.022
 inside-0.022
Token or
Token ID393
Feature activation+7.76e-01

Pos logit contributions
 decided+0.007
 UFO+0.007
 mainland+0.007
year+0.006
 lighthouse+0.006
Neg logit contributions
 losers-0.007
 your-0.006
 salon-0.006
 my-0.006
 pose-0.006
Token fall
Token ID2121
Feature activation+2.03e-01

Pos logit contributions
 salon+0.016
 pose+0.015
 yoga+0.014
 losers+0.014
ucker+0.014
Neg logit contributions
 decided-0.019
year-0.018
te-0.018
 inside-0.017
 UFO-0.017
Token not
Token ID407
Feature activation-6.44e-01

Pos logit contributions
 UFO+0.023
year+0.022
arp+0.022
 decided+0.022
col+0.021
Neg logit contributions
 salon-0.018
 losers-0.017
 pose-0.017
 yoga-0.016
 elbow-0.016
Token get
Token ID651
Feature activation-2.55e-02

Pos logit contributions
 decided+0.015
 inside+0.014
col+0.014
 UFO+0.014
year+0.014
Neg logit contributions
ucker-0.013
 salon-0.013
 losers-0.013
 pose-0.013
 Dr-0.012
Token angry
Token ID7954
Feature activation-4.92e-01

Pos logit contributions
 decided+0.001
 UFO+0.001
 inside+0.001
te+0.001
 should+0.001
Neg logit contributions
ucker-0.001
 salon-0.000
 losers-0.000
 Dr-0.000
 Fro-0.000
Token when
Token ID618
Feature activation-8.79e-01

Pos logit contributions
 UFO+0.010
 decided+0.010
Mill+0.010
 mainland+0.009
 lighthouse+0.009
Neg logit contributions
 losers-0.011
 salon-0.011
ips-0.010
 Leo-0.010
ucker-0.010
Token they
Token ID484
Feature activation+9.74e-01

Pos logit contributions
 decided+0.024
col+0.023
 UFO+0.023
year+0.023
 inside+0.022
Neg logit contributions
 yoga-0.014
 salon-0.014
 Dr-0.013
ucker-0.013
 losers-0.013
Token don
Token ID836
Feature activation-2.94e+00

Pos logit contributions
 salon+0.021
ucker+0.020
 losers+0.019
 pose+0.019
ips+0.018
Neg logit contributions
 inside-0.022
 decided-0.022
 UFO-0.021
makers-0.020
Mill-0.020
Token't
Token ID470
Feature activation-9.99e-01

Pos logit contributions
 decided+0.077
 inside+0.074
 should+0.067
 soon+0.066
 even+0.066
Neg logit contributions
 salon-0.057
ucker-0.057
ips-0.052
 Fro-0.051
ittens-0.051
Token fit
Token ID4197
Feature activation-1.93e-01

Pos logit contributions
year+0.024
 decided+0.024
col+0.024
 UFO+0.024
 inside+0.023
Neg logit contributions
ucker-0.019
 salon-0.019
 losers-0.018
 pose-0.018
 choke-0.016
Token.
Token ID13
Feature activation-4.53e-01

Pos logit contributions
 decided+0.004
 UFO+0.004
inct+0.004
 lighthouse+0.004
col+0.004
Neg logit contributions
ips-0.004
ags-0.004
ucker-0.004
 your-0.004
 losers-0.004
Token<|endoftext|>
Token ID50256
Feature activation-4.29e-01

Pos logit contributions
 decided+0.011
 inside+0.011
 UFO+0.009
makers+0.009
 even+0.009
Neg logit contributions
ucker-0.010
 Dr-0.010
 Fro-0.010
 salon-0.010
 Mara-0.010
TokenOnce
Token ID7454
Feature activation-6.73e-02

Pos logit contributions
 decided+0.009
 inside+0.008
 UFO+0.008
 even+0.008
 soon+0.007
Neg logit contributions
ucker-0.010
 salon-0.010
 Fro-0.010
ips-0.010
 Dr-0.010
Token with
Token ID351
Feature activation-7.98e-01

Pos logit contributions
 decided+0.019
 UFO+0.018
makers+0.018
 should+0.017
 mainland+0.017
Neg logit contributions
 salon-0.024
ucker-0.024
 mat-0.022
 Fro-0.022
 pose-0.022
Token lots
Token ID6041
Feature activation-1.00e+00

Pos logit contributions
 decided+0.016
 inside+0.016
makers+0.015
 soon+0.014
 should+0.014
Neg logit contributions
 Fro-0.018
 salon-0.018
ucker-0.018
 Mara-0.017
 Dr-0.017
Token of
Token ID286
Feature activation-2.25e-02

Pos logit contributions
 decided+0.030
 generation+0.029
 UFO+0.028
makers+0.028
edient+0.028
Neg logit contributions
ucker-0.015
ittens-0.013
pill-0.013
 salon-0.012
 toast-0.012
Token water
Token ID1660
Feature activation-1.02e+00

Pos logit contributions
makers+0.001
edient+0.001
 generation+0.001
 decided+0.001
 soon+0.001
Neg logit contributions
 Fro-0.000
ucker-0.000
 toast-0.000
ags-0.000
ittens-0.000
Tokenbirds
Token ID32002
Feature activation-1.75e+00

Pos logit contributions
 generation+0.020
 UFO+0.020
 decided+0.019
year+0.019
 generations+0.019
Neg logit contributions
ucker-0.025
ips-0.023
 Fro-0.023
 salon-0.023
 mat-0.022
Token swimming
Token ID14899
Feature activation-2.93e+00

Pos logit contributions
 generation+0.040
year+0.040
 UFO+0.039
te+0.037
 former+0.037
Neg logit contributions
 swinging-0.034
ucker-0.032
 Fro-0.031
 Leo-0.031
pill-0.031
Token in
Token ID287
Feature activation-1.61e+00

Pos logit contributions
 generation+0.070
 UFO+0.070
 former+0.069
 prayed+0.065
 mainland+0.064
Neg logit contributions
eh-0.054
ags-0.053
 Fro-0.052
ips-0.050
 mats-0.049
Token it
Token ID340
Feature activation-1.36e+00

Pos logit contributions
makers+0.045
works+0.044
edient+0.044
 UFO+0.043
year+0.043
Neg logit contributions
 Leo-0.024
 Fro-0.024
 my-0.021
 your-0.020
 Mara-0.020
Token.
Token ID13
Feature activation-7.61e-01

Pos logit contributions
 UFO+0.029
makers+0.028
 generation+0.028
year+0.027
 decided+0.027
Neg logit contributions
ucker-0.028
 Leo-0.027
ips-0.027
 pose-0.026
 mat-0.026
Token He
Token ID679
Feature activation-8.82e-01

Pos logit contributions
 decided+0.020
 inside+0.020
makers+0.018
 should+0.018
 soon+0.018
Neg logit contributions
 salon-0.013
 Dr-0.013
ucker-0.013
 Mara-0.012
 Leo-0.012
Token looked
Token ID3114
Feature activation-7.66e-01

Pos logit contributions
makers+0.016
 inside+0.015
keeping+0.014
 UFO+0.014
edient+0.014
Neg logit contributions
 Fro-0.023
 salon-0.023
ucker-0.022
 Leo-0.022
 Mara-0.022
Token get
Token ID651
Feature activation-2.05e-01

Pos logit contributions
 UFO+0.005
col+0.005
 decided+0.005
 mainland+0.005
 rooft+0.005
Neg logit contributions
 losers-0.005
 pose-0.005
ucker-0.005
 Fro-0.005
 salon-0.004
Token rewarded
Token ID20945
Feature activation-3.58e-02

Pos logit contributions
 UFO+0.005
 decided+0.005
 generation+0.004
makers+0.004
 lighthouse+0.004
Neg logit contributions
ucker-0.004
 salon-0.004
 Dr-0.004
 losers-0.004
 Fro-0.004
Token and
Token ID290
Feature activation-1.45e-01

Pos logit contributions
 UFO+0.001
 decided+0.001
 lighthouse+0.001
 mainland+0.001
 rooft+0.001
Neg logit contributions
 salon-0.001
ucker-0.001
 pose-0.001
 Dr-0.001
 Mara-0.001
Token when
Token ID618
Feature activation-5.43e-01

Pos logit contributions
 UFO+0.003
arp+0.003
 decided+0.003
 mainland+0.003
makers+0.003
Neg logit contributions
 salon-0.003
ucker-0.003
 pose-0.003
 yoga-0.003
 losers-0.003
Token you
Token ID345
Feature activation+1.58e-01

Pos logit contributions
 decided+0.016
col+0.016
year+0.016
 UFO+0.016
arp+0.016
Neg logit contributions
 yoga-0.008
 salon-0.007
 your-0.007
 Dr-0.007
 pose-0.007
Token don
Token ID836
Feature activation-2.93e+00

Pos logit contributions
 salon+0.003
ucker+0.003
 losers+0.003
 pose+0.003
 Fro+0.003
Neg logit contributions
 decided-0.004
 inside-0.004
 UFO-0.004
makers-0.004
 generations-0.004
Token't
Token ID470
Feature activation-6.11e-01

Pos logit contributions
 decided+0.080
 inside+0.077
 UFO+0.069
 soon+0.069
 should+0.068
Neg logit contributions
ucker-0.053
 salon-0.052
ips-0.048
 Dr-0.047
 Fro-0.047
Token listen
Token ID6004
Feature activation-9.84e-01

Pos logit contributions
year+0.016
 UFO+0.015
 decided+0.015
col+0.015
 inside+0.015
Neg logit contributions
ucker-0.011
 salon-0.011
 pose-0.011
 losers-0.010
 Fro-0.010
Token,
Token ID11
Feature activation-8.25e-01

Pos logit contributions
 UFO+0.028
 decided+0.026
 mainland+0.025
 lighthouse+0.025
col+0.024
Neg logit contributions
ucker-0.016
 yoga-0.015
 salon-0.015
 pose-0.014
 Dr-0.014
Token you
Token ID345
Feature activation-2.95e-01

Pos logit contributions
 UFO+0.023
 decided+0.023
 inside+0.022
year+0.022
makers+0.022
Neg logit contributions
 salon-0.012
 yoga-0.012
ucker-0.012
 Fro-0.011
 pose-0.011
Token get
Token ID651
Feature activation-3.33e-01

Pos logit contributions
 UFO+0.006
col+0.006
 rooft+0.006
Mill+0.006
 mainland+0.006
Neg logit contributions
 losers-0.007
ucker-0.006
 pose-0.006
 salon-0.006
 slips-0.006
Token need
Token ID761
Feature activation-3.62e-01

Pos logit contributions
 decided+0.004
 inside+0.003
 UFO+0.003
makers+0.003
 generations+0.003
Neg logit contributions
 salon-0.005
ucker-0.005
 losers-0.004
 Fro-0.004
 pose-0.004
Token to
Token ID284
Feature activation-1.08e-01

Pos logit contributions
 decided+0.009
 inside+0.008
 should+0.008
 UFO+0.008
makers+0.008
Neg logit contributions
ucker-0.008
 salon-0.008
 Fro-0.007
ips-0.007
 Dr-0.007
Token drink
Token ID4144
Feature activation-4.00e-01

Pos logit contributions
 decided+0.003
 inside+0.003
 UFO+0.003
makers+0.002
 should+0.002
Neg logit contributions
 salon-0.002
ucker-0.002
 pose-0.002
 Fro-0.002
 losers-0.002
Token some
Token ID617
Feature activation-1.02e+00

Pos logit contributions
 UFO+0.009
 decided+0.009
makers+0.009
 generation+0.009
 generations+0.009
Neg logit contributions
ucker-0.008
 Fro-0.008
 Dr-0.008
 salon-0.008
 toast-0.008
Token water
Token ID1660
Feature activation-3.00e-01

Pos logit contributions
 UFO+0.027
 decided+0.026
 generation+0.025
makers+0.025
 inside+0.024
Neg logit contributions
ucker-0.019
 toast-0.019
ittens-0.018
 losers-0.018
etchup-0.017
Token too
Token ID1165
Feature activation-2.93e+00

Pos logit contributions
 decided+0.006
 UFO+0.006
 inside+0.005
 generation+0.005
makers+0.005
Neg logit contributions
 salon-0.008
ucker-0.008
 Fro-0.007
ips-0.007
 pose-0.007
Token."
Token ID526
Feature activation+3.81e-01

Pos logit contributions
 UFO+0.064
 decided+0.060
makers+0.055
inct+0.054
col+0.054
Neg logit contributions
ucker-0.069
 toast-0.064
ittens-0.063
 Dr-0.063
inches-0.062
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 salon+0.013
ucker+0.013
 pose+0.012
eh+0.012
ips+0.012
Neg logit contributions
 decided-0.005
 inside-0.004
 soon-0.003
 even-0.003
 should-0.003
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 should+0.004
 even+0.003
 soon+0.003
Neg logit contributions
ucker-0.007
 salon-0.006
 Fro-0.006
 Dr-0.006
ips-0.006
TokenAnna
Token ID31160
Feature activation+1.68e+00

Pos logit contributions
ucker+0.003
 salon+0.003
 Fro+0.003
ips+0.003
ittens+0.003
Neg logit contributions
 decided-0.003
 inside-0.003
 soon-0.003
 should-0.003
 even-0.003
Token and
Token ID290
Feature activation+2.91e+00

Pos logit contributions
ucker+0.047
 salon+0.047
 Fro+0.044
ips+0.044
 Mara+0.043
Neg logit contributions
 decided-0.030
 inside-0.028
 should-0.024
 soon-0.024
 even-0.024
Token will
Token ID481
Feature activation-3.55e-01

Pos logit contributions
Mill+0.000
 generations+0.000
 UFO+0.000
year+0.000
makers+0.000
Neg logit contributions
 salon-0.000
 losers-0.000
 pose-0.000
ow-0.000
 slips-0.000
Token be
Token ID307
Feature activation-1.06e+00

Pos logit contributions
 decided+0.009
 inside+0.009
year+0.009
 UFO+0.008
 generations+0.008
Neg logit contributions
 salon-0.007
 losers-0.006
 pose-0.006
 Dr-0.006
ucker-0.006
Token sad
Token ID6507
Feature activation-6.06e-02

Pos logit contributions
 decided+0.024
 UFO+0.023
makers+0.022
col+0.022
 should+0.022
Neg logit contributions
ags-0.021
ucker-0.021
 losers-0.021
 Dr-0.021
eh-0.020
Token if
Token ID611
Feature activation-5.09e-01

Pos logit contributions
 UFO+0.002
year+0.002
col+0.002
 decided+0.002
 mainland+0.001
Neg logit contributions
 salon-0.001
 your-0.001
ucker-0.001
 my-0.001
 pose-0.001
Token they
Token ID484
Feature activation+8.72e-01

Pos logit contributions
 decided+0.013
 inside+0.012
col+0.012
 UFO+0.012
makers+0.011
Neg logit contributions
 salon-0.010
 Dr-0.010
 Leo-0.009
 yoga-0.009
 Mara-0.009
Token don
Token ID836
Feature activation-2.93e+00

Pos logit contributions
 salon+0.017
ucker+0.016
 losers+0.016
 pose+0.016
ow+0.015
Neg logit contributions
 decided-0.021
 UFO-0.021
 inside-0.020
 generations-0.020
Mill-0.020
Token't
Token ID470
Feature activation-4.08e-01

Pos logit contributions
 decided+0.079
 inside+0.076
 should+0.068
 UFO+0.068
 soon+0.067
Neg logit contributions
 salon-0.054
ucker-0.054
ips-0.050
ittens-0.048
 Dr-0.048
Token get
Token ID651
Feature activation-8.60e-02

Pos logit contributions
year+0.010
 generations+0.010
 rooft+0.010
 UFO+0.010
inct+0.010
Neg logit contributions
 salon-0.007
ucker-0.007
 losers-0.007
 pose-0.007
ow-0.007
Token the
Token ID262
Feature activation+9.60e-01

Pos logit contributions
 decided+0.002
 UFO+0.002
inct+0.002
col+0.002
Maybe+0.002
Neg logit contributions
 your-0.002
 losers-0.002
ucker-0.002
 Dr-0.002
 my-0.001
Token letters
Token ID7475
Feature activation+8.06e-01

Pos logit contributions
ags+0.020
 salon+0.020
 Dr+0.019
 toast+0.019
 losers+0.019
Neg logit contributions
 decided-0.022
makers-0.020
year-0.020
edient-0.019
 soon-0.019
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 salon+0.020
ucker+0.018
 losers+0.018
 pose+0.018
 Dr+0.018
Neg logit contributions
 decided-0.015
 UFO-0.014
 generations-0.014
col-0.014
makers-0.014
Token.
Token ID13
Feature activation+7.74e-01

Pos logit contributions
ucker+0.009
 salon+0.009
ittens+0.009
 pose+0.009
 mat+0.009
Neg logit contributions
 decided-0.006
 inside-0.005
 UFO-0.005
 should-0.005
 mainland-0.005
Token Hold
Token ID9340
Feature activation+3.63e-01

Pos logit contributions
 Dr+0.014
 Fro+0.014
 salon+0.014
 Leo+0.013
ucker+0.013
Neg logit contributions
 decided-0.022
 inside-0.021
makers-0.019
lling-0.019
 UFO-0.019
Token on
Token ID319
Feature activation+2.59e-01

Pos logit contributions
ucker+0.010
 salon+0.010
 Fro+0.009
ilo+0.009
 Dr+0.009
Neg logit contributions
 inside-0.007
 decided-0.007
 soon-0.006
 should-0.006
 even-0.006
Token tight
Token ID5381
Feature activation-1.25e-02

Pos logit contributions
 salon+0.006
ucker+0.006
ittens+0.006
 Fro+0.005
 mat+0.005
Neg logit contributions
 decided-0.006
 inside-0.005
 should-0.005
 UFO-0.005
 even-0.005
Token and
Token ID290
Feature activation-1.97e-01

Pos logit contributions
 decided+0.000
 UFO+0.000
 generation+0.000
 should+0.000
 mainland+0.000
Neg logit contributions
 salon-0.000
ittens-0.000
ips-0.000
ags-0.000
ucker-0.000
Token don
Token ID836
Feature activation-2.92e+00

Pos logit contributions
 decided+0.005
 UFO+0.005
 generations+0.005
 inside+0.005
makers+0.005
Neg logit contributions
 salon-0.004
 pose-0.004
 mat-0.003
ucker-0.003
 elbow-0.003
Token't
Token ID470
Feature activation-8.04e-01

Pos logit contributions
 decided+0.069
 inside+0.066
 should+0.060
 soon+0.060
 even+0.060
Neg logit contributions
ucker-0.065
 salon-0.065
 Fro-0.061
 Mara-0.059
ilo-0.058
Token let
Token ID1309
Feature activation+2.21e-01

Pos logit contributions
 decided+0.020
 inside+0.019
 generations+0.018
 UFO+0.018
makers+0.018
Neg logit contributions
 salon-0.016
ucker-0.015
 pose-0.015
 losers-0.014
 Fro-0.014
Token go
Token ID467
Feature activation-1.13e+00

Pos logit contributions
 salon+0.005
ucker+0.005
 Fro+0.005
ips+0.005
 Dr+0.005
Neg logit contributions
 decided-0.005
 inside-0.005
 should-0.004
 soon-0.004
 even-0.004
Token."
Token ID526
Feature activation+3.84e-01

Pos logit contributions
 decided+0.022
 UFO+0.019
 generations+0.019
 should+0.019
 generation+0.018
Neg logit contributions
 salon-0.029
ips-0.028
ucker-0.027
 Fro-0.027
ittens-0.027
Token Mom
Token ID11254
Feature activation+1.23e+00

Pos logit contributions
eh+0.015
ucker+0.015
 losers+0.015
 pose+0.015
 salon+0.014
Neg logit contributions
 decided-0.004
 inside-0.003
 soon-0.003
 even-0.003
 should-0.002
Token make
Token ID787
Feature activation-3.75e-01

Pos logit contributions
 decided+0.011
 inside+0.010
 UFO+0.009
 generations+0.009
 should+0.009
Neg logit contributions
 salon-0.010
ucker-0.009
 pose-0.009
 Fro-0.009
 losers-0.009
Token us
Token ID514
Feature activation-4.71e-01

Pos logit contributions
 decided+0.010
 inside+0.009
 UFO+0.009
year+0.008
 mainland+0.008
Neg logit contributions
 Dr-0.007
 salon-0.007
 yoga-0.007
ucker-0.007
 Fro-0.007
Token sick
Token ID6639
Feature activation-6.51e-01

Pos logit contributions
 decided+0.011
 inside+0.010
 should+0.010
 UFO+0.010
 generations+0.010
Neg logit contributions
 salon-0.010
ucker-0.010
 pose-0.009
 Fro-0.009
 losers-0.009
Token if
Token ID611
Feature activation-7.98e-01

Pos logit contributions
 UFO+0.015
 decided+0.014
year+0.014
Mill+0.014
 mainland+0.013
Neg logit contributions
 pose-0.013
 losers-0.013
 salon-0.013
 mat-0.013
 toast-0.013
Token we
Token ID356
Feature activation-1.51e-02

Pos logit contributions
 decided+0.021
 inside+0.020
 UFO+0.020
col+0.020
makers+0.019
Neg logit contributions
 salon-0.014
 Dr-0.013
 yoga-0.013
 Leo-0.013
 Fro-0.013
Token don
Token ID836
Feature activation-2.91e+00

Pos logit contributions
 UFO+0.000
 generations+0.000
 decided+0.000
makers+0.000
 inside+0.000
Neg logit contributions
 pose-0.000
 salon-0.000
 losers-0.000
ucker-0.000
 Fro-0.000
Token't
Token ID470
Feature activation-6.37e-01

Pos logit contributions
 decided+0.079
 inside+0.076
 should+0.068
 UFO+0.067
 soon+0.067
Neg logit contributions
 salon-0.054
ucker-0.054
ips-0.049
ittens-0.048
 Fro-0.048
Token wear
Token ID5806
Feature activation+2.80e-01

Pos logit contributions
 decided+0.016
year+0.016
 UFO+0.015
 generations+0.015
 generation+0.014
Neg logit contributions
ucker-0.012
 pose-0.012
 salon-0.012
 losers-0.011
ow-0.011
Token jackets
Token ID35129
Feature activation-1.30e+00

Pos logit contributions
 Velvet+0.006
ags+0.006
 yoga+0.005
 losers+0.005
ucker+0.005
Neg logit contributions
 decided-0.007
 UFO-0.006
year-0.006
makers-0.006
 should-0.006
Token."
Token ID526
Feature activation+1.57e-01

Pos logit contributions
 UFO+0.027
 decided+0.026
inct+0.026
 generations+0.026
col+0.024
Neg logit contributions
 salon-0.032
ucker-0.028
 losers-0.027
 Dr-0.027
 pose-0.026
Token
Token ID198
Feature activation-8.09e-02

Pos logit contributions
 salon+0.006
ucker+0.005
 losers+0.005
 pose+0.005
eh+0.005
Neg logit contributions
 decided-0.002
 inside-0.002
 soon-0.001
 should-0.001
 even-0.001
Token a
Token ID257
Feature activation-2.84e-02

Pos logit contributions
 decided+0.017
 inside+0.016
 should+0.015
 UFO+0.015
makers+0.015
Neg logit contributions
 salon-0.014
ucker-0.014
 Dr-0.014
 Fro-0.014
 losers-0.013
Token loud
Token ID7812
Feature activation-3.36e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.000
 even+0.000
 soon+0.000
Neg logit contributions
ucker-0.001
 salon-0.001
ips-0.001
 Fro-0.001
 Mara-0.001
Token noise
Token ID7838
Feature activation-3.10e-01

Pos logit contributions
 decided+0.007
 inside+0.006
 should+0.006
 soon+0.006
 even+0.006
Neg logit contributions
 salon-0.008
ucker-0.008
 pose-0.008
 Fro-0.008
 Dr-0.008
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
 decided+0.006
 UFO+0.005
 mainland+0.005
 generation+0.005
 generations+0.005
Neg logit contributions
 salon-0.008
 pose-0.008
 Dr-0.007
 mat-0.007
ucker-0.007
Token Lily
Token ID20037
Feature activation+1.30e+00

Pos logit contributions
 decided+0.007
 inside+0.007
 even+0.006
 UFO+0.006
 soon+0.006
Neg logit contributions
ucker-0.005
 salon-0.005
 Dr-0.005
 Fro-0.005
ips-0.004
Token and
Token ID290
Feature activation+2.28e+00

Pos logit contributions
 salon+0.036
ucker+0.036
 Fro+0.033
ips+0.033
 pose+0.033
Neg logit contributions
 decided-0.023
 inside-0.022
 should-0.019
 soon-0.019
 even-0.019
Token Tom
Token ID4186
Feature activation+6.92e-01

Pos logit contributions
ucker+0.045
ittens+0.040
ilo+0.039
 salon+0.038
eh+0.038
Neg logit contributions
 inside-0.068
 decided-0.066
 even-0.059
 soon-0.059
 next-0.057
Token heard
Token ID2982
Feature activation-9.07e-01

Pos logit contributions
 salon+0.015
ucker+0.015
 elbow+0.014
 Fro+0.014
 losers+0.013
Neg logit contributions
 generation-0.015
 generations-0.014
 decided-0.014
 inside-0.014
 mainland-0.014
Token someone
Token ID2130
Feature activation+4.85e-01

Pos logit contributions
 decided+0.029
edient+0.026
 inside+0.026
 should+0.026
makers+0.026
Neg logit contributions
 Dr-0.013
 Leo-0.012
 Luna-0.012
 salon-0.012
 Fro-0.012
Token scream
Token ID8196
Feature activation-5.01e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 Fro+0.010
ips+0.010
ittens+0.010
Neg logit contributions
 decided-0.011
 inside-0.011
 should-0.010
 even-0.010
 soon-0.009
Token.
Token ID13
Feature activation-5.69e-01

Pos logit contributions
 mainland+0.013
 UFO+0.012
 decided+0.012
 generation+0.012
 generations+0.012
Neg logit contributions
 Fro-0.009
 Dr-0.009
 Luna-0.009
 pose-0.009
 Leo-0.008
Token."
Token ID526
Feature activation-9.79e-02

Pos logit contributions
 UFO+0.031
 decided+0.029
inct+0.027
 generation+0.027
 lighthouse+0.026
Neg logit contributions
 salon-0.028
 your-0.027
 losers-0.026
 toast-0.026
ags-0.026
Token
Token ID198
Feature activation-2.96e-01

Pos logit contributions
 decided+0.002
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.003
ucker-0.003
ips-0.003
 pose-0.003
 losers-0.003
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
 decided+0.006
 inside+0.006
 UFO+0.005
 should+0.005
 even+0.005
Neg logit contributions
ucker-0.007
 salon-0.007
 Fro-0.007
 Dr-0.007
 Velvet-0.007
TokenL
Token ID43
Feature activation+4.37e-02

Pos logit contributions
 salon+0.004
ucker+0.003
 Fro+0.003
ips+0.003
ittens+0.003
Neg logit contributions
 decided-0.003
 inside-0.003
 should-0.003
 soon-0.003
 even-0.003
Tokenily
Token ID813
Feature activation+9.85e-01

Pos logit contributions
ucker+0.001
 salon+0.001
 Dr+0.001
ittens+0.001
 Fro+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 should-0.001
 even-0.000
makers-0.000
Token and
Token ID290
Feature activation+2.31e+00

Pos logit contributions
 salon+0.027
ucker+0.026
 Fro+0.025
 Mara+0.025
 Dr+0.024
Neg logit contributions
 inside-0.017
 decided-0.017
makers-0.015
 UFO-0.015
 should-0.015
Token Ben
Token ID3932
Feature activation+2.20e-01

Pos logit contributions
ucker+0.046
ittens+0.042
eh+0.039
 salon+0.038
ags+0.037
Neg logit contributions
 decided-0.068
 inside-0.065
 soon-0.059
 next-0.059
 even-0.059
Token felt
Token ID2936
Feature activation-1.20e+00

Pos logit contributions
 salon+0.004
ucker+0.004
 Fro+0.004
 Dr+0.004
 Mara+0.004
Neg logit contributions
 inside-0.005
makers-0.005
 generation-0.005
 generations-0.005
 UFO-0.005
Token bad
Token ID2089
Feature activation+7.15e-01

Pos logit contributions
 decided+0.028
 inside+0.027
 soon+0.024
 should+0.024
 UFO+0.024
Neg logit contributions
ucker-0.026
 salon-0.026
 Fro-0.024
 Dr-0.024
 Mara-0.024
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 salon+0.016
ucker+0.016
 Dr+0.016
 Fro+0.016
ips+0.015
Neg logit contributions
 decided-0.015
 UFO-0.014
 mainland-0.014
 generation-0.013
col-0.013
Token They
Token ID1119
Feature activation+6.43e-01

Pos logit contributions
 decided+0.004
 inside+0.004
 UFO+0.003
makers+0.003
 even+0.003
Neg logit contributions
ucker-0.003
 Fro-0.003
 Mara-0.003
 Dr-0.003
 Leo-0.003
Token He
Token ID679
Feature activation+4.63e-01

Pos logit contributions
 decided+0.006
 inside+0.006
 should+0.005
 soon+0.005
 even+0.005
Neg logit contributions
 salon-0.005
ucker-0.005
 Fro-0.005
ips-0.005
ittens-0.004
Token is
Token ID318
Feature activation+6.66e-02

Pos logit contributions
 salon+0.012
ucker+0.011
ittens+0.011
 Fro+0.011
ags+0.011
Neg logit contributions
 decided-0.010
 inside-0.010
 should-0.009
 even-0.009
 soon-0.008
Token proud
Token ID6613
Feature activation+1.56e+00

Pos logit contributions
ucker+0.002
 salon+0.002
pill+0.002
ittens+0.002
ilo+0.002
Neg logit contributions
 inside-0.001
 decided-0.001
 soon-0.001
 even-0.001
 next-0.001
Token of
Token ID286
Feature activation-2.75e-02

Pos logit contributions
ucker+0.030
 salon+0.030
 Dr+0.029
 Fro+0.028
ilo+0.028
Neg logit contributions
 decided-0.040
 inside-0.037
 UFO-0.036
 generation-0.036
makers-0.036
Token Lily
Token ID20037
Feature activation+1.11e+00

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 even+0.001
 UFO+0.001
Neg logit contributions
 salon-0.001
ucker-0.001
 Fro-0.001
 Dr-0.001
ips-0.001
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
 salon+0.032
ucker+0.032
 Fro+0.030
 Dr+0.030
ips+0.030
Neg logit contributions
 decided-0.019
 inside-0.017
 should-0.015
 UFO-0.015
 soon-0.015
Token Tom
Token ID4186
Feature activation+3.70e-01

Pos logit contributions
ucker+0.043
 salon+0.042
eh+0.039
 pose+0.039
ittens+0.039
Neg logit contributions
 inside-0.072
 decided-0.070
 soon-0.062
 next-0.061
 even-0.060
Token.
Token ID13
Feature activation-6.58e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 Dr+0.011
ips+0.011
 Fro+0.011
Neg logit contributions
 decided-0.006
 inside-0.005
 UFO-0.004
 should-0.004
 soon-0.004
Token He
Token ID679
Feature activation+6.47e-01

Pos logit contributions
 decided+0.018
 inside+0.017
 should+0.016
 soon+0.015
 even+0.015
Neg logit contributions
ucker-0.012
 salon-0.012
 Dr-0.012
 Fro-0.011
 Mara-0.011
Token knows
Token ID4206
Feature activation-5.35e-01

Pos logit contributions
 salon+0.016
ucker+0.015
ittens+0.015
 Fro+0.014
ips+0.014
Neg logit contributions
 decided-0.015
 inside-0.014
 should-0.013
 even-0.013
 soon-0.012
Token they
Token ID484
Feature activation-8.21e-02

Pos logit contributions
 decided+0.014
 UFO+0.013
 inside+0.012
edient+0.012
makers+0.012
Neg logit contributions
 Dr-0.011
 salon-0.010
 Leo-0.010
 Fro-0.010
 yoga-0.010
Token Lucy
Token ID22162
Feature activation+6.50e-01

Pos logit contributions
 decided+0.013
 inside+0.012
 UFO+0.012
 soon+0.011
 mainland+0.011
Neg logit contributions
 Dr-0.009
 salon-0.009
ucker-0.009
 Mara-0.008
 Velvet-0.008
Token's
Token ID338
Feature activation-1.56e-01

Pos logit contributions
 salon+0.020
ucker+0.019
 Fro+0.018
 Mara+0.018
 Dr+0.018
Neg logit contributions
 decided-0.010
 inside-0.009
 UFO-0.008
 should-0.008
makers-0.008
Token mom
Token ID1995
Feature activation-3.22e-01

Pos logit contributions
 decided+0.003
 inside+0.003
 should+0.002
 even+0.002
 soon+0.002
Neg logit contributions
ucker-0.004
 salon-0.004
 Fro-0.004
ips-0.004
 Mara-0.004
Token was
Token ID373
Feature activation+6.30e-02

Pos logit contributions
 inside+0.005
 decided+0.005
 UFO+0.005
 generation+0.005
makers+0.005
Neg logit contributions
 salon-0.009
ucker-0.009
 Dr-0.008
 Fro-0.008
 Mara-0.008
Token not
Token ID407
Feature activation+5.25e-01

Pos logit contributions
ucker+0.002
ittens+0.002
ilo+0.002
 salon+0.002
 mat+0.002
Neg logit contributions
 decided-0.001
 inside-0.001
 even-0.001
 soon-0.001
 next-0.001
Token troubled
Token ID17840
Feature activation+2.36e+00

Pos logit contributions
 salon+0.014
ucker+0.014
ips+0.013
ilo+0.013
ittens+0.013
Neg logit contributions
 decided-0.011
 inside-0.011
 even-0.009
 soon-0.009
 should-0.009
Token anymore
Token ID7471
Feature activation+2.46e-01

Pos logit contributions
 salon+0.052
ucker+0.050
 Dr+0.047
 Fro+0.046
ips+0.046
Neg logit contributions
 decided-0.055
 inside-0.051
 UFO-0.049
 mainland-0.048
 should-0.047
Token,
Token ID11
Feature activation-6.17e-01

Pos logit contributions
 salon+0.004
 yoga+0.004
 Dr+0.004
 pose+0.004
 mat+0.004
Neg logit contributions
 UFO-0.006
 mainland-0.006
 decided-0.006
 lighthouse-0.006
col-0.006
Token and
Token ID290
Feature activation-3.04e-01

Pos logit contributions
 decided+0.014
 inside+0.014
 UFO+0.012
makers+0.012
 mainland+0.012
Neg logit contributions
 salon-0.014
ucker-0.013
 Dr-0.013
 Fro-0.013
 Mara-0.013
Token they
Token ID484
Feature activation-2.11e-01

Pos logit contributions
 decided+0.008
 inside+0.008
 UFO+0.007
makers+0.007
 should+0.007
Neg logit contributions
 salon-0.006
ucker-0.005
 Dr-0.005
 Mara-0.005
 Fro-0.005
Token all
Token ID477
Feature activation+4.93e-01

Pos logit contributions
makers+0.004
 generations+0.004
keeping+0.004
 UFO+0.004
 rooft+0.004
Neg logit contributions
 salon-0.005
 Dr-0.005
 losers-0.004
 Fro-0.004
 Leo-0.004
Token Nicola
Token ID40396
Feature activation-1.14e+00

Pos logit contributions
ucker+0.009
 salon+0.009
eh+0.009
 losers+0.009
 elbow+0.009
Neg logit contributions
 decided-0.011
 inside-0.011
 should-0.010
 even-0.010
 soon-0.010
Token wanted
Token ID2227
Feature activation+9.00e-01

Pos logit contributions
 UFO+0.022
makers+0.021
 inside+0.021
 mainland+0.021
keeping+0.021
Neg logit contributions
 salon-0.029
 Mara-0.026
 pose-0.025
 Dr-0.025
ucker-0.024
Token to
Token ID284
Feature activation+4.54e-01

Pos logit contributions
 salon+0.018
ucker+0.017
 Fro+0.017
 Dr+0.017
 Mara+0.016
Neg logit contributions
 decided-0.023
 inside-0.022
 should-0.020
 UFO-0.020
makers-0.020
Token put
Token ID1234
Feature activation+1.03e+00

Pos logit contributions
 salon+0.010
 pose+0.009
ucker+0.009
 Fro+0.009
 losers+0.009
Neg logit contributions
 decided-0.011
 inside-0.010
makers-0.009
 UFO-0.009
 generations-0.009
Token a
Token ID257
Feature activation+6.68e-01

Pos logit contributions
 salon+0.015
ucker+0.014
 Dr+0.014
ips+0.014
 Fro+0.014
Neg logit contributions
 decided-0.032
 inside-0.030
 should-0.028
 UFO-0.028
makers-0.028
Token smile
Token ID8212
Feature activation+2.35e+00

Pos logit contributions
 salon+0.014
ucker+0.014
 Fro+0.013
 pose+0.013
 losers+0.012
Neg logit contributions
 decided-0.017
 inside-0.016
 should-0.014
 soon-0.014
makers-0.014
Token on
Token ID319
Feature activation+1.72e-01

Pos logit contributions
 salon+0.061
 pose+0.057
ucker+0.056
 mat+0.055
 Mara+0.055
Neg logit contributions
 decided-0.045
 inside-0.040
 UFO-0.040
 mainland-0.040
makers-0.039
Token her
Token ID607
Feature activation-1.86e-01

Pos logit contributions
ucker+0.003
 salon+0.003
ips+0.003
 Fro+0.003
eh+0.003
Neg logit contributions
 decided-0.005
 inside-0.005
 even-0.005
 should-0.004
 soon-0.004
Token little
Token ID1310
Feature activation+3.42e-01

Pos logit contributions
 decided+0.005
 inside+0.005
 should+0.004
 soon+0.004
 even+0.004
Neg logit contributions
 salon-0.004
ucker-0.004
 Fro-0.003
ittens-0.003
ips-0.003
Token face
Token ID1986
Feature activation+9.08e-01

Pos logit contributions
 salon+0.007
 pose+0.006
 mat+0.006
ucker+0.006
 elbow+0.006
Neg logit contributions
 decided-0.009
 inside-0.008
 should-0.008
makers-0.008
 UFO-0.008
Token so
Token ID523
Feature activation-9.51e-01

Pos logit contributions
 salon+0.020
 pose+0.018
 Mara+0.017
 yoga+0.017
 Dr+0.017
Neg logit contributions
 UFO-0.023
 mainland-0.022
makers-0.021
 lighthouse-0.021
inct-0.020
Token."
Token ID526
Feature activation+2.57e-01

Pos logit contributions
 salon+0.011
ucker+0.011
 pose+0.011
 mat+0.010
 losers+0.010
Neg logit contributions
 decided-0.008
 UFO-0.008
 inside-0.007
year-0.007
 generation-0.007
Token Sara
Token ID24799
Feature activation+1.68e+00

Pos logit contributions
 salon+0.005
ucker+0.005
 pose+0.004
ips+0.004
 losers+0.004
Neg logit contributions
 decided-0.007
 inside-0.007
 soon-0.006
 even-0.006
 should-0.006
Token said
Token ID531
Feature activation+8.38e-01

Pos logit contributions
 salon+0.050
ucker+0.050
 Fro+0.047
ips+0.047
ittens+0.047
Neg logit contributions
 decided-0.027
 inside-0.025
 should-0.021
 soon-0.021
 even-0.021
Token.
Token ID13
Feature activation+6.27e-01

Pos logit contributions
ucker+0.026
 salon+0.026
 Fro+0.025
ips+0.025
ittens+0.024
Neg logit contributions
 decided-0.012
 inside-0.011
 should-0.009
 UFO-0.009
 soon-0.009
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
 salon+0.019
ucker+0.019
 losers+0.018
 Fro+0.018
 pose+0.018
Neg logit contributions
 decided-0.010
 inside-0.010
 should-0.008
 soon-0.008
 even-0.008
Token
Token ID198
Feature activation+6.15e-01

Pos logit contributions
 salon+0.004
ucker+0.004
 Fro+0.004
ips+0.004
 pose+0.004
Neg logit contributions
 decided-0.002
 inside-0.002
 should-0.002
 soon-0.002
 even-0.002
Token"
Token ID1
Feature activation-3.07e-01

Pos logit contributions
 salon+0.018
ucker+0.018
 Fro+0.017
ips+0.017
 losers+0.017
Neg logit contributions
 decided-0.010
 inside-0.010
 soon-0.008
 should-0.008
 even-0.008
TokenThat
Token ID2504
Feature activation-6.42e-02

Pos logit contributions
 decided+0.005
 inside+0.005
 should+0.004
 soon+0.004
 even+0.004
Neg logit contributions
 salon-0.009
ucker-0.009
 Fro-0.008
ips-0.008
 Mara-0.008
Token's
Token ID338
Feature activation-1.01e+00

Pos logit contributions
 decided+0.001
 inside+0.001
 soon+0.001
makers+0.001
 should+0.001
Neg logit contributions
 salon-0.001
ucker-0.001
 Fro-0.001
 pose-0.001
 Mara-0.001
Token right
Token ID826
Feature activation-4.88e-01

Pos logit contributions
 decided+0.020
 inside+0.019
 UFO+0.016
 should+0.016
makers+0.016
Neg logit contributions
ucker-0.026
 salon-0.025
 Dr-0.025
 Fro-0.025
 Mara-0.024
Token.
Token ID13
Feature activation+9.01e-01

Pos logit contributions
 generation+0.013
 decided+0.013
 mainland+0.012
 UFO+0.012
 generations+0.012
Neg logit contributions
ucker-0.008
 Dr-0.008
 my-0.008
ilo-0.008
 your-0.008
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
 decided+0.009
 UFO+0.008
makers+0.008
 generations+0.008
 should+0.007
Neg logit contributions
 salon-0.013
ips-0.012
ucker-0.011
 Dr-0.011
 mat-0.011
Token OK
Token ID7477
Feature activation-1.41e+00

Pos logit contributions
ucker+0.009
 salon+0.009
 Dr+0.009
 Fro+0.008
 losers+0.008
Neg logit contributions
 decided-0.010
 inside-0.010
makers-0.009
 UFO-0.009
 even-0.008
Token?"
Token ID1701
Feature activation+5.39e-01

Pos logit contributions
 decided+0.040
col+0.038
 UFO+0.038
 generations+0.038
 generation+0.037
Neg logit contributions
ucker-0.023
 salon-0.022
 Dr-0.020
ips-0.020
 pose-0.020
Token
Token ID198
Feature activation-8.54e-02

Pos logit contributions
eh+0.020
 losers+0.020
ucker+0.020
 salon+0.019
ags+0.019
Neg logit contributions
 decided-0.005
 inside-0.005
 soon-0.004
 next-0.003
 should-0.003
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 should+0.001
 soon+0.001
 even+0.001
Neg logit contributions
 salon-0.003
ucker-0.003
 losers-0.002
ips-0.002
 Fro-0.002
TokenTom
Token ID13787
Feature activation+1.17e+00

Pos logit contributions
ucker+0.006
 salon+0.006
 Fro+0.006
 losers+0.006
ags+0.006
Neg logit contributions
 decided-0.005
 inside-0.005
 soon-0.004
 should-0.004
 next-0.004
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 salon+0.041
ittens+0.041
ucker+0.040
eh+0.040
 Fro+0.039
Neg logit contributions
 decided-0.013
 inside-0.013
 should-0.009
 even-0.009
 next-0.008
Token Mia
Token ID32189
Feature activation+1.84e+00

Pos logit contributions
ucker+0.047
 salon+0.043
ittens+0.042
ags+0.041
ips+0.040
Neg logit contributions
 decided-0.060
 inside-0.059
 soon-0.056
 next-0.053
 even-0.052
Token smile
Token ID8212
Feature activation+4.96e-02

Pos logit contributions
 salon+0.032
ucker+0.030
 pose+0.028
 Dr+0.028
ags+0.028
Neg logit contributions
 decided-0.051
 inside-0.049
 generations-0.047
 UFO-0.047
makers-0.047
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
ucker+0.001
 Dr+0.001
ips+0.001
ags+0.001
 pose+0.001
Neg logit contributions
 UFO-0.001
col-0.001
 generation-0.001
Mill-0.001
makers-0.001
Token They
Token ID1119
Feature activation+1.11e+00

Pos logit contributions
 decided+0.009
 inside+0.008
 even+0.007
 should+0.007
makers+0.007
Neg logit contributions
ucker-0.010
 salon-0.010
 Fro-0.009
 Dr-0.009
ips-0.009
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
 decided+0.002
 inside+0.002
 UFO+0.002
makers+0.002
 mainland+0.002
Neg logit contributions
 salon-0.003
ucker-0.003
 pose-0.003
ips-0.003
 mat-0.003
Token One
Token ID1881
Feature activation+6.57e-01

Pos logit contributions
 decided+0.008
 inside+0.007
 should+0.006
 even+0.006
 soon+0.006
Neg logit contributions
ucker-0.013
 salon-0.013
ittens-0.012
ips-0.012
 Fro-0.012
Token day
Token ID1110
Feature activation+8.49e-01

Pos logit contributions
 salon+0.022
ucker+0.022
 Fro+0.021
 pose+0.020
ips+0.020
Neg logit contributions
 decided-0.008
 inside-0.008
 should-0.006
makers-0.006
 soon-0.006
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
 salon+0.026
ucker+0.026
ags+0.024
 losers+0.024
ips+0.024
Neg logit contributions
 inside-0.014
 decided-0.014
 soon-0.012
 should-0.011
 even-0.011
Token Anna
Token ID11735
Feature activation-8.14e-02

Pos logit contributions
ucker+0.007
 salon+0.007
ags+0.007
ips+0.007
eh+0.007
Neg logit contributions
 decided-0.004
 inside-0.003
 soon-0.003
 even-0.003
 should-0.002
Token wants
Token ID3382
Feature activation+1.59e-01

Pos logit contributions
 decided+0.002
 inside+0.002
 should+0.002
 soon+0.002
 even+0.002
Neg logit contributions
 salon-0.002
ucker-0.002
 Fro-0.002
ips-0.002
 Mara-0.002
Token to
Token ID284
Feature activation+8.38e-01

Pos logit contributions
ucker+0.004
 salon+0.004
ips+0.004
ittens+0.004
 Fro+0.004
Neg logit contributions
 decided-0.004
 inside-0.003
 even-0.003
 soon-0.003
 should-0.003
Token play
Token ID711
Feature activation+4.27e-01

Pos logit contributions
ucker+0.024
 salon+0.023
ips+0.022
ilo+0.022
ittens+0.022
Neg logit contributions
 decided-0.016
 inside-0.015
 even-0.013
 should-0.013
 soon-0.013
Token with
Token ID351
Feature activation+7.56e-01

Pos logit contributions
ucker+0.015
 salon+0.014
 Mara+0.013
 losers+0.013
ittens+0.013
Neg logit contributions
 inside-0.007
 decided-0.006
 even-0.005
 soon-0.005
 next-0.005
Token Ben
Token ID3932
Feature activation-5.02e-01

Pos logit contributions
ucker+0.020
eh+0.018
IC+0.018
 losers+0.017
 salon+0.017
Neg logit contributions
 inside-0.016
 even-0.015
 soon-0.014
 decided-0.013
 next-0.013
Token at
Token ID379
Feature activation+3.91e-01

Pos logit contributions
 decided+0.008
 inside+0.007
 should+0.006
 even+0.006
 soon+0.006
Neg logit contributions
ucker-0.015
 salon-0.015
 Fro-0.014
ips-0.014
ittens-0.014
Token,
Token ID11
Feature activation+7.21e-01

Pos logit contributions
ucker+0.024
 Dr+0.021
 salon+0.021
etchup+0.020
okes+0.019
Neg logit contributions
 UFO-0.029
 generation-0.029
 mainland-0.028
 generations-0.028
 decided-0.028
Token I
Token ID314
Feature activation-1.15e-01

Pos logit contributions
 salon+0.016
ucker+0.015
 Dr+0.015
 Fro+0.014
 Mara+0.014
Neg logit contributions
 decided-0.016
 inside-0.015
 generations-0.015
makers-0.015
 UFO-0.014
Token don
Token ID836
Feature activation-1.36e+00

Pos logit contributions
 UFO+0.003
 rooft+0.003
 generations+0.003
 mainland+0.003
keeping+0.003
Neg logit contributions
ucker-0.002
 salon-0.002
etchup-0.002
 pose-0.002
 slips-0.002
Token't
Token ID470
Feature activation-2.39e-01

Pos logit contributions
 decided+0.035
 inside+0.033
 should+0.030
 soon+0.030
 even+0.030
Neg logit contributions
 salon-0.028
ucker-0.028
 Fro-0.025
ips-0.025
ittens-0.025
Token know
Token ID760
Feature activation+5.82e-01

Pos logit contributions
 decided+0.006
inct+0.006
 UFO+0.006
 inside+0.006
 mainland+0.006
Neg logit contributions
 losers-0.005
 salon-0.005
 pose-0.004
ucker-0.004
 Fro-0.004
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 salon+0.014
 Dr+0.013
 yoga+0.013
ucker+0.012
 Leo+0.012
Neg logit contributions
 decided-0.013
 UFO-0.012
 mainland-0.011
 inside-0.011
makers-0.011
Token He
Token ID679
Feature activation+1.27e-01

Pos logit contributions
 salon+0.018
 Dr+0.018
 Leo+0.017
 Mara+0.017
ucker+0.017
Neg logit contributions
 decided-0.033
 inside-0.032
lling-0.030
 even-0.030
 generations-0.030
Token was
Token ID373
Feature activation+2.65e-01

Pos logit contributions
ucker+0.003
 salon+0.003
 losers+0.003
 pose+0.003
 Dr+0.003
Neg logit contributions
 inside-0.003
 decided-0.003
 UFO-0.003
 generations-0.003
makers-0.002
Token here
Token ID994
Feature activation+7.19e-01

Pos logit contributions
 salon+0.006
ucker+0.006
 Fro+0.006
 Dr+0.006
 losers+0.006
Neg logit contributions
 decided-0.006
 inside-0.006
 should-0.005
 soon-0.005
 even-0.005
Token a
Token ID257
Feature activation-1.07e-01

Pos logit contributions
 salon+0.014
 Dr+0.013
 pose+0.013
ucker+0.013
 yoga+0.012
Neg logit contributions
 decided-0.018
 UFO-0.017
 inside-0.016
col-0.016
 mainland-0.016
Token minute
Token ID5664
Feature activation-6.75e-02

Pos logit contributions
 decided+0.003
ered+0.003
lling+0.003
lled+0.003
keeping+0.003
Neg logit contributions
 salon-0.002
 yoga-0.002
 mat-0.002
 pose-0.002
 cookie-0.002
Token why
Token ID1521
Feature activation-4.10e-01

Pos logit contributions
 salon+0.016
 Fro+0.015
 losers+0.015
ucker+0.015
 Dr+0.015
Neg logit contributions
 decided-0.014
 generations-0.013
 inside-0.013
 generation-0.012
 soon-0.012
Token did
Token ID750
Feature activation+1.76e+00

Pos logit contributions
 decided+0.008
 inside+0.007
 should+0.007
 even+0.006
 soon+0.006
Neg logit contributions
 salon-0.012
ucker-0.011
ips-0.011
ags-0.011
 Mara-0.011
Token you
Token ID345
Feature activation+5.21e-02

Pos logit contributions
ucker+0.059
ips+0.057
ags+0.056
 salon+0.055
eh+0.052
Neg logit contributions
 decided-0.028
 should-0.027
 inside-0.026
 soon-0.023
 even-0.023
Token do
Token ID466
Feature activation+5.15e-02

Pos logit contributions
 salon+0.001
ucker+0.001
 Fro+0.001
 pose+0.001
 losers+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 should-0.001
 soon-0.001
makers-0.001
Token that
Token ID326
Feature activation-1.63e-01

Pos logit contributions
ucker+0.001
 salon+0.001
 Fro+0.001
ips+0.001
ittens+0.001
Neg logit contributions
 decided-0.001
 inside-0.001
 should-0.001
 even-0.001
 soon-0.001
Token?
Token ID30
Feature activation+1.21e+00

Pos logit contributions
 decided+0.004
 inside+0.003
 should+0.003
makers+0.003
 soon+0.003
Neg logit contributions
 salon-0.004
ucker-0.004
 Fro-0.003
 Dr-0.003
ips-0.003
Token You
Token ID921
Feature activation+1.11e-01

Pos logit contributions
 Dr+0.020
ucker+0.019
 My+0.019
 salon+0.018
 Fro+0.018
Neg logit contributions
 decided-0.035
 inside-0.034
 mainland-0.032
 soon-0.032
 even-0.032
Token shouldn
Token ID6584
Feature activation-5.34e-01

Pos logit contributions
ucker+0.003
 losers+0.003
 salon+0.003
 Fro+0.002
 pose+0.002
Neg logit contributions
 inside-0.002
 decided-0.002
 generations-0.002
 generation-0.002
makers-0.002
Token't
Token ID470
Feature activation-5.71e-01

Pos logit contributions
 decided+0.012
 inside+0.012
 should+0.011
 even+0.011
 soon+0.011
Neg logit contributions
ucker-0.012
 salon-0.012
ips-0.011
ittens-0.011
ags-0.011
Token disturb
Token ID17037
Feature activation+1.54e-01

Pos logit contributions
year+0.014
 generations+0.014
 decided+0.014
 inside+0.014
 rooft+0.013
Neg logit contributions
 pose-0.010
 salon-0.010
ucker-0.010
 losers-0.010
 Fro-0.010
Token things
Token ID1243
Feature activation-4.39e-01

Pos logit contributions
 salon+0.003
 Leo+0.002
 Dr+0.002
 your+0.002
 my+0.002
Neg logit contributions
 decided-0.004
year-0.004
 UFO-0.004
makers-0.004
 inside-0.004
Token who
Token ID508
Feature activation-8.13e-01

Pos logit contributions
 decided+0.017
 UFO+0.017
 mainland+0.017
 lighthouse+0.016
col+0.016
Neg logit contributions
 Dr-0.011
 Leo-0.011
 Fro-0.010
 Luna-0.010
 Mara-0.009
Token he
Token ID339
Feature activation-3.33e-01

Pos logit contributions
 mainland+0.015
 inside+0.014
 decided+0.014
makers+0.014
 rooft+0.014
Neg logit contributions
 Dr-0.020
 Leo-0.020
 Fro-0.020
 salon-0.019
 Luna-0.019
Token was
Token ID373
Feature activation+1.38e-01

Pos logit contributions
 UFO+0.006
 generation+0.005
 rooft+0.005
keeping+0.005
 mainland+0.005
Neg logit contributions
 Fro-0.009
 Leo-0.008
ucker-0.008
 Dr-0.008
 Luna-0.008
Token.
Token ID13
Feature activation-4.91e-01

Pos logit contributions
 Fro+0.003
 Dr+0.003
 salon+0.003
 Leo+0.003
ucker+0.003
Neg logit contributions
 decided-0.003
 UFO-0.003
col-0.003
 should-0.003
 generations-0.003
Token
Token ID198
Feature activation+2.67e-01

Pos logit contributions
 decided+0.012
 inside+0.011
 should+0.011
 soon+0.010
 even+0.010
Neg logit contributions
 salon-0.011
ucker-0.011
ittens-0.010
 elbow-0.010
ags-0.010
Token
Token ID198
Feature activation+5.77e-02

Pos logit contributions
 salon+0.008
ucker+0.008
ips+0.008
 losers+0.008
 elbow+0.008
Neg logit contributions
 decided-0.004
 inside-0.004
 should-0.003
 soon-0.003
 even-0.003
TokenThe
Token ID464
Feature activation-5.56e-02

Pos logit contributions
ucker+0.002
 salon+0.002
 elbow+0.002
ips+0.002
ags+0.002
Neg logit contributions
 decided-0.001
 inside-0.001
 soon-0.001
 should-0.001
 even-0.001
Token mouse
Token ID10211
Feature activation-1.11e+00

Pos logit contributions
 decided+0.001
 inside+0.001
col+0.001
 soon+0.001
 mainland+0.001
Neg logit contributions
 Luna-0.001
ilo-0.001
 salon-0.001
 Dr-0.001
 pose-0.001
Token replied
Token ID8712
Feature activation-4.53e-01

Pos logit contributions
makers+0.024
keeping+0.023
 inside+0.023
 generation+0.023
 rooft+0.023
Neg logit contributions
 Fro-0.024
 Leo-0.023
ucker-0.023
 salon-0.023
 pose-0.023
Token,
Token ID11
Feature activation-5.79e-02

Pos logit contributions
 decided+0.010
 inside+0.010
 UFO+0.009
 should+0.009
 mainland+0.009
Neg logit contributions
 salon-0.010
 Fro-0.010
ucker-0.010
 Dr-0.009
 Mara-0.009
Token "
Token ID366
Feature activation-2.52e-01

Pos logit contributions
 decided+0.002
 UFO+0.002
keeping+0.002
 inside+0.002
 mainland+0.002
Neg logit contributions
 Dr-0.001
 Fro-0.001
 Leo-0.001
 Luna-0.001
 Mara-0.001
Token be
Token ID307
Feature activation+1.07e-01

Pos logit contributions
 salon+0.022
ucker+0.021
 Fro+0.020
 pose+0.020
ips+0.020
Neg logit contributions
 decided-0.023
 inside-0.022
 should-0.020
makers-0.019
 UFO-0.019
Token part
Token ID636
Feature activation+1.49e+00

Pos logit contributions
 salon+0.003
ucker+0.003
 Fro+0.002
ips+0.002
 pose+0.002
Neg logit contributions
 decided-0.002
 inside-0.002
 should-0.002
 soon-0.002
 even-0.002
Token of
Token ID286
Feature activation+1.55e-01

Pos logit contributions
 salon+0.035
ucker+0.035
 Fro+0.033
ips+0.032
 pose+0.032
Neg logit contributions
 decided-0.033
 inside-0.031
 should-0.028
 soon-0.027
 even-0.027
Token the
Token ID262
Feature activation-1.85e-01

Pos logit contributions
 Fro+0.002
 yoga+0.002
 Dr+0.002
 Leo+0.002
 salon+0.002
Neg logit contributions
 decided-0.005
 UFO-0.004
makers-0.004
 inside-0.004
 mainland-0.004
Token perfect
Token ID2818
Feature activation-3.49e-01

Pos logit contributions
 decided+0.005
year+0.004
makers+0.004
 inside+0.004
esh+0.004
Neg logit contributions
 salon-0.004
 yoga-0.004
 fle-0.004
 pose-0.004
 mat-0.004
Token parade
Token ID16134
Feature activation+1.17e-01

Pos logit contributions
 decided+0.008
 should+0.008
makers+0.007
year+0.007
 UFO+0.007
Neg logit contributions
 salon-0.008
 pose-0.008
 mat-0.008
 yoga-0.007
 elbow-0.007
Token.
Token ID13
Feature activation+6.64e-03

Pos logit contributions
 salon+0.003
 pose+0.003
 losers+0.003
 Dr+0.003
 Fro+0.003
Neg logit contributions
 decided-0.002
 UFO-0.002
 inside-0.002
 mainland-0.002
 should-0.002
Token He
Token ID679
Feature activation-1.81e-01

Pos logit contributions
 salon+0.000
ucker+0.000
 Fro+0.000
ips+0.000
 Dr+0.000
Neg logit contributions
 decided-0.000
 inside-0.000
 should-0.000
 soon-0.000
 even-0.000
Token waved
Token ID26834
Feature activation-5.01e-01

Pos logit contributions
makers+0.003
 inside+0.003
 UFO+0.003
 generation+0.003
 mainland+0.003
Neg logit contributions
 salon-0.005
 Fro-0.004
 losers-0.004
 Mara-0.004
ucker-0.004
Token goodbye
Token ID24829
Feature activation+3.43e-01

Pos logit contributions
 decided+0.012
 inside+0.012
 should+0.010
 soon+0.010
 even+0.010
Neg logit contributions
 salon-0.011
ucker-0.011
 Fro-0.010
ips-0.010
ittens-0.010
Token to
Token ID284
Feature activation-3.53e-01

Pos logit contributions
 salon+0.006
 Fro+0.006
ucker+0.006
 Leo+0.005
 Mara+0.005
Neg logit contributions
 decided-0.009
 should-0.009
 UFO-0.009
makers-0.008
 generations-0.008
Token his
Token ID465
Feature activation-1.58e-01

Pos logit contributions
ucker+0.023
 salon+0.022
 Fro+0.020
ittens+0.020
 Mara+0.020
Neg logit contributions
 decided-0.023
 inside-0.023
 even-0.020
 soon-0.020
 should-0.019
Token sc
Token ID629
Feature activation+7.59e-01

Pos logit contributions
 decided+0.004
 inside+0.003
 should+0.003
 soon+0.003
 even+0.003
Neg logit contributions
 salon-0.004
ucker-0.004
 Fro-0.003
ips-0.003
 Mara-0.003
Tokenooter
Token ID25141
Feature activation+9.51e-01

Pos logit contributions
ucker+0.017
 Mara+0.016
 salon+0.016
 Dr+0.016
 Velvet+0.016
Neg logit contributions
 decided-0.017
 inside-0.017
 even-0.015
 should-0.014
 soon-0.014
Token and
Token ID290
Feature activation-3.08e-01

Pos logit contributions
ucker+0.029
 salon+0.028
 Fro+0.027
 Mara+0.027
ips+0.027
Neg logit contributions
 decided-0.015
 inside-0.014
 even-0.012
 soon-0.012
 should-0.012
Token hurt
Token ID5938
Feature activation-4.26e-01

Pos logit contributions
 decided+0.006
 inside+0.006
makers+0.006
 should+0.006
 generations+0.006
Neg logit contributions
 salon-0.007
ucker-0.007
 Fro-0.007
 Dr-0.007
 elbow-0.007
Token his
Token ID465
Feature activation-3.35e-01

Pos logit contributions
 decided+0.012
 inside+0.012
 should+0.011
makers+0.011
 UFO+0.011
Neg logit contributions
 Fro-0.007
 salon-0.006
 Dr-0.006
 Mara-0.006
ucker-0.006
Token knee
Token ID10329
Feature activation-1.82e-01

Pos logit contributions
 decided+0.010
 inside+0.009
 should+0.009
 soon+0.009
makers+0.009
Neg logit contributions
 salon-0.006
ucker-0.006
 Fro-0.005
ittens-0.005
ips-0.005
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
 decided+0.003
 UFO+0.003
 inside+0.003
 mainland+0.003
 generations+0.003
Neg logit contributions
 salon-0.005
ucker-0.005
 pose-0.005
 elbow-0.005
ittens-0.005
Token A
Token ID317
Feature activation-4.43e-01

Pos logit contributions
 decided+0.007
 inside+0.007
 should+0.006
 even+0.006
 soon+0.006
Neg logit contributions
 salon-0.006
ucker-0.006
ips-0.006
 pose-0.006
 Fro-0.006
Token little
Token ID1310
Feature activation+8.07e-01

Pos logit contributions
 decided+0.010
 inside+0.010
makers+0.009
 soon+0.008
col+0.008
Neg logit contributions
 salon-0.011
ucker-0.010
 Fro-0.010
 pose-0.010
 Dr-0.009
Token girl
Token ID2576
Feature activation-4.52e-01

Pos logit contributions
 salon+0.020
ucker+0.020
 Fro+0.019
 pose+0.018
 Dr+0.018
Neg logit contributions
 decided-0.017
 inside-0.016
 should-0.014
 soon-0.014
 even-0.014
Token had
Token ID550
Feature activation+6.14e-01

Pos logit contributions
 inside+0.002
 UFO+0.002
 decided+0.002
makers+0.002
 mainland+0.002
Neg logit contributions
ucker-0.003
 salon-0.003
 Fro-0.003
 Mara-0.003
ilo-0.003
Token learned
Token ID4499
Feature activation+2.47e-01

Pos logit contributions
 salon+0.016
ucker+0.016
 Fro+0.015
ips+0.015
ittens+0.015
Neg logit contributions
 decided-0.012
 inside-0.012
 should-0.010
 soon-0.010
 even-0.010
Token an
Token ID281
Feature activation-1.38e+00

Pos logit contributions
 salon+0.005
ucker+0.005
 yoga+0.005
 Dr+0.005
 Fro+0.005
Neg logit contributions
 decided-0.006
 inside-0.006
makers-0.005
 UFO-0.005
 mainland-0.005
Token important
Token ID1593
Feature activation-1.16e+00

Pos logit contributions
 decided+0.025
 inside+0.023
keeping+0.023
makers+0.022
 soon+0.020
Neg logit contributions
 losers-0.034
ags-0.034
 yoga-0.034
 allergic-0.033
 salon-0.033
Token lesson
Token ID11483
Feature activation-8.06e-01

Pos logit contributions
 decided+0.021
 inside+0.018
 mainland+0.017
 rooft+0.016
 soon+0.016
Neg logit contributions
 yoga-0.032
 mat-0.032
 pose-0.032
 salon-0.031
ittens-0.029
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
 decided+0.016
 UFO+0.015
 mainland+0.015
 rooft+0.014
 lighthouse+0.014
Neg logit contributions
 salon-0.018
 Dr-0.018
 yoga-0.017
 Fro-0.017
 Leo-0.017
Token that
Token ID326
Feature activation-1.07e+00

Pos logit contributions
 decided+0.015
 inside+0.014
makers+0.014
 UFO+0.014
 mainland+0.013
Neg logit contributions
 salon-0.016
 losers-0.016
 Fro-0.016
 Dr-0.016
ucker-0.016
Token it
Token ID340
Feature activation+5.58e-01

Pos logit contributions
 decided+0.018
 mainland+0.018
 rooft+0.018
 UFO+0.018
 inside+0.018
Neg logit contributions
 yoga-0.026
 salon-0.025
 pose-0.025
 Dr-0.025
 Fro-0.025
Token's
Token ID338
Feature activation-7.50e-01

Pos logit contributions
ucker+0.013
 salon+0.013
ips+0.012
 Fro+0.012
 losers+0.012
Neg logit contributions
 decided-0.012
 inside-0.011
 UFO-0.010
 soon-0.010
makers-0.010
Token always
Token ID1464
Feature activation+3.12e-01

Pos logit contributions
 UFO+0.016
col+0.015
makers+0.014
 lighthouse+0.014
 decided+0.014
Neg logit contributions
 Dr-0.017
 your-0.017
 my-0.016
 yoga-0.016
 allergic-0.016
Token important
Token ID1593
Feature activation-5.24e-01

Pos logit contributions
ucker+0.007
 Dr+0.007
 pose+0.007
 losers+0.007
 yoga+0.007
Neg logit contributions
 decided-0.006
 UFO-0.006
makers-0.005
 inside-0.005
col-0.005
Token continued
Token ID3767
Feature activation+3.20e-01

Pos logit contributions
makers+0.007
 inside+0.007
 generations+0.007
 UFO+0.006
edient+0.006
Neg logit contributions
 Fro-0.007
 salon-0.007
 Leo-0.007
ucker-0.006
 Dr-0.006
Token to
Token ID284
Feature activation-9.35e-02

Pos logit contributions
 salon+0.006
ucker+0.006
 Fro+0.006
ips+0.006
 Dr+0.006
Neg logit contributions
 decided-0.008
 inside-0.008
 should-0.007
 soon-0.007
 even-0.007
Token play
Token ID711
Feature activation-6.52e-01

Pos logit contributions
makers+0.002
year+0.002
 generations+0.002
 decided+0.002
 inside+0.002
Neg logit contributions
 Fro-0.002
 pose-0.001
 Leo-0.001
 salon-0.001
 Dr-0.001
Token together
Token ID1978
Feature activation-4.27e-01

Pos logit contributions
 decided+0.011
 inside+0.009
 UFO+0.009
 should+0.009
makers+0.009
Neg logit contributions
 salon-0.018
ucker-0.018
ips-0.017
 Fro-0.017
ags-0.017
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
 decided+0.008
 UFO+0.008
makers+0.007
 should+0.007
 mainland+0.007
Neg logit contributions
 salon-0.011
 pose-0.010
ips-0.010
 yoga-0.010
 Fro-0.010
Token happy
Token ID3772
Feature activation-3.05e-02

Pos logit contributions
ucker+0.004
 salon+0.004
 Fro+0.004
ips+0.004
ittens+0.004
Neg logit contributions
 decided-0.004
 inside-0.003
 should-0.003
 even-0.003
 soon-0.003
Token to
Token ID284
Feature activation+1.79e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 UFO+0.001
 should+0.001
gl+0.001
Neg logit contributions
ucker-0.001
 Fro-0.001
 salon-0.001
 Leo-0.001
 Dr-0.001
Token have
Token ID423
Feature activation+8.78e-01

Pos logit contributions
 Fro+0.003
 pose+0.003
 losers+0.003
 salon+0.003
 Luna+0.003
Neg logit contributions
 inside-0.005
 decided-0.005
col-0.004
year-0.004
makers-0.004
Token each
Token ID1123
Feature activation+4.98e-01

Pos logit contributions
ucker+0.017
 salon+0.017
 Fro+0.016
ips+0.016
 Dr+0.016
Neg logit contributions
 decided-0.022
 inside-0.021
 should-0.019
 UFO-0.019
makers-0.019
Token other
Token ID584
Feature activation-4.57e-01

Pos logit contributions
 salon+0.014
ucker+0.014
 Fro+0.014
ips+0.013
 Dr+0.013
Neg logit contributions
 decided-0.009
 inside-0.008
 should-0.007
 soon-0.007
 even-0.007
Token as
Token ID355
Feature activation+4.54e-01

Pos logit contributions
 decided+0.009
 UFO+0.009
 inside+0.008
makers+0.008
 should+0.008
Neg logit contributions
 salon-0.011
ucker-0.010
 Fro-0.010
 pose-0.010
 losers-0.010
Token a
Token ID257
Feature activation+4.31e-01

Pos logit contributions
 inside+0.013
 decided+0.013
 even+0.011
 soon+0.010
 should+0.010
Neg logit contributions
 salon-0.023
ucker-0.023
 pose-0.021
ips-0.021
 losers-0.020
Token kind
Token ID1611
Feature activation-6.37e-02

Pos logit contributions
 salon+0.011
ucker+0.011
 Fro+0.011
 Dr+0.010
ittens+0.010
Neg logit contributions
 decided-0.008
 inside-0.008
 should-0.007
 soon-0.007
 even-0.007
Token mail
Token ID6920
Feature activation+2.98e-01

Pos logit contributions
 decided+0.001
 inside+0.001
 even+0.001
 should+0.001
 soon+0.001
Neg logit contributions
 salon-0.002
ucker-0.002
ips-0.002
 pose-0.001
 Fro-0.001
Tokenman
Token ID805
Feature activation-3.96e-01

Pos logit contributions
 salon+0.008
ucker+0.008
 Fro+0.007
 Dr+0.007
ips+0.007
Neg logit contributions
 decided-0.006
 inside-0.006
 should-0.005
 soon-0.005
 even-0.005
Token named
Token ID3706
Feature activation-2.56e-01

Pos logit contributions
 decided+0.007
 inside+0.007
 should+0.006
 UFO+0.006
makers+0.006
Neg logit contributions
 salon-0.011
ucker-0.011
 Fro-0.010
 Dr-0.010
ips-0.010
Token Max
Token ID5436
Feature activation-2.15e+00

Pos logit contributions
 decided+0.007
 inside+0.006
 should+0.006
 soon+0.006
 even+0.006
Neg logit contributions
 salon-0.005
ucker-0.005
 Fro-0.005
ips-0.005
ittens-0.005
Token.
Token ID13
Feature activation+6.18e-01

Pos logit contributions
 decided+0.029
 inside+0.027
 should+0.022
 soon+0.021
 even+0.021
Neg logit contributions
 salon-0.069
ucker-0.069
 Fro-0.066
 Dr-0.065
ips-0.064
Token Max
Token ID5436
Feature activation-2.26e+00

Pos logit contributions
 salon+0.012
ucker+0.011
ips+0.010
 Mara+0.010
eh+0.010
Neg logit contributions
 decided-0.017
 inside-0.017
 should-0.016
 soon-0.016
 even-0.015
Token had
Token ID550
Feature activation+2.27e-02

Pos logit contributions
 decided+0.030
 inside+0.029
 UFO+0.027
makers+0.026
col+0.026
Neg logit contributions
ucker-0.065
 salon-0.065
 Dr-0.064
 Fro-0.063
 Velvet-0.062
Token a
Token ID257
Feature activation-1.54e-01

Pos logit contributions
 salon+0.001
ucker+0.001
 Fro+0.001
ips+0.001
ittens+0.001
Neg logit contributions
 decided-0.000
 inside-0.000
 should-0.000
 soon-0.000
 even-0.000
Token big
Token ID1263
Feature activation-1.42e-01

Pos logit contributions
 decided+0.004
 inside+0.003
makers+0.003
 soon+0.003
 should+0.003
Neg logit contributions
 salon-0.004
ucker-0.004
 Fro-0.003
 Dr-0.003
ittens-0.003
Token man
Token ID582
Feature activation+1.13e+00

Pos logit contributions
 decided+0.006
 should+0.006
 inside+0.006
pt+0.006
makers+0.005
Neg logit contributions
 salon-0.005
 losers-0.004
 toast-0.004
 yoga-0.004
 mat-0.004
Token and
Token ID290
Feature activation+5.80e-01

Pos logit contributions
 salon+0.028
ucker+0.025
 Dr+0.025
 pose+0.025
 Fro+0.024
Neg logit contributions
 UFO-0.022
makers-0.022
 inside-0.021
Mill-0.020
 mainland-0.020
Token woman
Token ID2415
Feature activation+3.62e-01

Pos logit contributions
 salon+0.010
ucker+0.009
 Fro+0.009
 Dr+0.009
 Mara+0.009
Neg logit contributions
 decided-0.016
 inside-0.015
makers-0.014
 should-0.014
 UFO-0.014
Token shared
Token ID4888
Feature activation-5.56e-01

Pos logit contributions
 salon+0.009
 pose+0.007
 Dr+0.007
 Leo+0.007
ucker+0.007
Neg logit contributions
 UFO-0.008
makers-0.007
arp-0.007
 mainland-0.007
 generations-0.007
Token their
Token ID511
Feature activation-3.78e-01

Pos logit contributions
 decided+0.016
 UFO+0.016
makers+0.015
edient+0.015
inct+0.014
Neg logit contributions
 Leo-0.009
 toast-0.009
 yoga-0.009
 Dr-0.009
 Luna-0.008
Token first
Token ID717
Feature activation-2.00e+00

Pos logit contributions
 decided+0.010
 inside+0.009
makers+0.009
 should+0.009
 soon+0.009
Neg logit contributions
 toast-0.007
 salon-0.007
 yoga-0.007
 mat-0.007
ags-0.007
Token kiss
Token ID9245
Feature activation-2.28e-01

Pos logit contributions
 decided+0.049
 inside+0.044
makers+0.043
 should+0.042
 UFO+0.042
Neg logit contributions
 salon-0.043
 mat-0.041
 pose-0.041
 toast-0.040
 yoga-0.039
Token,
Token ID11
Feature activation-8.68e-01

Pos logit contributions
 decided+0.004
 UFO+0.004
 mainland+0.004
 inside+0.004
makers+0.004
Neg logit contributions
 salon-0.006
 pose-0.005
ips-0.005
ucker-0.005
 Fro-0.005
Token the
Token ID262
Feature activation-5.25e-01

Pos logit contributions
 decided+0.021
 inside+0.021
 UFO+0.020
makers+0.019
 should+0.019
Neg logit contributions
 salon-0.017
 Dr-0.017
 Fro-0.016
 Mara-0.016
ucker-0.016
Token guests
Token ID10650
Feature activation-9.59e-01

Pos logit contributions
 decided+0.013
 inside+0.013
 should+0.012
pt+0.012
makers+0.012
Neg logit contributions
 salon-0.011
 toast-0.009
 Dr-0.009
 mat-0.009
 losers-0.009
Token all
Token ID477
Feature activation+2.89e-01

Pos logit contributions
 UFO+0.019
 generation+0.018
 mainland+0.018
 rooft+0.018
inct+0.018
Neg logit contributions
 salon-0.022
ucker-0.020
ips-0.019
 pose-0.019
 losers-0.019
Token was
Token ID373
Feature activation-4.55e-01

Pos logit contributions
ucker+0.001
 salon+0.001
ags+0.001
 losers+0.001
ips+0.001
Neg logit contributions
 decided-0.000
 inside-0.000
 should-0.000
 even-0.000
 soon-0.000
Token a
Token ID257
Feature activation+6.01e-01

Pos logit contributions
 decided+0.009
 inside+0.008
 even+0.007
 soon+0.007
 should+0.007
Neg logit contributions
ucker-0.013
 salon-0.013
ips-0.012
 pose-0.012
ittens-0.012
Token little
Token ID1310
Feature activation+1.18e+00

Pos logit contributions
 salon+0.015
ucker+0.015
 Fro+0.014
 Dr+0.014
ittens+0.014
Neg logit contributions
 decided-0.012
 inside-0.011
 should-0.010
makers-0.010
 soon-0.010
Token bunny
Token ID44915
Feature activation-1.10e+00

Pos logit contributions
 salon+0.029
ucker+0.029
 Fro+0.027
ips+0.026
ittens+0.026
Neg logit contributions
 decided-0.025
 inside-0.024
 should-0.022
 soon-0.021
 even-0.021
Token named
Token ID3706
Feature activation-3.12e-01

Pos logit contributions
 decided+0.022
 inside+0.021
 UFO+0.019
makers+0.019
 should+0.018
Neg logit contributions
ucker-0.027
 salon-0.027
 Fro-0.026
ittens-0.025
eh-0.025
Token Benny
Token ID44275
Feature activation-2.20e+00

Pos logit contributions
 inside+0.006
 decided+0.006
 soon+0.005
 even+0.005
 should+0.005
Neg logit contributions
 salon-0.009
ucker-0.009
ips-0.008
ags-0.008
 pose-0.008
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
 decided+0.031
 inside+0.029
 UFO+0.024
 should+0.024
makers+0.023
Neg logit contributions
ucker-0.068
 salon-0.067
 Fro-0.065
 Dr-0.063
ittens-0.063
Token Benny
Token ID44275
Feature activation-1.90e+00

Pos logit contributions
 salon+0.005
ucker+0.005
ips+0.005
 pose+0.004
 Mara+0.004
Neg logit contributions
 decided-0.002
 inside-0.002
 should-0.002
 even-0.002
 soon-0.002
Token loved
Token ID6151
Feature activation+1.14e+00

Pos logit contributions
 decided+0.026
 inside+0.025
makers+0.020
 UFO+0.020
 soon+0.020
Neg logit contributions
ucker-0.059
 salon-0.058
 Fro-0.056
 Dr-0.055
ittens-0.055
Token to
Token ID284
Feature activation+7.71e-01

Pos logit contributions
 salon+0.036
ucker+0.035
ips+0.033
 Mara+0.032
ags+0.031
Neg logit contributions
 inside-0.020
 decided-0.020
 even-0.017
 should-0.016
 next-0.016
Token bounce
Token ID20110
Feature activation-7.27e-02

Pos logit contributions
ucker+0.022
 salon+0.022
ips+0.021
inches+0.019
 losers+0.019
Neg logit contributions
 decided-0.015
 inside-0.014
 even-0.013
 soon-0.012
 should-0.012
Token getting
Token ID1972
Feature activation-1.45e+00

Pos logit contributions
 decided+0.019
 UFO+0.017
makers+0.017
 generation+0.017
arp+0.017
Neg logit contributions
ucker-0.015
 losers-0.015
 pose-0.015
 salon-0.015
eh-0.015
Token more
Token ID517
Feature activation-2.94e-01

Pos logit contributions
 generation+0.036
makers+0.035
keeping+0.035
 UFO+0.034
 generations+0.034
Neg logit contributions
 Dr-0.025
ucker-0.024
 losers-0.024
 Fro-0.023
 salon-0.022
Token terrible
Token ID7818
Feature activation-1.39e+00

Pos logit contributions
 UFO+0.007
 decided+0.007
 generation+0.006
year+0.006
makers+0.006
Neg logit contributions
 toast-0.005
ucker-0.005
 allergic-0.005
 slips-0.005
 pose-0.005
Token by
Token ID416
Feature activation-1.12e+00

Pos logit contributions
 decided+0.030
year+0.029
 UFO+0.028
Mom+0.026
 moral+0.026
Neg logit contributions
 pose-0.029
 yoga-0.028
 mat-0.028
 toast-0.028
 elbow-0.027
Token the
Token ID262
Feature activation-7.90e-01

Pos logit contributions
 decided+0.029
 UFO+0.027
 should+0.026
 mainland+0.026
year+0.025
Neg logit contributions
 Leo-0.021
 Fro-0.021
 pose-0.019
 yoga-0.019
ips-0.019
Token second
Token ID1218
Feature activation-2.53e+00

Pos logit contributions
year+0.023
 decided+0.022
 vowed+0.019
rel+0.019
Mill+0.019
Neg logit contributions
 mat-0.012
 salon-0.012
 losers-0.012
 fle-0.011
 pose-0.011
Token.
Token ID13
Feature activation-5.59e-01

Pos logit contributions
 decided+0.062
year+0.060
 should+0.053
 inside+0.052
Would+0.052
Neg logit contributions
 pose-0.051
 mat-0.050
 salon-0.048
 tongue-0.045
ilo-0.045
Token Tony
Token ID8832
Feature activation-7.84e-01

Pos logit contributions
 decided+0.011
 inside+0.010
 should+0.009
 soon+0.009
 even+0.009
Neg logit contributions
 salon-0.015
ucker-0.015
ittens-0.014
ips-0.014
 losers-0.014
Token looked
Token ID3114
Feature activation-7.65e-01

Pos logit contributions
 inside+0.014
 UFO+0.014
makers+0.013
 generation+0.013
 mainland+0.013
Neg logit contributions
 salon-0.020
ucker-0.019
 pose-0.019
 Mara-0.018
 Fro-0.018
Token around
Token ID1088
Feature activation+5.00e-01

Pos logit contributions
 decided+0.016
 generations+0.015
keeping+0.015
makers+0.015
 generation+0.015
Neg logit contributions
ucker-0.016
 salon-0.016
 Dr-0.016
 Fro-0.015
 losers-0.015
Token for
Token ID329
Feature activation-2.08e-01

Pos logit contributions
 Fro+0.009
 Leo+0.009
 Dr+0.009
 yoga+0.009
 salon+0.009
Neg logit contributions
 decided-0.012
 UFO-0.012
makers-0.011
 should-0.011
 mainland-0.011
Token in
Token ID287
Feature activation-5.89e-03

Pos logit contributions
 inside+0.002
 decided+0.001
 even+0.001
 soon+0.001
 next+0.001
Neg logit contributions
ucker-0.005
 salon-0.004
 Mara-0.004
 Fro-0.004
ips-0.004
Token the
Token ID262
Feature activation+2.09e-01

Pos logit contributions
 inside+0.000
 decided+0.000
 even+0.000
 next+0.000
 soon+0.000
Neg logit contributions
ucker-0.000
inches-0.000
 losers-0.000
 salon-0.000
ittens-0.000
Token park
Token ID3952
Feature activation+4.72e-01

Pos logit contributions
ucker+0.005
 salon+0.005
 Fro+0.005
ips+0.005
ittens+0.005
Neg logit contributions
 decided-0.004
 inside-0.004
 should-0.004
 even-0.004
 soon-0.004
Token with
Token ID351
Feature activation-6.79e-01

Pos logit contributions
ucker+0.014
 salon+0.014
 Fro+0.013
ips+0.013
ittens+0.013
Neg logit contributions
 decided-0.008
 inside-0.007
 should-0.006
 soon-0.006
 even-0.006
Token her
Token ID607
Feature activation-4.22e-01

Pos logit contributions
 inside+0.016
 decided+0.016
 even+0.015
 soon+0.013
 next+0.013
Neg logit contributions
ucker-0.018
 salon-0.017
 losers-0.016
 pose-0.015
pill-0.015
Token friends
Token ID2460
Feature activation-2.17e+00

Pos logit contributions
 inside+0.008
 decided+0.008
 should+0.007
 even+0.007
 soon+0.007
Neg logit contributions
 salon-0.011
ucker-0.011
 losers-0.010
ips-0.010
 pose-0.010
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
 decided+0.036
 UFO+0.032
 inside+0.031
 mainland+0.030
makers+0.030
Neg logit contributions
 salon-0.061
ucker-0.059
ips-0.057
 Fro-0.057
ittens-0.056
Token One
Token ID1881
Feature activation+5.73e-02

Pos logit contributions
 decided+0.006
 inside+0.006
 even+0.005
 should+0.005
 soon+0.005
Neg logit contributions
ucker-0.014
 salon-0.014
 losers-0.013
 elbow-0.013
 pose-0.013
Token day
Token ID1110
Feature activation+7.41e-01

Pos logit contributions
ucker+0.002
 salon+0.002
ips+0.002
 losers+0.002
 elbow+0.002
Neg logit contributions
 decided-0.001
 inside-0.001
 soon-0.000
 even-0.000
 should-0.000
Token,
Token ID11
Feature activation-4.12e-01

Pos logit contributions
 salon+0.026
ucker+0.025
 losers+0.025
ags+0.025
 pose+0.024
Neg logit contributions
 inside-0.011
 decided-0.010
 soon-0.008
 even-0.007
 should-0.007
Token her
Token ID607
Feature activation+1.48e-02

Pos logit contributions
 decided+0.008
 inside+0.008
 soon+0.007
 even+0.006
 next+0.006
Neg logit contributions
ucker-0.013
ags-0.013
ips-0.013
 losers-0.013
oco-0.012