TOP ACTIVATIONS
MAX = 3.735

who lived in his cozy bur row . He loved to
gas ps . " What have you done , children ?
and shouted , " What have you done , you naughty
and asked , " What have you seen , John ?"
the garden . " What have you found , Mo ira
sheep asked , " What have we got to feed it
friendly smile . " What have you come to do here
. " What have you done ?" he said
Rabbit asked , " What have you got there , Jude
him and asked " What have you got there Joe ?".
. They had stopped qu arre ling . To celebrate ,
. " What have you done ?" Mom asked
" Oh no ! What have we done ?" Sara cried
and Lily . " What have you done ? This is
What â s up there ? â
. " What have you done ?" she yelled
them asked , " Where have you been ?" The squirrel
together . They never qu arre led again . <|endoftext|> Once
spot and said " What have you done , little girl
to him , " What have you got there ?" The

MIN ACTIVATIONS
MIN = -3.378

and it was the y umm iest steak she ever tasted
and it was the y umm iest thing in the world
and it was the y umm iest dinner they ever had
as it was the y umm iest cake she had ever
adventure and found the y umm iest berries they had ever
and it was the y umm iest ice cream she ever
said it was the y umm iest ice - cream they
store and got the y umm iest ice cream they could
! It was the y umm iest stew ever !
magic apple was the y umm iest thing they ever tasted
. It was the y umm iest thing she ever tasted
This is the y umm iest surprise ever ! â
the music was even y umm ier than before . <|endoftext|>
. It was the y umm iest thing he ever tasted
little boy had the y umm iest treat . He asked
the bear found the y umm iest cookie ever . He
made him an even y umm ier snack and they both
oil , and the y umm iest spaghetti ever !
top to make it y umm ier . Sarah was still
garden , and picked the ju ic iest carrots he could

INTERVAL 1.957 - 3.735
CONTAINS 0.023%

came by . " What 's wrong ?" he asked .
and asked , " What 's an earthquake ?" Her mom
, " Hello , why are you here ?" The squirrel
and asked , " What 's wrong , Tim my ?"
" Mom my , what 's this ?" Tim my asked

INTERVAL 0.179 - 1.957
CONTAINS 32.494%

, the tree continued to grow , and it became a
, there lived a little rabbit called Clara . She was
She smiled and hugged Lily . " Don 't
. The superhero had rescued the caul iflower and
play there . One sunny day , a little girl named

INTERVAL -1.600 - 0.179
CONTAINS 67.449%

was so sad and he felt like all his work was
would play with his friends in the snow . They would
very nice . He told Jane that she needed to eat
. He was so excited to share what he had learned
! <|endoftext|> Mom my was getting ready to bake something special

INTERVAL -3.378 - -1.600
CONTAINS 0.033%

going to have a special birthday party with cakes , games
a great job measuring her idea . The little
followed her mom to the cash ier to buy a toy
this for my friend 's birthday ?" Her mom said ,
shelf and went to the checkout . They paid for it
Token who
Token ID508
Feature activation-7.07e-02

Pos logit contributions
 our+0.024
 enough+0.023
 ourselves+0.022
 another+0.022
 your+0.021
Neg logit contributions
cut-0.019
Me-0.019
 ad-0.019
year-0.018
umm-0.016
Token lived
Token ID5615
Feature activation+4.27e-01

Pos logit contributions
umm+0.003
Me+0.002
cut+0.002
bridge+0.002
 redd+0.002
Neg logit contributions
 another-0.003
 our-0.003
 enough-0.003
 your-0.003
 band-0.003
Token in
Token ID287
Feature activation+7.17e-01

Pos logit contributions
 our+0.015
 another+0.014
 enough+0.014
 your+0.013
 band+0.013
Neg logit contributions
Me-0.019
umm-0.019
cut-0.018
 ad-0.017
year-0.017
Token his
Token ID465
Feature activation+1.39e+00

Pos logit contributions
 our+0.012
 enough+0.011
ounce+0.011
ig+0.010
 band+0.010
Neg logit contributions
Me-0.045
cut-0.042
 ad-0.041
year-0.041
umm-0.040
Token cozy
Token ID37438
Feature activation+1.36e+00

Pos logit contributions
 enough+0.052
 our+0.051
ounce+0.050
 ourselves+0.048
 later+0.048
Neg logit contributions
Me-0.056
umm-0.049
 ad-0.048
cut-0.047
 capt-0.045
Token bur
Token ID4356
Feature activation+2.76e+00

Pos logit contributions
 our+0.038
 enough+0.038
ounce+0.036
 another+0.036
render+0.035
Neg logit contributions
Me-0.061
umm-0.060
cut-0.060
 ad-0.057
 distance-0.056
Tokenrow
Token ID808
Feature activation+7.24e-01

Pos logit contributions
 our+0.071
 enough+0.065
 another+0.064
 your+0.059
 band+0.058
Neg logit contributions
Me-0.150
cut-0.142
umm-0.142
 ad-0.139
year-0.139
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
 our+0.019
 enough+0.019
 another+0.018
 band+0.017
 your+0.016
Neg logit contributions
Me-0.039
umm-0.036
cut-0.035
year-0.035
 ad-0.035
Token He
Token ID679
Feature activation+7.20e-02

Pos logit contributions
Me+0.012
cut+0.011
umm+0.011
 ad+0.011
year+0.011
Neg logit contributions
 our-0.013
 another-0.012
 enough-0.012
 your-0.012
 band-0.011
Token loved
Token ID6151
Feature activation+6.40e-02

Pos logit contributions
 our+0.003
 another+0.003
 enough+0.003
 your+0.003
 band+0.003
Neg logit contributions
umm-0.002
Me-0.002
cut-0.002
year-0.002
 ad-0.002
Token to
Token ID284
Feature activation+4.59e-01

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
 ourselves+0.001
ounce+0.001
Neg logit contributions
Me-0.004
cut-0.004
 ad-0.004
umm-0.003
year-0.003
Token gas
Token ID3623
Feature activation+4.24e-01

Pos logit contributions
ounce+0.009
ig+0.009
oppy+0.009
 our+0.009
 enough+0.009
Neg logit contributions
Me-0.011
 ad-0.010
cut-0.010
year-0.010
umm-0.010
Tokenps
Token ID862
Feature activation+2.77e-01

Pos logit contributions
 our+0.014
 another+0.013
oppy+0.013
 enough+0.013
 your+0.013
Neg logit contributions
cut-0.018
Me-0.018
 ad-0.017
umm-0.017
year-0.016
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
 our+0.008
oppy+0.007
 enough+0.007
 another+0.007
 ourselves+0.007
Neg logit contributions
Me-0.014
 ad-0.013
cut-0.013
year-0.012
umm-0.012
Token "
Token ID366
Feature activation+1.06e-01

Pos logit contributions
 another+0.018
 our+0.018
 enough+0.017
 your+0.017
 something+0.017
Neg logit contributions
cut-0.028
Me-0.027
umm-0.026
 ad-0.026
year-0.026
TokenWhat
Token ID2061
Feature activation+1.24e+00

Pos logit contributions
 our+0.005
 enough+0.005
 another+0.005
ounce+0.005
oppy+0.005
Neg logit contributions
Me-0.003
cut-0.003
year-0.003
umm-0.003
 ad-0.003
Token have
Token ID423
Feature activation+2.76e+00

Pos logit contributions
ounce+0.040
 scar+0.039
ig+0.039
 our+0.038
 enough+0.038
Neg logit contributions
Me-0.057
 ad-0.050
year-0.048
Here-0.045
vous-0.045
Token you
Token ID345
Feature activation+1.09e+00

Pos logit contributions
ounce+0.075
oppy+0.067
 wing+0.066
 loop+0.066
 scar+0.062
Neg logit contributions
Me-0.132
 ad-0.128
year-0.125
thank-0.124
Here-0.123
Token done
Token ID1760
Feature activation+1.13e+00

Pos logit contributions
 our+0.048
 enough+0.047
 another+0.047
ounce+0.046
 loop+0.045
Neg logit contributions
Me-0.035
 ad-0.033
year-0.032
thank-0.031
cut-0.031
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
 our+0.023
 another+0.022
 enough+0.021
 loop+0.020
 band+0.020
Neg logit contributions
Me-0.062
 ad-0.061
umm-0.061
cut-0.061
year-0.059
Token children
Token ID1751
Feature activation-3.18e-01

Pos logit contributions
Me+0.010
umm+0.010
 ad+0.009
cut+0.009
 redd+0.009
Neg logit contributions
ounce-0.007
 our-0.007
ig-0.007
 enough-0.006
 band-0.006
Token?
Token ID30
Feature activation+8.54e-01

Pos logit contributions
Me+0.019
cut+0.018
 ad+0.018
year+0.018
umm+0.018
Neg logit contributions
 our-0.006
 enough-0.005
 another-0.005
 ourselves-0.005
 scar-0.005
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
 our+0.027
oppy+0.026
 ourselves+0.025
 enough+0.025
 another+0.024
Neg logit contributions
cut-0.044
Me-0.043
 ad-0.042
he-0.039
 surrounding-0.037
Token shouted
Token ID17293
Feature activation-3.24e-01

Pos logit contributions
 our+0.011
 another+0.010
 enough+0.010
 your+0.009
 band+0.009
Neg logit contributions
umm-0.020
Me-0.020
cut-0.018
year-0.018
 ad-0.018
Token,
Token ID11
Feature activation-1.68e-01

Pos logit contributions
Me+0.014
 ad+0.014
year+0.013
 capt+0.013
cut+0.013
Neg logit contributions
 loop-0.010
 our-0.010
 ourselves-0.010
 scar-0.010
ig-0.010
Token "
Token ID366
Feature activation-3.11e-01

Pos logit contributions
Me+0.007
umm+0.007
year+0.007
cut+0.007
 ad+0.007
Neg logit contributions
 our-0.006
 another-0.006
 enough-0.006
 your-0.005
 band-0.005
TokenWhat
Token ID2061
Feature activation+1.07e+00

Pos logit contributions
Me+0.012
umm+0.011
cut+0.011
 ad+0.010
year+0.010
Neg logit contributions
 our-0.013
 enough-0.013
 another-0.012
 your-0.012
 band-0.012
Token have
Token ID423
Feature activation+2.70e+00

Pos logit contributions
 scar+0.034
 enough+0.033
 our+0.033
ig+0.033
 ourselves+0.033
Neg logit contributions
Me-0.048
 ad-0.042
cut-0.041
year-0.041
vous-0.040
Token you
Token ID345
Feature activation+9.13e-01

Pos logit contributions
ounce+0.076
 scar+0.072
oppy+0.069
 pose+0.069
ig+0.066
Neg logit contributions
Me-0.123
 ad-0.114
bridge-0.114
Here-0.113
year-0.112
Token done
Token ID1760
Feature activation+1.07e+00

Pos logit contributions
 our+0.040
 another+0.038
 enough+0.038
 ourselves+0.037
 loop+0.036
Neg logit contributions
Me-0.031
 ad-0.029
cut-0.028
thank-0.028
year-0.028
Token,
Token ID11
Feature activation-8.45e-02

Pos logit contributions
 our+0.021
 another+0.020
 enough+0.020
 band+0.019
 scar+0.019
Neg logit contributions
Me-0.060
cut-0.058
 ad-0.058
umm-0.058
year-0.056
Token you
Token ID345
Feature activation+2.00e-01

Pos logit contributions
Me+0.004
cut+0.004
umm+0.004
 ad+0.004
igator+0.004
Neg logit contributions
 enough-0.002
 ourselves-0.002
ig-0.002
ounce-0.002
 our-0.002
Token naughty
Token ID45128
Feature activation+5.24e-01

Pos logit contributions
 our+0.008
 ourselves+0.007
 enough+0.007
 another+0.007
 scar+0.007
Neg logit contributions
Me-0.007
cut-0.007
 ad-0.007
year-0.007
vous-0.007
Token and
Token ID290
Feature activation+2.39e-02

Pos logit contributions
Me+0.029
cut+0.028
 ad+0.028
umm+0.027
 capt+0.026
Neg logit contributions
 our-0.014
 band-0.012
 enough-0.012
 ourselves-0.012
 your-0.012
Token asked
Token ID1965
Feature activation+1.30e-01

Pos logit contributions
 our+0.001
 enough+0.001
 ourselves+0.001
ig+0.001
 another+0.001
Neg logit contributions
Me-0.001
cut-0.001
 ad-0.001
year-0.001
 capt-0.001
Token,
Token ID11
Feature activation+8.37e-02

Pos logit contributions
 our+0.002
 enough+0.002
 another+0.002
 your+0.002
 something+0.002
Neg logit contributions
Me-0.008
umm-0.008
cut-0.008
 capt-0.008
bridge-0.008
Token "
Token ID366
Feature activation-1.88e-01

Pos logit contributions
 our+0.003
 another+0.003
 enough+0.003
 something+0.003
 your+0.003
Neg logit contributions
Me-0.004
umm-0.004
cut-0.003
 ad-0.003
year-0.003
TokenWhat
Token ID2061
Feature activation+8.54e-01

Pos logit contributions
Me+0.006
umm+0.006
cut+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.008
 another-0.008
 enough-0.008
 band-0.008
 your-0.007
Token have
Token ID423
Feature activation+2.69e+00

Pos logit contributions
ounce+0.026
ig+0.025
 our+0.025
 scar+0.025
 enough+0.025
Neg logit contributions
Me-0.041
 ad-0.037
year-0.034
cut-0.033
 capt-0.033
Token you
Token ID345
Feature activation+8.79e-01

Pos logit contributions
ounce+0.063
oppy+0.059
ig+0.056
 loop+0.056
 scar+0.055
Neg logit contributions
Me-0.144
 ad-0.131
umm-0.130
year-0.130
igator-0.128
Token seen
Token ID1775
Feature activation+2.93e-01

Pos logit contributions
 our+0.034
 enough+0.033
 another+0.032
 band+0.032
 ourselves+0.031
Neg logit contributions
Me-0.034
 ad-0.032
year-0.031
cut-0.031
 capt-0.030
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
 our+0.007
 enough+0.007
 another+0.006
 band+0.006
 loop+0.006
Neg logit contributions
Me-0.015
cut-0.015
umm-0.015
 ad-0.015
year-0.014
Token John
Token ID1757
Feature activation-3.92e-02

Pos logit contributions
 our+0.007
 enough+0.007
 another+0.006
 your+0.006
 band+0.006
Neg logit contributions
Me-0.007
umm-0.006
cut-0.006
 ad-0.006
year-0.006
Token?"
Token ID1701
Feature activation+5.18e-01

Pos logit contributions
Me+0.002
cut+0.002
 ad+0.002
umm+0.002
year+0.002
Neg logit contributions
 our-0.001
 another-0.001
 enough-0.001
 band-0.001
 ourselves-0.001
Token the
Token ID262
Feature activation+4.78e-01

Pos logit contributions
Me+0.004
cut+0.004
 ad+0.004
year+0.004
umm+0.004
Neg logit contributions
 our-0.002
 enough-0.002
ounce-0.001
 ourselves-0.001
ig-0.001
Token garden
Token ID11376
Feature activation+8.21e-01

Pos logit contributions
 our+0.017
 enough+0.016
 another+0.016
 band+0.015
 your+0.015
Neg logit contributions
Me-0.021
umm-0.020
cut-0.020
 ad-0.019
year-0.019
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 our+0.020
 enough+0.017
 another+0.017
 your+0.016
 ourselves+0.016
Neg logit contributions
Me-0.045
cut-0.044
umm-0.043
 ad-0.043
year-0.042
Token "
Token ID366
Feature activation+9.89e-02

Pos logit contributions
 our+0.018
 enough+0.017
 another+0.017
 band+0.017
 ourselves+0.017
Neg logit contributions
Me-0.016
 ad-0.014
cut-0.014
 capt-0.014
year-0.013
TokenWhat
Token ID2061
Feature activation+1.07e+00

Pos logit contributions
 our+0.005
 another+0.004
 enough+0.004
 band+0.004
 your+0.004
Neg logit contributions
umm-0.003
Me-0.003
cut-0.003
 ad-0.003
year-0.003
Token have
Token ID423
Feature activation+2.68e+00

Pos logit contributions
ounce+0.035
 our+0.034
 scar+0.034
ig+0.034
 ourselves+0.033
Neg logit contributions
Me-0.048
 ad-0.043
year-0.042
vous-0.040
 capt-0.040
Token you
Token ID345
Feature activation+1.06e+00

Pos logit contributions
ounce+0.076
 scar+0.067
 brother+0.064
 loop+0.064
oppy+0.064
Neg logit contributions
Me-0.131
 ad-0.118
Here-0.117
year-0.117
umm-0.115
Token found
Token ID1043
Feature activation+6.96e-01

Pos logit contributions
 our+0.039
 enough+0.037
 another+0.037
 band+0.035
ounce+0.035
Neg logit contributions
Me-0.044
cut-0.041
 ad-0.041
year-0.040
umm-0.040
Token,
Token ID11
Feature activation-6.78e-02

Pos logit contributions
 our+0.016
 enough+0.015
 another+0.015
 band+0.015
ounce+0.014
Neg logit contributions
Me-0.036
cut-0.036
umm-0.035
 ad-0.035
year-0.034
Token Mo
Token ID4270
Feature activation+7.50e-01

Pos logit contributions
Me+0.003
 ad+0.003
cut+0.003
umm+0.002
 capt+0.002
Neg logit contributions
 another-0.002
 our-0.002
 band-0.002
 enough-0.002
ounce-0.002
Tokenira
Token ID8704
Feature activation+3.96e-01

Pos logit contributions
 our+0.024
 enough+0.022
 another+0.022
 band+0.021
 ourselves+0.021
Neg logit contributions
Me-0.035
cut-0.034
 ad-0.033
umm-0.033
year-0.033
Token sheep
Token ID15900
Feature activation+4.21e-01

Pos logit contributions
 our+0.005
 enough+0.005
 another+0.005
 your+0.005
 ourselves+0.005
Neg logit contributions
Me-0.009
cut-0.008
 ad-0.008
umm-0.008
year-0.008
Token asked
Token ID1965
Feature activation+5.59e-01

Pos logit contributions
 our+0.017
 enough+0.016
 another+0.016
 your+0.016
 ourselves+0.015
Neg logit contributions
Me-0.016
cut-0.015
 ad-0.015
umm-0.014
year-0.014
Token,
Token ID11
Feature activation+8.68e-03

Pos logit contributions
 our+0.011
 enough+0.010
 loop+0.009
ounce+0.009
 ourselves+0.009
Neg logit contributions
Me-0.033
cut-0.032
 ad-0.031
year-0.031
umm-0.030
Token "
Token ID366
Feature activation-3.17e-01

Pos logit contributions
 our+0.000
 enough+0.000
 another+0.000
 your+0.000
 ourselves+0.000
Neg logit contributions
Me-0.000
cut-0.000
umm-0.000
 ad-0.000
year-0.000
TokenWhat
Token ID2061
Feature activation+1.04e+00

Pos logit contributions
Me+0.009
umm+0.008
cut+0.008
 ad+0.008
year+0.008
Neg logit contributions
 our-0.016
 another-0.016
 enough-0.016
 band-0.015
 your-0.015
Token have
Token ID423
Feature activation+2.68e+00

Pos logit contributions
ounce+0.035
ig+0.033
 scar+0.032
 our+0.032
oppy+0.032
Neg logit contributions
Me-0.047
 ad-0.041
cut-0.039
 redd-0.039
 ju-0.038
Token we
Token ID356
Feature activation+1.44e+00

Pos logit contributions
ounce+0.075
ig+0.064
 loop+0.064
oppy+0.063
 mistake+0.061
Neg logit contributions
Me-0.134
Here-0.123
 ju-0.123
igator-0.121
umm-0.121
Token got
Token ID1392
Feature activation+1.21e+00

Pos logit contributions
 our+0.050
 enough+0.047
 another+0.046
 your+0.043
 band+0.043
Neg logit contributions
Me-0.066
cut-0.062
umm-0.061
 ad-0.060
year-0.060
Token to
Token ID284
Feature activation+2.76e-01

Pos logit contributions
 our+0.035
 another+0.033
 loop+0.032
 enough+0.032
 band+0.032
Neg logit contributions
Me-0.056
cut-0.056
 ad-0.054
year-0.054
umm-0.054
Token feed
Token ID3745
Feature activation+1.53e+00

Pos logit contributions
ig+0.009
ounce+0.009
oppy+0.008
azz+0.008
 enough+0.008
Neg logit contributions
Me-0.012
cut-0.012
thank-0.011
 ad-0.011
 capt-0.011
Token it
Token ID340
Feature activation+1.70e+00

Pos logit contributions
 enough+0.040
ig+0.038
 loop+0.038
ounce+0.038
 appearances+0.038
Neg logit contributions
Me-0.075
cut-0.073
umm-0.068
igator-0.067
 ad-0.067
Token friendly
Token ID8030
Feature activation-2.77e-01

Pos logit contributions
 our+0.003
 enough+0.003
 ourselves+0.003
 another+0.003
 later+0.003
Neg logit contributions
Me-0.003
cut-0.003
 ad-0.003
umm-0.003
 capt-0.003
Token smile
Token ID8212
Feature activation-8.02e-01

Pos logit contributions
Me+0.011
 ad+0.010
cut+0.010
 sensation+0.010
 capt+0.010
Neg logit contributions
 our-0.011
 ourselves-0.011
 enough-0.010
 later-0.010
 another-0.010
Token.
Token ID13
Feature activation+7.69e-01

Pos logit contributions
cut+0.037
Me+0.036
 ad+0.035
year+0.035
umm+0.034
Neg logit contributions
 our-0.026
 another-0.023
 ourselves-0.023
 enough-0.023
 your-0.023
Token "
Token ID366
Feature activation+2.10e-01

Pos logit contributions
 our+0.031
 enough+0.030
 another+0.029
 band+0.028
 your+0.027
Neg logit contributions
Me-0.031
cut-0.028
umm-0.028
year-0.027
 ad-0.027
TokenWhat
Token ID2061
Feature activation+9.85e-01

Pos logit contributions
 another+0.009
 our+0.009
 ourselves+0.008
 band+0.008
 later+0.008
Neg logit contributions
umm-0.007
cut-0.007
 ad-0.006
Me-0.006
rated-0.006
Token have
Token ID423
Feature activation+2.67e+00

Pos logit contributions
ounce+0.032
 scar+0.030
ig+0.030
 our+0.030
 enough+0.030
Neg logit contributions
Me-0.045
 ad-0.041
year-0.039
cut-0.038
vous-0.038
Token you
Token ID345
Feature activation+8.60e-01

Pos logit contributions
ounce+0.050
 enough+0.049
 loop+0.049
 our+0.048
 scar+0.047
Neg logit contributions
Me-0.152
 ad-0.141
year-0.140
cut-0.140
umm-0.140
Token come
Token ID1282
Feature activation+6.11e-01

Pos logit contributions
 our+0.032
 enough+0.031
 another+0.030
 band+0.030
 loop+0.029
Neg logit contributions
Me-0.035
 ad-0.033
cut-0.032
year-0.032
umm-0.031
Token to
Token ID284
Feature activation+2.47e-01

Pos logit contributions
 band+0.017
 loop+0.017
 enough+0.017
ig+0.017
 another+0.016
Neg logit contributions
cut-0.028
 ad-0.027
Me-0.027
year-0.026
 capt-0.026
Token do
Token ID466
Feature activation+6.43e-01

Pos logit contributions
ig+0.008
 enough+0.008
ounce+0.008
lets+0.007
 half+0.007
Neg logit contributions
cut-0.010
Me-0.010
 ad-0.009
thank-0.009
 surrounding-0.009
Token here
Token ID994
Feature activation+6.45e-03

Pos logit contributions
 our+0.018
 enough+0.017
 loop+0.017
 appearances+0.017
ounce+0.017
Neg logit contributions
cut-0.030
Me-0.029
 ad-0.029
year-0.028
thank-0.028
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 our+0.006
oppy+0.005
 enough+0.005
 band+0.005
 loop+0.005
Neg logit contributions
Me-0.007
 ad-0.007
cut-0.007
umm-0.007
year-0.007
Token
Token ID198
Feature activation+1.13e-01

Pos logit contributions
 another+0.006
 Sam+0.006
 our+0.006
 something+0.006
 your+0.005
Neg logit contributions
umm-0.014
 ad-0.014
Me-0.014
cut-0.013
rated-0.013
Token
Token ID198
Feature activation+4.61e-01

Pos logit contributions
 our+0.003
 another+0.003
 your+0.003
 later+0.002
 Sam+0.002
Neg logit contributions
umm-0.006
Me-0.006
 ad-0.006
 redd-0.006
cut-0.006
Token"
Token ID1
Feature activation+3.69e-01

Pos logit contributions
 our+0.007
 another+0.006
 your+0.006
 enough+0.005
 band+0.005
Neg logit contributions
umm-0.030
Me-0.030
 ad-0.029
cut-0.028
year-0.028
TokenWhat
Token ID2061
Feature activation+9.70e-01

Pos logit contributions
 our+0.017
 another+0.017
 band+0.016
 your+0.015
 ourselves+0.015
Neg logit contributions
umm-0.012
Me-0.011
 ad-0.011
year-0.011
cut-0.011
Token have
Token ID423
Feature activation+2.67e+00

Pos logit contributions
 scar+0.033
 our+0.033
 ourselves+0.033
 enough+0.033
ounce+0.033
Neg logit contributions
Me-0.041
 ad-0.036
year-0.035
cut-0.035
vous-0.034
Token you
Token ID345
Feature activation+8.53e-01

Pos logit contributions
ounce+0.078
 scar+0.067
 loop+0.066
oppy+0.066
 pose+0.064
Neg logit contributions
Me-0.126
Here-0.119
 ad-0.118
cut-0.117
year-0.115
Token done
Token ID1760
Feature activation+1.10e+00

Pos logit contributions
 our+0.035
 enough+0.033
 another+0.033
 ourselves+0.032
 band+0.031
Neg logit contributions
Me-0.032
 ad-0.030
cut-0.029
year-0.029
umm-0.028
Token?"
Token ID1701
Feature activation+8.05e-01

Pos logit contributions
 our+0.025
 another+0.023
 enough+0.023
 loop+0.021
 band+0.021
Neg logit contributions
Me-0.060
cut-0.059
 ad-0.058
umm-0.057
year-0.056
Token he
Token ID339
Feature activation+4.89e-01

Pos logit contributions
 our+0.026
 another+0.025
 your+0.023
 enough+0.023
 something+0.022
Neg logit contributions
umm-0.040
Me-0.040
year-0.038
 ad-0.038
cut-0.037
Token said
Token ID531
Feature activation+4.05e-01

Pos logit contributions
 our+0.024
 ourselves+0.022
 enough+0.022
ounce+0.022
 another+0.022
Neg logit contributions
Me-0.015
cut-0.013
 ad-0.013
year-0.012
 capt-0.012
Token Rabbit
Token ID25498
Feature activation+7.39e-01

Pos logit contributions
 our+0.023
 ourselves+0.022
 enough+0.021
 band+0.021
 later+0.021
Neg logit contributions
Me-0.021
umm-0.018
cut-0.018
 capt-0.017
 Everywhere-0.017
Token asked
Token ID1965
Feature activation+7.19e-01

Pos logit contributions
 our+0.030
 enough+0.028
 ourselves+0.027
 another+0.027
 your+0.027
Neg logit contributions
Me-0.029
cut-0.027
 ad-0.027
year-0.025
umm-0.025
Token,
Token ID11
Feature activation+4.25e-01

Pos logit contributions
 our+0.019
 enough+0.017
 ourselves+0.016
 another+0.016
 loop+0.016
Neg logit contributions
Me-0.038
cut-0.036
 ad-0.035
year-0.035
umm-0.034
Token "
Token ID366
Feature activation+4.40e-02

Pos logit contributions
 our+0.017
 enough+0.016
 another+0.016
 your+0.015
 band+0.015
Neg logit contributions
Me-0.017
umm-0.016
cut-0.016
 ad-0.015
year-0.015
TokenWhat
Token ID2061
Feature activation+8.80e-01

Pos logit contributions
 our+0.002
 another+0.002
 enough+0.002
 band+0.002
 your+0.002
Neg logit contributions
Me-0.002
umm-0.001
cut-0.001
 ad-0.001
year-0.001
Token have
Token ID423
Feature activation+2.66e+00

Pos logit contributions
ounce+0.027
ig+0.026
 our+0.026
 scar+0.026
 ourselves+0.025
Neg logit contributions
Me-0.042
 ad-0.038
cut-0.035
vous-0.035
 redd-0.035
Token you
Token ID345
Feature activation+9.35e-01

Pos logit contributions
ounce+0.077
 pose+0.067
 mistake+0.067
 loop+0.067
ig+0.066
Neg logit contributions
Me-0.124
Here-0.114
 ju-0.113
igator-0.112
 ad-0.112
Token got
Token ID1392
Feature activation+1.16e+00

Pos logit contributions
 our+0.036
 another+0.034
ounce+0.033
 loop+0.033
 enough+0.033
Neg logit contributions
Me-0.036
 ad-0.035
cut-0.034
year-0.034
thank-0.033
Token there
Token ID612
Feature activation+7.49e-02

Pos logit contributions
 our+0.034
 another+0.032
 loop+0.032
 band+0.031
 ourselves+0.030
Neg logit contributions
cut-0.052
Me-0.052
year-0.051
 ad-0.051
umm-0.050
Token,
Token ID11
Feature activation+3.42e-01

Pos logit contributions
 our+0.002
 loop+0.002
 band+0.002
 another+0.002
 enough+0.002
Neg logit contributions
Me-0.004
umm-0.004
cut-0.004
 ad-0.004
thank-0.004
Token Jude
Token ID30044
Feature activation+6.58e-01

Pos logit contributions
 ourselves+0.009
 band+0.009
 another+0.009
ig+0.009
 our+0.009
Neg logit contributions
Me-0.015
cut-0.015
umm-0.015
 ju-0.015
 redd-0.015
Token him
Token ID683
Feature activation+2.23e-01

Pos logit contributions
 our+0.010
 enough+0.010
 another+0.009
 your+0.008
 band+0.008
Neg logit contributions
Me-0.026
umm-0.025
cut-0.025
 ad-0.024
year-0.024
Token and
Token ID290
Feature activation+4.15e-01

Pos logit contributions
 our+0.007
 ourselves+0.007
oppy+0.007
ig+0.006
 scar+0.006
Neg logit contributions
 ad-0.010
Me-0.010
cut-0.009
year-0.009
 delighted-0.009
Token asked
Token ID1965
Feature activation+3.64e-01

Pos logit contributions
 our+0.018
 ourselves+0.017
 enough+0.016
ig+0.016
 appearances+0.016
Neg logit contributions
Me-0.016
 ad-0.015
cut-0.014
year-0.014
 capt-0.013
Token "
Token ID366
Feature activation-2.27e-01

Pos logit contributions
 our+0.007
 enough+0.007
 another+0.007
 your+0.006
 something+0.006
Neg logit contributions
Me-0.022
umm-0.022
cut-0.021
 ad-0.021
year-0.021
TokenWhat
Token ID2061
Feature activation+8.45e-01

Pos logit contributions
Me+0.008
umm+0.008
cut+0.008
 ad+0.008
year+0.007
Neg logit contributions
 our-0.010
 another-0.009
 enough-0.009
 band-0.009
 your-0.009
Token have
Token ID423
Feature activation+2.64e+00

Pos logit contributions
ounce+0.026
 scar+0.025
ig+0.025
 our+0.025
 enough+0.025
Neg logit contributions
Me-0.040
 ad-0.036
cut-0.034
year-0.034
vous-0.033
Token you
Token ID345
Feature activation+8.06e-01

Pos logit contributions
ounce+0.074
 scar+0.064
ig+0.063
 loop+0.062
oppy+0.061
Neg logit contributions
Me-0.126
 ad-0.119
year-0.117
igator-0.114
bridge-0.114
Token got
Token ID1392
Feature activation+1.07e+00

Pos logit contributions
 our+0.033
 enough+0.031
 another+0.031
 band+0.030
ounce+0.030
Neg logit contributions
Me-0.030
 ad-0.028
year-0.028
cut-0.027
thank-0.027
Token there
Token ID612
Feature activation+9.13e-03

Pos logit contributions
 our+0.030
 another+0.028
 band+0.027
 loop+0.027
 enough+0.026
Neg logit contributions
cut-0.050
Me-0.050
 ad-0.049
year-0.049
umm-0.048
Token Joe
Token ID5689
Feature activation+3.13e-01

Pos logit contributions
 our+0.000
 loop+0.000
 band+0.000
 another+0.000
 enough+0.000
Neg logit contributions
Me-0.000
umm-0.000
cut-0.000
 ad-0.000
thank-0.000
Token?".
Token ID43634
Feature activation+9.71e-01

Pos logit contributions
 our+0.008
 another+0.008
 scar+0.008
 loop+0.008
 appearances+0.008
Neg logit contributions
cut-0.015
 ad-0.014
Me-0.014
igator-0.014
thank-0.014
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
Me+0.021
 ad+0.020
umm+0.020
cut+0.020
year+0.020
Neg logit contributions
 our-0.009
 enough-0.008
 another-0.008
 loop-0.008
 band-0.008
Token They
Token ID1119
Feature activation-5.82e-01

Pos logit contributions
Me+0.056
cut+0.055
umm+0.054
 ad+0.053
year+0.052
Neg logit contributions
 our-0.039
 another-0.036
 enough-0.035
 your-0.034
 ourselves-0.032
Token had
Token ID550
Feature activation-5.16e-01

Pos logit contributions
Me+0.022
umm+0.021
cut+0.021
year+0.020
 ad+0.020
Neg logit contributions
 our-0.024
 enough-0.023
 another-0.023
 band-0.022
 your-0.021
Token stopped
Token ID5025
Feature activation+1.04e-01

Pos logit contributions
Me+0.025
 ad+0.023
year+0.022
cut+0.022
thank+0.022
Neg logit contributions
 our-0.015
ig-0.014
oppy-0.014
ounce-0.014
 enough-0.014
Token qu
Token ID627
Feature activation+7.01e-01

Pos logit contributions
ig+0.003
 enough+0.003
 loop+0.003
oppy+0.003
ounce+0.003
Neg logit contributions
Me-0.004
 ad-0.004
 capt-0.004
cut-0.004
 surrounding-0.004
Tokenarre
Token ID9624
Feature activation+2.64e+00

Pos logit contributions
 our+0.024
 enough+0.023
 another+0.022
 your+0.021
 scar+0.020
Neg logit contributions
Me-0.032
umm-0.032
year-0.031
rol-0.030
he-0.029
Tokenling
Token ID1359
Feature activation+1.02e+00

Pos logit contributions
 our+0.146
 enough+0.140
 another+0.137
 your+0.133
 ourselves+0.133
Neg logit contributions
Me-0.068
umm-0.060
cut-0.057
year-0.055
 ad-0.055
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
 our+0.022
 enough+0.020
oppy+0.019
ounce+0.019
 another+0.019
Neg logit contributions
Me-0.057
cut-0.054
 ad-0.054
umm-0.053
year-0.053
Token To
Token ID1675
Feature activation+1.95e-01

Pos logit contributions
Me+0.028
cut+0.028
umm+0.028
 ad+0.027
year+0.027
Neg logit contributions
 our-0.020
 another-0.018
 enough-0.018
 your-0.017
 ourselves-0.016
Token celebrate
Token ID10648
Feature activation+3.01e-01

Pos logit contributions
 enough+0.005
ounce+0.005
ig+0.004
 scar+0.004
 our+0.004
Neg logit contributions
Me-0.010
 ad-0.010
cut-0.009
thank-0.009
umm-0.009
Token,
Token ID11
Feature activation-3.96e-01

Pos logit contributions
 our+0.006
 enough+0.005
ounce+0.005
 another+0.005
 band+0.005
Neg logit contributions
Me-0.018
umm-0.016
 ad-0.016
cut-0.016
year-0.016
Token.
Token ID13
Feature activation+8.26e-02

Pos logit contributions
 our+0.026
oppy+0.024
 enough+0.023
 another+0.023
 my+0.022
Neg logit contributions
Me-0.034
 ad-0.033
cut-0.033
 surrounding-0.030
year-0.030
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 our+0.003
 another+0.003
 enough+0.003
 your+0.002
 band+0.002
Neg logit contributions
Me-0.004
cut-0.004
umm-0.004
 ad-0.004
year-0.004
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
umm+0.005
Me+0.005
cut+0.005
 ad+0.005
year+0.005
Neg logit contributions
 our-0.004
 another-0.004
 enough-0.003
 your-0.003
 band-0.003
Token"
Token ID1
Feature activation+2.34e-01

Pos logit contributions
 enough+0.006
 our+0.006
 another+0.005
 band+0.005
 brother+0.005
Neg logit contributions
Me-0.014
cut-0.012
year-0.012
 ad-0.012
umm-0.012
TokenWhat
Token ID2061
Feature activation+1.02e+00

Pos logit contributions
 our+0.011
 another+0.010
 enough+0.010
 band+0.010
 your+0.010
Neg logit contributions
Me-0.008
cut-0.007
umm-0.007
 ad-0.007
year-0.007
Token have
Token ID423
Feature activation+2.62e+00

Pos logit contributions
 enough+0.031
 our+0.031
 scar+0.031
ig+0.031
ounce+0.031
Neg logit contributions
Me-0.048
 ad-0.042
year-0.041
vous-0.038
umm-0.038
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
ounce+0.072
 wing+0.065
oppy+0.065
 scar+0.062
 pose+0.061
Neg logit contributions
Me-0.126
 ad-0.117
year-0.115
Here-0.114
thank-0.112
Token done
Token ID1760
Feature activation+1.10e+00

Pos logit contributions
 our+0.043
 enough+0.041
 another+0.041
 band+0.039
ounce+0.039
Neg logit contributions
Me-0.037
 ad-0.035
year-0.034
umm-0.033
cut-0.033
Token?"
Token ID1701
Feature activation+6.41e-01

Pos logit contributions
 our+0.025
 another+0.024
 enough+0.024
 band+0.023
 loop+0.022
Neg logit contributions
Me-0.058
 ad-0.057
umm-0.056
cut-0.056
year-0.054
Token Mom
Token ID11254
Feature activation+5.24e-01

Pos logit contributions
 our+0.018
 another+0.017
 enough+0.016
 your+0.016
 something+0.015
Neg logit contributions
Me-0.033
umm-0.033
cut-0.033
year-0.031
 ad-0.031
Token asked
Token ID1965
Feature activation+9.82e-01

Pos logit contributions
 our+0.018
 enough+0.018
 another+0.018
 band+0.017
 your+0.016
Neg logit contributions
Me-0.024
umm-0.023
cut-0.022
year-0.021
 ad-0.021
Token"
Token ID1
Feature activation+3.59e-01

Pos logit contributions
 our+0.017
 another+0.016
 enough+0.015
 your+0.015
 band+0.014
Neg logit contributions
umm-0.029
Me-0.029
cut-0.028
 ad-0.028
year-0.027
TokenOh
Token ID5812
Feature activation+8.44e-01

Pos logit contributions
 another+0.018
 our+0.018
 half+0.017
 band+0.017
 enough+0.017
Neg logit contributions
cut-0.010
 ad-0.010
umm-0.009
year-0.009
Me-0.009
Token no
Token ID645
Feature activation+1.40e-01

Pos logit contributions
 our+0.023
 another+0.021
 enough+0.021
 band+0.021
ounce+0.021
Neg logit contributions
Me-0.043
umm-0.040
 ad-0.039
cut-0.039
year-0.038
Token!
Token ID0
Feature activation+5.30e-01

Pos logit contributions
 our+0.004
 another+0.003
 enough+0.003
 ourselves+0.003
 band+0.003
Neg logit contributions
Me-0.007
umm-0.007
 ad-0.007
cut-0.007
year-0.006
Token What
Token ID1867
Feature activation+8.19e-01

Pos logit contributions
 our+0.020
 enough+0.019
 band+0.019
 scar+0.019
 another+0.018
Neg logit contributions
Me-0.022
umm-0.020
year-0.019
cut-0.019
 ad-0.018
Token have
Token ID423
Feature activation+2.61e+00

Pos logit contributions
 our+0.032
 enough+0.031
ounce+0.031
 scar+0.030
 ourselves+0.030
Neg logit contributions
Me-0.032
 ad-0.028
year-0.027
cut-0.027
bridge-0.025
Token we
Token ID356
Feature activation+1.17e+00

Pos logit contributions
ounce+0.060
 scar+0.055
 loop+0.055
oppy+0.054
ig+0.053
Neg logit contributions
Me-0.140
umm-0.128
 ad-0.127
year-0.125
Here-0.123
Token done
Token ID1760
Feature activation+9.66e-01

Pos logit contributions
 our+0.042
 enough+0.039
 another+0.038
 your+0.037
 band+0.036
Neg logit contributions
Me-0.054
umm-0.051
cut-0.050
year-0.048
 ad-0.048
Token?"
Token ID1701
Feature activation+9.36e-01

Pos logit contributions
 our+0.022
 enough+0.020
 another+0.019
 your+0.018
 band+0.017
Neg logit contributions
Me-0.056
umm-0.053
cut-0.053
year-0.052
 ad-0.052
Token Sara
Token ID24799
Feature activation+7.45e-01

Pos logit contributions
 our+0.026
 another+0.026
 enough+0.025
 your+0.023
 something+0.023
Neg logit contributions
umm-0.050
cut-0.049
Me-0.048
year-0.048
 ad-0.046
Token cried
Token ID16896
Feature activation+1.76e-01

Pos logit contributions
 our+0.033
 enough+0.031
 ourselves+0.030
 another+0.030
 your+0.029
Neg logit contributions
Me-0.026
cut-0.024
 ad-0.024
year-0.023
umm-0.023
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
 our+0.019
 ourselves+0.018
 loop+0.018
oppy+0.018
 another+0.017
Neg logit contributions
cut-0.031
 ad-0.029
Me-0.028
year-0.026
he-0.025
Token Lily
Token ID20037
Feature activation-3.53e-02

Pos logit contributions
Me+0.006
umm+0.006
cut+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.003
 enough-0.003
 another-0.003
 your-0.002
 band-0.002
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
Me+0.001
cut+0.001
 ad+0.001
year+0.001
he+0.001
Neg logit contributions
 our-0.001
oppy-0.001
 ourselves-0.001
 loop-0.001
 another-0.001
Token "
Token ID366
Feature activation-1.47e-01

Pos logit contributions
 our+0.011
 another+0.010
 enough+0.010
 your+0.010
 band+0.009
Neg logit contributions
Me-0.012
umm-0.012
cut-0.012
 ad-0.012
year-0.012
TokenWhat
Token ID2061
Feature activation+9.35e-01

Pos logit contributions
Me+0.005
cut+0.004
umm+0.004
 ad+0.004
year+0.004
Neg logit contributions
 our-0.007
 enough-0.007
 another-0.007
 your-0.007
 band-0.006
Token have
Token ID423
Feature activation+2.60e+00

Pos logit contributions
 enough+0.031
 scar+0.030
 our+0.030
 ourselves+0.029
ig+0.029
Neg logit contributions
Me-0.042
 ad-0.037
year-0.036
cut-0.035
vous-0.035
Token you
Token ID345
Feature activation+8.42e-01

Pos logit contributions
ounce+0.073
 scar+0.068
oppy+0.066
 loop+0.064
 wing+0.064
Neg logit contributions
Me-0.122
 ad-0.112
year-0.112
Here-0.112
thank-0.106
Token done
Token ID1760
Feature activation+1.02e+00

Pos logit contributions
 our+0.034
 enough+0.033
 another+0.032
 ourselves+0.030
 band+0.030
Neg logit contributions
Me-0.033
 ad-0.030
cut-0.029
year-0.029
umm-0.029
Token?
Token ID30
Feature activation+7.09e-01

Pos logit contributions
 our+0.022
 enough+0.021
 another+0.021
 loop+0.020
 band+0.020
Neg logit contributions
Me-0.055
 ad-0.054
cut-0.054
umm-0.053
year-0.052
Token This
Token ID770
Feature activation+3.51e-01

Pos logit contributions
 our+0.030
 enough+0.030
 another+0.028
 scar+0.028
 band+0.028
Neg logit contributions
Me-0.026
year-0.023
cut-0.023
umm-0.023
 ad-0.023
Token is
Token ID318
Feature activation+4.14e-01

Pos logit contributions
ounce+0.012
 our+0.012
 enough+0.012
oppy+0.012
 ourselves+0.012
Neg logit contributions
Me-0.014
cut-0.014
bridge-0.013
 ad-0.013
umm-0.012
Token
Token ID241
Feature activation-6.26e-03

Pos logit contributions
Me+0.002
umm+0.002
cut+0.002
year+0.002
 redd+0.002
Neg logit contributions
 our-0.003
 your-0.003
 another-0.003
 enough-0.003
 my-0.003
TokenWhat
Token ID2061
Feature activation+8.65e-01

Pos logit contributions
umm+0.000
cut+0.000
atively+0.000
 ju+0.000
 redd+0.000
Neg logit contributions
 another-0.000
 our-0.000
 your-0.000
 something-0.000
 ourselves-0.000
Tokenâ
Token ID22940
Feature activation+1.32e+00

Pos logit contributions
ounce+0.026
 our+0.026
 enough+0.025
ig+0.025
 scar+0.025
Neg logit contributions
Me-0.041
 ad-0.037
year-0.035
cut-0.034
 redd-0.033
Token
Token ID26391
Feature activation+6.57e-02

Pos logit contributions
 our+0.060
 enough+0.058
 another+0.057
 your+0.056
 band+0.056
Neg logit contributions
Me-0.045
umm-0.040
year-0.040
 ad-0.039
cut-0.038
Token
Token ID8151
Feature activation+1.34e+00

Pos logit contributions
 our+0.002
 another+0.002
 enough+0.002
 your+0.002
ounce+0.002
Neg logit contributions
Me-0.003
cut-0.003
umm-0.003
year-0.003
 ad-0.003
Tokens
Token ID82
Feature activation+2.59e+00

Pos logit contributions
 our+0.050
 enough+0.048
 another+0.048
ounce+0.046
 your+0.045
Neg logit contributions
Me-0.056
umm-0.054
cut-0.052
 ad-0.051
year-0.050
Token up
Token ID510
Feature activation+1.54e+00

Pos logit contributions
 our+0.052
 enough+0.046
 another+0.045
 your+0.040
 band+0.040
Neg logit contributions
Me-0.157
umm-0.148
cut-0.148
 ad-0.146
year-0.146
Token there
Token ID612
Feature activation+4.27e-01

Pos logit contributions
 our+0.033
 enough+0.030
 another+0.029
 your+0.026
 band+0.025
Neg logit contributions
Me-0.092
umm-0.087
cut-0.087
year-0.085
 ad-0.085
Token?
Token ID30
Feature activation+6.49e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.008
 band+0.007
 ourselves+0.007
Neg logit contributions
Me-0.025
cut-0.023
umm-0.023
 ad-0.023
year-0.022
Tokenâ
Token ID22940
Feature activation+8.03e-01

Pos logit contributions
 our+0.025
 enough+0.024
 another+0.023
 band+0.022
 your+0.022
Neg logit contributions
Me-0.027
cut-0.025
umm-0.025
 ad-0.024
year-0.024
Token
Token ID26391
Feature activation-2.97e-01

Pos logit contributions
 our+0.041
 enough+0.039
 another+0.039
 your+0.038
 band+0.038
Neg logit contributions
Me-0.023
umm-0.020
year-0.020
cut-0.019
 ad-0.019
Token.
Token ID13
Feature activation-3.20e-01

Pos logit contributions
 our+0.002
oppy+0.002
 loop+0.002
 band+0.002
 enough+0.002
Neg logit contributions
Me-0.004
 ad-0.004
cut-0.004
umm-0.004
year-0.004
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
cut+0.019
umm+0.019
rated+0.019
bridge+0.018
year+0.018
Neg logit contributions
 Sam-0.008
 another-0.008
 our-0.007
 your-0.007
 Ben-0.007
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
umm+0.009
Me+0.009
cut+0.008
 ad+0.008
year+0.008
Neg logit contributions
 our-0.005
 another-0.005
 your-0.005
 ourselves-0.004
 later-0.004
Token"
Token ID1
Feature activation+1.97e-01

Pos logit contributions
 our+0.004
 another+0.004
 enough+0.004
 your+0.003
 band+0.003
Neg logit contributions
Me-0.014
umm-0.014
cut-0.013
 ad-0.013
year-0.013
TokenWhat
Token ID2061
Feature activation+9.71e-01

Pos logit contributions
 our+0.009
 another+0.009
 enough+0.008
 band+0.008
 your+0.008
Neg logit contributions
Me-0.007
umm-0.007
cut-0.007
 ad-0.006
year-0.006
Token have
Token ID423
Feature activation+2.59e+00

Pos logit contributions
 our+0.032
 enough+0.032
 scar+0.030
 ourselves+0.030
 another+0.030
Neg logit contributions
Me-0.044
 ad-0.039
year-0.039
cut-0.038
umm-0.037
Token you
Token ID345
Feature activation+9.28e-01

Pos logit contributions
ounce+0.073
 loop+0.064
 scar+0.061
 wing+0.060
ig+0.059
Neg logit contributions
Me-0.126
Here-0.117
year-0.113
 ad-0.113
umm-0.113
Token done
Token ID1760
Feature activation+1.02e+00

Pos logit contributions
 our+0.037
 enough+0.035
 another+0.035
 ourselves+0.033
 band+0.033
Neg logit contributions
Me-0.036
 ad-0.033
cut-0.033
umm-0.033
year-0.032
Token?"
Token ID1701
Feature activation+6.15e-01

Pos logit contributions
 our+0.022
 another+0.020
 enough+0.020
 band+0.019
 loop+0.018
Neg logit contributions
Me-0.057
cut-0.055
 ad-0.055
umm-0.055
year-0.053
Token she
Token ID673
Feature activation+5.15e-01

Pos logit contributions
 another+0.015
 our+0.015
 your+0.014
 Sam+0.013
 something+0.013
Neg logit contributions
umm-0.037
cut-0.036
Me-0.035
rol-0.035
year-0.035
Token yelled
Token ID22760
Feature activation+2.99e-01

Pos logit contributions
 our+0.024
 enough+0.022
 another+0.022
 ourselves+0.022
ounce+0.021
Neg logit contributions
Me-0.017
cut-0.015
 ad-0.015
year-0.015
umm-0.014
Token them
Token ID606
Feature activation+1.24e-01

Pos logit contributions
 our+0.000
 enough+0.000
ounce+0.000
 ourselves+0.000
 another+0.000
Neg logit contributions
Me-0.001
cut-0.001
year-0.001
 ad-0.001
umm-0.001
Token asked
Token ID1965
Feature activation+2.29e-01

Pos logit contributions
 our+0.005
 enough+0.005
 another+0.005
 your+0.005
 band+0.005
Neg logit contributions
Me-0.005
umm-0.004
cut-0.004
year-0.004
 ad-0.004
Token,
Token ID11
Feature activation+4.49e-02

Pos logit contributions
 our+0.006
 enough+0.006
 another+0.005
 band+0.005
 ourselves+0.005
Neg logit contributions
Me-0.012
cut-0.011
 ad-0.011
year-0.011
umm-0.011
Token "
Token ID366
Feature activation-2.20e-01

Pos logit contributions
 our+0.002
 enough+0.002
 another+0.002
 band+0.002
 your+0.002
Neg logit contributions
Me-0.002
cut-0.002
umm-0.002
year-0.002
 ad-0.002
TokenWhere
Token ID8496
Feature activation+5.29e-01

Pos logit contributions
Me+0.007
umm+0.007
cut+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.010
 another-0.010
 enough-0.010
 band-0.010
 later-0.010
Token have
Token ID423
Feature activation+2.59e+00

Pos logit contributions
 our+0.018
 scar+0.018
ounce+0.017
 ourselves+0.017
 enough+0.017
Neg logit contributions
Me-0.023
 ad-0.020
cut-0.020
year-0.019
igator-0.019
Token you
Token ID345
Feature activation+8.11e-01

Pos logit contributions
ounce+0.071
oppy+0.065
ig+0.064
ap+0.057
 pose+0.056
Neg logit contributions
Me-0.133
bridge-0.121
umm-0.121
Here-0.121
 vendors-0.120
Token been
Token ID587
Feature activation+6.94e-01

Pos logit contributions
 our+0.031
ounce+0.029
oppy+0.029
 appearances+0.029
 scar+0.029
Neg logit contributions
Me-0.030
 ad-0.030
thank-0.029
year-0.028
cut-0.027
Token?"
Token ID1701
Feature activation+2.33e-01

Pos logit contributions
 our+0.021
 another+0.020
 enough+0.020
 loop+0.019
 ourselves+0.019
Neg logit contributions
Me-0.031
 ad-0.031
cut-0.030
umm-0.030
year-0.030
Token The
Token ID383
Feature activation+2.01e-01

Pos logit contributions
 our+0.010
 enough+0.009
 another+0.009
 ourselves+0.009
 your+0.008
Neg logit contributions
Me-0.009
cut-0.008
 ad-0.008
year-0.008
umm-0.008
Token squirrel
Token ID33039
Feature activation+5.38e-01

Pos logit contributions
 our+0.006
 another+0.005
 band+0.005
 enough+0.005
 brother+0.005
Neg logit contributions
Me-0.010
umm-0.010
cut-0.010
year-0.010
 ad-0.010
Token together
Token ID1978
Feature activation-4.63e-01

Pos logit contributions
 our+0.001
 enough+0.001
oppy+0.001
 another+0.001
ounce+0.001
Neg logit contributions
Me-0.003
 ad-0.003
cut-0.003
year-0.003
umm-0.003
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
Me+0.024
umm+0.023
cut+0.023
 ad+0.022
year+0.022
Neg logit contributions
 our-0.013
 enough-0.012
 another-0.012
 your-0.011
 band-0.011
Token They
Token ID1119
Feature activation+4.02e-01

Pos logit contributions
 our+0.009
 enough+0.009
 another+0.009
 your+0.008
 ourselves+0.008
Neg logit contributions
Me-0.011
cut-0.011
umm-0.011
 ad-0.010
year-0.010
Token never
Token ID1239
Feature activation+4.58e-01

Pos logit contributions
 our+0.017
 enough+0.016
 another+0.016
 your+0.015
 ourselves+0.015
Neg logit contributions
Me-0.016
cut-0.014
umm-0.014
 ad-0.014
year-0.014
Token qu
Token ID627
Feature activation+8.24e-01

Pos logit contributions
 our+0.021
 ourselves+0.020
 another+0.019
 your+0.019
 enough+0.019
Neg logit contributions
Me-0.015
 ad-0.014
umm-0.013
cut-0.013
year-0.013
Tokenarre
Token ID9624
Feature activation+2.58e+00

Pos logit contributions
 our+0.026
 enough+0.024
 another+0.023
 your+0.023
 band+0.022
Neg logit contributions
Me-0.040
umm-0.039
year-0.038
rol-0.038
vous-0.036
Tokenled
Token ID992
Feature activation+1.01e+00

Pos logit contributions
 our+0.152
 enough+0.146
 another+0.143
 your+0.140
 band+0.139
Neg logit contributions
Me-0.057
umm-0.050
cut-0.048
 ad-0.045
year-0.044
Token again
Token ID757
Feature activation+6.86e-01

Pos logit contributions
 our+0.023
 band+0.021
oppy+0.021
 loop+0.021
ig+0.020
Neg logit contributions
Me-0.053
 ad-0.052
cut-0.051
umm-0.050
year-0.050
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
 our+0.018
 appearances+0.016
ig+0.016
oppy+0.016
 ourselves+0.016
Neg logit contributions
 ad-0.032
Me-0.032
thank-0.032
 capt-0.031
cut-0.030
Token<|endoftext|>
Token ID50256
Feature activation+4.19e-01

Pos logit contributions
 our+0.008
 another+0.008
 enough+0.008
 your+0.007
 ourselves+0.007
Neg logit contributions
Me-0.010
cut-0.010
 ad-0.009
umm-0.009
year-0.009
TokenOnce
Token ID7454
Feature activation+3.83e-01

Pos logit contributions
 another+0.012
 our+0.011
 enough+0.011
 something+0.010
 later+0.010
Neg logit contributions
cut-0.022
umm-0.022
Me-0.021
 ad-0.021
 redd-0.020
Token spot
Token ID4136
Feature activation+2.56e-01

Pos logit contributions
 our+0.005
 another+0.005
 enough+0.005
 band+0.005
 loop+0.004
Neg logit contributions
Me-0.007
umm-0.007
year-0.007
 ad-0.007
cut-0.007
Token and
Token ID290
Feature activation-2.70e-01

Pos logit contributions
 our+0.007
 enough+0.006
 another+0.006
 your+0.005
 band+0.005
Neg logit contributions
Me-0.014
umm-0.013
cut-0.013
 ad-0.013
year-0.013
Token said
Token ID531
Feature activation-2.53e-01

Pos logit contributions
Me+0.011
cut+0.010
 ad+0.010
year+0.010
umm+0.009
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 ourselves-0.010
 your-0.009
Token "
Token ID366
Feature activation-1.10e-01

Pos logit contributions
Me+0.014
cut+0.013
 ad+0.013
year+0.013
umm+0.013
Neg logit contributions
 our-0.006
 enough-0.005
 another-0.005
 ourselves-0.005
 band-0.005
TokenWhat
Token ID2061
Feature activation+1.06e+00

Pos logit contributions
cut+0.003
umm+0.003
Me+0.003
 ad+0.003
year+0.003
Neg logit contributions
 another-0.005
 our-0.005
 enough-0.005
 band-0.005
 later-0.005
Token have
Token ID423
Feature activation+2.58e+00

Pos logit contributions
ounce+0.033
ig+0.032
 scar+0.031
 band+0.031
 our+0.031
Neg logit contributions
Me-0.049
 ad-0.044
cut-0.042
year-0.041
 redd-0.040
Token you
Token ID345
Feature activation+9.96e-01

Pos logit contributions
ounce+0.064
oppy+0.057
ig+0.055
 scar+0.055
 loop+0.053
Neg logit contributions
Me-0.133
 ad-0.124
umm-0.121
year-0.120
thank-0.119
Token done
Token ID1760
Feature activation+1.00e+00

Pos logit contributions
 our+0.036
 enough+0.034
 another+0.034
 band+0.032
 ourselves+0.032
Neg logit contributions
Me-0.043
 ad-0.039
cut-0.039
umm-0.039
year-0.039
Token,
Token ID11
Feature activation-3.54e-01

Pos logit contributions
 our+0.024
 another+0.022
 enough+0.022
 band+0.022
 loop+0.021
Neg logit contributions
Me-0.053
umm-0.051
 ad-0.051
cut-0.050
year-0.049
Token little
Token ID1310
Feature activation+2.84e-01

Pos logit contributions
Me+0.016
 ad+0.015
umm+0.015
cut+0.014
year+0.014
Neg logit contributions
 our-0.012
 enough-0.011
 ourselves-0.011
 another-0.011
 band-0.010
Token girl
Token ID2576
Feature activation-2.13e-01

Pos logit contributions
 our+0.009
 another+0.008
 ourselves+0.008
ounce+0.008
 enough+0.008
Neg logit contributions
Me-0.013
umm-0.012
cut-0.012
 ad-0.012
 redd-0.011
Token to
Token ID284
Feature activation+4.44e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.007
 band+0.007
ounce+0.007
Neg logit contributions
Me-0.019
cut-0.018
 ad-0.018
year-0.018
umm-0.017
Token him
Token ID683
Feature activation-5.17e-02

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 band+0.009
ounce+0.009
Neg logit contributions
Me-0.024
cut-0.023
umm-0.022
 ad-0.022
year-0.022
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
cut+0.003
Me+0.003
 ad+0.003
year+0.002
 capt+0.002
Neg logit contributions
 our-0.001
 ourselves-0.001
 band-0.001
 loop-0.001
 enough-0.001
Token "
Token ID366
Feature activation+9.18e-02

Pos logit contributions
 our+0.012
 enough+0.011
 another+0.011
 ourselves+0.011
 band+0.010
Neg logit contributions
Me-0.011
cut-0.010
 ad-0.010
year-0.009
umm-0.009
TokenWhat
Token ID2061
Feature activation+8.52e-01

Pos logit contributions
 our+0.004
 another+0.004
 enough+0.004
 band+0.004
 later+0.004
Neg logit contributions
Me-0.003
umm-0.002
cut-0.002
 ad-0.002
year-0.002
Token have
Token ID423
Feature activation+2.57e+00

Pos logit contributions
ounce+0.028
ig+0.026
 our+0.026
 scar+0.026
 band+0.025
Neg logit contributions
Me-0.039
 ad-0.036
year-0.034
cut-0.033
vous-0.033
Token you
Token ID345
Feature activation+8.38e-01

Pos logit contributions
ounce+0.066
 loop+0.060
 scar+0.060
ig+0.058
 band+0.058
Neg logit contributions
Me-0.129
year-0.120
 ad-0.120
Here-0.115
umm-0.115
Token got
Token ID1392
Feature activation+1.13e+00

Pos logit contributions
 our+0.031
 band+0.030
ounce+0.030
 another+0.030
 enough+0.030
Neg logit contributions
Me-0.032
 ad-0.032
year-0.031
cut-0.030
thank-0.029
Token there
Token ID612
Feature activation+5.69e-02

Pos logit contributions
 our+0.033
 another+0.030
 band+0.030
 loop+0.029
 enough+0.029
Neg logit contributions
Me-0.053
cut-0.052
year-0.051
 ad-0.051
umm-0.050
Token?"
Token ID1701
Feature activation+6.98e-01

Pos logit contributions
 our+0.001
 band+0.001
 loop+0.001
 enough+0.001
 another+0.001
Neg logit contributions
Me-0.003
cut-0.003
 ad-0.003
umm-0.003
year-0.003
Token The
Token ID383
Feature activation+4.34e-01

Pos logit contributions
 our+0.028
 enough+0.027
 another+0.025
 ourselves+0.025
 your+0.025
Neg logit contributions
Me-0.028
cut-0.026
umm-0.025
year-0.025
 ad-0.025
Token and
Token ID290
Feature activation-1.84e-01

Pos logit contributions
Me+0.032
umm+0.031
cut+0.030
 ad+0.030
year+0.030
Neg logit contributions
 our-0.017
 enough-0.015
 another-0.015
 your-0.014
 band-0.014
Token it
Token ID340
Feature activation-5.06e-01

Pos logit contributions
Me+0.008
umm+0.008
cut+0.008
year+0.008
 ad+0.007
Neg logit contributions
 our-0.006
 another-0.006
 enough-0.006
 your-0.006
 something-0.006
Token was
Token ID373
Feature activation-9.50e-01

Pos logit contributions
Me+0.019
umm+0.019
year+0.018
cut+0.018
 ad+0.017
Neg logit contributions
 another-0.021
 enough-0.020
 our-0.020
 band-0.019
 something-0.019
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
Me+0.045
umm+0.043
cut+0.043
year+0.041
 ad+0.040
Neg logit contributions
 enough-0.034
 our-0.034
 another-0.033
 something-0.031
 your-0.030
Token y
Token ID331
Feature activation-1.39e+00

Pos logit contributions
year+0.041
Me+0.038
rol+0.037
cut+0.037
 ad+0.036
Neg logit contributions
 enough-0.029
 another-0.028
 our-0.028
 something-0.026
 half-0.026
Tokenumm
Token ID13929
Feature activation-3.38e+00

Pos logit contributions
Me+0.049
umm+0.045
cut+0.045
 ad+0.043
year+0.043
Neg logit contributions
 our-0.062
 enough-0.060
 another-0.059
 your-0.056
 band-0.056
Tokeniest
Token ID6386
Feature activation-8.16e-01

Pos logit contributions
Me+0.102
 capt+0.093
 ad+0.082
 Everywhere+0.081
Here+0.081
Neg logit contributions
 enough-0.180
 later-0.171
 another-0.168
 our-0.166
 half-0.164
Token steak
Token ID26320
Feature activation-1.11e+00

Pos logit contributions
Me+0.045
cut+0.044
year+0.043
rol+0.042
 capt+0.042
Neg logit contributions
 enough-0.021
 another-0.021
 our-0.020
 half-0.018
 something-0.018
Token she
Token ID673
Feature activation-4.96e-01

Pos logit contributions
Me+0.058
year+0.055
umm+0.055
cut+0.055
Here+0.053
Neg logit contributions
 our-0.030
 enough-0.030
 another-0.029
 later-0.027
 something-0.026
Token ever
Token ID1683
Feature activation-2.64e-01

Pos logit contributions
cut+0.023
rol+0.023
umm+0.022
Me+0.020
bridge+0.020
Neg logit contributions
 another-0.019
 our-0.018
 too-0.018
 be-0.017
 enough-0.017
Token tasted
Token ID29187
Feature activation-6.81e-01

Pos logit contributions
Me+0.010
cut+0.009
umm+0.009
 ad+0.009
year+0.008
Neg logit contributions
 our-0.011
 enough-0.011
 another-0.011
 later-0.011
 your-0.010
Token and
Token ID290
Feature activation+2.63e-01

Pos logit contributions
 our+0.016
 appearances+0.014
oppy+0.014
 ourselves+0.014
ossible+0.014
Neg logit contributions
Me-0.028
 ad-0.027
cut-0.027
umm-0.026
 surrounding-0.026
Token it
Token ID340
Feature activation-2.20e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.008
 ourselves+0.008
ounce+0.008
Neg logit contributions
Me-0.012
 ad-0.011
cut-0.011
umm-0.011
year-0.011
Token was
Token ID373
Feature activation-6.49e-01

Pos logit contributions
Me+0.009
cut+0.008
umm+0.008
 ad+0.008
year+0.008
Neg logit contributions
 our-0.009
 enough-0.008
 another-0.008
 your-0.008
 ourselves-0.008
Token the
Token ID262
Feature activation-8.45e-01

Pos logit contributions
Me+0.031
umm+0.029
cut+0.029
year+0.028
 ad+0.028
Neg logit contributions
 our-0.022
 enough-0.021
 another-0.021
 your-0.019
 band-0.019
Token y
Token ID331
Feature activation-1.25e+00

Pos logit contributions
year+0.042
Me+0.038
cut+0.037
bridge+0.036
rol+0.035
Neg logit contributions
 enough-0.029
 another-0.029
 our-0.028
 band-0.027
 something-0.027
Tokenumm
Token ID13929
Feature activation-3.27e+00

Pos logit contributions
Me+0.041
umm+0.039
cut+0.036
 ad+0.035
year+0.035
Neg logit contributions
 our-0.059
 enough-0.056
 another-0.056
 your-0.054
 band-0.053
Tokeniest
Token ID6386
Feature activation-6.74e-01

Pos logit contributions
Me+0.104
 capt+0.092
 ad+0.085
Here+0.084
cut+0.084
Neg logit contributions
 enough-0.167
 later-0.157
 our-0.157
 another-0.154
 band-0.151
Token thing
Token ID1517
Feature activation-7.84e-01

Pos logit contributions
cut+0.031
year+0.030
Me+0.029
 capt+0.028
bridge+0.028
Neg logit contributions
 another-0.024
 enough-0.024
 our-0.023
 half-0.022
 too-0.022
Token in
Token ID287
Feature activation-8.16e-02

Pos logit contributions
Me+0.041
umm+0.039
cut+0.039
year+0.039
 ad+0.037
Neg logit contributions
 our-0.022
 enough-0.021
 another-0.021
 your-0.019
 something-0.018
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
Me+0.005
umm+0.004
 ad+0.004
cut+0.004
year+0.004
Neg logit contributions
 our-0.002
 enough-0.002
ounce-0.002
 another-0.002
 ourselves-0.002
Token world
Token ID995
Feature activation-3.22e-01

Pos logit contributions
 another+0.001
 our+0.001
 something+0.001
 enough+0.001
 your+0.001
Neg logit contributions
cut-0.002
year-0.001
 ad-0.001
Me-0.001
rol-0.001
Token and
Token ID290
Feature activation+8.49e-02

Pos logit contributions
Me+0.006
cut+0.006
umm+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.003
 enough-0.002
 another-0.002
 your-0.002
 band-0.002
Token it
Token ID340
Feature activation-4.54e-01

Pos logit contributions
 our+0.003
 enough+0.003
 another+0.003
 your+0.003
 band+0.003
Neg logit contributions
Me-0.003
cut-0.003
umm-0.003
year-0.003
 ad-0.003
Token was
Token ID373
Feature activation-6.93e-01

Pos logit contributions
Me+0.018
umm+0.017
cut+0.017
year+0.016
 ad+0.016
Neg logit contributions
 our-0.018
 enough-0.018
 another-0.018
 band-0.017
 your-0.016
Token the
Token ID262
Feature activation-9.72e-01

Pos logit contributions
Me+0.039
umm+0.039
cut+0.039
Here+0.036
year+0.036
Neg logit contributions
 our-0.019
 enough-0.019
 another-0.018
 your-0.016
 something-0.016
Token y
Token ID331
Feature activation-1.26e+00

Pos logit contributions
year+0.048
Me+0.045
cut+0.044
rol+0.042
 ad+0.041
Neg logit contributions
 our-0.033
 enough-0.032
 another-0.032
 band-0.030
 something-0.029
Tokenumm
Token ID13929
Feature activation-3.25e+00

Pos logit contributions
Me+0.043
umm+0.041
cut+0.039
 ad+0.038
year+0.038
Neg logit contributions
 our-0.057
 enough-0.054
 another-0.054
 your-0.052
 band-0.051
Tokeniest
Token ID6386
Feature activation-7.04e-01

Pos logit contributions
Me+0.096
 capt+0.084
Here+0.077
 Everywhere+0.076
 blonde+0.075
Neg logit contributions
 enough-0.175
 later-0.167
 our-0.164
 another-0.162
 band-0.158
Token dinner
Token ID8073
Feature activation-4.23e-01

Pos logit contributions
Me+0.037
cut+0.035
year+0.034
 ad+0.033
 capt+0.033
Neg logit contributions
 our-0.020
 enough-0.020
 another-0.020
 band-0.018
 something-0.017
Token they
Token ID484
Feature activation+5.69e-02

Pos logit contributions
Me+0.023
cut+0.022
year+0.022
umm+0.021
Here+0.021
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 later-0.009
 something-0.009
Token ever
Token ID1683
Feature activation-5.79e-02

Pos logit contributions
 our+0.002
 later+0.002
 another+0.002
 enough+0.002
 your+0.002
Neg logit contributions
umm-0.002
Me-0.002
cut-0.002
 redd-0.002
 capt-0.002
Token had
Token ID550
Feature activation-2.45e-01

Pos logit contributions
Me+0.002
cut+0.002
umm+0.002
 ad+0.002
year+0.002
Neg logit contributions
 our-0.002
 enough-0.002
 another-0.002
 later-0.002
 your-0.002
Token as
Token ID355
Feature activation-5.88e-01

Pos logit contributions
 our+0.005
 another+0.005
 enough+0.005
 your+0.005
 band+0.004
Neg logit contributions
Me-0.005
umm-0.005
cut-0.005
year-0.005
 ad-0.005
Token it
Token ID340
Feature activation-4.88e-02

Pos logit contributions
Me+0.033
cut+0.032
umm+0.032
year+0.030
 ad+0.030
Neg logit contributions
 our-0.015
 another-0.014
 enough-0.014
 your-0.013
 something-0.012
Token was
Token ID373
Feature activation-6.76e-01

Pos logit contributions
Me+0.002
umm+0.002
cut+0.002
year+0.002
 ad+0.002
Neg logit contributions
 our-0.002
 another-0.002
 enough-0.002
 band-0.002
 later-0.002
Token the
Token ID262
Feature activation-8.86e-01

Pos logit contributions
Me+0.035
umm+0.032
cut+0.032
year+0.031
Here+0.031
Neg logit contributions
 our-0.022
 enough-0.021
 another-0.020
 your-0.019
 something-0.018
Token y
Token ID331
Feature activation-1.15e+00

Pos logit contributions
year+0.045
rol+0.040
Me+0.040
cut+0.039
anned+0.038
Neg logit contributions
 another-0.031
 enough-0.030
 our-0.029
 half-0.028
 something-0.028
Tokenumm
Token ID13929
Feature activation-3.20e+00

Pos logit contributions
Me+0.036
umm+0.033
cut+0.032
 ad+0.031
year+0.031
Neg logit contributions
 our-0.057
 enough-0.054
 another-0.053
 your-0.051
 band-0.051
Tokeniest
Token ID6386
Feature activation-5.78e-01

Pos logit contributions
Me+0.099
 capt+0.085
 ad+0.079
Here+0.078
year+0.077
Neg logit contributions
 enough-0.169
 later-0.159
 our-0.157
 another-0.156
 half-0.152
Token cake
Token ID12187
Feature activation-9.84e-01

Pos logit contributions
Me+0.029
cut+0.028
year+0.028
 ad+0.026
 capt+0.026
Neg logit contributions
 enough-0.018
 another-0.018
 our-0.017
 band-0.016
 half-0.016
Token she
Token ID673
Feature activation-2.55e-01

Pos logit contributions
Me+0.053
year+0.050
cut+0.050
umm+0.049
Here+0.048
Neg logit contributions
 our-0.026
 enough-0.025
 another-0.024
 later-0.022
 your-0.022
Token had
Token ID550
Feature activation-2.44e-01

Pos logit contributions
cut+0.011
umm+0.011
rol+0.010
Me+0.010
 capt+0.009
Neg logit contributions
 our-0.010
 another-0.009
 your-0.009
 enough-0.009
 later-0.009
Token ever
Token ID1683
Feature activation-1.02e-01

Pos logit contributions
Me+0.010
umm+0.009
cut+0.009
year+0.009
Here+0.008
Neg logit contributions
 our-0.010
 enough-0.010
 another-0.010
 later-0.009
 your-0.009
Token adventure
Token ID8855
Feature activation-3.69e-01

Pos logit contributions
Me+0.019
 ad+0.017
cut+0.016
umm+0.016
spe+0.015
Neg logit contributions
 enough-0.018
ounce-0.017
 our-0.017
 another-0.017
ucks-0.017
Token and
Token ID290
Feature activation-1.58e-01

Pos logit contributions
Me+0.021
cut+0.020
 ad+0.020
umm+0.020
year+0.019
Neg logit contributions
 our-0.008
 enough-0.007
 another-0.007
 your-0.007
 band-0.006
Token found
Token ID1043
Feature activation-5.41e-01

Pos logit contributions
Me+0.007
cut+0.006
umm+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.006
 enough-0.006
 another-0.006
 your-0.005
 band-0.005
Token the
Token ID262
Feature activation-6.36e-01

Pos logit contributions
Me+0.033
umm+0.032
cut+0.031
 ad+0.031
year+0.031
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 your-0.008
 band-0.008
Token y
Token ID331
Feature activation-1.12e+00

Pos logit contributions
Me+0.031
year+0.030
umm+0.029
 ad+0.029
cut+0.029
Neg logit contributions
 our-0.020
 another-0.019
 enough-0.019
 band-0.018
 loop-0.017
Tokenumm
Token ID13929
Feature activation-3.13e+00

Pos logit contributions
umm+0.038
Me+0.029
 ad+0.025
cut+0.025
year+0.025
Neg logit contributions
 our-0.057
 another-0.054
 your-0.053
 enough-0.052
 something-0.052
Tokeniest
Token ID6386
Feature activation-5.75e-01

Pos logit contributions
Me+0.087
 capt+0.075
 ad+0.068
year+0.068
 blonde+0.066
Neg logit contributions
 enough-0.174
 our-0.165
 later-0.164
 another-0.163
 band-0.160
Token berries
Token ID36322
Feature activation-9.41e-01

Pos logit contributions
Me+0.030
year+0.028
cut+0.027
 ad+0.027
umm+0.027
Neg logit contributions
 our-0.017
 another-0.016
 enough-0.016
 band-0.015
 something-0.014
Token they
Token ID484
Feature activation+2.07e-01

Pos logit contributions
Me+0.050
umm+0.047
cut+0.046
year+0.045
 ad+0.045
Neg logit contributions
 our-0.026
 enough-0.024
 another-0.024
 your-0.022
 band-0.022
Token had
Token ID550
Feature activation-8.53e-02

Pos logit contributions
 our+0.009
 another+0.008
 enough+0.008
 later+0.008
 too+0.008
Neg logit contributions
umm-0.009
Me-0.008
cut-0.008
 redd-0.007
 capt-0.007
Token ever
Token ID1683
Feature activation-7.15e-02

Pos logit contributions
Me+0.003
umm+0.003
cut+0.003
year+0.003
 ad+0.003
Neg logit contributions
 our-0.004
 enough-0.003
 another-0.003
 your-0.003
 later-0.003
Token and
Token ID290
Feature activation-3.21e-01

Pos logit contributions
Me+0.063
umm+0.060
cut+0.060
year+0.059
 ad+0.059
Neg logit contributions
 our-0.020
 enough-0.018
 another-0.018
 band-0.016
 your-0.015
Token it
Token ID340
Feature activation-9.91e-02

Pos logit contributions
Me+0.014
cut+0.012
 ad+0.012
year+0.012
umm+0.012
Neg logit contributions
 our-0.012
 enough-0.011
 ourselves-0.011
ounce-0.011
 another-0.011
Token was
Token ID373
Feature activation-6.46e-01

Pos logit contributions
Me+0.004
umm+0.004
year+0.004
cut+0.004
 ad+0.004
Neg logit contributions
 our-0.004
 another-0.004
 enough-0.004
 band-0.003
 loop-0.003
Token the
Token ID262
Feature activation-9.11e-01

Pos logit contributions
Me+0.033
umm+0.032
cut+0.030
year+0.030
mer+0.029
Neg logit contributions
 our-0.022
 enough-0.022
 another-0.021
 your-0.020
 my-0.019
Token y
Token ID331
Feature activation-9.92e-01

Pos logit contributions
year+0.047
Me+0.043
 ad+0.039
rol+0.039
cut+0.039
Neg logit contributions
 enough-0.031
 our-0.031
 another-0.030
 band-0.030
 something-0.028
Tokenumm
Token ID13929
Feature activation-3.09e+00

Pos logit contributions
Me+0.023
umm+0.020
cut+0.020
 ad+0.019
year+0.019
Neg logit contributions
 our-0.057
 enough-0.055
 another-0.054
 your-0.052
 band-0.052
Tokeniest
Token ID6386
Feature activation-4.96e-01

Pos logit contributions
Me+0.109
 capt+0.102
 Everywhere+0.092
 blonde+0.091
Here+0.088
Neg logit contributions
 enough-0.155
 later-0.146
 our-0.139
 another-0.139
 band-0.137
Token ice
Token ID4771
Feature activation-9.34e-01

Pos logit contributions
Me+0.026
year+0.026
cut+0.025
 capt+0.024
 ad+0.024
Neg logit contributions
 enough-0.014
 another-0.014
 our-0.013
 band-0.013
 later-0.012
Token cream
Token ID8566
Feature activation-1.04e+00

Pos logit contributions
Me+0.041
year+0.039
 capt+0.038
cut+0.038
umm+0.038
Neg logit contributions
 our-0.034
 enough-0.032
 another-0.032
 later-0.031
 band-0.031
Token she
Token ID673
Feature activation-7.16e-02

Pos logit contributions
Me+0.056
year+0.054
cut+0.052
 capt+0.052
umm+0.051
Neg logit contributions
 enough-0.027
 our-0.027
 another-0.026
 later-0.024
 band-0.024
Token ever
Token ID1683
Feature activation+2.48e-02

Pos logit contributions
cut+0.003
umm+0.003
Me+0.003
rol+0.003
 capt+0.003
Neg logit contributions
 our-0.003
 another-0.003
 later-0.003
 your-0.003
 enough-0.003
Token said
Token ID531
Feature activation+1.08e-01

Pos logit contributions
 enough+0.017
 our+0.017
 another+0.016
 later+0.016
 band+0.016
Neg logit contributions
umm-0.020
Me-0.018
bridge-0.017
cut-0.016
Here-0.016
Token it
Token ID340
Feature activation+5.60e-01

Pos logit contributions
 our+0.002
 enough+0.002
 another+0.002
 loop+0.002
ounce+0.002
Neg logit contributions
Me-0.006
cut-0.006
 ad-0.006
umm-0.006
year-0.006
Token was
Token ID373
Feature activation-4.26e-01

Pos logit contributions
 our+0.025
 enough+0.024
 another+0.023
 your+0.023
 band+0.023
Neg logit contributions
Me-0.020
cut-0.018
umm-0.018
 ad-0.017
year-0.017
Token the
Token ID262
Feature activation-6.55e-01

Pos logit contributions
Me+0.020
cut+0.020
umm+0.019
year+0.019
Here+0.018
Neg logit contributions
 our-0.016
 enough-0.015
 another-0.015
 your-0.014
 something-0.014
Token y
Token ID331
Feature activation-9.49e-01

Pos logit contributions
year+0.033
Me+0.031
bridge+0.029
cut+0.028
rol+0.028
Neg logit contributions
 enough-0.023
 our-0.022
 another-0.022
 band-0.020
 too-0.020
Tokenumm
Token ID13929
Feature activation-3.02e+00

Pos logit contributions
Me+0.033
cut+0.030
 ad+0.029
year+0.029
umm+0.028
Neg logit contributions
 our-0.043
 enough-0.042
 another-0.041
 your-0.039
 band-0.039
Tokeniest
Token ID6386
Feature activation-2.90e-01

Pos logit contributions
Me+0.087
 capt+0.078
 blonde+0.069
Here+0.068
 Everywhere+0.067
Neg logit contributions
 enough-0.164
 later-0.157
 our-0.156
 another-0.152
 half-0.149
Token ice
Token ID4771
Feature activation-9.42e-01

Pos logit contributions
Me+0.015
year+0.015
cut+0.015
bridge+0.014
 capt+0.014
Neg logit contributions
 enough-0.008
 another-0.008
 our-0.008
 band-0.007
 later-0.007
Token-
Token ID12
Feature activation+1.89e-01

Pos logit contributions
Me+0.057
year+0.054
 capt+0.053
umm+0.053
cut+0.052
Neg logit contributions
 our-0.018
 enough-0.017
 another-0.016
 band-0.015
 later-0.015
Tokencream
Token ID36277
Feature activation-5.35e-01

Pos logit contributions
 our+0.007
 another+0.007
 enough+0.007
 later+0.007
 your+0.007
Neg logit contributions
Me-0.007
umm-0.007
 capt-0.007
 ad-0.007
 blonde-0.007
Token they
Token ID484
Feature activation+3.39e-01

Pos logit contributions
Me+0.027
year+0.025
cut+0.025
 ad+0.025
 capt+0.024
Neg logit contributions
 our-0.016
 enough-0.015
 another-0.015
 band-0.014
 later-0.014
Token store
Token ID3650
Feature activation-5.34e-01

Pos logit contributions
Me+0.012
umm+0.011
cut+0.011
 ad+0.011
year+0.011
Neg logit contributions
 our-0.010
 enough-0.010
 another-0.010
 your-0.009
 ourselves-0.009
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
Me+0.028
 ad+0.028
cut+0.028
year+0.027
thank+0.026
Neg logit contributions
 our-0.013
oppy-0.011
 your-0.011
 ourselves-0.011
ig-0.011
Token got
Token ID1392
Feature activation-1.92e-01

Pos logit contributions
 our+0.006
 enough+0.006
 another+0.006
 your+0.005
 band+0.005
Neg logit contributions
Me-0.006
umm-0.006
cut-0.006
 ad-0.006
year-0.006
Token the
Token ID262
Feature activation-4.32e-01

Pos logit contributions
umm+0.011
Me+0.011
cut+0.010
 ad+0.010
year+0.010
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.005
 your-0.004
 something-0.004
Token y
Token ID331
Feature activation-1.01e+00

Pos logit contributions
year+0.023
Me+0.022
umm+0.022
 ad+0.022
 capt+0.022
Neg logit contributions
 another-0.013
 our-0.013
 enough-0.012
 band-0.012
 half-0.011
Tokenumm
Token ID13929
Feature activation-2.94e+00

Pos logit contributions
umm+0.035
Me+0.032
 ad+0.029
year+0.029
cut+0.028
Neg logit contributions
 our-0.047
 another-0.044
 enough-0.043
 loop-0.042
 your-0.042
Tokeniest
Token ID6386
Feature activation-3.82e-01

Pos logit contributions
Me+0.077
 capt+0.064
 ad+0.060
year+0.058
cut+0.058
Neg logit contributions
 enough-0.167
 our-0.162
 another-0.157
 later-0.157
 band-0.154
Token ice
Token ID4771
Feature activation-1.43e+00

Pos logit contributions
Me+0.021
cut+0.020
year+0.020
umm+0.019
 ad+0.019
Neg logit contributions
 another-0.010
 enough-0.010
 our-0.010
 band-0.009
 something-0.009
Token cream
Token ID8566
Feature activation-1.56e+00

Pos logit contributions
Me+0.062
year+0.056
cut+0.056
umm+0.056
 capt+0.056
Neg logit contributions
 our-0.054
 enough-0.052
 another-0.051
 later-0.049
 band-0.049
Token they
Token ID484
Feature activation-5.87e-02

Pos logit contributions
Me+0.086
year+0.081
cut+0.079
umm+0.079
 ad+0.078
Neg logit contributions
 our-0.040
 enough-0.039
 another-0.038
 band-0.034
 later-0.034
Token could
Token ID714
Feature activation-4.25e-01

Pos logit contributions
umm+0.003
Me+0.002
cut+0.002
 capt+0.002
 redd+0.002
Neg logit contributions
 our-0.002
 later-0.002
 another-0.002
 enough-0.002
 too-0.002
Token!
Token ID0
Feature activation+3.01e-01

Pos logit contributions
Me+0.032
cut+0.030
umm+0.030
 ad+0.030
year+0.029
Neg logit contributions
 our-0.017
 enough-0.015
 another-0.015
 your-0.014
 band-0.014
Token It
Token ID632
Feature activation-1.92e-01

Pos logit contributions
 our+0.012
 enough+0.011
 another+0.011
 your+0.010
 band+0.010
Neg logit contributions
Me-0.013
cut-0.012
umm-0.012
 ad-0.012
year-0.011
Token was
Token ID373
Feature activation-2.48e-02

Pos logit contributions
Me+0.007
cut+0.006
umm+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.009
 enough-0.008
 another-0.008
 your-0.008
 band-0.008
Token the
Token ID262
Feature activation-6.71e-01

Pos logit contributions
Me+0.001
umm+0.001
cut+0.001
year+0.001
 ad+0.001
Neg logit contributions
 our-0.001
 enough-0.001
 another-0.001
 something-0.001
 your-0.001
Token y
Token ID331
Feature activation-9.63e-01

Pos logit contributions
year+0.031
Me+0.029
rol+0.028
cut+0.027
 ad+0.027
Neg logit contributions
 enough-0.025
 our-0.025
 another-0.024
 something-0.023
 band-0.022
Tokenumm
Token ID13929
Feature activation-2.92e+00

Pos logit contributions
Me+0.025
umm+0.022
cut+0.021
 ad+0.021
year+0.020
Neg logit contributions
 our-0.052
 enough-0.050
 another-0.050
 your-0.048
 band-0.048
Tokeniest
Token ID6386
Feature activation-4.94e-01

Pos logit contributions
Me+0.082
 capt+0.072
 ad+0.067
 blonde+0.064
cut+0.064
Neg logit contributions
 enough-0.158
 our-0.151
 later-0.151
 another-0.147
 band-0.143
Token stew
Token ID20798
Feature activation-7.89e-01

Pos logit contributions
Me+0.025
cut+0.023
year+0.023
 ad+0.022
umm+0.022
Neg logit contributions
 our-0.015
 enough-0.015
 another-0.015
 something-0.013
 band-0.013
Token ever
Token ID1683
Feature activation+5.20e-01

Pos logit contributions
Me+0.040
year+0.037
umm+0.037
cut+0.037
Here+0.036
Neg logit contributions
 our-0.024
 enough-0.022
 another-0.021
 later-0.020
 your-0.020
Token!
Token ID0
Feature activation+1.38e-01

Pos logit contributions
 our+0.013
 enough+0.012
 another+0.011
 your+0.010
 later+0.010
Neg logit contributions
Me-0.030
cut-0.028
umm-0.028
year-0.027
 ad-0.027
Token
Token ID198
Feature activation+2.00e-01

Pos logit contributions
 our+0.005
 another+0.005
 enough+0.005
 your+0.004
 band+0.004
Neg logit contributions
Me-0.006
cut-0.006
umm-0.006
 ad-0.006
year-0.006
Token magic
Token ID5536
Feature activation-3.27e-01

Pos logit contributions
Me+0.028
year+0.026
umm+0.026
cut+0.026
 ad+0.026
Neg logit contributions
 our-0.020
 enough-0.020
 another-0.019
 band-0.018
 your-0.018
Token apple
Token ID17180
Feature activation-4.88e-01

Pos logit contributions
Me+0.015
cut+0.014
umm+0.014
 ad+0.013
year+0.013
Neg logit contributions
 our-0.012
 enough-0.011
 another-0.011
 your-0.010
 band-0.010
Token was
Token ID373
Feature activation-2.68e-01

Pos logit contributions
Me+0.017
cut+0.015
 ad+0.015
umm+0.015
year+0.014
Neg logit contributions
 our-0.022
 enough-0.021
 another-0.021
 your-0.020
 ourselves-0.020
Token the
Token ID262
Feature activation-7.49e-01

Pos logit contributions
Me+0.014
cut+0.013
 ad+0.013
umm+0.013
year+0.013
Neg logit contributions
 our-0.007
 enough-0.007
 another-0.007
 band-0.006
 ourselves-0.006
Token y
Token ID331
Feature activation-7.83e-01

Pos logit contributions
Me+0.034
year+0.033
cut+0.031
umm+0.031
 ad+0.030
Neg logit contributions
 our-0.027
 enough-0.026
 another-0.026
 band-0.024
 your-0.023
Tokenumm
Token ID13929
Feature activation-2.90e+00

Pos logit contributions
umm+0.015
Me+0.012
cut+0.009
year+0.009
 ad+0.009
Neg logit contributions
 our-0.049
 another-0.047
 enough-0.046
 your-0.046
 band-0.046
Tokeniest
Token ID6386
Feature activation-4.64e-01

Pos logit contributions
Me+0.081
 capt+0.068
year+0.064
 ad+0.063
Here+0.061
Neg logit contributions
 enough-0.161
 our-0.153
 later-0.151
 another-0.150
 band-0.145
Token thing
Token ID1517
Feature activation-3.21e-01

Pos logit contributions
Me+0.024
cut+0.023
year+0.023
umm+0.022
 ad+0.022
Neg logit contributions
 our-0.013
 enough-0.013
 another-0.012
 band-0.011
 your-0.011
Token they
Token ID484
Feature activation+5.21e-01

Pos logit contributions
Me+0.017
umm+0.016
cut+0.016
year+0.015
 ad+0.015
Neg logit contributions
 our-0.009
 enough-0.008
 another-0.008
 your-0.008
 band-0.008
Token ever
Token ID1683
Feature activation+2.19e-01

Pos logit contributions
 our+0.023
 another+0.021
 enough+0.021
 later+0.021
 your+0.020
Neg logit contributions
Me-0.020
umm-0.020
cut-0.018
year-0.017
 redd-0.017
Token tasted
Token ID29187
Feature activation-5.30e-04

Pos logit contributions
 our+0.010
 enough+0.009
 another+0.009
 your+0.009
 band+0.009
Neg logit contributions
Me-0.007
umm-0.006
year-0.006
 ad-0.006
thank-0.006
Token.
Token ID13
Feature activation+4.06e-01

Pos logit contributions
 our+0.012
 enough+0.010
 band+0.010
 another+0.010
 your+0.010
Neg logit contributions
Me-0.018
umm-0.017
 ad-0.017
cut-0.017
year-0.017
Token It
Token ID632
Feature activation-1.95e-01

Pos logit contributions
 our+0.014
 another+0.014
 enough+0.013
 your+0.013
 something+0.013
Neg logit contributions
cut-0.018
Me-0.017
 ad-0.017
umm-0.017
year-0.016
Token was
Token ID373
Feature activation-2.84e-02

Pos logit contributions
umm+0.008
Me+0.008
cut+0.007
year+0.007
 ad+0.007
Neg logit contributions
 another-0.007
 our-0.007
 enough-0.007
 band-0.007
 loop-0.007
Token the
Token ID262
Feature activation-5.41e-01

Pos logit contributions
Me+0.001
umm+0.001
cut+0.001
year+0.001
mer+0.001
Neg logit contributions
 enough-0.001
 our-0.001
 another-0.001
 something-0.001
 too-0.001
Token y
Token ID331
Feature activation-9.16e-01

Pos logit contributions
year+0.023
Me+0.022
rol+0.021
cut+0.020
 ad+0.020
Neg logit contributions
 another-0.022
 our-0.022
 enough-0.021
 something-0.020
 brother-0.020
Tokenumm
Token ID13929
Feature activation-2.90e+00

Pos logit contributions
Me+0.028
umm+0.027
cut+0.025
 ad+0.025
year+0.024
Neg logit contributions
 our-0.045
 enough-0.043
 another-0.043
 your-0.041
 band-0.041
Tokeniest
Token ID6386
Feature activation-3.85e-01

Pos logit contributions
Me+0.079
 capt+0.071
 ad+0.063
year+0.063
Here+0.063
Neg logit contributions
 enough-0.160
 our-0.153
 later-0.151
 another-0.150
 half-0.146
Token thing
Token ID1517
Feature activation-4.69e-01

Pos logit contributions
Me+0.019
year+0.018
cut+0.018
 ad+0.017
umm+0.017
Neg logit contributions
 our-0.012
 enough-0.012
 another-0.012
 band-0.010
 your-0.010
Token she
Token ID673
Feature activation-2.89e-02

Pos logit contributions
Me+0.023
umm+0.022
cut+0.022
year+0.021
 ad+0.020
Neg logit contributions
 our-0.015
 enough-0.014
 another-0.014
 your-0.013
 something-0.013
Token ever
Token ID1683
Feature activation+2.44e-01

Pos logit contributions
cut+0.001
umm+0.001
rol+0.001
Me+0.001
bridge+0.001
Neg logit contributions
 our-0.001
 another-0.001
 too-0.001
 be-0.001
 enough-0.001
Token tasted
Token ID29187
Feature activation-1.85e-01

Pos logit contributions
 our+0.010
 enough+0.010
 another+0.010
 later+0.009
 your+0.009
Neg logit contributions
Me-0.009
cut-0.009
umm-0.009
 ad-0.008
year-0.008
Token
Token ID241
Feature activation+7.63e-02

Pos logit contributions
 our+0.000
 enough+0.000
 another+0.000
 your+0.000
 ourselves+0.000
Neg logit contributions
Me-0.000
umm-0.000
cut-0.000
year-0.000
 ad-0.000
TokenThis
Token ID1212
Feature activation+5.02e-01

Pos logit contributions
 another+0.003
 something+0.003
 our+0.003
 enough+0.003
 your+0.003
Neg logit contributions
cut-0.003
umm-0.003
jack-0.003
igator-0.003
atively-0.003
Token is
Token ID318
Feature activation+4.08e-01

Pos logit contributions
 our+0.014
 enough+0.013
 another+0.013
 your+0.012
 band+0.011
Neg logit contributions
Me-0.026
umm-0.025
cut-0.025
year-0.025
 ad-0.025
Token the
Token ID262
Feature activation-4.73e-01

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 your+0.010
 something+0.009
Neg logit contributions
Me-0.022
cut-0.022
umm-0.022
year-0.021
 ad-0.021
Token y
Token ID331
Feature activation-8.81e-01

Pos logit contributions
year+0.024
Me+0.022
cut+0.022
 ad+0.021
 capt+0.021
Neg logit contributions
 our-0.016
 enough-0.016
 another-0.015
 something-0.014
 half-0.014
Tokenumm
Token ID13929
Feature activation-2.87e+00

Pos logit contributions
Me+0.030
umm+0.028
cut+0.027
 ad+0.026
year+0.026
Neg logit contributions
 our-0.041
 enough-0.039
 another-0.039
 your-0.037
 band-0.037
Tokeniest
Token ID6386
Feature activation-2.64e-01

Pos logit contributions
Me+0.073
 capt+0.061
Here+0.058
year+0.057
cut+0.057
Neg logit contributions
 enough-0.161
 our-0.158
 later-0.154
 another-0.152
 band-0.149
Token surprise
Token ID5975
Feature activation-7.15e-01

Pos logit contributions
Me+0.013
cut+0.013
year+0.013
 ad+0.012
umm+0.012
Neg logit contributions
 our-0.008
 enough-0.008
 another-0.008
 your-0.007
 band-0.007
Token ever
Token ID1683
Feature activation+4.77e-01

Pos logit contributions
Me+0.040
cut+0.038
umm+0.038
year+0.037
 ad+0.037
Neg logit contributions
 our-0.018
 enough-0.016
 another-0.015
 your-0.014
 band-0.014
Token!
Token ID0
Feature activation+4.10e-01

Pos logit contributions
 our+0.012
 enough+0.011
 another+0.010
 your+0.010
 ourselves+0.009
Neg logit contributions
Me-0.027
cut-0.025
umm-0.025
 ad-0.025
year-0.025
Tokenâ
Token ID22940
Feature activation+5.40e-01

Pos logit contributions
 our+0.015
 another+0.014
 enough+0.014
 your+0.013
 my+0.013
Neg logit contributions
Me-0.018
cut-0.017
 ad-0.017
umm-0.017
year-0.016
Token the
Token ID262
Feature activation-6.09e-01

Pos logit contributions
Me+0.033
 ad+0.030
cut+0.030
year+0.030
umm+0.030
Neg logit contributions
 our-0.023
 enough-0.021
 another-0.021
 ourselves-0.020
 band-0.020
Token music
Token ID2647
Feature activation-6.80e-01

Pos logit contributions
Me+0.034
cut+0.032
 ad+0.032
umm+0.031
thank+0.030
Neg logit contributions
 our-0.015
 enough-0.014
 another-0.013
oppy-0.012
 ourselves-0.012
Token was
Token ID373
Feature activation-2.62e-01

Pos logit contributions
Me+0.026
umm+0.024
cut+0.024
year+0.023
 ad+0.023
Neg logit contributions
 our-0.028
 enough-0.027
 another-0.027
 band-0.026
 your-0.025
Token even
Token ID772
Feature activation-3.80e-01

Pos logit contributions
Me+0.013
cut+0.012
umm+0.012
 ad+0.012
year+0.012
Neg logit contributions
 our-0.008
 enough-0.007
 another-0.007
 your-0.007
 band-0.007
Token y
Token ID331
Feature activation-6.41e-01

Pos logit contributions
Me+0.022
cut+0.021
umm+0.021
 ad+0.021
year+0.020
Neg logit contributions
 our-0.008
 enough-0.007
 another-0.007
 ourselves-0.006
 your-0.006
Tokenumm
Token ID13929
Feature activation-2.84e+00

Pos logit contributions
Me+0.024
umm+0.023
cut+0.022
year+0.021
 ad+0.021
Neg logit contributions
 our-0.027
 enough-0.025
 another-0.025
 your-0.024
 band-0.024
Tokenier
Token ID959
Feature activation-5.01e-01

Pos logit contributions
Me+0.124
 capt+0.107
cut+0.106
 ad+0.105
Here+0.104
Neg logit contributions
 enough-0.113
 our-0.105
 another-0.103
 later-0.102
 band-0.099
Token than
Token ID621
Feature activation-1.53e-01

Pos logit contributions
Me+0.026
umm+0.024
cut+0.024
 ad+0.023
year+0.023
Neg logit contributions
 our-0.015
 enough-0.014
 another-0.013
 your-0.012
 band-0.012
Token before
Token ID878
Feature activation-1.84e-01

Pos logit contributions
Me+0.007
cut+0.007
umm+0.007
 ad+0.007
year+0.007
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.004
 your-0.004
 band-0.004
Token.
Token ID13
Feature activation-3.07e-01

Pos logit contributions
Me+0.008
 ad+0.007
umm+0.007
 capt+0.007
cut+0.007
Neg logit contributions
 our-0.006
 enough-0.006
oppy-0.006
 band-0.006
ounce-0.006
Token<|endoftext|>
Token ID50256
Feature activation+2.77e-02

Pos logit contributions
cut+0.015
Me+0.014
umm+0.014
 ad+0.014
year+0.014
Neg logit contributions
 our-0.010
 another-0.009
 enough-0.009
 your-0.009
 Sam-0.008
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
Me+0.006
cut+0.006
umm+0.006
 ad+0.006
year+0.006
Neg logit contributions
 our-0.003
 ourselves-0.003
 appearances-0.003
 your-0.003
ounce-0.003
Token It
Token ID632
Feature activation+3.89e-02

Pos logit contributions
 our+0.008
 enough+0.008
 another+0.008
 your+0.007
 band+0.007
Neg logit contributions
Me-0.012
cut-0.011
umm-0.011
 ad-0.011
year-0.011
Token was
Token ID373
Feature activation+4.91e-02

Pos logit contributions
 our+0.002
 another+0.001
 enough+0.001
 band+0.001
 your+0.001
Neg logit contributions
Me-0.002
umm-0.002
year-0.001
cut-0.001
 ad-0.001
Token the
Token ID262
Feature activation-5.09e-01

Pos logit contributions
 our+0.002
 enough+0.002
 another+0.002
 your+0.002
 too+0.002
Neg logit contributions
Me-0.002
umm-0.002
cut-0.002
year-0.002
 ad-0.002
Token y
Token ID331
Feature activation-7.87e-01

Pos logit contributions
year+0.024
Me+0.023
 ad+0.022
cut+0.021
rol+0.021
Neg logit contributions
 our-0.017
 another-0.017
 enough-0.017
 band-0.016
 brother-0.016
Tokenumm
Token ID13929
Feature activation-2.77e+00

Pos logit contributions
umm+0.025
Me+0.024
cut+0.021
year+0.021
 ad+0.021
Neg logit contributions
 our-0.038
 enough-0.036
 another-0.036
 your-0.035
 band-0.035
Tokeniest
Token ID6386
Feature activation-2.10e-01

Pos logit contributions
Me+0.079
 capt+0.073
 ad+0.063
 Everywhere+0.062
Here+0.062
Neg logit contributions
 enough-0.151
 later-0.144
 our-0.142
 another-0.141
 half-0.138
Token thing
Token ID1517
Feature activation-1.60e-01

Pos logit contributions
Me+0.011
cut+0.010
year+0.010
 ad+0.010
umm+0.010
Neg logit contributions
 our-0.006
 another-0.006
 enough-0.006
 band-0.005
 later-0.005
Token he
Token ID339
Feature activation+2.13e-01

Pos logit contributions
Me+0.008
umm+0.007
cut+0.007
year+0.007
 ad+0.007
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.005
 your-0.004
 band-0.004
Token ever
Token ID1683
Feature activation+3.62e-01

Pos logit contributions
 another+0.009
 our+0.008
 enough+0.008
 too+0.008
 later+0.008
Neg logit contributions
umm-0.009
cut-0.009
Me-0.008
rol-0.008
bridge-0.007
Token tasted
Token ID29187
Feature activation+1.46e-01

Pos logit contributions
 our+0.016
 enough+0.016
 another+0.016
 later+0.015
 your+0.014
Neg logit contributions
Me-0.013
cut-0.012
umm-0.012
 ad-0.012
year-0.011
Token little
Token ID1310
Feature activation-7.97e-02

Pos logit contributions
rol+0.017
year+0.016
umm+0.016
cut+0.016
 ad+0.016
Neg logit contributions
 another-0.008
 band-0.008
 brother-0.008
 bat-0.007
 something-0.007
Token boy
Token ID2933
Feature activation-7.27e-01

Pos logit contributions
Me+0.003
umm+0.003
cut+0.003
 ad+0.002
year+0.002
Neg logit contributions
 our-0.004
 enough-0.003
 ourselves-0.003
 your-0.003
 another-0.003
Token had
Token ID550
Feature activation-7.81e-02

Pos logit contributions
Me+0.032
cut+0.029
 ad+0.029
umm+0.029
year+0.029
Neg logit contributions
 our-0.026
 enough-0.024
 another-0.024
 ourselves-0.023
 your-0.023
Token the
Token ID262
Feature activation-5.81e-01

Pos logit contributions
Me+0.005
 ad+0.004
cut+0.004
umm+0.004
year+0.004
Neg logit contributions
 our-0.001
 enough-0.001
 another-0.001
ounce-0.001
 ourselves-0.001
Token y
Token ID331
Feature activation-7.44e-01

Pos logit contributions
year+0.032
Me+0.031
cut+0.031
umm+0.030
 ad+0.030
Neg logit contributions
 another-0.016
 enough-0.016
 our-0.016
 band-0.014
 something-0.014
Tokenumm
Token ID13929
Feature activation-2.76e+00

Pos logit contributions
umm+0.027
Me+0.027
 ad+0.024
cut+0.024
year+0.024
Neg logit contributions
 our-0.032
 another-0.030
 enough-0.030
 loop-0.029
 band-0.029
Tokeniest
Token ID6386
Feature activation-2.78e-01

Pos logit contributions
Me+0.079
 capt+0.068
 ad+0.060
Here+0.059
cut+0.059
Neg logit contributions
 enough-0.155
 later-0.146
 our-0.144
 another-0.143
 band-0.140
Token treat
Token ID2190
Feature activation-2.73e-01

Pos logit contributions
Me+0.015
cut+0.015
year+0.015
umm+0.014
 capt+0.014
Neg logit contributions
 enough-0.008
 another-0.007
 our-0.007
 band-0.007
 later-0.007
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
Me+0.014
umm+0.013
cut+0.013
year+0.013
 ad+0.013
Neg logit contributions
 our-0.008
 enough-0.008
 another-0.007
 your-0.007
 band-0.007
Token He
Token ID679
Feature activation-5.16e-02

Pos logit contributions
 our+0.013
 another+0.013
 enough+0.013
 your+0.012
 something+0.012
Neg logit contributions
Me-0.011
cut-0.010
 ad-0.010
year-0.010
umm-0.010
Token asked
Token ID1965
Feature activation+4.26e-01

Pos logit contributions
Me+0.002
umm+0.002
cut+0.002
year+0.002
 ad+0.002
Neg logit contributions
 our-0.002
 enough-0.002
 another-0.002
 band-0.002
 your-0.002
Token the
Token ID262
Feature activation-2.36e-01

Pos logit contributions
 our+0.016
 another+0.014
 enough+0.014
 your+0.013
 something+0.012
Neg logit contributions
Me-0.042
umm-0.041
cut-0.040
 ad-0.040
year-0.040
Token bear
Token ID6842
Feature activation-1.02e-01

Pos logit contributions
year+0.014
rol+0.014
spe+0.014
 ad+0.014
Me+0.014
Neg logit contributions
 another-0.005
 band-0.005
 brother-0.004
 our-0.004
 too-0.004
Token found
Token ID1043
Feature activation-1.39e-01

Pos logit contributions
Me+0.004
cut+0.003
umm+0.003
 ad+0.003
year+0.003
Neg logit contributions
 our-0.004
 enough-0.004
 another-0.004
 ourselves-0.004
 your-0.004
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
Me+0.010
umm+0.009
cut+0.009
 ad+0.009
year+0.009
Neg logit contributions
 our-0.002
 enough-0.001
 another-0.001
 your-0.001
 band-0.001
Token y
Token ID331
Feature activation-7.70e-01

Pos logit contributions
year+0.006
Me+0.006
 ad+0.006
cut+0.006
umm+0.006
Neg logit contributions
 another-0.004
 our-0.004
 enough-0.004
 band-0.004
 half-0.004
Tokenumm
Token ID13929
Feature activation-2.75e+00

Pos logit contributions
umm+0.026
Me+0.021
thank+0.017
 ad+0.017
year+0.017
Neg logit contributions
 our-0.039
 another-0.037
 loop-0.036
 appearances-0.035
 enough-0.035
Tokeniest
Token ID6386
Feature activation-1.95e-01

Pos logit contributions
Me+0.071
 capt+0.066
 ad+0.058
year+0.055
 blonde+0.054
Neg logit contributions
 enough-0.154
 our-0.149
 later-0.148
 another-0.146
 band-0.143
Token cookie
Token ID19751
Feature activation-1.14e-01

Pos logit contributions
Me+0.010
cut+0.010
year+0.010
umm+0.010
 ad+0.010
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.005
 band-0.005
 your-0.004
Token ever
Token ID1683
Feature activation+1.12e-01

Pos logit contributions
Me+0.005
umm+0.005
year+0.005
cut+0.005
 ad+0.005
Neg logit contributions
 our-0.004
 enough-0.004
 another-0.004
 later-0.003
 your-0.003
Token.
Token ID13
Feature activation+6.74e-01

Pos logit contributions
 our+0.003
 enough+0.003
 another+0.003
 your+0.002
 band+0.002
Neg logit contributions
Me-0.006
umm-0.006
cut-0.006
 ad-0.006
year-0.006
Token He
Token ID679
Feature activation+1.86e-02

Pos logit contributions
 our+0.031
 enough+0.029
 another+0.029
 your+0.028
 band+0.027
Neg logit contributions
Me-0.023
cut-0.021
 ad-0.021
umm-0.021
year-0.021
Token made
Token ID925
Feature activation-2.47e-01

Pos logit contributions
Me+0.019
umm+0.018
cut+0.017
year+0.017
 ad+0.016
Neg logit contributions
 our-0.021
 enough-0.020
 another-0.019
 your-0.019
 band-0.018
Token him
Token ID683
Feature activation-8.24e-02

Pos logit contributions
Me+0.014
cut+0.013
 ad+0.013
year+0.013
umm+0.013
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.005
 ourselves-0.004
ig-0.004
Token an
Token ID281
Feature activation+3.73e-01

Pos logit contributions
Me+0.003
 ad+0.003
cut+0.003
 capt+0.003
thank+0.003
Neg logit contributions
 our-0.003
 enough-0.003
 ourselves-0.003
ig-0.003
 appearances-0.003
Token even
Token ID772
Feature activation-2.52e-01

Pos logit contributions
 our+0.015
 enough+0.015
 another+0.015
 your+0.014
 band+0.014
Neg logit contributions
Me-0.015
umm-0.013
cut-0.013
year-0.013
 ad-0.013
Token y
Token ID331
Feature activation-3.64e-01

Pos logit contributions
Me+0.015
umm+0.015
cut+0.014
 ad+0.014
year+0.014
Neg logit contributions
 our-0.005
 another-0.005
 enough-0.005
 band-0.004
 your-0.004
Tokenumm
Token ID13929
Feature activation-2.63e+00

Pos logit contributions
umm+0.013
Me+0.012
cut+0.011
 ad+0.011
year+0.011
Neg logit contributions
 our-0.017
 another-0.016
 enough-0.015
 band-0.015
 your-0.015
Tokenier
Token ID959
Feature activation+1.13e-01

Pos logit contributions
Me+0.093
 capt+0.076
Here+0.074
 Everywhere+0.070
 blonde+0.070
Neg logit contributions
 enough-0.136
 our-0.126
 later-0.125
ig-0.122
 another-0.121
Token snack
Token ID26906
Feature activation+8.26e-01

Pos logit contributions
 our+0.003
 enough+0.003
 another+0.003
 your+0.003
 band+0.003
Neg logit contributions
Me-0.006
year-0.006
umm-0.006
Here-0.005
cut-0.005
Token and
Token ID290
Feature activation+1.20e-01

Pos logit contributions
 our+0.021
 enough+0.019
 another+0.019
 your+0.017
 band+0.017
Neg logit contributions
Me-0.045
umm-0.042
cut-0.042
 ad-0.042
year-0.041
Token they
Token ID484
Feature activation+2.87e-02

Pos logit contributions
 our+0.004
 enough+0.004
 another+0.004
 your+0.003
 band+0.003
Neg logit contributions
Me-0.006
umm-0.006
cut-0.005
year-0.005
 ad-0.005
Token both
Token ID1111
Feature activation+1.29e-01

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
 band+0.001
 later+0.001
Neg logit contributions
Me-0.001
umm-0.001
year-0.001
cut-0.001
Here-0.001
Token oil
Token ID3056
Feature activation-2.70e-02

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 band+0.009
 your+0.009
Neg logit contributions
Me-0.031
cut-0.030
umm-0.029
 ad-0.029
year-0.029
Token,
Token ID11
Feature activation+4.47e-01

Pos logit contributions
Me+0.001
cut+0.001
umm+0.001
 ad+0.001
year+0.001
Neg logit contributions
 our-0.001
 another-0.001
 enough-0.001
 band-0.001
 your-0.001
Token and
Token ID290
Feature activation+7.06e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.008
 band+0.007
ounce+0.007
Neg logit contributions
Me-0.026
cut-0.025
umm-0.025
 ad-0.024
year-0.024
Token the
Token ID262
Feature activation-3.39e-01

Pos logit contributions
 our+0.022
 enough+0.020
 another+0.020
ounce+0.019
 band+0.019
Neg logit contributions
Me-0.034
umm-0.032
cut-0.032
 ad-0.031
year-0.031
Token y
Token ID331
Feature activation-6.86e-01

Pos logit contributions
year+0.017
Me+0.016
cut+0.016
 ad+0.016
rol+0.016
Neg logit contributions
 our-0.010
 another-0.010
 enough-0.010
 something-0.010
 brother-0.009
Tokenumm
Token ID13929
Feature activation-2.62e+00

Pos logit contributions
umm+0.013
Me+0.011
cut+0.008
 ad+0.008
year+0.008
Neg logit contributions
 our-0.043
 another-0.041
 enough-0.041
 band-0.040
 loop-0.040
Tokeniest
Token ID6386
Feature activation-2.09e-01

Pos logit contributions
Me+0.072
 capt+0.063
 ad+0.057
year+0.057
cut+0.055
Neg logit contributions
 enough-0.145
 our-0.137
 later-0.137
 another-0.135
 half-0.131
Token spaghetti
Token ID48241
Feature activation-5.94e-01

Pos logit contributions
Me+0.010
cut+0.010
year+0.010
 ad+0.010
umm+0.009
Neg logit contributions
 our-0.006
 enough-0.006
 another-0.006
 something-0.006
 band-0.006
Token ever
Token ID1683
Feature activation+3.65e-02

Pos logit contributions
Me+0.029
year+0.027
cut+0.027
umm+0.027
 ad+0.026
Neg logit contributions
 our-0.019
 enough-0.018
 another-0.017
 your-0.016
 band-0.016
Token!
Token ID0
Feature activation-2.08e-01

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
 your+0.001
 later+0.001
Neg logit contributions
Me-0.002
cut-0.002
umm-0.002
year-0.002
 ad-0.002
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
Me+0.009
cut+0.009
umm+0.009
 ad+0.009
year+0.009
Neg logit contributions
 our-0.007
 another-0.007
 enough-0.007
 your-0.006
 band-0.006
Token top
Token ID1353
Feature activation-1.14e-01

Pos logit contributions
 our+0.005
 another+0.004
 enough+0.004
 your+0.004
 something+0.004
Neg logit contributions
Me-0.013
umm-0.013
cut-0.013
year-0.012
 ad-0.012
Token to
Token ID284
Feature activation-4.16e-01

Pos logit contributions
Me+0.006
cut+0.005
umm+0.005
 ad+0.005
year+0.005
Neg logit contributions
 our-0.004
 enough-0.004
 another-0.003
 your-0.003
 band-0.003
Token make
Token ID787
Feature activation-1.77e-01

Pos logit contributions
Me+0.022
cut+0.021
umm+0.021
 ad+0.020
year+0.020
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 your-0.009
 band-0.009
Token it
Token ID340
Feature activation+1.90e-01

Pos logit contributions
Me+0.010
umm+0.009
cut+0.009
 ad+0.009
year+0.009
Neg logit contributions
 our-0.004
 enough-0.004
ounce-0.004
ig-0.004
 another-0.003
Token y
Token ID331
Feature activation-8.43e-01

Pos logit contributions
 our+0.006
 appearances+0.006
ounce+0.006
ig+0.006
 enough+0.006
Neg logit contributions
Me-0.008
umm-0.008
 ad-0.008
cut-0.008
 ju-0.007
Tokenumm
Token ID13929
Feature activation-2.57e+00

Pos logit contributions
umm+0.030
Me+0.029
year+0.025
 ad+0.025
cut+0.025
Neg logit contributions
 our-0.038
 another-0.035
 enough-0.035
 band-0.035
 loop-0.034
Tokenier
Token ID959
Feature activation-4.04e-01

Pos logit contributions
Me+0.105
cut+0.094
 ad+0.091
umm+0.091
year+0.090
Neg logit contributions
 our-0.104
 enough-0.103
 another-0.099
 later-0.094
 your-0.093
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
Me+0.019
umm+0.019
cut+0.019
 ad+0.018
thank+0.018
Neg logit contributions
 our-0.012
 enough-0.011
 another-0.011
 band-0.011
ounce-0.010
Token Sarah
Token ID10490
Feature activation-3.23e-01

Pos logit contributions
cut+0.007
igator+0.006
Me+0.006
 ad+0.006
umm+0.006
Neg logit contributions
 another-0.005
 our-0.005
 Sam-0.005
 your-0.005
 enough-0.004
Token was
Token ID373
Feature activation+1.82e-01

Pos logit contributions
Me+0.013
umm+0.012
cut+0.012
year+0.012
 ad+0.011
Neg logit contributions
 our-0.013
 enough-0.012
 another-0.012
 band-0.012
 your-0.012
Token still
Token ID991
Feature activation+6.57e-01

Pos logit contributions
 another+0.006
 enough+0.006
 our+0.006
 too+0.005
 something+0.005
Neg logit contributions
umm-0.009
Me-0.009
cut-0.009
Here-0.008
year-0.008
Token garden
Token ID11376
Feature activation-5.60e-02

Pos logit contributions
Me+0.031
cut+0.029
year+0.029
 ad+0.029
umm+0.028
Neg logit contributions
 our-0.029
 enough-0.028
 another-0.028
 your-0.026
 something-0.026
Token,
Token ID11
Feature activation-4.49e-01

Pos logit contributions
Me+0.003
cut+0.003
umm+0.003
 ad+0.003
year+0.003
Neg logit contributions
 our-0.001
 enough-0.001
 band-0.001
 another-0.001
 ourselves-0.001
Token and
Token ID290
Feature activation+1.01e-01

Pos logit contributions
umm+0.024
Me+0.022
year+0.022
bridge+0.021
cut+0.021
Neg logit contributions
 our-0.013
 another-0.013
 enough-0.013
 something-0.012
 your-0.012
Token picked
Token ID6497
Feature activation-4.32e-01

Pos logit contributions
 our+0.003
 another+0.003
 enough+0.003
 something+0.003
 later+0.003
Neg logit contributions
umm-0.005
Me-0.005
cut-0.005
 ad-0.005
year-0.005
Token the
Token ID262
Feature activation-4.95e-01

Pos logit contributions
Me+0.028
umm+0.026
cut+0.026
 ad+0.026
year+0.026
Neg logit contributions
 our-0.007
 enough-0.006
 another-0.006
 your-0.005
 band-0.005
Token ju
Token ID7544
Feature activation-2.53e+00

Pos logit contributions
year+0.026
Me+0.025
 ad+0.024
cut+0.024
umm+0.024
Neg logit contributions
 our-0.014
 enough-0.014
 another-0.014
 your-0.012
 my-0.012
Tokenic
Token ID291
Feature activation-1.79e+00

Pos logit contributions
Me+0.082
umm+0.075
cut+0.074
 ad+0.071
year+0.070
Neg logit contributions
 our-0.121
 enough-0.115
 another-0.114
 your-0.110
 band-0.109
Tokeniest
Token ID6386
Feature activation-5.71e-01

Pos logit contributions
Me+0.042
umm+0.036
 ad+0.034
cut+0.034
year+0.034
Neg logit contributions
 our-0.102
 enough-0.099
 another-0.098
 your-0.095
 band-0.094
Token carrots
Token ID34397
Feature activation-1.12e+00

Pos logit contributions
Me+0.029
umm+0.027
cut+0.027
 ad+0.026
year+0.026
Neg logit contributions
 our-0.017
 enough-0.016
 another-0.015
 your-0.014
 band-0.014
Token he
Token ID339
Feature activation-5.07e-01

Pos logit contributions
Me+0.060
umm+0.056
cut+0.056
 ad+0.055
year+0.055
Neg logit contributions
 our-0.030
 enough-0.028
 another-0.027
 your-0.025
 band-0.025
Token could
Token ID714
Feature activation-6.43e-01

Pos logit contributions
umm+0.024
cut+0.023
rol+0.022
 sensation+0.021
Me+0.021
Neg logit contributions
 another-0.019
 enough-0.017
 too-0.017
 later-0.017
 be-0.016
Token came
Token ID1625
Feature activation-1.78e-01

Pos logit contributions
 our+0.013
 enough+0.012
 another+0.012
 ourselves+0.011
 your+0.011
Neg logit contributions
Me-0.010
cut-0.009
 ad-0.009
year-0.009
umm-0.009
Token by
Token ID416
Feature activation-7.22e-01

Pos logit contributions
Me+0.008
 ad+0.007
year+0.007
cut+0.007
 capt+0.007
Neg logit contributions
 our-0.006
 enough-0.006
 appearances-0.005
 loop-0.005
oppy-0.005
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
Me+0.038
cut+0.036
 ad+0.036
year+0.035
thank+0.034
Neg logit contributions
 our-0.018
 enough-0.017
oppy-0.016
ig-0.016
 band-0.015
Token "
Token ID366
Feature activation-2.85e-02

Pos logit contributions
 our+0.012
 enough+0.011
 another+0.011
 ourselves+0.010
 your+0.010
Neg logit contributions
Me-0.018
cut-0.017
umm-0.017
 ad-0.017
year-0.017
TokenWhat
Token ID2061
Feature activation+9.33e-01

Pos logit contributions
Me+0.001
umm+0.001
cut+0.001
 ad+0.001
year+0.001
Neg logit contributions
 our-0.001
 another-0.001
 enough-0.001
 band-0.001
 your-0.001
Token's
Token ID338
Feature activation+2.13e+00

Pos logit contributions
 scar+0.030
 our+0.030
 ourselves+0.030
ounce+0.029
ig+0.029
Neg logit contributions
Me-0.042
 ad-0.037
year-0.036
vous-0.035
cut-0.034
Token wrong
Token ID2642
Feature activation+5.96e-01

Pos logit contributions
ounce+0.090
oppy+0.087
 another+0.087
 our+0.086
 enough+0.086
Neg logit contributions
Me-0.068
cut-0.064
 ad-0.063
umm-0.058
Here-0.057
Token?"
Token ID1701
Feature activation+3.44e-01

Pos logit contributions
 our+0.014
 ourselves+0.013
 another+0.012
 enough+0.012
 scar+0.012
Neg logit contributions
Me-0.031
cut-0.031
umm-0.031
 ad-0.031
 capt-0.029
Token he
Token ID339
Feature activation+3.63e-01

Pos logit contributions
 our+0.007
 enough+0.007
 another+0.007
 your+0.006
 band+0.006
Neg logit contributions
Me-0.020
umm-0.019
cut-0.019
year-0.019
 ad-0.019
Token asked
Token ID1965
Feature activation+5.82e-01

Pos logit contributions
 our+0.018
 enough+0.017
 ourselves+0.016
 another+0.016
ig+0.016
Neg logit contributions
Me-0.011
cut-0.010
 ad-0.010
 capt-0.009
year-0.009
Token.
Token ID13
Feature activation+4.86e-01

Pos logit contributions
 our+0.016
 enough+0.014
 another+0.014
 your+0.013
 band+0.013
Neg logit contributions
Me-0.031
umm-0.030
cut-0.029
 ad-0.029
year-0.029
Token and
Token ID290
Feature activation+4.62e-01

Pos logit contributions
 our+0.017
 ourselves+0.015
 loop+0.014
 band+0.014
oppy+0.014
Neg logit contributions
 ad-0.021
Me-0.021
umm-0.021
year-0.020
cut-0.020
Token asked
Token ID1965
Feature activation+4.69e-01

Pos logit contributions
 our+0.019
 enough+0.018
 another+0.018
 your+0.017
 band+0.017
Neg logit contributions
Me-0.018
umm-0.018
cut-0.017
year-0.017
 ad-0.016
Token,
Token ID11
Feature activation+3.75e-01

Pos logit contributions
 our+0.012
 loop+0.011
 ourselves+0.011
 enough+0.010
ounce+0.010
Neg logit contributions
Me-0.025
 ad-0.024
year-0.023
cut-0.023
umm-0.022
Token "
Token ID366
Feature activation+2.63e-02

Pos logit contributions
 our+0.012
 another+0.012
 enough+0.012
 your+0.011
 something+0.011
Neg logit contributions
umm-0.018
Me-0.017
cut-0.017
 ad-0.016
year-0.016
TokenWhat
Token ID2061
Feature activation+9.08e-01

Pos logit contributions
 our+0.001
 another+0.001
 enough+0.001
 band+0.001
 your+0.001
Neg logit contributions
Me-0.001
umm-0.001
cut-0.001
 ad-0.001
year-0.001
Token's
Token ID338
Feature activation+2.11e+00

Pos logit contributions
ounce+0.026
 scar+0.025
ig+0.025
 enough+0.024
 our+0.024
Neg logit contributions
Me-0.045
 ad-0.040
cut-0.038
year-0.037
 redd-0.037
Token an
Token ID281
Feature activation+9.29e-01

Pos logit contributions
ounce+0.082
oppy+0.075
 enough+0.074
ucks+0.073
ig+0.073
Neg logit contributions
Me-0.078
cut-0.078
 ad-0.076
year-0.071
Here-0.068
Token earthquake
Token ID16295
Feature activation+2.14e-01

Pos logit contributions
 our+0.046
 enough+0.044
 another+0.044
 your+0.042
 band+0.042
Neg logit contributions
Me-0.028
cut-0.026
umm-0.026
 ad-0.025
year-0.024
Token?"
Token ID1701
Feature activation+5.70e-01

Pos logit contributions
 our+0.005
 another+0.004
 enough+0.004
 ourselves+0.004
 band+0.004
Neg logit contributions
Me-0.011
cut-0.011
umm-0.011
 ad-0.011
year-0.011
Token Her
Token ID2332
Feature activation+5.68e-01

Pos logit contributions
 our+0.025
 enough+0.024
 another+0.024
 your+0.023
 band+0.023
Neg logit contributions
Me-0.021
cut-0.019
umm-0.018
 ad-0.018
year-0.018
Token mom
Token ID1995
Feature activation+3.73e-01

Pos logit contributions
 our+0.013
 enough+0.012
 another+0.012
 band+0.010
 your+0.010
Neg logit contributions
Me-0.033
cut-0.032
umm-0.031
year-0.031
 ad-0.031
Token,
Token ID11
Feature activation+4.28e-01

Pos logit contributions
 our+0.007
 ourselves+0.007
 loop+0.006
ounce+0.006
 enough+0.006
Neg logit contributions
Me-0.019
cut-0.018
 ad-0.017
year-0.017
umm-0.016
Token "
Token ID366
Feature activation+6.00e-03

Pos logit contributions
 our+0.018
 enough+0.017
 another+0.017
 ourselves+0.016
 your+0.016
Neg logit contributions
Me-0.016
cut-0.015
 ad-0.015
umm-0.014
year-0.014
TokenHello
Token ID15496
Feature activation+3.26e-01

Pos logit contributions
 another+0.000
 our+0.000
 band+0.000
 half+0.000
 enough+0.000
Neg logit contributions
umm-0.000
year-0.000
 ad-0.000
cut-0.000
 capt-0.000
Token,
Token ID11
Feature activation+6.59e-01

Pos logit contributions
 our+0.008
 ourselves+0.008
 band+0.007
 appearances+0.007
 loop+0.007
Neg logit contributions
Me-0.017
umm-0.016
 ad-0.015
cut-0.015
 ju-0.015
Token why
Token ID1521
Feature activation+2.72e-01

Pos logit contributions
 our+0.023
 enough+0.022
 another+0.021
 ourselves+0.021
 band+0.021
Neg logit contributions
Me-0.030
cut-0.027
 ad-0.026
umm-0.026
year-0.025
Token are
Token ID389
Feature activation+2.04e+00

Pos logit contributions
 another+0.009
 enough+0.009
 our+0.009
ounce+0.009
 band+0.009
Neg logit contributions
Me-0.012
 ad-0.011
cut-0.011
igator-0.010
year-0.010
Token you
Token ID345
Feature activation+5.72e-01

Pos logit contributions
 enough+0.040
 loop+0.039
ounce+0.038
ig+0.038
 another+0.038
Neg logit contributions
Me-0.119
cut-0.106
 ad-0.105
year-0.104
umm-0.104
Token here
Token ID994
Feature activation-4.71e-02

Pos logit contributions
 our+0.023
 band+0.022
 ourselves+0.022
 another+0.022
 loop+0.022
Neg logit contributions
cut-0.019
 ad-0.019
Me-0.019
 capt-0.017
umm-0.017
Token?"
Token ID1701
Feature activation+8.25e-01

Pos logit contributions
Me+0.002
cut+0.002
 ad+0.002
umm+0.002
igator+0.002
Neg logit contributions
 enough-0.001
 our-0.001
 appearances-0.001
 ourselves-0.001
 band-0.001
Token The
Token ID383
Feature activation+4.59e-01

Pos logit contributions
 our+0.032
 enough+0.030
 another+0.029
 your+0.028
 band+0.028
Neg logit contributions
Me-0.035
cut-0.032
umm-0.032
 ad-0.031
year-0.031
Token squirrel
Token ID33039
Feature activation+3.72e-01

Pos logit contributions
 our+0.014
 enough+0.013
 another+0.012
 ourselves+0.011
 your+0.011
Neg logit contributions
Me-0.024
cut-0.022
 ad-0.022
umm-0.022
year-0.021
Token and
Token ID290
Feature activation+6.05e-01

Pos logit contributions
 our+0.013
oppy+0.012
 ourselves+0.011
ounce+0.011
 your+0.011
Neg logit contributions
 ad-0.020
umm-0.020
cut-0.019
Me-0.019
 sensation-0.019
Token asked
Token ID1965
Feature activation+4.23e-01

Pos logit contributions
 our+0.019
 enough+0.018
 another+0.018
 your+0.018
 band+0.018
Neg logit contributions
umm-0.031
Me-0.030
bridge-0.027
cut-0.027
year-0.026
Token,
Token ID11
Feature activation+5.46e-01

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 band+0.009
 your+0.009
Neg logit contributions
Me-0.023
cut-0.022
umm-0.022
 ad-0.021
year-0.021
Token "
Token ID366
Feature activation+1.32e-01

Pos logit contributions
 our+0.020
 another+0.019
 enough+0.019
 your+0.018
 band+0.017
Neg logit contributions
Me-0.024
umm-0.023
cut-0.022
 ad-0.022
year-0.021
TokenWhat
Token ID2061
Feature activation+8.99e-01

Pos logit contributions
 our+0.006
 enough+0.006
 another+0.006
 your+0.006
 band+0.006
Neg logit contributions
Me-0.004
umm-0.004
cut-0.004
 ad-0.004
year-0.004
Token's
Token ID338
Feature activation+2.22e+00

Pos logit contributions
 scar+0.027
 enough+0.026
 our+0.026
ig+0.026
ounce+0.026
Neg logit contributions
Me-0.042
 ad-0.038
cut-0.037
year-0.037
vous-0.035
Token wrong
Token ID2642
Feature activation+7.87e-01

Pos logit contributions
 our+0.095
 enough+0.092
 another+0.091
ounce+0.087
 band+0.087
Neg logit contributions
Me-0.079
cut-0.074
 ad-0.072
umm-0.071
year-0.070
Token,
Token ID11
Feature activation+6.91e-01

Pos logit contributions
 our+0.016
 enough+0.014
 another+0.014
 ourselves+0.014
 band+0.013
Neg logit contributions
Me-0.044
cut-0.044
umm-0.043
 ad-0.043
year-0.041
Token Tim
Token ID5045
Feature activation+1.45e-01

Pos logit contributions
 our+0.024
 enough+0.022
 another+0.022
 band+0.021
 your+0.021
Neg logit contributions
Me-0.032
cut-0.030
umm-0.030
 ad-0.029
year-0.029
Tokenmy
Token ID1820
Feature activation+3.48e-01

Pos logit contributions
 our+0.005
 enough+0.005
 another+0.004
 ourselves+0.004
 band+0.004
Neg logit contributions
Me-0.007
cut-0.006
umm-0.006
 ad-0.006
year-0.006
Token?"
Token ID1701
Feature activation+6.37e-01

Pos logit contributions
 our+0.007
 enough+0.007
 another+0.007
 band+0.006
 your+0.006
Neg logit contributions
Me-0.020
cut-0.019
umm-0.019
 ad-0.019
year-0.019
Token"
Token ID1
Feature activation+3.87e-01

Pos logit contributions
 our+0.014
 another+0.013
 your+0.013
 later+0.013
 bat+0.013
Neg logit contributions
umm-0.023
 ad-0.020
 capt-0.020
bridge-0.020
year-0.020
TokenMom
Token ID29252
Feature activation-8.13e-01

Pos logit contributions
 another+0.017
 half+0.017
 bat+0.016
 band+0.016
 our+0.016
Neg logit contributions
 ad-0.013
year-0.013
umm-0.012
cut-0.012
 capt-0.012
Tokenmy
Token ID1820
Feature activation-4.30e-02

Pos logit contributions
Me+0.036
cut+0.034
umm+0.034
 ad+0.033
year+0.032
Neg logit contributions
 our-0.028
 another-0.026
 enough-0.026
 ourselves-0.025
 band-0.024
Token,
Token ID11
Feature activation+5.64e-01

Pos logit contributions
Me+0.002
cut+0.002
umm+0.002
 ad+0.002
thank+0.002
Neg logit contributions
 our-0.001
 another-0.001
 ourselves-0.001
 enough-0.001
 band-0.001
Token what
Token ID644
Feature activation+2.53e-01

Pos logit contributions
ounce+0.018
 ourselves+0.017
ig+0.016
 enough+0.016
 appearances+0.016
Neg logit contributions
Me-0.024
 ju-0.022
 ad-0.022
cut-0.022
 redd-0.022
Token's
Token ID338
Feature activation+1.97e+00

Pos logit contributions
ounce+0.008
 scar+0.008
 ourselves+0.008
 enough+0.008
 later+0.008
Neg logit contributions
Me-0.011
 ad-0.010
 redd-0.009
 ju-0.009
 liked-0.009
Token this
Token ID428
Feature activation+5.31e-01

Pos logit contributions
ounce+0.047
 enough+0.042
 another+0.042
 our+0.040
ig+0.040
Neg logit contributions
Me-0.100
cut-0.098
 ad-0.096
Here-0.091
umm-0.091
Token?"
Token ID1701
Feature activation+8.52e-01

Pos logit contributions
 our+0.015
 enough+0.014
ounce+0.013
 ourselves+0.013
 another+0.013
Neg logit contributions
Me-0.026
umm-0.024
cut-0.024
 ad-0.023
 redd-0.023
Token Tim
Token ID5045
Feature activation+2.62e-01

Pos logit contributions
 Sam+0.017
 another+0.016
 our+0.016
 something+0.015
 your+0.015
Neg logit contributions
umm-0.057
year-0.054
bridge-0.052
rol-0.052
 capt-0.051
Tokenmy
Token ID1820
Feature activation+2.30e-01

Pos logit contributions
 our+0.007
 enough+0.006
 another+0.006
 band+0.006
 brother+0.006
Neg logit contributions
Me-0.014
umm-0.014
cut-0.013
year-0.013
 ad-0.013
Token asked
Token ID1965
Feature activation+7.04e-01

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 ourselves+0.010
 your+0.009
Neg logit contributions
Me-0.008
cut-0.007
 ad-0.007
umm-0.007
year-0.007
Token,
Token ID11
Feature activation+4.04e-02

Pos logit contributions
Me+0.010
cut+0.009
 ad+0.009
umm+0.009
year+0.009
Neg logit contributions
 our-0.004
 enough-0.004
 another-0.004
 band-0.004
ounce-0.004
Token the
Token ID262
Feature activation-3.38e-01

Pos logit contributions
 our+0.001
 enough+0.001
ounce+0.001
 band+0.001
 another+0.001
Neg logit contributions
Me-0.002
cut-0.002
 ad-0.002
umm-0.002
year-0.002
Token tree
Token ID5509
Feature activation-1.21e-01

Pos logit contributions
Me+0.016
year+0.016
umm+0.016
 ad+0.015
cut+0.015
Neg logit contributions
 our-0.010
 another-0.010
 enough-0.010
 band-0.010
 loop-0.009
Token continued
Token ID3767
Feature activation-1.03e-01

Pos logit contributions
Me+0.005
cut+0.005
umm+0.004
 ad+0.004
year+0.004
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.004
 your-0.004
 ourselves-0.004
Token to
Token ID284
Feature activation-1.21e-01

Pos logit contributions
Me+0.006
cut+0.006
 ad+0.005
umm+0.005
year+0.005
Neg logit contributions
 our-0.002
 enough-0.002
 another-0.002
 ourselves-0.002
ounce-0.002
Token grow
Token ID1663
Feature activation+5.42e-01

Pos logit contributions
Me+0.006
cut+0.006
 ad+0.006
umm+0.006
thank+0.006
Neg logit contributions
 our-0.003
 enough-0.003
 another-0.003
ounce-0.003
 your-0.003
Token,
Token ID11
Feature activation-2.73e-02

Pos logit contributions
 our+0.014
 enough+0.014
 another+0.013
 your+0.012
 something+0.012
Neg logit contributions
Me-0.030
umm-0.028
cut-0.028
 ad-0.028
year-0.027
Token and
Token ID290
Feature activation+4.61e-01

Pos logit contributions
Me+0.001
umm+0.001
cut+0.001
 ad+0.001
year+0.001
Neg logit contributions
 our-0.001
 enough-0.001
 another-0.001
 your-0.001
 band-0.001
Token it
Token ID340
Feature activation-2.16e-01

Pos logit contributions
 our+0.013
 enough+0.012
 another+0.011
 your+0.011
 band+0.011
Neg logit contributions
Me-0.024
cut-0.023
umm-0.023
 ad-0.022
year-0.022
Token became
Token ID2627
Feature activation-5.08e-01

Pos logit contributions
Me+0.008
umm+0.008
cut+0.008
year+0.007
 ad+0.007
Neg logit contributions
 our-0.009
 enough-0.009
 another-0.008
 your-0.008
 band-0.008
Token a
Token ID257
Feature activation-2.67e-01

Pos logit contributions
umm+0.030
Me+0.030
cut+0.030
year+0.029
 ad+0.028
Neg logit contributions
 our-0.012
 enough-0.012
 another-0.012
 something-0.010
 your-0.010
Token,
Token ID11
Feature activation+6.65e-01

Pos logit contributions
 our+0.020
 enough+0.019
 another+0.018
 band+0.018
 ourselves+0.016
Neg logit contributions
Me-0.066
cut-0.061
 ad-0.061
umm-0.060
year-0.059
Token there
Token ID612
Feature activation+8.24e-02

Pos logit contributions
 our+0.018
 enough+0.017
 another+0.016
 your+0.015
 band+0.015
Neg logit contributions
Me-0.035
umm-0.034
cut-0.033
 ad-0.033
year-0.033
Token lived
Token ID5615
Feature activation+2.65e-01

Pos logit contributions
 another+0.003
 our+0.003
 something+0.002
 enough+0.002
 later+0.002
Neg logit contributions
umm-0.004
year-0.004
cut-0.004
 redd-0.004
thank-0.004
Token a
Token ID257
Feature activation+4.77e-01

Pos logit contributions
 enough+0.006
ounce+0.006
ig+0.006
 break+0.006
 mistake+0.006
Neg logit contributions
Me-0.015
Here-0.014
year-0.013
thank-0.013
 ad-0.012
Token little
Token ID1310
Feature activation+5.54e-01

Pos logit contributions
 our+0.016
 another+0.015
 band+0.014
 enough+0.014
 your+0.014
Neg logit contributions
Me-0.022
umm-0.021
cut-0.021
year-0.021
 ad-0.020
Token rabbit
Token ID22746
Feature activation+7.17e-01

Pos logit contributions
 our+0.023
 enough+0.023
 another+0.022
 ourselves+0.022
ounce+0.021
Neg logit contributions
Me-0.021
cut-0.019
umm-0.019
 ad-0.018
year-0.017
Token called
Token ID1444
Feature activation+2.69e-01

Pos logit contributions
 our+0.029
 enough+0.027
 another+0.027
 your+0.026
 band+0.026
Neg logit contributions
Me-0.029
umm-0.027
cut-0.026
year-0.026
 ad-0.026
Token Clara
Token ID27443
Feature activation+5.52e-01

Pos logit contributions
 band+0.008
 enough+0.008
 our+0.008
 mistake+0.008
ounce+0.008
Neg logit contributions
Me-0.012
cut-0.011
umm-0.010
 ad-0.010
igator-0.010
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 our+0.015
 enough+0.014
 another+0.014
 your+0.013
 band+0.013
Neg logit contributions
Me-0.029
umm-0.027
cut-0.027
 ad-0.027
year-0.027
Token She
Token ID1375
Feature activation-8.47e-02

Pos logit contributions
 our+0.006
 enough+0.005
 another+0.005
 your+0.005
 band+0.005
Neg logit contributions
Me-0.006
cut-0.005
umm-0.005
 ad-0.005
year-0.005
Token was
Token ID373
Feature activation+5.05e-01

Pos logit contributions
umm+0.004
year+0.003
rol+0.003
cut+0.003
 redd+0.003
Neg logit contributions
 our-0.004
 your-0.004
 another-0.004
 be-0.004
 band-0.004
Token She
Token ID1375
Feature activation+6.34e-02

Pos logit contributions
 our+0.020
 enough+0.019
 another+0.019
 your+0.018
 band+0.017
Neg logit contributions
Me-0.022
cut-0.021
umm-0.020
 ad-0.020
year-0.020
Token smiled
Token ID13541
Feature activation-1.71e-01

Pos logit contributions
 our+0.003
 enough+0.003
 ourselves+0.002
 another+0.002
 your+0.002
Neg logit contributions
Me-0.002
cut-0.002
 ad-0.002
year-0.002
umm-0.002
Token and
Token ID290
Feature activation+1.51e-01

Pos logit contributions
Me+0.009
 ad+0.009
year+0.008
 capt+0.008
cut+0.008
Neg logit contributions
 our-0.005
 ourselves-0.004
 loop-0.004
oppy-0.004
 appearances-0.004
Token hugged
Token ID41130
Feature activation-2.24e-01

Pos logit contributions
 our+0.006
 ourselves+0.006
 enough+0.006
ig+0.006
oppy+0.006
Neg logit contributions
Me-0.006
 ad-0.005
year-0.005
cut-0.005
 capt-0.005
Token Lily
Token ID20037
Feature activation+3.43e-01

Pos logit contributions
Me+0.013
umm+0.013
cut+0.013
year+0.012
 ad+0.012
Neg logit contributions
 our-0.005
 enough-0.005
 another-0.005
 your-0.004
 band-0.004
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
 our+0.011
 ourselves+0.010
 enough+0.010
 band+0.010
 loop+0.010
Neg logit contributions
Me-0.016
 ad-0.015
cut-0.015
year-0.014
umm-0.013
Token
Token ID198
Feature activation+4.37e-02

Pos logit contributions
 another+0.008
 our+0.007
 enough+0.007
 your+0.007
 something+0.007
Neg logit contributions
cut-0.012
Me-0.012
umm-0.012
 ad-0.012
year-0.011
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
 our+0.001
 another+0.001
 your+0.001
 later+0.001
 ourselves+0.001
Neg logit contributions
umm-0.002
Me-0.002
cut-0.002
bridge-0.002
 redd-0.002
Token"
Token ID1
Feature activation+2.55e-01

Pos logit contributions
 our+0.006
 another+0.005
 your+0.005
 enough+0.005
 band+0.004
Neg logit contributions
Me-0.025
umm-0.025
cut-0.024
 ad-0.024
year-0.024
TokenDon
Token ID3987
Feature activation-3.03e-01

Pos logit contributions
 our+0.013
 enough+0.012
 another+0.012
 band+0.011
 your+0.011
Neg logit contributions
Me-0.008
umm-0.007
cut-0.007
 ad-0.007
year-0.007
Token't
Token ID470
Feature activation-3.84e-03

Pos logit contributions
Me+0.013
cut+0.011
umm+0.011
 ad+0.011
year+0.011
Neg logit contributions
 our-0.012
 enough-0.011
 another-0.011
 band-0.010
 your-0.010
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
Me+0.019
 ad+0.019
cut+0.019
year+0.018
umm+0.018
Neg logit contributions
 our-0.012
 band-0.010
 loop-0.010
 ourselves-0.010
 your-0.010
Token
Token ID198
Feature activation+5.55e-01

Pos logit contributions
 our+0.023
 enough+0.022
 another+0.022
 band+0.021
 your+0.021
Neg logit contributions
Me-0.019
umm-0.018
cut-0.018
year-0.017
 ad-0.017
Token
Token ID198
Feature activation+9.02e-01

Pos logit contributions
 our+0.016
 another+0.015
 band+0.014
 your+0.014
 later+0.013
Neg logit contributions
umm-0.030
Me-0.027
 redd-0.027
cut-0.026
 ad-0.026
TokenThe
Token ID464
Feature activation+6.73e-02

Pos logit contributions
 our+0.037
 enough+0.035
 another+0.034
 your+0.033
 band+0.033
Neg logit contributions
Me-0.036
umm-0.033
cut-0.033
 ad-0.032
year-0.032
Token superhero
Token ID17343
Feature activation-7.24e-02

Pos logit contributions
 our+0.003
 enough+0.002
 ourselves+0.002
 another+0.002
ounce+0.002
Neg logit contributions
Me-0.003
cut-0.003
 ad-0.003
year-0.003
umm-0.003
Token had
Token ID550
Feature activation+2.48e-01

Pos logit contributions
Me+0.003
cut+0.003
 ad+0.002
year+0.002
thank+0.002
Neg logit contributions
 our-0.003
 ourselves-0.003
 enough-0.003
ounce-0.003
 another-0.003
Token rescued
Token ID19868
Feature activation+2.17e-01

Pos logit contributions
 our+0.010
oppy+0.010
 appearances+0.010
 ourselves+0.010
ig+0.010
Neg logit contributions
Me-0.009
thank-0.008
 ad-0.008
year-0.008
 capt-0.007
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
 our+0.005
 enough+0.005
ounce+0.005
 band+0.005
 another+0.005
Neg logit contributions
Me-0.011
cut-0.011
umm-0.011
 ad-0.010
year-0.010
Token caul
Token ID46087
Feature activation-4.07e-01

Pos logit contributions
 our+0.006
 enough+0.005
 another+0.005
 your+0.005
 ourselves+0.005
Neg logit contributions
Me-0.008
cut-0.007
umm-0.007
 ad-0.007
year-0.007
Tokeniflower
Token ID42642
Feature activation+3.87e-01

Pos logit contributions
Me+0.014
cut+0.014
umm+0.013
 ad+0.013
year+0.013
Neg logit contributions
 our-0.018
 enough-0.018
 another-0.017
 your-0.016
 loop-0.016
Token and
Token ID290
Feature activation+2.16e-01

Pos logit contributions
 our+0.010
ounce+0.009
 scar+0.009
 band+0.009
 enough+0.009
Neg logit contributions
cut-0.020
Me-0.019
 ad-0.019
umm-0.019
 surrounding-0.018
Token play
Token ID711
Feature activation+3.17e-01

Pos logit contributions
 our+0.006
 another+0.005
 enough+0.005
 your+0.005
 band+0.005
Neg logit contributions
Me-0.014
umm-0.014
cut-0.013
year-0.013
 ad-0.013
Token there
Token ID612
Feature activation-4.52e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.008
 your+0.008
 band+0.008
Neg logit contributions
Me-0.016
umm-0.016
cut-0.016
 ad-0.015
year-0.015
Token.
Token ID13
Feature activation-3.87e-01

Pos logit contributions
Me+0.026
umm+0.025
cut+0.024
year+0.024
 ad+0.023
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 your-0.009
 later-0.009
Token One
Token ID1881
Feature activation-1.72e-01

Pos logit contributions
cut+0.019
Me+0.019
umm+0.019
 ad+0.018
year+0.018
Neg logit contributions
 our-0.012
 another-0.012
 enough-0.011
 your-0.010
 Sam-0.010
Token sunny
Token ID27737
Feature activation+1.02e-01

Pos logit contributions
umm+0.005
Me+0.005
cut+0.005
year+0.004
 ad+0.004
Neg logit contributions
 our-0.009
 another-0.009
 enough-0.008
 your-0.008
 band-0.008
Token day
Token ID1110
Feature activation+3.99e-01

Pos logit contributions
 our+0.005
 another+0.004
 your+0.004
 later+0.004
 enough+0.004
Neg logit contributions
year-0.003
umm-0.003
cut-0.003
Me-0.003
 capt-0.003
Token,
Token ID11
Feature activation+7.69e-02

Pos logit contributions
 our+0.008
 enough+0.008
ounce+0.008
 band+0.008
 ourselves+0.008
Neg logit contributions
Me-0.022
 ad-0.020
cut-0.020
year-0.020
umm-0.019
Token a
Token ID257
Feature activation+1.64e-01

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
ounce+0.001
ig+0.001
Neg logit contributions
Me-0.005
cut-0.005
 ad-0.004
umm-0.004
year-0.004
Token little
Token ID1310
Feature activation+1.16e-01

Pos logit contributions
 our+0.005
 another+0.005
 band+0.005
 enough+0.005
 loop+0.005
Neg logit contributions
umm-0.008
cut-0.008
Me-0.008
year-0.007
 ad-0.007
Token girl
Token ID2576
Feature activation-6.14e-01

Pos logit contributions
 our+0.006
 ourselves+0.005
 enough+0.005
 another+0.005
 appearances+0.005
Neg logit contributions
Me-0.004
cut-0.003
umm-0.003
 ad-0.003
igator-0.003
Token named
Token ID3706
Feature activation-8.14e-02

Pos logit contributions
umm+0.029
Me+0.028
cut+0.027
year+0.025
 redd+0.025
Neg logit contributions
 our-0.022
 another-0.020
 enough-0.020
 your-0.020
 later-0.019
Token was
Token ID373
Feature activation+2.61e-01

Pos logit contributions
Me+0.010
umm+0.010
cut+0.009
year+0.009
 ad+0.009
Neg logit contributions
 our-0.009
 enough-0.009
 another-0.009
 band-0.009
 loop-0.008
Token so
Token ID523
Feature activation-2.54e-01

Pos logit contributions
 our+0.012
 ourselves+0.011
 enough+0.011
ounce+0.011
 another+0.011
Neg logit contributions
Me-0.009
cut-0.008
year-0.008
 ad-0.008
umm-0.007
Token sad
Token ID6507
Feature activation-1.91e-01

Pos logit contributions
Me+0.010
umm+0.010
cut+0.009
 ad+0.009
year+0.009
Neg logit contributions
 our-0.010
 enough-0.010
 another-0.010
 your-0.009
 band-0.009
Token and
Token ID290
Feature activation+2.95e-01

Pos logit contributions
 ad+0.009
Me+0.008
cut+0.008
umm+0.008
thank+0.008
Neg logit contributions
 our-0.006
 scar-0.005
oppy-0.005
 ourselves-0.005
 loop-0.005
Token he
Token ID339
Feature activation-3.88e-01

Pos logit contributions
 our+0.011
 enough+0.011
 another+0.010
 your+0.010
 too+0.010
Neg logit contributions
Me-0.013
umm-0.013
bridge-0.012
rol-0.012
year-0.012
Token felt
Token ID2936
Feature activation-4.64e-01

Pos logit contributions
umm+0.016
Me+0.016
year+0.015
cut+0.015
 redd+0.014
Neg logit contributions
 our-0.015
 enough-0.015
 another-0.014
 band-0.014
 loop-0.013
Token like
Token ID588
Feature activation-2.20e-01

Pos logit contributions
Me+0.021
umm+0.021
cut+0.020
 ad+0.019
year+0.019
Neg logit contributions
 our-0.017
 another-0.017
 enough-0.016
 something-0.016
 your-0.015
Token all
Token ID477
Feature activation+4.32e-01

Pos logit contributions
Me+0.011
umm+0.011
cut+0.011
 ad+0.010
year+0.010
Neg logit contributions
 our-0.006
 another-0.006
 enough-0.006
 your-0.005
 something-0.005
Token his
Token ID465
Feature activation+3.92e-01

Pos logit contributions
 our+0.010
 enough+0.009
 another+0.009
 your+0.008
 band+0.008
Neg logit contributions
Me-0.025
umm-0.024
cut-0.023
 ad-0.023
year-0.023
Token work
Token ID670
Feature activation-8.24e-02

Pos logit contributions
 our+0.009
 another+0.009
 band+0.009
 enough+0.009
 brother+0.008
Neg logit contributions
Me-0.022
umm-0.021
year-0.021
rol-0.021
cut-0.021
Token was
Token ID373
Feature activation+3.75e-03

Pos logit contributions
Me+0.004
umm+0.003
cut+0.003
 ad+0.003
year+0.003
Neg logit contributions
 our-0.003
 enough-0.003
 another-0.003
 your-0.003
 band-0.003
Token would
Token ID561
Feature activation+3.68e-02

Pos logit contributions
 our+0.024
 enough+0.023
 ourselves+0.022
 another+0.022
 your+0.022
Neg logit contributions
Me-0.013
 ad-0.012
cut-0.012
thank-0.011
 capt-0.011
Token play
Token ID711
Feature activation+4.84e-03

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
 ourselves+0.001
ounce+0.001
Neg logit contributions
Me-0.002
cut-0.002
 ad-0.002
umm-0.002
year-0.002
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
 our+0.000
 enough+0.000
 ourselves+0.000
oppy+0.000
ounce+0.000
Neg logit contributions
Me-0.000
cut-0.000
 ad-0.000
year-0.000
thank-0.000
Token his
Token ID465
Feature activation+8.63e-02

Pos logit contributions
Me+0.064
umm+0.060
cut+0.060
 ad+0.059
year+0.059
Neg logit contributions
 our-0.031
 enough-0.029
 another-0.028
 your-0.026
 band-0.026
Token friends
Token ID2460
Feature activation-5.58e-01

Pos logit contributions
 our+0.002
 enough+0.002
 another+0.002
 ourselves+0.002
 your+0.002
Neg logit contributions
Me-0.005
cut-0.004
umm-0.004
 ad-0.004
year-0.004
Token in
Token ID287
Feature activation-3.76e-01

Pos logit contributions
Me+0.028
cut+0.027
 ad+0.027
year+0.026
umm+0.026
Neg logit contributions
 our-0.016
 enough-0.014
 ourselves-0.014
 another-0.014
 band-0.013
Token the
Token ID262
Feature activation-5.92e-02

Pos logit contributions
Me+0.020
 ad+0.019
cut+0.019
year+0.018
umm+0.018
Neg logit contributions
 enough-0.009
 our-0.009
ounce-0.009
 ourselves-0.008
ig-0.008
Token snow
Token ID6729
Feature activation+2.93e-01

Pos logit contributions
Me+0.002
umm+0.002
cut+0.002
 ad+0.002
year+0.002
Neg logit contributions
 our-0.002
 enough-0.002
 another-0.002
 ourselves-0.002
 your-0.002
Token.
Token ID13
Feature activation-6.70e-01

Pos logit contributions
 our+0.009
 enough+0.008
 another+0.008
 your+0.008
 band+0.007
Neg logit contributions
Me-0.014
cut-0.014
umm-0.014
 ad-0.013
thank-0.013
Token They
Token ID1119
Feature activation-1.09e-01

Pos logit contributions
Me+0.028
cut+0.026
umm+0.026
 ad+0.026
year+0.025
Neg logit contributions
 our-0.026
 enough-0.024
 another-0.024
 your-0.023
 band-0.022
Token would
Token ID561
Feature activation+4.74e-01

Pos logit contributions
umm+0.004
Me+0.004
year+0.004
Here+0.004
cut+0.004
Neg logit contributions
 our-0.005
 enough-0.004
 another-0.004
 bat-0.004
 band-0.004
Token very
Token ID845
Feature activation+3.87e-01

Pos logit contributions
 our+0.030
 enough+0.029
 another+0.029
 your+0.027
 band+0.027
Neg logit contributions
Me-0.035
umm-0.034
cut-0.032
 ad-0.031
year-0.030
Token nice
Token ID3621
Feature activation+3.37e-01

Pos logit contributions
 our+0.014
 another+0.013
 band+0.013
 enough+0.013
 your+0.013
Neg logit contributions
umm-0.018
Me-0.018
bridge-0.016
cut-0.016
 ad-0.016
Token.
Token ID13
Feature activation+9.97e-01

Pos logit contributions
 our+0.008
 ourselves+0.006
 another+0.006
 your+0.006
 loop+0.006
Neg logit contributions
Me-0.019
 ad-0.018
umm-0.018
cut-0.018
year-0.018
Token He
Token ID679
Feature activation+4.17e-02

Pos logit contributions
 another+0.034
 my+0.031
 our+0.031
 too+0.030
 Sam+0.030
Neg logit contributions
cut-0.049
Me-0.045
bridge-0.045
rol-0.045
umm-0.045
Token told
Token ID1297
Feature activation+7.10e-01

Pos logit contributions
 our+0.001
 enough+0.001
 another+0.001
 your+0.001
 band+0.001
Neg logit contributions
Me-0.002
cut-0.002
umm-0.002
 ad-0.002
year-0.002
Token Jane
Token ID12091
Feature activation-5.31e-01

Pos logit contributions
 our+0.032
 enough+0.031
 another+0.031
 your+0.030
 later+0.029
Neg logit contributions
Me-0.026
umm-0.025
cut-0.025
 ad-0.023
year-0.023
Token that
Token ID326
Feature activation+1.81e-01

Pos logit contributions
cut+0.027
Me+0.027
 ad+0.026
year+0.025
igator+0.024
Neg logit contributions
 our-0.013
 loop-0.013
ounce-0.013
 ourselves-0.013
 band-0.012
Token she
Token ID673
Feature activation-3.26e-01

Pos logit contributions
 our+0.005
 enough+0.005
 another+0.004
 band+0.004
 your+0.004
Neg logit contributions
Me-0.010
cut-0.009
umm-0.009
 ad-0.009
year-0.009
Token needed
Token ID2622
Feature activation-9.48e-02

Pos logit contributions
Me+0.010
cut+0.009
umm+0.008
 ad+0.008
year+0.008
Neg logit contributions
 our-0.017
 enough-0.016
 another-0.016
 your-0.015
 ourselves-0.015
Token to
Token ID284
Feature activation-1.11e-01

Pos logit contributions
Me+0.005
cut+0.005
year+0.005
 ad+0.005
Here+0.005
Neg logit contributions
ounce-0.002
 our-0.002
ig-0.002
 loop-0.002
oppy-0.002
Token eat
Token ID4483
Feature activation+1.08e-01

Pos logit contributions
Me+0.005
cut+0.005
thank+0.005
 ad+0.005
 sensation+0.004
Neg logit contributions
ounce-0.004
ig-0.003
 enough-0.003
 our-0.003
azz-0.003
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
Me+0.023
cut+0.022
 ad+0.022
umm+0.021
year+0.021
Neg logit contributions
 our-0.013
 enough-0.011
 another-0.011
 your-0.011
 band-0.011
Token He
Token ID679
Feature activation+6.32e-02

Pos logit contributions
 our+0.007
 another+0.007
 enough+0.007
 Sam+0.006
 your+0.006
Neg logit contributions
Me-0.008
cut-0.007
 ad-0.007
umm-0.007
year-0.007
Token was
Token ID373
Feature activation+5.02e-01

Pos logit contributions
 our+0.003
 enough+0.003
 another+0.003
 ourselves+0.003
 your+0.003
Neg logit contributions
Me-0.002
cut-0.002
 ad-0.002
umm-0.002
year-0.002
Token so
Token ID523
Feature activation-1.69e-01

Pos logit contributions
 our+0.018
 enough+0.018
 another+0.018
 your+0.016
 something+0.016
Neg logit contributions
Me-0.022
umm-0.022
cut-0.021
year-0.020
 ad-0.020
Token excited
Token ID6568
Feature activation+1.45e-02

Pos logit contributions
umm+0.007
cut+0.007
Me+0.006
bridge+0.006
Here+0.006
Neg logit contributions
 our-0.007
 another-0.007
 enough-0.006
 your-0.006
 too-0.006
Token to
Token ID284
Feature activation+1.41e-01

Pos logit contributions
 our+0.000
 another+0.000
 enough+0.000
 band+0.000
 loop+0.000
Neg logit contributions
Me-0.001
umm-0.001
 ad-0.001
cut-0.001
year-0.001
Token share
Token ID2648
Feature activation-5.65e-02

Pos logit contributions
 our+0.004
 enough+0.004
 another+0.003
ounce+0.003
 band+0.003
Neg logit contributions
Me-0.007
 ad-0.007
cut-0.007
year-0.007
umm-0.007
Token what
Token ID644
Feature activation-4.45e-01

Pos logit contributions
Me+0.003
 ad+0.003
cut+0.003
year+0.003
umm+0.003
Neg logit contributions
 our-0.001
 enough-0.001
oppy-0.001
 another-0.001
ounce-0.001
Token he
Token ID339
Feature activation+4.17e-02

Pos logit contributions
Me+0.021
umm+0.020
cut+0.019
year+0.019
 ad+0.019
Neg logit contributions
 our-0.015
 enough-0.014
 another-0.014
 your-0.013
 band-0.013
Token had
Token ID550
Feature activation+3.35e-01

Pos logit contributions
 another+0.002
 our+0.002
 enough+0.002
 later+0.002
 band+0.002
Neg logit contributions
umm-0.002
Me-0.002
cut-0.002
year-0.001
rol-0.001
Token learned
Token ID4499
Feature activation+2.27e-01

Pos logit contributions
 our+0.012
 enough+0.012
 another+0.012
 your+0.011
 band+0.011
Neg logit contributions
Me-0.015
umm-0.014
cut-0.014
 ad-0.013
year-0.013
Token!
Token ID0
Feature activation+1.79e-01

Pos logit contributions
 our+0.012
 enough+0.011
 another+0.011
 your+0.010
 band+0.010
Neg logit contributions
Me-0.027
umm-0.026
cut-0.026
year-0.025
 ad-0.025
Token<|endoftext|>
Token ID50256
Feature activation+2.63e-01

Pos logit contributions
 our+0.005
 another+0.005
 enough+0.005
 Sam+0.005
 your+0.005
Neg logit contributions
Me-0.009
cut-0.009
 ad-0.008
umm-0.008
year-0.008
TokenMom
Token ID29252
Feature activation-1.54e-01

Pos logit contributions
 another+0.007
 our+0.007
 enough+0.007
 something+0.007
 later+0.006
Neg logit contributions
cut-0.014
umm-0.013
Me-0.013
 ad-0.013
year-0.013
Tokenmy
Token ID1820
Feature activation+4.37e-01

Pos logit contributions
Me+0.008
umm+0.007
cut+0.007
 ad+0.007
year+0.007
Neg logit contributions
 our-0.005
 enough-0.004
 another-0.004
 your-0.004
 band-0.004
Token was
Token ID373
Feature activation+4.27e-01

Pos logit contributions
 our+0.011
 enough+0.010
 another+0.010
 ourselves+0.010
 band+0.009
Neg logit contributions
Me-0.023
cut-0.022
 ad-0.022
umm-0.021
year-0.021
Token getting
Token ID1972
Feature activation-2.86e-01

Pos logit contributions
 our+0.014
 enough+0.013
 another+0.013
 your+0.012
 band+0.012
Neg logit contributions
Me-0.020
umm-0.019
cut-0.019
 ad-0.018
year-0.018
Token ready
Token ID3492
Feature activation-5.64e-02

Pos logit contributions
Me+0.012
year+0.011
cut+0.011
 ad+0.011
umm+0.011
Neg logit contributions
 our-0.011
 enough-0.010
 another-0.010
 loop-0.009
ounce-0.009
Token to
Token ID284
Feature activation-3.97e-01

Pos logit contributions
Me+0.003
 ad+0.003
cut+0.003
umm+0.003
year+0.003
Neg logit contributions
 our-0.001
 enough-0.001
 another-0.001
 loop-0.001
 ourselves-0.001
Token bake
Token ID28450
Feature activation-1.18e-01

Pos logit contributions
Me+0.017
 ad+0.016
he+0.015
umm+0.015
thank+0.014
Neg logit contributions
ounce-0.014
ig-0.013
 enough-0.013
 scar-0.012
 appearances-0.012
Token something
Token ID1223
Feature activation-5.29e-01

Pos logit contributions
Me+0.006
umm+0.006
 ad+0.006
cut+0.005
year+0.005
Neg logit contributions
 our-0.003
ounce-0.003
 appearances-0.003
oppy-0.003
 wing-0.003
Token special
Token ID2041
Feature activation+1.25e-01

Pos logit contributions
Me+0.024
cut+0.022
umm+0.022
 ad+0.021
year+0.021
Neg logit contributions
 our-0.019
 enough-0.018
 another-0.017
 your-0.016
 band-0.016
Token going
Token ID1016
Feature activation-6.60e-01

Pos logit contributions
 our+0.005
 enough+0.004
 another+0.004
 your+0.004
 my+0.004
Neg logit contributions
umm-0.007
cut-0.007
Me-0.007
Here-0.006
bridge-0.006
Token to
Token ID284
Feature activation-4.73e-01

Pos logit contributions
Me+0.031
 ad+0.030
year+0.030
thank+0.030
 surrounding+0.029
Neg logit contributions
 appearances-0.019
 our-0.018
oppy-0.018
 ourselves-0.018
 wing-0.017
Token have
Token ID423
Feature activation-2.51e-02

Pos logit contributions
Me+0.021
thank+0.018
 ad+0.018
Here+0.017
he+0.017
Neg logit contributions
ig-0.020
ounce-0.018
 enough-0.017
oppy-0.016
ppy-0.015
Token a
Token ID257
Feature activation-6.05e-01

Pos logit contributions
Me+0.001
 ad+0.001
Here+0.001
year+0.001
umm+0.001
Neg logit contributions
ounce-0.000
 our-0.000
oppy-0.000
ig-0.000
 appearances-0.000
Token special
Token ID2041
Feature activation-4.40e-01

Pos logit contributions
Me+0.027
 ad+0.024
umm+0.024
cut+0.024
year+0.023
Neg logit contributions
 our-0.021
 enough-0.020
 ourselves-0.020
 another-0.019
ounce-0.018
Token birthday
Token ID10955
Feature activation-1.64e+00

Pos logit contributions
Me+0.018
 ad+0.017
umm+0.017
cut+0.017
thank+0.016
Neg logit contributions
 our-0.016
 ourselves-0.015
 enough-0.015
ounce-0.015
 another-0.014
Token party
Token ID2151
Feature activation-5.52e-01

Pos logit contributions
Me+0.082
umm+0.078
 ad+0.076
cut+0.076
year+0.075
Neg logit contributions
 our-0.048
 enough-0.044
 another-0.043
 ourselves-0.042
 your-0.041
Token with
Token ID351
Feature activation-6.21e-01

Pos logit contributions
Me+0.028
umm+0.027
cut+0.026
 ad+0.026
year+0.026
Neg logit contributions
 our-0.016
 enough-0.014
 another-0.014
 your-0.013
 band-0.013
Token cakes
Token ID30849
Feature activation-9.44e-01

Pos logit contributions
Me+0.037
cut+0.036
umm+0.035
 ad+0.034
year+0.034
Neg logit contributions
 our-0.013
 another-0.012
 enough-0.012
 your-0.011
 band-0.010
Token,
Token ID11
Feature activation+2.97e-01

Pos logit contributions
Me+0.051
umm+0.049
cut+0.048
 ad+0.048
year+0.047
Neg logit contributions
 our-0.024
 enough-0.022
 another-0.021
 your-0.020
 band-0.020
Token games
Token ID1830
Feature activation-2.50e-02

Pos logit contributions
 our+0.009
 another+0.008
 enough+0.008
 your+0.008
 band+0.008
Neg logit contributions
Me-0.015
umm-0.014
cut-0.014
year-0.014
 ad-0.014
Token a
Token ID257
Feature activation-4.79e-01

Pos logit contributions
Me+0.021
 ad+0.020
year+0.020
cut+0.019
umm+0.019
Neg logit contributions
 our-0.005
ounce-0.004
 enough-0.004
oppy-0.004
ig-0.004
Token great
Token ID1049
Feature activation-2.20e-01

Pos logit contributions
Me+0.018
umm+0.016
cut+0.016
 ad+0.016
year+0.015
Neg logit contributions
 our-0.020
 enough-0.019
 another-0.019
 ourselves-0.019
ounce-0.018
Token job
Token ID1693
Feature activation-1.17e+00

Pos logit contributions
Me+0.009
umm+0.009
 ad+0.008
cut+0.008
year+0.008
Neg logit contributions
 our-0.009
 enough-0.008
 ourselves-0.008
 another-0.008
 your-0.008
Token measuring
Token ID15964
Feature activation-1.49e-01

Pos logit contributions
Me+0.060
cut+0.057
umm+0.057
 ad+0.057
year+0.056
Neg logit contributions
 our-0.032
 enough-0.030
 another-0.029
 band-0.027
 your-0.027
Token her
Token ID607
Feature activation+2.95e-01

Pos logit contributions
Me+0.008
cut+0.007
 ad+0.007
umm+0.007
year+0.007
Neg logit contributions
 our-0.004
 enough-0.003
ounce-0.003
 another-0.003
oppy-0.003
Token idea
Token ID2126
Feature activation-1.94e+00

Pos logit contributions
 our+0.009
 enough+0.009
 another+0.008
ounce+0.008
 your+0.008
Neg logit contributions
Me-0.014
umm-0.013
 ad-0.013
cut-0.013
 capt-0.012
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
Me+0.104
umm+0.098
cut+0.098
 ad+0.096
year+0.095
Neg logit contributions
 our-0.051
 enough-0.047
 another-0.046
 your-0.043
 band-0.042
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
 our+0.014
 another+0.013
 enough+0.013
 your+0.013
 band+0.012
Neg logit contributions
Me-0.016
cut-0.015
umm-0.015
 ad-0.014
year-0.014
Token
Token ID198
Feature activation+6.67e-01

Pos logit contributions
 our+0.007
 another+0.007
 band+0.006
 your+0.006
 later+0.006
Neg logit contributions
umm-0.015
 redd-0.013
cut-0.013
Me-0.013
rated-0.013
TokenThe
Token ID464
Feature activation+2.42e-02

Pos logit contributions
 our+0.025
 another+0.023
 enough+0.023
 band+0.022
 your+0.022
Neg logit contributions
Me-0.028
umm-0.028
cut-0.027
 ad-0.026
year-0.025
Token little
Token ID1310
Feature activation+4.24e-02

Pos logit contributions
 our+0.001
 another+0.001
 enough+0.001
 band+0.001
 brother+0.001
Neg logit contributions
Me-0.001
cut-0.001
umm-0.001
year-0.001
 ad-0.001
Token followed
Token ID3940
Feature activation-3.99e-02

Pos logit contributions
 our+0.007
 ourselves+0.007
 appearances+0.007
 your+0.007
 another+0.006
Neg logit contributions
Me-0.005
 ad-0.005
thank-0.005
year-0.005
cut-0.005
Token her
Token ID607
Feature activation+1.25e-03

Pos logit contributions
Me+0.002
 ad+0.002
cut+0.002
year+0.002
umm+0.002
Neg logit contributions
 our-0.001
 enough-0.001
 ourselves-0.001
ounce-0.001
ig-0.001
Token mom
Token ID1995
Feature activation-2.08e-01

Pos logit contributions
 our+0.000
 enough+0.000
 another+0.000
 band+0.000
 your+0.000
Neg logit contributions
Me-0.000
cut-0.000
umm-0.000
year-0.000
 ad-0.000
Token to
Token ID284
Feature activation-2.28e-01

Pos logit contributions
Me+0.011
cut+0.010
umm+0.010
 ad+0.010
year+0.010
Neg logit contributions
 our-0.006
 enough-0.005
 another-0.005
 your-0.005
 ourselves-0.005
Token the
Token ID262
Feature activation-4.12e-01

Pos logit contributions
Me+0.012
 ad+0.011
cut+0.010
thank+0.010
year+0.010
Neg logit contributions
 our-0.006
ounce-0.006
 enough-0.006
ig-0.006
oppy-0.006
Token cash
Token ID5003
Feature activation-1.60e+00

Pos logit contributions
Me+0.018
umm+0.016
cut+0.016
year+0.016
 ad+0.016
Neg logit contributions
 our-0.015
 enough-0.014
 another-0.014
 band-0.014
 your-0.014
Tokenier
Token ID959
Feature activation-1.73e-01

Pos logit contributions
Me+0.047
cut+0.047
year+0.047
umm+0.044
 ad+0.044
Neg logit contributions
 our-0.081
 loop-0.072
 your-0.072
 another-0.071
 my-0.071
Token to
Token ID284
Feature activation-9.65e-02

Pos logit contributions
Me+0.009
cut+0.009
umm+0.009
 ad+0.008
year+0.008
Neg logit contributions
 our-0.005
 enough-0.004
 another-0.004
 your-0.004
 band-0.004
Token buy
Token ID2822
Feature activation-8.35e-02

Pos logit contributions
Me+0.005
cut+0.004
 ad+0.004
umm+0.004
year+0.004
Neg logit contributions
 our-0.003
 enough-0.003
 another-0.003
 ourselves-0.003
ounce-0.003
Token a
Token ID257
Feature activation+4.16e-02

Pos logit contributions
Me+0.005
umm+0.005
cut+0.005
 ad+0.005
year+0.005
Neg logit contributions
 our-0.002
 enough-0.002
 another-0.001
 your-0.001
 band-0.001
Token toy
Token ID13373
Feature activation+2.51e-01

Pos logit contributions
 our+0.001
 enough+0.001
 ourselves+0.001
 another+0.001
 your+0.001
Neg logit contributions
Me-0.002
cut-0.002
 ad-0.002
umm-0.002
year-0.002
Token this
Token ID428
Feature activation-5.19e-01

Pos logit contributions
Me+0.022
 ad+0.021
year+0.021
cut+0.021
Here+0.020
Neg logit contributions
ounce-0.008
 our-0.008
 enough-0.008
ig-0.007
 loop-0.007
Token for
Token ID329
Feature activation-3.08e-01

Pos logit contributions
Me+0.024
 ad+0.023
cut+0.022
umm+0.022
thank+0.021
Neg logit contributions
 our-0.015
ounce-0.015
 enough-0.014
oppy-0.014
 ourselves-0.014
Token my
Token ID616
Feature activation-5.73e-01

Pos logit contributions
Me+0.015
 ad+0.014
thank+0.014
Here+0.013
year+0.013
Neg logit contributions
ounce-0.009
ig-0.008
azz-0.007
oppy-0.007
 loop-0.007
Token friend
Token ID1545
Feature activation-2.38e-01

Pos logit contributions
Me+0.029
cut+0.028
year+0.027
umm+0.027
 ad+0.027
Neg logit contributions
 our-0.017
 another-0.016
 enough-0.016
 band-0.015
 your-0.015
Token's
Token ID338
Feature activation+1.82e-01

Pos logit contributions
Me+0.010
thank+0.010
 ad+0.010
cut+0.009
 ju+0.009
Neg logit contributions
 appearances-0.007
 ourselves-0.007
 our-0.007
 loop-0.007
ig-0.006
Token birthday
Token ID10955
Feature activation-1.66e+00

Pos logit contributions
 our+0.007
 enough+0.006
 another+0.006
ounce+0.006
 your+0.006
Neg logit contributions
Me-0.008
umm-0.007
 ad-0.007
cut-0.007
year-0.007
Token?"
Token ID1701
Feature activation+2.13e-01

Pos logit contributions
Me+0.083
umm+0.081
 ad+0.078
cut+0.077
thank+0.076
Neg logit contributions
 our-0.044
 ourselves-0.040
 enough-0.039
 appearances-0.039
 another-0.038
Token Her
Token ID2332
Feature activation+4.33e-01

Pos logit contributions
 our+0.008
 another+0.008
 enough+0.008
 your+0.007
 band+0.007
Neg logit contributions
Me-0.009
umm-0.008
cut-0.008
year-0.008
 ad-0.008
Token mom
Token ID1995
Feature activation+2.27e-01

Pos logit contributions
 our+0.011
 enough+0.011
 another+0.010
 ourselves+0.009
 your+0.009
Neg logit contributions
Me-0.024
umm-0.022
cut-0.022
 ad-0.022
year-0.021
Token said
Token ID531
Feature activation+3.11e-01

Pos logit contributions
 our+0.008
 another+0.008
 enough+0.008
 band+0.007
 your+0.007
Neg logit contributions
Me-0.010
umm-0.010
cut-0.009
year-0.009
 ad-0.009
Token,
Token ID11
Feature activation+2.38e-01

Pos logit contributions
 our+0.005
 enough+0.004
 another+0.004
 your+0.003
 band+0.003
Neg logit contributions
Me-0.020
umm-0.020
cut-0.020
year-0.019
 ad-0.019
Token shelf
Token ID18316
Feature activation-3.43e-01

Pos logit contributions
year+0.009
Me+0.009
umm+0.009
cut+0.009
 capt+0.009
Neg logit contributions
 our-0.009
 another-0.009
 your-0.008
 my-0.008
 band-0.008
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
Me+0.020
cut+0.019
umm+0.019
 ad+0.019
year+0.019
Neg logit contributions
 our-0.007
 enough-0.006
 another-0.006
 band-0.006
 your-0.006
Token went
Token ID1816
Feature activation-2.84e-01

Pos logit contributions
Me+0.005
 ad+0.004
cut+0.004
year+0.004
umm+0.004
Neg logit contributions
 our-0.004
 enough-0.004
 ourselves-0.004
 appearances-0.004
 another-0.004
Token to
Token ID284
Feature activation-6.27e-01

Pos logit contributions
Me+0.015
year+0.014
 ad+0.014
cut+0.013
umm+0.013
Neg logit contributions
 our-0.007
 enough-0.007
 appearances-0.007
 another-0.007
 ourselves-0.007
Token the
Token ID262
Feature activation-7.33e-01

Pos logit contributions
Me+0.029
 ad+0.025
he+0.023
Here+0.023
 hallway+0.023
Neg logit contributions
ig-0.020
 enough-0.019
 wing-0.019
 appearances-0.019
ounce-0.019
Token checkout
Token ID28006
Feature activation-2.06e+00

Pos logit contributions
cut+0.030
Me+0.030
year+0.029
umm+0.029
 ad+0.028
Neg logit contributions
 our-0.028
 another-0.027
 enough-0.026
 your-0.025
 band-0.025
Token.
Token ID13
Feature activation+6.74e-01

Pos logit contributions
Me+0.100
cut+0.094
umm+0.093
 ad+0.093
year+0.092
Neg logit contributions
 our-0.064
 enough-0.059
 another-0.058
 your-0.054
 band-0.054
Token They
Token ID1119
Feature activation+1.03e-02

Pos logit contributions
 our+0.027
 enough+0.025
 another+0.025
 your+0.024
 band+0.023
Neg logit contributions
Me-0.027
cut-0.026
umm-0.025
 ad-0.025
year-0.025
Token paid
Token ID3432
Feature activation-5.84e-01

Pos logit contributions
 another+0.000
 enough+0.000
 our+0.000
 later+0.000
 band+0.000
Neg logit contributions
umm-0.000
Me-0.000
cut-0.000
bridge-0.000
 capt-0.000
Token for
Token ID329
Feature activation+1.21e-02

Pos logit contributions
Me+0.030
 ad+0.029
year+0.028
cut+0.028
umm+0.027
Neg logit contributions
 appearances-0.013
 loop-0.013
 our-0.013
 wing-0.012
 band-0.012
Token it
Token ID340
Feature activation-9.83e-02

Pos logit contributions
ounce+0.000
ig+0.000
oppy+0.000
ap+0.000
 wing+0.000
Neg logit contributions
Me-0.001
 ad-0.001
year-0.000
cut-0.000
Here-0.000