TOP ACTIVATIONS
MAX = 3.431

someday and ride in the bright van again . <|endoftext|> Once
to play outside in the bright sunshine . One day ,
liked to work in his beautiful garden . Every day ,
to play outside in the sunny weather . One day ,
my stood outside in the bright sunshine . As the sun
flower , growing in the sunny garden . Every day ,
loved playing outside in the bright sun . One day ,
the garden with a smile on his face .
to play outside in the bright sun . One day ,
to play outside in the bright sunshine . One day ,
to play outside in the sunny weather . One day he
loved playing outside in the bright sunshine with his friends .
lots of fun in the beautiful nature of the park .
, " What is wrong , Mom my ?" His Mom
She gave each of her friends a beautiful painted picture .
loved to play in her beautiful garden every day . <|endoftext|>
the football around with his friends . One
outside to walk in the wet lawn . Little Johnny was
. " Sorry , Farmer Joe ," Sam says
" What do you mean , Joe ?"

MIN ACTIVATIONS
MIN = -3.338

ops inside . The l oll ip ops were red and
come inside for dinner . But St itch didn 't want
Wh isk ers saw a can of tuna on the kitchen
. Mom says they are av oc ados . Sara wants
waited and waited , but no one came to find him
and wanted to rest , but they didn 't have a
the water was really rushing and it looked filthy . Cotton
found some blue berries , but they were bitter .
, Lily saw her l oll ipop had melted completely .
decided to take her favourite caul iflower with her . She
a little green frog appeared . The frog said , "
she noticed that the l oll ipop was gone . She
the prize . Ba iley and her friends started
Nut ty !" said a chip m unk named Ch ippy
they found his lost l oll ipop . The little boy
As they were walking , Velvet noticed their friend 's
plate were two ste amed caul iflower fl ore ts .
to explore the river . But when they got there ,
They finished their l oll ip ops and threw the
. There were l oll ip ops , ch ocol ates

INTERVAL 1.738 - 3.431
CONTAINS 0.414%

, so she touched the empty ash tr ay . Suddenly
. When she was done swinging , she
he saw a man inside the van who looked very uncomfortable
They did not like harsh people . They hoped they would
She stretched it up in the air and then stretched it

INTERVAL 0.046 - 1.738
CONTAINS 46.106%

They made a lot of noise to scare the big cat
adventure . He knew every day he would keep trying new
and she accidentally knocked over a v ase . Her mom
said to Jonas , � � € � � You can
switch . That is fair ." Lily 's mom says .

INTERVAL -1.646 - 0.046
CONTAINS 53.084%

my said . L ily couldn 't wait to
the broken te ap ot . The moral of
to Jack 's rescue . He helped Jack stop the boat
But the brother said he would love to come to
house . The re ind eer thanked Jane for

INTERVAL -3.338 - -1.646
CONTAINS 0.395%

," Sue says . " Let 's try again ."
time , past many houses and trees and hills . Ben
He saw a pair of scissors on the ground and decided
L ily said , " No , Tommy . That 's
could go to the park , but only if she cleaned
Token someday
Token ID25580
Feature activation+2.91e-01

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.004
 breed+0.004
 batch+0.004
Neg logit contributions
 stop-0.004
orry-0.003
ibr-0.003
od-0.003
 mantle-0.003
Token and
Token ID290
Feature activation-4.71e-01

Pos logit contributions
 stop+0.005
ibr+0.004
od+0.004
orry+0.004
 clean+0.004
Neg logit contributions
 avail-0.011
ounced-0.010
 batch-0.010
 enough-0.010
 breed-0.010
Token ride
Token ID6594
Feature activation-1.08e-01

Pos logit contributions
 avail+0.016
ounced+0.015
 enough+0.015
 breed+0.014
 batch+0.014
Neg logit contributions
ibr-0.009
od-0.009
orry-0.008
 stop-0.008
iz-0.008
Token in
Token ID287
Feature activation+3.79e-01

Pos logit contributions
 avail+0.004
 enough+0.004
ounced+0.004
 choice+0.003
 batch+0.003
Neg logit contributions
 stop-0.002
ibr-0.002
orry-0.002
od-0.002
 mantle-0.002
Token the
Token ID262
Feature activation+2.02e+00

Pos logit contributions
 stop+0.004
 Kate+0.003
 clean+0.003
 un+0.003
 Jos+0.003
Neg logit contributions
ounced-0.017
 avail-0.016
ucker-0.016
 breed-0.015
 batch-0.015
Token bright
Token ID6016
Feature activation+3.43e+00

Pos logit contributions
od+0.059
ibr+0.059
 Kate+0.055
 Mir+0.055
orry+0.053
Neg logit contributions
 problem-0.049
 avail-0.047
 enough-0.046
 choice-0.046
ounced-0.044
Token van
Token ID5719
Feature activation+7.72e-01

Pos logit contributions
ibr+0.087
 stop+0.085
od+0.083
orry+0.082
 mantle+0.079
Neg logit contributions
 avail-0.095
ounced-0.087
 enough-0.087
 problem-0.085
 choice-0.084
Token again
Token ID757
Feature activation+1.44e-01

Pos logit contributions
 stop+0.015
ibr+0.015
orry+0.015
od+0.014
iz+0.014
Neg logit contributions
 avail-0.025
ounced-0.024
 enough-0.024
 batch-0.023
 problem-0.022
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 stop+0.002
ibr+0.002
od+0.002
orry+0.002
 clean+0.002
Neg logit contributions
 avail-0.005
ounced-0.005
 batch-0.005
 enough-0.005
 breed-0.005
Token<|endoftext|>
Token ID50256
Feature activation-4.60e-02

Pos logit contributions
 stop+0.006
od+0.005
ibr+0.005
orry+0.005
 un+0.005
Neg logit contributions
 avail-0.009
ounced-0.008
 batch-0.008
 enough-0.008
 breed-0.008
TokenOnce
Token ID7454
Feature activation+4.55e-01

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 breed+0.001
Neg logit contributions
 stop-0.001
ibr-0.001
orry-0.001
od-0.001
 clean-0.001
Token to
Token ID284
Feature activation+3.47e-01

Pos logit contributions
 avail+0.008
ounced+0.007
 batch+0.007
 enough+0.007
 breed+0.007
Neg logit contributions
 stop-0.011
orry-0.010
ibr-0.010
od-0.010
iz-0.010
Token play
Token ID711
Feature activation+5.25e-01

Pos logit contributions
orry+0.010
ibr+0.010
 mantle+0.010
vell+0.010
iz+0.009
Neg logit contributions
 avail-0.008
 saddle-0.007
 breed-0.007
 choice-0.007
 loaf-0.007
Token outside
Token ID2354
Feature activation+8.33e-01

Pos logit contributions
orry+0.012
 Jos+0.012
iz+0.011
od+0.011
amps+0.011
Neg logit contributions
 batch-0.015
 avail-0.015
 saddle-0.014
 bone-0.014
 enough-0.014
Token in
Token ID287
Feature activation+1.20e+00

Pos logit contributions
ibr+0.015
orry+0.014
 stop+0.014
od+0.013
iz+0.013
Neg logit contributions
 avail-0.028
 enough-0.028
 batch-0.027
ounced-0.027
 breed-0.027
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
ibr+0.027
orry+0.025
vell+0.024
od+0.023
 stop+0.023
Neg logit contributions
 avail-0.040
 enough-0.036
grown-0.035
 batch-0.035
 kinds-0.034
Token bright
Token ID6016
Feature activation+3.37e+00

Pos logit contributions
od+0.053
ibr+0.053
vell+0.050
orry+0.049
 stop+0.048
Neg logit contributions
 avail-0.040
 enough-0.038
 problem-0.038
Today-0.036
grown-0.036
Token sunshine
Token ID34488
Feature activation+1.44e+00

Pos logit contributions
ibr+0.095
od+0.088
 stop+0.088
orry+0.088
vell+0.086
Neg logit contributions
 avail-0.086
 kinds-0.082
aters-0.082
 enough-0.081
 types-0.080
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
ibr+0.027
 stop+0.027
orry+0.027
iz+0.025
od+0.025
Neg logit contributions
 avail-0.048
 enough-0.046
 batch-0.045
ounced-0.045
 breed-0.044
Token One
Token ID1881
Feature activation+2.60e-01

Pos logit contributions
ibr+0.015
 stop+0.015
orry+0.015
od+0.015
vell+0.015
Neg logit contributions
 avail-0.006
 batch-0.006
 breed-0.005
 enough-0.005
ounced-0.005
Token day
Token ID1110
Feature activation-3.96e-01

Pos logit contributions
ibr+0.009
od+0.008
 mantle+0.008
vell+0.008
 Jos+0.008
Neg logit contributions
 avail-0.005
 breed-0.005
 batch-0.005
 problem-0.005
ited-0.005
Token,
Token ID11
Feature activation-8.94e-01

Pos logit contributions
 avail+0.012
ounced+0.012
 enough+0.011
 problem+0.011
 batch+0.011
Neg logit contributions
 mantle-0.009
 stop-0.009
ibr-0.009
od-0.008
orry-0.008
Token liked
Token ID8288
Feature activation-2.60e-01

Pos logit contributions
 stop+0.009
ibr+0.009
orry+0.009
od+0.009
 mantle+0.009
Neg logit contributions
ounced-0.007
 avail-0.007
 breed-0.007
 batch-0.006
 enough-0.006
Token to
Token ID284
Feature activation+1.84e-01

Pos logit contributions
 avail+0.006
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.008
ibr-0.008
orry-0.008
od-0.008
 mantle-0.007
Token work
Token ID670
Feature activation-2.05e-02

Pos logit contributions
orry+0.004
ibr+0.004
od+0.004
iz+0.004
 mantle+0.004
Neg logit contributions
 avail-0.006
 choice-0.005
 breed-0.005
 saddle-0.005
 batch-0.005
Token in
Token ID287
Feature activation+6.25e-01

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 problem+0.001
Neg logit contributions
 stop-0.000
orry-0.000
ibr-0.000
od-0.000
 mantle-0.000
Token his
Token ID465
Feature activation+2.04e+00

Pos logit contributions
 stop+0.010
ibr+0.010
orry+0.010
od+0.010
 mantle+0.009
Neg logit contributions
 avail-0.023
ounced-0.021
 enough-0.021
 batch-0.021
 breed-0.021
Token beautiful
Token ID4950
Feature activation+3.32e+00

Pos logit contributions
od+0.054
ibr+0.051
 stop+0.051
 Jos+0.051
 Kate+0.050
Neg logit contributions
 avail-0.054
 problem-0.052
ounced-0.050
 enough-0.050
 batch-0.050
Token garden
Token ID11376
Feature activation+9.85e-01

Pos logit contributions
 stop+0.075
 clean+0.069
ibr+0.069
 un+0.068
orry+0.068
Neg logit contributions
 avail-0.102
ounced-0.100
 enough-0.092
 batch-0.091
 breed-0.091
Token.
Token ID13
Feature activation+5.85e-01

Pos logit contributions
 stop+0.018
orry+0.017
ibr+0.017
od+0.017
 Jos+0.016
Neg logit contributions
 avail-0.033
ounced-0.032
 enough-0.032
 batch-0.031
 breed-0.031
Token Every
Token ID3887
Feature activation+4.22e-02

Pos logit contributions
 stop+0.014
ibr+0.013
od+0.013
orry+0.013
 clean+0.013
Neg logit contributions
 avail-0.017
ounced-0.016
 enough-0.015
 batch-0.015
 breed-0.015
Token day
Token ID1110
Feature activation+5.72e-01

Pos logit contributions
 stop+0.001
 clean+0.001
orry+0.001
 un+0.001
ibr+0.001
Neg logit contributions
 avail-0.001
ounced-0.001
 dollars-0.001
ucker-0.001
 enough-0.001
Token,
Token ID11
Feature activation+4.97e-01

Pos logit contributions
 stop+0.010
ibr+0.009
od+0.009
orry+0.009
 clean+0.009
Neg logit contributions
 avail-0.021
ounced-0.020
 batch-0.019
 enough-0.019
 breed-0.019
Token to
Token ID284
Feature activation+2.47e-01

Pos logit contributions
 avail+0.013
ounced+0.012
 enough+0.011
 batch+0.011
 breed+0.011
Neg logit contributions
 stop-0.016
ibr-0.015
orry-0.015
od-0.015
 mantle-0.014
Token play
Token ID711
Feature activation+3.95e-01

Pos logit contributions
orry+0.007
ibr+0.007
od+0.007
iz+0.007
vell+0.006
Neg logit contributions
 avail-0.006
 choice-0.006
 breed-0.006
 saddle-0.006
 batch-0.005
Token outside
Token ID2354
Feature activation+9.93e-01

Pos logit contributions
 Jos+0.010
orry+0.009
od+0.009
iz+0.009
 Mir+0.009
Neg logit contributions
 bone-0.011
 batch-0.011
 avail-0.011
 pistols-0.010
 sausage-0.010
Token in
Token ID287
Feature activation+1.23e+00

Pos logit contributions
ibr+0.017
orry+0.016
 stop+0.016
od+0.015
iz+0.015
Neg logit contributions
 avail-0.035
 enough-0.034
 batch-0.033
ounced-0.033
 problem-0.032
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
ibr+0.026
orry+0.025
od+0.024
 stop+0.023
vell+0.023
Neg logit contributions
 avail-0.042
 enough-0.038
 batch-0.036
grown-0.036
 kinds-0.036
Token sunny
Token ID27737
Feature activation+3.29e+00

Pos logit contributions
ibr+0.051
od+0.051
orry+0.047
vell+0.047
 stop+0.047
Neg logit contributions
 avail-0.045
 enough-0.042
 problem-0.042
ounced-0.040
grown-0.040
Token weather
Token ID6193
Feature activation+1.25e+00

Pos logit contributions
ibr+0.107
od+0.104
vell+0.103
orry+0.101
 stop+0.099
Neg logit contributions
 avail-0.069
 batch-0.063
 kinds-0.062
 problem-0.062
 breed-0.061
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
ibr+0.027
orry+0.026
 stop+0.025
iz+0.024
od+0.024
Neg logit contributions
 avail-0.038
 batch-0.037
 enough-0.036
 breed-0.036
 kinds-0.035
Token One
Token ID1881
Feature activation+3.18e-01

Pos logit contributions
ibr+0.039
 stop+0.038
od+0.037
orry+0.037
vell+0.036
Neg logit contributions
 avail-0.029
ounced-0.025
 batch-0.025
 breed-0.025
 enough-0.025
Token day
Token ID1110
Feature activation+6.98e-01

Pos logit contributions
ibr+0.010
od+0.010
 mantle+0.010
vell+0.010
 Jos+0.010
Neg logit contributions
 avail-0.006
 breed-0.006
 problem-0.006
 batch-0.006
ounced-0.006
Token,
Token ID11
Feature activation+7.07e-01

Pos logit contributions
 stop+0.013
ibr+0.012
od+0.012
orry+0.012
 mantle+0.011
Neg logit contributions
 avail-0.024
ounced-0.024
 enough-0.022
 batch-0.022
 breed-0.022
Tokenmy
Token ID1820
Feature activation-5.11e-02

Pos logit contributions
ibr+0.007
 stop+0.006
orry+0.006
od+0.006
 mantle+0.006
Neg logit contributions
 avail-0.004
ounced-0.004
 enough-0.004
 batch-0.003
 breed-0.003
Token stood
Token ID6204
Feature activation-5.99e-01

Pos logit contributions
 avail+0.001
 enough+0.001
ounced+0.001
ited+0.001
grown+0.001
Neg logit contributions
ibr-0.001
 stop-0.001
orry-0.001
 mantle-0.001
od-0.001
Token outside
Token ID2354
Feature activation+3.33e-01

Pos logit contributions
 avail+0.028
ounced+0.027
 breed+0.026
ited+0.026
 dollars+0.025
Neg logit contributions
 clean-0.005
 stop-0.005
 un-0.004
 nicely-0.004
 Jos-0.003
Token in
Token ID287
Feature activation+4.73e-01

Pos logit contributions
 stop+0.006
ibr+0.006
orry+0.006
od+0.006
iz+0.006
Neg logit contributions
 avail-0.011
ounced-0.010
 enough-0.010
 batch-0.010
 breed-0.010
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
 stop+0.006
ibr+0.005
od+0.005
 clean+0.005
orry+0.005
Neg logit contributions
 avail-0.019
ounced-0.019
 enough-0.018
 batch-0.018
 breed-0.017
Token bright
Token ID6016
Feature activation+3.27e+00

Pos logit contributions
od+0.043
ibr+0.043
vell+0.041
orry+0.039
 stop+0.038
Neg logit contributions
 batch-0.033
 avail-0.033
 enough-0.033
 problem-0.033
 sausage-0.031
Token sunshine
Token ID34488
Feature activation+1.52e+00

Pos logit contributions
ibr+0.092
 stop+0.088
orry+0.087
od+0.087
vell+0.083
Neg logit contributions
 avail-0.084
 enough-0.076
 batch-0.076
 problem-0.076
ounced-0.074
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
ibr+0.028
 stop+0.027
orry+0.027
od+0.026
iz+0.025
Neg logit contributions
 avail-0.052
ounced-0.049
 enough-0.049
 batch-0.048
 breed-0.047
Token As
Token ID1081
Feature activation-1.28e-01

Pos logit contributions
 avail+0.008
 enough+0.008
ounced+0.008
aters+0.008
 batch+0.007
Neg logit contributions
ibr-0.007
orry-0.006
od-0.006
 stop-0.006
vell-0.006
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.004
 problem+0.004
 batch+0.004
Neg logit contributions
 stop-0.002
ibr-0.002
orry-0.002
od-0.002
 mantle-0.002
Token sun
Token ID4252
Feature activation+1.44e+00

Pos logit contributions
od+0.003
ibr+0.003
orry+0.003
 stop+0.003
 nicely+0.003
Neg logit contributions
 avail-0.005
ounced-0.005
 problem-0.005
 enough-0.005
 breed-0.005
Tokenflower
Token ID25547
Feature activation-7.62e-02

Pos logit contributions
 stop+0.028
ibr+0.027
orry+0.027
od+0.027
 un+0.027
Neg logit contributions
 avail-0.012
ounced-0.012
 enough-0.010
 batch-0.010
 breed-0.010
Token,
Token ID11
Feature activation-5.81e-01

Pos logit contributions
 breed+0.002
 avail+0.002
 enough+0.002
 batch+0.002
ounced+0.002
Neg logit contributions
 stop-0.002
ibr-0.001
orry-0.001
 un-0.001
 mantle-0.001
Token growing
Token ID3957
Feature activation+1.60e-01

Pos logit contributions
ounced+0.016
 avail+0.016
 enough+0.016
 breed+0.016
 batch+0.015
Neg logit contributions
ibr-0.014
 stop-0.014
orry-0.013
 mantle-0.013
iz-0.012
Token in
Token ID287
Feature activation+3.89e-01

Pos logit contributions
 stop+0.003
ibr+0.003
orry+0.003
od+0.003
 mantle+0.003
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.005
 batch-0.005
 breed-0.005
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
 stop+0.006
ibr+0.005
od+0.005
 clean+0.005
orry+0.005
Neg logit contributions
 avail-0.015
ounced-0.015
 enough-0.014
 batch-0.014
 breed-0.014
Token sunny
Token ID27737
Feature activation+3.26e+00

Pos logit contributions
ibr+0.053
od+0.051
 Kate+0.049
orry+0.047
vell+0.047
Neg logit contributions
 avail-0.045
 loaf-0.045
 batch-0.045
 problem-0.044
 enough-0.044
Token garden
Token ID11376
Feature activation+3.59e-01

Pos logit contributions
ibr+0.098
vell+0.092
od+0.091
orry+0.091
 stop+0.090
Neg logit contributions
 avail-0.075
 batch-0.072
 kinds-0.071
 enough-0.070
 loaf-0.070
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
ibr+0.007
 stop+0.007
orry+0.007
od+0.007
 Kate+0.007
Neg logit contributions
 enough-0.011
 avail-0.011
ounced-0.011
 batch-0.010
 breed-0.010
Token Every
Token ID3887
Feature activation+1.57e-01

Pos logit contributions
 stop+0.000
ibr+0.000
orry+0.000
od+0.000
 clean+0.000
Neg logit contributions
 avail-0.000
ounced-0.000
 enough-0.000
 batch-0.000
 breed-0.000
Token day
Token ID1110
Feature activation+2.68e-01

Pos logit contributions
 stop+0.003
 clean+0.003
orry+0.003
iz+0.003
 un+0.003
Neg logit contributions
 avail-0.006
ounced-0.006
ucker-0.005
 dollars-0.005
aters-0.005
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
 stop+0.004
 clean+0.003
 Kate+0.003
ibr+0.003
od+0.003
Neg logit contributions
 avail-0.010
ounced-0.010
 batch-0.010
 breed-0.010
ucker-0.010
Token loved
Token ID6151
Feature activation-4.98e-01

Pos logit contributions
ibr+0.007
 stop+0.007
od+0.007
 Jos+0.007
 mantle+0.007
Neg logit contributions
ounced-0.006
 breed-0.006
die-0.006
 problem-0.005
 bone-0.005
Token playing
Token ID2712
Feature activation+6.29e-01

Pos logit contributions
 avail+0.012
ounced+0.011
 enough+0.011
 batch+0.010
 breed+0.010
Neg logit contributions
 stop-0.015
ibr-0.014
orry-0.014
od-0.014
 mantle-0.013
Token outside
Token ID2354
Feature activation+6.48e-01

Pos logit contributions
 Jos+0.016
orry+0.015
iz+0.014
od+0.014
 Mir+0.014
Neg logit contributions
 bone-0.017
 batch-0.017
 sausage-0.016
 pistols-0.016
 saddle-0.015
Token in
Token ID287
Feature activation+1.03e+00

Pos logit contributions
ibr+0.011
orry+0.010
 stop+0.010
od+0.010
iz+0.009
Neg logit contributions
 avail-0.023
 enough-0.022
ounced-0.022
 batch-0.022
 problem-0.021
Token the
Token ID262
Feature activation+1.75e+00

Pos logit contributions
ibr+0.022
orry+0.021
od+0.020
 stop+0.019
vell+0.019
Neg logit contributions
 avail-0.035
 enough-0.032
grown-0.031
 batch-0.030
 kinds-0.030
Token bright
Token ID6016
Feature activation+3.24e+00

Pos logit contributions
ibr+0.056
od+0.056
vell+0.053
orry+0.052
 Kate+0.051
Neg logit contributions
 avail-0.034
grown-0.034
 enough-0.034
Today-0.034
 problem-0.033
Token sun
Token ID4252
Feature activation+1.27e+00

Pos logit contributions
ibr+0.089
od+0.082
orry+0.081
 stop+0.081
vell+0.081
Neg logit contributions
 avail-0.085
 kinds-0.082
aters-0.081
 types-0.080
 enough-0.080
Token.
Token ID13
Feature activation+7.57e-01

Pos logit contributions
ibr+0.025
orry+0.025
 stop+0.025
iz+0.023
od+0.023
Neg logit contributions
 avail-0.042
 enough-0.039
ounced-0.039
 batch-0.038
 breed-0.038
Token One
Token ID1881
Feature activation+1.91e-01

Pos logit contributions
ibr+0.026
 stop+0.026
od+0.025
orry+0.025
 mantle+0.024
Neg logit contributions
 avail-0.014
ounced-0.012
 batch-0.012
 enough-0.012
 breed-0.011
Token day
Token ID1110
Feature activation+2.46e-01

Pos logit contributions
ibr+0.006
od+0.006
vell+0.006
 mantle+0.006
 Jos+0.006
Neg logit contributions
 avail-0.004
 breed-0.004
 batch-0.003
 problem-0.003
ited-0.003
Token,
Token ID11
Feature activation+1.49e-01

Pos logit contributions
 stop+0.005
ibr+0.005
 mantle+0.005
orry+0.005
od+0.005
Neg logit contributions
 avail-0.008
ounced-0.008
 enough-0.007
 problem-0.007
 batch-0.007
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
 stop+0.024
ibr+0.023
orry+0.022
od+0.022
 mantle+0.021
Neg logit contributions
 avail-0.050
ounced-0.048
 enough-0.047
 batch-0.045
 problem-0.045
Token garden
Token ID11376
Feature activation+1.11e+00

Pos logit contributions
 stop+0.038
ibr+0.036
od+0.035
orry+0.035
 mantle+0.034
Neg logit contributions
 avail-0.059
ounced-0.057
 enough-0.053
 batch-0.053
 problem-0.052
Token with
Token ID351
Feature activation+2.54e+00

Pos logit contributions
ibr+0.027
orry+0.027
iz+0.025
 stop+0.025
od+0.025
Neg logit contributions
 enough-0.031
 avail-0.029
ounced-0.029
 batch-0.029
 problem-0.027
Token a
Token ID257
Feature activation+1.80e+00

Pos logit contributions
 stop+0.039
ibr+0.036
od+0.035
orry+0.035
 mantle+0.033
Neg logit contributions
 avail-0.096
ounced-0.093
 enough-0.089
 batch-0.088
 breed-0.087
Token smile
Token ID8212
Feature activation+1.89e+00

Pos logit contributions
 stop+0.039
 clean+0.035
ibr+0.035
 un+0.035
orry+0.034
Neg logit contributions
 avail-0.058
ounced-0.056
 enough-0.052
 batch-0.052
 breed-0.052
Token on
Token ID319
Feature activation+3.24e+00

Pos logit contributions
 stop+0.041
ibr+0.040
orry+0.039
 mantle+0.038
od+0.038
Neg logit contributions
 avail-0.057
ounced-0.056
 enough-0.054
 batch-0.052
 breed-0.052
Token his
Token ID465
Feature activation+1.46e+00

Pos logit contributions
ibr+0.123
nes+0.114
vell+0.113
onel+0.107
emic+0.106
Neg logit contributions
 avail-0.065
 enough-0.058
 sausage-0.056
ened-0.055
 apples-0.055
Token face
Token ID1986
Feature activation+2.24e+00

Pos logit contributions
ibr+0.031
od+0.027
iz+0.027
aund+0.026
 Kate+0.026
Neg logit contributions
 problem-0.045
 avail-0.044
 choice-0.044
 coll-0.044
 saddle-0.044
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ibr+0.055
orry+0.053
iz+0.051
 mantle+0.051
 stop+0.049
Neg logit contributions
 enough-0.064
 avail-0.059
ounced-0.058
 batch-0.056
 problem-0.055
Token
Token ID198
Feature activation+4.53e-01

Pos logit contributions
 stop+0.038
vell+0.037
ibr+0.036
od+0.035
orry+0.035
Neg logit contributions
 types-0.025
 properties-0.025
 kinds-0.025
 breed-0.025
 bone-0.025
Token
Token ID198
Feature activation+9.50e-01

Pos logit contributions
 stop+0.010
ibr+0.009
 mantle+0.009
orry+0.009
od+0.009
Neg logit contributions
 avail-0.013
ounced-0.013
 batch-0.012
 enough-0.012
 breed-0.012
Token to
Token ID284
Feature activation+1.18e-01

Pos logit contributions
 avail+0.010
ounced+0.010
 batch+0.009
 enough+0.009
 breed+0.009
Neg logit contributions
 stop-0.014
ibr-0.014
orry-0.014
od-0.014
iz-0.013
Token play
Token ID711
Feature activation+4.11e-01

Pos logit contributions
orry+0.003
ibr+0.003
od+0.003
vell+0.003
iz+0.003
Neg logit contributions
 avail-0.003
 breed-0.003
 choice-0.003
 saddle-0.003
 batch-0.003
Token outside
Token ID2354
Feature activation+8.19e-01

Pos logit contributions
 Jos+0.009
orry+0.009
od+0.008
iz+0.008
amps+0.008
Neg logit contributions
 batch-0.012
 avail-0.012
 saddle-0.011
 bone-0.011
 breed-0.011
Token in
Token ID287
Feature activation+1.07e+00

Pos logit contributions
ibr+0.014
orry+0.014
 stop+0.013
od+0.013
iz+0.012
Neg logit contributions
 avail-0.028
 enough-0.028
 batch-0.028
ounced-0.027
 breed-0.027
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
ibr+0.024
orry+0.022
vell+0.021
od+0.021
 stop+0.021
Neg logit contributions
 avail-0.035
 enough-0.032
 batch-0.031
grown-0.031
 kinds-0.031
Token bright
Token ID6016
Feature activation+3.21e+00

Pos logit contributions
od+0.049
ibr+0.048
vell+0.046
orry+0.044
 Kate+0.044
Neg logit contributions
 avail-0.032
 problem-0.032
 enough-0.032
grown-0.031
Today-0.031
Token sun
Token ID4252
Feature activation+1.58e+00

Pos logit contributions
ibr+0.087
 stop+0.081
od+0.080
vell+0.079
orry+0.079
Neg logit contributions
 avail-0.085
 kinds-0.081
aters-0.081
 batch-0.080
 enough-0.080
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
orry+0.033
ibr+0.033
 stop+0.032
iz+0.031
od+0.030
Neg logit contributions
 avail-0.050
 enough-0.048
 batch-0.047
ounced-0.047
 breed-0.046
Token One
Token ID1881
Feature activation+2.57e-01

Pos logit contributions
 stop+0.015
ibr+0.014
orry+0.014
od+0.014
 un+0.014
Neg logit contributions
 avail-0.008
ounced-0.007
 batch-0.007
 enough-0.007
 breed-0.007
Token day
Token ID1110
Feature activation-3.25e-01

Pos logit contributions
ibr+0.009
od+0.008
vell+0.008
 mantle+0.008
 Jos+0.008
Neg logit contributions
 avail-0.005
 breed-0.005
 batch-0.005
 problem-0.005
ited-0.005
Token,
Token ID11
Feature activation-7.83e-01

Pos logit contributions
 avail+0.010
ounced+0.010
 enough+0.009
 problem+0.009
 batch+0.009
Neg logit contributions
 stop-0.007
 mantle-0.007
ibr-0.007
orry-0.007
od-0.007
Token to
Token ID284
Feature activation+2.61e-01

Pos logit contributions
 avail+0.010
ounced+0.009
 enough+0.008
 batch+0.008
 breed+0.008
Neg logit contributions
 stop-0.012
ibr-0.011
orry-0.011
od-0.011
 clean-0.011
Token play
Token ID711
Feature activation+3.79e-01

Pos logit contributions
orry+0.007
ibr+0.007
iz+0.007
od+0.007
vell+0.007
Neg logit contributions
 avail-0.007
 saddle-0.006
 breed-0.006
 batch-0.006
 choice-0.006
Token outside
Token ID2354
Feature activation+8.44e-01

Pos logit contributions
 Jos+0.009
orry+0.008
iz+0.008
od+0.008
 Mir+0.008
Neg logit contributions
 bone-0.011
 avail-0.011
 batch-0.011
 saddle-0.010
 pistols-0.010
Token in
Token ID287
Feature activation+1.14e+00

Pos logit contributions
ibr+0.013
 stop+0.013
orry+0.013
od+0.012
 mantle+0.011
Neg logit contributions
 avail-0.031
 enough-0.030
ounced-0.029
 batch-0.029
 breed-0.029
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
ibr+0.024
orry+0.023
od+0.022
 stop+0.022
vell+0.021
Neg logit contributions
 avail-0.039
 enough-0.035
grown-0.034
 batch-0.033
 kinds-0.033
Token bright
Token ID6016
Feature activation+3.21e+00

Pos logit contributions
od+0.052
ibr+0.051
vell+0.049
orry+0.048
 Kate+0.048
Neg logit contributions
 avail-0.034
 problem-0.034
 enough-0.034
grown-0.033
 bone-0.033
Token sunshine
Token ID34488
Feature activation+1.52e+00

Pos logit contributions
ibr+0.089
 stop+0.083
orry+0.082
od+0.082
vell+0.080
Neg logit contributions
 avail-0.083
 kinds-0.078
aters-0.077
 problem-0.077
 types-0.077
Token.
Token ID13
Feature activation+7.13e-01

Pos logit contributions
 stop+0.027
ibr+0.027
orry+0.026
od+0.025
iz+0.025
Neg logit contributions
 avail-0.052
ounced-0.049
 enough-0.049
 batch-0.048
 breed-0.048
Token One
Token ID1881
Feature activation+2.79e-01

Pos logit contributions
ibr+0.026
 stop+0.025
od+0.025
orry+0.025
vell+0.024
Neg logit contributions
 avail-0.012
 batch-0.010
 breed-0.010
ounced-0.010
 enough-0.010
Token day
Token ID1110
Feature activation+3.88e-01

Pos logit contributions
ibr+0.010
od+0.009
vell+0.009
 mantle+0.009
 Jos+0.009
Neg logit contributions
 breed-0.005
 avail-0.005
able-0.005
 problem-0.005
 batch-0.005
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
 stop+0.008
ibr+0.008
 mantle+0.007
od+0.007
orry+0.007
Neg logit contributions
 avail-0.013
ounced-0.012
 enough-0.012
 problem-0.011
 batch-0.011
Token to
Token ID284
Feature activation+2.11e-01

Pos logit contributions
 avail+0.005
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.006
ibr-0.005
 clean-0.005
od-0.005
 un-0.005
Token play
Token ID711
Feature activation+6.22e-01

Pos logit contributions
ibr+0.005
orry+0.005
od+0.005
vell+0.004
 mantle+0.004
Neg logit contributions
 avail-0.006
 breed-0.006
 choice-0.006
 batch-0.006
 saddle-0.005
Token outside
Token ID2354
Feature activation+1.04e+00

Pos logit contributions
orry+0.014
 Jos+0.013
iz+0.012
od+0.012
 Mir+0.012
Neg logit contributions
 avail-0.018
 batch-0.018
 bone-0.017
 enough-0.017
 pistols-0.017
Token in
Token ID287
Feature activation+1.41e+00

Pos logit contributions
 stop+0.018
ibr+0.017
orry+0.017
od+0.016
 mantle+0.016
Neg logit contributions
 avail-0.037
ounced-0.035
 enough-0.034
 batch-0.034
 breed-0.033
Token the
Token ID262
Feature activation+1.61e+00

Pos logit contributions
ibr+0.026
orry+0.024
 stop+0.024
od+0.024
 mantle+0.022
Neg logit contributions
 avail-0.050
 enough-0.046
grown-0.044
 batch-0.044
 choice-0.044
Token sunny
Token ID27737
Feature activation+3.21e+00

Pos logit contributions
od+0.043
ibr+0.043
 stop+0.040
orry+0.040
vell+0.039
Neg logit contributions
 avail-0.042
 enough-0.039
 problem-0.039
 batch-0.038
grown-0.037
Token weather
Token ID6193
Feature activation+1.26e+00

Pos logit contributions
ibr+0.099
od+0.096
vell+0.095
orry+0.093
 stop+0.093
Neg logit contributions
 avail-0.073
 kinds-0.066
 batch-0.065
 enough-0.064
 problem-0.064
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
ibr+0.024
orry+0.024
 stop+0.024
od+0.022
iz+0.022
Neg logit contributions
 avail-0.041
 enough-0.039
 batch-0.038
ounced-0.038
 breed-0.038
Token One
Token ID1881
Feature activation+4.13e-01

Pos logit contributions
 avail+0.004
ounced+0.003
 enough+0.003
 batch+0.003
 breed+0.003
Neg logit contributions
 stop-0.004
ibr-0.004
 clean-0.004
 un-0.004
orry-0.004
Token day
Token ID1110
Feature activation-7.58e-01

Pos logit contributions
ibr+0.012
od+0.012
 stop+0.012
 mantle+0.012
vell+0.011
Neg logit contributions
 avail-0.009
ounced-0.009
 breed-0.009
 batch-0.008
 enough-0.008
Token he
Token ID339
Feature activation+3.59e-01

Pos logit contributions
 avail+0.025
ounced+0.024
 enough+0.023
 batch+0.023
 problem+0.022
Neg logit contributions
 stop-0.015
ibr-0.014
orry-0.014
od-0.014
 mantle-0.014
Token loved
Token ID6151
Feature activation-3.70e-01

Pos logit contributions
ibr+0.006
 stop+0.006
 Jos+0.006
orry+0.006
vell+0.006
Neg logit contributions
ounced-0.004
 breed-0.004
 bone-0.004
die-0.004
 enough-0.004
Token playing
Token ID2712
Feature activation+6.85e-01

Pos logit contributions
 avail+0.008
ounced+0.008
 enough+0.008
 batch+0.007
 breed+0.007
Neg logit contributions
 stop-0.011
ibr-0.011
orry-0.011
od-0.010
iz-0.010
Token outside
Token ID2354
Feature activation+7.67e-01

Pos logit contributions
 Jos+0.017
orry+0.017
od+0.016
iz+0.015
 Mir+0.015
Neg logit contributions
 bone-0.019
 batch-0.018
 sausage-0.017
 pistols-0.017
 saddle-0.017
Token in
Token ID287
Feature activation+1.09e+00

Pos logit contributions
ibr+0.013
orry+0.012
 stop+0.012
od+0.012
iz+0.011
Neg logit contributions
 avail-0.027
 enough-0.027
 batch-0.026
ounced-0.026
 breed-0.025
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
ibr+0.022
orry+0.021
od+0.020
 stop+0.020
vell+0.019
Neg logit contributions
 avail-0.037
 enough-0.034
grown-0.033
 batch-0.033
 kinds-0.032
Token bright
Token ID6016
Feature activation+3.16e+00

Pos logit contributions
od+0.052
ibr+0.051
vell+0.049
orry+0.047
 Kate+0.047
Neg logit contributions
 avail-0.035
 enough-0.034
grown-0.034
 problem-0.034
Today-0.032
Token sunshine
Token ID34488
Feature activation+1.30e+00

Pos logit contributions
ibr+0.083
 stop+0.078
od+0.078
orry+0.076
vell+0.076
Neg logit contributions
 avail-0.086
 enough-0.081
 kinds-0.080
aters-0.080
 types-0.079
Token with
Token ID351
Feature activation+1.91e+00

Pos logit contributions
 stop+0.025
ibr+0.025
orry+0.025
iz+0.023
od+0.023
Neg logit contributions
 avail-0.043
 enough-0.041
ounced-0.040
 batch-0.040
 breed-0.040
Token his
Token ID465
Feature activation+6.18e-01

Pos logit contributions
ibr+0.037
orry+0.036
 stop+0.036
od+0.035
 mantle+0.033
Neg logit contributions
 avail-0.065
 enough-0.060
ounced-0.059
 batch-0.058
 breed-0.058
Token friends
Token ID2460
Feature activation+2.11e+00

Pos logit contributions
 stop+0.016
ibr+0.015
od+0.015
orry+0.015
 clean+0.014
Neg logit contributions
 avail-0.017
ounced-0.017
 enough-0.015
 batch-0.015
 breed-0.015
Token.
Token ID13
Feature activation+5.52e-01

Pos logit contributions
orry+0.047
 stop+0.046
ibr+0.046
 mantle+0.045
iz+0.044
Neg logit contributions
 enough-0.062
 avail-0.059
 breed-0.058
 batch-0.057
ounced-0.057
Token lots
Token ID6041
Feature activation+1.87e-01

Pos logit contributions
 avail+0.012
ounced+0.011
ited+0.011
 breed+0.010
 batch+0.010
Neg logit contributions
 stop-0.006
 clean-0.006
 un-0.006
od-0.005
ibr-0.005
Token of
Token ID286
Feature activation-7.97e-01

Pos logit contributions
 stop+0.005
ibr+0.005
od+0.005
orry+0.005
 mantle+0.005
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.004
 batch-0.004
 breed-0.004
Token fun
Token ID1257
Feature activation+9.93e-01

Pos logit contributions
 avail+0.021
ounced+0.020
 enough+0.019
 batch+0.018
 breed+0.018
Neg logit contributions
 stop-0.022
ibr-0.020
od-0.020
orry-0.020
 clean-0.020
Token in
Token ID287
Feature activation+1.18e-01

Pos logit contributions
 stop+0.018
ibr+0.016
orry+0.016
od+0.016
 clean+0.015
Neg logit contributions
 avail-0.036
ounced-0.034
 enough-0.032
 batch-0.032
 breed-0.032
Token the
Token ID262
Feature activation+1.85e+00

Pos logit contributions
 stop+0.002
ibr+0.001
orry+0.001
od+0.001
 mantle+0.001
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.004
 batch-0.004
 breed-0.004
Token beautiful
Token ID4950
Feature activation+3.15e+00

Pos logit contributions
ibr+0.054
od+0.053
 stop+0.052
 Jos+0.051
 Kate+0.050
Neg logit contributions
 problem-0.043
 avail-0.043
ounced-0.040
 enough-0.039
 batch-0.039
Token nature
Token ID3450
Feature activation+1.78e+00

Pos logit contributions
 stop+0.064
 clean+0.063
 un+0.059
 nicely+0.057
orry+0.056
Neg logit contributions
ounced-0.103
 avail-0.103
 dollars-0.093
ucker-0.092
 breed-0.092
Token of
Token ID286
Feature activation+5.79e-01

Pos logit contributions
ibr+0.039
 stop+0.039
orry+0.039
iz+0.038
 Jos+0.037
Neg logit contributions
 enough-0.052
 avail-0.050
 batch-0.050
ounced-0.049
 breed-0.048
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 stop+0.012
ibr+0.011
od+0.011
orry+0.011
 clean+0.010
Neg logit contributions
 avail-0.019
ounced-0.019
 enough-0.018
 batch-0.018
 breed-0.017
Token park
Token ID3952
Feature activation+9.86e-01

Pos logit contributions
ibr+0.050
od+0.049
 stop+0.049
 Jos+0.048
 Kate+0.048
Neg logit contributions
 problem-0.035
 avail-0.033
ounced-0.031
 enough-0.031
 batch-0.030
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
 stop+0.020
ibr+0.019
orry+0.019
 mantle+0.019
iz+0.019
Neg logit contributions
 enough-0.030
 avail-0.030
ounced-0.029
 batch-0.028
 breed-0.028
Token,
Token ID11
Feature activation+7.81e-02

Pos logit contributions
 avail+0.014
ounced+0.013
ited+0.012
 breed+0.012
 batch+0.012
Neg logit contributions
 stop-0.010
od-0.009
 clean-0.009
ibr-0.009
 un-0.009
Token "
Token ID366
Feature activation+3.20e-01

Pos logit contributions
 stop+0.001
ibr+0.001
orry+0.001
od+0.001
 mantle+0.001
Neg logit contributions
 avail-0.003
ounced-0.002
 enough-0.002
 batch-0.002
 breed-0.002
TokenWhat
Token ID2061
Feature activation+6.50e-01

Pos logit contributions
 stop+0.008
 mantle+0.007
 un+0.007
 clean+0.007
 nicely+0.007
Neg logit contributions
 avail-0.010
ounced-0.009
 choice-0.009
 problem-0.009
 enough-0.009
Token is
Token ID318
Feature activation+1.67e+00

Pos logit contributions
 stop+0.015
od+0.013
 clean+0.013
ibr+0.013
 un+0.013
Neg logit contributions
 avail-0.021
ounced-0.021
ited-0.019
 batch-0.019
 breed-0.018
Token wrong
Token ID2642
Feature activation+1.18e+00

Pos logit contributions
ibr+0.039
orry+0.038
 stop+0.038
iz+0.037
od+0.035
Neg logit contributions
 avail-0.050
 enough-0.049
 problem-0.047
ounced-0.046
 breed-0.043
Token,
Token ID11
Feature activation+3.13e+00

Pos logit contributions
 stop+0.022
 clean+0.020
 un+0.020
 nicely+0.019
od+0.019
Neg logit contributions
ounced-0.042
 avail-0.041
ited-0.040
 batch-0.037
ucker-0.037
Token Mom
Token ID11254
Feature activation+2.79e-01

Pos logit contributions
ibr+0.054
od+0.050
orry+0.049
iz+0.045
 stop+0.044
Neg logit contributions
 avail-0.112
 enough-0.108
 batch-0.105
ounced-0.104
 choice-0.103
Tokenmy
Token ID1820
Feature activation+7.52e-01

Pos logit contributions
 stop+0.008
 clean+0.007
 un+0.007
od+0.007
 Jos+0.007
Neg logit contributions
 avail-0.007
ounced-0.007
 breed-0.006
 batch-0.006
ited-0.006
Token?"
Token ID1701
Feature activation+1.36e+00

Pos logit contributions
 stop+0.021
 clean+0.019
 un+0.019
od+0.019
ibr+0.018
Neg logit contributions
 avail-0.020
ounced-0.019
 batch-0.018
ited-0.017
 breed-0.017
Token His
Token ID2399
Feature activation-4.57e-01

Pos logit contributions
ibr+0.034
orry+0.033
vell+0.032
od+0.031
 stop+0.030
Neg logit contributions
 avail-0.039
 batch-0.036
 enough-0.035
 flavors-0.034
 choice-0.034
Token Mom
Token ID11254
Feature activation-1.94e-01

Pos logit contributions
 avail+0.021
ounced+0.020
ited+0.020
 breed+0.019
 dollars+0.019
Neg logit contributions
 stop-0.005
 clean-0.004
 un-0.003
 Kate-0.003
 nicely-0.003
Token She
Token ID1375
Feature activation+1.01e+00

Pos logit contributions
 stop+0.000
vell+0.000
 knock+0.000
 clean+0.000
 arrive+0.000
Neg logit contributions
 batch-0.000
ucker-0.000
ounced-0.000
aters-0.000
 enough-0.000
Token gave
Token ID2921
Feature activation-3.15e-01

Pos logit contributions
 stop+0.023
 clean+0.021
od+0.021
orry+0.021
ibr+0.021
Neg logit contributions
 avail-0.033
ounced-0.030
 breed-0.028
 enough-0.028
 batch-0.028
Token each
Token ID1123
Feature activation-2.35e-01

Pos logit contributions
 avail+0.012
ounced+0.011
 enough+0.011
 batch+0.011
 breed+0.011
Neg logit contributions
 stop-0.005
ibr-0.005
orry-0.004
od-0.004
 clean-0.004
Token of
Token ID286
Feature activation+3.41e-01

Pos logit contributions
 avail+0.007
 batch+0.007
ounced+0.007
 breed+0.007
 enough+0.007
Neg logit contributions
ibr-0.005
 stop-0.005
 mantle-0.005
od-0.005
 un-0.005
Token her
Token ID607
Feature activation+1.05e+00

Pos logit contributions
 stop+0.007
ibr+0.007
 mantle+0.006
orry+0.006
vell+0.006
Neg logit contributions
 avail-0.011
 enough-0.010
 batch-0.010
 breed-0.010
ounced-0.010
Token friends
Token ID2460
Feature activation+3.12e+00

Pos logit contributions
 stop+0.025
ibr+0.024
od+0.023
 mantle+0.023
orry+0.023
Neg logit contributions
 avail-0.030
ounced-0.029
 batch-0.028
 enough-0.027
 problem-0.027
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
ibr+0.086
 knock+0.085
od+0.082
orry+0.081
iz+0.080
Neg logit contributions
 enough-0.080
 batch-0.072
 kinds-0.068
 tighter-0.066
able-0.065
Token beautiful
Token ID4950
Feature activation+2.07e+00

Pos logit contributions
 stop+0.022
ibr+0.020
orry+0.020
od+0.020
 clean+0.019
Neg logit contributions
 avail-0.035
ounced-0.034
 enough-0.032
 batch-0.031
 breed-0.031
Token painted
Token ID13055
Feature activation+1.33e+00

Pos logit contributions
 stop+0.047
 clean+0.043
orry+0.040
od+0.040
 un+0.040
Neg logit contributions
 avail-0.066
ounced-0.065
 enough-0.059
 breed-0.058
 dollars-0.057
Token picture
Token ID4286
Feature activation+8.93e-01

Pos logit contributions
 stop+0.029
 clean+0.026
od+0.026
ibr+0.025
orry+0.025
Neg logit contributions
 avail-0.044
ounced-0.043
 breed-0.039
ited-0.039
 enough-0.039
Token.
Token ID13
Feature activation-3.88e-01

Pos logit contributions
 stop+0.021
ibr+0.021
orry+0.021
 mantle+0.020
od+0.020
Neg logit contributions
 avail-0.024
 enough-0.024
ounced-0.023
 batch-0.023
 breed-0.022
Token loved
Token ID6151
Feature activation+7.17e-02

Pos logit contributions
ibr+0.013
 stop+0.012
iz+0.012
od+0.012
orry+0.012
Neg logit contributions
 avail-0.016
ounced-0.016
 enough-0.015
 breed-0.014
 batch-0.014
Token to
Token ID284
Feature activation+1.90e-01

Pos logit contributions
 stop+0.002
ibr+0.002
orry+0.002
iz+0.002
od+0.002
Neg logit contributions
 avail-0.002
ounced-0.002
 enough-0.002
 batch-0.002
 breed-0.002
Token play
Token ID711
Feature activation+9.67e-01

Pos logit contributions
ibr+0.004
orry+0.004
od+0.004
iz+0.004
vell+0.004
Neg logit contributions
 avail-0.006
 choice-0.005
 enough-0.005
 breed-0.005
 batch-0.005
Token in
Token ID287
Feature activation+6.72e-01

Pos logit contributions
orry+0.020
ibr+0.019
iz+0.019
 Jos+0.019
 stop+0.019
Neg logit contributions
 avail-0.030
 enough-0.028
 batch-0.028
 breed-0.026
 loaf-0.026
Token her
Token ID607
Feature activation+1.97e+00

Pos logit contributions
 stop+0.009
ibr+0.008
orry+0.008
od+0.008
 clean+0.008
Neg logit contributions
 avail-0.027
ounced-0.026
 enough-0.025
 batch-0.024
 breed-0.024
Token beautiful
Token ID4950
Feature activation+3.10e+00

Pos logit contributions
ibr+0.047
od+0.046
 stop+0.045
 Jos+0.043
iz+0.042
Neg logit contributions
 avail-0.057
 problem-0.055
ounced-0.054
 batch-0.053
 enough-0.053
Token garden
Token ID11376
Feature activation+9.17e-01

Pos logit contributions
 stop+0.053
 clean+0.048
 un+0.046
od+0.046
orry+0.046
Neg logit contributions
 avail-0.113
ounced-0.110
 enough-0.103
 breed-0.102
 batch-0.102
Token every
Token ID790
Feature activation-5.05e-01

Pos logit contributions
 stop+0.015
ibr+0.014
orry+0.014
od+0.014
 clean+0.013
Neg logit contributions
 avail-0.034
ounced-0.032
 enough-0.031
 batch-0.031
 breed-0.030
Token day
Token ID1110
Feature activation+1.99e-01

Pos logit contributions
ounced+0.018
 avail+0.017
ucker+0.016
 pistols+0.016
 dollars+0.016
Neg logit contributions
 stop-0.010
 clean-0.010
 un-0.009
 Kate-0.009
orry-0.008
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
 stop+0.004
ibr+0.004
iz+0.004
orry+0.004
ilet+0.004
Neg logit contributions
 avail-0.006
 enough-0.006
ounced-0.006
 batch-0.006
 breed-0.006
Token<|endoftext|>
Token ID50256
Feature activation-2.59e-01

Pos logit contributions
 stop+0.003
vell+0.003
ibr+0.003
od+0.003
orry+0.003
Neg logit contributions
 avail-0.004
 batch-0.004
 breed-0.003
 enough-0.003
ounced-0.003
Token the
Token ID262
Feature activation+6.63e-02

Pos logit contributions
 stop+0.000
 clean+0.000
 nicely+0.000
 un+0.000
 Kate+0.000
Neg logit contributions
ounced-0.001
 avail-0.001
ucker-0.001
ited-0.001
 breed-0.001
Token football
Token ID4346
Feature activation+3.50e-01

Pos logit contributions
 stop+0.002
 clean+0.001
 cool+0.001
 un+0.001
 mantle+0.001
Neg logit contributions
 avail-0.002
ounced-0.002
ucker-0.002
 batch-0.002
 breed-0.002
Token around
Token ID1088
Feature activation+1.38e+00

Pos logit contributions
ibr+0.007
 stop+0.007
orry+0.007
 Jos+0.007
iz+0.007
Neg logit contributions
 avail-0.010
ounced-0.010
 enough-0.010
 problem-0.010
 batch-0.010
Token with
Token ID351
Feature activation+1.89e+00

Pos logit contributions
ibr+0.038
vell+0.036
orry+0.035
aund+0.034
iz+0.033
Neg logit contributions
 avail-0.038
 enough-0.034
grown-0.031
joy-0.030
 ones-0.030
Token his
Token ID465
Feature activation+5.14e-01

Pos logit contributions
ibr+0.036
 mantle+0.034
 stop+0.034
orry+0.034
od+0.034
Neg logit contributions
 avail-0.065
 enough-0.061
ounced-0.058
 choice-0.057
 breed-0.056
Token friends
Token ID2460
Feature activation+3.09e+00

Pos logit contributions
 stop+0.012
 clean+0.011
 un+0.011
 nicely+0.010
orry+0.010
Neg logit contributions
 avail-0.017
ounced-0.016
ited-0.015
 batch-0.015
 enough-0.015
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ibr+0.068
orry+0.068
 mantle+0.065
 stop+0.065
iz+0.064
Neg logit contributions
 enough-0.090
 avail-0.087
ounced-0.086
 breed-0.083
 problem-0.081
Token
Token ID220
Feature activation-7.52e-02

Pos logit contributions
 stop+0.020
ibr+0.018
od+0.018
orry+0.017
 clean+0.017
Neg logit contributions
 avail-0.043
ounced-0.041
 enough-0.039
 batch-0.039
 breed-0.038
Token
Token ID198
Feature activation+3.44e-01

Pos logit contributions
 avail+0.002
ounced+0.002
 enough+0.002
 batch+0.002
 breed+0.002
Neg logit contributions
 stop-0.002
ibr-0.001
orry-0.001
 mantle-0.001
od-0.001
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 stop+0.004
od+0.004
ibr+0.004
orry+0.004
iz+0.004
Neg logit contributions
 avail-0.014
ounced-0.014
 enough-0.013
 batch-0.013
ited-0.013
TokenOne
Token ID3198
Feature activation+3.19e-01

Pos logit contributions
 stop+0.046
od+0.045
orry+0.044
ibr+0.043
iz+0.043
Neg logit contributions
 avail-0.030
ounced-0.030
 batch-0.026
ucker-0.026
 dollars-0.025
Token outside
Token ID2354
Feature activation+9.16e-01

Pos logit contributions
od+0.024
ibr+0.023
orry+0.022
 mantle+0.021
vell+0.020
Neg logit contributions
 enough-0.040
ounced-0.038
 avail-0.038
 saddle-0.037
 problem-0.036
Token to
Token ID284
Feature activation+3.18e-01

Pos logit contributions
ibr+0.023
orry+0.023
 stop+0.022
od+0.022
 mantle+0.021
Neg logit contributions
 avail-0.024
 enough-0.024
 batch-0.023
ounced-0.022
grown-0.022
Token walk
Token ID2513
Feature activation+9.98e-01

Pos logit contributions
ibr+0.008
od+0.008
vell+0.007
orry+0.007
 mantle+0.007
Neg logit contributions
 avail-0.009
 enough-0.009
 batch-0.008
 choice-0.008
 breed-0.008
Token in
Token ID287
Feature activation+7.41e-01

Pos logit contributions
ibr+0.019
 stop+0.018
od+0.018
orry+0.018
 mantle+0.017
Neg logit contributions
 avail-0.034
ounced-0.032
 enough-0.032
 breed-0.030
 batch-0.030
Token the
Token ID262
Feature activation+2.50e+00

Pos logit contributions
 stop+0.009
 clean+0.008
 un+0.007
 Kate+0.007
 Jos+0.007
Neg logit contributions
ounced-0.032
 avail-0.031
 batch-0.029
ucker-0.029
 breed-0.029
Token wet
Token ID9583
Feature activation+3.09e+00

Pos logit contributions
od+0.061
ibr+0.059
orry+0.053
 stop+0.053
 Kate+0.051
Neg logit contributions
 avail-0.073
 problem-0.071
ounced-0.070
 enough-0.069
 batch-0.066
Token lawn
Token ID21133
Feature activation+1.16e+00

Pos logit contributions
ibr+0.067
od+0.065
 stop+0.063
orry+0.062
 mantle+0.059
Neg logit contributions
 avail-0.097
ounced-0.093
 enough-0.089
 batch-0.088
 problem-0.088
Token.
Token ID13
Feature activation-2.05e-02

Pos logit contributions
ibr+0.022
orry+0.022
 stop+0.021
od+0.021
iz+0.020
Neg logit contributions
 avail-0.037
ounced-0.036
 enough-0.036
 batch-0.035
 breed-0.034
Token Little
Token ID7703
Feature activation-2.00e-01

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
aters+0.001
Neg logit contributions
 stop-0.000
ibr-0.000
od-0.000
vell-0.000
orry-0.000
Token Johnny
Token ID15470
Feature activation+1.05e+00

Pos logit contributions
 avail+0.006
ounced+0.006
 enough+0.006
 problem+0.006
 batch+0.006
Neg logit contributions
ibr-0.004
 stop-0.004
od-0.004
orry-0.004
 mantle-0.004
Token was
Token ID373
Feature activation-4.71e-01

Pos logit contributions
 stop+0.026
ibr+0.026
od+0.025
orry+0.025
 mantle+0.024
Neg logit contributions
ounced-0.028
 avail-0.028
 enough-0.026
 batch-0.025
ited-0.025
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
 avail+0.005
ounced+0.005
 batch+0.005
ited+0.005
ucker+0.005
Neg logit contributions
 stop-0.001
 nicely-0.001
 clean-0.001
ily-0.001
 un-0.001
Token
Token ID198
Feature activation+1.58e+00

Pos logit contributions
 stop+0.048
od+0.048
ibr+0.047
vell+0.047
 mantle+0.046
Neg logit contributions
 avail-0.038
 breed-0.036
ounced-0.036
 enough-0.035
 Does-0.033
Token
Token ID198
Feature activation+1.99e+00

Pos logit contributions
 stop+0.036
ibr+0.035
od+0.034
 mantle+0.034
vell+0.033
Neg logit contributions
ounced-0.045
 avail-0.044
 batch-0.042
ucker-0.041
 breed-0.041
Token"
Token ID1
Feature activation+7.54e-01

Pos logit contributions
 stop+0.051
ibr+0.049
od+0.049
 mantle+0.049
 Kate+0.046
Neg logit contributions
ounced-0.053
 avail-0.052
 batch-0.048
 enough-0.047
etchup-0.047
TokenSorry
Token ID14385
Feature activation+1.15e+00

Pos logit contributions
 stop+0.022
ibr+0.021
orry+0.021
od+0.021
 mantle+0.020
Neg logit contributions
 avail-0.019
ounced-0.017
 breed-0.016
ited-0.016
 batch-0.015
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
ibr+0.023
 stop+0.023
 mantle+0.022
orry+0.022
od+0.022
Neg logit contributions
 avail-0.037
ounced-0.035
 enough-0.035
 problem-0.034
 batch-0.033
Token Farmer
Token ID37755
Feature activation+1.04e+00

Pos logit contributions
ibr+0.105
od+0.101
 mantle+0.098
nes+0.097
 nicely+0.096
Neg logit contributions
 avail-0.062
 problem-0.060
 enough-0.059
 batch-0.054
 choice-0.053
Token Joe
Token ID5689
Feature activation+1.24e+00

Pos logit contributions
 stop+0.015
od+0.014
ibr+0.013
orry+0.013
 clean+0.013
Neg logit contributions
 avail-0.040
ounced-0.039
 batch-0.037
 breed-0.036
ited-0.036
Token,"
Token ID553
Feature activation+2.66e+00

Pos logit contributions
 stop+0.033
ibr+0.032
orry+0.031
od+0.031
 mantle+0.031
Neg logit contributions
 avail-0.033
ounced-0.031
 enough-0.029
 batch-0.029
 breed-0.028
Token Sam
Token ID3409
Feature activation+1.19e+00

Pos logit contributions
ibr+0.070
 mantle+0.069
 mant+0.066
vell+0.065
orry+0.065
Neg logit contributions
 avail-0.074
 enough-0.068
ounced-0.065
 problem-0.065
 batch-0.065
Token says
Token ID1139
Feature activation+3.50e-01

Pos logit contributions
 stop+0.040
ibr+0.038
orry+0.038
od+0.038
 clean+0.037
Neg logit contributions
 avail-0.024
ounced-0.022
 enough-0.020
 batch-0.020
 breed-0.019
Token "
Token ID366
Feature activation+5.06e-01

Pos logit contributions
 enough+0.001
ounced+0.001
 breed+0.001
 avail+0.001
 problem+0.001
Neg logit contributions
orry-0.002
ibr-0.002
 mantle-0.002
iz-0.001
ilet-0.001
TokenWhat
Token ID2061
Feature activation+1.92e-01

Pos logit contributions
 stop+0.013
ibr+0.012
od+0.012
orry+0.012
 clean+0.012
Neg logit contributions
 avail-0.014
ounced-0.014
 enough-0.013
 batch-0.013
 breed-0.012
Token do
Token ID466
Feature activation+1.35e+00

Pos logit contributions
 stop+0.004
ibr+0.004
od+0.004
orry+0.004
 clean+0.004
Neg logit contributions
 avail-0.006
ounced-0.006
 enough-0.005
 batch-0.005
 breed-0.005
Token you
Token ID345
Feature activation+6.09e-01

Pos logit contributions
 stop+0.023
 mantle+0.023
ibr+0.022
orry+0.022
od+0.021
Neg logit contributions
 avail-0.048
 enough-0.043
 problem-0.043
ounced-0.043
 breed-0.042
Token mean
Token ID1612
Feature activation+6.81e-01

Pos logit contributions
ibr+0.016
 Kate+0.015
 mantle+0.015
 Mir+0.015
 stop+0.015
Neg logit contributions
 avail-0.015
 problem-0.015
 enough-0.014
ounced-0.014
ited-0.014
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
 stop+0.015
 clean+0.013
od+0.013
ibr+0.013
orry+0.013
Neg logit contributions
 avail-0.022
ounced-0.022
 batch-0.020
ited-0.020
 breed-0.020
Token Joe
Token ID5689
Feature activation+9.84e-01

Pos logit contributions
ibr+0.073
zed+0.072
 mantle+0.068
Her+0.065
nes+0.063
Neg logit contributions
 enough-0.087
 avail-0.086
 problem-0.086
 choice-0.084
 types-0.083
Token?"
Token ID1701
Feature activation+1.22e+00

Pos logit contributions
 stop+0.026
 clean+0.025
od+0.024
 un+0.024
ibr+0.024
Neg logit contributions
 avail-0.027
ounced-0.026
 batch-0.023
 breed-0.023
 enough-0.023
Token
Token ID220
Feature activation+3.13e-01

Pos logit contributions
ibr+0.036
nes+0.035
vell+0.034
 stop+0.032
ilet+0.032
Neg logit contributions
 kinds-0.026
 avail-0.026
 types-0.026
 suggested-0.026
 enough-0.025
Token
Token ID198
Feature activation+1.29e+00

Pos logit contributions
 stop+0.008
 mantle+0.008
ilet+0.007
ibr+0.007
orry+0.007
Neg logit contributions
ounced-0.008
ucker-0.008
 avail-0.008
 enough-0.008
 batch-0.007
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
 stop+0.040
 mantle+0.038
ibr+0.037
vell+0.037
 mant+0.036
Neg logit contributions
ounced-0.026
 suggested-0.024
 avail-0.024
Sometimes-0.022
die-0.022
Tokenops
Token ID2840
Feature activation-8.55e-01

Pos logit contributions
 avail+0.067
ounced+0.059
grown+0.055
 breed+0.055
ited+0.054
Neg logit contributions
 stop-0.056
ibr-0.051
 clean-0.050
od-0.049
 un-0.049
Token inside
Token ID2641
Feature activation-7.78e-01

Pos logit contributions
 avail+0.030
ounced+0.029
 batch+0.027
 breed+0.027
 enough+0.027
Neg logit contributions
 stop-0.016
ibr-0.015
od-0.014
 clean-0.014
 un-0.014
Token.
Token ID13
Feature activation-5.31e-01

Pos logit contributions
ounced+0.032
 avail+0.032
 breed+0.029
ited+0.029
irms+0.029
Neg logit contributions
 stop-0.010
 clean-0.009
 Kate-0.009
 un-0.009
 nicely-0.008
Token The
Token ID383
Feature activation-1.02e+00

Pos logit contributions
 avail+0.017
ounced+0.016
 enough+0.016
 batch+0.016
 breed+0.015
Neg logit contributions
 stop-0.011
orry-0.010
ibr-0.010
od-0.010
 clean-0.010
Token l
Token ID300
Feature activation-2.19e+00

Pos logit contributions
 avail+0.032
ounced+0.031
ucker+0.028
 enough+0.028
ited+0.028
Neg logit contributions
 stop-0.024
 clean-0.022
ibr-0.021
 un-0.021
orry-0.020
Tokenoll
Token ID692
Feature activation-3.26e+00

Pos logit contributions
 avail+0.081
ounced+0.079
 enough+0.075
 batch+0.074
 breed+0.074
Neg logit contributions
 stop-0.036
ibr-0.032
od-0.032
orry-0.031
 clean-0.031
Tokenip
Token ID541
Feature activation-2.87e+00

Pos logit contributions
 avail+0.085
 perspective+0.068
 breed+0.067
aters+0.067
ounced+0.066
Neg logit contributions
 stop-0.093
ibr-0.092
od-0.090
ily-0.089
orry-0.089
Tokenops
Token ID2840
Feature activation-1.02e-01

Pos logit contributions
 avail+0.071
ounced+0.066
 enough+0.060
 batch+0.060
 breed+0.060
Neg logit contributions
 stop-0.083
ibr-0.079
od-0.078
orry-0.078
 clean-0.077
Token were
Token ID547
Feature activation-3.53e-01

Pos logit contributions
ounced+0.003
 avail+0.003
 enough+0.003
 batch+0.003
 problem+0.003
Neg logit contributions
 stop-0.002
ibr-0.002
orry-0.002
od-0.002
 Jos-0.002
Token red
Token ID2266
Feature activation-6.42e-01

Pos logit contributions
 avail+0.014
ounced+0.012
 breed+0.012
ited+0.012
irms+0.012
Neg logit contributions
 clean-0.006
 stop-0.006
 un-0.006
 Kate-0.005
 nicely-0.005
Token and
Token ID290
Feature activation-1.13e+00

Pos logit contributions
 avail+0.028
ounced+0.027
ited+0.025
 breed+0.024
ucker+0.024
Neg logit contributions
 stop-0.009
 clean-0.008
 un-0.006
 nicely-0.006
od-0.006
Token come
Token ID1282
Feature activation+5.71e-01

Pos logit contributions
 avail+0.017
ounced+0.016
 batch+0.015
 enough+0.015
 breed+0.015
Neg logit contributions
 stop-0.009
 clean-0.008
 un-0.008
ibr-0.007
orry-0.007
Token inside
Token ID2641
Feature activation-3.38e-01

Pos logit contributions
 stop+0.010
ibr+0.010
od+0.009
orry+0.009
 clean+0.009
Neg logit contributions
 avail-0.020
ounced-0.019
 enough-0.018
 batch-0.018
 breed-0.018
Token for
Token ID329
Feature activation-1.27e+00

Pos logit contributions
ounced+0.014
 avail+0.013
 breed+0.012
ited+0.012
ucker+0.012
Neg logit contributions
 stop-0.005
 clean-0.004
 un-0.004
 Kate-0.004
 nicely-0.004
Token dinner
Token ID8073
Feature activation-6.00e-02

Pos logit contributions
 avail+0.044
ounced+0.042
 enough+0.040
 batch+0.039
 breed+0.039
Neg logit contributions
 stop-0.024
ibr-0.022
od-0.022
orry-0.022
 clean-0.021
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 avail+0.002
ounced+0.002
 enough+0.002
 batch+0.002
 breed+0.002
Neg logit contributions
 stop-0.001
orry-0.001
ibr-0.001
od-0.001
iz-0.001
Token But
Token ID887
Feature activation-3.21e+00

Pos logit contributions
 avail+0.092
ounced+0.088
ited+0.084
 choice+0.082
 batch+0.082
Neg logit contributions
 stop-0.034
 clean-0.030
 un-0.030
ibr-0.029
od-0.029
Token St
Token ID520
Feature activation-2.23e+00

Pos logit contributions
ounced+0.119
 avail+0.117
 batch+0.113
 stash+0.111
 types+0.111
Neg logit contributions
 stop-0.056
 clean-0.048
 un-0.047
 nicely-0.047
 Mir-0.043
Tokenitch
Token ID2007
Feature activation-2.79e-01

Pos logit contributions
 avail+0.071
ounced+0.067
 batch+0.063
 breed+0.063
 enough+0.062
Neg logit contributions
 stop-0.049
ibr-0.046
orry-0.046
od-0.045
 clean-0.044
Token didn
Token ID1422
Feature activation+3.40e-01

Pos logit contributions
 avail+0.009
ounced+0.008
 breed+0.008
 batch+0.008
 enough+0.008
Neg logit contributions
 stop-0.006
orry-0.006
 clean-0.006
ibr-0.006
 un-0.006
Token't
Token ID470
Feature activation+2.30e-01

Pos logit contributions
ibr+0.007
 stop+0.007
orry+0.007
od+0.007
 Jos+0.007
Neg logit contributions
 avail-0.010
ounced-0.009
 enough-0.009
 problem-0.009
 batch-0.009
Token want
Token ID765
Feature activation-6.94e-01

Pos logit contributions
ibr+0.007
orry+0.006
 mantle+0.006
od+0.006
 mant+0.006
Neg logit contributions
 enough-0.006
 avail-0.006
 choice-0.005
 problem-0.005
 breed-0.005
Token Wh
Token ID854
Feature activation-1.61e+00

Pos logit contributions
 avail+0.049
ounced+0.047
ited+0.045
 batch+0.044
 enough+0.044
Neg logit contributions
 stop-0.023
 clean-0.020
 un-0.020
ibr-0.019
od-0.018
Tokenisk
Token ID1984
Feature activation-1.24e-01

Pos logit contributions
 avail+0.046
ounced+0.045
 enough+0.041
 batch+0.041
 breed+0.041
Neg logit contributions
 stop-0.040
ibr-0.037
od-0.037
orry-0.037
 clean-0.036
Tokeners
Token ID364
Feature activation+4.75e-01

Pos logit contributions
 enough+0.002
ounced+0.002
orns+0.002
able+0.002
 avail+0.002
Neg logit contributions
 stop-0.004
 Jos-0.004
 un-0.004
 mantle-0.004
 mant-0.004
Token saw
Token ID2497
Feature activation-2.03e-01

Pos logit contributions
 stop+0.012
ibr+0.012
orry+0.012
od+0.012
 mantle+0.012
Neg logit contributions
ounced-0.012
 avail-0.012
 enough-0.011
 batch-0.011
 problem-0.011
Token a
Token ID257
Feature activation-1.43e+00

Pos logit contributions
 avail+0.008
ounced+0.007
ited+0.007
 dollars+0.007
 batch+0.007
Neg logit contributions
 stop-0.003
 clean-0.003
ibr-0.003
od-0.003
orry-0.003
Token can
Token ID460
Feature activation-3.21e+00

Pos logit contributions
 avail+0.054
ounced+0.052
aters+0.050
ited+0.050
 enough+0.050
Neg logit contributions
 stop-0.025
 clean-0.023
 un-0.021
 knock-0.021
 cool-0.020
Token of
Token ID286
Feature activation-2.48e+00

Pos logit contributions
 avail+0.113
ounced+0.103
 enough+0.098
ited+0.095
 problem+0.095
Neg logit contributions
 stop-0.073
 un-0.068
 clean-0.067
od-0.064
orry-0.063
Token tuna
Token ID38883
Feature activation-1.44e+00

Pos logit contributions
 avail+0.073
ounced+0.072
ited+0.068
aters+0.064
eter+0.063
Neg logit contributions
 clean-0.072
 cool-0.069
 stop-0.066
 grown-0.065
 un-0.064
Token on
Token ID319
Feature activation-1.72e+00

Pos logit contributions
 avail+0.051
ounced+0.049
 breed+0.045
 batch+0.045
ited+0.045
Neg logit contributions
 stop-0.027
 clean-0.024
ibr-0.024
od-0.024
 un-0.024
Token the
Token ID262
Feature activation-4.28e-01

Pos logit contributions
ounced+0.078
 avail+0.075
ited+0.072
 dollars+0.071
ucker+0.071
Neg logit contributions
 stop-0.016
 clean-0.015
 un-0.015
 nicely-0.012
 mantle-0.012
Token kitchen
Token ID9592
Feature activation-9.24e-01

Pos logit contributions
 avail+0.011
 batch+0.011
 breed+0.010
 sausage+0.010
 bone+0.010
Neg logit contributions
od-0.011
 Jos-0.011
ibr-0.011
 Kate-0.011
iz-0.011
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
 avail+0.017
ounced+0.016
ited+0.015
 choice+0.015
 batch+0.015
Neg logit contributions
 stop-0.007
 clean-0.006
 un-0.006
od-0.006
ibr-0.006
Token Mom
Token ID11254
Feature activation-8.65e-01

Pos logit contributions
 avail+0.012
ounced+0.012
 enough+0.011
 batch+0.011
 breed+0.011
Neg logit contributions
 stop-0.007
ibr-0.007
orry-0.007
od-0.007
iz-0.007
Token says
Token ID1139
Feature activation-7.22e-01

Pos logit contributions
 avail+0.020
ounced+0.018
 breed+0.017
 enough+0.017
 batch+0.016
Neg logit contributions
 stop-0.027
ibr-0.025
od-0.025
 un-0.025
 clean-0.025
Token they
Token ID484
Feature activation-6.37e-01

Pos logit contributions
 avail+0.026
ounced+0.025
 batch+0.023
 enough+0.023
 breed+0.023
Neg logit contributions
 stop-0.013
ibr-0.012
od-0.011
orry-0.011
 clean-0.011
Token are
Token ID389
Feature activation-5.65e-01

Pos logit contributions
 avail+0.023
ounced+0.022
 breed+0.020
ucker+0.020
 batch+0.020
Neg logit contributions
 stop-0.014
 clean-0.013
 un-0.013
 knock-0.012
orry-0.012
Token av
Token ID1196
Feature activation-3.20e+00

Pos logit contributions
 enough+0.015
ounced+0.014
 avail+0.014
 problem+0.013
 batch+0.013
Neg logit contributions
orry-0.015
ibr-0.014
od-0.014
 stop-0.014
 mantle-0.014
Tokenoc
Token ID420
Feature activation-1.02e+00

Pos logit contributions
 avail+0.089
ounced+0.079
 problem+0.072
 choice+0.071
 breed+0.071
Neg logit contributions
od-0.085
 un-0.080
ibr-0.079
 clean-0.079
orry-0.078
Tokenados
Token ID22484
Feature activation-1.12e+00

Pos logit contributions
ounced+0.021
 avail+0.020
 enough+0.019
 batch+0.018
 dollars+0.017
Neg logit contributions
 stop-0.034
 clean-0.033
 un-0.033
 Jos-0.032
 Kate-0.032
Token.
Token ID13
Feature activation-7.74e-01

Pos logit contributions
 avail+0.041
ounced+0.039
 batch+0.037
 enough+0.037
 breed+0.037
Neg logit contributions
 stop-0.019
ibr-0.017
od-0.017
 clean-0.017
orry-0.017
Token Sara
Token ID24799
Feature activation-7.36e-01

Pos logit contributions
 avail+0.028
ounced+0.027
 enough+0.026
 batch+0.025
 breed+0.025
Neg logit contributions
 stop-0.013
ibr-0.013
orry-0.012
od-0.012
iz-0.012
Token wants
Token ID3382
Feature activation-2.52e-01

Pos logit contributions
 avail+0.015
ounced+0.015
 enough+0.014
 batch+0.013
 breed+0.013
Neg logit contributions
 stop-0.023
ibr-0.022
orry-0.022
od-0.022
 Jos-0.022
Token waited
Token ID13488
Feature activation-8.39e-01

Pos logit contributions
 avail+0.006
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.004
ibr-0.004
orry-0.004
 clean-0.004
od-0.004
Token and
Token ID290
Feature activation-7.76e-01

Pos logit contributions
 enough+0.019
 avail+0.018
ounced+0.018
 problem+0.017
 batch+0.016
Neg logit contributions
ibr-0.024
orry-0.024
od-0.024
 mantle-0.023
 stop-0.023
Token waited
Token ID13488
Feature activation-5.86e-01

Pos logit contributions
 avail+0.020
ounced+0.019
 enough+0.019
ited+0.018
 batch+0.017
Neg logit contributions
ibr-0.021
od-0.021
orry-0.021
 stop-0.020
 mantle-0.020
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
 enough+0.015
 avail+0.013
ounced+0.013
 problem+0.012
 batch+0.012
Neg logit contributions
ibr-0.016
orry-0.016
iz-0.015
od-0.015
 lyn-0.015
Token but
Token ID475
Feature activation-2.16e+00

Pos logit contributions
 enough+0.040
 avail+0.036
ounced+0.035
 batch+0.034
 problem+0.034
Neg logit contributions
ibr-0.041
iz-0.038
orry-0.038
vell-0.038
ilet-0.037
Token no
Token ID645
Feature activation-3.17e+00

Pos logit contributions
 avail+0.072
ounced+0.068
 batch+0.065
 breed+0.064
ited+0.064
Neg logit contributions
 stop-0.047
 clean-0.041
orry-0.041
 un-0.040
 nicely-0.040
Token one
Token ID530
Feature activation-3.85e-01

Pos logit contributions
ounced+0.089
 batch+0.080
ucker+0.079
ited+0.078
 avail+0.078
Neg logit contributions
 stop-0.090
 clean-0.079
 Mia-0.077
 un-0.077
 Mir-0.077
Token came
Token ID1625
Feature activation+4.81e-02

Pos logit contributions
 avail+0.011
ounced+0.010
 enough+0.010
 batch+0.010
 breed+0.009
Neg logit contributions
 stop-0.010
ibr-0.009
od-0.009
orry-0.009
 mantle-0.009
Token to
Token ID284
Feature activation-1.28e+00

Pos logit contributions
 stop+0.001
ibr+0.001
orry+0.001
od+0.001
 mantle+0.001
Neg logit contributions
 avail-0.002
ounced-0.001
 enough-0.001
 problem-0.001
 breed-0.001
Token find
Token ID1064
Feature activation-3.68e-01

Pos logit contributions
 avail+0.045
ounced+0.041
 enough+0.041
 breed+0.039
 choice+0.039
Neg logit contributions
ibr-0.024
od-0.023
orry-0.023
 stop-0.022
 mantle-0.022
Token him
Token ID683
Feature activation+3.12e-01

Pos logit contributions
 avail+0.012
ounced+0.012
 enough+0.012
 breed+0.011
 choice+0.011
Neg logit contributions
 stop-0.007
ibr-0.007
 mantle-0.007
od-0.007
orry-0.006
Token and
Token ID290
Feature activation-3.85e-01

Pos logit contributions
 avail+0.022
ounced+0.022
 batch+0.020
 breed+0.020
ited+0.020
Neg logit contributions
 stop-0.012
 clean-0.011
 un-0.010
od-0.010
ibr-0.010
Token wanted
Token ID2227
Feature activation-1.70e+00

Pos logit contributions
 avail+0.011
ounced+0.010
 batch+0.010
 enough+0.009
 breed+0.009
Neg logit contributions
 stop-0.010
 clean-0.009
 un-0.009
ibr-0.009
orry-0.009
Token to
Token ID284
Feature activation-4.54e-01

Pos logit contributions
 avail+0.054
ited+0.053
ounced+0.052
 batch+0.049
ucker+0.049
Neg logit contributions
 stop-0.040
 clean-0.038
 un-0.036
 nicely-0.035
 Mia-0.034
Token rest
Token ID1334
Feature activation-7.99e-02

Pos logit contributions
 avail+0.015
 choice+0.014
 enough+0.013
 saddle+0.013
 breed+0.012
Neg logit contributions
ibr-0.010
od-0.010
orry-0.010
 Jos-0.009
 mantle-0.009
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
 avail+0.003
ounced+0.003
 batch+0.003
ited+0.003
 breed+0.003
Neg logit contributions
 stop-0.001
 clean-0.001
 un-0.001
 Kate-0.001
 nicely-0.001
Token but
Token ID475
Feature activation-3.16e+00

Pos logit contributions
 avail+0.068
ounced+0.066
 batch+0.062
ited+0.061
 breed+0.061
Neg logit contributions
 stop-0.032
 clean-0.028
 un-0.027
ibr-0.027
od-0.027
Token they
Token ID484
Feature activation-1.23e-01

Pos logit contributions
 loaf+0.144
aters+0.144
 batch+0.144
ited+0.142
eter+0.140
Neg logit contributions
 Lily-0.042
 stop-0.038
 everyone-0.037
 Mia-0.034
 Anna-0.032
Token didn
Token ID1422
Feature activation-3.31e-01

Pos logit contributions
 avail+0.003
ounced+0.003
 enough+0.003
 batch+0.003
 breed+0.003
Neg logit contributions
 stop-0.003
ibr-0.003
od-0.003
orry-0.003
 clean-0.003
Token't
Token ID470
Feature activation-2.57e-01

Pos logit contributions
 avail+0.010
 enough+0.009
ounced+0.009
 problem+0.009
 batch+0.008
Neg logit contributions
ibr-0.007
orry-0.007
 stop-0.007
od-0.007
 Jos-0.007
Token have
Token ID423
Feature activation-3.79e-01

Pos logit contributions
 avail+0.006
 enough+0.006
 problem+0.006
 choice+0.006
 breed+0.005
Neg logit contributions
ibr-0.008
 mantle-0.007
orry-0.007
od-0.007
 Jos-0.007
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
 avail+0.009
ounced+0.009
 batch+0.008
 breed+0.008
ited+0.008
Neg logit contributions
 stop-0.012
 clean-0.011
 un-0.011
 grown-0.010
od-0.010
Token the
Token ID262
Feature activation+3.06e-01

Pos logit contributions
 avail+0.088
ounced+0.087
ited+0.082
 batch+0.081
 breed+0.080
Neg logit contributions
 stop-0.052
 clean-0.046
 un-0.046
ibr-0.045
od-0.045
Token water
Token ID1660
Feature activation+8.88e-01

Pos logit contributions
 stop+0.006
 clean+0.005
 un+0.005
 cool+0.005
 grown+0.005
Neg logit contributions
 avail-0.012
ounced-0.012
 batch-0.011
ited-0.011
ucker-0.011
Token was
Token ID373
Feature activation-2.83e-01

Pos logit contributions
 stop+0.019
 clean+0.017
 un+0.017
 grown+0.016
orry+0.016
Neg logit contributions
 avail-0.032
 batch-0.027
 breed-0.027
aters-0.027
 dollars-0.027
Token really
Token ID1107
Feature activation-5.40e-03

Pos logit contributions
 avail+0.012
 dollars+0.010
rob+0.010
irms+0.010
use+0.010
Neg logit contributions
 clean-0.005
 stop-0.005
 un-0.005
 cool-0.004
 tidy-0.004
Token rushing
Token ID15795
Feature activation-7.96e-02

Pos logit contributions
 avail+0.000
aters+0.000
ounced+0.000
 batch+0.000
 breed+0.000
Neg logit contributions
 clean-0.000
 stop-0.000
 un-0.000
 cool-0.000
 tidy-0.000
Token and
Token ID290
Feature activation-3.16e+00

Pos logit contributions
 avail+0.003
ounced+0.003
 batch+0.003
 breed+0.003
ucker+0.003
Neg logit contributions
 stop-0.001
 clean-0.001
 un-0.001
 nicely-0.001
 mantle-0.001
Token it
Token ID340
Feature activation-2.01e-01

Pos logit contributions
 avail+0.115
ited+0.109
 choice+0.109
 dollars+0.106
 breed+0.105
Neg logit contributions
 stop-0.057
 clean-0.055
 un-0.054
 grown-0.050
 everyone-0.049
Token looked
Token ID3114
Feature activation-1.33e+00

Pos logit contributions
 avail+0.007
ounced+0.006
 dollars+0.006
 batch+0.006
 breed+0.006
Neg logit contributions
 stop-0.005
 clean-0.004
 un-0.004
orry-0.004
 mantle-0.004
Token filthy
Token ID35677
Feature activation-2.59e-01

Pos logit contributions
 avail+0.047
ounced+0.044
 breed+0.042
 batch+0.042
ucker+0.040
Neg logit contributions
 stop-0.027
 clean-0.025
 un-0.024
 nicely-0.023
orry-0.022
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
 avail+0.010
ounced+0.010
 batch+0.009
 breed+0.009
ited+0.009
Neg logit contributions
 stop-0.004
 clean-0.003
 un-0.003
od-0.003
ibr-0.003
Token Cotton
Token ID23915
Feature activation-4.63e-01

Pos logit contributions
 avail+0.080
ounced+0.076
ited+0.071
 batch+0.070
 choice+0.070
Neg logit contributions
 stop-0.056
ibr-0.052
orry-0.051
od-0.051
 clean-0.051
Token found
Token ID1043
Feature activation-7.68e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 enough+0.005
 breed+0.005
 batch+0.005
Neg logit contributions
 stop-0.005
 clean-0.004
 un-0.004
od-0.004
orry-0.004
Token some
Token ID617
Feature activation-1.09e+00

Pos logit contributions
 avail+0.033
ited+0.030
ounced+0.029
ucker+0.028
 choice+0.028
Neg logit contributions
 stop-0.010
ily-0.010
 Mia-0.010
 clean-0.010
 Mir-0.009
Token blue
Token ID4171
Feature activation-2.09e-01

Pos logit contributions
 avail+0.041
ounced+0.034
joy+0.033
irms+0.033
 Amen+0.032
Neg logit contributions
 clean-0.026
 cool-0.025
 grown-0.023
 p-0.022
 stop-0.022
Tokenberries
Token ID20853
Feature activation-1.32e+00

Pos logit contributions
 avail+0.007
 faith+0.005
 Cubs+0.005
ited+0.005
 fracture+0.005
Neg logit contributions
 O-0.006
 p-0.006
 in-0.005
 herself-0.005
 clean-0.005
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
 avail+0.053
ounced+0.048
 breed+0.046
irms+0.045
ited+0.045
Neg logit contributions
 stop-0.022
 clean-0.020
od-0.019
 un-0.018
 nicely-0.017
Token but
Token ID475
Feature activation-3.15e+00

Pos logit contributions
 avail+0.061
ounced+0.056
 choice+0.053
 problem+0.053
 breed+0.051
Neg logit contributions
 stop-0.031
 clean-0.030
 un-0.029
od-0.027
 Mia-0.027
Token they
Token ID484
Feature activation-3.82e-01

Pos logit contributions
 avail+0.154
ounced+0.151
ited+0.150
 saddle+0.147
aters+0.145
Neg logit contributions
 Kate-0.018
 stop-0.017
 Mia-0.017
 clean-0.016
 nicely-0.016
Token were
Token ID547
Feature activation-1.08e+00

Pos logit contributions
 avail+0.012
ounced+0.011
 enough+0.011
 batch+0.011
 breed+0.011
Neg logit contributions
 stop-0.009
 clean-0.008
od-0.008
orry-0.008
ibr-0.008
Token bitter
Token ID12922
Feature activation-8.67e-01

Pos logit contributions
 avail+0.040
 choice+0.034
 breed+0.034
ounced+0.034
aters+0.033
Neg logit contributions
 clean-0.022
 stop-0.022
 un-0.021
 nicely-0.019
orry-0.019
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
 avail+0.037
ounced+0.035
 saddle+0.033
ited+0.033
 breed+0.033
Neg logit contributions
 stop-0.011
 clean-0.010
od-0.009
 nicely-0.008
 un-0.008
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
 avail+0.076
ounced+0.073
 enough+0.069
 batch+0.069
 breed+0.069
Neg logit contributions
 stop-0.028
ibr-0.025
od-0.024
orry-0.024
 clean-0.024
Token,
Token ID11
Feature activation-4.12e-01

Pos logit contributions
ibr+0.021
orry+0.021
od+0.020
iz+0.020
 mantle+0.020
Neg logit contributions
 enough-0.021
 avail-0.020
ounced-0.020
 problem-0.020
 choice-0.019
Token Lily
Token ID20037
Feature activation+2.53e-01

Pos logit contributions
 avail+0.016
 enough+0.015
 batch+0.014
ounced+0.014
 choice+0.014
Neg logit contributions
ibr-0.007
orry-0.007
od-0.006
 stop-0.006
iz-0.006
Token saw
Token ID2497
Feature activation+2.61e-01

Pos logit contributions
 stop+0.006
ibr+0.006
od+0.006
orry+0.006
 clean+0.005
Neg logit contributions
 avail-0.007
ounced-0.007
 enough-0.007
 batch-0.007
 breed-0.007
Token her
Token ID607
Feature activation-1.64e+00

Pos logit contributions
 stop+0.004
 clean+0.003
ibr+0.003
od+0.003
 Kate+0.003
Neg logit contributions
 avail-0.011
ounced-0.010
ited-0.010
 dollars-0.010
 breed-0.009
Token l
Token ID300
Feature activation-2.39e+00

Pos logit contributions
 avail+0.059
ounced+0.055
 properties+0.052
use+0.052
 breed+0.052
Neg logit contributions
 stop-0.034
 clean-0.032
 un-0.029
 nicely-0.027
 face-0.027
Tokenoll
Token ID692
Feature activation-3.15e+00

Pos logit contributions
 avail+0.090
ounced+0.083
 batch+0.081
 breed+0.079
 enough+0.079
Neg logit contributions
 stop-0.040
ibr-0.039
orry-0.038
od-0.036
iz-0.035
Tokenipop
Token ID42800
Feature activation-2.11e+00

Pos logit contributions
 avail+0.081
 breed+0.066
 batch+0.065
 ditch+0.065
irms+0.064
Neg logit contributions
 stop-0.090
orry-0.085
ibr-0.085
od-0.084
 clean-0.083
Token had
Token ID550
Feature activation-1.22e+00

Pos logit contributions
 avail+0.074
ounced+0.071
 breed+0.066
 batch+0.066
ited+0.065
Neg logit contributions
 stop-0.041
 clean-0.037
 un-0.035
od-0.035
ibr-0.035
Token melted
Token ID24178
Feature activation-1.17e+00

Pos logit contributions
 avail+0.039
ounced+0.034
 breed+0.034
 batch+0.033
irms+0.033
Neg logit contributions
 stop-0.029
 clean-0.027
 un-0.026
od-0.025
 grown-0.025
Token completely
Token ID3190
Feature activation-3.33e-02

Pos logit contributions
ounced+0.048
 avail+0.047
 breed+0.045
ucker+0.044
 batch+0.043
Neg logit contributions
 stop-0.016
 clean-0.016
 nicely-0.013
 un-0.013
 in-0.013
Token.
Token ID13
Feature activation-6.75e-02

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 breed+0.001
Neg logit contributions
 stop-0.001
ibr-0.001
orry-0.001
od-0.001
 Jos-0.001
Token decided
Token ID3066
Feature activation-2.45e-01

Pos logit contributions
 clean+0.010
 stop+0.009
 un+0.009
orry+0.008
ibr+0.008
Neg logit contributions
 avail-0.013
ounced-0.012
 breed-0.011
 dollars-0.011
 batch-0.011
Token to
Token ID284
Feature activation+1.76e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 batch+0.005
 breed+0.005
 enough+0.005
Neg logit contributions
 stop-0.007
 clean-0.006
od-0.006
ibr-0.006
orry-0.006
Token take
Token ID1011
Feature activation+3.63e-01

Pos logit contributions
ibr+0.004
orry+0.004
od+0.004
vell+0.003
 mantle+0.003
Neg logit contributions
 avail-0.006
 choice-0.005
 enough-0.005
 batch-0.005
 breed-0.005
Token her
Token ID607
Feature activation-4.85e-02

Pos logit contributions
 mantle+0.010
 stop+0.009
ibr+0.009
orry+0.009
 mant+0.009
Neg logit contributions
 enough-0.010
 avail-0.010
ounced-0.008
 batch-0.008
 breed-0.008
Token favourite
Token ID12507
Feature activation-2.01e-02

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 breed+0.001
Neg logit contributions
 stop-0.001
ibr-0.001
 clean-0.001
 un-0.001
orry-0.001
Token caul
Token ID46087
Feature activation-3.14e+00

Pos logit contributions
ounced+0.001
 avail+0.001
ucker+0.001
 enough+0.001
ited+0.001
Neg logit contributions
 stop-0.000
 clean-0.000
 un-0.000
ibr-0.000
 mantle-0.000
Tokeniflower
Token ID42642
Feature activation-8.39e-01

Pos logit contributions
 avail+0.118
irms+0.116
ounced+0.115
 breed+0.110
 enough+0.110
Neg logit contributions
od-0.048
 mantle-0.046
 clean-0.043
An-0.040
 nicely-0.040
Token with
Token ID351
Feature activation+7.85e-01

Pos logit contributions
 avail+0.032
ounced+0.030
ucker+0.028
ited+0.028
 batch+0.027
Neg logit contributions
 stop-0.015
 clean-0.014
 nicely-0.013
 un-0.013
ibr-0.013
Token her
Token ID607
Feature activation+3.23e-01

Pos logit contributions
 stop+0.006
 clean+0.006
 un+0.005
 nicely+0.004
od+0.004
Neg logit contributions
 avail-0.036
ounced-0.036
 problem-0.034
 dollars-0.034
ucker-0.034
Token.
Token ID13
Feature activation+6.43e-01

Pos logit contributions
 stop+0.004
 clean+0.004
 un+0.004
od+0.004
 nicely+0.004
Neg logit contributions
 avail-0.013
ounced-0.013
ited-0.013
irms-0.012
 batch-0.012
Token She
Token ID1375
Feature activation-8.67e-02

Pos logit contributions
 stop+0.012
ibr+0.011
orry+0.011
od+0.011
 mantle+0.010
Neg logit contributions
 avail-0.023
ounced-0.022
 enough-0.021
 batch-0.021
 breed-0.020
Token a
Token ID257
Feature activation-4.68e-01

Pos logit contributions
 avail+0.081
ounced+0.080
 batch+0.075
 breed+0.074
 enough+0.074
Neg logit contributions
 stop-0.040
ibr-0.036
 clean-0.035
od-0.035
 un-0.035
Token little
Token ID1310
Feature activation-3.86e-01

Pos logit contributions
 avail+0.016
ounced+0.015
 enough+0.014
 batch+0.014
 breed+0.014
Neg logit contributions
 stop-0.009
 clean-0.008
 un-0.008
ibr-0.008
orry-0.008
Token green
Token ID4077
Feature activation-4.60e-03

Pos logit contributions
 avail+0.012
ounced+0.012
aters+0.011
ucker+0.011
 allowance+0.011
Neg logit contributions
 stop-0.010
 clean-0.009
 un-0.008
 knock-0.008
 Mia-0.008
Token frog
Token ID21264
Feature activation-9.48e-01

Pos logit contributions
 avail+0.000
ounced+0.000
 enough+0.000
 batch+0.000
 dollars+0.000
Neg logit contributions
 stop-0.000
 clean-0.000
orry-0.000
od-0.000
 un-0.000
Token appeared
Token ID4120
Feature activation-1.22e-01

Pos logit contributions
 avail+0.028
ounced+0.027
 batch+0.025
 enough+0.025
 breed+0.025
Neg logit contributions
 stop-0.023
ibr-0.021
od-0.021
orry-0.021
 un-0.021
Token.
Token ID13
Feature activation-3.13e+00

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.004
 breed+0.004
 batch+0.004
Neg logit contributions
 stop-0.002
ibr-0.002
orry-0.002
 mantle-0.002
od-0.002
Token The
Token ID383
Feature activation-5.38e-02

Pos logit contributions
ounced+0.134
 avail+0.134
 choice+0.125
ited+0.125
 flavors+0.122
Neg logit contributions
 stop-0.036
 clean-0.033
 un-0.033
 Everyone-0.029
 Mir-0.029
Token frog
Token ID21264
Feature activation-8.74e-01

Pos logit contributions
 avail+0.002
ounced+0.002
 enough+0.002
 dollars+0.002
aters+0.002
Neg logit contributions
 stop-0.001
 clean-0.001
 un-0.001
 Kate-0.001
orry-0.001
Token said
Token ID531
Feature activation-3.26e-01

Pos logit contributions
ounced+0.019
 avail+0.019
 enough+0.017
 batch+0.017
 breed+0.016
Neg logit contributions
 stop-0.027
ibr-0.026
orry-0.025
od-0.025
 mantle-0.025
Token,
Token ID11
Feature activation-9.70e-01

Pos logit contributions
 enough+0.009
ounced+0.009
 avail+0.008
 problem+0.008
 batch+0.008
Neg logit contributions
 mantle-0.009
orry-0.008
iz-0.008
ibr-0.008
 stop-0.008
Token "
Token ID366
Feature activation-3.83e-01

Pos logit contributions
ounced+0.020
 enough+0.019
 breed+0.018
 sausage+0.017
 batch+0.017
Neg logit contributions
orry-0.031
iz-0.031
ibr-0.031
 mantle-0.028
lo-0.028
Token she
Token ID673
Feature activation+2.80e-01

Pos logit contributions
 avail+0.012
 enough+0.011
ounced+0.010
 problem+0.010
grown+0.010
Neg logit contributions
orry-0.008
ibr-0.008
 stop-0.008
od-0.008
iz-0.008
Token noticed
Token ID6810
Feature activation+5.71e-01

Pos logit contributions
 stop+0.007
 un+0.007
 clean+0.007
od+0.006
 Kate+0.006
Neg logit contributions
 avail-0.008
ounced-0.008
 breed-0.007
 batch-0.007
 dollars-0.007
Token that
Token ID326
Feature activation-7.70e-01

Pos logit contributions
 stop+0.011
od+0.010
ibr+0.010
 clean+0.010
orry+0.010
Neg logit contributions
 avail-0.019
ounced-0.019
 batch-0.018
 breed-0.017
ited-0.017
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
ounced+0.028
 avail+0.028
ited+0.025
 breed+0.025
 batch+0.025
Neg logit contributions
 stop-0.013
 clean-0.012
 Kate-0.012
 un-0.012
od-0.012
Token l
Token ID300
Feature activation-2.14e+00

Pos logit contributions
 avail+0.024
ounced+0.024
 enough+0.022
 batch+0.021
 breed+0.021
Neg logit contributions
 stop-0.018
 clean-0.017
ibr-0.016
 mantle-0.016
 un-0.016
Tokenoll
Token ID692
Feature activation-3.12e+00

Pos logit contributions
 avail+0.088
ounced+0.086
 enough+0.082
 batch+0.082
 breed+0.081
Neg logit contributions
 stop-0.025
ibr-0.022
od-0.021
orry-0.021
 clean-0.021
Tokenipop
Token ID42800
Feature activation-2.08e+00

Pos logit contributions
 avail+0.080
irms+0.067
 breed+0.066
aters+0.064
 batch+0.063
Neg logit contributions
 stop-0.088
od-0.087
ibr-0.086
orry-0.085
ily-0.083
Token was
Token ID373
Feature activation-4.44e-01

Pos logit contributions
 avail+0.065
ounced+0.060
 breed+0.057
 batch+0.057
 enough+0.056
Neg logit contributions
 stop-0.049
 clean-0.045
 un-0.044
od-0.043
ibr-0.043
Token gone
Token ID3750
Feature activation-2.53e-01

Pos logit contributions
 avail+0.016
ounced+0.014
 breed+0.014
irms+0.013
 choice+0.013
Neg logit contributions
 stop-0.009
 clean-0.009
 un-0.008
 mantle-0.008
 nicely-0.008
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
 avail+0.010
ounced+0.010
 breed+0.010
 batch+0.010
ucker+0.010
Neg logit contributions
 clean-0.003
 stop-0.003
 un-0.003
 nicely-0.003
 tidy-0.002
Token She
Token ID1375
Feature activation-8.14e-02

Pos logit contributions
 avail+0.014
 batch+0.014
 Does+0.014
 breed+0.013
ounced+0.013
Neg logit contributions
 stop-0.017
vell-0.016
orry-0.015
iz-0.015
ibr-0.015
Token the
Token ID262
Feature activation-1.55e+00

Pos logit contributions
ounced+0.002
 avail+0.002
ited+0.002
ucker+0.002
 batch+0.002
Neg logit contributions
 stop-0.001
 clean-0.001
 un-0.001
 Kate-0.001
 nicely-0.001
Token prize
Token ID11596
Feature activation-8.80e-01

Pos logit contributions
ounced+0.054
ucker+0.050
 batch+0.047
etchup+0.046
ays+0.046
Neg logit contributions
 stop-0.039
 cool-0.036
 clean-0.035
 right-0.034
 grown-0.033
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 avail+0.035
ounced+0.034
ited+0.032
ucker+0.032
 batch+0.031
Neg logit contributions
 stop-0.013
 clean-0.011
 Kate-0.011
od-0.011
 nicely-0.011
Token
Token ID198
Feature activation-5.07e-01

Pos logit contributions
 avail+0.041
ounced+0.039
 enough+0.038
 breed+0.037
 batch+0.037
Neg logit contributions
 stop-0.025
orry-0.023
ibr-0.023
od-0.023
 clean-0.022
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
 avail+0.018
ounced+0.017
 enough+0.016
 batch+0.016
 breed+0.016
Neg logit contributions
 stop-0.009
ibr-0.008
orry-0.008
od-0.008
 clean-0.008
TokenBa
Token ID34458
Feature activation-3.12e+00

Pos logit contributions
 avail+0.038
ounced+0.037
 batch+0.034
 enough+0.034
 breed+0.034
Neg logit contributions
 stop-0.027
ibr-0.025
od-0.024
orry-0.024
 clean-0.024
Tokeniley
Token ID9618
Feature activation-8.43e-01

Pos logit contributions
ounced+0.089
 avail+0.084
 batch+0.080
 stash+0.079
 loaf+0.077
Neg logit contributions
ibr-0.078
od-0.074
 stop-0.073
 Kate-0.073
orry-0.072
Token and
Token ID290
Feature activation+7.03e-01

Pos logit contributions
ounced+0.021
 avail+0.021
 enough+0.020
 batch+0.019
 breed+0.019
Neg logit contributions
 stop-0.023
ibr-0.022
orry-0.021
 mantle-0.021
od-0.021
Token her
Token ID607
Feature activation-5.74e-01

Pos logit contributions
 stop+0.010
ibr+0.009
orry+0.008
od+0.008
 clean+0.008
Neg logit contributions
 avail-0.028
ounced-0.027
 enough-0.026
 batch-0.026
 breed-0.025
Token friends
Token ID2460
Feature activation+6.22e-01

Pos logit contributions
 avail+0.015
ounced+0.014
 enough+0.013
 problem+0.013
 batch+0.013
Neg logit contributions
 stop-0.016
ibr-0.015
od-0.015
orry-0.015
 mantle-0.014
Token started
Token ID2067
Feature activation-2.49e-01

Pos logit contributions
 stop+0.017
ibr+0.016
orry+0.016
 mantle+0.016
od+0.016
Neg logit contributions
 avail-0.016
ounced-0.015
 enough-0.015
ited-0.014
 problem-0.014
Token Nut
Token ID11959
Feature activation-8.93e-01

Pos logit contributions
ibr+0.040
od+0.039
orry+0.039
 stop+0.039
 mantle+0.038
Neg logit contributions
 avail-0.041
ounced-0.040
 enough-0.038
 problem-0.037
 breed-0.037
Tokenty
Token ID774
Feature activation-8.60e-01

Pos logit contributions
 avail+0.029
ounced+0.028
 batch+0.026
 breed+0.025
 enough+0.025
Neg logit contributions
 stop-0.019
od-0.018
 clean-0.017
orry-0.017
iz-0.017
Token!"
Token ID2474
Feature activation+4.83e-01

Pos logit contributions
 avail+0.027
ounced+0.026
 batch+0.024
 enough+0.024
 breed+0.023
Neg logit contributions
 stop-0.019
ibr-0.018
od-0.018
orry-0.018
 clean-0.018
Token said
Token ID531
Feature activation+6.20e-01

Pos logit contributions
 stop+0.011
ibr+0.010
 clean+0.010
od+0.010
 un+0.010
Neg logit contributions
 avail-0.014
ounced-0.014
 batch-0.013
 enough-0.013
 breed-0.013
Token a
Token ID257
Feature activation-4.22e-01

Pos logit contributions
ibr+0.012
orry+0.012
 mantle+0.011
 stop+0.011
od+0.011
Neg logit contributions
 avail-0.021
ounced-0.020
 enough-0.020
 breed-0.019
 choice-0.018
Token chip
Token ID11594
Feature activation-3.10e+00

Pos logit contributions
 problem+0.011
 avail+0.011
ounced+0.011
 breed+0.011
 batch+0.011
Neg logit contributions
ibr-0.011
orry-0.010
od-0.010
 mantle-0.010
iz-0.010
Tokenm
Token ID76
Feature activation-9.48e-02

Pos logit contributions
 avail+0.098
ounced+0.094
 enough+0.087
 batch+0.086
 breed+0.085
Neg logit contributions
 stop-0.070
od-0.064
orry-0.064
ibr-0.063
 clean-0.063
Tokenunk
Token ID2954
Feature activation-1.07e+00

Pos logit contributions
ounced+0.003
 avail+0.003
 enough+0.003
 breed+0.003
 problem+0.003
Neg logit contributions
 stop-0.002
 mantle-0.002
ibr-0.002
 mant-0.002
 clean-0.002
Token named
Token ID3706
Feature activation-1.36e-02

Pos logit contributions
ounced+0.033
 enough+0.031
 avail+0.031
 breed+0.030
 batch+0.029
Neg logit contributions
ibr-0.023
 mantle-0.022
 stop-0.022
orry-0.020
od-0.020
Token Ch
Token ID609
Feature activation-5.39e-01

Pos logit contributions
 avail+0.000
ounced+0.000
 breed+0.000
 enough+0.000
die+0.000
Neg logit contributions
ibr-0.000
orry-0.000
 stop-0.000
 mantle-0.000
od-0.000
Tokenippy
Token ID41214
Feature activation-1.24e+00

Pos logit contributions
 avail+0.015
ounced+0.015
 enough+0.014
 batch+0.014
 breed+0.014
Neg logit contributions
 stop-0.014
ibr-0.013
 clean-0.012
od-0.012
 un-0.012
Token they
Token ID484
Feature activation+2.63e-01

Pos logit contributions
 avail+0.013
 enough+0.012
ounced+0.012
 problem+0.011
grown+0.011
Neg logit contributions
ibr-0.012
od-0.011
orry-0.011
 stop-0.011
 mantle-0.011
Token found
Token ID1043
Feature activation-1.65e-01

Pos logit contributions
ibr+0.008
 mantle+0.008
od+0.008
lo+0.008
orry+0.008
Neg logit contributions
ounced-0.005
 enough-0.005
 avail-0.005
 seen-0.004
ited-0.004
Token his
Token ID465
Feature activation-1.83e+00

Pos logit contributions
 avail+0.006
ounced+0.006
ited+0.006
 batch+0.006
 breed+0.006
Neg logit contributions
 stop-0.003
 clean-0.002
ibr-0.002
od-0.002
orry-0.002
Token lost
Token ID2626
Feature activation-4.77e-01

Pos logit contributions
 avail+0.063
ited+0.056
ounced+0.056
grown+0.056
 properties+0.055
Neg logit contributions
 stop-0.039
 clean-0.036
 cool-0.035
 nicely-0.034
 un-0.033
Token l
Token ID300
Feature activation-2.38e+00

Pos logit contributions
ited+0.016
 avail+0.015
ounced+0.015
use+0.014
smanship+0.014
Neg logit contributions
 clean-0.010
 stop-0.010
 face-0.010
 p-0.009
 un-0.009
Tokenoll
Token ID692
Feature activation-3.10e+00

Pos logit contributions
 avail+0.094
ounced+0.086
 batch+0.084
 breed+0.083
 choice+0.081
Neg logit contributions
 stop-0.037
ibr-0.036
orry-0.035
od-0.033
ily-0.033
Tokenipop
Token ID42800
Feature activation-2.34e+00

Pos logit contributions
 avail+0.079
 faith+0.066
 breed+0.066
ounced+0.066
irms+0.066
Neg logit contributions
 stop-0.086
ibr-0.081
 clean-0.081
 nicely-0.080
orry-0.079
Token.
Token ID13
Feature activation+4.22e-02

Pos logit contributions
 avail+0.093
ounced+0.089
 batch+0.084
 breed+0.084
ited+0.083
Neg logit contributions
 stop-0.034
 clean-0.029
ibr-0.029
od-0.028
orry-0.028
Token The
Token ID383
Feature activation-1.39e+00

Pos logit contributions
vell+0.001
iz+0.001
orry+0.001
od+0.001
 stop+0.001
Neg logit contributions
 batch-0.001
 breed-0.001
 enough-0.001
 sausage-0.001
 types-0.001
Token little
Token ID1310
Feature activation-7.90e-01

Pos logit contributions
 avail+0.059
ounced+0.057
 enough+0.054
 breed+0.053
 batch+0.053
Neg logit contributions
 stop-0.018
 clean-0.015
ibr-0.014
 un-0.014
orry-0.014
Token boy
Token ID2933
Feature activation-4.49e-01

Pos logit contributions
ounced+0.033
 avail+0.031
 properties+0.028
aters+0.028
ained+0.028
Neg logit contributions
 clean-0.014
 stop-0.013
 p-0.013
 Mia-0.012
 Lily-0.012
Token
Token ID198
Feature activation-2.56e+00

Pos logit contributions
 avail+0.076
ounced+0.072
 enough+0.069
ited+0.069
 dollars+0.069
Neg logit contributions
 stop-0.016
 clean-0.014
iz-0.013
od-0.013
 Kate-0.013
TokenAs
Token ID1722
Feature activation-2.64e+00

Pos logit contributions
 avail+0.097
ounced+0.089
 choice+0.086
 types+0.086
 stash+0.085
Neg logit contributions
 stop-0.045
 clean-0.042
 un-0.041
An-0.039
Her-0.039
Token they
Token ID484
Feature activation+1.15e+00

Pos logit contributions
 avail+0.118
ounced+0.117
ited+0.116
 types+0.113
 batch+0.111
Neg logit contributions
 stop-0.024
 clean-0.021
 Mir-0.019
 Mia-0.018
 un-0.018
Token were
Token ID547
Feature activation-2.43e-01

Pos logit contributions
 stop+0.030
ibr+0.029
od+0.029
orry+0.029
 mantle+0.028
Neg logit contributions
 avail-0.030
ounced-0.029
 enough-0.028
ited-0.027
 batch-0.026
Token walking
Token ID6155
Feature activation-4.97e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 enough+0.006
 problem+0.006
 batch+0.005
Neg logit contributions
 stop-0.006
ibr-0.006
od-0.006
orry-0.006
 mantle-0.006
Token,
Token ID11
Feature activation-3.07e+00

Pos logit contributions
 avail+0.017
ounced+0.017
 batch+0.016
 enough+0.016
 breed+0.016
Neg logit contributions
 stop-0.009
ibr-0.008
orry-0.008
od-0.008
 clean-0.008
Token Velvet
Token ID36271
Feature activation-3.79e-01

Pos logit contributions
 avail+0.114
ounced+0.110
ited+0.106
 batch+0.105
 choice+0.103
Neg logit contributions
 stop-0.053
 clean-0.044
 Kate-0.044
ibr-0.044
 Mir-0.043
Token noticed
Token ID6810
Feature activation-1.87e-01

Pos logit contributions
ounced+0.008
 avail+0.008
 enough+0.007
 problem+0.007
 batch+0.007
Neg logit contributions
 stop-0.012
ibr-0.012
orry-0.011
 mantle-0.011
od-0.011
Token their
Token ID511
Feature activation-5.09e-01

Pos logit contributions
 avail+0.007
ounced+0.007
ited+0.007
 batch+0.007
 dollars+0.007
Neg logit contributions
 stop-0.003
od-0.003
 clean-0.003
ibr-0.003
 Kate-0.003
Token friend
Token ID1545
Feature activation-4.19e-02

Pos logit contributions
 avail+0.016
ounced+0.015
ucker+0.014
 batch+0.014
ited+0.014
Neg logit contributions
 stop-0.012
 clean-0.012
 un-0.011
 mantle-0.010
 nicely-0.010
Token's
Token ID338
Feature activation-7.72e-01

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 breed+0.001
Neg logit contributions
 stop-0.001
ibr-0.001
orry-0.001
od-0.001
 mantle-0.001
Token plate
Token ID7480
Feature activation-6.19e-01

Pos logit contributions
 avail+0.013
ounced+0.013
 breed+0.012
 batch+0.012
ucker+0.012
Neg logit contributions
 stop-0.008
 clean-0.007
 un-0.007
ibr-0.007
orry-0.006
Token were
Token ID547
Feature activation+1.51e-01

Pos logit contributions
 avail+0.019
ounced+0.018
 enough+0.017
 batch+0.017
 breed+0.017
Neg logit contributions
 stop-0.014
ibr-0.013
od-0.013
orry-0.013
 clean-0.013
Token two
Token ID734
Feature activation-1.38e+00

Pos logit contributions
 stop+0.003
 clean+0.003
 Kate+0.003
ibr+0.003
od+0.002
Neg logit contributions
 avail-0.005
ounced-0.005
ited-0.005
 breed-0.005
ucker-0.005
Token ste
Token ID2876
Feature activation-1.66e+00

Pos logit contributions
 avail+0.047
ounced+0.043
 breed+0.041
 properties+0.039
ucker+0.039
Neg logit contributions
 stop-0.030
 clean-0.028
 un-0.027
od-0.027
ibr-0.026
Tokenamed
Token ID2434
Feature activation+4.00e-01

Pos logit contributions
 avail+0.050
ounced+0.048
 enough+0.045
 batch+0.044
 breed+0.044
Neg logit contributions
 stop-0.039
ibr-0.036
od-0.036
orry-0.036
 clean-0.035
Token caul
Token ID46087
Feature activation-3.05e+00

Pos logit contributions
 stop+0.009
orry+0.009
 mantle+0.009
ibr+0.009
iz+0.009
Neg logit contributions
 enough-0.011
 batch-0.011
 avail-0.011
ounced-0.011
 types-0.010
Tokeniflower
Token ID42642
Feature activation-1.50e+00

Pos logit contributions
 avail+0.095
ounced+0.088
 enough+0.083
 breed+0.083
 choice+0.082
Neg logit contributions
od-0.069
ibr-0.065
 stop-0.064
orry-0.064
 mantle-0.063
Token fl
Token ID781
Feature activation-1.34e+00

Pos logit contributions
 avail+0.049
ounced+0.047
 enough+0.043
 batch+0.043
 breed+0.043
Neg logit contributions
 stop-0.031
ibr-0.029
od-0.029
orry-0.028
 clean-0.028
Tokenore
Token ID382
Feature activation-5.35e-01

Pos logit contributions
 avail+0.038
ounced+0.035
 enough+0.033
 batch+0.033
 breed+0.032
Neg logit contributions
 stop-0.034
ibr-0.033
od-0.032
orry-0.032
 clean-0.031
Tokents
Token ID912
Feature activation-1.28e-01

Pos logit contributions
 avail+0.014
 enough+0.014
ounced+0.013
 batch+0.013
 breed+0.013
Neg logit contributions
 stop-0.014
orry-0.013
ibr-0.013
iz-0.013
 mantle-0.013
Token.
Token ID13
Feature activation-6.96e-01

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.004
 batch+0.004
 breed+0.004
Neg logit contributions
 stop-0.002
ibr-0.002
orry-0.002
od-0.002
 clean-0.002
Token to
Token ID284
Feature activation-7.53e-01

Pos logit contributions
 avail+0.024
ounced+0.023
 batch+0.022
ited+0.022
 breed+0.021
Neg logit contributions
 stop-0.024
 clean-0.022
 un-0.021
od-0.021
 nicely-0.021
Token explore
Token ID7301
Feature activation-3.58e-01

Pos logit contributions
 avail+0.027
 enough+0.024
ounced+0.024
 breed+0.024
 choice+0.023
Neg logit contributions
ibr-0.013
orry-0.013
od-0.013
 stop-0.013
 Jos-0.012
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
ounced+0.015
 avail+0.015
ited+0.014
ucker+0.014
 batch+0.014
Neg logit contributions
 stop-0.004
 clean-0.004
 un-0.003
 nicely-0.003
od-0.003
Token river
Token ID7850
Feature activation-2.54e-01

Pos logit contributions
 stop+0.007
 clean+0.007
 un+0.006
 mantle+0.006
orry+0.006
Neg logit contributions
 avail-0.008
ounced-0.008
 batch-0.008
ucker-0.008
 dollars-0.008
Token.
Token ID13
Feature activation-2.24e+00

Pos logit contributions
 avail+0.010
ounced+0.009
 batch+0.009
 breed+0.009
 enough+0.009
Neg logit contributions
 stop-0.004
 clean-0.004
od-0.003
 un-0.003
ibr-0.003
Token But
Token ID887
Feature activation-3.05e+00

Pos logit contributions
 avail+0.087
ounced+0.084
ited+0.081
 choice+0.078
 batch+0.078
Neg logit contributions
 stop-0.034
 un-0.030
 clean-0.030
 mantle-0.029
ibr-0.029
Token when
Token ID618
Feature activation-2.42e+00

Pos logit contributions
ounced+0.109
 avail+0.108
ited+0.105
 batch+0.105
aters+0.105
Neg logit contributions
 stop-0.055
 nicely-0.049
 un-0.048
 clean-0.047
 everyone-0.045
Token they
Token ID484
Feature activation+1.72e+00

Pos logit contributions
ited+0.099
ounced+0.098
 types+0.096
 batch+0.095
 avail+0.094
Neg logit contributions
 stop-0.031
 clean-0.026
 un-0.025
 grown-0.024
 everyone-0.024
Token got
Token ID1392
Feature activation+1.14e-01

Pos logit contributions
 stop+0.042
ibr+0.040
od+0.039
orry+0.039
 mantle+0.039
Neg logit contributions
 avail-0.049
ounced-0.048
 enough-0.045
 batch-0.043
ited-0.043
Token there
Token ID612
Feature activation-3.76e-01

Pos logit contributions
 clean+0.002
 stop+0.002
 un+0.002
 outside+0.002
 nicely+0.002
Neg logit contributions
 avail-0.004
 batch-0.004
 dollars-0.004
 breed-0.004
aters-0.004
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
 avail+0.014
ounced+0.013
 batch+0.013
 breed+0.013
ited+0.012
Neg logit contributions
 stop-0.006
ibr-0.006
 clean-0.005
od-0.005
 un-0.005
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
 stop+0.005
ibr+0.004
 clean+0.004
od+0.004
orry+0.004
Neg logit contributions
 avail-0.013
ounced-0.012
 enough-0.012
 batch-0.011
ited-0.011
TokenThey
Token ID2990
Feature activation+2.48e-01

Pos logit contributions
 stop+0.010
 clean+0.009
 nicely+0.009
 un+0.009
ibr+0.009
Neg logit contributions
 avail-0.023
ounced-0.022
 enough-0.021
 batch-0.021
 breed-0.021
Token finished
Token ID5201
Feature activation-2.31e-01

Pos logit contributions
ibr+0.006
orry+0.006
od+0.006
 mantle+0.006
iz+0.006
Neg logit contributions
ounced-0.007
 enough-0.006
ited-0.006
 batch-0.006
 avail-0.006
Token their
Token ID511
Feature activation-4.86e-01

Pos logit contributions
 batch+0.007
ounced+0.007
 enough+0.007
 avail+0.007
ucker+0.007
Neg logit contributions
ibr-0.005
orry-0.005
 stop-0.005
iz-0.005
vell-0.005
Token l
Token ID300
Feature activation-2.31e+00

Pos logit contributions
 avail+0.013
ounced+0.012
 enough+0.011
 breed+0.011
grown+0.011
Neg logit contributions
 stop-0.013
 clean-0.012
 un-0.012
ibr-0.012
orry-0.012
Tokenoll
Token ID692
Feature activation-3.05e+00

Pos logit contributions
 avail+0.082
ounced+0.078
 batch+0.074
 enough+0.074
 breed+0.073
Neg logit contributions
 stop-0.042
ibr-0.040
orry-0.039
od-0.038
 clean-0.037
Tokenip
Token ID541
Feature activation-3.00e+00

Pos logit contributions
 avail+0.076
ounced+0.061
 breed+0.060
 properties+0.058
 batch+0.058
Neg logit contributions
 stop-0.090
ibr-0.086
orry-0.084
od-0.084
 nicely-0.083
Tokenops
Token ID2840
Feature activation-1.11e+00

Pos logit contributions
 avail+0.063
ounced+0.060
 enough+0.055
 batch+0.055
 breed+0.053
Neg logit contributions
 stop-0.096
ibr-0.092
orry-0.092
od-0.091
 un-0.090
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
 enough+0.029
 batch+0.027
 dollars+0.027
ounced+0.026
 avail+0.025
Neg logit contributions
ibr-0.029
orry-0.028
 Jos-0.028
iz-0.028
 stop-0.028
Token threw
Token ID9617
Feature activation-1.08e+00

Pos logit contributions
 avail+0.010
ounced+0.009
 problem+0.009
 breed+0.009
 batch+0.009
Neg logit contributions
 stop-0.006
 clean-0.006
 un-0.006
 Kate-0.005
ibr-0.005
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 avail+0.043
ounced+0.040
 breed+0.038
ited+0.038
ucker+0.037
Neg logit contributions
 stop-0.018
 nicely-0.015
 clean-0.015
 Mir-0.014
 Mia-0.014
Token.
Token ID13
Feature activation-6.70e-01

Pos logit contributions
 avail+0.066
ounced+0.064
 breed+0.059
ited+0.058
ucker+0.058
Neg logit contributions
 stop-0.028
 clean-0.024
od-0.024
 mantle-0.023
 nicely-0.023
Token There
Token ID1318
Feature activation-1.77e-01

Pos logit contributions
 avail+0.021
ounced+0.021
 enough+0.019
 batch+0.019
 breed+0.019
Neg logit contributions
 stop-0.015
orry-0.013
ibr-0.013
od-0.013
 clean-0.013
Token were
Token ID547
Feature activation-3.98e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 breed+0.005
ited+0.005
 batch+0.005
Neg logit contributions
 stop-0.004
od-0.003
 clean-0.003
 Kate-0.003
 un-0.003
Token l
Token ID300
Feature activation-2.00e+00

Pos logit contributions
 avail+0.013
ounced+0.012
 breed+0.012
ited+0.012
ucker+0.011
Neg logit contributions
 stop-0.009
 clean-0.008
od-0.008
ibr-0.008
 un-0.008
Tokenoll
Token ID692
Feature activation-2.98e+00

Pos logit contributions
ounced+0.064
 avail+0.063
 enough+0.060
 batch+0.058
 dollars+0.057
Neg logit contributions
 stop-0.042
 clean-0.038
 un-0.038
 mantle-0.037
od-0.037
Tokenip
Token ID541
Feature activation-3.03e+00

Pos logit contributions
 avail+0.074
 breed+0.063
 batch+0.060
 properties+0.058
aters+0.058
Neg logit contributions
 stop-0.087
od-0.086
orry-0.084
ibr-0.084
 clean-0.081
Tokenops
Token ID2840
Feature activation-8.60e-01

Pos logit contributions
 avail+0.076
ounced+0.070
 breed+0.065
 batch+0.064
 enough+0.063
Neg logit contributions
 stop-0.088
od-0.082
ibr-0.082
 clean-0.081
orry-0.081
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
 avail+0.027
ounced+0.027
 enough+0.026
 batch+0.025
 breed+0.024
Neg logit contributions
 stop-0.018
ibr-0.017
orry-0.016
od-0.016
 Jos-0.016
Token ch
Token ID442
Feature activation-8.15e-01

Pos logit contributions
 avail+0.052
ounced+0.050
 enough+0.048
 batch+0.047
 breed+0.047
Neg logit contributions
 stop-0.032
ibr-0.030
orry-0.029
od-0.029
 clean-0.028
Tokenocol
Token ID4668
Feature activation-1.18e+00

Pos logit contributions
 avail+0.020
ounced+0.018
 breed+0.017
 batch+0.017
 enough+0.016
Neg logit contributions
 stop-0.024
orry-0.023
od-0.023
ibr-0.023
 un-0.022
Tokenates
Token ID689
Feature activation-1.09e+00

Pos logit contributions
 avail+0.034
ounced+0.032
 batch+0.030
 breed+0.030
 problem+0.030
Neg logit contributions
 stop-0.028
ibr-0.028
od-0.028
orry-0.027
iz-0.026
Token,
Token ID11
Feature activation-9.69e-01

Pos logit contributions
 stop+0.013
ibr+0.013
orry+0.013
 Kate+0.012
 mantle+0.012
Neg logit contributions
 avail-0.023
ounced-0.023
 enough-0.023
 problem-0.022
 batch-0.021
Token so
Token ID523
Feature activation+2.70e-01

Pos logit contributions
 avail+0.037
ounced+0.036
 batch+0.033
 breed+0.033
ited+0.033
Neg logit contributions
 stop-0.016
 clean-0.014
 un-0.014
od-0.014
 Jos-0.014
Token she
Token ID673
Feature activation+3.75e-01

Pos logit contributions
 stop+0.003
 Kate+0.003
 clean+0.003
 Jos+0.003
 Mia+0.003
Neg logit contributions
ounced-0.011
 avail-0.011
ited-0.011
 breed-0.011
ucker-0.011
Token touched
Token ID12615
Feature activation-2.50e-02

Pos logit contributions
 stop+0.010
ibr+0.010
od+0.010
orry+0.010
 mantle+0.010
Neg logit contributions
ounced-0.009
 avail-0.009
 enough-0.009
 batch-0.008
ited-0.008
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 batch+0.001
 breed+0.001
Neg logit contributions
 stop-0.001
ibr-0.000
orry-0.000
od-0.000
 clean-0.000
Token empty
Token ID6565
Feature activation+1.74e+00

Pos logit contributions
ibr+0.025
od+0.023
orry+0.022
 stop+0.021
 Kate+0.021
Neg logit contributions
 problem-0.040
ounced-0.038
 avail-0.038
 choice-0.037
 batch-0.037
Token ash
Token ID12530
Feature activation+9.09e-01

Pos logit contributions
ibr+0.035
 stop+0.034
orry+0.033
od+0.033
 mantle+0.031
Neg logit contributions
 avail-0.056
ounced-0.054
 problem-0.053
 batch-0.053
 enough-0.052
Tokentr
Token ID2213
Feature activation-8.01e-01

Pos logit contributions
ibr+0.023
orry+0.023
od+0.021
iz+0.021
 Kate+0.021
Neg logit contributions
 enough-0.024
ounced-0.023
 problem-0.022
 avail-0.022
 batch-0.022
Tokenay
Token ID323
Feature activation+1.73e-01

Pos logit contributions
 avail+0.016
ounced+0.016
 enough+0.014
 batch+0.014
 breed+0.013
Neg logit contributions
 stop-0.026
ibr-0.025
orry-0.025
od-0.025
 clean-0.024
Token.
Token ID13
Feature activation+2.25e-03

Pos logit contributions
 stop+0.004
ibr+0.004
orry+0.004
 mantle+0.003
iz+0.003
Neg logit contributions
 enough-0.005
 avail-0.005
ounced-0.005
 batch-0.005
 breed-0.005
Token Suddenly
Token ID24975
Feature activation-3.70e-01

Pos logit contributions
 stop+0.000
vell+0.000
orry+0.000
od+0.000
ibr+0.000
Neg logit contributions
 batch-0.000
 avail-0.000
ounced-0.000
 Does-0.000
 enough-0.000
Token.
Token ID13
Feature activation+4.89e-01

Pos logit contributions
 stop+0.001
ibr+0.000
od+0.000
orry+0.000
 clean+0.000
Neg logit contributions
 avail-0.001
ounced-0.001
 batch-0.001
 enough-0.001
 breed-0.001
Token
Token ID220
Feature activation+3.28e-01

Pos logit contributions
vell+0.014
 stop+0.014
od+0.013
il+0.013
ibr+0.013
Neg logit contributions
 avail-0.011
ounced-0.010
 breed-0.010
grown-0.010
 batch-0.010
Token
Token ID198
Feature activation+8.28e-01

Pos logit contributions
 stop+0.009
 mer+0.009
 mantle+0.009
 mant+0.008
vell+0.008
Neg logit contributions
ounced-0.008
ucker-0.007
 enough-0.007
aters-0.007
ained-0.007
Token
Token ID198
Feature activation+8.43e-01

Pos logit contributions
 stop+0.025
vell+0.023
orry+0.023
 mantle+0.023
od+0.022
Neg logit contributions
ounced-0.016
Sometimes-0.015
 avail-0.015
Today-0.015
die-0.015
TokenWhen
Token ID2215
Feature activation+1.40e-01

Pos logit contributions
 Jos+0.027
od+0.026
 stop+0.026
 O+0.025
 Mir+0.025
Neg logit contributions
Sometimes-0.016
Today-0.015
ounced-0.014
ited-0.014
Every-0.014
Token she
Token ID673
Feature activation+1.92e+00

Pos logit contributions
ibr+0.004
orry+0.003
od+0.003
 mantle+0.003
vell+0.003
Neg logit contributions
 avail-0.004
 enough-0.004
 problem-0.004
 batch-0.004
ounced-0.003
Token was
Token ID373
Feature activation+2.53e-01

Pos logit contributions
 stop+0.048
ibr+0.047
 mantle+0.046
orry+0.046
od+0.046
Neg logit contributions
 avail-0.052
ounced-0.051
 enough-0.048
ited-0.047
 batch-0.046
Token done
Token ID1760
Feature activation+1.08e+00

Pos logit contributions
ibr+0.006
 mantle+0.006
od+0.006
ilet+0.006
lo+0.006
Neg logit contributions
 enough-0.008
ounced-0.007
 avail-0.006
 problem-0.006
ited-0.006
Token swinging
Token ID25635
Feature activation+8.53e-01

Pos logit contributions
orry+0.033
 mantle+0.032
ibr+0.032
iz+0.032
amps+0.032
Neg logit contributions
 enough-0.026
 avail-0.026
 problem-0.023
joy-0.022
 batch-0.020
Token,
Token ID11
Feature activation-4.11e-01

Pos logit contributions
 stop+0.016
ibr+0.015
orry+0.015
od+0.015
 mantle+0.014
Neg logit contributions
 avail-0.029
ounced-0.028
 enough-0.027
 batch-0.026
 breed-0.026
Token she
Token ID673
Feature activation+4.69e-01

Pos logit contributions
 avail+0.014
 enough+0.012
ounced+0.012
 batch+0.012
 problem+0.011
Neg logit contributions
ibr-0.008
orry-0.008
 stop-0.008
od-0.008
 mantle-0.008
Token he
Token ID339
Feature activation+2.45e-01

Pos logit contributions
 stop+0.002
ibr+0.002
od+0.002
orry+0.002
 clean+0.002
Neg logit contributions
 avail-0.003
ounced-0.003
 enough-0.003
 batch-0.003
 breed-0.003
Token saw
Token ID2497
Feature activation+3.89e-01

Pos logit contributions
 stop+0.006
orry+0.006
 un+0.006
 clean+0.006
od+0.006
Neg logit contributions
 avail-0.007
 breed-0.006
 choice-0.006
 batch-0.006
aters-0.006
Token a
Token ID257
Feature activation-6.54e-01

Pos logit contributions
 stop+0.007
 clean+0.006
orry+0.006
ibr+0.006
od+0.006
Neg logit contributions
 avail-0.015
ounced-0.014
 batch-0.013
ited-0.013
 breed-0.013
Token man
Token ID582
Feature activation-8.28e-01

Pos logit contributions
 avail+0.024
ounced+0.023
aters+0.022
 enough+0.022
ited+0.022
Neg logit contributions
 stop-0.012
 clean-0.011
 un-0.011
 knock-0.011
 grown-0.010
Token inside
Token ID2641
Feature activation-3.56e-01

Pos logit contributions
 avail+0.025
ounced+0.024
 enough+0.023
 batch+0.023
 breed+0.022
Neg logit contributions
ibr-0.018
 stop-0.018
orry-0.017
od-0.017
 mantle-0.017
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
 avail+0.016
ounced+0.016
 dollars+0.014
 batch+0.014
ucker+0.014
Neg logit contributions
 stop-0.004
 clean-0.003
 un-0.003
 Kate-0.003
od-0.003
Token van
Token ID5719
Feature activation-1.71e-01

Pos logit contributions
 stop+0.038
ibr+0.036
od+0.036
orry+0.036
 clean+0.034
Neg logit contributions
 avail-0.056
ounced-0.054
 enough-0.051
 batch-0.050
 breed-0.050
Token who
Token ID508
Feature activation-1.86e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 enough+0.006
 batch+0.006
 breed+0.006
Neg logit contributions
 stop-0.003
ibr-0.003
od-0.003
orry-0.002
 clean-0.002
Token looked
Token ID3114
Feature activation-9.95e-01

Pos logit contributions
 avail+0.005
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.004
Neg logit contributions
 stop-0.005
ibr-0.005
orry-0.004
od-0.004
 clean-0.004
Token very
Token ID845
Feature activation+1.99e-02

Pos logit contributions
 avail+0.036
ounced+0.033
aters+0.031
 batch+0.031
 breed+0.031
Neg logit contributions
 stop-0.021
 clean-0.019
 un-0.019
 cool-0.017
 nicely-0.016
Token uncomfortable
Token ID12916
Feature activation-1.28e-01

Pos logit contributions
ibr+0.001
 stop+0.001
orry+0.001
od+0.001
 mantle+0.000
Neg logit contributions
 avail-0.001
ounced-0.001
 problem-0.000
 enough-0.000
 choice-0.000
Token They
Token ID1119
Feature activation+4.90e-01

Pos logit contributions
 stop+0.007
ibr+0.006
od+0.006
orry+0.006
 mantle+0.006
Neg logit contributions
 avail-0.011
ounced-0.010
 enough-0.010
 breed-0.010
 batch-0.010
Token did
Token ID750
Feature activation-5.69e-02

Pos logit contributions
 stop+0.013
ibr+0.013
orry+0.012
od+0.012
 mantle+0.012
Neg logit contributions
 avail-0.012
ounced-0.012
 enough-0.011
 batch-0.011
 breed-0.011
Token not
Token ID407
Feature activation+7.42e-02

Pos logit contributions
 avail+0.002
 enough+0.002
 choice+0.002
 problem+0.002
 batch+0.002
Neg logit contributions
ibr-0.001
orry-0.001
 mantle-0.001
od-0.001
iz-0.001
Token like
Token ID588
Feature activation+1.84e+00

Pos logit contributions
ibr+0.002
 mantle+0.002
orry+0.002
vell+0.002
 mant+0.002
Neg logit contributions
 avail-0.002
 enough-0.002
 choice-0.001
 problem-0.001
 seen-0.001
Token harsh
Token ID11859
Feature activation+1.41e+00

Pos logit contributions
ibr+0.054
iz+0.053
orry+0.053
vell+0.050
 mantle+0.049
Neg logit contributions
 enough-0.045
 avail-0.041
 problem-0.041
 batch-0.040
 pistols-0.039
Token people
Token ID661
Feature activation+1.88e+00

Pos logit contributions
 stop+0.036
orry+0.036
ibr+0.036
iz+0.035
od+0.035
Neg logit contributions
 avail-0.038
 enough-0.035
ounced-0.035
 problem-0.033
 batch-0.033
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
iz+0.041
orry+0.040
 Jos+0.040
ibr+0.040
 mantle+0.039
Neg logit contributions
 enough-0.056
 dollars-0.053
ounced-0.053
 avail-0.053
 problem-0.052
Token They
Token ID1119
Feature activation+4.79e-01

Pos logit contributions
 stop+0.013
 mantle+0.012
ibr+0.012
orry+0.012
od+0.012
Neg logit contributions
 avail-0.016
ounced-0.015
 breed-0.015
 enough-0.015
grown-0.014
Token hoped
Token ID10719
Feature activation-4.09e-02

Pos logit contributions
 stop+0.012
ibr+0.012
od+0.012
orry+0.012
 mantle+0.012
Neg logit contributions
 avail-0.012
ounced-0.012
 enough-0.011
 breed-0.011
 batch-0.011
Token they
Token ID484
Feature activation+3.66e-01

Pos logit contributions
 avail+0.001
 enough+0.001
ounced+0.001
 problem+0.001
 breed+0.001
Neg logit contributions
ibr-0.001
orry-0.001
 mantle-0.001
od-0.001
vell-0.001
Token would
Token ID561
Feature activation-1.36e-01

Pos logit contributions
 stop+0.013
ibr+0.012
od+0.012
orry+0.012
 clean+0.012
Neg logit contributions
 avail-0.007
ounced-0.006
 enough-0.005
 batch-0.005
 breed-0.005
Token She
Token ID1375
Feature activation+4.53e-01

Pos logit contributions
 enough+0.005
 avail+0.005
 breed+0.005
 pouch+0.004
ounced+0.004
Neg logit contributions
iz-0.005
orry-0.005
vell-0.005
 stop-0.005
ibr-0.005
Token stretched
Token ID19110
Feature activation-8.24e-02

Pos logit contributions
 stop+0.009
 clean+0.009
 un+0.009
orry+0.009
od+0.009
Neg logit contributions
 avail-0.016
ounced-0.014
 choice-0.013
 breed-0.013
 enough-0.013
Token it
Token ID340
Feature activation+3.87e-01

Pos logit contributions
 avail+0.003
 enough+0.003
ounced+0.002
 breed+0.002
 batch+0.002
Neg logit contributions
ibr-0.002
orry-0.002
 stop-0.002
 mantle-0.002
od-0.001
Token up
Token ID510
Feature activation-1.23e-01

Pos logit contributions
ibr+0.010
orry+0.010
ilet+0.010
iz+0.010
 Jos+0.010
Neg logit contributions
 enough-0.010
ounced-0.010
 avail-0.009
 tighter-0.009
 batch-0.009
Token in
Token ID287
Feature activation+3.77e-01

Pos logit contributions
 avail+0.004
 enough+0.003
ounced+0.003
 batch+0.003
 breed+0.003
Neg logit contributions
ibr-0.003
 stop-0.003
orry-0.003
od-0.003
 mantle-0.003
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
 stop+0.005
ibr+0.004
 clean+0.004
od+0.004
orry+0.004
Neg logit contributions
 avail-0.015
ounced-0.015
 batch-0.014
 enough-0.014
 breed-0.014
Token air
Token ID1633
Feature activation+4.17e-01

Pos logit contributions
ibr+0.056
od+0.055
 Jos+0.050
iz+0.050
 Kate+0.049
Neg logit contributions
 problem-0.039
 avail-0.039
ounced-0.038
 batch-0.037
 enough-0.037
Token and
Token ID290
Feature activation-3.90e-01

Pos logit contributions
ibr+0.010
 stop+0.009
orry+0.009
od+0.009
iz+0.009
Neg logit contributions
 avail-0.012
 enough-0.012
ounced-0.012
 batch-0.011
 breed-0.011
Token then
Token ID788
Feature activation-5.58e-01

Pos logit contributions
 avail+0.010
ounced+0.010
 enough+0.009
 batch+0.009
 breed+0.008
Neg logit contributions
ibr-0.011
 stop-0.010
orry-0.010
iz-0.010
od-0.010
Token stretched
Token ID19110
Feature activation-3.03e-02

Pos logit contributions
ounced+0.015
 avail+0.015
 enough+0.015
 batch+0.013
 problem+0.013
Neg logit contributions
ibr-0.014
 mantle-0.014
 stop-0.013
orry-0.013
iz-0.013
Token it
Token ID340
Feature activation+6.57e-01

Pos logit contributions
 avail+0.001
 enough+0.001
ounced+0.001
 choice+0.001
 breed+0.001
Neg logit contributions
ibr-0.001
orry-0.001
od-0.001
 mantle-0.001
 stop-0.001
Token They
Token ID1119
Feature activation+1.60e-01

Pos logit contributions
ounced+0.010
 enough+0.010
 breed+0.009
 avail+0.009
 problem+0.009
Neg logit contributions
 stop-0.008
orry-0.008
od-0.008
ibr-0.008
 mantle-0.007
Token made
Token ID925
Feature activation-3.29e-01

Pos logit contributions
 stop+0.004
ibr+0.004
od+0.004
orry+0.004
 mantle+0.004
Neg logit contributions
 avail-0.004
ounced-0.004
 enough-0.004
 batch-0.004
 breed-0.004
Token a
Token ID257
Feature activation-2.18e-01

Pos logit contributions
 avail+0.010
 enough+0.010
ounced+0.009
 batch+0.009
 choice+0.009
Neg logit contributions
 stop-0.007
ibr-0.007
 mantle-0.007
orry-0.007
od-0.007
Token lot
Token ID1256
Feature activation+6.62e-02

Pos logit contributions
 avail+0.007
ounced+0.006
 batch+0.006
 enough+0.006
 problem+0.006
Neg logit contributions
 stop-0.005
ibr-0.005
od-0.005
orry-0.005
 Jos-0.004
Token of
Token ID286
Feature activation-1.26e+00

Pos logit contributions
 mantle+0.002
od+0.002
 Jos+0.002
 stop+0.002
orry+0.002
Neg logit contributions
 enough-0.001
ounced-0.001
 batch-0.001
 breed-0.001
 choice-0.001
Token noise
Token ID7838
Feature activation+5.21e-01

Pos logit contributions
 avail+0.048
ounced+0.046
 enough+0.043
ited+0.043
 breed+0.042
Neg logit contributions
 stop-0.021
 clean-0.020
ibr-0.018
orry-0.018
 un-0.018
Token to
Token ID284
Feature activation-8.24e-01

Pos logit contributions
ibr+0.016
 mantle+0.015
orry+0.015
 mant+0.015
ilet+0.015
Neg logit contributions
 enough-0.013
 problem-0.010
grown-0.010
ounced-0.010
 pistols-0.010
Token scare
Token ID19437
Feature activation-4.22e-01

Pos logit contributions
 avail+0.026
 choice+0.023
 breed+0.022
 enough+0.022
ounced+0.021
Neg logit contributions
ibr-0.020
od-0.019
orry-0.018
aund-0.018
 Jos-0.018
Token the
Token ID262
Feature activation+4.58e-01

Pos logit contributions
 enough+0.011
 avail+0.011
 pistols+0.010
ounced+0.010
 breed+0.010
Neg logit contributions
 mantle-0.011
 mant-0.011
ibr-0.010
vell-0.010
 balcony-0.010
Token big
Token ID1263
Feature activation+5.32e-01

Pos logit contributions
 stop+0.010
 clean+0.009
 un+0.008
 knock+0.008
 grown+0.008
Neg logit contributions
 avail-0.017
ounced-0.015
ited-0.014
 enough-0.014
 properties-0.014
Token cat
Token ID3797
Feature activation+6.49e-01

Pos logit contributions
 stop+0.010
 clean+0.009
 knock+0.009
 Mia+0.008
 un+0.008
Neg logit contributions
 avail-0.021
ounced-0.019
ited-0.019
die-0.018
 properties-0.018
Token adventure
Token ID8855
Feature activation+2.30e-01

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.003
 batch+0.003
 breed+0.003
Neg logit contributions
 stop-0.003
ibr-0.003
od-0.003
 Kate-0.003
orry-0.003
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
 stop+0.004
ibr+0.004
orry+0.004
od+0.004
 clean+0.004
Neg logit contributions
 avail-0.008
ounced-0.008
 enough-0.007
 batch-0.007
 breed-0.007
Token He
Token ID679
Feature activation-1.87e-01

Pos logit contributions
 avail+0.007
 enough+0.007
ounced+0.007
 batch+0.006
 breed+0.006
Neg logit contributions
 stop-0.008
od-0.007
 clean-0.007
vell-0.007
ibr-0.007
Token knew
Token ID2993
Feature activation-7.68e-01

Pos logit contributions
 avail+0.006
ounced+0.005
 batch+0.005
 breed+0.005
 enough+0.005
Neg logit contributions
 stop-0.004
 clean-0.004
 un-0.004
ibr-0.004
orry-0.004
Token every
Token ID790
Feature activation-9.30e-01

Pos logit contributions
 avail+0.025
ounced+0.024
 enough+0.023
 batch+0.023
 breed+0.022
Neg logit contributions
 stop-0.016
ibr-0.015
od-0.014
orry-0.014
 clean-0.014
Token day
Token ID1110
Feature activation+1.95e-01

Pos logit contributions
ounced+0.037
ucker+0.036
 avail+0.036
aters+0.035
ays+0.034
Neg logit contributions
 stop-0.016
 clean-0.014
 un-0.013
 Kate-0.013
 grown-0.012
Token he
Token ID339
Feature activation+1.81e-01

Pos logit contributions
 stop+0.004
ibr+0.003
od+0.003
orry+0.003
 clean+0.003
Neg logit contributions
 avail-0.007
ounced-0.006
 batch-0.006
 breed-0.006
ited-0.006
Token would
Token ID561
Feature activation-4.62e-01

Pos logit contributions
 stop+0.005
 un+0.005
 clean+0.004
ibr+0.004
orry+0.004
Neg logit contributions
 avail-0.005
ounced-0.005
 batch-0.005
ucker-0.004
 breed-0.004
Token keep
Token ID1394
Feature activation-1.62e-01

Pos logit contributions
 avail+0.015
 enough+0.014
ounced+0.014
 choice+0.014
 breed+0.013
Neg logit contributions
ibr-0.009
orry-0.009
od-0.009
 Jos-0.009
 Kate-0.009
Token trying
Token ID2111
Feature activation+5.76e-02

Pos logit contributions
 avail+0.005
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.003
ibr-0.003
orry-0.003
od-0.003
 mantle-0.003
Token new
Token ID649
Feature activation+4.55e-01

Pos logit contributions
 stop+0.001
 clean+0.001
 un+0.001
od+0.001
 nicely+0.001
Neg logit contributions
 avail-0.002
ounced-0.002
 batch-0.002
ited-0.002
 breed-0.002
Token and
Token ID290
Feature activation-4.92e-01

Pos logit contributions
 stop+0.011
 clean+0.010
ibr+0.009
 nicely+0.009
 un+0.009
Neg logit contributions
 avail-0.022
ounced-0.021
 batch-0.019
 breed-0.019
ucker-0.019
Token she
Token ID673
Feature activation+2.59e-01

Pos logit contributions
 avail+0.018
ounced+0.018
ited+0.017
 dollars+0.017
 breed+0.016
Neg logit contributions
 stop-0.009
 clean-0.008
 Kate-0.008
 un-0.007
 Jos-0.007
Token accidentally
Token ID14716
Feature activation-5.19e-01

Pos logit contributions
 stop+0.007
 clean+0.006
orry+0.006
 un+0.006
od+0.006
Neg logit contributions
 avail-0.008
ounced-0.007
 breed-0.007
 batch-0.007
 dollars-0.006
Token knocked
Token ID13642
Feature activation-2.04e-01

Pos logit contributions
ounced+0.014
 avail+0.014
 enough+0.013
 batch+0.012
 breed+0.012
Neg logit contributions
ibr-0.013
 stop-0.013
od-0.013
orry-0.013
 mantle-0.013
Token over
Token ID625
Feature activation-4.97e-03

Pos logit contributions
 avail+0.008
ounced+0.008
 batch+0.007
 breed+0.007
 enough+0.007
Neg logit contributions
 stop-0.003
 clean-0.003
ibr-0.003
orry-0.003
od-0.003
Token a
Token ID257
Feature activation+3.23e-01

Pos logit contributions
 avail+0.000
ounced+0.000
 enough+0.000
 batch+0.000
 breed+0.000
Neg logit contributions
 stop-0.000
ibr-0.000
od-0.000
orry-0.000
 clean-0.000
Token v
Token ID410
Feature activation+2.03e-01

Pos logit contributions
 stop+0.007
 clean+0.006
 un+0.006
 knock+0.006
 cool+0.006
Neg logit contributions
 avail-0.011
ounced-0.011
ited-0.010
aters-0.010
 enough-0.010
Tokenase
Token ID589
Feature activation+1.33e-01

Pos logit contributions
 stop+0.005
ibr+0.005
od+0.005
 un+0.005
 clean+0.005
Neg logit contributions
 avail-0.005
ounced-0.005
 batch-0.005
 breed-0.005
 enough-0.005
Token.
Token ID13
Feature activation-7.27e-02

Pos logit contributions
ibr+0.003
 stop+0.003
orry+0.003
od+0.003
 Jos+0.002
Neg logit contributions
ounced-0.004
 avail-0.004
 enough-0.004
 batch-0.004
 breed-0.004
Token Her
Token ID2332
Feature activation-7.34e-01

Pos logit contributions
 batch+0.002
 avail+0.002
ounced+0.002
 breed+0.002
 enough+0.002
Neg logit contributions
vell-0.002
iz-0.002
od-0.002
ibr-0.002
il-0.002
Token mom
Token ID1995
Feature activation-2.26e-01

Pos logit contributions
 avail+0.031
ited+0.030
use+0.028
grown+0.028
ounced+0.028
Neg logit contributions
 stop-0.010
 clean-0.010
 un-0.008
 face-0.008
 cool-0.008
Token said
Token ID531
Feature activation-7.56e-02

Pos logit contributions
 avail+0.002
ounced+0.002
 batch+0.002
 breed+0.002
 choice+0.002
Neg logit contributions
 stop-0.002
 clean-0.002
od-0.002
 un-0.002
orry-0.002
Token to
Token ID284
Feature activation+1.65e-01

Pos logit contributions
 avail+0.002
ounced+0.002
 enough+0.002
 problem+0.002
 breed+0.002
Neg logit contributions
 stop-0.002
orry-0.002
 mantle-0.002
ibr-0.002
iz-0.002
Token Jonas
Token ID40458
Feature activation-1.10e+00

Pos logit contributions
ibr+0.004
od+0.004
 stop+0.004
orry+0.004
 Kate+0.004
Neg logit contributions
 avail-0.005
 enough-0.004
ounced-0.004
 breed-0.004
 choice-0.004
Token,
Token ID11
Feature activation-5.65e-01

Pos logit contributions
 enough+0.033
 avail+0.032
ounced+0.032
 problem+0.030
 choice+0.029
Neg logit contributions
ibr-0.025
 stop-0.023
 mantle-0.023
orry-0.023
od-0.022
Token �
Token ID6184
Feature activation+1.23e-01

Pos logit contributions
ounced+0.016
 enough+0.016
 avail+0.016
 breed+0.015
 problem+0.015
Neg logit contributions
ibr-0.013
orry-0.013
 stop-0.012
od-0.012
iz-0.012
Token�
Token ID95
Feature activation+2.39e-01

Pos logit contributions
 Kate+0.004
and+0.004
ibr+0.004
Kate+0.004
 Jos+0.004
Neg logit contributions
tail-0.003
 avail-0.003
ounced-0.002
 breed-0.002
 pouch-0.002
Token€
Token ID26391
Feature activation+6.05e-02

Pos logit contributions
ibr+0.006
 Kate+0.006
 Jos+0.006
 clean+0.006
 nicely+0.006
Neg logit contributions
 avail-0.006
 breed-0.006
tail-0.006
 properties-0.006
 enough-0.006
Token�
Token ID129
Feature activation+1.93e-01

Pos logit contributions
 stop+0.002
ibr+0.002
 mantle+0.002
 clean+0.002
 Kate+0.002
Neg logit contributions
ounced-0.002
 avail-0.001
 enough-0.001
 batch-0.001
 breed-0.001
Token�
Token ID241
Feature activation+2.80e-01

Pos logit contributions
ibr+0.005
 Kate+0.005
 Jos+0.005
 stop+0.005
Kate+0.004
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.005
 breed-0.005
 saddle-0.005
TokenYou
Token ID1639
Feature activation+3.66e-01

Pos logit contributions
 stop+0.005
od+0.005
 un+0.005
 Mir+0.005
iz+0.005
Neg logit contributions
 avail-0.010
ounced-0.009
 problem-0.009
 batch-0.009
 enough-0.009
Token can
Token ID460
Feature activation-1.53e+00

Pos logit contributions
 stop+0.011
ibr+0.011
 mantle+0.011
orry+0.011
od+0.011
Neg logit contributions
ounced-0.007
 enough-0.007
 avail-0.007
 problem-0.006
 breed-0.006
Token switch
Token ID5078
Feature activation+6.37e-01

Pos logit contributions
 avail+0.021
ounced+0.020
 enough+0.020
 choice+0.019
 problem+0.019
Neg logit contributions
ibr-0.014
orry-0.013
 Jos-0.013
od-0.012
 mantle-0.012
Token.
Token ID13
Feature activation+4.09e-01

Pos logit contributions
 stop+0.013
ibr+0.012
orry+0.012
od+0.012
 mantle+0.012
Neg logit contributions
 avail-0.021
ounced-0.020
 enough-0.019
 batch-0.018
 problem-0.018
Token That
Token ID1320
Feature activation+4.13e-02

Pos logit contributions
od+0.010
 stop+0.010
vell+0.009
orry+0.009
 clean+0.009
Neg logit contributions
 avail-0.011
 breed-0.011
 enough-0.011
aters-0.010
irms-0.010
Token is
Token ID318
Feature activation+1.12e+00

Pos logit contributions
 stop+0.001
 clean+0.001
 un+0.001
od+0.001
 nicely+0.001
Neg logit contributions
 avail-0.001
ounced-0.001
ited-0.001
grown-0.001
ucker-0.001
Token fair
Token ID3148
Feature activation+3.59e-01

Pos logit contributions
 stop+0.024
ibr+0.024
od+0.024
orry+0.023
 mantle+0.023
Neg logit contributions
 avail-0.035
 enough-0.033
ounced-0.033
 problem-0.031
 batch-0.031
Token."
Token ID526
Feature activation+7.40e-01

Pos logit contributions
 stop+0.006
 clean+0.006
 nicely+0.005
 un+0.005
od+0.005
Neg logit contributions
 avail-0.014
ounced-0.014
ited-0.013
 batch-0.013
 breed-0.012
Token Lily
Token ID20037
Feature activation+4.02e-01

Pos logit contributions
 stop+0.010
ibr+0.009
orry+0.008
od+0.008
 mantle+0.008
Neg logit contributions
 avail-0.030
ounced-0.029
 enough-0.028
 batch-0.028
 breed-0.027
Token's
Token ID338
Feature activation-1.07e+00

Pos logit contributions
 stop+0.012
 clean+0.011
 un+0.011
od+0.011
ibr+0.011
Neg logit contributions
 avail-0.010
ounced-0.009
 breed-0.008
ited-0.008
 batch-0.008
Token mom
Token ID1995
Feature activation+1.02e-02

Pos logit contributions
 avail+0.043
ounced+0.041
 batch+0.040
ited+0.040
 breed+0.039
Neg logit contributions
 stop-0.016
 clean-0.013
orry-0.012
 un-0.012
 nicely-0.012
Token says
Token ID1139
Feature activation-2.20e-01

Pos logit contributions
orry+0.000
 stop+0.000
 mantle+0.000
ibr+0.000
od+0.000
Neg logit contributions
ounced-0.000
 avail-0.000
 enough-0.000
 batch-0.000
 problem-0.000
Token.
Token ID13
Feature activation+9.35e-01

Pos logit contributions
 enough+0.006
ounced+0.006
 dollars+0.006
 avail+0.006
 tighter+0.005
Neg logit contributions
ibr-0.005
orry-0.005
iz-0.005
 Jos-0.005
 mantle-0.005
Tokenmy
Token ID1820
Feature activation+1.80e-01

Pos logit contributions
 avail+0.012
ounced+0.011
 enough+0.010
 breed+0.010
 batch+0.010
Neg logit contributions
 stop-0.014
 clean-0.014
od-0.014
ibr-0.014
 Kate-0.014
Token said
Token ID531
Feature activation-7.57e-02

Pos logit contributions
 stop+0.007
ibr+0.007
orry+0.007
 mantle+0.007
od+0.006
Neg logit contributions
ounced-0.002
 avail-0.002
 batch-0.002
 enough-0.002
 problem-0.002
Token.
Token ID13
Feature activation-5.04e-01

Pos logit contributions
 enough+0.002
 batch+0.002
 choice+0.002
ounced+0.002
 avail+0.002
Neg logit contributions
iz-0.002
orry-0.002
ibr-0.002
lo-0.002
 mantle-0.002
Token
Token ID198
Feature activation-6.53e-01

Pos logit contributions
ounced+0.013
 batch+0.013
 avail+0.012
 enough+0.012
ucker+0.012
Neg logit contributions
orry-0.013
 stop-0.013
od-0.013
ibr-0.012
iz-0.012
Token
Token ID198
Feature activation-5.79e-01

Pos logit contributions
 avail+0.026
ounced+0.025
 enough+0.024
 breed+0.023
ited+0.023
Neg logit contributions
 stop-0.009
ibr-0.009
 clean-0.008
od-0.008
orry-0.008
TokenL
Token ID43
Feature activation-5.39e-01

Pos logit contributions
 avail+0.018
ounced+0.018
 batch+0.017
 enough+0.017
 breed+0.016
Neg logit contributions
 stop-0.012
ibr-0.011
od-0.011
orry-0.011
 Jos-0.011
Tokenily
Token ID813
Feature activation-1.38e-02

Pos logit contributions
 avail+0.025
ounced+0.025
 enough+0.024
 batch+0.024
 breed+0.024
Neg logit contributions
 stop-0.003
ibr-0.002
 clean-0.002
 un-0.002
od-0.002
Token couldn
Token ID3521
Feature activation-7.40e-01

Pos logit contributions
ounced+0.000
 batch+0.000
 enough+0.000
 avail+0.000
ited+0.000
Neg logit contributions
 stop-0.000
ibr-0.000
od-0.000
orry-0.000
lo-0.000
Token't
Token ID470
Feature activation-4.46e-01

Pos logit contributions
 avail+0.018
 enough+0.016
ounced+0.016
 batch+0.016
 problem+0.016
Neg logit contributions
ibr-0.020
orry-0.020
 stop-0.019
od-0.019
vell-0.019
Token wait
Token ID4043
Feature activation-8.56e-01

Pos logit contributions
 avail+0.013
 longer+0.012
 enough+0.011
 choice+0.011
 seen+0.011
Neg logit contributions
ibr-0.011
lo-0.010
orry-0.010
od-0.010
 Mir-0.009
Token to
Token ID284
Feature activation-4.01e-01

Pos logit contributions
 avail+0.018
ounced+0.017
 enough+0.017
 batch+0.016
 problem+0.015
Neg logit contributions
ibr-0.026
 stop-0.026
orry-0.026
od-0.025
 mantle-0.025
Token the
Token ID262
Feature activation-1.85e-01

Pos logit contributions
 stop+0.014
ibr+0.013
orry+0.013
od+0.012
 mantle+0.012
Neg logit contributions
 avail-0.030
ounced-0.028
 enough-0.028
 batch-0.027
 choice-0.027
Token broken
Token ID5445
Feature activation+4.99e-01

Pos logit contributions
 avail+0.005
ounced+0.005
 breed+0.005
 batch+0.005
ucker+0.005
Neg logit contributions
 stop-0.005
 clean-0.004
ibr-0.004
 un-0.004
 mantle-0.004
Token te
Token ID573
Feature activation-5.60e-01

Pos logit contributions
 stop+0.014
 clean+0.013
 un+0.012
 wall+0.012
 nicely+0.012
Neg logit contributions
ounced-0.014
 avail-0.014
ited-0.013
ucker-0.013
use-0.012
Tokenap
Token ID499
Feature activation+5.13e-01

Pos logit contributions
ounced+0.015
 avail+0.014
 enough+0.013
 batch+0.013
 dollars+0.013
Neg logit contributions
 stop-0.015
 clean-0.014
 mantle-0.014
 un-0.014
od-0.014
Tokenot
Token ID313
Feature activation+5.06e-01

Pos logit contributions
 stop+0.009
ibr+0.009
orry+0.009
 mantle+0.009
od+0.009
Neg logit contributions
 avail-0.017
ounced-0.016
 enough-0.016
 batch-0.016
 choice-0.015
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 Jos+0.013
iz+0.013
orry+0.012
 mantle+0.012
od+0.012
Neg logit contributions
 enough-0.014
 dollars-0.013
able-0.012
 batch-0.012
 problem-0.012
Token
Token ID198
Feature activation-2.38e-01

Pos logit contributions
ounced+0.004
 batch+0.004
ucker+0.004
 avail+0.004
 enough+0.003
Neg logit contributions
 stop-0.004
vell-0.004
ibr-0.003
od-0.003
iz-0.003
Token
Token ID198
Feature activation+3.51e-02

Pos logit contributions
ounced+0.008
 avail+0.007
 batch+0.007
 breed+0.007
 enough+0.007
Neg logit contributions
 stop-0.005
ibr-0.004
od-0.004
orry-0.004
 mantle-0.004
TokenThe
Token ID464
Feature activation-8.88e-01

Pos logit contributions
od+0.001
 stop+0.001
 Jos+0.001
ibr+0.001
vell+0.001
Neg logit contributions
ounced-0.001
 batch-0.001
ited-0.001
Sometimes-0.001
 avail-0.001
Token moral
Token ID6573
Feature activation-2.97e-01

Pos logit contributions
 avail+0.026
ounced+0.025
 enough+0.023
 batch+0.023
 breed+0.022
Neg logit contributions
 stop-0.023
ibr-0.021
 clean-0.021
orry-0.020
 un-0.020
Token of
Token ID286
Feature activation-3.11e-01

Pos logit contributions
 avail+0.012
ounced+0.011
ited+0.011
die+0.010
 batch+0.010
Neg logit contributions
ily-0.005
 clean-0.005
 stop-0.005
iz-0.005
 grown-0.005
Token to
Token ID284
Feature activation-1.26e+00

Pos logit contributions
 avail+0.005
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.004
ibr-0.004
orry-0.003
od-0.003
 clean-0.003
Token Jack
Token ID3619
Feature activation+6.08e-01

Pos logit contributions
 avail+0.045
ounced+0.041
 enough+0.041
 breed+0.040
 batch+0.040
Neg logit contributions
ibr-0.022
orry-0.022
od-0.021
 stop-0.021
iz-0.020
Token's
Token ID338
Feature activation+8.31e-02

Pos logit contributions
 stop+0.010
 clean+0.009
od+0.009
ibr+0.009
 un+0.009
Neg logit contributions
 avail-0.022
ounced-0.022
 batch-0.020
 breed-0.020
 enough-0.020
Token rescue
Token ID9992
Feature activation-7.54e-01

Pos logit contributions
 stop+0.002
 clean+0.002
 un+0.002
 mantle+0.001
 nicely+0.001
Neg logit contributions
 avail-0.003
ounced-0.003
ited-0.003
ucker-0.003
 batch-0.003
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
ounced+0.030
 avail+0.030
ucker+0.027
ited+0.027
 breed+0.027
Neg logit contributions
 stop-0.011
 clean-0.010
 un-0.009
 Kate-0.009
 nicely-0.009
Token He
Token ID679
Feature activation-1.82e-01

Pos logit contributions
 avail+0.008
ounced+0.008
 batch+0.008
 enough+0.008
 breed+0.007
Neg logit contributions
 stop-0.005
ibr-0.005
orry-0.005
od-0.005
 clean-0.005
Token helped
Token ID4193
Feature activation+2.29e-01

Pos logit contributions
 avail+0.006
ounced+0.005
aters+0.005
 batch+0.005
irms+0.005
Neg logit contributions
 stop-0.004
 un-0.004
 clean-0.004
orry-0.004
 knock-0.004
Token Jack
Token ID3619
Feature activation+1.18e+00

Pos logit contributions
ibr+0.005
orry+0.005
od+0.004
vell+0.004
 stop+0.004
Neg logit contributions
 avail-0.007
 enough-0.007
 batch-0.007
 choice-0.006
 pistols-0.006
Token stop
Token ID2245
Feature activation+2.50e-03

Pos logit contributions
ibr+0.028
orry+0.028
od+0.027
 Jos+0.027
 Kate+0.027
Neg logit contributions
 enough-0.034
 saddle-0.030
ounced-0.030
 tighter-0.029
 choice-0.029
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
ibr+0.000
orry+0.000
iz+0.000
od+0.000
 mantle+0.000
Neg logit contributions
 avail-0.000
 enough-0.000
ounced-0.000
 problem-0.000
 batch-0.000
Token boat
Token ID8848
Feature activation+8.56e-01

Pos logit contributions
 stop+0.004
 clean+0.003
 un+0.003
 mantle+0.003
ibr+0.003
Neg logit contributions
ounced-0.005
 avail-0.005
 batch-0.005
ucker-0.005
 breed-0.005
Token
Token ID198
Feature activation+4.14e-01

Pos logit contributions
 stop+0.001
ibr+0.001
orry+0.001
od+0.001
 mantle+0.001
Neg logit contributions
 avail-0.002
ounced-0.002
 batch-0.002
 enough-0.002
 breed-0.002
TokenBut
Token ID1537
Feature activation-6.02e-01

Pos logit contributions
 stop+0.008
ibr+0.008
 Jos+0.008
od+0.008
 Kate+0.008
Neg logit contributions
 avail-0.013
ounced-0.013
 batch-0.012
ited-0.012
 breed-0.012
Token the
Token ID262
Feature activation-5.43e-01

Pos logit contributions
 avail+0.021
ounced+0.021
 batch+0.020
 breed+0.019
ucker+0.019
Neg logit contributions
 stop-0.011
 clean-0.010
 un-0.010
orry-0.010
 nicely-0.009
Token brother
Token ID3956
Feature activation-6.79e-01

Pos logit contributions
ounced+0.016
 avail+0.016
ucker+0.014
aters+0.014
 batch+0.014
Neg logit contributions
 stop-0.015
 clean-0.014
 un-0.013
ibr-0.013
 nicely-0.012
Token said
Token ID531
Feature activation-3.00e-01

Pos logit contributions
 avail+0.019
ounced+0.018
 batch+0.017
 enough+0.017
 breed+0.017
Neg logit contributions
 stop-0.017
ibr-0.016
orry-0.016
od-0.016
 clean-0.016
Token he
Token ID339
Feature activation-8.56e-01

Pos logit contributions
 avail+0.009
ounced+0.009
 enough+0.009
 problem+0.008
 batch+0.008
Neg logit contributions
ibr-0.007
 stop-0.007
 mantle-0.007
orry-0.006
od-0.006
Token would
Token ID561
Feature activation-1.24e+00

Pos logit contributions
 avail+0.034
 batch+0.029
 loaf+0.028
aters+0.028
irms+0.028
Neg logit contributions
 stop-0.017
orry-0.016
 clean-0.015
 un-0.015
od-0.014
Token love
Token ID1842
Feature activation-5.32e-01

Pos logit contributions
 avail+0.043
ounced+0.041
 enough+0.040
 breed+0.039
 batch+0.039
Neg logit contributions
ibr-0.022
 stop-0.022
orry-0.021
od-0.021
iz-0.020
Token to
Token ID284
Feature activation+2.10e-01

Pos logit contributions
 avail+0.019
ounced+0.018
 enough+0.017
 batch+0.017
 breed+0.017
Neg logit contributions
 stop-0.010
ibr-0.009
od-0.009
orry-0.008
 clean-0.008
Token come
Token ID1282
Feature activation+2.87e-01

Pos logit contributions
ibr+0.004
orry+0.004
od+0.004
iz+0.003
 mantle+0.003
Neg logit contributions
 avail-0.007
 enough-0.007
ounced-0.007
 choice-0.007
 breed-0.006
Token to
Token ID284
Feature activation-8.38e-01

Pos logit contributions
 stop+0.006
ibr+0.005
orry+0.005
od+0.005
 clean+0.005
Neg logit contributions
 avail-0.010
ounced-0.009
 enough-0.009
 batch-0.009
 breed-0.009
Token house
Token ID2156
Feature activation+2.87e-01

Pos logit contributions
 avail+0.009
ounced+0.009
 enough+0.008
 batch+0.008
 breed+0.008
Neg logit contributions
ibr-0.005
 stop-0.005
orry-0.005
od-0.005
iz-0.005
Token.
Token ID13
Feature activation+8.66e-01

Pos logit contributions
 stop+0.005
ibr+0.004
orry+0.004
od+0.004
 mantle+0.004
Neg logit contributions
 avail-0.010
ounced-0.010
 enough-0.010
 batch-0.010
 breed-0.009
Token
Token ID198
Feature activation+8.58e-01

Pos logit contributions
vell+0.020
 stop+0.020
ibr+0.019
ilet+0.019
orry+0.018
Neg logit contributions
 batch-0.024
 enough-0.023
 types-0.023
grown-0.023
 avail-0.023
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
 stop+0.022
ibr+0.021
vell+0.020
 mantle+0.020
orry+0.020
Neg logit contributions
ounced-0.021
 avail-0.021
 batch-0.021
grown-0.020
 enough-0.020
TokenThe
Token ID464
Feature activation-8.12e-01

Pos logit contributions
 stop+0.028
ibr+0.028
od+0.027
orry+0.026
vell+0.026
Neg logit contributions
ited-0.026
ounced-0.026
 avail-0.025
 batch-0.025
grown-0.025
Token re
Token ID302
Feature activation-1.55e+00

Pos logit contributions
ounced+0.028
 avail+0.027
ucker+0.026
 batch+0.025
 breed+0.024
Neg logit contributions
 stop-0.018
 clean-0.016
 un-0.016
 mantle-0.015
 cool-0.015
Tokenind
Token ID521
Feature activation-4.40e-02

Pos logit contributions
 avail+0.045
ounced+0.041
 enough+0.040
 batch+0.038
 problem+0.037
Neg logit contributions
od-0.038
orry-0.038
 stop-0.037
 mantle-0.036
ibr-0.036
Tokeneer
Token ID28153
Feature activation+1.63e-01

Pos logit contributions
 breed+0.001
ounced+0.001
 avail+0.001
 types+0.001
 enough+0.001
Neg logit contributions
 stop-0.002
 clean-0.002
ibr-0.002
 Kate-0.002
 un-0.002
Token thanked
Token ID26280
Feature activation+1.46e+00

Pos logit contributions
 stop+0.004
ibr+0.004
od+0.004
 mantle+0.004
orry+0.004
Neg logit contributions
ounced-0.004
 avail-0.004
 enough-0.004
 batch-0.004
ited-0.004
Token Jane
Token ID12091
Feature activation-1.55e-01

Pos logit contributions
ibr+0.033
orry+0.028
vell+0.028
 lyn+0.025
 Vol+0.025
Neg logit contributions
 avail-0.051
 enough-0.048
 choice-0.046
 batch-0.046
 kinds-0.043
Token for
Token ID329
Feature activation-5.49e-01

Pos logit contributions
 enough+0.003
 avail+0.003
ounced+0.003
 batch+0.003
 dollars+0.003
Neg logit contributions
ibr-0.005
 stop-0.005
orry-0.004
iz-0.004
od-0.004
Token,"
Token ID553
Feature activation+1.08e+00

Pos logit contributions
 avail+0.024
ounced+0.022
irms+0.021
ucker+0.021
aters+0.020
Neg logit contributions
 stop-0.016
 clean-0.014
 nicely-0.013
od-0.013
 Mia-0.013
Token Sue
Token ID26089
Feature activation+2.63e-01

Pos logit contributions
 stop+0.010
 Mia+0.009
 Kate+0.008
 clean+0.007
 Mir+0.007
Neg logit contributions
 avail-0.049
ounced-0.049
aters-0.046
ited-0.046
ucker-0.046
Token says
Token ID1139
Feature activation-3.25e-01

Pos logit contributions
 stop+0.009
ibr+0.009
orry+0.009
od+0.009
 clean+0.008
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.004
 batch-0.004
 breed-0.004
Token.
Token ID13
Feature activation+6.45e-01

Pos logit contributions
 avail+0.011
ounced+0.010
 enough+0.010
 breed+0.009
 dollars+0.009
Neg logit contributions
ibr-0.006
orry-0.006
 mantle-0.006
 stop-0.006
iz-0.006
Token "
Token ID366
Feature activation+8.83e-02

Pos logit contributions
 stop+0.011
ibr+0.010
orry+0.010
od+0.010
 clean+0.009
Neg logit contributions
 avail-0.024
ounced-0.023
 enough-0.022
 batch-0.022
 breed-0.021
TokenLet
Token ID5756
Feature activation-1.83e+00

Pos logit contributions
 stop+0.003
orry+0.003
ibr+0.002
iz+0.002
 Kate+0.002
Neg logit contributions
 avail-0.002
ounced-0.002
 breed-0.002
 enough-0.002
 batch-0.002
Token's
Token ID338
Feature activation-1.97e-01

Pos logit contributions
 avail+0.066
ounced+0.064
 enough+0.061
 batch+0.060
 problem+0.059
Neg logit contributions
 stop-0.031
ibr-0.030
orry-0.029
od-0.029
 mantle-0.028
Token try
Token ID1949
Feature activation-1.97e+00

Pos logit contributions
 avail+0.006
 enough+0.005
 choice+0.005
 saddle+0.005
ounced+0.005
Neg logit contributions
ibr-0.005
orry-0.004
od-0.004
 Jos-0.004
 Kate-0.004
Token again
Token ID757
Feature activation-1.29e+00

Pos logit contributions
 avail+0.058
ounced+0.056
 batch+0.051
 enough+0.051
 breed+0.051
Neg logit contributions
 stop-0.048
od-0.044
 clean-0.044
orry-0.044
ibr-0.043
Token."
Token ID526
Feature activation+6.59e-01

Pos logit contributions
ounced+0.052
 avail+0.050
ited+0.048
 batch+0.048
ucker+0.047
Neg logit contributions
 stop-0.019
 clean-0.019
 un-0.018
 nicely-0.018
od-0.017
Token
Token ID198
Feature activation+7.43e-01

Pos logit contributions
 stop+0.013
ibr+0.013
orry+0.013
od+0.012
 mantle+0.012
Neg logit contributions
 avail-0.021
ounced-0.021
 enough-0.020
 breed-0.019
 batch-0.019
Token time
Token ID640
Feature activation+1.70e-01

Pos logit contributions
 stop+0.010
ibr+0.009
orry+0.009
od+0.009
 clean+0.009
Neg logit contributions
 avail-0.013
ounced-0.013
 enough-0.012
 batch-0.012
 breed-0.011
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
 mantle+0.004
 stop+0.004
ibr+0.004
orry+0.004
od+0.004
Neg logit contributions
 enough-0.005
 avail-0.005
ounced-0.004
 problem-0.004
grown-0.004
Token past
Token ID1613
Feature activation-3.91e-02

Pos logit contributions
 enough+0.041
 avail+0.039
ounced+0.038
 batch+0.037
 breed+0.037
Neg logit contributions
ibr-0.038
orry-0.036
od-0.036
 stop-0.035
 mantle-0.035
Token many
Token ID867
Feature activation-1.90e-01

Pos logit contributions
 avail+0.001
ounced+0.001
 enough+0.001
 breed+0.001
 batch+0.001
Neg logit contributions
 stop-0.001
ibr-0.001
orry-0.001
 mantle-0.001
 clean-0.001
Token houses
Token ID7777
Feature activation+1.71e-01

Pos logit contributions
 avail+0.006
ounced+0.006
 batch+0.006
 breed+0.006
ucker+0.006
Neg logit contributions
 stop-0.004
ibr-0.004
 un-0.004
 clean-0.004
od-0.004
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
orry+0.004
ibr+0.004
 stop+0.004
 mantle+0.003
od+0.003
Neg logit contributions
 avail-0.005
ounced-0.005
 enough-0.005
 breed-0.005
 problem-0.005
Token trees
Token ID7150
Feature activation+8.35e-01

Pos logit contributions
 avail+0.057
ounced+0.055
 batch+0.051
 enough+0.050
 breed+0.050
Neg logit contributions
 stop-0.038
ibr-0.035
 clean-0.034
 un-0.034
od-0.034
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 stop+0.016
orry+0.016
ibr+0.015
 mantle+0.015
od+0.015
Neg logit contributions
 avail-0.027
ounced-0.027
 enough-0.026
 batch-0.025
 breed-0.025
Token hills
Token ID18639
Feature activation+4.98e-01

Pos logit contributions
 avail+0.040
ounced+0.039
 batch+0.036
 enough+0.036
 breed+0.035
Neg logit contributions
 stop-0.029
ibr-0.027
od-0.026
 clean-0.026
orry-0.026
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 stop+0.008
ibr+0.008
orry+0.007
od+0.007
 clean+0.007
Neg logit contributions
 avail-0.018
ounced-0.018
 enough-0.017
 batch-0.017
 breed-0.016
Token Ben
Token ID3932
Feature activation+5.97e-02

Pos logit contributions
 avail+0.033
ounced+0.033
 batch+0.032
 enough+0.032
ucker+0.031
Neg logit contributions
 stop-0.020
vell-0.018
ibr-0.017
 mantle-0.017
 clean-0.017
Token He
Token ID679
Feature activation-2.08e-01

Pos logit contributions
 avail+0.019
ounced+0.018
 enough+0.017
 batch+0.017
 breed+0.017
Neg logit contributions
 stop-0.010
ibr-0.009
od-0.009
orry-0.009
 clean-0.009
Token saw
Token ID2497
Feature activation-5.39e-02

Pos logit contributions
 avail+0.009
aters+0.008
irms+0.008
ays+0.008
ucker+0.007
Neg logit contributions
 un-0.003
 clean-0.003
 stop-0.003
 peek-0.003
 sure-0.003
Token a
Token ID257
Feature activation-5.70e-01

Pos logit contributions
 avail+0.002
ited+0.002
ounced+0.002
 dollars+0.002
ucker+0.002
Neg logit contributions
 stop-0.001
 clean-0.001
 O-0.001
od-0.001
 Mir-0.001
Token pair
Token ID5166
Feature activation-8.96e-01

Pos logit contributions
 avail+0.022
ounced+0.021
ays+0.020
ited+0.020
aters+0.020
Neg logit contributions
 stop-0.010
 clean-0.009
 un-0.008
 knock-0.008
 grown-0.008
Token of
Token ID286
Feature activation-2.42e-01

Pos logit contributions
 avail+0.026
ounced+0.025
 batch+0.023
 breed+0.022
 enough+0.022
Neg logit contributions
 stop-0.023
 clean-0.021
orry-0.021
od-0.021
ibr-0.021
Token scissors
Token ID41786
Feature activation-1.75e+00

Pos logit contributions
 avail+0.007
ounced+0.006
ucker+0.006
 breed+0.006
ited+0.006
Neg logit contributions
 stop-0.007
 clean-0.006
 un-0.006
od-0.006
 cool-0.006
Token on
Token ID319
Feature activation+8.38e-02

Pos logit contributions
 avail+0.070
ounced+0.065
 batch+0.063
 choice+0.062
 breed+0.062
Neg logit contributions
 stop-0.029
od-0.024
 clean-0.024
 un-0.023
orry-0.021
Token the
Token ID262
Feature activation+8.48e-01

Pos logit contributions
 stop+0.001
ibr+0.001
orry+0.001
od+0.001
 clean+0.001
Neg logit contributions
 avail-0.003
ounced-0.003
 enough-0.003
 batch-0.003
 breed-0.003
Token ground
Token ID2323
Feature activation+9.30e-02

Pos logit contributions
ibr+0.021
 Jos+0.020
 Kate+0.020
od+0.020
orry+0.020
Neg logit contributions
 avail-0.024
 enough-0.023
ounced-0.022
 problem-0.022
 batch-0.021
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 stop+0.002
 clean+0.001
od+0.001
 un+0.001
 nicely+0.001
Neg logit contributions
 avail-0.004
ounced-0.003
 batch-0.003
 breed-0.003
ited-0.003
Token decided
Token ID3066
Feature activation-8.28e-01

Pos logit contributions
 avail+0.034
ounced+0.031
aters+0.029
ucker+0.029
 breed+0.029
Neg logit contributions
 stop-0.023
 un-0.022
 clean-0.022
 Kate-0.020
 Mir-0.020
TokenL
Token ID43
Feature activation-6.67e-01

Pos logit contributions
 avail+0.016
ounced+0.015
 enough+0.014
 breed+0.014
 batch+0.014
Neg logit contributions
 stop-0.004
 clean-0.004
 un-0.004
 nicely-0.003
 tidy-0.003
Tokenily
Token ID813
Feature activation-2.02e-01

Pos logit contributions
 avail+0.030
ounced+0.029
 enough+0.029
 batch+0.028
 choice+0.028
Neg logit contributions
 stop-0.005
 mantle-0.003
od-0.003
ibr-0.003
 un-0.003
Token said
Token ID531
Feature activation-7.84e-02

Pos logit contributions
 avail+0.005
ounced+0.005
 enough+0.005
 batch+0.005
 breed+0.005
Neg logit contributions
 stop-0.005
ibr-0.005
orry-0.005
od-0.005
 mantle-0.005
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
ounced+0.002
 avail+0.002
 enough+0.002
 problem+0.002
 batch+0.002
Neg logit contributions
 mantle-0.002
ibr-0.002
 stop-0.002
orry-0.002
iz-0.002
Token "
Token ID366
Feature activation-2.42e-01

Pos logit contributions
ounced+0.006
 enough+0.006
 breed+0.006
 batch+0.005
 problem+0.005
Neg logit contributions
orry-0.009
iz-0.009
ibr-0.009
 mantle-0.008
amps-0.008
TokenNo
Token ID2949
Feature activation-1.95e+00

Pos logit contributions
ounced+0.008
 avail+0.008
 problem+0.008
 enough+0.008
ucker+0.007
Neg logit contributions
 stop-0.005
 clean-0.005
Her-0.004
 mantle-0.004
 un-0.004
Token,
Token ID11
Feature activation+6.25e-01

Pos logit contributions
ounced+0.074
 avail+0.073
ited+0.069
 batch+0.068
 breed+0.067
Neg logit contributions
 stop-0.033
 clean-0.027
od-0.026
 Kate-0.026
orry-0.026
Token Tommy
Token ID19919
Feature activation+4.59e-01

Pos logit contributions
ibr+0.015
od+0.015
orry+0.015
iz+0.015
 stop+0.014
Neg logit contributions
 avail-0.018
 enough-0.017
ounced-0.016
 batch-0.016
 problem-0.016
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
ibr+0.010
 stop+0.010
 mantle+0.010
orry+0.010
od+0.010
Neg logit contributions
 avail-0.014
ounced-0.013
 enough-0.013
 problem-0.013
 choice-0.013
Token That
Token ID1320
Feature activation-6.51e-01

Pos logit contributions
 stop+0.003
od+0.003
ibr+0.003
iz+0.003
vell+0.003
Neg logit contributions
 Does-0.003
ounced-0.003
 avail-0.003
 breed-0.003
 enough-0.003
Token's
Token ID338
Feature activation+9.22e-01

Pos logit contributions
ounced+0.027
ucker+0.025
etchup+0.025
ited+0.024
 avail+0.024
Neg logit contributions
 stop-0.013
 clean-0.011
 solve-0.010
 wall-0.010
 grown-0.009
Token could
Token ID714
Feature activation-7.41e-01

Pos logit contributions
 avail+0.040
ounced+0.034
 saddle+0.033
 reins+0.033
irms+0.033
Neg logit contributions
 stop-0.024
 clean-0.024
 un-0.023
od-0.022
orry-0.022
Token go
Token ID467
Feature activation-8.67e-01

Pos logit contributions
 avail+0.023
 choice+0.022
 batch+0.022
ounced+0.022
 breed+0.021
Neg logit contributions
ibr-0.015
od-0.015
orry-0.015
iz-0.013
 stop-0.013
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
 avail+0.027
ounced+0.026
 batch+0.023
 breed+0.023
 enough+0.023
Neg logit contributions
 stop-0.021
 clean-0.019
 un-0.019
ibr-0.018
od-0.018
Token the
Token ID262
Feature activation-2.94e-01

Pos logit contributions
 avail+0.048
ounced+0.047
 enough+0.044
 batch+0.044
 breed+0.044
Neg logit contributions
 stop-0.017
ibr-0.015
orry-0.015
od-0.014
 clean-0.014
Token park
Token ID3952
Feature activation-1.42e-01

Pos logit contributions
 avail+0.008
ounced+0.008
 enough+0.007
 batch+0.007
 breed+0.007
Neg logit contributions
 stop-0.007
ibr-0.007
od-0.007
orry-0.007
 clean-0.007
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
 avail+0.004
ounced+0.004
 enough+0.004
 batch+0.004
 breed+0.004
Neg logit contributions
 stop-0.003
ibr-0.003
 clean-0.003
od-0.003
 un-0.003
Token but
Token ID475
Feature activation-1.92e+00

Pos logit contributions
 avail+0.050
ounced+0.049
 enough+0.049
 batch+0.048
 breed+0.046
Neg logit contributions
ibr-0.037
orry-0.035
od-0.035
 stop-0.034
iz-0.033
Token only
Token ID691
Feature activation-1.85e+00

Pos logit contributions
ounced+0.099
 avail+0.095
aters+0.095
ited+0.094
 saddle+0.093
Neg logit contributions
 stop-0.010
 clean-0.008
 Kate-0.006
 nicely-0.005
 Mia-0.005
Token if
Token ID611
Feature activation-1.14e+00

Pos logit contributions
 avail+0.054
ounced+0.053
 batch+0.048
ited+0.047
 breed+0.047
Neg logit contributions
 stop-0.046
 clean-0.042
 un-0.042
ibr-0.041
od-0.041
Token she
Token ID673
Feature activation+1.90e-01

Pos logit contributions
ounced+0.046
 avail+0.045
ited+0.043
 batch+0.043
 breed+0.042
Neg logit contributions
 stop-0.015
 clean-0.014
 Kate-0.013
 un-0.013
 nicely-0.013
Token cleaned
Token ID20750
Feature activation-7.77e-01

Pos logit contributions
 stop+0.005
 un+0.005
 clean+0.005
ibr+0.005
orry+0.005
Neg logit contributions
 avail-0.005
ounced-0.005
 batch-0.005
 enough-0.005
 breed-0.005