TOP ACTIVATIONS
MAX = 0.080

surprise . He threw a tant rum . He kicked and
wanted to increase . By living life , it found joy
tiny little jumps . So oner or later , it was
not toys . They are living things ." Tom said "
not toys . They are living beings . Do you understand
it to me for safe keeping . He told me to
to bugs . They are living things too ."
listen . She threw a tant rum in the store .
a squirrel . It was living in the tree . It
not toys . They are living creatures . You should respect
sad . She threw a tant rum and cried loudly .
owned . He ended up living alone again , but this
stubborn . She throws a tant rum . She kicks and
grandma 's house for safe keeping . When
her feet and threw a tant rum . She grabbed the
his face . He was living his dream and was one
! The mouse had been living in the jewelry box and
not toys . They are living things . They have their
. He kept throwing a tant rum . His mom was
toy . He is a living animal . He has feelings

MIN ACTIVATIONS
MIN = -0.067

When Daisy put the magn ifying glass down , the cater
" Yes , look how high it is flying !"
said , " Let â s go and try
her , " Let â s go home now
stairs and z ig z agged around the hallway !"
said , " Let â s go fishing !"
daughter , " Let â s not waste our
stairs and z ig z agged her way through the hallway
Nicole , " Let â s go on an
friends decided to measure how high they could jump . One
suggested , " Let â s go to the
them up and see how high they could go .
, no matter how " simple " it may seem .
mouse , " Let â s go outside and
my said " Let â s go !" As
big towers and see how high he could make them .
the necklace and the magn ifying glass and started to examine
said , " Let â s go on an
I need to see how high you can raise your arms
the necklace and the magn ifying glass and told him that

INTERVAL 0.043 - 0.080
CONTAINS 0.025%

. It had a long aisle and a big door .
. He laughs and cl aps his hands .
Tim my hugged the t eddy bear and said , "
the seed asked the f auc et , " Can you
L ily took a small bite and to her surprise

INTERVAL 0.007 - 0.043
CONTAINS 25.301%

gaze and saw it too . A giant was walking towards
them !" L ily gets angry . She pushes
down the hill together again and again , until the sun
Just then , a kind voice spoke out of the
Of course , my sweet ies ," she says . "

INTERVAL -0.030 - 0.007
CONTAINS 74.136%

it was perfect ! The shop keeper smiled and
and thanked mum for the special gift . <|endoftext|> Once there
â Billy smiled .
. He looked around desperately and saw a
little girl named Lily . She had a special drawer in

INTERVAL -0.067 - -0.030
CONTAINS 0.538%

We promise !" So , Fin and his friends
that ?" The sun answered back in a bright
go outside and see the sun together , alright ?"
there was a little bunny named Fl oppy . Fl oppy
idea how . So Lucy called out to the
Token surprise
Token ID5975
Feature activation-1.14e-02

Pos logit contributions
 not+0.005
and+0.005
 and+0.004
 sure+0.004
 never+0.004
Neg logit contributions
iful-0.005
ka-0.005
rus-0.005
works-0.005
ulus-0.005
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
iful+0.023
ka+0.022
rus+0.022
ulus+0.022
ked+0.022
Neg logit contributions
 and-0.010
 not-0.010
and-0.010
 about-0.009
 sure-0.009
Token He
Token ID679
Feature activation-1.13e-03

Pos logit contributions
and+0.013
arre+0.012
 knowing+0.012
 sure+0.012
 not+0.012
Neg logit contributions
ka-0.017
ulus-0.017
ocol-0.017
ked-0.017
ifying-0.017
Token threw
Token ID9617
Feature activation+5.29e-03

Pos logit contributions
iful+0.002
ka+0.002
ulus+0.002
works+0.002
ocol+0.002
Neg logit contributions
 not-0.001
 and-0.001
 never-0.001
and-0.001
 sure-0.001
Token a
Token ID257
Feature activation+1.35e-02

Pos logit contributions
and+0.007
 not+0.006
 knowing+0.006
arre+0.006
 aware+0.006
Neg logit contributions
ka-0.008
ked-0.008
ocol-0.008
iful-0.008
rus-0.008
Token tant
Token ID24246
Feature activation+6.72e-02

Pos logit contributions
 not+0.018
and+0.018
 never+0.017
 and+0.017
 sure+0.017
Neg logit contributions
ka-0.020
iful-0.020
rus-0.019
ked-0.019
ocol-0.019
Tokenrum
Token ID6582
Feature activation+1.51e-02

Pos logit contributions
 not+0.066
 and+0.061
and+0.059
 never+0.058
 sure+0.057
Neg logit contributions
ka-0.125
iful-0.123
rus-0.121
ked-0.119
ocol-0.119
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
and+0.016
 think+0.015
 not+0.015
 sure+0.015
arre+0.015
Neg logit contributions
ked-0.025
iful-0.025
rus-0.025
ka-0.025
works-0.024
Token He
Token ID679
Feature activation+6.48e-03

Pos logit contributions
and+0.018
 sure+0.017
arre+0.017
 knowing+0.017
 aware+0.017
Neg logit contributions
ked-0.023
ulus-0.023
ocol-0.023
ka-0.023
icles-0.023
Token kicked
Token ID12165
Feature activation+1.25e-02

Pos logit contributions
 not+0.008
and+0.008
 and+0.007
 sure+0.007
 never+0.007
Neg logit contributions
iful-0.011
ka-0.010
rus-0.010
ulus-0.010
works-0.010
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
and+0.016
 not+0.015
 aware+0.014
 mindful+0.014
 knowing+0.014
Neg logit contributions
ka-0.019
ked-0.019
iful-0.018
rus-0.018
ocol-0.017
Token wanted
Token ID2227
Feature activation+3.43e-03

Pos logit contributions
iful+0.005
rus+0.005
ka+0.005
ulus+0.005
glers+0.005
Neg logit contributions
 not-0.005
 and-0.005
 never-0.004
and-0.004
 sure-0.004
Token to
Token ID284
Feature activation+1.51e-02

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 sure+0.002
 never+0.002
Neg logit contributions
iful-0.008
ka-0.008
rus-0.008
ulus-0.008
works-0.008
Token increase
Token ID2620
Feature activation+1.53e-02

Pos logit contributions
and+0.020
 not+0.018
 and+0.018
arre+0.017
 sure+0.017
Neg logit contributions
iful-0.022
rus-0.022
ka-0.022
works-0.021
glers-0.021
Token.
Token ID13
Feature activation+1.72e-03

Pos logit contributions
and+0.015
 not+0.015
 and+0.013
 sure+0.013
 think+0.013
Neg logit contributions
iful-0.028
ka-0.027
rus-0.026
works-0.026
ulus-0.026
Token By
Token ID2750
Feature activation+2.42e-02

Pos logit contributions
 not+0.002
and+0.002
 knowing+0.002
 sure+0.002
 and+0.002
Neg logit contributions
ka-0.003
iful-0.003
ulus-0.003
rus-0.003
works-0.003
Token living
Token ID2877
Feature activation+6.65e-02

Pos logit contributions
and+0.033
 not+0.029
 and+0.028
arre+0.028
 sure+0.027
Neg logit contributions
iful-0.038
ka-0.036
ocol-0.035
rus-0.034
works-0.034
Token life
Token ID1204
Feature activation+2.31e-03

Pos logit contributions
and+0.081
aring+0.075
 not+0.075
 cry+0.072
Unfortunately+0.072
Neg logit contributions
iful-0.102
ka-0.099
 avail-0.097
mer-0.095
women-0.094
Token,
Token ID11
Feature activation+7.06e-03

Pos logit contributions
and+0.002
 not+0.002
 sure+0.002
 think+0.002
 knowing+0.001
Neg logit contributions
iful-0.005
rus-0.004
works-0.004
ka-0.004
ulus-0.004
Token it
Token ID340
Feature activation+1.35e-03

Pos logit contributions
and+0.008
 not+0.007
 and+0.007
 think+0.006
 about+0.006
Neg logit contributions
iful-0.012
rus-0.012
ka-0.012
works-0.012
ulus-0.012
Token found
Token ID1043
Feature activation-5.50e-03

Pos logit contributions
and+0.002
 not+0.002
 and+0.002
 knowing+0.002
 about+0.002
Neg logit contributions
iful-0.002
rus-0.002
ka-0.002
ked-0.002
works-0.002
Token joy
Token ID8716
Feature activation-4.95e-03

Pos logit contributions
iful+0.011
ka+0.011
ulus+0.011
rus+0.011
glers+0.011
Neg logit contributions
 not-0.005
 and-0.005
and-0.004
 sure-0.004
 never-0.004
Token tiny
Token ID7009
Feature activation+3.54e-02

Pos logit contributions
 not+0.007
 and+0.006
and+0.006
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.009
ka-0.009
rus-0.009
ulus-0.009
works-0.009
Token little
Token ID1310
Feature activation+3.07e-02

Pos logit contributions
 not+0.041
and+0.040
 sure+0.036
 never+0.036
 and+0.036
Neg logit contributions
iful-0.059
ka-0.058
rus-0.056
ked-0.056
ocol-0.055
Token jumps
Token ID18045
Feature activation-4.45e-03

Pos logit contributions
 not+0.036
and+0.034
 and+0.034
 sure+0.032
 never+0.031
Neg logit contributions
iful-0.052
rus-0.050
ka-0.050
works-0.049
ulus-0.049
Token.
Token ID13
Feature activation+5.16e-03

Pos logit contributions
iful+0.010
ka+0.009
rus+0.009
ked+0.009
works+0.009
Neg logit contributions
 not-0.003
and-0.003
 think-0.003
 never-0.003
 sure-0.002
Token So
Token ID1406
Feature activation-7.74e-03

Pos logit contributions
 not+0.006
and+0.005
 and+0.005
 never+0.005
 sure+0.005
Neg logit contributions
iful-0.009
ka-0.009
oured-0.008
ulus-0.008
ked-0.008
Tokenoner
Token ID14491
Feature activation+6.54e-02

Pos logit contributions
iful+0.012
 rib+0.011
 avail+0.011
ka+0.011
works+0.011
Neg logit contributions
and-0.011
arre-0.009
 not-0.008
Unfortunately-0.008
 knowing-0.008
Token or
Token ID393
Feature activation+3.84e-03

Pos logit contributions
 not+0.054
and+0.051
 Sadly+0.050
arre+0.050
Unfortunately+0.049
Neg logit contributions
iful-0.130
ka-0.128
ked-0.125
mer-0.123
ocol-0.121
Token later
Token ID1568
Feature activation+8.68e-03

Pos logit contributions
and+0.004
 not+0.004
 and+0.004
 think+0.004
 about+0.004
Neg logit contributions
iful-0.007
ka-0.006
rus-0.006
works-0.006
ked-0.006
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
and+0.010
 not+0.008
 think+0.008
aps+0.008
 aware+0.007
Neg logit contributions
iful-0.014
ka-0.014
glers-0.014
ked-0.014
works-0.014
Token it
Token ID340
Feature activation-3.29e-03

Pos logit contributions
and+0.018
arre+0.016
 not+0.016
 think+0.015
aps+0.015
Neg logit contributions
iful-0.020
 usual-0.019
 rib-0.019
ked-0.018
glers-0.018
Token was
Token ID373
Feature activation+6.99e-03

Pos logit contributions
iful+0.005
ka+0.004
works+0.004
glers+0.004
rus+0.004
Neg logit contributions
and-0.005
 not-0.004
 knowing-0.004
 and-0.004
 think-0.004
Token not
Token ID407
Feature activation+6.72e-03

Pos logit contributions
and+0.015
 not+0.015
 and+0.014
 sure+0.013
 think+0.013
Neg logit contributions
iful-0.036
ka-0.035
rus-0.035
ulus-0.034
works-0.034
Token toys
Token ID14958
Feature activation+3.18e-02

Pos logit contributions
 not+0.007
and+0.007
 and+0.007
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.012
ka-0.011
rus-0.011
works-0.011
ulus-0.011
Token.
Token ID13
Feature activation+6.17e-03

Pos logit contributions
 not+0.033
and+0.033
 sure+0.030
 and+0.029
 think+0.029
Neg logit contributions
iful-0.056
rus-0.054
ka-0.054
works-0.053
ulus-0.053
Token They
Token ID1119
Feature activation+4.85e-04

Pos logit contributions
 not+0.005
and+0.005
 and+0.005
 sure+0.004
 never+0.004
Neg logit contributions
iful-0.013
ka-0.012
rus-0.012
ulus-0.012
works-0.012
Token are
Token ID389
Feature activation+1.84e-02

Pos logit contributions
and+0.001
 not+0.001
 knowing+0.001
 and+0.001
 about+0.000
Neg logit contributions
iful-0.001
rus-0.001
works-0.001
ka-0.001
ked-0.001
Token living
Token ID2877
Feature activation+6.47e-02

Pos logit contributions
and+0.018
 not+0.016
 and+0.016
 sure+0.015
 think+0.015
Neg logit contributions
iful-0.034
ka-0.033
rus-0.033
works-0.033
ulus-0.032
Token things
Token ID1243
Feature activation+3.07e-02

Pos logit contributions
Unfortunately+0.069
and+0.068
 Sadly+0.067
 Unfortunately+0.066
Sadly+0.065
Neg logit contributions
mer-0.107
ka-0.106
iful-0.106
ked-0.099
works-0.098
Token."
Token ID526
Feature activation+4.77e-03

Pos logit contributions
and+0.027
 not+0.025
arre+0.023
Unfortunately+0.023
 think+0.023
Neg logit contributions
iful-0.057
works-0.056
rus-0.056
ked-0.055
ka-0.055
Token Tom
Token ID4186
Feature activation+3.36e-03

Pos logit contributions
and+0.005
 not+0.005
 sure+0.005
 never+0.005
 aware+0.004
Neg logit contributions
ked-0.008
ulus-0.008
glers-0.008
ka-0.008
oured-0.007
Token said
Token ID531
Feature activation-3.54e-03

Pos logit contributions
and+0.005
 not+0.004
 knowing+0.004
 about+0.004
 many+0.004
Neg logit contributions
ked-0.005
ka-0.005
rus-0.005
iful-0.005
oured-0.005
Token "
Token ID366
Feature activation+9.91e-04

Pos logit contributions
ked+0.006
iful+0.006
ka+0.005
rus+0.005
works+0.005
Neg logit contributions
and-0.004
 not-0.004
 think-0.004
arre-0.004
 never-0.003
Token not
Token ID407
Feature activation+7.23e-03

Pos logit contributions
and+0.011
 not+0.009
 and+0.009
 think+0.008
 sure+0.008
Neg logit contributions
iful-0.034
ka-0.034
rus-0.033
works-0.033
ocol-0.033
Token toys
Token ID14958
Feature activation+3.15e-02

Pos logit contributions
and+0.007
 not+0.007
 and+0.007
 never+0.006
 think+0.006
Neg logit contributions
iful-0.013
ka-0.013
rus-0.012
ked-0.012
works-0.012
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
and+0.029
 not+0.027
 sure+0.026
Unfortunately+0.026
 think+0.025
Neg logit contributions
iful-0.057
rus-0.055
works-0.055
mer-0.054
ked-0.054
Token They
Token ID1119
Feature activation+3.67e-03

Pos logit contributions
 not+0.012
and+0.012
 sure+0.011
 and+0.011
 knowing+0.011
Neg logit contributions
works-0.023
ulus-0.023
iful-0.023
ka-0.023
ocol-0.023
Token are
Token ID389
Feature activation+1.85e-02

Pos logit contributions
and+0.005
 knowing+0.004
 not+0.004
Unfortunately+0.004
 and+0.004
Neg logit contributions
iful-0.006
rus-0.006
works-0.006
ka-0.006
glers-0.006
Token living
Token ID2877
Feature activation+6.47e-02

Pos logit contributions
and+0.016
 not+0.013
 and+0.013
Unfortunately+0.012
arre+0.012
Neg logit contributions
iful-0.036
ka-0.035
rus-0.035
works-0.035
ocol-0.035
Token beings
Token ID9791
Feature activation+1.08e-02

Pos logit contributions
Unfortunately+0.086
 Unfortunately+0.082
Sadly+0.079
 Sadly+0.079
and+0.078
Neg logit contributions
mer-0.095
ka-0.094
iful-0.090
ked-0.081
ifying-0.081
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
and+0.011
Unfortunately+0.010
 not+0.010
 Sadly+0.010
 Unfortunately+0.010
Neg logit contributions
iful-0.019
works-0.018
rus-0.017
ka-0.017
glers-0.017
Token Do
Token ID2141
Feature activation+8.88e-03

Pos logit contributions
 not+0.011
and+0.011
 sure+0.010
 and+0.010
 knowing+0.010
Neg logit contributions
works-0.020
ulus-0.020
ka-0.020
ocol-0.020
iful-0.020
Token you
Token ID345
Feature activation+7.25e-03

Pos logit contributions
and+0.009
 and+0.009
 not+0.008
 knowing+0.008
 sure+0.008
Neg logit contributions
iful-0.015
rus-0.015
works-0.015
ka-0.015
ulus-0.015
Token understand
Token ID1833
Feature activation+1.90e-03

Pos logit contributions
and+0.007
Unfortunately+0.005
 knowing+0.005
 office+0.005
 Sadly+0.005
Neg logit contributions
iful-0.013
ka-0.013
works-0.012
ocol-0.012
ked-0.012
Token it
Token ID340
Feature activation+5.53e-03

Pos logit contributions
 not+0.019
 and+0.018
and+0.017
 never+0.016
 sure+0.016
Neg logit contributions
iful-0.042
ulus-0.041
rus-0.040
glers-0.040
works-0.039
Token to
Token ID284
Feature activation+5.10e-03

Pos logit contributions
 not+0.003
 and+0.003
and+0.003
 never+0.002
 many+0.002
Neg logit contributions
iful-0.013
rus-0.012
ulus-0.012
ka-0.012
glers-0.012
Token me
Token ID502
Feature activation+1.83e-02

Pos logit contributions
 not+0.006
 and+0.006
and+0.006
 never+0.006
 sure+0.006
Neg logit contributions
iful-0.009
ka-0.008
rus-0.008
ulus-0.008
ked-0.008
Token for
Token ID329
Feature activation+8.76e-03

Pos logit contributions
 not+0.015
 and+0.014
and+0.014
 sure+0.012
 never+0.012
Neg logit contributions
iful-0.038
ka-0.037
rus-0.037
ulus-0.037
glers-0.036
Token safe
Token ID3338
Feature activation-5.99e-03

Pos logit contributions
 not+0.009
and+0.008
 and+0.008
 never+0.007
 sure+0.007
Neg logit contributions
iful-0.017
ka-0.016
rus-0.016
ulus-0.016
works-0.016
Tokenkeeping
Token ID19934
Feature activation+6.45e-02

Pos logit contributions
iful+0.009
ka+0.009
mer+0.009
works+0.009
ked+0.009
Neg logit contributions
and-0.007
Unfortunately-0.007
arre-0.007
 not-0.007
 sure-0.006
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
and+0.089
Unfortunately+0.088
arre+0.084
Sadly+0.084
 Sadly+0.082
Neg logit contributions
ked-0.083
iful-0.077
ka-0.077
mer-0.076
works-0.075
Token He
Token ID679
Feature activation+2.67e-03

Pos logit contributions
 not+0.012
and+0.012
 and+0.011
 sure+0.011
 never+0.011
Neg logit contributions
ka-0.024
iful-0.023
works-0.023
ulus-0.023
rus-0.023
Token told
Token ID1297
Feature activation+1.58e-02

Pos logit contributions
and+0.003
 not+0.003
 and+0.003
 knowing+0.003
 sure+0.003
Neg logit contributions
ka-0.004
iful-0.004
rus-0.004
works-0.004
ked-0.004
Token me
Token ID502
Feature activation+2.00e-02

Pos logit contributions
and+0.020
 not+0.020
 and+0.019
 sure+0.018
 never+0.018
Neg logit contributions
iful-0.024
ka-0.023
rus-0.023
works-0.023
ocol-0.023
Token to
Token ID284
Feature activation+1.57e-02

Pos logit contributions
and+0.015
arre+0.014
 not+0.012
Unfortunately+0.012
 sure+0.012
Neg logit contributions
iful-0.040
works-0.039
rus-0.039
ka-0.038
glers-0.038
Token to
Token ID284
Feature activation-5.75e-03

Pos logit contributions
Unfortunately+0.006
Sadly+0.006
 Sadly+0.006
 Unfortunately+0.005
 sure+0.005
Neg logit contributions
ked-0.008
iful-0.008
mer-0.008
ka-0.008
women-0.008
Token bugs
Token ID11316
Feature activation+2.22e-02

Pos logit contributions
iful+0.008
rus+0.008
ka+0.008
ked+0.008
ulus+0.008
Neg logit contributions
and-0.008
 not-0.007
 and-0.007
arre-0.007
 sure-0.007
Token.
Token ID13
Feature activation+8.13e-03

Pos logit contributions
and+0.022
Unfortunately+0.022
 Sadly+0.022
 Unfortunately+0.021
arre+0.020
Neg logit contributions
iful-0.036
mer-0.036
glers-0.035
rus-0.035
ked-0.035
Token They
Token ID1119
Feature activation+2.99e-04

Pos logit contributions
and+0.009
 not+0.008
 knowing+0.008
 sure+0.008
 and+0.008
Neg logit contributions
ulus-0.013
ocol-0.013
ka-0.013
ked-0.013
works-0.013
Token are
Token ID389
Feature activation+1.52e-02

Pos logit contributions
and+0.000
 not+0.000
 knowing+0.000
 about+0.000
 and+0.000
Neg logit contributions
iful-0.000
rus-0.000
ka-0.000
ked-0.000
works-0.000
Token living
Token ID2877
Feature activation+6.39e-02

Pos logit contributions
and+0.010
 and+0.007
 not+0.007
arre+0.007
 think+0.007
Neg logit contributions
iful-0.033
rus-0.032
ka-0.032
ked-0.032
ocol-0.032
Token things
Token ID1243
Feature activation+3.05e-02

Pos logit contributions
Unfortunately+0.077
 Unfortunately+0.076
and+0.074
 Sadly+0.073
Sadly+0.071
Neg logit contributions
mer-0.099
ka-0.093
iful-0.090
ked-0.086
ocol-0.084
Token too
Token ID1165
Feature activation+2.05e-02

Pos logit contributions
and+0.026
arre+0.024
Unfortunately+0.023
 not+0.023
 Sadly+0.022
Neg logit contributions
rus-0.056
ked-0.055
iful-0.055
works-0.055
ka-0.054
Token."
Token ID526
Feature activation+5.34e-03

Pos logit contributions
 not+0.022
and+0.021
 and+0.021
 sure+0.019
 never+0.019
Neg logit contributions
iful-0.037
ka-0.036
rus-0.036
works-0.035
ulus-0.035
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
 sure+0.005
 aware+0.005
 mindful+0.005
arre+0.005
and+0.005
Neg logit contributions
ulus-0.008
ked-0.008
glers-0.008
redible-0.008
women-0.007
Token
Token ID198
Feature activation+6.87e-03

Pos logit contributions
and+0.007
 not+0.007
 sure+0.007
arre+0.006
 never+0.006
Neg logit contributions
rus-0.025
ulus-0.025
iful-0.025
ka-0.025
glers-0.025
Token listen
Token ID6004
Feature activation+1.06e-02

Pos logit contributions
and+0.009
Unfortunately+0.008
 Sadly+0.008
 Unfortunately+0.008
aps+0.008
Neg logit contributions
iful-0.012
ka-0.012
 matter-0.012
ked-0.011
works-0.011
Token.
Token ID13
Feature activation+9.46e-03

Pos logit contributions
and+0.010
Unfortunately+0.010
 think+0.010
arre+0.009
Sadly+0.009
Neg logit contributions
ked-0.017
iful-0.016
ka-0.016
 usual-0.016
mer-0.016
Token She
Token ID1375
Feature activation+3.42e-03

Pos logit contributions
and+0.008
 not+0.008
 never+0.008
 sure+0.008
 think+0.007
Neg logit contributions
ulus-0.017
iful-0.016
ked-0.016
ka-0.016
oured-0.016
Token threw
Token ID9617
Feature activation+4.00e-03

Pos logit contributions
and+0.004
 not+0.004
 and+0.004
 sure+0.004
 knowing+0.004
Neg logit contributions
iful-0.005
ka-0.005
rus-0.005
ked-0.005
glers-0.005
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
and+0.005
 not+0.005
 never+0.004
 sure+0.004
 knowing+0.004
Neg logit contributions
ka-0.006
iful-0.006
ked-0.006
rus-0.006
ocol-0.006
Token tant
Token ID24246
Feature activation+6.36e-02

Pos logit contributions
 not+0.019
and+0.018
 and+0.017
 never+0.017
 sure+0.017
Neg logit contributions
iful-0.029
ka-0.029
rus-0.028
ked-0.027
ulus-0.027
Tokenrum
Token ID6582
Feature activation+1.36e-02

Pos logit contributions
 not+0.063
 and+0.057
and+0.056
 never+0.055
 sure+0.054
Neg logit contributions
ka-0.119
iful-0.117
rus-0.114
ked-0.113
ocol-0.113
Token in
Token ID287
Feature activation+1.02e-02

Pos logit contributions
and+0.011
 think+0.011
arre+0.011
 sure+0.010
 story+0.010
Neg logit contributions
ked-0.026
iful-0.026
ka-0.025
rus-0.025
works-0.024
Token the
Token ID262
Feature activation+2.52e-02

Pos logit contributions
and+0.013
 not+0.013
arre+0.012
 knowing+0.012
 sure+0.012
Neg logit contributions
ka-0.015
ocol-0.015
iful-0.014
ked-0.014
ifying-0.014
Token store
Token ID3650
Feature activation+2.70e-02

Pos logit contributions
and+0.038
 not+0.037
 sure+0.034
 never+0.034
 and+0.034
Neg logit contributions
ka-0.034
iful-0.034
rus-0.033
glers-0.033
works-0.032
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
and+0.019
 sure+0.017
 think+0.016
Unfortunately+0.016
arre+0.016
Neg logit contributions
ked-0.054
glers-0.053
iful-0.053
mer-0.053
rus-0.053
Token a
Token ID257
Feature activation+7.88e-04

Pos logit contributions
 not+0.010
 and+0.008
 about+0.007
 with+0.007
 never+0.006
Neg logit contributions
iful-0.030
oured-0.029
rus-0.029
ka-0.028
works-0.028
Token squirrel
Token ID33039
Feature activation+1.60e-03

Pos logit contributions
 not+0.001
 and+0.001
and+0.001
 never+0.001
 think+0.001
Neg logit contributions
iful-0.001
ka-0.001
ked-0.001
rus-0.001
oured-0.001
Token.
Token ID13
Feature activation+6.55e-03

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 sure+0.001
 never+0.001
Neg logit contributions
iful-0.003
rus-0.003
ka-0.003
glers-0.003
ked-0.003
Token It
Token ID632
Feature activation-9.57e-03

Pos logit contributions
 not+0.006
and+0.006
 and+0.006
 never+0.006
 sure+0.005
Neg logit contributions
iful-0.012
ka-0.012
rus-0.012
ked-0.012
ulus-0.012
Token was
Token ID373
Feature activation+1.37e-02

Pos logit contributions
iful+0.025
ocol+0.024
works+0.023
oured+0.022
ifying+0.022
Neg logit contributions
 and-0.008
 not-0.008
 never-0.006
 with-0.006
-0.005
Token living
Token ID2877
Feature activation+6.35e-02

Pos logit contributions
 not+0.013
 and+0.011
 about+0.010
 with+0.010
 never+0.010
Neg logit contributions
iful-0.031
works-0.029
rus-0.029
oured-0.029
ka-0.029
Token in
Token ID287
Feature activation+3.97e-03

Pos logit contributions
and+0.064
Unfortunately+0.063
 not+0.061
 understand+0.061
 think+0.061
Neg logit contributions
ka-0.114
iful-0.110
mer-0.107
ocol-0.105
ked-0.105
Token the
Token ID262
Feature activation+2.03e-02

Pos logit contributions
and+0.005
 not+0.005
 and+0.004
 sure+0.004
 never+0.004
Neg logit contributions
iful-0.007
ka-0.007
rus-0.006
ocol-0.006
ulus-0.006
Token tree
Token ID5509
Feature activation+1.82e-02

Pos logit contributions
and+0.030
 not+0.029
 sure+0.029
 about+0.028
arre+0.027
Neg logit contributions
rus-0.026
iful-0.026
ulus-0.025
ka-0.025
ifying-0.025
Token.
Token ID13
Feature activation+7.22e-03

Pos logit contributions
and+0.022
 sure+0.021
 not+0.021
arre+0.020
Unfortunately+0.020
Neg logit contributions
iful-0.027
glers-0.027
rus-0.026
ulus-0.026
ked-0.026
Token It
Token ID632
Feature activation-1.03e-02

Pos logit contributions
 not+0.008
and+0.008
 never+0.007
 and+0.007
 sure+0.007
Neg logit contributions
ka-0.012
iful-0.012
ked-0.012
ulus-0.012
rus-0.012
Token not
Token ID407
Feature activation+5.28e-03

Pos logit contributions
and+0.009
 and+0.006
Unfortunately+0.006
 not+0.006
arre+0.006
Neg logit contributions
iful-0.026
rus-0.025
ka-0.025
works-0.025
ocol-0.025
Token toys
Token ID14958
Feature activation+3.00e-02

Pos logit contributions
and+0.005
 not+0.005
 and+0.005
 never+0.005
 think+0.004
Neg logit contributions
iful-0.010
ka-0.009
rus-0.009
ked-0.009
works-0.009
Token.
Token ID13
Feature activation+6.00e-03

Pos logit contributions
and+0.032
 not+0.031
 sure+0.029
 think+0.028
Unfortunately+0.028
Neg logit contributions
iful-0.051
rus-0.049
works-0.049
ked-0.049
ka-0.049
Token They
Token ID1119
Feature activation-1.04e-03

Pos logit contributions
 not+0.006
and+0.006
 sure+0.005
 knowing+0.005
 and+0.005
Neg logit contributions
works-0.011
ulus-0.011
ocol-0.010
iful-0.010
ka-0.010
Token are
Token ID389
Feature activation+1.74e-02

Pos logit contributions
iful+0.002
rus+0.002
works+0.002
glers+0.002
ka+0.002
Neg logit contributions
and-0.001
 knowing-0.001
 not-0.001
Unfortunately-0.001
 about-0.001
Token living
Token ID2877
Feature activation+6.34e-02

Pos logit contributions
and+0.016
Unfortunately+0.013
arre+0.012
 Sadly+0.012
 Unfortunately+0.012
Neg logit contributions
iful-0.033
rus-0.032
works-0.032
ocol-0.032
ka-0.032
Token creatures
Token ID8109
Feature activation+3.04e-02

Pos logit contributions
Unfortunately+0.084
 Unfortunately+0.082
 Sadly+0.080
and+0.079
Sadly+0.078
Neg logit contributions
mer-0.092
ka-0.090
iful-0.088
ocol-0.082
ked-0.081
Token.
Token ID13
Feature activation+6.41e-03

Pos logit contributions
and+0.032
Unfortunately+0.031
 not+0.029
 Unfortunately+0.029
 Sadly+0.028
Neg logit contributions
mer-0.050
iful-0.048
ked-0.047
ifying-0.047
works-0.047
Token You
Token ID921
Feature activation+6.15e-03

Pos logit contributions
 not+0.006
and+0.006
 sure+0.006
 knowing+0.006
 and+0.006
Neg logit contributions
works-0.011
ulus-0.011
iful-0.011
ocol-0.011
ka-0.011
Token should
Token ID815
Feature activation+1.45e-02

Pos logit contributions
and+0.008
 not+0.008
 knowing+0.008
 and+0.007
 about+0.007
Neg logit contributions
iful-0.009
rus-0.009
ka-0.009
works-0.009
ked-0.008
Token respect
Token ID2461
Feature activation+1.24e-02

Pos logit contributions
and+0.023
 knowing+0.016
Unfortunately+0.016
arre+0.015
 Sadly+0.015
Neg logit contributions
glers-0.020
works-0.020
rus-0.019
ocol-0.019
 avail-0.019
Token sad
Token ID6507
Feature activation+3.23e-03

Pos logit contributions
and+0.014
 not+0.013
 and+0.013
 sure+0.012
 about+0.012
Neg logit contributions
iful-0.018
ked-0.018
ka-0.018
rus-0.018
glers-0.017
Token.
Token ID13
Feature activation+4.59e-03

Pos logit contributions
and+0.005
arre+0.005
Unfortunately+0.005
Sadly+0.004
 think+0.004
Neg logit contributions
ked-0.004
iful-0.004
rus-0.003
works-0.003
ted-0.003
Token She
Token ID1375
Feature activation-2.79e-04

Pos logit contributions
and+0.004
 sure+0.004
 not+0.003
 aware+0.003
 knowing+0.003
Neg logit contributions
ulus-0.008
ked-0.008
oured-0.008
redible-0.007
 usual-0.007
Token threw
Token ID9617
Feature activation+3.76e-03

Pos logit contributions
iful+0.000
ka+0.000
rus+0.000
ked+0.000
ulus+0.000
Neg logit contributions
and-0.000
 not-0.000
 and-0.000
 sure-0.000
 about-0.000
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
and+0.005
 not+0.005
 knowing+0.004
 never+0.004
arre+0.004
Neg logit contributions
ka-0.006
iful-0.005
ked-0.005
rus-0.005
ocol-0.005
Token tant
Token ID24246
Feature activation+6.33e-02

Pos logit contributions
 not+0.016
and+0.015
 never+0.014
 and+0.014
 sure+0.014
Neg logit contributions
ka-0.023
iful-0.023
rus-0.022
ked-0.022
ocol-0.021
Tokenrum
Token ID6582
Feature activation+1.36e-02

Pos logit contributions
 not+0.064
 and+0.059
and+0.056
 never+0.055
 sure+0.054
Neg logit contributions
ka-0.117
iful-0.116
rus-0.112
ked-0.112
ocol-0.111
Token and
Token ID290
Feature activation+9.98e-03

Pos logit contributions
and+0.010
 think+0.009
 story+0.009
arre+0.009
 sure+0.009
Neg logit contributions
ked-0.029
iful-0.027
ka-0.027
rus-0.027
ted-0.025
Token cried
Token ID16896
Feature activation-4.48e-03

Pos logit contributions
and+0.014
 story+0.013
aps+0.013
 not+0.012
arre+0.012
Neg logit contributions
ked-0.014
rus-0.013
iful-0.013
ka-0.013
glers-0.013
Token loudly
Token ID23112
Feature activation+1.87e-02

Pos logit contributions
iful+0.009
ked+0.008
ka+0.008
rus+0.008
ulus+0.008
Neg logit contributions
 not-0.004
and-0.004
 think-0.003
 never-0.003
 and-0.003
Token.
Token ID13
Feature activation+9.39e-03

Pos logit contributions
 not+0.017
and+0.017
 and+0.016
 sure+0.015
 think+0.015
Neg logit contributions
iful-0.035
ka-0.034
rus-0.034
ked-0.034
works-0.033
Token owned
Token ID6898
Feature activation-1.73e-02

Pos logit contributions
 not+0.010
 and+0.010
 never+0.010
 sure+0.009
and+0.009
Neg logit contributions
iful-0.014
ulus-0.014
ka-0.014
ocol-0.013
rus-0.013
Token.
Token ID13
Feature activation+3.34e-03

Pos logit contributions
iful+0.034
ka+0.033
rus+0.032
ulus+0.032
works+0.032
Neg logit contributions
 not-0.017
 and-0.017
and-0.015
 never-0.014
 sure-0.014
Token He
Token ID679
Feature activation-2.18e-03

Pos logit contributions
 not+0.004
and+0.004
 think+0.004
 never+0.004
 story+0.004
Neg logit contributions
oured-0.005
rus-0.004
ifying-0.004
ulus-0.004
ked-0.004
Token ended
Token ID4444
Feature activation+1.70e-02

Pos logit contributions
rus+0.003
ka+0.003
iful+0.003
ked+0.003
oured+0.003
Neg logit contributions
and-0.003
 not-0.003
aps-0.002
arre-0.002
 about-0.002
Token up
Token ID510
Feature activation+2.22e-02

Pos logit contributions
and+0.030
irk+0.026
 story+0.025
arre+0.024
Unfortunately+0.024
Neg logit contributions
ked-0.018
ka-0.018
oured-0.018
rolled-0.018
ened-0.018
Token living
Token ID2877
Feature activation+6.23e-02

Pos logit contributions
and+0.025
Unfortunately+0.022
 think+0.020
Sadly+0.020
 sure+0.020
Neg logit contributions
mer-0.033
ocol-0.033
 usual-0.033
ked-0.032
ened-0.032
Token alone
Token ID3436
Feature activation+2.66e-02

Pos logit contributions
aring+0.080
and+0.079
age+0.077
oll+0.077
uts+0.075
Neg logit contributions
mer-0.085
ened-0.080
ka-0.077
 coloured-0.070
led-0.068
Token again
Token ID757
Feature activation+1.40e-02

Pos logit contributions
and+0.021
 think+0.018
 not+0.018
Unfortunately+0.017
 sure+0.017
Neg logit contributions
mer-0.053
iful-0.051
ked-0.049
ka-0.049
rus-0.048
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
and+0.015
Unfortunately+0.013
arre+0.013
Sadly+0.013
 not+0.012
Neg logit contributions
mer-0.022
iful-0.021
ocol-0.021
 usual-0.020
ifying-0.020
Token but
Token ID475
Feature activation+7.75e-03

Pos logit contributions
and+0.017
aps+0.016
arre+0.015
 story+0.013
ulk+0.013
Neg logit contributions
ked-0.026
 spe-0.025
 usual-0.025
oured-0.025
 lit-0.025
Token this
Token ID428
Feature activation+1.32e-02

Pos logit contributions
and+0.011
arre+0.010
aps+0.009
aring+0.009
 think+0.008
Neg logit contributions
 usual-0.011
mer-0.010
iful-0.010
rus-0.009
 avail-0.009
Token stubborn
Token ID24444
Feature activation+8.32e-03

Pos logit contributions
and+0.024
 and+0.019
Unfortunately+0.018
arre+0.018
 think+0.018
Neg logit contributions
iful-0.049
ka-0.049
ked-0.049
ulus-0.048
rus-0.048
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
Unfortunately+0.012
Sadly+0.012
 Sadly+0.012
and+0.012
 Unfortunately+0.011
Neg logit contributions
iful-0.010
ked-0.010
ka-0.009
mer-0.009
glers-0.008
Token She
Token ID1375
Feature activation+4.44e-03

Pos logit contributions
and+0.011
 not+0.011
 never+0.011
 sure+0.011
arre+0.010
Neg logit contributions
ulus-0.021
ifying-0.020
ka-0.020
works-0.019
ocol-0.019
Token throws
Token ID12542
Feature activation+9.14e-03

Pos logit contributions
and+0.005
 not+0.004
 knowing+0.004
 and+0.004
 sure+0.004
Neg logit contributions
ka-0.007
iful-0.007
rus-0.007
ked-0.007
ulus-0.007
Token a
Token ID257
Feature activation+1.28e-02

Pos logit contributions
and+0.012
 not+0.011
 knowing+0.010
 never+0.010
 sure+0.010
Neg logit contributions
ka-0.014
iful-0.014
ked-0.014
ocol-0.013
rus-0.013
Token tant
Token ID24246
Feature activation+6.22e-02

Pos logit contributions
 not+0.015
and+0.015
 and+0.014
 never+0.014
 sure+0.014
Neg logit contributions
iful-0.021
ka-0.021
rus-0.020
ocol-0.020
ked-0.020
Tokenrum
Token ID6582
Feature activation+1.26e-02

Pos logit contributions
 not+0.062
 and+0.058
and+0.056
 never+0.055
 sure+0.054
Neg logit contributions
ka-0.114
iful-0.113
rus-0.110
ocol-0.109
ked-0.108
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
and+0.015
 think+0.014
 sure+0.014
 not+0.014
arre+0.014
Neg logit contributions
iful-0.020
ked-0.019
ka-0.019
rus-0.019
works-0.019
Token She
Token ID1375
Feature activation+9.85e-03

Pos logit contributions
and+0.014
 sure+0.013
 never+0.013
 not+0.013
 knowing+0.013
Neg logit contributions
ulus-0.027
ked-0.026
ka-0.025
iful-0.025
women-0.025
Token kicks
Token ID17364
Feature activation+1.70e-02

Pos logit contributions
and+0.015
 knowing+0.012
Unfortunately+0.011
 sure+0.011
 aware+0.011
Neg logit contributions
ka-0.015
glers-0.015
rus-0.014
ked-0.014
ocol-0.014
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 not+0.019
and+0.019
 never+0.017
 knowing+0.017
 sure+0.017
Neg logit contributions
iful-0.028
ka-0.028
ked-0.027
rus-0.027
ocol-0.026
Token grandma
Token ID49890
Feature activation+1.24e-02

Pos logit contributions
 not+0.000
 and+0.000
 never+0.000
 many+0.000
 story+0.000
Neg logit contributions
iful-0.000
ka-0.000
rus-0.000
works-0.000
ked-0.000
Token's
Token ID338
Feature activation+2.01e-02

Pos logit contributions
 and+0.009
 not+0.008
and+0.007
 about+0.007
 sure+0.006
Neg logit contributions
iful-0.028
ka-0.028
rus-0.027
works-0.027
ulus-0.027
Token house
Token ID2156
Feature activation+1.35e-02

Pos logit contributions
 not+0.021
and+0.021
 and+0.019
 sure+0.019
 about+0.019
Neg logit contributions
iful-0.034
rus-0.034
ka-0.033
ulus-0.033
works-0.033
Token for
Token ID329
Feature activation+6.53e-03

Pos logit contributions
and+0.010
 not+0.009
 sure+0.009
arre+0.009
 think+0.009
Neg logit contributions
iful-0.027
ked-0.026
rus-0.026
glers-0.026
works-0.025
Token safe
Token ID3338
Feature activation-9.43e-03

Pos logit contributions
 not+0.007
 and+0.006
and+0.006
 sure+0.005
 never+0.005
Neg logit contributions
iful-0.013
ka-0.013
rus-0.012
works-0.012
ulus-0.012
Tokenkeeping
Token ID19934
Feature activation+6.21e-02

Pos logit contributions
iful+0.016
ked+0.015
ka+0.015
glers+0.015
works+0.015
Neg logit contributions
and-0.010
 not-0.009
Unfortunately-0.009
arre-0.009
 sure-0.009
Token.
Token ID13
Feature activation+1.55e-03

Pos logit contributions
Unfortunately+0.079
and+0.078
arre+0.076
Sadly+0.075
 sure+0.075
Neg logit contributions
ked-0.083
iful-0.081
oured-0.079
rus-0.079
ka-0.076
Token
Token ID220
Feature activation-4.40e-03

Pos logit contributions
 not+0.002
and+0.002
 story+0.002
 never+0.002
 think+0.002
Neg logit contributions
oured-0.002
rus-0.002
glers-0.002
ulus-0.002
ked-0.002
Token
Token ID198
Feature activation+4.54e-03

Pos logit contributions
iful+0.011
ka+0.010
rus+0.010
ulus+0.010
works+0.010
Neg logit contributions
 not-0.002
and-0.002
 and-0.002
 sure-0.001
 never-0.001
Token
Token ID198
Feature activation-2.24e-03

Pos logit contributions
 not+0.004
 knowing+0.003
 heart+0.003
 cry+0.003
 think+0.003
Neg logit contributions
glers-0.008
rus-0.008
redible-0.008
ulus-0.008
ka-0.008
TokenWhen
Token ID2215
Feature activation-5.28e-03

Pos logit contributions
rus+0.003
redible+0.003
glers+0.003
ulus+0.003
ka+0.003
Neg logit contributions
 not-0.003
 about-0.003
 and-0.003
 heart-0.003
and-0.003
Token her
Token ID607
Feature activation+1.83e-02

Pos logit contributions
and+0.024
 not+0.024
Unfortunately+0.022
arre+0.022
 story+0.022
Neg logit contributions
ked-0.030
ka-0.029
iful-0.028
 rib-0.028
ocol-0.027
Token feet
Token ID3625
Feature activation+1.46e-02

Pos logit contributions
and+0.026
arre+0.024
 sure+0.024
Unfortunately+0.024
 not+0.023
Neg logit contributions
ked-0.025
rus-0.024
ka-0.024
 usual-0.024
iful-0.023
Token and
Token ID290
Feature activation+8.87e-03

Pos logit contributions
 story+0.010
 think+0.009
and+0.009
 wakes+0.009
aps+0.009
Neg logit contributions
ked-0.029
iful-0.027
oured-0.027
glers-0.026
rus-0.026
Token threw
Token ID9617
Feature activation+1.34e-03

Pos logit contributions
and+0.013
 story+0.011
 not+0.011
arre+0.011
 about+0.011
Neg logit contributions
ked-0.013
rus-0.012
ka-0.012
glers-0.012
iful-0.011
Token a
Token ID257
Feature activation+9.02e-03

Pos logit contributions
 not+0.002
and+0.002
 and+0.001
 sure+0.001
 never+0.001
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
ked-0.002
ocol-0.002
Token tant
Token ID24246
Feature activation+6.11e-02

Pos logit contributions
 not+0.009
and+0.009
 and+0.008
 never+0.008
 sure+0.008
Neg logit contributions
ka-0.016
iful-0.016
rus-0.016
ked-0.015
ocol-0.015
Tokenrum
Token ID6582
Feature activation+9.62e-03

Pos logit contributions
 not+0.061
 and+0.055
and+0.053
 never+0.052
 sure+0.051
Neg logit contributions
ka-0.114
iful-0.113
rus-0.109
ked-0.109
ocol-0.109
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
and+0.011
 not+0.010
 think+0.010
 sure+0.010
arre+0.009
Neg logit contributions
iful-0.016
ked-0.016
ka-0.016
rus-0.016
works-0.015
Token She
Token ID1375
Feature activation+8.28e-03

Pos logit contributions
and+0.011
 not+0.011
 sure+0.010
 think+0.010
 never+0.010
Neg logit contributions
ked-0.017
iful-0.017
ulus-0.017
ka-0.017
rus-0.017
Token grabbed
Token ID13176
Feature activation+1.64e-02

Pos logit contributions
and+0.011
 not+0.010
 and+0.010
 knowing+0.010
 sure+0.010
Neg logit contributions
rus-0.012
ka-0.012
ked-0.012
iful-0.012
glers-0.012
Token the
Token ID262
Feature activation+2.49e-02

Pos logit contributions
 not+0.015
 and+0.015
and+0.014
 sure+0.014
 many+0.013
Neg logit contributions
iful-0.032
rus-0.031
works-0.031
ulus-0.031
ka-0.030
Token his
Token ID465
Feature activation+4.96e-03

Pos logit contributions
 not+0.006
and+0.006
 and+0.006
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.011
ka-0.010
rus-0.010
works-0.010
ulus-0.010
Token face
Token ID1986
Feature activation-1.77e-02

Pos logit contributions
 not+0.006
 and+0.006
and+0.006
 never+0.006
 sure+0.006
Neg logit contributions
iful-0.008
ka-0.008
rus-0.008
works-0.007
ulus-0.007
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
iful+0.035
ka+0.035
rus+0.034
ulus+0.034
works+0.034
Neg logit contributions
 not-0.015
and-0.014
 and-0.014
 sure-0.012
 never-0.012
Token He
Token ID679
Feature activation-3.36e-03

Pos logit contributions
 not+0.002
and+0.002
 and+0.001
 never+0.001
 sure+0.001
Neg logit contributions
iful-0.004
ka-0.004
rus-0.004
works-0.004
ocol-0.004
Token was
Token ID373
Feature activation+1.28e-02

Pos logit contributions
iful+0.005
ka+0.005
rus+0.004
mer+0.004
ked+0.004
Neg logit contributions
and-0.005
 not-0.004
 knowing-0.004
 about-0.004
 and-0.004
Token living
Token ID2877
Feature activation+6.09e-02

Pos logit contributions
and+0.012
arre+0.009
 not+0.009
 think+0.008
 and+0.008
Neg logit contributions
iful-0.025
ka-0.024
oured-0.023
rus-0.023
ked-0.023
Token his
Token ID465
Feature activation+1.62e-04

Pos logit contributions
 cry+0.066
Sadly+0.065
Unfortunately+0.064
aring+0.064
 Sadly+0.063
Neg logit contributions
mer-0.101
ka-0.097
iful-0.097
ifying-0.089
 usual-0.085
Token dream
Token ID4320
Feature activation-1.74e-03

Pos logit contributions
 not+0.000
 and+0.000
and+0.000
 sure+0.000
 never+0.000
Neg logit contributions
iful-0.000
ka-0.000
rus-0.000
works-0.000
ulus-0.000
Token and
Token ID290
Feature activation+2.41e-03

Pos logit contributions
iful+0.003
rus+0.003
ka+0.003
works+0.003
glers+0.003
Neg logit contributions
 not-0.002
and-0.002
 and-0.001
 think-0.001
 sure-0.001
Token was
Token ID373
Feature activation+8.90e-03

Pos logit contributions
and+0.003
arre+0.003
aps+0.003
Unfortunately+0.003
 not+0.002
Neg logit contributions
iful-0.004
ka-0.003
glers-0.003
 usual-0.003
mer-0.003
Token one
Token ID530
Feature activation-4.05e-03

Pos logit contributions
and+0.008
 not+0.007
 and+0.006
 think+0.006
arre+0.006
Neg logit contributions
iful-0.017
ka-0.017
rus-0.016
glers-0.016
oured-0.016
Token!
Token ID0
Feature activation-8.36e-03

Pos logit contributions
and+0.018
 not+0.018
arre+0.016
 think+0.016
 sure+0.016
Neg logit contributions
iful-0.017
rus-0.017
glers-0.017
ka-0.017
mer-0.016
Token The
Token ID383
Feature activation+2.68e-03

Pos logit contributions
iful+0.016
ka+0.016
rus+0.016
ulus+0.015
ocol+0.015
Neg logit contributions
 not-0.007
and-0.007
 and-0.006
 never-0.006
 sure-0.006
Token mouse
Token ID10211
Feature activation+7.04e-03

Pos logit contributions
 not+0.004
 and+0.004
 many+0.003
 story+0.003
 sure+0.003
Neg logit contributions
iful-0.004
ka-0.004
ked-0.004
oured-0.004
ocol-0.004
Token had
Token ID550
Feature activation+1.86e-03

Pos logit contributions
and+0.009
 not+0.009
 knowing+0.008
 think+0.008
 sure+0.008
Neg logit contributions
rus-0.010
iful-0.010
ka-0.010
ked-0.010
glers-0.010
Token been
Token ID587
Feature activation+6.89e-03

Pos logit contributions
and+0.002
 not+0.002
 and+0.002
 think+0.002
 sure+0.002
Neg logit contributions
iful-0.003
ka-0.003
rus-0.003
ked-0.003
ulus-0.003
Token living
Token ID2877
Feature activation+6.06e-02

Pos logit contributions
and+0.006
 not+0.006
 and+0.005
 think+0.005
 never+0.005
Neg logit contributions
iful-0.013
ka-0.013
rus-0.013
ked-0.013
ulus-0.013
Token in
Token ID287
Feature activation+5.94e-03

Pos logit contributions
arre+0.081
Sadly+0.078
aring+0.076
age+0.076
Unfortunately+0.074
Neg logit contributions
mer-0.084
ka-0.083
iful-0.081
ifying-0.078
 usual-0.076
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
and+0.009
arre+0.008
aring+0.008
Unfortunately+0.007
apping+0.007
Neg logit contributions
ocol-0.008
 avail-0.008
ka-0.008
iful-0.008
ifying-0.008
Token jewelry
Token ID22634
Feature activation+9.96e-03

Pos logit contributions
and+0.019
 not+0.019
 and+0.018
 sure+0.017
 about+0.017
Neg logit contributions
iful-0.026
rus-0.026
ka-0.025
ulus-0.025
works-0.025
Token box
Token ID3091
Feature activation+1.63e-02

Pos logit contributions
and+0.009
 not+0.009
 think+0.009
 sure+0.008
arre+0.008
Neg logit contributions
iful-0.019
rus-0.018
glers-0.018
oured-0.017
ka-0.017
Token and
Token ID290
Feature activation-1.51e-03

Pos logit contributions
and+0.014
 think+0.013
arre+0.013
 not+0.013
 sure+0.012
Neg logit contributions
iful-0.031
rus-0.030
glers-0.029
oured-0.029
ka-0.029
Token not
Token ID407
Feature activation-6.20e-04

Pos logit contributions
and+0.012
 not+0.011
 and+0.011
 sure+0.010
 think+0.010
Neg logit contributions
iful-0.021
ka-0.020
rus-0.020
works-0.020
ulus-0.019
Token toys
Token ID14958
Feature activation+2.60e-02

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
ked+0.001
works+0.001
Neg logit contributions
and-0.001
 not-0.001
 and-0.001
 never-0.001
Unfortunately-0.001
Token.
Token ID13
Feature activation-2.01e-03

Pos logit contributions
and+0.025
 not+0.024
 sure+0.023
 think+0.022
 and+0.021
Neg logit contributions
iful-0.047
rus-0.046
ka-0.045
works-0.045
glers-0.045
Token They
Token ID1119
Feature activation-8.10e-03

Pos logit contributions
iful+0.004
works+0.004
ulus+0.004
ka+0.004
rus+0.004
Neg logit contributions
 not-0.002
and-0.002
 and-0.002
 sure-0.002
 never-0.002
Token are
Token ID389
Feature activation+1.74e-02

Pos logit contributions
iful+0.013
works+0.012
rus+0.012
ka+0.012
glers+0.012
Neg logit contributions
and-0.010
 knowing-0.009
 not-0.009
Unfortunately-0.009
 about-0.009
Token living
Token ID2877
Feature activation+6.04e-02

Pos logit contributions
and+0.019
 not+0.016
Unfortunately+0.016
arre+0.016
 and+0.016
Neg logit contributions
iful-0.030
ka-0.029
rus-0.029
works-0.029
glers-0.029
Token things
Token ID1243
Feature activation+3.05e-02

Pos logit contributions
Unfortunately+0.073
 Unfortunately+0.069
Sadly+0.068
 Sadly+0.068
and+0.068
Neg logit contributions
ka-0.094
iful-0.093
mer-0.093
works-0.086
ocol-0.086
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
and+0.029
arre+0.027
Unfortunately+0.026
 not+0.026
 Sadly+0.025
Neg logit contributions
iful-0.053
works-0.053
rus-0.052
ked-0.052
ka-0.051
Token They
Token ID1119
Feature activation-3.74e-03

Pos logit contributions
 not+0.004
and+0.004
 and+0.004
 sure+0.004
 knowing+0.003
Neg logit contributions
works-0.007
iful-0.007
ulus-0.007
ka-0.007
ked-0.007
Token have
Token ID423
Feature activation+3.39e-03

Pos logit contributions
iful+0.006
works+0.006
rus+0.006
ka+0.005
glers+0.005
Neg logit contributions
and-0.005
 knowing-0.004
Unfortunately-0.004
 not-0.004
 about-0.004
Token their
Token ID511
Feature activation+4.09e-03

Pos logit contributions
and+0.004
 not+0.004
 and+0.004
 sure+0.003
 think+0.003
Neg logit contributions
iful-0.006
ka-0.006
rus-0.006
works-0.005
ked-0.005
Token.
Token ID13
Feature activation+8.38e-03

Pos logit contributions
arre+0.019
Sadly+0.019
Unfortunately+0.019
and+0.019
 Sadly+0.019
Neg logit contributions
ked-0.012
 usual-0.012
iful-0.011
bell-0.011
glers-0.011
Token He
Token ID679
Feature activation+1.85e-03

Pos logit contributions
and+0.013
arre+0.012
 sure+0.012
 never+0.012
 not+0.011
Neg logit contributions
ked-0.009
ka-0.009
women-0.008
ulus-0.008
icles-0.008
Token kept
Token ID4030
Feature activation+1.43e-02

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 never+0.002
 sure+0.002
Neg logit contributions
iful-0.003
ka-0.003
rus-0.003
ulus-0.003
works-0.003
Token throwing
Token ID9644
Feature activation+7.02e-03

Pos logit contributions
and+0.020
Unfortunately+0.016
Sadly+0.016
arre+0.016
 Sadly+0.015
Neg logit contributions
ked-0.022
ocol-0.021
ka-0.021
ries-0.021
rus-0.020
Token a
Token ID257
Feature activation+1.07e-02

Pos logit contributions
 not+0.008
and+0.008
 never+0.007
 sure+0.007
 and+0.007
Neg logit contributions
iful-0.011
ka-0.011
rus-0.011
ked-0.011
ocol-0.011
Token tant
Token ID24246
Feature activation+6.03e-02

Pos logit contributions
 not+0.013
and+0.013
 and+0.013
 sure+0.012
 never+0.012
Neg logit contributions
iful-0.017
ka-0.016
rus-0.016
ulus-0.016
works-0.016
Tokenrum
Token ID6582
Feature activation+1.13e-02

Pos logit contributions
 not+0.060
 and+0.055
and+0.054
 never+0.053
 sure+0.051
Neg logit contributions
ka-0.111
iful-0.110
rus-0.107
ked-0.106
ocol-0.106
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
and+0.014
arre+0.014
 think+0.014
 sure+0.013
 not+0.013
Neg logit contributions
iful-0.017
ked-0.016
rus-0.016
ka-0.016
works-0.015
Token His
Token ID2399
Feature activation-2.31e-03

Pos logit contributions
and+0.020
 sure+0.019
arre+0.018
 mindful+0.018
 aware+0.018
Neg logit contributions
ked-0.021
ulus-0.020
icles-0.020
 usual-0.020
women-0.020
Token mom
Token ID1995
Feature activation+1.50e-02

Pos logit contributions
rus+0.003
ka+0.003
iful+0.003
ked+0.003
glers+0.003
Neg logit contributions
and-0.003
 sure-0.003
 not-0.003
arre-0.003
 about-0.003
Token was
Token ID373
Feature activation+1.80e-02

Pos logit contributions
and+0.020
 not+0.019
 think+0.018
 knowing+0.018
 many+0.017
Neg logit contributions
ked-0.022
rus-0.022
ka-0.021
iful-0.021
oured-0.021
Token toy
Token ID13373
Feature activation+7.26e-03

Pos logit contributions
iful+0.001
rus+0.001
mer+0.001
ka+0.001
ulus+0.001
Neg logit contributions
and-0.001
 not-0.001
Unfortunately-0.001
 about-0.001
aps-0.001
Token.
Token ID13
Feature activation+7.18e-03

Pos logit contributions
 not+0.007
 and+0.006
and+0.006
 never+0.006
 sure+0.006
Neg logit contributions
iful-0.014
ka-0.014
rus-0.014
ulus-0.013
works-0.013
Token He
Token ID679
Feature activation-7.49e-04

Pos logit contributions
 not+0.007
and+0.007
 and+0.006
 knowing+0.006
 sure+0.006
Neg logit contributions
iful-0.013
ocol-0.013
ka-0.013
works-0.012
ulus-0.012
Token is
Token ID318
Feature activation+2.67e-02

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
works+0.001
glers+0.001
Neg logit contributions
and-0.001
 not-0.001
 knowing-0.001
Unfortunately-0.001
 and-0.001
Token a
Token ID257
Feature activation+1.16e-03

Pos logit contributions
and+0.029
arre+0.024
Unfortunately+0.023
 Sadly+0.022
 and+0.021
Neg logit contributions
iful-0.046
ka-0.046
ocol-0.046
rus-0.045
ked-0.043
Token living
Token ID2877
Feature activation+6.01e-02

Pos logit contributions
and+0.002
 not+0.002
 about+0.001
arre+0.001
aps+0.001
Neg logit contributions
rus-0.002
iful-0.002
mer-0.002
ka-0.002
ulus-0.002
Token animal
Token ID5044
Feature activation+2.96e-03

Pos logit contributions
Unfortunately+0.081
Sadly+0.078
 Sadly+0.077
 Unfortunately+0.076
and+0.075
Neg logit contributions
mer-0.101
ka-0.089
iful-0.086
ocol-0.080
ifying-0.079
Token.
Token ID13
Feature activation+8.41e-03

Pos logit contributions
and+0.003
 not+0.003
arre+0.003
Unfortunately+0.002
 sure+0.002
Neg logit contributions
iful-0.005
rus-0.005
mer-0.005
glers-0.005
works-0.005
Token He
Token ID679
Feature activation-3.71e-04

Pos logit contributions
 not+0.008
and+0.008
 and+0.007
 knowing+0.007
 sure+0.007
Neg logit contributions
iful-0.015
ka-0.015
ocol-0.015
works-0.015
ulus-0.014
Token has
Token ID468
Feature activation+1.37e-02

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
works+0.001
glers+0.001
Neg logit contributions
and-0.000
 not-0.000
 knowing-0.000
Unfortunately-0.000
 sure-0.000
Token feelings
Token ID7666
Feature activation+8.65e-03

Pos logit contributions
and+0.015
 not+0.013
 and+0.013
arre+0.013
 sure+0.012
Neg logit contributions
iful-0.023
ka-0.023
ked-0.022
rus-0.022
ocol-0.022
Token When
Token ID1649
Feature activation+6.89e-04

Pos logit contributions
 not+0.002
and+0.002
 sure+0.002
 and+0.002
 never+0.002
Neg logit contributions
ka-0.005
iful-0.005
rus-0.005
ulus-0.004
works-0.004
Token Daisy
Token ID40355
Feature activation-3.77e-03

Pos logit contributions
and+0.001
 not+0.001
 and+0.001
 sure+0.001
arre+0.001
Neg logit contributions
iful-0.001
glers-0.001
ka-0.001
rus-0.001
works-0.001
Token put
Token ID1234
Feature activation+6.69e-03

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
works+0.006
ulus+0.006
Neg logit contributions
 not-0.005
 and-0.005
and-0.005
 sure-0.004
 never-0.004
Token the
Token ID262
Feature activation-8.05e-03

Pos logit contributions
and+0.006
 not+0.005
 and+0.005
 never+0.005
 sure+0.004
Neg logit contributions
iful-0.013
ka-0.013
rus-0.013
ocol-0.013
works-0.013
Token magn
Token ID7842
Feature activation+1.23e-02

Pos logit contributions
iful+0.012
ka+0.012
ulus+0.012
rus+0.012
ked+0.012
Neg logit contributions
 not-0.011
 and-0.010
and-0.010
 think-0.009
 sure-0.009
Tokenifying
Token ID4035
Feature activation-6.70e-02

Pos logit contributions
 and+0.030
and+0.029
 not+0.029
 sure+0.027
 with+0.027
Neg logit contributions
iful-0.005
ulus-0.005
works-0.004
glers-0.004
ocol-0.004
Token glass
Token ID5405
Feature activation+2.80e-03

Pos logit contributions
iful+0.089
ka+0.087
rus+0.086
works+0.083
ulus+0.083
Neg logit contributions
 not-0.099
and-0.099
 and-0.094
 sure-0.092
 never-0.091
Token down
Token ID866
Feature activation-6.19e-03

Pos logit contributions
 not+0.003
and+0.003
 knowing+0.003
 sure+0.002
 never+0.002
Neg logit contributions
iful-0.005
glers-0.005
ka-0.005
rus-0.005
works-0.005
Token,
Token ID11
Feature activation-4.62e-03

Pos logit contributions
iful+0.012
ka+0.012
rus+0.012
works+0.012
glers+0.012
Neg logit contributions
and-0.005
 not-0.005
 sure-0.005
 and-0.004
 knowing-0.004
Token the
Token ID262
Feature activation-1.44e-03

Pos logit contributions
iful+0.008
 avail+0.008
glers+0.008
rus+0.008
ka+0.008
Neg logit contributions
and-0.005
arre-0.004
 not-0.004
 knowing-0.004
aps-0.004
Token cater
Token ID20825
Feature activation+3.19e-03

Pos logit contributions
iful+0.002
ka+0.002
rus+0.002
ulus+0.002
ked+0.002
Neg logit contributions
 not-0.002
 and-0.002
and-0.002
 never-0.002
 think-0.002
Token "
Token ID366
Feature activation-1.51e-02

Pos logit contributions
and+0.021
 not+0.018
 think+0.018
arre+0.017
 never+0.017
Neg logit contributions
rus-0.025
iful-0.025
ked-0.025
ulus-0.025
works-0.024
TokenYes
Token ID5297
Feature activation-2.68e-02

Pos logit contributions
iful+0.026
ocol+0.025
rus+0.025
ka+0.025
ulus+0.025
Neg logit contributions
and-0.017
 and-0.017
 not-0.016
 sure-0.015
 never-0.015
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
iful+0.059
ka+0.057
rus+0.057
works+0.056
ulus+0.056
Neg logit contributions
 not-0.018
and-0.017
 and-0.016
 sure-0.014
 never-0.014
Token look
Token ID804
Feature activation-1.17e-02

Pos logit contributions
iful+0.022
works+0.021
ka+0.021
rus+0.020
ulus+0.020
Neg logit contributions
and-0.018
arre-0.018
 not-0.017
 Sadly-0.016
 knowing-0.016
Token how
Token ID703
Feature activation-1.00e-02

Pos logit contributions
iful+0.022
rus+0.022
ka+0.021
glers+0.021
works+0.021
Neg logit contributions
 not-0.011
 and-0.011
and-0.011
 sure-0.010
 about-0.010
Token high
Token ID1029
Feature activation-6.62e-02

Pos logit contributions
iful+0.019
ka+0.018
rus+0.018
ked+0.018
works+0.018
Neg logit contributions
 not-0.010
and-0.010
 and-0.010
 sure-0.009
 never-0.009
Token it
Token ID340
Feature activation-2.90e-02

Pos logit contributions
iful+0.134
rus+0.131
ka+0.131
oured+0.130
ulus+0.130
Neg logit contributions
 not-0.056
 and-0.056
 sure-0.046
 never-0.044
 with-0.043
Token is
Token ID318
Feature activation+4.32e-03

Pos logit contributions
iful+0.057
ka+0.056
rus+0.055
ocol+0.055
ulus+0.055
Neg logit contributions
 not-0.026
 and-0.025
and-0.024
 sure-0.022
 never-0.022
Token flying
Token ID7348
Feature activation-1.26e-02

Pos logit contributions
 not+0.004
 and+0.004
and+0.004
 about+0.004
 sure+0.004
Neg logit contributions
iful-0.008
ka-0.008
rus-0.008
ocol-0.008
glers-0.008
Token!"
Token ID2474
Feature activation-9.75e-03

Pos logit contributions
iful+0.024
rus+0.023
ka+0.023
ocol+0.023
ulus+0.023
Neg logit contributions
 not-0.012
 and-0.012
and-0.012
 never-0.011
 about-0.010
Token
Token ID198
Feature activation-6.65e-03

Pos logit contributions
iful+0.012
ka+0.011
ked+0.011
ulus+0.011
oured+0.010
Neg logit contributions
 not-0.015
and-0.014
 mistake-0.014
 never-0.014
 grounded-0.014
Token said
Token ID531
Feature activation-1.74e-02

Pos logit contributions
iful+0.040
ka+0.039
ulus+0.038
ocol+0.038
rus+0.038
Neg logit contributions
 and-0.030
 not-0.030
and-0.029
 sure-0.028
 never-0.027
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
iful+0.035
ka+0.034
rus+0.034
works+0.033
ked+0.033
Neg logit contributions
and-0.014
 not-0.013
 and-0.012
 think-0.011
 sure-0.011
Token "
Token ID366
Feature activation-1.72e-02

Pos logit contributions
and+0.016
 not+0.015
 never+0.014
arre+0.014
 think+0.014
Neg logit contributions
rus-0.020
iful-0.020
ulus-0.020
ked-0.020
works-0.020
TokenLet
Token ID5756
Feature activation-9.00e-03

Pos logit contributions
iful+0.027
rus+0.026
ocol+0.026
ulus+0.025
ka+0.025
Neg logit contributions
and-0.022
 and-0.022
 not-0.021
 never-0.020
 sure-0.020
Tokenâ
Token ID22940
Feature activation-7.21e-03

Pos logit contributions
iful+0.021
ocol+0.021
rus+0.021
ka+0.021
ked+0.020
Neg logit contributions
 not-0.005
 and-0.005
 sure-0.005
and-0.004
 never-0.004
Token
Token ID26391
Feature activation-6.60e-02

Pos logit contributions
iful+0.011
ka+0.011
rus+0.011
works+0.011
ulus+0.011
Neg logit contributions
 not-0.009
and-0.009
 and-0.009
 sure-0.008
 about-0.008
Token
Token ID8151
Feature activation-1.45e-02

Pos logit contributions
ulus+0.100
ka+0.099
iful+0.099
works+0.098
rus+0.097
Neg logit contributions
 not-0.079
and-0.077
 and-0.074
 sure-0.072
 knowing-0.071
Tokens
Token ID82
Feature activation-2.22e-02

Pos logit contributions
iful+0.027
ka+0.026
rus+0.026
works+0.026
ocol+0.026
Neg logit contributions
 not-0.016
and-0.016
 and-0.015
 about-0.014
 never-0.013
Token go
Token ID467
Feature activation-1.78e-02

Pos logit contributions
iful+0.032
rus+0.030
works+0.030
ka+0.030
glers+0.030
Neg logit contributions
and-0.032
 knowing-0.028
 not-0.027
 and-0.026
 about-0.026
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
iful+0.034
ka+0.034
rus+0.033
glers+0.033
ulus+0.033
Neg logit contributions
and-0.015
 not-0.015
arre-0.013
 knowing-0.012
 sure-0.012
Token try
Token ID1949
Feature activation-3.03e-02

Pos logit contributions
iful+0.035
ka+0.034
rus+0.034
ulus+0.034
works+0.034
Neg logit contributions
 not-0.021
and-0.021
 and-0.020
 sure-0.019
 never-0.018
Token her
Token ID607
Feature activation-1.23e-02

Pos logit contributions
iful+0.023
ka+0.022
rus+0.022
works+0.022
ked+0.022
Neg logit contributions
and-0.017
 not-0.016
arre-0.015
 never-0.015
Unfortunately-0.015
Token,
Token ID11
Feature activation+1.65e-03

Pos logit contributions
iful+0.024
ka+0.024
rus+0.024
ulus+0.023
ked+0.023
Neg logit contributions
 not-0.011
and-0.010
 and-0.010
 never-0.009
 sure-0.009
Token "
Token ID366
Feature activation-2.30e-02

Pos logit contributions
and+0.002
 not+0.002
arre+0.002
 never+0.002
 and+0.002
Neg logit contributions
iful-0.003
ked-0.003
rus-0.003
ulus-0.003
ka-0.003
TokenLet
Token ID5756
Feature activation-1.08e-02

Pos logit contributions
iful+0.034
ka+0.033
rus+0.033
works+0.032
ulus+0.032
Neg logit contributions
 not-0.032
and-0.030
 and-0.030
 about-0.028
 sure-0.028
Tokenâ
Token ID22940
Feature activation-9.28e-03

Pos logit contributions
iful+0.025
ka+0.024
rus+0.024
works+0.023
ocol+0.023
Neg logit contributions
 not-0.007
 and-0.007
and-0.006
 sure-0.006
 never-0.005
Token
Token ID26391
Feature activation-6.59e-02

Pos logit contributions
iful+0.014
ka+0.013
ulus+0.013
works+0.013
rus+0.013
Neg logit contributions
and-0.012
 not-0.012
 and-0.012
 many-0.011
 knowing-0.011
Token
Token ID8151
Feature activation-1.52e-02

Pos logit contributions
ulus+0.100
ked+0.096
ka+0.094
glers+0.094
oured+0.094
Neg logit contributions
and-0.080
 not-0.077
 knowing-0.073
 and-0.073
 sure-0.072
Tokens
Token ID82
Feature activation-2.25e-02

Pos logit contributions
iful+0.026
ka+0.025
rus+0.025
works+0.024
ulus+0.024
Neg logit contributions
 not-0.018
and-0.018
 and-0.018
 never-0.016
 about-0.016
Token go
Token ID467
Feature activation-1.93e-02

Pos logit contributions
iful+0.031
glers+0.030
rus+0.030
ulus+0.030
ka+0.030
Neg logit contributions
and-0.033
 knowing-0.029
 not-0.028
 and-0.028
 about-0.027
Token home
Token ID1363
Feature activation-3.84e-02

Pos logit contributions
glers+0.033
oured+0.033
ka+0.033
iful+0.032
ulus+0.032
Neg logit contributions
and-0.019
 not-0.018
arre-0.017
Sadly-0.017
Unfortunately-0.017
Token now
Token ID783
Feature activation-1.72e-02

Pos logit contributions
iful+0.058
mer+0.056
glers+0.054
ka+0.054
 usual+0.054
Neg logit contributions
and-0.043
Sadly-0.042
arre-0.042
Unfortunately-0.040
 Sadly-0.040
Token stairs
Token ID16046
Feature activation+2.95e-02

Pos logit contributions
iful+0.028
ka+0.027
rus+0.026
works+0.026
ulus+0.026
Neg logit contributions
 not-0.022
and-0.022
 sure-0.021
 about-0.020
 and-0.020
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 not+0.027
and+0.026
 and+0.024
 sure+0.023
 think+0.022
Neg logit contributions
iful-0.057
ka-0.055
rus-0.054
works-0.054
glers-0.053
Token z
Token ID1976
Feature activation+7.23e-03

Pos logit contributions
iful+0.026
ka+0.025
oured+0.024
works+0.024
ked+0.024
Neg logit contributions
and-0.021
 not-0.019
 story-0.019
arre-0.018
 knowing-0.018
Tokenig
Token ID328
Feature activation-1.60e-02

Pos logit contributions
 not+0.011
and+0.011
 and+0.011
 sure+0.010
 never+0.010
Neg logit contributions
iful-0.009
rus-0.009
ka-0.009
ulus-0.009
works-0.009
Tokenz
Token ID89
Feature activation+1.91e-02

Pos logit contributions
iful+0.022
ka+0.021
rus+0.020
ked+0.020
works+0.020
Neg logit contributions
 not-0.023
and-0.023
 and-0.022
 think-0.022
 sure-0.021
Tokenagged
Token ID14655
Feature activation-6.53e-02

Pos logit contributions
 and+0.027
 not+0.027
and+0.027
 about+0.025
 sure+0.025
Neg logit contributions
ulus-0.026
iful-0.026
rus-0.026
works-0.025
ked-0.025
Token around
Token ID1088
Feature activation-2.10e-03

Pos logit contributions
iful+0.110
ka+0.107
works+0.102
rus+0.100
ulus+0.100
Neg logit contributions
and-0.075
 not-0.073
 sure-0.067
 think-0.066
 and-0.065
Token the
Token ID262
Feature activation-7.45e-03

Pos logit contributions
iful+0.004
ka+0.003
works+0.003
rus+0.003
mer+0.003
Neg logit contributions
and-0.002
 not-0.002
 think-0.002
 sure-0.002
Unfortunately-0.002
Token hallway
Token ID23959
Feature activation+1.38e-02

Pos logit contributions
iful+0.013
ka+0.012
rus+0.012
ulus+0.012
works+0.012
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.008
 about-0.007
Token!"
Token ID2474
Feature activation-5.89e-05

Pos logit contributions
 not+0.014
and+0.014
 sure+0.012
 think+0.012
 and+0.012
Neg logit contributions
iful-0.025
ka-0.024
works-0.024
rus-0.023
ulus-0.023
Token
Token ID198
Feature activation-3.14e-04

Pos logit contributions
oured+0.000
ifying+0.000
rolled+0.000
ka+0.000
ked+0.000
Neg logit contributions
 not-0.000
 never-0.000
 grounded-0.000
 sure-0.000
and-0.000
Token said
Token ID531
Feature activation-1.92e-02

Pos logit contributions
 not+0.005
 and+0.005
 sure+0.005
 never+0.005
 about+0.004
Neg logit contributions
iful-0.010
ocol-0.010
ka-0.009
works-0.009
rus-0.009
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
iful+0.039
ka+0.039
rus+0.038
ulus+0.038
ked+0.038
Neg logit contributions
and-0.015
 not-0.014
 and-0.013
 never-0.012
 think-0.012
Token "
Token ID366
Feature activation-1.65e-02

Pos logit contributions
and+0.017
 not+0.016
 and+0.015
 think+0.015
 never+0.015
Neg logit contributions
iful-0.026
rus-0.026
ked-0.026
ulus-0.025
ka-0.025
TokenLet
Token ID5756
Feature activation-1.05e-02

Pos logit contributions
iful+0.024
ka+0.023
rus+0.023
ulus+0.022
works+0.022
Neg logit contributions
 not-0.023
and-0.023
 and-0.022
 sure-0.021
 never-0.021
Tokenâ
Token ID22940
Feature activation-4.95e-03

Pos logit contributions
iful+0.024
ka+0.023
rus+0.023
works+0.023
ocol+0.022
Neg logit contributions
 not-0.007
 and-0.007
and-0.006
 sure-0.006
 never-0.006
Token
Token ID26391
Feature activation-6.33e-02

Pos logit contributions
iful+0.007
ka+0.007
works+0.007
ulus+0.007
rus+0.007
Neg logit contributions
 not-0.007
and-0.007
 and-0.006
 sure-0.006
 about-0.006
Token
Token ID8151
Feature activation-1.41e-02

Pos logit contributions
ulus+0.097
ked+0.095
glers+0.094
ka+0.093
works+0.093
Neg logit contributions
and-0.073
 not-0.073
 and-0.068
 knowing-0.068
 sure-0.067
Tokens
Token ID82
Feature activation-2.01e-02

Pos logit contributions
iful+0.024
rus+0.023
ka+0.023
works+0.023
ulus+0.022
Neg logit contributions
and-0.017
 not-0.017
 and-0.017
 about-0.015
 never-0.015
Token go
Token ID467
Feature activation-1.76e-02

Pos logit contributions
iful+0.028
glers+0.027
rus+0.027
ka+0.027
works+0.027
Neg logit contributions
and-0.030
 knowing-0.026
 not-0.025
 about-0.024
 and-0.024
Token fishing
Token ID12478
Feature activation-3.67e-02

Pos logit contributions
ka+0.032
iful+0.032
glers+0.032
oured+0.032
ulus+0.031
Neg logit contributions
and-0.016
 not-0.016
arre-0.014
 knowing-0.014
Unfortunately-0.013
Token!"
Token ID2474
Feature activation-1.09e-02

Pos logit contributions
mer+0.061
ka+0.057
works+0.057
rus+0.057
iful+0.057
Neg logit contributions
 not-0.039
and-0.039
 crying-0.038
 knowing-0.038
 cry-0.037
Token daughter
Token ID4957
Feature activation+4.96e-03

Pos logit contributions
iful+0.005
ka+0.005
rus+0.005
ulus+0.005
works+0.005
Neg logit contributions
 not-0.004
and-0.004
 and-0.004
 never-0.004
 sure-0.004
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
and+0.005
Unfortunately+0.004
arre+0.004
 not+0.004
Sadly+0.004
Neg logit contributions
iful-0.009
ked-0.008
rus-0.008
ka-0.008
ulus-0.008
Token "
Token ID366
Feature activation-1.79e-02

Pos logit contributions
and+0.013
 not+0.012
 and+0.011
 never+0.011
 sure+0.011
Neg logit contributions
iful-0.021
rus-0.020
ka-0.020
ulus-0.020
ked-0.020
TokenLet
Token ID5756
Feature activation-9.04e-03

Pos logit contributions
iful+0.028
rus+0.027
ka+0.027
ocol+0.026
ulus+0.026
Neg logit contributions
and-0.023
 not-0.023
 and-0.023
 never-0.021
 sure-0.021
Tokenâ
Token ID22940
Feature activation-5.57e-03

Pos logit contributions
iful+0.020
ka+0.019
rus+0.019
works+0.019
ked+0.019
Neg logit contributions
 not-0.006
 and-0.006
and-0.006
 sure-0.005
 never-0.005
Token
Token ID26391
Feature activation-6.33e-02

Pos logit contributions
iful+0.009
ka+0.008
ulus+0.008
works+0.008
rus+0.008
Neg logit contributions
 not-0.007
and-0.007
 and-0.007
 about-0.006
 knowing-0.006
Token
Token ID8151
Feature activation-1.36e-02

Pos logit contributions
ulus+0.099
iful+0.095
ked+0.095
ka+0.095
glers+0.094
Neg logit contributions
and-0.074
 not-0.073
 and-0.068
 sure-0.067
 knowing-0.067
Tokens
Token ID82
Feature activation-2.00e-02

Pos logit contributions
iful+0.025
ka+0.024
rus+0.024
works+0.023
ulus+0.023
Neg logit contributions
 not-0.015
and-0.015
 and-0.015
 about-0.013
 never-0.013
Token not
Token ID407
Feature activation-2.32e-02

Pos logit contributions
iful+0.030
ka+0.028
rus+0.028
works+0.028
ulus+0.028
Neg logit contributions
and-0.028
 not-0.024
 knowing-0.024
 and-0.023
 about-0.023
Token waste
Token ID7030
Feature activation-1.46e-02

Pos logit contributions
ulus+0.033
iful+0.033
glers+0.032
ka+0.031
redible+0.031
Neg logit contributions
and-0.032
 knowing-0.028
 not-0.027
 many-0.027
Unfortunately-0.027
Token our
Token ID674
Feature activation-1.67e-02

Pos logit contributions
iful+0.027
rus+0.027
ka+0.026
ulus+0.026
ked+0.026
Neg logit contributions
 not-0.015
 and-0.015
and-0.014
 about-0.013
 many-0.013
Token stairs
Token ID16046
Feature activation+3.12e-02

Pos logit contributions
rus+0.019
ka+0.019
iful+0.019
glers+0.019
ulus+0.019
Neg logit contributions
 not-0.019
 sure-0.018
and-0.018
 about-0.018
 think-0.017
Token and
Token ID290
Feature activation-5.74e-03

Pos logit contributions
 not+0.024
and+0.023
 and+0.022
 sure+0.020
 never+0.020
Neg logit contributions
iful-0.064
ka-0.063
rus-0.062
ulus-0.061
works-0.061
Token z
Token ID1976
Feature activation+7.70e-03

Pos logit contributions
ka+0.007
ked+0.007
iful+0.007
 usual+0.007
oured+0.007
Neg logit contributions
and-0.008
 story-0.008
aring-0.008
 not-0.007
arre-0.007
Tokenig
Token ID328
Feature activation-1.23e-02

Pos logit contributions
 not+0.011
 and+0.011
and+0.011
 sure+0.010
 about+0.010
Neg logit contributions
iful-0.011
works-0.010
rus-0.010
ulus-0.010
glers-0.010
Tokenz
Token ID89
Feature activation+2.39e-02

Pos logit contributions
iful+0.016
ka+0.015
ked+0.015
rus+0.015
glers+0.014
Neg logit contributions
and-0.019
 not-0.019
 and-0.018
 think-0.018
 sure-0.017
Tokenagged
Token ID14655
Feature activation-6.32e-02

Pos logit contributions
 and+0.034
 not+0.034
and+0.033
 about+0.032
 sure+0.031
Neg logit contributions
ulus-0.034
iful-0.033
rus-0.033
works-0.033
glers-0.032
Token her
Token ID607
Feature activation+3.92e-03

Pos logit contributions
ka+0.113
iful+0.112
ked+0.106
ulus+0.104
glers+0.103
Neg logit contributions
and-0.065
 not-0.060
 think-0.059
 sure-0.057
arre-0.056
Token way
Token ID835
Feature activation-6.68e-03

Pos logit contributions
 and+0.004
 not+0.004
and+0.004
 knowing+0.004
 with+0.004
Neg logit contributions
glers-0.007
iful-0.007
rus-0.007
works-0.007
ocol-0.007
Token through
Token ID832
Feature activation-3.80e-03

Pos logit contributions
iful+0.011
ka+0.011
rus+0.010
ked+0.010
ulus+0.010
Neg logit contributions
and-0.007
 not-0.007
 think-0.007
arre-0.007
 sure-0.007
Token the
Token ID262
Feature activation+3.07e-03

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
ulus+0.006
works+0.006
Neg logit contributions
 not-0.004
and-0.004
 and-0.004
 sure-0.004
 never-0.004
Token hallway
Token ID23959
Feature activation+1.67e-02

Pos logit contributions
 not+0.004
and+0.004
 sure+0.004
 and+0.004
 never+0.004
Neg logit contributions
iful-0.005
ka-0.005
rus-0.005
ulus-0.005
glers-0.004
Token Nicole
Token ID25176
Feature activation+1.08e-02

Pos logit contributions
 not+0.005
 and+0.005
 sure+0.004
 never+0.004
 think+0.004
Neg logit contributions
iful-0.011
rus-0.011
ka-0.011
mer-0.011
works-0.011
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
and+0.013
Unfortunately+0.012
arre+0.011
 Sadly+0.011
Sadly+0.011
Neg logit contributions
iful-0.017
ka-0.017
rus-0.016
ked-0.016
works-0.016
Token "
Token ID366
Feature activation-1.77e-02

Pos logit contributions
 not+0.021
and+0.020
 and+0.019
 sure+0.018
 never+0.018
Neg logit contributions
iful-0.038
ka-0.037
rus-0.037
ulus-0.036
works-0.036
TokenLet
Token ID5756
Feature activation-8.94e-03

Pos logit contributions
iful+0.025
ka+0.025
rus+0.024
works+0.023
ulus+0.023
Neg logit contributions
 not-0.026
and-0.024
 and-0.024
 about-0.023
 sure-0.023
Tokenâ
Token ID22940
Feature activation-6.34e-03

Pos logit contributions
iful+0.019
ka+0.019
rus+0.019
works+0.019
ulus+0.019
Neg logit contributions
 not-0.006
and-0.006
 and-0.006
 sure-0.005
 never-0.005
Token
Token ID26391
Feature activation-6.26e-02

Pos logit contributions
iful+0.010
ka+0.009
works+0.009
ulus+0.009
rus+0.009
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 about-0.008
 many-0.008
Token
Token ID8151
Feature activation-1.37e-02

Pos logit contributions
ulus+0.097
iful+0.095
ka+0.094
ked+0.094
glers+0.094
Neg logit contributions
and-0.073
 not-0.072
 and-0.068
 about-0.067
 knowing-0.066
Tokens
Token ID82
Feature activation-2.07e-02

Pos logit contributions
iful+0.023
ka+0.022
rus+0.022
works+0.022
ocol+0.021
Neg logit contributions
 not-0.017
and-0.017
 and-0.017
 about-0.015
 never-0.015
Token go
Token ID467
Feature activation-1.83e-02

Pos logit contributions
iful+0.030
rus+0.029
ka+0.029
glers+0.028
ulus+0.028
Neg logit contributions
and-0.029
 not-0.026
 knowing-0.026
 and-0.025
 about-0.025
Token on
Token ID319
Feature activation-1.09e-02

Pos logit contributions
ka+0.034
iful+0.034
oured+0.034
glers+0.033
ulus+0.033
Neg logit contributions
and-0.016
 not-0.015
arre-0.013
Unfortunately-0.013
 knowing-0.013
Token an
Token ID281
Feature activation-1.85e-02

Pos logit contributions
iful+0.022
rus+0.021
ka+0.021
ked+0.021
works+0.021
Neg logit contributions
 not-0.010
 and-0.009
and-0.008
 never-0.008
 sure-0.008
Token friends
Token ID2460
Feature activation+1.20e-02

Pos logit contributions
iful+0.002
works+0.002
rus+0.002
ka+0.002
ulus+0.002
Neg logit contributions
 and-0.001
 not-0.001
 many-0.001
and-0.001
 sure-0.001
Token decided
Token ID3066
Feature activation+6.78e-03

Pos logit contributions
 not+0.010
and+0.009
 and+0.008
 sure+0.008
 knowing+0.008
Neg logit contributions
iful-0.024
ka-0.024
rus-0.023
ked-0.023
ulus-0.023
Token to
Token ID284
Feature activation-6.71e-04

Pos logit contributions
and+0.007
aps+0.006
arre+0.006
irk+0.006
Unfortunately+0.006
Neg logit contributions
ked-0.011
rus-0.011
 usual-0.011
ulus-0.011
iful-0.011
Token measure
Token ID3953
Feature activation-8.45e-03

Pos logit contributions
iful+0.001
rus+0.001
ka+0.001
ked+0.001
ocol+0.001
Neg logit contributions
 not-0.001
 and-0.001
 think-0.001
 never-0.001
and-0.001
Token how
Token ID703
Feature activation-6.44e-03

Pos logit contributions
iful+0.016
ka+0.015
rus+0.015
ulus+0.015
ked+0.015
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.007
 never-0.007
Token high
Token ID1029
Feature activation-6.23e-02

Pos logit contributions
iful+0.014
ka+0.014
rus+0.014
ulus+0.014
works+0.014
Neg logit contributions
 not-0.004
and-0.004
 and-0.003
 sure-0.003
 never-0.003
Token they
Token ID484
Feature activation-6.98e-03

Pos logit contributions
iful+0.133
ocol+0.133
oured+0.132
ka+0.129
ulus+0.129
Neg logit contributions
 and-0.053
 not-0.050
 with-0.040
 never-0.038
 many-0.036
Token could
Token ID714
Feature activation-1.11e-02

Pos logit contributions
ka+0.012
ocol+0.012
ulus+0.012
oured+0.012
iful+0.011
Neg logit contributions
 and-0.009
 not-0.009
 never-0.008
 think-0.008
 sure-0.008
Token jump
Token ID4391
Feature activation-1.67e-02

Pos logit contributions
iful+0.017
rus+0.017
ka+0.017
works+0.016
ulus+0.016
Neg logit contributions
 not-0.014
and-0.014
 and-0.013
 sure-0.012
 never-0.012
Token.
Token ID13
Feature activation-8.03e-03

Pos logit contributions
iful+0.035
ka+0.034
rus+0.034
ked+0.034
ocol+0.034
Neg logit contributions
 and-0.013
 not-0.013
and-0.011
 about-0.010
 never-0.010
Token One
Token ID1881
Feature activation-5.42e-04

Pos logit contributions
iful+0.019
ka+0.018
rus+0.018
ocol+0.018
ked+0.018
Neg logit contributions
 not-0.005
 and-0.005
and-0.004
 about-0.004
 think-0.004
Token suggested
Token ID5220
Feature activation+2.17e-02

Pos logit contributions
and+0.018
 not+0.017
 knowing+0.016
 many+0.016
 never+0.015
Neg logit contributions
rus-0.018
ka-0.018
iful-0.018
ked-0.018
oured-0.017
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
and+0.019
 not+0.017
 and+0.016
 think+0.016
 never+0.016
Neg logit contributions
iful-0.042
rus-0.041
ked-0.041
ka-0.041
ulus-0.041
Token "
Token ID366
Feature activation-2.29e-02

Pos logit contributions
and+0.019
 not+0.018
 and+0.017
 never+0.016
 think+0.016
Neg logit contributions
iful-0.031
ka-0.030
rus-0.030
ked-0.030
ulus-0.030
TokenLet
Token ID5756
Feature activation-1.18e-02

Pos logit contributions
works+0.027
ulus+0.026
iful+0.026
rus+0.026
ka+0.025
Neg logit contributions
 not-0.036
 about-0.036
 cry-0.035
 and-0.034
 heart-0.034
Tokenâ
Token ID22940
Feature activation-6.43e-03

Pos logit contributions
iful+0.025
ka+0.025
rus+0.025
ulus+0.024
works+0.024
Neg logit contributions
 not-0.009
and-0.008
 and-0.008
 sure-0.007
 never-0.007
Token
Token ID26391
Feature activation-6.21e-02

Pos logit contributions
iful+0.010
ka+0.009
works+0.009
ulus+0.009
rus+0.009
Neg logit contributions
 not-0.009
and-0.008
 and-0.008
 about-0.008
 many-0.008
Token
Token ID8151
Feature activation-1.73e-02

Pos logit contributions
ulus+0.092
ked+0.089
ka+0.088
iful+0.088
glers+0.087
Neg logit contributions
and-0.076
 not-0.075
 and-0.070
 about-0.070
 knowing-0.069
Tokens
Token ID82
Feature activation-1.96e-02

Pos logit contributions
iful+0.028
ka+0.026
rus+0.026
works+0.026
ocol+0.026
Neg logit contributions
 not-0.023
and-0.023
 and-0.023
 about-0.020
 never-0.020
Token go
Token ID467
Feature activation-1.97e-02

Pos logit contributions
iful+0.029
rus+0.028
ka+0.028
works+0.027
glers+0.027
Neg logit contributions
and-0.027
 not-0.025
 knowing-0.024
 and-0.023
 about-0.023
Token to
Token ID284
Feature activation-2.44e-02

Pos logit contributions
ka+0.037
iful+0.037
glers+0.036
oured+0.036
ulus+0.035
Neg logit contributions
and-0.017
 not-0.017
arre-0.015
Sadly-0.015
 Sadly-0.015
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
iful+0.042
ka+0.041
rus+0.041
works+0.040
ulus+0.040
Neg logit contributions
 not-0.028
and-0.026
 and-0.026
 sure-0.024
 never-0.024
Token them
Token ID606
Feature activation-8.01e-03

Pos logit contributions
 not+0.001
 and+0.001
and+0.001
 many+0.001
 about+0.001
Neg logit contributions
iful-0.001
rus-0.001
ka-0.001
ulus-0.001
ked-0.001
Token up
Token ID510
Feature activation-1.07e-02

Pos logit contributions
iful+0.017
ka+0.016
rus+0.016
works+0.016
ulus+0.016
Neg logit contributions
 not-0.006
and-0.006
 and-0.005
 sure-0.005
 never-0.005
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
iful+0.024
ka+0.024
rus+0.024
works+0.023
ulus+0.023
Neg logit contributions
 not-0.006
and-0.006
 and-0.005
 sure-0.005
 think-0.005
Token see
Token ID766
Feature activation-7.20e-03

Pos logit contributions
iful+0.023
glers+0.022
ka+0.022
rus+0.022
works+0.022
Neg logit contributions
and-0.021
 not-0.019
arre-0.018
 sure-0.018
 and-0.018
Token how
Token ID703
Feature activation-5.79e-03

Pos logit contributions
iful+0.014
ka+0.014
rus+0.014
ked+0.013
ulus+0.013
Neg logit contributions
 not-0.007
 and-0.006
and-0.006
 sure-0.006
 about-0.005
Token high
Token ID1029
Feature activation-6.17e-02

Pos logit contributions
iful+0.012
rus+0.011
ka+0.011
ked+0.011
oured+0.011
Neg logit contributions
 not-0.006
and-0.005
 and-0.005
 many-0.005
 sure-0.005
Token they
Token ID484
Feature activation-5.06e-03

Pos logit contributions
iful+0.131
ocol+0.130
oured+0.130
ka+0.127
rus+0.127
Neg logit contributions
 and-0.054
 not-0.051
 with-0.041
 never-0.038
 many-0.038
Token could
Token ID714
Feature activation-7.00e-03

Pos logit contributions
ocol+0.009
ulus+0.009
ka+0.009
iful+0.009
rus+0.009
Neg logit contributions
 and-0.006
 not-0.006
 never-0.006
 think-0.005
 many-0.005
Token go
Token ID467
Feature activation-6.22e-03

Pos logit contributions
iful+0.012
ka+0.012
rus+0.012
ked+0.012
ulus+0.012
Neg logit contributions
 not-0.008
 and-0.008
and-0.008
 never-0.007
 think-0.007
Token.
Token ID13
Feature activation-2.56e-03

Pos logit contributions
ocol+0.014
oured+0.013
ked+0.013
iful+0.013
rus+0.013
Neg logit contributions
 and-0.005
 not-0.005
 with-0.004
and-0.004
 many-0.004
Token
Token ID198
Feature activation+4.41e-03

Pos logit contributions
iful+0.006
ka+0.006
ulus+0.006
works+0.005
rus+0.005
Neg logit contributions
 not-0.001
and-0.001
 and-0.001
 never-0.001
 sure-0.001
Token,
Token ID11
Feature activation+1.22e-03

Pos logit contributions
 not+0.003
and+0.003
 and+0.002
 sure+0.002
 never+0.002
Neg logit contributions
iful-0.004
ka-0.004
rus-0.004
works-0.004
ked-0.004
Token no
Token ID645
Feature activation-8.93e-03

Pos logit contributions
and+0.001
aps+0.001
arre+0.001
Unfortunately+0.001
ulk+0.001
Neg logit contributions
 rib-0.002
 usual-0.002
ked-0.002
iful-0.002
 lit-0.002
Token matter
Token ID2300
Feature activation-9.68e-03

Pos logit contributions
iful+0.006
rus+0.005
ka+0.005
ulus+0.005
works+0.005
Neg logit contributions
 not-0.020
 and-0.019
and-0.019
 sure-0.019
 never-0.019
Token how
Token ID703
Feature activation-3.86e-04

Pos logit contributions
iful+0.019
ka+0.018
rus+0.018
works+0.018
ulus+0.018
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.007
 think-0.007
Token "
Token ID366
Feature activation-2.99e-02

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
ulus+0.001
works+0.001
Neg logit contributions
 not-0.000
and-0.000
 and-0.000
 sure-0.000
 never-0.000
Tokensimple
Token ID36439
Feature activation-6.16e-02

Pos logit contributions
rus+0.042
ocol+0.041
ulus+0.041
 avail+0.041
iful+0.040
Neg logit contributions
and-0.043
 and-0.042
 not-0.040
 never-0.039
 sure-0.039
Token"
Token ID1
Feature activation-2.41e-02

Pos logit contributions
iful+0.103
ka+0.099
rus+0.098
works+0.098
ulus+0.096
Neg logit contributions
and-0.069
 not-0.069
 and-0.065
 think-0.061
 never-0.061
Token it
Token ID340
Feature activation-1.33e-02

Pos logit contributions
iful+0.041
rus+0.040
ocol+0.040
ka+0.040
ulus+0.039
Neg logit contributions
 not-0.028
 and-0.028
and-0.027
 sure-0.025
 never-0.025
Token may
Token ID743
Feature activation+2.23e-03

Pos logit contributions
iful+0.022
works+0.021
rus+0.021
ka+0.021
glers+0.021
Neg logit contributions
and-0.016
 not-0.014
 and-0.013
 knowing-0.012
Unfortunately-0.012
Token seem
Token ID1283
Feature activation+1.25e-02

Pos logit contributions
and+0.003
arre+0.002
Unfortunately+0.002
 knowing+0.002
 Sadly+0.002
Neg logit contributions
iful-0.003
works-0.003
glers-0.003
rus-0.003
 avail-0.003
Token.
Token ID13
Feature activation-6.48e-03

Pos logit contributions
 not+0.011
and+0.011
 and+0.010
 sure+0.009
 never+0.009
Neg logit contributions
iful-0.024
ka-0.024
rus-0.024
ulus-0.023
works-0.023
Token mouse
Token ID10211
Feature activation+5.46e-03

Pos logit contributions
 not+0.003
 and+0.003
 story+0.002
 crying+0.002
 heart+0.002
Neg logit contributions
ocol-0.004
iful-0.004
ka-0.004
oured-0.004
works-0.004
Token,
Token ID11
Feature activation+1.28e-02

Pos logit contributions
and+0.005
 not+0.005
Unfortunately+0.004
arre+0.004
 never+0.004
Neg logit contributions
iful-0.010
ked-0.010
rus-0.010
ka-0.010
ulus-0.010
Token "
Token ID366
Feature activation-1.60e-02

Pos logit contributions
and+0.016
 not+0.014
arre+0.013
 never+0.013
 think+0.013
Neg logit contributions
iful-0.020
ked-0.020
rus-0.020
ulus-0.019
ka-0.019
TokenLet
Token ID5756
Feature activation-7.65e-03

Pos logit contributions
iful+0.023
rus+0.023
ka+0.022
ulus+0.022
glers+0.022
Neg logit contributions
and-0.022
 not-0.022
 and-0.022
 never-0.021
 sure-0.020
Tokenâ
Token ID22940
Feature activation-4.34e-03

Pos logit contributions
iful+0.016
ka+0.015
rus+0.015
works+0.015
ocol+0.015
Neg logit contributions
 not-0.007
 and-0.007
 sure-0.006
and-0.006
 never-0.006
Token
Token ID26391
Feature activation-6.16e-02

Pos logit contributions
iful+0.007
ka+0.007
rus+0.007
ulus+0.007
works+0.007
Neg logit contributions
 not-0.005
and-0.005
 and-0.005
 sure-0.005
 about-0.005
Token
Token ID8151
Feature activation-1.40e-02

Pos logit contributions
ulus+0.091
ked+0.089
rus+0.087
works+0.087
iful+0.087
Neg logit contributions
and-0.076
 not-0.074
 and-0.070
 about-0.069
 knowing-0.068
Tokens
Token ID82
Feature activation-1.70e-02

Pos logit contributions
iful+0.024
ka+0.023
rus+0.023
works+0.023
ulus+0.023
Neg logit contributions
 not-0.017
and-0.017
 and-0.016
 never-0.015
 about-0.014
Token go
Token ID467
Feature activation-1.52e-02

Pos logit contributions
iful+0.025
rus+0.024
ka+0.023
works+0.023
ulus+0.023
Neg logit contributions
and-0.025
 knowing-0.021
 not-0.021
Unfortunately-0.020
 about-0.020
Token outside
Token ID2354
Feature activation-3.14e-02

Pos logit contributions
iful+0.027
ka+0.027
glers+0.026
oured+0.026
ulus+0.026
Neg logit contributions
and-0.014
 not-0.014
arre-0.013
 sure-0.012
 about-0.012
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
iful+0.060
mer+0.058
ka+0.056
oured+0.055
ulus+0.055
Neg logit contributions
and-0.025
 not-0.024
Sadly-0.023
Unfortunately-0.023
arre-0.021
Tokenmy
Token ID1820
Feature activation+1.09e-02

Pos logit contributions
 and+0.002
 not+0.002
and+0.002
 sure+0.001
 about+0.001
Neg logit contributions
ocol-0.004
rus-0.004
ulus-0.004
iful-0.003
works-0.003
Token said
Token ID531
Feature activation-1.10e-02

Pos logit contributions
 not+0.014
 and+0.014
and+0.013
 sure+0.013
 never+0.013
Neg logit contributions
iful-0.017
ka-0.017
rus-0.017
ulus-0.016
ocol-0.016
Token "
Token ID366
Feature activation-1.40e-02

Pos logit contributions
iful+0.020
ka+0.020
ked+0.019
rus+0.019
works+0.019
Neg logit contributions
and-0.011
 not-0.010
 think-0.009
 and-0.009
arre-0.009
TokenLet
Token ID5756
Feature activation-3.84e-03

Pos logit contributions
rus+0.023
iful+0.022
ulus+0.022
ocol+0.022
glers+0.022
Neg logit contributions
and-0.018
 and-0.017
 not-0.016
 sure-0.015
 never-0.015
Tokenâ
Token ID22940
Feature activation-1.86e-03

Pos logit contributions
iful+0.008
ka+0.008
rus+0.008
ulus+0.008
works+0.008
Neg logit contributions
 not-0.003
and-0.003
 and-0.003
 sure-0.002
 never-0.002
Token
Token ID26391
Feature activation-6.13e-02

Pos logit contributions
iful+0.003
ka+0.003
works+0.003
ulus+0.003
rus+0.003
Neg logit contributions
 not-0.002
and-0.002
 and-0.002
 sure-0.002
 many-0.002
Token
Token ID8151
Feature activation-9.66e-03

Pos logit contributions
ulus+0.092
ka+0.089
iful+0.089
works+0.088
oured+0.088
Neg logit contributions
and-0.072
 not-0.072
 and-0.066
 knowing-0.066
 think-0.065
Tokens
Token ID82
Feature activation-1.74e-02

Pos logit contributions
iful+0.016
rus+0.015
ka+0.015
works+0.015
ocol+0.015
Neg logit contributions
 not-0.012
and-0.012
 and-0.012
 never-0.011
 about-0.011
Token go
Token ID467
Feature activation-1.25e-02

Pos logit contributions
iful+0.025
ka+0.023
works+0.023
rus+0.023
ulus+0.022
Neg logit contributions
and-0.026
 knowing-0.022
 not-0.022
 crying-0.021
 about-0.021
Token!"
Token ID2474
Feature activation-1.12e-02

Pos logit contributions
iful+0.020
ka+0.020
oured+0.019
ulus+0.019
glers+0.019
Neg logit contributions
and-0.014
 not-0.014
arre-0.013
 knowing-0.012
Unfortunately-0.012
Token As
Token ID1081
Feature activation-7.39e-03

Pos logit contributions
oured+0.020
ked+0.020
ka+0.020
iful+0.019
ulus+0.019
Neg logit contributions
 not-0.011
and-0.010
 never-0.010
 think-0.010
 story-0.009
Token big
Token ID1263
Feature activation+5.17e-03

Pos logit contributions
iful+0.014
ka+0.013
glers+0.013
works+0.013
rus+0.012
Neg logit contributions
and-0.012
 Sadly-0.012
 not-0.012
Unfortunately-0.012
 Unfortunately-0.012
Token towers
Token ID18028
Feature activation+9.80e-03

Pos logit contributions
 not+0.009
and+0.009
 never+0.008
Unfortunately+0.008
 sure+0.008
Neg logit contributions
iful-0.006
ka-0.006
ocol-0.005
glers-0.005
mer-0.005
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
Sadly+0.007
 Sadly+0.007
Unfortunately+0.006
 think+0.006
 cry+0.006
Neg logit contributions
glers-0.020
iful-0.020
ka-0.019
rus-0.018
 coloured-0.018
Token see
Token ID766
Feature activation-4.39e-03

Pos logit contributions
iful+0.023
ka+0.022
glers+0.022
rus+0.022
works+0.022
Neg logit contributions
and-0.016
 not-0.016
 and-0.015
 sure-0.015
 think-0.014
Token how
Token ID703
Feature activation-1.29e-02

Pos logit contributions
iful+0.008
ka+0.007
rus+0.007
glers+0.007
ulus+0.007
Neg logit contributions
 not-0.005
and-0.004
 and-0.004
 never-0.004
 think-0.004
Token high
Token ID1029
Feature activation-6.12e-02

Pos logit contributions
iful+0.027
ka+0.027
rus+0.026
works+0.026
ulus+0.026
Neg logit contributions
 not-0.010
and-0.009
 and-0.009
 sure-0.008
 never-0.008
Token he
Token ID339
Feature activation-8.88e-03

Pos logit contributions
iful+0.124
ka+0.122
oured+0.121
ocol+0.121
rus+0.121
Neg logit contributions
 not-0.053
 and-0.053
 never-0.043
and-0.042
 with-0.042
Token could
Token ID714
Feature activation-6.64e-03

Pos logit contributions
ocol+0.018
ifying+0.017
ulus+0.017
rus+0.017
oured+0.016
Neg logit contributions
 and-0.010
 not-0.009
 never-0.009
 sure-0.009
 with-0.008
Token make
Token ID787
Feature activation-1.06e-03

Pos logit contributions
iful+0.011
ka+0.011
rus+0.011
ulus+0.011
ked+0.011
Neg logit contributions
 not-0.008
 and-0.007
and-0.007
 never-0.007
 think-0.007
Token them
Token ID606
Feature activation-1.33e-02

Pos logit contributions
iful+0.002
ka+0.002
rus+0.002
glers+0.002
works+0.002
Neg logit contributions
 not-0.001
and-0.001
 never-0.001
 and-0.001
 think-0.001
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
iful+0.023
glers+0.021
ka+0.021
rus+0.021
ulus+0.021
Neg logit contributions
and-0.014
 not-0.014
 never-0.013
Unfortunately-0.013
 Sadly-0.013
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
and+0.006
 not+0.006
arre+0.006
 think+0.006
 knowing+0.006
Neg logit contributions
ka-0.009
ked-0.008
iful-0.008
ocol-0.008
 rib-0.008
Token necklace
Token ID38987
Feature activation+1.92e-02

Pos logit contributions
 not+0.026
 and+0.025
and+0.024
 never+0.023
 think+0.023
Neg logit contributions
iful-0.025
rus-0.023
ka-0.023
ulus-0.023
works-0.023
Token and
Token ID290
Feature activation+1.36e-03

Pos logit contributions
arre+0.015
 think+0.012
 mindful+0.012
Unfortunately+0.012
ulk+0.012
Neg logit contributions
ked-0.039
glers-0.038
rus-0.038
als-0.038
iful-0.037
Token the
Token ID262
Feature activation+1.06e-03

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 sure+0.002
 never+0.002
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
ked-0.002
works-0.002
Token magn
Token ID7842
Feature activation+7.44e-03

Pos logit contributions
 not+0.002
 and+0.002
and+0.002
 never+0.001
 sure+0.001
Neg logit contributions
iful-0.001
ka-0.001
rus-0.001
ulus-0.001
works-0.001
Tokenifying
Token ID4035
Feature activation-6.11e-02

Pos logit contributions
 and+0.016
and+0.016
 not+0.015
 never+0.015
 with+0.015
Neg logit contributions
ulus-0.005
 avail-0.005
works-0.005
glers-0.004
iful-0.004
Token glass
Token ID5405
Feature activation+1.41e-03

Pos logit contributions
iful+0.087
rus+0.083
ka+0.082
works+0.081
ked+0.081
Neg logit contributions
 not-0.090
 and-0.087
and-0.084
 never-0.081
 many-0.080
Token and
Token ID290
Feature activation-6.98e-03

Pos logit contributions
 not+0.002
and+0.001
 sure+0.001
 think+0.001
 and+0.001
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
glers-0.002
ked-0.002
Token started
Token ID2067
Feature activation+1.52e-02

Pos logit contributions
iful+0.010
ka+0.010
rus+0.010
ked+0.010
works+0.010
Neg logit contributions
 not-0.009
and-0.009
 and-0.009
 sure-0.008
 never-0.008
Token to
Token ID284
Feature activation+1.03e-02

Pos logit contributions
and+0.012
arre+0.011
 not+0.011
Unfortunately+0.011
 think+0.011
Neg logit contributions
ked-0.030
ka-0.029
ulus-0.028
iful-0.028
ocol-0.028
Token examine
Token ID10716
Feature activation+9.78e-03

Pos logit contributions
and+0.014
 not+0.014
 and+0.013
 sure+0.013
 about+0.012
Neg logit contributions
iful-0.015
ka-0.015
rus-0.015
ulus-0.014
glers-0.014
Token said
Token ID531
Feature activation-1.37e-02

Pos logit contributions
ocol+0.020
iful+0.020
 avail+0.019
ulus+0.019
works+0.019
Neg logit contributions
 and-0.013
 not-0.013
 never-0.012
 sure-0.011
 think-0.011
Token,
Token ID11
Feature activation+2.42e-02

Pos logit contributions
rus+0.027
ka+0.026
iful+0.026
ked+0.026
ulus+0.025
Neg logit contributions
and-0.012
 not-0.010
 think-0.010
 and-0.010
 sure-0.009
Token "
Token ID366
Feature activation-1.12e-02

Pos logit contributions
and+0.029
 not+0.027
arre+0.025
 think+0.025
 never+0.025
Neg logit contributions
rus-0.038
iful-0.038
ked-0.038
ulus-0.037
works-0.036
TokenLet
Token ID5756
Feature activation-1.02e-02

Pos logit contributions
iful+0.016
rus+0.015
ka+0.015
ulus+0.015
works+0.015
Neg logit contributions
 not-0.016
and-0.016
 and-0.016
 never-0.015
 sure-0.015
Tokenâ
Token ID22940
Feature activation-5.64e-03

Pos logit contributions
iful+0.022
ka+0.022
ocol+0.022
rus+0.022
ked+0.021
Neg logit contributions
 not-0.008
 and-0.007
 sure-0.007
and-0.006
 never-0.006
Token
Token ID26391
Feature activation-6.08e-02

Pos logit contributions
iful+0.009
ka+0.008
works+0.008
rus+0.008
ulus+0.008
Neg logit contributions
 not-0.007
and-0.007
 and-0.007
 about-0.007
 many-0.007
Token
Token ID8151
Feature activation-1.34e-02

Pos logit contributions
ulus+0.087
ked+0.082
ifying+0.082
glers+0.082
rus+0.081
Neg logit contributions
and-0.078
 not-0.072
 knowing-0.070
 and-0.069
 about-0.069
Tokens
Token ID82
Feature activation-2.01e-02

Pos logit contributions
iful+0.023
ka+0.022
works+0.022
rus+0.022
ocol+0.022
Neg logit contributions
 not-0.016
 and-0.016
and-0.016
 about-0.014
 never-0.014
Token go
Token ID467
Feature activation-1.87e-02

Pos logit contributions
iful+0.028
rus+0.027
ka+0.026
works+0.026
ulus+0.025
Neg logit contributions
and-0.031
 knowing-0.027
 about-0.025
 not-0.025
Unfortunately-0.025
Token on
Token ID319
Feature activation-1.05e-02

Pos logit contributions
iful+0.034
ka+0.034
rus+0.033
glers+0.033
ulus+0.033
Neg logit contributions
and-0.017
 not-0.017
arre-0.015
 knowing-0.015
 sure-0.014
Token an
Token ID281
Feature activation-1.71e-02

Pos logit contributions
iful+0.020
ka+0.019
rus+0.019
ulus+0.019
works+0.019
Neg logit contributions
 not-0.010
and-0.010
 and-0.009
 sure-0.009
 never-0.009
Token I
Token ID314
Feature activation+5.04e-03

Pos logit contributions
 not+0.005
and+0.005
 and+0.005
 never+0.005
 sure+0.005
Neg logit contributions
ulus-0.007
iful-0.007
ka-0.007
ocol-0.007
ifying-0.007
Token need
Token ID761
Feature activation-5.06e-03

Pos logit contributions
and+0.007
Unfortunately+0.006
 knowing+0.006
 many+0.005
 not+0.005
Neg logit contributions
mer-0.007
iful-0.007
ka-0.007
ocol-0.007
oured-0.007
Token to
Token ID284
Feature activation+4.95e-03

Pos logit contributions
iful+0.011
rus+0.011
ka+0.011
glers+0.010
ulus+0.010
Neg logit contributions
 not-0.004
 and-0.004
and-0.003
 never-0.003
 sure-0.003
Token see
Token ID766
Feature activation-6.14e-04

Pos logit contributions
and+0.007
arre+0.006
 knowing+0.006
 about+0.006
 and+0.006
Neg logit contributions
iful-0.007
 avail-0.006
ocol-0.006
rus-0.006
ka-0.006
Token how
Token ID703
Feature activation-5.30e-03

Pos logit contributions
iful+0.001
rus+0.001
glers+0.001
ka+0.001
works+0.001
Neg logit contributions
 not-0.001
 and-0.001
and-0.000
 sure-0.000
 many-0.000
Token high
Token ID1029
Feature activation-6.03e-02

Pos logit contributions
iful+0.011
rus+0.011
ka+0.011
works+0.011
ked+0.011
Neg logit contributions
 not-0.004
and-0.004
 and-0.004
 many-0.003
 never-0.003
Token you
Token ID345
Feature activation-7.82e-03

Pos logit contributions
iful+0.120
rus+0.117
ka+0.117
ulus+0.115
works+0.114
Neg logit contributions
 not-0.053
 and-0.051
and-0.047
 never-0.044
 sure-0.043
Token can
Token ID460
Feature activation+5.69e-05

Pos logit contributions
iful+0.015
rus+0.014
ka+0.014
ulus+0.014
glers+0.014
Neg logit contributions
 not-0.008
 and-0.007
and-0.007
 think-0.007
 sure-0.007
Token raise
Token ID5298
Feature activation-1.21e-02

Pos logit contributions
 not+0.000
and+0.000
 and+0.000
 sure+0.000
 never+0.000
Neg logit contributions
iful-0.000
ka-0.000
rus-0.000
ulus-0.000
works-0.000
Token your
Token ID534
Feature activation+1.14e-03

Pos logit contributions
iful+0.023
ka+0.023
rus+0.022
works+0.022
ulus+0.022
Neg logit contributions
 not-0.011
and-0.011
 and-0.010
 sure-0.010
 never-0.009
Token arms
Token ID5101
Feature activation-1.05e-02

Pos logit contributions
 not+0.001
and+0.001
 and+0.001
 never+0.001
 sure+0.001
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
ked-0.002
ulus-0.002
Token the
Token ID262
Feature activation+1.65e-02

Pos logit contributions
 not+0.007
and+0.007
 and+0.007
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.013
ka-0.012
rus-0.012
ulus-0.012
works-0.012
Token necklace
Token ID38987
Feature activation+1.38e-02

Pos logit contributions
and+0.029
 not+0.028
 and+0.027
 sure+0.027
 about+0.027
Neg logit contributions
iful-0.018
ka-0.017
rus-0.017
works-0.016
ked-0.016
Token and
Token ID290
Feature activation+5.03e-03

Pos logit contributions
and+0.007
arre+0.006
Unfortunately+0.005
 not+0.005
 think+0.005
Neg logit contributions
rus-0.031
glers-0.031
iful-0.031
ked-0.031
works-0.031
Token the
Token ID262
Feature activation+2.61e-03

Pos logit contributions
and+0.007
 not+0.006
arre+0.006
 and+0.006
 think+0.006
Neg logit contributions
ka-0.007
glers-0.007
ked-0.007
iful-0.007
rus-0.007
Token magn
Token ID7842
Feature activation+2.31e-03

Pos logit contributions
and+0.004
 not+0.004
 and+0.004
 sure+0.004
 about+0.004
Neg logit contributions
iful-0.003
ka-0.003
rus-0.003
works-0.003
ked-0.003
Tokenifying
Token ID4035
Feature activation-6.03e-02

Pos logit contributions
 and+0.005
and+0.005
 not+0.005
 never+0.005
 with+0.005
Neg logit contributions
ulus-0.001
 avail-0.001
works-0.001
glers-0.001
iful-0.001
Token glass
Token ID5405
Feature activation-1.03e-03

Pos logit contributions
iful+0.091
rus+0.086
ka+0.084
ked+0.084
works+0.084
Neg logit contributions
 not-0.085
 and-0.083
 many-0.078
and-0.078
 never-0.076
Token and
Token ID290
Feature activation-6.18e-03

Pos logit contributions
iful+0.002
ka+0.002
rus+0.002
glers+0.002
ked+0.002
Neg logit contributions
 not-0.001
and-0.001
 sure-0.001
 think-0.001
 never-0.001
Token told
Token ID1297
Feature activation+1.46e-02

Pos logit contributions
iful+0.009
ka+0.009
rus+0.009
ked+0.009
works+0.009
Neg logit contributions
 not-0.008
and-0.008
 and-0.007
 sure-0.007
 never-0.007
Token him
Token ID683
Feature activation+6.56e-04

Pos logit contributions
and+0.021
 not+0.021
 and+0.020
 never+0.019
arre+0.019
Neg logit contributions
iful-0.019
ka-0.019
rus-0.019
ocol-0.018
works-0.018
Token that
Token ID326
Feature activation+1.92e-02

Pos logit contributions
and+0.001
 not+0.000
arre+0.000
 and+0.000
Unfortunately+0.000
Neg logit contributions
iful-0.001
ka-0.001
glers-0.001
rus-0.001
ked-0.001
Token.
Token ID13
Feature activation+5.05e-03

Pos logit contributions
 not+0.008
and+0.008
 and+0.008
 sure+0.007
 never+0.007
Neg logit contributions
iful-0.026
ka-0.025
rus-0.025
ulus-0.025
works-0.025
Token It
Token ID632
Feature activation-7.87e-03

Pos logit contributions
 not+0.005
and+0.005
 and+0.005
 sure+0.005
 never+0.004
Neg logit contributions
iful-0.009
ka-0.009
rus-0.009
ulus-0.009
works-0.009
Token had
Token ID550
Feature activation+1.07e-02

Pos logit contributions
iful+0.013
ka+0.013
rus+0.013
ulus+0.012
works+0.012
Neg logit contributions
 not-0.009
 and-0.009
and-0.009
 never-0.008
 sure-0.008
Token a
Token ID257
Feature activation+6.73e-03

Pos logit contributions
and+0.011
 not+0.010
 and+0.010
 sure+0.009
 think+0.009
Neg logit contributions
iful-0.019
ka-0.019
rus-0.019
works-0.018
ked-0.018
Token long
Token ID890
Feature activation+2.59e-02

Pos logit contributions
and+0.011
 not+0.009
 sure+0.009
 about+0.009
arre+0.009
Neg logit contributions
ka-0.009
rus-0.009
iful-0.009
ulus-0.008
works-0.008
Token aisle
Token ID28960
Feature activation+5.29e-02

Pos logit contributions
and+0.047
 sure+0.038
arre+0.037
 not+0.036
Unfortunately+0.035
Neg logit contributions
ka-0.033
iful-0.031
 rib-0.030
 coloured-0.030
ocol-0.029
Token and
Token ID290
Feature activation-1.20e-03

Pos logit contributions
and+0.042
 not+0.041
 sure+0.038
 think+0.037
arre+0.036
Neg logit contributions
iful-0.103
ka-0.102
glers-0.102
rus-0.101
ked-0.101
Token a
Token ID257
Feature activation-1.15e-02

Pos logit contributions
iful+0.002
ka+0.002
rus+0.002
works+0.002
ulus+0.002
Neg logit contributions
 not-0.001
and-0.001
 and-0.001
 sure-0.001
 never-0.001
Token big
Token ID1263
Feature activation+1.48e-02

Pos logit contributions
ka+0.015
rus+0.015
iful+0.015
ifying+0.014
ulus+0.014
Neg logit contributions
and-0.018
 not-0.016
 sure-0.015
 about-0.015
arre-0.015
Token door
Token ID3420
Feature activation+1.22e-02

Pos logit contributions
and+0.025
 not+0.021
 sure+0.021
Unfortunately+0.020
arre+0.020
Neg logit contributions
ka-0.019
iful-0.019
ocol-0.019
 rib-0.018
rus-0.018
Token.
Token ID13
Feature activation+5.24e-03

Pos logit contributions
 not+0.011
and+0.010
 and+0.010
 never+0.009
 sure+0.009
Neg logit contributions
iful-0.024
ka-0.024
rus-0.024
ulus-0.023
works-0.023
Token.
Token ID13
Feature activation+5.94e-03

Pos logit contributions
arre+0.001
Sadly+0.001
 Sadly+0.001
Unfortunately+0.001
 mindful+0.001
Neg logit contributions
ked-0.001
ries-0.000
 coloured-0.000
icles-0.000
ted-0.000
Token He
Token ID679
Feature activation-3.81e-04

Pos logit contributions
and+0.008
 knowing+0.007
 sure+0.007
arre+0.007
 mindful+0.007
Neg logit contributions
women-0.008
ulus-0.008
ked-0.007
glers-0.007
rus-0.007
Token laughs
Token ID22051
Feature activation+4.56e-03

Pos logit contributions
ka+0.001
rus+0.001
ked+0.001
iful+0.001
ulus+0.001
Neg logit contributions
and-0.001
 not-0.000
 knowing-0.000
 sure-0.000
arre-0.000
Token and
Token ID290
Feature activation+8.64e-03

Pos logit contributions
and+0.003
 think+0.003
 Sadly+0.003
 obey+0.003
arre+0.002
Neg logit contributions
ked-0.010
iful-0.009
ka-0.009
ulus-0.009
rus-0.009
Token cl
Token ID537
Feature activation+4.20e-03

Pos logit contributions
and+0.013
 not+0.010
 story+0.010
 knowing+0.010
arre+0.010
Neg logit contributions
ked-0.013
ka-0.012
ulus-0.012
oured-0.012
glers-0.012
Tokenaps
Token ID1686
Feature activation+4.48e-02

Pos logit contributions
 not+0.004
and+0.004
 never+0.003
 sure+0.003
 story+0.003
Neg logit contributions
ka-0.008
ocol-0.008
ked-0.008
ulus-0.008
oured-0.008
Token his
Token ID465
Feature activation+7.28e-03

Pos logit contributions
and+0.044
 not+0.044
 and+0.039
 think+0.039
 sure+0.039
Neg logit contributions
iful-0.080
ka-0.079
ked-0.077
rus-0.077
ocol-0.076
Token hands
Token ID2832
Feature activation-5.58e-03

Pos logit contributions
 not+0.008
and+0.008
 and+0.008
 sure+0.007
 never+0.007
Neg logit contributions
iful-0.013
ka-0.012
rus-0.012
ked-0.012
ulus-0.012
Token.
Token ID13
Feature activation+8.34e-03

Pos logit contributions
iful+0.009
ked+0.008
ka+0.008
rus+0.008
ulus+0.008
Neg logit contributions
and-0.007
 not-0.006
 think-0.006
 sure-0.006
arre-0.006
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
and+0.007
 sure+0.006
arre+0.006
 knowing+0.006
 not+0.006
Neg logit contributions
ulus-0.015
ked-0.015
rus-0.015
women-0.015
ka-0.015
Token
Token ID198
Feature activation+5.86e-03

Pos logit contributions
arre+0.009
and+0.009
 not+0.008
 knowing+0.008
 and+0.008
Neg logit contributions
rus-0.026
ulus-0.026
works-0.025
glers-0.025
iful-0.025
Token Tim
Token ID5045
Feature activation+1.33e-02

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
works+0.001
ulus+0.001
Neg logit contributions
 not-0.001
 and-0.001
and-0.001
 about-0.001
 sure-0.001
Tokenmy
Token ID1820
Feature activation+1.17e-02

Pos logit contributions
 not+0.013
 and+0.013
and+0.013
 never+0.012
 sure+0.012
Neg logit contributions
iful-0.025
rus-0.024
works-0.024
ulus-0.024
ocol-0.024
Token hugged
Token ID41130
Feature activation+6.61e-03

Pos logit contributions
and+0.014
 not+0.014
 and+0.013
 sure+0.013
 about+0.013
Neg logit contributions
iful-0.018
ka-0.018
rus-0.018
ked-0.018
ulus-0.017
Token the
Token ID262
Feature activation+1.94e-02

Pos logit contributions
 not+0.006
and+0.006
 and+0.006
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.012
ka-0.012
rus-0.012
ulus-0.012
works-0.012
Token t
Token ID256
Feature activation+1.56e-02

Pos logit contributions
 not+0.029
 and+0.028
and+0.028
 sure+0.027
 never+0.027
Neg logit contributions
iful-0.026
ka-0.025
rus-0.025
ulus-0.024
works-0.024
Tokeneddy
Token ID21874
Feature activation+4.39e-02

Pos logit contributions
and+0.027
 and+0.026
 not+0.026
 sure+0.024
 never+0.024
Neg logit contributions
iful-0.018
ulus-0.017
works-0.017
rus-0.016
glers-0.016
Token bear
Token ID6842
Feature activation+2.31e-02

Pos logit contributions
 and+0.065
 not+0.063
and+0.059
 sure+0.058
 never+0.057
Neg logit contributions
iful-0.062
ulus-0.060
ocol-0.060
glers-0.059
rus-0.059
Token and
Token ID290
Feature activation+6.83e-03

Pos logit contributions
arre+0.016
and+0.015
Unfortunately+0.014
ulk+0.013
 not+0.013
Neg logit contributions
ked-0.047
oured-0.047
rus-0.047
ka-0.046
iful-0.046
Token said
Token ID531
Feature activation-2.05e-03

Pos logit contributions
and+0.010
 not+0.009
arre+0.009
Unfortunately+0.008
 story+0.008
Neg logit contributions
ked-0.009
ka-0.009
rus-0.009
glers-0.009
iful-0.009
Token,
Token ID11
Feature activation+2.35e-02

Pos logit contributions
ked+0.004
ka+0.003
rus+0.003
ocol+0.003
iful+0.003
Neg logit contributions
and-0.002
arre-0.002
Unfortunately-0.002
 think-0.002
 not-0.002
Token "
Token ID366
Feature activation-7.07e-03

Pos logit contributions
and+0.035
arre+0.033
 think+0.030
 obey+0.030
 story+0.030
Neg logit contributions
ked-0.031
rus-0.029
ulus-0.028
iful-0.026
oured-0.026
Token the
Token ID262
Feature activation-9.81e-03

Pos logit contributions
iful+0.025
oured+0.024
ocol+0.024
ples+0.024
ka+0.023
Neg logit contributions
 not-0.006
 and-0.006
 many-0.005
 with-0.005
 so-0.003
Token seed
Token ID9403
Feature activation-1.59e-02

Pos logit contributions
iful+0.015
ka+0.015
ked+0.015
rus+0.015
oured+0.014
Neg logit contributions
 not-0.013
 and-0.013
and-0.012
 never-0.012
 think-0.012
Token asked
Token ID1965
Feature activation+1.28e-03

Pos logit contributions
iful+0.023
ka+0.022
rus+0.022
ulus+0.022
works+0.022
Neg logit contributions
 not-0.022
and-0.022
 and-0.021
 sure-0.020
 never-0.020
Token the
Token ID262
Feature activation-3.39e-03

Pos logit contributions
 and+0.001
 not+0.001
 many+0.001
 about+0.001
 sure+0.001
Neg logit contributions
iful-0.003
ka-0.003
glers-0.003
ifying-0.003
oured-0.003
Token f
Token ID277
Feature activation+2.44e-04

Pos logit contributions
iful+0.005
ka+0.005
ked+0.005
oured+0.005
rus+0.005
Neg logit contributions
 not-0.005
 and-0.004
and-0.004
 think-0.004
 never-0.004
Tokenauc
Token ID14272
Feature activation+4.48e-02

Pos logit contributions
 not+0.000
 and+0.000
 never+0.000
and+0.000
 sure+0.000
Neg logit contributions
iful-0.000
ka-0.000
rus-0.000
works-0.000
glers-0.000
Tokenet
Token ID316
Feature activation-2.37e-02

Pos logit contributions
 not+0.085
and+0.083
 and+0.081
 sure+0.078
 never+0.078
Neg logit contributions
iful-0.043
ka-0.041
rus-0.040
works-0.039
ulus-0.039
Token,
Token ID11
Feature activation-7.85e-04

Pos logit contributions
iful+0.052
ka+0.052
rus+0.050
ulus+0.050
ocol+0.049
Neg logit contributions
 and-0.018
 not-0.017
and-0.015
 about-0.014
 sure-0.014
Token "
Token ID366
Feature activation-2.30e-02

Pos logit contributions
iful+0.001
rus+0.001
ked+0.001
ulus+0.001
ka+0.001
Neg logit contributions
and-0.001
 not-0.001
 never-0.001
arre-0.001
 think-0.001
TokenCan
Token ID6090
Feature activation-1.32e-02

Pos logit contributions
iful+0.038
ka+0.037
oured+0.036
rus+0.036
glers+0.036
Neg logit contributions
 and-0.030
and-0.028
 not-0.028
 never-0.027
 think-0.026
Token you
Token ID345
Feature activation-5.73e-03

Pos logit contributions
iful+0.030
ka+0.029
oured+0.028
glers+0.028
ked+0.028
Neg logit contributions
 not-0.013
 and-0.010
 never-0.010
 think-0.009
 with-0.008
Token
Token ID198
Feature activation-1.62e-03

Pos logit contributions
 not+0.002
 knowing+0.002
and+0.002
 never+0.002
 think+0.002
Neg logit contributions
ka-0.006
ulus-0.006
iful-0.006
rus-0.006
redible-0.006
TokenL
Token ID43
Feature activation+1.10e-02

Pos logit contributions
iful+0.002
ka+0.002
redible+0.002
ulus+0.002
rus+0.002
Neg logit contributions
 not-0.002
 and-0.002
and-0.002
 about-0.002
 sure-0.002
Tokenily
Token ID813
Feature activation+7.80e-03

Pos logit contributions
 not+0.012
and+0.011
 and+0.010
 never+0.010
 think+0.010
Neg logit contributions
iful-0.020
ka-0.019
rus-0.018
works-0.018
ked-0.018
Token took
Token ID1718
Feature activation+2.49e-03

Pos logit contributions
and+0.011
 not+0.010
 knowing+0.010
 many+0.010
 about+0.010
Neg logit contributions
rus-0.011
ked-0.010
ka-0.010
oured-0.010
iful-0.010
Token a
Token ID257
Feature activation+1.08e-02

Pos logit contributions
 not+0.002
 and+0.002
 sure+0.002
 many+0.002
and+0.002
Neg logit contributions
iful-0.005
rus-0.005
works-0.005
glers-0.005
ulus-0.005
Token small
Token ID1402
Feature activation+4.94e-02

Pos logit contributions
 not+0.016
 never+0.015
and+0.015
 knowing+0.015
 about+0.015
Neg logit contributions
ka-0.014
rus-0.014
iful-0.014
oured-0.013
ked-0.013
Token bite
Token ID13197
Feature activation-1.17e-02

Pos logit contributions
 not+0.091
 never+0.088
and+0.088
Unfortunately+0.082
 about+0.082
Neg logit contributions
ka-0.053
ocol-0.048
 stuffed-0.047
oured-0.047
mer-0.046
Token and
Token ID290
Feature activation+4.63e-03

Pos logit contributions
iful+0.024
ka+0.024
ked+0.024
glers+0.024
oured+0.023
Neg logit contributions
 not-0.008
and-0.007
 think-0.007
 never-0.007
 sure-0.006
Token to
Token ID284
Feature activation-5.22e-03

Pos logit contributions
 not+0.006
and+0.005
 and+0.005
 sure+0.005
 never+0.005
Neg logit contributions
iful-0.008
ka-0.007
rus-0.007
works-0.007
ulus-0.007
Token her
Token ID607
Feature activation+7.40e-04

Pos logit contributions
iful+0.009
ka+0.009
rus+0.009
ulus+0.009
ked+0.009
Neg logit contributions
and-0.006
 not-0.005
 and-0.005
 sure-0.005
 never-0.005
Token surprise
Token ID5975
Feature activation-9.56e-03

Pos logit contributions
 not+0.001
 and+0.001
 with+0.001
 sure+0.001
 about+0.001
Neg logit contributions
iful-0.002
works-0.001
rus-0.001
ka-0.001
ocol-0.001
Token gaze
Token ID17841
Feature activation+1.46e-02

Pos logit contributions
arre+0.035
Unfortunately+0.029
 late+0.029
and+0.028
Sadly+0.028
Neg logit contributions
rus-0.021
 usual-0.021
women-0.021
 avail-0.021
ked-0.021
Token and
Token ID290
Feature activation+4.20e-03

Pos logit contributions
 not+0.010
and+0.009
 and+0.008
 sure+0.008
 never+0.008
Neg logit contributions
iful-0.032
ka-0.031
rus-0.031
ked-0.031
ulus-0.030
Token saw
Token ID2497
Feature activation+1.13e-03

Pos logit contributions
and+0.007
arre+0.006
 not+0.006
 about+0.006
 story+0.006
Neg logit contributions
ked-0.005
iful-0.005
rus-0.005
ka-0.005
glers-0.005
Token it
Token ID340
Feature activation-6.79e-03

Pos logit contributions
 not+0.001
and+0.001
 and+0.001
 never+0.001
 sure+0.001
Neg logit contributions
iful-0.002
rus-0.002
ka-0.002
ked-0.002
glers-0.002
Token too
Token ID1165
Feature activation+1.16e-02

Pos logit contributions
iful+0.013
ka+0.013
rus+0.013
ocol+0.013
ulus+0.013
Neg logit contributions
 not-0.006
 and-0.006
and-0.005
 never-0.005
 about-0.005
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
and+0.013
 not+0.013
 sure+0.011
 and+0.011
 never+0.011
Neg logit contributions
iful-0.019
ka-0.019
rus-0.019
ulus-0.019
works-0.019
Token A
Token ID317
Feature activation+4.63e-04

Pos logit contributions
 sure+0.011
 mindful+0.011
and+0.010
 never+0.010
 aware+0.010
Neg logit contributions
ked-0.016
ulus-0.016
rus-0.015
ocol-0.015
icles-0.015
Token giant
Token ID6175
Feature activation+1.56e-02

Pos logit contributions
 not+0.001
 and+0.001
and+0.001
 sure+0.001
 never+0.001
Neg logit contributions
iful-0.001
ka-0.001
rus-0.001
ked-0.001
ulus-0.001
Token was
Token ID373
Feature activation+8.56e-03

Pos logit contributions
and+0.019
 not+0.019
 and+0.017
 sure+0.017
 never+0.017
Neg logit contributions
iful-0.025
ka-0.024
rus-0.024
works-0.023
glers-0.023
Token walking
Token ID6155
Feature activation-7.43e-03

Pos logit contributions
 not+0.008
and+0.008
 and+0.008
 sure+0.007
 never+0.007
Neg logit contributions
iful-0.016
ka-0.016
rus-0.015
ulus-0.015
works-0.015
Token towards
Token ID3371
Feature activation-1.93e-03

Pos logit contributions
iful+0.014
ka+0.014
rus+0.014
ulus+0.014
works+0.014
Neg logit contributions
 not-0.007
and-0.006
 and-0.006
 sure-0.006
 never-0.006
Token them
Token ID606
Feature activation-7.51e-03

Pos logit contributions
iful+0.033
rus+0.032
works+0.031
glers+0.031
ulus+0.031
Neg logit contributions
 not-0.014
 and-0.013
 many-0.012
 never-0.012
 about-0.011
Token!"
Token ID2474
Feature activation-4.88e-03

Pos logit contributions
iful+0.013
ka+0.012
rus+0.012
works+0.012
ulus+0.012
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.007
 never-0.007
Token
Token ID198
Feature activation+2.51e-03

Pos logit contributions
ked+0.005
icles+0.005
women+0.005
ted+0.005
 widened+0.005
Neg logit contributions
arre-0.007
 aware-0.007
 mindful-0.007
vel-0.006
and-0.006
Token
Token ID198
Feature activation-1.79e-03

Pos logit contributions
 not+0.002
arre+0.002
 knowing+0.002
 never+0.002
 think+0.002
Neg logit contributions
rus-0.005
ka-0.005
ulus-0.005
glers-0.004
works-0.004
TokenL
Token ID43
Feature activation+1.45e-02

Pos logit contributions
ka+0.002
ulus+0.002
redible+0.002
rus+0.002
glers+0.002
Neg logit contributions
 about-0.002
 not-0.002
 sure-0.002
and-0.002
 think-0.002
Tokenily
Token ID813
Feature activation+9.52e-03

Pos logit contributions
 not+0.015
and+0.014
 and+0.014
 never+0.013
 sure+0.013
Neg logit contributions
iful-0.026
ka-0.026
rus-0.025
ked-0.025
works-0.025
Token gets
Token ID3011
Feature activation+1.44e-02

Pos logit contributions
and+0.015
 knowing+0.013
 aware+0.011
Unfortunately+0.011
 late+0.011
Neg logit contributions
ked-0.014
ka-0.013
oured-0.013
rus-0.013
ulus-0.013
Token angry
Token ID7954
Feature activation+5.46e-03

Pos logit contributions
and+0.018
 knowing+0.016
 not+0.015
 never+0.015
 story+0.015
Neg logit contributions
ka-0.022
ked-0.022
glers-0.022
rus-0.021
ulus-0.021
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
 Sadly+0.007
Sadly+0.007
Unfortunately+0.007
 Unfortunately+0.007
 never+0.006
Neg logit contributions
ked-0.008
ly-0.007
ted-0.007
ka-0.007
iful-0.007
Token She
Token ID1375
Feature activation-1.66e-03

Pos logit contributions
and+0.002
 never+0.002
 sure+0.002
 not+0.002
 story+0.002
Neg logit contributions
ulus-0.004
ked-0.004
rus-0.003
redible-0.003
oured-0.003
Token pushes
Token ID20070
Feature activation+1.89e-02

Pos logit contributions
ka+0.003
rus+0.003
ked+0.003
iful+0.003
ulus+0.003
Neg logit contributions
and-0.002
 knowing-0.002
 not-0.002
 sure-0.002
 and-0.002
Token down
Token ID866
Feature activation-5.16e-03

Pos logit contributions
ka+0.000
iful+0.000
ked+0.000
rus+0.000
glers+0.000
Neg logit contributions
 not-0.000
and-0.000
arre-0.000
aring-0.000
 knowing-0.000
Token the
Token ID262
Feature activation-2.63e-03

Pos logit contributions
iful+0.008
ka+0.008
glers+0.008
ked+0.007
rus+0.007
Neg logit contributions
 not-0.006
and-0.006
arre-0.006
 sure-0.006
 think-0.006
Token hill
Token ID12788
Feature activation+6.05e-03

Pos logit contributions
iful+0.004
ocol+0.004
ked+0.004
ka+0.004
rus+0.004
Neg logit contributions
 and-0.003
 not-0.003
and-0.003
 never-0.003
 think-0.003
Token together
Token ID1978
Feature activation-1.44e-03

Pos logit contributions
and+0.005
 not+0.004
 sure+0.004
 think+0.004
arre+0.004
Neg logit contributions
iful-0.012
glers-0.012
mer-0.011
rus-0.011
oured-0.011
Token again
Token ID757
Feature activation+6.73e-03

Pos logit contributions
iful+0.003
ked+0.003
 usual+0.003
oured+0.003
mer+0.003
Neg logit contributions
arre-0.001
and-0.001
 not-0.001
Sadly-0.001
 Sadly-0.001
Token and
Token ID290
Feature activation+7.83e-03

Pos logit contributions
 not+0.005
and+0.005
 sure+0.004
 never+0.004
 think+0.004
Neg logit contributions
iful-0.014
ka-0.013
ulus-0.013
rus-0.013
works-0.013
Token again
Token ID757
Feature activation+8.96e-03

Pos logit contributions
and+0.010
 not+0.010
 and+0.010
 think+0.009
 about+0.009
Neg logit contributions
iful-0.012
ka-0.011
ked-0.011
glers-0.011
rus-0.011
Token,
Token ID11
Feature activation+6.26e-03

Pos logit contributions
 not+0.008
and+0.007
 and+0.007
 sure+0.007
 never+0.007
Neg logit contributions
iful-0.018
ka-0.017
rus-0.017
ulus-0.017
works-0.017
Token until
Token ID1566
Feature activation+2.59e-03

Pos logit contributions
and+0.006
 story+0.006
Sadly+0.006
arre+0.006
Unfortunately+0.006
Neg logit contributions
oured-0.011
ked-0.010
iful-0.010
 rib-0.010
 usual-0.010
Token the
Token ID262
Feature activation-8.35e-03

Pos logit contributions
 not+0.002
 and+0.002
and+0.002
 never+0.002
 sure+0.002
Neg logit contributions
iful-0.005
rus-0.005
ka-0.005
ulus-0.005
works-0.005
Token sun
Token ID4252
Feature activation-3.99e-02

Pos logit contributions
iful+0.014
ocol+0.014
ulus+0.013
ked+0.013
ka+0.013
Neg logit contributions
 and-0.011
 not-0.010
 never-0.009
 story-0.009
 heart-0.009
Token
Token ID198
Feature activation+2.57e-03

Pos logit contributions
arre+0.008
 knowing+0.008
 mindful+0.008
 aware+0.008
 thinking+0.008
Neg logit contributions
rolled-0.014
rus-0.013
redible-0.013
ulus-0.013
 usual-0.013
TokenJust
Token ID5703
Feature activation-1.92e-02

Pos logit contributions
 knowing+0.003
 crying+0.003
 thinking+0.003
aps+0.003
 and+0.003
Neg logit contributions
redible-0.003
ulus-0.003
rus-0.003
ressive-0.003
Free-0.003
Token then
Token ID788
Feature activation-3.88e-03

Pos logit contributions
iful+0.029
ka+0.028
rus+0.028
ulus+0.027
ked+0.027
Neg logit contributions
 not-0.025
and-0.025
 and-0.024
 sure-0.023
 never-0.023
Token,
Token ID11
Feature activation+1.19e-02

Pos logit contributions
ka+0.008
iful+0.008
rus+0.008
ked+0.008
ulus+0.007
Neg logit contributions
and-0.003
 not-0.003
 and-0.003
 sure-0.003
 think-0.003
Token a
Token ID257
Feature activation-1.09e-02

Pos logit contributions
and+0.012
 not+0.011
 and+0.010
 sure+0.010
 think+0.010
Neg logit contributions
iful-0.022
ka-0.022
rus-0.022
ked-0.021
works-0.021
Token kind
Token ID1611
Feature activation+1.78e-02

Pos logit contributions
iful+0.021
ka+0.020
ked+0.020
rus+0.019
works+0.019
Neg logit contributions
 not-0.011
 and-0.011
and-0.010
 sure-0.010
 never-0.010
Token voice
Token ID3809
Feature activation-1.09e-02

Pos logit contributions
and+0.031
 not+0.028
Unfortunately+0.028
arre+0.027
 never+0.027
Neg logit contributions
rus-0.020
ka-0.020
iful-0.019
ocol-0.018
glers-0.018
Token spoke
Token ID5158
Feature activation+3.59e-03

Pos logit contributions
iful+0.021
ocol+0.020
ka+0.020
ifying+0.020
ksh+0.020
Neg logit contributions
 and-0.012
 about-0.011
 not-0.011
 with-0.010
 sure-0.010
Token out
Token ID503
Feature activation-6.49e-03

Pos logit contributions
 not+0.002
 and+0.002
and+0.002
 about+0.002
 never+0.002
Neg logit contributions
iful-0.008
ka-0.008
ocol-0.008
rus-0.008
ulus-0.008
Token of
Token ID286
Feature activation-1.81e-02

Pos logit contributions
iful+0.012
rus+0.011
ka+0.011
ked+0.011
ulus+0.011
Neg logit contributions
and-0.007
 not-0.006
 think-0.006
 never-0.006
 sure-0.006
Token the
Token ID262
Feature activation+1.91e-03

Pos logit contributions
iful+0.033
ka+0.032
rus+0.032
ulus+0.032
works+0.032
Neg logit contributions
 not-0.019
 and-0.019
 never-0.017
and-0.017
 about-0.017
TokenOf
Token ID5189
Feature activation-2.76e-03

Pos logit contributions
iful+0.020
ka+0.019
rus+0.019
ulus+0.019
ocol+0.018
Neg logit contributions
 not-0.019
and-0.019
 and-0.018
 never-0.017
 sure-0.017
Token course
Token ID1781
Feature activation-7.12e-03

Pos logit contributions
iful+0.005
ked+0.005
rus+0.005
oured+0.005
ulus+0.005
Neg logit contributions
 not-0.003
 and-0.003
 never-0.003
 think-0.003
 many-0.003
Token,
Token ID11
Feature activation-5.35e-03

Pos logit contributions
iful+0.019
rus+0.019
ka+0.018
ked+0.018
glers+0.018
Neg logit contributions
 and-0.003
 not-0.003
 with-0.002
 sure-0.002
 never-0.002
Token my
Token ID616
Feature activation-8.52e-03

Pos logit contributions
works+0.007
iful+0.007
ocol+0.007
 rib+0.007
ka+0.007
Neg logit contributions
and-0.008
arre-0.007
 Sadly-0.007
 knowing-0.007
 Unfortunately-0.006
Token sweet
Token ID6029
Feature activation-3.14e-03

Pos logit contributions
iful+0.016
glers+0.016
ked+0.016
rus+0.016
ulus+0.016
Neg logit contributions
 and-0.009
 not-0.009
 think-0.008
 sure-0.007
 about-0.007
Tokenies
Token ID444
Feature activation+8.21e-03

Pos logit contributions
iful+0.003
ka+0.003
glers+0.003
works+0.003
ocol+0.003
Neg logit contributions
 not-0.006
and-0.006
 never-0.005
 sure-0.005
Unfortunately-0.005
Token,"
Token ID553
Feature activation-1.05e-02

Pos logit contributions
and+0.010
 not+0.010
Unfortunately+0.009
 knowing+0.009
 sure+0.009
Neg logit contributions
iful-0.013
ka-0.012
rus-0.012
works-0.012
glers-0.012
Token she
Token ID673
Feature activation-6.68e-03

Pos logit contributions
glers+0.017
ka+0.016
 rib+0.015
works+0.015
ocol+0.015
Neg logit contributions
and-0.013
arre-0.012
 knowing-0.011
 never-0.011
 not-0.011
Token says
Token ID1139
Feature activation-2.33e-02

Pos logit contributions
ka+0.011
iful+0.011
rus+0.010
works+0.010
glers+0.010
Neg logit contributions
and-0.008
 not-0.008
 never-0.007
 knowing-0.007
 and-0.007
Token.
Token ID13
Feature activation-3.72e-03

Pos logit contributions
ked+0.037
iful+0.036
ka+0.036
rus+0.035
ulus+0.035
Neg logit contributions
 not-0.027
and-0.026
 never-0.025
arre-0.025
 think-0.025
Token "
Token ID366
Feature activation-1.08e-02

Pos logit contributions
ked+0.004
ulus+0.004
women+0.004
rus+0.004
oured+0.004
Neg logit contributions
and-0.006
arre-0.005
 think-0.005
 never-0.005
 sure-0.005
Token it
Token ID340
Feature activation-1.47e-02

Pos logit contributions
iful+0.031
ka+0.031
ked+0.029
ulus+0.029
 rib+0.029
Neg logit contributions
and-0.028
 not-0.027
arre-0.025
 sure-0.025
 think-0.025
Token was
Token ID373
Feature activation+3.40e-03

Pos logit contributions
iful+0.025
ka+0.024
rus+0.024
ocol+0.024
ulus+0.024
Neg logit contributions
 not-0.018
 and-0.017
and-0.016
 never-0.016
 sure-0.015
Token perfect
Token ID2818
Feature activation-5.85e-03

Pos logit contributions
 not+0.003
and+0.003
 and+0.003
 sure+0.003
 never+0.003
Neg logit contributions
iful-0.007
ka-0.006
rus-0.006
ulus-0.006
works-0.006
Token!
Token ID0
Feature activation-7.77e-03

Pos logit contributions
iful+0.011
ka+0.010
ked+0.010
oured+0.010
rus+0.010
Neg logit contributions
and-0.005
 not-0.005
arre-0.005
Unfortunately-0.005
 Sadly-0.005
Token
Token ID198
Feature activation+3.29e-03

Pos logit contributions
oured+0.015
ka+0.015
iful+0.015
ulus+0.015
ked+0.015
Neg logit contributions
 not-0.006
and-0.006
 never-0.006
 sure-0.005
 think-0.005
Token
Token ID198
Feature activation-3.49e-03

Pos logit contributions
 not+0.003
arre+0.003
 knowing+0.003
 mindful+0.003
 aware+0.002
Neg logit contributions
rus-0.005
redible-0.005
ulus-0.005
glers-0.005
ka-0.005
TokenThe
Token ID464
Feature activation+2.00e-03

Pos logit contributions
ulus+0.005
redible+0.005
iful+0.005
ka+0.005
rus+0.005
Neg logit contributions
 not-0.004
and-0.004
 and-0.004
 knowing-0.004
 sure-0.004
Token shop
Token ID6128
Feature activation-4.53e-03

Pos logit contributions
 not+0.002
 and+0.002
 sure+0.002
and+0.002
 think+0.002
Neg logit contributions
iful-0.004
rus-0.004
ocol-0.003
ked-0.003
ka-0.003
Tokenkeeper
Token ID13884
Feature activation+2.43e-03

Pos logit contributions
iful+0.006
ka+0.006
ocol+0.006
rus+0.006
ulus+0.006
Neg logit contributions
 not-0.007
 and-0.007
and-0.006
 never-0.006
 sure-0.006
Token smiled
Token ID13541
Feature activation-2.52e-03

Pos logit contributions
 not+0.003
 and+0.003
and+0.003
 sure+0.003
 never+0.003
Neg logit contributions
iful-0.004
ka-0.004
rus-0.004
works-0.004
ulus-0.004
Token and
Token ID290
Feature activation+1.59e-03

Pos logit contributions
iful+0.006
ka+0.006
ked+0.006
rus+0.006
ulus+0.006
Neg logit contributions
and-0.001
 not-0.001
 think-0.000
 never-0.000
 sure-0.000
Token and
Token ID290
Feature activation+5.42e-03

Pos logit contributions
iful+0.014
ked+0.013
ka+0.013
works+0.013
rus+0.013
Neg logit contributions
and-0.006
 not-0.006
Unfortunately-0.005
 think-0.005
Sadly-0.005
Token thanked
Token ID26280
Feature activation-5.96e-03

Pos logit contributions
and+0.007
arre+0.006
 not+0.006
Unfortunately+0.006
 and+0.006
Neg logit contributions
iful-0.008
ka-0.008
 usual-0.008
 avail-0.008
 rib-0.007
Token mum
Token ID25682
Feature activation+1.87e-02

Pos logit contributions
iful+0.011
rus+0.011
ka+0.011
works+0.010
ulus+0.010
Neg logit contributions
 not-0.006
 and-0.006
and-0.006
 sure-0.005
 never-0.005
Token for
Token ID329
Feature activation+5.51e-03

Pos logit contributions
 not+0.015
and+0.015
 and+0.013
 never+0.013
 think+0.013
Neg logit contributions
iful-0.038
ka-0.037
rus-0.036
glers-0.036
works-0.036
Token the
Token ID262
Feature activation+2.08e-03

Pos logit contributions
and+0.007
arre+0.006
 and+0.006
 not+0.006
 think+0.006
Neg logit contributions
iful-0.009
ocol-0.008
rus-0.008
redible-0.008
ka-0.008
Token special
Token ID2041
Feature activation+6.56e-03

Pos logit contributions
 not+0.003
and+0.003
 and+0.003
 sure+0.002
 never+0.002
Neg logit contributions
iful-0.003
ka-0.003
rus-0.003
works-0.003
ulus-0.003
Token gift
Token ID6979
Feature activation-2.77e-02

Pos logit contributions
 not+0.010
and+0.010
 sure+0.009
 and+0.009
 never+0.009
Neg logit contributions
ka-0.008
iful-0.008
works-0.008
rus-0.008
ulus-0.008
Token.
Token ID13
Feature activation+2.00e-03

Pos logit contributions
iful+0.052
ka+0.051
rus+0.051
works+0.050
ulus+0.050
Neg logit contributions
 not-0.026
and-0.025
 and-0.024
 sure-0.022
 never-0.022
Token<|endoftext|>
Token ID50256
Feature activation+1.38e-03

Pos logit contributions
 not+0.002
and+0.001
 never+0.001
 think+0.001
 cry+0.001
Neg logit contributions
oured-0.004
ka-0.004
ifying-0.004
iful-0.004
ulus-0.004
TokenOnce
Token ID7454
Feature activation+4.65e-03

Pos logit contributions
 not+0.002
 and+0.002
and+0.002
 sure+0.002
 about+0.002
Neg logit contributions
iful-0.002
rus-0.002
ka-0.002
ulus-0.002
ked-0.002
Token there
Token ID612
Feature activation+3.34e-02

Pos logit contributions
 not+0.007
and+0.006
 and+0.006
 never+0.006
 sure+0.006
Neg logit contributions
iful-0.006
ka-0.006
rus-0.006
glers-0.006
works-0.006
Tokenâ
Token ID22940
Feature activation+2.14e-02

Pos logit contributions
iful+0.014
ka+0.013
rus+0.013
ulus+0.013
works+0.013
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.007
 never-0.007
Token
Token ID26391
Feature activation-2.26e-02

Pos logit contributions
 not+0.034
and+0.034
 and+0.033
 never+0.032
 sure+0.032
Neg logit contributions
iful-0.026
ka-0.025
rus-0.025
ked-0.025
ulus-0.024
Token
Token ID198
Feature activation+1.15e-03

Pos logit contributions
ulus+0.032
ka+0.031
rus+0.030
glers+0.030
works+0.030
Neg logit contributions
and-0.030
 not-0.027
 knowing-0.026
 about-0.026
 many-0.026
Token
Token ID198
Feature activation-5.10e-03

Pos logit contributions
 not+0.001
 knowing+0.001
arre+0.001
 think+0.001
and+0.001
Neg logit contributions
ulus-0.002
rus-0.002
redible-0.002
works-0.002
glers-0.002
TokenBilly
Token ID49640
Feature activation-3.42e-02

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
ulus+0.006
glers+0.006
Neg logit contributions
 not-0.008
and-0.008
 and-0.008
 sure-0.007
 think-0.007
Token smiled
Token ID13541
Feature activation+1.41e-03

Pos logit contributions
iful+0.052
ka+0.052
rus+0.051
ked+0.049
ulus+0.049
Neg logit contributions
and-0.045
 not-0.042
 and-0.038
 knowing-0.038
 sure-0.038
Token.
Token ID13
Feature activation-5.02e-03

Pos logit contributions
and+0.001
 not+0.001
 never+0.000
 think+0.000
Unfortunately+0.000
Neg logit contributions
iful-0.003
ka-0.003
ked-0.003
ulus-0.003
rus-0.003
Token
Token ID6184
Feature activation+1.18e-02

Pos logit contributions
iful+0.010
ka+0.009
rus+0.009
ulus+0.009
works+0.009
Neg logit contributions
 not-0.004
and-0.004
 and-0.004
 never-0.004
 sure-0.004
Token
Token ID95
Feature activation+1.18e-03

Pos logit contributions
 not+0.016
 and+0.016
and+0.016
 think+0.015
 sure+0.015
Neg logit contributions
iful-0.018
glers-0.017
rus-0.016
ka-0.016
works-0.016
Token
Token ID26391
Feature activation-1.97e-02

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 sure+0.001
 never+0.001
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
ked-0.002
works-0.002
Token
Token ID129
Feature activation-7.82e-03

Pos logit contributions
ulus+0.028
ka+0.027
rus+0.027
iful+0.027
works+0.026
Neg logit contributions
and-0.026
 not-0.025
 sure-0.023
 knowing-0.023
 about-0.023
Token.
Token ID13
Feature activation+3.80e-03

Pos logit contributions
iful+0.024
ka+0.023
rus+0.023
ulus+0.023
works+0.023
Neg logit contributions
 not-0.013
and-0.013
 and-0.013
 sure-0.012
 never-0.012
Token
Token ID220
Feature activation-1.09e-02

Pos logit contributions
 not+0.003
and+0.003
 knowing+0.003
arre+0.003
 and+0.003
Neg logit contributions
rus-0.007
ulus-0.007
ked-0.007
oured-0.007
ka-0.007
Token
Token ID198
Feature activation+6.57e-03

Pos logit contributions
rus+0.021
iful+0.021
works+0.020
glers+0.020
ulus+0.020
Neg logit contributions
 not-0.007
 knowing-0.007
 never-0.007
 think-0.007
and-0.006
Token
Token ID198
Feature activation-9.33e-04

Pos logit contributions
 knowing+0.006
arre+0.005
 not+0.005
 cry+0.005
 think+0.005
Neg logit contributions
rus-0.011
redible-0.011
ulus-0.011
ka-0.011
rolled-0.011
TokenHe
Token ID1544
Feature activation-2.06e-02

Pos logit contributions
rus+0.002
ulus+0.002
ka+0.002
iful+0.002
redible+0.001
Neg logit contributions
 not-0.001
 and-0.001
and-0.001
 knowing-0.001
 think-0.001
Token looked
Token ID3114
Feature activation-5.99e-03

Pos logit contributions
iful+0.034
ocol+0.033
ka+0.033
works+0.033
ulus+0.032
Neg logit contributions
 not-0.027
 and-0.026
 never-0.025
 sure-0.024
and-0.024
Token around
Token ID1088
Feature activation+3.84e-03

Pos logit contributions
iful+0.010
ka+0.010
rus+0.009
ked+0.009
oured+0.009
Neg logit contributions
and-0.007
 not-0.007
Unfortunately-0.006
 story-0.006
 never-0.006
Token desperately
Token ID16459
Feature activation+1.30e-02

Pos logit contributions
arre+0.003
and+0.003
Sadly+0.003
aring+0.003
Unfortunately+0.003
Neg logit contributions
 usual-0.007
iful-0.007
ked-0.007
ocol-0.007
mer-0.007
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
and+0.013
arre+0.012
Unfortunately+0.012
Sadly+0.012
 not+0.010
Neg logit contributions
ked-0.024
iful-0.023
 usual-0.022
ocol-0.022
ka-0.022
Token saw
Token ID2497
Feature activation+2.90e-03

Pos logit contributions
and+0.017
 not+0.016
 and+0.015
arre+0.015
 sure+0.015
Neg logit contributions
iful-0.017
rus-0.017
ked-0.016
ka-0.016
ulus-0.016
Token a
Token ID257
Feature activation-4.85e-03

Pos logit contributions
and+0.003
 not+0.003
 and+0.003
 think+0.002
 never+0.002
Neg logit contributions
iful-0.005
rus-0.005
ka-0.005
ked-0.005
glers-0.005
Token little
Token ID1310
Feature activation+9.64e-03

Pos logit contributions
iful+0.030
works+0.028
ked+0.028
ka+0.028
ocol+0.028
Neg logit contributions
 not-0.017
 and-0.017
 sure-0.014
 think-0.014
 many-0.014
Token girl
Token ID2576
Feature activation+5.87e-03

Pos logit contributions
 and+0.012
 not+0.012
and+0.011
 sure+0.011
 think+0.011
Neg logit contributions
iful-0.016
rus-0.015
ka-0.015
works-0.015
ked-0.015
Token named
Token ID3706
Feature activation-3.13e-02

Pos logit contributions
and+0.008
 not+0.007
arre+0.007
 sure+0.007
Unfortunately+0.007
Neg logit contributions
ka-0.008
ulus-0.008
iful-0.008
glers-0.008
rus-0.008
Token Lily
Token ID20037
Feature activation-4.83e-03

Pos logit contributions
iful+0.067
rus+0.066
ka+0.065
ocol+0.064
glers+0.063
Neg logit contributions
 not-0.030
 and-0.026
and-0.026
 never-0.024
 many-0.024
Token.
Token ID13
Feature activation-2.59e-03

Pos logit contributions
iful+0.007
ka+0.007
rus+0.007
ked+0.007
ulus+0.007
Neg logit contributions
 not-0.006
and-0.006
arre-0.006
Unfortunately-0.005
 sure-0.005
Token She
Token ID1375
Feature activation-1.92e-02

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
ulus+0.006
works+0.006
Neg logit contributions
 not-0.002
and-0.001
 and-0.001
 sure-0.001
 never-0.001
Token had
Token ID550
Feature activation-8.91e-03

Pos logit contributions
iful+0.034
ka+0.033
rus+0.033
works+0.032
ulus+0.032
Neg logit contributions
 not-0.022
and-0.020
 and-0.020
 never-0.019
 sure-0.019
Token a
Token ID257
Feature activation+5.28e-03

Pos logit contributions
ka+0.014
iful+0.013
ulus+0.013
ked+0.013
rus+0.013
Neg logit contributions
and-0.011
 think-0.010
arre-0.009
 not-0.009
 and-0.009
Token special
Token ID2041
Feature activation+2.68e-02

Pos logit contributions
 not+0.007
and+0.007
 and+0.007
 sure+0.006
 never+0.006
Neg logit contributions
iful-0.008
ka-0.008
rus-0.008
works-0.007
ulus-0.007
Token drawer
Token ID33451
Feature activation+3.11e-02

Pos logit contributions
and+0.038
 not+0.036
 sure+0.034
 never+0.033
Unfortunately+0.033
Neg logit contributions
ka-0.038
iful-0.037
rus-0.037
ulus-0.035
works-0.035
Token in
Token ID287
Feature activation-6.11e-03

Pos logit contributions
arre+0.028
and+0.027
 think+0.025
 sure+0.024
Unfortunately+0.024
Neg logit contributions
ka-0.058
iful-0.056
rus-0.056
glers-0.055
oured-0.055
TokenWe
Token ID1135
Feature activation+1.47e-03

Pos logit contributions
iful+0.017
ocol+0.017
rus+0.016
ka+0.016
glers+0.016
Neg logit contributions
 not-0.013
and-0.013
 and-0.013
 never-0.012
 sure-0.012
Token promise
Token ID6991
Feature activation-3.21e-02

Pos logit contributions
and+0.002
 not+0.002
 and+0.002
 knowing+0.002
 about+0.002
Neg logit contributions
iful-0.002
ka-0.002
rus-0.002
works-0.002
ked-0.002
Token!"
Token ID2474
Feature activation-1.03e-02

Pos logit contributions
iful+0.068
ka+0.067
rus+0.067
ulus+0.066
works+0.066
Neg logit contributions
and-0.022
 not-0.021
 and-0.020
 sure-0.018
 think-0.017
Token
Token ID198
Feature activation-3.69e-03

Pos logit contributions
ked+0.015
ted+0.014
rolled+0.014
glers+0.014
icles+0.014
Neg logit contributions
and-0.013
 grounded-0.012
 not-0.012
 never-0.012
 cry-0.012
Token
Token ID198
Feature activation-7.92e-03

Pos logit contributions
rus+0.007
glers+0.007
works+0.007
ulus+0.007
ka+0.007
Neg logit contributions
 not-0.002
and-0.002
 knowing-0.002
arre-0.002
 think-0.002
TokenSo
Token ID2396
Feature activation-3.10e-02

Pos logit contributions
iful+0.012
rus+0.012
ka+0.012
glers+0.012
ulus+0.012
Neg logit contributions
and-0.010
 not-0.010
 and-0.009
 sure-0.009
 think-0.009
Token,
Token ID11
Feature activation-2.99e-03

Pos logit contributions
ked+0.057
iful+0.057
ka+0.057
rus+0.056
glers+0.055
Neg logit contributions
and-0.030
 not-0.026
arre-0.024
 think-0.024
 sure-0.023
Token Fin
Token ID4463
Feature activation-1.78e-02

Pos logit contributions
iful+0.005
ked+0.004
ka+0.004
rus+0.004
glers+0.004
Neg logit contributions
and-0.004
 not-0.004
 and-0.003
 think-0.003
 sure-0.003
Token and
Token ID290
Feature activation+5.69e-03

Pos logit contributions
iful+0.045
ka+0.045
ked+0.044
rus+0.044
glers+0.044
Neg logit contributions
and-0.004
 not-0.004
 about-0.002
 sure-0.001
 never-0.001
Token his
Token ID465
Feature activation-2.04e-02

Pos logit contributions
and+0.007
 not+0.006
arre+0.006
 about+0.006
Unfortunately+0.006
Neg logit contributions
ka-0.009
 avail-0.009
iful-0.009
glers-0.009
ked-0.009
Token friends
Token ID2460
Feature activation-5.38e-03

Pos logit contributions
iful+0.054
ifying+0.053
oured+0.052
ocol+0.051
works+0.051
Neg logit contributions
 not-0.011
 and-0.009
 story-0.008
 many-0.007
 sure-0.006
Token that
Token ID326
Feature activation-9.65e-03

Pos logit contributions
iful+0.036
ka+0.036
rus+0.035
ulus+0.035
works+0.035
Neg logit contributions
 not-0.009
and-0.008
 and-0.008
 sure-0.007
 never-0.007
Token?"
Token ID1701
Feature activation-1.74e-03

Pos logit contributions
iful+0.016
rus+0.016
ka+0.016
works+0.016
ulus+0.015
Neg logit contributions
and-0.011
 not-0.011
 and-0.010
 never-0.010
 sure-0.010
Token
Token ID198
Feature activation-7.21e-03

Pos logit contributions
rus+0.003
ka+0.003
iful+0.003
ulus+0.003
works+0.003
Neg logit contributions
 not-0.002
and-0.002
 never-0.001
 knowing-0.001
arre-0.001
Token
Token ID198
Feature activation-9.87e-03

Pos logit contributions
iful+0.018
ka+0.018
rus+0.017
ked+0.017
works+0.017
Neg logit contributions
 not-0.003
and-0.003
 and-0.002
 sure-0.002
 never-0.002
TokenThe
Token ID464
Feature activation-4.61e-03

Pos logit contributions
iful+0.019
ka+0.018
ked+0.018
rus+0.018
ocol+0.018
Neg logit contributions
 not-0.010
 and-0.009
and-0.009
 never-0.009
 sure-0.009
Token sun
Token ID4252
Feature activation-4.11e-02

Pos logit contributions
ifying+0.008
oured+0.008
iful+0.008
ocol+0.008
ka+0.008
Neg logit contributions
 not-0.006
 and-0.006
 story-0.005
 never-0.005
 sure-0.005
Token answered
Token ID9373
Feature activation-1.12e-02

Pos logit contributions
iful+0.070
ocol+0.069
ulus+0.067
ka+0.066
 avail+0.065
Neg logit contributions
 and-0.053
 not-0.051
 never-0.046
 with-0.044
 sure-0.042
Token back
Token ID736
Feature activation-4.51e-03

Pos logit contributions
iful+0.026
ka+0.025
rus+0.025
works+0.025
glers+0.025
Neg logit contributions
 and-0.007
 not-0.007
and-0.006
 about-0.005
 sure-0.005
Token in
Token ID287
Feature activation-5.54e-04

Pos logit contributions
ka+0.009
iful+0.009
ked+0.009
rus+0.009
works+0.008
Neg logit contributions
and-0.004
 not-0.003
 never-0.003
Unfortunately-0.003
arre-0.003
Token a
Token ID257
Feature activation-2.00e-03

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
ocol+0.001
ulus+0.001
Neg logit contributions
and-0.001
 not-0.001
 and-0.001
 sure-0.001
 about-0.001
Token bright
Token ID6016
Feature activation+1.14e-02

Pos logit contributions
iful+0.003
ka+0.003
rus+0.003
ulus+0.003
works+0.003
Neg logit contributions
 not-0.003
and-0.002
 and-0.002
 sure-0.002
 never-0.002
Token go
Token ID467
Feature activation-3.52e-03

Pos logit contributions
and+0.002
 not+0.002
 knowing+0.002
 and+0.002
 about+0.002
Neg logit contributions
iful-0.002
rus-0.002
glers-0.002
works-0.002
ka-0.002
Token outside
Token ID2354
Feature activation-2.03e-02

Pos logit contributions
ka+0.006
oured+0.006
iful+0.006
glers+0.006
ulus+0.006
Neg logit contributions
 not-0.003
and-0.003
arre-0.003
Unfortunately-0.003
Sadly-0.003
Token and
Token ID290
Feature activation-8.17e-03

Pos logit contributions
iful+0.040
ka+0.038
mer+0.038
ulus+0.037
works+0.037
Neg logit contributions
 not-0.015
and-0.015
Unfortunately-0.013
Sadly-0.013
arre-0.013
Token see
Token ID766
Feature activation-5.21e-03

Pos logit contributions
iful+0.011
ulus+0.011
rus+0.011
works+0.010
glers+0.010
Neg logit contributions
and-0.012
arre-0.010
 not-0.010
 knowing-0.010
 and-0.010
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
ka+0.012
rus+0.012
ked+0.012
iful+0.012
glers+0.012
Neg logit contributions
 and-0.004
 not-0.003
 about-0.003
 sure-0.003
 many-0.003
Token sun
Token ID4252
Feature activation-4.47e-02

Pos logit contributions
iful+0.018
ka+0.017
ked+0.017
rus+0.017
oured+0.017
Neg logit contributions
 not-0.018
 and-0.017
and-0.016
 never-0.016
 sure-0.016
Token together
Token ID1978
Feature activation-8.56e-03

Pos logit contributions
rus+0.088
ocol+0.088
ka+0.087
ked+0.085
iful+0.085
Neg logit contributions
 and-0.046
 not-0.041
 with-0.037
 about-0.035
 never-0.034
Token,
Token ID11
Feature activation-8.10e-03

Pos logit contributions
rus+0.016
iful+0.016
ka+0.016
ked+0.016
ulus+0.016
Neg logit contributions
 and-0.008
 not-0.008
and-0.007
 about-0.007
 never-0.007
Token alright
Token ID23036
Feature activation-7.34e-03

Pos logit contributions
iful+0.015
works+0.015
 rib+0.015
ulus+0.014
ked+0.014
Neg logit contributions
and-0.008
arre-0.006
Unfortunately-0.006
Sadly-0.006
 Sadly-0.006
Token?"
Token ID1701
Feature activation-2.03e-03

Pos logit contributions
rus+0.013
ka+0.013
iful+0.013
glers+0.013
ked+0.013
Neg logit contributions
 not-0.008
 and-0.008
and-0.007
 about-0.007
 sure-0.007
Token
Token ID198
Feature activation-7.91e-03

Pos logit contributions
rus+0.003
ked+0.003
oured+0.003
ka+0.003
glers+0.003
Neg logit contributions
 not-0.002
and-0.002
arre-0.002
 never-0.002
 sure-0.002
Token there
Token ID612
Feature activation+2.43e-02

Pos logit contributions
 not+0.002
and+0.002
 and+0.002
 sure+0.002
 never+0.001
Neg logit contributions
iful-0.008
ka-0.007
rus-0.007
ulus-0.007
works-0.007
Token was
Token ID373
Feature activation-1.89e-02

Pos logit contributions
 not+0.025
 and+0.025
and+0.023
 never+0.022
 about+0.022
Neg logit contributions
iful-0.047
ka-0.045
rus-0.045
ked-0.044
ulus-0.044
Token a
Token ID257
Feature activation-1.49e-02

Pos logit contributions
iful+0.042
rus+0.041
works+0.040
ka+0.040
ked+0.040
Neg logit contributions
 not-0.014
 and-0.014
and-0.012
 many-0.012
 sure-0.012
Token little
Token ID1310
Feature activation+1.04e-02

Pos logit contributions
iful+0.028
works+0.027
ked+0.027
ka+0.027
ocol+0.026
Neg logit contributions
 not-0.016
 and-0.016
 sure-0.014
 think-0.014
 never-0.014
Token bunny
Token ID44915
Feature activation+1.01e-02

Pos logit contributions
 not+0.013
 and+0.013
and+0.013
 sure+0.012
 never+0.012
Neg logit contributions
iful-0.016
ka-0.016
rus-0.015
works-0.015
ulus-0.015
Token named
Token ID3706
Feature activation-3.21e-02

Pos logit contributions
and+0.013
 not+0.012
arre+0.012
 sure+0.012
 think+0.011
Neg logit contributions
rus-0.015
ka-0.015
ulus-0.014
iful-0.014
glers-0.014
Token Fl
Token ID1610
Feature activation-1.34e-02

Pos logit contributions
iful+0.055
works+0.052
ka+0.052
ocol+0.050
glers+0.050
Neg logit contributions
 not-0.045
 and-0.042
and-0.038
 sure-0.038
 about-0.038
Tokenoppy
Token ID26696
Feature activation-3.53e-03

Pos logit contributions
iful+0.024
works+0.023
ka+0.023
glers+0.022
ocol+0.022
Neg logit contributions
 and-0.016
 not-0.016
and-0.015
aps-0.015
 with-0.014
Token.
Token ID13
Feature activation-4.12e-03

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
ulus+0.006
works+0.006
Neg logit contributions
 not-0.004
and-0.004
 and-0.003
 sure-0.003
 never-0.003
Token Fl
Token ID1610
Feature activation+2.69e-03

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
works+0.006
ulus+0.006
Neg logit contributions
 not-0.006
and-0.006
 and-0.005
 never-0.005
 about-0.005
Tokenoppy
Token ID26696
Feature activation+4.95e-03

Pos logit contributions
 not+0.004
and+0.004
 and+0.004
 sure+0.003
 never+0.003
Neg logit contributions
iful-0.004
works-0.004
ka-0.004
rus-0.004
glers-0.004
Token idea
Token ID2126
Feature activation-2.11e-02

Pos logit contributions
iful+0.014
ka+0.014
ocol+0.013
rus+0.013
 avail+0.013
Neg logit contributions
and-0.010
 not-0.010
 and-0.009
 sure-0.009
 never-0.009
Token how
Token ID703
Feature activation-1.06e-02

Pos logit contributions
iful+0.042
ka+0.041
rus+0.040
ulus+0.040
works+0.040
Neg logit contributions
 not-0.017
and-0.016
 and-0.015
 never-0.014
 sure-0.014
Token.
Token ID13
Feature activation-2.99e-03

Pos logit contributions
iful+0.023
ka+0.023
rus+0.022
ulus+0.022
works+0.022
Neg logit contributions
 not-0.006
and-0.006
 and-0.006
 sure-0.005
 never-0.005
Token
Token ID198
Feature activation-4.77e-04

Pos logit contributions
iful+0.006
ka+0.006
rus+0.006
ulus+0.006
ked+0.006
Neg logit contributions
 not-0.002
and-0.002
 and-0.001
 never-0.001
 sure-0.001
Token
Token ID198
Feature activation-7.27e-03

Pos logit contributions
glers+0.001
rus+0.001
ka+0.001
ulus+0.001
iful+0.001
Neg logit contributions
 not-0.000
 knowing-0.000
arre-0.000
 think-0.000
 mindful-0.000
TokenSo
Token ID2396
Feature activation-3.32e-02

Pos logit contributions
iful+0.008
ka+0.008
rus+0.008
ulus+0.008
works+0.008
Neg logit contributions
 not-0.012
and-0.012
 and-0.012
 sure-0.011
 think-0.011
Token Lucy
Token ID22162
Feature activation-6.51e-03

Pos logit contributions
ka+0.059
iful+0.059
works+0.057
rus+0.056
ulus+0.056
Neg logit contributions
and-0.034
arre-0.029
 not-0.029
 knowing-0.027
 sure-0.027
Token called
Token ID1444
Feature activation-1.65e-02

Pos logit contributions
iful+0.008
ka+0.008
rus+0.008
ked+0.008
oured+0.008
Neg logit contributions
and-0.010
 not-0.010
 knowing-0.009
aps-0.009
 about-0.009
Token out
Token ID503
Feature activation-1.08e-02

Pos logit contributions
iful+0.027
ka+0.026
rus+0.026
ked+0.026
works+0.026
Neg logit contributions
and-0.019
 not-0.017
arre-0.017
 and-0.016
Unfortunately-0.016
Token to
Token ID284
Feature activation-4.13e-04

Pos logit contributions
ked+0.020
iful+0.019
rus+0.019
ka+0.019
ulus+0.019
Neg logit contributions
and-0.009
 not-0.008
arre-0.008
 think-0.008
 never-0.008
Token the
Token ID262
Feature activation-4.54e-03

Pos logit contributions
iful+0.001
ka+0.001
rus+0.001
works+0.001
ulus+0.001
Neg logit contributions
 not-0.000
 and-0.000
and-0.000
 sure-0.000
 never-0.000