TOP ACTIVATIONS
MAX = 2.800

grandma 's house for safe keeping . When
" No , I want to paint blue and purple !"
you think our wish will come true ?" Ang ie asked
" Where are we going ?" Tom asked .
" Where are you going , Mia ?" Ben asked
" Where are we going ?" Lily asked .
" Can I help you paint ?" Tommy asked .
the jam was for their elderly grandma , who was coming
" Let 's go explore the ship ," said Bill
they did not know how to use the scissors well .
" What is going on here ?" Mom said
She could feel the masse use 's warm hands pressing
P aint the wall ! Go on ! Be brave !"
Good idea ! What should we glue ?" Tom asked .
leaves . Lily was happy to have her parent with her
. It was the biggest and sweet est mel on the
painter . He was a brave and kind painter . He
siblings , a twin brother and sister . They were intelligent
Lisa that if she wanted a new pair of scissors ,
happy that she found the little dirty pumpkin . <|endoftext|> Once

MIN ACTIVATIONS
MIN = -2.523

dog with a book like that near the pond . It
a folder with all his favorite pictures in it . He
toy box with all his favorite toys . One day ,
sign that said " Cl own Show Today ". They wanted
" Hello there ," said the creature . "
saw a magazine with his favorite cartoon character on the cover
outside , especially with his favorite rope . He would tug
himself " I wonder what that land is like ?".
From that day on , Sp ide y made sure to
little suitcase with all her favorite toys and clothes . When
say , " I hate that bush ." One
has a picture of your favorite toy on the cover .
had a toy bunny with one ear . She loved her
out a movie with her favorite princess . They went home
said , " Yes , that 's a thoughtful idea .
Do not go near the water ." The duck
smiled , excited to share his new normal bed time routine
From that day on , Sp ide y knew that even
. " Look , there 's a puppy outside !"
not know what a lab was , but she did not

INTERVAL 1.469 - 2.800
CONTAINS 0.147%

asleep . <|endoftext|> L ila and Ben are at the zoo
After a while , the cookies were ready and they sat
loud . " One , two , three , four ,
What do you want me to do ?"
welcome . Maybe we can meet again in the park .

INTERVAL 0.138 - 1.469
CONTAINS 37.758%

. She added some yellow and some orange , too .
ummy snack all by herself . <|endoftext|> Once upon a time
with her and the leaves . He looked out the window
and walked into the hole . It was dark and cold
running around all day and wanted to rest . As he

INTERVAL -1.193 - 0.138
CONTAINS 61.456%

he said . " I tried to warn you , but
late . It 's time to come inside and get dry
a happy dog who loved to go on walks with his
. Sometimes , sharing is better ." Ben and
girl were friends . <|endoftext|> Once upon a time , there

INTERVAL -2.523 - -1.193
CONTAINS 0.639%

and asked , " What 's wrong , Lily ?"
my . It 's the stars , the planets , and
He liked to explore and one day he went to a
enjoy singing , no matter what ." L ily
It wob bles and til ts . Sara and Ben lose
Token grandma
Token ID49890
Feature activation+3.32e-01

Pos logit contributions
works+0.014
 matter+0.013
 narrative+0.013
 meanings+0.012
der+0.012
Neg logit contributions
 see-0.012
 glad-0.011
 reunited-0.010
 back-0.010
 thrilled-0.009
Token's
Token ID338
Feature activation-4.01e-01

Pos logit contributions
 see+0.011
 glad+0.010
 reunited+0.009
 back+0.009
 thrilled+0.009
Neg logit contributions
works-0.014
 matter-0.013
 narrative-0.012
der-0.012
ports-0.012
Token house
Token ID2156
Feature activation+2.05e-03

Pos logit contributions
works+0.019
 matter+0.018
 narrative+0.018
 meanings+0.017
der+0.016
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.010
 back-0.009
 thrilled-0.009
Token for
Token ID329
Feature activation+3.91e-03

Pos logit contributions
 see+0.000
 back+0.000
 glad+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
ports-0.000
 meanings-0.000
Token safe
Token ID3338
Feature activation-3.28e-02

Pos logit contributions
 see+0.000
 glad+0.000
 reunited+0.000
 back+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
der-0.000
Tokenkeeping
Token ID19934
Feature activation+2.75e+00

Pos logit contributions
works+0.001
 matter+0.001
 narrative+0.001
 meanings+0.001
der+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 reunited-0.001
 back-0.001
 thrilled-0.001
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
 see+0.119
 glad+0.113
 reunited+0.105
 back+0.101
 thrilled+0.101
Neg logit contributions
works-0.082
 matter-0.073
der-0.071
 narrative-0.071
 meanings-0.067
Token
Token ID220
Feature activation+3.38e-01

Pos logit contributions
works+0.010
 matter+0.009
 meanings+0.009
 narrative+0.008
der+0.008
Neg logit contributions
 see-0.008
 reunited-0.007
 glad-0.007
 back-0.007
 thrilled-0.006
Token
Token ID198
Feature activation+3.51e-01

Pos logit contributions
 see+0.009
 reunited+0.009
 glad+0.008
 back+0.008
 satisfied+0.008
Neg logit contributions
works-0.015
ports-0.013
 narrative-0.013
 meanings-0.013
der-0.013
Token
Token ID198
Feature activation+6.02e-02

Pos logit contributions
 reunited+0.011
 back+0.010
 see+0.010
 glad+0.010
 pad+0.010
Neg logit contributions
works-0.013
oral-0.012
ves-0.012
ports-0.012
 matter-0.012
TokenWhen
Token ID2215
Feature activation-2.37e-01

Pos logit contributions
 see+0.002
 glad+0.002
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
ports-0.002
Token"
Token ID1
Feature activation+5.11e-01

Pos logit contributions
works+0.006
 matter+0.006
 narrative+0.005
 meanings+0.005
ports+0.005
Neg logit contributions
 see-0.003
 glad-0.003
 reunited-0.003
 back-0.002
 thrilled-0.002
TokenNo
Token ID2949
Feature activation+1.29e+00

Pos logit contributions
 see+0.017
 happy+0.013
 back+0.012
 glad+0.012
ends+0.011
Neg logit contributions
works-0.022
der-0.022
ently-0.021
lights-0.021
 meanings-0.021
Token,
Token ID11
Feature activation-3.58e-02

Pos logit contributions
 see+0.017
 glad+0.012
 back+0.011
 reunited+0.010
 happy+0.008
Neg logit contributions
works-0.084
 narrative-0.080
 matter-0.076
ports-0.075
 meanings-0.075
Token I
Token ID314
Feature activation+6.30e-01

Pos logit contributions
works+0.002
 narrative+0.002
 matter+0.002
 meanings+0.002
oral+0.002
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 happy-0.001
 reunited-0.001
Token want
Token ID765
Feature activation+1.85e+00

Pos logit contributions
 see+0.021
 glad+0.019
 reunited+0.018
 back+0.018
 thrilled+0.016
Neg logit contributions
works-0.027
 matter-0.025
 narrative-0.025
 meanings-0.023
ports-0.023
Token to
Token ID284
Feature activation+2.73e+00

Pos logit contributions
 see+0.043
 back+0.035
 glad+0.035
 reunited+0.032
 happy+0.029
Neg logit contributions
works-0.100
 narrative-0.095
 matter-0.092
 meanings-0.089
ports-0.088
Token paint
Token ID7521
Feature activation+1.34e+00

Pos logit contributions
 see+0.095
 glad+0.095
 reunited+0.093
 back+0.086
 thrilled+0.084
Neg logit contributions
works-0.103
 matter-0.099
 narrative-0.091
 meanings-0.088
der-0.084
Token blue
Token ID4171
Feature activation+5.78e-01

Pos logit contributions
 see+0.042
 glad+0.041
 reunited+0.039
 thrilled+0.035
 back+0.035
Neg logit contributions
works-0.058
 matter-0.054
 narrative-0.052
 meanings-0.049
ports-0.049
Token and
Token ID290
Feature activation+1.54e+00

Pos logit contributions
 see+0.019
 glad+0.017
 reunited+0.016
 back+0.016
 thrilled+0.014
Neg logit contributions
works-0.025
 matter-0.024
 narrative-0.023
 meanings-0.022
der-0.022
Token purple
Token ID14032
Feature activation+8.97e-01

Pos logit contributions
 see+0.052
 glad+0.042
 back+0.041
 happy+0.040
 thrilled+0.035
Neg logit contributions
works-0.070
 narrative-0.067
 matter-0.062
der-0.061
oral-0.061
Token!"
Token ID2474
Feature activation-1.22e-01

Pos logit contributions
 see+0.028
 back+0.024
 glad+0.024
 reunited+0.023
 happy+0.021
Neg logit contributions
works-0.040
 narrative-0.038
 matter-0.037
 meanings-0.035
ports-0.035
Token you
Token ID345
Feature activation+2.49e-01

Pos logit contributions
works+0.014
 narrative+0.013
 matter+0.013
der+0.013
 meanings+0.012
Neg logit contributions
 see-0.009
 glad-0.008
 back-0.007
 reunited-0.007
 happy-0.007
Token think
Token ID892
Feature activation+1.22e+00

Pos logit contributions
 glad+0.006
 reunited+0.006
 see+0.006
 back+0.006
 thrilled+0.005
Neg logit contributions
 matter-0.012
works-0.012
 narrative-0.011
 meanings-0.010
der-0.010
Token our
Token ID674
Feature activation+7.77e-01

Pos logit contributions
 see+0.034
 glad+0.028
 back+0.028
 happy+0.025
 reunited+0.025
Neg logit contributions
works-0.059
 narrative-0.058
 matter-0.056
 meanings-0.055
ports-0.053
Token wish
Token ID4601
Feature activation+1.23e+00

Pos logit contributions
 see+0.029
 glad+0.026
 back+0.025
 reunited+0.024
 happy+0.024
Neg logit contributions
works-0.031
 narrative-0.029
 matter-0.028
der-0.027
ports-0.026
Token will
Token ID481
Feature activation+1.30e+00

Pos logit contributions
 see+0.042
 glad+0.040
 reunited+0.039
 thrilled+0.035
 back+0.035
Neg logit contributions
works-0.050
 matter-0.046
 narrative-0.044
 meanings-0.042
der-0.041
Token come
Token ID1282
Feature activation+2.53e+00

Pos logit contributions
 glad+0.035
 see+0.033
 reunited+0.032
 back+0.030
 thrilled+0.030
Neg logit contributions
works-0.059
 matter-0.057
 narrative-0.053
 meanings-0.050
oral-0.050
Token true
Token ID2081
Feature activation+1.53e+00

Pos logit contributions
 glad+0.084
 thrilled+0.076
 pad+0.075
 reunited+0.073
ovi+0.072
Neg logit contributions
works-0.092
 matter-0.088
ports-0.076
ist-0.074
 narrative-0.074
Token?"
Token ID1701
Feature activation-3.68e-01

Pos logit contributions
 see+0.046
 glad+0.041
 reunited+0.039
 back+0.039
 thrilled+0.035
Neg logit contributions
works-0.071
 matter-0.066
 narrative-0.065
 meanings-0.062
der-0.061
Token Ang
Token ID2895
Feature activation+2.74e-01

Pos logit contributions
works+0.015
 matter+0.015
 narrative+0.015
 meanings+0.014
ports+0.013
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.010
 back-0.010
 pad-0.009
Tokenie
Token ID494
Feature activation-2.16e-01

Pos logit contributions
 see+0.013
 glad+0.012
 reunited+0.011
 back+0.011
 thrilled+0.011
Neg logit contributions
works-0.008
 matter-0.007
 narrative-0.007
der-0.007
ports-0.007
Token asked
Token ID1965
Feature activation+6.97e-01

Pos logit contributions
 matter+0.007
works+0.007
 narrative+0.007
der+0.007
 meanings+0.007
Neg logit contributions
 see-0.008
 glad-0.007
 reunited-0.007
 back-0.007
ends-0.007
Token
Token ID198
Feature activation+5.65e-01

Pos logit contributions
 reunited+0.019
 see+0.019
 glad+0.018
 back+0.018
 pad+0.017
Neg logit contributions
works-0.035
 narrative-0.032
oral-0.032
 matter-0.031
ves-0.031
Token"
Token ID1
Feature activation+4.71e-01

Pos logit contributions
 see+0.019
 glad+0.017
 reunited+0.016
 back+0.016
 happy+0.015
Neg logit contributions
works-0.025
 matter-0.023
 narrative-0.023
 meanings-0.021
der-0.021
TokenWhere
Token ID8496
Feature activation-5.05e-01

Pos logit contributions
 see+0.017
 back+0.013
 glad+0.013
 happy+0.013
 met+0.012
Neg logit contributions
works-0.019
 meanings-0.018
der-0.018
ently-0.017
 matter-0.017
Token are
Token ID389
Feature activation-3.87e-01

Pos logit contributions
works+0.020
 narrative+0.018
 matter+0.018
der+0.017
 meanings+0.017
Neg logit contributions
 see-0.020
 glad-0.017
 back-0.017
 reunited-0.016
 happy-0.016
Token we
Token ID356
Feature activation+1.09e+00

Pos logit contributions
works+0.018
 narrative+0.018
 matter+0.017
der+0.017
 meanings+0.017
Neg logit contributions
 see-0.011
 back-0.010
 glad-0.010
 reunited-0.009
 here-0.009
Token going
Token ID1016
Feature activation+2.52e+00

Pos logit contributions
 glad+0.033
 reunited+0.032
 relieved+0.032
 thrilled+0.032
 pad+0.032
Neg logit contributions
works-0.042
 matter-0.042
 narrative-0.038
ports-0.037
ators-0.035
Token?"
Token ID1701
Feature activation-6.96e-02

Pos logit contributions
 glad+0.107
 thrilled+0.102
 relieved+0.100
 satisfied+0.100
 pad+0.098
Neg logit contributions
works-0.076
ators-0.066
 matter-0.064
ports-0.061
 narrative-0.056
Token Tom
Token ID4186
Feature activation+2.52e-01

Pos logit contributions
works+0.004
 narrative+0.003
 matter+0.003
 meanings+0.003
der+0.003
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.001
 back-0.001
 happy-0.001
Token asked
Token ID1965
Feature activation+4.03e-01

Pos logit contributions
 see+0.009
 glad+0.009
 reunited+0.008
 back+0.008
 thrilled+0.007
Neg logit contributions
works-0.009
 matter-0.009
 narrative-0.009
 meanings-0.008
ports-0.008
Token.
Token ID13
Feature activation+7.52e-01

Pos logit contributions
 see+0.012
 back+0.012
 home+0.010
 happy+0.010
 reunited+0.009
Neg logit contributions
works-0.021
 narrative-0.019
 meanings-0.019
 matter-0.019
der-0.018
Token
Token ID198
Feature activation+5.94e-01

Pos logit contributions
 see+0.014
 glad+0.011
 back+0.010
 reunited+0.010
 happy+0.008
Neg logit contributions
works-0.044
 matter-0.042
 narrative-0.042
 meanings-0.040
ports-0.039
Token
Token ID198
Feature activation+6.24e-02

Pos logit contributions
 reunited+0.005
 see+0.005
 glad+0.005
 back+0.004
 thrilled+0.004
Neg logit contributions
works-0.009
 narrative-0.008
oral-0.008
 matter-0.008
ves-0.008
Token"
Token ID1
Feature activation+3.69e-01

Pos logit contributions
 see+0.002
 reunited+0.002
 glad+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.002
 narrative-0.002
 matter-0.002
ports-0.002
oral-0.002
TokenWhere
Token ID8496
Feature activation-8.44e-01

Pos logit contributions
 see+0.015
 glad+0.013
 back+0.013
 reunited+0.013
 happy+0.012
Neg logit contributions
works-0.014
 matter-0.012
 narrative-0.012
 meanings-0.012
der-0.012
Token are
Token ID389
Feature activation-8.41e-01

Pos logit contributions
works+0.034
 narrative+0.031
 matter+0.030
der+0.029
 meanings+0.029
Neg logit contributions
 see-0.032
 glad-0.028
 back-0.027
 reunited-0.026
 happy-0.026
Token you
Token ID345
Feature activation+9.22e-01

Pos logit contributions
works+0.041
 narrative+0.039
 matter+0.038
der+0.037
 meanings+0.037
Neg logit contributions
 see-0.024
 back-0.021
 glad-0.020
 reunited-0.019
 here-0.018
Token going
Token ID1016
Feature activation+2.43e+00

Pos logit contributions
 reunited+0.028
 see+0.028
 relieved+0.027
 pad+0.027
 satisfied+0.027
Neg logit contributions
 matter-0.035
works-0.032
ports-0.029
 narrative-0.029
ators-0.028
Token,
Token ID11
Feature activation-5.72e-01

Pos logit contributions
 glad+0.099
 relieved+0.094
 satisfied+0.094
 thrilled+0.094
 reunited+0.089
Neg logit contributions
works-0.078
ators-0.067
 matter-0.065
ports-0.064
ist-0.056
Token Mia
Token ID32189
Feature activation+8.38e-01

Pos logit contributions
works+0.027
 matter+0.025
 narrative+0.024
der+0.023
ports+0.023
Neg logit contributions
 reunited-0.016
 see-0.016
 glad-0.015
 back-0.014
 pad-0.013
Token?"
Token ID1701
Feature activation-2.78e-01

Pos logit contributions
 glad+0.037
 thrilled+0.036
 reunited+0.035
 see+0.035
ends+0.035
Neg logit contributions
 matter-0.021
works-0.021
 narrative-0.017
ist-0.017
 costs-0.016
Token Ben
Token ID3932
Feature activation-5.98e-03

Pos logit contributions
works+0.014
 matter+0.013
 narrative+0.013
 meanings+0.013
der+0.013
Neg logit contributions
 see-0.007
 glad-0.006
 reunited-0.006
 back-0.005
 thrilled-0.005
Token asked
Token ID1965
Feature activation+1.89e-01

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
 meanings+0.000
der+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
 reunited+0.012
 pad+0.011
 back+0.010
 glad+0.010
 thrilled+0.010
Neg logit contributions
works-0.014
oral-0.013
ves-0.013
ports-0.013
 narrative-0.013
Token"
Token ID1
Feature activation-6.03e-01

Pos logit contributions
 see+0.001
 reunited+0.001
 glad+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.001
 narrative-0.001
ports-0.001
oral-0.001
 matter-0.001
TokenWhere
Token ID8496
Feature activation-1.16e+00

Pos logit contributions
works+0.025
 meanings+0.024
ool+0.023
 matter+0.023
nexpected+0.023
Neg logit contributions
 see-0.020
 glad-0.017
 back-0.017
 happy-0.016
ends-0.015
Token are
Token ID389
Feature activation-7.00e-01

Pos logit contributions
works+0.043
 matter+0.038
 narrative+0.038
 meanings+0.036
der+0.036
Neg logit contributions
 see-0.047
 glad-0.043
 reunited-0.041
 back-0.041
 happy-0.038
Token we
Token ID356
Feature activation+6.09e-01

Pos logit contributions
works+0.034
 narrative+0.034
 matter+0.032
 meanings+0.032
der+0.032
Neg logit contributions
 see-0.020
 back-0.017
 glad-0.016
 here-0.015
 home-0.015
Token going
Token ID1016
Feature activation+2.39e+00

Pos logit contributions
 see+0.016
 reunited+0.016
 glad+0.016
 pad+0.015
 thrilled+0.015
Neg logit contributions
works-0.026
 matter-0.025
ports-0.023
 narrative-0.023
ators-0.022
Token?"
Token ID1701
Feature activation+1.53e-01

Pos logit contributions
 glad+0.090
 thrilled+0.086
 pad+0.085
 relieved+0.082
 satisfied+0.081
Neg logit contributions
works-0.082
 matter-0.074
ports-0.070
ators-0.070
 narrative-0.068
Token Lily
Token ID20037
Feature activation+3.06e-01

Pos logit contributions
 see+0.005
 glad+0.004
 reunited+0.004
 back+0.004
 thrilled+0.004
Neg logit contributions
works-0.007
 matter-0.006
 narrative-0.006
ports-0.006
 meanings-0.006
Token asked
Token ID1965
Feature activation+6.10e-01

Pos logit contributions
 see+0.011
 glad+0.010
 reunited+0.010
 back+0.009
 welcome+0.008
Neg logit contributions
works-0.012
 matter-0.011
 narrative-0.011
der-0.011
 meanings-0.010
Token.
Token ID13
Feature activation+8.42e-01

Pos logit contributions
 see+0.021
 back+0.019
 glad+0.019
 reunited+0.018
 happy+0.017
Neg logit contributions
works-0.026
 matter-0.025
 narrative-0.024
 meanings-0.024
nexpected-0.022
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
 reunited+0.021
 see+0.020
 glad+0.019
 back+0.018
 pad+0.017
Neg logit contributions
works-0.040
 narrative-0.038
der-0.037
ports-0.036
 matter-0.036
Token
Token ID198
Feature activation-6.84e-01

Pos logit contributions
works+0.010
 matter+0.009
 narrative+0.009
ports+0.009
oral+0.009
Neg logit contributions
 see-0.005
 reunited-0.005
 glad-0.005
 back-0.005
 pad-0.004
Token"
Token ID1
Feature activation-1.08e-01

Pos logit contributions
works+0.030
 matter+0.028
 narrative+0.028
 meanings+0.026
ports+0.026
Neg logit contributions
 see-0.022
 glad-0.020
 reunited-0.019
 back-0.019
 thrilled-0.017
TokenCan
Token ID6090
Feature activation-6.75e-02

Pos logit contributions
works+0.004
 narrative+0.004
 matter+0.004
 meanings+0.003
der+0.003
Neg logit contributions
 see-0.004
 glad-0.004
 reunited-0.004
 back-0.004
 happy-0.003
Token I
Token ID314
Feature activation+1.77e+00

Pos logit contributions
works+0.003
 narrative+0.003
 matter+0.003
 meanings+0.003
der+0.003
Neg logit contributions
 see-0.002
 glad-0.002
 back-0.002
 reunited-0.002
 happy-0.001
Token help
Token ID1037
Feature activation+8.76e-01

Pos logit contributions
 see+0.061
 glad+0.057
 reunited+0.055
 back+0.054
 thrilled+0.050
Neg logit contributions
works-0.071
 matter-0.066
 narrative-0.065
 meanings-0.061
ports-0.060
Token you
Token ID345
Feature activation+2.39e+00

Pos logit contributions
 glad+0.027
 see+0.027
 reunited+0.025
 pad+0.025
 thrilled+0.024
Neg logit contributions
works-0.037
 matter-0.036
ports-0.031
 narrative-0.031
 meanings-0.031
Token paint
Token ID7521
Feature activation+1.33e+00

Pos logit contributions
 glad+0.079
 see+0.078
 reunited+0.075
 back+0.070
 thrilled+0.069
Neg logit contributions
works-0.094
 matter-0.091
 narrative-0.084
der-0.079
 meanings-0.078
Token?"
Token ID1701
Feature activation-1.46e+00

Pos logit contributions
 glad+0.042
 see+0.041
 reunited+0.040
 thrilled+0.036
 pad+0.036
Neg logit contributions
works-0.057
 matter-0.054
 narrative-0.050
ports-0.048
 meanings-0.047
Token Tommy
Token ID19919
Feature activation-5.93e-01

Pos logit contributions
works+0.063
 matter+0.061
 narrative+0.059
 meanings+0.056
ports+0.055
Neg logit contributions
 see-0.045
 glad-0.041
 reunited-0.041
 back-0.039
 pad-0.036
Token asked
Token ID1965
Feature activation-3.13e-01

Pos logit contributions
works+0.024
 matter+0.023
 narrative+0.022
 meanings+0.021
ports+0.021
Neg logit contributions
 see-0.020
 glad-0.018
 reunited-0.017
 back-0.017
 thrilled-0.016
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
works+0.016
 narrative+0.015
 matter+0.014
 meanings+0.014
ports+0.013
Neg logit contributions
 see-0.009
 back-0.008
 glad-0.008
 reunited-0.008
 happy-0.007
Token the
Token ID262
Feature activation+8.42e-01

Pos logit contributions
 see+0.003
 glad+0.003
 back+0.003
 reunited+0.002
 happy+0.002
Neg logit contributions
works-0.005
 matter-0.005
 narrative-0.005
 meanings-0.004
ports-0.004
Token jam
Token ID16853
Feature activation+2.05e-01

Pos logit contributions
 see+0.026
 glad+0.023
 back+0.022
 reunited+0.020
 happy+0.020
Neg logit contributions
works-0.039
 matter-0.035
 narrative-0.035
der-0.034
 meanings-0.033
Token was
Token ID373
Feature activation-2.47e-02

Pos logit contributions
 see+0.008
 back+0.007
 glad+0.007
 reunited+0.007
 happy+0.007
Neg logit contributions
works-0.007
 narrative-0.007
 matter-0.007
ports-0.007
 meanings-0.006
Token for
Token ID329
Feature activation+2.81e-01

Pos logit contributions
works+0.001
 narrative+0.001
ports+0.001
 matter+0.001
der+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 happy-0.001
 reunited-0.001
Token their
Token ID511
Feature activation+8.59e-01

Pos logit contributions
 see+0.008
 glad+0.007
 reunited+0.007
 back+0.007
 happy+0.006
Neg logit contributions
works-0.014
 narrative-0.013
 matter-0.013
der-0.012
 meanings-0.012
Token elderly
Token ID13830
Feature activation+2.38e+00

Pos logit contributions
 see+0.030
 glad+0.028
 reunited+0.027
 back+0.026
 thrilled+0.024
Neg logit contributions
works-0.034
 matter-0.033
 narrative-0.032
 meanings-0.031
ports-0.029
Token grandma
Token ID49890
Feature activation+9.22e-01

Pos logit contributions
 see+0.081
 glad+0.075
 reunited+0.072
 back+0.068
 thrilled+0.065
Neg logit contributions
works-0.096
 matter-0.091
 narrative-0.091
 meanings-0.086
der-0.084
Token,
Token ID11
Feature activation+3.63e-01

Pos logit contributions
 see+0.026
 glad+0.024
 reunited+0.022
 thrilled+0.021
 back+0.020
Neg logit contributions
works-0.042
 matter-0.040
 narrative-0.037
 meanings-0.036
ports-0.036
Token who
Token ID508
Feature activation-2.94e-01

Pos logit contributions
 see+0.010
 glad+0.008
 reunited+0.008
 back+0.008
 happy+0.007
Neg logit contributions
works-0.018
 matter-0.017
 narrative-0.017
 meanings-0.016
der-0.016
Token was
Token ID373
Feature activation-9.49e-02

Pos logit contributions
works+0.013
 narrative+0.012
der+0.012
ports+0.012
 matter+0.011
Neg logit contributions
 see-0.010
 glad-0.008
 happy-0.008
 found-0.008
 reunited-0.008
Token coming
Token ID2406
Feature activation+1.11e+00

Pos logit contributions
works+0.005
 matter+0.004
 narrative+0.004
 meanings+0.004
der+0.004
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token
Token ID198
Feature activation-9.97e-01

Pos logit contributions
works+0.015
 matter+0.014
 narrative+0.014
ports+0.014
 meanings+0.014
Neg logit contributions
 see-0.006
 glad-0.005
 reunited-0.005
 back-0.005
 thrilled-0.004
Token"
Token ID1
Feature activation-1.94e-01

Pos logit contributions
works+0.031
 narrative+0.026
der+0.026
 matter+0.026
 meanings+0.024
Neg logit contributions
 see-0.048
 glad-0.041
 back-0.040
 happy-0.039
ends-0.038
TokenLet
Token ID5756
Feature activation+1.26e-01

Pos logit contributions
works+0.008
der+0.007
 narrative+0.007
 matter+0.007
 meanings+0.007
Neg logit contributions
 see-0.007
 glad-0.006
 back-0.006
 reunited-0.006
 happy-0.005
Token's
Token ID338
Feature activation-2.88e-01

Pos logit contributions
 glad+0.004
 see+0.004
 reunited+0.004
ends+0.004
 satisfied+0.004
Neg logit contributions
 matter-0.005
works-0.005
 narrative-0.005
ports-0.004
nexpected-0.004
Token go
Token ID467
Feature activation+1.71e+00

Pos logit contributions
works+0.012
 matter+0.012
 narrative+0.011
 meanings+0.010
ports+0.010
Neg logit contributions
 glad-0.009
 reunited-0.008
 see-0.008
 thrilled-0.008
 pad-0.008
Token explore
Token ID7301
Feature activation+2.37e+00

Pos logit contributions
 glad+0.053
 thrilled+0.049
 reunited+0.047
 pad+0.046
 instant+0.046
Neg logit contributions
works-0.066
ports-0.059
 matter-0.057
ators-0.053
fighters-0.052
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
 see+0.064
 glad+0.056
 reunited+0.053
 back+0.053
 happy+0.047
Neg logit contributions
works-0.117
 narrative-0.109
 matter-0.109
 meanings-0.103
der-0.102
Token ship
Token ID4074
Feature activation+9.14e-02

Pos logit contributions
works+0.001
 narrative+0.000
 matter+0.000
ports+0.000
 meanings+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
Token,"
Token ID553
Feature activation-1.15e+00

Pos logit contributions
 see+0.003
 glad+0.003
 reunited+0.003
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.004
 matter-0.004
 narrative-0.003
der-0.003
ports-0.003
Token said
Token ID531
Feature activation-1.47e-01

Pos logit contributions
works+0.062
der+0.059
 narrative+0.057
oral+0.054
ators+0.054
Neg logit contributions
 see-0.031
 happy-0.022
 glad-0.022
 back-0.021
 reunited-0.018
Token Bill
Token ID3941
Feature activation-3.81e-01

Pos logit contributions
works+0.009
 narrative+0.009
der+0.009
 matter+0.009
oral+0.008
Neg logit contributions
 see-0.002
 back-0.002
 reunited-0.002
 glad-0.001
 home-0.001
Token they
Token ID484
Feature activation+1.18e+00

Pos logit contributions
 see+0.018
 glad+0.016
 reunited+0.016
 back+0.015
 thrilled+0.014
Neg logit contributions
works-0.025
 matter-0.023
 narrative-0.023
 meanings-0.021
der-0.021
Token did
Token ID750
Feature activation+3.18e-01

Pos logit contributions
 see+0.044
 glad+0.037
 back+0.036
 reunited+0.035
 happy+0.034
Neg logit contributions
works-0.050
 narrative-0.045
 matter-0.045
ports-0.043
 meanings-0.042
Token not
Token ID407
Feature activation+1.78e+00

Pos logit contributions
 see+0.007
 back+0.006
 glad+0.006
 happy+0.005
ends+0.005
Neg logit contributions
works-0.019
 narrative-0.018
 meanings-0.016
 matter-0.016
ports-0.016
Token know
Token ID760
Feature activation+5.18e-03

Pos logit contributions
 see+0.049
 glad+0.043
 reunited+0.041
 back+0.041
 thrilled+0.036
Neg logit contributions
works-0.087
 matter-0.081
 narrative-0.081
 meanings-0.076
der-0.076
Token how
Token ID703
Feature activation+4.88e-01

Pos logit contributions
 see+0.000
 back+0.000
 glad+0.000
 outside+0.000
 playing+0.000
Neg logit contributions
 narrative-0.000
works-0.000
 matter-0.000
oral-0.000
 meanings-0.000
Token to
Token ID284
Feature activation+2.35e+00

Pos logit contributions
 see+0.012
 glad+0.011
 back+0.010
 happy+0.010
 home+0.008
Neg logit contributions
works-0.027
 narrative-0.026
 matter-0.025
ports-0.024
 meanings-0.023
Token use
Token ID779
Feature activation+7.54e-01

Pos logit contributions
 see+0.083
 glad+0.070
 back+0.069
 reunited+0.068
 happy+0.064
Neg logit contributions
works-0.101
 narrative-0.094
 matter-0.090
 meanings-0.088
der-0.088
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
 see+0.021
 back+0.018
 glad+0.016
 home+0.016
 happy+0.015
Neg logit contributions
works-0.039
 narrative-0.037
 matter-0.035
oral-0.034
Deal-0.034
Token scissors
Token ID41786
Feature activation+4.33e-01

Pos logit contributions
 see+0.046
 back+0.041
 glad+0.039
 pad+0.038
 happy+0.037
Neg logit contributions
works-0.046
 narrative-0.042
der-0.040
 matter-0.039
nexpected-0.039
Token well
Token ID880
Feature activation+4.57e-01

Pos logit contributions
 see+0.015
 glad+0.014
 back+0.013
 reunited+0.013
 happy+0.012
Neg logit contributions
works-0.018
 narrative-0.016
 matter-0.016
 meanings-0.015
ports-0.015
Token.
Token ID13
Feature activation+4.80e-01

Pos logit contributions
 see+0.016
 back+0.014
 glad+0.014
 reunited+0.013
 home+0.013
Neg logit contributions
works-0.019
 narrative-0.018
 matter-0.017
 meanings-0.016
ports-0.016
Token
Token ID198
Feature activation+7.23e-01

Pos logit contributions
 see+0.021
 glad+0.018
 reunited+0.017
 back+0.017
 happy+0.015
Neg logit contributions
works-0.043
 matter-0.040
 narrative-0.040
 meanings-0.038
der-0.037
Token
Token ID198
Feature activation+4.57e-02

Pos logit contributions
 reunited+0.023
 pad+0.023
 see+0.020
 thrilled+0.020
 glad+0.020
Neg logit contributions
works-0.028
oral-0.026
ves-0.025
ports-0.025
 narrative-0.025
Token"
Token ID1
Feature activation+4.85e-01

Pos logit contributions
 see+0.002
 reunited+0.001
 glad+0.001
 back+0.001
 pad+0.001
Neg logit contributions
works-0.002
 narrative-0.002
ports-0.002
 matter-0.002
oral-0.002
TokenWhat
Token ID2061
Feature activation-2.72e-01

Pos logit contributions
 see+0.014
 back+0.011
 glad+0.011
 here+0.010
 happy+0.010
Neg logit contributions
works-0.022
 meanings-0.022
der-0.021
 matter-0.020
lights-0.020
Token is
Token ID318
Feature activation-1.97e-01

Pos logit contributions
works+0.012
 narrative+0.010
 matter+0.010
der+0.010
 meanings+0.010
Neg logit contributions
 see-0.010
 glad-0.009
 back-0.008
 happy-0.008
 reunited-0.008
Token going
Token ID1016
Feature activation+2.33e+00

Pos logit contributions
works+0.009
 narrative+0.008
 matter+0.008
nexpected+0.008
ports+0.008
Neg logit contributions
 see-0.006
 reunited-0.005
 glad-0.005
 back-0.005
 thrilled-0.005
Token on
Token ID319
Feature activation+1.72e+00

Pos logit contributions
 glad+0.095
 thrilled+0.091
 pad+0.091
ends+0.085
 relieved+0.085
Neg logit contributions
 matter-0.069
works-0.068
 narrative-0.058
ports-0.053
 cost-0.053
Token here
Token ID994
Feature activation+1.46e+00

Pos logit contributions
 glad+0.042
 see+0.041
 pad+0.040
 reunited+0.038
ends+0.037
Neg logit contributions
works-0.083
 matter-0.075
 narrative-0.075
ports-0.074
nexpected-0.074
Token?"
Token ID1701
Feature activation-8.16e-02

Pos logit contributions
 glad+0.056
 see+0.052
 reunited+0.052
 pad+0.051
 thrilled+0.050
Neg logit contributions
works-0.051
 matter-0.046
 narrative-0.042
ports-0.041
nexpected-0.039
Token Mom
Token ID11254
Feature activation+7.90e-01

Pos logit contributions
works+0.004
 matter+0.004
 narrative+0.004
 meanings+0.003
der+0.003
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token said
Token ID531
Feature activation-2.37e-01

Pos logit contributions
 see+0.027
 glad+0.024
 reunited+0.023
 back+0.023
 thrilled+0.021
Neg logit contributions
works-0.033
 matter-0.031
 narrative-0.030
 meanings-0.028
ports-0.028
Token
Token ID220
Feature activation+2.86e-02

Pos logit contributions
works+0.026
 matter+0.025
 narrative+0.024
 meanings+0.023
der+0.023
Neg logit contributions
 see-0.016
 glad-0.014
 reunited-0.013
 back-0.013
 happy-0.012
Token She
Token ID1375
Feature activation-9.61e-02

Pos logit contributions
 see+0.001
 reunited+0.001
 glad+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.001
der-0.001
 narrative-0.001
ports-0.001
 meanings-0.001
Token could
Token ID714
Feature activation-1.45e-01

Pos logit contributions
works+0.004
 narrative+0.004
 matter+0.004
ports+0.004
 meanings+0.004
Neg logit contributions
 see-0.003
 glad-0.003
 back-0.003
 reunited-0.003
 happy-0.003
Token feel
Token ID1254
Feature activation+3.06e-01

Pos logit contributions
 matter+0.006
works+0.006
 narrative+0.005
der+0.005
 meanings+0.005
Neg logit contributions
 glad-0.004
 reunited-0.004
 see-0.004
 pad-0.004
 back-0.004
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
 see+0.009
 reunited+0.008
 glad+0.008
 pad+0.008
 back+0.008
Neg logit contributions
 matter-0.012
works-0.012
der-0.011
 narrative-0.011
 meanings-0.011
Token masse
Token ID41869
Feature activation+2.32e+00

Pos logit contributions
 see+0.010
 back+0.009
 glad+0.009
 happy+0.008
 reunited+0.008
Neg logit contributions
works-0.015
 narrative-0.013
ports-0.013
 matter-0.013
 meanings-0.013
Tokenuse
Token ID1904
Feature activation-2.56e-01

Pos logit contributions
 see+0.087
 glad+0.079
 reunited+0.076
 back+0.075
 happy+0.070
Neg logit contributions
works-0.090
 matter-0.082
 narrative-0.082
 meanings-0.076
der-0.075
Token's
Token ID338
Feature activation-7.89e-01

Pos logit contributions
works+0.014
 narrative+0.014
 matter+0.012
der+0.012
oral+0.012
Neg logit contributions
 see-0.006
 back-0.005
 glad-0.005
 reunited-0.005
 happy-0.004
Token warm
Token ID5814
Feature activation-3.08e-01

Pos logit contributions
works+0.034
 matter+0.031
 narrative+0.031
 meanings+0.029
der+0.029
Neg logit contributions
 see-0.027
 glad-0.024
 reunited-0.023
 back-0.023
 happy-0.021
Token hands
Token ID2832
Feature activation-1.82e-01

Pos logit contributions
works+0.013
 narrative+0.012
 matter+0.011
ports+0.010
 meanings+0.010
Neg logit contributions
 see-0.011
 back-0.010
 glad-0.010
 reunited-0.010
 home-0.010
Token pressing
Token ID12273
Feature activation-5.86e-01

Pos logit contributions
works+0.009
 narrative+0.008
 matter+0.008
 meanings+0.008
ports+0.008
Neg logit contributions
 see-0.005
 back-0.005
 glad-0.005
 reunited-0.004
 happy-0.004
TokenP
Token ID47
Feature activation-2.97e-02

Pos logit contributions
 see+0.006
 glad+0.006
 back+0.005
 thrilled+0.005
 happy+0.005
Neg logit contributions
works-0.007
 narrative-0.006
 matter-0.006
der-0.006
 meanings-0.006
Tokenaint
Token ID2913
Feature activation+1.68e-05

Pos logit contributions
works+0.001
 matter+0.001
 narrative+0.001
 meanings+0.001
der+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 reunited-0.001
 back-0.001
 happy-0.001
Token the
Token ID262
Feature activation+3.65e-01

Pos logit contributions
 see+0.000
 glad+0.000
 back+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
der-0.000
Token wall
Token ID3355
Feature activation+7.24e-02

Pos logit contributions
 see+0.013
 glad+0.011
 back+0.011
 reunited+0.011
 happy+0.010
Neg logit contributions
works-0.015
 narrative-0.014
 matter-0.014
der-0.013
 meanings-0.013
Token!
Token ID0
Feature activation+2.98e-01

Pos logit contributions
 see+0.002
 glad+0.002
 reunited+0.002
 back+0.002
 happy+0.002
Neg logit contributions
works-0.003
 matter-0.003
 narrative-0.003
 meanings-0.003
der-0.003
Token Go
Token ID1514
Feature activation+2.31e+00

Pos logit contributions
 see+0.009
 glad+0.008
 reunited+0.007
 back+0.007
 happy+0.006
Neg logit contributions
works-0.014
 matter-0.013
 narrative-0.013
 meanings-0.012
der-0.012
Token on
Token ID319
Feature activation+1.01e+00

Pos logit contributions
 glad+0.058
 see+0.055
 reunited+0.053
 thrilled+0.050
 relieved+0.046
Neg logit contributions
works-0.110
 matter-0.103
ports-0.098
 narrative-0.095
ist-0.094
Token!
Token ID0
Feature activation+4.02e-02

Pos logit contributions
 see+0.026
 glad+0.021
 back+0.021
 reunited+0.020
 home+0.019
Neg logit contributions
works-0.053
 narrative-0.050
 matter-0.049
 meanings-0.046
der-0.046
Token Be
Token ID1355
Feature activation+9.82e-02

Pos logit contributions
 see+0.001
 glad+0.001
 back+0.001
 reunited+0.001
 thrilled+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
der-0.002
Token brave
Token ID14802
Feature activation+7.68e-01

Pos logit contributions
 see+0.003
 reunited+0.002
 glad+0.002
 back+0.002
ovi+0.002
Neg logit contributions
works-0.005
 matter-0.004
 narrative-0.004
 meanings-0.004
ports-0.004
Token!"
Token ID2474
Feature activation-4.28e-01

Pos logit contributions
 see+0.023
 glad+0.022
 reunited+0.021
 back+0.019
 thrilled+0.018
Neg logit contributions
works-0.034
 matter-0.032
der-0.030
 meanings-0.030
 narrative-0.030
TokenGood
Token ID10248
Feature activation+8.58e-01

Pos logit contributions
 see+0.029
 happy+0.024
 glad+0.023
ends+0.023
 back+0.022
Neg logit contributions
works-0.023
 narrative-0.021
der-0.021
ently-0.020
ool-0.020
Token idea
Token ID2126
Feature activation+5.83e-01

Pos logit contributions
 see+0.034
 glad+0.033
 reunited+0.031
 back+0.030
 thrilled+0.029
Neg logit contributions
works-0.031
 matter-0.029
 narrative-0.028
 meanings-0.026
der-0.025
Token!
Token ID0
Feature activation+2.50e-01

Pos logit contributions
 see+0.018
 glad+0.017
 reunited+0.016
 back+0.015
 thrilled+0.014
Neg logit contributions
works-0.026
 matter-0.024
 narrative-0.023
der-0.022
 meanings-0.022
Token What
Token ID1867
Feature activation+3.56e-01

Pos logit contributions
 see+0.009
 glad+0.008
 back+0.008
 reunited+0.008
 thrilled+0.007
Neg logit contributions
works-0.010
 narrative-0.009
 matter-0.009
 meanings-0.009
der-0.009
Token should
Token ID815
Feature activation+1.23e+00

Pos logit contributions
 see+0.010
 happy+0.008
 here+0.007
 glad+0.007
 playing+0.007
Neg logit contributions
works-0.018
 narrative-0.017
lights-0.017
der-0.017
nexpected-0.016
Token we
Token ID356
Feature activation+2.30e+00

Pos logit contributions
 see+0.040
 glad+0.032
 back+0.032
 reunited+0.030
 happy+0.029
Neg logit contributions
works-0.055
 narrative-0.053
 matter-0.051
 meanings-0.049
der-0.049
Token glue
Token ID22749
Feature activation+8.24e-02

Pos logit contributions
 glad+0.081
 reunited+0.079
 see+0.077
 back+0.074
 thrilled+0.072
Neg logit contributions
works-0.088
 matter-0.086
 narrative-0.079
 meanings-0.074
ports-0.070
Token?"
Token ID1701
Feature activation+3.95e-01

Pos logit contributions
 glad+0.003
 see+0.003
 reunited+0.003
 thrilled+0.003
 pad+0.002
Neg logit contributions
works-0.003
 matter-0.003
ports-0.003
 narrative-0.003
ist-0.003
Token Tom
Token ID4186
Feature activation-1.32e-01

Pos logit contributions
 see+0.010
 glad+0.009
 reunited+0.008
 back+0.008
 thrilled+0.007
Neg logit contributions
works-0.020
 matter-0.019
 narrative-0.019
 meanings-0.018
ports-0.018
Token asked
Token ID1965
Feature activation-1.06e-02

Pos logit contributions
works+0.006
 narrative+0.006
 matter+0.006
 meanings+0.005
der+0.005
Neg logit contributions
 see-0.004
 glad-0.004
 reunited-0.003
 back-0.003
 happy-0.003
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
works+0.001
 narrative+0.001
 meanings+0.000
 matter+0.000
cuts+0.000
Neg logit contributions
 back-0.000
 see-0.000
 happy-0.000
 reunited-0.000
 home-0.000
Token leaves
Token ID5667
Feature activation+3.01e-01

Pos logit contributions
 see+0.021
 glad+0.018
 back+0.017
 happy+0.017
 reunited+0.016
Neg logit contributions
works-0.021
 narrative-0.019
 matter-0.018
ports-0.018
der-0.018
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
 see+0.009
 glad+0.008
 reunited+0.007
 back+0.007
 thrilled+0.007
Neg logit contributions
works-0.014
 matter-0.013
 narrative-0.013
der-0.012
 meanings-0.012
Token Lily
Token ID20037
Feature activation+1.47e+00

Pos logit contributions
works+0.009
 matter+0.009
 narrative+0.009
 meanings+0.008
der+0.008
Neg logit contributions
 see-0.006
 glad-0.005
 reunited-0.005
 back-0.005
 happy-0.004
Token was
Token ID373
Feature activation+7.97e-01

Pos logit contributions
 see+0.055
 glad+0.052
 reunited+0.050
 back+0.048
 welcome+0.045
Neg logit contributions
works-0.052
 matter-0.050
 narrative-0.047
 meanings-0.046
der-0.045
Token happy
Token ID3772
Feature activation+2.99e-01

Pos logit contributions
 pad+0.023
ovi+0.022
umper+0.022
awk+0.021
 see+0.021
Neg logit contributions
 matter-0.028
 none-0.025
 importance-0.025
 concept-0.023
 exactly-0.023
Token to
Token ID284
Feature activation+2.29e+00

Pos logit contributions
 see+0.009
 glad+0.008
 reunited+0.008
 pad+0.008
ends+0.008
Neg logit contributions
der-0.012
 matter-0.011
works-0.011
 meanings-0.010
 narrative-0.010
Token have
Token ID423
Feature activation+7.00e-01

Pos logit contributions
 glad+0.056
 reunited+0.056
 see+0.056
 back+0.050
 thrilled+0.048
Neg logit contributions
works-0.108
 matter-0.105
 narrative-0.101
 meanings-0.095
der-0.093
Token her
Token ID607
Feature activation+1.46e+00

Pos logit contributions
 see+0.021
 glad+0.019
 reunited+0.018
 back+0.018
 thrilled+0.016
Neg logit contributions
works-0.030
 matter-0.030
 narrative-0.028
 meanings-0.027
ports-0.027
Token parent
Token ID2560
Feature activation+9.49e-01

Pos logit contributions
 see+0.050
 glad+0.045
 reunited+0.043
 back+0.043
 happy+0.039
Neg logit contributions
works-0.062
 narrative-0.056
 matter-0.056
 meanings-0.053
der-0.052
Token with
Token ID351
Feature activation+6.40e-01

Pos logit contributions
 see+0.027
 glad+0.025
 reunited+0.023
ends+0.022
ovi+0.020
Neg logit contributions
works-0.040
 matter-0.039
 narrative-0.036
 meanings-0.034
ist-0.034
Token her
Token ID607
Feature activation+9.94e-01

Pos logit contributions
 see+0.018
 glad+0.015
 reunited+0.015
 back+0.015
 happy+0.013
Neg logit contributions
works-0.032
 narrative-0.030
 matter-0.029
der-0.028
ports-0.028
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
works+0.031
 narrative+0.030
 matter+0.029
ports+0.028
 meanings+0.028
Neg logit contributions
 see-0.019
 outside-0.016
 glad-0.016
 happy-0.015
 back-0.015
Token It
Token ID632
Feature activation-2.40e-02

Pos logit contributions
works+0.013
 narrative+0.013
 matter+0.012
der+0.012
 meanings+0.012
Neg logit contributions
 see-0.006
 glad-0.005
 reunited-0.004
 back-0.004
 happy-0.004
Token was
Token ID373
Feature activation+1.76e-01

Pos logit contributions
works+0.001
 narrative+0.001
 matter+0.001
ports+0.001
 meanings+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 happy-0.001
 reunited-0.001
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 see+0.005
 reunited+0.005
 glad+0.004
 back+0.004
ovi+0.004
Neg logit contributions
 matter-0.008
works-0.008
 narrative-0.007
 meanings-0.007
der-0.007
Token biggest
Token ID4094
Feature activation+8.53e-01

Pos logit contributions
 see+0.038
 reunited+0.035
 glad+0.034
 back+0.031
 thrilled+0.031
Neg logit contributions
works-0.049
 matter-0.048
 narrative-0.046
 meanings-0.043
ports-0.041
Token and
Token ID290
Feature activation+2.28e+00

Pos logit contributions
 see+0.030
 reunited+0.028
 glad+0.028
ends+0.026
 back+0.025
Neg logit contributions
works-0.032
 matter-0.032
 narrative-0.031
der-0.029
 meanings-0.028
Token sweet
Token ID6029
Feature activation+1.47e+00

Pos logit contributions
 see+0.083
 glad+0.075
 reunited+0.074
 back+0.071
 thrilled+0.066
Neg logit contributions
works-0.090
 matter-0.084
 narrative-0.082
 meanings-0.077
ports-0.076
Tokenest
Token ID395
Feature activation+6.69e-01

Pos logit contributions
 see+0.052
 glad+0.049
 reunited+0.049
ends+0.046
 back+0.045
Neg logit contributions
works-0.057
 matter-0.056
 narrative-0.051
 meanings-0.049
der-0.049
Token mel
Token ID7758
Feature activation+1.21e+00

Pos logit contributions
 see+0.016
 back+0.015
 home+0.013
 glad+0.013
 happy+0.013
Neg logit contributions
works-0.036
 narrative-0.034
fighters-0.032
ports-0.032
 meanings-0.031
Tokenon
Token ID261
Feature activation-7.78e-01

Pos logit contributions
 see+0.029
 glad+0.025
 reunited+0.024
 back+0.023
 thrilled+0.020
Neg logit contributions
works-0.065
 narrative-0.059
 matter-0.057
ports-0.055
 meanings-0.055
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
works+0.032
 narrative+0.030
 matter+0.030
 meanings+0.028
ports+0.027
Neg logit contributions
 see-0.027
 glad-0.025
 back-0.024
 reunited-0.024
 happy-0.022
Token painter
Token ID34537
Feature activation-9.63e-01

Pos logit contributions
works+0.001
 matter+0.001
 narrative+0.001
ports+0.001
 meanings+0.001
Neg logit contributions
 see-0.001
 reunited-0.001
 glad-0.001
 back-0.001
 thrilled-0.001
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
works+0.043
 narrative+0.040
 matter+0.039
 meanings+0.037
ports+0.037
Neg logit contributions
 see-0.031
 glad-0.028
 reunited-0.027
 back-0.027
 happy-0.025
Token He
Token ID679
Feature activation+2.77e-01

Pos logit contributions
 see+0.016
 glad+0.013
 back+0.011
 reunited+0.011
 happy+0.011
Neg logit contributions
works-0.036
 narrative-0.034
 matter-0.033
ports-0.032
der-0.031
Token was
Token ID373
Feature activation+8.19e-01

Pos logit contributions
 see+0.011
 glad+0.010
 reunited+0.009
 back+0.009
 happy+0.009
Neg logit contributions
works-0.011
 matter-0.010
 narrative-0.010
 meanings-0.009
der-0.009
Token a
Token ID257
Feature activation+1.58e+00

Pos logit contributions
 reunited+0.025
 pad+0.025
 see+0.024
ovi+0.021
 back+0.021
Neg logit contributions
 matter-0.032
works-0.028
 meanings-0.027
 cost-0.027
 narrative-0.027
Token brave
Token ID14802
Feature activation+2.27e+00

Pos logit contributions
 reunited+0.046
 see+0.045
ovi+0.038
ends+0.038
 pad+0.038
Neg logit contributions
 matter-0.065
works-0.064
 narrative-0.061
Deal-0.056
 concept-0.056
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
 see+0.063
 reunited+0.063
 glad+0.059
ends+0.056
 back+0.052
Neg logit contributions
 matter-0.100
works-0.097
 narrative-0.094
der-0.092
 concept-0.085
Token kind
Token ID1611
Feature activation+1.46e+00

Pos logit contributions
 see+0.028
 glad+0.025
 back+0.023
 reunited+0.022
 happy+0.022
Neg logit contributions
works-0.053
 narrative-0.050
 matter-0.049
der-0.047
 meanings-0.046
Token painter
Token ID34537
Feature activation-5.71e-01

Pos logit contributions
 see+0.036
 glad+0.031
 reunited+0.030
 back+0.029
 thrilled+0.025
Neg logit contributions
works-0.075
 matter-0.070
 narrative-0.070
 meanings-0.066
der-0.066
Token.
Token ID13
Feature activation-2.73e-02

Pos logit contributions
works+0.026
 narrative+0.025
 matter+0.023
oral+0.023
ports+0.022
Neg logit contributions
 see-0.018
 back-0.016
 glad-0.016
 reunited-0.015
 happy-0.015
Token He
Token ID679
Feature activation-1.41e-02

Pos logit contributions
works+0.001
 narrative+0.001
 matter+0.001
nexpected+0.001
ports+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 reunited-0.001
 happy-0.001
Token siblings
Token ID20569
Feature activation+1.97e-01

Pos logit contributions
 see+0.012
 glad+0.012
 back+0.011
 reunited+0.011
 happy+0.011
Neg logit contributions
works-0.024
 matter-0.021
 narrative-0.021
ports-0.020
 meanings-0.020
Token,
Token ID11
Feature activation+5.72e-01

Pos logit contributions
 see+0.006
 glad+0.005
 reunited+0.005
 back+0.005
 thrilled+0.004
Neg logit contributions
works-0.009
 matter-0.009
 narrative-0.008
der-0.008
 meanings-0.008
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 see+0.014
 glad+0.012
 reunited+0.011
 back+0.011
 happy+0.010
Neg logit contributions
works-0.030
 narrative-0.029
 matter-0.028
 meanings-0.027
der-0.027
Token twin
Token ID15203
Feature activation+7.91e-01

Pos logit contributions
 see+0.039
 reunited+0.037
 back+0.034
 glad+0.034
 seen+0.033
Neg logit contributions
works-0.044
 matter-0.042
 narrative-0.040
oral-0.037
ports-0.037
Token brother
Token ID3956
Feature activation+2.19e-01

Pos logit contributions
 see+0.020
 glad+0.017
 reunited+0.016
 back+0.016
 thrilled+0.015
Neg logit contributions
works-0.040
 matter-0.037
 narrative-0.037
der-0.035
 meanings-0.035
Token and
Token ID290
Feature activation+2.27e+00

Pos logit contributions
 see+0.006
 glad+0.005
 reunited+0.005
 thrilled+0.005
 back+0.005
Neg logit contributions
works-0.010
 matter-0.009
der-0.009
 narrative-0.009
 meanings-0.009
Token sister
Token ID6621
Feature activation-1.30e-01

Pos logit contributions
 see+0.068
 glad+0.061
 reunited+0.060
 back+0.060
 pad+0.059
Neg logit contributions
works-0.095
 matter-0.094
 narrative-0.087
 meanings-0.084
ports-0.083
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
works+0.006
 narrative+0.006
 matter+0.006
 meanings+0.005
der+0.005
Neg logit contributions
 see-0.004
 glad-0.003
 back-0.003
 reunited-0.003
 happy-0.003
Token They
Token ID1119
Feature activation+9.74e-01

Pos logit contributions
 see+0.023
 glad+0.019
 back+0.019
 reunited+0.018
 happy+0.017
Neg logit contributions
works-0.028
 narrative-0.026
 matter-0.026
der-0.024
ports-0.024
Token were
Token ID547
Feature activation+7.08e-01

Pos logit contributions
 see+0.034
 glad+0.029
 back+0.028
 reunited+0.028
 happy+0.027
Neg logit contributions
works-0.042
 narrative-0.039
 matter-0.038
der-0.037
oral-0.036
Token intelligent
Token ID12661
Feature activation+1.18e+00

Pos logit contributions
 pad+0.021
 see+0.020
 reunited+0.018
ovi+0.018
line+0.017
Neg logit contributions
 matter-0.026
works-0.026
 narrative-0.025
 meanings-0.024
ports-0.022
Token Lisa
Token ID15378
Feature activation+6.06e-01

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
der-0.002
Token that
Token ID326
Feature activation+2.53e-01

Pos logit contributions
 see+0.017
 glad+0.015
 reunited+0.015
 back+0.014
 thrilled+0.013
Neg logit contributions
works-0.028
 matter-0.026
 narrative-0.026
 meanings-0.025
der-0.025
Token if
Token ID611
Feature activation+1.82e-01

Pos logit contributions
 see+0.007
 glad+0.006
 back+0.006
 happy+0.005
 reunited+0.005
Neg logit contributions
works-0.013
 narrative-0.012
 matter-0.011
ports-0.011
 meanings-0.011
Token she
Token ID673
Feature activation+5.81e-01

Pos logit contributions
 see+0.004
 glad+0.003
 back+0.003
 happy+0.003
 reunited+0.003
Neg logit contributions
works-0.010
 narrative-0.009
 matter-0.009
der-0.009
 meanings-0.009
Token wanted
Token ID2227
Feature activation+1.15e+00

Pos logit contributions
 see+0.021
 back+0.017
 glad+0.017
 reunited+0.017
 happy+0.016
Neg logit contributions
works-0.025
 narrative-0.022
ports-0.021
 matter-0.021
lights-0.021
Token a
Token ID257
Feature activation+2.26e+00

Pos logit contributions
 see+0.027
 glad+0.023
 back+0.022
 reunited+0.021
 happy+0.019
Neg logit contributions
works-0.062
 narrative-0.058
 matter-0.057
 meanings-0.055
der-0.054
Token new
Token ID649
Feature activation+1.70e+00

Pos logit contributions
 see+0.085
 glad+0.079
 reunited+0.077
 back+0.073
 thrilled+0.070
Neg logit contributions
works-0.085
 matter-0.080
 narrative-0.077
 meanings-0.072
der-0.071
Token pair
Token ID5166
Feature activation+1.23e+00

Pos logit contributions
 see+0.065
 glad+0.058
 back+0.057
 reunited+0.056
 happy+0.053
Neg logit contributions
works-0.067
 narrative-0.061
 matter-0.059
 meanings-0.057
ports-0.056
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
 see+0.042
 glad+0.040
 reunited+0.038
 pad+0.035
ends+0.035
Neg logit contributions
works-0.046
 matter-0.045
 narrative-0.043
der-0.039
 meanings-0.038
Token scissors
Token ID41786
Feature activation+4.56e-01

Pos logit contributions
 see+0.049
 glad+0.045
 back+0.043
 reunited+0.042
 happy+0.041
Neg logit contributions
works-0.040
 narrative-0.036
 matter-0.035
der-0.033
ports-0.032
Token,
Token ID11
Feature activation+5.92e-01

Pos logit contributions
 see+0.011
 glad+0.010
 reunited+0.009
 thrilled+0.008
 back+0.008
Neg logit contributions
works-0.023
 matter-0.022
 narrative-0.021
der-0.020
 meanings-0.020
Token happy
Token ID3772
Feature activation+8.78e-02

Pos logit contributions
 pad+0.015
ovi+0.013
 reunited+0.012
 see+0.012
 back+0.011
Neg logit contributions
 matter-0.024
 narrative-0.024
works-0.024
Deal-0.024
 lesson-0.023
Token that
Token ID326
Feature activation+6.79e-01

Pos logit contributions
 see+0.003
 glad+0.002
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.004
der-0.004
 matter-0.004
 narrative-0.003
 meanings-0.003
Token she
Token ID673
Feature activation+1.16e+00

Pos logit contributions
 see+0.018
 glad+0.016
 reunited+0.015
 back+0.015
 thrilled+0.013
Neg logit contributions
works-0.034
 matter-0.032
 narrative-0.032
 meanings-0.030
der-0.030
Token found
Token ID1043
Feature activation+3.84e-01

Pos logit contributions
 see+0.045
 glad+0.042
 reunited+0.040
 back+0.039
 home+0.039
Neg logit contributions
works-0.038
 matter-0.036
 narrative-0.033
nexpected-0.032
ports-0.032
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
 see+0.013
 glad+0.011
 reunited+0.011
 pad+0.010
ends+0.010
Neg logit contributions
works-0.016
 matter-0.015
 narrative-0.014
der-0.014
ports-0.014
Token little
Token ID1310
Feature activation+2.26e+00

Pos logit contributions
 see+0.044
 reunited+0.039
 glad+0.039
 back+0.035
ends+0.035
Neg logit contributions
works-0.057
 matter-0.054
 narrative-0.053
 meanings-0.049
der-0.049
Token dirty
Token ID11841
Feature activation+1.13e+00

Pos logit contributions
 see+0.076
 glad+0.069
 reunited+0.068
ends+0.063
 back+0.062
Neg logit contributions
works-0.093
 matter-0.090
 narrative-0.086
der-0.080
 meanings-0.079
Token pumpkin
Token ID30089
Feature activation+5.16e-01

Pos logit contributions
 see+0.033
 glad+0.030
 reunited+0.028
 back+0.027
 thrilled+0.025
Neg logit contributions
works-0.053
 matter-0.049
 narrative-0.049
 meanings-0.046
der-0.046
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 see+0.018
 glad+0.016
 reunited+0.015
ends+0.014
 thrilled+0.014
Neg logit contributions
works-0.021
 matter-0.019
 narrative-0.019
der-0.018
 meanings-0.017
Token<|endoftext|>
Token ID50256
Feature activation+1.10e-01

Pos logit contributions
 see+0.003
 reunited+0.003
 back+0.003
 glad+0.003
 happy+0.003
Neg logit contributions
works-0.005
 narrative-0.004
 meanings-0.004
der-0.004
 matter-0.004
TokenOnce
Token ID7454
Feature activation-7.98e-02

Pos logit contributions
 see+0.003
 glad+0.002
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.006
 narrative-0.005
 matter-0.005
ports-0.005
 meanings-0.005
Token dog
Token ID3290
Feature activation-3.96e-01

Pos logit contributions
works+0.015
 matter+0.015
 narrative+0.014
 meanings+0.013
der+0.013
Neg logit contributions
 see-0.013
 glad-0.012
 reunited-0.012
 back-0.011
 thrilled-0.011
Token with
Token ID351
Feature activation-1.05e+00

Pos logit contributions
works+0.016
 matter+0.015
 narrative+0.013
der+0.013
ports+0.013
Neg logit contributions
 see-0.012
 glad-0.012
 thrilled-0.011
 reunited-0.011
 satisfied-0.010
Token a
Token ID257
Feature activation-8.96e-01

Pos logit contributions
works+0.050
 narrative+0.046
 matter+0.046
 meanings+0.043
der+0.043
Neg logit contributions
 see-0.030
 glad-0.027
 reunited-0.026
 back-0.025
 happy-0.023
Token book
Token ID1492
Feature activation-3.93e-01

Pos logit contributions
works+0.034
 matter+0.033
 narrative+0.032
 meanings+0.031
der+0.030
Neg logit contributions
 see-0.031
 reunited-0.028
 glad-0.028
 back-0.026
 thrilled-0.025
Token like
Token ID588
Feature activation-9.11e-01

Pos logit contributions
works+0.018
 narrative+0.016
 matter+0.016
der+0.016
 meanings+0.015
Neg logit contributions
 see-0.012
 glad-0.011
 reunited-0.010
 back-0.010
 thrilled-0.009
Token that
Token ID326
Feature activation-2.52e+00

Pos logit contributions
works+0.043
 matter+0.040
 narrative+0.040
 meanings+0.038
der+0.038
Neg logit contributions
 see-0.026
 glad-0.023
 reunited-0.022
 back-0.021
 thrilled-0.019
Token near
Token ID1474
Feature activation-7.75e-01

Pos logit contributions
works+0.106
 narrative+0.099
 matter+0.098
 meanings+0.090
ports+0.090
Neg logit contributions
 see-0.087
 glad-0.077
 back-0.076
 reunited-0.074
 happy-0.070
Token the
Token ID262
Feature activation-5.58e-01

Pos logit contributions
works+0.038
 narrative+0.035
 matter+0.035
der+0.033
 meanings+0.033
Neg logit contributions
 see-0.022
 glad-0.019
 back-0.018
 reunited-0.018
 happy-0.016
Token pond
Token ID16723
Feature activation-1.55e+00

Pos logit contributions
works+0.024
 narrative+0.022
 matter+0.022
 meanings+0.021
der+0.020
Neg logit contributions
 see-0.018
 glad-0.017
 reunited-0.016
 back-0.015
 thrilled-0.015
Token.
Token ID13
Feature activation-9.21e-01

Pos logit contributions
works+0.063
 narrative+0.059
 matter+0.059
 meanings+0.055
ports+0.053
Neg logit contributions
 see-0.055
 glad-0.049
 back-0.049
 reunited-0.047
 happy-0.045
Token It
Token ID632
Feature activation-1.41e+00

Pos logit contributions
works+0.041
 narrative+0.037
 matter+0.037
 meanings+0.036
ports+0.035
Neg logit contributions
 see-0.028
 glad-0.026
 reunited-0.025
 back-0.024
 thrilled-0.022
Token a
Token ID257
Feature activation+2.13e-01

Pos logit contributions
 reunited+0.004
 see+0.004
 glad+0.004
 pad+0.004
 satisfied+0.004
Neg logit contributions
 matter-0.003
 meanings-0.003
 narrative-0.003
ports-0.003
works-0.003
Token folder
Token ID9483
Feature activation+5.78e-01

Pos logit contributions
 see+0.007
 reunited+0.007
 glad+0.007
 back+0.006
 thrilled+0.006
Neg logit contributions
works-0.008
 matter-0.008
 narrative-0.008
 meanings-0.007
der-0.007
Token with
Token ID351
Feature activation-9.55e-01

Pos logit contributions
 see+0.017
 glad+0.015
 reunited+0.015
 back+0.014
 thrilled+0.013
Neg logit contributions
works-0.026
 matter-0.025
 narrative-0.024
 meanings-0.023
der-0.023
Token all
Token ID477
Feature activation-1.17e+00

Pos logit contributions
works+0.036
 matter+0.033
 narrative+0.033
 meanings+0.031
der+0.030
Neg logit contributions
 see-0.037
 glad-0.034
 reunited-0.032
 back-0.032
 happy-0.030
Token his
Token ID465
Feature activation-1.47e+00

Pos logit contributions
works+0.052
 matter+0.048
 narrative+0.048
 meanings+0.045
ports+0.045
Neg logit contributions
 see-0.037
 glad-0.033
 reunited-0.032
 back-0.031
 thrilled-0.029
Token favorite
Token ID4004
Feature activation-2.50e+00

Pos logit contributions
works+0.067
 narrative+0.061
 matter+0.060
der+0.056
nexpected+0.056
Neg logit contributions
 see-0.048
 glad-0.045
 happy-0.042
 back-0.041
 reunited-0.039
Token pictures
Token ID5986
Feature activation-2.62e-01

Pos logit contributions
works+0.104
 narrative+0.099
Deal+0.086
fighters+0.086
der+0.085
Neg logit contributions
 see-0.091
 back-0.081
 happy-0.078
 home-0.078
 drawing-0.076
Token in
Token ID287
Feature activation-8.14e-02

Pos logit contributions
works+0.010
 matter+0.009
 narrative+0.009
 meanings+0.009
ports+0.009
Neg logit contributions
 see-0.010
 glad-0.009
 back-0.009
 reunited-0.009
 happy-0.008
Token it
Token ID340
Feature activation-5.55e-01

Pos logit contributions
works+0.005
 narrative+0.005
Deal+0.005
ist+0.004
der+0.004
Neg logit contributions
 see-0.002
 back-0.001
 home-0.001
 happy-0.001
 glad-0.001
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
works+0.028
 narrative+0.027
 matter+0.025
oral+0.025
 meanings+0.024
Neg logit contributions
 see-0.015
 outside-0.014
 back-0.014
 playing-0.013
 happy-0.013
Token He
Token ID679
Feature activation-6.14e-01

Pos logit contributions
works+0.009
 narrative+0.008
 matter+0.008
 meanings+0.007
der+0.007
Neg logit contributions
 see-0.006
 glad-0.005
 back-0.005
 reunited-0.004
 happy-0.004
Token toy
Token ID13373
Feature activation-1.02e+00

Pos logit contributions
 see+0.007
 reunited+0.006
 glad+0.006
 back+0.006
 thrilled+0.005
Neg logit contributions
works-0.009
 matter-0.009
 narrative-0.009
 meanings-0.008
der-0.008
Token box
Token ID3091
Feature activation-5.13e-01

Pos logit contributions
works+0.052
 matter+0.049
 narrative+0.048
nexpected+0.045
 meanings+0.045
Neg logit contributions
 see-0.028
 back-0.026
 glad-0.024
 reunited-0.022
 home-0.022
Token with
Token ID351
Feature activation-3.51e-01

Pos logit contributions
works+0.026
 narrative+0.024
 matter+0.024
 meanings+0.023
ports+0.023
Neg logit contributions
 see-0.014
 glad-0.012
 back-0.012
 reunited-0.011
 happy-0.011
Token all
Token ID477
Feature activation-9.59e-01

Pos logit contributions
works+0.015
 narrative+0.014
 matter+0.013
der+0.012
oral+0.012
Neg logit contributions
 see-0.013
 glad-0.011
 back-0.011
 reunited-0.011
 happy-0.010
Token his
Token ID465
Feature activation-1.41e+00

Pos logit contributions
works+0.043
 matter+0.040
 narrative+0.039
 meanings+0.037
ports+0.037
Neg logit contributions
 see-0.030
 glad-0.027
 reunited-0.026
 back-0.025
 thrilled-0.023
Token favorite
Token ID4004
Feature activation-2.45e+00

Pos logit contributions
works+0.066
 narrative+0.061
 matter+0.058
oral+0.055
der+0.054
Neg logit contributions
 see-0.046
 glad-0.043
 back-0.040
 happy-0.040
 reunited-0.037
Token toys
Token ID14958
Feature activation+7.19e-03

Pos logit contributions
works+0.119
 narrative+0.118
oral+0.104
Deal+0.103
der+0.102
Neg logit contributions
 see-0.072
 back-0.067
 home-0.062
 happy-0.061
 glad-0.057
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 see+0.000
 glad+0.000
 back+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
ports-0.000
Token One
Token ID1881
Feature activation-4.86e-01

Pos logit contributions
works+0.001
 narrative+0.001
 matter+0.001
 meanings+0.001
der+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 reunited-0.001
 happy-0.001
Token day
Token ID1110
Feature activation+1.49e-01

Pos logit contributions
works+0.019
 matter+0.019
 narrative+0.017
oral+0.016
 meanings+0.016
Neg logit contributions
 see-0.017
 reunited-0.016
 glad-0.015
 back-0.014
ends-0.014
Token,
Token ID11
Feature activation+3.94e-02

Pos logit contributions
 see+0.003
 glad+0.002
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.008
 matter-0.008
 narrative-0.008
der-0.007
 meanings-0.007
Token sign
Token ID1051
Feature activation-2.50e-01

Pos logit contributions
 see+0.005
 glad+0.005
 reunited+0.005
ends+0.005
 thrilled+0.004
Neg logit contributions
works-0.006
 matter-0.006
 narrative-0.006
ports-0.005
der-0.005
Token that
Token ID326
Feature activation-1.08e+00

Pos logit contributions
works+0.013
 matter+0.013
 narrative+0.013
 meanings+0.012
nexpected+0.012
Neg logit contributions
 see-0.006
 back-0.005
 glad-0.005
 reunited-0.004
 happy-0.004
Token said
Token ID531
Feature activation-6.86e-01

Pos logit contributions
 narrative+0.058
oral+0.056
 matter+0.055
works+0.055
der+0.055
Neg logit contributions
 see-0.028
 happy-0.019
 glad-0.019
 back-0.018
 found-0.018
Token "
Token ID366
Feature activation-9.69e-01

Pos logit contributions
works+0.033
 narrative+0.033
 matter+0.033
 meanings+0.030
der+0.030
Neg logit contributions
 see-0.019
 glad-0.017
 back-0.016
 reunited-0.015
 happy-0.015
TokenCl
Token ID2601
Feature activation+6.18e-02

Pos logit contributions
works+0.036
 narrative+0.036
der+0.034
 matter+0.032
nexpected+0.032
Neg logit contributions
 see-0.038
 back-0.032
 reunited-0.031
 glad-0.031
 home-0.030
Tokenown
Token ID593
Feature activation-2.43e+00

Pos logit contributions
 see+0.002
 glad+0.002
 back+0.002
 reunited+0.002
 home+0.002
Neg logit contributions
works-0.003
 narrative-0.003
 matter-0.003
 meanings-0.002
der-0.002
Token Show
Token ID5438
Feature activation-1.42e+00

Pos logit contributions
 narrative+0.096
 matter+0.093
works+0.090
der+0.083
nexpected+0.082
Neg logit contributions
 see-0.090
 glad-0.078
 back-0.077
 reunited-0.073
 home-0.071
Token Today
Token ID6288
Feature activation-1.96e+00

Pos logit contributions
 narrative+0.055
works+0.053
 matter+0.053
der+0.048
 meanings+0.047
Neg logit contributions
 see-0.053
 back-0.048
 glad-0.045
 reunited-0.045
 home-0.044
Token".
Token ID1911
Feature activation-3.09e-01

Pos logit contributions
 narrative+0.101
works+0.093
 matter+0.093
 meanings+0.089
der+0.087
Neg logit contributions
 see-0.053
 back-0.045
 here-0.041
 glad-0.040
 welcome-0.039
Token They
Token ID1119
Feature activation-7.94e-01

Pos logit contributions
works+0.013
 matter+0.012
 narrative+0.012
ports+0.012
 meanings+0.012
Neg logit contributions
 see-0.009
 reunited-0.009
 glad-0.009
 back-0.008
 pad-0.008
Token wanted
Token ID2227
Feature activation-4.68e-02

Pos logit contributions
works+0.038
 narrative+0.037
 matter+0.034
der+0.034
oral+0.033
Neg logit contributions
 see-0.028
 glad-0.022
 happy-0.021
 back-0.021
 reunited-0.020
Token
Token ID198
Feature activation-9.70e-01

Pos logit contributions
works+0.007
 narrative+0.006
ports+0.006
 matter+0.006
der+0.006
Neg logit contributions
 reunited-0.003
 see-0.003
 glad-0.003
 back-0.003
 thrilled-0.003
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
works+0.058
 matter+0.055
 narrative+0.055
 meanings+0.052
der+0.052
Neg logit contributions
 see-0.018
 glad-0.013
 back-0.012
 reunited-0.011
 happy-0.010
Token"
Token ID1
Feature activation-3.00e-01

Pos logit contributions
works+0.061
 narrative+0.057
 matter+0.057
 meanings+0.053
der+0.053
Neg logit contributions
 see-0.045
 glad-0.037
 back-0.035
 reunited-0.034
 happy-0.033
TokenHello
Token ID15496
Feature activation+1.90e-01

Pos logit contributions
works+0.013
 narrative+0.013
der+0.013
 meanings+0.012
 matter+0.012
Neg logit contributions
 see-0.009
 glad-0.007
 back-0.007
 reunited-0.007
 thrilled-0.007
Token there
Token ID612
Feature activation+5.21e-01

Pos logit contributions
 see+0.005
 glad+0.005
 reunited+0.005
 back+0.004
 thrilled+0.004
Neg logit contributions
works-0.009
 matter-0.008
ports-0.008
 narrative-0.008
 meanings-0.008
Token,"
Token ID553
Feature activation-2.43e+00

Pos logit contributions
 see+0.015
 glad+0.014
 reunited+0.013
ends+0.013
 back+0.012
Neg logit contributions
works-0.024
 matter-0.022
 narrative-0.021
 meanings-0.020
der-0.020
Token said
Token ID531
Feature activation-1.19e+00

Pos logit contributions
works+0.123
 narrative+0.119
der+0.116
 matter+0.113
oral+0.112
Neg logit contributions
 see-0.073
 glad-0.053
 happy-0.053
 back-0.048
 thrilled-0.046
Token the
Token ID262
Feature activation-6.10e-01

Pos logit contributions
 narrative+0.060
works+0.060
 matter+0.056
der+0.056
 meanings+0.055
Neg logit contributions
 see-0.035
 back-0.028
 glad-0.025
 happy-0.024
 home-0.023
Token creature
Token ID7185
Feature activation-2.98e-01

Pos logit contributions
works+0.030
 narrative+0.028
 matter+0.028
der+0.027
 meanings+0.026
Neg logit contributions
 see-0.017
 glad-0.015
 back-0.015
 reunited-0.014
 happy-0.013
Token.
Token ID13
Feature activation-8.04e-01

Pos logit contributions
works+0.014
 narrative+0.014
 matter+0.013
 meanings+0.013
ports+0.013
Neg logit contributions
 see-0.010
 back-0.009
 glad-0.008
 reunited-0.008
 happy-0.008
Token "
Token ID366
Feature activation-1.12e+00

Pos logit contributions
works+0.038
 narrative+0.036
 matter+0.036
 meanings+0.034
der+0.033
Neg logit contributions
 see-0.024
 glad-0.021
 back-0.019
 reunited-0.019
 happy-0.018
Token saw
Token ID2497
Feature activation-5.80e-01

Pos logit contributions
ports+0.011
 matter+0.011
 cost+0.011
works+0.010
 sounded+0.010
Neg logit contributions
 glad-0.009
 see-0.008
 welcome-0.008
 reunited-0.007
 satisfied-0.007
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
works+0.025
 matter+0.024
 narrative+0.023
 meanings+0.022
der+0.022
Neg logit contributions
 see-0.018
 glad-0.016
 reunited-0.016
 back-0.015
 thrilled-0.014
Token magazine
Token ID7093
Feature activation+5.54e-01

Pos logit contributions
works+0.006
 matter+0.006
 narrative+0.006
 meanings+0.005
ports+0.005
Neg logit contributions
 see-0.005
 glad-0.004
 reunited-0.004
 back-0.004
 thrilled-0.004
Token with
Token ID351
Feature activation-9.52e-01

Pos logit contributions
 see+0.015
 back+0.013
 glad+0.013
 reunited+0.013
 happy+0.011
Neg logit contributions
works-0.028
 matter-0.026
 narrative-0.026
 meanings-0.025
ports-0.025
Token his
Token ID465
Feature activation-1.22e+00

Pos logit contributions
works+0.041
 narrative+0.038
 matter+0.037
oral+0.034
der+0.034
Neg logit contributions
 see-0.033
 glad-0.029
 back-0.028
 reunited-0.028
 happy-0.026
Token favorite
Token ID4004
Feature activation-2.41e+00

Pos logit contributions
works+0.048
 matter+0.044
 narrative+0.044
 meanings+0.042
der+0.041
Neg logit contributions
 see-0.044
 glad-0.040
 reunited-0.039
 back-0.038
 thrilled-0.036
Token cartoon
Token ID16251
Feature activation-3.82e-01

Pos logit contributions
works+0.106
 narrative+0.102
oral+0.086
Deal+0.086
nexpected+0.086
Neg logit contributions
 see-0.086
 back-0.076
 happy-0.071
 glad-0.070
 reunited-0.070
Token character
Token ID2095
Feature activation-5.73e-01

Pos logit contributions
works+0.017
 matter+0.016
 narrative+0.016
nexpected+0.015
oral+0.014
Neg logit contributions
 see-0.012
 back-0.011
 reunited-0.011
 glad-0.011
 happy-0.010
Token on
Token ID319
Feature activation-2.92e-01

Pos logit contributions
works+0.027
 matter+0.026
 narrative+0.026
oral+0.023
 meanings+0.023
Neg logit contributions
 see-0.017
 back-0.015
 reunited-0.015
 glad-0.015
 playing-0.014
Token the
Token ID262
Feature activation-3.80e-01

Pos logit contributions
works+0.017
 narrative+0.017
oral+0.015
Deal+0.015
ist+0.015
Neg logit contributions
 see-0.007
 back-0.005
 home-0.004
 outside-0.004
 happy-0.004
Token cover
Token ID3002
Feature activation-7.78e-01

Pos logit contributions
works+0.015
 narrative+0.014
 matter+0.013
nexpected+0.013
der+0.013
Neg logit contributions
 see-0.014
 back-0.013
 glad-0.013
 reunited-0.012
 home-0.012
Token outside
Token ID2354
Feature activation+5.04e-02

Pos logit contributions
 see+0.001
 back+0.001
 glad+0.001
 reunited+0.001
 happy+0.000
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
ports-0.002
Token,
Token ID11
Feature activation-4.95e-01

Pos logit contributions
 see+0.001
 back+0.001
 glad+0.001
 outside+0.001
 happy+0.001
Neg logit contributions
works-0.003
 matter-0.003
 narrative-0.002
 meanings-0.002
ports-0.002
Token especially
Token ID2592
Feature activation-8.32e-01

Pos logit contributions
works+0.020
 matter+0.019
 narrative+0.018
der+0.017
 meanings+0.017
Neg logit contributions
 see-0.017
 glad-0.016
 reunited-0.015
 back-0.015
 thrilled-0.014
Token with
Token ID351
Feature activation-8.06e-01

Pos logit contributions
 matter+0.032
works+0.032
 narrative+0.029
ports+0.029
 meanings+0.029
Neg logit contributions
 see-0.028
 reunited-0.026
 glad-0.026
 thrilled-0.024
 satisfied-0.022
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
works+0.034
 matter+0.032
 narrative+0.031
 meanings+0.030
der+0.029
Neg logit contributions
 see-0.026
 glad-0.024
 reunited-0.023
 back-0.022
 thrilled-0.021
Token favorite
Token ID4004
Feature activation-2.40e+00

Pos logit contributions
works+0.045
 matter+0.043
 narrative+0.042
 meanings+0.041
ports+0.039
Neg logit contributions
 see-0.036
 glad-0.032
 reunited-0.032
 back-0.030
 thrilled-0.028
Token rope
Token ID17182
Feature activation-1.13e-01

Pos logit contributions
works+0.109
 narrative+0.100
 matter+0.096
der+0.093
 meanings+0.091
Neg logit contributions
 see-0.078
 back-0.070
 glad-0.067
 reunited-0.065
 happy-0.063
Token.
Token ID13
Feature activation-7.67e-03

Pos logit contributions
works+0.005
 narrative+0.005
 matter+0.005
ports+0.005
 meanings+0.005
Neg logit contributions
 see-0.003
 back-0.003
 outside-0.003
 reunited-0.003
 playing-0.003
Token He
Token ID679
Feature activation-1.49e-01

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
 meanings+0.000
der+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
Token would
Token ID561
Feature activation-4.49e-01

Pos logit contributions
works+0.005
 matter+0.004
 narrative+0.004
 meanings+0.004
ports+0.004
Neg logit contributions
 see-0.007
 glad-0.006
 reunited-0.006
 back-0.006
 thrilled-0.006
Token tug
Token ID27762
Feature activation-4.19e-01

Pos logit contributions
 matter+0.017
works+0.016
 narrative+0.016
 meanings+0.015
 concept+0.014
Neg logit contributions
 reunited-0.014
 glad-0.014
 see-0.014
 back-0.013
 thrilled-0.012
Token himself
Token ID2241
Feature activation-5.16e-01

Pos logit contributions
 reunited+0.005
 glad+0.005
 see+0.005
ends+0.005
 back+0.004
Neg logit contributions
 matter-0.007
works-0.007
 narrative-0.007
der-0.006
 meanings-0.006
Token "
Token ID366
Feature activation-8.13e-01

Pos logit contributions
works+0.027
 matter+0.026
 narrative+0.026
der+0.025
 meanings+0.025
Neg logit contributions
 see-0.011
 glad-0.009
 reunited-0.009
 back-0.008
 thrilled-0.007
TokenI
Token ID40
Feature activation-7.49e-01

Pos logit contributions
works+0.032
 narrative+0.030
 matter+0.028
der+0.028
 meanings+0.028
Neg logit contributions
 see-0.031
 back-0.026
 glad-0.026
 reunited-0.025
 happy-0.024
Token wonder
Token ID4240
Feature activation-5.49e-01

Pos logit contributions
works+0.029
 matter+0.028
 narrative+0.027
 meanings+0.025
nexpected+0.024
Neg logit contributions
 glad-0.025
 see-0.025
 reunited-0.024
 back-0.023
 thrilled-0.022
Token what
Token ID644
Feature activation-4.40e-01

Pos logit contributions
works+0.026
 narrative+0.026
 matter+0.024
oral+0.023
 meanings+0.023
Neg logit contributions
 see-0.017
 glad-0.014
 back-0.014
 reunited-0.013
 happy-0.012
Token that
Token ID326
Feature activation-2.40e+00

Pos logit contributions
works+0.022
 narrative+0.021
 matter+0.020
der+0.020
 meanings+0.019
Neg logit contributions
 see-0.013
 glad-0.010
 back-0.010
 happy-0.009
 reunited-0.009
Token land
Token ID1956
Feature activation-1.52e+00

Pos logit contributions
works+0.094
 narrative+0.088
 matter+0.086
 meanings+0.082
der+0.081
Neg logit contributions
 see-0.091
 glad-0.079
 back-0.078
 reunited-0.075
 happy-0.072
Token is
Token ID318
Feature activation-1.80e+00

Pos logit contributions
works+0.063
 narrative+0.062
 matter+0.061
 meanings+0.057
der+0.056
Neg logit contributions
 see-0.051
 back-0.045
 glad-0.044
 reunited-0.042
 here-0.040
Token like
Token ID588
Feature activation-1.39e+00

Pos logit contributions
works+0.078
 matter+0.072
 narrative+0.071
 meanings+0.067
der+0.067
Neg logit contributions
 see-0.058
 glad-0.052
 reunited-0.050
 back-0.049
 thrilled-0.045
Token?".
Token ID43634
Feature activation+1.18e-01

Pos logit contributions
works+0.066
 narrative+0.063
 matter+0.062
 meanings+0.058
der+0.057
Neg logit contributions
 see-0.040
 glad-0.035
 back-0.034
 reunited-0.033
 happy-0.030
Token
Token ID198
Feature activation+6.59e-01

Pos logit contributions
 see+0.003
 glad+0.003
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.006
 matter-0.006
 narrative-0.006
 meanings-0.005
ports-0.005
Token From
Token ID3574
Feature activation+5.53e-01

Pos logit contributions
works+0.036
 matter+0.035
 narrative+0.034
 meanings+0.032
ports+0.031
Neg logit contributions
 see-0.022
 glad-0.019
 back-0.018
 reunited-0.018
 happy-0.016
Token that
Token ID326
Feature activation-6.33e-01

Pos logit contributions
 see+0.018
 pad+0.017
 reunited+0.016
 thrilled+0.016
 glad+0.016
Neg logit contributions
works-0.022
 narrative-0.021
 matter-0.020
nexpected-0.020
der-0.019
Token day
Token ID1110
Feature activation-1.66e-01

Pos logit contributions
works+0.025
 matter+0.024
 narrative+0.024
oral+0.022
nexpected+0.021
Neg logit contributions
 see-0.022
 reunited-0.020
 glad-0.019
 back-0.018
ends-0.018
Token on
Token ID319
Feature activation+1.21e+00

Pos logit contributions
works+0.008
 matter+0.008
 narrative+0.007
 meanings+0.007
der+0.007
Neg logit contributions
 see-0.005
 glad-0.004
 back-0.004
 reunited-0.004
 happy-0.004
Token,
Token ID11
Feature activation-8.01e-01

Pos logit contributions
 see+0.023
 glad+0.020
 reunited+0.018
 back+0.017
 thrilled+0.015
Neg logit contributions
works-0.068
 narrative-0.064
 matter-0.064
der-0.061
 meanings-0.061
Token Sp
Token ID1338
Feature activation-2.39e+00

Pos logit contributions
works+0.030
 narrative+0.027
 matter+0.027
 meanings+0.025
ports+0.025
Neg logit contributions
 see-0.032
 glad-0.029
 reunited-0.028
 back-0.028
 happy-0.026
Tokenide
Token ID485
Feature activation-2.60e-01

Pos logit contributions
works+0.063
 matter+0.063
 narrative+0.059
ports+0.056
 meanings+0.056
Neg logit contributions
 see-0.100
 glad-0.098
 reunited-0.094
 happy-0.091
 pad-0.090
Tokeny
Token ID88
Feature activation-1.34e-01

Pos logit contributions
works+0.010
 matter+0.009
 narrative+0.009
der+0.008
 meanings+0.008
Neg logit contributions
 see-0.010
 glad-0.009
 reunited-0.009
 back-0.009
 thrilled-0.008
Token made
Token ID925
Feature activation-4.12e-01

Pos logit contributions
works+0.005
 narrative+0.004
 matter+0.004
der+0.004
 importance+0.004
Neg logit contributions
 see-0.005
 glad-0.005
 reunited-0.004
 back-0.004
 thrilled-0.004
Token sure
Token ID1654
Feature activation-2.16e-01

Pos logit contributions
 matter+0.017
works+0.016
ves+0.016
der+0.016
 importance+0.016
Neg logit contributions
 pad-0.012
 see-0.011
 glad-0.010
 reunited-0.010
 thrilled-0.009
Token to
Token ID284
Feature activation-2.54e-01

Pos logit contributions
works+0.010
 matter+0.010
 narrative+0.010
der+0.009
 meanings+0.009
Neg logit contributions
 see-0.006
 glad-0.005
 reunited-0.005
 pad-0.005
 back-0.004
Token little
Token ID1310
Feature activation-7.38e-01

Pos logit contributions
works+0.034
 matter+0.031
 narrative+0.031
der+0.029
 meanings+0.029
Neg logit contributions
 see-0.030
 glad-0.027
 back-0.027
 reunited-0.026
 happy-0.024
Token suitcase
Token ID45391
Feature activation-3.13e-01

Pos logit contributions
works+0.026
 matter+0.024
 narrative+0.024
ports+0.022
 meanings+0.022
Neg logit contributions
 see-0.029
 glad-0.027
 reunited-0.026
 back-0.025
 thrilled-0.024
Token with
Token ID351
Feature activation-6.71e-01

Pos logit contributions
works+0.017
 matter+0.016
 narrative+0.016
 meanings+0.015
der+0.015
Neg logit contributions
 see-0.007
 glad-0.006
 back-0.006
 reunited-0.006
 happy-0.005
Token all
Token ID477
Feature activation-1.11e+00

Pos logit contributions
works+0.033
 narrative+0.031
 matter+0.031
der+0.029
 meanings+0.029
Neg logit contributions
 see-0.019
 glad-0.016
 reunited-0.016
 back-0.016
 happy-0.014
Token her
Token ID607
Feature activation-9.27e-01

Pos logit contributions
works+0.047
 matter+0.045
 narrative+0.043
ports+0.041
 meanings+0.040
Neg logit contributions
 see-0.035
 glad-0.031
 reunited-0.031
 back-0.029
 pad-0.028
Token favorite
Token ID4004
Feature activation-2.38e+00

Pos logit contributions
works+0.038
 narrative+0.036
der+0.034
 matter+0.033
oral+0.032
Neg logit contributions
 see-0.035
 back-0.033
 glad-0.032
 happy-0.031
 home-0.030
Token toys
Token ID14958
Feature activation-1.38e-01

Pos logit contributions
works+0.108
 narrative+0.104
der+0.098
oral+0.091
 matter+0.091
Neg logit contributions
 see-0.080
 back-0.074
 home-0.069
 happy-0.067
 reunited-0.066
Token and
Token ID290
Feature activation-6.11e-01

Pos logit contributions
works+0.007
 matter+0.007
 narrative+0.007
 meanings+0.006
ports+0.006
Neg logit contributions
 see-0.003
 glad-0.003
 reunited-0.003
 back-0.003
 happy-0.002
Token clothes
Token ID8242
Feature activation-2.81e-01

Pos logit contributions
works+0.021
 matter+0.020
 narrative+0.019
 meanings+0.018
ports+0.018
Neg logit contributions
 see-0.024
 glad-0.023
 reunited-0.022
 back-0.022
 thrilled-0.020
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
works+0.011
 narrative+0.011
 matter+0.011
 meanings+0.010
ports+0.010
Neg logit contributions
 see-0.010
 glad-0.009
 back-0.009
 reunited-0.009
 happy-0.008
Token When
Token ID1649
Feature activation-4.39e-01

Pos logit contributions
works+0.013
 meanings+0.013
ports+0.012
der+0.012
 matter+0.012
Neg logit contributions
 see-0.013
 reunited-0.013
 glad-0.013
 seen-0.012
 satisfied-0.011
Token say
Token ID910
Feature activation-1.06e-01

Pos logit contributions
works+0.051
 narrative+0.047
 matter+0.045
ports+0.045
der+0.044
Neg logit contributions
 see-0.034
 glad-0.029
 back-0.027
 reunited-0.027
 happy-0.025
Token,
Token ID11
Feature activation-4.14e-01

Pos logit contributions
works+0.005
 matter+0.005
 narrative+0.005
 meanings+0.004
der+0.004
Neg logit contributions
 see-0.003
 glad-0.003
 reunited-0.003
 back-0.002
 thrilled-0.002
Token "
Token ID366
Feature activation-8.58e-02

Pos logit contributions
works+0.020
 narrative+0.019
 matter+0.018
 meanings+0.017
ports+0.017
Neg logit contributions
 see-0.012
 glad-0.011
 back-0.010
 reunited-0.010
 happy-0.010
TokenI
Token ID40
Feature activation-4.11e-01

Pos logit contributions
works+0.003
 narrative+0.003
der+0.003
 matter+0.003
ports+0.003
Neg logit contributions
 see-0.003
 glad-0.003
 back-0.003
 reunited-0.003
 happy-0.003
Token hate
Token ID5465
Feature activation-7.34e-01

Pos logit contributions
works+0.019
 narrative+0.017
 matter+0.017
der+0.016
 meanings+0.016
Neg logit contributions
 see-0.013
 glad-0.011
 reunited-0.011
 back-0.011
 happy-0.010
Token that
Token ID326
Feature activation-2.36e+00

Pos logit contributions
works+0.030
 narrative+0.028
 matter+0.028
 meanings+0.026
der+0.025
Neg logit contributions
 see-0.026
 glad-0.024
 reunited-0.023
 back-0.023
 happy-0.021
Token bush
Token ID24484
Feature activation-6.53e-01

Pos logit contributions
works+0.114
 narrative+0.107
 matter+0.105
 meanings+0.102
der+0.099
Neg logit contributions
 see-0.069
 back-0.059
 glad-0.059
 reunited-0.057
 home-0.053
Token."
Token ID526
Feature activation-4.97e-01

Pos logit contributions
works+0.032
 narrative+0.030
 matter+0.030
 meanings+0.028
ports+0.028
Neg logit contributions
 see-0.018
 back-0.016
 glad-0.016
 reunited-0.015
 happy-0.014
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
works+0.028
 matter+0.026
 narrative+0.026
 meanings+0.025
der+0.025
Neg logit contributions
 see-0.010
 glad-0.009
 reunited-0.008
 back-0.008
 happy-0.007
Token
Token ID198
Feature activation-6.04e-01

Pos logit contributions
works+0.001
 narrative+0.001
 matter+0.001
 meanings+0.001
ports+0.001
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
TokenOne
Token ID3198
Feature activation-8.02e-01

Pos logit contributions
works+0.020
 narrative+0.017
der+0.017
 matter+0.017
ports+0.016
Neg logit contributions
 see-0.028
 glad-0.024
 back-0.024
 happy-0.024
 reunited-0.023
Token has
Token ID468
Feature activation-1.46e+00

Pos logit contributions
works+0.006
 narrative+0.006
 matter+0.006
der+0.005
 meanings+0.005
Neg logit contributions
 see-0.007
 glad-0.006
 back-0.006
 reunited-0.006
 happy-0.006
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
works+0.066
 narrative+0.062
 matter+0.060
der+0.056
ports+0.056
Neg logit contributions
 see-0.046
 glad-0.040
 back-0.040
 reunited-0.039
 happy-0.036
Token picture
Token ID4286
Feature activation-1.01e+00

Pos logit contributions
works+0.053
 matter+0.049
 narrative+0.049
 meanings+0.046
der+0.045
Neg logit contributions
 see-0.043
 glad-0.039
 reunited-0.038
 back-0.037
 thrilled-0.034
Token of
Token ID286
Feature activation-5.42e-01

Pos logit contributions
works+0.041
 matter+0.037
 narrative+0.036
der+0.034
 meanings+0.034
Neg logit contributions
 see-0.036
 glad-0.034
 reunited-0.032
 back-0.030
 thrilled-0.029
Token your
Token ID534
Feature activation-9.78e-01

Pos logit contributions
works+0.025
 narrative+0.024
 matter+0.023
 meanings+0.022
der+0.022
Neg logit contributions
 see-0.016
 glad-0.014
 reunited-0.014
 back-0.014
 happy-0.012
Token favorite
Token ID4004
Feature activation-2.35e+00

Pos logit contributions
works+0.039
 matter+0.036
 narrative+0.035
 meanings+0.034
der+0.033
Neg logit contributions
 see-0.035
 glad-0.032
 reunited-0.031
 back-0.030
 thrilled-0.028
Token toy
Token ID13373
Feature activation-1.24e+00

Pos logit contributions
works+0.099
 narrative+0.094
 matter+0.086
nexpected+0.084
der+0.083
Neg logit contributions
 see-0.082
 back-0.073
 glad-0.070
 reunited-0.070
 home-0.068
Token on
Token ID319
Feature activation-3.78e-01

Pos logit contributions
works+0.054
 narrative+0.050
 matter+0.050
 meanings+0.046
ports+0.046
Neg logit contributions
 see-0.041
 glad-0.036
 back-0.036
 reunited-0.035
 happy-0.032
Token the
Token ID262
Feature activation+5.40e-02

Pos logit contributions
works+0.019
 narrative+0.018
 matter+0.017
 meanings+0.016
der+0.016
Neg logit contributions
 see-0.011
 back-0.009
 glad-0.009
 reunited-0.008
 home-0.008
Token cover
Token ID3002
Feature activation-1.05e+00

Pos logit contributions
 see+0.002
 glad+0.002
 reunited+0.002
 thrilled+0.002
ends+0.002
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
ports-0.002
Token.
Token ID13
Feature activation-4.33e-01

Pos logit contributions
works+0.044
 narrative+0.042
 matter+0.041
 meanings+0.039
ports+0.038
Neg logit contributions
 see-0.035
 back-0.031
 glad-0.031
 reunited-0.031
 home-0.028
Token had
Token ID550
Feature activation-6.12e-02

Pos logit contributions
 see+0.000
 reunited+0.000
 glad+0.000
 back+0.000
 pad+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
ports-0.000
Token a
Token ID257
Feature activation-4.09e-02

Pos logit contributions
 matter+0.002
works+0.002
 narrative+0.002
 meanings+0.002
ports+0.002
Neg logit contributions
 reunited-0.002
 see-0.002
 glad-0.002
 back-0.002
 pad-0.002
Token toy
Token ID13373
Feature activation-1.04e+00

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.002
 meanings+0.002
der+0.002
Neg logit contributions
 see-0.001
 reunited-0.001
 glad-0.001
 back-0.001
 thrilled-0.001
Token bunny
Token ID44915
Feature activation+4.92e-02

Pos logit contributions
works+0.052
 matter+0.050
 narrative+0.048
nexpected+0.046
ports+0.045
Neg logit contributions
 see-0.028
 back-0.026
 glad-0.024
 reunited-0.023
 home-0.023
Token with
Token ID351
Feature activation-6.06e-01

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
 meanings-0.002
der-0.002
Token one
Token ID530
Feature activation-2.35e+00

Pos logit contributions
works+0.028
 matter+0.026
 narrative+0.026
 meanings+0.025
der+0.025
Neg logit contributions
 see-0.018
 glad-0.016
 reunited-0.015
 back-0.015
 thrilled-0.013
Token ear
Token ID1027
Feature activation-6.17e-01

Pos logit contributions
works+0.098
 matter+0.091
 narrative+0.090
 meanings+0.085
der+0.085
Neg logit contributions
 see-0.080
 glad-0.072
 reunited-0.070
 back-0.068
 thrilled-0.063
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
works+0.024
der+0.023
 matter+0.022
 meanings+0.022
ports+0.021
Neg logit contributions
 see-0.021
 glad-0.020
 reunited-0.019
 thrilled-0.017
 happy-0.017
Token She
Token ID1375
Feature activation-2.86e-01

Pos logit contributions
works+0.008
 matter+0.007
 narrative+0.007
 meanings+0.007
der+0.007
Neg logit contributions
 see-0.005
 glad-0.005
 reunited-0.005
 back-0.005
 thrilled-0.004
Token loved
Token ID6151
Feature activation-5.68e-01

Pos logit contributions
works+0.011
 matter+0.010
 narrative+0.010
 meanings+0.009
der+0.009
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.009
 back-0.009
 thrilled-0.009
Token her
Token ID607
Feature activation-1.24e+00

Pos logit contributions
works+0.024
 matter+0.024
 meanings+0.024
 narrative+0.022
der+0.022
Neg logit contributions
 see-0.017
 reunited-0.015
 glad-0.015
 thrilled-0.014
 back-0.013
Token out
Token ID503
Feature activation-9.31e-01

Pos logit contributions
works+0.027
ports+0.025
 matter+0.025
 narrative+0.024
 meanings+0.024
Neg logit contributions
 see-0.016
 glad-0.015
 reunited-0.015
 back-0.012
 thrilled-0.012
Token a
Token ID257
Feature activation-9.09e-01

Pos logit contributions
works+0.051
 narrative+0.048
 matter+0.047
der+0.045
oral+0.044
Neg logit contributions
 see-0.022
 back-0.018
 glad-0.018
 reunited-0.016
 happy-0.016
Token movie
Token ID3807
Feature activation-2.51e-01

Pos logit contributions
works+0.037
 matter+0.034
 narrative+0.033
ports+0.031
 meanings+0.031
Neg logit contributions
 see-0.032
 glad-0.029
 reunited-0.028
 back-0.027
 thrilled-0.025
Token with
Token ID351
Feature activation-1.64e-01

Pos logit contributions
works+0.012
 matter+0.011
 narrative+0.011
 meanings+0.010
der+0.010
Neg logit contributions
 see-0.007
 glad-0.006
 reunited-0.006
 back-0.006
 thrilled-0.005
Token her
Token ID607
Feature activation-1.02e+00

Pos logit contributions
works+0.007
 matter+0.007
 narrative+0.007
 meanings+0.006
ports+0.006
Neg logit contributions
 see-0.005
 glad-0.005
 reunited-0.005
 back-0.005
 thrilled-0.004
Token favorite
Token ID4004
Feature activation-2.32e+00

Pos logit contributions
works+0.047
 narrative+0.044
 matter+0.043
der+0.040
 meanings+0.040
Neg logit contributions
 see-0.033
 glad-0.030
 back-0.028
 reunited-0.027
 happy-0.026
Token princess
Token ID21752
Feature activation-9.27e-03

Pos logit contributions
 narrative+0.104
works+0.098
der+0.089
Deal+0.083
 matter+0.082
Neg logit contributions
 see-0.083
 back-0.072
 happy-0.072
 glad-0.070
 home-0.066
Token.
Token ID13
Feature activation-5.42e-02

Pos logit contributions
works+0.000
 matter+0.000
 meanings+0.000
 narrative+0.000
ports+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
Token They
Token ID1119
Feature activation-2.94e-01

Pos logit contributions
works+0.003
 matter+0.002
 meanings+0.002
der+0.002
 narrative+0.002
Neg logit contributions
 see-0.002
 glad-0.001
 reunited-0.001
 back-0.001
 thrilled-0.001
Token went
Token ID1816
Feature activation-5.03e-02

Pos logit contributions
works+0.014
 narrative+0.014
 matter+0.013
oral+0.013
der+0.012
Neg logit contributions
 see-0.010
 glad-0.008
 back-0.008
 happy-0.008
 reunited-0.008
Token home
Token ID1363
Feature activation+9.72e-02

Pos logit contributions
works+0.002
ports+0.002
 matter+0.002
fighters+0.002
 concept+0.002
Neg logit contributions
 see-0.001
 glad-0.001
 reunited-0.001
 thrilled-0.001
 pad-0.001
Token said
Token ID531
Feature activation-4.66e-01

Pos logit contributions
works+0.013
 narrative+0.012
der+0.011
 matter+0.011
ports+0.011
Neg logit contributions
 see-0.007
 glad-0.006
 back-0.006
 happy-0.006
 reunited-0.006
Token,
Token ID11
Feature activation-4.75e-01

Pos logit contributions
works+0.031
 narrative+0.030
 matter+0.029
 meanings+0.028
ports+0.027
Neg logit contributions
 see-0.006
 glad-0.005
 back-0.005
 happy-0.004
 reunited-0.004
Token "
Token ID366
Feature activation-5.41e-02

Pos logit contributions
works+0.024
 narrative+0.023
 matter+0.022
 meanings+0.021
ports+0.020
Neg logit contributions
 see-0.013
 glad-0.012
 back-0.012
 happy-0.011
 reunited-0.011
TokenYes
Token ID5297
Feature activation+3.59e-01

Pos logit contributions
works+0.002
 narrative+0.002
der+0.002
ports+0.002
 matter+0.002
Neg logit contributions
 see-0.002
 glad-0.002
 back-0.002
 happy-0.002
 thrilled-0.002
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
 see+0.004
 back+0.003
 glad+0.002
 happy+0.001
 reunited+0.001
Neg logit contributions
 narrative-0.024
works-0.024
 meanings-0.023
ports-0.022
 matter-0.022
Token that
Token ID326
Feature activation-2.32e+00

Pos logit contributions
works+0.014
 narrative+0.014
 meanings+0.013
ports+0.013
 matter+0.013
Neg logit contributions
 see-0.009
 glad-0.008
 back-0.008
 happy-0.007
 reunited-0.007
Token's
Token ID338
Feature activation-1.17e+00

Pos logit contributions
works+0.107
 narrative+0.096
 meanings+0.092
der+0.091
fighters+0.090
Neg logit contributions
 see-0.074
 back-0.066
 happy-0.063
 glad-0.063
 welcome-0.057
Token a
Token ID257
Feature activation-8.21e-01

Pos logit contributions
works+0.064
 narrative+0.062
der+0.060
oral+0.059
 meanings+0.057
Neg logit contributions
 see-0.026
 back-0.024
 glad-0.023
 happy-0.020
 home-0.020
Token thoughtful
Token ID22677
Feature activation-4.90e-01

Pos logit contributions
works+0.035
 matter+0.032
 narrative+0.032
 meanings+0.030
der+0.029
Neg logit contributions
 see-0.028
 glad-0.025
 reunited-0.024
 back-0.024
 happy-0.022
Token idea
Token ID2126
Feature activation-9.67e-01

Pos logit contributions
works+0.020
 narrative+0.018
 matter+0.018
 meanings+0.017
ports+0.017
Neg logit contributions
 see-0.018
 glad-0.016
 back-0.015
 reunited-0.015
 happy-0.014
Token.
Token ID13
Feature activation-4.49e-01

Pos logit contributions
works+0.043
 narrative+0.042
 matter+0.040
 meanings+0.038
ports+0.038
Neg logit contributions
 see-0.031
 glad-0.027
 back-0.027
 reunited-0.026
 home-0.024
Token Do
Token ID2141
Feature activation-8.13e-01

Pos logit contributions
works+0.030
 matter+0.029
 narrative+0.029
 meanings+0.027
der+0.027
Neg logit contributions
 see-0.018
 glad-0.015
 back-0.015
 reunited-0.014
 happy-0.013
Token not
Token ID407
Feature activation-1.27e-01

Pos logit contributions
works+0.034
 matter+0.031
 narrative+0.031
 meanings+0.029
ports+0.029
Neg logit contributions
 see-0.028
 glad-0.026
 reunited-0.026
 back-0.024
 thrilled-0.023
Token go
Token ID467
Feature activation+1.10e-02

Pos logit contributions
works+0.006
 narrative+0.005
 matter+0.005
 meanings+0.005
der+0.005
Neg logit contributions
 see-0.004
 glad-0.004
 reunited-0.004
 back-0.004
 happy-0.003
Token near
Token ID1474
Feature activation-5.87e-01

Pos logit contributions
 see+0.000
 glad+0.000
 reunited+0.000
 thrilled+0.000
 pad+0.000
Neg logit contributions
works-0.000
 matter-0.000
 narrative-0.000
ports-0.000
oral-0.000
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
works+0.027
 narrative+0.026
 matter+0.025
 meanings+0.024
der+0.024
Neg logit contributions
 see-0.018
 glad-0.016
 back-0.015
 reunited-0.015
 happy-0.013
Token water
Token ID1660
Feature activation-2.31e+00

Pos logit contributions
works+0.010
 matter+0.009
 narrative+0.009
ports+0.009
 meanings+0.008
Neg logit contributions
 see-0.005
 glad-0.005
 reunited-0.004
 back-0.004
 thrilled-0.004
Token."
Token ID526
Feature activation+2.96e-01

Pos logit contributions
works+0.096
 matter+0.091
 narrative+0.091
 meanings+0.089
nexpected+0.083
Neg logit contributions
 see-0.081
 back-0.072
 glad-0.069
 reunited-0.069
 home-0.064
Token
Token ID198
Feature activation+8.41e-02

Pos logit contributions
 see+0.006
 glad+0.005
 reunited+0.004
 back+0.004
 happy+0.004
Neg logit contributions
works-0.017
 matter-0.016
 narrative-0.016
 meanings-0.015
der-0.015
Token
Token ID198
Feature activation-5.99e-02

Pos logit contributions
 see+0.002
 reunited+0.002
 glad+0.002
 back+0.002
 pad+0.002
Neg logit contributions
works-0.004
 narrative-0.004
 matter-0.004
oral-0.004
ports-0.004
TokenThe
Token ID464
Feature activation-2.36e-01

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.002
 meanings+0.002
der+0.002
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token duck
Token ID22045
Feature activation+2.65e-01

Pos logit contributions
works+0.011
 matter+0.010
 narrative+0.010
 meanings+0.009
der+0.009
Neg logit contributions
 see-0.008
 glad-0.007
 back-0.007
 reunited-0.006
 happy-0.006
Token smiled
Token ID13541
Feature activation+2.39e-01

Pos logit contributions
 see+0.003
 glad+0.003
 reunited+0.003
 back+0.003
 happy+0.002
Neg logit contributions
works-0.004
 matter-0.003
 narrative-0.003
 meanings-0.003
ports-0.003
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
 see+0.009
 glad+0.008
 reunited+0.008
 pad+0.007
ovi+0.007
Neg logit contributions
works-0.008
der-0.008
 matter-0.008
 narrative-0.008
 meanings-0.007
Token excited
Token ID6568
Feature activation-3.63e-01

Pos logit contributions
 see+0.003
 reunited+0.003
 back+0.002
ovi+0.002
 pad+0.002
Neg logit contributions
der-0.006
 matter-0.005
works-0.005
 narrative-0.005
 meanings-0.005
Token to
Token ID284
Feature activation+7.50e-01

Pos logit contributions
works+0.016
 matter+0.015
 narrative+0.015
 meanings+0.014
der+0.014
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.010
 back-0.009
 thrilled-0.009
Token share
Token ID2648
Feature activation-5.54e-01

Pos logit contributions
 reunited+0.020
 glad+0.020
 pad+0.019
 see+0.019
 thrilled+0.018
Neg logit contributions
 matter-0.031
works-0.030
 narrative-0.029
 meanings-0.028
oral-0.027
Token his
Token ID465
Feature activation-2.30e+00

Pos logit contributions
works+0.028
 narrative+0.026
 matter+0.026
 meanings+0.025
der+0.024
Neg logit contributions
 see-0.015
 glad-0.013
 back-0.013
 reunited-0.012
 happy-0.012
Token new
Token ID649
Feature activation-1.98e+00

Pos logit contributions
works+0.102
 narrative+0.093
 matter+0.090
 meanings+0.088
der+0.087
Neg logit contributions
 see-0.077
 glad-0.071
 back-0.070
 happy-0.067
 reunited-0.064
Token normal
Token ID3487
Feature activation-1.84e+00

Pos logit contributions
works+0.084
 narrative+0.077
 matter+0.075
 meanings+0.072
ports+0.070
Neg logit contributions
 see-0.069
 glad-0.062
 back-0.062
 reunited-0.060
 happy-0.057
Token bed
Token ID3996
Feature activation-4.98e-01

Pos logit contributions
works+0.083
 narrative+0.076
 meanings+0.072
 matter+0.072
ports+0.070
Neg logit contributions
 see-0.059
 back-0.055
 glad-0.053
 reunited-0.050
 home-0.050
Tokentime
Token ID2435
Feature activation-5.07e-01

Pos logit contributions
works+0.023
 matter+0.021
 narrative+0.021
 meanings+0.020
ports+0.020
Neg logit contributions
 see-0.015
 glad-0.014
 back-0.014
 reunited-0.013
 happy-0.012
Token routine
Token ID8027
Feature activation-3.20e-01

Pos logit contributions
works+0.020
 narrative+0.018
 matter+0.018
 meanings+0.017
ports+0.017
Neg logit contributions
 see-0.019
 glad-0.017
 back-0.017
 reunited-0.017
 happy-0.015
Token From
Token ID3574
Feature activation+7.46e-01

Pos logit contributions
works+0.003
 matter+0.003
 narrative+0.003
 meanings+0.003
ports+0.003
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 happy-0.002
Token that
Token ID326
Feature activation-2.59e-01

Pos logit contributions
 see+0.024
 reunited+0.022
 pad+0.021
 glad+0.021
 thrilled+0.020
Neg logit contributions
works-0.030
 matter-0.028
nexpected-0.028
 narrative-0.027
der-0.026
Token day
Token ID1110
Feature activation+1.57e-01

Pos logit contributions
works+0.011
 matter+0.010
 narrative+0.010
nexpected+0.009
 meanings+0.009
Neg logit contributions
 see-0.009
 reunited-0.008
 glad-0.008
 back-0.007
ends-0.007
Token on
Token ID319
Feature activation+1.08e+00

Pos logit contributions
 see+0.004
 glad+0.004
 back+0.004
 reunited+0.004
 happy+0.003
Neg logit contributions
works-0.008
 matter-0.007
 narrative-0.007
 meanings-0.007
der-0.007
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
 see+0.020
 glad+0.016
 reunited+0.015
 back+0.014
 thrilled+0.012
Neg logit contributions
works-0.063
 matter-0.059
 narrative-0.059
 meanings-0.057
der-0.056
Token Sp
Token ID1338
Feature activation-2.30e+00

Pos logit contributions
works+0.020
 narrative+0.018
 matter+0.018
 meanings+0.017
ports+0.017
Neg logit contributions
 see-0.021
 glad-0.019
 reunited-0.018
 back-0.018
 happy-0.017
Tokenide
Token ID485
Feature activation-2.74e-01

Pos logit contributions
 matter+0.061
works+0.061
 narrative+0.058
ports+0.054
 meanings+0.054
Neg logit contributions
 see-0.096
 glad-0.094
 reunited-0.091
 pad-0.088
 happy-0.087
Tokeny
Token ID88
Feature activation-6.38e-03

Pos logit contributions
works+0.010
 matter+0.010
 narrative+0.009
der+0.009
 meanings+0.009
Neg logit contributions
 see-0.010
 glad-0.010
 reunited-0.009
 back-0.009
 thrilled-0.008
Token knew
Token ID2993
Feature activation+5.47e-01

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
der+0.000
 meanings+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 reunited-0.000
 back-0.000
 thrilled-0.000
Token that
Token ID326
Feature activation-1.74e-01

Pos logit contributions
 see+0.012
 glad+0.010
 back+0.010
 reunited+0.009
 home+0.008
Neg logit contributions
works-0.031
 narrative-0.029
 matter-0.028
ports-0.027
 meanings-0.027
Token even
Token ID772
Feature activation-1.36e-01

Pos logit contributions
works+0.008
 narrative+0.008
 matter+0.008
ports+0.007
 meanings+0.007
Neg logit contributions
 see-0.005
 glad-0.005
 back-0.004
 reunited-0.004
 happy-0.004
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
works+0.025
 matter+0.023
der+0.022
 narrative+0.022
 meanings+0.021
Neg logit contributions
 see-0.019
 glad-0.017
 reunited-0.017
 pad-0.015
 back-0.015
Token "
Token ID366
Feature activation-1.02e+00

Pos logit contributions
works+0.017
der+0.015
 matter+0.015
 meanings+0.014
ports+0.014
Neg logit contributions
 reunited-0.019
 see-0.019
 glad-0.018
 pad-0.018
 back-0.017
TokenLook
Token ID8567
Feature activation-4.68e-01

Pos logit contributions
works+0.028
 narrative+0.025
 matter+0.024
ports+0.023
Okay+0.022
Neg logit contributions
 reunited-0.046
 see-0.045
 glad-0.044
 back-0.043
 home-0.043
Token,
Token ID11
Feature activation-1.11e+00

Pos logit contributions
works+0.018
 matter+0.017
nexpected+0.016
ports+0.016
 meanings+0.015
Neg logit contributions
 see-0.015
 glad-0.014
 reunited-0.014
 thrilled-0.013
 pad-0.013
Token there
Token ID612
Feature activation-1.38e+00

Pos logit contributions
works+0.044
 matter+0.040
ports+0.037
 narrative+0.037
 meanings+0.037
Neg logit contributions
 see-0.036
 reunited-0.035
 glad-0.034
 pad-0.033
ends-0.032
Token's
Token ID338
Feature activation-2.30e+00

Pos logit contributions
works+0.068
 narrative+0.064
 matter+0.064
 meanings+0.061
der+0.060
Neg logit contributions
 see-0.037
 glad-0.032
 reunited-0.030
 back-0.030
 happy-0.027
Token a
Token ID257
Feature activation-8.41e-01

Pos logit contributions
works+0.102
 matter+0.098
 narrative+0.093
 meanings+0.091
ports+0.089
Neg logit contributions
 see-0.068
 glad-0.062
 reunited-0.060
 thrilled-0.055
 back-0.055
Token puppy
Token ID26188
Feature activation-4.10e-01

Pos logit contributions
works+0.035
 matter+0.034
 narrative+0.033
 meanings+0.031
ports+0.030
Neg logit contributions
 see-0.027
 glad-0.024
 reunited-0.023
 back-0.022
 thrilled-0.021
Token outside
Token ID2354
Feature activation-1.30e+00

Pos logit contributions
works+0.017
 matter+0.016
 narrative+0.016
ports+0.015
der+0.015
Neg logit contributions
 see-0.013
 glad-0.012
 reunited-0.011
 thrilled-0.011
 back-0.011
Token!"
Token ID2474
Feature activation-6.51e-01

Pos logit contributions
works+0.059
 matter+0.055
 narrative+0.055
 meanings+0.052
der+0.051
Neg logit contributions
 see-0.040
 glad-0.035
 reunited-0.034
 back-0.034
 happy-0.030
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
works+0.027
 matter+0.025
 narrative+0.025
der+0.024
 meanings+0.024
Neg logit contributions
 see-0.020
 reunited-0.018
 glad-0.018
 back-0.017
 pad-0.017
Token not
Token ID407
Feature activation-7.21e-02

Pos logit contributions
works+0.072
 narrative+0.068
der+0.063
 matter+0.063
 meanings+0.062
Neg logit contributions
 see-0.038
 glad-0.031
 happy-0.029
 back-0.029
ends-0.026
Token know
Token ID760
Feature activation-1.18e+00

Pos logit contributions
 matter+0.003
works+0.003
 narrative+0.003
 meanings+0.003
 concept+0.003
Neg logit contributions
 glad-0.002
 see-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token what
Token ID644
Feature activation-4.15e-01

Pos logit contributions
works+0.062
 narrative+0.062
oral+0.058
 matter+0.057
 meanings+0.056
Neg logit contributions
 see-0.028
 glad-0.026
 back-0.024
 playing-0.023
 happy-0.022
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
works+0.025
 narrative+0.024
 matter+0.023
der+0.023
ist+0.022
Neg logit contributions
 see-0.008
 back-0.007
 glad-0.006
 happy-0.006
 outside-0.006
Token lab
Token ID2248
Feature activation-4.68e-01

Pos logit contributions
works+0.054
 narrative+0.051
der+0.049
 matter+0.047
 meanings+0.047
Neg logit contributions
 see-0.035
 back-0.028
 glad-0.028
 happy-0.027
 home-0.026
Token was
Token ID373
Feature activation-2.28e+00

Pos logit contributions
works+0.021
 narrative+0.021
 matter+0.020
ports+0.020
nexpected+0.020
Neg logit contributions
 see-0.014
 back-0.013
 reunited-0.011
 glad-0.011
 happy-0.011
Token,
Token ID11
Feature activation-8.37e-01

Pos logit contributions
works+0.107
 narrative+0.101
 matter+0.098
 meanings+0.094
der+0.093
Neg logit contributions
 see-0.069
 glad-0.060
 back-0.059
 reunited-0.057
 happy-0.053
Token but
Token ID475
Feature activation-4.80e-01

Pos logit contributions
works+0.036
 matter+0.034
 narrative+0.033
 meanings+0.031
der+0.031
Neg logit contributions
 see-0.027
 glad-0.024
 reunited-0.024
 back-0.023
 thrilled-0.021
Token she
Token ID673
Feature activation-4.99e-01

Pos logit contributions
works+0.025
 narrative+0.023
 matter+0.023
 meanings+0.021
ports+0.021
Neg logit contributions
 see-0.013
 glad-0.011
 back-0.010
 reunited-0.010
 happy-0.009
Token did
Token ID750
Feature activation-1.05e+00

Pos logit contributions
works+0.022
 narrative+0.020
 matter+0.019
ports+0.019
der+0.018
Neg logit contributions
 see-0.018
 glad-0.015
 back-0.015
 reunited-0.014
 happy-0.014
Token not
Token ID407
Feature activation+2.55e-01

Pos logit contributions
works+0.064
 narrative+0.059
 matter+0.054
oral+0.054
der+0.054
Neg logit contributions
 see-0.027
 back-0.021
 glad-0.020
 happy-0.020
ends-0.018
Token asleep
Token ID16039
Feature activation+3.57e-01

Pos logit contributions
works+0.007
 matter+0.006
 narrative+0.006
 meanings+0.006
der+0.006
Neg logit contributions
 see-0.003
 glad-0.003
 reunited-0.003
 back-0.003
 thrilled-0.002
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
 see+0.013
 back+0.011
 glad+0.011
 reunited+0.011
 happy+0.010
Neg logit contributions
works-0.016
 matter-0.015
 narrative-0.014
 meanings-0.014
nexpected-0.013
Token<|endoftext|>
Token ID50256
Feature activation+4.25e-01

Pos logit contributions
 see+0.015
 reunited+0.014
 back+0.013
 glad+0.013
 pad+0.012
Neg logit contributions
works-0.019
 narrative-0.018
 matter-0.018
 meanings-0.017
ports-0.017
TokenL
Token ID43
Feature activation-2.14e-01

Pos logit contributions
 see+0.013
 glad+0.012
 reunited+0.012
 back+0.011
 thrilled+0.011
Neg logit contributions
works-0.019
 narrative-0.017
 matter-0.017
ports-0.016
der-0.016
Tokenila
Token ID10102
Feature activation+3.12e-01

Pos logit contributions
works+0.008
 narrative+0.007
ports+0.007
 matter+0.007
oral+0.007
Neg logit contributions
 see-0.008
 glad-0.008
 reunited-0.007
 thrilled-0.007
 relieved-0.007
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
 glad+0.010
 see+0.010
 reunited+0.010
ends+0.009
 back+0.009
Neg logit contributions
works-0.012
 narrative-0.011
 matter-0.011
der-0.010
ports-0.010
Token Ben
Token ID3932
Feature activation+3.73e-01

Pos logit contributions
 reunited+0.051
 pad+0.048
ends+0.048
 glad+0.048
 thrilled+0.047
Neg logit contributions
 narrative-0.058
 matter-0.057
works-0.056
 meanings-0.055
ports-0.053
Token are
Token ID389
Feature activation-3.63e-01

Pos logit contributions
 glad+0.014
 reunited+0.013
 see+0.013
 thrilled+0.013
ends+0.013
Neg logit contributions
works-0.012
 narrative-0.012
 matter-0.011
ports-0.010
 meanings-0.010
Token at
Token ID379
Feature activation-1.16e-02

Pos logit contributions
 narrative+0.015
oral+0.013
ports+0.012
 none+0.012
works+0.012
Neg logit contributions
 see-0.010
 pad-0.009
ovi-0.008
 reunited-0.008
orse-0.008
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
ports+0.000
ators+0.000
Neg logit contributions
 see-0.000
 reunited-0.000
 glad-0.000
 pad-0.000
 thrilled-0.000
Token zoo
Token ID26626
Feature activation-6.52e-02

Pos logit contributions
 see+0.008
 glad+0.008
 reunited+0.008
 thrilled+0.007
ends+0.007
Neg logit contributions
works-0.010
 matter-0.009
 narrative-0.009
ports-0.008
 meanings-0.008
Token After
Token ID2293
Feature activation+5.62e-01

Pos logit contributions
 see+0.011
 reunited+0.010
 glad+0.010
 back+0.009
 thrilled+0.009
Neg logit contributions
works-0.017
 narrative-0.016
der-0.015
ports-0.015
 meanings-0.015
Token a
Token ID257
Feature activation-3.71e-01

Pos logit contributions
 see+0.019
 glad+0.018
 reunited+0.017
 back+0.017
 happy+0.015
Neg logit contributions
works-0.024
 matter-0.022
 narrative-0.022
 meanings-0.020
der-0.020
Token while
Token ID981
Feature activation+1.13e-01

Pos logit contributions
works+0.013
 narrative+0.012
 matter+0.012
 meanings+0.011
ports+0.011
Neg logit contributions
 see-0.015
 glad-0.014
 back-0.013
 reunited-0.013
 happy-0.013
Token,
Token ID11
Feature activation+2.44e-02

Pos logit contributions
 see+0.002
 glad+0.002
 reunited+0.002
 back+0.002
 thrilled+0.002
Neg logit contributions
works-0.006
 matter-0.006
 narrative-0.006
der-0.005
ports-0.005
Token the
Token ID262
Feature activation+3.18e-03

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.001
 matter-0.001
 narrative-0.001
 meanings-0.001
der-0.001
Token cookies
Token ID14746
Feature activation+1.55e+00

Pos logit contributions
 see+0.000
 glad+0.000
 back+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 matter-0.000
 narrative-0.000
 meanings-0.000
ports-0.000
Token were
Token ID547
Feature activation+5.28e-01

Pos logit contributions
 see+0.054
 back+0.047
 glad+0.047
 reunited+0.046
 happy+0.043
Neg logit contributions
works-0.065
 matter-0.061
 narrative-0.060
 meanings-0.058
ports-0.057
Token ready
Token ID3492
Feature activation+5.42e-03

Pos logit contributions
 see+0.014
 reunited+0.012
 glad+0.011
 pad+0.011
ovi+0.011
Neg logit contributions
works-0.025
 matter-0.024
 narrative-0.023
 meanings-0.022
der-0.021
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
 see+0.000
 back+0.000
 glad+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
ports-0.000
Token they
Token ID484
Feature activation+1.06e+00

Pos logit contributions
 see+0.009
 glad+0.008
 reunited+0.008
 back+0.008
 pad+0.008
Neg logit contributions
works-0.012
 matter-0.011
 narrative-0.011
 meanings-0.010
ports-0.010
Token sat
Token ID3332
Feature activation-1.40e-01

Pos logit contributions
 see+0.032
 back+0.028
 glad+0.027
 reunited+0.027
 happy+0.025
Neg logit contributions
works-0.051
 matter-0.048
 narrative-0.048
 meanings-0.045
ports-0.045
Token loud
Token ID7812
Feature activation+1.22e-01

Pos logit contributions
works+0.018
 narrative+0.017
ports+0.016
Deal+0.016
cuts+0.015
Neg logit contributions
 see-0.009
 back-0.008
 happy-0.008
 glad-0.007
 outside-0.006
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
 see+0.004
 glad+0.004
 reunited+0.003
 back+0.003
 thrilled+0.003
Neg logit contributions
works-0.005
 matter-0.005
der-0.005
 narrative-0.005
 meanings-0.005
Token "
Token ID366
Feature activation+5.03e-01

Pos logit contributions
 see+0.019
 glad+0.017
 back+0.016
 reunited+0.016
 thrilled+0.014
Neg logit contributions
works-0.033
 narrative-0.031
 matter-0.031
 meanings-0.029
ports-0.029
TokenOne
Token ID3198
Feature activation-3.62e-01

Pos logit contributions
 see+0.020
 glad+0.017
 back+0.016
 pad+0.016
 reunited+0.015
Neg logit contributions
works-0.019
 narrative-0.017
der-0.017
 matter-0.017
 meanings-0.016
Token,
Token ID11
Feature activation+9.18e-01

Pos logit contributions
works+0.020
 narrative+0.019
ports+0.018
 meanings+0.018
der+0.018
Neg logit contributions
 see-0.008
 happy-0.006
 back-0.006
 glad-0.006
 welcome-0.006
Token two
Token ID734
Feature activation+1.51e+00

Pos logit contributions
 see+0.030
 glad+0.025
 back+0.025
 happy+0.023
 reunited+0.022
Neg logit contributions
works-0.043
 narrative-0.040
 matter-0.038
 meanings-0.038
nexpected-0.037
Token,
Token ID11
Feature activation+6.94e-01

Pos logit contributions
 see+0.032
 glad+0.026
 reunited+0.024
 back+0.024
 happy+0.021
Neg logit contributions
works-0.085
 narrative-0.080
 matter-0.078
 meanings-0.076
der-0.075
Token three
Token ID1115
Feature activation+5.50e-01

Pos logit contributions
 see+0.024
 back+0.020
 glad+0.020
 reunited+0.019
 happy+0.018
Neg logit contributions
works-0.030
 narrative-0.028
 matter-0.027
 meanings-0.026
nexpected-0.025
Token,
Token ID11
Feature activation+3.65e-01

Pos logit contributions
 see+0.012
 glad+0.010
 reunited+0.009
 happy+0.009
 back+0.009
Neg logit contributions
works-0.031
 narrative-0.029
 matter-0.028
 meanings-0.027
 concept-0.026
Token four
Token ID1440
Feature activation+1.07e+00

Pos logit contributions
 see+0.013
 glad+0.011
 back+0.011
 reunited+0.011
 happy+0.010
Neg logit contributions
works-0.015
 narrative-0.014
 matter-0.014
 meanings-0.013
der-0.013
Token,
Token ID11
Feature activation+4.31e-01

Pos logit contributions
 see+0.021
 glad+0.017
 happy+0.015
 reunited+0.015
 back+0.015
Neg logit contributions
works-0.063
 narrative-0.060
 matter-0.057
 meanings-0.055
 concept-0.054
Token What
Token ID1867
Feature activation-3.47e-01

Pos logit contributions
works+0.027
 narrative+0.026
 matter+0.026
 meanings+0.024
der+0.024
Neg logit contributions
 see-0.019
 glad-0.017
 back-0.016
 reunited-0.016
 happy-0.014
Token do
Token ID466
Feature activation-6.48e-01

Pos logit contributions
works+0.017
 narrative+0.016
lights+0.015
der+0.015
ports+0.015
Neg logit contributions
 see-0.010
 glad-0.008
 happy-0.008
 back-0.008
 welcome-0.008
Token you
Token ID345
Feature activation+3.81e-01

Pos logit contributions
der+0.037
works+0.037
 narrative+0.035
 concept+0.034
 matter+0.034
Neg logit contributions
 see-0.013
 glad-0.009
 back-0.008
 welcome-0.008
 happy-0.008
Token want
Token ID765
Feature activation+2.15e-01

Pos logit contributions
 glad+0.010
 reunited+0.010
 see+0.010
 back+0.009
 thrilled+0.009
Neg logit contributions
 matter-0.017
works-0.017
 narrative-0.016
 meanings-0.015
 concept-0.014
Token me
Token ID502
Feature activation+1.69e+00

Pos logit contributions
 see+0.005
 back+0.003
 glad+0.003
 reunited+0.003
 home+0.003
Neg logit contributions
works-0.012
 narrative-0.012
 matter-0.011
der-0.011
 meanings-0.011
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
 reunited+0.055
 glad+0.054
 pad+0.052
 relieved+0.050
 thrilled+0.049
Neg logit contributions
 matter-0.061
works-0.055
 cost-0.050
 meanings-0.050
ports-0.048
Token do
Token ID466
Feature activation+4.39e-01

Pos logit contributions
 glad+0.061
 reunited+0.059
 pad+0.057
 thrilled+0.055
 relieved+0.054
Neg logit contributions
 matter-0.073
works-0.068
 meanings-0.061
 narrative-0.060
oral-0.057
Token?"
Token ID1701
Feature activation-6.15e-01

Pos logit contributions
 see+0.013
 glad+0.012
 reunited+0.011
 back+0.011
 happy+0.010
Neg logit contributions
works-0.020
 narrative-0.019
 matter-0.019
 meanings-0.018
der-0.018
Token
Token ID220
Feature activation-1.66e-01

Pos logit contributions
works+0.029
 matter+0.027
 narrative+0.027
ports+0.025
 meanings+0.025
Neg logit contributions
 see-0.017
 glad-0.015
 reunited-0.015
 back-0.014
 thrilled-0.013
Token
Token ID198
Feature activation-2.51e-02

Pos logit contributions
works+0.007
 narrative+0.007
oral+0.006
der+0.006
ves+0.006
Neg logit contributions
 see-0.005
 reunited-0.005
 glad-0.004
 back-0.004
 pad-0.004
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
works+0.001
 matter+0.001
 narrative+0.001
ports+0.001
oral+0.001
Neg logit contributions
 glad-0.001
 reunited-0.001
 see-0.001
 thrilled-0.001
 pad-0.001
Token welcome
Token ID7062
Feature activation+1.11e+00

Pos logit contributions
works+0.008
 matter+0.008
 narrative+0.008
 meanings+0.007
ports+0.007
Neg logit contributions
 see-0.005
 reunited-0.005
 glad-0.005
 back-0.005
 pad-0.004
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
 see+0.032
 glad+0.029
 reunited+0.027
 back+0.025
 thrilled+0.025
Neg logit contributions
works-0.052
 matter-0.048
 narrative-0.048
ports-0.045
 meanings-0.045
Token Maybe
Token ID6674
Feature activation+5.06e-01

Pos logit contributions
works+0.008
 matter+0.008
 narrative+0.007
 meanings+0.007
der+0.007
Neg logit contributions
 see-0.006
 glad-0.005
 reunited-0.005
 back-0.005
 happy-0.004
Token we
Token ID356
Feature activation-1.24e-01

Pos logit contributions
 see+0.015
 glad+0.013
 back+0.012
 reunited+0.012
 happy+0.011
Neg logit contributions
works-0.024
 narrative-0.023
 matter-0.022
 meanings-0.021
der-0.021
Token can
Token ID460
Feature activation+2.47e-01

Pos logit contributions
works+0.005
 matter+0.005
 narrative+0.005
 meanings+0.004
der+0.004
Neg logit contributions
 see-0.005
 glad-0.004
 back-0.004
 reunited-0.004
 happy-0.004
Token meet
Token ID1826
Feature activation+1.52e+00

Pos logit contributions
 see+0.006
 glad+0.006
 reunited+0.006
 back+0.005
 thrilled+0.005
Neg logit contributions
works-0.012
 matter-0.011
 narrative-0.011
 meanings-0.010
der-0.010
Token again
Token ID757
Feature activation+7.30e-01

Pos logit contributions
 see+0.043
 back+0.037
 glad+0.037
 reunited+0.035
 happy+0.032
Neg logit contributions
works-0.074
 narrative-0.070
 matter-0.069
 meanings-0.065
der-0.065
Token in
Token ID287
Feature activation+2.92e-01

Pos logit contributions
 see+0.022
 glad+0.020
 reunited+0.020
 back+0.019
 thrilled+0.018
Neg logit contributions
works-0.032
 matter-0.030
 narrative-0.030
der-0.028
 meanings-0.028
Token the
Token ID262
Feature activation-8.10e-01

Pos logit contributions
 see+0.009
 glad+0.009
 reunited+0.008
 pad+0.008
 thrilled+0.008
Neg logit contributions
works-0.011
 matter-0.011
 narrative-0.011
ports-0.010
 meanings-0.010
Token park
Token ID3952
Feature activation+5.94e-01

Pos logit contributions
works+0.037
 narrative+0.036
 matter+0.036
 meanings+0.033
ports+0.033
Neg logit contributions
 see-0.023
 glad-0.021
 reunited-0.021
 thrilled-0.018
 back-0.018
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 see+0.020
 glad+0.018
 reunited+0.018
 back+0.017
 thrilled+0.016
Neg logit contributions
works-0.025
 matter-0.023
 narrative-0.023
 meanings-0.021
der-0.021
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
works+0.018
 narrative+0.017
 matter+0.016
 meanings+0.016
ports+0.015
Neg logit contributions
 see-0.014
 back-0.012
 glad-0.012
 reunited-0.012
 happy-0.011
Token She
Token ID1375
Feature activation+5.83e-01

Pos logit contributions
 see+0.012
 glad+0.011
 reunited+0.011
 back+0.010
 happy+0.009
Neg logit contributions
works-0.019
 narrative-0.017
 matter-0.017
 meanings-0.016
der-0.016
Token added
Token ID2087
Feature activation-4.09e-01

Pos logit contributions
 see+0.021
 glad+0.018
 reunited+0.017
 back+0.017
 happy+0.016
Neg logit contributions
works-0.025
 narrative-0.023
 matter-0.023
 meanings-0.021
ports-0.021
Token some
Token ID617
Feature activation+9.53e-02

Pos logit contributions
works+0.017
 matter+0.015
 narrative+0.015
 meanings+0.014
der+0.014
Neg logit contributions
 see-0.014
 glad-0.013
 reunited-0.013
 back-0.012
 thrilled-0.011
Token yellow
Token ID7872
Feature activation+7.87e-02

Pos logit contributions
 see+0.003
 glad+0.003
 back+0.003
 happy+0.003
 reunited+0.003
Neg logit contributions
works-0.004
 narrative-0.004
 matter-0.004
 meanings-0.004
ports-0.004
Token and
Token ID290
Feature activation+8.71e-01

Pos logit contributions
 see+0.003
 back+0.002
 glad+0.002
 reunited+0.002
 home+0.002
Neg logit contributions
works-0.003
 narrative-0.003
 matter-0.003
 meanings-0.003
ports-0.003
Token some
Token ID617
Feature activation+9.02e-02

Pos logit contributions
 see+0.030
 glad+0.026
 back+0.025
 reunited+0.025
 happy+0.024
Neg logit contributions
works-0.037
 narrative-0.035
 matter-0.034
 meanings-0.032
der-0.032
Token orange
Token ID10912
Feature activation+4.10e-01

Pos logit contributions
 see+0.003
 back+0.003
 glad+0.003
 happy+0.003
 reunited+0.002
Neg logit contributions
works-0.004
 narrative-0.004
 matter-0.004
ports-0.003
 meanings-0.003
Token,
Token ID11
Feature activation+6.72e-01

Pos logit contributions
 see+0.013
 back+0.012
 glad+0.012
 reunited+0.011
 happy+0.010
Neg logit contributions
works-0.018
 narrative-0.017
 matter-0.017
 meanings-0.016
ports-0.015
Token too
Token ID1165
Feature activation+9.63e-01

Pos logit contributions
 see+0.020
 glad+0.018
 back+0.017
 reunited+0.016
 happy+0.016
Neg logit contributions
works-0.032
 narrative-0.030
 matter-0.029
 meanings-0.028
ports-0.027
Token.
Token ID13
Feature activation+8.76e-01

Pos logit contributions
 see+0.032
 glad+0.027
 happy+0.027
 back+0.026
 thrilled+0.023
Neg logit contributions
works-0.044
 narrative-0.044
 matter-0.040
ports-0.040
oral-0.039
Tokenummy
Token ID13513
Feature activation+7.34e-01

Pos logit contributions
 see+0.038
 glad+0.035
 reunited+0.034
 back+0.033
 thrilled+0.031
Neg logit contributions
works-0.032
 matter-0.030
 narrative-0.029
der-0.028
 meanings-0.027
Token snack
Token ID26906
Feature activation+1.81e-01

Pos logit contributions
 see+0.028
 glad+0.026
 reunited+0.026
ends+0.024
 back+0.024
Neg logit contributions
works-0.026
 matter-0.025
 narrative-0.023
 meanings-0.022
der-0.022
Token all
Token ID477
Feature activation-1.94e-01

Pos logit contributions
 see+0.006
 glad+0.006
 reunited+0.006
 back+0.005
 happy+0.005
Neg logit contributions
works-0.007
 matter-0.007
 narrative-0.007
 meanings-0.006
ports-0.006
Token by
Token ID416
Feature activation-9.78e-02

Pos logit contributions
works+0.009
 narrative+0.009
 matter+0.008
 meanings+0.008
ports+0.008
Neg logit contributions
 see-0.006
 glad-0.005
 back-0.005
 reunited-0.005
 happy-0.005
Token herself
Token ID5223
Feature activation-3.80e-01

Pos logit contributions
works+0.006
 narrative+0.005
 matter+0.005
 meanings+0.005
ports+0.005
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 happy-0.001
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
works+0.015
 narrative+0.014
 matter+0.014
 meanings+0.013
ports+0.013
Neg logit contributions
 see-0.014
 glad-0.013
 back-0.013
 reunited-0.012
 happy-0.012
Token<|endoftext|>
Token ID50256
Feature activation+1.74e-01

Pos logit contributions
 see+0.008
 reunited+0.007
 back+0.007
 glad+0.007
 happy+0.006
Neg logit contributions
works-0.012
 narrative-0.011
 meanings-0.011
der-0.011
 matter-0.011
TokenOnce
Token ID7454
Feature activation-6.29e-03

Pos logit contributions
 see+0.005
 glad+0.005
 reunited+0.004
 back+0.004
 thrilled+0.004
Neg logit contributions
works-0.008
 narrative-0.007
 matter-0.007
ports-0.007
 meanings-0.007
Token upon
Token ID2402
Feature activation+1.32e+00

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
oral+0.000
der+0.000
Neg logit contributions
 see-0.000
 reunited-0.000
 glad-0.000
 pad-0.000
 back-0.000
Token a
Token ID257
Feature activation-1.02e+00

Pos logit contributions
 see+0.042
 reunited+0.041
 glad+0.040
 back+0.038
 pad+0.036
Neg logit contributions
 matter-0.050
works-0.049
 narrative-0.046
der-0.044
 meanings-0.044
Token time
Token ID640
Feature activation+5.17e-02

Pos logit contributions
 matter+0.030
 narrative+0.027
works+0.027
nexpected+0.024
 meanings+0.023
Neg logit contributions
 see-0.045
 reunited-0.044
 glad-0.041
 back-0.041
 thrilled-0.038
Token with
Token ID351
Feature activation-9.93e-02

Pos logit contributions
works+0.014
 narrative+0.013
 matter+0.013
der+0.012
 meanings+0.012
Neg logit contributions
 see-0.009
 glad-0.008
 reunited-0.008
 back-0.007
 thrilled-0.007
Token her
Token ID607
Feature activation-6.82e-01

Pos logit contributions
works+0.006
 narrative+0.005
 matter+0.005
 meanings+0.005
oral+0.005
Neg logit contributions
 see-0.002
 back-0.002
 glad-0.002
 reunited-0.002
 happy-0.002
Token and
Token ID290
Feature activation+2.67e-01

Pos logit contributions
works+0.034
 narrative+0.030
 matter+0.030
nexpected+0.030
ports+0.030
Neg logit contributions
 see-0.019
 back-0.019
 glad-0.018
 happy-0.017
 home-0.016
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
 see+0.008
 reunited+0.007
 glad+0.007
 back+0.007
 thrilled+0.007
Neg logit contributions
works-0.011
 matter-0.011
 narrative-0.011
 meanings-0.010
der-0.010
Token leaves
Token ID5667
Feature activation-6.85e-01

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.001
ports+0.001
 meanings+0.001
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 reunited-0.001
 happy-0.001
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
works+0.030
 matter+0.028
 narrative+0.028
ports+0.027
 meanings+0.027
Neg logit contributions
 see-0.023
 back-0.022
 glad-0.020
 reunited-0.020
 happy-0.019
Token He
Token ID679
Feature activation-3.54e-03

Pos logit contributions
 see+0.020
 reunited+0.020
 glad+0.018
 back+0.018
 thrilled+0.016
Neg logit contributions
works-0.025
 narrative-0.024
der-0.024
 matter-0.023
 meanings-0.023
Token looked
Token ID3114
Feature activation-1.15e-01

Pos logit contributions
works+0.000
 matter+0.000
 narrative+0.000
ators+0.000
ports+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 back-0.000
 happy-0.000
 reunited-0.000
Token out
Token ID503
Feature activation-7.09e-01

Pos logit contributions
 matter+0.004
works+0.004
der+0.004
 narrative+0.004
 meanings+0.004
Neg logit contributions
 see-0.004
 reunited-0.004
orse-0.003
ovi-0.003
 glad-0.003
Token the
Token ID262
Feature activation+8.53e-02

Pos logit contributions
works+0.038
 narrative+0.035
 matter+0.034
ports+0.033
 meanings+0.033
Neg logit contributions
 see-0.019
 back-0.016
 glad-0.016
 reunited-0.015
 happy-0.014
Token window
Token ID4324
Feature activation-1.41e-01

Pos logit contributions
 see+0.002
 glad+0.002
 back+0.002
 reunited+0.002
 happy+0.002
Neg logit contributions
works-0.004
 matter-0.004
 narrative-0.004
 meanings-0.004
der-0.004
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
works+0.014
 narrative+0.013
 matter+0.013
 meanings+0.013
oral+0.012
Neg logit contributions
 see-0.004
 back-0.004
 glad-0.003
 reunited-0.003
 happy-0.003
Token walked
Token ID6807
Feature activation+1.60e-01

Pos logit contributions
works+0.012
 narrative+0.011
 matter+0.011
oral+0.010
der+0.010
Neg logit contributions
 see-0.009
 back-0.007
 reunited-0.007
 glad-0.007
 happy-0.007
Token into
Token ID656
Feature activation-6.84e-01

Pos logit contributions
 see+0.004
 glad+0.004
 reunited+0.004
 back+0.003
 thrilled+0.003
Neg logit contributions
works-0.008
 matter-0.007
 narrative-0.007
ports-0.007
 meanings-0.007
Token the
Token ID262
Feature activation-6.84e-01

Pos logit contributions
works+0.039
 narrative+0.037
 matter+0.036
 meanings+0.035
der+0.035
Neg logit contributions
 see-0.017
 back-0.013
 glad-0.012
 reunited-0.012
 happy-0.011
Token hole
Token ID7604
Feature activation-1.43e+00

Pos logit contributions
works+0.027
 narrative+0.025
 matter+0.025
 meanings+0.023
ports+0.023
Neg logit contributions
 see-0.024
 glad-0.022
 reunited-0.022
 thrilled-0.021
 back-0.020
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
works+0.066
 matter+0.065
 narrative+0.064
 meanings+0.061
ports+0.060
Neg logit contributions
 see-0.044
 back-0.043
 glad-0.039
 happy-0.038
 reunited-0.038
Token It
Token ID632
Feature activation+9.56e-02

Pos logit contributions
 see+0.014
 glad+0.013
 reunited+0.013
 back+0.012
 happy+0.012
Neg logit contributions
works-0.015
 narrative-0.013
 matter-0.013
der-0.013
ports-0.013
Token was
Token ID373
Feature activation-9.10e-02

Pos logit contributions
 see+0.003
 back+0.003
 reunited+0.003
 glad+0.003
 happy+0.003
Neg logit contributions
works-0.004
 narrative-0.004
 meanings-0.004
 matter-0.004
der-0.004
Token dark
Token ID3223
Feature activation-2.62e-01

Pos logit contributions
works+0.004
 matter+0.004
 narrative+0.004
 meanings+0.004
der+0.004
Neg logit contributions
 see-0.003
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token and
Token ID290
Feature activation+3.77e-03

Pos logit contributions
works+0.012
 matter+0.011
der+0.011
 narrative+0.010
lights+0.010
Neg logit contributions
 see-0.008
 glad-0.008
 reunited-0.007
 thrilled-0.007
ends-0.006
Token cold
Token ID4692
Feature activation-6.29e-01

Pos logit contributions
 see+0.000
 glad+0.000
 back+0.000
 reunited+0.000
 happy+0.000
Neg logit contributions
works-0.000
 narrative-0.000
 matter-0.000
 meanings-0.000
der-0.000
Token running
Token ID2491
Feature activation+4.41e-01

Pos logit contributions
 see+0.006
 glad+0.005
 pad+0.005
 reunited+0.005
ends+0.005
Neg logit contributions
works-0.011
 matter-0.010
 narrative-0.010
ports-0.010
 concept-0.009
Token around
Token ID1088
Feature activation-4.73e-02

Pos logit contributions
 see+0.011
 glad+0.010
 reunited+0.009
 back+0.009
 happy+0.008
Neg logit contributions
works-0.022
 narrative-0.021
 matter-0.021
 meanings-0.020
ports-0.020
Token all
Token ID477
Feature activation-3.72e-01

Pos logit contributions
works+0.002
 narrative+0.002
 matter+0.002
 meanings+0.002
ports+0.002
Neg logit contributions
 see-0.001
 glad-0.001
 back-0.001
 reunited-0.001
 happy-0.001
Token day
Token ID1110
Feature activation-6.84e-02

Pos logit contributions
works+0.016
 matter+0.015
 narrative+0.015
 meanings+0.013
ports+0.013
Neg logit contributions
 see-0.012
 reunited-0.011
 glad-0.011
 back-0.010
ends-0.010
Token and
Token ID290
Feature activation-3.22e-01

Pos logit contributions
works+0.003
 matter+0.002
 narrative+0.002
der+0.002
 meanings+0.002
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token wanted
Token ID2227
Feature activation+4.72e-01

Pos logit contributions
works+0.015
 matter+0.015
 narrative+0.014
 meanings+0.014
der+0.014
Neg logit contributions
 see-0.008
 reunited-0.007
 glad-0.007
 back-0.006
ends-0.006
Token to
Token ID284
Feature activation-4.95e-01

Pos logit contributions
 see+0.009
 back+0.008
 glad+0.007
 home+0.006
 happy+0.006
Neg logit contributions
works-0.028
 narrative-0.027
 matter-0.025
 meanings-0.025
oral-0.025
Token rest
Token ID1334
Feature activation-2.38e-01

Pos logit contributions
works+0.022
 matter+0.021
 narrative+0.020
 meanings+0.019
ports+0.019
Neg logit contributions
 glad-0.014
 see-0.013
 reunited-0.013
 back-0.012
 pad-0.012
Token.
Token ID13
Feature activation-2.92e-01

Pos logit contributions
works+0.012
 meanings+0.011
 matter+0.011
 narrative+0.011
oral+0.010
Neg logit contributions
 see-0.007
 back-0.007
 glad-0.006
 happy-0.006
 home-0.006
Token As
Token ID1081
Feature activation-3.07e-01

Pos logit contributions
works+0.013
 matter+0.012
 narrative+0.012
 meanings+0.012
der+0.012
Neg logit contributions
 see-0.009
 glad-0.008
 reunited-0.008
 back-0.007
 happy-0.007
Token he
Token ID339
Feature activation-1.04e+00

Pos logit contributions
works+0.015
 narrative+0.013
 matter+0.013
 meanings+0.013
der+0.013
Neg logit contributions
 see-0.009
 glad-0.008
 reunited-0.008
 back-0.008
 happy-0.007
Token he
Token ID339
Feature activation+9.62e-02

Pos logit contributions
works+0.014
 narrative+0.013
 matter+0.013
 meanings+0.013
der+0.013
Neg logit contributions
 see-0.008
 glad-0.007
 back-0.006
 reunited-0.006
 happy-0.006
Token said
Token ID531
Feature activation-1.43e-01

Pos logit contributions
 see+0.003
 back+0.002
 glad+0.002
 reunited+0.002
 happy+0.002
Neg logit contributions
works-0.005
 narrative-0.005
 matter-0.004
 meanings-0.004
der-0.004
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
works+0.006
 narrative+0.005
 matter+0.005
 meanings+0.005
ports+0.005
Neg logit contributions
 see-0.005
 glad-0.005
 back-0.005
 reunited-0.005
 happy-0.004
Token "
Token ID366
Feature activation-3.60e-01

Pos logit contributions
 see+0.006
 glad+0.005
 reunited+0.005
 back+0.005
 thrilled+0.004
Neg logit contributions
works-0.007
 narrative-0.007
 matter-0.007
der-0.006
 meanings-0.006
TokenI
Token ID40
Feature activation-3.00e-01

Pos logit contributions
works+0.012
 narrative+0.011
 matter+0.011
der+0.010
ports+0.010
Neg logit contributions
 see-0.015
 reunited-0.014
 glad-0.014
 back-0.014
 home-0.013
Token tried
Token ID3088
Feature activation+4.93e-02

Pos logit contributions
works+0.013
 narrative+0.011
 matter+0.011
ports+0.011
 meanings+0.011
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.009
 back-0.009
 happy-0.009
Token to
Token ID284
Feature activation+1.04e-01

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
der-0.002
 meanings-0.002
Token warn
Token ID9828
Feature activation+3.80e-01

Pos logit contributions
 glad+0.003
 see+0.003
 reunited+0.003
 thrilled+0.003
 pad+0.003
Neg logit contributions
works-0.004
 matter-0.004
 narrative-0.004
 meanings-0.004
der-0.004
Token you
Token ID345
Feature activation-1.50e-01

Pos logit contributions
 see+0.010
 back+0.009
 glad+0.009
 reunited+0.008
 home+0.007
Neg logit contributions
works-0.019
 narrative-0.018
 matter-0.018
 meanings-0.017
der-0.017
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
works+0.007
 narrative+0.006
 matter+0.006
 meanings+0.006
ports+0.006
Neg logit contributions
 see-0.005
 glad-0.004
 back-0.004
 reunited-0.004
 happy-0.004
Token but
Token ID475
Feature activation+2.01e-01

Pos logit contributions
works+0.012
 matter+0.012
 narrative+0.011
der+0.011
 meanings+0.011
Neg logit contributions
 see-0.008
 glad-0.007
 reunited-0.007
 back-0.007
 thrilled-0.006
Token late
Token ID2739
Feature activation-4.82e-02

Pos logit contributions
works+0.003
 narrative+0.003
der+0.003
 meanings+0.003
oral+0.003
Neg logit contributions
 see-0.001
 back-0.001
 glad-0.001
 happy-0.001
 reunited-0.001
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.002
der+0.002
 meanings+0.002
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.001
Token It
Token ID632
Feature activation-8.55e-01

Pos logit contributions
works+0.016
 matter+0.015
 narrative+0.015
 meanings+0.014
ports+0.014
Neg logit contributions
 see-0.012
 glad-0.011
 reunited-0.011
 back-0.011
 happy-0.010
Token's
Token ID338
Feature activation-2.96e-01

Pos logit contributions
works+0.046
 narrative+0.045
 meanings+0.043
ports+0.042
der+0.042
Neg logit contributions
 see-0.019
 back-0.016
 glad-0.014
 reunited-0.013
 happy-0.013
Token time
Token ID640
Feature activation+1.69e-01

Pos logit contributions
works+0.014
 narrative+0.013
der+0.012
 matter+0.012
ports+0.012
Neg logit contributions
 see-0.009
 glad-0.008
 back-0.008
 reunited-0.007
 happy-0.007
Token to
Token ID284
Feature activation-4.55e-01

Pos logit contributions
 see+0.005
 glad+0.004
 reunited+0.004
 back+0.004
 thrilled+0.003
Neg logit contributions
works-0.008
 matter-0.008
 narrative-0.007
 meanings-0.007
ports-0.007
Token come
Token ID1282
Feature activation+7.60e-01

Pos logit contributions
works+0.021
 matter+0.020
 narrative+0.020
 meanings+0.018
der+0.018
Neg logit contributions
 see-0.013
 glad-0.012
 reunited-0.011
 back-0.011
 thrilled-0.010
Token inside
Token ID2641
Feature activation-1.60e-01

Pos logit contributions
 glad+0.019
 thrilled+0.018
 satisfied+0.016
 reunited+0.016
 delighted+0.016
Neg logit contributions
works-0.035
 matter-0.031
ist-0.029
ports-0.029
ators-0.029
Token and
Token ID290
Feature activation-9.14e-01

Pos logit contributions
works+0.007
 matter+0.007
 narrative+0.007
 meanings+0.006
der+0.006
Neg logit contributions
 see-0.005
 glad-0.004
 reunited-0.004
 back-0.004
 thrilled-0.003
Token get
Token ID651
Feature activation-2.16e-01

Pos logit contributions
works+0.038
 matter+0.037
 narrative+0.034
 meanings+0.032
ports+0.032
Neg logit contributions
 glad-0.027
 see-0.027
 reunited-0.026
 pad-0.025
 thrilled-0.025
Token dry
Token ID5894
Feature activation-4.14e-01

Pos logit contributions
works+0.011
 matter+0.010
 narrative+0.010
 meanings+0.010
ports+0.010
Neg logit contributions
 see-0.005
 glad-0.005
 reunited-0.004
 back-0.004
 thrilled-0.004
Token a
Token ID257
Feature activation+3.49e-01

Pos logit contributions
 matter+0.020
works+0.017
 narrative+0.017
 sounded+0.017
 cost+0.017
Neg logit contributions
 pad-0.019
 reunited-0.018
 see-0.018
ovi-0.016
 back-0.016
Token happy
Token ID3772
Feature activation-2.22e-01

Pos logit contributions
 see+0.010
 reunited+0.010
 pad+0.009
ovi+0.009
 glad+0.009
Neg logit contributions
 matter-0.014
works-0.014
 narrative-0.013
Deal-0.012
 concept-0.012
Token dog
Token ID3290
Feature activation+2.29e-01

Pos logit contributions
works+0.009
 matter+0.009
 narrative+0.008
der+0.008
 meanings+0.008
Neg logit contributions
 see-0.007
 glad-0.007
 reunited-0.007
ends-0.006
 thrilled-0.006
Token who
Token ID508
Feature activation-7.51e-01

Pos logit contributions
 see+0.006
 back+0.005
 glad+0.005
 reunited+0.004
 outside+0.004
Neg logit contributions
works-0.012
 narrative-0.012
 matter-0.012
 meanings-0.011
oral-0.011
Token loved
Token ID6151
Feature activation-5.34e-01

Pos logit contributions
works+0.034
 narrative+0.032
 matter+0.031
der+0.030
ports+0.029
Neg logit contributions
 see-0.025
 glad-0.022
 back-0.020
 reunited-0.020
 happy-0.020
Token to
Token ID284
Feature activation-7.48e-01

Pos logit contributions
works+0.029
 narrative+0.028
 matter+0.027
der+0.026
 meanings+0.026
Neg logit contributions
 see-0.012
 glad-0.010
 back-0.009
 reunited-0.009
 happy-0.008
Token go
Token ID467
Feature activation+3.94e-02

Pos logit contributions
works+0.040
 matter+0.038
 narrative+0.037
 meanings+0.036
der+0.035
Neg logit contributions
 see-0.017
 glad-0.015
 reunited-0.014
 back-0.013
 thrilled-0.011
Token on
Token ID319
Feature activation+3.45e-01

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
ports-0.002
 meanings-0.002
Token walks
Token ID11114
Feature activation+5.17e-01

Pos logit contributions
 see+0.010
 back+0.009
 glad+0.008
 home+0.008
 happy+0.008
Neg logit contributions
works-0.018
 narrative-0.015
ist-0.015
 matter-0.015
der-0.015
Token with
Token ID351
Feature activation-1.06e+00

Pos logit contributions
 see+0.014
 back+0.013
 outside+0.012
 home+0.012
 happy+0.011
Neg logit contributions
works-0.028
 narrative-0.027
 matter-0.026
oral-0.025
 meanings-0.025
Token his
Token ID465
Feature activation-1.04e+00

Pos logit contributions
works+0.055
 narrative+0.051
 matter+0.049
der+0.048
 meanings+0.047
Neg logit contributions
 see-0.029
 glad-0.025
 back-0.024
 reunited-0.023
 happy-0.022
Token.
Token ID13
Feature activation+3.80e-01

Pos logit contributions
works+0.017
 matter+0.015
 narrative+0.015
 meanings+0.014
der+0.014
Neg logit contributions
 see-0.016
 glad-0.015
 reunited-0.014
 back-0.014
 thrilled-0.013
Token Sometimes
Token ID8975
Feature activation+3.67e-01

Pos logit contributions
 see+0.012
 glad+0.011
 back+0.010
 reunited+0.010
 happy+0.009
Neg logit contributions
works-0.017
 matter-0.016
 narrative-0.015
 meanings-0.015
der-0.014
Token,
Token ID11
Feature activation-2.87e-01

Pos logit contributions
 see+0.010
 glad+0.008
 back+0.008
 happy+0.007
 reunited+0.007
Neg logit contributions
works-0.019
 narrative-0.017
 matter-0.016
ports-0.016
oral-0.016
Token sharing
Token ID7373
Feature activation-5.87e-02

Pos logit contributions
works+0.013
 narrative+0.012
 matter+0.012
 meanings+0.012
ports+0.012
Neg logit contributions
 see-0.009
 glad-0.008
 back-0.007
 reunited-0.007
 happy-0.007
Token is
Token ID318
Feature activation+1.35e-01

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.002
 meanings+0.002
der+0.002
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 happy-0.002
Token better
Token ID1365
Feature activation-1.27e-01

Pos logit contributions
 see+0.004
 glad+0.004
 back+0.003
 happy+0.003
 reunited+0.003
Neg logit contributions
works-0.007
 narrative-0.006
ports-0.006
nexpected-0.006
 matter-0.006
Token."
Token ID526
Feature activation-1.14e-01

Pos logit contributions
works+0.005
 matter+0.005
 narrative+0.005
der+0.004
 meanings+0.004
Neg logit contributions
 see-0.004
 glad-0.004
 reunited-0.004
 back-0.004
 thrilled-0.004
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
 narrative+0.005
der+0.004
oral+0.004
works+0.004
ports+0.004
Neg logit contributions
 reunited-0.004
 see-0.003
 pad-0.003
 back-0.003
 glad-0.003
Token
Token ID198
Feature activation-5.86e-01

Pos logit contributions
oral+0.011
works+0.010
 narrative+0.010
ves+0.010
der+0.010
Neg logit contributions
 reunited-0.010
 pad-0.009
 thrilled-0.009
 back-0.009
 see-0.009
TokenBen
Token ID11696
Feature activation-2.30e-01

Pos logit contributions
 narrative+0.022
works+0.021
 matter+0.020
oral+0.019
ports+0.019
Neg logit contributions
 see-0.022
 reunited-0.021
 glad-0.020
 back-0.020
 thrilled-0.019
Token and
Token ID290
Feature activation-5.32e-01

Pos logit contributions
works+0.010
der+0.010
 narrative+0.010
 matter+0.009
ports+0.009
Neg logit contributions
 see-0.006
 glad-0.006
 reunited-0.006
 back-0.005
 home-0.005
Token girl
Token ID2576
Feature activation+1.10e+00

Pos logit contributions
 see+0.021
 back+0.018
 glad+0.017
 happy+0.016
 reunited+0.016
Neg logit contributions
works-0.039
 narrative-0.036
ports-0.035
 meanings-0.035
 matter-0.034
Token were
Token ID547
Feature activation+1.36e-01

Pos logit contributions
 see+0.042
 glad+0.039
 reunited+0.038
 back+0.037
 thrilled+0.035
Neg logit contributions
works-0.040
 narrative-0.036
 matter-0.036
 meanings-0.034
der-0.033
Token friends
Token ID2460
Feature activation+1.28e-01

Pos logit contributions
 pad+0.005
 see+0.005
ovi+0.004
 reunited+0.004
 downstairs+0.004
Neg logit contributions
 narrative-0.004
 matter-0.004
works-0.004
Deal-0.004
nexpected-0.004
Token.
Token ID13
Feature activation+5.17e-02

Pos logit contributions
 see+0.004
 glad+0.004
 back+0.004
 reunited+0.004
 happy+0.004
Neg logit contributions
works-0.006
 matter-0.005
 narrative-0.005
 meanings-0.005
ports-0.005
Token<|endoftext|>
Token ID50256
Feature activation+3.04e-01

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.003
 narrative-0.002
 matter-0.002
 meanings-0.002
der-0.002
TokenOnce
Token ID7454
Feature activation-8.89e-02

Pos logit contributions
 see+0.007
 glad+0.006
 reunited+0.006
 back+0.005
 thrilled+0.005
Neg logit contributions
works-0.016
 narrative-0.015
 matter-0.015
ports-0.014
 meanings-0.014
Token upon
Token ID2402
Feature activation+1.59e+00

Pos logit contributions
works+0.003
 narrative+0.003
 matter+0.003
oral+0.002
der+0.002
Neg logit contributions
 see-0.004
 reunited-0.003
 glad-0.003
 back-0.003
 pad-0.003
Token a
Token ID257
Feature activation-8.55e-01

Pos logit contributions
 see+0.050
 reunited+0.049
 glad+0.049
 back+0.045
 pad+0.045
Neg logit contributions
works-0.059
 matter-0.059
 narrative-0.055
der-0.052
ports-0.052
Token time
Token ID640
Feature activation-1.93e-01

Pos logit contributions
works+0.026
 matter+0.026
 narrative+0.025
 meanings+0.022
nexpected+0.022
Neg logit contributions
 see-0.036
 reunited-0.034
 glad-0.033
 back-0.033
 thrilled-0.030
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
works+0.008
 narrative+0.008
 matter+0.007
nexpected+0.007
ves+0.007
Neg logit contributions
 reunited-0.005
 see-0.005
 glad-0.005
 pad-0.005
 back-0.004
Token there
Token ID612
Feature activation+6.15e-01

Pos logit contributions
 see+0.004
 reunited+0.004
 glad+0.003
ovi+0.003
 pad+0.003
Neg logit contributions
works-0.005
 matter-0.005
 narrative-0.005
nexpected-0.004
ports-0.004
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
works+0.017
 matter+0.017
 narrative+0.016
der+0.016
 meanings+0.015
Neg logit contributions
 see-0.009
 glad-0.008
 reunited-0.008
 back-0.007
 thrilled-0.007
Token asked
Token ID1965
Feature activation-7.68e-02

Pos logit contributions
 see+0.006
 glad+0.005
 reunited+0.005
 back+0.005
 pad+0.005
Neg logit contributions
works-0.010
 matter-0.009
 narrative-0.009
ports-0.009
 meanings-0.009
Token,
Token ID11
Feature activation-6.31e-01

Pos logit contributions
works+0.004
 matter+0.004
 narrative+0.004
 meanings+0.004
der+0.003
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.001
 thrilled-0.001
Token "
Token ID366
Feature activation-4.93e-01

Pos logit contributions
works+0.027
 matter+0.025
 narrative+0.024
der+0.023
 meanings+0.023
Neg logit contributions
 see-0.021
 glad-0.019
 reunited-0.018
 back-0.018
 thrilled-0.016
TokenWhat
Token ID2061
Feature activation-7.36e-01

Pos logit contributions
works+0.020
 narrative+0.018
 matter+0.018
 meanings+0.018
der+0.017
Neg logit contributions
 see-0.018
 glad-0.016
 back-0.016
 reunited-0.015
 happy-0.014
Token's
Token ID338
Feature activation-1.35e+00

Pos logit contributions
works+0.033
 matter+0.030
 narrative+0.030
der+0.028
 meanings+0.028
Neg logit contributions
 see-0.024
 glad-0.021
 reunited-0.020
 back-0.020
 happy-0.019
Token wrong
Token ID2642
Feature activation+4.51e-02

Pos logit contributions
works+0.043
 matter+0.039
 narrative+0.039
 meanings+0.035
der+0.035
Neg logit contributions
 see-0.060
 glad-0.055
 reunited-0.054
 back-0.053
 thrilled-0.050
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 see+0.001
 glad+0.001
 reunited+0.001
 pad+0.001
 back+0.001
Neg logit contributions
works-0.002
 matter-0.002
 narrative-0.002
der-0.002
ports-0.002
Token Lily
Token ID20037
Feature activation-6.74e-02

Pos logit contributions
works+0.049
 narrative+0.046
 matter+0.045
 meanings+0.043
ports+0.043
Neg logit contributions
 see-0.029
 glad-0.026
 back-0.024
 reunited-0.024
 happy-0.022
Token?"
Token ID1701
Feature activation+6.88e-02

Pos logit contributions
works+0.002
 matter+0.002
 narrative+0.002
ports+0.002
der+0.002
Neg logit contributions
 see-0.003
 glad-0.002
 reunited-0.002
 thrilled-0.002
 back-0.002
Token
Token ID198
Feature activation+3.32e-01

Pos logit contributions
 see+0.002
 glad+0.002
 reunited+0.002
 back+0.001
 thrilled+0.001
Neg logit contributions
works-0.003
 narrative-0.003
 matter-0.003
der-0.003
 meanings-0.003
Tokenmy
Token ID1820
Feature activation-6.45e-01

Pos logit contributions
 see+0.000
 glad+0.000
 reunited+0.000
 thrilled+0.000
 back+0.000
Neg logit contributions
works-0.001
 matter-0.001
 narrative-0.001
ports-0.001
 meanings-0.001
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
works+0.024
 narrative+0.022
 matter+0.022
 meanings+0.020
der+0.020
Neg logit contributions
 see-0.025
 glad-0.023
 reunited-0.022
 back-0.022
 happy-0.020
Token It
Token ID632
Feature activation-1.42e+00

Pos logit contributions
works+0.029
 matter+0.027
 narrative+0.027
 meanings+0.025
der+0.025
Neg logit contributions
 see-0.018
 glad-0.016
 back-0.015
 reunited-0.015
 happy-0.014
Token's
Token ID338
Feature activation-4.96e-01

Pos logit contributions
works+0.059
 narrative+0.056
 matter+0.055
 meanings+0.053
der+0.052
Neg logit contributions
 see-0.050
 glad-0.043
 back-0.043
 reunited-0.040
 happy-0.039
Token the
Token ID262
Feature activation-2.34e-01

Pos logit contributions
works+0.025
 matter+0.023
 narrative+0.023
 meanings+0.022
der+0.021
Neg logit contributions
 see-0.013
 glad-0.011
 reunited-0.011
 back-0.011
 thrilled-0.010
Token stars
Token ID5788
Feature activation-1.26e+00

Pos logit contributions
works+0.010
nexpected+0.009
ves+0.009
der+0.009
ports+0.009
Neg logit contributions
 see-0.008
 back-0.007
 glad-0.007
 happy-0.006
 home-0.006
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
works+0.056
 matter+0.051
 narrative+0.050
der+0.048
ports+0.047
Neg logit contributions
 see-0.039
 glad-0.036
 reunited-0.035
 back-0.033
 thrilled-0.031
Token the
Token ID262
Feature activation+2.56e-02

Pos logit contributions
works+0.017
 narrative+0.016
 matter+0.016
 meanings+0.015
der+0.015
Neg logit contributions
 see-0.011
 glad-0.010
 back-0.009
 reunited-0.009
 happy-0.009
Token planets
Token ID14705
Feature activation-1.13e+00

Pos logit contributions
 see+0.001
 back+0.001
 glad+0.001
 happy+0.001
 reunited+0.001
Neg logit contributions
works-0.001
nexpected-0.001
 narrative-0.001
 matter-0.001
der-0.001
Token,
Token ID11
Feature activation-3.90e-01

Pos logit contributions
works+0.053
 matter+0.048
 narrative+0.046
der+0.045
 meanings+0.044
Neg logit contributions
 see-0.032
 glad-0.030
 reunited-0.029
 back-0.027
 thrilled-0.026
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
works+0.019
 matter+0.017
 narrative+0.017
 meanings+0.016
der+0.016
Neg logit contributions
 see-0.011
 glad-0.010
 reunited-0.010
 back-0.009
 thrilled-0.008
Token He
Token ID679
Feature activation-1.94e-01

Pos logit contributions
works+0.005
 matter+0.005
 narrative+0.005
 meanings+0.004
der+0.004
Neg logit contributions
 see-0.003
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token liked
Token ID8288
Feature activation+4.64e-01

Pos logit contributions
works+0.007
 matter+0.006
 narrative+0.006
 meanings+0.006
der+0.006
Neg logit contributions
 see-0.007
 glad-0.007
 reunited-0.007
 back-0.007
 thrilled-0.006
Token to
Token ID284
Feature activation-1.00e-01

Pos logit contributions
 see+0.014
 reunited+0.013
 glad+0.013
ends+0.012
 relieved+0.012
Neg logit contributions
works-0.019
 matter-0.019
 meanings-0.018
der-0.017
 narrative-0.017
Token explore
Token ID7301
Feature activation+1.18e+00

Pos logit contributions
works+0.005
 matter+0.005
 narrative+0.005
 meanings+0.005
der+0.005
Neg logit contributions
 see-0.002
 glad-0.002
 reunited-0.002
 back-0.002
 thrilled-0.002
Token and
Token ID290
Feature activation-3.96e-01

Pos logit contributions
 see+0.029
 back+0.024
 glad+0.023
 happy+0.022
 outside+0.021
Neg logit contributions
works-0.066
 narrative-0.062
 matter-0.060
 meanings-0.059
der-0.057
Token one
Token ID530
Feature activation-1.26e+00

Pos logit contributions
works+0.018
 matter+0.018
 narrative+0.018
 meanings+0.017
der+0.016
Neg logit contributions
 see-0.010
 reunited-0.010
 glad-0.010
 back-0.009
 thrilled-0.008
Token day
Token ID1110
Feature activation+1.60e-01

Pos logit contributions
works+0.055
 matter+0.052
 narrative+0.051
 meanings+0.048
der+0.047
Neg logit contributions
 see-0.040
 glad-0.036
 reunited-0.035
 back-0.033
 thrilled-0.031
Token he
Token ID339
Feature activation-2.51e-01

Pos logit contributions
 see+0.005
 glad+0.004
 back+0.004
 reunited+0.004
 happy+0.003
Neg logit contributions
works-0.008
 narrative-0.007
 matter-0.007
 meanings-0.007
ports-0.007
Token went
Token ID1816
Feature activation+3.70e-01

Pos logit contributions
works+0.014
 matter+0.013
 narrative+0.013
 meanings+0.012
der+0.012
Neg logit contributions
 see-0.005
 glad-0.004
 reunited-0.004
 back-0.004
 happy-0.003
Token to
Token ID284
Feature activation+1.96e-01

Pos logit contributions
 see+0.010
 glad+0.009
 reunited+0.008
 thrilled+0.008
 back+0.007
Neg logit contributions
works-0.018
 narrative-0.016
 matter-0.016
ports-0.015
der-0.015
Token a
Token ID257
Feature activation-2.15e-01

Pos logit contributions
 glad+0.006
 reunited+0.006
 thrilled+0.005
 pad+0.005
ends+0.005
Neg logit contributions
 matter-0.008
works-0.007
 narrative-0.007
ports-0.007
 concept-0.007
Token enjoy
Token ID2883
Feature activation-4.56e-03

Pos logit contributions
works+0.020
 matter+0.019
 narrative+0.019
 meanings+0.018
der+0.017
Neg logit contributions
 see-0.014
 glad-0.013
 reunited-0.012
 back-0.012
 thrilled-0.011
Token singing
Token ID13777
Feature activation+4.27e-01

Pos logit contributions
works+0.000
 narrative+0.000
 matter+0.000
 meanings+0.000
ports+0.000
Neg logit contributions
 see-0.000
 glad-0.000
 back-0.000
 reunited-0.000
 happy-0.000
Token,
Token ID11
Feature activation-6.53e-01

Pos logit contributions
 see+0.015
 glad+0.013
 back+0.013
 reunited+0.012
 happy+0.012
Neg logit contributions
works-0.018
 matter-0.017
 narrative-0.017
 meanings-0.016
ports-0.015
Token no
Token ID645
Feature activation-6.51e-01

Pos logit contributions
works+0.027
 matter+0.027
 narrative+0.026
der+0.025
 meanings+0.024
Neg logit contributions
 see-0.019
 reunited-0.018
 glad-0.017
 back-0.016
 pad-0.015
Token matter
Token ID2300
Feature activation-4.31e-01

Pos logit contributions
works+0.015
 narrative+0.012
ports+0.011
fighters+0.010
ves+0.010
Neg logit contributions
 see-0.038
 back-0.035
 home-0.034
 happy-0.034
 glad-0.034
Token what
Token ID644
Feature activation-1.38e+00

Pos logit contributions
 narrative+0.023
works+0.023
Okay+0.022
oral+0.021
fighters+0.020
Neg logit contributions
 see-0.009
 here-0.009
 playing-0.008
 back-0.008
 glad-0.008
Token."
Token ID526
Feature activation-5.62e-01

Pos logit contributions
works+0.067
 narrative+0.062
 matter+0.059
ports+0.056
ators+0.056
Neg logit contributions
 see-0.041
 glad-0.037
 back-0.035
 happy-0.034
 here-0.033
Token
Token ID198
Feature activation-2.69e-01

Pos logit contributions
 narrative+0.024
works+0.023
oral+0.022
 matter+0.022
der+0.021
Neg logit contributions
 see-0.015
 reunited-0.014
 pad-0.014
 glad-0.013
 back-0.013
Token
Token ID198
Feature activation-2.51e-01

Pos logit contributions
works+0.011
 narrative+0.011
oral+0.011
ves+0.010
ports+0.010
Neg logit contributions
 see-0.008
 reunited-0.007
 glad-0.007
 back-0.007
 pad-0.007
TokenL
Token ID43
Feature activation-3.90e-01

Pos logit contributions
works+0.009
 matter+0.008
 narrative+0.008
 meanings+0.007
der+0.007
Neg logit contributions
 see-0.010
 glad-0.010
 reunited-0.009
 back-0.009
 happy-0.009
Tokenily
Token ID813
Feature activation-6.61e-01

Pos logit contributions
works+0.018
 matter+0.017
 narrative+0.016
 meanings+0.016
ports+0.016
Neg logit contributions
 see-0.012
 glad-0.010
 reunited-0.010
 back-0.010
 happy-0.009
Token It
Token ID632
Feature activation+2.84e-01

Pos logit contributions
 see+0.011
 glad+0.009
 back+0.008
 reunited+0.007
 happy+0.007
Neg logit contributions
works-0.025
 matter-0.024
 narrative-0.023
 meanings-0.022
nexpected-0.022
Token wob
Token ID46867
Feature activation+4.38e-01

Pos logit contributions
 see+0.010
 back+0.009
 glad+0.009
 reunited+0.008
 happy+0.008
Neg logit contributions
works-0.012
 matter-0.010
 narrative-0.010
ports-0.010
 meanings-0.010
Tokenbles
Token ID7689
Feature activation-2.46e-01

Pos logit contributions
 see+0.018
 glad+0.016
 reunited+0.016
 back+0.015
 thrilled+0.014
Neg logit contributions
works-0.016
 narrative-0.015
 matter-0.014
der-0.014
ports-0.013
Token and
Token ID290
Feature activation+4.97e-01

Pos logit contributions
works+0.011
 matter+0.010
 narrative+0.010
der+0.010
 meanings+0.009
Neg logit contributions
 see-0.008
 glad-0.007
 reunited-0.007
 back-0.006
 thrilled-0.006
Token til
Token ID21502
Feature activation+2.03e-01

Pos logit contributions
 see+0.017
 glad+0.014
 back+0.014
 reunited+0.013
 happy+0.013
Neg logit contributions
works-0.023
 narrative-0.020
 matter-0.020
ports-0.019
 meanings-0.019
Tokents
Token ID912
Feature activation-1.40e+00

Pos logit contributions
 see+0.008
 back+0.008
 glad+0.008
 happy+0.007
 reunited+0.007
Neg logit contributions
works-0.006
 matter-0.006
 narrative-0.006
 concept-0.006
oral-0.006
Token.
Token ID13
Feature activation+2.88e-01

Pos logit contributions
works+0.060
 narrative+0.055
 matter+0.055
 meanings+0.052
ports+0.051
Neg logit contributions
 see-0.048
 glad-0.043
 back-0.042
 reunited-0.041
 happy-0.038
Token Sara
Token ID24799
Feature activation+7.00e-01

Pos logit contributions
 see+0.007
 glad+0.006
 back+0.005
 reunited+0.005
 happy+0.005
Neg logit contributions
works-0.016
 matter-0.015
 narrative-0.015
 meanings-0.014
ports-0.014
Token and
Token ID290
Feature activation+8.46e-01

Pos logit contributions
 see+0.022
 glad+0.021
 reunited+0.021
 back+0.019
 thrilled+0.018
Neg logit contributions
works-0.029
 narrative-0.027
 matter-0.027
der-0.027
 meanings-0.026
Token Ben
Token ID3932
Feature activation+5.08e-01

Pos logit contributions
 see+0.019
 glad+0.017
 reunited+0.016
 back+0.015
 thrilled+0.014
Neg logit contributions
works-0.045
 matter-0.042
 narrative-0.042
 meanings-0.041
der-0.040
Token lose
Token ID4425
Feature activation-2.43e-01

Pos logit contributions
 see+0.016
 glad+0.014
 reunited+0.014
 back+0.013
 thrilled+0.012
Neg logit contributions
works-0.023
 matter-0.021
 narrative-0.021
 meanings-0.020
der-0.020