TOP ACTIVATIONS
MAX = 3.055

the price - it was $ 5 . Sally was so
my learned that it 's not okay to take things without
story is that it 's not always about winning , but
told her that it 's not nice to hate people and
story is that it 's not nice to push people and
story is that it 's not nice to be naughty and
Jack learned that it 's not always good to dare and
Ben slides down , his rh ino falls out of his
The le opard toy costs $ 10 ."
Tim learned that it 's not always about winning . <|endoftext|>
learned that sometimes it 's not about winning , but about
story is that it 's not nice to play tricks on
is important and it 's not nice to be selfish .
She learned that it 's not nice to hit people and
Lily learned that it 's not always about winning , but
story is that it 's not good to take things that
Emma learned that it 's not nice to be en vious
He learned that it 's not always good to keep things
Lily realized that it 's not good to be en vious
told her that it 's not good to keep wild animals

MIN ACTIVATIONS
MIN = -2.976

a jacket . The p anda asked , "
present ?" B ree said , " Well ,
the night ?" The rabbit said , " I
too ?" The other child was a bit
said , " It feels thick ." " Yes
march ." Ped ro said , " Okay ,
luggage . The elephant asked , " What
said , " Wow , an elephant is amazing . Can
play !" F ro ggy and D ucky played
I found ! It 's an insect !" he said .
Hello little boy ," said the giant . " Why are
't be scared ," said the giant . " I 'm
the jungle ?" The child said , " I
a family who really enjoyed pic n ics . They loved
" What 's wrong ?" The little boy said , "
go see some birds !" The little orange bear was excited
eddy bear . Every day the child would visit the bear
the story . The stranger chuckled and said ,
me ?" The crab replied , " I
" Wow , an oct opus . It is

INTERVAL 1.547 - 3.055
CONTAINS 0.370%

Ben were poor . They lived in a small hut with
Maybe I can fly and help people after school . Maybe
She tells him about her fl ute . Ben wants to
" Hello , Mr . Snow man !" Lily
one straight side and one w avy side ." Tom agrees

INTERVAL 0.039 - 1.547
CONTAINS 46.838%

Once there was a boy . He was three years old
of trees and flowers . Tim my 's yard was very
and together they went inside . Max was careful as he
. " Help us !" The teacher hears them
goodbye to her friends and went home with her mom ,

INTERVAL -1.468 - 0.039
CONTAINS 52.453%

it ." L ily and Ben shake their heads
door . Anna 's mom opens the door . She sees
heart . But you can play with your football outside when
but his friend Sally couldn 't spell very well and accidentally
™ s okay , don â € ™ t worry .

INTERVAL -2.976 - -1.468
CONTAINS 0.339%

never seen so many beautiful things before . She watched the
L ila shook her head . She did not know
to safety . The baby was so grateful to
, the bear went on his way , knowing that he
." L ily felt better and smiled back .
Token the
Token ID262
Feature activation-7.12e-01

Pos logit contributions
 disl+0.020
 gn+0.017
col+0.016
handed+0.016
growth+0.016
Neg logit contributions
ap-0.014
ooks-0.012
apping-0.012
 upon-0.011
aps-0.011
Token price
Token ID2756
Feature activation+1.63e+00

Pos logit contributions
 disl+0.032
 gn+0.026
col+0.026
growth+0.025
handed+0.024
Neg logit contributions
ap-0.026
 upon-0.022
apping-0.021
ooks-0.021
aps-0.020
Token -
Token ID532
Feature activation+7.12e-01

Pos logit contributions
ap+0.062
ooks+0.052
apping+0.051
angles+0.047
aps+0.047
Neg logit contributions
 disl-0.068
 gn-0.063
col-0.059
handed-0.058
themed-0.052
Token it
Token ID340
Feature activation+4.27e-01

Pos logit contributions
ap+0.029
apping+0.024
ooks+0.024
aps+0.023
angles+0.022
Neg logit contributions
 disl-0.030
 gn-0.027
col-0.024
handed-0.024
growth-0.022
Token was
Token ID373
Feature activation+4.42e-01

Pos logit contributions
ap+0.016
ooks+0.014
apping+0.014
aps+0.013
angles+0.012
Neg logit contributions
 disl-0.019
 gn-0.017
col-0.016
handed-0.015
growth-0.014
Token $
Token ID720
Feature activation+3.05e+00

Pos logit contributions
ap+0.018
apping+0.015
ooks+0.015
aps+0.014
angles+0.014
Neg logit contributions
 disl-0.019
 gn-0.017
col-0.015
handed-0.015
growth-0.014
Token5
Token ID20
Feature activation+6.24e-01

Pos logit contributions
ap+0.111
ies+0.103
ooks+0.101
apping+0.089
angles+0.088
Neg logit contributions
handed-0.106
igsaw-0.100
col-0.100
riger-0.097
 disl-0.096
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
ap+0.023
ooks+0.020
apping+0.019
aps+0.018
angles+0.018
Neg logit contributions
 disl-0.027
 gn-0.024
col-0.023
handed-0.022
growth-0.021
Token Sally
Token ID25737
Feature activation-4.80e-01

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.015
handed+0.015
growth+0.015
Neg logit contributions
ap-0.016
apping-0.014
ooks-0.014
 upon-0.013
aps-0.013
Token was
Token ID373
Feature activation-2.31e-01

Pos logit contributions
 disl+0.024
 gn+0.020
col+0.019
handed+0.019
growth+0.018
Neg logit contributions
ap-0.016
apping-0.013
ooks-0.013
 upon-0.013
aps-0.012
Token so
Token ID523
Feature activation-4.31e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.007
growth+0.007
Neg logit contributions
ap-0.010
apping-0.008
ooks-0.008
 upon-0.008
aps-0.008
Tokenmy
Token ID1820
Feature activation-1.51e+00

Pos logit contributions
 disl+0.011
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
angles-0.003
Token learned
Token ID4499
Feature activation-3.05e-01

Pos logit contributions
 disl+0.080
igsaw+0.069
 gn+0.068
col+0.067
handed+0.066
Neg logit contributions
ap-0.045
ooks-0.033
apping-0.032
 upon-0.030
ies-0.030
Token that
Token ID326
Feature activation+2.32e-02

Pos logit contributions
 disl+0.013
 gn+0.011
col+0.010
handed+0.010
growth+0.010
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.010
 upon-0.010
aps-0.010
Token it
Token ID340
Feature activation+2.14e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token's
Token ID338
Feature activation+8.17e-01

Pos logit contributions
ap+0.008
ooks+0.006
apping+0.006
aps+0.006
 upon+0.006
Neg logit contributions
 disl-0.010
 gn-0.009
col-0.008
handed-0.008
growth-0.008
Token not
Token ID407
Feature activation+3.01e+00

Pos logit contributions
ap+0.033
apping+0.029
aps+0.028
ooks+0.028
angles+0.027
Neg logit contributions
 disl-0.035
 gn-0.029
col-0.026
 dis-0.025
handed-0.025
Token okay
Token ID8788
Feature activation+1.20e+00

Pos logit contributions
ap+0.152
aps+0.147
apping+0.144
ucks+0.140
ooks+0.138
Neg logit contributions
 disl-0.097
 dis-0.069
 gn-0.069
 bad-0.055
col-0.053
Token to
Token ID284
Feature activation+6.55e-01

Pos logit contributions
ap+0.051
angles+0.044
ags+0.044
apping+0.043
ooks+0.043
Neg logit contributions
 disl-0.048
 gn-0.037
col-0.034
 dis-0.031
handed-0.031
Token take
Token ID1011
Feature activation+6.61e-01

Pos logit contributions
ap+0.023
apping+0.020
ooks+0.020
aps+0.019
angles+0.019
Neg logit contributions
 disl-0.031
 gn-0.026
col-0.024
handed-0.024
 dis-0.022
Token things
Token ID1243
Feature activation-5.27e-01

Pos logit contributions
ap+0.022
apping+0.019
ooks+0.019
 upon+0.018
aps+0.017
Neg logit contributions
 disl-0.032
 gn-0.027
col-0.026
handed-0.026
growth-0.025
Token without
Token ID1231
Feature activation+8.89e-01

Pos logit contributions
 disl+0.022
 gn+0.018
col+0.017
growth+0.016
handed+0.016
Neg logit contributions
ap-0.022
apping-0.019
ooks-0.018
 upon-0.018
angles-0.017
Token story
Token ID1621
Feature activation+1.24e+00

Pos logit contributions
 disl+0.072
handed+0.064
growth+0.062
col+0.061
 gn+0.061
Neg logit contributions
ap-0.029
 upon-0.020
apping-0.019
ooks-0.018
 friends-0.018
Token is
Token ID318
Feature activation+1.00e+00

Pos logit contributions
ap+0.043
ooks+0.040
apping+0.037
ags+0.035
 upon+0.034
Neg logit contributions
 disl-0.053
 gn-0.044
col-0.043
igsaw-0.040
handed-0.039
Token that
Token ID326
Feature activation+8.16e-01

Pos logit contributions
ap+0.053
angles+0.050
ooks+0.048
ags+0.047
 upon+0.047
Neg logit contributions
 disl-0.027
 gn-0.019
col-0.017
handed-0.014
igsaw-0.013
Token it
Token ID340
Feature activation+7.75e-01

Pos logit contributions
ap+0.030
ooks+0.026
apping+0.026
ags+0.025
angles+0.025
Neg logit contributions
 disl-0.037
 gn-0.031
col-0.028
 dis-0.026
handed-0.026
Token's
Token ID338
Feature activation+8.42e-01

Pos logit contributions
ap+0.029
ooks+0.026
apping+0.025
aps+0.023
angles+0.023
Neg logit contributions
 disl-0.034
 gn-0.028
col-0.027
handed-0.025
themed-0.024
Token not
Token ID407
Feature activation+2.95e+00

Pos logit contributions
ap+0.034
apping+0.030
aps+0.030
ooks+0.029
angles+0.029
Neg logit contributions
 disl-0.034
 gn-0.029
col-0.026
handed-0.025
 dis-0.024
Token always
Token ID1464
Feature activation+1.72e+00

Pos logit contributions
ap+0.136
aps+0.136
ucks+0.132
apping+0.130
ooks+0.129
Neg logit contributions
 disl-0.099
 gn-0.069
 dis-0.068
working-0.063
col-0.059
Token about
Token ID546
Feature activation+7.62e-01

Pos logit contributions
ap+0.072
ooks+0.069
apping+0.067
aps+0.064
angles+0.064
Neg logit contributions
 disl-0.072
 gn-0.052
col-0.045
 dis-0.045
handed-0.043
Token winning
Token ID5442
Feature activation+7.54e-01

Pos logit contributions
ap+0.025
apping+0.020
ooks+0.020
angles+0.019
aps+0.019
Neg logit contributions
 disl-0.039
 gn-0.033
col-0.031
handed-0.030
growth-0.029
Token,
Token ID11
Feature activation+6.75e-01

Pos logit contributions
ap+0.028
ooks+0.024
apping+0.024
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.034
 gn-0.028
col-0.027
handed-0.026
growth-0.025
Token but
Token ID475
Feature activation+5.34e-01

Pos logit contributions
ap+0.026
apping+0.022
ooks+0.022
aps+0.021
angles+0.020
Neg logit contributions
 disl-0.030
 gn-0.025
col-0.024
handed-0.022
growth-0.022
Token told
Token ID1297
Feature activation+3.50e-02

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token her
Token ID607
Feature activation-8.26e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
aps+0.001
 upon+0.001
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.001
growth-0.001
Token that
Token ID326
Feature activation-3.75e-01

Pos logit contributions
 disl+0.036
 gn+0.031
col+0.031
handed+0.030
growth+0.030
Neg logit contributions
ap-0.031
ooks-0.026
apping-0.026
 upon-0.025
ies-0.024
Token it
Token ID340
Feature activation-1.38e-02

Pos logit contributions
 disl+0.017
 gn+0.014
col+0.013
handed+0.013
growth+0.013
Neg logit contributions
ap-0.014
apping-0.012
ooks-0.012
 upon-0.011
aps-0.011
Token's
Token ID338
Feature activation+6.83e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.000
apping-0.000
ooks-0.000
 upon-0.000
aps-0.000
Token not
Token ID407
Feature activation+2.91e+00

Pos logit contributions
ap+0.025
apping+0.021
ooks+0.021
aps+0.020
angles+0.020
Neg logit contributions
 disl-0.031
 gn-0.027
col-0.024
handed-0.023
 dis-0.023
Token nice
Token ID3621
Feature activation+6.37e-01

Pos logit contributions
aps+0.147
ap+0.145
ucks+0.142
ooks+0.142
apping+0.141
Neg logit contributions
 disl-0.082
 gn-0.054
 dis-0.052
working-0.047
 bad-0.046
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
ap+0.024
ooks+0.020
apping+0.020
angles+0.019
aps+0.019
Neg logit contributions
 disl-0.029
 gn-0.024
col-0.023
handed-0.022
themed-0.020
Token hate
Token ID5465
Feature activation+3.17e-01

Pos logit contributions
ap+0.040
apping+0.038
ooks+0.038
aps+0.036
angles+0.035
Neg logit contributions
 disl-0.054
 gn-0.043
col-0.039
handed-0.038
 dis-0.037
Token people
Token ID661
Feature activation-3.32e-01

Pos logit contributions
ap+0.011
apping+0.009
ooks+0.009
aps+0.008
angles+0.008
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.013
handed-0.013
growth-0.012
Token and
Token ID290
Feature activation+3.30e-01

Pos logit contributions
 disl+0.017
 gn+0.015
col+0.014
growth+0.014
handed+0.014
Neg logit contributions
ap-0.011
apping-0.008
ooks-0.008
 upon-0.008
aps-0.007
Token story
Token ID1621
Feature activation+7.45e-01

Pos logit contributions
 disl+0.065
growth+0.054
handed+0.054
col+0.054
 gn+0.053
Neg logit contributions
ap-0.027
 upon-0.020
ooks-0.019
apping-0.019
aps-0.017
Token is
Token ID318
Feature activation+6.50e-01

Pos logit contributions
ap+0.025
ooks+0.021
apping+0.021
angles+0.019
aps+0.018
Neg logit contributions
 disl-0.035
 gn-0.030
col-0.029
handed-0.027
themed-0.026
Token that
Token ID326
Feature activation+3.36e-01

Pos logit contributions
ap+0.034
angles+0.031
ooks+0.030
apping+0.030
aps+0.029
Neg logit contributions
 disl-0.019
 gn-0.014
col-0.013
handed-0.012
 pel-0.011
Token it
Token ID340
Feature activation+4.44e-01

Pos logit contributions
ap+0.013
ooks+0.012
apping+0.012
angles+0.011
aps+0.011
Neg logit contributions
 disl-0.015
 gn-0.012
col-0.011
handed-0.011
growth-0.010
Token's
Token ID338
Feature activation+7.79e-01

Pos logit contributions
ap+0.017
ooks+0.014
apping+0.014
aps+0.013
angles+0.013
Neg logit contributions
 disl-0.020
 gn-0.017
col-0.016
handed-0.015
growth-0.015
Token not
Token ID407
Feature activation+2.91e+00

Pos logit contributions
ap+0.032
apping+0.028
ooks+0.027
aps+0.027
angles+0.027
Neg logit contributions
 disl-0.032
 gn-0.027
col-0.024
handed-0.023
 dis-0.023
Token nice
Token ID3621
Feature activation+7.25e-01

Pos logit contributions
ap+0.146
aps+0.140
apping+0.137
ooks+0.136
angles+0.133
Neg logit contributions
 disl-0.089
 dis-0.059
 gn-0.058
working-0.054
large-0.050
Token to
Token ID284
Feature activation+8.97e-01

Pos logit contributions
ap+0.028
ooks+0.024
apping+0.023
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.032
 gn-0.026
col-0.025
handed-0.024
growth-0.022
Token push
Token ID4574
Feature activation+4.79e-01

Pos logit contributions
ap+0.031
apping+0.028
ooks+0.028
angles+0.026
aps+0.026
Neg logit contributions
 disl-0.044
 gn-0.037
handed-0.034
col-0.033
 dis-0.031
Token people
Token ID661
Feature activation-5.76e-01

Pos logit contributions
ap+0.015
ooks+0.012
apping+0.012
aps+0.011
 upon+0.011
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.020
handed-0.020
growth-0.019
Token and
Token ID290
Feature activation+6.01e-01

Pos logit contributions
 disl+0.030
 gn+0.027
col+0.025
growth+0.025
handed+0.024
Neg logit contributions
ap-0.018
apping-0.014
ooks-0.013
 upon-0.013
aps-0.012
Token story
Token ID1621
Feature activation+9.73e-01

Pos logit contributions
 disl+0.072
handed+0.063
 gn+0.062
growth+0.062
col+0.062
Neg logit contributions
ap-0.029
 upon-0.020
apping-0.019
ooks-0.019
 for-0.017
Token is
Token ID318
Feature activation+7.26e-01

Pos logit contributions
ap+0.034
ooks+0.030
apping+0.029
ags+0.027
 upon+0.026
Neg logit contributions
 disl-0.044
col-0.036
 gn-0.036
handed-0.033
igsaw-0.031
Token that
Token ID326
Feature activation+5.46e-01

Pos logit contributions
ap+0.038
angles+0.034
ooks+0.034
apping+0.034
 upon+0.033
Neg logit contributions
 disl-0.021
 gn-0.015
col-0.014
handed-0.012
igsaw-0.011
Token it
Token ID340
Feature activation+7.23e-01

Pos logit contributions
ap+0.020
ooks+0.017
apping+0.017
aps+0.017
angles+0.016
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.019
handed-0.018
 dis-0.018
Token's
Token ID338
Feature activation+7.58e-01

Pos logit contributions
ap+0.027
ooks+0.024
apping+0.024
aps+0.022
 upon+0.022
Neg logit contributions
 disl-0.031
 gn-0.026
col-0.025
handed-0.024
themed-0.022
Token not
Token ID407
Feature activation+2.90e+00

Pos logit contributions
ap+0.031
apping+0.027
aps+0.027
ooks+0.026
angles+0.026
Neg logit contributions
 disl-0.031
 gn-0.026
col-0.023
handed-0.023
 dis-0.022
Token nice
Token ID3621
Feature activation+7.28e-01

Pos logit contributions
aps+0.142
ap+0.140
ucks+0.135
apping+0.134
ooks+0.131
Neg logit contributions
 disl-0.093
 dis-0.068
 gn-0.059
working-0.054
 bad-0.050
Token to
Token ID284
Feature activation+8.17e-01

Pos logit contributions
ap+0.027
ooks+0.023
apping+0.023
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.032
 gn-0.026
col-0.025
handed-0.024
growth-0.023
Token be
Token ID307
Feature activation+3.82e-01

Pos logit contributions
ap+0.024
apping+0.022
ooks+0.021
aps+0.021
 upon+0.020
Neg logit contributions
 disl-0.044
 gn-0.036
handed-0.034
col-0.033
 dis-0.032
Token naughty
Token ID45128
Feature activation-3.81e-01

Pos logit contributions
ap+0.016
apping+0.014
ooks+0.014
angles+0.013
aps+0.013
Neg logit contributions
 disl-0.016
 gn-0.013
col-0.012
handed-0.012
themed-0.011
Token and
Token ID290
Feature activation+1.74e-01

Pos logit contributions
 disl+0.020
 gn+0.017
col+0.016
handed+0.016
growth+0.016
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.009
 upon-0.009
aps-0.009
Token Jack
Token ID3619
Feature activation-9.29e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.007
growth+0.007
Neg logit contributions
ap-0.010
apping-0.009
ooks-0.009
 upon-0.008
aps-0.008
Token learned
Token ID4499
Feature activation-2.48e-01

Pos logit contributions
 disl+0.050
 gn+0.042
col+0.040
handed+0.040
igsaw+0.040
Neg logit contributions
ap-0.028
apping-0.021
ooks-0.021
 upon-0.021
angles-0.019
Token that
Token ID326
Feature activation+3.73e-01

Pos logit contributions
 disl+0.011
 gn+0.009
col+0.008
handed+0.008
growth+0.008
Neg logit contributions
ap-0.010
apping-0.008
ooks-0.008
 upon-0.008
aps-0.008
Token it
Token ID340
Feature activation+3.91e-01

Pos logit contributions
ap+0.015
apping+0.013
ooks+0.013
aps+0.012
angles+0.012
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.012
handed-0.012
growth-0.012
Token's
Token ID338
Feature activation+7.16e-01

Pos logit contributions
ap+0.015
ooks+0.012
apping+0.012
aps+0.011
angles+0.011
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.014
handed-0.014
growth-0.013
Token not
Token ID407
Feature activation+2.90e+00

Pos logit contributions
ap+0.029
apping+0.025
ooks+0.025
aps+0.025
angles+0.023
Neg logit contributions
 disl-0.030
 gn-0.025
col-0.023
handed-0.022
 dis-0.022
Token always
Token ID1464
Feature activation+1.71e+00

Pos logit contributions
ap+0.138
aps+0.130
apping+0.128
ucks+0.125
ooks+0.125
Neg logit contributions
 disl-0.099
 dis-0.078
 gn-0.077
working-0.063
�-0.062
Token good
Token ID922
Feature activation+5.52e-01

Pos logit contributions
ap+0.071
ooks+0.070
apping+0.067
aps+0.064
ucks+0.062
Neg logit contributions
 disl-0.071
 gn-0.055
 dis-0.049
col-0.044
�-0.043
Token to
Token ID284
Feature activation+5.40e-01

Pos logit contributions
ap+0.020
ooks+0.017
apping+0.017
angles+0.016
aps+0.015
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.020
handed-0.019
growth-0.019
Token dare
Token ID16498
Feature activation+1.75e+00

Pos logit contributions
ap+0.018
apping+0.016
ooks+0.015
aps+0.015
angles+0.014
Neg logit contributions
 disl-0.027
 gn-0.023
col-0.021
handed-0.021
growth-0.020
Token and
Token ID290
Feature activation+4.31e-01

Pos logit contributions
ap+0.069
ooks+0.063
apping+0.063
aps+0.060
angles+0.059
Neg logit contributions
 disl-0.072
 gn-0.058
col-0.057
 dis-0.050
handed-0.049
Token Ben
Token ID3932
Feature activation-6.85e-02

Pos logit contributions
ap+0.015
apping+0.013
ooks+0.012
aps+0.012
angles+0.012
Neg logit contributions
 disl-0.011
 gn-0.010
col-0.009
handed-0.008
growth-0.008
Token slides
Token ID19392
Feature activation+5.81e-01

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token down
Token ID866
Feature activation+2.04e-01

Pos logit contributions
ap+0.024
ooks+0.021
apping+0.021
aps+0.019
angles+0.019
Neg logit contributions
 disl-0.023
 gn-0.020
col-0.019
handed-0.018
growth-0.017
Token,
Token ID11
Feature activation-8.82e-02

Pos logit contributions
ap+0.008
ooks+0.007
apping+0.007
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.007
handed-0.006
growth-0.006
Token his
Token ID465
Feature activation+6.42e-01

Pos logit contributions
 disl+0.004
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token rh
Token ID9529
Feature activation+2.88e+00

Pos logit contributions
ap+0.023
ooks+0.020
apping+0.019
angles+0.018
 upon+0.017
Neg logit contributions
 disl-0.029
 gn-0.026
handed-0.023
col-0.023
growth-0.023
Tokenino
Token ID2879
Feature activation+4.55e-01

Pos logit contributions
ap+0.122
ooks+0.120
 upon+0.105
apping+0.100
othes+0.100
Neg logit contributions
handed-0.081
robat-0.075
cut-0.072
rob-0.068
l-0.068
Token falls
Token ID8953
Feature activation-1.71e-02

Pos logit contributions
ap+0.018
ooks+0.016
apping+0.016
angles+0.015
aps+0.015
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.014
growth-0.014
Token out
Token ID503
Feature activation-2.20e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.000
ooks-0.000
 upon-0.000
aps-0.000
Token of
Token ID286
Feature activation-5.28e-01

Pos logit contributions
 disl+0.013
 gn+0.011
col+0.010
handed+0.010
growth+0.010
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.004
 upon-0.004
aps-0.004
Token his
Token ID465
Feature activation+2.73e-01

Pos logit contributions
 disl+0.028
 gn+0.024
col+0.023
handed+0.022
themed+0.022
Neg logit contributions
ap-0.015
ooks-0.012
 upon-0.012
apping-0.012
aps-0.012
TokenThe
Token ID464
Feature activation-5.17e-01

Pos logit contributions
ap+0.037
ooks+0.032
apping+0.031
aps+0.030
ies+0.029
Neg logit contributions
 disl-0.033
col-0.028
 gn-0.027
handed-0.026
themed-0.025
Token le
Token ID443
Feature activation+5.36e-01

Pos logit contributions
 disl+0.019
col+0.014
 gn+0.014
growth+0.014
handed+0.014
Neg logit contributions
ap-0.024
ooks-0.020
apping-0.020
 upon-0.020
aps-0.019
Tokenopard
Token ID15478
Feature activation+9.18e-01

Pos logit contributions
ap+0.026
ooks+0.023
apping+0.023
 upon+0.022
angles+0.022
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.013
handed-0.013
growth-0.012
Token toy
Token ID13373
Feature activation+4.65e-01

Pos logit contributions
ap+0.032
ooks+0.028
apping+0.028
aps+0.025
 upon+0.024
Neg logit contributions
 disl-0.044
 gn-0.038
col-0.035
handed-0.035
themed-0.032
Token costs
Token ID3484
Feature activation+9.12e-01

Pos logit contributions
ap+0.016
apping+0.014
ooks+0.014
aps+0.013
 upon+0.013
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.018
handed-0.018
growth-0.017
Token $
Token ID720
Feature activation+2.87e+00

Pos logit contributions
ap+0.035
apping+0.030
ooks+0.029
angles+0.027
aps+0.027
Neg logit contributions
 disl-0.039
 gn-0.035
col-0.032
handed-0.031
growth-0.029
Token10
Token ID940
Feature activation-1.51e-01

Pos logit contributions
ap+0.103
ooks+0.091
ies+0.088
apping+0.088
 upon+0.085
Neg logit contributions
 disl-0.105
handed-0.102
col-0.099
 gn-0.095
riger-0.095
Token."
Token ID526
Feature activation-2.09e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.005
growth+0.005
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.004
aps-0.004
Token
Token ID220
Feature activation+9.72e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.007
growth+0.007
Neg logit contributions
ap-0.008
ooks-0.006
apping-0.006
 upon-0.006
aps-0.006
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
ap+0.043
apping+0.037
ooks+0.036
angles+0.035
ies+0.034
Neg logit contributions
 disl-0.035
 gn-0.028
col-0.027
handed-0.027
growth-0.024
Token
Token ID198
Feature activation+5.70e-01

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.005
aps+0.005
angles+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.006
handed-0.005
growth-0.005
Token Tim
Token ID5045
Feature activation-2.08e-01

Pos logit contributions
 disl+0.022
 gn+0.019
col+0.018
handed+0.018
growth+0.018
Neg logit contributions
ap-0.012
ooks-0.010
apping-0.010
 upon-0.009
aps-0.009
Token learned
Token ID4499
Feature activation-5.67e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token that
Token ID326
Feature activation+2.68e-01

Pos logit contributions
 disl+0.023
 gn+0.020
col+0.019
growth+0.019
handed+0.018
Neg logit contributions
ap-0.022
ooks-0.019
apping-0.019
 upon-0.018
aps-0.018
Token it
Token ID340
Feature activation+5.41e-01

Pos logit contributions
ap+0.011
ooks+0.009
apping+0.009
aps+0.008
angles+0.008
Neg logit contributions
 disl-0.012
 gn-0.010
col-0.009
handed-0.009
growth-0.008
Token's
Token ID338
Feature activation+9.57e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.016
aps+0.015
 upon+0.015
Neg logit contributions
 disl-0.025
 gn-0.022
col-0.020
handed-0.020
themed-0.019
Token not
Token ID407
Feature activation+2.87e+00

Pos logit contributions
ap+0.040
apping+0.035
ooks+0.035
aps+0.034
angles+0.033
Neg logit contributions
 disl-0.039
 gn-0.032
 dis-0.028
col-0.028
handed-0.027
Token always
Token ID1464
Feature activation+1.34e+00

Pos logit contributions
ap+0.138
ooks+0.136
aps+0.133
ucks+0.132
apping+0.129
Neg logit contributions
 disl-0.096
 dis-0.065
 gn-0.063
working-0.053
igator-0.053
Token about
Token ID546
Feature activation+8.96e-01

Pos logit contributions
ap+0.054
ooks+0.053
apping+0.050
angles+0.048
aps+0.047
Neg logit contributions
 disl-0.058
 gn-0.044
col-0.038
 dis-0.038
handed-0.038
Token winning
Token ID5442
Feature activation+7.90e-01

Pos logit contributions
ap+0.029
ooks+0.025
apping+0.025
angles+0.023
aps+0.023
Neg logit contributions
 disl-0.046
 gn-0.039
col-0.035
handed-0.035
growth-0.033
Token.
Token ID13
Feature activation-5.90e-02

Pos logit contributions
ap+0.030
ooks+0.026
apping+0.025
angles+0.024
aps+0.023
Neg logit contributions
 disl-0.035
 gn-0.029
col-0.027
handed-0.026
growth-0.025
Token<|endoftext|>
Token ID50256
Feature activation-3.49e-01

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token learned
Token ID4499
Feature activation-4.40e-01

Pos logit contributions
 disl+0.009
 gn+0.008
col+0.007
handed+0.007
growth+0.007
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token that
Token ID326
Feature activation+5.30e-01

Pos logit contributions
 disl+0.016
 gn+0.013
col+0.012
handed+0.012
growth+0.012
Neg logit contributions
ap-0.020
apping-0.018
ooks-0.017
 upon-0.017
aps-0.017
Token sometimes
Token ID3360
Feature activation+9.69e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.016
angles+0.015
aps+0.015
Neg logit contributions
 disl-0.024
 gn-0.021
col-0.019
handed-0.018
growth-0.018
Token it
Token ID340
Feature activation+8.14e-01

Pos logit contributions
ap+0.038
ooks+0.036
apping+0.034
ags+0.033
aps+0.033
Neg logit contributions
 disl-0.040
 gn-0.033
 dis-0.029
handed-0.028
col-0.028
Token's
Token ID338
Feature activation+1.08e+00

Pos logit contributions
ap+0.031
ooks+0.029
apping+0.028
aps+0.026
 upon+0.025
Neg logit contributions
 disl-0.035
 gn-0.028
col-0.027
handed-0.025
themed-0.024
Token not
Token ID407
Feature activation+2.87e+00

Pos logit contributions
ap+0.046
apping+0.041
aps+0.041
ooks+0.040
angles+0.038
Neg logit contributions
 disl-0.041
 gn-0.034
 dis-0.030
col-0.029
handed-0.028
Token about
Token ID546
Feature activation+1.04e+00

Pos logit contributions
ap+0.133
aps+0.129
ucks+0.128
ooks+0.125
apping+0.124
Neg logit contributions
 disl-0.094
 dis-0.073
 gn-0.073
 ju-0.059
 pel-0.059
Token winning
Token ID5442
Feature activation+9.30e-01

Pos logit contributions
ap+0.036
ooks+0.031
apping+0.030
angles+0.028
aps+0.028
Neg logit contributions
 disl-0.051
 gn-0.043
col-0.038
handed-0.037
 dis-0.036
Token,
Token ID11
Feature activation+3.03e-01

Pos logit contributions
ap+0.036
ooks+0.032
apping+0.031
aps+0.029
angles+0.029
Neg logit contributions
 disl-0.040
 gn-0.033
col-0.030
handed-0.029
growth-0.029
Token but
Token ID475
Feature activation+1.37e-01

Pos logit contributions
ap+0.011
apping+0.009
ooks+0.009
aps+0.009
angles+0.008
Neg logit contributions
 disl-0.014
 gn-0.012
col-0.011
handed-0.011
growth-0.011
Token about
Token ID546
Feature activation+7.72e-01

Pos logit contributions
ap+0.005
apping+0.004
ooks+0.004
aps+0.004
 upon+0.004
Neg logit contributions
 disl-0.006
 gn-0.005
col-0.005
handed-0.005
growth-0.005
Token story
Token ID1621
Feature activation+1.07e+00

Pos logit contributions
 disl+0.061
handed+0.053
col+0.051
 gn+0.051
growth+0.050
Neg logit contributions
ap-0.029
 upon-0.022
apping-0.021
ooks-0.020
 eyes-0.019
Token is
Token ID318
Feature activation+1.17e+00

Pos logit contributions
ap+0.036
ooks+0.032
apping+0.030
angles+0.028
ags+0.028
Neg logit contributions
 disl-0.050
 gn-0.041
col-0.041
handed-0.037
igsaw-0.036
Token that
Token ID326
Feature activation+1.04e+00

Pos logit contributions
ap+0.057
angles+0.055
ooks+0.051
 upon+0.051
apping+0.051
Neg logit contributions
 disl-0.035
 gn-0.026
col-0.023
 pel-0.020
 dis-0.020
Token it
Token ID340
Feature activation+8.20e-01

Pos logit contributions
ap+0.039
ooks+0.036
apping+0.035
angles+0.034
ags+0.034
Neg logit contributions
 disl-0.044
 gn-0.037
col-0.033
 dis-0.032
handed-0.031
Token's
Token ID338
Feature activation+8.43e-01

Pos logit contributions
ap+0.030
ooks+0.027
apping+0.027
angles+0.025
aps+0.025
Neg logit contributions
 disl-0.036
 gn-0.030
col-0.029
handed-0.027
growth-0.026
Token not
Token ID407
Feature activation+2.87e+00

Pos logit contributions
ap+0.034
apping+0.031
aps+0.030
ooks+0.029
angles+0.029
Neg logit contributions
 disl-0.035
 gn-0.029
col-0.026
handed-0.025
 dis-0.025
Token nice
Token ID3621
Feature activation+7.47e-01

Pos logit contributions
ap+0.124
aps+0.121
apping+0.119
ooks+0.118
ucks+0.118
Neg logit contributions
 disl-0.105
 gn-0.073
 dis-0.071
working-0.066
igator-0.064
Token to
Token ID284
Feature activation+7.28e-01

Pos logit contributions
ap+0.028
ooks+0.024
apping+0.023
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.033
 gn-0.027
col-0.027
handed-0.025
growth-0.024
Token play
Token ID711
Feature activation+1.68e-01

Pos logit contributions
ap+0.024
ooks+0.021
apping+0.021
aps+0.020
angles+0.020
Neg logit contributions
 disl-0.037
 gn-0.031
handed-0.029
col-0.028
growth-0.027
Token tricks
Token ID15910
Feature activation-2.23e-02

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
 upon+0.004
angles+0.004
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token on
Token ID319
Feature activation+4.11e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token is
Token ID318
Feature activation+1.12e+00

Pos logit contributions
ap+0.040
apping+0.035
ooks+0.035
aps+0.031
 upon+0.030
Neg logit contributions
 disl-0.051
 gn-0.041
col-0.040
handed-0.037
 dis-0.036
Token important
Token ID1593
Feature activation+3.17e-01

Pos logit contributions
ap+0.052
apping+0.046
ooks+0.043
aps+0.043
angles+0.041
Neg logit contributions
 disl-0.043
 gn-0.034
col-0.030
 dis-0.029
handed-0.028
Token and
Token ID290
Feature activation+6.98e-01

Pos logit contributions
ap+0.011
ooks+0.009
apping+0.009
 upon+0.008
angles+0.008
Neg logit contributions
 disl-0.016
 gn-0.013
col-0.013
handed-0.012
growth-0.012
Token it
Token ID340
Feature activation+8.62e-01

Pos logit contributions
ap+0.026
apping+0.021
ooks+0.021
aps+0.020
 upon+0.020
Neg logit contributions
 disl-0.033
 gn-0.027
col-0.025
handed-0.024
 dis-0.023
Token's
Token ID338
Feature activation+1.20e+00

Pos logit contributions
ap+0.033
ooks+0.029
apping+0.029
aps+0.026
 upon+0.026
Neg logit contributions
 disl-0.038
 gn-0.032
col-0.030
handed-0.028
themed-0.027
Token not
Token ID407
Feature activation+2.87e+00

Pos logit contributions
ap+0.052
apping+0.048
aps+0.046
ooks+0.045
 upon+0.044
Neg logit contributions
 disl-0.046
 gn-0.035
 dis-0.033
handed-0.031
col-0.030
Token nice
Token ID3621
Feature activation+3.67e-01

Pos logit contributions
ap+0.138
aps+0.132
apping+0.130
ooks+0.129
ucks+0.129
Neg logit contributions
 disl-0.094
 dis-0.072
 gn-0.068
 ju-0.060
 bad-0.053
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
ap+0.013
ooks+0.011
apping+0.011
angles+0.010
aps+0.010
Neg logit contributions
 disl-0.017
 gn-0.015
col-0.014
handed-0.014
growth-0.013
Token be
Token ID307
Feature activation+6.00e-01

Pos logit contributions
ap+0.039
apping+0.036
ooks+0.035
aps+0.033
 upon+0.032
Neg logit contributions
 disl-0.050
 gn-0.040
handed-0.037
col-0.036
 dis-0.035
Token selfish
Token ID20363
Feature activation-2.87e-01

Pos logit contributions
ap+0.026
apping+0.022
ooks+0.022
angles+0.021
aps+0.021
Neg logit contributions
 disl-0.025
 gn-0.019
col-0.018
handed-0.018
 dis-0.016
Token.
Token ID13
Feature activation-9.29e-02

Pos logit contributions
 disl+0.015
 gn+0.013
col+0.012
handed+0.012
growth+0.012
Neg logit contributions
ap-0.009
apping-0.007
ooks-0.007
 upon-0.007
aps-0.007
Token She
Token ID1375
Feature activation-2.74e-02

Pos logit contributions
 disl+0.028
 gn+0.025
col+0.023
handed+0.023
growth+0.022
Neg logit contributions
ap-0.016
ooks-0.013
apping-0.013
 upon-0.012
aps-0.012
Token learned
Token ID4499
Feature activation-9.12e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token that
Token ID326
Feature activation+1.40e-01

Pos logit contributions
 disl+0.038
 gn+0.033
col+0.030
growth+0.030
riger+0.030
Neg logit contributions
ap-0.034
 upon-0.029
ooks-0.028
apping-0.028
 exactly-0.028
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
ap+0.005
apping+0.004
ooks+0.004
aps+0.004
 upon+0.004
Neg logit contributions
 disl-0.006
 gn-0.005
col-0.005
handed-0.005
growth-0.005
Token's
Token ID338
Feature activation+1.02e+00

Pos logit contributions
ap+0.004
apping+0.003
ooks+0.003
aps+0.003
 upon+0.003
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token not
Token ID407
Feature activation+2.86e+00

Pos logit contributions
ap+0.042
apping+0.038
aps+0.037
ooks+0.036
angles+0.034
Neg logit contributions
 disl-0.042
 gn-0.033
col-0.030
 dis-0.030
handed-0.029
Token nice
Token ID3621
Feature activation+6.65e-01

Pos logit contributions
ap+0.145
aps+0.138
apping+0.136
ooks+0.134
ucks+0.134
Neg logit contributions
 disl-0.095
 dis-0.064
 gn-0.059
large-0.051
 bad-0.050
Token to
Token ID284
Feature activation+8.01e-01

Pos logit contributions
ap+0.025
ooks+0.021
apping+0.021
angles+0.020
aps+0.019
Neg logit contributions
 disl-0.030
 gn-0.025
col-0.024
handed-0.022
growth-0.021
Token hit
Token ID2277
Feature activation+8.48e-01

Pos logit contributions
ap+0.025
apping+0.022
ooks+0.022
aps+0.020
angles+0.020
Neg logit contributions
 disl-0.043
 gn-0.035
col-0.033
handed-0.033
 dis-0.031
Token people
Token ID661
Feature activation-6.14e-01

Pos logit contributions
ap+0.029
apping+0.025
ooks+0.025
aps+0.023
angles+0.022
Neg logit contributions
 disl-0.041
 gn-0.035
col-0.034
handed-0.032
growth-0.031
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
 disl+0.033
 gn+0.030
col+0.028
growth+0.028
handed+0.027
Neg logit contributions
ap-0.018
apping-0.014
 upon-0.013
ooks-0.012
aps-0.012
Token Lily
Token ID20037
Feature activation-1.15e+00

Pos logit contributions
 disl+0.006
 gn+0.006
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token learned
Token ID4499
Feature activation-2.05e-02

Pos logit contributions
 disl+0.065
 gn+0.056
col+0.054
handed+0.054
igsaw+0.054
Neg logit contributions
ap-0.031
apping-0.022
ooks-0.022
 upon-0.021
angles-0.020
Token that
Token ID326
Feature activation+4.86e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token it
Token ID340
Feature activation+6.08e-01

Pos logit contributions
ap+0.021
ooks+0.018
apping+0.018
angles+0.017
aps+0.017
Neg logit contributions
 disl-0.020
 gn-0.017
col-0.015
handed-0.014
growth-0.014
Token's
Token ID338
Feature activation+9.16e-01

Pos logit contributions
ap+0.023
ooks+0.020
apping+0.020
aps+0.018
angles+0.018
Neg logit contributions
 disl-0.027
 gn-0.022
col-0.021
handed-0.020
growth-0.019
Token not
Token ID407
Feature activation+2.86e+00

Pos logit contributions
ap+0.037
apping+0.033
aps+0.032
ooks+0.032
angles+0.030
Neg logit contributions
 disl-0.038
 gn-0.031
col-0.027
 dis-0.027
handed-0.027
Token always
Token ID1464
Feature activation+1.58e+00

Pos logit contributions
ap+0.134
aps+0.128
apping+0.126
ooks+0.126
ucks+0.123
Neg logit contributions
 disl-0.099
 dis-0.075
 gn-0.075
working-0.061
�-0.060
Token about
Token ID546
Feature activation+9.13e-01

Pos logit contributions
ap+0.066
ooks+0.065
apping+0.062
angles+0.058
aps+0.058
Neg logit contributions
 disl-0.066
 gn-0.049
 dis-0.044
col-0.041
handed-0.040
Token winning
Token ID5442
Feature activation+7.97e-01

Pos logit contributions
ap+0.030
apping+0.025
ooks+0.025
angles+0.024
aps+0.023
Neg logit contributions
 disl-0.047
 gn-0.040
col-0.036
handed-0.035
 dis-0.034
Token,
Token ID11
Feature activation+5.36e-01

Pos logit contributions
ap+0.030
ooks+0.026
apping+0.026
angles+0.024
aps+0.024
Neg logit contributions
 disl-0.036
 gn-0.030
col-0.028
handed-0.027
growth-0.026
Token but
Token ID475
Feature activation+1.98e-01

Pos logit contributions
ap+0.020
apping+0.017
ooks+0.017
aps+0.016
angles+0.016
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.020
handed-0.018
growth-0.018
Token story
Token ID1621
Feature activation+6.47e-01

Pos logit contributions
 disl+0.056
handed+0.047
col+0.046
 gn+0.046
growth+0.045
Neg logit contributions
ap-0.025
 upon-0.019
ooks-0.018
apping-0.018
angles-0.017
Token is
Token ID318
Feature activation+7.93e-01

Pos logit contributions
ap+0.021
ooks+0.018
apping+0.018
angles+0.016
 upon+0.016
Neg logit contributions
 disl-0.031
 gn-0.026
col-0.026
handed-0.024
growth-0.024
Token that
Token ID326
Feature activation+7.49e-01

Pos logit contributions
ap+0.043
angles+0.039
apping+0.038
ooks+0.038
aps+0.037
Neg logit contributions
 disl-0.021
 gn-0.015
col-0.014
handed-0.012
 dis-0.011
Token it
Token ID340
Feature activation+4.18e-01

Pos logit contributions
ap+0.028
ooks+0.024
apping+0.024
angles+0.024
aps+0.023
Neg logit contributions
 disl-0.034
 gn-0.028
col-0.025
handed-0.024
 dis-0.024
Token's
Token ID338
Feature activation+8.41e-01

Pos logit contributions
ap+0.015
ooks+0.013
apping+0.013
aps+0.012
angles+0.012
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token not
Token ID407
Feature activation+2.85e+00

Pos logit contributions
ap+0.034
apping+0.030
aps+0.029
ooks+0.028
angles+0.028
Neg logit contributions
 disl-0.036
 gn-0.030
col-0.027
 dis-0.026
handed-0.026
Token good
Token ID922
Feature activation+1.02e+00

Pos logit contributions
ap+0.139
aps+0.137
apping+0.134
ucks+0.132
ooks+0.128
Neg logit contributions
 disl-0.093
 dis-0.066
 gn-0.064
working-0.054
 bad-0.052
Token to
Token ID284
Feature activation+5.22e-01

Pos logit contributions
ap+0.040
apping+0.034
ooks+0.034
angles+0.034
aps+0.032
Neg logit contributions
 disl-0.043
 gn-0.035
col-0.033
handed-0.031
 dis-0.030
Token take
Token ID1011
Feature activation+9.41e-01

Pos logit contributions
ap+0.018
apping+0.016
ooks+0.016
aps+0.015
 upon+0.014
Neg logit contributions
 disl-0.025
 gn-0.022
handed-0.020
col-0.020
growth-0.018
Token things
Token ID1243
Feature activation-3.65e-01

Pos logit contributions
ap+0.034
apping+0.031
ooks+0.029
 upon+0.028
aps+0.028
Neg logit contributions
 disl-0.043
 gn-0.035
col-0.034
handed-0.034
growth-0.032
Token that
Token ID326
Feature activation+3.54e-02

Pos logit contributions
 disl+0.015
 gn+0.012
col+0.011
handed+0.011
growth+0.011
Neg logit contributions
ap-0.016
apping-0.013
ooks-0.013
 upon-0.013
aps-0.012
Token Emma
Token ID18966
Feature activation-3.94e-01

Pos logit contributions
ap+0.003
apping+0.003
ooks+0.003
 upon+0.003
aps+0.003
Neg logit contributions
 disl-0.004
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token learned
Token ID4499
Feature activation-4.32e-01

Pos logit contributions
 disl+0.018
 gn+0.015
col+0.014
handed+0.014
growth+0.014
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.012
 upon-0.011
aps-0.011
Token that
Token ID326
Feature activation+4.19e-01

Pos logit contributions
 disl+0.015
 gn+0.012
col+0.011
handed+0.011
growth+0.011
Neg logit contributions
ap-0.020
apping-0.018
ooks-0.018
 upon-0.017
aps-0.017
Token it
Token ID340
Feature activation+4.68e-01

Pos logit contributions
ap+0.016
ooks+0.013
apping+0.013
angles+0.013
aps+0.012
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.014
growth-0.014
Token's
Token ID338
Feature activation+8.45e-01

Pos logit contributions
ap+0.017
ooks+0.014
apping+0.014
aps+0.013
 upon+0.013
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.018
handed-0.017
growth-0.016
Token not
Token ID407
Feature activation+2.85e+00

Pos logit contributions
ap+0.034
apping+0.030
ooks+0.029
aps+0.029
angles+0.028
Neg logit contributions
 disl-0.035
 gn-0.029
col-0.026
handed-0.025
 dis-0.025
Token nice
Token ID3621
Feature activation+6.25e-01

Pos logit contributions
ap+0.137
ucks+0.130
ooks+0.130
aps+0.129
apping+0.127
Neg logit contributions
 disl-0.098
 dis-0.065
 gn-0.064
 baths-0.054
col-0.053
Token to
Token ID284
Feature activation+8.94e-01

Pos logit contributions
ap+0.023
ooks+0.019
apping+0.019
angles+0.018
aps+0.017
Neg logit contributions
 disl-0.029
 gn-0.024
col-0.023
handed-0.021
growth-0.021
Token be
Token ID307
Feature activation+4.37e-01

Pos logit contributions
ap+0.027
apping+0.024
ooks+0.024
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.048
 gn-0.040
col-0.036
handed-0.036
 dis-0.034
Token en
Token ID551
Feature activation+1.68e+00

Pos logit contributions
ap+0.018
apping+0.015
ooks+0.015
angles+0.015
aps+0.015
Neg logit contributions
 disl-0.019
 gn-0.015
col-0.014
handed-0.014
themed-0.012
Tokenvious
Token ID1442
Feature activation+1.29e-01

Pos logit contributions
ooks+0.122
ap+0.120
angles+0.113
ook+0.113
 exactly+0.111
Neg logit contributions
 disl-0.007
vious-0.001
col+0.006
handed+0.007
igsaw+0.009
Token He
Token ID679
Feature activation+1.01e-01

Pos logit contributions
 disl+0.006
 gn+0.006
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.005
ooks-0.004
apping-0.004
 upon-0.004
aps-0.004
Token learned
Token ID4499
Feature activation-7.95e-01

Pos logit contributions
ap+0.004
apping+0.004
ooks+0.004
aps+0.004
 upon+0.003
Neg logit contributions
 disl-0.004
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token that
Token ID326
Feature activation+5.20e-02

Pos logit contributions
 disl+0.034
 gn+0.030
col+0.028
growth+0.028
handed+0.028
Neg logit contributions
ap-0.028
ooks-0.024
 upon-0.024
apping-0.024
aps-0.023
Token it
Token ID340
Feature activation+1.37e-01

Pos logit contributions
ap+0.002
apping+0.002
ooks+0.002
aps+0.002
 upon+0.002
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token's
Token ID338
Feature activation+8.19e-01

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
aps+0.004
 upon+0.004
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.005
handed-0.005
growth-0.005
Token not
Token ID407
Feature activation+2.83e+00

Pos logit contributions
ap+0.034
apping+0.030
aps+0.029
ooks+0.029
angles+0.027
Neg logit contributions
 disl-0.034
 gn-0.028
col-0.025
 dis-0.025
handed-0.024
Token always
Token ID1464
Feature activation+1.47e+00

Pos logit contributions
ap+0.136
aps+0.131
apping+0.126
ooks+0.125
ucks+0.125
Neg logit contributions
 disl-0.096
 dis-0.072
 gn-0.061
col-0.054
working-0.053
Token good
Token ID922
Feature activation+5.79e-01

Pos logit contributions
ap+0.063
ooks+0.061
apping+0.059
angles+0.056
aps+0.056
Neg logit contributions
 disl-0.062
 gn-0.044
 dis-0.041
col-0.039
handed-0.037
Token to
Token ID284
Feature activation+6.86e-01

Pos logit contributions
ap+0.021
ooks+0.018
apping+0.018
angles+0.017
aps+0.016
Neg logit contributions
 disl-0.027
 gn-0.022
col-0.021
handed-0.020
growth-0.019
Token keep
Token ID1394
Feature activation+5.37e-01

Pos logit contributions
ap+0.022
apping+0.020
ooks+0.019
aps+0.018
angles+0.018
Neg logit contributions
 disl-0.035
 gn-0.029
handed-0.027
col-0.027
 dis-0.025
Token things
Token ID1243
Feature activation-3.16e-01

Pos logit contributions
ap+0.017
apping+0.014
ooks+0.014
aps+0.013
 upon+0.013
Neg logit contributions
 disl-0.028
 gn-0.023
handed-0.022
col-0.022
growth-0.021
Token Lily
Token ID20037
Feature activation+2.59e-03

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.006
ooks-0.005
apping-0.005
 upon-0.004
aps-0.004
Token realized
Token ID6939
Feature activation-5.46e-01

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token that
Token ID326
Feature activation-6.58e-02

Pos logit contributions
 disl+0.020
 gn+0.017
handed+0.016
col+0.015
growth+0.015
Neg logit contributions
ap-0.024
apping-0.021
 upon-0.021
ooks-0.021
angles-0.020
Token it
Token ID340
Feature activation-1.01e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token's
Token ID338
Feature activation+5.60e-01

Pos logit contributions
 disl+0.005
 gn+0.004
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token not
Token ID407
Feature activation+2.82e+00

Pos logit contributions
ap+0.020
apping+0.017
ooks+0.017
aps+0.017
angles+0.016
Neg logit contributions
 disl-0.026
 gn-0.022
col-0.020
handed-0.020
 dis-0.019
Token good
Token ID922
Feature activation+5.31e-01

Pos logit contributions
ap+0.141
ucks+0.135
aps+0.133
apping+0.129
ooks+0.129
Neg logit contributions
 disl-0.088
 gn-0.064
 dis-0.059
col-0.052
 ju-0.052
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
ap+0.019
ooks+0.016
apping+0.016
angles+0.015
aps+0.014
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.020
handed-0.019
themed-0.018
Token be
Token ID307
Feature activation+6.43e-01

Pos logit contributions
ap+0.037
apping+0.035
ooks+0.033
aps+0.032
angles+0.030
Neg logit contributions
 disl-0.054
 gn-0.045
col-0.041
handed-0.040
 dis-0.038
Token en
Token ID551
Feature activation+1.40e+00

Pos logit contributions
ap+0.028
apping+0.023
ooks+0.023
angles+0.023
aps+0.022
Neg logit contributions
 disl-0.026
 gn-0.021
col-0.020
handed-0.019
 dis-0.017
Tokenvious
Token ID1442
Feature activation-3.21e-01

Pos logit contributions
ap+0.100
ooks+0.098
angles+0.092
ook+0.090
apping+0.090
Neg logit contributions
 disl-0.009
vious-0.000
col+0.002
handed+0.002
 gn+0.003
Token told
Token ID1297
Feature activation+8.13e-02

Pos logit contributions
ap+0.008
ooks+0.007
apping+0.007
aps+0.007
angles+0.006
Neg logit contributions
 disl-0.010
 gn-0.008
col-0.008
handed-0.008
growth-0.007
Token her
Token ID607
Feature activation-8.75e-01

Pos logit contributions
ap+0.003
apping+0.002
ooks+0.002
aps+0.002
 upon+0.002
Neg logit contributions
 disl-0.004
 gn-0.004
col-0.004
handed-0.003
growth-0.003
Token that
Token ID326
Feature activation-3.32e-01

Pos logit contributions
 disl+0.040
 gn+0.034
handed+0.034
col+0.033
growth+0.033
Neg logit contributions
ap-0.032
ooks-0.027
apping-0.027
 upon-0.026
ies-0.025
Token it
Token ID340
Feature activation+1.82e-02

Pos logit contributions
 disl+0.016
 gn+0.013
handed+0.013
col+0.013
growth+0.012
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.009
 upon-0.009
aps-0.009
Token's
Token ID338
Feature activation+5.16e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token not
Token ID407
Feature activation+2.82e+00

Pos logit contributions
ap+0.019
apping+0.016
ooks+0.015
angles+0.015
aps+0.015
Neg logit contributions
 disl-0.024
 gn-0.021
col-0.019
handed-0.018
 pel-0.018
Token good
Token ID922
Feature activation+8.46e-01

Pos logit contributions
ap+0.133
aps+0.127
apping+0.126
ooks+0.123
angles+0.122
Neg logit contributions
 disl-0.091
 gn-0.066
col-0.056
 dis-0.055
 pel-0.054
Token to
Token ID284
Feature activation+1.06e+00

Pos logit contributions
ap+0.028
apping+0.023
ooks+0.023
angles+0.022
aps+0.021
Neg logit contributions
 disl-0.042
 gn-0.035
col-0.033
handed-0.031
themed-0.030
Token keep
Token ID1394
Feature activation-1.21e-01

Pos logit contributions
ap+0.041
apping+0.039
ooks+0.039
aps+0.037
 upon+0.037
Neg logit contributions
 disl-0.046
 gn-0.037
col-0.033
handed-0.033
 pel-0.031
Token wild
Token ID4295
Feature activation-9.10e-01

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token animals
Token ID4695
Feature activation-7.60e-01

Pos logit contributions
 disl+0.052
 gn+0.048
handed+0.046
col+0.045
growth+0.045
Neg logit contributions
ap-0.023
apping-0.016
ies-0.015
ooks-0.015
aps-0.014
Token a
Token ID257
Feature activation-1.32e-01

Pos logit contributions
 disl+0.012
 gn+0.010
col+0.010
handed+0.010
growth+0.009
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token jacket
Token ID15224
Feature activation-3.20e-01

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
 disl+0.017
 gn+0.014
col+0.014
handed+0.013
growth+0.013
Neg logit contributions
ap-0.010
apping-0.008
ooks-0.008
 upon-0.008
aps-0.007
Token
Token ID198
Feature activation-4.41e-01

Pos logit contributions
 disl+0.012
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.010
apping-0.009
ooks-0.009
 upon-0.008
aps-0.008
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
 disl+0.021
 gn+0.018
col+0.017
handed+0.017
growth+0.016
Neg logit contributions
ap-0.015
ooks-0.013
apping-0.013
 upon-0.012
aps-0.012
TokenThe
Token ID464
Feature activation-2.85e+00

Pos logit contributions
 disl+0.030
 gn+0.025
col+0.024
handed+0.024
growth+0.023
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.014
 upon-0.013
aps-0.012
Token p
Token ID279
Feature activation+1.41e-01

Pos logit contributions
 disl+0.169
side+0.150
growth+0.149
 CRE+0.149
flight+0.143
Neg logit contributions
 friend-0.073
 friends-0.071
 me-0.056
 upon-0.055
 be-0.051
Tokenanda
Token ID5282
Feature activation+6.61e-02

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
angles+0.004
 upon+0.004
Neg logit contributions
 disl-0.006
 gn-0.005
col-0.005
handed-0.005
growth-0.004
Token asked
Token ID1965
Feature activation+2.25e-01

Pos logit contributions
ap+0.003
apping+0.002
ooks+0.002
aps+0.002
 upon+0.002
Neg logit contributions
 disl-0.003
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
ap+0.009
apping+0.008
ooks+0.008
aps+0.007
angles+0.007
Neg logit contributions
 disl-0.010
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token "
Token ID366
Feature activation-1.22e+00

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token present
Token ID1944
Feature activation+2.79e-01

Pos logit contributions
ap+0.009
ooks+0.008
apping+0.008
angles+0.007
 upon+0.007
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.008
handed-0.007
growth-0.007
Token?"
Token ID1701
Feature activation-7.44e-02

Pos logit contributions
ap+0.010
ooks+0.009
apping+0.009
aps+0.008
angles+0.008
Neg logit contributions
 disl-0.013
 gn-0.011
col-0.010
handed-0.010
growth-0.010
Token
Token ID198
Feature activation+1.21e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.002
growth+0.002
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.002
aps-0.002
Token
Token ID198
Feature activation-1.78e-01

Pos logit contributions
ap+0.005
apping+0.004
ooks+0.004
 upon+0.004
aps+0.004
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
TokenB
Token ID33
Feature activation+7.09e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Tokenree
Token ID631
Feature activation-2.78e+00

Pos logit contributions
ap+0.028
ooks+0.024
apping+0.023
angles+0.023
 upon+0.023
Neg logit contributions
 disl-0.029
 gn-0.024
col-0.024
handed-0.022
growth-0.022
Token said
Token ID531
Feature activation-1.17e+00

Pos logit contributions
 disl+0.123
handed+0.111
igsaw+0.108
 piping+0.106
themed+0.106
Neg logit contributions
ap-0.086
apping-0.070
 upon-0.063
ies-0.063
ook-0.062
Token,
Token ID11
Feature activation-2.65e-01

Pos logit contributions
 disl+0.060
handed+0.051
growth+0.049
col+0.048
 gn+0.047
Neg logit contributions
ap-0.041
apping-0.033
aps-0.030
ooks-0.030
 upon-0.029
Token "
Token ID366
Feature activation-5.29e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.011
handed+0.011
growth+0.011
Neg logit contributions
ap-0.008
apping-0.007
ooks-0.007
 upon-0.006
aps-0.006
TokenWell
Token ID5779
Feature activation-5.50e-01

Pos logit contributions
 disl+0.028
 gn+0.024
handed+0.023
col+0.022
growth+0.022
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.012
 upon-0.011
aps-0.011
Token,
Token ID11
Feature activation+2.64e-01

Pos logit contributions
 disl+0.026
 gn+0.022
handed+0.022
col+0.022
growth+0.021
Neg logit contributions
ap-0.021
apping-0.017
 upon-0.017
ooks-0.016
angles-0.015
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
 disl+0.059
 gn+0.050
col+0.049
handed+0.049
themed+0.048
Neg logit contributions
ap-0.029
 upon-0.022
ooks-0.020
aps-0.020
apping-0.020
Token night
Token ID1755
Feature activation-9.81e-01

Pos logit contributions
 disl+0.073
col+0.058
 gn+0.056
handed+0.056
growth+0.055
Neg logit contributions
ap-0.031
 upon-0.026
 eyes-0.025
 me-0.025
 friends-0.023
Token?"
Token ID1701
Feature activation+2.92e-01

Pos logit contributions
 disl+0.049
 gn+0.041
col+0.040
handed+0.039
growth+0.038
Neg logit contributions
ap-0.034
apping-0.027
 upon-0.027
ooks-0.026
aps-0.025
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
ap+0.013
ooks+0.012
apping+0.012
angles+0.011
aps+0.011
Neg logit contributions
 disl-0.011
 gn-0.009
col-0.008
handed-0.008
growth-0.008
Token
Token ID198
Feature activation-6.63e-02

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
TokenThe
Token ID464
Feature activation-2.75e+00

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token rabbit
Token ID22746
Feature activation+4.29e-01

Pos logit contributions
 disl+0.157
side+0.147
growth+0.144
 CRE+0.142
handed+0.137
Neg logit contributions
 friends-0.073
 friend-0.071
 me-0.062
 animals-0.051
 be-0.050
Token said
Token ID531
Feature activation-4.99e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.016
aps+0.016
angles+0.016
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.013
handed-0.012
growth-0.011
Token,
Token ID11
Feature activation-2.84e-01

Pos logit contributions
 disl+0.023
 gn+0.019
col+0.019
handed+0.018
growth+0.018
Neg logit contributions
ap-0.019
apping-0.016
ooks-0.015
 upon-0.015
aps-0.015
Token "
Token ID366
Feature activation-7.18e-01

Pos logit contributions
 disl+0.015
 gn+0.013
col+0.012
handed+0.012
growth+0.012
Neg logit contributions
ap-0.009
apping-0.007
ooks-0.007
 upon-0.007
aps-0.007
TokenI
Token ID40
Feature activation-7.14e-01

Pos logit contributions
 disl+0.040
 gn+0.036
col+0.034
igsaw+0.034
handed+0.033
Neg logit contributions
ap-0.017
apping-0.013
 upon-0.013
ooks-0.012
aps-0.012
Token too
Token ID1165
Feature activation-1.48e-01

Pos logit contributions
 disl+0.020
 gn+0.017
col+0.016
growth+0.016
handed+0.016
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
 upon-0.008
aps-0.008
Token?"
Token ID1701
Feature activation-2.59e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.006
growth+0.005
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token
Token ID220
Feature activation-7.06e-02

Pos logit contributions
 disl+0.011
 gn+0.009
col+0.008
handed+0.008
growth+0.008
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
aps-0.009
 upon-0.009
Token
Token ID198
Feature activation-4.04e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.002
growth+0.002
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.015
handed+0.015
growth+0.015
Neg logit contributions
ap-0.014
apping-0.012
ooks-0.012
aps-0.011
 upon-0.011
TokenThe
Token ID464
Feature activation-2.75e+00

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.006
growth+0.006
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
aps-0.005
 upon-0.005
Token other
Token ID584
Feature activation-3.68e-01

Pos logit contributions
 disl+0.154
growth+0.139
keeping+0.131
themed+0.130
handed+0.130
Neg logit contributions
 friend-0.060
 friends-0.055
ap-0.054
 others-0.046
 twins-0.043
Token child
Token ID1200
Feature activation+4.47e-01

Pos logit contributions
 disl+0.017
 gn+0.015
col+0.014
handed+0.014
growth+0.013
Neg logit contributions
ap-0.014
apping-0.011
ooks-0.011
 upon-0.011
angles-0.010
Token was
Token ID373
Feature activation-3.59e-01

Pos logit contributions
ap+0.018
ooks+0.016
apping+0.016
aps+0.015
angles+0.015
Neg logit contributions
 disl-0.018
 gn-0.016
col-0.015
handed-0.014
growth-0.013
Token a
Token ID257
Feature activation-1.34e+00

Pos logit contributions
 disl+0.016
 gn+0.014
col+0.013
handed+0.013
growth+0.013
Neg logit contributions
ap-0.014
apping-0.012
ooks-0.011
aps-0.011
 upon-0.011
Token bit
Token ID1643
Feature activation-1.04e+00

Pos logit contributions
 disl+0.064
handed+0.054
col+0.053
growth+0.053
 gn+0.051
Neg logit contributions
ap-0.047
apping-0.037
 upon-0.036
 friend-0.033
ooks-0.033
Token said
Token ID531
Feature activation-4.26e-01

Pos logit contributions
ap+0.013
apping+0.011
ooks+0.011
aps+0.010
angles+0.010
Neg logit contributions
 disl-0.012
 gn-0.011
col-0.010
handed-0.009
growth-0.009
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 disl+0.019
 gn+0.016
handed+0.015
col+0.015
growth+0.015
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.014
aps-0.013
 upon-0.013
Token "
Token ID366
Feature activation-2.35e-01

Pos logit contributions
 disl+0.013
 gn+0.011
col+0.010
handed+0.010
growth+0.010
Neg logit contributions
ap-0.008
apping-0.007
ooks-0.006
 upon-0.006
aps-0.006
TokenIt
Token ID1026
Feature activation-3.62e-02

Pos logit contributions
 disl+0.011
 gn+0.009
col+0.009
handed+0.009
growth+0.008
Neg logit contributions
ap-0.008
apping-0.007
ooks-0.007
 upon-0.007
aps-0.007
Token feels
Token ID5300
Feature activation-8.36e-01

Pos logit contributions
 disl+0.002
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token thick
Token ID6546
Feature activation-2.71e+00

Pos logit contributions
 disl+0.038
 gn+0.033
handed+0.031
growth+0.031
col+0.031
Neg logit contributions
ap-0.030
ooks-0.025
 upon-0.024
apping-0.024
aps-0.022
Token."
Token ID526
Feature activation+7.68e-01

Pos logit contributions
 disl+0.166
growth+0.143
handed+0.142
 gn+0.141
igsaw+0.136
Neg logit contributions
ap-0.050
 for-0.047
ooks-0.035
 by-0.035
ies-0.034
Token
Token ID220
Feature activation+6.33e-01

Pos logit contributions
ap+0.033
ooks+0.029
angles+0.029
apping+0.028
ies+0.027
Neg logit contributions
 disl-0.028
 gn-0.024
col-0.022
handed-0.021
growth-0.020
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
ap+0.027
apping+0.023
ooks+0.022
angles+0.022
 upon+0.021
Neg logit contributions
 disl-0.024
 gn-0.020
col-0.019
handed-0.019
growth-0.018
Token"
Token ID1
Feature activation-4.70e-01

Pos logit contributions
ap+0.024
ooks+0.021
angles+0.020
apping+0.020
 upon+0.019
Neg logit contributions
 disl-0.024
 gn-0.021
col-0.020
handed-0.019
themed-0.018
TokenYes
Token ID5297
Feature activation+1.55e-01

Pos logit contributions
 disl+0.023
 gn+0.020
handed+0.019
col+0.019
growth+0.018
Neg logit contributions
ap-0.016
apping-0.013
ooks-0.012
 upon-0.012
aps-0.012
Token march
Token ID9960
Feature activation+4.90e-01

Pos logit contributions
 disl+0.071
 gn+0.057
col+0.056
handed+0.056
growth+0.054
Neg logit contributions
ap-0.049
 upon-0.038
apping-0.037
ooks-0.034
angles-0.034
Token."
Token ID526
Feature activation+2.49e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.017
aps+0.016
angles+0.016
Neg logit contributions
 disl-0.020
 gn-0.017
col-0.016
handed-0.015
themed-0.014
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
ap+0.010
apping+0.009
ooks+0.009
aps+0.008
angles+0.008
Neg logit contributions
 disl-0.010
 gn-0.009
col-0.008
handed-0.008
growth-0.008
Token
Token ID198
Feature activation+2.81e-02

Pos logit contributions
ap+0.020
apping+0.017
ooks+0.017
angles+0.016
aps+0.016
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.015
handed-0.014
growth-0.014
TokenPed
Token ID43468
Feature activation-2.18e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
aps+0.001
 upon+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Tokenro
Token ID305
Feature activation-2.66e+00

Pos logit contributions
 disl+0.009
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.009
ooks-0.008
apping-0.008
aps-0.008
angles-0.008
Token said
Token ID531
Feature activation-1.15e+00

Pos logit contributions
 disl+0.122
 gn+0.106
col+0.106
 piping+0.105
growth+0.104
Neg logit contributions
ap-0.091
apping-0.074
ies-0.068
 upon-0.066
ooks-0.064
Token,
Token ID11
Feature activation-6.03e-01

Pos logit contributions
 disl+0.059
handed+0.051
growth+0.049
 gn+0.049
col+0.048
Neg logit contributions
ap-0.040
apping-0.032
ooks-0.029
aps-0.028
 upon-0.028
Token "
Token ID366
Feature activation-2.35e-01

Pos logit contributions
 disl+0.033
handed+0.028
 gn+0.028
col+0.027
growth+0.027
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.013
 upon-0.013
aps-0.012
TokenOkay
Token ID16454
Feature activation-3.44e-01

Pos logit contributions
 disl+0.011
 gn+0.009
col+0.009
handed+0.008
growth+0.008
Neg logit contributions
ap-0.009
apping-0.007
ooks-0.007
 upon-0.007
aps-0.007
Token,
Token ID11
Feature activation-5.43e-01

Pos logit contributions
 disl+0.015
 gn+0.013
col+0.012
handed+0.012
growth+0.012
Neg logit contributions
ap-0.014
apping-0.011
ooks-0.011
 upon-0.011
aps-0.010
Token luggage
Token ID31488
Feature activation+7.37e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.006
growth+0.006
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token.
Token ID13
Feature activation-9.23e-02

Pos logit contributions
ap+0.028
ooks+0.023
apping+0.023
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.032
 gn-0.028
col-0.027
handed-0.026
growth-0.024
Token
Token ID220
Feature activation+8.17e-01

Pos logit contributions
 disl+0.004
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token
Token ID198
Feature activation-5.28e-01

Pos logit contributions
ap+0.036
apping+0.030
ooks+0.030
angles+0.029
ies+0.028
Neg logit contributions
 disl-0.031
 gn-0.025
col-0.024
handed-0.024
growth-0.022
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 disl+0.026
 gn+0.022
col+0.021
handed+0.020
growth+0.020
Neg logit contributions
ap-0.017
ooks-0.015
apping-0.015
 upon-0.014
aps-0.014
TokenThe
Token ID464
Feature activation-2.65e+00

Pos logit contributions
 disl+0.022
 gn+0.019
col+0.018
handed+0.018
growth+0.018
Neg logit contributions
ap-0.014
apping-0.011
ooks-0.011
 upon-0.011
aps-0.010
Token elephant
Token ID20950
Feature activation+1.13e-01

Pos logit contributions
 disl+0.156
riger+0.135
growth+0.133
 CRE+0.131
paste+0.131
Neg logit contributions
 friend-0.056
 friends-0.048
 eyes-0.045
 upon-0.043
 animals-0.042
Token asked
Token ID1965
Feature activation-4.87e-02

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
aps+0.004
 upon+0.004
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token,
Token ID11
Feature activation-1.33e-01

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token "
Token ID366
Feature activation-9.13e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.004
ooks-0.004
 upon-0.003
aps-0.003
TokenWhat
Token ID2061
Feature activation+8.56e-01

Pos logit contributions
 disl+0.047
 gn+0.037
col+0.036
igsaw+0.036
handed+0.036
Neg logit contributions
ap-0.027
 upon-0.024
apping-0.022
ooks-0.020
angles-0.020
Token said
Token ID531
Feature activation-2.24e-01

Pos logit contributions
ap+0.005
ooks+0.005
apping+0.005
aps+0.004
 upon+0.004
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token,
Token ID11
Feature activation-6.02e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.009
apping-0.008
ooks-0.007
aps-0.007
 upon-0.007
Token "
Token ID366
Feature activation-5.43e-01

Pos logit contributions
 disl+0.033
handed+0.029
 gn+0.029
col+0.028
growth+0.027
Neg logit contributions
ap-0.017
apping-0.013
 upon-0.012
ooks-0.012
aps-0.012
TokenWow
Token ID22017
Feature activation+7.78e-01

Pos logit contributions
 disl+0.027
 gn+0.023
handed+0.022
col+0.022
igsaw+0.021
Neg logit contributions
ap-0.018
apping-0.014
ooks-0.014
aps-0.013
 upon-0.013
Token,
Token ID11
Feature activation-6.17e-01

Pos logit contributions
ap+0.035
ooks+0.033
apping+0.032
angles+0.030
aps+0.030
Neg logit contributions
 disl-0.025
 gn-0.021
col-0.019
riger-0.018
handed-0.017
Token an
Token ID281
Feature activation-2.63e+00

Pos logit contributions
 disl+0.024
handed+0.020
 gn+0.020
col+0.019
growth+0.018
Neg logit contributions
ap-0.027
apping-0.022
 upon-0.022
angles-0.021
ooks-0.021
Token elephant
Token ID20950
Feature activation+1.68e-01

Pos logit contributions
 disl+0.121
side+0.109
 gn+0.101
riger+0.100
handed+0.098
Neg logit contributions
ap-0.082
 for-0.070
ook-0.069
ooks-0.067
angles-0.066
Token is
Token ID318
Feature activation-9.20e-02

Pos logit contributions
ap+0.006
ooks+0.005
apping+0.005
 upon+0.005
angles+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.006
handed-0.006
growth-0.006
Token amazing
Token ID4998
Feature activation-6.62e-01

Pos logit contributions
 disl+0.004
 gn+0.004
col+0.004
handed+0.003
growth+0.003
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
 disl+0.033
 gn+0.028
handed+0.028
col+0.027
growth+0.027
Neg logit contributions
ap-0.022
apping-0.018
ooks-0.018
aps-0.017
 upon-0.017
Token Can
Token ID1680
Feature activation+3.98e-01

Pos logit contributions
 disl+0.018
 gn+0.016
handed+0.015
col+0.015
growth+0.014
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.012
aps-0.011
 upon-0.011
Token play
Token ID711
Feature activation-3.04e-01

Pos logit contributions
 disl+0.009
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token!"
Token ID2474
Feature activation+5.06e-01

Pos logit contributions
 disl+0.018
 gn+0.016
col+0.015
handed+0.015
growth+0.015
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.005
aps-0.005
 upon-0.005
Token
Token ID198
Feature activation+5.40e-01

Pos logit contributions
ap+0.022
ooks+0.019
apping+0.019
angles+0.018
 upon+0.018
Neg logit contributions
 disl-0.020
 gn-0.017
col-0.016
handed-0.015
growth-0.014
Token
Token ID198
Feature activation+7.15e-01

Pos logit contributions
ap+0.023
apping+0.019
ooks+0.019
angles+0.019
 upon+0.018
Neg logit contributions
 disl-0.022
 gn-0.018
col-0.017
handed-0.017
growth-0.016
TokenF
Token ID37
Feature activation-2.99e-02

Pos logit contributions
ap+0.026
ooks+0.022
apping+0.022
angles+0.021
 upon+0.020
Neg logit contributions
 disl-0.032
 gn-0.028
col-0.027
handed-0.026
growth-0.024
Tokenro
Token ID305
Feature activation-2.62e+00

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Tokenggy
Token ID19970
Feature activation-4.88e-01

Pos logit contributions
 disl+0.118
col+0.104
 piping+0.102
growth+0.101
 gn+0.100
Neg logit contributions
ap-0.090
apping-0.072
ies-0.065
 upon-0.064
ooks-0.063
Token and
Token ID290
Feature activation-2.16e-02

Pos logit contributions
 disl+0.022
 gn+0.019
handed+0.018
col+0.018
growth+0.017
Neg logit contributions
ap-0.018
apping-0.015
ooks-0.014
 upon-0.014
aps-0.014
Token D
Token ID360
Feature activation+1.76e+00

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Tokenucky
Token ID5309
Feature activation+1.51e-02

Pos logit contributions
ooks+0.074
ap+0.072
angles+0.071
 exactly+0.067
ook+0.064
Neg logit contributions
 disl-0.059
 gn-0.047
handed-0.047
col-0.044
igsaw-0.043
Token played
Token ID2826
Feature activation+4.12e-01

Pos logit contributions
ap+0.001
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token I
Token ID314
Feature activation-9.31e-01

Pos logit contributions
ap+0.006
ooks+0.005
apping+0.005
angles+0.005
aps+0.005
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.008
handed-0.007
growth-0.007
Token found
Token ID1043
Feature activation+1.58e-01

Pos logit contributions
 disl+0.049
 gn+0.042
handed+0.041
col+0.041
growth+0.041
Neg logit contributions
ap-0.027
 upon-0.024
apping-0.020
ooks-0.020
aps-0.018
Token!
Token ID0
Feature activation-8.13e-01

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.006
aps+0.005
 upon+0.005
Neg logit contributions
 disl-0.006
 gn-0.005
col-0.005
handed-0.005
growth-0.005
Token It
Token ID632
Feature activation-7.64e-01

Pos logit contributions
 disl+0.040
 gn+0.034
handed+0.032
col+0.032
themed+0.030
Neg logit contributions
ap-0.028
apping-0.023
ooks-0.022
 upon-0.021
aps-0.021
Token's
Token ID338
Feature activation+1.90e-01

Pos logit contributions
 disl+0.041
 gn+0.035
handed+0.034
col+0.033
growth+0.033
Neg logit contributions
ap-0.021
 upon-0.016
apping-0.016
ooks-0.015
angles-0.015
Token an
Token ID281
Feature activation-2.61e+00

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.006
aps+0.006
angles+0.006
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.007
handed-0.007
growth-0.006
Token insect
Token ID12833
Feature activation-3.23e-01

Pos logit contributions
 disl+0.139
side+0.121
handed+0.119
 gn+0.118
riger+0.117
Neg logit contributions
ap-0.068
 for-0.057
apping-0.054
ook-0.052
ooks-0.050
Token!"
Token ID2474
Feature activation-4.69e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.011
handed+0.011
growth+0.010
Neg logit contributions
ap-0.013
apping-0.011
ooks-0.011
 upon-0.010
aps-0.010
Token he
Token ID339
Feature activation+5.37e-01

Pos logit contributions
 disl+0.020
 gn+0.017
col+0.016
handed+0.016
growth+0.015
Neg logit contributions
ap-0.019
apping-0.016
ooks-0.015
 upon-0.015
aps-0.015
Token said
Token ID531
Feature activation-4.58e-01

Pos logit contributions
ap+0.026
ooks+0.023
apping+0.023
angles+0.022
aps+0.022
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.014
handed-0.012
growth-0.012
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
 disl+0.024
 gn+0.021
col+0.020
growth+0.020
handed+0.020
Neg logit contributions
ap-0.014
apping-0.011
ooks-0.011
aps-0.010
 upon-0.010
TokenHello
Token ID15496
Feature activation-7.19e-01

Pos logit contributions
 disl+0.026
 gn+0.021
col+0.020
handed+0.020
igsaw+0.020
Neg logit contributions
ap-0.018
 upon-0.016
ooks-0.015
apping-0.015
angles-0.014
Token little
Token ID1310
Feature activation-3.15e-01

Pos logit contributions
 disl+0.034
col+0.028
 gn+0.028
handed+0.027
growth+0.027
Neg logit contributions
ap-0.026
 upon-0.022
apping-0.021
ooks-0.021
aps-0.020
Token boy
Token ID2933
Feature activation-2.87e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.011
handed+0.011
growth+0.011
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.010
 upon-0.010
aps-0.009
Token,"
Token ID553
Feature activation+5.34e-01

Pos logit contributions
 disl+0.012
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.010
aps-0.010
 upon-0.010
Token said
Token ID531
Feature activation+2.44e-01

Pos logit contributions
ap+0.025
apping+0.023
ooks+0.023
aps+0.021
angles+0.021
Neg logit contributions
 disl-0.020
 gn-0.016
col-0.014
handed-0.014
themed-0.013
Token the
Token ID262
Feature activation-2.60e+00

Pos logit contributions
ap+0.009
ooks+0.008
apping+0.008
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.011
 gn-0.009
col-0.009
handed-0.008
growth-0.008
Token giant
Token ID6175
Feature activation-2.37e-01

Pos logit contributions
 disl+0.119
riger+0.098
side+0.095
handed+0.092
themed+0.090
Neg logit contributions
 upon-0.087
 friend-0.086
 eyes-0.082
ap-0.081
 friends-0.079
Token.
Token ID13
Feature activation-7.69e-01

Pos logit contributions
 disl+0.012
 gn+0.010
col+0.010
handed+0.010
growth+0.009
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token "
Token ID366
Feature activation-3.49e-01

Pos logit contributions
 disl+0.042
col+0.036
handed+0.036
 gn+0.035
growth+0.035
Neg logit contributions
ap-0.020
apping-0.016
ooks-0.016
 upon-0.016
aps-0.015
TokenWhy
Token ID5195
Feature activation+8.27e-01

Pos logit contributions
 disl+0.016
 gn+0.014
col+0.013
handed+0.013
growth+0.012
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.010
 upon-0.010
aps-0.010
Token are
Token ID389
Feature activation+3.44e-01

Pos logit contributions
ap+0.032
ooks+0.029
apping+0.028
aps+0.026
ags+0.025
Neg logit contributions
 disl-0.037
 gn-0.031
col-0.027
handed-0.026
themed-0.025
Token't
Token ID470
Feature activation+3.32e-01

Pos logit contributions
 disl+0.002
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token be
Token ID307
Feature activation-5.93e-01

Pos logit contributions
ap+0.014
ooks+0.013
apping+0.013
angles+0.012
aps+0.012
Neg logit contributions
 disl-0.014
 gn-0.011
col-0.010
handed-0.010
growth-0.009
Token scared
Token ID12008
Feature activation-2.94e-02

Pos logit contributions
 disl+0.023
 gn+0.019
growth+0.019
handed+0.019
col+0.019
Neg logit contributions
ap-0.025
apping-0.022
ooks-0.021
 upon-0.020
aps-0.020
Token,"
Token ID553
Feature activation+2.52e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token said
Token ID531
Feature activation+2.79e-01

Pos logit contributions
ap+0.012
ooks+0.010
apping+0.010
aps+0.010
 upon+0.010
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.006
Token the
Token ID262
Feature activation-2.60e+00

Pos logit contributions
ap+0.011
apping+0.009
ooks+0.009
aps+0.009
angles+0.009
Neg logit contributions
 disl-0.012
 gn-0.011
col-0.010
handed-0.010
growth-0.009
Token giant
Token ID6175
Feature activation-5.44e-01

Pos logit contributions
 disl+0.113
side+0.093
riger+0.092
handed+0.090
themed+0.089
Neg logit contributions
 upon-0.093
ap-0.087
 friend-0.086
 eyes-0.083
 friends-0.082
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
 disl+0.028
 gn+0.024
col+0.023
handed+0.023
growth+0.023
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.013
 upon-0.013
aps-0.012
Token "
Token ID366
Feature activation-6.39e-02

Pos logit contributions
 disl+0.028
 gn+0.024
col+0.023
handed+0.023
growth+0.023
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.012
 upon-0.012
aps-0.011
TokenI
Token ID40
Feature activation-5.20e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token'm
Token ID1101
Feature activation-9.32e-02

Pos logit contributions
 disl+0.025
 gn+0.022
col+0.021
handed+0.021
growth+0.021
Neg logit contributions
ap-0.018
 upon-0.014
apping-0.014
ooks-0.014
aps-0.013
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
 disl+0.066
 gn+0.055
col+0.055
themed+0.054
handed+0.054
Neg logit contributions
ap-0.036
 upon-0.029
 your-0.027
ooks-0.026
 for-0.026
Token jungle
Token ID20712
Feature activation+1.08e-01

Pos logit contributions
 disl+0.059
 gn+0.046
col+0.045
igsaw+0.043
growth+0.042
Neg logit contributions
ap-0.036
 upon-0.029
 us-0.028
 eyes-0.028
 me-0.026
Token?"
Token ID1701
Feature activation+5.05e-03

Pos logit contributions
ap+0.004
ooks+0.003
apping+0.003
aps+0.003
angles+0.003
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token
Token ID198
Feature activation-2.81e-01

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token
Token ID198
Feature activation-2.79e-01

Pos logit contributions
 disl+0.013
 gn+0.011
col+0.011
handed+0.010
growth+0.010
Neg logit contributions
ap-0.010
apping-0.008
ooks-0.008
 upon-0.008
aps-0.008
TokenThe
Token ID464
Feature activation-2.59e+00

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.011
handed+0.011
growth+0.011
Neg logit contributions
ap-0.009
apping-0.008
ooks-0.008
 upon-0.007
aps-0.007
Token child
Token ID1200
Feature activation+5.96e-01

Pos logit contributions
 disl+0.151
riger+0.133
growth+0.130
side+0.127
 CRE+0.127
Neg logit contributions
 friend-0.057
 friends-0.052
 animals-0.044
 eyes-0.044
 upon-0.042
Token said
Token ID531
Feature activation-1.05e+00

Pos logit contributions
ap+0.027
ooks+0.024
apping+0.023
aps+0.022
angles+0.022
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.017
handed-0.016
growth-0.015
Token,
Token ID11
Feature activation-2.37e-01

Pos logit contributions
 disl+0.053
handed+0.044
growth+0.044
 gn+0.044
col+0.043
Neg logit contributions
ap-0.037
apping-0.030
 upon-0.027
ooks-0.026
aps-0.026
Token "
Token ID366
Feature activation-5.22e-01

Pos logit contributions
 disl+0.012
 gn+0.010
col+0.010
handed+0.010
growth+0.010
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
TokenI
Token ID40
Feature activation-1.07e+00

Pos logit contributions
 disl+0.029
 gn+0.025
col+0.024
handed+0.024
growth+0.023
Neg logit contributions
ap-0.013
apping-0.011
 upon-0.010
ooks-0.010
aps-0.010
Token a
Token ID257
Feature activation+2.32e-01

Pos logit contributions
 disl+0.034
 gn+0.029
col+0.028
handed+0.028
riger+0.026
Neg logit contributions
ap-0.019
ooks-0.015
apping-0.015
 upon-0.014
aps-0.013
Token family
Token ID1641
Feature activation+4.73e-01

Pos logit contributions
ap+0.010
ooks+0.008
apping+0.008
angles+0.008
aps+0.008
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token who
Token ID508
Feature activation+3.40e-01

Pos logit contributions
ap+0.015
ooks+0.012
angles+0.012
apping+0.012
 upon+0.011
Neg logit contributions
 disl-0.023
 gn-0.021
col-0.020
handed-0.019
themed-0.018
Token really
Token ID1107
Feature activation-4.58e-01

Pos logit contributions
ap+0.012
apping+0.010
ooks+0.010
angles+0.010
aps+0.010
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.013
handed-0.012
growth-0.012
Token enjoyed
Token ID8359
Feature activation+2.64e-01

Pos logit contributions
 disl+0.022
 gn+0.020
col+0.018
handed+0.018
growth+0.018
Neg logit contributions
ap-0.016
apping-0.012
ooks-0.012
aps-0.011
 upon-0.011
Token pic
Token ID8301
Feature activation-2.58e+00

Pos logit contributions
ap+0.009
apping+0.008
ooks+0.008
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.012
 gn-0.011
col-0.010
handed-0.010
growth-0.009
Tokenn
Token ID77
Feature activation+8.04e-01

Pos logit contributions
 disl+0.080
 gn+0.079
col+0.071
growth+0.067
handed+0.066
Neg logit contributions
ap-0.117
ooks-0.097
ag-0.092
apping-0.091
angles-0.090
Tokenics
Token ID873
Feature activation+1.16e+00

Pos logit contributions
ap+0.036
ooks+0.031
apping+0.030
aps+0.029
 upon+0.029
Neg logit contributions
 disl-0.029
 gn-0.023
col-0.023
handed-0.023
themed-0.020
Token.
Token ID13
Feature activation+6.03e-01

Pos logit contributions
ap+0.042
ooks+0.038
angles+0.038
apping+0.036
 upon+0.036
Neg logit contributions
 disl-0.047
 gn-0.042
col-0.041
handed-0.039
themed-0.035
Token They
Token ID1119
Feature activation+1.81e-01

Pos logit contributions
ap+0.021
apping+0.018
ooks+0.017
angles+0.017
aps+0.017
Neg logit contributions
 disl-0.028
 gn-0.024
col-0.023
growth-0.022
handed-0.022
Token loved
Token ID6151
Feature activation+1.60e-01

Pos logit contributions
ap+0.007
apping+0.005
ooks+0.005
aps+0.005
angles+0.005
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.007
handed-0.007
growth-0.006
Token "
Token ID366
Feature activation-8.12e-02

Pos logit contributions
ap+0.006
ooks+0.005
apping+0.005
angles+0.005
aps+0.005
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.006
handed-0.006
growth-0.006
TokenWhat
Token ID2061
Feature activation+3.61e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token's
Token ID338
Feature activation-5.22e-02

Pos logit contributions
ap+0.013
ooks+0.011
apping+0.011
angles+0.010
aps+0.010
Neg logit contributions
 disl-0.017
 gn-0.014
col-0.013
handed-0.013
growth-0.012
Token wrong
Token ID2642
Feature activation-7.79e-01

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.001
aps-0.001
Token?"
Token ID1701
Feature activation+2.07e-01

Pos logit contributions
 disl+0.040
col+0.034
handed+0.033
 gn+0.033
growth+0.033
Neg logit contributions
ap-0.026
apping-0.021
ooks-0.020
 upon-0.019
aps-0.019
Token The
Token ID383
Feature activation-2.57e+00

Pos logit contributions
ap+0.009
ooks+0.007
apping+0.007
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.009
 gn-0.007
col-0.007
handed-0.006
growth-0.006
Token little
Token ID1310
Feature activation-9.77e-01

Pos logit contributions
 disl+0.124
riger+0.111
col+0.111
pan+0.108
 CRE+0.107
Neg logit contributions
 upon-0.076
ap-0.068
 friend-0.065
 eyes-0.064
 friends-0.059
Token boy
Token ID2933
Feature activation+1.19e+00

Pos logit contributions
 disl+0.058
col+0.050
growth+0.050
handed+0.049
 gn+0.049
Neg logit contributions
ap-0.023
 upon-0.018
apping-0.016
ies-0.016
aps-0.015
Token said
Token ID531
Feature activation-2.92e-01

Pos logit contributions
ap+0.058
ooks+0.055
apping+0.052
angles+0.052
aps+0.052
Neg logit contributions
 disl-0.038
 gn-0.032
col-0.026
handed-0.024
 dis-0.024
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 disl+0.013
 gn+0.011
col+0.010
handed+0.010
growth+0.010
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.010
 upon-0.009
aps-0.009
Token "
Token ID366
Feature activation-6.08e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token go
Token ID467
Feature activation+2.92e-01

Pos logit contributions
 disl+0.036
 gn+0.031
col+0.030
handed+0.030
growth+0.029
Neg logit contributions
ap-0.020
apping-0.014
ooks-0.014
 upon-0.013
angles-0.013
Token see
Token ID766
Feature activation+6.11e-02

Pos logit contributions
ap+0.010
ooks+0.008
apping+0.008
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.014
 gn-0.013
col-0.012
handed-0.011
growth-0.011
Token some
Token ID617
Feature activation+3.35e-01

Pos logit contributions
ap+0.002
apping+0.002
ooks+0.002
aps+0.002
 upon+0.002
Neg logit contributions
 disl-0.003
 gn-0.003
col-0.002
handed-0.002
growth-0.002
Token birds
Token ID10087
Feature activation-1.00e-01

Pos logit contributions
ap+0.014
apping+0.012
ooks+0.012
aps+0.011
angles+0.011
Neg logit contributions
 disl-0.014
 gn-0.012
col-0.011
handed-0.011
growth-0.010
Token!"
Token ID2474
Feature activation+2.89e-01

Pos logit contributions
 disl+0.005
 gn+0.004
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token The
Token ID383
Feature activation-2.56e+00

Pos logit contributions
ap+0.013
ooks+0.011
apping+0.011
angles+0.011
 upon+0.011
Neg logit contributions
 disl-0.011
 gn-0.009
col-0.009
handed-0.008
growth-0.008
Token little
Token ID1310
Feature activation-9.02e-01

Pos logit contributions
 disl+0.165
handed+0.141
col+0.141
 CRE+0.137
igsaw+0.137
Neg logit contributions
 friend-0.051
 eyes-0.048
 friends-0.045
 animals-0.040
 upon-0.038
Token orange
Token ID10912
Feature activation+4.28e-01

Pos logit contributions
 disl+0.054
col+0.045
handed+0.045
growth+0.044
 gn+0.044
Neg logit contributions
ap-0.022
 upon-0.016
ies-0.015
apping-0.015
 eyes-0.015
Token bear
Token ID6842
Feature activation-6.75e-02

Pos logit contributions
ap+0.018
ooks+0.016
apping+0.015
aps+0.014
angles+0.014
Neg logit contributions
 disl-0.017
 gn-0.015
col-0.014
handed-0.013
growth-0.013
Token was
Token ID373
Feature activation-1.45e-01

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token excited
Token ID6568
Feature activation+5.17e-01

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.005
handed+0.004
growth+0.004
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Tokeneddy
Token ID21874
Feature activation+2.14e-02

Pos logit contributions
ap+0.038
ooks+0.031
apping+0.029
angles+0.028
 upon+0.027
Neg logit contributions
 disl-0.046
col-0.042
 gn-0.042
handed-0.041
themed-0.038
Token bear
Token ID6842
Feature activation-8.00e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token.
Token ID13
Feature activation+7.98e-01

Pos logit contributions
 disl+0.044
 gn+0.037
col+0.035
handed+0.035
growth+0.035
Neg logit contributions
ap-0.024
apping-0.019
ooks-0.018
 upon-0.017
aps-0.017
Token Every
Token ID3887
Feature activation-3.83e-01

Pos logit contributions
ap+0.038
apping+0.033
ooks+0.032
ies+0.032
angles+0.031
Neg logit contributions
 disl-0.029
 gn-0.024
col-0.021
growth-0.020
handed-0.020
Token day
Token ID1110
Feature activation+3.78e-01

Pos logit contributions
 disl+0.015
 gn+0.012
col+0.012
handed+0.011
growth+0.011
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.014
 upon-0.014
aps-0.013
Token the
Token ID262
Feature activation-2.56e+00

Pos logit contributions
ap+0.017
apping+0.015
ooks+0.015
aps+0.015
angles+0.015
Neg logit contributions
 disl-0.014
 gn-0.011
col-0.010
growth-0.009
themed-0.009
Token child
Token ID1200
Feature activation-9.38e-01

Pos logit contributions
 disl+0.159
col+0.125
igsaw+0.125
handed+0.124
riger+0.124
Neg logit contributions
 eyes-0.055
 friend-0.051
 upon-0.051
 friends-0.047
 me-0.044
Token would
Token ID561
Feature activation+1.12e-01

Pos logit contributions
 disl+0.052
 gn+0.045
handed+0.044
col+0.043
igsaw+0.042
Neg logit contributions
ap-0.026
ooks-0.018
 upon-0.018
apping-0.018
angles-0.017
Token visit
Token ID3187
Feature activation+6.53e-01

Pos logit contributions
ap+0.004
apping+0.003
ooks+0.003
aps+0.003
 upon+0.003
Neg logit contributions
 disl-0.005
 gn-0.005
col-0.004
handed-0.004
growth-0.004
Token the
Token ID262
Feature activation-9.65e-01

Pos logit contributions
ap+0.027
apping+0.024
ooks+0.024
aps+0.023
 upon+0.021
Neg logit contributions
 disl-0.027
 gn-0.022
col-0.021
handed-0.020
growth-0.020
Token bear
Token ID6842
Feature activation+4.13e-02

Pos logit contributions
 disl+0.046
 gn+0.036
col+0.035
growth+0.035
handed+0.034
Neg logit contributions
ap-0.034
 upon-0.029
ooks-0.027
apping-0.027
aps-0.026
Token the
Token ID262
Feature activation-1.66e+00

Pos logit contributions
ap+0.014
apping+0.012
ooks+0.012
aps+0.011
angles+0.011
Neg logit contributions
 disl-0.014
 gn-0.012
col-0.011
handed-0.010
growth-0.010
Token story
Token ID1621
Feature activation+3.81e-01

Pos logit contributions
 disl+0.089
 gn+0.074
col+0.073
growth+0.071
riger+0.070
Neg logit contributions
ap-0.041
 friend-0.040
 friends-0.037
 eyes-0.035
 upon-0.034
Token.
Token ID13
Feature activation-4.93e-02

Pos logit contributions
ap+0.014
ooks+0.012
apping+0.012
aps+0.011
 upon+0.011
Neg logit contributions
 disl-0.017
 gn-0.015
col-0.014
handed-0.014
growth-0.013
Token
Token ID198
Feature activation-8.12e-01

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token
Token ID198
Feature activation-4.44e-01

Pos logit contributions
 disl+0.042
 gn+0.037
col+0.034
handed+0.033
growth+0.033
Neg logit contributions
ap-0.026
apping-0.021
ooks-0.021
 upon-0.020
aps-0.020
TokenThe
Token ID464
Feature activation-2.56e+00

Pos logit contributions
 disl+0.022
 gn+0.019
col+0.018
growth+0.017
handed+0.017
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.012
 upon-0.011
aps-0.011
Token stranger
Token ID16195
Feature activation+5.93e-01

Pos logit contributions
 disl+0.135
riger+0.114
 gn+0.114
col+0.111
growth+0.109
Neg logit contributions
 friend-0.072
 friends-0.071
ap-0.059
 by-0.058
 eyes-0.054
Token chuckled
Token ID39889
Feature activation+2.73e-01

Pos logit contributions
ap+0.026
ooks+0.023
apping+0.023
aps+0.022
angles+0.022
Neg logit contributions
 disl-0.022
 gn-0.020
col-0.017
handed-0.016
growth-0.016
Token and
Token ID290
Feature activation-5.07e-01

Pos logit contributions
ap+0.009
ooks+0.008
apping+0.007
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.013
 gn-0.012
col-0.011
handed-0.011
growth-0.010
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 disl+0.022
 gn+0.018
handed+0.018
col+0.018
growth+0.017
Neg logit contributions
ap-0.019
apping-0.016
ooks-0.016
 upon-0.015
angles-0.015
Token,
Token ID11
Feature activation-5.72e-01

Pos logit contributions
 disl+0.048
handed+0.042
growth+0.041
col+0.041
 gn+0.040
Neg logit contributions
ap-0.035
apping-0.028
ooks-0.026
 upon-0.025
aps-0.025
Token me
Token ID502
Feature activation-1.58e-01

Pos logit contributions
 disl+0.027
 gn+0.025
col+0.023
handed+0.023
growth+0.023
Neg logit contributions
ap-0.009
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token?"
Token ID1701
Feature activation-1.31e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.006
handed+0.006
growth+0.006
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token
Token ID220
Feature activation-1.85e-01

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token
Token ID198
Feature activation-6.59e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.007
apping-0.006
ooks-0.006
 upon-0.005
aps-0.005
Token
Token ID198
Feature activation-4.76e-01

Pos logit contributions
 disl+0.033
 gn+0.029
col+0.027
handed+0.026
themed+0.025
Neg logit contributions
ap-0.022
apping-0.017
ooks-0.017
aps-0.016
 upon-0.016
TokenThe
Token ID464
Feature activation-2.55e+00

Pos logit contributions
 disl+0.024
 gn+0.021
col+0.020
handed+0.019
growth+0.018
Neg logit contributions
ap-0.015
apping-0.013
ooks-0.012
 upon-0.011
aps-0.011
Token crab
Token ID32202
Feature activation+4.30e-01

Pos logit contributions
 disl+0.134
riger+0.130
 CRE+0.121
side+0.114
storm+0.113
Neg logit contributions
 friends-0.060
 friend-0.059
 upon-0.057
ap-0.056
 be-0.052
Token replied
Token ID8712
Feature activation-6.12e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.016
angles+0.015
aps+0.015
Neg logit contributions
 disl-0.017
 gn-0.014
col-0.013
handed-0.012
growth-0.012
Token,
Token ID11
Feature activation-3.61e-01

Pos logit contributions
 disl+0.029
 gn+0.023
handed+0.023
col+0.023
growth+0.022
Neg logit contributions
ap-0.024
apping-0.020
ooks-0.018
aps-0.018
 upon-0.018
Token "
Token ID366
Feature activation-7.08e-01

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.016
handed+0.016
growth+0.015
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
 upon-0.008
aps-0.008
TokenI
Token ID40
Feature activation-1.13e+00

Pos logit contributions
 disl+0.038
 gn+0.033
handed+0.031
col+0.031
growth+0.031
Neg logit contributions
ap-0.019
apping-0.015
 upon-0.015
aps-0.014
ooks-0.014
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token
Token ID198
Feature activation+8.67e-01

Pos logit contributions
ap+0.052
apping+0.044
angles+0.043
ooks+0.042
ies+0.040
Neg logit contributions
 disl-0.034
 gn-0.028
col-0.026
handed-0.025
growth-0.023
Token"
Token ID1
Feature activation-4.96e-01

Pos logit contributions
ap+0.034
apping+0.027
angles+0.027
ooks+0.026
ags+0.025
Neg logit contributions
 disl-0.037
 gn-0.034
col-0.030
handed-0.030
themed-0.027
TokenWow
Token ID22017
Feature activation+2.67e-02

Pos logit contributions
 disl+0.024
 gn+0.021
handed+0.020
col+0.019
growth+0.019
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.013
aps-0.013
 upon-0.013
Token,
Token ID11
Feature activation-9.19e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
aps+0.001
 upon+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token an
Token ID281
Feature activation-2.55e+00

Pos logit contributions
 disl+0.045
 gn+0.040
col+0.040
handed+0.039
growth+0.038
Neg logit contributions
ap-0.028
 upon-0.022
angles-0.021
 exactly-0.020
apping-0.020
Token oct
Token ID19318
Feature activation-2.68e-01

Pos logit contributions
 disl+0.082
side+0.074
riger+0.065
�+0.063
handed+0.063
Neg logit contributions
ap-0.112
ook-0.102
ooks-0.101
ies-0.100
 for-0.099
Tokenopus
Token ID25790
Feature activation+4.25e-01

Pos logit contributions
 disl+0.008
 gn+0.006
col+0.006
handed+0.006
growth+0.005
Neg logit contributions
ap-0.014
apping-0.012
ooks-0.012
aps-0.012
 upon-0.012
Token.
Token ID13
Feature activation-5.23e-01

Pos logit contributions
ap+0.016
ooks+0.014
apping+0.014
 upon+0.013
angles+0.013
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token It
Token ID632
Feature activation-2.92e-01

Pos logit contributions
 disl+0.025
 gn+0.022
handed+0.021
col+0.021
growth+0.020
Neg logit contributions
ap-0.018
apping-0.015
ooks-0.014
aps-0.014
 upon-0.013
Token is
Token ID318
Feature activation+2.22e-01

Pos logit contributions
 disl+0.016
 gn+0.014
handed+0.013
col+0.013
growth+0.013
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token Ben
Token ID3932
Feature activation-2.83e-02

Pos logit contributions
ap+0.003
ooks+0.002
apping+0.002
aps+0.002
 upon+0.002
Neg logit contributions
 disl-0.003
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token were
Token ID547
Feature activation+6.66e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token poor
Token ID3595
Feature activation+1.29e+00

Pos logit contributions
ap+0.020
ooks+0.018
angles+0.017
apping+0.016
aps+0.015
Neg logit contributions
 disl-0.030
 gn-0.027
col-0.026
handed-0.025
themed-0.024
Token.
Token ID13
Feature activation+5.52e-01

Pos logit contributions
ap+0.049
ooks+0.044
angles+0.043
apping+0.040
ook+0.039
Neg logit contributions
 disl-0.052
 gn-0.048
col-0.043
handed-0.041
growth-0.038
Token They
Token ID1119
Feature activation+4.03e-01

Pos logit contributions
ap+0.018
apping+0.014
angles+0.014
ooks+0.014
ies+0.013
Neg logit contributions
 disl-0.027
 gn-0.024
col-0.022
growth-0.022
handed-0.021
Token lived
Token ID5615
Feature activation+1.81e+00

Pos logit contributions
ap+0.016
apping+0.014
ooks+0.014
aps+0.013
angles+0.013
Neg logit contributions
 disl-0.017
 gn-0.015
col-0.014
handed-0.013
growth-0.013
Token in
Token ID287
Feature activation-5.57e-01

Pos logit contributions
ap+0.066
angles+0.064
ooks+0.060
apping+0.053
ucks+0.051
Neg logit contributions
 gn-0.066
 disl-0.061
handed-0.058
col-0.057
side-0.051
Token a
Token ID257
Feature activation-4.83e-01

Pos logit contributions
 disl+0.029
 gn+0.025
col+0.025
handed+0.024
themed+0.023
Neg logit contributions
ap-0.017
ooks-0.013
apping-0.013
 upon-0.013
aps-0.012
Token small
Token ID1402
Feature activation+1.81e-01

Pos logit contributions
 disl+0.022
 gn+0.019
col+0.018
growth+0.017
handed+0.017
Neg logit contributions
ap-0.018
ooks-0.015
apping-0.015
 upon-0.014
aps-0.014
Token hut
Token ID40812
Feature activation+1.00e+00

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
angles+0.004
aps+0.004
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token with
Token ID351
Feature activation-4.67e-01

Pos logit contributions
ap+0.035
ooks+0.030
angles+0.030
apping+0.029
aps+0.027
Neg logit contributions
 disl-0.042
 gn-0.039
handed-0.036
col-0.036
growth-0.033
TokenMaybe
Token ID13300
Feature activation+1.97e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token I
Token ID314
Feature activation-2.07e-01

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.006
aps+0.006
angles+0.006
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token can
Token ID460
Feature activation+5.69e-01

Pos logit contributions
 disl+0.010
 gn+0.009
col+0.008
handed+0.008
growth+0.008
Neg logit contributions
ap-0.007
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token fly
Token ID6129
Feature activation+3.78e-01

Pos logit contributions
ap+0.020
ooks+0.018
apping+0.017
 upon+0.017
angles+0.017
Neg logit contributions
 disl-0.025
 gn-0.022
col-0.020
growth-0.019
handed-0.019
Token and
Token ID290
Feature activation-9.70e-02

Pos logit contributions
ap+0.013
ooks+0.011
apping+0.011
angles+0.010
aps+0.010
Neg logit contributions
 disl-0.018
 gn-0.016
col-0.015
handed-0.014
growth-0.014
Token help
Token ID1037
Feature activation+1.64e+00

Pos logit contributions
 disl+0.005
 gn+0.004
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token people
Token ID661
Feature activation+4.58e-02

Pos logit contributions
ooks+0.056
ap+0.055
apping+0.053
itting+0.053
apped+0.050
Neg logit contributions
 disl-0.068
 gn-0.059
handed-0.054
col-0.054
growth-0.053
Token after
Token ID706
Feature activation+1.53e-01

Pos logit contributions
ap+0.002
ooks+0.001
apping+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token school
Token ID1524
Feature activation-1.91e-01

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
 upon+0.004
aps+0.003
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.006
handed-0.006
growth-0.006
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
 disl+0.009
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
aps-0.005
 upon-0.005
Token Maybe
Token ID6674
Feature activation+1.28e-01

Pos logit contributions
 disl+0.013
 gn+0.012
col+0.011
handed+0.011
growth+0.010
Neg logit contributions
ap-0.010
apping-0.008
ooks-0.008
aps-0.008
 upon-0.008
Token She
Token ID1375
Feature activation+5.16e-01

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.008
growth+0.008
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.004
aps-0.004
Token tells
Token ID4952
Feature activation-3.29e-01

Pos logit contributions
ap+0.021
ooks+0.019
apping+0.019
aps+0.017
angles+0.016
Neg logit contributions
 disl-0.021
 gn-0.019
col-0.017
handed-0.015
growth-0.015
Token him
Token ID683
Feature activation-5.49e-01

Pos logit contributions
 disl+0.018
 gn+0.016
col+0.015
handed+0.015
growth+0.015
Neg logit contributions
ap-0.009
apping-0.007
ooks-0.007
 upon-0.007
aps-0.006
Token about
Token ID546
Feature activation+3.86e-01

Pos logit contributions
 disl+0.028
 gn+0.024
col+0.023
handed+0.023
growth+0.022
Neg logit contributions
ap-0.018
apping-0.014
ooks-0.014
 upon-0.014
aps-0.013
Token her
Token ID607
Feature activation-3.51e-01

Pos logit contributions
ap+0.015
ooks+0.013
apping+0.013
aps+0.011
 upon+0.011
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.014
handed-0.013
growth-0.013
Token fl
Token ID781
Feature activation+2.05e+00

Pos logit contributions
 disl+0.017
 gn+0.014
col+0.014
handed+0.014
growth+0.013
Neg logit contributions
ap-0.013
apping-0.010
 upon-0.010
ooks-0.010
aps-0.010
Tokenute
Token ID1133
Feature activation-6.08e-03

Pos logit contributions
ap+0.059
ooks+0.058
angles+0.052
 upon+0.049
 exactly+0.045
Neg logit contributions
 disl-0.085
col-0.073
 gn-0.073
 pel-0.071
igsaw-0.067
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
 disl+0.000
 gn+0.000
col+0.000
handed+0.000
growth+0.000
Neg logit contributions
ap-0.000
apping-0.000
ooks-0.000
 upon-0.000
aps-0.000
Token Ben
Token ID3932
Feature activation+4.78e-01

Pos logit contributions
 disl+0.009
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.006
apping-0.004
ooks-0.004
aps-0.004
 upon-0.004
Token wants
Token ID3382
Feature activation+3.25e-01

Pos logit contributions
ap+0.019
ooks+0.017
apping+0.017
aps+0.015
angles+0.015
Neg logit contributions
 disl-0.020
 gn-0.017
col-0.016
handed-0.015
growth-0.014
Token to
Token ID284
Feature activation+3.52e-01

Pos logit contributions
ap+0.012
ooks+0.010
apping+0.010
angles+0.009
 upon+0.009
Neg logit contributions
 disl-0.015
 gn-0.013
col-0.012
handed-0.012
growth-0.012
Token
Token ID198
Feature activation+6.21e-02

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
ap+0.002
apping+0.002
ooks+0.002
 upon+0.002
aps+0.002
Neg logit contributions
 disl-0.003
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token"
Token ID1
Feature activation-8.56e-01

Pos logit contributions
ap+0.011
apping+0.009
ooks+0.009
angles+0.009
 upon+0.008
Neg logit contributions
 disl-0.015
 gn-0.013
col-0.013
handed-0.012
growth-0.012
TokenHello
Token ID15496
Feature activation+1.19e-01

Pos logit contributions
 disl+0.040
 gn+0.033
handed+0.032
igsaw+0.031
col+0.031
Neg logit contributions
ap-0.029
apping-0.023
angles-0.022
aps-0.022
ooks-0.022
Token,
Token ID11
Feature activation+3.59e-01

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
angles+0.004
aps+0.004
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.003
Token Mr
Token ID1770
Feature activation+1.74e+00

Pos logit contributions
ap+0.014
ooks+0.012
apping+0.012
aps+0.011
angles+0.011
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.012
handed-0.012
growth-0.011
Token.
Token ID13
Feature activation+5.12e-02

Pos logit contributions
ap+0.073
apping+0.067
ooks+0.067
angles+0.066
 upon+0.064
Neg logit contributions
 disl-0.061
 gn-0.055
handed-0.046
growth-0.044
col-0.042
Token Snow
Token ID7967
Feature activation+2.04e-01

Pos logit contributions
ap+0.002
apping+0.002
ooks+0.002
 upon+0.002
angles+0.002
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.001
Tokenman
Token ID805
Feature activation-3.06e-02

Pos logit contributions
ap+0.008
ooks+0.007
apping+0.006
aps+0.006
 upon+0.006
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token!"
Token ID2474
Feature activation+1.50e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token Lily
Token ID20037
Feature activation-1.31e-01

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
angles+0.005
aps+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.005
handed-0.005
growth-0.005
Token one
Token ID530
Feature activation+4.07e-01

Pos logit contributions
ap+0.029
apping+0.025
ooks+0.024
aps+0.023
angles+0.022
Neg logit contributions
 disl-0.032
 gn-0.027
col-0.025
growth-0.024
themed-0.023
Token straight
Token ID3892
Feature activation+1.38e+00

Pos logit contributions
ap+0.014
ooks+0.012
apping+0.012
 upon+0.011
angles+0.011
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token side
Token ID1735
Feature activation-2.74e-01

Pos logit contributions
ap+0.051
apping+0.049
ooks+0.044
aps+0.041
 upon+0.041
Neg logit contributions
 disl-0.057
 gn-0.049
growth-0.047
col-0.045
igsaw-0.042
Token and
Token ID290
Feature activation+2.03e-01

Pos logit contributions
 disl+0.013
 gn+0.012
col+0.011
handed+0.011
growth+0.010
Neg logit contributions
ap-0.009
apping-0.008
ooks-0.008
 upon-0.007
aps-0.007
Token one
Token ID530
Feature activation+9.49e-01

Pos logit contributions
ap+0.007
ooks+0.006
apping+0.006
 upon+0.006
aps+0.006
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token w
Token ID266
Feature activation+2.03e+00

Pos logit contributions
ap+0.034
ooks+0.032
apping+0.030
 upon+0.028
angles+0.028
Neg logit contributions
 disl-0.039
 gn-0.034
col-0.033
growth-0.030
handed-0.030
Tokenavy
Token ID2830
Feature activation+4.99e-01

Pos logit contributions
 upon+0.078
ap+0.071
ooks+0.065
apping+0.064
ipt+0.062
Neg logit contributions
 disl-0.078
igsaw-0.065
col-0.059
growth-0.059
obb-0.058
Token side
Token ID1735
Feature activation+4.77e-01

Pos logit contributions
ap+0.019
apping+0.016
ooks+0.016
 upon+0.015
aps+0.015
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.017
growth-0.017
handed-0.017
Token."
Token ID526
Feature activation+4.08e-01

Pos logit contributions
ap+0.018
ooks+0.015
apping+0.015
 upon+0.014
aps+0.014
Neg logit contributions
 disl-0.021
 gn-0.018
col-0.017
handed-0.016
growth-0.016
Token Tom
Token ID4186
Feature activation+7.10e-01

Pos logit contributions
ap+0.015
ooks+0.013
apping+0.013
angles+0.012
 upon+0.012
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token agrees
Token ID14386
Feature activation-5.42e-01

Pos logit contributions
ap+0.030
ooks+0.028
apping+0.027
 upon+0.024
aps+0.024
Neg logit contributions
 disl-0.028
 gn-0.024
col-0.022
handed-0.019
growth-0.019
TokenOnce
Token ID7454
Feature activation+2.68e-01

Pos logit contributions
ap+0.001
apping+0.001
ooks+0.001
 upon+0.001
aps+0.001
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token there
Token ID612
Feature activation-3.47e-01

Pos logit contributions
ap+0.003
ooks+0.002
apping+0.002
aps+0.001
angles+0.001
Neg logit contributions
 disl-0.019
 gn-0.017
col-0.016
handed-0.016
growth-0.016
Token was
Token ID373
Feature activation-7.90e-01

Pos logit contributions
 disl+0.018
 gn+0.015
col+0.015
handed+0.014
growth+0.014
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
 upon-0.009
aps-0.008
Token a
Token ID257
Feature activation+3.27e-01

Pos logit contributions
 disl+0.043
 gn+0.037
col+0.035
handed+0.035
riger+0.034
Neg logit contributions
ap-0.023
apping-0.018
ooks-0.018
 upon-0.017
aps-0.017
Token boy
Token ID2933
Feature activation+6.81e-03

Pos logit contributions
ap+0.014
ooks+0.012
apping+0.012
angles+0.011
aps+0.011
Neg logit contributions
 disl-0.013
 gn-0.011
col-0.010
handed-0.010
growth-0.009
Token.
Token ID13
Feature activation+4.96e-02

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token He
Token ID679
Feature activation+9.85e-02

Pos logit contributions
ap+0.002
apping+0.001
ooks+0.001
 upon+0.001
angles+0.001
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token was
Token ID373
Feature activation+6.13e-01

Pos logit contributions
ap+0.003
apping+0.003
ooks+0.003
angles+0.003
aps+0.003
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token three
Token ID1115
Feature activation-4.57e-01

Pos logit contributions
ap+0.024
ooks+0.021
angles+0.021
apping+0.020
aps+0.019
Neg logit contributions
 disl-0.025
 gn-0.022
col-0.020
handed-0.019
themed-0.018
Token years
Token ID812
Feature activation-6.66e-01

Pos logit contributions
 disl+0.024
 gn+0.021
col+0.020
handed+0.020
growth+0.019
Neg logit contributions
ap-0.014
apping-0.011
ooks-0.011
aps-0.010
 upon-0.010
Token old
Token ID1468
Feature activation+6.68e-01

Pos logit contributions
 disl+0.031
 gn+0.026
col+0.025
growth+0.024
handed+0.024
Neg logit contributions
ap-0.024
apping-0.021
ooks-0.021
 upon-0.019
aps-0.019
Token of
Token ID286
Feature activation+2.01e-01

Pos logit contributions
 disl+0.024
 gn+0.021
col+0.020
handed+0.020
growth+0.020
Neg logit contributions
ap-0.008
ooks-0.006
apping-0.006
aps-0.005
 upon-0.005
Token trees
Token ID7150
Feature activation-1.42e+00

Pos logit contributions
ap+0.008
apping+0.007
ooks+0.007
angles+0.006
aps+0.006
Neg logit contributions
 disl-0.009
 gn-0.007
col-0.007
handed-0.007
growth-0.006
Token and
Token ID290
Feature activation-9.02e-01

Pos logit contributions
 disl+0.093
handed+0.077
growth+0.076
igsaw+0.075
 gn+0.074
Neg logit contributions
ap-0.025
apping-0.019
 upon-0.019
 for-0.019
aps-0.018
Token flowers
Token ID12734
Feature activation-1.26e+00

Pos logit contributions
 disl+0.045
 gn+0.037
handed+0.037
col+0.036
igsaw+0.036
Neg logit contributions
ap-0.027
ooks-0.024
 upon-0.023
angles-0.021
 eyes-0.021
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 disl+0.080
igsaw+0.066
handed+0.065
growth+0.065
 gn+0.064
Neg logit contributions
ap-0.025
 upon-0.019
apping-0.019
aps-0.019
 for-0.018
Token Tim
Token ID5045
Feature activation+4.30e-01

Pos logit contributions
ap+0.005
apping+0.004
ooks+0.004
aps+0.004
angles+0.004
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Tokenmy
Token ID1820
Feature activation-6.54e-02

Pos logit contributions
ap+0.013
apping+0.011
ooks+0.011
aps+0.010
angles+0.009
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.018
handed-0.018
themed-0.017
Token's
Token ID338
Feature activation-1.31e-01

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token yard
Token ID12699
Feature activation+2.72e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token was
Token ID373
Feature activation+1.54e-01

Pos logit contributions
ap+0.010
apping+0.008
ooks+0.008
aps+0.008
angles+0.008
Neg logit contributions
 disl-0.012
 gn-0.011
col-0.010
handed-0.010
growth-0.009
Token very
Token ID845
Feature activation+8.98e-02

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
aps+0.005
angles+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.005
handed-0.005
growth-0.005
Token and
Token ID290
Feature activation-4.11e-01

Pos logit contributions
ap+0.006
ooks+0.005
apping+0.005
angles+0.004
aps+0.004
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.008
handed-0.007
growth-0.007
Token together
Token ID1978
Feature activation-4.79e-02

Pos logit contributions
 disl+0.018
 gn+0.015
handed+0.014
col+0.014
growth+0.014
Neg logit contributions
ap-0.016
apping-0.013
ooks-0.013
 upon-0.013
aps-0.012
Token they
Token ID484
Feature activation+9.97e-02

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.001
aps-0.001
Token went
Token ID1816
Feature activation-6.06e-02

Pos logit contributions
ap+0.004
apping+0.003
ooks+0.003
aps+0.003
 upon+0.003
Neg logit contributions
 disl-0.004
 gn-0.004
col-0.004
handed-0.003
growth-0.003
Token inside
Token ID2641
Feature activation+1.01e+00

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
ap+0.040
angles+0.035
ooks+0.034
apping+0.034
ags+0.032
Neg logit contributions
 disl-0.040
 gn-0.035
col-0.033
handed-0.032
 dis-0.030
Token Max
Token ID5436
Feature activation-4.16e-01

Pos logit contributions
ap+0.012
apping+0.010
ooks+0.010
 upon+0.009
aps+0.009
Neg logit contributions
 disl-0.014
 gn-0.012
col-0.011
handed-0.011
growth-0.011
Token was
Token ID373
Feature activation-2.40e-01

Pos logit contributions
 disl+0.020
 gn+0.017
col+0.016
handed+0.016
growth+0.016
Neg logit contributions
ap-0.014
ooks-0.011
apping-0.011
 upon-0.011
angles-0.011
Token careful
Token ID8161
Feature activation+1.52e-01

Pos logit contributions
 disl+0.011
 gn+0.009
col+0.009
handed+0.009
growth+0.008
Neg logit contributions
ap-0.009
apping-0.008
ooks-0.008
 upon-0.007
aps-0.007
Token as
Token ID355
Feature activation-2.80e-02

Pos logit contributions
ap+0.005
ooks+0.004
apping+0.004
angles+0.004
aps+0.004
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.006
handed-0.006
growth-0.006
Token he
Token ID339
Feature activation-2.61e-01

Pos logit contributions
 disl+0.001
 gn+0.001
col+0.001
handed+0.001
growth+0.001
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
ap+0.015
ooks+0.013
angles+0.012
apping+0.012
 upon+0.011
Neg logit contributions
 disl-0.019
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token "
Token ID366
Feature activation-3.60e-01

Pos logit contributions
ap+0.009
ooks+0.008
apping+0.008
angles+0.008
 upon+0.007
Neg logit contributions
 disl-0.012
 gn-0.010
col-0.010
growth-0.009
handed-0.009
TokenHelp
Token ID22087
Feature activation+4.01e-02

Pos logit contributions
 disl+0.018
 gn+0.015
col+0.014
handed+0.014
growth+0.013
Neg logit contributions
ap-0.012
apping-0.010
ooks-0.009
 upon-0.009
aps-0.009
Token us
Token ID514
Feature activation+9.80e-02

Pos logit contributions
ap+0.002
apping+0.001
ooks+0.001
aps+0.001
 upon+0.001
Neg logit contributions
 disl-0.002
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token!"
Token ID2474
Feature activation+2.61e-01

Pos logit contributions
ap+0.004
apping+0.003
ooks+0.003
angles+0.003
aps+0.003
Neg logit contributions
 disl-0.004
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token
Token ID198
Feature activation+2.17e-01

Pos logit contributions
ap+0.011
ooks+0.009
apping+0.009
angles+0.009
 upon+0.008
Neg logit contributions
 disl-0.011
 gn-0.009
col-0.009
handed-0.008
growth-0.008
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
ap+0.009
apping+0.007
ooks+0.007
angles+0.007
 upon+0.007
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
TokenThe
Token ID464
Feature activation-1.39e-01

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.006
 upon+0.005
angles+0.005
Neg logit contributions
 disl-0.008
 gn-0.007
col-0.006
handed-0.006
growth-0.006
Token teacher
Token ID4701
Feature activation+9.03e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token hears
Token ID23180
Feature activation-1.86e-01

Pos logit contributions
ap+0.036
ooks+0.034
apping+0.032
angles+0.030
 upon+0.030
Neg logit contributions
 disl-0.035
 gn-0.032
col-0.029
growth-0.027
handed-0.026
Token them
Token ID606
Feature activation+9.60e-02

Pos logit contributions
 disl+0.010
 gn+0.009
col+0.009
handed+0.008
growth+0.008
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token goodbye
Token ID24829
Feature activation-9.50e-02

Pos logit contributions
 disl+0.016
 gn+0.013
col+0.013
handed+0.013
growth+0.012
Neg logit contributions
ap-0.013
apping-0.011
ooks-0.011
 upon-0.010
aps-0.010
Token to
Token ID284
Feature activation+5.90e-01

Pos logit contributions
 disl+0.005
 gn+0.004
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token her
Token ID607
Feature activation-3.72e-01

Pos logit contributions
ap+0.022
apping+0.019
ooks+0.019
aps+0.018
 upon+0.017
Neg logit contributions
 disl-0.027
 gn-0.023
col-0.021
handed-0.021
growth-0.021
Token friends
Token ID2460
Feature activation-3.89e-02

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.016
handed+0.015
growth+0.015
Neg logit contributions
ap-0.012
apping-0.009
ooks-0.009
 upon-0.009
aps-0.009
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token went
Token ID1816
Feature activation+1.05e-01

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
aps+0.005
 upon+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.005
handed-0.005
growth-0.005
Token home
Token ID1363
Feature activation-1.12e-02

Pos logit contributions
ap+0.004
ooks+0.003
apping+0.003
angles+0.003
 upon+0.003
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token with
Token ID351
Feature activation-7.72e-01

Pos logit contributions
 disl+0.001
 gn+0.000
col+0.000
handed+0.000
growth+0.000
Neg logit contributions
ap-0.000
apping-0.000
ooks-0.000
 upon-0.000
aps-0.000
Token her
Token ID607
Feature activation-7.13e-01

Pos logit contributions
 disl+0.045
 gn+0.039
col+0.037
handed+0.036
growth+0.035
Neg logit contributions
ap-0.019
ooks-0.014
 upon-0.013
apping-0.013
aps-0.013
Token mom
Token ID1995
Feature activation-5.01e-02

Pos logit contributions
 disl+0.038
 gn+0.032
col+0.031
handed+0.030
growth+0.030
Neg logit contributions
ap-0.021
 upon-0.017
apping-0.017
ooks-0.016
aps-0.016
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token it
Token ID340
Feature activation-4.45e-01

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token."
Token ID526
Feature activation-3.26e-01

Pos logit contributions
 disl+0.022
 gn+0.020
col+0.019
handed+0.018
growth+0.018
Neg logit contributions
ap-0.015
apping-0.012
ooks-0.011
 upon-0.011
aps-0.011
Token
Token ID198
Feature activation-3.04e-01

Pos logit contributions
 disl+0.016
 gn+0.014
col+0.013
handed+0.013
growth+0.012
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
aps-0.009
 upon-0.009
Token
Token ID198
Feature activation+4.27e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.012
handed+0.011
growth+0.011
Neg logit contributions
ap-0.011
apping-0.009
ooks-0.009
aps-0.009
 upon-0.009
TokenL
Token ID43
Feature activation+7.93e-01

Pos logit contributions
ap+0.013
apping+0.011
ooks+0.011
angles+0.010
aps+0.010
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.018
handed-0.018
themed-0.017
Tokenily
Token ID813
Feature activation-7.33e-02

Pos logit contributions
ap+0.039
apping+0.033
ooks+0.032
angles+0.031
 upon+0.031
Neg logit contributions
 disl-0.026
 gn-0.021
handed-0.020
col-0.019
growth-0.018
Token and
Token ID290
Feature activation+4.94e-01

Pos logit contributions
 disl+0.004
 gn+0.003
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token Ben
Token ID3932
Feature activation+4.08e-01

Pos logit contributions
ap+0.018
ooks+0.017
apping+0.016
aps+0.014
ook+0.014
Neg logit contributions
 disl-0.022
 gn-0.018
col-0.017
growth-0.016
handed-0.016
Token shake
Token ID13279
Feature activation+1.26e-01

Pos logit contributions
ap+0.015
ooks+0.013
apping+0.013
aps+0.012
angles+0.012
Neg logit contributions
 disl-0.018
 gn-0.016
col-0.015
handed-0.014
growth-0.014
Token their
Token ID511
Feature activation+6.18e-01

Pos logit contributions
ap+0.004
ooks+0.003
apping+0.003
 upon+0.003
angles+0.003
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.005
handed-0.005
growth-0.005
Token heads
Token ID6665
Feature activation-9.06e-01

Pos logit contributions
ap+0.022
ooks+0.019
apping+0.019
angles+0.017
 upon+0.017
Neg logit contributions
 disl-0.028
 gn-0.025
col-0.023
handed-0.022
growth-0.022
Token door
Token ID3420
Feature activation-1.25e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.006
growth+0.006
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.002
aps-0.002
Token.
Token ID13
Feature activation+7.19e-02

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token Anna
Token ID11735
Feature activation+6.93e-01

Pos logit contributions
ap+0.003
apping+0.002
ooks+0.002
 upon+0.002
angles+0.002
Neg logit contributions
 disl-0.003
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token's
Token ID338
Feature activation+2.87e-01

Pos logit contributions
ap+0.029
ooks+0.027
apping+0.026
angles+0.023
 upon+0.023
Neg logit contributions
 disl-0.027
 gn-0.024
col-0.022
growth-0.020
handed-0.020
Token mom
Token ID1995
Feature activation+2.74e-01

Pos logit contributions
ap+0.010
ooks+0.009
apping+0.008
angles+0.008
 upon+0.008
Neg logit contributions
 disl-0.013
 gn-0.012
col-0.011
handed-0.011
growth-0.010
Token opens
Token ID9808
Feature activation-3.21e-01

Pos logit contributions
ap+0.011
ooks+0.009
apping+0.009
 upon+0.008
angles+0.008
Neg logit contributions
 disl-0.012
 gn-0.010
col-0.010
handed-0.009
growth-0.009
Token the
Token ID262
Feature activation+9.75e-02

Pos logit contributions
 disl+0.017
 gn+0.014
col+0.014
handed+0.013
growth+0.013
Neg logit contributions
ap-0.010
ooks-0.008
apping-0.008
 upon-0.008
aps-0.008
Token door
Token ID3420
Feature activation-2.13e-01

Pos logit contributions
ap+0.003
apping+0.002
ooks+0.002
angles+0.002
 upon+0.002
Neg logit contributions
 disl-0.005
 gn-0.005
col-0.004
handed-0.004
growth-0.004
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
 disl+0.011
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
aps-0.005
 upon-0.005
Token She
Token ID1375
Feature activation+3.11e-01

Pos logit contributions
 disl+0.005
 gn+0.005
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
aps-0.003
 upon-0.003
Token sees
Token ID7224
Feature activation-9.12e-01

Pos logit contributions
ap+0.013
apping+0.011
ooks+0.011
aps+0.010
 upon+0.010
Neg logit contributions
 disl-0.013
 gn-0.011
col-0.010
handed-0.010
growth-0.010
Tokenheart
Token ID11499
Feature activation+1.67e-01

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token.
Token ID13
Feature activation+6.37e-01

Pos logit contributions
ap+0.006
apping+0.005
ooks+0.005
 upon+0.005
aps+0.005
Neg logit contributions
 disl-0.008
 gn-0.006
col-0.006
handed-0.006
growth-0.006
Token But
Token ID887
Feature activation+9.82e-01

Pos logit contributions
ap+0.023
ooks+0.020
apping+0.019
 upon+0.019
angles+0.019
Neg logit contributions
 disl-0.029
 gn-0.024
col-0.023
handed-0.022
growth-0.022
Token you
Token ID345
Feature activation-4.21e-03

Pos logit contributions
ap+0.033
ooks+0.031
apping+0.030
aps+0.029
ags+0.026
Neg logit contributions
 disl-0.045
 gn-0.040
col-0.035
 dis-0.033
handed-0.033
Token can
Token ID460
Feature activation+3.55e-01

Pos logit contributions
 disl+0.000
 gn+0.000
col+0.000
handed+0.000
growth+0.000
Neg logit contributions
ap-0.000
apping-0.000
ooks-0.000
 upon-0.000
aps-0.000
Token play
Token ID711
Feature activation-8.22e-02

Pos logit contributions
ap+0.015
ooks+0.013
apping+0.013
 upon+0.012
aps+0.012
Neg logit contributions
 disl-0.014
 gn-0.012
col-0.011
handed-0.011
growth-0.010
Token with
Token ID351
Feature activation-2.02e-01

Pos logit contributions
 disl+0.005
 gn+0.004
col+0.004
handed+0.004
growth+0.004
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token your
Token ID534
Feature activation-1.66e-01

Pos logit contributions
 disl+0.010
 gn+0.009
col+0.008
handed+0.008
growth+0.008
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token football
Token ID4346
Feature activation-7.88e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.007
Neg logit contributions
ap-0.005
apping-0.004
ooks-0.004
 upon-0.004
aps-0.004
Token outside
Token ID2354
Feature activation-1.81e-01

Pos logit contributions
 disl+0.041
 gn+0.035
col+0.033
handed+0.032
growth+0.032
Neg logit contributions
ap-0.026
apping-0.020
 upon-0.019
ooks-0.019
aps-0.019
Token when
Token ID618
Feature activation+6.60e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.007
handed+0.007
growth+0.006
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token but
Token ID475
Feature activation-3.34e-01

Pos logit contributions
ap+0.025
ooks+0.021
apping+0.020
aps+0.020
angles+0.020
Neg logit contributions
 disl-0.027
 gn-0.024
col-0.022
handed-0.021
growth-0.020
Token his
Token ID465
Feature activation-1.81e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.012
handed+0.011
growth+0.011
Neg logit contributions
ap-0.013
ooks-0.011
apping-0.011
 upon-0.011
aps-0.010
Token friend
Token ID1545
Feature activation+6.28e-01

Pos logit contributions
 disl+0.012
 gn+0.011
col+0.011
handed+0.011
growth+0.010
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
aps-0.001
 upon-0.001
Token Sally
Token ID25737
Feature activation-9.79e-02

Pos logit contributions
ap+0.027
ooks+0.023
apping+0.023
aps+0.023
angles+0.022
Neg logit contributions
 disl-0.024
 gn-0.021
col-0.019
handed-0.018
themed-0.017
Token couldn
Token ID3521
Feature activation-4.31e-01

Pos logit contributions
 disl+0.004
 gn+0.004
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token't
Token ID470
Feature activation-9.24e-02

Pos logit contributions
 disl+0.017
 gn+0.014
col+0.013
handed+0.012
growth+0.012
Neg logit contributions
ap-0.019
apping-0.016
ooks-0.016
aps-0.015
 upon-0.015
Token spell
Token ID4822
Feature activation-8.47e-02

Pos logit contributions
 disl+0.004
 gn+0.004
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.003
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token very
Token ID845
Feature activation+4.36e-01

Pos logit contributions
 disl+0.004
 gn+0.004
col+0.003
handed+0.003
growth+0.003
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token well
Token ID880
Feature activation-1.29e+00

Pos logit contributions
ap+0.015
ooks+0.012
apping+0.012
 upon+0.011
aps+0.011
Neg logit contributions
 disl-0.021
 gn-0.018
handed-0.017
col-0.017
growth-0.016
Token and
Token ID290
Feature activation-1.99e-01

Pos logit contributions
 disl+0.075
 gn+0.065
growth+0.063
col+0.063
handed+0.061
Neg logit contributions
ap-0.033
apping-0.025
 upon-0.024
 for-0.022
aps-0.021
Token accidentally
Token ID14716
Feature activation-1.27e-01

Pos logit contributions
 disl+0.009
 gn+0.007
col+0.007
handed+0.007
growth+0.007
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Tokenâ„¢
Token ID8151
Feature activation-4.35e-01

Pos logit contributions
 disl+0.034
 gn+0.029
col+0.027
handed+0.026
growth+0.026
Neg logit contributions
ap-0.033
apping-0.028
ooks-0.027
aps-0.026
 upon-0.026
Tokens
Token ID82
Feature activation+6.61e-01

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.015
handed+0.014
growth+0.014
Neg logit contributions
ap-0.017
apping-0.015
ooks-0.014
 upon-0.014
aps-0.014
Token okay
Token ID8788
Feature activation+3.92e-01

Pos logit contributions
ap+0.024
apping+0.021
ooks+0.021
angles+0.020
aps+0.019
Neg logit contributions
 disl-0.030
 gn-0.026
col-0.024
handed-0.023
growth-0.022
Token,
Token ID11
Feature activation-6.03e-02

Pos logit contributions
ap+0.016
ooks+0.014
apping+0.013
angles+0.013
aps+0.013
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.013
handed-0.012
themed-0.012
Token don
Token ID836
Feature activation-7.92e-01

Pos logit contributions
 disl+0.003
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.002
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Tokenâ
Token ID22940
Feature activation-1.01e+00

Pos logit contributions
 disl+0.033
 gn+0.027
col+0.025
handed+0.025
growth+0.024
Neg logit contributions
ap-0.033
apping-0.028
 upon-0.027
ooks-0.027
aps-0.027
Token€
Token ID26391
Feature activation-8.85e-01

Pos logit contributions
 disl+0.034
 gn+0.029
growth+0.026
col+0.026
themed+0.024
Neg logit contributions
ap-0.050
apping-0.042
ags-0.038
ooks-0.038
aps-0.038
Tokenâ„¢
Token ID8151
Feature activation-8.21e-01

Pos logit contributions
 disl+0.035
 gn+0.029
col+0.028
growth+0.026
handed+0.026
Neg logit contributions
ap-0.039
apping-0.033
ooks-0.032
 upon-0.032
aps-0.032
Tokent
Token ID83
Feature activation+8.40e-01

Pos logit contributions
 disl+0.035
 gn+0.029
col+0.027
growth+0.027
handed+0.026
Neg logit contributions
ap-0.034
apping-0.030
 upon-0.029
ooks-0.028
aps-0.027
Token worry
Token ID5490
Feature activation+4.30e-01

Pos logit contributions
ap+0.029
ooks+0.026
angles+0.025
apping+0.025
aps+0.024
Neg logit contributions
 disl-0.039
 gn-0.033
col-0.031
handed-0.030
themed-0.028
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
ap+0.017
ooks+0.015
apping+0.015
angles+0.014
aps+0.014
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.014
handed-0.014
themed-0.013
Token never
Token ID1239
Feature activation-6.74e-01

Pos logit contributions
 disl+0.016
 gn+0.013
col+0.013
handed+0.013
growth+0.012
Neg logit contributions
ap-0.012
ooks-0.010
apping-0.010
 upon-0.010
aps-0.010
Token seen
Token ID1775
Feature activation-1.28e+00

Pos logit contributions
 disl+0.033
 gn+0.028
col+0.028
handed+0.027
growth+0.027
Neg logit contributions
ap-0.021
 upon-0.017
apping-0.017
ooks-0.017
aps-0.015
Token so
Token ID523
Feature activation-1.22e+00

Pos logit contributions
 disl+0.070
 gn+0.058
handed+0.057
riger+0.055
col+0.055
Neg logit contributions
ap-0.035
 upon-0.030
ooks-0.028
apping-0.027
 exactly-0.025
Token many
Token ID867
Feature activation-6.00e-01

Pos logit contributions
 disl+0.063
 gn+0.055
riger+0.053
robat+0.052
col+0.051
Neg logit contributions
ap-0.036
 upon-0.031
apping-0.030
ooks-0.030
aps-0.029
Token beautiful
Token ID4950
Feature activation-6.55e-01

Pos logit contributions
 disl+0.029
 gn+0.024
handed+0.023
col+0.023
growth+0.022
Neg logit contributions
ap-0.020
ooks-0.017
 upon-0.017
apping-0.017
aps-0.016
Token things
Token ID1243
Feature activation-1.51e+00

Pos logit contributions
 disl+0.033
 gn+0.028
handed+0.027
col+0.026
growth+0.026
Neg logit contributions
ap-0.021
ooks-0.017
apping-0.017
 upon-0.016
ies-0.016
Token before
Token ID878
Feature activation+9.56e-02

Pos logit contributions
 disl+0.084
 gn+0.068
growth+0.068
handed+0.067
col+0.065
Neg logit contributions
ap-0.042
 upon-0.034
apping-0.034
 for-0.032
ooks-0.029
Token.
Token ID13
Feature activation-8.75e-02

Pos logit contributions
ap+0.003
ooks+0.003
apping+0.003
angles+0.003
aps+0.003
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.004
growth-0.004
Token She
Token ID1375
Feature activation+4.65e-02

Pos logit contributions
 disl+0.003
 gn+0.003
col+0.003
handed+0.003
growth+0.002
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token watched
Token ID7342
Feature activation-3.99e-02

Pos logit contributions
ap+0.002
apping+0.002
ooks+0.002
aps+0.001
 upon+0.001
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token the
Token ID262
Feature activation-1.09e-01

Pos logit contributions
 disl+0.002
 gn+0.002
col+0.002
handed+0.002
growth+0.002
Neg logit contributions
ap-0.001
apping-0.001
ooks-0.001
 upon-0.001
aps-0.001
Token
Token ID198
Feature activation+3.44e-01

Pos logit contributions
ap+0.025
apping+0.021
ooks+0.021
angles+0.020
 upon+0.019
Neg logit contributions
 disl-0.022
 gn-0.019
col-0.017
handed-0.017
growth-0.016
TokenL
Token ID43
Feature activation-1.62e-01

Pos logit contributions
ap+0.012
apping+0.010
ooks+0.010
angles+0.009
 upon+0.009
Neg logit contributions
 disl-0.016
 gn-0.014
col-0.013
handed-0.013
growth-0.012
Tokenila
Token ID10102
Feature activation-2.39e-01

Pos logit contributions
 disl+0.008
 gn+0.007
col+0.006
handed+0.006
growth+0.006
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.004
aps-0.004
Token shook
Token ID14682
Feature activation-4.55e-01

Pos logit contributions
 disl+0.011
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.008
apping-0.007
ooks-0.007
 upon-0.006
aps-0.006
Token her
Token ID607
Feature activation-1.08e-01

Pos logit contributions
 disl+0.027
 gn+0.023
handed+0.023
col+0.023
growth+0.022
Neg logit contributions
ap-0.012
apping-0.008
 upon-0.008
ooks-0.008
aps-0.007
Token head
Token ID1182
Feature activation-1.73e+00

Pos logit contributions
 disl+0.006
 gn+0.005
col+0.005
handed+0.005
growth+0.005
Neg logit contributions
ap-0.003
apping-0.002
ooks-0.002
 upon-0.002
aps-0.002
Token.
Token ID13
Feature activation+4.99e-02

Pos logit contributions
 disl+0.113
igsaw+0.092
handed+0.092
 gn+0.091
col+0.091
Neg logit contributions
ap-0.040
apping-0.032
 upon-0.026
aps-0.024
 for-0.023
Token She
Token ID1375
Feature activation+4.35e-01

Pos logit contributions
ap+0.002
apping+0.001
ooks+0.001
 upon+0.001
angles+0.001
Neg logit contributions
 disl-0.002
 gn-0.002
col-0.002
handed-0.002
growth-0.002
Token did
Token ID750
Feature activation-1.33e-01

Pos logit contributions
ap+0.019
ooks+0.016
apping+0.016
aps+0.015
angles+0.015
Neg logit contributions
 disl-0.018
 gn-0.015
col-0.014
growth-0.013
handed-0.013
Token not
Token ID407
Feature activation-5.15e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.006
growth+0.006
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
 upon-0.003
aps-0.003
Token know
Token ID760
Feature activation-8.05e-01

Pos logit contributions
 disl+0.023
 gn+0.019
handed+0.019
col+0.018
growth+0.018
Neg logit contributions
ap-0.020
apping-0.016
ooks-0.016
 upon-0.016
aps-0.015
Token to
Token ID284
Feature activation+3.74e-03

Pos logit contributions
 disl+0.010
 gn+0.008
col+0.008
handed+0.008
growth+0.007
Neg logit contributions
ap-0.007
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token safety
Token ID3747
Feature activation-2.29e-01

Pos logit contributions
ap+0.000
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.000
 gn-0.000
col-0.000
handed-0.000
growth-0.000
Token.
Token ID13
Feature activation+9.05e-02

Pos logit contributions
 disl+0.011
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
ap+0.004
apping+0.003
ooks+0.003
angles+0.003
 upon+0.003
Neg logit contributions
 disl-0.004
 gn-0.003
col-0.003
handed-0.003
growth-0.003
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
 disl+0.033
 gn+0.028
col+0.026
handed+0.025
growth+0.025
Neg logit contributions
ap-0.021
apping-0.018
ooks-0.018
aps-0.017
 upon-0.017
TokenThe
Token ID464
Feature activation-1.56e+00

Pos logit contributions
 disl+0.009
 gn+0.007
col+0.007
handed+0.007
growth+0.007
Neg logit contributions
ap-0.006
apping-0.005
ooks-0.005
 upon-0.005
aps-0.005
Token baby
Token ID5156
Feature activation+8.03e-01

Pos logit contributions
 disl+0.082
col+0.067
 gn+0.063
 CRE+0.062
growth+0.062
Neg logit contributions
ap-0.048
 friends-0.041
 eyes-0.041
 upon-0.040
 friend-0.040
Token was
Token ID373
Feature activation-1.84e-01

Pos logit contributions
ap+0.034
ooks+0.031
apping+0.029
angles+0.029
aps+0.028
Neg logit contributions
 disl-0.030
 gn-0.028
col-0.025
handed-0.025
growth-0.023
Token so
Token ID523
Feature activation-6.54e-01

Pos logit contributions
 disl+0.008
 gn+0.006
col+0.006
handed+0.006
growth+0.006
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
 upon-0.006
aps-0.006
Token grateful
Token ID14066
Feature activation+6.13e-01

Pos logit contributions
 disl+0.028
 gn+0.023
growth+0.022
col+0.022
handed+0.021
Neg logit contributions
ap-0.027
apping-0.024
aps-0.022
ooks-0.022
 upon-0.022
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
ap+0.025
ooks+0.022
apping+0.021
angles+0.021
aps+0.020
Neg logit contributions
 disl-0.025
 gn-0.022
col-0.020
handed-0.019
growth-0.018
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
 disl+0.027
 gn+0.023
handed+0.022
col+0.022
growth+0.021
Neg logit contributions
ap-0.023
apping-0.018
 upon-0.017
ooks-0.017
aps-0.017
Token the
Token ID262
Feature activation-8.80e-01

Pos logit contributions
ap+0.011
apping+0.009
ooks+0.009
aps+0.009
angles+0.009
Neg logit contributions
 disl-0.012
 gn-0.010
col-0.010
handed-0.009
growth-0.009
Token bear
Token ID6842
Feature activation-5.43e-01

Pos logit contributions
 disl+0.041
col+0.033
handed+0.032
growth+0.032
themed+0.031
Neg logit contributions
ap-0.033
 upon-0.027
apping-0.026
aps-0.025
ooks-0.025
Token went
Token ID1816
Feature activation+1.68e-02

Pos logit contributions
 disl+0.024
 gn+0.020
col+0.019
handed+0.019
growth+0.018
Neg logit contributions
ap-0.021
apping-0.017
ooks-0.017
 upon-0.016
aps-0.016
Token on
Token ID319
Feature activation+3.91e-01

Pos logit contributions
ap+0.001
apping+0.000
ooks+0.000
 upon+0.000
aps+0.000
Neg logit contributions
 disl-0.001
 gn-0.001
col-0.001
handed-0.001
growth-0.001
Token his
Token ID465
Feature activation-1.53e+00

Pos logit contributions
ap+0.014
apping+0.012
ooks+0.012
angles+0.011
 upon+0.011
Neg logit contributions
 disl-0.018
 gn-0.016
col-0.015
handed-0.015
growth-0.014
Token way
Token ID835
Feature activation+9.12e-01

Pos logit contributions
 disl+0.074
col+0.064
 gn+0.060
handed+0.058
growth+0.057
Neg logit contributions
ap-0.054
 eyes-0.041
 friends-0.039
 for-0.038
 upon-0.038
Token,
Token ID11
Feature activation+6.32e-01

Pos logit contributions
ap+0.036
ooks+0.031
angles+0.031
apping+0.030
ymes+0.029
Neg logit contributions
 disl-0.035
 gn-0.032
handed-0.030
col-0.030
growth-0.027
Token knowing
Token ID6970
Feature activation+1.58e-01

Pos logit contributions
ap+0.026
ooks+0.022
apping+0.021
angles+0.021
aps+0.021
Neg logit contributions
 disl-0.026
 gn-0.024
col-0.021
handed-0.021
growth-0.019
Token that
Token ID326
Feature activation+2.15e-01

Pos logit contributions
ap+0.008
apping+0.007
ooks+0.007
angles+0.007
aps+0.007
Neg logit contributions
 disl-0.005
 gn-0.004
col-0.004
handed-0.003
growth-0.003
Token he
Token ID339
Feature activation+1.91e-01

Pos logit contributions
ap+0.009
apping+0.008
ooks+0.008
aps+0.007
angles+0.007
Neg logit contributions
 disl-0.009
 gn-0.008
col-0.007
handed-0.007
growth-0.007
Token."
Token ID526
Feature activation+1.70e-01

Pos logit contributions
 disl+0.011
 gn+0.010
col+0.009
handed+0.009
growth+0.009
Neg logit contributions
ap-0.008
apping-0.007
ooks-0.007
 upon-0.006
aps-0.006
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ap+0.007
apping+0.006
ooks+0.006
angles+0.005
 upon+0.005
Neg logit contributions
 disl-0.007
 gn-0.006
col-0.006
handed-0.006
growth-0.005
Token
Token ID198
Feature activation+7.03e-01

Pos logit contributions
ap+0.005
apping+0.004
ooks+0.004
angles+0.004
 upon+0.004
Neg logit contributions
 disl-0.005
 gn-0.005
col-0.004
handed-0.004
growth-0.004
TokenL
Token ID43
Feature activation+9.73e-01

Pos logit contributions
ap+0.023
apping+0.018
ooks+0.018
angles+0.018
 eyes+0.016
Neg logit contributions
 disl-0.035
 gn-0.031
handed-0.029
col-0.029
growth-0.026
Tokenily
Token ID813
Feature activation-6.86e-01

Pos logit contributions
ap+0.048
ooks+0.041
apping+0.039
angles+0.039
 upon+0.038
Neg logit contributions
 disl-0.031
 gn-0.025
handed-0.024
col-0.023
igsaw-0.021
Token felt
Token ID2936
Feature activation-1.59e+00

Pos logit contributions
 disl+0.035
 gn+0.030
col+0.029
handed+0.029
growth+0.029
Neg logit contributions
ap-0.021
apping-0.017
 upon-0.016
ooks-0.015
aps-0.015
Token better
Token ID1365
Feature activation-2.57e-01

Pos logit contributions
 disl+0.079
storm+0.073
col+0.072
 gn+0.072
growth+0.070
Neg logit contributions
ap-0.048
apping-0.037
aps-0.037
 for-0.034
ags-0.032
Token and
Token ID290
Feature activation-4.29e-01

Pos logit contributions
 disl+0.014
 gn+0.012
col+0.012
handed+0.011
growth+0.011
Neg logit contributions
ap-0.008
apping-0.006
ooks-0.006
aps-0.005
 upon-0.005
Token smiled
Token ID13541
Feature activation+2.79e-01

Pos logit contributions
 disl+0.019
 gn+0.016
col+0.015
growth+0.015
handed+0.015
Neg logit contributions
ap-0.017
apping-0.014
ooks-0.013
aps-0.013
 upon-0.013
Token back
Token ID736
Feature activation-1.31e-01

Pos logit contributions
ap+0.010
ooks+0.008
apping+0.008
angles+0.008
 upon+0.007
Neg logit contributions
 disl-0.013
 gn-0.011
col-0.010
handed-0.010
growth-0.010
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
 disl+0.007
 gn+0.006
col+0.006
handed+0.005
growth+0.005
Neg logit contributions
ap-0.004
apping-0.003
ooks-0.003
aps-0.003
 upon-0.003