TOP ACTIVATIONS
MAX = 7.734

. They were called ost ric hes . Lily and Ben
grass and rye grass and al f alf a and cl
that said " O st ric hes ". Sara and Tom
ie . Unlike other ost ric hes , his feathers gl
Yes , they are ost ric hes ," mom said .
ostr ich pen . Ost ric hes are big birds with
did not know what ost ric hes were . They looked
ostr ich pen . Ost ric hes are big birds with
sweet , but salsa is also spicy . Do you remember
went outside and showed Daddy Sil via 's beautiful new coat
out for a walk with Sil via wearing her new coat
should not tease the ost ric hes . They are not
you count how many ost ric hes there are ?"
but be careful . Ost ric hes can be gr umpy
something new - there were tad p oles swimming around in
his eyes and smiled . Pret ending was so much fun
Look , mom , ost ric hes !" "
One day , a bird al ight ed on his windows
too close . The ost ric hes might pe ck you
with your new coat ." Sil via smiled and said ,

MIN ACTIVATIONS
MIN = -5.517

, Lily was getting ready to sleep when she heard a
little girl appeared on the other side of the valley .
tired . He was ready to go to sleep but he
, something happens . The angry man starts to cough .
the hill and got ready to go down .
were hungry . They wanted to eat some soup . But
are hungry . They want to eat meat for lunch .
quickly ran over to the other side of the room and
saw a rabbit on the other side ! The children were
were hungry . They wanted to feed the ducks at the
!" Ben sits on the other chair and says , "
see the sack on the other side of the fence .
are hungry . They want to eat something y ummy .
he couldn 't decide what to eat for lunch . He
small girl went to the other side of the playground and
were hungry . They wanted to eat something y ummy .
saw a cat on the other side of a wire fence
a girl standing on the other side of the hedge .
saw something shiny on the other side of the room .
a boy standing on the other side of the pond .

INTERVAL 4.421 - 7.734
CONTAINS 0.148%

€ � � she was so proud of her brave and
to try . Our barn is getting wet and we need
It looks like you are learning a lot ."
" I am so proud of myself ," he
whistle , Sue . It is easy and fun !"

INTERVAL 1.108 - 4.421
CONTAINS 21.261%

to be a mystery . One day while
sad and miserable . He didn 't have many friends to
. But he kept taking care of it every day .
away together , happy to have found a new friend .
The angel smiled . " That 's a good idea

INTERVAL -2.204 - 1.108
CONTAINS 71.801%

his mom bought the toy car and went home . Tim
. As they were eating , Lily saw a
his passport and runs away . The man is
them and was scared . He kept them quiet with candy
. They reached the middle of the log and

INTERVAL -5.517 - -2.204
CONTAINS 6.790%

they arrived , Joe ran outside . He saw lots of
Her mom said , " L ily , cartoons
fun . After playing , the boy thanked Lily
day , they see a big cage in the park .
the day of Tim 's birthday came . Tim and his
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
 valuable+0.050
b+0.050
ang+0.049
cks+0.048
irling+0.047
Neg logit contributions
 department-0.033
 nod-0.032
 walk-0.031
 moment-0.029
 gaze-0.029
Token They
Token ID1119
Feature activation+3.56e-01

Pos logit contributions
 department+0.009
 nod+0.009
 walk+0.009
 Lee+0.008
 gaze+0.008
Neg logit contributions
b-0.015
 valuable-0.015
ang-0.015
irling-0.015
cks-0.015
Token were
Token ID547
Feature activation+2.82e+00

Pos logit contributions
 valuable+0.011
ang+0.011
b+0.011
cks+0.010
irling+0.010
Neg logit contributions
 department-0.006
 nod-0.006
 walk-0.006
 moment-0.006
 Lee-0.005
Token called
Token ID1444
Feature activation+3.34e-01

Pos logit contributions
hest+0.065
b+0.063
pill+0.063
 valuable+0.062
ang+0.062
Neg logit contributions
 nod-0.072
 walk-0.070
 smile-0.069
 department-0.069
 hug-0.067
Token ost
Token ID23619
Feature activation+2.63e+00

Pos logit contributions
 valuable+0.009
ang+0.009
b+0.009
cks+0.008
 awesome+0.008
Neg logit contributions
 department-0.007
 nod-0.007
 walk-0.007
 moment-0.006
 gaze-0.006
Tokenric
Token ID1173
Feature activation+7.71e+00

Pos logit contributions
 valuable+0.054
ang+0.051
b+0.050
hest+0.048
 awesome+0.048
Neg logit contributions
 department-0.075
 nod-0.073
 walk-0.072
 moment-0.068
 gaze-0.068
Tokenhes
Token ID956
Feature activation+1.55e+00

Pos logit contributions
 valuable+0.100
 elegant+0.091
ang+0.090
ind+0.082
irling+0.081
Neg logit contributions
 nod-0.256
 department-0.255
 moment-0.251
 walk-0.244
 came-0.239
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 valuable+0.045
b+0.045
ang+0.045
cks+0.043
hest+0.042
Neg logit contributions
 department-0.029
 nod-0.029
 walk-0.028
 moment-0.026
 gaze-0.026
Token Lily
Token ID20037
Feature activation+1.52e+00

Pos logit contributions
b+0.003
 valuable+0.003
irling+0.003
ang+0.003
cks+0.003
Neg logit contributions
 nod-0.002
 Lee-0.002
 department-0.002
 walk-0.002
 gaze-0.002
Token and
Token ID290
Feature activation+6.22e-02

Pos logit contributions
 valuable+0.045
ang+0.045
b+0.045
irling+0.043
cks+0.043
Neg logit contributions
 department-0.028
 nod-0.027
 walk-0.026
 moment-0.025
 gaze-0.024
Token Ben
Token ID3932
Feature activation+1.66e-01

Pos logit contributions
 valuable+0.002
ang+0.002
b+0.002
cks+0.002
pill+0.002
Neg logit contributions
 department-0.001
 nod-0.001
 walk-0.001
 moment-0.001
 hug-0.001
Token grass
Token ID8701
Feature activation+2.22e+00

Pos logit contributions
b+0.050
 valuable+0.049
ang+0.049
cks+0.047
hest+0.047
Neg logit contributions
 nod-0.040
 department-0.040
 walk-0.039
 moment-0.037
 gaze-0.037
Token and
Token ID290
Feature activation+2.33e+00

Pos logit contributions
b+0.060
ang+0.060
 valuable+0.059
cks+0.057
irling+0.056
Neg logit contributions
 nod-0.046
 department-0.045
 walk-0.043
 gaze-0.041
 sister-0.041
Token rye
Token ID47553
Feature activation+2.33e+00

Pos logit contributions
 valuable+0.055
b+0.054
ang+0.052
cks+0.051
 awesome+0.050
Neg logit contributions
 department-0.058
 nod-0.056
 walk-0.054
 Lee-0.052
 moment-0.052
Token grass
Token ID8701
Feature activation+2.82e+00

Pos logit contributions
ang+0.056
 valuable+0.055
b+0.054
hest+0.053
irling+0.052
Neg logit contributions
 walk-0.056
 nod-0.056
 hug-0.053
 moment-0.053
 department-0.053
Token and
Token ID290
Feature activation+2.47e+00

Pos logit contributions
 valuable+0.080
b+0.080
ang+0.080
cks+0.077
irling+0.075
Neg logit contributions
 nod-0.055
 department-0.054
 walk-0.052
 gaze-0.049
 moment-0.049
Token al
Token ID435
Feature activation+7.54e+00

Pos logit contributions
 valuable+0.056
b+0.055
ang+0.053
cks+0.052
 awesome+0.051
Neg logit contributions
 department-0.063
 nod-0.061
 walk-0.059
 Lee-0.058
 moment-0.057
Tokenf
Token ID69
Feature activation+3.92e+00

Pos logit contributions
 valuable+0.153
 awesome+0.145
 elegant+0.143
 amazing+0.142
 annoying+0.136
Neg logit contributions
 nod-0.194
 walk-0.191
 department-0.188
 moment-0.185
pit-0.178
Tokenalf
Token ID1604
Feature activation+2.70e+00

Pos logit contributions
 valuable+0.079
b+0.076
ang+0.075
cks+0.074
 awesome+0.073
Neg logit contributions
 department-0.109
 nod-0.107
 walk-0.107
 hug-0.102
 gaze-0.099
Tokena
Token ID64
Feature activation+3.19e+00

Pos logit contributions
 valuable+0.071
ang+0.070
b+0.070
cks+0.067
irling+0.067
Neg logit contributions
 department-0.058
 nod-0.057
 walk-0.057
 moment-0.053
 gaze-0.052
Token and
Token ID290
Feature activation+2.59e+00

Pos logit contributions
 valuable+0.090
ang+0.088
b+0.088
cks+0.085
irling+0.084
Neg logit contributions
 department-0.063
 nod-0.062
 walk-0.062
 moment-0.057
 gaze-0.056
Token cl
Token ID537
Feature activation+3.49e+00

Pos logit contributions
 valuable+0.065
b+0.063
ang+0.062
cks+0.060
 awesome+0.059
Neg logit contributions
 department-0.060
 nod-0.058
 walk-0.057
 moment-0.054
 gaze-0.053
Token that
Token ID326
Feature activation-2.51e-01

Pos logit contributions
 valuable+0.032
ang+0.031
b+0.031
irling+0.030
cks+0.030
Neg logit contributions
 department-0.021
 nod-0.021
 walk-0.020
 hug-0.019
 moment-0.019
Token said
Token ID531
Feature activation+7.70e-01

Pos logit contributions
 walk+0.004
 department+0.004
 nod+0.004
 hug+0.004
 Lee+0.004
Neg logit contributions
irling-0.008
ang-0.008
ind-0.008
 valuable-0.008
hest-0.008
Token "
Token ID366
Feature activation+9.28e-01

Pos logit contributions
 valuable+0.023
b+0.022
ang+0.022
cks+0.022
 awesome+0.022
Neg logit contributions
 walk-0.014
 department-0.014
 nod-0.014
 moment-0.012
 gaze-0.012
TokenO
Token ID46
Feature activation+1.59e+00

Pos logit contributions
 valuable+0.022
irling+0.022
cks+0.021
 awesome+0.021
 amazing+0.020
Neg logit contributions
 walk-0.024
 moment-0.023
 department-0.022
 visit-0.022
 nod-0.022
Tokenst
Token ID301
Feature activation+2.72e+00

Pos logit contributions
 valuable+0.038
 elegant+0.035
 awesome+0.034
years+0.034
irling+0.034
Neg logit contributions
 walk-0.039
 department-0.039
 Lee-0.037
 nod-0.036
 moment-0.036
Tokenric
Token ID1173
Feature activation+7.39e+00

Pos logit contributions
 valuable+0.065
irling+0.064
years+0.063
 awesome+0.062
 elegant+0.062
Neg logit contributions
 walk-0.064
 department-0.064
 nod-0.064
 gaze-0.061
 Lee-0.060
Tokenhes
Token ID956
Feature activation+1.84e+00

Pos logit contributions
ang+0.096
 valuable+0.095
irling+0.094
 elegant+0.093
b+0.090
Neg logit contributions
 department-0.247
 nod-0.244
 moment-0.233
 walk-0.230
 Lee-0.230
Token".
Token ID1911
Feature activation-1.40e+00

Pos logit contributions
 valuable+0.051
ang+0.051
b+0.050
irling+0.049
cks+0.049
Neg logit contributions
 department-0.037
 nod-0.037
 walk-0.036
 moment-0.033
 hug-0.033
Token Sara
Token ID24799
Feature activation+1.49e+00

Pos logit contributions
 department+0.029
 nod+0.029
 walk+0.028
 moment+0.027
 hug+0.026
Neg logit contributions
 valuable-0.038
ang-0.037
b-0.037
cks-0.036
irling-0.035
Token and
Token ID290
Feature activation-8.01e-01

Pos logit contributions
 valuable+0.046
ang+0.045
b+0.045
cks+0.044
irling+0.043
Neg logit contributions
 department-0.026
 nod-0.026
 walk-0.025
 moment-0.023
 gaze-0.023
Token Tom
Token ID4186
Feature activation-4.86e-01

Pos logit contributions
 department+0.020
 moment+0.017
 ending+0.015
ously+0.015
pit+0.015
Neg logit contributions
cks-0.019
 awesome-0.018
ang-0.018
 amazing-0.018
 impressive-0.018
Tokenie
Token ID494
Feature activation-5.45e-01

Pos logit contributions
 valuable+0.085
b+0.084
ang+0.084
cks+0.081
irling+0.080
Neg logit contributions
 department-0.058
 nod-0.057
 walk-0.056
 moment-0.052
 gaze-0.052
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 nod+0.007
 walk+0.007
 department+0.007
 hug+0.006
 Lee+0.006
Neg logit contributions
 valuable-0.018
b-0.018
cks-0.018
ang-0.018
irling-0.018
Token Unlike
Token ID12101
Feature activation+2.14e-01

Pos logit contributions
irling+0.032
 valuable+0.032
cks+0.032
b+0.031
isin+0.030
Neg logit contributions
 nod-0.018
 walk-0.017
 gaze-0.017
 Lee-0.017
 department-0.017
Token other
Token ID584
Feature activation-2.06e+00

Pos logit contributions
 valuable+0.006
pill+0.006
hest+0.006
irling+0.006
ang+0.006
Neg logit contributions
 moment-0.004
 walk-0.004
 sister-0.004
 gaze-0.004
 nod-0.004
Token ost
Token ID23619
Feature activation+3.13e+00

Pos logit contributions
 walk+0.050
 department+0.049
 nod+0.049
 gaze+0.046
 moment+0.046
Neg logit contributions
 valuable-0.050
ang-0.049
b-0.048
irling-0.048
hest-0.047
Tokenric
Token ID1173
Feature activation+7.17e+00

Pos logit contributions
 valuable+0.066
ang+0.060
b+0.059
hest+0.058
 awesome+0.058
Neg logit contributions
 department-0.086
 nod-0.085
 walk-0.084
 gaze-0.081
 Lee-0.080
Tokenhes
Token ID956
Feature activation+1.49e+00

Pos logit contributions
 valuable+0.089
ang+0.086
 elegant+0.086
ind+0.080
irling+0.076
Neg logit contributions
 nod-0.235
 moment-0.231
 department-0.231
 walk-0.225
 woke-0.223
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
 valuable+0.044
ang+0.043
b+0.043
irling+0.042
cks+0.042
Neg logit contributions
 department-0.027
 nod-0.027
 walk-0.027
 moment-0.025
 hug-0.025
Token his
Token ID465
Feature activation+5.64e-01

Pos logit contributions
irling+0.037
hest+0.037
cks+0.037
ang+0.035
 valuable+0.035
Neg logit contributions
 walk-0.022
 nod-0.021
 department-0.021
 moment-0.020
 boy-0.020
Token feathers
Token ID27400
Feature activation+1.02e+00

Pos logit contributions
ang+0.016
 valuable+0.015
hest+0.015
b+0.015
cks+0.015
Neg logit contributions
 walk-0.012
 nod-0.011
 department-0.011
 gaze-0.010
 hug-0.010
Token gl
Token ID1278
Feature activation+4.34e+00

Pos logit contributions
b+0.031
 valuable+0.030
ang+0.029
cks+0.029
hest+0.028
Neg logit contributions
 department-0.020
 nod-0.018
 walk-0.018
 Lee-0.018
 moment-0.016
TokenYes
Token ID5297
Feature activation-2.89e+00

Pos logit contributions
 valuable+0.031
b+0.030
cks+0.030
irling+0.030
ang+0.029
Neg logit contributions
 nod-0.023
 moment-0.023
 walk-0.023
 department-0.021
 gaze-0.021
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
 department+0.042
 walk+0.040
 nod+0.039
 moment+0.037
 garage+0.036
Neg logit contributions
 awesome-0.093
ang-0.092
 amazing-0.092
 valuable-0.092
cks-0.090
Token they
Token ID484
Feature activation-1.32e+00

Pos logit contributions
 department+0.054
 nod+0.053
 walk+0.052
 moment+0.049
 hug+0.048
Neg logit contributions
 valuable-0.071
ang-0.070
b-0.070
cks-0.067
irling-0.067
Token are
Token ID389
Feature activation+3.22e+00

Pos logit contributions
 department+0.020
 nod+0.019
 walk+0.018
 moment+0.017
 Lee+0.017
Neg logit contributions
 valuable-0.044
b-0.043
ang-0.043
cks-0.042
 awesome-0.042
Token ost
Token ID23619
Feature activation+2.85e+00

Pos logit contributions
ang+0.104
irling+0.100
years+0.100
ind+0.099
b+0.099
Neg logit contributions
,-0.052
 nod-0.052
 walk-0.051
 department-0.049
 hug-0.049
Tokenric
Token ID1173
Feature activation+7.08e+00

Pos logit contributions
 valuable+0.063
b+0.056
ang+0.056
 awesome+0.056
hest+0.056
Neg logit contributions
 department-0.077
 nod-0.075
 walk-0.074
 gaze-0.071
 Lee-0.070
Tokenhes
Token ID956
Feature activation+2.28e+00

Pos logit contributions
 valuable+0.097
 elegant+0.096
ang+0.093
ind+0.092
irling+0.091
Neg logit contributions
 nod-0.224
 department-0.213
 moment-0.208
 listen-0.208
 came-0.208
Token,"
Token ID553
Feature activation-2.36e-01

Pos logit contributions
 valuable+0.063
ang+0.063
b+0.062
irling+0.060
cks+0.060
Neg logit contributions
 nod-0.046
 department-0.045
 walk-0.045
 hug-0.042
 moment-0.042
Token mom
Token ID1995
Feature activation+9.88e-02

Pos logit contributions
 department+0.005
 walk+0.005
 nod+0.005
 moment+0.004
 ending+0.004
Neg logit contributions
ang-0.006
 valuable-0.006
b-0.006
cks-0.006
 amazing-0.006
Token said
Token ID531
Feature activation-2.03e+00

Pos logit contributions
ang+0.002
b+0.002
 valuable+0.002
cks+0.002
hest+0.002
Neg logit contributions
 department-0.002
 walk-0.002
 nod-0.002
 moment-0.002
 gaze-0.002
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 garage+0.053
pit+0.052
 department+0.051
 ending+0.051
 Driver+0.050
Neg logit contributions
 valuable-0.039
ang-0.039
ula-0.038
b-0.038
 much-0.038
Token ostr
Token ID42387
Feature activation+1.63e+00

Pos logit contributions
 nod+0.013
 department+0.012
 walk+0.012
ously+0.012
 moment+0.011
Neg logit contributions
 valuable-0.018
ang-0.017
b-0.017
 awesome-0.017
ind-0.017
Tokenich
Token ID488
Feature activation-1.72e+00

Pos logit contributions
ang+0.029
b+0.027
cks+0.026
ind+0.025
irling+0.025
Neg logit contributions
 nod-0.052
 walk-0.051
 department-0.050
 moment-0.050
 hug-0.049
Token pen
Token ID3112
Feature activation-1.33e-01

Pos logit contributions
 department+0.042
 ending+0.042
 moment+0.039
 guy+0.039
 Lee+0.038
Neg logit contributions
ang-0.038
b-0.035
cks-0.035
 valuable-0.034
ind-0.034
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
 department+0.003
 nod+0.003
 gaze+0.003
 moment+0.003
 ending+0.003
Neg logit contributions
ang-0.003
b-0.003
 valuable-0.003
hest-0.003
cks-0.003
Token Ost
Token ID38919
Feature activation+1.83e+00

Pos logit contributions
 nod+0.013
 department+0.013
 walk+0.012
 Lee+0.012
 hug+0.012
Neg logit contributions
b-0.020
 valuable-0.020
ang-0.020
irling-0.020
hest-0.019
Tokenric
Token ID1173
Feature activation+7.04e+00

Pos logit contributions
 valuable+0.038
ang+0.035
b+0.035
irling+0.033
 awesome+0.033
Neg logit contributions
 department-0.051
 nod-0.051
 walk-0.051
 moment-0.049
 gaze-0.049
Tokenhes
Token ID956
Feature activation+8.74e-01

Pos logit contributions
 valuable+0.094
 elegant+0.090
ind+0.080
ang+0.078
ender+0.077
Neg logit contributions
 nod-0.238
 moment-0.225
 department-0.224
 woke-0.220
 walk-0.218
Token are
Token ID389
Feature activation+3.90e+00

Pos logit contributions
 valuable+0.025
b+0.025
ang+0.025
cks+0.024
hest+0.024
Neg logit contributions
 department-0.017
 nod-0.016
 walk-0.016
 moment-0.015
 gaze-0.015
Token big
Token ID1263
Feature activation-6.62e-01

Pos logit contributions
ang+0.087
hest+0.085
pill+0.084
irling+0.082
dd+0.079
Neg logit contributions
 walk-0.101
 nod-0.100
 department-0.100
 hug-0.098
 smile-0.097
Token birds
Token ID10087
Feature activation+7.28e-01

Pos logit contributions
 department+0.013
 nod+0.013
 walk+0.013
 moment+0.012
 gaze+0.012
Neg logit contributions
 valuable-0.018
ang-0.018
b-0.018
cks-0.017
pill-0.017
Token with
Token ID351
Feature activation+1.96e+00

Pos logit contributions
 valuable+0.024
b+0.023
ang+0.023
irling+0.023
 awesome+0.023
Neg logit contributions
 nod-0.011
 walk-0.011
 department-0.010
 hug-0.010
 moment-0.010
Token did
Token ID750
Feature activation+8.13e-01

Pos logit contributions
 department+0.010
 moment+0.009
 Lee+0.009
 ending+0.009
 nod+0.008
Neg logit contributions
 valuable-0.014
b-0.013
ang-0.013
 amazing-0.013
cks-0.013
Token not
Token ID407
Feature activation-1.32e+00

Pos logit contributions
 valuable+0.020
ang+0.020
b+0.020
cks+0.019
hest+0.019
Neg logit contributions
 nod-0.019
 walk-0.019
 department-0.019
 moment-0.017
 gaze-0.017
Token know
Token ID760
Feature activation-3.08e-01

Pos logit contributions
 department+0.028
 nod+0.027
 walk+0.027
 moment+0.025
 Lee+0.025
Neg logit contributions
 valuable-0.036
b-0.035
ang-0.035
cks-0.034
hest-0.033
Token what
Token ID644
Feature activation+5.94e-01

Pos logit contributions
 department+0.006
 nod+0.006
 walk+0.006
 Lee+0.005
 moment+0.005
Neg logit contributions
 valuable-0.009
ang-0.009
b-0.009
irling-0.009
hest-0.008
Token ost
Token ID23619
Feature activation+2.50e+00

Pos logit contributions
ang+0.016
irling+0.016
b+0.016
hest+0.016
 valuable+0.016
Neg logit contributions
 department-0.012
 walk-0.012
 nod-0.012
 moment-0.011
 Lee-0.011
Tokenric
Token ID1173
Feature activation+7.03e+00

Pos logit contributions
 valuable+0.051
b+0.047
ang+0.047
irling+0.045
 awesome+0.045
Neg logit contributions
 department-0.071
 nod-0.070
 walk-0.069
 gaze-0.066
 Lee-0.065
Tokenhes
Token ID956
Feature activation+1.21e-01

Pos logit contributions
 valuable+0.091
 elegant+0.086
ang+0.083
irling+0.083
ind+0.080
Neg logit contributions
 department-0.235
 nod-0.231
 moment-0.226
 walk-0.223
 woke-0.218
Token were
Token ID547
Feature activation+1.13e+00

Pos logit contributions
 valuable+0.003
ang+0.003
b+0.003
cks+0.003
irling+0.003
Neg logit contributions
 department-0.002
 nod-0.002
 walk-0.002
 moment-0.002
 hug-0.002
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
 valuable+0.035
ang+0.035
b+0.035
cks+0.033
irling+0.033
Neg logit contributions
 department-0.019
 nod-0.019
 walk-0.019
 moment-0.017
 hug-0.017
Token They
Token ID1119
Feature activation-1.25e+00

Pos logit contributions
 nod+0.023
 department+0.023
 walk+0.023
 Lee+0.022
 gaze+0.021
Neg logit contributions
b-0.040
 valuable-0.039
ang-0.038
irling-0.038
cks-0.037
Token looked
Token ID3114
Feature activation+2.38e-01

Pos logit contributions
 department+0.023
 nod+0.022
 walk+0.022
 moment+0.021
 Lee+0.020
Neg logit contributions
 valuable-0.037
ang-0.037
b-0.037
cks-0.035
irling-0.035
Token ostr
Token ID42387
Feature activation+1.74e+00

Pos logit contributions
 nod+0.010
 department+0.010
 walk+0.009
ously+0.009
 moment+0.009
Neg logit contributions
 valuable-0.016
 awesome-0.015
b-0.015
ang-0.015
ind-0.015
Tokenich
Token ID488
Feature activation-1.34e+00

Pos logit contributions
ang+0.030
b+0.029
cks+0.027
ind+0.027
irling+0.026
Neg logit contributions
 nod-0.056
 walk-0.055
 department-0.054
 moment-0.054
 hug-0.053
Token pen
Token ID3112
Feature activation+5.46e-02

Pos logit contributions
 ending+0.035
 department+0.035
 guy+0.033
 moment+0.033
 nod+0.032
Neg logit contributions
ang-0.027
b-0.025
 valuable-0.025
pill-0.025
cks-0.025
Token.
Token ID13
Feature activation-7.27e-01

Pos logit contributions
 valuable+0.001
hest+0.001
b+0.001
ang+0.001
ink+0.001
Neg logit contributions
 department-0.001
 nod-0.001
 gaze-0.001
 ending-0.001
 sister-0.001
Token Ost
Token ID38919
Feature activation+1.90e+00

Pos logit contributions
 nod+0.015
 department+0.015
 walk+0.015
 Lee+0.014
 hug+0.014
Neg logit contributions
b-0.020
 valuable-0.020
ang-0.019
irling-0.019
cks-0.019
Tokenric
Token ID1173
Feature activation+7.00e+00

Pos logit contributions
 valuable+0.038
ang+0.036
b+0.036
irling+0.034
 awesome+0.034
Neg logit contributions
 department-0.054
 nod-0.054
 walk-0.053
 moment-0.051
 gaze-0.051
Tokenhes
Token ID956
Feature activation+6.68e-01

Pos logit contributions
 valuable+0.090
 elegant+0.085
ang+0.079
ind+0.078
ender+0.074
Neg logit contributions
 nod-0.238
 moment-0.229
 department-0.228
 woke-0.222
 walk-0.220
Token are
Token ID389
Feature activation+3.67e+00

Pos logit contributions
b+0.019
 valuable+0.019
ang+0.018
cks+0.018
hest+0.017
Neg logit contributions
 department-0.014
 nod-0.013
 walk-0.013
 gaze-0.012
 Lee-0.012
Token big
Token ID1263
Feature activation-9.02e-01

Pos logit contributions
ang+0.085
hest+0.084
pill+0.081
irling+0.081
dd+0.077
Neg logit contributions
 department-0.091
 walk-0.090
 nod-0.089
 smile-0.088
 hug-0.088
Token birds
Token ID10087
Feature activation+5.81e-01

Pos logit contributions
 department+0.017
 nod+0.017
 walk+0.017
 moment+0.016
 gaze+0.016
Neg logit contributions
ang-0.026
 valuable-0.026
b-0.025
cks-0.025
pill-0.025
Token with
Token ID351
Feature activation+1.84e+00

Pos logit contributions
 valuable+0.018
ang+0.018
b+0.018
cks+0.018
irling+0.017
Neg logit contributions
 department-0.010
 nod-0.009
 walk-0.009
 moment-0.008
 gaze-0.008
Token sweet
Token ID6029
Feature activation-9.14e-02

Pos logit contributions
ang+0.093
irling+0.092
b+0.087
hest+0.085
ind+0.084
Neg logit contributions
 nod-0.109
 hug-0.108
 sister-0.103
 smile-0.102
 walk-0.100
Token,
Token ID11
Feature activation+1.91e+00

Pos logit contributions
 department+0.002
 walk+0.002
 gaze+0.002
 nod+0.002
 moment+0.002
Neg logit contributions
 valuable-0.002
ang-0.002
b-0.002
ind-0.002
pill-0.002
Token but
Token ID475
Feature activation+1.13e+00

Pos logit contributions
ang+0.057
 valuable+0.056
b+0.056
irling+0.055
cks+0.055
Neg logit contributions
 nod-0.035
 department-0.034
 walk-0.034
 moment-0.032
 hug-0.031
Token salsa
Token ID49529
Feature activation+3.30e+00

Pos logit contributions
ang+0.031
irling+0.030
hest+0.030
b+0.030
 valuable+0.030
Neg logit contributions
 nod-0.023
 department-0.022
 walk-0.021
 moment-0.021
 sister-0.021
Token is
Token ID318
Feature activation+6.13e+00

Pos logit contributions
ang+0.098
 valuable+0.097
b+0.097
irling+0.095
ind+0.093
Neg logit contributions
 nod-0.059
 department-0.058
 walk-0.057
 hug-0.054
 moment-0.053
Token also
Token ID635
Feature activation+6.98e+00

Pos logit contributions
ang+0.173
irling+0.165
hest+0.161
pill+0.159
years+0.159
Neg logit contributions
 nod-0.125
 listen-0.122
 hug-0.122
,-0.119
 moment-0.119
Token spicy
Token ID26880
Feature activation+1.29e+00

Pos logit contributions
irling+0.166
ang+0.164
hest+0.159
years+0.157
ind+0.150
Neg logit contributions
 nod-0.168
,-0.164
 hug-0.163
 sister-0.162
 listen-0.161
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
 valuable+0.038
b+0.038
ang+0.037
cks+0.036
hest+0.036
Neg logit contributions
 department-0.024
 nod-0.023
 walk-0.023
 gaze-0.021
 moment-0.021
Token Do
Token ID2141
Feature activation+1.34e+00

Pos logit contributions
b+0.039
irling+0.039
 valuable+0.037
ang+0.037
cks+0.036
Neg logit contributions
 nod-0.027
 Lee-0.025
 department-0.025
 hug-0.023
 sister-0.023
Token you
Token ID345
Feature activation-6.43e-01

Pos logit contributions
ang+0.030
 valuable+0.029
b+0.028
irling+0.028
hest+0.027
Neg logit contributions
 walk-0.035
 nod-0.035
 department-0.033
 Lee-0.033
 hug-0.032
Token remember
Token ID3505
Feature activation+3.54e-01

Pos logit contributions
 nod+0.012
 department+0.012
 walk+0.012
 moment+0.011
 hug+0.011
Neg logit contributions
 valuable-0.019
ang-0.019
b-0.018
cks-0.018
irling-0.018
Token went
Token ID1816
Feature activation-1.06e+00

Pos logit contributions
 department+0.002
 nod+0.002
 ending+0.002
 moment+0.002
 Lee+0.002
Neg logit contributions
ang-0.003
b-0.003
 valuable-0.003
irling-0.003
ink-0.003
Token outside
Token ID2354
Feature activation-2.81e+00

Pos logit contributions
 department+0.021
 nod+0.021
 walk+0.019
 ending+0.019
 moment+0.018
Neg logit contributions
 valuable-0.030
ang-0.030
b-0.030
irling-0.028
cks-0.028
Token and
Token ID290
Feature activation-4.91e-01

Pos logit contributions
 department+0.067
 ending+0.061
 guy+0.061
 nod+0.061
pit+0.059
Neg logit contributions
ang-0.067
 valuable-0.062
b-0.061
 amazing-0.059
irling-0.059
Token showed
Token ID3751
Feature activation+1.08e+00

Pos logit contributions
 department+0.009
 nod+0.009
 ending+0.009
 moment+0.008
 walk+0.008
Neg logit contributions
b-0.014
ang-0.014
 valuable-0.013
irling-0.013
 amazing-0.013
Token Daddy
Token ID26926
Feature activation+7.99e-01

Pos logit contributions
irling+0.033
hest+0.032
ang+0.032
 valuable+0.032
cks+0.032
Neg logit contributions
 nod-0.019
 hug-0.019
 walk-0.019
 department-0.019
 Lee-0.018
Token Sil
Token ID4243
Feature activation+6.92e+00

Pos logit contributions
 valuable+0.021
ang+0.021
b+0.020
cks+0.020
irling+0.020
Neg logit contributions
 nod-0.017
 department-0.017
 walk-0.017
 moment-0.016
 hug-0.016
Tokenvia
Token ID8869
Feature activation+1.67e+00

Pos logit contributions
 valuable+0.129
 awesome+0.129
pill+0.127
irling+0.122
 IT+0.121
Neg logit contributions
 department-0.174
 moment-0.170
 nod-0.163
pit-0.162
 walk-0.159
Token's
Token ID338
Feature activation+2.68e-01

Pos logit contributions
 valuable+0.041
b+0.040
ang+0.039
cks+0.038
 awesome+0.037
Neg logit contributions
 department-0.040
 nod-0.039
 walk-0.039
 moment-0.037
 Lee-0.036
Token beautiful
Token ID4950
Feature activation+4.01e-01

Pos logit contributions
 valuable+0.007
b+0.007
ang+0.007
 awesome+0.007
irling+0.007
Neg logit contributions
 department-0.006
 nod-0.005
 walk-0.005
 Lee-0.005
 moment-0.005
Token new
Token ID649
Feature activation+5.23e-01

Pos logit contributions
 valuable+0.011
b+0.011
ang+0.011
irling+0.010
 awesome+0.010
Neg logit contributions
 department-0.008
 nod-0.008
 walk-0.008
 Lee-0.007
 gaze-0.007
Token coat
Token ID13209
Feature activation+3.48e-01

Pos logit contributions
 valuable+0.016
b+0.015
ang+0.015
 awesome+0.014
 amazing+0.014
Neg logit contributions
 department-0.010
 nod-0.009
 walk-0.009
 Lee-0.009
 gaze-0.008
Token out
Token ID503
Feature activation-1.16e+00

Pos logit contributions
 department+0.025
 nod+0.025
 walk+0.024
 moment+0.023
 gaze+0.023
Neg logit contributions
 valuable-0.029
ang-0.029
b-0.028
irling-0.027
cks-0.027
Token for
Token ID329
Feature activation-1.17e+00

Pos logit contributions
 nod+0.022
 department+0.022
 walk+0.021
 gaze+0.020
 moment+0.019
Neg logit contributions
 valuable-0.034
ang-0.033
b-0.033
cks-0.031
irling-0.031
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
 walk+0.025
 hug+0.023
 nod+0.023
 visit+0.022
 party+0.022
Neg logit contributions
ang-0.035
b-0.034
hest-0.034
iky-0.033
 valuable-0.033
Token walk
Token ID2513
Feature activation-6.55e-01

Pos logit contributions
 department+0.021
 nod+0.020
 Lee+0.019
pit+0.019
 sister+0.017
Neg logit contributions
 valuable-0.043
b-0.041
 awesome-0.039
 amazing-0.039
ang-0.039
Token with
Token ID351
Feature activation+2.49e-01

Pos logit contributions
 department+0.014
 nod+0.014
 gaze+0.013
 walk+0.013
 hug+0.013
Neg logit contributions
ang-0.018
b-0.017
 valuable-0.017
hest-0.016
 amazing-0.016
Token Sil
Token ID4243
Feature activation+6.88e+00

Pos logit contributions
b+0.007
hest+0.007
 valuable+0.007
ang+0.007
cks+0.007
Neg logit contributions
 walk-0.005
 nod-0.005
 hug-0.005
 department-0.005
 moment-0.005
Tokenvia
Token ID8869
Feature activation+5.52e-01

Pos logit contributions
 valuable+0.142
 awesome+0.141
 impressive+0.131
 amazing+0.131
pill+0.129
Neg logit contributions
 department-0.155
 moment-0.155
 walk-0.148
pit-0.148
 nod-0.146
Token wearing
Token ID5762
Feature activation+1.86e+00

Pos logit contributions
 valuable+0.015
b+0.015
ang+0.015
pill+0.014
 amazing+0.014
Neg logit contributions
 department-0.011
 nod-0.011
 walk-0.010
 Lee-0.010
 gaze-0.010
Token her
Token ID607
Feature activation+6.83e-01

Pos logit contributions
ang+0.050
 valuable+0.049
b+0.049
cks+0.048
hest+0.047
Neg logit contributions
 nod-0.040
 walk-0.040
 department-0.039
 hug-0.037
 moment-0.037
Token new
Token ID649
Feature activation+1.90e-02

Pos logit contributions
 valuable+0.016
ang+0.016
b+0.015
cks+0.015
hest+0.015
Neg logit contributions
 nod-0.017
 walk-0.017
 department-0.017
 moment-0.016
 hug-0.016
Token coat
Token ID13209
Feature activation-1.31e-01

Pos logit contributions
 valuable+0.001
b+0.001
ang+0.001
cks+0.001
irling+0.001
Neg logit contributions
 department-0.000
 nod-0.000
 walk-0.000
 moment-0.000
 Lee-0.000
Token should
Token ID815
Feature activation-1.10e+00

Pos logit contributions
ang+0.018
 valuable+0.018
b+0.017
irling+0.017
cks+0.017
Neg logit contributions
 department-0.012
 nod-0.012
 walk-0.012
 hug-0.011
 gaze-0.011
Token not
Token ID407
Feature activation-2.70e+00

Pos logit contributions
 department+0.026
pit+0.023
 moment+0.023
 Lee+0.022
atively+0.022
Neg logit contributions
 valuable-0.028
b-0.026
 awesome-0.026
irling-0.026
cks-0.026
Token tease
Token ID34691
Feature activation+3.41e-01

Pos logit contributions
 department+0.043
 nod+0.038
 moment+0.038
 Lee+0.037
pit+0.037
Neg logit contributions
 valuable-0.087
b-0.086
ang-0.085
cks-0.084
hest-0.082
Token the
Token ID262
Feature activation+2.55e-01

Pos logit contributions
 valuable+0.009
irling+0.009
ang+0.009
b+0.009
hest+0.008
Neg logit contributions
 walk-0.007
 nod-0.007
 department-0.007
 Lee-0.007
 hug-0.007
Token ost
Token ID23619
Feature activation+2.52e+00

Pos logit contributions
 valuable+0.006
ang+0.005
irling+0.005
hest+0.005
b+0.005
Neg logit contributions
 department-0.007
 walk-0.007
 nod-0.007
 moment-0.006
 hug-0.006
Tokenric
Token ID1173
Feature activation+6.81e+00

Pos logit contributions
 valuable+0.053
b+0.047
 awesome+0.047
ang+0.047
hest+0.046
Neg logit contributions
 department-0.071
 nod-0.069
 walk-0.068
 Lee-0.065
 moment-0.065
Tokenhes
Token ID956
Feature activation+1.26e-01

Pos logit contributions
 elegant+0.084
 valuable+0.083
irling+0.074
ind+0.072
ang+0.071
Neg logit contributions
 nod-0.231
 department-0.226
 moment-0.223
 walk-0.219
 listen-0.218
Token.
Token ID13
Feature activation-9.07e-04

Pos logit contributions
 valuable+0.004
b+0.003
ang+0.003
cks+0.003
hest+0.003
Neg logit contributions
 department-0.002
 nod-0.002
 walk-0.002
 gaze-0.002
 moment-0.002
Token They
Token ID1119
Feature activation-3.83e-01

Pos logit contributions
 Lee+0.000
 nod+0.000
 department+0.000
 sister+0.000
 gaze+0.000
Neg logit contributions
b-0.000
irling-0.000
 valuable-0.000
hest-0.000
 awesome-0.000
Token are
Token ID389
Feature activation+4.47e+00

Pos logit contributions
 department+0.006
 nod+0.006
 walk+0.006
 moment+0.005
 hug+0.005
Neg logit contributions
 valuable-0.012
ang-0.012
b-0.012
cks-0.012
irling-0.012
Token not
Token ID407
Feature activation+2.96e+00

Pos logit contributions
hest+0.125
ang+0.125
pill+0.120
ink+0.119
irling+0.117
Neg logit contributions
 hug-0.091
 department-0.090
 sister-0.089
 nod-0.088
 walk-0.086
Token you
Token ID345
Feature activation-1.11e+00

Pos logit contributions
 department+0.022
 nod+0.022
 walk+0.021
 moment+0.020
 gaze+0.019
Neg logit contributions
 valuable-0.035
ang-0.034
b-0.034
cks-0.033
irling-0.033
Token count
Token ID954
Feature activation+1.94e+00

Pos logit contributions
 department+0.016
 nod+0.015
 walk+0.014
 moment+0.014
 Lee+0.013
Neg logit contributions
 valuable-0.038
b-0.037
ang-0.037
cks-0.036
hest-0.036
Token how
Token ID703
Feature activation+2.84e+00

Pos logit contributions
 valuable+0.061
irling+0.058
dd+0.058
pill+0.056
hest+0.056
Neg logit contributions
 walk-0.032
 department-0.032
,-0.032
 Lee-0.032
 nod-0.031
Token many
Token ID867
Feature activation+1.14e+00

Pos logit contributions
irling+0.087
cks+0.086
hest+0.085
ang+0.084
b+0.082
Neg logit contributions
 department-0.050
 walk-0.048
 nod-0.046
 listen-0.045
 visit-0.044
Token ost
Token ID23619
Feature activation+2.82e+00

Pos logit contributions
irling+0.030
 valuable+0.029
hest+0.029
 IT+0.028
ula+0.028
Neg logit contributions
 walk-0.025
,-0.023
 nod-0.023
 moment-0.022
 watch-0.022
Tokenric
Token ID1173
Feature activation+6.72e+00

Pos logit contributions
 valuable+0.068
 awesome+0.061
hest+0.060
 elegant+0.059
irling+0.059
Neg logit contributions
 department-0.071
 nod-0.068
 walk-0.068
 gaze-0.066
 Lee-0.064
Tokenhes
Token ID956
Feature activation+1.76e+00

Pos logit contributions
 valuable+0.104
 elegant+0.100
ind+0.098
irling+0.097
ang+0.086
Neg logit contributions
 nod-0.205
 department-0.202
 walk-0.198
 moment-0.197
 listen-0.197
Token there
Token ID612
Feature activation-5.89e-01

Pos logit contributions
 valuable+0.049
ang+0.048
b+0.048
irling+0.047
cks+0.046
Neg logit contributions
 nod-0.035
 department-0.035
 walk-0.034
 moment-0.032
 hug-0.032
Token are
Token ID389
Feature activation+3.70e+00

Pos logit contributions
 nod+0.011
 walk+0.011
 department+0.011
 moment+0.010
 hug+0.010
Neg logit contributions
 valuable-0.017
ang-0.017
b-0.017
irling-0.017
cks-0.017
Token?"
Token ID1701
Feature activation-5.59e-01

Pos logit contributions
ang+0.100
irling+0.097
dd+0.096
 valuable+0.095
hest+0.094
Neg logit contributions
 walk-0.077
 department-0.076
 nod-0.073
 visit-0.071
,-0.070
Token
Token ID198
Feature activation+2.21e+00

Pos logit contributions
 department+0.012
 walk+0.011
 nod+0.011
 ending+0.010
 moment+0.010
Neg logit contributions
ang-0.015
 valuable-0.015
b-0.015
cks-0.014
pill-0.014
Token but
Token ID475
Feature activation-2.40e+00

Pos logit contributions
 department+0.074
 nod+0.074
 walk+0.073
 moment+0.069
 gaze+0.068
Neg logit contributions
 valuable-0.062
ang-0.061
b-0.061
cks-0.058
irling-0.057
Token be
Token ID307
Feature activation+3.80e+00

Pos logit contributions
 department+0.055
 nod+0.051
 walk+0.049
ously+0.048
pit+0.048
Neg logit contributions
 valuable-0.061
b-0.061
 awesome-0.056
ang-0.056
 amazing-0.056
Token careful
Token ID8161
Feature activation-3.22e-01

Pos logit contributions
pill+0.102
hest+0.095
iky+0.094
ivating+0.092
ula+0.087
Neg logit contributions
 nod-0.093
,-0.083
 hug-0.082
 smile-0.080
 department-0.080
Token.
Token ID13
Feature activation+4.45e-01

Pos logit contributions
 department+0.006
 nod+0.006
 walk+0.006
 gaze+0.006
 moment+0.005
Neg logit contributions
 valuable-0.009
b-0.009
ang-0.009
cks-0.009
irling-0.009
Token Ost
Token ID38919
Feature activation+2.01e+00

Pos logit contributions
b+0.013
 valuable+0.012
irling+0.012
ang+0.012
hest+0.012
Neg logit contributions
 nod-0.009
 department-0.009
 Lee-0.009
 walk-0.009
 gaze-0.008
Tokenric
Token ID1173
Feature activation+6.71e+00

Pos logit contributions
 valuable+0.047
ang+0.042
 elegant+0.041
b+0.041
irling+0.041
Neg logit contributions
 nod-0.052
 walk-0.051
 department-0.051
 moment-0.050
 gaze-0.050
Tokenhes
Token ID956
Feature activation+1.34e+00

Pos logit contributions
 elegant+0.092
 valuable+0.089
ind+0.083
irling+0.081
ang+0.080
Neg logit contributions
 nod-0.224
 moment-0.211
 listen-0.210
 department-0.209
 watch-0.206
Token can
Token ID460
Feature activation+1.60e-01

Pos logit contributions
 valuable+0.040
ang+0.040
b+0.039
irling+0.038
cks+0.038
Neg logit contributions
 nod-0.024
 department-0.024
 walk-0.023
 moment-0.022
 hug-0.022
Token be
Token ID307
Feature activation+4.52e+00

Pos logit contributions
 valuable+0.006
irling+0.005
ang+0.005
years+0.005
pill+0.005
Neg logit contributions
 walk-0.003
 nod-0.002
 hug-0.002
 visit-0.002
 listen-0.002
Token gr
Token ID1036
Feature activation+3.26e+00

Pos logit contributions
hest+0.105
pill+0.101
irling+0.099
ula+0.096
iky+0.095
Neg logit contributions
 walk-0.113
 hug-0.110
 nod-0.108
 department-0.105
 smile-0.104
Tokenumpy
Token ID32152
Feature activation+4.60e-01

Pos logit contributions
 valuable+0.073
 awesome+0.071
 elegant+0.070
 amazing+0.067
irling+0.066
Neg logit contributions
 nod-0.078
 department-0.075
 walk-0.073
 team-0.072
 behavior-0.070
Token something
Token ID1223
Feature activation+1.95e+00

Pos logit contributions
 valuable+0.030
b+0.030
ang+0.030
cks+0.028
irling+0.028
Neg logit contributions
 department-0.029
 nod-0.029
 walk-0.029
 moment-0.027
 gaze-0.027
Token new
Token ID649
Feature activation+1.06e+00

Pos logit contributions
ang+0.040
cks+0.038
ind+0.037
irling+0.037
b+0.036
Neg logit contributions
 walk-0.054
 nod-0.054
 department-0.053
 moment-0.052
 hug-0.051
Token-
Token ID12
Feature activation+4.23e+00

Pos logit contributions
 valuable+0.029
b+0.029
ang+0.028
 amazing+0.027
cks+0.026
Neg logit contributions
 department-0.023
 nod-0.023
 walk-0.021
 Lee-0.020
 gaze-0.020
Token there
Token ID612
Feature activation-5.70e-01

Pos logit contributions
ang+0.074
 valuable+0.073
irling+0.072
cks+0.072
 awesome+0.069
Neg logit contributions
 walk-0.122
 moment-0.122
 nod-0.121
 department-0.121
 hug-0.114
Token were
Token ID547
Feature activation+3.19e+00

Pos logit contributions
 department+0.011
 nod+0.010
 walk+0.010
 Lee+0.010
 sister+0.009
Neg logit contributions
 valuable-0.017
b-0.017
ang-0.016
cks-0.016
hest-0.016
Token tad
Token ID36264
Feature activation+6.69e+00

Pos logit contributions
ang+0.075
 valuable+0.073
b+0.072
cks+0.070
irling+0.069
Neg logit contributions
 department-0.080
 nod-0.077
 walk-0.077
 moment-0.074
 hug-0.072
Tokenp
Token ID79
Feature activation+3.96e+00

Pos logit contributions
irling+0.126
ang+0.121
irl+0.116
 valuable+0.115
ink+0.114
Neg logit contributions
 nod-0.175
 department-0.173
 walk-0.167
ously-0.164
 stop-0.161
Tokenoles
Token ID4316
Feature activation+1.39e+00

Pos logit contributions
 valuable+0.090
ang+0.087
b+0.085
 awesome+0.082
cks+0.082
Neg logit contributions
 department-0.100
 walk-0.099
 nod-0.099
 moment-0.093
 hug-0.092
Token swimming
Token ID14899
Feature activation+2.40e+00

Pos logit contributions
 valuable+0.036
b+0.036
ang+0.036
cks+0.034
irling+0.034
Neg logit contributions
 department-0.031
 nod-0.030
 walk-0.029
 moment-0.028
 Lee-0.028
Token around
Token ID1088
Feature activation+2.50e-01

Pos logit contributions
 valuable+0.061
ang+0.060
b+0.059
cks+0.058
irling+0.058
Neg logit contributions
 department-0.055
 walk-0.053
 nod-0.053
 moment-0.051
 hug-0.049
Token in
Token ID287
Feature activation-1.44e-01

Pos logit contributions
 valuable+0.007
b+0.007
ang+0.007
cks+0.007
irling+0.006
Neg logit contributions
 department-0.005
 nod-0.005
 walk-0.005
 gaze-0.005
 Lee-0.005
Token his
Token ID465
Feature activation-6.04e-01

Pos logit contributions
ang+0.040
 valuable+0.039
b+0.038
pill+0.038
hest+0.037
Neg logit contributions
 nod-0.028
 walk-0.027
 department-0.026
 moment-0.025
 hug-0.025
Token eyes
Token ID2951
Feature activation+5.56e-01

Pos logit contributions
 walk+0.012
 nod+0.012
 department+0.012
 hug+0.011
 gaze+0.011
Neg logit contributions
ang-0.017
 valuable-0.017
hest-0.017
b-0.016
cks-0.016
Token and
Token ID290
Feature activation+4.96e-01

Pos logit contributions
 valuable+0.017
b+0.017
ang+0.017
cks+0.016
hest+0.016
Neg logit contributions
 department-0.009
 nod-0.009
 walk-0.009
 moment-0.008
 Lee-0.008
Token smiled
Token ID13541
Feature activation+3.87e-01

Pos logit contributions
b+0.014
 valuable+0.014
ang+0.014
cks+0.013
irling+0.013
Neg logit contributions
 department-0.010
 nod-0.010
 walk-0.009
 Lee-0.009
 moment-0.009
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
 valuable+0.011
b+0.011
ang+0.011
 awesome+0.010
irling+0.010
Neg logit contributions
 department-0.008
 walk-0.007
 nod-0.007
 moment-0.007
 Lee-0.007
Token Pret
Token ID37123
Feature activation+6.52e+00

Pos logit contributions
b+0.007
ang+0.007
 valuable+0.007
cks+0.007
irling+0.006
Neg logit contributions
 nod-0.004
 department-0.004
 walk-0.004
 Lee-0.003
 moment-0.003
Tokenending
Token ID1571
Feature activation+3.59e-02

Pos logit contributions
 valuable+0.160
 amazing+0.154
 impressive+0.153
 prett+0.153
 awesome+0.150
Neg logit contributions
ously-0.141
 department-0.133
eman-0.130
ue-0.126
gy-0.125
Token was
Token ID373
Feature activation+3.58e+00

Pos logit contributions
 valuable+0.001
b+0.001
ang+0.001
cks+0.001
irling+0.001
Neg logit contributions
 department-0.001
 nod-0.001
 walk-0.001
 gaze-0.001
 Lee-0.001
Token so
Token ID523
Feature activation+3.58e+00

Pos logit contributions
ang+0.082
 valuable+0.076
b+0.075
cks+0.075
pill+0.074
Neg logit contributions
 department-0.093
 nod-0.093
 walk-0.091
 moment-0.088
 hug-0.087
Token much
Token ID881
Feature activation+1.40e+00

Pos logit contributions
ang+0.065
cks+0.063
hest+0.060
ind+0.059
ender+0.059
Neg logit contributions
 nod-0.111
 smile-0.106
 walk-0.106
 department-0.105
 hug-0.104
Token fun
Token ID1257
Feature activation+4.28e-01

Pos logit contributions
 valuable+0.024
b+0.023
ang+0.023
cks+0.022
irling+0.021
Neg logit contributions
 department-0.044
 nod-0.043
 walk-0.043
 Lee-0.041
 moment-0.041
TokenLook
Token ID8567
Feature activation-1.93e-01

Pos logit contributions
 valuable+0.030
irling+0.029
ang+0.028
b+0.028
cks+0.028
Neg logit contributions
 walk-0.023
 nod-0.023
 department-0.022
 moment-0.022
 gaze-0.021
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
 department+0.002
 nod+0.002
 walk+0.002
 moment+0.002
 gaze+0.002
Neg logit contributions
ang-0.007
 valuable-0.007
b-0.007
cks-0.007
 awesome-0.007
Token mom
Token ID1995
Feature activation-1.78e+00

Pos logit contributions
 department+0.042
 moment+0.041
ously+0.041
 nod+0.041
 walk+0.040
Neg logit contributions
 awesome-0.051
 amazing-0.050
 valuable-0.050
 impressive-0.048
ang-0.047
Token,
Token ID11
Feature activation-1.99e+00

Pos logit contributions
 nod+0.028
 walk+0.027
 gaze+0.026
 department+0.025
atively+0.024
Neg logit contributions
ang-0.058
b-0.058
ink-0.055
 valuable-0.055
 awesome-0.054
Token ost
Token ID23619
Feature activation+2.55e+00

Pos logit contributions
 department+0.048
 nod+0.047
 moment+0.045
 walk+0.045
ously+0.044
Neg logit contributions
 valuable-0.047
 awesome-0.045
 amazing-0.044
b-0.043
 impressive-0.043
Tokenric
Token ID1173
Feature activation+6.50e+00

Pos logit contributions
 valuable+0.054
ang+0.048
b+0.048
hest+0.048
 awesome+0.047
Neg logit contributions
 department-0.071
 nod-0.070
 walk-0.069
 gaze-0.066
 Lee-0.066
Tokenhes
Token ID956
Feature activation+1.63e+00

Pos logit contributions
 valuable+0.086
ang+0.082
 elegant+0.079
irling+0.076
ind+0.074
Neg logit contributions
 nod-0.217
 department-0.210
 woke-0.205
 moment-0.204
 came-0.202
Token!"
Token ID2474
Feature activation-1.36e+00

Pos logit contributions
 valuable+0.050
ang+0.049
b+0.048
irling+0.047
pill+0.046
Neg logit contributions
 nod-0.029
 department-0.028
 walk-0.028
 hug-0.027
 moment-0.026
Token
Token ID198
Feature activation+2.49e+00

Pos logit contributions
 department+0.034
 walk+0.031
 guy+0.031
 moment+0.031
pit+0.030
Neg logit contributions
hes-0.029
cks-0.029
ang-0.028
 valuable-0.028
ink-0.027
Token
Token ID198
Feature activation+1.88e+00

Pos logit contributions
 valuable+0.077
b+0.076
ang+0.075
pill+0.075
 awesome+0.075
Neg logit contributions
 nod-0.043
 walk-0.041
 moment-0.039
 department-0.039
 hug-0.038
Token"
Token ID1
Feature activation+1.11e+00

Pos logit contributions
 valuable+0.043
b+0.042
ang+0.042
cks+0.040
hest+0.040
Neg logit contributions
 nod-0.048
 department-0.048
 walk-0.048
 moment-0.044
 gaze-0.044
TokenOne
Token ID3198
Feature activation-1.54e+00

Pos logit contributions
cks+0.068
 valuable+0.064
ang+0.064
b+0.063
irling+0.062
Neg logit contributions
 walk-0.057
 nod-0.057
 moment-0.056
 department-0.055
 visit-0.052
Token day
Token ID1110
Feature activation-1.94e+00

Pos logit contributions
 department+0.030
 nod+0.028
 walk+0.027
 Lee+0.027
 hug+0.026
Neg logit contributions
 valuable-0.046
ang-0.045
b-0.044
 awesome-0.042
 amazing-0.041
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
 department+0.028
 nod+0.026
 walk+0.025
pit+0.024
 gaze+0.024
Neg logit contributions
b-0.064
 valuable-0.063
ang-0.062
dd-0.061
 amazing-0.061
Token a
Token ID257
Feature activation-3.50e-01

Pos logit contributions
 department+0.033
 nod+0.033
 walk+0.031
 gaze+0.030
 hug+0.030
Neg logit contributions
b-0.037
 valuable-0.037
ang-0.037
 awesome-0.034
 amazing-0.034
Token bird
Token ID6512
Feature activation+1.16e-01

Pos logit contributions
 department+0.009
 nod+0.009
 walk+0.009
 Lee+0.008
 gaze+0.008
Neg logit contributions
 valuable-0.008
ang-0.008
b-0.008
 awesome-0.007
 amazing-0.007
Token al
Token ID435
Feature activation+6.49e+00

Pos logit contributions
b+0.004
ang+0.003
cks+0.003
pill+0.003
irling+0.003
Neg logit contributions
 department-0.002
 Lee-0.002
 nod-0.002
 ending-0.002
 sister-0.002
Tokenight
Token ID432
Feature activation+6.93e-01

Pos logit contributions
 valuable+0.138
 awesome+0.134
 amazing+0.127
 elegant+0.127
cks+0.123
Neg logit contributions
 nod-0.163
 department-0.162
 walk-0.162
 moment-0.159
pit-0.153
Tokened
Token ID276
Feature activation+6.67e-01

Pos logit contributions
b+0.015
ang+0.015
 valuable+0.015
cks+0.015
irling+0.014
Neg logit contributions
 department-0.018
 nod-0.018
 walk-0.017
 moment-0.016
 Lee-0.016
Token on
Token ID319
Feature activation-4.88e-01

Pos logit contributions
b+0.017
ang+0.016
 valuable+0.016
cks+0.016
ind+0.015
Neg logit contributions
 department-0.016
 nod-0.015
 walk-0.015
 gaze-0.014
 moment-0.014
Token his
Token ID465
Feature activation-2.62e+00

Pos logit contributions
 walk+0.011
 nod+0.010
 department+0.010
 moment+0.010
 Lee+0.010
Neg logit contributions
 valuable-0.013
ang-0.013
b-0.013
irling-0.012
hest-0.012
Token windows
Token ID9168
Feature activation-1.45e-01

Pos logit contributions
 department+0.051
 nod+0.050
 Lee+0.047
 moment+0.044
 walk+0.042
Neg logit contributions
 valuable-0.077
 amazing-0.072
b-0.072
 awesome-0.072
 impressive-0.071
Token too
Token ID1165
Feature activation-8.33e-01

Pos logit contributions
 nod+0.026
 department+0.026
 walk+0.025
 moment+0.024
 gaze+0.024
Neg logit contributions
ang-0.033
b-0.033
 valuable-0.033
cks-0.032
 awesome-0.031
Token close
Token ID1969
Feature activation-2.89e-01

Pos logit contributions
 department+0.023
 walk+0.023
 nod+0.023
 moment+0.022
 hug+0.021
Neg logit contributions
 valuable-0.017
b-0.017
ang-0.017
cks-0.016
irling-0.016
Token.
Token ID13
Feature activation-9.23e-01

Pos logit contributions
 nod+0.005
 department+0.005
 walk+0.005
 gaze+0.005
 moment+0.004
Neg logit contributions
ang-0.009
b-0.009
 valuable-0.009
cks-0.008
 awesome-0.008
Token The
Token ID383
Feature activation-2.62e-01

Pos logit contributions
 department+0.021
 nod+0.021
 walk+0.021
 moment+0.019
 gaze+0.019
Neg logit contributions
 valuable-0.023
b-0.023
ang-0.023
cks-0.022
irling-0.022
Token ost
Token ID23619
Feature activation+2.16e+00

Pos logit contributions
 walk+0.005
 department+0.005
 nod+0.005
 moment+0.005
 hug+0.005
Neg logit contributions
b-0.008
 valuable-0.008
ang-0.008
irling-0.007
cks-0.007
Tokenric
Token ID1173
Feature activation+6.47e+00

Pos logit contributions
 valuable+0.044
b+0.040
ang+0.040
 awesome+0.039
hest+0.038
Neg logit contributions
 department-0.062
 nod-0.061
 walk-0.060
 gaze-0.057
 moment-0.057
Tokenhes
Token ID956
Feature activation+2.62e-01

Pos logit contributions
 valuable+0.081
 elegant+0.077
ind+0.071
irling+0.069
ang+0.066
Neg logit contributions
 nod-0.223
 department-0.220
 moment-0.215
 walk-0.211
 woke-0.210
Token might
Token ID1244
Feature activation-1.22e+00

Pos logit contributions
 valuable+0.007
ang+0.007
b+0.007
cks+0.007
hest+0.007
Neg logit contributions
 department-0.005
 nod-0.005
 walk-0.005
 gaze-0.005
 moment-0.005
Token pe
Token ID613
Feature activation+4.23e+00

Pos logit contributions
 walk+0.016
 nod+0.016
 department+0.015
 hug+0.014
 moment+0.014
Neg logit contributions
 valuable-0.043
ang-0.043
b-0.042
irling-0.041
pill-0.041
Tokenck
Token ID694
Feature activation+2.24e+00

Pos logit contributions
 valuable+0.073
hest+0.058
pill+0.058
 awesome+0.057
 elegant+0.057
Neg logit contributions
 walk-0.135
 nod-0.133
 department-0.129
 gaze-0.128
 listen-0.124
Token you
Token ID345
Feature activation+9.92e-01

Pos logit contributions
 valuable+0.060
ang+0.059
b+0.058
irling+0.057
cks+0.056
Neg logit contributions
 department-0.047
 nod-0.047
 walk-0.046
 moment-0.044
 hug-0.043
Token with
Token ID351
Feature activation+2.14e+00

Pos logit contributions
 nod+0.001
 walk+0.001
 department+0.001
 gaze+0.001
 moment+0.001
Neg logit contributions
 valuable-0.001
 awesome-0.001
ang-0.001
b-0.001
 amazing-0.001
Token your
Token ID534
Feature activation+6.75e-01

Pos logit contributions
ang+0.063
hest+0.061
iky+0.060
b+0.060
pill+0.060
Neg logit contributions
 hug-0.042
,-0.038
 Lee-0.038
 smile-0.037
 walk-0.037
Token new
Token ID649
Feature activation+2.24e-01

Pos logit contributions
 valuable+0.015
ang+0.015
b+0.015
cks+0.015
hest+0.014
Neg logit contributions
 department-0.017
 nod-0.017
 walk-0.017
 moment-0.016
 hug-0.016
Token coat
Token ID13209
Feature activation+7.01e-01

Pos logit contributions
 valuable+0.007
b+0.007
ang+0.007
irling+0.007
 awesome+0.007
Neg logit contributions
 department-0.004
 nod-0.004
 walk-0.004
 moment-0.003
 Lee-0.003
Token."
Token ID526
Feature activation+6.94e-01

Pos logit contributions
 valuable+0.017
b+0.016
ang+0.016
hest+0.016
 awesome+0.015
Neg logit contributions
 nod-0.017
 department-0.017
 walk-0.016
 gaze-0.016
 moment-0.016
Token Sil
Token ID4243
Feature activation+6.45e+00

Pos logit contributions
 valuable+0.019
ang+0.019
b+0.018
cks+0.018
irling+0.018
Neg logit contributions
 department-0.015
 nod-0.014
 walk-0.014
 moment-0.013
 gaze-0.013
Tokenvia
Token ID8869
Feature activation+1.79e+00

Pos logit contributions
 awesome+0.144
 valuable+0.141
pill+0.137
irling+0.129
 amazing+0.129
Neg logit contributions
 department-0.145
 nod-0.141
 moment-0.140
ously-0.139
 sister-0.137
Token smiled
Token ID13541
Feature activation-6.05e-01

Pos logit contributions
 valuable+0.059
ang+0.059
b+0.058
cks+0.057
irling+0.056
Neg logit contributions
 department-0.027
 nod-0.026
 walk-0.026
 moment-0.023
 hug-0.023
Token and
Token ID290
Feature activation+8.17e-01

Pos logit contributions
 department+0.013
 walk+0.012
 Lee+0.011
 nod+0.011
 moment+0.011
Neg logit contributions
 valuable-0.017
b-0.015
ang-0.015
 amazing-0.015
 impressive-0.015
Token said
Token ID531
Feature activation+4.87e-01

Pos logit contributions
 valuable+0.024
ang+0.024
b+0.024
cks+0.023
hest+0.023
Neg logit contributions
 nod-0.015
 walk-0.015
 department-0.015
 hug-0.014
 gaze-0.014
Token,
Token ID11
Feature activation-2.56e-01

Pos logit contributions
 valuable+0.018
b+0.017
ang+0.016
irling+0.016
 amazing+0.016
Neg logit contributions
 department-0.006
 walk-0.006
 garage-0.005
 nod-0.005
pit-0.005
Token,
Token ID11
Feature activation-3.33e+00

Pos logit contributions
 department+0.039
 nod+0.038
pit+0.034
 walk+0.033
 gaze+0.031
Neg logit contributions
 valuable-0.082
b-0.079
ang-0.079
 much-0.078
dd-0.076
Token Lily
Token ID20037
Feature activation-7.52e-01

Pos logit contributions
 department+0.084
 nod+0.082
 walk+0.077
ously+0.076
pit+0.076
Neg logit contributions
 valuable-0.074
 amazing-0.071
 impressive-0.070
 awesome-0.070
 colourful-0.069
Token was
Token ID373
Feature activation-1.86e-01

Pos logit contributions
 department+0.013
 nod+0.013
 Lee+0.012
pit+0.011
 moment+0.011
Neg logit contributions
 valuable-0.023
b-0.023
ang-0.022
ind-0.022
ink-0.021
Token getting
Token ID1972
Feature activation+1.11e+00

Pos logit contributions
 department+0.005
 nod+0.005
 walk+0.005
 moment+0.004
 Lee+0.004
Neg logit contributions
 valuable-0.004
ang-0.004
b-0.004
cks-0.004
irling-0.004
Token ready
Token ID3492
Feature activation-2.41e+00

Pos logit contributions
ang+0.026
cks+0.026
pill+0.025
hest+0.025
 valuable+0.025
Neg logit contributions
 walk-0.029
 department-0.028
 nod-0.028
 hug-0.027
 moment-0.026
Token to
Token ID284
Feature activation-5.52e+00

Pos logit contributions
 department+0.056
 nod+0.054
pit+0.050
 Lee+0.050
 walk+0.049
Neg logit contributions
 valuable-0.062
b-0.057
 amazing-0.056
 awesome-0.055
 impressive-0.054
Token sleep
Token ID3993
Feature activation-1.36e+00

Pos logit contributions
 department+0.107
pit+0.098
 Lee+0.095
 moment+0.094
ously+0.093
Neg logit contributions
 valuable-0.156
 amazing-0.154
cks-0.150
 awesome-0.149
 impressive-0.149
Token when
Token ID618
Feature activation-2.54e+00

Pos logit contributions
 department+0.032
 nod+0.031
 walk+0.028
 gaze+0.028
 hug+0.028
Neg logit contributions
 valuable-0.034
irling-0.032
b-0.032
ang-0.032
ind-0.032
Token she
Token ID673
Feature activation-1.06e+00

Pos logit contributions
 department+0.061
 nod+0.058
pit+0.053
ously+0.050
 hug+0.049
Neg logit contributions
 valuable-0.062
 amazing-0.061
 much-0.059
irling-0.059
 awesome-0.059
Token heard
Token ID2982
Feature activation-5.78e-01

Pos logit contributions
 department+0.014
 nod+0.013
 Lee+0.012
pit+0.010
 Johnson+0.010
Neg logit contributions
b-0.037
 valuable-0.036
ang-0.035
ink-0.035
 amazing-0.034
Token a
Token ID257
Feature activation-1.73e+00

Pos logit contributions
 department+0.015
 nod+0.015
 walk+0.015
 moment+0.014
 hug+0.014
Neg logit contributions
 valuable-0.013
ang-0.013
b-0.013
cks-0.012
irling-0.012
Token little
Token ID1310
Feature activation-1.59e+00

Pos logit contributions
 department+0.043
 nod+0.040
 walk+0.038
 Lee+0.038
 gaze+0.036
Neg logit contributions
 valuable-0.088
ang-0.087
b-0.085
 amazing-0.083
 awesome-0.083
Token girl
Token ID2576
Feature activation-5.10e-01

Pos logit contributions
 department+0.023
 walk+0.021
 nod+0.020
 gaze+0.020
 ending+0.020
Neg logit contributions
ang-0.051
 valuable-0.050
b-0.050
 colourful-0.049
 elegant-0.048
Token appeared
Token ID4120
Feature activation-1.82e-02

Pos logit contributions
 department+0.009
 nod+0.009
 moment+0.009
 ending+0.008
 walk+0.008
Neg logit contributions
b-0.015
ang-0.015
 valuable-0.014
pill-0.014
irling-0.014
Token on
Token ID319
Feature activation-9.26e-01

Pos logit contributions
 department+0.000
 nod+0.000
 walk+0.000
 gaze+0.000
 sister+0.000
Neg logit contributions
 valuable-0.000
ang-0.000
b-0.000
irling-0.000
 amazing-0.000
Token the
Token ID262
Feature activation-2.83e+00

Pos logit contributions
 department+0.021
 nod+0.021
 walk+0.021
 moment+0.020
 Lee+0.019
Neg logit contributions
 valuable-0.023
ang-0.023
b-0.023
cks-0.022
irling-0.022
Token other
Token ID584
Feature activation-5.46e+00

Pos logit contributions
 department+0.067
 nod+0.066
 Lee+0.062
 walk+0.059
 sister+0.059
Neg logit contributions
 valuable-0.073
hest-0.069
 amazing-0.068
b-0.068
 awesome-0.068
Token side
Token ID1735
Feature activation-1.30e+00

Pos logit contributions
 department+0.086
 nod+0.078
 Lee+0.068
 sister+0.065
 kiss+0.063
Neg logit contributions
 amazing-0.176
hest-0.174
ang-0.173
 valuable-0.172
 awesome-0.169
Token of
Token ID286
Feature activation-4.06e-01

Pos logit contributions
 department+0.041
 nod+0.040
 walk+0.038
 gaze+0.037
 sister+0.037
Neg logit contributions
ang-0.023
b-0.021
 valuable-0.021
hest-0.021
 amazing-0.019
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
 department+0.010
 nod+0.010
 walk+0.009
 gaze+0.009
 moment+0.009
Neg logit contributions
 valuable-0.010
ang-0.010
b-0.010
cks-0.009
irling-0.009
Token valley
Token ID19272
Feature activation-1.03e+00

Pos logit contributions
 department+0.027
 nod+0.027
 walk+0.025
 Lee+0.025
 gaze+0.024
Neg logit contributions
 valuable-0.038
ang-0.037
b-0.037
hest-0.035
 awesome-0.035
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
 department+0.023
 nod+0.023
 sister+0.022
 Lee+0.021
 walk+0.021
Neg logit contributions
b-0.026
 valuable-0.026
ang-0.026
hest-0.024
 amazing-0.024
Token tired
Token ID10032
Feature activation-3.22e+00

Pos logit contributions
cks+0.045
ang+0.045
pill+0.042
hest+0.042
b+0.042
Neg logit contributions
 walk-0.047
 nod-0.047
 sister-0.045
 hug-0.044
 gaze-0.044
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
 department+0.079
 nod+0.075
 Lee+0.071
 gaze+0.070
pit+0.069
Neg logit contributions
 valuable-0.077
hest-0.073
 amazing-0.072
b-0.071
 much-0.069
Token He
Token ID679
Feature activation-1.09e+00

Pos logit contributions
 department+0.047
 walk+0.046
 nod+0.044
 party+0.043
 moment+0.042
Neg logit contributions
 valuable-0.041
 impressive-0.036
b-0.035
 awesome-0.035
 amazing-0.035
Token was
Token ID373
Feature activation+1.33e-01

Pos logit contributions
 nod+0.016
 walk+0.016
 department+0.016
 moment+0.014
 hug+0.014
Neg logit contributions
ang-0.036
 valuable-0.036
b-0.036
cks-0.035
irling-0.035
Token ready
Token ID3492
Feature activation-2.07e+00

Pos logit contributions
 valuable+0.003
b+0.003
ang+0.003
irling+0.003
cks+0.003
Neg logit contributions
 department-0.003
 nod-0.003
 walk-0.003
 moment-0.003
 Lee-0.003
Token to
Token ID284
Feature activation-5.45e+00

Pos logit contributions
 department+0.047
 nod+0.045
pit+0.043
 Lee+0.042
 Driver+0.040
Neg logit contributions
 valuable-0.056
 amazing-0.052
b-0.051
 awesome-0.050
 important-0.050
Token go
Token ID467
Feature activation-3.17e+00

Pos logit contributions
 department+0.122
pit+0.111
 Lee+0.108
ously+0.104
 Johnson+0.097
Neg logit contributions
 amazing-0.142
 valuable-0.140
hest-0.136
 awesome-0.134
 impressive-0.133
Token to
Token ID284
Feature activation-4.25e+00

Pos logit contributions
 department+0.063
 nod+0.060
 Lee+0.054
 moment+0.054
 walk+0.053
Neg logit contributions
 valuable-0.091
b-0.090
ang-0.088
 awesome-0.085
hest-0.085
Token sleep
Token ID3993
Feature activation-1.41e+00

Pos logit contributions
 department+0.091
 nod+0.086
 walk+0.079
 moment+0.078
 Lee+0.077
Neg logit contributions
 valuable-0.116
b-0.112
ang-0.111
 awesome-0.109
cks-0.109
Token but
Token ID475
Feature activation-1.45e+00

Pos logit contributions
 department+0.028
 nod+0.027
 walk+0.026
 moment+0.024
 gaze+0.024
Neg logit contributions
 valuable-0.040
b-0.039
ang-0.039
irling-0.038
cks-0.038
Token he
Token ID339
Feature activation-1.52e-01

Pos logit contributions
 department+0.031
 nod+0.028
 Lee+0.027
 walk+0.026
pit+0.025
Neg logit contributions
 valuable-0.042
b-0.040
ang-0.038
 amazing-0.038
 awesome-0.037
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
 department+0.047
 spine+0.046
pit+0.044
 doorstep+0.042
 reaction+0.042
Neg logit contributions
 much-0.074
 amazing-0.072
 awesome-0.072
b-0.071
 valuable-0.071
Token something
Token ID1223
Feature activation+5.29e-01

Pos logit contributions
ously+0.060
 nod+0.060
 department+0.060
 spine+0.059
pit+0.059
Neg logit contributions
 awesome-0.038
 amazing-0.037
 much-0.035
 valuable-0.034
 impressive-0.034
Token happens
Token ID4325
Feature activation-8.96e-01

Pos logit contributions
 valuable+0.010
ang+0.010
b+0.010
cks+0.009
irling+0.009
Neg logit contributions
 department-0.015
 nod-0.015
 walk-0.015
 moment-0.014
 hug-0.014
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
 department+0.018
 walk+0.018
 nod+0.018
 gaze+0.016
 Lee+0.016
Neg logit contributions
b-0.025
 valuable-0.025
cks-0.024
ang-0.024
irling-0.023
Token The
Token ID383
Feature activation-1.47e+00

Pos logit contributions
 department+0.035
 nod+0.035
 walk+0.034
 moment+0.032
 gaze+0.032
Neg logit contributions
 valuable-0.049
ang-0.049
b-0.049
cks-0.047
irling-0.046
Token angry
Token ID7954
Feature activation-5.35e+00

Pos logit contributions
 department+0.029
 gaze+0.025
 Lee+0.024
ously+0.024
gy+0.024
Neg logit contributions
 valuable-0.044
 awesome-0.041
 impressive-0.041
 amazing-0.040
 annoying-0.040
Token man
Token ID582
Feature activation-7.79e-01

Pos logit contributions
 department+0.125
days+0.103
pit+0.101
comb+0.100
 walk+0.099
Neg logit contributions
 impressive-0.126
 annoying-0.124
aw-0.122
b-0.122
ang-0.121
Token starts
Token ID4940
Feature activation-7.84e-01

Pos logit contributions
 department+0.015
 nod+0.015
 walk+0.015
 moment+0.014
 Lee+0.014
Neg logit contributions
b-0.022
cks-0.022
ang-0.022
 valuable-0.022
irling-0.020
Token to
Token ID284
Feature activation-3.25e+00

Pos logit contributions
 department+0.014
 nod+0.013
 walk+0.012
pit+0.012
 gaze+0.012
Neg logit contributions
 valuable-0.024
b-0.024
ang-0.023
cks-0.023
irling-0.022
Token cough
Token ID22094
Feature activation-2.18e+00

Pos logit contributions
 department+0.067
pit+0.061
ously+0.059
 ending+0.055
atively+0.055
Neg logit contributions
cks-0.094
 annoying-0.089
b-0.088
 amazing-0.086
 awesome-0.084
Token.
Token ID13
Feature activation-1.95e+00

Pos logit contributions
 department+0.038
 nod+0.038
 walk+0.037
 moment+0.034
 gaze+0.034
Neg logit contributions
 valuable-0.066
ang-0.065
b-0.065
cks-0.063
hest-0.062
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
 department+0.028
 nod+0.028
 walk+0.026
 gaze+0.025
 hug+0.025
Neg logit contributions
ang-0.026
 valuable-0.026
b-0.025
cks-0.024
 awesome-0.024
Token hill
Token ID12788
Feature activation-1.40e+00

Pos logit contributions
 department+0.011
 nod+0.010
 walk+0.010
 moment+0.009
 gaze+0.008
Neg logit contributions
 valuable-0.043
ang-0.042
b-0.042
cks-0.041
irling-0.041
Token and
Token ID290
Feature activation-2.04e+00

Pos logit contributions
 nod+0.032
 department+0.032
 gaze+0.031
 walk+0.030
 sister+0.029
Neg logit contributions
ang-0.035
 valuable-0.034
b-0.033
hest-0.031
 amazing-0.030
Token got
Token ID1392
Feature activation+1.15e+00

Pos logit contributions
 department+0.044
 nod+0.039
 ending+0.039
 behavior+0.038
ously+0.037
Neg logit contributions
b-0.055
ang-0.053
 valuable-0.050
cks-0.049
 amazing-0.048
Token ready
Token ID3492
Feature activation-1.52e+00

Pos logit contributions
 valuable+0.030
ang+0.029
b+0.029
cks+0.028
hest+0.028
Neg logit contributions
 department-0.026
 nod-0.026
 walk-0.026
 moment-0.024
 hug-0.024
Token to
Token ID284
Feature activation-5.30e+00

Pos logit contributions
 department+0.024
 nod+0.024
 walk+0.023
 moment+0.021
 gaze+0.021
Neg logit contributions
 valuable-0.049
b-0.048
ang-0.048
cks-0.046
irling-0.046
Token go
Token ID467
Feature activation-2.66e+00

Pos logit contributions
 department+0.082
 nod+0.069
 moment+0.067
 Lee+0.064
 ending+0.064
Neg logit contributions
 valuable-0.173
b-0.172
ang-0.172
cks-0.171
 awesome-0.170
Token down
Token ID866
Feature activation-2.15e+00

Pos logit contributions
 department+0.052
 nod+0.051
 walk+0.048
 moment+0.045
 gaze+0.045
Neg logit contributions
ang-0.077
 valuable-0.077
b-0.077
cks-0.073
 awesome-0.072
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 department+0.044
 nod+0.043
 walk+0.042
 gaze+0.039
 moment+0.039
Neg logit contributions
 valuable-0.060
ang-0.059
b-0.058
cks-0.056
 awesome-0.056
Token
Token ID198
Feature activation+2.58e+00

Pos logit contributions
 nod+0.020
 walk+0.019
 department+0.019
 Lee+0.018
 hug+0.018
Neg logit contributions
b-0.031
 valuable-0.031
ang-0.030
irling-0.029
cks-0.029
Token
Token ID198
Feature activation+2.01e+00

Pos logit contributions
 elegant+0.090
b+0.090
years+0.090
ang+0.090
 valuable+0.089
Neg logit contributions
 walk-0.034
 nod-0.034
 moment-0.033
 visit-0.030
 hug-0.030
Token were
Token ID547
Feature activation+9.95e-01

Pos logit contributions
 department+0.026
 nod+0.022
 moment+0.022
 spine+0.021
 Lee+0.021
Neg logit contributions
b-0.029
cks-0.029
ang-0.029
ink-0.028
 valuable-0.028
Token hungry
Token ID14720
Feature activation-3.96e+00

Pos logit contributions
 valuable+0.023
b+0.023
ang+0.023
cks+0.022
hest+0.021
Neg logit contributions
 department-0.025
 nod-0.025
 walk-0.024
 moment-0.023
 hug-0.023
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
 department+0.108
 nod+0.099
 Driver+0.098
 moment+0.094
 gaze+0.094
Neg logit contributions
 valuable-0.079
 amazing-0.076
 impressive-0.074
 much-0.073
 immensely-0.072
Token They
Token ID1119
Feature activation-2.26e+00

Pos logit contributions
 department+0.049
 walk+0.044
 sister+0.044
 party+0.043
eman+0.042
Neg logit contributions
 valuable-0.035
cks-0.033
ang-0.032
 awesome-0.032
 impressive-0.032
Token wanted
Token ID2227
Feature activation-2.20e+00

Pos logit contributions
 department+0.060
 moment+0.054
 Lee+0.053
pit+0.053
 ending+0.051
Neg logit contributions
 valuable-0.049
 amazing-0.045
 impressive-0.044
b-0.044
cks-0.043
Token to
Token ID284
Feature activation-5.30e+00

Pos logit contributions
 nod+0.058
 department+0.058
pit+0.057
 spine+0.055
 Driver+0.055
Neg logit contributions
 valuable-0.049
 amazing-0.045
 much-0.043
 impressive-0.043
 incredible-0.042
Token eat
Token ID4483
Feature activation-9.64e-01

Pos logit contributions
 department+0.119
ously+0.118
 spine+0.104
pit+0.104
 Lee+0.103
Neg logit contributions
 amazing-0.133
 impressive-0.129
 awesome-0.128
 valuable-0.127
hest-0.127
Token some
Token ID617
Feature activation-1.22e+00

Pos logit contributions
 department+0.022
 nod+0.022
 walk+0.022
 moment+0.020
 gaze+0.020
Neg logit contributions
 valuable-0.024
ang-0.024
b-0.024
cks-0.023
irling-0.023
Token soup
Token ID17141
Feature activation-1.13e+00

Pos logit contributions
 department+0.027
 nod+0.027
 walk+0.026
 moment+0.025
 gaze+0.024
Neg logit contributions
 valuable-0.032
ang-0.031
b-0.031
cks-0.030
irling-0.030
Token.
Token ID13
Feature activation-1.90e+00

Pos logit contributions
 nod+0.025
 gaze+0.024
 department+0.024
 walk+0.024
 hug+0.023
Neg logit contributions
 valuable-0.029
b-0.028
ang-0.027
 amazing-0.027
hest-0.027
Token But
Token ID887
Feature activation-2.83e+00

Pos logit contributions
 department+0.049
 walk+0.045
 nod+0.044
 sister+0.044
 ending+0.044
Neg logit contributions
 valuable-0.042
ang-0.042
cks-0.040
b-0.040
 awesome-0.039
Token are
Token ID389
Feature activation+1.37e+00

Pos logit contributions
 department+0.015
 nod+0.014
 moment+0.013
 Lee+0.013
 spine+0.013
Neg logit contributions
b-0.017
cks-0.016
ang-0.016
 valuable-0.016
ink-0.016
Token hungry
Token ID14720
Feature activation-3.81e+00

Pos logit contributions
 valuable+0.034
b+0.034
ang+0.033
hest+0.032
cks+0.032
Neg logit contributions
 nod-0.032
 walk-0.032
 department-0.032
 hug-0.030
 sister-0.030
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
 department+0.104
 nod+0.096
 Driver+0.092
 gaze+0.091
 moment+0.091
Neg logit contributions
 valuable-0.075
 amazing-0.072
 impressive-0.071
 much-0.070
ang-0.069
Token They
Token ID1119
Feature activation-1.89e+00

Pos logit contributions
 department+0.036
 walk+0.033
 sister+0.031
 party+0.030
 ending+0.030
Neg logit contributions
 valuable-0.030
cks-0.030
ang-0.030
 awesome-0.028
 impressive-0.027
Token want
Token ID765
Feature activation-2.88e+00

Pos logit contributions
 department+0.046
 Lee+0.040
pit+0.040
 moment+0.039
ously+0.037
Neg logit contributions
 valuable-0.048
 amazing-0.045
 awesome-0.044
cks-0.044
 impressive-0.044
Token to
Token ID284
Feature activation-5.29e+00

Pos logit contributions
 nod+0.064
 department+0.063
pit+0.061
 spine+0.058
 Lee+0.058
Neg logit contributions
 valuable-0.076
 amazing-0.070
 impressive-0.068
hes-0.067
 awesome-0.067
Token eat
Token ID4483
Feature activation-9.95e-01

Pos logit contributions
 department+0.121
ously+0.117
pit+0.105
 moment+0.103
 Lee+0.102
Neg logit contributions
 amazing-0.131
 awesome-0.127
 impressive-0.126
hest-0.125
cks-0.125
Token meat
Token ID6174
Feature activation-4.99e-01

Pos logit contributions
 department+0.023
 nod+0.023
 walk+0.023
 moment+0.021
 gaze+0.021
Neg logit contributions
 valuable-0.025
ang-0.024
b-0.024
cks-0.023
irling-0.023
Token for
Token ID329
Feature activation-1.26e+00

Pos logit contributions
 nod+0.011
 department+0.010
 walk+0.010
 gaze+0.010
 moment+0.010
Neg logit contributions
 valuable-0.013
b-0.013
ang-0.013
cks-0.013
hest-0.012
Token lunch
Token ID9965
Feature activation-1.11e+00

Pos logit contributions
 walk+0.021
 nod+0.018
 Lee+0.018
 snack+0.017
 department+0.017
Neg logit contributions
ang-0.043
b-0.041
 valuable-0.041
hest-0.041
pill-0.040
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
 department+0.022
 nod+0.022
 walk+0.021
 gaze+0.021
 moment+0.020
Neg logit contributions
 valuable-0.031
b-0.030
ang-0.030
hest-0.030
 awesome-0.029
Token quickly
Token ID2952
Feature activation-2.09e+00

Pos logit contributions
 department+0.006
 nod+0.006
 moment+0.005
 ending+0.005
 walk+0.005
Neg logit contributions
 valuable-0.008
b-0.008
ang-0.007
irling-0.007
ind-0.007
Token ran
Token ID4966
Feature activation-1.21e+00

Pos logit contributions
 department+0.042
 nod+0.041
 moment+0.041
 Lee+0.039
 walk+0.038
Neg logit contributions
 valuable-0.058
b-0.056
ang-0.054
irling-0.052
 amazing-0.052
Token over
Token ID625
Feature activation-1.69e+00

Pos logit contributions
 nod+0.026
 department+0.025
 gaze+0.023
 ending+0.023
 walk+0.022
Neg logit contributions
 valuable-0.033
b-0.031
 awesome-0.030
 amazing-0.030
ang-0.030
Token to
Token ID284
Feature activation-3.32e+00

Pos logit contributions
 department+0.035
 nod+0.034
 ending+0.034
 sister+0.031
 gaze+0.031
Neg logit contributions
 valuable-0.044
ang-0.043
b-0.043
 amazing-0.042
 colourful-0.041
Token the
Token ID262
Feature activation-2.10e+00

Pos logit contributions
 department+0.098
pit+0.089
 ending+0.083
 spine+0.081
 guy+0.080
Neg logit contributions
 amazing-0.064
 valuable-0.062
cks-0.061
 awesome-0.060
b-0.060
Token other
Token ID584
Feature activation-5.28e+00

Pos logit contributions
 nod+0.042
 department+0.040
ously+0.039
 walk+0.038
 gaze+0.037
Neg logit contributions
 valuable-0.060
 amazing-0.056
 impressive-0.056
 incredible-0.054
 awesome-0.054
Token side
Token ID1735
Feature activation-2.08e+00

Pos logit contributions
 nod+0.100
 department+0.098
ously+0.098
 gaze+0.093
atively+0.092
Neg logit contributions
 amazing-0.141
 valuable-0.139
 impressive-0.136
 incredible-0.135
 awesome-0.132
Token of
Token ID286
Feature activation-1.24e+00

Pos logit contributions
 department+0.065
 nod+0.064
 gaze+0.062
 walk+0.062
 ending+0.060
Neg logit contributions
ang-0.037
b-0.030
 valuable-0.029
hest-0.028
 amazing-0.027
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
 department+0.031
 nod+0.031
 walk+0.029
 gaze+0.028
 moment+0.027
Neg logit contributions
 valuable-0.029
ang-0.028
b-0.028
cks-0.027
 awesome-0.027
Token room
Token ID2119
Feature activation-2.90e+00

Pos logit contributions
 nod+0.027
 department+0.025
 gaze+0.023
ously+0.023
 walk+0.023
Neg logit contributions
 valuable-0.039
 amazing-0.036
 impressive-0.036
b-0.035
ang-0.035
Token and
Token ID290
Feature activation-8.14e-01

Pos logit contributions
 nod+0.071
 department+0.070
 guy+0.068
 Driver+0.066
 walk+0.066
Neg logit contributions
 valuable-0.058
ang-0.056
 amazing-0.056
 much-0.055
her-0.054
Token saw
Token ID2497
Feature activation+9.06e-01

Pos logit contributions
 department+0.012
 nod+0.012
 moment+0.011
 walk+0.011
 Lee+0.011
Neg logit contributions
 valuable-0.019
b-0.019
ang-0.019
irling-0.018
cks-0.018
Token a
Token ID257
Feature activation-4.83e-02

Pos logit contributions
 valuable+0.019
ang+0.018
b+0.018
cks+0.018
irling+0.017
Neg logit contributions
 department-0.025
 nod-0.025
 walk-0.024
 moment-0.023
 gaze-0.023
Token rabbit
Token ID22746
Feature activation-7.37e-01

Pos logit contributions
 department+0.001
 nod+0.001
 walk+0.001
 gaze+0.001
 Lee+0.001
Neg logit contributions
 valuable-0.001
ang-0.001
b-0.001
 awesome-0.001
irling-0.001
Token on
Token ID319
Feature activation-5.55e-01

Pos logit contributions
 department+0.018
 nod+0.017
 moment+0.016
 Lee+0.016
 ending+0.016
Neg logit contributions
ang-0.018
b-0.018
 valuable-0.017
irling-0.017
cks-0.017
Token the
Token ID262
Feature activation-3.27e+00

Pos logit contributions
 walk+0.011
 nod+0.010
 department+0.010
 Lee+0.009
 hug+0.009
Neg logit contributions
b-0.017
ang-0.017
 valuable-0.017
irling-0.016
hest-0.016
Token other
Token ID584
Feature activation-5.28e+00

Pos logit contributions
 department+0.060
 nod+0.060
 walk+0.059
 moment+0.054
 gaze+0.053
Neg logit contributions
 valuable-0.097
ang-0.096
b-0.095
cks-0.092
irling-0.091
Token side
Token ID1735
Feature activation-6.32e-01

Pos logit contributions
 department+0.063
 nod+0.059
 walk+0.051
 moment+0.047
 gaze+0.047
Neg logit contributions
ang-0.190
 valuable-0.189
b-0.185
hest-0.183
 amazing-0.181
Token!
Token ID0
Feature activation-1.15e+00

Pos logit contributions
 department+0.018
 nod+0.018
 walk+0.017
 gaze+0.017
 hug+0.016
Neg logit contributions
ang-0.013
b-0.013
 valuable-0.012
cks-0.012
hest-0.012
Token The
Token ID383
Feature activation-1.71e+00

Pos logit contributions
 department+0.027
 walk+0.026
 nod+0.026
 moment+0.025
 gaze+0.024
Neg logit contributions
 valuable-0.029
ang-0.028
b-0.027
pill-0.026
cks-0.026
Token children
Token ID1751
Feature activation-2.04e-01

Pos logit contributions
 department+0.034
 nod+0.033
 walk+0.031
 gaze+0.031
 Lee+0.030
Neg logit contributions
 valuable-0.049
 amazing-0.046
 impressive-0.046
ang-0.046
 awesome-0.046
Token were
Token ID547
Feature activation+1.30e+00

Pos logit contributions
 department+0.004
 nod+0.004
 moment+0.004
 walk+0.004
 ending+0.004
Neg logit contributions
ang-0.006
b-0.006
 valuable-0.005
irling-0.005
cks-0.005
Token were
Token ID547
Feature activation+1.29e+00

Pos logit contributions
 department+0.042
 nod+0.035
 moment+0.035
 spine+0.035
pit+0.034
Neg logit contributions
 valuable-0.048
b-0.047
 amazing-0.046
ang-0.046
cks-0.046
Token hungry
Token ID14720
Feature activation-3.81e+00

Pos logit contributions
b+0.035
 valuable+0.035
ang+0.033
hest+0.033
cks+0.033
Neg logit contributions
 walk-0.027
 nod-0.027
 department-0.027
 sister-0.026
 hug-0.025
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 department+0.097
 nod+0.090
 Driver+0.088
 moment+0.086
 gaze+0.086
Neg logit contributions
 valuable-0.083
 amazing-0.081
 impressive-0.078
 much-0.076
 incredible-0.076
Token They
Token ID1119
Feature activation-1.99e+00

Pos logit contributions
 department+0.040
 walk+0.038
 nod+0.037
 moment+0.035
 hug+0.035
Neg logit contributions
 valuable-0.040
ang-0.038
b-0.036
cks-0.036
 awesome-0.036
Token wanted
Token ID2227
Feature activation-2.14e+00

Pos logit contributions
 department+0.053
 Lee+0.048
 moment+0.047
pit+0.046
 nod+0.046
Neg logit contributions
 valuable-0.043
 amazing-0.040
b-0.039
 impressive-0.038
 incredibly-0.038
Token to
Token ID284
Feature activation-5.28e+00

Pos logit contributions
 nod+0.054
 department+0.053
pit+0.052
 Lee+0.050
 Driver+0.050
Neg logit contributions
 valuable-0.051
 amazing-0.046
 impressive-0.044
 much-0.044
 awesome-0.044
Token feed
Token ID3745
Feature activation-1.02e+00

Pos logit contributions
 department+0.114
ously+0.110
pit+0.098
 spine+0.098
 Lee+0.098
Neg logit contributions
 amazing-0.138
 impressive-0.134
 valuable-0.133
 awesome-0.133
hest-0.133
Token the
Token ID262
Feature activation-8.43e-02

Pos logit contributions
 nod+0.020
 walk+0.020
 department+0.020
 moment+0.019
 hug+0.018
Neg logit contributions
ang-0.029
 valuable-0.029
irling-0.029
b-0.028
pill-0.027
Token ducks
Token ID39694
Feature activation-1.00e+00

Pos logit contributions
 department+0.002
 nod+0.002
 walk+0.002
 moment+0.002
 gaze+0.002
Neg logit contributions
 valuable-0.002
ang-0.002
b-0.002
cks-0.002
irling-0.002
Token at
Token ID379
Feature activation-1.30e+00

Pos logit contributions
 department+0.021
 nod+0.020
 walk+0.019
 Lee+0.019
 ending+0.019
Neg logit contributions
 valuable-0.027
b-0.027
ang-0.027
cks-0.026
hest-0.025
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
 department+0.034
 nod+0.034
 walk+0.032
ously+0.031
 moment+0.030
Neg logit contributions
 valuable-0.028
ang-0.027
cks-0.027
b-0.027
ind-0.026
Token!"
Token ID2474
Feature activation-8.17e-01

Pos logit contributions
 valuable+0.057
b+0.057
ang+0.056
cks+0.054
hest+0.053
Neg logit contributions
 department-0.043
 nod-0.043
 walk-0.042
 moment-0.039
 gaze-0.039
Token Ben
Token ID3932
Feature activation+4.46e-01

Pos logit contributions
 department+0.022
 walk+0.020
 nod+0.020
 moment+0.019
 ending+0.019
Neg logit contributions
 valuable-0.018
ang-0.018
b-0.017
cks-0.017
pill-0.016
Token sits
Token ID10718
Feature activation+7.34e-01

Pos logit contributions
 valuable+0.011
ang+0.011
b+0.011
cks+0.011
 awesome+0.010
Neg logit contributions
 department-0.010
 walk-0.010
 nod-0.010
 moment-0.009
 gaze-0.009
Token on
Token ID319
Feature activation-1.54e+00

Pos logit contributions
 valuable+0.018
b+0.017
ang+0.017
hest+0.016
cks+0.016
Neg logit contributions
 nod-0.018
 department-0.018
 walk-0.018
 moment-0.017
 Lee-0.017
Token the
Token ID262
Feature activation-2.54e+00

Pos logit contributions
 department+0.036
 nod+0.036
 walk+0.035
 moment+0.033
 gaze+0.033
Neg logit contributions
 valuable-0.038
ang-0.037
b-0.037
cks-0.035
irling-0.035
Token other
Token ID584
Feature activation-5.27e+00

Pos logit contributions
 department+0.045
 nod+0.044
 walk+0.040
 Lee+0.038
 moment+0.038
Neg logit contributions
 valuable-0.077
b-0.075
hest-0.074
 awesome-0.074
ang-0.074
Token chair
Token ID5118
Feature activation-1.69e+00

Pos logit contributions
 department+0.095
 nod+0.085
ously+0.076
 walk+0.071
 gaze+0.069
Neg logit contributions
hest-0.154
ang-0.153
b-0.152
 awesome-0.149
 valuable-0.149
Token and
Token ID290
Feature activation-9.63e-01

Pos logit contributions
 department+0.038
 nod+0.036
 walk+0.034
 moment+0.034
 gaze+0.033
Neg logit contributions
ang-0.043
b-0.042
 valuable-0.042
cks-0.040
hest-0.039
Token says
Token ID1139
Feature activation-3.60e-01

Pos logit contributions
 department+0.022
 nod+0.020
 moment+0.020
 ending+0.020
 walk+0.020
Neg logit contributions
b-0.024
cks-0.024
ang-0.024
 valuable-0.023
irling-0.022
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 department+0.005
 walk+0.005
 garage+0.005
 doorstep+0.004
pit+0.004
Neg logit contributions
 valuable-0.012
b-0.012
 awesome-0.011
 much-0.011
 amazing-0.011
Token "
Token ID366
Feature activation+6.76e-01

Pos logit contributions
 department+0.029
 walk+0.028
 nod+0.027
 party+0.025
 ending+0.025
Neg logit contributions
 valuable-0.037
ang-0.037
b-0.037
irling-0.035
cks-0.035
Token see
Token ID766
Feature activation+6.50e-02

Pos logit contributions
 department+0.036
pit+0.034
 garage+0.030
 Lee+0.030
atively+0.030
Neg logit contributions
b-0.053
 valuable-0.051
 amazing-0.050
cks-0.050
 awesome-0.050
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
 valuable+0.002
ang+0.002
b+0.002
hest+0.002
irling+0.002
Neg logit contributions
 walk-0.001
 department-0.001
 nod-0.001
 Lee-0.001
 hug-0.001
Token sack
Token ID23704
Feature activation-1.16e+00

Pos logit contributions
 nod+0.018
 department+0.017
 walk+0.016
 gaze+0.016
 moment+0.015
Neg logit contributions
 valuable-0.050
ang-0.048
b-0.047
 awesome-0.046
 amazing-0.046
Token on
Token ID319
Feature activation-1.09e+00

Pos logit contributions
 department+0.030
 nod+0.029
 Lee+0.028
 sister+0.027
 gaze+0.027
Neg logit contributions
hest-0.026
 valuable-0.026
b-0.025
cks-0.025
ang-0.025
Token the
Token ID262
Feature activation-3.14e+00

Pos logit contributions
 walk+0.021
 department+0.020
 nod+0.020
 Lee+0.020
 moment+0.019
Neg logit contributions
b-0.032
 valuable-0.032
ang-0.032
irling-0.030
hest-0.030
Token other
Token ID584
Feature activation-5.27e+00

Pos logit contributions
 nod+0.076
 department+0.074
 gaze+0.068
 moment+0.068
 walk+0.067
Neg logit contributions
 valuable-0.078
hest-0.071
 awesome-0.070
 amazing-0.070
b-0.069
Token side
Token ID1735
Feature activation-1.15e+00

Pos logit contributions
 nod+0.071
 department+0.068
 gaze+0.059
 walk+0.058
 moment+0.057
Neg logit contributions
 valuable-0.179
ang-0.178
hest-0.175
 amazing-0.169
 awesome-0.168
Token of
Token ID286
Feature activation-1.02e+00

Pos logit contributions
 department+0.030
 nod+0.030
 walk+0.029
 gaze+0.028
 moment+0.027
Neg logit contributions
ang-0.026
b-0.025
 valuable-0.024
hest-0.024
cks-0.023
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
 department+0.023
 nod+0.023
 walk+0.023
 moment+0.021
 gaze+0.021
Neg logit contributions
 valuable-0.026
ang-0.026
b-0.026
cks-0.025
irling-0.024
Token fence
Token ID13990
Feature activation-1.43e+00

Pos logit contributions
 nod+0.035
 department+0.034
 gaze+0.031
 Lee+0.030
 walk+0.030
Neg logit contributions
 valuable-0.052
ang-0.049
b-0.047
ind-0.047
 amazing-0.046
Token.
Token ID13
Feature activation-5.58e-01

Pos logit contributions
 department+0.033
 nod+0.033
 moment+0.030
 sister+0.030
 gaze+0.030
Neg logit contributions
ang-0.035
b-0.035
cks-0.035
hest-0.034
 valuable-0.034
Token are
Token ID389
Feature activation+1.23e+00

Pos logit contributions
 department+0.048
 spine+0.041
 moment+0.040
pit+0.040
 Lee+0.039
Neg logit contributions
 valuable-0.047
 amazing-0.046
b-0.046
ang-0.045
dd-0.044
Token hungry
Token ID14720
Feature activation-4.02e+00

Pos logit contributions
b+0.033
 valuable+0.032
ang+0.031
hest+0.031
cks+0.031
Neg logit contributions
 nod-0.028
 walk-0.027
 department-0.027
 sister-0.026
 hug-0.026
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 department+0.111
 Driver+0.101
 nod+0.100
 gaze+0.096
 moment+0.096
Neg logit contributions
 valuable-0.081
 amazing-0.076
 impressive-0.074
 much-0.074
 immensely-0.072
Token They
Token ID1119
Feature activation-1.86e+00

Pos logit contributions
 department+0.031
 walk+0.029
 nod+0.028
 moment+0.027
 hug+0.026
Neg logit contributions
 valuable-0.035
ang-0.034
cks-0.033
b-0.033
 awesome-0.032
Token want
Token ID765
Feature activation-2.91e+00

Pos logit contributions
 department+0.047
pit+0.041
 Lee+0.041
 moment+0.040
ously+0.038
Neg logit contributions
 valuable-0.044
 amazing-0.042
 awesome-0.041
 impressive-0.040
cks-0.040
Token to
Token ID284
Feature activation-5.27e+00

Pos logit contributions
 nod+0.066
 department+0.065
pit+0.064
 moment+0.060
 spine+0.060
Neg logit contributions
 valuable-0.075
 amazing-0.069
 impressive-0.067
 awesome-0.066
 incredible-0.066
Token eat
Token ID4483
Feature activation-8.61e-01

Pos logit contributions
 department+0.121
ously+0.119
pit+0.105
 spine+0.105
 Lee+0.103
Neg logit contributions
 amazing-0.130
 awesome-0.126
 impressive-0.125
hest-0.124
 valuable-0.123
Token something
Token ID1223
Feature activation+8.38e-01

Pos logit contributions
 nod+0.018
 department+0.018
 walk+0.018
 moment+0.017
 hug+0.017
Neg logit contributions
 valuable-0.023
ang-0.023
b-0.023
cks-0.022
irling-0.022
Token y
Token ID331
Feature activation-1.26e+00

Pos logit contributions
ang+0.019
b+0.018
cks+0.018
irling+0.018
pill+0.017
Neg logit contributions
 walk-0.021
 department-0.021
 nod-0.021
 moment-0.020
 hug-0.020
Tokenummy
Token ID13513
Feature activation-1.56e+00

Pos logit contributions
 department+0.033
 nod+0.032
 walk+0.031
 moment+0.030
 gaze+0.030
Neg logit contributions
 valuable-0.028
ang-0.028
b-0.027
cks-0.027
hest-0.026
Token.
Token ID13
Feature activation-2.10e+00

Pos logit contributions
 department+0.032
 nod+0.031
 walk+0.030
 gaze+0.029
 hug+0.028
Neg logit contributions
 valuable-0.044
b-0.042
ang-0.041
 awesome-0.040
cks-0.040
Token he
Token ID339
Feature activation+1.12e+00

Pos logit contributions
 department+0.012
 nod+0.012
 walk+0.012
 moment+0.011
 gaze+0.010
Neg logit contributions
 valuable-0.030
ang-0.029
b-0.029
cks-0.029
irling-0.028
Token couldn
Token ID3521
Feature activation-6.56e-01

Pos logit contributions
ang+0.034
cks+0.033
hest+0.033
 valuable+0.033
b+0.032
Neg logit contributions
 walk-0.021
 department-0.020
 woke-0.019
 nod-0.019
 sister-0.018
Token't
Token ID470
Feature activation-3.80e+00

Pos logit contributions
 walk+0.014
 department+0.014
 nod+0.013
 moment+0.013
 hug+0.012
Neg logit contributions
 valuable-0.018
ang-0.018
b-0.018
cks-0.017
irling-0.017
Token decide
Token ID5409
Feature activation-1.85e+00

Pos logit contributions
 department+0.059
 Lee+0.051
 nod+0.048
 moment+0.047
pit+0.047
Neg logit contributions
b-0.124
 valuable-0.123
hest-0.123
ang-0.121
 amazing-0.120
Token what
Token ID644
Feature activation-6.02e-01

Pos logit contributions
 department+0.040
 nod+0.039
 Lee+0.036
 moment+0.035
 walk+0.035
Neg logit contributions
b-0.050
 valuable-0.050
cks-0.047
 awesome-0.047
 amazing-0.046
Token to
Token ID284
Feature activation-5.25e+00

Pos logit contributions
 department+0.010
 nod+0.009
 walk+0.009
 Lee+0.008
pit+0.008
Neg logit contributions
 valuable-0.020
b-0.019
ang-0.018
 awesome-0.018
 amazing-0.018
Token eat
Token ID4483
Feature activation-6.94e-01

Pos logit contributions
 department+0.144
 Lee+0.128
ously+0.126
pit+0.120
atively+0.116
Neg logit contributions
 amazing-0.116
hest-0.111
 awesome-0.108
cks-0.107
b-0.107
Token for
Token ID329
Feature activation-1.75e+00

Pos logit contributions
 department+0.012
 nod+0.012
 walk+0.012
 moment+0.011
 hug+0.011
Neg logit contributions
 valuable-0.021
ang-0.021
b-0.021
cks-0.020
irling-0.020
Token lunch
Token ID9965
Feature activation-7.45e-01

Pos logit contributions
 walk+0.028
 nod+0.026
 department+0.024
 moment+0.024
 hug+0.024
Neg logit contributions
ang-0.058
 valuable-0.057
b-0.056
irling-0.054
hest-0.054
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
 nod+0.015
 department+0.015
 walk+0.014
 gaze+0.014
 hug+0.014
Neg logit contributions
 valuable-0.021
b-0.020
ang-0.020
hest-0.020
 awesome-0.019
Token He
Token ID679
Feature activation-5.12e-02

Pos logit contributions
 department+0.006
 walk+0.006
 nod+0.006
 moment+0.006
 hug+0.006
Neg logit contributions
 valuable-0.007
ang-0.006
b-0.006
cks-0.006
hest-0.006
Token small
Token ID1402
Feature activation-4.46e+00

Pos logit contributions
ously+0.039
gy+0.038
 Lee+0.038
 department+0.038
 Johnson+0.035
Neg logit contributions
 amazing-0.024
 impressive-0.024
 valuable-0.023
 incredible-0.023
 awesome-0.022
Token girl
Token ID2576
Feature activation-7.32e-02

Pos logit contributions
stack+0.135
comb+0.132
isk+0.119
 spine+0.116
 hospitality+0.115
Neg logit contributions
 attractive-0.086
 impressive-0.085
 colorful-0.082
 pretty-0.082
 expensive-0.081
Token went
Token ID1816
Feature activation-1.20e+00

Pos logit contributions
 department+0.002
 nod+0.001
 ending+0.001
 moment+0.001
 Lee+0.001
Neg logit contributions
b-0.002
ang-0.002
 valuable-0.002
irling-0.002
 amazing-0.002
Token to
Token ID284
Feature activation-2.66e+00

Pos logit contributions
 department+0.026
 nod+0.025
 ending+0.024
 gaze+0.024
 walk+0.023
Neg logit contributions
b-0.032
ang-0.032
 valuable-0.031
 awesome-0.029
irling-0.029
Token the
Token ID262
Feature activation-2.01e+00

Pos logit contributions
 department+0.085
pit+0.078
ously+0.075
 ending+0.075
atively+0.072
Neg logit contributions
 amazing-0.044
 awesome-0.043
 impressive-0.041
 incredible-0.040
 valuable-0.040
Token other
Token ID584
Feature activation-5.24e+00

Pos logit contributions
 department+0.050
ously+0.048
gy+0.045
 Lee+0.044
 nod+0.044
Neg logit contributions
 amazing-0.050
 impressive-0.050
 awesome-0.048
 valuable-0.048
 incredible-0.048
Token side
Token ID1735
Feature activation-9.83e-01

Pos logit contributions
 department+0.108
ously+0.097
gy+0.093
atively+0.092
 nod+0.090
Neg logit contributions
 amazing-0.140
ang-0.140
 impressive-0.138
 awesome-0.135
 incredible-0.134
Token of
Token ID286
Feature activation-1.37e+00

Pos logit contributions
 department+0.034
 nod+0.031
 walk+0.030
 gaze+0.030
 guy+0.029
Neg logit contributions
ang-0.017
b-0.014
 valuable-0.013
 much-0.012
hest-0.012
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 department+0.039
 nod+0.037
 walk+0.034
 gaze+0.034
 ending+0.034
Neg logit contributions
ang-0.028
 valuable-0.027
b-0.027
cks-0.027
 awesome-0.026
Token playground
Token ID24817
Feature activation-1.79e+00

Pos logit contributions
 department+0.021
 nod+0.020
 Lee+0.018
 walk+0.018
ously+0.017
Neg logit contributions
 valuable-0.039
ang-0.038
b-0.038
 awesome-0.037
 amazing-0.037
Token and
Token ID290
Feature activation-7.11e-01

Pos logit contributions
 department+0.046
 nod+0.042
 guy+0.041
 walk+0.039
 moment+0.039
Neg logit contributions
ang-0.040
 valuable-0.037
 much-0.037
b-0.037
 immensely-0.036
Token were
Token ID547
Feature activation+1.28e+00

Pos logit contributions
 department+0.050
 moment+0.042
pit+0.042
 nod+0.041
 spine+0.041
Neg logit contributions
 valuable-0.057
cks-0.056
 amazing-0.056
b-0.056
 awesome-0.054
Token hungry
Token ID14720
Feature activation-3.86e+00

Pos logit contributions
 valuable+0.030
b+0.030
ang+0.030
cks+0.029
hest+0.028
Neg logit contributions
 department-0.031
 nod-0.031
 walk-0.031
 hug-0.029
 moment-0.029
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
 department+0.102
 nod+0.095
 Driver+0.093
 moment+0.090
 gaze+0.089
Neg logit contributions
 valuable-0.080
 amazing-0.078
 impressive-0.076
 much-0.075
 incredible-0.074
Token They
Token ID1119
Feature activation-2.23e+00

Pos logit contributions
 department+0.050
 walk+0.047
 nod+0.044
 moment+0.042
 ending+0.042
Neg logit contributions
 valuable-0.044
ang-0.043
b-0.040
cks-0.040
pill-0.039
Token wanted
Token ID2227
Feature activation-2.26e+00

Pos logit contributions
 department+0.061
 moment+0.054
 Lee+0.054
pit+0.053
 ending+0.053
Neg logit contributions
 valuable-0.046
 amazing-0.043
b-0.042
cks-0.042
 impressive-0.042
Token to
Token ID284
Feature activation-5.24e+00

Pos logit contributions
 nod+0.057
 department+0.056
pit+0.054
 Driver+0.053
 spine+0.053
Neg logit contributions
 valuable-0.054
 amazing-0.050
 much-0.048
 impressive-0.048
 incredible-0.047
Token eat
Token ID4483
Feature activation-1.03e+00

Pos logit contributions
 department+0.117
ously+0.114
 spine+0.101
pit+0.099
 Lee+0.099
Neg logit contributions
 amazing-0.133
 impressive-0.129
 awesome-0.128
hest-0.126
 valuable-0.126
Token something
Token ID1223
Feature activation+7.02e-01

Pos logit contributions
 department+0.023
 nod+0.023
 walk+0.022
 moment+0.021
 hug+0.021
Neg logit contributions
 valuable-0.027
ang-0.026
b-0.026
cks-0.025
irling-0.025
Token y
Token ID331
Feature activation-1.44e+00

Pos logit contributions
ang+0.015
b+0.015
cks+0.015
irling+0.014
pill+0.014
Neg logit contributions
 walk-0.019
 nod-0.018
 department-0.018
 moment-0.018
 hug-0.017
Tokenummy
Token ID13513
Feature activation-1.52e+00

Pos logit contributions
 department+0.037
 nod+0.036
 walk+0.035
 moment+0.034
 gaze+0.033
Neg logit contributions
 valuable-0.033
ang-0.033
b-0.032
cks-0.031
hest-0.031
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
 department+0.032
 nod+0.032
 walk+0.030
 gaze+0.029
 hug+0.028
Neg logit contributions
 valuable-0.042
b-0.041
ang-0.040
 awesome-0.039
cks-0.039
Token saw
Token ID2497
Feature activation+3.24e-01

Pos logit contributions
 department+0.001
 nod+0.001
 walk+0.001
 moment+0.001
 Lee+0.001
Neg logit contributions
 valuable-0.002
b-0.002
ang-0.002
cks-0.002
irling-0.002
Token a
Token ID257
Feature activation-1.64e-01

Pos logit contributions
ang+0.007
 valuable+0.007
b+0.007
irling+0.007
cks+0.007
Neg logit contributions
 nod-0.008
 department-0.008
 walk-0.008
 hug-0.008
 moment-0.008
Token cat
Token ID3797
Feature activation-1.69e-01

Pos logit contributions
 department+0.005
 nod+0.005
 walk+0.005
 Lee+0.005
 gaze+0.005
Neg logit contributions
 valuable-0.003
ang-0.003
b-0.003
 awesome-0.002
irling-0.002
Token on
Token ID319
Feature activation-6.90e-01

Pos logit contributions
 department+0.004
 nod+0.004
 moment+0.004
 walk+0.004
 ending+0.004
Neg logit contributions
ang-0.004
b-0.004
 valuable-0.004
cks-0.004
irling-0.004
Token the
Token ID262
Feature activation-2.92e+00

Pos logit contributions
 walk+0.014
 Lee+0.012
 nod+0.012
 department+0.012
 hug+0.012
Neg logit contributions
ang-0.021
b-0.021
irling-0.021
 valuable-0.020
hest-0.020
Token other
Token ID584
Feature activation-5.23e+00

Pos logit contributions
 nod+0.060
 department+0.060
 walk+0.055
 moment+0.055
 Lee+0.053
Neg logit contributions
 valuable-0.081
ang-0.078
b-0.077
hest-0.076
 awesome-0.075
Token side
Token ID1735
Feature activation-1.08e+00

Pos logit contributions
 department+0.069
 nod+0.065
 walk+0.053
 moment+0.052
 gaze+0.051
Neg logit contributions
ang-0.185
 valuable-0.180
hest-0.176
b-0.176
ind-0.174
Token of
Token ID286
Feature activation-8.91e-01

Pos logit contributions
 department+0.029
 nod+0.029
 walk+0.028
 gaze+0.026
 moment+0.026
Neg logit contributions
ang-0.023
 valuable-0.023
b-0.023
cks-0.022
hest-0.021
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
 walk+0.018
 department+0.018
 nod+0.018
 Lee+0.017
 moment+0.017
Neg logit contributions
 valuable-0.025
ang-0.025
b-0.025
irling-0.024
hest-0.023
Token wire
Token ID6503
Feature activation-1.29e-01

Pos logit contributions
 department+0.027
 nod+0.027
 walk+0.026
 moment+0.024
 gaze+0.024
Neg logit contributions
 valuable-0.038
ang-0.037
b-0.037
irling-0.035
ind-0.035
Token fence
Token ID13990
Feature activation-6.43e-01

Pos logit contributions
 department+0.003
 nod+0.003
 walk+0.003
 moment+0.003
 gaze+0.003
Neg logit contributions
b-0.003
ang-0.003
cks-0.003
 valuable-0.003
irling-0.003
Token a
Token ID257
Feature activation-3.27e-01

Pos logit contributions
ang+0.016
 valuable+0.016
b+0.016
irling+0.015
cks+0.015
Neg logit contributions
 nod-0.015
 department-0.015
 walk-0.015
 hug-0.014
 moment-0.014
Token girl
Token ID2576
Feature activation-5.73e-01

Pos logit contributions
 department+0.008
 nod+0.008
 walk+0.007
 Lee+0.007
 gaze+0.007
Neg logit contributions
 valuable-0.008
b-0.008
ang-0.008
 awesome-0.007
irling-0.007
Token standing
Token ID5055
Feature activation+2.05e+00

Pos logit contributions
 department+0.014
 nod+0.013
 walk+0.012
 moment+0.012
 ending+0.012
Neg logit contributions
ang-0.014
b-0.014
 valuable-0.014
cks-0.013
irling-0.013
Token on
Token ID319
Feature activation-9.74e-01

Pos logit contributions
 valuable+0.061
hest+0.058
pill+0.057
 awesome+0.055
isin+0.054
Neg logit contributions
 nod-0.040
 walk-0.040
 department-0.039
 Lee-0.038
pit-0.037
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
 department+0.023
 nod+0.023
 walk+0.023
 moment+0.022
 gaze+0.022
Neg logit contributions
 valuable-0.023
ang-0.023
b-0.023
cks-0.022
irling-0.022
Token other
Token ID584
Feature activation-5.22e+00

Pos logit contributions
 department+0.066
 nod+0.066
 walk+0.063
 Lee+0.059
 moment+0.059
Neg logit contributions
 valuable-0.085
b-0.082
ang-0.082
hest-0.078
cks-0.078
Token side
Token ID1735
Feature activation-9.21e-01

Pos logit contributions
 department+0.056
 nod+0.053
 walk+0.046
 Lee+0.042
 moment+0.041
Neg logit contributions
 valuable-0.197
ang-0.194
b-0.191
hest-0.186
cks-0.184
Token of
Token ID286
Feature activation-5.55e-01

Pos logit contributions
 department+0.027
 nod+0.027
 walk+0.026
 gaze+0.025
 Lee+0.024
Neg logit contributions
ang-0.018
b-0.017
 valuable-0.017
hest-0.016
cks-0.016
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 department+0.012
 walk+0.012
 nod+0.012
 moment+0.011
 hug+0.011
Neg logit contributions
 valuable-0.015
ang-0.015
b-0.015
irling-0.014
cks-0.014
Token hedge
Token ID19859
Feature activation-3.75e-01

Pos logit contributions
 department+0.033
 nod+0.032
 walk+0.032
 moment+0.030
 hug+0.029
Neg logit contributions
 valuable-0.054
ang-0.054
b-0.054
cks-0.052
irling-0.052
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
 nod+0.008
 department+0.008
 walk+0.008
 gaze+0.007
 ending+0.007
Neg logit contributions
ang-0.010
b-0.010
 valuable-0.010
pill-0.009
hest-0.009
Token saw
Token ID2497
Feature activation+1.62e-01

Pos logit contributions
 nod+0.001
 department+0.001
 moment+0.001
 walk+0.001
 hug+0.001
Neg logit contributions
b-0.002
 valuable-0.002
ang-0.002
irling-0.002
 amazing-0.002
Token something
Token ID1223
Feature activation+1.56e+00

Pos logit contributions
 valuable+0.003
b+0.003
ang+0.003
cks+0.003
irling+0.003
Neg logit contributions
 department-0.004
 nod-0.004
 walk-0.004
 moment-0.004
 gaze-0.004
Token shiny
Token ID22441
Feature activation+9.68e-01

Pos logit contributions
ang+0.033
cks+0.031
ind+0.031
pill+0.031
ink+0.031
Neg logit contributions
 walk-0.044
 nod-0.042
 moment-0.040
 hug-0.040
 department-0.040
Token on
Token ID319
Feature activation-1.69e+00

Pos logit contributions
 valuable+0.023
b+0.023
ang+0.023
cks+0.022
irling+0.022
Neg logit contributions
 department-0.023
 nod-0.023
 walk-0.023
 moment-0.021
 gaze-0.021
Token the
Token ID262
Feature activation-2.59e+00

Pos logit contributions
 walk+0.030
 department+0.028
 nod+0.028
 hug+0.027
 Lee+0.027
Neg logit contributions
ang-0.053
b-0.053
irling-0.052
 valuable-0.052
pill-0.050
Token other
Token ID584
Feature activation-5.21e+00

Pos logit contributions
 nod+0.052
 department+0.052
 walk+0.050
 moment+0.047
 gaze+0.046
Neg logit contributions
 valuable-0.073
b-0.070
ang-0.070
hest-0.068
 awesome-0.068
Token side
Token ID1735
Feature activation-1.60e+00

Pos logit contributions
 nod+0.074
 department+0.072
 walk+0.065
 gaze+0.060
ously+0.058
Neg logit contributions
hest-0.172
 valuable-0.169
ang-0.163
 amazing-0.163
 awesome-0.160
Token of
Token ID286
Feature activation-8.17e-01

Pos logit contributions
 nod+0.044
 department+0.044
 walk+0.043
 gaze+0.041
 moment+0.040
Neg logit contributions
ang-0.033
b-0.032
 valuable-0.032
hest-0.031
cks-0.030
Token the
Token ID262
Feature activation-1.99e+00

Pos logit contributions
 department+0.018
 walk+0.017
 nod+0.017
 moment+0.016
 hug+0.016
Neg logit contributions
 valuable-0.022
ang-0.022
b-0.021
irling-0.021
cks-0.021
Token room
Token ID2119
Feature activation-1.76e+00

Pos logit contributions
 nod+0.038
 department+0.037
 walk+0.035
 gaze+0.033
 moment+0.033
Neg logit contributions
 valuable-0.059
ang-0.056
b-0.055
hest-0.055
 awesome-0.055
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
 nod+0.044
 department+0.043
 walk+0.041
 moment+0.039
 gaze+0.039
Neg logit contributions
hest-0.039
b-0.039
 valuable-0.039
ang-0.039
 amazing-0.037
Token a
Token ID257
Feature activation-7.22e-01

Pos logit contributions
b+0.009
irling+0.009
 valuable+0.009
ang+0.009
hest+0.008
Neg logit contributions
 nod-0.007
 walk-0.006
 Lee-0.006
 department-0.006
 hug-0.006
Token boy
Token ID2933
Feature activation-1.60e+00

Pos logit contributions
 department+0.017
 nod+0.017
 walk+0.017
 gaze+0.016
 Lee+0.015
Neg logit contributions
 valuable-0.018
ang-0.017
b-0.017
 awesome-0.016
irling-0.016
Token standing
Token ID5055
Feature activation+1.63e+00

Pos logit contributions
 department+0.041
 nod+0.036
 ending+0.036
 moment+0.035
 walk+0.034
Neg logit contributions
ang-0.040
b-0.039
cks-0.037
irling-0.035
 valuable-0.035
Token on
Token ID319
Feature activation-9.95e-01

Pos logit contributions
 valuable+0.049
pill+0.047
hest+0.046
 awesome+0.044
isin+0.043
Neg logit contributions
 nod-0.031
 walk-0.031
 department-0.030
 Lee-0.030
pit-0.029
Token the
Token ID262
Feature activation-3.09e+00

Pos logit contributions
 walk+0.023
 nod+0.023
 department+0.023
 Lee+0.021
 moment+0.021
Neg logit contributions
 valuable-0.025
b-0.025
ang-0.025
irling-0.024
cks-0.024
Token other
Token ID584
Feature activation-5.21e+00

Pos logit contributions
 department+0.068
 nod+0.067
 walk+0.065
 Lee+0.062
 moment+0.061
Neg logit contributions
 valuable-0.082
b-0.079
ang-0.079
cks-0.076
 awesome-0.076
Token side
Token ID1735
Feature activation-1.30e+00

Pos logit contributions
 department+0.062
 nod+0.056
 walk+0.049
 Lee+0.047
 gaze+0.045
Neg logit contributions
ang-0.190
 valuable-0.188
b-0.188
hest-0.183
cks-0.182
Token of
Token ID286
Feature activation-7.89e-01

Pos logit contributions
 department+0.037
 nod+0.037
 walk+0.036
 gaze+0.035
 Lee+0.034
Neg logit contributions
ang-0.026
b-0.025
 valuable-0.024
hest-0.023
cks-0.023
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
 department+0.018
 nod+0.018
 walk+0.018
 moment+0.017
 gaze+0.017
Neg logit contributions
 valuable-0.020
ang-0.019
b-0.019
cks-0.019
irling-0.018
Token pond
Token ID16723
Feature activation-6.72e-01

Pos logit contributions
 department+0.036
 walk+0.036
 nod+0.035
 moment+0.033
 hug+0.032
Neg logit contributions
 valuable-0.054
ang-0.053
b-0.053
cks-0.051
irling-0.051
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 department+0.015
 nod+0.014
 walk+0.014
 Lee+0.013
 gaze+0.013
Neg logit contributions
 valuable-0.018
ang-0.018
b-0.018
cks-0.017
hest-0.016
Token€
Token ID26391
Feature activation+2.44e+00

Pos logit contributions
 valuable+0.045
b+0.044
ang+0.043
cks+0.042
irling+0.042
Neg logit contributions
 nod-0.048
 department-0.047
 walk-0.047
 gaze-0.045
 moment-0.044
Token�
Token ID447
Feature activation+2.43e+00

Pos logit contributions
 valuable+0.057
ang+0.057
b+0.057
cks+0.055
irling+0.054
Neg logit contributions
 department-0.060
 nod-0.059
 walk-0.058
 moment-0.055
 hug-0.054
Token�
Token ID250
Feature activation+2.95e+00

Pos logit contributions
 valuable+0.086
b+0.085
ang+0.083
cks+0.083
 awesome+0.081
Neg logit contributions
 nod-0.032
 walk-0.031
 department-0.030
 moment-0.027
 hug-0.027
Token she
Token ID673
Feature activation+1.70e+00

Pos logit contributions
cks+0.083
 valuable+0.079
irling+0.075
ang+0.072
ind+0.072
Neg logit contributions
 moment-0.064
 nod-0.060
 department-0.056
 walk-0.056
 visit-0.054
Token was
Token ID373
Feature activation+3.99e+00

Pos logit contributions
ang+0.046
 valuable+0.046
b+0.045
cks+0.045
irling+0.044
Neg logit contributions
 nod-0.036
 walk-0.035
 department-0.035
 hug-0.033
 moment-0.033
Token so
Token ID523
Feature activation+4.58e+00

Pos logit contributions
ang+0.086
cks+0.082
b+0.080
pill+0.080
iky+0.079
Neg logit contributions
 nod-0.107
 hug-0.103
 smile-0.102
 walk-0.102
 moment-0.101
Token proud
Token ID6613
Feature activation+1.27e+00

Pos logit contributions
cks+0.102
pill+0.092
hest+0.091
ang+0.091
b+0.090
Neg logit contributions
 nod-0.123
 smile-0.121
 hug-0.121
 walk-0.116
 sister-0.115
Token of
Token ID286
Feature activation+9.62e-01

Pos logit contributions
 valuable+0.030
ang+0.029
b+0.029
cks+0.028
irling+0.027
Neg logit contributions
 department-0.031
 nod-0.031
 walk-0.031
 moment-0.029
 hug-0.029
Token her
Token ID607
Feature activation-7.55e-02

Pos logit contributions
ang+0.022
 valuable+0.022
b+0.021
hest+0.021
pill+0.020
Neg logit contributions
 nod-0.024
 walk-0.024
 hug-0.023
 moment-0.023
 gaze-0.022
Token brave
Token ID14802
Feature activation-8.45e-01

Pos logit contributions
 department+0.001
 nod+0.001
 walk+0.001
 Lee+0.001
 moment+0.001
Neg logit contributions
 valuable-0.002
ang-0.002
b-0.002
 awesome-0.002
irling-0.002
Token and
Token ID290
Feature activation+2.05e+00

Pos logit contributions
 department+0.016
 nod+0.014
atively+0.013
 Lee+0.013
 walk+0.013
Neg logit contributions
 valuable-0.025
 amazing-0.023
 awesome-0.023
ang-0.023
b-0.023
Token to
Token ID284
Feature activation-1.54e+00

Pos logit contributions
ang+0.093
b+0.091
 valuable+0.091
cks+0.090
irling+0.088
Neg logit contributions
 department-0.042
 walk-0.041
 nod-0.041
 hug-0.037
 moment-0.036
Token try
Token ID1949
Feature activation+5.77e-01

Pos logit contributions
 department+0.026
 moment+0.022
 Lee+0.022
 nod+0.022
pit+0.022
Neg logit contributions
 valuable-0.049
b-0.047
 awesome-0.047
cks-0.046
ang-0.046
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
ang+0.018
 valuable+0.017
b+0.017
cks+0.017
hest+0.017
Neg logit contributions
 department-0.010
 walk-0.010
 nod-0.010
 moment-0.009
 hug-0.009
Token Our
Token ID3954
Feature activation+1.43e-01

Pos logit contributions
b+0.020
 valuable+0.019
irling+0.019
cks+0.019
ang+0.018
Neg logit contributions
 nod-0.015
 department-0.014
 Lee-0.014
 walk-0.013
 hug-0.013
Token barn
Token ID25203
Feature activation+9.11e-01

Pos logit contributions
 valuable+0.003
ang+0.003
b+0.003
 awesome+0.003
ind+0.003
Neg logit contributions
 nod-0.003
 department-0.003
 walk-0.003
 moment-0.003
 gaze-0.003
Token is
Token ID318
Feature activation+4.49e+00

Pos logit contributions
 valuable+0.022
b+0.021
ang+0.020
cks+0.020
 awesome+0.020
Neg logit contributions
 nod-0.022
 walk-0.021
 department-0.021
 gaze-0.020
 Lee-0.020
Token getting
Token ID1972
Feature activation+3.44e+00

Pos logit contributions
ang+0.105
pill+0.100
b+0.093
cks+0.092
irling+0.091
Neg logit contributions
 department-0.122
 walk-0.117
 nod-0.114
 moment-0.114
 hug-0.112
Token wet
Token ID9583
Feature activation-1.18e+00

Pos logit contributions
ang+0.093
pill+0.089
irling+0.087
cks+0.086
iky+0.079
Neg logit contributions
 department-0.086
 walk-0.080
 hug-0.078
 moment-0.076
 again-0.076
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
 nod+0.028
 walk+0.027
 department+0.027
 gaze+0.027
 hug+0.025
Neg logit contributions
 valuable-0.028
ang-0.027
b-0.027
hest-0.027
 awesome-0.027
Token we
Token ID356
Feature activation-1.44e-01

Pos logit contributions
ang+0.045
 valuable+0.043
b+0.042
cks+0.041
hest+0.041
Neg logit contributions
 department-0.062
 walk-0.061
 nod-0.061
 moment-0.059
 hug-0.058
Token need
Token ID761
Feature activation+4.03e-01

Pos logit contributions
 department+0.003
 nod+0.003
 walk+0.002
 moment+0.002
 hug+0.002
Neg logit contributions
 valuable-0.004
ang-0.004
b-0.004
cks-0.004
irling-0.004
Token It
Token ID632
Feature activation+1.68e+00

Pos logit contributions
b+0.050
 valuable+0.049
cks+0.048
irling+0.048
ang+0.047
Neg logit contributions
 nod-0.043
 department-0.042
 Lee-0.040
 hug-0.039
 walk-0.039
Token looks
Token ID3073
Feature activation+4.37e+00

Pos logit contributions
ang+0.049
 valuable+0.048
b+0.048
cks+0.047
irling+0.047
Neg logit contributions
 department-0.032
 nod-0.032
 walk-0.031
 hug-0.029
 moment-0.029
Token like
Token ID588
Feature activation+2.29e+00

Pos logit contributions
ang+0.122
cks+0.111
irling+0.105
dd+0.104
pill+0.104
Neg logit contributions
 hug-0.103
 smile-0.102
 nod-0.101
,-0.099
 party-0.098
Token you
Token ID345
Feature activation+1.69e+00

Pos logit contributions
ang+0.053
b+0.050
cks+0.049
 valuable+0.048
irling+0.048
Neg logit contributions
 walk-0.059
 department-0.058
 nod-0.058
 moment-0.056
 hug-0.056
Token are
Token ID389
Feature activation+4.18e+00

Pos logit contributions
ang+0.051
 valuable+0.051
b+0.050
cks+0.049
irling+0.049
Neg logit contributions
 nod-0.030
 walk-0.029
 department-0.029
 hug-0.027
 moment-0.027
Token learning
Token ID4673
Feature activation+4.56e+00

Pos logit contributions
ang+0.098
b+0.089
pill+0.088
cks+0.086
 valuable+0.084
Neg logit contributions
 department-0.106
 nod-0.105
 walk-0.104
 hug-0.102
 smile-0.101
Token a
Token ID257
Feature activation+2.04e+00

Pos logit contributions
ang+0.137
b+0.128
cks+0.127
irling+0.125
ind+0.122
Neg logit contributions
 nod-0.088
 department-0.087
 walk-0.084
 moment-0.082
 hug-0.082
Token lot
Token ID1256
Feature activation+3.09e+00

Pos logit contributions
ang+0.040
cks+0.038
b+0.037
irling+0.036
pill+0.035
Neg logit contributions
 department-0.060
 nod-0.060
 walk-0.059
 moment-0.058
 hug-0.056
Token."
Token ID526
Feature activation+5.70e-01

Pos logit contributions
ang+0.094
irling+0.093
cks+0.092
b+0.089
years+0.088
Neg logit contributions
 department-0.056
 moment-0.053
 nod-0.053
 hug-0.051
,-0.050
Token
Token ID220
Feature activation+2.60e+00

Pos logit contributions
 valuable+0.017
pill+0.016
ang+0.016
hest+0.016
 awesome+0.015
Neg logit contributions
 department-0.011
 walk-0.011
 nod-0.010
 ending-0.009
 sister-0.009
Token
Token ID198
Feature activation+3.08e+00

Pos logit contributions
 valuable+0.074
b+0.074
ang+0.074
irling+0.071
ind+0.071
Neg logit contributions
 nod-0.050
 department-0.049
 walk-0.047
 moment-0.045
 Lee-0.045
Token
Token ID198
Feature activation+3.65e+00

Pos logit contributions
b+0.023
 valuable+0.022
cks+0.022
irling+0.022
ang+0.022
Neg logit contributions
 nod-0.015
 department-0.014
 Lee-0.014
 walk-0.013
 moment-0.013
Token
Token ID198
Feature activation+3.33e+00

Pos logit contributions
ender+0.120
 elegant+0.120
ind+0.118
 valuable+0.118
 prett+0.117
Neg logit contributions
 nod-0.056
 moment-0.053
 walk-0.052
 hug-0.051
 visit-0.048
Token"
Token ID1
Feature activation+1.18e+00

Pos logit contributions
cks+0.079
 valuable+0.075
ang+0.073
ender+0.072
ind+0.072
Neg logit contributions
 nod-0.082
 walk-0.081
 moment-0.079
 department-0.079
 hug-0.073
TokenI
Token ID40
Feature activation+3.31e-01

Pos logit contributions
 valuable+0.031
cks+0.030
ang+0.030
irling+0.029
b+0.028
Neg logit contributions
 nod-0.027
 moment-0.027
 department-0.026
 walk-0.026
 gaze-0.024
Token am
Token ID716
Feature activation+3.22e+00

Pos logit contributions
 valuable+0.010
b+0.010
ang+0.010
cks+0.009
irling+0.009
Neg logit contributions
 department-0.006
 nod-0.006
 walk-0.006
 Lee-0.005
 moment-0.005
Token so
Token ID523
Feature activation+4.43e+00

Pos logit contributions
ang+0.069
pill+0.069
b+0.068
iky+0.067
hest+0.066
Neg logit contributions
 nod-0.086
 hug-0.084
 department-0.084
 smile-0.083
 sister-0.083
Token proud
Token ID6613
Feature activation+1.52e+00

Pos logit contributions
pill+0.098
cks+0.097
hest+0.093
iky+0.090
ender+0.089
Neg logit contributions
 nod-0.118
 smile-0.115
 hug-0.112
 walk-0.111
 sister-0.109
Token of
Token ID286
Feature activation+7.27e-01

Pos logit contributions
 valuable+0.037
ang+0.036
b+0.036
cks+0.035
irling+0.035
Neg logit contributions
 department-0.036
 nod-0.036
 walk-0.035
 moment-0.033
 hug-0.033
Token myself
Token ID3589
Feature activation+1.59e+00

Pos logit contributions
 valuable+0.015
ang+0.015
b+0.015
pill+0.014
hest+0.014
Neg logit contributions
 nod-0.019
 walk-0.019
 department-0.018
 hug-0.018
 moment-0.018
Token,"
Token ID553
Feature activation+5.52e-01

Pos logit contributions
 valuable+0.040
b+0.039
ang+0.039
hest+0.038
 awesome+0.037
Neg logit contributions
 department-0.037
 nod-0.036
 walk-0.036
 gaze-0.034
 moment-0.033
Token he
Token ID339
Feature activation+1.54e+00

Pos logit contributions
 valuable+0.018
ang+0.017
b+0.017
cks+0.017
irling+0.017
Neg logit contributions
 nod-0.009
 department-0.009
 walk-0.009
 moment-0.008
 hug-0.008
Token whistle
Token ID16121
Feature activation+2.53e+00

Pos logit contributions
 department+0.048
 nod+0.045
 walk+0.044
 moment+0.041
 Lee+0.041
Neg logit contributions
 valuable-0.092
b-0.091
ang-0.090
cks-0.088
irling-0.087
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
ang+0.076
cks+0.073
irling+0.073
pill+0.072
 valuable+0.072
Neg logit contributions
 nod-0.047
 walk-0.044
 department-0.043
 moment-0.043
 hug-0.042
Token Sue
Token ID26089
Feature activation-6.23e-01

Pos logit contributions
 department+0.031
 nod+0.030
 walk+0.030
 moment+0.029
 gaze+0.028
Neg logit contributions
 valuable-0.042
ang-0.041
b-0.041
 awesome-0.039
hest-0.039
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 department+0.014
 nod+0.014
 walk+0.013
 gaze+0.013
 ending+0.012
Neg logit contributions
b-0.016
 valuable-0.016
ang-0.016
cks-0.015
 awesome-0.015
Token It
Token ID632
Feature activation+8.44e-01

Pos logit contributions
 department+0.001
 nod+0.001
 walk+0.001
 moment+0.001
 gaze+0.001
Neg logit contributions
 valuable-0.001
ang-0.001
b-0.001
cks-0.001
irling-0.001
Token is
Token ID318
Feature activation+4.88e+00

Pos logit contributions
ang+0.025
 valuable+0.025
irling+0.024
b+0.024
cks+0.024
Neg logit contributions
 nod-0.016
 department-0.016
 walk-0.015
 hug-0.014
 moment-0.014
Token easy
Token ID2562
Feature activation+7.83e-01

Pos logit contributions
ang+0.141
pill+0.132
years+0.129
dd+0.123
iky+0.122
Neg logit contributions
 nod-0.106
,-0.105
 smile-0.103
 hug-0.103
 moment-0.103
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
 valuable+0.025
b+0.024
ang+0.024
cks+0.023
irling+0.023
Neg logit contributions
 department-0.013
 nod-0.013
 walk-0.013
 gaze-0.012
 moment-0.012
Token fun
Token ID1257
Feature activation+3.29e-01

Pos logit contributions
ang+0.059
ind+0.055
pill+0.055
cks+0.054
irling+0.054
Neg logit contributions
 nod-0.081
 walk-0.079
 moment-0.075
 listen-0.075
 hug-0.075
Token!"
Token ID2474
Feature activation-6.62e-01

Pos logit contributions
 valuable+0.009
b+0.008
ang+0.008
cks+0.008
 awesome+0.008
Neg logit contributions
 department-0.007
 nod-0.007
 walk-0.007
 gaze-0.007
 moment-0.007
Token
Token ID198
Feature activation+2.63e+00

Pos logit contributions
 department+0.017
 walk+0.017
 nod+0.016
 gaze+0.015
 ending+0.015
Neg logit contributions
ang-0.014
 valuable-0.014
b-0.014
cks-0.013
pill-0.013
Token to
Token ID284
Feature activation-3.32e+00

Pos logit contributions
ang+0.029
cks+0.028
irling+0.028
ind+0.027
pill+0.027
Neg logit contributions
 walk-0.037
 nod-0.037
 hug-0.035
 moment-0.035
 department-0.034
Token be
Token ID307
Feature activation+3.05e+00

Pos logit contributions
 department+0.060
 nod+0.052
 walk+0.050
 Lee+0.050
 moment+0.050
Neg logit contributions
 valuable-0.105
b-0.102
ang-0.100
hest-0.099
cks-0.099
Token a
Token ID257
Feature activation+6.11e-01

Pos logit contributions
ang+0.069
pill+0.065
hest+0.064
irling+0.064
cks+0.063
Neg logit contributions
 walk-0.079
 nod-0.079
 department-0.078
 hug-0.074
 moment-0.074
Token mystery
Token ID10715
Feature activation-4.75e-01

Pos logit contributions
 valuable+0.017
ang+0.015
b+0.015
 awesome+0.015
 amazing+0.015
Neg logit contributions
 nod-0.013
 department-0.013
 walk-0.012
 Lee-0.012
ously-0.012
Token.
Token ID13
Feature activation-6.58e-01

Pos logit contributions
 nod+0.010
 department+0.010
 walk+0.009
 gaze+0.009
 Lee+0.009
Neg logit contributions
 valuable-0.013
b-0.012
 amazing-0.012
ang-0.012
 awesome-0.012
Token
Token ID220
Feature activation+2.32e+00

Pos logit contributions
 department+0.015
 walk+0.014
 nod+0.014
 ending+0.013
 moment+0.013
Neg logit contributions
 valuable-0.017
ang-0.016
b-0.016
pill-0.015
hest-0.015
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
 valuable+0.067
b+0.066
ang+0.066
irling+0.065
cks+0.064
Neg logit contributions
 nod-0.044
 department-0.042
 walk-0.042
 moment-0.041
 Lee-0.040
Token
Token ID198
Feature activation+1.26e+00

Pos logit contributions
 valuable+0.036
ang+0.035
b+0.035
cks+0.035
irling+0.035
Neg logit contributions
 nod-0.024
 walk-0.024
 moment-0.023
 department-0.023
 Lee-0.022
TokenOne
Token ID3198
Feature activation-2.73e+00

Pos logit contributions
 valuable+0.026
cks+0.026
ang+0.026
b+0.026
irling+0.025
Neg logit contributions
 walk-0.035
 nod-0.035
 department-0.034
 moment-0.033
 hug-0.032
Token day
Token ID1110
Feature activation-3.14e+00

Pos logit contributions
 department+0.059
 nod+0.056
 Lee+0.055
pit+0.052
 walk+0.051
Neg logit contributions
 valuable-0.077
ang-0.074
b-0.073
 amazing-0.071
 impressive-0.070
Token while
Token ID981
Feature activation-1.68e+00

Pos logit contributions
 department+0.064
pit+0.054
 nod+0.053
 Driver+0.050
 hospitality+0.050
Neg logit contributions
dd-0.087
b-0.087
ang-0.087
 much-0.086
 valuable-0.085
Token sad
Token ID6507
Feature activation-1.28e+00

Pos logit contributions
hest+0.057
 valuable+0.055
pill+0.054
b+0.054
cks+0.053
Neg logit contributions
 nod-0.051
 walk-0.051
 sister-0.050
 Lee-0.049
 department-0.049
Token and
Token ID290
Feature activation+9.15e-01

Pos logit contributions
 department+0.035
 nod+0.033
 Lee+0.032
 walk+0.032
 gaze+0.032
Neg logit contributions
 valuable-0.029
 important-0.026
 amazing-0.026
b-0.025
 impressive-0.025
Token miserable
Token ID22444
Feature activation-1.57e+00

Pos logit contributions
 valuable+0.024
b+0.023
ang+0.022
 awesome+0.022
cks+0.022
Neg logit contributions
 department-0.021
 nod-0.020
 moment-0.019
 walk-0.019
 Lee-0.018
Token.
Token ID13
Feature activation-5.78e-01

Pos logit contributions
 nod+0.042
 department+0.041
 walk+0.038
 gaze+0.038
 hug+0.036
Neg logit contributions
 valuable-0.035
 amazing-0.033
 important-0.032
b-0.031
 much-0.031
Token He
Token ID679
Feature activation+4.43e-01

Pos logit contributions
 nod+0.010
 department+0.010
 walk+0.010
 Lee+0.009
 moment+0.009
Neg logit contributions
b-0.018
 valuable-0.018
ang-0.018
irling-0.017
cks-0.017
Token didn
Token ID1422
Feature activation+1.40e+00

Pos logit contributions
 valuable+0.013
ang+0.013
b+0.013
cks+0.013
irling+0.013
Neg logit contributions
 department-0.008
 nod-0.008
 walk-0.008
 hug-0.007
 moment-0.007
Token't
Token ID470
Feature activation-1.23e+00

Pos logit contributions
ang+0.047
b+0.046
cks+0.046
irling+0.045
 valuable+0.044
Neg logit contributions
 walk-0.022
 department-0.021
 hug-0.020
 nod-0.020
 visit-0.020
Token have
Token ID423
Feature activation+1.58e+00

Pos logit contributions
 department+0.029
 nod+0.028
 walk+0.027
 Lee+0.027
 moment+0.026
Neg logit contributions
 valuable-0.031
b-0.030
ang-0.029
hest-0.028
 awesome-0.028
Token many
Token ID867
Feature activation+3.64e-01

Pos logit contributions
 valuable+0.039
ang+0.039
b+0.039
cks+0.037
irling+0.037
Neg logit contributions
 department-0.037
 nod-0.037
 walk-0.036
 moment-0.034
 hug-0.034
Token friends
Token ID2460
Feature activation+1.42e-01

Pos logit contributions
ang+0.009
 valuable+0.009
b+0.009
cks+0.009
irling+0.009
Neg logit contributions
 department-0.008
 nod-0.008
 walk-0.008
 hug-0.007
 moment-0.007
Token to
Token ID284
Feature activation-3.13e+00

Pos logit contributions
 valuable+0.004
b+0.003
 amazing+0.003
 awesome+0.003
ang+0.003
Neg logit contributions
 department-0.003
 nod-0.003
 gaze-0.003
atively-0.003
 moment-0.003
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
 department+0.007
 nod+0.007
 walk+0.007
 Lee+0.007
 gaze+0.007
Neg logit contributions
 valuable-0.008
b-0.008
cks-0.007
ang-0.007
 amazing-0.007
Token But
Token ID887
Feature activation-1.34e+00

Pos logit contributions
 department+0.006
 walk+0.006
 nod+0.006
 moment+0.005
 hug+0.005
Neg logit contributions
 valuable-0.007
ang-0.007
b-0.007
pill-0.007
hest-0.007
Token he
Token ID339
Feature activation+8.42e-01

Pos logit contributions
 department+0.033
 nod+0.031
 walk+0.030
 Lee+0.029
pit+0.029
Neg logit contributions
 valuable-0.033
b-0.033
ang-0.031
 amazing-0.030
 awesome-0.030
Token kept
Token ID4030
Feature activation+6.17e-02

Pos logit contributions
 valuable+0.023
b+0.023
ang+0.022
irling+0.021
cks+0.021
Neg logit contributions
 department-0.018
 nod-0.017
 walk-0.017
 Lee-0.016
 moment-0.016
Token taking
Token ID2263
Feature activation+4.78e-01

Pos logit contributions
hest+0.002
ang+0.002
 valuable+0.002
pill+0.002
iky+0.002
Neg logit contributions
 walk-0.001
 nod-0.001
 department-0.001
 moment-0.001
 gaze-0.001
Token care
Token ID1337
Feature activation+2.33e+00

Pos logit contributions
ang+0.014
irling+0.013
 valuable+0.013
pill+0.013
b+0.013
Neg logit contributions
 nod-0.010
 walk-0.010
 department-0.009
 moment-0.009
 hug-0.009
Token of
Token ID286
Feature activation+3.55e-01

Pos logit contributions
 valuable+0.053
ang+0.051
b+0.051
cks+0.049
irling+0.048
Neg logit contributions
 department-0.060
 nod-0.059
 walk-0.059
 moment-0.055
 gaze-0.055
Token it
Token ID340
Feature activation-6.38e-01

Pos logit contributions
 valuable+0.008
ang+0.008
b+0.008
cks+0.008
irling+0.007
Neg logit contributions
 department-0.009
 nod-0.009
 walk-0.009
 moment-0.008
 gaze-0.008
Token every
Token ID790
Feature activation-1.09e+00

Pos logit contributions
 department+0.012
 nod+0.011
 walk+0.011
 gaze+0.010
 Lee+0.010
Neg logit contributions
 valuable-0.019
b-0.019
ang-0.018
hest-0.018
 awesome-0.018
Token day
Token ID1110
Feature activation-1.86e+00

Pos logit contributions
 Lee+0.028
 department+0.027
 nod+0.026
pit+0.025
 Driver+0.024
Neg logit contributions
 valuable-0.028
 amazing-0.025
b-0.025
 important-0.024
 impressive-0.024
Token.
Token ID13
Feature activation+6.45e-01

Pos logit contributions
 department+0.033
 nod+0.032
 walk+0.031
 moment+0.029
 gaze+0.029
Neg logit contributions
 valuable-0.057
ang-0.055
b-0.055
cks-0.053
hest-0.053
Token away
Token ID1497
Feature activation-7.17e-01

Pos logit contributions
 department+0.005
 nod+0.005
 gaze+0.005
 ending+0.005
 walk+0.005
Neg logit contributions
 valuable-0.007
b-0.007
ang-0.007
 awesome-0.006
 amazing-0.006
Token together
Token ID1978
Feature activation-1.90e+00

Pos logit contributions
 department+0.016
 nod+0.016
pit+0.015
 gaze+0.015
 walk+0.015
Neg logit contributions
 valuable-0.019
b-0.017
ang-0.017
 amazing-0.016
 impressive-0.016
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
 department+0.045
 Driver+0.041
 guy+0.041
 Lee+0.040
pit+0.039
Neg logit contributions
 valuable-0.049
 amazing-0.044
b-0.043
 immensely-0.043
 much-0.043
Token happy
Token ID3772
Feature activation-1.33e+00

Pos logit contributions
 department+0.007
 nod+0.006
 guy+0.006
 Lee+0.006
 walk+0.006
Neg logit contributions
 valuable-0.005
ang-0.005
b-0.005
irling-0.005
cks-0.005
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
pit+0.034
 department+0.033
 nod+0.032
 Lee+0.032
 garage+0.032
Neg logit contributions
 valuable-0.031
 amazing-0.028
 beautifully-0.027
 impressive-0.027
 incredible-0.026
Token have
Token ID423
Feature activation+1.21e+00

Pos logit contributions
 department+0.033
pit+0.031
 Lee+0.029
 nod+0.027
 walk+0.027
Neg logit contributions
 valuable-0.043
 awesome-0.041
 amazing-0.040
b-0.039
cks-0.039
Token found
Token ID1043
Feature activation+8.02e-01

Pos logit contributions
 valuable+0.032
b+0.031
ang+0.031
cks+0.030
irling+0.030
Neg logit contributions
 department-0.026
 nod-0.026
 walk-0.026
 moment-0.024
 gaze-0.024
Token a
Token ID257
Feature activation+7.55e-01

Pos logit contributions
 valuable+0.018
ang+0.017
b+0.017
cks+0.017
 awesome+0.017
Neg logit contributions
 department-0.021
 walk-0.020
 nod-0.020
 gaze-0.019
 moment-0.019
Token new
Token ID649
Feature activation-8.92e-01

Pos logit contributions
 valuable+0.015
 amazing+0.012
 awesome+0.011
 important+0.011
 impressive+0.011
Neg logit contributions
 Lee-0.024
 nod-0.023
pit-0.023
 department-0.022
atively-0.022
Token friend
Token ID1545
Feature activation-9.12e-01

Pos logit contributions
 department+0.016
 Lee+0.015
 nod+0.015
atively+0.015
 gaze+0.014
Neg logit contributions
 valuable-0.028
 amazing-0.026
 important-0.025
 impressive-0.024
 awesome-0.024
Token.
Token ID13
Feature activation+9.77e-01

Pos logit contributions
 department+0.021
 nod+0.021
 gaze+0.021
 walk+0.020
 ending+0.020
Neg logit contributions
 valuable-0.021
ang-0.021
b-0.020
 amazing-0.020
 awesome-0.019
Token
Token ID198
Feature activation+3.18e+00

Pos logit contributions
 valuable+0.098
b+0.095
cks+0.095
ang+0.095
 awesome+0.093
Neg logit contributions
 nod-0.068
 walk-0.066
 department-0.065
 moment-0.063
 hug-0.062
TokenThe
Token ID464
Feature activation-3.07e-01

Pos logit contributions
 valuable+0.061
cks+0.060
b+0.059
ang+0.059
 awesome+0.057
Neg logit contributions
 nod-0.092
 walk-0.091
 department-0.091
 moment-0.087
 hug-0.085
Token angel
Token ID18304
Feature activation+6.43e-01

Pos logit contributions
 department+0.006
 nod+0.006
 walk+0.006
 Lee+0.006
 gaze+0.006
Neg logit contributions
 valuable-0.009
 amazing-0.008
 awesome-0.008
 impressive-0.008
b-0.008
Token smiled
Token ID13541
Feature activation+3.38e-02

Pos logit contributions
b+0.018
 valuable+0.018
ang+0.017
irling+0.016
 amazing+0.016
Neg logit contributions
 department-0.013
 Lee-0.012
 walk-0.012
 nod-0.012
 moment-0.011
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
 valuable+0.001
b+0.001
ang+0.001
 amazing+0.001
 impressive+0.001
Neg logit contributions
 department-0.001
 walk-0.001
 Lee-0.001
 nod-0.001
 gaze-0.001
Token "
Token ID366
Feature activation+2.58e+00

Pos logit contributions
 valuable+0.020
ang+0.019
b+0.019
cks+0.019
irling+0.019
Neg logit contributions
 department-0.012
 nod-0.012
 walk-0.012
 moment-0.011
 gaze-0.010
TokenThat
Token ID2504
Feature activation+1.31e+00

Pos logit contributions
cks+0.069
irling+0.062
 valuable+0.062
b+0.061
 awesome+0.061
Neg logit contributions
 nod-0.061
 moment-0.060
 walk-0.055
 department-0.053
 gaze-0.053
Token's
Token ID338
Feature activation+3.46e+00

Pos logit contributions
 valuable+0.033
b+0.030
 amazing+0.030
 awesome+0.029
 impressive+0.028
Neg logit contributions
 department-0.032
 walk-0.030
 nod-0.030
 Lee-0.030
pit-0.028
Token a
Token ID257
Feature activation+2.01e+00

Pos logit contributions
ang+0.097
pill+0.096
irling+0.090
iky+0.088
cks+0.088
Neg logit contributions
 nod-0.079
 moment-0.077
 listen-0.076
,-0.074
 ending-0.073
Token good
Token ID922
Feature activation-9.55e-01

Pos logit contributions
 valuable+0.029
ang+0.028
b+0.028
cks+0.026
irling+0.026
Neg logit contributions
 department-0.068
 nod-0.067
 walk-0.067
 moment-0.064
 gaze-0.063
Token idea
Token ID2126
Feature activation-1.18e+00

Pos logit contributions
 department+0.023
 Lee+0.020
pit+0.020
 walk+0.020
oup+0.019
Neg logit contributions
 valuable-0.025
 important-0.024
 amazing-0.023
 impressive-0.022
 incredible-0.021
Token his
Token ID465
Feature activation-6.92e-01

Pos logit contributions
 department+0.003
 nod+0.003
 walk+0.003
 spine+0.003
pit+0.003
Neg logit contributions
b-0.003
 valuable-0.003
ang-0.003
 amazing-0.003
cks-0.003
Token mom
Token ID1995
Feature activation-1.38e+00

Pos logit contributions
 department+0.012
 nod+0.012
 Lee+0.011
 walk+0.011
 gaze+0.011
Neg logit contributions
 valuable-0.021
 amazing-0.020
 awesome-0.020
ang-0.020
b-0.020
Token bought
Token ID5839
Feature activation+5.98e-01

Pos logit contributions
 moment+0.032
 spine+0.031
 department+0.031
 guy+0.029
 Lee+0.029
Neg logit contributions
 amazing-0.030
aw-0.028
years-0.028
eps-0.028
ink-0.027
Token the
Token ID262
Feature activation-8.32e-02

Pos logit contributions
ang+0.016
 valuable+0.016
b+0.016
irling+0.016
cks+0.016
Neg logit contributions
 nod-0.012
 department-0.012
 walk-0.012
 hug-0.012
 Lee-0.011
Token toy
Token ID13373
Feature activation-7.56e-01

Pos logit contributions
 department+0.002
 nod+0.002
 walk+0.002
ously+0.002
 Lee+0.002
Neg logit contributions
 valuable-0.002
 awesome-0.002
 amazing-0.002
 impressive-0.002
b-0.002
Token car
Token ID1097
Feature activation+4.98e-01

Pos logit contributions
 nod+0.016
 walk+0.015
 department+0.015
 gaze+0.015
 moment+0.014
Neg logit contributions
 valuable-0.021
ang-0.020
b-0.020
 amazing-0.019
 awesome-0.019
Token and
Token ID290
Feature activation-6.08e-01

Pos logit contributions
 valuable+0.013
b+0.013
ang+0.013
hest+0.012
 awesome+0.012
Neg logit contributions
 nod-0.011
 department-0.010
 walk-0.010
 gaze-0.010
 moment-0.010
Token went
Token ID1816
Feature activation-1.05e+00

Pos logit contributions
 department+0.013
 nod+0.013
 moment+0.013
 spine+0.012
 gaze+0.012
Neg logit contributions
 valuable-0.016
b-0.015
 amazing-0.015
 awesome-0.015
ang-0.014
Token home
Token ID1363
Feature activation-7.93e-01

Pos logit contributions
 department+0.020
 nod+0.020
 ending+0.018
 moment+0.018
 walk+0.018
Neg logit contributions
 valuable-0.030
b-0.030
ang-0.030
 awesome-0.028
irling-0.028
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 nod+0.018
 department+0.018
 gaze+0.017
 ending+0.017
 walk+0.017
Neg logit contributions
 valuable-0.020
b-0.019
ang-0.019
irling-0.018
 amazing-0.018
Token Tim
Token ID5045
Feature activation+8.39e-01

Pos logit contributions
b+0.015
 valuable+0.015
ang+0.015
cks+0.015
irling+0.015
Neg logit contributions
 department-0.008
 nod-0.008
 walk-0.008
 Lee-0.007
 hug-0.007
Token.
Token ID13
Feature activation-6.61e-01

Pos logit contributions
 nod+0.018
 department+0.018
 moment+0.016
 sister+0.016
 ending+0.016
Neg logit contributions
ang-0.019
b-0.018
 valuable-0.018
hest-0.018
cks-0.017
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
 nod+0.011
 department+0.010
 walk+0.010
 moment+0.010
 hug+0.010
Neg logit contributions
b-0.022
ang-0.021
 valuable-0.021
cks-0.021
irling-0.021
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
b+0.050
 valuable+0.050
ang+0.049
cks+0.049
irling+0.049
Neg logit contributions
 nod-0.028
 walk-0.028
 moment-0.028
 department-0.027
 hug-0.026
TokenAs
Token ID1722
Feature activation-1.34e+00

Pos logit contributions
cks+0.040
b+0.039
ang+0.038
pill+0.037
 awesome+0.037
Neg logit contributions
 walk-0.024
 moment-0.022
 nod-0.021
 department-0.021
 visit-0.020
Token they
Token ID484
Feature activation-2.30e+00

Pos logit contributions
 department+0.020
 nod+0.020
 walk+0.019
 moment+0.017
 gaze+0.017
Neg logit contributions
 valuable-0.045
ang-0.044
b-0.044
cks-0.043
irling-0.043
Token were
Token ID547
Feature activation-3.65e-01

Pos logit contributions
 nod+0.040
 department+0.040
 moment+0.036
 Lee+0.036
 walk+0.036
Neg logit contributions
ang-0.068
b-0.067
 valuable-0.066
cks-0.064
hest-0.063
Token eating
Token ID6600
Feature activation-1.27e+00

Pos logit contributions
 nod+0.008
 department+0.008
 walk+0.008
 moment+0.008
 gaze+0.007
Neg logit contributions
 valuable-0.009
ang-0.009
b-0.009
cks-0.009
irling-0.009
Token,
Token ID11
Feature activation-2.30e+00

Pos logit contributions
 department+0.019
 nod+0.017
 walk+0.016
 gaze+0.015
ously+0.014
Neg logit contributions
 valuable-0.043
 awesome-0.042
 amazing-0.042
ang-0.041
b-0.041
Token Lily
Token ID20037
Feature activation+1.70e-01

Pos logit contributions
 department+0.044
 nod+0.044
 walk+0.042
 gaze+0.039
 hug+0.038
Neg logit contributions
 valuable-0.066
ang-0.064
b-0.063
 awesome-0.062
irling-0.061
Token saw
Token ID2497
Feature activation+1.43e-01

Pos logit contributions
b+0.005
 valuable+0.005
ang+0.005
irling+0.005
ind+0.005
Neg logit contributions
 nod-0.003
 department-0.003
 moment-0.003
 walk-0.003
 hug-0.003
Token a
Token ID257
Feature activation+5.24e-02

Pos logit contributions
 valuable+0.003
b+0.003
ang+0.003
cks+0.003
 awesome+0.003
Neg logit contributions
 department-0.004
 nod-0.004
 walk-0.004
 moment-0.004
 gaze-0.004
Token his
Token ID465
Feature activation-4.53e-01

Pos logit contributions
irling+0.018
 valuable+0.018
ang+0.017
hest+0.017
b+0.016
Neg logit contributions
 Lee-0.009
 walk-0.009
 nod-0.009
 hug-0.009
 gaze-0.008
Token passport
Token ID17981
Feature activation+7.58e-01

Pos logit contributions
 department+0.006
 nod+0.006
 walk+0.006
 moment+0.005
 gaze+0.005
Neg logit contributions
 valuable-0.016
b-0.015
ang-0.015
cks-0.015
irling-0.015
Token and
Token ID290
Feature activation+2.40e-02

Pos logit contributions
 valuable+0.019
b+0.019
ang+0.019
hest+0.018
cks+0.018
Neg logit contributions
 department-0.017
 nod-0.017
 gaze-0.016
 moment-0.016
 ending-0.016
Token runs
Token ID4539
Feature activation-5.75e-03

Pos logit contributions
 valuable+0.001
b+0.001
ang+0.001
cks+0.001
 awesome+0.001
Neg logit contributions
 department-0.001
 nod-0.001
 moment-0.001
 walk-0.001
 ending-0.000
Token away
Token ID1497
Feature activation-2.03e-01

Pos logit contributions
 nod+0.000
 department+0.000
 walk+0.000
 gaze+0.000
 moment+0.000
Neg logit contributions
 valuable-0.000
b-0.000
ang-0.000
cks-0.000
ind-0.000
Token.
Token ID13
Feature activation-5.87e-01

Pos logit contributions
 department+0.004
 nod+0.004
 walk+0.004
 gaze+0.004
 moment+0.004
Neg logit contributions
 valuable-0.006
b-0.006
ang-0.006
cks-0.005
irling-0.005
Token
Token ID198
Feature activation+2.73e+00

Pos logit contributions
 nod+0.011
 department+0.010
 walk+0.010
 Lee+0.010
 gaze+0.010
Neg logit contributions
 valuable-0.018
b-0.018
ang-0.018
irling-0.017
hest-0.017
Token
Token ID198
Feature activation+2.27e+00

Pos logit contributions
 elegant+0.090
 valuable+0.090
b+0.090
ang+0.089
irling+0.089
Neg logit contributions
 nod-0.042
 walk-0.040
 hug-0.040
 Lee-0.036
 visit-0.036
TokenThe
Token ID464
Feature activation-1.04e+00

Pos logit contributions
 valuable+0.060
b+0.057
pill+0.056
 awesome+0.056
cks+0.056
Neg logit contributions
 nod-0.051
 walk-0.051
 department-0.048
 Lee-0.047
 hug-0.047
Token man
Token ID582
Feature activation-8.62e-02

Pos logit contributions
 department+0.012
 nod+0.011
 walk+0.010
 moment+0.009
 Lee+0.009
Neg logit contributions
 valuable-0.039
b-0.038
ang-0.038
 awesome-0.037
cks-0.037
Token is
Token ID318
Feature activation+2.39e+00

Pos logit contributions
 department+0.002
 moment+0.002
 nod+0.002
 ending+0.002
 walk+0.002
Neg logit contributions
cks-0.002
b-0.002
ang-0.002
 valuable-0.002
pill-0.002
Token them
Token ID606
Feature activation-2.20e+00

Pos logit contributions
 valuable+0.007
ang+0.007
b+0.007
cks+0.007
irling+0.007
Neg logit contributions
 department-0.008
 nod-0.008
 walk-0.008
 moment-0.007
 gaze-0.007
Token and
Token ID290
Feature activation-3.91e-01

Pos logit contributions
 department+0.049
 Lee+0.043
pit+0.043
 moment+0.042
 guy+0.042
Neg logit contributions
b-0.055
 valuable-0.055
ang-0.053
 amazing-0.053
cks-0.051
Token was
Token ID373
Feature activation+2.56e+00

Pos logit contributions
 nod+0.007
 department+0.007
 spine+0.007
 walk+0.007
 ending+0.007
Neg logit contributions
b-0.011
 valuable-0.010
ang-0.010
 amazing-0.010
cks-0.010
Token scared
Token ID12008
Feature activation-1.01e+00

Pos logit contributions
 valuable+0.050
ang+0.050
b+0.048
pill+0.048
hest+0.046
Neg logit contributions
 nod-0.074
 department-0.074
 walk-0.072
 hug-0.070
 Lee-0.069
Token.
Token ID13
Feature activation-6.18e-01

Pos logit contributions
 department+0.021
 nod+0.021
 walk+0.021
 gaze+0.020
 moment+0.019
Neg logit contributions
 valuable-0.027
ang-0.026
b-0.026
 amazing-0.025
 awesome-0.025
Token He
Token ID679
Feature activation-9.19e-02

Pos logit contributions
 department+0.011
 nod+0.011
 walk+0.011
 moment+0.010
 Lee+0.010
Neg logit contributions
 valuable-0.018
ang-0.018
b-0.018
cks-0.018
irling-0.018
Token kept
Token ID4030
Feature activation-6.29e-02

Pos logit contributions
 department+0.002
 nod+0.002
 walk+0.002
 moment+0.001
 hug+0.001
Neg logit contributions
 valuable-0.003
ang-0.003
b-0.003
cks-0.003
irling-0.003
Token them
Token ID606
Feature activation-2.86e-01

Pos logit contributions
 walk+0.001
 nod+0.001
 department+0.001
 hug+0.001
 moment+0.001
Neg logit contributions
 valuable-0.002
ang-0.002
b-0.002
hest-0.002
pill-0.002
Token quiet
Token ID5897
Feature activation+1.64e-01

Pos logit contributions
 department+0.008
 nod+0.008
 Lee+0.007
 gaze+0.007
 moment+0.007
Neg logit contributions
 valuable-0.006
b-0.006
ang-0.006
 awesome-0.005
 amazing-0.005
Token with
Token ID351
Feature activation+8.79e-01

Pos logit contributions
 valuable+0.004
b+0.004
 amazing+0.004
ang+0.004
 awesome+0.004
Neg logit contributions
 department-0.004
 nod-0.004
 walk-0.003
 gaze-0.003
 Lee-0.003
Token candy
Token ID18550
Feature activation+8.58e-01

Pos logit contributions
 valuable+0.023
ang+0.022
b+0.022
cks+0.021
irling+0.021
Neg logit contributions
 department-0.020
 nod-0.019
 walk-0.019
 moment-0.018
 hug-0.018
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
 department+0.040
 nod+0.037
 sister+0.035
 ending+0.034
 walk+0.034
Neg logit contributions
b-0.053
ang-0.052
 valuable-0.050
cks-0.049
irling-0.047
Token
Token ID198
Feature activation+3.34e+00

Pos logit contributions
 nod+0.003
 department+0.003
 walk+0.003
 Lee+0.003
 moment+0.003
Neg logit contributions
b-0.005
 valuable-0.005
ang-0.005
cks-0.005
irling-0.005
Token
Token ID198
Feature activation+2.81e+00

Pos logit contributions
 elegant+0.117
b+0.116
ang+0.116
years+0.116
ind+0.115
Neg logit contributions
 nod-0.044
 walk-0.040
 hug-0.039
 moment-0.039
 visit-0.037
TokenThey
Token ID2990
Feature activation-1.40e+00

Pos logit contributions
cks+0.070
 valuable+0.067
ang+0.067
b+0.066
pill+0.064
Neg logit contributions
 nod-0.065
 walk-0.064
 moment-0.061
 department-0.060
 hug-0.057
Token reached
Token ID4251
Feature activation-7.22e-01

Pos logit contributions
 department+0.025
 nod+0.024
 walk+0.022
 moment+0.022
 Lee+0.021
Neg logit contributions
b-0.042
ang-0.042
 valuable-0.042
cks-0.040
irling-0.040
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
 department+0.018
 nod+0.018
 walk+0.017
 moment+0.017
 gaze+0.016
Neg logit contributions
 valuable-0.016
ang-0.016
b-0.016
cks-0.016
 awesome-0.015
Token middle
Token ID3504
Feature activation-7.12e-01

Pos logit contributions
 department+0.029
 walk+0.029
 nod+0.028
 moment+0.026
 hug+0.026
Neg logit contributions
 valuable-0.050
ang-0.050
b-0.049
cks-0.049
irling-0.049
Token of
Token ID286
Feature activation-9.03e-01

Pos logit contributions
 department+0.021
 nod+0.021
 walk+0.020
 Lee+0.020
 gaze+0.020
Neg logit contributions
ang-0.013
b-0.013
 valuable-0.012
cks-0.012
hest-0.012
Token the
Token ID262
Feature activation-9.20e-01

Pos logit contributions
 walk+0.019
 department+0.018
 nod+0.018
 moment+0.017
 Lee+0.017
Neg logit contributions
 valuable-0.025
b-0.025
ang-0.025
hest-0.024
irling-0.024
Token log
Token ID2604
Feature activation-1.82e+00

Pos logit contributions
 walk+0.016
 department+0.015
 moment+0.015
 nod+0.015
 hug+0.014
Neg logit contributions
 valuable-0.029
cks-0.029
ang-0.029
irling-0.028
b-0.028
Token and
Token ID290
Feature activation-9.16e-01

Pos logit contributions
 department+0.038
 sister+0.037
 nod+0.036
 ending+0.034
 guy+0.034
Neg logit contributions
ang-0.047
hest-0.046
b-0.046
 valuable-0.045
cks-0.043
Token they
Token ID484
Feature activation-2.29e+00

Pos logit contributions
 department+0.048
 nod+0.046
 Lee+0.043
 walk+0.042
 hug+0.042
Neg logit contributions
 valuable-0.067
b-0.066
 awesome-0.065
ang-0.065
 amazing-0.065
Token arrived
Token ID5284
Feature activation-8.56e-01

Pos logit contributions
 department+0.030
 nod+0.028
 Lee+0.024
 sister+0.023
 moment+0.023
Neg logit contributions
b-0.079
 valuable-0.079
ang-0.078
 amazing-0.075
hest-0.075
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 department+0.021
 nod+0.018
 kiss+0.018
 moment+0.018
 gaze+0.017
Neg logit contributions
ang-0.022
 amazing-0.021
 impressive-0.020
b-0.020
 valuable-0.020
Token Joe
Token ID5689
Feature activation+9.55e-01

Pos logit contributions
 department+0.024
 nod+0.022
 walk+0.022
 hug+0.021
 Lee+0.021
Neg logit contributions
b-0.028
 valuable-0.028
 amazing-0.028
ang-0.027
 awesome-0.027
Token ran
Token ID4966
Feature activation-7.79e-01

Pos logit contributions
b+0.030
 valuable+0.030
ang+0.029
cks+0.028
irling+0.028
Neg logit contributions
 department-0.016
 nod-0.016
 walk-0.014
 moment-0.014
 Lee-0.014
Token outside
Token ID2354
Feature activation-2.26e+00

Pos logit contributions
 department+0.016
 nod+0.015
 walk+0.015
 gaze+0.014
 moment+0.014
Neg logit contributions
 valuable-0.022
b-0.021
ang-0.021
 awesome-0.021
hest-0.020
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
 department+0.053
 nod+0.049
 ending+0.046
oup+0.045
pit+0.045
Neg logit contributions
ang-0.054
 valuable-0.053
b-0.051
 amazing-0.050
 impressive-0.049
Token He
Token ID679
Feature activation+6.51e-01

Pos logit contributions
 department+0.006
 walk+0.006
 nod+0.006
 moment+0.006
 party+0.006
Neg logit contributions
 valuable-0.007
ang-0.007
b-0.007
 awesome-0.007
cks-0.007
Token saw
Token ID2497
Feature activation+9.52e-01

Pos logit contributions
 valuable+0.024
ang+0.023
b+0.023
cks+0.023
hest+0.023
Neg logit contributions
 walk-0.008
 department-0.008
 nod-0.008
 moment-0.006
 hug-0.006
Token lots
Token ID6041
Feature activation+2.62e-01

Pos logit contributions
ang+0.020
 valuable+0.020
b+0.020
irling+0.019
cks+0.019
Neg logit contributions
 department-0.025
 nod-0.025
 walk-0.025
 hug-0.024
 moment-0.024
Token of
Token ID286
Feature activation+6.71e-01

Pos logit contributions
cks+0.008
irling+0.008
 valuable+0.008
b+0.007
ind+0.007
Neg logit contributions
 nod-0.004
 walk-0.004
 hug-0.004
 department-0.004
,-0.004
Token
Token ID198
Feature activation+2.01e+00

Pos logit contributions
 department+0.039
 walk+0.036
pit+0.035
 nod+0.034
 ending+0.034
Neg logit contributions
 valuable-0.021
ang-0.021
b-0.019
pill-0.019
cks-0.018
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 valuable+0.060
b+0.059
ang+0.059
pill+0.057
cks+0.057
Neg logit contributions
 nod-0.037
 walk-0.037
 moment-0.034
 hug-0.034
 department-0.034
TokenHer
Token ID9360
Feature activation-2.15e+00

Pos logit contributions
 valuable+0.034
ang+0.034
b+0.033
cks+0.032
hest+0.032
Neg logit contributions
 nod-0.034
 walk-0.034
 department-0.034
 moment-0.032
 hug-0.032
Token mom
Token ID1995
Feature activation-1.28e+00

Pos logit contributions
 department+0.033
 nod+0.033
 walk+0.032
 gaze+0.032
 moment+0.031
Neg logit contributions
 valuable-0.069
 amazing-0.065
 impressive-0.064
 awesome-0.064
 important-0.063
Token said
Token ID531
Feature activation-1.75e+00

Pos logit contributions
 ending+0.031
 gaze+0.031
 moment+0.030
 spine+0.030
 walk+0.030
Neg logit contributions
b-0.028
ang-0.027
aw-0.026
her-0.026
hes-0.025
Token,
Token ID11
Feature activation-2.32e+00

Pos logit contributions
 department+0.036
pit+0.035
 garage+0.035
 walk+0.034
 driveway+0.034
Neg logit contributions
 valuable-0.047
 much-0.047
ula-0.046
 amazing-0.043
 important-0.043
Token "
Token ID366
Feature activation+1.03e+00

Pos logit contributions
 walk+0.057
 department+0.055
 gaze+0.054
 nod+0.053
pit+0.053
Neg logit contributions
 valuable-0.051
b-0.048
ang-0.048
hes-0.048
ula-0.047
TokenL
Token ID43
Feature activation+1.03e+00

Pos logit contributions
cks+0.030
ink+0.028
 valuable+0.028
 awesome+0.027
irling+0.027
Neg logit contributions
 moment-0.021
 nod-0.020
 walk-0.020
 department-0.020
 girl-0.019
Tokenily
Token ID813
Feature activation-2.29e+00

Pos logit contributions
ang+0.019
 valuable+0.019
b+0.018
hest+0.018
cks+0.017
Neg logit contributions
 nod-0.031
 department-0.031
 walk-0.030
 gaze-0.029
 moment-0.029
Token,
Token ID11
Feature activation-3.26e+00

Pos logit contributions
 gaze+0.038
 department+0.038
atively+0.037
 walk+0.036
 nod+0.036
Neg logit contributions
 valuable-0.069
 important-0.065
ink-0.065
ang-0.065
b-0.064
Token cartoons
Token ID30070
Feature activation-1.95e+00

Pos logit contributions
 department+0.097
 nod+0.092
 walk+0.090
pit+0.089
 moment+0.088
Neg logit contributions
 valuable-0.060
 awesome-0.057
b-0.055
 amazing-0.055
ang-0.054
Token fun
Token ID1257
Feature activation-1.39e+00

Pos logit contributions
ang+0.065
b+0.059
cks+0.057
ind+0.054
irling+0.053
Neg logit contributions
 walk-0.085
 nod-0.083
 department-0.081
 party-0.080
 hug-0.080
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
 department+0.031
 nod+0.030
 walk+0.029
 Lee+0.028
 gaze+0.028
Neg logit contributions
 valuable-0.037
ang-0.036
b-0.036
irling-0.035
cks-0.035
Token
Token ID198
Feature activation+2.86e+00

Pos logit contributions
 department+0.004
 nod+0.004
 walk+0.004
 moment+0.004
 hug+0.003
Neg logit contributions
ang-0.005
b-0.005
 valuable-0.005
cks-0.005
irling-0.005
Token
Token ID198
Feature activation+2.36e+00

Pos logit contributions
pill+0.097
ang+0.096
 elegant+0.095
b+0.095
ender+0.095
Neg logit contributions
 moment-0.040
 walk-0.037
 nod-0.037
 visit-0.035
 hug-0.033
TokenAfter
Token ID3260
Feature activation-5.99e-01

Pos logit contributions
cks+0.060
ang+0.060
pill+0.058
ocol+0.056
b+0.054
Neg logit contributions
 walk-0.056
 moment-0.053
 visit-0.048
 nod-0.048
 listen-0.048
Token playing
Token ID2712
Feature activation-2.64e+00

Pos logit contributions
 department+0.015
 nod+0.015
 walk+0.014
 moment+0.013
 gaze+0.013
Neg logit contributions
 valuable-0.014
ang-0.013
b-0.013
irling-0.013
 awesome-0.013
Token,
Token ID11
Feature activation-2.31e+00

Pos logit contributions
 department+0.065
 gaze+0.062
 nod+0.062
 walk+0.060
 spine+0.056
Neg logit contributions
 amazing-0.060
ang-0.060
 valuable-0.059
 awesome-0.058
 impressive-0.057
Token the
Token ID262
Feature activation-1.42e+00

Pos logit contributions
 nod+0.054
 department+0.052
 gaze+0.049
 walk+0.049
ously+0.047
Neg logit contributions
 valuable-0.057
 amazing-0.053
 awesome-0.053
ang-0.053
b-0.053
Token boy
Token ID2933
Feature activation-5.97e-01

Pos logit contributions
ously+0.022
 Lee+0.021
 department+0.021
gy+0.020
 gaze+0.019
Neg logit contributions
 valuable-0.048
 amazing-0.047
 impressive-0.047
 incredible-0.046
 awesome-0.046
Token thanked
Token ID26280
Feature activation-1.69e+00

Pos logit contributions
 moment+0.013
 nod+0.013
 department+0.013
 gaze+0.013
 Lee+0.012
Neg logit contributions
 valuable-0.015
b-0.014
 amazing-0.014
ang-0.013
 impressive-0.013
Token Lily
Token ID20037
Feature activation-2.32e+00

Pos logit contributions
 department+0.039
 nod+0.038
pit+0.038
 walk+0.037
 gaze+0.036
Neg logit contributions
 valuable-0.041
 awesome-0.038
 amazing-0.038
ind-0.038
ang-0.037
Token day
Token ID1110
Feature activation-2.77e+00

Pos logit contributions
 department+0.048
 nod+0.043
 Lee+0.041
 walk+0.041
 gaze+0.039
Neg logit contributions
 valuable-0.066
ang-0.066
b-0.064
 awesome-0.061
 amazing-0.060
Token,
Token ID11
Feature activation-3.22e+00

Pos logit contributions
 department+0.046
 nod+0.039
 garage+0.038
 walk+0.037
pit+0.036
Neg logit contributions
 valuable-0.085
 much-0.083
b-0.083
ang-0.083
dd-0.082
Token they
Token ID484
Feature activation-1.48e+00

Pos logit contributions
 department+0.078
 nod+0.070
 Driver+0.069
eman+0.068
ously+0.067
Neg logit contributions
 valuable-0.073
 awesome-0.071
 amazing-0.071
hes-0.069
 impressive-0.069
Token see
Token ID766
Feature activation-5.99e-01

Pos logit contributions
 department+0.029
 moment+0.026
ously+0.025
 nod+0.024
 Lee+0.024
Neg logit contributions
 valuable-0.041
ang-0.040
 amazing-0.040
hest-0.040
 awesome-0.040
Token a
Token ID257
Feature activation-1.71e-01

Pos logit contributions
 department+0.016
 nod+0.016
 walk+0.016
 moment+0.015
 gaze+0.015
Neg logit contributions
 valuable-0.013
ang-0.012
b-0.012
cks-0.012
irling-0.012
Token big
Token ID1263
Feature activation-3.00e+00

Pos logit contributions
 department+0.005
 nod+0.005
 walk+0.005
 moment+0.005
 gaze+0.005
Neg logit contributions
 valuable-0.003
ang-0.003
b-0.003
cks-0.003
irling-0.003
Token cage
Token ID17186
Feature activation+6.07e-01

Pos logit contributions
 department+0.061
ously+0.056
 nod+0.054
atively+0.051
 walk+0.049
Neg logit contributions
ang-0.086
 valuable-0.084
 amazing-0.082
 awesome-0.082
 impressive-0.082
Token in
Token ID287
Feature activation-1.68e+00

Pos logit contributions
 valuable+0.014
ang+0.014
b+0.014
cks+0.014
hest+0.013
Neg logit contributions
 department-0.015
 nod-0.015
 walk-0.014
 sister-0.014
 Lee-0.014
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 walk+0.034
 department+0.033
 nod+0.033
 moment+0.032
 hug+0.031
Neg logit contributions
 valuable-0.048
ang-0.048
b-0.047
irling-0.046
pill-0.045
Token park
Token ID3952
Feature activation-1.56e+00

Pos logit contributions
 department+0.023
 nod+0.023
 walk+0.022
 moment+0.021
 gaze+0.020
Neg logit contributions
 valuable-0.034
ang-0.034
b-0.034
cks-0.033
hest-0.032
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 department+0.036
 nod+0.033
 walk+0.031
 sister+0.031
 hug+0.030
Neg logit contributions
ang-0.040
 valuable-0.040
b-0.040
cks-0.039
hest-0.037
Token the
Token ID262
Feature activation+2.44e-01

Pos logit contributions
 nod+0.033
 department+0.033
 hug+0.030
 gaze+0.030
 walk+0.030
Neg logit contributions
 amazing-0.032
b-0.032
 valuable-0.032
 awesome-0.031
 impressive-0.031
Token day
Token ID1110
Feature activation-1.93e+00

Pos logit contributions
 valuable+0.006
ang+0.006
b+0.006
cks+0.006
irling+0.006
Neg logit contributions
 department-0.005
 nod-0.005
 walk-0.005
 moment-0.005
 hug-0.005
Token of
Token ID286
Feature activation-2.48e-02

Pos logit contributions
 nod+0.044
 department+0.043
 walk+0.042
 Lee+0.041
 gaze+0.040
Neg logit contributions
 valuable-0.048
b-0.048
ang-0.046
 amazing-0.045
 impressive-0.044
Token Tim
Token ID5045
Feature activation-6.52e-01

Pos logit contributions
 department+0.001
 nod+0.001
 walk+0.001
 gaze+0.001
 moment+0.001
Neg logit contributions
ang-0.001
b-0.001
 valuable-0.001
cks-0.001
 awesome-0.001
Token's
Token ID338
Feature activation-1.65e+00

Pos logit contributions
 nod+0.014
 department+0.014
 moment+0.013
 Lee+0.012
 ending+0.012
Neg logit contributions
b-0.017
 valuable-0.016
ang-0.016
ink-0.016
irling-0.015
Token birthday
Token ID10955
Feature activation-2.68e+00

Pos logit contributions
 department+0.025
 nod+0.024
 Lee+0.023
 walk+0.023
 moment+0.022
Neg logit contributions
 valuable-0.055
b-0.052
ang-0.051
 amazing-0.051
 impressive-0.050
Token came
Token ID1625
Feature activation-2.24e-01

Pos logit contributions
 nod+0.049
 gaze+0.044
 department+0.042
 walk+0.041
ously+0.040
Neg logit contributions
 valuable-0.078
 amazing-0.077
 impressive-0.075
 incredible-0.073
 important-0.073
Token.
Token ID13
Feature activation-6.28e-01

Pos logit contributions
 nod+0.004
 department+0.004
 walk+0.004
 Lee+0.004
 gaze+0.004
Neg logit contributions
 valuable-0.006
b-0.006
ang-0.006
 awesome-0.006
hest-0.006
Token Tim
Token ID5045
Feature activation+6.64e-01

Pos logit contributions
 department+0.012
 walk+0.012
 nod+0.012
 moment+0.011
 gaze+0.011
Neg logit contributions
 valuable-0.018
ang-0.017
b-0.017
 awesome-0.017
hest-0.017
Token and
Token ID290
Feature activation-8.70e-01

Pos logit contributions
b+0.017
 valuable+0.016
ang+0.016
ink+0.015
 amazing+0.015
Neg logit contributions
 nod-0.014
 department-0.014
 moment-0.014
 ending-0.013
 walk-0.013
Token his
Token ID465
Feature activation-1.27e+00

Pos logit contributions
 department+0.026
 nod+0.025
pit+0.025
atively+0.024
 walk+0.024
Neg logit contributions
 valuable-0.016
b-0.015
 impressive-0.014
 amazing-0.014
 incredible-0.013