TOP ACTIVATIONS
MAX = 0.056

room . Lily thought it smelled disgusting ! She wrink led
kin ! The Royal Fix er said that the
sip . He thought it tasted sweet , so he kept
When the fisherman landed , he knew he
too ?" Her pap a smiled and said ,
. When the farmer was done , the cart
pastry . She thought it tasted so good . She was
away . M acy looked around her clean
special snap . When the farmer snapped the fire fly 's
long . He loved the deep sounds it made .
red flower . When his wife came home , he put
When the farmer finally found her , it
look like . When the witch was done stirring , the
amazed ! Even though the hive was hidden , she was
Whenever the fox prints a picture , the
. After the quiz , they continued their walk
. When their mum saw what the two had
stand . He thought it smelled delicious , so he went
a lesson . As the farmer approached , the goat panicked
! When the fisherman came back to the boat

MIN ACTIVATIONS
MIN = -0.075

mom m ies and d add ies when they say things
squ aw ked and w add led around , enjoying the
mom m ies and d add ies . Lily and Ben
had solved her problem and salv aged her l imes .
the little girl and w add led over to her .
pond with many ducks and sw ans . Sara liked the
in the water and w add led on the land .
was so excited and w add led over to see what
he qu acked and w add led all around the yard
pond with many ducks and sw ans . Lily liked the
mom m ies and d add ies , who cl apped
a small yellow duck w add led around the pond looking
to the billboard and w add led away , feeling content
of m ummies , d add ies and children . The
girl 's shoulder and w add led over to see the
mom m ies and d add ies . They were all
some bread to feed the sw an ?" Her
mom m ies and d add ies said , " Today
curious , so he w add led over to take a
and Pupp y and w add led home . Mama Duck

INTERVAL 0.023 - 0.056
CONTAINS 1.366%

. Then his face lit up with excitement . " Ice
to admire it . It was loved by everyone who saw
his eyes tightly . When he opened them , he saw
big rail and thought it would be funny to hide behind
mouse . But you have to be careful , because you

INTERVAL -0.009 - 0.023
CONTAINS 81.889%

to her home . He wanted to help the p anda
Sara played with her makeup . She said it was not
kid named Sam came to play . Sam was not very
Days went by and Lily spent a lot of time with
at a time ." The cater pillar thanked the

INTERVAL -0.042 - -0.009
CONTAINS 16.734%

bright . It can hurt our eyes ," Ben says .
It looks very dirty . We don 't want to get
Conf used , they started arguing about what to
. They smile too . They are happy .
strong and could send the red ball to the mattress .

INTERVAL -0.075 - -0.042
CONTAINS 0.011%

sad and asked " Where are you going ?" Mom my
O w , ow , ow !" she cries . "
sorry ," the wild er be ast said . " But
played in the sand and sw am in the sea .
dolphin saw what happened and sw am over to help .
Token room
Token ID2119
Feature activation-1.06e-02

Pos logit contributions
 spine+0.021
hy+0.019
ses+0.018
ro+0.018
sed+0.016
Neg logit contributions
ull-0.015
Mom-0.014
 them-0.014
in-0.013
ins-0.013
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 them+0.009
ull+0.008
Mom+0.008
ups+0.008
Joe+0.008
Neg logit contributions
 spine-0.028
ses-0.026
ro-0.024
hy-0.024
sed-0.023
Token Lily
Token ID20037
Feature activation+6.70e-03

Pos logit contributions
 spine+0.028
ro+0.024
hy+0.024
ses+0.023
 grade+0.021
Neg logit contributions
ull-0.022
Joe-0.022
Mom-0.021
 them-0.020
add-0.019
Token thought
Token ID1807
Feature activation+1.80e-03

Pos logit contributions
 spine+0.014
ro+0.011
hy+0.011
ses+0.011
 grade+0.010
Neg logit contributions
ull-0.010
Joe-0.010
Mom-0.009
 them-0.009
ins-0.009
Token it
Token ID340
Feature activation+3.21e-02

Pos logit contributions
 spine+0.005
ro+0.004
hy+0.004
ses+0.004
 grade+0.004
Neg logit contributions
ull-0.002
Joe-0.002
 them-0.001
Mom-0.001
Jess-0.001
Token smelled
Token ID37609
Feature activation+5.64e-02

Pos logit contributions
 spine+0.069
ro+0.059
hy+0.058
ses+0.056
 grade+0.052
Neg logit contributions
ull-0.045
Joe-0.043
Mom-0.041
 them-0.040
in-0.039
Token disgusting
Token ID23374
Feature activation+1.90e-02

Pos logit contributions
 spine+0.144
ro+0.126
ses+0.124
hy+0.120
 grade+0.112
Neg logit contributions
ull-0.059
Joe-0.056
 them-0.055
Mom-0.052
in-0.048
Token!
Token ID0
Feature activation+1.97e-03

Pos logit contributions
 spine+0.039
ro+0.033
hy+0.033
ses+0.032
 grade+0.029
Neg logit contributions
ull-0.028
Joe-0.027
 them-0.025
Mom-0.025
ins-0.025
Token She
Token ID1375
Feature activation+3.42e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.002
ses+0.002
 Whe+0.002
Neg logit contributions
ull-0.004
agles-0.004
Joe-0.004
in-0.004
eper-0.004
Token wrink
Token ID34866
Feature activation+1.39e-02

Pos logit contributions
 spine+0.006
ro+0.005
hy+0.005
ses+0.005
 mun+0.004
Neg logit contributions
ull-0.006
Joe-0.006
agles-0.005
ins-0.005
Jess-0.005
Tokenled
Token ID992
Feature activation+1.43e-02

Pos logit contributions
 spine+0.028
ro+0.024
hy+0.023
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.021
Joe-0.021
Mom-0.019
 them-0.019
ins-0.019
Tokenkin
Token ID5116
Feature activation-1.21e-03

Pos logit contributions
 spine+0.030
ro+0.027
hy+0.026
ses+0.025
 grade+0.024
Neg logit contributions
ull-0.015
Joe-0.014
Mom-0.013
 them-0.012
ins-0.012
Token!
Token ID0
Feature activation+5.34e-03

Pos logit contributions
ull+0.002
Joe+0.002
 them+0.002
Mom+0.002
in+0.002
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token
Token ID198
Feature activation+6.74e-03

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.008
ses+0.008
 grade+0.008
Neg logit contributions
ull-0.009
Joe-0.008
Mom-0.008
 them-0.008
Jess-0.008
Token
Token ID198
Feature activation+5.65e-03

Pos logit contributions
 spine+0.012
ro+0.011
hy+0.011
ses+0.010
 grade+0.009
Neg logit contributions
ull-0.010
Joe-0.010
ins-0.010
agles-0.009
Mom-0.009
TokenThe
Token ID464
Feature activation+1.54e-02

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.009
 grade+0.008
WR+0.008
Neg logit contributions
ull-0.008
add-0.008
ins-0.008
Joe-0.007
idy-0.007
Token Royal
Token ID8111
Feature activation+5.52e-02

Pos logit contributions
 spine+0.035
ro+0.032
ses+0.030
hy+0.030
 grade+0.027
Neg logit contributions
ull-0.019
Joe-0.017
Mom-0.017
 them-0.016
agles-0.016
Token Fix
Token ID13268
Feature activation+1.39e-02

Pos logit contributions
 spine+0.106
ro+0.089
hy+0.087
ses+0.085
 grade+0.078
Neg logit contributions
ull-0.089
Joe-0.087
Mom-0.081
Jess-0.079
 them-0.079
Tokener
Token ID263
Feature activation+1.07e-02

Pos logit contributions
 spine+0.029
ro+0.025
hy+0.024
ses+0.023
 grade+0.022
Neg logit contributions
ull-0.020
Joe-0.020
Mom-0.018
 them-0.018
Jess-0.018
Token said
Token ID531
Feature activation+4.11e-03

Pos logit contributions
 spine+0.019
ro+0.017
hy+0.016
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.018
Joe-0.017
Mom-0.016
add-0.016
Jess-0.016
Token that
Token ID326
Feature activation+4.20e-04

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.006
 grade+0.006
Neg logit contributions
ull-0.007
Joe-0.007
Mom-0.006
ins-0.006
add-0.006
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
 them-0.000
Mom-0.000
Jess-0.000
Token sip
Token ID31145
Feature activation-1.25e-02

Pos logit contributions
 spine+0.008
ses+0.007
hy+0.007
ro+0.007
sed+0.006
Neg logit contributions
ull-0.006
in-0.006
 them-0.006
Joe-0.005
 rad-0.005
Token.
Token ID13
Feature activation+3.38e-03

Pos logit contributions
ull+0.018
Joe+0.017
Mom+0.016
Jess+0.015
 them+0.015
Neg logit contributions
 spine-0.026
ro-0.023
hy-0.022
ses-0.021
 grade-0.020
Token He
Token ID679
Feature activation-4.59e-04

Pos logit contributions
 spine+0.006
ro+0.005
hy+0.005
 Sure+0.004
ses+0.004
Neg logit contributions
ull-0.006
agles-0.006
in-0.006
ins-0.006
eper-0.006
Token thought
Token ID1807
Feature activation-2.84e-03

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.001
 them+0.001
in+0.001
Neg logit contributions
 spine-0.001
hy-0.001
ro-0.001
ses-0.001
 grade-0.001
Token it
Token ID340
Feature activation+1.97e-02

Pos logit contributions
ull+0.004
Joe+0.004
 them+0.003
in+0.003
Mom+0.003
Neg logit contributions
 spine-0.006
ro-0.006
hy-0.006
ses-0.005
 grade-0.005
Token tasted
Token ID29187
Feature activation+5.48e-02

Pos logit contributions
 spine+0.045
ro+0.038
hy+0.038
ses+0.036
 grade+0.034
Neg logit contributions
ull-0.026
Joe-0.025
 them-0.023
Mom-0.023
in-0.022
Token sweet
Token ID6029
Feature activation+2.20e-02

Pos logit contributions
 spine+0.116
ro+0.101
hy+0.099
ses+0.095
 grade+0.090
Neg logit contributions
ull-0.076
Joe-0.073
Mom-0.067
Jess-0.065
add-0.065
Token,
Token ID11
Feature activation+1.21e-03

Pos logit contributions
 spine+0.045
hy+0.038
ses+0.037
ro+0.036
sed+0.033
Neg logit contributions
ull-0.034
Joe-0.033
 them-0.032
in-0.031
ins-0.031
Token so
Token ID523
Feature activation-4.99e-03

Pos logit contributions
 spine+0.002
hy+0.002
ses+0.002
ro+0.002
 grade+0.002
Neg logit contributions
 them-0.002
ull-0.002
Joe-0.002
Mom-0.002
 decided-0.002
Token he
Token ID339
Feature activation-1.37e-02

Pos logit contributions
ull+0.007
Joe+0.007
ins+0.007
Mom+0.007
agles+0.007
Neg logit contributions
 spine-0.010
hy-0.009
ro-0.009
ses-0.008
 grade-0.008
Token kept
Token ID4030
Feature activation-3.85e-03

Pos logit contributions
ull+0.022
Joe+0.021
Mom+0.020
Jess+0.020
agles+0.019
Neg logit contributions
 spine-0.025
ro-0.022
hy-0.021
ses-0.020
 grade-0.019
Token
Token ID220
Feature activation-3.80e-03

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.001
 them+0.001
add+0.001
Neg logit contributions
 spine-0.001
hy-0.001
ro-0.001
ses-0.001
 grade-0.001
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
ull+0.007
Joe+0.006
agles+0.006
Jess+0.006
ins+0.006
Neg logit contributions
 spine-0.006
ro-0.005
hy-0.005
ses-0.005
 grade-0.004
Token
Token ID198
Feature activation+4.14e-03

Pos logit contributions
 spine+0.023
ro+0.020
hy+0.020
ses+0.018
 grade+0.017
Neg logit contributions
ull-0.021
Joe-0.020
agles-0.018
ins-0.018
Mom-0.018
TokenWhen
Token ID2215
Feature activation-1.10e-02

Pos logit contributions
 spine+0.009
ro+0.007
hy+0.007
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.006
Joe-0.006
Mom-0.005
 them-0.005
add-0.005
Token the
Token ID262
Feature activation-4.31e-03

Pos logit contributions
ull+0.017
Joe+0.016
ins+0.015
Mom+0.015
add+0.015
Neg logit contributions
 spine-0.022
ro-0.018
hy-0.018
ses-0.017
 grade-0.016
Token fisherman
Token ID48750
Feature activation+5.45e-02

Pos logit contributions
ull+0.005
Joe+0.005
ins+0.005
Jess+0.005
Mom+0.005
Neg logit contributions
 spine-0.010
ro-0.008
hy-0.008
ses-0.008
 grade-0.007
Token landed
Token ID11406
Feature activation+1.20e-02

Pos logit contributions
 spine+0.114
ro+0.097
hy+0.096
ses+0.095
 grade+0.088
Neg logit contributions
ull-0.082
Joe-0.077
 them-0.071
Mom-0.069
in-0.068
Token,
Token ID11
Feature activation-1.16e-02

Pos logit contributions
 spine+0.024
ro+0.021
hy+0.020
ses+0.019
 grade+0.018
Neg logit contributions
ull-0.018
Joe-0.018
Mom-0.016
Jess-0.016
 them-0.016
Token he
Token ID339
Feature activation-5.12e-04

Pos logit contributions
ull+0.017
Joe+0.016
Mom+0.015
ins+0.015
add+0.015
Neg logit contributions
 spine-0.024
ro-0.020
hy-0.020
ses-0.019
 grade-0.018
Token knew
Token ID2993
Feature activation-6.28e-03

Pos logit contributions
ull+0.001
Joe+0.001
 them+0.001
Mom+0.001
in+0.001
Neg logit contributions
 spine-0.001
hy-0.001
ro-0.001
ses-0.001
 grade-0.001
Token he
Token ID339
Feature activation+9.46e-03

Pos logit contributions
ull+0.010
Mom+0.010
Joe+0.010
ins+0.010
add+0.009
Neg logit contributions
 spine-0.011
ro-0.009
hy-0.009
ses-0.009
 grade-0.008
Token too
Token ID1165
Feature activation-1.97e-03

Pos logit contributions
 spine+0.003
ses+0.002
ro+0.002
hy+0.002
 grade+0.002
Neg logit contributions
ull-0.001
Joe-0.001
 them-0.001
in-0.001
Mom-0.001
Token?"
Token ID1701
Feature activation+8.41e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
 them+0.002
Jess+0.002
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Token
Token ID198
Feature activation+2.17e-03

Pos logit contributions
 spine+0.019
hy+0.017
ro+0.017
ses+0.016
sing+0.014
Neg logit contributions
ull-0.011
Joe-0.011
 them-0.011
Mom-0.010
Jess-0.010
Token
Token ID198
Feature activation+8.44e-04

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.004
Joe-0.004
ins-0.003
add-0.003
agles-0.003
TokenHer
Token ID9360
Feature activation+6.60e-03

Pos logit contributions
 spine+0.002
ro+0.002
hy+0.002
 grade+0.001
 lick+0.001
Neg logit contributions
ull-0.001
add-0.001
Joe-0.001
ins-0.001
 them-0.001
Token pap
Token ID20461
Feature activation+5.41e-02

Pos logit contributions
 spine+0.017
ro+0.015
hy+0.015
ses+0.014
 grade+0.014
Neg logit contributions
ull-0.006
Joe-0.006
Mom-0.005
 them-0.005
add-0.005
Tokena
Token ID64
Feature activation+1.88e-02

Pos logit contributions
 spine+0.144
ro+0.128
hy+0.125
ses+0.123
 grade+0.117
Neg logit contributions
ull-0.050
Joe-0.046
Mom-0.040
in-0.040
ins-0.038
Token smiled
Token ID13541
Feature activation+2.68e-02

Pos logit contributions
 spine+0.035
ro+0.030
hy+0.029
ses+0.028
 grade+0.026
Neg logit contributions
ull-0.031
Joe-0.030
Mom-0.028
 them-0.028
Jess-0.027
Token and
Token ID290
Feature activation+9.50e-03

Pos logit contributions
 spine+0.042
ro+0.035
hy+0.034
ses+0.032
 grade+0.029
Neg logit contributions
ull-0.052
Joe-0.050
Mom-0.048
 them-0.047
in-0.046
Token said
Token ID531
Feature activation+1.38e-03

Pos logit contributions
 spine+0.019
ro+0.016
hy+0.015
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.015
Joe-0.015
Mom-0.013
 them-0.013
Jess-0.013
Token,
Token ID11
Feature activation+8.33e-03

Pos logit contributions
 spine+0.002
hy+0.002
ro+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
add-0.002
agles-0.002
Token.
Token ID13
Feature activation+4.89e-03

Pos logit contributions
ull+0.011
Joe+0.011
Mom+0.010
 them+0.010
ins+0.010
Neg logit contributions
 spine-0.016
ro-0.014
hy-0.013
ses-0.013
 grade-0.012
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.007
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.008
Joe-0.008
Mom-0.008
 them-0.008
add-0.007
Token
Token ID198
Feature activation+6.92e-03

Pos logit contributions
 spine+0.026
ro+0.023
hy+0.022
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.015
Joe-0.015
Mom-0.014
 them-0.013
Jess-0.013
TokenWhen
Token ID2215
Feature activation-7.12e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.013
 grade+0.012
Neg logit contributions
ull-0.009
Joe-0.009
Mom-0.008
 them-0.008
Jess-0.008
Token the
Token ID262
Feature activation-1.32e-03

Pos logit contributions
ull+0.009
Joe+0.008
 them+0.008
Mom+0.008
Jess+0.007
Neg logit contributions
 spine-0.017
ro-0.015
hy-0.015
ses-0.014
 grade-0.013
Token farmer
Token ID18739
Feature activation+5.38e-02

Pos logit contributions
ull+0.002
Joe+0.002
in+0.001
ins+0.001
Jess+0.001
Neg logit contributions
 spine-0.003
ro-0.003
hy-0.003
ses-0.003
 lick-0.002
Token was
Token ID373
Feature activation+5.98e-03

Pos logit contributions
 spine+0.110
ro+0.092
hy+0.091
ses+0.090
 grade+0.082
Neg logit contributions
ull-0.082
Joe-0.080
Mom-0.074
 them-0.073
in-0.072
Token done
Token ID1760
Feature activation+1.46e-02

Pos logit contributions
 spine+0.012
ro+0.011
ses+0.010
hy+0.010
 grade+0.009
Neg logit contributions
ull-0.009
Joe-0.008
Mom-0.008
 them-0.008
add-0.008
Token,
Token ID11
Feature activation-8.63e-03

Pos logit contributions
 spine+0.029
ro+0.025
hy+0.024
ses+0.023
 grade+0.022
Neg logit contributions
ull-0.022
Joe-0.022
Mom-0.020
Jess-0.019
ins-0.019
Token the
Token ID262
Feature activation+3.80e-03

Pos logit contributions
ull+0.012
Joe+0.012
Mom+0.011
 them+0.011
Jess+0.010
Neg logit contributions
 spine-0.019
ro-0.016
hy-0.016
ses-0.015
 grade-0.014
Token cart
Token ID6383
Feature activation+1.31e-02

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.007
ses+0.007
 lick+0.006
Neg logit contributions
ull-0.005
Joe-0.005
irs-0.004
ades-0.004
agles-0.004
Token pastry
Token ID41738
Feature activation+1.12e-02

Pos logit contributions
 spine+0.002
ro+0.002
hy+0.002
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
 them-0.002
in-0.002
Token.
Token ID13
Feature activation+2.24e-03

Pos logit contributions
 spine+0.023
hy+0.021
ses+0.020
ro+0.020
 grade+0.018
Neg logit contributions
ull-0.015
Joe-0.014
in-0.014
 them-0.014
Mom-0.013
Token She
Token ID1375
Feature activation+1.67e-06

Pos logit contributions
 spine+0.004
hy+0.003
ro+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.004
Joe-0.004
Mom-0.003
 them-0.003
Jess-0.003
Token thought
Token ID1807
Feature activation-2.08e-03

Feature not active on prev token;
no predictions were affected.
Token it
Token ID340
Feature activation+2.31e-02

Pos logit contributions
ull+0.002
 them+0.002
Joe+0.002
in+0.002
Jess+0.002
Neg logit contributions
 spine-0.005
ro-0.005
hy-0.005
ses-0.005
 grade-0.004
Token tasted
Token ID29187
Feature activation+5.30e-02

Pos logit contributions
 spine+0.055
ro+0.046
hy+0.046
ses+0.045
 grade+0.042
Neg logit contributions
ull-0.029
Joe-0.029
 them-0.026
in-0.026
Mom-0.025
Token so
Token ID523
Feature activation+2.50e-02

Pos logit contributions
 spine+0.107
ro+0.093
hy+0.091
ses+0.087
 grade+0.082
Neg logit contributions
ull-0.078
Joe-0.075
Mom-0.069
add-0.067
Jess-0.067
Token good
Token ID922
Feature activation-7.53e-03

Pos logit contributions
 spine+0.054
ro+0.048
hy+0.047
ses+0.045
 grade+0.043
Neg logit contributions
ull-0.033
Joe-0.031
Mom-0.029
agles-0.028
eper-0.027
Token.
Token ID13
Feature activation-3.82e-03

Pos logit contributions
Joe+0.010
ull+0.010
 them+0.009
in+0.009
ins+0.009
Neg logit contributions
 spine-0.017
hy-0.015
ro-0.014
ses-0.014
 grade-0.013
Token She
Token ID1375
Feature activation-8.00e-03

Pos logit contributions
Joe+0.007
ull+0.007
Mom+0.006
Jess+0.006
 them+0.006
Neg logit contributions
 spine-0.007
hy-0.005
ro-0.005
ses-0.005
 grade-0.005
Token was
Token ID373
Feature activation-6.80e-03

Pos logit contributions
ull+0.012
Joe+0.012
Mom+0.011
 them+0.011
Jess+0.011
Neg logit contributions
 spine-0.016
ro-0.014
hy-0.014
ses-0.013
 grade-0.012
Token away
Token ID1497
Feature activation+1.59e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.001
 them+0.001
add+0.001
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
 spine+0.004
ses+0.003
ro+0.003
hy+0.003
 grade+0.003
Neg logit contributions
ull-0.002
Joe-0.002
 them-0.002
 friends-0.002
Jess-0.002
Token
Token ID220
Feature activation-2.61e-04

Pos logit contributions
 spine+0.050
hy+0.044
ro+0.042
ses+0.042
 grade+0.038
Neg logit contributions
ull-0.032
Joe-0.031
Mom-0.030
Jess-0.030
 them-0.030
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
ull+0.000
Joe+0.000
Mom+0.000
agles+0.000
add+0.000
Neg logit contributions
 spine-0.000
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token
Token ID198
Feature activation+1.94e-02

Pos logit contributions
 spine+0.034
ro+0.030
hy+0.029
ses+0.028
 grade+0.026
Neg logit contributions
ull-0.025
Joe-0.025
Mom-0.023
 them-0.022
add-0.022
TokenM
Token ID44
Feature activation+5.30e-02

Pos logit contributions
 spine+0.050
ro+0.045
hy+0.044
ses+0.043
 grade+0.041
Neg logit contributions
ull-0.018
Joe-0.017
Mom-0.015
 them-0.014
add-0.014
Tokenacy
Token ID1590
Feature activation+2.08e-02

Pos logit contributions
 spine+0.140
hy+0.129
ses+0.124
ro+0.119
sed+0.115
Neg logit contributions
in-0.051
ull-0.051
ins-0.049
add-0.046
Joe-0.045
Token looked
Token ID3114
Feature activation+1.25e-03

Pos logit contributions
 spine+0.042
ro+0.034
hy+0.034
ses+0.033
 grade+0.031
Neg logit contributions
ull-0.033
Joe-0.032
Mom-0.030
in-0.029
 them-0.029
Token around
Token ID1088
Feature activation+9.20e-03

Pos logit contributions
 spine+0.003
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.001
 them-0.001
in-0.001
Mom-0.001
Token her
Token ID607
Feature activation+4.22e-03

Pos logit contributions
 spine+0.020
ses+0.019
hy+0.019
ro+0.018
 grade+0.017
Neg logit contributions
 them-0.013
Joe-0.011
Jess-0.010
ull-0.010
 friends-0.010
Token clean
Token ID3424
Feature activation-1.91e-02

Pos logit contributions
 spine+0.009
hy+0.008
ro+0.008
ses+0.008
 grade+0.008
Neg logit contributions
 friends-0.005
ull-0.005
Joe-0.005
Mom-0.005
 them-0.005
Token special
Token ID2041
Feature activation-4.16e-03

Pos logit contributions
 spine+0.026
ro+0.023
hy+0.022
ses+0.021
 grade+0.019
Neg logit contributions
ull-0.018
Joe-0.016
Mom-0.016
Jess-0.015
in-0.015
Token snap
Token ID11495
Feature activation-1.92e-02

Pos logit contributions
ull+0.007
Joe+0.006
 them+0.006
Mom+0.006
in+0.005
Neg logit contributions
 spine-0.008
ro-0.007
hy-0.007
ses-0.006
 grade-0.006
Token.
Token ID13
Feature activation+8.94e-03

Pos logit contributions
ull+0.028
Joe+0.028
Mom+0.025
 them+0.025
Jess+0.025
Neg logit contributions
 spine-0.039
ro-0.034
hy-0.033
ses-0.032
 grade-0.030
Token When
Token ID1649
Feature activation+3.38e-03

Pos logit contributions
 spine+0.015
ro+0.012
hy+0.012
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.017
Joe-0.016
Mom-0.015
 them-0.015
Jess-0.015
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.006
 grade+0.006
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.004
 them-0.004
Jess-0.004
Token farmer
Token ID18739
Feature activation+5.22e-02

Pos logit contributions
 spine+0.037
ro+0.031
ses+0.031
hy+0.031
 grade+0.029
Neg logit contributions
Joe-0.019
ull-0.018
ins-0.017
ups-0.016
add-0.016
Token snapped
Token ID20821
Feature activation+7.82e-03

Pos logit contributions
 spine+0.107
ro+0.090
hy+0.088
ses+0.088
 grade+0.080
Neg logit contributions
ull-0.079
Joe-0.076
Mom-0.070
 them-0.069
in-0.069
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
 spine+0.016
ro+0.014
hy+0.013
ses+0.013
 grade+0.012
Neg logit contributions
ull-0.012
Joe-0.011
Mom-0.010
ins-0.010
add-0.010
Token fire
Token ID2046
Feature activation+2.62e-02

Pos logit contributions
 spine+0.042
ro+0.037
hy+0.037
ses+0.035
 grade+0.034
Neg logit contributions
ull-0.020
Joe-0.019
Mom-0.017
 them-0.017
Jess-0.016
Tokenfly
Token ID12254
Feature activation+2.30e-02

Pos logit contributions
 spine+0.048
ro+0.041
hy+0.040
ses+0.038
 grade+0.036
Neg logit contributions
ull-0.044
Joe-0.042
Mom-0.039
 them-0.039
agles-0.038
Token's
Token ID338
Feature activation+6.54e-03

Pos logit contributions
 spine+0.046
ro+0.040
hy+0.039
ses+0.038
 grade+0.035
Neg logit contributions
ull-0.034
Joe-0.033
Mom-0.030
 them-0.030
Jess-0.029
Token long
Token ID890
Feature activation-1.12e-02

Pos logit contributions
Joe+0.011
ull+0.011
 them+0.011
Mom+0.011
in+0.010
Neg logit contributions
 spine-0.019
ses-0.018
hy-0.017
ro-0.017
 grade-0.016
Token.
Token ID13
Feature activation+1.86e-03

Pos logit contributions
 them+0.012
ull+0.011
Joe+0.011
Mom+0.010
in+0.010
Neg logit contributions
 spine-0.027
ses-0.024
hy-0.023
ro-0.023
 grade-0.022
Token He
Token ID679
Feature activation+7.86e-03

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
Joe-0.003
Mom-0.003
 them-0.003
add-0.003
Token loved
Token ID6151
Feature activation+1.71e-02

Pos logit contributions
 spine+0.016
hy+0.013
ro+0.013
ses+0.013
 grade+0.012
Neg logit contributions
ull-0.012
Joe-0.012
 them-0.011
Mom-0.011
in-0.010
Token the
Token ID262
Feature activation+2.30e-02

Pos logit contributions
 spine+0.036
ro+0.030
hy+0.029
ses+0.028
 grade+0.025
Neg logit contributions
ull-0.026
 them-0.026
Joe-0.025
Mom-0.023
in-0.023
Token deep
Token ID2769
Feature activation+5.21e-02

Pos logit contributions
 spine+0.051
ro+0.044
hy+0.042
ses+0.042
 mun+0.038
Neg logit contributions
ull-0.029
Mom-0.029
 them-0.029
Everyone-0.028
Joe-0.028
Token sounds
Token ID5238
Feature activation-3.02e-04

Pos logit contributions
 spine+0.104
hy+0.092
ro+0.089
ses+0.088
 mun+0.083
Neg logit contributions
ull-0.069
 them-0.069
Mom-0.067
Joe-0.064
in-0.063
Token it
Token ID340
Feature activation+7.78e-03

Pos logit contributions
ull+0.000
Joe+0.000
Mom+0.000
add+0.000
Jess+0.000
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.000
ses-0.000
 grade-0.000
Token made
Token ID925
Feature activation+7.48e-03

Pos logit contributions
 spine+0.018
ses+0.016
ro+0.015
hy+0.015
sed+0.014
Neg logit contributions
ull-0.010
 them-0.010
Joe-0.009
in-0.009
 decided-0.008
Token.
Token ID13
Feature activation+2.45e-03

Pos logit contributions
 spine+0.015
ses+0.013
ro+0.012
hy+0.012
sed+0.011
Neg logit contributions
ull-0.012
 them-0.011
Joe-0.011
ins-0.010
in-0.010
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
 spine+0.005
ro+0.004
hy+0.004
ses+0.004
 grade+0.004
Neg logit contributions
ull-0.004
Joe-0.004
Mom-0.003
 them-0.003
add-0.003
Token red
Token ID2266
Feature activation-8.36e-03

Pos logit contributions
ull+0.000
Joe+0.000
Mom+0.000
 them+0.000
in+0.000
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.000
Token flower
Token ID15061
Feature activation+1.43e-02

Pos logit contributions
ull+0.012
Joe+0.012
Mom+0.011
 them+0.011
in+0.011
Neg logit contributions
 spine-0.017
ro-0.014
hy-0.014
ses-0.014
 grade-0.012
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
 spine+0.027
ro+0.023
hy+0.023
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.023
Joe-0.021
 them-0.020
Mom-0.020
in-0.020
Token When
Token ID1649
Feature activation-4.63e-03

Pos logit contributions
 spine+0.033
ses+0.028
hy+0.027
ro+0.025
sed+0.024
Neg logit contributions
Joe-0.025
ull-0.025
Mom-0.025
 them-0.024
um-0.023
Token his
Token ID465
Feature activation-1.13e-02

Pos logit contributions
ull+0.007
Joe+0.007
ins+0.006
Mom+0.006
add+0.006
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.007
 grade-0.007
Token wife
Token ID3656
Feature activation+5.19e-02

Pos logit contributions
ull+0.015
Joe+0.014
Mom+0.013
add+0.013
ins+0.013
Neg logit contributions
 spine-0.025
ro-0.022
hy-0.021
ses-0.021
 grade-0.020
Token came
Token ID1625
Feature activation+6.63e-03

Pos logit contributions
 spine+0.098
ro+0.082
hy+0.080
ses+0.078
 grade+0.071
Neg logit contributions
ull-0.084
Joe-0.082
Mom-0.077
 them-0.076
Jess-0.074
Token home
Token ID1363
Feature activation+3.04e-02

Pos logit contributions
 spine+0.009
ro+0.007
hy+0.007
 grade+0.006
 Sure+0.006
Neg logit contributions
add-0.013
ull-0.013
ins-0.013
agles-0.012
ades-0.012
Token,
Token ID11
Feature activation-4.86e-03

Pos logit contributions
 spine+0.070
ro+0.061
ses+0.061
hy+0.060
 grade+0.054
Neg logit contributions
ull-0.036
Joe-0.036
 them-0.033
Mom-0.032
in-0.031
Token he
Token ID339
Feature activation+3.39e-03

Pos logit contributions
ull+0.006
Joe+0.006
Mom+0.006
 them+0.005
Jess+0.005
Neg logit contributions
 spine-0.011
ro-0.009
hy-0.009
ses-0.009
 grade-0.008
Token put
Token ID1234
Feature activation+1.31e-02

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.006
 grade+0.005
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.005
 them-0.005
in-0.004
Token
Token ID220
Feature activation-7.65e-03

Pos logit contributions
Joe+0.002
ull+0.001
Mom+0.001
 them+0.001
Jess+0.001
Neg logit contributions
 spine-0.003
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token
Token ID198
Feature activation+4.17e-03

Pos logit contributions
ull+0.013
Joe+0.013
agles+0.012
add+0.012
Mom+0.012
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.011
ses-0.010
 grade-0.010
Token
Token ID198
Feature activation+5.37e-04

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.007
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.006
Joe-0.006
Mom-0.005
 them-0.005
Jess-0.005
TokenWhen
Token ID2215
Feature activation-6.34e-03

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
ins-0.001
agles-0.001
add-0.001
Token the
Token ID262
Feature activation-8.67e-03

Pos logit contributions
 them+0.006
Joe+0.006
ull+0.006
Mom+0.006
Jess+0.006
Neg logit contributions
 spine-0.016
ro-0.015
hy-0.014
ses-0.014
 grade-0.013
Token farmer
Token ID18739
Feature activation+5.13e-02

Pos logit contributions
ull+0.013
Joe+0.012
Mom+0.011
Jess+0.011
agles+0.011
Neg logit contributions
 spine-0.018
ro-0.015
hy-0.015
ses-0.015
 grade-0.014
Token finally
Token ID3443
Feature activation+1.26e-02

Pos logit contributions
 spine+0.106
ro+0.090
hy+0.088
ses+0.086
 grade+0.080
Neg logit contributions
ull-0.077
Joe-0.074
Mom-0.068
 them-0.067
add-0.066
Token found
Token ID1043
Feature activation+8.33e-03

Pos logit contributions
 spine+0.024
ro+0.020
hy+0.020
ses+0.019
 grade+0.018
Neg logit contributions
ull-0.021
Joe-0.020
Mom-0.019
 them-0.018
Jess-0.018
Token her
Token ID607
Feature activation+4.56e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.014
Joe-0.013
Mom-0.012
ins-0.012
agles-0.012
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 spine+0.009
ro+0.009
hy+0.008
WR+0.008
ses+0.008
Neg logit contributions
ull-0.006
 friends-0.006
 them-0.006
in-0.005
Joe-0.005
Token it
Token ID340
Feature activation-4.19e-03

Pos logit contributions
ull+0.012
Joe+0.012
 them+0.011
Mom+0.011
Jess+0.011
Neg logit contributions
 spine-0.024
ro-0.022
hy-0.021
ses-0.020
 grade-0.019
Token look
Token ID804
Feature activation+3.95e-03

Pos logit contributions
 spine+0.035
ro+0.029
ses+0.029
hy+0.028
sed+0.025
Neg logit contributions
ull-0.030
Joe-0.029
 them-0.029
Mom-0.027
Jess-0.026
Token like
Token ID588
Feature activation+8.12e-03

Pos logit contributions
 spine+0.006
ro+0.004
hy+0.004
ses+0.004
 grade+0.004
Neg logit contributions
ull-0.008
Joe-0.008
Mom-0.008
Jess-0.008
add-0.008
Token.
Token ID13
Feature activation+8.64e-03

Pos logit contributions
 spine+0.014
ro+0.013
hy+0.012
ses+0.011
 grade+0.011
Neg logit contributions
ull-0.013
Joe-0.013
add-0.012
Mom-0.012
agles-0.012
Token When
Token ID1649
Feature activation-4.25e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.012
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.014
Joe-0.014
 them-0.014
Mom-0.014
add-0.013
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
ull+0.006
Joe+0.006
ins+0.005
Mom+0.005
add+0.005
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.007
 grade-0.007
Token witch
Token ID16365
Feature activation+5.13e-02

Pos logit contributions
 spine+0.026
ro+0.023
hy+0.022
ses+0.022
 grade+0.020
Neg logit contributions
ull-0.016
Joe-0.014
Mom-0.014
 them-0.014
add-0.014
Token was
Token ID373
Feature activation+7.64e-03

Pos logit contributions
 spine+0.105
ro+0.088
hy+0.087
ses+0.085
 grade+0.078
Neg logit contributions
ull-0.078
Joe-0.075
 them-0.069
Mom-0.069
in-0.068
Token done
Token ID1760
Feature activation+1.08e-02

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.012
Joe-0.012
 them-0.011
Mom-0.011
add-0.011
Token stirring
Token ID26547
Feature activation+2.52e-02

Pos logit contributions
 spine+0.022
ro+0.019
hy+0.018
ses+0.018
 grade+0.017
Neg logit contributions
ull-0.016
Joe-0.015
Mom-0.014
add-0.014
ins-0.014
Token,
Token ID11
Feature activation-8.13e-03

Pos logit contributions
 spine+0.055
ro+0.048
hy+0.047
ses+0.046
 grade+0.042
Neg logit contributions
ull-0.034
Joe-0.033
 them-0.031
Mom-0.030
Jess-0.029
Token the
Token ID262
Feature activation+1.28e-02

Pos logit contributions
ull+0.009
Joe+0.009
 them+0.008
Mom+0.008
Jess+0.008
Neg logit contributions
 spine-0.020
ro-0.018
hy-0.017
ses-0.017
 grade-0.016
Token amazed
Token ID24562
Feature activation+6.53e-03

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
agles-0.001
Mom-0.001
add-0.001
Token!
Token ID0
Feature activation-1.37e-03

Pos logit contributions
 spine+0.013
ro+0.011
ses+0.011
hy+0.011
 grade+0.010
Neg logit contributions
ull-0.010
Joe-0.010
 them-0.009
Mom-0.009
in-0.009
Token Even
Token ID3412
Feature activation-1.94e-02

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
 them+0.002
agles+0.002
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token though
Token ID996
Feature activation-3.09e-03

Pos logit contributions
ull+0.033
Joe+0.032
Mom+0.030
 them+0.029
Jess+0.029
Neg logit contributions
 spine-0.036
ro-0.030
hy-0.029
ses-0.028
 grade-0.026
Token the
Token ID262
Feature activation-8.72e-03

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.004
add+0.004
ins+0.004
Neg logit contributions
 spine-0.006
ro-0.005
hy-0.005
ses-0.005
 grade-0.005
Token hive
Token ID35881
Feature activation+5.12e-02

Pos logit contributions
ull+0.010
Joe+0.010
Mom+0.009
 them+0.009
Jess+0.009
Neg logit contributions
 spine-0.020
ro-0.018
hy-0.018
ses-0.017
 grade-0.016
Token was
Token ID373
Feature activation+9.48e-03

Pos logit contributions
 spine+0.108
ro+0.092
hy+0.091
ses+0.087
 grade+0.082
Neg logit contributions
ull-0.074
Joe-0.071
 them-0.066
Mom-0.066
in-0.064
Token hidden
Token ID7104
Feature activation+9.12e-03

Pos logit contributions
 spine+0.019
ro+0.017
hy+0.016
ses+0.016
 grade+0.015
Neg logit contributions
ull-0.014
Joe-0.013
Mom-0.012
add-0.012
ins-0.012
Token,
Token ID11
Feature activation-5.21e-03

Pos logit contributions
 spine+0.018
ro+0.016
hy+0.015
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.014
Joe-0.013
 them-0.012
Mom-0.012
in-0.012
Token she
Token ID673
Feature activation-9.79e-03

Pos logit contributions
ull+0.007
Joe+0.007
Mom+0.006
 them+0.006
Jess+0.006
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.010
ses-0.010
 grade-0.009
Token was
Token ID373
Feature activation-8.51e-03

Pos logit contributions
ull+0.016
Joe+0.015
Mom+0.014
add+0.014
 them+0.014
Neg logit contributions
 spine-0.018
ro-0.016
hy-0.015
ses-0.015
 grade-0.014
Token
Token ID220
Feature activation-1.21e-03

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
Mom-0.001
 them-0.001
Jess-0.001
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
ull+0.002
Joe+0.002
agles+0.002
 tides+0.002
add+0.002
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.001
ses-0.001
 Whe-0.001
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
 spine+0.021
ro+0.018
hy+0.017
ses+0.017
 grade+0.016
Neg logit contributions
ull-0.017
Joe-0.016
Mom-0.015
ins-0.015
agles-0.015
TokenWhenever
Token ID28877
Feature activation+1.53e-02

Pos logit contributions
 spine+0.005
ro+0.005
hy+0.005
ses+0.005
 grade+0.004
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
ins-0.002
add-0.002
Token the
Token ID262
Feature activation+3.91e-03

Pos logit contributions
 spine+0.032
ro+0.028
hy+0.027
ses+0.026
 grade+0.024
Neg logit contributions
ull-0.022
Joe-0.021
Mom-0.020
 them-0.019
Jess-0.019
Token fox
Token ID21831
Feature activation+5.09e-02

Pos logit contributions
 spine+0.009
hy+0.009
ro+0.008
ses+0.008
 grade+0.008
Neg logit contributions
Mom-0.004
 friends-0.004
in-0.004
ull-0.004
eper-0.003
Token prints
Token ID20842
Feature activation+2.43e-02

Pos logit contributions
 spine+0.106
hy+0.088
ro+0.087
ses+0.085
 grade+0.078
Neg logit contributions
ull-0.077
Joe-0.072
 them-0.068
in-0.067
Mom-0.067
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
 spine+0.050
ro+0.043
hy+0.042
ses+0.040
 grade+0.037
Neg logit contributions
ull-0.036
Joe-0.034
 them-0.032
Mom-0.032
agles-0.031
Token picture
Token ID4286
Feature activation+5.51e-03

Pos logit contributions
 spine+0.029
ro+0.025
hy+0.025
ses+0.024
 grade+0.022
Neg logit contributions
ull-0.018
Joe-0.016
Mom-0.015
 them-0.015
in-0.015
Token,
Token ID11
Feature activation-8.27e-03

Pos logit contributions
 spine+0.012
ro+0.010
hy+0.010
ses+0.010
 grade+0.009
Neg logit contributions
ull-0.008
Joe-0.007
 them-0.007
Mom-0.007
in-0.007
Token the
Token ID262
Feature activation+9.42e-03

Pos logit contributions
ull+0.010
Joe+0.009
 them+0.009
Mom+0.008
Jess+0.008
Neg logit contributions
 spine-0.020
ro-0.018
hy-0.017
ses-0.017
 grade-0.016
Token.
Token ID13
Feature activation+9.18e-03

Pos logit contributions
 spine+0.013
ro+0.011
hy+0.011
ses+0.010
 grade+0.009
Neg logit contributions
ull-0.009
Joe-0.009
 them-0.008
Mom-0.008
Jess-0.008
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 spine+0.017
ro+0.014
hy+0.014
ses+0.013
 grade+0.013
Neg logit contributions
ull-0.015
Joe-0.015
Mom-0.014
 them-0.014
add-0.014
Token
Token ID198
Feature activation+9.30e-03

Pos logit contributions
 spine+0.026
ro+0.023
hy+0.022
ses+0.021
 grade+0.020
Neg logit contributions
ull-0.019
Joe-0.019
Mom-0.017
 them-0.017
Jess-0.017
TokenAfter
Token ID3260
Feature activation+9.19e-03

Pos logit contributions
 spine+0.018
ro+0.016
hy+0.016
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.014
Joe-0.013
agles-0.012
add-0.012
ins-0.012
Token the
Token ID262
Feature activation+2.88e-03

Pos logit contributions
 spine+0.022
ro+0.019
ses+0.018
hy+0.017
sed+0.015
Neg logit contributions
Joe-0.012
 them-0.012
ull-0.011
Mom-0.010
 friends-0.010
Token quiz
Token ID38964
Feature activation+5.09e-02

Pos logit contributions
 spine+0.005
ro+0.005
ses+0.005
hy+0.004
 bru+0.004
Neg logit contributions
 friends-0.004
ups-0.004
Everyone-0.004
Joe-0.004
 other-0.004
Token,
Token ID11
Feature activation-5.01e-04

Pos logit contributions
 spine+0.134
ses+0.117
ro+0.111
sing+0.107
hy+0.106
Neg logit contributions
 them-0.047
ull-0.044
Joe-0.044
in-0.040
 friends-0.038
Token they
Token ID484
Feature activation-1.15e-02

Pos logit contributions
 them+0.001
Mom+0.001
ull+0.001
Joe+0.001
 friends+0.000
Neg logit contributions
 spine-0.001
hy-0.001
ro-0.001
ses-0.001
 grade-0.001
Token continued
Token ID3767
Feature activation-4.24e-03

Pos logit contributions
ull+0.017
Joe+0.016
Mom+0.015
ins+0.015
Jess+0.015
Neg logit contributions
 spine-0.023
ro-0.020
hy-0.020
ses-0.019
 grade-0.018
Token their
Token ID511
Feature activation-4.63e-03

Pos logit contributions
 them+0.006
 friends+0.005
ull+0.005
 boats+0.005
Joe+0.005
Neg logit contributions
 spine-0.010
ro-0.008
hy-0.008
ses-0.008
WR-0.007
Token walk
Token ID2513
Feature activation+7.30e-03

Pos logit contributions
 friends+0.007
ups+0.006
 families+0.006
 them+0.006
ths+0.006
Neg logit contributions
 spine-0.009
ro-0.008
ses-0.008
hy-0.008
 bru-0.007
Token.
Token ID13
Feature activation-7.71e-04

Pos logit contributions
ull+0.038
Joe+0.036
 them+0.034
Mom+0.033
ins+0.032
Neg logit contributions
 spine-0.054
ro-0.046
hy-0.045
ses-0.044
 grade-0.041
Token
Token ID198
Feature activation+4.60e-03

Pos logit contributions
ull+0.001
Joe+0.001
agles+0.001
in+0.001
ins+0.001
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token
Token ID198
Feature activation-1.08e-03

Pos logit contributions
 spine+0.005
ro+0.005
hy+0.005
 Whe+0.004
 lap+0.004
Neg logit contributions
agles-0.009
ull-0.009
 tides-0.008
 anchors-0.008
eper-0.008
TokenWhen
Token ID2215
Feature activation-8.18e-03

Pos logit contributions
ull+0.002
agles+0.002
els+0.001
eper+0.001
 anchors+0.001
Neg logit contributions
ro-0.002
 spine-0.002
hy-0.002
 Whe-0.001
 lick-0.001
Token their
Token ID511
Feature activation-1.00e-02

Pos logit contributions
ull+0.010
Joe+0.009
ins+0.008
Mom+0.008
add+0.008
Neg logit contributions
 spine-0.019
ro-0.017
hy-0.016
ses-0.016
 grade-0.015
Token mum
Token ID25682
Feature activation+5.08e-02

Pos logit contributions
ull+0.015
Joe+0.014
ins+0.013
Mom+0.012
Jess+0.012
Neg logit contributions
 spine-0.021
ro-0.018
hy-0.018
ses-0.017
 grade-0.016
Token saw
Token ID2497
Feature activation+1.31e-03

Pos logit contributions
 spine+0.099
ro+0.083
hy+0.081
ses+0.079
 grade+0.073
Neg logit contributions
ull-0.081
Joe-0.079
Mom-0.074
 them-0.072
Jess-0.071
Token what
Token ID644
Feature activation+7.08e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.001
Joe-0.001
Mom-0.001
add-0.001
Jess-0.001
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
 spine+0.013
hy+0.011
ro+0.011
ses+0.011
 grade+0.010
Neg logit contributions
ull-0.011
Joe-0.010
Mom-0.010
add-0.010
agles-0.010
Token two
Token ID734
Feature activation+5.87e-03

Pos logit contributions
 spine+0.046
ro+0.040
hy+0.038
ses+0.037
 grade+0.034
Neg logit contributions
ull-0.033
Joe-0.032
Mom-0.031
 them-0.030
Jess-0.029
Token had
Token ID550
Feature activation+1.02e-02

Pos logit contributions
 spine+0.012
hy+0.010
ro+0.010
 grade+0.009
ses+0.009
Neg logit contributions
ull-0.009
Joe-0.008
Mom-0.007
add-0.007
Jon-0.007
Token stand
Token ID1302
Feature activation+1.06e-02

Pos logit contributions
 spine+0.028
ro+0.025
hy+0.024
ses+0.024
 grade+0.022
Neg logit contributions
ull-0.016
Joe-0.015
 them-0.014
Mom-0.014
in-0.014
Token.
Token ID13
Feature activation+2.61e-03

Pos logit contributions
 spine+0.022
ro+0.019
hy+0.018
ses+0.018
 grade+0.017
Neg logit contributions
ull-0.016
Joe-0.015
Mom-0.014
 them-0.014
ins-0.013
Token He
Token ID679
Feature activation-2.32e-03

Pos logit contributions
 spine+0.005
ses+0.004
hy+0.004
ro+0.004
 grade+0.004
Neg logit contributions
Joe-0.004
ull-0.004
Mom-0.004
 them-0.004
Jess-0.004
Token thought
Token ID1807
Feature activation-7.28e-03

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.003
ins+0.003
agles+0.003
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.003
 grade-0.003
Token it
Token ID340
Feature activation+1.93e-02

Pos logit contributions
ull+0.007
Joe+0.006
 them+0.006
in+0.005
add+0.005
Neg logit contributions
 spine-0.019
ro-0.017
hy-0.017
ses-0.017
 grade-0.015
Token smelled
Token ID37609
Feature activation+5.06e-02

Pos logit contributions
 spine+0.037
ro+0.032
hy+0.031
ses+0.030
 grade+0.028
Neg logit contributions
ull-0.031
Joe-0.030
Mom-0.028
 them-0.027
Jess-0.027
Token delicious
Token ID12625
Feature activation+1.82e-02

Pos logit contributions
 spine+0.126
ro+0.111
ses+0.107
hy+0.105
 grade+0.098
Neg logit contributions
ull-0.057
Joe-0.055
 them-0.054
Mom-0.049
in-0.046
Token,
Token ID11
Feature activation+3.47e-04

Pos logit contributions
 spine+0.038
ro+0.033
hy+0.032
ses+0.031
 grade+0.029
Neg logit contributions
ull-0.026
Joe-0.026
Mom-0.023
 them-0.023
ins-0.023
Token so
Token ID523
Feature activation+1.56e-04

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
Joe-0.000
 them-0.000
ull-0.000
Mom-0.000
ins-0.000
Token he
Token ID339
Feature activation-1.57e-02

Pos logit contributions
 spine+0.000
ro+0.000
hy+0.000
ses+0.000
 grade+0.000
Neg logit contributions
ull-0.000
Joe-0.000
Mom-0.000
ins-0.000
add-0.000
Token went
Token ID1816
Feature activation-1.38e-02

Pos logit contributions
ull+0.023
Joe+0.022
agles+0.021
ins+0.021
irs+0.021
Neg logit contributions
 spine-0.029
ro-0.027
hy-0.024
 mun-0.024
ses-0.024
Token a
Token ID257
Feature activation+1.88e-02

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.007
ses+0.007
 grade+0.006
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.005
Jess-0.005
add-0.005
Token lesson
Token ID11483
Feature activation-1.36e-02

Pos logit contributions
 spine+0.037
ro+0.032
hy+0.031
ses+0.030
 grade+0.028
Neg logit contributions
ull-0.029
Joe-0.028
Mom-0.025
 them-0.025
Jess-0.025
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
ull+0.018
Joe+0.018
 them+0.017
Mom+0.016
in+0.016
Neg logit contributions
 spine-0.029
ro-0.025
hy-0.025
ses-0.025
 grade-0.023
Token As
Token ID1081
Feature activation-2.85e-03

Pos logit contributions
 spine+0.009
ses+0.008
hy+0.008
ro+0.007
 grade+0.007
Neg logit contributions
Joe-0.008
ull-0.007
Mom-0.007
 them-0.007
ups-0.007
Token the
Token ID262
Feature activation+3.17e-03

Pos logit contributions
ull+0.004
Joe+0.004
ins+0.004
Mom+0.004
add+0.004
Neg logit contributions
 spine-0.006
ro-0.005
hy-0.005
ses-0.005
 grade-0.004
Token farmer
Token ID18739
Feature activation+5.05e-02

Pos logit contributions
 spine+0.007
hy+0.006
ro+0.006
ses+0.006
 lick+0.005
Neg logit contributions
ull-0.004
Joe-0.004
ins-0.004
Jess-0.004
in-0.004
Token approached
Token ID10448
Feature activation+1.00e-02

Pos logit contributions
 spine+0.109
hy+0.090
ses+0.089
ro+0.089
 grade+0.082
Neg logit contributions
ull-0.072
Joe-0.071
Mom-0.066
in-0.065
 them-0.063
Token,
Token ID11
Feature activation-8.35e-03

Pos logit contributions
 spine+0.022
ro+0.019
hy+0.018
ses+0.018
 grade+0.017
Neg logit contributions
ull-0.014
Joe-0.014
 them-0.013
Mom-0.012
agles-0.012
Token the
Token ID262
Feature activation+2.80e-04

Pos logit contributions
ull+0.011
Joe+0.011
Mom+0.010
 them+0.010
Jess+0.010
Neg logit contributions
 spine-0.018
ro-0.016
hy-0.015
ses-0.015
 grade-0.014
Token goat
Token ID26917
Feature activation+1.68e-02

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.000
Neg logit contributions
ull-0.000
Joe-0.000
ins-0.000
Jess-0.000
add-0.000
Token panicked
Token ID38225
Feature activation-2.71e-04

Pos logit contributions
 spine+0.033
ro+0.028
hy+0.028
ses+0.027
 grade+0.025
Neg logit contributions
ull-0.026
Joe-0.025
Mom-0.023
 them-0.023
Jess-0.022
Token!
Token ID0
Feature activation+3.77e-03

Pos logit contributions
ull+0.000
Joe+0.000
Mom+0.000
Jess+0.000
add+0.000
Neg logit contributions
 spine-0.000
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token
Token ID198
Feature activation+8.00e-03

Pos logit contributions
 spine+0.008
ro+0.006
hy+0.006
ses+0.006
 grade+0.006
Neg logit contributions
Joe-0.006
ull-0.006
Mom-0.005
add-0.005
 them-0.005
Token
Token ID198
Feature activation+3.19e-03

Pos logit contributions
 spine+0.017
ro+0.014
hy+0.014
ses+0.014
 grade+0.013
Neg logit contributions
ull-0.012
Joe-0.011
Mom-0.010
 them-0.010
Jess-0.010
TokenWhen
Token ID2215
Feature activation-9.43e-03

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.006
 grade+0.005
Neg logit contributions
ull-0.004
Joe-0.004
Mom-0.004
 them-0.004
ins-0.004
Token the
Token ID262
Feature activation-5.36e-03

Pos logit contributions
Joe+0.012
ull+0.012
 them+0.010
Mom+0.010
Jess+0.010
Neg logit contributions
 spine-0.022
ro-0.020
hy-0.019
ses-0.019
 grade-0.018
Token fisherman
Token ID48750
Feature activation+4.99e-02

Pos logit contributions
Joe+0.007
ull+0.007
ins+0.007
Jess+0.007
irs+0.007
Neg logit contributions
 spine-0.011
ro-0.009
hy-0.009
ses-0.009
 lick-0.008
Token came
Token ID1625
Feature activation+4.24e-03

Pos logit contributions
 spine+0.106
hy+0.090
ro+0.089
ses+0.089
 grade+0.082
Neg logit contributions
ull-0.074
Joe-0.071
 them-0.065
Mom-0.063
in-0.063
Token back
Token ID736
Feature activation+8.54e-03

Pos logit contributions
 spine+0.006
ro+0.004
 for+0.004
 Sure+0.004
ses+0.004
Neg logit contributions
ins-0.009
agles-0.008
idy-0.008
ades-0.008
irs-0.008
Token to
Token ID284
Feature activation-5.08e-03

Pos logit contributions
 spine+0.017
ro+0.014
hy+0.014
ses+0.013
 grade+0.013
Neg logit contributions
ull-0.013
Joe-0.013
Mom-0.012
 them-0.012
Jess-0.012
Token the
Token ID262
Feature activation-6.78e-03

Pos logit contributions
 them+0.006
ull+0.006
Joe+0.006
 us+0.005
Jess+0.005
Neg logit contributions
 spine-0.012
hy-0.011
ro-0.011
ses-0.010
ical-0.010
Token boat
Token ID8848
Feature activation+1.39e-02

Pos logit contributions
ins+0.009
Joe+0.009
ull+0.009
agles+0.009
irs+0.009
Neg logit contributions
 spine-0.014
ro-0.012
hy-0.011
ses-0.011
 shelf-0.010
Token mom
Token ID1995
Feature activation-5.03e-03

Pos logit contributions
ull+0.016
Joe+0.015
Mom+0.013
Jess+0.013
 them+0.012
Neg logit contributions
 spine-0.043
ro-0.038
hy-0.037
ses-0.037
 grade-0.035
Tokenm
Token ID76
Feature activation-3.42e-02

Pos logit contributions
ull+0.005
Joe+0.005
 them+0.004
Mom+0.004
ins+0.004
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.011
ses-0.011
 grade-0.010
Tokenies
Token ID444
Feature activation-2.46e-03

Pos logit contributions
Joe+0.040
Mom+0.039
ull+0.037
Jess+0.036
agles+0.034
Neg logit contributions
 spine-0.079
ro-0.069
ses-0.065
hy-0.064
 grade-0.061
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
ull+0.004
Joe+0.004
 them+0.004
Mom+0.004
in+0.004
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.004
 grade-0.003
Token d
Token ID288
Feature activation-3.47e-02

Pos logit contributions
ull+0.018
Joe+0.018
Mom+0.016
add+0.016
Jess+0.016
Neg logit contributions
 spine-0.023
ro-0.019
hy-0.019
ses-0.018
 grade-0.017
Tokenadd
Token ID2860
Feature activation-7.53e-02

Pos logit contributions
ull+0.037
Joe+0.035
Jess+0.033
Mom+0.032
Dad+0.029
Neg logit contributions
 spine-0.083
ro-0.077
hy-0.072
 grade-0.067
ses-0.067
Tokenies
Token ID444
Feature activation-1.18e-02

Pos logit contributions
ull+0.057
Joe+0.055
Mom+0.048
Jess+0.044
add+0.041
Neg logit contributions
 spine-0.206
ro-0.186
hy-0.181
ses-0.177
 grade-0.169
Token when
Token ID618
Feature activation-2.31e-02

Pos logit contributions
ull+0.019
Joe+0.018
 them+0.018
in+0.017
Mom+0.017
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.019
ses-0.018
 grade-0.017
Token they
Token ID484
Feature activation-1.49e-02

Pos logit contributions
ull+0.038
Joe+0.037
eper+0.034
Mom+0.033
add+0.033
Neg logit contributions
 spine-0.038
ro-0.031
ses-0.030
 grade-0.030
hy-0.029
Token say
Token ID910
Feature activation-1.44e-02

Pos logit contributions
ull+0.020
 them+0.020
Joe+0.020
add+0.019
in+0.019
Neg logit contributions
 spine-0.031
ro-0.027
hy-0.026
ses-0.025
 grade-0.023
Token things
Token ID1243
Feature activation-1.61e-02

Pos logit contributions
Joe+0.023
ull+0.023
Mom+0.021
Jess+0.020
add+0.020
Neg logit contributions
 spine-0.026
hy-0.022
ro-0.022
ses-0.021
 grade-0.020
Token squ
Token ID2809
Feature activation-2.80e-03

Pos logit contributions
 spine+0.008
ro+0.006
 grade+0.006
hy+0.006
ses+0.006
Neg logit contributions
ull-0.005
Joe-0.005
 them-0.005
Mom-0.005
in-0.004
Tokenaw
Token ID707
Feature activation-8.75e-06

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.004
Jess+0.004
 them+0.004
Neg logit contributions
 spine-0.005
ro-0.005
hy-0.005
ses-0.004
 grade-0.004
Tokenked
Token ID9091
Feature activation-1.50e-02

Feature not active on prev token;
no predictions were affected.
Token and
Token ID290
Feature activation+3.29e-03

Pos logit contributions
ull+0.025
Joe+0.024
 them+0.022
Mom+0.022
ins+0.021
Neg logit contributions
 spine-0.029
hy-0.024
ro-0.024
ses-0.023
 grade-0.022
Token w
Token ID266
Feature activation-7.53e-03

Pos logit contributions
 spine+0.006
ro+0.005
hy+0.005
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.006
Joe-0.005
 them-0.005
Mom-0.005
Jess-0.005
Tokenadd
Token ID2860
Feature activation-7.11e-02

Pos logit contributions
ull+0.010
Joe+0.010
Mom+0.010
Jess+0.009
ups+0.009
Neg logit contributions
 spine-0.015
ro-0.014
hy-0.013
ses-0.013
 grade-0.012
Tokenled
Token ID992
Feature activation-2.64e-02

Pos logit contributions
ull+0.098
Joe+0.097
Mom+0.093
Jess+0.090
add+0.085
Neg logit contributions
 spine-0.146
ro-0.131
hy-0.126
ses-0.120
 grade-0.110
Token around
Token ID1088
Feature activation-1.83e-02

Pos logit contributions
ull+0.040
 them+0.038
Joe+0.038
ins+0.034
in+0.034
Neg logit contributions
 spine-0.053
hy-0.046
ro-0.046
ses-0.044
 grade-0.042
Token,
Token ID11
Feature activation-1.54e-02

Pos logit contributions
 them+0.023
ull+0.020
Joe+0.019
 friends+0.017
 boats+0.016
Neg logit contributions
 spine-0.043
hy-0.042
ro-0.039
ses-0.038
 grade-0.037
Token enjoying
Token ID13226
Feature activation-7.34e-03

Pos logit contributions
ull+0.024
Joe+0.024
 them+0.024
Mom+0.022
in+0.021
Neg logit contributions
 spine-0.028
ro-0.025
hy-0.024
 grade-0.023
ses-0.023
Token the
Token ID262
Feature activation-4.42e-04

Pos logit contributions
ull+0.011
Joe+0.010
Mom+0.009
add+0.009
Jess+0.009
Neg logit contributions
 spine-0.015
ro-0.013
hy-0.013
ses-0.012
 grade-0.012
Token mom
Token ID1995
Feature activation+1.01e-02

Pos logit contributions
 spine+0.014
ro+0.012
hy+0.012
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.010
Joe-0.009
Mom-0.009
Jess-0.008
add-0.008
Tokenm
Token ID76
Feature activation-2.30e-02

Pos logit contributions
 spine+0.023
ses+0.020
ro+0.020
hy+0.018
 grade+0.018
Neg logit contributions
ull-0.012
 them-0.012
Joe-0.012
ins-0.012
ups-0.012
Tokenies
Token ID444
Feature activation+3.33e-03

Pos logit contributions
Joe+0.023
ull+0.022
Mom+0.022
Jess+0.022
agles+0.019
Neg logit contributions
 spine-0.056
ro-0.051
hy-0.050
ses-0.047
 grade-0.046
Token and
Token ID290
Feature activation-1.91e-03

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.005
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.005
 them-0.005
ins-0.004
Token d
Token ID288
Feature activation-3.04e-02

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
 them+0.002
Jess+0.002
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.004
 grade-0.003
Tokenadd
Token ID2860
Feature activation-7.10e-02

Pos logit contributions
ull+0.046
Jess+0.044
Joe+0.044
Mom+0.041
Dad+0.038
Neg logit contributions
 spine-0.059
ro-0.057
hy-0.052
ses-0.046
 grade-0.045
Tokenies
Token ID444
Feature activation-1.07e-02

Pos logit contributions
ull+0.076
Joe+0.074
Mom+0.067
Jess+0.064
add+0.062
Neg logit contributions
 spine-0.172
ro-0.154
hy-0.150
ses-0.144
 grade-0.137
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
ull+0.016
Joe+0.015
 them+0.014
Mom+0.014
ins+0.014
Neg logit contributions
 spine-0.021
ro-0.018
hy-0.018
ses-0.017
 grade-0.016
Token Lily
Token ID20037
Feature activation+6.85e-03

Pos logit contributions
ull+0.017
Joe+0.016
agles+0.015
in+0.015
ins+0.015
Neg logit contributions
 spine-0.016
ro-0.013
hy-0.013
ses-0.012
 grade-0.011
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
 spine+0.016
hy+0.013
ses+0.013
 grade+0.012
ro+0.012
Neg logit contributions
Joe-0.008
ull-0.008
Mom-0.008
ups-0.008
 decided-0.008
Token Ben
Token ID3932
Feature activation+1.45e-02

Pos logit contributions
 spine+0.037
ro+0.032
hy+0.031
ses+0.030
 grade+0.028
Neg logit contributions
ull-0.023
Joe-0.023
 them-0.021
Mom-0.021
add-0.020
Token had
Token ID550
Feature activation+2.42e-03

Pos logit contributions
ull+0.007
Joe+0.007
Mom+0.006
 them+0.006
Jess+0.006
Neg logit contributions
 spine-0.009
ro-0.008
ses-0.008
hy-0.007
 grade-0.007
Token solved
Token ID16019
Feature activation+7.98e-04

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.004
Joe-0.004
add-0.004
agles-0.004
Mom-0.004
Token her
Token ID607
Feature activation+7.03e-03

Pos logit contributions
 spine+0.002
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
Mom-0.001
 them-0.001
Jess-0.001
Token problem
Token ID1917
Feature activation-6.70e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.009
Joe-0.009
Mom-0.008
 them-0.008
add-0.008
Token and
Token ID290
Feature activation-2.19e-03

Pos logit contributions
ull+0.010
Joe+0.010
Mom+0.009
 them+0.009
ins+0.009
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.011
ses-0.011
 grade-0.010
Token salv
Token ID24143
Feature activation-7.05e-02

Pos logit contributions
ull+0.003
Joe+0.003
 them+0.003
Mom+0.003
ups+0.003
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Tokenaged
Token ID1886
Feature activation-2.35e-02

Pos logit contributions
ull+0.123
add+0.111
Joe+0.110
in+0.108
leys+0.107
Neg logit contributions
 spine-0.104
ro-0.097
ses-0.091
hy-0.089
tail-0.085
Token her
Token ID607
Feature activation-8.27e-03

Pos logit contributions
ull+0.033
Joe+0.032
Mom+0.029
add+0.028
Jess+0.028
Neg logit contributions
 spine-0.049
ro-0.043
hy-0.042
ses-0.040
 grade-0.038
Token l
Token ID300
Feature activation+3.56e-03

Pos logit contributions
 friends+0.012
Joe+0.011
 them+0.011
ull+0.011
Mom+0.011
Neg logit contributions
 spine-0.017
hy-0.014
ro-0.014
ses-0.013
 grade-0.013
Tokenimes
Token ID999
Feature activation+7.07e-03

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.006
 grade+0.005
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.005
 them-0.005
Jess-0.005
Token.
Token ID13
Feature activation-5.15e-04

Pos logit contributions
 spine+0.014
ro+0.012
hy+0.012
ses+0.011
 grade+0.010
Neg logit contributions
ull-0.011
Joe-0.010
Mom-0.010
Jess-0.009
add-0.009
Token the
Token ID262
Feature activation-5.06e-03

Pos logit contributions
ull+0.011
Joe+0.010
Mom+0.010
add+0.009
ins+0.009
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.010
ses-0.010
 grade-0.009
Token little
Token ID1310
Feature activation+1.42e-03

Pos logit contributions
ull+0.006
Joe+0.006
Mom+0.005
add+0.005
 them+0.005
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.010
ses-0.010
 grade-0.009
Token girl
Token ID2576
Feature activation-1.84e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.003
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
 them-0.002
add-0.002
Token and
Token ID290
Feature activation-2.49e-03

Pos logit contributions
ull+0.003
Joe+0.003
 them+0.003
in+0.002
Mom+0.002
Neg logit contributions
 spine-0.004
ro-0.003
hy-0.003
ses-0.003
 grade-0.003
Token w
Token ID266
Feature activation-9.18e-03

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.003
add+0.003
Jess+0.003
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Tokenadd
Token ID2860
Feature activation-6.79e-02

Pos logit contributions
Mom+0.012
ull+0.012
Joe+0.011
Jess+0.011
Dad+0.011
Neg logit contributions
 spine-0.019
ro-0.017
ses-0.017
hy-0.016
sed-0.015
Tokenled
Token ID992
Feature activation-2.18e-02

Pos logit contributions
Mom+0.109
Joe+0.109
ull+0.107
Jess+0.105
Dad+0.100
Neg logit contributions
 spine-0.122
ro-0.108
hy-0.102
ses-0.097
 mun-0.088
Token over
Token ID625
Feature activation-1.21e-02

Pos logit contributions
ull+0.034
Joe+0.032
Mom+0.030
add+0.029
Jess+0.029
Neg logit contributions
 spine-0.043
ro-0.037
hy-0.036
ses-0.035
 grade-0.032
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
ull+0.021
Joe+0.020
 them+0.019
Mom+0.019
Jess+0.018
Neg logit contributions
 spine-0.022
ro-0.018
hy-0.018
ses-0.017
 grade-0.016
Token her
Token ID607
Feature activation-5.24e-03

Pos logit contributions
ull+0.021
Joe+0.021
 them+0.021
Mom+0.019
Jess+0.019
Neg logit contributions
 spine-0.032
ro-0.028
hy-0.027
ses-0.026
 grade-0.024
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
ull+0.006
Joe+0.006
 them+0.006
Mom+0.005
in+0.005
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.010
ses-0.010
 grade-0.010
Token pond
Token ID16723
Feature activation+1.17e-02

Pos logit contributions
ull+0.021
Joe+0.021
Mom+0.020
 them+0.019
in+0.019
Neg logit contributions
 spine-0.037
ro-0.032
hy-0.031
ses-0.029
 grade-0.028
Token with
Token ID351
Feature activation-2.66e-02

Pos logit contributions
 spine+0.021
ro+0.017
hy+0.016
ses+0.016
 grade+0.016
Neg logit contributions
ull-0.021
Joe-0.020
Mom-0.019
 them-0.019
in-0.018
Token many
Token ID867
Feature activation-3.73e-02

Pos logit contributions
ull+0.038
agles+0.037
Joe+0.036
ins+0.036
leys+0.036
Neg logit contributions
 spine-0.045
hy-0.045
ro-0.044
ses-0.042
 l-0.038
Token ducks
Token ID39694
Feature activation-1.78e-02

Pos logit contributions
ull+0.055
Jess+0.053
agles+0.052
ator+0.052
Mill+0.051
Neg logit contributions
ro-0.061
 l-0.059
 spine-0.057
hy-0.054
ses-0.052
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
Jess+0.033
Joe+0.033
Mom+0.032
ull+0.032
agles+0.032
Neg logit contributions
 spine-0.026
ro-0.023
hy-0.021
ses-0.021
 lick-0.018
Token sw
Token ID1509
Feature activation-6.67e-02

Pos logit contributions
ull+0.015
Joe+0.014
in+0.013
agles+0.013
Jess+0.013
Neg logit contributions
 spine-0.031
ro-0.028
hy-0.028
ses-0.027
 mun-0.025
Tokenans
Token ID504
Feature activation-1.62e-02

Pos logit contributions
Mom+0.072
Jess+0.071
ull+0.070
Joe+0.070
Daddy+0.066
Neg logit contributions
 spine-0.155
ro-0.145
ses-0.140
hy-0.133
sed-0.129
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
Jess+0.030
Joe+0.030
ull+0.029
Mom+0.029
Dad+0.027
Neg logit contributions
 spine-0.026
hy-0.022
ro-0.022
ses-0.020
 mun-0.018
Token Sara
Token ID24799
Feature activation+8.88e-03

Pos logit contributions
ull+0.023
Joe+0.022
agles+0.021
in+0.021
ins+0.020
Neg logit contributions
 spine-0.023
ro-0.020
hy-0.019
ses-0.018
 grade-0.016
Token liked
Token ID8288
Feature activation-1.07e-02

Pos logit contributions
 spine+0.014
ro+0.012
hy+0.011
ses+0.011
 grade+0.010
Neg logit contributions
ull-0.017
Joe-0.016
Mom-0.015
add-0.015
agles-0.015
Token the
Token ID262
Feature activation-8.07e-03

Pos logit contributions
ull+0.019
Joe+0.019
Mom+0.019
Jess+0.018
ins+0.018
Neg logit contributions
 spine-0.016
ro-0.014
hy-0.014
ses-0.013
 lick-0.012
Token in
Token ID287
Feature activation+5.24e-03

Pos logit contributions
ull+0.017
Joe+0.017
Mom+0.016
Jess+0.016
ins+0.016
Neg logit contributions
 spine-0.018
ro-0.015
hy-0.015
ses-0.014
 grade-0.012
Token the
Token ID262
Feature activation+6.05e-04

Pos logit contributions
 spine+0.012
hy+0.010
ro+0.010
ses+0.010
 grade+0.009
Neg logit contributions
Joe-0.007
 them-0.007
ull-0.007
Mom-0.006
Jess-0.006
Token water
Token ID1660
Feature activation-5.62e-04

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 lap+0.001
Neg logit contributions
ull-0.001
Joe-0.001
ins-0.001
Jess-0.001
add-0.001
Token and
Token ID290
Feature activation+2.18e-04

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.001
Jess+0.001
add+0.001
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token w
Token ID266
Feature activation-6.85e-03

Pos logit contributions
 spine+0.000
hy+0.000
ro+0.000
ses+0.000
 grade+0.000
Neg logit contributions
ull-0.000
Joe-0.000
 them-0.000
Mom-0.000
add-0.000
Tokenadd
Token ID2860
Feature activation-6.64e-02

Pos logit contributions
Mom+0.007
ull+0.007
Joe+0.007
Jess+0.006
Dad+0.006
Neg logit contributions
 spine-0.016
ro-0.015
hy-0.014
ses-0.014
 mun-0.013
Tokenled
Token ID992
Feature activation-2.16e-02

Pos logit contributions
ull+0.099
Joe+0.099
Mom+0.097
Jess+0.094
Dad+0.089
Neg logit contributions
 spine-0.127
ro-0.115
hy-0.107
ses-0.102
 mun-0.092
Token on
Token ID319
Feature activation-1.69e-02

Pos logit contributions
ull+0.035
Joe+0.034
Mom+0.032
Jess+0.031
add+0.031
Neg logit contributions
 spine-0.041
ro-0.035
hy-0.034
ses-0.033
 grade-0.030
Token the
Token ID262
Feature activation+7.73e-04

Pos logit contributions
ull+0.029
Joe+0.027
ins+0.026
add+0.026
agles+0.025
Neg logit contributions
 spine-0.030
ro-0.027
hy-0.025
ses-0.024
 grade-0.022
Token land
Token ID1956
Feature activation+9.62e-03

Pos logit contributions
 spine+0.002
ro+0.002
hy+0.002
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
ins-0.001
Jess-0.001
Mom-0.001
Token.
Token ID13
Feature activation-3.03e-03

Pos logit contributions
 spine+0.019
ro+0.016
hy+0.016
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.015
Joe-0.015
Mom-0.014
add-0.013
Jess-0.013
Token was
Token ID373
Feature activation-6.30e-03

Pos logit contributions
ull+0.013
Joe+0.012
Mom+0.011
 them+0.011
Jess+0.011
Neg logit contributions
 spine-0.016
ro-0.013
hy-0.013
ses-0.013
 grade-0.012
Token so
Token ID523
Feature activation-3.39e-03

Pos logit contributions
ull+0.010
Joe+0.010
Mom+0.009
agles+0.009
ins+0.009
Neg logit contributions
 spine-0.011
hy-0.010
ro-0.010
ses-0.009
 grade-0.008
Token excited
Token ID6568
Feature activation-1.13e-02

Pos logit contributions
agles+0.006
ull+0.006
Joe+0.006
ups+0.006
Mom+0.005
Neg logit contributions
 spine-0.005
hy-0.005
ro-0.005
ses-0.004
 grade-0.004
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
Joe+0.017
ull+0.017
 them+0.017
ups+0.015
ins+0.015
Neg logit contributions
 spine-0.023
ses-0.020
ro-0.019
hy-0.019
 grade-0.017
Token w
Token ID266
Feature activation-1.82e-03

Pos logit contributions
ull+0.024
Joe+0.023
Mom+0.021
 them+0.021
Jess+0.020
Neg logit contributions
 spine-0.041
ro-0.036
hy-0.035
ses-0.034
 grade-0.032
Tokenadd
Token ID2860
Feature activation-6.64e-02

Pos logit contributions
ull+0.002
Joe+0.001
Mom+0.001
Jess+0.001
 them+0.001
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Tokenled
Token ID992
Feature activation-2.17e-02

Pos logit contributions
ull+0.108
Joe+0.106
Mom+0.104
Jess+0.099
Dad+0.095
Neg logit contributions
 spine-0.119
ro-0.106
hy-0.100
ses-0.094
 grade-0.085
Token over
Token ID625
Feature activation-1.09e-02

Pos logit contributions
Joe+0.033
ull+0.033
 them+0.033
Mom+0.029
Jess+0.029
Neg logit contributions
 spine-0.043
ro-0.037
hy-0.036
ses-0.036
 grade-0.033
Token to
Token ID284
Feature activation-1.68e-02

Pos logit contributions
Joe+0.018
 them+0.018
ull+0.017
Mom+0.016
in+0.015
Neg logit contributions
 spine-0.021
ses-0.018
hy-0.018
ro-0.017
 laptop-0.015
Token see
Token ID766
Feature activation-1.29e-02

Pos logit contributions
ull+0.026
Joe+0.026
 them+0.026
Mom+0.024
add+0.023
Neg logit contributions
 spine-0.034
ro-0.028
hy-0.028
ses-0.027
 grade-0.024
Token what
Token ID644
Feature activation-5.02e-03

Pos logit contributions
ull+0.021
Joe+0.019
Mom+0.019
agles+0.019
add+0.018
Neg logit contributions
 spine-0.022
hy-0.020
ro-0.020
ses-0.018
 lick-0.017
Token he
Token ID339
Feature activation-1.19e-02

Pos logit contributions
ull+0.012
Joe+0.011
 them+0.011
Mom+0.009
 friends+0.009
Neg logit contributions
 spine-0.027
ro-0.023
hy-0.023
ses-0.023
 grade-0.022
Token qu
Token ID627
Feature activation-1.54e-02

Pos logit contributions
ull+0.020
Joe+0.019
agles+0.019
Jess+0.019
ins+0.019
Neg logit contributions
 spine-0.020
ro-0.018
hy-0.016
ses-0.016
 mun-0.016
Tokenacked
Token ID6021
Feature activation-2.58e-03

Pos logit contributions
ull+0.024
Joe+0.024
Jess+0.024
 them+0.023
Mom+0.023
Neg logit contributions
 spine-0.026
ro-0.024
hy-0.022
ses-0.020
 mun-0.019
Token and
Token ID290
Feature activation+5.70e-03

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.004
 them+0.004
add+0.004
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Token w
Token ID266
Feature activation-1.04e-03

Pos logit contributions
 spine+0.011
ro+0.010
hy+0.010
ses+0.009
 grade+0.009
Neg logit contributions
ull-0.009
Joe-0.009
 them-0.008
Mom-0.008
Jess-0.007
Tokenadd
Token ID2860
Feature activation-6.61e-02

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.001
Jess+0.001
 them+0.001
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Tokenled
Token ID992
Feature activation-1.81e-02

Pos logit contributions
ull+0.100
Joe+0.100
Mom+0.096
Jess+0.092
 them+0.088
Neg logit contributions
 spine-0.128
ro-0.112
hy-0.108
ses-0.103
 grade-0.094
Token all
Token ID477
Feature activation-1.79e-02

Pos logit contributions
ull+0.029
Joe+0.029
 them+0.027
Mom+0.026
Jess+0.025
Neg logit contributions
 spine-0.034
ro-0.029
hy-0.029
ses-0.028
 grade-0.026
Token around
Token ID1088
Feature activation-1.36e-02

Pos logit contributions
ull+0.029
Joe+0.029
Mom+0.027
Jess+0.027
agles+0.027
Neg logit contributions
 spine-0.033
ro-0.028
hy-0.027
ses-0.025
 grade-0.024
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
 them+0.017
ull+0.016
Joe+0.016
in+0.013
Mom+0.013
Neg logit contributions
 spine-0.032
hy-0.029
ses-0.028
ro-0.028
 grade-0.026
Token yard
Token ID12699
Feature activation-1.08e-02

Pos logit contributions
ull+0.017
Joe+0.016
add+0.015
ins+0.015
agles+0.015
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.018
ses-0.017
 grade-0.016
Token pond
Token ID16723
Feature activation+1.48e-02

Pos logit contributions
ull+0.022
Joe+0.022
Mom+0.020
 them+0.020
in+0.019
Neg logit contributions
 spine-0.036
ro-0.032
hy-0.030
ses-0.030
 grade-0.028
Token with
Token ID351
Feature activation-2.47e-02

Pos logit contributions
 spine+0.026
ro+0.021
hy+0.021
ses+0.021
 grade+0.019
Neg logit contributions
ull-0.027
Joe-0.026
Mom-0.024
 them-0.024
in-0.023
Token many
Token ID867
Feature activation-3.56e-02

Pos logit contributions
ull+0.032
agles+0.032
leys+0.030
ins+0.030
Joe+0.030
Neg logit contributions
 spine-0.044
hy-0.044
ro-0.043
ses-0.041
 l-0.038
Token ducks
Token ID39694
Feature activation-1.53e-02

Pos logit contributions
ull+0.051
agles+0.049
Jess+0.048
ator+0.047
Mom+0.047
Neg logit contributions
ro-0.059
 l-0.057
 spine-0.055
hy-0.052
 mun-0.049
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
Jess+0.028
Mom+0.028
Joe+0.027
ull+0.027
agles+0.026
Neg logit contributions
 spine-0.023
ro-0.020
hy-0.018
ses-0.018
 mun-0.016
Token sw
Token ID1509
Feature activation-6.60e-02

Pos logit contributions
ull+0.015
Joe+0.014
agles+0.013
in+0.013
Jess+0.013
Neg logit contributions
 spine-0.031
ro-0.028
hy-0.028
ses-0.027
 mun-0.025
Tokenans
Token ID504
Feature activation-1.49e-02

Pos logit contributions
Mom+0.075
Jess+0.074
ull+0.070
Joe+0.070
Daddy+0.068
Neg logit contributions
 spine-0.150
ro-0.142
ses-0.135
hy-0.127
sed-0.126
Token.
Token ID13
Feature activation-5.60e-03

Pos logit contributions
Jess+0.027
Joe+0.027
Mom+0.027
ull+0.026
Dad+0.025
Neg logit contributions
 spine-0.023
hy-0.021
ro-0.020
ses-0.018
 mun-0.016
Token Lily
Token ID20037
Feature activation-1.41e-04

Pos logit contributions
ull+0.011
agles+0.010
Joe+0.010
Jess+0.010
eper+0.010
Neg logit contributions
 spine-0.009
ro-0.008
 Whe-0.007
hy-0.007
ses-0.007
Token liked
Token ID8288
Feature activation-9.59e-03

Pos logit contributions
ull+0.000
Joe+0.000
agles+0.000
add+0.000
Mom+0.000
Neg logit contributions
 spine-0.000
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token the
Token ID262
Feature activation-7.70e-03

Pos logit contributions
ull+0.017
Joe+0.017
Mom+0.017
Jess+0.016
ins+0.016
Neg logit contributions
 spine-0.015
ro-0.013
hy-0.012
ses-0.012
 lick-0.010
Token mom
Token ID1995
Feature activation+6.70e-03

Pos logit contributions
ull+0.004
Joe+0.004
ins+0.004
Jess+0.004
Mom+0.004
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.009
ses-0.009
 grade-0.008
Tokenm
Token ID76
Feature activation-2.43e-02

Pos logit contributions
 spine+0.011
ro+0.010
hy+0.010
ses+0.009
 grade+0.008
Neg logit contributions
ull-0.012
Joe-0.011
Jess-0.011
add-0.011
Mom-0.010
Tokenies
Token ID444
Feature activation+1.64e-03

Pos logit contributions
Joe+0.030
Jess+0.029
Mom+0.028
ull+0.027
agles+0.025
Neg logit contributions
 spine-0.053
hy-0.047
ro-0.046
ses-0.045
 grade-0.042
Token and
Token ID290
Feature activation-2.40e-03

Pos logit contributions
 spine+0.003
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.003
Joe-0.003
Mom-0.003
Jess-0.003
add-0.003
Token d
Token ID288
Feature activation-2.65e-02

Pos logit contributions
ull+0.003
Joe+0.003
add+0.003
agles+0.003
Mom+0.003
Neg logit contributions
 spine-0.005
hy-0.004
ro-0.004
ses-0.004
 grade-0.004
Tokenadd
Token ID2860
Feature activation-6.57e-02

Pos logit contributions
ull+0.034
Jess+0.033
Joe+0.033
Mom+0.031
Dad+0.029
Neg logit contributions
 spine-0.056
ro-0.053
hy-0.050
ses-0.044
 grade-0.043
Tokenies
Token ID444
Feature activation-5.29e-03

Pos logit contributions
ull+0.073
Joe+0.068
 them+0.062
ins+0.062
add+0.060
Neg logit contributions
 spine-0.159
ro-0.139
hy-0.137
ses-0.135
 grade-0.128
Token,
Token ID11
Feature activation+1.92e-03

Pos logit contributions
ull+0.008
Joe+0.008
 them+0.008
in+0.007
Mom+0.007
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.008
ses-0.008
 grade-0.008
Token who
Token ID508
Feature activation-1.02e-02

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
 them-0.003
Joe-0.003
 friends-0.003
Mom-0.003
Token cl
Token ID537
Feature activation-5.76e-03

Pos logit contributions
ull+0.016
Joe+0.015
 them+0.014
in+0.014
Mom+0.014
Neg logit contributions
 spine-0.021
ro-0.018
ses-0.017
hy-0.017
 grade-0.016
Tokenapped
Token ID6320
Feature activation-6.14e-03

Pos logit contributions
ull+0.010
Joe+0.009
Jess+0.009
Mom+0.009
 them+0.009
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.009
ses-0.008
 grade-0.007
Token a
Token ID257
Feature activation-1.20e-03

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.009
ses+0.008
 grade+0.007
Neg logit contributions
ull-0.010
Joe-0.010
ins-0.009
agles-0.009
add-0.009
Token small
Token ID1402
Feature activation-3.72e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.001
 them+0.001
Jess+0.001
Neg logit contributions
 spine-0.003
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token yellow
Token ID7872
Feature activation-9.67e-03

Pos logit contributions
ull+0.005
Joe+0.005
Mom+0.005
in+0.005
 them+0.004
Neg logit contributions
 spine-0.008
ro-0.007
hy-0.007
ses-0.006
 grade-0.006
Token duck
Token ID22045
Feature activation-3.05e-03

Pos logit contributions
ull+0.012
Joe+0.011
Mom+0.010
 them+0.010
in+0.010
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.019
ses-0.018
 grade-0.017
Token w
Token ID266
Feature activation-7.99e-03

Pos logit contributions
Joe+0.006
ull+0.005
Jess+0.005
ups+0.005
Mom+0.005
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.003
 mun-0.003
Tokenadd
Token ID2860
Feature activation-6.34e-02

Pos logit contributions
Mom+0.006
Joe+0.006
Jess+0.005
ull+0.005
ups+0.005
Neg logit contributions
 spine-0.021
ro-0.020
ses-0.019
hy-0.019
sed-0.018
Tokenled
Token ID992
Feature activation-9.75e-03

Pos logit contributions
Mom+0.078
Joe+0.077
Jess+0.074
Dad+0.069
ull+0.068
Neg logit contributions
 spine-0.136
ro-0.127
hy-0.120
ses-0.114
 mun-0.107
Token around
Token ID1088
Feature activation-4.66e-03

Pos logit contributions
ull+0.016
Joe+0.016
Mom+0.015
Jess+0.014
add+0.014
Neg logit contributions
 spine-0.018
ro-0.015
hy-0.015
ses-0.014
 grade-0.013
Token the
Token ID262
Feature activation-6.91e-03

Pos logit contributions
 them+0.006
ull+0.006
Joe+0.006
 us+0.005
Mom+0.005
Neg logit contributions
 spine-0.011
hy-0.010
ro-0.009
ses-0.009
 grade-0.009
Token pond
Token ID16723
Feature activation+3.13e-03

Pos logit contributions
ull+0.011
Joe+0.011
ins+0.011
agles+0.011
Jess+0.011
Neg logit contributions
 spine-0.012
ro-0.011
hy-0.010
ses-0.009
 shelf-0.009
Token looking
Token ID2045
Feature activation-1.49e-02

Pos logit contributions
 spine+0.006
ro+0.006
hy+0.005
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.005
Joe-0.004
Mom-0.004
 them-0.004
Jess-0.004
Token to
Token ID284
Feature activation+5.61e-03

Pos logit contributions
 spine+0.040
ses+0.038
 grade+0.038
 laptop+0.038
hy+0.037
Neg logit contributions
 them-0.025
Joe-0.025
ull-0.025
"-0.024
 friends-0.023
Token the
Token ID262
Feature activation+3.54e-03

Pos logit contributions
 spine+0.014
ro+0.013
hy+0.013
ses+0.013
sed+0.012
Neg logit contributions
 them-0.006
ull-0.006
Joe-0.005
 friends-0.005
 us-0.005
Token billboard
Token ID41448
Feature activation+9.17e-03

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.007
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.004
Joe-0.004
Mom-0.004
 them-0.004
Jess-0.004
Token and
Token ID290
Feature activation+4.12e-03

Pos logit contributions
 spine+0.019
ses+0.017
ro+0.017
hy+0.016
 grade+0.016
Neg logit contributions
ull-0.012
 them-0.011
Joe-0.011
 friends-0.010
in-0.010
Token w
Token ID266
Feature activation+1.25e-03

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.008
ses+0.008
 grade+0.007
Neg logit contributions
ull-0.005
Joe-0.005
 them-0.005
Mom-0.005
Jess-0.005
Tokenadd
Token ID2860
Feature activation-6.33e-02

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.001
Joe-0.001
add-0.001
Mom-0.001
 them-0.001
Tokenled
Token ID992
Feature activation-2.32e-02

Pos logit contributions
ull+0.096
Joe+0.094
Mom+0.089
Jess+0.086
 them+0.084
Neg logit contributions
 spine-0.123
ro-0.108
hy-0.104
ses-0.099
 grade-0.092
Token away
Token ID1497
Feature activation-3.47e-03

Pos logit contributions
ull+0.031
Joe+0.031
 them+0.029
Mom+0.028
Jess+0.027
Neg logit contributions
 spine-0.050
ro-0.044
hy-0.043
ses-0.042
 grade-0.040
Token,
Token ID11
Feature activation+4.10e-03

Pos logit contributions
ull+0.004
Joe+0.004
 them+0.004
in+0.004
Jess+0.004
Neg logit contributions
 spine-0.008
ses-0.007
ro-0.007
hy-0.007
 grade-0.006
Token feeling
Token ID4203
Feature activation+6.31e-03

Pos logit contributions
 spine+0.008
ro+0.007
ses+0.007
hy+0.007
 grade+0.006
Neg logit contributions
Joe-0.006
ull-0.006
 them-0.006
Mom-0.006
Jon-0.005
Token content
Token ID2695
Feature activation+8.91e-03

Pos logit contributions
 spine+0.014
ro+0.012
ses+0.011
hy+0.011
 grade+0.010
Neg logit contributions
ull-0.009
Joe-0.009
 them-0.008
Mom-0.008
in-0.008
Token of
Token ID286
Feature activation+2.57e-03

Pos logit contributions
 spine+0.012
ro+0.010
hy+0.010
ses+0.009
 grade+0.009
Neg logit contributions
ull-0.011
Joe-0.010
add-0.010
agles-0.010
Jess-0.010
Token m
Token ID285
Feature activation-1.95e-02

Pos logit contributions
 spine+0.006
ro+0.005
ses+0.005
hy+0.005
 grade+0.005
Neg logit contributions
ull-0.003
Joe-0.003
ups-0.003
Mom-0.003
Jess-0.003
Tokenummies
Token ID39578
Feature activation-7.13e-03

Pos logit contributions
ull+0.025
 them+0.024
Joe+0.024
Mom+0.022
Jess+0.022
Neg logit contributions
 spine-0.041
hy-0.037
ro-0.032
ses-0.031
sing-0.030
Token,
Token ID11
Feature activation+2.52e-03

Pos logit contributions
ull+0.011
Joe+0.011
Mom+0.010
add+0.010
Jess+0.010
Neg logit contributions
 spine-0.013
ro-0.012
hy-0.011
ses-0.011
 grade-0.010
Token d
Token ID288
Feature activation-1.85e-02

Pos logit contributions
 spine+0.005
hy+0.004
ro+0.004
ses+0.004
 grade+0.003
Neg logit contributions
ull-0.004
Joe-0.004
ups-0.004
agles-0.004
ins-0.004
Tokenadd
Token ID2860
Feature activation-6.30e-02

Pos logit contributions
ull+0.029
Joe+0.028
Mom+0.027
Jess+0.026
 them+0.025
Neg logit contributions
 spine-0.036
ro-0.032
hy-0.030
ses-0.028
 grade-0.026
Tokenies
Token ID444
Feature activation+7.28e-03

Pos logit contributions
Joe+0.053
ull+0.053
Mom+0.050
Jess+0.046
Dad+0.043
Neg logit contributions
 spine-0.162
ro-0.148
hy-0.143
ses-0.137
 grade-0.129
Token and
Token ID290
Feature activation-4.58e-03

Pos logit contributions
 spine+0.014
ro+0.012
hy+0.012
ses+0.011
 grade+0.011
Neg logit contributions
ull-0.011
Joe-0.011
Mom-0.010
 them-0.010
Jess-0.010
Token children
Token ID1751
Feature activation-1.14e-02

Pos logit contributions
ull+0.006
Joe+0.006
add+0.006
Mom+0.006
in+0.006
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.008
 grade-0.007
Token.
Token ID13
Feature activation+6.12e-03

Pos logit contributions
ull+0.019
Joe+0.018
Mom+0.017
agles+0.017
Jess+0.017
Neg logit contributions
 spine-0.020
hy-0.018
ro-0.018
ses-0.016
 grade-0.015
Token The
Token ID383
Feature activation+2.43e-04

Pos logit contributions
 spine+0.012
ro+0.010
hy+0.010
ses+0.009
 grade+0.009
Neg logit contributions
ull-0.010
Joe-0.010
Mom-0.009
 them-0.009
add-0.009
Token girl
Token ID2576
Feature activation+5.21e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.002
add-0.002
ins-0.002
Mom-0.002
Token's
Token ID338
Feature activation-2.69e-03

Pos logit contributions
 spine+0.010
ro+0.009
ses+0.009
hy+0.009
 grade+0.009
Neg logit contributions
ull-0.007
Joe-0.007
 them-0.006
Mom-0.006
 friends-0.006
Token shoulder
Token ID8163
Feature activation+1.14e-02

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.004
 them+0.004
 friends+0.004
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Token and
Token ID290
Feature activation+2.35e-03

Pos logit contributions
 spine+0.022
ses+0.019
ro+0.018
hy+0.018
 grade+0.017
Neg logit contributions
ull-0.016
Joe-0.016
 them-0.016
Mom-0.015
ups-0.015
Token w
Token ID266
Feature activation+4.57e-03

Pos logit contributions
 spine+0.005
ro+0.004
hy+0.004
ses+0.004
 grade+0.004
Neg logit contributions
ull-0.004
Joe-0.004
Mom-0.003
 them-0.003
Jess-0.003
Tokenadd
Token ID2860
Feature activation-6.27e-02

Pos logit contributions
 spine+0.010
ro+0.008
hy+0.008
ses+0.008
 grade+0.007
Neg logit contributions
ull-0.007
Joe-0.006
Mom-0.006
add-0.006
 them-0.006
Tokenled
Token ID992
Feature activation-1.59e-02

Pos logit contributions
ull+0.088
Joe+0.085
Mom+0.079
Jess+0.076
 them+0.076
Neg logit contributions
 spine-0.131
ro-0.115
hy-0.112
ses-0.107
 grade-0.101
Token over
Token ID625
Feature activation-4.72e-03

Pos logit contributions
ull+0.023
Joe+0.023
 them+0.022
Mom+0.021
Jess+0.020
Neg logit contributions
 spine-0.033
ro-0.028
hy-0.028
ses-0.027
 grade-0.025
Token to
Token ID284
Feature activation-1.14e-02

Pos logit contributions
Joe+0.007
ull+0.007
 them+0.007
Mom+0.007
in+0.007
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.008
 grade-0.007
Token see
Token ID766
Feature activation-3.84e-03

Pos logit contributions
ull+0.016
Joe+0.016
 them+0.015
Mom+0.014
Jess+0.014
Neg logit contributions
 spine-0.025
ro-0.021
hy-0.021
ses-0.020
 grade-0.018
Token the
Token ID262
Feature activation+1.74e-04

Pos logit contributions
ull+0.006
Joe+0.006
add+0.005
Mom+0.005
agles+0.005
Neg logit contributions
 spine-0.007
hy-0.006
ro-0.006
ses-0.006
 grade-0.006
Token mom
Token ID1995
Feature activation+4.49e-03

Pos logit contributions
ull+0.009
Joe+0.008
 them+0.008
Mom+0.008
agles+0.008
Neg logit contributions
 spine-0.013
ro-0.012
hy-0.011
ses-0.011
 grade-0.010
Tokenm
Token ID76
Feature activation-1.99e-02

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.008
ses+0.008
 grade+0.008
Neg logit contributions
ull-0.006
Joe-0.006
Mom-0.005
 them-0.005
ins-0.005
Tokenies
Token ID444
Feature activation+7.96e-03

Pos logit contributions
Joe+0.028
ull+0.028
Mom+0.026
Jess+0.025
agles+0.024
Neg logit contributions
 spine-0.041
hy-0.036
ro-0.036
ses-0.034
 grade-0.031
Token and
Token ID290
Feature activation-2.53e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.012
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.013
Joe-0.013
Mom-0.012
add-0.011
Jess-0.011
Token d
Token ID288
Feature activation-2.50e-02

Pos logit contributions
ull+0.004
Joe+0.003
Mom+0.003
add+0.003
Jess+0.003
Neg logit contributions
 spine-0.005
ro-0.005
hy-0.005
ses-0.004
 grade-0.004
Tokenadd
Token ID2860
Feature activation-6.27e-02

Pos logit contributions
ull+0.035
Joe+0.033
Jess+0.031
Mom+0.030
Dad+0.028
Neg logit contributions
 spine-0.053
ro-0.049
hy-0.048
ses-0.042
 grade-0.041
Tokenies
Token ID444
Feature activation-3.62e-03

Pos logit contributions
ull+0.075
Joe+0.072
Mom+0.065
Jess+0.063
 them+0.063
Neg logit contributions
 spine-0.144
ro-0.128
hy-0.125
ses-0.121
 grade-0.114
Token.
Token ID13
Feature activation-4.04e-03

Pos logit contributions
ull+0.006
Joe+0.006
Mom+0.005
 them+0.005
Jess+0.005
Neg logit contributions
 spine-0.007
ro-0.006
hy-0.006
ses-0.005
 grade-0.005
Token They
Token ID1119
Feature activation-3.16e-03

Pos logit contributions
ull+0.007
Joe+0.007
agles+0.006
Mom+0.006
ins+0.006
Neg logit contributions
 spine-0.007
ro-0.006
hy-0.006
ses-0.005
 grade-0.005
Token were
Token ID547
Feature activation-1.43e-03

Pos logit contributions
ull+0.004
Joe+0.004
 them+0.004
Mom+0.004
Jess+0.004
Neg logit contributions
 spine-0.007
ro-0.006
ses-0.006
hy-0.006
 grade-0.005
Token all
Token ID477
Feature activation-4.80e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
agles+0.002
in+0.002
Neg logit contributions
 spine-0.003
hy-0.002
ro-0.002
ses-0.002
 grade-0.002
Token some
Token ID617
Feature activation-1.60e-02

Pos logit contributions
ull+0.007
Joe+0.007
ins+0.007
Jess+0.007
leys+0.006
Neg logit contributions
 spine-0.008
ro-0.007
hy-0.007
ses-0.007
 lick-0.007
Token bread
Token ID8509
Feature activation+2.51e-03

Pos logit contributions
Joe+0.022
ull+0.022
Jess+0.020
Mom+0.019
leys+0.019
Neg logit contributions
 spine-0.031
ro-0.029
hy-0.028
ses-0.026
 lick-0.025
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.005
Joe-0.005
Jess-0.004
add-0.004
Mom-0.004
Token feed
Token ID3745
Feature activation-1.72e-03

Pos logit contributions
ull+0.019
Joe+0.018
Mom+0.017
Jess+0.016
add+0.016
Neg logit contributions
 spine-0.027
ro-0.024
hy-0.023
ses-0.022
 grade-0.021
Token the
Token ID262
Feature activation-2.67e-02

Pos logit contributions
ull+0.002
Joe+0.002
add+0.002
Jess+0.002
ins+0.002
Neg logit contributions
 spine-0.003
ro-0.003
hy-0.003
ses-0.003
 grade-0.003
Token sw
Token ID1509
Feature activation-6.13e-02

Pos logit contributions
irs+0.033
Jess+0.033
Joe+0.032
leys+0.032
ull+0.032
Neg logit contributions
 spine-0.054
ro-0.051
ses-0.049
hy-0.048
sed-0.045
Tokenan
Token ID272
Feature activation-1.73e-02

Pos logit contributions
Joe+0.065
ull+0.063
Jess+0.061
Mom+0.060
 them+0.053
Neg logit contributions
 spine-0.148
ro-0.136
ses-0.127
hy-0.126
sed-0.115
Token?"
Token ID1701
Feature activation+3.15e-04

Pos logit contributions
ull+0.021
Joe+0.020
Mom+0.018
Jess+0.018
 them+0.018
Neg logit contributions
 spine-0.040
ro-0.035
hy-0.034
ses-0.033
 grade-0.031
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
 spine+0.001
ro+0.000
hy+0.000
ses+0.000
 grade+0.000
Neg logit contributions
ull-0.001
Joe-0.001
ins-0.000
agles-0.000
add-0.000
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
ull+0.028
 tides+0.028
agles+0.028
ins+0.028
add+0.027
Neg logit contributions
ro-0.013
 spine-0.013
hy-0.011
 But-0.010
 Sure-0.010
TokenHer
Token ID9360
Feature activation-2.76e-03

Pos logit contributions
add+0.030
ull+0.030
ins+0.027
idy+0.027
agles+0.026
Neg logit contributions
ro-0.027
 spine-0.025
hy-0.024
tail-0.022
 lick-0.021
Token mom
Token ID1995
Feature activation+4.96e-03

Pos logit contributions
ull+0.015
Joe+0.014
Jess+0.013
ins+0.012
agles+0.012
Neg logit contributions
 spine-0.036
ro-0.032
hy-0.032
ses-0.031
 grade-0.029
Tokenm
Token ID76
Feature activation-2.50e-02

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.007
ses+0.006
 grade+0.006
Neg logit contributions
ull-0.009
Joe-0.008
agles-0.008
add-0.008
Jess-0.008
Tokenies
Token ID444
Feature activation+4.29e-03

Pos logit contributions
Joe+0.032
ull+0.031
Jess+0.030
Mom+0.029
agles+0.028
Neg logit contributions
 spine-0.054
ro-0.047
hy-0.046
ses-0.045
 grade-0.042
Token and
Token ID290
Feature activation-4.58e-03

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.007
ses+0.007
 grade+0.006
Neg logit contributions
ull-0.007
Joe-0.006
Mom-0.006
Jess-0.006
add-0.006
Token d
Token ID288
Feature activation-2.26e-02

Pos logit contributions
ull+0.005
Joe+0.005
add+0.005
Mom+0.004
Jess+0.004
Neg logit contributions
 spine-0.010
hy-0.009
ro-0.009
ses-0.009
 grade-0.009
Tokenadd
Token ID2860
Feature activation-6.10e-02

Pos logit contributions
ull+0.031
Joe+0.029
Jess+0.029
Mom+0.027
 anchors+0.025
Neg logit contributions
 spine-0.047
ro-0.044
hy-0.041
ses-0.036
 grade-0.036
Tokenies
Token ID444
Feature activation-1.08e-03

Pos logit contributions
ull+0.079
Joe+0.078
Mom+0.073
Jess+0.070
add+0.066
Neg logit contributions
 spine-0.132
ro-0.117
hy-0.114
ses-0.108
 grade-0.102
Token said
Token ID531
Feature activation-1.06e-02

Pos logit contributions
ull+0.002
Joe+0.002
 them+0.002
Mom+0.002
ins+0.001
Neg logit contributions
 spine-0.002
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token,
Token ID11
Feature activation+6.88e-03

Pos logit contributions
ull+0.018
Joe+0.018
Mom+0.017
ins+0.017
agles+0.017
Neg logit contributions
 spine-0.017
hy-0.016
ro-0.015
ses-0.014
 grade-0.013
Token "
Token ID366
Feature activation-1.70e-03

Pos logit contributions
 spine+0.013
ro+0.011
hy+0.010
ses+0.010
 grade+0.009
Neg logit contributions
ull-0.011
Joe-0.011
Mom-0.010
 them-0.010
Jess-0.010
TokenToday
Token ID8888
Feature activation+1.26e-02

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
add+0.002
 them+0.002
Neg logit contributions
 spine-0.004
ro-0.003
hy-0.003
ses-0.003
 grade-0.003
Token curious
Token ID11040
Feature activation+8.79e-03

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.008
ses+0.008
 grade+0.007
Neg logit contributions
ull-0.009
agles-0.009
Joe-0.009
Mom-0.008
add-0.008
Token,
Token ID11
Feature activation+2.23e-03

Pos logit contributions
 spine+0.016
ro+0.013
ses+0.013
hy+0.013
 grade+0.012
Neg logit contributions
ull-0.015
Joe-0.015
 them-0.014
Mom-0.013
in-0.013
Token so
Token ID523
Feature activation-2.67e-03

Pos logit contributions
 spine+0.004
ro+0.004
ses+0.004
hy+0.004
 grade+0.003
Neg logit contributions
Joe-0.003
ull-0.003
 them-0.003
Mom-0.003
Jess-0.003
Token he
Token ID339
Feature activation-1.45e-02

Pos logit contributions
ull+0.005
ins+0.004
add+0.004
agles+0.004
Joe+0.004
Neg logit contributions
 spine-0.005
hy-0.004
ro-0.004
ses-0.003
 grade-0.003
Token w
Token ID266
Feature activation+3.42e-04

Pos logit contributions
ull+0.022
ins+0.021
agles+0.020
Joe+0.020
irs+0.020
Neg logit contributions
 spine-0.027
ro-0.025
hy-0.022
 mun-0.021
ses-0.021
Tokenadd
Token ID2860
Feature activation-6.09e-02

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
Mom-0.000
ull-0.000
Joe-0.000
Jess-0.000
 them-0.000
Tokenled
Token ID992
Feature activation-1.54e-02

Pos logit contributions
Mom+0.098
ull+0.098
Joe+0.097
Jess+0.093
Dad+0.090
Neg logit contributions
 spine-0.108
ro-0.097
hy-0.090
ses-0.084
 mun-0.077
Token over
Token ID625
Feature activation-2.27e-03

Pos logit contributions
ull+0.026
Joe+0.025
 them+0.024
Mom+0.023
Jess+0.023
Neg logit contributions
 spine-0.028
ro-0.024
hy-0.024
ses-0.023
 grade-0.021
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
Joe+0.004
ull+0.004
 them+0.004
Mom+0.004
in+0.004
Neg logit contributions
 spine-0.004
hy-0.003
ses-0.003
ro-0.003
 grade-0.003
Token take
Token ID1011
Feature activation+1.80e-03

Pos logit contributions
ull+0.021
Joe+0.021
 them+0.019
Mom+0.019
Jess+0.018
Neg logit contributions
 spine-0.025
ro-0.021
hy-0.021
ses-0.020
 grade-0.018
Token a
Token ID257
Feature activation+2.26e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.003
ses+0.003
 grade+0.002
Neg logit contributions
ull-0.003
Joe-0.003
Mom-0.003
ins-0.003
Jess-0.003
Token and
Token ID290
Feature activation+4.03e-03

Pos logit contributions
 spine+0.016
ses+0.014
hy+0.014
ro+0.014
 grade+0.014
Neg logit contributions
ull-0.012
 them-0.012
Joe-0.011
in-0.010
 friends-0.010
Token Pupp
Token ID20926
Feature activation-2.37e-02

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.008
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.005
Joe-0.005
 them-0.005
Mom-0.005
Jess-0.005
Tokeny
Token ID88
Feature activation-7.97e-03

Pos logit contributions
ull+0.037
Joe+0.037
Jess+0.036
Mom+0.035
Dad+0.034
Neg logit contributions
 spine-0.043
ro-0.039
hy-0.036
ses-0.032
 mun-0.031
Token and
Token ID290
Feature activation+7.30e-03

Pos logit contributions
ull+0.013
Joe+0.012
Mom+0.011
Jess+0.011
ins+0.011
Neg logit contributions
 spine-0.015
ro-0.013
hy-0.013
ses-0.012
 grade-0.011
Token w
Token ID266
Feature activation+5.51e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.012
 grade+0.012
Neg logit contributions
ull-0.010
Joe-0.010
Mom-0.009
 them-0.009
Jess-0.009
Tokenadd
Token ID2860
Feature activation-6.06e-02

Pos logit contributions
 spine+0.011
ro+0.010
hy+0.010
ses+0.010
 mun+0.009
Neg logit contributions
ull-0.007
Joe-0.007
Mom-0.007
Jess-0.007
Dad-0.006
Tokenled
Token ID992
Feature activation-1.98e-02

Pos logit contributions
ull+0.083
Joe+0.082
Mom+0.079
Jess+0.076
Dad+0.072
Neg logit contributions
 spine-0.125
ro-0.112
hy-0.107
ses-0.102
 mun-0.093
Token home
Token ID1363
Feature activation-1.47e-04

Pos logit contributions
ull+0.028
Joe+0.027
 them+0.025
Mom+0.024
add+0.024
Neg logit contributions
 spine-0.042
ro-0.036
hy-0.036
ses-0.034
 grade-0.033
Token.
Token ID13
Feature activation-4.52e-04

Pos logit contributions
ull+0.000
Joe+0.000
 them+0.000
ins+0.000
add+0.000
Neg logit contributions
 spine-0.000
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token Mama
Token ID40084
Feature activation+2.71e-02

Pos logit contributions
ull+0.001
agles+0.001
Joe+0.001
ins+0.001
in+0.001
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 Sure-0.001
Token Duck
Token ID21867
Feature activation+9.54e-03

Pos logit contributions
 spine+0.044
ro+0.040
hy+0.038
ses+0.035
 mun+0.033
Neg logit contributions
ull-0.047
Joe-0.045
Jess-0.042
agles-0.042
Mom-0.042
Token.
Token ID13
Feature activation-8.43e-03

Pos logit contributions
Joe+0.004
ull+0.004
 them+0.004
Mom+0.004
in+0.003
Neg logit contributions
 spine-0.009
ses-0.009
ro-0.008
hy-0.008
sed-0.008
Token Then
Token ID3244
Feature activation-6.10e-03

Pos logit contributions
ull+0.018
agles+0.017
Joe+0.016
ins+0.016
add+0.016
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.009
ses-0.009
 Sure-0.009
Token his
Token ID465
Feature activation-1.26e-02

Pos logit contributions
ull+0.011
add+0.010
ins+0.010
Joe+0.010
agles+0.010
Neg logit contributions
 spine-0.010
hy-0.008
ro-0.008
 grade-0.008
ses-0.007
Token face
Token ID1986
Feature activation+5.06e-03

Pos logit contributions
ull+0.008
Joe+0.008
Mom+0.008
 them+0.006
add+0.006
Neg logit contributions
 spine-0.035
ro-0.032
hy-0.031
ses-0.030
 grade-0.029
Token lit
Token ID6578
Feature activation+1.12e-02

Pos logit contributions
 spine+0.009
ro+0.008
hy+0.008
ses+0.007
 grade+0.007
Neg logit contributions
ull-0.008
Joe-0.008
Mom-0.008
 them-0.008
in-0.007
Token up
Token ID510
Feature activation+2.38e-02

Pos logit contributions
 spine+0.026
ro+0.023
hy+0.022
ses+0.022
 grade+0.020
Neg logit contributions
ull-0.014
Joe-0.013
Mom-0.012
 them-0.012
Jess-0.011
Token with
Token ID351
Feature activation-1.05e-02

Pos logit contributions
 spine+0.049
ses+0.043
ro+0.042
hy+0.041
 grade+0.038
Neg logit contributions
ull-0.036
Joe-0.035
 them-0.032
Mom-0.032
in-0.030
Token excitement
Token ID14067
Feature activation-2.19e-02

Pos logit contributions
ull+0.015
Joe+0.015
Mom+0.014
 them+0.014
Jess+0.013
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.018
ses-0.018
 grade-0.016
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
ull+0.031
Joe+0.030
Mom+0.027
 them+0.027
ins+0.026
Neg logit contributions
 spine-0.046
ro-0.040
hy-0.039
ses-0.038
 grade-0.035
Token "
Token ID366
Feature activation-6.63e-03

Pos logit contributions
ull+0.013
agles+0.013
ins+0.012
idy+0.012
in+0.012
Neg logit contributions
 spine-0.009
ro-0.007
 Sure-0.007
hy-0.007
ses-0.006
TokenIce
Token ID23709
Feature activation-2.18e-02

Pos logit contributions
ull+0.009
Joe+0.008
agles+0.008
add+0.008
 them+0.008
Neg logit contributions
 spine-0.014
ro-0.012
hy-0.012
ses-0.011
 grade-0.011
Token to
Token ID284
Feature activation+1.40e-03

Pos logit contributions
ull+0.020
Joe+0.019
Mom+0.018
Jess+0.018
add+0.018
Neg logit contributions
 spine-0.020
ro-0.016
hy-0.016
ses-0.015
 grade-0.014
Token admire
Token ID21099
Feature activation+5.65e-03

Pos logit contributions
 spine+0.003
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
 them-0.002
Joe-0.002
Jess-0.002
Mom-0.002
Token it
Token ID340
Feature activation-2.00e-03

Pos logit contributions
 spine+0.011
ro+0.009
hy+0.009
ses+0.009
 grade+0.008
Neg logit contributions
ull-0.008
Joe-0.008
Mom-0.008
add-0.008
ins-0.008
Token.
Token ID13
Feature activation+8.12e-03

Pos logit contributions
ull+0.003
 them+0.003
Joe+0.002
in+0.002
Jess+0.002
Neg logit contributions
 spine-0.004
hy-0.004
ro-0.004
ses-0.004
 grade-0.003
Token It
Token ID632
Feature activation-5.97e-03

Pos logit contributions
 spine+0.014
ro+0.011
hy+0.011
ses+0.011
 grade+0.010
Neg logit contributions
ull-0.015
Joe-0.014
 them-0.014
Mom-0.013
Jess-0.013
Token was
Token ID373
Feature activation+2.59e-02

Pos logit contributions
ull+0.010
Joe+0.009
Mom+0.009
 them+0.008
Jess+0.008
Neg logit contributions
 spine-0.011
ro-0.010
hy-0.009
ses-0.009
 grade-0.008
Token loved
Token ID6151
Feature activation+1.99e-02

Pos logit contributions
 spine+0.050
ro+0.043
hy+0.041
ses+0.040
 grade+0.037
Neg logit contributions
ull-0.041
Joe-0.040
Mom-0.037
 them-0.036
add-0.036
Token by
Token ID416
Feature activation+1.30e-02

Pos logit contributions
 spine+0.037
ro+0.030
hy+0.030
ses+0.028
 grade+0.027
Neg logit contributions
ull-0.035
Joe-0.034
 them-0.033
Jess-0.031
in-0.030
Token everyone
Token ID2506
Feature activation-3.33e-04

Pos logit contributions
 spine+0.027
ro+0.024
hy+0.023
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.018
Joe-0.018
Mom-0.016
 them-0.016
add-0.016
Token who
Token ID508
Feature activation-1.57e-02

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.000
 them+0.000
Jess+0.000
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.000
Token saw
Token ID2497
Feature activation-8.30e-03

Pos logit contributions
ull+0.024
Joe+0.023
 them+0.021
in+0.021
Jess+0.021
Neg logit contributions
 spine-0.033
ro-0.028
hy-0.027
ses-0.027
 grade-0.025
Token his
Token ID465
Feature activation-2.50e-03

Pos logit contributions
ull+0.008
Joe+0.008
Mom+0.007
 them+0.007
Jess+0.007
Neg logit contributions
 spine-0.013
ro-0.012
hy-0.011
ses-0.011
 grade-0.010
Token eyes
Token ID2951
Feature activation+2.67e-02

Pos logit contributions
ull+0.004
Joe+0.004
Mom+0.004
Jess+0.004
ins+0.004
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.003
 grade-0.003
Token tightly
Token ID17707
Feature activation+1.07e-02

Pos logit contributions
 spine+0.050
ses+0.044
ro+0.043
hy+0.042
 grade+0.040
Neg logit contributions
ull-0.041
Joe-0.040
 them-0.039
Mom-0.038
in-0.036
Token.
Token ID13
Feature activation-3.13e-04

Pos logit contributions
 spine+0.024
hy+0.022
ses+0.022
 grade+0.022
ro+0.022
Neg logit contributions
 them-0.011
ull-0.010
Joe-0.010
 friends-0.009
in-0.009
Token When
Token ID1649
Feature activation-1.51e-02

Pos logit contributions
ull+0.001
Joe+0.001
agles+0.001
Jess+0.001
Mom+0.001
Neg logit contributions
 spine-0.001
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token he
Token ID339
Feature activation+2.44e-02

Pos logit contributions
ins+0.024
ull+0.023
Joe+0.022
ups+0.022
agles+0.022
Neg logit contributions
 spine-0.028
hy-0.024
ro-0.023
 grade-0.023
ses-0.022
Token opened
Token ID4721
Feature activation+4.32e-03

Pos logit contributions
 spine+0.057
ses+0.047
hy+0.047
ro+0.046
sing+0.042
Neg logit contributions
ull-0.034
Joe-0.033
in-0.030
Mom-0.030
 them-0.030
Token them
Token ID606
Feature activation+2.77e-02

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.008
ins-0.008
Joe-0.007
ups-0.007
add-0.007
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
 spine+0.063
ro+0.056
hy+0.055
ses+0.054
 grade+0.049
Neg logit contributions
ull-0.033
Joe-0.033
 them-0.032
in-0.029
Mom-0.029
Token he
Token ID339
Feature activation+1.76e-03

Pos logit contributions
ull+0.019
ins+0.018
Joe+0.017
ups+0.017
Jess+0.017
Neg logit contributions
 spine-0.023
hy-0.020
ro-0.020
ses-0.018
 grade-0.018
Token saw
Token ID2497
Feature activation-7.89e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
Joe-0.003
Mom-0.003
 them-0.003
in-0.003
Token big
Token ID1263
Feature activation+2.83e-03

Pos logit contributions
 spine+0.014
ro+0.012
hy+0.012
ses+0.011
 grade+0.011
Neg logit contributions
ull-0.009
Joe-0.008
add-0.007
Mom-0.007
ins-0.007
Token rail
Token ID6787
Feature activation-4.17e-03

Pos logit contributions
 spine+0.006
ro+0.006
hy+0.005
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.003
Joe-0.003
 them-0.003
Mom-0.003
 raft-0.003
Token and
Token ID290
Feature activation-1.94e-03

Pos logit contributions
ull+0.006
Joe+0.006
 them+0.006
Mom+0.005
in+0.005
Neg logit contributions
 spine-0.008
ses-0.007
ro-0.007
 grade-0.007
hy-0.007
Token thought
Token ID1807
Feature activation-1.10e-02

Pos logit contributions
ull+0.003
Joe+0.003
agles+0.003
ins+0.003
add+0.003
Neg logit contributions
 spine-0.004
ro-0.003
hy-0.003
ses-0.003
 grade-0.003
Token it
Token ID340
Feature activation+1.33e-02

Pos logit contributions
Joe+0.013
ull+0.013
Mom+0.013
ins+0.013
Jess+0.012
Neg logit contributions
 spine-0.024
ro-0.021
hy-0.021
ses-0.020
 grade-0.019
Token would
Token ID561
Feature activation+3.06e-02

Pos logit contributions
 spine+0.024
ro+0.020
hy+0.020
ses+0.019
 grade+0.017
Neg logit contributions
ull-0.023
Joe-0.022
Mom-0.020
ins-0.020
Jess-0.020
Token be
Token ID307
Feature activation+3.17e-02

Pos logit contributions
 spine+0.063
ro+0.053
hy+0.051
ses+0.051
 grade+0.048
Neg logit contributions
ull-0.044
Joe-0.043
 them-0.043
Mom-0.040
add-0.037
Token funny
Token ID8258
Feature activation+1.12e-02

Pos logit contributions
 spine+0.055
hy+0.049
ro+0.048
ses+0.046
 grade+0.041
Neg logit contributions
ull-0.051
Joe-0.050
in-0.049
Mom-0.048
agles-0.048
Token to
Token ID284
Feature activation+5.35e-03

Pos logit contributions
 spine+0.016
ro+0.013
hy+0.012
ses+0.011
 grade+0.011
Neg logit contributions
ull-0.023
Joe-0.023
Mom-0.022
agles-0.022
Jess-0.022
Token hide
Token ID7808
Feature activation+3.78e-03

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.006
ses+0.006
 grade+0.005
Neg logit contributions
ull-0.011
Joe-0.010
Mom-0.010
ins-0.010
Jess-0.010
Token behind
Token ID2157
Feature activation-3.25e-03

Pos logit contributions
 spine+0.006
ro+0.006
hy+0.005
ses+0.005
 grade+0.004
Neg logit contributions
ull-0.007
Joe-0.006
Jess-0.006
add-0.006
Mom-0.006
Token mouse
Token ID10211
Feature activation+9.71e-04

Pos logit contributions
 spine+0.032
ro+0.027
hy+0.026
ses+0.026
 grade+0.025
Neg logit contributions
Joe-0.029
ull-0.028
add-0.027
ins-0.027
ups-0.027
Token.
Token ID13
Feature activation+3.92e-03

Pos logit contributions
 spine+0.002
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.001
Joe-0.001
Mom-0.001
 them-0.001
Jess-0.001
Token But
Token ID887
Feature activation+1.72e-02

Pos logit contributions
 spine+0.007
ro+0.005
hy+0.005
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.007
Joe-0.007
Mom-0.007
 them-0.006
Jess-0.006
Token you
Token ID345
Feature activation+1.76e-02

Pos logit contributions
 spine+0.040
ro+0.035
hy+0.034
ses+0.033
 grade+0.031
Neg logit contributions
ull-0.020
Joe-0.020
Mom-0.019
 them-0.018
Jess-0.017
Token have
Token ID423
Feature activation+1.23e-02

Pos logit contributions
 spine+0.034
ro+0.028
hy+0.028
ses+0.027
 grade+0.025
Neg logit contributions
ull-0.029
Joe-0.027
 them-0.026
Mom-0.025
add-0.025
Token to
Token ID284
Feature activation+3.06e-02

Pos logit contributions
 spine+0.021
hy+0.017
ro+0.017
ses+0.016
 grade+0.015
Neg logit contributions
ull-0.022
Joe-0.022
 them-0.021
Mom-0.020
Jess-0.020
Token be
Token ID307
Feature activation+1.85e-02

Pos logit contributions
 spine+0.052
ro+0.043
hy+0.042
ses+0.040
 grade+0.037
Neg logit contributions
ull-0.055
Joe-0.054
Mom-0.051
 them-0.050
Jess-0.049
Token careful
Token ID8161
Feature activation+6.89e-03

Pos logit contributions
 spine+0.039
ro+0.034
hy+0.033
ses+0.032
 grade+0.031
Neg logit contributions
ull-0.024
Joe-0.023
add-0.022
agles-0.022
Mom-0.021
Token,
Token ID11
Feature activation+7.82e-03

Pos logit contributions
 spine+0.010
ro+0.008
hy+0.008
ses+0.007
 grade+0.006
Neg logit contributions
ull-0.014
Joe-0.014
add-0.013
Jess-0.013
agles-0.013
Token because
Token ID780
Feature activation-4.42e-03

Pos logit contributions
 spine+0.016
ro+0.013
hy+0.013
ses+0.013
 grade+0.012
Neg logit contributions
ull-0.012
Joe-0.011
 them-0.011
Mom-0.011
Jess-0.010
Token you
Token ID345
Feature activation+1.64e-02

Pos logit contributions
ull+0.006
Joe+0.006
add+0.005
ins+0.005
Mom+0.005
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.008
 grade-0.007
Token to
Token ID284
Feature activation-4.29e-03

Pos logit contributions
ull+0.029
Joe+0.028
Mom+0.027
Jess+0.026
add+0.026
Neg logit contributions
 spine-0.024
ro-0.020
hy-0.019
ses-0.018
 grade-0.016
Token her
Token ID607
Feature activation-2.92e-03

Pos logit contributions
 them+0.005
ull+0.005
Joe+0.005
 us+0.004
Mom+0.004
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.009
ses-0.009
ical-0.008
Token home
Token ID1363
Feature activation+1.42e-02

Pos logit contributions
 friends+0.003
ull+0.002
 families+0.002
 them+0.002
Mom+0.002
Neg logit contributions
 spine-0.007
hy-0.007
ro-0.007
ses-0.007
WR-0.006
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
 spine+0.038
ses+0.036
ro+0.034
hy+0.034
WR+0.033
Neg logit contributions
ull-0.010
 them-0.010
 friends-0.009
Joe-0.008
in-0.008
Token He
Token ID679
Feature activation+3.91e-03

Pos logit contributions
 spine+0.031
ses+0.027
sing+0.026
 grade+0.025
ro+0.024
Neg logit contributions
Joe-0.011
"-0.011
 rad-0.010
Mom-0.010
 them-0.010
Token wanted
Token ID2227
Feature activation+4.38e-03

Pos logit contributions
 spine+0.010
ro+0.008
hy+0.008
ses+0.008
 grade+0.008
Neg logit contributions
ull-0.005
Joe-0.004
 them-0.004
 decided-0.004
Mom-0.004
Token to
Token ID284
Feature activation+1.75e-02

Pos logit contributions
 spine+0.008
ro+0.007
ses+0.007
hy+0.007
sing+0.006
Neg logit contributions
 them-0.007
ull-0.007
Joe-0.007
Mom-0.007
ups-0.007
Token help
Token ID1037
Feature activation+7.12e-04

Pos logit contributions
 spine+0.030
ro+0.024
hy+0.023
ses+0.023
 grade+0.020
Neg logit contributions
ull-0.032
Joe-0.031
Mom-0.030
 them-0.030
Jess-0.028
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
 spine+0.002
ses+0.002
ro+0.001
hy+0.001
sed+0.001
Neg logit contributions
 them-0.001
ull-0.001
Joe-0.001
Mom-0.001
Jess-0.001
Token p
Token ID279
Feature activation-2.10e-02

Pos logit contributions
ull+0.017
Joe+0.016
Jess+0.015
agles+0.015
leys+0.015
Neg logit contributions
 spine-0.020
ro-0.017
hy-0.017
ses-0.015
 grade-0.015
Tokenanda
Token ID5282
Feature activation-8.71e-04

Pos logit contributions
ull+0.028
Joe+0.027
Jess+0.025
Mom+0.024
 them+0.023
Neg logit contributions
 spine-0.048
ro-0.043
hy-0.042
ses-0.038
 grade-0.036
Token Sara
Token ID24799
Feature activation+5.81e-03

Pos logit contributions
ull+0.027
agles+0.026
leys+0.025
Joe+0.025
add+0.024
Neg logit contributions
 spine-0.020
ro-0.018
hy-0.017
ses-0.016
 Whe-0.016
Token played
Token ID2826
Feature activation+1.92e-03

Pos logit contributions
 spine+0.013
hy+0.010
ses+0.010
ro+0.010
 grade+0.009
Neg logit contributions
Joe-0.007
 them-0.007
ull-0.007
in-0.007
Mom-0.007
Token with
Token ID351
Feature activation-1.32e-03

Pos logit contributions
 spine+0.004
hy+0.003
ses+0.003
ro+0.003
sed+0.003
Neg logit contributions
ull-0.003
 them-0.003
Joe-0.003
Mom-0.003
in-0.003
Token her
Token ID607
Feature activation+6.11e-03

Pos logit contributions
ull+0.002
Joe+0.002
add+0.002
Jess+0.002
Mom+0.002
Neg logit contributions
 spine-0.003
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token makeup
Token ID16029
Feature activation+1.02e-02

Pos logit contributions
 spine+0.014
hy+0.013
ro+0.012
ses+0.012
WR+0.012
Neg logit contributions
 friends-0.007
 them-0.006
in-0.006
ull-0.006
Mom-0.006
Token.
Token ID13
Feature activation-6.44e-03

Pos logit contributions
 spine+0.024
ses+0.021
hy+0.021
ro+0.021
WR+0.019
Neg logit contributions
 them-0.010
ull-0.010
ups-0.010
in-0.010
Mom-0.010
Token She
Token ID1375
Feature activation+5.47e-03

Pos logit contributions
ull+0.011
Joe+0.010
Mom+0.009
 them+0.009
Jess+0.009
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.010
ses-0.010
 grade-0.009
Token said
Token ID531
Feature activation-2.26e-03

Pos logit contributions
 spine+0.012
ro+0.010
hy+0.010
ses+0.009
 grade+0.009
Neg logit contributions
ull-0.008
Joe-0.007
Mom-0.007
 them-0.007
in-0.007
Token it
Token ID340
Feature activation+1.23e-02

Pos logit contributions
ull+0.004
Joe+0.004
agles+0.003
Jess+0.003
add+0.003
Neg logit contributions
 spine-0.004
ro-0.004
hy-0.004
ses-0.003
 grade-0.003
Token was
Token ID373
Feature activation+2.39e-02

Pos logit contributions
 spine+0.028
ro+0.023
hy+0.023
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.016
Joe-0.015
Mom-0.014
 them-0.014
ups-0.014
Token not
Token ID407
Feature activation+1.83e-02

Pos logit contributions
 spine+0.053
ro+0.046
hy+0.045
ses+0.043
 grade+0.041
Neg logit contributions
ull-0.030
 them-0.029
Joe-0.029
Mom-0.028
ins-0.026
Token kid
Token ID5141
Feature activation-8.75e-03

Pos logit contributions
ull+0.010
Joe+0.009
Mom+0.009
in+0.008
 them+0.008
Neg logit contributions
 spine-0.014
ro-0.012
hy-0.011
ses-0.011
 grade-0.010
Token named
Token ID3706
Feature activation+1.81e-03

Pos logit contributions
ull+0.014
Joe+0.014
Mom+0.013
add+0.013
Jess+0.013
Neg logit contributions
 spine-0.016
ro-0.014
hy-0.014
ses-0.013
 grade-0.012
Token Sam
Token ID3409
Feature activation-5.52e-03

Pos logit contributions
 spine+0.003
hy+0.003
ro+0.003
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.003
Joe-0.003
Mom-0.003
ins-0.003
add-0.003
Token came
Token ID1625
Feature activation-1.45e-02

Pos logit contributions
ull+0.009
Joe+0.009
Mom+0.008
add+0.008
Jess+0.008
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.008
ses-0.008
 grade-0.007
Token to
Token ID284
Feature activation-6.16e-03

Pos logit contributions
ull+0.030
Joe+0.029
Mom+0.028
Jess+0.028
agles+0.028
Neg logit contributions
 spine-0.020
ro-0.017
hy-0.015
ses-0.014
 grade-0.013
Token play
Token ID711
Feature activation+3.32e-03

Pos logit contributions
Joe+0.008
 them+0.008
ull+0.008
Mom+0.007
Dad+0.007
Neg logit contributions
 spine-0.014
ro-0.012
hy-0.012
ses-0.012
sed-0.010
Token.
Token ID13
Feature activation-7.18e-04

Pos logit contributions
 spine+0.008
ses+0.006
hy+0.006
ro+0.006
sed+0.006
Neg logit contributions
 them-0.005
ull-0.004
Joe-0.004
in-0.004
 friends-0.004
Token Sam
Token ID3409
Feature activation+1.12e-02

Pos logit contributions
Joe+0.001
ull+0.001
Mom+0.001
 them+0.001
Dad+0.001
Neg logit contributions
 spine-0.002
ro-0.001
hy-0.001
 grade-0.001
ses-0.001
Token was
Token ID373
Feature activation+3.41e-03

Pos logit contributions
 spine+0.025
ro+0.020
hy+0.020
ses+0.020
 grade+0.019
Neg logit contributions
Joe-0.015
ull-0.015
 them-0.014
Mom-0.014
in-0.013
Token not
Token ID407
Feature activation+1.85e-03

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.006
ses+0.006
 grade+0.006
Neg logit contributions
ull-0.004
Joe-0.004
 them-0.004
Mom-0.004
 friends-0.004
Token very
Token ID845
Feature activation+1.63e-02

Pos logit contributions
 spine+0.004
ro+0.004
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
Joe-0.002
 them-0.002
Mom-0.002
ins-0.002
TokenDays
Token ID38770
Feature activation-8.98e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.012
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.009
Joe-0.008
Mom-0.008
 them-0.008
Jess-0.007
Token went
Token ID1816
Feature activation+4.90e-03

Pos logit contributions
ull+0.012
Joe+0.012
 them+0.011
Mom+0.011
add+0.011
Neg logit contributions
 spine-0.020
ro-0.017
hy-0.016
ses-0.016
 grade-0.015
Token by
Token ID416
Feature activation+2.47e-02

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.008
ses+0.008
 grade+0.008
Neg logit contributions
ull-0.007
Joe-0.007
 them-0.007
Mom-0.007
agles-0.006
Token and
Token ID290
Feature activation+3.46e-03

Pos logit contributions
 spine+0.056
hy+0.048
 grade+0.045
WR+0.044
ro+0.044
Neg logit contributions
 them-0.033
ull-0.033
 friends-0.030
Joe-0.028
oles-0.025
Token Lily
Token ID20037
Feature activation-1.24e-02

Pos logit contributions
 spine+0.008
hy+0.007
ro+0.007
ses+0.007
 grade+0.006
Neg logit contributions
 them-0.004
ull-0.004
Joe-0.004
Mom-0.004
add-0.003
Token spent
Token ID3377
Feature activation+7.39e-03

Pos logit contributions
ull+0.017
Joe+0.017
Mom+0.016
 them+0.016
ups+0.015
Neg logit contributions
 spine-0.026
ro-0.022
hy-0.022
ses-0.022
 grade-0.021
Token a
Token ID257
Feature activation+9.83e-03

Pos logit contributions
 spine+0.021
hy+0.018
ses+0.017
ro+0.017
 grade+0.016
Neg logit contributions
 them-0.008
 friends-0.007
Joe-0.007
ull-0.006
 other-0.006
Token lot
Token ID1256
Feature activation+1.06e-03

Pos logit contributions
 spine+0.022
ses+0.020
ro+0.019
hy+0.019
 grade+0.018
Neg logit contributions
Joe-0.011
ull-0.011
Mom-0.011
Everyone-0.010
Jess-0.010
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
 spine+0.003
ses+0.002
hy+0.002
sed+0.002
ro+0.002
Neg logit contributions
ull-0.001
Joe-0.001
in-0.001
 them-0.001
ups-0.001
Token time
Token ID640
Feature activation-7.17e-03

Pos logit contributions
ull+0.013
Joe+0.013
 them+0.012
add+0.012
 friends+0.012
Neg logit contributions
 spine-0.025
ro-0.021
hy-0.021
ses-0.020
 grade-0.019
Token with
Token ID351
Feature activation-1.90e-02

Pos logit contributions
ull+0.011
Joe+0.011
 them+0.010
in+0.010
ins+0.010
Neg logit contributions
 spine-0.014
ro-0.012
ses-0.011
hy-0.011
 grade-0.010
Token at
Token ID379
Feature activation-1.07e-02

Pos logit contributions
ull+0.036
Joe+0.035
Mom+0.033
 them+0.032
Jess+0.032
Neg logit contributions
 spine-0.048
ro-0.041
hy-0.040
ses-0.039
 grade-0.036
Token a
Token ID257
Feature activation-7.04e-03

Pos logit contributions
ull+0.015
Joe+0.015
Mom+0.014
 them+0.014
Jess+0.013
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.019
ses-0.018
 grade-0.017
Token time
Token ID640
Feature activation-1.36e-02

Pos logit contributions
ull+0.011
Joe+0.010
ins+0.010
Mom+0.010
add+0.010
Neg logit contributions
 spine-0.014
ro-0.012
hy-0.012
ses-0.011
 grade-0.010
Token."
Token ID526
Feature activation+1.01e-02

Pos logit contributions
ull+0.021
Joe+0.020
Mom+0.019
add+0.018
agles+0.018
Neg logit contributions
 spine-0.027
ro-0.024
hy-0.022
ses-0.021
 grade-0.020
Token
Token ID198
Feature activation+8.13e-03

Pos logit contributions
 spine+0.024
ses+0.021
hy+0.021
ro+0.020
 grade+0.019
Neg logit contributions
Joe-0.012
ull-0.011
 them-0.010
Mom-0.010
Everyone-0.010
Token
Token ID198
Feature activation+1.89e-03

Pos logit contributions
 spine+0.015
ro+0.013
hy+0.013
ses+0.012
 grade+0.011
Neg logit contributions
ull-0.013
Joe-0.012
Mom-0.012
ins-0.012
agles-0.012
TokenThe
Token ID464
Feature activation+3.81e-03

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
Joe-0.002
add-0.002
 them-0.002
ins-0.002
Token cater
Token ID20825
Feature activation+2.83e-02

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.006
ses+0.006
 grade+0.006
Neg logit contributions
ull-0.006
Joe-0.005
 them-0.005
Mom-0.005
ins-0.005
Tokenpillar
Token ID41643
Feature activation+2.06e-02

Pos logit contributions
 spine+0.065
hy+0.055
ses+0.053
ro+0.051
sing+0.050
Neg logit contributions
ull-0.039
Joe-0.038
in-0.038
ins-0.034
Jess-0.034
Token thanked
Token ID26280
Feature activation+1.83e-02

Pos logit contributions
 spine+0.044
hy+0.037
ro+0.036
ses+0.036
 grade+0.035
Neg logit contributions
ull-0.030
Joe-0.028
Mom-0.026
in-0.026
Jess-0.026
Token the
Token ID262
Feature activation-3.98e-03

Pos logit contributions
 spine+0.047
ses+0.040
hy+0.040
ro+0.040
sed+0.036
Neg logit contributions
 them-0.021
ull-0.021
Joe-0.020
Mom-0.016
Jess-0.016
Token bright
Token ID6016
Feature activation+6.69e-03

Pos logit contributions
 spine+0.006
ro+0.005
hy+0.005
ses+0.005
 shelf+0.004
Neg logit contributions
Joe-0.009
agles-0.008
eper-0.008
add-0.008
ins-0.008
Token.
Token ID13
Feature activation+1.26e-03

Pos logit contributions
 spine+0.010
ro+0.008
hy+0.008
ses+0.008
 grade+0.007
Neg logit contributions
ull-0.013
Joe-0.013
Mom-0.012
 them-0.012
ins-0.012
Token It
Token ID632
Feature activation-8.10e-03

Pos logit contributions
 spine+0.002
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
ins-0.002
 them-0.002
Token can
Token ID460
Feature activation-6.74e-03

Pos logit contributions
ull+0.012
 them+0.011
Joe+0.011
Mom+0.011
in+0.010
Neg logit contributions
 spine-0.017
ro-0.014
hy-0.014
ses-0.013
 grade-0.013
Token hurt
Token ID5938
Feature activation+1.02e-02

Pos logit contributions
ull+0.012
Joe+0.012
Mom+0.011
 them+0.011
Jess+0.011
Neg logit contributions
 spine-0.011
ro-0.010
hy-0.009
ses-0.009
 grade-0.008
Token our
Token ID674
Feature activation-1.36e-02

Pos logit contributions
 spine+0.022
ro+0.021
hy+0.020
ses+0.019
 grade+0.017
Neg logit contributions
ull-0.012
Joe-0.011
add-0.011
ins-0.010
ades-0.010
Token eyes
Token ID2951
Feature activation+1.79e-02

Pos logit contributions
ull+0.019
Joe+0.019
Mom+0.017
add+0.017
Jess+0.017
Neg logit contributions
 spine-0.029
ro-0.025
hy-0.024
ses-0.024
 grade-0.022
Token,"
Token ID553
Feature activation+1.32e-02

Pos logit contributions
 spine+0.032
ro+0.028
hy+0.027
ses+0.025
 grade+0.023
Neg logit contributions
ull-0.030
Joe-0.030
add-0.027
Jess-0.027
agles-0.027
Token Ben
Token ID3932
Feature activation+1.19e-02

Pos logit contributions
 spine+0.027
ro+0.023
hy+0.023
ses+0.022
 grade+0.020
Neg logit contributions
ull-0.020
Joe-0.019
Mom-0.018
 them-0.017
add-0.017
Token says
Token ID1139
Feature activation+4.88e-03

Pos logit contributions
 spine+0.025
ro+0.021
hy+0.021
ses+0.019
 grade+0.018
Neg logit contributions
ull-0.017
Joe-0.016
 them-0.016
Mom-0.015
ins-0.015
Token.
Token ID13
Feature activation-5.77e-03

Pos logit contributions
 spine+0.011
ro+0.009
hy+0.009
ses+0.009
 grade+0.008
Neg logit contributions
ull-0.006
Joe-0.006
 them-0.006
Mom-0.006
in-0.005
Token It
Token ID632
Feature activation-1.44e-02

Pos logit contributions
 spine+0.030
ro+0.026
hy+0.026
ses+0.025
 grade+0.024
Neg logit contributions
ull-0.016
Joe-0.015
 them-0.014
Mom-0.014
in-0.013
Token looks
Token ID3073
Feature activation+9.32e-03

Pos logit contributions
ull+0.024
Joe+0.023
Mom+0.021
Jess+0.021
Dad+0.021
Neg logit contributions
 spine-0.026
ro-0.022
hy-0.022
ses-0.022
 grade-0.019
Token very
Token ID845
Feature activation+1.02e-02

Pos logit contributions
 spine+0.019
ro+0.016
hy+0.016
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.014
Joe-0.013
 them-0.013
Mom-0.012
in-0.012
Token dirty
Token ID11841
Feature activation-3.85e-03

Pos logit contributions
 spine+0.019
ro+0.016
hy+0.016
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.017
Joe-0.016
agles-0.016
Jess-0.015
eper-0.015
Token.
Token ID13
Feature activation-6.32e-04

Pos logit contributions
ull+0.007
Joe+0.006
Jess+0.006
add+0.006
agles+0.006
Neg logit contributions
 spine-0.007
ro-0.006
hy-0.006
ses-0.005
 grade-0.005
Token We
Token ID775
Feature activation-1.61e-02

Pos logit contributions
ull+0.001
Joe+0.001
ins+0.001
in+0.001
agles+0.001
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token don
Token ID836
Feature activation-1.36e-02

Pos logit contributions
ull+0.028
Joe+0.027
Mom+0.026
 them+0.025
add+0.025
Neg logit contributions
 spine-0.028
ro-0.024
hy-0.023
ses-0.022
 grade-0.020
Token't
Token ID470
Feature activation-8.65e-04

Pos logit contributions
ull+0.028
Joe+0.027
Jess+0.026
ins+0.026
 anchors+0.025
Neg logit contributions
 spine-0.016
hy-0.014
ro-0.014
 grade-0.012
ses-0.012
Token want
Token ID765
Feature activation+4.31e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
 them+0.002
Jess+0.002
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token to
Token ID284
Feature activation+6.47e-03

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
 grade+0.003
ses+0.003
Neg logit contributions
ull-0.010
Joe-0.010
ins-0.010
Jess-0.010
eper-0.010
Token get
Token ID651
Feature activation+5.15e-03

Pos logit contributions
 spine+0.008
ro+0.007
hy+0.006
 lick+0.006
 grade+0.006
Neg logit contributions
ull-0.015
ins-0.015
Joe-0.014
Jess-0.014
agles-0.014
Token
Token ID198
Feature activation-5.18e-03

Pos logit contributions
 spine+0.006
ro+0.005
hy+0.005
ses+0.004
 grade+0.004
Neg logit contributions
ull-0.006
Joe-0.006
agles-0.006
Mom-0.006
Jess-0.006
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
ull+0.010
ins+0.009
agles+0.009
 tides+0.009
add+0.009
Neg logit contributions
 spine-0.007
ro-0.007
hy-0.006
 Sure-0.005
 But-0.005
TokenConf
Token ID18546
Feature activation-1.05e-02

Pos logit contributions
ull+0.020
ins+0.018
add+0.018
agles+0.017
 tides+0.017
Neg logit contributions
 spine-0.026
hy-0.025
ro-0.024
ses-0.021
 lick-0.021
Tokenused
Token ID1484
Feature activation+1.26e-02

Pos logit contributions
ull+0.017
Joe+0.016
ins+0.015
Mom+0.015
 them+0.015
Neg logit contributions
 spine-0.020
ro-0.017
hy-0.017
ses-0.017
 grade-0.015
Token,
Token ID11
Feature activation-1.58e-02

Pos logit contributions
 spine+0.027
ro+0.023
hy+0.023
ses+0.022
 grade+0.020
Neg logit contributions
ull-0.018
Joe-0.017
Mom-0.016
 them-0.015
Jess-0.015
Token they
Token ID484
Feature activation-1.27e-02

Pos logit contributions
ull+0.023
Joe+0.022
ins+0.021
add+0.020
in+0.020
Neg logit contributions
 spine-0.032
ro-0.028
hy-0.027
ses-0.026
 grade-0.024
Token started
Token ID2067
Feature activation-1.35e-02

Pos logit contributions
ull+0.021
ins+0.019
Joe+0.019
Mom+0.018
agles+0.018
Neg logit contributions
 spine-0.022
hy-0.019
ro-0.019
ses-0.017
 mun-0.017
Token arguing
Token ID11810
Feature activation-1.64e-02

Pos logit contributions
ull+0.026
Joe+0.025
Mom+0.024
 them+0.023
Jess+0.023
Neg logit contributions
 spine-0.021
ro-0.017
hy-0.017
ses-0.016
 grade-0.015
Token about
Token ID546
Feature activation+9.42e-03

Pos logit contributions
ull+0.027
Joe+0.026
 them+0.025
Mom+0.024
ups+0.023
Neg logit contributions
 spine-0.031
ro-0.026
hy-0.026
ses-0.025
 grade-0.023
Token what
Token ID644
Feature activation-6.76e-03

Pos logit contributions
 spine+0.020
ro+0.018
hy+0.017
ses+0.017
 grade+0.016
Neg logit contributions
ull-0.013
Joe-0.012
Mom-0.011
 them-0.011
add-0.011
Token to
Token ID284
Feature activation-1.66e-03

Pos logit contributions
add+0.010
ull+0.010
ins+0.010
eper+0.010
Joe+0.010
Neg logit contributions
 spine-0.011
hy-0.010
ro-0.010
ses-0.010
 grade-0.009
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
ull+0.010
Joe+0.010
 them+0.009
Mom+0.009
ins+0.009
Neg logit contributions
 spine-0.017
ro-0.015
hy-0.014
ses-0.014
 grade-0.013
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
ull+0.026
Joe+0.025
agles+0.025
add+0.024
ins+0.024
Neg logit contributions
 spine-0.022
ro-0.019
ses-0.018
hy-0.018
 Sure-0.016
Token
Token ID198
Feature activation-1.88e-02

Pos logit contributions
ull+0.027
agles+0.027
 tides+0.026
add+0.026
Joe+0.026
Neg logit contributions
 spine-0.016
ro-0.015
ses-0.013
hy-0.012
 Sure-0.011
TokenThey
Token ID2990
Feature activation-4.73e-03

Pos logit contributions
add+0.028
ull+0.026
agles+0.024
idy+0.023
 tides+0.022
Neg logit contributions
ro-0.036
 spine-0.033
hy-0.029
 Sure-0.028
tail-0.028
Token smile
Token ID8212
Feature activation+3.26e-03

Pos logit contributions
ull+0.008
Joe+0.008
 them+0.008
Mom+0.008
add+0.008
Neg logit contributions
 spine-0.008
ro-0.007
hy-0.006
ses-0.006
 grade-0.006
Token too
Token ID1165
Feature activation-1.01e-02

Pos logit contributions
 spine+0.007
ro+0.006
hy+0.006
ses+0.005
 grade+0.005
Neg logit contributions
ull-0.005
 them-0.004
Joe-0.004
Mom-0.004
ins-0.004
Token.
Token ID13
Feature activation-1.48e-03

Pos logit contributions
ull+0.016
Joe+0.016
agles+0.015
Mom+0.015
Jess+0.015
Neg logit contributions
 spine-0.019
ro-0.016
hy-0.016
ses-0.015
 grade-0.014
Token They
Token ID1119
Feature activation-4.78e-03

Pos logit contributions
ull+0.002
Joe+0.002
Mom+0.002
 them+0.002
Jess+0.002
Neg logit contributions
 spine-0.003
ro-0.002
hy-0.002
ses-0.002
 grade-0.002
Token are
Token ID389
Feature activation-2.05e-02

Pos logit contributions
Joe+0.007
 them+0.007
ull+0.007
Mom+0.007
add+0.007
Neg logit contributions
 spine-0.009
ro-0.007
ses-0.007
hy-0.007
 grade-0.006
Token happy
Token ID3772
Feature activation-9.02e-03

Pos logit contributions
Joe+0.036
ull+0.035
agles+0.035
Jess+0.032
irs+0.032
Neg logit contributions
 spine-0.034
ses-0.029
hy-0.029
ro-0.026
sing-0.026
Token.
Token ID13
Feature activation-2.14e-03

Pos logit contributions
ull+0.014
Joe+0.013
Mom+0.012
add+0.012
Jess+0.012
Neg logit contributions
 spine-0.018
ro-0.015
hy-0.015
ses-0.014
 grade-0.014
Token strong
Token ID1913
Feature activation+1.69e-02

Pos logit contributions
 spine+0.028
ro+0.024
hy+0.023
ses+0.023
 grade+0.021
Neg logit contributions
ull-0.021
Joe-0.020
Jess-0.019
Mom-0.018
ins-0.018
Token and
Token ID290
Feature activation-7.95e-03

Pos logit contributions
 spine+0.033
ro+0.027
hy+0.027
ses+0.026
 grade+0.025
Neg logit contributions
ull-0.026
 them-0.025
Joe-0.025
in-0.024
Mom-0.023
Token could
Token ID714
Feature activation-1.29e-02

Pos logit contributions
ull+0.012
Joe+0.012
Mom+0.011
Jess+0.011
 them+0.011
Neg logit contributions
 spine-0.015
ro-0.013
hy-0.013
ses-0.013
 grade-0.012
Token send
Token ID3758
Feature activation-5.03e-03

Pos logit contributions
ull+0.019
Mom+0.018
Joe+0.018
 them+0.017
agles+0.015
Neg logit contributions
 spine-0.027
ro-0.022
hy-0.021
ses-0.021
 grade-0.020
Token the
Token ID262
Feature activation+5.30e-04

Pos logit contributions
Joe+0.008
ull+0.008
Jess+0.007
add+0.007
Mom+0.007
Neg logit contributions
 spine-0.009
ro-0.008
hy-0.008
ses-0.008
 grade-0.007
Token red
Token ID2266
Feature activation-1.49e-02

Pos logit contributions
 spine+0.001
ro+0.001
hy+0.001
ses+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
Jess-0.001
Mom-0.001
add-0.001
Token ball
Token ID2613
Feature activation-1.48e-02

Pos logit contributions
ull+0.027
Joe+0.027
Jess+0.025
add+0.024
Mom+0.024
Neg logit contributions
 spine-0.026
ro-0.022
hy-0.021
ses-0.020
 grade-0.019
Token to
Token ID284
Feature activation-3.06e-03

Pos logit contributions
ull+0.028
Joe+0.028
Mom+0.026
Jess+0.026
add+0.025
Neg logit contributions
 spine-0.024
ro-0.020
hy-0.019
ses-0.018
 grade-0.016
Token the
Token ID262
Feature activation-6.58e-03

Pos logit contributions
ull+0.004
Joe+0.004
 them+0.004
Mom+0.004
add+0.004
Neg logit contributions
 spine-0.007
ro-0.006
hy-0.006
ses-0.005
 grade-0.005
Token mattress
Token ID33388
Feature activation-4.52e-03

Pos logit contributions
ull+0.011
Joe+0.010
Jess+0.009
ins+0.009
add+0.009
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.011
ses-0.011
 grade-0.010
Token.
Token ID13
Feature activation+4.81e-03

Pos logit contributions
ull+0.005
 them+0.005
Joe+0.004
in+0.004
Mom+0.004
Neg logit contributions
 spine-0.011
ro-0.009
hy-0.009
ses-0.009
 grade-0.009
Token sad
Token ID6507
Feature activation+6.54e-03

Pos logit contributions
ull+0.000
Joe+0.000
agles+0.000
Mom+0.000
Jess+0.000
Neg logit contributions
 spine-0.000
ro-0.000
hy-0.000
ses-0.000
 grade-0.000
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
 spine+0.014
hy+0.013
ses+0.012
ro+0.012
sed+0.011
Neg logit contributions
 them-0.009
Joe-0.008
ull-0.008
in-0.008
idy-0.007
Token asked
Token ID1965
Feature activation+2.26e-03

Pos logit contributions
ull+0.016
Joe+0.016
Mom+0.014
 them+0.014
Jess+0.014
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.018
ses-0.018
 grade-0.017
Token "
Token ID366
Feature activation-9.37e-03

Pos logit contributions
 spine+0.004
ro+0.004
hy+0.003
 grade+0.003
 Whe+0.003
Neg logit contributions
ull-0.004
agles-0.003
Joe-0.003
ins-0.003
add-0.003
TokenWhere
Token ID8496
Feature activation+4.19e-03

Pos logit contributions
ull+0.014
Joe+0.013
add+0.012
agles+0.012
ins+0.012
Neg logit contributions
 spine-0.019
ro-0.016
hy-0.016
ses-0.015
 grade-0.015
Token are
Token ID389
Feature activation-4.35e-02

Pos logit contributions
 spine+0.007
hy+0.006
ro+0.006
ses+0.006
 grade+0.005
Neg logit contributions
ull-0.007
Joe-0.007
Mom-0.007
add-0.007
Jess-0.007
Token you
Token ID345
Feature activation+2.55e-03

Pos logit contributions
ull+0.086
ins+0.074
Joe+0.073
leys+0.073
eled+0.072
Neg logit contributions
hy-0.054
 spine-0.052
 l-0.047
 cost-0.046
 This-0.046
Token going
Token ID1016
Feature activation-2.09e-03

Pos logit contributions
 spine+0.005
ro+0.004
hy+0.004
ses+0.004
 grade+0.004
Neg logit contributions
Joe-0.004
ull-0.004
Jess-0.004
Mom-0.004
ups-0.004
Token?"
Token ID1701
Feature activation+8.97e-03

Pos logit contributions
ull+0.003
Joe+0.003
 them+0.002
Mom+0.002
in+0.002
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Token Mom
Token ID11254
Feature activation+1.91e-03

Pos logit contributions
 spine+0.018
ro+0.016
hy+0.015
ses+0.015
 grade+0.014
Neg logit contributions
ull-0.013
Joe-0.013
Mom-0.012
 them-0.012
add-0.012
Tokenmy
Token ID1820
Feature activation-8.30e-03

Pos logit contributions
 spine+0.004
ro+0.003
hy+0.003
ses+0.003
 grade+0.003
Neg logit contributions
ull-0.003
Joe-0.003
add-0.002
Jess-0.002
agles-0.002
TokenO
Token ID46
Feature activation-2.43e-02

Pos logit contributions
ull+0.010
Joe+0.009
ins+0.009
add+0.009
ups+0.009
Neg logit contributions
 spine-0.014
ro-0.013
ses-0.012
hy-0.012
 lick-0.011
Tokenw
Token ID86
Feature activation-1.50e-02

Pos logit contributions
ull+0.031
Joe+0.030
in+0.027
Mom+0.027
ins+0.026
Neg logit contributions
 spine-0.055
ro-0.048
hy-0.048
ses-0.045
 grade-0.044
Token,
Token ID11
Feature activation+1.17e-03

Pos logit contributions
ull+0.026
Joe+0.025
Jess+0.025
add+0.024
 anchors+0.024
Neg logit contributions
 spine-0.025
ro-0.021
hy-0.021
ses-0.020
sed-0.018
Token ow
Token ID12334
Feature activation-4.07e-02

Pos logit contributions
 spine+0.003
ro+0.002
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.001
Joe-0.001
Mom-0.001
ins-0.001
Jess-0.001
Token,
Token ID11
Feature activation+4.39e-03

Pos logit contributions
ull+0.078
Joe+0.075
Jess+0.072
 them+0.072
add+0.071
Neg logit contributions
 spine-0.063
ro-0.049
ses-0.048
hy-0.045
 lick-0.045
Token ow
Token ID12334
Feature activation-4.62e-02

Pos logit contributions
 spine+0.010
ro+0.009
hy+0.009
ses+0.008
 grade+0.008
Neg logit contributions
ull-0.005
Joe-0.005
Mom-0.005
 them-0.005
Jess-0.004
Token!"
Token ID2474
Feature activation-5.92e-03

Pos logit contributions
ull+0.083
Joe+0.079
add+0.078
irs+0.077
 them+0.077
Neg logit contributions
 spine-0.070
ro-0.057
ses-0.055
 lick-0.055
 mun-0.053
Token she
Token ID673
Feature activation-7.74e-03

Pos logit contributions
ull+0.009
Joe+0.009
Mom+0.009
 them+0.008
Jess+0.008
Neg logit contributions
 spine-0.011
ro-0.010
hy-0.009
ses-0.009
 grade-0.008
Token cries
Token ID24691
Feature activation-6.77e-03

Pos logit contributions
ull+0.014
Joe+0.014
Mom+0.013
 them+0.013
ins+0.013
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.010
ses-0.010
 grade-0.009
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
Joe+0.012
ull+0.012
agles+0.011
Jess+0.011
ins+0.011
Neg logit contributions
 spine-0.012
ro-0.010
hy-0.009
ses-0.009
 grade-0.008
Token "
Token ID366
Feature activation-1.74e-02

Pos logit contributions
ull+0.029
Joe+0.029
 them+0.028
Mom+0.028
Jess+0.026
Neg logit contributions
 spine-0.044
hy-0.038
ro-0.038
 grade-0.035
ses-0.034
Token sorry
Token ID7926
Feature activation-1.09e-02

Pos logit contributions
ull+0.016
Joe+0.016
Mom+0.015
ins+0.015
agles+0.015
Neg logit contributions
 spine-0.021
hy-0.018
ro-0.018
ses-0.017
 grade-0.016
Token,"
Token ID553
Feature activation+1.01e-02

Pos logit contributions
ull+0.019
Joe+0.018
ins+0.018
leys+0.018
agles+0.017
Neg logit contributions
 spine-0.019
hy-0.016
ro-0.015
ses-0.014
 grade-0.014
Token the
Token ID262
Feature activation-2.56e-03

Pos logit contributions
 spine+0.021
ro+0.018
hy+0.018
ses+0.017
 grade+0.016
Neg logit contributions
ull-0.014
Joe-0.014
Mom-0.013
 them-0.013
Jess-0.012
Token wild
Token ID4295
Feature activation-3.58e-03

Pos logit contributions
Joe+0.004
ull+0.004
Jess+0.004
leys+0.004
add+0.004
Neg logit contributions
 spine-0.005
ro-0.004
hy-0.004
ses-0.004
 grade-0.004
Tokener
Token ID263
Feature activation-1.56e-02

Pos logit contributions
Joe+0.005
ull+0.005
Jess+0.005
Mom+0.005
ins+0.004
Neg logit contributions
 spine-0.007
ro-0.007
hy-0.006
ses-0.006
 grade-0.006
Tokenbe
Token ID1350
Feature activation-4.56e-02

Pos logit contributions
Joe+0.027
ull+0.027
Jess+0.026
Mom+0.026
add+0.026
Neg logit contributions
 spine-0.025
ro-0.022
hy-0.022
ses-0.019
sed-0.017
Tokenast
Token ID459
Feature activation+1.54e-02

Pos logit contributions
Joe+0.065
ull+0.064
Mom+0.062
Jess+0.061
add+0.058
Neg logit contributions
 spine-0.089
ro-0.080
hy-0.078
ses-0.072
 grade-0.068
Token said
Token ID531
Feature activation+7.49e-04

Pos logit contributions
 spine+0.031
ro+0.028
hy+0.027
ses+0.025
 grade+0.024
Neg logit contributions
ull-0.022
Joe-0.022
Mom-0.020
Jess-0.020
ins-0.019
Token.
Token ID13
Feature activation-9.92e-03

Pos logit contributions
 spine+0.002
ro+0.001
ses+0.001
hy+0.001
 grade+0.001
Neg logit contributions
ull-0.001
Joe-0.001
 them-0.001
Mom-0.001
in-0.001
Token "
Token ID366
Feature activation-1.15e-02

Pos logit contributions
ull+0.016
Joe+0.015
Mom+0.014
 them+0.014
Jess+0.014
Neg logit contributions
 spine-0.019
ro-0.016
hy-0.016
ses-0.015
 grade-0.014
TokenBut
Token ID1537
Feature activation-5.25e-03

Pos logit contributions
ull+0.013
Joe+0.013
Mom+0.012
Jess+0.011
in+0.011
Neg logit contributions
 spine-0.028
hy-0.024
ro-0.024
ses-0.024
 grade-0.022
Token played
Token ID2826
Feature activation+8.41e-03

Pos logit contributions
ull+0.003
Joe+0.003
Mom+0.003
 them+0.003
add+0.002
Neg logit contributions
 spine-0.003
ro-0.003
hy-0.003
ses-0.003
 grade-0.002
Token in
Token ID287
Feature activation+1.13e-02

Pos logit contributions
 spine+0.022
hy+0.020
ro+0.018
ses+0.018
 choke+0.016
Neg logit contributions
 them-0.011
 friends-0.009
ull-0.009
Joe-0.008
 castles-0.008
Token the
Token ID262
Feature activation+7.74e-03

Pos logit contributions
 spine+0.026
ro+0.022
hy+0.021
ses+0.021
 grade+0.020
Neg logit contributions
ull-0.015
Joe-0.015
 them-0.014
Mom-0.014
Jess-0.012
Token sand
Token ID6450
Feature activation+1.40e-02

Pos logit contributions
 spine+0.011
hy+0.010
ro+0.009
 sleep+0.009
 shelf+0.009
Neg logit contributions
oles-0.015
Joe-0.015
ull-0.015
ants-0.015
agles-0.015
Token and
Token ID290
Feature activation+5.62e-03

Pos logit contributions
 spine+0.028
hy+0.024
ro+0.023
ses+0.022
 grade+0.021
Neg logit contributions
ull-0.022
Joe-0.020
 them-0.020
ins-0.019
in-0.019
Token sw
Token ID1509
Feature activation-4.28e-02

Pos logit contributions
 spine+0.011
hy+0.010
ro+0.010
ses+0.009
 grade+0.009
Neg logit contributions
ull-0.008
Joe-0.008
 them-0.008
Mom-0.007
 boats-0.007
Tokenam
Token ID321
Feature activation-1.16e-02

Pos logit contributions
Joe+0.036
ull+0.036
Jess+0.033
Mom+0.032
Dad+0.027
Neg logit contributions
 spine-0.112
ro-0.103
ses-0.098
hy-0.098
sed-0.090
Token in
Token ID287
Feature activation-3.06e-04

Pos logit contributions
ull+0.017
Joe+0.017
 them+0.017
Mom+0.016
in+0.015
Neg logit contributions
 spine-0.024
hy-0.020
ro-0.020
ses-0.020
 grade-0.019
Token the
Token ID262
Feature activation-3.90e-03

Pos logit contributions
ull+0.000
Joe+0.000
 them+0.000
Mom+0.000
Jess+0.000
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token sea
Token ID5417
Feature activation-4.27e-04

Pos logit contributions
Joe+0.006
ull+0.006
oles+0.006
Jess+0.006
ins+0.006
Neg logit contributions
 spine-0.007
ro-0.006
hy-0.006
 lick-0.005
 lap-0.005
Token.
Token ID13
Feature activation+4.05e-03

Pos logit contributions
ull+0.001
Joe+0.001
Mom+0.001
 them+0.001
ins+0.001
Neg logit contributions
 spine-0.001
ro-0.001
hy-0.001
ses-0.001
 grade-0.001
Token dolphin
Token ID44136
Feature activation+1.49e-03

Pos logit contributions
ull+0.021
Joe+0.021
Mom+0.019
 them+0.019
Jess+0.018
Neg logit contributions
 spine-0.029
ro-0.025
hy-0.024
ses-0.024
 grade-0.022
Token saw
Token ID2497
Feature activation-8.72e-03

Pos logit contributions
 spine+0.003
ro+0.003
hy+0.002
ses+0.002
 grade+0.002
Neg logit contributions
ull-0.002
Joe-0.002
Mom-0.002
Jess-0.002
ins-0.002
Token what
Token ID644
Feature activation-4.91e-03

Pos logit contributions
ull+0.012
Joe+0.011
Mom+0.011
Jess+0.010
add+0.010
Neg logit contributions
 spine-0.018
ro-0.016
hy-0.015
ses-0.015
 grade-0.014
Token happened
Token ID3022
Feature activation-7.10e-03

Pos logit contributions
ull+0.007
Joe+0.007
Mom+0.006
 them+0.006
Jess+0.006
Neg logit contributions
 spine-0.010
ro-0.009
hy-0.009
ses-0.008
 grade-0.008
Token and
Token ID290
Feature activation-2.00e-03

Pos logit contributions
ull+0.012
Joe+0.011
 them+0.011
Mom+0.010
in+0.010
Neg logit contributions
 spine-0.013
ro-0.011
hy-0.011
ses-0.011
 grade-0.010
Token sw
Token ID1509
Feature activation-4.43e-02

Pos logit contributions
ull+0.003
Joe+0.003
Mom+0.003
agles+0.003
ins+0.003
Neg logit contributions
 spine-0.004
ro-0.003
hy-0.003
ses-0.003
 grade-0.003
Tokenam
Token ID321
Feature activation-1.28e-02

Pos logit contributions
Jess+0.042
Mom+0.041
ull+0.040
Joe+0.040
Dad+0.036
Neg logit contributions
 spine-0.108
ro-0.100
ses-0.096
hy-0.093
sed-0.090
Token over
Token ID625
Feature activation-1.22e-02

Pos logit contributions
ull+0.022
Joe+0.022
Mom+0.021
Jess+0.020
ins+0.020
Neg logit contributions
 spine-0.022
ro-0.019
hy-0.019
ses-0.017
 grade-0.016
Token to
Token ID284
Feature activation-1.39e-02

Pos logit contributions
Joe+0.020
ull+0.020
 them+0.020
in+0.017
Mom+0.017
Neg logit contributions
 spine-0.025
ses-0.020
hy-0.020
ro-0.019
 grade-0.018
Token help
Token ID1037
Feature activation-1.08e-02

Pos logit contributions
ull+0.017
Joe+0.016
 them+0.016
add+0.014
Jess+0.014
Neg logit contributions
 spine-0.033
hy-0.029
ro-0.029
ses-0.028
 grade-0.026
Token.
Token ID13
Feature activation+4.63e-03

Pos logit contributions
ull+0.014
Joe+0.013
Mom+0.012
 them+0.012
Jess+0.012
Neg logit contributions
 spine-0.024
ro-0.021
hy-0.021
ses-0.020
 grade-0.019