TOP ACTIVATIONS
MAX = 3.063

so excited to watch the new video . But when she
was so excited to watch the new video . But when
One day , he came up with an idea to scare
Benny 's friend Sally came up to him and said ,
" Don 't worry , little mouse , I will rescue
. One day he came up with a new game .
. " Don 't worry little friend , I will help
One day , she came up with a great idea .
park when a boy came up to her and asked if
searched until she finally found the video . She was so
mom . A stranger came up to them and asked for
, named Linda , came up to him and asked if
took good care of his new tablet so it wouldn 't
up , then come back down to her tree .
One day , she came up with an idea . She
his dad for replacing his old tablet with a new one
he had chosen to watch the fun and easy video .
then a big cat comes and sees the toy birds .
, a little girl came up to her , looked up
about it until she came up with a great plan .

MIN ACTIVATIONS
MIN = -3.431

his pet mouse , S que aky . One
park when she stumbled upon an old cob web . Mill
a little mouse named S que aky . One day ,
the grass when he heard an odd sound . It sounded
. P ips que ak started to regret it
a loud noise coming from an ig loo .
Lily . " It 's an underground tunnel ," said her
G iz mo and S que eker , were both small
one day he stumbled across an amazing pattern ! It was
big step and go on an adventure . They walked far
by himself when he heard an enormous noise . He looked
walking around when he heard an odd sound . It was
. One day P ips que ak was playing with its
her shoe was stuck in an icy crack in the sidewalk
too fast and bumped into an old man .
, Liz found him in an old dust bin . Joe
play ?" S que aky replied , " Yes
and the car twisted in an uncomfortable way . The little
. The bird lived in an ordinary tree with its family
was the only stitch in an old , worn jumper .

INTERVAL 1.440 - 3.063
CONTAINS 0.693%

nap . The cat 's friend wanted to
" Hello , dog ! Can I take a picture of
cool . The End <|endoftext|> One day ,
One day , T illy found a little
, Lily . One day , they went to the beach

INTERVAL -0.184 - 1.440
CONTAINS 62.772%

� You never know what you â € ™ ll find
Jenny ," said Mr . Fisher . " We have to
John found himself in a park . He continued to explore
my own ? â € The crystal said ,
shiny . One day , John was feeling

INTERVAL -1.807 - -0.184
CONTAINS 36.381%

the light and smiled . It was a wonderful
happened and ran to help Lily . He was frightened that
a lot of cars . Lily said , " That house
wants pasta and salad . Let 's tell the lady what
exactly what to do . He smiled at the

INTERVAL -3.431 - -1.807
CONTAINS 0.154%

They see the jam on their clothes . They laugh .
and worried , he didn 't know what to do .
and she sadly died . Everyone was very sad and wished
, and puts it in an envelope . She writes her
man watched as she skipped mer rily away into the sunset
Token so
Token ID523
Feature activation+3.55e-02

Pos logit contributions
nes+0.008
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.006
 Mr-0.005
 last-0.005
Token excited
Token ID6568
Feature activation+4.75e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ust-0.002
aces-0.001
Token to
Token ID284
Feature activation-8.80e-02

Pos logit contributions
 Ms+0.013
 teach+0.011
 Mr+0.011
 Mitt+0.011
 Mrs+0.011
Neg logit contributions
nes-0.022
vel-0.021
oll-0.021
ust-0.020
aces-0.020
Token watch
Token ID2342
Feature activation+3.71e-01

Pos logit contributions
nes+0.005
vel+0.005
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.002
 last-0.002
Token the
Token ID262
Feature activation+3.06e+00

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.006
 Mr+0.005
Neg logit contributions
vel-0.020
nes-0.020
oll-0.019
aces-0.019
ode-0.018
Token new
Token ID649
Feature activation+3.06e+00

Pos logit contributions
thank+0.117
 Mitt+0.111
Thank+0.111
 hire+0.110
 Ms+0.109
Neg logit contributions
vel-0.072
nes-0.058
glass-0.057
oll-0.057
aky-0.053
Token video
Token ID2008
Feature activation-2.25e-01

Pos logit contributions
Thank+0.120
 usual+0.119
Dr+0.116
 Mitt+0.115
thank+0.114
Neg logit contributions
vel-0.071
nes-0.060
ont-0.058
glass-0.056
od-0.054
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.011
aces+0.011
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 last-0.004
Token But
Token ID887
Feature activation+1.55e-01

Pos logit contributions
 Ms+0.021
 last+0.020
 while+0.020
 teach+0.020
glers+0.019
Neg logit contributions
nes-0.022
oll-0.022
aces-0.021
vel-0.021
ode-0.020
Token when
Token ID618
Feature activation+8.42e-01

Pos logit contributions
 Ms+0.003
 teach+0.002
 while+0.002
 praise+0.002
 Mrs+0.002
Neg logit contributions
nes-0.009
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token she
Token ID673
Feature activation-3.79e-01

Pos logit contributions
 Ms+0.025
 praise+0.024
 teach+0.024
 last+0.022
glers+0.021
Neg logit contributions
nes-0.033
vel-0.031
oll-0.030
ust-0.028
ode-0.027
Token was
Token ID373
Feature activation-1.86e-01

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 Mr-0.004
 last-0.003
Token so
Token ID523
Feature activation+3.55e-02

Pos logit contributions
nes+0.008
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.006
 Mr-0.005
 last-0.005
Token excited
Token ID6568
Feature activation+4.75e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ust-0.002
aces-0.001
Token to
Token ID284
Feature activation-8.80e-02

Pos logit contributions
 Ms+0.013
 teach+0.011
 Mr+0.011
 Mitt+0.011
 Mrs+0.011
Neg logit contributions
nes-0.022
vel-0.021
oll-0.021
ust-0.020
aces-0.020
Token watch
Token ID2342
Feature activation+3.71e-01

Pos logit contributions
nes+0.005
vel+0.005
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.002
 last-0.002
Token the
Token ID262
Feature activation+3.06e+00

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.006
 Mr+0.005
Neg logit contributions
vel-0.020
nes-0.020
oll-0.019
aces-0.019
ode-0.018
Token new
Token ID649
Feature activation+3.06e+00

Pos logit contributions
thank+0.117
 Mitt+0.111
Thank+0.111
 hire+0.110
 Ms+0.109
Neg logit contributions
vel-0.072
nes-0.058
glass-0.057
oll-0.057
aky-0.053
Token video
Token ID2008
Feature activation-2.25e-01

Pos logit contributions
Thank+0.120
 usual+0.119
Dr+0.116
 Mitt+0.115
thank+0.114
Neg logit contributions
vel-0.071
nes-0.060
ont-0.058
glass-0.056
od-0.054
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.011
aces+0.011
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 last-0.004
Token But
Token ID887
Feature activation+1.55e-01

Pos logit contributions
 Ms+0.021
 last+0.020
 while+0.020
 teach+0.020
glers+0.019
Neg logit contributions
nes-0.022
oll-0.022
aces-0.021
vel-0.021
ode-0.020
Token when
Token ID618
Feature activation+8.42e-01

Pos logit contributions
 Ms+0.003
 teach+0.002
 while+0.002
 praise+0.002
 Mrs+0.002
Neg logit contributions
nes-0.009
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token One
Token ID1881
Feature activation-1.62e-02

Pos logit contributions
nes+0.016
vel+0.016
oll+0.016
ode+0.016
aces+0.015
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 teach-0.005
Token day
Token ID1110
Feature activation+9.03e-01

Pos logit contributions
nes+0.001
oll+0.001
vel+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 teach-0.000
 Mr-0.000
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
 Ms+0.025
 praise+0.023
 teach+0.022
 last+0.022
glers+0.022
Neg logit contributions
vel-0.039
oll-0.037
nes-0.036
aces-0.035
aund-0.034
Token he
Token ID339
Feature activation+4.23e-01

Pos logit contributions
 praise+0.035
 Ms+0.035
glers+0.035
 thanks+0.033
 teach+0.032
Neg logit contributions
nes-0.048
vel-0.047
aces-0.045
oll-0.043
glass-0.043
Token came
Token ID1625
Feature activation+1.17e+00

Pos logit contributions
 Ms+0.016
 Mrs+0.014
 while+0.014
 teach+0.014
 Mr+0.014
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ode-0.015
aky-0.014
Token up
Token ID510
Feature activation+3.05e+00

Pos logit contributions
 Ms+0.046
glers+0.042
 Mrs+0.040
 Mitt+0.040
 Mr+0.040
Neg logit contributions
vel-0.040
ust-0.039
aky-0.039
nes-0.038
aces-0.036
Token with
Token ID351
Feature activation+6.52e-02

Pos logit contributions
glers+0.106
 Ms+0.104
 Mitt+0.100
 praise+0.096
 choice+0.095
Neg logit contributions
vel-0.103
ust-0.093
aky-0.092
nes-0.089
per-0.085
Token an
Token ID281
Feature activation-2.39e+00

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.001
 teach+0.001
 Mr+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token idea
Token ID2126
Feature activation+2.35e-01

Pos logit contributions
nes+0.136
lor+0.120
vel+0.120
ode+0.117
emic+0.116
Neg logit contributions
 last-0.049
 teach-0.049
 Ms-0.048
 Mrs-0.046
 Mr-0.045
Token to
Token ID284
Feature activation+3.90e-01

Pos logit contributions
 Ms+0.006
 Mrs+0.005
 Mr+0.005
 while+0.005
 last+0.004
Neg logit contributions
nes-0.012
vel-0.012
oll-0.012
ode-0.011
aces-0.011
Token scare
Token ID19437
Feature activation-1.02e+00

Pos logit contributions
 Ms+0.010
 while+0.009
 Mrs+0.009
 Mitt+0.009
 Mr+0.008
Neg logit contributions
nes-0.019
vel-0.018
ode-0.018
oll-0.017
aces-0.017
Token Benny
Token ID44275
Feature activation+5.93e-01

Pos logit contributions
 praise+0.061
 Ms+0.059
 hire+0.057
 choice+0.055
 praises+0.054
Neg logit contributions
vel-0.049
nes-0.048
bowl-0.046
aky-0.044
 anchor-0.044
Token's
Token ID338
Feature activation-5.58e-01

Pos logit contributions
 Ms+0.021
 teach+0.019
 Mrs+0.019
 praise+0.018
 last+0.018
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
ode-0.022
aky-0.022
Token friend
Token ID1545
Feature activation+4.45e-01

Pos logit contributions
nes+0.029
vel+0.029
oll+0.027
ode+0.027
aces+0.027
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.011
 Mr-0.011
 teach-0.011
Token Sally
Token ID25737
Feature activation+9.53e-01

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.009
 praise+0.009
Neg logit contributions
nes-0.022
oll-0.021
vel-0.021
ode-0.021
aky-0.020
Token came
Token ID1625
Feature activation+6.37e-01

Pos logit contributions
 Ms+0.030
 praise+0.027
 last+0.027
 teach+0.027
 while+0.027
Neg logit contributions
nes-0.037
vel-0.037
oll-0.035
ode-0.035
aky-0.035
Token up
Token ID510
Feature activation+2.95e+00

Pos logit contributions
 Ms+0.020
 Mrs+0.018
 while+0.018
 Mr+0.017
 teach+0.017
Neg logit contributions
nes-0.026
vel-0.026
aky-0.025
ust-0.025
oll-0.025
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
glers+0.108
 Ms+0.105
 choice+0.102
 praise+0.102
 allows+0.099
Neg logit contributions
vel-0.086
aky-0.077
nes-0.075
ust-0.073
per-0.071
Token him
Token ID683
Feature activation-1.04e-01

Pos logit contributions
 Ms+0.023
 while+0.019
glers+0.019
 last+0.018
 Mrs+0.018
Neg logit contributions
nes-0.049
vel-0.047
ode-0.046
aky-0.043
anna-0.043
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.002
 last-0.002
Token said
Token ID531
Feature activation-5.06e-01

Pos logit contributions
 last+0.040
 Ms+0.039
 while+0.036
 before+0.036
 praise+0.036
Neg logit contributions
nes-0.054
anna-0.049
vel-0.048
aky-0.048
ust-0.047
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.029
aces+0.029
Neg logit contributions
 Ms-0.008
 while-0.007
 Mr-0.006
 Mrs-0.006
 teach-0.006
Token "
Token ID366
Feature activation+5.06e-01

Pos logit contributions
 last+0.029
 Ms+0.028
 praise+0.028
 teach+0.027
 visit+0.026
Neg logit contributions
nes-0.042
oll-0.041
vel-0.041
aces-0.039
ode-0.039
TokenDon
Token ID3987
Feature activation+1.03e+00

Pos logit contributions
 Ms+0.016
 last+0.014
 teach+0.014
 while+0.014
 Mrs+0.014
Neg logit contributions
nes-0.022
vel-0.021
ode-0.020
aky-0.020
ust-0.020
Token't
Token ID470
Feature activation+3.07e-01

Pos logit contributions
 Ms+0.019
 while+0.017
 praise+0.015
 Mrs+0.015
 teach+0.014
Neg logit contributions
nes-0.056
vel-0.054
oll-0.053
ode-0.052
ust-0.051
Token worry
Token ID5490
Feature activation+4.55e-01

Pos logit contributions
 Ms+0.011
 while+0.010
 Mrs+0.009
 last+0.009
 Mr+0.009
Neg logit contributions
nes-0.012
vel-0.012
ode-0.012
ust-0.011
oll-0.011
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 Ms+0.010
 teach+0.009
 while+0.008
 praise+0.008
 last+0.008
Neg logit contributions
oll-0.023
vel-0.023
nes-0.023
ust-0.022
ode-0.022
Token little
Token ID1310
Feature activation+2.94e+00

Pos logit contributions
 Ms+0.029
glers+0.026
 praise+0.025
 while+0.025
 grace+0.025
Neg logit contributions
ode-0.041
oll-0.041
nes-0.041
vel-0.041
ust-0.040
Token mouse
Token ID10211
Feature activation+1.14e+00

Pos logit contributions
glers+0.112
 Ms+0.107
 early+0.102
 usual+0.102
bel+0.101
Neg logit contributions
 wing-0.087
ig-0.085
 egg-0.084
vel-0.082
ode-0.082
Token,
Token ID11
Feature activation+2.86e-01

Pos logit contributions
 Ms+0.034
 Mrs+0.028
 particular+0.027
 praise+0.027
 while+0.027
Neg logit contributions
nes-0.051
vel-0.050
oll-0.048
ode-0.047
onel-0.047
Token I
Token ID314
Feature activation+9.04e-01

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 last+0.007
 teach+0.007
Neg logit contributions
nes-0.013
vel-0.012
ode-0.012
oll-0.012
ust-0.012
Token will
Token ID481
Feature activation+1.09e+00

Pos logit contributions
 Ms+0.037
 Mrs+0.033
 while+0.031
 Mr+0.031
 before+0.030
Neg logit contributions
vel-0.033
nes-0.032
oll-0.031
ode-0.031
onel-0.031
Token rescue
Token ID9992
Feature activation+6.98e-01

Pos logit contributions
 Ms+0.027
glers+0.023
 while+0.022
 Mrs+0.021
 Mr+0.019
Neg logit contributions
nes-0.054
vel-0.054
ode-0.052
onel-0.050
oll-0.049
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 Mr+0.010
 Mitt+0.010
 last+0.010
Neg logit contributions
nes-0.019
vel-0.019
oll-0.018
aces-0.018
ust-0.017
Token One
Token ID1881
Feature activation+3.69e-01

Pos logit contributions
nes+0.009
vel+0.008
oll+0.008
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 last-0.003
 teach-0.003
Token day
Token ID1110
Feature activation+7.86e-01

Pos logit contributions
 Ms+0.011
 Mrs+0.010
 Mr+0.009
 teach+0.009
 praise+0.009
Neg logit contributions
nes-0.016
oll-0.016
vel-0.016
aky-0.015
ust-0.015
Token he
Token ID339
Feature activation+5.09e-01

Pos logit contributions
 Ms+0.022
 teach+0.018
 praise+0.018
 last+0.018
 Mrs+0.018
Neg logit contributions
vel-0.035
oll-0.034
nes-0.034
aces-0.033
ode-0.032
Token came
Token ID1625
Feature activation+1.03e+00

Pos logit contributions
 Ms+0.018
 Mrs+0.016
 while+0.016
 teach+0.015
 Mr+0.015
Neg logit contributions
nes-0.021
vel-0.019
oll-0.019
aky-0.018
ust-0.018
Token up
Token ID510
Feature activation+2.94e+00

Pos logit contributions
 Ms+0.036
 Mrs+0.031
glers+0.031
 Mitt+0.030
 praise+0.030
Neg logit contributions
nes-0.040
vel-0.039
ust-0.039
aky-0.038
oll-0.037
Token with
Token ID351
Feature activation+2.13e-01

Pos logit contributions
 Ms+0.098
glers+0.098
 praise+0.095
 Mitt+0.093
 choice+0.092
Neg logit contributions
vel-0.097
aky-0.089
nes-0.088
ust-0.088
per-0.083
Token a
Token ID257
Feature activation+7.74e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 teach+0.005
 last+0.005
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.009
aces-0.009
Token new
Token ID649
Feature activation+2.50e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 Mr+0.002
 last+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token game
Token ID983
Feature activation+4.81e-01

Pos logit contributions
 Ms+0.009
 Mrs+0.008
 Mr+0.008
 while+0.008
 Mitt+0.007
Neg logit contributions
nes-0.010
vel-0.010
oll-0.009
ode-0.009
aces-0.009
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 Mr+0.010
 Mitt+0.010
 while+0.010
Neg logit contributions
nes-0.024
oll-0.023
vel-0.023
aces-0.023
ode-0.022
Token.
Token ID13
Feature activation+8.38e-01

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.007
aces-0.007
Token "
Token ID366
Feature activation+2.74e-01

Pos logit contributions
 last+0.022
 teach+0.021
 Ms+0.021
 praise+0.020
 while+0.020
Neg logit contributions
nes-0.036
oll-0.035
vel-0.035
anna-0.033
ode-0.032
TokenDon
Token ID3987
Feature activation+8.44e-01

Pos logit contributions
 Ms+0.008
 while+0.007
 last+0.007
 teach+0.007
 Mrs+0.007
Neg logit contributions
nes-0.012
vel-0.012
ode-0.012
oll-0.011
ust-0.011
Token't
Token ID470
Feature activation+3.00e-01

Pos logit contributions
 Ms+0.014
 while+0.012
 Mrs+0.010
 praise+0.010
 teach+0.010
Neg logit contributions
nes-0.048
vel-0.046
oll-0.046
ode-0.045
ust-0.044
Token worry
Token ID5490
Feature activation+1.98e-01

Pos logit contributions
 Ms+0.011
 while+0.009
 Mrs+0.009
 last+0.009
 Mr+0.009
Neg logit contributions
nes-0.012
vel-0.011
ode-0.011
ust-0.011
oll-0.010
Token little
Token ID1310
Feature activation+2.89e+00

Pos logit contributions
 Ms+0.004
 while+0.004
 teach+0.004
 last+0.003
 Mrs+0.003
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.010
ust-0.010
Token friend
Token ID1545
Feature activation+9.62e-01

Pos logit contributions
 early+0.109
glers+0.105
 Ms+0.104
 usual+0.101
bel+0.097
Neg logit contributions
 wing-0.078
 egg-0.076
ode-0.076
vel-0.075
ig-0.073
Token,
Token ID11
Feature activation+6.24e-01

Pos logit contributions
 Ms+0.026
 grace+0.022
 teach+0.022
 Mitt+0.021
 praise+0.021
Neg logit contributions
nes-0.043
vel-0.043
ode-0.041
oll-0.040
ust-0.040
Token I
Token ID314
Feature activation+7.61e-01

Pos logit contributions
 Ms+0.017
 while+0.016
 last+0.015
glers+0.014
 teach+0.014
Neg logit contributions
nes-0.027
ode-0.026
vel-0.026
oll-0.026
ust-0.025
Token will
Token ID481
Feature activation+7.46e-01

Pos logit contributions
 Ms+0.032
 Mrs+0.028
 while+0.028
 Mr+0.027
glers+0.026
Neg logit contributions
nes-0.026
vel-0.026
ode-0.024
oll-0.024
ust-0.024
Token help
Token ID1037
Feature activation+1.21e+00

Pos logit contributions
 Ms+0.017
 while+0.013
 Mrs+0.013
glers+0.012
 Mr+0.012
Neg logit contributions
nes-0.040
vel-0.039
ode-0.037
oll-0.036
onel-0.036
Token One
Token ID1881
Feature activation+2.89e-01

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.001
 while-0.000
 last-0.000
 Mrs-0.000
 teach-0.000
Token day
Token ID1110
Feature activation+5.58e-01

Pos logit contributions
 Ms+0.009
 Mrs+0.007
 Mr+0.007
 teach+0.007
 praise+0.007
Neg logit contributions
nes-0.013
oll-0.012
vel-0.012
ust-0.012
aky-0.012
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 Ms+0.013
 last+0.012
 teach+0.012
 praise+0.011
glers+0.011
Neg logit contributions
vel-0.027
nes-0.026
oll-0.026
aces-0.024
ode-0.024
Token she
Token ID673
Feature activation-1.50e-02

Pos logit contributions
 Ms+0.027
 praise+0.027
 thanks+0.027
 teach+0.026
 last+0.025
Neg logit contributions
nes-0.043
vel-0.040
oll-0.039
aces-0.038
ode-0.036
Token came
Token ID1625
Feature activation+8.10e-01

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 Mr-0.000
 last-0.000
Token up
Token ID510
Feature activation+2.87e+00

Pos logit contributions
 Ms+0.028
 Mr+0.025
 Mrs+0.025
 teach+0.024
glers+0.024
Neg logit contributions
nes-0.032
vel-0.032
ust-0.030
aky-0.030
ode-0.029
Token with
Token ID351
Feature activation+4.30e-01

Pos logit contributions
glers+0.094
 Ms+0.092
 praise+0.088
 Mitt+0.088
 choice+0.087
Neg logit contributions
vel-0.101
nes-0.087
aky-0.085
ust-0.083
per-0.083
Token a
Token ID257
Feature activation-1.27e-01

Pos logit contributions
 Ms+0.012
 while+0.011
 Mr+0.009
 teach+0.009
 Mrs+0.009
Neg logit contributions
nes-0.019
vel-0.019
oll-0.019
aces-0.018
ode-0.018
Token great
Token ID1049
Feature activation-2.41e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
ust+0.005
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 Mr-0.003
 last-0.003
Token idea
Token ID2126
Feature activation+4.59e-02

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.007
 while-0.007
 Mrs-0.006
 Mr-0.006
 last-0.006
Token.
Token ID13
Feature activation-9.27e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token park
Token ID3952
Feature activation+1.76e+00

Pos logit contributions
nes+0.083
vel+0.082
aces+0.081
oll+0.080
ust+0.077
Neg logit contributions
 Ms-0.036
 visit-0.035
 Mitt-0.034
 last-0.033
 while-0.033
Token when
Token ID618
Feature activation+5.20e-01

Pos logit contributions
 Ms+0.048
glers+0.042
 praise+0.041
 Mr+0.040
 Mrs+0.040
Neg logit contributions
vel-0.078
nes-0.077
adow-0.075
oll-0.073
aces-0.073
Token a
Token ID257
Feature activation+3.68e-02

Pos logit contributions
 Ms+0.014
 teach+0.013
 praise+0.012
 while+0.012
 last+0.012
Neg logit contributions
nes-0.023
oll-0.023
vel-0.022
aces-0.021
ode-0.021
Token boy
Token ID2933
Feature activation+1.05e+00

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token came
Token ID1625
Feature activation+9.39e-01

Pos logit contributions
 Ms+0.037
 last+0.032
 Mrs+0.032
 praise+0.031
 Mr+0.031
Neg logit contributions
nes-0.039
vel-0.038
ode-0.037
oll-0.037
aky-0.036
Token up
Token ID510
Feature activation+2.87e+00

Pos logit contributions
 Ms+0.031
 Mrs+0.027
 while+0.026
 praise+0.025
 teach+0.025
Neg logit contributions
nes-0.037
vel-0.036
aky-0.036
ode-0.035
ust-0.035
Token to
Token ID284
Feature activation+7.96e-01

Pos logit contributions
 Ms+0.102
glers+0.097
 Mitt+0.097
 allows+0.095
 praise+0.094
Neg logit contributions
vel-0.081
nes-0.080
aky-0.075
ust-0.073
per-0.069
Token her
Token ID607
Feature activation+2.08e-01

Pos logit contributions
 Ms+0.022
 while+0.020
glers+0.019
 last+0.019
 Mitt+0.018
Neg logit contributions
nes-0.034
vel-0.032
ode-0.030
oll-0.030
aky-0.029
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 Mr+0.004
 Mitt+0.004
Neg logit contributions
nes-0.010
oll-0.010
vel-0.010
ode-0.010
aces-0.009
Token asked
Token ID1965
Feature activation+2.65e-01

Pos logit contributions
 Ms+0.042
 last+0.039
 while+0.037
glers+0.036
 hire+0.036
Neg logit contributions
nes-0.048
anna-0.044
vel-0.043
aky-0.042
ode-0.042
Token if
Token ID611
Feature activation+1.05e+00

Pos logit contributions
 Ms+0.007
 last+0.006
 teach+0.006
 while+0.006
 praise+0.006
Neg logit contributions
vel-0.012
nes-0.012
oll-0.012
aces-0.011
ust-0.011
Token searched
Token ID16499
Feature activation+1.01e+00

Pos logit contributions
oll+0.061
nes+0.059
vel+0.059
aces+0.059
ode+0.056
Neg logit contributions
 while-0.046
 Ms-0.043
 last-0.041
 every-0.040
 visit-0.040
Token until
Token ID1566
Feature activation+6.29e-02

Pos logit contributions
 Ms+0.032
 teach+0.028
 Mrs+0.027
 Mr+0.027
 Mitt+0.026
Neg logit contributions
vel-0.041
nes-0.040
aces-0.039
aky-0.039
ode-0.038
Token she
Token ID673
Feature activation-6.49e-01

Pos logit contributions
 Ms+0.001
 teach+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token finally
Token ID3443
Feature activation-4.25e-01

Pos logit contributions
nes+0.028
vel+0.027
oll+0.027
ode+0.026
aces+0.026
Neg logit contributions
 Ms-0.021
 while-0.020
 last-0.019
 Mrs-0.018
 Mr-0.018
Token found
Token ID1043
Feature activation-8.46e-02

Pos logit contributions
nes+0.016
vel+0.016
oll+0.016
ode+0.015
aces+0.015
Neg logit contributions
 Ms-0.016
 while-0.015
 last-0.014
 Mrs-0.014
 Mr-0.014
Token the
Token ID262
Feature activation+2.86e+00

Pos logit contributions
nes+0.005
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.001
 teach-0.001
 Mr-0.001
Token video
Token ID2008
Feature activation-8.52e-01

Pos logit contributions
 Ms+0.101
Dr+0.100
thank+0.100
mun+0.099
Thank+0.098
Neg logit contributions
vel-0.066
aky-0.061
nes-0.059
 pad-0.057
oll-0.056
Token.
Token ID13
Feature activation+4.01e-01

Pos logit contributions
nes+0.048
vel+0.047
ode+0.046
oll+0.046
ust+0.045
Neg logit contributions
 Ms-0.016
 while-0.015
 last-0.013
 Mrs-0.013
 Mr-0.013
Token She
Token ID1375
Feature activation-2.69e-01

Pos logit contributions
 Ms+0.013
 while+0.013
 last+0.012
 teach+0.012
glers+0.012
Neg logit contributions
nes-0.016
oll-0.015
vel-0.015
aces-0.014
ode-0.014
Token was
Token ID373
Feature activation-7.63e-01

Pos logit contributions
nes+0.014
vel+0.014
oll+0.013
ode+0.013
aces+0.013
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.005
 Mr-0.005
 last-0.005
Token so
Token ID523
Feature activation-1.75e-01

Pos logit contributions
nes+0.031
vel+0.030
ode+0.030
oll+0.030
aces+0.029
Neg logit contributions
 Ms-0.027
 while-0.025
 Mrs-0.024
 Mr-0.024
 last-0.024
Token mom
Token ID1995
Feature activation+3.49e-01

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 Mr+0.007
 teach+0.007
Neg logit contributions
nes-0.010
oll-0.010
vel-0.010
aces-0.009
ode-0.009
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 Ms+0.009
 while+0.007
 praise+0.007
 Mrs+0.007
 Mr+0.007
Neg logit contributions
nes-0.017
vel-0.017
ode-0.016
oll-0.016
aky-0.016
Token A
Token ID317
Feature activation-2.08e-01

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.009
 Mrs+0.008
Neg logit contributions
nes-0.012
oll-0.012
vel-0.012
aces-0.011
ode-0.011
Token stranger
Token ID16195
Feature activation+7.91e-01

Pos logit contributions
nes+0.010
vel+0.010
oll+0.010
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.005
 while-0.005
 Mrs-0.004
 Mr-0.004
 last-0.004
Token came
Token ID1625
Feature activation+6.87e-01

Pos logit contributions
 Ms+0.025
 while+0.022
 last+0.021
 Mrs+0.021
 Mitt+0.021
Neg logit contributions
nes-0.032
vel-0.032
oll-0.031
ode-0.030
aces-0.029
Token up
Token ID510
Feature activation+2.86e+00

Pos logit contributions
 Ms+0.022
 teach+0.019
 Mrs+0.019
 Mitt+0.018
 Mr+0.018
Neg logit contributions
nes-0.028
vel-0.028
aky-0.027
ust-0.027
ode-0.026
Token to
Token ID284
Feature activation+9.37e-01

Pos logit contributions
glers+0.110
 expiration+0.105
mun+0.104
 praise+0.104
 Ms+0.103
Neg logit contributions
nes-0.070
vel-0.068
per-0.062
aky-0.062
ful-0.061
Token them
Token ID606
Feature activation+2.65e-01

Pos logit contributions
 Ms+0.027
 while+0.026
glers+0.025
 downward+0.024
 last+0.023
Neg logit contributions
nes-0.038
vel-0.035
ode-0.032
oll-0.031
aky-0.031
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
 Ms+0.007
 Mrs+0.006
 while+0.006
 Mr+0.006
 Mitt+0.006
Neg logit contributions
nes-0.013
vel-0.012
oll-0.012
ode-0.012
aces-0.012
Token asked
Token ID1965
Feature activation+6.41e-01

Pos logit contributions
 Ms+0.029
 last+0.027
 while+0.026
 praise+0.025
 Mitt+0.025
Neg logit contributions
nes-0.046
vel-0.041
oll-0.040
ode-0.039
anna-0.039
Token for
Token ID329
Feature activation+6.51e-01

Pos logit contributions
 Ms+0.018
 last+0.017
 teach+0.016
 praise+0.016
glers+0.016
Neg logit contributions
vel-0.027
nes-0.027
oll-0.026
aces-0.024
ust-0.024
Token,
Token ID11
Feature activation+2.40e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 last+0.005
 teach+0.005
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ode-0.010
aces-0.010
Token named
Token ID3706
Feature activation+1.65e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token Linda
Token ID21776
Feature activation+5.25e-01

Pos logit contributions
 Ms+0.004
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.008
vel-0.008
oll-0.007
ode-0.007
aces-0.007
Token,
Token ID11
Feature activation+4.37e-03

Pos logit contributions
 Ms+0.012
 while+0.010
 last+0.010
 Mrs+0.010
 teach+0.010
Neg logit contributions
nes-0.027
vel-0.026
oll-0.025
ode-0.025
aces-0.025
Token came
Token ID1625
Feature activation+7.39e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 teach+0.000
 Mrs+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
ust-0.000
Token up
Token ID510
Feature activation+2.84e+00

Pos logit contributions
 Ms+0.025
 Mrs+0.022
 Mr+0.021
 Mitt+0.021
 teach+0.021
Neg logit contributions
nes-0.030
vel-0.030
ust-0.028
ode-0.028
oll-0.027
Token to
Token ID284
Feature activation+4.50e-01

Pos logit contributions
 Mitt+0.095
glers+0.095
 Ms+0.094
 praise+0.088
mun+0.085
Neg logit contributions
vel-0.095
nes-0.084
ust-0.083
per-0.081
aky-0.079
Token him
Token ID683
Feature activation+1.40e-01

Pos logit contributions
 Ms+0.012
 while+0.010
 Mrs+0.010
 last+0.009
 Mr+0.009
Neg logit contributions
nes-0.021
vel-0.020
ode-0.019
oll-0.019
aces-0.018
Token and
Token ID290
Feature activation+3.71e-01

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 Mr+0.003
 Mitt+0.003
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
aces-0.007
Token asked
Token ID1965
Feature activation-1.92e-01

Pos logit contributions
 Ms+0.009
 while+0.008
 last+0.008
 teach+0.008
 Mitt+0.008
Neg logit contributions
nes-0.018
vel-0.017
oll-0.017
ode-0.016
ust-0.016
Token if
Token ID611
Feature activation+6.56e-01

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.005
 while-0.005
 Mrs-0.004
 last-0.004
 Mr-0.004
Token took
Token ID1718
Feature activation-1.00e+00

Pos logit contributions
nes+0.019
vel+0.018
oll+0.018
ode+0.018
ust+0.017
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.009
 Mr-0.009
 teach-0.009
Token good
Token ID922
Feature activation+4.09e-01

Pos logit contributions
nes+0.057
ode+0.056
ust+0.054
vel+0.054
oll+0.053
Neg logit contributions
 Ms-0.020
 while-0.020
 last-0.018
 Mr-0.017
 teach-0.017
Token care
Token ID1337
Feature activation+1.60e-01

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 Mr+0.010
 while+0.010
 Mitt+0.009
Neg logit contributions
vel-0.019
nes-0.019
oll-0.019
ode-0.018
aces-0.018
Token of
Token ID286
Feature activation-8.54e-02

Pos logit contributions
 Ms+0.004
 Mrs+0.003
 Mr+0.003
 while+0.003
 teach+0.003
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
aces-0.008
ode-0.008
Token his
Token ID465
Feature activation+2.64e+00

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 last-0.002
 Mr-0.002
Token new
Token ID649
Feature activation+2.83e+00

Pos logit contributions
Thank+0.095
glers+0.094
 Ms+0.093
thank+0.092
Dr+0.092
Neg logit contributions
vel-0.071
aky-0.068
aces-0.064
oll-0.064
ocket-0.063
Token tablet
Token ID14147
Feature activation-1.13e-01

Pos logit contributions
Thank+0.125
Dr+0.125
 Ms+0.119
glers+0.118
Grab+0.116
Neg logit contributions
vel-0.058
aky-0.049
 pad-0.049
ont-0.048
aces-0.045
Token so
Token ID523
Feature activation-3.76e-02

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 Mr-0.002
 teach-0.002
Token it
Token ID340
Feature activation+4.55e-02

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 teach-0.001
 Mr-0.001
Token wouldn
Token ID3636
Feature activation+1.63e+00

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token't
Token ID470
Feature activation-6.55e-02

Pos logit contributions
 Ms+0.038
 while+0.032
 Mrs+0.030
 praise+0.030
Thank+0.028
Neg logit contributions
nes-0.087
vel-0.080
oll-0.077
onel-0.076
ust-0.074
Token up
Token ID510
Feature activation+1.17e+00

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 Mr+0.003
 Mitt+0.003
Neg logit contributions
nes-0.005
vel-0.005
ode-0.005
oll-0.005
aces-0.005
Token,
Token ID11
Feature activation+9.74e-01

Pos logit contributions
 Ms+0.036
 teach+0.032
 Mitt+0.032
glers+0.030
 Mrs+0.030
Neg logit contributions
vel-0.048
nes-0.048
aces-0.047
oll-0.044
ust-0.044
Token then
Token ID788
Feature activation+1.42e-01

Pos logit contributions
 Ms+0.029
 Mitt+0.025
 Mrs+0.025
 last+0.024
glers+0.024
Neg logit contributions
vel-0.040
nes-0.040
aces-0.039
ode-0.039
oll-0.037
Token come
Token ID1282
Feature activation+9.81e-01

Pos logit contributions
 Ms+0.004
 while+0.003
 Mrs+0.003
 Mr+0.003
 teach+0.003
Neg logit contributions
nes-0.007
vel-0.006
oll-0.006
ode-0.006
aces-0.006
Token back
Token ID736
Feature activation+9.78e-01

Pos logit contributions
 Ms+0.042
 Mrs+0.037
 Mitt+0.036
 Mr+0.036
 praise+0.034
Neg logit contributions
nes-0.033
vel-0.032
ode-0.032
aces-0.030
ust-0.030
Token down
Token ID866
Feature activation+2.83e+00

Pos logit contributions
 Ms+0.031
 Mitt+0.026
 teach+0.026
 Mrs+0.026
 praise+0.026
Neg logit contributions
nes-0.040
vel-0.039
oll-0.038
aces-0.038
adow-0.037
Token to
Token ID284
Feature activation+6.12e-02

Pos logit contributions
 Mitt+0.100
glers+0.099
mun+0.092
 hire+0.091
 Ms+0.091
Neg logit contributions
vel-0.080
 itself-0.074
per-0.070
nes-0.070
aund-0.069
Token her
Token ID607
Feature activation-5.24e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token tree
Token ID5509
Feature activation+2.04e-01

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.024
 while-0.023
 last-0.022
 Mr-0.022
 teach-0.022
Token.
Token ID13
Feature activation+4.10e-01

Pos logit contributions
 Ms+0.005
 Mrs+0.004
 while+0.004
 Mr+0.004
 Mitt+0.004
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.010
aces-0.010
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.009
 Mrs+0.008
Neg logit contributions
nes-0.020
oll-0.019
vel-0.019
aces-0.019
ode-0.018
TokenOne
Token ID3198
Feature activation+3.66e-01

Pos logit contributions
nes+0.004
vel+0.004
oll+0.003
ode+0.003
ust+0.003
Neg logit contributions
 Ms-0.002
 while-0.002
 Mr-0.002
 Mrs-0.002
 last-0.002
Token day
Token ID1110
Feature activation+8.20e-01

Pos logit contributions
 Ms+0.011
 Mrs+0.009
 teach+0.009
 Mr+0.009
 praise+0.009
Neg logit contributions
oll-0.016
nes-0.016
vel-0.015
aky-0.015
ust-0.015
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
 Ms+0.022
 praise+0.021
 teach+0.020
glers+0.020
 last+0.020
Neg logit contributions
vel-0.035
oll-0.035
nes-0.034
aces-0.032
aund-0.031
Token she
Token ID673
Feature activation+2.02e-01

Pos logit contributions
glers+0.041
 praise+0.040
 hire+0.038
 teach+0.038
 Ms+0.038
Neg logit contributions
nes-0.040
oll-0.038
aces-0.037
vel-0.037
alo-0.034
Token came
Token ID1625
Feature activation+1.18e+00

Pos logit contributions
 Ms+0.007
 while+0.006
 Mrs+0.006
 teach+0.006
 Mr+0.006
Neg logit contributions
nes-0.009
vel-0.008
oll-0.008
ode-0.008
aky-0.008
Token up
Token ID510
Feature activation+2.83e+00

Pos logit contributions
 Ms+0.044
glers+0.041
 Mrs+0.039
 Mr+0.038
 praise+0.038
Neg logit contributions
nes-0.043
vel-0.043
aky-0.041
ust-0.041
oll-0.038
Token with
Token ID351
Feature activation+5.37e-01

Pos logit contributions
glers+0.106
 Ms+0.096
 Mitt+0.095
 choice+0.093
 praise+0.093
Neg logit contributions
vel-0.092
nes-0.083
aky-0.078
ust-0.076
per-0.075
Token an
Token ID281
Feature activation-2.29e+00

Pos logit contributions
 Ms+0.015
 while+0.014
glers+0.012
 teach+0.012
 before+0.012
Neg logit contributions
oll-0.023
vel-0.023
nes-0.023
aces-0.022
ode-0.021
Token idea
Token ID2126
Feature activation+2.40e-01

Pos logit contributions
nes+0.128
lor+0.117
vel+0.116
ode+0.113
emic+0.112
Neg logit contributions
 teach-0.047
 last-0.047
 Ms-0.045
 while-0.043
 work-0.042
Token.
Token ID13
Feature activation-2.58e-01

Pos logit contributions
 Ms+0.006
 Mrs+0.005
 Mr+0.005
 while+0.005
 last+0.004
Neg logit contributions
nes-0.012
vel-0.012
oll-0.012
aces-0.012
ode-0.012
Token She
Token ID1375
Feature activation+1.91e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.011
ode+0.011
aces+0.011
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.007
 Mr-0.007
 teach-0.006
Token his
Token ID465
Feature activation+1.07e+00

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.004
 Mrs+0.004
 teach+0.004
Neg logit contributions
nes-0.007
vel-0.006
oll-0.006
ode-0.006
aces-0.006
Token dad
Token ID9955
Feature activation+5.42e-01

Pos logit contributions
 Ms+0.052
 while+0.048
 Mitt+0.047
glers+0.047
 Mrs+0.046
Neg logit contributions
oll-0.026
vel-0.022
nes-0.022
aky-0.022
ode-0.021
Token for
Token ID329
Feature activation-5.99e-01

Pos logit contributions
 Ms+0.014
 teach+0.012
 Mitt+0.012
 while+0.012
 Mrs+0.012
Neg logit contributions
vel-0.025
nes-0.025
oll-0.025
ode-0.024
ust-0.023
Token replacing
Token ID13586
Feature activation+1.25e+00

Pos logit contributions
nes+0.033
oll+0.032
vel+0.032
ode+0.031
ust+0.031
Neg logit contributions
 Ms-0.013
 while-0.012
 Mrs-0.011
 teach-0.011
 Mr-0.011
Token his
Token ID465
Feature activation+2.17e+00

Pos logit contributions
 Ms+0.033
Thank+0.031
 while+0.031
 last+0.029
 visit+0.029
Neg logit contributions
vel-0.053
nes-0.051
aces-0.049
oll-0.049
aky-0.049
Token old
Token ID1468
Feature activation+2.82e+00

Pos logit contributions
Thank+0.082
 Ms+0.082
glers+0.081
 Mrs+0.077
thank+0.077
Neg logit contributions
vel-0.061
aky-0.059
aces-0.057
ocket-0.053
oll-0.053
Token tablet
Token ID14147
Feature activation+2.88e-01

Pos logit contributions
Thank+0.121
 Ms+0.120
glers+0.118
 Mrs+0.113
Grab+0.112
Neg logit contributions
vel-0.059
aces-0.056
 pad-0.055
aky-0.053
glass-0.052
Token with
Token ID351
Feature activation+1.11e+00

Pos logit contributions
 Ms+0.007
 Mrs+0.006
 while+0.005
 Mr+0.005
 Mitt+0.005
Neg logit contributions
nes-0.015
vel-0.015
oll-0.014
aces-0.014
ode-0.014
Token a
Token ID257
Feature activation+1.33e+00

Pos logit contributions
 Ms+0.036
 Mrs+0.029
 while+0.029
glers+0.029
 downward+0.028
Neg logit contributions
vel-0.043
aces-0.042
oll-0.041
nes-0.041
aky-0.040
Token new
Token ID649
Feature activation+2.40e+00

Pos logit contributions
 Ms+0.042
glers+0.037
 Mrs+0.035
 Mr+0.034
 Mitt+0.034
Neg logit contributions
vel-0.052
nes-0.049
aky-0.047
ode-0.045
 pad-0.045
Token one
Token ID530
Feature activation-1.34e-01

Pos logit contributions
 Ms+0.091
glers+0.089
Thank+0.086
Dr+0.084
 Mrs+0.084
Neg logit contributions
 pad-0.065
vel-0.063
nes-0.060
aky-0.059
 mate-0.058
Token he
Token ID339
Feature activation+1.00e-01

Pos logit contributions
 Ms+0.007
 while+0.006
 Mrs+0.006
 teach+0.006
 last+0.006
Neg logit contributions
nes-0.012
oll-0.012
vel-0.012
aces-0.011
ode-0.011
Token had
Token ID550
Feature activation-6.10e-01

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 Mr+0.003
 last+0.003
Neg logit contributions
nes-0.004
vel-0.004
oll-0.004
ode-0.004
aces-0.004
Token chosen
Token ID7147
Feature activation-3.55e-02

Pos logit contributions
nes+0.031
oll+0.031
vel+0.031
ode+0.030
aces+0.029
Neg logit contributions
 Ms-0.015
 while-0.014
 teach-0.013
 Mrs-0.012
 last-0.012
Token to
Token ID284
Feature activation+2.74e-02

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 Mr-0.001
 last-0.001
Token watch
Token ID2342
Feature activation+7.33e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 Mitt+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token the
Token ID262
Feature activation+2.81e+00

Pos logit contributions
 Ms+0.016
 while+0.014
 last+0.013
 Mrs+0.013
 Mitt+0.013
Neg logit contributions
vel-0.036
nes-0.036
oll-0.035
aces-0.034
ode-0.034
Token fun
Token ID1257
Feature activation+7.16e-01

Pos logit contributions
 Ms+0.110
Dr+0.107
 while+0.105
 Mrs+0.105
bel+0.104
Neg logit contributions
vel-0.058
nes-0.049
od-0.048
orry-0.047
ils-0.043
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
 Ms+0.020
 Mrs+0.018
 Mr+0.017
 Mitt+0.017
 last+0.017
Neg logit contributions
nes-0.032
vel-0.031
oll-0.029
aces-0.028
ust-0.028
Token easy
Token ID2562
Feature activation+1.32e+00

Pos logit contributions
 Ms+0.005
 while+0.004
 Mrs+0.004
 Mr+0.004
 Mitt+0.004
Neg logit contributions
nes-0.009
vel-0.009
oll-0.008
ode-0.008
aces-0.008
Token video
Token ID2008
Feature activation-2.10e-01

Pos logit contributions
 Ms+0.041
 Mrs+0.037
 Mr+0.036
 Mitt+0.035
Thank+0.035
Neg logit contributions
vel-0.053
nes-0.051
oll-0.046
aces-0.046
onel-0.043
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 last-0.004
Token then
Token ID788
Feature activation+5.44e-01

Pos logit contributions
 Ms+0.005
 teach+0.004
 while+0.004
 Mrs+0.004
 praise+0.004
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.009
ust-0.009
Token a
Token ID257
Feature activation+4.91e-01

Pos logit contributions
 Ms+0.013
 teach+0.011
 praise+0.011
 last+0.011
 while+0.010
Neg logit contributions
oll-0.026
nes-0.026
vel-0.026
ust-0.024
aces-0.024
Token big
Token ID1263
Feature activation-9.35e-01

Pos logit contributions
 Ms+0.015
 while+0.013
 Mrs+0.013
 Mr+0.013
 teach+0.013
Neg logit contributions
nes-0.020
vel-0.020
oll-0.020
ode-0.019
aky-0.019
Token cat
Token ID3797
Feature activation+1.65e-01

Pos logit contributions
nes+0.049
vel+0.046
oll+0.045
ust+0.045
ode+0.044
Neg logit contributions
 Ms-0.025
 while-0.024
 Mr-0.023
 last-0.022
 teach-0.022
Token comes
Token ID2058
Feature activation+1.19e+00

Pos logit contributions
 Ms+0.005
 while+0.005
 Mrs+0.005
 Mr+0.005
 last+0.005
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.006
aces-0.006
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
 Ms+0.041
 Mrs+0.036
Thank+0.036
 praise+0.035
 teach+0.035
Neg logit contributions
vel-0.047
nes-0.045
ust-0.045
aky-0.045
aces-0.042
Token sees
Token ID7224
Feature activation-2.39e-01

Pos logit contributions
 hire+0.104
thank+0.102
 whenever+0.101
 choice+0.097
 praise+0.096
Neg logit contributions
vel-0.069
oll-0.067
ox-0.065
aund-0.064
aces-0.063
Token the
Token ID262
Feature activation+9.53e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.012
aces+0.011
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 last-0.005
Token toy
Token ID13373
Feature activation+1.10e-01

Pos logit contributions
 Ms+0.039
 Mrs+0.033
 while+0.033
 whenever+0.032
 Mr+0.032
Neg logit contributions
ode-0.030
oll-0.029
aces-0.029
aky-0.029
vel-0.028
Token birds
Token ID10087
Feature activation-1.07e-01

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.002
 Mr+0.002
 teach+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token.
Token ID13
Feature activation-4.46e-01

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 Mr-0.002
 teach-0.002
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 Ms+0.014
 teach+0.013
 last+0.013
 praise+0.013
glers+0.013
Neg logit contributions
vel-0.030
nes-0.029
oll-0.029
aces-0.027
anna-0.026
Token a
Token ID257
Feature activation-1.54e-01

Pos logit contributions
 Ms+0.027
 praise+0.027
 teach+0.026
glers+0.026
 thanks+0.025
Neg logit contributions
nes-0.040
vel-0.038
oll-0.037
aces-0.036
alo-0.034
Token little
Token ID1310
Feature activation-2.07e-01

Pos logit contributions
nes+0.008
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 Mr-0.003
 last-0.003
Token girl
Token ID2576
Feature activation+7.51e-01

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.009
aces+0.008
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.005
 Mr-0.005
 teach-0.005
Token came
Token ID1625
Feature activation+1.03e+00

Pos logit contributions
 Ms+0.025
 last+0.022
 teach+0.022
 Mr+0.022
 Mrs+0.021
Neg logit contributions
nes-0.030
vel-0.029
oll-0.029
ode-0.028
aky-0.027
Token up
Token ID510
Feature activation+2.76e+00

Pos logit contributions
 Ms+0.034
 Mrs+0.030
 Mr+0.029
 praise+0.029
 teach+0.028
Neg logit contributions
vel-0.041
nes-0.041
ust-0.039
aky-0.039
ode-0.038
Token to
Token ID284
Feature activation+9.08e-01

Pos logit contributions
glers+0.104
 Ms+0.103
 praise+0.102
 choice+0.101
 teach+0.098
Neg logit contributions
vel-0.069
nes-0.064
ust-0.063
aky-0.060
adow-0.060
Token her
Token ID607
Feature activation+6.62e-01

Pos logit contributions
 Ms+0.022
glers+0.022
 while+0.021
s+0.019
 last+0.019
Neg logit contributions
nes-0.039
vel-0.037
ode-0.036
ocket-0.034
oll-0.034
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
 Ms+0.022
 while+0.018
glers+0.018
 Mrs+0.018
 Mr+0.018
Neg logit contributions
nes-0.026
oll-0.026
vel-0.025
ode-0.025
aces-0.024
Token looked
Token ID3114
Feature activation+3.33e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 last+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
ust-0.001
Token up
Token ID510
Feature activation+1.94e+00

Pos logit contributions
 Ms+0.011
 while+0.010
 teach+0.009
 last+0.009
 visit+0.009
Neg logit contributions
vel-0.014
oll-0.013
nes-0.013
ust-0.013
aky-0.013
Token about
Token ID546
Feature activation-9.05e-01

Pos logit contributions
nes+0.032
ode+0.031
vel+0.031
oll+0.030
ust+0.030
Neg logit contributions
 Ms-0.013
 while-0.012
 Mrs-0.011
 Mr-0.011
 last-0.011
Token it
Token ID340
Feature activation+3.04e-01

Pos logit contributions
nes+0.042
oll+0.039
ode+0.038
vel+0.038
aces+0.037
Neg logit contributions
 Ms-0.028
 while-0.027
 teach-0.026
 Mrs-0.026
 Mr-0.025
Token until
Token ID1566
Feature activation-1.85e-01

Pos logit contributions
 Ms+0.008
 Mrs+0.006
 Mr+0.006
 while+0.006
 praise+0.006
Neg logit contributions
vel-0.015
nes-0.015
oll-0.015
ust-0.014
aces-0.014
Token she
Token ID673
Feature activation-3.37e-01

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token came
Token ID1625
Feature activation+6.87e-01

Pos logit contributions
nes+0.015
vel+0.014
oll+0.014
ode+0.014
aces+0.013
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.010
 Mr-0.010
 last-0.010
Token up
Token ID510
Feature activation+2.76e+00

Pos logit contributions
 Ms+0.023
 Mrs+0.020
 Mr+0.020
glers+0.019
 while+0.019
Neg logit contributions
vel-0.028
nes-0.028
ust-0.026
ode-0.026
oll-0.026
Token with
Token ID351
Feature activation+6.13e-01

Pos logit contributions
glers+0.102
 Mitt+0.093
 Ms+0.091
mun+0.088
 allows+0.086
Neg logit contributions
vel-0.098
nes-0.091
ust-0.084
aky-0.082
per-0.078
Token a
Token ID257
Feature activation-1.29e-01

Pos logit contributions
 Ms+0.017
 while+0.016
glers+0.014
 Mrs+0.014
 Mr+0.014
Neg logit contributions
vel-0.027
nes-0.027
oll-0.026
aces-0.025
ode-0.025
Token great
Token ID1049
Feature activation-2.05e-01

Pos logit contributions
nes+0.006
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 Mr-0.003
 last-0.003
Token plan
Token ID1410
Feature activation+4.55e-02

Pos logit contributions
nes+0.009
vel+0.009
oll+0.008
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.006
 Mr-0.006
 last-0.006
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
aces-0.002
ode-0.002
Token his
Token ID465
Feature activation+1.58e-01

Pos logit contributions
nes+0.025
vel+0.024
oll+0.024
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.014
 while-0.013
 Mr-0.012
 Mrs-0.012
 Mitt-0.012
Token pet
Token ID4273
Feature activation+2.68e-01

Pos logit contributions
 Ms+0.005
 Mrs+0.005
 while+0.005
 teach+0.005
 Mr+0.005
Neg logit contributions
nes-0.006
oll-0.006
vel-0.006
ode-0.006
aces-0.006
Token mouse
Token ID10211
Feature activation+2.71e-02

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 last+0.007
 teach+0.007
Neg logit contributions
nes-0.012
oll-0.012
vel-0.011
ode-0.011
aces-0.011
Token,
Token ID11
Feature activation-3.12e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token S
Token ID311
Feature activation-6.17e-01

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.015
aces+0.015
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.007
 Mr-0.007
 teach-0.006
Tokenque
Token ID4188
Feature activation-3.43e+00

Pos logit contributions
nes+0.026
vel+0.025
oll+0.024
ode+0.024
aces+0.023
Neg logit contributions
 Ms-0.021
 while-0.019
 last-0.019
 Mrs-0.018
 Mr-0.018
Tokenaky
Token ID15492
Feature activation+8.40e-01

Pos logit contributions
nes+0.090
lor+0.078
aund+0.078
oll+0.078
ode+0.072
Neg logit contributions
 while-0.155
 Mitt-0.143
 tired-0.142
 when-0.141
 with-0.139
Token.
Token ID13
Feature activation+7.22e-02

Pos logit contributions
 Ms+0.022
 Mrs+0.018
 Mr+0.018
 while+0.017
 teach+0.017
Neg logit contributions
nes-0.041
vel-0.040
aces-0.039
oll-0.039
aky-0.038
Token
Token ID220
Feature activation-6.15e-01

Pos logit contributions
 Ms+0.002
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.004
oll-0.004
vel-0.004
ode-0.004
aces-0.004
Token
Token ID198
Feature activation+6.23e-02

Pos logit contributions
nes+0.035
vel+0.034
oll+0.034
ode+0.034
aces+0.033
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.009
 Mr-0.009
 last-0.008
TokenOne
Token ID3198
Feature activation+2.94e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 Mr+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token park
Token ID3952
Feature activation+1.74e+00

Pos logit contributions
nes+0.067
vel+0.066
aces+0.066
oll+0.064
onel+0.061
Neg logit contributions
 Ms-0.034
 Mr-0.030
 Mrs-0.030
 visit-0.030
 Mitt-0.030
Token when
Token ID618
Feature activation+6.93e-01

Pos logit contributions
 Ms+0.047
glers+0.042
 Mr+0.040
 Mitt+0.039
 praise+0.039
Neg logit contributions
adow-0.078
nes-0.077
vel-0.075
per-0.068
iece-0.068
Token she
Token ID673
Feature activation+4.44e-01

Pos logit contributions
 Ms+0.019
 teach+0.017
 praise+0.016
 while+0.016
 last+0.015
Neg logit contributions
nes-0.031
vel-0.030
oll-0.029
ode-0.028
ust-0.028
Token stumbled
Token ID24241
Feature activation+9.56e-01

Pos logit contributions
 Ms+0.013
 teach+0.012
 while+0.011
 Mrs+0.011
 praise+0.011
Neg logit contributions
nes-0.020
vel-0.019
oll-0.019
ode-0.019
aky-0.018
Token upon
Token ID2402
Feature activation-6.79e-02

Pos logit contributions
 Ms+0.032
 teach+0.028
 praise+0.028
 Mrs+0.028
 Mr+0.027
Neg logit contributions
nes-0.040
vel-0.039
oll-0.039
aky-0.038
aces-0.038
Token an
Token ID281
Feature activation-3.38e+00

Pos logit contributions
nes+0.004
vel+0.004
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 teach-0.001
 Mr-0.001
Token old
Token ID1468
Feature activation-9.09e-01

Pos logit contributions
nes+0.196
lor+0.191
vel+0.190
ode+0.175
emic+0.175
Neg logit contributions
 last-0.065
 Mrs-0.063
 Mr-0.063
 Ms-0.062
 elderly-0.060
Token cob
Token ID22843
Feature activation-1.14e+00

Pos logit contributions
nes+0.044
vel+0.043
oll+0.042
ust+0.041
ode+0.041
Neg logit contributions
 Ms-0.025
 while-0.024
 Mrs-0.022
 Mr-0.022
 teach-0.022
Tokenweb
Token ID12384
Feature activation+1.76e-01

Pos logit contributions
nes+0.032
vel+0.031
ust+0.029
ode+0.029
oll+0.028
Neg logit contributions
 Ms-0.050
 while-0.050
 last-0.049
 teach-0.047
 before-0.047
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 Ms+0.004
 Mrs+0.004
 while+0.004
 Mr+0.004
 teach+0.004
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.008
aces-0.008
Token Mill
Token ID9212
Feature activation-1.19e+00

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 last-0.003
 teach-0.003
Token a
Token ID257
Feature activation-5.09e-02

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ode+0.000
ust+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token little
Token ID1310
Feature activation-2.31e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
ust+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 last-0.001
 teach-0.001
Token mouse
Token ID10211
Feature activation+3.66e-02

Pos logit contributions
nes+0.010
vel+0.010
oll+0.010
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.006
 Mr-0.006
 last-0.006
Token named
Token ID3706
Feature activation+3.30e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token S
Token ID311
Feature activation-7.79e-01

Pos logit contributions
 Ms+0.008
 while+0.008
 last+0.007
 praise+0.007
 teach+0.007
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ode-0.015
aces-0.015
Tokenque
Token ID4188
Feature activation-3.28e+00

Pos logit contributions
nes+0.034
vel+0.032
oll+0.031
ode+0.031
ust+0.030
Neg logit contributions
 Ms-0.025
 last-0.023
 while-0.023
 Mrs-0.023
 Mr-0.022
Tokenaky
Token ID15492
Feature activation+1.54e+00

Pos logit contributions
nes+0.087
lor+0.076
oll+0.074
ode+0.071
aund+0.066
Neg logit contributions
 while-0.144
 last-0.133
 Mitt-0.133
 Ms-0.133
 when-0.132
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
 Ms+0.047
 Mrs+0.040
 praise+0.039
Thank+0.038
 Mr+0.038
Neg logit contributions
aces-0.064
ode-0.062
nes-0.062
vel-0.062
aky-0.061
Token One
Token ID1881
Feature activation-6.73e-03

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.009
 praise+0.009
Neg logit contributions
nes-0.020
oll-0.019
vel-0.019
ode-0.019
aces-0.018
Token day
Token ID1110
Feature activation+1.28e+00

Pos logit contributions
nes+0.000
oll+0.000
vel+0.000
ode+0.000
ust+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 teach-0.000
 Mr-0.000
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 Ms+0.036
 praise+0.034
 teach+0.034
 hire+0.031
 last+0.031
Neg logit contributions
oll-0.054
vel-0.053
nes-0.052
aces-0.050
aund-0.048
Token the
Token ID262
Feature activation-7.54e-01

Pos logit contributions
 Ms+0.008
 while+0.007
 teach+0.007
 Mrs+0.006
 Mitt+0.006
Neg logit contributions
nes-0.022
vel-0.021
oll-0.020
ode-0.020
aces-0.020
Token grass
Token ID8701
Feature activation+1.34e+00

Pos logit contributions
vel+0.036
nes+0.035
oll+0.035
aces+0.034
ode+0.033
Neg logit contributions
 Ms-0.022
 while-0.020
 last-0.020
 Mrs-0.019
 Mr-0.019
Token when
Token ID618
Feature activation+7.79e-01

Pos logit contributions
 Ms+0.038
 Mrs+0.032
 praise+0.032
 teach+0.032
 Mitt+0.031
Neg logit contributions
oll-0.060
vel-0.059
aces-0.059
adow-0.059
nes-0.059
Token he
Token ID339
Feature activation+8.28e-01

Pos logit contributions
 Ms+0.023
 teach+0.021
 praise+0.020
 Mitt+0.019
 while+0.019
Neg logit contributions
nes-0.033
oll-0.032
vel-0.032
ust-0.030
ode-0.030
Token heard
Token ID2982
Feature activation-3.33e-01

Pos logit contributions
 Ms+0.027
 teach+0.025
 praise+0.024
 Mrs+0.024
 Mr+0.023
Neg logit contributions
nes-0.034
vel-0.033
oll-0.032
ode-0.032
aky-0.031
Token an
Token ID281
Feature activation-3.25e+00

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
aces+0.017
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.006
 Mr-0.006
 last-0.006
Token odd
Token ID5629
Feature activation-1.08e+00

Pos logit contributions
nes+0.181
lor+0.177
vel+0.170
emic+0.160
ode+0.159
Neg logit contributions
 Ms-0.070
 Mrs-0.069
 teach-0.068
 Mr-0.067
 last-0.065
Token sound
Token ID2128
Feature activation-4.73e-02

Pos logit contributions
nes+0.062
oll+0.059
vel+0.058
ode+0.057
aces+0.057
Neg logit contributions
 while-0.021
 Ms-0.021
 teach-0.019
 last-0.018
 Mr-0.018
Token.
Token ID13
Feature activation-5.02e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 Mr-0.001
 teach-0.001
Token It
Token ID632
Feature activation+4.45e-01

Pos logit contributions
nes+0.022
vel+0.022
oll+0.021
ode+0.021
ust+0.020
Neg logit contributions
 Ms-0.016
 while-0.015
 Mrs-0.014
 Mr-0.014
 teach-0.014
Token sounded
Token ID14846
Feature activation+1.11e+00

Pos logit contributions
 Ms+0.013
 while+0.012
 Mrs+0.011
 last+0.011
 teach+0.011
Neg logit contributions
nes-0.019
vel-0.019
oll-0.018
ust-0.018
ode-0.018
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.011
ode+0.011
aces+0.011
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.009
 Mr-0.009
 last-0.009
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
 Ms+0.006
 while+0.006
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.012
oll-0.011
vel-0.011
ode-0.011
aces-0.011
Token
Token ID198
Feature activation+6.80e-01

Pos logit contributions
 Ms+0.010
 while+0.009
 teach+0.008
 last+0.008
 Mrs+0.008
Neg logit contributions
nes-0.020
vel-0.019
oll-0.019
ust-0.018
ode-0.018
TokenP
Token ID47
Feature activation-6.62e-01

Pos logit contributions
 Ms+0.024
 Mrs+0.022
 last+0.022
 praise+0.021
 Mr+0.021
Neg logit contributions
vel-0.027
nes-0.027
ust-0.025
oll-0.025
aky-0.025
Tokenips
Token ID2419
Feature activation-1.72e+00

Pos logit contributions
nes+0.022
vel+0.021
ode+0.020
oll+0.020
aces+0.019
Neg logit contributions
 Ms-0.027
 while-0.026
 Mr-0.025
 last-0.025
 Mitt-0.025
Tokenque
Token ID4188
Feature activation-3.25e+00

Pos logit contributions
vel+0.084
ode+0.084
nes+0.082
oll+0.081
ust+0.078
Neg logit contributions
 while-0.044
 Ms-0.043
 last-0.041
 Mr-0.039
 before-0.039
Tokenak
Token ID461
Feature activation+8.45e-01

Pos logit contributions
oll+0.073
nes+0.072
alo+0.067
aund+0.062
ode+0.060
Neg logit contributions
 while-0.144
 Ms-0.133
 last-0.133
 with-0.132
 when-0.131
Token started
Token ID2067
Feature activation+5.48e-01

Pos logit contributions
 Ms+0.027
 Mrs+0.023
 Mr+0.022
 teach+0.022
 while+0.022
Neg logit contributions
nes-0.037
vel-0.036
aces-0.035
oll-0.034
ode-0.034
Token to
Token ID284
Feature activation+3.24e-01

Pos logit contributions
 Ms+0.015
 last+0.013
 Mrs+0.013
 while+0.012
 teach+0.012
Neg logit contributions
vel-0.024
nes-0.024
ode-0.023
oll-0.023
aces-0.022
Token regret
Token ID13721
Feature activation-7.56e-01

Pos logit contributions
 Ms+0.010
 while+0.009
 Mrs+0.008
 last+0.008
 Mr+0.008
Neg logit contributions
nes-0.014
vel-0.014
ode-0.013
oll-0.013
ust-0.013
Token it
Token ID340
Feature activation+2.27e-01

Pos logit contributions
nes+0.038
oll+0.036
ode+0.036
vel+0.036
ust+0.035
Neg logit contributions
 Ms-0.020
 while-0.019
 Mr-0.017
 Mrs-0.017
 Mitt-0.017
Token a
Token ID257
Feature activation-6.82e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 Mr-0.002
 teach-0.002
Token loud
Token ID7812
Feature activation-1.22e+00

Pos logit contributions
nes+0.037
oll+0.036
vel+0.035
aces+0.035
ode+0.034
Neg logit contributions
 Ms-0.016
 while-0.015
 Mrs-0.013
 Mr-0.013
 teach-0.013
Token noise
Token ID7838
Feature activation+3.04e-01

Pos logit contributions
nes+0.068
oll+0.066
ode+0.064
vel+0.064
ust+0.064
Neg logit contributions
 Ms-0.027
 while-0.025
 Mr-0.024
 teach-0.024
 Mrs-0.023
Token coming
Token ID2406
Feature activation+3.56e-01

Pos logit contributions
 Ms+0.009
 Mrs+0.007
 Mr+0.007
 Mitt+0.007
 while+0.007
Neg logit contributions
nes-0.014
vel-0.014
oll-0.013
ode-0.013
aces-0.013
Token from
Token ID422
Feature activation+4.37e-01

Pos logit contributions
 Ms+0.010
 while+0.008
 teach+0.008
 Mrs+0.008
 Mr+0.008
Neg logit contributions
nes-0.017
vel-0.016
oll-0.016
ust-0.016
ode-0.015
Token an
Token ID281
Feature activation-3.22e+00

Pos logit contributions
 Ms+0.011
 while+0.009
 teach+0.009
 Mrs+0.009
 Mr+0.009
Neg logit contributions
nes-0.021
vel-0.020
oll-0.020
ode-0.020
aces-0.020
Token ig
Token ID45329
Feature activation-7.52e-01

Pos logit contributions
vel+0.188
nes+0.184
lor+0.175
ust+0.173
oll+0.170
Neg logit contributions
 last-0.074
 Mrs-0.068
 Ms-0.068
 Mr-0.066
 the-0.065
Tokenloo
Token ID29680
Feature activation+7.59e-01

Pos logit contributions
nes+0.023
vel+0.022
oll+0.021
aces+0.020
ust+0.020
Neg logit contributions
 Ms-0.033
 while-0.032
 last-0.032
 Mr-0.030
 Mrs-0.030
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 Ms+0.022
 Mrs+0.018
 Mitt+0.018
 Mr+0.017
 while+0.017
Neg logit contributions
nes-0.035
vel-0.034
oll-0.034
ode-0.033
aces-0.032
Token
Token ID198
Feature activation-2.41e-01

Pos logit contributions
nes+0.065
vel+0.063
ode+0.063
oll+0.063
ust+0.060
Neg logit contributions
 Ms-0.027
 while-0.025
 Mr-0.023
 Mrs-0.023
 visit-0.021
Token
Token ID198
Feature activation-6.73e-01

Pos logit contributions
nes+0.013
vel+0.013
oll+0.013
ode+0.013
aces+0.012
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 last-0.004
Token Lily
Token ID20037
Feature activation+3.67e-03

Pos logit contributions
 Ms+0.021
 while+0.019
 last+0.019
 teach+0.019
 visit+0.018
Neg logit contributions
nes-0.027
oll-0.025
vel-0.025
aces-0.024
aky-0.023
Token.
Token ID13
Feature activation+2.13e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 last+0.000
 Mr+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
aces-0.000
Token "
Token ID366
Feature activation-1.33e+00

Pos logit contributions
 Ms+0.005
 last+0.005
 while+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.010
vel-0.010
oll-0.009
aces-0.009
ode-0.009
TokenIt
Token ID1026
Feature activation-3.61e-01

Pos logit contributions
oll+0.063
nes+0.062
aces+0.061
vel+0.061
ode+0.060
Neg logit contributions
 while-0.035
 Ms-0.035
 Mrs-0.030
Thank-0.030
 Mitt-0.030
Token's
Token ID338
Feature activation-5.28e-01

Pos logit contributions
nes+0.015
vel+0.014
oll+0.014
ode+0.014
aces+0.014
Neg logit contributions
 Ms-0.012
 while-0.011
 Mrs-0.011
 Mr-0.011
 last-0.011
Token an
Token ID281
Feature activation-3.20e+00

Pos logit contributions
nes+0.028
vel+0.027
oll+0.027
ode+0.026
aces+0.026
Neg logit contributions
 Ms-0.012
 while-0.011
 Mr-0.011
 Mrs-0.011
 last-0.010
Token underground
Token ID11447
Feature activation-7.51e-01

Pos logit contributions
nes+0.164
vel+0.147
emic+0.146
lor+0.143
ode+0.142
Neg logit contributions
 last-0.088
 Mr-0.079
 work-0.078
 while-0.078
 when-0.075
Token tunnel
Token ID13275
Feature activation+4.15e-01

Pos logit contributions
nes+0.030
vel+0.029
oll+0.029
aces+0.027
ode+0.027
Neg logit contributions
 while-0.027
 Ms-0.027
 last-0.025
 teach-0.025
 Mr-0.024
Token,"
Token ID553
Feature activation-2.00e-01

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 Mitt+0.010
 Mr+0.010
 while+0.010
Neg logit contributions
nes-0.020
vel-0.019
oll-0.018
aces-0.018
ode-0.018
Token said
Token ID531
Feature activation+1.22e-02

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.003
 Mr-0.003
 last-0.003
Token her
Token ID607
Feature activation-6.46e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 last+0.000
 Mr+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token G
Token ID402
Feature activation-9.90e-01

Pos logit contributions
nes+0.032
vel+0.032
oll+0.031
ode+0.031
aces+0.030
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.013
 Mr-0.013
 teach-0.013
Tokeniz
Token ID528
Feature activation-6.32e-01

Pos logit contributions
nes+0.035
vel+0.034
oll+0.033
ode+0.032
ust+0.031
Neg logit contributions
 Ms-0.037
 last-0.035
 while-0.034
 teach-0.033
 Mrs-0.033
Tokenmo
Token ID5908
Feature activation-1.86e-01

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.029
ust+0.029
Neg logit contributions
 Ms-0.016
 while-0.015
 last-0.014
 teach-0.014
 Mrs-0.014
Token and
Token ID290
Feature activation-4.44e-01

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token S
Token ID311
Feature activation-1.09e+00

Pos logit contributions
nes+0.024
vel+0.023
oll+0.023
ode+0.023
ust+0.022
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.009
 Mr-0.008
 teach-0.008
Tokenque
Token ID4188
Feature activation-3.19e+00

Pos logit contributions
nes+0.043
vel+0.041
oll+0.039
ode+0.037
ust+0.036
Neg logit contributions
 Ms-0.039
 last-0.038
 teach-0.036
 while-0.036
 Mrs-0.035
Tokeneker
Token ID28233
Feature activation+7.43e-01

Pos logit contributions
lor+0.079
nes+0.078
oll+0.075
alo+0.070
aund+0.066
Neg logit contributions
 while-0.142
 last-0.133
 work-0.132
 Ms-0.129
 every-0.128
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
 Ms+0.019
 Mrs+0.016
 Mr+0.015
 Mitt+0.015
 while+0.014
Neg logit contributions
nes-0.037
aces-0.036
vel-0.036
oll-0.035
ode-0.035
Token were
Token ID547
Feature activation+6.03e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 teach-0.003
 last-0.003
Token both
Token ID1111
Feature activation+2.55e-01

Pos logit contributions
 Ms+0.018
 while+0.017
 last+0.016
 Mrs+0.016
 praise+0.016
Neg logit contributions
nes-0.026
vel-0.025
oll-0.024
aces-0.024
aky-0.023
Token small
Token ID1402
Feature activation+2.95e-01

Pos logit contributions
 Ms+0.009
 while+0.008
 Mrs+0.008
 Mr+0.007
 teach+0.007
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
aces-0.010
ode-0.010
Token one
Token ID530
Feature activation-1.30e-01

Pos logit contributions
 Ms+0.002
 Mrs+0.002
 while+0.002
 teach+0.002
 Mr+0.002
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
aces-0.007
Token day
Token ID1110
Feature activation+5.90e-01

Pos logit contributions
nes+0.007
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 Mr-0.003
 teach-0.003
Token he
Token ID339
Feature activation-4.23e-01

Pos logit contributions
 Ms+0.015
 teach+0.013
 praise+0.012
 Mrs+0.012
 Mitt+0.011
Neg logit contributions
oll-0.028
nes-0.028
vel-0.028
aces-0.026
ust-0.026
Token stumbled
Token ID24241
Feature activation+6.10e-01

Pos logit contributions
nes+0.019
vel+0.018
oll+0.018
ode+0.018
aces+0.017
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.011
 Mrs-0.011
 Mr-0.011
Token across
Token ID1973
Feature activation+1.92e-01

Pos logit contributions
 Ms+0.019
 teach+0.017
 Mrs+0.016
 praise+0.016
 Mr+0.016
Neg logit contributions
nes-0.027
vel-0.027
oll-0.026
aces-0.025
aky-0.025
Token an
Token ID281
Feature activation-3.17e+00

Pos logit contributions
 Ms+0.005
 while+0.004
 teach+0.004
 Mrs+0.004
 Mr+0.004
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.009
aces-0.009
Token amazing
Token ID4998
Feature activation-8.88e-01

Pos logit contributions
nes+0.192
vel+0.188
lor+0.187
emic+0.179
ode+0.173
Neg logit contributions
 last-0.057
 Ms-0.045
 Mr-0.044
 Mrs-0.044
 while-0.043
Token pattern
Token ID3912
Feature activation-2.00e-01

Pos logit contributions
nes+0.043
oll+0.042
vel+0.040
aces+0.039
ust+0.039
Neg logit contributions
 while-0.025
 Ms-0.025
 last-0.023
 teach-0.022
 visit-0.022
Token!
Token ID0
Feature activation-7.39e-01

Pos logit contributions
nes+0.010
vel+0.010
oll+0.010
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token It
Token ID632
Feature activation-1.12e-01

Pos logit contributions
nes+0.038
vel+0.037
ode+0.036
oll+0.035
ust+0.034
Neg logit contributions
 Ms-0.020
 Mr-0.018
 while-0.017
 Mrs-0.017
 Mitt-0.016
Token was
Token ID373
Feature activation+9.16e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.002
 teach-0.002
Token big
Token ID1263
Feature activation-5.71e-01

Pos logit contributions
nes+0.037
vel+0.035
oll+0.035
aces+0.035
ode+0.034
Neg logit contributions
 Ms-0.018
 while-0.017
 Mr-0.016
 teach-0.016
 Mrs-0.016
Token step
Token ID2239
Feature activation-1.78e-01

Pos logit contributions
nes+0.024
vel+0.022
oll+0.022
aces+0.021
ode+0.021
Neg logit contributions
 Ms-0.020
 while-0.019
 Mr-0.018
 teach-0.018
 Mrs-0.018
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token go
Token ID467
Feature activation+7.45e-01

Pos logit contributions
 Ms+0.011
 while+0.009
 Mrs+0.009
 Mr+0.009
 last+0.009
Neg logit contributions
nes-0.022
vel-0.020
ode-0.020
oll-0.020
aces-0.020
Token on
Token ID319
Feature activation+2.94e-02

Pos logit contributions
 Ms+0.027
 Mrs+0.024
 Mr+0.023
 praise+0.022
 Mitt+0.021
Neg logit contributions
nes-0.029
ode-0.026
vel-0.026
aky-0.025
aces-0.025
Token an
Token ID281
Feature activation-3.17e+00

Pos logit contributions
 Ms+0.001
 while+0.001
 teach+0.001
 Mrs+0.001
 Mr+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.001
ode-0.001
aces-0.001
Token adventure
Token ID8855
Feature activation+3.79e-01

Pos logit contributions
nes+0.164
vel+0.162
emic+0.153
onel+0.151
lor+0.151
Neg logit contributions
 last-0.079
 teach-0.072
 visit-0.069
 every-0.068
 while-0.067
Token.
Token ID13
Feature activation+7.50e-01

Pos logit contributions
 Ms+0.010
 Mrs+0.008
 Mr+0.008
 Mitt+0.008
 praise+0.008
Neg logit contributions
nes-0.019
vel-0.018
oll-0.017
aces-0.017
ode-0.017
Token They
Token ID1119
Feature activation-8.53e-01

Pos logit contributions
 Ms+0.020
 last+0.019
 teach+0.019
 praise+0.018
 while+0.018
Neg logit contributions
nes-0.033
vel-0.030
oll-0.030
aces-0.029
ode-0.029
Token walked
Token ID6807
Feature activation+6.73e-01

Pos logit contributions
vel+0.043
nes+0.042
oll+0.042
ode+0.041
aces+0.040
Neg logit contributions
 Ms-0.022
 while-0.021
 Mr-0.019
 last-0.019
 Mrs-0.019
Token far
Token ID1290
Feature activation+1.86e-01

Pos logit contributions
 Ms+0.021
 teach+0.019
 Mrs+0.019
 Mr+0.018
 Mitt+0.018
Neg logit contributions
nes-0.029
aces-0.025
vel-0.025
aky-0.025
oll-0.025
Token by
Token ID416
Feature activation-5.75e-01

Pos logit contributions
 Ms+0.023
glers+0.020
 Mrs+0.019
 Mitt+0.019
 praise+0.019
Neg logit contributions
vel-0.036
nes-0.036
aky-0.033
ode-0.033
aces-0.032
Token himself
Token ID2241
Feature activation+5.19e-01

Pos logit contributions
nes+0.034
oll+0.033
vel+0.033
ode+0.032
ust+0.031
Neg logit contributions
 Ms-0.010
 while-0.009
 Mr-0.008
 last-0.008
 Mrs-0.008
Token when
Token ID618
Feature activation+6.93e-01

Pos logit contributions
 Ms+0.014
 Mrs+0.012
 praise+0.011
 Mitt+0.011
 teach+0.011
Neg logit contributions
vel-0.025
nes-0.025
oll-0.024
ode-0.024
ust-0.024
Token he
Token ID339
Feature activation+9.89e-01

Pos logit contributions
 Ms+0.020
 teach+0.018
 praise+0.017
 while+0.016
 Mrs+0.016
Neg logit contributions
nes-0.030
vel-0.029
oll-0.029
ode-0.028
aces-0.027
Token heard
Token ID2982
Feature activation-5.78e-01

Pos logit contributions
 Ms+0.034
 praise+0.031
 teach+0.031
 Mrs+0.030
 while+0.029
Neg logit contributions
nes-0.039
oll-0.037
vel-0.036
aky-0.036
ode-0.035
Token an
Token ID281
Feature activation-3.15e+00

Pos logit contributions
nes+0.033
vel+0.031
oll+0.031
ode+0.031
aces+0.030
Neg logit contributions
 Ms-0.012
 while-0.010
 Mrs-0.010
 Mr-0.010
 teach-0.009
Token enormous
Token ID9812
Feature activation-8.54e-01

Pos logit contributions
nes+0.171
lor+0.168
vel+0.163
ode+0.156
emic+0.153
Neg logit contributions
 teach-0.069
 Ms-0.068
 Mr-0.066
 Mrs-0.066
 last-0.063
Token noise
Token ID7838
Feature activation+3.45e-01

Pos logit contributions
nes+0.043
oll+0.040
vel+0.040
aces+0.039
ust+0.039
Neg logit contributions
 Ms-0.024
 while-0.022
 teach-0.021
 Mr-0.021
 Mrs-0.021
Token.
Token ID13
Feature activation-5.68e-01

Pos logit contributions
 Ms+0.009
 Mrs+0.007
 Mitt+0.007
 while+0.007
 Mr+0.007
Neg logit contributions
nes-0.017
vel-0.017
oll-0.016
ode-0.016
ust-0.015
Token He
Token ID679
Feature activation-1.18e+00

Pos logit contributions
nes+0.025
vel+0.024
oll+0.024
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.019
 while-0.017
 Mr-0.017
 Mrs-0.017
 Mitt-0.016
Token looked
Token ID3114
Feature activation+2.03e-01

Pos logit contributions
nes+0.065
vel+0.064
oll+0.063
aces+0.062
ode+0.061
Neg logit contributions
 Ms-0.025
 while-0.025
 Mr-0.022
 Mrs-0.021
 last-0.021
Token walking
Token ID6155
Feature activation+5.89e-01

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.017
ust+0.016
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.009
 Mr-0.009
 last-0.009
Token around
Token ID1088
Feature activation+9.60e-01

Pos logit contributions
 Ms+0.023
 Mrs+0.020
 teach+0.020
 Mr+0.020
 praise+0.020
Neg logit contributions
nes-0.021
vel-0.020
aky-0.019
oll-0.019
aces-0.019
Token when
Token ID618
Feature activation+9.86e-01

Pos logit contributions
 Ms+0.022
glers+0.020
 teach+0.019
 praise+0.018
 kiss+0.018
Neg logit contributions
nes-0.045
vel-0.044
aky-0.041
adow-0.040
ode-0.040
Token he
Token ID339
Feature activation+9.39e-01

Pos logit contributions
 Ms+0.029
 teach+0.029
 praise+0.027
 kiss+0.025
 Mrs+0.024
Neg logit contributions
nes-0.041
oll-0.038
vel-0.038
aces-0.036
ode-0.036
Token heard
Token ID2982
Feature activation-6.36e-01

Pos logit contributions
 Ms+0.032
 teach+0.029
 praise+0.029
 Mrs+0.028
 Mr+0.027
Neg logit contributions
nes-0.037
vel-0.036
oll-0.035
ode-0.035
aky-0.035
Token an
Token ID281
Feature activation-3.13e+00

Pos logit contributions
nes+0.037
vel+0.035
ode+0.035
oll+0.035
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.011
 Mrs-0.010
 Mr-0.010
 last-0.009
Token odd
Token ID5629
Feature activation-1.02e+00

Pos logit contributions
nes+0.165
lor+0.161
vel+0.157
ode+0.149
emic+0.148
Neg logit contributions
 Ms-0.074
 teach-0.072
 Mrs-0.071
 Mr-0.069
 last-0.069
Token sound
Token ID2128
Feature activation+7.72e-02

Pos logit contributions
nes+0.058
oll+0.054
vel+0.054
ode+0.053
aces+0.053
Neg logit contributions
 Ms-0.021
 while-0.020
 teach-0.018
 Mrs-0.018
 Mr-0.018
Token.
Token ID13
Feature activation-8.81e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 Mr+0.002
 Mitt+0.001
Neg logit contributions
nes-0.004
vel-0.004
oll-0.004
ode-0.004
aces-0.004
Token It
Token ID632
Feature activation+5.19e-01

Pos logit contributions
nes+0.043
vel+0.041
ode+0.040
oll+0.040
ust+0.039
Neg logit contributions
 Ms-0.026
 while-0.024
 Mrs-0.024
 Mr-0.024
 Mitt-0.022
Token was
Token ID373
Feature activation+5.22e-01

Pos logit contributions
 Ms+0.014
 while+0.013
 last+0.012
 teach+0.012
 Mrs+0.012
Neg logit contributions
nes-0.023
vel-0.023
oll-0.022
ust-0.022
ode-0.021
Token.
Token ID13
Feature activation+6.51e-01

Pos logit contributions
 Ms+0.025
 Mrs+0.021
 praise+0.020
 teach+0.020
 last+0.020
Neg logit contributions
nes-0.043
vel-0.042
oll-0.042
aces-0.042
ode-0.041
Token One
Token ID1881
Feature activation-1.09e-01

Pos logit contributions
 Ms+0.019
 while+0.017
 last+0.016
 Mrs+0.016
 teach+0.016
Neg logit contributions
nes-0.028
oll-0.028
vel-0.027
aces-0.027
ode-0.027
Token day
Token ID1110
Feature activation+1.32e+00

Pos logit contributions
nes+0.006
vel+0.005
oll+0.005
ode+0.005
ust+0.005
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 Mr-0.002
 teach-0.002
Token P
Token ID350
Feature activation-5.90e-01

Pos logit contributions
 Ms+0.040
 praise+0.034
 Mrs+0.034
 last+0.033
 teach+0.032
Neg logit contributions
vel-0.057
oll-0.056
aces-0.052
nes-0.052
aund-0.050
Tokenips
Token ID2419
Feature activation-8.40e-01

Pos logit contributions
nes+0.023
vel+0.022
oll+0.022
ode+0.022
aces+0.021
Neg logit contributions
 Ms-0.021
 while-0.019
 last-0.019
 teach-0.018
 Mr-0.018
Tokenque
Token ID4188
Feature activation-3.13e+00

Pos logit contributions
nes+0.042
vel+0.042
ode+0.041
oll+0.041
ust+0.039
Neg logit contributions
 Ms-0.020
 while-0.019
 Mr-0.018
 last-0.018
 Mrs-0.017
Tokenak
Token ID461
Feature activation+8.28e-01

Pos logit contributions
oll+0.075
nes+0.073
alo+0.069
ode+0.066
aund+0.066
Neg logit contributions
 while-0.133
s-0.133
 Ms-0.126
 with-0.125
 upon-0.125
Token was
Token ID373
Feature activation+5.93e-02

Pos logit contributions
 Ms+0.027
 Mrs+0.023
 while+0.023
 teach+0.023
 praise+0.023
Neg logit contributions
nes-0.035
vel-0.033
oll-0.033
aces-0.032
ode-0.032
Token playing
Token ID2712
Feature activation+9.30e-02

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 teach+0.002
 last+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.002
aces-0.002
Token with
Token ID351
Feature activation-1.24e-01

Pos logit contributions
 Ms+0.002
 Mrs+0.002
 Mr+0.002
 while+0.002
 last+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.004
ode-0.004
aces-0.004
Token its
Token ID663
Feature activation+7.82e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 last-0.003
 Mr-0.003
Token her
Token ID607
Feature activation-2.60e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 teach+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token shoe
Token ID17292
Feature activation-4.34e-01

Pos logit contributions
nes+0.009
vel+0.008
oll+0.008
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.010
 Mr-0.010
 last-0.010
Token was
Token ID373
Feature activation-2.64e-01

Pos logit contributions
nes+0.022
vel+0.022
oll+0.022
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token stuck
Token ID7819
Feature activation+1.44e-02

Pos logit contributions
nes+0.011
vel+0.010
oll+0.010
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.009
 while-0.008
 Mrs-0.008
 Mr-0.008
 last-0.008
Token in
Token ID287
Feature activation+4.22e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 teach+0.000
 Mr+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token an
Token ID281
Feature activation-3.13e+00

Pos logit contributions
 Ms+0.009
 while+0.008
 teach+0.008
 Mrs+0.007
 Mr+0.007
Neg logit contributions
vel-0.022
nes-0.021
oll-0.021
ode-0.020
aces-0.020
Token icy
Token ID30284
Feature activation-1.74e+00

Pos logit contributions
nes+0.169
ust+0.154
emic+0.153
vel+0.151
lor+0.149
Neg logit contributions
 last-0.080
 front-0.071
 Ms-0.068
 Every-0.068
 while-0.067
Token crack
Token ID8469
Feature activation-6.65e-01

Pos logit contributions
nes+0.078
vel+0.076
ust+0.073
oll+0.073
onel+0.071
Neg logit contributions
 while-0.058
 last-0.054
 Ms-0.054
 Mitt-0.053
 Mr-0.051
Token in
Token ID287
Feature activation+1.20e-02

Pos logit contributions
nes+0.037
vel+0.036
oll+0.035
ode+0.035
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.012
 last-0.011
 Mrs-0.010
 Mr-0.010
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 Mrs+0.000
 Mr+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token sidewalk
Token ID24058
Feature activation+1.58e-01

Pos logit contributions
nes+0.049
vel+0.047
ust+0.045
oll+0.044
ode+0.042
Neg logit contributions
 Ms-0.038
 last-0.037
 while-0.036
 Mr-0.035
 Mitt-0.035
Token too
Token ID1165
Feature activation+7.86e-01

Pos logit contributions
 Ms+0.035
 praise+0.031
 teach+0.031
 Mrs+0.030
 kiss+0.030
Neg logit contributions
nes-0.044
vel-0.039
aces-0.037
ust-0.036
oll-0.036
Token fast
Token ID3049
Feature activation-1.42e-01

Pos logit contributions
 Ms+0.022
 teach+0.018
 Mrs+0.018
 Mr+0.017
 praise+0.017
Neg logit contributions
nes-0.038
vel-0.037
oll-0.036
ust-0.035
aces-0.035
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
nes+0.008
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 Mr-0.003
 teach-0.002
Token bumped
Token ID38034
Feature activation-1.44e-01

Pos logit contributions
 Ms+0.061
 particular+0.057
 praise+0.053
 Mrs+0.052
 hire+0.052
Neg logit contributions
nes-0.071
aces-0.058
onel-0.053
ode-0.053
vel-0.053
Token into
Token ID656
Feature activation-1.76e-02

Pos logit contributions
nes+0.007
vel+0.006
oll+0.006
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 teach-0.004
 last-0.004
Token an
Token ID281
Feature activation-3.12e+00

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token old
Token ID1468
Feature activation-5.32e-01

Pos logit contributions
nes+0.176
vel+0.170
lor+0.166
emic+0.162
ode+0.161
Neg logit contributions
 last-0.065
 Ms-0.062
 Mr-0.061
 Mrs-0.060
 elderly-0.056
Token man
Token ID582
Feature activation-2.85e-01

Pos logit contributions
nes+0.023
vel+0.022
oll+0.022
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.017
 while-0.017
 Mrs-0.016
 Mr-0.016
 teach-0.015
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
nes+0.015
vel+0.014
oll+0.014
ode+0.014
aces+0.014
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.006
 Mr-0.006
 teach-0.006
Token
Token ID198
Feature activation+4.65e-01

Pos logit contributions
 last+0.029
 teach+0.027
 Ms+0.027
 while+0.026
 praise+0.026
Neg logit contributions
nes-0.043
oll-0.038
vel-0.037
anna-0.037
aces-0.036
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
 Ms+0.011
 while+0.010
 teach+0.010
 last+0.010
 Mrs+0.009
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
ust-0.021
aky-0.021
Token,
Token ID11
Feature activation+9.93e-01

Pos logit contributions
nes+0.027
vel+0.026
oll+0.026
ode+0.026
ust+0.025
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.005
 Mr-0.005
 last-0.005
Token Liz
Token ID28313
Feature activation-6.35e-01

Pos logit contributions
 Ms+0.029
 teach+0.024
 praise+0.024
 Mrs+0.023
 Mr+0.023
Neg logit contributions
nes-0.042
vel-0.041
oll-0.040
ust-0.040
ode-0.038
Token found
Token ID1043
Feature activation-7.68e-02

Pos logit contributions
nes+0.029
vel+0.029
oll+0.029
ode+0.028
aces+0.027
Neg logit contributions
 Ms-0.018
 while-0.017
 Mrs-0.016
 last-0.016
 Mr-0.016
Token him
Token ID683
Feature activation-1.75e-01

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.001
 teach-0.001
 Mr-0.001
Token in
Token ID287
Feature activation+6.22e-01

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 teach-0.005
Token an
Token ID281
Feature activation-3.11e+00

Pos logit contributions
 Ms+0.014
 while+0.013
 teach+0.012
glers+0.011
 last+0.011
Neg logit contributions
nes-0.030
vel-0.029
oll-0.029
aces-0.028
ode-0.028
Token old
Token ID1468
Feature activation-1.54e+00

Pos logit contributions
nes+0.183
vel+0.173
emic+0.168
ust+0.167
oll+0.162
Neg logit contributions
 last-0.065
 front-0.057
 visit-0.056
 Ms-0.056
 Mrs-0.055
Token dust
Token ID8977
Feature activation-6.46e-01

Pos logit contributions
oll+0.068
nes+0.066
vel+0.065
ust+0.064
aces+0.062
Neg logit contributions
 while-0.052
 Ms-0.051
 visit-0.047
 Mrs-0.046
 Mr-0.046
Tokenbin
Token ID8800
Feature activation-7.75e-01

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.029
ust+0.029
Neg logit contributions
 Ms-0.017
 while-0.016
 last-0.015
 Mrs-0.015
 teach-0.015
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
nes+0.044
oll+0.043
vel+0.043
ode+0.042
ust+0.042
Neg logit contributions
 Ms-0.013
 while-0.013
 last-0.011
 teach-0.011
 Mrs-0.011
Token Joe
Token ID5689
Feature activation+1.21e-01

Pos logit contributions
 Ms+0.005
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.006
vel-0.006
oll-0.006
aces-0.006
ode-0.006
Token play
Token ID711
Feature activation-2.48e-01

Pos logit contributions
 Ms+0.007
 while+0.006
 Mrs+0.005
 last+0.005
 Mr+0.005
Neg logit contributions
nes-0.015
ode-0.014
vel-0.014
oll-0.014
ust-0.014
Token?"
Token ID1701
Feature activation+9.58e-02

Pos logit contributions
nes+0.014
vel+0.014
oll+0.014
ode+0.013
aces+0.013
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token
Token ID198
Feature activation+8.55e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.005
oll-0.005
vel-0.005
ode-0.005
aces-0.005
Token
Token ID198
Feature activation+5.62e-01

Pos logit contributions
 Ms+0.024
 last+0.022
 while+0.022
 praise+0.021
 teach+0.021
Neg logit contributions
nes-0.039
oll-0.036
vel-0.036
aky-0.034
ust-0.033
TokenS
Token ID50
Feature activation+3.51e-01

Pos logit contributions
 Ms+0.019
 last+0.018
 praise+0.017
 Mrs+0.017
 while+0.017
Neg logit contributions
nes-0.023
vel-0.022
oll-0.021
ode-0.021
aky-0.020
Tokenque
Token ID4188
Feature activation-3.11e+00

Pos logit contributions
 Ms+0.013
 while+0.012
 Mrs+0.012
 teach+0.012
 Mr+0.011
Neg logit contributions
nes-0.013
oll-0.013
vel-0.013
ode-0.013
ust-0.012
Tokenaky
Token ID15492
Feature activation-4.50e-01

Pos logit contributions
nes+0.078
ode+0.068
oll+0.060
alo+0.057
lor+0.056
Neg logit contributions
 while-0.135
 Ms-0.134
 every-0.128
 last-0.127
Every-0.126
Token replied
Token ID8712
Feature activation+2.76e-01

Pos logit contributions
nes+0.024
vel+0.023
oll+0.023
ode+0.023
aces+0.022
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token,
Token ID11
Feature activation+5.71e-01

Pos logit contributions
 Ms+0.006
 last+0.005
 Mrs+0.005
 teach+0.005
 praise+0.005
Neg logit contributions
vel-0.014
nes-0.014
oll-0.014
aces-0.013
ode-0.013
Token "
Token ID366
Feature activation+1.05e+00

Pos logit contributions
 Ms+0.016
 last+0.015
 praise+0.015
 teach+0.015
 visit+0.014
Neg logit contributions
vel-0.024
nes-0.024
oll-0.023
aces-0.023
ode-0.022
TokenYes
Token ID5297
Feature activation+6.11e-01

Pos logit contributions
 Ms+0.026
 Mrs+0.024
 praise+0.023
 thanks+0.023
 Mr+0.023
Neg logit contributions
nes-0.053
ust-0.048
ode-0.047
vel-0.046
oll-0.046
Token and
Token ID290
Feature activation+4.76e-03

Pos logit contributions
 Ms+0.005
 while+0.004
 Mrs+0.004
 teach+0.004
 Mr+0.004
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.008
aces-0.007
Token the
Token ID262
Feature activation+2.94e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 Mrs+0.000
 last+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
aces-0.000
Token car
Token ID1097
Feature activation+5.78e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mitt+0.001
 Mr+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token twisted
Token ID19074
Feature activation-5.21e-01

Pos logit contributions
 Ms+0.020
 Mrs+0.017
 while+0.016
 Mr+0.016
 teach+0.016
Neg logit contributions
nes-0.024
vel-0.023
oll-0.023
aces-0.023
ode-0.023
Token in
Token ID287
Feature activation-1.37e-01

Pos logit contributions
nes+0.028
vel+0.026
oll+0.026
ode+0.026
ust+0.025
Neg logit contributions
 Ms-0.012
 while-0.012
 Mr-0.010
 last-0.010
 Mrs-0.010
Token an
Token ID281
Feature activation-3.11e+00

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 Mr-0.003
 teach-0.003
Token uncomfortable
Token ID12916
Feature activation-1.54e+00

Pos logit contributions
nes+0.167
emic+0.146
lor+0.145
ust+0.145
vel+0.144
Neg logit contributions
 last-0.083
 Every-0.077
 front-0.076
 teach-0.071
 the-0.070
Token way
Token ID835
Feature activation-8.41e-01

Pos logit contributions
nes+0.071
oll+0.071
vel+0.070
ust+0.069
onel+0.065
Neg logit contributions
 while-0.047
 last-0.044
 Ms-0.042
 visit-0.040
 before-0.040
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
nes+0.050
oll+0.050
vel+0.049
ode+0.049
aces+0.047
Neg logit contributions
 while-0.013
 Ms-0.012
 last-0.011
 Mr-0.010
 before-0.010
Token The
Token ID383
Feature activation+2.17e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 teach-0.002
 last-0.002
 Mrs-0.002
Token little
Token ID1310
Feature activation-2.74e-01

Pos logit contributions
 Ms+0.007
 Mrs+0.006
 Mitt+0.006
 while+0.006
 Mr+0.005
Neg logit contributions
nes-0.010
vel-0.009
oll-0.009
ode-0.009
aces-0.009
Token.
Token ID13
Feature activation+4.79e-01

Pos logit contributions
 Ms+0.022
 Mrs+0.019
 Mr+0.018
 teach+0.018
 while+0.018
Neg logit contributions
nes-0.026
vel-0.026
oll-0.025
ode-0.025
aky-0.025
Token The
Token ID383
Feature activation-1.01e+00

Pos logit contributions
 Ms+0.012
 while+0.010
 last+0.010
 teach+0.010
 praise+0.009
Neg logit contributions
nes-0.022
aces-0.022
vel-0.022
oll-0.022
ode-0.022
Token bird
Token ID6512
Feature activation-3.39e-01

Pos logit contributions
nes+0.052
vel+0.051
oll+0.048
aces+0.047
ode+0.046
Neg logit contributions
 last-0.026
 while-0.025
 Ms-0.024
 Mr-0.024
 visit-0.023
Token lived
Token ID5615
Feature activation+2.79e-01

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.015
aces+0.015
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token in
Token ID287
Feature activation-3.54e-01

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 while+0.010
 teach+0.010
 Mr+0.010
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.009
aky-0.009
Token an
Token ID281
Feature activation-3.09e+00

Pos logit contributions
nes+0.020
vel+0.020
oll+0.019
ode+0.019
ust+0.019
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.005
 Mr-0.005
 last-0.005
Token ordinary
Token ID8850
Feature activation-1.94e+00

Pos logit contributions
nes+0.170
vel+0.163
oll+0.153
ust+0.147
lor+0.146
Neg logit contributions
 last-0.073
 Every-0.069
 Mr-0.067
 the-0.067
 Mrs-0.066
Token tree
Token ID5509
Feature activation+6.46e-01

Pos logit contributions
oll+0.095
aces+0.091
nes+0.090
vel+0.088
onel+0.083
Neg logit contributions
 while-0.063
 visit-0.060
 last-0.057
 Mr-0.054
 Ms-0.053
Token with
Token ID351
Feature activation-2.13e-01

Pos logit contributions
 Ms+0.019
 Mrs+0.016
 Mr+0.015
 Mitt+0.015
 teach+0.015
Neg logit contributions
nes-0.030
vel-0.030
oll-0.029
aces-0.029
aky-0.029
Token its
Token ID663
Feature activation+3.03e-02

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.005
 Mr-0.005
 teach-0.005
Token family
Token ID1641
Feature activation-6.21e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 teach+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token was
Token ID373
Feature activation-9.71e-02

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 Mr-0.001
 teach-0.001
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 teach-0.003
 Mr-0.003
Token only
Token ID691
Feature activation-5.36e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 Mr+0.000
 teach+0.000
Neg logit contributions
nes-0.001
oll-0.000
vel-0.000
ode-0.000
aces-0.000
Token stitch
Token ID24695
Feature activation-5.54e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.001
 teach-0.001
Token in
Token ID287
Feature activation-3.84e-01

Pos logit contributions
nes+0.028
vel+0.027
oll+0.027
ode+0.027
ust+0.026
Neg logit contributions
 Ms-0.013
 while-0.013
 last-0.011
 Mrs-0.011
 teach-0.011
Token an
Token ID281
Feature activation-3.06e+00

Pos logit contributions
nes+0.020
vel+0.019
oll+0.019
ode+0.019
aces+0.019
Neg logit contributions
 Ms-0.009
 while-0.008
 Mrs-0.008
 Mr-0.008
 last-0.008
Token old
Token ID1468
Feature activation-9.05e-01

Pos logit contributions
nes+0.167
emic+0.150
ust+0.148
vel+0.147
lor+0.143
Neg logit contributions
 last-0.076
 front-0.064
 Every-0.064
 visit-0.061
 fat-0.061
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
nes+0.037
oll+0.036
vel+0.036
ust+0.034
ode+0.034
Neg logit contributions
 Ms-0.032
 while-0.031
 Mr-0.029
 teach-0.028
 Mrs-0.028
Token worn
Token ID12666
Feature activation+2.19e-01

Pos logit contributions
nes+0.035
vel+0.034
oll+0.033
ode+0.033
aces+0.032
Neg logit contributions
 Ms-0.016
 while-0.016
 Mr-0.014
 Mrs-0.014
 last-0.014
Token jumper
Token ID31118
Feature activation-6.51e-02

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 teach+0.007
 Mr+0.007
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
aces-0.008
ode-0.008
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.002
 while-0.001
 Mrs-0.001
 Mr-0.001
 teach-0.001
Token nap
Token ID25422
Feature activation-4.07e-01

Pos logit contributions
 Ms+0.042
 Mrs+0.037
 teach+0.035
 Win+0.035
 Mr+0.035
Neg logit contributions
ode-0.039
oll-0.038
nes-0.037
aces-0.036
vel-0.036
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
nes+0.016
vel+0.016
oll+0.015
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.014
 while-0.014
 Mrs-0.013
 last-0.013
 Mr-0.013
Token
Token ID220
Feature activation-3.43e-01

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.005
 while-0.005
 Mrs-0.005
 Mr-0.005
 last-0.004
Token
Token ID198
Feature activation+3.26e-01

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
aces+0.018
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.005
 Mr-0.005
 last-0.005
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
 Ms+0.008
 while+0.007
 teach+0.007
 last+0.006
 Mrs+0.006
Neg logit contributions
nes-0.017
vel-0.016
oll-0.015
ode-0.015
ust-0.015
TokenThe
Token ID464
Feature activation+1.87e+00

Pos logit contributions
 Ms+0.006
 last+0.005
 Mr+0.005
 Mrs+0.005
 teach+0.005
Neg logit contributions
nes-0.010
vel-0.010
ust-0.009
oll-0.009
aky-0.009
Token cat
Token ID3797
Feature activation-4.35e-01

Pos logit contributions
 Ms+0.075
glers+0.071
 Both+0.068
 common+0.066
 whenever+0.064
Neg logit contributions
oll-0.056
aky-0.053
ode-0.049
vel-0.048
 pad-0.043
Token's
Token ID338
Feature activation+6.59e-01

Pos logit contributions
nes+0.022
vel+0.021
oll+0.021
ode+0.020
aces+0.020
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.010
 Mr-0.010
 teach-0.009
Token friend
Token ID1545
Feature activation+1.76e-01

Pos logit contributions
 Ms+0.023
 while+0.020
 Mrs+0.019
 Mr+0.019
 teach+0.018
Neg logit contributions
oll-0.027
ode-0.026
vel-0.025
aces-0.025
nes-0.025
Token wanted
Token ID2227
Feature activation+6.90e-02

Pos logit contributions
 Ms+0.004
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.009
vel-0.008
oll-0.008
ode-0.008
aky-0.008
Token to
Token ID284
Feature activation+1.55e+00

Pos logit contributions
 Ms+0.002
 while+0.001
 teach+0.001
 Mrs+0.001
 last+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token "
Token ID366
Feature activation+1.07e+00

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 praise+0.009
Neg logit contributions
vel-0.017
nes-0.017
oll-0.016
ode-0.016
aces-0.016
TokenHello
Token ID15496
Feature activation+9.19e-01

Pos logit contributions
 Ms+0.032
 last+0.029
 teach+0.029
 choice+0.029
 praise+0.029
Neg logit contributions
nes-0.043
ode-0.042
vel-0.041
aky-0.040
oll-0.038
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
 Ms+0.022
 praise+0.021
 teach+0.019
 while+0.019
 last+0.019
Neg logit contributions
vel-0.042
oll-0.042
nes-0.041
ust-0.040
ode-0.040
Token dog
Token ID3290
Feature activation+1.75e+00

Pos logit contributions
 Ms+0.004
 while+0.003
 teach+0.003
 last+0.003
 Mrs+0.003
Neg logit contributions
nes-0.007
oll-0.007
vel-0.006
ode-0.006
ust-0.006
Token!
Token ID0
Feature activation+9.63e-01

Pos logit contributions
 Ms+0.051
 praise+0.051
 usual+0.050
 last+0.050
 particular+0.048
Neg logit contributions
nes-0.070
vel-0.068
aky-0.064
oll-0.063
ode-0.062
Token Can
Token ID1680
Feature activation+1.54e+00

Pos logit contributions
 last+0.032
 Ms+0.031
 while+0.031
 choice+0.030
 praise+0.029
Neg logit contributions
nes-0.037
oll-0.035
vel-0.035
ode-0.033
aces-0.033
Token I
Token ID314
Feature activation+1.60e+00

Pos logit contributions
 Ms+0.057
 early+0.054
 praise+0.054
glers+0.054
 while+0.053
Neg logit contributions
nes-0.054
ode-0.046
vel-0.045
oll-0.044
aces-0.042
Token take
Token ID1011
Feature activation+4.58e-01

Pos logit contributions
 Ms+0.045
 downward+0.038
 Mrs+0.037
 early+0.037
 while+0.037
Neg logit contributions
nes-0.072
ode-0.067
vel-0.062
oll-0.061
aky-0.061
Token a
Token ID257
Feature activation-8.03e-02

Pos logit contributions
 Ms+0.013
 while+0.012
 Mrs+0.011
 last+0.011
glers+0.011
Neg logit contributions
nes-0.020
vel-0.020
oll-0.020
aces-0.019
ode-0.019
Token picture
Token ID4286
Feature activation+2.53e-01

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 Mr-0.002
 last-0.002
Token of
Token ID286
Feature activation+4.71e-01

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 Mr+0.005
 Mitt+0.004
Neg logit contributions
nes-0.013
vel-0.013
oll-0.013
ode-0.012
aces-0.012
Token cool
Token ID3608
Feature activation+1.03e+00

Pos logit contributions
 Ms+0.009
 Mrs+0.008
 while+0.008
 Mr+0.008
 teach+0.008
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ode-0.010
aces-0.010
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
 Ms+0.033
 Mrs+0.029
glers+0.027
 teach+0.026
 Mr+0.026
Neg logit contributions
oll-0.043
vel-0.042
nes-0.042
ode-0.040
aces-0.039
Token
Token ID220
Feature activation-3.10e-02

Pos logit contributions
 Ms+0.005
 while+0.005
 teach+0.004
 last+0.004
 Mrs+0.004
Neg logit contributions
nes-0.010
oll-0.010
vel-0.010
aces-0.009
ode-0.009
Token
Token ID198
Feature activation-2.41e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.012
aces+0.012
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 last-0.005
TokenThe
Token ID464
Feature activation+1.67e+00

Pos logit contributions
 Ms+0.004
 while+0.004
 Mrs+0.004
 Mr+0.004
 last+0.004
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
ust-0.007
Token End
Token ID5268
Feature activation+2.71e-01

Pos logit contributions
glers+0.057
 Ms+0.057
 Mrs+0.052
bat+0.051
s+0.049
Neg logit contributions
oll-0.063
aky-0.056
vel-0.056
nes-0.051
onel-0.048
Token<|endoftext|>
Token ID50256
Feature activation+1.55e-01

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 teach+0.005
 Mr+0.005
Neg logit contributions
nes-0.014
vel-0.014
oll-0.014
ust-0.013
aces-0.013
TokenOne
Token ID3198
Feature activation+4.16e-01

Pos logit contributions
 Ms+0.004
 while+0.003
 teach+0.003
 Mrs+0.003
 last+0.003
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.007
aces-0.007
Token day
Token ID1110
Feature activation+7.43e-01

Pos logit contributions
 Ms+0.013
 Mrs+0.011
 while+0.011
 Mr+0.010
 teach+0.010
Neg logit contributions
nes-0.018
oll-0.018
vel-0.018
ust-0.017
aky-0.017
Token,
Token ID11
Feature activation+6.73e-01

Pos logit contributions
 Ms+0.019
 teach+0.017
 praise+0.017
 last+0.016
 Mrs+0.016
Neg logit contributions
vel-0.036
nes-0.035
oll-0.034
ust-0.032
aund-0.031
Token
Token ID220
Feature activation-4.19e-01

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 last-0.001
 teach-0.001
Token
Token ID198
Feature activation+9.39e-02

Pos logit contributions
nes+0.023
vel+0.023
oll+0.023
ode+0.022
aces+0.022
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.006
 Mr-0.006
 last-0.006
Token
Token ID198
Feature activation+5.71e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 teach+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
ust-0.005
TokenOne
Token ID3198
Feature activation+3.77e-01

Pos logit contributions
 Ms+0.016
 while+0.014
 Mrs+0.014
 praise+0.013
 last+0.013
Neg logit contributions
vel-0.027
nes-0.027
oll-0.025
ode-0.025
ust-0.025
Token day
Token ID1110
Feature activation+1.19e+00

Pos logit contributions
 Ms+0.011
 Mrs+0.010
 teach+0.010
 Mr+0.009
 praise+0.009
Neg logit contributions
nes-0.016
oll-0.016
vel-0.016
aky-0.016
ust-0.016
Token,
Token ID11
Feature activation+1.57e+00

Pos logit contributions
 Ms+0.031
 praise+0.029
 last+0.028
glers+0.027
 teach+0.027
Neg logit contributions
vel-0.056
oll-0.054
nes-0.051
ig-0.049
aces-0.048
Token T
Token ID309
Feature activation-3.15e-01

Pos logit contributions
 praise+0.035
 Ms+0.033
glers+0.032
 teach+0.031
 last+0.031
Neg logit contributions
oll-0.072
vel-0.072
nes-0.069
aces-0.064
ocket-0.064
Tokenilly
Token ID6548
Feature activation+4.27e-01

Pos logit contributions
nes+0.011
vel+0.011
oll+0.010
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.013
 while-0.012
 Mrs-0.011
 Mr-0.011
 last-0.011
Token found
Token ID1043
Feature activation+1.46e-01

Pos logit contributions
 Ms+0.015
 Mrs+0.013
 while+0.013
 Mr+0.013
 last+0.013
Neg logit contributions
nes-0.018
vel-0.017
ode-0.016
oll-0.016
aces-0.016
Token a
Token ID257
Feature activation-4.30e-01

Pos logit contributions
 Ms+0.004
 while+0.003
 teach+0.003
 Mrs+0.003
 visit+0.003
Neg logit contributions
nes-0.007
oll-0.007
vel-0.007
aces-0.007
ode-0.007
Token little
Token ID1310
Feature activation-5.38e-01

Pos logit contributions
nes+0.021
vel+0.021
oll+0.020
ode+0.020
aces+0.020
Neg logit contributions
 Ms-0.011
 while-0.010
 Mrs-0.010
 Mr-0.010
 last-0.010
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
nes+0.030
vel+0.029
oll+0.029
ode+0.028
ust+0.028
Neg logit contributions
 Ms-0.009
 while-0.008
 Mr-0.007
 last-0.007
 Mrs-0.007
Token Lily
Token ID20037
Feature activation+1.42e-01

Pos logit contributions
 Ms+0.004
 while+0.003
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.008
ust-0.007
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 teach+0.003
 Mr+0.003
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
aces-0.007
Token One
Token ID1881
Feature activation-6.03e-02

Pos logit contributions
 Ms+0.012
 while+0.011
 last+0.011
 teach+0.011
 praise+0.011
Neg logit contributions
nes-0.022
oll-0.021
vel-0.021
aces-0.021
ode-0.020
Token day
Token ID1110
Feature activation+1.63e+00

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
ust+0.003
Neg logit contributions
 Ms-0.002
 while-0.001
 Mrs-0.001
 Mr-0.001
 teach-0.001
Token,
Token ID11
Feature activation+1.75e+00

Pos logit contributions
 Ms+0.054
 praise+0.048
glers+0.048
 teach+0.047
 hire+0.046
Neg logit contributions
vel-0.062
oll-0.061
nes-0.057
aces-0.056
aund-0.054
Token they
Token ID484
Feature activation-3.93e-01

Pos logit contributions
 praise+0.056
 hire+0.055
 types+0.053
glers+0.053
 Ms+0.053
Neg logit contributions
nes-0.054
vel-0.049
oll-0.048
aces-0.046
glass-0.045
Token went
Token ID1816
Feature activation+4.81e-02

Pos logit contributions
nes+0.018
vel+0.018
oll+0.018
ode+0.017
aces+0.017
Neg logit contributions
 Ms-0.011
 while-0.011
 Mrs-0.010
 Mr-0.010
 last-0.010
Token to
Token ID284
Feature activation+3.49e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token the
Token ID262
Feature activation-9.02e-01

Pos logit contributions
 Ms+0.006
 while+0.005
 teach+0.004
 last+0.004
 Mrs+0.004
Neg logit contributions
nes-0.019
vel-0.018
oll-0.018
ode-0.018
ust-0.017
Token beach
Token ID10481
Feature activation+1.10e+00

Pos logit contributions
nes+0.046
vel+0.046
aces+0.044
oll+0.044
ode+0.042
Neg logit contributions
 Ms-0.025
 Mr-0.022
 last-0.022
 while-0.022
 Mrs-0.022
Token�
Token ID241
Feature activation+2.09e-01

Pos logit contributions
nes+0.013
vel+0.012
oll+0.012
ode+0.012
aces+0.011
Neg logit contributions
 Ms-0.013
 while-0.012
 Mrs-0.012
 Mr-0.012
 teach-0.012
TokenYou
Token ID1639
Feature activation-5.83e-01

Pos logit contributions
 Ms+0.006
 while+0.006
 Mrs+0.006
 teach+0.005
 Mr+0.005
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ust-0.009
ode-0.009
Token never
Token ID1239
Feature activation+5.65e-01

Pos logit contributions
nes+0.025
vel+0.025
oll+0.025
ode+0.024
aces+0.023
Neg logit contributions
 Ms-0.018
 while-0.017
 teach-0.016
 last-0.016
 Mrs-0.016
Token know
Token ID760
Feature activation-5.78e-01

Pos logit contributions
 Ms+0.018
 while+0.017
 Mrs+0.016
 Mitt+0.016
 Mr+0.016
Neg logit contributions
nes-0.024
vel-0.023
oll-0.022
ode-0.022
ust-0.022
Token what
Token ID644
Feature activation-6.19e-01

Pos logit contributions
nes+0.026
vel+0.025
oll+0.025
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.018
 while-0.016
 Mrs-0.015
 Mr-0.015
 last-0.015
Token you
Token ID345
Feature activation+5.68e-01

Pos logit contributions
nes+0.032
vel+0.031
oll+0.031
ode+0.030
ust+0.029
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.013
 Mr-0.013
 teach-0.013
Tokenâ
Token ID22940
Feature activation+1.55e-02

Pos logit contributions
 Ms+0.019
 while+0.017
 Mitt+0.017
 Mrs+0.017
 Mr+0.017
Neg logit contributions
nes-0.023
vel-0.023
oll-0.022
ode-0.022
ust-0.021
Token€
Token ID26391
Feature activation-4.56e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
aces-0.000
Tokenâ„¢
Token ID8151
Feature activation+9.18e-01

Pos logit contributions
nes+0.002
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 last-0.002
 Mr-0.002
Tokenll
Token ID297
Feature activation-3.53e-01

Pos logit contributions
 Ms+0.026
 teach+0.024
 while+0.024
 praise+0.024
 visit+0.023
Neg logit contributions
nes-0.042
vel-0.041
oll-0.041
ust-0.039
aces-0.038
Token find
Token ID1064
Feature activation-2.62e-02

Pos logit contributions
nes+0.017
vel+0.016
oll+0.016
ode+0.016
aces+0.016
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 last-0.008
 teach-0.008
Token Jenny
Token ID28437
Feature activation+1.76e-01

Pos logit contributions
 Ms+0.017
 last+0.015
 while+0.015
 praise+0.015
 teach+0.015
Neg logit contributions
oll-0.030
vel-0.030
nes-0.030
ode-0.029
ust-0.027
Token,"
Token ID553
Feature activation+6.39e-01

Pos logit contributions
 Ms+0.004
 while+0.004
 teach+0.004
 Mrs+0.004
 last+0.004
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.008
aces-0.008
Token said
Token ID531
Feature activation+7.95e-01

Pos logit contributions
 Ms+0.016
 while+0.015
 last+0.015
 teach+0.014
glers+0.014
Neg logit contributions
nes-0.028
oll-0.027
vel-0.027
ode-0.025
aky-0.025
Token Mr
Token ID1770
Feature activation-1.24e+00

Pos logit contributions
 Ms+0.018
glers+0.017
 while+0.017
 last+0.017
 praise+0.016
Neg logit contributions
nes-0.037
oll-0.035
vel-0.034
aces-0.032
anna-0.031
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
nes+0.078
vel+0.078
oll+0.074
ode+0.074
onel+0.073
Neg logit contributions
 Ms-0.016
 while-0.013
 Mr-0.013
 Mrs-0.013
 last-0.012
Token Fisher
Token ID14388
Feature activation+8.16e-01

Pos logit contributions
 Ms+0.029
glers+0.027
 praise+0.027
 teach+0.027
 last+0.027
Neg logit contributions
nes-0.051
vel-0.050
oll-0.049
ode-0.048
aky-0.047
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
 Ms+0.029
 Mrs+0.025
 while+0.024
 Mitt+0.024
 Mr+0.024
Neg logit contributions
nes-0.033
ode-0.031
oll-0.031
vel-0.031
aces-0.030
Token "
Token ID366
Feature activation-3.91e-02

Pos logit contributions
 Ms+0.021
 last+0.020
 teach+0.020
 while+0.019
glers+0.019
Neg logit contributions
nes-0.033
oll-0.032
vel-0.032
aces-0.030
ode-0.029
TokenWe
Token ID1135
Feature activation-9.76e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
aces+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 Mr-0.001
 last-0.001
Token have
Token ID423
Feature activation-1.45e+00

Pos logit contributions
nes+0.047
vel+0.046
oll+0.045
aces+0.043
ode+0.043
Neg logit contributions
 Ms-0.025
 while-0.024
 last-0.024
 teach-0.023
 Mr-0.023
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
oll+0.087
nes+0.087
vel+0.086
onel+0.083
ust+0.083
Neg logit contributions
 while-0.023
 Ms-0.022
 last-0.021
 Mr-0.020
 Mrs-0.020
Token John
Token ID1757
Feature activation-4.83e-01

Pos logit contributions
 Ms+0.005
 teach+0.004
 while+0.004
 last+0.004
 Mrs+0.004
Neg logit contributions
nes-0.012
oll-0.012
vel-0.012
ode-0.012
ust-0.011
Token found
Token ID1043
Feature activation-1.92e-01

Pos logit contributions
nes+0.024
vel+0.023
oll+0.023
ode+0.022
aces+0.022
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.011
 Mrs-0.011
 Mr-0.011
Token himself
Token ID2241
Feature activation-8.70e-01

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token in
Token ID287
Feature activation+8.49e-01

Pos logit contributions
nes+0.041
vel+0.039
oll+0.039
ode+0.038
aces+0.037
Neg logit contributions
 Ms-0.024
 while-0.024
 last-0.022
 before-0.021
 teach-0.021
Token a
Token ID257
Feature activation-9.63e-01

Pos logit contributions
 Ms+0.022
 while+0.019
glers+0.018
 teach+0.018
 Mitt+0.017
Neg logit contributions
nes-0.037
oll-0.037
vel-0.037
aces-0.036
ode-0.035
Token park
Token ID3952
Feature activation+3.69e-01

Pos logit contributions
nes+0.047
oll+0.046
aces+0.046
vel+0.045
ode+0.043
Neg logit contributions
 Ms-0.027
 while-0.026
 last-0.024
 Mrs-0.024
 Mr-0.024
Token.
Token ID13
Feature activation-3.39e-01

Pos logit contributions
 Ms+0.009
 Mrs+0.007
 Mr+0.007
 Mitt+0.007
 while+0.007
Neg logit contributions
nes-0.019
vel-0.018
oll-0.018
ode-0.018
aces-0.017
Token He
Token ID679
Feature activation+3.94e-02

Pos logit contributions
nes+0.015
vel+0.015
oll+0.014
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.010
 while-0.010
 Mrs-0.009
 Mr-0.009
 last-0.009
Token continued
Token ID3767
Feature activation-1.22e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 Mr+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token to
Token ID284
Feature activation-6.50e-01

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 Mr-0.003
 teach-0.003
Token explore
Token ID7301
Feature activation+3.92e-01

Pos logit contributions
nes+0.036
oll+0.036
vel+0.036
aces+0.034
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.012
 teach-0.011
 last-0.011
 visit-0.010
Token my
Token ID616
Feature activation+6.89e-01

Pos logit contributions
 Ms+0.031
 Mitt+0.027
 praise+0.027
Thank+0.027
 Mrs+0.027
Neg logit contributions
vel-0.042
nes-0.042
oll-0.039
ode-0.038
ust-0.037
Token own
Token ID898
Feature activation+9.30e-01

Pos logit contributions
 Ms+0.022
 while+0.019
 Mitt+0.019
 Mrs+0.019
glers+0.018
Neg logit contributions
oll-0.029
vel-0.028
ode-0.028
nes-0.028
aces-0.027
Token?
Token ID30
Feature activation+4.16e-01

Pos logit contributions
 Ms+0.031
 Mitt+0.027
 while+0.026
 Mrs+0.026
 Mr+0.025
Neg logit contributions
ode-0.038
vel-0.038
oll-0.037
nes-0.037
aces-0.036
Tokenâ
Token ID22940
Feature activation-5.49e-01

Pos logit contributions
 Ms+0.013
 while+0.012
 last+0.012
 Mrs+0.011
 teach+0.011
Neg logit contributions
nes-0.018
vel-0.017
oll-0.017
ode-0.016
aces-0.016
Token€
Token ID26391
Feature activation+1.08e-01

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.023
 while-0.022
 teach-0.021
 last-0.021
 Mrs-0.021
Token
Token ID198
Feature activation-2.94e-02

Pos logit contributions
 Ms+0.003
 while+0.002
 last+0.002
 Mrs+0.002
 teach+0.002
Neg logit contributions
nes-0.006
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
TokenThe
Token ID464
Feature activation-7.92e-01

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
ust+0.005
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 last-0.004
 Mr-0.003
Token crystal
Token ID15121
Feature activation-1.76e-01

Pos logit contributions
nes+0.029
vel+0.026
oll+0.025
ust+0.024
aces+0.024
Neg logit contributions
 Ms-0.032
 while-0.031
 last-0.030
 Mr-0.030
 Mrs-0.029
Token said
Token ID531
Feature activation-3.92e-01

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.003
Token,
Token ID11
Feature activation-5.48e-01

Pos logit contributions
nes+0.020
vel+0.020
oll+0.020
ode+0.019
aces+0.019
Neg logit contributions
 Ms-0.009
 while-0.008
 Mrs-0.008
 Mr-0.008
 teach-0.008
Token shiny
Token ID22441
Feature activation+9.09e-02

Pos logit contributions
 Ms+0.037
 Mrs+0.032
 Mr+0.030
glers+0.030
 Mitt+0.029
Neg logit contributions
nes-0.045
vel-0.044
oll-0.042
ode-0.041
ust-0.040
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
 Ms+0.002
 Mrs+0.002
 while+0.002
 Mr+0.002
 teach+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.004
Token
Token ID220
Feature activation-4.12e-01

Pos logit contributions
nes+0.019
vel+0.018
oll+0.018
ode+0.018
aces+0.017
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.006
 Mr-0.006
 last-0.006
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
nes+0.023
vel+0.023
oll+0.022
ode+0.022
aces+0.021
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.006
 Mr-0.006
 last-0.006
Token
Token ID198
Feature activation-3.10e-02

Pos logit contributions
 Ms+0.003
 while+0.002
 teach+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.006
vel-0.006
oll-0.006
ode-0.006
ust-0.006
TokenOne
Token ID3198
Feature activation+6.04e-01

Pos logit contributions
nes+0.002
vel+0.002
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 Mr-0.001
 last-0.001
Token day
Token ID1110
Feature activation+6.22e-01

Pos logit contributions
 Ms+0.019
 Mrs+0.017
 teach+0.016
 Mitt+0.016
 Mr+0.016
Neg logit contributions
oll-0.025
nes-0.025
vel-0.025
aky-0.024
ust-0.024
Token,
Token ID11
Feature activation+8.45e-01

Pos logit contributions
 Ms+0.014
 teach+0.013
 last+0.012
 praise+0.012
 Mrs+0.012
Neg logit contributions
nes-0.030
vel-0.030
oll-0.030
aces-0.028
ode-0.028
Token John
Token ID1757
Feature activation-3.28e-01

Pos logit contributions
 Ms+0.019
 teach+0.017
 praise+0.017
 last+0.016
 kiss+0.015
Neg logit contributions
nes-0.041
vel-0.039
oll-0.038
ode-0.037
ust-0.037
Token was
Token ID373
Feature activation-6.11e-01

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
aces+0.014
Neg logit contributions
 Ms-0.009
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token feeling
Token ID4203
Feature activation+3.63e-01

Pos logit contributions
nes+0.030
vel+0.029
oll+0.029
ode+0.029
ust+0.028
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.013
 Mr-0.013
 teach-0.013
Token the
Token ID262
Feature activation-3.90e-01

Pos logit contributions
 Ms+0.003
 while+0.002
 Mrs+0.002
 teach+0.002
 Mr+0.002
Neg logit contributions
nes-0.008
vel-0.008
oll-0.007
ode-0.007
aces-0.007
Token light
Token ID1657
Feature activation-6.50e-01

Pos logit contributions
nes+0.019
vel+0.018
oll+0.018
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mrs-0.009
 Mr-0.009
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
nes+0.036
vel+0.035
oll+0.035
ode+0.034
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.012
 last-0.010
 Mrs-0.010
 Mr-0.010
Token smiled
Token ID13541
Feature activation+6.45e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 teach+0.000
 Mr+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 Mr+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token
Token ID198
Feature activation-3.04e-01

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
aces+0.014
Neg logit contributions
 Ms-0.008
 while-0.008
 Mrs-0.007
 Mr-0.007
 last-0.007
Token
Token ID198
Feature activation-9.04e-02

Pos logit contributions
nes+0.017
vel+0.016
oll+0.016
ode+0.016
aces+0.016
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 last-0.005
TokenIt
Token ID1026
Feature activation-6.20e-03

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
ust+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 Mrs-0.002
 Mr-0.002
 last-0.002
Token was
Token ID373
Feature activation+1.08e+00

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ode+0.000
ust+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 Mr-0.000
 teach-0.000
Token a
Token ID257
Feature activation+5.13e-01

Pos logit contributions
 Ms+0.033
 teach+0.032
 while+0.030
 praise+0.030
glers+0.028
Neg logit contributions
vel-0.042
nes-0.042
oll-0.042
ust-0.040
aky-0.040
Token wonderful
Token ID7932
Feature activation-3.99e-02

Pos logit contributions
 Ms+0.017
 Mrs+0.015
 while+0.015
 Mitt+0.014
 Mr+0.014
Neg logit contributions
nes-0.021
vel-0.020
ode-0.019
oll-0.019
ust-0.019
Token happened
Token ID3022
Feature activation+1.91e-01

Pos logit contributions
nes+0.029
vel+0.028
ode+0.028
oll+0.027
ust+0.026
Neg logit contributions
 Ms-0.016
 while-0.015
 Mrs-0.014
 Mr-0.014
 Mitt-0.014
Token and
Token ID290
Feature activation-4.62e-01

Pos logit contributions
 Ms+0.005
 Mrs+0.004
 Mr+0.004
 teach+0.004
 while+0.004
Neg logit contributions
vel-0.009
nes-0.009
oll-0.009
ode-0.008
aces-0.008
Token ran
Token ID4966
Feature activation+6.64e-01

Pos logit contributions
nes+0.024
vel+0.023
oll+0.023
ode+0.023
ust+0.022
Neg logit contributions
 Ms-0.012
 while-0.011
 Mrs-0.010
 Mr-0.010
 last-0.010
Token to
Token ID284
Feature activation+6.80e-01

Pos logit contributions
 Ms+0.025
 teach+0.022
 Mrs+0.021
 praise+0.021
 Mr+0.021
Neg logit contributions
nes-0.025
vel-0.022
oll-0.021
ust-0.021
aces-0.021
Token help
Token ID1037
Feature activation+9.75e-03

Pos logit contributions
 Ms+0.018
glers+0.017
 while+0.017
s+0.015
Thank+0.014
Neg logit contributions
nes-0.031
vel-0.028
oll-0.027
onel-0.026
ode-0.025
Token Lily
Token ID20037
Feature activation-6.25e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 Mrs+0.000
 last+0.000
 teach+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
aces-0.000
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
nes+0.035
oll+0.034
vel+0.034
ode+0.034
aces+0.033
Neg logit contributions
 Ms-0.012
 while-0.011
 teach-0.010
 last-0.010
 Mr-0.010
Token He
Token ID679
Feature activation+2.11e-01

Pos logit contributions
nes+0.036
vel+0.035
ode+0.034
oll+0.033
ust+0.032
Neg logit contributions
 Ms-0.018
 while-0.017
 Mr-0.017
 Mrs-0.016
 Mitt-0.016
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
 Ms+0.006
 while+0.006
 Mrs+0.006
 teach+0.006
 last+0.006
Neg logit contributions
nes-0.009
oll-0.009
vel-0.009
ode-0.009
aky-0.008
Token frightened
Token ID24776
Feature activation-7.21e-01

Pos logit contributions
nes+0.056
ode+0.054
vel+0.052
oll+0.052
ust+0.052
Neg logit contributions
 Ms-0.026
 Mr-0.023
 while-0.023
 Mrs-0.023
 Mitt-0.022
Token that
Token ID326
Feature activation+1.15e-01

Pos logit contributions
nes+0.044
vel+0.041
ode+0.041
oll+0.040
aces+0.040
Neg logit contributions
 while-0.013
 Ms-0.012
 Mr-0.011
 Mrs-0.010
 last-0.010
Token a
Token ID257
Feature activation-3.48e-01

Pos logit contributions
 Ms+0.013
 while+0.012
 teach+0.012
 last+0.011
 Mrs+0.011
Neg logit contributions
nes-0.021
oll-0.020
vel-0.020
aces-0.019
ode-0.019
Token lot
Token ID1256
Feature activation+4.12e-01

Pos logit contributions
nes+0.016
vel+0.016
oll+0.016
ode+0.015
aces+0.015
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.009
 Mr-0.009
 teach-0.008
Token of
Token ID286
Feature activation+7.05e-01

Pos logit contributions
 Ms+0.010
 while+0.009
 Mitt+0.009
 Mr+0.009
 teach+0.009
Neg logit contributions
nes-0.020
oll-0.020
vel-0.020
aces-0.020
ode-0.019
Token cars
Token ID5006
Feature activation-6.43e-01

Pos logit contributions
 Ms+0.023
 while+0.021
 Mrs+0.020
 teach+0.020
 Mr+0.020
Neg logit contributions
nes-0.028
vel-0.027
oll-0.027
aces-0.027
ode-0.026
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
nes+0.034
vel+0.033
oll+0.032
ode+0.032
ust+0.031
Neg logit contributions
 Ms-0.014
 while-0.014
 last-0.013
 Mrs-0.012
 teach-0.012
Token Lily
Token ID20037
Feature activation-5.52e-01

Pos logit contributions
nes+0.014
vel+0.013
oll+0.013
ode+0.013
aces+0.013
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.007
 Mr-0.006
 teach-0.006
Token said
Token ID531
Feature activation-1.13e+00

Pos logit contributions
nes+0.028
vel+0.027
oll+0.027
ode+0.026
aces+0.026
Neg logit contributions
 Ms-0.014
 while-0.013
 Mrs-0.012
 Mr-0.012
 teach-0.011
Token,
Token ID11
Feature activation-3.34e-01

Pos logit contributions
nes+0.076
ode+0.076
vel+0.074
oll+0.074
ust+0.073
Neg logit contributions
 while-0.010
 Ms-0.009
 Mr-0.008
 Mrs-0.007
 every-0.006
Token "
Token ID366
Feature activation+8.87e-03

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.017
aces+0.016
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.006
 Mr-0.006
 teach-0.006
TokenThat
Token ID2504
Feature activation+1.37e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 Mrs+0.000
 Mr+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
ust-0.000
Token house
Token ID2156
Feature activation+8.27e-01

Pos logit contributions
 Ms+0.005
 while+0.005
 Mrs+0.005
 Mr+0.004
 teach+0.004
Neg logit contributions
nes-0.005
vel-0.005
ode-0.005
oll-0.005
aces-0.005
Token wants
Token ID3382
Feature activation+9.28e-02

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 Mr+0.003
 Mitt+0.003
Neg logit contributions
nes-0.006
vel-0.006
oll-0.006
ode-0.006
aces-0.006
Token pasta
Token ID26296
Feature activation-1.08e+00

Pos logit contributions
 Ms+0.003
 while+0.002
 teach+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.004
vel-0.004
oll-0.004
ode-0.004
aces-0.004
Token and
Token ID290
Feature activation+4.62e-01

Pos logit contributions
nes+0.059
vel+0.058
ode+0.058
oll+0.057
onel+0.056
Neg logit contributions
 while-0.021
 Ms-0.021
 last-0.018
 Mr-0.017
 teach-0.017
Token salad
Token ID20698
Feature activation-1.01e+00

Pos logit contributions
 Ms+0.017
 while+0.015
 Mrs+0.015
 last+0.015
 teach+0.014
Neg logit contributions
nes-0.017
oll-0.016
vel-0.016
ode-0.016
aces-0.015
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
nes+0.056
vel+0.055
ode+0.054
oll+0.054
ust+0.052
Neg logit contributions
 Ms-0.019
 while-0.019
 last-0.017
 Mr-0.016
 teach-0.016
Token Let
Token ID3914
Feature activation-4.94e-01

Pos logit contributions
nes+0.006
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.003
 Mrs-0.003
 last-0.003
 Mr-0.003
Token's
Token ID338
Feature activation+6.61e-02

Pos logit contributions
nes+0.020
vel+0.020
oll+0.020
ode+0.019
aces+0.019
Neg logit contributions
 Ms-0.017
 while-0.016
 Mrs-0.016
 Mr-0.016
 last-0.015
Token tell
Token ID1560
Feature activation-6.18e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.001
 Mr+0.001
 Mitt+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token the
Token ID262
Feature activation+6.99e-01

Pos logit contributions
nes+0.031
vel+0.031
oll+0.030
ode+0.030
aces+0.029
Neg logit contributions
 Ms-0.016
 while-0.014
 Mrs-0.014
 Mr-0.014
 last-0.013
Token lady
Token ID10846
Feature activation-5.62e-01

Pos logit contributions
 Ms+0.029
 while+0.028
 Mitt+0.028
 Mrs+0.026
 grace+0.026
Neg logit contributions
oll-0.020
nes-0.020
vel-0.020
ode-0.020
ust-0.018
Token what
Token ID644
Feature activation-5.09e-01

Pos logit contributions
nes+0.029
vel+0.029
oll+0.028
ode+0.028
aces+0.027
Neg logit contributions
 Ms-0.013
 while-0.012
 Mrs-0.011
 Mr-0.011
 last-0.011
Token exactly
Token ID3446
Feature activation-1.28e+00

Pos logit contributions
nes+0.042
vel+0.041
oll+0.040
ode+0.039
aces+0.039
Neg logit contributions
 Ms-0.019
 while-0.018
 Mrs-0.017
 Mr-0.017
 last-0.016
Token what
Token ID644
Feature activation-2.88e-01

Pos logit contributions
nes+0.065
oll+0.059
ode+0.057
aces+0.057
vel+0.057
Neg logit contributions
 while-0.039
 Ms-0.036
 Mr-0.034
 Mrs-0.034
 before-0.034
Token to
Token ID284
Feature activation+1.25e-01

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
aces+0.014
Neg logit contributions
 Ms-0.006
 while-0.006
 Mrs-0.005
 Mr-0.005
 last-0.005
Token do
Token ID466
Feature activation-2.62e-01

Pos logit contributions
 Ms+0.004
 while+0.004
 Mrs+0.004
 Mitt+0.004
 Mr+0.004
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
ust-0.005
Token.
Token ID13
Feature activation+7.51e-02

Pos logit contributions
nes+0.014
vel+0.014
oll+0.014
ode+0.014
aces+0.013
Neg logit contributions
 Ms-0.005
 while-0.005
 Mrs-0.004
 Mr-0.004
 last-0.004
Token
Token ID198
Feature activation-3.34e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.017
aces+0.017
Neg logit contributions
 Ms-0.007
 while-0.006
 Mrs-0.006
 Mr-0.006
 last-0.006
TokenHe
Token ID1544
Feature activation-7.98e-01

Pos logit contributions
 Ms+0.004
 while+0.004
 Mr+0.004
 Mrs+0.004
 last+0.004
Neg logit contributions
nes-0.006
vel-0.006
oll-0.005
ode-0.005
ust-0.005
Token smiled
Token ID13541
Feature activation+1.03e-01

Pos logit contributions
nes+0.041
vel+0.040
oll+0.039
ode+0.039
aces+0.039
Neg logit contributions
 Ms-0.020
 while-0.018
 Mrs-0.017
 last-0.017
 Mr-0.017
Token at
Token ID379
Feature activation+2.39e-01

Pos logit contributions
 Ms+0.003
 while+0.002
 Mrs+0.002
 teach+0.002
 last+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
ust-0.005
Token the
Token ID262
Feature activation+4.18e-01

Pos logit contributions
 Ms+0.006
 while+0.005
 teach+0.004
 Mrs+0.004
 last+0.004
Neg logit contributions
nes-0.012
vel-0.012
oll-0.011
ode-0.011
aces-0.011
Token They
Token ID1119
Feature activation+3.75e-01

Pos logit contributions
 Ms+0.007
 while+0.007
 last+0.006
 teach+0.006
 Mrs+0.006
Neg logit contributions
nes-0.009
oll-0.008
vel-0.008
aces-0.008
ode-0.008
Token see
Token ID766
Feature activation-7.71e-03

Pos logit contributions
 Ms+0.012
 while+0.011
 Mrs+0.011
 Mitt+0.010
 Mr+0.010
Neg logit contributions
nes-0.016
vel-0.015
ode-0.015
oll-0.015
aces-0.015
Token the
Token ID262
Feature activation+3.01e-01

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ode+0.000
aces+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 teach-0.000
 last-0.000
Token jam
Token ID16853
Feature activation-9.56e-01

Pos logit contributions
 Ms+0.011
 while+0.010
 Mrs+0.010
 Mr+0.009
 teach+0.009
Neg logit contributions
ode-0.011
oll-0.011
vel-0.011
nes-0.011
aces-0.011
Token on
Token ID319
Feature activation+4.90e-01

Pos logit contributions
nes+0.047
vel+0.045
ust+0.044
ode+0.044
oll+0.043
Neg logit contributions
 Ms-0.025
 while-0.025
 last-0.023
 Mr-0.022
 teach-0.022
Token their
Token ID511
Feature activation-2.20e+00

Pos logit contributions
 Ms+0.009
 while+0.008
 teach+0.008
 praise+0.007
 last+0.007
Neg logit contributions
nes-0.025
vel-0.025
oll-0.024
aces-0.024
ode-0.024
Token clothes
Token ID8242
Feature activation-3.95e-01

Pos logit contributions
vel+0.112
ust+0.108
oll+0.107
nes+0.106
onel+0.100
Neg logit contributions
 last-0.060
 while-0.057
 visit-0.055
 Mr-0.055
 Ms-0.055
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
nes+0.022
vel+0.021
oll+0.021
ode+0.021
ust+0.020
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.006
 Mr-0.006
 last-0.006
Token They
Token ID1119
Feature activation+2.16e-01

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 teach+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token laugh
Token ID6487
Feature activation-1.79e-01

Pos logit contributions
 Ms+0.007
 while+0.006
 Mrs+0.006
 Mitt+0.006
 Mr+0.006
Neg logit contributions
nes-0.009
vel-0.009
ode-0.009
oll-0.009
aces-0.009
Token.
Token ID13
Feature activation+4.07e-03

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.003
 Mr-0.003
 last-0.003
Token and
Token ID290
Feature activation-3.55e-01

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
aces+0.017
Neg logit contributions
 Ms-0.008
 while-0.007
 Mrs-0.007
 Mr-0.007
 last-0.007
Token worried
Token ID7960
Feature activation-7.84e-01

Pos logit contributions
nes+0.017
vel+0.017
oll+0.017
ode+0.016
aces+0.016
Neg logit contributions
 Ms-0.009
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token,
Token ID11
Feature activation-9.83e-02

Pos logit contributions
nes+0.041
ode+0.039
vel+0.039
oll+0.039
ust+0.037
Neg logit contributions
 Ms-0.019
 while-0.019
 Mr-0.017
 Mrs-0.017
 before-0.016
Token he
Token ID339
Feature activation-7.73e-01

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.004
Neg logit contributions
 Ms-0.003
 while-0.002
 Mrs-0.002
 teach-0.002
 last-0.002
Token didn
Token ID1422
Feature activation+1.74e-01

Pos logit contributions
nes+0.038
vel+0.038
oll+0.037
ode+0.037
aces+0.036
Neg logit contributions
 Ms-0.019
 while-0.018
 Mr-0.016
 teach-0.016
 last-0.016
Token't
Token ID470
Feature activation-1.84e+00

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 Mr+0.002
 teach+0.002
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ode-0.010
ust-0.010
Token know
Token ID760
Feature activation-6.91e-01

Pos logit contributions
oll+0.100
nes+0.097
lor+0.096
aces+0.095
ode+0.094
Neg logit contributions
 while-0.042
 teach-0.042
 last-0.036
 visit-0.035
 work-0.033
Token what
Token ID644
Feature activation-1.42e+00

Pos logit contributions
nes+0.030
vel+0.028
oll+0.027
ode+0.027
aces+0.026
Neg logit contributions
 Ms-0.023
 while-0.022
 Mrs-0.021
 Mr-0.021
 last-0.021
Token to
Token ID284
Feature activation-4.67e-01

Pos logit contributions
nes+0.089
oll+0.087
ode+0.086
vel+0.086
ust+0.084
Neg logit contributions
 while-0.022
 Ms-0.021
 teach-0.018
 Mrs-0.017
 Mr-0.017
Token do
Token ID466
Feature activation-2.01e-01

Pos logit contributions
nes+0.022
oll+0.022
vel+0.022
ode+0.021
aces+0.021
Neg logit contributions
 Ms-0.013
 while-0.012
 teach-0.012
 last-0.011
 Mrs-0.011
Token.
Token ID13
Feature activation-6.00e-01

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
aces+0.010
Neg logit contributions
 Ms-0.004
 while-0.004
 Mrs-0.003
 Mr-0.003
 last-0.003
Token and
Token ID290
Feature activation-5.21e-01

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.015
aces+0.014
Neg logit contributions
 Ms-0.008
 while-0.008
 Mrs-0.007
 Mr-0.007
 last-0.007
Token she
Token ID673
Feature activation-7.70e-01

Pos logit contributions
nes+0.027
vel+0.027
oll+0.026
ode+0.026
aces+0.025
Neg logit contributions
 Ms-0.012
 while-0.011
 Mrs-0.011
 Mr-0.010
 last-0.010
Token sadly
Token ID21098
Feature activation-6.39e-01

Pos logit contributions
nes+0.039
vel+0.038
oll+0.037
ode+0.036
aces+0.036
Neg logit contributions
 Ms-0.019
 while-0.018
 last-0.017
 Mr-0.016
 Mrs-0.016
Token died
Token ID3724
Feature activation-7.23e-02

Pos logit contributions
nes+0.030
vel+0.029
oll+0.028
ode+0.028
ust+0.027
Neg logit contributions
 Ms-0.018
 while-0.018
 Mr-0.016
 Mrs-0.016
 last-0.016
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.001
 Mrs-0.001
 teach-0.001
 Mr-0.001
Token Everyone
Token ID11075
Feature activation-2.52e+00

Pos logit contributions
nes+0.061
vel+0.061
ode+0.060
oll+0.059
aky+0.057
Neg logit contributions
 Ms-0.018
 Mrs-0.015
 Mr-0.015
 Mitt-0.014
 while-0.013
Token was
Token ID373
Feature activation-2.09e+00

Pos logit contributions
aces+0.161
ode+0.161
vel+0.159
nes+0.158
oll+0.158
Neg logit contributions
 while-0.036
 last-0.032
 with-0.031
s-0.030
 the-0.030
Token very
Token ID845
Feature activation-6.63e-01

Pos logit contributions
aces+0.114
ode+0.114
nes+0.111
vel+0.108
oll+0.108
Neg logit contributions
 last-0.047
 Mr-0.046
 while-0.045
 Ms-0.045
 Mrs-0.044
Token sad
Token ID6507
Feature activation-7.45e-01

Pos logit contributions
nes+0.032
vel+0.031
oll+0.031
ode+0.031
aces+0.031
Neg logit contributions
 Ms-0.018
 while-0.017
 Mr-0.016
 Mrs-0.016
 last-0.016
Token and
Token ID290
Feature activation-6.73e-01

Pos logit contributions
nes+0.043
vel+0.041
ode+0.041
oll+0.041
aces+0.039
Neg logit contributions
 while-0.014
 Ms-0.014
 last-0.012
 Mr-0.012
 Mrs-0.011
Token wished
Token ID16555
Feature activation+8.51e-02

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.029
aces+0.028
Neg logit contributions
 Ms-0.021
 while-0.019
 Mr-0.019
 Mrs-0.019
 last-0.018
Token,
Token ID11
Feature activation-1.61e-02

Pos logit contributions
nes+0.027
vel+0.026
oll+0.026
ode+0.026
ust+0.025
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mrs-0.009
 teach-0.009
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 Mrs-0.000
 Mr-0.000
 teach-0.000
Token puts
Token ID7584
Feature activation-3.33e-01

Pos logit contributions
 Ms+0.007
 while+0.006
 Mrs+0.006
 Mr+0.006
 Mitt+0.006
Neg logit contributions
nes-0.012
vel-0.011
oll-0.011
ode-0.011
aces-0.011
Token it
Token ID340
Feature activation-9.09e-01

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.015
ust+0.014
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 Mr-0.008
 last-0.008
Token in
Token ID287
Feature activation+6.95e-01

Pos logit contributions
nes+0.039
vel+0.038
oll+0.037
ode+0.037
ust+0.035
Neg logit contributions
 while-0.028
 Ms-0.028
 last-0.027
 before-0.025
 every-0.025
Token an
Token ID281
Feature activation-2.14e+00

Pos logit contributions
 Ms+0.020
 while+0.018
 teach+0.018
glers+0.018
 praise+0.017
Neg logit contributions
vel-0.028
nes-0.028
oll-0.027
aces-0.026
aky-0.025
Token envelope
Token ID22878
Feature activation+3.09e-01

Pos logit contributions
nes+0.079
ust+0.070
vel+0.067
emic+0.066
ode+0.065
Neg logit contributions
 last-0.088
 Ms-0.084
 teach-0.083
 praise-0.081
 Mrs-0.081
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 Ms+0.008
 Mrs+0.006
 Mr+0.006
 while+0.006
 Mitt+0.006
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
aces-0.015
ode-0.015
Token She
Token ID1375
Feature activation+8.57e-01

Pos logit contributions
 Ms+0.006
 while+0.006
 last+0.006
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.007
oll-0.007
vel-0.007
aces-0.006
ode-0.006
Token writes
Token ID6797
Feature activation+9.72e-02

Pos logit contributions
 Ms+0.027
 while+0.026
 Mitt+0.024
 Mrs+0.023
 Mr+0.023
Neg logit contributions
oll-0.036
nes-0.036
vel-0.035
aces-0.035
ode-0.034
Token her
Token ID607
Feature activation+1.72e-01

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
aces-0.005
ode-0.004
Token man
Token ID582
Feature activation+1.73e-01

Pos logit contributions
 Ms+0.022
 while+0.020
 Mrs+0.019
 Mr+0.019
 Mitt+0.019
Neg logit contributions
nes-0.026
oll-0.026
vel-0.026
ode-0.025
aces-0.024
Token watched
Token ID7342
Feature activation+4.33e-01

Pos logit contributions
 Ms+0.004
 while+0.004
 Mrs+0.004
 Mitt+0.004
 Mr+0.004
Neg logit contributions
nes-0.009
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token as
Token ID355
Feature activation+7.32e-01

Pos logit contributions
 Ms+0.012
 while+0.011
 teach+0.011
 last+0.011
 Mrs+0.010
Neg logit contributions
nes-0.019
vel-0.019
oll-0.018
aces-0.017
ode-0.017
Token she
Token ID673
Feature activation-1.80e-01

Pos logit contributions
 Ms+0.016
 teach+0.014
 while+0.014
 visit+0.013
 last+0.012
Neg logit contributions
nes-0.037
oll-0.036
vel-0.035
aces-0.034
ust-0.033
Token skipped
Token ID26684
Feature activation+3.07e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
aces+0.008
Neg logit contributions
 Ms-0.005
 while-0.005
 Mrs-0.005
 Mr-0.004
 teach-0.004
Token mer
Token ID4017
Feature activation-2.02e+00

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 teach+0.001
 Mr+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
aces-0.001
ode-0.001
Tokenrily
Token ID28904
Feature activation-1.95e-01

Pos logit contributions
oll+0.066
ode+0.064
vel+0.063
nes+0.062
aces+0.059
Neg logit contributions
 while-0.080
 Ms-0.074
 grace-0.073
 every-0.073
 teach-0.071
Token away
Token ID1497
Feature activation+3.18e-01

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
aces+0.009
Neg logit contributions
 Ms-0.005
 while-0.004
 Mrs-0.004
 Mr-0.004
 teach-0.004
Token into
Token ID656
Feature activation-2.02e-01

Pos logit contributions
 Ms+0.008
 Mrs+0.006
 teach+0.006
 Mr+0.006
 Mitt+0.006
Neg logit contributions
nes-0.016
vel-0.016
oll-0.015
aces-0.015
ode-0.015
Token the
Token ID262
Feature activation-1.76e-01

Pos logit contributions
nes+0.012
vel+0.011
oll+0.011
ode+0.011
aces+0.011
Neg logit contributions
 Ms-0.004
 while-0.003
 Mrs-0.003
 Mr-0.003
 teach-0.003
Token sunset
Token ID26428
Feature activation-4.75e-01

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.006
 while-0.005
 Mrs-0.005
 Mr-0.005
 teach-0.005