TOP ACTIVATIONS
MAX = 0.085

his pet mouse , S que aky . One
park when she stumbled upon an old cob web . Mill
a little mouse named S que aky . One day ,
the grass when he heard an odd sound . It sounded
. P ips que ak started to regret it
Lily . " It 's an underground tunnel ," said her
a loud noise coming from an ig loo .
G iz mo and S que eker , were both small
one day he stumbled across an amazing pattern ! It was
big step and go on an adventure . They walked far
by himself when he heard an enormous noise . He looked
walking around when he heard an odd sound . It was
too fast and bumped into an old man .
her shoe was stuck in an icy crack in the sidewalk
. One day P ips que ak was playing with its
, Liz found him in an old dust bin . Joe
play ?" S que aky replied , " Yes
and the car twisted in an uncomfortable way . The little
. The bird lived in an ordinary tree with its family
and you love it , Sil via ! Come now ,

MIN ACTIVATIONS
MIN = -0.076

was so excited to watch the new video . But when
so excited to watch the new video . But when she
One day , he came up with an idea to scare
. One day he came up with a new game .
Benny 's friend Sally came up to him and said ,
" Don 't worry , little mouse , I will rescue
. " Don 't worry little friend , I will help
searched until she finally found the video . She was so
One day , she came up with a great idea .
park when a boy came up to her and asked if
mom . A stranger came up to them and asked for
up , then come back down to her tree .
, named Linda , came up to him and asked if
took good care of his new tablet so it wouldn 't
One day , she came up with an idea . She
he had chosen to watch the fun and easy video .
his dad for replacing his old tablet with a new one
then a big cat comes and sees the toy birds .
, a little girl came up to her , looked up
high and then come back down . It was so much

INTERVAL 0.045 - 0.085
CONTAINS 0.149%

. " A br ac ad ab ra !"
. The people around the tank were worried
he had to take a closer look . With a big
hair and the sun on his face . He liked the
, and then he had an idea too . He said

INTERVAL 0.005 - 0.045
CONTAINS 36.233%

. So Jack and Lucy had a special surprise
car penter 's garage . They saw the car penter operating
took his book and a pen and wrote , " To
dog wasn 't so bad after all ! He stayed up
be something special . They decided to wait until bed time

INTERVAL -0.035 - 0.005
CONTAINS 62.914%

It was just pretend . Let 's make some more food
there were two friends , Jack and Jill . They lived
my 's teacher told him to write his name on a
, and a hat . They make a face and a
Betsy nodded her head and Lily gently stro ked her soft

INTERVAL -0.076 - -0.035
CONTAINS 0.704%

and a ball in the backyard . Lily was very good
T eddy took the medicine and felt much better
and towers fall down . The tubes are broken and bent
One day , Lily 's mom told her they were going
smiled and gave him a big hug . From that day
Token his
Token ID465
Feature activation-3.91e-03

Pos logit contributions
nes+0.026
vel+0.025
oll+0.024
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.014
 while-0.013
 Mr-0.012
 last-0.012
 Mrs-0.012
Token pet
Token ID4273
Feature activation-6.21e-03

Pos logit contributions
 Ms+0.005
 while+0.005
 Mrs+0.005
 teach+0.005
 last+0.005
Neg logit contributions
nes-0.006
oll-0.006
vel-0.006
ode-0.006
aces-0.006
Token mouse
Token ID10211
Feature activation-4.29e-04

Pos logit contributions
 Ms+0.007
 while+0.007
 last+0.007
 Mrs+0.006
 teach+0.006
Neg logit contributions
nes-0.011
oll-0.011
vel-0.011
ode-0.010
aces-0.010
Token,
Token ID11
Feature activation+7.95e-03

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 Mrs+0.000
 teach+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token S
Token ID311
Feature activation+1.49e-02

Pos logit contributions
nes+0.017
vel+0.016
oll+0.016
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 Mrs-0.007
 teach-0.007
Tokenque
Token ID4188
Feature activation+8.47e-02

Pos logit contributions
nes+0.025
vel+0.025
oll+0.023
ode+0.023
ust+0.022
Neg logit contributions
 Ms-0.020
 while-0.019
 last-0.019
 teach-0.018
 Mrs-0.018
Tokenaky
Token ID15492
Feature activation-2.09e-02

Pos logit contributions
nes+0.091
aund+0.080
oll+0.080
lor+0.080
ode+0.072
Neg logit contributions
 while-0.154
 tired-0.141
 Mitt-0.141
 when-0.141
 with-0.138
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 Ms+0.022
 Mrs+0.018
 Mr+0.018
 last+0.018
 teach+0.017
Neg logit contributions
nes-0.042
vel-0.041
aces-0.040
oll-0.040
aky-0.039
Token
Token ID220
Feature activation+1.54e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token
Token ID198
Feature activation-1.37e-03

Pos logit contributions
nes+0.036
vel+0.036
oll+0.035
ode+0.035
ust+0.033
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mrs-0.009
 Mr-0.009
TokenOne
Token ID3198
Feature activation-7.51e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token park
Token ID3952
Feature activation-4.24e-02

Pos logit contributions
nes+0.067
vel+0.067
aces+0.065
oll+0.064
onel+0.061
Neg logit contributions
 Ms-0.033
 last-0.030
 visit-0.030
 Mr-0.030
 Mrs-0.029
Token when
Token ID618
Feature activation-1.69e-02

Pos logit contributions
 Ms+0.045
glers+0.041
 Mr+0.038
 praise+0.038
 Mitt+0.037
Neg logit contributions
adow-0.078
nes-0.077
vel-0.076
per-0.068
ust-0.068
Token she
Token ID673
Feature activation-1.09e-02

Pos logit contributions
 Ms+0.018
 teach+0.017
 praise+0.016
 while+0.015
 last+0.015
Neg logit contributions
nes-0.031
vel-0.030
oll-0.029
ode-0.027
ust-0.027
Token stumbled
Token ID24241
Feature activation-2.37e-02

Pos logit contributions
 Ms+0.013
 teach+0.012
 while+0.011
 last+0.011
 Mrs+0.011
Neg logit contributions
nes-0.020
vel-0.019
oll-0.019
ode-0.018
aky-0.018
Token upon
Token ID2402
Feature activation+1.73e-03

Pos logit contributions
 Ms+0.032
 teach+0.029
 praise+0.028
 Mrs+0.028
 Mr+0.027
Neg logit contributions
nes-0.041
vel-0.040
oll-0.040
aky-0.038
aces-0.038
Token an
Token ID281
Feature activation+8.38e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
ust+0.003
Neg logit contributions
 Ms-0.002
 while-0.001
 teach-0.001
 last-0.001
 Mrs-0.001
Token old
Token ID1468
Feature activation+2.31e-02

Pos logit contributions
nes+0.198
vel+0.194
lor+0.194
emic+0.176
ode+0.176
Neg logit contributions
 last-0.067
 Mr-0.062
 Mrs-0.062
 Ms-0.061
 elderly-0.061
Token cob
Token ID22843
Feature activation+2.82e-02

Pos logit contributions
nes+0.046
vel+0.045
oll+0.044
ust+0.043
ode+0.042
Neg logit contributions
 Ms-0.026
 while-0.025
 last-0.023
 teach-0.023
 Mr-0.023
Tokenweb
Token ID12384
Feature activation-4.12e-03

Pos logit contributions
nes+0.033
vel+0.032
ust+0.030
ode+0.029
oll+0.028
Neg logit contributions
 while-0.050
 Ms-0.050
 last-0.050
 teach-0.048
 before-0.047
Token.
Token ID13
Feature activation+2.91e-03

Pos logit contributions
 Ms+0.004
 while+0.004
 Mrs+0.003
 Mr+0.003
 teach+0.003
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token Mill
Token ID9212
Feature activation+2.90e-02

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
ust+0.005
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token a
Token ID257
Feature activation+9.83e-04

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 teach+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.000
oll-0.000
ode-0.000
ust-0.000
Token little
Token ID1310
Feature activation+6.23e-03

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
ust+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token mouse
Token ID10211
Feature activation-8.79e-04

Pos logit contributions
nes+0.012
vel+0.011
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.007
 teach-0.006
 Mrs-0.006
Token named
Token ID3706
Feature activation-8.00e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 Mrs+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token S
Token ID311
Feature activation+1.89e-02

Pos logit contributions
 Ms+0.008
 while+0.008
 last+0.007
 teach+0.007
 praise+0.007
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ode-0.015
ust-0.014
Tokenque
Token ID4188
Feature activation+8.12e-02

Pos logit contributions
nes+0.033
vel+0.032
oll+0.031
ode+0.030
ust+0.030
Neg logit contributions
 Ms-0.025
 last-0.023
 while-0.023
 teach-0.022
 Mrs-0.022
Tokenaky
Token ID15492
Feature activation-3.82e-02

Pos logit contributions
nes+0.089
lor+0.077
oll+0.076
ode+0.071
aund+0.068
Neg logit contributions
 while-0.144
 last-0.135
 when-0.133
 Ms-0.132
 tired-0.131
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 Ms+0.047
 Mrs+0.039
 praise+0.039
Thank+0.038
 Mr+0.037
Neg logit contributions
aces-0.064
vel-0.063
nes-0.063
oll-0.062
ode-0.062
Token One
Token ID1881
Feature activation+9.57e-05

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 praise+0.009
Neg logit contributions
nes-0.020
oll-0.019
vel-0.019
ode-0.018
aces-0.018
Token day
Token ID1110
Feature activation-3.16e-02

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ust+0.000
ode+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 teach-0.000
 Mrs-0.000
 last-0.000
Token,
Token ID11
Feature activation-3.50e-02

Pos logit contributions
 Ms+0.036
 praise+0.034
 teach+0.034
 last+0.032
 hire+0.032
Neg logit contributions
oll-0.055
vel-0.054
nes-0.052
aces-0.050
aund-0.048
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
 Ms+0.009
 while+0.007
 teach+0.007
 last+0.006
 Mrs+0.006
Neg logit contributions
nes-0.023
vel-0.022
oll-0.021
ode-0.021
aces-0.021
Token grass
Token ID8701
Feature activation-3.32e-02

Pos logit contributions
vel+0.035
nes+0.035
oll+0.035
aces+0.033
onel+0.033
Neg logit contributions
 Ms-0.021
 while-0.020
 last-0.020
 Mr-0.019
 Mrs-0.019
Token when
Token ID618
Feature activation-1.92e-02

Pos logit contributions
 Ms+0.037
 praise+0.032
 teach+0.032
 Mrs+0.032
 Mitt+0.030
Neg logit contributions
oll-0.061
vel-0.061
adow-0.060
nes-0.060
aces-0.060
Token he
Token ID339
Feature activation-2.04e-02

Pos logit contributions
 Ms+0.022
 teach+0.021
 praise+0.020
 while+0.019
 Mitt+0.019
Neg logit contributions
nes-0.033
vel-0.033
oll-0.032
ust-0.031
ode-0.030
Token heard
Token ID2982
Feature activation+8.50e-03

Pos logit contributions
 Ms+0.027
 teach+0.024
 praise+0.024
 Mrs+0.023
 last+0.023
Neg logit contributions
nes-0.034
vel-0.033
oll-0.032
ode-0.031
aky-0.031
Token an
Token ID281
Feature activation+8.06e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 Mrs-0.006
 teach-0.006
Token odd
Token ID5629
Feature activation+2.70e-02

Pos logit contributions
nes+0.183
lor+0.179
vel+0.173
emic+0.162
ode+0.160
Neg logit contributions
 Ms-0.070
 teach-0.069
 Mrs-0.068
 last-0.068
 Mr-0.066
Token sound
Token ID2128
Feature activation+1.49e-03

Pos logit contributions
nes+0.064
oll+0.060
vel+0.059
ode+0.058
aces+0.058
Neg logit contributions
 while-0.022
 Ms-0.021
 teach-0.019
 last-0.019
 Mr-0.018
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
ust+0.003
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 Mrs-0.001
 teach-0.001
Token It
Token ID632
Feature activation-1.09e-02

Pos logit contributions
nes+0.022
vel+0.022
oll+0.021
ode+0.020
ust+0.020
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.014
 last-0.013
 Mr-0.013
Token sounded
Token ID14846
Feature activation-2.75e-02

Pos logit contributions
 Ms+0.013
 while+0.012
 last+0.011
 teach+0.011
 praise+0.011
Neg logit contributions
nes-0.019
vel-0.019
oll-0.018
ust-0.018
ode-0.017
Token.
Token ID13
Feature activation-5.73e-03

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.009
 Mrs-0.009
Token
Token ID198
Feature activation-9.69e-03

Pos logit contributions
 Ms+0.006
 while+0.005
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
aces-0.010
ode-0.010
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.008
 Mrs+0.008
Neg logit contributions
nes-0.020
vel-0.019
oll-0.019
ust-0.018
ode-0.017
TokenP
Token ID47
Feature activation+1.66e-02

Pos logit contributions
 Ms+0.024
 last+0.023
 Mrs+0.022
 praise+0.021
 while+0.021
Neg logit contributions
vel-0.028
nes-0.028
ust-0.026
oll-0.025
aky-0.025
Tokenips
Token ID2419
Feature activation+4.22e-02

Pos logit contributions
nes+0.023
vel+0.021
oll+0.020
ode+0.020
aces+0.019
Neg logit contributions
 Ms-0.028
 while-0.026
 last-0.026
 teach-0.025
 Mr-0.025
Tokenque
Token ID4188
Feature activation+8.02e-02

Pos logit contributions
vel+0.085
ode+0.084
nes+0.082
oll+0.081
ust+0.079
Neg logit contributions
 while-0.043
 Ms-0.042
 last-0.042
 before-0.038
 Mr-0.038
Tokenak
Token ID461
Feature activation-2.07e-02

Pos logit contributions
oll+0.075
nes+0.074
alo+0.069
aund+0.064
lor+0.060
Neg logit contributions
 while-0.144
 last-0.134
 Ms-0.132
 first-0.132
 with-0.131
Token started
Token ID2067
Feature activation-1.32e-02

Pos logit contributions
 Ms+0.026
 Mrs+0.022
 teach+0.022
 Mr+0.022
 last+0.021
Neg logit contributions
nes-0.037
vel-0.036
aces-0.035
oll-0.034
onel-0.034
Token to
Token ID284
Feature activation-7.37e-03

Pos logit contributions
 Ms+0.014
 last+0.012
 teach+0.012
 while+0.012
 Mrs+0.012
Neg logit contributions
vel-0.024
nes-0.024
oll-0.023
ode-0.023
aces-0.022
Token regret
Token ID13721
Feature activation+1.91e-02

Pos logit contributions
 Ms+0.009
 while+0.008
 last+0.008
 Mrs+0.008
 Mr+0.007
Neg logit contributions
nes-0.013
vel-0.013
ode-0.012
oll-0.012
ust-0.012
Token it
Token ID340
Feature activation-5.99e-03

Pos logit contributions
nes+0.039
oll+0.037
vel+0.037
ode+0.037
ust+0.036
Neg logit contributions
 Ms-0.020
 while-0.019
 Mr-0.017
 last-0.017
 Mrs-0.017
Token Lily
Token ID20037
Feature activation+5.60e-04

Pos logit contributions
 Ms+0.021
 last+0.019
 while+0.019
 teach+0.019
 visit+0.018
Neg logit contributions
nes-0.027
oll-0.025
vel-0.025
aces-0.024
aky-0.023
Token.
Token ID13
Feature activation-5.44e-03

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token "
Token ID366
Feature activation+3.31e-02

Pos logit contributions
 Ms+0.006
 last+0.005
 while+0.005
 teach+0.005
 praise+0.005
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
aces-0.010
ode-0.010
TokenIt
Token ID1026
Feature activation+9.05e-03

Pos logit contributions
oll+0.065
nes+0.064
vel+0.063
aces+0.063
ode+0.060
Neg logit contributions
 while-0.035
 Ms-0.035
Thank-0.030
 Mrs-0.030
 visit-0.030
Token's
Token ID338
Feature activation+1.32e-02

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.012
 while-0.012
 last-0.011
 teach-0.011
 Mr-0.011
Token an
Token ID281
Feature activation+7.99e-02

Pos logit contributions
nes+0.029
vel+0.028
oll+0.028
ode+0.027
ust+0.026
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.011
 Mr-0.011
 Mrs-0.010
Token underground
Token ID11447
Feature activation+1.90e-02

Pos logit contributions
nes+0.168
vel+0.151
emic+0.148
lor+0.146
ode+0.143
Neg logit contributions
 last-0.090
 work-0.079
 Mr-0.078
 while-0.078
 when-0.076
Token tunnel
Token ID13275
Feature activation-1.00e-02

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
aces+0.027
ust+0.027
Neg logit contributions
 while-0.027
 Ms-0.027
 last-0.026
 teach-0.025
 visit-0.025
Token,"
Token ID553
Feature activation+5.04e-03

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 while+0.010
 Mr+0.010
 teach+0.009
Neg logit contributions
nes-0.019
vel-0.019
oll-0.018
aces-0.018
ode-0.018
Token said
Token ID531
Feature activation+2.89e-04

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.011
ust+0.010
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.003
 teach-0.003
 Mrs-0.003
Token her
Token ID607
Feature activation+1.62e-02

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token a
Token ID257
Feature activation+1.65e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token loud
Token ID7812
Feature activation+3.04e-02

Pos logit contributions
nes+0.036
oll+0.035
vel+0.035
aces+0.034
ode+0.034
Neg logit contributions
 Ms-0.015
 while-0.014
 teach-0.013
 last-0.013
 Mr-0.013
Token noise
Token ID7838
Feature activation-7.25e-03

Pos logit contributions
nes+0.069
oll+0.068
vel+0.065
ust+0.065
ode+0.065
Neg logit contributions
 Ms-0.027
 while-0.026
 teach-0.024
 Mr-0.023
 Mrs-0.023
Token coming
Token ID2406
Feature activation-8.14e-03

Pos logit contributions
 Ms+0.008
 Mrs+0.007
 Mr+0.007
 while+0.007
 last+0.007
Neg logit contributions
nes-0.013
vel-0.013
oll-0.013
ode-0.012
ust-0.012
Token from
Token ID422
Feature activation-1.08e-02

Pos logit contributions
 Ms+0.009
 while+0.008
 teach+0.007
 Mrs+0.007
 last+0.007
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ust-0.015
ode-0.014
Token an
Token ID281
Feature activation+7.94e-02

Pos logit contributions
 Ms+0.011
 while+0.009
 teach+0.009
 Mrs+0.008
 Mr+0.008
Neg logit contributions
nes-0.021
vel-0.021
oll-0.020
ode-0.020
aces-0.020
Token ig
Token ID45329
Feature activation+1.86e-02

Pos logit contributions
vel+0.190
nes+0.185
lor+0.176
ust+0.175
oll+0.171
Neg logit contributions
 last-0.076
 Ms-0.067
 Mrs-0.067
 the-0.066
 Mr-0.065
Tokenloo
Token ID29680
Feature activation-1.88e-02

Pos logit contributions
nes+0.023
vel+0.023
oll+0.021
ust+0.020
aces+0.020
Neg logit contributions
 Ms-0.033
 last-0.032
 while-0.032
 teach-0.031
 Mr-0.030
Token.
Token ID13
Feature activation+2.94e-02

Pos logit contributions
 Ms+0.021
 Mrs+0.018
 Mitt+0.017
 Mr+0.017
 while+0.017
Neg logit contributions
nes-0.036
vel-0.035
oll-0.034
ode-0.033
ust-0.033
Token
Token ID198
Feature activation+5.63e-03

Pos logit contributions
nes+0.065
vel+0.064
oll+0.063
ode+0.063
ust+0.061
Neg logit contributions
 Ms-0.027
 while-0.025
 Mr-0.023
 Mrs-0.022
 last-0.021
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
nes+0.013
vel+0.012
oll+0.012
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token G
Token ID402
Feature activation+2.46e-02

Pos logit contributions
vel+0.032
nes+0.032
oll+0.031
ode+0.030
onel+0.029
Neg logit contributions
 Ms-0.015
 while-0.014
 Mr-0.013
 Mrs-0.013
 last-0.013
Tokeniz
Token ID528
Feature activation+1.50e-02

Pos logit contributions
nes+0.036
vel+0.035
oll+0.033
ode+0.032
ust+0.031
Neg logit contributions
 Ms-0.037
 last-0.036
 while-0.034
 teach-0.034
 before-0.033
Tokenmo
Token ID5908
Feature activation+3.59e-03

Pos logit contributions
nes+0.030
vel+0.030
oll+0.029
ode+0.028
ust+0.028
Neg logit contributions
 Ms-0.015
 while-0.014
 last-0.014
 teach-0.013
 Mrs-0.013
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.004
 while-0.003
 last-0.003
 Mrs-0.003
 teach-0.003
Token S
Token ID311
Feature activation+2.63e-02

Pos logit contributions
nes+0.024
vel+0.024
oll+0.024
ode+0.023
ust+0.023
Neg logit contributions
 Ms-0.010
 while-0.009
 Mrs-0.008
 last-0.008
 Mr-0.008
Tokenque
Token ID4188
Feature activation+7.85e-02

Pos logit contributions
nes+0.042
vel+0.041
oll+0.038
ode+0.036
ust+0.036
Neg logit contributions
 Ms-0.038
 last-0.037
 teach-0.036
 while-0.035
 Mrs-0.034
Tokeneker
Token ID28233
Feature activation-1.87e-02

Pos logit contributions
lor+0.080
nes+0.080
oll+0.076
alo+0.071
aund+0.068
Neg logit contributions
 while-0.141
 last-0.134
 work-0.132
 every-0.128
 past-0.128
Token,
Token ID11
Feature activation+3.42e-03

Pos logit contributions
 Ms+0.019
 Mrs+0.016
 Mr+0.015
 Mitt+0.014
 while+0.014
Neg logit contributions
nes-0.038
vel-0.037
aces-0.037
oll-0.036
ode-0.035
Token were
Token ID547
Feature activation-1.52e-02

Pos logit contributions
nes+0.007
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.003
 teach-0.003
 Mrs-0.003
Token both
Token ID1111
Feature activation-6.48e-03

Pos logit contributions
 Ms+0.019
 while+0.017
 last+0.017
 praise+0.016
 teach+0.016
Neg logit contributions
nes-0.027
vel-0.026
oll-0.025
aces-0.024
ust-0.024
Token small
Token ID1402
Feature activation-7.34e-03

Pos logit contributions
 Ms+0.009
 while+0.008
 last+0.008
 Mrs+0.008
 teach+0.007
Neg logit contributions
nes-0.011
vel-0.011
oll-0.010
aces-0.010
ust-0.010
Token one
Token ID530
Feature activation+3.26e-03

Pos logit contributions
 Ms+0.003
 teach+0.002
 while+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token day
Token ID1110
Feature activation-1.48e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token he
Token ID339
Feature activation+1.05e-02

Pos logit contributions
 Ms+0.015
 teach+0.013
 praise+0.012
 last+0.012
 Mrs+0.012
Neg logit contributions
oll-0.029
vel-0.029
nes-0.028
aces-0.027
ust-0.027
Token stumbled
Token ID24241
Feature activation-1.52e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
aces+0.018
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.012
 teach-0.011
 Mr-0.011
Token across
Token ID1973
Feature activation-5.02e-03

Pos logit contributions
 Ms+0.019
 teach+0.017
 praise+0.016
 Mrs+0.016
 Mr+0.016
Neg logit contributions
nes-0.028
vel-0.027
oll-0.027
aces-0.026
aky-0.026
Token an
Token ID281
Feature activation+7.84e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 teach+0.004
 Mrs+0.004
 Mr+0.004
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.010
aces-0.010
Token amazing
Token ID4998
Feature activation+2.22e-02

Pos logit contributions
nes+0.194
vel+0.191
lor+0.188
emic+0.181
ust+0.174
Neg logit contributions
 last-0.058
 Ms-0.044
 Mr-0.043
 while-0.042
 Mrs-0.042
Token pattern
Token ID3912
Feature activation+5.27e-03

Pos logit contributions
nes+0.044
oll+0.043
vel+0.042
ust+0.040
aces+0.040
Neg logit contributions
 while-0.025
 Ms-0.025
 last-0.023
 teach-0.023
 visit-0.023
Token!
Token ID0
Feature activation+1.84e-02

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token It
Token ID632
Feature activation+3.00e-03

Pos logit contributions
nes+0.038
vel+0.038
oll+0.036
ode+0.036
ust+0.035
Neg logit contributions
 Ms-0.020
 while-0.017
 Mr-0.017
 Mrs-0.017
 last-0.016
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
nes+0.007
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 Mrs-0.002
 teach-0.002
Token big
Token ID1263
Feature activation+1.50e-02

Pos logit contributions
nes+0.039
vel+0.038
oll+0.037
aces+0.037
ode+0.036
Neg logit contributions
 Ms-0.019
 while-0.017
 teach-0.017
 last-0.016
 Mr-0.016
Token step
Token ID2239
Feature activation+4.66e-03

Pos logit contributions
nes+0.025
vel+0.024
oll+0.024
aces+0.023
ust+0.023
Neg logit contributions
 Ms-0.021
 while-0.021
 last-0.020
 teach-0.020
 Mr-0.019
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
ust+0.009
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token go
Token ID467
Feature activation-1.91e-02

Pos logit contributions
 Ms+0.011
 while+0.009
 last+0.009
 Mrs+0.009
 Mr+0.009
Neg logit contributions
nes-0.022
vel-0.021
oll-0.021
ode-0.020
aces-0.020
Token on
Token ID319
Feature activation-3.92e-04

Pos logit contributions
 Ms+0.028
 Mrs+0.025
 Mr+0.024
 praise+0.023
 teach+0.022
Neg logit contributions
nes-0.030
ode-0.027
vel-0.027
aky-0.026
aces-0.026
Token an
Token ID281
Feature activation+7.83e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 last+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
ust-0.001
Token adventure
Token ID8855
Feature activation-9.13e-03

Pos logit contributions
nes+0.165
vel+0.164
onel+0.153
emic+0.153
lor+0.151
Neg logit contributions
 last-0.082
 teach-0.073
 visit-0.070
 every-0.069
 while-0.068
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 Ms+0.010
 Mrs+0.008
 Mr+0.008
 teach+0.008
 praise+0.008
Neg logit contributions
nes-0.019
vel-0.018
oll-0.017
ust-0.017
aces-0.017
Token They
Token ID1119
Feature activation+2.12e-02

Pos logit contributions
 Ms+0.021
 last+0.020
 teach+0.020
 praise+0.019
 while+0.019
Neg logit contributions
nes-0.034
vel-0.031
oll-0.031
aces-0.030
ust-0.030
Token walked
Token ID6807
Feature activation-1.75e-02

Pos logit contributions
vel+0.044
nes+0.043
oll+0.043
ode+0.041
ust+0.041
Neg logit contributions
 Ms-0.022
 while-0.021
 last-0.020
 Mr-0.019
 Mrs-0.019
Token far
Token ID1290
Feature activation-5.53e-03

Pos logit contributions
 Ms+0.023
 teach+0.020
 Mrs+0.020
 Mr+0.019
 praise+0.019
Neg logit contributions
nes-0.030
vel-0.027
aces-0.027
oll-0.026
aky-0.026
Token by
Token ID416
Feature activation+1.42e-02

Pos logit contributions
 Ms+0.022
glers+0.020
 teach+0.019
 praise+0.019
 Mrs+0.019
Neg logit contributions
vel-0.037
nes-0.036
aky-0.033
ode-0.033
oll-0.033
Token himself
Token ID2241
Feature activation-1.32e-02

Pos logit contributions
nes+0.034
oll+0.033
vel+0.033
ode+0.032
ust+0.032
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.008
 Mr-0.008
 Mrs-0.007
Token when
Token ID618
Feature activation-1.70e-02

Pos logit contributions
 Ms+0.014
 Mrs+0.012
 praise+0.011
 teach+0.011
 Mitt+0.011
Neg logit contributions
vel-0.026
nes-0.026
oll-0.025
ode-0.025
ust-0.025
Token he
Token ID339
Feature activation-2.43e-02

Pos logit contributions
 Ms+0.020
 teach+0.018
 praise+0.017
 while+0.016
 last+0.016
Neg logit contributions
nes-0.030
vel-0.029
oll-0.029
ode-0.027
aces-0.027
Token heard
Token ID2982
Feature activation+1.47e-02

Pos logit contributions
 Ms+0.033
 praise+0.031
 teach+0.031
 Mrs+0.029
 while+0.029
Neg logit contributions
nes-0.039
oll-0.038
vel-0.037
aky-0.036
ode-0.035
Token an
Token ID281
Feature activation+7.81e-02

Pos logit contributions
nes+0.034
vel+0.033
oll+0.033
ode+0.032
ust+0.031
Neg logit contributions
 Ms-0.012
 while-0.010
 Mrs-0.010
 Mr-0.010
 last-0.010
Token enormous
Token ID9812
Feature activation+2.17e-02

Pos logit contributions
nes+0.173
lor+0.169
vel+0.166
ode+0.157
emic+0.154
Neg logit contributions
 teach-0.070
 Ms-0.068
 Mr-0.066
 last-0.065
 Mrs-0.065
Token noise
Token ID7838
Feature activation-8.32e-03

Pos logit contributions
nes+0.044
oll+0.042
vel+0.042
ust+0.041
aces+0.040
Neg logit contributions
 Ms-0.024
 while-0.023
 teach-0.022
 last-0.021
 Mr-0.021
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 last+0.007
 Mitt+0.007
Neg logit contributions
vel-0.017
nes-0.017
oll-0.016
ode-0.015
ust-0.015
Token He
Token ID679
Feature activation+2.89e-02

Pos logit contributions
nes+0.025
vel+0.025
oll+0.024
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.018
 while-0.017
 Mr-0.016
 Mrs-0.016
 last-0.016
Token looked
Token ID3114
Feature activation-4.76e-03

Pos logit contributions
nes+0.065
vel+0.064
oll+0.063
aces+0.061
ode+0.060
Neg logit contributions
 Ms-0.025
 while-0.024
 Mr-0.022
 last-0.022
 visit-0.021
Token walking
Token ID6155
Feature activation-1.48e-02

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.009
 Mrs-0.009
Token around
Token ID1088
Feature activation-2.38e-02

Pos logit contributions
 Ms+0.023
 Mrs+0.021
 teach+0.020
 praise+0.020
 Mr+0.020
Neg logit contributions
nes-0.021
vel-0.020
oll-0.020
aky-0.020
aces-0.019
Token when
Token ID618
Feature activation-2.47e-02

Pos logit contributions
 Ms+0.021
glers+0.020
 teach+0.019
 praise+0.018
 kiss+0.018
Neg logit contributions
vel-0.046
nes-0.046
aky-0.042
adow-0.042
onel-0.041
Token he
Token ID339
Feature activation-2.34e-02

Pos logit contributions
 Ms+0.030
 teach+0.029
 praise+0.027
 kiss+0.025
 last+0.025
Neg logit contributions
nes-0.042
oll-0.039
vel-0.039
ust-0.037
onel-0.037
Token heard
Token ID2982
Feature activation+1.58e-02

Pos logit contributions
 Ms+0.032
 teach+0.030
 praise+0.030
 Mrs+0.028
 last+0.027
Neg logit contributions
nes-0.037
vel-0.037
oll-0.036
ode-0.035
aky-0.035
Token an
Token ID281
Feature activation+7.77e-02

Pos logit contributions
nes+0.037
vel+0.036
oll+0.035
ode+0.035
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.011
 Mrs-0.010
 Mr-0.010
 last-0.010
Token odd
Token ID5629
Feature activation+2.55e-02

Pos logit contributions
nes+0.167
lor+0.163
vel+0.160
ode+0.149
emic+0.149
Neg logit contributions
 Ms-0.074
 teach-0.073
 last-0.071
 Mrs-0.070
 Mr-0.069
Token sound
Token ID2128
Feature activation-1.59e-03

Pos logit contributions
nes+0.059
oll+0.056
vel+0.056
ode+0.054
aces+0.054
Neg logit contributions
 Ms-0.021
 while-0.021
 teach-0.019
 last-0.018
 Mr-0.017
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 Ms+0.002
 while+0.001
 last+0.001
 Mrs+0.001
 teach+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token It
Token ID632
Feature activation-1.27e-02

Pos logit contributions
nes+0.043
vel+0.042
oll+0.040
ode+0.040
ust+0.039
Neg logit contributions
 Ms-0.026
 while-0.023
 Mr-0.023
 Mrs-0.023
 Mitt-0.021
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
 Ms+0.014
 while+0.013
 last+0.012
 teach+0.012
 praise+0.012
Neg logit contributions
nes-0.023
vel-0.023
oll-0.022
ust-0.022
onel-0.021
Token too
Token ID1165
Feature activation-1.92e-02

Pos logit contributions
 Ms+0.035
 praise+0.031
 teach+0.031
 Mrs+0.030
 kiss+0.030
Neg logit contributions
nes-0.044
vel-0.040
aces-0.037
ust-0.036
oll-0.036
Token fast
Token ID3049
Feature activation+3.67e-03

Pos logit contributions
 Ms+0.022
 teach+0.018
 Mrs+0.018
 praise+0.017
 Mr+0.017
Neg logit contributions
nes-0.038
vel-0.037
oll-0.036
ust-0.035
aces-0.034
Token and
Token ID290
Feature activation-4.48e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.007
aces+0.007
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 Mrs-0.003
 teach-0.003
Token bumped
Token ID38034
Feature activation+3.79e-03

Pos logit contributions
 Ms+0.060
 particular+0.056
 praise+0.052
 hire+0.052
 Mrs+0.051
Neg logit contributions
nes-0.071
aces-0.058
onel-0.054
vel-0.054
oll-0.054
Token into
Token ID656
Feature activation+4.34e-04

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token an
Token ID281
Feature activation+7.74e-02

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token old
Token ID1468
Feature activation+1.39e-02

Pos logit contributions
nes+0.179
vel+0.174
lor+0.169
emic+0.164
ode+0.162
Neg logit contributions
 last-0.067
 Ms-0.061
 Mr-0.060
 Mrs-0.059
 elderly-0.057
Token man
Token ID582
Feature activation+7.46e-03

Pos logit contributions
nes+0.025
vel+0.024
oll+0.024
ust+0.023
ode+0.023
Neg logit contributions
 Ms-0.018
 while-0.018
 last-0.016
 teach-0.016
 Mrs-0.016
Token.
Token ID13
Feature activation-2.57e-02

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 teach-0.006
 Mrs-0.006
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
 last+0.030
 teach+0.028
 Ms+0.027
glers+0.027
 praise+0.027
Neg logit contributions
nes-0.044
vel-0.039
oll-0.038
anna-0.038
aces-0.036
Token
Token ID198
Feature activation-1.53e-02

Pos logit contributions
 Ms+0.011
 teach+0.010
 while+0.010
 last+0.010
 Mrs+0.009
Neg logit contributions
nes-0.024
vel-0.023
oll-0.022
ust-0.021
aky-0.021
Token her
Token ID607
Feature activation+5.76e-03

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.001
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token shoe
Token ID17292
Feature activation+1.07e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.009
 Mrs-0.009
Token was
Token ID373
Feature activation+6.62e-03

Pos logit contributions
nes+0.023
vel+0.022
oll+0.022
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.008
 Mrs-0.008
Token stuck
Token ID7819
Feature activation-1.59e-04

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.009
 while-0.009
 last-0.008
 teach-0.008
 Mrs-0.008
Token in
Token ID287
Feature activation-1.03e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 Mrs+0.000
 last+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
ust-0.000
Token an
Token ID281
Feature activation+7.73e-02

Pos logit contributions
 Ms+0.009
 teach+0.008
 while+0.008
 last+0.007
 Mrs+0.007
Neg logit contributions
vel-0.022
nes-0.022
oll-0.021
ode-0.020
aces-0.020
Token icy
Token ID30284
Feature activation+4.30e-02

Pos logit contributions
nes+0.171
ust+0.155
emic+0.154
vel+0.153
lor+0.151
Neg logit contributions
 last-0.082
 front-0.073
 Every-0.068
 the-0.068
 Ms-0.068
Token crack
Token ID8469
Feature activation+1.62e-02

Pos logit contributions
nes+0.079
vel+0.077
ust+0.074
oll+0.074
onel+0.073
Neg logit contributions
 while-0.058
 last-0.055
 Ms-0.053
 Mitt-0.052
 Mr-0.050
Token in
Token ID287
Feature activation-1.04e-04

Pos logit contributions
nes+0.037
vel+0.036
oll+0.035
ode+0.035
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.012
 last-0.011
 teach-0.010
 Mrs-0.009
Token the
Token ID262
Feature activation+2.73e-02

Pos logit contributions
 Ms+0.000
 while+0.000
 teach+0.000
 last+0.000
 Mrs+0.000
Neg logit contributions
nes-0.000
vel-0.000
oll-0.000
ode-0.000
ust-0.000
Token sidewalk
Token ID24058
Feature activation-3.39e-03

Pos logit contributions
nes+0.049
vel+0.047
ust+0.045
oll+0.044
onel+0.042
Neg logit contributions
 Ms-0.037
 last-0.036
 while-0.035
 Mr-0.034
 Mrs-0.033
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
 Ms+0.026
 Mrs+0.021
 praise+0.021
 last+0.021
 teach+0.021
Neg logit contributions
nes-0.044
vel-0.044
oll-0.043
aces-0.043
ode-0.042
Token One
Token ID1881
Feature activation+2.60e-03

Pos logit contributions
 Ms+0.019
 while+0.018
 last+0.017
 teach+0.017
 Mrs+0.017
Neg logit contributions
nes-0.028
oll-0.028
vel-0.028
aces-0.027
ode-0.026
Token day
Token ID1110
Feature activation-3.29e-02

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
ust+0.005
Neg logit contributions
 Ms-0.003
 while-0.002
 teach-0.002
 last-0.002
 Mrs-0.002
Token P
Token ID350
Feature activation+1.45e-02

Pos logit contributions
 Ms+0.040
 praise+0.035
 last+0.035
 Mrs+0.034
 teach+0.033
Neg logit contributions
vel-0.058
oll-0.057
aces-0.053
nes-0.052
aund-0.051
Tokenips
Token ID2419
Feature activation+2.08e-02

Pos logit contributions
nes+0.023
vel+0.023
oll+0.022
ode+0.021
aces+0.021
Neg logit contributions
 Ms-0.020
 while-0.019
 last-0.019
 teach-0.018
 Mr-0.018
Tokenque
Token ID4188
Feature activation+7.70e-02

Pos logit contributions
vel+0.043
nes+0.043
oll+0.041
ode+0.041
ust+0.040
Neg logit contributions
 Ms-0.020
 while-0.019
 last-0.018
 Mr-0.017
 Mrs-0.017
Tokenak
Token ID461
Feature activation-2.03e-02

Pos logit contributions
oll+0.076
nes+0.074
alo+0.071
aund+0.068
ode+0.066
Neg logit contributions
s-0.132
 while-0.132
 last-0.125
 with-0.125
 Ms-0.125
Token was
Token ID373
Feature activation-1.53e-03

Pos logit contributions
 Ms+0.027
 teach+0.023
 Mrs+0.023
 while+0.023
 last+0.023
Neg logit contributions
nes-0.035
vel-0.034
oll-0.033
aces-0.032
ode-0.032
Token playing
Token ID2712
Feature activation-2.31e-03

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token with
Token ID351
Feature activation+3.15e-03

Pos logit contributions
 Ms+0.002
 last+0.002
 Mrs+0.002
 while+0.002
 Mr+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.004
ode-0.004
aces-0.004
Token its
Token ID663
Feature activation-2.00e-02

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.004
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token,
Token ID11
Feature activation-2.47e-02

Pos logit contributions
nes+0.026
vel+0.026
oll+0.025
ode+0.025
ust+0.025
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 Mrs-0.005
 Mr-0.005
Token Liz
Token ID28313
Feature activation+1.56e-02

Pos logit contributions
 Ms+0.029
 teach+0.025
 praise+0.025
 Mrs+0.023
 Mr+0.023
Neg logit contributions
nes-0.043
vel-0.042
oll-0.041
ust-0.040
onel-0.038
Token found
Token ID1043
Feature activation+1.92e-03

Pos logit contributions
nes+0.030
vel+0.029
oll+0.029
ode+0.028
ust+0.028
Neg logit contributions
 Ms-0.018
 while-0.017
 last-0.016
 teach-0.016
 Mrs-0.016
Token him
Token ID683
Feature activation+4.25e-03

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 teach-0.001
 last-0.001
 Mrs-0.001
Token in
Token ID287
Feature activation-1.54e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token an
Token ID281
Feature activation+7.70e-02

Pos logit contributions
 Ms+0.014
 while+0.012
 teach+0.012
glers+0.011
 last+0.011
Neg logit contributions
nes-0.030
vel-0.030
oll-0.030
aces-0.028
ode-0.028
Token old
Token ID1468
Feature activation+3.89e-02

Pos logit contributions
nes+0.185
vel+0.176
ust+0.169
emic+0.169
oll+0.164
Neg logit contributions
 last-0.067
 front-0.058
 visit-0.056
 Ms-0.055
 the-0.055
Token dust
Token ID8977
Feature activation+1.62e-02

Pos logit contributions
oll+0.071
nes+0.069
vel+0.069
ust+0.067
onel+0.065
Neg logit contributions
 while-0.053
 Ms-0.051
 visit-0.048
 last-0.047
 teach-0.046
Tokenbin
Token ID8800
Feature activation+1.99e-02

Pos logit contributions
nes+0.032
vel+0.031
oll+0.031
ust+0.030
ode+0.029
Neg logit contributions
 Ms-0.017
 while-0.017
 last-0.016
 teach-0.015
 Mrs-0.015
Token.
Token ID13
Feature activation-3.62e-03

Pos logit contributions
nes+0.047
vel+0.045
oll+0.045
ust+0.044
ode+0.044
Neg logit contributions
 while-0.014
 Ms-0.013
 last-0.012
 teach-0.011
 before-0.011
Token Joe
Token ID5689
Feature activation-3.25e-03

Pos logit contributions
 Ms+0.005
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.006
vel-0.006
oll-0.006
aces-0.006
ode-0.006
Token play
Token ID711
Feature activation+6.50e-03

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.005
 Mr+0.005
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ode-0.015
ust-0.014
Token?"
Token ID1701
Feature activation-2.27e-03

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token
Token ID198
Feature activation-2.10e-02

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.004
aces-0.004
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
 Ms+0.023
 last+0.022
 while+0.021
 praise+0.021
 teach+0.021
Neg logit contributions
nes-0.039
vel-0.037
oll-0.037
aky-0.034
ust-0.034
TokenS
Token ID50
Feature activation-8.68e-03

Pos logit contributions
 Ms+0.019
 last+0.019
 praise+0.018
 while+0.017
 Mrs+0.017
Neg logit contributions
nes-0.023
vel-0.022
oll-0.021
ode-0.021
aky-0.020
Tokenque
Token ID4188
Feature activation+7.70e-02

Pos logit contributions
 Ms+0.013
 while+0.012
 teach+0.012
 Mrs+0.011
 last+0.011
Neg logit contributions
vel-0.013
oll-0.013
nes-0.013
ode-0.013
ust-0.012
Tokenaky
Token ID15492
Feature activation+1.09e-02

Pos logit contributions
nes+0.080
ode+0.067
oll+0.062
alo+0.059
lor+0.057
Neg logit contributions
 while-0.135
 Ms-0.134
 last-0.129
 every-0.128
Every-0.127
Token replied
Token ID8712
Feature activation-6.49e-03

Pos logit contributions
nes+0.024
vel+0.023
oll+0.023
ode+0.022
ust+0.022
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.008
 teach-0.008
 Mrs-0.008
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
 Ms+0.006
 last+0.005
 teach+0.005
 Mrs+0.004
 praise+0.004
Neg logit contributions
vel-0.014
nes-0.014
oll-0.014
aces-0.013
onel-0.013
Token "
Token ID366
Feature activation-2.58e-02

Pos logit contributions
 Ms+0.016
 last+0.016
 praise+0.015
 teach+0.015
 visit+0.014
Neg logit contributions
vel-0.025
nes-0.024
oll-0.024
aces-0.023
ode-0.022
TokenYes
Token ID5297
Feature activation-1.51e-02

Pos logit contributions
 Ms+0.026
 Mrs+0.023
 last+0.023
 praise+0.023
 thanks+0.023
Neg logit contributions
nes-0.053
ust-0.048
ode-0.047
vel-0.047
oll-0.046
Token and
Token ID290
Feature activation+3.28e-05

Pos logit contributions
 Ms+0.005
 while+0.004
 teach+0.004
 last+0.004
 Mrs+0.004
Neg logit contributions
vel-0.008
nes-0.008
oll-0.008
ode-0.008
aces-0.008
Token the
Token ID262
Feature activation-1.42e-03

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ode+0.000
aces+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 teach-0.000
 last-0.000
 Mrs-0.000
Token car
Token ID1097
Feature activation-1.42e-02

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 teach+0.001
 last+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.002
ode-0.002
aces-0.002
Token twisted
Token ID19074
Feature activation+1.29e-02

Pos logit contributions
 Ms+0.019
 Mrs+0.017
 while+0.016
 teach+0.016
 praise+0.016
Neg logit contributions
nes-0.025
vel-0.024
oll-0.023
aces-0.023
ode-0.023
Token in
Token ID287
Feature activation+3.41e-03

Pos logit contributions
nes+0.028
vel+0.027
oll+0.027
ode+0.026
ust+0.026
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.011
 Mr-0.010
 Mrs-0.010
Token an
Token ID281
Feature activation+7.69e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token uncomfortable
Token ID12916
Feature activation+3.84e-02

Pos logit contributions
nes+0.168
vel+0.147
lor+0.146
emic+0.146
ust+0.146
Neg logit contributions
 last-0.085
 Every-0.077
 front-0.077
 teach-0.072
 the-0.071
Token way
Token ID835
Feature activation+2.11e-02

Pos logit contributions
nes+0.073
vel+0.073
oll+0.072
ust+0.071
onel+0.067
Neg logit contributions
 while-0.048
 last-0.045
 Ms-0.041
 visit-0.041
 before-0.040
Token.
Token ID13
Feature activation+2.95e-03

Pos logit contributions
nes+0.052
vel+0.051
oll+0.051
ode+0.049
aces+0.048
Neg logit contributions
 while-0.013
 Ms-0.012
 last-0.011
 before-0.010
 Mr-0.010
Token The
Token ID383
Feature activation-6.06e-03

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token little
Token ID1310
Feature activation+6.37e-03

Pos logit contributions
 Ms+0.008
 Mrs+0.006
 while+0.006
 Mitt+0.006
 Mr+0.006
Neg logit contributions
nes-0.011
vel-0.011
oll-0.010
ode-0.010
aces-0.010
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
 Ms+0.022
 Mrs+0.019
 teach+0.018
 Mr+0.018
 last+0.018
Neg logit contributions
nes-0.026
vel-0.026
oll-0.025
aky-0.024
ode-0.024
Token The
Token ID383
Feature activation+2.46e-02

Pos logit contributions
 Ms+0.011
 while+0.010
 last+0.010
 teach+0.010
 praise+0.009
Neg logit contributions
vel-0.022
nes-0.022
aces-0.022
oll-0.022
ode-0.022
Token bird
Token ID6512
Feature activation+8.14e-03

Pos logit contributions
nes+0.052
vel+0.051
oll+0.048
aces+0.046
ode+0.046
Neg logit contributions
 last-0.027
 while-0.025
 Ms-0.024
 Mr-0.023
 visit-0.023
Token lived
Token ID5615
Feature activation-6.91e-03

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.009
 while-0.009
 last-0.008
 teach-0.008
 Mrs-0.008
Token in
Token ID287
Feature activation+8.23e-03

Pos logit contributions
 Ms+0.011
 Mrs+0.010
 while+0.010
 teach+0.010
 last+0.010
Neg logit contributions
vel-0.010
nes-0.010
oll-0.009
aky-0.009
ust-0.009
Token an
Token ID281
Feature activation+7.63e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 Mrs-0.005
 teach-0.005
Token ordinary
Token ID8850
Feature activation+4.80e-02

Pos logit contributions
nes+0.171
vel+0.165
oll+0.154
ust+0.148
lor+0.147
Neg logit contributions
 last-0.075
 Every-0.070
 the-0.068
 Mr-0.067
 every-0.066
Token tree
Token ID5509
Feature activation-1.63e-02

Pos logit contributions
oll+0.097
aces+0.091
nes+0.091
vel+0.090
onel+0.085
Neg logit contributions
 while-0.064
 visit-0.060
 last-0.058
 every-0.053
 Mr-0.053
Token with
Token ID351
Feature activation+5.55e-03

Pos logit contributions
 Ms+0.019
 Mrs+0.016
 Mr+0.015
 teach+0.015
 Mitt+0.015
Neg logit contributions
nes-0.031
vel-0.031
oll-0.030
aces-0.029
aky-0.029
Token its
Token ID663
Feature activation-1.21e-03

Pos logit contributions
nes+0.010
vel+0.010
oll+0.010
ode+0.010
ust+0.009
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 teach-0.006
 Mrs-0.006
Token family
Token ID1641
Feature activation+1.36e-03

Pos logit contributions
 Ms+0.002
 while+0.002
 Mrs+0.002
 teach+0.002
 last+0.002
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token and
Token ID290
Feature activation-1.68e-02

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 teach-0.009
 Mrs-0.009
Token you
Token ID345
Feature activation-1.16e-03

Pos logit contributions
 Ms+0.020
 while+0.018
 Mrs+0.017
 last+0.016
 Mr+0.016
Neg logit contributions
nes-0.031
vel-0.030
oll-0.029
ode-0.028
onel-0.027
Token love
Token ID1842
Feature activation+5.33e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token it
Token ID340
Feature activation-1.35e-02

Pos logit contributions
nes+0.010
vel+0.010
oll+0.010
ode+0.009
ust+0.009
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 teach-0.005
 Mrs-0.005
Token,
Token ID11
Feature activation+2.03e-03

Pos logit contributions
 Ms+0.015
 teach+0.013
 Mrs+0.013
 while+0.013
 praise+0.013
Neg logit contributions
nes-0.027
vel-0.026
oll-0.025
aces-0.024
ust-0.024
Token Sil
Token ID4243
Feature activation+7.62e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.003
ust+0.003
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Tokenvia
Token ID8869
Feature activation-2.77e-02

Pos logit contributions
oll+0.094
ode+0.088
 ow+0.083
alo+0.080
 ankle+0.080
Neg logit contributions
 Ms-0.103
 before-0.103
 last-0.102
 while-0.102
 upon-0.101
Token!
Token ID0
Feature activation-1.99e-02

Pos logit contributions
 Ms+0.033
 Mrs+0.029
 Mr+0.027
 while+0.027
 last+0.027
Neg logit contributions
nes-0.053
vel-0.052
oll-0.048
anna-0.047
aces-0.047
Token Come
Token ID7911
Feature activation-9.82e-03

Pos logit contributions
 Ms+0.027
 while+0.027
 last+0.026
glers+0.025
 usual+0.024
Neg logit contributions
nes-0.032
vel-0.031
oll-0.030
ode-0.029
ust-0.028
Token now
Token ID783
Feature activation-1.32e-02

Pos logit contributions
 Ms+0.012
 Mrs+0.011
 while+0.011
 last+0.011
 Mr+0.010
Neg logit contributions
vel-0.017
nes-0.017
oll-0.017
ode-0.016
aces-0.015
Token,
Token ID11
Feature activation+1.85e-03

Pos logit contributions
 Ms+0.014
 Mrs+0.012
 while+0.012
 last+0.012
 Mr+0.011
Neg logit contributions
nes-0.026
vel-0.026
oll-0.024
ode-0.024
ust-0.023
Token was
Token ID373
Feature activation+4.71e-03

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
ust+0.008
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.003
 teach-0.003
 Mrs-0.003
Token so
Token ID523
Feature activation-4.94e-04

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.006
 teach-0.006
 Mrs-0.006
Token excited
Token ID6568
Feature activation-1.21e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ust-0.001
aces-0.001
Token to
Token ID284
Feature activation+1.29e-03

Pos logit contributions
 Ms+0.013
 teach+0.012
 last+0.011
 Mr+0.011
 praise+0.011
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
ust-0.021
aces-0.021
Token watch
Token ID2342
Feature activation-8.91e-03

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 last-0.001
 Mr-0.001
Token the
Token ID262
Feature activation-7.55e-02

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.005
 teach+0.005
Neg logit contributions
vel-0.020
nes-0.020
oll-0.019
aces-0.018
ode-0.018
Token new
Token ID649
Feature activation-7.52e-02

Pos logit contributions
thank+0.118
Thank+0.112
 hire+0.111
 Mitt+0.110
 Ms+0.110
Neg logit contributions
vel-0.073
nes-0.058
glass-0.057
oll-0.056
od-0.054
Token video
Token ID2008
Feature activation+5.78e-03

Pos logit contributions
Thank+0.121
 usual+0.119
Dr+0.116
glers+0.114
thank+0.114
Neg logit contributions
vel-0.072
nes-0.060
ont-0.057
glass-0.055
od-0.055
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
nes+0.013
vel+0.013
oll+0.012
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token But
Token ID887
Feature activation-3.93e-03

Pos logit contributions
 Ms+0.021
 last+0.021
 while+0.020
 teach+0.020
glers+0.019
Neg logit contributions
nes-0.023
oll-0.022
vel-0.022
aces-0.022
ust-0.020
Token when
Token ID618
Feature activation-2.08e-02

Pos logit contributions
 Ms+0.003
 teach+0.002
 last+0.002
 while+0.002
 praise+0.002
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ust-0.008
ode-0.008
Token so
Token ID523
Feature activation-4.94e-04

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.006
 teach-0.006
 Mrs-0.006
Token excited
Token ID6568
Feature activation-1.21e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ust-0.001
aces-0.001
Token to
Token ID284
Feature activation+1.29e-03

Pos logit contributions
 Ms+0.013
 teach+0.012
 last+0.011
 Mr+0.011
 praise+0.011
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
ust-0.021
aces-0.021
Token watch
Token ID2342
Feature activation-8.91e-03

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.001
 while-0.001
 Mrs-0.001
 last-0.001
 Mr-0.001
Token the
Token ID262
Feature activation-7.55e-02

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.005
 teach+0.005
Neg logit contributions
vel-0.020
nes-0.020
oll-0.019
aces-0.018
ode-0.018
Token new
Token ID649
Feature activation-7.52e-02

Pos logit contributions
thank+0.118
Thank+0.112
 hire+0.111
 Mitt+0.110
 Ms+0.110
Neg logit contributions
vel-0.073
nes-0.058
glass-0.057
oll-0.056
od-0.054
Token video
Token ID2008
Feature activation+5.78e-03

Pos logit contributions
Thank+0.121
 usual+0.119
Dr+0.116
glers+0.114
thank+0.114
Neg logit contributions
vel-0.072
nes-0.060
ont-0.057
glass-0.055
od-0.055
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
nes+0.013
vel+0.013
oll+0.012
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token But
Token ID887
Feature activation-3.93e-03

Pos logit contributions
 Ms+0.021
 last+0.021
 while+0.020
 teach+0.020
glers+0.019
Neg logit contributions
nes-0.023
oll-0.022
vel-0.022
aces-0.022
ust-0.020
Token when
Token ID618
Feature activation-2.08e-02

Pos logit contributions
 Ms+0.003
 teach+0.002
 last+0.002
 while+0.002
 praise+0.002
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ust-0.008
ode-0.008
Token she
Token ID673
Feature activation+8.91e-03

Pos logit contributions
 Ms+0.025
 praise+0.024
 teach+0.024
 last+0.022
glers+0.021
Neg logit contributions
nes-0.033
vel-0.031
oll-0.031
ust-0.028
onel-0.027
Token One
Token ID1881
Feature activation+2.78e-04

Pos logit contributions
nes+0.017
vel+0.017
oll+0.016
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 Mrs-0.005
 teach-0.005
Token day
Token ID1110
Feature activation-2.23e-02

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ust+0.001
ode+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 teach-0.000
 Mrs-0.000
 last-0.000
Token,
Token ID11
Feature activation-3.34e-02

Pos logit contributions
 Ms+0.025
 praise+0.023
 last+0.022
 teach+0.022
glers+0.022
Neg logit contributions
vel-0.039
oll-0.037
nes-0.036
aces-0.036
aund-0.035
Token he
Token ID339
Feature activation-1.00e-02

Pos logit contributions
 praise+0.035
glers+0.035
 Ms+0.035
 thanks+0.033
 teach+0.032
Neg logit contributions
nes-0.049
vel-0.049
aces-0.046
oll-0.045
glass-0.044
Token came
Token ID1625
Feature activation-2.92e-02

Pos logit contributions
 Ms+0.015
 while+0.014
 Mrs+0.013
 last+0.013
 teach+0.013
Neg logit contributions
nes-0.015
vel-0.015
oll-0.014
ode-0.014
aky-0.014
Token up
Token ID510
Feature activation-7.50e-02

Pos logit contributions
 Ms+0.046
glers+0.042
 Mrs+0.040
 Mitt+0.040
 Mr+0.040
Neg logit contributions
vel-0.041
ust-0.040
nes-0.039
aky-0.039
onel-0.037
Token with
Token ID351
Feature activation-1.16e-03

Pos logit contributions
glers+0.105
 Ms+0.102
 Mitt+0.097
 praise+0.095
 choice+0.094
Neg logit contributions
vel-0.105
ust-0.094
aky-0.092
nes-0.090
oll-0.085
Token an
Token ID281
Feature activation+5.98e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token idea
Token ID2126
Feature activation-5.32e-03

Pos logit contributions
nes+0.139
vel+0.123
lor+0.123
emic+0.119
ode+0.118
Neg logit contributions
 last-0.051
 teach-0.050
 Ms-0.048
 work-0.045
 Mrs-0.045
Token to
Token ID284
Feature activation-9.85e-03

Pos logit contributions
 Ms+0.005
 while+0.004
 last+0.004
 Mrs+0.004
 Mr+0.004
Neg logit contributions
vel-0.011
nes-0.011
oll-0.011
ode-0.010
aces-0.010
Token scare
Token ID19437
Feature activation+2.50e-02

Pos logit contributions
 Ms+0.010
 while+0.009
 Mrs+0.009
 Mr+0.008
 Mitt+0.008
Neg logit contributions
nes-0.020
vel-0.019
ode-0.018
oll-0.018
ust-0.018
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 last+0.010
 Mr+0.010
 teach+0.010
Neg logit contributions
nes-0.020
vel-0.019
oll-0.018
aces-0.018
ust-0.017
Token One
Token ID1881
Feature activation-9.16e-03

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.008
ust+0.008
Neg logit contributions
 Ms-0.004
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token day
Token ID1110
Feature activation-1.94e-02

Pos logit contributions
 Ms+0.011
 Mrs+0.010
 teach+0.009
 Mr+0.009
 praise+0.009
Neg logit contributions
nes-0.017
oll-0.016
vel-0.016
ust-0.015
aky-0.015
Token he
Token ID339
Feature activation-1.19e-02

Pos logit contributions
 Ms+0.022
 teach+0.019
 last+0.018
 praise+0.018
 Mrs+0.018
Neg logit contributions
vel-0.036
oll-0.035
nes-0.034
aces-0.033
ode-0.032
Token came
Token ID1625
Feature activation-2.58e-02

Pos logit contributions
 Ms+0.017
 Mrs+0.015
 while+0.015
 last+0.015
 teach+0.014
Neg logit contributions
nes-0.020
vel-0.018
oll-0.018
ust-0.018
aky-0.018
Token up
Token ID510
Feature activation-7.25e-02

Pos logit contributions
 Ms+0.036
 Mrs+0.031
glers+0.031
 praise+0.031
 teach+0.031
Neg logit contributions
vel-0.041
nes-0.040
ust-0.040
aky-0.039
oll-0.038
Token with
Token ID351
Feature activation-4.88e-03

Pos logit contributions
glers+0.098
 Ms+0.097
 praise+0.095
 Mitt+0.091
 choice+0.091
Neg logit contributions
vel-0.099
nes-0.090
ust-0.089
aky-0.089
oll-0.083
Token a
Token ID257
Feature activation-1.75e-03

Pos logit contributions
 Ms+0.005
 while+0.005
 teach+0.004
 last+0.004
 Mrs+0.004
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.009
aces-0.009
Token new
Token ID649
Feature activation-5.49e-03

Pos logit contributions
 Ms+0.002
 while+0.002
 last+0.002
 Mrs+0.002
 teach+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token game
Token ID983
Feature activation-1.18e-02

Pos logit contributions
 Ms+0.008
 Mrs+0.007
 while+0.007
 Mr+0.007
 last+0.007
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
ode-0.008
aces-0.008
Token.
Token ID13
Feature activation+6.98e-04

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 Mr+0.010
 while+0.010
 teach+0.010
Neg logit contributions
nes-0.024
vel-0.023
oll-0.023
aces-0.023
ode-0.022
Token Benny
Token ID44275
Feature activation-1.43e-02

Pos logit contributions
 praise+0.063
 Ms+0.061
 hire+0.059
 choice+0.057
 praises+0.056
Neg logit contributions
vel-0.050
nes-0.048
bowl-0.046
 anchor-0.044
oll-0.044
Token's
Token ID338
Feature activation+1.33e-02

Pos logit contributions
 Ms+0.020
 teach+0.018
 last+0.018
 Mrs+0.018
 praise+0.018
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
ode-0.021
aky-0.021
Token friend
Token ID1545
Feature activation-1.08e-02

Pos logit contributions
nes+0.029
vel+0.028
oll+0.027
ode+0.026
ust+0.026
Neg logit contributions
 Ms-0.012
 while-0.012
 last-0.011
 teach-0.011
 Mr-0.011
Token Sally
Token ID25737
Feature activation-2.31e-02

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 praise+0.008
Neg logit contributions
nes-0.022
vel-0.021
oll-0.021
ode-0.020
ust-0.020
Token came
Token ID1625
Feature activation-1.60e-02

Pos logit contributions
 Ms+0.030
 last+0.027
 praise+0.027
 teach+0.026
 while+0.026
Neg logit contributions
nes-0.037
vel-0.037
oll-0.035
ode-0.034
aky-0.034
Token up
Token ID510
Feature activation-7.23e-02

Pos logit contributions
 Ms+0.021
 Mrs+0.018
 while+0.018
 teach+0.017
 Mr+0.017
Neg logit contributions
vel-0.027
nes-0.027
aky-0.026
ust-0.026
oll-0.025
Token to
Token ID284
Feature activation-2.54e-02

Pos logit contributions
glers+0.107
 Ms+0.103
 praise+0.101
 choice+0.101
 allows+0.097
Neg logit contributions
vel-0.088
aky-0.077
nes-0.076
ust-0.074
ful-0.071
Token him
Token ID683
Feature activation+2.56e-03

Pos logit contributions
 Ms+0.022
 last+0.019
glers+0.019
 while+0.019
 Mrs+0.017
Neg logit contributions
nes-0.049
vel-0.048
ode-0.045
aky-0.043
anna-0.043
Token and
Token ID290
Feature activation-3.40e-02

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
ust+0.005
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 Mrs-0.002
 teach-0.002
Token said
Token ID531
Feature activation+1.31e-02

Pos logit contributions
 last+0.040
 Ms+0.038
 while+0.036
 before+0.036
 praise+0.035
Neg logit contributions
nes-0.054
anna-0.049
vel-0.049
aky-0.047
ust-0.047
Token,
Token ID11
Feature activation+2.92e-03

Pos logit contributions
nes+0.033
vel+0.032
oll+0.031
ode+0.031
ust+0.030
Neg logit contributions
 Ms-0.008
 while-0.007
 Mr-0.006
 Mrs-0.006
 teach-0.006
Token "
Token ID366
Feature activation-1.21e-02

Pos logit contributions
 last+0.030
 Ms+0.028
 praise+0.028
 teach+0.028
 visit+0.026
Neg logit contributions
nes-0.042
vel-0.042
oll-0.042
anna-0.040
aces-0.040
TokenDon
Token ID3987
Feature activation-2.63e-02

Pos logit contributions
 Ms+0.015
 last+0.014
 teach+0.014
 while+0.013
 Mr+0.013
Neg logit contributions
nes-0.021
vel-0.021
ode-0.020
ust-0.019
aky-0.019
Token't
Token ID470
Feature activation-7.71e-03

Pos logit contributions
 Ms+0.020
 while+0.018
 praise+0.016
 Mrs+0.015
 teach+0.015
Neg logit contributions
nes-0.059
vel-0.056
oll-0.055
ode-0.053
ust-0.053
Token worry
Token ID5490
Feature activation-1.11e-02

Pos logit contributions
 Ms+0.011
 while+0.010
 Mrs+0.010
 last+0.010
 Mr+0.009
Neg logit contributions
nes-0.012
vel-0.012
ode-0.012
ust-0.011
oll-0.011
Token,
Token ID11
Feature activation-2.58e-02

Pos logit contributions
 Ms+0.010
 teach+0.009
 last+0.008
 while+0.008
 praise+0.008
Neg logit contributions
vel-0.023
oll-0.023
nes-0.022
ust-0.022
ode-0.021
Token little
Token ID1310
Feature activation-7.20e-02

Pos logit contributions
 Ms+0.029
glers+0.027
 praise+0.026
 while+0.026
 grace+0.025
Neg logit contributions
oll-0.042
vel-0.042
ode-0.041
nes-0.041
ust-0.041
Token mouse
Token ID10211
Feature activation-2.79e-02

Pos logit contributions
glers+0.111
 Ms+0.106
 usual+0.101
 early+0.101
bel+0.101
Neg logit contributions
 wing-0.087
ig-0.084
 egg-0.084
vel-0.083
ode-0.080
Token,
Token ID11
Feature activation-7.20e-03

Pos logit contributions
 Ms+0.034
 Mrs+0.028
 last+0.027
 particular+0.027
 praise+0.027
Neg logit contributions
nes-0.051
vel-0.050
oll-0.047
onel-0.047
ode-0.046
Token I
Token ID314
Feature activation-2.17e-02

Pos logit contributions
 Ms+0.008
 while+0.007
 last+0.007
 Mrs+0.007
 teach+0.007
Neg logit contributions
nes-0.013
vel-0.013
oll-0.013
ode-0.013
ust-0.012
Token will
Token ID481
Feature activation-2.68e-02

Pos logit contributions
 Ms+0.035
 Mrs+0.031
 while+0.030
 Mr+0.029
 before+0.029
Neg logit contributions
vel-0.033
nes-0.032
oll-0.031
onel-0.030
ode-0.030
Token rescue
Token ID9992
Feature activation-1.68e-02

Pos logit contributions
 Ms+0.027
glers+0.022
 while+0.021
 Mrs+0.020
s+0.019
Neg logit contributions
nes-0.055
vel-0.055
ode-0.052
onel-0.051
oll-0.049
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 teach+0.002
 Mrs+0.002
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
aces-0.007
Token "
Token ID366
Feature activation-6.51e-03

Pos logit contributions
 last+0.023
 teach+0.021
 Ms+0.021
 praise+0.020
 while+0.020
Neg logit contributions
nes-0.036
oll-0.035
vel-0.035
anna-0.033
ust-0.033
TokenDon
Token ID3987
Feature activation-2.19e-02

Pos logit contributions
 Ms+0.008
 last+0.007
 while+0.007
 teach+0.007
 Mrs+0.007
Neg logit contributions
nes-0.012
vel-0.012
oll-0.011
ode-0.011
ust-0.011
Token't
Token ID470
Feature activation-7.65e-03

Pos logit contributions
 Ms+0.015
 while+0.013
 praise+0.011
 Mrs+0.011
 teach+0.011
Neg logit contributions
nes-0.051
vel-0.049
oll-0.048
ode-0.047
ust-0.046
Token worry
Token ID5490
Feature activation-4.73e-03

Pos logit contributions
 Ms+0.011
 while+0.010
 last+0.009
 Mrs+0.009
 teach+0.009
Neg logit contributions
nes-0.012
vel-0.012
ode-0.011
ust-0.011
oll-0.011
Token little
Token ID1310
Feature activation-7.08e-02

Pos logit contributions
 Ms+0.004
 while+0.003
 teach+0.003
 last+0.003
 visit+0.003
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ust-0.010
ode-0.009
Token friend
Token ID1545
Feature activation-2.39e-02

Pos logit contributions
 early+0.107
glers+0.105
 Ms+0.103
 usual+0.100
bel+0.097
Neg logit contributions
 wing-0.077
vel-0.077
 egg-0.076
ode-0.075
oll-0.073
Token,
Token ID11
Feature activation-1.55e-02

Pos logit contributions
 Ms+0.026
 teach+0.022
 grace+0.022
 praise+0.021
glers+0.021
Neg logit contributions
nes-0.044
vel-0.044
ode-0.041
oll-0.041
ust-0.040
Token I
Token ID314
Feature activation-1.83e-02

Pos logit contributions
 Ms+0.017
 while+0.016
 last+0.015
 teach+0.015
glers+0.015
Neg logit contributions
nes-0.027
vel-0.026
oll-0.026
ode-0.026
ust-0.025
Token will
Token ID481
Feature activation-1.84e-02

Pos logit contributions
 Ms+0.031
 Mrs+0.027
 while+0.027
 Mr+0.026
glers+0.025
Neg logit contributions
nes-0.026
vel-0.025
oll-0.024
ust-0.024
ode-0.023
Token help
Token ID1037
Feature activation-2.98e-02

Pos logit contributions
 Ms+0.016
 while+0.013
 Mrs+0.012
glers+0.012
 last+0.011
Neg logit contributions
nes-0.040
vel-0.039
ode-0.038
onel-0.036
oll-0.036
Token searched
Token ID16499
Feature activation-2.54e-02

Pos logit contributions
oll+0.061
vel+0.059
nes+0.059
aces+0.058
ust+0.056
Neg logit contributions
 while-0.046
 Ms-0.043
 last-0.042
 every-0.041
 visit-0.040
Token until
Token ID1566
Feature activation-1.78e-03

Pos logit contributions
 Ms+0.032
 teach+0.028
 Mrs+0.027
 Mr+0.027
 hire+0.027
Neg logit contributions
vel-0.042
nes-0.041
aces-0.039
aky-0.039
oll-0.039
Token she
Token ID673
Feature activation+1.59e-02

Pos logit contributions
 Ms+0.002
 teach+0.001
 while+0.001
 last+0.001
 Mrs+0.001
Neg logit contributions
nes-0.004
vel-0.004
oll-0.004
ode-0.003
ust-0.003
Token finally
Token ID3443
Feature activation+1.05e-02

Pos logit contributions
nes+0.028
vel+0.028
oll+0.027
ode+0.026
aces+0.026
Neg logit contributions
 Ms-0.021
 while-0.020
 last-0.019
 Mrs-0.018
 Mr-0.018
Token found
Token ID1043
Feature activation+2.20e-03

Pos logit contributions
nes+0.016
vel+0.016
oll+0.016
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.016
 while-0.015
 last-0.014
 teach-0.014
 Mrs-0.014
Token the
Token ID262
Feature activation-7.06e-02

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.001
Token video
Token ID2008
Feature activation+2.14e-02

Pos logit contributions
Dr+0.101
 Ms+0.101
thank+0.100
Thank+0.099
mun+0.098
Neg logit contributions
vel-0.067
aky-0.061
nes-0.060
 pad-0.058
oll-0.057
Token.
Token ID13
Feature activation-9.78e-03

Pos logit contributions
nes+0.049
vel+0.048
oll+0.047
ode+0.047
ust+0.047
Neg logit contributions
 Ms-0.016
 while-0.015
 last-0.014
 teach-0.013
 Mrs-0.013
Token She
Token ID1375
Feature activation+6.79e-03

Pos logit contributions
 Ms+0.013
 last+0.012
 while+0.012
 teach+0.012
glers+0.012
Neg logit contributions
nes-0.016
oll-0.015
vel-0.015
aces-0.014
ust-0.014
Token was
Token ID373
Feature activation+1.87e-02

Pos logit contributions
nes+0.014
vel+0.014
oll+0.014
ode+0.013
ust+0.013
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 teach-0.005
 Mrs-0.005
Token so
Token ID523
Feature activation+4.53e-03

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.030
ust+0.029
Neg logit contributions
 Ms-0.026
 while-0.025
 last-0.024
 Mrs-0.024
 Mr-0.024
Token One
Token ID1881
Feature activation-7.31e-03

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token day
Token ID1110
Feature activation-1.40e-02

Pos logit contributions
 Ms+0.009
 teach+0.007
 Mrs+0.007
 Mr+0.007
 praise+0.007
Neg logit contributions
nes-0.013
oll-0.013
vel-0.013
ust-0.012
aky-0.012
Token,
Token ID11
Feature activation-2.68e-02

Pos logit contributions
 Ms+0.013
 last+0.012
 teach+0.012
 praise+0.012
glers+0.011
Neg logit contributions
vel-0.028
nes-0.027
oll-0.027
aces-0.025
ode-0.024
Token she
Token ID673
Feature activation+1.14e-03

Pos logit contributions
 praise+0.028
 Ms+0.028
 thanks+0.028
 teach+0.027
 last+0.027
Neg logit contributions
nes-0.044
vel-0.042
oll-0.040
aces-0.038
onel-0.037
Token came
Token ID1625
Feature activation-2.05e-02

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ode+0.002
ust+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 Mrs-0.001
 teach-0.001
Token up
Token ID510
Feature activation-7.06e-02

Pos logit contributions
 Ms+0.029
 Mr+0.025
 Mrs+0.025
 teach+0.025
glers+0.024
Neg logit contributions
vel-0.033
nes-0.033
ust-0.031
aky-0.030
oll-0.030
Token with
Token ID351
Feature activation-1.01e-02

Pos logit contributions
glers+0.094
 Ms+0.090
 praise+0.087
 choice+0.085
 Mitt+0.085
Neg logit contributions
vel-0.103
nes-0.088
aky-0.085
ust-0.085
per-0.083
Token a
Token ID257
Feature activation+3.35e-03

Pos logit contributions
 Ms+0.011
 while+0.010
 last+0.009
 teach+0.009
 Mr+0.009
Neg logit contributions
vel-0.019
nes-0.019
oll-0.019
aces-0.018
ode-0.017
Token great
Token ID1049
Feature activation+6.82e-03

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.004
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token idea
Token ID2126
Feature activation-8.14e-04

Pos logit contributions
nes+0.013
vel+0.013
oll+0.012
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 teach-0.007
 Mrs-0.007
Token.
Token ID13
Feature activation+2.16e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 Mrs+0.001
 Mr+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token park
Token ID3952
Feature activation-4.28e-02

Pos logit contributions
nes+0.083
vel+0.082
aces+0.080
oll+0.080
ust+0.077
Neg logit contributions
 Ms-0.035
 visit-0.034
 last-0.033
 Mitt-0.033
 while-0.032
Token when
Token ID618
Feature activation-1.26e-02

Pos logit contributions
 Ms+0.047
glers+0.041
 praise+0.040
 Mr+0.038
 Mrs+0.038
Neg logit contributions
vel-0.079
nes-0.077
adow-0.075
oll-0.073
aces-0.072
Token a
Token ID257
Feature activation-1.54e-03

Pos logit contributions
 Ms+0.014
 teach+0.013
 praise+0.012
 while+0.012
 last+0.012
Neg logit contributions
nes-0.023
vel-0.022
oll-0.022
aces-0.021
onel-0.021
Token boy
Token ID2933
Feature activation-2.55e-02

Pos logit contributions
 Ms+0.002
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token came
Token ID1625
Feature activation-2.33e-02

Pos logit contributions
 Ms+0.036
 last+0.032
 Mrs+0.031
 praise+0.031
 teach+0.030
Neg logit contributions
nes-0.039
vel-0.039
oll-0.037
ode-0.037
aky-0.035
Token up
Token ID510
Feature activation-7.03e-02

Pos logit contributions
 Ms+0.031
 Mrs+0.027
 while+0.026
 teach+0.026
 praise+0.026
Neg logit contributions
nes-0.037
vel-0.037
aky-0.036
ust-0.036
ode-0.035
Token to
Token ID284
Feature activation-1.91e-02

Pos logit contributions
 Ms+0.100
glers+0.097
 Mitt+0.094
 allows+0.094
 praise+0.093
Neg logit contributions
vel-0.083
nes-0.081
aky-0.075
ust-0.074
ful-0.070
Token her
Token ID607
Feature activation-5.43e-03

Pos logit contributions
 Ms+0.021
 while+0.020
 last+0.019
glers+0.019
 Mitt+0.017
Neg logit contributions
nes-0.033
vel-0.032
ode-0.030
oll-0.029
anna-0.028
Token and
Token ID290
Feature activation-3.28e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 teach+0.005
 last+0.005
Neg logit contributions
nes-0.011
oll-0.011
vel-0.010
ode-0.010
aces-0.010
Token asked
Token ID1965
Feature activation-5.59e-03

Pos logit contributions
 Ms+0.041
 last+0.040
 while+0.037
glers+0.036
 hire+0.036
Neg logit contributions
nes-0.049
anna-0.045
vel-0.044
aky-0.042
ode-0.041
Token if
Token ID611
Feature activation-2.58e-02

Pos logit contributions
 Ms+0.006
 last+0.006
 teach+0.005
 while+0.005
 praise+0.005
Neg logit contributions
vel-0.010
nes-0.010
oll-0.010
ust-0.009
aces-0.009
Token mom
Token ID1995
Feature activation-8.90e-03

Pos logit contributions
 Ms+0.008
 while+0.008
 Mrs+0.007
 teach+0.007
 last+0.007
Neg logit contributions
nes-0.011
oll-0.011
vel-0.010
aces-0.010
ode-0.010
Token.
Token ID13
Feature activation-7.60e-03

Pos logit contributions
 Ms+0.009
 while+0.007
 praise+0.007
 teach+0.007
 last+0.007
Neg logit contributions
nes-0.018
vel-0.017
oll-0.017
ode-0.017
aky-0.016
Token A
Token ID317
Feature activation+4.60e-03

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 praise+0.008
Neg logit contributions
nes-0.013
vel-0.012
oll-0.012
aces-0.012
ust-0.011
Token stranger
Token ID16195
Feature activation-1.96e-02

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
ust+0.009
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token came
Token ID1625
Feature activation-1.68e-02

Pos logit contributions
 Ms+0.025
 last+0.022
 while+0.022
 Mrs+0.021
 teach+0.021
Neg logit contributions
nes-0.033
vel-0.032
oll-0.032
ode-0.030
onel-0.029
Token up
Token ID510
Feature activation-7.02e-02

Pos logit contributions
 Ms+0.021
 teach+0.019
 Mrs+0.018
 praise+0.018
 Mr+0.018
Neg logit contributions
nes-0.029
vel-0.028
ust-0.027
aky-0.027
oll-0.026
Token to
Token ID284
Feature activation-2.27e-02

Pos logit contributions
glers+0.109
 expiration+0.104
 praise+0.103
mun+0.102
 Ms+0.101
Neg logit contributions
nes-0.071
vel-0.070
per-0.062
ful-0.062
aky-0.062
Token them
Token ID606
Feature activation-6.32e-03

Pos logit contributions
 Ms+0.026
 while+0.025
glers+0.024
 downward+0.024
 last+0.024
Neg logit contributions
nes-0.038
vel-0.035
ode-0.031
oll-0.031
anna-0.031
Token and
Token ID290
Feature activation-2.64e-02

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 Mrs+0.006
 teach+0.005
Neg logit contributions
nes-0.012
vel-0.012
oll-0.012
ode-0.012
aces-0.012
Token asked
Token ID1965
Feature activation-1.51e-02

Pos logit contributions
 Ms+0.028
 last+0.027
 while+0.026
 praise+0.025
 teach+0.025
Neg logit contributions
nes-0.047
vel-0.042
oll-0.041
anna-0.040
ust-0.040
Token for
Token ID329
Feature activation-1.57e-02

Pos logit contributions
 Ms+0.017
 last+0.016
 teach+0.016
 praise+0.015
glers+0.015
Neg logit contributions
vel-0.026
nes-0.026
oll-0.025
ust-0.023
aces-0.023
Token up
Token ID510
Feature activation-2.90e-02

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 last+0.003
 Mr+0.003
Neg logit contributions
nes-0.005
vel-0.005
ode-0.005
oll-0.005
aces-0.005
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
 Ms+0.036
 teach+0.032
 Mitt+0.031
glers+0.030
 hire+0.030
Neg logit contributions
vel-0.049
nes-0.048
aces-0.048
oll-0.045
ust-0.045
Token then
Token ID788
Feature activation-2.69e-03

Pos logit contributions
 Ms+0.028
 last+0.024
 Mrs+0.024
 Mitt+0.024
 teach+0.023
Neg logit contributions
vel-0.040
nes-0.040
aces-0.038
ode-0.038
oll-0.038
Token come
Token ID1282
Feature activation-2.46e-02

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 Mrs+0.003
 teach+0.003
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token back
Token ID736
Feature activation-2.41e-02

Pos logit contributions
 Ms+0.043
 Mrs+0.037
 Mitt+0.036
 Mr+0.036
 praise+0.035
Neg logit contributions
nes-0.034
vel-0.033
ode-0.032
ust-0.031
aces-0.031
Token down
Token ID866
Feature activation-6.99e-02

Pos logit contributions
 Ms+0.030
 teach+0.026
 Mitt+0.025
 praise+0.025
glers+0.025
Neg logit contributions
nes-0.040
vel-0.040
oll-0.038
aces-0.038
adow-0.037
Token to
Token ID284
Feature activation-1.05e-03

Pos logit contributions
glers+0.099
 Mitt+0.098
 hire+0.091
mun+0.090
 Ms+0.090
Neg logit contributions
vel-0.083
 itself-0.075
aund-0.071
nes-0.071
onel-0.070
Token her
Token ID607
Feature activation+1.27e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 Mrs+0.001
 Mr+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
aces-0.002
Token tree
Token ID5509
Feature activation-4.78e-03

Pos logit contributions
nes+0.016
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.023
 while-0.023
 last-0.022
 teach-0.022
 Mr-0.021
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 Mrs+0.004
 Mr+0.004
 teach+0.004
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.009
aces-0.009
Token
Token ID198
Feature activation+3.93e-04

Pos logit contributions
 Ms+0.010
 while+0.009
 last+0.009
 teach+0.009
 Mrs+0.008
Neg logit contributions
nes-0.020
oll-0.019
vel-0.019
aces-0.019
ust-0.018
Token,
Token ID11
Feature activation-6.80e-04

Pos logit contributions
 Ms+0.006
 while+0.005
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.011
vel-0.011
oll-0.010
ode-0.010
aces-0.010
Token named
Token ID3706
Feature activation-3.49e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
ust-0.001
Token Linda
Token ID21776
Feature activation-1.29e-02

Pos logit contributions
 Ms+0.004
 while+0.004
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.007
vel-0.007
oll-0.006
ust-0.006
ode-0.006
Token,
Token ID11
Feature activation+3.10e-04

Pos logit contributions
 Ms+0.012
 while+0.010
 last+0.010
 teach+0.010
 Mrs+0.009
Neg logit contributions
nes-0.028
vel-0.026
oll-0.026
ode-0.025
aces-0.025
Token came
Token ID1625
Feature activation-1.83e-02

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token up
Token ID510
Feature activation-6.96e-02

Pos logit contributions
 Ms+0.025
 Mrs+0.021
 Mr+0.021
 teach+0.021
 while+0.021
Neg logit contributions
nes-0.031
vel-0.031
ust-0.029
ode-0.028
oll-0.028
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
glers+0.094
 Ms+0.092
 Mitt+0.092
 praise+0.087
 allows+0.083
Neg logit contributions
vel-0.097
nes-0.085
ust-0.084
per-0.081
aky-0.079
Token him
Token ID683
Feature activation-3.46e-03

Pos logit contributions
 Ms+0.011
 while+0.010
 last+0.009
 Mrs+0.009
 Mr+0.009
Neg logit contributions
nes-0.020
vel-0.020
ode-0.019
oll-0.019
ust-0.018
Token and
Token ID290
Feature activation-9.02e-03

Pos logit contributions
 Ms+0.003
 while+0.003
 Mrs+0.003
 last+0.003
 Mr+0.003
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
aces-0.007
Token asked
Token ID1965
Feature activation+5.80e-03

Pos logit contributions
 Ms+0.009
 last+0.008
 while+0.008
 teach+0.007
 Mr+0.007
Neg logit contributions
nes-0.018
vel-0.017
oll-0.017
ode-0.016
ust-0.016
Token if
Token ID611
Feature activation-1.59e-02

Pos logit contributions
nes+0.012
vel+0.011
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 teach-0.005
 Mrs-0.005
Token took
Token ID1718
Feature activation+2.41e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.010
 teach-0.009
 Mrs-0.009
Token good
Token ID922
Feature activation-1.02e-02

Pos logit contributions
nes+0.056
ode+0.054
ust+0.053
vel+0.053
oll+0.052
Neg logit contributions
 Ms-0.020
 while-0.019
 last-0.018
 teach-0.017
 Mr-0.017
Token care
Token ID1337
Feature activation-4.14e-03

Pos logit contributions
 Ms+0.012
 Mrs+0.010
 while+0.010
 Mr+0.010
 last+0.009
Neg logit contributions
vel-0.019
oll-0.019
nes-0.019
ode-0.018
aces-0.018
Token of
Token ID286
Feature activation+2.16e-03

Pos logit contributions
 Ms+0.004
 Mrs+0.003
 while+0.003
 Mr+0.003
 teach+0.003
Neg logit contributions
nes-0.009
vel-0.009
oll-0.009
aces-0.008
ode-0.008
Token his
Token ID465
Feature activation-6.53e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 Mrs-0.002
 teach-0.002
Token new
Token ID649
Feature activation-6.96e-02

Pos logit contributions
Thank+0.095
glers+0.094
Dr+0.093
 Ms+0.092
thank+0.092
Neg logit contributions
vel-0.072
aky-0.067
oll-0.064
ocket-0.064
aces-0.063
Token tablet
Token ID14147
Feature activation+3.18e-03

Pos logit contributions
Thank+0.125
Dr+0.125
 Ms+0.118
glers+0.118
Grab+0.116
Neg logit contributions
vel-0.060
 pad-0.050
aky-0.049
ont-0.049
 luggage-0.045
Token so
Token ID523
Feature activation+7.21e-04

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.006
aces+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.002
 teach-0.002
 Mrs-0.002
Token it
Token ID340
Feature activation-5.20e-04

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.001
 while-0.001
 teach-0.001
 last-0.001
 Mrs-0.001
Token wouldn
Token ID3636
Feature activation-3.95e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 last+0.001
 teach+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
ust-0.001
Token't
Token ID470
Feature activation+1.72e-03

Pos logit contributions
 Ms+0.037
 while+0.031
 praise+0.029
 Mrs+0.028
Thank+0.027
Neg logit contributions
nes-0.087
vel-0.080
oll-0.077
onel-0.076
ust-0.074
TokenOne
Token ID3198
Feature activation-9.20e-03

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ust+0.003
ode+0.003
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mr-0.002
Token day
Token ID1110
Feature activation-2.05e-02

Pos logit contributions
 Ms+0.011
 teach+0.010
 Mrs+0.009
 praise+0.009
 Mr+0.009
Neg logit contributions
oll-0.016
nes-0.016
vel-0.016
ust-0.015
aky-0.015
Token,
Token ID11
Feature activation-3.15e-02

Pos logit contributions
 Ms+0.022
 praise+0.021
 teach+0.021
 last+0.021
glers+0.021
Neg logit contributions
vel-0.037
oll-0.036
nes-0.035
aces-0.033
aund-0.032
Token she
Token ID673
Feature activation-4.37e-03

Pos logit contributions
glers+0.042
 praise+0.041
 hire+0.040
 teach+0.040
 thanks+0.039
Neg logit contributions
nes-0.041
oll-0.039
vel-0.038
aces-0.037
alo-0.035
Token came
Token ID1625
Feature activation-2.94e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.008
vel-0.007
oll-0.007
ode-0.007
ust-0.007
Token up
Token ID510
Feature activation-6.95e-02

Pos logit contributions
 Ms+0.044
glers+0.042
 Mrs+0.039
 Mr+0.038
 teach+0.038
Neg logit contributions
vel-0.044
nes-0.044
ust-0.042
aky-0.041
oll-0.039
Token with
Token ID351
Feature activation-1.28e-02

Pos logit contributions
glers+0.106
 Ms+0.094
 Mitt+0.093
 praise+0.092
 choice+0.092
Neg logit contributions
vel-0.094
nes-0.084
aky-0.078
ust-0.077
per-0.075
Token an
Token ID281
Feature activation+5.73e-02

Pos logit contributions
 Ms+0.014
 while+0.014
 teach+0.012
glers+0.012
 before+0.012
Neg logit contributions
vel-0.023
oll-0.023
nes-0.023
aces-0.021
onel-0.021
Token idea
Token ID2126
Feature activation-5.45e-03

Pos logit contributions
nes+0.131
lor+0.119
vel+0.119
ode+0.114
ust+0.114
Neg logit contributions
 last-0.049
 teach-0.048
 Ms-0.045
 while-0.043
 work-0.043
Token.
Token ID13
Feature activation+6.11e-03

Pos logit contributions
 Ms+0.005
 while+0.004
 Mr+0.004
 Mrs+0.004
 last+0.004
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
aces-0.011
ode-0.011
Token She
Token ID1375
Feature activation-4.23e-03

Pos logit contributions
nes+0.012
vel+0.011
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.006
 teach-0.006
 Mrs-0.006
Token he
Token ID339
Feature activation-2.93e-03

Pos logit contributions
 Ms+0.006
 while+0.006
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
ust-0.010
aces-0.010
Token had
Token ID550
Feature activation+1.45e-02

Pos logit contributions
 Ms+0.004
 while+0.004
 Mrs+0.003
 last+0.003
 Mr+0.003
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
ust-0.005
Token chosen
Token ID7147
Feature activation+5.36e-04

Pos logit contributions
nes+0.030
vel+0.030
oll+0.030
ode+0.029
ust+0.028
Neg logit contributions
 Ms-0.014
 while-0.013
 last-0.012
 teach-0.012
 Mrs-0.012
Token to
Token ID284
Feature activation-1.24e-03

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 Mrs-0.000
 teach-0.000
Token watch
Token ID2342
Feature activation-1.76e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 Mr+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token the
Token ID262
Feature activation-6.93e-02

Pos logit contributions
 Ms+0.016
 while+0.013
 last+0.013
 Mrs+0.012
 teach+0.012
Neg logit contributions
vel-0.036
nes-0.036
oll-0.034
aces-0.033
ode-0.033
Token fun
Token ID1257
Feature activation-1.74e-02

Pos logit contributions
 Ms+0.110
Dr+0.108
 while+0.106
bel+0.105
 Mrs+0.104
Neg logit contributions
vel-0.059
nes-0.049
od-0.048
orry-0.047
onel-0.044
Token and
Token ID290
Feature activation-4.77e-03

Pos logit contributions
 Ms+0.020
 last+0.017
 Mrs+0.017
 Mr+0.017
 while+0.017
Neg logit contributions
nes-0.032
vel-0.031
oll-0.029
ust-0.028
aces-0.028
Token easy
Token ID2562
Feature activation-3.24e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 Mrs+0.004
 last+0.004
 Mr+0.004
Neg logit contributions
nes-0.010
vel-0.009
oll-0.009
ode-0.009
ust-0.009
Token video
Token ID2008
Feature activation+5.22e-03

Pos logit contributions
 Ms+0.040
 Mrs+0.036
 Mr+0.036
Thank+0.035
 Mitt+0.034
Neg logit contributions
vel-0.053
nes-0.051
oll-0.047
aces-0.045
onel-0.044
Token.
Token ID13
Feature activation-5.99e-03

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token his
Token ID465
Feature activation-2.66e-02

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.005
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.007
vel-0.007
oll-0.006
ode-0.006
aces-0.006
Token dad
Token ID9955
Feature activation-1.31e-02

Pos logit contributions
 Ms+0.052
 while+0.049
 Mitt+0.047
glers+0.047
 Mrs+0.046
Neg logit contributions
oll-0.027
vel-0.023
nes-0.022
aky-0.021
ode-0.021
Token for
Token ID329
Feature activation+1.47e-02

Pos logit contributions
 Ms+0.014
 teach+0.012
 while+0.011
 last+0.011
 Mitt+0.011
Neg logit contributions
vel-0.025
nes-0.025
oll-0.025
ode-0.023
ust-0.023
Token replacing
Token ID13586
Feature activation-3.07e-02

Pos logit contributions
nes+0.033
vel+0.032
oll+0.032
ust+0.031
ode+0.031
Neg logit contributions
 Ms-0.013
 while-0.012
 teach-0.011
 last-0.011
 Mrs-0.011
Token his
Token ID465
Feature activation-5.35e-02

Pos logit contributions
 Ms+0.032
Thank+0.031
 while+0.031
 last+0.029
 visit+0.028
Neg logit contributions
vel-0.054
nes-0.052
oll-0.049
aces-0.049
aky-0.048
Token old
Token ID1468
Feature activation-6.91e-02

Pos logit contributions
Thank+0.083
 Ms+0.082
glers+0.082
thank+0.076
 Mrs+0.076
Neg logit contributions
vel-0.062
aky-0.058
aces-0.056
 pad-0.054
oll-0.054
Token tablet
Token ID14147
Feature activation-6.75e-03

Pos logit contributions
Thank+0.120
 Ms+0.118
glers+0.117
Grab+0.111
Yes+0.111
Neg logit contributions
vel-0.060
aces-0.056
 pad-0.056
aky-0.053
ont-0.052
Token with
Token ID351
Feature activation-2.72e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 teach+0.005
 last+0.005
Neg logit contributions
nes-0.014
vel-0.014
oll-0.014
aces-0.013
ode-0.013
Token a
Token ID257
Feature activation-3.28e-02

Pos logit contributions
 Ms+0.035
 while+0.028
glers+0.028
 Mrs+0.028
 last+0.027
Neg logit contributions
vel-0.044
oll-0.042
nes-0.042
aces-0.042
aky-0.040
Token new
Token ID649
Feature activation-5.90e-02

Pos logit contributions
 Ms+0.041
glers+0.036
 Mrs+0.034
 last+0.034
 Mr+0.033
Neg logit contributions
vel-0.053
nes-0.049
aky-0.047
 pad-0.046
oll-0.046
Token one
Token ID530
Feature activation+2.66e-03

Pos logit contributions
 Ms+0.090
glers+0.089
Thank+0.086
Dr+0.084
 Mrs+0.082
Neg logit contributions
 pad-0.066
vel-0.065
nes-0.060
 mate-0.059
aky-0.058
Token then
Token ID788
Feature activation-1.34e-02

Pos logit contributions
 Ms+0.005
 teach+0.005
 last+0.004
 while+0.004
 praise+0.004
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ust-0.010
ode-0.010
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 Ms+0.012
 teach+0.011
 last+0.011
 praise+0.011
 while+0.010
Neg logit contributions
vel-0.027
oll-0.027
nes-0.026
ust-0.025
aces-0.024
Token big
Token ID1263
Feature activation+2.29e-02

Pos logit contributions
 Ms+0.015
 while+0.014
 Mrs+0.013
 teach+0.013
 last+0.013
Neg logit contributions
nes-0.021
vel-0.021
oll-0.021
aky-0.020
ode-0.020
Token cat
Token ID3797
Feature activation-3.96e-03

Pos logit contributions
nes+0.049
vel+0.047
oll+0.045
ust+0.045
ode+0.044
Neg logit contributions
 Ms-0.025
 while-0.024
 last-0.023
 teach-0.022
 Mr-0.022
Token comes
Token ID2058
Feature activation-2.92e-02

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.005
 Mrs+0.005
 Mr+0.005
Neg logit contributions
vel-0.007
nes-0.007
oll-0.007
ode-0.006
aces-0.006
Token and
Token ID290
Feature activation-6.80e-02

Pos logit contributions
 Ms+0.041
Thank+0.036
 teach+0.035
 praise+0.035
 Mrs+0.035
Neg logit contributions
vel-0.048
nes-0.045
ust-0.045
aky-0.044
aces-0.041
Token sees
Token ID7224
Feature activation+5.74e-03

Pos logit contributions
 hire+0.103
thank+0.100
 whenever+0.100
 choice+0.096
 praise+0.096
Neg logit contributions
vel-0.070
oll-0.068
ox-0.066
aund-0.064
aces-0.063
Token the
Token ID262
Feature activation-2.37e-02

Pos logit contributions
nes+0.012
vel+0.012
oll+0.012
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token toy
Token ID13373
Feature activation-2.54e-03

Pos logit contributions
 Ms+0.039
 while+0.034
 Mrs+0.033
 whenever+0.032
 every+0.032
Neg logit contributions
oll-0.030
ode-0.030
aces-0.029
vel-0.029
aky-0.029
Token birds
Token ID10087
Feature activation+3.00e-03

Pos logit contributions
 Ms+0.003
 while+0.002
 Mrs+0.002
 last+0.002
 teach+0.002
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
ode-0.005
aces-0.005
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 Mrs-0.003
 teach-0.003
Token,
Token ID11
Feature activation-2.54e-02

Pos logit contributions
 Ms+0.014
 last+0.013
 teach+0.013
 praise+0.013
glers+0.013
Neg logit contributions
vel-0.031
nes-0.030
oll-0.030
aces-0.028
anna-0.027
Token a
Token ID257
Feature activation+3.28e-03

Pos logit contributions
 Ms+0.027
 teach+0.027
 praise+0.027
glers+0.027
 last+0.026
Neg logit contributions
nes-0.040
vel-0.040
oll-0.038
aces-0.036
onel-0.035
Token little
Token ID1310
Feature activation+5.17e-03

Pos logit contributions
nes+0.007
vel+0.007
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token girl
Token ID2576
Feature activation-1.80e-02

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
ust+0.009
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.006
 teach-0.005
 Mrs-0.005
Token came
Token ID1625
Feature activation-2.54e-02

Pos logit contributions
 Ms+0.024
 last+0.022
 teach+0.021
 while+0.021
 Mr+0.021
Neg logit contributions
nes-0.029
vel-0.029
oll-0.028
ode-0.027
aky-0.026
Token up
Token ID510
Feature activation-6.79e-02

Pos logit contributions
 Ms+0.034
 Mrs+0.029
 Mr+0.029
 teach+0.028
 praise+0.028
Neg logit contributions
vel-0.042
nes-0.041
ust-0.039
aky-0.039
adow-0.038
Token to
Token ID284
Feature activation-2.20e-02

Pos logit contributions
glers+0.103
 Ms+0.101
 praise+0.101
 choice+0.100
 teach+0.097
Neg logit contributions
vel-0.072
nes-0.066
ust-0.064
adow-0.062
aky-0.061
Token her
Token ID607
Feature activation-1.65e-02

Pos logit contributions
 Ms+0.021
glers+0.021
 while+0.020
 last+0.019
s+0.019
Neg logit contributions
nes-0.039
vel-0.038
ode-0.035
ocket-0.034
oll-0.034
Token,
Token ID11
Feature activation-3.01e-04

Pos logit contributions
 Ms+0.022
 while+0.019
glers+0.018
 Mrs+0.018
 praise+0.018
Neg logit contributions
nes-0.027
oll-0.026
vel-0.026
ode-0.025
adow-0.024
Token looked
Token ID3114
Feature activation-7.66e-03

Pos logit contributions
 Ms+0.000
 while+0.000
 last+0.000
 teach+0.000
 Mrs+0.000
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
ust-0.001
Token up
Token ID510
Feature activation-4.79e-02

Pos logit contributions
 Ms+0.010
 while+0.009
 teach+0.009
 last+0.009
 visit+0.008
Neg logit contributions
vel-0.013
oll-0.013
nes-0.013
ust-0.012
aky-0.012
Token high
Token ID1029
Feature activation-1.15e-02

Pos logit contributions
 Ms+0.036
 teach+0.034
 praise+0.032
 kiss+0.031
 hire+0.031
Neg logit contributions
nes-0.049
vel-0.046
oll-0.045
aces-0.044
ust-0.042
Token and
Token ID290
Feature activation-8.35e-03

Pos logit contributions
 Ms+0.014
 Mrs+0.012
 teach+0.012
 Mr+0.012
 praise+0.012
Neg logit contributions
nes-0.021
vel-0.020
oll-0.020
aces-0.020
ode-0.019
Token then
Token ID788
Feature activation+1.04e-02

Pos logit contributions
 Ms+0.011
 while+0.010
 Mrs+0.010
 teach+0.010
 last+0.009
Neg logit contributions
nes-0.014
vel-0.013
oll-0.013
ode-0.013
ust-0.013
Token come
Token ID1282
Feature activation-2.41e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.019
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.011
 teach-0.011
 Mrs-0.011
Token back
Token ID736
Feature activation-1.90e-02

Pos logit contributions
 Ms+0.042
 Mrs+0.037
 Mr+0.036
 praise+0.036
 teach+0.035
Neg logit contributions
nes-0.032
vel-0.030
ode-0.029
aces-0.028
oll-0.028
Token down
Token ID866
Feature activation-6.78e-02

Pos logit contributions
 Ms+0.025
 teach+0.021
 Mrs+0.021
 praise+0.020
 Mr+0.020
Neg logit contributions
nes-0.032
vel-0.031
oll-0.031
ust-0.029
aces-0.029
Token.
Token ID13
Feature activation-3.58e-03

Pos logit contributions
glers+0.100
 Ms+0.099
 Mitt+0.098
mun+0.096
 praise+0.093
Neg logit contributions
nes-0.068
vel-0.065
 itself-0.060
oll-0.060
aw-0.060
Token It
Token ID632
Feature activation+1.75e-04

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.007
vel-0.007
oll-0.007
ode-0.007
ust-0.007
Token was
Token ID373
Feature activation-2.78e-02

Pos logit contributions
nes+0.000
vel+0.000
oll+0.000
ust+0.000
ode+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 teach-0.000
 Mrs-0.000
 last-0.000
Token so
Token ID523
Feature activation-1.23e-02

Pos logit contributions
 Ms+0.038
 teach+0.038
 praise+0.037
 hire+0.035
 kiss+0.034
Neg logit contributions
oll-0.040
aky-0.038
ust-0.037
vel-0.037
nes-0.036
Token much
Token ID881
Feature activation-1.85e-02

Pos logit contributions
 Ms+0.015
 teach+0.014
 while+0.013
 last+0.013
 praise+0.012
Neg logit contributions
nes-0.022
oll-0.022
ust-0.021
vel-0.021
aky-0.020
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
nes+0.021
vel+0.021
oll+0.020
ode+0.020
ust+0.020
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 teach-0.005
 Mrs-0.005
Token "
Token ID366
Feature activation+3.75e-02

Pos logit contributions
nes+0.022
vel+0.022
oll+0.021
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mr-0.009
 Mrs-0.009
TokenA
Token ID32
Feature activation+3.21e-02

Pos logit contributions
oll+0.079
nes+0.078
vel+0.077
onel+0.075
ode+0.075
Neg logit contributions
 while-0.035
 Ms-0.031
Thank-0.031
 last-0.030
Yes-0.029
Tokenbr
Token ID1671
Feature activation+5.20e-02

Pos logit contributions
nes+0.056
oll+0.056
ust+0.055
aces+0.054
ode+0.054
Neg logit contributions
 while-0.044
 last-0.040
 teach-0.039
 Ms-0.039
 visit-0.038
Tokenac
Token ID330
Feature activation+1.67e-02

Pos logit contributions
nes+0.082
vel+0.075
onel+0.075
ode+0.074
oll+0.073
Neg logit contributions
 last-0.071
 while-0.066
 Ms-0.062
 teach-0.061
 every-0.061
Tokenad
Token ID324
Feature activation+4.79e-02

Pos logit contributions
nes+0.030
vel+0.029
oll+0.028
ust+0.027
ode+0.027
Neg logit contributions
 Ms-0.020
 last-0.020
 teach-0.019
 while-0.019
 praise-0.018
Tokenab
Token ID397
Feature activation+1.24e-02

Pos logit contributions
nes+0.085
ust+0.083
ode+0.082
oll+0.079
vel+0.077
Neg logit contributions
 while-0.059
 last-0.056
 teach-0.056
 Ms-0.056
 thanks-0.052
Tokenra
Token ID430
Feature activation+8.53e-03

Pos logit contributions
nes+0.019
vel+0.018
oll+0.018
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.019
 while-0.018
 last-0.017
 teach-0.017
 Mr-0.017
Token!"
Token ID2474
Feature activation+5.58e-03

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.017
ust+0.016
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 teach-0.007
 Mrs-0.007
Token
Token ID198
Feature activation-5.82e-03

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.011
ust+0.010
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token
Token ID198
Feature activation+5.46e-03

Pos logit contributions
 Ms+0.006
 while+0.005
 teach+0.005
 last+0.005
 Mrs+0.005
Neg logit contributions
nes-0.012
vel-0.012
oll-0.012
ust-0.011
aces-0.011
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
nes+0.049
vel+0.048
oll+0.047
ode+0.046
ust+0.045
Neg logit contributions
 while-0.012
 Ms-0.011
 last-0.010
 Mr-0.009
 before-0.009
Token
Token ID220
Feature activation+2.79e-03

Pos logit contributions
vel+0.046
nes+0.046
oll+0.045
ode+0.044
ust+0.043
Neg logit contributions
 Ms-0.018
 while-0.016
 Mr-0.015
 Mrs-0.015
 Mitt-0.013
Token
Token ID198
Feature activation-3.57e-03

Pos logit contributions
nes+0.006
vel+0.006
oll+0.006
ode+0.006
ust+0.006
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token
Token ID198
Feature activation+9.66e-03

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.008
vel-0.007
oll-0.007
ust-0.007
ode-0.007
TokenThe
Token ID464
Feature activation-2.56e-02

Pos logit contributions
nes+0.020
oll+0.020
vel+0.020
ode+0.019
aces+0.019
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.008
 teach-0.008
 Mrs-0.008
Token people
Token ID661
Feature activation+4.99e-02

Pos logit contributions
 Ms+0.044
 Mrs+0.040
 while+0.039
 Mitt+0.038
 downward+0.037
Neg logit contributions
ode-0.031
nes-0.030
vel-0.029
aky-0.029
oll-0.028
Token around
Token ID1088
Feature activation+8.23e-03

Pos logit contributions
oll+0.107
nes+0.103
vel+0.103
aces+0.102
ode+0.100
Neg logit contributions
 while-0.055
 last-0.048
 when-0.044
 Ms-0.044
 every-0.043
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.006
 teach-0.006
 Mrs-0.006
Token tank
Token ID6873
Feature activation+1.02e-02

Pos logit contributions
 Ms+0.023
 Mrs+0.021
 while+0.021
 Mitt+0.021
 last+0.021
Neg logit contributions
nes-0.018
ode-0.017
vel-0.017
aky-0.016
oll-0.015
Token were
Token ID547
Feature activation+4.61e-02

Pos logit contributions
nes+0.020
vel+0.020
oll+0.019
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.011
 while-0.011
 last-0.010
 teach-0.010
 Mrs-0.010
Token worried
Token ID7960
Feature activation+2.44e-02

Pos logit contributions
ode+0.117
oll+0.116
nes+0.115
vel+0.114
aces+0.111
Neg logit contributions
 while-0.024
 Ms-0.024
 last-0.024
 tired-0.023
 Mr-0.021
Token he
Token ID339
Feature activation-6.14e-03

Pos logit contributions
 Ms+0.030
 praise+0.029
 teach+0.028
 last+0.027
 while+0.026
Neg logit contributions
vel-0.036
oll-0.035
nes-0.035
ust-0.032
ode-0.032
Token had
Token ID550
Feature activation+2.60e-02

Pos logit contributions
 Ms+0.008
 last+0.007
 while+0.007
 Mrs+0.007
 teach+0.007
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ode-0.010
ust-0.010
Token to
Token ID284
Feature activation-1.45e-02

Pos logit contributions
nes+0.057
oll+0.057
vel+0.056
ode+0.053
ust+0.053
Neg logit contributions
 Ms-0.022
 while-0.022
 last-0.021
 teach-0.020
 every-0.020
Token take
Token ID1011
Feature activation+2.39e-03

Pos logit contributions
 Ms+0.016
 while+0.014
 Mrs+0.014
 Mr+0.013
 Mitt+0.013
Neg logit contributions
nes-0.028
vel-0.027
ode-0.027
oll-0.026
aces-0.026
Token a
Token ID257
Feature activation+1.08e-02

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token closer
Token ID5699
Feature activation+5.02e-02

Pos logit contributions
nes+0.023
vel+0.023
oll+0.022
aces+0.021
ode+0.021
Neg logit contributions
 Ms-0.010
 while-0.010
 last-0.009
 teach-0.009
 Mrs-0.008
Token look
Token ID804
Feature activation-1.05e-02

Pos logit contributions
nes+0.112
onel+0.107
vel+0.107
oll+0.107
ust+0.103
Neg logit contributions
 while-0.055
 last-0.050
 visit-0.046
 teach-0.042
 kiss-0.040
Token.
Token ID13
Feature activation-5.75e-03

Pos logit contributions
 Ms+0.012
 teach+0.011
 Mrs+0.011
 Mr+0.010
 Mitt+0.010
Neg logit contributions
vel-0.020
nes-0.020
oll-0.019
aces-0.018
ust-0.018
Token With
Token ID2080
Feature activation-2.50e-02

Pos logit contributions
 Ms+0.006
 last+0.006
 teach+0.006
 while+0.006
 praise+0.005
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ode-0.010
ust-0.010
Token a
Token ID257
Feature activation-2.53e-03

Pos logit contributions
 Ms+0.029
 while+0.026
glers+0.025
 teach+0.025
 hire+0.024
Neg logit contributions
oll-0.043
vel-0.043
aces-0.039
nes-0.038
onel-0.038
Token big
Token ID1263
Feature activation+5.15e-03

Pos logit contributions
 Ms+0.003
 while+0.002
 Mrs+0.002
 last+0.002
 Mr+0.002
Neg logit contributions
vel-0.005
nes-0.005
oll-0.005
ode-0.005
ust-0.005
Token hair
Token ID4190
Feature activation-5.38e-03

Pos logit contributions
nes+0.076
vel+0.073
ust+0.069
onel+0.066
oll+0.066
Neg logit contributions
 last-0.058
 while-0.056
 visit-0.056
 teach-0.055
 Mr-0.053
Token and
Token ID290
Feature activation-5.61e-03

Pos logit contributions
 Ms+0.005
 Mrs+0.004
 while+0.004
 teach+0.004
 Mr+0.004
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
aces-0.011
ode-0.010
Token the
Token ID262
Feature activation+5.22e-03

Pos logit contributions
 Ms+0.006
 while+0.005
 teach+0.005
 last+0.005
 Mrs+0.005
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
aces-0.010
ode-0.010
Token sun
Token ID4252
Feature activation-7.98e-04

Pos logit contributions
nes+0.009
vel+0.008
oll+0.008
ode+0.008
ust+0.008
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.007
 teach-0.006
 Mrs-0.006
Token on
Token ID319
Feature activation+6.79e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 teach+0.001
 Mrs+0.001
 last+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.001
aces-0.001
Token his
Token ID465
Feature activation+4.53e-02

Pos logit contributions
nes+0.013
vel+0.013
oll+0.012
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 teach-0.007
 Mrs-0.007
Token face
Token ID1986
Feature activation+3.59e-03

Pos logit contributions
nes+0.081
vel+0.078
ust+0.074
oll+0.074
onel+0.073
Neg logit contributions
 last-0.062
 while-0.059
 teach-0.057
 before-0.056
 every-0.056
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
ust+0.007
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.002
 teach-0.002
 Mrs-0.002
Token He
Token ID679
Feature activation-8.96e-03

Pos logit contributions
nes+0.023
vel+0.022
oll+0.021
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.013
 while-0.012
 Mr-0.011
 Mrs-0.011
 last-0.011
Token liked
Token ID8288
Feature activation+4.56e-03

Pos logit contributions
 Ms+0.012
 while+0.010
 Mrs+0.010
 teach+0.010
 last+0.010
Neg logit contributions
nes-0.016
oll-0.015
vel-0.015
aces-0.014
ode-0.014
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
nes+0.010
vel+0.009
oll+0.009
ode+0.009
ust+0.009
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
nes+0.041
vel+0.040
oll+0.039
ode+0.039
ust+0.038
Neg logit contributions
 while-0.013
 Ms-0.013
 last-0.011
 before-0.010
 teach-0.010
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
nes+0.054
vel+0.051
oll+0.051
ust+0.049
ode+0.049
Neg logit contributions
 Ms-0.019
 while-0.019
 Mr-0.016
 Mrs-0.016
 before-0.016
Token then
Token ID788
Feature activation+5.40e-03

Pos logit contributions
nes+0.038
vel+0.037
oll+0.037
ode+0.036
ust+0.035
Neg logit contributions
 Ms-0.014
 while-0.013
 Mrs-0.011
 Mr-0.011
 last-0.011
Token he
Token ID339
Feature activation-4.21e-03

Pos logit contributions
nes+0.012
vel+0.011
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token had
Token ID550
Feature activation+1.36e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.007
ust-0.007
Token an
Token ID281
Feature activation+4.58e-02

Pos logit contributions
nes+0.030
vel+0.029
oll+0.029
ode+0.028
ust+0.028
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.010
 Mrs-0.010
 teach-0.010
Token idea
Token ID2126
Feature activation+7.78e-03

Pos logit contributions
nes+0.105
vel+0.096
ode+0.093
lor+0.092
oll+0.091
Neg logit contributions
 last-0.040
 teach-0.039
 while-0.036
 Ms-0.035
 work-0.034
Token too
Token ID1165
Feature activation+1.02e-03

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
ust+0.017
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
nes+0.002
vel+0.002
oll+0.002
ust+0.002
ode+0.002
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token He
Token ID679
Feature activation-8.29e-04

Pos logit contributions
nes+0.014
vel+0.013
oll+0.013
ode+0.013
ust+0.013
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 Mrs-0.005
 teach-0.005
Token said
Token ID531
Feature activation+2.11e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 last+0.001
 Mrs+0.001
 teach+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.001
ust-0.001
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
nes+0.016
vel+0.016
oll+0.015
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.006
 teach-0.006
 Mrs-0.006
Token
Token ID198
Feature activation+9.85e-03

Pos logit contributions
nes+0.024
vel+0.024
oll+0.023
ode+0.023
aces+0.022
Neg logit contributions
 Ms-0.009
 while-0.008
 Mrs-0.007
 Mr-0.007
 last-0.007
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
aces+0.018
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.010
 Mrs-0.009
 Mr-0.009
TokenSo
Token ID2396
Feature activation-1.65e-03

Pos logit contributions
nes+0.033
oll+0.033
vel+0.033
aces+0.032
ode+0.032
Neg logit contributions
 Ms-0.016
 while-0.015
 Mrs-0.013
 last-0.013
 Mr-0.013
Token Jack
Token ID3619
Feature activation+3.95e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 teach+0.001
 last+0.001
 Mrs+0.001
Neg logit contributions
nes-0.004
vel-0.003
oll-0.003
ust-0.003
ode-0.003
Token and
Token ID290
Feature activation+9.71e-03

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
ust+0.007
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.003
 teach-0.003
 Mrs-0.003
Token Lucy
Token ID22162
Feature activation+2.84e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
aces+0.017
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mrs-0.009
 Mr-0.009
Token had
Token ID550
Feature activation+2.12e-02

Pos logit contributions
vel+0.055
nes+0.054
oll+0.054
aces+0.051
ode+0.051
Neg logit contributions
 Ms-0.031
 while-0.030
 last-0.029
 teach-0.029
 visit-0.027
Token a
Token ID257
Feature activation+1.41e-02

Pos logit contributions
nes+0.047
oll+0.047
vel+0.047
ode+0.045
onel+0.044
Neg logit contributions
 Ms-0.018
 while-0.017
 last-0.016
 Mr-0.016
 Mrs-0.016
Token special
Token ID2041
Feature activation+1.05e-02

Pos logit contributions
nes+0.027
vel+0.026
oll+0.026
aces+0.025
ode+0.025
Neg logit contributions
 Ms-0.017
 while-0.016
 last-0.015
 teach-0.015
 Mrs-0.015
Token surprise
Token ID5975
Feature activation+1.21e-02

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.014
 while-0.013
 last-0.013
 teach-0.013
 Mrs-0.012
Token car
Token ID1097
Feature activation+8.04e-03

Pos logit contributions
nes+0.017
vel+0.017
oll+0.016
ode+0.016
aces+0.016
Neg logit contributions
 Ms-0.014
 while-0.013
 last-0.012
 Mrs-0.012
 teach-0.012
Tokenpenter
Token ID26419
Feature activation-9.99e-03

Pos logit contributions
nes+0.017
vel+0.017
oll+0.016
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 teach-0.006
 Mrs-0.006
Token's
Token ID338
Feature activation+1.05e-02

Pos logit contributions
 Ms+0.014
 while+0.012
 Mrs+0.012
 Mr+0.012
 Mitt+0.012
Neg logit contributions
nes-0.016
vel-0.016
oll-0.015
ode-0.015
aces-0.014
Token garage
Token ID15591
Feature activation-1.84e-02

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ust+0.016
ode+0.016
Neg logit contributions
 Ms-0.015
 while-0.014
 last-0.013
 teach-0.013
 Mr-0.013
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 Ms+0.019
 teach+0.016
 Mrs+0.016
 Mr+0.015
 Mitt+0.015
Neg logit contributions
nes-0.037
vel-0.035
oll-0.035
aces-0.034
ode-0.034
Token They
Token ID1119
Feature activation+4.86e-03

Pos logit contributions
 Ms+0.002
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
ust-0.003
Token saw
Token ID2497
Feature activation+1.99e-03

Pos logit contributions
nes+0.009
vel+0.009
oll+0.009
ode+0.008
ust+0.008
Neg logit contributions
 Ms-0.006
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token car
Token ID1097
Feature activation-9.24e-03

Pos logit contributions
 Ms+0.034
 while+0.029
 Mrs+0.029
 teach+0.028
 Mitt+0.028
Neg logit contributions
ode-0.030
nes-0.029
oll-0.029
vel-0.029
aces-0.028
Tokenpenter
Token ID26419
Feature activation-7.58e-03

Pos logit contributions
 Ms+0.010
 while+0.009
 Mrs+0.009
 Mitt+0.008
 Mr+0.008
Neg logit contributions
nes-0.018
vel-0.017
oll-0.017
ode-0.017
aces-0.016
Token operating
Token ID5361
Feature activation-5.65e-03

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.007
 last+0.007
 teach+0.007
Neg logit contributions
nes-0.015
vel-0.015
oll-0.014
ode-0.014
aces-0.013
Token took
Token ID1718
Feature activation+2.10e-03

Pos logit contributions
nes+0.011
vel+0.010
oll+0.010
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token his
Token ID465
Feature activation-1.07e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token book
Token ID1492
Feature activation+5.19e-03

Pos logit contributions
 Ms+0.017
 while+0.016
 Mitt+0.015
 Mrs+0.015
 Mr+0.015
Neg logit contributions
vel-0.015
nes-0.015
oll-0.015
aces-0.014
ode-0.013
Token and
Token ID290
Feature activation-9.39e-03

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
 Ms+0.011
 while+0.010
 last+0.010
 teach+0.009
 Mrs+0.009
Neg logit contributions
nes-0.017
vel-0.016
oll-0.016
ode-0.016
aces-0.016
Token pen
Token ID3112
Feature activation+1.05e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.011
 teach-0.011
 Mrs-0.011
Token and
Token ID290
Feature activation+2.16e-03

Pos logit contributions
nes+0.023
vel+0.023
oll+0.023
ode+0.022
ust+0.022
Neg logit contributions
 Ms-0.008
 while-0.008
 last-0.007
 teach-0.007
 Mrs-0.007
Token wrote
Token ID2630
Feature activation+7.12e-03

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token,
Token ID11
Feature activation-8.71e-03

Pos logit contributions
nes+0.014
vel+0.014
oll+0.014
ode+0.013
ust+0.013
Neg logit contributions
 Ms-0.008
 while-0.007
 last-0.007
 teach-0.007
 Mrs-0.007
Token "
Token ID366
Feature activation+7.20e-03

Pos logit contributions
 Ms+0.009
 last+0.008
 while+0.008
 teach+0.008
 Mrs+0.007
Neg logit contributions
vel-0.017
nes-0.016
oll-0.016
aces-0.016
ode-0.015
TokenTo
Token ID2514
Feature activation+1.48e-02

Pos logit contributions
nes+0.014
vel+0.013
oll+0.013
ode+0.012
ust+0.012
Neg logit contributions
 Ms-0.009
 while-0.008
 last-0.007
 teach-0.007
 Mrs-0.007
Token dog
Token ID3290
Feature activation+1.08e-02

Pos logit contributions
nes+0.011
vel+0.010
oll+0.010
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.009
 while-0.008
 last-0.008
 teach-0.008
 Mrs-0.008
Token wasn
Token ID2492
Feature activation-3.09e-03

Pos logit contributions
nes+0.023
vel+0.022
oll+0.022
ode+0.021
ust+0.021
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.008
 Mrs-0.008
Token't
Token ID470
Feature activation+4.71e-02

Pos logit contributions
 Ms+0.002
 while+0.001
 teach+0.001
 Mrs+0.001
 last+0.001
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ust-0.007
ode-0.007
Token so
Token ID523
Feature activation+4.09e-03

Pos logit contributions
nes+0.102
ode+0.100
oll+0.099
vel+0.096
lor+0.094
Neg logit contributions
 while-0.047
 Ms-0.045
 Mr-0.043
 Mrs-0.043
 before-0.043
Token bad
Token ID2089
Feature activation+1.58e-02

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.005
 while-0.005
 last-0.004
 teach-0.004
 Mrs-0.004
Token after
Token ID706
Feature activation+1.13e-02

Pos logit contributions
nes+0.037
vel+0.036
oll+0.035
ode+0.035
ust+0.034
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.010
 Mr-0.010
 Mrs-0.009
Token all
Token ID477
Feature activation+8.25e-03

Pos logit contributions
nes+0.021
vel+0.020
oll+0.020
ode+0.019
ust+0.019
Neg logit contributions
 Ms-0.014
 while-0.013
 last-0.012
 Mr-0.012
 Mrs-0.012
Token!
Token ID0
Feature activation+2.24e-02

Pos logit contributions
nes+0.019
vel+0.019
oll+0.018
ode+0.018
ust+0.018
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 Mrs-0.005
 teach-0.005
Token He
Token ID679
Feature activation+1.59e-02

Pos logit contributions
nes+0.047
vel+0.047
ode+0.045
oll+0.045
ust+0.043
Neg logit contributions
 Ms-0.024
 Mr-0.021
 Mrs-0.021
 while-0.020
 Mitt-0.019
Token stayed
Token ID9658
Feature activation+1.22e-02

Pos logit contributions
nes+0.032
vel+0.031
oll+0.030
ode+0.029
aces+0.029
Neg logit contributions
 Ms-0.017
 while-0.016
 last-0.015
 teach-0.015
 Mr-0.015
Token up
Token ID510
Feature activation-2.40e-02

Pos logit contributions
nes+0.022
vel+0.021
oll+0.021
ode+0.020
aces+0.020
Neg logit contributions
 Ms-0.015
 while-0.015
 last-0.014
 teach-0.014
 Mrs-0.013
Token be
Token ID307
Feature activation+9.67e-03

Pos logit contributions
nes+0.016
vel+0.016
oll+0.016
ode+0.015
ust+0.015
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.009
 Mrs-0.008
Token something
Token ID1223
Feature activation+1.96e-02

Pos logit contributions
nes+0.018
vel+0.018
oll+0.017
ode+0.017
ust+0.016
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.010
 Mrs-0.010
 Mr-0.010
Token special
Token ID2041
Feature activation+2.35e-02

Pos logit contributions
nes+0.040
vel+0.040
oll+0.039
ode+0.038
aces+0.038
Neg logit contributions
 Ms-0.020
 while-0.020
 last-0.019
 Mr-0.018
 Mrs-0.018
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
nes+0.056
vel+0.056
oll+0.054
ode+0.054
ust+0.052
Neg logit contributions
 while-0.016
 Ms-0.015
 last-0.014
 before-0.013
 teach-0.013
Token They
Token ID1119
Feature activation+2.46e-02

Pos logit contributions
nes+0.025
vel+0.025
oll+0.024
ode+0.024
aces+0.023
Neg logit contributions
 Ms-0.011
 while-0.009
 Mrs-0.009
 Mr-0.009
 last-0.008
Token decided
Token ID3066
Feature activation+2.09e-02

Pos logit contributions
vel+0.048
nes+0.047
oll+0.046
ode+0.044
aces+0.044
Neg logit contributions
 Ms-0.027
 while-0.026
 last-0.025
 teach-0.025
 Mrs-0.024
Token to
Token ID284
Feature activation-1.20e-03

Pos logit contributions
oll+0.052
vel+0.051
nes+0.051
ode+0.050
aces+0.049
Neg logit contributions
 while-0.013
 Ms-0.013
 last-0.011
 Mrs-0.011
 teach-0.011
Token wait
Token ID4043
Feature activation+7.07e-03

Pos logit contributions
 Ms+0.001
 while+0.001
 Mrs+0.001
 last+0.001
 Mr+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token until
Token ID1566
Feature activation+5.36e-03

Pos logit contributions
nes+0.015
vel+0.015
oll+0.015
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.006
 while-0.006
 last-0.005
 teach-0.005
 Mrs-0.005
Token bed
Token ID3996
Feature activation-6.74e-03

Pos logit contributions
nes+0.012
vel+0.012
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.005
 while-0.004
 last-0.004
 teach-0.004
 Mrs-0.004
Tokentime
Token ID2435
Feature activation+1.42e-03

Pos logit contributions
 Ms+0.007
 while+0.006
 teach+0.006
 Mrs+0.006
 last+0.006
Neg logit contributions
nes-0.014
vel-0.013
oll-0.013
aces-0.012
ust-0.012
Token It
Token ID632
Feature activation-2.03e-02

Pos logit contributions
 Ms+0.020
 while+0.020
 last+0.019
glers+0.018
 teach+0.018
Neg logit contributions
oll-0.026
nes-0.026
vel-0.025
aces-0.025
ode-0.024
Token was
Token ID373
Feature activation+3.77e-03

Pos logit contributions
 Ms+0.033
 while+0.032
 Mrs+0.030
 visit+0.029
 last+0.029
Neg logit contributions
nes-0.026
vel-0.026
oll-0.026
ust-0.025
ode-0.025
Token just
Token ID655
Feature activation+7.93e-03

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.004
 while-0.004
 last-0.003
 teach-0.003
 Mrs-0.003
Token pretend
Token ID16614
Feature activation-5.16e-03

Pos logit contributions
nes+0.017
vel+0.017
oll+0.016
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 Mrs-0.006
 teach-0.006
Token.
Token ID13
Feature activation-9.91e-03

Pos logit contributions
 Ms+0.006
 while+0.005
 last+0.005
 Mrs+0.005
 teach+0.005
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
aces-0.009
ode-0.009
Token Let
Token ID3914
Feature activation+1.88e-03

Pos logit contributions
 Ms+0.013
 while+0.012
 last+0.012
 teach+0.011
 Mrs+0.011
Neg logit contributions
nes-0.017
oll-0.017
vel-0.017
aces-0.016
ode-0.016
Token's
Token ID338
Feature activation+1.91e-03

Pos logit contributions
nes+0.003
vel+0.003
oll+0.003
ode+0.002
ust+0.002
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token make
Token ID787
Feature activation+9.38e-03

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
ust+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.001
 Mrs-0.001
 Mr-0.001
Token some
Token ID617
Feature activation-3.55e-02

Pos logit contributions
nes+0.020
vel+0.019
oll+0.019
ode+0.019
ust+0.018
Neg logit contributions
 Ms-0.009
 while-0.008
 last-0.008
 Mr-0.007
 Mrs-0.007
Token more
Token ID517
Feature activation-1.93e-02

Pos logit contributions
 Ms+0.050
 Mrs+0.044
 visit+0.043
Thank+0.043
 last+0.043
Neg logit contributions
vel-0.053
oll-0.053
aces-0.048
nes-0.047
ode-0.046
Token food
Token ID2057
Feature activation+2.24e-02

Pos logit contributions
 Ms+0.027
 last+0.025
 Mrs+0.024
 while+0.023
 teach+0.023
Neg logit contributions
vel-0.031
oll-0.030
nes-0.029
aces-0.029
ode-0.027
Token there
Token ID612
Feature activation+9.12e-03

Pos logit contributions
nes+0.043
vel+0.041
ode+0.041
oll+0.041
aces+0.039
Neg logit contributions
 while-0.030
 Ms-0.028
 Mr-0.026
 last-0.026
 Mrs-0.025
Token were
Token ID547
Feature activation+3.89e-02

Pos logit contributions
nes+0.021
vel+0.021
oll+0.021
ode+0.020
ust+0.020
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.005
 Mr-0.005
 Mrs-0.005
Token two
Token ID734
Feature activation-2.64e-02

Pos logit contributions
nes+0.091
vel+0.090
ode+0.087
oll+0.087
aces+0.087
Neg logit contributions
 Ms-0.029
 Mr-0.026
 while-0.026
 Mrs-0.025
 last-0.024
Token friends
Token ID2460
Feature activation-5.91e-03

Pos logit contributions
 Ms+0.039
 while+0.036
 teach+0.036
 before+0.035
 visit+0.035
Neg logit contributions
oll-0.039
vel-0.037
aky-0.037
ode-0.037
ust-0.036
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 Ms+0.007
 while+0.006
 last+0.006
 teach+0.006
 Mrs+0.006
Neg logit contributions
nes-0.010
vel-0.010
oll-0.010
ode-0.010
aky-0.010
Token Jack
Token ID3619
Feature activation-1.16e-03

Pos logit contributions
 Ms+0.011
 teach+0.010
 while+0.010
 last+0.010
 praise+0.009
Neg logit contributions
nes-0.021
oll-0.021
vel-0.020
aces-0.020
ode-0.020
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 Ms+0.001
 while+0.001
 teach+0.001
 last+0.001
 Mrs+0.001
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ode-0.002
ust-0.002
Token Jill
Token ID22772
Feature activation-5.08e-03

Pos logit contributions
vel+0.034
nes+0.033
oll+0.033
ode+0.032
aces+0.032
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.013
 Mr-0.013
 last-0.012
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 teach+0.004
 Mrs+0.004
 last+0.004
Neg logit contributions
nes-0.011
vel-0.010
oll-0.010
ust-0.010
ode-0.010
Token They
Token ID1119
Feature activation-5.59e-03

Pos logit contributions
 Ms+0.018
 while+0.017
 last+0.017
 teach+0.016
 Mrs+0.016
Neg logit contributions
nes-0.022
vel-0.022
aces-0.021
oll-0.021
ode-0.021
Token lived
Token ID5615
Feature activation-1.64e-02

Pos logit contributions
 Ms+0.007
 while+0.007
 Mrs+0.006
 last+0.006
 teach+0.006
Neg logit contributions
nes-0.010
vel-0.009
oll-0.009
ust-0.009
aces-0.009
Tokenmy
Token ID1820
Feature activation-3.54e-03

Pos logit contributions
nes+0.040
vel+0.040
oll+0.039
ust+0.037
ode+0.037
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.008
 Mr-0.007
 Mrs-0.007
Token's
Token ID338
Feature activation-2.07e-02

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.006
vel-0.006
oll-0.006
ode-0.005
ust-0.005
Token teacher
Token ID4701
Feature activation-2.71e-02

Pos logit contributions
 Ms+0.029
 Mrs+0.024
 while+0.024
 Mr+0.024
glers+0.022
Neg logit contributions
oll-0.034
nes-0.032
ode-0.032
vel-0.031
aces-0.030
Token told
Token ID1297
Feature activation+1.77e-02

Pos logit contributions
 Ms+0.031
 last+0.030
 while+0.029
 praise+0.028
 grace+0.028
Neg logit contributions
nes-0.047
vel-0.045
oll-0.045
aces-0.041
anna-0.041
Token him
Token ID683
Feature activation+2.24e-02

Pos logit contributions
nes+0.038
vel+0.037
oll+0.036
ode+0.035
aces+0.035
Neg logit contributions
 Ms-0.019
 Mrs-0.017
 Mr-0.017
 while-0.017
 last-0.016
Token to
Token ID284
Feature activation-8.40e-03

Pos logit contributions
nes+0.051
vel+0.050
oll+0.049
ode+0.049
ust+0.048
Neg logit contributions
 Ms-0.018
 while-0.017
 Mr-0.015
 Mrs-0.015
 teach-0.015
Token write
Token ID3551
Feature activation-4.91e-03

Pos logit contributions
 Ms+0.010
 while+0.009
 Mrs+0.008
 Mr+0.008
 last+0.008
Neg logit contributions
nes-0.016
vel-0.015
oll-0.015
ode-0.015
ust-0.015
Token his
Token ID465
Feature activation-1.34e-02

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
vel-0.010
nes-0.010
oll-0.009
aces-0.009
ode-0.009
Token name
Token ID1438
Feature activation+3.33e-04

Pos logit contributions
 Ms+0.017
 while+0.015
 Mrs+0.014
 Mr+0.014
 Mitt+0.014
Neg logit contributions
vel-0.024
oll-0.024
nes-0.023
aces-0.023
ode-0.022
Token on
Token ID319
Feature activation-5.12e-03

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
aces+0.001
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 teach-0.000
 Mrs-0.000
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
 Ms+0.005
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.011
vel-0.011
oll-0.010
aces-0.010
ode-0.010
Token,
Token ID11
Feature activation+2.47e-03

Pos logit contributions
nes+0.012
vel+0.012
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.005
 while-0.005
 last-0.005
 teach-0.004
 Mrs-0.004
Token and
Token ID290
Feature activation-7.63e-03

Pos logit contributions
nes+0.005
vel+0.005
oll+0.005
ode+0.005
aces+0.005
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token a
Token ID257
Feature activation-5.44e-06

Pos logit contributions
 Ms+0.008
 while+0.007
 last+0.007
 teach+0.007
 Mrs+0.007
Neg logit contributions
nes-0.015
vel-0.015
oll-0.014
ode-0.014
aces-0.014
Token hat
Token ID6877
Feature activation+1.66e-03

Feature not active on prev token;
no predictions were affected.
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
nes+0.004
vel+0.003
oll+0.003
ode+0.003
aces+0.003
Neg logit contributions
 Ms-0.002
 while-0.001
 last-0.001
 Mrs-0.001
 teach-0.001
Token They
Token ID1119
Feature activation-6.43e-03

Pos logit contributions
nes+0.027
vel+0.027
oll+0.026
ode+0.025
ust+0.025
Neg logit contributions
 Ms-0.015
 while-0.014
 Mrs-0.013
 Mr-0.013
 teach-0.013
Token make
Token ID787
Feature activation+7.96e-03

Pos logit contributions
 Ms+0.008
 while+0.007
 Mrs+0.006
 last+0.006
 Mr+0.006
Neg logit contributions
nes-0.012
vel-0.012
oll-0.011
ode-0.011
aces-0.011
Token a
Token ID257
Feature activation+1.91e-02

Pos logit contributions
nes+0.015
vel+0.015
oll+0.014
ode+0.014
ust+0.014
Neg logit contributions
 Ms-0.009
 while-0.009
 last-0.008
 teach-0.008
 Mrs-0.008
Token face
Token ID1986
Feature activation+1.40e-02

Pos logit contributions
nes+0.035
vel+0.033
oll+0.033
aces+0.032
ust+0.032
Neg logit contributions
 Ms-0.024
 while-0.023
 last-0.022
 teach-0.022
 Mr-0.022
Token and
Token ID290
Feature activation-1.61e-02

Pos logit contributions
nes+0.031
vel+0.030
oll+0.030
ode+0.029
ust+0.029
Neg logit contributions
 Ms-0.012
 while-0.011
 last-0.010
 teach-0.010
 Mrs-0.010
Token a
Token ID257
Feature activation+9.94e-03

Pos logit contributions
 Ms+0.017
 while+0.015
 last+0.014
 Mrs+0.014
 Mr+0.014
Neg logit contributions
nes-0.031
vel-0.030
oll-0.030
aces-0.029
ode-0.028
Token Betsy
Token ID45775
Feature activation-7.92e-03

Pos logit contributions
 last+0.025
 Ms+0.024
glers+0.023
 praise+0.022
 teach+0.022
Neg logit contributions
nes-0.038
vel-0.037
oll-0.036
anna-0.035
aces-0.033
Token nodded
Token ID14464
Feature activation-1.28e-02

Pos logit contributions
 Ms+0.008
 while+0.007
 last+0.007
 teach+0.007
 Mrs+0.006
Neg logit contributions
nes-0.016
vel-0.016
oll-0.015
ode-0.015
ust-0.015
Token her
Token ID607
Feature activation+1.00e-02

Pos logit contributions
 Ms+0.015
 last+0.014
 teach+0.013
 while+0.013
 praise+0.013
Neg logit contributions
vel-0.023
nes-0.022
oll-0.022
ust-0.021
aky-0.021
Token head
Token ID1182
Feature activation-1.81e-02

Pos logit contributions
nes+0.018
vel+0.017
oll+0.017
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.012
 teach-0.012
 Mr-0.011
Token and
Token ID290
Feature activation-6.17e-03

Pos logit contributions
 Ms+0.021
 teach+0.017
 Mrs+0.017
 Mr+0.017
 last+0.017
Neg logit contributions
vel-0.034
oll-0.033
nes-0.033
aces-0.031
ust-0.031
Token Lily
Token ID20037
Feature activation-1.34e-02

Pos logit contributions
 Ms+0.006
 last+0.005
 while+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.012
vel-0.012
oll-0.012
ode-0.011
ust-0.011
Token gently
Token ID15165
Feature activation-2.89e-02

Pos logit contributions
 Ms+0.015
 last+0.014
 while+0.014
 teach+0.013
 Mrs+0.013
Neg logit contributions
nes-0.025
vel-0.024
oll-0.024
ode-0.022
ust-0.022
Token stro
Token ID9660
Feature activation-9.53e-04

Pos logit contributions
 Ms+0.038
 last+0.035
 Mr+0.034
 Mrs+0.034
 while+0.034
Neg logit contributions
vel-0.046
nes-0.045
oll-0.044
ode-0.041
aces-0.040
Tokenked
Token ID9091
Feature activation-1.59e-02

Pos logit contributions
 Ms+0.002
 while+0.001
 last+0.001
 teach+0.001
 Mrs+0.001
Neg logit contributions
nes-0.001
vel-0.001
oll-0.001
ode-0.001
aces-0.001
Token her
Token ID607
Feature activation+3.23e-05

Pos logit contributions
 Ms+0.018
 last+0.017
 while+0.016
 teach+0.016
 visit+0.016
Neg logit contributions
vel-0.028
oll-0.027
nes-0.027
aces-0.026
ode-0.025
Token soft
Token ID2705
Feature activation-2.26e-02

Pos logit contributions
vel+0.000
nes+0.000
oll+0.000
ode+0.000
aces+0.000
Neg logit contributions
 Ms-0.000
 while-0.000
 last-0.000
 Mrs-0.000
 teach-0.000
Token and
Token ID290
Feature activation-2.56e-02

Pos logit contributions
 Ms+0.016
 Mrs+0.014
 Mr+0.014
 teach+0.013
 Mitt+0.013
Neg logit contributions
nes-0.025
vel-0.024
aces-0.024
oll-0.024
ode-0.023
Token a
Token ID257
Feature activation+1.90e-02

Pos logit contributions
 Ms+0.035
 Mrs+0.030
 teach+0.029
 last+0.028
 Mr+0.028
Neg logit contributions
nes-0.042
aces-0.039
ode-0.039
oll-0.038
vel-0.037
Token ball
Token ID2613
Feature activation-5.41e-03

Pos logit contributions
nes+0.035
vel+0.034
oll+0.034
ust+0.032
ode+0.032
Neg logit contributions
 Ms-0.023
 while-0.023
 last-0.021
 teach-0.020
 Mr-0.020
Token in
Token ID287
Feature activation+3.81e-03

Pos logit contributions
 Ms+0.005
 Mrs+0.004
 while+0.004
 Mr+0.004
 teach+0.004
Neg logit contributions
nes-0.011
vel-0.011
oll-0.011
aces-0.011
ode-0.011
Token the
Token ID262
Feature activation+2.87e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.008
ust+0.008
Neg logit contributions
 Ms-0.003
 while-0.003
 last-0.003
 teach-0.003
 Mrs-0.003
Token backyard
Token ID24296
Feature activation-3.62e-02

Pos logit contributions
nes+0.056
vel+0.055
oll+0.053
ust+0.052
onel+0.049
Neg logit contributions
 Ms-0.034
 last-0.034
 while-0.032
 Mr-0.032
 visit-0.031
Token.
Token ID13
Feature activation+5.58e-03

Pos logit contributions
 Ms+0.048
 Mrs+0.041
 Mitt+0.041
 Mr+0.039
 teach+0.036
Neg logit contributions
nes-0.062
vel-0.059
ode-0.056
aces-0.056
ust-0.056
Token Lily
Token ID20037
Feature activation-1.36e-02

Pos logit contributions
nes+0.012
vel+0.012
oll+0.011
ode+0.011
ust+0.011
Neg logit contributions
 Ms-0.005
 while-0.005
 last-0.005
 teach-0.004
 Mrs-0.004
Token was
Token ID373
Feature activation-3.10e-03

Pos logit contributions
 Ms+0.017
 last+0.015
 while+0.015
 Mrs+0.015
 teach+0.015
Neg logit contributions
nes-0.025
vel-0.023
oll-0.022
aces-0.022
aky-0.022
Token very
Token ID845
Feature activation-1.75e-02

Pos logit contributions
 Ms+0.004
 while+0.004
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.005
vel-0.005
oll-0.005
aces-0.005
ust-0.004
Token good
Token ID922
Feature activation-1.36e-02

Pos logit contributions
 Ms+0.029
 last+0.027
 teach+0.026
 while+0.026
 visit+0.025
Neg logit contributions
nes-0.024
oll-0.023
vel-0.023
aky-0.022
ust-0.022
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 Ms+0.009
 while+0.008
 last+0.008
 teach+0.008
 Mrs+0.007
Neg logit contributions
nes-0.020
vel-0.019
oll-0.019
aces-0.018
ust-0.018
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 Mrs+0.009
Neg logit contributions
nes-0.022
vel-0.021
oll-0.020
ust-0.019
ode-0.019
TokenT
Token ID51
Feature activation-1.64e-03

Pos logit contributions
 Ms+0.013
 last+0.012
 while+0.012
 Mrs+0.012
 teach+0.011
Neg logit contributions
nes-0.023
vel-0.023
oll-0.021
ust-0.021
ode-0.020
Tokeneddy
Token ID21874
Feature activation+2.20e-03

Pos logit contributions
 Ms+0.003
 while+0.002
 teach+0.002
 last+0.002
 Mrs+0.002
Neg logit contributions
nes-0.002
vel-0.002
oll-0.002
ust-0.002
ode-0.002
Token took
Token ID1718
Feature activation-1.83e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
ust+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token the
Token ID262
Feature activation-3.71e-02

Pos logit contributions
 Ms+0.019
 while+0.017
 teach+0.016
glers+0.016
 last+0.015
Neg logit contributions
oll-0.035
nes-0.035
vel-0.035
aces-0.031
onel-0.031
Token medicine
Token ID9007
Feature activation+1.17e-02

Pos logit contributions
 Ms+0.064
glers+0.062
 Mrs+0.059
 Every+0.058
 thanks+0.057
Neg logit contributions
oll-0.041
vel-0.040
nes-0.037
 ankle-0.036
aky-0.035
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
nes+0.025
vel+0.025
oll+0.024
ode+0.024
ust+0.023
Neg logit contributions
 Ms-0.010
 while-0.010
 last-0.009
 Mrs-0.009
 teach-0.009
Token felt
Token ID2936
Feature activation-4.08e-03

Pos logit contributions
 Ms+0.023
 teach+0.020
 while+0.020
 last+0.020
 praise+0.018
Neg logit contributions
nes-0.039
oll-0.038
vel-0.037
ode-0.035
aces-0.034
Token much
Token ID881
Feature activation-1.81e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 last+0.005
 teach+0.005
 Mrs+0.005
Neg logit contributions
nes-0.007
vel-0.006
oll-0.006
onel-0.006
ust-0.006
Token better
Token ID1365
Feature activation+1.10e-02

Pos logit contributions
 Ms+0.025
 Mrs+0.022
 Mitt+0.021
 while+0.021
 Mr+0.021
Neg logit contributions
oll-0.030
vel-0.029
nes-0.027
ode-0.027
ust-0.026
Token and
Token ID290
Feature activation-4.90e-03

Pos logit contributions
 Ms+0.024
 Mrs+0.020
 Mr+0.019
 Mitt+0.019
 while+0.019
Neg logit contributions
nes-0.032
vel-0.032
aces-0.031
oll-0.031
ode-0.030
Token towers
Token ID18028
Feature activation-1.15e-02

Pos logit contributions
 Ms+0.006
 while+0.006
 teach+0.005
 Mrs+0.005
 last+0.005
Neg logit contributions
nes-0.008
vel-0.008
oll-0.008
ode-0.008
aces-0.008
Token fall
Token ID2121
Feature activation+4.33e-03

Pos logit contributions
 Ms+0.013
 Mrs+0.011
 teach+0.010
 while+0.010
 Mitt+0.010
Neg logit contributions
nes-0.022
vel-0.022
oll-0.021
aces-0.021
ode-0.021
Token down
Token ID866
Feature activation-1.89e-02

Pos logit contributions
nes+0.008
vel+0.008
oll+0.008
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.005
 while-0.005
 last-0.005
 teach-0.005
 Mrs-0.005
Token.
Token ID13
Feature activation+2.09e-03

Pos logit contributions
 Ms+0.021
 teach+0.019
 Mrs+0.017
 praise+0.017
 Mitt+0.017
Neg logit contributions
nes-0.035
vel-0.035
aces-0.033
oll-0.033
ust-0.032
Token The
Token ID383
Feature activation-4.23e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
aces+0.004
Neg logit contributions
 Ms-0.002
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token tubes
Token ID21103
Feature activation+1.11e-02

Pos logit contributions
 Ms+0.070
 Mrs+0.062
 Mr+0.058
 visit+0.058
 whenever+0.057
Neg logit contributions
aky-0.051
ode-0.050
nes-0.048
 pad-0.048
aces-0.047
Token are
Token ID389
Feature activation+5.82e-03

Pos logit contributions
nes+0.021
vel+0.020
oll+0.020
ode+0.019
ust+0.019
Neg logit contributions
 Ms-0.013
 while-0.012
 last-0.012
 teach-0.012
 Mr-0.011
Token broken
Token ID5445
Feature activation+7.86e-03

Pos logit contributions
nes+0.011
vel+0.011
oll+0.011
ode+0.010
ust+0.010
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 teach-0.006
 Mrs-0.006
Token and
Token ID290
Feature activation-1.69e-03

Pos logit contributions
nes+0.017
vel+0.017
oll+0.017
ode+0.016
ust+0.016
Neg logit contributions
 Ms-0.007
 while-0.006
 last-0.006
 teach-0.005
 Mrs-0.005
Token bent
Token ID17157
Feature activation+2.16e-02

Pos logit contributions
 Ms+0.002
 while+0.002
 teach+0.002
 Mrs+0.002
 last+0.002
Neg logit contributions
nes-0.003
vel-0.003
oll-0.003
ode-0.003
aces-0.003
Token One
Token ID1881
Feature activation-2.18e-03

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.003
ust+0.003
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token day
Token ID1110
Feature activation-2.72e-02

Pos logit contributions
 Ms+0.002
 while+0.002
 teach+0.002
 Mrs+0.002
 Mr+0.002
Neg logit contributions
nes-0.004
oll-0.004
vel-0.004
ust-0.004
ode-0.004
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
 Ms+0.032
 teach+0.029
 praise+0.028
 last+0.027
glers+0.026
Neg logit contributions
vel-0.048
oll-0.048
nes-0.046
aces-0.043
aund-0.042
Token Lily
Token ID20037
Feature activation-2.23e-03

Pos logit contributions
 praise+0.047
 hire+0.046
 thanks+0.046
 Ms+0.046
 last+0.045
Neg logit contributions
nes-0.045
vel-0.041
oll-0.041
aund-0.037
aces-0.037
Token's
Token ID338
Feature activation-4.90e-03

Pos logit contributions
 Ms+0.003
 while+0.003
 last+0.003
 teach+0.003
 Mrs+0.003
Neg logit contributions
nes-0.004
vel-0.004
oll-0.004
ode-0.004
ust-0.003
Token mom
Token ID1995
Feature activation-3.68e-02

Pos logit contributions
 Ms+0.006
 while+0.005
 Mrs+0.005
 Mr+0.005
 last+0.005
Neg logit contributions
nes-0.009
oll-0.009
vel-0.009
ode-0.009
ust-0.009
Token told
Token ID1297
Feature activation+1.04e-02

Pos logit contributions
 Ms+0.047
 downward+0.046
 grace+0.045
 common+0.044
 usual+0.044
Neg logit contributions
nes-0.055
oll-0.054
aky-0.054
vel-0.053
ode-0.051
Token her
Token ID607
Feature activation+1.56e-02

Pos logit contributions
nes+0.021
vel+0.021
oll+0.021
ode+0.020
aces+0.020
Neg logit contributions
 Ms-0.011
 while-0.010
 last-0.009
 Mrs-0.009
 Mr-0.009
Token they
Token ID484
Feature activation+7.96e-03

Pos logit contributions
nes+0.034
vel+0.033
oll+0.033
ode+0.033
ust+0.032
Neg logit contributions
 Ms-0.014
 while-0.013
 Mrs-0.012
 teach-0.012
 last-0.012
Token were
Token ID547
Feature activation-7.49e-03

Pos logit contributions
nes+0.015
vel+0.014
oll+0.014
ode+0.014
ust+0.013
Neg logit contributions
 Ms-0.010
 while-0.009
 last-0.009
 teach-0.009
 Mrs-0.008
Token going
Token ID1016
Feature activation+4.42e-04

Pos logit contributions
 Ms+0.009
 while+0.008
 last+0.008
 teach+0.008
 praise+0.008
Neg logit contributions
nes-0.013
vel-0.013
oll-0.013
ust-0.012
onel-0.012
Token smiled
Token ID13541
Feature activation+6.55e-04

Pos logit contributions
 Ms+0.005
 while+0.005
 last+0.004
 teach+0.004
 Mrs+0.004
Neg logit contributions
nes-0.013
vel-0.013
oll-0.013
ode-0.012
ust-0.012
Token and
Token ID290
Feature activation+9.41e-03

Pos logit contributions
nes+0.001
vel+0.001
oll+0.001
ode+0.001
ust+0.001
Neg logit contributions
 Ms-0.001
 while-0.001
 last-0.001
 teach-0.001
 Mrs-0.001
Token gave
Token ID2921
Feature activation+2.16e-03

Pos logit contributions
nes+0.022
vel+0.021
oll+0.021
ode+0.020
aces+0.020
Neg logit contributions
 Ms-0.007
 while-0.007
 last-0.006
 Mrs-0.006
 Mr-0.006
Token him
Token ID683
Feature activation-2.93e-02

Pos logit contributions
nes+0.004
vel+0.004
oll+0.004
ode+0.004
ust+0.004
Neg logit contributions
 Ms-0.003
 while-0.003
 teach-0.002
 last-0.002
 Mrs-0.002
Token a
Token ID257
Feature activation-2.63e-02

Pos logit contributions
 Ms+0.033
 while+0.030
 teach+0.028
glers+0.028
 hire+0.027
Neg logit contributions
oll-0.053
nes-0.051
vel-0.051
aces-0.048
ode-0.048
Token big
Token ID1263
Feature activation-4.20e-02

Pos logit contributions
 Ms+0.025
 while+0.020
 before+0.020
glers+0.020
 Mrs+0.019
Neg logit contributions
oll-0.051
vel-0.051
nes-0.050
aky-0.047
ode-0.047
Token hug
Token ID16225
Feature activation+3.28e-03

Pos logit contributions
 Ms+0.040
glers+0.035
 whenever+0.033
 usual+0.032
 before+0.031
Neg logit contributions
oll-0.082
vel-0.082
ode-0.076
ocket-0.073
ig-0.073
Token.
Token ID13
Feature activation-9.05e-03

Pos logit contributions
nes+0.007
vel+0.007
oll+0.007
ode+0.007
ust+0.007
Neg logit contributions
 Ms-0.003
 while-0.002
 last-0.002
 teach-0.002
 Mrs-0.002
Token From
Token ID3574
Feature activation-2.44e-03

Pos logit contributions
 Ms+0.010
 last+0.009
 while+0.009
 teach+0.009
 praise+0.008
Neg logit contributions
nes-0.017
vel-0.016
oll-0.016
ust-0.015
aces-0.015
Token that
Token ID326
Feature activation-2.42e-02

Pos logit contributions
 Ms+0.003
 while+0.003
 teach+0.003
 last+0.003
 Mrs+0.003
Neg logit contributions
nes-0.004
oll-0.004
vel-0.004
ode-0.004
ust-0.004
Token day
Token ID1110
Feature activation+1.15e-02

Pos logit contributions
 Ms+0.034
glers+0.033
 praise+0.031
 teach+0.031
 hire+0.031
Neg logit contributions
vel-0.036
nes-0.033
oll-0.032
aky-0.031
anna-0.030