TOP ACTIVATIONS
MAX = 0.045

want you to respect their rules and their things . If
okay , but respecting other people 's opinions was also important
also help you in tough situations . From that day on
it come out of the seaw eed and walk around the
It was hiding under a seaw eed . Lily said ,
, " Good night , Jill and Tom . Sweet dreams
You have to respect the rules . You have to say
could help him in scary situations . He continued to balance
slide and looked through the ey ep iece . He saw
things can come from bad situations . <|endoftext|> Once upon a
should respect their mom 's rules . They hugged their mom
remember to abide by my rules and get a permit before
see shells , rocks , seaw eed , crabs and fish
tentacles to reach for the seaw eed , and he wrapped
you for the salad , ma ' am ," Lena said
honest , even in difficult situations . <|endoftext|> Once upon a
he decided to remove the seaw eed from the water so
said , " Thank you mum !" He was so thankful
permission and to respect the rules . She also learned that
He said , " Your kids marked on my barn with

MIN ACTIVATIONS
MIN = -0.055

saw a dusty old ash tr ay on the shelf .
they see a big ash tr ay near a bench .
. It was an ash tr ay . He wanted to
they also see an ash tr ay on the table .
backyard there was an ash tr ay . It was the
and fell . The ash tr ay flew out of her
room and saw an ash tr ay . She remembered her
to blow . The ash tr ay got really hot !
very tired . The ash tr ay watched as the little
on blowing until the ash tr ay was so dry that
he sp ied an ash tr ay . He picked up
, he saw an ash tr ay on the table .
. She put the ash tr ay away . She said
no , because the ash tr ay was not a toy
Sara found an empty ash tr ay on the table .
. He had an ash tr ay that could zip around
. The ash tr ay was ready to provide
, Lily found an ash tr ay in the world behind
notice how slippery the ash tr ay was . In no
and saw a big t eddy bear . It was brown

INTERVAL 0.020 - 0.045
CONTAINS 0.628%

From that day on , Joey was always excited to poke
. You are my sweet children and I love you ."
on the ground and held hands . They said , "
his sleepy eyes shifted from side to side . He felt
let go of your selfish ness and serve others . <|endoftext|>

INTERVAL -0.005 - 0.020
CONTAINS 74.681%

. <|endoftext|> Once upon a time , there was a big
, Lily . You are wrong . I am the king
lit up . Lucy was so happy !
, the judge would always listen and try to help .
so they decided to separate their eyes . The bear 's

INTERVAL -0.030 - -0.005
CONTAINS 24.649%

You are a good sister ." He finished sewing
M um , Tom questioned , " What is that high
went to visit their neighbor , Mrs . Lee . Mrs
Once upon a time , there was an old man who
What are you doing here ?" Lily asked .

INTERVAL -0.055 - -0.030
CONTAINS 0.043%

me , Lily !" Ben calls . " We can slide
â T eddy hugged Wire - Dog and
She also had a t eddy bear , a doll ,
She saw rain bows and unic orns and lots of other
. It hit the t eddy bear and bounced back .
Token want
Token ID765
Feature activation+1.01e-02

Pos logit contributions
ags+0.016
 yourself+0.014
 ey+0.013
 yourselves+0.013
+0.012
Neg logit contributions
s-0.018
t-0.015
z-0.014
 of-0.014
ls-0.013
Token you
Token ID345
Feature activation+3.62e-03

Pos logit contributions
ags+0.021
 yourself+0.019
 yourselves+0.019
 ey+0.017
34+0.017
Neg logit contributions
s-0.015
oped-0.013
t-0.012
z-0.012
ashed-0.012
Token to
Token ID284
Feature activation+5.66e-03

Pos logit contributions
ags+0.007
 yourself+0.007
 yourselves+0.006
+0.005
 upper+0.005
Neg logit contributions
s-0.004
z-0.004
 batch-0.004
rolled-0.004
oped-0.004
Token respect
Token ID2461
Feature activation+1.20e-02

Pos logit contributions
ags+0.010
 yourself+0.009
 yourselves+0.008
 ey+0.008
+0.008
Neg logit contributions
s-0.012
t-0.010
 of-0.009
ashed-0.009
ls-0.009
Token their
Token ID511
Feature activation-2.87e-03

Pos logit contributions
ags+0.024
 yourself+0.021
 ey+0.020
 yourselves+0.020
34+0.019
Neg logit contributions
s-0.021
t-0.018
 of-0.016
ashed-0.016
 while-0.015
Token rules
Token ID3173
Feature activation+4.47e-02

Pos logit contributions
s+0.006
t+0.006
 of+0.005
 they+0.005
 together+0.005
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.004
 yourselves-0.004
34-0.004
Token and
Token ID290
Feature activation-3.94e-06

Pos logit contributions
ags+0.093
 yourself+0.081
34+0.078
 yourselves+0.077
 ey+0.072
Neg logit contributions
s-0.065
ashed-0.053
t-0.051
apy-0.051
rolled-0.050
Token their
Token ID511
Feature activation+6.36e-05

Feature not active on prev token;
no predictions were affected.
Token things
Token ID1243
Feature activation+2.60e-02

Pos logit contributions
ags+0.000
 ey+0.000
 yourself+0.000
 yourselves+0.000
34+0.000
Neg logit contributions
s-0.000
t-0.000
 of-0.000
z-0.000
 together-0.000
Token.
Token ID13
Feature activation+8.58e-03

Pos logit contributions
ags+0.052
 yourself+0.045
 yourselves+0.043
34+0.040
 ey+0.039
Neg logit contributions
s-0.042
t-0.034
ashed-0.033
ounce-0.033
 batch-0.032
Token If
Token ID1002
Feature activation-4.66e-03

Pos logit contributions
ags+0.020
 ey+0.018
ode+0.017
34+0.016
obb+0.016
Neg logit contributions
s-0.013
 they-0.010
 of-0.010
t-0.009
 together-0.009
Token okay
Token ID8788
Feature activation-1.35e-02

Pos logit contributions
ags+0.003
 yourself+0.003
 yourselves+0.003
34+0.003
 bluff+0.003
Neg logit contributions
s-0.004
t-0.003
z-0.003
 of-0.003
ashed-0.003
Token,
Token ID11
Feature activation-5.03e-03

Pos logit contributions
s+0.020
t+0.017
 mun+0.016
ashed+0.016
ls+0.015
Neg logit contributions
ags-0.027
 yourself-0.025
 yourselves-0.023
34-0.023
 ey-0.022
Token but
Token ID475
Feature activation-1.46e-02

Pos logit contributions
s+0.009
t+0.008
z+0.007
rolled+0.007
ashed+0.006
Neg logit contributions
ags-0.009
 yourself-0.008
34-0.008
 ey-0.008
 yourselves-0.008
Token respecting
Token ID30796
Feature activation+1.32e-02

Pos logit contributions
s+0.023
t+0.018
ashed+0.017
z+0.016
ounce+0.016
Neg logit contributions
ags-0.030
 yourself-0.027
 ey-0.027
 yourselves-0.026
34-0.026
Token other
Token ID584
Feature activation+2.09e-02

Pos logit contributions
ags+0.027
 ey+0.023
 yourself+0.022
 yourselves+0.022
34+0.022
Neg logit contributions
s-0.023
t-0.019
 of-0.018
 they-0.016
 while-0.016
Token people
Token ID661
Feature activation+4.42e-02

Pos logit contributions
ags+0.031
 yourself+0.031
 yourselves+0.028
 ey+0.025
34+0.024
Neg logit contributions
s-0.041
t-0.040
z-0.035
ashed-0.035
uttered-0.035
Token's
Token ID338
Feature activation+1.99e-02

Pos logit contributions
ags+0.083
 yourself+0.076
 yourselves+0.069
34+0.068
 ey+0.067
Neg logit contributions
s-0.066
t-0.060
ashed-0.055
 batch-0.054
z-0.053
Token opinions
Token ID9317
Feature activation+1.84e-02

Pos logit contributions
ags+0.037
 yourself+0.030
 ey+0.030
 yourselves+0.028
34+0.027
Neg logit contributions
s-0.041
t-0.035
 of-0.032
 they-0.030
 while-0.029
Token was
Token ID373
Feature activation+1.42e-02

Pos logit contributions
ags+0.042
 yourself+0.036
 ey+0.035
 yourselves+0.034
34+0.034
Neg logit contributions
s-0.026
t-0.021
ashed-0.018
ounce-0.018
z-0.017
Token also
Token ID635
Feature activation+1.61e-02

Pos logit contributions
ags+0.026
 yourself+0.023
 yourselves+0.021
34+0.020
 bluff+0.020
Neg logit contributions
s-0.028
t-0.023
 of-0.020
z-0.020
ashed-0.020
Token important
Token ID1593
Feature activation+3.46e-05

Pos logit contributions
ags+0.029
 yourself+0.024
34+0.022
 yourselves+0.022
 ey+0.022
Neg logit contributions
s-0.032
t-0.028
 of-0.025
ounce-0.023
z-0.023
Token also
Token ID635
Feature activation+5.30e-03

Pos logit contributions
s+0.014
ashed+0.011
t+0.011
 of+0.010
 batch+0.010
Neg logit contributions
ags-0.013
 yourself-0.011
 ey-0.010
34-0.010
 yourselves-0.010
Token help
Token ID1037
Feature activation+1.77e-02

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.006
 yourselves+0.006
34+0.006
Neg logit contributions
s-0.012
t-0.010
ashed-0.009
 of-0.009
ls-0.009
Token you
Token ID345
Feature activation+1.39e-03

Pos logit contributions
ags+0.032
 yourself+0.031
 yourselves+0.027
 ey+0.025
34+0.024
Neg logit contributions
s-0.032
t-0.028
ashed-0.027
 batch-0.024
z-0.024
Token in
Token ID287
Feature activation+5.61e-03

Pos logit contributions
ags+0.003
 yourself+0.002
 yourselves+0.002
34+0.002
 ey+0.002
Neg logit contributions
s-0.002
t-0.002
ashed-0.002
 batch-0.002
 of-0.002
Token tough
Token ID5802
Feature activation+3.35e-03

Pos logit contributions
ags+0.011
 yourself+0.010
 yourselves+0.009
 ey+0.009
34+0.009
Neg logit contributions
s-0.010
t-0.008
ashed-0.008
z-0.007
ounce-0.007
Token situations
Token ID7445
Feature activation+4.41e-02

Pos logit contributions
ags+0.003
 yourself+0.003
 ey+0.002
 yourselves+0.002
34+0.002
Neg logit contributions
s-0.009
t-0.009
 they-0.008
 of-0.008
z-0.008
Token.
Token ID13
Feature activation-3.70e-04

Pos logit contributions
ags+0.091
 yourself+0.082
34+0.077
 yourselves+0.076
 ey+0.075
Neg logit contributions
s-0.070
t-0.057
ashed-0.052
 of-0.049
 they-0.048
Token From
Token ID3574
Feature activation+9.10e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 while+0.000
 they+0.000
Neg logit contributions
ags-0.001
 ey-0.001
 yourself-0.001
34-0.001
 yourselves-0.001
Token that
Token ID326
Feature activation-7.04e-03

Pos logit contributions
ags+0.020
 yourself+0.018
 ey+0.017
 yourselves+0.017
34+0.016
Neg logit contributions
s-0.014
t-0.012
 of-0.010
ashed-0.009
 while-0.009
Token day
Token ID1110
Feature activation+1.41e-02

Pos logit contributions
s+0.012
ashed+0.009
ounce+0.009
t+0.009
ls+0.009
Neg logit contributions
ags-0.012
 ey-0.010
 yourself-0.010
-0.010
 narrative-0.010
Token on
Token ID319
Feature activation+1.30e-02

Pos logit contributions
ags+0.039
 ey+0.037
 yourselves+0.034
 yourself+0.033
34+0.033
Neg logit contributions
s-0.016
 of-0.012
t-0.011
 they-0.010
 together-0.009
Token it
Token ID340
Feature activation-2.06e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
 yourselves+0.002
34+0.002
Neg logit contributions
t-0.003
s-0.003
z-0.002
ashed-0.002
ounce-0.002
Token come
Token ID1282
Feature activation-1.10e-03

Pos logit contributions
t+0.003
s+0.003
ashed+0.003
 batch+0.003
ounce+0.002
Neg logit contributions
ags-0.004
 ey-0.003
 yourself-0.003
34-0.003
ode-0.003
Token out
Token ID503
Feature activation-4.64e-03

Pos logit contributions
s+0.002
t+0.002
 batch+0.002
ashed+0.001
 mun+0.001
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.002
34-0.002
 yourselves-0.001
Token of
Token ID286
Feature activation-7.55e-03

Pos logit contributions
s+0.005
t+0.004
ashed+0.003
 batch+0.003
z+0.002
Neg logit contributions
ags-0.012
 ey-0.011
34-0.010
 yourself-0.010
 yourselves-0.010
Token the
Token ID262
Feature activation+5.02e-03

Pos logit contributions
s+0.010
 of+0.008
 they+0.008
 together+0.007
 while+0.007
Neg logit contributions
ags-0.018
 ey-0.016
34-0.016
-0.016
ode-0.015
Token seaw
Token ID32085
Feature activation+4.38e-02

Pos logit contributions
ags+0.009
 ey+0.008
 yourself+0.007
34+0.007
 yourselves+0.007
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.007
 while-0.007
Tokeneed
Token ID2308
Feature activation+1.47e-02

Pos logit contributions
ags+0.083
+0.069
 ey+0.067
 bluff+0.067
 yourselves+0.066
Neg logit contributions
s-0.091
t-0.068
 of-0.065
 while-0.062
ounce-0.059
Token and
Token ID290
Feature activation-6.82e-03

Pos logit contributions
ags+0.030
 ey+0.026
34+0.026
 yourself+0.025
ode+0.024
Neg logit contributions
s-0.022
t-0.019
 of-0.015
apy-0.015
ashed-0.015
Token walk
Token ID2513
Feature activation-1.02e-03

Pos logit contributions
s+0.013
t+0.012
 of+0.010
z+0.010
ounce+0.009
Neg logit contributions
ags-0.012
 ey-0.011
 yourself-0.010
 yourselves-0.010
 upper-0.009
Token around
Token ID1088
Feature activation-1.75e-02

Pos logit contributions
s+0.002
t+0.002
 batch+0.001
apy+0.001
ops+0.001
Neg logit contributions
ags-0.002
 ey-0.001
 yourself-0.001
34-0.001
iece-0.001
Token the
Token ID262
Feature activation+1.31e-03

Pos logit contributions
s+0.028
t+0.023
 of+0.021
 they+0.018
ashed+0.018
Neg logit contributions
ags-0.037
 ey-0.032
 yourself-0.031
34-0.031
 yourselves-0.030
Token It
Token ID632
Feature activation-9.43e-03

Pos logit contributions
ags+0.006
 ey+0.005
 yourself+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.005
 of-0.004
t-0.004
 they-0.004
 while-0.004
Token was
Token ID373
Feature activation+1.12e-03

Pos logit contributions
t+0.013
s+0.013
less+0.012
 of+0.012
ounce+0.011
Neg logit contributions
 ey-0.015
ags-0.014
-0.013
34-0.013
iture-0.012
Token hiding
Token ID11816
Feature activation-3.18e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
34+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
 while-0.002
ounce-0.002
Token under
Token ID739
Feature activation-2.89e-03

Pos logit contributions
s+0.006
t+0.005
 of+0.004
 they+0.004
 while+0.004
Neg logit contributions
ags-0.006
 ey-0.005
 yourself-0.005
 yourselves-0.005
34-0.005
Token a
Token ID257
Feature activation-2.13e-03

Pos logit contributions
s+0.005
t+0.004
 of+0.004
ounce+0.003
 together+0.003
Neg logit contributions
ags-0.006
 ey-0.005
 yourself-0.005
 yourselves-0.005
34-0.005
Token seaw
Token ID32085
Feature activation+4.37e-02

Pos logit contributions
s+0.004
t+0.004
 of+0.004
 they+0.003
 together+0.003
Neg logit contributions
ags-0.004
 ey-0.003
 yourself-0.003
34-0.003
 yourselves-0.003
Tokeneed
Token ID2308
Feature activation+3.48e-03

Pos logit contributions
ags+0.075
 ey+0.060
+0.059
 yourselves+0.058
 yourself+0.058
Neg logit contributions
s-0.095
t-0.077
 of-0.074
 while-0.069
 they-0.065
Token.
Token ID13
Feature activation+3.96e-03

Pos logit contributions
ags+0.006
 ey+0.006
34+0.006
 yourself+0.006
ode+0.005
Neg logit contributions
s-0.006
t-0.005
 of-0.004
ashed-0.004
apy-0.004
Token Lily
Token ID20037
Feature activation+1.34e-03

Pos logit contributions
ags+0.008
 ey+0.006
34+0.006
 yourself+0.006
 yourselves+0.006
Neg logit contributions
s-0.008
t-0.006
 of-0.006
 they-0.006
 while-0.006
Token said
Token ID531
Feature activation+4.86e-03

Pos logit contributions
 ey+0.002
ags+0.002
 yourself+0.002
34+0.002
per+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
less-0.001
 while-0.001
Token,
Token ID11
Feature activation-4.42e-03

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.010
34+0.010
 yourselves+0.010
Neg logit contributions
s-0.006
t-0.005
 of-0.004
 while-0.004
ounce-0.003
Token,
Token ID11
Feature activation+3.24e-03

Pos logit contributions
ags+0.004
 yourself+0.003
 yourselves+0.003
34+0.003
 ey+0.003
Neg logit contributions
s-0.002
t-0.002
ounce-0.002
 of-0.002
z-0.002
Token "
Token ID366
Feature activation+7.89e-03

Pos logit contributions
ags+0.008
 ey+0.007
 yourself+0.006
 yourselves+0.006
+0.006
Neg logit contributions
s-0.005
 they-0.004
 of-0.004
t-0.004
 while-0.004
TokenGood
Token ID10248
Feature activation-4.14e-03

Pos logit contributions
ags+0.018
 ey+0.016
34+0.014
ode+0.014
+0.013
Neg logit contributions
s-0.014
t-0.011
 they-0.011
 of-0.010
 together-0.010
Token night
Token ID1755
Feature activation+1.02e-02

Pos logit contributions
s+0.007
t+0.005
 of+0.005
 they+0.004
 while+0.004
Neg logit contributions
ags-0.009
 ey-0.008
 yourself-0.008
34-0.008
 yourselves-0.008
Token,
Token ID11
Feature activation-3.05e-03

Pos logit contributions
ags+0.020
34+0.019
 yourself+0.019
 yourselves+0.018
 ey+0.017
Neg logit contributions
s-0.015
t-0.013
ashed-0.011
z-0.010
rolled-0.010
Token Jill
Token ID22772
Feature activation+4.29e-02

Pos logit contributions
s+0.006
 of+0.005
t+0.004
 they+0.004
 while+0.004
Neg logit contributions
ags-0.006
 ey-0.005
34-0.005
 yourself-0.005
-0.005
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
ags+0.113
 ey+0.100
34+0.096
 yourself+0.096
 yourselves+0.094
Neg logit contributions
s-0.051
t-0.036
 of-0.034
 they-0.030
 while-0.028
Token Tom
Token ID4186
Feature activation+1.75e-02

Pos logit contributions
ags+0.028
 ey+0.023
 yourself+0.023
 yourselves+0.022
34+0.022
Neg logit contributions
s-0.029
t-0.024
 of-0.023
 while-0.021
 they-0.020
Token.
Token ID13
Feature activation+1.05e-03

Pos logit contributions
ags+0.035
 yourself+0.031
 yourselves+0.029
34+0.028
 ey+0.027
Neg logit contributions
s-0.029
t-0.025
 of-0.021
ounce-0.020
ashed-0.020
Token Sweet
Token ID15335
Feature activation-4.76e-03

Pos logit contributions
ags+0.002
 ey+0.002
34+0.002
 yourself+0.002
ode+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
 they-0.001
 together-0.001
Token dreams
Token ID10625
Feature activation+9.36e-03

Pos logit contributions
s+0.011
t+0.009
 of+0.008
 they+0.008
 while+0.008
Neg logit contributions
ags-0.008
 ey-0.006
 yourself-0.006
 yourselves-0.006
34-0.006
Token You
Token ID921
Feature activation-6.78e-03

Pos logit contributions
s+0.000
 of+0.000
 they+0.000
t+0.000
 together+0.000
Neg logit contributions
ags-0.000
 ey-0.000
ode-0.000
34-0.000
-0.000
Token have
Token ID423
Feature activation-7.60e-05

Pos logit contributions
s+0.011
t+0.008
 of+0.008
 while+0.007
 they+0.007
Neg logit contributions
ags-0.015
 ey-0.012
 yourself-0.012
34-0.012
 yourselves-0.012
Token to
Token ID284
Feature activation+1.04e-02

Pos logit contributions
s+0.000
t+0.000
 of+0.000
 they+0.000
 while+0.000
Neg logit contributions
ags-0.000
 ey-0.000
 yourself-0.000
 yourselves-0.000
34-0.000
Token respect
Token ID2461
Feature activation+1.15e-02

Pos logit contributions
ags+0.021
 ey+0.018
34+0.017
 yourself+0.017
 yourselves+0.016
Neg logit contributions
s-0.019
t-0.015
 of-0.015
 they-0.014
 while-0.014
Token the
Token ID262
Feature activation+3.42e-03

Pos logit contributions
ags+0.025
 ey+0.022
34+0.021
+0.020
 yourself+0.020
Neg logit contributions
s-0.019
 of-0.015
t-0.014
 they-0.014
 while-0.013
Token rules
Token ID3173
Feature activation+4.29e-02

Pos logit contributions
ags+0.006
 ey+0.005
34+0.005
 yourself+0.005
 yourselves+0.005
Neg logit contributions
s-0.007
t-0.006
 of-0.006
 they-0.005
 while-0.005
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
ags+0.095
 yourself+0.080
34+0.079
 ey+0.078
 yourselves+0.078
Neg logit contributions
s-0.065
t-0.049
 of-0.045
 they-0.043
 while-0.042
Token You
Token ID921
Feature activation-3.81e-03

Pos logit contributions
ags+0.012
 ey+0.011
ode+0.011
34+0.010
+0.010
Neg logit contributions
s-0.009
 of-0.008
t-0.007
 they-0.007
 together-0.007
Token have
Token ID423
Feature activation+2.35e-03

Pos logit contributions
s+0.006
t+0.005
 of+0.004
 while+0.004
z+0.004
Neg logit contributions
ags-0.008
 yourself-0.007
 ey-0.007
 yourselves-0.006
34-0.006
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.006
 yourselves+0.006
34+0.006
Neg logit contributions
s-0.002
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token say
Token ID910
Feature activation+1.10e-02

Pos logit contributions
ags+0.022
 ey+0.019
34+0.018
 yourself+0.018
 yourselves+0.017
Neg logit contributions
s-0.020
t-0.016
 of-0.016
 they-0.015
 while-0.014
Token could
Token ID714
Feature activation-6.51e-04

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.010
 yourselves+0.010
34+0.009
Neg logit contributions
s-0.009
t-0.007
 of-0.007
 they-0.006
 while-0.006
Token help
Token ID1037
Feature activation+2.02e-02

Pos logit contributions
s+0.001
ashed+0.001
ls+0.001
t+0.001
 batch+0.001
Neg logit contributions
ags-0.001
 yourself-0.001
 yourselves-0.001
ode-0.001
 ey-0.001
Token him
Token ID683
Feature activation-3.19e-04

Pos logit contributions
ags+0.038
 yourself+0.035
 yourselves+0.032
 ey+0.031
+0.028
Neg logit contributions
s-0.038
t-0.031
ashed-0.029
ls-0.027
 of-0.026
Token in
Token ID287
Feature activation+7.33e-03

Pos logit contributions
s+0.001
t+0.000
ashed+0.000
 batch+0.000
ls+0.000
Neg logit contributions
ags-0.001
 yourself-0.001
 ey-0.000
ode-0.000
 yourselves-0.000
Token scary
Token ID14343
Feature activation+2.40e-03

Pos logit contributions
ags+0.014
 yourself+0.012
 ey+0.012
 yourselves+0.011
34+0.011
Neg logit contributions
s-0.015
t-0.012
ashed-0.011
 of-0.010
 they-0.010
Token situations
Token ID7445
Feature activation+4.27e-02

Pos logit contributions
ags+0.004
 ey+0.003
 yourself+0.003
 yourselves+0.003
34+0.002
Neg logit contributions
s-0.006
t-0.005
 of-0.004
 they-0.004
 together-0.004
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
ags+0.081
 yourself+0.076
 yourselves+0.069
 ey+0.067
34+0.067
Neg logit contributions
s-0.071
t-0.056
ashed-0.056
 batch-0.052
apy-0.051
Token He
Token ID679
Feature activation+1.94e-03

Pos logit contributions
ags+0.003
 ey+0.002
 yourself+0.002
34+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token continued
Token ID3767
Feature activation-7.36e-03

Pos logit contributions
ags+0.003
 ey+0.003
ode+0.003
 yourself+0.003
+0.002
Neg logit contributions
s-0.003
t-0.003
less-0.003
 while-0.003
 of-0.003
Token to
Token ID284
Feature activation+2.12e-03

Pos logit contributions
s+0.010
t+0.009
 of+0.007
ounce+0.006
ashed+0.006
Neg logit contributions
ags-0.017
 ey-0.014
 yourself-0.013
34-0.012
 yourselves-0.012
Token balance
Token ID5236
Feature activation+8.39e-03

Pos logit contributions
ags+0.005
 ey+0.004
 yourself+0.004
34+0.004
 yourselves+0.004
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.002
 while-0.002
Token slide
Token ID10649
Feature activation-1.22e-03

Pos logit contributions
ags+0.007
 ey+0.006
34+0.005
 yourself+0.005
 yourselves+0.005
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.007
 while-0.007
Token and
Token ID290
Feature activation-4.16e-03

Pos logit contributions
s+0.002
t+0.001
 of+0.001
ounce+0.001
ls+0.001
Neg logit contributions
ags-0.003
34-0.002
 ey-0.002
 yourself-0.002
 upper-0.002
Token looked
Token ID3114
Feature activation-1.79e-03

Pos logit contributions
s+0.007
t+0.007
 of+0.006
ounce+0.005
less+0.005
Neg logit contributions
ags-0.008
 ey-0.007
 yourself-0.007
 yourselves-0.007
34-0.006
Token through
Token ID832
Feature activation-7.59e-03

Pos logit contributions
s+0.003
t+0.002
 of+0.002
 they+0.002
 while+0.002
Neg logit contributions
ags-0.004
 ey-0.004
34-0.003
 yourself-0.003
 yourselves-0.003
Token the
Token ID262
Feature activation+2.58e-03

Pos logit contributions
s+0.013
t+0.011
 of+0.009
 they+0.009
ashed+0.009
Neg logit contributions
ags-0.016
 ey-0.014
 yourself-0.013
34-0.013
 yourselves-0.013
Token ey
Token ID1926
Feature activation+4.26e-02

Pos logit contributions
ags+0.004
34+0.003
+0.003
 yourself+0.003
 ey+0.003
Neg logit contributions
s-0.006
 of-0.005
t-0.005
 they-0.005
 while-0.005
Tokenep
Token ID538
Feature activation-1.58e-02

Pos logit contributions
ags+0.065
 ey+0.055
34+0.050
+0.049
 yourself+0.047
Neg logit contributions
s-0.098
t-0.080
 of-0.080
 together-0.075
 they-0.075
Tokeniece
Token ID8535
Feature activation+2.00e-02

Pos logit contributions
s+0.033
t+0.030
 batch+0.028
ls+0.026
ashed+0.026
Neg logit contributions
ags-0.023
iece-0.020
34-0.018
 yourself-0.018
per-0.016
Token.
Token ID13
Feature activation+1.77e-03

Pos logit contributions
ags+0.037
iece+0.031
34+0.030
 yourself+0.029
 ey+0.028
Neg logit contributions
s-0.035
t-0.028
 batch-0.027
ashed-0.027
apy-0.026
Token He
Token ID679
Feature activation+3.25e-04

Pos logit contributions
ags+0.003
 ey+0.003
34+0.002
 yourself+0.002
 yourselves+0.002
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token saw
Token ID2497
Feature activation-1.40e-03

Pos logit contributions
ags+0.001
 ey+0.001
34+0.001
 yourself+0.000
 upper+0.000
Neg logit contributions
s-0.001
t-0.000
 of-0.000
less-0.000
 while-0.000
Token things
Token ID1243
Feature activation+2.81e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
z+0.003
ounce+0.003
Neg logit contributions
ags-0.004
 ey-0.003
 yourself-0.003
 yourselves-0.003
34-0.003
Token can
Token ID460
Feature activation-3.53e-03

Pos logit contributions
ags+0.005
 yourself+0.004
 ey+0.004
 yourselves+0.004
34+0.004
Neg logit contributions
s-0.005
t-0.004
ashed-0.004
z-0.004
 batch-0.004
Token come
Token ID1282
Feature activation+2.25e-04

Pos logit contributions
s+0.007
ashed+0.006
t+0.006
 of+0.006
 while+0.005
Neg logit contributions
ags-0.006
 yourself-0.005
 ey-0.005
34-0.004
 yourselves-0.004
Token from
Token ID422
Feature activation+4.93e-03

Pos logit contributions
ags+0.000
 yourself+0.000
 yourselves+0.000
34+0.000
 ey+0.000
Neg logit contributions
s-0.000
 batch-0.000
 mun-0.000
t-0.000
apy-0.000
Token bad
Token ID2089
Feature activation-4.85e-03

Pos logit contributions
ags+0.009
 yourself+0.008
 yourselves+0.008
 ey+0.007
34+0.007
Neg logit contributions
s-0.010
t-0.009
 of-0.008
ashed-0.007
z-0.007
Token situations
Token ID7445
Feature activation+4.26e-02

Pos logit contributions
s+0.010
t+0.008
 of+0.008
 they+0.007
 while+0.007
Neg logit contributions
ags-0.009
 ey-0.008
 yourself-0.007
34-0.007
 yourselves-0.007
Token.
Token ID13
Feature activation-1.14e-03

Pos logit contributions
ags+0.086
 yourself+0.079
34+0.074
 yourselves+0.073
 ey+0.070
Neg logit contributions
s-0.063
t-0.052
ashed-0.048
apy-0.047
 mun-0.047
Token<|endoftext|>
Token ID50256
Feature activation-6.11e-03

Pos logit contributions
s+0.001
 of+0.001
t+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.003
 ey-0.003
34-0.003
 yourselves-0.003
 yourself-0.003
TokenOnce
Token ID7454
Feature activation+2.36e-04

Pos logit contributions
s+0.009
t+0.007
 of+0.006
 they+0.006
 while+0.005
Neg logit contributions
ags-0.015
 ey-0.013
34-0.012
 yourself-0.012
 yourselves-0.012
Token upon
Token ID2402
Feature activation+1.24e-02

Pos logit contributions
ags+0.001
 ey+0.000
34+0.000
 yourself+0.000
+0.000
Neg logit contributions
s-0.000
t-0.000
 while-0.000
ashed-0.000
 together-0.000
Token a
Token ID257
Feature activation+5.16e-03

Pos logit contributions
ags+0.028
 ey+0.024
 yourself+0.023
34+0.023
 yourselves+0.022
Neg logit contributions
s-0.019
t-0.015
 of-0.014
 they-0.013
 together-0.012
Token should
Token ID815
Feature activation+9.65e-04

Pos logit contributions
s+0.008
z+0.006
t+0.006
 batch+0.006
 while+0.006
Neg logit contributions
ags-0.008
 yourself-0.007
 ey-0.006
 yourselves-0.006
34-0.006
Token respect
Token ID2461
Feature activation+1.21e-02

Pos logit contributions
ags+0.002
 yourself+0.002
 ey+0.002
 yourselves+0.002
34+0.001
Neg logit contributions
s-0.002
t-0.001
 of-0.001
 while-0.001
 they-0.001
Token their
Token ID511
Feature activation-1.39e-05

Pos logit contributions
ags+0.026
 yourself+0.022
 ey+0.021
 yourselves+0.021
34+0.020
Neg logit contributions
s-0.020
t-0.017
 of-0.015
ashed-0.014
 while-0.014
Token mom
Token ID1995
Feature activation+1.97e-02

Pos logit contributions
s+0.000
t+0.000
 of+0.000
 they+0.000
 while+0.000
Neg logit contributions
ags-0.000
 ey-0.000
 yourself-0.000
 yourselves-0.000
34-0.000
Token's
Token ID338
Feature activation+1.14e-02

Pos logit contributions
ags+0.040
 yourself+0.035
34+0.033
 yourselves+0.032
 ey+0.032
Neg logit contributions
s-0.029
t-0.025
ashed-0.023
apy-0.022
rolled-0.022
Token rules
Token ID3173
Feature activation+4.25e-02

Pos logit contributions
ags+0.017
 yourself+0.013
 ey+0.013
 yourselves+0.012
34+0.011
Neg logit contributions
s-0.027
t-0.024
 of-0.022
 they-0.022
 while-0.021
Token.
Token ID13
Feature activation+7.05e-03

Pos logit contributions
ags+0.089
 yourself+0.077
34+0.074
 yourselves+0.072
 ey+0.069
Neg logit contributions
s-0.063
t-0.049
ashed-0.047
apy-0.047
 mun-0.046
Token They
Token ID1119
Feature activation+3.40e-03

Pos logit contributions
ags+0.018
 ey+0.016
34+0.015
ode+0.015
 yourself+0.015
Neg logit contributions
s-0.009
t-0.006
 of-0.006
 they-0.006
 together-0.005
Token hugged
Token ID41130
Feature activation-8.07e-03

Pos logit contributions
ags+0.007
 ey+0.006
34+0.006
 yourself+0.006
 yourselves+0.005
Neg logit contributions
s-0.006
t-0.005
 of-0.005
 they-0.004
 while-0.004
Token their
Token ID511
Feature activation-5.14e-04

Pos logit contributions
s+0.012
t+0.010
 of+0.009
 they+0.007
 while+0.007
Neg logit contributions
ags-0.019
 ey-0.016
 yourself-0.016
 yourselves-0.015
34-0.015
Token mom
Token ID1995
Feature activation+1.53e-02

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 they+0.001
 together+0.001
Neg logit contributions
ags-0.001
 ey-0.001
 yourselves-0.001
 yourself-0.001
34-0.001
Token remember
Token ID3505
Feature activation-1.46e-02

Pos logit contributions
s+0.002
t+0.001
ashed+0.001
ls+0.001
 of+0.001
Neg logit contributions
ags-0.001
 yourself-0.001
 yourselves-0.001
 ey-0.001
34-0.001
Token to
Token ID284
Feature activation+7.00e-03

Pos logit contributions
s+0.021
t+0.016
ashed+0.014
 of+0.012
z+0.012
Neg logit contributions
ags-0.035
 yourself-0.033
 yourselves-0.031
 ey-0.030
34-0.029
Token abide
Token ID27851
Feature activation+8.64e-03

Pos logit contributions
ags+0.013
 ey+0.011
 yourself+0.011
34+0.010
 yourselves+0.010
Neg logit contributions
s-0.014
t-0.011
 of-0.011
 they-0.010
 while-0.010
Token by
Token ID416
Feature activation+4.60e-03

Pos logit contributions
ags+0.019
 ey+0.017
34+0.016
 yourself+0.015
 yourselves+0.015
Neg logit contributions
s-0.014
t-0.011
 of-0.011
 they-0.009
 together-0.009
Token my
Token ID616
Feature activation+2.09e-03

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.006
 yourselves+0.006
34+0.006
Neg logit contributions
s-0.010
t-0.009
 of-0.008
ashed-0.008
 they-0.008
Token rules
Token ID3173
Feature activation+4.24e-02

Pos logit contributions
ags+0.004
 ey+0.003
 yourself+0.003
34+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.004
 of-0.004
 they-0.003
 together-0.003
Token and
Token ID290
Feature activation+8.24e-03

Pos logit contributions
ags+0.086
 yourself+0.075
34+0.073
 yourselves+0.070
 ey+0.068
Neg logit contributions
s-0.070
t-0.053
ashed-0.051
 they-0.050
 of-0.050
Token get
Token ID651
Feature activation+6.24e-03

Pos logit contributions
ags+0.014
 yourself+0.014
 yourselves+0.013
 ey+0.012
34+0.011
Neg logit contributions
s-0.017
t-0.014
 of-0.012
ashed-0.012
ls-0.012
Token a
Token ID257
Feature activation-1.62e-03

Pos logit contributions
ags+0.011
 yourself+0.011
 yourselves+0.010
 bluff+0.007
 ey+0.007
Neg logit contributions
s-0.012
t-0.010
ops-0.010
uddy-0.009
z-0.009
Token permit
Token ID8749
Feature activation+8.55e-03

Pos logit contributions
s+0.003
t+0.002
 of+0.002
 they+0.002
ls+0.002
Neg logit contributions
ags-0.003
 ey-0.002
 yourself-0.002
34-0.002
 upper-0.002
Token before
Token ID878
Feature activation+6.69e-03

Pos logit contributions
ags+0.016
 yourself+0.014
 yourselves+0.013
34+0.013
+0.012
Neg logit contributions
s-0.016
ashed-0.012
apy-0.012
 batch-0.012
ls-0.011
Token see
Token ID766
Feature activation-9.78e-05

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 while+0.002
 together+0.002
Neg logit contributions
ags-0.004
 ey-0.003
 yourself-0.003
34-0.003
 yourselves-0.003
Token shells
Token ID19679
Feature activation+1.08e-02

Pos logit contributions
s+0.000
t+0.000
z+0.000
ounce+0.000
 of+0.000
Neg logit contributions
ags-0.000
 ey-0.000
 yourself-0.000
 yourselves-0.000
34-0.000
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
ags+0.022
 ey+0.019
 yourself+0.018
34+0.018
 yourselves+0.017
Neg logit contributions
t-0.015
s-0.015
 batch-0.012
ashed-0.012
z-0.012
Token rocks
Token ID12586
Feature activation+1.88e-02

Pos logit contributions
s+0.023
t+0.019
 of+0.018
 they+0.017
 together+0.017
Neg logit contributions
ags-0.016
 ey-0.014
 yourself-0.013
34-0.012
 yourselves-0.012
Token,
Token ID11
Feature activation-4.90e-03

Pos logit contributions
ags+0.039
 ey+0.034
 yourself+0.033
34+0.033
 yourselves+0.031
Neg logit contributions
s-0.025
t-0.023
 batch-0.019
ashed-0.019
ounce-0.018
Token seaw
Token ID32085
Feature activation+4.20e-02

Pos logit contributions
s+0.009
 of+0.007
t+0.007
 they+0.007
 while+0.007
Neg logit contributions
ags-0.010
 ey-0.008
34-0.008
 yourself-0.008
-0.007
Tokeneed
Token ID2308
Feature activation+1.17e-02

Pos logit contributions
ags+0.073
+0.064
 ey+0.062
 bluff+0.060
34+0.058
Neg logit contributions
s-0.086
 of-0.066
t-0.065
 while-0.061
 they-0.059
Token,
Token ID11
Feature activation-6.34e-04

Pos logit contributions
ags+0.025
 ey+0.022
34+0.022
 yourself+0.022
 yourselves+0.020
Neg logit contributions
s-0.015
t-0.014
 of-0.011
ashed-0.011
 batch-0.011
Token crabs
Token ID49172
Feature activation+2.49e-02

Pos logit contributions
s+0.001
 of+0.001
t+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.002
 ey-0.001
34-0.001
 yourself-0.001
-0.001
Token and
Token ID290
Feature activation-7.88e-03

Pos logit contributions
ags+0.053
 ey+0.047
 yourself+0.047
34+0.046
 yourselves+0.043
Neg logit contributions
s-0.033
t-0.030
 of-0.024
ashed-0.023
 batch-0.023
Token fish
Token ID5916
Feature activation-6.54e-03

Pos logit contributions
s+0.018
t+0.017
ounce+0.014
 of+0.014
z+0.014
Neg logit contributions
ags-0.011
 ey-0.010
 yourself-0.009
 yourselves-0.009
34-0.008
Token tentacles
Token ID41192
Feature activation+3.58e-03

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.009
34+0.009
 upper+0.008
Neg logit contributions
s-0.013
t-0.011
 of-0.010
 they-0.010
 together-0.010
Token to
Token ID284
Feature activation+5.41e-03

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.006
aces+0.005
per+0.005
Neg logit contributions
s-0.004
 batch-0.004
t-0.004
apy-0.004
ounce-0.003
Token reach
Token ID3151
Feature activation+1.94e-02

Pos logit contributions
ags+0.010
 yourself+0.008
 ey+0.008
 yourselves+0.007
ode+0.007
Neg logit contributions
s-0.012
t-0.010
ls-0.008
ashed-0.008
 of-0.008
Token for
Token ID329
Feature activation-1.48e-03

Pos logit contributions
ags+0.038
 yourself+0.033
 ey+0.032
34+0.030
 yourselves+0.029
Neg logit contributions
s-0.034
t-0.031
ashed-0.026
z-0.024
 of-0.023
Token the
Token ID262
Feature activation-1.25e-03

Pos logit contributions
s+0.003
t+0.002
 of+0.002
ashed+0.002
ounce+0.002
Neg logit contributions
ags-0.003
 yourself-0.002
 ey-0.002
34-0.002
 yourselves-0.002
Token seaw
Token ID32085
Feature activation+4.18e-02

Pos logit contributions
s+0.004
 together+0.003
t+0.003
 they+0.003
 of+0.003
Neg logit contributions
ags-0.001
 ey-0.001
34-0.001
 yourself-0.001
 upper-0.001
Tokeneed
Token ID2308
Feature activation+1.03e-02

Pos logit contributions
ags+0.063
 ey+0.051
34+0.047
 yourself+0.047
 yourselves+0.045
Neg logit contributions
s-0.097
t-0.081
 of-0.080
 they-0.076
 while-0.074
Token,
Token ID11
Feature activation+6.70e-03

Pos logit contributions
ags+0.019
 yourself+0.017
34+0.016
per+0.016
ode+0.016
Neg logit contributions
s-0.014
t-0.012
apy-0.012
 batch-0.011
ashed-0.011
Token and
Token ID290
Feature activation+5.88e-03

Pos logit contributions
ags+0.013
 ey+0.012
 yourself+0.012
 yourselves+0.011
ode+0.011
Neg logit contributions
s-0.012
t-0.010
 of-0.008
z-0.008
ls-0.008
Token he
Token ID339
Feature activation-4.84e-03

Pos logit contributions
 yourself+0.008
 anchor+0.008
 ey+0.007
 yourselves+0.007
ags+0.007
Neg logit contributions
s-0.011
t-0.010
ounce-0.008
z-0.008
om-0.007
Token wrapped
Token ID12908
Feature activation-1.65e-02

Pos logit contributions
s+0.008
t+0.008
less+0.007
 batch+0.007
ounce+0.007
Neg logit contributions
per-0.006
ode-0.006
 anchor-0.006
 upper-0.005
 head-0.005
Token you
Token ID345
Feature activation+1.29e-02

Pos logit contributions
ags+0.038
 ey+0.035
34+0.032
+0.030
ode+0.030
Neg logit contributions
s-0.030
 of-0.024
t-0.023
 they-0.022
 together-0.021
Token for
Token ID329
Feature activation+4.44e-03

Pos logit contributions
ags+0.031
34+0.027
 yourself+0.026
 ey+0.025
 yourselves+0.025
Neg logit contributions
s-0.017
t-0.013
ashed-0.011
 of-0.010
 they-0.010
Token the
Token ID262
Feature activation-6.28e-03

Pos logit contributions
ags+0.010
 ey+0.009
34+0.008
+0.008
ode+0.008
Neg logit contributions
s-0.006
 of-0.005
 they-0.005
 together-0.005
 while-0.005
Token salad
Token ID20698
Feature activation+2.03e-02

Pos logit contributions
s+0.014
t+0.011
ounce+0.010
 of+0.010
z+0.010
Neg logit contributions
ags-0.010
 ey-0.009
 yourself-0.008
34-0.007
ode-0.007
Token,
Token ID11
Feature activation+3.01e-03

Pos logit contributions
ags+0.035
 yourself+0.032
 yourselves+0.029
34+0.029
 ma+0.026
Neg logit contributions
s-0.033
rolled-0.030
t-0.029
ounce-0.028
oped-0.028
Token ma
Token ID17266
Feature activation+4.16e-02

Pos logit contributions
ags+0.006
 ey+0.005
34+0.005
 yourself+0.005
 yourselves+0.004
Neg logit contributions
s-0.006
t-0.005
 of-0.004
 while-0.004
 together-0.004
Token'
Token ID6
Feature activation+9.41e-03

Pos logit contributions
ags+0.079
 ey+0.067
34+0.062
 yourself+0.061
 yourselves+0.060
Neg logit contributions
s-0.082
t-0.066
 of-0.065
 they-0.060
 while-0.059
Tokenam
Token ID321
Feature activation+1.85e-02

Pos logit contributions
ags+0.019
 yourself+0.015
 yourselves+0.014
 ey+0.014
34+0.013
Neg logit contributions
s-0.017
t-0.015
 while-0.014
 of-0.013
 they-0.013
Token,"
Token ID553
Feature activation-8.07e-03

Pos logit contributions
ags+0.037
 yourself+0.032
 yourselves+0.029
34+0.026
+0.026
Neg logit contributions
s-0.029
t-0.026
apy-0.024
 they-0.023
 batch-0.022
Token Lena
Token ID44131
Feature activation-2.41e-03

Pos logit contributions
s+0.015
t+0.014
 of+0.012
ashed+0.012
 while+0.011
Neg logit contributions
ags-0.014
 ey-0.012
34-0.012
 yourself-0.011
 yourselves-0.011
Token said
Token ID531
Feature activation-3.47e-03

Pos logit contributions
s+0.003
t+0.003
 of+0.002
less+0.002
 while+0.002
Neg logit contributions
ags-0.005
 ey-0.005
 yourself-0.005
34-0.005
 yourselves-0.004
Token honest
Token ID5508
Feature activation+1.51e-02

Pos logit contributions
ags+0.004
 ey+0.004
 yourself+0.004
34+0.004
 yourselves+0.003
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 they-0.004
 while-0.003
Token,
Token ID11
Feature activation+4.96e-04

Pos logit contributions
ags+0.029
 yourself+0.028
 yourselves+0.025
34+0.024
 ey+0.023
Neg logit contributions
s-0.024
t-0.021
ls-0.019
apy-0.019
 mun-0.018
Token even
Token ID772
Feature activation+1.34e-04

Pos logit contributions
ags+0.001
 yourself+0.001
 ey+0.001
 yourselves+0.001
34+0.001
Neg logit contributions
s-0.001
t-0.001
ls-0.001
z-0.001
ashed-0.001
Token in
Token ID287
Feature activation+6.14e-03

Pos logit contributions
ags+0.000
 ey+0.000
 yourself+0.000
34+0.000
 yourselves+0.000
Neg logit contributions
s-0.000
t-0.000
 of-0.000
 they-0.000
 while-0.000
Token difficult
Token ID2408
Feature activation+1.24e-03

Pos logit contributions
ags+0.012
 yourself+0.011
 ey+0.010
 yourselves+0.010
34+0.010
Neg logit contributions
s-0.012
t-0.009
 of-0.008
ashed-0.008
 they-0.008
Token situations
Token ID7445
Feature activation+4.15e-02

Pos logit contributions
ags+0.002
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.003
t-0.002
 of-0.002
 they-0.002
ashed-0.002
Token.
Token ID13
Feature activation-2.72e-03

Pos logit contributions
ags+0.084
 yourself+0.076
34+0.073
 yourselves+0.071
 ey+0.071
Neg logit contributions
s-0.064
t-0.051
ashed-0.047
ounce-0.045
 of-0.044
Token<|endoftext|>
Token ID50256
Feature activation-5.35e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.006
 ey-0.006
34-0.005
 yourself-0.005
 yourselves-0.005
TokenOnce
Token ID7454
Feature activation+1.49e-04

Pos logit contributions
s+0.008
t+0.006
 of+0.006
 they+0.006
 while+0.006
Neg logit contributions
ags-0.012
 ey-0.010
 yourself-0.010
34-0.010
 yourselves-0.009
Token upon
Token ID2402
Feature activation+1.25e-02

Pos logit contributions
ags+0.000
 ey+0.000
34+0.000
 yourself+0.000
+0.000
Neg logit contributions
s-0.000
t-0.000
 while-0.000
ashed-0.000
 of-0.000
Token a
Token ID257
Feature activation+6.96e-03

Pos logit contributions
ags+0.027
 ey+0.023
 yourself+0.022
34+0.022
 yourselves+0.022
Neg logit contributions
s-0.021
t-0.016
 of-0.015
 they-0.014
 together-0.013
Token he
Token ID339
Feature activation-2.62e-03

Pos logit contributions
ags+0.000
 ey+0.000
 yourself+0.000
 yourselves+0.000
+0.000
Neg logit contributions
s-0.000
t-0.000
ounce-0.000
z-0.000
by-0.000
Token decided
Token ID3066
Feature activation+8.44e-03

Pos logit contributions
s+0.004
t+0.004
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.005
 ey-0.004
34-0.004
 yourself-0.004
ode-0.004
Token to
Token ID284
Feature activation+4.66e-03

Pos logit contributions
ags+0.020
 yourself+0.019
 yourselves+0.018
 ey+0.018
34+0.018
Neg logit contributions
s-0.010
t-0.007
 of-0.007
ashed-0.005
z-0.005
Token remove
Token ID4781
Feature activation+5.63e-03

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.007
ode+0.006
 yourselves+0.006
Neg logit contributions
s-0.010
t-0.008
ls-0.007
 of-0.007
ashed-0.007
Token the
Token ID262
Feature activation+4.92e-03

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.009
 yourselves+0.009
34+0.009
Neg logit contributions
s-0.010
t-0.009
 of-0.008
 they-0.007
 while-0.007
Token seaw
Token ID32085
Feature activation+4.14e-02

Pos logit contributions
ags+0.009
 ey+0.007
34+0.007
 yourself+0.007
 yourselves+0.007
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.008
 while-0.007
Tokeneed
Token ID2308
Feature activation+1.18e-02

Pos logit contributions
ags+0.078
+0.062
 ey+0.062
 yourselves+0.062
 yourself+0.061
Neg logit contributions
s-0.087
t-0.068
 of-0.064
 while-0.060
ounce-0.056
Token from
Token ID422
Feature activation+4.68e-03

Pos logit contributions
ags+0.026
 ey+0.023
 yourself+0.023
34+0.022
 yourselves+0.022
Neg logit contributions
s-0.015
t-0.012
 of-0.011
ashed-0.010
 they-0.010
Token the
Token ID262
Feature activation+4.04e-03

Pos logit contributions
ags+0.009
 ey+0.008
 yourself+0.007
34+0.007
 yourselves+0.007
Neg logit contributions
s-0.009
t-0.007
 of-0.007
 they-0.006
 while-0.006
Token water
Token ID1660
Feature activation+2.49e-02

Pos logit contributions
ags+0.008
 ey+0.006
34+0.006
+0.006
 yourself+0.006
Neg logit contributions
s-0.008
t-0.006
 of-0.006
 they-0.006
 while-0.006
Token so
Token ID523
Feature activation-4.38e-03

Pos logit contributions
ags+0.054
 ey+0.047
 yourself+0.046
34+0.045
 yourselves+0.044
Neg logit contributions
s-0.035
t-0.029
 of-0.024
z-0.023
 they-0.023
Token said
Token ID531
Feature activation+7.40e-03

Pos logit contributions
ags+0.016
 yourself+0.014
 ey+0.014
+0.013
 yourselves+0.012
Neg logit contributions
s-0.014
t-0.013
 of-0.011
ounce-0.010
 while-0.009
Token,
Token ID11
Feature activation+7.30e-03

Pos logit contributions
ags+0.017
 ey+0.014
 yourself+0.014
34+0.014
 yourselves+0.014
Neg logit contributions
s-0.011
t-0.008
 of-0.007
 they-0.007
 while-0.007
Token "
Token ID366
Feature activation+1.06e-02

Pos logit contributions
ags+0.013
 ey+0.011
34+0.011
 yourself+0.010
 yourselves+0.010
Neg logit contributions
s-0.014
t-0.012
 of-0.011
 they-0.010
 while-0.010
TokenThank
Token ID10449
Feature activation+1.96e-02

Pos logit contributions
ags+0.018
 ey+0.015
34+0.014
 yourself+0.013
 yourselves+0.013
Neg logit contributions
s-0.023
t-0.019
 of-0.019
 they-0.018
 while-0.017
Token you
Token ID345
Feature activation+9.09e-03

Pos logit contributions
ags+0.043
 ey+0.038
34+0.035
+0.033
 yourselves+0.032
Neg logit contributions
s-0.033
 of-0.027
t-0.026
 they-0.024
 together-0.023
Token mum
Token ID25682
Feature activation+4.13e-02

Pos logit contributions
ags+0.020
34+0.018
 yourself+0.018
+0.017
 ey+0.017
Neg logit contributions
s-0.014
t-0.010
ashed-0.009
ls-0.008
 they-0.008
Token!"
Token ID2474
Feature activation-8.51e-03

Pos logit contributions
ags+0.075
 yourself+0.073
34+0.067
+0.064
 yourselves+0.064
Neg logit contributions
s-0.065
t-0.056
ashed-0.053
apy-0.052
uttered-0.049
Token He
Token ID679
Feature activation-3.44e-03

Pos logit contributions
s+0.016
t+0.014
 of+0.012
ls+0.012
 while+0.012
Neg logit contributions
ags-0.015
 ey-0.013
 yourself-0.012
-0.011
 yourselves-0.011
Token was
Token ID373
Feature activation+7.09e-03

Pos logit contributions
s+0.005
t+0.005
less+0.004
ounce+0.004
seeing+0.004
Neg logit contributions
ags-0.006
 ey-0.005
ode-0.005
rac-0.005
 yourself-0.005
Token so
Token ID523
Feature activation+7.20e-03

Pos logit contributions
ags+0.012
 yourself+0.010
 ey+0.010
+0.009
34+0.009
Neg logit contributions
s-0.015
t-0.012
ls-0.011
 while-0.010
ounce-0.010
Token thankful
Token ID25535
Feature activation+4.10e-03

Pos logit contributions
ags+0.011
 ey+0.009
 yourself+0.009
34+0.008
+0.008
Neg logit contributions
s-0.015
t-0.013
 while-0.011
ounce-0.011
ls-0.011
Token permission
Token ID7170
Feature activation+2.59e-02

Pos logit contributions
ags+0.016
 ey+0.013
34+0.013
 yourself+0.012
 yourselves+0.012
Neg logit contributions
s-0.015
t-0.012
 of-0.012
 they-0.011
 while-0.011
Token and
Token ID290
Feature activation-5.90e-03

Pos logit contributions
ags+0.062
 yourself+0.056
34+0.053
 ey+0.053
 yourselves+0.052
Neg logit contributions
s-0.035
t-0.025
 of-0.023
 they-0.022
ashed-0.022
Token to
Token ID284
Feature activation+1.04e-03

Pos logit contributions
s+0.011
t+0.008
ashed+0.008
ls+0.008
z+0.007
Neg logit contributions
ags-0.011
 yourself-0.011
 yourselves-0.010
 ey-0.010
ode-0.009
Token respect
Token ID2461
Feature activation+1.17e-02

Pos logit contributions
ags+0.002
 ey+0.002
34+0.002
 yourself+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token the
Token ID262
Feature activation+4.71e-04

Pos logit contributions
ags+0.024
 ey+0.020
 yourself+0.019
34+0.019
 yourselves+0.019
Neg logit contributions
s-0.021
t-0.017
 of-0.016
 they-0.015
 while-0.015
Token rules
Token ID3173
Feature activation+4.13e-02

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.001
ounce-0.001
Token.
Token ID13
Feature activation+2.18e-03

Pos logit contributions
ags+0.089
 yourself+0.080
34+0.077
 yourselves+0.074
 ey+0.072
Neg logit contributions
s-0.057
ashed-0.044
t-0.042
ounce-0.040
 mun-0.040
Token She
Token ID1375
Feature activation+1.47e-03

Pos logit contributions
ags+0.005
 ey+0.004
34+0.004
 yourself+0.004
 yourselves+0.004
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token also
Token ID635
Feature activation+6.07e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
34+0.002
ode+0.002
Neg logit contributions
s-0.003
t-0.002
 of-0.002
 while-0.002
less-0.002
Token learned
Token ID4499
Feature activation-4.15e-03

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.010
34+0.009
 yourselves+0.009
Neg logit contributions
s-0.011
t-0.009
 of-0.008
 while-0.008
 they-0.008
Token that
Token ID326
Feature activation-1.41e-02

Pos logit contributions
s+0.006
t+0.004
ashed+0.004
z+0.004
 of+0.004
Neg logit contributions
ags-0.010
 yourself-0.009
 ey-0.008
 yourselves-0.008
34-0.008
Token He
Token ID679
Feature activation+8.47e-03

Pos logit contributions
ags+0.005
 ey+0.004
 yourself+0.004
34+0.004
ode+0.004
Neg logit contributions
s-0.003
t-0.002
 of-0.002
 they-0.002
 while-0.002
Token said
Token ID531
Feature activation+2.02e-04

Pos logit contributions
ags+0.016
 ey+0.014
34+0.014
 yourself+0.013
 yourselves+0.013
Neg logit contributions
s-0.013
t-0.011
 of-0.009
 while-0.009
ounce-0.009
Token,
Token ID11
Feature activation+1.39e-03

Pos logit contributions
ags+0.000
 yourself+0.000
34+0.000
 yourselves+0.000
 ey+0.000
Neg logit contributions
s-0.000
t-0.000
 batch-0.000
ounce-0.000
ashed-0.000
Token "
Token ID366
Feature activation+4.53e-03

Pos logit contributions
ags+0.003
 ey+0.003
 yourself+0.003
 yourselves+0.002
34+0.002
Neg logit contributions
s-0.002
 of-0.002
t-0.002
 they-0.002
 while-0.002
TokenYour
Token ID7120
Feature activation-2.63e-03

Pos logit contributions
ags+0.008
 ey+0.007
34+0.006
 yourself+0.006
+0.006
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.008
 while-0.007
Token kids
Token ID3988
Feature activation+4.13e-02

Pos logit contributions
s+0.006
t+0.005
 of+0.005
 together+0.004
 while+0.004
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.004
34-0.003
 yourselves-0.003
Token marked
Token ID7498
Feature activation+3.31e-03

Pos logit contributions
ags+0.071
 yourself+0.067
 yourselves+0.061
34+0.057
 ey+0.055
Neg logit contributions
s-0.073
t-0.062
ashed-0.058
 while-0.055
ls-0.054
Token on
Token ID319
Feature activation+4.47e-03

Pos logit contributions
ags+0.007
 yourself+0.006
 ey+0.006
 yourselves+0.005
34+0.005
Neg logit contributions
s-0.006
t-0.005
 of-0.005
 while-0.004
 they-0.004
Token my
Token ID616
Feature activation+1.58e-03

Pos logit contributions
ags+0.008
 yourself+0.007
 yourselves+0.007
 ey+0.007
34+0.007
Neg logit contributions
s-0.009
t-0.007
 of-0.007
ashed-0.006
 they-0.006
Token barn
Token ID25203
Feature activation-8.22e-04

Pos logit contributions
ags+0.003
 ey+0.002
 yourself+0.002
34+0.002
 yourselves+0.002
Neg logit contributions
s-0.003
 of-0.002
t-0.002
 they-0.002
 while-0.002
Token with
Token ID351
Feature activation+4.23e-03

Pos logit contributions
s+0.001
t+0.001
ashed+0.001
 of+0.001
ls+0.001
Neg logit contributions
ags-0.002
 yourself-0.001
 yourselves-0.001
34-0.001
 ey-0.001
Token saw
Token ID2497
Feature activation+4.75e-03

Pos logit contributions
s+0.003
t+0.003
othe+0.002
apy+0.002
less+0.002
Neg logit contributions
 ey-0.002
 yourself-0.002
ags-0.002
-0.002
P-0.002
Token a
Token ID257
Feature activation+2.33e-03

Pos logit contributions
ags+0.007
 yourself+0.007
 ey+0.006
 yourselves+0.006
34+0.006
Neg logit contributions
s-0.011
t-0.009
ounce-0.008
z-0.008
ashed-0.007
Token dusty
Token ID36972
Feature activation+4.89e-03

Pos logit contributions
 ey+0.004
ags+0.003
34+0.003
 yourself+0.003
 bluff+0.003
Neg logit contributions
s-0.005
t-0.004
 of-0.004
ashed-0.004
 together-0.003
Token old
Token ID1468
Feature activation+1.28e-02

Pos logit contributions
ags+0.007
 ey+0.007
 yourself+0.006
 yourselves+0.005
34+0.005
Neg logit contributions
s-0.011
t-0.010
 of-0.009
 they-0.009
 together-0.008
Token ash
Token ID12530
Feature activation-5.94e-03

Pos logit contributions
 ey+0.017
ags+0.017
 yourself+0.014
 yourselves+0.014
34+0.013
Neg logit contributions
s-0.029
t-0.026
ashed-0.023
 of-0.022
 together-0.022
Tokentr
Token ID2213
Feature activation-5.45e-02

Pos logit contributions
s+0.011
t+0.009
 of+0.009
 they+0.008
 while+0.008
Neg logit contributions
ags-0.011
 ey-0.010
34-0.009
 yourself-0.009
 yourselves-0.008
Tokenay
Token ID323
Feature activation-1.68e-02

Pos logit contributions
s+0.088
t+0.067
 of+0.065
 they+0.063
 while+0.061
Neg logit contributions
ags-0.120
 ey-0.104
34-0.098
 yourself-0.096
 yourselves-0.095
Token on
Token ID319
Feature activation-1.24e-02

Pos logit contributions
s+0.027
t+0.021
ashed+0.020
ounce+0.019
 of+0.018
Neg logit contributions
ags-0.034
34-0.029
 yourself-0.028
 ey-0.028
-0.026
Token the
Token ID262
Feature activation+4.79e-03

Pos logit contributions
s+0.020
t+0.016
 of+0.016
 they+0.014
 while+0.014
Neg logit contributions
ags-0.027
 ey-0.024
34-0.022
 yourself-0.022
 yourselves-0.022
Token shelf
Token ID18316
Feature activation-5.32e-03

Pos logit contributions
ags+0.008
 ey+0.006
34+0.006
 yourself+0.005
 yourselves+0.005
Neg logit contributions
s-0.011
t-0.009
 of-0.009
 they-0.008
 while-0.008
Token.
Token ID13
Feature activation+3.62e-03

Pos logit contributions
s+0.009
t+0.007
ashed+0.006
 they+0.006
 of+0.006
Neg logit contributions
ags-0.011
 ey-0.009
34-0.009
 yourself-0.009
 yourselves-0.009
Token they
Token ID484
Feature activation-3.76e-03

Pos logit contributions
ags+0.012
 yourselves+0.010
 ey+0.010
 yourself+0.010
34+0.010
Neg logit contributions
s-0.004
t-0.003
 of-0.002
 batch-0.002
ounce-0.002
Token see
Token ID766
Feature activation+3.73e-03

Pos logit contributions
s+0.007
t+0.005
 of+0.005
 they+0.005
 while+0.005
Neg logit contributions
ags-0.008
 ey-0.007
 yourself-0.006
34-0.006
 yourselves-0.006
Token a
Token ID257
Feature activation+4.01e-03

Pos logit contributions
ags+0.009
 ey+0.007
 yourself+0.007
34+0.007
 yourselves+0.007
Neg logit contributions
s-0.006
t-0.005
 of-0.004
 they-0.004
 while-0.004
Token big
Token ID1263
Feature activation-3.91e-04

Pos logit contributions
ags+0.008
 ey+0.007
34+0.007
 yourself+0.006
 yourselves+0.006
Neg logit contributions
s-0.007
t-0.006
 of-0.005
 together-0.005
 they-0.005
Token ash
Token ID12530
Feature activation-6.96e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 they+0.001
 together+0.001
Neg logit contributions
ags-0.001
 ey-0.001
34-0.001
 yourself-0.001
 yourselves-0.001
Tokentr
Token ID2213
Feature activation-5.27e-02

Pos logit contributions
s+0.013
t+0.010
 of+0.010
 they+0.009
 while+0.009
Neg logit contributions
ags-0.014
 ey-0.012
34-0.011
 yourself-0.011
 yourselves-0.011
Tokenay
Token ID323
Feature activation-1.55e-02

Pos logit contributions
s+0.083
t+0.064
 of+0.062
 they+0.058
 while+0.057
Neg logit contributions
ags-0.118
 ey-0.102
34-0.097
 yourself-0.096
 yourselves-0.094
Token near
Token ID1474
Feature activation+1.79e-02

Pos logit contributions
s+0.025
t+0.020
ashed+0.017
 of+0.017
 while+0.016
Neg logit contributions
ags-0.033
34-0.027
 ey-0.027
 yourself-0.026
 yourselves-0.026
Token a
Token ID257
Feature activation+7.24e-03

Pos logit contributions
ags+0.040
 ey+0.035
34+0.033
 yourself+0.033
 yourselves+0.032
Neg logit contributions
s-0.029
t-0.022
 of-0.022
 they-0.020
 while-0.019
Token bench
Token ID7624
Feature activation-5.43e-03

Pos logit contributions
ags+0.015
 ey+0.013
34+0.012
 yourself+0.012
 yourselves+0.012
Neg logit contributions
s-0.013
 of-0.010
t-0.010
 they-0.010
 while-0.009
Token.
Token ID13
Feature activation+1.26e-03

Pos logit contributions
s+0.009
t+0.007
 of+0.006
 they+0.006
 while+0.006
Neg logit contributions
ags-0.011
 ey-0.010
 yourself-0.009
 yourselves-0.009
34-0.009
Token.
Token ID13
Feature activation-2.67e-03

Pos logit contributions
ags+0.041
 ey+0.036
34+0.035
 yourself+0.035
ode+0.033
Neg logit contributions
s-0.034
t-0.026
ashed-0.024
 they-0.024
 together-0.023
Token It
Token ID632
Feature activation-9.88e-03

Pos logit contributions
s+0.005
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.006
34-0.005
 ey-0.005
 yourself-0.005
 yourselves-0.005
Token was
Token ID373
Feature activation-3.58e-03

Pos logit contributions
s+0.011
t+0.008
 of+0.008
 while+0.007
ashed+0.007
Neg logit contributions
ags-0.023
 ey-0.022
 yourself-0.021
-0.021
34-0.020
Token an
Token ID281
Feature activation+5.92e-03

Pos logit contributions
s+0.006
t+0.005
 of+0.004
 they+0.004
 together+0.004
Neg logit contributions
ags-0.008
 ey-0.007
 yourself-0.006
34-0.006
 yourselves-0.006
Token ash
Token ID12530
Feature activation-3.28e-03

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.009
34+0.009
 yourselves+0.009
Neg logit contributions
s-0.010
 of-0.010
t-0.008
 while-0.008
 they-0.007
Tokentr
Token ID2213
Feature activation-5.24e-02

Pos logit contributions
s+0.007
t+0.005
 of+0.005
 they+0.005
 together+0.005
Neg logit contributions
ags-0.006
 ey-0.005
34-0.005
 yourself-0.004
 yourselves-0.004
Tokenay
Token ID323
Feature activation-1.11e-02

Pos logit contributions
s+0.083
t+0.063
 of+0.061
 they+0.058
 while+0.056
Neg logit contributions
ags-0.118
 ey-0.102
34-0.097
 yourself-0.096
 yourselves-0.094
Token.
Token ID13
Feature activation+7.07e-04

Pos logit contributions
s+0.017
t+0.013
ashed+0.012
 of+0.012
 they+0.012
Neg logit contributions
ags-0.023
34-0.020
 yourself-0.020
 ey-0.019
 yourselves-0.019
Token He
Token ID679
Feature activation-9.97e-04

Pos logit contributions
ags+0.002
 ey+0.001
34+0.001
 yourself+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token wanted
Token ID2227
Feature activation+5.02e-03

Pos logit contributions
s+0.002
t+0.001
 of+0.001
 they+0.001
 together+0.001
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.002
34-0.002
 yourselves-0.002
Token to
Token ID284
Feature activation+5.81e-03

Pos logit contributions
ags+0.013
 ey+0.011
 yourself+0.011
 yourselves+0.011
34+0.011
Neg logit contributions
s-0.006
t-0.004
ashed-0.004
 of-0.003
z-0.003
Token they
Token ID484
Feature activation-1.43e-03

Pos logit contributions
s+0.014
t+0.011
 of+0.010
ounce+0.008
ashed+0.007
Neg logit contributions
ags-0.026
 ey-0.023
 yourself-0.022
34-0.022
 yourselves-0.022
Token also
Token ID635
Feature activation+3.43e-03

Pos logit contributions
s+0.003
t+0.002
 of+0.002
 while+0.002
 together+0.002
Neg logit contributions
ags-0.003
 ey-0.002
 yourself-0.002
 yourselves-0.002
34-0.002
Token see
Token ID766
Feature activation+1.04e-03

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.006
 yourselves+0.006
+0.006
Neg logit contributions
s-0.006
t-0.005
 of-0.005
 together-0.004
 they-0.004
Token an
Token ID281
Feature activation+5.22e-03

Pos logit contributions
ags+0.002
34+0.002
 yourself+0.002
 yourselves+0.002
 ey+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.001
ashed-0.001
ounce-0.001
Token ash
Token ID12530
Feature activation-6.15e-03

Pos logit contributions
ags+0.010
 ey+0.008
 yourself+0.008
34+0.008
 yourselves+0.008
Neg logit contributions
s-0.009
 of-0.009
t-0.007
 while-0.007
 they-0.007
Tokentr
Token ID2213
Feature activation-5.16e-02

Pos logit contributions
s+0.012
t+0.009
 of+0.009
 they+0.009
 while+0.008
Neg logit contributions
ags-0.012
 ey-0.010
34-0.010
 yourself-0.010
 yourselves-0.009
Tokenay
Token ID323
Feature activation-1.53e-02

Pos logit contributions
s+0.081
t+0.062
 of+0.060
 they+0.056
 while+0.054
Neg logit contributions
ags-0.117
 ey-0.101
34-0.096
 yourself-0.095
 yourselves-0.093
Token on
Token ID319
Feature activation-1.63e-02

Pos logit contributions
s+0.025
t+0.019
ashed+0.018
ounce+0.017
 mun+0.017
Neg logit contributions
ags-0.032
34-0.027
 yourself-0.026
 yourselves-0.025
 ey-0.024
Token the
Token ID262
Feature activation+2.19e-03

Pos logit contributions
s+0.027
t+0.021
 of+0.020
 they+0.019
 while+0.018
Neg logit contributions
ags-0.036
 ey-0.031
34-0.029
 yourself-0.029
 yourselves-0.028
Token table
Token ID3084
Feature activation-1.07e-02

Pos logit contributions
ags+0.004
 ey+0.003
34+0.003
 yourself+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
 of-0.003
t-0.003
 while-0.003
 they-0.003
Token.
Token ID13
Feature activation+2.41e-03

Pos logit contributions
s+0.018
t+0.015
ounce+0.013
 of+0.013
ashed+0.012
Neg logit contributions
ags-0.022
 yourself-0.019
 ey-0.018
 yourselves-0.018
34-0.018
Token backyard
Token ID24296
Feature activation+1.65e-03

Pos logit contributions
ags+0.005
 ey+0.004
 yourself+0.004
34+0.004
 yourselves+0.004
Neg logit contributions
s-0.006
t-0.005
 of-0.005
 they-0.005
 together-0.005
Token there
Token ID612
Feature activation-1.06e-02

Pos logit contributions
ags+0.003
 ey+0.003
 yourself+0.003
 yourselves+0.003
34+0.003
Neg logit contributions
s-0.002
t-0.002
ashed-0.002
ounce-0.002
ls-0.002
Token was
Token ID373
Feature activation-1.98e-04

Pos logit contributions
s+0.016
t+0.012
 of+0.011
 they+0.010
 while+0.010
Neg logit contributions
ags-0.024
 ey-0.021
 yourself-0.020
34-0.020
 yourselves-0.020
Token an
Token ID281
Feature activation+6.68e-03

Pos logit contributions
s+0.000
t+0.000
ounce+0.000
ls+0.000
 of+0.000
Neg logit contributions
ags-0.000
 ey-0.000
 yourself-0.000
 yourselves-0.000
34-0.000
Token ash
Token ID12530
Feature activation-5.90e-03

Pos logit contributions
ags+0.014
 ey+0.011
 yourself+0.011
34+0.011
 Deal+0.011
Neg logit contributions
s-0.011
 of-0.010
 while-0.009
 they-0.008
 together-0.008
Tokentr
Token ID2213
Feature activation-5.14e-02

Pos logit contributions
s+0.012
t+0.010
 of+0.009
 they+0.009
 together+0.009
Neg logit contributions
ags-0.010
 ey-0.009
34-0.008
 yourself-0.008
 yourselves-0.008
Tokenay
Token ID323
Feature activation-1.47e-02

Pos logit contributions
s+0.085
t+0.065
 they+0.063
 while+0.063
 of+0.062
Neg logit contributions
ags-0.110
 ey-0.093
34-0.088
 yourself-0.086
 yourselves-0.085
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
s+0.025
t+0.019
ashed+0.017
ounce+0.017
by+0.016
Neg logit contributions
ags-0.029
34-0.025
 yourself-0.024
 yourselves-0.023
 ey-0.023
Token It
Token ID632
Feature activation-8.95e-03

Pos logit contributions
ags+0.003
 ey+0.003
34+0.003
 yourself+0.003
 yourselves+0.003
Neg logit contributions
s-0.003
 of-0.002
t-0.002
 they-0.002
 while-0.002
Token was
Token ID373
Feature activation-1.24e-03

Pos logit contributions
s+0.011
t+0.008
 of+0.007
ounce+0.007
 while+0.006
Neg logit contributions
ags-0.021
 ey-0.020
 yourself-0.019
-0.018
34-0.018
Token the
Token ID262
Feature activation-4.53e-03

Pos logit contributions
s+0.002
t+0.002
 of+0.002
 they+0.002
 together+0.002
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.002
34-0.002
 yourselves-0.002
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
s+0.019
t+0.014
 of+0.013
 they+0.012
 while+0.012
Neg logit contributions
ags-0.035
 ey-0.030
 yourself-0.029
34-0.028
 yourselves-0.028
Token fell
Token ID3214
Feature activation-2.13e-02

Pos logit contributions
s+0.017
t+0.012
 of+0.012
 together+0.010
 batch+0.010
Neg logit contributions
ags-0.032
 yourself-0.029
 ey-0.029
 yourselves-0.028
-0.027
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
s+0.032
t+0.024
 of+0.023
 they+0.021
 while+0.020
Neg logit contributions
ags-0.050
 ey-0.043
 yourself-0.041
34-0.041
 yourselves-0.040
Token The
Token ID383
Feature activation-2.61e-03

Pos logit contributions
ags+0.021
 ey+0.018
34+0.017
 yourself+0.017
 yourselves+0.016
Neg logit contributions
s-0.019
t-0.015
 of-0.014
 they-0.014
 while-0.013
Token ash
Token ID12530
Feature activation-6.75e-03

Pos logit contributions
s+0.006
t+0.005
 together+0.005
 they+0.004
ounce+0.004
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.003
 yourselves-0.003
 upper-0.003
Tokentr
Token ID2213
Feature activation-5.13e-02

Pos logit contributions
s+0.010
t+0.008
 of+0.008
 they+0.007
 together+0.007
Neg logit contributions
ags-0.015
 ey-0.013
 yourself-0.012
 yourselves-0.012
34-0.012
Tokenay
Token ID323
Feature activation-9.03e-03

Pos logit contributions
s+0.082
t+0.063
 of+0.061
 they+0.058
 while+0.058
Neg logit contributions
ags-0.113
 ey-0.098
34-0.092
 yourself-0.091
 yourselves-0.090
Token flew
Token ID13112
Feature activation-9.70e-03

Pos logit contributions
s+0.012
t+0.009
 of+0.009
ounce+0.008
 while+0.008
Neg logit contributions
ags-0.020
 yourself-0.017
 ey-0.017
 yourselves-0.017
34-0.017
Token out
Token ID503
Feature activation-1.10e-02

Pos logit contributions
s+0.016
t+0.012
 they+0.011
 of+0.011
 batch+0.010
Neg logit contributions
ags-0.021
 ey-0.017
 yourself-0.017
 yourselves-0.016
34-0.016
Token of
Token ID286
Feature activation-6.29e-03

Pos logit contributions
s+0.012
t+0.009
ashed+0.007
ounce+0.007
 batch+0.007
Neg logit contributions
ags-0.028
 yourself-0.024
 ey-0.024
34-0.023
 yourselves-0.023
Token her
Token ID607
Feature activation+1.19e-02

Pos logit contributions
s+0.010
t+0.008
 of+0.008
 they+0.007
 while+0.007
Neg logit contributions
ags-0.014
 ey-0.012
34-0.011
 yourself-0.011
 yourselves-0.011
Token room
Token ID2119
Feature activation-5.08e-03

Pos logit contributions
ags+0.010
 ey+0.008
 yourself+0.008
 yourselves+0.007
34+0.007
Neg logit contributions
s-0.015
t-0.013
 of-0.012
 they-0.012
 together-0.012
Token and
Token ID290
Feature activation+5.84e-03

Pos logit contributions
s+0.008
t+0.006
ounce+0.006
ls+0.006
rolled+0.006
Neg logit contributions
ags-0.009
 ey-0.008
34-0.008
 yourself-0.008
iece-0.007
Token saw
Token ID2497
Feature activation+4.04e-03

Pos logit contributions
ags+0.013
 ey+0.012
 yourself+0.011
 yourselves+0.011
34+0.011
Neg logit contributions
s-0.009
t-0.007
 of-0.007
ls-0.005
 they-0.005
Token an
Token ID281
Feature activation+7.07e-03

Pos logit contributions
ags+0.007
 yourself+0.006
 ey+0.006
 yourselves+0.006
34+0.005
Neg logit contributions
s-0.008
t-0.007
ounce-0.006
ashed-0.006
z-0.006
Token ash
Token ID12530
Feature activation-6.35e-03

Pos logit contributions
ags+0.013
 ey+0.011
34+0.010
 yourself+0.010
+0.010
Neg logit contributions
s-0.013
 of-0.012
t-0.011
 while-0.010
 they-0.010
Tokentr
Token ID2213
Feature activation-5.09e-02

Pos logit contributions
s+0.013
t+0.010
 of+0.010
 they+0.009
 while+0.009
Neg logit contributions
ags-0.011
 ey-0.010
34-0.009
 yourself-0.009
 yourselves-0.009
Tokenay
Token ID323
Feature activation-1.41e-02

Pos logit contributions
s+0.080
t+0.061
 of+0.060
 they+0.056
 while+0.054
Neg logit contributions
ags-0.114
 ey-0.099
34-0.094
 yourself-0.093
 yourselves-0.091
Token.
Token ID13
Feature activation+5.45e-03

Pos logit contributions
s+0.024
t+0.018
ashed+0.018
 of+0.016
 they+0.016
Neg logit contributions
ags-0.027
34-0.024
 yourself-0.024
 ey-0.023
 yourselves-0.022
Token She
Token ID1375
Feature activation+4.40e-04

Pos logit contributions
ags+0.012
 ey+0.010
34+0.009
 yourself+0.009
 yourselves+0.009
Neg logit contributions
s-0.010
t-0.008
 of-0.007
 they-0.007
 while-0.007
Token remembered
Token ID12086
Feature activation-3.14e-04

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
ode+0.001
34+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 together-0.000
 while-0.000
Token her
Token ID607
Feature activation+1.42e-03

Pos logit contributions
s+0.001
t+0.001
ashed+0.001
z+0.001
 of+0.000
Neg logit contributions
 yourself-0.000
ags-0.000
 ey-0.000
 yourselves-0.000
34-0.000
Token to
Token ID284
Feature activation+1.39e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
 yourselves+0.002
 upper+0.002
Neg logit contributions
s-0.001
t-0.001
 of-0.001
ls-0.001
ashed-0.001
Token blow
Token ID6611
Feature activation-3.75e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
 yourselves+0.002
ode+0.002
Neg logit contributions
s-0.003
t-0.003
 of-0.002
ls-0.002
ashed-0.002
Token.
Token ID13
Feature activation+4.22e-03

Pos logit contributions
s+0.006
t+0.006
ashed+0.004
ounce+0.004
 they+0.004
Neg logit contributions
ags-0.008
 yourself-0.007
 ey-0.007
 yourselves-0.006
34-0.006
Token The
Token ID383
Feature activation-1.68e-03

Pos logit contributions
ags+0.008
 ey+0.007
 yourself+0.007
34+0.007
 yourselves+0.007
Neg logit contributions
s-0.007
t-0.006
 of-0.006
 they-0.005
 while-0.005
Token ash
Token ID12530
Feature activation-6.93e-03

Pos logit contributions
s+0.004
t+0.004
 of+0.004
 they+0.003
 while+0.003
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.001
34-0.001
 yourselves-0.001
Tokentr
Token ID2213
Feature activation-5.09e-02

Pos logit contributions
s+0.015
ounce+0.012
t+0.012
 of+0.012
 they+0.012
Neg logit contributions
ags-0.010
 ey-0.008
 upper-0.007
34-0.007
ode-0.007
Tokenay
Token ID323
Feature activation-7.86e-03

Pos logit contributions
s+0.084
t+0.064
 of+0.061
 they+0.061
 while+0.060
Neg logit contributions
ags-0.110
 ey-0.094
34-0.088
 yourself-0.087
 yourselves-0.086
Token got
Token ID1392
Feature activation-4.88e-03

Pos logit contributions
s+0.010
 batch+0.008
ounce+0.008
t+0.008
 while+0.008
Neg logit contributions
ags-0.014
 yourself-0.013
et-0.012
 ey-0.012
aces-0.012
Token really
Token ID1107
Feature activation+2.51e-03

Pos logit contributions
s+0.010
t+0.008
ounce+0.007
 batch+0.007
 they+0.007
Neg logit contributions
ags-0.008
 ey-0.006
 yourself-0.006
34-0.006
 yourselves-0.006
Token hot
Token ID3024
Feature activation+2.40e-03

Pos logit contributions
ags+0.004
 yourself+0.003
obb+0.003
 ey+0.003
 yourselves+0.003
Neg logit contributions
s-0.005
t-0.004
 of-0.004
ounce-0.004
uddy-0.004
Token!
Token ID0
Feature activation+4.18e-03

Pos logit contributions
ags+0.006
 ey+0.005
 yourself+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.003
t-0.002
 of-0.002
 they-0.002
 while-0.002
Token very
Token ID845
Feature activation-1.06e-02

Pos logit contributions
ags+0.007
 yourself+0.006
34+0.005
 yourselves+0.005
 upper+0.005
Neg logit contributions
s-0.008
t-0.007
ls-0.006
ounce-0.005
 of-0.005
Token tired
Token ID10032
Feature activation+2.10e-03

Pos logit contributions
s+0.022
t+0.018
 of+0.016
ounce+0.015
z+0.015
Neg logit contributions
ags-0.018
 yourself-0.015
 yourselves-0.014
 ey-0.014
34-0.014
Token.
Token ID13
Feature activation-1.90e-03

Pos logit contributions
ags+0.004
34+0.003
 yourself+0.003
 ey+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.003
ashed-0.003
z-0.003
rolled-0.002
Token The
Token ID383
Feature activation-2.21e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.004
 ey-0.003
34-0.003
 yourself-0.003
 yourselves-0.003
Token ash
Token ID12530
Feature activation-4.28e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.005
 ey-0.004
34-0.004
 yourself-0.004
 yourselves-0.004
Tokentr
Token ID2213
Feature activation-5.08e-02

Pos logit contributions
s+0.006
ounce+0.005
 of+0.005
 together+0.005
 batch+0.004
Neg logit contributions
ags-0.008
ode-0.007
 upper-0.007
 ey-0.007
-0.007
Tokenay
Token ID323
Feature activation-6.92e-03

Pos logit contributions
s+0.086
 they+0.065
t+0.065
 while+0.064
 of+0.062
Neg logit contributions
ags-0.108
 ey-0.092
34-0.084
 yourself-0.084
 yourselves-0.082
Token watched
Token ID7342
Feature activation+8.91e-03

Pos logit contributions
s+0.010
t+0.008
 batch+0.007
ounce+0.007
 of+0.007
Neg logit contributions
ags-0.013
 yourself-0.011
34-0.011
 ey-0.011
ode-0.011
Token as
Token ID355
Feature activation+1.07e-02

Pos logit contributions
ags+0.018
 yourself+0.015
34+0.014
 yourselves+0.014
 ey+0.014
Neg logit contributions
s-0.016
t-0.013
ashed-0.011
ounce-0.011
ls-0.011
Token the
Token ID262
Feature activation-9.32e-04

Pos logit contributions
ags+0.020
 yourself+0.018
 ey+0.018
34+0.018
 yourselves+0.018
Neg logit contributions
s-0.019
t-0.015
ashed-0.013
 of-0.013
ounce-0.013
Token little
Token ID1310
Feature activation+3.49e-03

Pos logit contributions
s+0.002
t+0.001
 of+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.002
 ey-0.002
34-0.002
 yourself-0.002
 yourselves-0.001
Token on
Token ID319
Feature activation+4.36e-03

Pos logit contributions
ags+0.021
 ey+0.018
 yourself+0.017
34+0.017
 yourselves+0.017
Neg logit contributions
s-0.017
t-0.014
 of-0.013
 they-0.012
 while-0.012
Token blowing
Token ID19280
Feature activation-5.05e-03

Pos logit contributions
ags+0.008
 ey+0.007
 yourself+0.007
 yourselves+0.007
34+0.007
Neg logit contributions
s-0.008
t-0.007
 of-0.006
 they-0.006
 while-0.006
Token until
Token ID1566
Feature activation-2.63e-03

Pos logit contributions
s+0.008
t+0.007
ounce+0.006
ashed+0.006
 of+0.006
Neg logit contributions
ags-0.011
 ey-0.009
 yourself-0.009
 yourselves-0.008
34-0.008
Token the
Token ID262
Feature activation-5.01e-04

Pos logit contributions
s+0.004
t+0.003
 of+0.003
ounce+0.003
ashed+0.003
Neg logit contributions
ags-0.005
 ey-0.005
 yourself-0.005
34-0.004
 yourselves-0.004
Token ash
Token ID12530
Feature activation-4.30e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.001
 ey-0.001
 yourself-0.000
34-0.000
 yourselves-0.000
Tokentr
Token ID2213
Feature activation-5.07e-02

Pos logit contributions
s+0.009
 of+0.007
t+0.007
 they+0.007
 together+0.007
Neg logit contributions
ags-0.007
 ey-0.006
34-0.005
 yourself-0.005
-0.005
Tokenay
Token ID323
Feature activation-9.24e-03

Pos logit contributions
s+0.080
t+0.061
 of+0.059
 they+0.055
 while+0.054
Neg logit contributions
ags-0.114
 ey-0.098
34-0.093
 yourself-0.093
 yourselves-0.091
Token was
Token ID373
Feature activation+1.19e-03

Pos logit contributions
s+0.013
t+0.009
 of+0.009
ls+0.008
 batch+0.008
Neg logit contributions
ags-0.020
 yourself-0.016
34-0.016
-0.016
 ey-0.016
Token so
Token ID523
Feature activation+1.87e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
34+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
 they-0.002
 while-0.001
Token dry
Token ID5894
Feature activation-6.29e-03

Pos logit contributions
ags+0.004
 ey+0.003
 yourself+0.003
 yourselves+0.003
34+0.003
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token that
Token ID326
Feature activation-9.07e-03

Pos logit contributions
s+0.010
t+0.008
 of+0.007
ounce+0.006
ashed+0.006
Neg logit contributions
ags-0.013
 yourself-0.011
 ey-0.011
34-0.011
 yourselves-0.011
Token he
Token ID339
Feature activation-8.11e-03

Pos logit contributions
ags+0.001
 yourself+0.001
 yourselves+0.001
 ey+0.001
34+0.001
Neg logit contributions
s-0.001
t-0.001
ashed-0.001
ounce-0.001
z-0.001
Token sp
Token ID599
Feature activation-2.42e-03

Pos logit contributions
s+0.013
t+0.011
 of+0.011
 together+0.010
othe+0.010
Neg logit contributions
ags-0.013
 ey-0.012
 yourself-0.012
 upper-0.012
34-0.012
Tokenied
Token ID798
Feature activation-6.33e-03

Pos logit contributions
s+0.004
t+0.003
 they+0.003
 while+0.003
 of+0.003
Neg logit contributions
ags-0.006
 ey-0.005
34-0.004
 yourself-0.004
 yourselves-0.004
Token an
Token ID281
Feature activation+7.22e-03

Pos logit contributions
s+0.012
t+0.010
 of+0.009
 they+0.008
ashed+0.008
Neg logit contributions
ags-0.012
 yourself-0.010
 ey-0.010
34-0.010
 yourselves-0.010
Token ash
Token ID12530
Feature activation-5.26e-03

Pos logit contributions
ags+0.014
34+0.011
 ey+0.011
 yourself+0.011
 yourselves+0.011
Neg logit contributions
 of-0.012
s-0.011
t-0.009
 even-0.009
 while-0.009
Tokentr
Token ID2213
Feature activation-5.06e-02

Pos logit contributions
s+0.010
t+0.008
 of+0.008
 they+0.007
 while+0.007
Neg logit contributions
ags-0.010
 ey-0.009
34-0.008
 yourself-0.008
 yourselves-0.008
Tokenay
Token ID323
Feature activation-1.38e-02

Pos logit contributions
s+0.078
t+0.060
 of+0.058
 they+0.053
 while+0.051
Neg logit contributions
ags-0.115
 ey-0.101
34-0.095
 yourself-0.095
 yourselves-0.093
Token.
Token ID13
Feature activation-7.89e-04

Pos logit contributions
s+0.022
t+0.016
 of+0.016
 they+0.015
ls+0.015
Neg logit contributions
ags-0.028
34-0.024
 yourself-0.023
-0.023
 ey-0.023
Token He
Token ID679
Feature activation-9.77e-04

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.002
 ey-0.001
34-0.001
 yourself-0.001
 yourselves-0.001
Token picked
Token ID6497
Feature activation+5.18e-03

Pos logit contributions
s+0.002
t+0.002
less+0.001
 of+0.001
seeing+0.001
Neg logit contributions
 ey-0.001
ags-0.001
 upper-0.001
ode-0.001
 yourself-0.001
Token up
Token ID510
Feature activation-4.92e-03

Pos logit contributions
ags+0.012
 yourself+0.011
 ey+0.010
 yourselves+0.010
34+0.010
Neg logit contributions
s-0.007
t-0.006
 of-0.005
ounce-0.005
 they-0.005
Token,
Token ID11
Feature activation+6.20e-03

Pos logit contributions
ags+0.030
 ey+0.026
34+0.025
 yourself+0.025
 yourselves+0.025
Neg logit contributions
s-0.013
t-0.009
 of-0.008
 they-0.007
 together-0.007
Token he
Token ID339
Feature activation-4.59e-04

Pos logit contributions
ags+0.014
 ey+0.012
34+0.012
 yourself+0.012
 yourselves+0.011
Neg logit contributions
s-0.010
t-0.008
 of-0.007
 they-0.007
 while-0.007
Token saw
Token ID2497
Feature activation+4.53e-03

Pos logit contributions
s+0.001
t+0.001
othe+0.001
less+0.001
apy+0.001
Neg logit contributions
P-0.001
 yourself-0.001
ags-0.001
 ey-0.001
-0.001
Token an
Token ID281
Feature activation+7.86e-03

Pos logit contributions
ags+0.009
 ey+0.008
 yourself+0.008
 yourselves+0.008
34+0.008
Neg logit contributions
s-0.008
t-0.006
 of-0.006
 while-0.005
 they-0.005
Token ash
Token ID12530
Feature activation-4.55e-03

Pos logit contributions
ags+0.016
 ey+0.013
 yourself+0.013
 yourselves+0.013
34+0.012
Neg logit contributions
s-0.012
 of-0.012
 while-0.010
t-0.010
 they-0.009
Tokentr
Token ID2213
Feature activation-5.04e-02

Pos logit contributions
s+0.008
t+0.007
 of+0.007
 they+0.006
 while+0.006
Neg logit contributions
ags-0.009
 ey-0.008
 yourself-0.007
34-0.007
 yourselves-0.007
Tokenay
Token ID323
Feature activation-1.53e-02

Pos logit contributions
s+0.079
t+0.060
 of+0.059
 they+0.055
 while+0.053
Neg logit contributions
ags-0.114
 ey-0.099
34-0.093
 yourself-0.093
 yourselves-0.091
Token on
Token ID319
Feature activation-1.26e-02

Pos logit contributions
s+0.023
t+0.017
 of+0.016
 they+0.015
 while+0.014
Neg logit contributions
ags-0.034
 ey-0.029
34-0.029
 yourself-0.029
 yourselves-0.028
Token the
Token ID262
Feature activation+4.47e-03

Pos logit contributions
s+0.016
 of+0.013
 they+0.012
 while+0.011
t+0.011
Neg logit contributions
ags-0.032
 ey-0.028
34-0.027
ode-0.026
-0.026
Token table
Token ID3084
Feature activation-7.62e-03

Pos logit contributions
ags+0.008
 ey+0.007
34+0.007
 Deal+0.007
 yourself+0.006
Neg logit contributions
s-0.008
 of-0.007
t-0.007
 while-0.007
 they-0.007
Token.
Token ID13
Feature activation+5.67e-03

Pos logit contributions
s+0.013
t+0.010
 they+0.009
ls+0.009
ounce+0.009
Neg logit contributions
ags-0.016
 ey-0.013
 yourself-0.013
 yourselves-0.013
34-0.013
Token.
Token ID13
Feature activation-3.57e-03

Pos logit contributions
s+0.005
 of+0.004
t+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.012
 ey-0.011
 yourself-0.010
 yourselves-0.010
34-0.010
Token She
Token ID1375
Feature activation+4.13e-03

Pos logit contributions
s+0.005
 of+0.004
 they+0.004
t+0.004
 while+0.003
Neg logit contributions
ags-0.009
 ey-0.007
 yourself-0.007
ode-0.007
 yourselves-0.007
Token put
Token ID1234
Feature activation-1.40e-02

Pos logit contributions
ags+0.010
 ey+0.009
34+0.008
 yourself+0.008
+0.008
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 while-0.003
 they-0.003
Token the
Token ID262
Feature activation-2.17e-03

Pos logit contributions
s+0.024
t+0.019
 of+0.017
 while+0.016
 they+0.015
Neg logit contributions
ags-0.032
 ey-0.025
 yourself-0.025
 yourselves-0.024
34-0.023
Token ash
Token ID12530
Feature activation-7.08e-03

Pos logit contributions
s+0.005
t+0.004
 of+0.004
 they+0.004
 together+0.003
Neg logit contributions
ags-0.004
 ey-0.003
 yourself-0.003
 yourselves-0.003
34-0.003
Tokentr
Token ID2213
Feature activation-5.03e-02

Pos logit contributions
s+0.012
t+0.010
 of+0.009
 they+0.009
 while+0.008
Neg logit contributions
ags-0.015
 ey-0.013
 yourself-0.012
34-0.012
 yourselves-0.012
Tokenay
Token ID323
Feature activation-7.68e-03

Pos logit contributions
s+0.080
t+0.061
 of+0.059
 they+0.057
 while+0.056
Neg logit contributions
ags-0.113
 ey-0.097
34-0.091
 yourself-0.090
 yourselves-0.088
Token away
Token ID1497
Feature activation+1.07e-03

Pos logit contributions
s+0.013
t+0.010
 batch+0.010
ounce+0.009
 of+0.009
Neg logit contributions
ags-0.016
 yourself-0.013
 yourselves-0.012
34-0.012
 ey-0.011
Token.
Token ID13
Feature activation+2.96e-03

Pos logit contributions
ags+0.002
 yourself+0.002
 ey+0.002
34+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.001
 of-0.001
ounce-0.001
ashed-0.001
Token She
Token ID1375
Feature activation+3.76e-03

Pos logit contributions
ags+0.007
 ey+0.006
34+0.006
ode+0.006
 yourself+0.006
Neg logit contributions
s-0.005
 of-0.004
 they-0.003
t-0.003
 while-0.003
Token said
Token ID531
Feature activation+1.85e-04

Pos logit contributions
ags+0.007
 ey+0.006
34+0.006
 yourself+0.006
+0.006
Neg logit contributions
s-0.006
t-0.006
 of-0.005
 while-0.004
ounce-0.004
Token no
Token ID645
Feature activation+2.04e-03

Pos logit contributions
s+0.011
t+0.009
ounce+0.008
 of+0.008
ashed+0.008
Neg logit contributions
ags-0.009
 yourself-0.008
34-0.008
 ey-0.008
 yourselves-0.008
Token,
Token ID11
Feature activation-8.90e-03

Pos logit contributions
ags+0.004
 ey+0.004
 yourself+0.004
34+0.004
 yourselves+0.004
Neg logit contributions
s-0.003
t-0.002
 of-0.002
ashed-0.002
 they-0.002
Token because
Token ID780
Feature activation-1.88e-02

Pos logit contributions
s+0.015
t+0.011
 of+0.010
ashed+0.010
 together+0.010
Neg logit contributions
ags-0.018
 ey-0.016
 yourself-0.016
34-0.016
 yourselves-0.015
Token the
Token ID262
Feature activation-6.67e-03

Pos logit contributions
s+0.034
ashed+0.027
t+0.027
ounce+0.026
oped+0.025
Neg logit contributions
 yourself-0.034
 yourselves-0.033
ags-0.031
 ey-0.030
34-0.029
Token ash
Token ID12530
Feature activation-5.99e-03

Pos logit contributions
s+0.018
t+0.016
 of+0.015
 they+0.015
 together+0.014
Neg logit contributions
ags-0.007
 ey-0.006
 yourself-0.005
 yourselves-0.005
34-0.005
Tokentr
Token ID2213
Feature activation-5.00e-02

Pos logit contributions
s+0.011
 of+0.009
t+0.009
ounce+0.009
 while+0.009
Neg logit contributions
ags-0.010
 ey-0.008
34-0.008
 yourself-0.008
ode-0.007
Tokenay
Token ID323
Feature activation-1.08e-02

Pos logit contributions
s+0.082
t+0.063
 of+0.062
 while+0.062
 they+0.061
Neg logit contributions
ags-0.105
 ey-0.089
34-0.085
 yourself-0.083
 yourselves-0.082
Token was
Token ID373
Feature activation-1.23e-04

Pos logit contributions
s+0.014
ounce+0.012
ashed+0.012
t+0.011
 while+0.010
Neg logit contributions
ags-0.020
 yourself-0.019
 yourselves-0.018
 ey-0.017
34-0.017
Token not
Token ID407
Feature activation-1.51e-03

Pos logit contributions
s+0.000
t+0.000
 of+0.000
ounce+0.000
ashed+0.000
Neg logit contributions
ags-0.000
 yourself-0.000
 ey-0.000
34-0.000
 yourselves-0.000
Token a
Token ID257
Feature activation-1.32e-02

Pos logit contributions
s+0.002
t+0.002
 of+0.002
 they+0.002
 while+0.002
Neg logit contributions
ags-0.003
 ey-0.003
34-0.003
 yourself-0.003
 yourselves-0.003
Token toy
Token ID13373
Feature activation-1.39e-02

Pos logit contributions
s+0.026
t+0.021
 of+0.020
 they+0.019
ashed+0.018
Neg logit contributions
ags-0.022
 ey-0.020
 yourself-0.019
34-0.019
 yourselves-0.018
Token Sara
Token ID24799
Feature activation+6.45e-03

Pos logit contributions
ags+0.005
 ey+0.004
34+0.004
 yourself+0.004
 yourselves+0.004
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token found
Token ID1043
Feature activation-6.09e-04

Pos logit contributions
ags+0.011
 ey+0.010
yah+0.009
per+0.009
34+0.009
Neg logit contributions
s-0.010
t-0.010
 of-0.008
othe-0.008
apy-0.008
Token an
Token ID281
Feature activation+1.06e-02

Pos logit contributions
s+0.001
t+0.001
ounce+0.001
 while+0.001
z+0.001
Neg logit contributions
ags-0.001
 ey-0.001
 yourself-0.001
 yourselves-0.001
34-0.001
Token empty
Token ID6565
Feature activation+1.10e-02

Pos logit contributions
ags+0.018
 ey+0.015
34+0.014
 yourself+0.014
 yourselves+0.013
Neg logit contributions
s-0.023
t-0.019
 of-0.018
 they-0.018
 while-0.017
Token ash
Token ID12530
Feature activation-4.57e-03

Pos logit contributions
ags+0.014
 ey+0.014
 yourself+0.011
 yourselves+0.011
34+0.009
Neg logit contributions
s-0.027
t-0.023
 they-0.020
 of-0.020
ashed-0.020
Tokentr
Token ID2213
Feature activation-5.00e-02

Pos logit contributions
s+0.009
t+0.007
 of+0.007
 they+0.007
 while+0.006
Neg logit contributions
ags-0.008
 ey-0.007
34-0.007
 yourself-0.007
 yourselves-0.006
Tokenay
Token ID323
Feature activation-1.08e-02

Pos logit contributions
s+0.082
t+0.063
 of+0.061
 they+0.060
 while+0.059
Neg logit contributions
ags-0.107
 ey-0.091
34-0.087
 yourself-0.085
 yourselves-0.083
Token on
Token ID319
Feature activation-1.04e-02

Pos logit contributions
s+0.018
t+0.014
ashed+0.013
ounce+0.013
apy+0.013
Neg logit contributions
ags-0.021
34-0.018
 yourself-0.017
 yourselves-0.016
-0.015
Token the
Token ID262
Feature activation+4.11e-03

Pos logit contributions
s+0.019
t+0.016
 of+0.015
ashed+0.014
 together+0.013
Neg logit contributions
ags-0.020
 ey-0.017
 yourself-0.017
 yourselves-0.017
34-0.016
Token table
Token ID3084
Feature activation-7.61e-03

Pos logit contributions
ags+0.007
 ey+0.006
34+0.006
 yourself+0.006
 yourselves+0.005
Neg logit contributions
s-0.008
t-0.007
 of-0.007
 they-0.006
 while-0.006
Token.
Token ID13
Feature activation+9.22e-03

Pos logit contributions
s+0.014
t+0.011
ounce+0.010
apy+0.010
ls+0.010
Neg logit contributions
ags-0.014
 yourself-0.012
 ey-0.012
 yourselves-0.012
34-0.012
Token.
Token ID13
Feature activation-4.99e-03

Pos logit contributions
ags+0.019
 yourself+0.016
34+0.016
 ey+0.015
 yourselves+0.014
Neg logit contributions
s-0.024
t-0.021
ashed-0.018
apy-0.018
ounce-0.018
Token He
Token ID679
Feature activation+4.85e-03

Pos logit contributions
s+0.008
 of+0.007
t+0.006
 while+0.006
 they+0.006
Neg logit contributions
ags-0.011
 ey-0.009
34-0.009
 yourselves-0.009
 yourself-0.009
Token had
Token ID550
Feature activation-4.00e-03

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.009
34+0.009
+0.009
Neg logit contributions
s-0.007
t-0.006
 of-0.005
 they-0.005
 while-0.005
Token an
Token ID281
Feature activation+8.05e-03

Pos logit contributions
s+0.008
t+0.006
ounce+0.005
 of+0.005
 they+0.005
Neg logit contributions
ags-0.008
 yourself-0.007
 ey-0.007
 yourselves-0.006
34-0.006
Token ash
Token ID12530
Feature activation-4.63e-03

Pos logit contributions
ags+0.016
 ey+0.014
 yourselves+0.014
 yourself+0.013
34+0.013
Neg logit contributions
s-0.013
 of-0.012
t-0.010
 while-0.010
 they-0.009
Tokentr
Token ID2213
Feature activation-4.99e-02

Pos logit contributions
s+0.009
t+0.007
 of+0.007
 they+0.007
 together+0.007
Neg logit contributions
ags-0.008
 ey-0.007
34-0.007
 yourself-0.007
 yourselves-0.006
Tokenay
Token ID323
Feature activation-8.91e-03

Pos logit contributions
s+0.086
 they+0.067
 while+0.065
t+0.064
 of+0.060
Neg logit contributions
ags-0.104
 ey-0.088
34-0.082
 yourself-0.080
 yourselves-0.078
Token that
Token ID326
Feature activation-3.79e-03

Pos logit contributions
s+0.013
t+0.010
 they+0.009
ashed+0.009
ounce+0.008
Neg logit contributions
ags-0.020
 yourself-0.017
34-0.017
 ey-0.017
 yourselves-0.016
Token could
Token ID714
Feature activation+3.73e-03

Pos logit contributions
s+0.006
t+0.004
ounce+0.004
 of+0.004
ashed+0.004
Neg logit contributions
ags-0.008
 ey-0.007
 yourself-0.007
34-0.007
 yourselves-0.007
Token zip
Token ID19974
Feature activation-2.84e-03

Pos logit contributions
ags+0.006
 yourself+0.005
 ey+0.005
 yourselves+0.004
 upper+0.004
Neg logit contributions
s-0.008
ashed-0.007
ounce-0.006
 together-0.006
t-0.006
Token around
Token ID1088
Feature activation-1.10e-02

Pos logit contributions
s+0.005
t+0.004
 batch+0.004
ashed+0.004
 mun+0.003
Neg logit contributions
ags-0.005
 yourself-0.005
34-0.004
 ey-0.004
 yourselves-0.004
Token.
Token ID13
Feature activation+1.93e-03

Pos logit contributions
ags+0.004
 ey+0.003
 yourself+0.003
34+0.003
ode+0.003
Neg logit contributions
s-0.003
t-0.002
ounce-0.002
apy-0.002
ls-0.002
Token
Token ID198
Feature activation-5.33e-03

Pos logit contributions
ags+0.004
 ey+0.003
34+0.003
 yourself+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token
Token ID198
Feature activation+6.32e-03

Pos logit contributions
s+0.009
t+0.007
 of+0.007
 they+0.006
ashed+0.006
Neg logit contributions
ags-0.011
 ey-0.009
 yourself-0.009
34-0.009
 yourselves-0.009
TokenThe
Token ID464
Feature activation+2.91e-04

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.009
34+0.009
 yourselves+0.009
Neg logit contributions
s-0.013
t-0.011
 of-0.010
 they-0.010
 while-0.009
Token ash
Token ID12530
Feature activation-3.77e-03

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.000
 of-0.000
t-0.000
 they-0.000
 while-0.000
Tokentr
Token ID2213
Feature activation-4.98e-02

Pos logit contributions
s+0.008
 of+0.006
t+0.006
 together+0.006
 they+0.006
Neg logit contributions
ags-0.005
 ey-0.005
ode-0.005
 upper-0.004
34-0.004
Tokenay
Token ID323
Feature activation-6.79e-03

Pos logit contributions
s+0.084
t+0.063
 they+0.062
 while+0.062
 of+0.060
Neg logit contributions
ags-0.107
 ey-0.091
34-0.085
 yourself-0.084
 yourselves-0.082
Token was
Token ID373
Feature activation+2.08e-03

Pos logit contributions
s+0.010
t+0.008
 of+0.007
 batch+0.007
 while+0.007
Neg logit contributions
ags-0.012
34-0.012
 ey-0.011
ode-0.011
 yourself-0.011
Token ready
Token ID3492
Feature activation+4.45e-04

Pos logit contributions
ags+0.004
 ey+0.003
 yourself+0.003
34+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.003
ls-0.003
 they-0.003
ounce-0.003
Token to
Token ID284
Feature activation+5.27e-03

Pos logit contributions
ags+0.001
 yourself+0.001
34+0.001
 ey+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.000
ashed-0.000
ls-0.000
ounce-0.000
Token provide
Token ID2148
Feature activation+4.13e-03

Pos logit contributions
ags+0.010
 ey+0.009
 yourself+0.009
 yourselves+0.008
34+0.008
Neg logit contributions
s-0.010
t-0.008
 of-0.007
ashed-0.007
 they-0.007
Token,
Token ID11
Feature activation-1.60e-03

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.010
34+0.010
 yourselves+0.010
Neg logit contributions
s-0.007
t-0.006
 of-0.005
ounce-0.005
z-0.004
Token Lily
Token ID20037
Feature activation-1.12e-03

Pos logit contributions
s+0.003
t+0.002
 of+0.002
ounce+0.002
z+0.002
Neg logit contributions
ags-0.003
 ey-0.002
 yourself-0.002
 yourselves-0.002
34-0.002
Token found
Token ID1043
Feature activation-2.35e-03

Pos logit contributions
s+0.002
t+0.002
apy+0.001
othe+0.001
 of+0.001
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.002
 yourselves-0.002
-0.002
Token an
Token ID281
Feature activation+7.93e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.002
Neg logit contributions
ags-0.005
 ey-0.005
 yourself-0.004
34-0.004
 yourselves-0.004
Token ash
Token ID12530
Feature activation-4.36e-03

Pos logit contributions
ags+0.015
 ey+0.013
 yourself+0.012
34+0.012
 yourselves+0.012
Neg logit contributions
s-0.014
 of-0.012
t-0.012
 while-0.011
 they-0.010
Tokentr
Token ID2213
Feature activation-4.98e-02

Pos logit contributions
s+0.008
t+0.007
 of+0.006
 they+0.006
 while+0.006
Neg logit contributions
ags-0.008
 ey-0.007
34-0.007
 yourself-0.007
 yourselves-0.006
Tokenay
Token ID323
Feature activation-1.30e-02

Pos logit contributions
s+0.079
t+0.060
 of+0.058
 they+0.055
 while+0.054
Neg logit contributions
ags-0.112
 ey-0.097
34-0.091
 yourself-0.091
 yourselves-0.089
Token in
Token ID287
Feature activation-1.15e-03

Pos logit contributions
s+0.021
t+0.015
 of+0.014
 they+0.013
ashed+0.013
Neg logit contributions
ags-0.028
 yourself-0.024
34-0.023
 ey-0.023
 yourselves-0.022
Token the
Token ID262
Feature activation+1.34e-03

Pos logit contributions
s+0.002
 of+0.001
t+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.003
 ey-0.002
34-0.002
-0.002
 yourself-0.002
Token world
Token ID995
Feature activation+1.72e-02

Pos logit contributions
ags+0.003
34+0.003
 Deal+0.002
 ey+0.002
avi+0.002
Neg logit contributions
s-0.002
 of-0.002
 lots-0.002
 to-0.002
 while-0.001
Token behind
Token ID2157
Feature activation-1.01e-03

Pos logit contributions
ags+0.038
 ey+0.033
 yourself+0.032
34+0.031
 yourselves+0.031
Neg logit contributions
s-0.026
t-0.020
 of-0.018
 they-0.018
 while-0.017
Token notice
Token ID4003
Feature activation-8.62e-03

Pos logit contributions
s+0.007
t+0.005
 batch+0.005
ls+0.005
 together+0.005
Neg logit contributions
ags-0.006
 yourself-0.006
 ey-0.005
34-0.005
-0.005
Token how
Token ID703
Feature activation-8.59e-03

Pos logit contributions
s+0.016
t+0.013
ashed+0.012
ounce+0.011
z+0.011
Neg logit contributions
ags-0.015
 yourself-0.015
 yourselves-0.014
 ey-0.014
34-0.013
Token slippery
Token ID32911
Feature activation-8.98e-03

Pos logit contributions
s+0.015
t+0.012
ashed+0.011
 while+0.011
 batch+0.011
Neg logit contributions
ags-0.014
 yourself-0.014
 ey-0.013
 yourselves-0.013
34-0.012
Token the
Token ID262
Feature activation-4.97e-03

Pos logit contributions
s+0.013
t+0.010
 of+0.009
ashed+0.008
ounce+0.008
Neg logit contributions
ags-0.020
 yourself-0.018
 ey-0.018
 yourselves-0.018
34-0.017
Token ash
Token ID12530
Feature activation-4.33e-03

Pos logit contributions
s+0.012
t+0.010
 together+0.009
 they+0.009
ashed+0.009
Neg logit contributions
ags-0.007
 ey-0.007
 yourself-0.006
34-0.006
ode-0.005
Tokentr
Token ID2213
Feature activation-4.98e-02

Pos logit contributions
s+0.008
t+0.006
 of+0.006
ounce+0.005
ashed+0.005
Neg logit contributions
ags-0.008
 ey-0.007
34-0.006
 yourself-0.006
-0.006
Tokenay
Token ID323
Feature activation-7.71e-03

Pos logit contributions
s+0.082
t+0.063
 of+0.061
 they+0.061
 while+0.060
Neg logit contributions
ags-0.105
 ey-0.090
34-0.086
 yourself-0.084
 yourselves-0.082
Token was
Token ID373
Feature activation-1.99e-03

Pos logit contributions
s+0.009
ashed+0.006
ounce+0.006
t+0.006
 of+0.006
Neg logit contributions
ags-0.017
 yourself-0.016
-0.015
 ey-0.015
34-0.015
Token.
Token ID13
Feature activation+2.75e-04

Pos logit contributions
s+0.004
t+0.003
ashed+0.003
 of+0.002
ounce+0.002
Neg logit contributions
ags-0.004
 yourself-0.003
 ey-0.003
 yourselves-0.003
34-0.003
Token In
Token ID554
Feature activation+1.15e-02

Pos logit contributions
ags+0.000
 ey+0.000
 yourself+0.000
34+0.000
 yourselves+0.000
Neg logit contributions
s-0.001
t-0.000
 of-0.000
 they-0.000
 while-0.000
Token no
Token ID645
Feature activation-6.39e-03

Pos logit contributions
ags+0.021
 ey+0.018
 yourself+0.017
 yourselves+0.017
34+0.016
Neg logit contributions
s-0.023
t-0.019
ashed-0.017
 of-0.017
ounce-0.016
Token and
Token ID290
Feature activation-7.92e-04

Pos logit contributions
t+0.003
s+0.003
 of+0.003
 they+0.002
z+0.002
Neg logit contributions
ags-0.005
 ey-0.004
34-0.004
 yourself-0.004
 yourselves-0.003
Token saw
Token ID2497
Feature activation-2.33e-04

Pos logit contributions
s+0.001
 of+0.001
t+0.001
 they+0.001
 together+0.000
Neg logit contributions
ags-0.002
 ey-0.002
34-0.002
 yourself-0.002
-0.002
Token a
Token ID257
Feature activation-4.81e-04

Pos logit contributions
s+0.000
t+0.000
ashed+0.000
ounce+0.000
z+0.000
Neg logit contributions
ags-0.000
 ey-0.000
34-0.000
 yourselves-0.000
 yourself-0.000
Token big
Token ID1263
Feature activation-2.92e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.001
ashed+0.001
 together+0.001
Neg logit contributions
 ey-0.001
ags-0.001
34-0.001
 yourself-0.001
 yourselves-0.001
Token t
Token ID256
Feature activation-1.44e-02

Pos logit contributions
s+0.006
t+0.005
 of+0.004
 they+0.004
 together+0.004
Neg logit contributions
ags-0.005
 ey-0.005
34-0.004
 yourself-0.004
 yourselves-0.004
Tokeneddy
Token ID21874
Feature activation-4.98e-02

Pos logit contributions
s+0.023
t+0.018
 they+0.017
 of+0.017
 while+0.016
Neg logit contributions
ags-0.030
 ey-0.027
34-0.026
 yourself-0.025
 yourselves-0.025
Token bear
Token ID6842
Feature activation-1.16e-02

Pos logit contributions
s+0.093
t+0.075
 of+0.072
 they+0.068
 while+0.066
Neg logit contributions
ags-0.098
 ey-0.084
34-0.078
 yourself-0.078
 yourselves-0.076
Token.
Token ID13
Feature activation+2.37e-03

Pos logit contributions
s+0.020
t+0.018
ashed+0.016
by+0.014
 of+0.014
Neg logit contributions
ags-0.020
 ey-0.018
34-0.018
 yourself-0.018
ode-0.017
Token It
Token ID632
Feature activation-7.32e-03

Pos logit contributions
ags+0.006
ode+0.005
 ey+0.005
 yourself+0.005
34+0.005
Neg logit contributions
s-0.004
 of-0.003
 they-0.003
t-0.002
 while-0.002
Token was
Token ID373
Feature activation+2.25e-03

Pos logit contributions
s+0.009
t+0.008
 of+0.007
ashed+0.007
ounce+0.007
Neg logit contributions
 ey-0.014
ags-0.014
-0.013
34-0.013
 yourself-0.013
Token brown
Token ID7586
Feature activation-8.60e-03

Pos logit contributions
ags+0.004
 ey+0.004
 yourself+0.004
34+0.004
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.004
 of-0.003
 they-0.003
 while-0.003
Token From
Token ID3574
Feature activation+8.40e-03

Pos logit contributions
ags+0.006
 ey+0.005
 yourself+0.004
34+0.004
 yourselves+0.004
Neg logit contributions
s-0.007
t-0.006
 of-0.006
 they-0.005
 while-0.005
Token that
Token ID326
Feature activation-8.01e-03

Pos logit contributions
ags+0.018
 yourself+0.015
 ey+0.015
 yourselves+0.015
34+0.014
Neg logit contributions
s-0.014
t-0.012
 of-0.010
ashed-0.009
 while-0.009
Token day
Token ID1110
Feature activation+1.31e-02

Pos logit contributions
s+0.013
t+0.010
ashed+0.009
ounce+0.009
 of+0.009
Neg logit contributions
ags-0.016
 ey-0.014
 yourself-0.014
-0.013
34-0.013
Token on
Token ID319
Feature activation+1.15e-02

Pos logit contributions
ags+0.034
 ey+0.031
 yourselves+0.028
 yourself+0.028
34+0.027
Neg logit contributions
s-0.018
 of-0.013
t-0.013
 they-0.012
 together-0.011
Token,
Token ID11
Feature activation-2.62e-03

Pos logit contributions
ags+0.026
 ey+0.023
 yourself+0.023
 yourselves+0.022
34+0.022
Neg logit contributions
s-0.016
t-0.013
ashed-0.011
 of-0.011
ounce-0.010
Token Joey
Token ID26154
Feature activation+2.19e-02

Pos logit contributions
s+0.004
t+0.003
ashed+0.003
 batch+0.003
ounce+0.003
Neg logit contributions
ags-0.005
 yourself-0.005
 ey-0.005
 yourselves-0.004
-0.004
Token was
Token ID373
Feature activation+3.28e-03

Pos logit contributions
ags+0.043
 yourself+0.040
 ey+0.038
ode+0.037
+0.036
Neg logit contributions
s-0.032
t-0.027
 of-0.024
 they-0.023
 while-0.023
Token always
Token ID1464
Feature activation+2.12e-03

Pos logit contributions
ags+0.006
 yourself+0.005
34+0.005
 ey+0.005
+0.005
Neg logit contributions
s-0.007
t-0.005
ls-0.005
 of-0.005
 they-0.004
Token excited
Token ID6568
Feature activation-1.40e-03

Pos logit contributions
ags+0.004
 ey+0.004
 yourself+0.004
34+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.003
 of-0.003
 they-0.003
 while-0.003
Token to
Token ID284
Feature activation+2.43e-03

Pos logit contributions
s+0.002
apy+0.002
t+0.002
oped+0.001
ashed+0.001
Neg logit contributions
ags-0.003
 yourself-0.003
 ey-0.003
 ways-0.002
 yourselves-0.002
Token poke
Token ID22620
Feature activation+9.85e-03

Pos logit contributions
ags+0.006
 ey+0.005
34+0.005
 yourselves+0.005
+0.005
Neg logit contributions
s-0.003
 of-0.003
t-0.003
 they-0.002
 while-0.002
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
ags+0.007
 yourself+0.006
 yourselves+0.005
34+0.005
ode+0.005
Neg logit contributions
s-0.007
 batch-0.006
ashed-0.006
t-0.006
apy-0.006
Token You
Token ID921
Feature activation-1.48e-02

Pos logit contributions
ags+0.006
 ey+0.005
 yourself+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 they-0.004
 while-0.004
Token are
Token ID389
Feature activation+5.73e-03

Pos logit contributions
s+0.028
ounce+0.024
 batch+0.024
z+0.023
ashed+0.023
Neg logit contributions
ags-0.022
 yourself-0.020
ode-0.018
34-0.018
 ey-0.017
Token my
Token ID616
Feature activation+2.28e-03

Pos logit contributions
ags+0.012
 ey+0.010
34+0.010
 yourself+0.010
 yourselves+0.009
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.007
 while-0.007
Token sweet
Token ID6029
Feature activation+8.81e-03

Pos logit contributions
ags+0.004
 ey+0.004
34+0.004
 yourself+0.003
 yourselves+0.003
Neg logit contributions
s-0.004
t-0.004
 of-0.003
ashed-0.003
 they-0.003
Token children
Token ID1751
Feature activation+2.83e-02

Pos logit contributions
ags+0.015
 yourself+0.013
 ey+0.013
34+0.012
 yourselves+0.012
Neg logit contributions
s-0.018
t-0.016
rolled-0.014
apy-0.014
 they-0.013
Token and
Token ID290
Feature activation+1.24e-03

Pos logit contributions
ags+0.049
 yourself+0.045
 yourselves+0.040
34+0.040
ode+0.035
Neg logit contributions
s-0.050
t-0.043
ashed-0.041
ls-0.040
ounce-0.040
Token I
Token ID314
Feature activation-6.38e-04

Pos logit contributions
ags+0.002
 yourself+0.002
 ey+0.002
 yourselves+0.002
34+0.001
Neg logit contributions
s-0.003
t-0.002
ounce-0.002
 of-0.002
z-0.002
Token love
Token ID1842
Feature activation-7.70e-03

Pos logit contributions
s+0.001
ls+0.001
 batch+0.001
seeing+0.001
ounce+0.001
Neg logit contributions
ags-0.001
 yourself-0.001
-0.001
 bluff-0.001
Ju-0.001
Token you
Token ID345
Feature activation+3.71e-03

Pos logit contributions
s+0.018
t+0.016
ashed+0.016
ounce+0.015
oped+0.014
Neg logit contributions
ags-0.011
 yourself-0.010
 yourselves-0.009
 ey-0.008
34-0.008
Token."
Token ID526
Feature activation-7.61e-03

Pos logit contributions
ags+0.007
 yourself+0.006
34+0.006
 yourselves+0.006
+0.005
Neg logit contributions
s-0.006
t-0.005
ashed-0.005
ounce-0.005
 of-0.005
Token on
Token ID319
Feature activation-2.07e-03

Pos logit contributions
s+0.022
t+0.020
 of+0.016
ounce+0.016
ashed+0.015
Neg logit contributions
ags-0.027
 ey-0.021
34-0.021
 yourself-0.020
 yourselves-0.019
Token the
Token ID262
Feature activation+1.13e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.002
 while+0.002
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.004
34-0.004
 yourselves-0.004
Token ground
Token ID2323
Feature activation+1.45e-02

Pos logit contributions
ags+0.002
 ey+0.002
34+0.002
 yourself+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
 they-0.002
 while-0.002
Token and
Token ID290
Feature activation-3.26e-03

Pos logit contributions
ags+0.026
34+0.021
 yourself+0.020
 ey+0.020
per+0.020
Neg logit contributions
s-0.024
t-0.022
apy-0.020
rolled-0.019
ounce-0.019
Token held
Token ID2714
Feature activation+7.32e-03

Pos logit contributions
s+0.005
t+0.005
 of+0.004
ounce+0.004
z+0.004
Neg logit contributions
ags-0.006
 ey-0.006
 yourself-0.005
 yourselves-0.005
34-0.005
Token hands
Token ID2832
Feature activation+2.11e-02

Pos logit contributions
ags+0.017
 yourself+0.013
34+0.013
 ey+0.013
 yourselves+0.013
Neg logit contributions
s-0.011
t-0.010
ounce-0.008
 of-0.008
 while-0.007
Token.
Token ID13
Feature activation+3.51e-03

Pos logit contributions
ags+0.029
34+0.023
 yourself+0.021
per+0.021
 ear+0.020
Neg logit contributions
t-0.035
 batch-0.034
s-0.033
ashed-0.032
apy-0.032
Token They
Token ID1119
Feature activation-3.20e-03

Pos logit contributions
ags+0.008
 ey+0.007
34+0.007
 yourself+0.007
 yourselves+0.006
Neg logit contributions
s-0.006
t-0.004
 of-0.004
 they-0.004
 while-0.004
Token said
Token ID531
Feature activation+5.89e-03

Pos logit contributions
s+0.005
t+0.005
 of+0.004
ounce+0.004
seeing+0.004
Neg logit contributions
ags-0.006
 ey-0.005
34-0.005
 upper-0.005
ode-0.005
Token,
Token ID11
Feature activation+1.03e-03

Pos logit contributions
ags+0.011
34+0.009
 yourself+0.009
 ey+0.009
 yourselves+0.009
Neg logit contributions
s-0.010
t-0.009
ounce-0.008
z-0.007
 of-0.007
Token "
Token ID366
Feature activation+1.80e-03

Pos logit contributions
ags+0.002
 ey+0.002
 yourself+0.002
 yourselves+0.002
+0.002
Neg logit contributions
s-0.002
 they-0.001
t-0.001
 of-0.001
 while-0.001
Token his
Token ID465
Feature activation+8.93e-03

Pos logit contributions
s+0.014
t+0.013
z+0.011
ls+0.011
ashed+0.011
Neg logit contributions
ags-0.012
 ey-0.011
 yourself-0.011
 ear-0.010
 yourselves-0.010
Token sleepy
Token ID42023
Feature activation+5.96e-03

Pos logit contributions
ags+0.015
 ey+0.013
 yourself+0.012
 ear+0.011
 yourselves+0.011
Neg logit contributions
s-0.019
t-0.017
ashed-0.014
 of-0.014
ounce-0.014
Token eyes
Token ID2951
Feature activation+1.49e-02

Pos logit contributions
ags+0.009
 yourself+0.009
 ey+0.008
 yourselves+0.008
ode+0.008
Neg logit contributions
s-0.011
t-0.009
ls-0.008
 together-0.008
uttered-0.008
Token shifted
Token ID14869
Feature activation+5.54e-04

Pos logit contributions
ags+0.028
 yourself+0.024
34+0.023
 upper+0.022
 ey+0.021
Neg logit contributions
s-0.024
ounce-0.019
t-0.018
ls-0.017
 batch-0.017
Token from
Token ID422
Feature activation+1.09e-02

Pos logit contributions
ags+0.001
 yourself+0.001
 ey+0.001
 yourselves+0.001
34+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
ashed-0.001
ounce-0.001
Token side
Token ID1735
Feature activation+2.03e-02

Pos logit contributions
ags+0.019
 yourself+0.016
 yourselves+0.015
 ey+0.015
34+0.013
Neg logit contributions
s-0.025
t-0.021
ashed-0.018
ounce-0.018
 of-0.017
Token to
Token ID284
Feature activation+5.67e-03

Pos logit contributions
ags+0.035
 yourself+0.030
 ey+0.028
34+0.028
 yourselves+0.028
Neg logit contributions
s-0.033
ashed-0.028
t-0.027
ounce-0.026
ls-0.025
Token side
Token ID1735
Feature activation+1.84e-02

Pos logit contributions
ags+0.011
 yourself+0.010
 ey+0.009
 yourselves+0.009
+0.008
Neg logit contributions
s-0.011
t-0.009
 of-0.008
ashed-0.008
ls-0.008
Token.
Token ID13
Feature activation+5.78e-03

Pos logit contributions
ags+0.033
 yourself+0.030
 ey+0.027
 yourselves+0.027
34+0.025
Neg logit contributions
s-0.030
t-0.026
ashed-0.026
ls-0.023
ounce-0.022
Token He
Token ID679
Feature activation+2.96e-03

Pos logit contributions
ags+0.009
 ey+0.008
 yourself+0.008
+0.008
 yourselves+0.007
Neg logit contributions
s-0.012
t-0.010
 of-0.009
ls-0.009
ies-0.009
Token felt
Token ID2936
Feature activation+3.47e-03

Pos logit contributions
 ey+0.004
ags+0.004
 yourself+0.004
avi+0.004
+0.004
Neg logit contributions
s-0.005
t-0.005
less-0.004
ounce-0.004
 of-0.004
Token let
Token ID1309
Feature activation+1.01e-03

Pos logit contributions
ags+0.013
 yourself+0.011
 ey+0.010
 yourselves+0.010
+0.010
Neg logit contributions
s-0.015
t-0.012
 of-0.011
 they-0.011
ls-0.011
Token go
Token ID467
Feature activation-8.27e-03

Pos logit contributions
ags+0.002
 yourself+0.002
 yourselves+0.001
 ey+0.001
+0.001
Neg logit contributions
s-0.002
t-0.002
ashed-0.002
 of-0.002
z-0.001
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
s+0.013
t+0.011
apy+0.010
 batch+0.009
ashed+0.009
Neg logit contributions
ags-0.018
 yourself-0.015
 yourselves-0.014
 ey-0.013
ode-0.013
Token your
Token ID534
Feature activation-1.07e-02

Pos logit contributions
s+0.021
t+0.018
 of+0.016
ashed+0.015
 they+0.014
Neg logit contributions
ags-0.020
 yourself-0.017
 ey-0.016
 yourselves-0.016
34-0.016
Token selfish
Token ID20363
Feature activation+6.73e-03

Pos logit contributions
s+0.020
t+0.018
 of+0.016
 they+0.015
 while+0.014
Neg logit contributions
ags-0.020
 ey-0.017
 yourself-0.016
34-0.015
 yourselves-0.015
Tokenness
Token ID1108
Feature activation+2.88e-02

Pos logit contributions
ags+0.011
 yourself+0.009
 ey+0.009
 yourselves+0.008
34+0.007
Neg logit contributions
s-0.014
t-0.012
 they-0.011
 together-0.011
z-0.011
Token and
Token ID290
Feature activation-4.88e-03

Pos logit contributions
ags+0.058
 yourself+0.055
 ey+0.049
 yourselves+0.048
ode+0.045
Neg logit contributions
s-0.042
t-0.035
ashed-0.031
apy-0.031
ls-0.031
Token serve
Token ID4691
Feature activation+3.25e-03

Pos logit contributions
s+0.009
t+0.008
z+0.007
ls+0.007
 of+0.007
Neg logit contributions
ags-0.008
 yourself-0.008
 yourselves-0.007
 ey-0.007
-0.006
Token others
Token ID1854
Feature activation+1.59e-02

Pos logit contributions
ags+0.006
 yourself+0.005
 yourselves+0.005
 ey+0.004
 ways+0.004
Neg logit contributions
s-0.006
t-0.006
z-0.005
ashed-0.005
oped-0.005
Token.
Token ID13
Feature activation+6.76e-04

Pos logit contributions
ags+0.033
 yourself+0.031
 yourselves+0.028
 ey+0.027
34+0.026
Neg logit contributions
s-0.023
t-0.020
ashed-0.018
ounce-0.017
 of-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.00e-03

Pos logit contributions
ags+0.002
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token.
Token ID13
Feature activation+3.25e-03

Pos logit contributions
ags+0.026
 yourself+0.024
34+0.022
 yourselves+0.022
 ey+0.020
Neg logit contributions
s-0.025
t-0.021
ashed-0.019
apy-0.019
ounce-0.018
Token<|endoftext|>
Token ID50256
Feature activation-2.41e-03

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.006
34+0.006
+0.006
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 they-0.004
 while-0.004
TokenOnce
Token ID7454
Feature activation-1.52e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.005
 ey-0.004
34-0.004
 yourself-0.004
 yourselves-0.004
Token upon
Token ID2402
Feature activation+1.12e-02

Pos logit contributions
s+0.002
t+0.001
 while+0.001
 of+0.001
ashed+0.001
Neg logit contributions
ags-0.003
 ey-0.003
34-0.003
 yourself-0.003
-0.003
Token a
Token ID257
Feature activation+5.59e-03

Pos logit contributions
ags+0.023
 yourself+0.020
 ey+0.019
 yourselves+0.018
34+0.018
Neg logit contributions
s-0.020
t-0.015
 of-0.014
 they-0.013
ashed-0.013
Token time
Token ID640
Feature activation+5.01e-04

Pos logit contributions
ags+0.011
 ey+0.010
34+0.009
 yourself+0.009
 yourselves+0.009
Neg logit contributions
s-0.010
t-0.008
 of-0.008
 they-0.008
 while-0.007
Token,
Token ID11
Feature activation-6.25e-03

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.000
 of-0.000
ashed-0.000
 batch-0.000
Token there
Token ID612
Feature activation-6.91e-03

Pos logit contributions
s+0.010
t+0.008
 of+0.007
 they+0.006
 while+0.006
Neg logit contributions
ags-0.014
 ey-0.011
 yourself-0.011
34-0.011
 yourselves-0.011
Token was
Token ID373
Feature activation+5.46e-03

Pos logit contributions
s+0.008
t+0.007
ashed+0.006
ls+0.005
ounce+0.005
Neg logit contributions
ags-0.015
 ey-0.014
 yourself-0.014
-0.013
 yourselves-0.013
Token a
Token ID257
Feature activation-2.64e-03

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.009
 yourselves+0.009
34+0.009
Neg logit contributions
s-0.010
t-0.008
 of-0.007
 they-0.006
ounce-0.006
Token big
Token ID1263
Feature activation-3.40e-03

Pos logit contributions
s+0.004
 of+0.003
t+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.007
 ey-0.006
 yourself-0.005
 yourselves-0.005
34-0.005
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
s+0.010
t+0.007
 of+0.007
 they+0.006
 together+0.006
Neg logit contributions
ags-0.022
 ey-0.020
 yourself-0.019
34-0.019
 yourselves-0.019
Token Lily
Token ID20037
Feature activation+6.12e-03

Pos logit contributions
s+0.019
t+0.015
 of+0.014
 together+0.014
 while+0.013
Neg logit contributions
ags-0.020
 ey-0.018
 yourself-0.017
34-0.017
 yourselves-0.017
Token.
Token ID13
Feature activation-3.39e-03

Pos logit contributions
ags+0.014
 yourself+0.012
 ey+0.012
34+0.012
 yourselves+0.011
Neg logit contributions
s-0.008
t-0.007
 of-0.006
ashed-0.006
 while-0.006
Token You
Token ID921
Feature activation-1.19e-02

Pos logit contributions
s+0.007
t+0.006
 of+0.006
 they+0.006
 while+0.005
Neg logit contributions
ags-0.006
 ey-0.005
34-0.005
 yourself-0.004
 yourselves-0.004
Token are
Token ID389
Feature activation+4.87e-03

Pos logit contributions
s+0.024
z+0.021
 batch+0.020
 while+0.020
 together+0.019
Neg logit contributions
ags-0.016
 yourself-0.015
 yourselves-0.013
ode-0.013
 ey-0.013
Token wrong
Token ID2642
Feature activation+3.89e-03

Pos logit contributions
ags+0.011
 ey+0.010
34+0.009
+0.009
 yourself+0.008
Neg logit contributions
s-0.008
 of-0.006
t-0.006
 they-0.006
 together-0.005
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
ags+0.007
 yourself+0.006
 yourselves+0.006
34+0.006
 ey+0.006
Neg logit contributions
s-0.007
t-0.006
ashed-0.005
 batch-0.005
 mun-0.005
Token I
Token ID314
Feature activation-2.11e-03

Pos logit contributions
s+0.002
 of+0.002
t+0.002
 they+0.002
 together+0.002
Neg logit contributions
ags-0.003
 ey-0.002
34-0.002
ode-0.002
-0.002
Token am
Token ID716
Feature activation+2.04e-03

Pos logit contributions
s+0.004
seeing+0.004
cases+0.004
ls+0.004
 batch+0.004
Neg logit contributions
ags-0.002
Ju-0.002
 yourself-0.002
-0.002
 ey-0.002
Token the
Token ID262
Feature activation-1.60e-03

Pos logit contributions
ags+0.003
 yourself+0.002
 ey+0.002
 yourselves+0.002
34+0.002
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 while-0.004
 they-0.004
Token king
Token ID5822
Feature activation+1.67e-02

Pos logit contributions
s+0.003
t+0.003
 of+0.002
 they+0.002
 while+0.002
Neg logit contributions
ags-0.003
 ey-0.003
 yourself-0.002
34-0.002
 yourselves-0.002
Token lit
Token ID6578
Feature activation-1.28e-02

Pos logit contributions
s+0.005
t+0.004
ounce+0.003
ls+0.003
 of+0.003
Neg logit contributions
 yourself-0.004
ags-0.004
 ey-0.004
 yourselves-0.004
-0.003
Token up
Token ID510
Feature activation-5.10e-03

Pos logit contributions
s+0.020
t+0.020
 of+0.016
ashed+0.016
ounce+0.015
Neg logit contributions
ags-0.025
 yourself-0.023
 yourselves-0.021
 ey-0.021
34-0.021
Token.
Token ID13
Feature activation+4.52e-03

Pos logit contributions
s+0.010
t+0.009
ashed+0.007
ounce+0.007
rolled+0.007
Neg logit contributions
ags-0.009
 yourself-0.008
 ey-0.008
 yourselves-0.008
34-0.007
Token Lucy
Token ID22162
Feature activation+2.61e-03

Pos logit contributions
ags+0.009
 ey+0.007
 yourself+0.007
+0.007
34+0.007
Neg logit contributions
s-0.008
t-0.007
 of-0.006
 they-0.006
 while-0.006
Token was
Token ID373
Feature activation+6.24e-03

Pos logit contributions
ags+0.004
 yourself+0.004
 ey+0.004
 upper+0.004
+0.004
Neg logit contributions
t-0.004
s-0.004
ls-0.003
 of-0.003
ounce-0.003
Token so
Token ID523
Feature activation+4.83e-03

Pos logit contributions
ags+0.010
 yourself+0.009
 ey+0.008
34+0.008
+0.008
Neg logit contributions
s-0.014
t-0.011
ls-0.010
 of-0.010
 they-0.009
Token happy
Token ID3772
Feature activation-1.06e-02

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.007
 yourselves+0.007
34+0.007
Neg logit contributions
s-0.009
t-0.008
ls-0.007
ounce-0.007
 they-0.007
Token!
Token ID0
Feature activation+4.70e-03

Pos logit contributions
s+0.017
t+0.016
ashed+0.016
rolled+0.015
oped+0.014
Neg logit contributions
 yourself-0.017
 yourselves-0.015
 ey-0.015
ags-0.015
34-0.014
Token
Token ID220
Feature activation+3.32e-03

Pos logit contributions
ags+0.010
 ey+0.009
 yourself+0.008
34+0.008
 yourselves+0.008
Neg logit contributions
s-0.008
t-0.006
 of-0.006
 they-0.006
 while-0.006
Token
Token ID198
Feature activation+4.26e-04

Pos logit contributions
ags+0.007
 yourself+0.006
 ey+0.006
 yourselves+0.006
34+0.005
Neg logit contributions
s-0.005
t-0.004
 of-0.004
 while-0.004
 they-0.004
Token
Token ID198
Feature activation+8.32e-03

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
34+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.000
 while-0.000
Token,
Token ID11
Feature activation+8.87e-04

Pos logit contributions
s+0.001
t+0.001
ashed+0.001
ounce+0.001
 batch+0.001
Neg logit contributions
ags-0.002
 yourself-0.001
 ey-0.001
 yourselves-0.001
34-0.001
Token the
Token ID262
Feature activation-8.23e-04

Pos logit contributions
 yourself+0.001
 yourselves+0.001
ags+0.001
 ey+0.001
34+0.001
Neg logit contributions
s-0.002
ashed-0.001
t-0.001
ounce-0.001
z-0.001
Token judge
Token ID5052
Feature activation+1.71e-02

Pos logit contributions
s+0.002
t+0.002
 of+0.001
 together+0.001
 they+0.001
Neg logit contributions
ags-0.001
 ey-0.001
 yourself-0.001
34-0.001
 yourselves-0.001
Token would
Token ID561
Feature activation+5.54e-03

Pos logit contributions
 yourself+0.022
ags+0.021
ode+0.020
 narrative+0.019
aces+0.019
Neg logit contributions
s-0.028
ounce-0.025
t-0.024
 batch-0.024
apy-0.024
Token always
Token ID1464
Feature activation+4.12e-03

Pos logit contributions
ags+0.008
 yourself+0.008
 ey+0.007
 yourselves+0.006
ode+0.006
Neg logit contributions
s-0.013
ashed-0.010
ls-0.010
ounce-0.009
z-0.009
Token listen
Token ID6004
Feature activation+1.16e-02

Pos logit contributions
ags+0.007
 yourself+0.006
 ey+0.006
 yourselves+0.006
34+0.005
Neg logit contributions
s-0.009
t-0.007
 of-0.006
ashed-0.006
ls-0.006
Token and
Token ID290
Feature activation-1.50e-04

Pos logit contributions
ags+0.025
 yourself+0.022
34+0.021
 yourselves+0.020
 upper+0.020
Neg logit contributions
s-0.014
t-0.011
ounce-0.011
oped-0.011
apy-0.011
Token try
Token ID1949
Feature activation+1.31e-02

Pos logit contributions
s+0.000
t+0.000
ashed+0.000
 of+0.000
z+0.000
Neg logit contributions
ags-0.000
 yourself-0.000
 ey-0.000
 yourselves-0.000
34-0.000
Token to
Token ID284
Feature activation+4.44e-03

Pos logit contributions
ags+0.031
 yourself+0.028
 ey+0.027
34+0.026
 yourselves+0.025
Neg logit contributions
s-0.017
t-0.016
 of-0.012
ashed-0.012
z-0.011
Token help
Token ID1037
Feature activation+1.61e-02

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.007
 yourselves+0.007
34+0.006
Neg logit contributions
s-0.009
t-0.007
ashed-0.007
 of-0.006
 they-0.006
Token.
Token ID13
Feature activation-5.61e-03

Pos logit contributions
ags+0.032
 ey+0.027
 yourself+0.026
34+0.026
 yourselves+0.025
Neg logit contributions
s-0.030
t-0.024
 of-0.023
 they-0.021
 together-0.020
Token so
Token ID523
Feature activation-7.32e-03

Pos logit contributions
s+0.003
t+0.003
z+0.002
ashed+0.002
ls+0.002
Neg logit contributions
ags-0.002
 ey-0.002
 yourself-0.002
 yourselves-0.002
34-0.002
Token they
Token ID484
Feature activation-7.56e-03

Pos logit contributions
t+0.010
s+0.009
oped+0.009
uggle+0.008
z+0.008
Neg logit contributions
 yourselves-0.011
 yourself-0.011
34-0.011
 ear-0.011
 anchor-0.010
Token decided
Token ID3066
Feature activation+6.46e-03

Pos logit contributions
s+0.013
t+0.011
 of+0.010
ounce+0.009
z+0.009
Neg logit contributions
ags-0.014
 ey-0.012
ode-0.012
 yourself-0.012
34-0.011
Token to
Token ID284
Feature activation+4.56e-03

Pos logit contributions
 yourself+0.012
ags+0.012
 yourselves+0.011
34+0.011
+0.010
Neg logit contributions
s-0.009
t-0.007
ashed-0.007
 batch-0.007
 of-0.006
Token separate
Token ID4553
Feature activation+8.66e-03

Pos logit contributions
ags+0.008
 yourself+0.007
 ey+0.007
 yourselves+0.006
ode+0.006
Neg logit contributions
s-0.010
t-0.008
ls-0.007
ashed-0.007
ounce-0.007
Token their
Token ID511
Feature activation-4.61e-03

Pos logit contributions
ags+0.017
 ey+0.014
 yourself+0.014
 yourselves+0.013
34+0.012
Neg logit contributions
s-0.016
t-0.015
ashed-0.012
z-0.012
ounce-0.012
Token eyes
Token ID2951
Feature activation+2.23e-02

Pos logit contributions
s+0.010
t+0.009
 of+0.008
 they+0.008
 while+0.007
Neg logit contributions
ags-0.008
 ey-0.007
 yourself-0.006
 yourselves-0.006
 upper-0.006
Token.
Token ID13
Feature activation+3.88e-03

Pos logit contributions
ags+0.042
 yourself+0.036
34+0.035
 ey+0.034
per+0.033
Neg logit contributions
s-0.030
 mun-0.025
t-0.025
ashed-0.025
ounce-0.023
Token The
Token ID383
Feature activation+4.33e-04

Pos logit contributions
ags+0.007
 ey+0.006
+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.007
t-0.006
ls-0.006
 of-0.006
ies-0.005
Token bear
Token ID6842
Feature activation-5.61e-04

Pos logit contributions
ags+0.001
 ey+0.001
34+0.001
 yourself+0.001
ode+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.001
 together-0.001
Token's
Token ID338
Feature activation+1.18e-02

Pos logit contributions
s+0.001
t+0.001
less+0.001
uggle+0.001
othe+0.001
Neg logit contributions
ode-0.001
yah-0.001
per-0.001
pe-0.001
et-0.001
Token You
Token ID921
Feature activation-1.25e-02

Pos logit contributions
s+0.011
 of+0.009
t+0.009
 they+0.009
 while+0.008
Neg logit contributions
ags-0.012
 ey-0.010
34-0.010
ode-0.009
 yourselves-0.009
Token are
Token ID389
Feature activation+3.14e-03

Pos logit contributions
s+0.022
t+0.017
 of+0.017
 they+0.015
 while+0.015
Neg logit contributions
ags-0.026
 ey-0.022
 yourself-0.021
34-0.021
 yourselves-0.021
Token a
Token ID257
Feature activation-9.95e-03

Pos logit contributions
 ey+0.008
ags+0.008
ode+0.007
34+0.007
 upper+0.007
Neg logit contributions
s-0.003
 together-0.003
 of-0.003
 they-0.003
 with-0.002
Token good
Token ID922
Feature activation+1.35e-03

Pos logit contributions
s+0.012
 of+0.009
 while+0.009
 they+0.009
 together+0.008
Neg logit contributions
ags-0.027
 ey-0.024
34-0.022
 yourselves-0.022
-0.021
Token sister
Token ID6621
Feature activation+1.07e-02

Pos logit contributions
ags+0.002
 ey+0.002
34+0.002
 yourself+0.002
 upper+0.002
Neg logit contributions
s-0.003
t-0.002
 they-0.002
 of-0.002
 together-0.002
Token."
Token ID526
Feature activation-2.12e-02

Pos logit contributions
ags+0.022
 yourself+0.019
34+0.019
 ey+0.018
 yourselves+0.018
Neg logit contributions
s-0.018
t-0.014
 of-0.013
 they-0.012
 while-0.012
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
s+0.035
t+0.028
 of+0.026
 they+0.024
 while+0.024
Neg logit contributions
ags-0.046
 ey-0.039
 yourself-0.037
34-0.037
 yourselves-0.036
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
s+0.027
t+0.023
 of+0.020
 while+0.020
 they+0.020
Neg logit contributions
ags-0.038
 ey-0.032
 yourself-0.031
 yourselves-0.030
34-0.030
TokenHe
Token ID1544
Feature activation-3.72e-04

Pos logit contributions
s+0.014
t+0.012
 of+0.011
 they+0.011
 while+0.010
Neg logit contributions
ags-0.015
 ey-0.012
 yourself-0.012
34-0.012
 yourselves-0.011
Token finished
Token ID5201
Feature activation-7.58e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.000
less+0.000
 while+0.000
Neg logit contributions
ags-0.001
 ey-0.001
34-0.001
per-0.001
 yourself-0.001
Token sewing
Token ID35356
Feature activation-2.20e-02

Pos logit contributions
s+0.014
t+0.013
 of+0.011
ashed+0.010
 while+0.010
Neg logit contributions
ags-0.015
 ey-0.013
 yourself-0.012
 yourselves-0.012
34-0.011
TokenM
Token ID44
Feature activation+1.10e-02

Pos logit contributions
ags+0.008
 ey+0.006
 yourself+0.006
+0.006
 yourselves+0.006
Neg logit contributions
s-0.011
t-0.009
 of-0.009
 they-0.009
 together-0.009
Tokenum
Token ID388
Feature activation+2.47e-03

Pos logit contributions
ags+0.022
 ey+0.019
34+0.018
 yourself+0.018
 yourselves+0.017
Neg logit contributions
s-0.019
t-0.015
 of-0.014
 they-0.014
 while-0.014
Token,
Token ID11
Feature activation-1.14e-03

Pos logit contributions
ags+0.004
ode+0.004
 ey+0.004
34+0.003
per+0.003
Neg logit contributions
s-0.004
t-0.003
othe-0.003
 while-0.003
 whenever-0.003
Token Tom
Token ID4186
Feature activation-9.50e-04

Pos logit contributions
s+0.002
t+0.002
 of+0.002
 they+0.001
 together+0.001
Neg logit contributions
ags-0.002
 ey-0.002
34-0.002
 yourself-0.002
 yourselves-0.002
Token questioned
Token ID11434
Feature activation-4.96e-03

Pos logit contributions
s+0.002
t+0.001
 batch+0.001
uggle+0.001
apy+0.001
Neg logit contributions
per-0.001
yah-0.001
34-0.001
ode-0.001
 yourself-0.001
Token,
Token ID11
Feature activation-7.13e-03

Pos logit contributions
s+0.009
t+0.007
 of+0.006
ashed+0.006
ls+0.006
Neg logit contributions
ags-0.010
 yourself-0.008
 ey-0.008
 yourselves-0.008
34-0.008
Token "
Token ID366
Feature activation+2.48e-03

Pos logit contributions
s+0.013
t+0.010
 of+0.010
 they+0.010
 while+0.009
Neg logit contributions
ags-0.014
 ey-0.012
34-0.011
 yourself-0.011
 yourselves-0.011
TokenWhat
Token ID2061
Feature activation-1.59e-02

Pos logit contributions
ags+0.004
 ey+0.003
34+0.003
 yourself+0.002
 yourselves+0.002
Neg logit contributions
s-0.006
t-0.005
 of-0.005
 they-0.005
 while-0.005
Token is
Token ID318
Feature activation+5.90e-03

Pos logit contributions
s+0.028
t+0.024
ashed+0.021
 while+0.021
 of+0.021
Neg logit contributions
ags-0.028
 yourself-0.026
 ey-0.025
34-0.025
 yourselves-0.024
Token that
Token ID326
Feature activation+1.27e-02

Pos logit contributions
ags+0.016
 ey+0.014
34+0.014
 Deal+0.013
+0.013
Neg logit contributions
s-0.007
 they-0.005
 of-0.005
t-0.004
 together-0.004
Token high
Token ID1029
Feature activation+1.59e-02

Pos logit contributions
ags+0.023
 ey+0.019
34+0.018
 yourself+0.018
 yourselves+0.017
Neg logit contributions
s-0.026
t-0.021
 of-0.021
 they-0.019
 while-0.019
Token went
Token ID1816
Feature activation-1.26e-02

Pos logit contributions
s+0.009
t+0.007
 of+0.006
 they+0.006
 while+0.006
Neg logit contributions
ags-0.015
 ey-0.014
34-0.013
 yourself-0.013
 yourselves-0.013
Token to
Token ID284
Feature activation+1.22e-03

Pos logit contributions
s+0.014
t+0.011
 of+0.009
apy+0.008
 while+0.008
Neg logit contributions
ags-0.033
 ey-0.026
34-0.026
 yourself-0.026
 yourselves-0.025
Token visit
Token ID3187
Feature activation-8.55e-03

Pos logit contributions
ags+0.003
 ey+0.002
34+0.002
 yourself+0.002
 yourselves+0.002
Neg logit contributions
s-0.002
t-0.002
 of-0.002
 they-0.001
 while-0.001
Token their
Token ID511
Feature activation-8.10e-03

Pos logit contributions
s+0.012
 of+0.009
 they+0.008
t+0.008
 together+0.008
Neg logit contributions
ags-0.020
 ey-0.019
34-0.018
 yourself-0.017
 yourselves-0.017
Token neighbor
Token ID4780
Feature activation-2.25e-03

Pos logit contributions
s+0.014
t+0.011
 of+0.011
 they+0.010
 while+0.010
Neg logit contributions
ags-0.017
 ey-0.014
34-0.014
 yourself-0.014
 yourselves-0.014
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
s+0.004
t+0.003
apy+0.003
 of+0.003
ashed+0.003
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.004
-0.004
34-0.004
Token Mrs
Token ID9074
Feature activation-1.35e-02

Pos logit contributions
s+0.020
t+0.016
 of+0.015
ashed+0.014
ls+0.014
Neg logit contributions
ags-0.020
 ey-0.016
 yourself-0.016
34-0.016
 yourselves-0.016
Token.
Token ID13
Feature activation-3.47e-04

Pos logit contributions
s+0.018
 of+0.012
t+0.012
 they+0.011
 while+0.010
Neg logit contributions
ags-0.035
 ey-0.032
 yourself-0.029
 yourselves-0.029
34-0.029
Token Lee
Token ID5741
Feature activation-2.45e-03

Pos logit contributions
s+0.001
t+0.000
 of+0.000
 they+0.000
 while+0.000
Neg logit contributions
ags-0.001
 ey-0.001
34-0.001
 yourself-0.001
 yourselves-0.001
Token.
Token ID13
Feature activation-3.16e-03

Pos logit contributions
s+0.003
t+0.003
apy+0.003
ashed+0.003
z+0.003
Neg logit contributions
ags-0.004
34-0.004
 ey-0.004
-0.003
 ma-0.003
Token Mrs
Token ID9074
Feature activation-1.36e-02

Pos logit contributions
s+0.005
t+0.004
 of+0.004
 they+0.004
 while+0.003
Neg logit contributions
ags-0.007
 ey-0.006
 yourself-0.006
34-0.006
 yourselves-0.006
TokenOnce
Token ID7454
Feature activation+3.65e-03

Pos logit contributions
s+0.001
t+0.001
 of+0.001
 they+0.001
 while+0.001
Neg logit contributions
ags-0.001
 ey-0.001
 yourself-0.001
34-0.001
-0.001
Token upon
Token ID2402
Feature activation+1.72e-02

Pos logit contributions
ags+0.008
 ey+0.007
34+0.007
+0.006
 yourself+0.006
Neg logit contributions
s-0.004
t-0.003
ashed-0.003
 while-0.003
 mun-0.003
Token a
Token ID257
Feature activation+7.35e-03

Pos logit contributions
ags+0.033
 yourself+0.029
 ey+0.028
 yourselves+0.027
34+0.026
Neg logit contributions
s-0.032
t-0.025
ashed-0.022
 of-0.022
ls-0.022
Token time
Token ID640
Feature activation+2.57e-03

Pos logit contributions
ags+0.013
 ey+0.011
34+0.010
 yourself+0.010
+0.010
Neg logit contributions
s-0.015
t-0.012
 of-0.012
 they-0.011
 together-0.011
Token,
Token ID11
Feature activation-1.88e-03

Pos logit contributions
ags+0.005
 ey+0.005
 yourself+0.004
34+0.004
+0.004
Neg logit contributions
s-0.004
t-0.003
ashed-0.003
 batch-0.003
 of-0.003
Token there
Token ID612
Feature activation-8.64e-03

Pos logit contributions
s+0.003
t+0.003
ounce+0.002
 batch+0.002
ashed+0.002
Neg logit contributions
ags-0.003
 yourself-0.003
 ey-0.003
 yourselves-0.003
-0.003
Token was
Token ID373
Feature activation+5.96e-03

Pos logit contributions
s+0.011
t+0.010
ls+0.008
ashed+0.008
ounce+0.008
Neg logit contributions
ags-0.017
 ey-0.017
-0.016
 yourself-0.016
 yourselves-0.015
Token an
Token ID281
Feature activation+9.79e-03

Pos logit contributions
ags+0.012
 ey+0.010
 yourself+0.009
+0.009
 yourselves+0.009
Neg logit contributions
s-0.012
t-0.009
 of-0.008
ounce-0.008
ls-0.008
Token old
Token ID1468
Feature activation+4.16e-03

Pos logit contributions
ags+0.020
 ey+0.017
34+0.017
 yourself+0.017
 Deal+0.017
Neg logit contributions
s-0.015
 of-0.014
 while-0.012
 they-0.011
 together-0.011
Token man
Token ID582
Feature activation+8.33e-03

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.009
t-0.008
 of-0.007
 together-0.006
ashed-0.006
Token who
Token ID508
Feature activation+9.61e-03

Pos logit contributions
ags+0.013
 ey+0.011
 yourself+0.011
34+0.010
ode+0.010
Neg logit contributions
s-0.017
t-0.014
ls-0.013
ashed-0.013
 they-0.012
Token What
Token ID1867
Feature activation-1.96e-02

Pos logit contributions
s+0.009
t+0.008
 of+0.008
 they+0.007
 while+0.007
Neg logit contributions
ags-0.007
 ey-0.006
34-0.005
 yourself-0.005
 yourselves-0.005
Token are
Token ID389
Feature activation+6.11e-03

Pos logit contributions
s+0.034
t+0.027
 of+0.025
 they+0.024
 while+0.023
Neg logit contributions
ags-0.040
 ey-0.035
 yourself-0.034
34-0.033
 yourselves-0.032
Token you
Token ID345
Feature activation+1.58e-02

Pos logit contributions
ags+0.013
 ey+0.011
34+0.010
 yourself+0.010
+0.010
Neg logit contributions
s-0.011
t-0.009
 of-0.009
 they-0.008
 while-0.008
Token doing
Token ID1804
Feature activation+8.34e-03

Pos logit contributions
ags+0.030
 ey+0.025
34+0.023
 yourselves+0.022
 yourself+0.022
Neg logit contributions
s-0.031
t-0.025
 of-0.025
 they-0.023
 while-0.023
Token here
Token ID994
Feature activation+2.25e-02

Pos logit contributions
ags+0.014
 yourself+0.011
 ey+0.011
 yourselves+0.010
34+0.010
Neg logit contributions
s-0.019
t-0.015
 of-0.014
 they-0.014
ashed-0.014
Token?"
Token ID1701
Feature activation-1.50e-02

Pos logit contributions
ags+0.029
 yourself+0.023
 ey+0.020
 yourselves+0.020
34+0.018
Neg logit contributions
s-0.058
t-0.050
 they-0.045
ashed-0.045
 of-0.044
Token Lily
Token ID20037
Feature activation-1.14e-02

Pos logit contributions
s+0.023
t+0.019
 of+0.017
ounce+0.016
ashed+0.016
Neg logit contributions
ags-0.030
 ey-0.026
-0.026
 yourself-0.026
 yourselves-0.025
Token asked
Token ID1965
Feature activation-1.37e-02

Pos logit contributions
s+0.014
t+0.008
 of+0.008
 they+0.008
 while+0.007
Neg logit contributions
ags-0.032
 ey-0.028
34-0.027
 yourselves-0.026
 yourself-0.026
Token.
Token ID13
Feature activation-6.21e-03

Pos logit contributions
s+0.021
t+0.016
 of+0.015
 they+0.014
 while+0.013
Neg logit contributions
ags-0.031
 ey-0.027
 yourself-0.026
34-0.026
-0.025
Token
Token ID198
Feature activation-8.11e-03

Pos logit contributions
s+0.009
t+0.007
 of+0.007
 they+0.006
 while+0.006
Neg logit contributions
ags-0.015
 ey-0.013
34-0.013
 yourself-0.013
 yourselves-0.013
Token
Token ID198
Feature activation+6.05e-03

Pos logit contributions
s+0.012
t+0.009
 of+0.008
 they+0.008
 while+0.007
Neg logit contributions
ags-0.020
 ey-0.017
34-0.016
 yourself-0.016
 yourselves-0.016
Token me
Token ID502
Feature activation+1.29e-02

Pos logit contributions
s+0.017
 of+0.015
t+0.015
 they+0.014
 while+0.014
Neg logit contributions
ags-0.017
 ey-0.015
34-0.013
-0.013
 yourself-0.013
Token,
Token ID11
Feature activation-1.30e-02

Pos logit contributions
ags+0.025
 yourself+0.022
34+0.021
 yourselves+0.020
+0.019
Neg logit contributions
s-0.022
apy-0.017
ashed-0.017
t-0.016
 they-0.016
Token Lily
Token ID20037
Feature activation+5.26e-03

Pos logit contributions
s+0.024
t+0.019
 of+0.017
 together+0.016
z+0.015
Neg logit contributions
ags-0.026
 ey-0.022
 yourself-0.022
34-0.021
 yourselves-0.021
Token!"
Token ID2474
Feature activation-9.66e-03

Pos logit contributions
ags+0.009
 yourself+0.009
 yourselves+0.008
34+0.008
+0.008
Neg logit contributions
s-0.009
t-0.008
ashed-0.007
apy-0.007
 of-0.007
Token Ben
Token ID3932
Feature activation-8.86e-03

Pos logit contributions
s+0.021
t+0.018
 of+0.017
 together+0.016
 while+0.016
Neg logit contributions
ags-0.015
 ey-0.013
 yourself-0.012
 yourselves-0.011
34-0.011
Token calls
Token ID3848
Feature activation-3.11e-02

Pos logit contributions
s+0.016
t+0.014
 of+0.013
 while+0.012
less+0.012
Neg logit contributions
ags-0.017
 ey-0.014
-0.013
 yourself-0.013
34-0.013
Token.
Token ID13
Feature activation-2.42e-02

Pos logit contributions
s+0.052
t+0.039
 of+0.035
 they+0.033
ashed+0.032
Neg logit contributions
ags-0.068
 ey-0.055
34-0.054
 yourself-0.054
-0.053
Token "
Token ID366
Feature activation-9.58e-03

Pos logit contributions
s+0.048
t+0.039
 of+0.038
 they+0.035
 together+0.034
Neg logit contributions
ags-0.044
 ey-0.037
34-0.035
 yourself-0.034
 yourselves-0.033
TokenWe
Token ID1135
Feature activation-1.33e-02

Pos logit contributions
s+0.019
t+0.017
 of+0.015
 while+0.015
 they+0.014
Neg logit contributions
ags-0.018
 ey-0.016
34-0.014
 yourself-0.013
 yourselves-0.013
Token can
Token ID460
Feature activation-7.07e-04

Pos logit contributions
s+0.025
t+0.019
 of+0.018
 they+0.018
 together+0.018
Neg logit contributions
ags-0.025
 ey-0.021
 yourself-0.020
34-0.020
ode-0.020
Token slide
Token ID10649
Feature activation-8.74e-03

Pos logit contributions
s+0.001
ashed+0.001
t+0.001
 batch+0.001
 of+0.001
Neg logit contributions
ags-0.001
 yourself-0.001
 yourselves-0.001
-0.001
34-0.001
Tokenâ
Token ID22940
Feature activation-2.46e-03

Pos logit contributions
ags+0.011
 ey+0.010
 yourself+0.009
34+0.009
 yourselves+0.009
Neg logit contributions
s-0.013
t-0.011
 of-0.010
 they-0.010
 while-0.010
Token
Token ID26391
Feature activation+1.42e-02

Pos logit contributions
s+0.006
t+0.006
 of+0.005
 together+0.005
 while+0.005
Neg logit contributions
 ey-0.003
34-0.003
ags-0.003
 yourselves-0.002
ode-0.002
Token
Token ID198
Feature activation-2.31e-03

Pos logit contributions
ags+0.029
 ey+0.025
 yourself+0.024
34+0.024
 yourselves+0.023
Neg logit contributions
s-0.025
t-0.020
 of-0.020
 they-0.018
 while-0.017
Token
Token ID198
Feature activation+5.28e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 while+0.003
Neg logit contributions
ags-0.005
 ey-0.004
 yourself-0.004
34-0.004
 yourselves-0.004
TokenT
Token ID51
Feature activation+6.57e-04

Pos logit contributions
ags+0.009
 ey+0.007
 yourself+0.007
34+0.007
 yourselves+0.007
Neg logit contributions
s-0.011
t-0.009
 of-0.009
 they-0.009
 while-0.008
Tokeneddy
Token ID21874
Feature activation-3.48e-02

Pos logit contributions
ags+0.002
 ey+0.001
34+0.001
 yourself+0.001
 yourselves+0.001
Neg logit contributions
s-0.001
t-0.001
 of-0.001
 they-0.001
 while-0.001
Token hugged
Token ID41130
Feature activation-2.23e-03

Pos logit contributions
s+0.054
t+0.051
 of+0.045
 they+0.039
less+0.038
Neg logit contributions
ags-0.068
 yourself-0.060
 ey-0.059
-0.058
ode-0.056
Token Wire
Token ID14712
Feature activation-3.78e-03

Pos logit contributions
s+0.004
t+0.003
 of+0.003
 they+0.003
 together+0.003
Neg logit contributions
ags-0.004
 ey-0.004
 yourself-0.004
 yourselves-0.003
-0.003
Token-
Token ID12
Feature activation-7.86e-03

Pos logit contributions
s+0.005
t+0.005
 of+0.004
 they+0.004
ashed+0.004
Neg logit contributions
ags-0.008
 yourself-0.007
 ey-0.007
-0.007
34-0.007
TokenDog
Token ID32942
Feature activation-5.50e-03

Pos logit contributions
s+0.014
 of+0.011
 they+0.011
t+0.010
 together+0.010
Neg logit contributions
ags-0.014
 yourself-0.013
-0.012
 ey-0.012
 yourselves-0.012
Token and
Token ID290
Feature activation+9.12e-03

Pos logit contributions
s+0.008
t+0.007
 of+0.007
ls+0.006
 they+0.006
Neg logit contributions
ags-0.011
 yourself-0.010
-0.009
34-0.009
 ey-0.009
Token She
Token ID1375
Feature activation+6.16e-03

Pos logit contributions
ags+0.021
 yourself+0.018
 ey+0.018
 yourselves+0.017
+0.017
Neg logit contributions
s-0.026
t-0.023
 of-0.020
ls-0.020
ies-0.020
Token also
Token ID635
Feature activation+3.68e-04

Pos logit contributions
ags+0.009
 ey+0.009
 yourself+0.009
 yourselves+0.008
avi+0.007
Neg logit contributions
s-0.011
t-0.011
less-0.009
ounce-0.009
 while-0.008
Token had
Token ID550
Feature activation-9.98e-03

Pos logit contributions
ags+0.001
 ey+0.001
 yourself+0.001
 yourselves+0.001
34+0.000
Neg logit contributions
s-0.001
t-0.001
ounce-0.001
 of-0.001
 while-0.000
Token a
Token ID257
Feature activation-6.82e-03

Pos logit contributions
s+0.019
t+0.015
ounce+0.014
ls+0.013
 of+0.013
Neg logit contributions
ags-0.019
 yourself-0.017
 yourselves-0.016
34-0.015
 ey-0.015
Token t
Token ID256
Feature activation-1.88e-02

Pos logit contributions
s+0.014
t+0.012
 of+0.011
ounce+0.011
ashed+0.010
Neg logit contributions
ags-0.010
 ey-0.010
 upper-0.008
34-0.008
 bluff-0.008
Tokeneddy
Token ID21874
Feature activation-4.21e-02

Pos logit contributions
s+0.033
 they+0.031
 while+0.028
 together+0.028
 chat+0.027
Neg logit contributions
ags-0.030
 ey-0.025
 yourselves-0.024
34-0.023
ode-0.022
Token bear
Token ID6842
Feature activation-9.79e-03

Pos logit contributions
s+0.091
t+0.082
 of+0.074
ounce+0.071
 they+0.071
Neg logit contributions
ags-0.064
 yourself-0.050
 ey-0.049
34-0.047
 yourselves-0.047
Token,
Token ID11
Feature activation-7.46e-03

Pos logit contributions
s+0.016
t+0.014
ls+0.013
ashed+0.013
apy+0.012
Neg logit contributions
ags-0.015
 yourself-0.015
34-0.014
per-0.014
ode-0.014
Token a
Token ID257
Feature activation+1.82e-03

Pos logit contributions
s+0.014
t+0.013
ashed+0.011
ounce+0.010
ls+0.010
Neg logit contributions
ags-0.013
 yourself-0.012
 ey-0.012
 yourselves-0.011
34-0.011
Token doll
Token ID3654
Feature activation-2.53e-02

Pos logit contributions
ags+0.003
 ey+0.002
34+0.002
 yourself+0.002
 upper+0.002
Neg logit contributions
s-0.004
t-0.003
 of-0.003
ashed-0.003
ounce-0.003
Token,
Token ID11
Feature activation-4.90e-03

Pos logit contributions
s+0.038
t+0.034
ashed+0.031
ls+0.029
ounce+0.028
Neg logit contributions
ags-0.045
34-0.041
 yourself-0.040
per-0.038
 ey-0.038
Token She
Token ID1375
Feature activation+4.96e-03

Pos logit contributions
ags+0.011
 ey+0.009
 yourself+0.009
34+0.009
 yourselves+0.009
Neg logit contributions
s-0.012
t-0.010
 of-0.010
 they-0.009
 while-0.009
Token saw
Token ID2497
Feature activation+6.37e-03

Pos logit contributions
ags+0.008
 ey+0.008
34+0.007
 yourself+0.007
ode+0.007
Neg logit contributions
s-0.008
t-0.008
 of-0.007
less-0.006
 while-0.006
Token rain
Token ID6290
Feature activation+2.05e-03

Pos logit contributions
ags+0.011
 yourself+0.010
 yourselves+0.009
34+0.009
 ey+0.009
Neg logit contributions
s-0.012
t-0.012
ashed-0.010
ounce-0.010
z-0.010
Tokenbows
Token ID25435
Feature activation-8.35e-03

Pos logit contributions
ags+0.004
 ey+0.003
34+0.003
 yourself+0.003
 yourselves+0.003
Neg logit contributions
s-0.003
t-0.003
 of-0.002
ls-0.002
apy-0.002
Token and
Token ID290
Feature activation-6.44e-04

Pos logit contributions
s+0.012
t+0.012
ashed+0.010
rolled+0.010
ounce+0.010
Neg logit contributions
ags-0.014
34-0.013
 ey-0.012
 yourself-0.012
per-0.011
Token unic
Token ID28000
Feature activation-3.70e-02

Pos logit contributions
s+0.001
t+0.001
ounce+0.001
oped+0.001
z+0.001
Neg logit contributions
 ey-0.001
 yourself-0.001
 yourselves-0.001
ags-0.001
34-0.001
Tokenorns
Token ID19942
Feature activation+4.31e-03

Pos logit contributions
s+0.066
 of+0.054
 together+0.049
t+0.048
 they+0.044
Neg logit contributions
ags-0.069
 ey-0.064
-0.062
34-0.062
 yourselves-0.060
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
ags+0.008
 yourself+0.007
34+0.007
 ey+0.007
per+0.006
Neg logit contributions
s-0.006
t-0.005
 batch-0.005
ashed-0.005
apy-0.004
Token lots
Token ID6041
Feature activation-1.03e-02

Pos logit contributions
 yourself+0.002
 ey+0.002
ags+0.002
 yourselves+0.002
34+0.002
Neg logit contributions
s-0.003
t-0.003
ounce-0.003
z-0.003
oped-0.003
Token of
Token ID286
Feature activation-2.50e-02

Pos logit contributions
s+0.010
ounce+0.008
t+0.008
rolled+0.008
z+0.007
Neg logit contributions
ags-0.023
 ey-0.021
34-0.021
 yourself-0.020
 ways-0.019
Token other
Token ID584
Feature activation-1.26e-02

Pos logit contributions
s+0.053
t+0.047
ounce+0.043
z+0.039
ashed+0.039
Neg logit contributions
ags-0.041
 ey-0.037
 yourself-0.034
 yourselves-0.031
34-0.031
Token.
Token ID13
Feature activation+4.03e-03

Pos logit contributions
ags+0.007
34+0.006
per+0.005
 yourself+0.005
iece+0.005
Neg logit contributions
s-0.007
t-0.007
apy-0.006
 batch-0.006
ashed-0.006
Token It
Token ID632
Feature activation-9.28e-03

Pos logit contributions
ags+0.008
 ey+0.006
34+0.006
 yourself+0.006
 yourselves+0.006
Neg logit contributions
s-0.008
t-0.006
 of-0.006
 they-0.006
 while-0.005
Token hit
Token ID2277
Feature activation-1.04e-02

Pos logit contributions
s+0.015
t+0.014
less+0.013
 of+0.013
seeing+0.012
Neg logit contributions
ags-0.012
et-0.011
-0.011
 ey-0.010
iture-0.010
Token the
Token ID262
Feature activation+3.85e-03

Pos logit contributions
s+0.020
t+0.017
 of+0.016
 they+0.015
 while+0.014
Neg logit contributions
ags-0.020
 ey-0.016
 yourself-0.016
34-0.015
 yourselves-0.015
Token t
Token ID256
Feature activation-6.08e-03

Pos logit contributions
ags+0.007
 ey+0.006
 yourself+0.005
34+0.005
 yourselves+0.005
Neg logit contributions
s-0.008
t-0.007
 of-0.006
 they-0.006
 together-0.006
Tokeneddy
Token ID21874
Feature activation-3.40e-02

Pos logit contributions
s+0.010
 they+0.008
t+0.008
 while+0.008
 of+0.007
Neg logit contributions
ags-0.012
 ey-0.010
 yourselves-0.010
 yourself-0.010
34-0.010
Token bear
Token ID6842
Feature activation-6.85e-03

Pos logit contributions
s+0.065
t+0.055
 of+0.052
 they+0.049
 together+0.047
Neg logit contributions
ags-0.063
 ey-0.052
 yourself-0.050
34-0.049
 yourselves-0.048
Token and
Token ID290
Feature activation+5.30e-03

Pos logit contributions
s+0.010
t+0.009
ashed+0.008
 they+0.008
 batch+0.008
Neg logit contributions
ags-0.013
 yourself-0.011
34-0.010
 ey-0.010
per-0.010
Token bounced
Token ID31318
Feature activation-6.31e-03

Pos logit contributions
ags+0.009
 ey+0.008
 yourself+0.008
 yourselves+0.008
34+0.008
Neg logit contributions
s-0.009
t-0.008
 of-0.007
ounce-0.007
less-0.006
Token back
Token ID736
Feature activation-3.17e-03

Pos logit contributions
s+0.012
t+0.010
 of+0.009
 batch+0.009
ounce+0.009
Neg logit contributions
ags-0.011
 yourself-0.009
 ey-0.009
 yourselves-0.008
34-0.008
Token.
Token ID13
Feature activation+7.55e-03

Pos logit contributions
s+0.005
t+0.005
ashed+0.004
 mun+0.004
apy+0.004
Neg logit contributions
ags-0.006
 yourself-0.005
 yourselves-0.005
 ey-0.005
34-0.004