TOP ACTIVATIONS
MAX = 2.868

and they said their good by es . Big Bear learnt
amazed at the special nap kin ! The Royal
and they said their good by es . The rad ish
playing and said their good by es .
Uh - oh , our shirts are dirty !" Tom says
saw something shiny under a pile of swe aters . They
and looked at the ost ric hes . They saw one
they both said their good by es and went their separate
to be a good helper ," said Benny .
wanted to touch the ost ric hes . They reached out
. Ben touched the w affle maker . He screamed .
Then they said their good by es , and the little
see a man in a blue suit running to them .
sees something shiny under the pile . He reaches for it
They cleaned up the w affle and Lily 's mom made
, I can help you ," Max says . He takes
know , right ? Micro sc opes are amazing . They
mom and to the ost ric hes . They learned their
! What a nice surprise ," said M ummy .
They said their good by es until next time .

MIN ACTIVATIONS
MIN = -3.597

coats . They take their umb rell as .
N ina was panic - str icken . She screamed for
her . " If an earthquake happens , find a safe
coats . They take their umb rell as .
The town spe ople had to find a
she bowed , the town spe ople stopped to admire her
to find the land of Imp ressive Things . After walking
and stay inside during a storm . After the
got lost , the l ibr arians would help him find
their hands and spun their umb rell as . They had
never went outside during a storm again . <|endoftext|> Once upon
And I 'm sorry , ostr ich . Please come out
always wanted to see her ostr ich friend . <|endoftext|> Once
� € Hi Miss O str ich ! That sounds funny
said , " Thank you Aunt ie !" <|endoftext|> Once upon
he would take the av oc ados and scatter them around
go to the land of Imp ressive Things .
its home in a big storm . The little pine tree
, a small bird named Bree . Bree saw the big
He smiled as the l ibr arians watched him having fun

INTERVAL 1.252 - 2.868
CONTAINS 0.690%

When he finally reached the end , he marvel ed when
e , bye , tiger ," Tom and Lily say .
and soon everyone saw the bone . They laughed at him
in the world , but also see you , the dancer
have two cups for free ," the man said .

INTERVAL -0.365 - 1.252
CONTAINS 77.396%

to play this piano too . After a few
be something special like a medal . But David had a
. It was a fun and delicious dinner for Lily and
her friends to join her and when they said yes she
if he could crawl there . His aunt said that he

INTERVAL -1.981 - -0.365
CONTAINS 21.902%

tell their mom and dad what happened . They held hands
They say thank you to Mom . They say they love
the other food around , " Who wants to win a
, you are too noisy ! I can 't hear myself
her . Every day the farmer would walk her around the

INTERVAL -3.597 - -1.981
CONTAINS 0.012%

The town spe ople had to find a
than usual . As Ch ir p flew higher , he
are astronauts . " Look , Lily , I
mom my loved her daughter very much . One day ,
the pond !" " OK !" Ben says .
Token and
Token ID290
Feature activation+9.54e-01

Pos logit contributions
icker+0.019
ulum+0.018
by+0.017
?+0.017
 pens+0.016
Neg logit contributions
 Gor-0.017
 gor-0.016
 seventy-0.016
n-0.016
 deliber-0.015
Token they
Token ID484
Feature activation+4.86e-01

Pos logit contributions
by+0.018
icker+0.017
?+0.016
ulum+0.015
 pens+0.014
Neg logit contributions
 gor-0.019
 Gor-0.019
n-0.018
 deliber-0.018
 seventy-0.018
Token said
Token ID531
Feature activation+2.03e-01

Pos logit contributions
icker+0.008
?+0.008
by+0.008
ulum+0.008
 something+0.007
Neg logit contributions
 Gor-0.010
n-0.010
 gor-0.010
 seventy-0.010
 gob-0.010
Token their
Token ID511
Feature activation-6.07e-01

Pos logit contributions
by+0.006
icker+0.006
ulum+0.006
?+0.006
 pens+0.006
Neg logit contributions
n-0.001
 seventy-0.001
 deliber-0.001
 delighted-0.001
 gor-0.001
Token good
Token ID922
Feature activation+4.47e-01

Pos logit contributions
 gor+0.012
 Gor+0.012
n+0.011
 deliber+0.011
 seventy+0.011
Neg logit contributions
icker-0.011
by-0.011
?-0.010
ulum-0.010
 something-0.009
Tokenby
Token ID1525
Feature activation+2.87e+00

Pos logit contributions
icker+0.006
ulum+0.006
important+0.005
ifying+0.005
ilts+0.005
Neg logit contributions
 deliber-0.010
 surprise-0.010
 ostr-0.010
rd-0.010
n-0.010
Tokenes
Token ID274
Feature activation+9.72e-01

Pos logit contributions
icker+0.061
ulum+0.061
?+0.056
by+0.054
important+0.051
Neg logit contributions
 gor-0.050
 Gor-0.050
n-0.049
oke-0.046
 seventy-0.045
Token.
Token ID13
Feature activation+4.34e-01

Pos logit contributions
icker+0.014
ulum+0.013
by+0.013
?+0.012
bie+0.011
Neg logit contributions
n-0.023
 gor-0.023
 Gor-0.022
 deliber-0.022
 gob-0.022
Token Big
Token ID4403
Feature activation-1.25e+00

Pos logit contributions
by+0.008
icker+0.008
?+0.007
ulum+0.007
 pens+0.007
Neg logit contributions
 Gor-0.009
 gor-0.008
 Once-0.008
 seventy-0.008
 deliber-0.008
Token Bear
Token ID14732
Feature activation+8.65e-01

Pos logit contributions
 Gor+0.019
 seventy+0.019
 spe+0.018
n+0.017
 gor+0.017
Neg logit contributions
icker-0.030
by-0.027
ulum-0.026
 something-0.026
?-0.025
Token learnt
Token ID26338
Feature activation+6.20e-01

Pos logit contributions
icker+0.019
?+0.018
ulum+0.017
by+0.017
 something+0.016
Neg logit contributions
n-0.015
 Gor-0.014
 seventy-0.014
 gob-0.014
 gor-0.014
Token amazed
Token ID24562
Feature activation+1.36e-02

Pos logit contributions
 gor+0.012
 Gor+0.012
riger+0.012
 deliber+0.012
n+0.011
Neg logit contributions
by-0.009
icker-0.009
?-0.008
 something-0.008
 each-0.008
Token at
Token ID379
Feature activation+2.20e-01

Pos logit contributions
icker+0.000
by+0.000
ulum+0.000
?+0.000
 pens+0.000
Neg logit contributions
 gor-0.000
 Gor-0.000
 deliber-0.000
n-0.000
oke-0.000
Token the
Token ID262
Feature activation-2.83e-01

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
ilts+0.004
?+0.004
Neg logit contributions
 gor-0.004
n-0.004
 Gor-0.004
 seventy-0.004
 ostr-0.003
Token special
Token ID2041
Feature activation-1.31e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 deliber+0.005
 seventy+0.005
Neg logit contributions
icker-0.006
by-0.006
?-0.006
ulum-0.005
 each-0.005
Token nap
Token ID25422
Feature activation+1.42e+00

Pos logit contributions
 gor+0.002
 Gor+0.002
n+0.002
 deliber+0.002
 seventy+0.002
Neg logit contributions
by-0.003
icker-0.003
?-0.002
ulum-0.002
 something-0.002
Tokenkin
Token ID5116
Feature activation+2.59e+00

Pos logit contributions
by+0.029
icker+0.029
ulum+0.029
 Raz+0.027
important+0.026
Neg logit contributions
n-0.023
rd-0.022
 gor-0.021
oke-0.021
 delighted-0.021
Token!
Token ID0
Feature activation+5.10e-01

Pos logit contributions
icker+0.047
by+0.045
ulum+0.042
?+0.038
bie+0.037
Neg logit contributions
n-0.052
 Gor-0.051
 gor-0.050
 gob-0.050
 delighted-0.050
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
by+0.012
icker+0.012
?+0.011
ulum+0.011
 pens+0.010
Neg logit contributions
 Gor-0.008
 gor-0.008
oke-0.008
n-0.007
 deliber-0.007
Token
Token ID198
Feature activation-7.32e-02

Pos logit contributions
by+0.003
icker+0.003
?+0.003
ulum+0.003
 something+0.002
Neg logit contributions
 gor-0.004
 Gor-0.004
 seventy-0.004
 deliber-0.003
riger-0.003
TokenThe
Token ID464
Feature activation-5.44e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
n+0.002
 deliber+0.001
riger+0.001
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
bie-0.001
Token Royal
Token ID8111
Feature activation-8.30e-01

Pos logit contributions
 gor+0.008
n+0.008
 seventy+0.008
 Gor+0.008
 deliber+0.008
Neg logit contributions
by-0.012
icker-0.012
?-0.012
 pens-0.011
 something-0.010
Token and
Token ID290
Feature activation+7.27e-01

Pos logit contributions
 Gor+0.001
 gor+0.001
 seventy+0.001
n+0.000
 ostr+0.000
Neg logit contributions
icker-0.001
ulum-0.001
by-0.001
?-0.001
 gears-0.001
Token they
Token ID484
Feature activation+3.05e-01

Pos logit contributions
icker+0.015
by+0.015
ulum+0.014
?+0.014
own+0.013
Neg logit contributions
 Gor-0.013
 gor-0.013
 seventy-0.012
 deliber-0.012
n-0.012
Token said
Token ID531
Feature activation-1.75e-01

Pos logit contributions
icker+0.006
ulum+0.006
?+0.005
by+0.005
 pens+0.005
Neg logit contributions
 Gor-0.006
 gor-0.006
 gob-0.006
n-0.006
 seventy-0.006
Token their
Token ID511
Feature activation-5.21e-01

Pos logit contributions
 gor+0.001
 seventy+0.001
 Gor+0.001
n+0.001
 deliber+0.001
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
?-0.005
 pens-0.005
Token good
Token ID922
Feature activation+3.92e-01

Pos logit contributions
 gor+0.010
 deliber+0.010
 Gor+0.010
n+0.010
 seventy+0.010
Neg logit contributions
by-0.010
icker-0.009
?-0.009
 each-0.008
 something-0.008
Tokenby
Token ID1525
Feature activation+2.57e+00

Pos logit contributions
ulum+0.007
icker+0.007
ilts+0.005
Jason+0.005
'?+0.005
Neg logit contributions
 ostr-0.009
 gor-0.009
 Gor-0.009
 surprise-0.008
 pand-0.008
Tokenes
Token ID274
Feature activation+4.17e-01

Pos logit contributions
icker+0.062
ulum+0.062
?+0.054
pers+0.053
important+0.052
Neg logit contributions
 Gor-0.045
 gor-0.045
n-0.040
 seventy-0.040
 ostr-0.037
Token.
Token ID13
Feature activation+3.30e-02

Pos logit contributions
icker+0.008
ulum+0.008
by+0.007
important+0.007
 gears+0.007
Neg logit contributions
n-0.008
 gor-0.008
 gob-0.008
 Gor-0.008
 seventy-0.007
Token The
Token ID383
Feature activation-6.64e-01

Pos logit contributions
by+0.001
icker+0.001
?+0.001
 pens+0.001
ulum+0.001
Neg logit contributions
 Gor-0.001
 seventy-0.001
 gor-0.001
 Once-0.001
 Ph-0.001
Token rad
Token ID2511
Feature activation+1.69e-02

Pos logit contributions
 gor+0.012
 Gor+0.011
n+0.011
 deliber+0.011
 seventy+0.011
Neg logit contributions
by-0.014
icker-0.014
?-0.013
ulum-0.012
 pens-0.012
Tokenish
Token ID680
Feature activation+3.53e-01

Pos logit contributions
icker+0.000
ulum+0.000
?+0.000
by+0.000
own+0.000
Neg logit contributions
 Gor-0.000
 gor-0.000
 seventy-0.000
 ostr-0.000
n-0.000
Token playing
Token ID2712
Feature activation+1.15e+00

Pos logit contributions
 gor+0.005
 deliber+0.005
n+0.005
 following+0.005
 Gor+0.005
Neg logit contributions
icker-0.007
by-0.007
ulum-0.006
 pens-0.006
?-0.006
Token and
Token ID290
Feature activation+6.54e-01

Pos logit contributions
by+0.022
icker+0.021
bie+0.021
oppy+0.019
important+0.018
Neg logit contributions
 gor-0.022
 delighted-0.022
n-0.021
 deliber-0.020
 gob-0.020
Token said
Token ID531
Feature activation+2.83e-01

Pos logit contributions
icker+0.015
by+0.013
?+0.013
ulum+0.012
own+0.012
Neg logit contributions
 gor-0.012
 Gor-0.011
 gob-0.011
 ostr-0.011
n-0.011
Token their
Token ID511
Feature activation-6.27e-01

Pos logit contributions
icker+0.009
ulum+0.008
by+0.008
?+0.008
 pens+0.008
Neg logit contributions
 Gor-0.002
 gor-0.002
 seventy-0.002
 delighted-0.002
n-0.002
Token good
Token ID922
Feature activation+3.23e-01

Pos logit contributions
n+0.010
 gor+0.009
 ostr+0.009
 Gor+0.009
 delighted+0.009
Neg logit contributions
icker-0.014
by-0.013
ulum-0.013
?-0.013
important-0.012
Tokenby
Token ID1525
Feature activation+2.52e+00

Pos logit contributions
ulum+0.005
important+0.004
icker+0.004
pers+0.004
issors+0.004
Neg logit contributions
 ostr-0.007
 surprise-0.007
 invitation-0.007
 delight-0.007
 offer-0.007
Tokenes
Token ID274
Feature activation+7.96e-01

Pos logit contributions
icker+0.063
ulum+0.063
pers+0.054
important+0.054
?+0.052
Neg logit contributions
 Gor-0.041
 gor-0.041
n-0.038
oke-0.037
rd-0.035
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
icker+0.016
ulum+0.015
by+0.014
oppy+0.013
bie+0.013
Neg logit contributions
n-0.016
 gob-0.016
 gor-0.016
 Gor-0.015
 delighted-0.015
Token
Token ID220
Feature activation+2.67e-01

Pos logit contributions
 Ph+0.003
 Gor+0.003
 deliber+0.003
n+0.003
 Lib+0.003
Neg logit contributions
by-0.004
icker-0.004
ina-0.004
?-0.004
 pens-0.004
Token
Token ID198
Feature activation+6.93e-03

Pos logit contributions
by+0.007
icker+0.006
 pens+0.006
?+0.006
ulum+0.006
Neg logit contributions
n-0.004
oke-0.004
 gor-0.003
 Gor-0.003
riger-0.003
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
by+0.000
icker+0.000
?+0.000
ulum+0.000
 pens+0.000
Neg logit contributions
 gor-0.000
n-0.000
 Gor-0.000
 deliber-0.000
 seventy-0.000
TokenUh
Token ID34653
Feature activation-1.95e-01

Pos logit contributions
 gor+0.028
 Gor+0.027
n+0.027
 deliber+0.026
riger+0.026
Neg logit contributions
by-0.023
icker-0.022
?-0.021
ulum-0.020
 each-0.019
Token-
Token ID12
Feature activation-8.20e-01

Pos logit contributions
n+0.004
 gor+0.003
 Gor+0.003
 deliber+0.003
riger+0.003
Neg logit contributions
icker-0.004
by-0.004
ulum-0.004
 pens-0.003
 Toby-0.003
Tokenoh
Token ID1219
Feature activation-2.10e-01

Pos logit contributions
n+0.009
riger+0.007
go+0.007
oke+0.006
 seventy+0.006
Neg logit contributions
icker-0.024
 journals-0.022
ulum-0.022
?-0.022
ilts-0.022
Token,
Token ID11
Feature activation-3.79e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
 deliber+0.005
fters+0.005
 seventy+0.005
Neg logit contributions
by-0.003
?-0.003
icker-0.003
 something-0.003
 each-0.003
Token our
Token ID674
Feature activation-1.18e-01

Pos logit contributions
 deliber+0.010
 Gor+0.010
 gor+0.009
 seventy+0.009
 spe+0.009
Neg logit contributions
 something-0.005
?-0.005
 each-0.004
by-0.004
 their-0.004
Token shirts
Token ID20858
Feature activation+2.52e+00

Pos logit contributions
 picnic+0.002
n+0.002
 ostr+0.002
riger+0.002
 amaz+0.002
Neg logit contributions
icker-0.003
important-0.002
by-0.002
isdom-0.002
 Oliver-0.002
Token are
Token ID389
Feature activation+1.27e+00

Pos logit contributions
icker+0.066
by+0.063
ulum+0.060
 Toby+0.059
Simon+0.058
Neg logit contributions
n-0.030
 gob-0.028
 seventy-0.027
oke-0.027
 gor-0.027
Token dirty
Token ID11841
Feature activation+6.59e-01

Pos logit contributions
icker+0.028
by+0.028
ulum+0.024
ina+0.023
Simon+0.023
Neg logit contributions
 gor-0.021
n-0.021
 spe-0.021
 seventy-0.021
 Gor-0.020
Token!"
Token ID2474
Feature activation+1.18e+00

Pos logit contributions
icker+0.013
by+0.013
ulum+0.011
?+0.011
 pens+0.010
Neg logit contributions
 gor-0.012
 Gor-0.012
 seventy-0.012
n-0.012
 deliber-0.012
Token Tom
Token ID4186
Feature activation+6.86e-01

Pos logit contributions
by+0.026
icker+0.025
?+0.022
ulum+0.022
important+0.022
Neg logit contributions
 gor-0.020
 deliber-0.019
 Gor-0.019
n-0.019
 seventy-0.018
Token says
Token ID1139
Feature activation+2.80e-01

Pos logit contributions
by+0.016
icker+0.016
?+0.015
ulum+0.014
 pens+0.014
Neg logit contributions
 gor-0.010
 Gor-0.010
n-0.010
 deliber-0.009
 seventy-0.009
Token saw
Token ID2497
Feature activation+7.19e-02

Pos logit contributions
 Gor+0.001
 gor+0.001
 seventy+0.001
n+0.001
 ostr+0.001
Neg logit contributions
icker-0.002
by-0.001
ulum-0.001
?-0.001
own-0.001
Token something
Token ID1223
Feature activation-2.27e-01

Pos logit contributions
by+0.001
icker+0.001
ilts+0.001
?+0.001
ulum+0.001
Neg logit contributions
 gor-0.001
 ostr-0.001
 Ph-0.001
 delighted-0.001
 Gor-0.001
Token shiny
Token ID22441
Feature activation+1.87e-01

Pos logit contributions
 gor+0.004
n+0.004
 Gor+0.004
 deliber+0.004
 ostr+0.004
Neg logit contributions
by-0.005
icker-0.005
ulum-0.004
?-0.004
bie-0.004
Token under
Token ID739
Feature activation+9.42e-01

Pos logit contributions
icker+0.005
by+0.005
bie+0.004
oppy+0.004
ilts+0.004
Neg logit contributions
 gor-0.003
 ostr-0.002
 spe-0.002
 Gor-0.002
 seventy-0.002
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
by+0.023
icker+0.021
bie+0.019
important+0.018
attered+0.018
Neg logit contributions
n-0.015
 gor-0.015
 spe-0.015
 Ph-0.014
 ostr-0.013
Token pile
Token ID14540
Feature activation+2.51e+00

Pos logit contributions
icker+0.030
by+0.029
ulum+0.026
bie+0.026
important+0.025
Neg logit contributions
 gor-0.016
 Gor-0.016
 spe-0.015
 ostr-0.015
riger-0.015
Token of
Token ID286
Feature activation+4.92e-01

Pos logit contributions
by+0.074
icker+0.070
ulum+0.067
?+0.064
bie+0.064
Neg logit contributions
n-0.022
 gor-0.021
 Gor-0.020
 deliber-0.019
 ostr-0.018
Token swe
Token ID3490
Feature activation+7.49e-01

Pos logit contributions
icker+0.011
by+0.010
?+0.009
ulum+0.009
own+0.008
Neg logit contributions
 gor-0.009
 Gor-0.009
 ostr-0.009
 spe-0.008
 seventy-0.008
Tokenaters
Token ID8605
Feature activation+1.80e+00

Pos logit contributions
by+0.010
icker+0.010
?+0.009
ulum+0.008
 something+0.008
Neg logit contributions
 gor-0.018
 Gor-0.018
n-0.017
 seventy-0.017
 deliber-0.017
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
by+0.028
icker+0.026
ulum+0.022
bie+0.021
?+0.021
Neg logit contributions
n-0.042
 gor-0.041
 Gor-0.040
 seventy-0.040
 ostr-0.039
Token They
Token ID1119
Feature activation-9.99e-02

Pos logit contributions
by+0.011
icker+0.010
?+0.009
important+0.009
ina+0.009
Neg logit contributions
 Ph-0.011
 Gor-0.010
 deliber-0.010
 delighted-0.010
 gor-0.010
Token and
Token ID290
Feature activation+6.28e-01

Pos logit contributions
 gor+0.009
 Gor+0.009
n+0.009
 deliber+0.009
 seventy+0.009
Neg logit contributions
by-0.008
icker-0.008
?-0.008
ulum-0.007
 pens-0.007
Token looked
Token ID3114
Feature activation+5.69e-01

Pos logit contributions
icker+0.014
by+0.013
?+0.012
utes+0.012
ilts+0.012
Neg logit contributions
 Gor-0.011
 gor-0.010
 gob-0.010
 ostr-0.010
n-0.010
Token at
Token ID379
Feature activation-6.92e-02

Pos logit contributions
icker+0.014
by+0.014
ulum+0.013
own+0.013
 pens+0.012
Neg logit contributions
 gor-0.008
 Gor-0.007
 delighted-0.007
 ostr-0.007
 deliber-0.007
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
 gor+0.001
 ostr+0.001
 Ph+0.001
 Gor+0.001
 spe+0.001
Neg logit contributions
by-0.001
icker-0.001
ilts-0.001
ulum-0.001
?-0.001
Token ost
Token ID23619
Feature activation-2.08e+00

Pos logit contributions
by+0.006
icker+0.006
?+0.006
 something+0.006
 each+0.005
Neg logit contributions
 deliber-0.004
 seventy-0.004
riger-0.004
 Gor-0.004
n-0.004
Tokenric
Token ID1173
Feature activation+2.51e+00

Pos logit contributions
 gor+0.057
n+0.055
 Gor+0.055
 ostr+0.053
oke+0.053
Neg logit contributions
by-0.023
icker-0.022
?-0.020
ulum-0.018
 something-0.018
Tokenhes
Token ID956
Feature activation+4.42e-01

Pos logit contributions
by+0.040
icker+0.035
ulum+0.034
?+0.032
 Toby+0.031
Neg logit contributions
n-0.058
 gor-0.058
 Gor-0.055
oke-0.055
 seventy-0.053
Token.
Token ID13
Feature activation+7.20e-01

Pos logit contributions
by+0.008
ulum+0.008
icker+0.008
bie+0.007
oppy+0.007
Neg logit contributions
n-0.009
 gob-0.009
 gor-0.008
oke-0.008
 spe-0.008
Token They
Token ID1119
Feature activation-2.98e-01

Pos logit contributions
by+0.012
?+0.010
icker+0.010
ina+0.009
eds+0.009
Neg logit contributions
 Gor-0.017
 gor-0.017
 Ph-0.017
 deliber-0.016
 spe-0.016
Token saw
Token ID2497
Feature activation-2.03e-01

Pos logit contributions
 gor+0.004
 Gor+0.004
 ostr+0.004
n+0.004
 gob+0.004
Neg logit contributions
icker-0.007
by-0.007
ulum-0.007
?-0.006
own-0.006
Token one
Token ID530
Feature activation+8.30e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
 ostr+0.005
n+0.005
 seventy+0.005
Neg logit contributions
by-0.003
icker-0.003
?-0.003
ulum-0.002
own-0.002
Token they
Token ID484
Feature activation+2.50e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.004
ulum+0.004
 pens+0.004
Neg logit contributions
 gor-0.005
 Gor-0.005
n-0.005
 deliber-0.005
 seventy-0.005
Token both
Token ID1111
Feature activation+1.68e-02

Pos logit contributions
icker+0.004
ulum+0.004
by+0.004
?+0.004
 pens+0.003
Neg logit contributions
n-0.005
 gor-0.005
 Gor-0.005
 seventy-0.005
 gob-0.005
Token said
Token ID531
Feature activation-1.86e-02

Pos logit contributions
ulum+0.000
icker+0.000
 pens+0.000
?+0.000
important+0.000
Neg logit contributions
 gob-0.000
 ostr-0.000
 gor-0.000
 Gor-0.000
n-0.000
Token their
Token ID511
Feature activation-3.00e-01

Pos logit contributions
 seventy+0.000
 gor+0.000
n+0.000
 Gor+0.000
 deliber+0.000
Neg logit contributions
icker-0.001
by-0.001
ulum-0.000
?-0.000
 pens-0.000
Token good
Token ID922
Feature activation+4.43e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 deliber+0.005
 seventy+0.005
Neg logit contributions
icker-0.006
by-0.006
?-0.006
ulum-0.005
ina-0.005
Tokenby
Token ID1525
Feature activation+2.48e+00

Pos logit contributions
ulum+0.007
icker+0.006
nam+0.006
important+0.006
ifying+0.005
Neg logit contributions
 ostr-0.010
 gor-0.010
 deliber-0.010
 Gor-0.010
n-0.009
Tokenes
Token ID274
Feature activation+4.50e-01

Pos logit contributions
ulum+0.064
icker+0.064
pers+0.057
?+0.056
important+0.056
Neg logit contributions
 Gor-0.038
 gor-0.037
n-0.036
 seventy-0.033
oke-0.032
Token and
Token ID290
Feature activation+6.48e-01

Pos logit contributions
icker+0.010
ulum+0.010
by+0.009
important+0.009
 gears+0.009
Neg logit contributions
n-0.007
 gor-0.007
 gob-0.007
 Gor-0.007
 delighted-0.007
Token went
Token ID1816
Feature activation-1.08e-01

Pos logit contributions
icker+0.013
ulum+0.011
?+0.011
by+0.011
own+0.010
Neg logit contributions
 Gor-0.013
 gor-0.013
 ostr-0.012
n-0.012
 gob-0.012
Token their
Token ID511
Feature activation-3.51e-01

Pos logit contributions
n+0.002
 gor+0.002
 deliber+0.001
riger+0.001
 seventy+0.001
Neg logit contributions
icker-0.002
ulum-0.002
by-0.002
important-0.002
?-0.002
Token separate
Token ID4553
Feature activation+3.86e-01

Pos logit contributions
 gor+0.007
 Gor+0.007
 seventy+0.007
 deliber+0.007
n+0.007
Neg logit contributions
by-0.007
icker-0.006
?-0.006
 each-0.006
 pens-0.006
Token to
Token ID284
Feature activation+4.17e-01

Pos logit contributions
icker+0.000
by+0.000
ulum+0.000
?+0.000
 pens+0.000
Neg logit contributions
 Gor-0.000
n-0.000
 seventy-0.000
 gor-0.000
 deliber-0.000
Token be
Token ID307
Feature activation+2.84e-01

Pos logit contributions
by+0.009
icker+0.009
?+0.008
ulum+0.008
 pens+0.007
Neg logit contributions
 gor-0.007
 Gor-0.007
n-0.007
 deliber-0.007
 seventy-0.007
Token a
Token ID257
Feature activation-2.43e-01

Pos logit contributions
icker+0.007
by+0.006
ulum+0.006
ina+0.006
 Brother+0.005
Neg logit contributions
n-0.004
 Gor-0.004
 gor-0.004
 seventy-0.004
 deliber-0.004
Token good
Token ID922
Feature activation+5.76e-01

Pos logit contributions
 Gor+0.004
 gor+0.004
 seventy+0.004
 ostr+0.004
 spe+0.004
Neg logit contributions
icker-0.005
ulum-0.005
by-0.005
?-0.005
 pens-0.004
Token helper
Token ID31904
Feature activation-9.90e-01

Pos logit contributions
icker+0.010
by+0.010
ulum+0.009
?+0.009
 pens+0.008
Neg logit contributions
 gor-0.012
 Gor-0.012
 seventy-0.011
n-0.011
 deliber-0.011
Token,"
Token ID553
Feature activation+2.42e+00

Pos logit contributions
 gor+0.024
 Gor+0.024
 deliber+0.024
riger+0.024
n+0.024
Neg logit contributions
by-0.013
?-0.012
icker-0.012
 each-0.010
ulum-0.010
Token said
Token ID531
Feature activation+3.14e-01

Pos logit contributions
icker+0.063
by+0.060
ulum+0.058
ilts+0.055
?+0.054
Neg logit contributions
n-0.031
 deliber-0.031
 gob-0.030
 Gor-0.028
 seventy-0.027
Token Benny
Token ID44275
Feature activation+1.00e+00

Pos logit contributions
icker+0.005
by+0.005
ulum+0.005
?+0.005
bie+0.004
Neg logit contributions
 deliber-0.006
 Gor-0.006
n-0.006
 seventy-0.006
 gor-0.006
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
icker+0.010
by+0.009
?+0.008
ulum+0.008
 pens+0.006
Neg logit contributions
 Gor-0.028
 gor-0.028
n-0.028
 seventy-0.028
 deliber-0.027
Token
Token ID198
Feature activation+2.85e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.005
ulum+0.004
 Toby+0.004
Neg logit contributions
 gor-0.006
 Gor-0.006
n-0.006
riger-0.006
 deliber-0.006
Token
Token ID198
Feature activation+4.95e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.005
ulum+0.005
 pens+0.004
Neg logit contributions
 gor-0.006
 Gor-0.005
n-0.005
 deliber-0.005
 seventy-0.005
Token wanted
Token ID2227
Feature activation+5.58e-01

Pos logit contributions
 gob+0.004
n+0.004
 ostr+0.004
 Gor+0.004
 gor+0.004
Neg logit contributions
icker-0.007
by-0.007
ulum-0.007
?-0.007
 chess-0.006
Token to
Token ID284
Feature activation-3.92e-01

Pos logit contributions
icker+0.012
ulum+0.012
ilts+0.011
ina+0.011
by+0.011
Neg logit contributions
 gor-0.010
 ostr-0.009
 Gor-0.009
n-0.009
 deliber-0.009
Token touch
Token ID3638
Feature activation+4.36e-01

Pos logit contributions
 gor+0.008
 Gor+0.008
 deliber+0.007
 seventy+0.007
riger+0.007
Neg logit contributions
by-0.007
icker-0.007
?-0.007
ulum-0.006
 each-0.006
Token the
Token ID262
Feature activation+8.74e-02

Pos logit contributions
icker+0.010
by+0.010
ulum+0.009
oppy+0.009
ilts+0.008
Neg logit contributions
 gor-0.008
 ostr-0.008
 Gor-0.007
 ost-0.007
 spe-0.007
Token ost
Token ID23619
Feature activation-2.09e+00

Pos logit contributions
by+0.002
icker+0.002
?+0.002
 each+0.002
 something+0.002
Neg logit contributions
 deliber-0.002
 seventy-0.002
riger-0.002
 Gor-0.002
n-0.001
Tokenric
Token ID1173
Feature activation+2.42e+00

Pos logit contributions
 gor+0.061
 Gor+0.061
 deliber+0.060
n+0.060
 seventy+0.060
Neg logit contributions
by-0.019
icker-0.018
?-0.015
ulum-0.013
 pens-0.011
Tokenhes
Token ID956
Feature activation+3.27e-01

Pos logit contributions
by+0.035
icker+0.032
ulum+0.029
?+0.029
 Toby+0.026
Neg logit contributions
n-0.058
 gor-0.058
 Gor-0.057
oke-0.055
 deliber-0.055
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
by+0.007
icker+0.007
ulum+0.006
bie+0.006
oppy+0.006
Neg logit contributions
n-0.006
 gor-0.006
 ostr-0.005
 gob-0.005
 Gor-0.005
Token They
Token ID1119
Feature activation-3.60e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 deliber+0.005
riger+0.005
Neg logit contributions
by-0.002
icker-0.002
?-0.002
ulum-0.002
 pens-0.002
Token reached
Token ID4251
Feature activation+4.67e-01

Pos logit contributions
n+0.006
 gor+0.006
 Gor+0.006
 gob+0.006
 ostr+0.006
Neg logit contributions
icker-0.008
by-0.008
?-0.007
ulum-0.007
 chess-0.007
Token out
Token ID503
Feature activation+6.38e-02

Pos logit contributions
by+0.009
icker+0.009
ulum+0.009
important+0.009
own+0.009
Neg logit contributions
 gor-0.008
 seventy-0.008
 ostr-0.007
 Gor-0.007
 reaching-0.007
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
 gor+0.010
n+0.010
 Gor+0.010
 deliber+0.010
 seventy+0.010
Neg logit contributions
icker-0.004
by-0.004
ulum-0.003
?-0.003
 pens-0.003
Token Ben
Token ID3932
Feature activation+6.83e-01

Pos logit contributions
by+0.009
icker+0.007
important+0.007
ina+0.007
bie+0.007
Neg logit contributions
 seventy-0.006
 Gor-0.005
 deliber-0.005
 Ph-0.005
oke-0.005
Token touched
Token ID12615
Feature activation+4.94e-01

Pos logit contributions
icker+0.013
by+0.013
?+0.011
ulum+0.011
 pens+0.010
Neg logit contributions
 gor-0.013
 Gor-0.013
n-0.013
 seventy-0.013
 deliber-0.013
Token the
Token ID262
Feature activation-7.43e-02

Pos logit contributions
icker+0.014
by+0.012
ilts+0.011
ulum+0.011
utes+0.011
Neg logit contributions
n-0.006
 Ph-0.006
 seventy-0.006
 gor-0.006
 Sof-0.005
Token w
Token ID266
Feature activation+3.45e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
 seventy+0.001
n+0.001
 deliber+0.001
Neg logit contributions
icker-0.002
by-0.002
ulum-0.001
?-0.001
ina-0.001
Tokenaffle
Token ID30697
Feature activation+2.41e+00

Pos logit contributions
by+0.008
icker+0.008
ulum+0.008
 each+0.008
important+0.008
Neg logit contributions
oke-0.005
 gor-0.004
n-0.004
fters-0.004
 deliber-0.004
Token maker
Token ID16009
Feature activation+2.61e-01

Pos logit contributions
by+0.051
icker+0.047
bie+0.044
ulum+0.041
 Tina+0.040
Neg logit contributions
 gob-0.046
n-0.045
 Gor-0.043
 gor-0.042
 seventy-0.042
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
by+0.004
icker+0.004
bie+0.003
ulum+0.003
?+0.003
Neg logit contributions
n-0.006
 seventy-0.006
 gor-0.006
 gob-0.006
 Gor-0.006
Token He
Token ID679
Feature activation+2.04e-02

Pos logit contributions
by+0.008
ina+0.006
icker+0.006
bie+0.006
eds+0.006
Neg logit contributions
 Gor-0.005
 deliber-0.005
 gor-0.005
 seventy-0.005
 Ph-0.005
Token screamed
Token ID25421
Feature activation+3.32e-01

Pos logit contributions
icker+0.001
by+0.000
ilts+0.000
utes+0.000
ulum+0.000
Neg logit contributions
 gob-0.000
 spe-0.000
 gor-0.000
n-0.000
 Gor-0.000
Token.
Token ID13
Feature activation-7.61e-02

Pos logit contributions
by+0.008
icker+0.008
bie+0.007
ulum+0.007
important+0.007
Neg logit contributions
 seventy-0.004
 Gor-0.004
 following-0.004
 reaching-0.004
 gob-0.004
Token Then
Token ID3244
Feature activation+1.58e-01

Pos logit contributions
 Gor+0.002
n+0.002
 deliber+0.001
 seventy+0.001
 Once+0.001
Neg logit contributions
by-0.002
icker-0.002
 pens-0.002
?-0.002
ulum-0.002
Token they
Token ID484
Feature activation+1.50e-01

Pos logit contributions
icker+0.004
by+0.003
ulum+0.003
?+0.003
 pens+0.003
Neg logit contributions
n-0.002
 Gor-0.002
 gor-0.002
 deliber-0.002
 Ph-0.002
Token said
Token ID531
Feature activation-3.19e-02

Pos logit contributions
icker+0.003
by+0.003
ulum+0.003
?+0.003
 pens+0.002
Neg logit contributions
 seventy-0.003
 Gor-0.003
 gor-0.003
n-0.003
 deliber-0.003
Token their
Token ID511
Feature activation-3.56e-01

Pos logit contributions
 gor+0.000
 Gor+0.000
n+0.000
 seventy+0.000
 deliber+0.000
Neg logit contributions
icker-0.001
by-0.001
?-0.001
ulum-0.001
 pens-0.001
Token good
Token ID922
Feature activation+4.81e-01

Pos logit contributions
 gor+0.007
 Gor+0.007
n+0.007
 deliber+0.007
 seventy+0.007
Neg logit contributions
by-0.007
icker-0.007
?-0.006
ulum-0.006
 pens-0.005
Tokenby
Token ID1525
Feature activation+2.41e+00

Pos logit contributions
icker+0.009
ulum+0.009
important+0.008
?+0.008
'?+0.008
Neg logit contributions
 ostr-0.009
 deliber-0.009
n-0.008
 delight-0.008
 surprise-0.008
Tokenes
Token ID274
Feature activation+5.76e-01

Pos logit contributions
icker+0.058
ulum+0.055
?+0.050
important+0.050
by+0.049
Neg logit contributions
 Gor-0.039
n-0.038
 gor-0.038
 seventy-0.037
oke-0.034
Token,
Token ID11
Feature activation-2.76e-01

Pos logit contributions
icker+0.013
by+0.012
ulum+0.011
?+0.011
important+0.011
Neg logit contributions
n-0.010
 gor-0.009
 Gor-0.009
 seventy-0.009
 gob-0.009
Token and
Token ID290
Feature activation+9.57e-01

Pos logit contributions
 gor+0.006
 Gor+0.006
n+0.006
 seventy+0.006
 deliber+0.006
Neg logit contributions
by-0.005
icker-0.005
?-0.004
ulum-0.004
 pens-0.004
Token the
Token ID262
Feature activation-3.23e-01

Pos logit contributions
by+0.015
icker+0.015
?+0.014
ulum+0.013
 pens+0.012
Neg logit contributions
 gor-0.021
 Gor-0.021
n-0.020
 deliber-0.020
 seventy-0.020
Token little
Token ID1310
Feature activation-3.79e-01

Pos logit contributions
 gor+0.006
 Gor+0.006
n+0.006
 deliber+0.006
 seventy+0.006
Neg logit contributions
by-0.006
icker-0.006
?-0.006
ulum-0.006
 pens-0.005
Token see
Token ID766
Feature activation-6.03e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
 pens+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.001
 deliber-0.001
 seventy-0.001
Token a
Token ID257
Feature activation-7.88e-02

Pos logit contributions
 gor+0.014
 ostr+0.014
 Gor+0.013
n+0.013
 Ph+0.013
Neg logit contributions
icker-0.010
by-0.010
ulum-0.008
?-0.008
bie-0.007
Token man
Token ID582
Feature activation-1.85e-01

Pos logit contributions
 gor+0.001
 spe+0.001
 ostr+0.001
 Gor+0.001
oke+0.001
Neg logit contributions
icker-0.002
by-0.002
ulum-0.002
important-0.002
?-0.002
Token in
Token ID287
Feature activation-1.90e-01

Pos logit contributions
 Gor+0.004
 deliber+0.004
 gor+0.004
riger+0.004
 seventy+0.004
Neg logit contributions
?-0.003
by-0.003
icker-0.003
 something-0.002
ulum-0.002
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 gor+0.005
 Gor+0.004
 spe+0.004
 seventy+0.004
 deliber+0.004
Neg logit contributions
by-0.003
icker-0.003
ulum-0.002
?-0.002
ina-0.002
Token blue
Token ID4171
Feature activation+2.40e+00

Pos logit contributions
icker+0.020
by+0.020
?+0.018
ulum+0.018
ina+0.017
Neg logit contributions
 gor-0.018
 Gor-0.018
riger-0.017
 seventy-0.017
 ostr-0.016
Token suit
Token ID6050
Feature activation+4.34e-01

Pos logit contributions
icker+0.051
by+0.051
important+0.045
 Toby+0.043
bie+0.043
Neg logit contributions
 ostr-0.039
 gor-0.039
 Gor-0.038
 seventy-0.038
n-0.035
Token running
Token ID2491
Feature activation+9.84e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.003
ulum+0.003
bie+0.003
Neg logit contributions
 gor-0.012
n-0.012
 Gor-0.012
 seventy-0.012
 deliber-0.012
Token to
Token ID284
Feature activation+8.75e-01

Pos logit contributions
icker+0.021
by+0.020
?+0.019
ulum+0.018
bie+0.018
Neg logit contributions
 gor-0.017
 Gor-0.017
 ostr-0.016
n-0.016
go-0.015
Token them
Token ID606
Feature activation-2.87e-01

Pos logit contributions
icker+0.015
by+0.015
?+0.013
ulum+0.013
 pens+0.012
Neg logit contributions
 gor-0.019
 Gor-0.019
n-0.019
 deliber-0.018
 seventy-0.018
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
n+0.007
 gor+0.007
 Gor+0.007
 seventy+0.006
 gob+0.006
Neg logit contributions
icker-0.004
by-0.004
ulum-0.004
 pens-0.003
oppy-0.003
Token sees
Token ID7224
Feature activation-3.86e-01

Pos logit contributions
icker+0.017
by+0.016
important+0.014
own+0.014
ulum+0.014
Neg logit contributions
 gor-0.010
 Gor-0.010
 seventy-0.009
oke-0.009
riger-0.009
Token something
Token ID1223
Feature activation-5.39e-01

Pos logit contributions
 Ph+0.008
 gor+0.007
 ostr+0.007
 pand+0.007
 gn+0.007
Neg logit contributions
icker-0.008
by-0.008
bek-0.007
bie-0.007
ilts-0.006
Token shiny
Token ID22441
Feature activation-1.49e-02

Pos logit contributions
 gor+0.009
 Gor+0.009
n+0.009
 deliber+0.009
 spe+0.009
Neg logit contributions
by-0.012
icker-0.012
ulum-0.010
bie-0.010
own-0.010
Token under
Token ID739
Feature activation+5.98e-01

Pos logit contributions
 gor+0.000
 spe+0.000
 seventy+0.000
 Gor+0.000
 ostr+0.000
Neg logit contributions
icker-0.000
by-0.000
bie-0.000
oppy-0.000
Simon-0.000
Token the
Token ID262
Feature activation+1.03e-01

Pos logit contributions
by+0.015
icker+0.015
bie+0.013
important+0.013
attered+0.013
Neg logit contributions
 gor-0.009
 Ph-0.009
 spe-0.009
n-0.008
 Gor-0.008
Token pile
Token ID14540
Feature activation+2.40e+00

Pos logit contributions
by+0.003
icker+0.002
ulum+0.002
bie+0.002
ina+0.002
Neg logit contributions
 gor-0.002
 Gor-0.001
 spe-0.001
 ostr-0.001
 seventy-0.001
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
by+0.055
icker+0.051
ulum+0.048
bie+0.045
oppy+0.043
Neg logit contributions
n-0.039
oke-0.038
 gor-0.038
 Gor-0.037
 reaching-0.035
Token He
Token ID679
Feature activation+4.41e-01

Pos logit contributions
by+0.010
icker+0.009
eds+0.008
ina+0.007
important+0.007
Neg logit contributions
 Gor-0.008
 Ph-0.007
 gor-0.007
 deliber-0.007
 ostr-0.007
Token reaches
Token ID12229
Feature activation+5.99e-01

Pos logit contributions
icker+0.009
by+0.009
?+0.008
ulum+0.008
own+0.008
Neg logit contributions
 gor-0.008
 Gor-0.008
 seventy-0.007
n-0.007
 deliber-0.007
Token for
Token ID329
Feature activation+1.50e-01

Pos logit contributions
icker+0.014
by+0.014
ulum+0.013
important+0.013
own+0.012
Neg logit contributions
 seventy-0.009
 gor-0.009
n-0.008
 deliber-0.008
 Gor-0.008
Token it
Token ID340
Feature activation+6.57e-02

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
ina+0.002
Neg logit contributions
n-0.003
 deliber-0.003
 gor-0.003
 seventy-0.003
 Gor-0.003
Token They
Token ID1119
Feature activation-2.76e-01

Pos logit contributions
by+0.008
icker+0.008
?+0.007
ulum+0.006
 pens+0.006
Neg logit contributions
 Gor-0.008
n-0.008
 gor-0.008
 deliber-0.008
 seventy-0.008
Token cleaned
Token ID20750
Feature activation+1.01e+00

Pos logit contributions
 gob+0.005
 Gor+0.005
n+0.005
 gor+0.005
 seventy+0.005
Neg logit contributions
icker-0.006
by-0.006
ulum-0.006
?-0.005
own-0.005
Token up
Token ID510
Feature activation+5.06e-01

Pos logit contributions
icker+0.023
by+0.021
?+0.021
zo+0.020
ulum+0.019
Neg logit contributions
n-0.016
 gor-0.015
 Gor-0.015
 Ph-0.014
 Lib-0.013
Token the
Token ID262
Feature activation+5.86e-01

Pos logit contributions
icker+0.011
by+0.010
ulum+0.010
Simon+0.010
zo+0.010
Neg logit contributions
 gor-0.009
n-0.008
 cher-0.008
 spe-0.008
 bou-0.008
Token w
Token ID266
Feature activation+4.11e-01

Pos logit contributions
icker+0.012
by+0.012
?+0.011
ulum+0.011
 Toby+0.011
Neg logit contributions
 gor-0.010
 Gor-0.010
 ostr-0.010
n-0.009
 seventy-0.009
Tokenaffle
Token ID30697
Feature activation+2.40e+00

Pos logit contributions
by+0.009
?+0.009
ulum+0.009
 Toby+0.009
 something+0.009
Neg logit contributions
oke-0.007
n-0.006
fters-0.006
 gor-0.006
go-0.005
Token and
Token ID290
Feature activation+9.09e-01

Pos logit contributions
by+0.048
icker+0.046
?+0.043
ulum+0.043
bie+0.040
Neg logit contributions
n-0.044
 Gor-0.041
 gor-0.041
 gob-0.041
 deliber-0.040
Token Lily
Token ID20037
Feature activation-8.48e-02

Pos logit contributions
icker+0.019
by+0.018
?+0.017
ulum+0.016
own+0.016
Neg logit contributions
 gor-0.016
n-0.016
 Gor-0.016
 gob-0.015
 deliber-0.015
Token's
Token ID338
Feature activation+1.22e-01

Pos logit contributions
n+0.001
 gob+0.001
 delighted+0.001
 seventy+0.001
 deliber+0.001
Neg logit contributions
icker-0.002
ulum-0.002
?-0.002
by-0.002
utes-0.002
Token mom
Token ID1995
Feature activation-2.90e-01

Pos logit contributions
icker+0.002
by+0.002
?+0.002
ulum+0.002
ina+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 seventy-0.002
 deliber-0.002
Token made
Token ID925
Feature activation+6.70e-02

Pos logit contributions
n+0.005
 Gor+0.005
 gor+0.005
 seventy+0.005
 deliber+0.005
Neg logit contributions
icker-0.006
by-0.006
?-0.005
ulum-0.005
 something-0.005
Token,
Token ID11
Feature activation+2.93e-01

Pos logit contributions
 gor+0.014
 Gor+0.014
n+0.014
 deliber+0.013
 seventy+0.013
Neg logit contributions
by-0.014
icker-0.013
?-0.012
ulum-0.012
 pens-0.011
Token I
Token ID314
Feature activation+1.37e+00

Pos logit contributions
?+0.004
 something+0.004
 Toby+0.004
 each+0.004
by+0.004
Neg logit contributions
 gor-0.007
 Gor-0.007
riger-0.007
 deliber-0.007
rd-0.006
Token can
Token ID460
Feature activation+3.26e-01

Pos logit contributions
by+0.023
icker+0.022
?+0.021
ulum+0.018
 something+0.018
Neg logit contributions
 gor-0.031
 Gor-0.030
 seventy-0.029
 deliber-0.029
n-0.029
Token help
Token ID1037
Feature activation-3.37e-01

Pos logit contributions
by+0.008
icker+0.007
ulum+0.007
bie+0.006
 pens+0.006
Neg logit contributions
n-0.005
 seventy-0.005
riger-0.005
 deliber-0.005
 Gor-0.005
Token you
Token ID345
Feature activation+1.36e-01

Pos logit contributions
n+0.005
 seventy+0.004
riger+0.004
 gor+0.004
fters+0.004
Neg logit contributions
icker-0.009
by-0.008
ulum-0.007
attered-0.007
bie-0.007
Token,"
Token ID553
Feature activation+2.40e+00

Pos logit contributions
by+0.003
icker+0.003
ulum+0.002
bie+0.002
 Brother+0.002
Neg logit contributions
n-0.003
 seventy-0.002
riger-0.002
 gor-0.002
 Gor-0.002
Token Max
Token ID5436
Feature activation+1.27e+00

Pos logit contributions
icker+0.050
by+0.045
ilts+0.042
own+0.041
attered+0.041
Neg logit contributions
 deliber-0.044
 seventy-0.043
n-0.043
 Gor-0.042
 gor-0.042
Token says
Token ID1139
Feature activation+2.57e-01

Pos logit contributions
by+0.030
icker+0.030
?+0.028
ulum+0.027
 pens+0.025
Neg logit contributions
 gor-0.019
 Gor-0.018
n-0.018
 deliber-0.018
 seventy-0.018
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.001
 pens+0.001
Neg logit contributions
 gor-0.008
 Gor-0.008
n-0.008
 deliber-0.008
 seventy-0.008
Token He
Token ID679
Feature activation+1.10e-01

Pos logit contributions
 gor+0.003
 Gor+0.003
n+0.003
 deliber+0.003
 seventy+0.003
Neg logit contributions
by-0.002
icker-0.002
?-0.001
ulum-0.001
 pens-0.001
Token takes
Token ID2753
Feature activation-5.81e-02

Pos logit contributions
by+0.002
?+0.002
icker+0.002
 pens+0.002
 each+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 deliber-0.002
riger-0.002
Token know
Token ID760
Feature activation-6.08e-01

Pos logit contributions
by+0.010
icker+0.010
?+0.009
ulum+0.008
 something+0.008
Neg logit contributions
 gor-0.011
 Gor-0.011
 seventy-0.010
n-0.010
 deliber-0.010
Token,
Token ID11
Feature activation-5.11e-02

Pos logit contributions
 gor+0.009
 seventy+0.008
n+0.008
 Gor+0.008
 ostr+0.008
Neg logit contributions
icker-0.016
by-0.015
attered-0.014
ulum-0.014
important-0.014
Token right
Token ID826
Feature activation-1.31e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
 deliber+0.001
n+0.001
riger+0.001
Neg logit contributions
?-0.001
by-0.001
icker-0.001
 something-0.001
 Toby-0.001
Token?
Token ID30
Feature activation+2.08e-01

Pos logit contributions
 Gor+0.002
 gor+0.002
 ostr+0.002
riger+0.002
 spe+0.002
Neg logit contributions
icker-0.003
by-0.002
ilts-0.002
 pens-0.002
ulum-0.002
Token Micro
Token ID4527
Feature activation-5.48e-01

Pos logit contributions
by+0.005
icker+0.005
ulum+0.004
?+0.004
eds+0.004
Neg logit contributions
 Gor-0.003
 seventy-0.003
riger-0.003
 gor-0.003
oke-0.003
Tokensc
Token ID1416
Feature activation+2.39e+00

Pos logit contributions
n+0.010
riger+0.010
 Gor+0.009
 gor+0.009
 seventy+0.009
Neg logit contributions
by-0.012
icker-0.012
bie-0.011
ulum-0.010
 throats-0.010
Tokenopes
Token ID13920
Feature activation-1.89e-01

Pos logit contributions
by+0.023
icker+0.022
?+0.020
ulum+0.020
bie+0.017
Neg logit contributions
n-0.068
fters-0.061
 seventy-0.061
riger-0.061
 gob-0.061
Token are
Token ID389
Feature activation+1.19e+00

Pos logit contributions
n+0.003
 gor+0.003
riger+0.003
fters+0.003
 spe+0.003
Neg logit contributions
by-0.005
icker-0.004
ulum-0.004
bie-0.004
 Brother-0.004
Token amazing
Token ID4998
Feature activation-1.93e-01

Pos logit contributions
icker+0.026
by+0.024
ulum+0.022
?+0.020
ina+0.020
Neg logit contributions
 gor-0.023
 Gor-0.022
 spe-0.022
n-0.022
 seventy-0.021
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
 gor+0.004
 Gor+0.004
 spe+0.004
n+0.003
 ostr+0.003
Neg logit contributions
icker-0.004
by-0.004
ulum-0.003
bie-0.003
important-0.003
Token They
Token ID1119
Feature activation+2.39e-01

Pos logit contributions
by+0.000
icker+0.000
?+0.000
ulum+0.000
 pens+0.000
Neg logit contributions
 Gor-0.000
 gor-0.000
 seventy-0.000
n-0.000
 deliber-0.000
Token mom
Token ID1995
Feature activation-4.47e-01

Pos logit contributions
by+0.004
icker+0.004
ulum+0.003
 pens+0.003
?+0.003
Neg logit contributions
 gor-0.006
 ostr-0.005
 Gor-0.005
n-0.005
 spe-0.005
Token and
Token ID290
Feature activation+1.22e+00

Pos logit contributions
 gor+0.010
 Gor+0.010
n+0.010
 seventy+0.010
 deliber+0.010
Neg logit contributions
icker-0.007
by-0.007
?-0.006
ulum-0.006
 Toby-0.005
Token to
Token ID284
Feature activation+3.57e-01

Pos logit contributions
icker+0.022
by+0.022
ulum+0.021
?+0.019
own+0.018
Neg logit contributions
 gor-0.024
 Gor-0.024
 ostr-0.024
 deliber-0.024
 seventy-0.023
Token the
Token ID262
Feature activation-4.11e-01

Pos logit contributions
by+0.006
icker+0.005
ulum+0.005
?+0.004
 pens+0.004
Neg logit contributions
 gor-0.008
n-0.008
 ostr-0.008
 Gor-0.008
 seventy-0.008
Token ost
Token ID23619
Feature activation-1.86e+00

Pos logit contributions
 deliber+0.006
 seventy+0.006
riger+0.005
 Gor+0.005
rd+0.005
Neg logit contributions
by-0.010
icker-0.009
?-0.009
 something-0.009
 each-0.009
Tokenric
Token ID1173
Feature activation+2.39e+00

Pos logit contributions
 gor+0.053
 Gor+0.052
n+0.052
 seventy+0.051
 deliber+0.051
Neg logit contributions
by-0.018
icker-0.017
?-0.015
ulum-0.013
 pens-0.012
Tokenhes
Token ID956
Feature activation+4.49e-01

Pos logit contributions
by+0.034
icker+0.031
?+0.028
ulum+0.028
 Toby+0.026
Neg logit contributions
 gor-0.058
n-0.058
 Gor-0.056
 seventy-0.055
oke-0.054
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
ulum+0.009
icker+0.008
by+0.008
oppy+0.008
Simon+0.008
Neg logit contributions
n-0.009
 ostr-0.008
 gor-0.008
 Ph-0.008
oke-0.007
Token They
Token ID1119
Feature activation+3.56e-01

Pos logit contributions
by+0.003
icker+0.002
ulum+0.002
?+0.002
ina+0.002
Neg logit contributions
 gor-0.005
 Gor-0.005
 deliber-0.005
 ostr-0.005
 Ph-0.005
Token learned
Token ID4499
Feature activation-1.45e-01

Pos logit contributions
icker+0.007
by+0.007
ulum+0.007
?+0.007
own+0.006
Neg logit contributions
 Gor-0.006
 gor-0.006
n-0.006
 seventy-0.006
 deliber-0.006
Token their
Token ID511
Feature activation-2.67e-01

Pos logit contributions
 gor+0.003
 ostr+0.003
n+0.003
 Gor+0.003
 Lib+0.003
Neg logit contributions
by-0.002
icker-0.002
ulum-0.002
ito-0.002
ky-0.002
Token!
Token ID0
Feature activation-2.56e-01

Pos logit contributions
icker+0.007
by+0.007
ulum+0.006
?+0.006
 Toby+0.006
Neg logit contributions
 gor-0.006
n-0.006
 Gor-0.006
 seventy-0.006
riger-0.006
Token What
Token ID1867
Feature activation+7.21e-02

Pos logit contributions
 Gor+0.005
 seventy+0.005
n+0.005
 gor+0.005
 deliber+0.004
Neg logit contributions
by-0.006
icker-0.005
?-0.004
ulum-0.004
 pens-0.004
Token a
Token ID257
Feature activation+7.33e-01

Pos logit contributions
icker+0.002
by+0.002
ulum+0.002
ky+0.001
attered+0.001
Neg logit contributions
n-0.001
 gor-0.001
 seventy-0.001
 Gor-0.001
 spe-0.001
Token nice
Token ID3621
Feature activation+1.30e+00

Pos logit contributions
by+0.014
icker+0.014
?+0.013
ulum+0.012
 pens+0.012
Neg logit contributions
 gor-0.014
 Gor-0.014
n-0.013
 seventy-0.013
 deliber-0.013
Token surprise
Token ID5975
Feature activation-3.37e-01

Pos logit contributions
icker+0.027
by+0.026
ulum+0.024
?+0.023
 pens+0.023
Neg logit contributions
 gor-0.023
 Gor-0.022
 ostr-0.022
n-0.022
 seventy-0.022
Token,"
Token ID553
Feature activation+2.38e+00

Pos logit contributions
 gor+0.008
 Gor+0.008
n+0.008
 deliber+0.008
 seventy+0.007
Neg logit contributions
by-0.005
icker-0.005
?-0.004
ulum-0.004
 pens-0.004
Token said
Token ID531
Feature activation+6.71e-01

Pos logit contributions
by+0.057
icker+0.055
ulum+0.052
?+0.049
own+0.047
Neg logit contributions
n-0.036
 deliber-0.036
 delighted-0.033
 Gor-0.033
 gob-0.032
Token M
Token ID337
Feature activation-4.51e-01

Pos logit contributions
by+0.012
ulum+0.012
icker+0.012
ilts+0.011
eds+0.011
Neg logit contributions
 deliber-0.013
 Ph-0.013
 Gor-0.013
n-0.013
 delighted-0.012
Tokenummy
Token ID13513
Feature activation+9.25e-01

Pos logit contributions
 Gor+0.007
 deliber+0.007
oke+0.007
n+0.007
 gor+0.007
Neg logit contributions
icker-0.010
by-0.010
 something-0.009
 pens-0.009
ulum-0.009
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
icker+0.011
by+0.011
ulum+0.010
?+0.009
important+0.008
Neg logit contributions
n-0.023
 seventy-0.023
 Gor-0.023
 gor-0.023
 deliber-0.023
Token
Token ID198
Feature activation+5.12e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.004
ulum+0.004
 pens+0.004
Neg logit contributions
 gor-0.005
 Gor-0.005
n-0.005
 deliber-0.005
 seventy-0.005
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
by+0.006
icker+0.006
?+0.006
ulum+0.006
 pens+0.005
Neg logit contributions
n-0.005
 gor-0.005
 Gor-0.005
 seventy-0.005
 deliber-0.005
TokenThey
Token ID2990
Feature activation-2.14e-01

Pos logit contributions
 gor+0.002
riger+0.002
 Gor+0.002
 seventy+0.002
 spe+0.002
Neg logit contributions
by-0.001
icker-0.001
?-0.001
 each-0.001
They-0.001
Token said
Token ID531
Feature activation+6.37e-02

Pos logit contributions
 Gor+0.004
 gor+0.004
 gob+0.004
n+0.004
 seventy+0.004
Neg logit contributions
icker-0.004
ulum-0.004
?-0.004
by-0.004
 something-0.003
Token their
Token ID511
Feature activation-4.10e-01

Pos logit contributions
icker+0.002
by+0.002
?+0.002
ulum+0.002
 pens+0.002
Neg logit contributions
 gor-0.001
 Gor-0.001
 seventy-0.001
n-0.001
 deliber-0.001
Token good
Token ID922
Feature activation+5.34e-01

Pos logit contributions
fters+0.008
 deliber+0.008
n+0.008
 seventy+0.008
riger+0.008
Neg logit contributions
by-0.008
icker-0.007
?-0.007
 each-0.006
 something-0.006
Tokenby
Token ID1525
Feature activation+2.38e+00

Pos logit contributions
icker+0.006
ulum+0.006
?+0.005
'?+0.004
important+0.004
Neg logit contributions
 Gor-0.016
 gor-0.015
 ostr-0.015
 deliber-0.014
n-0.014
Tokenes
Token ID274
Feature activation+4.32e-01

Pos logit contributions
icker+0.057
ulum+0.054
?+0.048
by+0.048
pers+0.047
Neg logit contributions
 Gor-0.043
 gor-0.042
n-0.038
oke-0.036
 seventy-0.036
Token until
Token ID1566
Feature activation+1.67e-01

Pos logit contributions
icker+0.011
ulum+0.011
by+0.010
important+0.010
 elbows+0.010
Neg logit contributions
n-0.006
 gob-0.006
 gor-0.005
 Gor-0.005
 delighted-0.005
Token next
Token ID1306
Feature activation+1.17e+00

Pos logit contributions
by+0.003
icker+0.003
?+0.002
ulum+0.002
 pens+0.002
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 deliber-0.004
 seventy-0.004
Token time
Token ID640
Feature activation+2.47e-01

Pos logit contributions
icker+0.031
by+0.029
ulum+0.029
 pens+0.028
important+0.028
Neg logit contributions
 deliber-0.014
n-0.013
 gor-0.012
 seventy-0.011
 Gor-0.011
Token.
Token ID13
Feature activation-9.31e-03

Pos logit contributions
icker+0.005
by+0.005
ulum+0.005
important+0.005
?+0.005
Neg logit contributions
 Gor-0.004
 gor-0.004
n-0.004
 seventy-0.004
 deliber-0.004
Token coats
Token ID30720
Feature activation-8.20e-01

Pos logit contributions
icker+0.010
by+0.010
?+0.009
own+0.009
important+0.008
Neg logit contributions
n-0.009
 gor-0.008
 ostr-0.008
riger-0.008
 deliber-0.008
Token.
Token ID13
Feature activation-4.54e-01

Pos logit contributions
n+0.020
 gor+0.019
 seventy+0.019
 Gor+0.019
riger+0.019
Neg logit contributions
by-0.011
icker-0.011
?-0.009
ulum-0.009
oppy-0.009
Token They
Token ID1119
Feature activation-5.06e-01

Pos logit contributions
 gor+0.011
 Gor+0.010
 seventy+0.010
n+0.010
 deliber+0.010
Neg logit contributions
by-0.007
icker-0.007
?-0.006
ulum-0.006
ina-0.005
Token take
Token ID1011
Feature activation-2.89e-01

Pos logit contributions
n+0.008
 gor+0.008
riger+0.007
 Gor+0.007
 seventy+0.007
Neg logit contributions
by-0.012
icker-0.012
ulum-0.011
?-0.011
own-0.010
Token their
Token ID511
Feature activation-3.91e-01

Pos logit contributions
n+0.006
 gor+0.005
 spe+0.005
 Ph+0.005
 Gor+0.005
Neg logit contributions
icker-0.006
?-0.005
by-0.005
ulum-0.005
ina-0.005
Token umb
Token ID20810
Feature activation-3.01e+00

Pos logit contributions
 seventy+0.010
 gor+0.009
 Gor+0.009
 deliber+0.009
riger+0.009
Neg logit contributions
by-0.006
icker-0.005
?-0.005
 pens-0.005
 something-0.005
Tokenrell
Token ID11252
Feature activation+2.18e-01

Pos logit contributions
 gor+0.042
 Gor+0.042
 seventy+0.039
 deliber+0.036
oke+0.035
Neg logit contributions
by-0.075
icker-0.074
?-0.073
ulum-0.072
 each-0.070
Tokenas
Token ID292
Feature activation-1.63e-01

Pos logit contributions
by+0.004
icker+0.004
?+0.003
ulum+0.003
 each+0.003
Neg logit contributions
 gor-0.005
 Gor-0.005
 seventy-0.005
 deliber-0.005
n-0.004
Token.
Token ID13
Feature activation-5.71e-01

Pos logit contributions
n+0.003
 gor+0.003
 Gor+0.003
riger+0.003
 ostr+0.003
Neg logit contributions
icker-0.003
by-0.003
?-0.002
ulum-0.002
 pens-0.002
Token
Token ID198
Feature activation+3.74e-01

Pos logit contributions
 gor+0.012
 Gor+0.012
 seventy+0.012
n+0.012
 deliber+0.012
Neg logit contributions
by-0.010
icker-0.009
?-0.008
ulum-0.008
ina-0.008
Token
Token ID198
Feature activation+7.53e-02

Pos logit contributions
by+0.006
icker+0.006
?+0.005
 something+0.005
 They+0.005
Neg logit contributions
 gor-0.009
 Gor-0.009
 ostr-0.008
riger-0.008
 seventy-0.008
TokenN
Token ID45
Feature activation-1.14e-01

Pos logit contributions
by+0.001
icker+0.001
ulum+0.001
 pens+0.001
?+0.001
Neg logit contributions
n-0.001
 Gor-0.001
 gor-0.001
 deliber-0.001
 seventy-0.001
Tokenina
Token ID1437
Feature activation-1.16e+00

Pos logit contributions
oke+0.002
n+0.002
 Gor+0.002
 seventy+0.002
 Fool+0.002
Neg logit contributions
 gears-0.002
 pens-0.002
ulum-0.002
icker-0.002
 each-0.002
Token was
Token ID373
Feature activation-8.32e-02

Pos logit contributions
n+0.020
 Gor+0.019
 gor+0.019
 gob+0.019
 ostr+0.018
Neg logit contributions
icker-0.026
by-0.024
ulum-0.022
?-0.022
 pens-0.022
Token panic
Token ID13619
Feature activation-2.28e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
riger+0.002
 deliber+0.002
n+0.002
Neg logit contributions
by-0.001
icker-0.001
?-0.001
 something-0.001
ulum-0.001
Token-
Token ID12
Feature activation-9.48e-01

Pos logit contributions
 delighted+0.004
 reaching+0.004
 Gor+0.004
 following+0.004
 spe+0.003
Neg logit contributions
by-0.005
icker-0.005
ulum-0.005
ports-0.004
 pens-0.004
Tokenstr
Token ID2536
Feature activation-2.79e+00

Pos logit contributions
n+0.015
 Gor+0.014
go+0.014
 gor+0.014
riger+0.013
Neg logit contributions
?-0.020
by-0.020
icker-0.020
ulum-0.020
 pens-0.020
Tokenicken
Token ID5973
Feature activation-8.37e-01

Pos logit contributions
 gor+0.050
 Gor+0.049
n+0.049
 seventy+0.048
riger+0.047
Neg logit contributions
by-0.055
icker-0.054
?-0.051
ulum-0.047
 something-0.047
Token.
Token ID13
Feature activation+4.18e-02

Pos logit contributions
 gor+0.018
n+0.018
 Gor+0.017
 ostr+0.016
oke+0.016
Neg logit contributions
icker-0.015
by-0.014
ulum-0.012
bie-0.012
?-0.011
Token She
Token ID1375
Feature activation-3.40e-01

Pos logit contributions
by+0.001
icker+0.001
eds+0.001
 pens+0.001
ulum+0.001
Neg logit contributions
 Ph-0.001
 Gor-0.001
 Once-0.001
n-0.001
 Lib-0.001
Token screamed
Token ID25421
Feature activation+7.32e-02

Pos logit contributions
n+0.006
 gob+0.006
 Gor+0.006
 spe+0.005
 ostr+0.005
Neg logit contributions
icker-0.008
by-0.007
ulum-0.007
?-0.007
 chess-0.007
Token for
Token ID329
Feature activation+1.80e-01

Pos logit contributions
by+0.002
icker+0.002
ulum+0.002
 pens+0.001
important+0.001
Neg logit contributions
n-0.001
 delighted-0.001
 Gor-0.001
 seventy-0.001
 reaching-0.001
Token her
Token ID607
Feature activation+1.02e+00

Pos logit contributions
by+0.006
icker+0.006
ulum+0.005
?+0.005
bie+0.005
Neg logit contributions
n-0.010
 gor-0.009
 deliber-0.009
 Gor-0.009
fters-0.009
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
icker+0.024
ky+0.022
ulum+0.022
by+0.021
Simon+0.020
Neg logit contributions
n-0.021
rd-0.015
 following-0.014
 EX-0.014
 deliber-0.014
Token "
Token ID366
Feature activation-3.47e-01

Pos logit contributions
by+0.018
icker+0.017
?+0.016
ulum+0.014
bie+0.014
Neg logit contributions
 deliber-0.022
n-0.022
oke-0.021
 Ph-0.021
 Gor-0.021
TokenIf
Token ID1532
Feature activation+8.32e-02

Pos logit contributions
 gor+0.008
 Gor+0.008
n+0.007
 deliber+0.007
 seventy+0.007
Neg logit contributions
by-0.006
icker-0.006
?-0.005
ulum-0.005
 pens-0.004
Token an
Token ID281
Feature activation-1.89e-01

Pos logit contributions
icker+0.002
by+0.002
ulum+0.002
bie+0.002
ky+0.002
Neg logit contributions
n-0.001
 gor-0.001
 ostr-0.001
 Gor-0.001
 deliber-0.001
Token earthquake
Token ID16295
Feature activation-2.76e+00

Pos logit contributions
 gor+0.004
 Gor+0.004
 seventy+0.004
riger+0.004
n+0.004
Neg logit contributions
by-0.003
icker-0.003
?-0.003
ulum-0.003
 each-0.003
Token happens
Token ID4325
Feature activation-1.08e+00

Pos logit contributions
 gor+0.059
 Gor+0.058
 deliber+0.057
 seventy+0.057
riger+0.056
Neg logit contributions
by-0.047
?-0.045
icker-0.043
 something-0.037
 each-0.037
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
n+0.016
 gor+0.015
 Gor+0.015
 seventy+0.015
 deliber+0.014
Neg logit contributions
icker-0.026
by-0.025
ulum-0.023
 pens-0.023
 journals-0.021
Token find
Token ID1064
Feature activation-6.57e-01

Pos logit contributions
by+0.000
icker+0.000
?+0.000
ulum+0.000
 pens+0.000
Neg logit contributions
 gor-0.000
 Gor-0.000
n-0.000
 seventy-0.000
 deliber-0.000
Token a
Token ID257
Feature activation-4.99e-01

Pos logit contributions
n+0.015
 gor+0.014
 Gor+0.014
 ostr+0.014
 seventy+0.013
Neg logit contributions
by-0.011
icker-0.011
ulum-0.009
bie-0.009
ina-0.008
Token safe
Token ID3338
Feature activation+2.07e-01

Pos logit contributions
 gor+0.007
 Gor+0.007
 ostr+0.007
 spe+0.007
 inconvenience+0.006
Neg logit contributions
icker-0.012
by-0.012
ina-0.011
ulum-0.011
 pens-0.011
Token coats
Token ID30720
Feature activation-4.64e-01

Pos logit contributions
icker+0.010
by+0.010
?+0.009
own+0.008
ulum+0.008
Neg logit contributions
n-0.009
 gor-0.009
riger-0.009
 spe-0.009
 ostr-0.009
Token.
Token ID13
Feature activation-5.67e-01

Pos logit contributions
n+0.011
 gor+0.010
 seventy+0.010
 Gor+0.010
riger+0.010
Neg logit contributions
by-0.007
icker-0.007
?-0.006
ulum-0.006
oppy-0.006
Token They
Token ID1119
Feature activation-5.03e-01

Pos logit contributions
 gor+0.014
 Gor+0.014
 seventy+0.014
n+0.013
 deliber+0.013
Neg logit contributions
by-0.008
icker-0.008
?-0.007
ulum-0.006
 pens-0.006
Token take
Token ID1011
Feature activation-1.29e-01

Pos logit contributions
 gor+0.008
n+0.008
 Gor+0.008
riger+0.008
 seventy+0.007
Neg logit contributions
by-0.011
icker-0.011
?-0.010
ulum-0.010
own-0.010
Token their
Token ID511
Feature activation-2.88e-01

Pos logit contributions
n+0.003
 gor+0.003
 spe+0.002
 Gor+0.002
 Ph+0.002
Neg logit contributions
icker-0.002
by-0.002
?-0.002
ulum-0.002
ina-0.002
Token umb
Token ID20810
Feature activation-2.74e+00

Pos logit contributions
 gor+0.007
 Gor+0.007
 seventy+0.007
 deliber+0.007
riger+0.007
Neg logit contributions
by-0.004
icker-0.004
?-0.004
 pens-0.003
 each-0.003
Tokenrell
Token ID11252
Feature activation+3.57e-01

Pos logit contributions
 gor+0.035
 Gor+0.034
 seventy+0.032
 deliber+0.030
 spe+0.029
Neg logit contributions
by-0.071
icker-0.071
?-0.069
ulum-0.068
 each-0.066
Tokenas
Token ID292
Feature activation-1.77e-01

Pos logit contributions
by+0.006
icker+0.006
?+0.006
ulum+0.005
 pens+0.005
Neg logit contributions
 gor-0.007
 Gor-0.007
 deliber-0.007
 seventy-0.007
n-0.007
Token.
Token ID13
Feature activation-5.57e-01

Pos logit contributions
n+0.004
 gor+0.004
 Gor+0.004
riger+0.004
 seventy+0.004
Neg logit contributions
by-0.003
icker-0.003
?-0.003
ulum-0.003
 pens-0.002
Token
Token ID198
Feature activation+3.30e-01

Pos logit contributions
 gor+0.013
 Gor+0.012
n+0.012
 seventy+0.012
 deliber+0.012
Neg logit contributions
by-0.009
icker-0.009
?-0.008
ulum-0.007
 pens-0.007
Token
Token ID198
Feature activation+6.00e-02

Pos logit contributions
by+0.005
icker+0.005
?+0.005
 something+0.004
 each+0.004
Neg logit contributions
 gor-0.008
 Gor-0.008
riger-0.007
 seventy-0.007
 ostr-0.007
Token
Token ID220
Feature activation+4.07e-01

Pos logit contributions
n+0.010
 gor+0.008
 Gor+0.008
 deliber+0.008
 seventy+0.008
Neg logit contributions
by-0.037
icker-0.035
bie-0.032
 pens-0.032
ulum-0.032
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
by+0.009
icker+0.009
ulum+0.009
 pens+0.009
?+0.009
Neg logit contributions
n-0.006
oke-0.005
 seventy-0.005
 delighted-0.005
 deliber-0.005
Token
Token ID198
Feature activation-3.94e-02

Pos logit contributions
icker+0.001
by+0.001
ulum+0.000
 pens+0.000
?+0.000
Neg logit contributions
n-0.000
 Gor-0.000
 gor-0.000
 deliber-0.000
 seventy-0.000
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
n+0.001
 Gor+0.001
 gor+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
by-0.001
icker-0.001
ulum-0.001
?-0.001
 pens-0.001
Token town
Token ID3240
Feature activation-3.82e-01

Pos logit contributions
 gor+0.004
 ostr+0.004
 Gor+0.004
 seventy+0.004
n+0.004
Neg logit contributions
icker-0.007
by-0.006
ulum-0.006
?-0.006
own-0.006
Tokenspe
Token ID4125
Feature activation-2.73e+00

Pos logit contributions
n+0.006
 gor+0.006
 Gor+0.006
oke+0.006
 delighted+0.005
Neg logit contributions
icker-0.009
by-0.008
?-0.008
ulum-0.008
 pens-0.008
Tokenople
Token ID643
Feature activation-5.80e-01

Pos logit contributions
 gor+0.028
n+0.027
 Gor+0.027
 seventy+0.026
 deliber+0.025
Neg logit contributions
icker-0.076
by-0.074
?-0.071
 Toby-0.068
 something-0.067
Token had
Token ID550
Feature activation+2.93e-01

Pos logit contributions
n+0.010
 gor+0.009
 Gor+0.009
oke+0.009
 deliber+0.008
Neg logit contributions
icker-0.013
by-0.013
ulum-0.012
?-0.011
 pens-0.011
Token to
Token ID284
Feature activation-3.23e-01

Pos logit contributions
icker+0.007
oppy+0.006
by+0.006
ulum+0.005
ina+0.005
Neg logit contributions
n-0.006
 gor-0.005
 deliber-0.005
 delighted-0.005
 Gor-0.005
Token find
Token ID1064
Feature activation-2.41e-01

Pos logit contributions
n+0.005
 gor+0.005
 deliber+0.004
 Gor+0.004
riger+0.004
Neg logit contributions
icker-0.008
by-0.008
ulum-0.007
bie-0.007
 pens-0.007
Token a
Token ID257
Feature activation-2.99e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 deliber+0.005
 ostr+0.005
Neg logit contributions
icker-0.004
by-0.004
ulum-0.004
oppy-0.004
?-0.003
Token she
Token ID673
Feature activation-9.14e-01

Pos logit contributions
 gor+0.013
 Gor+0.013
n+0.012
 ostr+0.012
 deliber+0.012
Neg logit contributions
icker-0.016
by-0.014
ilts-0.013
important-0.013
attered-0.013
Token bowed
Token ID35737
Feature activation-5.12e-01

Pos logit contributions
 gor+0.019
 Gor+0.019
n+0.018
 seventy+0.018
 deliber+0.018
Neg logit contributions
icker-0.016
by-0.015
?-0.014
ulum-0.014
 something-0.013
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
 Gor+0.008
 seventy+0.007
 gor+0.007
 delighted+0.007
 ostr+0.007
Neg logit contributions
icker-0.012
by-0.011
ulum-0.011
ilts-0.010
ito-0.010
Token the
Token ID262
Feature activation-5.29e-02

Pos logit contributions
 gor+0.000
n+0.000
 Gor+0.000
 deliber+0.000
 Ph+0.000
Neg logit contributions
icker-0.001
by-0.001
ulum-0.000
?-0.000
ilts-0.000
Token town
Token ID3240
Feature activation-8.26e-01

Pos logit contributions
 ostr+0.001
 gor+0.001
 Gor+0.001
 seventy+0.001
n+0.001
Neg logit contributions
icker-0.001
by-0.001
ulum-0.001
?-0.001
bie-0.001
Tokenspe
Token ID4125
Feature activation-2.70e+00

Pos logit contributions
n+0.015
 gor+0.015
 Gor+0.014
 ostr+0.014
oke+0.014
Neg logit contributions
icker-0.017
ulum-0.015
by-0.015
?-0.014
 Toby-0.014
Tokenople
Token ID643
Feature activation-7.29e-01

Pos logit contributions
 gor+0.029
 Gor+0.028
 seventy+0.027
n+0.027
 deliber+0.026
Neg logit contributions
icker-0.074
by-0.073
?-0.069
 Toby-0.066
 something-0.065
Token stopped
Token ID5025
Feature activation-6.73e-01

Pos logit contributions
n+0.013
 gor+0.013
 Gor+0.012
 seventy+0.012
 ostr+0.012
Neg logit contributions
icker-0.015
by-0.014
ulum-0.014
 pens-0.012
own-0.012
Token to
Token ID284
Feature activation+1.84e-01

Pos logit contributions
 gor+0.010
 following+0.010
 spe+0.010
 reaching+0.010
 ostr+0.010
Neg logit contributions
icker-0.015
by-0.014
ulum-0.014
oppy-0.013
ilts-0.013
Token admire
Token ID21099
Feature activation-1.29e-01

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
bie+0.004
?+0.004
Neg logit contributions
 gor-0.003
n-0.003
 seventy-0.003
 Gor-0.003
 deliber-0.003
Token her
Token ID607
Feature activation+2.65e-01

Pos logit contributions
 gor+0.002
 spe+0.002
n+0.002
 Gor+0.002
 amaz+0.002
Neg logit contributions
icker-0.003
by-0.003
ulum-0.003
oppy-0.002
attered-0.002
Token to
Token ID284
Feature activation+3.30e-01

Pos logit contributions
by+0.000
icker+0.000
ulum+0.000
important+0.000
bie+0.000
Neg logit contributions
 deliber-0.000
 gor-0.000
 Gor-0.000
 delighted-0.000
n-0.000
Token find
Token ID1064
Feature activation-1.27e-01

Pos logit contributions
icker+0.008
by+0.008
attered+0.007
ulum+0.007
ina+0.007
Neg logit contributions
n-0.005
 gor-0.005
 Gor-0.005
 ostr-0.004
 deliber-0.004
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 gor+0.003
 Gor+0.003
 seventy+0.003
 deliber+0.003
 ostr+0.003
Neg logit contributions
icker-0.002
by-0.002
ulum-0.001
?-0.001
ina-0.001
Token land
Token ID1956
Feature activation+3.80e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
 ostr+0.002
n+0.002
 seventy+0.002
Neg logit contributions
icker-0.003
by-0.002
ulum-0.002
?-0.002
bie-0.002
Token of
Token ID286
Feature activation+2.68e-02

Pos logit contributions
icker+0.008
ulum+0.008
by+0.008
 Brother+0.007
ilts+0.007
Neg logit contributions
n-0.006
 gor-0.006
 Gor-0.006
go-0.006
oke-0.006
Token Imp
Token ID9855
Feature activation-2.70e+00

Pos logit contributions
icker+0.000
by+0.000
ulum+0.000
attered+0.000
?+0.000
Neg logit contributions
 gor-0.001
 Gor-0.001
 ostr-0.001
 seventy-0.001
 deliber-0.001
Tokenressive
Token ID3314
Feature activation-6.31e-01

Pos logit contributions
 gor+0.054
 Gor+0.053
 deliber+0.052
oke+0.052
 seventy+0.051
Neg logit contributions
by-0.049
icker-0.045
?-0.045
ulum-0.044
own-0.040
Token Things
Token ID11597
Feature activation-2.64e-01

Pos logit contributions
 gor+0.013
 Gor+0.012
n+0.011
 ostr+0.011
riger+0.010
Neg logit contributions
icker-0.014
by-0.013
bie-0.011
oppy-0.011
ot-0.011
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
n+0.006
 gor+0.006
 deliber+0.005
riger+0.005
 Gor+0.005
Neg logit contributions
icker-0.005
by-0.005
ulum-0.004
bie-0.004
oppy-0.003
Token After
Token ID2293
Feature activation+2.33e-01

Pos logit contributions
by+0.007
icker+0.006
?+0.006
ina+0.005
ulum+0.005
Neg logit contributions
 Gor-0.010
 gor-0.009
 seventy-0.009
 deliber-0.009
n-0.008
Token walking
Token ID6155
Feature activation-1.43e-01

Pos logit contributions
by+0.005
icker+0.005
ulum+0.004
?+0.004
bie+0.004
Neg logit contributions
 gor-0.005
 deliber-0.005
n-0.005
 Gor-0.005
 seventy-0.004
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
 gor+0.003
n+0.003
 Gor+0.003
 following+0.003
 spe+0.003
Neg logit contributions
icker-0.006
by-0.005
ulum-0.005
ito-0.005
bie-0.005
Token stay
Token ID2652
Feature activation-4.69e-01

Pos logit contributions
by+0.004
icker+0.004
?+0.004
ulum+0.004
 pens+0.003
Neg logit contributions
 gor-0.005
 Gor-0.005
n-0.005
 deliber-0.005
 seventy-0.005
Token inside
Token ID2641
Feature activation-2.69e-01

Pos logit contributions
n+0.007
 gor+0.007
riger+0.007
 deliber+0.006
 Gor+0.006
Neg logit contributions
by-0.010
icker-0.010
ulum-0.010
 pens-0.009
 Toby-0.009
Token during
Token ID1141
Feature activation-2.87e-01

Pos logit contributions
n+0.004
 gor+0.004
 following+0.004
 spe+0.004
 deliber+0.004
Neg logit contributions
icker-0.006
by-0.006
ulum-0.006
bie-0.005
zo-0.005
Token a
Token ID257
Feature activation-6.94e-01

Pos logit contributions
n+0.006
 gor+0.005
 deliber+0.005
 spe+0.005
 inconvenience+0.005
Neg logit contributions
by-0.006
icker-0.005
ulum-0.005
bie-0.005
ina-0.005
Token storm
Token ID6388
Feature activation-2.70e+00

Pos logit contributions
 gor+0.014
 Gor+0.013
 seventy+0.013
n+0.013
 deliber+0.013
Neg logit contributions
by-0.013
icker-0.013
?-0.012
ulum-0.011
 something-0.011
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
 seventy+0.081
 Gor+0.081
 gor+0.079
 deliber+0.078
fters+0.076
Neg logit contributions
by-0.023
?-0.022
icker-0.021
 each-0.018
ulum-0.014
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
by+0.001
icker+0.000
?+0.000
ulum+0.000
 Toby+0.000
Neg logit contributions
 gor-0.001
 Gor-0.001
 seventy-0.001
riger-0.001
n-0.001
Token
Token ID198
Feature activation+6.13e-02

Pos logit contributions
 Gor+0.003
 gor+0.003
 seventy+0.003
riger+0.003
 spe+0.002
Neg logit contributions
by-0.001
icker-0.001
?-0.001
 each-0.001
 something-0.001
TokenAfter
Token ID3260
Feature activation-6.60e-01

Pos logit contributions
by+0.001
icker+0.001
?+0.001
ulum+0.001
 something+0.001
Neg logit contributions
 gor-0.001
 Gor-0.001
 seventy-0.001
 deliber-0.001
n-0.001
Token the
Token ID262
Feature activation+2.38e-01

Pos logit contributions
n+0.014
 deliber+0.014
 gor+0.014
 Ph+0.013
 following+0.013
Neg logit contributions
by-0.013
icker-0.011
ulum-0.011
important-0.010
bie-0.010
Token got
Token ID1392
Feature activation-1.88e-01

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
?+0.004
 pens+0.004
Neg logit contributions
n-0.005
 Gor-0.005
 gor-0.005
 deliber-0.005
 seventy-0.005
Token lost
Token ID2626
Feature activation-1.05e-01

Pos logit contributions
n+0.003
 Gor+0.003
 gor+0.003
 seventy+0.003
 delighted+0.003
Neg logit contributions
icker-0.005
ulum-0.004
akes-0.004
ilts-0.004
ina-0.004
Token,
Token ID11
Feature activation+5.38e-02

Pos logit contributions
 Gor+0.002
 gor+0.002
n+0.001
 seventy+0.001
 spe+0.001
Neg logit contributions
icker-0.003
by-0.002
ulum-0.002
ilts-0.002
bie-0.002
Token the
Token ID262
Feature activation-2.14e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.001
?+0.001
own+0.001
Neg logit contributions
 gor-0.001
 Gor-0.001
n-0.001
 deliber-0.001
 seventy-0.001
Token l
Token ID300
Feature activation-5.25e-02

Pos logit contributions
 gor+0.004
 Gor+0.004
n+0.004
 deliber+0.004
 seventy+0.004
Neg logit contributions
by-0.004
icker-0.004
?-0.004
 something-0.003
ulum-0.003
Tokenibr
Token ID2889
Feature activation-2.59e+00

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.000
 seventy+0.000
 deliber+0.000
Neg logit contributions
by-0.002
icker-0.001
?-0.001
ulum-0.001
 each-0.001
Tokenarians
Token ID13517
Feature activation-9.67e-01

Pos logit contributions
 gor+0.046
 deliber+0.046
 ju+0.045
 Ph+0.045
 ostr+0.045
Neg logit contributions
by-0.050
icker-0.047
?-0.045
ulum-0.042
ina-0.042
Token would
Token ID561
Feature activation+1.10e-01

Pos logit contributions
n+0.018
 Gor+0.016
 gor+0.016
 seventy+0.015
riger+0.015
Neg logit contributions
icker-0.021
ulum-0.020
by-0.019
bie-0.017
ilts-0.017
Token help
Token ID1037
Feature activation-6.66e-01

Pos logit contributions
icker+0.002
by+0.002
ulum+0.002
?+0.002
 pens+0.002
Neg logit contributions
 gor-0.002
n-0.002
 Gor-0.002
 seventy-0.002
 deliber-0.002
Token him
Token ID683
Feature activation+2.10e-01

Pos logit contributions
n+0.014
 gor+0.013
 Gor+0.013
 deliber+0.012
 seventy+0.012
Neg logit contributions
icker-0.013
by-0.011
ulum-0.011
bie-0.010
attered-0.009
Token find
Token ID1064
Feature activation-6.08e-01

Pos logit contributions
icker+0.005
ulum+0.005
by+0.005
 Brother+0.005
ito+0.005
Neg logit contributions
n-0.003
 arrive-0.002
 reaching-0.002
 convince-0.002
 Gor-0.002
Token their
Token ID511
Feature activation+1.83e-01

Pos logit contributions
icker+0.011
ulum+0.010
by+0.010
important+0.009
?+0.009
Neg logit contributions
 delighted-0.010
 gor-0.010
n-0.010
 seventy-0.010
 ostr-0.010
Token hands
Token ID2832
Feature activation-4.46e-02

Pos logit contributions
by+0.003
icker+0.003
?+0.002
ulum+0.002
 something+0.002
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 seventy-0.004
 deliber-0.004
Token and
Token ID290
Feature activation+1.70e-01

Pos logit contributions
 seventy+0.001
 gor+0.001
n+0.001
 Gor+0.001
 deliber+0.001
Neg logit contributions
icker-0.001
by-0.001
ulum-0.001
?-0.001
 pens-0.001
Token spun
Token ID26843
Feature activation+2.37e-02

Pos logit contributions
icker+0.004
by+0.004
?+0.004
ulum+0.004
important+0.003
Neg logit contributions
 gor-0.002
n-0.002
 Gor-0.002
 spe-0.002
 deliber-0.002
Token their
Token ID511
Feature activation-8.90e-02

Pos logit contributions
icker+0.001
by+0.001
?+0.000
ulum+0.000
bie+0.000
Neg logit contributions
 gor-0.000
n-0.000
 Gor-0.000
 spe-0.000
riger-0.000
Token umb
Token ID20810
Feature activation-2.59e+00

Pos logit contributions
 gor+0.002
 Gor+0.002
 deliber+0.002
 seventy+0.002
n+0.002
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
 pens-0.001
Tokenrell
Token ID11252
Feature activation+1.98e-01

Pos logit contributions
 gor+0.026
 Gor+0.026
 seventy+0.024
 deliber+0.023
riger+0.022
Neg logit contributions
by-0.074
icker-0.072
?-0.069
ulum-0.068
 each-0.065
Tokenas
Token ID292
Feature activation-1.18e-01

Pos logit contributions
by+0.004
icker+0.004
?+0.004
ulum+0.003
 something+0.003
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 seventy-0.004
riger-0.003
Token.
Token ID13
Feature activation-2.93e-01

Pos logit contributions
n+0.003
 gor+0.003
 Gor+0.003
 seventy+0.003
riger+0.003
Neg logit contributions
by-0.002
icker-0.002
?-0.002
ulum-0.001
 pens-0.001
Token They
Token ID1119
Feature activation-9.04e-01

Pos logit contributions
 gor+0.006
 Ph+0.006
 deliber+0.006
 Gor+0.006
n+0.006
Neg logit contributions
by-0.005
icker-0.004
?-0.004
important-0.004
ulum-0.004
Token had
Token ID550
Feature activation+3.60e-01

Pos logit contributions
 gor+0.015
n+0.015
 spe+0.015
 Gor+0.015
 seventy+0.014
Neg logit contributions
icker-0.019
by-0.019
ulum-0.018
?-0.018
own-0.016
Token never
Token ID1239
Feature activation+2.81e-01

Pos logit contributions
 gor+0.003
n+0.003
 Gor+0.003
 seventy+0.003
 deliber+0.003
Neg logit contributions
icker-0.003
by-0.003
?-0.003
ulum-0.003
 pens-0.002
Token went
Token ID1816
Feature activation-7.66e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.004
ulum+0.004
 something+0.004
Neg logit contributions
 gor-0.006
 Gor-0.006
n-0.006
 seventy-0.006
riger-0.006
Token outside
Token ID2354
Feature activation-6.28e-01

Pos logit contributions
 gor+0.014
 Gor+0.013
n+0.013
 deliber+0.013
riger+0.012
Neg logit contributions
by-0.016
icker-0.016
ulum-0.014
?-0.014
 pens-0.013
Token during
Token ID1141
Feature activation-4.81e-01

Pos logit contributions
 gor+0.012
 Gor+0.012
 deliber+0.012
n+0.012
 seventy+0.011
Neg logit contributions
by-0.012
icker-0.012
ulum-0.011
?-0.010
 pens-0.010
Token a
Token ID257
Feature activation-7.10e-01

Pos logit contributions
 gor+0.009
 deliber+0.009
 spe+0.009
n+0.009
 Gor+0.008
Neg logit contributions
by-0.010
icker-0.009
ulum-0.008
important-0.008
?-0.008
Token storm
Token ID6388
Feature activation-2.57e+00

Pos logit contributions
 gor+0.014
 seventy+0.014
 Gor+0.014
n+0.013
 deliber+0.013
Neg logit contributions
by-0.013
icker-0.013
?-0.012
 something-0.011
ulum-0.011
Token again
Token ID757
Feature activation-5.37e-01

Pos logit contributions
 seventy+0.077
 Gor+0.075
fters+0.072
 eighty+0.072
 gor+0.071
Neg logit contributions
?-0.024
.-0.023
by-0.022
 each-0.020
icker-0.017
Token.
Token ID13
Feature activation-3.07e-01

Pos logit contributions
n+0.014
 gor+0.014
 Gor+0.013
 deliber+0.013
 spe+0.013
Neg logit contributions
icker-0.007
by-0.007
ulum-0.006
 pens-0.005
bie-0.005
Token<|endoftext|>
Token ID50256
Feature activation+2.75e-01

Pos logit contributions
 gor+0.006
 Gor+0.006
n+0.006
 deliber+0.006
 seventy+0.006
Neg logit contributions
by-0.006
icker-0.006
?-0.005
ulum-0.005
 pens-0.005
TokenOnce
Token ID7454
Feature activation-1.44e-01

Pos logit contributions
by+0.005
icker+0.005
?+0.004
ulum+0.004
 something+0.004
Neg logit contributions
 gor-0.006
 Gor-0.006
 seventy-0.005
 deliber-0.005
n-0.005
Token upon
Token ID2402
Feature activation-2.55e-01

Pos logit contributions
 Gor+0.004
 gor+0.004
riger+0.004
 seventy+0.004
 spe+0.004
Neg logit contributions
by-0.002
icker-0.002
?-0.002
bie-0.001
 each-0.001
Token And
Token ID843
Feature activation+5.36e-03

Pos logit contributions
by+0.001
icker+0.001
?+0.001
ulum+0.001
 pens+0.001
Neg logit contributions
 Gor-0.001
 gor-0.001
n-0.001
 seventy-0.001
 deliber-0.001
Token I
Token ID314
Feature activation+2.18e-01

Pos logit contributions
by+0.000
icker+0.000
?+0.000
ulum+0.000
 pens+0.000
Neg logit contributions
 gor-0.000
 Gor-0.000
n-0.000
 seventy-0.000
 deliber-0.000
Token'm
Token ID1101
Feature activation+1.02e-01

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
 pens+0.004
?+0.004
Neg logit contributions
n-0.004
 Gor-0.004
 gor-0.004
 deliber-0.004
riger-0.004
Token sorry
Token ID7926
Feature activation+1.50e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
 pens+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 seventy-0.002
 deliber-0.002
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
icker+0.003
by+0.003
ulum+0.003
 pens+0.003
important+0.002
Neg logit contributions
 gor-0.003
 Gor-0.003
n-0.003
 seventy-0.003
oke-0.003
Token ostr
Token ID42387
Feature activation-2.57e+00

Pos logit contributions
 gor+0.005
 deliber+0.005
 Gor+0.005
riger+0.005
n+0.005
Neg logit contributions
by-0.004
?-0.004
icker-0.004
 something-0.003
 each-0.003
Tokenich
Token ID488
Feature activation-7.35e-01

Pos logit contributions
 deliber+0.060
 gor+0.057
 Gor+0.057
riger+0.056
 seventy+0.056
Neg logit contributions
by-0.041
icker-0.039
?-0.037
ina-0.031
ulum-0.031
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
 gor+0.016
 Gor+0.016
n+0.016
 ostr+0.015
oke+0.015
Neg logit contributions
icker-0.014
ulum-0.013
by-0.012
 pens-0.012
 gears-0.011
Token Please
Token ID4222
Feature activation+1.36e-01

Pos logit contributions
icker+0.009
?+0.009
by+0.009
 They+0.008
ulum+0.008
Neg logit contributions
 gor-0.012
 deliber-0.012
fters-0.012
n-0.012
 seventy-0.012
Token come
Token ID1282
Feature activation-4.77e-01

Pos logit contributions
by+0.003
icker+0.003
ulum+0.003
 pens+0.003
?+0.003
Neg logit contributions
n-0.002
 gor-0.002
 Gor-0.002
 seventy-0.002
riger-0.002
Token out
Token ID503
Feature activation-1.02e+00

Pos logit contributions
 Gor+0.009
 gor+0.009
n+0.009
 seventy+0.008
riger+0.008
Neg logit contributions
icker-0.009
by-0.009
ulum-0.008
?-0.008
 pens-0.008
Token always
Token ID1464
Feature activation+2.04e-01

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
own+0.003
 pens+0.003
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 ostr-0.004
 seventy-0.004
Token wanted
Token ID2227
Feature activation+3.80e-01

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
own+0.004
 pens+0.004
Neg logit contributions
 seventy-0.003
 Gor-0.003
 deliber-0.003
n-0.003
 gor-0.003
Token to
Token ID284
Feature activation-4.15e-01

Pos logit contributions
icker+0.009
ulum+0.008
by+0.008
ilts+0.007
?+0.007
Neg logit contributions
 gor-0.007
n-0.007
 Gor-0.007
 deliber-0.006
 ostr-0.006
Token see
Token ID766
Feature activation-4.65e-01

Pos logit contributions
n+0.007
 gor+0.006
 Gor+0.006
 ostr+0.006
 deliber+0.005
Neg logit contributions
icker-0.010
by-0.010
ulum-0.009
 pens-0.009
 Toby-0.009
Token her
Token ID607
Feature activation+8.56e-02

Pos logit contributions
 gor+0.011
 Gor+0.010
 ostr+0.010
n+0.010
 spe+0.009
Neg logit contributions
icker-0.008
by-0.008
ulum-0.007
?-0.006
own-0.006
Token ostr
Token ID42387
Feature activation-2.57e+00

Pos logit contributions
icker+0.002
by+0.002
?+0.001
ulum+0.001
 pens+0.001
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 ostr-0.002
 deliber-0.002
Tokenich
Token ID488
Feature activation-5.45e-01

Pos logit contributions
 deliber+0.067
 seventy+0.064
riger+0.064
 gor+0.063
 Gor+0.062
Neg logit contributions
by-0.036
icker-0.034
?-0.030
bie-0.026
ina-0.026
Token friend
Token ID1545
Feature activation-2.27e-01

Pos logit contributions
 gor+0.011
 Gor+0.011
n+0.011
 ostr+0.010
 seventy+0.010
Neg logit contributions
icker-0.010
by-0.010
ulum-0.009
?-0.009
 pens-0.008
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
 gor+0.006
 Gor+0.005
n+0.005
 ostr+0.005
 seventy+0.005
Neg logit contributions
icker-0.003
by-0.003
ulum-0.003
?-0.003
 pens-0.002
Token<|endoftext|>
Token ID50256
Feature activation+5.50e-01

Pos logit contributions
by+0.010
icker+0.010
?+0.009
ulum+0.009
 pens+0.008
Neg logit contributions
 Gor-0.009
 gor-0.009
n-0.009
 deliber-0.009
 seventy-0.008
TokenOnce
Token ID7454
Feature activation-1.74e-01

Pos logit contributions
by+0.008
?+0.008
icker+0.008
 something+0.007
ulum+0.007
Neg logit contributions
 gor-0.013
 Gor-0.013
 seventy-0.013
riger-0.012
 ostr-0.012
Token�
Token ID95
Feature activation-2.96e-01

Pos logit contributions
n+0.005
riger+0.005
 Gor+0.005
 gor+0.004
 spe+0.004
Neg logit contributions
by-0.007
 pens-0.006
bie-0.006
icker-0.006
 pen-0.006
Token€
Token ID26391
Feature activation+5.20e-01

Pos logit contributions
n+0.005
 Gor+0.005
riger+0.005
 gor+0.005
 deliber+0.005
Neg logit contributions
by-0.007
icker-0.006
 pens-0.006
?-0.006
bie-0.006
TokenHi
Token ID17250
Feature activation+1.93e-01

Pos logit contributions
by+0.010
icker+0.010
?+0.009
ulum+0.008
 pens+0.008
Neg logit contributions
 gor-0.010
 Gor-0.010
n-0.010
 deliber-0.009
riger-0.009
Token Miss
Token ID4544
Feature activation-1.36e+00

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
 pens+0.003
important+0.003
Neg logit contributions
 Gor-0.004
 gor-0.004
n-0.003
 Ph-0.003
 seventy-0.003
Token O
Token ID440
Feature activation+3.25e-01

Pos logit contributions
 Gor+0.026
 gor+0.026
 seventy+0.024
oke+0.024
n+0.024
Neg logit contributions
icker-0.028
by-0.027
ulum-0.024
?-0.023
 pens-0.023
Tokenstr
Token ID2536
Feature activation-2.56e+00

Pos logit contributions
by+0.009
icker+0.009
 pens+0.009
 something+0.009
ulum+0.009
Neg logit contributions
n-0.003
oke-0.003
 Gor-0.003
 gor-0.003
riger-0.003
Tokenich
Token ID488
Feature activation-8.74e-01

Pos logit contributions
 gor+0.054
 deliber+0.053
 Gor+0.053
riger+0.052
 seventy+0.052
Neg logit contributions
by-0.044
icker-0.043
?-0.040
ulum-0.037
ina-0.035
Token!
Token ID0
Feature activation-7.58e-02

Pos logit contributions
 Gor+0.018
n+0.017
 gor+0.017
 ostr+0.017
oke+0.017
Neg logit contributions
icker-0.019
ulum-0.016
by-0.015
 pens-0.015
ilts-0.015
Token That
Token ID1320
Feature activation-2.21e-01

Pos logit contributions
 Gor+0.001
 gor+0.001
 seventy+0.001
n+0.001
 deliber+0.001
Neg logit contributions
by-0.002
icker-0.002
ulum-0.001
?-0.001
 pens-0.001
Token sounds
Token ID5238
Feature activation+3.22e-01

Pos logit contributions
 Gor+0.004
 gor+0.004
 ostr+0.004
n+0.003
 spe+0.003
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
?-0.004
 pens-0.004
Token funny
Token ID8258
Feature activation+3.53e-01

Pos logit contributions
icker+0.007
by+0.007
ulum+0.006
bie+0.006
?+0.006
Neg logit contributions
 gor-0.005
n-0.005
 Gor-0.005
 deliber-0.005
 delighted-0.005
Token said
Token ID531
Feature activation-7.25e-01

Pos logit contributions
icker+0.002
ulum+0.002
utes+0.002
ute+0.002
by+0.002
Neg logit contributions
 gob-0.001
 seventy-0.001
 deliber-0.001
 delighted-0.001
 spe-0.001
Token,
Token ID11
Feature activation-7.47e-01

Pos logit contributions
 gor+0.006
 seventy+0.006
 Gor+0.006
 deliber+0.005
 gob+0.005
Neg logit contributions
icker-0.022
ulum-0.022
by-0.022
bie-0.020
ky-0.020
Token "
Token ID366
Feature activation-2.28e-01

Pos logit contributions
 Gor+0.013
 gor+0.013
n+0.013
 seventy+0.013
 deliber+0.012
Neg logit contributions
by-0.016
icker-0.016
?-0.014
ulum-0.014
 pens-0.013
TokenThank
Token ID10449
Feature activation+6.76e-02

Pos logit contributions
 Gor+0.004
 gor+0.004
n+0.004
 seventy+0.004
 deliber+0.004
Neg logit contributions
icker-0.005
by-0.005
?-0.005
ulum-0.004
 pens-0.004
Token you
Token ID345
Feature activation+5.74e-01

Pos logit contributions
by+0.001
icker+0.001
?+0.001
ulum+0.001
 pens+0.001
Neg logit contributions
 gor-0.001
 Gor-0.001
n-0.001
 deliber-0.001
 seventy-0.001
Token Aunt
Token ID38074
Feature activation-2.55e+00

Pos logit contributions
icker+0.015
ulum+0.014
by+0.014
 pens+0.013
important+0.013
Neg logit contributions
 Gor-0.010
 gor-0.009
 seventy-0.007
 ostr-0.007
n-0.007
Tokenie
Token ID494
Feature activation-4.91e-01

Pos logit contributions
riger+0.068
 deliber+0.067
 gor+0.067
 seventy+0.066
n+0.066
Neg logit contributions
by-0.030
?-0.027
icker-0.026
ina-0.021
bie-0.021
Token!"
Token ID2474
Feature activation+1.23e+00

Pos logit contributions
 Gor+0.009
 gor+0.009
 ostr+0.008
n+0.008
oke+0.007
Neg logit contributions
icker-0.013
ulum-0.012
pers-0.010
 elbows-0.010
 pens-0.010
Token<|endoftext|>
Token ID50256
Feature activation+9.84e-02

Pos logit contributions
icker+0.030
by+0.028
oppy+0.027
important+0.026
eds+0.025
Neg logit contributions
 Gor-0.023
 gor-0.022
 Lib-0.022
 deliber-0.021
 Once-0.019
TokenOnce
Token ID7454
Feature activation+2.85e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
 pens+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 deliber-0.002
 seventy-0.002
Token upon
Token ID2402
Feature activation-1.99e-02

Pos logit contributions
by+0.004
icker+0.004
?+0.004
bie+0.003
ulum+0.003
Neg logit contributions
 gor-0.007
 Gor-0.007
 seventy-0.006
riger-0.006
 deliber-0.006
Token he
Token ID339
Feature activation-8.36e-02

Pos logit contributions
 gor+0.007
 Gor+0.006
n+0.006
 ostr+0.006
 deliber+0.006
Neg logit contributions
icker-0.009
important-0.008
ulum-0.008
by-0.008
ilts-0.008
Token would
Token ID561
Feature activation+1.88e-01

Pos logit contributions
 Gor+0.001
n+0.001
 gor+0.001
 gob+0.001
 seventy+0.001
Neg logit contributions
icker-0.002
ulum-0.002
 pens-0.002
 chess-0.002
 ink-0.002
Token take
Token ID1011
Feature activation-5.91e-01

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
 pens+0.004
attered+0.004
Neg logit contributions
n-0.003
 gor-0.003
 Gor-0.003
 seventy-0.003
 gob-0.003
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
n+0.013
 gor+0.013
 Gor+0.012
 deliber+0.012
 seventy+0.011
Neg logit contributions
icker-0.011
ulum-0.010
by-0.009
ina-0.009
?-0.009
Token av
Token ID1196
Feature activation-7.21e-01

Pos logit contributions
icker+0.003
by+0.002
ulum+0.002
?+0.002
own+0.002
Neg logit contributions
 gor-0.002
 Gor-0.002
 ostr-0.002
 seventy-0.002
 spe-0.002
Tokenoc
Token ID420
Feature activation-2.55e+00

Pos logit contributions
 Ph+0.010
 gor+0.010
 deliber+0.009
 inconvenience+0.009
n+0.009
Neg logit contributions
by-0.019
icker-0.017
?-0.016
 pens-0.015
 each-0.015
Tokenados
Token ID22484
Feature activation-5.82e-01

Pos logit contributions
 gor+0.048
 Gor+0.047
n+0.045
 seventy+0.045
 deliber+0.045
Neg logit contributions
by-0.050
icker-0.049
?-0.046
ulum-0.043
 something-0.041
Token and
Token ID290
Feature activation+2.11e-01

Pos logit contributions
n+0.011
 Gor+0.011
 gor+0.010
riger+0.010
 seventy+0.010
Neg logit contributions
icker-0.012
ulum-0.011
by-0.011
bie-0.009
 Raz-0.009
Token scatter
Token ID41058
Feature activation+2.87e-01

Pos logit contributions
icker+0.004
by+0.004
?+0.004
ulum+0.004
ina+0.004
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 seventy-0.004
 deliber-0.004
Token them
Token ID606
Feature activation-9.34e-01

Pos logit contributions
icker+0.006
ulum+0.006
by+0.006
ina+0.005
?+0.005
Neg logit contributions
 gor-0.005
 Gor-0.005
n-0.005
 spe-0.005
 deliber-0.004
Token around
Token ID1088
Feature activation+3.74e-01

Pos logit contributions
 gor+0.015
n+0.015
 Gor+0.015
 seventy+0.014
 gob+0.014
Neg logit contributions
icker-0.021
ulum-0.020
by-0.019
 Brother-0.017
?-0.017
Token go
Token ID467
Feature activation-4.74e-01

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
?+0.004
bie+0.004
Neg logit contributions
n-0.004
 gor-0.004
 Gor-0.004
 deliber-0.004
 ostr-0.004
Token to
Token ID284
Feature activation+1.83e-01

Pos logit contributions
 gor+0.009
 Gor+0.009
n+0.009
 deliber+0.009
 seventy+0.008
Neg logit contributions
icker-0.009
by-0.009
ulum-0.008
?-0.008
 pens-0.007
Token the
Token ID262
Feature activation+9.32e-01

Pos logit contributions
icker+0.004
by+0.003
attered+0.003
ulum+0.003
important+0.003
Neg logit contributions
 gor-0.004
n-0.004
 Gor-0.004
 deliber-0.004
 seventy-0.003
Token land
Token ID1956
Feature activation+3.30e-02

Pos logit contributions
icker+0.024
by+0.024
bie+0.022
ulum+0.022
ot+0.021
Neg logit contributions
 gor-0.013
 Gor-0.013
 ostr-0.012
 seventy-0.010
riger-0.010
Token of
Token ID286
Feature activation+2.25e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.001
 Brother+0.001
 pens+0.001
Neg logit contributions
 gor-0.000
 Gor-0.000
n-0.000
 deliber-0.000
 seventy-0.000
Token Imp
Token ID9855
Feature activation-2.52e+00

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
 something+0.002
Neg logit contributions
 gor-0.006
 Gor-0.006
n-0.006
 deliber-0.006
 seventy-0.006
Tokenressive
Token ID3314
Feature activation-5.93e-01

Pos logit contributions
 gor+0.050
 Gor+0.049
 deliber+0.048
oke+0.048
 seventy+0.047
Neg logit contributions
by-0.046
icker-0.043
?-0.042
ulum-0.041
own-0.038
Token Things
Token ID11597
Feature activation-4.78e-01

Pos logit contributions
 gor+0.012
 Gor+0.011
n+0.010
riger+0.010
 seventy+0.010
Neg logit contributions
icker-0.013
by-0.012
ot-0.010
bie-0.010
oppy-0.010
Token.
Token ID13
Feature activation-5.21e-02

Pos logit contributions
n+0.009
 gor+0.009
 Gor+0.009
 deliber+0.009
riger+0.009
Neg logit contributions
icker-0.010
by-0.009
ulum-0.008
bie-0.008
 pens-0.007
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
 pens-0.001
Token
Token ID198
Feature activation-7.22e-02

Pos logit contributions
 gor+0.004
 Gor+0.004
riger+0.004
 seventy+0.004
 deliber+0.004
Neg logit contributions
by-0.003
icker-0.003
?-0.003
 something-0.002
 each-0.002
Token its
Token ID663
Feature activation-2.92e-01

Pos logit contributions
 gor+0.005
 spe+0.005
 Gor+0.005
 ostr+0.005
 seventy+0.004
Neg logit contributions
icker-0.004
by-0.004
ulum-0.003
ina-0.003
ico-0.003
Token home
Token ID1363
Feature activation-5.54e-01

Pos logit contributions
 gor+0.006
riger+0.006
 deliber+0.006
 Gor+0.006
 seventy+0.006
Neg logit contributions
by-0.005
icker-0.005
?-0.005
 something-0.004
ulum-0.004
Token in
Token ID287
Feature activation+1.25e-01

Pos logit contributions
n+0.012
 gor+0.012
 Gor+0.012
 seventy+0.011
 spe+0.011
Neg logit contributions
by-0.009
icker-0.009
ulum-0.008
Simon-0.007
 Toby-0.007
Token a
Token ID257
Feature activation+3.61e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.002
 pens+0.001
Neg logit contributions
 gor-0.003
 Gor-0.003
n-0.003
 deliber-0.003
 seventy-0.003
Token big
Token ID1263
Feature activation-5.96e-01

Pos logit contributions
by+0.007
icker+0.007
?+0.007
ulum+0.006
 something+0.006
Neg logit contributions
 gor-0.007
 Gor-0.006
n-0.006
 deliber-0.006
 seventy-0.006
Token storm
Token ID6388
Feature activation-2.50e+00

Pos logit contributions
 gor+0.009
 Gor+0.008
n+0.008
 deliber+0.008
 seventy+0.008
Neg logit contributions
by-0.014
icker-0.014
?-0.013
ulum-0.013
 something-0.012
Token.
Token ID13
Feature activation+5.62e-02

Pos logit contributions
 gor+0.083
 Gor+0.083
 seventy+0.082
 deliber+0.082
riger+0.080
Neg logit contributions
by-0.013
?-0.011
icker-0.010
 each-0.006
ulum-0.006
Token The
Token ID383
Feature activation+1.50e-02

Pos logit contributions
by+0.001
icker+0.001
bie+0.001
?+0.001
ulum+0.001
Neg logit contributions
 Gor-0.001
n-0.001
 deliber-0.001
 gor-0.001
 seventy-0.001
Token little
Token ID1310
Feature activation-4.31e-01

Pos logit contributions
by+0.000
?+0.000
 something+0.000
 really+0.000
icker+0.000
Neg logit contributions
riger-0.000
 deliber-0.000
 seventy-0.000
Aud-0.000
oke-0.000
Token pine
Token ID20161
Feature activation-5.05e-01

Pos logit contributions
riger+0.008
 deliber+0.007
n+0.007
fters+0.007
 gor+0.007
Neg logit contributions
by-0.009
?-0.008
icker-0.008
 something-0.008
 each-0.008
Token tree
Token ID5509
Feature activation-4.37e-01

Pos logit contributions
riger+0.009
 seventy+0.009
 deliber+0.009
 gor+0.009
 Ph+0.008
Neg logit contributions
by-0.010
?-0.009
icker-0.009
 something-0.008
ulum-0.008
Token,
Token ID11
Feature activation-3.36e-01

Pos logit contributions
icker+0.008
by+0.008
ulum+0.007
?+0.007
important+0.007
Neg logit contributions
 gor-0.007
 Gor-0.007
n-0.006
 seventy-0.006
 deliber-0.006
Token a
Token ID257
Feature activation-5.17e-01

Pos logit contributions
 Gor+0.007
 gor+0.007
 Ph+0.006
 ostr+0.006
 Lib+0.006
Neg logit contributions
icker-0.008
by-0.007
ulum-0.007
attered-0.006
important-0.006
Token small
Token ID1402
Feature activation+1.63e-01

Pos logit contributions
 Gor+0.008
 gor+0.008
 ostr+0.007
 spe+0.007
 seventy+0.007
Neg logit contributions
icker-0.013
by-0.012
ulum-0.012
?-0.011
own-0.011
Token bird
Token ID6512
Feature activation-1.07e+00

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
?+0.003
own+0.003
Neg logit contributions
 gor-0.002
 Gor-0.002
 ostr-0.002
 seventy-0.002
 spe-0.002
Token named
Token ID3706
Feature activation-6.71e-01

Pos logit contributions
 gor+0.024
 Gor+0.024
n+0.023
 deliber+0.023
 seventy+0.023
Neg logit contributions
by-0.017
icker-0.016
?-0.015
ulum-0.014
 pens-0.013
Token Bree
Token ID40110
Feature activation-2.48e+00

Pos logit contributions
 gor+0.014
 Gor+0.014
n+0.014
 seventy+0.013
 deliber+0.013
Neg logit contributions
by-0.012
icker-0.012
?-0.010
ulum-0.010
 pens-0.009
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
 gor+0.056
 Gor+0.055
n+0.054
 seventy+0.053
 deliber+0.053
Neg logit contributions
icker-0.039
by-0.039
?-0.035
ulum-0.033
 pens-0.031
Token Bree
Token ID40110
Feature activation-2.02e+00

Pos logit contributions
by+0.008
icker+0.008
?+0.007
ulum+0.007
 pens+0.006
Neg logit contributions
 Gor-0.007
n-0.007
 gor-0.007
 deliber-0.007
 ostr-0.006
Token saw
Token ID2497
Feature activation-9.97e-02

Pos logit contributions
n+0.038
 Gor+0.037
 gor+0.036
 deliber+0.035
 seventy+0.035
Neg logit contributions
icker-0.042
by-0.039
?-0.036
ulum-0.035
 pens-0.033
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
riger+0.002
 seventy+0.002
 deliber+0.002
Neg logit contributions
by-0.001
icker-0.001
?-0.001
 something-0.001
 each-0.001
Token big
Token ID1263
Feature activation-6.32e-01

Pos logit contributions
 deliber+0.003
riger+0.003
oke+0.003
n+0.003
 gor+0.003
Neg logit contributions
by-0.002
 pen-0.002
?-0.002
 pens-0.002
icker-0.002
Token He
Token ID679
Feature activation-5.18e-01

Pos logit contributions
 deliber+0.003
 Gor+0.003
n+0.003
 gor+0.003
 seventy+0.002
Neg logit contributions
by-0.003
icker-0.003
?-0.003
 pens-0.003
 pen-0.003
Token smiled
Token ID13541
Feature activation-4.23e-01

Pos logit contributions
n+0.009
 gob+0.009
 Gor+0.008
 gor+0.008
 delighted+0.008
Neg logit contributions
icker-0.012
by-0.011
?-0.011
ulum-0.010
own-0.009
Token as
Token ID355
Feature activation-2.36e-01

Pos logit contributions
 delighted+0.006
 Gor+0.006
 thrilled+0.005
 gor+0.005
 seventy+0.005
Neg logit contributions
icker-0.011
by-0.010
ulum-0.010
ison-0.009
oppy-0.009
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
 gor+0.004
 Gor+0.004
n+0.004
 spe+0.004
 delighted+0.004
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
ilts-0.005
eds-0.005
Token l
Token ID300
Feature activation+3.39e-03

Pos logit contributions
by+0.005
icker+0.005
?+0.005
ulum+0.005
 pens+0.004
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 deliber-0.004
 seventy-0.004
Tokenibr
Token ID2889
Feature activation-2.47e+00

Pos logit contributions
by+0.000
icker+0.000
?+0.000
ulum+0.000
 something+0.000
Neg logit contributions
 gor-0.000
 Gor-0.000
n-0.000
 seventy-0.000
 deliber-0.000
Tokenarians
Token ID13517
Feature activation-7.60e-01

Pos logit contributions
 ju+0.044
 deliber+0.044
 gor+0.043
 Gor+0.043
 delighted+0.043
Neg logit contributions
by-0.048
icker-0.048
?-0.041
ulum-0.041
own-0.041
Token watched
Token ID7342
Feature activation+1.33e-02

Pos logit contributions
 gor+0.013
n+0.013
 Gor+0.013
 seventy+0.013
 deliber+0.013
Neg logit contributions
by-0.016
icker-0.015
?-0.014
ulum-0.014
 something-0.013
Token him
Token ID683
Feature activation+5.09e-01

Pos logit contributions
by+0.000
icker+0.000
ulum+0.000
?+0.000
important+0.000
Neg logit contributions
 gor-0.000
n-0.000
 Gor-0.000
 seventy-0.000
 deliber-0.000
Token having
Token ID1719
Feature activation+1.08e-01

Pos logit contributions
icker+0.011
by+0.011
ulum+0.010
?+0.010
 pens+0.009
Neg logit contributions
 gor-0.009
n-0.009
 Gor-0.008
 seventy-0.008
 gob-0.008
Token fun
Token ID1257
Feature activation-4.07e-01

Pos logit contributions
icker+0.003
by+0.003
ina+0.002
ulum+0.002
oppy+0.002
Neg logit contributions
 deliber-0.002
n-0.002
 gor-0.002
 delighted-0.001
 Gor-0.001
Token When
Token ID1649
Feature activation-3.27e-01

Pos logit contributions
 Gor+0.004
 seventy+0.004
 deliber+0.004
 gor+0.004
 Once+0.004
Neg logit contributions
by-0.005
icker-0.005
?-0.004
ulum-0.004
 pens-0.004
Token he
Token ID339
Feature activation+3.58e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
 deliber+0.005
 spe+0.005
n+0.005
Neg logit contributions
icker-0.008
by-0.008
attered-0.007
important-0.007
ulum-0.007
Token finally
Token ID3443
Feature activation+5.46e-01

Pos logit contributions
by+0.006
?+0.005
 really+0.005
 something+0.004
 each+0.004
Neg logit contributions
 gor-0.009
 inconvenience-0.008
 seventy-0.008
 Gor-0.008
umm-0.008
Token reached
Token ID4251
Feature activation-1.96e-01

Pos logit contributions
by+0.010
?+0.009
 something+0.008
icker+0.008
 each+0.008
Neg logit contributions
 gor-0.012
 Gor-0.012
 seventy-0.011
riger-0.011
go-0.011
Token the
Token ID262
Feature activation+5.82e-01

Pos logit contributions
 gor+0.004
 seventy+0.004
 Gor+0.004
 deliber+0.004
 spe+0.004
Neg logit contributions
icker-0.004
by-0.003
ulum-0.003
 pens-0.003
important-0.003
Token end
Token ID886
Feature activation+1.32e+00

Pos logit contributions
icker+0.013
by+0.013
bie+0.012
?+0.012
ulum+0.012
Neg logit contributions
 seventy-0.009
 spe-0.009
 ostr-0.009
 Gor-0.009
 gor-0.009
Token,
Token ID11
Feature activation-1.62e-02

Pos logit contributions
by+0.035
icker+0.034
ulum+0.033
?+0.033
 pens+0.031
Neg logit contributions
 gor-0.015
 Gor-0.014
n-0.014
 deliber-0.014
 seventy-0.013
Token he
Token ID339
Feature activation+7.30e-01

Pos logit contributions
 gor+0.000
 Gor+0.000
n+0.000
 deliber+0.000
 seventy+0.000
Neg logit contributions
icker-0.000
by-0.000
ulum-0.000
?-0.000
 pens-0.000
Token marvel
Token ID22672
Feature activation-2.56e-01

Pos logit contributions
by+0.013
icker+0.013
?+0.012
ulum+0.011
 something+0.011
Neg logit contributions
 gor-0.015
 Gor-0.015
n-0.014
 deliber-0.014
 seventy-0.014
Tokened
Token ID276
Feature activation+3.37e-01

Pos logit contributions
 seventy+0.003
 delighted+0.003
 Gor+0.003
 gor+0.003
oke+0.003
Neg logit contributions
ulum-0.007
icker-0.006
by-0.006
?-0.006
 pens-0.006
Token when
Token ID618
Feature activation+4.09e-01

Pos logit contributions
icker+0.008
by+0.008
ulum+0.007
?+0.007
 pens+0.007
Neg logit contributions
 gor-0.005
 Gor-0.005
 seventy-0.005
 spe-0.005
 delighted-0.005
Tokene
Token ID68
Feature activation-2.52e-01

Pos logit contributions
 gor+0.013
 Gor+0.013
 deliber+0.013
 seventy+0.012
n+0.012
Neg logit contributions
by-0.011
icker-0.010
?-0.010
ulum-0.009
 something-0.009
Token,
Token ID11
Feature activation-4.38e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
 deliber+0.005
 seventy+0.005
n+0.005
Neg logit contributions
by-0.005
icker-0.004
?-0.004
ulum-0.004
 something-0.004
Token bye
Token ID33847
Feature activation-7.75e-01

Pos logit contributions
 deliber+0.008
n+0.008
riger+0.008
fters+0.008
 spe+0.008
Neg logit contributions
?-0.008
by-0.008
 each-0.007
 something-0.007
icker-0.007
Token,
Token ID11
Feature activation-3.01e-01

Pos logit contributions
 gor+0.015
 Gor+0.014
n+0.014
 seventy+0.013
 deliber+0.013
Neg logit contributions
icker-0.015
by-0.015
?-0.014
ulum-0.013
 pens-0.013
Token tiger
Token ID26241
Feature activation-2.64e-01

Pos logit contributions
 deliber+0.006
riger+0.006
n+0.006
fters+0.006
rd+0.005
Neg logit contributions
?-0.006
by-0.005
 something-0.005
 each-0.005
 Toby-0.005
Token,"
Token ID553
Feature activation+1.47e+00

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 ostr+0.005
 seventy+0.005
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
 pens-0.005
?-0.004
Token Tom
Token ID4186
Feature activation+7.10e-01

Pos logit contributions
by+0.036
icker+0.035
bie+0.031
ina+0.030
ulum+0.030
Neg logit contributions
 gor-0.024
 ostr-0.022
 Gor-0.022
 deliber-0.022
 Ph-0.022
Token and
Token ID290
Feature activation-1.21e-01

Pos logit contributions
by+0.011
?+0.010
icker+0.010
ulum+0.009
 each+0.008
Neg logit contributions
 gor-0.016
 Gor-0.016
riger-0.016
 deliber-0.016
n-0.016
Token Lily
Token ID20037
Feature activation+2.86e-01

Pos logit contributions
 gor+0.003
 Gor+0.003
n+0.003
riger+0.003
 deliber+0.003
Neg logit contributions
by-0.002
icker-0.001
?-0.001
 each-0.001
 something-0.001
Token say
Token ID910
Feature activation-1.92e-01

Pos logit contributions
by+0.006
icker+0.006
?+0.006
ulum+0.005
 each+0.005
Neg logit contributions
 gor-0.005
 Gor-0.005
 deliber-0.005
n-0.005
 seventy-0.005
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
n+0.005
 seventy+0.005
 gor+0.005
 Gor+0.004
 deliber+0.004
Neg logit contributions
by-0.003
ulum-0.002
icker-0.002
 pens-0.002
bie-0.002
Token and
Token ID290
Feature activation+4.69e-01

Pos logit contributions
 Gor+0.004
 gor+0.004
 deliber+0.004
 delighted+0.004
 following+0.004
Neg logit contributions
icker-0.009
ulum-0.008
ilts-0.008
oppy-0.008
 pens-0.007
Token soon
Token ID2582
Feature activation+3.73e-01

Pos logit contributions
icker+0.009
by+0.009
?+0.008
ulum+0.008
 pens+0.007
Neg logit contributions
 gor-0.009
 Gor-0.009
n-0.009
 deliber-0.009
 seventy-0.009
Token everyone
Token ID2506
Feature activation-2.25e-01

Pos logit contributions
by+0.006
icker+0.006
?+0.005
ulum+0.005
 something+0.005
Neg logit contributions
 gor-0.008
 Gor-0.008
n-0.008
 seventy-0.008
 deliber-0.008
Token saw
Token ID2497
Feature activation+3.88e-01

Pos logit contributions
 gor+0.005
 seventy+0.005
 Gor+0.005
riger+0.005
 deliber+0.005
Neg logit contributions
by-0.004
icker-0.003
?-0.003
 each-0.003
 something-0.003
Token the
Token ID262
Feature activation-1.10e-01

Pos logit contributions
icker+0.007
by+0.007
ilts+0.006
ulum+0.006
bie+0.006
Neg logit contributions
 gor-0.008
 Gor-0.008
 amaz-0.007
 ostr-0.007
 delighted-0.007
Token bone
Token ID9970
Feature activation+1.59e+00

Pos logit contributions
 gor+0.003
n+0.003
 Gor+0.003
riger+0.003
 seventy+0.003
Neg logit contributions
by-0.001
?-0.001
icker-0.001
ulum-0.001
 something-0.001
Token.
Token ID13
Feature activation+6.74e-01

Pos logit contributions
icker+0.021
by+0.021
ulum+0.016
bie+0.016
?+0.016
Neg logit contributions
 gor-0.040
 Gor-0.040
n-0.039
 seventy-0.039
 gob-0.039
Token They
Token ID1119
Feature activation-3.38e-01

Pos logit contributions
by+0.015
icker+0.014
eds+0.013
oppy+0.012
bie+0.012
Neg logit contributions
 Gor-0.012
 deliber-0.011
 amaz-0.011
 delighted-0.011
 Once-0.011
Token laughed
Token ID13818
Feature activation+1.12e+00

Pos logit contributions
 gor+0.006
 Gor+0.006
 seventy+0.006
n+0.006
 deliber+0.006
Neg logit contributions
icker-0.007
by-0.007
?-0.006
ulum-0.006
own-0.006
Token at
Token ID379
Feature activation+2.02e-01

Pos logit contributions
by+0.030
icker+0.030
bie+0.028
oppy+0.027
ison+0.026
Neg logit contributions
 delighted-0.015
 thrilled-0.013
 Gor-0.013
 gor-0.012
 gob-0.012
Token him
Token ID683
Feature activation+4.65e-01

Pos logit contributions
icker+0.003
by+0.003
ulum+0.003
?+0.003
ilts+0.003
Neg logit contributions
n-0.004
 gor-0.004
 deliber-0.004
 Gor-0.004
 amaz-0.004
Token in
Token ID287
Feature activation+4.96e-01

Pos logit contributions
by+0.021
icker+0.020
ulum+0.019
 pens+0.017
?+0.017
Neg logit contributions
 gor-0.019
n-0.019
 seventy-0.018
 Gor-0.018
riger-0.018
Token the
Token ID262
Feature activation+5.80e-01

Pos logit contributions
icker+0.010
by+0.010
ulum+0.008
attered+0.008
bie+0.008
Neg logit contributions
 gor-0.010
 spe-0.009
 seventy-0.009
 Gor-0.009
n-0.009
Token world
Token ID995
Feature activation-2.84e-01

Pos logit contributions
icker+0.014
by+0.014
isdom+0.012
ulum+0.012
important+0.012
Neg logit contributions
 gor-0.008
 ostr-0.008
riger-0.008
 spe-0.007
 Gor-0.007
Token,
Token ID11
Feature activation+3.18e-01

Pos logit contributions
 gor+0.007
 Gor+0.006
n+0.006
 seventy+0.006
riger+0.006
Neg logit contributions
by-0.004
icker-0.004
ulum-0.004
?-0.004
 pens-0.003
Token but
Token ID475
Feature activation-3.56e-01

Pos logit contributions
icker+0.007
by+0.007
ulum+0.006
?+0.005
own+0.005
Neg logit contributions
 gor-0.006
 Gor-0.006
 seventy-0.006
n-0.006
 ostr-0.005
Token also
Token ID635
Feature activation+1.72e+00

Pos logit contributions
 gor+0.007
 Gor+0.007
 deliber+0.007
n+0.007
 seventy+0.007
Neg logit contributions
by-0.006
icker-0.006
?-0.006
ulum-0.005
 something-0.005
Token see
Token ID766
Feature activation-4.85e-01

Pos logit contributions
icker+0.037
ulum+0.036
by+0.036
 gears+0.030
own+0.030
Neg logit contributions
 gor-0.030
n-0.029
 Gor-0.028
 seventy-0.028
 spe-0.027
Token you
Token ID345
Feature activation+4.24e-02

Pos logit contributions
 gor+0.010
 Gor+0.009
n+0.009
 ostr+0.009
 seventy+0.009
Neg logit contributions
icker-0.010
by-0.009
ulum-0.009
attered-0.007
own-0.007
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
icker+0.001
ulum+0.001
by+0.001
 pens+0.001
 gears+0.001
Neg logit contributions
n-0.001
 gor-0.001
 seventy-0.001
 Gor-0.001
fters-0.001
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
icker+0.003
by+0.003
ulum+0.003
?+0.003
 pens+0.003
Neg logit contributions
 gor-0.003
 Gor-0.003
n-0.003
 seventy-0.003
 ostr-0.003
Token dancer
Token ID38619
Feature activation+2.49e-01

Pos logit contributions
 gor+0.002
 ostr+0.002
 Gor+0.002
n+0.002
 seventy+0.002
Neg logit contributions
icker-0.002
by-0.002
ulum-0.002
?-0.002
oppy-0.002
Token have
Token ID423
Feature activation+3.39e-01

Pos logit contributions
n+0.002
 gor+0.002
 seventy+0.002
 Gor+0.002
 gob+0.002
Neg logit contributions
by-0.003
icker-0.003
ulum-0.002
bie-0.002
 pens-0.002
Token two
Token ID734
Feature activation-6.77e-01

Pos logit contributions
icker+0.008
ulum+0.006
by+0.006
ina+0.006
oppy+0.006
Neg logit contributions
n-0.007
 gor-0.007
 seventy-0.006
 spe-0.006
 Gor-0.006
Token cups
Token ID14180
Feature activation+5.86e-01

Pos logit contributions
 gor+0.013
 spe+0.012
n+0.012
 Gor+0.012
 ostr+0.012
Neg logit contributions
icker-0.014
by-0.013
ulum-0.013
?-0.011
own-0.011
Token for
Token ID329
Feature activation-6.96e-01

Pos logit contributions
icker+0.012
by+0.012
ulum+0.011
 Toby+0.009
 Raz+0.009
Neg logit contributions
n-0.010
 gor-0.010
 seventy-0.010
 Gor-0.010
riger-0.010
Token free
Token ID1479
Feature activation+2.40e-01

Pos logit contributions
n+0.013
 gor+0.012
 seventy+0.012
 deliber+0.011
 Gor+0.011
Neg logit contributions
icker-0.016
by-0.015
ulum-0.014
attered-0.013
ina-0.013
Token,"
Token ID553
Feature activation+1.43e+00

Pos logit contributions
icker+0.005
by+0.004
ulum+0.004
oppy+0.004
bie+0.004
Neg logit contributions
 gor-0.005
n-0.005
 Gor-0.005
 ostr-0.004
 spe-0.004
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
icker+0.027
by+0.027
ulum+0.025
ilts+0.023
own+0.023
Neg logit contributions
 gor-0.029
 Gor-0.028
 deliber-0.028
 seventy-0.028
 ostr-0.027
Token man
Token ID582
Feature activation+3.89e-01

Pos logit contributions
 gor+0.015
 Gor+0.015
n+0.014
riger+0.014
 deliber+0.014
Neg logit contributions
icker-0.027
by-0.027
?-0.025
ulum-0.024
own-0.023
Token said
Token ID531
Feature activation+4.93e-01

Pos logit contributions
by+0.006
?+0.006
icker+0.005
 each+0.005
 something+0.005
Neg logit contributions
 gor-0.009
riger-0.009
fters-0.009
 Gor-0.009
 inconvenience-0.008
Token.
Token ID13
Feature activation-6.51e-02

Pos logit contributions
by+0.005
icker+0.004
?+0.004
ulum+0.004
 pens+0.003
Neg logit contributions
 seventy-0.014
n-0.014
 gor-0.014
 Gor-0.014
 deliber-0.014
Token
Token ID198
Feature activation+2.48e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
 something-0.001
Token to
Token ID284
Feature activation-3.49e-01

Pos logit contributions
icker+0.007
by+0.006
ulum+0.006
?+0.005
important+0.005
Neg logit contributions
 deliber-0.007
 gor-0.007
 Gor-0.007
n-0.007
 seventy-0.007
Token play
Token ID711
Feature activation+9.24e-01

Pos logit contributions
 gor+0.006
 Gor+0.006
n+0.006
 deliber+0.006
 seventy+0.006
Neg logit contributions
by-0.007
icker-0.007
?-0.007
ulum-0.006
 pens-0.006
Token this
Token ID428
Feature activation+9.65e-02

Pos logit contributions
by+0.019
icker+0.018
bie+0.016
?+0.015
ulum+0.015
Neg logit contributions
n-0.017
 gor-0.017
 deliber-0.017
riger-0.017
 Gor-0.017
Token piano
Token ID19132
Feature activation+1.24e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
ulum+0.001
 something+0.001
Neg logit contributions
 gor-0.002
 Gor-0.002
 seventy-0.002
 deliber-0.002
n-0.002
Token too
Token ID1165
Feature activation-4.62e-01

Pos logit contributions
icker+0.003
by+0.003
bie+0.002
ulum+0.002
oppy+0.002
Neg logit contributions
n-0.002
 gor-0.002
 Gor-0.002
 seventy-0.002
riger-0.002
Token.
Token ID13
Feature activation-5.88e-02

Pos logit contributions
n+0.007
hel+0.007
 deliber+0.007
 delighted+0.007
 spe+0.007
Neg logit contributions
by-0.010
icker-0.009
 notebooks-0.009
 elbows-0.009
ports-0.009
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
 Gor+0.001
 gor+0.001
n+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
 pens-0.001
Token
Token ID198
Feature activation-4.40e-05

Pos logit contributions
by+0.002
icker+0.002
?+0.002
 each+0.002
 something+0.002
Neg logit contributions
 Gor-0.004
 gor-0.004
 spe-0.004
riger-0.004
 seventy-0.004
TokenAfter
Token ID3260
Feature activation-1.80e-01

Pos logit contributions
 gor+0.000
 Gor+0.000
 spe+0.000
 ostr+0.000
 seventy+0.000
Neg logit contributions
icker-0.000
?-0.000
by-0.000
They-0.000
ulum-0.000
Token a
Token ID257
Feature activation-1.82e-01

Pos logit contributions
 deliber+0.004
n+0.003
 gor+0.003
 Ph+0.003
 seventy+0.003
Neg logit contributions
by-0.004
icker-0.004
important-0.003
bie-0.003
?-0.003
Token few
Token ID1178
Feature activation-8.96e-01

Pos logit contributions
 gor+0.004
 Gor+0.004
 seventy+0.004
n+0.004
fters+0.004
Neg logit contributions
by-0.003
icker-0.003
?-0.002
 something-0.002
 each-0.002
Token be
Token ID307
Feature activation+3.26e-01

Pos logit contributions
icker+0.013
by+0.013
ulum+0.012
 pens+0.012
bie+0.012
Neg logit contributions
n-0.007
 deliber-0.004
 thank-0.004
 gob-0.004
 Gor-0.004
Token something
Token ID1223
Feature activation-1.09e+00

Pos logit contributions
icker+0.009
ulum+0.008
ute+0.007
by+0.007
utes+0.007
Neg logit contributions
 delighted-0.005
 gor-0.005
n-0.005
 deliber-0.005
 spe-0.005
Token special
Token ID2041
Feature activation-5.56e-02

Pos logit contributions
n+0.017
 gor+0.017
 Gor+0.016
 deliber+0.016
 spe+0.016
Neg logit contributions
icker-0.026
ulum-0.025
by-0.024
own-0.021
bie-0.021
Token like
Token ID588
Feature activation+3.00e-01

Pos logit contributions
n+0.001
 Gor+0.001
 gor+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
icker-0.001
by-0.001
ulum-0.001
ilts-0.001
bie-0.001
Token a
Token ID257
Feature activation-8.72e-02

Pos logit contributions
icker+0.007
by+0.006
ulum+0.006
bie+0.005
important+0.005
Neg logit contributions
 gor-0.006
n-0.006
 Gor-0.005
 spe-0.005
 deliber-0.005
Token medal
Token ID18279
Feature activation+9.59e-01

Pos logit contributions
 Gor+0.001
 gor+0.001
 spe+0.001
 ostr+0.001
 gob+0.001
Neg logit contributions
icker-0.002
ulum-0.002
by-0.002
attered-0.002
uds-0.002
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
icker+0.017
by+0.017
ulum+0.015
bie+0.014
 Raz+0.013
Neg logit contributions
n-0.020
 gor-0.020
 Gor-0.020
riger-0.019
 deliber-0.019
Token But
Token ID887
Feature activation+2.46e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.000
eds+0.000
 pens+0.000
Neg logit contributions
 Once-0.000
 Gor-0.000
n-0.000
 deliber-0.000
oke-0.000
Token David
Token ID3271
Feature activation-2.94e-01

Pos logit contributions
icker+0.005
by+0.005
ulum+0.004
ilts+0.004
important+0.004
Neg logit contributions
n-0.005
 Gor-0.005
 deliber-0.005
 seventy-0.004
 gor-0.004
Token had
Token ID550
Feature activation-2.16e-02

Pos logit contributions
n+0.005
 Gor+0.005
 gob+0.005
 deliber+0.005
 gor+0.004
Neg logit contributions
icker-0.007
by-0.006
ulum-0.006
ilts-0.006
own-0.006
Token a
Token ID257
Feature activation-6.76e-01

Pos logit contributions
n+0.000
 deliber+0.000
 seventy+0.000
 delighted+0.000
 hes+0.000
Neg logit contributions
icker-0.001
ulum-0.000
by-0.000
ucer-0.000
ute-0.000
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
n+0.007
 gor+0.007
 Gor+0.007
 gob+0.006
 delighted+0.006
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
 Raz-0.004
 pens-0.004
Token It
Token ID632
Feature activation-7.12e-01

Pos logit contributions
 Gor+0.004
n+0.004
 gor+0.004
 Ph+0.004
 seventy+0.004
Neg logit contributions
by-0.005
icker-0.005
 pens-0.005
ulum-0.004
 pen-0.004
Token was
Token ID373
Feature activation+8.18e-02

Pos logit contributions
 gor+0.014
 Gor+0.014
n+0.014
 seventy+0.013
 deliber+0.013
Neg logit contributions
icker-0.014
by-0.013
?-0.012
ulum-0.012
 pens-0.011
Token a
Token ID257
Feature activation+4.68e-02

Pos logit contributions
icker+0.002
by+0.002
ulum+0.001
?+0.001
oppy+0.001
Neg logit contributions
 gor-0.002
 Gor-0.002
n-0.002
 seventy-0.002
 spe-0.002
Token fun
Token ID1257
Feature activation+7.59e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.001
?+0.001
ina+0.001
Neg logit contributions
 gor-0.001
 Gor-0.001
 spe-0.001
 ostr-0.001
 seventy-0.001
Token and
Token ID290
Feature activation+4.67e-01

Pos logit contributions
icker+0.018
by+0.017
ulum+0.015
important+0.015
?+0.015
Neg logit contributions
 gor-0.011
n-0.011
 deliber-0.011
 Gor-0.011
 ostr-0.010
Token delicious
Token ID12625
Feature activation+7.59e-01

Pos logit contributions
icker+0.010
by+0.009
ulum+0.008
?+0.008
own+0.008
Neg logit contributions
 gor-0.009
 Gor-0.009
n-0.008
 deliber-0.008
 delighted-0.008
Token dinner
Token ID8073
Feature activation+4.88e-01

Pos logit contributions
icker+0.017
by+0.016
?+0.015
ulum+0.015
ina+0.015
Neg logit contributions
 gor-0.013
 Gor-0.012
 deliber-0.012
n-0.011
 spe-0.011
Token for
Token ID329
Feature activation-1.83e-01

Pos logit contributions
icker+0.009
by+0.009
ulum+0.008
bie+0.007
?+0.007
Neg logit contributions
n-0.010
 gor-0.010
 Gor-0.010
 deliber-0.009
 spe-0.009
Token Lily
Token ID20037
Feature activation+3.58e-01

Pos logit contributions
 gor+0.004
n+0.003
 Gor+0.003
 deliber+0.003
 gob+0.003
Neg logit contributions
by-0.004
icker-0.004
attered-0.004
ulum-0.004
important-0.003
Token and
Token ID290
Feature activation+5.33e-02

Pos logit contributions
icker+0.008
ulum+0.007
by+0.006
important+0.006
 gears+0.006
Neg logit contributions
n-0.007
 Gor-0.006
 gor-0.006
 gob-0.006
 delighted-0.006
Token her
Token ID607
Feature activation-1.27e-01

Pos logit contributions
icker+0.004
by+0.004
ulum+0.004
?+0.003
 pens+0.003
Neg logit contributions
 gor-0.006
n-0.006
 Gor-0.006
 seventy-0.006
 deliber-0.006
Token friends
Token ID2460
Feature activation+2.38e-01

Pos logit contributions
 gor+0.003
 Gor+0.003
n+0.003
 seventy+0.003
 deliber+0.003
Neg logit contributions
icker-0.002
by-0.002
ulum-0.002
?-0.002
own-0.002
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
icker+0.006
ulum+0.006
by+0.005
important+0.005
attered+0.005
Neg logit contributions
 gor-0.003
n-0.003
 Gor-0.003
 seventy-0.003
oke-0.003
Token join
Token ID4654
Feature activation+1.00e+00

Pos logit contributions
by+0.027
icker+0.027
ulum+0.025
?+0.024
bie+0.023
Neg logit contributions
n-0.016
 gor-0.015
 deliber-0.015
 seventy-0.015
 Gor-0.015
Token her
Token ID607
Feature activation-4.58e-01

Pos logit contributions
icker+0.021
by+0.021
ulum+0.020
bie+0.018
important+0.018
Neg logit contributions
n-0.018
 deliber-0.017
 gor-0.016
 Ph-0.016
 seventy-0.015
Token and
Token ID290
Feature activation+8.14e-01

Pos logit contributions
 gor+0.009
n+0.009
 Gor+0.009
 deliber+0.009
 seventy+0.008
Neg logit contributions
icker-0.009
by-0.008
ulum-0.008
important-0.007
bie-0.007
Token when
Token ID618
Feature activation-4.48e-01

Pos logit contributions
icker+0.017
by+0.016
ulum+0.016
?+0.015
own+0.014
Neg logit contributions
n-0.015
 gor-0.014
 Gor-0.014
 deliber-0.014
 seventy-0.013
Token they
Token ID484
Feature activation+2.38e-01

Pos logit contributions
 Ph+0.007
 deliber+0.007
 Lib+0.007
 gor+0.007
n+0.007
Neg logit contributions
icker-0.012
by-0.010
ulum-0.010
eds-0.010
aker-0.010
Token said
Token ID531
Feature activation-3.07e-01

Pos logit contributions
icker+0.005
by+0.005
ulum+0.004
?+0.004
 pens+0.004
Neg logit contributions
 gor-0.004
 Gor-0.004
 deliber-0.004
n-0.004
 seventy-0.004
Token yes
Token ID3763
Feature activation+9.64e-01

Pos logit contributions
 gor+0.005
 seventy+0.005
 delighted+0.005
n+0.005
 Gor+0.005
Neg logit contributions
icker-0.007
by-0.007
ulum-0.007
bie-0.007
eds-0.006
Token she
Token ID673
Feature activation+1.19e-01

Pos logit contributions
icker+0.021
ulum+0.018
by+0.017
bie+0.017
 pens+0.017
Neg logit contributions
n-0.019
oke-0.018
 Gor-0.018
 gor-0.017
 deliber-0.017
Token if
Token ID611
Feature activation+3.70e-01

Pos logit contributions
icker+0.009
by+0.008
ulum+0.008
?+0.007
ilts+0.006
Neg logit contributions
 gor-0.015
 Gor-0.015
n-0.014
 seventy-0.014
 deliber-0.014
Token he
Token ID339
Feature activation+5.05e-01

Pos logit contributions
icker+0.008
by+0.007
ulum+0.007
ilts+0.007
?+0.007
Neg logit contributions
n-0.007
 Gor-0.006
 gor-0.006
 deliber-0.006
 seventy-0.006
Token could
Token ID714
Feature activation+7.59e-01

Pos logit contributions
icker+0.012
ulum+0.011
by+0.010
ilts+0.010
own+0.009
Neg logit contributions
 Gor-0.009
 seventy-0.008
n-0.008
 gor-0.008
 deliber-0.008
Token crawl
Token ID27318
Feature activation-4.16e-01

Pos logit contributions
by+0.019
icker+0.018
ulum+0.017
 pens+0.017
ky+0.016
Neg logit contributions
n-0.012
 Gor-0.010
 gor-0.010
 seventy-0.010
 deliber-0.010
Token there
Token ID612
Feature activation-8.54e-01

Pos logit contributions
n+0.006
 gor+0.006
 Ph+0.006
 seventy+0.006
 Gor+0.005
Neg logit contributions
icker-0.010
by-0.010
ulum-0.009
bie-0.009
 Brother-0.009
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
 Gor+0.016
 gor+0.016
 seventy+0.015
n+0.015
 deliber+0.015
Neg logit contributions
icker-0.018
by-0.017
ulum-0.016
ilts-0.015
oppy-0.014
Token His
Token ID2399
Feature activation-1.63e-02

Pos logit contributions
icker+0.000
by+0.000
important+0.000
?+0.000
ulum+0.000
Neg logit contributions
n-0.000
 Once-0.000
 deliber-0.000
 Gor-0.000
riger-0.000
Token aunt
Token ID25949
Feature activation-1.08e+00

Pos logit contributions
 gor+0.000
fters+0.000
 seventy+0.000
riger+0.000
 deliber+0.000
Neg logit contributions
by-0.000
icker-0.000
?-0.000
 each-0.000
 something-0.000
Token said
Token ID531
Feature activation-7.40e-01

Pos logit contributions
 gor+0.019
 Gor+0.019
n+0.019
 seventy+0.018
 deliber+0.018
Neg logit contributions
icker-0.022
by-0.022
?-0.020
ulum-0.020
 pens-0.019
Token that
Token ID326
Feature activation+2.78e-01

Pos logit contributions
 gor+0.009
 Gor+0.008
n+0.008
 seventy+0.008
 deliber+0.008
Neg logit contributions
by-0.020
icker-0.020
?-0.018
ulum-0.018
 pens-0.017
Token he
Token ID339
Feature activation+2.91e-01

Pos logit contributions
icker+0.005
by+0.005
?+0.004
ulum+0.004
own+0.004
Neg logit contributions
 gor-0.006
 Gor-0.006
n-0.006
 deliber-0.006
 seventy-0.006
Token tell
Token ID1560
Feature activation-1.00e-02

Pos logit contributions
by+0.013
icker+0.013
?+0.012
ulum+0.011
 something+0.011
Neg logit contributions
 gor-0.014
 Gor-0.014
 deliber-0.014
 seventy-0.014
n-0.013
Token their
Token ID511
Feature activation-5.19e-01

Pos logit contributions
 gor+0.000
 Gor+0.000
n+0.000
 ostr+0.000
 thank+0.000
Neg logit contributions
icker-0.000
by-0.000
ulum-0.000
bie-0.000
ina-0.000
Token mom
Token ID1995
Feature activation+1.12e-01

Pos logit contributions
 seventy+0.013
 gor+0.013
 deliber+0.013
n+0.013
 Gor+0.013
Neg logit contributions
by-0.007
icker-0.006
?-0.006
 each-0.005
 something-0.005
Token and
Token ID290
Feature activation+6.09e-01

Pos logit contributions
icker+0.003
ulum+0.003
ky+0.003
attered+0.003
important+0.003
Neg logit contributions
 gor-0.002
 Gor-0.001
n-0.001
 ostr-0.001
oke-0.001
Token dad
Token ID9955
Feature activation+6.21e-01

Pos logit contributions
by+0.007
icker+0.007
?+0.006
ulum+0.005
 pens+0.005
Neg logit contributions
 gor-0.017
 Gor-0.016
n-0.016
 seventy-0.016
 deliber-0.016
Token what
Token ID644
Feature activation-4.89e-01

Pos logit contributions
icker+0.017
ulum+0.017
important+0.015
ky+0.015
by+0.015
Neg logit contributions
 gor-0.009
 Gor-0.007
n-0.007
oke-0.007
 ostr-0.007
Token happened
Token ID3022
Feature activation-2.73e-01

Pos logit contributions
 gor+0.007
 deliber+0.007
n+0.007
 Gor+0.007
 spe+0.006
Neg logit contributions
icker-0.012
by-0.011
ulum-0.011
ky-0.010
bie-0.010
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
n+0.007
 deliber+0.006
 gor+0.006
 Gor+0.006
 seventy+0.006
Neg logit contributions
icker-0.004
by-0.004
ulum-0.004
bie-0.003
 pens-0.003
Token They
Token ID1119
Feature activation-2.60e-01

Pos logit contributions
by+0.008
icker+0.007
?+0.006
ina+0.005
important+0.005
Neg logit contributions
 deliber-0.012
 Gor-0.012
 gor-0.012
 Ph-0.012
n-0.012
Token held
Token ID2714
Feature activation+3.38e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 seventy+0.005
 deliber+0.005
Neg logit contributions
icker-0.005
by-0.005
?-0.005
ulum-0.004
 pens-0.004
Token hands
Token ID2832
Feature activation-3.42e-01

Pos logit contributions
icker+0.006
by+0.006
ina+0.005
?+0.005
ulum+0.005
Neg logit contributions
 seventy-0.007
 Gor-0.007
 gor-0.007
n-0.006
 spe-0.006
Token They
Token ID1119
Feature activation-4.54e-01

Pos logit contributions
 gor+0.016
 Gor+0.016
 Ph+0.016
 deliber+0.015
n+0.015
Neg logit contributions
by-0.011
icker-0.009
ulum-0.008
ky-0.008
ina-0.008
Token say
Token ID910
Feature activation-6.45e-01

Pos logit contributions
 gor+0.007
 Gor+0.007
 gob+0.006
n+0.006
riger+0.006
Neg logit contributions
icker-0.011
by-0.011
ulum-0.010
?-0.009
own-0.009
Token thank
Token ID5875
Feature activation+2.32e-02

Pos logit contributions
 gor+0.007
 seventy+0.007
 Gor+0.007
 deliber+0.007
n+0.007
Neg logit contributions
by-0.018
icker-0.018
ulum-0.016
?-0.016
bie-0.016
Token you
Token ID345
Feature activation+6.85e-01

Pos logit contributions
by+0.001
ulum+0.001
bie+0.001
icker+0.001
 gears+0.001
Neg logit contributions
n-0.000
go-0.000
 deliber-0.000
riger-0.000
 seventy-0.000
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
icker+0.017
ulum+0.017
important+0.016
ined+0.015
ording+0.014
Neg logit contributions
 Gor-0.010
 gor-0.009
n-0.009
 ostr-0.008
rd-0.008
Token Mom
Token ID11254
Feature activation-5.15e-01

Pos logit contributions
by+0.005
icker+0.005
ulum+0.005
ilts+0.005
attered+0.005
Neg logit contributions
 gor-0.005
 Gor-0.005
 ostr-0.005
 gob-0.005
 seventy-0.005
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
 gor+0.010
 Gor+0.010
n+0.009
 seventy+0.009
oke+0.009
Neg logit contributions
icker-0.011
ulum-0.010
Simon-0.009
by-0.009
important-0.008
Token They
Token ID1119
Feature activation+3.88e-02

Pos logit contributions
 Gor+0.010
 gor+0.010
 deliber+0.010
 Ph+0.010
 seventy+0.010
Neg logit contributions
by-0.008
icker-0.006
ulum-0.006
ina-0.005
ito-0.005
Token say
Token ID910
Feature activation-6.51e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.001
?+0.001
own+0.001
Neg logit contributions
 gor-0.001
 Gor-0.001
n-0.001
 gob-0.001
 seventy-0.001
Token they
Token ID484
Feature activation+1.70e-01

Pos logit contributions
 gor+0.009
 seventy+0.009
n+0.009
 deliber+0.009
 Gor+0.009
Neg logit contributions
by-0.016
icker-0.016
ulum-0.015
?-0.015
bie-0.014
Token love
Token ID1842
Feature activation+9.09e-01

Pos logit contributions
icker+0.004
ulum+0.004
by+0.004
 Toby+0.004
 Brother+0.004
Neg logit contributions
n-0.002
 gor-0.002
 seventy-0.002
riger-0.002
 Gor-0.002
Token the
Token ID262
Feature activation-5.05e-01

Pos logit contributions
icker+0.013
ulum+0.012
by+0.012
ilts+0.011
attered+0.011
Neg logit contributions
 Gor-0.012
 gor-0.011
 seventy-0.011
 ostr-0.010
n-0.010
Token other
Token ID584
Feature activation-7.09e-01

Pos logit contributions
 gor+0.010
 Gor+0.010
 ostr+0.009
n+0.009
 seventy+0.009
Neg logit contributions
by-0.009
icker-0.009
?-0.009
ulum-0.008
 Toby-0.008
Token food
Token ID2057
Feature activation-1.05e-01

Pos logit contributions
 gor+0.013
 Gor+0.013
 ostr+0.012
n+0.012
 seventy+0.012
Neg logit contributions
icker-0.014
by-0.014
ulum-0.013
?-0.013
own-0.011
Token around
Token ID1088
Feature activation+4.83e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
 ostr+0.002
oke+0.002
n+0.002
Neg logit contributions
ulum-0.002
icker-0.002
by-0.002
ilts-0.002
ky-0.002
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
icker+0.011
by+0.011
ilts+0.010
ulum+0.010
ucer+0.010
Neg logit contributions
n-0.008
 gor-0.008
 reaching-0.008
 following-0.007
 gob-0.007
Token "
Token ID366
Feature activation-6.79e-01

Pos logit contributions
by+0.013
icker+0.012
ulum+0.011
important+0.011
oppy+0.010
Neg logit contributions
 Gor-0.004
n-0.004
 delighted-0.004
 reaching-0.004
 Ph-0.004
TokenWho
Token ID8241
Feature activation-8.86e-01

Pos logit contributions
 gor+0.012
 Gor+0.012
n+0.012
 seventy+0.012
 deliber+0.011
Neg logit contributions
by-0.014
icker-0.014
?-0.013
ulum-0.012
 pens-0.011
Token wants
Token ID3382
Feature activation+3.51e-01

Pos logit contributions
 gor+0.015
n+0.015
 Gor+0.015
 seventy+0.014
 ostr+0.013
Neg logit contributions
icker-0.020
by-0.019
ulum-0.018
 Toby-0.016
?-0.016
Token to
Token ID284
Feature activation+4.36e-01

Pos logit contributions
icker+0.007
by+0.007
ina+0.006
ulum+0.006
bie+0.006
Neg logit contributions
n-0.007
 gor-0.007
 Gor-0.007
 deliber-0.006
 seventy-0.006
Token win
Token ID1592
Feature activation-4.19e-02

Pos logit contributions
?+0.009
by+0.008
icker+0.008
 something+0.008
 each+0.007
Neg logit contributions
 gor-0.008
 Gor-0.008
riger-0.008
oke-0.008
 deliber-0.008
Token a
Token ID257
Feature activation-5.01e-01

Pos logit contributions
n+0.001
 gor+0.001
 seventy+0.001
 Gor+0.001
 deliber+0.001
Neg logit contributions
by-0.001
icker-0.001
bie-0.001
ulum-0.001
attered-0.000
Token,
Token ID11
Feature activation+6.38e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.001
 journals+0.001
 pens+0.001
Neg logit contributions
n-0.000
oke-0.000
 Gor-0.000
 seventy-0.000
 gor-0.000
Token you
Token ID345
Feature activation+5.04e-01

Pos logit contributions
by+0.012
icker+0.012
?+0.011
ulum+0.010
 something+0.010
Neg logit contributions
 gor-0.013
 Gor-0.012
n-0.012
 deliber-0.012
 seventy-0.012
Token are
Token ID389
Feature activation+8.35e-01

Pos logit contributions
by+0.011
icker+0.011
ulum+0.010
?+0.009
 early+0.009
Neg logit contributions
n-0.009
 gor-0.008
 Gor-0.008
riger-0.008
oke-0.008
Token too
Token ID1165
Feature activation-7.46e-02

Pos logit contributions
by+0.018
icker+0.017
ulum+0.014
?+0.014
bie+0.014
Neg logit contributions
 gor-0.016
 Gor-0.016
n-0.015
 seventy-0.015
riger-0.015
Token noisy
Token ID31210
Feature activation+3.29e-02

Pos logit contributions
 gor+0.001
oke+0.001
 spe+0.001
 deliber+0.001
 Gor+0.001
Neg logit contributions
by-0.002
ulum-0.002
icker-0.001
 pens-0.001
 journals-0.001
Token!
Token ID0
Feature activation-6.12e-01

Pos logit contributions
icker+0.001
by+0.001
ulum+0.000
bie+0.000
oppy+0.000
Neg logit contributions
 gor-0.001
n-0.001
 deliber-0.001
 Gor-0.001
riger-0.001
Token I
Token ID314
Feature activation+4.26e-01

Pos logit contributions
 Gor+0.009
oke+0.009
n+0.009
 seventy+0.008
 gor+0.008
Neg logit contributions
by-0.017
icker-0.015
 pens-0.013
?-0.012
 early-0.012
Token can
Token ID460
Feature activation-7.51e-01

Pos logit contributions
by+0.009
icker+0.009
ulum+0.008
 pens+0.008
?+0.008
Neg logit contributions
n-0.007
 gor-0.007
 Gor-0.007
riger-0.007
 deliber-0.007
Token't
Token ID470
Feature activation+1.24e-01

Pos logit contributions
n+0.008
riger+0.006
fters+0.006
 gor+0.006
 seventy+0.006
Neg logit contributions
by-0.022
icker-0.020
bie-0.020
ulum-0.019
 pens-0.019
Token hear
Token ID3285
Feature activation-1.09e+00

Pos logit contributions
by+0.003
icker+0.003
bie+0.003
 pens+0.003
ulum+0.003
Neg logit contributions
n-0.002
riger-0.002
fters-0.002
go-0.002
 deliber-0.002
Token myself
Token ID3589
Feature activation+7.33e-01

Pos logit contributions
 gor+0.020
n+0.020
 Gor+0.019
 seventy+0.018
 ostr+0.018
Neg logit contributions
icker-0.024
by-0.023
ina-0.019
oppy-0.019
important-0.019
Token her
Token ID607
Feature activation-2.10e-02

Pos logit contributions
 gor+0.003
n+0.002
 Gor+0.002
 spe+0.002
fters+0.002
Neg logit contributions
icker-0.003
by-0.002
ulum-0.002
utes-0.002
bie-0.002
Token.
Token ID13
Feature activation-8.92e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.001
 ostr+0.001
 spe+0.001
Neg logit contributions
icker-0.000
by-0.000
ulum-0.000
?-0.000
 Toby-0.000
Token Every
Token ID3887
Feature activation+1.17e+00

Pos logit contributions
 Gor+0.016
 gor+0.016
n+0.015
 deliber+0.015
 seventy+0.015
Neg logit contributions
by-0.019
icker-0.019
?-0.017
ulum-0.016
 pens-0.016
Token day
Token ID1110
Feature activation+2.55e-01

Pos logit contributions
icker+0.032
by+0.030
ulum+0.029
 pens+0.029
important+0.028
Neg logit contributions
 gor-0.015
 Gor-0.014
n-0.014
 deliber-0.014
 ostr-0.013
Token the
Token ID262
Feature activation-2.70e-02

Pos logit contributions
icker+0.005
by+0.005
ulum+0.005
?+0.005
important+0.005
Neg logit contributions
 gor-0.005
 Gor-0.005
 ostr-0.004
n-0.004
 deliber-0.004
Token farmer
Token ID18739
Feature activation-6.49e-01

Pos logit contributions
 seventy+0.001
riger+0.001
oke+0.001
n+0.001
 deliber+0.001
Neg logit contributions
by-0.000
 each-0.000
?-0.000
 something-0.000
icker-0.000
Token would
Token ID561
Feature activation+6.49e-02

Pos logit contributions
 gor+0.015
 Gor+0.014
n+0.014
 seventy+0.014
 ostr+0.014
Neg logit contributions
icker-0.011
by-0.010
ulum-0.009
?-0.009
own-0.008
Token walk
Token ID2513
Feature activation-6.51e-01

Pos logit contributions
by+0.001
icker+0.001
?+0.001
ulum+0.001
 pens+0.001
Neg logit contributions
 gor-0.001
n-0.001
 Gor-0.001
 seventy-0.001
 deliber-0.001
Token her
Token ID607
Feature activation+4.75e-01

Pos logit contributions
 Gor+0.009
 gor+0.009
 deliber+0.007
 Lib+0.007
go+0.007
Neg logit contributions
icker-0.018
ulum-0.016
oppy-0.016
ilts-0.015
ico-0.015
Token around
Token ID1088
Feature activation+1.32e-01

Pos logit contributions
by+0.009
icker+0.009
?+0.008
ulum+0.008
 each+0.008
Neg logit contributions
 gor-0.009
 Gor-0.009
 deliber-0.009
 seventy-0.009
n-0.009
Token the
Token ID262
Feature activation+7.62e-01

Pos logit contributions
icker+0.003
by+0.002
ulum+0.002
important+0.002
?+0.002
Neg logit contributions
 gor-0.003
 Gor-0.003
 deliber-0.003
 spe-0.002
 seventy-0.002
Token
Token ID220
Feature activation+4.07e-01

Pos logit contributions
n+0.010
 gor+0.008
 Gor+0.008
 deliber+0.008
 seventy+0.008
Neg logit contributions
by-0.037
icker-0.035
bie-0.032
 pens-0.032
ulum-0.032
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
by+0.009
icker+0.009
ulum+0.009
 pens+0.009
?+0.009
Neg logit contributions
n-0.006
oke-0.005
 seventy-0.005
 delighted-0.005
 deliber-0.005
Token
Token ID198
Feature activation-3.94e-02

Pos logit contributions
icker+0.001
by+0.001
ulum+0.000
 pens+0.000
?+0.000
Neg logit contributions
n-0.000
 Gor-0.000
 gor-0.000
 deliber-0.000
 seventy-0.000
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
n+0.001
 Gor+0.001
 gor+0.001
 deliber+0.001
 seventy+0.001
Neg logit contributions
by-0.001
icker-0.001
ulum-0.001
?-0.001
 pens-0.001
Token town
Token ID3240
Feature activation-3.82e-01

Pos logit contributions
 gor+0.004
 ostr+0.004
 Gor+0.004
 seventy+0.004
n+0.004
Neg logit contributions
icker-0.007
by-0.006
ulum-0.006
?-0.006
own-0.006
Tokenspe
Token ID4125
Feature activation-2.73e+00

Pos logit contributions
n+0.006
 gor+0.006
 Gor+0.006
oke+0.006
 delighted+0.005
Neg logit contributions
icker-0.009
by-0.008
?-0.008
ulum-0.008
 pens-0.008
Tokenople
Token ID643
Feature activation-5.80e-01

Pos logit contributions
 gor+0.028
n+0.027
 Gor+0.027
 seventy+0.026
 deliber+0.025
Neg logit contributions
icker-0.076
by-0.074
?-0.071
 Toby-0.068
 something-0.067
Token had
Token ID550
Feature activation+2.93e-01

Pos logit contributions
n+0.010
 gor+0.009
 Gor+0.009
oke+0.009
 deliber+0.008
Neg logit contributions
icker-0.013
by-0.013
ulum-0.012
?-0.011
 pens-0.011
Token to
Token ID284
Feature activation-3.23e-01

Pos logit contributions
icker+0.007
oppy+0.006
by+0.006
ulum+0.005
ina+0.005
Neg logit contributions
n-0.006
 gor-0.005
 deliber-0.005
 delighted-0.005
 Gor-0.005
Token find
Token ID1064
Feature activation-2.41e-01

Pos logit contributions
n+0.005
 gor+0.005
 deliber+0.004
 Gor+0.004
riger+0.004
Neg logit contributions
icker-0.008
by-0.008
ulum-0.007
bie-0.007
 pens-0.007
Token a
Token ID257
Feature activation-2.99e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 deliber+0.005
 ostr+0.005
Neg logit contributions
icker-0.004
by-0.004
ulum-0.004
oppy-0.004
?-0.003
Token than
Token ID621
Feature activation-2.56e-01

Pos logit contributions
 gor+0.006
n+0.006
 Gor+0.006
 ostr+0.005
 spe+0.005
Neg logit contributions
icker-0.006
by-0.006
ulum-0.006
 pens-0.005
 Toby-0.005
Token usual
Token ID6678
Feature activation-2.87e-02

Pos logit contributions
 gor+0.005
n+0.005
 Gor+0.005
 spe+0.004
 ostr+0.004
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
ilts-0.005
?-0.005
Token.
Token ID13
Feature activation+5.00e-02

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.001
 seventy+0.001
 spe+0.001
Neg logit contributions
icker-0.000
by-0.000
ulum-0.000
?-0.000
bie-0.000
Token As
Token ID1081
Feature activation-1.61e-01

Pos logit contributions
by+0.001
icker+0.001
?+0.001
ulum+0.001
 pens+0.001
Neg logit contributions
 Gor-0.001
n-0.001
 gor-0.001
 deliber-0.001
 seventy-0.001
Token Ch
Token ID609
Feature activation-1.15e+00

Pos logit contributions
 gor+0.003
n+0.003
 Gor+0.003
 spe+0.003
 deliber+0.003
Neg logit contributions
icker-0.003
by-0.003
ulum-0.003
?-0.003
 pens-0.003
Tokenir
Token ID343
Feature activation-2.30e+00

Pos logit contributions
 gor+0.018
 Gor+0.018
n+0.017
 deliber+0.017
 seventy+0.017
Neg logit contributions
by-0.026
icker-0.025
?-0.024
ulum-0.023
 pens-0.022
Tokenp
Token ID79
Feature activation-1.48e+00

Pos logit contributions
 gor+0.033
n+0.032
 Gor+0.031
 ostr+0.030
 seventy+0.030
Neg logit contributions
icker-0.056
by-0.055
?-0.052
ulum-0.050
 pens-0.049
Token flew
Token ID13112
Feature activation-7.20e-01

Pos logit contributions
 gor+0.027
 Gor+0.026
n+0.026
 deliber+0.026
 seventy+0.026
Neg logit contributions
by-0.029
icker-0.029
?-0.027
ulum-0.026
 pens-0.024
Token higher
Token ID2440
Feature activation-5.13e-01

Pos logit contributions
 gor+0.012
 Gor+0.012
n+0.012
 seventy+0.012
 deliber+0.012
Neg logit contributions
icker-0.015
by-0.015
ulum-0.014
?-0.014
 pens-0.013
Token,
Token ID11
Feature activation-7.98e-01

Pos logit contributions
 gor+0.009
 Gor+0.009
n+0.009
 seventy+0.009
 deliber+0.009
Neg logit contributions
icker-0.010
by-0.010
?-0.009
ulum-0.009
 pens-0.009
Token he
Token ID339
Feature activation+3.52e-02

Pos logit contributions
 gor+0.013
n+0.013
 Gor+0.013
 Ph+0.012
 spe+0.012
Neg logit contributions
icker-0.019
by-0.018
ulum-0.017
?-0.017
own-0.016
Token are
Token ID389
Feature activation+5.79e-01

Pos logit contributions
n+0.001
 gor+0.001
 Gor+0.001
riger+0.001
 seventy+0.001
Neg logit contributions
icker-0.002
by-0.002
ulum-0.002
?-0.002
 Toby-0.002
Token astronauts
Token ID26835
Feature activation-4.62e-01

Pos logit contributions
by+0.012
icker+0.012
ulum+0.011
?+0.010
bie+0.010
Neg logit contributions
 gor-0.011
 Gor-0.010
n-0.010
 seventy-0.010
 spe-0.009
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 gor+0.012
 Gor+0.012
n+0.011
 seventy+0.011
riger+0.011
Neg logit contributions
icker-0.006
by-0.006
ulum-0.005
?-0.005
 pens-0.004
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
 gor+0.026
 Gor+0.025
n+0.025
 deliber+0.025
 seventy+0.024
Neg logit contributions
by-0.023
icker-0.023
?-0.021
ulum-0.020
 pens-0.019
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
by+0.002
icker+0.002
?+0.002
 something+0.002
ulum+0.002
Neg logit contributions
 gor-0.003
 Gor-0.003
riger-0.003
 ostr-0.003
 seventy-0.003
Token"
Token ID1
Feature activation-2.08e+00

Pos logit contributions
 gor+0.004
 Gor+0.004
 seventy+0.004
 deliber+0.004
n+0.004
Neg logit contributions
by-0.003
icker-0.003
?-0.003
ulum-0.003
 something-0.003
TokenLook
Token ID8567
Feature activation-3.99e-01

Pos logit contributions
 gor+0.049
oke+0.048
 deliber+0.048
 Gor+0.047
riger+0.047
Neg logit contributions
by-0.030
icker-0.027
?-0.026
 each-0.025
ulum-0.025
Token,
Token ID11
Feature activation-5.56e-01

Pos logit contributions
 gor+0.008
 Gor+0.008
 deliber+0.008
n+0.008
 seventy+0.007
Neg logit contributions
by-0.007
icker-0.007
?-0.007
ulum-0.006
 something-0.006
Token Lily
Token ID20037
Feature activation-2.50e-01

Pos logit contributions
 gor+0.014
 deliber+0.014
 Gor+0.014
fters+0.013
riger+0.013
Neg logit contributions
?-0.007
 something-0.007
by-0.006
 chess-0.006
 each-0.006
Token,
Token ID11
Feature activation-4.54e-01

Pos logit contributions
 gor+0.005
 Gor+0.005
n+0.005
 seventy+0.005
 deliber+0.004
Neg logit contributions
icker-0.005
by-0.005
ulum-0.004
?-0.004
 pens-0.004
Token I
Token ID314
Feature activation+7.95e-01

Pos logit contributions
 gor+0.011
 deliber+0.011
 Gor+0.010
fters+0.010
n+0.010
Neg logit contributions
?-0.006
by-0.006
 something-0.006
icker-0.006
 each-0.006
Token mom
Token ID1995
Feature activation-7.30e-01

Pos logit contributions
 gor+0.001
 Gor+0.001
n+0.001
riger+0.001
 deliber+0.001
Neg logit contributions
by-0.001
icker-0.001
?-0.001
ulum-0.001
 something-0.001
Tokenmy
Token ID1820
Feature activation-2.31e-01

Pos logit contributions
 gor+0.017
 Gor+0.017
n+0.017
 seventy+0.017
 deliber+0.017
Neg logit contributions
icker-0.011
by-0.010
?-0.009
ulum-0.009
 something-0.008
Token loved
Token ID6151
Feature activation+7.30e-01

Pos logit contributions
n+0.004
 ostr+0.004
 Gor+0.004
go+0.003
 gob+0.003
Neg logit contributions
icker-0.005
by-0.005
ulum-0.005
own-0.004
 opposite-0.004
Token her
Token ID607
Feature activation-7.18e-01

Pos logit contributions
icker+0.012
by+0.012
?+0.011
ulum+0.010
own+0.009
Neg logit contributions
 gor-0.017
 Gor-0.016
 deliber-0.016
n-0.016
 ostr-0.016
Token daughter
Token ID4957
Feature activation-5.76e-02

Pos logit contributions
 Gor+0.009
 ostr+0.009
 gor+0.009
 delighted+0.009
 deliber+0.008
Neg logit contributions
icker-0.018
by-0.018
ulum-0.018
eds-0.017
Simon-0.016
Token very
Token ID845
Feature activation-2.17e+00

Pos logit contributions
 delighted+0.001
 Gor+0.001
 thrilled+0.001
 gor+0.001
n+0.001
Neg logit contributions
icker-0.001
ulum-0.001
Simon-0.001
by-0.001
ucer-0.001
Token much
Token ID881
Feature activation-1.23e+00

Pos logit contributions
 gor+0.046
 deliber+0.045
rd+0.045
n+0.044
 delighted+0.044
Neg logit contributions
ulum-0.038
by-0.036
 Toby-0.034
fired-0.033
 pens-0.033
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
 gor+0.026
 Gor+0.025
 deliber+0.025
riger+0.023
 spe+0.023
Neg logit contributions
icker-0.023
ulum-0.022
by-0.021
Simon-0.019
Harry-0.019
Token One
Token ID1881
Feature activation+7.86e-01

Pos logit contributions
 gor+0.018
 Gor+0.018
n+0.017
 deliber+0.017
 seventy+0.017
Neg logit contributions
by-0.009
icker-0.009
?-0.008
ulum-0.007
 pens-0.007
Token day
Token ID1110
Feature activation+2.08e-01

Pos logit contributions
icker+0.020
by+0.019
ulum+0.019
 pens+0.018
 Toby+0.018
Neg logit contributions
 ostr-0.010
 Gor-0.010
 gor-0.010
n-0.009
go-0.009
Token,
Token ID11
Feature activation-2.37e-01

Pos logit contributions
by+0.004
icker+0.004
?+0.004
ulum+0.004
 pens+0.003
Neg logit contributions
 gor-0.004
 Gor-0.004
n-0.004
 deliber-0.004
 seventy-0.004
Token the
Token ID262
Feature activation-1.00e-01

Pos logit contributions
icker+0.013
by+0.013
?+0.011
ulum+0.011
own+0.010
Neg logit contributions
 gor-0.017
 Gor-0.016
 seventy-0.016
n-0.016
 ostr-0.015
Token pond
Token ID16723
Feature activation+5.34e-01

Pos logit contributions
 gor+0.002
 Gor+0.002
 ostr+0.002
 seventy+0.001
n+0.001
Neg logit contributions
icker-0.002
by-0.002
?-0.002
ulum-0.002
own-0.002
Token!"
Token ID2474
Feature activation+5.48e-01

Pos logit contributions
icker+0.010
by+0.010
ulum+0.009
 Toby+0.008
 Raz+0.008
Neg logit contributions
 gor-0.011
 Gor-0.011
n-0.010
 seventy-0.010
 spe-0.010
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
by+0.015
important+0.014
icker+0.014
eds+0.013
ulum+0.013
Neg logit contributions
 Ph-0.006
n-0.006
 deliber-0.006
 Lib-0.005
 Once-0.005
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
 gor+0.006
 Gor+0.006
 seventy+0.006
riger+0.006
 deliber+0.006
Neg logit contributions
by-0.004
icker-0.004
?-0.004
 each-0.004
 something-0.003
Token"
Token ID1
Feature activation-2.06e+00

Pos logit contributions
n+0.003
riger+0.003
fters+0.003
 gor+0.003
 seventy+0.003
Neg logit contributions
by-0.004
icker-0.004
ulum-0.004
?-0.004
bie-0.004
TokenOK
Token ID11380
Feature activation-3.74e-01

Pos logit contributions
 gor+0.043
 Gor+0.042
n+0.041
 deliber+0.041
 seventy+0.041
Neg logit contributions
by-0.036
icker-0.035
?-0.033
ulum-0.030
 pens-0.029
Token!"
Token ID2474
Feature activation+1.39e+00

Pos logit contributions
 gor+0.007
 Gor+0.007
n+0.007
 deliber+0.007
 seventy+0.007
Neg logit contributions
by-0.007
icker-0.007
?-0.006
ulum-0.006
 pens-0.006
Token Ben
Token ID3932
Feature activation+8.42e-01

Pos logit contributions
by+0.029
icker+0.028
eds+0.028
important+0.027
ulum+0.027
Neg logit contributions
 deliber-0.025
 Ph-0.025
n-0.023
 ostr-0.022
 Lib-0.022
Token says
Token ID1139
Feature activation-3.17e-03

Pos logit contributions
by+0.018
icker+0.017
?+0.017
ulum+0.015
 each+0.015
Neg logit contributions
 gor-0.014
 Gor-0.014
n-0.014
 deliber-0.013
riger-0.013
Token.
Token ID13
Feature activation-5.90e-01

Pos logit contributions
n+0.000
 deliber+0.000
 seventy+0.000
 Gor+0.000
 gor+0.000
Neg logit contributions
by-0.000
icker-0.000
ulum-0.000
?-0.000
 pens-0.000