TOP ACTIVATIONS
MAX = 3.861

friends stopped and watched in amaz ement . The dog too
her mom my watched in amaz ement as the machine flew
the zipper and gasped with amaz ement at all the treasures
and M ummy gasped with amaz ement . "
ered in , blinking in amaz ement . But
. Tim my watched with amaz ement as he saw stars
eddy bear and watched in amaz ement as it shrunk an
as the woman watched in amaz ement . She saw the
led as if it was brand new . Max was proud
sun as if it was brand new . Fe
. To her amaz ement , the clothes came
T ina waited with amaz ement as the old man
It was fixed and looked brand new ! Tim my was
air . He watched with amaz ement as the z ig
ly . Jill nodded in amaz ement .
L ucky watched in amaz ement as the frog zoom
, her eyes wide with amaz ement . Her
on her mom my 's brand new table !
wardrobe changed from ancient to brand new ! They looked around
Sam gasped in amaz ement . He had never

MIN ACTIVATIONS
MIN = -3.747

things . Gr ac ie saw bright yellow flowers
the mountain lion . Mir ac ulously , the lion backed
said , " A br ac ad ab ra !" But
air . Gr ac ie was walking with her
the mess and the qu arre ling twins . She is
air . " A br ac ad ab ra !"
said , " A br ac ad ab ra ! I
he found a rabbit qu arre lling with a squirrel .
it !" Isa ac gul ped and said ,
said , " A br ac ad ab ra !" Suddenly
his wet radio . Mir ac ulously , it made a
tidy . Gr ac ie felt very proud of
the story is : Qu arre lling over small things is
and said " A br ac ad ab ra !" but
cry . Gr ac ie 's mom was so
said , " A br ac ad ab ra , make
around . " A br ac ad ab ra !"
face . Gr ac ie started to cry and
her . Gr ac ie sat and listened to
wand . " A br ac ad ab ra !"

INTERVAL 1.959 - 3.861
CONTAINS 0.023%

were both happy that it wasn 't lost forever . <|endoftext|>
sweet dreams about carrots and sunshine . <|endoftext|> Once upon a
sorry and asked if there was anything he could do to
, but the stove just wasn 't there . They looked
stared at the pie in amaz ement . He just couldn

INTERVAL 0.057 - 1.959
CONTAINS 45.356%

a voice . It was one of the team 's moms
happy and felt lucky to have such great friends to play
the scenes . Anna 's friends practiced with her
and he could run and play again soon . <|endoftext|> Once
from Jill and Jack . He said he would give them

INTERVAL -1.845 - 0.057
CONTAINS 54.478%

a big bag of candy from her mom . She was
children . The end . <|endoftext|> One day , a little
Once upon a time , there was a little girl named
Tom learned their lesson . They learned that rain can be
should we do with the iron ?" Jane asked .

INTERVAL -3.747 - -1.845
CONTAINS 0.143%

to the beautiful poses she saw . Her Mom
. They have a big field near their house . They
notice . They dragged the kay ak to the shore and
on a branch , ch ir ping away .
They talked about what they saw and learned in the museum
Token friends
Token ID2460
Feature activation+2.57e-01

Pos logit contributions
 least+0.001
 reality+0.001
 wishing+0.001
 heaven+0.001
 Gray+0.001
Neg logit contributions
adoes-0.001
 face-0.001
 structures-0.001
stals-0.001
ag-0.001
Token stopped
Token ID5025
Feature activation+9.11e-01

Pos logit contributions
 least+0.007
 reality+0.006
 wishing+0.006
 Gray+0.006
 heaven+0.006
Neg logit contributions
adoes-0.008
 structures-0.008
 face-0.007
stals-0.007
ag-0.007
Token and
Token ID290
Feature activation+5.41e-01

Pos logit contributions
 least+0.026
 reality+0.025
 wishing+0.025
 heaven+0.024
 knowing+0.024
Neg logit contributions
adoes-0.026
 face-0.025
stals-0.024
rac-0.024
 structures-0.024
Token watched
Token ID7342
Feature activation-3.09e-01

Pos logit contributions
 least+0.014
 reality+0.014
 Tooth+0.014
 heaven+0.014
rob+0.013
Neg logit contributions
 face-0.016
adoes-0.015
 structures-0.014
rac-0.014
asuring-0.014
Token in
Token ID287
Feature activation-7.96e-01

Pos logit contributions
adoes+0.010
rac+0.009
 structures+0.009
 face+0.009
stals+0.009
Neg logit contributions
 reality-0.007
 least-0.007
 wishing-0.007
 heaven-0.007
 knowing-0.007
Token amaz
Token ID40642
Feature activation+3.35e+00

Pos logit contributions
 face+0.024
adoes+0.022
 structures+0.021
ags+0.020
ag+0.020
Neg logit contributions
rob-0.022
 least-0.021
 reality-0.021
 Gray-0.020
 Tooth-0.020
Tokenement
Token ID972
Feature activation+7.82e-01

Pos logit contributions
 least+0.094
 reality+0.092
 Gray+0.090
 heaven+0.089
 Tooth+0.088
Neg logit contributions
 face-0.095
adoes-0.094
 structures-0.090
ag-0.087
rac-0.086
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
 least+0.019
 reality+0.019
 Tooth+0.018
rob+0.018
 Gray+0.018
Neg logit contributions
 face-0.025
adoes-0.025
 structures-0.024
ag-0.023
asuring-0.023
Token The
Token ID383
Feature activation+3.76e-01

Pos logit contributions
 least+0.008
 reality+0.008
 Gray+0.008
 heaven+0.008
 wishing+0.008
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
ag-0.005
rac-0.005
Token dog
Token ID3290
Feature activation-1.46e-01

Pos logit contributions
 least+0.012
 reality+0.012
 wishing+0.011
 Tooth+0.011
 heaven+0.011
Neg logit contributions
adoes-0.010
rac-0.009
ag-0.009
stals-0.009
 face-0.009
Token too
Token ID1165
Feature activation-1.14e+00

Pos logit contributions
adoes+0.004
 face+0.004
 structures+0.004
ag+0.004
rac+0.004
Neg logit contributions
 least-0.004
 reality-0.004
 Gray-0.004
 wishing-0.004
 heaven-0.004
Token her
Token ID607
Feature activation+2.33e-01

Pos logit contributions
 face+0.004
adoes+0.003
 structures+0.003
 in+0.003
ag+0.003
Neg logit contributions
rob-0.004
 least-0.004
 North-0.004
 reality-0.004
isc-0.004
Token mom
Token ID1995
Feature activation-3.68e-02

Pos logit contributions
rob+0.008
 least+0.008
 reality+0.008
ge+0.008
 Gray+0.008
Neg logit contributions
 face-0.006
adoes-0.005
 structures-0.004
ag-0.004
ap-0.004
Tokenmy
Token ID1820
Feature activation-2.37e-01

Pos logit contributions
 face+0.001
ap+0.001
adoes+0.001
asuring+0.001
ag+0.001
Neg logit contributions
 reality-0.001
rob-0.001
 certainty-0.001
 Gray-0.001
 outer-0.001
Token watched
Token ID7342
Feature activation-9.39e-01

Pos logit contributions
 face+0.006
adoes+0.005
ap+0.005
 in+0.005
asuring+0.005
Neg logit contributions
 reality-0.008
rob-0.008
 certainty-0.007
 least-0.007
 outer-0.007
Token in
Token ID287
Feature activation-1.21e+00

Pos logit contributions
 face+0.028
adoes+0.027
 structures+0.026
ag+0.025
rac+0.025
Neg logit contributions
 least-0.025
 reality-0.025
 Gray-0.023
 heaven-0.023
 wishing-0.023
Token amaz
Token ID40642
Feature activation+3.22e+00

Pos logit contributions
 face+0.031
adoes+0.028
 structures+0.027
ags+0.024
ap+0.024
Neg logit contributions
rob-0.040
 least-0.038
 Gray-0.037
 reality-0.037
 Tooth-0.037
Tokenement
Token ID972
Feature activation+3.73e-01

Pos logit contributions
 least+0.094
 Gray+0.092
 reality+0.092
 wishing+0.089
 Tooth+0.089
Neg logit contributions
 face-0.087
adoes-0.086
 structures-0.081
ag-0.080
rac-0.078
Token as
Token ID355
Feature activation+6.01e-01

Pos logit contributions
 least+0.010
 reality+0.010
rob+0.009
 Tooth+0.009
 Gray+0.009
Neg logit contributions
 face-0.011
adoes-0.011
 structures-0.011
asuring-0.010
ag-0.010
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
rob+0.017
 least+0.017
ge+0.015
ited+0.015
 Gray+0.015
Neg logit contributions
 face-0.018
adoes-0.017
 structures-0.016
 in-0.015
 squash-0.015
Token machine
Token ID4572
Feature activation+4.97e-02

Pos logit contributions
adoes+0.006
 face+0.005
rac+0.005
 structures+0.005
ag+0.005
Neg logit contributions
 reality-0.007
 least-0.007
 wishing-0.006
 Tooth-0.006
 heaven-0.006
Token flew
Token ID13112
Feature activation+7.68e-01

Pos logit contributions
 reality+0.002
 least+0.001
 Tooth+0.001
 heaven+0.001
rob+0.001
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
ag-0.001
asuring-0.001
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
 least+0.009
 reality+0.009
 heaven+0.009
rob+0.009
 Tooth+0.008
Neg logit contributions
 face-0.007
adoes-0.007
 structures-0.006
rac-0.006
ag-0.006
Token zipper
Token ID48992
Feature activation+8.46e-01

Pos logit contributions
 heaven+0.004
 reality+0.004
 least+0.004
rob+0.004
 someday+0.003
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
rac-0.003
stals-0.003
Token and
Token ID290
Feature activation+2.05e-01

Pos logit contributions
 reality+0.035
 heaven+0.034
 least+0.033
 certainty+0.033
 Tooth+0.033
Neg logit contributions
 face-0.015
adoes-0.012
 in-0.011
 structures-0.011
uting-0.011
Token gasped
Token ID45236
Feature activation+1.15e+00

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.006
rob+0.006
 Tooth+0.006
Neg logit contributions
 face-0.005
adoes-0.005
rac-0.005
 structures-0.005
uting-0.005
Token with
Token ID351
Feature activation+6.44e-01

Pos logit contributions
 least+0.038
rob+0.037
 reality+0.036
 heaven+0.036
 Tooth+0.036
Neg logit contributions
 face-0.028
adoes-0.027
 structures-0.025
uting-0.024
asuring-0.024
Token amaz
Token ID40642
Feature activation+3.17e+00

Pos logit contributions
ge+0.021
rob+0.021
 least+0.020
 reality+0.020
 Gray+0.020
Neg logit contributions
 face-0.017
adoes-0.015
 structures-0.014
rac-0.014
ag-0.013
Tokenement
Token ID972
Feature activation+9.06e-01

Pos logit contributions
 least+0.094
 reality+0.091
 Gray+0.090
 heaven+0.090
 Tooth+0.088
Neg logit contributions
 face-0.086
adoes-0.085
 structures-0.079
ag-0.078
rac-0.077
Token at
Token ID379
Feature activation+2.40e-01

Pos logit contributions
 least+0.022
 reality+0.021
rob+0.021
 Tooth+0.020
 heaven+0.020
Neg logit contributions
 face-0.031
adoes-0.029
 structures-0.028
asuring-0.027
ag-0.027
Token all
Token ID477
Feature activation+3.61e-01

Pos logit contributions
rob+0.009
tail+0.009
 heaven+0.009
ited+0.009
 therapist+0.009
Neg logit contributions
 face-0.005
adoes-0.004
 structures-0.004
rac-0.004
uting-0.004
Token the
Token ID262
Feature activation+7.99e-02

Pos logit contributions
 least+0.011
 reality+0.011
 heaven+0.010
 Tooth+0.010
 Gray+0.010
Neg logit contributions
 face-0.010
adoes-0.010
 structures-0.009
ag-0.009
rac-0.009
Token treasures
Token ID29561
Feature activation+9.73e-01

Pos logit contributions
 least+0.002
 reality+0.002
 heaven+0.002
 Gray+0.002
 someday+0.002
Neg logit contributions
 face-0.002
adoes-0.002
 structures-0.002
ag-0.002
stals-0.002
Token and
Token ID290
Feature activation-1.00e-01

Pos logit contributions
 face+0.005
adoes+0.004
rac+0.004
asuring+0.004
 structures+0.004
Neg logit contributions
 reality-0.005
 least-0.005
 heaven-0.005
rob-0.005
 Tooth-0.005
Token M
Token ID337
Feature activation-4.22e-01

Pos logit contributions
 face+0.003
adoes+0.003
rac+0.003
 structures+0.003
asuring+0.003
Neg logit contributions
 least-0.003
rob-0.003
 reality-0.003
 heaven-0.002
ge-0.002
Tokenummy
Token ID13513
Feature activation+1.01e+00

Pos logit contributions
adoes+0.009
 face+0.009
 structures+0.008
rac+0.008
ag+0.008
Neg logit contributions
 least-0.015
 reality-0.014
 heaven-0.014
 Gray-0.014
 Tooth-0.014
Token gasped
Token ID45236
Feature activation+9.99e-01

Pos logit contributions
 reality+0.030
 least+0.030
 heaven+0.030
rob+0.029
 Gray+0.029
Neg logit contributions
 face-0.028
adoes-0.024
 structures-0.024
asuring-0.023
rac-0.023
Token with
Token ID351
Feature activation+6.36e-01

Pos logit contributions
 least+0.035
 reality+0.035
 heaven+0.034
 Gray+0.033
 wishing+0.033
Neg logit contributions
 face-0.021
adoes-0.020
 structures-0.019
ag-0.018
rac-0.018
Token amaz
Token ID40642
Feature activation+3.12e+00

Pos logit contributions
 least+0.017
 reality+0.017
rob+0.016
ge+0.016
 Gray+0.016
Neg logit contributions
 face-0.019
adoes-0.018
 structures-0.017
rac-0.017
ag-0.017
Tokenement
Token ID972
Feature activation+8.12e-01

Pos logit contributions
 least+0.086
 reality+0.084
 heaven+0.080
 Gray+0.080
 Tooth+0.079
Neg logit contributions
 face-0.090
adoes-0.090
 structures-0.084
ag-0.082
rac-0.082
Token.
Token ID13
Feature activation-1.32e-01

Pos logit contributions
 least+0.018
 reality+0.018
 heaven+0.017
 Tooth+0.017
 Gray+0.017
Neg logit contributions
 face-0.028
adoes-0.027
 structures-0.026
ag-0.025
asuring-0.025
Token
Token ID198
Feature activation-5.49e-02

Pos logit contributions
adoes+0.004
 face+0.004
ag+0.004
 structures+0.004
rac+0.004
Neg logit contributions
 least-0.003
 Gray-0.003
 reality-0.003
 wishing-0.003
 anytime-0.003
Token
Token ID198
Feature activation+9.30e-02

Pos logit contributions
adoes+0.002
 face+0.002
 structures+0.002
asuring+0.002
ag+0.002
Neg logit contributions
 reality-0.001
 least-0.001
 Gray-0.001
 anytime-0.001
 heaven-0.001
Token"
Token ID1
Feature activation+1.04e-01

Pos logit contributions
T+0.002
 reality+0.002
 anytime+0.002
Charl+0.002
 she+0.002
Neg logit contributions
adoes-0.003
 face-0.003
ags-0.003
ag-0.003
 structures-0.003
Tokenered
Token ID1068
Feature activation-6.59e-01

Pos logit contributions
 face+0.011
adoes+0.011
 structures+0.011
stals+0.010
ag+0.010
Neg logit contributions
 least-0.022
 reality-0.021
ge-0.020
 wishing-0.020
 heaven-0.020
Token in
Token ID287
Feature activation-1.13e+00

Pos logit contributions
adoes+0.014
 face+0.014
 structures+0.013
rac+0.013
ag+0.013
Neg logit contributions
 least-0.022
 reality-0.022
 heaven-0.021
 wishing-0.021
 Gray-0.021
Token,
Token ID11
Feature activation-7.88e-02

Pos logit contributions
 face+0.030
adoes+0.027
 structures+0.027
ag+0.025
asuring+0.024
Neg logit contributions
 least-0.035
rob-0.035
 reality-0.034
 Tooth-0.033
 Gray-0.033
Token blinking
Token ID43196
Feature activation-5.46e-02

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
rac+0.002
ag+0.002
Neg logit contributions
 least-0.002
 reality-0.002
 heaven-0.002
rob-0.002
 Tooth-0.002
Token in
Token ID287
Feature activation-6.51e-01

Pos logit contributions
adoes+0.001
 face+0.001
 structures+0.001
ag+0.001
asuring+0.001
Neg logit contributions
 reality-0.002
 least-0.002
 wishing-0.002
 Gray-0.002
 heaven-0.002
Token amaz
Token ID40642
Feature activation+3.10e+00

Pos logit contributions
 face+0.017
adoes+0.016
 structures+0.015
ag+0.014
rac+0.013
Neg logit contributions
 least-0.021
rob-0.021
 reality-0.020
 Tooth-0.019
 Gray-0.019
Tokenement
Token ID972
Feature activation+2.24e-01

Pos logit contributions
 least+0.090
 reality+0.087
 Gray+0.086
 heaven+0.084
 Tooth+0.084
Neg logit contributions
 face-0.086
adoes-0.085
 structures-0.080
ag-0.079
rac-0.077
Token.
Token ID13
Feature activation-3.93e-01

Pos logit contributions
 least+0.006
 reality+0.005
rob+0.005
 Tooth+0.005
 heaven+0.005
Neg logit contributions
 face-0.007
adoes-0.007
 structures-0.007
ag-0.007
asuring-0.007
Token
Token ID198
Feature activation-9.43e-02

Pos logit contributions
 face+0.010
adoes+0.010
 structures+0.010
rac+0.009
ag+0.009
Neg logit contributions
 least-0.012
 reality-0.012
 heaven-0.011
 Gray-0.011
 wishing-0.011
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
ag+0.002
rac+0.002
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.002
 Gray-0.002
 wishing-0.002
TokenBut
Token ID1537
Feature activation+1.25e+00

Pos logit contributions
 reality+0.003
 least+0.003
 Gray+0.003
 heaven+0.003
 anytime+0.003
Neg logit contributions
adoes-0.003
 face-0.003
 structures-0.003
ags-0.003
ag-0.003
Token.
Token ID13
Feature activation-5.14e-01

Pos logit contributions
 face+0.017
adoes+0.016
asuring+0.015
ag+0.015
 structures+0.015
Neg logit contributions
rob-0.022
 reality-0.022
 Tooth-0.021
 least-0.021
 Gray-0.021
Token Tim
Token ID5045
Feature activation-8.88e-02

Pos logit contributions
 face+0.013
adoes+0.013
 structures+0.012
rac+0.012
ag+0.011
Neg logit contributions
 least-0.016
 reality-0.016
 heaven-0.015
 wishing-0.015
 Tooth-0.015
Tokenmy
Token ID1820
Feature activation-6.82e-01

Pos logit contributions
 face+0.002
ag+0.002
arre+0.002
adoes+0.002
ap+0.002
Neg logit contributions
 reality-0.003
 possible-0.003
 outer-0.003
 heaven-0.003
rob-0.003
Token watched
Token ID7342
Feature activation-7.22e-01

Pos logit contributions
 face+0.017
adoes+0.016
 structures+0.014
rac+0.014
ag+0.014
Neg logit contributions
 reality-0.022
 least-0.022
 Tooth-0.021
 heaven-0.021
rob-0.021
Token with
Token ID351
Feature activation+4.13e-01

Pos logit contributions
adoes+0.024
 structures+0.023
stals+0.023
rac+0.023
 face+0.023
Neg logit contributions
 reality-0.016
 least-0.015
 wishing-0.014
 Gray-0.014
 heaven-0.014
Token amaz
Token ID40642
Feature activation+3.10e+00

Pos logit contributions
rob+0.015
ge+0.014
iko+0.014
 least+0.014
 Tooth+0.014
Neg logit contributions
 face-0.010
adoes-0.009
 structures-0.008
 squash-0.007
ag-0.007
Tokenement
Token ID972
Feature activation+5.30e-01

Pos logit contributions
 least+0.088
 reality+0.086
 heaven+0.084
 Gray+0.084
 wishing+0.082
Neg logit contributions
 face-0.087
adoes-0.086
 structures-0.082
ag-0.080
rac-0.079
Token as
Token ID355
Feature activation+5.67e-01

Pos logit contributions
 least+0.015
 reality+0.015
 Tooth+0.014
rob+0.014
 heaven+0.014
Neg logit contributions
 face-0.015
adoes-0.015
 structures-0.014
asuring-0.014
ag-0.013
Token he
Token ID339
Feature activation+1.72e-01

Pos logit contributions
rob+0.019
 least+0.019
ited+0.018
tail+0.017
 heaven+0.017
Neg logit contributions
 face-0.014
adoes-0.013
 structures-0.013
 in-0.012
 towers-0.011
Token saw
Token ID2497
Feature activation-1.25e+00

Pos logit contributions
 heaven+0.006
 Tooth+0.006
rob+0.006
 reality+0.006
 possible+0.006
Neg logit contributions
 face-0.004
rac-0.003
asuring-0.003
 in-0.003
adoes-0.003
Token stars
Token ID5788
Feature activation+7.37e-01

Pos logit contributions
 face+0.029
adoes+0.028
 structures+0.027
 towers+0.024
ag+0.022
Neg logit contributions
 least-0.043
ge-0.043
 heaven-0.042
tail-0.041
 reality-0.041
Tokeneddy
Token ID21874
Feature activation-1.82e-01

Pos logit contributions
 least+0.012
 wishing+0.012
 reality+0.011
 Gray+0.011
 heaven+0.011
Neg logit contributions
adoes-0.014
 face-0.014
rac-0.013
 structures-0.013
ag-0.013
Token bear
Token ID6842
Feature activation+4.96e-02

Pos logit contributions
 face+0.005
adoes+0.005
stals+0.004
 structures+0.004
rac+0.004
Neg logit contributions
 reality-0.006
ge-0.005
 suitable-0.005
 least-0.005
 certainty-0.005
Token and
Token ID290
Feature activation+3.23e-01

Pos logit contributions
 reality+0.002
rob+0.002
 least+0.002
 certainty+0.002
 Tooth+0.002
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
asuring-0.001
ap-0.001
Token watched
Token ID7342
Feature activation-6.87e-01

Pos logit contributions
 least+0.009
rob+0.009
ge+0.009
 reality+0.008
 anytime+0.008
Neg logit contributions
 face-0.010
adoes-0.009
 structures-0.009
 squash-0.008
 faces-0.008
Token in
Token ID287
Feature activation-1.26e+00

Pos logit contributions
 face+0.019
adoes+0.019
 structures+0.018
ag+0.017
rac+0.017
Neg logit contributions
 least-0.020
 reality-0.019
rob-0.018
 heaven-0.018
 Gray-0.018
Token amaz
Token ID40642
Feature activation+3.04e+00

Pos logit contributions
 face+0.034
 structures+0.027
adoes+0.026
 in+0.026
 faces+0.025
Neg logit contributions
rob-0.047
iko-0.040
 least-0.039
isc-0.038
 Gray-0.038
Tokenement
Token ID972
Feature activation+4.49e-01

Pos logit contributions
 least+0.098
 Gray+0.097
 wishing+0.094
 reality+0.094
 Tooth+0.093
Neg logit contributions
 face-0.076
adoes-0.075
ag-0.069
 structures-0.069
asuring-0.067
Token as
Token ID355
Feature activation+4.02e-01

Pos logit contributions
 least+0.011
 reality+0.011
rob+0.011
 Tooth+0.011
 Gray+0.010
Neg logit contributions
 face-0.015
adoes-0.014
 structures-0.013
ag-0.013
asuring-0.013
Token it
Token ID340
Feature activation+1.98e-01

Pos logit contributions
rob+0.014
 least+0.013
ited+0.013
ge+0.012
iko+0.011
Neg logit contributions
 face-0.010
adoes-0.009
 in-0.009
 structures-0.009
 squash-0.008
Token shrunk
Token ID38900
Feature activation+9.30e-01

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.006
rob+0.006
 Tooth+0.006
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.004
ag-0.004
anned-0.004
Token an
Token ID281
Feature activation+5.78e-01

Pos logit contributions
 least+0.029
rob+0.028
 Tooth+0.027
 wishing+0.027
 reality+0.027
Neg logit contributions
adoes-0.026
 face-0.025
 structures-0.023
ag-0.022
 in-0.021
Token as
Token ID355
Feature activation+8.52e-01

Pos logit contributions
 reality+0.011
 least+0.010
 Tooth+0.010
 heaven+0.010
 certainty+0.010
Neg logit contributions
 face-0.005
adoes-0.005
ag-0.005
 structures-0.004
rac-0.004
Token the
Token ID262
Feature activation-9.44e-02

Pos logit contributions
rob+0.028
 least+0.028
 heaven+0.027
tail+0.026
 Tooth+0.025
Neg logit contributions
 in-0.020
 face-0.019
adoes-0.018
 structures-0.018
ull-0.018
Token woman
Token ID2415
Feature activation+3.90e-02

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
ag+0.002
rac+0.002
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.003
 Gray-0.003
 wishing-0.003
Token watched
Token ID7342
Feature activation-4.90e-01

Pos logit contributions
 reality+0.001
 Tooth+0.001
rob+0.001
 heaven+0.001
 least+0.001
Neg logit contributions
 face-0.001
 in-0.001
ag-0.001
ap-0.001
rac-0.001
Token in
Token ID287
Feature activation-8.40e-01

Pos logit contributions
 face+0.014
adoes+0.014
 structures+0.013
ag+0.013
rac+0.012
Neg logit contributions
 least-0.014
 reality-0.013
rob-0.013
 heaven-0.013
 Tooth-0.012
Token amaz
Token ID40642
Feature activation+3.01e+00

Pos logit contributions
 face+0.022
 structures+0.017
 pitch+0.017
adoes+0.016
ap+0.016
Neg logit contributions
rob-0.031
 Tooth-0.027
och-0.027
ilda-0.026
ge-0.026
Tokenement
Token ID972
Feature activation+6.81e-01

Pos logit contributions
 least+0.088
 reality+0.085
 Gray+0.085
 heaven+0.083
 Tooth+0.083
Neg logit contributions
 face-0.083
adoes-0.082
 structures-0.077
ag-0.076
rac-0.075
Token.
Token ID13
Feature activation+2.13e-01

Pos logit contributions
 least+0.019
rob+0.019
 reality+0.019
 Tooth+0.018
 heaven+0.018
Neg logit contributions
 face-0.020
adoes-0.019
 structures-0.018
ag-0.018
ap-0.018
Token She
Token ID1375
Feature activation+3.46e-01

Pos logit contributions
rob+0.007
 least+0.007
 reality+0.007
ge+0.007
 heaven+0.007
Neg logit contributions
 face-0.005
adoes-0.004
uting-0.004
 structures-0.004
rac-0.004
Token saw
Token ID2497
Feature activation-1.15e+00

Pos logit contributions
 least+0.010
 heaven+0.010
 reality+0.010
 Tooth+0.010
rob+0.010
Neg logit contributions
 face-0.009
adoes-0.008
rac-0.008
ag-0.008
 structures-0.008
Token the
Token ID262
Feature activation-4.85e-01

Pos logit contributions
 face+0.027
adoes+0.025
 structures+0.024
ag+0.022
 jokes+0.021
Neg logit contributions
 least-0.039
ge-0.038
rob-0.037
 reality-0.037
 heaven-0.037
Tokenled
Token ID992
Feature activation-4.15e-03

Pos logit contributions
 least+0.058
 reality+0.056
 heaven+0.055
 Tooth+0.055
 wishing+0.055
Neg logit contributions
 face-0.035
adoes-0.035
ag-0.031
 structures-0.030
stals-0.030
Token as
Token ID355
Feature activation+1.14e+00

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
uting+0.000
asuring+0.000
Neg logit contributions
rob-0.000
 least-0.000
 Tooth-0.000
 Gray-0.000
 heaven-0.000
Token if
Token ID611
Feature activation-5.85e-01

Pos logit contributions
rob+0.035
 least+0.035
 heaven+0.033
tail+0.033
ited+0.033
Neg logit contributions
adoes-0.030
 face-0.030
 structures-0.028
stals-0.027
uting-0.025
Token it
Token ID340
Feature activation+4.21e-01

Pos logit contributions
 face+0.016
adoes+0.016
 structures+0.015
stals+0.014
rac+0.014
Neg logit contributions
 least-0.017
rob-0.017
 reality-0.016
 heaven-0.016
ge-0.016
Token was
Token ID373
Feature activation+9.16e-01

Pos logit contributions
 reality+0.014
 heaven+0.014
 least+0.014
rob+0.013
 wishing+0.013
Neg logit contributions
 face-0.010
adoes-0.009
 structures-0.008
asuring-0.008
uting-0.008
Token brand
Token ID4508
Feature activation+2.96e+00

Pos logit contributions
rob+0.028
 least+0.028
 reality+0.027
 Gray+0.027
 heaven+0.026
Neg logit contributions
 face-0.024
adoes-0.022
 structures-0.021
ag-0.021
asuring-0.020
Token new
Token ID649
Feature activation+1.01e+00

Pos logit contributions
rob+0.092
 Gray+0.087
 heaven+0.086
 least+0.086
 Tooth+0.083
Neg logit contributions
 face-0.085
 structures-0.075
adoes-0.075
ag-0.073
uting-0.071
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
rob+0.033
 Tooth+0.032
 least+0.032
 Gray+0.032
 reality+0.031
Neg logit contributions
 face-0.024
adoes-0.024
 structures-0.023
asuring-0.021
 in-0.019
Token Max
Token ID5436
Feature activation+2.35e-01

Pos logit contributions
 face+0.008
adoes+0.008
 structures+0.007
rac+0.007
ag+0.007
Neg logit contributions
 least-0.009
 reality-0.009
 heaven-0.008
 wishing-0.008
 Tooth-0.008
Token was
Token ID373
Feature activation-1.70e-01

Pos logit contributions
 least+0.007
 reality+0.007
 heaven+0.006
 Tooth+0.006
 Gray+0.006
Neg logit contributions
 face-0.007
adoes-0.006
 structures-0.006
rac-0.006
ag-0.006
Token proud
Token ID6613
Feature activation+4.56e-01

Pos logit contributions
 face+0.005
adoes+0.004
 structures+0.004
asuring+0.004
ag+0.004
Neg logit contributions
 reality-0.005
 least-0.005
rob-0.005
 Gray-0.005
 Tooth-0.005
Token sun
Token ID4252
Feature activation+6.36e-01

Pos logit contributions
 least+0.011
 reality+0.011
 Tooth+0.011
 wishing+0.011
 heaven+0.011
Neg logit contributions
adoes-0.008
asuring-0.007
 structures-0.007
rac-0.007
 face-0.007
Token as
Token ID355
Feature activation+1.00e+00

Pos logit contributions
 reality+0.023
 least+0.023
 heaven+0.022
 certainty+0.022
rob+0.022
Neg logit contributions
 face-0.014
adoes-0.012
ap-0.011
 structures-0.011
uting-0.011
Token if
Token ID611
Feature activation-7.06e-01

Pos logit contributions
 least+0.024
rob+0.024
 Gray+0.022
 heaven+0.022
ge+0.021
Neg logit contributions
 face-0.033
adoes-0.033
 structures-0.032
asuring-0.030
stals-0.030
Token it
Token ID340
Feature activation+3.70e-01

Pos logit contributions
adoes+0.023
 face+0.023
 structures+0.022
ag+0.022
rac+0.021
Neg logit contributions
 least-0.016
 reality-0.016
 heaven-0.015
 Gray-0.015
 wishing-0.015
Token was
Token ID373
Feature activation+1.02e+00

Pos logit contributions
 reality+0.013
 least+0.013
 heaven+0.013
rob+0.012
 certainty+0.012
Neg logit contributions
 face-0.008
adoes-0.007
rac-0.006
anned-0.006
asuring-0.006
Token brand
Token ID4508
Feature activation+2.96e+00

Pos logit contributions
rob+0.036
 least+0.032
 Gray+0.031
 North+0.030
 heaven+0.030
Neg logit contributions
 face-0.027
adoes-0.023
ag-0.022
asuring-0.021
ags-0.021
Token new
Token ID649
Feature activation+1.19e+00

Pos logit contributions
rob+0.092
 least+0.090
 heaven+0.089
 Gray+0.089
 dreaming+0.086
Neg logit contributions
 face-0.083
adoes-0.073
ag-0.073
 structures-0.073
ades-0.068
Token.
Token ID13
Feature activation-3.80e-01

Pos logit contributions
rob+0.045
 Tooth+0.043
 least+0.042
 Gray+0.042
Ba+0.042
Neg logit contributions
 face-0.024
adoes-0.023
 structures-0.021
 in-0.020
asuring-0.019
Token
Token ID198
Feature activation-3.20e-01

Pos logit contributions
adoes+0.012
ag+0.011
 face+0.011
stals+0.011
rac+0.011
Neg logit contributions
 least-0.009
 reality-0.009
 anytime-0.009
 heaven-0.009
 wishing-0.009
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
adoes+0.010
 face+0.010
 structures+0.009
rac+0.009
ag+0.009
Neg logit contributions
 least-0.008
 reality-0.008
 heaven-0.008
 Gray-0.008
 wishing-0.008
TokenFe
Token ID14304
Feature activation-3.61e-01

Pos logit contributions
adoes+0.006
ags+0.005
 face+0.005
ag+0.005
stals+0.005
Neg logit contributions
 she-0.003
 reality-0.003
 anytime-0.003
 someday-0.003
 knowing-0.003
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
ag+0.000
ap+0.000
Neg logit contributions
 reality-0.000
 least-0.000
rob-0.000
 Tooth-0.000
 certainty-0.000
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
 face+0.008
adoes+0.008
 structures+0.007
rac+0.007
asuring+0.007
Neg logit contributions
 least-0.011
 reality-0.011
rob-0.010
 wishing-0.010
 heaven-0.010
Token
Token ID198
Feature activation+3.62e-01

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.005
 wishing+0.005
 Gray+0.005
Neg logit contributions
 face-0.004
adoes-0.004
 structures-0.004
ag-0.004
rac-0.004
TokenTo
Token ID2514
Feature activation+2.14e-01

Pos logit contributions
 reality+0.010
 least+0.010
 heaven+0.010
 Gray+0.010
 Tooth+0.010
Neg logit contributions
adoes-0.010
 face-0.010
 structures-0.009
ag-0.009
stals-0.009
Token her
Token ID607
Feature activation+2.25e-01

Pos logit contributions
rob+0.006
 reality+0.006
 least+0.006
 wishing+0.006
ge+0.006
Neg logit contributions
 face-0.006
adoes-0.006
rac-0.005
 structures-0.005
ap-0.005
Token amaz
Token ID40642
Feature activation+2.96e+00

Pos logit contributions
rob+0.008
ge+0.007
 least+0.007
 reality+0.007
 wishing+0.007
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
ap-0.004
rac-0.004
Tokenement
Token ID972
Feature activation+4.95e-01

Pos logit contributions
 least+0.077
 reality+0.076
 heaven+0.071
 Gray+0.070
 wishing+0.070
Neg logit contributions
 face-0.088
adoes-0.088
 structures-0.084
ag-0.081
rac-0.081
Token,
Token ID11
Feature activation-6.06e-03

Pos logit contributions
 least+0.015
 reality+0.014
rob+0.014
 heaven+0.014
 Tooth+0.014
Neg logit contributions
 face-0.013
adoes-0.013
 structures-0.012
ag-0.012
asuring-0.012
Token the
Token ID262
Feature activation-2.37e-01

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
rac+0.000
asuring+0.000
Neg logit contributions
 least-0.000
 reality-0.000
rob-0.000
 heaven-0.000
ge-0.000
Token clothes
Token ID8242
Feature activation+1.33e-01

Pos logit contributions
adoes+0.006
 structures+0.006
rac+0.006
ag+0.006
 face+0.006
Neg logit contributions
 reality-0.007
 least-0.007
 wishing-0.007
 Tooth-0.007
 Gray-0.007
Token came
Token ID1625
Feature activation+1.72e-01

Pos logit contributions
 least+0.004
 reality+0.004
 wishing+0.004
 heaven+0.003
 Gray+0.003
Neg logit contributions
adoes-0.004
 face-0.004
 structures-0.003
ag-0.003
asuring-0.003
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
 face+0.001
<+0.001
ag+0.001
Everyone+0.001
ades+0.001
Neg logit contributions
 least-0.002
 reality-0.002
ited-0.002
 wishing-0.002
 heaven-0.002
TokenT
Token ID51
Feature activation-4.59e-01

Pos logit contributions
 least+0.003
rob+0.003
 wishing+0.003
 heaven+0.003
 reality+0.003
Neg logit contributions
 face-0.004
adoes-0.004
rac-0.004
ades-0.004
stals-0.004
Tokenina
Token ID1437
Feature activation-4.23e-01

Pos logit contributions
adoes+0.016
 face+0.015
stals+0.015
uting+0.015
arre+0.014
Neg logit contributions
 heaven-0.011
 wishing-0.010
 least-0.010
tail-0.010
 reality-0.010
Token waited
Token ID13488
Feature activation-1.23e-01

Pos logit contributions
adoes+0.013
 structures+0.013
 face+0.013
ag+0.013
ags+0.012
Neg logit contributions
 least-0.010
 reality-0.010
 wishing-0.009
 Gray-0.009
 heaven-0.009
Token with
Token ID351
Feature activation+3.59e-01

Pos logit contributions
 face+0.004
adoes+0.003
 structures+0.003
uting+0.003
ag+0.003
Neg logit contributions
 least-0.003
rob-0.003
 reality-0.003
 heaven-0.003
 Gray-0.003
Token amaz
Token ID40642
Feature activation+2.95e+00

Pos logit contributions
rob+0.014
ge+0.013
iw+0.013
och+0.012
pp+0.012
Neg logit contributions
 face-0.009
ags-0.007
wered-0.007
 eyes-0.007
 jokes-0.007
Tokenement
Token ID972
Feature activation+1.07e+00

Pos logit contributions
 least+0.082
 reality+0.081
 heaven+0.077
 Gray+0.077
 wishing+0.076
Neg logit contributions
adoes-0.085
 face-0.084
 structures-0.079
ag-0.078
rac-0.077
Token as
Token ID355
Feature activation+6.74e-01

Pos logit contributions
rob+0.036
 least+0.035
 reality+0.035
 Tooth+0.033
 heaven+0.033
Neg logit contributions
 face-0.028
adoes-0.025
 structures-0.023
asuring-0.023
ap-0.023
Token the
Token ID262
Feature activation-5.04e-01

Pos logit contributions
rob+0.023
ited+0.021
 least+0.021
handed+0.019
tail+0.019
Neg logit contributions
adoes-0.018
 face-0.017
 in-0.017
stals-0.016
 structures-0.014
Token old
Token ID1468
Feature activation+6.92e-01

Pos logit contributions
 face+0.014
adoes+0.014
 structures+0.013
ag+0.013
rac+0.012
Neg logit contributions
 least-0.015
 reality-0.014
 heaven-0.014
 Gray-0.013
rob-0.013
Token man
Token ID582
Feature activation+2.70e-01

Pos logit contributions
 least+0.020
 reality+0.019
 heaven+0.018
 Gray+0.018
rob+0.018
Neg logit contributions
 face-0.020
adoes-0.019
 structures-0.018
ag-0.017
rac-0.017
Token It
Token ID632
Feature activation+7.81e-01

Pos logit contributions
 least+0.002
 reality+0.002
 heaven+0.002
 Gray+0.002
 wishing+0.002
Neg logit contributions
 face-0.002
adoes-0.002
 structures-0.002
ag-0.002
rac-0.002
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 heaven+0.039
 reality+0.036
 certainty+0.036
 supervision+0.035
iley+0.035
Neg logit contributions
 face-0.009
 in-0.007
adoes-0.006
 crossed-0.006
uate-0.005
Token fixed
Token ID5969
Feature activation+1.13e+00

Pos logit contributions
rob+0.033
 least+0.033
 heaven+0.032
 reality+0.032
 therapist+0.032
Neg logit contributions
 face-0.025
adoes-0.022
 structures-0.020
ag-0.020
asuring-0.020
Token and
Token ID290
Feature activation+7.23e-01

Pos logit contributions
 least+0.044
 heaven+0.044
 certainty+0.044
rob+0.043
 reality+0.043
Neg logit contributions
adoes-0.021
 face-0.020
 in-0.018
 structures-0.017
uting-0.016
Token looked
Token ID3114
Feature activation+3.63e-01

Pos logit contributions
 least+0.022
 reality+0.022
 heaven+0.021
 Gray+0.021
 Tooth+0.021
Neg logit contributions
 face-0.019
adoes-0.018
 structures-0.017
rac-0.016
ag-0.016
Token brand
Token ID4508
Feature activation+2.93e+00

Pos logit contributions
rob+0.013
 therapist+0.012
iko+0.012
 least+0.012
 Tooth+0.012
Neg logit contributions
 face-0.009
adoes-0.008
 in-0.007
 structures-0.007
ag-0.007
Token new
Token ID649
Feature activation+1.02e+00

Pos logit contributions
 least+0.079
 heaven+0.078
 reality+0.077
 Gray+0.074
ge+0.073
Neg logit contributions
 face-0.088
adoes-0.085
 structures-0.082
ag-0.079
stals-0.077
Token!
Token ID0
Feature activation-2.89e-01

Pos logit contributions
rob+0.035
 Tooth+0.034
Ba+0.033
 least+0.032
 certainty+0.032
Neg logit contributions
 in-0.024
 face-0.024
adoes-0.024
asuring-0.024
 structures-0.023
Token Tim
Token ID5045
Feature activation-1.00e-01

Pos logit contributions
adoes+0.008
ag+0.008
rac+0.008
 face+0.008
ags+0.008
Neg logit contributions
 least-0.008
 Gray-0.007
 wishing-0.007
 reality-0.007
 Tooth-0.007
Tokenmy
Token ID1820
Feature activation-5.14e-01

Pos logit contributions
 face+0.002
adoes+0.002
arre+0.002
rac+0.002
asuring+0.002
Neg logit contributions
 reality-0.003
 heaven-0.003
 outer-0.003
 possible-0.003
 certainty-0.003
Token was
Token ID373
Feature activation-2.51e-01

Pos logit contributions
 face+0.012
rac+0.009
adoes+0.009
asuring+0.009
 structures+0.008
Neg logit contributions
 reality-0.019
 heaven-0.018
 possible-0.018
rob-0.018
 Tooth-0.017
Token air
Token ID1633
Feature activation+4.36e-01

Pos logit contributions
adoes+0.029
 face+0.028
 structures+0.026
 squash+0.024
 Toys+0.023
Neg logit contributions
 least-0.043
ge-0.042
 reality-0.041
ited-0.040
rob-0.039
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
 least+0.012
 reality+0.012
 heaven+0.011
 wishing+0.011
 Gray+0.011
Neg logit contributions
adoes-0.012
 face-0.012
 structures-0.012
rac-0.011
ag-0.011
Token He
Token ID679
Feature activation-5.03e-01

Pos logit contributions
 face+0.017
adoes+0.016
 structures+0.015
rac+0.015
asuring+0.014
Neg logit contributions
 least-0.022
 reality-0.022
rob-0.020
 wishing-0.020
 heaven-0.020
Token watched
Token ID7342
Feature activation-9.97e-01

Pos logit contributions
 face+0.014
adoes+0.014
 structures+0.013
rac+0.013
ag+0.013
Neg logit contributions
 least-0.014
 reality-0.014
 heaven-0.013
 Gray-0.013
 Tooth-0.013
Token with
Token ID351
Feature activation-9.03e-02

Pos logit contributions
adoes+0.030
 structures+0.029
asuring+0.028
 face+0.028
rac+0.028
Neg logit contributions
 reality-0.025
 least-0.024
 Gray-0.023
 heaven-0.023
 wishing-0.023
Token amaz
Token ID40642
Feature activation+2.87e+00

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
ag+0.002
ags+0.002
Neg logit contributions
rob-0.003
ge-0.003
 least-0.003
 Tooth-0.003
 reality-0.003
Tokenement
Token ID972
Feature activation+1.92e-01

Pos logit contributions
 least+0.083
 reality+0.081
 Gray+0.079
 heaven+0.079
 Tooth+0.078
Neg logit contributions
adoes-0.079
 face-0.079
 structures-0.074
ag-0.073
rac-0.072
Token as
Token ID355
Feature activation+3.86e-01

Pos logit contributions
 least+0.006
 reality+0.005
 Tooth+0.005
rob+0.005
 heaven+0.005
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
asuring-0.005
ag-0.005
Token the
Token ID262
Feature activation-4.93e-01

Pos logit contributions
rob+0.012
 least+0.012
ited+0.011
tail+0.011
ge+0.011
Neg logit contributions
 face-0.010
adoes-0.009
 structures-0.009
 in-0.008
 towers-0.008
Token z
Token ID1976
Feature activation+1.49e-01

Pos logit contributions
adoes+0.013
rac+0.012
uting+0.012
 face+0.012
 structures+0.011
Neg logit contributions
 reality-0.015
 least-0.015
 wishing-0.015
 Tooth-0.014
 Gray-0.014
Tokenig
Token ID328
Feature activation-3.56e-01

Pos logit contributions
 reality+0.004
 least+0.004
 heaven+0.004
 wishing+0.004
 Gray+0.004
Neg logit contributions
 face-0.004
adoes-0.004
rac-0.004
ag-0.004
 structures-0.004
Tokenly
Token ID306
Feature activation+8.12e-01

Pos logit contributions
 least+0.070
 reality+0.067
 heaven+0.066
 North+0.066
 Tooth+0.066
Neg logit contributions
adoes-0.037
 face-0.035
ag-0.032
stals-0.030
 structures-0.029
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 least+0.022
 reality+0.021
rob+0.020
 Gray+0.020
 Tooth+0.020
Neg logit contributions
 face-0.025
adoes-0.024
 structures-0.023
asuring-0.021
ag-0.021
Token Jill
Token ID22772
Feature activation+2.35e-01

Pos logit contributions
 least+0.004
 reality+0.004
 heaven+0.004
rob+0.004
 Gray+0.004
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
rac-0.003
ag-0.003
Token nodded
Token ID14464
Feature activation+6.29e-01

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.005
 wishing+0.005
 Gray+0.005
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.007
ag-0.007
rac-0.007
Token in
Token ID287
Feature activation-1.93e-01

Pos logit contributions
 least+0.018
rob+0.018
 reality+0.018
 Gray+0.017
 heaven+0.017
Neg logit contributions
 face-0.019
adoes-0.017
rac-0.016
ag-0.016
asuring-0.016
Token amaz
Token ID40642
Feature activation+2.87e+00

Pos logit contributions
 face+0.005
adoes+0.005
 structures+0.005
ag+0.005
rac+0.005
Neg logit contributions
 least-0.006
 reality-0.005
 heaven-0.005
 Gray-0.005
 wishing-0.005
Tokenement
Token ID972
Feature activation+6.16e-01

Pos logit contributions
 least+0.086
 Gray+0.084
 reality+0.084
 heaven+0.082
 Tooth+0.081
Neg logit contributions
 face-0.077
adoes-0.076
 structures-0.071
ag-0.070
rac-0.069
Token.
Token ID13
Feature activation+4.88e-02

Pos logit contributions
 least+0.017
rob+0.017
 reality+0.017
 Tooth+0.016
 Gray+0.016
Neg logit contributions
 face-0.019
adoes-0.018
 structures-0.017
asuring-0.016
ag-0.016
Token
Token ID220
Feature activation+5.61e-01

Pos logit contributions
 reality+0.001
 least+0.001
rob+0.001
 heaven+0.001
ge+0.001
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
rac-0.001
uting-0.001
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
 reality+0.018
 least+0.018
 heaven+0.017
 wishing+0.017
rob+0.017
Neg logit contributions
 face-0.014
adoes-0.013
 structures-0.013
ag-0.012
rac-0.012
Token
Token ID198
Feature activation+5.96e-02

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
rac+0.003
ag+0.003
Neg logit contributions
 least-0.004
 reality-0.004
 heaven-0.004
 wishing-0.003
 Tooth-0.003
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
 least+0.008
 reality+0.008
 wishing+0.007
 heaven+0.007
 Tooth+0.007
Neg logit contributions
 face-0.007
adoes-0.007
 structures-0.007
ag-0.007
rac-0.007
TokenL
Token ID43
Feature activation+4.86e-01

Pos logit contributions
 reality+0.012
 least+0.012
 Gray+0.012
 heaven+0.012
 anytime+0.012
Neg logit contributions
adoes-0.012
 face-0.012
 structures-0.011
stals-0.010
ag-0.010
Tokenucky
Token ID5309
Feature activation-2.48e-01

Pos logit contributions
 least+0.019
 reality+0.019
 wishing+0.019
 Gray+0.019
 heaven+0.019
Neg logit contributions
 face-0.008
adoes-0.008
 structures-0.007
ag-0.007
asuring-0.006
Token watched
Token ID7342
Feature activation-5.30e-01

Pos logit contributions
adoes+0.007
 face+0.007
 structures+0.007
ag+0.007
rac+0.007
Neg logit contributions
 least-0.007
 reality-0.006
 Gray-0.006
 wishing-0.006
 heaven-0.006
Token in
Token ID287
Feature activation-5.97e-01

Pos logit contributions
adoes+0.017
 structures+0.017
rac+0.016
stals+0.016
 face+0.016
Neg logit contributions
 least-0.012
 reality-0.012
 wishing-0.012
 heaven-0.012
 knowing-0.012
Token amaz
Token ID40642
Feature activation+2.84e+00

Pos logit contributions
 face+0.018
adoes+0.016
 structures+0.016
ag+0.015
ap+0.015
Neg logit contributions
rob-0.017
 least-0.016
 reality-0.016
 Tooth-0.015
ge-0.015
Tokenement
Token ID972
Feature activation+8.22e-01

Pos logit contributions
 least+0.076
 reality+0.074
 heaven+0.071
 Gray+0.070
 wishing+0.070
Neg logit contributions
 face-0.084
adoes-0.083
 structures-0.079
ag-0.077
rac-0.076
Token as
Token ID355
Feature activation+7.44e-01

Pos logit contributions
 least+0.023
 reality+0.022
rob+0.021
 Tooth+0.021
 heaven+0.021
Neg logit contributions
 face-0.025
adoes-0.024
 structures-0.022
ag-0.022
asuring-0.021
Token the
Token ID262
Feature activation-7.68e-02

Pos logit contributions
rob+0.026
 least+0.026
ited+0.024
tail+0.023
 heaven+0.023
Neg logit contributions
 face-0.017
adoes-0.016
 structures-0.015
 in-0.015
 squash-0.015
Token frog
Token ID21264
Feature activation+5.08e-02

Pos logit contributions
adoes+0.002
rac+0.002
uting+0.002
stals+0.002
 structures+0.002
Neg logit contributions
 reality-0.002
 least-0.002
 wishing-0.002
 Tooth-0.002
 Gray-0.002
Token zoom
Token ID19792
Feature activation+6.83e-01

Pos logit contributions
 reality+0.002
 least+0.002
 Tooth+0.002
rob+0.001
 heaven+0.001
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
ag-0.001
asuring-0.001
Token,
Token ID11
Feature activation+6.41e-01

Pos logit contributions
rob+0.025
 reality+0.024
 least+0.023
 North+0.022
 certainty+0.022
Neg logit contributions
 face-0.022
ap-0.016
ag-0.016
 in-0.016
adoes-0.016
Token her
Token ID607
Feature activation+1.96e-01

Pos logit contributions
 least+0.018
 reality+0.018
 wishing+0.018
 heaven+0.018
 Gray+0.017
Neg logit contributions
adoes-0.017
 face-0.017
 structures-0.016
ag-0.016
rac-0.016
Token eyes
Token ID2951
Feature activation+4.19e-01

Pos logit contributions
 reality+0.007
 least+0.007
 wishing+0.006
 Gray+0.006
 Tooth+0.006
Neg logit contributions
adoes-0.004
 structures-0.004
 face-0.004
ag-0.004
rac-0.004
Token wide
Token ID3094
Feature activation+1.15e+00

Pos logit contributions
 least+0.013
rob+0.013
 reality+0.013
 Gray+0.012
 heaven+0.012
Neg logit contributions
 face-0.012
adoes-0.010
ag-0.010
uting-0.009
rac-0.009
Token with
Token ID351
Feature activation+5.00e-01

Pos logit contributions
 least+0.031
rob+0.030
 reality+0.029
 Gray+0.027
 heaven+0.027
Neg logit contributions
 face-0.037
adoes-0.035
ag-0.032
 structures-0.032
asuring-0.032
Token amaz
Token ID40642
Feature activation+2.83e+00

Pos logit contributions
rob+0.017
 least+0.016
ge+0.016
 reality+0.016
 anytime+0.015
Neg logit contributions
 face-0.014
adoes-0.012
 structures-0.011
ag-0.011
rac-0.011
Tokenement
Token ID972
Feature activation+3.17e-01

Pos logit contributions
 least+0.079
 reality+0.077
 Gray+0.074
 heaven+0.074
 Tooth+0.073
Neg logit contributions
 face-0.081
adoes-0.081
 structures-0.075
ag-0.074
rac-0.073
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
 least+0.008
 reality+0.007
rob+0.007
 Tooth+0.007
 Gray+0.007
Neg logit contributions
 face-0.011
adoes-0.010
 structures-0.010
asuring-0.010
ag-0.009
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
 least+0.012
 reality+0.012
rob+0.011
 heaven+0.011
ge+0.011
Neg logit contributions
 face-0.009
adoes-0.009
 structures-0.008
rac-0.008
stals-0.008
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
 least+0.006
 reality+0.006
 wishing+0.006
 heaven+0.006
rob+0.005
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.004
ag-0.004
rac-0.004
TokenHer
Token ID9360
Feature activation+4.97e-02

Pos logit contributions
 reality+0.006
 least+0.006
 anytime+0.006
 heaven+0.006
 someday+0.006
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.006
ag-0.006
ags-0.006
Token on
Token ID319
Feature activation-4.93e-01

Pos logit contributions
 least+0.004
 reality+0.004
rob+0.004
 heaven+0.004
ge+0.004
Neg logit contributions
 face-0.006
adoes-0.006
 structures-0.005
rac-0.005
ag-0.005
Token her
Token ID607
Feature activation+4.80e-01

Pos logit contributions
 face+0.014
adoes+0.014
 structures+0.013
rac+0.012
ag+0.012
Neg logit contributions
 least-0.014
rob-0.014
 reality-0.014
ge-0.013
 someday-0.013
Token mom
Token ID1995
Feature activation+4.89e-01

Pos logit contributions
 reality+0.015
 least+0.015
ge+0.014
 heaven+0.014
rob+0.014
Neg logit contributions
 face-0.013
adoes-0.012
 structures-0.011
rac-0.011
ag-0.011
Tokenmy
Token ID1820
Feature activation+2.46e-01

Pos logit contributions
 reality+0.019
rob+0.018
 least+0.018
 certainty+0.017
 heaven+0.017
Neg logit contributions
 face-0.010
ag-0.008
adoes-0.008
 in-0.008
rac-0.007
Token's
Token ID338
Feature activation+3.64e-01

Pos logit contributions
rob+0.011
 reality+0.011
 least+0.011
 North+0.010
 certainty+0.010
Neg logit contributions
 face-0.003
adoes-0.002
 in-0.002
ag-0.002
ap-0.002
Token brand
Token ID4508
Feature activation+2.83e+00

Pos logit contributions
rob+0.012
 least+0.012
 reality+0.012
ge+0.011
 Gray+0.011
Neg logit contributions
 face-0.010
adoes-0.008
 structures-0.008
rac-0.008
ag-0.008
Token new
Token ID649
Feature activation+1.53e+00

Pos logit contributions
 least+0.072
 reality+0.070
 heaven+0.067
 Gray+0.066
 wishing+0.066
Neg logit contributions
 face-0.089
adoes-0.087
 structures-0.083
ag-0.081
rac-0.080
Token table
Token ID3084
Feature activation-4.00e-01

Pos logit contributions
 least+0.047
 reality+0.046
 Gray+0.044
 heaven+0.044
 wishing+0.043
Neg logit contributions
 face-0.040
adoes-0.039
 structures-0.037
ag-0.035
rac-0.035
Token!
Token ID0
Feature activation-3.31e-01

Pos logit contributions
 face+0.010
adoes+0.009
 structures+0.008
 in+0.008
asuring+0.008
Neg logit contributions
rob-0.014
 least-0.013
 reality-0.013
Ba-0.012
 Gray-0.012
Token
Token ID198
Feature activation+2.11e-02

Pos logit contributions
 face+0.010
adoes+0.010
 structures+0.010
rac+0.009
ag+0.009
Neg logit contributions
 least-0.008
 reality-0.008
 heaven-0.008
 Gray-0.008
 wishing-0.008
Token
Token ID198
Feature activation+1.37e-01

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
 wishing+0.001
 Tooth+0.001
Neg logit contributions
 face-0.001
adoes-0.000
ag-0.000
 structures-0.000
rac-0.000
Token wardrobe
Token ID35593
Feature activation+3.40e-01

Pos logit contributions
 face+0.008
adoes+0.007
 structures+0.007
stals+0.007
ag+0.006
Neg logit contributions
 least-0.009
 reality-0.009
rob-0.009
ge-0.009
 heaven-0.009
Token changed
Token ID3421
Feature activation+7.19e-01

Pos logit contributions
 least+0.010
 reality+0.010
 heaven+0.010
 Tooth+0.009
 Gray+0.009
Neg logit contributions
 face-0.009
adoes-0.009
 structures-0.008
rac-0.008
ag-0.008
Token from
Token ID422
Feature activation-4.10e-01

Pos logit contributions
 least+0.024
rob+0.023
 reality+0.023
 Tooth+0.023
 Gray+0.022
Neg logit contributions
 face-0.017
adoes-0.017
 structures-0.016
ag-0.014
stals-0.014
Token ancient
Token ID6156
Feature activation+8.56e-01

Pos logit contributions
 face+0.008
adoes+0.008
 structures+0.008
 in+0.007
 pitch+0.007
Neg logit contributions
rob-0.016
 Tooth-0.014
 therapist-0.014
 least-0.014
och-0.014
Token to
Token ID284
Feature activation+8.76e-01

Pos logit contributions
 least+0.032
rob+0.031
uka+0.029
 Tooth+0.029
 Gray+0.028
Neg logit contributions
 face-0.020
adoes-0.017
 structures-0.017
 in-0.014
 faces-0.014
Token brand
Token ID4508
Feature activation+2.83e+00

Pos logit contributions
rob+0.035
 therapist+0.031
 Gray+0.030
Charl+0.030
ge+0.030
Neg logit contributions
 face-0.021
 structures-0.017
adoes-0.016
ap-0.015
ag-0.015
Token new
Token ID649
Feature activation+1.61e+00

Pos logit contributions
 reality+0.062
 least+0.062
 heaven+0.057
 Gray+0.056
 wishing+0.056
Neg logit contributions
adoes-0.095
 face-0.094
 structures-0.089
rac-0.088
asuring-0.088
Token!
Token ID0
Feature activation-5.19e-01

Pos logit contributions
rob+0.064
 least+0.061
 Tooth+0.060
uka+0.059
och+0.059
Neg logit contributions
 face-0.034
adoes-0.030
 structures-0.030
 faces-0.026
 eyes-0.025
Token They
Token ID1119
Feature activation+3.87e-01

Pos logit contributions
adoes+0.016
 face+0.015
ag+0.015
asuring+0.014
ags+0.014
Neg logit contributions
 least-0.014
 heaven-0.013
 Gray-0.013
 anytime-0.013
 reality-0.012
Token looked
Token ID3114
Feature activation-2.91e-01

Pos logit contributions
 least+0.011
 reality+0.011
 heaven+0.011
rob+0.011
 Tooth+0.011
Neg logit contributions
 face-0.010
adoes-0.010
 structures-0.009
rac-0.009
asuring-0.009
Token around
Token ID1088
Feature activation-6.55e-01

Pos logit contributions
adoes+0.008
rac+0.007
anish+0.007
 face+0.007
 structures+0.007
Neg logit contributions
 reality-0.009
 least-0.009
 wishing-0.009
 knowing-0.009
 anytime-0.008
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
adoes+0.011
 face+0.011
 structures+0.011
ag+0.011
ags+0.010
Neg logit contributions
 least-0.010
 reality-0.010
 Gray-0.009
 heaven-0.009
 wishing-0.009
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
 reality+0.005
 least+0.005
 Gray+0.005
 Tooth+0.005
 heaven+0.005
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.007
asuring-0.007
rac-0.007
TokenSam
Token ID16305
Feature activation+1.99e-01

Pos logit contributions
 reality+0.007
 least+0.007
 Gray+0.007
 Tooth+0.007
 heaven+0.007
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.007
ag-0.006
ags-0.006
Token gasped
Token ID45236
Feature activation+8.83e-01

Pos logit contributions
 reality+0.006
 least+0.006
 heaven+0.006
 Tooth+0.006
 wishing+0.006
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
rac-0.005
ag-0.005
Token in
Token ID287
Feature activation+5.09e-02

Pos logit contributions
 least+0.026
 reality+0.025
 wishing+0.024
 heaven+0.024
 Gray+0.024
Neg logit contributions
adoes-0.024
 face-0.023
 structures-0.022
ag-0.022
rac-0.022
Token amaz
Token ID40642
Feature activation+2.82e+00

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
 Gray+0.001
 wishing+0.001
Neg logit contributions
adoes-0.002
 face-0.002
 structures-0.001
ag-0.001
rac-0.001
Tokenement
Token ID972
Feature activation+5.76e-01

Pos logit contributions
 least+0.076
 reality+0.075
 heaven+0.071
 Gray+0.071
 wishing+0.071
Neg logit contributions
 face-0.082
adoes-0.082
 structures-0.078
ag-0.076
rac-0.075
Token.
Token ID13
Feature activation+1.44e-01

Pos logit contributions
 least+0.014
 reality+0.014
 Tooth+0.013
rob+0.013
 heaven+0.013
Neg logit contributions
 face-0.019
adoes-0.019
 structures-0.018
ag-0.017
asuring-0.017
Token He
Token ID679
Feature activation+2.09e-01

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.004
 Gray+0.004
 wishing+0.004
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
rac-0.003
ag-0.003
Token had
Token ID550
Feature activation-2.17e-01

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.005
 Tooth+0.005
 Gray+0.005
Neg logit contributions
 face-0.006
adoes-0.006
 structures-0.006
rac-0.005
ag-0.005
Token never
Token ID1239
Feature activation-6.62e-01

Pos logit contributions
 face+0.006
adoes+0.005
 structures+0.005
rac+0.005
asuring+0.005
Neg logit contributions
 least-0.006
 reality-0.006
rob-0.006
 Gray-0.006
 heaven-0.006
Token things
Token ID1243
Feature activation-9.50e-02

Pos logit contributions
 reality+0.003
 least+0.003
 wishing+0.003
 anytime+0.003
rob+0.003
Neg logit contributions
rac-0.002
 face-0.002
adoes-0.002
asuring-0.002
ag-0.002
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
asuring+0.003
ag+0.003
Neg logit contributions
 reality-0.003
 Tooth-0.003
 least-0.003
 Gray-0.002
 heaven-0.002
Token
Token ID198
Feature activation-2.19e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.005
rac+0.005
stals+0.005
Neg logit contributions
 least-0.008
 reality-0.008
rob-0.008
 heaven-0.008
 Tooth-0.008
Token
Token ID198
Feature activation-8.34e-02

Pos logit contributions
 face+0.005
adoes+0.005
rac+0.005
ag+0.005
uting+0.005
Neg logit contributions
 least-0.007
 reality-0.007
 heaven-0.007
rob-0.007
 Tooth-0.007
TokenGr
Token ID8642
Feature activation-1.37e+00

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
rac+0.002
ag+0.002
Neg logit contributions
 least-0.002
 reality-0.002
 heaven-0.002
 Gray-0.002
 wishing-0.002
Tokenac
Token ID330
Feature activation-3.75e+00

Pos logit contributions
 face+0.025
adoes+0.024
ag+0.022
 structures+0.021
ap+0.021
Neg logit contributions
 least-0.051
 wishing-0.051
rob-0.051
 reality-0.051
 Gray-0.051
Tokenie
Token ID494
Feature activation-6.53e-01

Pos logit contributions
adoes+0.138
 face+0.137
 structures+0.132
stals+0.129
asuring+0.127
Neg logit contributions
 least-0.073
 reality-0.069
 Tooth-0.063
 heaven-0.061
 North-0.060
Token saw
Token ID2497
Feature activation-1.08e+00

Pos logit contributions
adoes+0.021
 face+0.020
 structures+0.020
ags+0.019
ag+0.019
Neg logit contributions
 least-0.016
 reality-0.015
 wishing-0.015
 Gray-0.015
 anytime-0.014
Token bright
Token ID6016
Feature activation-5.47e-02

Pos logit contributions
 face+0.028
adoes+0.027
 structures+0.026
ag+0.024
rac+0.024
Neg logit contributions
 least-0.034
ge-0.033
 reality-0.032
 heaven-0.031
 Tooth-0.031
Token yellow
Token ID7872
Feature activation-2.87e-01

Pos logit contributions
rac+0.002
anish+0.001
asuring+0.001
agged+0.001
issors+0.001
Neg logit contributions
 reality-0.001
 North-0.001
 wishing-0.001
 Gray-0.001
 anytime-0.001
Token flowers
Token ID12734
Feature activation+1.59e-01

Pos logit contributions
rac+0.007
adoes+0.007
ags+0.007
asuring+0.007
agged+0.007
Neg logit contributions
 wishing-0.008
 reality-0.008
 least-0.008
 North-0.008
 someday-0.008
Token the
Token ID262
Feature activation+7.28e-01

Pos logit contributions
 least+0.000
 reality+0.000
 Gray+0.000
 heaven+0.000
 Tooth+0.000
Neg logit contributions
 face-0.000
adoes-0.000
 structures-0.000
rac-0.000
ag-0.000
Token mountain
Token ID8598
Feature activation-3.35e-01

Pos logit contributions
 least+0.022
 reality+0.022
 heaven+0.021
 Gray+0.021
 wishing+0.021
Neg logit contributions
 face-0.018
adoes-0.018
 structures-0.017
ag-0.016
rac-0.016
Token lion
Token ID18744
Feature activation-5.04e-01

Pos logit contributions
 face+0.009
adoes+0.008
 structures+0.008
asuring+0.008
ap+0.008
Neg logit contributions
 reality-0.010
 Tooth-0.010
 least-0.010
 Gray-0.010
rob-0.010
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
 face+0.014
adoes+0.013
 structures+0.013
asuring+0.012
ag+0.012
Neg logit contributions
 reality-0.015
 least-0.015
 Tooth-0.014
rob-0.014
 certainty-0.014
Token Mir
Token ID7381
Feature activation-6.43e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 least-0.008
 reality-0.008
 heaven-0.008
 wishing-0.008
 Tooth-0.008
Tokenac
Token ID330
Feature activation-3.70e+00

Pos logit contributions
 face+0.018
ag+0.017
rac+0.017
asuring+0.016
ap+0.016
Neg logit contributions
 reality-0.018
 Gray-0.018
 least-0.018
 wishing-0.017
 Tooth-0.017
Tokenulously
Token ID18117
Feature activation-3.94e-01

Pos logit contributions
adoes+0.103
 face+0.102
 structures+0.096
stals+0.094
uting+0.092
Neg logit contributions
 least-0.105
 reality-0.100
 Tooth-0.095
rob-0.094
 Gray-0.094
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
 face+0.010
adoes+0.009
 structures+0.008
ag+0.008
rac+0.008
Neg logit contributions
 least-0.013
 reality-0.013
rob-0.012
 Tooth-0.012
 Gray-0.012
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
rac+0.003
asuring+0.003
Neg logit contributions
 least-0.004
 reality-0.003
rob-0.003
 certainty-0.003
 heaven-0.003
Token lion
Token ID18744
Feature activation+2.34e-01

Pos logit contributions
adoes+0.008
 face+0.008
 structures+0.007
ag+0.007
rac+0.007
Neg logit contributions
 least-0.009
 reality-0.009
 heaven-0.008
 Gray-0.008
 wishing-0.008
Token backed
Token ID9763
Feature activation+2.19e-01

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.006
 Gray+0.006
 wishing+0.006
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.006
ag-0.006
rac-0.006
Token said
Token ID531
Feature activation-2.03e-01

Pos logit contributions
 face+0.007
adoes+0.006
rac+0.006
asuring+0.006
 structures+0.006
Neg logit contributions
 reality-0.010
 least-0.010
 heaven-0.009
rob-0.009
 possible-0.009
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
adoes+0.006
 structures+0.006
 face+0.006
ag+0.006
uting+0.006
Neg logit contributions
 Gray-0.005
 reality-0.005
 least-0.005
 wishing-0.005
 heaven-0.005
Token "
Token ID366
Feature activation-6.55e-01

Pos logit contributions
adoes+0.008
 structures+0.007
 face+0.007
ag+0.007
rac+0.007
Neg logit contributions
 least-0.010
 wishing-0.010
 Gray-0.010
 reality-0.009
 Tooth-0.009
TokenA
Token ID32
Feature activation-1.09e-02

Pos logit contributions
adoes+0.015
 face+0.015
rac+0.014
 structures+0.014
asuring+0.014
Neg logit contributions
 reality-0.022
 least-0.021
 wishing-0.020
 heaven-0.020
 dreaming-0.020
Tokenbr
Token ID1671
Feature activation-7.04e-01

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
stals+0.000
ag+0.000
Neg logit contributions
 least-0.000
 reality-0.000
 heaven-0.000
rob-0.000
ge-0.000
Tokenac
Token ID330
Feature activation-3.68e+00

Pos logit contributions
adoes+0.012
 face+0.011
 structures+0.011
stals+0.009
 squash+0.009
Neg logit contributions
 least-0.028
 reality-0.028
ge-0.027
 someday-0.026
 heaven-0.026
Tokenad
Token ID324
Feature activation-7.93e-01

Pos logit contributions
uting+0.128
 structures+0.126
 face+0.125
adoes+0.124
stals+0.123
Neg logit contributions
 least-0.072
rob-0.066
 Tooth-0.061
 heaven-0.061
 certainty-0.059
Tokenab
Token ID397
Feature activation-1.46e+00

Pos logit contributions
 face+0.022
adoes+0.022
ag+0.021
 structures+0.021
rac+0.020
Neg logit contributions
 least-0.022
 reality-0.022
 wishing-0.020
 Tooth-0.020
 Gray-0.020
Tokenra
Token ID430
Feature activation-9.43e-01

Pos logit contributions
 structures+0.047
adoes+0.045
arre+0.044
ag+0.043
 face+0.043
Neg logit contributions
 least-0.034
 reality-0.032
 anytime-0.031
 Gray-0.031
 someday-0.030
Token!"
Token ID2474
Feature activation-1.77e-01

Pos logit contributions
 face+0.023
adoes+0.022
rac+0.022
asuring+0.021
ag+0.021
Neg logit contributions
 reality-0.031
 least-0.030
 wishing-0.028
 heaven-0.028
 Tooth-0.028
Token But
Token ID887
Feature activation+1.10e+00

Pos logit contributions
 face+0.004
rac+0.003
ags+0.003
adoes+0.003
asuring+0.003
Neg logit contributions
 reality-0.007
 least-0.007
rob-0.007
ge-0.007
 North-0.006
Token air
Token ID1633
Feature activation-4.94e-01

Pos logit contributions
 least+0.023
 reality+0.023
 heaven+0.022
 wishing+0.021
 Tooth+0.021
Neg logit contributions
adoes-0.016
 face-0.015
 structures-0.014
rac-0.014
ag-0.014
Token.
Token ID13
Feature activation-4.62e-01

Pos logit contributions
 face+0.016
adoes+0.015
 structures+0.014
ap+0.014
ag+0.014
Neg logit contributions
 least-0.012
rob-0.012
 reality-0.012
 Gray-0.012
 Tooth-0.012
Token
Token ID198
Feature activation-2.36e-01

Pos logit contributions
adoes+0.011
 face+0.011
 structures+0.010
ag+0.010
rac+0.010
Neg logit contributions
 least-0.015
 reality-0.015
 Gray-0.014
 heaven-0.014
 wishing-0.014
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.006
rac+0.005
ag+0.005
Neg logit contributions
 least-0.007
 reality-0.007
 heaven-0.007
 Tooth-0.007
 wishing-0.007
TokenGr
Token ID8642
Feature activation-1.40e+00

Pos logit contributions
adoes+0.003
 face+0.003
 structures+0.003
ags+0.003
ag+0.003
Neg logit contributions
 reality-0.003
 least-0.003
 Gray-0.003
 wishing-0.003
 heaven-0.003
Tokenac
Token ID330
Feature activation-3.68e+00

Pos logit contributions
 face+0.028
adoes+0.027
ag+0.025
ap+0.025
rac+0.024
Neg logit contributions
rob-0.050
 wishing-0.050
 least-0.049
 Gray-0.049
 reality-0.048
Tokenie
Token ID494
Feature activation-7.84e-01

Pos logit contributions
adoes+0.128
 face+0.127
 structures+0.122
stals+0.118
asuring+0.118
Neg logit contributions
 least-0.079
 reality-0.076
 heaven-0.070
 Tooth-0.070
 Gray-0.069
Token was
Token ID373
Feature activation-3.15e-01

Pos logit contributions
 face+0.022
adoes+0.022
 structures+0.020
ag+0.020
rac+0.020
Neg logit contributions
 least-0.022
 reality-0.022
 heaven-0.021
 Gray-0.021
 wishing-0.020
Token walking
Token ID6155
Feature activation-5.13e-01

Pos logit contributions
 face+0.009
adoes+0.009
 structures+0.008
ag+0.008
rac+0.008
Neg logit contributions
 least-0.009
 reality-0.009
 heaven-0.008
 Gray-0.008
 Tooth-0.008
Token with
Token ID351
Feature activation+5.44e-01

Pos logit contributions
adoes+0.012
 face+0.012
 structures+0.011
stals+0.011
rac+0.011
Neg logit contributions
 least-0.017
 reality-0.016
 wishing-0.016
 anytime-0.016
 someday-0.016
Token her
Token ID607
Feature activation+1.62e-01

Pos logit contributions
rob+0.017
 least+0.016
ge+0.016
 reality+0.015
och+0.015
Neg logit contributions
 face-0.016
adoes-0.015
 structures-0.014
ag-0.013
ags-0.013
Token the
Token ID262
Feature activation-8.49e-01

Pos logit contributions
 face+0.043
adoes+0.043
 structures+0.041
rac+0.038
ag+0.038
Neg logit contributions
 least-0.050
 reality-0.050
 Tooth-0.047
 wishing-0.047
 heaven-0.047
Token mess
Token ID2085
Feature activation-4.62e-01

Pos logit contributions
rac+0.030
adoes+0.028
ag+0.028
agged+0.027
ap+0.027
Neg logit contributions
 wishing-0.020
 least-0.019
 Tooth-0.019
 reality-0.018
 dreaming-0.018
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
adoes+0.012
 face+0.012
 structures+0.011
rac+0.011
ag+0.011
Neg logit contributions
 least-0.014
 reality-0.013
 heaven-0.013
 Gray-0.013
 wishing-0.013
Token the
Token ID262
Feature activation-5.75e-01

Pos logit contributions
adoes+0.038
 face+0.038
 structures+0.035
ag+0.035
rac+0.034
Neg logit contributions
 least-0.038
 reality-0.037
 heaven-0.035
 Gray-0.035
 wishing-0.035
Token qu
Token ID627
Feature activation-2.85e+00

Pos logit contributions
rac+0.018
adoes+0.018
agged+0.017
ag+0.017
uting+0.017
Neg logit contributions
 wishing-0.014
 least-0.014
 reality-0.014
 Tooth-0.014
 dreaming-0.014
Tokenarre
Token ID9624
Feature activation-3.67e+00

Pos logit contributions
adoes+0.081
rac+0.078
asuring+0.077
 face+0.072
 squash+0.068
Neg logit contributions
 least-0.067
 heaven-0.064
ge-0.063
rob-0.063
handed-0.061
Tokenling
Token ID1359
Feature activation-1.77e+00

Pos logit contributions
adoes+0.098
rac+0.098
asuring+0.094
agged+0.094
 face+0.091
Neg logit contributions
ge-0.106
rob-0.095
 least-0.092
 reality-0.092
 knowing-0.088
Token twins
Token ID20345
Feature activation+1.90e-01

Pos logit contributions
rac+0.053
asuring+0.049
agged+0.048
ags+0.047
ag+0.046
Neg logit contributions
 reality-0.046
 heaven-0.046
 she-0.043
 therapist-0.043
 wishing-0.042
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 least+0.004
 reality+0.004
 heaven+0.003
 wishing+0.003
 Gray+0.003
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.007
rac-0.006
ag-0.006
Token She
Token ID1375
Feature activation-3.05e-01

Pos logit contributions
 Gray+0.003
 heaven+0.002
 wishing+0.002
 least+0.002
 Tooth+0.002
Neg logit contributions
adoes-0.006
ag-0.006
 face-0.006
 structures-0.006
asuring-0.006
Token is
Token ID318
Feature activation-7.15e-01

Pos logit contributions
adoes+0.010
 face+0.010
ag+0.010
uting+0.009
rac+0.009
Neg logit contributions
 least-0.007
 wishing-0.007
 Gray-0.007
 reality-0.006
 certainty-0.006
Token air
Token ID1633
Feature activation-6.93e-01

Pos logit contributions
 least+0.010
 reality+0.009
 heaven+0.009
 wishing+0.009
 Gray+0.009
Neg logit contributions
adoes-0.010
asuring-0.010
rac-0.010
 structures-0.009
ag-0.009
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 face+0.024
adoes+0.022
 structures+0.021
ag+0.021
ap+0.020
Neg logit contributions
 least-0.017
 reality-0.017
rob-0.016
 Gray-0.016
 Tooth-0.016
Token "
Token ID366
Feature activation-4.74e-01

Pos logit contributions
adoes+0.006
 face+0.005
ag+0.005
 structures+0.005
rac+0.005
Neg logit contributions
 Gray-0.006
 anytime-0.006
 least-0.006
 reality-0.005
 heaven-0.005
TokenA
Token ID32
Feature activation+2.11e-01

Pos logit contributions
 face+0.011
adoes+0.011
 structures+0.010
rac+0.010
ag+0.010
Neg logit contributions
 least-0.015
 reality-0.015
 heaven-0.014
 Gray-0.014
 wishing-0.014
Tokenbr
Token ID1671
Feature activation-9.30e-01

Pos logit contributions
 least+0.007
 reality+0.006
ge+0.006
 heaven+0.006
 certainty+0.006
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
stals-0.004
ag-0.004
Tokenac
Token ID330
Feature activation-3.64e+00

Pos logit contributions
adoes+0.019
 face+0.018
 structures+0.017
 squash+0.015
stals+0.015
Neg logit contributions
 least-0.035
 reality-0.034
ge-0.032
 someday-0.032
 anytime-0.031
Tokenad
Token ID324
Feature activation-8.86e-01

Pos logit contributions
 face+0.122
adoes+0.122
uting+0.122
 structures+0.118
stals+0.118
Neg logit contributions
 least-0.083
rob-0.074
 reality-0.069
 heaven-0.068
 Tooth-0.066
Tokenab
Token ID397
Feature activation-1.59e+00

Pos logit contributions
 face+0.025
adoes+0.025
 structures+0.024
ag+0.024
rac+0.023
Neg logit contributions
 reality-0.024
 least-0.024
 wishing-0.023
 Tooth-0.022
 Gray-0.022
Tokenra
Token ID430
Feature activation-9.39e-01

Pos logit contributions
 structures+0.048
adoes+0.047
 face+0.045
arre+0.045
ag+0.044
Neg logit contributions
 least-0.040
 reality-0.038
 anytime-0.036
 Gray-0.036
 someday-0.036
Token!"
Token ID2474
Feature activation-1.94e-01

Pos logit contributions
 face+0.023
adoes+0.021
rac+0.021
ag+0.021
asuring+0.021
Neg logit contributions
 reality-0.031
 least-0.030
 wishing-0.029
 heaven-0.028
 Tooth-0.028
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.005
ag+0.005
rac+0.005
Neg logit contributions
 least-0.005
 reality-0.005
 heaven-0.005
 Gray-0.005
 wishing-0.005
Token said
Token ID531
Feature activation+1.51e-01

Pos logit contributions
 face+0.005
adoes+0.004
 structures+0.004
rac+0.004
stals+0.004
Neg logit contributions
 reality-0.006
 least-0.006
 heaven-0.005
 North-0.005
rob-0.005
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
 she+0.003
 wishing+0.003
 knowing+0.003
 reality+0.003
 Gray+0.003
Neg logit contributions
adoes-0.005
ags-0.005
 structures-0.005
ag-0.005
apers-0.005
Token "
Token ID366
Feature activation-2.76e-01

Pos logit contributions
adoes+0.003
 face+0.003
 structures+0.003
ag+0.003
rac+0.002
Neg logit contributions
 least-0.004
 wishing-0.004
 reality-0.004
 Gray-0.004
 knowing-0.004
TokenA
Token ID32
Feature activation+1.62e-01

Pos logit contributions
adoes+0.007
 face+0.007
rac+0.006
 structures+0.006
uting+0.006
Neg logit contributions
 reality-0.009
 least-0.009
 heaven-0.008
 Gray-0.008
 wishing-0.008
Tokenbr
Token ID1671
Feature activation-7.15e-01

Pos logit contributions
 least+0.005
rob+0.005
 heaven+0.005
ge+0.005
 reality+0.005
Neg logit contributions
 face-0.004
adoes-0.004
ag-0.003
stals-0.003
 structures-0.003
Tokenac
Token ID330
Feature activation-3.61e+00

Pos logit contributions
adoes+0.012
 face+0.011
 structures+0.011
stals+0.009
uting+0.009
Neg logit contributions
 least-0.029
 reality-0.029
ge-0.027
 heaven-0.027
 someday-0.027
Tokenad
Token ID324
Feature activation-6.87e-01

Pos logit contributions
 face+0.115
adoes+0.115
uting+0.115
 structures+0.114
stals+0.110
Neg logit contributions
 least-0.083
 reality-0.075
 Tooth-0.074
 certainty-0.072
rob-0.071
Tokenab
Token ID397
Feature activation-1.32e+00

Pos logit contributions
 face+0.019
adoes+0.019
ag+0.018
 structures+0.018
asuring+0.018
Neg logit contributions
 reality-0.019
 least-0.019
 wishing-0.018
 Tooth-0.018
 Gray-0.018
Tokenra
Token ID430
Feature activation-8.83e-01

Pos logit contributions
 structures+0.041
adoes+0.039
 face+0.038
arre+0.037
ag+0.037
Neg logit contributions
 least-0.033
 reality-0.032
 anytime-0.030
 someday-0.030
 heaven-0.029
Token!
Token ID0
Feature activation-5.49e-01

Pos logit contributions
 face+0.023
adoes+0.021
rac+0.021
asuring+0.020
ap+0.020
Neg logit contributions
 reality-0.028
 least-0.027
 heaven-0.026
 wishing-0.026
 Tooth-0.025
Token I
Token ID314
Feature activation+6.12e-01

Pos logit contributions
adoes+0.014
 face+0.014
 structures+0.013
ag+0.013
ags+0.012
Neg logit contributions
 least-0.017
 reality-0.017
 Gray-0.016
 Tooth-0.016
 heaven-0.016
Token he
Token ID339
Feature activation-4.22e-02

Pos logit contributions
 face+0.012
rac+0.012
adoes+0.012
stals+0.011
 structures+0.011
Neg logit contributions
 heaven-0.009
 Tooth-0.009
 least-0.009
 reality-0.009
rob-0.009
Token found
Token ID1043
Feature activation-9.12e-01

Pos logit contributions
 face+0.001
adoes+0.001
rac+0.001
uting+0.001
 structures+0.001
Neg logit contributions
 reality-0.001
 least-0.001
rob-0.001
 heaven-0.001
 Tooth-0.001
Token a
Token ID257
Feature activation-4.35e-01

Pos logit contributions
 face+0.018
adoes+0.015
 structures+0.014
 in+0.014
ull+0.014
Neg logit contributions
ge-0.036
 reality-0.034
ited-0.034
 heaven-0.034
rob-0.034
Token rabbit
Token ID22746
Feature activation-8.89e-01

Pos logit contributions
 face+0.011
adoes+0.010
 structures+0.009
ag+0.009
rac+0.009
Neg logit contributions
 least-0.014
 reality-0.014
 heaven-0.013
rob-0.013
 Gray-0.013
Token qu
Token ID627
Feature activation-2.56e+00

Pos logit contributions
 face+0.023
adoes+0.023
 structures+0.021
ag+0.021
asuring+0.020
Neg logit contributions
 least-0.027
 reality-0.027
 heaven-0.025
 Gray-0.025
 Tooth-0.025
Tokenarre
Token ID9624
Feature activation-3.57e+00

Pos logit contributions
adoes+0.078
 structures+0.070
asuring+0.069
 face+0.069
rac+0.068
Neg logit contributions
ited-0.068
 least-0.066
 heaven-0.061
 reality-0.060
alia-0.060
Tokenlling
Token ID2680
Feature activation-1.72e+00

Pos logit contributions
adoes+0.094
 face+0.092
rac+0.092
asuring+0.087
 structures+0.087
Neg logit contributions
ge-0.107
rob-0.100
 reality-0.099
 least-0.098
 knowing-0.097
Token with
Token ID351
Feature activation+2.21e-01

Pos logit contributions
adoes+0.034
 face+0.033
 structures+0.031
rac+0.031
stals+0.030
Neg logit contributions
 least-0.061
 reality-0.061
 wishing-0.059
 heaven-0.059
 Gray-0.058
Token a
Token ID257
Feature activation+2.96e-01

Pos logit contributions
 least+0.008
rob+0.008
ge+0.008
 someday+0.007
 reality+0.007
Neg logit contributions
 face-0.006
adoes-0.005
 structures-0.005
 squash-0.004
 jokes-0.004
Token squirrel
Token ID33039
Feature activation-4.64e-01

Pos logit contributions
 reality+0.009
 least+0.009
 wishing+0.009
 Gray+0.009
 heaven+0.008
Neg logit contributions
adoes-0.008
 face-0.007
 structures-0.007
rac-0.007
ag-0.007
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
 face+0.014
adoes+0.013
ag+0.013
 structures+0.013
asuring+0.013
Neg logit contributions
 reality-0.013
 least-0.012
 Tooth-0.011
rob-0.011
 heaven-0.011
Token it
Token ID340
Feature activation+3.21e-01

Pos logit contributions
 least+0.009
 reality+0.009
 heaven+0.008
 wishing+0.008
rob+0.008
Neg logit contributions
adoes-0.006
 face-0.006
 structures-0.006
asuring-0.006
rac-0.006
Token!"
Token ID2474
Feature activation+9.24e-02

Pos logit contributions
 least+0.009
 reality+0.009
rob+0.009
 wishing+0.009
 certainty+0.009
Neg logit contributions
 face-0.008
adoes-0.008
asuring-0.008
 structures-0.008
rac-0.007
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
 reality+0.003
 least+0.003
ge+0.003
 certainty+0.003
 possible+0.003
Neg logit contributions
 face-0.002
rac-0.002
ap-0.002
asuring-0.002
 Everyone-0.002
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
 least+0.001
 reality+0.001
 wishing+0.001
ge+0.001
 heaven+0.001
Neg logit contributions
 face-0.000
rac-0.000
ag-0.000
ap-0.000
<-0.000
TokenIsa
Token ID39443
Feature activation-1.10e+00

Pos logit contributions
 least+0.006
 reality+0.006
 wishing+0.006
 heaven+0.006
 certainty+0.006
Neg logit contributions
 face-0.005
rac-0.005
asuring-0.004
ap-0.004
ag-0.004
Tokenac
Token ID330
Feature activation-3.57e+00

Pos logit contributions
 face+0.017
rac+0.016
ap+0.016
ag+0.016
adoes+0.015
Neg logit contributions
 wishing-0.044
 Gray-0.043
 least-0.043
rob-0.043
 reality-0.042
Token gul
Token ID47161
Feature activation-1.05e+00

Pos logit contributions
adoes+0.102
 face+0.100
 structures+0.096
stals+0.093
uting+0.092
Neg logit contributions
 least-0.099
 reality-0.094
 Tooth-0.089
 heaven-0.088
 Gray-0.088
Tokenped
Token ID9124
Feature activation-2.45e-01

Pos logit contributions
 structures+0.026
adoes+0.025
 face+0.024
asuring+0.023
stals+0.023
Neg logit contributions
 least-0.031
 reality-0.030
 heaven-0.030
 Gray-0.030
 Tooth-0.029
Token and
Token ID290
Feature activation+8.59e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 least-0.008
 reality-0.008
rob-0.008
 heaven-0.007
 Tooth-0.007
Token said
Token ID531
Feature activation+6.43e-01

Pos logit contributions
 reality+0.028
 certainty+0.026
 suitable+0.026
 heaven+0.026
 least+0.026
Neg logit contributions
 face-0.023
rac-0.020
adoes-0.019
ap-0.019
asuring-0.018
Token,
Token ID11
Feature activation+5.54e-01

Pos logit contributions
 least+0.018
 reality+0.018
 Gray+0.017
 wishing+0.017
 heaven+0.017
Neg logit contributions
adoes-0.018
 face-0.018
 structures-0.017
ag-0.016
rac-0.016
Token said
Token ID531
Feature activation+6.21e-02

Pos logit contributions
 face+0.005
adoes+0.004
 structures+0.004
rac+0.004
 squash+0.004
Neg logit contributions
 reality-0.006
 least-0.005
 heaven-0.005
rob-0.005
 North-0.005
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
 reality+0.001
 wishing+0.001
 she+0.001
 knowing+0.001
 Gray+0.001
Neg logit contributions
adoes-0.002
 structures-0.002
ags-0.002
ag-0.002
uting-0.002
Token "
Token ID366
Feature activation-3.65e-01

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.005
ag+0.004
rac+0.004
Neg logit contributions
 least-0.007
 wishing-0.007
 reality-0.007
 Gray-0.006
 knowing-0.006
TokenA
Token ID32
Feature activation+1.38e-01

Pos logit contributions
adoes+0.009
 face+0.008
rac+0.008
uting+0.008
 structures+0.008
Neg logit contributions
 reality-0.012
 least-0.012
 heaven-0.011
 Gray-0.011
 wishing-0.011
Tokenbr
Token ID1671
Feature activation-6.85e-01

Pos logit contributions
 least+0.005
rob+0.004
 heaven+0.004
ge+0.004
 certainty+0.004
Neg logit contributions
 face-0.003
stals-0.003
adoes-0.003
ag-0.003
ap-0.003
Tokenac
Token ID330
Feature activation-3.56e+00

Pos logit contributions
adoes+0.010
 face+0.010
 structures+0.009
stals+0.008
ag+0.008
Neg logit contributions
 least-0.029
 reality-0.028
ge-0.027
 someday-0.027
 heaven-0.027
Tokenad
Token ID324
Feature activation-7.48e-01

Pos logit contributions
 face+0.118
adoes+0.118
uting+0.118
 structures+0.118
stals+0.112
Neg logit contributions
 least-0.079
 reality-0.070
 Tooth-0.070
rob-0.066
 certainty-0.065
Tokenab
Token ID397
Feature activation-1.31e+00

Pos logit contributions
 face+0.021
adoes+0.021
ag+0.020
 structures+0.020
asuring+0.019
Neg logit contributions
 least-0.020
 reality-0.020
 wishing-0.019
 Gray-0.019
 Tooth-0.019
Tokenra
Token ID430
Feature activation-8.72e-01

Pos logit contributions
 structures+0.039
adoes+0.039
 face+0.037
ag+0.036
arre+0.036
Neg logit contributions
 least-0.034
 reality-0.033
 anytime-0.031
 someday-0.030
 Gray-0.030
Token!"
Token ID2474
Feature activation-2.18e-01

Pos logit contributions
 face+0.023
adoes+0.022
rac+0.021
asuring+0.021
ap+0.021
Neg logit contributions
 reality-0.027
 least-0.026
 heaven-0.025
 wishing-0.025
 North-0.025
Token Suddenly
Token ID24975
Feature activation+1.01e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 reality-0.007
 least-0.006
ge-0.006
rob-0.006
 heaven-0.006
Token his
Token ID465
Feature activation+1.26e-01

Pos logit contributions
 face+0.005
 structures+0.005
 in+0.005
adoes+0.004
 squash+0.004
Neg logit contributions
rob-0.011
 reality-0.011
iko-0.011
 Tooth-0.011
 certainty-0.011
Token wet
Token ID9583
Feature activation+1.10e-01

Pos logit contributions
 reality+0.004
 least+0.004
rob+0.004
ge+0.004
 heaven+0.004
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
rac-0.003
ag-0.002
Token radio
Token ID5243
Feature activation+6.32e-01

Pos logit contributions
 least+0.004
 reality+0.003
 heaven+0.003
 Gray+0.003
 wishing+0.003
Neg logit contributions
adoes-0.003
 face-0.003
 structures-0.002
ag-0.002
asuring-0.002
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
 reality+0.019
 least+0.019
rob+0.018
 certainty+0.018
 Tooth+0.017
Neg logit contributions
 face-0.018
adoes-0.017
 structures-0.016
uting-0.015
ag-0.015
Token Mir
Token ID7381
Feature activation-1.12e+00

Pos logit contributions
 face+0.004
adoes+0.004
 structures+0.004
rac+0.004
stals+0.003
Neg logit contributions
 least-0.006
 reality-0.006
 heaven-0.006
rob-0.006
 wishing-0.006
Tokenac
Token ID330
Feature activation-3.56e+00

Pos logit contributions
adoes+0.028
 face+0.028
 structures+0.027
stals+0.025
uting+0.025
Neg logit contributions
 least-0.035
 reality-0.035
 heaven-0.033
 Tooth-0.032
 someday-0.032
Tokenulously
Token ID18117
Feature activation-6.48e-01

Pos logit contributions
 face+0.108
adoes+0.106
 structures+0.101
stals+0.101
asuring+0.099
Neg logit contributions
 least-0.092
 reality-0.085
rob-0.081
 Tooth-0.080
ge-0.080
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
 face+0.016
adoes+0.015
 structures+0.014
stals+0.013
ag+0.013
Neg logit contributions
 least-0.021
 reality-0.021
 Tooth-0.020
rob-0.020
 heaven-0.020
Token it
Token ID340
Feature activation+4.18e-02

Pos logit contributions
 face+0.017
adoes+0.017
 structures+0.015
 squash+0.014
asuring+0.014
Neg logit contributions
 least-0.017
rob-0.017
 reality-0.017
 heaven-0.016
 certainty-0.016
Token made
Token ID925
Feature activation+4.92e-01

Pos logit contributions
 least+0.001
 reality+0.001
 Gray+0.001
 wishing+0.001
 Tooth+0.001
Neg logit contributions
adoes-0.001
 face-0.001
 structures-0.001
rac-0.001
ag-0.001
Token a
Token ID257
Feature activation+3.52e-01

Pos logit contributions
rob+0.017
 least+0.017
ge+0.017
och+0.016
 wishing+0.016
Neg logit contributions
 face-0.012
adoes-0.011
 structures-0.010
ag-0.010
 faces-0.009
Token tidy
Token ID43044
Feature activation+5.97e-01

Pos logit contributions
 least+0.031
 heaven+0.031
ge+0.030
 reality+0.030
rob+0.030
Neg logit contributions
 face-0.029
adoes-0.027
rac-0.024
 structures-0.024
stals-0.024
Token.
Token ID13
Feature activation-9.49e-02

Pos logit contributions
 least+0.018
rob+0.018
Ba+0.018
 Gray+0.017
iko+0.017
Neg logit contributions
 face-0.017
adoes-0.016
 structures-0.014
 in-0.014
stals-0.013
Token
Token ID198
Feature activation+3.36e-02

Pos logit contributions
 face+0.002
 Everyone+0.002
stals+0.002
adoes+0.002
 Toys+0.002
Neg logit contributions
rob-0.003
ge-0.003
 reality-0.003
 least-0.003
ited-0.003
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
rob+0.001
 Tooth+0.001
Neg logit contributions
 face-0.001
ag-0.001
adoes-0.000
<-0.000
stals-0.000
TokenGr
Token ID8642
Feature activation-1.30e+00

Pos logit contributions
 least+0.004
 reality+0.004
 heaven+0.004
 Gray+0.004
 Tooth+0.004
Neg logit contributions
 face-0.004
adoes-0.004
stals-0.004
rac-0.004
 structures-0.004
Tokenac
Token ID330
Feature activation-3.53e+00

Pos logit contributions
 face+0.018
adoes+0.017
ag+0.015
ap+0.015
 structures+0.015
Neg logit contributions
rob-0.054
 wishing-0.054
 Gray-0.054
 least-0.053
 reality-0.053
Tokenie
Token ID494
Feature activation-6.18e-01

Pos logit contributions
adoes+0.124
 face+0.123
 structures+0.118
stals+0.114
asuring+0.113
Neg logit contributions
 least-0.075
 reality-0.072
 Tooth-0.066
 heaven-0.066
 Gray-0.065
Token felt
Token ID2936
Feature activation+1.06e-01

Pos logit contributions
 face+0.017
adoes+0.016
 structures+0.015
rac+0.015
ag+0.015
Neg logit contributions
 reality-0.018
 least-0.017
 heaven-0.017
 Gray-0.016
 Tooth-0.016
Token very
Token ID845
Feature activation-5.76e-01

Pos logit contributions
rob+0.004
ge+0.004
iko+0.004
 Gray+0.004
 reality+0.003
Neg logit contributions
 face-0.003
ag-0.002
 in-0.002
asuring-0.002
rac-0.002
Token proud
Token ID6613
Feature activation+3.12e-01

Pos logit contributions
 face+0.017
rac+0.014
ag+0.014
ap+0.014
asuring+0.014
Neg logit contributions
rob-0.017
 Gray-0.017
 least-0.016
 reality-0.016
 heaven-0.016
Token of
Token ID286
Feature activation-3.91e-01

Pos logit contributions
rob+0.014
 least+0.013
 reality+0.013
 Gray+0.013
ge+0.013
Neg logit contributions
 face-0.005
adoes-0.003
asuring-0.003
 structures-0.003
ap-0.003
Token the
Token ID262
Feature activation-6.71e-01

Pos logit contributions
 face+0.024
adoes+0.024
 structures+0.022
rac+0.022
ag+0.022
Neg logit contributions
 reality-0.030
 least-0.030
 wishing-0.029
 heaven-0.028
ge-0.028
Token story
Token ID1621
Feature activation-1.50e+00

Pos logit contributions
adoes+0.021
agged+0.021
uting+0.020
ags+0.020
rac+0.019
Neg logit contributions
 Tooth-0.021
 Gray-0.020
 North-0.019
 wishing-0.019
 least-0.019
Token is
Token ID318
Feature activation-1.08e-01

Pos logit contributions
 face+0.052
adoes+0.050
ags+0.050
ag+0.049
agged+0.049
Neg logit contributions
 least-0.032
 someday-0.030
 anytime-0.030
 reality-0.028
 Gray-0.028
Token:
Token ID25
Feature activation-2.97e-01

Pos logit contributions
adoes+0.005
agged+0.004
 structures+0.004
 face+0.004
ags+0.004
Neg logit contributions
 least-0.002
 knowing-0.002
 H-0.002
 wishing-0.002
 she-0.002
Token Qu
Token ID2264
Feature activation-1.54e+00

Pos logit contributions
agged+0.010
adoes+0.010
ags+0.009
ag+0.009
anned+0.009
Neg logit contributions
 least-0.008
 someday-0.007
 anytime-0.007
 wishing-0.007
 Gray-0.007
Tokenarre
Token ID9624
Feature activation-3.52e+00

Pos logit contributions
adoes+0.039
 face+0.037
 structures+0.034
rac+0.033
asuring+0.032
Neg logit contributions
 least-0.051
 reality-0.049
 heaven-0.047
 anytime-0.046
 Gray-0.044
Tokenlling
Token ID2680
Feature activation-1.13e+00

Pos logit contributions
 face+0.107
adoes+0.097
 palms+0.096
wered+0.094
rac+0.093
Neg logit contributions
 least-0.086
ge-0.083
 reality-0.081
able-0.079
 anytime-0.077
Token over
Token ID625
Feature activation-1.72e-01

Pos logit contributions
adoes+0.031
 face+0.030
 structures+0.029
rac+0.029
agged+0.029
Neg logit contributions
 least-0.032
 reality-0.030
 Gray-0.028
 heaven-0.028
 wishing-0.028
Token small
Token ID1402
Feature activation+1.04e+00

Pos logit contributions
 face+0.004
adoes+0.004
 structures+0.004
rac+0.004
asuring+0.004
Neg logit contributions
 reality-0.005
 least-0.005
rob-0.005
 Tooth-0.005
 Gray-0.005
Token things
Token ID1243
Feature activation+1.61e-01

Pos logit contributions
 least+0.032
 reality+0.032
 heaven+0.030
 wishing+0.029
 anytime+0.029
Neg logit contributions
adoes-0.026
 face-0.025
agged-0.025
rac-0.025
ag-0.024
Token is
Token ID318
Feature activation-3.64e-01

Pos logit contributions
 reality+0.005
 Tooth+0.005
rob+0.005
 least+0.005
 heaven+0.005
Neg logit contributions
 face-0.004
adoes-0.004
 structures-0.003
ag-0.003
asuring-0.003
Token and
Token ID290
Feature activation-2.13e-01

Pos logit contributions
 face+0.012
adoes+0.012
 structures+0.011
ag+0.010
rac+0.010
Neg logit contributions
 least-0.015
 reality-0.015
rob-0.015
 Tooth-0.014
 Gray-0.014
Token said
Token ID531
Feature activation+1.57e-01

Pos logit contributions
 face+0.006
adoes+0.005
 in+0.004
 structures+0.004
 faces+0.004
Neg logit contributions
 reality-0.007
 least-0.007
 heaven-0.007
rob-0.007
 North-0.007
Token "
Token ID366
Feature activation-1.05e-01

Pos logit contributions
 reality+0.003
 wishing+0.003
 Gray+0.003
 she+0.003
 knowing+0.003
Neg logit contributions
adoes-0.005
ags-0.005
 structures-0.005
ag-0.005
agged-0.005
TokenA
Token ID32
Feature activation+1.22e-01

Pos logit contributions
adoes+0.003
 face+0.002
rac+0.002
uting+0.002
ag+0.002
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.003
 wishing-0.003
 Gray-0.003
Tokenbr
Token ID1671
Feature activation-5.92e-01

Pos logit contributions
 least+0.004
rob+0.004
 heaven+0.004
ge+0.004
 certainty+0.004
Neg logit contributions
 face-0.003
adoes-0.003
stals-0.003
ag-0.002
ap-0.002
Tokenac
Token ID330
Feature activation-3.52e+00

Pos logit contributions
adoes+0.009
 face+0.009
 structures+0.008
stals+0.007
ag+0.007
Neg logit contributions
 least-0.025
 reality-0.024
 heaven-0.023
ge-0.023
 someday-0.023
Tokenad
Token ID324
Feature activation-8.14e-01

Pos logit contributions
 face+0.107
adoes+0.106
 structures+0.103
uting+0.102
stals+0.101
Neg logit contributions
 least-0.090
 reality-0.085
 Tooth-0.081
 heaven-0.081
rob-0.080
Tokenab
Token ID397
Feature activation-1.24e+00

Pos logit contributions
 face+0.023
adoes+0.023
ag+0.022
 structures+0.021
rac+0.021
Neg logit contributions
 least-0.023
 reality-0.022
 wishing-0.021
 Tooth-0.021
 Gray-0.021
Tokenra
Token ID430
Feature activation-9.33e-01

Pos logit contributions
 structures+0.037
adoes+0.036
 face+0.035
ag+0.035
arre+0.034
Neg logit contributions
 least-0.032
 reality-0.031
 anytime-0.029
 Gray-0.029
 someday-0.029
Token!"
Token ID2474
Feature activation-1.88e-01

Pos logit contributions
 face+0.024
adoes+0.023
rac+0.022
asuring+0.022
ap+0.021
Neg logit contributions
 reality-0.029
 least-0.028
 heaven-0.027
 Tooth-0.027
 wishing-0.027
Token but
Token ID475
Feature activation+8.69e-01

Pos logit contributions
 face+0.005
adoes+0.005
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 least-0.005
 reality-0.005
 heaven-0.005
 wishing-0.005
 Gray-0.005
Token cry
Token ID3960
Feature activation+4.16e-01

Pos logit contributions
adoes+0.003
 structures+0.002
 face+0.002
rac+0.002
ag+0.002
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.003
 wishing-0.003
 Gray-0.003
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
rob+0.013
 least+0.012
 certainty+0.011
 North+0.011
 reality+0.011
Neg logit contributions
 face-0.014
adoes-0.012
ap-0.011
 structures-0.011
uting-0.010
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
 face+0.007
adoes+0.006
rac+0.006
stals+0.006
 Toys+0.006
Neg logit contributions
 reality-0.009
rob-0.009
 least-0.009
ge-0.009
 heaven-0.009
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
 face+0.008
adoes+0.007
rac+0.007
ag+0.007
 structures+0.007
Neg logit contributions
 least-0.011
 reality-0.011
rob-0.010
 heaven-0.010
 wishing-0.010
TokenGr
Token ID8642
Feature activation-1.28e+00

Pos logit contributions
adoes+0.006
 structures+0.006
 face+0.006
ag+0.005
asuring+0.005
Neg logit contributions
 reality-0.003
 least-0.003
 Gray-0.003
 wishing-0.003
 heaven-0.003
Tokenac
Token ID330
Feature activation-3.51e+00

Pos logit contributions
 face+0.020
adoes+0.018
ap+0.016
ag+0.016
anish+0.015
Neg logit contributions
rob-0.055
 Gray-0.052
 wishing-0.052
 least-0.051
 dreaming-0.051
Tokenie
Token ID494
Feature activation-7.53e-01

Pos logit contributions
adoes+0.121
 face+0.121
 structures+0.116
stals+0.112
asuring+0.111
Neg logit contributions
 least-0.076
 reality-0.073
 Tooth-0.068
 heaven-0.068
 Gray-0.067
Token's
Token ID338
Feature activation-6.14e-01

Pos logit contributions
adoes+0.025
 structures+0.024
ags+0.023
stals+0.023
ag+0.023
Neg logit contributions
 least-0.018
 reality-0.017
 Gray-0.017
 wishing-0.017
 heaven-0.017
Token mom
Token ID1995
Feature activation-9.98e-02

Pos logit contributions
adoes+0.015
stals+0.013
 structures+0.013
ag+0.013
uting+0.013
Neg logit contributions
 reality-0.020
 least-0.020
 wishing-0.020
 Tooth-0.019
 heaven-0.019
Token was
Token ID373
Feature activation-5.04e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
ag+0.003
rac+0.003
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.003
 Gray-0.003
 Tooth-0.003
Token so
Token ID523
Feature activation-3.95e-01

Pos logit contributions
adoes+0.020
 structures+0.019
stals+0.018
anish+0.017
rac+0.017
Neg logit contributions
 least-0.011
 wishing-0.011
 dreaming-0.010
 heaven-0.010
 knowing-0.010
Token said
Token ID531
Feature activation+1.75e-02

Pos logit contributions
 face+0.023
adoes+0.022
 structures+0.021
rac+0.020
asuring+0.020
Neg logit contributions
 reality-0.023
 least-0.022
ge-0.021
 heaven-0.021
 certainty-0.021
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.000
 Tooth+0.000
 Gray+0.000
Neg logit contributions
 face-0.000
adoes-0.000
 structures-0.000
rac-0.000
ag-0.000
Token "
Token ID366
Feature activation-3.77e-01

Pos logit contributions
adoes+0.003
 face+0.003
 structures+0.003
ag+0.003
rac+0.003
Neg logit contributions
 least-0.005
 reality-0.005
 wishing-0.005
 Gray-0.005
 heaven-0.005
TokenA
Token ID32
Feature activation+1.38e-01

Pos logit contributions
rac+0.008
adoes+0.008
 face+0.008
Everyone+0.008
asuring+0.008
Neg logit contributions
 reality-0.013
 least-0.013
ge-0.012
 heaven-0.012
 wishing-0.012
Tokenbr
Token ID1671
Feature activation-5.40e-01

Pos logit contributions
 least+0.005
rob+0.005
ge+0.005
 certainty+0.005
 wishing+0.004
Neg logit contributions
 face-0.003
ap-0.003
ag-0.003
adoes-0.003
 pitch-0.003
Tokenac
Token ID330
Feature activation-3.51e+00

Pos logit contributions
adoes+0.008
 face+0.007
 structures+0.007
stals+0.006
ag+0.006
Neg logit contributions
 least-0.023
 reality-0.023
 heaven-0.022
 Gray-0.022
 someday-0.021
Tokenad
Token ID324
Feature activation-7.40e-01

Pos logit contributions
adoes+0.112
uting+0.112
 face+0.111
 structures+0.110
stals+0.109
Neg logit contributions
 least-0.083
 heaven-0.075
 reality-0.075
rob-0.074
 Tooth-0.072
Tokenab
Token ID397
Feature activation-1.25e+00

Pos logit contributions
 face+0.019
ag+0.018
adoes+0.018
asuring+0.018
ap+0.018
Neg logit contributions
 reality-0.023
 least-0.022
 wishing-0.021
 Tooth-0.021
ge-0.021
Tokenra
Token ID430
Feature activation-8.50e-01

Pos logit contributions
adoes+0.033
 face+0.033
 structures+0.032
ag+0.030
stals+0.030
Neg logit contributions
 least-0.036
 reality-0.036
 heaven-0.034
 Gray-0.034
 anytime-0.033
Token,
Token ID11
Feature activation-1.21e-01

Pos logit contributions
 face+0.018
rac+0.017
ap+0.017
adoes+0.016
asuring+0.016
Neg logit contributions
 reality-0.031
 least-0.029
 wishing-0.029
 heaven-0.028
 certainty-0.028
Token make
Token ID787
Feature activation-1.20e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
rac+0.003
asuring+0.003
Neg logit contributions
 least-0.004
 reality-0.004
rob-0.003
ge-0.003
 heaven-0.003
Token around
Token ID1088
Feature activation-4.35e-01

Pos logit contributions
 least+0.002
 reality+0.002
rob+0.002
 Tooth+0.002
 heaven+0.002
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
ag-0.001
rac-0.001
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
 face+0.015
adoes+0.014
 structures+0.014
ag+0.013
ap+0.013
Neg logit contributions
 least-0.011
rob-0.011
 reality-0.010
 Tooth-0.010
 North-0.009
Token "
Token ID366
Feature activation-5.34e-01

Pos logit contributions
 face+0.008
adoes+0.008
 structures+0.007
rac+0.007
ag+0.007
Neg logit contributions
 least-0.008
 reality-0.008
 heaven-0.008
 Gray-0.008
 wishing-0.008
TokenA
Token ID32
Feature activation+1.84e-01

Pos logit contributions
adoes+0.015
 face+0.015
 structures+0.014
ag+0.014
rac+0.014
Neg logit contributions
 least-0.015
 reality-0.014
 heaven-0.014
 Gray-0.014
 Tooth-0.014
Tokenbr
Token ID1671
Feature activation-7.51e-01

Pos logit contributions
 least+0.007
ge+0.007
 reality+0.006
 heaven+0.006
 wishing+0.006
Neg logit contributions
 face-0.004
ag-0.003
stals-0.003
adoes-0.003
 structures-0.003
Tokenac
Token ID330
Feature activation-3.51e+00

Pos logit contributions
adoes+0.013
 face+0.013
 structures+0.012
stals+0.011
uting+0.011
Neg logit contributions
 least-0.029
 reality-0.029
ge-0.027
 heaven-0.027
 someday-0.027
Tokenad
Token ID324
Feature activation-9.15e-01

Pos logit contributions
uting+0.115
adoes+0.115
 face+0.114
stals+0.113
 structures+0.113
Neg logit contributions
 least-0.079
rob-0.073
 reality-0.070
 heaven-0.070
 North-0.068
Tokenab
Token ID397
Feature activation-1.42e+00

Pos logit contributions
 face+0.027
adoes+0.026
ag+0.025
 structures+0.025
rac+0.024
Neg logit contributions
 least-0.025
 reality-0.025
 wishing-0.023
 Tooth-0.023
 Gray-0.023
Tokenra
Token ID430
Feature activation-9.42e-01

Pos logit contributions
 structures+0.040
adoes+0.040
 face+0.039
ag+0.037
stals+0.037
Neg logit contributions
 least-0.038
 reality-0.037
 anytime-0.035
 Gray-0.035
 heaven-0.035
Token!"
Token ID2474
Feature activation-3.01e-01

Pos logit contributions
 face+0.023
adoes+0.022
rac+0.021
ag+0.021
asuring+0.021
Neg logit contributions
 reality-0.031
 least-0.030
 heaven-0.029
 Tooth-0.028
 wishing-0.028
Token
Token ID198
Feature activation-3.20e-01

Pos logit contributions
 face+0.013
adoes+0.013
rac+0.012
 structures+0.012
ag+0.012
Neg logit contributions
 reality-0.004
 least-0.004
ge-0.004
rob-0.004
 North-0.003
Token face
Token ID1986
Feature activation+4.23e-01

Pos logit contributions
rob+0.018
 reality+0.017
 least+0.017
 Gray+0.016
ge+0.016
Neg logit contributions
 face-0.014
adoes-0.012
 structures-0.011
ag-0.011
rac-0.011
Token.
Token ID13
Feature activation-3.19e-01

Pos logit contributions
rob+0.014
 least+0.013
 reality+0.013
Ba+0.012
 certainty+0.012
Neg logit contributions
 face-0.012
adoes-0.011
 in-0.010
asuring-0.010
ag-0.009
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 face+0.007
adoes+0.007
stals+0.007
 Toys+0.006
rac+0.006
Neg logit contributions
 reality-0.011
rob-0.011
ge-0.011
 least-0.010
 heaven-0.010
Token
Token ID198
Feature activation+4.43e-02

Pos logit contributions
 face+0.003
ag+0.003
<+0.003
rac+0.003
stals+0.003
Neg logit contributions
 least-0.006
 reality-0.006
 heaven-0.005
 wishing-0.005
 Tooth-0.005
TokenGr
Token ID8642
Feature activation-1.23e+00

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
 Gray+0.001
rob+0.001
Neg logit contributions
 face-0.002
adoes-0.001
 structures-0.001
rac-0.001
ag-0.001
Tokenac
Token ID330
Feature activation-3.50e+00

Pos logit contributions
 face+0.020
adoes+0.019
ag+0.017
ap+0.017
 structures+0.016
Neg logit contributions
rob-0.050
 wishing-0.049
 Gray-0.049
 least-0.048
 dreaming-0.047
Tokenie
Token ID494
Feature activation-5.36e-01

Pos logit contributions
adoes+0.125
 face+0.125
 structures+0.119
stals+0.116
asuring+0.115
Neg logit contributions
 least-0.072
 reality-0.069
 Tooth-0.064
 heaven-0.064
 Gray-0.063
Token started
Token ID2067
Feature activation+2.40e-02

Pos logit contributions
adoes+0.018
 structures+0.017
ags+0.017
stals+0.016
ag+0.016
Neg logit contributions
 least-0.013
 reality-0.012
 wishing-0.012
 Gray-0.012
 heaven-0.012
Token to
Token ID284
Feature activation-1.76e-01

Pos logit contributions
 least+0.001
 wishing+0.001
 reality+0.001
 heaven+0.001
 dreaming+0.001
Neg logit contributions
adoes-0.001
 structures-0.001
 face-0.001
stals-0.001
rac-0.001
Token cry
Token ID3960
Feature activation+5.61e-01

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.005
rac+0.004
ag+0.004
Neg logit contributions
 least-0.005
 reality-0.005
 heaven-0.005
 wishing-0.005
 Gray-0.005
Token and
Token ID290
Feature activation+5.75e-01

Pos logit contributions
rob+0.020
 least+0.018
 certainty+0.018
 reality+0.017
 North+0.017
Neg logit contributions
 face-0.016
adoes-0.014
ap-0.012
asuring-0.012
uting-0.011
Token her
Token ID607
Feature activation+3.39e-01

Pos logit contributions
 face+0.016
adoes+0.015
ag+0.014
lasses+0.014
rac+0.014
Neg logit contributions
 reality-0.012
 least-0.012
rob-0.012
ge-0.012
 Tooth-0.011
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
 least+0.009
 reality+0.009
rob+0.009
 Tooth+0.009
 Gray+0.008
Neg logit contributions
 face-0.010
adoes-0.010
 structures-0.009
ag-0.009
rac-0.009
Token
Token ID198
Feature activation+2.56e-02

Pos logit contributions
 Everyone+0.005
 In+0.004
 face+0.004
 Everybody+0.004
 Toys+0.004
Neg logit contributions
ge-0.012
 reality-0.012
rob-0.011
ited-0.011
 least-0.011
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
ge+0.001
 Tooth+0.001
Neg logit contributions
 face-0.000
ag-0.000
<-0.000
rac-0.000
stals-0.000
TokenGr
Token ID8642
Feature activation-1.28e+00

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.005
 Gray+0.005
 wishing+0.005
Neg logit contributions
 face-0.004
adoes-0.004
rac-0.004
ag-0.003
stals-0.003
Tokenac
Token ID330
Feature activation-3.50e+00

Pos logit contributions
 face+0.016
adoes+0.015
ap+0.014
anish+0.013
ag+0.013
Neg logit contributions
rob-0.057
 wishing-0.056
 Gray-0.055
 dreaming-0.054
 least-0.054
Tokenie
Token ID494
Feature activation-4.53e-01

Pos logit contributions
adoes+0.121
 face+0.121
 structures+0.115
rac+0.111
ag+0.111
Neg logit contributions
 least-0.076
 reality-0.074
 heaven-0.068
 Gray-0.068
 Tooth-0.067
Token sat
Token ID3332
Feature activation-8.63e-02

Pos logit contributions
 face+0.013
adoes+0.012
rac+0.012
 structures+0.012
ag+0.011
Neg logit contributions
 reality-0.012
 least-0.012
 heaven-0.012
 Tooth-0.012
 Gray-0.012
Token and
Token ID290
Feature activation+5.14e-01

Pos logit contributions
adoes+0.002
 face+0.002
 structures+0.002
ag+0.002
rac+0.002
Neg logit contributions
 least-0.003
 reality-0.003
 heaven-0.003
 wishing-0.003
 Gray-0.003
Token listened
Token ID16399
Feature activation-3.88e-01

Pos logit contributions
 reality+0.016
 least+0.015
ge+0.014
 heaven+0.014
rob+0.014
Neg logit contributions
 face-0.014
rac-0.012
adoes-0.012
asuring-0.011
ap-0.011
Token to
Token ID284
Feature activation+5.11e-01

Pos logit contributions
 face+0.009
adoes+0.008
ag+0.007
rac+0.007
uting+0.007
Neg logit contributions
 least-0.014
rob-0.014
 reality-0.013
 heaven-0.013
 Tooth-0.013
Token wand
Token ID11569
Feature activation+5.26e-01

Pos logit contributions
 least+0.010
 reality+0.010
 heaven+0.009
 wishing+0.009
rob+0.009
Neg logit contributions
 face-0.009
ag-0.008
adoes-0.008
 structures-0.007
ap-0.007
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
 reality+0.017
rob+0.016
 least+0.016
 Tooth+0.016
 certainty+0.015
Neg logit contributions
 face-0.014
adoes-0.013
ag-0.012
ap-0.012
asuring-0.012
Token "
Token ID366
Feature activation-6.21e-01

Pos logit contributions
adoes+0.016
 face+0.015
 structures+0.015
ag+0.014
rac+0.014
Neg logit contributions
 least-0.015
 reality-0.015
 Gray-0.014
 heaven-0.014
 wishing-0.014
TokenA
Token ID32
Feature activation+1.69e-01

Pos logit contributions
adoes+0.017
 face+0.017
 structures+0.016
rac+0.015
ag+0.015
Neg logit contributions
 least-0.018
 reality-0.018
 heaven-0.017
 wishing-0.017
 Gray-0.017
Tokenbr
Token ID1671
Feature activation-6.49e-01

Pos logit contributions
 least+0.007
ge+0.006
 reality+0.006
 heaven+0.006
 wishing+0.006
Neg logit contributions
 face-0.003
ag-0.003
stals-0.003
adoes-0.003
 structures-0.002
Tokenac
Token ID330
Feature activation-3.49e+00

Pos logit contributions
adoes+0.010
 face+0.010
 structures+0.009
stals+0.008
uting+0.008
Neg logit contributions
 least-0.027
 reality-0.026
 heaven-0.025
ge-0.025
 someday-0.025
Tokenad
Token ID324
Feature activation-7.58e-01

Pos logit contributions
adoes+0.111
 face+0.111
uting+0.110
 structures+0.109
stals+0.106
Neg logit contributions
 least-0.081
rob-0.075
 reality-0.074
 heaven-0.073
 Tooth-0.071
Tokenab
Token ID397
Feature activation-1.41e+00

Pos logit contributions
 face+0.023
adoes+0.022
 structures+0.021
ag+0.021
rac+0.021
Neg logit contributions
 least-0.020
 reality-0.020
 wishing-0.018
 Gray-0.018
 heaven-0.018
Tokenra
Token ID430
Feature activation-8.26e-01

Pos logit contributions
 structures+0.040
adoes+0.040
 face+0.039
ag+0.037
uting+0.037
Neg logit contributions
 least-0.037
 reality-0.036
 Gray-0.034
 heaven-0.034
 anytime-0.034
Token!"
Token ID2474
Feature activation-3.04e-01

Pos logit contributions
 face+0.021
adoes+0.020
rac+0.019
ag+0.019
asuring+0.018
Neg logit contributions
 reality-0.026
 least-0.026
 heaven-0.025
 wishing-0.024
 Tooth-0.024
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 face+0.011
adoes+0.011
 structures+0.011
rac+0.010
ag+0.010
Neg logit contributions
 least-0.006
 reality-0.006
 heaven-0.005
 wishing-0.005
 Gray-0.005
Token were
Token ID547
Feature activation-5.32e-03

Pos logit contributions
 least+0.014
 reality+0.013
 wishing+0.012
 Gray+0.012
 heaven+0.012
Neg logit contributions
adoes-0.013
 structures-0.012
 face-0.012
ag-0.011
stals-0.011
Token both
Token ID1111
Feature activation-1.06e+00

Pos logit contributions
adoes+0.000
 face+0.000
 structures+0.000
rac+0.000
stals+0.000
Neg logit contributions
 least-0.000
 reality-0.000
 wishing-0.000
 heaven-0.000
 dreaming-0.000
Token happy
Token ID3772
Feature activation-1.75e-01

Pos logit contributions
 face+0.030
 structures+0.025
ap+0.025
adoes+0.025
asuring+0.024
Neg logit contributions
rob-0.034
 Tooth-0.032
 Gray-0.031
 reality-0.030
 least-0.030
Token that
Token ID326
Feature activation-6.16e-01

Pos logit contributions
adoes+0.006
 face+0.006
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 least-0.004
 reality-0.004
 heaven-0.004
 wishing-0.004
 Gray-0.004
Token it
Token ID340
Feature activation+3.48e-01

Pos logit contributions
 face+0.015
adoes+0.015
 structures+0.014
asuring+0.013
rac+0.013
Neg logit contributions
 least-0.020
 reality-0.019
 heaven-0.019
rob-0.019
 Tooth-0.019
Token wasn
Token ID2492
Feature activation+2.02e+00

Pos logit contributions
 reality+0.010
 least+0.010
 heaven+0.010
 Gray+0.010
 Tooth+0.010
Neg logit contributions
 face-0.009
adoes-0.009
 structures-0.008
ag-0.008
rac-0.008
Token't
Token ID470
Feature activation-3.18e-01

Pos logit contributions
 least+0.072
 reality+0.070
 Gray+0.068
 heaven+0.067
 wishing+0.067
Neg logit contributions
adoes-0.042
 face-0.042
 structures-0.040
rac-0.037
ag-0.036
Token lost
Token ID2626
Feature activation+8.35e-01

Pos logit contributions
 face+0.009
adoes+0.008
 structures+0.008
ag+0.008
asuring+0.008
Neg logit contributions
 least-0.009
 reality-0.009
rob-0.009
 Gray-0.009
 Tooth-0.009
Token forever
Token ID8097
Feature activation+4.99e-01

Pos logit contributions
rob+0.026
 least+0.026
 Tooth+0.025
 reality+0.025
 Gray+0.025
Neg logit contributions
 face-0.021
adoes-0.021
 structures-0.020
ag-0.018
uting-0.018
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
 least+0.011
 reality+0.011
rob+0.010
 Gray+0.010
 Tooth+0.010
Neg logit contributions
 face-0.017
adoes-0.017
 structures-0.016
ag-0.015
asuring-0.015
Token<|endoftext|>
Token ID50256
Feature activation-1.82e-01

Pos logit contributions
 face+0.009
adoes+0.009
 structures+0.009
rac+0.008
ag+0.008
Neg logit contributions
 least-0.010
 reality-0.010
 heaven-0.009
 Gray-0.009
 Tooth-0.009
Token sweet
Token ID6029
Feature activation+8.28e-01

Pos logit contributions
ge+0.026
och+0.025
 least+0.025
rob+0.024
iw+0.023
Neg logit contributions
 face-0.013
ag-0.010
 jokes-0.010
adoes-0.009
 Toys-0.009
Token dreams
Token ID10625
Feature activation+1.43e+00

Pos logit contributions
 least+0.021
 reality+0.020
ge+0.019
 Gray+0.019
rob+0.019
Neg logit contributions
 face-0.027
adoes-0.025
ag-0.024
 structures-0.024
ap-0.023
Token about
Token ID546
Feature activation+1.00e+00

Pos logit contributions
 least+0.050
rob+0.049
 certainty+0.048
 reality+0.047
 Tooth+0.046
Neg logit contributions
 face-0.033
ag-0.030
adoes-0.027
ap-0.026
 in-0.026
Token carrots
Token ID34397
Feature activation+1.96e-01

Pos logit contributions
 least+0.033
ited+0.032
rob+0.031
isc+0.031
Ba+0.031
Neg logit contributions
 face-0.024
adoes-0.023
 structures-0.022
 jokes-0.020
 squash-0.019
Token and
Token ID290
Feature activation+6.34e-01

Pos logit contributions
 certainty+0.007
Ba+0.007
 reality+0.006
rob+0.006
 Tooth+0.006
Neg logit contributions
 in-0.004
 face-0.004
asuring-0.004
 squash-0.003
ag-0.003
Token sunshine
Token ID34488
Feature activation+2.13e+00

Pos logit contributions
 least+0.021
 reality+0.020
ge+0.020
rob+0.019
 heaven+0.019
Neg logit contributions
 face-0.015
adoes-0.015
 structures-0.014
stals-0.012
 squash-0.012
Token.
Token ID13
Feature activation-1.06e-01

Pos logit contributions
 least+0.066
 reality+0.063
 certainty+0.063
Ba+0.059
ge+0.059
Neg logit contributions
 face-0.057
adoes-0.054
 in-0.051
ag-0.049
 structures-0.049
Token<|endoftext|>
Token ID50256
Feature activation+2.67e-02

Pos logit contributions
 face+0.002
adoes+0.002
stals+0.002
 Everyone+0.002
 Toys+0.002
Neg logit contributions
ge-0.004
 reality-0.004
 least-0.004
rob-0.004
 heaven-0.004
TokenOnce
Token ID7454
Feature activation-5.08e-01

Pos logit contributions
 reality+0.001
 least+0.001
 wishing+0.001
ge+0.001
 heaven+0.001
Neg logit contributions
 face-0.001
adoes-0.001
ag-0.001
 structures-0.000
 Toys-0.000
Token upon
Token ID2402
Feature activation+2.98e-01

Pos logit contributions
 face+0.010
adoes+0.009
ag+0.009
 structures+0.009
rac+0.009
Neg logit contributions
 least-0.019
 reality-0.018
rob-0.018
 Tooth-0.018
 heaven-0.018
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
 least+0.010
 reality+0.010
rob+0.010
ge+0.009
 North+0.009
Neg logit contributions
 face-0.007
ag-0.006
adoes-0.006
 structures-0.006
<-0.006
Token sorry
Token ID7926
Feature activation+3.88e-01

Pos logit contributions
 she+0.011
 wishing+0.010
 Gray+0.010
 knowing+0.010
 someday+0.009
Neg logit contributions
ags-0.018
adoes-0.018
 structures-0.017
uting-0.017
apers-0.017
Token and
Token ID290
Feature activation+9.63e-01

Pos logit contributions
 reality+0.015
 least+0.015
rob+0.014
 heaven+0.014
 Tooth+0.014
Neg logit contributions
 face-0.008
adoes-0.007
 structures-0.006
ag-0.006
asuring-0.006
Token asked
Token ID1965
Feature activation+4.27e-01

Pos logit contributions
 reality+0.028
 least+0.027
rob+0.026
ge+0.026
 North+0.026
Neg logit contributions
 face-0.027
adoes-0.026
 structures-0.024
rac-0.023
asuring-0.023
Token if
Token ID611
Feature activation-4.18e-01

Pos logit contributions
rob+0.014
 least+0.014
 reality+0.013
ited+0.013
 North+0.013
Neg logit contributions
 face-0.012
adoes-0.010
 in-0.010
 structures-0.010
ag-0.010
Token there
Token ID612
Feature activation+5.98e-01

Pos logit contributions
 face+0.014
adoes+0.013
 structures+0.012
stals+0.012
lasses+0.012
Neg logit contributions
rob-0.009
 least-0.009
ge-0.009
 reality-0.009
 North-0.009
Token was
Token ID373
Feature activation+2.16e+00

Pos logit contributions
 reality+0.021
 least+0.021
 Tooth+0.020
 heaven+0.020
rob+0.020
Neg logit contributions
 face-0.013
adoes-0.013
 structures-0.012
ag-0.011
rac-0.011
Token anything
Token ID1997
Feature activation+1.89e+00

Pos logit contributions
rob+0.063
 least+0.062
ge+0.061
 reality+0.059
 wishing+0.059
Neg logit contributions
 face-0.061
adoes-0.058
 structures-0.054
ag-0.051
rac-0.050
Token he
Token ID339
Feature activation-3.24e-01

Pos logit contributions
rob+0.067
 least+0.067
 reality+0.067
 North+0.065
 Tooth+0.064
Neg logit contributions
 face-0.041
adoes-0.037
ag-0.035
asuring-0.035
 structures-0.035
Token could
Token ID714
Feature activation-1.06e+00

Pos logit contributions
 face+0.006
 in+0.005
ag+0.005
 did+0.005
asuring+0.004
Neg logit contributions
 reality-0.014
rob-0.014
 therapist-0.013
 North-0.013
 fascination-0.013
Token do
Token ID466
Feature activation-7.33e-01

Pos logit contributions
 face+0.023
adoes+0.021
ag+0.019
 structures+0.019
asuring+0.018
Neg logit contributions
rob-0.038
 least-0.038
 reality-0.038
 Gray-0.037
 Tooth-0.037
Token to
Token ID284
Feature activation+4.14e-01

Pos logit contributions
 face+0.018
ag+0.017
 structures+0.017
adoes+0.017
asuring+0.016
Neg logit contributions
 North-0.023
 Gray-0.023
 Tooth-0.023
 reality-0.023
 least-0.023
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
adoes+0.012
 face+0.011
 structures+0.011
rac+0.011
ag+0.011
Neg logit contributions
 least-0.008
 reality-0.007
 heaven-0.007
 Gray-0.007
 wishing-0.007
Token but
Token ID475
Feature activation+7.51e-01

Pos logit contributions
adoes+0.004
 face+0.004
 structures+0.003
ag+0.003
rac+0.003
Neg logit contributions
 least-0.005
 reality-0.005
 heaven-0.005
 wishing-0.005
 Gray-0.005
Token the
Token ID262
Feature activation-4.44e-01

Pos logit contributions
 least+0.019
 reality+0.018
 wishing+0.018
 someday+0.017
 Gray+0.017
Neg logit contributions
adoes-0.024
 face-0.023
ag-0.023
 structures-0.023
ags-0.023
Token stove
Token ID27635
Feature activation-5.59e-01

Pos logit contributions
ag+0.014
agged+0.014
adoes+0.013
rac+0.013
anned+0.013
Neg logit contributions
 least-0.012
 wishing-0.011
 reality-0.011
 Tooth-0.011
 dreaming-0.011
Token just
Token ID655
Feature activation+9.86e-02

Pos logit contributions
adoes+0.016
 face+0.016
 structures+0.015
ag+0.014
rac+0.014
Neg logit contributions
 least-0.015
 reality-0.015
 heaven-0.014
 wishing-0.014
 Gray-0.014
Token wasn
Token ID2492
Feature activation+2.08e+00

Pos logit contributions
 least+0.003
 reality+0.003
 heaven+0.003
rob+0.003
 Gray+0.003
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.002
rac-0.002
asuring-0.002
Token't
Token ID470
Feature activation-5.24e-01

Pos logit contributions
 reality+0.082
 least+0.081
rob+0.080
 heaven+0.079
 Tooth+0.078
Neg logit contributions
 face-0.033
adoes-0.032
rac-0.029
ag-0.028
 structures-0.028
Token there
Token ID612
Feature activation+1.06e-01

Pos logit contributions
 face+0.012
adoes+0.012
ag+0.011
asuring+0.010
 structures+0.010
Neg logit contributions
rob-0.018
 least-0.018
 reality-0.016
 Gray-0.016
 certainty-0.016
Token.
Token ID13
Feature activation-4.76e-02

Pos logit contributions
rob+0.004
Ba+0.004
 Tooth+0.004
 Gray+0.004
Charl+0.003
Neg logit contributions
 in-0.002
 face-0.002
 about-0.002
 structures-0.002
ag-0.002
Token They
Token ID1119
Feature activation+1.79e-01

Pos logit contributions
ag+0.001
adoes+0.001
rac+0.001
 face+0.001
ags+0.001
Neg logit contributions
 least-0.001
 heaven-0.001
 reality-0.001
 wishing-0.001
 Gray-0.001
Token looked
Token ID3114
Feature activation-2.75e-01

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.005
 Gray+0.005
 Tooth+0.005
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
rac-0.005
ag-0.005
Token stared
Token ID18484
Feature activation-2.06e-01

Pos logit contributions
 reality+0.012
 least+0.012
 heaven+0.011
 Tooth+0.011
rob+0.011
Neg logit contributions
 face-0.012
adoes-0.011
 structures-0.010
rac-0.010
asuring-0.010
Token at
Token ID379
Feature activation-8.93e-02

Pos logit contributions
adoes+0.004
 face+0.004
 structures+0.004
rac+0.004
ag+0.004
Neg logit contributions
 reality-0.007
 least-0.007
 wishing-0.007
 knowing-0.007
 heaven-0.007
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
 squash+0.001
rac+0.001
Neg logit contributions
rob-0.003
 reality-0.003
 Gray-0.003
 Tooth-0.003
Ba-0.003
Token pie
Token ID2508
Feature activation-2.09e-01

Pos logit contributions
adoes+0.006
rac+0.006
ag+0.006
agged+0.006
stals+0.006
Neg logit contributions
 least-0.007
 reality-0.006
 wishing-0.006
 Tooth-0.006
 knowing-0.006
Token in
Token ID287
Feature activation-6.50e-01

Pos logit contributions
 face+0.006
adoes+0.006
 structures+0.006
asuring+0.005
ag+0.005
Neg logit contributions
 reality-0.006
 least-0.006
 heaven-0.006
 Tooth-0.005
rob-0.005
Token amaz
Token ID40642
Feature activation+2.20e+00

Pos logit contributions
 face+0.018
adoes+0.016
 structures+0.016
uting+0.014
rac+0.014
Neg logit contributions
rob-0.020
 least-0.019
 Gray-0.018
 reality-0.018
ge-0.018
Tokenement
Token ID972
Feature activation+2.47e-01

Pos logit contributions
 least+0.058
 reality+0.057
 heaven+0.054
 Gray+0.054
 wishing+0.053
Neg logit contributions
 face-0.065
adoes-0.065
 structures-0.062
ag-0.060
rac-0.059
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
 least+0.006
 reality+0.006
 Tooth+0.005
rob+0.005
 Gray+0.005
Neg logit contributions
 face-0.009
adoes-0.008
 structures-0.008
asuring-0.008
ag-0.008
Token He
Token ID679
Feature activation+2.06e-01

Pos logit contributions
adoes+0.006
 face+0.006
ag+0.006
 structures+0.006
rac+0.006
Neg logit contributions
 least-0.006
 Gray-0.006
 reality-0.006
 wishing-0.006
 Tooth-0.006
Token just
Token ID655
Feature activation-4.65e-01

Pos logit contributions
 least+0.005
 reality+0.005
 wishing+0.005
 Gray+0.005
 Tooth+0.005
Neg logit contributions
adoes-0.007
 face-0.006
 structures-0.006
ag-0.006
ags-0.006
Token couldn
Token ID3521
Feature activation+7.71e-02

Pos logit contributions
 face+0.012
asuring+0.010
adoes+0.010
rac+0.010
uting+0.009
Neg logit contributions
 reality-0.015
rob-0.015
 heaven-0.015
 least-0.014
 Gray-0.014
Token a
Token ID257
Feature activation+3.33e-01

Pos logit contributions
adoes+0.031
 face+0.030
 structures+0.029
rac+0.029
stals+0.028
Neg logit contributions
 reality-0.024
 least-0.024
 heaven-0.023
 Gray-0.023
 Tooth-0.022
Token voice
Token ID3809
Feature activation+5.23e-01

Pos logit contributions
 reality+0.011
 least+0.011
 wishing+0.011
 Gray+0.011
 heaven+0.011
Neg logit contributions
adoes-0.008
 face-0.007
 structures-0.007
rac-0.007
ag-0.007
Token.
Token ID13
Feature activation-5.57e-03

Pos logit contributions
 reality+0.014
 least+0.014
 Gray+0.013
 Tooth+0.013
 heaven+0.013
Neg logit contributions
 face-0.016
adoes-0.015
ag-0.015
 structures-0.014
asuring-0.014
Token It
Token ID632
Feature activation+5.78e-01

Pos logit contributions
adoes+0.000
 face+0.000
 structures+0.000
ag+0.000
asuring+0.000
Neg logit contributions
 Gray-0.000
 least-0.000
 reality-0.000
 wishing-0.000
 heaven-0.000
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
 least+0.019
 reality+0.019
 heaven+0.018
 Tooth+0.018
ge+0.018
Neg logit contributions
 face-0.014
adoes-0.013
ag-0.011
rac-0.011
asuring-0.011
Token one
Token ID530
Feature activation+7.68e-01

Pos logit contributions
 least+0.032
 reality+0.032
 Gray+0.030
rob+0.030
 heaven+0.030
Neg logit contributions
 face-0.034
adoes-0.033
 structures-0.031
ag-0.031
rac-0.030
Token of
Token ID286
Feature activation-1.24e-01

Pos logit contributions
 least+0.028
 reality+0.028
rob+0.027
 Gray+0.027
 heaven+0.027
Neg logit contributions
 face-0.015
adoes-0.014
 structures-0.013
ag-0.013
asuring-0.012
Token the
Token ID262
Feature activation-2.06e-01

Pos logit contributions
adoes+0.003
 face+0.003
 structures+0.003
ag+0.003
rac+0.003
Neg logit contributions
 least-0.004
 reality-0.004
 heaven-0.003
 Gray-0.003
 wishing-0.003
Token team
Token ID1074
Feature activation-1.21e+00

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.005
rac+0.005
ag+0.004
Neg logit contributions
 least-0.007
 reality-0.007
 wishing-0.006
 heaven-0.006
 Tooth-0.006
Token's
Token ID338
Feature activation+7.10e-02

Pos logit contributions
 face+0.032
adoes+0.032
 structures+0.030
ag+0.029
rac+0.029
Neg logit contributions
 least-0.036
 reality-0.035
 heaven-0.033
 Gray-0.033
 Tooth-0.033
Token moms
Token ID34015
Feature activation+1.20e-01

Pos logit contributions
 least+0.002
 reality+0.002
 heaven+0.002
 Gray+0.002
 wishing+0.002
Neg logit contributions
adoes-0.002
 face-0.002
 structures-0.002
ag-0.001
rac-0.001
Token happy
Token ID3772
Feature activation-3.31e-01

Pos logit contributions
 face+0.013
adoes+0.012
ag+0.011
 structures+0.011
asuring+0.011
Neg logit contributions
 least-0.014
 reality-0.014
rob-0.014
 heaven-0.014
 Tooth-0.014
Token and
Token ID290
Feature activation+8.12e-01

Pos logit contributions
 face+0.008
adoes+0.007
 structures+0.007
asuring+0.006
ag+0.006
Neg logit contributions
 least-0.011
 reality-0.011
rob-0.011
 Tooth-0.011
 heaven-0.011
Token felt
Token ID2936
Feature activation+6.27e-01

Pos logit contributions
 heaven+0.025
 reality+0.025
 Tooth+0.024
 least+0.024
ge+0.024
Neg logit contributions
 face-0.022
adoes-0.019
rac-0.019
ap-0.018
 structures-0.018
Token lucky
Token ID9670
Feature activation-1.86e-02

Pos logit contributions
rob+0.021
ge+0.021
 Tooth+0.020
iko+0.020
 reality+0.020
Neg logit contributions
 face-0.015
ag-0.014
 in-0.013
ap-0.012
adoes-0.012
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
asuring+0.000
 in+0.000
Neg logit contributions
rob-0.001
 least-0.001
 heaven-0.001
 Tooth-0.001
 reality-0.001
Token have
Token ID423
Feature activation+1.56e-01

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.005
 wishing+0.005
 Gray+0.005
Neg logit contributions
adoes-0.003
 face-0.003
 structures-0.003
rac-0.002
ag-0.002
Token such
Token ID884
Feature activation-1.06e-01

Pos logit contributions
 least+0.005
 Tooth+0.005
 reality+0.005
ge+0.005
 North+0.005
Neg logit contributions
 face-0.004
adoes-0.004
ag-0.004
 structures-0.004
asuring-0.003
Token great
Token ID1049
Feature activation+1.26e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.002
asuring+0.002
ag+0.002
Neg logit contributions
rob-0.003
 least-0.003
ge-0.003
 Gray-0.003
 Tooth-0.003
Token friends
Token ID2460
Feature activation-3.58e-01

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.005
 wishing+0.005
 Gray+0.005
Neg logit contributions
adoes-0.002
 face-0.002
 structures-0.002
rac-0.002
ag-0.002
Token to
Token ID284
Feature activation+5.65e-03

Pos logit contributions
 face+0.010
adoes+0.010
asuring+0.009
ag+0.009
 structures+0.009
Neg logit contributions
 reality-0.010
 least-0.010
 Tooth-0.009
 Gray-0.009
 heaven-0.009
Token play
Token ID711
Feature activation+7.77e-02

Pos logit contributions
 least+0.000
 reality+0.000
 heaven+0.000
 Gray+0.000
 Tooth+0.000
Neg logit contributions
 face-0.000
adoes-0.000
 structures-0.000
ag-0.000
rac-0.000
Token the
Token ID262
Feature activation+4.35e-02

Pos logit contributions
 least+0.052
 Tooth+0.050
ited+0.048
rob+0.048
 North+0.047
Neg logit contributions
 face-0.035
 structures-0.035
adoes-0.035
ag-0.031
 in-0.031
Token scenes
Token ID8188
Feature activation+1.11e+00

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
 Gray+0.001
 Tooth+0.001
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
ag-0.001
rac-0.001
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
rob+0.037
 Tooth+0.036
 least+0.036
 North+0.035
 reality+0.034
Neg logit contributions
 face-0.029
 structures-0.025
 in-0.025
adoes-0.025
ag-0.025
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
 face+0.004
 structures+0.003
adoes+0.003
 Toys+0.003
rac+0.003
Neg logit contributions
 reality-0.005
 least-0.005
ge-0.005
rob-0.005
 wishing-0.005
Token
Token ID198
Feature activation+1.32e-01

Pos logit contributions
 face+0.000
ag+0.000
<+0.000
rac+0.000
ggies+0.000
Neg logit contributions
 least-0.001
 reality-0.001
 Tooth-0.001
 wishing-0.001
 heaven-0.001
TokenAnna
Token ID31160
Feature activation+3.57e-01

Pos logit contributions
 least+0.004
 reality+0.004
 heaven+0.004
 Gray+0.004
 Tooth+0.003
Neg logit contributions
adoes-0.004
 face-0.004
 structures-0.003
ag-0.003
rac-0.003
Token's
Token ID338
Feature activation-2.73e-01

Pos logit contributions
 reality+0.011
 least+0.010
 heaven+0.010
 Tooth+0.010
 Gray+0.010
Neg logit contributions
 face-0.009
adoes-0.009
rac-0.009
 structures-0.009
asuring-0.008
Token friends
Token ID2460
Feature activation+7.62e-02

Pos logit contributions
adoes+0.006
 face+0.006
 structures+0.006
ag+0.005
rac+0.005
Neg logit contributions
 least-0.009
 reality-0.009
 heaven-0.009
 wishing-0.009
 Gray-0.009
Token practiced
Token ID19893
Feature activation+5.42e-01

Pos logit contributions
 reality+0.002
 Tooth+0.002
 least+0.002
 heaven+0.002
rob+0.002
Neg logit contributions
 face-0.002
adoes-0.002
 structures-0.002
asuring-0.002
ag-0.002
Token with
Token ID351
Feature activation+8.36e-01

Pos logit contributions
 Tooth+0.019
 least+0.019
isc+0.017
 reality+0.017
 North+0.017
Neg logit contributions
 face-0.012
 structures-0.012
adoes-0.012
ap-0.011
ag-0.011
Token her
Token ID607
Feature activation+3.75e-01

Pos logit contributions
 least+0.031
rob+0.030
 Tooth+0.030
 North+0.030
ge+0.029
Neg logit contributions
 face-0.018
 structures-0.017
adoes-0.017
 jokes-0.015
ag-0.014
Token and
Token ID290
Feature activation+4.02e-01

Pos logit contributions
 certainty+0.035
 section+0.035
ited+0.034
rob+0.033
 reality+0.033
Neg logit contributions
 face-0.033
adoes-0.033
 in-0.028
ap-0.027
asuring-0.027
Token he
Token ID339
Feature activation+6.66e-01

Pos logit contributions
 least+0.011
 reality+0.011
rob+0.010
ge+0.010
 heaven+0.010
Neg logit contributions
 face-0.012
adoes-0.012
 structures-0.010
rac-0.010
asuring-0.010
Token could
Token ID714
Feature activation-8.78e-01

Pos logit contributions
 reality+0.024
rob+0.024
 outer+0.024
ge+0.024
och+0.024
Neg logit contributions
 face-0.015
 in-0.010
rac-0.010
adoes-0.010
 jokes-0.010
Token run
Token ID1057
Feature activation+2.29e-01

Pos logit contributions
 face+0.021
adoes+0.018
ag+0.017
ap+0.017
rac+0.016
Neg logit contributions
 reality-0.030
rob-0.030
 least-0.029
 heaven-0.028
 Tooth-0.028
Token and
Token ID290
Feature activation+4.27e-01

Pos logit contributions
 Tooth+0.009
 certainty+0.008
 reality+0.008
 least+0.008
 supervision+0.008
Neg logit contributions
 face-0.005
ag-0.004
 in-0.004
adoes-0.004
 structures-0.004
Token play
Token ID711
Feature activation+1.79e-01

Pos logit contributions
 reality+0.014
 least+0.013
rob+0.013
 heaven+0.013
 Tooth+0.013
Neg logit contributions
 face-0.011
adoes-0.010
 structures-0.009
rac-0.009
ag-0.009
Token again
Token ID757
Feature activation-3.83e-02

Pos logit contributions
 least+0.006
 reality+0.006
 heaven+0.006
 Gray+0.006
 wishing+0.006
Neg logit contributions
 face-0.004
adoes-0.004
 structures-0.004
ag-0.004
rac-0.004
Token soon
Token ID2582
Feature activation-1.81e-02

Pos logit contributions
 in+0.001
 face+0.001
adoes+0.001
ap+0.001
 structures+0.001
Neg logit contributions
 certainty-0.001
rob-0.001
isc-0.001
 therapist-0.001
ited-0.001
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
 face+0.000
 in+0.000
adoes+0.000
 structures+0.000
ap+0.000
Neg logit contributions
rob-0.001
 certainty-0.001
 least-0.001
 Tooth-0.001
Charl-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.03e-01

Pos logit contributions
adoes+0.009
 face+0.008
 structures+0.008
ag+0.008
rac+0.008
Neg logit contributions
 least-0.008
 reality-0.008
 Gray-0.007
 heaven-0.007
 wishing-0.007
TokenOnce
Token ID7454
Feature activation-5.19e-01

Pos logit contributions
 face+0.002
adoes+0.002
ag+0.002
 structures+0.002
uting+0.002
Neg logit contributions
 reality-0.004
 least-0.003
 wishing-0.003
 heaven-0.003
 certainty-0.003
Token from
Token ID422
Feature activation-8.66e-01

Pos logit contributions
 reality+0.004
 least+0.004
 Tooth+0.004
 wishing+0.004
 heaven+0.004
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
ag-0.003
asuring-0.003
Token Jill
Token ID22772
Feature activation-1.40e-01

Pos logit contributions
 face+0.018
 structures+0.017
adoes+0.017
 squash+0.015
ag+0.015
Neg logit contributions
rob-0.030
 certainty-0.029
 least-0.029
 Gray-0.029
Charl-0.028
Token and
Token ID290
Feature activation+9.58e-03

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
ag+0.003
asuring+0.003
Neg logit contributions
 reality-0.005
rob-0.005
 least-0.005
 certainty-0.004
 Tooth-0.004
Token Jack
Token ID3619
Feature activation+3.74e-01

Pos logit contributions
 least+0.000
 reality+0.000
rob+0.000
ge+0.000
 wishing+0.000
Neg logit contributions
 face-0.000
adoes-0.000
 structures-0.000
rac-0.000
asuring-0.000
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
 reality+0.013
rob+0.013
 Tooth+0.013
 North+0.012
 least+0.012
Neg logit contributions
 face-0.009
 in-0.008
ag-0.008
asuring-0.008
ap-0.007
Token He
Token ID679
Feature activation+1.29e-01

Pos logit contributions
 least+0.001
 reality+0.001
 wishing+0.001
 heaven+0.001
 Gray+0.001
Neg logit contributions
adoes-0.001
 face-0.001
 structures-0.001
ag-0.001
rac-0.001
Token said
Token ID531
Feature activation+1.75e-01

Pos logit contributions
 least+0.003
 reality+0.002
 Gray+0.002
 wishing+0.002
 Tooth+0.002
Neg logit contributions
adoes-0.005
 face-0.005
 structures-0.005
ags-0.005
ag-0.005
Token he
Token ID339
Feature activation-4.44e-02

Pos logit contributions
 reality+0.003
 someday+0.003
 she+0.003
 wishing+0.003
 least+0.003
Neg logit contributions
adoes-0.007
ags-0.007
uting-0.007
 face-0.007
ag-0.006
Token would
Token ID561
Feature activation-9.62e-01

Pos logit contributions
 face+0.001
adoes+0.001
 structures+0.001
asuring+0.001
rac+0.001
Neg logit contributions
 reality-0.001
 least-0.001
 heaven-0.001
 Tooth-0.001
rob-0.001
Token give
Token ID1577
Feature activation+5.38e-02

Pos logit contributions
 face+0.025
adoes+0.023
 structures+0.021
ag+0.021
rac+0.021
Neg logit contributions
 reality-0.030
 least-0.030
rob-0.029
 Gray-0.029
 Tooth-0.028
Token them
Token ID606
Feature activation-3.83e-01

Pos logit contributions
 least+0.002
 reality+0.002
 heaven+0.002
 Gray+0.002
 Tooth+0.002
Neg logit contributions
adoes-0.001
 face-0.001
 structures-0.001
rac-0.001
asuring-0.001
Token a
Token ID257
Feature activation-4.58e-01

Pos logit contributions
adoes+0.016
 face+0.015
 structures+0.015
stals+0.014
rac+0.014
Neg logit contributions
 least-0.012
 reality-0.011
 heaven-0.011
 wishing-0.011
 Tooth-0.011
Token big
Token ID1263
Feature activation+2.99e-01

Pos logit contributions
 face+0.012
adoes+0.010
 structures+0.010
 squash+0.009
ag+0.009
Neg logit contributions
 least-0.015
rob-0.014
 reality-0.014
 heaven-0.014
 someday-0.014
Token bag
Token ID6131
Feature activation-1.08e+00

Pos logit contributions
 least+0.009
 reality+0.009
 heaven+0.008
 Gray+0.008
 wishing+0.008
Neg logit contributions
 face-0.008
adoes-0.008
 structures-0.007
ag-0.007
rac-0.007
Token of
Token ID286
Feature activation+4.48e-01

Pos logit contributions
adoes+0.020
 face+0.019
 structures+0.018
rac+0.017
stals+0.017
Neg logit contributions
 least-0.040
 reality-0.040
 heaven-0.039
 wishing-0.038
 Tooth-0.038
Token candy
Token ID18550
Feature activation+1.95e-01

Pos logit contributions
 least+0.014
 reality+0.014
ge+0.013
 someday+0.013
 anytime+0.013
Neg logit contributions
 face-0.013
adoes-0.010
 squash-0.010
ag-0.010
 structures-0.010
Token from
Token ID422
Feature activation-7.62e-01

Pos logit contributions
 reality+0.008
 certainty+0.008
 least+0.008
 North+0.008
 fascination+0.007
Neg logit contributions
 face-0.004
 in-0.003
adoes-0.003
asuring-0.003
 squash-0.003
Token her
Token ID607
Feature activation+6.62e-01

Pos logit contributions
 face+0.023
adoes+0.021
 structures+0.021
 in+0.021
ags+0.021
Neg logit contributions
 lifetime-0.022
 reality-0.021
odic-0.020
rob-0.020
isc-0.019
Token mom
Token ID1995
Feature activation+5.46e-02

Pos logit contributions
rob+0.032
ge+0.030
ited+0.029
 reality+0.029
 possible+0.029
Neg logit contributions
 face-0.012
 squash-0.007
 hands-0.006
 structures-0.006
 Toys-0.006
Token.
Token ID13
Feature activation-8.20e-02

Pos logit contributions
 reality+0.002
rob+0.002
 certainty+0.002
 least+0.002
 wishing+0.002
Neg logit contributions
 face-0.001
adoes-0.001
ag-0.001
 structures-0.001
asuring-0.001
Token She
Token ID1375
Feature activation+3.29e-01

Pos logit contributions
adoes+0.003
 structures+0.003
ag+0.003
asuring+0.003
 face+0.003
Neg logit contributions
 Gray-0.002
 heaven-0.002
 anytime-0.002
 least-0.002
 Tooth-0.002
Token was
Token ID373
Feature activation-3.75e-01

Pos logit contributions
 least+0.009
 reality+0.009
 heaven+0.008
 Gray+0.008
 wishing+0.008
Neg logit contributions
adoes-0.009
 face-0.009
 structures-0.009
ag-0.009
rac-0.009
Token children
Token ID1751
Feature activation+9.88e-02

Pos logit contributions
rob+0.007
 least+0.007
 reality+0.007
ge+0.006
 North+0.006
Neg logit contributions
 face-0.005
 structures-0.004
adoes-0.004
 squash-0.004
asuring-0.003
Token.
Token ID13
Feature activation-6.82e-02

Pos logit contributions
 reality+0.002
 least+0.002
rob+0.002
 Tooth+0.002
 Gray+0.002
Neg logit contributions
 face-0.003
adoes-0.003
 structures-0.003
asuring-0.003
ag-0.003
Token The
Token ID383
Feature activation+3.02e-03

Pos logit contributions
 face+0.001
adoes+0.001
 structures+0.001
rac+0.001
stals+0.001
Neg logit contributions
 reality-0.003
ge-0.002
 least-0.002
 heaven-0.002
 wishing-0.002
Token end
Token ID886
Feature activation+1.12e+00

Pos logit contributions
 least+0.000
 reality+0.000
 heaven+0.000
 Gray+0.000
 wishing+0.000
Neg logit contributions
 face-0.000
adoes-0.000
 structures-0.000
ag-0.000
rac-0.000
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
 least+0.024
 reality+0.024
 wishing+0.022
rob+0.022
 heaven+0.022
Neg logit contributions
 face-0.039
adoes-0.038
 structures-0.037
ag-0.036
asuring-0.035
Token<|endoftext|>
Token ID50256
Feature activation-1.69e-01

Pos logit contributions
 face+0.001
rac+0.001
 Toys+0.001
adoes+0.001
stals+0.001
Neg logit contributions
ge-0.002
 reality-0.002
 least-0.002
 wishing-0.002
ited-0.002
TokenOne
Token ID3198
Feature activation+7.04e-01

Pos logit contributions
 face+0.004
adoes+0.004
 structures+0.004
ag+0.004
rac+0.004
Neg logit contributions
 reality-0.005
 least-0.005
 wishing-0.005
 heaven-0.005
ge-0.005
Token day
Token ID1110
Feature activation-8.04e-01

Pos logit contributions
 least+0.020
 reality+0.020
 Gray+0.019
 heaven+0.019
rob+0.019
Neg logit contributions
 face-0.020
adoes-0.019
 structures-0.018
ag-0.018
rac-0.018
Token,
Token ID11
Feature activation-4.84e-01

Pos logit contributions
adoes+0.029
 structures+0.028
 face+0.027
ags+0.026
ag+0.026
Neg logit contributions
 Gray-0.016
 least-0.016
 reality-0.016
 wishing-0.016
 dreaming-0.015
Token a
Token ID257
Feature activation+2.72e-01

Pos logit contributions
adoes+0.018
 face+0.017
 structures+0.017
stals+0.016
ags+0.016
Neg logit contributions
 least-0.009
 wishing-0.009
 Gray-0.009
 reality-0.009
 heaven-0.009
Token little
Token ID1310
Feature activation-4.32e-01

Pos logit contributions
 least+0.010
 reality+0.010
 Gray+0.009
 wishing+0.009
 Tooth+0.009
Neg logit contributions
adoes-0.006
 face-0.005
 structures-0.005
rac-0.005
ag-0.005
TokenOnce
Token ID7454
Feature activation-4.48e-01

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
ag+0.002
rac+0.002
Neg logit contributions
 least-0.002
 reality-0.002
 heaven-0.002
 wishing-0.002
 Tooth-0.002
Token upon
Token ID2402
Feature activation+1.77e-01

Pos logit contributions
 face+0.010
adoes+0.010
 structures+0.009
ag+0.009
rac+0.009
Neg logit contributions
 least-0.015
 reality-0.015
 heaven-0.014
 Tooth-0.014
rob-0.014
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
 least+0.005
 reality+0.005
rob+0.005
 Gray+0.005
ge+0.005
Neg logit contributions
 face-0.005
adoes-0.004
ag-0.004
 structures-0.004
asuring-0.004
Token time
Token ID640
Feature activation-8.95e-01

Pos logit contributions
 least+0.040
 reality+0.040
 heaven+0.039
 wishing+0.038
 Tooth+0.038
Neg logit contributions
adoes-0.019
 face-0.017
 structures-0.016
rac-0.015
stals-0.015
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
adoes+0.030
 face+0.030
 structures+0.028
ags+0.028
ag+0.027
Neg logit contributions
 least-0.020
 reality-0.019
 anytime-0.019
 Gray-0.019
 someday-0.019
Token there
Token ID612
Feature activation-3.74e-01

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.005
 aids+0.004
ags+0.004
Neg logit contributions
 she-0.004
 least-0.004
 someday-0.004
 wishing-0.004
 anytime-0.004
Token was
Token ID373
Feature activation+1.05e+00

Pos logit contributions
 face+0.009
adoes+0.009
 structures+0.008
rac+0.008
ag+0.008
Neg logit contributions
 reality-0.012
 least-0.012
 heaven-0.011
 Tooth-0.011
 Gray-0.011
Token a
Token ID257
Feature activation+1.40e-01

Pos logit contributions
 least+0.029
 reality+0.028
ge+0.026
 Gray+0.026
 wishing+0.026
Neg logit contributions
 face-0.030
adoes-0.030
 structures-0.028
ag-0.027
asuring-0.027
Token little
Token ID1310
Feature activation-3.03e-01

Pos logit contributions
 reality+0.005
 least+0.005
 Gray+0.005
 wishing+0.005
 Tooth+0.004
Neg logit contributions
adoes-0.003
 face-0.003
 structures-0.003
ag-0.003
rac-0.003
Token girl
Token ID2576
Feature activation+1.05e-01

Pos logit contributions
adoes+0.009
 face+0.009
 structures+0.009
rac+0.009
ag+0.008
Neg logit contributions
 least-0.008
 reality-0.008
 Gray-0.007
 heaven-0.007
 Tooth-0.007
Token named
Token ID3706
Feature activation-8.42e-02

Pos logit contributions
 reality+0.003
 least+0.003
rob+0.003
 Gray+0.003
ge+0.003
Neg logit contributions
 face-0.003
adoes-0.002
 structures-0.002
ag-0.002
ap-0.002
Token Tom
Token ID4186
Feature activation+3.31e-01

Pos logit contributions
 face+0.003
adoes+0.003
 structures+0.003
ag+0.003
rac+0.003
Neg logit contributions
 reality-0.005
 least-0.005
 wishing-0.005
rob-0.005
ge-0.005
Token learned
Token ID4499
Feature activation+3.95e-01

Pos logit contributions
 reality+0.010
rob+0.010
 least+0.010
 Gray+0.010
 heaven+0.010
Neg logit contributions
 face-0.009
adoes-0.008
rac-0.008
asuring-0.007
ag-0.007
Token their
Token ID511
Feature activation-4.47e-01

Pos logit contributions
 least+0.013
 reality+0.013
rob+0.013
ge+0.013
ited+0.013
Neg logit contributions
 face-0.009
adoes-0.009
 structures-0.008
ag-0.008
asuring-0.008
Token lesson
Token ID11483
Feature activation-4.01e-03

Pos logit contributions
 face+0.010
adoes+0.010
 structures+0.009
ag+0.009
rac+0.009
Neg logit contributions
 least-0.015
 reality-0.015
 heaven-0.015
 Gray-0.014
 wishing-0.014
Token.
Token ID13
Feature activation+3.84e-01

Pos logit contributions
 face+0.000
adoes+0.000
 structures+0.000
ag+0.000
asuring+0.000
Neg logit contributions
 reality-0.000
 least-0.000
rob-0.000
 Tooth-0.000
 Gray-0.000
Token They
Token ID1119
Feature activation-2.65e-02

Pos logit contributions
rob+0.019
 least+0.018
 reality+0.018
ge+0.018
ited+0.018
Neg logit contributions
adoes-0.004
 face-0.004
 structures-0.003
 Toys-0.003
stals-0.003
Token learned
Token ID4499
Feature activation+1.67e-01

Pos logit contributions
 face+0.001
asuring+0.001
adoes+0.001
rac+0.001
ag+0.001
Neg logit contributions
 reality-0.001
rob-0.001
ge-0.001
 heaven-0.001
 Gray-0.001
Token that
Token ID326
Feature activation-8.18e-01

Pos logit contributions
 least+0.005
 reality+0.005
 heaven+0.005
 Gray+0.005
 wishing+0.005
Neg logit contributions
 face-0.004
adoes-0.004
 structures-0.004
ag-0.004
rac-0.004
Token rain
Token ID6290
Feature activation+1.36e+00

Pos logit contributions
 face+0.022
adoes+0.022
 structures+0.021
ag+0.020
rac+0.020
Neg logit contributions
 least-0.024
 reality-0.023
 heaven-0.022
 Gray-0.022
 wishing-0.022
Token can
Token ID460
Feature activation+1.78e-01

Pos logit contributions
 least+0.034
 anytime+0.031
 someday+0.031
 Gray+0.030
 wishing+0.029
Neg logit contributions
 face-0.042
rac-0.039
adoes-0.038
 structures-0.038
ophers-0.038
Token be
Token ID307
Feature activation+1.86e-02

Pos logit contributions
 least+0.006
 reality+0.006
 wishing+0.005
 someday+0.005
 Gray+0.005
Neg logit contributions
adoes-0.004
 face-0.004
 structures-0.004
agged-0.004
rac-0.004
Token should
Token ID815
Feature activation-1.31e-01

Pos logit contributions
 reality+0.011
 least+0.011
 heaven+0.010
 someday+0.010
 Gray+0.010
Neg logit contributions
adoes-0.011
agged-0.010
 face-0.010
ags-0.010
ag-0.010
Token we
Token ID356
Feature activation+2.10e-01

Pos logit contributions
adoes+0.004
ag+0.004
agged+0.004
ags+0.004
 face+0.004
Neg logit contributions
 least-0.004
 reality-0.003
 she-0.003
 heaven-0.003
 someday-0.003
Token do
Token ID466
Feature activation-1.09e+00

Pos logit contributions
 least+0.006
 reality+0.005
 wishing+0.005
 someday+0.005
 hope+0.005
Neg logit contributions
adoes-0.007
agged-0.007
ggies-0.006
 structures-0.006
 aids-0.006
Token with
Token ID351
Feature activation+2.20e-01

Pos logit contributions
adoes+0.035
 face+0.032
agged+0.032
 structures+0.031
ag+0.030
Neg logit contributions
 least-0.030
 someday-0.029
 anytime-0.028
 reality-0.028
 dreaming-0.026
Token the
Token ID262
Feature activation+6.44e-02

Pos logit contributions
 least+0.006
 reality+0.006
 anytime+0.005
 heaven+0.005
 someday+0.005
Neg logit contributions
adoes-0.007
 face-0.006
agged-0.006
ag-0.006
 structures-0.006
Token iron
Token ID6953
Feature activation-3.37e-02

Pos logit contributions
 least+0.002
 reality+0.002
 wishing+0.002
 Tooth+0.002
 heaven+0.002
Neg logit contributions
adoes-0.002
 face-0.002
 structures-0.001
rac-0.001
ag-0.001
Token?"
Token ID1701
Feature activation-1.26e-01

Pos logit contributions
adoes+0.001
 face+0.001
 structures+0.001
ag+0.001
agged+0.001
Neg logit contributions
 least-0.001
 reality-0.001
 someday-0.001
 Tooth-0.001
 anytime-0.001
Token Jane
Token ID12091
Feature activation-3.86e-01

Pos logit contributions
adoes+0.004
 face+0.004
 structures+0.003
ag+0.003
stals+0.003
Neg logit contributions
 least-0.003
 reality-0.003
 wishing-0.003
 heaven-0.003
 Gray-0.003
Token asked
Token ID1965
Feature activation+1.06e-01

Pos logit contributions
 face+0.012
adoes+0.011
rac+0.011
asuring+0.011
 structures+0.011
Neg logit contributions
 reality-0.010
 least-0.010
rob-0.009
 heaven-0.009
 Gray-0.009
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 least+0.003
 reality+0.003
rob+0.002
 North+0.002
 Tooth+0.002
Neg logit contributions
 face-0.004
adoes-0.003
 structures-0.003
rac-0.003
ag-0.003
Token
Token ID198
Feature activation-2.55e-03

Pos logit contributions
 least+0.006
 reality+0.006
 Gray+0.006
 heaven+0.006
 wishing+0.006
Neg logit contributions
adoes-0.007
 face-0.007
 structures-0.007
ag-0.007
rac-0.007
Token to
Token ID284
Feature activation+2.21e-01

Pos logit contributions
 face+0.013
adoes+0.013
 structures+0.012
ag+0.011
rac+0.011
Neg logit contributions
 least-0.022
 reality-0.022
 heaven-0.021
 Gray-0.021
 wishing-0.021
Token the
Token ID262
Feature activation-5.59e-01

Pos logit contributions
rob+0.008
 least+0.008
ge+0.008
och+0.008
 Tooth+0.008
Neg logit contributions
 face-0.005
 structures-0.004
adoes-0.004
 jokes-0.004
 pitch-0.004
Token beautiful
Token ID4950
Feature activation-2.86e-02

Pos logit contributions
 face+0.015
adoes+0.014
 structures+0.013
ag+0.013
asuring+0.012
Neg logit contributions
 least-0.017
 reality-0.017
rob-0.016
ge-0.016
 heaven-0.016
Token poses
Token ID17313
Feature activation-1.23e-01

Pos logit contributions
 face+0.001
adoes+0.001
 structures+0.001
ag+0.001
rac+0.001
Neg logit contributions
 least-0.001
 reality-0.001
 heaven-0.001
 Gray-0.001
 wishing-0.001
Token she
Token ID673
Feature activation+4.06e-02

Pos logit contributions
 face+0.003
adoes+0.003
asuring+0.003
ag+0.003
 structures+0.003
Neg logit contributions
 least-0.004
 reality-0.004
rob-0.003
 Tooth-0.003
 Gray-0.003
Token saw
Token ID2497
Feature activation-1.97e+00

Pos logit contributions
 reality+0.001
ge+0.001
 heaven+0.001
rob+0.001
 least+0.001
Neg logit contributions
 face-0.001
ag-0.001
rac-0.001
asuring-0.001
 structures-0.001
Token.
Token ID13
Feature activation-4.41e-02

Pos logit contributions
 face+0.057
adoes+0.054
 structures+0.051
ag+0.050
asuring+0.050
Neg logit contributions
 least-0.057
 reality-0.057
 Gray-0.053
rob-0.052
 Tooth-0.052
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
adoes+0.002
ag+0.001
 structures+0.001
stals+0.001
 face+0.001
Neg logit contributions
 Gray-0.001
 wishing-0.001
 she-0.001
 least-0.001
 anytime-0.001
Token
Token ID198
Feature activation-8.32e-02

Pos logit contributions
adoes+0.006
 face+0.006
 structures+0.006
stals+0.006
ag+0.006
Neg logit contributions
 least-0.004
 reality-0.004
 Gray-0.004
 heaven-0.004
 anytime-0.004
TokenHer
Token ID9360
Feature activation-1.43e-01

Pos logit contributions
adoes+0.003
stals+0.003
ag+0.003
ags+0.003
 structures+0.003
Neg logit contributions
 she-0.002
 someday-0.002
 anytime-0.002
 heaven-0.002
 soon-0.002
Token Mom
Token ID11254
Feature activation+4.43e-01

Pos logit contributions
adoes+0.003
stals+0.003
 structures+0.003
ag+0.003
uting+0.003
Neg logit contributions
 reality-0.005
 least-0.005
 Gray-0.004
 wishing-0.004
 Tooth-0.004
Token.
Token ID13
Feature activation+2.67e-02

Pos logit contributions
 face+0.003
ag+0.003
 structures+0.003
adoes+0.003
ap+0.003
Neg logit contributions
 reality-0.003
 Tooth-0.003
 least-0.003
 certainty-0.003
 wishing-0.003
Token They
Token ID1119
Feature activation+4.42e-01

Pos logit contributions
 least+0.001
 anytime+0.001
 heaven+0.001
 someday+0.001
 Gray+0.001
Neg logit contributions
adoes-0.001
rac-0.001
 face-0.001
ag-0.001
asuring-0.001
Token have
Token ID423
Feature activation-2.14e-01

Pos logit contributions
 least+0.015
 reality+0.014
 Gray+0.014
 wishing+0.014
 heaven+0.014
Neg logit contributions
adoes-0.010
 face-0.010
 structures-0.009
rac-0.009
ag-0.009
Token a
Token ID257
Feature activation-2.41e-01

Pos logit contributions
adoes+0.007
 face+0.007
 structures+0.006
rac+0.006
ag+0.006
Neg logit contributions
 least-0.005
 reality-0.005
 heaven-0.005
 wishing-0.005
 Gray-0.005
Token big
Token ID1263
Feature activation+2.77e-01

Pos logit contributions
adoes+0.006
 face+0.005
 structures+0.005
rac+0.005
ag+0.005
Neg logit contributions
 reality-0.008
 least-0.008
 wishing-0.007
 Gray-0.007
 heaven-0.007
Token field
Token ID2214
Feature activation-1.85e+00

Pos logit contributions
 least+0.009
 reality+0.009
 wishing+0.009
 Gray+0.009
 heaven+0.008
Neg logit contributions
adoes-0.007
rac-0.006
uting-0.006
asuring-0.006
anned-0.006
Token near
Token ID1474
Feature activation-2.49e-01

Pos logit contributions
adoes+0.054
 face+0.049
stals+0.048
rac+0.047
 structures+0.047
Neg logit contributions
 least-0.051
 reality-0.049
 heaven-0.049
 someday-0.049
 anytime-0.048
Token their
Token ID511
Feature activation+2.41e-01

Pos logit contributions
 face+0.006
 structures+0.006
adoes+0.006
rac+0.005
ag+0.005
Neg logit contributions
 reality-0.008
 least-0.008
rob-0.008
 Tooth-0.008
 wishing-0.008
Token house
Token ID2156
Feature activation-8.82e-01

Pos logit contributions
 reality+0.008
 least+0.008
 Tooth+0.008
 heaven+0.008
 Gray+0.008
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
rac-0.005
ag-0.005
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
 face+0.028
adoes+0.027
 structures+0.026
ap+0.025
asuring+0.025
Neg logit contributions
 reality-0.023
 least-0.022
 Tooth-0.022
rob-0.022
 Gray-0.022
Token They
Token ID1119
Feature activation+5.23e-02

Pos logit contributions
adoes+0.011
ag+0.010
 face+0.010
rac+0.010
 structures+0.009
Neg logit contributions
 anytime-0.006
 Gray-0.006
 least-0.006
 heaven-0.006
 someday-0.006
Token notice
Token ID4003
Feature activation-1.67e+00

Pos logit contributions
adoes+0.042
 structures+0.038
rac+0.038
stals+0.038
agged+0.038
Neg logit contributions
 least-0.029
 someday-0.027
 reality-0.026
 knowing-0.026
 wishing-0.026
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
adoes+0.037
 face+0.036
 structures+0.032
asuring+0.030
 in+0.030
Neg logit contributions
rob-0.062
tail-0.058
 Tooth-0.057
 Gray-0.056
 reality-0.055
Token They
Token ID1119
Feature activation-5.25e-01

Pos logit contributions
adoes+0.004
 face+0.004
ag+0.004
 structures+0.004
asuring+0.003
Neg logit contributions
 least-0.005
 heaven-0.005
 Gray-0.005
 reality-0.005
 wishing-0.005
Token dragged
Token ID17901
Feature activation-1.70e-01

Pos logit contributions
 face+0.014
adoes+0.013
 structures+0.013
rac+0.012
ag+0.012
Neg logit contributions
 reality-0.016
 least-0.015
 heaven-0.015
 Tooth-0.015
 Gray-0.015
Token the
Token ID262
Feature activation-9.85e-01

Pos logit contributions
adoes+0.005
 face+0.005
stals+0.005
asuring+0.005
 structures+0.005
Neg logit contributions
 least-0.004
 heaven-0.004
 Gray-0.004
 anytime-0.004
 reality-0.004
Token kay
Token ID34681
Feature activation-1.85e+00

Pos logit contributions
adoes+0.034
asuring+0.031
agged+0.030
anish+0.030
 structures+0.030
Neg logit contributions
 least-0.024
 wishing-0.023
 North-0.023
 reality-0.023
ge-0.021
Tokenak
Token ID461
Feature activation-1.56e+00

Pos logit contributions
adoes+0.050
 face+0.048
 structures+0.043
stals+0.041
 squash+0.040
Neg logit contributions
 least-0.057
rob-0.057
ge-0.055
 anytime-0.054
 reality-0.052
Token to
Token ID284
Feature activation-2.31e-01

Pos logit contributions
 face+0.031
adoes+0.029
 structures+0.027
ag+0.026
rac+0.025
Neg logit contributions
 reality-0.059
 least-0.058
 Tooth-0.057
rob-0.056
 heaven-0.056
Token the
Token ID262
Feature activation-7.22e-01

Pos logit contributions
 face+0.005
adoes+0.005
 structures+0.005
ag+0.004
rac+0.004
Neg logit contributions
 reality-0.008
rob-0.008
 least-0.008
 Tooth-0.008
 Gray-0.008
Token shore
Token ID15191
Feature activation-7.84e-02

Pos logit contributions
adoes+0.020
 face+0.018
 structures+0.018
ag+0.018
rac+0.018
Neg logit contributions
 least-0.022
 reality-0.021
 wishing-0.020
 North-0.020
 Gray-0.020
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
 face+0.002
adoes+0.002
 structures+0.002
ap+0.001
asuring+0.001
Neg logit contributions
 reality-0.003
 Tooth-0.003
rob-0.003
 least-0.003
 Gray-0.003
Token on
Token ID319
Feature activation+1.58e-01

Pos logit contributions
 face+0.022
adoes+0.022
 structures+0.019
ag+0.018
rac+0.018
Neg logit contributions
 least-0.052
 reality-0.052
 heaven-0.050
 Gray-0.050
 wishing-0.050
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
 reality+0.005
ge+0.005
 least+0.005
rob+0.005
 certainty+0.005
Neg logit contributions
 face-0.004
adoes-0.004
 structures-0.003
 squash-0.003
 pitch-0.003
Token branch
Token ID8478
Feature activation-2.33e-01

Pos logit contributions
 least+0.032
 reality+0.031
 heaven+0.029
rob+0.029
 Gray+0.029
Neg logit contributions
 face-0.035
adoes-0.033
 structures-0.032
ag-0.032
rac-0.031
Token,
Token ID11
Feature activation+7.02e-01

Pos logit contributions
 face+0.006
adoes+0.006
ag+0.006
 structures+0.005
asuring+0.005
Neg logit contributions
 reality-0.007
 least-0.007
rob-0.007
 Tooth-0.007
 Gray-0.007
Token ch
Token ID442
Feature activation-2.79e-01

Pos logit contributions
 least+0.018
 reality+0.018
 heaven+0.017
 Gray+0.017
 wishing+0.017
Neg logit contributions
 face-0.021
adoes-0.021
 structures-0.020
ag-0.019
rac-0.019
Tokenir
Token ID343
Feature activation-2.20e+00

Pos logit contributions
adoes+0.008
 face+0.008
 structures+0.008
asuring+0.007
stals+0.007
Neg logit contributions
 reality-0.007
 least-0.007
ge-0.007
 certainty-0.007
 heaven-0.007
Tokenping
Token ID13886
Feature activation-1.27e+00

Pos logit contributions
 face+0.043
adoes+0.043
 structures+0.038
ag+0.038
rac+0.036
Neg logit contributions
 least-0.082
 reality-0.082
 heaven-0.078
 Gray-0.077
 Tooth-0.077
Token away
Token ID1497
Feature activation+4.71e-02

Pos logit contributions
adoes+0.032
 face+0.032
rac+0.032
 structures+0.031
asuring+0.031
Neg logit contributions
 reality-0.037
 wishing-0.036
 Gray-0.035
 least-0.035
 heaven-0.035
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 least+0.001
 reality+0.001
 heaven+0.001
 Tooth+0.001
rob+0.001
Neg logit contributions
 face-0.001
adoes-0.001
 structures-0.001
asuring-0.001
ag-0.001
Token
Token ID198
Feature activation-5.72e-02

Pos logit contributions
adoes+0.005
 face+0.005
 structures+0.004
ag+0.004
rac+0.004
Neg logit contributions
 least-0.004
 reality-0.004
 Gray-0.004
 Tooth-0.004
 wishing-0.004
Token
Token ID198
Feature activation+8.89e-02

Pos logit contributions
adoes+0.002
 face+0.002
 structures+0.002
asuring+0.002
rac+0.002
Neg logit contributions
 least-0.001
 reality-0.001
 Gray-0.001
 Tooth-0.001
 anytime-0.001
Token They
Token ID1119
Feature activation-7.31e-02

Pos logit contributions
 least+0.010
 reality+0.010
rob+0.010
ge+0.010
 wishing+0.010
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.004
stals-0.004
rac-0.004
Token talked
Token ID6619
Feature activation-7.02e-01

Pos logit contributions
 face+0.001
rac+0.001
 in+0.001
 crossed+0.001
 did+0.001
Neg logit contributions
 reality-0.003
 certainty-0.003
 possible-0.003
 Tooth-0.003
 heaven-0.003
Token about
Token ID546
Feature activation+1.01e+00

Pos logit contributions
 face+0.013
adoes+0.013
 structures+0.012
ag+0.011
rac+0.011
Neg logit contributions
 least-0.026
 reality-0.026
 heaven-0.025
 Tooth-0.025
 Gray-0.025
Token what
Token ID644
Feature activation-4.35e-02

Pos logit contributions
rob+0.035
 least+0.035
 Tooth+0.034
 reality+0.033
ited+0.033
Neg logit contributions
 face-0.022
 structures-0.022
adoes-0.021
 squash-0.018
 jokes-0.017
Token they
Token ID484
Feature activation-1.61e-01

Pos logit contributions
adoes+0.001
 face+0.001
ag+0.001
ags+0.001
rac+0.001
Neg logit contributions
 reality-0.001
 least-0.001
 Gray-0.001
 heaven-0.001
 wishing-0.001
Token saw
Token ID2497
Feature activation-1.88e+00

Pos logit contributions
 face+0.003
asuring+0.002
ag+0.002
rac+0.002
agged+0.002
Neg logit contributions
 reality-0.006
ge-0.006
 heaven-0.006
 Tooth-0.006
 Gray-0.006
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 in+0.036
 face+0.036
adoes+0.034
 structures+0.032
ag+0.031
Neg logit contributions
 reality-0.071
 least-0.069
 Tooth-0.068
Ba-0.067
rob-0.067
Token learned
Token ID4499
Feature activation+2.45e-01

Pos logit contributions
 face+0.008
adoes+0.008
 structures+0.008
rac+0.007
asuring+0.007
Neg logit contributions
 least-0.009
 reality-0.008
 heaven-0.008
 Gray-0.008
 Tooth-0.008
Token in
Token ID287
Feature activation-9.76e-01

Pos logit contributions
 least+0.009
 reality+0.008
rob+0.008
 Tooth+0.008
 Gray+0.008
Neg logit contributions
 face-0.005
adoes-0.005
 structures-0.005
ag-0.005
asuring-0.004
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
 face+0.017
 structures+0.015
adoes+0.014
 pitch+0.013
 differences+0.013
Neg logit contributions
rob-0.041
ited-0.037
 therapist-0.037
 least-0.037
isc-0.036
Token museum
Token ID13257
Feature activation+8.60e-01

Pos logit contributions
 least+0.007
 reality+0.007
 heaven+0.007
 wishing+0.006
 Gray+0.006
Neg logit contributions
adoes-0.009
 face-0.009
 structures-0.008
rac-0.008
ag-0.008