TOP ACTIVATIONS
MAX = 0.086

her mom my watched in amaz ement as the machine flew
animals were watching him with amaz ement .
. To her amaz ement , the clothes came
at the bright colours with amaz ement and awe . He
eddy bear and watched in amaz ement as it shrunk an
. Tim my watched with amaz ement as he saw stars
her eyes grew wide with amaz ement . She wanted to
air . He watched with amaz ement as the z ig
as the woman watched in amaz ement . She saw the
thin stone wasn â t good enough to join
realised he wasn â t scared to go to
go ! Everyone watched in amaz ement as it took off
knew she wasn â t a threat and it
looked at the bug with amaz ement .
friends stopped and watched in amaz ement . The dog too
and it wasn â t long until Chloe felt
The little girl watched with amaz ement when her mom my
egg ! He watched in amaz ement as the egg began
watched the k ite in amaz ement , and Little Joe
like he wasn â t successful . <|endoftext|> Once

MIN ACTIVATIONS
MIN = -0.081

, sauce , cheese and topp ings . She showed them
He washed it with so apy water and dried it with
a big bucket of so apy water . She said ,
in the warm , so apy water . She spl ashed
in the warm , so apy water . His mom and
. It 's all so apy and wide !
a cloth and some so apy water and got to work
it with lots of so apy bubbles . Finally she polished
of cheese and sauce and topp ings . The people who
a toy symbol in so apy water , it would become
brush and bucket of so apy water . Sus ie scrub
his bubble maker with so apy water and blew . Bub
inside ! There were ch ocol ates , cookies , and
full of warm , so apy water and left him to
out the towels and so apy water . Little one brushed
and filled it with so apy water . He went back
cakes and cookies and ch ocol ates and ice cream .
a big bowl of so apy water and she was shaking
into the bucket of so apy water and started turning around
They also see some fancy topp ings . They see sprink

INTERVAL 0.044 - 0.086
CONTAINS 0.074%

he couldn 't ride his bicycle . He wob bled around
back , Mom my asked Mary to say thank you .
but he couldn â t find anything .
so Mama said , " Sarah , it is selfish not
said " Let â s go buy some tea

INTERVAL 0.003 - 0.044
CONTAINS 40.478%

were breath less . G ail smiled at Sue .
a stepped out and began to speak . The
the human went was so beautiful and colorful . The human
" No , you won 't . You can 't do
. Tim my looked at the envelope and said

INTERVAL -0.039 - 0.003
CONTAINS 59.254%

poured some wine on Ben 's tower . The wine made
it forever . <|endoftext|> One dark night , Jack gazed up
sticker !" Mom and dad smiled and hugged him
he was helping people by giving them medals . He felt
was inside . They opened the big door and

INTERVAL -0.081 - -0.039
CONTAINS 0.194%

We are sorry . Please forgive us ." The
and finding shortcuts . The end . <|endoftext|> Once upon a
with his ball . The end . <|endoftext|> Once upon a
songs . They had a very happy time together . <|endoftext|>
time , there was a hungry little girl . Her tum
Token her
Token ID607
Feature activation+9.40e-03

Pos logit contributions
 spaghetti+0.031
bat+0.029
 poems+0.029
 elbows+0.028
 sauce+0.028
Neg logit contributions
stru-0.023
 clim-0.021
 of-0.020
away-0.020
or-0.019
Token mom
Token ID1995
Feature activation-6.68e-03

Pos logit contributions
stru+0.013
 clim+0.012
 of+0.011
away+0.011
 year+0.011
Neg logit contributions
 spaghetti-0.017
 sauce-0.016
 poems-0.016
 noodles-0.016
ems-0.016
Tokenmy
Token ID1820
Feature activation-4.94e-03

Pos logit contributions
 spaghetti+0.013
 poems+0.013
 masks+0.012
 noodles+0.012
 sauce+0.012
Neg logit contributions
stru-0.008
 clim-0.008
 of-0.007
or-0.007
ris-0.007
Token watched
Token ID7342
Feature activation+1.02e-02

Pos logit contributions
 poems+0.010
 spaghetti+0.010
 masks+0.010
 sauce+0.010
 noodles+0.010
Neg logit contributions
stru-0.006
 clim-0.005
 of-0.005
or-0.005
ester-0.005
Token in
Token ID287
Feature activation-6.62e-03

Pos logit contributions
stru+0.010
 of+0.010
away+0.009
 forward+0.009
or+0.009
Neg logit contributions
 poems-0.021
ccoli-0.021
 spaghetti-0.021
ems-0.021
 sauce-0.021
Token amaz
Token ID40642
Feature activation+8.62e-02

Pos logit contributions
 poems+0.013
 spaghetti+0.013
ccoli+0.013
ems+0.013
 sauce+0.013
Neg logit contributions
stru-0.007
away-0.007
 of-0.007
 clim-0.007
 forward-0.006
Tokenement
Token ID972
Feature activation+3.06e-02

Pos logit contributions
stru+0.088
 clim+0.075
ester+0.065
ur+0.062
ris+0.062
Neg logit contributions
 spaghetti-0.205
 poems-0.199
 sauce-0.190
 gum-0.187
 noodles-0.186
Token as
Token ID355
Feature activation-6.09e-03

Pos logit contributions
stru+0.026
 of+0.023
 clim+0.023
away+0.021
or+0.021
Neg logit contributions
 spaghetti-0.072
 sauce-0.071
 poems-0.070
 gum-0.069
 noodles-0.068
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
 spaghetti+0.012
 poems+0.012
 sauce+0.012
 noodles+0.011
bat+0.011
Neg logit contributions
stru-0.008
 clim-0.007
 of-0.007
away-0.007
 ever-0.006
Token machine
Token ID4572
Feature activation+9.99e-03

Pos logit contributions
stru+0.022
 clim+0.021
 year+0.020
 of+0.020
away+0.019
Neg logit contributions
 spaghetti-0.027
 poems-0.026
ccoli-0.025
 sauce-0.025
ems-0.025
Token flew
Token ID13112
Feature activation+2.19e-02

Pos logit contributions
stru+0.012
 of+0.011
 clim+0.011
or+0.011
 ever+0.010
Neg logit contributions
 spaghetti-0.021
 poems-0.020
 sauce-0.019
 noodles-0.019
 gum-0.019
Token animals
Token ID4695
Feature activation+4.74e-03

Pos logit contributions
 spaghetti+0.017
 poems+0.017
 sauce+0.016
 noodles+0.015
 gum+0.015
Neg logit contributions
stru-0.025
 clim-0.023
away-0.022
ester-0.022
or-0.022
Token were
Token ID547
Feature activation+7.94e-03

Pos logit contributions
stru+0.006
 clim+0.005
away+0.005
or+0.005
ester+0.005
Neg logit contributions
 spaghetti-0.010
 poems-0.010
 sauce-0.009
 noodles-0.009
 gum-0.009
Token watching
Token ID4964
Feature activation+9.64e-03

Pos logit contributions
stru+0.008
 clim+0.008
 of+0.007
 able+0.007
 victorious+0.007
Neg logit contributions
 spaghetti-0.018
 poems-0.017
ems-0.017
 sauce-0.017
ccoli-0.017
Token him
Token ID683
Feature activation-4.08e-03

Pos logit contributions
 of+0.009
stru+0.008
-+0.008
 forward+0.007
away+0.007
Neg logit contributions
 elbows-0.021
ems-0.021
ccoli-0.021
 fiber-0.021
irms-0.020
Token with
Token ID351
Feature activation+8.72e-03

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 noodles+0.008
 gum+0.008
Neg logit contributions
stru-0.005
 clim-0.004
away-0.004
 of-0.004
or-0.004
Token amaz
Token ID40642
Feature activation+8.52e-02

Pos logit contributions
 of+0.010
stru+0.009
 clim+0.009
 forward+0.009
 ever+0.009
Neg logit contributions
aghetti-0.017
ems-0.017
ccoli-0.017
 poems-0.017
 spaghetti-0.016
Tokenement
Token ID972
Feature activation+3.22e-02

Pos logit contributions
stru+0.068
 clim+0.055
away+0.048
ester+0.047
ris+0.046
Neg logit contributions
 spaghetti-0.217
 poems-0.211
 sauce-0.207
 noodles-0.203
 gum-0.201
Token.
Token ID13
Feature activation-7.33e-03

Pos logit contributions
stru+0.027
 of+0.024
 clim+0.024
away+0.021
 ever+0.021
Neg logit contributions
 spaghetti-0.077
 poems-0.075
 sauce-0.074
 gum-0.073
 noodles-0.072
Token
Token ID220
Feature activation+1.37e-02

Pos logit contributions
 poems+0.016
 spaghetti+0.015
 sauce+0.015
 masks+0.014
 noodles+0.014
Neg logit contributions
stru-0.008
 clim-0.008
ris-0.008
 of-0.007
 victorious-0.007
Token
Token ID198
Feature activation+3.08e-03

Pos logit contributions
stru+0.015
or+0.012
 clim+0.012
 of+0.012
old+0.012
Neg logit contributions
 spaghetti-0.031
 poems-0.031
 sauce-0.030
 noodles-0.030
 masks-0.029
Token
Token ID198
Feature activation+8.10e-03

Pos logit contributions
stru+0.003
ris+0.003
 clim+0.003
or+0.003
away+0.003
Neg logit contributions
 spaghetti-0.007
 poems-0.007
 sauce-0.007
 noodles-0.006
 masks-0.006
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
 of+0.007
stru+0.007
or+0.006
 clim+0.006
away+0.006
Neg logit contributions
 spaghetti-0.022
 poems-0.022
 sauce-0.020
 noodles-0.020
ccoli-0.020
Token
Token ID198
Feature activation+5.24e-03

Pos logit contributions
 poems+0.011
 spaghetti+0.010
 masks+0.010
 sauce+0.010
 noodles+0.010
Neg logit contributions
stru-0.006
 clim-0.006
 of-0.005
ur-0.005
old-0.005
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
stru+0.006
 clim+0.006
ris+0.005
away+0.005
or+0.005
Neg logit contributions
 spaghetti-0.011
 poems-0.011
 sauce-0.010
 noodles-0.010
 masks-0.010
TokenTo
Token ID2514
Feature activation+1.57e-02

Pos logit contributions
stru+0.015
 clim+0.014
away+0.013
ris+0.013
or+0.013
Neg logit contributions
 spaghetti-0.020
 poems-0.019
 sauce-0.019
 noodles-0.019
 masks-0.018
Token her
Token ID607
Feature activation+2.08e-02

Pos logit contributions
stru+0.023
 clim+0.021
away+0.020
or+0.020
ris+0.020
Neg logit contributions
 spaghetti-0.029
 poems-0.028
 sauce-0.027
 noodles-0.027
 gum-0.026
Token amaz
Token ID40642
Feature activation+8.50e-02

Pos logit contributions
stru+0.033
 clim+0.029
away+0.029
or+0.028
ester+0.028
Neg logit contributions
 spaghetti-0.038
 poems-0.036
 sauce-0.036
 gum-0.034
 masks-0.034
Tokenement
Token ID972
Feature activation+3.62e-02

Pos logit contributions
stru+0.085
 clim+0.070
ur+0.060
ester+0.059
ris+0.058
Neg logit contributions
 spaghetti-0.200
 poems-0.198
 sauce-0.190
 gum-0.188
 noodles-0.184
Token,
Token ID11
Feature activation-3.15e-03

Pos logit contributions
stru+0.029
 clim+0.025
away+0.022
or+0.022
ester+0.021
Neg logit contributions
 spaghetti-0.091
 poems-0.088
 sauce-0.087
 noodles-0.085
 masks-0.085
Token the
Token ID262
Feature activation+8.79e-03

Pos logit contributions
 spaghetti+0.007
 sauce+0.007
 poems+0.007
 noodles+0.006
ccoli+0.006
Neg logit contributions
stru-0.003
 clim-0.003
 of-0.003
away-0.003
 forward-0.003
Token clothes
Token ID8242
Feature activation+1.97e-02

Pos logit contributions
stru+0.010
 year+0.010
 clim+0.009
 of+0.009
away+0.009
Neg logit contributions
 spaghetti-0.017
 poems-0.017
bat-0.017
ems-0.017
ccoli-0.016
Token came
Token ID1625
Feature activation+1.63e-02

Pos logit contributions
stru+0.027
 clim+0.023
or+0.023
away+0.023
ester+0.022
Neg logit contributions
 spaghetti-0.040
 poems-0.038
 sauce-0.037
 noodles-0.036
 gum-0.035
Token at
Token ID379
Feature activation+2.70e-02

Pos logit contributions
stru+0.025
 clim+0.023
ris+0.022
or+0.022
ester+0.022
Neg logit contributions
 spaghetti-0.040
 poems-0.037
 sauce-0.037
 masks-0.036
 noodles-0.035
Token the
Token ID262
Feature activation+2.58e-04

Pos logit contributions
stru+0.042
 clim+0.038
away+0.036
ester+0.036
or+0.035
Neg logit contributions
 spaghetti-0.049
 poems-0.047
 sauce-0.046
 noodles-0.044
 masks-0.044
Token bright
Token ID6016
Feature activation+6.48e-03

Pos logit contributions
stru+0.000
 clim+0.000
ester+0.000
 of+0.000
 year+0.000
Neg logit contributions
 spaghetti-0.001
 poems-0.000
 noodles-0.000
ccoli-0.000
bat-0.000
Token colours
Token ID18915
Feature activation+1.26e-02

Pos logit contributions
stru+0.011
 of+0.010
 clim+0.010
away+0.010
ester+0.010
Neg logit contributions
 spaghetti-0.010
 poems-0.010
ccoli-0.010
 noodles-0.009
 masks-0.009
Token with
Token ID351
Feature activation+1.91e-02

Pos logit contributions
stru+0.015
 clim+0.013
away+0.012
ester+0.012
or+0.012
Neg logit contributions
 spaghetti-0.027
 poems-0.026
 sauce-0.026
 noodles-0.025
 masks-0.025
Token amaz
Token ID40642
Feature activation+8.31e-02

Pos logit contributions
stru+0.025
 of+0.024
 clim+0.023
away+0.022
 forward+0.022
Neg logit contributions
 spaghetti-0.036
ccoli-0.036
aghetti-0.035
ems-0.034
 poems-0.034
Tokenement
Token ID972
Feature activation+3.03e-02

Pos logit contributions
stru+0.105
 clim+0.089
ur+0.082
urry+0.079
ester+0.077
Neg logit contributions
 spaghetti-0.173
 poems-0.172
 sauce-0.167
 gum-0.166
 dough-0.162
Token and
Token ID290
Feature activation+1.41e-04

Pos logit contributions
stru+0.030
 clim+0.026
away+0.024
or+0.023
ester+0.023
Neg logit contributions
 spaghetti-0.070
 poems-0.068
 sauce-0.067
 noodles-0.066
 masks-0.065
Token awe
Token ID25030
Feature activation+3.81e-02

Pos logit contributions
stru+0.000
 clim+0.000
 of+0.000
 victorious+0.000
away+0.000
Neg logit contributions
 spaghetti-0.000
aghetti-0.000
 poems-0.000
 sauce-0.000
ccoli-0.000
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
stru+0.049
 clim+0.043
ester+0.040
ris+0.040
or+0.040
Neg logit contributions
 spaghetti-0.078
 poems-0.076
 sauce-0.074
 noodles-0.072
 masks-0.072
Token He
Token ID679
Feature activation-4.56e-03

Pos logit contributions
 poems+0.028
 spaghetti+0.028
 masks+0.027
 sauce+0.027
 noodles+0.027
Neg logit contributions
stru-0.017
 clim-0.015
ris-0.014
ester-0.014
old-0.014
Tokeneddy
Token ID21874
Feature activation-1.29e-02

Pos logit contributions
stru+0.025
 clim+0.023
 victorious+0.021
 forward+0.021
ris+0.020
Neg logit contributions
 spaghetti-0.022
 poems-0.021
 sauce-0.021
bat-0.020
 noodles-0.020
Token bear
Token ID6842
Feature activation-6.19e-03

Pos logit contributions
 spaghetti+0.025
 poems+0.024
 sauce+0.023
 mustard+0.023
 elbows+0.023
Neg logit contributions
stru-0.015
or-0.014
away-0.014
 clim-0.014
--0.014
Token and
Token ID290
Feature activation-6.75e-03

Pos logit contributions
 poems+0.014
 spaghetti+0.014
 noodles+0.013
 sauce+0.013
irms+0.013
Neg logit contributions
stru-0.006
 clim-0.006
or-0.005
 of-0.005
away-0.005
Token watched
Token ID7342
Feature activation+1.63e-02

Pos logit contributions
aghetti+0.012
 spaghetti+0.012
bat+0.012
 poems+0.011
ksh+0.011
Neg logit contributions
stru-0.008
 clim-0.008
 of-0.008
 returned-0.007
 ever-0.007
Token in
Token ID287
Feature activation-9.90e-04

Pos logit contributions
stru+0.017
 of+0.015
away+0.015
 clim+0.014
ively+0.014
Neg logit contributions
 spaghetti-0.035
 poems-0.034
 masks-0.034
ems-0.034
 sauce-0.034
Token amaz
Token ID40642
Feature activation+8.12e-02

Pos logit contributions
 spaghetti+0.002
 poems+0.002
 sauce+0.002
 noodles+0.002
ccoli+0.002
Neg logit contributions
stru-0.001
 clim-0.001
away-0.001
ris-0.001
 of-0.001
Tokenement
Token ID972
Feature activation+3.63e-02

Pos logit contributions
stru+0.087
 clim+0.074
ester+0.065
ur+0.063
ris+0.061
Neg logit contributions
 spaghetti-0.188
 poems-0.183
 sauce-0.175
 gum-0.173
 noodles-0.170
Token as
Token ID355
Feature activation-5.37e-03

Pos logit contributions
stru+0.035
 clim+0.031
away+0.029
 of+0.029
or+0.028
Neg logit contributions
 spaghetti-0.085
 sauce-0.082
 poems-0.081
 noodles-0.079
 gum-0.079
Token it
Token ID340
Feature activation-1.53e-02

Pos logit contributions
 spaghetti+0.011
 poems+0.010
 sauce+0.010
 noodles+0.010
 masks+0.010
Neg logit contributions
stru-0.007
 clim-0.006
away-0.006
or-0.006
 of-0.006
Token shrunk
Token ID38900
Feature activation-1.93e-03

Pos logit contributions
 spaghetti+0.033
 poems+0.032
 sauce+0.030
 gum+0.030
 noodles+0.030
Neg logit contributions
stru-0.019
 clim-0.017
 of-0.016
 ever-0.016
 returned-0.015
Token an
Token ID281
Feature activation+1.71e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.004
 sauce+0.004
 noodles+0.004
 gum+0.004
Neg logit contributions
stru-0.002
away-0.002
 clim-0.002
or-0.001
 forward-0.001
Token.
Token ID13
Feature activation-3.12e-03

Pos logit contributions
stru+0.039
away+0.034
 clim+0.034
or+0.033
ris+0.033
Neg logit contributions
 spaghetti-0.054
 poems-0.051
 sauce-0.051
 gum-0.049
 noodles-0.049
Token Tim
Token ID5045
Feature activation+1.33e-03

Pos logit contributions
 poems+0.006
 spaghetti+0.006
 sauce+0.006
 noodles+0.006
 masks+0.006
Neg logit contributions
stru-0.004
 clim-0.003
 of-0.003
old-0.003
ur-0.003
Tokenmy
Token ID1820
Feature activation-2.08e-03

Pos logit contributions
stru+0.002
 clim+0.002
old+0.002
ris+0.002
or+0.002
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 gum-0.002
Token watched
Token ID7342
Feature activation+1.00e-02

Pos logit contributions
 spaghetti+0.004
 poems+0.004
 sauce+0.004
 noodles+0.004
 gum+0.004
Neg logit contributions
stru-0.002
 clim-0.002
old-0.002
or-0.002
ris-0.002
Token with
Token ID351
Feature activation+7.59e-03

Pos logit contributions
stru+0.010
 of+0.010
away+0.009
ively+0.009
 clim+0.009
Neg logit contributions
ems-0.021
 poems-0.021
 spaghetti-0.021
ccoli-0.021
 masks-0.021
Token amaz
Token ID40642
Feature activation+8.12e-02

Pos logit contributions
stru+0.010
 clim+0.010
away+0.009
ris+0.009
ester+0.009
Neg logit contributions
 spaghetti-0.014
 poems-0.014
 sauce-0.014
 noodles-0.013
ems-0.013
Tokenement
Token ID972
Feature activation+3.15e-02

Pos logit contributions
stru+0.082
 clim+0.072
ester+0.061
away+0.060
ris+0.060
Neg logit contributions
 spaghetti-0.189
 poems-0.186
 sauce-0.178
 gum-0.176
 noodles-0.172
Token as
Token ID355
Feature activation-7.89e-03

Pos logit contributions
stru+0.031
 clim+0.027
away+0.025
or+0.024
ester+0.023
Neg logit contributions
 spaghetti-0.074
 poems-0.071
 sauce-0.071
 noodles-0.069
 masks-0.068
Token he
Token ID339
Feature activation-2.67e-03

Pos logit contributions
 spaghetti+0.016
 poems+0.015
 sauce+0.015
 noodles+0.015
 masks+0.014
Neg logit contributions
stru-0.010
 clim-0.009
away-0.009
or-0.008
ester-0.008
Token saw
Token ID2497
Feature activation+4.01e-03

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.005
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.004
 clim-0.004
away-0.003
ester-0.003
or-0.003
Token stars
Token ID5788
Feature activation+1.47e-02

Pos logit contributions
stru+0.005
 clim+0.005
 of+0.005
away+0.005
ris+0.005
Neg logit contributions
 spaghetti-0.008
 poems-0.007
ccoli-0.007
 sauce-0.007
 noodles-0.007
Token her
Token ID607
Feature activation+8.77e-03

Pos logit contributions
 spaghetti+0.005
 poems+0.005
ccoli+0.005
aghetti+0.005
 sauce+0.005
Neg logit contributions
stru-0.002
 clim-0.002
 of-0.002
 forward-0.001
 ever-0.001
Token eyes
Token ID2951
Feature activation+6.84e-04

Pos logit contributions
stru+0.012
 clim+0.011
away+0.011
ester+0.011
ris+0.010
Neg logit contributions
 spaghetti-0.017
 poems-0.016
 sauce-0.016
 noodles-0.015
 masks-0.015
Token grew
Token ID6348
Feature activation+7.04e-05

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
or+0.001
ester+0.001
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 noodles-0.001
 masks-0.001
Token wide
Token ID3094
Feature activation+3.18e-02

Pos logit contributions
stru+0.000
 clim+0.000
away+0.000
ester+0.000
or+0.000
Neg logit contributions
 spaghetti-0.000
 poems-0.000
 sauce-0.000
 noodles-0.000
 masks-0.000
Token with
Token ID351
Feature activation+9.63e-03

Pos logit contributions
stru+0.039
 clim+0.034
away+0.032
ester+0.032
or+0.032
Neg logit contributions
 spaghetti-0.067
 poems-0.065
 sauce-0.064
 noodles-0.062
 masks-0.061
Token amaz
Token ID40642
Feature activation+8.07e-02

Pos logit contributions
stru+0.013
 of+0.012
 clim+0.012
away+0.012
 forward+0.012
Neg logit contributions
ccoli-0.018
 spaghetti-0.017
ems-0.017
aghetti-0.017
 poems-0.017
Tokenement
Token ID972
Feature activation+3.09e-02

Pos logit contributions
stru+0.101
 clim+0.087
ur+0.079
ester+0.076
urry+0.076
Neg logit contributions
 spaghetti-0.169
 poems-0.167
 gum-0.159
 sauce-0.159
 noodles-0.153
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
stru+0.034
 clim+0.030
away+0.028
ester+0.027
or+0.027
Neg logit contributions
 spaghetti-0.068
 poems-0.066
 sauce-0.065
 noodles-0.064
 masks-0.063
Token She
Token ID1375
Feature activation-7.17e-04

Pos logit contributions
 spaghetti+0.020
 poems+0.020
 sauce+0.019
 masks+0.019
 noodles+0.019
Neg logit contributions
stru-0.014
 clim-0.012
ester-0.011
or-0.011
away-0.011
Token wanted
Token ID2227
Feature activation+7.72e-03

Pos logit contributions
 spaghetti+0.001
 poems+0.001
 sauce+0.001
 noodles+0.001
 gum+0.001
Neg logit contributions
stru-0.001
 clim-0.001
away-0.001
ris-0.001
ester-0.001
Token to
Token ID284
Feature activation+2.24e-03

Pos logit contributions
stru+0.007
 clim+0.006
away+0.006
 of+0.006
 ever+0.005
Neg logit contributions
 spaghetti-0.018
 poems-0.018
 sauce-0.017
ccoli-0.017
bat-0.017
Token air
Token ID1633
Feature activation+2.67e-03

Pos logit contributions
 spaghetti+0.014
 poems+0.014
 sauce+0.014
 noodles+0.013
ccoli+0.013
Neg logit contributions
stru-0.011
 clim-0.010
 of-0.010
ively-0.009
away-0.009
Token.
Token ID13
Feature activation-6.94e-03

Pos logit contributions
stru+0.002
 of+0.002
away+0.002
 forward+0.002
 clim+0.002
Neg logit contributions
 spaghetti-0.006
 poems-0.006
ccoli-0.006
 sauce-0.006
 noodles-0.006
Token He
Token ID679
Feature activation-9.84e-03

Pos logit contributions
 poems+0.014
 spaghetti+0.014
 masks+0.013
 sauce+0.013
 noodles+0.013
Neg logit contributions
 of-0.008
stru-0.007
 clim-0.007
ur-0.007
old-0.007
Token watched
Token ID7342
Feature activation+1.04e-02

Pos logit contributions
 spaghetti+0.019
 poems+0.018
 sauce+0.017
 gum+0.017
bat+0.017
Neg logit contributions
stru-0.013
 clim-0.012
ris-0.011
 of-0.011
or-0.011
Token with
Token ID351
Feature activation+3.94e-03

Pos logit contributions
 of+0.010
stru+0.010
away+0.009
ively+0.009
 forward+0.009
Neg logit contributions
ems-0.022
ccoli-0.022
 poems-0.022
 spaghetti-0.022
bat-0.022
Token amaz
Token ID40642
Feature activation+8.07e-02

Pos logit contributions
stru+0.005
 clim+0.005
away+0.005
 of+0.005
 forward+0.005
Neg logit contributions
 spaghetti-0.007
 poems-0.007
 sauce-0.007
bat-0.007
 noodles-0.007
Tokenement
Token ID972
Feature activation+3.07e-02

Pos logit contributions
stru+0.080
 clim+0.069
ester+0.061
ris+0.058
away+0.057
Neg logit contributions
 spaghetti-0.188
 poems-0.185
 sauce-0.177
 gum-0.175
 noodles-0.173
Token as
Token ID355
Feature activation-9.56e-03

Pos logit contributions
stru+0.026
 clim+0.023
 of+0.021
away+0.021
or+0.020
Neg logit contributions
 spaghetti-0.074
 poems-0.072
 sauce-0.072
 gum-0.070
 noodles-0.069
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
 spaghetti+0.019
 poems+0.019
 sauce+0.019
 noodles+0.018
 masks+0.018
Neg logit contributions
stru-0.012
 clim-0.011
away-0.010
ris-0.010
or-0.010
Token z
Token ID1976
Feature activation+2.25e-02

Pos logit contributions
stru+0.022
 clim+0.022
 year+0.021
 of+0.021
away+0.020
Neg logit contributions
 spaghetti-0.027
ccoli-0.026
 poems-0.026
ems-0.025
bat-0.025
Tokenig
Token ID328
Feature activation+1.76e-02

Pos logit contributions
stru+0.027
away+0.021
 clim+0.020
 forward+0.020
old+0.020
Neg logit contributions
 poems-0.045
 sauce-0.045
 spaghetti-0.044
 gum-0.044
 noodles-0.043
Token as
Token ID355
Feature activation-8.25e-03

Pos logit contributions
stru+0.013
 clim+0.011
ester+0.011
away+0.011
ris+0.010
Neg logit contributions
 spaghetti-0.023
 poems-0.022
 sauce-0.022
 masks-0.021
 noodles-0.021
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
 spaghetti+0.016
 poems+0.015
 sauce+0.015
 noodles+0.014
 gum+0.014
Neg logit contributions
stru-0.012
 clim-0.010
ester-0.010
away-0.010
or-0.010
Token woman
Token ID2415
Feature activation+4.11e-02

Pos logit contributions
stru+0.020
 clim+0.020
 of+0.019
 year+0.019
away+0.019
Neg logit contributions
 spaghetti-0.028
ccoli-0.027
 poems-0.027
ems-0.026
 sauce-0.026
Token watched
Token ID7342
Feature activation+1.55e-02

Pos logit contributions
stru+0.059
 clim+0.053
away+0.051
or+0.050
ester+0.050
Neg logit contributions
 spaghetti-0.078
 poems-0.075
 sauce-0.073
 noodles-0.071
 masks-0.070
Token in
Token ID287
Feature activation-3.62e-03

Pos logit contributions
stru+0.018
 clim+0.016
away+0.016
or+0.015
 of+0.015
Neg logit contributions
 spaghetti-0.033
 poems-0.032
 sauce-0.031
 masks-0.031
 noodles-0.031
Token amaz
Token ID40642
Feature activation+8.02e-02

Pos logit contributions
 spaghetti+0.007
 poems+0.007
 sauce+0.007
 noodles+0.007
ccoli+0.006
Neg logit contributions
stru-0.005
 clim-0.004
away-0.004
ris-0.004
 forward-0.004
Tokenement
Token ID972
Feature activation+3.68e-02

Pos logit contributions
stru+0.086
 clim+0.073
ester+0.066
ris+0.062
ur+0.060
Neg logit contributions
 spaghetti-0.181
 poems-0.179
 sauce-0.170
 gum-0.168
 noodles-0.167
Token.
Token ID13
Feature activation-1.31e-03

Pos logit contributions
stru+0.034
 clim+0.029
away+0.028
or+0.026
 of+0.026
Neg logit contributions
 spaghetti-0.089
 sauce-0.085
 poems-0.085
 noodles-0.083
 gum-0.082
Token She
Token ID1375
Feature activation+2.41e-03

Pos logit contributions
 spaghetti+0.003
 poems+0.003
 sauce+0.003
 masks+0.003
 noodles+0.003
Neg logit contributions
stru-0.002
 clim-0.002
ris-0.001
ur-0.001
 forward-0.001
Token saw
Token ID2497
Feature activation-1.89e-03

Pos logit contributions
stru+0.003
 clim+0.003
away+0.003
or+0.003
ester+0.003
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.004
 noodles-0.004
 gum-0.004
Token the
Token ID262
Feature activation+8.71e-03

Pos logit contributions
 spaghetti+0.004
 sauce+0.004
 poems+0.004
ccoli+0.004
 noodles+0.004
Neg logit contributions
stru-0.002
 clim-0.002
 of-0.002
away-0.002
old-0.002
Token thin
Token ID7888
Feature activation-1.05e-02

Pos logit contributions
stru+0.005
 clim+0.004
away+0.004
 ever+0.004
 victorious+0.004
Neg logit contributions
 spaghetti-0.007
bat-0.007
aghetti-0.007
 poems-0.006
ccoli-0.006
Token stone
Token ID7815
Feature activation+8.98e-03

Pos logit contributions
 spaghetti+0.019
 poems+0.018
bat+0.017
 sauce+0.017
 masks+0.017
Neg logit contributions
stru-0.015
 clim-0.015
away-0.014
or-0.013
 of-0.013
Token wasn
Token ID2492
Feature activation+3.26e-02

Pos logit contributions
stru+0.011
 clim+0.010
away+0.009
ester+0.009
or+0.009
Neg logit contributions
 spaghetti-0.019
 poems-0.018
 sauce-0.018
 noodles-0.017
 masks-0.017
Tokenâ
Token ID22940
Feature activation+2.64e-02

Pos logit contributions
stru+0.049
away+0.044
 clim+0.043
ester+0.043
ris+0.041
Neg logit contributions
 spaghetti-0.059
 poems-0.056
 sauce-0.055
 gum-0.054
 noodles-0.053
Token
Token ID26391
Feature activation+5.18e-02

Pos logit contributions
stru+0.044
 clim+0.040
away+0.038
ester+0.038
ris+0.037
Neg logit contributions
 spaghetti-0.043
 poems-0.042
 sauce-0.041
 noodles-0.039
 gum-0.039
Token
Token ID8151
Feature activation+8.02e-02

Pos logit contributions
stru+0.074
 clim+0.067
away+0.065
ester+0.063
or+0.062
Neg logit contributions
 spaghetti-0.097
 poems-0.094
 sauce-0.092
 noodles-0.089
 gum-0.088
Tokent
Token ID83
Feature activation-5.68e-03

Pos logit contributions
stru+0.083
 clim+0.074
away+0.072
 adventurer+0.064
 checkpoint+0.063
Neg logit contributions
 spaghetti-0.175
 sauce-0.168
 poems-0.166
 gum-0.165
 dough-0.160
Token good
Token ID922
Feature activation+3.61e-03

Pos logit contributions
 spaghetti+0.012
 poems+0.012
 sauce+0.012
 noodles+0.012
 elbows+0.012
Neg logit contributions
stru-0.006
 clim-0.006
away-0.005
 of-0.005
 ever-0.005
Token enough
Token ID1576
Feature activation-4.00e-03

Pos logit contributions
stru+0.004
 clim+0.004
away+0.003
ester+0.003
or+0.003
Neg logit contributions
 spaghetti-0.008
 poems-0.008
 sauce-0.008
 noodles-0.007
 masks-0.007
Token to
Token ID284
Feature activation+2.35e-03

Pos logit contributions
 spaghetti+0.009
 poems+0.008
 sauce+0.008
 noodles+0.008
 masks+0.008
Neg logit contributions
stru-0.005
 clim-0.004
away-0.004
ester-0.004
or-0.004
Token join
Token ID4654
Feature activation-1.89e-02

Pos logit contributions
stru+0.003
 clim+0.003
away+0.003
or+0.002
ris+0.002
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.005
 noodles-0.005
 gum-0.005
Token realised
Token ID19169
Feature activation+3.21e-03

Pos logit contributions
 spaghetti+0.016
 poems+0.016
 sauce+0.016
 noodles+0.015
 masks+0.015
Neg logit contributions
stru-0.010
 clim-0.009
away-0.009
or-0.009
ester-0.009
Token he
Token ID339
Feature activation+3.82e-03

Pos logit contributions
stru+0.003
 clim+0.002
away+0.002
ester+0.002
or+0.002
Neg logit contributions
 spaghetti-0.008
 poems-0.008
 sauce-0.008
 noodles-0.007
 masks-0.007
Token wasn
Token ID2492
Feature activation+2.75e-02

Pos logit contributions
stru+0.006
 clim+0.006
away+0.006
or+0.005
ester+0.005
Neg logit contributions
 spaghetti-0.007
 poems-0.006
 sauce-0.006
 masks-0.006
 noodles-0.006
Tokenâ
Token ID22940
Feature activation+1.61e-02

Pos logit contributions
stru+0.049
away+0.045
 clim+0.043
ester+0.042
ris+0.041
Neg logit contributions
 spaghetti-0.041
 sauce-0.038
 gum-0.038
 poems-0.037
 yarn-0.036
Token
Token ID26391
Feature activation+4.14e-02

Pos logit contributions
stru+0.026
 clim+0.024
away+0.022
ester+0.022
or+0.021
Neg logit contributions
 spaghetti-0.027
 sauce-0.026
 poems-0.025
 gum-0.024
 noodles-0.024
Token
Token ID8151
Feature activation+7.99e-02

Pos logit contributions
stru+0.067
 clim+0.060
away+0.058
ester+0.057
or+0.057
Neg logit contributions
 spaghetti-0.072
 poems-0.069
 sauce-0.067
 noodles-0.065
 masks-0.064
Tokent
Token ID83
Feature activation+1.14e-03

Pos logit contributions
stru+0.096
away+0.091
 clim+0.089
 adventurer+0.087
 checkpoint+0.083
Neg logit contributions
 spaghetti-0.155
 gum-0.146
 sauce-0.145
 poems-0.142
bat-0.142
Token scared
Token ID12008
Feature activation+2.13e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
ester+0.001
 of+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token to
Token ID284
Feature activation-1.30e-03

Pos logit contributions
stru+0.027
ris+0.023
 clim+0.022
or+0.022
ester+0.021
Neg logit contributions
 spaghetti-0.045
 sauce-0.042
 poems-0.042
 gum-0.040
 noodles-0.040
Token go
Token ID467
Feature activation+8.53e-03

Pos logit contributions
 spaghetti+0.003
 poems+0.003
 sauce+0.003
 noodles+0.002
 masks+0.002
Neg logit contributions
stru-0.002
 clim-0.002
away-0.001
or-0.001
ris-0.001
Token to
Token ID284
Feature activation-3.20e-03

Pos logit contributions
stru+0.013
 clim+0.011
ris+0.011
ester+0.010
ittle+0.010
Neg logit contributions
 spaghetti-0.015
 sauce-0.015
 masks-0.015
 gum-0.014
 yarn-0.014
Token go
Token ID467
Feature activation+2.69e-03

Pos logit contributions
 spaghetti+0.024
 poems+0.024
ccoli+0.024
aghetti+0.023
 sauce+0.023
Neg logit contributions
 clim-0.012
stru-0.011
 of-0.011
away-0.011
 return-0.011
Token!
Token ID0
Feature activation+4.15e-04

Pos logit contributions
stru+0.004
 clim+0.003
ester+0.003
ris+0.003
ively+0.003
Neg logit contributions
 spaghetti-0.005
 masks-0.005
 sauce-0.005
 poems-0.005
 noodles-0.005
Token Everyone
Token ID11075
Feature activation+7.72e-03

Pos logit contributions
stru+0.001
 clim+0.000
 of+0.000
ris+0.000
or+0.000
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 noodles-0.001
 masks-0.001
Token watched
Token ID7342
Feature activation+9.25e-03

Pos logit contributions
stru+0.011
 clim+0.009
away+0.009
ester+0.009
ris+0.009
Neg logit contributions
 spaghetti-0.015
 poems-0.014
 sauce-0.014
 noodles-0.014
 masks-0.014
Token in
Token ID287
Feature activation-1.18e-02

Pos logit contributions
stru+0.010
 of+0.009
away+0.009
 clim+0.009
 forward+0.008
Neg logit contributions
 spaghetti-0.020
 poems-0.020
 sauce-0.020
 gum-0.019
ccoli-0.019
Token amaz
Token ID40642
Feature activation+7.86e-02

Pos logit contributions
 spaghetti+0.024
 poems+0.024
 sauce+0.023
ccoli+0.023
ems+0.023
Neg logit contributions
stru-0.014
 clim-0.013
away-0.012
 forward-0.012
 of-0.012
Tokenement
Token ID972
Feature activation+3.30e-02

Pos logit contributions
stru+0.082
 clim+0.071
ester+0.063
ur+0.060
ris+0.059
Neg logit contributions
 spaghetti-0.180
 poems-0.176
 sauce-0.167
 masks-0.164
 noodles-0.164
Token as
Token ID355
Feature activation-1.13e-02

Pos logit contributions
stru+0.031
 clim+0.027
away+0.025
 of+0.024
or+0.024
Neg logit contributions
 spaghetti-0.079
 sauce-0.076
 poems-0.076
 gum-0.074
 noodles-0.073
Token it
Token ID340
Feature activation-1.64e-02

Pos logit contributions
 spaghetti+0.022
 poems+0.022
 sauce+0.021
 noodles+0.021
 masks+0.020
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
or-0.012
ris-0.012
Token took
Token ID1718
Feature activation-1.22e-02

Pos logit contributions
 spaghetti+0.035
 poems+0.033
 sauce+0.033
 gum+0.032
 noodles+0.032
Neg logit contributions
stru-0.021
 clim-0.019
or-0.017
away-0.017
 of-0.017
Token off
Token ID572
Feature activation-1.59e-03

Pos logit contributions
 spaghetti+0.025
 poems+0.024
 sauce+0.024
 noodles+0.023
ccoli+0.023
Neg logit contributions
stru-0.015
away-0.014
 clim-0.014
 forward-0.013
ris-0.013
Token knew
Token ID2993
Feature activation+9.39e-03

Pos logit contributions
 spaghetti+0.012
 poems+0.011
 sauce+0.011
 noodles+0.011
 gum+0.011
Neg logit contributions
stru-0.008
 clim-0.007
away-0.007
ester-0.007
or-0.007
Token she
Token ID673
Feature activation+1.73e-02

Pos logit contributions
stru+0.014
 clim+0.012
away+0.012
ris+0.012
ester+0.012
Neg logit contributions
 spaghetti-0.018
 poems-0.017
 sauce-0.016
 masks-0.016
 gum-0.016
Token wasn
Token ID2492
Feature activation+2.65e-02

Pos logit contributions
stru+0.029
away+0.026
 clim+0.026
or+0.025
ester+0.025
Neg logit contributions
 spaghetti-0.029
 poems-0.027
 sauce-0.027
 masks-0.026
 noodles-0.026
Tokenâ
Token ID22940
Feature activation+1.52e-02

Pos logit contributions
stru+0.052
away+0.049
ester+0.046
 clim+0.046
ry+0.045
Neg logit contributions
 spaghetti-0.033
 gum-0.031
 sauce-0.030
 poems-0.029
Ol-0.029
Token
Token ID26391
Feature activation+3.98e-02

Pos logit contributions
stru+0.028
 clim+0.025
ester+0.024
away+0.024
 victorious+0.023
Neg logit contributions
 spaghetti-0.022
 poems-0.021
 sauce-0.021
 gum-0.020
ems-0.020
Token
Token ID8151
Feature activation+7.86e-02

Pos logit contributions
stru+0.063
 clim+0.057
away+0.055
or+0.054
ester+0.054
Neg logit contributions
 spaghetti-0.069
 poems-0.067
 sauce-0.065
 noodles-0.063
 masks-0.062
Tokent
Token ID83
Feature activation+8.41e-04

Pos logit contributions
stru+0.096
away+0.088
 clim+0.086
 adventurer+0.085
 explorer+0.082
Neg logit contributions
 spaghetti-0.154
 gum-0.144
 sauce-0.143
 poems-0.142
bat-0.141
Token a
Token ID257
Feature activation+2.36e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
or+0.001
ris+0.001
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 noodles-0.001
 masks-0.001
Token threat
Token ID2372
Feature activation+1.17e-03

Pos logit contributions
stru+0.041
or+0.037
ester+0.036
ively+0.036
ris+0.035
Neg logit contributions
 spaghetti-0.039
 sauce-0.037
 poems-0.036
 gum-0.035
 noodles-0.034
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
ester+0.001
or+0.001
Neg logit contributions
 spaghetti-0.003
 poems-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token it
Token ID340
Feature activation-1.21e-02

Pos logit contributions
stru+0.018
 clim+0.016
away+0.015
ris+0.015
or+0.015
Neg logit contributions
 spaghetti-0.023
 poems-0.023
 sauce-0.022
 noodles-0.021
 masks-0.021
Token looked
Token ID3114
Feature activation+1.64e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 noodles+0.009
 gum+0.009
Neg logit contributions
stru-0.008
 clim-0.007
ester-0.007
or-0.007
ris-0.007
Token at
Token ID379
Feature activation+2.95e-02

Pos logit contributions
 clim+0.027
ively+0.026
or+0.026
stru+0.026
 checkpoint+0.025
Neg logit contributions
 spaghetti-0.029
 sauce-0.026
 masks-0.025
 poems-0.025
 noodles-0.024
Token the
Token ID262
Feature activation+2.04e-03

Pos logit contributions
stru+0.043
 clim+0.039
ester+0.036
away+0.036
ris+0.036
Neg logit contributions
 spaghetti-0.057
 poems-0.055
 sauce-0.054
 noodles-0.052
 masks-0.051
Token bug
Token ID5434
Feature activation-2.30e-02

Pos logit contributions
 of+0.003
stru+0.003
 clim+0.003
 year+0.002
way+0.002
Neg logit contributions
ccoli-0.004
bat-0.004
 apr-0.004
 poems-0.004
 elbows-0.003
Token with
Token ID351
Feature activation+1.38e-02

Pos logit contributions
 spaghetti+0.048
 poems+0.047
 sauce+0.046
 noodles+0.045
 masks+0.044
Neg logit contributions
stru-0.029
 clim-0.025
away-0.024
ester-0.023
or-0.023
Token amaz
Token ID40642
Feature activation+7.82e-02

Pos logit contributions
stru+0.019
 clim+0.017
 of+0.017
away+0.016
 victorious+0.016
Neg logit contributions
 spaghetti-0.025
ccoli-0.025
aghetti-0.024
 poems-0.024
ems-0.024
Tokenement
Token ID972
Feature activation+2.82e-02

Pos logit contributions
stru+0.112
 clim+0.092
ur+0.087
urry+0.086
 victorious+0.084
Neg logit contributions
 spaghetti-0.159
 poems-0.154
 gum-0.154
 sauce-0.152
 dough-0.144
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
stru+0.033
 clim+0.029
away+0.027
ester+0.027
or+0.026
Neg logit contributions
 spaghetti-0.061
 poems-0.059
 sauce-0.058
 noodles-0.057
 masks-0.056
Token
Token ID220
Feature activation+1.29e-03

Pos logit contributions
 poems+0.024
 spaghetti+0.024
 sauce+0.023
 noodles+0.023
 masks+0.023
Neg logit contributions
stru-0.015
 clim-0.012
or-0.012
ester-0.012
away-0.012
Token
Token ID198
Feature activation+1.71e-04

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
ester+0.002
or+0.002
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token
Token ID198
Feature activation+7.33e-03

Pos logit contributions
stru+0.000
 clim+0.000
ester+0.000
away+0.000
ris+0.000
Neg logit contributions
 spaghetti-0.000
 poems-0.000
 sauce-0.000
 gum-0.000
 masks-0.000
Token friends
Token ID2460
Feature activation-3.02e-02

Pos logit contributions
stru+0.015
 clim+0.014
away+0.013
ester+0.013
or+0.013
Neg logit contributions
 spaghetti-0.018
 poems-0.017
 sauce-0.017
 noodles-0.016
 masks-0.016
Token stopped
Token ID5025
Feature activation+1.62e-02

Pos logit contributions
 spaghetti+0.060
 poems+0.058
 sauce+0.056
 noodles+0.055
 gum+0.054
Neg logit contributions
stru-0.042
 clim-0.038
or-0.036
ris-0.035
away-0.035
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
stru+0.020
 clim+0.018
away+0.017
 of+0.017
 forward+0.016
Neg logit contributions
 spaghetti-0.034
 poems-0.033
 sauce-0.032
 noodles-0.032
ems-0.031
Token watched
Token ID7342
Feature activation+1.34e-02

Pos logit contributions
stru+0.016
 clim+0.015
or+0.013
away+0.013
ester+0.013
Neg logit contributions
 spaghetti-0.021
 poems-0.021
 sauce-0.020
 gum-0.020
 noodles-0.020
Token in
Token ID287
Feature activation-3.70e-03

Pos logit contributions
stru+0.014
away+0.012
 clim+0.012
 of+0.011
 forward+0.011
Neg logit contributions
 spaghetti-0.029
 poems-0.029
 sauce-0.028
ems-0.028
 masks-0.028
Token amaz
Token ID40642
Feature activation+7.81e-02

Pos logit contributions
 spaghetti+0.008
 poems+0.007
 sauce+0.007
 noodles+0.007
ccoli+0.007
Neg logit contributions
stru-0.004
 clim-0.004
away-0.004
ris-0.004
 forward-0.004
Tokenement
Token ID972
Feature activation+3.49e-02

Pos logit contributions
stru+0.085
 clim+0.074
ester+0.064
ur+0.062
 reun+0.060
Neg logit contributions
 spaghetti-0.180
 poems-0.176
 sauce-0.167
 gum-0.164
 masks-0.164
Token.
Token ID13
Feature activation-2.06e-03

Pos logit contributions
stru+0.032
 clim+0.027
away+0.026
 of+0.025
or+0.025
Neg logit contributions
 spaghetti-0.083
 poems-0.080
 sauce-0.080
 gum-0.078
 noodles-0.078
Token The
Token ID383
Feature activation+4.90e-03

Pos logit contributions
 poems+0.004
 spaghetti+0.004
 sauce+0.004
 masks+0.004
 elbows+0.004
Neg logit contributions
stru-0.003
 clim-0.003
 forward-0.003
ur-0.002
ris-0.002
Token dog
Token ID3290
Feature activation-2.78e-05

Pos logit contributions
 clim+0.006
stru+0.006
 year+0.006
 of+0.006
 victorious+0.006
Neg logit contributions
 spaghetti-0.009
ems-0.009
 poems-0.009
bat-0.008
ccoli-0.008
Token too
Token ID1165
Feature activation-3.42e-02

Pos logit contributions
 spaghetti+0.000
 poems+0.000
 sauce+0.000
 noodles+0.000
 elbows+0.000
Neg logit contributions
stru-0.000
 clim-0.000
 returned-0.000
 of-0.000
 ever-0.000
Token and
Token ID290
Feature activation-1.62e-03

Pos logit contributions
stru+0.014
 clim+0.013
ester+0.012
away+0.012
ris+0.012
Neg logit contributions
 spaghetti-0.018
 poems-0.018
 sauce-0.017
 noodles-0.017
 masks-0.017
Token it
Token ID340
Feature activation-2.24e-02

Pos logit contributions
 spaghetti+0.003
 poems+0.003
 sauce+0.003
 noodles+0.003
 masks+0.003
Neg logit contributions
stru-0.002
 clim-0.002
or-0.002
 of-0.002
ester-0.002
Token wasn
Token ID2492
Feature activation+1.84e-02

Pos logit contributions
 spaghetti+0.047
 poems+0.046
 sauce+0.045
 gum+0.044
 noodles+0.044
Neg logit contributions
stru-0.028
 clim-0.024
or-0.021
 of-0.021
ester-0.021
Tokenâ
Token ID22940
Feature activation+2.09e-02

Pos logit contributions
stru+0.028
away+0.025
 clim+0.024
ester+0.024
or+0.023
Neg logit contributions
 spaghetti-0.034
 poems-0.031
 sauce-0.031
 gum-0.031
 noodles-0.030
Token
Token ID26391
Feature activation+4.28e-02

Pos logit contributions
stru+0.037
 clim+0.034
away+0.033
ester+0.032
ris+0.032
Neg logit contributions
 spaghetti-0.032
 poems-0.031
 sauce-0.030
 noodles-0.029
 gum-0.029
Token
Token ID8151
Feature activation+7.79e-02

Pos logit contributions
stru+0.062
 clim+0.056
away+0.052
ester+0.052
or+0.052
Neg logit contributions
 spaghetti-0.081
 poems-0.078
 sauce-0.077
 noodles-0.075
 masks-0.074
Tokent
Token ID83
Feature activation-6.00e-03

Pos logit contributions
stru+0.091
away+0.086
 clim+0.082
 adventurer+0.080
 explorer+0.077
Neg logit contributions
 spaghetti-0.156
 sauce-0.147
 gum-0.146
bat-0.145
 poems-0.144
Token long
Token ID890
Feature activation+1.27e-02

Pos logit contributions
 poems+0.012
 sauce+0.012
 spaghetti+0.012
 salsa+0.012
 masks+0.011
Neg logit contributions
 clim-0.007
stru-0.007
 of-0.007
 ever-0.007
 year-0.006
Token until
Token ID1566
Feature activation+1.89e-02

Pos logit contributions
stru+0.013
 clim+0.012
away+0.011
 of+0.010
or+0.010
Neg logit contributions
 spaghetti-0.029
 poems-0.028
 sauce-0.027
 noodles-0.027
 masks-0.027
Token Chloe
Token ID29476
Feature activation+3.41e-02

Pos logit contributions
stru+0.023
 clim+0.021
away+0.020
or+0.019
ester+0.019
Neg logit contributions
 spaghetti-0.039
 poems-0.038
 sauce-0.037
 noodles-0.036
 masks-0.036
Token felt
Token ID2936
Feature activation+1.76e-02

Pos logit contributions
stru+0.048
 clim+0.043
away+0.042
ester+0.040
or+0.040
Neg logit contributions
 spaghetti-0.065
 poems-0.062
 sauce-0.061
 noodles-0.060
 masks-0.059
Token The
Token ID383
Feature activation-2.88e-04

Pos logit contributions
 poems+0.012
 spaghetti+0.011
 masks+0.011
 noodles+0.011
 sauce+0.011
Neg logit contributions
stru-0.007
 clim-0.006
 of-0.006
old-0.006
ur-0.006
Token little
Token ID1310
Feature activation+1.17e-02

Pos logit contributions
ems+0.000
bat+0.000
ccoli+0.000
 elbows+0.000
aghetti+0.000
Neg logit contributions
 year-0.000
 of-0.000
 taxi-0.000
 clim-0.000
 palace-0.000
Token girl
Token ID2576
Feature activation+9.77e-03

Pos logit contributions
 clim+0.010
 of+0.010
 year+0.009
-+0.009
stru+0.008
Neg logit contributions
ccoli-0.026
aghetti-0.026
 poems-0.026
 elbows-0.025
ems-0.025
Token watched
Token ID7342
Feature activation+1.22e-02

Pos logit contributions
stru+0.012
 clim+0.011
 of+0.010
 year+0.010
 ever+0.010
Neg logit contributions
 spaghetti-0.020
 poems-0.020
 masks-0.019
 sauce-0.019
 elbows-0.019
Token with
Token ID351
Feature activation+6.58e-03

Pos logit contributions
stru+0.011
 of+0.011
 forward+0.010
ively+0.010
 clim+0.010
Neg logit contributions
ems-0.026
ccoli-0.026
 spaghetti-0.026
 masks-0.026
 elbows-0.026
Token amaz
Token ID40642
Feature activation+7.78e-02

Pos logit contributions
stru+0.008
 clim+0.008
 of+0.007
 forward+0.007
away+0.007
Neg logit contributions
 poems-0.012
ems-0.012
 spaghetti-0.012
ccoli-0.012
 sauce-0.012
Tokenement
Token ID972
Feature activation+3.41e-02

Pos logit contributions
stru+0.073
 clim+0.062
ester+0.053
ris+0.053
away+0.053
Neg logit contributions
 spaghetti-0.189
 poems-0.186
 sauce-0.179
 gum-0.175
 noodles-0.174
Token when
Token ID618
Feature activation+5.83e-03

Pos logit contributions
stru+0.031
 clim+0.027
 of+0.025
away+0.024
or+0.023
Neg logit contributions
 spaghetti-0.080
 poems-0.078
 sauce-0.078
 masks-0.076
 noodles-0.076
Token her
Token ID607
Feature activation+1.25e-02

Pos logit contributions
stru+0.006
 clim+0.005
 ever+0.005
 of+0.005
 forward+0.005
Neg logit contributions
 spaghetti-0.013
 poems-0.013
 sauce-0.013
ccoli-0.013
 noodles-0.012
Token mom
Token ID1995
Feature activation-8.15e-03

Pos logit contributions
 of+0.013
stru+0.013
-+0.013
 year+0.013
 clim+0.012
Neg logit contributions
ems-0.024
bat-0.024
aghetti-0.023
 elbows-0.023
 spaghetti-0.023
Tokenmy
Token ID1820
Feature activation+5.09e-03

Pos logit contributions
 poems+0.016
 spaghetti+0.015
 noodles+0.015
 elbows+0.015
 sauce+0.015
Neg logit contributions
stru-0.011
 clim-0.010
 of-0.010
ris-0.010
ester-0.010
Token egg
Token ID5935
Feature activation-2.16e-02

Pos logit contributions
 spaghetti+0.008
 poems+0.008
 sauce+0.008
 masks+0.008
 noodles+0.008
Neg logit contributions
stru-0.006
 clim-0.006
away-0.005
ris-0.005
 of-0.005
Token!
Token ID0
Feature activation+1.41e-03

Pos logit contributions
 poems+0.049
 spaghetti+0.048
 masks+0.046
 sauce+0.045
 noodles+0.045
Neg logit contributions
stru-0.021
 clim-0.020
 of-0.019
 ever-0.018
away-0.017
Token He
Token ID679
Feature activation-1.20e-03

Pos logit contributions
stru+0.002
 clim+0.002
 of+0.002
old+0.002
ris+0.002
Neg logit contributions
 poems-0.003
 spaghetti-0.003
 masks-0.003
 sauce-0.003
 noodles-0.003
Token watched
Token ID7342
Feature activation+1.33e-02

Pos logit contributions
 spaghetti+0.003
 poems+0.002
 sauce+0.002
 gum+0.002
 masks+0.002
Neg logit contributions
stru-0.001
 clim-0.001
ris-0.001
 of-0.001
away-0.001
Token in
Token ID287
Feature activation-1.03e-02

Pos logit contributions
stru+0.014
 clim+0.012
 of+0.012
away+0.012
ively+0.011
Neg logit contributions
 spaghetti-0.029
 poems-0.029
 masks-0.028
ems-0.028
 sauce-0.028
Token amaz
Token ID40642
Feature activation+7.78e-02

Pos logit contributions
 spaghetti+0.021
 poems+0.021
 sauce+0.020
ems+0.020
ccoli+0.020
Neg logit contributions
stru-0.012
 clim-0.011
away-0.011
 of-0.010
 forward-0.010
Tokenement
Token ID972
Feature activation+3.19e-02

Pos logit contributions
stru+0.077
 clim+0.065
ester+0.060
or+0.057
ris+0.056
Neg logit contributions
 spaghetti-0.184
 poems-0.177
 sauce-0.172
 noodles-0.168
 gum-0.168
Token as
Token ID355
Feature activation-1.21e-02

Pos logit contributions
stru+0.028
 clim+0.024
away+0.022
 of+0.021
or+0.020
Neg logit contributions
 spaghetti-0.078
 poems-0.076
 sauce-0.075
 gum-0.073
 masks-0.073
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
 spaghetti+0.025
 poems+0.024
 sauce+0.023
 noodles+0.023
 masks+0.023
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
or-0.012
ris-0.012
Token egg
Token ID5935
Feature activation+1.17e-03

Pos logit contributions
stru+0.019
 clim+0.018
 year+0.017
 of+0.016
away+0.016
Neg logit contributions
 poems-0.023
 spaghetti-0.023
ems-0.022
 masks-0.022
 sauce-0.021
Token began
Token ID2540
Feature activation+7.04e-03

Pos logit contributions
stru+0.001
 clim+0.001
 of+0.001
 ever+0.001
or+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 masks-0.002
 gum-0.002
Token watched
Token ID7342
Feature activation+1.47e-02

Pos logit contributions
stru+0.033
away+0.030
 clim+0.030
ester+0.030
or+0.028
Neg logit contributions
 spaghetti-0.031
 poems-0.030
 sauce-0.030
 masks-0.029
 noodles-0.029
Token the
Token ID262
Feature activation-3.39e-03

Pos logit contributions
stru+0.019
 clim+0.016
away+0.016
ester+0.015
or+0.015
Neg logit contributions
 spaghetti-0.030
 poems-0.029
 sauce-0.029
 noodles-0.028
 masks-0.028
Token k
Token ID479
Feature activation+5.77e-03

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 sauce+0.006
 noodles+0.006
ccoli+0.006
Neg logit contributions
stru-0.005
 clim-0.005
away-0.004
ester-0.004
or-0.004
Tokenite
Token ID578
Feature activation+1.42e-02

Pos logit contributions
stru+0.007
 clim+0.006
away+0.006
ris+0.006
or+0.006
Neg logit contributions
 spaghetti-0.011
bat-0.011
 poems-0.011
 gum-0.011
 yarn-0.011
Token in
Token ID287
Feature activation+5.61e-03

Pos logit contributions
stru+0.020
 clim+0.018
away+0.017
or+0.016
ester+0.016
Neg logit contributions
 spaghetti-0.028
 poems-0.027
 sauce-0.026
 noodles-0.026
 masks-0.025
Token amaz
Token ID40642
Feature activation+7.77e-02

Pos logit contributions
stru+0.008
 clim+0.007
away+0.006
ester+0.006
or+0.006
Neg logit contributions
 spaghetti-0.011
 poems-0.011
 sauce-0.011
 noodles-0.010
 masks-0.010
Tokenement
Token ID972
Feature activation+3.66e-02

Pos logit contributions
stru+0.101
 clim+0.086
ur+0.079
ester+0.078
urry+0.074
Neg logit contributions
 spaghetti-0.158
 poems-0.154
 sauce-0.146
 gum-0.146
 dough-0.146
Token,
Token ID11
Feature activation-9.69e-03

Pos logit contributions
stru+0.038
 clim+0.033
away+0.031
or+0.029
ester+0.029
Neg logit contributions
 spaghetti-0.085
 poems-0.081
 sauce-0.081
 noodles-0.079
 masks-0.078
Token and
Token ID290
Feature activation-2.09e-03

Pos logit contributions
 spaghetti+0.021
 poems+0.020
 sauce+0.020
 noodles+0.019
 gum+0.019
Neg logit contributions
stru-0.012
 clim-0.010
 forward-0.010
away-0.009
 victorious-0.009
Token Little
Token ID7703
Feature activation+1.34e-02

Pos logit contributions
 spaghetti+0.004
 poems+0.004
 sauce+0.004
 noodles+0.004
 gum+0.004
Neg logit contributions
stru-0.003
 clim-0.002
away-0.002
or-0.002
ester-0.002
Token Joe
Token ID5689
Feature activation+1.46e-02

Pos logit contributions
stru+0.016
 clim+0.014
 ever+0.013
way+0.012
 year+0.012
Neg logit contributions
 poems-0.030
 spaghetti-0.029
 sauce-0.028
 pens-0.027
 elbows-0.027
Token like
Token ID588
Feature activation-4.64e-03

Pos logit contributions
stru+0.023
 clim+0.020
away+0.020
ester+0.019
or+0.019
Neg logit contributions
 spaghetti-0.037
 poems-0.036
 sauce-0.035
 noodles-0.034
 masks-0.034
Token he
Token ID339
Feature activation+3.68e-03

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.008
ccoli+0.008
 noodles+0.008
Neg logit contributions
stru-0.006
 clim-0.006
away-0.005
 of-0.005
or-0.005
Token wasn
Token ID2492
Feature activation+2.42e-02

Pos logit contributions
stru+0.004
 clim+0.004
ris+0.004
away+0.004
or+0.004
Neg logit contributions
 spaghetti-0.008
 poems-0.008
 sauce-0.007
 noodles-0.007
 masks-0.007
Tokenâ
Token ID22940
Feature activation+2.28e-02

Pos logit contributions
stru+0.039
away+0.035
 clim+0.034
ester+0.033
ris+0.033
Neg logit contributions
 spaghetti-0.040
 poems-0.038
 gum-0.038
 sauce-0.038
 yarn-0.036
Token
Token ID26391
Feature activation+4.99e-02

Pos logit contributions
stru+0.042
 clim+0.038
away+0.037
ester+0.036
ris+0.036
Neg logit contributions
 spaghetti-0.033
 poems-0.032
 sauce-0.031
 gum-0.030
 noodles-0.029
Token
Token ID8151
Feature activation+7.76e-02

Pos logit contributions
stru+0.073
 clim+0.066
or+0.062
ester+0.062
away+0.061
Neg logit contributions
 spaghetti-0.094
 poems-0.090
 sauce-0.089
 noodles-0.087
 masks-0.085
Tokent
Token ID83
Feature activation-6.20e-03

Pos logit contributions
stru+0.081
 clim+0.070
away+0.070
 checkpoint+0.061
ris+0.059
Neg logit contributions
 spaghetti-0.172
 sauce-0.164
 poems-0.163
 gum-0.162
bat-0.158
Token successful
Token ID4388
Feature activation+1.01e-02

Pos logit contributions
 spaghetti+0.012
 poems+0.012
 sauce+0.012
 noodles+0.012
 masks+0.011
Neg logit contributions
stru-0.008
 clim-0.008
away-0.007
ester-0.007
 forward-0.007
Token.
Token ID13
Feature activation-3.84e-03

Pos logit contributions
stru+0.010
 clim+0.009
away+0.009
or+0.009
ester+0.008
Neg logit contributions
 spaghetti-0.023
 poems-0.022
 sauce-0.022
 noodles-0.021
 masks-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.96e-02

Pos logit contributions
 spaghetti+0.007
 poems+0.007
 sauce+0.007
 noodles+0.007
 masks+0.007
Neg logit contributions
stru-0.006
 clim-0.005
away-0.005
ester-0.005
or-0.005
TokenOnce
Token ID7454
Feature activation+1.49e-02

Pos logit contributions
 poems+0.039
 spaghetti+0.039
 sauce+0.037
 noodles+0.036
 masks+0.036
Neg logit contributions
stru-0.025
 clim-0.022
or-0.022
ris-0.021
 of-0.021
Token,
Token ID11
Feature activation-7.13e-03

Pos logit contributions
 elbows+0.070
irms+0.068
ems+0.068
 poems+0.067
 guitars+0.066
Neg logit contributions
 of-0.020
--0.016
 ever-0.014
 clim-0.013
ester-0.013
Token sauce
Token ID10746
Feature activation-3.84e-02

Pos logit contributions
bat+0.012
 elbows+0.011
ems+0.011
irms+0.011
bows+0.010
Neg logit contributions
 of-0.012
 where-0.010
 that-0.009
,-0.008
 as-0.008
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 elbows+0.100
 poems+0.095
irms+0.095
ems+0.094
bat+0.094
Neg logit contributions
 of-0.027
 ever-0.019
--0.019
 clim-0.018
or-0.016
Token cheese
Token ID9891
Feature activation-3.69e-02

Pos logit contributions
 elbows+0.027
bat+0.027
irms+0.026
 poems+0.026
ems+0.026
Neg logit contributions
 of-0.016
 ever-0.012
--0.012
 where-0.012
stru-0.011
Token and
Token ID290
Feature activation-7.44e-03

Pos logit contributions
 elbows+0.107
irms+0.102
ems+0.100
 guitars+0.100
 poems+0.099
Neg logit contributions
 of-0.016
--0.010
,-0.008
 clim-0.007
 ever-0.007
Token topp
Token ID23126
Feature activation-8.08e-02

Pos logit contributions
bat+0.011
ems+0.010
 elbows+0.010
aghetti+0.010
irms+0.010
Neg logit contributions
 of-0.013
 ever-0.011
 that-0.011
 return-0.010
 clim-0.010
Tokenings
Token ID654
Feature activation-2.59e-02

Pos logit contributions
 spaghetti+0.176
 poems+0.173
ems+0.168
 knit+0.167
 masks+0.166
Neg logit contributions
--0.081
or-0.074
stru-0.074
ook-0.066
 of-0.066
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
 elbows+0.066
 poems+0.061
irms+0.060
ems+0.060
bat+0.059
Neg logit contributions
 of-0.021
--0.016
 forward-0.016
 ever-0.014
or-0.013
Token She
Token ID1375
Feature activation-1.45e-03

Pos logit contributions
 of+0.003
 forward+0.002
stru+0.002
 Once+0.002
 sadly+0.002
Neg logit contributions
 poems-0.004
 elbows-0.004
bat-0.004
ccoli-0.004
aghetti-0.004
Token showed
Token ID3751
Feature activation+6.99e-03

Pos logit contributions
 elbows+0.003
 poems+0.003
bat+0.002
ems+0.002
 spaghetti+0.002
Neg logit contributions
stru-0.002
 of-0.002
 clim-0.002
ris-0.002
 ever-0.002
Token them
Token ID606
Feature activation+2.72e-03

Pos logit contributions
 of+0.008
stru+0.008
or+0.007
ively+0.007
away+0.007
Neg logit contributions
 elbows-0.014
bat-0.014
ccoli-0.014
 poems-0.013
ems-0.013
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 masks+0.009
 noodles+0.009
Neg logit contributions
stru-0.008
 clim-0.007
or-0.006
ester-0.006
ris-0.006
Token washed
Token ID18989
Feature activation-3.31e-02

Pos logit contributions
 poems+0.020
 spaghetti+0.019
bat+0.019
 elbows+0.019
ems+0.019
Neg logit contributions
stru-0.013
 clim-0.012
ris-0.012
 of-0.012
 ever-0.011
Token it
Token ID340
Feature activation-3.46e-02

Pos logit contributions
 poems+0.074
 spaghetti+0.073
ccoli+0.070
ems+0.070
 masks+0.068
Neg logit contributions
stru-0.032
or-0.031
away-0.029
 forward-0.029
 of-0.027
Token with
Token ID351
Feature activation+4.68e-03

Pos logit contributions
 poems+0.079
 spaghetti+0.077
ems+0.074
 masks+0.074
 sauce+0.073
Neg logit contributions
stru-0.034
 clim-0.029
away-0.028
 of-0.027
or-0.027
Token so
Token ID523
Feature activation-1.41e-02

Pos logit contributions
 of+0.006
stru+0.005
 return+0.005
 home+0.005
 forward+0.005
Neg logit contributions
ems-0.010
 poems-0.009
ccoli-0.009
aghetti-0.009
bat-0.009
Tokenapy
Token ID12826
Feature activation-7.66e-02

Pos logit contributions
 spaghetti+0.030
 poems+0.029
 sauce+0.028
 noodles+0.027
 masks+0.027
Neg logit contributions
stru-0.018
 clim-0.015
away-0.015
or-0.014
ester-0.014
Token water
Token ID1660
Feature activation-2.21e-02

Pos logit contributions
bat+0.138
 poems+0.135
ccoli+0.133
ems+0.128
aghetti+0.125
Neg logit contributions
 of-0.116
--0.097
 ever-0.092
away-0.090
 year-0.090
Token and
Token ID290
Feature activation-3.80e-03

Pos logit contributions
 poems+0.052
 spaghetti+0.052
 elbows+0.049
 masks+0.048
ems+0.048
Neg logit contributions
stru-0.018
 of-0.017
 clim-0.016
or-0.015
 forward-0.015
Token dried
Token ID16577
Feature activation-8.16e-04

Pos logit contributions
bat+0.006
ems+0.006
 poems+0.006
ccoli+0.006
aghetti+0.006
Neg logit contributions
 of-0.005
stru-0.005
 ever-0.005
 returned-0.005
 clim-0.005
Token it
Token ID340
Feature activation-2.17e-02

Pos logit contributions
 spaghetti+0.002
 poems+0.002
 sauce+0.002
 noodles+0.002
 masks+0.002
Neg logit contributions
stru-0.001
away-0.001
or-0.001
 clim-0.001
ester-0.001
Token with
Token ID351
Feature activation+1.13e-02

Pos logit contributions
 spaghetti+0.049
 poems+0.048
 sauce+0.046
 noodles+0.046
 masks+0.045
Neg logit contributions
stru-0.023
 clim-0.019
away-0.019
or-0.018
 forward-0.017
Token a
Token ID257
Feature activation+5.18e-03

Pos logit contributions
stru+0.000
 clim+0.000
 ever+0.000
 returned+0.000
 of+0.000
Neg logit contributions
 poems-0.000
ccoli-0.000
 spaghetti-0.000
ems-0.000
aghetti-0.000
Token big
Token ID1263
Feature activation-1.05e-02

Pos logit contributions
stru+0.007
 clim+0.007
away+0.006
ester+0.006
ris+0.006
Neg logit contributions
 spaghetti-0.010
 poems-0.010
 sauce-0.009
 noodles-0.009
 masks-0.009
Token bucket
Token ID19236
Feature activation-2.53e-02

Pos logit contributions
 spaghetti+0.019
 poems+0.019
 sauce+0.018
 noodles+0.018
 masks+0.017
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
ester-0.013
ris-0.013
Token of
Token ID286
Feature activation-1.64e-02

Pos logit contributions
 spaghetti+0.062
 poems+0.060
 sauce+0.059
 noodles+0.058
 masks+0.057
Neg logit contributions
stru-0.022
 clim-0.019
away-0.017
ester-0.016
or-0.016
Token so
Token ID523
Feature activation-1.76e-02

Pos logit contributions
 poems+0.026
ccoli+0.025
ems+0.025
 spaghetti+0.024
bat+0.023
Neg logit contributions
stru-0.027
 of-0.026
away-0.025
 clim-0.024
 year-0.023
Tokenapy
Token ID12826
Feature activation-7.50e-02

Pos logit contributions
 spaghetti+0.034
 poems+0.032
 sauce+0.032
 noodles+0.031
 gum+0.031
Neg logit contributions
stru-0.025
 clim-0.022
away-0.021
ester-0.021
ris-0.021
Token water
Token ID1660
Feature activation-1.59e-02

Pos logit contributions
 poems+0.113
 spaghetti+0.109
ccoli+0.104
ems+0.098
bat+0.098
Neg logit contributions
stru-0.129
 of-0.123
 clim-0.121
away-0.120
or-0.115
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 poems+0.037
 spaghetti+0.037
 sauce+0.034
 noodles+0.034
ccoli+0.034
Neg logit contributions
stru-0.014
 clim-0.013
 of-0.012
away-0.012
or-0.012
Token She
Token ID1375
Feature activation-2.64e-03

Pos logit contributions
 poems+0.021
 spaghetti+0.020
 masks+0.020
 gum+0.020
bat+0.020
Neg logit contributions
stru-0.017
 clim-0.015
old-0.014
 forward-0.014
 of-0.014
Token said
Token ID531
Feature activation+2.33e-03

Pos logit contributions
 spaghetti+0.005
 poems+0.004
 sauce+0.004
 noodles+0.004
 gum+0.004
Neg logit contributions
stru-0.004
 clim-0.004
ester-0.004
ris-0.004
away-0.004
Token,
Token ID11
Feature activation-3.22e-02

Pos logit contributions
stru+0.002
 clim+0.001
away+0.001
or+0.001
 of+0.001
Neg logit contributions
 spaghetti-0.006
 poems-0.006
 sauce-0.006
 masks-0.006
 noodles-0.006
Token in
Token ID287
Feature activation+1.47e-02

Pos logit contributions
stru+0.021
 clim+0.019
 forward+0.017
 of+0.017
 ever+0.017
Neg logit contributions
 spaghetti-0.057
 poems-0.056
 sauce-0.056
 noodles-0.054
 gum-0.054
Token the
Token ID262
Feature activation+2.80e-03

Pos logit contributions
stru+0.019
 clim+0.017
away+0.017
 of+0.016
ris+0.016
Neg logit contributions
 poems-0.028
 spaghetti-0.028
 sauce-0.027
ccoli-0.027
 noodles-0.027
Token warm
Token ID5814
Feature activation-1.44e-02

Pos logit contributions
stru+0.004
 of+0.004
away+0.004
 forward+0.004
 year+0.004
Neg logit contributions
 poems-0.005
ccoli-0.004
ems-0.004
 spaghetti-0.004
bat-0.004
Token,
Token ID11
Feature activation-2.76e-03

Pos logit contributions
 spaghetti+0.020
 poems+0.019
 sauce+0.018
 noodles+0.017
 masks+0.017
Neg logit contributions
stru-0.028
 clim-0.026
away-0.025
or-0.025
ester-0.025
Token so
Token ID523
Feature activation-4.09e-03

Pos logit contributions
 poems+0.005
ems+0.005
ccoli+0.005
bat+0.005
 spaghetti+0.004
Neg logit contributions
stru-0.004
 of-0.004
 clim-0.004
 forward-0.004
away-0.003
Tokenapy
Token ID12826
Feature activation-7.43e-02

Pos logit contributions
 spaghetti+0.008
 poems+0.008
 sauce+0.008
 noodles+0.008
 masks+0.008
Neg logit contributions
stru-0.005
 clim-0.005
away-0.005
or-0.004
ester-0.004
Token water
Token ID1660
Feature activation-1.56e-02

Pos logit contributions
 poems+0.124
ccoli+0.122
bat+0.116
 spaghetti+0.114
ems+0.113
Neg logit contributions
 of-0.118
stru-0.105
away-0.103
--0.102
 ever-0.101
Token.
Token ID13
Feature activation-9.40e-03

Pos logit contributions
 poems+0.036
 spaghetti+0.036
ccoli+0.034
 sauce+0.033
 gum+0.033
Neg logit contributions
stru-0.014
 of-0.013
 clim-0.013
or-0.012
 ever-0.012
Token She
Token ID1375
Feature activation-3.08e-03

Pos logit contributions
 spaghetti+0.018
 poems+0.017
 sauce+0.017
 noodles+0.016
 masks+0.016
Neg logit contributions
stru-0.014
 clim-0.012
away-0.012
ester-0.012
or-0.012
Token spl
Token ID4328
Feature activation-3.03e-02

Pos logit contributions
 poems+0.006
 spaghetti+0.006
 sauce+0.006
 gum+0.006
bat+0.006
Neg logit contributions
stru-0.004
 clim-0.003
or-0.003
 of-0.003
 ever-0.003
Tokenashed
Token ID5263
Feature activation-3.90e-02

Pos logit contributions
 poems+0.062
 spaghetti+0.062
 sauce+0.058
 gum+0.058
 masks+0.057
Neg logit contributions
stru-0.040
or-0.035
ester-0.034
away-0.033
 clim-0.033
Token in
Token ID287
Feature activation+7.67e-03

Pos logit contributions
stru+0.006
 clim+0.006
away+0.005
ester+0.005
or+0.005
Neg logit contributions
 spaghetti-0.012
 poems-0.011
 sauce-0.011
 noodles-0.011
 masks-0.011
Token the
Token ID262
Feature activation+6.69e-03

Pos logit contributions
stru+0.012
 clim+0.011
away+0.010
ester+0.010
or+0.010
Neg logit contributions
 spaghetti-0.014
 poems-0.014
 sauce-0.013
 noodles-0.013
 masks-0.013
Token warm
Token ID5814
Feature activation-1.35e-02

Pos logit contributions
stru+0.012
 clim+0.011
away+0.011
or+0.011
ester+0.011
Neg logit contributions
 poems-0.009
 spaghetti-0.009
ccoli-0.009
ems-0.009
 sauce-0.009
Token,
Token ID11
Feature activation-8.85e-03

Pos logit contributions
 spaghetti+0.017
 sauce+0.016
 poems+0.015
 noodles+0.015
 masks+0.014
Neg logit contributions
stru-0.029
 clim-0.027
ris-0.026
away-0.025
ester-0.025
Token so
Token ID523
Feature activation-3.29e-03

Pos logit contributions
 poems+0.016
ems+0.016
ccoli+0.016
aghetti+0.015
 spaghetti+0.015
Neg logit contributions
stru-0.012
 of-0.011
 clim-0.011
 forward-0.011
 ever-0.010
Tokenapy
Token ID12826
Feature activation-7.37e-02

Pos logit contributions
 spaghetti+0.008
 poems+0.007
 sauce+0.007
 masks+0.007
 noodles+0.007
Neg logit contributions
stru-0.003
 clim-0.003
 of-0.003
away-0.003
or-0.003
Token water
Token ID1660
Feature activation-1.84e-02

Pos logit contributions
 poems+0.118
ccoli+0.116
 spaghetti+0.111
bat+0.110
ems+0.108
Neg logit contributions
 of-0.118
stru-0.110
away-0.106
 clim-0.106
 ever-0.105
Token.
Token ID13
Feature activation-5.18e-03

Pos logit contributions
 spaghetti+0.042
 poems+0.042
 sauce+0.039
ccoli+0.039
 noodles+0.039
Neg logit contributions
stru-0.017
 clim-0.015
 of-0.015
away-0.014
or-0.014
Token His
Token ID2399
Feature activation+4.43e-03

Pos logit contributions
 poems+0.011
 masks+0.011
 elbows+0.011
ems+0.011
 spaghetti+0.011
Neg logit contributions
stru-0.005
 clim-0.005
 of-0.005
ris-0.004
way-0.004
Token mom
Token ID1995
Feature activation-1.86e-02

Pos logit contributions
stru+0.006
 clim+0.005
ester+0.005
 year+0.005
away+0.005
Neg logit contributions
 spaghetti-0.008
 poems-0.008
ems-0.008
 sauce-0.008
 noodles-0.008
Token and
Token ID290
Feature activation-1.97e-04

Pos logit contributions
 spaghetti+0.037
 poems+0.036
 elbows+0.035
 sauce+0.035
 noodles+0.035
Neg logit contributions
stru-0.025
 clim-0.023
 of-0.022
ester-0.021
ris-0.021
Token.
Token ID13
Feature activation-6.12e-03

Pos logit contributions
stru+0.013
ris+0.012
 clim+0.012
away+0.012
ester+0.011
Neg logit contributions
 spaghetti-0.017
 poems-0.016
 sauce-0.016
 gum-0.016
 noodles-0.015
Token It
Token ID632
Feature activation-1.45e-02

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.010
 noodles+0.010
 masks+0.010
Neg logit contributions
stru-0.010
 clim-0.009
away-0.008
or-0.008
ester-0.008
Token's
Token ID338
Feature activation-1.05e-02

Pos logit contributions
 poems+0.041
 spaghetti+0.040
ems+0.039
ities+0.039
 elbows+0.039
Neg logit contributions
stru-0.007
 clim-0.005
 ever-0.005
 returned-0.005
 of-0.004
Token all
Token ID477
Feature activation+2.38e-03

Pos logit contributions
ems+0.021
 poems+0.021
 spaghetti+0.020
 sauce+0.020
 noodles+0.020
Neg logit contributions
 of-0.012
 clim-0.011
stru-0.011
 forward-0.011
 luckily-0.010
Token so
Token ID523
Feature activation-2.07e-02

Pos logit contributions
stru+0.003
 of+0.003
 clim+0.003
 ever+0.002
 forward+0.002
Neg logit contributions
 poems-0.005
ities-0.005
 elbows-0.005
 spaghetti-0.005
ems-0.005
Tokenapy
Token ID12826
Feature activation-7.34e-02

Pos logit contributions
 poems+0.037
 spaghetti+0.036
ems+0.036
 elbows+0.035
 sauce+0.035
Neg logit contributions
stru-0.031
 of-0.028
 clim-0.028
away-0.027
ester-0.026
Token and
Token ID290
Feature activation-5.73e-03

Pos logit contributions
ccoli+0.137
 poems+0.137
bat+0.132
ems+0.131
 knit+0.127
Neg logit contributions
 of-0.101
 ever-0.092
stru-0.086
 clim-0.086
away-0.085
Token wide
Token ID3094
Feature activation+2.31e-02

Pos logit contributions
ems+0.011
ccoli+0.010
bat+0.010
 poems+0.010
ksh+0.010
Neg logit contributions
 ever-0.007
 of-0.007
 now-0.006
 luckily-0.006
 quickly-0.006
Token!
Token ID0
Feature activation+7.10e-03

Pos logit contributions
stru+0.017
 clim+0.016
 of+0.014
 ever+0.013
away+0.013
Neg logit contributions
 spaghetti-0.056
 poems-0.056
 sauce-0.054
ems-0.054
 apr-0.053
Token
Token ID198
Feature activation+4.72e-03

Pos logit contributions
stru+0.010
 clim+0.009
ris+0.009
 forward+0.009
 of+0.008
Neg logit contributions
 poems-0.013
 spaghetti-0.013
 sauce-0.012
 masks-0.012
 noodles-0.012
Token
Token ID198
Feature activation+9.34e-03

Pos logit contributions
stru+0.006
 clim+0.006
away+0.005
or+0.005
ris+0.005
Neg logit contributions
 spaghetti-0.009
 poems-0.009
 sauce-0.009
 noodles-0.009
 masks-0.008
Token a
Token ID257
Feature activation+9.11e-03

Pos logit contributions
 spaghetti+0.044
 poems+0.043
 sauce+0.041
 noodles+0.041
 masks+0.040
Neg logit contributions
stru-0.034
 clim-0.030
away-0.029
or-0.029
ester-0.028
Token cloth
Token ID16270
Feature activation-1.54e-02

Pos logit contributions
stru+0.018
 clim+0.017
away+0.016
or+0.016
 of+0.016
Neg logit contributions
 poems-0.012
 spaghetti-0.012
 noodles-0.011
 sauce-0.011
 masks-0.010
Token and
Token ID290
Feature activation+3.13e-03

Pos logit contributions
 spaghetti+0.031
 poems+0.030
 sauce+0.029
 noodles+0.028
 masks+0.028
Neg logit contributions
stru-0.020
 clim-0.018
away-0.017
ester-0.017
or-0.017
Token some
Token ID617
Feature activation-2.34e-02

Pos logit contributions
stru+0.004
 clim+0.004
away+0.004
or+0.004
ester+0.004
Neg logit contributions
 spaghetti-0.006
 poems-0.006
 sauce-0.006
 noodles-0.005
 masks-0.005
Token so
Token ID523
Feature activation-9.11e-03

Pos logit contributions
ccoli+0.029
bat+0.028
 poems+0.028
ems+0.027
 spaghetti+0.027
Neg logit contributions
 of-0.047
stru-0.044
 clim-0.041
--0.041
away-0.040
Tokenapy
Token ID12826
Feature activation-7.22e-02

Pos logit contributions
 spaghetti+0.016
 poems+0.015
 sauce+0.015
 noodles+0.014
 masks+0.014
Neg logit contributions
stru-0.014
 clim-0.013
away-0.013
ester-0.012
or-0.012
Token water
Token ID1660
Feature activation-1.48e-02

Pos logit contributions
 poems+0.113
ccoli+0.110
 spaghetti+0.106
bat+0.105
ems+0.100
Neg logit contributions
 of-0.120
stru-0.113
--0.107
 clim-0.107
away-0.107
Token and
Token ID290
Feature activation+4.51e-03

Pos logit contributions
 poems+0.035
 spaghetti+0.035
 noodles+0.032
 sauce+0.032
ccoli+0.032
Neg logit contributions
stru-0.013
 clim-0.012
 of-0.011
or-0.011
away-0.010
Token got
Token ID1392
Feature activation+5.78e-04

Pos logit contributions
stru+0.006
 of+0.005
 clim+0.005
-+0.005
 ever+0.005
Neg logit contributions
 poems-0.009
 spaghetti-0.008
ccoli-0.008
bat-0.008
ems-0.008
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
stru+0.001
 clim+0.001
 forward+0.001
 of+0.001
or+0.001
Neg logit contributions
 poems-0.001
 spaghetti-0.001
ems-0.001
ccoli-0.001
 sauce-0.001
Token work
Token ID670
Feature activation-1.49e-03

Pos logit contributions
stru+0.023
 clim+0.021
away+0.021
or+0.019
 reun+0.019
Neg logit contributions
 poems-0.032
 spaghetti-0.031
 noodles-0.029
 masks-0.029
 sauce-0.029
Token it
Token ID340
Feature activation-2.71e-02

Pos logit contributions
 poems+0.022
 spaghetti+0.022
ems+0.022
ccoli+0.021
 masks+0.021
Neg logit contributions
stru-0.008
 of-0.008
--0.008
 forward-0.008
or-0.007
Token with
Token ID351
Feature activation+1.49e-02

Pos logit contributions
 spaghetti+0.059
 poems+0.058
 sauce+0.057
 noodles+0.056
 masks+0.055
Neg logit contributions
stru-0.030
 clim-0.027
away-0.025
or-0.024
ester-0.024
Token lots
Token ID6041
Feature activation-1.06e-02

Pos logit contributions
 of+0.017
stru+0.014
 forward+0.014
 ever+0.013
 home+0.013
Neg logit contributions
ems-0.032
ccoli-0.032
aghetti-0.032
 poems-0.030
bat-0.030
Token of
Token ID286
Feature activation-2.00e-02

Pos logit contributions
 spaghetti+0.029
 sauce+0.028
 poems+0.027
 gum+0.027
 yarn+0.027
Neg logit contributions
stru-0.008
 clim-0.006
or-0.005
ester-0.005
away-0.005
Token so
Token ID523
Feature activation-1.02e-02

Pos logit contributions
ccoli+0.029
bat+0.029
ems+0.027
 poems+0.026
aghetti+0.026
Neg logit contributions
 of-0.038
stru-0.034
--0.033
old-0.032
 year-0.032
Tokenapy
Token ID12826
Feature activation-7.17e-02

Pos logit contributions
 spaghetti+0.021
 poems+0.021
 sauce+0.020
 noodles+0.020
 masks+0.020
Neg logit contributions
stru-0.013
 clim-0.011
away-0.011
ester-0.010
or-0.010
Token bubbles
Token ID25037
Feature activation-2.53e-02

Pos logit contributions
ccoli+0.123
 poems+0.118
bat+0.117
aghetti+0.114
ems+0.114
Neg logit contributions
 of-0.115
--0.099
 clim-0.095
stru-0.094
 ever-0.093
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
 spaghetti+0.059
 poems+0.059
ccoli+0.057
 noodles+0.056
 sauce+0.056
Neg logit contributions
stru-0.020
 of-0.020
 clim-0.020
 ever-0.018
away-0.016
Token Finally
Token ID9461
Feature activation-2.88e-03

Pos logit contributions
 poems+0.013
 spaghetti+0.012
 masks+0.012
 sauce+0.012
 noodles+0.012
Neg logit contributions
stru-0.007
 clim-0.006
 of-0.006
way-0.006
 year-0.006
Token she
Token ID673
Feature activation+6.70e-03

Pos logit contributions
 spaghetti+0.007
 sauce+0.006
 poems+0.006
 noodles+0.006
 gum+0.006
Neg logit contributions
stru-0.003
away-0.003
ester-0.003
ris-0.003
 clim-0.003
Token polished
Token ID23895
Feature activation-2.19e-04

Pos logit contributions
stru+0.010
 clim+0.009
ris+0.009
ester+0.008
or+0.008
Neg logit contributions
 spaghetti-0.012
 poems-0.012
 sauce-0.011
 masks-0.011
 noodles-0.011
Token of
Token ID286
Feature activation-1.82e-02

Pos logit contributions
 spaghetti+0.107
 poems+0.105
 sauce+0.102
 noodles+0.101
 masks+0.101
Neg logit contributions
stru-0.010
 clim-0.006
away-0.004
ris-0.004
or-0.003
Token cheese
Token ID9891
Feature activation-2.48e-02

Pos logit contributions
bat+0.034
 spaghetti+0.032
ccoli+0.032
 poems+0.032
ems+0.031
Neg logit contributions
stru-0.024
 of-0.024
 year-0.023
ris-0.022
away-0.022
Token and
Token ID290
Feature activation-4.46e-03

Pos logit contributions
 spaghetti+0.058
 poems+0.057
 elbows+0.057
bat+0.055
Ol+0.055
Neg logit contributions
 of-0.019
 clim-0.018
 ever-0.017
away-0.016
or-0.016
Token sauce
Token ID10746
Feature activation-2.66e-02

Pos logit contributions
bat+0.006
aghetti+0.006
ems+0.005
Ol+0.005
 poems+0.005
Neg logit contributions
 of-0.008
 ever-0.008
 return-0.007
stru-0.007
 clim-0.007
Token and
Token ID290
Feature activation-7.25e-03

Pos logit contributions
 spaghetti+0.061
 elbows+0.061
 poems+0.061
bat+0.059
 gum+0.057
Neg logit contributions
 of-0.022
 clim-0.020
 ever-0.019
or-0.019
stru-0.019
Token topp
Token ID23126
Feature activation-7.16e-02

Pos logit contributions
bat+0.009
aghetti+0.009
ems+0.009
Ol+0.008
irms+0.008
Neg logit contributions
 of-0.013
 ever-0.013
 return-0.012
 clim-0.011
 that-0.011
Tokenings
Token ID654
Feature activation-2.25e-02

Pos logit contributions
 spaghetti+0.156
 poems+0.150
 knit+0.146
 yarn+0.146
ems+0.145
Neg logit contributions
--0.074
or-0.068
stru-0.067
away-0.063
 clim-0.060
Token.
Token ID13
Feature activation+2.97e-03

Pos logit contributions
 poems+0.052
 spaghetti+0.052
 elbows+0.052
bat+0.050
Ol+0.049
Neg logit contributions
 of-0.018
 forward-0.017
or-0.016
 ever-0.016
away-0.015
Token The
Token ID383
Feature activation+1.13e-03

Pos logit contributions
stru+0.004
 clim+0.004
 of+0.004
ris+0.004
 forward+0.003
Neg logit contributions
 poems-0.005
 spaghetti-0.005
 masks-0.005
 elbows-0.005
bat-0.005
Token people
Token ID661
Feature activation+1.02e-02

Pos logit contributions
 year+0.002
stru+0.002
 clim+0.002
 of+0.002
ris+0.002
Neg logit contributions
 poems-0.002
 spaghetti-0.002
bat-0.002
ems-0.002
 elbows-0.002
Token who
Token ID508
Feature activation+2.51e-02

Pos logit contributions
stru+0.014
 clim+0.013
away+0.012
or+0.012
ris+0.012
Neg logit contributions
 spaghetti-0.020
 poems-0.019
 sauce-0.018
 noodles-0.018
 masks-0.018
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
ccoli+0.037
bat+0.036
aghetti+0.036
 spaghetti+0.036
 poems+0.035
Neg logit contributions
 forward-0.017
away-0.017
 of-0.016
or-0.016
--0.016
Token toy
Token ID13373
Feature activation-4.53e-03

Pos logit contributions
stru+0.016
away+0.016
 clim+0.016
 year+0.015
 of+0.015
Neg logit contributions
 poems-0.022
 spaghetti-0.022
ems-0.022
bat-0.021
 masks-0.021
Token symbol
Token ID6194
Feature activation-2.85e-03

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.009
ccoli+0.009
bat+0.009
Neg logit contributions
stru-0.004
 clim-0.004
away-0.004
 of-0.004
or-0.004
Token in
Token ID287
Feature activation+9.96e-03

Pos logit contributions
 spaghetti+0.007
 poems+0.006
 elbows+0.006
 noodles+0.006
bat+0.006
Neg logit contributions
stru-0.003
 clim-0.002
 of-0.002
away-0.002
or-0.002
Token so
Token ID523
Feature activation-1.91e-03

Pos logit contributions
stru+0.013
away+0.013
 clim+0.012
 of+0.012
 year+0.012
Neg logit contributions
 spaghetti-0.019
 poems-0.018
ccoli-0.018
 noodles-0.018
bat-0.017
Tokenapy
Token ID12826
Feature activation-7.11e-02

Pos logit contributions
 spaghetti+0.004
 poems+0.004
 sauce+0.004
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
or-0.002
 of-0.002
Token water
Token ID1660
Feature activation-6.75e-03

Pos logit contributions
ccoli+0.153
bat+0.150
 poems+0.149
 spaghetti+0.147
aghetti+0.146
Neg logit contributions
 of-0.082
away-0.071
stru-0.068
 ever-0.068
--0.065
Token,
Token ID11
Feature activation-2.19e-04

Pos logit contributions
 poems+0.017
 spaghetti+0.017
 elbows+0.016
ccoli+0.016
 noodles+0.016
Neg logit contributions
 of-0.004
stru-0.004
or-0.004
 clim-0.004
 ever-0.004
Token it
Token ID340
Feature activation-2.24e-02

Pos logit contributions
aghetti+0.001
ccoli+0.001
bat+0.001
ksh+0.001
irms+0.000
Neg logit contributions
 of-0.000
 ever-0.000
 forward-0.000
 luckily-0.000
stru-0.000
Token would
Token ID561
Feature activation-7.58e-03

Pos logit contributions
 spaghetti+0.047
 poems+0.046
 sauce+0.043
 noodles+0.043
 gum+0.042
Neg logit contributions
stru-0.028
 clim-0.025
or-0.022
away-0.022
ester-0.022
Token become
Token ID1716
Feature activation+7.98e-03

Pos logit contributions
 spaghetti+0.015
 poems+0.015
ccoli+0.014
ems+0.014
aghetti+0.014
Neg logit contributions
stru-0.009
 ever-0.009
 of-0.009
away-0.009
 clim-0.009
Token brush
Token ID14093
Feature activation-2.68e-03

Pos logit contributions
stru+0.014
 clim+0.014
 of+0.012
away+0.012
ris+0.012
Neg logit contributions
 poems-0.019
 spaghetti-0.018
 elbows-0.018
 noodles-0.018
 sauce-0.018
Token and
Token ID290
Feature activation-1.24e-03

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 sauce+0.006
 noodles+0.006
 masks+0.006
Neg logit contributions
stru-0.002
 of-0.002
 clim-0.002
away-0.002
or-0.002
Token bucket
Token ID19236
Feature activation-2.26e-02

Pos logit contributions
aghetti+0.002
 poems+0.002
bat+0.002
ccoli+0.002
 spaghetti+0.002
Neg logit contributions
stru-0.002
 of-0.002
 clim-0.002
old-0.001
 able-0.001
Token of
Token ID286
Feature activation-6.83e-03

Pos logit contributions
 spaghetti+0.054
 poems+0.054
ccoli+0.052
ems+0.051
 noodles+0.051
Neg logit contributions
 of-0.018
stru-0.018
 clim-0.016
ester-0.014
away-0.014
Token so
Token ID523
Feature activation-3.32e-03

Pos logit contributions
ccoli+0.012
ems+0.011
aghetti+0.011
bat+0.010
ities+0.010
Neg logit contributions
 of-0.012
 year-0.010
--0.009
 taxi-0.009
 home-0.009
Tokenapy
Token ID12826
Feature activation-7.05e-02

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 sauce+0.006
 noodles+0.006
 masks+0.006
Neg logit contributions
stru-0.005
 clim-0.004
away-0.004
ester-0.004
or-0.004
Token water
Token ID1660
Feature activation-1.26e-02

Pos logit contributions
ccoli+0.108
bat+0.104
 poems+0.103
ems+0.098
ities+0.098
Neg logit contributions
 of-0.128
--0.108
 ever-0.101
 year-0.099
,-0.097
Token.
Token ID13
Feature activation-9.71e-03

Pos logit contributions
 poems+0.031
 spaghetti+0.030
 elbows+0.029
 noodles+0.029
 sauce+0.029
Neg logit contributions
stru-0.008
 of-0.008
 clim-0.008
or-0.008
 ever-0.008
Token Sus
Token ID8932
Feature activation+1.56e-02

Pos logit contributions
 spaghetti+0.015
 sauce+0.014
 poems+0.014
 gum+0.013
 yarn+0.013
Neg logit contributions
stru-0.017
away-0.016
 clim-0.016
ester-0.015
or-0.015
Tokenie
Token ID494
Feature activation-4.13e-03

Pos logit contributions
stru+0.022
 victorious+0.020
 clim+0.020
 forward+0.018
away+0.018
Neg logit contributions
 spaghetti-0.029
 poems-0.028
 sauce-0.027
 noodles-0.026
 masks-0.026
Token scrub
Token ID27268
Feature activation-3.58e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.008
 noodles+0.008
 gum+0.008
Neg logit contributions
stru-0.005
 clim-0.004
or-0.004
ester-0.004
ris-0.004
Token his
Token ID465
Feature activation-1.16e-03

Pos logit contributions
ems+0.005
ccoli+0.005
 poems+0.005
ities+0.005
 spaghetti+0.005
Neg logit contributions
stru-0.002
away-0.002
 of-0.002
 clim-0.002
--0.002
Token bubble
Token ID14310
Feature activation-3.40e-02

Pos logit contributions
ems+0.002
 poems+0.002
 spaghetti+0.002
bat+0.002
aghetti+0.002
Neg logit contributions
stru-0.002
 home-0.002
 of-0.002
 year-0.002
ester-0.001
Token maker
Token ID16009
Feature activation-1.66e-02

Pos logit contributions
 spaghetti+0.061
 poems+0.059
ems+0.059
ccoli+0.057
 mustard+0.057
Neg logit contributions
 of-0.044
stru-0.044
 clim-0.042
away-0.042
or-0.042
Token with
Token ID351
Feature activation+1.52e-02

Pos logit contributions
 spaghetti+0.036
 poems+0.036
 sauce+0.035
ems+0.034
 noodles+0.034
Neg logit contributions
stru-0.017
 clim-0.015
or-0.014
 of-0.014
away-0.014
Token so
Token ID523
Feature activation-4.51e-03

Pos logit contributions
 of+0.021
stru+0.018
 ever+0.017
 forward+0.017
 clim+0.017
Neg logit contributions
ems-0.030
ccoli-0.029
aghetti-0.028
 poems-0.027
bat-0.026
Tokenapy
Token ID12826
Feature activation-6.99e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 noodles+0.009
 masks+0.009
Neg logit contributions
stru-0.006
 clim-0.005
away-0.005
ester-0.005
or-0.005
Token water
Token ID1660
Feature activation-2.05e-02

Pos logit contributions
ccoli+0.116
bat+0.110
ems+0.110
 poems+0.109
ities+0.105
Neg logit contributions
 of-0.123
--0.104
 ever-0.096
 year-0.092
away-0.090
Token and
Token ID290
Feature activation+4.81e-04

Pos logit contributions
 poems+0.048
 spaghetti+0.048
ems+0.045
ccoli+0.045
 sauce+0.045
Neg logit contributions
stru-0.017
 of-0.016
 clim-0.015
or-0.014
away-0.014
Token blew
Token ID17948
Feature activation-4.86e-03

Pos logit contributions
 of+0.001
stru+0.001
 clim+0.001
 ever+0.001
 quickly+0.001
Neg logit contributions
ems-0.001
 poems-0.001
ccoli-0.001
ities-0.001
bat-0.001
Token.
Token ID13
Feature activation-7.03e-03

Pos logit contributions
ems+0.011
ccoli+0.011
 poems+0.011
 knit+0.010
aghetti+0.010
Neg logit contributions
 forward-0.005
 of-0.005
--0.004
away-0.004
stru-0.004
Token Bub
Token ID19645
Feature activation-9.40e-03

Pos logit contributions
 poems+0.014
 spaghetti+0.014
 masks+0.014
ems+0.014
 sauce+0.013
Neg logit contributions
stru-0.009
 clim-0.008
 of-0.008
ris-0.007
way-0.007
Token inside
Token ID2641
Feature activation+7.48e-03

Pos logit contributions
 spaghetti+0.007
 poems+0.007
 sauce+0.007
 noodles+0.007
 elbows+0.007
Neg logit contributions
stru-0.003
 clim-0.003
away-0.003
ris-0.003
ester-0.003
Token!
Token ID0
Feature activation+1.91e-03

Pos logit contributions
stru+0.009
 clim+0.008
away+0.007
or+0.007
ester+0.007
Neg logit contributions
 spaghetti-0.016
 poems-0.016
 sauce-0.015
 noodles-0.015
 masks-0.015
Token There
Token ID1318
Feature activation+9.13e-03

Pos logit contributions
stru+0.003
 clim+0.003
away+0.003
or+0.003
ester+0.003
Neg logit contributions
 spaghetti-0.003
 poems-0.003
 sauce-0.003
 masks-0.003
 noodles-0.003
Token were
Token ID547
Feature activation-1.03e-02

Pos logit contributions
stru+0.013
 clim+0.012
away+0.012
ester+0.011
ris+0.011
Neg logit contributions
 spaghetti-0.017
 poems-0.017
 sauce-0.016
 noodles-0.016
 masks-0.016
Token ch
Token ID442
Feature activation-5.65e-03

Pos logit contributions
ems+0.017
ccoli+0.017
 elbows+0.016
 spaghetti+0.016
 poems+0.016
Neg logit contributions
 of-0.016
stru-0.016
 clim-0.014
 year-0.014
 ever-0.014
Tokenocol
Token ID4668
Feature activation-6.97e-02

Pos logit contributions
 spaghetti+0.011
 poems+0.011
 sauce+0.010
 noodles+0.010
 masks+0.010
Neg logit contributions
stru-0.008
 clim-0.007
away-0.007
ester-0.007
or-0.007
Tokenates
Token ID689
Feature activation+6.65e-04

Pos logit contributions
 spaghetti+0.105
 poems+0.100
 sauce+0.095
 noodles+0.093
 masks+0.091
Neg logit contributions
stru-0.126
 clim-0.116
ris-0.115
or-0.112
ester-0.111
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
stru+0.001
 of+0.000
 clim+0.000
or+0.000
ester+0.000
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 elbows-0.002
 sauce-0.002
 masks-0.002
Token cookies
Token ID14746
Feature activation-3.28e-02

Pos logit contributions
 elbows+0.018
ems+0.017
irms+0.017
ccoli+0.017
aghetti+0.016
Neg logit contributions
 of-0.022
 ever-0.019
stru-0.019
 clim-0.019
 year-0.019
Token,
Token ID11
Feature activation-1.90e-02

Pos logit contributions
 spaghetti+0.084
 poems+0.084
 elbows+0.082
irms+0.080
 noodles+0.080
Neg logit contributions
stru-0.019
 of-0.018
 clim-0.018
 ever-0.015
ris-0.014
Token and
Token ID290
Feature activation-6.51e-03

Pos logit contributions
 elbows+0.041
irms+0.039
ems+0.039
ccoli+0.038
 poems+0.038
Neg logit contributions
 of-0.018
stru-0.017
 clim-0.016
 ever-0.015
 year-0.015
Token full
Token ID1336
Feature activation-5.51e-03

Pos logit contributions
 poems+0.079
 spaghetti+0.078
 masks+0.075
 sauce+0.074
 noodles+0.074
Neg logit contributions
stru-0.031
 of-0.029
 clim-0.027
away-0.026
or-0.025
Token of
Token ID286
Feature activation-8.50e-03

Pos logit contributions
 poems+0.017
 spaghetti+0.017
ems+0.017
 masks+0.017
bat+0.017
Neg logit contributions
stru-0.000
 of-0.000
 clim+0.000
 year+0.000
away+0.000
Token warm
Token ID5814
Feature activation-2.91e-02

Pos logit contributions
ems+0.014
ccoli+0.012
bat+0.012
 poems+0.012
 keyboard+0.011
Neg logit contributions
 of-0.016
 year-0.014
stru-0.013
 ever-0.013
--0.013
Token,
Token ID11
Feature activation-1.68e-02

Pos logit contributions
 poems+0.043
ems+0.041
ccoli+0.041
 spaghetti+0.039
 keyboard+0.038
Neg logit contributions
 of-0.049
stru-0.048
away-0.046
--0.046
 year-0.045
Token so
Token ID523
Feature activation-1.20e-02

Pos logit contributions
 poems+0.033
ems+0.032
 elbows+0.031
bat+0.031
 masks+0.031
Neg logit contributions
 of-0.019
stru-0.019
 clim-0.017
 ever-0.016
--0.016
Tokenapy
Token ID12826
Feature activation-6.88e-02

Pos logit contributions
 spaghetti+0.026
 poems+0.025
 sauce+0.025
 masks+0.024
 noodles+0.024
Neg logit contributions
stru-0.013
 clim-0.012
 of-0.012
 ever-0.011
away-0.011
Token water
Token ID1660
Feature activation-2.29e-02

Pos logit contributions
 poems+0.121
ccoli+0.116
ems+0.116
bat+0.113
 keyboard+0.109
Neg logit contributions
 of-0.106
 ever-0.092
--0.089
 year-0.089
 clim-0.088
Token and
Token ID290
Feature activation-4.17e-03

Pos logit contributions
 poems+0.054
 spaghetti+0.053
 sauce+0.050
 noodles+0.050
 gum+0.049
Neg logit contributions
stru-0.020
 clim-0.018
 of-0.018
away-0.017
 ever-0.016
Token left
Token ID1364
Feature activation-1.57e-03

Pos logit contributions
ems+0.007
 poems+0.007
aghetti+0.006
bat+0.006
ities+0.006
Neg logit contributions
 of-0.006
 ever-0.006
 returned-0.006
 clim-0.005
stru-0.005
Token him
Token ID683
Feature activation-7.73e-03

Pos logit contributions
 poems+0.003
 spaghetti+0.003
 sauce+0.003
 noodles+0.003
 masks+0.003
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
or-0.002
 of-0.002
Token to
Token ID284
Feature activation+2.81e-04

Pos logit contributions
 spaghetti+0.017
 poems+0.017
 sauce+0.017
 noodles+0.016
 masks+0.016
Neg logit contributions
stru-0.008
 clim-0.007
away-0.007
or-0.006
ester-0.006
Token out
Token ID503
Feature activation+1.00e-02

Pos logit contributions
 spaghetti+0.000
 poems+0.000
 sauce+0.000
 noodles+0.000
ems+0.000
Neg logit contributions
stru-0.000
 clim-0.000
away-0.000
or-0.000
ester-0.000
Token the
Token ID262
Feature activation+7.25e-03

Pos logit contributions
stru+0.014
 clim+0.013
away+0.013
or+0.012
ester+0.012
Neg logit contributions
 spaghetti-0.020
 sauce-0.018
 poems-0.018
 noodles-0.018
 masks-0.018
Token towels
Token ID38494
Feature activation-2.13e-03

Pos logit contributions
stru+0.012
 clim+0.011
away+0.010
or+0.010
ris+0.010
Neg logit contributions
 spaghetti-0.012
 poems-0.012
 sauce-0.012
 noodles-0.011
 masks-0.011
Token and
Token ID290
Feature activation+4.77e-03

Pos logit contributions
 spaghetti+0.005
 poems+0.004
 sauce+0.004
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.003
 clim-0.002
away-0.002
or-0.002
ester-0.002
Token so
Token ID523
Feature activation-2.10e-03

Pos logit contributions
stru+0.006
 clim+0.005
 of+0.005
or+0.005
ris+0.005
Neg logit contributions
 poems-0.009
 spaghetti-0.009
 sauce-0.009
ems-0.009
ccoli-0.009
Tokenapy
Token ID12826
Feature activation-6.84e-02

Pos logit contributions
 spaghetti+0.004
 poems+0.004
 sauce+0.004
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.003
 clim-0.003
away-0.002
or-0.002
ester-0.002
Token water
Token ID1660
Feature activation-1.41e-02

Pos logit contributions
 poems+0.116
ccoli+0.113
ems+0.111
bat+0.108
 spaghetti+0.107
Neg logit contributions
 of-0.104
--0.094
 clim-0.094
stru-0.091
 ever-0.091
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
 poems+0.032
 spaghetti+0.032
 sauce+0.030
 noodles+0.030
 elbows+0.030
Neg logit contributions
stru-0.013
 clim-0.012
 of-0.011
or-0.011
away-0.011
Token Little
Token ID7703
Feature activation+8.59e-03

Pos logit contributions
 poems+0.032
 spaghetti+0.031
 sauce+0.030
 masks+0.030
 noodles+0.029
Neg logit contributions
stru-0.023
 clim-0.019
ris-0.019
 forward-0.018
 of-0.018
Token one
Token ID530
Feature activation-1.17e-02

Pos logit contributions
stru+0.012
 clim+0.011
-+0.011
 ever+0.010
ester+0.010
Neg logit contributions
 poems-0.016
 spaghetti-0.015
 sauce-0.015
 noodles-0.015
 salsa-0.015
Token brushed
Token ID31782
Feature activation-5.26e-03

Pos logit contributions
 spaghetti+0.024
 poems+0.022
 sauce+0.022
 noodles+0.022
 gum+0.021
Neg logit contributions
stru-0.015
 clim-0.013
away-0.013
or-0.013
ester-0.013
Token and
Token ID290
Feature activation+1.31e-03

Pos logit contributions
 spaghetti+0.040
 sauce+0.039
 poems+0.037
 noodles+0.037
 gum+0.036
Neg logit contributions
stru-0.018
 clim-0.015
away-0.014
or-0.013
ris-0.013
Token filled
Token ID5901
Feature activation+6.32e-03

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
ester+0.002
ris+0.002
Neg logit contributions
 poems-0.002
 spaghetti-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token it
Token ID340
Feature activation-2.23e-02

Pos logit contributions
stru+0.008
 clim+0.007
away+0.007
or+0.006
ester+0.006
Neg logit contributions
 spaghetti-0.013
 poems-0.013
 sauce-0.012
 noodles-0.012
 masks-0.012
Token with
Token ID351
Feature activation+2.04e-02

Pos logit contributions
 spaghetti+0.045
 poems+0.043
 sauce+0.043
 noodles+0.042
 masks+0.041
Neg logit contributions
stru-0.029
 clim-0.026
away-0.025
ester-0.024
or-0.024
Token so
Token ID523
Feature activation-6.07e-03

Pos logit contributions
stru+0.031
 of+0.029
 clim+0.029
away+0.028
 ever+0.027
Neg logit contributions
 poems-0.034
ems-0.034
ccoli-0.033
 spaghetti-0.032
aghetti-0.031
Tokenapy
Token ID12826
Feature activation-6.80e-02

Pos logit contributions
 spaghetti+0.012
 poems+0.012
 sauce+0.012
 noodles+0.011
 masks+0.011
Neg logit contributions
stru-0.008
 clim-0.007
away-0.007
ester-0.007
or-0.007
Token water
Token ID1660
Feature activation-1.51e-02

Pos logit contributions
 poems+0.112
ccoli+0.107
bat+0.105
 spaghetti+0.103
ems+0.103
Neg logit contributions
 of-0.107
stru-0.102
away-0.097
--0.094
 clim-0.094
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 poems+0.035
 spaghetti+0.035
 sauce+0.033
 noodles+0.033
 masks+0.032
Neg logit contributions
stru-0.014
 clim-0.012
 of-0.012
away-0.012
or-0.011
Token He
Token ID679
Feature activation-5.24e-03

Pos logit contributions
 poems+0.026
 spaghetti+0.026
 masks+0.025
 sauce+0.025
 noodles+0.024
Neg logit contributions
stru-0.017
 clim-0.015
ris-0.014
or-0.014
ester-0.014
Token went
Token ID1816
Feature activation+1.17e-02

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.009
 noodles+0.009
 gum+0.009
Neg logit contributions
stru-0.007
 clim-0.007
ris-0.006
ester-0.006
away-0.006
Token back
Token ID736
Feature activation-8.17e-03

Pos logit contributions
stru+0.017
 clim+0.015
ively+0.014
ris+0.014
ester+0.014
Neg logit contributions
 spaghetti-0.023
 sauce-0.021
 masks-0.021
 noodles-0.020
 gum-0.020
Token cakes
Token ID30849
Feature activation-1.74e-02

Pos logit contributions
ems+0.039
ccoli+0.037
aghetti+0.034
 knit+0.034
 poems+0.034
Neg logit contributions
 of-0.027
 clim-0.025
 ever-0.025
stru-0.025
 year-0.024
Token and
Token ID290
Feature activation-9.34e-03

Pos logit contributions
 spaghetti+0.044
 poems+0.043
 noodles+0.041
 sauce+0.041
 elbows+0.041
Neg logit contributions
stru-0.011
 clim-0.011
 of-0.011
ris-0.009
or-0.009
Token cookies
Token ID14746
Feature activation-2.86e-02

Pos logit contributions
ems+0.013
ccoli+0.013
irms+0.012
bat+0.011
aghetti+0.011
Neg logit contributions
 of-0.016
 ever-0.015
 year-0.015
stru-0.015
 clim-0.015
Token and
Token ID290
Feature activation-9.65e-03

Pos logit contributions
 poems+0.067
 spaghetti+0.067
 elbows+0.065
 noodles+0.064
ccoli+0.064
Neg logit contributions
 of-0.022
 clim-0.021
stru-0.020
 ever-0.020
ris-0.019
Token ch
Token ID442
Feature activation-2.69e-03

Pos logit contributions
ccoli+0.012
ems+0.012
bat+0.012
aghetti+0.011
irms+0.011
Neg logit contributions
 of-0.018
 ever-0.017
 clim-0.016
stru-0.016
 year-0.016
Tokenocol
Token ID4668
Feature activation-6.80e-02

Pos logit contributions
 spaghetti+0.006
 poems+0.005
 sauce+0.005
 noodles+0.005
 masks+0.005
Neg logit contributions
stru-0.003
 clim-0.003
away-0.003
ester-0.003
or-0.003
Tokenates
Token ID689
Feature activation-2.36e-03

Pos logit contributions
 spaghetti+0.105
 poems+0.097
 sauce+0.093
 noodles+0.091
 masks+0.090
Neg logit contributions
stru-0.122
 clim-0.113
ris-0.111
ester-0.108
or-0.108
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 elbows+0.005
irms+0.005
ems+0.005
Neg logit contributions
 of-0.002
stru-0.002
 ever-0.002
 clim-0.001
or-0.001
Token ice
Token ID4771
Feature activation-2.31e-02

Pos logit contributions
bat+0.019
ems+0.019
aghetti+0.019
ccoli+0.018
Ol+0.018
Neg logit contributions
 of-0.024
 ever-0.024
 clim-0.023
stru-0.022
 year-0.021
Token cream
Token ID8566
Feature activation-5.86e-03

Pos logit contributions
 spaghetti+0.031
 poems+0.031
ems+0.029
 elbows+0.029
 keyboard+0.028
Neg logit contributions
stru-0.041
--0.041
 clim-0.039
or-0.039
away-0.039
Token.
Token ID13
Feature activation-2.16e-03

Pos logit contributions
 poems+0.014
 spaghetti+0.014
 elbows+0.014
ems+0.013
irms+0.013
Neg logit contributions
 clim-0.004
stru-0.004
 of-0.004
 ever-0.004
--0.004
Token a
Token ID257
Feature activation+7.73e-03

Pos logit contributions
 of+0.004
stru+0.004
 ever+0.004
 able+0.004
 year+0.004
Neg logit contributions
ems-0.008
ccoli-0.008
bat-0.008
aghetti-0.008
 poems-0.008
Token big
Token ID1263
Feature activation-5.70e-03

Pos logit contributions
stru+0.010
 clim+0.009
away+0.009
 year+0.009
ester+0.008
Neg logit contributions
 poems-0.015
 spaghetti-0.014
ems-0.014
bat-0.014
 noodles-0.014
Token bowl
Token ID9396
Feature activation-2.65e-02

Pos logit contributions
 poems+0.009
 spaghetti+0.009
ems+0.008
bat+0.008
ccoli+0.008
Neg logit contributions
stru-0.009
 clim-0.009
 year-0.008
away-0.008
ester-0.008
Token of
Token ID286
Feature activation-1.48e-02

Pos logit contributions
 spaghetti+0.070
 poems+0.069
 sauce+0.067
 noodles+0.066
 masks+0.066
Neg logit contributions
stru-0.017
 clim-0.014
away-0.012
ester-0.012
or-0.011
Token so
Token ID523
Feature activation-1.03e-02

Pos logit contributions
ems+0.024
ccoli+0.023
 poems+0.021
bat+0.020
 elbows+0.020
Neg logit contributions
 of-0.027
 year-0.024
--0.022
stru-0.022
 home-0.022
Tokenapy
Token ID12826
Feature activation-6.80e-02

Pos logit contributions
 spaghetti+0.018
 poems+0.018
 sauce+0.017
 masks+0.017
 noodles+0.017
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
or-0.013
ester-0.013
Token water
Token ID1660
Feature activation-1.26e-02

Pos logit contributions
 poems+0.118
ccoli+0.115
ems+0.113
bat+0.112
 keyboard+0.104
Neg logit contributions
 of-0.110
--0.093
 year-0.090
 ever-0.087
away-0.083
Token and
Token ID290
Feature activation+1.09e-03

Pos logit contributions
 poems+0.031
 spaghetti+0.029
 elbows+0.029
ems+0.028
 masks+0.028
Neg logit contributions
 of-0.009
stru-0.009
 clim-0.008
 ever-0.008
or-0.008
Token she
Token ID673
Feature activation+6.83e-03

Pos logit contributions
 of+0.002
stru+0.001
 ever+0.001
 clim+0.001
 able+0.001
Neg logit contributions
ems-0.002
bat-0.002
 poems-0.002
ccoli-0.002
irms-0.002
Token was
Token ID373
Feature activation+8.54e-03

Pos logit contributions
stru+0.008
 clim+0.007
 of+0.007
ester+0.007
 ever+0.006
Neg logit contributions
 poems-0.014
 spaghetti-0.014
 masks-0.013
bat-0.013
 sauce-0.013
Token shaking
Token ID17275
Feature activation-8.82e-03

Pos logit contributions
stru+0.010
 clim+0.008
away+0.008
ester+0.008
ris+0.007
Neg logit contributions
 spaghetti-0.019
 poems-0.018
 sauce-0.018
 noodles-0.018
 masks-0.018
Token into
Token ID656
Feature activation-1.39e-02

Pos logit contributions
stru+0.008
 clim+0.007
ester+0.007
ris+0.006
old+0.006
Neg logit contributions
 spaghetti-0.012
 sauce-0.012
 poems-0.012
 masks-0.012
 noodles-0.011
Token the
Token ID262
Feature activation+2.40e-03

Pos logit contributions
 poems+0.030
ems+0.030
 spaghetti+0.029
ccoli+0.029
 sauce+0.028
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
 of-0.013
 victorious-0.012
Token bucket
Token ID19236
Feature activation-1.29e-02

Pos logit contributions
 home+0.003
away+0.003
 of+0.003
 single+0.003
 clim+0.003
Neg logit contributions
 poems-0.005
ems-0.005
ccoli-0.004
ities-0.004
aghetti-0.004
Token of
Token ID286
Feature activation-3.16e-03

Pos logit contributions
 spaghetti+0.028
 poems+0.028
 sauce+0.027
 noodles+0.026
 masks+0.026
Neg logit contributions
stru-0.015
 clim-0.013
away-0.012
ester-0.012
or-0.012
Token so
Token ID523
Feature activation-1.02e-02

Pos logit contributions
ems+0.006
ccoli+0.006
aghetti+0.005
 poems+0.005
ities+0.005
Neg logit contributions
 of-0.005
 year-0.004
 home-0.004
away-0.004
 clim-0.004
Tokenapy
Token ID12826
Feature activation-6.78e-02

Pos logit contributions
 spaghetti+0.020
 poems+0.020
 sauce+0.019
 noodles+0.019
 masks+0.019
Neg logit contributions
stru-0.014
 clim-0.012
away-0.012
ester-0.011
or-0.011
Token water
Token ID1660
Feature activation-1.72e-02

Pos logit contributions
 poems+0.109
ccoli+0.103
 spaghetti+0.102
ems+0.098
bat+0.097
Neg logit contributions
 of-0.110
stru-0.109
 clim-0.104
away-0.103
 ever-0.100
Token and
Token ID290
Feature activation-1.92e-03

Pos logit contributions
 spaghetti+0.039
 poems+0.039
 sauce+0.036
 noodles+0.036
ccoli+0.036
Neg logit contributions
stru-0.018
 clim-0.015
away-0.015
 of-0.014
or-0.014
Token started
Token ID2067
Feature activation-2.48e-03

Pos logit contributions
 poems+0.004
ems+0.004
aghetti+0.004
 spaghetti+0.004
ities+0.004
Neg logit contributions
stru-0.002
 of-0.002
 ever-0.002
 clim-0.002
 luckily-0.002
Token turning
Token ID6225
Feature activation+1.38e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.005
ems+0.005
 noodles+0.005
Neg logit contributions
stru-0.003
 clim-0.002
away-0.002
ester-0.002
 of-0.002
Token around
Token ID1088
Feature activation+5.56e-03

Pos logit contributions
stru+0.013
 forward+0.013
away+0.012
 of+0.012
 clim+0.011
Neg logit contributions
 poems-0.031
 spaghetti-0.031
ems-0.030
ccoli-0.030
 sauce-0.029
Token They
Token ID1119
Feature activation-6.82e-03

Pos logit contributions
 poems+0.003
 spaghetti+0.003
 masks+0.003
 elbows+0.003
 sauce+0.003
Neg logit contributions
stru-0.002
 of-0.002
 clim-0.002
ris-0.002
away-0.002
Token also
Token ID635
Feature activation-1.82e-02

Pos logit contributions
 poems+0.014
 spaghetti+0.014
 masks+0.013
 elbows+0.013
 sauce+0.012
Neg logit contributions
 clim-0.008
stru-0.008
 of-0.007
ris-0.007
 ever-0.007
Token see
Token ID766
Feature activation+1.37e-03

Pos logit contributions
 spaghetti+0.036
 poems+0.036
bat+0.034
 masks+0.034
 sauce+0.034
Neg logit contributions
stru-0.022
 clim-0.021
 of-0.019
 forward-0.019
ris-0.018
Token some
Token ID617
Feature activation-3.07e-02

Pos logit contributions
 of+0.002
stru+0.002
 clim+0.002
 taxi+0.002
 forward+0.002
Neg logit contributions
ems-0.002
ccoli-0.002
 spaghetti-0.002
 poems-0.002
aghetti-0.002
Token fancy
Token ID14996
Feature activation-4.58e-03

Pos logit contributions
 spaghetti+0.050
 poems+0.049
ccoli+0.048
ems+0.046
 masks+0.046
Neg logit contributions
stru-0.049
 of-0.046
 clim-0.046
away-0.044
 year-0.043
Token topp
Token ID23126
Feature activation-6.73e-02

Pos logit contributions
 spaghetti+0.006
 poems+0.006
ccoli+0.006
ems+0.006
 masks+0.005
Neg logit contributions
stru-0.009
 clim-0.008
 of-0.008
away-0.008
--0.008
Tokenings
Token ID654
Feature activation-2.59e-02

Pos logit contributions
 spaghetti+0.142
 poems+0.138
ems+0.131
 masks+0.131
 yarn+0.129
Neg logit contributions
stru-0.078
 clim-0.072
or-0.071
--0.068
away-0.068
Token.
Token ID13
Feature activation-4.50e-04

Pos logit contributions
 elbows+0.065
 spaghetti+0.063
 poems+0.063
 apr+0.061
ccoli+0.061
Neg logit contributions
 of-0.021
 forward-0.018
or-0.016
--0.015
 clim-0.015
Token They
Token ID1119
Feature activation-7.79e-03

Pos logit contributions
 poems+0.001
 spaghetti+0.001
 masks+0.001
ems+0.001
 elbows+0.001
Neg logit contributions
 of-0.001
stru-0.001
 clim-0.000
ris-0.000
 tow-0.000
Token see
Token ID766
Feature activation+2.00e-04

Pos logit contributions
 spaghetti+0.016
 poems+0.016
 gum+0.015
 sauce+0.015
 masks+0.015
Neg logit contributions
stru-0.009
 clim-0.009
ris-0.008
 of-0.008
 ever-0.008
Token sprink
Token ID27562
Feature activation-1.12e-02

Pos logit contributions
 of+0.000
stru+0.000
 clim+0.000
 taxi+0.000
 forward+0.000
Neg logit contributions
ems-0.000
ccoli-0.000
aghetti-0.000
 spaghetti-0.000
 poems-0.000
Token he
Token ID339
Feature activation+2.75e-03

Pos logit contributions
stru+0.010
 clim+0.009
 of+0.009
away+0.008
or+0.008
Neg logit contributions
 spaghetti-0.020
 poems-0.020
 sauce-0.020
ems-0.019
ccoli-0.019
Token couldn
Token ID3521
Feature activation+1.27e-02

Pos logit contributions
stru+0.003
 clim+0.003
 ever+0.003
 of+0.002
 year+0.002
Neg logit contributions
 spaghetti-0.006
 poems-0.006
bat-0.006
 sauce-0.006
 noodles-0.006
Token't
Token ID470
Feature activation+2.37e-03

Pos logit contributions
stru+0.013
 clim+0.012
or+0.011
away+0.011
ester+0.011
Neg logit contributions
 spaghetti-0.029
 poems-0.028
 sauce-0.027
 noodles-0.027
 masks-0.026
Token ride
Token ID6594
Feature activation+3.18e-02

Pos logit contributions
stru+0.003
 clim+0.003
away+0.003
ester+0.003
or+0.003
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.004
 noodles-0.004
 masks-0.004
Token his
Token ID465
Feature activation+1.10e-02

Pos logit contributions
stru+0.032
away+0.032
 forward+0.031
 clim+0.031
or+0.030
Neg logit contributions
 spaghetti-0.069
 sauce-0.067
 noodles-0.066
 poems-0.065
ccoli-0.065
Token bicycle
Token ID17026
Feature activation+4.45e-02

Pos logit contributions
stru+0.017
 clim+0.015
away+0.015
or+0.015
ester+0.014
Neg logit contributions
 spaghetti-0.019
 poems-0.018
 sauce-0.018
 noodles-0.018
 masks-0.017
Token.
Token ID13
Feature activation+1.83e-03

Pos logit contributions
stru+0.053
 clim+0.046
away+0.044
or+0.042
ester+0.042
Neg logit contributions
 spaghetti-0.095
 poems-0.092
 sauce-0.091
 noodles-0.088
 masks-0.087
Token He
Token ID679
Feature activation+5.22e-04

Pos logit contributions
stru+0.002
 clim+0.002
or+0.002
ris+0.002
 year+0.002
Neg logit contributions
 poems-0.004
 spaghetti-0.004
 sauce-0.003
 masks-0.003
 noodles-0.003
Token wob
Token ID46867
Feature activation-6.95e-03

Pos logit contributions
stru+0.001
 clim+0.001
ris+0.001
away+0.001
or+0.001
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 noodles-0.001
 gum-0.001
Tokenbled
Token ID9342
Feature activation-1.31e-02

Pos logit contributions
 spaghetti+0.011
 poems+0.011
 sauce+0.011
 noodles+0.010
 masks+0.010
Neg logit contributions
stru-0.012
 clim-0.011
away-0.010
ester-0.010
ris-0.010
Token around
Token ID1088
Feature activation+9.37e-05

Pos logit contributions
 spaghetti+0.031
 poems+0.030
 sauce+0.030
 noodles+0.029
ccoli+0.029
Neg logit contributions
stru-0.011
 forward-0.011
away-0.011
 clim-0.011
or-0.010
Token back
Token ID736
Feature activation-6.20e-03

Pos logit contributions
 spaghetti+0.030
 poems+0.029
 sauce+0.029
 noodles+0.028
 masks+0.028
Neg logit contributions
stru-0.014
 clim-0.012
away-0.012
ris-0.011
ester-0.011
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 spaghetti+0.013
 poems+0.013
 sauce+0.013
 gum+0.012
 masks+0.012
Neg logit contributions
stru-0.007
 clim-0.006
away-0.006
ester-0.006
ris-0.006
Token Mom
Token ID11254
Feature activation-2.38e-02

Pos logit contributions
 spaghetti+0.022
 poems+0.021
 sauce+0.021
 noodles+0.021
 masks+0.021
Neg logit contributions
stru-0.012
 clim-0.011
 of-0.010
 forward-0.010
 ever-0.009
Tokenmy
Token ID1820
Feature activation-7.35e-03

Pos logit contributions
 poems+0.043
 spaghetti+0.042
 noodles+0.040
 elbows+0.040
 masks+0.040
Neg logit contributions
stru-0.035
 of-0.034
 clim-0.033
ester-0.032
old-0.031
Token asked
Token ID1965
Feature activation-3.12e-03

Pos logit contributions
 spaghetti+0.013
 poems+0.013
 sauce+0.012
 noodles+0.012
 masks+0.012
Neg logit contributions
stru-0.011
 clim-0.010
away-0.010
ester-0.010
or-0.010
Token Mary
Token ID5335
Feature activation+5.53e-02

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 sauce+0.006
 noodles+0.006
 gum+0.005
Neg logit contributions
stru-0.004
 clim-0.004
away-0.004
or-0.004
 forward-0.004
Token to
Token ID284
Feature activation+7.60e-04

Pos logit contributions
stru+0.064
 clim+0.056
away+0.053
ester+0.052
or+0.051
Neg logit contributions
 spaghetti-0.120
 poems-0.116
 sauce-0.114
 noodles-0.111
 masks-0.110
Token say
Token ID910
Feature activation+1.45e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
or+0.001
ester+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 masks-0.001
Token thank
Token ID5875
Feature activation-1.04e-02

Pos logit contributions
stru+0.016
 clim+0.014
 forward+0.013
away+0.013
or+0.013
Neg logit contributions
 spaghetti-0.032
 sauce-0.031
 poems-0.030
 noodles-0.030
 masks-0.030
Token you
Token ID345
Feature activation-1.95e-02

Pos logit contributions
 spaghetti+0.019
 poems+0.019
 sauce+0.018
 noodles+0.018
 masks+0.018
Neg logit contributions
stru-0.015
 clim-0.014
away-0.013
or-0.013
ester-0.013
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 spaghetti+0.042
 poems+0.041
 sauce+0.041
 noodles+0.040
 masks+0.039
Neg logit contributions
stru-0.022
 clim-0.019
away-0.019
or-0.018
ester-0.018
Token but
Token ID475
Feature activation+3.21e-02

Pos logit contributions
 poems+0.012
ems+0.011
 spaghetti+0.011
ccoli+0.011
aghetti+0.011
Neg logit contributions
stru-0.005
 of-0.005
 clim-0.005
 ever-0.005
 forward-0.005
Token he
Token ID339
Feature activation+8.20e-03

Pos logit contributions
stru+0.040
 clim+0.035
away+0.033
or+0.032
ester+0.031
Neg logit contributions
 spaghetti-0.067
 poems-0.065
 sauce-0.064
 noodles-0.062
 masks-0.062
Token couldn
Token ID3521
Feature activation+1.26e-02

Pos logit contributions
stru+0.010
 clim+0.009
away+0.009
ester+0.009
or+0.009
Neg logit contributions
 spaghetti-0.017
 poems-0.016
 sauce-0.016
 noodles-0.016
 masks-0.015
Tokenâ
Token ID22940
Feature activation+6.94e-04

Pos logit contributions
stru+0.020
away+0.018
 clim+0.018
ester+0.017
ris+0.017
Neg logit contributions
 spaghetti-0.023
 sauce-0.021
 gum-0.021
 poems-0.020
 noodles-0.020
Token
Token ID26391
Feature activation+3.24e-02

Pos logit contributions
stru+0.001
 clim+0.001
ester+0.001
away+0.001
ris+0.001
Neg logit contributions
 spaghetti-0.001
 sauce-0.001
 poems-0.001
 gum-0.001
 noodles-0.001
Token
Token ID8151
Feature activation+6.68e-02

Pos logit contributions
stru+0.054
away+0.049
 clim+0.049
ester+0.047
or+0.046
Neg logit contributions
 spaghetti-0.054
 poems-0.052
 sauce-0.050
 noodles-0.048
 masks-0.048
Tokent
Token ID83
Feature activation-3.90e-03

Pos logit contributions
stru+0.080
 clim+0.071
away+0.071
 adventurer+0.068
 explorer+0.066
Neg logit contributions
 spaghetti-0.135
 sauce-0.127
 gum-0.126
bat-0.124
 poems-0.124
Token find
Token ID1064
Feature activation-2.32e-03

Pos logit contributions
 spaghetti+0.007
 sauce+0.007
 noodles+0.006
 knit+0.006
 poems+0.006
Neg logit contributions
stru-0.006
ester-0.005
ris-0.005
or-0.005
 clim-0.005
Token anything
Token ID1997
Feature activation+2.20e-03

Pos logit contributions
 spaghetti+0.004
 sauce+0.003
 gum+0.003
 noodles+0.003
 yarn+0.003
Neg logit contributions
stru-0.005
ester-0.004
ris-0.004
or-0.004
imp-0.004
Token.
Token ID13
Feature activation-7.24e-04

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
or+0.002
ester+0.002
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.005
 noodles-0.005
 masks-0.004
Token
Token ID198
Feature activation-2.18e-03

Pos logit contributions
 spaghetti+0.001
 poems+0.001
 sauce+0.001
 noodles+0.001
 masks+0.001
Neg logit contributions
stru-0.001
 clim-0.001
away-0.001
or-0.001
ester-0.001
Token so
Token ID523
Feature activation+4.25e-03

Pos logit contributions
 poems+0.028
 spaghetti+0.028
aghetti+0.028
ccoli+0.028
 sauce+0.027
Neg logit contributions
stru-0.010
 of-0.010
 year-0.009
 forward-0.009
 sadly-0.009
Token Mama
Token ID40084
Feature activation-1.96e-03

Pos logit contributions
stru+0.005
 of+0.004
 clim+0.004
 forward+0.004
 year+0.004
Neg logit contributions
 poems-0.009
 spaghetti-0.009
 sauce-0.009
ems-0.009
 masks-0.009
Token said
Token ID531
Feature activation+8.63e-03

Pos logit contributions
 poems+0.004
 spaghetti+0.004
 elbows+0.004
 noodles+0.004
 sauce+0.004
Neg logit contributions
stru-0.003
 of-0.002
ester-0.002
 clim-0.002
 returned-0.002
Token,
Token ID11
Feature activation-2.04e-02

Pos logit contributions
stru+0.007
 clim+0.006
 of+0.006
or+0.005
 forward+0.005
Neg logit contributions
 spaghetti-0.021
 poems-0.021
 sauce-0.020
 masks-0.020
 noodles-0.020
Token "
Token ID366
Feature activation+1.38e-02

Pos logit contributions
 spaghetti+0.042
 poems+0.042
 elbows+0.042
 masks+0.042
irms+0.041
Neg logit contributions
stru-0.024
 forward-0.021
 of-0.020
 clim-0.019
or-0.019
TokenSarah
Token ID29284
Feature activation+5.07e-02

Pos logit contributions
stru+0.021
away+0.018
or+0.018
 ever+0.017
 year+0.017
Neg logit contributions
 spaghetti-0.027
 poems-0.026
 masks-0.025
 noodles-0.025
 sauce-0.025
Token,
Token ID11
Feature activation-9.80e-03

Pos logit contributions
stru+0.040
 clim+0.033
ester+0.029
or+0.029
away+0.028
Neg logit contributions
 spaghetti-0.128
 poems-0.126
 sauce-0.122
 noodles-0.121
 masks-0.121
Token it
Token ID340
Feature activation-1.84e-02

Pos logit contributions
 poems+0.020
 spaghetti+0.020
bat+0.019
 elbows+0.019
 masks+0.019
Neg logit contributions
stru-0.012
 of-0.010
 clim-0.010
 ever-0.009
away-0.009
Token is
Token ID318
Feature activation+1.68e-03

Pos logit contributions
 spaghetti+0.039
 poems+0.038
 sauce+0.036
 noodles+0.036
 elbows+0.036
Neg logit contributions
stru-0.023
 clim-0.020
 of-0.018
or-0.018
ester-0.018
Token selfish
Token ID20363
Feature activation-2.84e-02

Pos logit contributions
 ever+0.002
 of+0.002
 forward+0.002
 year+0.001
 able+0.001
Neg logit contributions
bat-0.004
 elbows-0.004
ems-0.004
 shirts-0.004
 poems-0.003
Token not
Token ID407
Feature activation-3.90e-03

Pos logit contributions
 spaghetti+0.066
 poems+0.066
 elbows+0.065
bat+0.063
 noodles+0.063
Neg logit contributions
 of-0.023
stru-0.022
 clim-0.020
away-0.019
--0.019
Token said
Token ID531
Feature activation+1.61e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.005
 noodles+0.004
 gum+0.004
Neg logit contributions
stru-0.005
 clim-0.004
ester-0.004
ris-0.004
or-0.004
Token "
Token ID366
Feature activation+1.17e-02

Pos logit contributions
stru+0.014
 clim+0.012
or+0.010
away+0.010
 of+0.010
Neg logit contributions
 spaghetti-0.039
 poems-0.038
 sauce-0.038
 noodles-0.037
 masks-0.037
TokenLet
Token ID5756
Feature activation+7.54e-03

Pos logit contributions
stru+0.017
 year+0.016
away+0.015
 ever+0.015
 clim+0.014
Neg logit contributions
 masks-0.023
 poems-0.023
 noodles-0.022
 spaghetti-0.022
 sauce-0.022
Tokenâ
Token ID22940
Feature activation-7.68e-03

Pos logit contributions
stru+0.010
 clim+0.008
away+0.008
or+0.008
ester+0.008
Neg logit contributions
 spaghetti-0.015
 poems-0.015
 sauce-0.015
 noodles-0.015
 masks-0.014
Token
Token ID26391
Feature activation+3.07e-02

Pos logit contributions
 spaghetti+0.013
 sauce+0.013
 poems+0.013
 gum+0.012
 noodles+0.012
Neg logit contributions
stru-0.012
 clim-0.011
away-0.010
ester-0.010
ris-0.010
Token
Token ID8151
Feature activation+5.60e-02

Pos logit contributions
stru+0.051
away+0.048
ively+0.046
 clim+0.046
or+0.045
Neg logit contributions
 spaghetti-0.051
 poems-0.047
 sauce-0.046
bat-0.045
 masks-0.045
Tokens
Token ID82
Feature activation+3.51e-03

Pos logit contributions
stru+0.088
away+0.085
 clim+0.081
 adventurer+0.081
 explorer+0.079
Neg logit contributions
 spaghetti-0.092
bat-0.086
 sauce-0.086
 gum-0.084
 dough-0.082
Token go
Token ID467
Feature activation+5.64e-03

Pos logit contributions
stru+0.004
 clim+0.004
away+0.004
or+0.004
ris+0.003
Neg logit contributions
 spaghetti-0.007
 poems-0.007
 sauce-0.007
 noodles-0.007
 masks-0.007
Token buy
Token ID2822
Feature activation+3.91e-03

Pos logit contributions
stru+0.010
 clim+0.009
spe+0.009
ively+0.008
year+0.008
Neg logit contributions
 spaghetti-0.010
 sauce-0.009
 masks-0.009
 gum-0.008
 knit-0.008
Token some
Token ID617
Feature activation-1.81e-02

Pos logit contributions
stru+0.006
 of+0.005
or+0.005
away+0.005
-+0.005
Neg logit contributions
 poems-0.006
ccoli-0.006
 elbows-0.006
ems-0.006
 spaghetti-0.006
Token tea
Token ID8887
Feature activation-1.46e-02

Pos logit contributions
 poems+0.023
 spaghetti+0.023
ccoli+0.022
 elbows+0.022
ems+0.022
Neg logit contributions
stru-0.033
 clim-0.032
 of-0.031
away-0.031
ester-0.030
Token were
Token ID547
Feature activation+6.11e-03

Pos logit contributions
 spaghetti+0.008
 poems+0.008
 sauce+0.007
 noodles+0.007
 gum+0.007
Neg logit contributions
stru-0.006
 clim-0.006
away-0.006
or-0.005
ester-0.005
Token breath
Token ID8033
Feature activation+1.99e-02

Pos logit contributions
stru+0.009
 clim+0.008
away+0.008
or+0.007
ester+0.007
Neg logit contributions
 spaghetti-0.012
 poems-0.011
 sauce-0.011
 noodles-0.011
 masks-0.011
Tokenless
Token ID1203
Feature activation-1.15e-02

Pos logit contributions
stru+0.024
 clim+0.021
away+0.020
ester+0.020
or+0.019
Neg logit contributions
 spaghetti-0.042
 poems-0.040
 sauce-0.040
 noodles-0.039
 masks-0.038
Token.
Token ID13
Feature activation-2.77e-03

Pos logit contributions
 spaghetti+0.027
 poems+0.027
 sauce+0.026
 noodles+0.026
ems+0.026
Neg logit contributions
stru-0.009
 clim-0.009
 ever-0.008
 of-0.008
 forward-0.008
Token G
Token ID402
Feature activation+2.14e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.004
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.004
 clim-0.004
ris-0.004
or-0.004
away-0.004
Tokenail
Token ID603
Feature activation+6.89e-03

Pos logit contributions
stru+0.026
away+0.023
 clim+0.023
 forward+0.021
 victorious+0.021
Neg logit contributions
 spaghetti-0.042
 sauce-0.041
 poems-0.040
ccoli-0.040
ems-0.040
Token smiled
Token ID13541
Feature activation+1.31e-02

Pos logit contributions
stru+0.010
 clim+0.009
away+0.009
ester+0.009
or+0.008
Neg logit contributions
 spaghetti-0.013
 poems-0.012
 sauce-0.012
 noodles-0.012
 masks-0.012
Token at
Token ID379
Feature activation+2.42e-02

Pos logit contributions
stru+0.016
 clim+0.014
away+0.013
ester+0.013
or+0.013
Neg logit contributions
 spaghetti-0.028
 poems-0.027
 sauce-0.027
 noodles-0.026
 masks-0.026
Token Sue
Token ID26089
Feature activation+1.16e-02

Pos logit contributions
stru+0.039
 clim+0.036
away+0.034
ester+0.034
ris+0.033
Neg logit contributions
 spaghetti-0.041
 poems-0.040
 sauce-0.039
 masks-0.038
 noodles-0.037
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
stru+0.012
 clim+0.011
ester+0.010
or+0.010
ris+0.010
Neg logit contributions
 spaghetti-0.026
 poems-0.025
 sauce-0.025
 noodles-0.025
 masks-0.024
Token
Token ID6184
Feature activation-1.06e-02

Pos logit contributions
 spaghetti+0.025
 poems+0.024
 noodles+0.024
 sauce+0.023
 masks+0.023
Neg logit contributions
stru-0.018
 clim-0.016
or-0.015
ur-0.015
ester-0.015
Tokena
Token ID64
Feature activation-8.05e-03

Pos logit contributions
 spaghetti+0.077
 poems+0.073
 sauce+0.072
 noodles+0.070
 masks+0.069
Neg logit contributions
stru-0.042
 clim-0.038
or-0.035
away-0.035
ester-0.034
Token stepped
Token ID10764
Feature activation+1.98e-02

Pos logit contributions
 spaghetti+0.016
 poems+0.015
 sauce+0.015
 noodles+0.015
 gum+0.015
Neg logit contributions
stru-0.011
 clim-0.010
away-0.009
or-0.009
ester-0.009
Token out
Token ID503
Feature activation+2.61e-03

Pos logit contributions
stru+0.023
 clim+0.021
ester+0.019
away+0.019
ris+0.019
Neg logit contributions
 spaghetti-0.043
 poems-0.041
 sauce-0.041
 noodles-0.040
 masks-0.039
Token and
Token ID290
Feature activation-7.25e-03

Pos logit contributions
stru+0.003
 clim+0.002
ester+0.002
or+0.002
ris+0.002
Neg logit contributions
 spaghetti-0.006
 poems-0.006
 sauce-0.006
 noodles-0.006
 masks-0.006
Token began
Token ID2540
Feature activation+5.50e-03

Pos logit contributions
 spaghetti+0.015
 poems+0.015
 sauce+0.014
 noodles+0.014
aghetti+0.014
Neg logit contributions
stru-0.010
 clim-0.008
away-0.008
 forward-0.008
 returned-0.007
Token to
Token ID284
Feature activation+3.09e-03

Pos logit contributions
stru+0.006
 clim+0.005
away+0.005
or+0.004
ester+0.004
Neg logit contributions
 spaghetti-0.013
 poems-0.012
 sauce-0.012
 noodles-0.012
 gum-0.012
Token speak
Token ID2740
Feature activation+1.50e-02

Pos logit contributions
stru+0.004
 clim+0.004
 reun+0.003
away+0.003
 return+0.003
Neg logit contributions
 spaghetti-0.007
 poems-0.006
 sauce-0.006
 noodles-0.006
aghetti-0.006
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
stru+0.015
away+0.014
 clim+0.013
 forward+0.013
 of+0.012
Neg logit contributions
 spaghetti-0.035
 sauce-0.034
 poems-0.033
 noodles-0.033
 gum-0.033
Token
Token ID198
Feature activation+4.71e-03

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.010
 noodles+0.009
 masks+0.009
Neg logit contributions
stru-0.007
 clim-0.006
or-0.006
away-0.006
ris-0.006
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
stru+0.007
 clim+0.006
away+0.006
ester+0.006
or+0.006
Neg logit contributions
 spaghetti-0.009
 poems-0.009
 sauce-0.008
 noodles-0.008
 masks-0.008
TokenThe
Token ID464
Feature activation-6.35e-03

Pos logit contributions
stru+0.016
 clim+0.014
away+0.014
or+0.013
ris+0.013
Neg logit contributions
 spaghetti-0.023
 poems-0.022
 sauce-0.021
 noodles-0.021
 masks-0.021
Token the
Token ID262
Feature activation+6.62e-03

Pos logit contributions
stru+0.032
 clim+0.031
 of+0.030
 ever+0.028
 year+0.027
Neg logit contributions
 spaghetti-0.079
 poems-0.079
bat-0.077
 sauce-0.077
 elbows-0.077
Token human
Token ID1692
Feature activation+2.54e-04

Pos logit contributions
 of+0.009
stru+0.008
 year+0.008
-+0.008
 forward+0.008
Neg logit contributions
 elbows-0.012
bat-0.011
ems-0.011
 shorts-0.011
ccoli-0.011
Token went
Token ID1816
Feature activation+7.34e-03

Pos logit contributions
stru+0.000
 clim+0.000
or+0.000
 year+0.000
ester+0.000
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 gum-0.001
 mustard-0.001
Token was
Token ID373
Feature activation+6.84e-03

Pos logit contributions
stru+0.004
 clim+0.004
away+0.004
 of+0.003
 forward+0.003
Neg logit contributions
 spaghetti-0.020
 poems-0.019
ccoli-0.019
 sauce-0.019
 apr-0.019
Token so
Token ID523
Feature activation+6.75e-03

Pos logit contributions
stru+0.008
 clim+0.007
 of+0.007
 ever+0.007
 forward+0.006
Neg logit contributions
ems-0.014
 spaghetti-0.014
 poems-0.014
 elbows-0.013
 noodles-0.013
Token beautiful
Token ID4950
Feature activation+6.75e-03

Pos logit contributions
stru+0.009
 clim+0.009
away+0.008
ris+0.008
or+0.008
Neg logit contributions
 spaghetti-0.013
 poems-0.012
 sauce-0.012
 noodles-0.012
 masks-0.011
Token and
Token ID290
Feature activation+7.90e-03

Pos logit contributions
stru+0.006
 clim+0.006
away+0.005
 of+0.005
ester+0.005
Neg logit contributions
 spaghetti-0.015
 poems-0.015
 sauce-0.015
 elbows-0.015
ccoli-0.015
Token colorful
Token ID20239
Feature activation-3.32e-03

Pos logit contributions
stru+0.010
 clim+0.009
 of+0.009
 ever+0.008
 forward+0.008
Neg logit contributions
 spaghetti-0.015
 elbows-0.015
 poems-0.015
ems-0.015
ccoli-0.015
Token.
Token ID13
Feature activation-6.48e-03

Pos logit contributions
 spaghetti+0.008
 poems+0.007
ccoli+0.007
 sauce+0.007
 elbows+0.007
Neg logit contributions
stru-0.003
 clim-0.003
 of-0.003
away-0.003
ester-0.002
Token The
Token ID383
Feature activation+7.82e-04

Pos logit contributions
 poems+0.014
 masks+0.014
 elbows+0.014
 spaghetti+0.014
 sauce+0.014
Neg logit contributions
stru-0.006
 clim-0.006
 of-0.006
ris-0.005
 year-0.005
Token human
Token ID1692
Feature activation-9.70e-03

Pos logit contributions
stru+0.001
 forward+0.001
 of+0.001
 clim+0.001
 year+0.001
Neg logit contributions
bat-0.001
 elbows-0.001
ems-0.001
ccoli-0.001
 shorts-0.001
Token "
Token ID366
Feature activation+5.74e-03

Pos logit contributions
bat+0.057
aghetti+0.056
 spaghetti+0.056
 noodles+0.056
 poems+0.056
Neg logit contributions
 of-0.014
 where-0.013
 sadly-0.013
:-0.013
stru-0.012
TokenNo
Token ID2949
Feature activation-9.62e-04

Pos logit contributions
stru+0.008
away+0.007
 clim+0.007
or+0.007
ester+0.007
Neg logit contributions
 spaghetti-0.011
 noodles-0.011
ccoli-0.011
 sauce-0.010
 poems-0.010
Token,
Token ID11
Feature activation+1.70e-03

Pos logit contributions
 spaghetti+0.003
 poems+0.002
 sauce+0.002
 noodles+0.002
bat+0.002
Neg logit contributions
stru-0.001
 clim-0.000
away-0.000
or-0.000
 of-0.000
Token you
Token ID345
Feature activation-1.39e-03

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
ester+0.002
 of+0.002
Neg logit contributions
 spaghetti-0.003
 poems-0.003
 sauce-0.003
 noodles-0.003
bat-0.003
Token won
Token ID1839
Feature activation-1.54e-02

Pos logit contributions
 spaghetti+0.002
 poems+0.002
 sauce+0.002
 noodles+0.002
 masks+0.002
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
ester-0.002
or-0.002
Token't
Token ID470
Feature activation+3.49e-03

Pos logit contributions
 spaghetti+0.035
 poems+0.034
 sauce+0.033
 noodles+0.033
 masks+0.032
Neg logit contributions
stru-0.015
 clim-0.014
away-0.012
ester-0.012
or-0.012
Token.
Token ID13
Feature activation+2.76e-02

Pos logit contributions
stru+0.004
 clim+0.004
away+0.004
or+0.003
ester+0.003
Neg logit contributions
 spaghetti-0.008
 poems-0.007
 sauce-0.007
 noodles-0.007
 masks-0.007
Token You
Token ID921
Feature activation+2.45e-02

Pos logit contributions
stru+0.040
 clim+0.038
ester+0.035
or+0.035
away+0.034
Neg logit contributions
 spaghetti-0.049
 poems-0.048
 masks-0.046
 sauce-0.046
 noodles-0.046
Token can
Token ID460
Feature activation-1.83e-03

Pos logit contributions
stru+0.037
 clim+0.034
away+0.032
or+0.032
ris+0.031
Neg logit contributions
 spaghetti-0.045
 poems-0.043
 sauce-0.042
 noodles-0.041
 masks-0.040
Token't
Token ID470
Feature activation+1.35e-03

Pos logit contributions
 spaghetti+0.004
 poems+0.004
 sauce+0.004
 noodles+0.003
 masks+0.003
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
ester-0.002
or-0.002
Token do
Token ID466
Feature activation-2.07e-03

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
or+0.002
ris+0.001
Neg logit contributions
 spaghetti-0.003
 poems-0.003
 sauce-0.003
 noodles-0.003
 masks-0.002
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
stru+0.012
 clim+0.011
away+0.011
ester+0.010
or+0.010
Neg logit contributions
 spaghetti-0.029
 poems-0.028
 sauce-0.027
 noodles-0.027
 masks-0.027
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
 poems+0.034
 spaghetti+0.032
 noodles+0.032
 masks+0.032
 sauce+0.031
Neg logit contributions
stru-0.014
 of-0.012
 clim-0.012
ur-0.011
way-0.011
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
stru+0.015
 clim+0.014
away+0.013
ester+0.013
or+0.013
Neg logit contributions
 spaghetti-0.021
 poems-0.020
 sauce-0.020
 noodles-0.019
 masks-0.019
TokenTim
Token ID14967
Feature activation+3.31e-03

Pos logit contributions
stru+0.021
 clim+0.019
away+0.019
or+0.018
ris+0.018
Neg logit contributions
 spaghetti-0.026
 poems-0.025
 noodles-0.024
 sauce-0.024
 masks-0.024
Tokenmy
Token ID1820
Feature activation-4.98e-04

Pos logit contributions
stru+0.004
ris+0.004
ester+0.004
 clim+0.004
-+0.004
Neg logit contributions
 spaghetti-0.006
 salsa-0.006
 poems-0.006
 noodles-0.006
 mustard-0.006
Token looked
Token ID3114
Feature activation+1.47e-02

Pos logit contributions
 spaghetti+0.001
 poems+0.001
 sauce+0.001
 noodles+0.001
 gum+0.001
Neg logit contributions
stru-0.001
 clim-0.001
ris-0.001
ester-0.001
or-0.001
Token at
Token ID379
Feature activation+2.89e-02

Pos logit contributions
stru+0.022
 clim+0.022
handed+0.021
ris+0.021
ively+0.020
Neg logit contributions
 spaghetti-0.027
 poems-0.024
 masks-0.024
 sauce-0.024
 noodles-0.023
Token the
Token ID262
Feature activation+3.28e-03

Pos logit contributions
stru+0.063
 clim+0.056
or+0.053
ris+0.053
ester+0.053
Neg logit contributions
 spaghetti-0.039
 yarn-0.037
 sauce-0.036
 poems-0.034
 masks-0.034
Token envelope
Token ID22878
Feature activation+1.60e-02

Pos logit contributions
stru+0.006
 clim+0.006
ester+0.006
or+0.006
ur+0.006
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.004
 masks-0.004
 gum-0.004
Token and
Token ID290
Feature activation+1.15e-03

Pos logit contributions
stru+0.018
 clim+0.016
away+0.015
or+0.014
ester+0.014
Neg logit contributions
 spaghetti-0.035
 poems-0.034
 sauce-0.033
 noodles-0.033
 masks-0.032
Token said
Token ID531
Feature activation+8.38e-03

Pos logit contributions
stru+0.002
 clim+0.001
away+0.001
 luckily+0.001
ester+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 gum-0.002
Token poured
Token ID19036
Feature activation-2.17e-02

Pos logit contributions
 poems+0.012
bat+0.012
 spaghetti+0.012
ccoli+0.011
 elbows+0.011
Neg logit contributions
stru-0.009
 clim-0.008
 of-0.008
 returned-0.008
ester-0.007
Token some
Token ID617
Feature activation-1.97e-02

Pos logit contributions
 poems+0.048
ems+0.046
 spaghetti+0.046
ccoli+0.046
 masks+0.045
Neg logit contributions
 of-0.021
stru-0.021
 forward-0.020
or-0.019
away-0.019
Token wine
Token ID8237
Feature activation-2.16e-02

Pos logit contributions
bat+0.036
 poems+0.036
ccoli+0.035
 elbows+0.035
ems+0.034
Neg logit contributions
 of-0.028
stru-0.025
away-0.025
--0.024
or-0.023
Token on
Token ID319
Feature activation+1.40e-02

Pos logit contributions
 spaghetti+0.050
 poems+0.049
 noodles+0.047
 masks+0.047
 sauce+0.046
Neg logit contributions
stru-0.020
 clim-0.018
away-0.017
or-0.016
 of-0.016
Token Ben
Token ID3932
Feature activation-1.12e-02

Pos logit contributions
away+0.011
 of+0.010
stru+0.010
 clim+0.010
or+0.010
Neg logit contributions
ems-0.031
 poems-0.031
bat-0.031
ccoli-0.031
 elbows-0.030
Token's
Token ID338
Feature activation-4.69e-03

Pos logit contributions
 spaghetti+0.025
 elbows+0.025
 poems+0.025
bat+0.025
irms+0.025
Neg logit contributions
stru-0.009
 of-0.009
--0.008
or-0.008
 clim-0.008
Token tower
Token ID10580
Feature activation-1.21e-03

Pos logit contributions
bat+0.007
ems+0.007
 spaghetti+0.007
 poems+0.007
 masks+0.006
Neg logit contributions
 of-0.007
stru-0.007
 clim-0.007
away-0.007
--0.007
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
 spaghetti+0.003
 poems+0.003
 sauce+0.003
 noodles+0.003
 masks+0.003
Neg logit contributions
stru-0.001
 clim-0.001
away-0.001
ris-0.001
ester-0.001
Token The
Token ID383
Feature activation-1.06e-02

Pos logit contributions
 poems+0.037
 masks+0.035
 spaghetti+0.035
 noodles+0.035
 elbows+0.035
Neg logit contributions
stru-0.015
 of-0.014
or-0.012
 clim-0.012
 forward-0.012
Token wine
Token ID8237
Feature activation-3.41e-02

Pos logit contributions
bat+0.014
 elbows+0.013
ems+0.013
ccoli+0.013
ksh+0.013
Neg logit contributions
 of-0.020
 year-0.018
 clim-0.017
 prize-0.017
 return-0.017
Token made
Token ID925
Feature activation-1.96e-02

Pos logit contributions
 poems+0.073
 spaghetti+0.072
 elbows+0.070
ems+0.070
bat+0.070
Neg logit contributions
 of-0.033
stru-0.033
 ever-0.032
 clim-0.031
 returned-0.028
Token it
Token ID340
Feature activation-1.78e-02

Pos logit contributions
stru+0.011
 forward+0.011
away+0.011
 of+0.011
or+0.010
Neg logit contributions
 poems-0.028
 spaghetti-0.028
ems-0.028
 elbows-0.028
 fiber-0.028
Token forever
Token ID8097
Feature activation-2.42e-03

Pos logit contributions
 spaghetti+0.037
 poems+0.037
 sauce+0.036
 gum+0.035
 noodles+0.035
Neg logit contributions
stru-0.021
 clim-0.019
away-0.019
 of-0.018
 forward-0.018
Token.
Token ID13
Feature activation-5.42e-03

Pos logit contributions
 spaghetti+0.006
 poems+0.006
 sauce+0.005
 noodles+0.005
 masks+0.005
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
 of-0.002
ris-0.002
Token<|endoftext|>
Token ID50256
Feature activation-2.07e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 noodles+0.008
 gum+0.008
Neg logit contributions
stru-0.009
 clim-0.008
away-0.008
ester-0.008
or-0.007
TokenOne
Token ID3198
Feature activation-8.63e-03

Pos logit contributions
 poems+0.042
 spaghetti+0.041
 sauce+0.039
 noodles+0.039
 masks+0.038
Neg logit contributions
stru-0.026
 clim-0.023
or-0.023
ris-0.023
 of-0.022
Token dark
Token ID3223
Feature activation-1.96e-02

Pos logit contributions
 spaghetti+0.019
 poems+0.019
 sauce+0.018
 noodles+0.018
 masks+0.018
Neg logit contributions
stru-0.010
 clim-0.009
away-0.008
ester-0.008
ris-0.008
Token night
Token ID1755
Feature activation-8.57e-04

Pos logit contributions
 poems+0.042
 spaghetti+0.041
 sauce+0.040
 elbows+0.039
 noodles+0.039
Neg logit contributions
stru-0.021
 of-0.021
 clim-0.020
 year-0.020
way-0.019
Token,
Token ID11
Feature activation+1.68e-03

Pos logit contributions
 poems+0.002
 spaghetti+0.002
 elbows+0.002
ccoli+0.002
ems+0.002
Neg logit contributions
stru-0.000
 of-0.000
 clim-0.000
or-0.000
way-0.000
Token Jack
Token ID3619
Feature activation+1.07e-03

Pos logit contributions
stru+0.002
 clim+0.002
 of+0.002
 ever+0.002
 year+0.002
Neg logit contributions
 poems-0.003
ems-0.003
 spaghetti-0.003
 sauce-0.003
 noodles-0.003
Token gazed
Token ID50255
Feature activation+4.25e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
ester+0.001
or+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token up
Token ID510
Feature activation+2.76e-02

Pos logit contributions
stru+0.053
 clim+0.046
ris+0.044
ester+0.043
or+0.042
Neg logit contributions
 spaghetti-0.090
 poems-0.086
 sauce-0.085
 noodles-0.083
 masks-0.082
Token sticker
Token ID27735
Feature activation-2.92e-03

Pos logit contributions
bat+0.003
 elbows+0.003
ems+0.003
ities+0.003
 spaghetti+0.003
Neg logit contributions
 of-0.002
 clim-0.002
stru-0.002
ris-0.002
 taxi-0.002
Token!"
Token ID2474
Feature activation+5.75e-04

Pos logit contributions
 spaghetti+0.006
 elbows+0.006
 poems+0.006
 sauce+0.006
 masks+0.006
Neg logit contributions
 of-0.003
 clim-0.003
stru-0.003
 ever-0.003
ris-0.003
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
stru+0.001
 of+0.001
 clim+0.001
away+0.001
or+0.000
Neg logit contributions
 spaghetti-0.001
 poems-0.001
 sauce-0.001
 noodles-0.001
ccoli-0.001
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
stru+0.019
 clim+0.017
away+0.016
ester+0.016
or+0.016
Neg logit contributions
 spaghetti-0.021
 poems-0.020
 sauce-0.020
 masks-0.019
 noodles-0.019
TokenMom
Token ID29252
Feature activation-1.02e-02

Pos logit contributions
stru+0.027
 clim+0.025
away+0.025
or+0.024
ris+0.024
Neg logit contributions
 spaghetti-0.028
 poems-0.027
 sauce-0.026
 noodles-0.025
 masks-0.025
Token and
Token ID290
Feature activation-9.82e-03

Pos logit contributions
 spaghetti+0.022
 poems+0.022
 sauce+0.020
 noodles+0.020
 gum+0.020
Neg logit contributions
stru-0.012
 clim-0.010
ris-0.010
 of-0.010
ester-0.010
Token dad
Token ID9955
Feature activation-3.33e-03

Pos logit contributions
 poems+0.018
 spaghetti+0.018
 sauce+0.017
 noodles+0.017
 gum+0.017
Neg logit contributions
stru-0.013
 clim-0.012
away-0.011
 of-0.011
ester-0.010
Token smiled
Token ID13541
Feature activation+2.00e-02

Pos logit contributions
 spaghetti+0.005
 sauce+0.005
 poems+0.005
 noodles+0.005
 masks+0.005
Neg logit contributions
stru-0.006
 clim-0.005
away-0.005
ester-0.005
or-0.005
Token and
Token ID290
Feature activation-3.06e-03

Pos logit contributions
stru+0.026
 clim+0.023
away+0.022
or+0.022
ester+0.022
Neg logit contributions
 spaghetti-0.041
 poems-0.039
 sauce-0.039
 noodles-0.037
 masks-0.037
Token hugged
Token ID41130
Feature activation+1.15e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.005
 noodles+0.004
 masks+0.004
Neg logit contributions
stru-0.005
 clim-0.005
away-0.005
or-0.005
ester-0.005
Token him
Token ID683
Feature activation-5.67e-03

Pos logit contributions
stru+0.016
away+0.015
 clim+0.015
or+0.014
 forward+0.014
Neg logit contributions
 spaghetti-0.022
 poems-0.021
 sauce-0.021
 noodles-0.020
ccoli-0.020
Token he
Token ID339
Feature activation+2.57e-03

Pos logit contributions
 spaghetti+0.021
 poems+0.020
ccoli+0.020
 elbows+0.020
bat+0.020
Neg logit contributions
stru-0.010
 clim-0.009
 of-0.009
 ever-0.009
 forward-0.008
Token was
Token ID373
Feature activation+1.29e-02

Pos logit contributions
stru+0.004
 clim+0.004
away+0.004
or+0.004
ester+0.004
Neg logit contributions
 spaghetti-0.004
 poems-0.004
 sauce-0.004
 masks-0.004
 noodles-0.004
Token helping
Token ID5742
Feature activation-1.34e-02

Pos logit contributions
stru+0.019
away+0.017
ris+0.017
 clim+0.017
or+0.017
Neg logit contributions
 spaghetti-0.023
 poems-0.022
 sauce-0.022
 gum-0.021
 noodles-0.021
Token people
Token ID661
Feature activation+1.79e-02

Pos logit contributions
 spaghetti+0.025
 poems+0.024
ccoli+0.023
 noodles+0.023
 sauce+0.023
Neg logit contributions
stru-0.019
away-0.018
 of-0.017
or-0.017
 clim-0.016
Token by
Token ID416
Feature activation+2.23e-03

Pos logit contributions
stru+0.021
 clim+0.018
away+0.017
or+0.017
ester+0.016
Neg logit contributions
 spaghetti-0.039
 poems-0.038
 sauce-0.037
 noodles-0.037
 masks-0.036
Token giving
Token ID3501
Feature activation-1.07e-02

Pos logit contributions
stru+0.003
 of+0.003
away+0.003
ively+0.003
 clim+0.003
Neg logit contributions
 spaghetti-0.004
 poems-0.004
ccoli-0.004
 sauce-0.004
 noodles-0.004
Token them
Token ID606
Feature activation+1.09e-02

Pos logit contributions
 spaghetti+0.021
 poems+0.020
 sauce+0.020
 noodles+0.020
ccoli+0.019
Neg logit contributions
stru-0.015
away-0.013
or-0.013
 forward-0.013
 clim-0.012
Token medals
Token ID28057
Feature activation+2.56e-02

Pos logit contributions
stru+0.015
away+0.014
 clim+0.013
or+0.013
 forward+0.013
Neg logit contributions
 spaghetti-0.021
 poems-0.020
 sauce-0.019
 elbows-0.019
 noodles-0.019
Token.
Token ID13
Feature activation+1.78e-04

Pos logit contributions
stru+0.027
away+0.023
 clim+0.023
or+0.022
 of+0.021
Neg logit contributions
 spaghetti-0.057
 poems-0.055
 sauce-0.055
 noodles-0.054
 gum-0.052
Token He
Token ID679
Feature activation-5.23e-03

Pos logit contributions
stru+0.000
 clim+0.000
away+0.000
ester+0.000
or+0.000
Neg logit contributions
 spaghetti-0.000
 sauce-0.000
 poems-0.000
 noodles-0.000
 gum-0.000
Token felt
Token ID2936
Feature activation+9.89e-03

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.010
 noodles+0.009
 masks+0.009
Neg logit contributions
stru-0.007
 clim-0.006
away-0.006
or-0.006
ester-0.006
Token was
Token ID373
Feature activation+2.97e-02

Pos logit contributions
stru+0.008
 clim+0.007
 of+0.007
 year+0.006
away+0.006
Neg logit contributions
 spaghetti-0.016
 poems-0.015
 sauce-0.015
ems-0.015
 elbows-0.015
Token inside
Token ID2641
Feature activation+1.92e-02

Pos logit contributions
stru+0.034
 clim+0.029
away+0.027
ester+0.027
or+0.027
Neg logit contributions
 spaghetti-0.066
 poems-0.063
 sauce-0.063
 noodles-0.061
 masks-0.061
Token.
Token ID13
Feature activation-4.44e-03

Pos logit contributions
stru+0.027
 clim+0.025
away+0.023
ester+0.023
ris+0.023
Neg logit contributions
 spaghetti-0.037
 poems-0.036
 sauce-0.035
 noodles-0.034
 masks-0.034
Token
Token ID198
Feature activation+3.66e-03

Pos logit contributions
 poems+0.008
 spaghetti+0.008
 sauce+0.008
 noodles+0.007
 masks+0.007
Neg logit contributions
stru-0.006
 clim-0.006
or-0.006
ester-0.005
away-0.005
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
stru+0.006
 clim+0.005
away+0.005
ester+0.005
or+0.005
Neg logit contributions
 spaghetti-0.006
 poems-0.006
 sauce-0.006
 masks-0.006
 gum-0.006
TokenThey
Token ID2990
Feature activation-1.67e-02

Pos logit contributions
stru+0.015
 clim+0.014
away+0.013
or+0.013
ris+0.013
Neg logit contributions
 spaghetti-0.020
 poems-0.020
 sauce-0.019
 noodles-0.019
 masks-0.018
Token opened
Token ID4721
Feature activation+1.68e-02

Pos logit contributions
 spaghetti+0.032
 poems+0.030
 sauce+0.030
 noodles+0.029
 masks+0.029
Neg logit contributions
stru-0.024
 clim-0.021
away-0.021
ester-0.020
or-0.020
Token the
Token ID262
Feature activation+5.23e-03

Pos logit contributions
stru+0.025
 clim+0.022
away+0.022
ester+0.021
or+0.021
Neg logit contributions
 spaghetti-0.031
 poems-0.030
 sauce-0.030
 noodles-0.029
 masks-0.028
Token big
Token ID1263
Feature activation-6.44e-03

Pos logit contributions
stru+0.007
 clim+0.007
away+0.006
or+0.006
ris+0.006
Neg logit contributions
 spaghetti-0.010
 poems-0.010
 sauce-0.009
 noodles-0.009
 gum-0.009
Token door
Token ID3420
Feature activation+1.60e-02

Pos logit contributions
 poems+0.012
 spaghetti+0.012
ccoli+0.012
aghetti+0.011
 sauce+0.011
Neg logit contributions
stru-0.008
away-0.007
 clim-0.007
--0.007
 of-0.007
Token and
Token ID290
Feature activation+9.10e-03

Pos logit contributions
stru+0.024
 clim+0.022
ester+0.021
away+0.021
or+0.021
Neg logit contributions
 spaghetti-0.029
 poems-0.027
 sauce-0.027
 masks-0.026
 noodles-0.026
Token We
Token ID775
Feature activation+1.87e-02

Pos logit contributions
stru+0.026
 clim+0.023
away+0.022
ris+0.021
or+0.021
Neg logit contributions
 spaghetti-0.044
 poems-0.043
 sauce-0.041
 masks-0.041
 noodles-0.041
Token are
Token ID389
Feature activation-7.13e-03

Pos logit contributions
stru+0.023
 clim+0.022
ris+0.021
 ever+0.021
or+0.020
Neg logit contributions
 spaghetti-0.039
 poems-0.038
 noodles-0.037
 sauce-0.035
 masks-0.034
Token sorry
Token ID7926
Feature activation-7.15e-03

Pos logit contributions
bat+0.014
 spaghetti+0.014
ems+0.014
 poems+0.014
 noodles+0.014
Neg logit contributions
 clim-0.008
stru-0.008
 able-0.007
 ever-0.007
 lucky-0.007
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 spaghetti+0.017
 poems+0.017
 elbows+0.017
 apr+0.017
 sauce+0.017
Neg logit contributions
 of-0.006
stru-0.005
 clim-0.005
--0.005
 forward-0.005
Token Please
Token ID4222
Feature activation-3.93e-02

Pos logit contributions
stru+0.024
 clim+0.021
away+0.021
ris+0.020
or+0.020
Neg logit contributions
 spaghetti-0.036
 poems-0.035
 sauce-0.034
 noodles-0.033
 masks-0.033
Token forgive
Token ID20927
Feature activation-4.12e-02

Pos logit contributions
 spaghetti+0.087
 poems+0.085
 noodles+0.081
 sauce+0.080
 masks+0.078
Neg logit contributions
stru-0.045
 clim-0.040
away-0.037
or-0.037
ester-0.037
Token us
Token ID514
Feature activation-4.38e-03

Pos logit contributions
 spaghetti+0.086
ccoli+0.085
 noodles+0.083
bows+0.081
 poems+0.081
Neg logit contributions
--0.040
away-0.037
or-0.033
 of-0.033
ively-0.032
Token."
Token ID526
Feature activation-3.23e-04

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.009
 noodles+0.009
 masks+0.009
Neg logit contributions
stru-0.005
 clim-0.005
away-0.004
ester-0.004
or-0.004
Token
Token ID198
Feature activation+1.26e-02

Pos logit contributions
 spaghetti+0.001
 poems+0.001
 sauce+0.001
 noodles+0.001
 masks+0.001
Neg logit contributions
stru-0.000
 clim-0.000
away-0.000
ris-0.000
ester-0.000
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
stru+0.018
 clim+0.017
away+0.016
ester+0.016
or+0.015
Neg logit contributions
 spaghetti-0.024
 poems-0.023
 sauce-0.022
 noodles-0.021
 gum-0.021
TokenThe
Token ID464
Feature activation-1.94e-02

Pos logit contributions
stru+0.031
 clim+0.027
away+0.027
ris+0.026
ester+0.026
Neg logit contributions
 spaghetti-0.041
 poems-0.039
 sauce-0.038
 noodles-0.037
 masks-0.036
Token and
Token ID290
Feature activation+6.34e-03

Pos logit contributions
stru+0.001
 of+0.001
 clim+0.001
away+0.001
 forward+0.001
Neg logit contributions
 spaghetti-0.003
 poems-0.003
 sauce-0.003
 elbows-0.003
 noodles-0.003
Token finding
Token ID4917
Feature activation+4.55e-03

Pos logit contributions
stru+0.009
 clim+0.008
away+0.007
 forward+0.007
ester+0.007
Neg logit contributions
 spaghetti-0.013
 poems-0.012
 sauce-0.012
 noodles-0.012
 masks-0.011
Token shortcuts
Token ID32953
Feature activation+1.57e-03

Pos logit contributions
stru+0.007
away+0.006
 clim+0.006
or+0.006
ris+0.006
Neg logit contributions
 spaghetti-0.008
 poems-0.008
 sauce-0.008
 noodles-0.008
bat-0.008
Token.
Token ID13
Feature activation-7.32e-03

Pos logit contributions
stru+0.002
 clim+0.002
away+0.002
or+0.001
 of+0.001
Neg logit contributions
 spaghetti-0.003
 poems-0.003
 sauce-0.003
 noodles-0.003
 gum-0.003
Token The
Token ID383
Feature activation-2.05e-03

Pos logit contributions
 spaghetti+0.014
 poems+0.014
 sauce+0.014
 noodles+0.013
 masks+0.013
Neg logit contributions
stru-0.010
 clim-0.009
away-0.008
ester-0.008
ris-0.008
Token end
Token ID886
Feature activation-4.34e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.004
 sauce+0.004
 noodles+0.004
ccoli+0.004
Neg logit contributions
stru-0.002
 clim-0.002
away-0.002
ester-0.002
 forward-0.002
Token.
Token ID13
Feature activation-6.65e-03

Pos logit contributions
 spaghetti+0.098
 poems+0.095
 sauce+0.094
 noodles+0.093
 masks+0.091
Neg logit contributions
stru-0.045
 clim-0.040
away-0.038
ris-0.038
 of-0.036
Token<|endoftext|>
Token ID50256
Feature activation-2.10e-02

Pos logit contributions
 spaghetti+0.012
 poems+0.012
 sauce+0.012
 noodles+0.011
 gum+0.011
Neg logit contributions
stru-0.010
 clim-0.009
away-0.009
or-0.008
ester-0.008
TokenOnce
Token ID7454
Feature activation+1.25e-02

Pos logit contributions
 spaghetti+0.042
 poems+0.041
 sauce+0.040
 noodles+0.039
 masks+0.039
Neg logit contributions
stru-0.027
 clim-0.024
away-0.023
ris-0.023
or-0.023
Token upon
Token ID2402
Feature activation-2.12e-02

Pos logit contributions
stru+0.025
 clim+0.023
away+0.022
handed+0.021
ris+0.021
Neg logit contributions
 spaghetti-0.017
 poems-0.015
 sauce-0.015
 masks-0.014
 gum-0.014
Token a
Token ID257
Feature activation-2.61e-03

Pos logit contributions
 spaghetti+0.036
 poems+0.036
 sauce+0.035
 noodles+0.034
ccoli+0.034
Neg logit contributions
stru-0.031
 clim-0.028
away-0.027
 of-0.027
 year-0.026
Token with
Token ID351
Feature activation+2.29e-02

Pos logit contributions
ccoli+0.021
ems+0.021
 poems+0.021
 spaghetti+0.021
bat+0.021
Neg logit contributions
away-0.007
 clim-0.007
 of-0.007
 forward-0.006
stru-0.006
Token his
Token ID465
Feature activation+3.54e-03

Pos logit contributions
 clim+0.027
stru+0.027
 of+0.027
away+0.026
 ever+0.024
Neg logit contributions
 poems-0.043
 spaghetti-0.042
ems-0.042
bat-0.042
ccoli-0.042
Token ball
Token ID2613
Feature activation-1.01e-02

Pos logit contributions
 clim+0.005
stru+0.005
away+0.004
ester+0.004
 year+0.004
Neg logit contributions
ems-0.006
 spaghetti-0.006
 poems-0.006
bat-0.006
 sauce-0.006
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 spaghetti+0.023
 poems+0.022
 sauce+0.022
 noodles+0.021
ccoli+0.021
Neg logit contributions
stru-0.010
 clim-0.009
away-0.009
or-0.008
 of-0.008
Token The
Token ID383
Feature activation-2.47e-03

Pos logit contributions
 poems+0.028
 spaghetti+0.028
 sauce+0.027
 masks+0.027
 noodles+0.027
Neg logit contributions
stru-0.018
 clim-0.017
ris-0.016
ester-0.015
or-0.015
Token end
Token ID886
Feature activation-4.46e-02

Pos logit contributions
 spaghetti+0.005
 poems+0.005
bat+0.005
ems+0.005
 sauce+0.005
Neg logit contributions
stru-0.003
 clim-0.003
 year-0.002
away-0.002
 of-0.002
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
 spaghetti+0.096
 poems+0.093
 sauce+0.092
 noodles+0.090
 masks+0.088
Neg logit contributions
stru-0.052
 clim-0.046
away-0.044
ester-0.042
ris-0.042
Token<|endoftext|>
Token ID50256
Feature activation-2.52e-02

Pos logit contributions
 spaghetti+0.021
 poems+0.020
 sauce+0.019
 noodles+0.019
 gum+0.018
Neg logit contributions
stru-0.016
 clim-0.015
away-0.014
or-0.014
ester-0.014
TokenOnce
Token ID7454
Feature activation+1.35e-02

Pos logit contributions
 spaghetti+0.048
 poems+0.046
 sauce+0.045
 noodles+0.044
 masks+0.043
Neg logit contributions
stru-0.036
 clim-0.032
away-0.031
ester-0.030
or-0.030
Token upon
Token ID2402
Feature activation-2.21e-02

Pos logit contributions
stru+0.028
 clim+0.027
away+0.026
 reun+0.025
ris+0.025
Neg logit contributions
 spaghetti-0.016
 poems-0.014
 sauce-0.014
 masks-0.013
 Ben-0.013
Token a
Token ID257
Feature activation-5.85e-03

Pos logit contributions
 spaghetti+0.037
 poems+0.036
 sauce+0.036
 noodles+0.035
ccoli+0.035
Neg logit contributions
stru-0.034
 clim-0.031
away-0.029
 of-0.029
 year-0.028
Token songs
Token ID7259
Feature activation+1.67e-02

Pos logit contributions
ccoli+0.031
bat+0.031
 spaghetti+0.031
 elbows+0.030
ems+0.029
Neg logit contributions
stru-0.040
 of-0.040
away-0.038
 clim-0.037
 year-0.037
Token.
Token ID13
Feature activation+1.15e-03

Pos logit contributions
 of+0.012
 forward+0.010
 clim+0.010
stru+0.010
 ever+0.009
Neg logit contributions
 elbows-0.043
 spaghetti-0.042
 sauce-0.041
 mustard-0.040
 salsa-0.040
Token They
Token ID1119
Feature activation-1.28e-02

Pos logit contributions
stru+0.002
ris+0.002
 clim+0.002
 forward+0.002
away+0.002
Neg logit contributions
 poems-0.002
 spaghetti-0.002
 sauce-0.002
 noodles-0.002
 masks-0.002
Token had
Token ID550
Feature activation-5.38e-04

Pos logit contributions
 spaghetti+0.027
 gum+0.025
 sauce+0.025
 poems+0.025
 salsa+0.025
Neg logit contributions
stru-0.015
 ever-0.014
 clim-0.014
ris-0.013
 returned-0.013
Token a
Token ID257
Feature activation+2.66e-03

Pos logit contributions
ems+0.001
ccoli+0.001
aghetti+0.001
 salsa+0.001
bat+0.001
Neg logit contributions
 ever-0.001
stru-0.001
 able-0.000
 clim-0.000
 of-0.000
Token very
Token ID845
Feature activation-4.25e-02

Pos logit contributions
stru+0.004
 clim+0.003
or+0.003
away+0.003
ester+0.003
Neg logit contributions
 spaghetti-0.005
 poems-0.005
 sauce-0.005
 masks-0.005
 noodles-0.005
Token happy
Token ID3772
Feature activation+7.71e-03

Pos logit contributions
 spaghetti+0.073
 poems+0.071
 sauce+0.068
 noodles+0.066
 masks+0.065
Neg logit contributions
stru-0.070
 clim-0.063
ively-0.061
or-0.060
ester-0.060
Token time
Token ID640
Feature activation-2.28e-03

Pos logit contributions
stru+0.009
 clim+0.008
away+0.008
 year+0.007
 of+0.007
Neg logit contributions
 spaghetti-0.016
 poems-0.016
 sauce-0.016
 noodles-0.015
ccoli-0.015
Token together
Token ID1978
Feature activation-3.46e-03

Pos logit contributions
 spaghetti+0.005
 poems+0.005
 sauce+0.005
 noodles+0.005
 masks+0.005
Neg logit contributions
stru-0.002
 clim-0.002
 of-0.002
ris-0.002
ester-0.002
Token.
Token ID13
Feature activation-4.65e-03

Pos logit contributions
 spaghetti+0.008
 poems+0.008
 sauce+0.008
 noodles+0.007
 masks+0.007
Neg logit contributions
stru-0.003
 clim-0.003
away-0.003
or-0.003
ris-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.29e-02

Pos logit contributions
 spaghetti+0.009
 poems+0.009
 sauce+0.008
 noodles+0.008
 masks+0.008
Neg logit contributions
stru-0.006
 clim-0.006
ris-0.006
away-0.005
or-0.005
Token time
Token ID640
Feature activation-1.38e-02

Pos logit contributions
 spaghetti+0.010
 poems+0.010
 sauce+0.009
 noodles+0.009
 gum+0.009
Neg logit contributions
stru-0.010
 clim-0.009
away-0.008
or-0.008
ester-0.008
Token,
Token ID11
Feature activation+4.47e-03

Pos logit contributions
 spaghetti+0.027
 sauce+0.024
 gum+0.024
 poems+0.024
 masks+0.024
Neg logit contributions
stru-0.019
away-0.018
ester-0.017
ris-0.017
or-0.016
Token there
Token ID612
Feature activation+1.43e-02

Pos logit contributions
stru+0.006
 clim+0.005
away+0.005
ester+0.005
or+0.005
Neg logit contributions
 spaghetti-0.009
 poems-0.009
 sauce-0.009
 noodles-0.008
 masks-0.008
Token was
Token ID373
Feature activation-2.41e-05

Pos logit contributions
stru+0.023
away+0.021
ester+0.020
 clim+0.020
or+0.019
Neg logit contributions
 spaghetti-0.025
 sauce-0.024
 gum-0.023
 poems-0.022
 noodles-0.022
Token a
Token ID257
Feature activation-6.88e-03

Pos logit contributions
 poems+0.000
 spaghetti+0.000
ems+0.000
 sauce+0.000
ccoli+0.000
Neg logit contributions
stru-0.000
 clim-0.000
 of-0.000
away-0.000
ris-0.000
Token hungry
Token ID14720
Feature activation-4.97e-02

Pos logit contributions
 spaghetti+0.014
 poems+0.013
 sauce+0.013
 noodles+0.013
 gum+0.013
Neg logit contributions
stru-0.010
or-0.008
 clim-0.008
away-0.008
ively-0.008
Token little
Token ID1310
Feature activation+4.27e-03

Pos logit contributions
 elbows+0.090
 poems+0.087
 spaghetti+0.085
 masks+0.083
ems+0.082
Neg logit contributions
 clim-0.070
stru-0.064
 year-0.063
 of-0.063
away-0.062
Token girl
Token ID2576
Feature activation-2.60e-03

Pos logit contributions
 clim+0.006
stru+0.006
away+0.005
ester+0.005
 year+0.005
Neg logit contributions
 poems-0.008
 spaghetti-0.008
ems-0.007
 elbows-0.007
 masks-0.007
Token.
Token ID13
Feature activation+9.27e-04

Pos logit contributions
 spaghetti+0.005
 sauce+0.005
 poems+0.005
 dough+0.004
 yarn+0.004
Neg logit contributions
stru-0.004
 victorious-0.003
or-0.003
ively-0.003
away-0.003
Token Her
Token ID2332
Feature activation+1.89e-02

Pos logit contributions
stru+0.001
 clim+0.001
away+0.001
 of+0.001
ris+0.001
Neg logit contributions
 spaghetti-0.002
 poems-0.002
 sauce-0.002
 masks-0.002
 noodles-0.001
Token tum
Token ID11814
Feature activation-4.61e-02

Pos logit contributions
stru+0.031
 clim+0.029
ester+0.027
away+0.027
 forward+0.026
Neg logit contributions
 spaghetti-0.030
 poems-0.029
 sauce-0.028
 noodles-0.028
 masks-0.027