TOP ACTIVATIONS
MAX = 3.588

with lots of sand and palm trees . There was a
us ." The boss looked at Tom and Lily
and pige ons . P ige ons are birds that have
Yes , today is a good day because now we can
said yes . Re becca ran back outside and
" Hello , Mister P ige on !" The
finished , Mrs . P ige on said Jack had done
From then on , the naughty boy was more obedient and
big one for only two dollars . â Tina was
happy . The nos y metal robot realized that
. <|endoftext|> Tom was a smart boy who liked to wear
No ! I am the boss of the slide . You
, " This is a good lesson for us , Tommy
mean ." The mean kid feels angry . He
kind ." The mean boy laughs and says ,
" What is a price ?" Ben asked .
We have apple pie , cherry pie , pumpkin pie ,
the sun . Re becca felt excited and ran
. The nos y metal robot thought it
playing happily together with the pick . <|endoftext|> Once there was

MIN ACTIVATIONS
MIN = -3.863

would play tag , hop sc ot ch , and many
play a game of hop sc ot ch , but it
together . They played hop sc ot ch and picked more
and - seek and hop sc ot ch . Then ,
They started to play hop sc ot ch and had lots
knew how to play hop sc ot ch .
She played tag and hop sc ot ch and pretended to
love . They played hop sc ot ch , jumped rope
day , they play hop sc ot ch . They draw
hide and seek , hop sc ot ch and much more
group of girls playing hop sc ot ch . They joined
stars . He reached his little hand up to touch them
play a game of hop sc ot ch . Tim took
. You can play hop sc ot ch again . But
! Tim my cried a little bit , but then he
also played tag and hop sc ot ch . They laughed
it , a big gust of wind came by and blew
, with a sudden gust of wind , the singing rock
felt sad and cried a little bit .
Suddenly , a gust of strong wind blew and the

INTERVAL 1.725 - 3.588
CONTAINS 0.352%

Tom says . S ue rub s the butter
said , " I am not rude , I am telling
together . The silly refrigerator and the little girl
was glad that she was able to find swimming goggles that
Let 's use our grocery list to find what we need

INTERVAL -0.138 - 1.725
CONTAINS 58.913%

thanked her dad for being so kind . From then on
her eye with water . She put a band - aid
John was happy so he decided to show off by ped
made a face . She did not like the frog .
fence . Now he could explore the other side of the

INTERVAL -2.000 - -0.138
CONTAINS 40.679%

to him and said , " Hi , what 's your
that some children had very little and were not happy .
tasted even better than if Jimmy 's mom had cooked it
strange on the ground . It was a button ! She
adventurous three year old was given a letter . She was

INTERVAL -3.863 - -2.000
CONTAINS 0.056%

around and playing . Soon night time came and the bird
is that even when we feel tired , doing things we
secret she had written the night before was that she could
ear . The cat could feel the music through the string
a high - p itched noise coming from a nearby creek
Token with
Token ID351
Feature activation+7.60e-01

Pos logit contributions
Charl+0.026
use+0.024
 forward+0.023
ifying+0.021
ppy+0.020
Neg logit contributions
 pige-0.034
 sandwiches-0.033
 cakes-0.032
 ducks-0.031
 balloons-0.031
Token lots
Token ID6041
Feature activation+1.05e+00

Pos logit contributions
 pige+0.026
 pie+0.023
 cakes+0.023
 sandwiches+0.023
mm+0.023
Neg logit contributions
 forward-0.024
ifying-0.023
Charl-0.023
ky-0.021
use-0.020
Token of
Token ID286
Feature activation+5.89e-01

Pos logit contributions
 pige+0.041
 sandwiches+0.040
 cakes+0.039
 ducks+0.037
 swings+0.036
Neg logit contributions
Charl-0.031
 forward-0.030
use-0.029
ifying-0.027
sc-0.025
Token sand
Token ID6450
Feature activation+9.26e-02

Pos logit contributions
 pige+0.014
 pie+0.014
 sandwiches+0.013
 cakes+0.013
 Sue+0.013
Neg logit contributions
 forward-0.022
Charl-0.021
ifying-0.021
use-0.021
sc-0.020
Token and
Token ID290
Feature activation+2.26e+00

Pos logit contributions
 pige+0.004
 sandwiches+0.004
 cakes+0.004
 ducks+0.004
 balloons+0.003
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ifying-0.002
ppy-0.002
Token palm
Token ID18057
Feature activation+3.59e+00

Pos logit contributions
 pige+0.059
 sandwiches+0.052
 pie+0.051
 cakes+0.049
mm+0.048
Neg logit contributions
Charl-0.081
 forward-0.081
ifying-0.079
sc-0.076
use-0.075
Token trees
Token ID7150
Feature activation+1.68e+00

Pos logit contributions
 pige+0.107
 cakes+0.103
 sandwiches+0.101
 pie+0.092
 Sue+0.091
Neg logit contributions
Charl-0.117
ifying-0.111
use-0.111
 forward-0.111
ky-0.108
Token.
Token ID13
Feature activation+6.29e-01

Pos logit contributions
 pige+0.078
 sandwiches+0.075
 cakes+0.073
 ducks+0.068
mm+0.068
Neg logit contributions
Charl-0.037
 forward-0.036
use-0.033
ifying-0.032
oping-0.030
Token There
Token ID1318
Feature activation+1.36e+00

Pos logit contributions
 pige+0.018
 sandwiches+0.017
 cakes+0.016
 balloons+0.015
 sandwich+0.014
Neg logit contributions
Charl-0.025
 forward-0.023
use-0.022
sc-0.022
 Until-0.022
Token was
Token ID373
Feature activation+1.01e+00

Pos logit contributions
 pige+0.058
 sandwiches+0.056
 cakes+0.053
 balloons+0.052
 swings+0.050
Neg logit contributions
 forward-0.036
Charl-0.035
use-0.034
oping-0.030
ley-0.029
Token a
Token ID257
Feature activation+9.85e-01

Pos logit contributions
 pige+0.035
 sandwiches+0.033
 cakes+0.032
 balloons+0.030
 pie+0.030
Neg logit contributions
Charl-0.034
 forward-0.034
ifying-0.031
use-0.030
sc-0.028
Token us
Token ID514
Feature activation+3.75e-01

Pos logit contributions
use+0.005
Charl+0.005
 forward+0.005
ppy+0.005
ky+0.004
Neg logit contributions
 sandwiches-0.006
 pige-0.006
 cakes-0.006
 balloons-0.006
 swings-0.005
Token."
Token ID526
Feature activation+2.62e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 ducks+0.011
 balloons+0.011
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.014
ifying-0.013
ppy-0.013
Token
Token ID198
Feature activation+6.04e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.009
use-0.008
 forward-0.008
ifying-0.008
ppy-0.007
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.024
 cakes+0.023
 ducks+0.022
 balloons+0.022
Neg logit contributions
Charl-0.020
use-0.018
 forward-0.017
ifying-0.016
ppy-0.015
TokenThe
Token ID464
Feature activation+1.52e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.007
 cakes+0.006
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.009
use-0.009
 forward-0.008
ifying-0.008
ppy-0.008
Token boss
Token ID6478
Feature activation+3.40e+00

Pos logit contributions
 pige+0.005
 sandwiches+0.004
 ducks+0.004
 cakes+0.004
 balloons+0.004
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.006
ppy-0.006
Token looked
Token ID3114
Feature activation+1.22e+00

Pos logit contributions
 sandwiches+0.172
 pige+0.171
 cakes+0.164
 ducks+0.158
 swings+0.157
Neg logit contributions
Charl-0.070
use-0.064
 forward-0.061
ifying-0.054
ppy-0.054
Token at
Token ID379
Feature activation+2.39e+00

Pos logit contributions
 pige+0.041
 sandwiches+0.040
 ducks+0.038
 balloons+0.036
 cakes+0.036
Neg logit contributions
Charl-0.051
use-0.046
ppy-0.042
acy-0.042
ifying-0.042
Token Tom
Token ID4186
Feature activation+5.49e-01

Pos logit contributions
 pige+0.063
 sandwiches+0.062
 cakes+0.058
 pie+0.052
 ducks+0.052
Neg logit contributions
Charl-0.104
 forward-0.099
use-0.098
ifying-0.092
ky-0.091
Token and
Token ID290
Feature activation-4.16e-01

Pos logit contributions
 pige+0.018
 sandwiches+0.018
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.023
use-0.021
ifying-0.020
issy-0.018
Ru-0.018
Token Lily
Token ID20037
Feature activation+3.04e-01

Pos logit contributions
Charl+0.019
use+0.018
 forward+0.018
ifying+0.017
ppy+0.017
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 cakes-0.010
 ducks-0.009
 balloons-0.009
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
 pige+0.008
 sandwiches+0.008
 cakes+0.007
 ducks+0.007
 balloons+0.007
Neg logit contributions
Charl-0.006
use-0.006
 forward-0.006
ppy-0.005
ifying-0.005
Token pige
Token ID32372
Feature activation+2.33e+00

Pos logit contributions
 pige+0.039
 sandwiches+0.039
 cakes+0.034
 pie+0.033
mm+0.033
Neg logit contributions
use-0.067
Charl-0.066
 forward-0.064
sc-0.062
ifying-0.059
Tokenons
Token ID684
Feature activation+1.10e+00

Pos logit contributions
 sandwiches+0.078
 pige+0.075
 cakes+0.072
 balloons+0.070
 ducks+0.069
Neg logit contributions
Charl-0.087
use-0.082
 forward-0.074
ppy-0.073
issy-0.070
Token.
Token ID13
Feature activation+8.18e-01

Pos logit contributions
 pige+0.041
 sandwiches+0.041
 cakes+0.038
 ducks+0.037
 balloons+0.037
Neg logit contributions
Charl-0.040
use-0.037
 forward-0.035
ifying-0.033
ppy-0.032
Token P
Token ID350
Feature activation+1.26e+00

Pos logit contributions
 pige+0.037
 sandwiches+0.036
 pie+0.034
castle+0.033
 cakes+0.033
Neg logit contributions
Charl-0.020
 forward-0.019
use-0.017
sc-0.016
ifying-0.016
Tokenige
Token ID10045
Feature activation+3.24e+00

Pos logit contributions
 pige+0.050
 sandwiches+0.048
 ducks+0.045
 cakes+0.045
 balloons+0.045
Neg logit contributions
Charl-0.042
use-0.039
 forward-0.037
ppy-0.036
ifying-0.034
Tokenons
Token ID684
Feature activation+3.82e-01

Pos logit contributions
 sandwiches+0.119
 pige+0.118
 cakes+0.111
 ducks+0.106
 balloons+0.106
Neg logit contributions
Charl-0.106
use-0.102
ppy-0.094
 forward-0.093
issy-0.088
Token are
Token ID389
Feature activation+2.69e-01

Pos logit contributions
 sandwiches+0.012
 pige+0.012
 balloons+0.011
 ducks+0.011
 cakes+0.011
Neg logit contributions
Charl-0.017
use-0.015
 forward-0.014
ifying-0.014
uster-0.014
Token birds
Token ID10087
Feature activation-2.03e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.009
 cakes+0.009
 balloons+0.009
 ducks+0.009
Neg logit contributions
Charl-0.008
 forward-0.008
use-0.007
ppy-0.007
ifying-0.007
Token that
Token ID326
Feature activation+6.99e-01

Pos logit contributions
Charl+0.007
use+0.006
 forward+0.006
ifying+0.006
ppy+0.005
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token have
Token ID423
Feature activation+1.87e+00

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 ducks+0.023
 cakes+0.022
 balloons+0.022
Neg logit contributions
Charl-0.029
 forward-0.025
use-0.025
ifying-0.024
uster-0.023
TokenYes
Token ID5297
Feature activation+3.36e-01

Pos logit contributions
Charl+0.005
use+0.005
 forward+0.004
ppy+0.004
ifying+0.004
Neg logit contributions
 pige-0.005
 sandwiches-0.005
 cakes-0.005
 ducks-0.005
 balloons-0.005
Token,
Token ID11
Feature activation+2.24e-01

Pos logit contributions
 pige+0.015
 sandwiches+0.014
 ducks+0.013
 swings+0.013
 balloons+0.013
Neg logit contributions
 forward-0.009
Charl-0.009
use-0.009
ifying-0.008
ppy-0.008
Token today
Token ID1909
Feature activation-2.07e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.007
 cakes+0.006
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.009
 forward-0.008
use-0.008
ifying-0.008
ppy-0.008
Token is
Token ID318
Feature activation+1.22e+00

Pos logit contributions
use+0.006
 forward+0.006
Charl+0.006
ifying+0.005
sc+0.005
Neg logit contributions
 pige-0.008
 sandwiches-0.007
 ducks-0.007
 cakes-0.007
 swings-0.007
Token a
Token ID257
Feature activation+1.83e+00

Pos logit contributions
 pige+0.045
 sandwiches+0.039
 ducks+0.037
 balloons+0.036
 swings+0.035
Neg logit contributions
 forward-0.044
Charl-0.042
use-0.040
ifying-0.040
well-0.036
Token good
Token ID922
Feature activation+3.24e+00

Pos logit contributions
 pige+0.056
 sandwiches+0.055
 cakes+0.051
 ducks+0.050
 balloons+0.049
Neg logit contributions
Charl-0.076
use-0.072
 forward-0.069
ifying-0.066
ppy-0.065
Token day
Token ID1110
Feature activation-7.04e-01

Pos logit contributions
 pige+0.107
 sandwiches+0.100
 cakes+0.093
 balloons+0.092
 ducks+0.091
Neg logit contributions
Charl-0.120
 forward-0.118
use-0.117
well-0.100
sc-0.099
Token because
Token ID780
Feature activation+1.75e+00

Pos logit contributions
Charl+0.029
use+0.027
 forward+0.027
ifying+0.025
ppy+0.024
Neg logit contributions
 pige-0.022
 sandwiches-0.022
 cakes-0.020
 ducks-0.020
 balloons-0.019
Token now
Token ID783
Feature activation+5.84e-01

Pos logit contributions
 pige+0.060
 sandwiches+0.055
 cakes+0.050
 swings+0.049
 ducks+0.049
Neg logit contributions
 forward-0.062
Charl-0.060
use-0.058
ifying-0.057
well-0.053
Token we
Token ID356
Feature activation+5.59e-01

Pos logit contributions
 pige+0.021
 sandwiches+0.019
 ducks+0.017
 cakes+0.017
 balloons+0.016
Neg logit contributions
Charl-0.020
use-0.020
 forward-0.020
ifying-0.017
 Taylor-0.017
Token can
Token ID460
Feature activation+1.64e-02

Pos logit contributions
 sandwiches+0.021
 pige+0.021
 ducks+0.021
 cakes+0.020
 balloons+0.020
Neg logit contributions
Charl-0.019
use-0.016
 forward-0.015
Ru-0.015
och-0.014
Token said
Token ID531
Feature activation+5.48e-02

Pos logit contributions
 pige+0.004
 sandwiches+0.004
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.004
use-0.004
 forward-0.004
ifying-0.004
sc-0.003
Token yes
Token ID3763
Feature activation+1.62e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ifying-0.002
ppy-0.002
Token.
Token ID13
Feature activation+9.90e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.006
use-0.006
ifying-0.005
 forward-0.005
Ru-0.005
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
 pige+0.036
 sandwiches+0.034
 ducks+0.032
 cakes+0.032
 balloons+0.032
Neg logit contributions
Charl-0.036
use-0.034
 forward-0.031
ppy-0.031
ifying-0.030
Token
Token ID198
Feature activation+6.31e-01

Pos logit contributions
 pige+0.035
 sandwiches+0.033
 ducks+0.031
 balloons+0.030
 swings+0.030
Neg logit contributions
Charl-0.040
use-0.040
ifying-0.037
 forward-0.037
ppy-0.036
TokenRe
Token ID3041
Feature activation+3.21e+00

Pos logit contributions
 pige+0.025
 sandwiches+0.024
 ducks+0.022
 cakes+0.022
 balloons+0.022
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.018
ppy-0.018
ifying-0.018
Tokenbecca
Token ID20627
Feature activation+1.01e+00

Pos logit contributions
 sandwiches+0.107
 pige+0.105
 cakes+0.098
 ducks+0.097
 pie+0.094
Neg logit contributions
Charl-0.126
 forward-0.114
use-0.114
well-0.105
ky-0.104
Token ran
Token ID4966
Feature activation+4.90e-01

Pos logit contributions
 pige+0.036
 sandwiches+0.035
 cakes+0.033
 ducks+0.033
 balloons+0.032
Neg logit contributions
Charl-0.037
use-0.035
 forward-0.034
ifying-0.032
ppy-0.030
Token back
Token ID736
Feature activation+1.43e+00

Pos logit contributions
 sandwiches+0.017
 pige+0.017
 balloons+0.016
 cakes+0.016
 ice+0.016
Neg logit contributions
Charl-0.018
use-0.017
ifying-0.016
ppy-0.016
issy-0.015
Token outside
Token ID2354
Feature activation-8.08e-02

Pos logit contributions
 sandwiches+0.040
 pige+0.039
 balloons+0.038
 ducks+0.036
 ice+0.036
Neg logit contributions
Charl-0.063
use-0.059
ifying-0.057
ppy-0.057
issy-0.055
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
Charl+0.003
use+0.003
ifying+0.003
ppy+0.003
 forward+0.003
Neg logit contributions
 sandwiches-0.003
 pige-0.003
 cakes-0.003
 balloons-0.003
 ducks-0.003
Token "
Token ID366
Feature activation-6.65e-01

Pos logit contributions
Charl+0.019
use+0.018
ifying+0.017
 forward+0.017
ppy+0.017
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 ducks-0.007
 balloons-0.007
TokenHello
Token ID15496
Feature activation-1.31e-01

Pos logit contributions
Charl+0.021
 forward+0.020
use+0.020
ppy+0.018
ifying+0.018
Neg logit contributions
 pige-0.027
 ducks-0.025
 sandwiches-0.024
 balloons-0.024
 cakes-0.024
Token,
Token ID11
Feature activation+5.36e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ifying+0.003
Ru+0.002
Neg logit contributions
 sandwiches-0.005
 pies-0.005
 cakes-0.005
 swings-0.005
 structures-0.005
Token Mister
Token ID43438
Feature activation+1.24e+00

Pos logit contributions
 sandwiches+0.019
 cakes+0.017
 pies+0.016
 pige+0.016
 flavors+0.016
Neg logit contributions
Charl-0.020
 forward-0.018
use-0.018
ifying-0.017
och-0.016
Token P
Token ID350
Feature activation+8.65e-01

Pos logit contributions
 sandwiches+0.050
 cakes+0.047
 pige+0.045
 swings+0.044
 pies+0.044
Neg logit contributions
Charl-0.039
use-0.037
ky-0.033
ley-0.032
ifying-0.032
Tokenige
Token ID10045
Feature activation+3.19e+00

Pos logit contributions
 pige+0.025
 sandwiches+0.025
 cakes+0.023
 ducks+0.023
 balloons+0.023
Neg logit contributions
Charl-0.037
use-0.036
 forward-0.033
ppy-0.033
issy-0.032
Tokenon
Token ID261
Feature activation+2.99e-01

Pos logit contributions
 sandwiches+0.127
 pige+0.118
 cakes+0.115
 ducks+0.112
 swings+0.111
Neg logit contributions
Charl-0.093
use-0.091
 forward-0.086
ppy-0.084
och-0.081
Token!"
Token ID2474
Feature activation-1.83e-01

Pos logit contributions
 pige+0.011
 sandwiches+0.010
 cakes+0.010
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.011
use-0.010
 forward-0.010
ifying-0.009
ppy-0.009
Token
Token ID198
Feature activation-1.63e-01

Pos logit contributions
Charl+0.008
use+0.008
 forward+0.007
ifying+0.007
ppy+0.007
Neg logit contributions
 pige-0.005
 sandwiches-0.005
 cakes-0.005
 ducks-0.005
 balloons-0.005
Token
Token ID198
Feature activation-5.33e-01

Pos logit contributions
Charl+0.006
use+0.006
 forward+0.006
ifying+0.006
ppy+0.005
Neg logit contributions
 pige-0.005
 sandwiches-0.005
 ducks-0.005
 cakes-0.005
 balloons-0.005
TokenThe
Token ID464
Feature activation-1.25e-01

Pos logit contributions
use+0.021
Charl+0.021
 forward+0.020
ifying+0.019
ley+0.019
Neg logit contributions
 pige-0.017
 ducks-0.016
 sandwiches-0.015
 cakes-0.015
 balloons-0.015
Token finished
Token ID5201
Feature activation-1.23e+00

Pos logit contributions
Charl+0.012
use+0.011
 forward+0.010
ifying+0.010
Ru+0.009
Neg logit contributions
 sandwiches-0.010
 pige-0.010
 cakes-0.009
 ducks-0.009
 balloons-0.009
Token,
Token ID11
Feature activation+5.09e-01

Pos logit contributions
Charl+0.045
use+0.043
 forward+0.042
ifying+0.039
ppy+0.038
Neg logit contributions
 pige-0.044
 sandwiches-0.042
 cakes-0.039
 ducks-0.039
 balloons-0.038
Token Mrs
Token ID9074
Feature activation+1.41e+00

Pos logit contributions
 pige+0.013
 sandwiches+0.013
 cakes+0.012
 ducks+0.011
 balloons+0.011
Neg logit contributions
Charl-0.024
use-0.023
 forward-0.022
ifying-0.021
ppy-0.021
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
 pige+0.058
 sandwiches+0.057
 cakes+0.054
 ducks+0.053
 balloons+0.053
Neg logit contributions
Charl-0.044
use-0.041
 forward-0.039
ppy-0.037
ifying-0.036
Token P
Token ID350
Feature activation+1.78e+00

Pos logit contributions
 pige+0.069
 sandwiches+0.065
 cakes+0.062
 ducks+0.061
 Sue+0.061
Neg logit contributions
Charl-0.075
use-0.072
 forward-0.069
sc-0.068
ppy-0.066
Tokenige
Token ID10045
Feature activation+3.18e+00

Pos logit contributions
 pige+0.040
 sandwiches+0.039
 cakes+0.035
 ducks+0.033
 balloons+0.032
Neg logit contributions
Charl-0.091
use-0.085
 forward-0.083
ifying-0.080
ppy-0.078
Tokenon
Token ID261
Feature activation+1.99e-01

Pos logit contributions
 pige+0.122
 sandwiches+0.117
 cakes+0.110
 ducks+0.109
 balloons+0.106
Neg logit contributions
Charl-0.105
use-0.098
ppy-0.095
 forward-0.094
issy-0.089
Token said
Token ID531
Feature activation-2.88e-01

Pos logit contributions
 pige+0.008
 sandwiches+0.007
 cakes+0.007
 ducks+0.007
 balloons+0.007
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ppy-0.006
ifying-0.006
Token Jack
Token ID3619
Feature activation-6.23e-01

Pos logit contributions
Charl+0.012
 forward+0.012
use+0.011
ifying+0.011
ppy+0.010
Neg logit contributions
 pige-0.009
 sandwiches-0.008
 cakes-0.008
 ducks-0.008
 swings-0.007
Token had
Token ID550
Feature activation+3.57e-01

Pos logit contributions
Charl+0.024
use+0.023
 forward+0.022
ifying+0.021
ppy+0.020
Neg logit contributions
 sandwiches-0.021
 pige-0.021
 cakes-0.019
 ducks-0.019
 balloons-0.019
Token done
Token ID1760
Feature activation+2.05e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.014
 cakes+0.014
 ducks+0.014
 swings+0.013
Neg logit contributions
Charl-0.010
use-0.010
 forward-0.009
ifying-0.008
ppy-0.008
Token From
Token ID3574
Feature activation-8.36e-01

Pos logit contributions
 pige+0.018
 sandwiches+0.017
 cakes+0.017
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.012
use-0.012
 forward-0.011
ifying-0.010
ppy-0.010
Token then
Token ID788
Feature activation-2.56e-02

Pos logit contributions
Charl+0.035
use+0.033
 forward+0.032
ifying+0.030
ppy+0.029
Neg logit contributions
 pige-0.026
 sandwiches-0.026
 cakes-0.024
 ducks-0.024
 balloons-0.023
Token on
Token ID319
Feature activation+9.55e-01

Pos logit contributions
Charl+0.001
use+0.001
ppy+0.001
ifying+0.001
ky+0.001
Neg logit contributions
 sandwiches-0.001
 pige-0.001
 balloons-0.001
 ice-0.001
 cakes-0.000
Token,
Token ID11
Feature activation+9.04e-01

Pos logit contributions
 sandwiches+0.031
 pige+0.030
 cakes+0.028
 balloons+0.027
 ducks+0.027
Neg logit contributions
Charl-0.039
use-0.037
ifying-0.034
 forward-0.034
ppy-0.033
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.021
 ducks+0.021
 balloons+0.020
Neg logit contributions
Charl-0.043
use-0.041
 forward-0.038
ifying-0.038
ppy-0.038
Token naughty
Token ID45128
Feature activation+3.17e+00

Pos logit contributions
 pige+0.035
 sandwiches+0.035
 cakes+0.034
 Sue+0.030
 balloons+0.029
Neg logit contributions
 forward-0.040
Charl-0.039
use-0.038
 que-0.035
ifying-0.035
Token boy
Token ID2933
Feature activation-2.17e-01

Pos logit contributions
 sandwiches+0.101
 cakes+0.098
 pige+0.097
 pie+0.086
 balloons+0.085
Neg logit contributions
use-0.111
 forward-0.107
Charl-0.106
ley-0.102
ian-0.098
Token was
Token ID373
Feature activation+4.62e-01

Pos logit contributions
Charl+0.009
use+0.008
ifying+0.008
arl+0.008
 forward+0.007
Neg logit contributions
 pige-0.007
 ducks-0.007
 sandwiches-0.007
 ice-0.006
 swings-0.006
Token more
Token ID517
Feature activation+1.29e+00

Pos logit contributions
 pige+0.018
 sandwiches+0.018
 cakes+0.017
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.016
use-0.015
 forward-0.014
ifying-0.013
ppy-0.013
Token obedient
Token ID41652
Feature activation+1.41e+00

Pos logit contributions
 pige+0.036
 ducks+0.034
 sandwiches+0.032
 ice+0.031
 balloons+0.030
Neg logit contributions
use-0.059
ifying-0.057
Charl-0.056
sc-0.052
issy-0.052
Token and
Token ID290
Feature activation+2.72e-01

Pos logit contributions
 pige+0.053
 sandwiches+0.052
 ducks+0.048
 cakes+0.048
 balloons+0.048
Neg logit contributions
Charl-0.052
use-0.048
ifying-0.044
 forward-0.043
ppy-0.042
Token big
Token ID1263
Feature activation+1.73e+00

Pos logit contributions
 pige+0.051
 sandwiches+0.050
 cakes+0.047
 ducks+0.046
 balloons+0.045
Neg logit contributions
Charl-0.071
use-0.066
 forward-0.064
ifying-0.061
ppy-0.060
Token one
Token ID530
Feature activation+1.20e+00

Pos logit contributions
 pige+0.048
 sandwiches+0.047
 cakes+0.043
 ducks+0.041
 balloons+0.041
Neg logit contributions
Charl-0.074
use-0.073
 forward-0.070
ifying-0.064
well-0.063
Token for
Token ID329
Feature activation+1.45e+00

Pos logit contributions
 pige+0.040
 sandwiches+0.040
 cakes+0.037
 ducks+0.036
 balloons+0.036
Neg logit contributions
Charl-0.046
use-0.043
 forward-0.042
ifying-0.040
ppy-0.039
Token only
Token ID691
Feature activation+1.56e+00

Pos logit contributions
 pige+0.046
 sandwiches+0.044
 cakes+0.041
 ducks+0.040
 pie+0.039
Neg logit contributions
Charl-0.056
 forward-0.052
use-0.052
ifying-0.048
ppy-0.047
Token two
Token ID734
Feature activation+1.42e+00

Pos logit contributions
 pige+0.054
 sandwiches+0.053
 cakes+0.048
 ducks+0.047
 balloons+0.046
Neg logit contributions
use-0.053
 forward-0.051
Charl-0.050
ifying-0.047
sc-0.045
Token dollars
Token ID5054
Feature activation+3.17e+00

Pos logit contributions
 sandwiches+0.035
 pige+0.035
 cakes+0.031
 ducks+0.029
 swings+0.029
Neg logit contributions
use-0.060
 forward-0.060
Charl-0.059
ifying-0.055
well-0.053
Token.
Token ID13
Feature activation+6.85e-01

Pos logit contributions
 pige+0.102
 sandwiches+0.100
 cakes+0.093
 ducks+0.091
 balloons+0.090
Neg logit contributions
Charl-0.131
use-0.121
 forward-0.116
ifying-0.111
ppy-0.110
Tokenâ
Token ID22940
Feature activation+1.18e+00

Pos logit contributions
 pige+0.025
 sandwiches+0.024
 cakes+0.023
 ducks+0.022
 balloons+0.022
Neg logit contributions
Charl-0.025
use-0.023
 forward-0.022
ppy-0.021
ifying-0.021
Token
Token ID26391
Feature activation+2.24e+00

Pos logit contributions
 pige+0.042
 sandwiches+0.041
 cakes+0.039
 ducks+0.038
 balloons+0.038
Neg logit contributions
Charl-0.044
use-0.041
 forward-0.039
ifying-0.037
ppy-0.036
Token Tina
Token ID35903
Feature activation+4.15e-01

Pos logit contributions
 sandwiches+0.090
 pige+0.089
 cakes+0.084
 ducks+0.082
 pie+0.081
Neg logit contributions
Charl-0.075
use-0.068
 forward-0.066
oping-0.060
ifying-0.059
Token was
Token ID373
Feature activation+7.96e-01

Pos logit contributions
 pige+0.015
 sandwiches+0.015
 cakes+0.014
 ducks+0.014
 balloons+0.013
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.014
ifying-0.013
ppy-0.012
Token happy
Token ID3772
Feature activation-1.62e-01

Pos logit contributions
Charl+0.006
 forward+0.006
use+0.005
ifying+0.005
well+0.005
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 cakes-0.005
 ducks-0.005
 balloons-0.005
Token.
Token ID13
Feature activation+6.22e-01

Pos logit contributions
Charl+0.005
 forward+0.005
use+0.005
ifying+0.004
well+0.004
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 ducks-0.006
 cakes-0.006
 balloons-0.006
Token
Token ID198
Feature activation+5.18e-01

Pos logit contributions
 pige+0.022
 sandwiches+0.022
 cakes+0.021
 ducks+0.020
 balloons+0.020
Neg logit contributions
Charl-0.023
use-0.021
 forward-0.020
ifying-0.019
ppy-0.019
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
 pige+0.020
 sandwiches+0.019
 cakes+0.018
 ducks+0.018
 balloons+0.018
Neg logit contributions
Charl-0.018
use-0.016
 forward-0.016
ifying-0.015
ppy-0.015
TokenThe
Token ID464
Feature activation+5.88e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 ducks+0.005
 cakes+0.005
 balloons+0.005
Neg logit contributions
Charl-0.004
use-0.004
 forward-0.004
ifying-0.004
ppy-0.003
Token nos
Token ID43630
Feature activation+3.16e+00

Pos logit contributions
 pige+0.017
 sandwiches+0.016
 cakes+0.015
 ducks+0.014
 balloons+0.014
Neg logit contributions
Charl-0.023
 forward-0.023
use-0.022
ifying-0.020
acy-0.020
Tokeny
Token ID88
Feature activation+1.85e+00

Pos logit contributions
 sandwiches+0.177
 pige+0.173
 cakes+0.169
 swings+0.163
 balloons+0.162
Neg logit contributions
Charl-0.049
use-0.042
 forward-0.041
ifying-0.035
ppy-0.034
Token metal
Token ID6147
Feature activation+9.87e-01

Pos logit contributions
 pige+0.050
 sandwiches+0.049
 cakes+0.044
 ducks+0.043
 balloons+0.043
Neg logit contributions
Charl-0.085
use-0.080
 forward-0.078
ifying-0.074
ppy-0.073
Token robot
Token ID9379
Feature activation-2.75e-01

Pos logit contributions
 pige+0.031
 sandwiches+0.030
 cakes+0.028
 ducks+0.027
 balloons+0.027
Neg logit contributions
Charl-0.040
use-0.038
 forward-0.037
ifying-0.035
ppy-0.034
Token realized
Token ID6939
Feature activation-2.42e-02

Pos logit contributions
Charl+0.010
use+0.009
 forward+0.009
ifying+0.008
ppy+0.008
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 ducks-0.009
 balloons-0.009
Token that
Token ID326
Feature activation+3.13e-01

Pos logit contributions
Charl+0.001
use+0.001
 forward+0.001
ifying+0.001
ppy+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 cakes-0.001
 ducks-0.001
 balloons-0.001
Token.
Token ID13
Feature activation-8.28e-01

Pos logit contributions
Charl+0.015
use+0.014
 forward+0.013
ifying+0.012
ppy+0.012
Neg logit contributions
 pige-0.018
 sandwiches-0.018
 cakes-0.017
 ducks-0.017
 balloons-0.017
Token<|endoftext|>
Token ID50256
Feature activation+2.48e-01

Pos logit contributions
Charl+0.028
use+0.026
 forward+0.025
ifying+0.024
ppy+0.023
Neg logit contributions
 pige-0.032
 sandwiches-0.032
 cakes-0.030
 balloons-0.029
 ducks-0.029
TokenTom
Token ID13787
Feature activation+6.93e-01

Pos logit contributions
 pige+0.011
 sandwiches+0.011
 cakes+0.011
 ducks+0.010
 balloons+0.010
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.005
ppy-0.005
Token was
Token ID373
Feature activation+8.67e-01

Pos logit contributions
 sandwiches+0.023
 pige+0.022
 ducks+0.022
 ice+0.022
 cakes+0.022
Neg logit contributions
Charl-0.026
 forward-0.025
ifying-0.024
use-0.023
sc-0.023
Token a
Token ID257
Feature activation+2.35e+00

Pos logit contributions
 sandwiches+0.028
 pige+0.028
 cakes+0.026
 ducks+0.026
 balloons+0.025
Neg logit contributions
Charl-0.035
use-0.034
 forward-0.030
ifying-0.030
ppy-0.030
Token smart
Token ID4451
Feature activation+3.15e+00

Pos logit contributions
 pige+0.075
 sandwiches+0.071
 ducks+0.068
 cakes+0.067
 balloons+0.066
Neg logit contributions
Charl-0.096
use-0.092
 forward-0.086
ifying-0.083
ppy-0.082
Token boy
Token ID2933
Feature activation+3.50e-01

Pos logit contributions
 pige+0.084
 sandwiches+0.084
 cakes+0.076
 ducks+0.072
 balloons+0.072
Neg logit contributions
Charl-0.144
use-0.135
 forward-0.132
ifying-0.124
ppy-0.124
Token who
Token ID508
Feature activation-8.18e-02

Pos logit contributions
 pige+0.013
 sandwiches+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.012
use-0.011
 forward-0.011
ifying-0.010
ppy-0.010
Token liked
Token ID8288
Feature activation+3.39e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ifying+0.003
ppy+0.003
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 swings-0.002
Token to
Token ID284
Feature activation+1.34e+00

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 ducks+0.013
 swings+0.012
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.008
Ru-0.007
Token wear
Token ID5806
Feature activation-9.44e-01

Pos logit contributions
 sandwiches+0.047
 pige+0.047
 cakes+0.046
 balloons+0.044
 swings+0.044
Neg logit contributions
Charl-0.051
 forward-0.046
use-0.046
ifying-0.045
ppy-0.041
TokenNo
Token ID2949
Feature activation+1.68e+00

Pos logit contributions
Charl+0.003
use+0.002
 forward+0.002
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token!
Token ID0
Feature activation-3.24e-02

Pos logit contributions
 pige+0.083
 sandwiches+0.076
 ducks+0.072
 balloons+0.072
 cakes+0.072
Neg logit contributions
Charl-0.036
use-0.035
 forward-0.033
 Taylor-0.028
ppy-0.027
Token I
Token ID314
Feature activation-1.43e-01

Pos logit contributions
Charl+0.001
 forward+0.001
use+0.001
ifying+0.001
ppy+0.001
Neg logit contributions
 sandwiches-0.001
 pige-0.001
 cakes-0.001
 balloons-0.001
 flavors-0.001
Token am
Token ID716
Feature activation+3.25e-01

Pos logit contributions
Charl+0.005
 forward+0.004
ifying+0.004
ifully+0.004
ky+0.004
Neg logit contributions
 pige-0.005
 ducks-0.005
 sandwiches-0.005
 ice-0.005
 Sue-0.005
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 ducks+0.010
 balloons+0.010
Neg logit contributions
Charl-0.012
 forward-0.011
use-0.010
ppy-0.010
ifying-0.009
Token boss
Token ID6478
Feature activation+3.14e+00

Pos logit contributions
 sandwiches+0.047
 pige+0.045
 cakes+0.042
 balloons+0.039
 ducks+0.036
Neg logit contributions
 forward-0.069
Charl-0.067
use-0.065
ifying-0.060
Ru-0.060
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
 sandwiches+0.134
 pige+0.131
 cakes+0.126
 swings+0.122
 pie+0.117
Neg logit contributions
 forward-0.082
Charl-0.081
use-0.076
ppy-0.073
ifying-0.071
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
 sandwiches+0.034
 pige+0.031
 cakes+0.029
 flavors+0.026
 sandwich+0.026
Neg logit contributions
 forward-0.057
use-0.054
Charl-0.052
arl-0.051
ifying-0.048
Token slide
Token ID10649
Feature activation+8.46e-01

Pos logit contributions
 sandwiches+0.025
 pige+0.023
 cakes+0.020
 balloons+0.019
 ducks+0.018
Neg logit contributions
Charl-0.045
 forward-0.045
use-0.044
ifying-0.040
well-0.040
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
 pige+0.028
 sandwiches+0.028
 cakes+0.026
 ducks+0.026
 balloons+0.026
Neg logit contributions
Charl-0.034
use-0.031
 forward-0.029
ifying-0.028
Ru-0.027
Token You
Token ID921
Feature activation+1.28e-01

Pos logit contributions
Charl+0.009
 forward+0.009
use+0.009
ifying+0.008
ppy+0.008
Neg logit contributions
 sandwiches-0.009
 pige-0.009
 cakes-0.008
 balloons-0.008
 ducks-0.008
Token,
Token ID11
Feature activation-1.75e-01

Pos logit contributions
 pige+0.013
 sandwiches+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.016
use-0.015
 forward-0.015
ifying-0.014
ppy-0.014
Token "
Token ID366
Feature activation-2.30e-02

Pos logit contributions
Charl+0.007
use+0.006
 forward+0.006
ifying+0.006
ppy+0.006
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 cakes-0.006
 ducks-0.006
 balloons-0.006
TokenThis
Token ID1212
Feature activation+1.58e+00

Pos logit contributions
Charl+0.001
use+0.001
 forward+0.001
ppy+0.001
ifying+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 ducks-0.001
 cakes-0.001
 balloons-0.001
Token is
Token ID318
Feature activation+5.00e-01

Pos logit contributions
 pige+0.048
 sandwiches+0.046
 cakes+0.041
 ducks+0.040
 balloons+0.040
Neg logit contributions
use-0.058
Charl-0.058
 forward-0.057
well-0.052
 particularly-0.051
Token a
Token ID257
Feature activation+1.72e+00

Pos logit contributions
 pige+0.015
 sandwiches+0.014
 cakes+0.013
 ducks+0.013
 balloons+0.013
Neg logit contributions
Charl-0.022
use-0.021
 forward-0.020
ifying-0.019
ppy-0.019
Token good
Token ID922
Feature activation+3.13e+00

Pos logit contributions
 pige+0.056
 sandwiches+0.055
 cakes+0.051
 ducks+0.050
 balloons+0.049
Neg logit contributions
Charl-0.069
use-0.065
 forward-0.062
ifying-0.059
ppy-0.059
Token lesson
Token ID11483
Feature activation+1.57e+00

Pos logit contributions
 pige+0.084
 sandwiches+0.076
 cakes+0.069
 balloons+0.068
 lions+0.067
Neg logit contributions
 forward-0.132
use-0.130
Charl-0.129
well-0.120
sc-0.115
Token for
Token ID329
Feature activation+2.40e-01

Pos logit contributions
 pige+0.058
 sandwiches+0.057
 cakes+0.053
 ducks+0.053
 balloons+0.051
Neg logit contributions
Charl-0.055
use-0.052
 forward-0.051
ifying-0.046
ppy-0.046
Token us
Token ID514
Feature activation+5.25e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.007
 cakes+0.006
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.010
use-0.010
 forward-0.010
ifying-0.009
ppy-0.009
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
 pige+0.017
 sandwiches+0.017
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.021
use-0.019
 forward-0.018
Ru-0.018
ifying-0.018
Token Tommy
Token ID19919
Feature activation+7.00e-01

Pos logit contributions
 pige+0.039
 sandwiches+0.039
 cakes+0.036
 ducks+0.035
 balloons+0.035
Neg logit contributions
Charl-0.048
use-0.045
 forward-0.043
ifying-0.041
ppy-0.040
Token mean
Token ID1612
Feature activation+1.10e+00

Pos logit contributions
 pige+0.001
 sandwiches+0.001
 cakes+0.001
 ducks+0.001
 balloons+0.001
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.001
ifying-0.001
ppy-0.001
Token."
Token ID526
Feature activation-2.96e-01

Pos logit contributions
 pige+0.039
 sandwiches+0.038
 cakes+0.036
 ducks+0.035
 balloons+0.034
Neg logit contributions
Charl-0.041
use-0.038
 forward-0.037
ifying-0.035
ppy-0.034
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
Charl+0.011
use+0.010
 forward+0.010
ifying+0.009
ppy+0.009
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 cakes-0.010
 ducks-0.010
 balloons-0.010
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 balloons+0.013
 ducks+0.012
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.008
Ru-0.008
TokenThe
Token ID464
Feature activation+7.04e-01

Pos logit contributions
Charl+0.007
use+0.007
 forward+0.007
ifying+0.006
ppy+0.006
Neg logit contributions
 pige-0.008
 sandwiches-0.007
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token mean
Token ID1612
Feature activation+3.12e+00

Pos logit contributions
 pige+0.025
 sandwiches+0.024
 cakes+0.023
 balloons+0.021
 ducks+0.021
Neg logit contributions
Charl-0.024
 forward-0.024
use-0.023
Ru-0.021
ifying-0.020
Token kid
Token ID5141
Feature activation+9.31e-01

Pos logit contributions
 pige+0.085
 sandwiches+0.082
 cakes+0.077
 pie+0.070
 balloons+0.068
Neg logit contributions
 forward-0.127
use-0.120
Charl-0.117
ppy-0.110
acy-0.108
Token feels
Token ID5300
Feature activation-6.48e-01

Pos logit contributions
 pige+0.032
 sandwiches+0.031
 cakes+0.029
 ducks+0.028
 balloons+0.028
Neg logit contributions
Charl-0.036
use-0.034
 forward-0.032
ifying-0.031
ppy-0.030
Token angry
Token ID7954
Feature activation+1.91e-01

Pos logit contributions
Charl+0.028
use+0.027
ppy+0.025
ifying+0.024
issy+0.024
Neg logit contributions
 pige-0.020
 sandwiches-0.020
 ducks-0.019
 balloons-0.018
 swings-0.017
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.007
 cakes+0.007
 ducks+0.007
 balloons+0.007
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.005
ppy-0.005
Token He
Token ID679
Feature activation+2.26e-01

Pos logit contributions
Charl+0.014
use+0.012
 forward+0.012
ifying+0.011
ppy+0.011
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 ducks-0.015
 balloons-0.015
Token kind
Token ID1611
Feature activation+1.08e+00

Pos logit contributions
Charl+0.009
use+0.008
 forward+0.008
ifying+0.007
ley+0.007
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.006
 ducks-0.006
 ice-0.006
Token."
Token ID526
Feature activation+6.16e-01

Pos logit contributions
 pige+0.037
 sandwiches+0.036
 ducks+0.034
 cakes+0.033
 balloons+0.033
Neg logit contributions
Charl-0.044
use-0.039
 forward-0.037
ifying-0.037
Ru-0.036
Token
Token ID198
Feature activation+7.88e-01

Pos logit contributions
 pige+0.019
 sandwiches+0.019
 Sue+0.017
 ducks+0.017
 cakes+0.017
Neg logit contributions
Charl-0.026
use-0.024
ifying-0.024
 forward-0.024
ppy-0.023
Token
Token ID198
Feature activation+6.74e-01

Pos logit contributions
 pige+0.030
 sandwiches+0.029
 cakes+0.027
 ducks+0.027
 balloons+0.026
Neg logit contributions
Charl-0.028
use-0.026
 forward-0.025
ifying-0.023
ppy-0.023
TokenThe
Token ID464
Feature activation+6.68e-01

Pos logit contributions
 pige+0.025
 sandwiches+0.024
 ducks+0.023
 cakes+0.023
 swings+0.022
Neg logit contributions
Charl-0.024
use-0.023
 forward-0.022
ifying-0.021
ppy-0.020
Token mean
Token ID1612
Feature activation+3.10e+00

Pos logit contributions
 pige+0.026
 sandwiches+0.025
 cakes+0.024
 ducks+0.023
 balloons+0.023
Neg logit contributions
Charl-0.022
use-0.020
 forward-0.020
ifying-0.018
Ru-0.017
Token boy
Token ID2933
Feature activation-2.62e-01

Pos logit contributions
 pige+0.085
 sandwiches+0.084
 cakes+0.077
 ducks+0.071
 balloons+0.070
Neg logit contributions
 forward-0.124
use-0.124
Charl-0.121
ppy-0.112
issy-0.111
Token laughs
Token ID22051
Feature activation-3.24e-01

Pos logit contributions
Charl+0.009
use+0.008
 forward+0.008
ppy+0.007
Ru+0.007
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 balloons-0.009
 ducks-0.009
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
Charl+0.013
use+0.012
ifying+0.011
oris+0.011
acy+0.011
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 ducks-0.010
 balloons-0.010
 cakes-0.010
Token says
Token ID1139
Feature activation-2.74e-01

Pos logit contributions
Charl+0.005
use+0.005
 forward+0.005
Ru+0.004
ppy+0.004
Neg logit contributions
 pige-0.005
 sandwiches-0.004
 cakes-0.004
 balloons-0.004
 ducks-0.004
Token,
Token ID11
Feature activation-2.51e-01

Pos logit contributions
Charl+0.008
 forward+0.008
use+0.008
ppy+0.007
ifying+0.007
Neg logit contributions
 pige-0.012
 sandwiches-0.011
 cakes-0.011
 ducks-0.011
 balloons-0.010
Token
Token ID198
Feature activation+4.82e-01

Pos logit contributions
 pige+0.027
 sandwiches+0.026
 cakes+0.025
 ducks+0.024
 balloons+0.024
Neg logit contributions
Charl-0.024
use-0.022
 forward-0.021
ifying-0.020
ppy-0.019
Token"
Token ID1
Feature activation-1.87e-02

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 ducks+0.015
 cakes+0.015
 swings+0.014
Neg logit contributions
Charl-0.019
use-0.018
 forward-0.017
ifying-0.017
ppy-0.016
TokenWhat
Token ID2061
Feature activation+7.01e-02

Pos logit contributions
Charl+0.001
use+0.001
 forward+0.001
ifying+0.001
ppy+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 ducks-0.001
 cakes-0.001
 balloons-0.001
Token is
Token ID318
Feature activation+2.18e+00

Pos logit contributions
 pige+0.003
 sandwiches+0.003
 ducks+0.002
 cakes+0.002
 swings+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
Ru-0.002
ppy-0.002
Token a
Token ID257
Feature activation+1.49e+00

Pos logit contributions
 pige+0.075
 sandwiches+0.073
 cakes+0.069
 ducks+0.068
 balloons+0.066
Neg logit contributions
Charl-0.080
 forward-0.076
use-0.074
Ru-0.067
ifying-0.064
Token price
Token ID2756
Feature activation+3.10e+00

Pos logit contributions
 pige+0.044
 sandwiches+0.043
 cakes+0.040
 ducks+0.039
 balloons+0.039
Neg logit contributions
Charl-0.065
use-0.061
 forward-0.059
ifying-0.057
ppy-0.056
Token?"
Token ID1701
Feature activation+9.66e-02

Pos logit contributions
 pige+0.117
 sandwiches+0.113
 cakes+0.107
 ducks+0.106
 balloons+0.103
Neg logit contributions
Charl-0.104
use-0.101
 forward-0.097
ifying-0.088
ppy-0.087
Token Ben
Token ID3932
Feature activation+5.22e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 ducks+0.002
 swings+0.002
 balloons+0.001
Neg logit contributions
Charl-0.005
use-0.005
 forward-0.005
ifying-0.005
ppy-0.005
Token asked
Token ID1965
Feature activation+1.12e+00

Pos logit contributions
 pige+0.021
 sandwiches+0.020
 cakes+0.019
 ducks+0.019
 balloons+0.019
Neg logit contributions
Charl-0.017
use-0.016
 forward-0.015
ifying-0.014
ppy-0.014
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
 pige+0.043
 sandwiches+0.042
 cakes+0.040
 ducks+0.039
 balloons+0.039
Neg logit contributions
Charl-0.039
use-0.036
 forward-0.035
ifying-0.033
ppy-0.032
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
 pige+0.011
 Sue+0.010
 ducks+0.010
 sandwiches+0.010
 swings+0.010
Neg logit contributions
use-0.021
Charl-0.021
ifying-0.019
 forward-0.019
ppy-0.019
Token We
Token ID775
Feature activation+4.80e-01

Pos logit contributions
Charl+0.015
use+0.014
 forward+0.013
ppy+0.013
ley+0.012
Neg logit contributions
 pige-0.011
 sandwiches-0.010
 cakes-0.010
 ducks-0.010
 balloons-0.009
Token have
Token ID423
Feature activation+7.24e-01

Pos logit contributions
 pige+0.018
 sandwiches+0.017
 ducks+0.017
 balloons+0.017
 cakes+0.017
Neg logit contributions
Charl-0.017
use-0.014
Ru-0.014
 forward-0.014
ifying-0.014
Token apple
Token ID17180
Feature activation+2.10e+00

Pos logit contributions
 pige+0.026
mm+0.025
castle+0.024
 monkeys+0.023
 swings+0.023
Neg logit contributions
Charl-0.021
use-0.021
 forward-0.019
ifying-0.018
ky-0.016
Token pie
Token ID2508
Feature activation+1.28e+00

Pos logit contributions
 pige+0.035
 sandwiches+0.030
 ducks+0.028
 cakes+0.028
 swings+0.026
Neg logit contributions
use-0.108
Charl-0.107
 forward-0.101
ppy-0.098
sc-0.096
Token,
Token ID11
Feature activation-7.58e-02

Pos logit contributions
 pige+0.046
 sandwiches+0.045
 cakes+0.042
 ducks+0.041
 balloons+0.041
Neg logit contributions
Charl-0.046
use-0.043
 forward-0.042
ifying-0.039
ppy-0.039
Token cherry
Token ID23612
Feature activation+3.08e+00

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.002
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.003
 sandwiches-0.002
 ducks-0.002
 cakes-0.002
 swings-0.002
Token pie
Token ID2508
Feature activation+1.72e+00

Pos logit contributions
 pige+0.088
 sandwiches+0.084
 cakes+0.078
 ducks+0.077
 swings+0.075
Neg logit contributions
use-0.122
Charl-0.120
 forward-0.112
ppy-0.107
ifying-0.106
Token,
Token ID11
Feature activation-9.43e-02

Pos logit contributions
 pige+0.059
 sandwiches+0.058
 cakes+0.054
 ducks+0.053
 balloons+0.052
Neg logit contributions
Charl-0.065
use-0.061
 forward-0.059
ifying-0.055
ppy-0.055
Token pumpkin
Token ID30089
Feature activation+1.82e+00

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ppy+0.002
ifying+0.002
Neg logit contributions
 pige-0.004
 sandwiches-0.003
 ducks-0.003
 cakes-0.003
 balloons-0.003
Token pie
Token ID2508
Feature activation+1.74e+00

Pos logit contributions
 pige+0.059
 sandwiches+0.057
 cakes+0.053
 ducks+0.052
 balloons+0.051
Neg logit contributions
Charl-0.071
use-0.068
 forward-0.065
ppy-0.062
ifying-0.061
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
 pige+0.061
 sandwiches+0.059
 cakes+0.056
 ducks+0.055
 balloons+0.054
Neg logit contributions
Charl-0.064
use-0.060
 forward-0.057
ppy-0.054
ifying-0.054
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
Charl+0.023
use+0.022
 forward+0.021
ppy+0.020
ifying+0.020
Neg logit contributions
 pige-0.013
 sandwiches-0.013
 cakes-0.012
 ducks-0.012
 balloons-0.011
Token sun
Token ID4252
Feature activation-1.05e+00

Pos logit contributions
 pige+0.004
 sandwiches+0.004
 cakes+0.004
 swings+0.004
 balloons+0.004
Neg logit contributions
Charl-0.006
ppy-0.006
use-0.006
acy-0.005
Ru-0.005
Token.
Token ID13
Feature activation+5.61e-01

Pos logit contributions
Charl+0.040
use+0.037
 forward+0.034
Ru+0.032
ppy+0.032
Neg logit contributions
 sandwiches-0.038
 pige-0.037
 cakes-0.036
 ducks-0.035
 balloons-0.035
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
 pige+0.021
 sandwiches+0.020
 cakes+0.019
 ducks+0.019
 balloons+0.019
Neg logit contributions
Charl-0.020
use-0.019
 forward-0.018
ifying-0.017
ppy-0.017
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 ducks+0.009
 cakes+0.009
 balloons+0.009
Neg logit contributions
Charl-0.011
use-0.010
 forward-0.010
ifying-0.009
ppy-0.009
TokenRe
Token ID3041
Feature activation+3.07e+00

Pos logit contributions
 pige+0.004
 sandwiches+0.004
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.004
use-0.003
 forward-0.003
ifying-0.003
ppy-0.003
Tokenbecca
Token ID20627
Feature activation+9.10e-01

Pos logit contributions
 sandwiches+0.109
 pige+0.103
 cakes+0.099
 pie+0.097
 ducks+0.097
Neg logit contributions
Charl-0.113
 forward-0.102
use-0.100
ky-0.095
well-0.094
Token felt
Token ID2936
Feature activation-1.41e+00

Pos logit contributions
 pige+0.033
 sandwiches+0.033
 cakes+0.031
 ducks+0.030
 balloons+0.030
Neg logit contributions
Charl-0.033
use-0.030
 forward-0.029
ifying-0.028
ppy-0.027
Token excited
Token ID6568
Feature activation-8.21e-01

Pos logit contributions
use+0.064
Charl+0.064
issy+0.061
ppy+0.060
ley+0.059
Neg logit contributions
 pige-0.037
 sandwiches-0.036
 ice-0.036
 ducks-0.035
 balloons-0.031
Token and
Token ID290
Feature activation-5.50e-01

Pos logit contributions
Charl+0.026
use+0.024
 forward+0.023
ifying+0.021
ppy+0.021
Neg logit contributions
 pige-0.033
 sandwiches-0.033
 cakes-0.031
 ducks-0.031
 balloons-0.030
Token ran
Token ID4966
Feature activation+2.25e-01

Pos logit contributions
Charl+0.022
use+0.020
ifying+0.019
 forward+0.019
ppy+0.018
Neg logit contributions
 pige-0.018
 sandwiches-0.018
 ducks-0.017
 cakes-0.017
 swings-0.017
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
Charl+0.007
use+0.007
ifying+0.006
acy+0.006
issy+0.006
Neg logit contributions
 sandwiches-0.006
 pige-0.006
 cakes-0.006
 ducks-0.006
 balloons-0.006
Token
Token ID220
Feature activation+2.51e-01

Pos logit contributions
Charl+0.006
use+0.006
 forward+0.005
ifying+0.005
ppy+0.005
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.009
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.009
use-0.008
 forward-0.008
ifying-0.007
ppy-0.007
Token
Token ID198
Feature activation-3.60e-01

Pos logit contributions
Charl+0.005
use+0.005
ifying+0.004
 forward+0.004
ppy+0.004
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 ducks-0.004
 cakes-0.004
 balloons-0.004
TokenThe
Token ID464
Feature activation+2.61e-01

Pos logit contributions
use+0.014
Charl+0.014
 forward+0.013
ifying+0.013
ley+0.012
Neg logit contributions
 pige-0.012
 ducks-0.011
 sandwiches-0.011
 swings-0.010
 cakes-0.010
Token nos
Token ID43630
Feature activation+3.06e+00

Pos logit contributions
 pige+0.009
 sandwiches+0.009
 cakes+0.008
 ducks+0.007
mm+0.007
Neg logit contributions
 forward-0.009
Charl-0.009
use-0.008
 Taylor-0.008
Ru-0.008
Tokeny
Token ID88
Feature activation+1.42e+00

Pos logit contributions
 sandwiches+0.171
 pige+0.168
 cakes+0.164
 balloons+0.158
 swings+0.158
Neg logit contributions
Charl-0.047
 forward-0.041
use-0.040
ppy-0.034
ifying-0.033
Token metal
Token ID6147
Feature activation+4.85e-01

Pos logit contributions
 pige+0.039
 sandwiches+0.038
 cakes+0.034
 ducks+0.034
 balloons+0.033
Neg logit contributions
Charl-0.065
use-0.061
 forward-0.060
ifying-0.057
ppy-0.056
Token robot
Token ID9379
Feature activation-2.24e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 ducks+0.013
 balloons+0.012
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.019
ifying-0.019
ppy-0.018
Token thought
Token ID1807
Feature activation-1.55e-01

Pos logit contributions
Charl+0.008
use+0.007
 forward+0.007
ifying+0.007
ppy+0.006
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.008
 ducks-0.008
 balloons-0.007
Token it
Token ID340
Feature activation+4.36e-01

Pos logit contributions
Charl+0.006
 forward+0.006
use+0.006
ppy+0.006
sc+0.005
Neg logit contributions
 pige-0.005
 sandwiches-0.004
 cakes-0.004
 ducks-0.004
 balloons-0.004
Token playing
Token ID2712
Feature activation+7.60e-01

Pos logit contributions
 sandwiches+0.000
 pige+0.000
 ducks+0.000
 cakes+0.000
 swings+0.000
Neg logit contributions
Charl-0.000
 forward-0.000
use-0.000
well-0.000
oping-0.000
Token happily
Token ID18177
Feature activation+3.44e-01

Pos logit contributions
 pige+0.030
 sandwiches+0.028
 cakes+0.027
 ducks+0.026
 balloons+0.026
Neg logit contributions
Charl-0.025
 forward-0.023
use-0.022
sc-0.020
ifying-0.020
Token together
Token ID1978
Feature activation+3.39e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 ducks+0.011
 balloons+0.011
Neg logit contributions
Charl-0.013
use-0.012
 forward-0.011
ifying-0.011
ppy-0.010
Token with
Token ID351
Feature activation-7.17e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 balloons+0.011
 ducks+0.011
Neg logit contributions
Charl-0.013
use-0.012
 forward-0.011
ifying-0.011
ppy-0.011
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
Charl+0.030
use+0.028
 forward+0.027
ifying+0.026
ppy+0.025
Neg logit contributions
 pige-0.022
 sandwiches-0.022
 cakes-0.020
 ducks-0.020
 balloons-0.019
Token pick
Token ID2298
Feature activation+3.06e+00

Pos logit contributions
 pige+0.022
 sandwiches+0.022
 cakes+0.020
 ducks+0.020
 balloons+0.020
Neg logit contributions
Charl-0.029
use-0.027
 forward-0.026
ifying-0.024
ppy-0.024
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
 pige+0.107
 sandwiches+0.106
 cakes+0.098
 ducks+0.097
 balloons+0.095
Neg logit contributions
Charl-0.119
use-0.111
 forward-0.103
ifying-0.100
ppy-0.098
Token<|endoftext|>
Token ID50256
Feature activation+3.91e-01

Pos logit contributions
 pige+0.018
 sandwiches+0.018
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.018
use-0.017
 forward-0.016
ifying-0.015
ppy-0.015
TokenOnce
Token ID7454
Feature activation-4.67e-01

Pos logit contributions
 pige+0.017
 sandwiches+0.017
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.011
use-0.010
 forward-0.010
ifying-0.009
ppy-0.009
Token there
Token ID612
Feature activation-2.05e-01

Pos logit contributions
Charl+0.008
use+0.007
 forward+0.007
ifying+0.006
ppy+0.006
Neg logit contributions
 pige-0.025
 sandwiches-0.025
 cakes-0.024
 ducks-0.024
 balloons-0.024
Token was
Token ID373
Feature activation-4.14e-01

Pos logit contributions
Charl+0.006
use+0.005
 forward+0.005
oping+0.005
well+0.005
Neg logit contributions
 pige-0.009
 sandwiches-0.009
 cakes-0.008
 balloons-0.008
 ducks-0.008
Token would
Token ID561
Feature activation-2.35e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 ducks+0.010
 cakes+0.010
 swings+0.009
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.009
och-0.008
Token play
Token ID711
Feature activation+5.81e-02

Pos logit contributions
Charl+0.010
use+0.009
ifying+0.009
och+0.009
 forward+0.009
Neg logit contributions
 swings-0.007
 cakes-0.007
 sandwiches-0.007
 pige-0.007
 balloons-0.007
Token tag
Token ID7621
Feature activation+3.02e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ifying-0.002
ppy-0.002
Token,
Token ID11
Feature activation-5.46e-01

Pos logit contributions
 sandwiches+0.011
 pige+0.010
 cakes+0.010
 balloons+0.010
 swings+0.010
Neg logit contributions
Charl-0.012
use-0.011
ifying-0.011
 forward-0.010
och-0.010
Token hop
Token ID1725
Feature activation-5.07e-01

Pos logit contributions
Charl+0.023
use+0.021
 forward+0.021
ifying+0.020
ppy+0.019
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 ducks-0.016
 balloons-0.016
Tokensc
Token ID1416
Feature activation-3.86e+00

Pos logit contributions
use+0.018
Charl+0.018
ifying+0.017
enna+0.016
och+0.016
Neg logit contributions
 sandwiches-0.019
 swings-0.018
 balloons-0.017
 pige-0.017
 ducks-0.017
Tokenot
Token ID313
Feature activation+1.10e-01

Pos logit contributions
Charl+0.106
use+0.094
ifying+0.080
 forward+0.080
Ru+0.073
Neg logit contributions
 sandwiches-0.182
 pige-0.174
 cakes-0.168
 ducks-0.165
 balloons-0.164
Tokench
Token ID354
Feature activation-2.63e-02

Pos logit contributions
 sandwiches+0.004
 pige+0.004
 balloons+0.004
 cakes+0.004
 ducks+0.004
Neg logit contributions
Charl-0.004
use-0.004
 forward-0.003
ifying-0.003
ley-0.003
Token,
Token ID11
Feature activation-4.77e-01

Pos logit contributions
Charl+0.001
use+0.001
 forward+0.001
ppy+0.001
ifying+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 cakes-0.001
 ducks-0.001
 balloons-0.001
Token and
Token ID290
Feature activation+3.63e-01

Pos logit contributions
Charl+0.019
use+0.018
ifying+0.017
issy+0.016
 forward+0.016
Neg logit contributions
 sandwiches-0.017
 pige-0.016
 swings-0.016
 balloons-0.016
 cakes-0.016
Token many
Token ID867
Feature activation+1.35e+00

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 balloons+0.011
 ducks+0.011
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.013
ifying-0.013
ppy-0.012
Token play
Token ID711
Feature activation-1.52e-01

Pos logit contributions
Charl+0.008
use+0.007
 forward+0.007
ifying+0.007
och+0.007
Neg logit contributions
 sandwiches-0.005
 ducks-0.005
 cakes-0.005
 pige-0.005
 ice-0.005
Token a
Token ID257
Feature activation-3.83e-01

Pos logit contributions
Charl+0.005
 forward+0.004
use+0.004
ppy+0.004
sc+0.004
Neg logit contributions
 pige-0.007
 sandwiches-0.006
 cakes-0.006
 balloons-0.006
 ducks-0.006
Token game
Token ID983
Feature activation-4.22e-01

Pos logit contributions
Charl+0.017
use+0.016
 forward+0.016
sc+0.015
ppy+0.014
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 balloons-0.009
 ducks-0.009
Token of
Token ID286
Feature activation+2.09e-01

Pos logit contributions
Charl+0.019
use+0.018
ifying+0.017
 forward+0.016
ley+0.016
Neg logit contributions
 sandwiches-0.013
 pige-0.012
 swings-0.011
 cakes-0.011
 ducks-0.011
Token hop
Token ID1725
Feature activation-6.44e-01

Pos logit contributions
 pige+0.008
 sandwiches+0.007
mm+0.007
 cakes+0.007
 balloons+0.007
Neg logit contributions
Charl-0.007
sc-0.006
 forward-0.006
use-0.006
Ru-0.005
Tokensc
Token ID1416
Feature activation-3.76e+00

Pos logit contributions
Charl+0.025
use+0.024
enna+0.023
ifying+0.022
issy+0.022
Neg logit contributions
 sandwiches-0.022
 ducks-0.021
 swings-0.021
 pige-0.020
 ice-0.020
Tokenot
Token ID313
Feature activation-3.28e-01

Pos logit contributions
Charl+0.097
use+0.087
 forward+0.076
ifying+0.074
Ru+0.068
Neg logit contributions
 sandwiches-0.179
 pige-0.175
 cakes-0.166
 ducks-0.165
 balloons-0.162
Tokench
Token ID354
Feature activation-1.70e-01

Pos logit contributions
Charl+0.012
use+0.012
ifying+0.011
och+0.010
 forward+0.010
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 ducks-0.011
 balloons-0.010
 ice-0.010
Token,
Token ID11
Feature activation-3.32e-01

Pos logit contributions
Charl+0.006
use+0.006
 forward+0.006
ifying+0.005
ppy+0.005
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 cakes-0.006
 ducks-0.006
 balloons-0.006
Token but
Token ID475
Feature activation-1.40e-01

Pos logit contributions
Charl+0.013
use+0.012
ifying+0.011
 forward+0.011
issy+0.011
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 ducks-0.011
 swings-0.011
 cakes-0.010
Token it
Token ID340
Feature activation-3.75e-01

Pos logit contributions
Charl+0.006
use+0.005
 forward+0.005
ifying+0.005
ppy+0.005
Neg logit contributions
 pige-0.005
 sandwiches-0.005
 cakes-0.004
 ducks-0.004
 balloons-0.004
Token together
Token ID1978
Feature activation+6.42e-01

Pos logit contributions
 pige+0.026
 sandwiches+0.025
 cakes+0.024
 ducks+0.024
 balloons+0.023
Neg logit contributions
Charl-0.015
 forward-0.015
use-0.014
sc-0.013
bs-0.013
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
 pige+0.029
 sandwiches+0.028
 cakes+0.027
 ducks+0.026
 balloons+0.026
Neg logit contributions
Charl-0.017
 forward-0.016
use-0.015
ifying-0.014
ppy-0.014
Token They
Token ID1119
Feature activation+7.49e-01

Pos logit contributions
 sandwiches+0.010
 pige+0.010
 swings+0.010
 cakes+0.010
 ducks+0.010
Neg logit contributions
Charl-0.011
use-0.011
 forward-0.010
ifying-0.010
ppy-0.010
Token played
Token ID2826
Feature activation+2.22e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.021
 ducks+0.021
 balloons+0.021
Neg logit contributions
Charl-0.031
use-0.029
 forward-0.028
ifying-0.027
ppy-0.026
Token hop
Token ID1725
Feature activation-3.15e-01

Pos logit contributions
 pige+0.009
 flavors+0.008
 sandwiches+0.008
 cakes+0.008
 balloons+0.008
Neg logit contributions
Charl-0.006
 forward-0.006
sc-0.006
Ru-0.005
use-0.005
Tokensc
Token ID1416
Feature activation-3.67e+00

Pos logit contributions
Charl+0.010
use+0.009
ifying+0.008
 forward+0.008
och+0.007
Neg logit contributions
 sandwiches-0.014
 pige-0.013
 ducks-0.013
 balloons-0.013
 swings-0.012
Tokenot
Token ID313
Feature activation-8.20e-02

Pos logit contributions
Charl+0.074
use+0.065
 forward+0.061
ppy+0.053
ifying+0.053
Neg logit contributions
 pige-0.192
 sandwiches-0.188
 cakes-0.181
 ducks-0.179
 balloons-0.177
Tokench
Token ID354
Feature activation-5.93e-02

Pos logit contributions
Charl+0.003
use+0.002
 forward+0.002
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token and
Token ID290
Feature activation+6.35e-01

Pos logit contributions
Charl+0.002
 forward+0.002
sc+0.002
ppy+0.002
use+0.002
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 ducks-0.002
 flavors-0.002
Token picked
Token ID6497
Feature activation+9.06e-01

Pos logit contributions
 pige+0.025
 sandwiches+0.024
 cakes+0.022
 ducks+0.022
 balloons+0.021
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.019
ppy-0.018
sc-0.018
Token more
Token ID517
Feature activation+6.62e-01

Pos logit contributions
 pige+0.030
 cakes+0.026
 sandwiches+0.026
 pie+0.025
 balloons+0.025
Neg logit contributions
 forward-0.031
Charl-0.030
sc-0.029
use-0.027
ky-0.027
Tokenand
Token ID392
Feature activation-1.31e+00

Pos logit contributions
 pige+0.014
 sandwiches+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.012
use-0.011
 forward-0.011
ifying-0.010
ppy-0.010
Token-
Token ID12
Feature activation+4.90e-01

Pos logit contributions
Charl+0.038
use+0.033
 forward+0.033
ppy+0.032
bs+0.029
Neg logit contributions
 pige-0.055
 sandwiches-0.054
 ducks-0.052
 cakes-0.051
 balloons-0.051
Tokenseek
Token ID36163
Feature activation-4.04e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.015
 cakes+0.014
 ducks+0.014
 balloons+0.014
Neg logit contributions
Charl-0.020
use-0.019
 forward-0.018
ifying-0.018
ppy-0.017
Token and
Token ID290
Feature activation+8.58e-01

Pos logit contributions
Charl+0.014
use+0.013
 forward+0.012
ifying+0.012
ppy+0.011
Neg logit contributions
 sandwiches-0.016
 pige-0.016
 cakes-0.015
 ducks-0.014
 balloons-0.014
Token hop
Token ID1725
Feature activation+1.98e-02

Pos logit contributions
 pige+0.033
 sandwiches+0.031
 cakes+0.029
 ducks+0.029
 balloons+0.028
Neg logit contributions
Charl-0.030
use-0.028
 forward-0.027
ppy-0.025
sc-0.024
Tokensc
Token ID1416
Feature activation-3.53e+00

Pos logit contributions
 sandwiches+0.001
 pige+0.001
 ducks+0.001
 swings+0.001
 balloons+0.001
Neg logit contributions
Charl-0.001
use-0.001
ifying-0.001
 forward-0.001
och-0.001
Tokenot
Token ID313
Feature activation+1.28e-01

Pos logit contributions
Charl+0.091
use+0.082
 forward+0.074
ifying+0.070
Ru+0.064
Neg logit contributions
 sandwiches-0.168
 pige-0.164
 cakes-0.158
 ducks-0.155
 balloons-0.153
Tokench
Token ID354
Feature activation+1.54e-01

Pos logit contributions
 sandwiches+0.005
 pige+0.005
 cakes+0.005
 balloons+0.005
 ducks+0.005
Neg logit contributions
Charl-0.004
use-0.004
 forward-0.004
ifying-0.004
Ru-0.003
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.006
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.005
 forward-0.004
use-0.004
ppy-0.004
sc-0.004
Token Then
Token ID3244
Feature activation-1.72e+00

Pos logit contributions
Charl+0.012
use+0.012
 forward+0.011
ppy+0.010
ifying+0.010
Neg logit contributions
 sandwiches-0.013
 pige-0.013
 ducks-0.012
 cakes-0.012
 swings-0.012
Token,
Token ID11
Feature activation-9.11e-01

Pos logit contributions
Charl+0.060
use+0.055
 forward+0.053
ifying+0.050
ppy+0.049
Neg logit contributions
 pige-0.065
 sandwiches-0.064
 cakes-0.060
 ducks-0.060
 balloons-0.059
Token They
Token ID1119
Feature activation+4.41e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.007
 cakes+0.007
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.006
ppy-0.006
Token started
Token ID2067
Feature activation+4.66e-02

Pos logit contributions
 pige+0.018
 sandwiches+0.017
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.014
use-0.013
 forward-0.013
ppy-0.012
ifying-0.012
Token to
Token ID284
Feature activation+2.39e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
mm+0.002
 cakes+0.002
 ducks+0.002
Neg logit contributions
Charl-0.001
 forward-0.001
use-0.001
Ru-0.001
sc-0.001
Token play
Token ID711
Feature activation+1.45e-01

Pos logit contributions
 pige+0.009
 sandwiches+0.008
 cakes+0.008
 ducks+0.008
 balloons+0.008
Neg logit contributions
Charl-0.009
use-0.008
 forward-0.008
ifying-0.007
ppy-0.007
Token hop
Token ID1725
Feature activation-3.17e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.004
 forward-0.004
use-0.004
sc-0.004
Ru-0.003
Tokensc
Token ID1416
Feature activation-3.53e+00

Pos logit contributions
Charl+0.011
use+0.011
ifying+0.010
acy+0.009
och+0.009
Neg logit contributions
 sandwiches-0.012
 pige-0.012
 swings-0.012
 ducks-0.011
 balloons-0.011
Tokenot
Token ID313
Feature activation-1.16e-02

Pos logit contributions
Charl+0.083
use+0.074
 forward+0.067
ifying+0.062
ppy+0.057
Neg logit contributions
 pige-0.174
 sandwiches-0.173
 cakes-0.164
 ducks-0.162
 balloons-0.160
Tokench
Token ID354
Feature activation-5.50e-02

Pos logit contributions
Charl+0.000
use+0.000
ifying+0.000
 forward+0.000
ley+0.000
Neg logit contributions
 sandwiches-0.000
 pige-0.000
 balloons-0.000
 ducks-0.000
 cakes-0.000
Token and
Token ID290
Feature activation+5.26e-01

Pos logit contributions
Charl+0.002
 forward+0.001
use+0.001
ppy+0.001
sc+0.001
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 ducks-0.002
 balloons-0.002
Token had
Token ID550
Feature activation+4.37e-01

Pos logit contributions
 pige+0.019
 sandwiches+0.018
 cakes+0.017
 ducks+0.017
 balloons+0.016
Neg logit contributions
Charl-0.019
use-0.018
 forward-0.018
ppy-0.016
ifying-0.016
Token lots
Token ID6041
Feature activation+7.79e-01

Pos logit contributions
 pige+0.019
mm+0.017
ries+0.016
eros+0.016
 cakes+0.016
Neg logit contributions
Charl-0.011
sc-0.010
 rather-0.010
 forward-0.010
 particularly-0.009
Token knew
Token ID2993
Feature activation-5.99e-01

Pos logit contributions
Charl+0.042
use+0.039
 forward+0.037
ifying+0.035
ppy+0.034
Neg logit contributions
 pige-0.047
 sandwiches-0.046
 cakes-0.043
 ducks-0.043
 balloons-0.042
Token how
Token ID703
Feature activation-4.50e-01

Pos logit contributions
Charl+0.027
use+0.025
 forward+0.023
ifying+0.023
well+0.022
Neg logit contributions
 sandwiches-0.017
 pige-0.017
 cakes-0.016
 ducks-0.016
 balloons-0.016
Token to
Token ID284
Feature activation-8.05e-01

Pos logit contributions
Charl+0.013
 forward+0.013
use+0.013
ifying+0.011
ppy+0.010
Neg logit contributions
 pige-0.018
 sandwiches-0.018
 cakes-0.017
 ducks-0.017
 balloons-0.016
Token play
Token ID711
Feature activation-4.43e-01

Pos logit contributions
Charl+0.031
use+0.028
 forward+0.028
ifying+0.026
ppy+0.025
Neg logit contributions
 pige-0.028
 sandwiches-0.027
 ducks-0.026
 cakes-0.026
 balloons-0.026
Token hop
Token ID1725
Feature activation-7.70e-01

Pos logit contributions
Charl+0.015
 forward+0.013
use+0.013
sc+0.012
ppy+0.012
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 ducks-0.016
 balloons-0.015
Tokensc
Token ID1416
Feature activation-3.46e+00

Pos logit contributions
Charl+0.033
use+0.029
issy+0.028
ifying+0.028
acy+0.028
Neg logit contributions
 sandwiches-0.024
 pige-0.023
 ducks-0.022
 balloons-0.022
 ice-0.022
Tokenot
Token ID313
Feature activation-1.85e-01

Pos logit contributions
Charl+0.110
use+0.099
ifying+0.089
 forward+0.082
 Taylor+0.080
Neg logit contributions
 sandwiches-0.147
 pige-0.138
 cakes-0.131
 ducks-0.130
 balloons-0.129
Tokench
Token ID354
Feature activation-4.28e-01

Pos logit contributions
Charl+0.007
use+0.006
ifying+0.006
acy+0.006
ley+0.006
Neg logit contributions
 sandwiches-0.006
 pige-0.006
 balloons-0.006
 ducks-0.006
 ice-0.006
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ifying+0.013
ppy+0.012
Neg logit contributions
 pige-0.016
 sandwiches-0.016
 cakes-0.014
 ducks-0.014
 balloons-0.014
Token
Token ID198
Feature activation+1.84e-01

Pos logit contributions
 pige+0.008
 sandwiches+0.008
 cakes+0.007
 ducks+0.007
 balloons+0.007
Neg logit contributions
Charl-0.007
use-0.007
 forward-0.006
ifying-0.006
ppy-0.006
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.006
 ducks+0.006
 balloons+0.006
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.006
ppy-0.006
Token She
Token ID1375
Feature activation+1.84e-01

Pos logit contributions
 pige+0.036
 sandwiches+0.035
 cakes+0.033
 ducks+0.033
 balloons+0.033
Neg logit contributions
Charl-0.030
use-0.028
 forward-0.027
ifying-0.025
ppy-0.024
Token played
Token ID2826
Feature activation+2.78e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 ducks+0.006
 swings+0.006
 balloons+0.006
Neg logit contributions
Charl-0.007
use-0.007
 forward-0.006
ifying-0.006
arl-0.006
Token tag
Token ID7621
Feature activation+3.80e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.009
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.010
 forward-0.009
use-0.009
sc-0.008
ppy-0.008
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
 sandwiches+0.014
 pige+0.014
 cakes+0.013
 balloons+0.013
 ducks+0.013
Neg logit contributions
Charl-0.014
use-0.013
ifying-0.012
 forward-0.012
ppy-0.011
Token hop
Token ID1725
Feature activation-5.92e-02

Pos logit contributions
 pige+0.034
 sandwiches+0.033
 cakes+0.031
 ducks+0.030
 balloons+0.030
Neg logit contributions
Charl-0.040
use-0.038
 forward-0.036
ifying-0.034
ppy-0.034
Tokensc
Token ID1416
Feature activation-3.43e+00

Pos logit contributions
use+0.002
Charl+0.002
ifying+0.002
issy+0.002
och+0.002
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 balloons-0.002
 swings-0.002
 ducks-0.002
Tokenot
Token ID313
Feature activation+2.78e-01

Pos logit contributions
Charl+0.084
use+0.076
 forward+0.068
ifying+0.063
Ru+0.059
Neg logit contributions
 sandwiches-0.168
 pige-0.165
 cakes-0.157
 ducks-0.154
 balloons-0.154
Tokench
Token ID354
Feature activation+2.57e-01

Pos logit contributions
 sandwiches+0.011
 pige+0.011
 balloons+0.010
 cakes+0.010
 ducks+0.010
Neg logit contributions
Charl-0.009
use-0.009
ifying-0.008
 forward-0.008
Ru-0.008
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 pige+0.011
 sandwiches+0.010
 cakes+0.010
 ducks+0.010
 balloons+0.009
Neg logit contributions
Charl-0.008
 forward-0.007
use-0.007
ppy-0.007
sc-0.006
Token pretended
Token ID36766
Feature activation+4.15e-01

Pos logit contributions
 pige+0.035
 sandwiches+0.034
 cakes+0.032
 ducks+0.031
 balloons+0.030
Neg logit contributions
Charl-0.041
use-0.038
 forward-0.037
ifying-0.035
ppy-0.034
Token to
Token ID284
Feature activation-9.06e-02

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 ducks+0.015
 cakes+0.015
 balloons+0.014
Neg logit contributions
Charl-0.013
 forward-0.012
use-0.012
ifying-0.011
sc-0.010
Token love
Token ID1842
Feature activation+4.72e-02

Pos logit contributions
Charl+0.006
 forward+0.006
use+0.006
ifying+0.005
sc+0.005
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.001
ifying-0.001
ppy-0.001
Token They
Token ID1119
Feature activation+6.66e-01

Pos logit contributions
 pige+0.009
 sandwiches+0.009
 cakes+0.008
 Sue+0.008
 ducks+0.008
Neg logit contributions
Charl-0.015
use-0.015
ppy-0.013
 forward-0.013
ley-0.013
Token played
Token ID2826
Feature activation+4.87e-01

Pos logit contributions
 pige+0.021
 sandwiches+0.020
 ducks+0.020
 balloons+0.020
 cakes+0.019
Neg logit contributions
Charl-0.027
use-0.025
 forward-0.024
ifying-0.024
Ru-0.023
Token hop
Token ID1725
Feature activation-2.63e-01

Pos logit contributions
 pige+0.019
 sandwiches+0.016
 cakes+0.016
 flavors+0.016
 ducks+0.016
Neg logit contributions
 forward-0.016
Charl-0.016
sc-0.014
use-0.013
ky-0.012
Tokensc
Token ID1416
Feature activation-3.42e+00

Pos logit contributions
Charl+0.008
use+0.008
ifying+0.007
 forward+0.007
ppy+0.007
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 balloons-0.010
 ducks-0.010
 swings-0.010
Tokenot
Token ID313
Feature activation-4.03e-02

Pos logit contributions
Charl+0.075
use+0.066
 forward+0.061
ifying+0.055
ppy+0.052
Neg logit contributions
 pige-0.173
 sandwiches-0.172
 cakes-0.164
 ducks-0.162
 balloons-0.161
Tokench
Token ID354
Feature activation+9.03e-02

Pos logit contributions
Charl+0.001
use+0.001
ifying+0.001
 forward+0.001
Ru+0.001
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 balloons-0.001
 ducks-0.001
 cakes-0.001
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 pige+0.003
 sandwiches+0.003
 cakes+0.003
 ducks+0.003
 balloons+0.003
Neg logit contributions
Charl-0.003
 forward-0.003
use-0.003
ppy-0.003
sc-0.003
Token jumped
Token ID11687
Feature activation-3.67e-01

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.003
ifying+0.003
ppy+0.003
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token rope
Token ID17182
Feature activation+5.95e-01

Pos logit contributions
Charl+0.012
use+0.011
 forward+0.011
ifying+0.010
ppy+0.010
Neg logit contributions
 pige-0.015
 sandwiches-0.014
 cakes-0.014
 ducks-0.013
 balloons-0.013
Token day
Token ID1110
Feature activation+2.94e-01

Pos logit contributions
 pige+0.003
 sandwiches+0.003
 cakes+0.003
 ducks+0.003
 balloons+0.003
Neg logit contributions
Charl-0.003
use-0.003
 forward-0.003
ifying-0.003
ppy-0.003
Token,
Token ID11
Feature activation+1.63e+00

Pos logit contributions
 pige+0.015
 sandwiches+0.014
 cakes+0.013
 swings+0.013
 ducks+0.013
Neg logit contributions
 forward-0.006
Charl-0.006
use-0.006
ifying-0.005
bs-0.005
Token they
Token ID484
Feature activation+8.75e-01

Pos logit contributions
 pige+0.048
 sandwiches+0.042
 pie+0.038
 ducks+0.037
 cakes+0.037
Neg logit contributions
 forward-0.065
Charl-0.063
ifying-0.061
use-0.057
bs-0.057
Token play
Token ID711
Feature activation+7.59e-01

Pos logit contributions
 pige+0.031
 ducks+0.030
 swings+0.029
 sandwiches+0.029
 balloons+0.029
Neg logit contributions
Charl-0.035
uster-0.029
ifying-0.029
och-0.029
oris-0.028
Token hop
Token ID1725
Feature activation-4.14e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.021
 ducks+0.021
 balloons+0.021
Neg logit contributions
Charl-0.032
use-0.029
 forward-0.029
ifying-0.027
ppy-0.027
Tokensc
Token ID1416
Feature activation-3.39e+00

Pos logit contributions
Charl+0.017
use+0.016
och+0.015
enna+0.015
issy+0.015
Neg logit contributions
 sandwiches-0.014
 ducks-0.013
 swings-0.013
 pige-0.013
 balloons-0.012
Tokenot
Token ID313
Feature activation-1.03e-01

Pos logit contributions
Charl+0.081
use+0.072
 forward+0.066
ifying+0.061
ppy+0.057
Neg logit contributions
 pige-0.166
 sandwiches-0.166
 cakes-0.157
 ducks-0.155
 balloons-0.154
Tokench
Token ID354
Feature activation+1.17e-01

Pos logit contributions
Charl+0.004
use+0.003
ifying+0.003
 forward+0.003
ley+0.003
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 ducks-0.004
 balloons-0.004
 cakes-0.004
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 pige+0.005
 sandwiches+0.004
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.004
 forward-0.004
use-0.004
ppy-0.003
sc-0.003
Token They
Token ID1119
Feature activation+1.58e-01

Pos logit contributions
 pige+0.051
 sandwiches+0.050
 pie+0.045
 cakes+0.044
 sandwich+0.042
Neg logit contributions
ifying-0.034
 forward-0.033
Charl-0.032
sc-0.031
use-0.030
Token draw
Token ID3197
Feature activation+1.21e-01

Pos logit contributions
 swings+0.005
 ducks+0.005
 pige+0.005
 sandwiches+0.004
 balloons+0.004
Neg logit contributions
Charl-0.007
ifying-0.006
och-0.006
uster-0.006
use-0.006
Token hide
Token ID7808
Feature activation-4.24e-02

Pos logit contributions
Charl+0.011
use+0.010
 forward+0.010
ifying+0.009
ppy+0.009
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 ducks-0.009
 balloons-0.009
Token and
Token ID290
Feature activation+9.26e-01

Pos logit contributions
Charl+0.002
use+0.001
 forward+0.001
ifying+0.001
issy+0.001
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 ducks-0.001
 balloons-0.001
 cakes-0.001
Token seek
Token ID5380
Feature activation+9.04e-01

Pos logit contributions
 pige+0.030
 sandwiches+0.029
 cakes+0.027
 ducks+0.027
 balloons+0.026
Neg logit contributions
Charl-0.037
use-0.035
 forward-0.034
ifying-0.032
ppy-0.031
Token,
Token ID11
Feature activation+4.68e-02

Pos logit contributions
 sandwiches+0.033
 pige+0.032
 cakes+0.030
 ducks+0.030
 balloons+0.030
Neg logit contributions
Charl-0.033
use-0.031
 forward-0.029
ifying-0.028
ppy-0.027
Token hop
Token ID1725
Feature activation-5.82e-02

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.001
ifying-0.001
ppy-0.001
Tokensc
Token ID1416
Feature activation-3.39e+00

Pos logit contributions
Charl+0.002
use+0.002
issy+0.002
ifying+0.002
och+0.002
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 balloons-0.002
 ducks-0.002
 swings-0.002
Tokenot
Token ID313
Feature activation+6.22e-01

Pos logit contributions
Charl+0.089
use+0.080
 forward+0.069
ifying+0.066
Ru+0.063
Neg logit contributions
 sandwiches-0.162
 pige-0.156
 cakes-0.150
 ducks-0.148
 balloons-0.146
Tokench
Token ID354
Feature activation-8.88e-02

Pos logit contributions
 sandwiches+0.024
 pige+0.024
 balloons+0.023
 cakes+0.023
 ducks+0.023
Neg logit contributions
Charl-0.020
use-0.019
 forward-0.018
ifying-0.017
Ru-0.016
Token and
Token ID290
Feature activation+9.76e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token much
Token ID881
Feature activation+8.50e-01

Pos logit contributions
 pige+0.032
 sandwiches+0.032
 cakes+0.030
 ducks+0.029
 balloons+0.029
Neg logit contributions
Charl-0.039
use-0.036
 forward-0.035
ifying-0.033
ppy-0.032
Token more
Token ID517
Feature activation+1.04e+00

Pos logit contributions
 sandwiches+0.027
 pige+0.026
 cakes+0.025
 ducks+0.024
 balloons+0.024
Neg logit contributions
Charl-0.037
use-0.034
 forward-0.032
ppy-0.031
ifying-0.031
Token group
Token ID1448
Feature activation+1.16e+00

Pos logit contributions
 pige+0.015
 sandwiches+0.015
 cakes+0.014
 ducks+0.013
 balloons+0.013
Neg logit contributions
Charl-0.022
use-0.020
 forward-0.020
ifying-0.019
ppy-0.018
Token of
Token ID286
Feature activation+5.80e-01

Pos logit contributions
 pige+0.043
 sandwiches+0.043
 cakes+0.041
 ducks+0.039
 balloons+0.039
Neg logit contributions
Charl-0.039
 forward-0.036
use-0.036
ifying-0.032
sc-0.032
Token girls
Token ID4813
Feature activation+2.66e-01

Pos logit contributions
 sandwiches+0.013
 pige+0.013
 cakes+0.012
 pie+0.011
 balloons+0.011
Neg logit contributions
Charl-0.027
 forward-0.025
use-0.025
sc-0.023
ifying-0.023
Token playing
Token ID2712
Feature activation+7.11e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.008
ppy-0.008
Token hop
Token ID1725
Feature activation-4.64e-03

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.022
 ducks+0.021
 balloons+0.021
Neg logit contributions
Charl-0.028
use-0.026
 forward-0.025
ifying-0.024
ppy-0.023
Tokensc
Token ID1416
Feature activation-3.36e+00

Pos logit contributions
Charl+0.000
use+0.000
acy+0.000
issy+0.000
ifying+0.000
Neg logit contributions
 sandwiches-0.000
 pige-0.000
 ducks-0.000
 balloons-0.000
 swings-0.000
Tokenot
Token ID313
Feature activation+4.71e-01

Pos logit contributions
Charl+0.092
use+0.082
 forward+0.074
ifying+0.071
Ru+0.065
Neg logit contributions
 sandwiches-0.155
 pige-0.154
 cakes-0.145
 ducks-0.143
 balloons-0.141
Tokench
Token ID354
Feature activation+4.08e-01

Pos logit contributions
 sandwiches+0.020
 pige+0.020
 balloons+0.018
 ducks+0.018
 cakes+0.018
Neg logit contributions
Charl-0.014
use-0.013
 forward-0.012
ifying-0.012
acy-0.011
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 cakes+0.015
 ducks+0.015
 balloons+0.014
Neg logit contributions
Charl-0.013
 forward-0.012
use-0.012
sc-0.011
ppy-0.011
Token They
Token ID1119
Feature activation+3.88e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ppy-0.006
ifying-0.006
Token joined
Token ID5399
Feature activation+4.10e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 ducks+0.011
 balloons+0.011
 cakes+0.011
Neg logit contributions
Charl-0.016
use-0.015
 forward-0.014
ifying-0.014
och-0.013
Token stars
Token ID5788
Feature activation-1.61e+00

Pos logit contributions
Charl+0.012
 forward+0.011
use+0.011
ifying+0.010
ppy+0.010
Neg logit contributions
 pige-0.013
 sandwiches-0.013
 cakes-0.012
 ducks-0.012
 balloons-0.012
Token.
Token ID13
Feature activation+4.96e-01

Pos logit contributions
Charl+0.059
use+0.055
 forward+0.051
ifying+0.048
ppy+0.048
Neg logit contributions
 pige-0.058
 sandwiches-0.058
 cakes-0.054
 ducks-0.054
 balloons-0.053
Token He
Token ID679
Feature activation-2.48e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.021
 ducks+0.021
 balloons+0.021
Neg logit contributions
 forward-0.011
Charl-0.011
use-0.010
ifying-0.010
 Taylor-0.009
Token reached
Token ID4251
Feature activation-2.93e-01

Pos logit contributions
Charl+0.009
use+0.008
 forward+0.008
ifying+0.007
ppy+0.007
Neg logit contributions
 pige-0.010
 sandwiches-0.009
 cakes-0.009
 ducks-0.009
 balloons-0.009
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
Charl+0.011
use+0.010
ifying+0.009
och+0.009
ppy+0.008
Neg logit contributions
 sandwiches-0.011
 balloons-0.010
 ducks-0.010
 ice-0.010
 cakes-0.010
Token little
Token ID1310
Feature activation-3.32e+00

Pos logit contributions
Charl+0.049
use+0.046
 forward+0.042
ppy+0.042
och+0.042
Neg logit contributions
 sandwiches-0.033
 pige-0.030
 swings-0.029
 balloons-0.029
 ducks-0.029
Token hand
Token ID1021
Feature activation-2.40e-01

Pos logit contributions
Charl+0.145
use+0.132
 forward+0.125
ppy+0.120
ifying+0.120
Neg logit contributions
 pige-0.101
 sandwiches-0.099
 ducks-0.092
 balloons-0.092
 cakes-0.092
Token up
Token ID510
Feature activation-3.64e-01

Pos logit contributions
Charl+0.010
use+0.009
acy+0.008
issy+0.008
 forward+0.008
Neg logit contributions
 sandwiches-0.008
 pige-0.008
 ducks-0.007
 ice-0.007
 balloons-0.007
Token to
Token ID284
Feature activation+3.37e-01

Pos logit contributions
Charl+0.017
use+0.016
ppy+0.015
ifying+0.015
issy+0.015
Neg logit contributions
 sandwiches-0.010
 ice-0.010
 pige-0.009
 balloons-0.009
 Sue-0.009
Token touch
Token ID3638
Feature activation-4.63e-02

Pos logit contributions
 pige+0.010
 sandwiches+0.009
 ducks+0.009
 cakes+0.009
 balloons+0.009
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.014
ifying-0.013
ppy-0.013
Token them
Token ID606
Feature activation-4.38e-01

Pos logit contributions
Charl+0.002
use+0.002
 forward+0.002
ppy+0.002
ifying+0.002
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 ducks-0.001
 balloons-0.001
 cakes-0.001
Token play
Token ID711
Feature activation+1.64e-01

Pos logit contributions
 pige+0.011
 sandwiches+0.011
 cakes+0.010
 ducks+0.010
 balloons+0.010
Neg logit contributions
Charl-0.011
use-0.011
 forward-0.010
ifying-0.010
ppy-0.009
Token a
Token ID257
Feature activation-9.34e-02

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 balloons+0.005
 ducks+0.005
Neg logit contributions
Charl-0.006
 forward-0.005
use-0.005
sc-0.005
ppy-0.005
Token game
Token ID983
Feature activation-1.01e-01

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.004
sc+0.004
ifying+0.004
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 ducks-0.002
 balloons-0.002
Token of
Token ID286
Feature activation+1.94e-01

Pos logit contributions
Charl+0.004
use+0.004
ifying+0.004
 forward+0.004
ppy+0.003
Neg logit contributions
 sandwiches-0.003
 pige-0.003
 cakes-0.003
 ducks-0.003
 balloons-0.003
Token hop
Token ID1725
Feature activation-4.18e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
mm+0.005
 cakes+0.005
 balloons+0.005
Neg logit contributions
Charl-0.007
 forward-0.007
sc-0.007
use-0.007
Ru-0.006
Tokensc
Token ID1416
Feature activation-3.32e+00

Pos logit contributions
Charl+0.012
use+0.010
ifying+0.009
issy+0.009
och+0.009
Neg logit contributions
 sandwiches-0.019
 pige-0.018
 ducks-0.018
 balloons-0.018
 swings-0.018
Tokenot
Token ID313
Feature activation-2.74e-01

Pos logit contributions
Charl+0.091
use+0.081
 forward+0.071
ifying+0.069
Ru+0.064
Neg logit contributions
 sandwiches-0.153
 pige-0.150
 cakes-0.142
 ducks-0.141
 balloons-0.140
Tokench
Token ID354
Feature activation-1.11e-01

Pos logit contributions
Charl+0.010
use+0.010
ifying+0.009
och+0.008
ley+0.008
Neg logit contributions
 sandwiches-0.010
 pige-0.009
 balloons-0.009
 ducks-0.009
 ice-0.009
Token.
Token ID13
Feature activation+7.66e-01

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.004
ifying+0.003
ppy+0.003
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 cakes-0.004
 ducks-0.004
 balloons-0.004
Token Tim
Token ID5045
Feature activation-6.29e-01

Pos logit contributions
 pige+0.028
 sandwiches+0.028
 cakes+0.026
 balloons+0.025
 ducks+0.025
Neg logit contributions
Charl-0.027
use-0.025
 forward-0.024
sc-0.023
ifying-0.023
Token took
Token ID1718
Feature activation-9.06e-01

Pos logit contributions
Charl+0.025
use+0.023
ifying+0.023
 forward+0.023
och+0.021
Neg logit contributions
 pige-0.020
 sandwiches-0.020
 ducks-0.019
 swings-0.019
 cakes-0.018
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
 pige+0.028
 sandwiches+0.027
 cakes+0.025
 ducks+0.025
 swings+0.024
Neg logit contributions
Charl-0.025
 forward-0.025
use-0.024
ifying-0.022
ppy-0.021
Token You
Token ID921
Feature activation-2.86e-01

Pos logit contributions
Charl+0.033
use+0.032
 forward+0.031
ppy+0.029
ifying+0.028
Neg logit contributions
 pige-0.026
 sandwiches-0.026
 ducks-0.024
 cakes-0.024
 balloons-0.023
Token can
Token ID460
Feature activation-7.92e-01

Pos logit contributions
Charl+0.010
use+0.009
 forward+0.009
ifying+0.008
ppy+0.008
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 cakes-0.010
 ducks-0.010
 balloons-0.010
Token play
Token ID711
Feature activation+1.41e-03

Pos logit contributions
Charl+0.034
 forward+0.031
use+0.030
sc+0.030
ifying+0.029
Neg logit contributions
 sandwiches-0.022
 swings-0.022
 ducks-0.022
 ice-0.022
 pige-0.022
Token hop
Token ID1725
Feature activation-5.55e-01

Pos logit contributions
 pige+0.000
 sandwiches+0.000
 cakes+0.000
 ducks+0.000
 balloons+0.000
Neg logit contributions
Charl-0.000
 forward-0.000
use-0.000
sc-0.000
ppy-0.000
Tokensc
Token ID1416
Feature activation-3.25e+00

Pos logit contributions
Charl+0.020
use+0.018
issy+0.016
ifying+0.016
acy+0.016
Neg logit contributions
 sandwiches-0.021
 pige-0.021
 ducks-0.019
 balloons-0.019
 swings-0.019
Tokenot
Token ID313
Feature activation-1.64e-01

Pos logit contributions
Charl+0.086
use+0.074
 forward+0.069
ifying+0.066
Ru+0.060
Neg logit contributions
 sandwiches-0.155
 pige-0.152
 cakes-0.144
 ducks-0.142
 balloons-0.141
Tokench
Token ID354
Feature activation-3.07e-01

Pos logit contributions
Charl+0.006
use+0.005
ifying+0.005
 forward+0.005
acy+0.005
Neg logit contributions
 sandwiches-0.006
 pige-0.006
 balloons-0.006
 ducks-0.006
 cakes-0.005
Token again
Token ID757
Feature activation+6.57e-02

Pos logit contributions
Charl+0.012
use+0.011
 forward+0.010
ifying+0.010
ppy+0.010
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 cakes-0.010
 ducks-0.010
 balloons-0.009
Token.
Token ID13
Feature activation-9.16e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 ducks+0.001
 cakes+0.001
 balloons+0.001
Neg logit contributions
Charl-0.003
use-0.003
 forward-0.003
ifying-0.003
issy-0.003
Token But
Token ID887
Feature activation-2.79e-01

Pos logit contributions
Charl+0.033
use+0.032
 forward+0.030
ifying+0.028
ley+0.028
Neg logit contributions
 pige-0.032
 sandwiches-0.031
 ducks-0.030
 cakes-0.029
 swings-0.029
Token!
Token ID0
Feature activation-1.80e-01

Pos logit contributions
Charl+0.011
use+0.010
 forward+0.009
ifying+0.009
ppy+0.008
Neg logit contributions
 pige-0.012
 sandwiches-0.012
 cakes-0.011
 ducks-0.011
 balloons-0.010
Token Tim
Token ID5045
Feature activation-4.99e-01

Pos logit contributions
Charl+0.006
use+0.006
 forward+0.006
ifying+0.005
Ru+0.005
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.006
 balloons-0.006
 ducks-0.006
Tokenmy
Token ID1820
Feature activation-1.60e+00

Pos logit contributions
Charl+0.013
use+0.012
 forward+0.011
ifying+0.011
sc+0.009
Neg logit contributions
 sandwiches-0.023
 pige-0.023
 cakes-0.022
 ducks-0.021
 swings-0.021
Token cried
Token ID16896
Feature activation-1.64e+00

Pos logit contributions
Charl+0.054
use+0.050
 forward+0.048
ifying+0.045
ppy+0.044
Neg logit contributions
 pige-0.063
 sandwiches-0.061
 cakes-0.058
 ducks-0.057
 balloons-0.056
Token a
Token ID257
Feature activation-8.21e-01

Pos logit contributions
use+0.072
Charl+0.071
ifying+0.063
acy+0.062
och+0.062
Neg logit contributions
 sandwiches-0.049
 ice-0.048
 cakes-0.042
 pie-0.042
 ducks-0.042
Token little
Token ID1310
Feature activation-3.19e+00

Pos logit contributions
use+0.038
Charl+0.037
acy+0.036
well+0.036
ifying+0.036
Neg logit contributions
 ice-0.018
 sandwiches-0.017
 pige-0.016
 pie-0.016
 ducks-0.015
Token bit
Token ID1643
Feature activation-1.20e+00

Pos logit contributions
Charl+0.127
use+0.111
acy+0.109
ifying+0.109
issy+0.105
Neg logit contributions
 pige-0.112
 sandwiches-0.108
 ducks-0.107
 ice-0.104
 pie-0.102
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
Charl+0.055
ifying+0.052
use+0.050
acy+0.049
issy+0.047
Neg logit contributions
 ducks-0.032
 ice-0.032
 pige-0.031
 Sue-0.031
 sandwiches-0.031
Token but
Token ID475
Feature activation-7.97e-01

Pos logit contributions
Charl+0.035
use+0.032
 forward+0.031
ifying+0.030
ppy+0.028
Neg logit contributions
 pige-0.040
 sandwiches-0.039
 cakes-0.037
 ducks-0.037
 balloons-0.036
Token then
Token ID788
Feature activation-1.58e+00

Pos logit contributions
Charl+0.034
 forward+0.032
use+0.031
ifying+0.030
ppy+0.029
Neg logit contributions
 pige-0.025
 sandwiches-0.023
 cakes-0.022
 balloons-0.021
 ducks-0.021
Token he
Token ID339
Feature activation-6.75e-01

Pos logit contributions
Charl+0.056
 forward+0.051
use+0.051
 Taylor+0.047
ifying+0.046
Neg logit contributions
 pige-0.060
 sandwiches-0.056
 cakes-0.052
 ducks-0.052
 balloons-0.051
Token also
Token ID635
Feature activation+4.24e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 swings+0.012
 flavors+0.012
Neg logit contributions
 forward-0.026
Charl-0.025
use-0.025
ppy-0.024
bs-0.023
Token played
Token ID2826
Feature activation+2.22e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 ducks+0.015
 cakes+0.015
 balloons+0.014
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.013
ifying-0.013
ppy-0.012
Token tag
Token ID7621
Feature activation+3.45e-01

Pos logit contributions
 pige+0.008
 sandwiches+0.008
 cakes+0.007
 balloons+0.007
 ducks+0.007
Neg logit contributions
Charl-0.007
 forward-0.007
use-0.006
sc-0.006
ppy-0.006
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
 sandwiches+0.013
 pige+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.013
use-0.012
 forward-0.011
ifying-0.011
ppy-0.010
Token hop
Token ID1725
Feature activation-3.89e-02

Pos logit contributions
 pige+0.044
 sandwiches+0.043
 cakes+0.040
 ducks+0.039
 balloons+0.039
Neg logit contributions
Charl-0.047
use-0.044
 forward-0.043
ifying-0.040
ppy-0.040
Tokensc
Token ID1416
Feature activation-3.19e+00

Pos logit contributions
Charl+0.001
use+0.001
ifying+0.001
 forward+0.001
acy+0.001
Neg logit contributions
 sandwiches-0.002
 pige-0.001
 ducks-0.001
 balloons-0.001
 cakes-0.001
Tokenot
Token ID313
Feature activation+2.04e-01

Pos logit contributions
Charl+0.079
use+0.070
 forward+0.064
ifying+0.059
ppy+0.054
Neg logit contributions
 sandwiches-0.155
 pige-0.154
 cakes-0.147
 ducks-0.144
 balloons-0.143
Tokench
Token ID354
Feature activation+1.27e-01

Pos logit contributions
 sandwiches+0.008
 pige+0.008
 balloons+0.008
 cakes+0.007
 ducks+0.007
Neg logit contributions
Charl-0.007
use-0.006
 forward-0.006
ifying-0.006
Ru-0.005
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 cakes+0.005
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.004
 forward-0.004
use-0.004
ppy-0.004
sc-0.003
Token They
Token ID1119
Feature activation+3.68e-01

Pos logit contributions
 pige+0.007
 sandwiches+0.006
 ducks+0.006
 Sue+0.006
 cakes+0.006
Neg logit contributions
Charl-0.011
use-0.011
 forward-0.010
ppy-0.010
well-0.010
Token laughed
Token ID13818
Feature activation-1.77e-01

Pos logit contributions
 sandwiches+0.012
 pige+0.011
 ducks+0.011
 cakes+0.011
 balloons+0.011
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.014
ifying-0.013
arl-0.013
Token it
Token ID340
Feature activation-4.10e-01

Pos logit contributions
Charl+0.006
use+0.005
ifying+0.005
ppy+0.005
well+0.005
Neg logit contributions
 sandwiches-0.001
 pige-0.001
 ice-0.001
 ducks-0.001
 cakes-0.001
Token,
Token ID11
Feature activation-4.79e-01

Pos logit contributions
Charl+0.020
use+0.018
ifying+0.016
ppy+0.015
Ru+0.015
Neg logit contributions
 sandwiches-0.011
 Sue-0.011
 ice-0.010
 swings-0.010
 cakes-0.010
Token a
Token ID257
Feature activation-3.92e-01

Pos logit contributions
Charl+0.023
use+0.022
 forward+0.021
ppy+0.020
ifying+0.020
Neg logit contributions
 pige-0.012
 sandwiches-0.012
 ducks-0.011
 cakes-0.011
 balloons-0.011
Token big
Token ID1263
Feature activation-3.74e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.015
ppy+0.014
ifying+0.014
Neg logit contributions
 pige-0.012
 sandwiches-0.012
 ducks-0.011
 cakes-0.011
 balloons-0.011
Token gust
Token ID35253
Feature activation-4.28e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ifying+0.013
ppy+0.013
Neg logit contributions
 pige-0.012
 sandwiches-0.011
 cakes-0.011
 ducks-0.010
 balloons-0.010
Token of
Token ID286
Feature activation-3.19e+00

Pos logit contributions
Charl+0.023
use+0.021
 Taylor+0.019
 Piper+0.019
 Melanie+0.019
Neg logit contributions
 pige-0.009
 ducks-0.009
 ice-0.008
 monkeys-0.008
 sandwiches-0.008
Token wind
Token ID2344
Feature activation-8.03e-01

Pos logit contributions
Charl+0.167
use+0.157
ifying+0.148
 forward+0.147
ppy+0.146
Neg logit contributions
 pige-0.068
 sandwiches-0.066
 ducks-0.063
 swings-0.059
 ice-0.059
Token came
Token ID1625
Feature activation-8.30e-03

Pos logit contributions
Charl+0.030
use+0.027
 forward+0.026
ifying+0.024
ppy+0.024
Neg logit contributions
 pige-0.029
 sandwiches-0.029
 ducks-0.027
 cakes-0.027
 balloons-0.027
Token by
Token ID416
Feature activation-1.03e+00

Pos logit contributions
Charl+0.000
use+0.000
acy+0.000
och+0.000
ifying+0.000
Neg logit contributions
 ducks-0.000
 sandwiches-0.000
 pige-0.000
 ice-0.000
 cakes-0.000
Token and
Token ID290
Feature activation+3.95e-01

Pos logit contributions
Charl+0.039
use+0.036
 forward+0.033
ifying+0.032
ppy+0.032
Neg logit contributions
 pige-0.037
 sandwiches-0.037
 ducks-0.034
 cakes-0.034
 balloons-0.033
Token blew
Token ID17948
Feature activation-1.51e-01

Pos logit contributions
 pige+0.013
 sandwiches+0.013
 ducks+0.012
 cakes+0.012
 balloons+0.012
Neg logit contributions
Charl-0.016
use-0.015
 forward-0.014
ppy-0.014
acy-0.014
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
Charl+0.057
use+0.053
 forward+0.053
ifying+0.049
ppy+0.049
Neg logit contributions
 pige-0.042
 sandwiches-0.041
 cakes-0.038
 ducks-0.036
 balloons-0.036
Token with
Token ID351
Feature activation-1.41e+00

Pos logit contributions
Charl+0.013
use+0.013
 forward+0.012
ifying+0.012
ppy+0.012
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 cakes-0.006
 ducks-0.006
 balloons-0.005
Token a
Token ID257
Feature activation-3.63e-01

Pos logit contributions
use+0.070
Charl+0.070
ppy+0.063
sc+0.063
och+0.062
Neg logit contributions
 sandwiches-0.033
 pige-0.031
 ducks-0.029
 balloons-0.029
 ice-0.029
Token sudden
Token ID4802
Feature activation-5.59e-01

Pos logit contributions
Charl+0.015
use+0.014
ppy+0.013
 forward+0.013
ley+0.013
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 ducks-0.010
 cakes-0.010
 pie-0.010
Token gust
Token ID35253
Feature activation-2.06e-01

Pos logit contributions
Charl+0.021
use+0.019
 forward+0.018
ifying+0.018
ppy+0.017
Neg logit contributions
 pige-0.020
 sandwiches-0.020
 cakes-0.018
 ducks-0.018
 balloons-0.018
Token of
Token ID286
Feature activation-3.11e+00

Pos logit contributions
Charl+0.011
use+0.010
 Taylor+0.009
ley+0.009
ifying+0.009
Neg logit contributions
 pige-0.004
 ducks-0.004
 monkeys-0.004
 ice-0.004
 lions-0.004
Token wind
Token ID2344
Feature activation-6.02e-01

Pos logit contributions
Charl+0.154
use+0.144
 forward+0.139
ifying+0.137
ppy+0.134
Neg logit contributions
 pige-0.076
 sandwiches-0.075
 ducks-0.069
 balloons-0.067
 cakes-0.066
Token,
Token ID11
Feature activation-6.90e-01

Pos logit contributions
Charl+0.022
use+0.020
 forward+0.019
ifying+0.018
Ru+0.018
Neg logit contributions
 pige-0.022
 sandwiches-0.021
 ducks-0.020
 balloons-0.020
 cakes-0.020
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
Charl+0.036
use+0.035
 forward+0.033
ifying+0.032
ppy+0.032
Neg logit contributions
 pige-0.015
 sandwiches-0.014
 ducks-0.013
 cakes-0.012
 balloons-0.012
Token singing
Token ID13777
Feature activation-6.16e-01

Pos logit contributions
Charl+0.042
use+0.040
 forward+0.039
ifying+0.037
ppy+0.036
Neg logit contributions
 sandwiches-0.033
 pige-0.032
 cakes-0.031
 balloons-0.028
 ducks-0.027
Token rock
Token ID3881
Feature activation-4.51e-01

Pos logit contributions
Charl+0.023
use+0.021
 forward+0.020
ppy+0.018
ifying+0.018
Neg logit contributions
 pige-0.022
 sandwiches-0.022
 ducks-0.021
 cakes-0.020
 balloons-0.020
Token felt
Token ID2936
Feature activation-1.37e+00

Pos logit contributions
Charl+0.046
use+0.041
 forward+0.040
ppy+0.038
well+0.036
Neg logit contributions
 pige-0.064
 sandwiches-0.063
 cakes-0.061
 balloons-0.059
 ducks-0.058
Token sad
Token ID6507
Feature activation-1.60e+00

Pos logit contributions
Charl+0.060
use+0.059
issy+0.052
oris+0.051
ley+0.051
Neg logit contributions
 pige-0.041
 sandwiches-0.041
 ducks-0.038
 ice-0.038
 balloons-0.036
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
Charl+0.075
use+0.072
ifying+0.067
Ru+0.062
acy+0.061
Neg logit contributions
 sandwiches-0.046
 ice-0.042
 pige-0.042
 ducks-0.039
 cakes-0.038
Token cried
Token ID16896
Feature activation-1.71e+00

Pos logit contributions
Charl+0.032
use+0.030
 forward+0.029
ppy+0.026
ifying+0.026
Neg logit contributions
 pige-0.042
 sandwiches-0.041
 cakes-0.039
 balloons-0.038
 ducks-0.038
Token a
Token ID257
Feature activation-6.41e-01

Pos logit contributions
Charl+0.075
use+0.074
ifying+0.063
och+0.062
Ru+0.061
Neg logit contributions
 sandwiches-0.053
 ice-0.048
 cakes-0.048
 pige-0.047
 ducks-0.046
Token little
Token ID1310
Feature activation-3.11e+00

Pos logit contributions
use+0.026
Charl+0.025
ifying+0.025
acy+0.025
 forward+0.025
Neg logit contributions
 ice-0.016
 pige-0.016
 sandwiches-0.016
 pie-0.015
 cake-0.015
Token bit
Token ID1643
Feature activation-1.02e+00

Pos logit contributions
Charl+0.127
use+0.113
ifying+0.108
acy+0.105
issy+0.104
Neg logit contributions
 pige-0.104
 sandwiches-0.101
 ducks-0.097
 ice-0.093
 cakes-0.093
Token.
Token ID13
Feature activation-5.44e-01

Pos logit contributions
Charl+0.048
use+0.043
ifying+0.043
acy+0.041
issy+0.040
Neg logit contributions
 sandwiches-0.026
 ducks-0.026
 pige-0.026
 ice-0.025
 Sue-0.023
Token
Token ID220
Feature activation+5.82e-02

Pos logit contributions
Charl+0.015
use+0.013
 forward+0.012
 Taylor+0.011
Ru+0.011
Neg logit contributions
 pige-0.026
 sandwiches-0.025
 balloons-0.023
 cakes-0.023
 ducks-0.023
Token
Token ID198
Feature activation-6.99e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ifying-0.002
ppy-0.002
Token
Token ID198
Feature activation-7.65e-01

Pos logit contributions
Charl+0.024
use+0.022
 forward+0.022
ifying+0.020
ppy+0.020
Neg logit contributions
 pige-0.027
 sandwiches-0.026
 cakes-0.025
 ducks-0.024
 balloons-0.024
Token
Token ID198
Feature activation-6.98e-01

Pos logit contributions
Charl+0.012
use+0.011
 forward+0.011
ifying+0.010
ppy+0.010
Neg logit contributions
 pige-0.014
 sandwiches-0.014
 cakes-0.013
 ducks-0.013
 balloons-0.013
TokenSuddenly
Token ID38582
Feature activation-1.03e+00

Pos logit contributions
Charl+0.021
use+0.020
 forward+0.019
ifying+0.018
ppy+0.017
Neg logit contributions
 pige-0.029
 sandwiches-0.029
 cakes-0.027
 ducks-0.027
 balloons-0.027
Token,
Token ID11
Feature activation-4.27e-01

Pos logit contributions
Charl+0.028
 forward+0.026
use+0.026
ppy+0.023
ifying+0.022
Neg logit contributions
 pige-0.046
 sandwiches-0.044
 cakes-0.043
 ducks-0.041
 balloons-0.041
Token a
Token ID257
Feature activation-3.99e-01

Pos logit contributions
Charl+0.017
 forward+0.016
use+0.015
ifying+0.015
ppy+0.014
Neg logit contributions
 pige-0.014
 sandwiches-0.013
 cakes-0.013
 ducks-0.012
 balloons-0.012
Token gust
Token ID35253
Feature activation-4.44e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.015
ifying+0.014
ppy+0.014
Neg logit contributions
 pige-0.013
 sandwiches-0.013
 cakes-0.012
 ducks-0.012
 balloons-0.011
Token of
Token ID286
Feature activation-3.08e+00

Pos logit contributions
Charl+0.021
use+0.020
ifying+0.018
 forward+0.018
 Taylor+0.018
Neg logit contributions
 pige-0.012
 ducks-0.011
 sandwiches-0.010
 monkeys-0.010
 ice-0.010
Token strong
Token ID1913
Feature activation+5.03e-01

Pos logit contributions
Charl+0.157
use+0.146
ifying+0.141
 forward+0.139
ppy+0.135
Neg logit contributions
 pige-0.071
 sandwiches-0.070
 ducks-0.066
 cakes-0.063
 balloons-0.062
Token wind
Token ID2344
Feature activation-8.49e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 cakes+0.015
 ducks+0.014
 balloons+0.014
Neg logit contributions
Charl-0.020
use-0.019
 forward-0.019
ifying-0.017
ppy-0.017
Token blew
Token ID17948
Feature activation-4.68e-01

Pos logit contributions
Charl+0.030
use+0.027
 forward+0.026
ifying+0.024
ppy+0.024
Neg logit contributions
 pige-0.032
 sandwiches-0.032
 ducks-0.030
 cakes-0.030
 balloons-0.029
Token and
Token ID290
Feature activation+2.20e-01

Pos logit contributions
Charl+0.019
use+0.018
ifying+0.015
acy+0.015
och+0.015
Neg logit contributions
 sandwiches-0.016
 pige-0.015
 ducks-0.015
 ice-0.014
 balloons-0.014
Token the
Token ID262
Feature activation-4.67e-01

Pos logit contributions
 sandwiches+0.006
 pige+0.006
 ducks+0.005
 cakes+0.005
 balloons+0.005
Neg logit contributions
Charl-0.010
use-0.010
 forward-0.009
ifying-0.009
ppy-0.009
Token Tom
Token ID4186
Feature activation+3.98e-01

Pos logit contributions
Charl+0.014
use+0.013
 forward+0.013
Ru+0.012
 Taylor+0.012
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 balloons-0.007
 flavors-0.007
Token says
Token ID1139
Feature activation+9.67e-02

Pos logit contributions
 pige+0.011
 sandwiches+0.011
 cakes+0.010
 ducks+0.010
 balloons+0.009
Neg logit contributions
Charl-0.018
use-0.017
 forward-0.017
ifying-0.016
ppy-0.016
Token.
Token ID13
Feature activation+5.54e-01

Pos logit contributions
 sandwiches+0.003
 pige+0.003
 cakes+0.003
 balloons+0.003
 ducks+0.003
Neg logit contributions
Charl-0.004
use-0.003
 forward-0.003
ifying-0.003
ppy-0.003
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
 pige+0.019
 sandwiches+0.019
 ducks+0.018
 cakes+0.017
 balloons+0.017
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.019
ifying-0.019
ppy-0.018
Token
Token ID198
Feature activation+3.10e-01

Pos logit contributions
 pige+0.017
 sandwiches+0.017
 cakes+0.016
 balloons+0.016
 ducks+0.016
Neg logit contributions
Charl-0.014
use-0.012
 forward-0.012
ifying-0.011
ppy-0.011
TokenS
Token ID50
Feature activation+1.79e+00

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 balloons+0.011
 ducks+0.011
Neg logit contributions
Charl-0.010
use-0.010
 forward-0.009
ifying-0.009
ppy-0.008
Tokenue
Token ID518
Feature activation+7.39e-01

Pos logit contributions
 pige+0.051
 sandwiches+0.046
 ducks+0.044
 balloons+0.043
 cakes+0.042
Neg logit contributions
use-0.077
Charl-0.072
 forward-0.068
arl-0.066
issy-0.066
Token rub
Token ID6437
Feature activation+9.14e-03

Pos logit contributions
 sandwiches+0.025
 pige+0.024
 cakes+0.023
 ducks+0.023
 swings+0.023
Neg logit contributions
Charl-0.029
use-0.027
 forward-0.026
ifying-0.026
ppy-0.024
Tokens
Token ID82
Feature activation-2.68e-01

Pos logit contributions
 sandwiches+0.000
 pige+0.000
 ducks+0.000
 balloons+0.000
 cakes+0.000
Neg logit contributions
Charl-0.000
use-0.000
 forward-0.000
ifying-0.000
ppy-0.000
Token the
Token ID262
Feature activation-7.16e-01

Pos logit contributions
Charl+0.013
use+0.013
issy+0.012
ifying+0.012
och+0.011
Neg logit contributions
 sandwiches-0.007
 ice-0.006
 balloons-0.006
 cakes-0.006
 ducks-0.006
Token butter
Token ID9215
Feature activation+6.96e-01

Pos logit contributions
Charl+0.035
use+0.033
 forward+0.032
ifying+0.031
ppy+0.030
Neg logit contributions
 sandwiches-0.018
 pige-0.018
 cakes-0.016
 ducks-0.016
 balloons-0.016
Token said
Token ID531
Feature activation-4.20e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ppy+0.014
sc+0.013
Neg logit contributions
 pige-0.018
 sandwiches-0.018
 cakes-0.017
 balloons-0.017
 ducks-0.017
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
Charl+0.013
use+0.012
 forward+0.012
ppy+0.010
ifying+0.010
Neg logit contributions
 pige-0.018
 sandwiches-0.017
 cakes-0.016
 ducks-0.016
 balloons-0.016
Token "
Token ID366
Feature activation-6.67e-01

Pos logit contributions
Charl+0.006
use+0.006
 forward+0.005
ifying+0.005
ppy+0.005
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.002
 balloons-0.002
TokenI
Token ID40
Feature activation-2.02e-01

Pos logit contributions
Charl+0.023
use+0.021
 forward+0.020
ifying+0.019
ppy+0.019
Neg logit contributions
 pige-0.026
 sandwiches-0.025
 ducks-0.024
 cakes-0.024
 balloons-0.023
Token am
Token ID716
Feature activation+4.19e-01

Pos logit contributions
Charl+0.008
ifying+0.007
 forward+0.007
uster+0.007
ky+0.007
Neg logit contributions
 pige-0.007
 ducks-0.007
 sandwiches-0.007
 Sue-0.006
 ice-0.006
Token not
Token ID407
Feature activation+1.86e+00

Pos logit contributions
 sandwiches+0.016
 pige+0.016
 cakes+0.015
 balloons+0.015
 ducks+0.015
Neg logit contributions
Charl-0.014
 forward-0.013
use-0.012
Ru-0.011
ppy-0.011
Token rude
Token ID22066
Feature activation+5.97e-01

Pos logit contributions
 sandwiches+0.077
 pige+0.072
 cakes+0.070
 balloons+0.070
 swings+0.069
Neg logit contributions
Charl-0.055
 forward-0.053
use-0.051
ppy-0.049
Ru-0.044
Token,
Token ID11
Feature activation+5.75e-01

Pos logit contributions
 pige+0.023
 sandwiches+0.022
 cakes+0.021
 ducks+0.021
 balloons+0.020
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.019
ifying-0.018
ppy-0.017
Token I
Token ID314
Feature activation+2.64e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 cakes+0.015
 balloons+0.014
 ducks+0.014
Neg logit contributions
Charl-0.025
use-0.023
 forward-0.023
ppy-0.022
ifying-0.021
Token am
Token ID716
Feature activation+5.75e-01

Pos logit contributions
 pige+0.008
 ducks+0.008
 sandwiches+0.007
 swings+0.007
 ice+0.007
Neg logit contributions
Charl-0.012
ifying-0.010
uster-0.010
 forward-0.010
ley-0.010
Token telling
Token ID5149
Feature activation+5.23e-02

Pos logit contributions
 pige+0.020
 sandwiches+0.020
 cakes+0.018
 balloons+0.018
 ducks+0.017
Neg logit contributions
Charl-0.022
 forward-0.020
use-0.020
ppy-0.018
ifying-0.018
Token together
Token ID1978
Feature activation+6.47e-01

Pos logit contributions
Charl+0.021
use+0.019
ppy+0.017
 forward+0.017
ifying+0.017
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 balloons-0.016
 ducks-0.016
Token.
Token ID13
Feature activation+7.45e-01

Pos logit contributions
 pige+0.022
 sandwiches+0.022
 cakes+0.021
 ducks+0.021
 balloons+0.021
Neg logit contributions
Charl-0.025
use-0.023
 forward-0.021
ifying-0.021
ppy-0.020
Token
Token ID198
Feature activation+9.84e-01

Pos logit contributions
 pige+0.028
 sandwiches+0.028
 cakes+0.026
 ducks+0.026
 balloons+0.025
Neg logit contributions
Charl-0.026
use-0.024
 forward-0.023
ifying-0.022
ppy-0.021
Token
Token ID198
Feature activation+8.00e-01

Pos logit contributions
 pige+0.036
 sandwiches+0.035
 cakes+0.034
 balloons+0.033
 ducks+0.033
Neg logit contributions
Charl-0.036
 forward-0.032
use-0.031
Ru-0.028
ppy-0.028
TokenThe
Token ID464
Feature activation+1.60e-01

Pos logit contributions
 pige+0.032
 sandwiches+0.032
 cakes+0.030
 ducks+0.029
 balloons+0.029
Neg logit contributions
Charl-0.026
use-0.024
 forward-0.023
ifying-0.022
ppy-0.021
Token silly
Token ID14397
Feature activation+1.83e+00

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.006
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.006
use-0.005
 forward-0.005
ifying-0.005
ppy-0.005
Token refrigerator
Token ID30500
Feature activation+4.40e-01

Pos logit contributions
 pige+0.066
 sandwiches+0.063
 ducks+0.058
 cakes+0.058
 balloons+0.057
Neg logit contributions
Charl-0.069
use-0.064
 forward-0.062
ifying-0.057
ppy-0.056
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
 pige+0.015
 sandwiches+0.015
 cakes+0.014
 balloons+0.013
 ducks+0.013
Neg logit contributions
Charl-0.018
use-0.016
 forward-0.015
ifying-0.015
ppy-0.015
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
 pige+0.033
 sandwiches+0.032
 cakes+0.030
 ducks+0.028
 balloons+0.028
Neg logit contributions
Charl-0.048
 forward-0.045
use-0.044
ifying-0.042
ppy-0.041
Token little
Token ID1310
Feature activation+5.20e-01

Pos logit contributions
 pige+0.009
 sandwiches+0.009
 cakes+0.008
 ducks+0.008
 balloons+0.008
Neg logit contributions
Charl-0.012
use-0.012
 forward-0.011
ifying-0.011
ppy-0.011
Token girl
Token ID2576
Feature activation-3.61e-01

Pos logit contributions
 pige+0.020
 sandwiches+0.019
 cakes+0.018
 ducks+0.018
 balloons+0.017
Neg logit contributions
Charl-0.018
use-0.017
 forward-0.016
ifying-0.015
ppy-0.015
Token was
Token ID373
Feature activation+1.34e-01

Pos logit contributions
Charl+0.002
use+0.002
och+0.002
 forward+0.002
ifying+0.002
Neg logit contributions
 ducks-0.002
 sandwiches-0.002
 swings-0.002
 pige-0.002
 balloons-0.002
Token glad
Token ID9675
Feature activation-1.93e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.005
use-0.005
 forward-0.005
ifying-0.004
ppy-0.004
Token that
Token ID326
Feature activation+6.40e-01

Pos logit contributions
Charl+0.007
use+0.006
 forward+0.006
ifying+0.006
ppy+0.006
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.007
 ducks-0.007
 balloons-0.006
Token she
Token ID673
Feature activation+5.11e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.024
 cakes+0.023
 ducks+0.022
 balloons+0.022
Neg logit contributions
Charl-0.022
use-0.021
 forward-0.019
ifying-0.019
ppy-0.018
Token was
Token ID373
Feature activation+2.67e-01

Pos logit contributions
 pige+0.016
 sandwiches+0.016
 ducks+0.015
 balloons+0.015
 swings+0.015
Neg logit contributions
Charl-0.020
use-0.019
 forward-0.018
ifying-0.018
arl-0.017
Token able
Token ID1498
Feature activation+2.28e+00

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.009
 forward-0.009
use-0.008
ppy-0.008
ifying-0.008
Token to
Token ID284
Feature activation-7.00e-01

Pos logit contributions
 pige+0.092
 sandwiches+0.091
 cakes+0.087
 ducks+0.084
 swings+0.083
Neg logit contributions
Charl-0.071
 forward-0.066
use-0.065
ifying-0.058
ppy-0.056
Token find
Token ID1064
Feature activation-4.85e-01

Pos logit contributions
Charl+0.031
use+0.028
ifying+0.027
 forward+0.027
och+0.027
Neg logit contributions
 sandwiches-0.020
 ducks-0.020
 pige-0.019
 swings-0.019
 ice-0.018
Token swimming
Token ID14899
Feature activation-1.04e+00

Pos logit contributions
Charl+0.022
use+0.022
ifying+0.020
oping+0.020
well+0.019
Neg logit contributions
 sandwiches-0.015
 pige-0.013
 ducks-0.013
 balloons-0.013
 cakes-0.013
Token goggles
Token ID42762
Feature activation-1.02e+00

Pos logit contributions
Charl+0.042
use+0.038
issy+0.036
ifying+0.035
och+0.035
Neg logit contributions
 sandwiches-0.036
 pige-0.034
 ducks-0.033
 balloons-0.031
 swings-0.031
Token that
Token ID326
Feature activation+3.28e-01

Pos logit contributions
Charl+0.044
use+0.041
issy+0.037
Ru+0.037
Case+0.036
Neg logit contributions
 sandwiches-0.034
 balloons-0.030
 pige-0.030
 ducks-0.029
 ice-0.029
Token Let
Token ID3914
Feature activation+6.94e-02

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.006
use-0.006
 forward-0.005
ppy-0.005
ifying-0.005
Token's
Token ID338
Feature activation-2.25e-01

Pos logit contributions
 pige+0.003
 sandwiches+0.003
 cakes+0.003
 balloons+0.002
 ducks+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ppy-0.002
ley-0.002
Token use
Token ID779
Feature activation-6.77e-01

Pos logit contributions
Charl+0.009
use+0.008
 forward+0.007
ppy+0.007
ifying+0.007
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token our
Token ID674
Feature activation-4.88e-01

Pos logit contributions
Charl+0.028
use+0.026
 forward+0.026
ifying+0.025
sc+0.023
Neg logit contributions
 pige-0.021
 sandwiches-0.018
 cakes-0.018
 ducks-0.017
 swings-0.017
Token grocery
Token ID16918
Feature activation+8.48e-01

Pos logit contributions
Charl+0.021
use+0.020
 forward+0.019
ifying+0.018
ppy+0.017
Neg logit contributions
 pige-0.014
 sandwiches-0.014
 cakes-0.013
 ducks-0.013
 balloons-0.012
Token list
Token ID1351
Feature activation+1.74e+00

Pos logit contributions
 pige+0.028
 sandwiches+0.028
 cakes+0.026
 ducks+0.025
 balloons+0.025
Neg logit contributions
Charl-0.034
use-0.031
 forward-0.030
ifying-0.029
ppy-0.028
Token to
Token ID284
Feature activation+3.04e-01

Pos logit contributions
 pige+0.059
 sandwiches+0.059
 cakes+0.055
 ducks+0.053
 balloons+0.053
Neg logit contributions
Charl-0.068
use-0.063
 forward-0.060
ifying-0.057
ppy-0.056
Token find
Token ID1064
Feature activation+5.93e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 cakes+0.011
 ducks+0.011
 balloons+0.011
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.008
ppy-0.008
Token what
Token ID644
Feature activation+5.86e-01

Pos logit contributions
 pige+0.017
 sandwiches+0.017
 cakes+0.015
 ducks+0.015
 balloons+0.015
Neg logit contributions
Charl-0.026
use-0.024
 forward-0.024
ifying-0.022
ppy-0.022
Token we
Token ID356
Feature activation+6.13e-01

Pos logit contributions
 sandwiches+0.020
 pige+0.020
 monkeys+0.019
castle+0.019
 ducks+0.018
Neg logit contributions
Charl-0.017
use-0.017
 forward-0.015
-0.014
sc-0.014
Token need
Token ID761
Feature activation+2.17e-02

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 cakes+0.023
 ducks+0.023
 ice+0.022
Neg logit contributions
Charl-0.022
 forward-0.016
ifying-0.016
Ru-0.016
uster-0.016
Token thanked
Token ID26280
Feature activation-3.54e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.005
use-0.005
 forward-0.005
ifying-0.004
ppy-0.004
Token her
Token ID607
Feature activation+2.41e-01

Pos logit contributions
Charl+0.011
use+0.010
 forward+0.010
ifying+0.009
ppy+0.009
Neg logit contributions
 pige-0.015
 sandwiches-0.014
 cakes-0.014
 ducks-0.013
 balloons-0.013
Token dad
Token ID9955
Feature activation-3.77e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.011
use-0.011
 forward-0.011
ifying-0.010
ppy-0.010
Token for
Token ID329
Feature activation-5.45e-01

Pos logit contributions
Charl+0.016
use+0.015
ifying+0.013
 forward+0.013
Ru+0.013
Neg logit contributions
 sandwiches-0.012
 pige-0.012
 cakes-0.011
 balloons-0.010
 ducks-0.010
Token being
Token ID852
Feature activation-5.55e-01

Pos logit contributions
Charl+0.022
use+0.021
 forward+0.020
ifying+0.019
ppy+0.019
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 ducks-0.016
 balloons-0.015
Token so
Token ID523
Feature activation-1.37e-01

Pos logit contributions
Charl+0.022
 forward+0.020
use+0.020
ifying+0.018
ppy+0.018
Neg logit contributions
 pige-0.019
 sandwiches-0.018
 cakes-0.017
 ducks-0.017
 balloons-0.016
Token kind
Token ID1611
Feature activation+6.99e-01

Pos logit contributions
Charl+0.006
use+0.005
 forward+0.005
ifying+0.005
ppy+0.005
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 cakes-0.004
 ducks-0.004
 balloons-0.004
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
 sandwiches+0.022
 pige+0.022
 ducks+0.020
 balloons+0.020
 cakes+0.019
Neg logit contributions
Charl-0.030
use-0.028
ifying-0.027
Ru-0.025
acy-0.025
Token From
Token ID3574
Feature activation-1.14e+00

Pos logit contributions
 pige+0.020
 sandwiches+0.020
 cakes+0.019
 ducks+0.019
 Sue+0.018
Neg logit contributions
Charl-0.018
use-0.018
 forward-0.016
ifying-0.016
ppy-0.016
Token then
Token ID788
Feature activation-1.48e-01

Pos logit contributions
Charl+0.049
use+0.046
 forward+0.044
ifying+0.042
ppy+0.041
Neg logit contributions
 pige-0.034
 sandwiches-0.033
 cakes-0.031
 ducks-0.030
 balloons-0.030
Token on
Token ID319
Feature activation+9.64e-01

Pos logit contributions
Charl+0.007
use+0.007
ifying+0.006
ppy+0.006
ky+0.006
Neg logit contributions
 sandwiches-0.003
 pige-0.003
 ice-0.003
 cakes-0.003
 balloons-0.003
Token her
Token ID607
Feature activation-5.15e-01

Pos logit contributions
 sandwiches+0.015
 balloons+0.013
 cakes+0.013
 pige+0.013
 ducks+0.013
Neg logit contributions
Charl-0.020
use-0.019
ifying-0.017
Ru-0.017
 forward-0.017
Token eye
Token ID4151
Feature activation-3.65e-01

Pos logit contributions
Charl+0.023
use+0.022
 forward+0.021
ifying+0.019
Ru+0.019
Neg logit contributions
 sandwiches-0.015
 pige-0.015
 cakes-0.014
 balloons-0.013
 ducks-0.013
Token with
Token ID351
Feature activation-1.07e+00

Pos logit contributions
Charl+0.017
use+0.016
Ru+0.014
oris+0.014
 forward+0.014
Neg logit contributions
 sandwiches-0.011
 cakes-0.010
 pige-0.010
 balloons-0.010
 ice-0.010
Token water
Token ID1660
Feature activation-5.11e-01

Pos logit contributions
Charl+0.056
use+0.054
 forward+0.049
ppy+0.049
oris+0.049
Neg logit contributions
 sandwiches-0.027
 balloons-0.023
 ice-0.023
 cakes-0.022
 cookies-0.022
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
Charl+0.024
use+0.021
Ru+0.020
acy+0.019
 forward+0.019
Neg logit contributions
 sandwiches-0.016
 pige-0.015
 ducks-0.015
 balloons-0.014
 cakes-0.014
Token She
Token ID1375
Feature activation+9.10e-02

Pos logit contributions
use+0.005
Charl+0.004
 forward+0.004
ppy+0.004
ley+0.004
Neg logit contributions
 sandwiches-0.005
 pige-0.005
 Sue-0.005
 cakes-0.005
 ducks-0.004
Token put
Token ID1234
Feature activation-4.77e-01

Pos logit contributions
 sandwiches+0.003
 pige+0.003
 ducks+0.003
 cakes+0.003
 balloons+0.003
Neg logit contributions
Charl-0.003
use-0.003
 forward-0.003
ifying-0.003
och-0.003
Token a
Token ID257
Feature activation-9.55e-01

Pos logit contributions
Charl+0.024
use+0.023
oping+0.021
Ru+0.021
oris+0.020
Neg logit contributions
 sandwiches-0.012
 balloons-0.011
 cakes-0.010
 pige-0.010
 ducks-0.010
Token band
Token ID4097
Feature activation+1.64e-01

Pos logit contributions
Charl+0.045
use+0.044
 forward+0.041
ppy+0.040
enna+0.040
Neg logit contributions
 sandwiches-0.027
 pige-0.023
 ice-0.023
 ducks-0.023
 balloons-0.023
Token-
Token ID12
Feature activation+3.96e-01

Pos logit contributions
 sandwiches+0.005
 pige+0.005
 cakes+0.005
 balloons+0.005
 ducks+0.005
Neg logit contributions
Charl-0.006
use-0.006
ifying-0.006
Ru-0.006
 forward-0.006
Tokenaid
Token ID1698
Feature activation+3.14e-01

Pos logit contributions
 pige+0.013
 sandwiches+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.016
use-0.015
 forward-0.014
ifying-0.013
Ru-0.013
TokenJohn
Token ID7554
Feature activation-2.16e-01

Pos logit contributions
 sandwiches+0.023
 pige+0.023
 cakes+0.020
 ducks+0.020
 balloons+0.020
Neg logit contributions
Charl-0.021
 forward-0.019
use-0.018
ifying-0.017
well-0.016
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
Charl+0.008
use+0.008
 forward+0.007
ifying+0.007
och+0.007
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 ducks-0.007
 cakes-0.007
 balloons-0.007
Token happy
Token ID3772
Feature activation+4.95e-02

Pos logit contributions
 pige+0.004
 sandwiches+0.004
 cakes+0.004
 ducks+0.004
 balloons+0.004
Neg logit contributions
Charl-0.004
 forward-0.004
use-0.004
ifying-0.004
ppy-0.003
Token so
Token ID523
Feature activation-7.00e-02

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
 forward-0.001
use-0.001
ifying-0.001
ppy-0.001
Token he
Token ID339
Feature activation+4.43e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ppy+0.003
ifying+0.003
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 ducks-0.002
 balloons-0.002
Token decided
Token ID3066
Feature activation+2.75e-01

Pos logit contributions
 pige+0.015
 sandwiches+0.015
 ducks+0.014
 balloons+0.014
 cakes+0.014
Neg logit contributions
Charl-0.017
use-0.016
 forward-0.015
ifying-0.015
ppy-0.014
Token to
Token ID284
Feature activation-9.28e-02

Pos logit contributions
 pige+0.011
 sandwiches+0.011
 cakes+0.011
 ducks+0.011
 balloons+0.010
Neg logit contributions
Charl-0.008
 forward-0.008
use-0.008
ifying-0.006
 impuls-0.006
Token show
Token ID905
Feature activation-4.08e-01

Pos logit contributions
Charl+0.004
use+0.004
ifying+0.003
 forward+0.003
ppy+0.003
Neg logit contributions
 swings-0.003
 sandwiches-0.003
 balloons-0.003
 pige-0.003
 cakes-0.003
Token off
Token ID572
Feature activation+8.40e-01

Pos logit contributions
 forward+0.013
Charl+0.013
use+0.011
 Taylor+0.011
ley+0.010
Neg logit contributions
 pige-0.017
 sandwiches-0.015
 ducks-0.014
 cakes-0.014
 swings-0.014
Token by
Token ID416
Feature activation-1.93e-01

Pos logit contributions
 pige+0.035
 sandwiches+0.032
 ducks+0.031
 cakes+0.030
 swings+0.029
Neg logit contributions
Charl-0.026
 forward-0.026
use-0.022
 Taylor-0.021
ifying-0.021
Token ped
Token ID7190
Feature activation+2.19e-01

Pos logit contributions
Charl+0.008
 forward+0.007
use+0.007
ifying+0.006
 Taylor+0.006
Neg logit contributions
 pige-0.006
 sandwiches-0.006
 cakes-0.006
 ducks-0.005
 balloons-0.005
Token made
Token ID925
Feature activation+3.67e-01

Pos logit contributions
Charl+0.002
use+0.002
 forward+0.002
ppy+0.001
 Taylor+0.001
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 cakes-0.002
 balloons-0.002
 ducks-0.002
Token a
Token ID257
Feature activation-5.93e-01

Pos logit contributions
 pige+0.013
 sandwiches+0.012
 cakes+0.012
 balloons+0.011
mm+0.011
Neg logit contributions
 forward-0.012
Charl-0.012
bs-0.010
 Taylor-0.010
use-0.010
Token face
Token ID1986
Feature activation+2.57e-01

Pos logit contributions
Charl+0.022
 forward+0.020
use+0.020
ifying+0.019
ppy+0.018
Neg logit contributions
 pige-0.021
 sandwiches-0.020
 cakes-0.019
 balloons-0.018
 ducks-0.018
Token.
Token ID13
Feature activation+7.77e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 cakes+0.009
 ducks+0.009
 balloons+0.009
Neg logit contributions
Charl-0.009
use-0.008
 forward-0.008
ifying-0.007
ppy-0.007
Token She
Token ID1375
Feature activation+6.06e-02

Pos logit contributions
 pige+0.036
 sandwiches+0.035
 cakes+0.033
 ducks+0.033
 balloons+0.032
Neg logit contributions
Charl-0.020
use-0.018
 forward-0.018
ifying-0.016
ppy-0.015
Token did
Token ID750
Feature activation+4.36e-01

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 cakes+0.002
 ducks+0.002
 balloons+0.002
Neg logit contributions
Charl-0.002
use-0.002
 forward-0.002
ppy-0.002
ifying-0.002
Token not
Token ID407
Feature activation+4.46e-02

Pos logit contributions
 pige+0.014
 sandwiches+0.013
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.018
 forward-0.017
use-0.016
ifying-0.015
ppy-0.015
Token like
Token ID588
Feature activation-1.05e+00

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 ducks+0.002
 cakes+0.002
 balloons+0.002
Neg logit contributions
Charl-0.001
use-0.001
ifying-0.001
 forward-0.001
well-0.001
Token the
Token ID262
Feature activation-3.27e-01

Pos logit contributions
Charl+0.046
use+0.043
 forward+0.042
ppy+0.039
ifying+0.039
Neg logit contributions
 pige-0.028
 sandwiches-0.028
 cakes-0.025
 ducks-0.024
 balloons-0.024
Token frog
Token ID21264
Feature activation-6.70e-01

Pos logit contributions
Charl+0.015
use+0.014
 forward+0.014
ifying+0.013
ppy+0.013
Neg logit contributions
 pige-0.009
 sandwiches-0.009
 cakes-0.008
 balloons-0.008
 ducks-0.007
Token.
Token ID13
Feature activation+3.12e-01

Pos logit contributions
Charl+0.022
use+0.021
 forward+0.020
ppy+0.018
ifying+0.018
Neg logit contributions
 pige-0.026
 sandwiches-0.026
 cakes-0.024
 ducks-0.024
 balloons-0.024
Token fence
Token ID13990
Feature activation+6.70e-01

Pos logit contributions
Charl+0.036
use+0.034
 forward+0.033
ifying+0.031
sc+0.030
Neg logit contributions
 pige-0.034
 sandwiches-0.033
 cakes-0.031
 balloons-0.030
 ducks-0.029
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
 sandwiches+0.020
 pige+0.019
 balloons+0.019
 ducks+0.019
 cakes+0.019
Neg logit contributions
Charl-0.029
use-0.028
ppy-0.024
Ru-0.024
issy-0.024
Token Now
Token ID2735
Feature activation-6.16e-01

Pos logit contributions
 pige+0.042
 sandwiches+0.042
 cakes+0.040
 ducks+0.039
 balloons+0.038
Neg logit contributions
Charl-0.046
use-0.044
 forward-0.041
ppy-0.040
ifying-0.039
Token he
Token ID339
Feature activation-5.22e-01

Pos logit contributions
Charl+0.023
 forward+0.022
use+0.022
ifying+0.020
ppy+0.020
Neg logit contributions
 pige-0.022
 sandwiches-0.021
 cakes-0.019
 ducks-0.019
 balloons-0.019
Token could
Token ID714
Feature activation-4.23e-01

Pos logit contributions
Charl+0.021
 forward+0.018
use+0.018
arl+0.018
ifying+0.018
Neg logit contributions
 ducks-0.016
 pige-0.016
 sandwiches-0.015
 swings-0.015
 balloons-0.015
Token explore
Token ID7301
Feature activation+3.10e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ifying+0.014
ppy+0.013
Neg logit contributions
 pige-0.015
 sandwiches-0.015
 cakes-0.014
 ducks-0.014
 balloons-0.013
Token the
Token ID262
Feature activation-5.23e-01

Pos logit contributions
 sandwiches+0.007
 pige+0.007
 ducks+0.007
 cakes+0.007
 balloons+0.007
Neg logit contributions
Charl-0.015
use-0.015
ifying-0.014
ppy-0.014
 forward-0.014
Token other
Token ID584
Feature activation-3.56e-02

Pos logit contributions
Charl+0.021
use+0.020
 forward+0.019
ifying+0.018
ppy+0.018
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.016
 ducks-0.015
 balloons-0.015
Token side
Token ID1735
Feature activation+3.43e-01

Pos logit contributions
Charl+0.001
use+0.001
 forward+0.001
ifying+0.001
ppy+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 cakes-0.001
 balloons-0.001
 ducks-0.001
Token of
Token ID286
Feature activation-8.30e-01

Pos logit contributions
 pige+0.010
 sandwiches+0.010
 cakes+0.010
 ducks+0.010
 balloons+0.010
Neg logit contributions
Charl-0.015
use-0.014
ppy-0.012
Ru-0.012
ifying-0.012
Token the
Token ID262
Feature activation-6.80e-01

Pos logit contributions
Charl+0.042
use+0.040
 forward+0.038
ifying+0.037
ppy+0.037
Neg logit contributions
 pige-0.019
 sandwiches-0.019
 cakes-0.017
 ducks-0.017
 balloons-0.016
Token to
Token ID284
Feature activation+8.96e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 balloons+0.005
 ducks+0.005
 cakes+0.005
Neg logit contributions
Charl-0.007
use-0.007
ifying-0.006
 forward-0.006
ppy-0.006
Token him
Token ID683
Feature activation+2.48e-01

Pos logit contributions
 pige+0.026
 sandwiches+0.026
 cakes+0.024
 ducks+0.023
 balloons+0.023
Neg logit contributions
Charl-0.039
use-0.037
 forward-0.036
ifying-0.035
ppy-0.034
Token and
Token ID290
Feature activation-4.10e-01

Pos logit contributions
 sandwiches+0.008
 pige+0.008
 ice+0.008
 balloons+0.008
 cakes+0.008
Neg logit contributions
Charl-0.010
use-0.009
 forward-0.009
ifying-0.009
acy-0.008
Token said
Token ID531
Feature activation-6.41e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ifying+0.014
ppy+0.013
Neg logit contributions
 pige-0.014
 sandwiches-0.014
 cakes-0.013
 ducks-0.013
 balloons-0.012
Token,
Token ID11
Feature activation-4.71e-01

Pos logit contributions
Charl+0.014
 forward+0.014
use+0.013
ppy+0.010
bs+0.010
Neg logit contributions
 pige-0.030
 sandwiches-0.030
 ducks-0.028
 flavors-0.028
 cakes-0.028
Token "
Token ID366
Feature activation-7.82e-01

Pos logit contributions
Charl+0.023
use+0.022
 forward+0.021
ifying+0.020
ppy+0.020
Neg logit contributions
 pige-0.011
 sandwiches-0.011
 cakes-0.010
 ducks-0.010
 balloons-0.010
TokenHi
Token ID17250
Feature activation-1.52e-01

Pos logit contributions
Charl+0.025
 forward+0.023
use+0.023
ppy+0.021
ifying+0.021
Neg logit contributions
 pige-0.032
 ducks-0.029
 sandwiches-0.029
 balloons-0.028
 cakes-0.028
Token,
Token ID11
Feature activation+4.17e-01

Pos logit contributions
Charl+0.003
 forward+0.003
use+0.002
 Taylor+0.002
ley+0.002
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.007
 flavors-0.007
 pies-0.007
Token what
Token ID644
Feature activation+1.17e-01

Pos logit contributions
 sandwiches+0.015
 pige+0.015
 cakes+0.014
 balloons+0.013
 flavors+0.013
Neg logit contributions
Charl-0.015
 forward-0.014
use-0.013
ifying-0.012
ppy-0.012
Token's
Token ID338
Feature activation+4.76e-01

Pos logit contributions
 sandwiches+0.005
 pige+0.005
 cakes+0.005
 pie+0.005
 ducks+0.005
Neg logit contributions
Charl-0.003
use-0.003
 forward-0.003
Ru-0.003
ifying-0.003
Token your
Token ID534
Feature activation-7.11e-01

Pos logit contributions
 pige+0.018
 sandwiches+0.017
 cakes+0.016
 ducks+0.016
 balloons+0.016
Neg logit contributions
Charl-0.017
 forward-0.016
use-0.015
ifying-0.014
ppy-0.013
Token that
Token ID326
Feature activation-9.89e-02

Pos logit contributions
Charl+0.029
use+0.027
 forward+0.026
ifying+0.025
ppy+0.025
Neg logit contributions
 pige-0.019
 sandwiches-0.019
 cakes-0.017
 ducks-0.017
 balloons-0.017
Token some
Token ID617
Feature activation+1.04e+00

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.004
ifying+0.004
ppy+0.004
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.002
 ducks-0.002
 balloons-0.002
Token children
Token ID1751
Feature activation-6.11e-01

Pos logit contributions
 pige+0.028
 sandwiches+0.028
 cakes+0.025
 ducks+0.025
 balloons+0.024
Neg logit contributions
Charl-0.047
use-0.044
 forward-0.043
ifying-0.041
ppy-0.040
Token had
Token ID550
Feature activation+7.55e-01

Pos logit contributions
Charl+0.023
use+0.021
 forward+0.020
ifying+0.020
Ru+0.019
Neg logit contributions
 pige-0.022
 sandwiches-0.021
 cakes-0.020
 ducks-0.020
 balloons-0.020
Token very
Token ID845
Feature activation-2.28e-01

Pos logit contributions
 pige+0.023
 sandwiches+0.023
 cakes+0.021
 ducks+0.020
 balloons+0.020
Neg logit contributions
Charl-0.032
use-0.030
 forward-0.029
ifying-0.027
ppy-0.027
Token little
Token ID1310
Feature activation-5.22e-01

Pos logit contributions
Charl+0.009
use+0.009
 forward+0.008
ifying+0.008
ppy+0.008
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.007
 ducks-0.006
 balloons-0.006
Token and
Token ID290
Feature activation+7.45e-01

Pos logit contributions
Charl+0.027
use+0.025
 forward+0.024
ifying+0.023
ppy+0.023
Neg logit contributions
 pige-0.012
 sandwiches-0.012
 cakes-0.011
 balloons-0.010
 ducks-0.010
Token were
Token ID547
Feature activation+3.91e-01

Pos logit contributions
 pige+0.022
 sandwiches+0.022
 cakes+0.020
 ducks+0.020
 balloons+0.020
Neg logit contributions
Charl-0.033
use-0.031
 forward-0.030
ifying-0.028
ppy-0.028
Token not
Token ID407
Feature activation+1.59e+00

Pos logit contributions
 pige+0.014
 sandwiches+0.013
 cakes+0.013
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.015
use-0.014
 forward-0.013
ifying-0.013
ppy-0.013
Token happy
Token ID3772
Feature activation-7.12e-01

Pos logit contributions
 pige+0.053
 sandwiches+0.052
 cakes+0.048
 ducks+0.047
 balloons+0.046
Neg logit contributions
Charl-0.062
use-0.058
 forward-0.057
ifying-0.053
ppy-0.052
Token.
Token ID13
Feature activation+5.77e-02

Pos logit contributions
Charl+0.026
use+0.024
 forward+0.023
ifying+0.022
ppy+0.021
Neg logit contributions
 pige-0.026
 sandwiches-0.025
 cakes-0.024
 ducks-0.023
 balloons-0.023
Token tasted
Token ID29187
Feature activation+4.99e-01

Pos logit contributions
 sandwiches+0.018
 pige+0.018
 cakes+0.017
 ducks+0.017
 balloons+0.016
Neg logit contributions
Charl-0.024
use-0.023
 forward-0.021
ifying-0.021
ppy-0.020
Token even
Token ID772
Feature activation-4.34e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.012
 ducks+0.012
 balloons+0.012
Neg logit contributions
Charl-0.023
use-0.021
 forward-0.020
ifying-0.020
ppy-0.019
Token better
Token ID1365
Feature activation-1.39e-01

Pos logit contributions
Charl+0.019
use+0.017
ifying+0.016
 forward+0.016
ppy+0.015
Neg logit contributions
 sandwiches-0.014
 pige-0.013
 cakes-0.013
 ducks-0.012
 balloons-0.012
Token than
Token ID621
Feature activation+9.37e-01

Pos logit contributions
Charl+0.005
use+0.005
 forward+0.004
ppy+0.004
ifying+0.004
Neg logit contributions
 sandwiches-0.005
 pige-0.005
 cakes-0.005
 ducks-0.005
 balloons-0.005
Token if
Token ID611
Feature activation-3.36e-02

Pos logit contributions
 pige+0.032
 sandwiches+0.032
 cakes+0.030
 ducks+0.029
 balloons+0.029
Neg logit contributions
Charl-0.037
use-0.034
 forward-0.033
ifying-0.031
ppy-0.030
Token Jimmy
Token ID12963
Feature activation-6.15e-01

Pos logit contributions
Charl+0.002
use+0.001
 forward+0.001
ifying+0.001
ppy+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 cakes-0.001
 ducks-0.001
 swings-0.001
Token's
Token ID338
Feature activation-2.77e-01

Pos logit contributions
Charl+0.024
 forward+0.021
use+0.021
ifying+0.020
Ru+0.020
Neg logit contributions
 pige-0.021
 sandwiches-0.021
 ducks-0.019
 cakes-0.019
 balloons-0.019
Token mom
Token ID1995
Feature activation-5.23e-01

Pos logit contributions
Charl+0.012
use+0.011
 forward+0.011
ifying+0.011
ppy+0.010
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token had
Token ID550
Feature activation+3.96e-01

Pos logit contributions
Charl+0.022
 forward+0.019
use+0.019
ifying+0.018
Ru+0.018
Neg logit contributions
 pige-0.016
 sandwiches-0.016
 cakes-0.016
 ducks-0.015
 balloons-0.015
Token cooked
Token ID15847
Feature activation-5.32e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 ducks+0.013
 balloons+0.013
Neg logit contributions
Charl-0.014
use-0.013
 forward-0.013
ifying-0.012
ppy-0.012
Token it
Token ID340
Feature activation-1.24e-01

Pos logit contributions
Charl+0.023
use+0.021
 forward+0.020
ifying+0.020
ppy+0.019
Neg logit contributions
 sandwiches-0.016
 pige-0.016
 cakes-0.015
 ducks-0.014
 balloons-0.014
Token strange
Token ID6283
Feature activation-5.63e-01

Pos logit contributions
Charl+0.016
use+0.015
 forward+0.014
ppy+0.013
ifying+0.013
Neg logit contributions
 pige-0.013
 sandwiches-0.013
 cakes-0.012
 ducks-0.012
 balloons-0.012
Token on
Token ID319
Feature activation+8.11e-01

Pos logit contributions
Charl+0.022
use+0.020
 forward+0.019
ppy+0.018
ifying+0.018
Neg logit contributions
 sandwiches-0.020
 pige-0.019
 cakes-0.018
 ducks-0.018
 balloons-0.018
Token the
Token ID262
Feature activation-6.24e-01

Pos logit contributions
 sandwiches+0.015
 pige+0.015
 balloons+0.014
 ducks+0.014
 cakes+0.013
Neg logit contributions
Charl-0.044
use-0.043
 forward-0.040
ppy-0.040
sc-0.038
Token ground
Token ID2323
Feature activation-9.94e-01

Pos logit contributions
Charl+0.028
use+0.026
 forward+0.025
ppy+0.025
Ru+0.024
Neg logit contributions
 pige-0.019
 sandwiches-0.017
 cakes-0.017
 ducks-0.017
 swings-0.016
Token.
Token ID13
Feature activation-9.26e-02

Pos logit contributions
Charl+0.046
use+0.041
 forward+0.038
Ru+0.037
ppy+0.037
Neg logit contributions
 sandwiches-0.029
 pige-0.029
 ducks-0.027
 swings-0.027
 cakes-0.027
Token It
Token ID632
Feature activation-5.67e-01

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.003
ppy+0.003
ley+0.003
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 Sue-0.003
Token was
Token ID373
Feature activation-4.54e-01

Pos logit contributions
Charl+0.025
use+0.024
 forward+0.023
ifying+0.021
arl+0.021
Neg logit contributions
 sandwiches-0.016
 pige-0.016
 ducks-0.015
 swings-0.015
 cakes-0.014
Token a
Token ID257
Feature activation-2.73e-02

Pos logit contributions
use+0.022
Charl+0.022
 forward+0.020
ppy+0.020
och+0.019
Neg logit contributions
 sandwiches-0.012
 pige-0.011
 ducks-0.011
 cakes-0.011
 ice-0.010
Token button
Token ID4936
Feature activation-3.44e-01

Pos logit contributions
Charl+0.001
use+0.001
ppy+0.001
 forward+0.001
issy+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 ducks-0.001
 cakes-0.001
 pie-0.001
Token!
Token ID0
Feature activation-4.75e-03

Pos logit contributions
Charl+0.014
use+0.013
 forward+0.012
Ru+0.012
ppy+0.012
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 cakes-0.011
 balloons-0.010
 swings-0.010
Token She
Token ID1375
Feature activation-1.07e-01

Pos logit contributions
Charl+0.000
use+0.000
 forward+0.000
ppy+0.000
ifying+0.000
Neg logit contributions
 pige-0.000
 sandwiches-0.000
 ducks-0.000
 cakes-0.000
 Sue-0.000
Token adventurous
Token ID39770
Feature activation+1.08e+00

Pos logit contributions
 sandwiches+0.036
 cakes+0.034
 pige+0.034
 balloons+0.032
 pies+0.030
Neg logit contributions
Charl-0.041
 forward-0.039
ky-0.037
ppy-0.037
use-0.036
Token three
Token ID1115
Feature activation+4.77e-01

Pos logit contributions
 sandwiches+0.034
 pige+0.034
 cakes+0.032
 balloons+0.030
 ducks+0.029
Neg logit contributions
Charl-0.044
use-0.040
 forward-0.040
ppy-0.037
ley-0.036
Token year
Token ID614
Feature activation-7.14e-01

Pos logit contributions
 sandwiches+0.022
 pige+0.021
 cakes+0.020
 swings+0.020
 balloons+0.020
Neg logit contributions
Charl-0.011
 forward-0.009
ley-0.009
use-0.009
ppy-0.008
Token old
Token ID1468
Feature activation-2.24e-01

Pos logit contributions
Charl+0.024
 forward+0.024
use+0.023
ley+0.021
issy+0.021
Neg logit contributions
 pige-0.025
 sandwiches-0.025
 cakes-0.023
 ducks-0.022
 swings-0.022
Token was
Token ID373
Feature activation+1.60e-01

Pos logit contributions
Charl+0.008
ley+0.007
use+0.007
 forward+0.006
 Taylor+0.006
Neg logit contributions
 sandwiches-0.009
 pige-0.008
 cakes-0.008
 balloons-0.007
 swings-0.007
Token given
Token ID1813
Feature activation-3.87e-01

Pos logit contributions
 pige+0.006
 sandwiches+0.006
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.006
use-0.006
 forward-0.005
ifying-0.005
ppy-0.005
Token a
Token ID257
Feature activation+1.70e-01

Pos logit contributions
Charl+0.013
 forward+0.012
use+0.011
sc+0.011
ky+0.011
Neg logit contributions
 pige-0.014
 sandwiches-0.014
 cakes-0.013
 swings-0.013
 balloons-0.013
Token letter
Token ID3850
Feature activation-1.95e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 cakes+0.005
 ducks+0.005
 balloons+0.005
Neg logit contributions
Charl-0.007
use-0.007
 forward-0.006
ppy-0.006
ifying-0.006
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
Charl+0.007
use+0.007
 forward+0.006
ifying+0.006
ppy+0.006
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 cakes-0.006
 ducks-0.006
 balloons-0.006
Token She
Token ID1375
Feature activation-3.93e-01

Pos logit contributions
Charl+0.004
use+0.004
 forward+0.004
ifying+0.003
 Taylor+0.003
Neg logit contributions
 pige-0.004
 sandwiches-0.004
 cakes-0.004
 ducks-0.004
 balloons-0.004
Token was
Token ID373
Feature activation-1.61e-03

Pos logit contributions
Charl+0.015
use+0.014
 forward+0.014
ifying+0.013
ppy+0.013
Neg logit contributions
 pige-0.014
 sandwiches-0.013
 cakes-0.012
 ducks-0.012
 balloons-0.012
Token around
Token ID1088
Feature activation-2.52e-01

Pos logit contributions
Charl+0.003
use+0.003
ifying+0.003
issy+0.003
acy+0.003
Neg logit contributions
 sandwiches-0.003
 pige-0.003
 ducks-0.003
 swings-0.003
 balloons-0.003
Token and
Token ID290
Feature activation+3.03e-01

Pos logit contributions
Charl+0.011
use+0.010
ifying+0.009
 forward+0.009
ppy+0.009
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 ducks-0.007
 cakes-0.007
 balloons-0.007
Token playing
Token ID2712
Feature activation-2.48e-01

Pos logit contributions
 pige+0.009
 sandwiches+0.009
 cakes+0.008
 ducks+0.008
 balloons+0.008
Neg logit contributions
Charl-0.013
use-0.012
 forward-0.012
ifying-0.011
ppy-0.011
Token.
Token ID13
Feature activation-6.63e-02

Pos logit contributions
Charl+0.008
 forward+0.007
use+0.007
sc+0.006
Ru+0.006
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 balloons-0.009
 ducks-0.009
Token Soon
Token ID15894
Feature activation-3.78e-01

Pos logit contributions
Charl+0.002
use+0.002
 forward+0.002
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.003
 sandwiches-0.003
 cakes-0.003
 ducks-0.003
 balloons-0.002
Token night
Token ID1755
Feature activation-2.01e+00

Pos logit contributions
 forward+0.007
use+0.005
 que+0.004
Charl+0.004
bs+0.004
Neg logit contributions
 pige-0.019
 cakes-0.019
 sandwiches-0.018
 flavors-0.018
 pie-0.018
Token time
Token ID640
Feature activation-2.11e-01

Pos logit contributions
Charl+0.063
use+0.059
 forward+0.057
ifying+0.053
ppy+0.051
Neg logit contributions
 pige-0.081
 sandwiches-0.080
 cakes-0.076
 ducks-0.074
 balloons-0.073
Token came
Token ID1625
Feature activation-2.24e-01

Pos logit contributions
Charl+0.007
 forward+0.006
use+0.006
oping+0.005
ppy+0.005
Neg logit contributions
 pige-0.009
 sandwiches-0.008
 cakes-0.008
 ducks-0.008
 balloons-0.008
Token and
Token ID290
Feature activation-2.45e-01

Pos logit contributions
Charl+0.008
use+0.008
 forward+0.007
ifying+0.007
ppy+0.007
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 ducks-0.008
 cakes-0.007
 balloons-0.007
Token the
Token ID262
Feature activation-4.99e-01

Pos logit contributions
Charl+0.010
use+0.009
 forward+0.009
ifying+0.008
ppy+0.008
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 ducks-0.007
 balloons-0.007
Token bird
Token ID6512
Feature activation-3.15e-01

Pos logit contributions
use+0.016
 forward+0.016
Charl+0.016
well+0.015
 que+0.015
Neg logit contributions
 sandwiches-0.017
 cakes-0.015
 pige-0.014
mm-0.014
 flavors-0.013
Token is
Token ID318
Feature activation+4.62e-01

Pos logit contributions
 pige+0.009
 sandwiches+0.009
 cakes+0.008
 balloons+0.008
 ducks+0.008
Neg logit contributions
Charl-0.014
use-0.013
 forward-0.012
ifying-0.012
ppy-0.012
Token that
Token ID326
Feature activation+4.12e-01

Pos logit contributions
 pige+0.012
 sandwiches+0.012
 ducks+0.011
 cakes+0.011
 balloons+0.010
Neg logit contributions
Charl-0.022
use-0.020
 forward-0.019
ifying-0.019
sc-0.019
Token even
Token ID772
Feature activation-3.09e-02

Pos logit contributions
 pige+0.013
 sandwiches+0.012
 cakes+0.012
 ducks+0.011
 balloons+0.011
Neg logit contributions
Charl-0.017
use-0.016
 forward-0.016
ifying-0.014
ppy-0.014
Token when
Token ID618
Feature activation-5.38e-01

Pos logit contributions
Charl+0.001
ifying+0.001
use+0.001
oping+0.001
uster+0.001
Neg logit contributions
 pige-0.001
 sandwiches-0.001
 ducks-0.001
 cakes-0.001
 ice-0.001
Token we
Token ID356
Feature activation-1.13e+00

Pos logit contributions
Charl+0.023
use+0.022
 forward+0.021
ifying+0.020
ppy+0.020
Neg logit contributions
 pige-0.015
 sandwiches-0.015
 cakes-0.014
 ducks-0.013
 balloons-0.013
Token feel
Token ID1254
Feature activation-2.15e+00

Pos logit contributions
Charl+0.049
och+0.040
uster+0.039
sc+0.039
ky+0.039
Neg logit contributions
 sandwiches-0.035
 pige-0.035
 ducks-0.035
 ice-0.033
 balloons-0.032
Token tired
Token ID10032
Feature activation-8.97e-01

Pos logit contributions
Charl+0.095
use+0.088
oris+0.086
issy+0.084
och+0.083
Neg logit contributions
 sandwiches-0.067
 pige-0.064
 ducks-0.062
 ice-0.061
 balloons-0.058
Token,
Token ID11
Feature activation-6.98e-01

Pos logit contributions
Charl+0.043
use+0.037
Ru+0.036
oris+0.035
Taylor+0.035
Neg logit contributions
 sandwiches-0.026
 pige-0.025
 ducks-0.024
 ice-0.023
 cakes-0.023
Token doing
Token ID1804
Feature activation+2.94e-01

Pos logit contributions
Charl+0.033
use+0.032
 forward+0.031
ifying+0.029
ppy+0.029
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.015
 ducks-0.015
 balloons-0.015
Token things
Token ID1243
Feature activation-5.94e-01

Pos logit contributions
 pige+0.008
 sandwiches+0.008
 cakes+0.007
 ducks+0.007
 balloons+0.007
Neg logit contributions
Charl-0.013
use-0.012
 forward-0.012
ifying-0.011
ppy-0.011
Token we
Token ID356
Feature activation+5.69e-02

Pos logit contributions
Charl+0.026
use+0.022
ifying+0.022
 forward+0.022
uster+0.022
Neg logit contributions
 sandwiches-0.017
 pige-0.017
 cakes-0.016
 balloons-0.016
 ducks-0.016
Token secret
Token ID3200
Feature activation-2.39e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ifying+0.003
ppy+0.003
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 balloons-0.002
 ducks-0.002
Token she
Token ID673
Feature activation+7.43e-01

Pos logit contributions
Charl+0.010
use+0.009
 forward+0.009
ppy+0.009
ifying+0.008
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 cakes-0.007
 balloons-0.007
 ducks-0.007
Token had
Token ID550
Feature activation+7.27e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.023
 swings+0.022
 balloons+0.022
 cakes+0.022
Neg logit contributions
Charl-0.030
use-0.027
ifying-0.026
 forward-0.026
arl-0.026
Token written
Token ID3194
Feature activation-3.90e-01

Pos logit contributions
 pige+0.024
 sandwiches+0.024
 balloons+0.023
 swings+0.022
 cakes+0.022
Neg logit contributions
Charl-0.030
use-0.028
 forward-0.026
uster-0.025
arl-0.025
Token the
Token ID262
Feature activation-3.61e-01

Pos logit contributions
Charl+0.017
use+0.017
ppy+0.014
sc+0.014
 forward+0.014
Neg logit contributions
 sandwiches-0.011
 pige-0.011
 balloons-0.011
 cakes-0.011
 ducks-0.010
Token night
Token ID1755
Feature activation-2.35e+00

Pos logit contributions
Charl+0.017
use+0.016
ppy+0.015
oping+0.015
uster+0.015
Neg logit contributions
 sandwiches-0.010
 balloons-0.009
 cakes-0.009
 pige-0.009
 cake-0.009
Token before
Token ID878
Feature activation-7.15e-02

Pos logit contributions
Charl+0.094
use+0.087
 forward+0.081
ppy+0.078
Ru+0.077
Neg logit contributions
 pige-0.081
 sandwiches-0.080
 cakes-0.077
 balloons-0.075
 ducks-0.072
Token was
Token ID373
Feature activation+6.55e-04

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ppy+0.003
ifying+0.003
Neg logit contributions
 sandwiches-0.002
 pige-0.002
 balloons-0.002
 cakes-0.002
 ducks-0.002
Token that
Token ID326
Feature activation+3.81e-01

Pos logit contributions
 sandwiches+0.000
 pige+0.000
 cakes+0.000
 balloons+0.000
 ducks+0.000
Neg logit contributions
Charl-0.000
use-0.000
ifying-0.000
sc-0.000
ppy-0.000
Token she
Token ID673
Feature activation+1.73e-01

Pos logit contributions
 sandwiches+0.013
 pige+0.012
 cakes+0.012
 balloons+0.012
 ducks+0.011
Neg logit contributions
Charl-0.016
use-0.015
ppy-0.013
 forward-0.013
ifying-0.013
Token could
Token ID714
Feature activation-6.76e-01

Pos logit contributions
 pige+0.005
 sandwiches+0.005
 swings+0.004
 balloons+0.004
 cakes+0.004
Neg logit contributions
Charl-0.008
arl-0.007
uster-0.007
 forward-0.007
use-0.007
Token ear
Token ID1027
Feature activation+1.31e-01

Pos logit contributions
Charl+0.009
use+0.008
 forward+0.008
ifying+0.007
ppy+0.007
Neg logit contributions
 pige-0.007
 sandwiches-0.007
 ducks-0.006
 cakes-0.006
 balloons-0.006
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
 sandwiches+0.003
 balloons+0.003
 pige+0.003
 cakes+0.003
 swings+0.003
Neg logit contributions
Charl-0.006
use-0.006
Ru-0.005
Fel-0.005
Case-0.005
Token The
Token ID383
Feature activation-9.16e-01

Pos logit contributions
Charl+0.003
use+0.003
 forward+0.003
ifying+0.003
ppy+0.003
Neg logit contributions
 pige-0.005
 sandwiches-0.005
 cakes-0.005
 ducks-0.005
 balloons-0.005
Token cat
Token ID3797
Feature activation-5.86e-02

Pos logit contributions
Charl+0.040
use+0.037
 forward+0.036
ifying+0.034
ppy+0.034
Neg logit contributions
 pige-0.027
 sandwiches-0.026
 cakes-0.024
 ducks-0.023
 balloons-0.023
Token could
Token ID714
Feature activation-5.99e-01

Pos logit contributions
Charl+0.002
use+0.002
 forward+0.002
ifying+0.002
ppy+0.002
Neg logit contributions
 pige-0.002
 sandwiches-0.002
 cakes-0.002
 ducks-0.002
 balloons-0.002
Token feel
Token ID1254
Feature activation-2.47e+00

Pos logit contributions
Charl+0.024
use+0.023
 forward+0.022
ifying+0.022
och+0.021
Neg logit contributions
 sandwiches-0.019
 pige-0.019
 ducks-0.018
 swings-0.018
 balloons-0.018
Token the
Token ID262
Feature activation-7.57e-01

Pos logit contributions
use+0.145
Charl+0.141
ley+0.129
issy+0.129
ifying+0.129
Neg logit contributions
 sandwiches-0.040
 ducks-0.038
 pige-0.037
 balloons-0.034
 ice-0.033
Token music
Token ID2647
Feature activation-1.14e+00

Pos logit contributions
Charl+0.035
use+0.035
 forward+0.032
ppy+0.032
ifying+0.031
Neg logit contributions
 pige-0.020
 sandwiches-0.020
 ducks-0.019
 balloons-0.019
 swings-0.018
Token through
Token ID832
Feature activation+6.70e-02

Pos logit contributions
Charl+0.045
use+0.044
ppy+0.037
Ru+0.037
ifying+0.037
Neg logit contributions
 sandwiches-0.038
 pige-0.038
 balloons-0.036
 cakes-0.036
 ducks-0.035
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
 pige+0.002
 sandwiches+0.002
 ducks+0.002
 balloons+0.002
 cakes+0.002
Neg logit contributions
Charl-0.003
use-0.003
 forward-0.003
ifying-0.003
ppy-0.002
Token string
Token ID4731
Feature activation-7.51e-02

Pos logit contributions
Charl+0.041
use+0.038
 forward+0.037
ppy+0.035
ifying+0.034
Neg logit contributions
 pige-0.033
 sandwiches-0.032
 ducks-0.030
 cakes-0.030
 balloons-0.030
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
Charl+0.012
 forward+0.011
use+0.011
ifying+0.010
ppy+0.010
Neg logit contributions
 pige-0.010
 sandwiches-0.010
 cakes-0.009
 ducks-0.009
 balloons-0.008
Token high
Token ID1029
Feature activation-5.69e-01

Pos logit contributions
Charl+0.051
use+0.048
 forward+0.046
ifying+0.044
ppy+0.044
Neg logit contributions
 pige-0.037
 sandwiches-0.036
 cakes-0.034
 ducks-0.033
 balloons-0.032
Token-
Token ID12
Feature activation-3.55e-01

Pos logit contributions
Charl+0.024
use+0.023
 forward+0.022
ifying+0.021
ppy+0.021
Neg logit contributions
 pige-0.017
 sandwiches-0.017
 cakes-0.015
 ducks-0.015
 balloons-0.015
Tokenp
Token ID79
Feature activation-9.23e-01

Pos logit contributions
Charl+0.015
use+0.014
ifying+0.013
 forward+0.013
ppy+0.012
Neg logit contributions
 pige-0.011
 sandwiches-0.010
 ducks-0.010
 pie-0.010
 cakes-0.010
Tokenitched
Token ID10981
Feature activation-6.50e-01

Pos logit contributions
Charl+0.038
use+0.036
 forward+0.031
Ru+0.030
och+0.030
Neg logit contributions
 sandwiches-0.031
 pige-0.030
 ducks-0.029
 cakes-0.028
 monkeys-0.027
Token noise
Token ID7838
Feature activation-2.05e+00

Pos logit contributions
Charl+0.029
use+0.028
ifying+0.025
 forward+0.025
och+0.024
Neg logit contributions
 sandwiches-0.019
 pige-0.018
 cakes-0.018
 ducks-0.017
 balloons-0.017
Token coming
Token ID2406
Feature activation-7.05e-01

Pos logit contributions
Charl+0.083
use+0.079
ifying+0.070
Ru+0.069
acy+0.068
Neg logit contributions
 sandwiches-0.067
 pige-0.066
 cakes-0.064
 ice-0.063
 ducks-0.062
Token from
Token ID422
Feature activation-5.48e-02

Pos logit contributions
Charl+0.030
use+0.026
och+0.023
ifying+0.023
acy+0.023
Neg logit contributions
 sandwiches-0.023
 ducks-0.023
 pige-0.023
 ice-0.022
 Sue-0.022
Token a
Token ID257
Feature activation-2.56e-01

Pos logit contributions
Charl+0.003
use+0.002
ppy+0.002
ifying+0.002
 forward+0.002
Neg logit contributions
 sandwiches-0.001
 pige-0.001
 ducks-0.001
 swings-0.001
 ice-0.001
Token nearby
Token ID6716
Feature activation+4.86e-01

Pos logit contributions
Charl+0.010
use+0.010
 forward+0.009
ppy+0.009
acy+0.009
Neg logit contributions
 pige-0.008
 sandwiches-0.008
 ducks-0.008
 cakes-0.007
 swings-0.007
Token creek
Token ID39459
Feature activation-8.07e-01

Pos logit contributions
 pige+0.014
 sandwiches+0.014
 cakes+0.013
 ducks+0.013
 balloons+0.012
Neg logit contributions
Charl-0.021
use-0.020
 forward-0.019
ifying-0.018
ppy-0.018