TOP ACTIVATIONS
MAX = 0.061

it to me for safe keeping . He told me to
grandma 's house for safe keeping . When
. " Which locked door ?" uncle Bob asked
got out their coats and umb rell as and they held
. What 's going on here ?" L ily
friends cheered for him . Spe edy said , " Good
" What is going on here ?" she asked . "
, what are you doing here ?" he says . "
" What is going on here ?" Lily 's mom asked
" What is going on here ? Why are you not
" What is going on here ?" Mom asks .
" What is going on here ?" Mom asks .
" What is going on here ? Why are you fighting
of the truck . He slammed the door and locked it
jackets became heavy and their umb rell as began to dro
" What is going on here ?" she asks . She
ended up in such bad situations . The
are still many things to explore and have fun with ."
. What is going on here ?" Tim and
, what are you doing here ?" It was Dad .

MIN ACTIVATIONS
MIN = -0.057

and looked at the ost ric hes . They saw one
that said " O st ric hes ". Sara and Tom
John chose some carrots and cuc umbers to take home .
big salad with carrots , cuc umbers , and tomatoes .
cried . " The ost ric hes are mean ! They
and the peppers from the cuc umbers . It took a
They do not pack ost ric hes anymore . They pack
but be careful . Ost ric hes can be gr umpy
village would raise a fire work into the sky to celebrate
some y ummy carrots and cuc umbers from the farmer 's
ie . Unlike other ost ric hes , his feathers gl
has lettuce and tomatoes and cuc umbers . And it has
cuc umber . Mama puts cuc umbers in our salad .
too close . The ost ric hes might pe ck you
go outside and fold orig ami animals in her garden .
. Tom ro ars and breat hes fire . Anna screams
is about a dragon that breat hes fire and ch ases
house . They wash the cuc umber and cut it into
mom and to the ost ric hes . They learned their
and run to the ost ric hes . "

INTERVAL 0.031 - 0.061
CONTAINS 0.106%

" No , it is mine !" Ben says . He
Tim my had hurt himself badly and had to go to
wanted the white spot so badly , so he decided to
distance and they wanted to explore it . As
party . Is it your party ?" Lily said .

INTERVAL 0.002 - 0.031
CONTAINS 42.894%

got you ." Ben reaches the bottom . Anna
they all lived happily ever after . <|endoftext|> Once upon a
. He was extraordinary . But not everyone liked
Sam are scared . They drop the shiny things and run
they finished all the laundry . When they were

INTERVAL -0.028 - 0.002
CONTAINS 56.631%

can sort it ." The second girl asked ,
prepare the garden for new flowers ." Lily was very excited
a bad wreck and had been alone ever since . The
big boy nodded one last time . He clut ched his
From that day on , Lily loved to touch

INTERVAL -0.057 - -0.028
CONTAINS 0.369%

shoes and a smile that made everyone around her happy .
whist led every morning when he went to his office .
day and didn 't like to do anything . One day
was a blue spike . It liked to study all day
design . They make a sun , a heart , and
Token it
Token ID340
Feature activation-1.36e-02

Pos logit contributions
ibi+0.005
edy+0.005
 sidewalk+0.005
oma+0.004
 Karen+0.004
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 Manny-0.009
 Macy-0.009
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
Justin+0.026
 Mason+0.025
 Justin+0.025
 Macy+0.024
ubb+0.024
Neg logit contributions
."-0.015
edy-0.015
ibi-0.015
!"-0.015
 Kim-0.014
Token me
Token ID502
Feature activation+8.49e-03

Pos logit contributions
ibi+0.018
edy+0.017
ongo+0.016
oot+0.016
 sidewalk+0.016
Neg logit contributions
 Justin-0.020
Justin-0.020
 Mason-0.019
 pap-0.018
 Manny-0.018
Token for
Token ID329
Feature activation+6.66e-03

Pos logit contributions
ibi+0.009
 sidewalk+0.009
oma+0.008
 Timothy+0.008
 Karen+0.008
Neg logit contributions
Justin-0.016
 Justin-0.016
 Mason-0.016
 pap-0.015
 Manny-0.015
Token safe
Token ID3338
Feature activation+1.34e-02

Pos logit contributions
ibi+0.010
edy+0.009
oma+0.009
round+0.009
 Karen+0.009
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.010
 pap-0.009
 mer-0.009
Tokenkeeping
Token ID19934
Feature activation+6.09e-02

Pos logit contributions
ibi+0.014
edy+0.012
oma+0.011
 Karen+0.011
ongo+0.011
Neg logit contributions
 Justin-0.028
Justin-0.028
 mer-0.027
 Mason-0.027
 pap-0.026
Token.
Token ID13
Feature activation+2.85e-03

Pos logit contributions
ibi+0.088
 Timothy+0.085
 Su+0.081
Kim+0.080
anya+0.080
Neg logit contributions
 nood-0.082
 mer-0.080
 Mason-0.074
 Justin-0.074
 pap-0.073
Token He
Token ID679
Feature activation+7.74e-04

Pos logit contributions
edy+0.004
ibi+0.004
 sidewalk+0.004
oma+0.004
oot+0.004
Neg logit contributions
 Justin-0.004
Justin-0.004
 Mason-0.004
 Manny-0.004
 Macy-0.004
Token told
Token ID1297
Feature activation-8.27e-04

Pos logit contributions
ibi+0.001
 sidewalk+0.001
edy+0.001
 Karen+0.001
 Kim+0.001
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.002
 pap-0.002
 mer-0.001
Token me
Token ID502
Feature activation+2.20e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 pap+0.001
 Macy+0.001
Neg logit contributions
ibi-0.001
edy-0.001
 sidewalk-0.001
oma-0.001
 Karen-0.001
Token to
Token ID284
Feature activation+9.02e-03

Pos logit contributions
ibi+0.027
edy+0.026
oma+0.025
 sidewalk+0.025
 Karen+0.024
Neg logit contributions
Justin-0.039
 Justin-0.038
 Mason-0.037
 Manny-0.036
 Macy-0.036
Token grandma
Token ID49890
Feature activation-6.77e-03

Pos logit contributions
 pap+0.008
Justin+0.008
 Justin+0.008
 Mason+0.007
 Manny+0.007
Neg logit contributions
ibi-0.006
round-0.006
edy-0.006
 Karen-0.006
Tom-0.006
Token's
Token ID338
Feature activation+2.27e-03

Pos logit contributions
 Justin+0.012
Justin+0.012
 Mason+0.012
 Manny+0.011
 pap+0.011
Neg logit contributions
ibi-0.008
edy-0.007
 sidewalk-0.007
Tom-0.007
oma-0.007
Token house
Token ID2156
Feature activation+2.08e-02

Pos logit contributions
ibi+0.003
edy+0.003
 Karen+0.003
round+0.003
 Kim+0.003
Neg logit contributions
Justin-0.004
 Justin-0.004
 pap-0.004
 Mason-0.004
 Manny-0.004
Token for
Token ID329
Feature activation+9.96e-03

Pos logit contributions
ibi+0.021
edy+0.019
oma+0.019
Tom+0.018
uri+0.018
Neg logit contributions
 Justin-0.039
Justin-0.038
 Mason-0.038
 Manny-0.037
 pap-0.037
Token safe
Token ID3338
Feature activation+1.43e-02

Pos logit contributions
ibi+0.017
round+0.016
oma+0.016
edy+0.016
 sidewalk+0.015
Neg logit contributions
 Justin-0.012
Justin-0.012
 Mason-0.012
 Manny-0.012
 pap-0.011
Tokenkeeping
Token ID19934
Feature activation+5.91e-02

Pos logit contributions
ibi+0.015
edy+0.014
oma+0.013
round+0.013
Tom+0.013
Neg logit contributions
 Justin-0.026
 mer-0.026
 Mason-0.026
 Manny-0.026
 pap-0.025
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
ibi+0.078
oma+0.078
anya+0.074
Tom+0.074
 Kim+0.074
Neg logit contributions
 nood-0.079
 mer-0.079
 Mason-0.077
light-0.076
 Justin-0.076
Token
Token ID220
Feature activation+7.27e-03

Pos logit contributions
ves+0.018
edy+0.018
oma+0.017
anya+0.017
istle+0.017
Neg logit contributions
 Manny-0.018
 Mason-0.017
 Justin-0.017
 pap-0.016
 Macy-0.016
Token
Token ID198
Feature activation+9.22e-03

Pos logit contributions
round+0.011
 chain+0.011
 sidewalk+0.011
edy+0.011
er+0.010
Neg logit contributions
Justin-0.009
 Manny-0.009
 Justin-0.009
 Mason-0.009
 Macy-0.008
Token
Token ID198
Feature activation+5.32e-03

Pos logit contributions
edy+0.015
 automobile+0.015
er+0.015
 automobiles+0.015
 chain+0.014
Neg logit contributions
Justin-0.012
 Mason-0.011
 Manny-0.011
 Justin-0.011
Beck-0.011
TokenWhen
Token ID2215
Feature activation-9.90e-03

Pos logit contributions
."+0.006
ibi+0.006
oma+0.006
edy+0.006
 sidewalk+0.006
Neg logit contributions
Justin-0.009
where-0.009
 Mason-0.008
 Justin-0.008
Beck-0.008
Token.
Token ID13
Feature activation+4.61e-04

Pos logit contributions
ibi+0.001
edy+0.001
 sidewalk+0.001
oma+0.001
 Kim+0.001
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
 pap-0.004
 Manny-0.004
Token
Token ID198
Feature activation+4.99e-03

Pos logit contributions
edy+0.001
ibi+0.001
round+0.000
 sidewalk+0.000
!"+0.000
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 pap-0.001
Beck-0.001
Token
Token ID198
Feature activation+2.16e-03

Pos logit contributions
edy+0.008
ibi+0.007
round+0.007
er+0.007
 sidewalk+0.007
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
Beck-0.007
Lou-0.007
Token"
Token ID1
Feature activation-5.27e-03

Pos logit contributions
ibi+0.002
edy+0.002
 sidewalk+0.002
."+0.002
oma+0.002
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
 pap-0.004
Louis-0.004
TokenWhich
Token ID13828
Feature activation+1.80e-02

Pos logit contributions
 Justin+0.009
ubb+0.009
Justin+0.009
 Mason+0.009
 Manny+0.009
Neg logit contributions
Tom-0.007
ibi-0.006
edy-0.006
round-0.006
 Karen-0.006
Token locked
Token ID8970
Feature activation+5.73e-02

Pos logit contributions
ibi+0.027
edy+0.026
 sidewalk+0.024
round+0.024
oma+0.024
Neg logit contributions
 pap-0.027
Justin-0.027
 Justin-0.026
where-0.026
 Mason-0.026
Token door
Token ID3420
Feature activation+3.58e-02

Pos logit contributions
ibi+0.068
edy+0.057
keeping+0.054
round+0.054
Twe+0.053
Neg logit contributions
 Justin-0.103
 Manny-0.100
 pap-0.098
Justin-0.098
 Mason-0.097
Token?"
Token ID1701
Feature activation-3.56e-03

Pos logit contributions
ibi+0.051
keeping+0.046
oma+0.046
edy+0.046
 Timothy+0.043
Neg logit contributions
Justin-0.052
 Manny-0.051
 Justin-0.051
 pap-0.051
 Mason-0.051
Token uncle
Token ID7711
Feature activation+1.90e-02

Pos logit contributions
Justin+0.008
 Justin+0.007
 Mason+0.007
 Macy+0.007
 pap+0.007
Neg logit contributions
ibi-0.003
edy-0.003
 Kim-0.003
 sidewalk-0.003
."-0.003
Token Bob
Token ID5811
Feature activation+1.09e-02

Pos logit contributions
 sidewalk+0.018
ibi+0.017
edy+0.017
keeping+0.017
."+0.016
Neg logit contributions
 Justin-0.039
 Mason-0.038
Justin-0.038
 Macy-0.036
 Manny-0.036
Token asked
Token ID1965
Feature activation+2.65e-03

Pos logit contributions
ibi+0.008
edy+0.008
 sidewalk+0.008
 Kim+0.008
."+0.007
Neg logit contributions
Justin-0.024
 Justin-0.024
 Mason-0.023
 pap-0.023
 Macy-0.022
Token got
Token ID1392
Feature activation-1.42e-02

Pos logit contributions
ibi+0.004
edy+0.004
 sidewalk+0.004
oma+0.004
."+0.004
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
 pap-0.006
 Manny-0.006
Token out
Token ID503
Feature activation-2.19e-02

Pos logit contributions
Justin+0.029
 Justin+0.027
Louis+0.027
Dean+0.027
 Mason+0.027
Neg logit contributions
 Kim-0.015
 Sue-0.015
 Su-0.015
 Tommy-0.014
,"-0.014
Token their
Token ID511
Feature activation-2.52e-03

Pos logit contributions
Justin+0.048
 Justin+0.046
 Mason+0.045
Louis+0.045
Dean+0.043
Neg logit contributions
 Kim-0.020
 Su-0.019
 Sue-0.018
."-0.018
ibi-0.018
Token coats
Token ID30720
Feature activation+2.10e-02

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
Louis+0.004
 Macy+0.004
Neg logit contributions
ibi-0.003
edy-0.003
 sidewalk-0.003
 Kim-0.003
."-0.003
Token and
Token ID290
Feature activation+7.79e-03

Pos logit contributions
ibi+0.016
oma+0.015
edy+0.015
anya+0.014
Tom+0.014
Neg logit contributions
 Mason-0.042
Justin-0.042
 Justin-0.042
 Manny-0.042
 mer-0.042
Token umb
Token ID20810
Feature activation+5.51e-02

Pos logit contributions
oma+0.010
ibi+0.010
edy+0.010
."+0.009
 sidewalk+0.009
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.012
 pap-0.012
 Manny-0.012
Tokenrell
Token ID11252
Feature activation+1.32e-02

Pos logit contributions
oma+0.076
?"+0.074
 mistake+0.074
anya+0.071
edy+0.070
Neg logit contributions
ifying-0.071
 Manny-0.069
arse-0.064
light-0.063
 Mason-0.063
Tokenas
Token ID292
Feature activation+1.56e-04

Pos logit contributions
ibi+0.016
edy+0.016
 Kim+0.015
 Su+0.014
 sidewalk+0.014
Neg logit contributions
 Justin-0.025
Justin-0.025
 Mason-0.024
 pap-0.023
Louis-0.023
Token and
Token ID290
Feature activation+8.49e-03

Pos logit contributions
ibi+0.000
oma+0.000
edy+0.000
 sidewalk+0.000
Tom+0.000
Neg logit contributions
Justin-0.000
 Mason-0.000
 Justin-0.000
 Manny-0.000
 mer-0.000
Token they
Token ID484
Feature activation-8.12e-03

Pos logit contributions
oma+0.011
ibi+0.011
edy+0.011
."+0.010
 sidewalk+0.010
Neg logit contributions
Justin-0.014
 Mason-0.013
 Justin-0.013
 pap-0.013
 Manny-0.013
Token held
Token ID2714
Feature activation+9.21e-03

Pos logit contributions
Justin+0.014
 Mason+0.013
 Justin+0.013
 Manny+0.013
 pap+0.013
Neg logit contributions
ibi-0.010
oma-0.010
 sidewalk-0.010
edy-0.010
."-0.009
Token.
Token ID13
Feature activation-4.49e-03

Pos logit contributions
ibi+0.012
edy+0.012
anya+0.011
oma+0.011
 Karen+0.011
Neg logit contributions
Justin-0.021
 Justin-0.020
 Mason-0.020
where-0.019
 Manny-0.019
Token What
Token ID1867
Feature activation-3.07e-03

Pos logit contributions
Justin+0.008
 Justin+0.007
 Mason+0.007
 Macy+0.007
 pap+0.007
Neg logit contributions
ibi-0.006
edy-0.006
 sidewalk-0.005
 Kim-0.005
."-0.005
Token's
Token ID338
Feature activation+6.90e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.004
Dean+0.004
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
round-0.004
 Karen-0.004
Token going
Token ID1016
Feature activation+6.89e-03

Pos logit contributions
ibi+0.009
round+0.009
 sidewalk+0.008
 Karen+0.008
ongo+0.008
Neg logit contributions
Justin-0.011
 Justin-0.011
 Mason-0.010
 pap-0.010
where-0.010
Token on
Token ID319
Feature activation+1.76e-02

Pos logit contributions
ibi+0.012
edy+0.012
aron+0.011
 Karen+0.011
oma+0.011
Neg logit contributions
where-0.009
Justin-0.008
 Justin-0.008
 Mason-0.008
 mer-0.008
Token here
Token ID994
Feature activation+5.42e-02

Pos logit contributions
ibi+0.025
edy+0.024
round+0.022
 sidewalk+0.021
ongo+0.021
Neg logit contributions
Justin-0.029
 Justin-0.029
where-0.029
 Mason-0.028
 pap-0.026
Token?"
Token ID1701
Feature activation+2.05e-04

Pos logit contributions
ibi+0.082
edy+0.075
anya+0.075
 Karen+0.074
 Kim+0.072
Neg logit contributions
where-0.087
Justin-0.083
 Justin-0.081
 Mason-0.079
 nood-0.077
Token
Token ID198
Feature activation-1.45e-03

Pos logit contributions
edy+0.000
ibi+0.000
round+0.000
 sidewalk+0.000
oma+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
 Macy-0.000
 Manny-0.000
Token
Token ID198
Feature activation-4.11e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
Beck+0.002
 Macy+0.002
 Mason+0.002
Neg logit contributions
round-0.002
edy-0.002
ibi-0.002
 sidewalk-0.002
 automobiles-0.002
TokenL
Token ID43
Feature activation-5.39e-03

Pos logit contributions
Justin+0.008
 Justin+0.007
 Mason+0.007
 Macy+0.007
 pap+0.007
Neg logit contributions
ibi-0.005
edy-0.005
 Kim-0.004
 sidewalk-0.004
."-0.004
Tokenily
Token ID813
Feature activation+2.26e-02

Pos logit contributions
Justin+0.008
 Justin+0.008
acy+0.008
 Macy+0.007
 Mason+0.007
Neg logit contributions
ller-0.009
 Karen-0.009
 sidewalk-0.008
 Twe-0.008
ibi-0.008
Token friends
Token ID2460
Feature activation+1.83e-02

Pos logit contributions
ibi+0.023
edy+0.022
oma+0.022
 Kim+0.022
."+0.021
Neg logit contributions
Justin-0.024
 Justin-0.023
 Mason-0.023
 pap-0.022
 Manny-0.022
Token cheered
Token ID34767
Feature activation+3.17e-03

Pos logit contributions
ibi+0.024
edy+0.023
 Kim+0.022
oma+0.022
 sidewalk+0.022
Neg logit contributions
Justin-0.031
 Justin-0.030
 Mason-0.030
 mer-0.029
 Manny-0.028
Token for
Token ID329
Feature activation+1.54e-02

Pos logit contributions
ibi+0.002
edy+0.002
 sidewalk+0.002
 Kim+0.002
oma+0.002
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
 Macy-0.007
 pap-0.007
Token him
Token ID683
Feature activation+1.21e-02

Pos logit contributions
ibi+0.024
edy+0.024
 Kim+0.023
 sidewalk+0.023
."+0.023
Neg logit contributions
Justin-0.021
 Justin-0.021
 Mason-0.020
 Macy-0.019
 pap-0.019
Token.
Token ID13
Feature activation+6.72e-03

Pos logit contributions
ibi+0.011
 sidewalk+0.010
edy+0.010
oma+0.010
Tom+0.010
Neg logit contributions
Justin-0.024
 Justin-0.024
 Mason-0.024
 Manny-0.023
 pap-0.023
Token Spe
Token ID2531
Feature activation+5.37e-02

Pos logit contributions
edy+0.009
ibi+0.009
 chain+0.009
anya+0.008
 sidewalk+0.008
Neg logit contributions
 Justin-0.012
 Mason-0.011
Justin-0.011
 Manny-0.011
 Macy-0.010
Tokenedy
Token ID4716
Feature activation+2.41e-02

Pos logit contributions
."+0.042
 Sue+0.040
keeping+0.039
 Sally+0.038
 Kim+0.037
Neg logit contributions
years-0.112
Justin-0.103
where-0.097
Louis-0.095
 Justin-0.095
Token said
Token ID531
Feature activation-1.07e-03

Pos logit contributions
ibi+0.028
 Kim+0.028
 sidewalk+0.027
anya+0.027
 Sue+0.027
Neg logit contributions
Justin-0.041
 Mason-0.040
 Justin-0.040
Dean-0.037
 Macy-0.037
Token,
Token ID11
Feature activation+6.99e-04

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 mer+0.002
 Manny+0.002
Neg logit contributions
ibi-0.001
edy-0.001
 sidewalk-0.001
oma-0.001
 Kim-0.001
Token "
Token ID366
Feature activation+1.51e-02

Pos logit contributions
ibi+0.001
edy+0.001
oma+0.001
 sidewalk+0.001
keeping+0.001
Neg logit contributions
 Justin-0.001
Justin-0.001
 Mason-0.001
 pap-0.001
 mer-0.001
TokenGood
Token ID10248
Feature activation+5.84e-03

Pos logit contributions
ibi+0.021
edy+0.021
 Kim+0.021
 sidewalk+0.021
."+0.021
Neg logit contributions
Justin-0.023
 Justin-0.022
 Mason-0.022
 Macy-0.021
Louis-0.021
Token"
Token ID1
Feature activation-4.53e-03

Pos logit contributions
ibi+0.006
edy+0.005
ller+0.005
 sidewalk+0.005
anya+0.005
Neg logit contributions
Justin-0.009
 Mason-0.009
 Justin-0.009
Louis-0.009
 Macy-0.009
TokenWhat
Token ID2061
Feature activation-3.05e-03

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
Louis+0.005
Beck+0.005
Neg logit contributions
ibi-0.008
edy-0.007
."-0.007
 sidewalk-0.007
 Kim-0.007
Token is
Token ID318
Feature activation+8.80e-03

Pos logit contributions
Justin+0.005
 Justin+0.004
 Mason+0.004
where+0.004
 mer+0.004
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.004
round-0.004
 Timothy-0.004
Token going
Token ID1016
Feature activation+2.83e-03

Pos logit contributions
ibi+0.013
round+0.011
edy+0.011
oma+0.011
ongo+0.011
Neg logit contributions
 Justin-0.014
Justin-0.014
 Mason-0.013
 mer-0.013
where-0.013
Token on
Token ID319
Feature activation+1.41e-02

Pos logit contributions
ibi+0.005
edy+0.005
aron+0.004
anya+0.004
 sidewalk+0.004
Neg logit contributions
where-0.004
Justin-0.004
 Justin-0.003
 Mason-0.003
 mer-0.003
Token here
Token ID994
Feature activation+5.34e-02

Pos logit contributions
ibi+0.023
edy+0.021
anya+0.020
 sidewalk+0.020
ongo+0.020
Neg logit contributions
Justin-0.021
 Justin-0.021
where-0.020
 Mason-0.020
 mer-0.018
Token?"
Token ID1701
Feature activation+4.91e-03

Pos logit contributions
ibi+0.065
anya+0.055
edy+0.055
rix+0.053
 Kim+0.052
Neg logit contributions
where-0.102
Justin-0.102
 Justin-0.102
 Mason-0.097
 mer-0.092
Token she
Token ID673
Feature activation+2.51e-02

Pos logit contributions
edy+0.008
ibi+0.008
round+0.007
keeping+0.007
 sidewalk+0.007
Neg logit contributions
 Mason-0.007
 Justin-0.007
 Macy-0.007
Justin-0.007
 pap-0.006
Token asked
Token ID1965
Feature activation+5.33e-03

Pos logit contributions
ibi+0.036
edy+0.034
round+0.034
 sidewalk+0.032
ongo+0.031
Neg logit contributions
 Justin-0.042
Justin-0.042
 pap-0.041
 Mason-0.041
 Macy-0.039
Token.
Token ID13
Feature activation+5.09e-03

Pos logit contributions
ibi+0.006
round+0.005
edy+0.005
 sidewalk+0.005
ller+0.005
Neg logit contributions
 Justin-0.010
Justin-0.010
 Mason-0.010
 Macy-0.010
 Manny-0.009
Token "
Token ID366
Feature activation+2.44e-03

Pos logit contributions
edy+0.008
ibi+0.007
 chain+0.007
round+0.007
 sidewalk+0.007
Neg logit contributions
 Mason-0.008
 Justin-0.008
Justin-0.008
 Dean-0.008
 Macy-0.008
Token,
Token ID11
Feature activation+3.64e-03

Pos logit contributions
ibi+0.021
edy+0.020
 sidewalk+0.019
oma+0.018
 Kim+0.018
Neg logit contributions
Justin-0.041
 Justin-0.040
 Mason-0.039
 pap-0.038
 mer-0.037
Token what
Token ID644
Feature activation-1.03e-02

Pos logit contributions
edy+0.004
 Kim+0.004
 Sue+0.004
."+0.004
!"+0.004
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
 Macy-0.007
Beck-0.006
Token are
Token ID389
Feature activation-1.89e-03

Pos logit contributions
Justin+0.017
 Justin+0.017
 Mason+0.017
 Macy+0.016
 pap+0.016
Neg logit contributions
ibi-0.013
edy-0.013
."-0.013
 Kim-0.013
!"-0.013
Token you
Token ID345
Feature activation+3.00e-02

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
ubb+0.003
 Macy+0.003
Neg logit contributions
edy-0.002
."-0.002
 Kim-0.002
!"-0.002
 Sue-0.002
Token doing
Token ID1804
Feature activation+2.34e-02

Pos logit contributions
ibi+0.047
edy+0.046
round+0.043
 sidewalk+0.043
 Kim+0.042
Neg logit contributions
Justin-0.044
 Justin-0.043
 Mason-0.042
 pap-0.040
 mer-0.040
Token here
Token ID994
Feature activation+5.32e-02

Pos logit contributions
ibi+0.031
edy+0.027
ongo+0.025
 sidewalk+0.025
round+0.025
Neg logit contributions
Justin-0.043
 Justin-0.043
 Mason-0.041
where-0.041
 mer-0.041
Token?"
Token ID1701
Feature activation-1.23e-03

Pos logit contributions
ibi+0.060
edy+0.052
anya+0.048
 sidewalk+0.047
rix+0.047
Neg logit contributions
Justin-0.107
 Justin-0.107
where-0.105
 Mason-0.102
 mer-0.100
Token he
Token ID339
Feature activation+1.96e-02

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 pap+0.002
 Macy+0.002
Neg logit contributions
ibi-0.002
edy-0.002
 sidewalk-0.002
 Kim-0.002
oma-0.002
Token says
Token ID1139
Feature activation-1.16e-02

Pos logit contributions
ibi+0.022
edy+0.021
 sidewalk+0.019
round+0.019
 Su+0.019
Neg logit contributions
Justin-0.038
 Justin-0.037
 Mason-0.036
 pap-0.035
 Macy-0.035
Token.
Token ID13
Feature activation-4.46e-03

Pos logit contributions
Justin+0.027
 Justin+0.026
 Mason+0.026
 Macy+0.025
 pap+0.025
Neg logit contributions
ibi-0.008
edy-0.008
 sidewalk-0.007
 Kim-0.007
oma-0.007
Token "
Token ID366
Feature activation+4.74e-03

Pos logit contributions
Justin+0.009
 Justin+0.008
 Mason+0.008
 pap+0.008
 mer+0.008
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.005
oma-0.004
 Karen-0.004
Token"
Token ID1
Feature activation-9.24e-03

Pos logit contributions
ibi+0.001
edy+0.001
round+0.001
 Karen+0.001
oma+0.001
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
Beck-0.001
 Macy-0.001
TokenWhat
Token ID2061
Feature activation-3.43e-03

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.014
 pap+0.013
 Macy+0.013
Neg logit contributions
ibi-0.013
edy-0.013
 Kim-0.013
 sidewalk-0.012
Tom-0.012
Token is
Token ID318
Feature activation+1.07e-02

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
 Macy+0.006
 pap+0.006
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
 Karen-0.004
round-0.004
Token going
Token ID1016
Feature activation+2.98e-03

Pos logit contributions
ibi+0.015
round+0.014
oma+0.014
edy+0.013
 Karen+0.013
Neg logit contributions
Justin-0.016
 Justin-0.016
 Mason-0.016
 mer-0.015
where-0.015
Token on
Token ID319
Feature activation+1.64e-02

Pos logit contributions
ibi+0.005
edy+0.004
oma+0.004
 sidewalk+0.004
 Karen+0.004
Neg logit contributions
Justin-0.004
 Justin-0.004
where-0.004
 Mason-0.004
 mer-0.004
Token here
Token ID994
Feature activation+5.30e-02

Pos logit contributions
ibi+0.026
edy+0.024
ongo+0.023
round+0.023
anya+0.022
Neg logit contributions
Justin-0.025
 Justin-0.025
 Mason-0.023
where-0.023
 mer-0.022
Token?"
Token ID1701
Feature activation-4.71e-03

Pos logit contributions
ibi+0.073
anya+0.065
edy+0.064
rix+0.063
round+0.062
Neg logit contributions
Justin-0.091
 Justin-0.091
where-0.090
 Mason-0.087
 nood-0.082
Token Lily
Token ID20037
Feature activation+2.81e-02

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
 Manny+0.008
 pap+0.008
Neg logit contributions
ibi-0.005
edy-0.005
round-0.005
oma-0.005
 sidewalk-0.005
Token's
Token ID338
Feature activation+6.48e-03

Pos logit contributions
ibi+0.038
edy+0.036
 Timothy+0.034
round+0.034
 Su+0.033
Neg logit contributions
Justin-0.049
 Justin-0.048
 Mason-0.048
 pap-0.046
 Macy-0.044
Token mom
Token ID1995
Feature activation+2.03e-02

Pos logit contributions
ibi+0.007
edy+0.006
 sidewalk+0.006
."+0.006
 Karen+0.006
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.012
 pap-0.011
 mer-0.011
Token asked
Token ID1965
Feature activation-4.39e-03

Pos logit contributions
ibi+0.020
edy+0.019
 sidewalk+0.018
."+0.017
 Karen+0.017
Neg logit contributions
Justin-0.041
 Justin-0.040
 Mason-0.040
 Macy-0.038
 pap-0.038
Token "
Token ID366
Feature activation+5.99e-03

Pos logit contributions
Justin+0.008
 Justin+0.008
 pap+0.008
 Mason+0.008
 nood+0.007
Neg logit contributions
ibi-0.006
edy-0.006
round-0.006
keeping-0.006
oma-0.006
TokenWhat
Token ID2061
Feature activation-2.00e-03

Pos logit contributions
ibi+0.009
edy+0.008
 Kim+0.008
 sidewalk+0.008
."+0.008
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 Macy-0.008
 pap-0.008
Token is
Token ID318
Feature activation+6.59e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.002
edy-0.002
 sidewalk-0.002
 Karen-0.002
round-0.002
Token going
Token ID1016
Feature activation+6.02e-03

Pos logit contributions
ibi+0.010
round+0.009
oma+0.009
keeping+0.009
 Su+0.009
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.009
 mer-0.009
where-0.009
Token on
Token ID319
Feature activation+1.56e-02

Pos logit contributions
ibi+0.010
edy+0.009
oma+0.009
 Karen+0.009
anya+0.009
Neg logit contributions
Justin-0.008
 Justin-0.008
where-0.008
 Mason-0.007
 mer-0.007
Token here
Token ID994
Feature activation+5.28e-02

Pos logit contributions
ibi+0.025
edy+0.023
anya+0.022
round+0.022
rix+0.022
Neg logit contributions
Justin-0.024
 Justin-0.024
 Mason-0.023
where-0.022
 mer-0.020
Token?
Token ID30
Feature activation-2.63e-03

Pos logit contributions
ibi+0.082
anya+0.075
edy+0.075
rix+0.073
 Karen+0.072
Neg logit contributions
where-0.081
 Justin-0.080
Justin-0.080
 Mason-0.077
 nood-0.073
Token Why
Token ID4162
Feature activation+5.16e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
 Macy+0.003
 Manny+0.003
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
oma-0.004
 Karen-0.004
Token are
Token ID389
Feature activation-1.98e-02

Pos logit contributions
round+0.011
keeping+0.011
ibi+0.011
 sidewalk+0.010
oot+0.010
Neg logit contributions
Justin-0.005
 Justin-0.005
 Mason-0.004
 nood-0.004
 Macy-0.004
Token you
Token ID345
Feature activation+2.95e-02

Pos logit contributions
Justin+0.034
 Justin+0.032
 Mason+0.032
Beck+0.031
ubb+0.031
Neg logit contributions
edy-0.025
 Kim-0.025
."-0.024
 Sue-0.024
ibi-0.024
Token not
Token ID407
Feature activation+1.64e-02

Pos logit contributions
ibi+0.044
edy+0.041
round+0.041
 sidewalk+0.040
 Karen+0.040
Neg logit contributions
Justin-0.045
 Justin-0.044
 Mason-0.043
 pap-0.041
 Manny-0.040
Token"
Token ID1
Feature activation-1.02e-02

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 Macy+0.002
Beck+0.002
Neg logit contributions
edy-0.001
ibi-0.001
 Karen-0.001
 sidewalk-0.001
oma-0.001
TokenWhat
Token ID2061
Feature activation-2.92e-03

Pos logit contributions
 Mason+0.017
 Justin+0.017
Justin+0.017
 pap+0.017
ubb+0.016
Neg logit contributions
Tom-0.014
 Kim-0.012
ibi-0.012
edy-0.012
 Sue-0.012
Token is
Token ID318
Feature activation+6.60e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.005
 pap+0.005
Neg logit contributions
ibi-0.003
edy-0.003
 sidewalk-0.003
 Kim-0.003
 Karen-0.003
Token going
Token ID1016
Feature activation+4.13e-03

Pos logit contributions
ibi+0.008
edy+0.008
 Kim+0.007
 sidewalk+0.007
."+0.007
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.011
 Macy-0.011
 pap-0.011
Token on
Token ID319
Feature activation+1.59e-02

Pos logit contributions
ibi+0.006
edy+0.006
 sidewalk+0.006
oma+0.005
 Karen+0.005
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.006
where-0.006
 mer-0.006
Token here
Token ID994
Feature activation+5.25e-02

Pos logit contributions
ibi+0.021
edy+0.020
 sidewalk+0.018
 Kim+0.018
round+0.018
Neg logit contributions
Justin-0.028
 Justin-0.028
 Mason-0.027
where-0.026
 pap-0.025
Token?"
Token ID1701
Feature activation-5.42e-03

Pos logit contributions
ibi+0.067
edy+0.061
round+0.058
anya+0.058
rix+0.057
Neg logit contributions
Justin-0.097
 Justin-0.097
where-0.094
 Mason-0.093
 mer-0.087
Token Mom
Token ID11254
Feature activation+1.01e-02

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.009
 Macy+0.009
 pap+0.009
Neg logit contributions
edy-0.007
ibi-0.006
 sidewalk-0.006
oma-0.006
round-0.006
Token asks
Token ID7893
Feature activation-5.27e-03

Pos logit contributions
edy+0.013
ibi+0.013
 Timothy+0.012
 sidewalk+0.012
 Twe+0.012
Neg logit contributions
Justin-0.018
 Justin-0.017
 Mason-0.017
 Macy-0.016
 Manny-0.016
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.010
 Macy+0.010
 Manny+0.009
Neg logit contributions
ibi-0.006
edy-0.006
round-0.005
 sidewalk-0.005
ller-0.005
Token
Token ID198
Feature activation+1.27e-03

Pos logit contributions
edy+0.002
ibi+0.002
round+0.002
istle+0.002
oot+0.002
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.002
 Macy-0.002
 pap-0.002
Token"
Token ID1
Feature activation-1.23e-02

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.005
 Macy+0.005
Neg logit contributions
ibi-0.001
 Kim-0.001
Tom-0.001
."-0.001
edy-0.001
TokenWhat
Token ID2061
Feature activation-1.28e-03

Pos logit contributions
 pap+0.023
ubb+0.023
 Mason+0.023
 Justin+0.022
 Manny+0.022
Neg logit contributions
Tom-0.016
Tim-0.013
 Sue-0.012
 Tom-0.012
 lady-0.011
Token is
Token ID318
Feature activation+9.24e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 Macy+0.002
 pap+0.002
Neg logit contributions
ibi-0.001
edy-0.001
 Kim-0.001
 sidewalk-0.001
."-0.001
Token going
Token ID1016
Feature activation+5.15e-03

Pos logit contributions
ibi+0.013
edy+0.012
oma+0.011
 Karen+0.011
 Kim+0.011
Neg logit contributions
Justin-0.015
 Justin-0.015
 Mason-0.014
 Macy-0.014
 pap-0.014
Token on
Token ID319
Feature activation+1.76e-02

Pos logit contributions
ibi+0.008
edy+0.008
oma+0.007
 Karen+0.007
 sidewalk+0.007
Neg logit contributions
Justin-0.007
 Justin-0.007
where-0.007
 Mason-0.007
 mer-0.007
Token here
Token ID994
Feature activation+5.22e-02

Pos logit contributions
ibi+0.023
edy+0.021
anya+0.019
round+0.019
 sidewalk+0.019
Neg logit contributions
Justin-0.032
 Justin-0.032
 Mason-0.031
where-0.030
 Macy-0.029
Token?"
Token ID1701
Feature activation-7.80e-03

Pos logit contributions
ibi+0.064
edy+0.058
anya+0.056
rix+0.054
round+0.054
Neg logit contributions
Justin-0.097
 Justin-0.097
where-0.095
 Mason-0.094
 nood-0.089
Token Mom
Token ID11254
Feature activation+8.55e-03

Pos logit contributions
Justin+0.017
ifying+0.017
ubb+0.016
Louis+0.016
 Justin+0.016
Neg logit contributions
 Sue-0.007
 Kim-0.007
 Tom-0.007
 Sally-0.006
Tom-0.006
Token asks
Token ID7893
Feature activation-9.27e-03

Pos logit contributions
ibi+0.009
edy+0.009
 sidewalk+0.008
 Kim+0.008
 Karen+0.008
Neg logit contributions
Justin-0.017
 Justin-0.016
 Mason-0.016
 Macy-0.015
 pap-0.015
Token.
Token ID13
Feature activation-2.31e-03

Pos logit contributions
Justin+0.023
 Justin+0.022
 Mason+0.022
Louis+0.021
 pap+0.021
Neg logit contributions
."-0.004
 Kim-0.004
ibi-0.004
 Sue-0.004
edy-0.004
Token
Token ID198
Feature activation-5.16e-04

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.004
 Macy+0.004
Neg logit contributions
edy-0.002
ibi-0.002
 sidewalk-0.002
round-0.002
 Karen-0.002
Token "
Token ID366
Feature activation+9.88e-03

Pos logit contributions
Justin+0.005
 pap+0.005
 Mason+0.005
 Justin+0.005
 nood+0.005
Neg logit contributions
ibi-0.004
edy-0.004
round-0.003
oma-0.003
keeping-0.003
TokenWhat
Token ID2061
Feature activation-4.92e-04

Pos logit contributions
ibi+0.015
edy+0.015
."+0.014
 sidewalk+0.014
 Kim+0.014
Neg logit contributions
Justin-0.015
 Justin-0.014
 Mason-0.014
 Macy-0.013
Louis-0.013
Token is
Token ID318
Feature activation+7.90e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 Macy+0.001
 pap+0.001
Neg logit contributions
ibi-0.001
edy-0.001
 sidewalk-0.001
round-0.001
 Karen-0.001
Token going
Token ID1016
Feature activation+4.96e-03

Pos logit contributions
ibi+0.011
edy+0.010
round+0.010
oma+0.010
 Karen+0.010
Neg logit contributions
Justin-0.013
 Justin-0.013
 Mason-0.012
 mer-0.012
 pap-0.012
Token on
Token ID319
Feature activation+1.73e-02

Pos logit contributions
ibi+0.008
edy+0.008
oma+0.008
aron+0.007
 Karen+0.007
Neg logit contributions
Justin-0.006
where-0.006
 Justin-0.006
 Mason-0.006
 mer-0.006
Token here
Token ID994
Feature activation+5.16e-02

Pos logit contributions
ibi+0.028
edy+0.025
ongo+0.024
anya+0.024
rix+0.023
Neg logit contributions
 Justin-0.026
Justin-0.026
 Mason-0.025
where-0.025
 mer-0.023
Token?
Token ID30
Feature activation-3.28e-03

Pos logit contributions
ibi+0.078
edy+0.073
anya+0.071
rix+0.069
ongo+0.069
Neg logit contributions
where-0.080
 Justin-0.078
Justin-0.078
 Mason-0.076
 mer-0.071
Token Why
Token ID4162
Feature activation+5.04e-03

Pos logit contributions
Justin+0.005
 Justin+0.004
 Mason+0.004
 Macy+0.004
 Manny+0.004
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.005
oma-0.005
 Karen-0.005
Token are
Token ID389
Feature activation-1.72e-02

Pos logit contributions
round+0.010
ibi+0.010
 sidewalk+0.010
oot+0.010
keeping+0.010
Neg logit contributions
Justin-0.005
 Justin-0.005
 Mason-0.005
 Macy-0.004
 nood-0.004
Token you
Token ID345
Feature activation+3.23e-02

Pos logit contributions
Justin+0.032
 Justin+0.030
 Mason+0.030
ubb+0.029
Beck+0.029
Neg logit contributions
 Kim-0.020
edy-0.020
 Sue-0.020
."-0.020
 Tommy-0.019
Token fighting
Token ID4330
Feature activation+3.26e-02

Pos logit contributions
ibi+0.049
edy+0.046
round+0.046
 sidewalk+0.044
 Karen+0.044
Neg logit contributions
Justin-0.048
 Justin-0.046
 Mason-0.046
 pap-0.045
 mer-0.043
Token of
Token ID286
Feature activation+8.86e-04

Pos logit contributions
edy+0.024
ibi+0.023
 Timothy+0.021
oma+0.021
 Karen+0.021
Neg logit contributions
 Mason-0.032
Justin-0.032
 Manny-0.031
 Justin-0.031
 mer-0.031
Token the
Token ID262
Feature activation-6.50e-04

Pos logit contributions
ibi+0.001
edy+0.001
ongo+0.001
oot+0.001
ller+0.001
Neg logit contributions
 Justin-0.001
 Mason-0.001
 mer-0.001
Justin-0.001
 pap-0.001
Token truck
Token ID7779
Feature activation+4.38e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
where+0.001
 pap+0.001
Neg logit contributions
ibi-0.001
edy-0.001
oot-0.001
 Karen-0.001
oma-0.001
Token.
Token ID13
Feature activation+2.24e-03

Pos logit contributions
ibi+0.003
edy+0.003
 Karen+0.003
oma+0.003
 Kim+0.003
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 Manny-0.009
 mer-0.009
Token He
Token ID679
Feature activation+1.17e-02

Pos logit contributions
ibi+0.003
edy+0.003
 Karen+0.003
 sidewalk+0.003
oma+0.003
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
 Macy-0.004
 Manny-0.004
Token slammed
Token ID21376
Feature activation+5.13e-02

Pos logit contributions
ibi+0.016
 Karen+0.015
 sidewalk+0.015
 Timothy+0.014
 Su+0.014
Neg logit contributions
Justin-0.019
 Justin-0.019
 mer-0.018
 nood-0.018
 Mason-0.018
Token the
Token ID262
Feature activation-3.52e-04

Pos logit contributions
ibi+0.086
ongo+0.081
anya+0.074
 sidewalk+0.073
ooters+0.073
Neg logit contributions
 Justin-0.068
 mer-0.067
 Mason-0.065
Justin-0.064
 Manny-0.063
Token door
Token ID3420
Feature activation+1.95e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 mer+0.001
 pap+0.001
Neg logit contributions
ibi-0.000
edy-0.000
 Karen-0.000
 Su-0.000
 Timothy-0.000
Token and
Token ID290
Feature activation+6.89e-03

Pos logit contributions
ibi+0.020
edy+0.019
oma+0.018
 Timothy+0.017
 Karen+0.017
Neg logit contributions
Justin-0.036
 Justin-0.036
 Mason-0.035
 Manny-0.035
 mer-0.035
Token locked
Token ID8970
Feature activation+4.60e-02

Pos logit contributions
ibi+0.009
edy+0.008
oma+0.008
 sidewalk+0.008
."+0.007
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.012
 mer-0.011
 pap-0.011
Token it
Token ID340
Feature activation-1.95e-03

Pos logit contributions
ibi+0.068
ongo+0.065
round+0.059
 sidewalk+0.059
grand+0.059
Neg logit contributions
 Justin-0.073
 Mason-0.071
 Manny-0.071
Justin-0.067
 Macy-0.067
Token jackets
Token ID35129
Feature activation+2.65e-02

Pos logit contributions
edy+0.033
ibi+0.032
oma+0.032
 Kim+0.031
 Karen+0.031
Neg logit contributions
Justin-0.049
 Mason-0.047
 Justin-0.047
 mer-0.047
where-0.046
Token became
Token ID2627
Feature activation-1.38e-02

Pos logit contributions
ibi+0.036
oma+0.035
edy+0.035
 Kim+0.035
anya+0.035
Neg logit contributions
Justin-0.038
 Mason-0.037
 mer-0.037
 Justin-0.037
 nood-0.036
Token heavy
Token ID4334
Feature activation+8.62e-03

Pos logit contributions
Justin+0.021
 Justin+0.021
 Mason+0.020
 pap+0.019
 mer+0.019
Neg logit contributions
ibi-0.020
edy-0.019
oma-0.019
 sidewalk-0.019
 Kim-0.018
Token and
Token ID290
Feature activation+1.69e-04

Pos logit contributions
ibi+0.006
edy+0.005
 Kim+0.005
 sidewalk+0.005
oma+0.005
Neg logit contributions
Justin-0.020
 Justin-0.019
 Mason-0.019
 pap-0.019
 Macy-0.019
Token their
Token ID511
Feature activation+9.65e-03

Pos logit contributions
ibi+0.000
edy+0.000
 sidewalk+0.000
oma+0.000
."+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
 pap-0.000
 Macy-0.000
Token umb
Token ID20810
Feature activation+5.13e-02

Pos logit contributions
ibi+0.013
edy+0.013
 Kim+0.012
."+0.012
oma+0.012
Neg logit contributions
Justin-0.016
 Justin-0.015
 Mason-0.015
 pap-0.014
 Macy-0.014
Tokenrell
Token ID11252
Feature activation+1.64e-02

Pos logit contributions
oma+0.066
edy+0.064
anya+0.064
?"+0.063
 mistake+0.062
Neg logit contributions
ifying-0.077
ubb-0.066
 Manny-0.066
light-0.065
years-0.063
Tokenas
Token ID292
Feature activation-1.82e-04

Pos logit contributions
ibi+0.021
edy+0.020
 Kim+0.019
."+0.019
 Su+0.018
Neg logit contributions
Justin-0.030
 Justin-0.030
 Mason-0.029
 pap-0.027
 Macy-0.027
Token began
Token ID2540
Feature activation-1.50e-02

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Manny+0.000
 mer+0.000
Neg logit contributions
ibi-0.000
edy-0.000
oma-0.000
 sidewalk-0.000
 Kim-0.000
Token to
Token ID284
Feature activation-2.54e-02

Pos logit contributions
Justin+0.030
 Justin+0.028
 Mason+0.027
Louis+0.027
Dean+0.026
Neg logit contributions
."-0.016
ibi-0.016
 Kim-0.016
 Su-0.016
edy-0.015
Token dro
Token ID3102
Feature activation+2.16e-03

Pos logit contributions
Justin+0.043
 Justin+0.042
 Mason+0.040
Louis+0.038
years+0.038
Neg logit contributions
."-0.033
ibi-0.033
!"-0.032
 sidewalk-0.032
 Kim-0.032
Token"
Token ID1
Feature activation-8.27e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
ubb+0.002
 mer+0.002
Neg logit contributions
Tom-0.000
 Kim-0.000
 Sue-0.000
."-0.000
!"-0.000
TokenWhat
Token ID2061
Feature activation-4.21e-03

Pos logit contributions
ubb+0.015
oss+0.015
ifying+0.015
 Mason+0.015
 pap+0.015
Neg logit contributions
Tom-0.011
Tim-0.009
 Sue-0.009
 Kim-0.008
 Tom-0.008
Token is
Token ID318
Feature activation+8.45e-03

Pos logit contributions
Justin+0.008
 Justin+0.008
 Mason+0.007
 Macy+0.007
 pap+0.007
Neg logit contributions
ibi-0.005
edy-0.005
 Kim-0.004
."-0.004
 sidewalk-0.004
Token going
Token ID1016
Feature activation+4.60e-03

Pos logit contributions
ibi+0.010
edy+0.010
 Kim+0.009
 sidewalk+0.009
."+0.009
Neg logit contributions
Justin-0.015
 Justin-0.015
 Mason-0.015
 Macy-0.014
 pap-0.014
Token on
Token ID319
Feature activation+1.64e-02

Pos logit contributions
ibi+0.007
edy+0.007
 sidewalk+0.006
oma+0.006
 Karen+0.006
Neg logit contributions
Justin-0.007
 Justin-0.007
where-0.006
 Mason-0.006
 mer-0.006
Token here
Token ID994
Feature activation+5.11e-02

Pos logit contributions
ibi+0.023
edy+0.022
 sidewalk+0.021
 Kim+0.020
round+0.020
Neg logit contributions
Justin-0.027
 Justin-0.027
 Mason-0.026
where-0.025
 Macy-0.024
Token?"
Token ID1701
Feature activation-7.72e-03

Pos logit contributions
ibi+0.066
edy+0.061
anya+0.057
rix+0.057
round+0.056
Neg logit contributions
Justin-0.092
 Justin-0.092
where-0.089
 Mason-0.088
 nood-0.083
Token she
Token ID673
Feature activation+1.85e-02

Pos logit contributions
ifying+0.015
Justin+0.015
ubb+0.015
 mer+0.014
 nood+0.014
Neg logit contributions
 Sue-0.009
 Kim-0.009
 Tom-0.009
,"-0.008
Tom-0.008
Token asks
Token ID7893
Feature activation-5.73e-03

Pos logit contributions
ibi+0.023
edy+0.023
round+0.022
 sidewalk+0.021
 Timothy+0.021
Neg logit contributions
Justin-0.033
 Justin-0.033
 Mason-0.032
 pap-0.031
 Macy-0.031
Token.
Token ID13
Feature activation-4.48e-03

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.014
ifying+0.013
Louis+0.013
Neg logit contributions
 Kim-0.003
 Sue-0.003
."-0.003
,"-0.002
ibi-0.002
Token She
Token ID1375
Feature activation+1.60e-02

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.009
 Macy+0.009
 pap+0.009
Neg logit contributions
ibi-0.003
edy-0.003
 Kim-0.003
."-0.003
 sidewalk-0.003
Token ended
Token ID4444
Feature activation+1.51e-02

Pos logit contributions
Justin+0.012
 Justin+0.012
 Mason+0.011
where+0.011
 pap+0.011
Neg logit contributions
ibi-0.014
oma-0.013
round-0.013
edy-0.013
 sidewalk-0.013
Token up
Token ID510
Feature activation-6.50e-03

Pos logit contributions
ibi+0.024
round+0.022
anya+0.022
ongo+0.022
Tom+0.021
Neg logit contributions
 Justin-0.020
 Manny-0.020
Justin-0.019
 Mason-0.019
 mer-0.019
Token in
Token ID287
Feature activation+1.40e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 mer+0.010
where+0.010
Neg logit contributions
ibi-0.008
edy-0.008
oma-0.007
anya-0.007
round-0.007
Token such
Token ID884
Feature activation+1.42e-02

Pos logit contributions
ibi+0.002
edy+0.002
 sidewalk+0.001
oma+0.001
 Kim+0.001
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 pap-0.002
 Manny-0.002
Token bad
Token ID2089
Feature activation+1.65e-02

Pos logit contributions
ibi+0.022
edy+0.021
 sidewalk+0.021
oma+0.021
 Kim+0.021
Neg logit contributions
Justin-0.020
 Justin-0.020
 Mason-0.019
 pap-0.018
 Manny-0.018
Token situations
Token ID7445
Feature activation+5.10e-02

Pos logit contributions
ibi+0.019
edy+0.019
 Kim+0.018
 sidewalk+0.018
."+0.018
Neg logit contributions
Justin-0.030
 Justin-0.029
 Mason-0.029
 pap-0.027
 Macy-0.027
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
ibi+0.056
oma+0.051
edy+0.050
anya+0.049
 Kim+0.047
Neg logit contributions
Justin-0.095
 Justin-0.093
 Mason-0.091
where-0.091
 mer-0.090
Token
Token ID220
Feature activation+8.47e-03

Pos logit contributions
edy+0.006
ibi+0.005
 sidewalk+0.005
oot+0.005
anya+0.005
Neg logit contributions
 Justin-0.007
Justin-0.007
 Mason-0.007
 Manny-0.007
 Macy-0.006
Token
Token ID198
Feature activation+5.11e-03

Pos logit contributions
edy+0.012
round+0.012
 chain+0.012
 sidewalk+0.012
er+0.012
Neg logit contributions
Justin-0.013
 Justin-0.012
 Mason-0.012
 Manny-0.012
 Macy-0.011
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
edy+0.007
 sidewalk+0.006
 automobiles+0.006
er+0.006
ibi+0.006
Neg logit contributions
Justin-0.009
 Justin-0.009
Beck-0.008
 Mason-0.008
 Manny-0.008
TokenThe
Token ID464
Feature activation+2.91e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
Beck+0.004
Louis+0.004
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
."-0.003
 Kim-0.003
Token are
Token ID389
Feature activation-1.38e-02

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.005
where+0.005
Neg logit contributions
ibi-0.004
edy-0.004
oma-0.004
anya-0.004
 sidewalk-0.004
Token still
Token ID991
Feature activation+7.30e-03

Pos logit contributions
Justin+0.020
 Justin+0.019
 Mason+0.018
 pap+0.017
 Macy+0.017
Neg logit contributions
ibi-0.022
edy-0.021
oma-0.020
 Kim-0.020
 sidewalk-0.020
Token many
Token ID867
Feature activation+2.39e-02

Pos logit contributions
ibi+0.013
ongo+0.012
edy+0.012
oot+0.012
anya+0.012
Neg logit contributions
 mer-0.009
Justin-0.009
where-0.009
 Justin-0.008
 pap-0.008
Token things
Token ID1243
Feature activation+1.71e-02

Pos logit contributions
ibi+0.038
edy+0.037
ongo+0.036
oma+0.034
ller+0.034
Neg logit contributions
 mer-0.032
Justin-0.031
 pap-0.029
ubb-0.029
where-0.029
Token to
Token ID284
Feature activation+7.21e-04

Pos logit contributions
ibi+0.027
oma+0.024
anya+0.024
 Kim+0.023
edy+0.023
Neg logit contributions
Justin-0.025
 Justin-0.025
 Mason-0.024
 mer-0.024
 nood-0.023
Token explore
Token ID7301
Feature activation+5.09e-02

Pos logit contributions
ibi+0.001
oma+0.001
edy+0.001
/+0.001
rix+0.001
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 mer-0.001
where-0.001
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
ibi+0.080
ongo+0.070
anya+0.069
edy+0.065
oma+0.064
Neg logit contributions
where-0.085
 Justin-0.083
Justin-0.082
 mer-0.080
 Mason-0.078
Token have
Token ID423
Feature activation+1.25e-02

Pos logit contributions
ibi+0.015
edy+0.014
oma+0.014
 Kim+0.013
 Karen+0.013
Neg logit contributions
Justin-0.017
 Justin-0.017
 Mason-0.016
 mer-0.016
 pap-0.016
Token fun
Token ID1257
Feature activation+1.86e-02

Pos logit contributions
ibi+0.018
oma+0.017
edy+0.017
round+0.016
oot+0.016
Neg logit contributions
Justin-0.019
 Justin-0.019
 mer-0.018
 Mason-0.018
 pap-0.018
Token with
Token ID351
Feature activation+1.68e-02

Pos logit contributions
ibi+0.020
edy+0.019
oma+0.018
anya+0.018
ongo+0.018
Neg logit contributions
 mer-0.036
Justin-0.035
 Justin-0.034
 Mason-0.034
where-0.034
Token."
Token ID526
Feature activation-1.72e-03

Pos logit contributions
ibi+0.026
ongo+0.024
edy+0.023
round+0.023
 sidewalk+0.023
Neg logit contributions
 Justin-0.026
Justin-0.026
 Mason-0.025
 pap-0.025
 Macy-0.024
Token.
Token ID13
Feature activation-3.00e-03

Pos logit contributions
ibi+0.008
edy+0.007
 Karen+0.007
oma+0.007
anya+0.007
Neg logit contributions
Justin-0.015
 Justin-0.015
 Mason-0.015
 Manny-0.014
where-0.014
Token What
Token ID1867
Feature activation-2.16e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.005
 Macy+0.005
Neg logit contributions
ibi-0.003
 Kim-0.003
edy-0.003
."-0.003
 sidewalk-0.003
Token is
Token ID318
Feature activation+9.53e-04

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.003
edy-0.003
 sidewalk-0.003
 Karen-0.003
oma-0.003
Token going
Token ID1016
Feature activation+3.84e-03

Pos logit contributions
ibi+0.001
edy+0.001
 Kim+0.001
 sidewalk+0.001
."+0.001
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.002
 Macy-0.002
 pap-0.002
Token on
Token ID319
Feature activation+1.26e-02

Pos logit contributions
ibi+0.006
edy+0.006
oma+0.005
 Karen+0.005
 sidewalk+0.005
Neg logit contributions
Justin-0.006
 Justin-0.005
where-0.005
 Mason-0.005
 mer-0.005
Token here
Token ID994
Feature activation+5.08e-02

Pos logit contributions
ibi+0.018
edy+0.017
round+0.016
 sidewalk+0.015
rix+0.015
Neg logit contributions
Justin-0.021
 Justin-0.021
 Mason-0.020
where-0.020
 Macy-0.019
Token?"
Token ID1701
Feature activation-1.92e-04

Pos logit contributions
ibi+0.078
edy+0.071
 Karen+0.071
anya+0.071
ongo+0.068
Neg logit contributions
where-0.079
Justin-0.077
 Justin-0.076
 Mason-0.074
 nood-0.071
Token
Token ID198
Feature activation+2.85e-03

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Manny+0.000
 Macy+0.000
Neg logit contributions
edy-0.000
round-0.000
ibi-0.000
oma-0.000
er-0.000
Token
Token ID198
Feature activation+1.49e-03

Pos logit contributions
round+0.005
edy+0.005
ibi+0.004
er+0.004
aron+0.004
Neg logit contributions
Justin-0.004
Beck-0.004
 Justin-0.004
 Manny-0.004
 Milo-0.004
TokenTim
Token ID14967
Feature activation+1.88e-02

Pos logit contributions
edy+0.002
ibi+0.002
."+0.002
oma+0.002
round+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
Beck-0.003
 Manny-0.003
Token and
Token ID290
Feature activation-7.77e-03

Pos logit contributions
 Timothy+0.018
 Twe+0.017
edy+0.017
 Karen+0.017
ibi+0.017
Neg logit contributions
Justin-0.040
 Justin-0.038
 Mason-0.037
Dean-0.036
 Dean-0.035
Token,
Token ID11
Feature activation+4.58e-03

Pos logit contributions
ibi+0.013
edy+0.012
anya+0.012
 sidewalk+0.012
oma+0.011
Neg logit contributions
Justin-0.017
 Mason-0.017
 Justin-0.017
 nood-0.017
 mer-0.016
Token what
Token ID644
Feature activation-7.39e-03

Pos logit contributions
edy+0.005
 Kim+0.005
 Sue+0.005
."+0.005
!"+0.005
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
 Macy-0.008
Beck-0.008
Token are
Token ID389
Feature activation-3.21e-03

Pos logit contributions
Justin+0.012
 Justin+0.012
 Mason+0.012
 Macy+0.011
 pap+0.011
Neg logit contributions
ibi-0.010
edy-0.010
 sidewalk-0.009
 Kim-0.009
."-0.009
Token you
Token ID345
Feature activation+3.16e-02

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.004
 pap+0.004
Neg logit contributions
ibi-0.005
edy-0.005
 Kim-0.004
 sidewalk-0.004
."-0.004
Token doing
Token ID1804
Feature activation+2.26e-02

Pos logit contributions
ibi+0.049
edy+0.048
round+0.046
 sidewalk+0.045
 Kim+0.045
Neg logit contributions
Justin-0.046
 Mason-0.044
 Justin-0.044
 mer-0.042
 pap-0.042
Token here
Token ID994
Feature activation+5.07e-02

Pos logit contributions
ibi+0.027
edy+0.023
ongo+0.022
 sidewalk+0.021
 Kim+0.021
Neg logit contributions
Justin-0.044
 Justin-0.044
where-0.042
 mer-0.042
 Mason-0.042
Token?"
Token ID1701
Feature activation-1.51e-03

Pos logit contributions
ibi+0.054
edy+0.046
anya+0.045
 sidewalk+0.043
rix+0.042
Neg logit contributions
Justin-0.103
 Justin-0.103
where-0.101
 Mason-0.100
 mer-0.097
Token It
Token ID632
Feature activation+5.82e-03

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 Manny+0.003
 pap+0.003
Neg logit contributions
ibi-0.001
edy-0.001
oma-0.001
 sidewalk-0.001
round-0.001
Token was
Token ID373
Feature activation-4.18e-03

Pos logit contributions
ibi+0.006
edy+0.006
 sidewalk+0.006
 Karen+0.006
 Timothy+0.006
Neg logit contributions
Justin-0.011
 Justin-0.011
 Mason-0.011
 mer-0.010
 pap-0.010
Token Dad
Token ID17415
Feature activation-2.10e-03

Pos logit contributions
Justin+0.007
 Justin+0.007
 Mason+0.007
 pap+0.007
 mer+0.006
Neg logit contributions
edy-0.006
ibi-0.006
round-0.005
 sidewalk-0.005
."-0.005
Token.
Token ID13
Feature activation-3.93e-03

Pos logit contributions
Justin+0.005
 Justin+0.004
 Mason+0.004
 Macy+0.004
Louis+0.004
Neg logit contributions
."-0.002
ibi-0.002
edy-0.002
 Kim-0.002
!"-0.002
Token and
Token ID290
Feature activation-2.57e-03

Pos logit contributions
Justin+0.033
 Justin+0.032
 Mason+0.031
 Macy+0.031
Louis+0.030
Neg logit contributions
 Kim-0.015
ibi-0.015
."-0.014
 Sue-0.014
edy-0.014
Token looked
Token ID3114
Feature activation+9.65e-03

Pos logit contributions
Justin+0.005
 Justin+0.004
 Mason+0.004
ubb+0.004
Louis+0.004
Neg logit contributions
edy-0.003
ibi-0.003
 Kim-0.003
 Sue-0.003
 Karen-0.003
Token at
Token ID379
Feature activation-5.32e-04

Pos logit contributions
ibi+0.016
oma+0.015
Tom+0.015
edy+0.015
 sidewalk+0.015
Neg logit contributions
Justin-0.013
 Justin-0.013
 Mason-0.012
 mer-0.012
 pap-0.012
Token the
Token ID262
Feature activation+8.61e-03

Pos logit contributions
 Justin+0.001
 Mason+0.001
Justin+0.001
 pap+0.001
 Manny+0.001
Neg logit contributions
edy-0.001
ibi-0.001
 sidewalk-0.001
ongo-0.001
round-0.001
Token ost
Token ID23619
Feature activation-9.96e-03

Pos logit contributions
edy+0.013
ibi+0.012
 Jerry+0.012
 Kim+0.012
 Timothy+0.012
Neg logit contributions
 pap-0.013
 Mason-0.012
Justin-0.012
 mer-0.012
ubb-0.012
Tokenric
Token ID1173
Feature activation-5.74e-02

Pos logit contributions
Justin+0.011
 Justin+0.010
 Mason+0.010
 pap+0.010
 Manny+0.010
Neg logit contributions
ibi-0.018
edy-0.018
 Kim-0.017
 sidewalk-0.017
Tom-0.017
Tokenhes
Token ID956
Feature activation-1.87e-02

Pos logit contributions
Justin+0.092
 Justin+0.088
 Mason+0.086
 Macy+0.082
Dean+0.081
Neg logit contributions
ibi-0.082
edy-0.080
 Karen-0.078
 sidewalk-0.076
 Timothy-0.076
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
Justin+0.038
 Justin+0.038
 Mason+0.038
 mer+0.036
 Macy+0.036
Neg logit contributions
ibi-0.017
edy-0.016
 sidewalk-0.015
 Karen-0.014
oma-0.014
Token They
Token ID1119
Feature activation+1.29e-02

Pos logit contributions
 Justin+0.002
 Mason+0.002
Justin+0.002
 Manny+0.002
 Macy+0.002
Neg logit contributions
ibi-0.002
oot-0.002
edy-0.002
ongo-0.002
!"-0.002
Token saw
Token ID2497
Feature activation-1.38e-03

Pos logit contributions
ibi+0.018
edy+0.018
 sidewalk+0.017
round+0.017
oma+0.017
Neg logit contributions
Justin-0.020
 Justin-0.020
 Mason-0.020
 pap-0.019
 mer-0.018
Token one
Token ID530
Feature activation-1.23e-02

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 Macy+0.002
 pap+0.002
Neg logit contributions
ibi-0.002
edy-0.002
 Kim-0.002
 sidewalk-0.002
."-0.002
Token that
Token ID326
Feature activation+5.96e-03

Pos logit contributions
 mer+0.005
 Justin+0.005
 Mason+0.005
ifying+0.005
Justin+0.005
Neg logit contributions
ibi-0.004
edy-0.004
ongo-0.004
oma-0.004
 Timothy-0.004
Token said
Token ID531
Feature activation+2.90e-03

Pos logit contributions
ibi+0.007
edy+0.006
round+0.006
ongo+0.006
oma+0.006
Neg logit contributions
 Mason-0.011
 pap-0.011
 Justin-0.011
 mer-0.010
Justin-0.010
Token "
Token ID366
Feature activation-3.07e-03

Pos logit contributions
ibi+0.003
edy+0.003
 sidewalk+0.003
round+0.003
ongo+0.003
Neg logit contributions
 Mason-0.005
 Justin-0.005
Justin-0.005
 pap-0.005
 mer-0.005
TokenO
Token ID46
Feature activation+2.35e-02

Pos logit contributions
Justin+0.004
 Mason+0.004
 Justin+0.004
where+0.004
Louis+0.004
Neg logit contributions
edy-0.005
ibi-0.004
 sidewalk-0.004
 Kim-0.004
."-0.004
Tokenst
Token ID301
Feature activation-8.43e-03

Pos logit contributions
 sidewalk+0.031
ibi+0.031
edy+0.031
 Kim+0.030
 Sue+0.030
Neg logit contributions
Justin-0.034
 Mason-0.034
where-0.032
 Justin-0.032
 pap-0.031
Tokenric
Token ID1173
Feature activation-5.68e-02

Pos logit contributions
 Mason+0.013
Justin+0.013
 Justin+0.013
 pap+0.012
 Macy+0.012
Neg logit contributions
ibi-0.012
edy-0.011
Tom-0.011
 sidewalk-0.011
 Karen-0.011
Tokenhes
Token ID956
Feature activation-2.08e-02

Pos logit contributions
Justin+0.092
 Justin+0.088
 Mason+0.086
 Macy+0.081
Louis+0.081
Neg logit contributions
ibi-0.083
edy-0.079
 Karen-0.077
 sidewalk-0.076
 Kim-0.076
Token".
Token ID1911
Feature activation+1.25e-02

Pos logit contributions
 Mason+0.036
Justin+0.036
 Justin+0.036
where+0.034
ifying+0.034
Neg logit contributions
ibi-0.025
edy-0.025
 sidewalk-0.023
anya-0.022
 Karen-0.022
Token Sara
Token ID24799
Feature activation+1.92e-02

Pos logit contributions
edy+0.015
ibi+0.015
oot+0.015
istle+0.014
anya+0.014
Neg logit contributions
 Mason-0.022
 Justin-0.020
Justin-0.020
 pap-0.020
 Macy-0.020
Token and
Token ID290
Feature activation+9.72e-03

Pos logit contributions
 Timothy+0.021
ibi+0.021
 Kim+0.020
 sidewalk+0.020
edy+0.020
Neg logit contributions
Justin-0.036
 Mason-0.035
 pap-0.034
 Justin-0.034
 Macy-0.032
Token Tom
Token ID4186
Feature activation-8.79e-03

Pos logit contributions
edy+0.006
."+0.006
er+0.006
Twe+0.006
rix+0.006
Neg logit contributions
 Justin-0.022
 Mason-0.022
Justin-0.021
 pap-0.021
 Dean-0.020
Token John
Token ID1757
Feature activation-7.19e-03

Pos logit contributions
Justin+0.024
 Justin+0.023
 Mason+0.023
 Macy+0.022
 pap+0.022
Neg logit contributions
ibi-0.016
edy-0.016
 Kim-0.015
 sidewalk-0.015
."-0.015
Token chose
Token ID7690
Feature activation-7.24e-03

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.013
 pap+0.013
 Manny+0.013
Neg logit contributions
ibi-0.007
edy-0.007
 sidewalk-0.007
 Kim-0.007
oma-0.007
Token some
Token ID617
Feature activation+7.60e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 pap+0.010
 Macy+0.010
Neg logit contributions
ibi-0.010
edy-0.010
 sidewalk-0.010
oma-0.010
 Kim-0.010
Token carrots
Token ID34397
Feature activation-2.16e-03

Pos logit contributions
ibi+0.009
edy+0.009
."+0.009
 sidewalk+0.009
 Kim+0.009
Neg logit contributions
Justin-0.013
 Justin-0.013
 Mason-0.013
 Macy-0.012
 pap-0.012
Token and
Token ID290
Feature activation-5.96e-04

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.004
 pap+0.004
Neg logit contributions
ibi-0.002
edy-0.002
."-0.002
 Kim-0.001
 sidewalk-0.001
Token cuc
Token ID38421
Feature activation-5.58e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
ubb+0.001
Louis+0.001
Neg logit contributions
."-0.001
edy-0.001
 Sue-0.001
ibi-0.001
!"-0.001
Tokenumbers
Token ID17024
Feature activation-1.82e-03

Pos logit contributions
 Justin+0.082
 pap+0.082
 mer+0.081
 Manny+0.081
Justin+0.079
Neg logit contributions
."-0.076
!"-0.075
edy-0.075
,"-0.072
er-0.069
Token to
Token ID284
Feature activation-3.16e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
 Macy+0.004
 pap+0.004
Neg logit contributions
ibi-0.002
edy-0.002
 Kim-0.001
 sidewalk-0.001
."-0.001
Token take
Token ID1011
Feature activation+4.59e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
Louis+0.005
 Manny+0.005
Neg logit contributions
."-0.004
!"-0.004
edy-0.004
 sidewalk-0.004
 Sue-0.004
Token home
Token ID1363
Feature activation+1.56e-02

Pos logit contributions
ibi+0.006
edy+0.006
."+0.005
 Kim+0.005
!"+0.005
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.007
 pap-0.007
 Macy-0.007
Token.
Token ID13
Feature activation+4.05e-03

Pos logit contributions
ibi+0.013
edy+0.013
."+0.012
 Kim+0.012
 sidewalk+0.012
Neg logit contributions
Justin-0.033
 Justin-0.032
 Mason-0.032
 Macy-0.031
 pap-0.030
Token big
Token ID1263
Feature activation-2.80e-02

Pos logit contributions
years+0.033
Justin+0.033
ubb+0.032
ifying+0.032
 Justin+0.032
Neg logit contributions
!"-0.018
."-0.018
,"-0.018
?"-0.018
 man-0.018
Token salad
Token ID20698
Feature activation-1.69e-02

Pos logit contributions
Justin+0.048
 Justin+0.045
 Mason+0.045
Beck+0.044
 Macy+0.044
Neg logit contributions
!"-0.037
."-0.037
,"-0.036
?"-0.035
 sidewalk-0.034
Token with
Token ID351
Feature activation-2.25e-02

Pos logit contributions
Justin+0.037
 Justin+0.036
 Mason+0.035
 Macy+0.034
ubb+0.034
Neg logit contributions
."-0.014
!"-0.013
edy-0.013
ibi-0.013
 Sue-0.012
Token carrots
Token ID34397
Feature activation-1.10e-02

Pos logit contributions
Justin+0.042
 Justin+0.040
 Mason+0.039
Louis+0.039
Dean+0.039
Neg logit contributions
."-0.027
!"-0.025
 Sue-0.025
,"-0.025
 Twe-0.024
Token,
Token ID11
Feature activation-1.78e-03

Pos logit contributions
Justin+0.023
 Justin+0.022
 Mason+0.022
 nood+0.021
 pap+0.021
Neg logit contributions
ibi-0.010
edy-0.009
 Kim-0.009
oma-0.009
 Karen-0.009
Token cuc
Token ID38421
Feature activation-5.57e-02

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.002
edy-0.002
 Kim-0.002
."-0.002
 sidewalk-0.002
Tokenumbers
Token ID17024
Feature activation-1.19e-02

Pos logit contributions
 Manny+0.102
 Justin+0.102
 pap+0.101
 Louie+0.099
 fascination+0.098
Neg logit contributions
uri-0.055
edy-0.053
."-0.052
!"-0.052
ongo-0.051
Token,
Token ID11
Feature activation-4.50e-03

Pos logit contributions
Justin+0.025
 Mason+0.024
 Justin+0.024
 pap+0.023
 Macy+0.023
Neg logit contributions
ibi-0.011
edy-0.010
oma-0.010
 sidewalk-0.010
 Karen-0.010
Token and
Token ID290
Feature activation-3.93e-03

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.010
Louis+0.009
 Macy+0.009
Neg logit contributions
."-0.003
!"-0.003
edy-0.003
 sidewalk-0.003
,"-0.003
Token tomatoes
Token ID23972
Feature activation-2.48e-02

Pos logit contributions
Justin+0.007
ubb+0.007
 Justin+0.007
 Mason+0.007
Louis+0.007
Neg logit contributions
."-0.005
 Sue-0.005
,"-0.005
 Kim-0.004
 Sally-0.004
Token.
Token ID13
Feature activation-4.61e-03

Pos logit contributions
Justin+0.056
 Justin+0.055
 Mason+0.054
 Macy+0.052
 pap+0.052
Neg logit contributions
ibi-0.017
edy-0.017
."-0.016
 sidewalk-0.016
 Kim-0.016
Token cried
Token ID16896
Feature activation-1.76e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
ubb+0.002
where+0.002
Neg logit contributions
ibi-0.000
."-0.000
edy-0.000
!"-0.000
 Kim-0.000
Token.
Token ID13
Feature activation-1.13e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
Louis+0.004
 Macy+0.004
Neg logit contributions
."-0.001
ibi-0.001
 Kim-0.001
edy-0.001
 Sue-0.001
Token "
Token ID366
Feature activation+3.82e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 pap+0.002
 Manny+0.002
Neg logit contributions
edy-0.002
ibi-0.002
oot-0.001
round-0.001
ongo-0.001
TokenThe
Token ID464
Feature activation+3.15e-03

Pos logit contributions
Tom+0.005
ibi+0.004
."+0.004
 sidewalk+0.004
edy+0.004
Neg logit contributions
 Justin-0.007
 Mason-0.007
Justin-0.007
 pap-0.006
 Manny-0.006
Token ost
Token ID23619
Feature activation-1.19e-02

Pos logit contributions
ibi+0.004
edy+0.004
 Kim+0.003
 Karen+0.003
oma+0.003
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.005
 pap-0.005
 mer-0.005
Tokenric
Token ID1173
Feature activation-5.55e-02

Pos logit contributions
Justin+0.013
 Justin+0.013
 Mason+0.013
 pap+0.012
 Manny+0.011
Neg logit contributions
ibi-0.022
edy-0.021
 sidewalk-0.021
 Karen-0.021
 Kim-0.021
Tokenhes
Token ID956
Feature activation-2.34e-02

Pos logit contributions
Justin+0.094
 Justin+0.090
 Mason+0.088
 Macy+0.084
Louis+0.083
Neg logit contributions
ibi-0.073
edy-0.071
 Karen-0.069
 sidewalk-0.068
 Kim-0.068
Token are
Token ID389
Feature activation-2.47e-02

Pos logit contributions
Justin+0.039
 Justin+0.038
 Mason+0.037
where+0.036
 Macy+0.035
Neg logit contributions
ibi-0.030
edy-0.030
 sidewalk-0.028
 Karen-0.028
oma-0.028
Token mean
Token ID1612
Feature activation+2.51e-03

Pos logit contributions
Justin+0.044
 Justin+0.042
 Mason+0.042
 pap+0.040
 Macy+0.040
Neg logit contributions
ibi-0.029
edy-0.029
 sidewalk-0.028
 Kim-0.028
."-0.027
Token!
Token ID0
Feature activation-1.14e-02

Pos logit contributions
ibi+0.002
edy+0.002
 Karen+0.001
 sidewalk+0.001
oma+0.001
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.005
 mer-0.005
 nood-0.005
Token They
Token ID1119
Feature activation-4.82e-03

Pos logit contributions
Justin+0.018
 Mason+0.018
 Justin+0.018
 Manny+0.017
 pap+0.017
Neg logit contributions
edy-0.017
ibi-0.016
 sidewalk-0.015
oot-0.014
 chain-0.014
Token and
Token ID290
Feature activation-1.95e-03

Pos logit contributions
ibi+0.001
edy+0.001
 Kim+0.001
 sidewalk+0.001
oma+0.001
Neg logit contributions
Justin-0.005
 Justin-0.005
 Mason-0.004
 pap-0.004
 Macy-0.004
Token the
Token ID262
Feature activation-3.54e-04

Pos logit contributions
Justin+0.004
 Justin+0.003
 Mason+0.003
Louis+0.003
 Macy+0.003
Neg logit contributions
edy-0.002
 Kim-0.002
."-0.002
ibi-0.002
 Sue-0.002
Token peppers
Token ID31804
Feature activation-2.89e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 Manny+0.001
 Macy+0.001
Neg logit contributions
edy-0.000
 sidewalk-0.000
."-0.000
ibi-0.000
!"-0.000
Token from
Token ID422
Feature activation-1.88e-03

Pos logit contributions
Justin+0.056
 Justin+0.054
 Mason+0.054
 Macy+0.051
 Manny+0.051
Neg logit contributions
ibi-0.030
edy-0.029
 Kim-0.028
oma-0.027
 sidewalk-0.027
Token the
Token ID262
Feature activation-1.44e-02

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.002
edy-0.002
oma-0.002
round-0.002
ongo-0.002
Token cuc
Token ID38421
Feature activation-5.54e-02

Pos logit contributions
Justin+0.025
 Justin+0.024
 Mason+0.024
ubb+0.023
Louis+0.023
Neg logit contributions
 sidewalk-0.017
."-0.017
!"-0.016
edy-0.016
,"-0.015
Tokenumbers
Token ID17024
Feature activation-1.47e-02

Pos logit contributions
 Manny+0.102
 Justin+0.101
 mer+0.099
 imag+0.099
 pap+0.099
Neg logit contributions
."-0.055
edy-0.053
er-0.052
!"-0.051
uri-0.051
Token.
Token ID13
Feature activation-8.02e-04

Pos logit contributions
Justin+0.029
 Justin+0.029
 Mason+0.029
 Manny+0.028
 Macy+0.028
Neg logit contributions
ibi-0.014
edy-0.013
oma-0.013
 Karen-0.012
Tom-0.012
Token It
Token ID632
Feature activation-1.92e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 Macy+0.001
 Manny+0.001
Neg logit contributions
edy-0.001
ibi-0.001
oma-0.001
!"-0.001
 sidewalk-0.001
Token took
Token ID1718
Feature activation-9.62e-03

Pos logit contributions
Justin+0.043
 Justin+0.042
 Mason+0.041
ifying+0.040
Dean+0.039
Neg logit contributions
edy-0.013
."-0.013
 Sue-0.013
!"-0.012
ibi-0.012
Token a
Token ID257
Feature activation-9.10e-03

Pos logit contributions
Justin+0.018
 Justin+0.017
 Mason+0.017
 pap+0.017
 Macy+0.016
Neg logit contributions
 Sue-0.010
."-0.010
edy-0.010
 Kim-0.010
!"-0.010
Token They
Token ID1119
Feature activation+1.10e-02

Pos logit contributions
ibi+0.007
edy+0.007
oot+0.007
oma+0.007
ves+0.007
Neg logit contributions
 Justin-0.007
 Mason-0.007
Justin-0.007
 Manny-0.007
 pap-0.007
Token do
Token ID466
Feature activation+5.33e-03

Pos logit contributions
ibi+0.014
edy+0.013
 Kim+0.013
oma+0.013
 Karen+0.013
Neg logit contributions
Justin-0.019
 Justin-0.018
 Mason-0.018
 pap-0.017
Louis-0.017
Token not
Token ID407
Feature activation+7.96e-03

Pos logit contributions
ibi+0.006
 Karen+0.006
 Su+0.006
 Timothy+0.006
round+0.006
Neg logit contributions
Justin-0.010
 Justin-0.009
 Mason-0.009
where-0.009
Dean-0.009
Token pack
Token ID2353
Feature activation+5.58e-03

Pos logit contributions
ibi+0.011
edy+0.011
oma+0.011
 Karen+0.010
 sidewalk+0.010
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.012
 pap-0.011
Dean-0.011
Token ost
Token ID23619
Feature activation-1.69e-02

Pos logit contributions
ibi+0.009
oma+0.008
 sidewalk+0.008
round+0.008
 Karen+0.008
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
 Manny-0.007
 mer-0.007
Tokenric
Token ID1173
Feature activation-5.52e-02

Pos logit contributions
Justin+0.025
 Justin+0.024
 Mason+0.023
 Macy+0.022
Louis+0.022
Neg logit contributions
ibi-0.026
edy-0.026
 Kim-0.025
."-0.025
oma-0.025
Tokenhes
Token ID956
Feature activation-1.92e-02

Pos logit contributions
Justin+0.091
 Justin+0.086
 Mason+0.084
 Macy+0.080
Dean+0.079
Neg logit contributions
ibi-0.077
edy-0.072
 Karen-0.072
 sidewalk-0.072
oma-0.070
Token anymore
Token ID7471
Feature activation+2.04e-02

Pos logit contributions
 Mason+0.030
Justin+0.030
 Justin+0.030
 pap+0.029
 Manny+0.029
Neg logit contributions
ibi-0.025
edy-0.024
oma-0.024
 sidewalk-0.023
ongo-0.023
Token.
Token ID13
Feature activation+4.06e-03

Pos logit contributions
ibi+0.025
edy+0.022
 Karen+0.022
oma+0.022
 Timothy+0.022
Neg logit contributions
Justin-0.036
 Justin-0.036
 nood-0.035
 Mason-0.035
 pap-0.034
Token They
Token ID1119
Feature activation+8.14e-04

Pos logit contributions
edy+0.006
ibi+0.006
oot+0.006
er+0.006
round+0.006
Neg logit contributions
 Justin-0.006
Justin-0.006
 Mason-0.006
 Manny-0.005
 pap-0.005
Token pack
Token ID2353
Feature activation+6.13e-03

Pos logit contributions
ibi+0.001
edy+0.001
 Kim+0.001
 sidewalk+0.001
."+0.001
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 Macy-0.001
 pap-0.001
Token but
Token ID475
Feature activation-1.15e-05

Pos logit contributions
ibi+0.011
edy+0.011
 Kim+0.010
."+0.010
 sidewalk+0.010
Neg logit contributions
Justin-0.014
 Justin-0.013
 Mason-0.013
 Macy-0.012
 pap-0.012
Token be
Token ID307
Feature activation+1.19e-02

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Macy+0.000
 Manny+0.000
Neg logit contributions
ibi-0.000
edy-0.000
 Kim-0.000
."-0.000
 sidewalk-0.000
Token careful
Token ID8161
Feature activation+4.65e-03

Pos logit contributions
ibi+0.018
edy+0.018
 Kim+0.017
 sidewalk+0.017
."+0.017
Neg logit contributions
Justin-0.018
 Justin-0.017
 Mason-0.017
 Macy-0.016
 pap-0.016
Token.
Token ID13
Feature activation-6.16e-03

Pos logit contributions
ibi+0.004
edy+0.004
 Karen+0.004
oma+0.004
 sidewalk+0.004
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 pap-0.009
 Manny-0.009
Token Ost
Token ID38919
Feature activation-6.63e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.010
 Macy+0.010
 pap+0.010
Neg logit contributions
ibi-0.007
edy-0.007
 Kim-0.007
."-0.007
 sidewalk-0.007
Tokenric
Token ID1173
Feature activation-5.49e-02

Pos logit contributions
Justin+0.007
 Mason+0.007
 Justin+0.007
where+0.006
 pap+0.006
Neg logit contributions
ibi-0.013
edy-0.013
 Kim-0.012
 Karen-0.012
 Su-0.012
Tokenhes
Token ID956
Feature activation-3.40e-02

Pos logit contributions
Justin+0.089
 Justin+0.085
 Mason+0.083
Dean+0.079
Louis+0.078
Neg logit contributions
ibi-0.078
edy-0.075
 Karen-0.075
 sidewalk-0.073
 Timothy-0.072
Token can
Token ID460
Feature activation-6.09e-03

Pos logit contributions
Justin+0.059
 Mason+0.058
 Justin+0.057
 pap+0.055
 Macy+0.054
Neg logit contributions
ibi-0.043
edy-0.041
 Karen-0.039
 sidewalk-0.039
oma-0.038
Token be
Token ID307
Feature activation-6.24e-03

Pos logit contributions
Justin+0.010
 Justin+0.009
 Mason+0.009
 pap+0.009
 Macy+0.009
Neg logit contributions
ibi-0.008
edy-0.008
 Kim-0.008
 sidewalk-0.008
oma-0.008
Token gr
Token ID1036
Feature activation+5.59e-04

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
 pap+0.009
 nood+0.008
Neg logit contributions
ibi-0.009
edy-0.009
 sidewalk-0.009
oma-0.009
 Karen-0.009
Tokenumpy
Token ID32152
Feature activation+6.76e-03

Pos logit contributions
edy+0.001
 Karen+0.001
ibi+0.001
 Kim+0.001
 Timothy+0.001
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
ubb-0.001
Dean-0.001
Token village
Token ID7404
Feature activation-4.57e-04

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Macy+0.000
Dean+0.000
Neg logit contributions
 sidewalk-0.000
."-0.000
!"-0.000
edy-0.000
 Karen-0.000
Token would
Token ID561
Feature activation-1.44e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 Manny+0.001
 pap+0.001
Neg logit contributions
ibi-0.001
edy-0.001
oma-0.000
 Kim-0.000
 sidewalk-0.000
Token raise
Token ID5298
Feature activation-1.89e-02

Pos logit contributions
Justin+0.025
 Justin+0.024
 Mason+0.023
 Macy+0.023
 Manny+0.023
Neg logit contributions
 sidewalk-0.018
."-0.018
!"-0.017
 Sue-0.017
edy-0.017
Token a
Token ID257
Feature activation-8.89e-04

Pos logit contributions
Justin+0.033
 Justin+0.032
 Mason+0.032
 pap+0.030
 Manny+0.030
Neg logit contributions
ibi-0.024
edy-0.023
oma-0.022
 Kim-0.022
 sidewalk-0.021
Token fire
Token ID2046
Feature activation-2.60e-02

Pos logit contributions
Justin+0.002
 Justin+0.001
 Mason+0.001
 pap+0.001
 Macy+0.001
Neg logit contributions
ibi-0.001
edy-0.001
 Kim-0.001
."-0.001
 sidewalk-0.001
Tokenwork
Token ID1818
Feature activation-5.46e-02

Pos logit contributions
Justin+0.052
 Justin+0.050
 Mason+0.049
 Macy+0.047
Louis+0.047
Neg logit contributions
."-0.026
edy-0.025
ibi-0.025
 sidewalk-0.025
!"-0.025
Token into
Token ID656
Feature activation-9.06e-03

Pos logit contributions
Justin+0.117
 Justin+0.112
 Mason+0.110
years+0.107
 Macy+0.107
Neg logit contributions
."-0.051
!"-0.049
 sidewalk-0.046
,"-0.045
edy-0.044
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
Justin+0.018
 Justin+0.017
 Mason+0.017
Louis+0.016
 Macy+0.016
Neg logit contributions
."-0.009
 sidewalk-0.009
 Kim-0.009
!"-0.008
edy-0.008
Token sky
Token ID6766
Feature activation-1.94e-02

Pos logit contributions
Justin+0.018
 Justin+0.017
 Mason+0.017
 Macy+0.016
Louis+0.016
Neg logit contributions
 sidewalk-0.015
."-0.014
!"-0.013
 automobile-0.013
,"-0.013
Token to
Token ID284
Feature activation-6.38e-03

Pos logit contributions
Justin+0.049
 Justin+0.048
 Mason+0.047
Louis+0.046
 Macy+0.046
Neg logit contributions
."-0.010
!"-0.008
ibi-0.008
 sidewalk-0.008
,"-0.007
Token celebrate
Token ID10648
Feature activation-4.41e-03

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
 mer+0.008
 pap+0.008
Neg logit contributions
ibi-0.010
edy-0.010
oma-0.009
 Karen-0.009
 Kim-0.009
Token some
Token ID617
Feature activation+8.81e-03

Pos logit contributions
ibi+0.010
edy+0.010
."+0.010
 Sue+0.010
 Kim+0.010
Neg logit contributions
Justin-0.014
 Justin-0.014
 Mason-0.013
 pap-0.013
 Macy-0.013
Token y
Token ID331
Feature activation-8.75e-03

Pos logit contributions
ibi+0.012
edy+0.012
oma+0.011
 Kim+0.011
 Karen+0.011
Neg logit contributions
Justin-0.014
 Justin-0.014
 Mason-0.014
 pap-0.013
 mer-0.013
Tokenummy
Token ID13513
Feature activation-1.03e-02

Pos logit contributions
Justin+0.020
 Justin+0.019
Beck+0.018
Louis+0.018
Dean+0.018
Neg logit contributions
."-0.008
er-0.008
!"-0.007
 Sally-0.007
 Twe-0.007
Token carrots
Token ID34397
Feature activation+1.53e-03

Pos logit contributions
Justin+0.019
 Justin+0.018
 Mason+0.018
Louis+0.017
 Macy+0.017
Neg logit contributions
."-0.012
!"-0.012
ibi-0.011
 Sue-0.011
,"-0.011
Token and
Token ID290
Feature activation-8.87e-04

Pos logit contributions
ibi+0.001
edy+0.001
."+0.001
 Kim+0.001
 sidewalk+0.001
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 Macy-0.003
 pap-0.003
Token cuc
Token ID38421
Feature activation-5.44e-02

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
Louis+0.001
 Macy+0.001
Neg logit contributions
."-0.001
 sidewalk-0.001
 Sue-0.001
!"-0.001
edy-0.001
Tokenumbers
Token ID17024
Feature activation-4.60e-03

Pos logit contributions
 pap+0.097
 Justin+0.094
 Manny+0.094
 awe+0.090
 fascination+0.090
Neg logit contributions
edy-0.058
oot-0.058
ongo-0.057
."-0.055
!"-0.055
Token from
Token ID422
Feature activation+1.64e-03

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.010
 pap+0.009
 Macy+0.009
Neg logit contributions
ibi-0.003
edy-0.003
 sidewalk-0.003
 Kim-0.003
oma-0.003
Token the
Token ID262
Feature activation-9.03e-03

Pos logit contributions
ibi+0.002
edy+0.002
oma+0.002
Tom+0.002
 Kim+0.002
Neg logit contributions
 Justin-0.003
Justin-0.003
 Mason-0.003
 pap-0.003
where-0.002
Token farmer
Token ID18739
Feature activation-1.70e-02

Pos logit contributions
Justin+0.017
 Justin+0.016
 Mason+0.016
ubb+0.016
 Macy+0.015
Neg logit contributions
 sidewalk-0.010
."-0.010
!"-0.010
edy-0.009
,"-0.009
Token's
Token ID338
Feature activation-1.41e-02

Pos logit contributions
Justin+0.033
 Justin+0.032
 Mason+0.031
ubb+0.031
Louis+0.030
Neg logit contributions
."-0.018
!"-0.017
,"-0.016
edy-0.016
 sidewalk-0.016
Tokenie
Token ID494
Feature activation-4.63e-03

Pos logit contributions
Justin+0.027
 Justin+0.026
 Mason+0.025
 Macy+0.024
 pap+0.024
Neg logit contributions
ibi-0.024
edy-0.023
."-0.022
 Kim-0.022
 sidewalk-0.022
Token.
Token ID13
Feature activation-8.56e-03

Pos logit contributions
 pap+0.008
 Mason+0.008
 Justin+0.008
Justin+0.008
 nood+0.008
Neg logit contributions
ibi-0.005
 sidewalk-0.005
edy-0.005
 Karen-0.005
keeping-0.005
Token Unlike
Token ID12101
Feature activation-2.98e-03

Pos logit contributions
Justin+0.015
 Justin+0.014
 Mason+0.014
 Macy+0.014
 pap+0.013
Neg logit contributions
ibi-0.011
edy-0.010
 Kim-0.010
 sidewalk-0.010
."-0.010
Token other
Token ID584
Feature activation-6.01e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.005
 pap+0.005
Neg logit contributions
ibi-0.004
edy-0.004
 Kim-0.004
 sidewalk-0.004
."-0.004
Token ost
Token ID23619
Feature activation-5.70e-03

Pos logit contributions
Justin+0.012
 Justin+0.011
 Mason+0.011
 Macy+0.011
 Manny+0.011
Neg logit contributions
ibi-0.006
edy-0.006
 Kim-0.006
."-0.006
 sidewalk-0.006
Tokenric
Token ID1173
Feature activation-5.43e-02

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
 pap+0.006
 Manny+0.005
Neg logit contributions
ibi-0.010
edy-0.010
 Kim-0.010
 sidewalk-0.010
."-0.010
Tokenhes
Token ID956
Feature activation-2.87e-02

Pos logit contributions
Justin+0.089
 Justin+0.085
 Mason+0.082
 Macy+0.078
Louis+0.078
Neg logit contributions
ibi-0.076
edy-0.073
 Karen-0.072
 sidewalk-0.071
 Kim-0.071
Token,
Token ID11
Feature activation+6.45e-03

Pos logit contributions
Justin+0.053
 Justin+0.052
 Mason+0.051
 Macy+0.049
 pap+0.049
Neg logit contributions
ibi-0.032
edy-0.031
 sidewalk-0.030
 Kim-0.029
oma-0.029
Token his
Token ID465
Feature activation+4.90e-03

Pos logit contributions
ibi+0.009
edy+0.009
oma+0.008
 sidewalk+0.008
round+0.008
Neg logit contributions
 Mason-0.010
Justin-0.010
 Justin-0.010
 pap-0.010
 Manny-0.010
Token feathers
Token ID27400
Feature activation+3.78e-03

Pos logit contributions
ibi+0.006
edy+0.006
 Kim+0.006
 sidewalk+0.006
."+0.006
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
 pap-0.008
 Macy-0.007
Token gl
Token ID1278
Feature activation-3.26e-02

Pos logit contributions
ibi+0.004
edy+0.004
 Kim+0.004
oma+0.004
 Karen+0.004
Neg logit contributions
Justin-0.007
 Justin-0.006
 Mason-0.006
 mer-0.006
 pap-0.006
Token has
Token ID468
Feature activation-1.14e-02

Pos logit contributions
Justin+0.035
 Mason+0.034
 Justin+0.034
ubb+0.033
years+0.033
Neg logit contributions
."-0.008
!"-0.007
,"-0.007
?"-0.007
 Sue-0.006
Token lettuce
Token ID39406
Feature activation-4.40e-03

Pos logit contributions
Justin+0.021
 Mason+0.020
 Justin+0.020
where+0.019
Louis+0.019
Neg logit contributions
."-0.014
!"-0.013
 Sue-0.013
,"-0.012
 sidewalk-0.012
Token and
Token ID290
Feature activation-7.61e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 pap+0.010
 Macy+0.010
Neg logit contributions
ibi-0.002
edy-0.002
 Kim-0.002
 sidewalk-0.002
 Karen-0.002
Token tomatoes
Token ID23972
Feature activation-1.15e-02

Pos logit contributions
Justin+0.012
 Mason+0.012
where+0.011
ubb+0.011
itting+0.011
Neg logit contributions
."-0.011
,"-0.010
!"-0.010
 Sue-0.010
 Twe-0.010
Token and
Token ID290
Feature activation-4.15e-03

Pos logit contributions
Justin+0.025
 Justin+0.025
 Mason+0.025
 mer+0.024
 nood+0.024
Neg logit contributions
ibi-0.009
edy-0.008
 Kim-0.008
 Karen-0.008
oma-0.008
Token cuc
Token ID38421
Feature activation-5.42e-02

Pos logit contributions
Justin+0.006
 Mason+0.006
where+0.006
 Justin+0.006
ubb+0.006
Neg logit contributions
."-0.006
,"-0.006
!"-0.006
 Sue-0.006
 sidewalk-0.005
Tokenumbers
Token ID17024
Feature activation+8.44e-04

Pos logit contributions
 Manny+0.085
 pap+0.085
 Justin+0.084
where+0.081
 mer+0.081
Neg logit contributions
."-0.073
!"-0.073
edy-0.073
,"-0.069
/-0.067
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
ibi+0.001
 Karen+0.001
edy+0.001
oma+0.001
 sidewalk+0.001
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.002
 nood-0.002
 mer-0.002
Token And
Token ID843
Feature activation-8.16e-03

Pos logit contributions
edy+0.002
ibi+0.002
oot+0.002
oma+0.002
round+0.002
Neg logit contributions
 Justin-0.002
Justin-0.002
 Mason-0.002
 Manny-0.002
 Macy-0.002
Token it
Token ID340
Feature activation-2.07e-02

Pos logit contributions
Justin+0.016
 Justin+0.015
 Mason+0.015
Beck+0.015
Dean+0.014
Neg logit contributions
 Sue-0.008
."-0.008
 Kim-0.008
edy-0.008
!"-0.008
Token has
Token ID468
Feature activation-1.39e-02

Pos logit contributions
Justin+0.051
years+0.051
ifying+0.050
 Mason+0.050
where+0.050
Neg logit contributions
."-0.011
!"-0.010
?"-0.009
,"-0.009
 and-0.008
Token cuc
Token ID38421
Feature activation-4.99e-02

Pos logit contributions
Justin+0.012
years+0.012
 Justin+0.011
 Mason+0.011
 Manny+0.011
Neg logit contributions
."-0.012
!"-0.012
,"-0.011
?"-0.011
 sidewalk-0.011
Tokenumber
Token ID4494
Feature activation-1.53e-02

Pos logit contributions
 Manny+0.102
 Justin+0.100
 imag+0.098
 fascination+0.096
Beck+0.096
Neg logit contributions
."-0.044
!"-0.041
er-0.041
,"-0.039
?"-0.037
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
Justin+0.035
 Justin+0.034
 Mason+0.034
 pap+0.033
 Macy+0.032
Neg logit contributions
ibi-0.010
edy-0.010
 sidewalk-0.009
 Kim-0.009
oma-0.009
Token Mama
Token ID40084
Feature activation-4.57e-03

Pos logit contributions
Justin+0.024
 Justin+0.024
 Mason+0.023
Louis+0.022
 Macy+0.022
Neg logit contributions
."-0.010
 Kim-0.010
ibi-0.010
edy-0.010
 Sue-0.010
Token puts
Token ID7584
Feature activation-1.65e-02

Pos logit contributions
Justin+0.008
 Justin+0.008
 Mason+0.008
 pap+0.008
 Macy+0.008
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.005
 Kim-0.005
 Karen-0.005
Token cuc
Token ID38421
Feature activation-5.38e-02

Pos logit contributions
Justin+0.034
 Justin+0.033
 Mason+0.032
 Macy+0.031
Louis+0.031
Neg logit contributions
edy-0.015
."-0.015
ibi-0.015
 Kim-0.014
!"-0.014
Tokenumbers
Token ID17024
Feature activation-1.39e-02

Pos logit contributions
 Justin+0.101
 Manny+0.099
 mer+0.097
 pap+0.096
Justin+0.095
Neg logit contributions
."-0.053
er-0.052
,"-0.051
!"-0.050
?"-0.048
Token in
Token ID287
Feature activation-1.43e-02

Pos logit contributions
Justin+0.026
 Justin+0.025
 Mason+0.025
 Macy+0.024
 pap+0.024
Neg logit contributions
ibi-0.016
edy-0.015
oma-0.015
 sidewalk-0.015
 Karen-0.014
Token our
Token ID674
Feature activation-3.22e-02

Pos logit contributions
Justin+0.028
 Justin+0.027
 Mason+0.026
 Macy+0.025
 pap+0.025
Neg logit contributions
ibi-0.015
edy-0.015
 sidewalk-0.014
 Kim-0.014
."-0.014
Token salad
Token ID20698
Feature activation-1.01e-02

Pos logit contributions
Justin+0.052
 Justin+0.049
Louis+0.048
 Mason+0.048
ubb+0.048
Neg logit contributions
."-0.043
 sidewalk-0.043
!"-0.040
,"-0.040
 Sue-0.039
Token.
Token ID13
Feature activation-6.09e-03

Pos logit contributions
Justin+0.022
 Justin+0.022
 Mason+0.021
 pap+0.021
 Macy+0.021
Neg logit contributions
ibi-0.008
edy-0.007
 Karen-0.007
 Kim-0.007
oma-0.007
Token too
Token ID1165
Feature activation+6.02e-03

Pos logit contributions
where+0.002
 Mason+0.002
Justin+0.002
 Justin+0.002
 mer+0.002
Neg logit contributions
ibi-0.002
anya-0.002
edy-0.002
oma-0.002
 sidewalk-0.002
Token close
Token ID1969
Feature activation+5.96e-03

Pos logit contributions
ibi+0.012
edy+0.011
 sidewalk+0.011
oma+0.011
anya+0.011
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.006
where-0.006
 pap-0.005
Token.
Token ID13
Feature activation-5.42e-03

Pos logit contributions
ibi+0.009
edy+0.009
anya+0.009
ongo+0.009
oma+0.009
Neg logit contributions
 Mason-0.008
 Justin-0.008
where-0.008
Justin-0.008
 Manny-0.008
Token The
Token ID383
Feature activation+7.99e-03

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
 pap+0.008
 Manny+0.008
Neg logit contributions
ibi-0.007
edy-0.007
 sidewalk-0.007
oma-0.007
!"-0.006
Token ost
Token ID23619
Feature activation-1.15e-03

Pos logit contributions
edy+0.011
ibi+0.011
oot+0.011
ongo+0.011
anya+0.010
Neg logit contributions
 pap-0.012
where-0.012
 mer-0.012
 Mason-0.012
Justin-0.012
Tokenric
Token ID1173
Feature activation-5.37e-02

Pos logit contributions
Justin+0.001
 Mason+0.001
 Justin+0.001
 pap+0.001
ifying+0.001
Neg logit contributions
ibi-0.002
edy-0.002
Tom-0.002
 Timothy-0.002
 Karen-0.002
Tokenhes
Token ID956
Feature activation-2.06e-02

Pos logit contributions
Justin+0.084
 Justin+0.079
 Mason+0.077
Dean+0.074
Louis+0.073
Neg logit contributions
ibi-0.080
edy-0.077
 Karen-0.076
 Timothy-0.075
 sidewalk-0.074
Token might
Token ID1244
Feature activation+4.74e-03

Pos logit contributions
Justin+0.031
 Mason+0.030
 Justin+0.030
where+0.029
ifying+0.028
Neg logit contributions
ibi-0.030
edy-0.029
anya-0.028
 sidewalk-0.027
ongo-0.027
Token pe
Token ID613
Feature activation-2.34e-02

Pos logit contributions
ibi+0.007
edy+0.007
anya+0.007
oma+0.007
 Timothy+0.007
Neg logit contributions
Justin-0.006
 Mason-0.006
 pap-0.006
 Justin-0.006
where-0.006
Tokenck
Token ID694
Feature activation-2.38e-02

Pos logit contributions
Justin+0.042
 Justin+0.042
 Mason+0.040
 Macy+0.039
 Manny+0.038
Neg logit contributions
edy-0.027
ibi-0.027
."-0.026
!"-0.026
 Kim-0.025
Token you
Token ID345
Feature activation-3.39e-03

Pos logit contributions
Justin+0.045
 Justin+0.044
 Mason+0.043
 Macy+0.041
 mer+0.041
Neg logit contributions
ibi-0.026
edy-0.025
 sidewalk-0.024
oma-0.024
 Karen-0.024
Token go
Token ID467
Feature activation-8.55e-03

Pos logit contributions
Justin+0.026
 Justin+0.025
 Mason+0.024
ubb+0.024
 Macy+0.024
Neg logit contributions
."-0.015
 sidewalk-0.015
!"-0.014
?"-0.014
 and-0.014
Token outside
Token ID2354
Feature activation-1.94e-02

Pos logit contributions
Justin+0.016
 Justin+0.015
 Mason+0.015
ubb+0.015
 pap+0.014
Neg logit contributions
."-0.010
ibi-0.010
edy-0.009
 Kim-0.009
 sidewalk-0.009
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
Justin+0.056
ubb+0.054
ifying+0.053
 Justin+0.053
Louis+0.052
Neg logit contributions
 and-0.013
."-0.009
,"-0.007
?"-0.007
!"-0.007
Token fold
Token ID5591
Feature activation-1.47e-02

Pos logit contributions
Justin+0.019
ubb+0.019
 Justin+0.018
 Mason+0.018
Louis+0.017
Neg logit contributions
 sidewalk-0.011
 Sue-0.011
."-0.011
?"-0.011
 Kim-0.011
Token orig
Token ID1796
Feature activation-2.11e-02

Pos logit contributions
Justin+0.026
 Justin+0.024
 Mason+0.024
ubb+0.024
Louis+0.024
Neg logit contributions
."-0.018
!"-0.018
,"-0.018
ibi-0.017
 Sue-0.017
Tokenami
Token ID6277
Feature activation-5.34e-02

Pos logit contributions
Justin+0.037
 Justin+0.037
 Mason+0.036
 pap+0.034
 Macy+0.034
Neg logit contributions
er-0.027
ibi-0.026
oma-0.025
oot-0.025
anya-0.025
Token animals
Token ID4695
Feature activation-8.61e-03

Pos logit contributions
Justin+0.112
 Justin+0.107
 Mason+0.104
Louis+0.103
Dean+0.101
Neg logit contributions
."-0.051
!"-0.048
,"-0.045
edy-0.045
 sidewalk-0.044
Token in
Token ID287
Feature activation-7.54e-03

Pos logit contributions
Justin+0.019
 Justin+0.019
 Mason+0.018
 pap+0.018
Louis+0.018
Neg logit contributions
."-0.007
!"-0.006
edy-0.006
ibi-0.006
 sidewalk-0.006
Token her
Token ID607
Feature activation-7.25e-03

Pos logit contributions
Justin+0.013
 Justin+0.013
 Mason+0.013
Louis+0.012
 pap+0.012
Neg logit contributions
 Kim-0.009
."-0.009
 Sue-0.009
ibi-0.009
 sidewalk-0.009
Token garden
Token ID11376
Feature activation+1.61e-03

Pos logit contributions
Justin+0.014
 Justin+0.013
 Mason+0.013
Louis+0.012
ubb+0.012
Neg logit contributions
 sidewalk-0.008
."-0.008
ibi-0.008
edy-0.008
 Kim-0.008
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
ibi+0.001
."+0.001
edy+0.001
 sidewalk+0.001
 Kim+0.001
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
 Macy-0.004
Louis-0.004
Token.
Token ID13
Feature activation-1.35e-04

Pos logit contributions
Justin+0.034
 Justin+0.034
 Mason+0.033
 Macy+0.032
 mer+0.032
Neg logit contributions
ibi-0.013
edy-0.013
 sidewalk-0.012
 Kim-0.012
 Karen-0.012
Token Tom
Token ID4186
Feature activation+7.02e-03

Pos logit contributions
 Justin+0.000
Justin+0.000
 Mason+0.000
 Macy+0.000
 pap+0.000
Neg logit contributions
edy-0.000
ibi-0.000
oot-0.000
er-0.000
!"-0.000
Token ro
Token ID686
Feature activation-1.37e-02

Pos logit contributions
ibi+0.006
edy+0.006
 Kim+0.006
 sidewalk+0.006
 Karen+0.005
Neg logit contributions
Justin-0.015
 Justin-0.015
 Mason-0.014
 pap-0.014
 Macy-0.014
Tokenars
Token ID945
Feature activation-1.51e-02

Pos logit contributions
Justin+0.024
 Justin+0.023
 Mason+0.023
 pap+0.022
 Manny+0.022
Neg logit contributions
edy-0.017
ibi-0.017
."-0.016
!"-0.016
oma-0.016
Token and
Token ID290
Feature activation-5.12e-03

Pos logit contributions
Justin+0.040
 Justin+0.039
 Mason+0.039
Beck+0.038
 pap+0.038
Neg logit contributions
."-0.006
 and-0.005
!"-0.005
 Kim-0.004
 Sue-0.004
Token breat
Token ID4882
Feature activation-5.33e-02

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
ubb+0.008
years+0.008
Neg logit contributions
 Kim-0.006
 sidewalk-0.006
."-0.006
 Sue-0.006
ibi-0.006
Tokenhes
Token ID956
Feature activation-3.98e-02

Pos logit contributions
Justin+0.096
 Justin+0.093
 Mason+0.092
 Macy+0.087
 pap+0.087
Neg logit contributions
ibi-0.063
edy-0.062
 Kim-0.060
 sidewalk-0.059
."-0.058
Token fire
Token ID2046
Feature activation-1.91e-02

Pos logit contributions
Justin+0.084
 Justin+0.081
 Mason+0.079
Louis+0.077
 Macy+0.077
Neg logit contributions
."-0.035
 Kim-0.034
ibi-0.034
edy-0.034
 sidewalk-0.033
Token.
Token ID13
Feature activation-6.05e-03

Pos logit contributions
Justin+0.048
 Justin+0.046
 Mason+0.045
Louis+0.044
Beck+0.044
Neg logit contributions
."-0.012
!"-0.011
 sidewalk-0.011
er-0.010
,"-0.010
Token Anna
Token ID11735
Feature activation+4.66e-04

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 Macy+0.010
 pap+0.010
Neg logit contributions
ibi-0.007
edy-0.007
 sidewalk-0.006
."-0.006
 Kim-0.006
Token screams
Token ID26557
Feature activation-4.40e-03

Pos logit contributions
."+0.000
!"+0.000
ibi+0.000
oma+0.000
 and+0.000
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
ubb-0.001
Beck-0.001
Token is
Token ID318
Feature activation-1.15e-02

Pos logit contributions
Justin+0.041
 Justin+0.040
 Mason+0.039
 pap+0.038
 Macy+0.037
Neg logit contributions
ibi-0.030
edy-0.030
 sidewalk-0.029
 Kim-0.029
."-0.028
Token about
Token ID546
Feature activation+7.30e-03

Pos logit contributions
 Justin+0.017
Justin+0.017
 pap+0.017
 Mason+0.016
 Macy+0.015
Neg logit contributions
ibi-0.018
edy-0.017
round-0.017
Tom-0.016
oma-0.016
Token a
Token ID257
Feature activation-2.96e-03

Pos logit contributions
ibi+0.011
edy+0.011
oma+0.010
 Kim+0.010
 Karen+0.010
Neg logit contributions
Justin-0.011
 Justin-0.011
 Mason-0.011
 pap-0.010
 mer-0.010
Token dragon
Token ID10441
Feature activation-1.62e-02

Pos logit contributions
Justin+0.005
 Justin+0.004
 Mason+0.004
 pap+0.004
 Macy+0.004
Neg logit contributions
ibi-0.004
edy-0.004
 Kim-0.004
 sidewalk-0.004
."-0.004
Token that
Token ID326
Feature activation-2.01e-02

Pos logit contributions
Justin+0.035
 Justin+0.035
 Mason+0.034
 pap+0.033
 mer+0.033
Neg logit contributions
ibi-0.014
edy-0.013
 sidewalk-0.012
 Kim-0.012
 Karen-0.012
Token breat
Token ID4882
Feature activation-5.31e-02

Pos logit contributions
Justin+0.039
 Justin+0.038
 Mason+0.037
ubb+0.036
 Manny+0.036
Neg logit contributions
."-0.021
!"-0.020
 Sue-0.019
edy-0.019
?"-0.018
Tokenhes
Token ID956
Feature activation-4.18e-02

Pos logit contributions
Justin+0.094
 Justin+0.090
 Mason+0.087
 Macy+0.084
Dean+0.084
Neg logit contributions
ibi-0.072
edy-0.068
 Karen-0.067
 Kim-0.066
 sidewalk-0.064
Token fire
Token ID2046
Feature activation-1.69e-02

Pos logit contributions
Justin+0.080
 Justin+0.078
 Mason+0.077
 Macy+0.074
 pap+0.073
Neg logit contributions
ibi-0.045
edy-0.043
oma-0.041
 Karen-0.041
 sidewalk-0.040
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
Justin+0.040
 Justin+0.039
 Mason+0.038
 Macy+0.037
 pap+0.037
Neg logit contributions
ibi-0.010
edy-0.010
."-0.009
 sidewalk-0.009
 Kim-0.009
Token ch
Token ID442
Feature activation-2.28e-02

Pos logit contributions
Justin+0.036
 Justin+0.035
 Mason+0.034
Louis+0.033
 Macy+0.033
Neg logit contributions
."-0.027
!"-0.026
 sidewalk-0.025
 Kim-0.025
,"-0.025
Tokenases
Token ID1386
Feature activation+9.29e-03

Pos logit contributions
Justin+0.033
 Justin+0.032
 Mason+0.032
 pap+0.030
 Macy+0.030
Neg logit contributions
ibi-0.034
edy-0.034
."-0.033
 Kim-0.033
!"-0.033
Token house
Token ID2156
Feature activation+1.22e-02

Pos logit contributions
Justin+0.039
 Justin+0.037
 Mason+0.037
ubb+0.037
years+0.036
Neg logit contributions
 sidewalk-0.027
."-0.025
?"-0.024
!"-0.024
 Sue-0.023
Token.
Token ID13
Feature activation+2.67e-03

Pos logit contributions
ibi+0.008
edy+0.008
 Kim+0.007
 sidewalk+0.007
oma+0.007
Neg logit contributions
Justin-0.028
 Justin-0.027
 Mason-0.027
 Macy-0.026
 pap-0.026
Token They
Token ID1119
Feature activation-6.34e-04

Pos logit contributions
edy+0.003
ibi+0.003
 sidewalk+0.002
oma+0.002
."+0.002
Neg logit contributions
Justin-0.005
 Justin-0.005
 Mason-0.005
 pap-0.005
 Macy-0.005
Token wash
Token ID13502
Feature activation+7.30e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
ubb+0.001
 Manny+0.001
Neg logit contributions
."-0.001
!"-0.001
edy-0.001
,"-0.001
 sidewalk-0.001
Token the
Token ID262
Feature activation-9.68e-03

Pos logit contributions
."+0.008
edy+0.007
 Kim+0.007
ibi+0.007
!"+0.007
Neg logit contributions
Justin-0.014
 Justin-0.013
 Mason-0.013
Louis-0.013
Dean-0.013
Token cuc
Token ID38421
Feature activation-5.30e-02

Pos logit contributions
Justin+0.015
 Justin+0.015
 Mason+0.015
Louis+0.014
 Macy+0.014
Neg logit contributions
 sidewalk-0.014
."-0.014
!"-0.013
,"-0.013
edy-0.012
Tokenumber
Token ID4494
Feature activation-1.41e-02

Pos logit contributions
 Justin+0.106
where+0.101
 imag+0.100
Justin+0.100
 Manny+0.099
Neg logit contributions
er-0.049
."-0.047
!"-0.044
edy-0.043
ongo-0.042
Token and
Token ID290
Feature activation-9.23e-03

Pos logit contributions
Justin+0.036
 Justin+0.035
 Mason+0.034
Louis+0.034
years+0.034
Neg logit contributions
."-0.008
,"-0.007
!"-0.006
 and-0.006
er-0.006
Token cut
Token ID2005
Feature activation-3.84e-03

Pos logit contributions
Justin+0.017
 Justin+0.016
 Mason+0.016
ubb+0.015
Louis+0.015
Neg logit contributions
 Kim-0.010
."-0.010
 Sue-0.010
 sidewalk-0.010
edy-0.010
Token it
Token ID340
Feature activation-1.62e-02

Pos logit contributions
Justin+0.007
 Justin+0.007
 Mason+0.007
 Macy+0.006
 pap+0.006
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
 Kim-0.004
oma-0.004
Token into
Token ID656
Feature activation-1.09e-02

Pos logit contributions
Justin+0.036
 Justin+0.035
 Mason+0.035
years+0.035
Dean+0.034
Neg logit contributions
."-0.015
,"-0.014
 and-0.013
!"-0.013
er-0.012
Token mom
Token ID1995
Feature activation-6.23e-03

Pos logit contributions
ibi+0.023
keeping+0.022
edy+0.022
ongo+0.022
oot+0.021
Neg logit contributions
 pap-0.018
 Mason-0.016
where-0.015
 Justin-0.015
 Milo-0.015
Token and
Token ID290
Feature activation+9.84e-03

Pos logit contributions
 Justin+0.012
 pap+0.012
Justin+0.012
 Mason+0.012
 mer+0.011
Neg logit contributions
ibi-0.006
edy-0.006
 sidewalk-0.006
oma-0.005
 Timothy-0.005
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
ibi+0.013
edy+0.013
oma+0.013
 sidewalk+0.012
."+0.012
Neg logit contributions
Justin-0.016
 Justin-0.016
 Mason-0.015
 pap-0.015
 Macy-0.014
Token the
Token ID262
Feature activation+1.59e-04

Pos logit contributions
ibi+0.020
edy+0.020
oma+0.019
!"+0.019
round+0.019
Neg logit contributions
Justin-0.023
 Justin-0.023
 Mason-0.023
 pap-0.023
 mer-0.022
Token ost
Token ID23619
Feature activation-1.13e-02

Pos logit contributions
ibi+0.000
edy+0.000
ongo+0.000
 Kim+0.000
 Jerry+0.000
Neg logit contributions
 pap-0.000
Justin-0.000
 Mason-0.000
 mer-0.000
 Justin-0.000
Tokenric
Token ID1173
Feature activation-5.28e-02

Pos logit contributions
Justin+0.010
 Mason+0.009
 pap+0.009
 Justin+0.009
ubb+0.009
Neg logit contributions
edy-0.022
ibi-0.022
Tom-0.022
 Timothy-0.022
 sidewalk-0.022
Tokenhes
Token ID956
Feature activation-1.67e-02

Pos logit contributions
Justin+0.078
 Justin+0.074
 Mason+0.071
Dean+0.068
Louis+0.068
Neg logit contributions
ibi-0.082
 Karen-0.079
edy-0.078
 Timothy-0.077
 sidewalk-0.077
Token.
Token ID13
Feature activation+2.52e-03

Pos logit contributions
 Mason+0.027
 Justin+0.026
Justin+0.026
 pap+0.025
 Macy+0.025
Neg logit contributions
edy-0.021
ibi-0.021
ongo-0.020
 sidewalk-0.020
ller-0.020
Token They
Token ID1119
Feature activation+1.74e-02

Pos logit contributions
edy+0.004
oot+0.003
ibi+0.003
round+0.003
!"+0.003
Neg logit contributions
 pap-0.004
Justin-0.004
 Justin-0.004
 Mason-0.004
 Manny-0.003
Token learned
Token ID4499
Feature activation+1.38e-02

Pos logit contributions
ibi+0.021
 sidewalk+0.021
round+0.021
ller+0.021
oma+0.020
Neg logit contributions
Justin-0.030
 Justin-0.028
 pap-0.028
 mer-0.028
 Mason-0.028
Token their
Token ID511
Feature activation+3.29e-03

Pos logit contributions
ibi+0.020
edy+0.020
 sidewalk+0.019
oma+0.019
round+0.019
Neg logit contributions
Justin-0.020
 Justin-0.019
 Mason-0.019
 pap-0.019
 mer-0.018
Token and
Token ID290
Feature activation+5.25e-03

Pos logit contributions
ibi+0.002
edy+0.002
 Kim+0.002
 sidewalk+0.002
."+0.002
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.010
 Macy-0.010
 pap-0.010
Token run
Token ID1057
Feature activation+1.63e-02

Pos logit contributions
."+0.006
 sidewalk+0.006
 Kim+0.006
 Sue+0.006
!"+0.006
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
ubb-0.009
years-0.009
Token to
Token ID284
Feature activation+1.42e-03

Pos logit contributions
ibi+0.022
edy+0.021
oma+0.020
 Kim+0.020
 sidewalk+0.020
Neg logit contributions
Justin-0.027
 Justin-0.026
 Mason-0.026
 mer-0.025
Louis-0.024
Token the
Token ID262
Feature activation+4.33e-04

Pos logit contributions
ibi+0.002
edy+0.002
oma+0.002
ongo+0.002
 sidewalk+0.002
Neg logit contributions
 Justin-0.002
Justin-0.002
 Mason-0.002
 pap-0.002
 Manny-0.002
Token ost
Token ID23619
Feature activation-1.08e-02

Pos logit contributions
ibi+0.000
edy+0.000
 Timothy+0.000
 Karen+0.000
 Kim+0.000
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 pap-0.001
 mer-0.001
Tokenric
Token ID1173
Feature activation-5.26e-02

Pos logit contributions
Justin+0.012
 Justin+0.011
 Mason+0.011
 pap+0.011
 Manny+0.010
Neg logit contributions
ibi-0.020
edy-0.019
 Kim-0.019
 Karen-0.019
 sidewalk-0.019
Tokenhes
Token ID956
Feature activation-2.08e-02

Pos logit contributions
Justin+0.082
 Justin+0.078
 Mason+0.076
Dean+0.073
 Macy+0.072
Neg logit contributions
ibi-0.076
edy-0.075
 Karen-0.073
 Timothy-0.072
 sidewalk-0.072
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
Justin+0.043
 Justin+0.042
 Mason+0.042
 Macy+0.040
 Manny+0.040
Neg logit contributions
ibi-0.019
edy-0.018
 sidewalk-0.017
 Karen-0.017
oma-0.017
Token
Token ID198
Feature activation+6.59e-03

Pos logit contributions
edy+0.008
ibi+0.008
oot+0.008
anya+0.008
oma+0.008
Neg logit contributions
 Mason-0.010
Justin-0.009
 Justin-0.009
 Manny-0.009
 Macy-0.009
Token
Token ID198
Feature activation+2.41e-03

Pos logit contributions
edy+0.011
er+0.011
 automobiles+0.011
ibi+0.011
 automobile+0.011
Neg logit contributions
Justin-0.008
 Mason-0.008
 Justin-0.007
Beck-0.007
Larry-0.007
Token"
Token ID1
Feature activation-6.34e-03

Pos logit contributions
ibi+0.003
anya+0.003
edy+0.003
 sidewalk+0.003
oot+0.003
Neg logit contributions
Justin-0.004
 Mason-0.004
where-0.004
 Justin-0.004
Louis-0.004
Token"
Token ID1
Feature activation-7.89e-03

Pos logit contributions
Justin+0.017
 Justin+0.016
 Mason+0.016
 Macy+0.015
 pap+0.015
Neg logit contributions
ibi-0.005
edy-0.005
 Kim-0.005
 sidewalk-0.004
."-0.004
TokenNo
Token ID2949
Feature activation+2.58e-02

Pos logit contributions
 Justin+0.013
 Manny+0.013
Justin+0.013
 Mason+0.012
 pap+0.012
Neg logit contributions
Tom-0.011
ibi-0.010
edy-0.010
 sidewalk-0.010
."-0.010
Token,
Token ID11
Feature activation-6.63e-03

Pos logit contributions
ibi+0.032
edy+0.031
 sidewalk+0.031
keeping+0.031
oma+0.030
Neg logit contributions
 Justin-0.043
 Mason-0.043
Justin-0.042
 pap-0.042
 Manny-0.039
Token it
Token ID340
Feature activation-2.11e-02

Pos logit contributions
Justin+0.016
 Justin+0.015
Beck+0.015
 Mason+0.015
 Manny+0.015
Neg logit contributions
."-0.004
!"-0.004
edy-0.004
 Kim-0.004
 Sue-0.004
Token is
Token ID318
Feature activation-9.07e-03

Pos logit contributions
Justin+0.047
 Justin+0.046
 Mason+0.046
 Macy+0.045
years+0.045
Neg logit contributions
!"-0.017
."-0.017
,"-0.015
edy-0.015
/-0.015
Token mine
Token ID6164
Feature activation+3.20e-02

Pos logit contributions
 Justin+0.015
Justin+0.014
 pap+0.014
 Mason+0.014
 nood+0.013
Neg logit contributions
ibi-0.012
 sidewalk-0.012
anya-0.012
round-0.011
keeping-0.011
Token!"
Token ID2474
Feature activation-1.33e-02

Pos logit contributions
ibi+0.020
 sidewalk+0.018
edy+0.017
uri+0.017
anya+0.017
Neg logit contributions
Justin-0.073
 Justin-0.072
 Mason-0.071
 Manny-0.070
 Macy-0.068
Token Ben
Token ID3932
Feature activation+1.23e-03

Pos logit contributions
Justin+0.024
ubb+0.023
 Justin+0.023
Beck+0.023
 Mason+0.023
Neg logit contributions
 Tom-0.015
 Sue-0.015
 Tommy-0.015
 Kim-0.015
Tom-0.015
Token says
Token ID1139
Feature activation-2.04e-02

Pos logit contributions
ibi+0.001
edy+0.001
 Kim+0.001
 sidewalk+0.001
."+0.001
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 Macy-0.003
 pap-0.003
Token.
Token ID13
Feature activation-9.18e-03

Pos logit contributions
Justin+0.048
 Justin+0.047
 Mason+0.046
 Macy+0.045
 pap+0.045
Neg logit contributions
ibi-0.013
edy-0.012
 Kim-0.011
 sidewalk-0.011
."-0.011
Token He
Token ID679
Feature activation+7.50e-03

Pos logit contributions
Justin+0.018
 Justin+0.017
 Mason+0.017
 Manny+0.017
 pap+0.017
Neg logit contributions
ibi-0.009
."-0.009
 Kim-0.009
 Sue-0.009
edy-0.009
Token Tim
Token ID5045
Feature activation+4.67e-03

Pos logit contributions
ibi+0.025
oma+0.024
edy+0.022
anya+0.022
round+0.022
Neg logit contributions
 Manny-0.024
 Justin-0.024
where-0.024
 mer-0.023
Justin-0.023
Tokenmy
Token ID1820
Feature activation+3.84e-04

Pos logit contributions
 sidewalk+0.008
round+0.007
 Karen+0.007
edy+0.007
 Twe+0.007
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.005
 Manny-0.005
Dean-0.005
Token had
Token ID550
Feature activation+7.53e-03

Pos logit contributions
ibi+0.001
round+0.001
 sidewalk+0.001
 Karen+0.001
edy+0.000
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 pap-0.001
Dean-0.001
Token hurt
Token ID5938
Feature activation+1.40e-02

Pos logit contributions
ibi+0.011
edy+0.011
round+0.011
 sidewalk+0.011
oma+0.010
Neg logit contributions
Justin-0.011
 Justin-0.011
 pap-0.011
 Mason-0.011
 nood-0.010
Token himself
Token ID2241
Feature activation+1.87e-02

Pos logit contributions
ibi+0.022
edy+0.022
round+0.021
 sidewalk+0.021
oma+0.021
Neg logit contributions
Justin-0.019
 Justin-0.018
 Mason-0.018
 Manny-0.017
 pap-0.017
Token badly
Token ID11234
Feature activation+3.34e-02

Pos logit contributions
ibi+0.018
 Kim+0.016
edy+0.016
 Karen+0.016
oma+0.016
Neg logit contributions
Justin-0.038
 Justin-0.037
 Mason-0.036
where-0.035
 pap-0.035
Token and
Token ID290
Feature activation-2.22e-03

Pos logit contributions
ibi+0.036
oma+0.032
 Kim+0.032
edy+0.031
 Su+0.031
Neg logit contributions
Justin-0.063
 Justin-0.060
 mer-0.060
 Mason-0.059
 nood-0.059
Token had
Token ID550
Feature activation+2.87e-03

Pos logit contributions
 Justin+0.003
Justin+0.003
 Manny+0.003
 Mason+0.003
 pap+0.003
Neg logit contributions
ibi-0.003
oma-0.003
round-0.003
edy-0.003
uri-0.003
Token to
Token ID284
Feature activation-7.25e-03

Pos logit contributions
ibi+0.004
edy+0.004
 sidewalk+0.004
oma+0.004
 Kim+0.004
Neg logit contributions
Justin-0.005
 Justin-0.004
 Mason-0.004
 pap-0.004
 Manny-0.004
Token go
Token ID467
Feature activation+6.35e-03

Pos logit contributions
Justin+0.010
 Justin+0.009
 Mason+0.009
 pap+0.009
 mer+0.009
Neg logit contributions
ibi-0.011
oma-0.011
edy-0.011
round-0.011
 sidewalk-0.011
Token to
Token ID284
Feature activation-2.22e-03

Pos logit contributions
ibi+0.010
oma+0.009
edy+0.009
anya+0.009
Tom+0.009
Neg logit contributions
Justin-0.008
 Mason-0.008
 Justin-0.008
where-0.008
 mer-0.008
Token wanted
Token ID2227
Feature activation+1.12e-02

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 pap+0.003
 Macy+0.003
Neg logit contributions
ibi-0.002
edy-0.002
 sidewalk-0.002
 Kim-0.002
."-0.002
Token the
Token ID262
Feature activation+1.69e-02

Pos logit contributions
ibi+0.014
edy+0.014
 Kim+0.013
."+0.013
 sidewalk+0.013
Neg logit contributions
Justin-0.020
 Justin-0.019
 Mason-0.019
 Macy-0.018
Louis-0.018
Token white
Token ID2330
Feature activation+1.82e-02

Pos logit contributions
edy+0.020
ibi+0.020
 Kim+0.019
."+0.019
oma+0.019
Neg logit contributions
Justin-0.029
 Justin-0.028
 Mason-0.028
 pap-0.027
 mer-0.027
Token spot
Token ID4136
Feature activation+1.64e-03

Pos logit contributions
edy+0.020
/+0.018
."+0.018
 Su+0.018
?"+0.018
Neg logit contributions
 nood-0.030
where-0.029
 pap-0.029
 mer-0.028
 Mason-0.028
Token so
Token ID523
Feature activation+1.59e-02

Pos logit contributions
ibi+0.002
edy+0.002
 sidewalk+0.002
 Kim+0.002
."+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 Macy-0.003
 pap-0.003
Token badly
Token ID11234
Feature activation+3.98e-02

Pos logit contributions
ibi+0.020
edy+0.020
 Kim+0.019
 sidewalk+0.019
."+0.019
Neg logit contributions
Justin-0.027
 Justin-0.026
 Mason-0.026
 Macy-0.025
 pap-0.025
Token,
Token ID11
Feature activation+9.36e-03

Pos logit contributions
ibi+0.048
oma+0.043
anya+0.042
 Kim+0.042
uri+0.041
Neg logit contributions
 Justin-0.071
Justin-0.070
 Mason-0.069
 mer-0.067
where-0.066
Token so
Token ID523
Feature activation+9.05e-03

Pos logit contributions
ibi+0.009
edy+0.008
 Kim+0.008
 sidewalk+0.008
."+0.008
Neg logit contributions
Justin-0.019
 Justin-0.019
 Mason-0.019
 Macy-0.018
 pap-0.018
Token he
Token ID339
Feature activation+2.43e-03

Pos logit contributions
ibi+0.015
edy+0.015
 sidewalk+0.014
oma+0.014
."+0.014
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.011
 pap-0.011
 Manny-0.011
Token decided
Token ID3066
Feature activation+1.71e-02

Pos logit contributions
ibi+0.003
 sidewalk+0.003
 Kim+0.003
edy+0.003
oma+0.003
Neg logit contributions
Justin-0.005
 Justin-0.004
 Mason-0.004
 mer-0.004
 pap-0.004
Token to
Token ID284
Feature activation-1.63e-04

Pos logit contributions
ibi+0.033
oma+0.031
uri+0.030
edy+0.029
istle+0.029
Neg logit contributions
 Justin-0.018
Justin-0.018
 Mason-0.017
 mer-0.017
where-0.016
Token distance
Token ID5253
Feature activation+2.43e-02

Pos logit contributions
Justin+0.004
 Justin+0.003
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.004
edy-0.003
 sidewalk-0.003
."-0.003
 Kim-0.003
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
edy+0.035
ibi+0.034
ongo+0.032
anya+0.031
uri+0.031
Neg logit contributions
 mer-0.035
 Mason-0.034
 Manny-0.034
 Justin-0.033
where-0.032
Token they
Token ID484
Feature activation-6.58e-03

Pos logit contributions
ibi+0.015
oma+0.014
edy+0.014
ongo+0.013
anya+0.013
Neg logit contributions
 Justin-0.020
 Mason-0.020
 mer-0.019
Justin-0.019
 Manny-0.018
Token wanted
Token ID2227
Feature activation+1.91e-02

Pos logit contributions
Justin+0.012
 Justin+0.011
 Mason+0.011
 mer+0.011
 pap+0.011
Neg logit contributions
ibi-0.008
oma-0.007
edy-0.007
 Kim-0.007
 sidewalk-0.007
Token to
Token ID284
Feature activation+1.37e-02

Pos logit contributions
ibi+0.032
edy+0.030
oma+0.030
 sidewalk+0.029
round+0.029
Neg logit contributions
 Justin-0.025
Justin-0.025
 Mason-0.024
 mer-0.023
 Manny-0.023
Token explore
Token ID7301
Feature activation+3.92e-02

Pos logit contributions
ibi+0.024
oma+0.024
edy+0.023
anya+0.022
ller+0.022
Neg logit contributions
 Mason-0.016
 mer-0.016
Justin-0.016
 Justin-0.015
where-0.015
Token it
Token ID340
Feature activation+4.11e-03

Pos logit contributions
ibi+0.069
ongo+0.063
oma+0.059
anya+0.058
edy+0.057
Neg logit contributions
 mer-0.057
 coral-0.054
where-0.052
 Justin-0.052
 Manny-0.050
Token.
Token ID13
Feature activation+3.43e-03

Pos logit contributions
ibi+0.004
edy+0.003
 Kim+0.003
oma+0.003
 sidewalk+0.003
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.008
 Manny-0.008
 Macy-0.008
Token
Token ID198
Feature activation+9.63e-03

Pos logit contributions
ibi+0.005
edy+0.005
anya+0.005
oot+0.004
oma+0.004
Neg logit contributions
 Justin-0.005
 Mason-0.005
Justin-0.005
 Manny-0.005
where-0.005
Token
Token ID198
Feature activation-1.58e-03

Pos logit contributions
edy+0.014
 sidewalk+0.014
ibi+0.013
 automobiles+0.013
er+0.013
Neg logit contributions
Justin-0.015
 Mason-0.014
 Justin-0.014
where-0.014
Beck-0.014
TokenAs
Token ID1722
Feature activation-1.67e-02

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
where+0.002
Louis+0.002
Neg logit contributions
ibi-0.002
edy-0.002
 Kim-0.002
."-0.002
 sidewalk-0.002
Token party
Token ID2151
Feature activation+2.27e-02

Pos logit contributions
ibi+0.007
edy+0.007
 Kim+0.007
oot+0.007
 Timothy+0.007
Neg logit contributions
Justin-0.012
 Justin-0.011
 Mason-0.011
 mer-0.011
 pap-0.011
Token.
Token ID13
Feature activation+1.42e-03

Pos logit contributions
ibi+0.022
edy+0.019
 Timothy+0.019
anya+0.019
Twe+0.019
Neg logit contributions
 Justin-0.043
Justin-0.042
 Mason-0.042
 mer-0.041
 Manny-0.040
Token Is
Token ID1148
Feature activation-1.43e-02

Pos logit contributions
edy+0.002
ibi+0.002
oma+0.002
 sidewalk+0.002
anya+0.002
Neg logit contributions
 Justin-0.002
 Mason-0.002
Justin-0.002
 Macy-0.002
 Manny-0.002
Token it
Token ID340
Feature activation-1.77e-02

Pos logit contributions
 Justin+0.025
Justin+0.025
 Mason+0.024
 Macy+0.023
where+0.023
Neg logit contributions
ibi-0.018
 sidewalk-0.017
aron-0.016
ongo-0.015
oma-0.015
Token your
Token ID534
Feature activation+8.93e-03

Pos logit contributions
Justin+0.026
 Justin+0.026
 Mason+0.025
 pap+0.024
where+0.024
Neg logit contributions
ibi-0.027
edy-0.025
 sidewalk-0.025
oma-0.025
Tom-0.025
Token party
Token ID2151
Feature activation+3.33e-02

Pos logit contributions
ibi+0.013
edy+0.013
Tom+0.012
 Kim+0.012
oma+0.012
Neg logit contributions
Justin-0.013
 Justin-0.013
 Mason-0.013
 pap-0.013
 mer-0.012
Token?"
Token ID1701
Feature activation-6.28e-04

Pos logit contributions
ibi+0.036
 Timothy+0.032
edy+0.032
 Su+0.032
rix+0.031
Neg logit contributions
Justin-0.061
where-0.060
 Justin-0.058
 mer-0.057
 pap-0.056
Token Lily
Token ID20037
Feature activation+1.92e-02

Pos logit contributions
 Justin+0.001
Justin+0.001
 Mason+0.001
 Manny+0.001
 Macy+0.001
Neg logit contributions
edy-0.001
ibi-0.001
 sidewalk-0.001
oot-0.001
oma-0.001
Token said
Token ID531
Feature activation-1.37e-02

Pos logit contributions
ibi+0.026
edy+0.026
 sidewalk+0.024
 Timothy+0.024
 Kim+0.024
Neg logit contributions
 Justin-0.033
Justin-0.033
 Mason-0.032
 pap-0.031
 Macy-0.030
Token.
Token ID13
Feature activation-9.58e-05

Pos logit contributions
Justin+0.032
 Justin+0.031
 Mason+0.031
 Macy+0.030
 pap+0.030
Neg logit contributions
ibi-0.008
edy-0.008
."-0.008
 Kim-0.008
 sidewalk-0.007
Token
Token ID198
Feature activation-1.14e-03

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Dean+0.000
 Macy+0.000
Neg logit contributions
edy-0.000
ibi-0.000
 sidewalk-0.000
keeping-0.000
round-0.000
Token got
Token ID1392
Feature activation-1.20e-02

Pos logit contributions
ibi+0.009
edy+0.009
 sidewalk+0.009
 Kim+0.008
oma+0.008
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.009
 pap-0.009
 Macy-0.009
Token you
Token ID345
Feature activation+1.80e-03

Pos logit contributions
Justin+0.023
 Justin+0.022
 Mason+0.022
Beck+0.022
Dean+0.021
Neg logit contributions
."-0.014
!"-0.013
,"-0.013
 Sue-0.012
 Kim-0.012
Token."
Token ID526
Feature activation+1.78e-03

Pos logit contributions
ibi+0.002
edy+0.002
oma+0.002
 sidewalk+0.002
 Kim+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 pap-0.003
 mer-0.003
Token
Token ID198
Feature activation-4.62e-03

Pos logit contributions
ibi+0.001
."+0.001
 Kim+0.001
 Sue+0.001
edy+0.001
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
ubb-0.004
 pap-0.004
Token
Token ID198
Feature activation-6.86e-03

Pos logit contributions
Justin+0.009
 Justin+0.008
 Mason+0.008
 Macy+0.008
Louis+0.008
Neg logit contributions
edy-0.006
ibi-0.005
 Karen-0.005
round-0.005
 Timothy-0.005
TokenBen
Token ID11696
Feature activation+4.02e-03

Pos logit contributions
ubb+0.014
years+0.014
 Manny+0.014
 pap+0.014
 imag+0.013
Neg logit contributions
Tom-0.008
."-0.006
!"-0.006
,"-0.005
Tim-0.005
Token reaches
Token ID12229
Feature activation-5.32e-03

Pos logit contributions
ibi+0.005
edy+0.005
 Kim+0.005
 sidewalk+0.005
 Karen+0.005
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
 pap-0.006
 Macy-0.006
Token the
Token ID262
Feature activation+1.65e-03

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.010
Louis+0.010
ubb+0.009
Neg logit contributions
."-0.005
 Kim-0.005
ibi-0.005
 Sue-0.005
!"-0.005
Token bottom
Token ID4220
Feature activation+4.58e-03

Pos logit contributions
edy+0.002
ibi+0.002
 Kim+0.002
 Karen+0.002
oma+0.002
Neg logit contributions
Justin-0.003
 Justin-0.002
 Mason-0.002
 Macy-0.002
Louis-0.002
Token.
Token ID13
Feature activation-5.75e-04

Pos logit contributions
edy+0.005
ibi+0.005
 Karen+0.005
 sidewalk+0.004
oma+0.004
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
 Manny-0.008
 Macy-0.008
Token Anna
Token ID11735
Feature activation-1.79e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
ubb+0.001
 pap+0.001
Neg logit contributions
 Sue-0.001
."-0.001
 Kim-0.001
ibi-0.001
 sidewalk-0.001
Token they
Token ID484
Feature activation+1.44e-03

Pos logit contributions
Justin+0.007
 Justin+0.007
 Mason+0.007
 Macy+0.006
 pap+0.006
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.005
."-0.005
oma-0.005
Token all
Token ID477
Feature activation+1.16e-02

Pos logit contributions
ibi+0.002
 sidewalk+0.002
edy+0.002
 Kim+0.002
oma+0.002
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.002
Louis-0.002
 mer-0.002
Token lived
Token ID5615
Feature activation+6.30e-03

Pos logit contributions
ibi+0.013
edy+0.012
 sidewalk+0.012
oma+0.012
 Kim+0.012
Neg logit contributions
Justin-0.021
 Justin-0.021
 Mason-0.021
 mer-0.020
 Manny-0.020
Token happily
Token ID18177
Feature activation+4.37e-03

Pos logit contributions
ibi+0.010
edy+0.009
 sidewalk+0.009
oma+0.009
round+0.009
Neg logit contributions
Justin-0.010
 Justin-0.009
 Mason-0.009
Louis-0.009
 mer-0.009
Token ever
Token ID1683
Feature activation-3.78e-03

Pos logit contributions
ibi+0.007
edy+0.007
 Kim+0.006
 sidewalk+0.006
."+0.006
Neg logit contributions
Justin-0.006
 Justin-0.006
 Mason-0.006
 Macy-0.006
 pap-0.006
Token after
Token ID706
Feature activation+9.76e-03

Pos logit contributions
Justin+0.007
 Mason+0.007
 Justin+0.007
 Manny+0.007
 Macy+0.007
Neg logit contributions
."-0.004
!"-0.004
edy-0.004
ibi-0.004
 sidewalk-0.004
Token.
Token ID13
Feature activation+9.65e-03

Pos logit contributions
ibi+0.013
round+0.012
edy+0.012
anya+0.011
uri+0.011
Neg logit contributions
 Justin-0.017
Justin-0.017
where-0.016
 mer-0.016
 Mason-0.016
Token<|endoftext|>
Token ID50256
Feature activation-4.59e-03

Pos logit contributions
edy+0.012
 sidewalk+0.011
ibi+0.011
round+0.011
er+0.011
Neg logit contributions
Justin-0.016
 Justin-0.016
where-0.016
 Manny-0.015
 Mason-0.015
TokenOnce
Token ID7454
Feature activation-8.54e-03

Pos logit contributions
Justin+0.006
 Justin+0.006
where+0.006
Beck+0.006
 Mason+0.006
Neg logit contributions
edy-0.008
ibi-0.008
round-0.007
oot-0.007
 sidewalk-0.007
Token upon
Token ID2402
Feature activation+6.42e-03

Pos logit contributions
Justin+0.008
 Justin+0.007
 Mason+0.007
 Manny+0.007
 Macy+0.007
Neg logit contributions
ibi-0.017
edy-0.017
 sidewalk-0.017
oma-0.017
 Kim-0.017
Token a
Token ID257
Feature activation+2.69e-03

Pos logit contributions
ibi+0.009
edy+0.009
."+0.008
 Kim+0.008
!"+0.008
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.010
 pap-0.009
 Macy-0.009
Token.
Token ID13
Feature activation-1.75e-03

Pos logit contributions
Justin+0.038
 Justin+0.037
 Mason+0.037
 Macy+0.035
 pap+0.035
Neg logit contributions
ibi-0.011
edy-0.011
 Kim-0.010
 sidewalk-0.010
."-0.010
Token He
Token ID679
Feature activation-4.19e-03

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
edy-0.002
ibi-0.002
 sidewalk-0.002
oma-0.002
!"-0.002
Token was
Token ID373
Feature activation-6.33e-03

Pos logit contributions
Justin+0.008
 Justin+0.008
 Mason+0.007
 mer+0.007
 pap+0.007
Neg logit contributions
ibi-0.005
 sidewalk-0.005
edy-0.004
 Kim-0.004
 Timothy-0.004
Token extraordinary
Token ID11359
Feature activation-2.23e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 Macy+0.010
 Manny+0.010
Neg logit contributions
ibi-0.008
edy-0.007
."-0.007
 Kim-0.007
 Sue-0.007
Token.
Token ID13
Feature activation+1.86e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.005
 pap+0.005
Neg logit contributions
ibi-0.001
edy-0.001
 Kim-0.001
 sidewalk-0.001
."-0.001
Token
Token ID198
Feature activation+5.12e-03

Pos logit contributions
edy+0.002
ibi+0.002
 sidewalk+0.002
oma+0.002
!"+0.002
Neg logit contributions
Justin-0.004
 Justin-0.003
 Mason-0.003
 Macy-0.003
 pap-0.003
Token
Token ID198
Feature activation+4.27e-03

Pos logit contributions
edy+0.008
ibi+0.007
round+0.007
 sidewalk+0.007
er+0.007
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
 Manny-0.007
Louis-0.006
TokenBut
Token ID1537
Feature activation+1.64e-02

Pos logit contributions
ibi+0.004
edy+0.004
 Kim+0.004
 sidewalk+0.004
."+0.004
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
Louis-0.008
 pap-0.008
Token not
Token ID407
Feature activation-6.04e-03

Pos logit contributions
 Sue+0.015
ibi+0.015
 Kim+0.015
 Karen+0.015
edy+0.015
Neg logit contributions
Justin-0.033
 Justin-0.032
 Mason-0.032
 Macy-0.031
ubb-0.030
Token everyone
Token ID2506
Feature activation-2.23e-04

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 Macy+0.010
 pap+0.010
Neg logit contributions
ibi-0.007
edy-0.007
 Kim-0.006
."-0.006
 sidewalk-0.006
Token liked
Token ID8288
Feature activation+4.27e-03

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 Macy+0.000
 pap+0.000
Neg logit contributions
ibi-0.000
edy-0.000
 sidewalk-0.000
 Kim-0.000
 Karen-0.000
Token Sam
Token ID3409
Feature activation-2.99e-03

Pos logit contributions
ibi+0.000
edy+0.000
 Kim+0.000
 sidewalk+0.000
 Sue+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
 Macy-0.000
Louis-0.000
Token are
Token ID389
Feature activation-5.40e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.004
 pap+0.004
Neg logit contributions
ibi-0.004
edy-0.004
 sidewalk-0.004
 Kim-0.004
."-0.004
Token scared
Token ID12008
Feature activation+8.05e-03

Pos logit contributions
Justin+0.009
 Justin+0.008
 Mason+0.008
 pap+0.008
 Macy+0.008
Neg logit contributions
ibi-0.007
edy-0.007
 sidewalk-0.007
 Kim-0.007
."-0.007
Token.
Token ID13
Feature activation-3.38e-03

Pos logit contributions
ibi+0.004
edy+0.004
."+0.004
 Kim+0.004
 sidewalk+0.004
Neg logit contributions
Justin-0.020
 Justin-0.019
 Mason-0.019
 Macy-0.018
 pap-0.018
Token They
Token ID1119
Feature activation+9.01e-03

Pos logit contributions
 Mason+0.005
 Justin+0.005
Justin+0.005
 Manny+0.005
 Dean+0.005
Neg logit contributions
edy-0.005
ibi-0.005
oot-0.005
round-0.005
ongo-0.004
Token drop
Token ID4268
Feature activation+1.80e-02

Pos logit contributions
ibi+0.013
edy+0.012
oma+0.012
 Kim+0.012
 sidewalk+0.012
Neg logit contributions
Justin-0.014
 Justin-0.013
 Mason-0.013
 mer-0.012
 Macy-0.012
Token the
Token ID262
Feature activation+2.49e-03

Pos logit contributions
ibi+0.018
edy+0.018
 sidewalk+0.017
 Kim+0.017
."+0.017
Neg logit contributions
Justin-0.035
 Justin-0.034
 Mason-0.034
 Macy-0.032
 pap-0.032
Token shiny
Token ID22441
Feature activation+6.37e-03

Pos logit contributions
ibi+0.003
edy+0.003
 Kim+0.003
 Karen+0.003
oma+0.003
Neg logit contributions
Justin-0.005
 Justin-0.004
 Mason-0.004
 mer-0.004
 pap-0.004
Token things
Token ID1243
Feature activation+4.93e-03

Pos logit contributions
ibi+0.010
 Karen+0.009
 Su+0.009
 Kim+0.009
 Timothy+0.009
Neg logit contributions
 mer-0.009
Justin-0.009
 Justin-0.008
 nood-0.008
Louis-0.008
Token and
Token ID290
Feature activation+5.91e-03

Pos logit contributions
ibi+0.004
edy+0.004
oma+0.004
 sidewalk+0.004
 Kim+0.004
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.010
 mer-0.009
 Manny-0.009
Token run
Token ID1057
Feature activation+2.17e-02

Pos logit contributions
ibi+0.009
oma+0.008
edy+0.008
 Karen+0.008
oot+0.008
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 pap-0.008
 mer-0.008
Token they
Token ID484
Feature activation-2.87e-03

Pos logit contributions
ibi+0.002
edy+0.002
 Kim+0.002
 sidewalk+0.002
."+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 Macy-0.003
 pap-0.003
Token finished
Token ID5201
Feature activation-3.43e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.005
 Macy+0.005
Neg logit contributions
edy-0.003
ibi-0.003
 Kim-0.003
 Su-0.003
."-0.003
Token all
Token ID477
Feature activation+1.25e-02

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
Louis+0.006
Beck+0.006
Neg logit contributions
ibi-0.004
edy-0.004
."-0.004
 Kim-0.004
 sidewalk-0.004
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
 Sue+0.011
 Su+0.011
ibi+0.011
 Kim+0.011
."+0.011
Neg logit contributions
Justin-0.027
 Justin-0.026
 Mason-0.025
Louis-0.025
Dean-0.024
Token laundry
Token ID25724
Feature activation+1.86e-02

Pos logit contributions
 sidewalk+0.010
edy+0.010
ibi+0.010
!"+0.010
."+0.010
Neg logit contributions
Justin-0.020
ifying-0.020
 Justin-0.020
 Mason-0.019
Louis-0.019
Token.
Token ID13
Feature activation+8.82e-03

Pos logit contributions
ibi+0.014
edy+0.013
 Kim+0.012
oma+0.012
 Karen+0.012
Neg logit contributions
Justin-0.041
 Justin-0.040
 Mason-0.040
 Manny-0.039
 Macy-0.039
Token
Token ID198
Feature activation+3.93e-03

Pos logit contributions
edy+0.012
ibi+0.011
 sidewalk+0.011
 chain+0.011
!"+0.010
Neg logit contributions
 Justin-0.015
 Mason-0.015
Justin-0.015
 Macy-0.015
 Manny-0.015
Token
Token ID198
Feature activation-2.67e-03

Pos logit contributions
edy+0.005
 sidewalk+0.005
ibi+0.005
er+0.005
round+0.005
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.007
Beck-0.006
 Manny-0.006
TokenWhen
Token ID2215
Feature activation-1.34e-02

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.004
 Macy+0.004
Neg logit contributions
ibi-0.003
edy-0.003
Tom-0.003
 Kim-0.003
 sidewalk-0.003
Token they
Token ID484
Feature activation-1.48e-02

Pos logit contributions
Justin+0.027
 Justin+0.026
 Mason+0.026
Louis+0.025
 pap+0.025
Neg logit contributions
 Kim-0.012
ibi-0.012
 Sue-0.012
 Karen-0.012
edy-0.012
Token were
Token ID547
Feature activation-8.77e-03

Pos logit contributions
Justin+0.028
 Justin+0.027
 Mason+0.026
 pap+0.025
 Macy+0.025
Neg logit contributions
ibi-0.016
edy-0.016
."-0.015
 Kim-0.015
 Sue-0.015
Token can
Token ID460
Feature activation-2.29e-03

Pos logit contributions
ibi+0.011
oma+0.011
 sidewalk+0.011
 Kim+0.010
 Karen+0.010
Neg logit contributions
Justin-0.011
 Justin-0.011
 Mason-0.010
Louis-0.010
 pap-0.010
Token sort
Token ID3297
Feature activation+1.71e-02

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
 pap+0.003
Louis+0.003
Neg logit contributions
ibi-0.003
edy-0.003
 Kim-0.003
oma-0.003
 sidewalk-0.003
Token it
Token ID340
Feature activation-5.20e-03

Pos logit contributions
ibi+0.017
oma+0.016
edy+0.016
 sidewalk+0.015
anya+0.014
Neg logit contributions
Justin-0.034
 Justin-0.033
 Mason-0.033
 Manny-0.032
 Macy-0.032
Token."
Token ID526
Feature activation+3.30e-04

Pos logit contributions
Justin+0.008
 Justin+0.008
 Mason+0.008
 Manny+0.008
 Macy+0.008
Neg logit contributions
ibi-0.007
oma-0.007
anya-0.007
Tom-0.007
 Karen-0.007
Token
Token ID198
Feature activation+1.75e-03

Pos logit contributions
edy+0.000
ibi+0.000
oma+0.000
 sidewalk+0.000
!"+0.000
Neg logit contributions
Justin-0.001
 Justin-0.001
 Mason-0.001
 Macy-0.001
 Manny-0.001
Token
Token ID198
Feature activation-2.29e-03

Pos logit contributions
edy+0.002
 sidewalk+0.002
er+0.002
ibi+0.002
 automobiles+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
Beck-0.003
Louis-0.003
TokenThe
Token ID464
Feature activation+1.85e-03

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
 Macy+0.004
 pap+0.004
Neg logit contributions
ibi-0.003
edy-0.003
 Kim-0.003
 sidewalk-0.003
."-0.003
Token second
Token ID1218
Feature activation-1.72e-03

Pos logit contributions
edy+0.002
 sidewalk+0.002
."+0.002
ibi+0.002
oma+0.002
Neg logit contributions
Justin-0.003
 Mason-0.003
 Justin-0.003
ifying-0.003
 Manny-0.003
Token girl
Token ID2576
Feature activation-4.93e-04

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 pap+0.003
 Manny+0.003
Neg logit contributions
ibi-0.002
edy-0.002
."-0.002
 Sue-0.002
!"-0.002
Token asked
Token ID1965
Feature activation-7.59e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 pap+0.001
 Manny+0.001
Neg logit contributions
ibi-0.001
edy-0.000
 sidewalk-0.000
."-0.000
 Kim-0.000
Token,
Token ID11
Feature activation-8.81e-03

Pos logit contributions
Justin+0.015
 Justin+0.015
 Mason+0.015
 Macy+0.014
 pap+0.014
Neg logit contributions
ibi-0.007
edy-0.007
 sidewalk-0.007
 Kim-0.007
oma-0.007
Token prepare
Token ID8335
Feature activation-1.27e-02

Pos logit contributions
Justin+0.028
 Justin+0.027
 Mason+0.027
 pap+0.025
 mer+0.025
Neg logit contributions
ibi-0.026
edy-0.026
oma-0.025
 sidewalk-0.024
 Karen-0.024
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
Justin+0.021
 Justin+0.020
 Mason+0.020
 Macy+0.019
 pap+0.019
Neg logit contributions
ibi-0.017
edy-0.017
 sidewalk-0.016
oma-0.016
 Kim-0.016
Token garden
Token ID11376
Feature activation-1.15e-02

Pos logit contributions
Justin+0.021
 Justin+0.020
 Mason+0.019
 Macy+0.019
Louis+0.018
Neg logit contributions
ibi-0.016
."-0.016
edy-0.016
 sidewalk-0.016
!"-0.016
Token for
Token ID329
Feature activation+2.75e-03

Pos logit contributions
Justin+0.022
 Justin+0.021
 Mason+0.021
 Manny+0.020
 Macy+0.020
Neg logit contributions
ibi-0.013
edy-0.012
oma-0.012
 Karen-0.011
Tom-0.011
Token new
Token ID649
Feature activation-1.37e-03

Pos logit contributions
ibi+0.004
edy+0.004
oma+0.004
round+0.004
ongo+0.004
Neg logit contributions
Justin-0.004
 Justin-0.004
 Mason-0.004
 pap-0.004
 Macy-0.004
Token flowers
Token ID12734
Feature activation-6.58e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 pap+0.002
 mer+0.002
Neg logit contributions
ibi-0.002
edy-0.002
oma-0.002
 Kim-0.002
Tom-0.002
Token."
Token ID526
Feature activation+1.17e-03

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.014
 Macy+0.013
 mer+0.013
Neg logit contributions
ibi-0.005
edy-0.005
oma-0.005
 Kim-0.005
 sidewalk-0.005
Token Lily
Token ID20037
Feature activation+2.91e-03

Pos logit contributions
edy+0.002
 chain+0.002
ibi+0.001
 sidewalk+0.001
oot+0.001
Neg logit contributions
 Justin-0.002
 Mason-0.002
Justin-0.002
 Macy-0.002
 Manny-0.002
Token was
Token ID373
Feature activation-1.98e-03

Pos logit contributions
ibi+0.003
edy+0.003
 Kim+0.003
 sidewalk+0.003
 Karen+0.003
Neg logit contributions
Justin-0.006
 Justin-0.005
 Mason-0.005
 pap-0.005
 Macy-0.005
Token very
Token ID845
Feature activation-3.76e-03

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 pap+0.003
 mer+0.002
Neg logit contributions
edy-0.003
ibi-0.003
 sidewalk-0.003
round-0.003
oot-0.003
Token excited
Token ID6568
Feature activation-1.21e-02

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
 pap+0.006
 Macy+0.006
Neg logit contributions
ibi-0.005
edy-0.005
 sidewalk-0.005
 Kim-0.005
oma-0.005
Token a
Token ID257
Feature activation-2.61e-02

Pos logit contributions
Justin+0.004
 Justin+0.004
 Mason+0.004
 pap+0.004
where+0.004
Neg logit contributions
edy-0.003
ibi-0.003
 sidewalk-0.003
round-0.003
oma-0.003
Token bad
Token ID2089
Feature activation-9.96e-03

Pos logit contributions
Justin+0.045
 Mason+0.044
 Justin+0.044
 Macy+0.043
 Manny+0.042
Neg logit contributions
."-0.032
!"-0.031
edy-0.031
,"-0.030
 sidewalk-0.029
Token wreck
Token ID15228
Feature activation-4.87e-03

Pos logit contributions
Justin+0.015
 Justin+0.015
 Mason+0.014
 pap+0.014
 mer+0.013
Neg logit contributions
ibi-0.015
edy-0.014
 Kim-0.014
 Karen-0.014
 sidewalk-0.014
Token and
Token ID290
Feature activation-7.96e-03

Pos logit contributions
Justin+0.010
 Justin+0.010
 Mason+0.010
 mer+0.009
 Macy+0.009
Neg logit contributions
ibi-0.004
edy-0.004
 Kim-0.004
oma-0.004
 Karen-0.004
Token had
Token ID550
Feature activation-2.44e-05

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.014
 pap+0.013
 Macy+0.013
Neg logit contributions
ibi-0.009
edy-0.009
 sidewalk-0.009
oma-0.008
 Kim-0.008
Token been
Token ID587
Feature activation-3.53e-04

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 pap+0.000
 Macy+0.000
Neg logit contributions
ibi-0.000
edy-0.000
 sidewalk-0.000
oma-0.000
 Kim-0.000
Token alone
Token ID3436
Feature activation+9.79e-03

Pos logit contributions
Justin+0.001
 Justin+0.001
 Mason+0.001
 pap+0.001
 Macy+0.001
Neg logit contributions
ibi-0.000
edy-0.000
 sidewalk-0.000
 Kim-0.000
oma-0.000
Token ever
Token ID1683
Feature activation-4.99e-03

Pos logit contributions
ibi+0.010
edy+0.009
 sidewalk+0.009
 Kim+0.009
oma+0.009
Neg logit contributions
Justin-0.019
 Justin-0.019
 Mason-0.019
 mer-0.018
 Manny-0.018
Token since
Token ID1201
Feature activation+5.38e-03

Pos logit contributions
Justin+0.009
 Justin+0.009
 Mason+0.009
 mer+0.008
 Macy+0.008
Neg logit contributions
ibi-0.006
edy-0.006
 Kim-0.005
 sidewalk-0.005
 Karen-0.005
Token.
Token ID13
Feature activation-1.95e-03

Pos logit contributions
ibi+0.007
edy+0.007
 sidewalk+0.006
 Kim+0.006
rix+0.006
Neg logit contributions
where-0.009
 Justin-0.009
 mer-0.009
Justin-0.009
 Mason-0.008
Token The
Token ID383
Feature activation+1.58e-03

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
 Macy+0.003
 Manny+0.003
Neg logit contributions
edy-0.003
ibi-0.003
 sidewalk-0.002
 chain-0.002
!"-0.002
Token big
Token ID1263
Feature activation-6.82e-05

Pos logit contributions
ibi+0.019
round+0.018
oot+0.018
edy+0.018
aron+0.018
Neg logit contributions
Justin-0.022
 Justin-0.022
 mer-0.021
 Mason-0.020
where-0.020
Token boy
Token ID2933
Feature activation-6.59e-03

Pos logit contributions
 Justin+0.000
Justin+0.000
 mer+0.000
 pap+0.000
 Mason+0.000
Neg logit contributions
ibi-0.000
edy-0.000
oot-0.000
anya-0.000
ves-0.000
Token nodded
Token ID14464
Feature activation-1.24e-02

Pos logit contributions
Justin+0.012
 Justin+0.012
 Mason+0.011
 pap+0.011
 Macy+0.011
Neg logit contributions
ibi-0.008
 sidewalk-0.008
edy-0.007
oma-0.007
 Kim-0.007
Token one
Token ID530
Feature activation-2.48e-02

Pos logit contributions
Justin+0.030
 Justin+0.028
 Mason+0.028
 Macy+0.027
Louis+0.027
Neg logit contributions
."-0.007
ibi-0.007
edy-0.007
 Kim-0.007
 Sue-0.007
Token last
Token ID938
Feature activation-1.31e-02

Pos logit contributions
Justin+0.044
 Justin+0.042
 Mason+0.041
 Macy+0.039
Beck+0.039
Neg logit contributions
."-0.031
ibi-0.031
!"-0.030
edy-0.030
,"-0.030
Token time
Token ID640
Feature activation-1.10e-02

Pos logit contributions
Justin+0.028
 Justin+0.027
 Mason+0.027
 Macy+0.026
 pap+0.026
Neg logit contributions
ibi-0.011
edy-0.011
."-0.011
!"-0.010
 Kim-0.010
Token.
Token ID13
Feature activation+6.20e-03

Pos logit contributions
Justin+0.022
 Justin+0.022
 Mason+0.022
 Macy+0.021
 pap+0.021
Neg logit contributions
ibi-0.010
edy-0.010
 Kim-0.009
 sidewalk-0.009
."-0.009
Token He
Token ID679
Feature activation+5.92e-03

Pos logit contributions
edy+0.008
ibi+0.008
 chain+0.008
 sidewalk+0.008
anya+0.008
Neg logit contributions
 Justin-0.011
Justin-0.010
 Mason-0.010
 Manny-0.010
 Macy-0.009
Token clut
Token ID32170
Feature activation+1.40e-02

Pos logit contributions
 sidewalk+0.008
 automobiles+0.007
ibi+0.007
 Kim+0.007
edy+0.007
Neg logit contributions
Justin-0.010
 Justin-0.010
 Mason-0.010
 pap-0.010
 mer-0.009
Tokenched
Token ID1740
Feature activation+5.49e-03

Pos logit contributions
 sidewalk+0.024
ibi+0.024
 Kim+0.023
oma+0.023
 Sue+0.023
Neg logit contributions
Justin-0.018
 Justin-0.017
 Mason-0.017
 mer-0.016
Louis-0.015
Token his
Token ID465
Feature activation-2.74e-04

Pos logit contributions
ibi+0.007
edy+0.007
 sidewalk+0.007
oma+0.007
."+0.007
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 Macy-0.008
 pap-0.008
Token
Token ID198
Feature activation-1.90e-03

Pos logit contributions
Justin+0.014
 Justin+0.013
 Mason+0.013
 Macy+0.013
 Manny+0.012
Neg logit contributions
edy-0.015
round-0.015
 sidewalk-0.014
 chain-0.014
 automobile-0.013
Token
Token ID198
Feature activation-6.98e-03

Pos logit contributions
Justin+0.003
 Justin+0.003
 Mason+0.003
Beck+0.003
 Macy+0.003
Neg logit contributions
edy-0.003
ibi-0.002
round-0.002
 sidewalk-0.002
 automobile-0.002
TokenFrom
Token ID4863
Feature activation+1.51e-02

Pos logit contributions
Justin+0.013
 Justin+0.013
 Mason+0.012
 pap+0.012
 Manny+0.012
Neg logit contributions
ibi-0.008
edy-0.008
 sidewalk-0.008
Tom-0.008
."-0.007
Token that
Token ID326
Feature activation-1.33e-02

Pos logit contributions
ibi+0.024
keeping+0.024
 Kim+0.023
 sidewalk+0.023
oma+0.023
Neg logit contributions
where-0.018
 mer-0.017
 Justin-0.017
Justin-0.017
 pap-0.017
Token day
Token ID1110
Feature activation-1.86e-02

Pos logit contributions
Justin+0.025
 Justin+0.024
 Mason+0.024
 Macy+0.023
ubb+0.023
Neg logit contributions
ibi-0.015
."-0.014
!"-0.014
edy-0.014
 sidewalk-0.014
Token on
Token ID319
Feature activation-1.88e-03

Pos logit contributions
Justin+0.036
 Justin+0.035
 Mason+0.034
 pap+0.033
 Macy+0.033
Neg logit contributions
 Kim-0.019
ibi-0.019
."-0.019
edy-0.018
!"-0.018
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
Justin+0.004
 Justin+0.003
 Mason+0.003
 Macy+0.003
 pap+0.003
Neg logit contributions
ibi-0.002
edy-0.002
 sidewalk-0.002
oma-0.002
round-0.002
Token Lily
Token ID20037
Feature activation+5.14e-03

Pos logit contributions
Justin+0.030
 Justin+0.029
 Mason+0.029
Louis+0.028
 pap+0.028
Neg logit contributions
."-0.004
ibi-0.003
 Kim-0.003
edy-0.003
 Sue-0.003
Token loved
Token ID6151
Feature activation+5.49e-03

Pos logit contributions
edy+0.007
 Timothy+0.006
ibi+0.006
 Karen+0.006
 Kim+0.006
Neg logit contributions
Justin-0.009
 Mason-0.008
 Justin-0.008
 pap-0.008
where-0.008
Token to
Token ID284
Feature activation-5.39e-03

Pos logit contributions
edy+0.008
ibi+0.008
oma+0.007
 Kim+0.007
 sidewalk+0.007
Neg logit contributions
Justin-0.008
 Justin-0.008
 Mason-0.008
 pap-0.008
 mer-0.008
Token touch
Token ID3638
Feature activation+1.38e-02

Pos logit contributions
Justin+0.007
 Justin+0.007
 Mason+0.007
 pap+0.006
 Macy+0.006
Neg logit contributions
edy-0.009
ibi-0.009
oma-0.008
 Kim-0.008
 sidewalk-0.008
Token shoes
Token ID10012
Feature activation+8.34e-03

Pos logit contributions
Justin+0.037
 Justin+0.036
 Mason+0.036
 Macy+0.034
 Manny+0.034
Neg logit contributions
ibi-0.022
edy-0.021
 sidewalk-0.021
."-0.021
 Kim-0.021
Token and
Token ID290
Feature activation+5.85e-03

Pos logit contributions
ibi+0.008
edy+0.008
 Kim+0.008
oma+0.007
 Karen+0.007
Neg logit contributions
Justin-0.016
 Mason-0.015
 Justin-0.015
 nood-0.015
 mer-0.015
Token a
Token ID257
Feature activation+3.84e-05

Pos logit contributions
ibi+0.008
edy+0.008
 Kim+0.008
 sidewalk+0.008
."+0.008
Neg logit contributions
Justin-0.009
 Justin-0.009
 Mason-0.009
 Macy-0.008
 pap-0.008
Token smile
Token ID8212
Feature activation-2.65e-02

Pos logit contributions
ibi+0.000
edy+0.000
 sidewalk+0.000
."+0.000
 Kim+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
 Macy-0.000
 Manny-0.000
Token that
Token ID326
Feature activation-2.23e-02

Pos logit contributions
Justin+0.044
 Mason+0.044
 Justin+0.043
 mer+0.043
 pap+0.043
Neg logit contributions
edy-0.032
ibi-0.030
 sidewalk-0.030
 Karen-0.029
Tom-0.029
Token made
Token ID925
Feature activation-2.81e-02

Pos logit contributions
Justin+0.038
 Justin+0.036
 Mason+0.035
Louis+0.033
ubb+0.033
Neg logit contributions
ibi-0.030
 Kim-0.029
 Sue-0.028
edy-0.028
."-0.028
Token everyone
Token ID2506
Feature activation-4.02e-03

Pos logit contributions
Justin+0.058
 Justin+0.053
ifying+0.053
Louis+0.053
Dean+0.052
Neg logit contributions
 Kim-0.032
 Sue-0.030
 Su-0.029
."-0.028
ibi-0.028
Token around
Token ID1088
Feature activation-1.22e-02

Pos logit contributions
Justin+0.007
 Justin+0.007
 Mason+0.007
Dean+0.007
Louis+0.007
Neg logit contributions
."-0.005
!"-0.005
,"-0.005
ibi-0.005
 sidewalk-0.005
Token her
Token ID607
Feature activation+6.37e-03

Pos logit contributions
Justin+0.023
 Justin+0.022
 Mason+0.022
 Macy+0.021
 pap+0.021
Neg logit contributions
ibi-0.013
edy-0.013
 Kim-0.012
 sidewalk-0.012
."-0.012
Token happy
Token ID3772
Feature activation-2.18e-02

Pos logit contributions
ibi+0.006
edy+0.006
 Timothy+0.006
 Kim+0.006
round+0.006
Neg logit contributions
Justin-0.012
 Justin-0.012
 Mason-0.012
 pap-0.012
 mer-0.012
Token.
Token ID13
Feature activation-5.29e-03

Pos logit contributions
Justin+0.042
 Justin+0.041
 Mason+0.041
 pap+0.039
 mer+0.039
Neg logit contributions
ibi-0.022
edy-0.022
 Kim-0.021
 Karen-0.020
oma-0.020
Token whist
Token ID32262
Feature activation-1.89e-02

Pos logit contributions
Justin+0.028
 Justin+0.027
 Mason+0.027
 pap+0.025
 Macy+0.025
Neg logit contributions
ibi-0.018
edy-0.018
 sidewalk-0.017
 Kim-0.017
 Karen-0.017
Tokenled
Token ID992
Feature activation-2.93e-02

Pos logit contributions
Justin+0.022
 Justin+0.021
 Mason+0.019
Dean+0.018
 Macy+0.018
Neg logit contributions
ibi-0.035
 Karen-0.035
 Kim-0.034
 Sue-0.034
 Su-0.034
Token every
Token ID790
Feature activation-2.39e-02

Pos logit contributions
Justin+0.061
 Mason+0.059
 Justin+0.059
 Macy+0.057
ubb+0.057
Neg logit contributions
."-0.028
!"-0.026
edy-0.025
,"-0.025
 and-0.025
Token morning
Token ID3329
Feature activation-5.14e-03

Pos logit contributions
Justin+0.044
 Justin+0.043
 Mason+0.042
 Manny+0.041
 Macy+0.041
Neg logit contributions
."-0.027
ibi-0.026
!"-0.026
 sidewalk-0.025
edy-0.025
Token when
Token ID618
Feature activation-2.50e-02

Pos logit contributions
 Justin+0.010
Justin+0.010
 Mason+0.010
 pap+0.010
 mer+0.010
Neg logit contributions
ibi-0.005
edy-0.005
 Timothy-0.004
 Karen-0.004
oma-0.004
Token he
Token ID339
Feature activation-3.13e-02

Pos logit contributions
Justin+0.041
 Justin+0.039
 Mason+0.038
 Macy+0.036
Louis+0.036
Neg logit contributions
ibi-0.033
edy-0.033
 Kim-0.033
."-0.032
 sidewalk-0.032
Token went
Token ID1816
Feature activation-1.42e-02

Pos logit contributions
Justin+0.053
 Justin+0.051
 Mason+0.050
 pap+0.048
 Macy+0.048
Neg logit contributions
ibi-0.040
edy-0.039
 Kim-0.038
 sidewalk-0.038
 Karen-0.037
Token to
Token ID284
Feature activation-8.35e-03

Pos logit contributions
Justin+0.026
 Justin+0.025
 Mason+0.025
 Macy+0.023
 pap+0.023
Neg logit contributions
ibi-0.016
edy-0.016
."-0.016
 sidewalk-0.016
 Kim-0.016
Token his
Token ID465
Feature activation-2.49e-02

Pos logit contributions
Justin+0.014
 Justin+0.014
 Mason+0.013
 Macy+0.013
 pap+0.013
Neg logit contributions
ibi-0.011
edy-0.011
oma-0.010
 Kim-0.010
 Karen-0.010
Token office
Token ID2607
Feature activation+2.71e-03

Pos logit contributions
Justin+0.053
 Justin+0.051
ubb+0.050
 Mason+0.050
Beck+0.050
Neg logit contributions
 sidewalk-0.021
."-0.021
oma-0.020
ibi-0.020
edy-0.020
Token.
Token ID13
Feature activation-5.12e-03

Pos logit contributions
ibi+0.003
 Timothy+0.003
edy+0.003
 Karen+0.003
 Su+0.003
Neg logit contributions
 Justin-0.005
Justin-0.005
 Mason-0.005
where-0.005
 Macy-0.005
Token day
Token ID1110
Feature activation-2.74e-03

Pos logit contributions
Justin+0.011
 Justin+0.010
 Mason+0.010
Louis+0.010
 pap+0.010
Neg logit contributions
."-0.005
 Kim-0.005
 Sue-0.005
!"-0.005
 sidewalk-0.005
Token and
Token ID290
Feature activation-5.09e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 pap+0.005
 Macy+0.005
Neg logit contributions
edy-0.003
."-0.003
ibi-0.003
 sidewalk-0.003
!"-0.003
Token didn
Token ID1422
Feature activation-1.02e-02

Pos logit contributions
Justin+0.009
 Justin+0.008
 Mason+0.008
ubb+0.008
 Macy+0.008
Neg logit contributions
."-0.006
 sidewalk-0.006
 Kim-0.006
edy-0.006
ibi-0.006
Token't
Token ID470
Feature activation+1.63e-03

Pos logit contributions
Justin+0.018
 Justin+0.017
 Mason+0.017
 pap+0.016
 Macy+0.016
Neg logit contributions
ibi-0.012
edy-0.012
."-0.012
 Kim-0.012
 sidewalk-0.012
Token like
Token ID588
Feature activation-5.13e-03

Pos logit contributions
."+0.002
ibi+0.002
edy+0.002
!"+0.002
 Kim+0.002
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
 pap-0.003
 Manny-0.003
Token to
Token ID284
Feature activation-2.99e-02

Pos logit contributions
Justin+0.010
 Justin+0.010
Beck+0.010
 Mason+0.010
Louis+0.009
Neg logit contributions
."-0.006
!"-0.005
 Sue-0.005
 sidewalk-0.005
 Kim-0.005
Token do
Token ID466
Feature activation-1.98e-02

Pos logit contributions
Justin+0.048
 Justin+0.046
 Mason+0.045
 Manny+0.043
years+0.043
Neg logit contributions
."-0.043
!"-0.041
,"-0.040
 sidewalk-0.040
ibi-0.040
Token anything
Token ID1997
Feature activation-2.95e-03

Pos logit contributions
Justin+0.033
 Justin+0.032
 Mason+0.031
 pap+0.030
 Macy+0.030
Neg logit contributions
edy-0.025
."-0.025
ibi-0.025
 Kim-0.025
 sidewalk-0.024
Token.
Token ID13
Feature activation-1.62e-03

Pos logit contributions
Justin+0.006
 Justin+0.006
 Mason+0.006
 Macy+0.006
 pap+0.006
Neg logit contributions
ibi-0.003
edy-0.003
 Kim-0.002
 sidewalk-0.002
oma-0.002
Token One
Token ID1881
Feature activation+1.51e-03

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 Manny+0.002
 pap+0.002
Neg logit contributions
edy-0.003
ibi-0.002
oma-0.002
oot-0.002
anya-0.002
Token day
Token ID1110
Feature activation-3.73e-03

Pos logit contributions
ibi+0.002
 Sue+0.001
."+0.001
 Kim+0.001
Tom+0.001
Neg logit contributions
Justin-0.003
 Justin-0.003
 Mason-0.003
ubb-0.003
Louis-0.003
Token was
Token ID373
Feature activation+2.36e-03

Pos logit contributions
edy+0.012
ibi+0.012
 sidewalk+0.011
round+0.011
oot+0.010
Neg logit contributions
Justin-0.020
 Justin-0.020
 Mason-0.020
where-0.019
 Manny-0.019
Token a
Token ID257
Feature activation-5.14e-03

Pos logit contributions
ibi+0.003
 Sue+0.003
."+0.003
edy+0.003
Tom+0.003
Neg logit contributions
Justin-0.004
 Mason-0.004
 Justin-0.004
ubb-0.004
 pap-0.004
Token blue
Token ID4171
Feature activation-2.58e-02

Pos logit contributions
Justin+0.009
 Justin+0.008
 Mason+0.008
 pap+0.008
 mer+0.008
Neg logit contributions
ibi-0.007
edy-0.006
 Kim-0.006
 sidewalk-0.006
 Karen-0.006
Token spike
Token ID20240
Feature activation-2.28e-02

Pos logit contributions
Justin+0.048
 Justin+0.046
 Mason+0.045
 Macy+0.043
Beck+0.043
Neg logit contributions
ibi-0.030
."-0.029
 sidewalk-0.029
edy-0.028
!"-0.028
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
Justin+0.053
 Justin+0.051
 Mason+0.050
 Macy+0.049
 pap+0.048
Neg logit contributions
ibi-0.015
edy-0.015
."-0.015
 sidewalk-0.014
 Kim-0.014
Token It
Token ID632
Feature activation-2.89e-02

Pos logit contributions
 Justin+0.028
Justin+0.028
 Mason+0.027
 Manny+0.026
 Macy+0.026
Neg logit contributions
edy-0.022
ibi-0.021
oma-0.019
oot-0.018
 sidewalk-0.018
Token liked
Token ID8288
Feature activation+3.72e-05

Pos logit contributions
Justin+0.065
 Justin+0.063
 Mason+0.062
 Macy+0.060
 pap+0.060
Neg logit contributions
ibi-0.021
edy-0.021
 Kim-0.019
."-0.019
 sidewalk-0.019
Token to
Token ID284
Feature activation-2.11e-02

Pos logit contributions
."+0.000
 Sue+0.000
 sidewalk+0.000
 Kim+0.000
!"+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
Beck-0.000
Louis-0.000
Token study
Token ID2050
Feature activation+4.33e-03

Pos logit contributions
Justin+0.032
 Justin+0.031
 Mason+0.030
 Macy+0.029
 Manny+0.029
Neg logit contributions
ibi-0.030
 sidewalk-0.030
."-0.030
edy-0.030
 Kim-0.029
Token all
Token ID477
Feature activation-2.99e-03

Pos logit contributions
ibi+0.007
ongo+0.007
 Timothy+0.006
edy+0.006
oma+0.006
Neg logit contributions
 Manny-0.006
 Mason-0.006
 Justin-0.006
 mer-0.006
 Macy-0.006
Token day
Token ID1110
Feature activation-2.05e-03

Pos logit contributions
Justin+0.005
 Justin+0.005
 Mason+0.005
 Macy+0.005
 pap+0.005
Neg logit contributions
ibi-0.003
edy-0.003
 sidewalk-0.003
 Kim-0.003
oma-0.003
Token design
Token ID1486
Feature activation-5.79e-03

Pos logit contributions
edy+0.000
ibi+0.000
 Karen+0.000
 Timothy+0.000
 Kim+0.000
Neg logit contributions
Justin-0.000
 Justin-0.000
 Mason-0.000
 pap-0.000
 mer-0.000
Token.
Token ID13
Feature activation+3.10e-03

Pos logit contributions
Justin+0.011
 Justin+0.011
 Mason+0.011
 Manny+0.010
 mer+0.010
Neg logit contributions
ibi-0.006
 Karen-0.006
edy-0.006
 Timothy-0.006
oma-0.005
Token They
Token ID1119
Feature activation-1.25e-03

Pos logit contributions
ibi+0.002
edy+0.002
 Kim+0.002
."+0.002
 sidewalk+0.002
Neg logit contributions
Justin-0.007
 Justin-0.007
 Mason-0.006
 pap-0.006
 Macy-0.006
Token make
Token ID787
Feature activation-1.48e-02

Pos logit contributions
Justin+0.002
 Justin+0.002
 Mason+0.002
 pap+0.002
 Macy+0.002
Neg logit contributions
edy-0.001
."-0.001
ibi-0.001
 sidewalk-0.001
!"-0.001
Token a
Token ID257
Feature activation-1.34e-02

Pos logit contributions
Justin+0.027
 Justin+0.026
 Mason+0.025
Louis+0.025
where+0.024
Neg logit contributions
edy-0.017
."-0.017
ibi-0.017
 Kim-0.017
!"-0.016
Token sun
Token ID4252
Feature activation-3.16e-02

Pos logit contributions
Justin+0.026
 Justin+0.025
 Mason+0.025
years+0.024
 Macy+0.024
Neg logit contributions
 sidewalk-0.014
ibi-0.013
."-0.013
edy-0.013
!"-0.013
Token,
Token ID11
Feature activation-2.00e-04

Pos logit contributions
Justin+0.069
 Justin+0.067
 Mason+0.065
 Macy+0.063
Louis+0.063
Neg logit contributions
ibi-0.026
edy-0.025
."-0.025
 Kim-0.025
 sidewalk-0.024
Token a
Token ID257
Feature activation-9.80e-03

Pos logit contributions
Justin+0.000
 Justin+0.000
 Mason+0.000
 pap+0.000
 Manny+0.000
Neg logit contributions
ibi-0.000
edy-0.000
oma-0.000
 Karen-0.000
 Timothy-0.000
Token heart
Token ID2612
Feature activation-7.97e-03

Pos logit contributions
Justin+0.017
 Justin+0.016
 Mason+0.016
 Macy+0.015
 Manny+0.015
Neg logit contributions
ibi-0.012
edy-0.012
 sidewalk-0.012
 Kim-0.011
."-0.011
Token,
Token ID11
Feature activation+7.82e-04

Pos logit contributions
Justin+0.015
 Justin+0.015
 Mason+0.015
 pap+0.015
 Manny+0.014
Neg logit contributions
ibi-0.008
edy-0.008
 Timothy-0.008
 Karen-0.007
 Su-0.007
Token and
Token ID290
Feature activation-1.50e-03

Pos logit contributions
ibi+0.001
edy+0.001
oma+0.001
 Karen+0.001
 Timothy+0.001
Neg logit contributions
Justin-0.002
 Justin-0.002
 Mason-0.001
 pap-0.001
 mer-0.001