TOP ACTIVATIONS
MAX = 0.081

his head , understanding what his parents were telling him .
the top . Tom thought her ladder was silly , and
him it was Spanish . J ax asked them to teach
didn 't fully understand what her dad meant , but she
L ily did what her dad said . She held
" Can I have these bricks , Mom my ?" Larry
have some of your pretty bricks , Sara ?" Tom asked
ladder but she remembered what her mom told her about being
First ". Lily remembered what her mom said about the fence
L ily thinks her mom 's job is easy
cleaning , she remembered what her dad said earlier . He
M andy thought her special gift would make her
the cake . She thought her mom would not notice .
ams ? Where are your toys ? Did you nap ?"
better . She remembered what her mom said and knew it
Lily didn 't understand what her mom meant by " spell
idea , sweet ie ," her Mom my said . "
bed , he remembered what his dad said . He realized
was worried . She thought her trumpet was broken . She
very happy . She thinks her mom and dad bought it

MIN ACTIVATIONS
MIN = -0.065

owner took him to the vet because Max wasn 't feeling
was a clever fox named Fox y . Fox y liked
was a small bird called Small Bird .
was a little fox named Fox y . Fox y was
large open space . In that space there were two friends
owner took him to the vet . The vet said Max
were two good friends called Marty and Tim my . They
was a clever fox named Fox y . Fox y had
and took it to the vet . The vet
always wanted to fly in one . One day , Tim
took the bird to the vet . Ros ie wanted to
there was a boy named Mario . He was very eager
young re ind eer named Elise . She was running around
was a smart fox named Fox y . Fox y was
with many rooms . In one room , there was a
to the vet in the morning . The next
took the bird to the vet . The vet put a
and spun around in the night sky . One
woke up early in the morning . She felt cold ,
had a cob web in one corner and the walls were

INTERVAL 0.044 - 0.081
CONTAINS 0.132%

with a purple door . Inside the hut lived a little
make a big surprise . J ax was so excited to
eddy bear . Lu cy was excited to give
". He put down the oranges and began running around the
. You have your own toys . Here , this is

INTERVAL 0.008 - 0.044
CONTAINS 24.735%

out on the table . Max and his dad both laughed
Jimmy a reward . Jimmy smiled and said , " The
wanted to stop playing . " L ily ,
Lily , but she holds on tight . They start to
are you crying ?" The little girl said , " I

INTERVAL -0.029 - 0.008
CONTAINS 73.925%

. The goose thinks they have no more bread . The
little scratch . You were very brave and did a great
said , " Please can I take it home ? I
. Lily accidentally touched the pan with her heel and it
if you want ," Ben said . " Sorry

INTERVAL -0.065 - -0.029
CONTAINS 1.208%

. He will be gentle and kind . And the hospital
want to join me in my spaceship ?" Tom said ,
. Max could jump very far , but Lisa could jump
time there was a small girl called Lily who was very
my 's mom asked him to clean his room . Tim
Token his
Token ID465
Feature activation+3.60e-02

Pos logit contributions
uct+0.002
 John+0.002
 tears+0.002
 Tommy+0.002
colm+0.002
Neg logit contributions
 mate-0.006
hetic-0.006
Ch-0.006
use-0.005
oise-0.005
Token head
Token ID1182
Feature activation-6.37e-03

Pos logit contributions
uct+0.084
 Lib+0.075
ons+0.073
 Jerry+0.073
case+0.073
Neg logit contributions
 mate-0.033
 leader-0.028
 crew-0.026
 longer-0.026
sing-0.024
Token,
Token ID11
Feature activation-5.46e-03

Pos logit contributions
 mate+0.014
use+0.013
Ch+0.013
Ro+0.013
hetic+0.013
Neg logit contributions
uct-0.006
 tears-0.005
 John-0.005
 Tommy-0.005
 Tears-0.005
Token understanding
Token ID4547
Feature activation-4.07e-03

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.008
king+0.008
use+0.008
Neg logit contributions
uct-0.008
colm-0.007
case-0.007
 John-0.007
 Tommy-0.006
Token what
Token ID644
Feature activation+5.61e-03

Pos logit contributions
 mate+0.007
hetic+0.007
Ch+0.006
use+0.006
king+0.006
Neg logit contributions
uct-0.006
colm-0.005
case-0.005
 John-0.005
 tears-0.005
Token his
Token ID465
Feature activation+8.08e-02

Pos logit contributions
uct+0.006
colm+0.005
 John+0.004
 tears+0.004
case+0.004
Neg logit contributions
 mate-0.012
hetic-0.011
Ch-0.011
use-0.011
 crew-0.011
Token parents
Token ID3397
Feature activation+4.35e-03

Pos logit contributions
uct+0.131
case+0.112
 Jerry+0.111
colm+0.104
 unlocks+0.103
Neg logit contributions
 mate-0.117
 longer-0.115
 someday-0.107
 crew-0.103
 permission-0.102
Token were
Token ID547
Feature activation+5.89e-03

Pos logit contributions
uct+0.006
colm+0.005
 Tears+0.005
 tears+0.005
 John+0.005
Neg logit contributions
 mate-0.007
use-0.007
hetic-0.007
be-0.006
oise-0.006
Token telling
Token ID5149
Feature activation-1.87e-02

Pos logit contributions
uct+0.008
colm+0.007
case+0.007
 John+0.007
 Tears+0.007
Neg logit contributions
 mate-0.010
hetic-0.009
use-0.009
be-0.009
king-0.009
Token him
Token ID683
Feature activation-4.86e-03

Pos logit contributions
 mate+0.033
hetic+0.030
Ch+0.029
use+0.029
king+0.028
Neg logit contributions
uct-0.027
colm-0.023
 John-0.022
 tears-0.022
case-0.022
Token.
Token ID13
Feature activation-2.41e-03

Pos logit contributions
 mate+0.009
hetic+0.008
use+0.007
 crew+0.007
ppy+0.007
Neg logit contributions
uct-0.007
colm-0.006
 tears-0.006
 John-0.006
case-0.006
Token the
Token ID262
Feature activation-4.63e-04

Pos logit contributions
uct+0.019
colm+0.015
 Tears+0.014
case+0.014
 tears+0.014
Neg logit contributions
 mate-0.023
Ch-0.020
 longer-0.020
hetic-0.020
use-0.020
Token top
Token ID1353
Feature activation+5.17e-03

Pos logit contributions
 mate+0.001
hetic+0.001
Ch+0.001
be+0.001
use+0.001
Neg logit contributions
uct-0.001
colm-0.001
 John-0.001
 Tommy-0.001
 Tears-0.001
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
uct+0.008
colm+0.007
 tears+0.007
 John+0.007
 Tommy+0.007
Neg logit contributions
 mate-0.008
Ch-0.007
hetic-0.007
be-0.007
use-0.007
Token Tom
Token ID4186
Feature activation+1.46e-02

Pos logit contributions
uct+0.029
colm+0.027
 ey+0.027
 tears+0.027
 done+0.025
Neg logit contributions
 Some-0.021
Some-0.020
Cow-0.020
 mate-0.020
Ch-0.020
Token thought
Token ID1807
Feature activation+2.00e-02

Pos logit contributions
uct+0.023
colm+0.020
 Jerry+0.019
 Tears+0.019
 Tommy+0.018
Neg logit contributions
 mate-0.022
be-0.022
Ch-0.022
ppy-0.020
hetic-0.020
Token her
Token ID607
Feature activation+7.70e-02

Pos logit contributions
uct+0.030
colm+0.027
 Tears+0.024
case+0.022
 tears+0.022
Neg logit contributions
 mate-0.036
hetic-0.031
 longer-0.031
Ch-0.029
 someday-0.029
Token ladder
Token ID18002
Feature activation+3.66e-02

Pos logit contributions
uct+0.126
colm+0.116
 Jerry+0.116
 Tommy+0.112
keeping+0.108
Neg logit contributions
 mate-0.093
 sheep-0.092
 leader-0.092
 longer-0.092
 help-0.092
Token was
Token ID373
Feature activation+4.16e-03

Pos logit contributions
uct+0.049
colm+0.038
 John+0.037
 tears+0.036
 Tears+0.036
Neg logit contributions
 mate-0.069
hetic-0.063
Ch-0.063
be-0.062
use-0.060
Token silly
Token ID14397
Feature activation+1.49e-02

Pos logit contributions
uct+0.006
colm+0.006
 ey+0.005
 Tears+0.005
 tears+0.005
Neg logit contributions
 mate-0.007
 longer-0.006
hetic-0.006
 engineers-0.006
be-0.006
Token,
Token ID11
Feature activation+8.33e-03

Pos logit contributions
uct+0.023
colm+0.020
 tears+0.018
 Tears+0.018
case+0.018
Neg logit contributions
 mate-0.026
hetic-0.023
king-0.022
 longer-0.022
be-0.022
Token and
Token ID290
Feature activation+1.76e-02

Pos logit contributions
uct+0.014
colm+0.012
case+0.012
 Tears+0.011
 tears+0.011
Neg logit contributions
 mate-0.013
hetic-0.011
 longer-0.011
king-0.011
Ch-0.011
Token him
Token ID683
Feature activation+1.98e-03

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.009
use+0.009
oise+0.009
Neg logit contributions
uct-0.008
 John-0.007
 Tommy-0.007
 tears-0.006
colm-0.006
Token it
Token ID340
Feature activation-1.11e-03

Pos logit contributions
uct+0.003
colm+0.002
 John+0.002
 tears+0.002
 Tommy+0.002
Neg logit contributions
 mate-0.004
hetic-0.003
Ch-0.003
use-0.003
 crew-0.003
Token was
Token ID373
Feature activation-1.90e-02

Pos logit contributions
 mate+0.002
hetic+0.002
Ch+0.002
use+0.002
Cow+0.002
Neg logit contributions
uct-0.001
 John-0.001
 tears-0.001
colm-0.001
 Tommy-0.001
Token Spanish
Token ID7897
Feature activation-9.10e-03

Pos logit contributions
 mate+0.036
hetic+0.033
Ch+0.033
use+0.032
Cow+0.031
Neg logit contributions
uct-0.025
 John-0.021
 Tommy-0.021
 tears-0.021
colm-0.021
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
 mate+0.017
hetic+0.015
Ch+0.015
use+0.015
Cow+0.015
Neg logit contributions
uct-0.012
 John-0.011
 tears-0.010
colm-0.010
 Tommy-0.010
Token J
Token ID449
Feature activation+7.56e-02

Pos logit contributions
uct+0.033
colm+0.028
 John+0.028
 tears+0.027
 Tommy+0.027
Neg logit contributions
 mate-0.036
hetic-0.031
Ch-0.031
use-0.030
Cow-0.030
Tokenax
Token ID897
Feature activation+7.45e-03

Pos logit contributions
case+0.115
 tears+0.108
 Tears+0.108
 Amy+0.103
 Alice+0.102
Neg logit contributions
 mate-0.109
ppy-0.105
aa-0.105
be-0.104
ub-0.103
Token asked
Token ID1965
Feature activation-2.31e-03

Pos logit contributions
uct+0.010
colm+0.008
 John+0.008
 Tommy+0.008
 tears+0.008
Neg logit contributions
 mate-0.015
hetic-0.013
Ch-0.013
be-0.013
use-0.013
Token them
Token ID606
Feature activation-5.72e-03

Pos logit contributions
 mate+0.005
hetic+0.005
Ch+0.005
use+0.005
king+0.005
Neg logit contributions
uct-0.002
 John-0.002
 Tommy-0.002
 tears-0.002
colm-0.002
Token to
Token ID284
Feature activation-7.82e-03

Pos logit contributions
 mate+0.010
Ch+0.010
hetic+0.010
use+0.010
Cow+0.009
Neg logit contributions
uct-0.007
 John-0.007
 Tommy-0.006
 Jerry-0.006
 tears-0.006
Token teach
Token ID4545
Feature activation+1.66e-03

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.009
use+0.008
 chick+0.008
Neg logit contributions
uct-0.015
 John-0.014
 Tommy-0.013
 tears-0.013
 Jerry-0.013
Token didn
Token ID1422
Feature activation+1.94e-02

Pos logit contributions
uct+0.023
 John+0.020
 Jerry+0.020
 Tommy+0.020
 Lib+0.019
Neg logit contributions
 mate-0.024
be-0.023
Ch-0.023
ppy-0.023
hetic-0.022
Token't
Token ID470
Feature activation+1.41e-02

Pos logit contributions
uct+0.037
colm+0.033
 tears+0.032
 John+0.031
 Tears+0.031
Neg logit contributions
 mate-0.024
Ch-0.022
hetic-0.022
be-0.021
Cow-0.021
Token fully
Token ID3938
Feature activation-1.61e-03

Pos logit contributions
uct+0.025
 Tommy+0.022
 John+0.022
 tears+0.022
 Jerry+0.021
Neg logit contributions
 mate-0.020
Ch-0.019
hetic-0.018
 chick-0.017
Cow-0.017
Token understand
Token ID1833
Feature activation-5.78e-03

Pos logit contributions
 mate+0.002
hetic+0.002
Ch+0.002
use+0.002
Cow+0.002
Neg logit contributions
uct-0.003
 John-0.003
colm-0.003
 tears-0.002
 Tommy-0.002
Token what
Token ID644
Feature activation+8.92e-03

Pos logit contributions
 mate+0.009
hetic+0.008
use+0.008
Ch+0.008
king+0.008
Neg logit contributions
uct-0.010
colm-0.008
case-0.008
 John-0.008
 Tears-0.008
Token her
Token ID607
Feature activation+7.52e-02

Pos logit contributions
uct+0.012
colm+0.010
 John+0.010
 tears+0.010
case+0.009
Neg logit contributions
 mate-0.017
hetic-0.015
Ch-0.015
use-0.015
Cow-0.015
Token dad
Token ID9955
Feature activation+1.04e-02

Pos logit contributions
uct+0.135
 Jerry+0.117
case+0.116
 Tears+0.111
colm+0.107
Neg logit contributions
 mate-0.109
 crew-0.099
 permission-0.095
 spy-0.093
 sheep-0.093
Token meant
Token ID4001
Feature activation+2.16e-02

Pos logit contributions
uct+0.013
colm+0.011
 tears+0.011
 Tears+0.011
 John+0.010
Neg logit contributions
 mate-0.019
hetic-0.018
use-0.017
Ch-0.017
be-0.017
Token,
Token ID11
Feature activation+4.19e-03

Pos logit contributions
uct+0.032
colm+0.027
 tears+0.026
 John+0.026
case+0.025
Neg logit contributions
 mate-0.038
hetic-0.034
use-0.033
Ch-0.033
be-0.032
Token but
Token ID475
Feature activation+9.54e-03

Pos logit contributions
uct+0.007
colm+0.006
case+0.006
 John+0.005
 Tears+0.005
Neg logit contributions
 mate-0.006
 longer-0.006
hetic-0.006
Ch-0.006
king-0.006
Token she
Token ID673
Feature activation+1.02e-02

Pos logit contributions
uct+0.016
colm+0.015
case+0.015
icycle+0.014
 Tears+0.014
Neg logit contributions
 mate-0.013
 longer-0.012
 someday-0.012
 chick-0.011
 lear-0.011
Token
Token ID198
Feature activation-1.55e-02

Pos logit contributions
 mate+0.028
Ch+0.027
Gr+0.026
Cow+0.025
use+0.025
Neg logit contributions
uct-0.021
colm-0.017
 John-0.017
 tears-0.016
case-0.015
TokenL
Token ID43
Feature activation+1.41e-02

Pos logit contributions
 mate+0.019
hetic+0.016
use+0.015
Ch+0.015
king+0.015
Neg logit contributions
uct-0.032
colm-0.028
 John-0.028
 tears-0.028
 Tommy-0.028
Tokenily
Token ID813
Feature activation+2.05e-02

Pos logit contributions
uct+0.025
colm+0.023
 John+0.023
 Jerry+0.022
 tears+0.022
Neg logit contributions
 mate-0.020
ppy-0.018
hetic-0.017
Ch-0.017
Gr-0.017
Token did
Token ID750
Feature activation+2.14e-02

Pos logit contributions
uct+0.031
 John+0.027
 Timothy+0.027
 Jerry+0.027
colm+0.027
Neg logit contributions
Ch-0.029
 mate-0.029
ppy-0.029
be-0.028
Qu-0.027
Token what
Token ID644
Feature activation+5.33e-03

Pos logit contributions
uct+0.030
colm+0.029
case+0.025
 John+0.024
 Tears+0.024
Neg logit contributions
 mate-0.036
hetic-0.034
 longer-0.034
king-0.032
Ch-0.032
Token her
Token ID607
Feature activation+7.47e-02

Pos logit contributions
uct+0.007
colm+0.006
case+0.005
 tears+0.005
 ey+0.005
Neg logit contributions
 mate-0.010
Ch-0.009
hetic-0.009
ppy-0.009
 sheep-0.009
Token dad
Token ID9955
Feature activation+1.58e-02

Pos logit contributions
uct+0.135
case+0.119
 Tears+0.115
 Jerry+0.113
colm+0.112
Neg logit contributions
 mate-0.096
 permission-0.088
 sheep-0.088
 spy-0.087
 crew-0.086
Token said
Token ID531
Feature activation-5.57e-03

Pos logit contributions
uct+0.021
 Tears+0.018
colm+0.018
 tears+0.017
 John+0.017
Neg logit contributions
 mate-0.027
Ch-0.025
use-0.025
hetic-0.025
ppy-0.025
Token.
Token ID13
Feature activation+9.67e-03

Pos logit contributions
 mate+0.011
Ch+0.010
hetic+0.010
use+0.010
Cow+0.010
Neg logit contributions
uct-0.007
 tears-0.006
 John-0.006
 Tommy-0.006
colm-0.006
Token She
Token ID1375
Feature activation-1.15e-03

Pos logit contributions
uct+0.016
colm+0.014
 done+0.013
case+0.012
 tears+0.012
Neg logit contributions
 mate-0.015
Ch-0.013
Some-0.013
Cow-0.013
Gr-0.012
Token held
Token ID2714
Feature activation+1.29e-02

Pos logit contributions
 mate+0.002
Ch+0.002
hetic+0.002
ppy+0.002
 spy+0.002
Neg logit contributions
uct-0.002
 John-0.001
 Tears-0.001
colm-0.001
 Jerry-0.001
Token"
Token ID1
Feature activation-1.11e-02

Pos logit contributions
 mate+0.020
hetic+0.018
use+0.017
Ch+0.017
king+0.017
Neg logit contributions
uct-0.017
 John-0.015
colm-0.015
 Tommy-0.014
 tears-0.014
TokenCan
Token ID6090
Feature activation-1.10e-02

Pos logit contributions
 mate+0.018
Ch+0.016
hetic+0.016
use+0.016
Cow+0.016
Neg logit contributions
uct-0.018
 tears-0.016
 John-0.015
colm-0.015
 Tears-0.015
Token I
Token ID314
Feature activation-9.11e-03

Pos logit contributions
 mate+0.017
Ch+0.014
use+0.014
hetic+0.014
Cow+0.013
Neg logit contributions
uct-0.020
colm-0.017
 tears-0.017
 Tears-0.016
 John-0.016
Token have
Token ID423
Feature activation+2.42e-03

Pos logit contributions
 mate+0.014
hetic+0.013
Ch+0.012
king+0.012
use+0.012
Neg logit contributions
uct-0.015
 John-0.014
 Tommy-0.013
case-0.013
colm-0.013
Token these
Token ID777
Feature activation+1.56e-02

Pos logit contributions
uct+0.004
 John+0.003
 Tommy+0.003
 tears+0.003
 Jerry+0.003
Neg logit contributions
 mate-0.004
hetic-0.004
Ch-0.004
king-0.004
ppy-0.004
Token bricks
Token ID28902
Feature activation+7.45e-02

Pos logit contributions
uct+0.026
colm+0.022
 Tears+0.022
 tears+0.020
 Heidi+0.020
Neg logit contributions
 mate-0.026
use-0.022
 longer-0.022
 engineers-0.021
be-0.021
Token,
Token ID11
Feature activation-1.95e-04

Pos logit contributions
uct+0.136
colm+0.127
 Tears+0.124
 tears+0.123
 John+0.110
Neg logit contributions
 mate-0.106
use-0.096
be-0.091
 longer-0.090
 someday-0.089
Token Mom
Token ID11254
Feature activation+2.35e-02

Pos logit contributions
 mate+0.000
 crew+0.000
Ch+0.000
 engineers+0.000
 spy+0.000
Neg logit contributions
uct-0.000
colm-0.000
 Tears-0.000
 tears-0.000
case-0.000
Tokenmy
Token ID1820
Feature activation+1.97e-02

Pos logit contributions
uct+0.039
 tears+0.035
colm+0.035
 Tears+0.034
case+0.033
Neg logit contributions
 mate-0.037
use-0.033
be-0.032
ppy-0.031
hetic-0.031
Token?"
Token ID1701
Feature activation-1.11e-02

Pos logit contributions
uct+0.031
colm+0.027
 tears+0.027
 Tears+0.026
 John+0.026
Neg logit contributions
 mate-0.033
use-0.029
hetic-0.029
Ch-0.028
be-0.028
Token Larry
Token ID13633
Feature activation+8.08e-04

Pos logit contributions
 mate+0.021
Ch+0.019
Gr+0.018
Cow+0.018
use+0.018
Neg logit contributions
uct-0.015
colm-0.013
case-0.012
 Tears-0.011
 tears-0.011
Token have
Token ID423
Feature activation-3.89e-03

Pos logit contributions
 mate+0.024
hetic+0.022
Ch+0.022
king+0.021
Gr+0.020
Neg logit contributions
uct-0.021
 John-0.019
 Tommy-0.018
 Jerry-0.018
case-0.018
Token some
Token ID617
Feature activation-1.43e-03

Pos logit contributions
 mate+0.007
hetic+0.007
Ch+0.006
king+0.006
ppy+0.006
Neg logit contributions
uct-0.005
 John-0.005
 Tommy-0.005
 tears-0.005
 Jerry-0.005
Token of
Token ID286
Feature activation+8.05e-03

Pos logit contributions
 mate+0.002
use+0.002
 longer+0.002
be+0.002
 engineers+0.002
Neg logit contributions
uct-0.002
colm-0.002
 Tears-0.002
 John-0.002
 tears-0.002
Token your
Token ID534
Feature activation-5.76e-03

Pos logit contributions
uct+0.011
colm+0.009
 John+0.009
 tears+0.009
 Tommy+0.008
Neg logit contributions
 mate-0.016
hetic-0.014
Ch-0.014
use-0.014
Cow-0.013
Token pretty
Token ID2495
Feature activation+5.28e-04

Pos logit contributions
 mate+0.011
hetic+0.010
use+0.010
Ch+0.010
 crew+0.009
Neg logit contributions
uct-0.008
colm-0.006
 John-0.006
 tears-0.006
 Tommy-0.006
Token bricks
Token ID28902
Feature activation+7.40e-02

Pos logit contributions
uct+0.001
colm+0.001
 John+0.001
 tears+0.001
 Tommy+0.001
Neg logit contributions
 mate-0.001
hetic-0.001
use-0.001
Ch-0.001
be-0.001
Token,
Token ID11
Feature activation-8.59e-04

Pos logit contributions
uct+0.144
colm+0.129
 Tears+0.129
 tears+0.127
 John+0.118
Neg logit contributions
 mate-0.101
use-0.089
be-0.087
 longer-0.084
 someday-0.083
Token Sara
Token ID24799
Feature activation+1.35e-02

Pos logit contributions
 mate+0.002
Ch+0.001
Cow+0.001
 spy+0.001
 longer+0.001
Neg logit contributions
uct-0.001
colm-0.001
 Tears-0.001
 tears-0.001
case-0.001
Token?"
Token ID1701
Feature activation-2.12e-03

Pos logit contributions
uct+0.021
colm+0.017
 John+0.017
 tears+0.017
 Tears+0.017
Neg logit contributions
 mate-0.023
hetic-0.021
use-0.020
Ch-0.020
ppy-0.020
Token Tom
Token ID4186
Feature activation+1.17e-02

Pos logit contributions
 mate+0.004
Ch+0.004
Cow+0.004
 spy+0.004
use+0.004
Neg logit contributions
uct-0.003
case-0.002
colm-0.002
 Tears-0.002
 tears-0.002
Token asked
Token ID1965
Feature activation-4.17e-03

Pos logit contributions
uct+0.018
colm+0.016
 John+0.016
case+0.015
 tears+0.015
Neg logit contributions
 mate-0.020
Ch-0.017
hetic-0.017
use-0.017
ppy-0.017
Token ladder
Token ID18002
Feature activation+1.06e-02

Pos logit contributions
uct+0.012
colm+0.010
 John+0.010
 Tommy+0.010
 Jerry+0.009
Neg logit contributions
 mate-0.014
be-0.014
 longer-0.013
Ch-0.013
 clinic-0.013
Token but
Token ID475
Feature activation+2.33e-03

Pos logit contributions
uct+0.017
colm+0.015
 tears+0.014
 Tears+0.013
 John+0.013
Neg logit contributions
 mate-0.018
 longer-0.015
be-0.015
 someday-0.015
use-0.015
Token she
Token ID673
Feature activation+1.60e-02

Pos logit contributions
uct+0.004
colm+0.004
case+0.003
 tears+0.003
 Tears+0.003
Neg logit contributions
 mate-0.003
 someday-0.003
 longer-0.003
hetic-0.003
 spy-0.003
Token remembered
Token ID12086
Feature activation+1.16e-02

Pos logit contributions
uct+0.024
 John+0.020
 Jerry+0.020
 Tommy+0.019
colm+0.019
Neg logit contributions
 mate-0.027
Ch-0.025
be-0.025
ppy-0.025
hetic-0.024
Token what
Token ID644
Feature activation+8.20e-03

Pos logit contributions
uct+0.016
colm+0.014
case+0.014
 John+0.013
 tears+0.013
Neg logit contributions
 mate-0.021
hetic-0.019
Ch-0.018
use-0.018
 chick-0.018
Token her
Token ID607
Feature activation+7.36e-02

Pos logit contributions
uct+0.013
colm+0.011
case+0.011
 tears+0.010
 ey+0.010
Neg logit contributions
 mate-0.013
Ch-0.012
 sheep-0.012
 chick-0.011
 nurses-0.011
Token mom
Token ID1995
Feature activation+2.04e-02

Pos logit contributions
uct+0.141
 Jerry+0.124
colm+0.122
 Tears+0.122
 Timothy+0.119
Neg logit contributions
 mate-0.086
 sheep-0.082
 permission-0.077
 crew-0.077
 engineers-0.076
Token told
Token ID1297
Feature activation-1.83e-02

Pos logit contributions
uct+0.030
 Tears+0.027
 tears+0.026
colm+0.026
 ey+0.025
Neg logit contributions
 mate-0.031
use-0.031
ppy-0.030
be-0.030
Ch-0.029
Token her
Token ID607
Feature activation+2.74e-02

Pos logit contributions
 mate+0.034
Ch+0.031
hetic+0.031
use+0.030
ppy+0.030
Neg logit contributions
uct-0.024
colm-0.021
 John-0.020
 tears-0.019
case-0.019
Token about
Token ID546
Feature activation+2.00e-02

Pos logit contributions
uct+0.048
colm+0.042
 ey+0.039
 tears+0.039
 Jerry+0.038
Neg logit contributions
 mate-0.042
hetic-0.034
 crew-0.033
be-0.033
 someday-0.033
Token being
Token ID852
Feature activation-1.29e-02

Pos logit contributions
uct+0.025
colm+0.021
 tears+0.020
 John+0.019
case+0.019
Neg logit contributions
 mate-0.039
Ch-0.036
hetic-0.036
use-0.034
Cow-0.034
Token First
Token ID3274
Feature activation+5.38e-04

Pos logit contributions
 mate+0.003
Ch+0.003
hetic+0.003
Gr+0.002
Cow+0.002
Neg logit contributions
uct-0.003
 John-0.002
 Tommy-0.002
colm-0.002
case-0.002
Token".
Token ID1911
Feature activation-1.67e-02

Pos logit contributions
uct+0.001
 John+0.001
 Tommy+0.001
 tears+0.001
colm+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
hetic-0.001
use-0.001
Gr-0.001
Token Lily
Token ID20037
Feature activation+1.23e-02

Pos logit contributions
Ch+0.025
 mate+0.024
Gr+0.024
Cow+0.022
 clinic+0.022
Neg logit contributions
uct-0.026
colm-0.026
 ey-0.021
 John-0.021
 tears-0.021
Token remembered
Token ID12086
Feature activation+6.96e-03

Pos logit contributions
uct+0.014
colm+0.011
 tears+0.011
 John+0.011
case+0.011
Neg logit contributions
 mate-0.026
hetic-0.024
use-0.023
Ch-0.023
king-0.023
Token what
Token ID644
Feature activation+8.56e-03

Pos logit contributions
uct+0.009
 John+0.008
colm+0.007
 tears+0.007
 Tommy+0.007
Neg logit contributions
 mate-0.014
hetic-0.012
Ch-0.012
use-0.012
Cow-0.012
Token her
Token ID607
Feature activation+7.35e-02

Pos logit contributions
uct+0.013
colm+0.011
 tears+0.010
case+0.010
 John+0.010
Neg logit contributions
 mate-0.015
hetic-0.014
Ch-0.014
 chick-0.013
 sheep-0.013
Token mom
Token ID1995
Feature activation+2.16e-02

Pos logit contributions
uct+0.131
colm+0.111
 Jerry+0.104
 Robert+0.101
 Tears+0.101
Neg logit contributions
 mate-0.115
 crew-0.099
 engineers-0.095
 sheep-0.094
 leader-0.094
Token said
Token ID531
Feature activation-1.26e-03

Pos logit contributions
uct+0.030
colm+0.025
 Tears+0.025
 tears+0.025
 John+0.024
Neg logit contributions
 mate-0.038
use-0.035
hetic-0.034
be-0.034
Ch-0.034
Token about
Token ID546
Feature activation+1.59e-02

Pos logit contributions
 mate+0.003
hetic+0.002
Ch+0.002
use+0.002
Cow+0.002
Neg logit contributions
uct-0.001
 John-0.001
 tears-0.001
 Tommy-0.001
colm-0.001
Token the
Token ID262
Feature activation+7.07e-03

Pos logit contributions
uct+0.019
 John+0.016
 tears+0.015
colm+0.015
 Tommy+0.015
Neg logit contributions
 mate-0.033
hetic-0.030
Ch-0.029
use-0.029
king-0.028
Token fence
Token ID13990
Feature activation+1.09e-02

Pos logit contributions
uct+0.009
 tears+0.008
case+0.008
 John+0.007
 Jerry+0.007
Neg logit contributions
 mate-0.014
ppy-0.013
Ch-0.013
hetic-0.013
Gr-0.013
Token
Token ID198
Feature activation-5.86e-04

Pos logit contributions
uct+0.009
colm+0.007
 tears+0.007
 John+0.007
case+0.007
Neg logit contributions
 mate-0.014
Ch-0.013
Cow-0.012
Gr-0.012
hetic-0.012
Token
Token ID198
Feature activation-4.82e-03

Pos logit contributions
 mate+0.001
Ch+0.001
Gr+0.001
Cow+0.001
use+0.001
Neg logit contributions
uct-0.001
colm-0.001
 tears-0.001
 John-0.001
 Tears-0.001
TokenL
Token ID43
Feature activation+1.50e-02

Pos logit contributions
 mate+0.008
 crew+0.007
hetic+0.006
 chick+0.006
use+0.006
Neg logit contributions
uct-0.008
case-0.007
 Tommy-0.007
 tears-0.007
colm-0.007
Tokenily
Token ID813
Feature activation+2.04e-02

Pos logit contributions
uct+0.024
colm+0.021
 John+0.021
 tears+0.021
 Tommy+0.020
Neg logit contributions
 mate-0.025
hetic-0.021
Ch-0.021
ppy-0.021
use-0.020
Token thinks
Token ID6834
Feature activation+8.81e-03

Pos logit contributions
uct+0.034
 Timothy+0.030
 Jerry+0.029
 John+0.029
 Lib+0.028
Neg logit contributions
 mate-0.030
hetic-0.028
be-0.028
use-0.027
 spy-0.027
Token her
Token ID607
Feature activation+7.34e-02

Pos logit contributions
uct+0.013
colm+0.011
 John+0.010
 Tears+0.010
 tears+0.010
Neg logit contributions
 mate-0.016
hetic-0.014
Ch-0.014
use-0.013
 crew-0.013
Token mom
Token ID1995
Feature activation+2.18e-02

Pos logit contributions
uct+0.128
 John+0.101
 Jerry+0.101
colm+0.099
 Robert+0.095
Neg logit contributions
 mate-0.116
 spy-0.105
 clinic-0.099
 sheep-0.099
 crew-0.099
Token's
Token ID338
Feature activation+3.00e-02

Pos logit contributions
uct+0.032
 John+0.026
 Tears+0.026
 Tommy+0.025
 Jerry+0.025
Neg logit contributions
 mate-0.036
use-0.033
be-0.033
hetic-0.032
Ch-0.032
Token job
Token ID1693
Feature activation+3.42e-03

Pos logit contributions
uct+0.043
 John+0.035
colm+0.035
 Jerry+0.034
 Tommy+0.033
Neg logit contributions
 mate-0.056
hetic-0.050
Ch-0.049
use-0.048
 crew-0.047
Token is
Token ID318
Feature activation-7.18e-03

Pos logit contributions
uct+0.005
colm+0.004
 John+0.004
 tears+0.004
 Jerry+0.004
Neg logit contributions
 mate-0.006
hetic-0.006
Ch-0.006
be-0.005
use-0.005
Token easy
Token ID2562
Feature activation+2.16e-04

Pos logit contributions
 mate+0.013
hetic+0.012
Ch+0.012
use+0.012
be+0.011
Neg logit contributions
uct-0.010
colm-0.008
 John-0.008
 tears-0.008
 Tommy-0.008
Token cleaning
Token ID12724
Feature activation+1.72e-02

Pos logit contributions
uct+0.020
 John+0.017
colm+0.017
 tears+0.017
 Tommy+0.016
Neg logit contributions
 mate-0.027
hetic-0.025
Ch-0.024
use-0.024
Cow-0.023
Token,
Token ID11
Feature activation+6.17e-03

Pos logit contributions
uct+0.031
colm+0.025
case+0.024
 Jerry+0.023
 Tears+0.023
Neg logit contributions
 mate-0.024
 longer-0.023
hetic-0.023
use-0.022
Ch-0.022
Token she
Token ID673
Feature activation-4.48e-03

Pos logit contributions
uct+0.010
colm+0.009
case+0.009
 John+0.008
 ey+0.008
Neg logit contributions
 mate-0.009
 longer-0.009
Ch-0.008
Gr-0.008
hetic-0.008
Token remembered
Token ID12086
Feature activation+6.50e-03

Pos logit contributions
 mate+0.008
hetic+0.007
Ch+0.007
use+0.007
Cow+0.007
Neg logit contributions
uct-0.006
 tears-0.005
 John-0.005
colm-0.005
 Tommy-0.005
Token what
Token ID644
Feature activation+3.04e-03

Pos logit contributions
uct+0.009
colm+0.008
case+0.008
 John+0.008
 tears+0.008
Neg logit contributions
 mate-0.012
hetic-0.010
Ch-0.010
use-0.010
 peac-0.010
Token her
Token ID607
Feature activation+7.32e-02

Pos logit contributions
uct+0.005
colm+0.004
case+0.004
 ey+0.004
 tears+0.004
Neg logit contributions
 mate-0.005
Ch-0.004
Gr-0.004
 nurses-0.004
 sheep-0.004
Token dad
Token ID9955
Feature activation+6.60e-03

Pos logit contributions
uct+0.144
 Tears+0.119
case+0.118
 Jerry+0.114
 unlocks+0.114
Neg logit contributions
 mate-0.095
 sheep-0.089
 crew-0.086
 spy-0.083
 permission-0.079
Token said
Token ID531
Feature activation-2.65e-03

Pos logit contributions
uct+0.009
 Tears+0.008
colm+0.008
 tears+0.008
 John+0.007
Neg logit contributions
 mate-0.011
use-0.010
hetic-0.010
be-0.010
ppy-0.010
Token earlier
Token ID2961
Feature activation-2.47e-03

Pos logit contributions
 mate+0.005
hetic+0.005
Ch+0.005
use+0.005
Cow+0.004
Neg logit contributions
uct-0.003
 John-0.003
colm-0.003
 tears-0.003
 Tommy-0.003
Token.
Token ID13
Feature activation-3.31e-03

Pos logit contributions
 mate+0.005
hetic+0.004
Ch+0.004
use+0.004
Cow+0.004
Neg logit contributions
uct-0.003
colm-0.003
 John-0.003
 tears-0.003
 Tommy-0.003
Token He
Token ID679
Feature activation-2.38e-03

Pos logit contributions
Ch+0.005
 mate+0.005
Gr+0.005
Cow+0.004
cient+0.004
Neg logit contributions
uct-0.005
colm-0.005
 tears-0.004
 done-0.004
 ey-0.004
Token
Token ID198
Feature activation+1.62e-03

Pos logit contributions
 mate+0.002
Ch+0.002
hetic+0.002
Gr+0.002
Cow+0.002
Neg logit contributions
uct-0.002
colm-0.002
 tears-0.001
case-0.001
 Jerry-0.001
Token
Token ID198
Feature activation-6.55e-03

Pos logit contributions
uct+0.002
colm+0.001
 John+0.001
 tears+0.001
 Tommy+0.001
Neg logit contributions
 mate-0.004
Ch-0.003
hetic-0.003
use-0.003
Cow-0.003
TokenM
Token ID44
Feature activation+2.41e-02

Pos logit contributions
 mate+0.012
use+0.011
hetic+0.011
king+0.011
urnal+0.011
Neg logit contributions
uct-0.010
 Tears-0.007
case-0.007
 John-0.007
colm-0.007
Tokenandy
Token ID10757
Feature activation+9.51e-03

Pos logit contributions
uct+0.041
colm+0.035
 John+0.034
 tears+0.034
 Tears+0.033
Neg logit contributions
 mate-0.039
hetic-0.034
Ch-0.033
king-0.033
use-0.033
Token thought
Token ID1807
Feature activation+1.55e-02

Pos logit contributions
uct+0.012
 Tommy+0.010
 John+0.010
 Jerry+0.010
colm+0.010
Neg logit contributions
 mate-0.018
hetic-0.016
Ch-0.016
be-0.016
use-0.016
Token her
Token ID607
Feature activation+7.28e-02

Pos logit contributions
uct+0.021
colm+0.018
 tears+0.017
 John+0.017
 Tommy+0.016
Neg logit contributions
 mate-0.029
hetic-0.026
Ch-0.025
use-0.025
 crew-0.024
Token special
Token ID2041
Feature activation+1.68e-02

Pos logit contributions
uct+0.116
 Jerry+0.097
 John+0.096
 Timothy+0.096
colm+0.095
Neg logit contributions
 mate-0.119
 crew-0.104
 chick-0.104
 peac-0.102
 leader-0.101
Token gift
Token ID6979
Feature activation+1.29e-02

Pos logit contributions
uct+0.025
 John+0.022
colm+0.021
 tears+0.021
 Tommy+0.021
Neg logit contributions
 mate-0.029
hetic-0.026
Ch-0.026
use-0.025
king-0.025
Token would
Token ID561
Feature activation+1.13e-02

Pos logit contributions
uct+0.017
colm+0.013
 Tommy+0.013
 John+0.012
 Tears+0.012
Neg logit contributions
 mate-0.025
 chick-0.022
Ch-0.022
 crew-0.022
hetic-0.022
Token make
Token ID787
Feature activation-1.47e-02

Pos logit contributions
uct+0.017
 John+0.015
 tears+0.015
 Tommy+0.015
case+0.014
Neg logit contributions
 mate-0.019
hetic-0.018
Ch-0.018
king-0.017
oise-0.017
Token her
Token ID607
Feature activation+1.50e-02

Pos logit contributions
 mate+0.027
hetic+0.024
Ch+0.024
 crew+0.024
 chick+0.024
Neg logit contributions
uct-0.020
colm-0.017
case-0.016
 John-0.016
 Tears-0.015
Token the
Token ID262
Feature activation+2.70e-02

Pos logit contributions
uct+0.017
colm+0.015
case+0.015
 John+0.013
 Tears+0.013
Neg logit contributions
 mate-0.023
 longer-0.021
hetic-0.021
use-0.020
 chick-0.020
Token cake
Token ID12187
Feature activation+2.27e-02

Pos logit contributions
uct+0.038
colm+0.030
 Tears+0.029
 Tommy+0.028
 John+0.028
Neg logit contributions
 mate-0.051
 chick-0.045
use-0.044
be-0.044
hetic-0.044
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
uct+0.040
colm+0.036
 Tears+0.034
 tears+0.034
 John+0.032
Neg logit contributions
 mate-0.032
 longer-0.030
 someday-0.028
ppy-0.027
 nood-0.027
Token She
Token ID1375
Feature activation+1.57e-02

Pos logit contributions
uct+0.018
colm+0.016
 done+0.015
lf+0.015
 tears+0.014
Neg logit contributions
Ch-0.012
Cow-0.011
use-0.011
Some-0.010
Gr-0.010
Token thought
Token ID1807
Feature activation+2.03e-02

Pos logit contributions
uct+0.026
 John+0.024
case+0.023
 Tears+0.023
 Jerry+0.023
Neg logit contributions
 mate-0.022
ppy-0.020
Ch-0.020
use-0.020
hetic-0.020
Token her
Token ID607
Feature activation+7.23e-02

Pos logit contributions
uct+0.032
colm+0.029
case+0.024
 Tears+0.024
icycle+0.024
Neg logit contributions
 mate-0.032
 longer-0.030
hetic-0.029
 nood-0.029
ppy-0.028
Token mom
Token ID1995
Feature activation+2.04e-02

Pos logit contributions
uct+0.136
 Timothy+0.107
 Lib+0.105
colm+0.103
 Jerry+0.103
Neg logit contributions
 mate-0.105
 chick-0.093
 crew-0.092
 nood-0.091
 sheep-0.088
Token would
Token ID561
Feature activation+1.56e-02

Pos logit contributions
uct+0.034
 ey+0.029
colm+0.029
 tears+0.028
 Tears+0.028
Neg logit contributions
 mate-0.028
ppy-0.027
use-0.026
Ch-0.026
be-0.025
Token not
Token ID407
Feature activation+2.31e-02

Pos logit contributions
uct+0.024
 John+0.021
 tears+0.021
colm+0.021
 Tommy+0.020
Neg logit contributions
 mate-0.026
hetic-0.024
Ch-0.024
use-0.022
king-0.022
Token notice
Token ID4003
Feature activation+7.92e-03

Pos logit contributions
uct+0.044
colm+0.039
 John+0.039
case+0.038
 Tears+0.038
Neg logit contributions
 mate-0.032
use-0.026
hetic-0.026
be-0.025
ppy-0.025
Token.
Token ID13
Feature activation+5.95e-03

Pos logit contributions
uct+0.013
colm+0.012
case+0.012
 Jerry+0.011
 Tears+0.011
Neg logit contributions
 mate-0.011
hetic-0.011
Ch-0.010
 longer-0.010
use-0.010
Tokenams
Token ID4105
Feature activation+3.97e-02

Pos logit contributions
 mate+0.002
be+0.002
use+0.002
hetic+0.002
oise+0.002
Neg logit contributions
uct-0.002
 tears-0.002
 John-0.002
colm-0.002
 Jerry-0.002
Token?
Token ID30
Feature activation-4.14e-03

Pos logit contributions
uct+0.069
colm+0.062
 tears+0.062
 Tears+0.060
 John+0.057
Neg logit contributions
 mate-0.059
use-0.053
be-0.051
ppy-0.049
cient-0.048
Token Where
Token ID6350
Feature activation+1.27e-02

Pos logit contributions
 mate+0.008
hetic+0.007
Ch+0.007
use+0.007
king+0.007
Neg logit contributions
uct-0.006
 John-0.005
 Tommy-0.005
colm-0.005
case-0.005
Token are
Token ID389
Feature activation+4.10e-02

Pos logit contributions
uct+0.021
colm+0.018
 tears+0.017
 Tears+0.017
 John+0.016
Neg logit contributions
 mate-0.022
hetic-0.018
Ch-0.018
use-0.018
ppy-0.018
Token your
Token ID534
Feature activation+3.57e-02

Pos logit contributions
uct+0.071
colm+0.062
 tears+0.060
case+0.059
 Tears+0.059
Neg logit contributions
 mate-0.064
hetic-0.053
use-0.052
Ch-0.052
be-0.051
Token toys
Token ID14958
Feature activation+7.18e-02

Pos logit contributions
uct+0.057
colm+0.046
case+0.046
 Tears+0.046
 Jerry+0.045
Neg logit contributions
 mate-0.063
 crew-0.053
use-0.053
be-0.052
 peac-0.052
Token?
Token ID30
Feature activation-5.40e-03

Pos logit contributions
uct+0.127
colm+0.113
 Tears+0.108
 tears+0.108
 John+0.103
Neg logit contributions
 mate-0.110
use-0.097
be-0.092
 spy-0.088
 longer-0.088
Token Did
Token ID7731
Feature activation-3.91e-03

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.009
use+0.009
king+0.008
Neg logit contributions
uct-0.008
 John-0.006
colm-0.006
 Tommy-0.006
 tears-0.006
Token you
Token ID345
Feature activation+1.54e-02

Pos logit contributions
 mate+0.006
Ch+0.005
hetic+0.005
use+0.005
Cow+0.005
Neg logit contributions
uct-0.007
colm-0.006
 tears-0.006
 Tears-0.006
 John-0.006
Token nap
Token ID25422
Feature activation+2.10e-02

Pos logit contributions
uct+0.023
 John+0.021
 Tommy+0.021
case+0.020
 Jerry+0.020
Neg logit contributions
 mate-0.024
Ch-0.024
hetic-0.024
king-0.023
Gr-0.022
Token?"
Token ID1701
Feature activation+4.26e-03

Pos logit contributions
uct+0.031
colm+0.026
 John+0.026
 tears+0.025
 Tears+0.025
Neg logit contributions
 mate-0.038
hetic-0.033
Ch-0.033
use-0.033
ppy-0.032
Token better
Token ID1365
Feature activation+1.19e-02

Pos logit contributions
 mate+0.053
Ch+0.047
 crew+0.045
 engineers+0.045
oise+0.045
Neg logit contributions
uct-0.032
 tears-0.031
 Tommy-0.027
 Tears-0.026
 John-0.025
Token.
Token ID13
Feature activation-1.32e-03

Pos logit contributions
uct+0.017
 John+0.015
colm+0.015
 tears+0.015
 Tommy+0.014
Neg logit contributions
 mate-0.021
hetic-0.019
Ch-0.019
use-0.019
Cow-0.018
Token She
Token ID1375
Feature activation+4.58e-03

Pos logit contributions
 mate+0.002
Gr+0.002
Ch+0.002
hetic+0.002
Cow+0.002
Neg logit contributions
uct-0.002
colm-0.002
 tears-0.001
case-0.001
 Tommy-0.001
Token remembered
Token ID12086
Feature activation+1.16e-02

Pos logit contributions
uct+0.007
 John+0.006
 Jerry+0.006
colm+0.006
 Lib+0.006
Neg logit contributions
 mate-0.007
hetic-0.006
Ch-0.006
be-0.006
ppy-0.006
Token what
Token ID644
Feature activation+8.03e-03

Pos logit contributions
uct+0.017
colm+0.015
case+0.015
 Tears+0.014
 tears+0.013
Neg logit contributions
 mate-0.020
hetic-0.018
use-0.017
 peac-0.017
Ch-0.017
Token her
Token ID607
Feature activation+7.18e-02

Pos logit contributions
uct+0.012
colm+0.011
case+0.010
 tears+0.010
 Tears+0.009
Neg logit contributions
 mate-0.013
 chick-0.012
 peac-0.012
Gr-0.012
Ch-0.012
Token mom
Token ID1995
Feature activation+2.47e-02

Pos logit contributions
uct+0.134
colm+0.120
 Lib+0.112
 John+0.107
 unlocks+0.106
Neg logit contributions
 mate-0.094
 spy-0.083
 longer-0.081
 sheep-0.079
 leader-0.077
Token said
Token ID531
Feature activation-2.72e-03

Pos logit contributions
uct+0.036
colm+0.031
 Tears+0.029
 tears+0.029
 John+0.029
Neg logit contributions
 mate-0.041
use-0.038
hetic-0.038
be-0.038
Ch-0.037
Token and
Token ID290
Feature activation-1.21e-03

Pos logit contributions
 mate+0.006
hetic+0.005
Ch+0.005
use+0.005
Cow+0.005
Neg logit contributions
uct-0.003
 John-0.003
 tears-0.003
colm-0.003
 Tommy-0.003
Token knew
Token ID2993
Feature activation+1.21e-02

Pos logit contributions
 mate+0.002
hetic+0.002
 crew+0.002
 chick+0.002
 peac+0.002
Neg logit contributions
uct-0.002
colm-0.002
case-0.002
 Tears-0.002
 John-0.002
Token it
Token ID340
Feature activation+9.63e-03

Pos logit contributions
uct+0.018
colm+0.016
case+0.015
 tears+0.015
 John+0.014
Neg logit contributions
 mate-0.021
hetic-0.018
Ch-0.018
 chick-0.018
 peac-0.018
Token Lily
Token ID20037
Feature activation+2.31e-02

Pos logit contributions
 mate+0.022
Ch+0.021
hetic+0.020
use+0.020
Gr+0.020
Neg logit contributions
uct-0.015
colm-0.014
 John-0.012
case-0.012
 tears-0.012
Token didn
Token ID1422
Feature activation+2.38e-02

Pos logit contributions
uct+0.030
colm+0.024
 John+0.024
 tears+0.023
 Tommy+0.023
Neg logit contributions
 mate-0.045
hetic-0.041
Ch-0.040
use-0.040
be-0.039
Token't
Token ID470
Feature activation+9.83e-03

Pos logit contributions
uct+0.049
colm+0.045
 tears+0.042
 John+0.042
 Tears+0.042
Neg logit contributions
 mate-0.024
Ch-0.024
hetic-0.022
Cow-0.021
be-0.021
Token understand
Token ID1833
Feature activation+3.18e-03

Pos logit contributions
uct+0.017
colm+0.015
 John+0.015
 tears+0.015
 Tommy+0.014
Neg logit contributions
 mate-0.015
hetic-0.013
Ch-0.013
use-0.012
ppy-0.012
Token what
Token ID644
Feature activation+8.75e-03

Pos logit contributions
uct+0.006
colm+0.005
case+0.005
 Tears+0.005
 John+0.005
Neg logit contributions
 mate-0.004
hetic-0.004
use-0.004
ppy-0.004
Ch-0.004
Token her
Token ID607
Feature activation+7.15e-02

Pos logit contributions
uct+0.012
colm+0.010
 John+0.009
 tears+0.009
case+0.009
Neg logit contributions
 mate-0.017
hetic-0.015
Ch-0.015
use-0.015
Cow-0.015
Token mom
Token ID1995
Feature activation+3.55e-02

Pos logit contributions
uct+0.141
colm+0.116
 Jerry+0.113
 Tears+0.111
 John+0.110
Neg logit contributions
 mate-0.100
 spy-0.085
 crew-0.081
 leader-0.081
 sheep-0.079
Token meant
Token ID4001
Feature activation+2.61e-02

Pos logit contributions
uct+0.050
colm+0.043
 Tears+0.043
 tears+0.043
 ey+0.042
Neg logit contributions
 mate-0.056
ppy-0.054
use-0.053
Ch-0.052
be-0.052
Token by
Token ID416
Feature activation+1.80e-02

Pos logit contributions
uct+0.040
colm+0.034
 tears+0.033
 John+0.032
 Tears+0.032
Neg logit contributions
 mate-0.046
hetic-0.040
use-0.039
Ch-0.039
be-0.038
Token "
Token ID366
Feature activation-4.16e-03

Pos logit contributions
uct+0.025
colm+0.021
 John+0.021
 tears+0.020
 Tommy+0.020
Neg logit contributions
 mate-0.033
hetic-0.030
Ch-0.030
use-0.029
Cow-0.028
Tokenspell
Token ID46143
Feature activation-2.12e-02

Pos logit contributions
 mate+0.006
hetic+0.006
Ch+0.005
use+0.005
 crew+0.005
Neg logit contributions
uct-0.007
colm-0.006
 John-0.006
 tears-0.006
 Tommy-0.006
Token idea
Token ID2126
Feature activation+2.84e-02

Pos logit contributions
uct+0.037
colm+0.030
 John+0.029
 Tears+0.029
 Tommy+0.028
Neg logit contributions
 mate-0.053
hetic-0.047
use-0.046
 crew-0.045
 peac-0.045
Token,
Token ID11
Feature activation+4.72e-03

Pos logit contributions
uct+0.049
colm+0.043
 Tears+0.040
 tears+0.040
case+0.039
Neg logit contributions
 mate-0.045
use-0.039
hetic-0.038
be-0.037
Ch-0.036
Token sweet
Token ID6029
Feature activation+4.50e-04

Pos logit contributions
uct+0.007
colm+0.007
case+0.006
 Tears+0.006
 tears+0.006
Neg logit contributions
 mate-0.008
 longer-0.007
hetic-0.007
 chick-0.007
Ch-0.007
Tokenie
Token ID494
Feature activation-8.73e-03

Pos logit contributions
uct+0.001
colm+0.001
 John+0.001
 tears+0.001
 Tears+0.001
Neg logit contributions
 mate-0.000
hetic-0.000
Ch-0.000
use-0.000
 chick-0.000
Token,"
Token ID553
Feature activation-4.00e-04

Pos logit contributions
 mate+0.017
Ch+0.015
hetic+0.015
use+0.014
Cow+0.014
Neg logit contributions
uct-0.011
 John-0.010
 Tommy-0.009
colm-0.009
case-0.009
Token her
Token ID607
Feature activation+7.15e-02

Pos logit contributions
 mate+0.001
Ch+0.001
 longer+0.001
Gr+0.001
Cow+0.001
Neg logit contributions
uct-0.001
colm-0.001
case-0.001
 Tears-0.000
icycle-0.000
Token Mom
Token ID11254
Feature activation+2.32e-02

Pos logit contributions
uct+0.123
colm+0.118
keeping+0.108
 Jerry+0.106
 Heidi+0.102
Neg logit contributions
 mate-0.091
 longer-0.088
 spy-0.076
be-0.075
Ch-0.075
Tokenmy
Token ID1820
Feature activation+6.23e-03

Pos logit contributions
uct+0.034
colm+0.030
 tears+0.029
 John+0.029
case+0.028
Neg logit contributions
 mate-0.040
Ch-0.036
hetic-0.036
use-0.035
be-0.035
Token said
Token ID531
Feature activation-1.27e-02

Pos logit contributions
uct+0.007
colm+0.005
 John+0.005
 tears+0.005
 Tommy+0.005
Neg logit contributions
 mate-0.014
hetic-0.012
Ch-0.012
use-0.012
be-0.012
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
 mate+0.025
use+0.022
Ch+0.022
hetic+0.022
Cow+0.022
Neg logit contributions
uct-0.017
 John-0.014
 tears-0.014
 Tommy-0.014
 Tears-0.013
Token "
Token ID366
Feature activation-5.91e-03

Pos logit contributions
 mate+0.017
Ch+0.016
hetic+0.015
Gr+0.015
Cow+0.015
Neg logit contributions
uct-0.017
colm-0.015
 tears-0.014
 John-0.014
case-0.014
Token bed
Token ID3996
Feature activation-6.30e-03

Pos logit contributions
uct+0.002
colm+0.001
case+0.001
 Tears+0.001
 John+0.001
Neg logit contributions
 mate-0.002
hetic-0.002
Ch-0.002
 crew-0.002
use-0.002
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
 mate+0.011
hetic+0.010
Ch+0.010
use+0.010
king+0.009
Neg logit contributions
uct-0.009
colm-0.008
 John-0.008
 tears-0.008
 Tommy-0.007
Token he
Token ID339
Feature activation-7.72e-03

Pos logit contributions
uct+0.017
colm+0.014
case+0.013
 ey+0.012
 tears+0.012
Neg logit contributions
 mate-0.021
 longer-0.019
Ch-0.018
 nurses-0.018
 crew-0.018
Token remembered
Token ID12086
Feature activation+3.17e-03

Pos logit contributions
 mate+0.013
hetic+0.011
Ch+0.011
be+0.011
ppy+0.011
Neg logit contributions
uct-0.012
colm-0.010
 John-0.010
 Jerry-0.010
 Tears-0.010
Token what
Token ID644
Feature activation+2.62e-03

Pos logit contributions
uct+0.005
colm+0.004
case+0.004
 Tears+0.004
 tears+0.003
Neg logit contributions
 mate-0.006
hetic-0.005
 crew-0.005
use-0.005
 spy-0.005
Token his
Token ID465
Feature activation+7.12e-02

Pos logit contributions
uct+0.004
colm+0.003
case+0.003
 ey+0.002
 Tears+0.002
Neg logit contributions
 mate-0.005
 spy-0.005
 crew-0.005
Ch-0.005
 nurses-0.004
Token dad
Token ID9955
Feature activation+5.83e-03

Pos logit contributions
uct+0.131
case+0.108
colm+0.107
 Tears+0.106
 Jerry+0.105
Neg logit contributions
 mate-0.103
 spy-0.093
 crew-0.089
 longer-0.088
 leader-0.085
Token said
Token ID531
Feature activation-2.68e-03

Pos logit contributions
uct+0.008
 Tears+0.007
colm+0.007
 tears+0.007
 ey+0.006
Neg logit contributions
 mate-0.010
use-0.009
hetic-0.009
be-0.009
Ch-0.009
Token.
Token ID13
Feature activation-6.03e-03

Pos logit contributions
 mate+0.005
hetic+0.005
Ch+0.005
use+0.005
Cow+0.004
Neg logit contributions
uct-0.004
 John-0.003
colm-0.003
 tears-0.003
 Tommy-0.003
Token He
Token ID679
Feature activation+4.17e-04

Pos logit contributions
 mate+0.010
Ch+0.009
Cow+0.009
cient+0.009
hetic+0.009
Neg logit contributions
uct-0.008
colm-0.007
 tears-0.006
 ey-0.006
ries-0.006
Token realized
Token ID6939
Feature activation+2.64e-02

Pos logit contributions
uct+0.001
 Jerry+0.001
 Tears+0.001
 John+0.001
colm+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
be-0.001
hetic-0.001
ppy-0.001
Token was
Token ID373
Feature activation+1.37e-02

Pos logit contributions
uct+0.019
 John+0.016
 Jerry+0.016
colm+0.016
case+0.016
Neg logit contributions
 mate-0.021
Ch-0.019
hetic-0.018
ppy-0.018
be-0.018
Token worried
Token ID7960
Feature activation+4.01e-03

Pos logit contributions
uct+0.022
colm+0.020
 John+0.019
case+0.019
 Tommy+0.018
Neg logit contributions
 mate-0.022
hetic-0.019
Ch-0.019
king-0.018
be-0.018
Token.
Token ID13
Feature activation+5.49e-03

Pos logit contributions
uct+0.006
colm+0.005
 John+0.005
 tears+0.005
case+0.005
Neg logit contributions
 mate-0.007
hetic-0.006
Ch-0.006
be-0.006
use-0.006
Token She
Token ID1375
Feature activation+9.20e-03

Pos logit contributions
uct+0.008
colm+0.007
 John+0.006
 tears+0.006
case+0.006
Neg logit contributions
Ch-0.009
 mate-0.009
Gr-0.008
Cow-0.008
hetic-0.008
Token thought
Token ID1807
Feature activation+1.46e-02

Pos logit contributions
uct+0.014
 John+0.012
colm+0.011
 Tears+0.011
case+0.011
Neg logit contributions
 mate-0.016
Ch-0.014
hetic-0.014
use-0.014
ppy-0.014
Token her
Token ID607
Feature activation+7.11e-02

Pos logit contributions
uct+0.021
colm+0.018
 John+0.015
 Jerry+0.015
case+0.015
Neg logit contributions
 mate-0.026
Ch-0.024
hetic-0.024
 longer-0.023
ppy-0.023
Token trumpet
Token ID40334
Feature activation+9.65e-03

Pos logit contributions
uct+0.125
 Lib+0.103
 Jerry+0.103
 Heidi+0.099
 John+0.099
Neg logit contributions
 mate-0.110
Ch-0.102
 spy-0.099
 crew-0.097
be-0.097
Token was
Token ID373
Feature activation+2.05e-04

Pos logit contributions
uct+0.013
colm+0.011
 John+0.011
 tears+0.010
 Tommy+0.010
Neg logit contributions
 mate-0.018
Ch-0.016
hetic-0.016
be-0.016
use-0.016
Token broken
Token ID5445
Feature activation+2.31e-05

Pos logit contributions
uct+0.000
colm+0.000
case+0.000
 John+0.000
 Lib+0.000
Neg logit contributions
 mate-0.000
 sheep-0.000
 chick-0.000
 longer-0.000
ppy-0.000
Token.
Token ID13
Feature activation+6.08e-03

Pos logit contributions
uct+0.000
colm+0.000
 John+0.000
 tears+0.000
 Tears+0.000
Neg logit contributions
 mate-0.000
hetic-0.000
Ch-0.000
 longer-0.000
king-0.000
Token She
Token ID1375
Feature activation+8.53e-03

Pos logit contributions
uct+0.009
colm+0.008
 John+0.007
 tears+0.007
case+0.007
Neg logit contributions
Ch-0.010
 mate-0.010
Gr-0.010
Cow-0.009
hetic-0.009
Token very
Token ID845
Feature activation+7.71e-03

Pos logit contributions
uct+0.031
colm+0.029
case+0.026
 John+0.025
 bicycles+0.025
Neg logit contributions
 mate-0.028
be-0.024
 longer-0.023
king-0.022
 spy-0.022
Token happy
Token ID3772
Feature activation+3.16e-03

Pos logit contributions
uct+0.013
colm+0.012
 John+0.012
 Tears+0.011
 Jerry+0.011
Neg logit contributions
 mate-0.011
be-0.010
hetic-0.010
 leader-0.009
 spy-0.009
Token.
Token ID13
Feature activation+9.41e-03

Pos logit contributions
uct+0.006
colm+0.005
case+0.005
 Tears+0.005
 tears+0.005
Neg logit contributions
 mate-0.004
 longer-0.004
 someday-0.004
 spy-0.004
 nood-0.004
Token She
Token ID1375
Feature activation+7.65e-03

Pos logit contributions
uct+0.015
colm+0.013
 Timothy+0.013
 done+0.013
 Jerry+0.012
Neg logit contributions
 mate-0.014
Ch-0.013
 clinic-0.012
 nood-0.012
Cow-0.012
Token thinks
Token ID6834
Feature activation+1.02e-02

Pos logit contributions
uct+0.014
colm+0.012
 Timothy+0.012
 Tears+0.012
 John+0.012
Neg logit contributions
 mate-0.010
use-0.009
 spy-0.009
hetic-0.009
 lear-0.008
Token her
Token ID607
Feature activation+7.11e-02

Pos logit contributions
uct+0.017
colm+0.016
icycle+0.013
case+0.013
 Tears+0.013
Neg logit contributions
 mate-0.016
 longer-0.014
hetic-0.014
 nood-0.014
 sheep-0.013
Token mom
Token ID1995
Feature activation+1.27e-02

Pos logit contributions
uct+0.135
 Timothy+0.107
 Jerry+0.107
 John+0.105
 Lib+0.105
Neg logit contributions
 mate-0.097
 nood-0.095
 spy-0.092
 sheep-0.091
 chick-0.089
Token and
Token ID290
Feature activation+5.38e-03

Pos logit contributions
uct+0.022
 Tears+0.020
 tears+0.019
colm+0.018
 ey+0.018
Neg logit contributions
use-0.018
ppy-0.016
 mate-0.016
oise-0.015
be-0.015
Token dad
Token ID9955
Feature activation-7.00e-03

Pos logit contributions
uct+0.009
case+0.009
colm+0.008
 Tears+0.008
 Timothy+0.008
Neg logit contributions
 mate-0.008
 spy-0.007
 nurses-0.007
 sheep-0.007
 permission-0.007
Token bought
Token ID5839
Feature activation+1.40e-02

Pos logit contributions
 mate+0.011
use+0.009
hetic+0.009
be+0.009
ppy+0.008
Neg logit contributions
uct-0.012
colm-0.011
 Tears-0.010
 tears-0.010
 John-0.010
Token it
Token ID340
Feature activation+3.34e-04

Pos logit contributions
uct+0.020
colm+0.017
case+0.017
 Tears+0.016
 tears+0.015
Neg logit contributions
 mate-0.024
use-0.022
hetic-0.021
Ch-0.021
 chick-0.021
Token owner
Token ID4870
Feature activation-1.47e-02

Pos logit contributions
uct+0.000
 John+0.000
 Lib+0.000
colm+0.000
 Tears+0.000
Neg logit contributions
 mate-0.000
 crew-0.000
 spy-0.000
 leader-0.000
 chick-0.000
Token took
Token ID1718
Feature activation-2.35e-02

Pos logit contributions
 mate+0.018
use+0.016
hetic+0.015
be+0.015
ppy+0.014
Neg logit contributions
uct-0.030
colm-0.026
 Tears-0.026
case-0.025
 tears-0.025
Token him
Token ID683
Feature activation-3.53e-02

Pos logit contributions
 mate+0.041
Ch+0.037
hetic+0.036
use+0.036
Cow+0.035
Neg logit contributions
uct-0.036
colm-0.030
case-0.029
 Tears-0.028
 tears-0.027
Token to
Token ID284
Feature activation-2.55e-02

Pos logit contributions
 mate+0.052
hetic+0.044
use+0.043
Ch+0.043
ppy+0.041
Neg logit contributions
uct-0.065
colm-0.056
 John-0.054
 tears-0.054
 Tears-0.053
Token the
Token ID262
Feature activation-2.67e-02

Pos logit contributions
 mate+0.054
hetic+0.049
Ch+0.048
use+0.048
Cow+0.047
Neg logit contributions
uct-0.029
colm-0.023
 John-0.023
 tears-0.022
case-0.022
Token vet
Token ID20202
Feature activation-6.54e-02

Pos logit contributions
 mate+0.044
hetic+0.038
Ch+0.038
use+0.037
Cow+0.037
Neg logit contributions
uct-0.043
 John-0.037
colm-0.036
 tears-0.035
 Tears-0.035
Token because
Token ID780
Feature activation-5.32e-03

Pos logit contributions
 mate+0.149
Ro+0.145
hetic+0.144
Qu+0.144
 conclusions+0.142
Neg logit contributions
 John-0.052
 Tommy-0.051
uct-0.050
 Tom-0.046
 Jerry-0.045
Token Max
Token ID5436
Feature activation+6.28e-05

Pos logit contributions
 mate+0.010
Ch+0.009
hetic+0.009
 engineers+0.009
 nurses+0.009
Neg logit contributions
uct-0.007
colm-0.006
 Tears-0.005
 tears-0.005
case-0.005
Token wasn
Token ID2492
Feature activation-3.57e-03

Pos logit contributions
uct+0.000
 John+0.000
colm+0.000
 Tears+0.000
 tears+0.000
Neg logit contributions
 mate-0.000
hetic-0.000
Ch-0.000
use-0.000
ppy-0.000
Token't
Token ID470
Feature activation-1.31e-03

Pos logit contributions
 mate+0.006
hetic+0.005
Ch+0.005
use+0.005
king+0.005
Neg logit contributions
uct-0.006
 John-0.005
colm-0.005
 tears-0.005
 Tommy-0.005
Token feeling
Token ID4203
Feature activation-2.65e-02

Pos logit contributions
 mate+0.002
hetic+0.002
 chick+0.002
be+0.002
king+0.002
Neg logit contributions
uct-0.002
colm-0.002
case-0.002
 Tears-0.002
 tears-0.002
Token was
Token ID373
Feature activation-9.80e-03

Pos logit contributions
uct+0.003
colm+0.003
 ey+0.003
 tears+0.003
 Jerry+0.003
Neg logit contributions
 mate-0.002
Ch-0.002
be-0.002
Qu-0.002
 nood-0.002
Token a
Token ID257
Feature activation+4.50e-03

Pos logit contributions
 mate+0.016
Ch+0.014
 chick+0.014
Gr+0.013
 longer+0.013
Neg logit contributions
uct-0.016
colm-0.014
 tears-0.013
 Tears-0.013
case-0.013
Token clever
Token ID14169
Feature activation+2.16e-03

Pos logit contributions
uct+0.004
 Tommy+0.004
 John+0.004
colm+0.004
 tears+0.004
Neg logit contributions
 mate-0.009
use-0.009
hetic-0.009
Ch-0.008
ppy-0.008
Token fox
Token ID21831
Feature activation-3.52e-02

Pos logit contributions
uct+0.003
 John+0.003
colm+0.003
 Tommy+0.003
 tears+0.003
Neg logit contributions
 mate-0.003
hetic-0.003
Ch-0.003
use-0.003
king-0.003
Token named
Token ID3706
Feature activation-4.40e-02

Pos logit contributions
use+0.059
 mate+0.056
hetic+0.055
 conclusions+0.055
 engineers+0.053
Neg logit contributions
 Jerry-0.043
 John-0.042
 Tom-0.042
 Tommy-0.041
 Sue-0.041
Token Fox
Token ID5426
Feature activation-6.46e-02

Pos logit contributions
 mate+0.088
Ch+0.079
hetic+0.077
ppy+0.075
Cow+0.075
Neg logit contributions
uct-0.057
 tears-0.047
colm-0.046
case-0.045
 Tears-0.044
Tokeny
Token ID88
Feature activation-2.40e-02

Pos logit contributions
hetic+0.106
Ch+0.106
 mate+0.103
use+0.102
 conclusions+0.102
Neg logit contributions
 Tommy-0.074
 John-0.071
colm-0.069
 Tom-0.066
uct-0.066
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
use+0.054
 mate+0.051
 conclusions+0.051
hetic+0.050
urnal+0.049
Neg logit contributions
 Tommy-0.017
 John-0.017
 Jerry-0.016
colm-0.015
uct-0.015
Token Fox
Token ID5426
Feature activation-3.48e-02

Pos logit contributions
 mate+0.039
hetic+0.036
Ch+0.035
use+0.034
be+0.034
Neg logit contributions
uct-0.025
colm-0.023
 John-0.023
 Jerry-0.022
 Tommy-0.022
Tokeny
Token ID88
Feature activation+5.70e-03

Pos logit contributions
 mate+0.031
hetic+0.025
Ch+0.024
use+0.023
Cow+0.022
Neg logit contributions
uct-0.082
 John-0.074
colm-0.073
 tears-0.073
 Tommy-0.072
Token liked
Token ID8288
Feature activation-6.53e-03

Pos logit contributions
uct+0.008
 tears+0.007
 John+0.007
colm+0.007
 Tears+0.007
Neg logit contributions
 mate-0.010
Ch-0.009
hetic-0.009
use-0.009
 peac-0.009
Token was
Token ID373
Feature activation-1.34e-02

Pos logit contributions
 mate+0.053
hetic+0.049
use+0.048
Ch+0.047
king+0.047
Neg logit contributions
uct-0.024
colm-0.021
 John-0.021
 tears-0.020
 Tommy-0.020
Token a
Token ID257
Feature activation+1.41e-02

Pos logit contributions
 mate+0.026
hetic+0.025
use+0.025
Ch+0.024
 conclusions+0.023
Neg logit contributions
uct-0.016
 John-0.014
 Tommy-0.014
 tears-0.014
 Jerry-0.013
Token small
Token ID1402
Feature activation-1.65e-02

Pos logit contributions
 John+0.013
 Tommy+0.012
 Jerry+0.012
 tears+0.012
colm+0.012
Neg logit contributions
 mate-0.028
use-0.027
hetic-0.027
king-0.027
Ch-0.026
Token bird
Token ID6512
Feature activation-2.32e-02

Pos logit contributions
Ch+0.028
 mate+0.027
hetic+0.027
Gr+0.026
use+0.025
Neg logit contributions
 tears-0.021
 John-0.021
 Tommy-0.020
 Tears-0.019
colm-0.019
Token called
Token ID1444
Feature activation-3.90e-02

Pos logit contributions
 mate+0.049
use+0.049
hetic+0.047
 engineers+0.046
Ch+0.045
Neg logit contributions
uct-0.020
case-0.019
 John-0.018
 Jerry-0.018
 Tears-0.017
Token Small
Token ID10452
Feature activation-6.39e-02

Pos logit contributions
 mate+0.068
hetic+0.062
use+0.062
Ch+0.061
be+0.060
Neg logit contributions
uct-0.054
 John-0.050
 Tommy-0.049
 Jerry-0.047
colm-0.047
Token Bird
Token ID14506
Feature activation-4.96e-02

Pos logit contributions
 mate+0.114
use+0.112
 conclusions+0.111
hetic+0.108
 peac+0.108
Neg logit contributions
 John-0.078
 Tommy-0.072
uct-0.068
colm-0.066
 Tom-0.065
Token.
Token ID13
Feature activation-7.09e-03

Pos logit contributions
use+0.107
Ro+0.106
 mate+0.103
Ch+0.102
 conclusions+0.102
Neg logit contributions
 John-0.038
 Tommy-0.033
 Bob-0.031
 about-0.030
 Tears-0.029
Token
Token ID220
Feature activation-2.66e-02

Pos logit contributions
 mate+0.015
hetic+0.014
use+0.014
oise+0.013
king+0.013
Neg logit contributions
uct-0.008
 John-0.007
 Tommy-0.006
 Tears-0.006
colm-0.006
Token
Token ID198
Feature activation-5.03e-03

Pos logit contributions
 mate+0.050
Ch+0.046
Gr+0.042
Cow+0.042
be+0.042
Neg logit contributions
uct-0.037
colm-0.031
 tears-0.031
 John-0.027
 wrapper-0.027
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
 mate+0.013
hetic+0.012
use+0.012
king+0.012
ppy+0.012
Neg logit contributions
uct-0.003
 John-0.003
 Tommy-0.002
colm-0.002
case-0.002
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
uct+0.001
colm+0.001
 tears+0.001
 Tears+0.001
 Jerry+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
be-0.000
Qu-0.000
 nood-0.000
Token a
Token ID257
Feature activation+6.45e-03

Pos logit contributions
 mate+0.017
Ch+0.015
 chick+0.015
Gr+0.014
 longer+0.014
Neg logit contributions
uct-0.018
colm-0.015
 tears-0.014
 Tears-0.014
case-0.014
Token little
Token ID1310
Feature activation-6.56e-03

Pos logit contributions
uct+0.006
 Tommy+0.006
colm+0.005
 John+0.005
 tears+0.005
Neg logit contributions
 mate-0.014
use-0.013
hetic-0.013
Ch-0.012
oise-0.012
Token fox
Token ID21831
Feature activation-3.93e-02

Pos logit contributions
 mate+0.012
hetic+0.012
use+0.011
Ch+0.011
king+0.011
Neg logit contributions
 Tommy-0.007
 John-0.007
 Jerry-0.007
colm-0.006
 tears-0.006
Token named
Token ID3706
Feature activation-4.24e-02

Pos logit contributions
use+0.072
 mate+0.068
hetic+0.066
 engineers+0.064
 conclusions+0.064
Neg logit contributions
 Tom-0.042
 Jerry-0.042
 John-0.042
 about-0.041
 Tommy-0.041
Token Fox
Token ID5426
Feature activation-6.32e-02

Pos logit contributions
 mate+0.080
Ch+0.072
hetic+0.070
Cow+0.069
use+0.069
Neg logit contributions
uct-0.060
 tears-0.049
colm-0.049
case-0.047
 Tears-0.047
Tokeny
Token ID88
Feature activation-2.12e-02

Pos logit contributions
hetic+0.106
Ch+0.104
 mate+0.104
use+0.103
 engineers+0.102
Neg logit contributions
 Tommy-0.071
 John-0.067
colm-0.067
uct-0.064
 Tom-0.063
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
use+0.048
 mate+0.047
hetic+0.045
 conclusions+0.044
urnal+0.044
Neg logit contributions
 Tommy-0.014
 John-0.014
colm-0.014
uct-0.014
 tears-0.013
Token Fox
Token ID5426
Feature activation-3.30e-02

Pos logit contributions
 mate+0.033
hetic+0.030
Ch+0.029
use+0.029
be+0.028
Neg logit contributions
uct-0.031
colm-0.028
 John-0.028
 Tommy-0.027
 Jerry-0.026
Tokeny
Token ID88
Feature activation+7.47e-03

Pos logit contributions
 mate+0.030
hetic+0.024
Ch+0.023
use+0.022
Cow+0.021
Neg logit contributions
uct-0.078
 John-0.070
colm-0.070
 tears-0.069
 Tommy-0.068
Token was
Token ID373
Feature activation-5.73e-03

Pos logit contributions
uct+0.012
 tears+0.010
 John+0.010
colm+0.010
 Tears+0.009
Neg logit contributions
 mate-0.013
Ch-0.011
hetic-0.011
ppy-0.011
 peac-0.011
Token large
Token ID1588
Feature activation+5.39e-03

Pos logit contributions
uct+0.007
 Tommy+0.006
 John+0.006
colm+0.006
 tears+0.006
Neg logit contributions
 mate-0.015
use-0.014
hetic-0.014
Ch-0.013
ppy-0.013
Token open
Token ID1280
Feature activation+1.17e-02

Pos logit contributions
uct+0.009
 John+0.008
colm+0.008
 tears+0.008
 Tommy+0.008
Neg logit contributions
 mate-0.008
hetic-0.008
Ch-0.007
use-0.007
king-0.007
Token space
Token ID2272
Feature activation-1.86e-02

Pos logit contributions
uct+0.019
colm+0.016
 John+0.016
 tears+0.016
 Tommy+0.016
Neg logit contributions
 mate-0.019
hetic-0.017
Ch-0.016
use-0.016
 peac-0.016
Token.
Token ID13
Feature activation-3.49e-03

Pos logit contributions
 mate+0.035
hetic+0.033
Ch+0.033
use+0.033
Cow+0.032
Neg logit contributions
uct-0.023
case-0.020
 John-0.020
colm-0.019
 tears-0.019
Token In
Token ID554
Feature activation+1.44e-02

Pos logit contributions
 mate+0.007
Ch+0.006
hetic+0.006
Cow+0.006
use+0.006
Neg logit contributions
uct-0.005
colm-0.004
 tears-0.004
 John-0.003
 Tommy-0.003
Token that
Token ID326
Feature activation-6.30e-02

Pos logit contributions
uct+0.022
colm+0.017
 Tears+0.016
 tears+0.016
 John+0.016
Neg logit contributions
 mate-0.027
Ch-0.023
 peac-0.022
hetic-0.022
 chick-0.022
Token space
Token ID2272
Feature activation-1.91e-02

Pos logit contributions
 mate+0.125
hetic+0.118
king+0.115
ppy+0.114
ulent+0.113
Neg logit contributions
uct-0.068
case-0.064
 John-0.063
colm-0.062
 Tommy-0.061
Token there
Token ID612
Feature activation+3.15e-03

Pos logit contributions
 mate+0.036
hetic+0.032
Ch+0.031
use+0.031
 crew+0.030
Neg logit contributions
uct-0.027
colm-0.022
 John-0.022
 tears-0.021
 Tommy-0.021
Token were
Token ID547
Feature activation-1.24e-02

Pos logit contributions
uct+0.005
colm+0.004
 tears+0.004
 John+0.004
 Tears+0.004
Neg logit contributions
 mate-0.005
Ch-0.005
hetic-0.005
be-0.005
use-0.004
Token two
Token ID734
Feature activation-1.60e-02

Pos logit contributions
 mate+0.022
Ch+0.019
hetic+0.019
 crew+0.018
 peac+0.018
Neg logit contributions
uct-0.020
colm-0.016
 John-0.016
 tears-0.016
 Tommy-0.016
Token friends
Token ID2460
Feature activation-5.02e-03

Pos logit contributions
 mate+0.030
hetic+0.027
Ch+0.026
use+0.026
 crew+0.025
Neg logit contributions
uct-0.023
colm-0.019
 John-0.019
 tears-0.018
 Tommy-0.018
Token owner
Token ID4870
Feature activation-1.54e-02

Pos logit contributions
 mate+0.003
Ch+0.002
hetic+0.002
Cow+0.002
king+0.002
Neg logit contributions
uct-0.003
 John-0.002
colm-0.002
 tears-0.002
 Tears-0.002
Token took
Token ID1718
Feature activation-2.23e-02

Pos logit contributions
 mate+0.023
use+0.019
ppy+0.019
Ch+0.019
hetic+0.018
Neg logit contributions
uct-0.029
colm-0.024
 Tears-0.024
 tears-0.024
 ey-0.023
Token him
Token ID683
Feature activation-3.79e-02

Pos logit contributions
 mate+0.036
Ch+0.033
hetic+0.032
ch+0.031
use+0.031
Neg logit contributions
uct-0.037
colm-0.030
case-0.030
 Tears-0.029
 tears-0.028
Token to
Token ID284
Feature activation-2.62e-02

Pos logit contributions
 mate+0.061
hetic+0.054
Ch+0.053
use+0.052
Cow+0.051
Neg logit contributions
uct-0.062
 John-0.053
colm-0.053
 tears-0.052
 Tommy-0.052
Token the
Token ID262
Feature activation-3.43e-02

Pos logit contributions
 mate+0.054
hetic+0.049
Ch+0.048
use+0.047
Cow+0.047
Neg logit contributions
uct-0.032
colm-0.025
 John-0.024
 tears-0.024
case-0.024
Token vet
Token ID20202
Feature activation-6.24e-02

Pos logit contributions
 mate+0.067
hetic+0.063
Ch+0.060
use+0.060
ppy+0.060
Neg logit contributions
uct-0.043
colm-0.036
 Tommy-0.036
 John-0.036
 tears-0.035
Token.
Token ID13
Feature activation-4.59e-03

Pos logit contributions
 conclusions+0.142
Ro+0.142
 mate+0.141
hetic+0.140
Qu+0.140
Neg logit contributions
 Tommy-0.048
 John-0.046
 Tom-0.043
 Jerry-0.042
uct-0.039
Token The
Token ID383
Feature activation-3.47e-02

Pos logit contributions
 mate+0.008
Ch+0.008
Cow+0.007
 engineers+0.007
use+0.007
Neg logit contributions
uct-0.007
colm-0.006
 tears-0.005
 ey-0.005
 Lib-0.005
Token vet
Token ID20202
Feature activation-3.30e-02

Pos logit contributions
 mate+0.047
ppy+0.044
hetic+0.044
oise+0.041
mer+0.040
Neg logit contributions
 tears-0.061
uct-0.060
colm-0.059
case-0.058
 Tommy-0.057
Token said
Token ID531
Feature activation-6.83e-03

Pos logit contributions
 mate+0.070
hetic+0.063
Ch+0.063
use+0.062
king+0.061
Neg logit contributions
uct-0.037
 John-0.031
colm-0.030
 tears-0.030
 Tommy-0.030
Token Max
Token ID5436
Feature activation-1.43e-02

Pos logit contributions
 mate+0.014
hetic+0.012
Ch+0.012
use+0.012
king+0.012
Neg logit contributions
uct-0.009
 John-0.007
 Tommy-0.007
colm-0.007
 tears-0.007
Token were
Token ID547
Feature activation-3.90e-03

Pos logit contributions
uct+0.002
colm+0.001
 tears+0.001
 Jerry+0.001
 ey+0.001
Neg logit contributions
Ch-0.001
 mate-0.001
Qu-0.001
be-0.001
 nood-0.001
Token two
Token ID734
Feature activation-1.20e-02

Pos logit contributions
 mate+0.007
be+0.006
Ch+0.006
Gr+0.006
 chick+0.006
Neg logit contributions
uct-0.006
colm-0.005
 tears-0.004
case-0.004
 Tears-0.004
Token good
Token ID922
Feature activation-6.69e-03

Pos logit contributions
 mate+0.025
hetic+0.023
Ch+0.022
use+0.022
Cow+0.022
Neg logit contributions
uct-0.014
 tears-0.012
 John-0.012
 Tommy-0.011
colm-0.011
Token friends
Token ID2460
Feature activation-1.07e-02

Pos logit contributions
 mate+0.014
hetic+0.013
use+0.013
Ch+0.013
king+0.012
Neg logit contributions
uct-0.007
 John-0.006
 tears-0.006
 Tommy-0.006
colm-0.006
Token called
Token ID1444
Feature activation-3.45e-02

Pos logit contributions
 mate+0.020
use+0.020
hetic+0.020
Ch+0.019
Ro+0.019
Neg logit contributions
 John-0.013
 Tommy-0.012
 Jerry-0.012
uct-0.012
 tears-0.011
Token Marty
Token ID29876
Feature activation-6.20e-02

Pos logit contributions
 mate+0.085
Ch+0.075
hetic+0.074
 crew+0.073
 peac+0.072
Neg logit contributions
uct-0.033
colm-0.022
 tears-0.022
case-0.020
 Tears-0.020
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 mate+0.137
use+0.136
Ro+0.132
Ch+0.132
 conclusions+0.131
Neg logit contributions
uct-0.047
 John-0.046
 Tommy-0.042
 tears-0.041
 Jerry-0.039
Token Tim
Token ID5045
Feature activation-1.02e-02

Pos logit contributions
 mate+0.040
hetic+0.037
use+0.037
Ch+0.037
king+0.036
Neg logit contributions
uct-0.025
 John-0.023
 Tommy-0.022
colm-0.021
 Jerry-0.021
Tokenmy
Token ID1820
Feature activation-2.64e-02

Pos logit contributions
 mate+0.019
hetic+0.017
Ch+0.016
use+0.016
Cow+0.016
Neg logit contributions
uct-0.015
 John-0.012
colm-0.012
 tears-0.012
 Tommy-0.012
Token.
Token ID13
Feature activation-2.39e-02

Pos logit contributions
 mate+0.051
use+0.046
hetic+0.046
Ch+0.045
Cow+0.044
Neg logit contributions
uct-0.033
 John-0.030
 Tommy-0.029
 tears-0.029
colm-0.028
Token They
Token ID1119
Feature activation-1.88e-02

Pos logit contributions
 mate+0.045
hetic+0.040
Ch+0.039
use+0.039
Cow+0.038
Neg logit contributions
uct-0.034
colm-0.028
 John-0.027
 tears-0.027
 Tommy-0.027
Token was
Token ID373
Feature activation-1.20e-02

Pos logit contributions
uct+0.002
colm+0.002
 tears+0.002
 Jerry+0.002
 Tears+0.002
Neg logit contributions
 mate-0.002
Ch-0.002
be-0.002
Qu-0.002
 spy-0.002
Token a
Token ID257
Feature activation+9.27e-03

Pos logit contributions
 mate+0.019
Ch+0.017
 chick+0.016
Gr+0.016
 longer+0.016
Neg logit contributions
uct-0.021
colm-0.017
 tears-0.016
 Tears-0.016
case-0.016
Token clever
Token ID14169
Feature activation+2.88e-03

Pos logit contributions
uct+0.009
 Tommy+0.008
 John+0.008
colm+0.008
 tears+0.008
Neg logit contributions
 mate-0.019
use-0.018
hetic-0.018
Ch-0.017
king-0.017
Token fox
Token ID21831
Feature activation-3.39e-02

Pos logit contributions
uct+0.004
 John+0.004
 Tommy+0.004
colm+0.004
 tears+0.004
Neg logit contributions
 mate-0.005
hetic-0.004
use-0.004
Ch-0.004
king-0.004
Token named
Token ID3706
Feature activation-4.28e-02

Pos logit contributions
use+0.060
 mate+0.058
hetic+0.057
 engineers+0.056
urnal+0.055
Neg logit contributions
 John-0.037
 Jerry-0.037
 Tom-0.037
uct-0.036
case-0.036
Token Fox
Token ID5426
Feature activation-6.20e-02

Pos logit contributions
 mate+0.094
Ch+0.085
hetic+0.083
use+0.082
Cow+0.081
Neg logit contributions
uct-0.048
 tears-0.038
colm-0.036
 Tears-0.035
case-0.035
Tokeny
Token ID88
Feature activation-2.04e-02

Pos logit contributions
hetic+0.089
 mate+0.089
Ch+0.088
ulent+0.084
 engineers+0.084
Neg logit contributions
 Tommy-0.085
uct-0.082
 John-0.082
colm-0.081
 Tom-0.076
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
 mate+0.044
use+0.043
hetic+0.042
Ch+0.041
urnal+0.041
Neg logit contributions
 Tommy-0.016
uct-0.016
 John-0.016
colm-0.016
 tears-0.014
Token Fox
Token ID5426
Feature activation-3.29e-02

Pos logit contributions
 mate+0.032
hetic+0.029
Ch+0.028
use+0.028
king+0.028
Neg logit contributions
uct-0.022
 John-0.020
colm-0.020
 Tommy-0.019
 Jerry-0.019
Tokeny
Token ID88
Feature activation+7.60e-03

Pos logit contributions
 mate+0.031
hetic+0.025
Ch+0.024
use+0.023
Cow+0.021
Neg logit contributions
uct-0.076
colm-0.068
 John-0.068
 tears-0.067
 Tommy-0.067
Token had
Token ID550
Feature activation+1.02e-03

Pos logit contributions
uct+0.012
 tears+0.010
 John+0.010
colm+0.010
 Tears+0.010
Neg logit contributions
 mate-0.013
Ch-0.011
hetic-0.011
use-0.011
ppy-0.011
Token and
Token ID290
Feature activation-8.47e-04

Pos logit contributions
 mate+0.046
hetic+0.042
Ch+0.041
use+0.040
Cow+0.039
Neg logit contributions
uct-0.032
 John-0.026
 tears-0.026
colm-0.026
case-0.025
Token took
Token ID1718
Feature activation-1.22e-02

Pos logit contributions
 mate+0.002
hetic+0.002
Ch+0.001
use+0.001
Cow+0.001
Neg logit contributions
uct-0.001
 John-0.001
colm-0.001
 tears-0.001
 Tommy-0.001
Token it
Token ID340
Feature activation-2.24e-02

Pos logit contributions
 mate+0.020
ch+0.019
Ch+0.018
be+0.018
 longer+0.018
Neg logit contributions
uct-0.020
case-0.016
colm-0.016
 ey-0.015
 Tears-0.015
Token to
Token ID284
Feature activation-1.98e-02

Pos logit contributions
 mate+0.038
hetic+0.033
Ch+0.033
use+0.032
king+0.032
Neg logit contributions
uct-0.035
colm-0.030
 John-0.030
 tears-0.029
 Tommy-0.029
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 mate+0.046
hetic+0.043
Ch+0.042
use+0.042
ppy+0.042
Neg logit contributions
uct-0.018
 John-0.015
 Tommy-0.015
 tears-0.015
 Jerry-0.014
Token vet
Token ID20202
Feature activation-6.19e-02

Pos logit contributions
 mate+0.030
hetic+0.027
Ch+0.026
use+0.026
ppy+0.026
Neg logit contributions
uct-0.032
 tears-0.029
colm-0.028
 John-0.028
case-0.027
Token.
Token ID13
Feature activation-2.72e-04

Pos logit contributions
 mate+0.145
Ro+0.141
Qu+0.137
Ch+0.136
hetic+0.134
Neg logit contributions
uct-0.047
 John-0.046
 Tommy-0.043
 tears-0.040
 Jerry-0.040
Token
Token ID198
Feature activation+6.68e-04

Pos logit contributions
Ch+0.000
 mate+0.000
Cow+0.000
cient+0.000
Gr+0.000
Neg logit contributions
uct-0.000
colm-0.000
 tears-0.000
 ey-0.000
 done-0.000
Token
Token ID198
Feature activation-6.15e-03

Pos logit contributions
uct+0.001
 tears+0.001
colm+0.001
 ey+0.001
 Tears+0.001
Neg logit contributions
Ch-0.001
 mate-0.001
Cow-0.001
Gr-0.001
be-0.001
TokenThe
Token ID464
Feature activation-1.70e-02

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.009
use+0.009
king+0.009
Neg logit contributions
uct-0.010
 John-0.008
colm-0.008
 tears-0.008
 Tommy-0.008
Token vet
Token ID20202
Feature activation-3.29e-02

Pos logit contributions
 mate+0.026
hetic+0.025
ppy+0.024
Ch+0.024
use+0.024
Neg logit contributions
uct-0.027
 tears-0.026
colm-0.025
case-0.024
 Jerry-0.023
Token always
Token ID1464
Feature activation-2.13e-02

Pos logit contributions
uct+0.003
 John+0.003
 Tommy+0.003
 Jerry+0.003
 tears+0.003
Neg logit contributions
 mate-0.003
use-0.003
king-0.003
mer-0.003
hetic-0.003
Token wanted
Token ID2227
Feature activation-9.02e-03

Pos logit contributions
 mate+0.043
 conclusions+0.040
hetic+0.039
Ch+0.039
 chick+0.039
Neg logit contributions
uct-0.023
 Tommy-0.022
 John-0.021
colm-0.020
 Jerry-0.018
Token to
Token ID284
Feature activation+9.97e-04

Pos logit contributions
hetic+0.017
use+0.017
 mate+0.017
ppy+0.017
oise+0.017
Neg logit contributions
 John-0.009
 Tommy-0.009
 tears-0.009
uct-0.009
 bicycles-0.008
Token fly
Token ID6129
Feature activation-2.30e-02

Pos logit contributions
uct+0.002
 John+0.001
 Tommy+0.001
 tears+0.001
 Jerry+0.001
Neg logit contributions
hetic-0.001
 mate-0.001
ppy-0.001
use-0.001
Ch-0.001
Token in
Token ID287
Feature activation+6.07e-03

Pos logit contributions
 mate+0.044
hetic+0.042
Ch+0.040
use+0.040
Cow+0.039
Neg logit contributions
uct-0.027
 John-0.024
 Tommy-0.023
 tears-0.023
 Jerry-0.023
Token one
Token ID530
Feature activation-6.16e-02

Pos logit contributions
uct+0.006
 John+0.005
 tears+0.005
 Tommy+0.005
colm+0.005
Neg logit contributions
 mate-0.013
hetic-0.012
use-0.012
Ch-0.012
king-0.012
Token.
Token ID13
Feature activation-3.04e-03

Pos logit contributions
hetic+0.108
 mate+0.106
use+0.100
king+0.097
Ch+0.097
Neg logit contributions
uct-0.083
case-0.074
 John-0.071
colm-0.071
 Tommy-0.070
Token One
Token ID1881
Feature activation-1.15e-02

Pos logit contributions
 mate+0.004
Ch+0.003
Cow+0.003
hetic+0.003
use+0.003
Neg logit contributions
uct-0.006
 tears-0.005
colm-0.005
 Tears-0.005
case-0.005
Token day
Token ID1110
Feature activation-8.35e-03

Pos logit contributions
 mate+0.022
hetic+0.021
use+0.020
king+0.019
Ch+0.019
Neg logit contributions
uct-0.015
 John-0.013
 Tommy-0.013
colm-0.013
 tears-0.012
Token,
Token ID11
Feature activation+9.87e-03

Pos logit contributions
 mate+0.014
hetic+0.012
Ch+0.012
use+0.011
be+0.011
Neg logit contributions
uct-0.014
colm-0.012
 tears-0.012
case-0.011
 John-0.011
Token Tim
Token ID5045
Feature activation+1.57e-02

Pos logit contributions
uct+0.013
colm+0.010
case+0.010
 tears+0.009
 Tears+0.009
Neg logit contributions
 mate-0.020
Ch-0.018
 longer-0.018
Cow-0.017
hetic-0.017
Token took
Token ID1718
Feature activation-2.12e-02

Pos logit contributions
 mate+0.018
 conclusions+0.016
hetic+0.016
king+0.015
urnal+0.015
Neg logit contributions
uct-0.009
 John-0.008
 tears-0.007
 Tommy-0.007
 Jerry-0.007
Token the
Token ID262
Feature activation-2.61e-02

Pos logit contributions
 mate+0.041
hetic+0.037
Ch+0.037
use+0.036
king+0.035
Neg logit contributions
uct-0.028
 John-0.023
 tears-0.023
colm-0.023
 Tommy-0.022
Token bird
Token ID6512
Feature activation-3.35e-02

Pos logit contributions
 mate+0.043
hetic+0.039
Ch+0.039
use+0.037
ppy+0.037
Neg logit contributions
uct-0.039
 tears-0.039
colm-0.034
 Tommy-0.034
case-0.034
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
 mate+0.053
hetic+0.053
Ch+0.050
 lear+0.048
Qu+0.047
Neg logit contributions
uct-0.047
 tears-0.045
 Jerry-0.045
 up-0.044
 John-0.043
Token the
Token ID262
Feature activation-3.14e-02

Pos logit contributions
 mate+0.043
hetic+0.041
oise+0.040
Ch+0.039
ppy+0.039
Neg logit contributions
uct-0.017
 tears-0.015
 John-0.015
 Tommy-0.014
 Jerry-0.014
Token vet
Token ID20202
Feature activation-6.16e-02

Pos logit contributions
 mate+0.050
hetic+0.047
ppy+0.046
Ch+0.046
use+0.044
Neg logit contributions
uct-0.050
 tears-0.049
case-0.045
colm-0.044
 Tears-0.043
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 mate+0.147
Ro+0.145
Ch+0.142
Qu+0.142
hetic+0.140
Neg logit contributions
uct-0.043
 John-0.041
 Tommy-0.038
 tears-0.037
 Tom-0.036
Token Ros
Token ID10018
Feature activation-2.24e-04

Pos logit contributions
 mate+0.036
oise+0.032
hetic+0.031
use+0.031
king+0.031
Neg logit contributions
uct-0.015
 Tommy-0.013
 John-0.013
 tears-0.013
 Tears-0.012
Tokenie
Token ID494
Feature activation-4.02e-03

Pos logit contributions
 mate+0.000
hetic+0.000
Ch+0.000
urnal+0.000
use+0.000
Neg logit contributions
uct-0.000
colm-0.000
 tears-0.000
 John-0.000
 Tommy-0.000
Token wanted
Token ID2227
Feature activation+6.36e-04

Pos logit contributions
 mate+0.009
hetic+0.008
Ch+0.008
use+0.008
urnal+0.008
Neg logit contributions
uct-0.004
 tears-0.003
colm-0.003
 John-0.003
case-0.003
Token to
Token ID284
Feature activation-8.56e-03

Pos logit contributions
uct+0.001
 Tommy+0.001
 tears+0.001
 John+0.001
 Jerry+0.001
Neg logit contributions
 mate-0.001
hetic-0.001
Ch-0.001
oise-0.001
king-0.001
Token there
Token ID612
Feature activation+2.30e-03

Pos logit contributions
 mate+0.011
Ch+0.010
hetic+0.009
ppy+0.009
be+0.009
Neg logit contributions
uct-0.008
colm-0.006
 tears-0.006
 John-0.006
 Tears-0.006
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
uct+0.004
colm+0.003
 tears+0.003
 Jerry+0.003
 ey+0.003
Neg logit contributions
 mate-0.003
Ch-0.003
be-0.003
Qu-0.003
 nood-0.003
Token a
Token ID257
Feature activation+4.86e-03

Pos logit contributions
 mate+0.021
Ch+0.019
 chick+0.018
Gr+0.018
 longer+0.018
Neg logit contributions
uct-0.021
colm-0.018
 tears-0.016
case-0.016
 Tears-0.016
Token boy
Token ID2933
Feature activation-2.79e-02

Pos logit contributions
uct+0.005
 Tommy+0.004
 John+0.004
 tears+0.004
colm+0.004
Neg logit contributions
 mate-0.010
use-0.010
hetic-0.009
Ch-0.009
king-0.009
Token named
Token ID3706
Feature activation-3.68e-02

Pos logit contributions
 mate+0.048
hetic+0.046
urnal+0.044
use+0.044
mic+0.042
Neg logit contributions
 Tommy-0.039
 Tom-0.038
 John-0.037
 Jerry-0.035
 Sue-0.035
Token Mario
Token ID10682
Feature activation-6.15e-02

Pos logit contributions
 mate+0.084
Ch+0.074
hetic+0.071
use+0.070
Cow+0.070
Neg logit contributions
uct-0.041
 tears-0.032
colm-0.032
 Tears-0.030
case-0.029
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 mate+0.134
use+0.127
Ro+0.126
 conclusions+0.125
 tweet+0.124
Neg logit contributions
 Tommy-0.056
 John-0.054
uct-0.052
colm-0.048
 Jerry-0.047
Token He
Token ID679
Feature activation-1.34e-02

Pos logit contributions
 mate+0.027
hetic+0.024
Ch+0.024
use+0.024
Cow+0.023
Neg logit contributions
uct-0.015
 tears-0.012
colm-0.012
 John-0.012
case-0.011
Token was
Token ID373
Feature activation-1.64e-03

Pos logit contributions
 mate+0.021
Ch+0.020
hetic+0.018
be+0.018
Gr+0.018
Neg logit contributions
uct-0.023
 John-0.019
 tears-0.019
 Tears-0.019
colm-0.018
Token very
Token ID845
Feature activation-3.19e-03

Pos logit contributions
 mate+0.003
use+0.003
hetic+0.003
Ch+0.003
king+0.003
Neg logit contributions
uct-0.002
 John-0.002
 Tommy-0.002
 tears-0.002
colm-0.002
Token eager
Token ID11069
Feature activation-8.43e-03

Pos logit contributions
 mate+0.006
use+0.006
Ch+0.006
hetic+0.006
king+0.006
Neg logit contributions
uct-0.004
 Tommy-0.003
 John-0.003
 tears-0.003
colm-0.003
Token young
Token ID1862
Feature activation-3.97e-03

Pos logit contributions
uct+0.005
 Tommy+0.004
 John+0.004
colm+0.004
 tears+0.004
Neg logit contributions
 mate-0.010
use-0.010
hetic-0.010
Ch-0.009
king-0.009
Token re
Token ID302
Feature activation-1.71e-02

Pos logit contributions
 mate+0.007
hetic+0.006
use+0.006
Ch+0.006
king+0.006
Neg logit contributions
uct-0.005
 John-0.005
 Tommy-0.005
colm-0.005
 tears-0.005
Tokenind
Token ID521
Feature activation-5.62e-03

Pos logit contributions
 mate+0.021
use+0.019
hetic+0.018
Ch+0.017
urnal+0.017
Neg logit contributions
uct-0.032
colm-0.031
case-0.029
 John-0.028
 tears-0.027
Tokeneer
Token ID28153
Feature activation-3.57e-02

Pos logit contributions
 mate+0.007
hetic+0.006
use+0.006
Ch+0.006
king+0.006
Neg logit contributions
uct-0.010
colm-0.010
 John-0.010
 tears-0.010
 Tommy-0.010
Token named
Token ID3706
Feature activation-4.48e-02

Pos logit contributions
use+0.066
 mate+0.064
hetic+0.059
urnal+0.059
 conclusions+0.057
Neg logit contributions
 on-0.038
 Tears-0.038
 John-0.038
case-0.037
 Tommy-0.036
Token Elise
Token ID46174
Feature activation-6.12e-02

Pos logit contributions
 mate+0.084
hetic+0.076
Ch+0.075
use+0.074
king+0.072
Neg logit contributions
uct-0.061
 John-0.051
colm-0.051
 Tommy-0.050
 tears-0.049
Token.
Token ID13
Feature activation-1.58e-02

Pos logit contributions
use+0.118
 mate+0.117
hetic+0.112
Ro+0.112
 engineers+0.110
Neg logit contributions
 John-0.058
uct-0.057
 Tommy-0.054
colm-0.054
case-0.048
Token She
Token ID1375
Feature activation-8.84e-03

Pos logit contributions
 mate+0.027
hetic+0.026
use+0.025
king+0.024
oise+0.024
Neg logit contributions
uct-0.022
 John-0.020
colm-0.020
 Tommy-0.020
 Tears-0.019
Token was
Token ID373
Feature activation-4.08e-05

Pos logit contributions
 mate+0.015
Ch+0.014
hetic+0.014
use+0.013
Cow+0.013
Neg logit contributions
uct-0.014
 John-0.011
colm-0.011
 tears-0.011
 Tommy-0.011
Token running
Token ID2491
Feature activation-1.08e-02

Pos logit contributions
 mate+0.000
hetic+0.000
Ch+0.000
be+0.000
Cow+0.000
Neg logit contributions
uct-0.000
colm-0.000
 John-0.000
 tears-0.000
 Tommy-0.000
Token around
Token ID1088
Feature activation-1.75e-02

Pos logit contributions
 mate+0.024
hetic+0.022
use+0.022
Ch+0.021
 chick+0.021
Neg logit contributions
uct-0.011
 John-0.009
 tears-0.009
 Tommy-0.009
colm-0.009
Token was
Token ID373
Feature activation-1.58e-02

Pos logit contributions
 mate+0.028
Ch+0.026
be+0.024
Gr+0.023
Cow+0.023
Neg logit contributions
uct-0.022
colm-0.017
 tears-0.017
 Jerry-0.016
 Tears-0.016
Token a
Token ID257
Feature activation-1.76e-04

Pos logit contributions
 mate+0.026
Ch+0.023
 chick+0.022
Gr+0.022
ch+0.021
Neg logit contributions
uct-0.027
colm-0.023
 tears-0.022
 Tears-0.021
case-0.021
Token smart
Token ID4451
Feature activation-1.12e-02

Pos logit contributions
 mate+0.000
use+0.000
hetic+0.000
Ch+0.000
king+0.000
Neg logit contributions
uct-0.000
 Tommy-0.000
colm-0.000
 John-0.000
 tears-0.000
Token fox
Token ID21831
Feature activation-4.05e-02

Pos logit contributions
 mate+0.019
hetic+0.017
use+0.017
Ch+0.017
king+0.016
Neg logit contributions
uct-0.016
 John-0.015
 Tommy-0.015
colm-0.015
 tears-0.014
Token named
Token ID3706
Feature activation-4.81e-02

Pos logit contributions
 mate+0.066
use+0.066
hetic+0.064
urnal+0.062
 engineers+0.061
Neg logit contributions
 John-0.050
uct-0.050
 Tommy-0.049
case-0.049
 Tom-0.049
Token Fox
Token ID5426
Feature activation-6.09e-02

Pos logit contributions
 mate+0.096
Ch+0.085
hetic+0.083
ppy+0.082
Cow+0.081
Neg logit contributions
uct-0.065
 tears-0.054
colm-0.051
 Tears-0.050
case-0.050
Tokeny
Token ID88
Feature activation-1.69e-02

Pos logit contributions
 mate+0.087
hetic+0.081
Ch+0.080
use+0.076
king+0.075
Neg logit contributions
uct-0.097
 Tommy-0.092
colm-0.092
 John-0.091
 tears-0.086
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
 mate+0.033
hetic+0.030
use+0.030
Ch+0.030
king+0.029
Neg logit contributions
uct-0.020
colm-0.018
 John-0.018
 Tommy-0.018
 tears-0.017
Token Fox
Token ID5426
Feature activation-3.61e-02

Pos logit contributions
 mate+0.032
hetic+0.030
use+0.028
Ch+0.028
be+0.028
Neg logit contributions
uct-0.033
 John-0.031
colm-0.031
 Tommy-0.030
 Jerry-0.030
Tokeny
Token ID88
Feature activation+7.65e-03

Pos logit contributions
 mate+0.032
hetic+0.024
Ch+0.024
use+0.023
ppy+0.022
Neg logit contributions
uct-0.087
 tears-0.077
 John-0.077
colm-0.077
 Tears-0.075
Token was
Token ID373
Feature activation-5.63e-03

Pos logit contributions
uct+0.013
 tears+0.010
 John+0.010
 Tears+0.010
colm+0.010
Neg logit contributions
 mate-0.013
Ch-0.011
hetic-0.011
ppy-0.010
 crew-0.010
Token with
Token ID351
Feature activation-1.37e-02

Pos logit contributions
uct+0.008
colm+0.007
 John+0.007
 tears+0.007
 Tommy+0.006
Neg logit contributions
 mate-0.011
hetic-0.010
Ch-0.010
use-0.009
be-0.009
Token many
Token ID867
Feature activation-2.24e-02

Pos logit contributions
 mate+0.026
Ch+0.023
hetic+0.023
Cow+0.022
use+0.022
Neg logit contributions
uct-0.020
colm-0.016
 John-0.015
 tears-0.015
 Tears-0.015
Token rooms
Token ID9519
Feature activation+6.40e-03

Pos logit contributions
 mate+0.044
hetic+0.040
Ch+0.039
use+0.038
be+0.038
Neg logit contributions
uct-0.029
colm-0.024
 John-0.023
 Tommy-0.022
 tears-0.022
Token.
Token ID13
Feature activation+9.40e-03

Pos logit contributions
uct+0.009
 John+0.007
 Tommy+0.007
 tears+0.007
case+0.007
Neg logit contributions
 mate-0.012
hetic-0.011
Ch-0.011
use-0.011
Cow-0.011
Token In
Token ID554
Feature activation+2.04e-02

Pos logit contributions
uct+0.014
colm+0.013
 tears+0.011
 ey+0.011
 Kyle+0.011
Neg logit contributions
Ch-0.015
 mate-0.015
Cow-0.014
Gr-0.013
 clinic-0.013
Token one
Token ID530
Feature activation-6.08e-02

Pos logit contributions
uct+0.026
colm+0.020
 John+0.019
 tears+0.019
 Tears+0.019
Neg logit contributions
 mate-0.041
Ch-0.037
hetic-0.037
use-0.036
Cow-0.035
Token room
Token ID2119
Feature activation+8.50e-03

Pos logit contributions
 mate+0.125
hetic+0.120
king+0.116
Ch+0.113
use+0.112
Neg logit contributions
uct-0.067
 tears-0.061
 John-0.059
case-0.059
 Tommy-0.057
Token,
Token ID11
Feature activation+5.57e-03

Pos logit contributions
uct+0.010
 John+0.009
 Tommy+0.009
 tears+0.008
case+0.008
Neg logit contributions
 mate-0.017
hetic-0.016
Ch-0.016
king-0.015
use-0.015
Token there
Token ID612
Feature activation+1.35e-02

Pos logit contributions
uct+0.008
colm+0.007
 tears+0.007
 ey+0.006
 Tears+0.006
Neg logit contributions
 mate-0.010
Ch-0.009
 nurses-0.009
 chick-0.008
Gr-0.008
Token was
Token ID373
Feature activation-6.52e-03

Pos logit contributions
uct+0.020
colm+0.017
 tears+0.015
 Tears+0.015
 Jerry+0.015
Neg logit contributions
 mate-0.024
Ch-0.022
be-0.020
Cow-0.020
 chick-0.020
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
 mate+0.012
Ch+0.011
hetic+0.011
use+0.010
Cow+0.010
Neg logit contributions
uct-0.009
colm-0.008
 John-0.007
 tears-0.007
 Tears-0.007
Token to
Token ID284
Feature activation-1.02e-02

Pos logit contributions
 mate+0.046
hetic+0.044
Ch+0.043
Cow+0.041
use+0.041
Neg logit contributions
uct-0.036
 John-0.031
 tears-0.031
 Jerry-0.031
case-0.030
Token the
Token ID262
Feature activation-4.00e-02

Pos logit contributions
 mate+0.020
hetic+0.019
Ch+0.019
king+0.018
use+0.018
Neg logit contributions
uct-0.013
 John-0.011
 tears-0.011
 Tommy-0.011
 Jerry-0.010
Token vet
Token ID20202
Feature activation-5.10e-02

Pos logit contributions
 mate+0.072
hetic+0.069
Ch+0.066
ppy+0.066
king+0.065
Neg logit contributions
uct-0.054
 tears-0.048
colm-0.047
case-0.046
 John-0.046
Token in
Token ID287
Feature activation-4.66e-03

Pos logit contributions
 mate+0.109
Ch+0.106
hetic+0.105
Ro+0.104
Qu+0.103
Neg logit contributions
uct-0.048
 John-0.045
 Tommy-0.042
case-0.041
 Jerry-0.040
Token the
Token ID262
Feature activation-3.86e-02

Pos logit contributions
 mate+0.009
hetic+0.009
Ch+0.008
use+0.008
king+0.008
Neg logit contributions
uct-0.006
 John-0.005
 tears-0.005
colm-0.005
 Tommy-0.005
Token morning
Token ID3329
Feature activation-6.08e-02

Pos logit contributions
 mate+0.065
hetic+0.059
Ch+0.057
use+0.055
king+0.055
Neg logit contributions
uct-0.061
 tears-0.052
colm-0.052
 John-0.051
 Tommy-0.050
Token.
Token ID13
Feature activation-2.54e-03

Pos logit contributions
 mate+0.118
hetic+0.110
Ch+0.106
oise+0.104
use+0.104
Neg logit contributions
uct-0.075
 John-0.068
 Tommy-0.065
 tears-0.064
case-0.064
Token
Token ID198
Feature activation-4.07e-03

Pos logit contributions
 mate+0.004
Ch+0.004
Cow+0.004
Gr+0.004
use+0.004
Neg logit contributions
uct-0.004
colm-0.003
 tears-0.003
case-0.003
 bicycles-0.003
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 mate+0.008
Ch+0.008
Gr+0.007
Cow+0.007
hetic+0.007
Neg logit contributions
uct-0.005
colm-0.004
 tears-0.004
 John-0.004
 Tommy-0.004
TokenThe
Token ID464
Feature activation-2.43e-02

Pos logit contributions
 mate+0.022
hetic+0.019
use+0.019
 crew+0.019
king+0.018
Neg logit contributions
uct-0.016
case-0.013
 John-0.012
 Jerry-0.012
colm-0.012
Token next
Token ID1306
Feature activation-2.17e-03

Pos logit contributions
 mate+0.042
hetic+0.037
Ch+0.036
use+0.036
king+0.036
Neg logit contributions
uct-0.036
 tears-0.034
colm-0.032
case-0.030
 Jerry-0.029
Token took
Token ID1718
Feature activation-1.27e-02

Pos logit contributions
 mate+0.045
hetic+0.041
Ch+0.040
use+0.040
Cow+0.039
Neg logit contributions
uct-0.030
colm-0.024
 John-0.024
 tears-0.024
 Tommy-0.023
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
 mate+0.023
 chick+0.021
Ch+0.021
hetic+0.021
Cow+0.021
Neg logit contributions
uct-0.018
case-0.014
colm-0.014
 Tears-0.014
 John-0.013
Token bird
Token ID6512
Feature activation-2.85e-02

Pos logit contributions
 mate+0.031
hetic+0.029
Ch+0.029
ppy+0.028
use+0.027
Neg logit contributions
uct-0.031
 tears-0.031
colm-0.028
 Jerry-0.027
case-0.027
Token to
Token ID284
Feature activation-1.93e-02

Pos logit contributions
 mate+0.053
hetic+0.050
Ch+0.049
use+0.047
Qu+0.046
Neg logit contributions
uct-0.037
 tears-0.033
 John-0.032
 Jerry-0.031
case-0.031
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
 mate+0.041
hetic+0.038
Ch+0.038
king+0.037
use+0.037
Neg logit contributions
uct-0.021
 John-0.018
 tears-0.018
 Tommy-0.018
 Jerry-0.017
Token vet
Token ID20202
Feature activation-6.07e-02

Pos logit contributions
 mate+0.045
hetic+0.042
ppy+0.041
Ch+0.039
oise+0.039
Neg logit contributions
uct-0.064
 tears-0.062
colm-0.061
case-0.060
 Tommy-0.058
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
 mate+0.145
Ro+0.143
Qu+0.140
Ch+0.139
hetic+0.137
Neg logit contributions
uct-0.042
 John-0.041
 Tommy-0.038
 tears-0.038
 Tom-0.035
Token The
Token ID383
Feature activation-3.09e-02

Pos logit contributions
 mate+0.023
Ch+0.021
hetic+0.021
Cow+0.020
use+0.020
Neg logit contributions
uct-0.023
colm-0.020
 tears-0.019
 John-0.018
case-0.018
Token vet
Token ID20202
Feature activation-3.50e-02

Pos logit contributions
 mate+0.039
hetic+0.037
ppy+0.037
use+0.035
Ch+0.035
Neg logit contributions
 tears-0.057
uct-0.055
colm-0.053
case-0.051
 wrapper-0.050
Token put
Token ID1234
Feature activation-4.38e-03

Pos logit contributions
 mate+0.073
hetic+0.066
Ch+0.065
use+0.064
king+0.064
Neg logit contributions
uct-0.040
 John-0.034
 tears-0.033
colm-0.033
 Tommy-0.033
Token a
Token ID257
Feature activation-6.56e-03

Pos logit contributions
 mate+0.009
hetic+0.008
Ch+0.008
use+0.008
 chick+0.008
Neg logit contributions
uct-0.005
colm-0.004
 John-0.004
case-0.004
 tears-0.004
Token and
Token ID290
Feature activation-8.12e-03

Pos logit contributions
 mate+0.038
mer+0.035
hetic+0.034
king+0.034
Ch+0.034
Neg logit contributions
uct-0.033
colm-0.026
 John-0.026
 Tommy-0.026
 tears-0.026
Token spun
Token ID26843
Feature activation-1.31e-02

Pos logit contributions
 mate+0.018
hetic+0.017
Ch+0.017
use+0.016
be+0.016
Neg logit contributions
uct-0.008
colm-0.006
 John-0.006
 tears-0.006
 Tommy-0.006
Token around
Token ID1088
Feature activation-1.63e-02

Pos logit contributions
 mate+0.025
Ch+0.022
hetic+0.022
Gr+0.021
king+0.021
Neg logit contributions
uct-0.018
colm-0.015
 John-0.015
 tears-0.014
 Tommy-0.014
Token in
Token ID287
Feature activation-4.52e-03

Pos logit contributions
 mate+0.038
use+0.035
hetic+0.035
 crew+0.033
ppy+0.033
Neg logit contributions
uct-0.015
 John-0.013
 tears-0.013
 Tommy-0.012
 Tears-0.011
Token the
Token ID262
Feature activation-3.15e-02

Pos logit contributions
 mate+0.009
hetic+0.008
Ch+0.008
use+0.008
king+0.008
Neg logit contributions
uct-0.006
 John-0.005
colm-0.005
 tears-0.005
 Tommy-0.004
Token night
Token ID1755
Feature activation-6.05e-02

Pos logit contributions
 mate+0.063
hetic+0.057
Ch+0.057
use+0.056
Cow+0.054
Neg logit contributions
uct-0.039
colm-0.032
 John-0.032
 tears-0.031
 Tommy-0.031
Token sky
Token ID6766
Feature activation-2.26e-02

Pos logit contributions
 mate+0.128
use+0.118
hetic+0.116
oise+0.115
Ch+0.112
Neg logit contributions
uct-0.061
 tears-0.056
case-0.055
 John-0.055
 Tommy-0.054
Token.
Token ID13
Feature activation+2.51e-03

Pos logit contributions
 mate+0.054
use+0.053
hetic+0.052
oise+0.050
Cow+0.050
Neg logit contributions
uct-0.015
 John-0.014
 tears-0.014
 Tears-0.013
case-0.012
Token
Token ID198
Feature activation+3.47e-03

Pos logit contributions
uct+0.003
colm+0.002
 tears+0.002
 John+0.002
 Tommy+0.002
Neg logit contributions
 mate-0.005
Ch-0.005
hetic-0.005
use-0.005
Cow-0.005
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
uct+0.004
colm+0.003
 tears+0.003
 John+0.003
 Tommy+0.003
Neg logit contributions
 mate-0.007
Ch-0.007
hetic-0.007
use-0.006
Cow-0.006
TokenOne
Token ID3198
Feature activation-1.94e-02

Pos logit contributions
 mate+0.016
hetic+0.015
 crew+0.015
use+0.014
mer+0.014
Neg logit contributions
uct-0.021
case-0.019
John-0.017
 Tears-0.017
 John-0.016
Token woke
Token ID19092
Feature activation-7.59e-03

Pos logit contributions
uct+0.003
colm+0.003
 John+0.003
 tears+0.003
 Tears+0.003
Neg logit contributions
 mate-0.004
Ch-0.004
hetic-0.004
use-0.004
be-0.004
Token up
Token ID510
Feature activation-3.02e-02

Pos logit contributions
 mate+0.016
hetic+0.016
 chick+0.015
 longer+0.015
cient+0.015
Neg logit contributions
uct-0.007
colm-0.006
case-0.005
 John-0.005
 Tears-0.005
Token early
Token ID1903
Feature activation-1.41e-02

Pos logit contributions
 mate+0.053
hetic+0.048
Ch+0.047
king+0.046
be+0.046
Neg logit contributions
uct-0.044
colm-0.037
 John-0.036
case-0.035
 Tommy-0.035
Token in
Token ID287
Feature activation-1.81e-03

Pos logit contributions
 mate+0.027
hetic+0.025
Ch+0.024
use+0.024
Cow+0.023
Neg logit contributions
uct-0.019
 John-0.015
colm-0.015
 tears-0.015
 Tommy-0.015
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
 mate+0.003
Ch+0.003
use+0.003
hetic+0.003
Gr+0.003
Neg logit contributions
uct-0.003
colm-0.002
case-0.002
 Tears-0.002
 John-0.002
Token morning
Token ID3329
Feature activation-6.04e-02

Pos logit contributions
 mate+0.032
Ch+0.030
hetic+0.028
Cow+0.028
Gr+0.027
Neg logit contributions
uct-0.037
colm-0.032
 John-0.031
 tears-0.030
 Jerry-0.030
Token.
Token ID13
Feature activation+4.49e-03

Pos logit contributions
 mate+0.122
hetic+0.114
oise+0.109
use+0.109
 crew+0.107
Neg logit contributions
uct-0.072
 John-0.065
 Tommy-0.064
 tears-0.062
 Tears-0.061
Token She
Token ID1375
Feature activation-7.87e-03

Pos logit contributions
uct+0.008
colm+0.007
 Kyle+0.006
icycle+0.006
 tears+0.006
Neg logit contributions
Ch-0.006
 mate-0.006
Cow-0.006
Gr-0.006
 nood-0.005
Token felt
Token ID2936
Feature activation-1.37e-02

Pos logit contributions
Ch+0.011
 mate+0.011
be+0.010
Gr+0.010
hetic+0.010
Neg logit contributions
uct-0.014
 Jerry-0.012
colm-0.012
 John-0.011
 Tears-0.011
Token cold
Token ID4692
Feature activation-8.62e-03

Pos logit contributions
 mate+0.025
hetic+0.022
Ch+0.022
use+0.021
Cow+0.021
Neg logit contributions
uct-0.020
colm-0.017
 John-0.017
 tears-0.016
 Tommy-0.016
Token,
Token ID11
Feature activation-7.83e-03

Pos logit contributions
 mate+0.015
Ch+0.014
hetic+0.014
be+0.014
king+0.013
Neg logit contributions
uct-0.013
colm-0.011
 John-0.010
 tears-0.010
 Jerry-0.010
Token had
Token ID550
Feature activation+7.73e-03

Pos logit contributions
 mate+0.025
hetic+0.023
Ch+0.022
use+0.022
king+0.022
Neg logit contributions
uct-0.012
 John-0.009
colm-0.009
 tears-0.009
 Tommy-0.009
Token a
Token ID257
Feature activation+2.19e-02

Pos logit contributions
uct+0.013
colm+0.011
 John+0.010
 Tommy+0.010
 Tears+0.010
Neg logit contributions
 mate-0.013
Ch-0.011
be-0.011
 peac-0.011
hetic-0.011
Token cob
Token ID22843
Feature activation-3.81e-03

Pos logit contributions
uct+0.033
colm+0.026
 John+0.026
 Tommy+0.024
 Jerry+0.024
Neg logit contributions
 mate-0.041
 peac-0.035
hetic-0.035
 crew-0.035
Ch-0.035
Tokenweb
Token ID12384
Feature activation-6.16e-04

Pos logit contributions
 mate+0.006
hetic+0.006
use+0.005
Ch+0.005
Cow+0.005
Neg logit contributions
uct-0.006
colm-0.005
 tears-0.005
 John-0.005
case-0.005
Token in
Token ID287
Feature activation+4.22e-03

Pos logit contributions
 mate+0.001
hetic+0.001
use+0.001
Ch+0.001
Cow+0.001
Neg logit contributions
uct-0.001
case-0.001
 tears-0.001
 John-0.001
colm-0.001
Token one
Token ID530
Feature activation-6.04e-02

Pos logit contributions
uct+0.006
colm+0.005
 John+0.004
 tears+0.004
case+0.004
Neg logit contributions
 mate-0.008
Ch-0.007
hetic-0.007
 peac-0.007
Cow-0.007
Token corner
Token ID5228
Feature activation-1.93e-02

Pos logit contributions
 mate+0.118
hetic+0.113
use+0.107
Ch+0.106
king+0.106
Neg logit contributions
uct-0.075
case-0.064
 John-0.062
 tears-0.062
 Tears-0.061
Token and
Token ID290
Feature activation+5.94e-03

Pos logit contributions
 mate+0.038
hetic+0.036
use+0.036
Ch+0.035
Cow+0.035
Neg logit contributions
uct-0.023
 John-0.020
case-0.020
 tears-0.019
 Tears-0.019
Token the
Token ID262
Feature activation-7.56e-03

Pos logit contributions
uct+0.008
colm+0.007
 John+0.007
 tears+0.007
 Tommy+0.007
Neg logit contributions
 mate-0.011
hetic-0.010
Ch-0.010
use-0.009
Cow-0.009
Token walls
Token ID7714
Feature activation+2.30e-02

Pos logit contributions
 mate+0.014
use+0.013
hetic+0.013
king+0.012
ppy+0.012
Neg logit contributions
uct-0.010
 tears-0.009
 John-0.009
 Tommy-0.009
colm-0.008
Token were
Token ID547
Feature activation-4.60e-03

Pos logit contributions
uct+0.031
 John+0.026
 tears+0.026
case+0.025
colm+0.025
Neg logit contributions
 mate-0.044
hetic-0.039
Ch-0.039
use-0.038
Cow-0.038
Token with
Token ID351
Feature activation-1.25e-02

Pos logit contributions
 mate+0.011
hetic+0.010
use+0.010
Ch+0.010
Gr+0.010
Neg logit contributions
uct-0.007
case-0.006
 Tears-0.006
 John-0.006
 tears-0.006
Token a
Token ID257
Feature activation+1.29e-02

Pos logit contributions
 mate+0.025
hetic+0.023
use+0.023
Ch+0.023
king+0.022
Neg logit contributions
uct-0.015
 John-0.013
 tears-0.013
colm-0.013
 Tommy-0.012
Token purple
Token ID14032
Feature activation-4.54e-03

Pos logit contributions
uct+0.020
colm+0.016
 John+0.016
 Tommy+0.015
 Jerry+0.015
Neg logit contributions
 mate-0.023
hetic-0.021
Ch-0.020
 peac-0.020
 crew-0.020
Token door
Token ID3420
Feature activation+1.52e-02

Pos logit contributions
 mate+0.008
hetic+0.007
Ch+0.007
use+0.007
king+0.007
Neg logit contributions
uct-0.007
 John-0.006
colm-0.006
 tears-0.006
 Tommy-0.006
Token.
Token ID13
Feature activation+4.26e-03

Pos logit contributions
uct+0.024
colm+0.021
 John+0.020
 tears+0.020
 Tommy+0.020
Neg logit contributions
 mate-0.025
hetic-0.022
Ch-0.022
use-0.021
be-0.021
Token Inside
Token ID14384
Feature activation+4.76e-02

Pos logit contributions
uct+0.007
colm+0.006
 tears+0.005
 John+0.005
case+0.005
Neg logit contributions
 mate-0.007
Ch-0.007
Cow-0.006
hetic-0.006
Gr-0.006
Token the
Token ID262
Feature activation-9.57e-03

Pos logit contributions
uct+0.062
colm+0.050
 tears+0.049
 Tears+0.047
 John+0.046
Neg logit contributions
 mate-0.093
Ch-0.083
hetic-0.082
use-0.081
 chick-0.080
Token hut
Token ID40812
Feature activation+3.00e-02

Pos logit contributions
 mate+0.015
 chick+0.012
 crew+0.012
use+0.012
hetic+0.012
Neg logit contributions
uct-0.018
 Tommy-0.015
colm-0.015
 John-0.014
 Tears-0.014
Token lived
Token ID5615
Feature activation-7.20e-03

Pos logit contributions
uct+0.040
colm+0.032
 tears+0.031
 John+0.031
 Tommy+0.031
Neg logit contributions
 mate-0.058
hetic-0.051
Ch-0.051
use-0.051
be-0.050
Token a
Token ID257
Feature activation+1.66e-02

Pos logit contributions
 mate+0.015
hetic+0.014
use+0.014
Ch+0.014
king+0.013
Neg logit contributions
uct-0.008
 John-0.007
 tears-0.006
colm-0.006
 Tommy-0.006
Token little
Token ID1310
Feature activation-4.15e-03

Pos logit contributions
uct+0.024
colm+0.020
 John+0.020
 tears+0.019
 Tommy+0.019
Neg logit contributions
 mate-0.030
hetic-0.027
Ch-0.027
use-0.026
Cow-0.026
Token make
Token ID787
Feature activation-2.22e-02

Pos logit contributions
 mate+0.016
hetic+0.016
king+0.015
Ch+0.015
use+0.015
Neg logit contributions
uct-0.015
 John-0.014
 tears-0.014
 Tommy-0.014
case-0.013
Token a
Token ID257
Feature activation-4.11e-03

Pos logit contributions
 mate+0.044
hetic+0.039
Ch+0.039
use+0.038
be+0.037
Neg logit contributions
uct-0.029
colm-0.023
 John-0.022
 tears-0.022
 Tommy-0.022
Token big
Token ID1263
Feature activation+8.83e-03

Pos logit contributions
 mate+0.008
hetic+0.007
Ch+0.007
use+0.007
 crew+0.006
Neg logit contributions
uct-0.006
colm-0.005
 John-0.005
 Tommy-0.005
 tears-0.005
Token surprise
Token ID5975
Feature activation-7.37e-03

Pos logit contributions
uct+0.013
colm+0.011
 John+0.010
 tears+0.010
 Tommy+0.010
Neg logit contributions
 mate-0.016
hetic-0.014
Ch-0.014
use-0.014
be-0.013
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 mate+0.011
hetic+0.009
Ch+0.009
use+0.009
be+0.009
Neg logit contributions
uct-0.013
colm-0.011
 John-0.011
 tears-0.011
 Tommy-0.011
Token J
Token ID449
Feature activation+5.58e-02

Pos logit contributions
uct+0.020
colm+0.018
 tears+0.017
case+0.016
 Jerry+0.015
Neg logit contributions
 mate-0.025
Ch-0.023
Cow-0.022
hetic-0.022
be-0.022
Tokenax
Token ID897
Feature activation-2.49e-03

Pos logit contributions
 tears+0.081
 Tears+0.077
case+0.077
uct+0.071
colm+0.065
Neg logit contributions
 mate-0.091
ppy-0.085
aa-0.084
be-0.082
ub-0.082
Token was
Token ID373
Feature activation+5.54e-03

Pos logit contributions
 mate+0.005
hetic+0.005
Ch+0.005
use+0.004
Cow+0.004
Neg logit contributions
uct-0.003
 John-0.002
colm-0.002
 tears-0.002
 Tommy-0.002
Token so
Token ID523
Feature activation+5.22e-03

Pos logit contributions
uct+0.007
colm+0.006
 Tommy+0.005
 Jerry+0.005
 tears+0.005
Neg logit contributions
 mate-0.011
hetic-0.010
be-0.010
Ch-0.010
 crew-0.009
Token excited
Token ID6568
Feature activation+1.43e-02

Pos logit contributions
uct+0.009
colm+0.009
 Jerry+0.008
 bicycles+0.008
 tears+0.008
Neg logit contributions
 mate-0.007
be-0.006
hetic-0.006
 crew-0.006
 longer-0.006
Token to
Token ID284
Feature activation+1.29e-02

Pos logit contributions
uct+0.026
colm+0.023
 tears+0.022
case+0.022
 Tears+0.021
Neg logit contributions
 mate-0.020
 longer-0.017
king-0.017
 someday-0.016
hetic-0.016
Tokeneddy
Token ID21874
Feature activation+2.72e-03

Pos logit contributions
uct+0.010
 tears+0.009
 John+0.009
colm+0.009
 Tommy+0.008
Neg logit contributions
 mate-0.011
Ch-0.010
hetic-0.009
be-0.009
use-0.009
Token bear
Token ID6842
Feature activation-7.31e-03

Pos logit contributions
uct+0.004
 John+0.004
 Tommy+0.004
 tears+0.004
colm+0.004
Neg logit contributions
 mate-0.004
hetic-0.004
Ch-0.004
king-0.004
be-0.004
Token.
Token ID13
Feature activation+7.63e-03

Pos logit contributions
 mate+0.014
hetic+0.013
Ch+0.013
use+0.012
king+0.012
Neg logit contributions
uct-0.009
 Tommy-0.008
 John-0.008
 tears-0.008
colm-0.008
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
uct+0.012
colm+0.011
 ey+0.010
 tears+0.010
 done+0.009
Neg logit contributions
Ch-0.012
 mate-0.011
Cow-0.011
Gr-0.011
 chick-0.010
Token
Token ID198
Feature activation-5.54e-03

Pos logit contributions
uct+0.004
 tears+0.003
colm+0.003
 ey+0.003
 Tears+0.003
Neg logit contributions
Ch-0.004
 mate-0.004
Gr-0.004
Cow-0.004
be-0.003
TokenLu
Token ID25596
Feature activation+4.71e-02

Pos logit contributions
 mate+0.011
hetic+0.010
use+0.010
 crew+0.009
 peac+0.009
Neg logit contributions
uct-0.007
 John-0.006
case-0.006
colm-0.006
 Tommy-0.006
Tokency
Token ID948
Feature activation+2.18e-02

Pos logit contributions
uct+0.112
 tears+0.100
case+0.099
colm+0.097
 Jerry+0.096
Neg logit contributions
 mate-0.039
use-0.036
Ch-0.036
 chick-0.033
oise-0.033
Token was
Token ID373
Feature activation+6.43e-03

Pos logit contributions
uct+0.028
 John+0.023
colm+0.023
 tears+0.022
 Tommy+0.022
Neg logit contributions
 mate-0.042
hetic-0.038
Ch-0.038
use-0.037
be-0.036
Token excited
Token ID6568
Feature activation+4.69e-03

Pos logit contributions
uct+0.010
colm+0.009
case+0.008
 John+0.008
 ey+0.008
Neg logit contributions
 mate-0.011
hetic-0.009
 longer-0.009
be-0.009
Ch-0.009
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
uct+0.009
colm+0.008
case+0.007
 Tears+0.007
 tears+0.007
Neg logit contributions
 mate-0.006
 longer-0.006
king-0.005
 someday-0.005
be-0.005
Token give
Token ID1577
Feature activation+6.98e-03

Pos logit contributions
uct+0.029
 John+0.026
colm+0.026
 tears+0.026
 Tommy+0.026
Neg logit contributions
 mate-0.019
hetic-0.016
Ch-0.016
use-0.015
king-0.015
Token".
Token ID1911
Feature activation-6.46e-03

Pos logit contributions
 mate+0.016
hetic+0.015
Ch+0.015
use+0.014
Cow+0.014
Neg logit contributions
uct-0.013
 John-0.011
colm-0.010
 Tommy-0.010
 tears-0.010
Token He
Token ID679
Feature activation+4.25e-04

Pos logit contributions
Ch+0.009
 mate+0.009
Cow+0.009
Gr+0.009
 clinic+0.009
Neg logit contributions
uct-0.010
 ey-0.008
colm-0.008
 done-0.008
 unlocks-0.008
Token put
Token ID1234
Feature activation+1.38e-02

Pos logit contributions
uct+0.001
 John+0.001
colm+0.001
 Tears+0.001
 Tommy+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
hetic-0.001
be-0.001
ppy-0.001
Token down
Token ID866
Feature activation-4.86e-03

Pos logit contributions
uct+0.020
case+0.018
colm+0.016
 Tears+0.015
 John+0.014
Neg logit contributions
 mate-0.024
use-0.022
 longer-0.021
 chick-0.021
Ch-0.021
Token the
Token ID262
Feature activation-1.42e-03

Pos logit contributions
 mate+0.010
hetic+0.009
Ch+0.009
 longer+0.008
use+0.008
Neg logit contributions
uct-0.006
colm-0.005
case-0.005
 Tears-0.005
 John-0.005
Token oranges
Token ID48389
Feature activation+4.65e-02

Pos logit contributions
 mate+0.002
use+0.002
 sheep+0.002
be+0.002
hetic+0.002
Neg logit contributions
uct-0.002
colm-0.002
 John-0.002
 Tears-0.002
case-0.002
Token and
Token ID290
Feature activation+2.71e-03

Pos logit contributions
uct+0.065
colm+0.054
 John+0.052
 tears+0.051
 Tears+0.051
Neg logit contributions
 mate-0.086
hetic-0.076
Ch-0.075
use-0.075
be-0.075
Token began
Token ID2540
Feature activation-1.36e-02

Pos logit contributions
uct+0.004
case+0.004
colm+0.004
 ey+0.004
 Tears+0.004
Neg logit contributions
 mate-0.004
 longer-0.004
hetic-0.003
Ch-0.003
 sheep-0.003
Token running
Token ID2491
Feature activation-1.79e-02

Pos logit contributions
 mate+0.023
hetic+0.020
Ch+0.020
use+0.020
ppy+0.019
Neg logit contributions
uct-0.021
 John-0.018
 tears-0.018
 Tommy-0.018
colm-0.018
Token around
Token ID1088
Feature activation-1.56e-02

Pos logit contributions
 mate+0.030
hetic+0.026
 longer+0.026
Ch+0.026
king+0.026
Neg logit contributions
uct-0.029
colm-0.025
 John-0.023
 Tears-0.023
case-0.023
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
 mate+0.031
hetic+0.028
Ch+0.028
use+0.027
king+0.027
Neg logit contributions
uct-0.020
colm-0.016
 John-0.016
 tears-0.016
 Tommy-0.015
Token.
Token ID13
Feature activation+4.15e-03

Pos logit contributions
uct+0.083
colm+0.071
 tears+0.068
 Tears+0.067
 John+0.067
Neg logit contributions
 mate-0.078
hetic-0.065
use-0.065
Ch-0.064
be-0.063
Token You
Token ID921
Feature activation-1.13e-02

Pos logit contributions
uct+0.007
colm+0.006
 tears+0.006
 John+0.005
case+0.005
Neg logit contributions
 mate-0.007
Ch-0.006
use-0.006
hetic-0.006
Cow-0.006
Token have
Token ID423
Feature activation+1.18e-02

Pos logit contributions
 mate+0.018
use+0.015
ppy+0.015
be+0.015
hetic+0.014
Neg logit contributions
uct-0.020
colm-0.017
 Tears-0.017
 John-0.017
 tears-0.016
Token your
Token ID534
Feature activation+1.04e-02

Pos logit contributions
uct+0.021
colm+0.018
 John+0.018
case+0.017
 Tears+0.017
Neg logit contributions
 mate-0.018
hetic-0.015
use-0.015
Ch-0.015
be-0.015
Token own
Token ID898
Feature activation+5.85e-03

Pos logit contributions
uct+0.018
colm+0.015
 John+0.014
 Tears+0.014
case+0.014
Neg logit contributions
 mate-0.018
use-0.015
 spy-0.014
 crew-0.014
hetic-0.014
Token toys
Token ID14958
Feature activation+5.07e-02

Pos logit contributions
uct+0.009
colm+0.007
 John+0.007
 Tears+0.007
 Tommy+0.007
Neg logit contributions
 mate-0.011
use-0.009
hetic-0.009
 spy-0.009
 crew-0.009
Token.
Token ID13
Feature activation+4.76e-03

Pos logit contributions
uct+0.077
colm+0.065
 John+0.064
 tears+0.063
 Tears+0.062
Neg logit contributions
 mate-0.088
hetic-0.078
use-0.076
Ch-0.076
be-0.074
Token Here
Token ID3423
Feature activation+2.39e-03

Pos logit contributions
uct+0.006
 John+0.006
 Tommy+0.005
colm+0.005
 tears+0.005
Neg logit contributions
 mate-0.009
hetic-0.008
Ch-0.008
use-0.008
king-0.008
Token,
Token ID11
Feature activation-5.32e-03

Pos logit contributions
uct+0.004
colm+0.003
 Tears+0.003
 John+0.003
 tears+0.003
Neg logit contributions
 mate-0.004
use-0.003
hetic-0.003
Ch-0.003
 peac-0.003
Token this
Token ID428
Feature activation-8.79e-03

Pos logit contributions
 mate+0.008
use+0.006
hetic+0.006
Ch+0.006
 spy+0.006
Neg logit contributions
uct-0.010
colm-0.009
 tears-0.008
 Tears-0.008
 John-0.008
Token is
Token ID318
Feature activation-3.49e-03

Pos logit contributions
 mate+0.017
use+0.014
 spy+0.014
 leader+0.013
hetic+0.013
Neg logit contributions
uct-0.013
colm-0.011
 Tears-0.011
 John-0.011
 Jerry-0.010
Token out
Token ID503
Feature activation-1.82e-02

Pos logit contributions
 mate+0.017
 chick+0.015
king+0.015
hetic+0.015
ppy+0.015
Neg logit contributions
uct-0.013
colm-0.010
case-0.010
 tears-0.009
 Tears-0.009
Token on
Token ID319
Feature activation+1.30e-02

Pos logit contributions
 mate+0.030
hetic+0.026
Ch+0.026
use+0.025
king+0.025
Neg logit contributions
uct-0.029
colm-0.026
 tears-0.024
 John-0.024
case-0.024
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
uct+0.018
colm+0.015
case+0.014
 Tears+0.014
 tears+0.013
Neg logit contributions
 mate-0.024
Ch-0.022
hetic-0.022
 longer-0.021
 chick-0.021
Token table
Token ID3084
Feature activation-9.59e-04

Pos logit contributions
 mate+0.028
hetic+0.025
Ch+0.025
use+0.025
 peac+0.024
Neg logit contributions
uct-0.017
 tears-0.014
 John-0.014
colm-0.014
case-0.014
Token.
Token ID13
Feature activation+3.98e-03

Pos logit contributions
 mate+0.002
hetic+0.002
Ch+0.002
use+0.002
Cow+0.002
Neg logit contributions
uct-0.001
 John-0.001
 tears-0.001
colm-0.001
case-0.001
Token Max
Token ID5436
Feature activation+9.29e-03

Pos logit contributions
uct+0.007
colm+0.006
 done+0.005
ries+0.005
 ey+0.005
Neg logit contributions
Ch-0.006
Cow-0.005
 mate-0.005
Gr-0.005
Some-0.005
Token and
Token ID290
Feature activation+2.57e-02

Pos logit contributions
uct+0.013
colm+0.011
 John+0.011
 Tommy+0.010
 tears+0.010
Neg logit contributions
 mate-0.017
hetic-0.015
Ch-0.015
be-0.015
Cow-0.015
Token his
Token ID465
Feature activation+4.78e-02

Pos logit contributions
case+0.042
uct+0.040
colm+0.034
icycle+0.033
 wrapper+0.031
Neg logit contributions
 mate-0.041
 sheep-0.040
 act-0.037
 spy-0.037
 nurses-0.036
Token dad
Token ID9955
Feature activation+4.13e-03

Pos logit contributions
uct+0.084
case+0.065
colm+0.064
 Jerry+0.062
 Tears+0.062
Neg logit contributions
 mate-0.079
 crew-0.072
 sheep-0.070
 leader-0.067
 chick-0.066
Token both
Token ID1111
Feature activation+3.40e-03

Pos logit contributions
uct+0.005
 John+0.005
 tears+0.004
 Tommy+0.004
colm+0.004
Neg logit contributions
 mate-0.008
hetic-0.007
Ch-0.007
use-0.007
Cow-0.007
Token laughed
Token ID13818
Feature activation+7.58e-03

Pos logit contributions
uct+0.005
colm+0.004
 John+0.004
 tears+0.004
 Tommy+0.004
Neg logit contributions
 mate-0.007
hetic-0.006
Ch-0.006
use-0.006
be-0.006
Token Jimmy
Token ID12963
Feature activation-9.88e-03

Pos logit contributions
uct+0.037
case+0.035
 ey+0.034
colm+0.033
 Tears+0.032
Neg logit contributions
 mate-0.046
Ch-0.042
 longer-0.041
ppy-0.041
Gr-0.041
Token a
Token ID257
Feature activation+5.21e-03

Pos logit contributions
 mate+0.018
hetic+0.016
Ch+0.016
ppy+0.015
be+0.015
Neg logit contributions
uct-0.015
colm-0.012
case-0.012
 tears-0.011
 John-0.011
Token reward
Token ID6721
Feature activation-3.21e-03

Pos logit contributions
uct+0.008
colm+0.006
 John+0.006
 Tommy+0.006
 Tears+0.006
Neg logit contributions
 mate-0.010
hetic-0.008
Ch-0.008
use-0.008
 chick-0.008
Token.
Token ID13
Feature activation+8.52e-03

Pos logit contributions
 mate+0.006
hetic+0.005
Ch+0.005
use+0.005
Cow+0.005
Neg logit contributions
uct-0.005
 John-0.004
colm-0.004
 Tommy-0.004
 tears-0.004
Token Jimmy
Token ID12963
Feature activation+1.16e-02

Pos logit contributions
uct+0.011
colm+0.010
 done+0.009
 tears+0.009
 bicycles+0.008
Neg logit contributions
 mate-0.015
Ch-0.014
Gr-0.014
hetic-0.014
 clinic-0.013
Token smiled
Token ID13541
Feature activation+1.04e-02

Pos logit contributions
uct+0.015
colm+0.012
 Tears+0.011
 Tommy+0.011
 Jerry+0.011
Neg logit contributions
 mate-0.022
hetic-0.020
Ch-0.020
be-0.020
 crew-0.020
Token and
Token ID290
Feature activation-1.95e-03

Pos logit contributions
uct+0.014
colm+0.011
 John+0.011
 tears+0.011
case+0.011
Neg logit contributions
 mate-0.020
hetic-0.018
king-0.018
Ch-0.018
 crew-0.017
Token said
Token ID531
Feature activation-3.97e-03

Pos logit contributions
 mate+0.003
 crew+0.003
 chick+0.003
Ch+0.003
hetic+0.003
Neg logit contributions
uct-0.003
case-0.003
 ey-0.003
colm-0.002
icycle-0.002
Token,
Token ID11
Feature activation-2.79e-04

Pos logit contributions
 mate+0.007
hetic+0.006
Ch+0.006
 crew+0.006
use+0.006
Neg logit contributions
uct-0.006
colm-0.005
 tears-0.005
case-0.005
 Tears-0.005
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
 mate+0.001
hetic+0.000
Ch+0.000
use+0.000
Cow+0.000
Neg logit contributions
uct-0.000
 John-0.000
colm-0.000
 tears-0.000
 Tommy-0.000
TokenThe
Token ID464
Feature activation-3.22e-02

Pos logit contributions
 mate+0.024
hetic+0.021
Ch+0.021
use+0.020
Cow+0.020
Neg logit contributions
uct-0.022
colm-0.019
 John-0.019
 tears-0.019
 Tommy-0.018
Token wanted
Token ID2227
Feature activation+9.72e-03

Pos logit contributions
uct+0.011
 John+0.009
colm+0.009
 Jerry+0.009
 Tommy+0.009
Neg logit contributions
 mate-0.013
Ch-0.012
hetic-0.012
ppy-0.011
use-0.011
Token to
Token ID284
Feature activation+4.56e-03

Pos logit contributions
uct+0.018
colm+0.016
case+0.015
 Tears+0.015
 John+0.014
Neg logit contributions
 mate-0.014
Ch-0.011
use-0.011
hetic-0.011
 spy-0.011
Token stop
Token ID2245
Feature activation-6.68e-03

Pos logit contributions
uct+0.010
colm+0.009
 Tears+0.009
 John+0.009
 tears+0.009
Neg logit contributions
 mate-0.005
 spy-0.004
use-0.003
Ch-0.003
Cow-0.003
Token playing
Token ID2712
Feature activation+7.34e-03

Pos logit contributions
 mate+0.012
hetic+0.010
Ch+0.010
use+0.010
king+0.010
Neg logit contributions
uct-0.010
colm-0.008
 John-0.008
 tears-0.008
case-0.008
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
uct+0.011
colm+0.010
 John+0.009
 Tears+0.009
 tears+0.009
Neg logit contributions
 mate-0.013
hetic-0.011
Ch-0.011
use-0.011
 longer-0.011
Token
Token ID198
Feature activation+8.75e-03

Pos logit contributions
uct+0.018
colm+0.016
 ey+0.015
 tears+0.015
 bicycles+0.014
Neg logit contributions
Ch-0.016
 mate-0.016
Cow-0.015
Gr-0.014
ch-0.014
Token
Token ID198
Feature activation+3.22e-03

Pos logit contributions
uct+0.012
colm+0.010
 tears+0.010
 John+0.009
 Tears+0.009
Neg logit contributions
 mate-0.016
Ch-0.016
Gr-0.015
Cow-0.015
use-0.014
Token"
Token ID1
Feature activation-8.99e-03

Pos logit contributions
uct+0.005
colm+0.005
 John+0.005
case+0.004
 Tommy+0.004
Neg logit contributions
 mate-0.005
hetic-0.005
use-0.004
 peac-0.004
 crew-0.004
TokenL
Token ID43
Feature activation+6.66e-03

Pos logit contributions
 mate+0.014
hetic+0.012
Ch+0.012
use+0.012
Cow+0.011
Neg logit contributions
uct-0.016
 John-0.014
colm-0.014
 tears-0.013
 Tommy-0.013
Tokenily
Token ID813
Feature activation+3.68e-03

Pos logit contributions
uct+0.011
colm+0.009
 John+0.009
 tears+0.009
 Tears+0.009
Neg logit contributions
 mate-0.011
hetic-0.010
Ch-0.009
use-0.009
ppy-0.009
Token,
Token ID11
Feature activation-7.51e-04

Pos logit contributions
uct+0.007
colm+0.006
 tears+0.005
 Tears+0.005
case+0.005
Neg logit contributions
 mate-0.006
use-0.005
hetic-0.005
ppy-0.005
be-0.005
Token Lily
Token ID20037
Feature activation-1.18e-03

Pos logit contributions
uct+0.013
 John+0.010
colm+0.010
 tears+0.010
 Tommy+0.010
Neg logit contributions
 mate-0.026
hetic-0.024
Ch-0.024
use-0.023
Cow-0.023
Token,
Token ID11
Feature activation-3.57e-03

Pos logit contributions
 mate+0.002
hetic+0.002
Ch+0.002
use+0.002
Cow+0.002
Neg logit contributions
uct-0.001
 John-0.001
 Tommy-0.001
 tears-0.001
colm-0.001
Token but
Token ID475
Feature activation-9.81e-03

Pos logit contributions
 mate+0.006
hetic+0.005
Ch+0.005
use+0.005
Cow+0.005
Neg logit contributions
uct-0.006
 John-0.005
 tears-0.005
 Tommy-0.005
colm-0.005
Token she
Token ID673
Feature activation+9.56e-03

Pos logit contributions
 mate+0.018
Ch+0.016
hetic+0.016
use+0.015
 longer+0.015
Neg logit contributions
uct-0.014
colm-0.012
 John-0.011
 Tears-0.011
case-0.011
Token holds
Token ID6622
Feature activation+4.48e-03

Pos logit contributions
uct+0.015
 John+0.013
 Jerry+0.012
colm+0.012
 Robert+0.012
Neg logit contributions
 mate-0.016
Ch-0.015
hetic-0.015
ppy-0.014
be-0.014
Token on
Token ID319
Feature activation+1.48e-02

Pos logit contributions
uct+0.007
case+0.006
colm+0.005
 Tears+0.005
 John+0.005
Neg logit contributions
 mate-0.008
Ch-0.007
hetic-0.007
ppy-0.007
 chick-0.007
Token tight
Token ID5381
Feature activation-1.36e-02

Pos logit contributions
uct+0.021
colm+0.017
case+0.016
 John+0.015
 Tears+0.015
Neg logit contributions
 mate-0.027
Ch-0.025
hetic-0.025
king-0.024
be-0.023
Token.
Token ID13
Feature activation+1.13e-03

Pos logit contributions
 mate+0.023
hetic+0.020
Ch+0.020
use+0.019
king+0.019
Neg logit contributions
uct-0.022
colm-0.018
 John-0.018
case-0.018
 tears-0.018
Token They
Token ID1119
Feature activation-5.03e-04

Pos logit contributions
uct+0.002
colm+0.001
 tears+0.001
 John+0.001
case+0.001
Neg logit contributions
 mate-0.002
Ch-0.002
Cow-0.002
hetic-0.002
Gr-0.002
Token start
Token ID923
Feature activation-7.62e-03

Pos logit contributions
 mate+0.001
hetic+0.001
Ch+0.001
use+0.001
Cow+0.001
Neg logit contributions
uct-0.001
colm-0.001
 John-0.001
 tears-0.001
 Tommy-0.001
Token to
Token ID284
Feature activation-8.47e-03

Pos logit contributions
 mate+0.016
use+0.014
 crew+0.014
hetic+0.014
 peac+0.014
Neg logit contributions
uct-0.009
 tears-0.009
 Tommy-0.008
 John-0.008
 Tears-0.008
Token are
Token ID389
Feature activation+2.37e-03

Pos logit contributions
 mate+0.056
hetic+0.045
 spy+0.045
ppy+0.044
cient+0.044
Neg logit contributions
uct-0.086
colm-0.071
 tears-0.067
 John-0.066
 Tears-0.065
Token you
Token ID345
Feature activation-9.17e-03

Pos logit contributions
uct+0.004
colm+0.004
 John+0.004
 tears+0.004
case+0.004
Neg logit contributions
 mate-0.004
hetic-0.003
Ch-0.003
use-0.003
 peac-0.003
Token crying
Token ID13774
Feature activation-7.54e-03

Pos logit contributions
 mate+0.016
hetic+0.014
Ch+0.014
use+0.014
king+0.014
Neg logit contributions
uct-0.014
colm-0.012
 John-0.012
 tears-0.011
 Tommy-0.011
Token?"
Token ID1701
Feature activation-1.39e-02

Pos logit contributions
 mate+0.014
hetic+0.012
Ch+0.012
use+0.012
Cow+0.012
Neg logit contributions
uct-0.011
 John-0.009
colm-0.009
 tears-0.009
 Tommy-0.009
Token The
Token ID383
Feature activation-1.40e-02

Pos logit contributions
 mate+0.023
Ch+0.022
Cow+0.021
use+0.021
hetic+0.021
Neg logit contributions
uct-0.020
colm-0.018
case-0.017
 tears-0.017
 Tears-0.016
Token little
Token ID1310
Feature activation+8.80e-03

Pos logit contributions
 mate+0.022
Ch+0.020
hetic+0.019
Gr+0.019
 chick+0.019
Neg logit contributions
uct-0.023
colm-0.020
 John-0.020
 Tommy-0.019
 Tears-0.019
Token girl
Token ID2576
Feature activation-4.23e-03

Pos logit contributions
uct+0.015
colm+0.013
 John+0.013
 tears+0.013
 Tommy+0.013
Neg logit contributions
 mate-0.013
hetic-0.012
Ch-0.012
use-0.011
Cow-0.011
Token said
Token ID531
Feature activation+5.03e-03

Pos logit contributions
 mate+0.009
hetic+0.009
Ch+0.008
use+0.008
king+0.008
Neg logit contributions
uct-0.005
 John-0.004
colm-0.003
 tears-0.003
 Tommy-0.003
Token,
Token ID11
Feature activation-1.12e-03

Pos logit contributions
uct+0.004
 John+0.004
 tears+0.004
 Tommy+0.004
 Jerry+0.004
Neg logit contributions
 mate-0.012
Ch-0.011
oise-0.011
use-0.011
 engineers-0.011
Token "
Token ID366
Feature activation-5.18e-03

Pos logit contributions
 mate+0.002
hetic+0.002
use+0.002
Ch+0.002
oise+0.002
Neg logit contributions
uct-0.001
 John-0.001
 Tommy-0.001
 tears-0.001
 Tears-0.001
TokenI
Token ID40
Feature activation-2.48e-03

Pos logit contributions
 mate+0.008
Ch+0.007
hetic+0.007
use+0.007
Cow+0.007
Neg logit contributions
uct-0.009
colm-0.007
 John-0.007
 tears-0.007
 Tommy-0.007
Token.
Token ID13
Feature activation+7.16e-03

Pos logit contributions
uct+0.019
colm+0.016
 John+0.015
 tears+0.015
 Tommy+0.015
Neg logit contributions
 mate-0.023
hetic-0.020
Ch-0.020
use-0.020
king-0.019
Token The
Token ID383
Feature activation+2.56e-03

Pos logit contributions
uct+0.009
colm+0.007
 tears+0.007
case+0.006
 John+0.006
Neg logit contributions
 mate-0.015
Ch-0.014
hetic-0.013
Cow-0.013
Gr-0.013
Token goose
Token ID37246
Feature activation-3.75e-03

Pos logit contributions
uct+0.004
 tears+0.004
colm+0.004
 Tommy+0.004
 John+0.004
Neg logit contributions
 mate-0.004
hetic-0.004
Ch-0.004
ppy-0.004
use-0.004
Token thinks
Token ID6834
Feature activation-1.51e-03

Pos logit contributions
 mate+0.007
hetic+0.006
use+0.006
Ch+0.006
be+0.006
Neg logit contributions
uct-0.005
 John-0.004
colm-0.004
 Tommy-0.004
 Tears-0.004
Token they
Token ID484
Feature activation-2.44e-03

Pos logit contributions
 mate+0.003
hetic+0.003
Ch+0.003
use+0.003
Cow+0.003
Neg logit contributions
uct-0.002
colm-0.002
 John-0.002
 tears-0.002
 Tommy-0.002
Token have
Token ID423
Feature activation+5.71e-03

Pos logit contributions
 mate+0.004
hetic+0.003
use+0.003
Ch+0.003
be+0.003
Neg logit contributions
uct-0.004
 John-0.003
colm-0.003
 tears-0.003
 Tommy-0.003
Token no
Token ID645
Feature activation-4.83e-03

Pos logit contributions
uct+0.009
colm+0.007
 John+0.007
case+0.007
 Tears+0.007
Neg logit contributions
 mate-0.010
hetic-0.009
Ch-0.009
use-0.009
be-0.008
Token more
Token ID517
Feature activation+7.64e-03

Pos logit contributions
 mate+0.009
Ch+0.009
hetic+0.009
ppy+0.009
Gr+0.009
Neg logit contributions
uct-0.005
 tears-0.005
 Tommy-0.005
 John-0.005
 Jerry-0.005
Token bread
Token ID8509
Feature activation+5.04e-03

Pos logit contributions
uct+0.012
colm+0.010
 John+0.010
case+0.009
 Tears+0.009
Neg logit contributions
 mate-0.014
hetic-0.012
use-0.011
 peac-0.011
be-0.011
Token.
Token ID13
Feature activation+4.56e-03

Pos logit contributions
uct+0.008
colm+0.006
 John+0.006
 tears+0.006
 Tommy+0.006
Neg logit contributions
 mate-0.009
hetic-0.008
Ch-0.008
use-0.008
king-0.007
Token The
Token ID383
Feature activation+1.57e-03

Pos logit contributions
uct+0.006
colm+0.005
 tears+0.005
 John+0.004
case+0.004
Neg logit contributions
 mate-0.009
Ch-0.008
hetic-0.008
Cow-0.008
use-0.008
Token little
Token ID1310
Feature activation-5.68e-03

Pos logit contributions
uct+0.021
 tears+0.018
 John+0.018
 Tommy+0.018
colm+0.018
Neg logit contributions
 mate-0.027
hetic-0.025
Ch-0.025
use-0.024
king-0.024
Token scratch
Token ID12692
Feature activation-1.20e-02

Pos logit contributions
 mate+0.008
Ch+0.008
hetic+0.008
Cow+0.008
Gr+0.008
Neg logit contributions
 tears-0.009
uct-0.008
 John-0.008
 Tommy-0.008
 Tears-0.008
Token.
Token ID13
Feature activation-1.97e-03

Pos logit contributions
 mate+0.025
Ch+0.022
hetic+0.022
Gr+0.022
Cow+0.022
Neg logit contributions
uct-0.013
 Tommy-0.012
 John-0.012
case-0.011
 Jerry-0.011
Token You
Token ID921
Feature activation-1.52e-02

Pos logit contributions
 mate+0.004
hetic+0.004
use+0.004
king+0.004
Ch+0.004
Neg logit contributions
uct-0.002
 John-0.002
 Tommy-0.002
colm-0.002
 Tears-0.002
Token were
Token ID547
Feature activation+1.21e-02

Pos logit contributions
 mate+0.024
hetic+0.021
Ch+0.021
use+0.020
king+0.020
Neg logit contributions
uct-0.026
 John-0.022
colm-0.022
 tears-0.022
 Tommy-0.022
Token very
Token ID845
Feature activation-3.08e-03

Pos logit contributions
uct+0.019
colm+0.017
case+0.016
 John+0.016
 Tears+0.016
Neg logit contributions
 mate-0.021
hetic-0.018
 chick-0.017
king-0.017
ppy-0.017
Token brave
Token ID14802
Feature activation+4.60e-04

Pos logit contributions
 mate+0.006
Ch+0.005
hetic+0.005
use+0.005
king+0.005
Neg logit contributions
uct-0.004
 John-0.004
 tears-0.004
 Tommy-0.004
colm-0.004
Token and
Token ID290
Feature activation-8.43e-03

Pos logit contributions
uct+0.001
 John+0.000
 Tommy+0.000
 tears+0.000
colm+0.000
Neg logit contributions
 mate-0.001
Ch-0.001
hetic-0.001
Cow-0.001
use-0.001
Token did
Token ID750
Feature activation+9.58e-03

Pos logit contributions
 mate+0.015
Ch+0.014
hetic+0.014
use+0.013
king+0.013
Neg logit contributions
uct-0.012
 John-0.010
 Tommy-0.010
 tears-0.010
 Jerry-0.010
Token a
Token ID257
Feature activation+4.11e-03

Pos logit contributions
uct+0.009
 tears+0.007
 John+0.007
 Tommy+0.007
colm+0.007
Neg logit contributions
 mate-0.022
Ch-0.020
hetic-0.020
use-0.020
Cow-0.019
Token great
Token ID1049
Feature activation+2.13e-02

Pos logit contributions
uct+0.003
 tears+0.003
 Tommy+0.003
colm+0.003
 John+0.003
Neg logit contributions
 mate-0.009
Ch-0.009
use-0.009
hetic-0.008
oise-0.008
Token said
Token ID531
Feature activation+8.67e-03

Pos logit contributions
uct+0.013
colm+0.011
 John+0.011
 tears+0.011
 Tommy+0.010
Neg logit contributions
 mate-0.018
hetic-0.016
Ch-0.016
use-0.016
Cow-0.015
Token,
Token ID11
Feature activation+7.57e-03

Pos logit contributions
uct+0.009
 John+0.008
 tears+0.008
 Tommy+0.008
 Jerry+0.008
Neg logit contributions
 mate-0.018
Ch-0.018
oise-0.017
use-0.017
hetic-0.017
Token "
Token ID366
Feature activation-1.12e-02

Pos logit contributions
uct+0.008
 John+0.007
 Tommy+0.006
 tears+0.006
 Jerry+0.006
Neg logit contributions
 mate-0.017
oise-0.016
use-0.016
hetic-0.016
 conclusions-0.015
TokenPlease
Token ID5492
Feature activation-1.82e-02

Pos logit contributions
 mate+0.021
hetic+0.019
use+0.018
 crew+0.018
ppy+0.018
Neg logit contributions
uct-0.016
colm-0.013
 John-0.013
 Tommy-0.013
 tears-0.012
Token can
Token ID460
Feature activation-9.28e-03

Pos logit contributions
 mate+0.034
Ch+0.031
hetic+0.031
king+0.031
Gr+0.030
Neg logit contributions
uct-0.023
 Tommy-0.022
 John-0.021
 tears-0.020
 Jerry-0.020
Token I
Token ID314
Feature activation-1.30e-02

Pos logit contributions
 mate+0.016
hetic+0.014
Ch+0.014
use+0.013
Cow+0.013
Neg logit contributions
uct-0.015
colm-0.013
 John-0.012
 tears-0.012
 Tommy-0.012
Token take
Token ID1011
Feature activation-7.54e-03

Pos logit contributions
 mate+0.021
hetic+0.020
king+0.019
Ch+0.019
 crew+0.018
Neg logit contributions
uct-0.020
 Tommy-0.019
case-0.018
 John-0.018
 tears-0.017
Token it
Token ID340
Feature activation+2.31e-03

Pos logit contributions
 mate+0.014
hetic+0.013
Ch+0.013
king+0.013
use+0.013
Neg logit contributions
uct-0.010
 tears-0.009
 John-0.008
 Tommy-0.008
colm-0.008
Token home
Token ID1363
Feature activation+5.23e-03

Pos logit contributions
uct+0.003
 John+0.003
 Tommy+0.003
case+0.003
 tears+0.003
Neg logit contributions
 mate-0.004
hetic-0.004
Ch-0.004
king-0.004
use-0.004
Token?
Token ID30
Feature activation-1.82e-02

Pos logit contributions
uct+0.007
 John+0.006
 Tommy+0.006
colm+0.005
case+0.005
Neg logit contributions
 mate-0.010
hetic-0.009
Ch-0.009
use-0.009
 peac-0.009
Token I
Token ID314
Feature activation+2.35e-03

Pos logit contributions
 mate+0.036
hetic+0.033
use+0.032
king+0.032
 peac+0.031
Neg logit contributions
uct-0.023
 John-0.021
 Tommy-0.020
 Tears-0.019
case-0.019
Token.
Token ID13
Feature activation-5.78e-03

Pos logit contributions
 mate+0.032
hetic+0.029
Ch+0.029
use+0.028
Gr+0.027
Neg logit contributions
uct-0.025
 John-0.021
 tears-0.021
 Tommy-0.020
colm-0.020
Token Lily
Token ID20037
Feature activation+1.94e-02

Pos logit contributions
 mate+0.009
Ch+0.009
hetic+0.009
Cow+0.008
Gr+0.008
Neg logit contributions
uct-0.008
colm-0.008
 John-0.007
 tears-0.007
 Jerry-0.006
Token accidentally
Token ID14716
Feature activation-4.76e-03

Pos logit contributions
uct+0.026
 John+0.022
colm+0.022
 Tommy+0.021
 tears+0.021
Neg logit contributions
 mate-0.036
hetic-0.033
Ch-0.033
use-0.032
be-0.031
Token touched
Token ID12615
Feature activation+1.99e-03

Pos logit contributions
 mate+0.008
hetic+0.007
Ch+0.007
use+0.007
king+0.007
Neg logit contributions
uct-0.007
 John-0.006
colm-0.006
 tears-0.006
 Tommy-0.006
Token the
Token ID262
Feature activation+6.62e-03

Pos logit contributions
uct+0.002
colm+0.002
case+0.002
 John+0.002
 tears+0.002
Neg logit contributions
 mate-0.004
Ch-0.004
hetic-0.004
 chick-0.003
use-0.003
Token pan
Token ID3425
Feature activation-5.74e-03

Pos logit contributions
uct+0.008
 tears+0.007
case+0.006
 John+0.006
 Tears+0.006
Neg logit contributions
 mate-0.014
hetic-0.012
Ch-0.012
use-0.012
Gr-0.012
Token with
Token ID351
Feature activation+6.22e-03

Pos logit contributions
 mate+0.011
hetic+0.010
Ch+0.010
use+0.010
king+0.010
Neg logit contributions
uct-0.007
colm-0.006
 John-0.006
 tears-0.006
 Tommy-0.006
Token her
Token ID607
Feature activation+2.73e-02

Pos logit contributions
uct+0.008
colm+0.006
 John+0.006
 tears+0.006
 Tommy+0.006
Neg logit contributions
 mate-0.013
hetic-0.011
Ch-0.011
use-0.011
Cow-0.011
Token heel
Token ID24703
Feature activation+9.69e-03

Pos logit contributions
uct+0.038
colm+0.031
 John+0.030
 Tommy+0.029
 tears+0.029
Neg logit contributions
 mate-0.052
hetic-0.046
Ch-0.046
use-0.045
be-0.044
Token and
Token ID290
Feature activation+9.56e-03

Pos logit contributions
uct+0.014
colm+0.012
 John+0.011
 Tommy+0.011
 tears+0.011
Neg logit contributions
 mate-0.017
hetic-0.016
Ch-0.016
be-0.015
king-0.015
Token it
Token ID340
Feature activation+6.00e-03

Pos logit contributions
uct+0.014
colm+0.013
case+0.012
 John+0.011
 Jerry+0.011
Neg logit contributions
 mate-0.016
hetic-0.015
Ch-0.014
 chick-0.014
 longer-0.014
Token if
Token ID611
Feature activation-7.93e-03

Pos logit contributions
uct+0.006
 John+0.005
case+0.005
 Tommy+0.005
 tears+0.005
Neg logit contributions
 mate-0.009
hetic-0.008
Ch-0.008
Gr-0.008
Cow-0.008
Token you
Token ID345
Feature activation-7.06e-03

Pos logit contributions
 mate+0.013
hetic+0.012
Ch+0.011
use+0.011
Cow+0.011
Neg logit contributions
uct-0.013
 John-0.011
colm-0.011
 tears-0.011
 Tommy-0.011
Token want
Token ID765
Feature activation+4.15e-03

Pos logit contributions
 mate+0.011
hetic+0.010
Ch+0.010
Gr+0.009
 chick+0.009
Neg logit contributions
uct-0.012
 Tommy-0.010
case-0.010
 John-0.010
 tears-0.010
Token,"
Token ID553
Feature activation+4.87e-03

Pos logit contributions
uct+0.006
 John+0.005
 Tommy+0.005
 tears+0.005
 Jerry+0.005
Neg logit contributions
 mate-0.008
hetic-0.007
Ch-0.007
king-0.007
oise-0.007
Token Ben
Token ID3932
Feature activation-4.28e-03

Pos logit contributions
uct+0.005
 Tommy+0.005
 John+0.005
 tears+0.004
colm+0.004
Neg logit contributions
 mate-0.011
hetic-0.010
Ch-0.009
use-0.009
oise-0.009
Token said
Token ID531
Feature activation-4.53e-03

Pos logit contributions
 mate+0.009
Ch+0.008
hetic+0.008
use+0.008
be+0.008
Neg logit contributions
uct-0.005
colm-0.004
 John-0.004
 Tears-0.004
case-0.004
Token.
Token ID13
Feature activation+5.17e-04

Pos logit contributions
 mate+0.009
Ch+0.008
hetic+0.008
use+0.008
Cow+0.008
Neg logit contributions
uct-0.006
 tears-0.005
 John-0.005
 Tommy-0.005
colm-0.005
Token
Token ID198
Feature activation-2.89e-04

Pos logit contributions
uct+0.001
colm+0.001
 John+0.001
 tears+0.001
case+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
Cow-0.001
Gr-0.001
hetic-0.001
Token
Token ID198
Feature activation-2.49e-03

Pos logit contributions
 mate+0.001
Ch+0.001
hetic+0.001
use+0.001
Cow+0.001
Neg logit contributions
uct-0.000
 John-0.000
colm-0.000
 tears-0.000
 Tears-0.000
Token"
Token ID1
Feature activation-6.81e-03

Pos logit contributions
 mate+0.004
hetic+0.004
use+0.004
cient+0.004
 peac+0.004
Neg logit contributions
uct-0.004
colm-0.003
 Tommy-0.003
 tears-0.003
case-0.003
TokenSorry
Token ID14385
Feature activation+2.36e-03

Pos logit contributions
 mate+0.012
hetic+0.010
use+0.010
ppy+0.010
 crew+0.010
Neg logit contributions
uct-0.011
 Tommy-0.009
colm-0.009
 John-0.009
 tears-0.009
Token.
Token ID13
Feature activation-9.73e-03

Pos logit contributions
uct+0.013
 John+0.011
colm+0.010
 tears+0.010
 Tommy+0.010
Neg logit contributions
 mate-0.018
hetic-0.017
Ch-0.017
use-0.016
Cow-0.016
Token He
Token ID679
Feature activation-1.79e-02

Pos logit contributions
 mate+0.023
hetic+0.021
Ch+0.021
use+0.021
king+0.021
Neg logit contributions
uct-0.008
 John-0.007
 Tommy-0.006
colm-0.006
 Tears-0.006
Token will
Token ID481
Feature activation-1.57e-02

Pos logit contributions
 mate+0.035
Ch+0.031
hetic+0.031
king+0.030
Cow+0.030
Neg logit contributions
uct-0.023
 tears-0.019
colm-0.019
 John-0.019
 Tommy-0.018
Token be
Token ID307
Feature activation-2.67e-02

Pos logit contributions
hetic+0.026
 mate+0.025
oise+0.025
king+0.025
mer+0.024
Neg logit contributions
uct-0.021
 tears-0.021
 Tommy-0.021
 John-0.020
case-0.019
Token gentle
Token ID10296
Feature activation-1.64e-02

Pos logit contributions
 mate+0.044
hetic+0.039
Ch+0.038
use+0.037
king+0.037
Neg logit contributions
uct-0.043
colm-0.037
 John-0.036
case-0.036
 tears-0.035
Token and
Token ID290
Feature activation-3.03e-02

Pos logit contributions
 mate+0.039
hetic+0.037
Ch+0.036
use+0.036
Cow+0.036
Neg logit contributions
uct-0.012
 John-0.011
 Tommy-0.010
 tears-0.010
case-0.010
Token kind
Token ID1611
Feature activation-1.49e-02

Pos logit contributions
 mate+0.052
Ch+0.048
hetic+0.048
oise+0.047
king+0.046
Neg logit contributions
uct-0.044
 tears-0.040
 Tommy-0.038
 John-0.037
colm-0.035
Token.
Token ID13
Feature activation-5.74e-03

Pos logit contributions
 mate+0.029
hetic+0.027
Ch+0.027
Cow+0.026
Gr+0.026
Neg logit contributions
uct-0.017
 John-0.016
 Tommy-0.015
 tears-0.015
colm-0.014
Token And
Token ID843
Feature activation-2.78e-02

Pos logit contributions
 mate+0.015
hetic+0.014
oise+0.014
king+0.014
use+0.014
Neg logit contributions
uct-0.003
 John-0.003
 Tommy-0.003
colm-0.002
 Tears-0.002
Token the
Token ID262
Feature activation-1.83e-02

Pos logit contributions
 mate+0.047
hetic+0.043
Ch+0.042
use+0.041
king+0.040
Neg logit contributions
uct-0.042
 John-0.036
colm-0.036
 tears-0.036
 Tommy-0.035
Token hospital
Token ID4436
Feature activation-1.37e-02

Pos logit contributions
 mate+0.029
ppy+0.027
king+0.027
hetic+0.026
use+0.026
Neg logit contributions
 tears-0.028
uct-0.027
 Tommy-0.025
case-0.025
 Tears-0.024
Token want
Token ID765
Feature activation-1.01e-02

Pos logit contributions
Ch+0.022
hetic+0.021
 mate+0.020
king+0.020
Gr+0.020
Neg logit contributions
uct-0.024
 Tommy-0.023
 John-0.022
 Jerry-0.022
 tears-0.022
Token to
Token ID284
Feature activation-3.57e-03

Pos logit contributions
hetic+0.022
king+0.021
oise+0.021
mer+0.021
Ch+0.021
Neg logit contributions
 John-0.009
 Tommy-0.008
 Jerry-0.008
 tears-0.008
 presents-0.008
Token join
Token ID4654
Feature activation-2.34e-02

Pos logit contributions
hetic+0.005
Ch+0.005
king+0.005
 mate+0.005
 chick+0.005
Neg logit contributions
uct-0.006
 Tommy-0.005
 John-0.005
case-0.005
 Jerry-0.005
Token me
Token ID502
Feature activation-1.60e-02

Pos logit contributions
hetic+0.049
 mate+0.048
Ch+0.045
king+0.045
use+0.045
Neg logit contributions
uct-0.023
 John-0.021
 Tommy-0.021
 tears-0.020
 Jerry-0.019
Token in
Token ID287
Feature activation-4.98e-04

Pos logit contributions
hetic+0.034
Ch+0.032
 mate+0.032
Gr+0.031
Pr+0.031
Neg logit contributions
uct-0.014
case-0.014
 John-0.014
 Tommy-0.013
 Jerry-0.012
Token my
Token ID616
Feature activation-3.01e-02

Pos logit contributions
 mate+0.001
hetic+0.001
king+0.001
Ch+0.001
ppy+0.001
Neg logit contributions
uct-0.000
 John-0.000
 Tommy-0.000
 tears-0.000
 Jerry-0.000
Token spaceship
Token ID40663
Feature activation-1.80e-02

Pos logit contributions
 mate+0.048
hetic+0.046
Ch+0.045
king+0.045
Cow+0.043
Neg logit contributions
uct-0.044
 tears-0.041
 John-0.040
colm-0.040
case-0.040
Token?"
Token ID1701
Feature activation+4.17e-03

Pos logit contributions
hetic+0.038
Ch+0.036
Pr+0.036
 mate+0.035
Gr+0.035
Neg logit contributions
uct-0.018
case-0.017
 John-0.015
 Tommy-0.015
 Jerry-0.014
Token Tom
Token ID4186
Feature activation+8.78e-03

Pos logit contributions
uct+0.005
 tears+0.005
colm+0.005
case+0.004
 ey+0.004
Neg logit contributions
 mate-0.007
Ch-0.007
Cow-0.007
 someday-0.007
Some-0.007
Token said
Token ID531
Feature activation-3.06e-03

Pos logit contributions
uct+0.011
colm+0.009
 tears+0.009
 Tears+0.009
 John+0.009
Neg logit contributions
 mate-0.017
Ch-0.016
ppy-0.015
hetic-0.015
use-0.015
Token,
Token ID11
Feature activation+9.23e-04

Pos logit contributions
 mate+0.007
use+0.007
oise+0.007
 engineers+0.007
 conclusions+0.007
Neg logit contributions
 John-0.003
uct-0.002
 Tommy-0.002
 tears-0.002
 Jerry-0.002
Token.
Token ID13
Feature activation+8.27e-04

Pos logit contributions
 mate+0.021
 longer+0.018
Ch+0.018
hetic+0.017
 engineers+0.017
Neg logit contributions
uct-0.023
colm-0.019
case-0.018
 Tears-0.017
 tears-0.017
Token Max
Token ID5436
Feature activation-1.90e-03

Pos logit contributions
uct+0.001
colm+0.001
 tears+0.001
 ey+0.001
 Lib+0.001
Neg logit contributions
 mate-0.001
Ch-0.001
hetic-0.001
Cow-0.001
 engineers-0.001
Token could
Token ID714
Feature activation+9.16e-03

Pos logit contributions
 mate+0.003
Ch+0.003
be+0.003
hetic+0.003
ppy+0.003
Neg logit contributions
uct-0.003
colm-0.002
 Tears-0.002
 tears-0.002
 Tommy-0.002
Token jump
Token ID4391
Feature activation-6.13e-03

Pos logit contributions
uct+0.017
colm+0.015
 John+0.015
 tears+0.014
 Tommy+0.014
Neg logit contributions
 mate-0.013
hetic-0.011
Ch-0.011
use-0.011
be-0.010
Token very
Token ID845
Feature activation-8.94e-03

Pos logit contributions
 mate+0.010
Ch+0.009
 longer+0.009
ch+0.009
king+0.009
Neg logit contributions
uct-0.010
colm-0.008
 Tears-0.008
 tears-0.007
 John-0.007
Token far
Token ID1290
Feature activation-3.31e-02

Pos logit contributions
 mate+0.016
hetic+0.015
Ch+0.015
use+0.014
king+0.014
Neg logit contributions
uct-0.013
colm-0.011
 John-0.010
 tears-0.010
 Tommy-0.010
Token,
Token ID11
Feature activation-8.13e-03

Pos logit contributions
 mate+0.059
hetic+0.053
king+0.051
 longer+0.051
mer+0.051
Neg logit contributions
uct-0.051
colm-0.041
 Tears-0.040
case-0.039
 tears-0.039
Token but
Token ID475
Feature activation+1.34e-03

Pos logit contributions
 mate+0.015
hetic+0.014
Ch+0.013
use+0.013
king+0.013
Neg logit contributions
uct-0.011
colm-0.009
 John-0.009
 tears-0.009
 Tommy-0.009
Token Lisa
Token ID15378
Feature activation+6.38e-03

Pos logit contributions
uct+0.002
colm+0.001
 tears+0.001
 John+0.001
 Tears+0.001
Neg logit contributions
 mate-0.003
hetic-0.002
Ch-0.002
use-0.002
 crew-0.002
Token could
Token ID714
Feature activation+1.67e-02

Pos logit contributions
uct+0.009
 Tears+0.008
colm+0.008
 John+0.008
 tears+0.008
Neg logit contributions
 mate-0.011
Ch-0.010
ppy-0.010
be-0.010
hetic-0.010
Token jump
Token ID4391
Feature activation-1.14e-03

Pos logit contributions
uct+0.027
colm+0.023
 John+0.022
 Tears+0.022
 tears+0.022
Neg logit contributions
 mate-0.028
hetic-0.024
Ch-0.024
use-0.023
 spy-0.023
Token time
Token ID640
Feature activation+6.70e-03

Pos logit contributions
 mate+0.019
Ch+0.017
hetic+0.017
use+0.017
ppy+0.017
Neg logit contributions
uct-0.009
colm-0.007
 John-0.007
 tears-0.007
 Tears-0.007
Token there
Token ID612
Feature activation+1.60e-03

Pos logit contributions
uct+0.012
colm+0.010
 tears+0.010
 Tears+0.009
case+0.009
Neg logit contributions
 mate-0.011
Ch-0.009
hetic-0.009
use-0.009
be-0.009
Token was
Token ID373
Feature activation-9.23e-03

Pos logit contributions
uct+0.003
colm+0.002
 tears+0.002
 Jerry+0.002
 Tears+0.002
Neg logit contributions
 mate-0.002
Ch-0.002
be-0.002
Qu-0.002
 someday-0.002
Token a
Token ID257
Feature activation+6.41e-03

Pos logit contributions
 mate+0.017
Ch+0.015
hetic+0.014
 chick+0.014
Gr+0.014
Neg logit contributions
uct-0.014
colm-0.012
 tears-0.011
 Tears-0.011
 John-0.011
Token small
Token ID1402
Feature activation-3.51e-03

Pos logit contributions
uct+0.006
 Tommy+0.006
colm+0.005
 John+0.005
 tears+0.005
Neg logit contributions
 mate-0.014
use-0.013
hetic-0.013
Ch-0.012
king-0.012
Token girl
Token ID2576
Feature activation-3.09e-02

Pos logit contributions
 mate+0.006
hetic+0.005
Ch+0.005
use+0.005
king+0.005
Neg logit contributions
uct-0.005
 John-0.005
 Tommy-0.005
colm-0.005
 tears-0.005
Token called
Token ID1444
Feature activation-2.71e-02

Pos logit contributions
 mate+0.055
hetic+0.050
urnal+0.050
king+0.048
use+0.048
Neg logit contributions
 Tommy-0.039
 Tom-0.038
 John-0.038
uct-0.038
 Sue-0.036
Token Lily
Token ID20037
Feature activation-1.58e-02

Pos logit contributions
 mate+0.064
hetic+0.062
king+0.061
use+0.060
be+0.060
Neg logit contributions
uct-0.020
 John-0.019
 Tommy-0.018
 Jerry-0.017
 Tom-0.015
Token who
Token ID508
Feature activation-2.82e-02

Pos logit contributions
 mate+0.032
hetic+0.029
use+0.029
Cow+0.028
Ch+0.028
Neg logit contributions
uct-0.019
 John-0.016
 Tommy-0.016
colm-0.016
 tears-0.015
Token was
Token ID373
Feature activation-7.54e-03

Pos logit contributions
 mate+0.045
Ch+0.041
hetic+0.040
use+0.039
Cow+0.038
Neg logit contributions
uct-0.047
 John-0.039
colm-0.039
 tears-0.039
 Tommy-0.038
Token very
Token ID845
Feature activation-2.18e-03

Pos logit contributions
 mate+0.017
use+0.016
hetic+0.016
 conclusions+0.015
Ch+0.015
Neg logit contributions
uct-0.007
 Tommy-0.006
 John-0.006
 tears-0.006
 Tears-0.006
Tokenmy
Token ID1820
Feature activation-1.15e-03

Pos logit contributions
uct+0.045
 Lib+0.039
 John+0.039
case+0.038
 Tears+0.038
Neg logit contributions
 mate-0.027
ppy-0.027
be-0.024
oise-0.024
use-0.024
Token's
Token ID338
Feature activation+1.12e-03

Pos logit contributions
 mate+0.002
hetic+0.002
use+0.001
be+0.001
ppy+0.001
Neg logit contributions
uct-0.002
 John-0.002
colm-0.002
 Tears-0.002
 Jerry-0.002
Token mom
Token ID1995
Feature activation-3.16e-03

Pos logit contributions
uct+0.002
 Lib+0.002
colm+0.002
 John+0.002
 Jerry+0.002
Neg logit contributions
 mate-0.002
 spy-0.002
 crew-0.001
 leader-0.001
 sheep-0.001
Token asked
Token ID1965
Feature activation-4.28e-02

Pos logit contributions
 mate+0.005
hetic+0.004
use+0.004
Ch+0.004
be+0.004
Neg logit contributions
uct-0.005
colm-0.005
 Tears-0.005
 John-0.005
 tears-0.004
Token him
Token ID683
Feature activation-3.19e-02

Pos logit contributions
 mate+0.080
hetic+0.071
Ch+0.070
use+0.069
Cow+0.067
Neg logit contributions
uct-0.061
colm-0.050
 John-0.050
 tears-0.049
 Tommy-0.048
Token to
Token ID284
Feature activation-3.23e-02

Pos logit contributions
 mate+0.056
Ch+0.054
hetic+0.053
use+0.051
be+0.051
Neg logit contributions
uct-0.043
 John-0.040
 Tommy-0.039
 Jerry-0.038
 tears-0.037
Token clean
Token ID3424
Feature activation-1.03e-03

Pos logit contributions
 mate+0.041
hetic+0.034
Ch+0.034
use+0.033
Cow+0.032
Neg logit contributions
uct-0.064
colm-0.057
 John-0.056
 tears-0.055
 Tears-0.055
Token his
Token ID465
Feature activation-1.65e-02

Pos logit contributions
 mate+0.002
 sheep+0.002
 chick+0.002
 longer+0.002
hetic+0.002
Neg logit contributions
uct-0.001
case-0.001
colm-0.001
 Tears-0.001
 Lib-0.001
Token room
Token ID2119
Feature activation-8.29e-04

Pos logit contributions
 mate+0.031
 clinic+0.028
 chick+0.028
 spy+0.028
 crew+0.027
Neg logit contributions
uct-0.022
 John-0.018
 Tommy-0.017
colm-0.017
 Robert-0.017
Token.
Token ID13
Feature activation+1.45e-03

Pos logit contributions
 mate+0.001
hetic+0.001
Ch+0.001
use+0.001
king+0.001
Neg logit contributions
uct-0.001
colm-0.001
 John-0.001
 tears-0.001
 Tommy-0.001
Token Tim
Token ID5045
Feature activation+3.49e-02

Pos logit contributions
uct+0.002
colm+0.001
 tears+0.001
case+0.001
 Tears+0.001
Neg logit contributions
 mate-0.003
Ch-0.003
hetic-0.003
Cow-0.003
use-0.003