TOP ACTIVATIONS
MAX = 0.052

This taught her an important lesson . She learned
to explain that it was an accident , but his mom
, " Let 's play an adventure game ". Kate was
Tim found a magic f auc et in his yard .
naughty people had taught everyone an important lesson . Even though
my 's mom showed him an empty box . She said
and happy . It learned an important lesson that day ,
okay , but he learned an important lesson . He needed
saw her and gave her an orange umbrella to keep her
Henry 's mom gave him an cookies as a snack .
each other , Jake smiled an enthusiastic smile . Clara was
Next , Daddy taught Jane an action that she had to
beautiful and it gl owed an even brighter purple than before
saw a little bird singing an ancient song . The ant
mum told her it was an earthquake .
back and gave the bird an open box . Inside the
The bunny taught us an important lesson that day
she saw a beautiful f auc et .
. He was wearing his brand new icy blue jacket as
's mom gave them a brand new basketball and they played

MIN ACTIVATIONS
MIN = -0.076

could even turn flip - fl ops ! Everyone was very
had been decorated with snow fl akes . Thomas couldn 't
stretch out and enormous sun fl owers would show their bright
, roses , and sun fl owers . They waved to
me adow full of wild fl owers . He spun around
. They were called ost ric hes . Lily and Ben
girl who didn 't like av oc ados . She always
to send soft white snow fl akes into the sky .
a garden full of wild fl owers . He decided to
back with lots of sun fl owers to fill the prince
who liked to collect or nam ents . Everywhere she went
John chose some carrots and cuc umbers to take home .
<|endoftext|> L ily loved sun fl owers . She liked to
tree could see lots of av oc ados that the supplier
chains , glitter y snow fl akes , and a picture
for some funny little sun fl owers . Her m ama
da is ies , sun fl owers , and tul ips
a big field of sun fl owers . The sun flower
big cage with lots of gor illas inside !"
lots of trees and wild fl owers . As he stepped

INTERVAL 0.020 - 0.052
CONTAINS 1.113%

she says . " Someone must have left them here .
uddle , making a big mess . They did not see
The family spent the next week playing games , lou
. " L ily , can I play
loved his new toy the most . He couldn 't tell

INTERVAL -0.012 - 0.020
CONTAINS 91.705%

. They had a big drum that made loud noises when
passed , and Benny went back to playing in the sunshine
the park . They looked up at the sky and saw
Lily . One day , Lily 's mom my made some
she stumbled around the park until the sun started to go

INTERVAL -0.044 - -0.012
CONTAINS 7.179%

ran around the room , laughing and smiling . They had
the van and wanted to count how many things were inside
the birds would love to build a nest with it !
It saw green trees , yellow flowers , and blue birds
! It was done . The spark les

INTERVAL -0.076 - -0.044
CONTAINS 0.003%

She saw lots of snow fl akes , glitter ing in
working and May kept on pe eking . Soon the bird
, like lions , gir aff es , and elephants .
a stream , some wild fl owers and a huge old
house . They delivered more rad ishes and made more people
Token
Token ID198
Feature activation+6.88e-03

Pos logit contributions
 heights+0.019
 hesitation+0.019
used+0.018
 competitors+0.017
joy+0.017
Neg logit contributions
grand-0.007
 lived-0.006
L-0.006
 woke-0.005
tr-0.005
Token
Token ID198
Feature activation-4.81e-03

Pos logit contributions
grand+0.012
tr+0.011
 hyd+0.010
 ash+0.010
oc+0.010
Neg logit contributions
used-0.011
 heights-0.011
 practicing-0.011
 hesitation-0.011
joy-0.010
TokenThis
Token ID1212
Feature activation-4.93e-03

Pos logit contributions
 heights+0.008
used+0.008
 practicing+0.008
ths+0.007
 competitors+0.007
Neg logit contributions
grand-0.008
rost-0.008
 woke-0.007
tr-0.007
 lived-0.007
Token taught
Token ID7817
Feature activation+4.20e-03

Pos logit contributions
 heights+0.011
 hesitation+0.011
used+0.010
 practicing+0.010
 attempts+0.010
Neg logit contributions
grand-0.007
L-0.006
Twe-0.006
rost-0.006
oc-0.006
Token her
Token ID607
Feature activation+1.11e-02

Pos logit contributions
grand+0.006
 lived+0.006
L+0.005
 woke+0.005
 was+0.005
Neg logit contributions
 heights-0.009
used-0.009
 hesitation-0.009
joy-0.008
 competitors-0.008
Token an
Token ID281
Feature activation+4.83e-02

Pos logit contributions
grand+0.019
 lived+0.018
 woke+0.017
L+0.016
oc+0.016
Neg logit contributions
 heights-0.023
used-0.022
 hesitation-0.021
 competitors-0.019
joy-0.019
Token important
Token ID1593
Feature activation+2.30e-02

Pos logit contributions
grand+0.069
 woke+0.065
 lived+0.064
L+0.061
oc+0.058
Neg logit contributions
 heights-0.108
 hesitation-0.098
used-0.096
 competitors-0.094
 practicing-0.091
Token lesson
Token ID11483
Feature activation+4.96e-03

Pos logit contributions
grand+0.026
 lived+0.025
 woke+0.022
L+0.022
oc+0.020
Neg logit contributions
 heights-0.057
 hesitation-0.053
used-0.052
 competitors-0.051
 practicing-0.050
Token.
Token ID13
Feature activation+2.70e-04

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.011
used-0.010
 hesitation-0.010
 competitors-0.009
joy-0.009
Token She
Token ID1375
Feature activation-2.23e-03

Pos logit contributions
grand+0.000
 lived+0.000
 woke+0.000
L+0.000
oc+0.000
Neg logit contributions
 heights-0.001
 hesitation-0.001
used-0.001
 competitors-0.001
joy-0.001
Token learned
Token ID4499
Feature activation+1.10e-03

Pos logit contributions
 heights+0.006
used+0.005
 hesitation+0.005
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.003
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token to
Token ID284
Feature activation+1.59e-02

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.004
L+0.004
oc+0.004
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
joy-0.005
Token explain
Token ID4727
Feature activation+3.66e-03

Pos logit contributions
grand+0.024
 lived+0.022
oc+0.020
 woke+0.019
tr+0.019
Neg logit contributions
 heights-0.036
 hesitation-0.034
used-0.034
 practicing-0.031
 competitors-0.031
Token that
Token ID326
Feature activation+1.05e-02

Pos logit contributions
grand+0.006
 lived+0.005
 woke+0.005
L+0.005
oc+0.005
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.007
 competitors-0.007
joy-0.007
Token it
Token ID340
Feature activation+3.95e-03

Pos logit contributions
grand+0.018
 lived+0.017
 woke+0.016
L+0.015
oc+0.015
Neg logit contributions
 heights-0.021
used-0.020
 hesitation-0.020
 competitors-0.018
joy-0.018
Token was
Token ID373
Feature activation+7.37e-03

Pos logit contributions
grand+0.004
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.011
used-0.010
 hesitation-0.010
 competitors-0.010
 practicing-0.010
Token an
Token ID281
Feature activation+4.79e-02

Pos logit contributions
grand+0.013
 lived+0.011
 woke+0.011
oc+0.011
L+0.010
Neg logit contributions
 heights-0.015
 hesitation-0.014
used-0.014
 practicing-0.013
joy-0.013
Token accident
Token ID5778
Feature activation+1.94e-02

Pos logit contributions
grand+0.081
 lived+0.080
L+0.077
 woke+0.076
|+0.073
Neg logit contributions
 heights-0.092
 hesitation-0.087
iously-0.083
used-0.082
 muscles-0.080
Token,
Token ID11
Feature activation+7.32e-03

Pos logit contributions
grand+0.027
 lived+0.025
 woke+0.025
L+0.023
tr+0.022
Neg logit contributions
 heights-0.046
used-0.044
 hesitation-0.043
joy-0.040
 competitors-0.039
Token but
Token ID475
Feature activation+1.50e-02

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.013
oc+0.013
L+0.013
Neg logit contributions
 heights-0.013
 hesitation-0.012
used-0.012
 practicing-0.010
joy-0.010
Token his
Token ID465
Feature activation+6.77e-03

Pos logit contributions
grand+0.026
 lived+0.023
oc+0.022
 woke+0.022
L+0.021
Neg logit contributions
 heights-0.030
used-0.028
 hesitation-0.028
 practicing-0.026
joy-0.025
Token mom
Token ID1995
Feature activation+1.84e-02

Pos logit contributions
grand+0.011
 lived+0.011
 woke+0.010
L+0.010
tr+0.009
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 competitors-0.012
joy-0.012
Token,
Token ID11
Feature activation+1.28e-03

Pos logit contributions
 heights+0.027
used+0.025
 competitors+0.024
 hesitation+0.023
 runners+0.023
Neg logit contributions
 lived-0.013
grand-0.013
 woke-0.012
 was-0.012
oc-0.011
Token "
Token ID366
Feature activation+3.87e-03

Pos logit contributions
 lived+0.002
grand+0.001
 woke+0.001
L+0.001
 was+0.001
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.003
 breaths-0.003
TokenLet
Token ID5756
Feature activation-2.58e-03

Pos logit contributions
grand+0.006
 lived+0.006
ork+0.006
 woke+0.005
 report+0.005
Neg logit contributions
used-0.007
 heights-0.007
 competitors-0.006
joy-0.006
yles-0.006
Token's
Token ID338
Feature activation+1.20e-02

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
joy+0.005
 competitors+0.005
Neg logit contributions
grand-0.004
 lived-0.003
 woke-0.003
L-0.003
oc-0.003
Token play
Token ID711
Feature activation+8.13e-03

Pos logit contributions
grand+0.014
 lived+0.013
 woke+0.012
L+0.011
tr+0.011
Neg logit contributions
 heights-0.031
 hesitation-0.029
used-0.029
joy-0.028
 competitors-0.027
Token an
Token ID281
Feature activation+4.72e-02

Pos logit contributions
grand+0.014
 lived+0.012
 woke+0.012
L+0.011
tr+0.011
Neg logit contributions
 heights-0.017
used-0.015
 hesitation-0.015
 competitors-0.014
 practicing-0.014
Token adventure
Token ID8855
Feature activation+1.29e-02

Pos logit contributions
grand+0.085
 woke+0.083
 lived+0.081
ork+0.077
L+0.077
Neg logit contributions
 heights-0.086
 hesitation-0.075
used-0.075
 practicing-0.072
 competitors-0.070
Token game
Token ID983
Feature activation+1.87e-03

Pos logit contributions
grand+0.018
 lived+0.018
 woke+0.016
L+0.016
 was+0.015
Neg logit contributions
 hesitation-0.029
 heights-0.029
used-0.028
joy-0.026
 attempts-0.025
Token".
Token ID1911
Feature activation-1.21e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.002
L+0.002
tr+0.002
Neg logit contributions
 heights-0.004
 hesitation-0.004
used-0.004
joy-0.004
iously-0.004
Token Kate
Token ID16693
Feature activation+2.20e-03

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token was
Token ID373
Feature activation-4.14e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
 competitors-0.005
 practicing-0.005
Token Tim
Token ID5045
Feature activation+1.90e-02

Pos logit contributions
grand+0.040
ork+0.032
oc+0.032
rim+0.032
L+0.031
Neg logit contributions
used-0.039
 heights-0.037
 hesitation-0.035
 muscles-0.034
 toes-0.033
Token found
Token ID1043
Feature activation+3.73e-03

Pos logit contributions
grand+0.033
ork+0.032
tr+0.031
rost+0.029
rim+0.029
Neg logit contributions
used-0.029
 practicing-0.027
 heights-0.026
 muscles-0.025
joy-0.024
Token a
Token ID257
Feature activation+4.97e-03

Pos logit contributions
grand+0.008
 lived+0.007
oc+0.007
 woke+0.006
tr+0.006
Neg logit contributions
 heights-0.006
 hesitation-0.006
used-0.006
 practicing-0.005
joy-0.005
Token magic
Token ID5536
Feature activation+1.36e-02

Pos logit contributions
grand+0.009
 lived+0.008
oc+0.007
|+0.007
L+0.007
Neg logit contributions
 heights-0.009
 hesitation-0.009
joy-0.008
used-0.008
 practicing-0.008
Token f
Token ID277
Feature activation+1.45e-02

Pos logit contributions
grand+0.024
 lived+0.021
L+0.021
 woke+0.020
oc+0.020
Neg logit contributions
 heights-0.027
 hesitation-0.025
used-0.024
 practicing-0.022
joy-0.022
Tokenauc
Token ID14272
Feature activation+4.67e-02

Pos logit contributions
grand+0.013
 lived+0.012
 woke+0.011
oc+0.010
L+0.010
Neg logit contributions
 heights-0.040
used-0.039
 hesitation-0.038
 competitors-0.036
joy-0.035
Tokenet
Token ID316
Feature activation+6.26e-03

Pos logit contributions
grand+0.101
ork+0.087
 CA+0.087
jar+0.087
 lived+0.086
Neg logit contributions
 heights-0.066
 breaths-0.060
used-0.052
ipping-0.052
iously-0.050
Token in
Token ID287
Feature activation+7.59e-03

Pos logit contributions
grand+0.011
 lived+0.009
L+0.009
 woke+0.009
oc+0.009
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 practicing-0.011
 competitors-0.011
Token his
Token ID465
Feature activation+4.73e-03

Pos logit contributions
grand+0.011
 lived+0.010
tr+0.010
 woke+0.010
oc+0.009
Neg logit contributions
 heights-0.017
 hesitation-0.015
used-0.015
 muscles-0.014
 practicing-0.014
Token yard
Token ID12699
Feature activation+1.04e-02

Pos logit contributions
grand+0.006
 lived+0.006
 woke+0.005
L+0.005
oc+0.005
Neg logit contributions
 heights-0.011
used-0.010
 hesitation-0.010
 practicing-0.010
joy-0.009
Token.
Token ID13
Feature activation-8.32e-03

Pos logit contributions
grand+0.020
 woke+0.017
 lived+0.017
L+0.017
oc+0.017
Neg logit contributions
 heights-0.018
used-0.017
 hesitation-0.016
 practicing-0.015
joy-0.015
Token naughty
Token ID45128
Feature activation-8.09e-03

Pos logit contributions
 heights+0.013
 hesitation+0.013
 competitors+0.012
used+0.012
 runners+0.011
Neg logit contributions
grand-0.012
 lived-0.010
 woke-0.010
tr-0.010
L-0.010
Token people
Token ID661
Feature activation+1.86e-03

Pos logit contributions
 heights+0.015
 hesitation+0.014
used+0.014
 competitors+0.014
 runners+0.013
Neg logit contributions
grand-0.014
 lived-0.013
tr-0.013
L-0.013
 woke-0.012
Token had
Token ID550
Feature activation-5.76e-04

Pos logit contributions
grand+0.003
L+0.002
 lived+0.002
tr+0.002
 woke+0.002
Neg logit contributions
used-0.004
 heights-0.004
 hesitation-0.004
 competitors-0.004
 practicing-0.004
Token taught
Token ID7817
Feature activation-2.14e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
 was-0.001
Token everyone
Token ID2506
Feature activation+3.56e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
joy+0.004
 competitors+0.004
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
L-0.002
oc-0.002
Token an
Token ID281
Feature activation+4.62e-02

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.005
L+0.005
 was+0.004
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.007
 competitors-0.007
joy-0.007
Token important
Token ID1593
Feature activation+1.87e-02

Pos logit contributions
grand+0.067
 woke+0.065
 lived+0.064
L+0.064
tr+0.058
Neg logit contributions
 heights-0.100
 hesitation-0.093
 competitors-0.091
used-0.088
 runners-0.087
Token lesson
Token ID11483
Feature activation+3.06e-03

Pos logit contributions
grand+0.020
 lived+0.018
 woke+0.017
L+0.017
oc+0.015
Neg logit contributions
 heights-0.048
 hesitation-0.045
used-0.045
 competitors-0.043
joy-0.042
Token.
Token ID13
Feature activation-1.00e-03

Pos logit contributions
grand+0.005
 lived+0.004
 woke+0.004
L+0.004
oc+0.004
Neg logit contributions
 heights-0.007
used-0.006
 hesitation-0.006
 competitors-0.006
joy-0.006
Token Even
Token ID3412
Feature activation-1.28e-02

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token though
Token ID996
Feature activation-1.79e-02

Pos logit contributions
used+0.030
 heights+0.029
 hesitation+0.028
joy+0.027
iously+0.026
Neg logit contributions
 lived-0.017
grand-0.016
L-0.015
 woke-0.015
 was-0.014
Tokenmy
Token ID1820
Feature activation+1.40e-02

Pos logit contributions
grand+0.010
 lived+0.009
L+0.008
oc+0.008
 woke+0.008
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 practicing-0.012
 competitors-0.012
Token's
Token ID338
Feature activation+9.32e-03

Pos logit contributions
grand+0.022
 lived+0.019
L+0.018
tr+0.018
rost+0.017
Neg logit contributions
 heights-0.029
used-0.028
 hesitation-0.027
 practicing-0.026
joy-0.025
Token mom
Token ID1995
Feature activation+1.43e-02

Pos logit contributions
grand+0.017
 lived+0.016
L+0.015
 woke+0.015
oc+0.015
Neg logit contributions
 heights-0.018
 hesitation-0.016
used-0.016
joy-0.015
 competitors-0.015
Token showed
Token ID3751
Feature activation+8.19e-03

Pos logit contributions
grand+0.022
 lived+0.018
rost+0.018
ork+0.018
tr+0.018
Neg logit contributions
used-0.028
 hesitation-0.028
 practicing-0.027
 heights-0.027
joy-0.025
Token him
Token ID683
Feature activation+5.71e-03

Pos logit contributions
grand+0.016
 lived+0.015
tr+0.015
oc+0.014
 woke+0.014
Neg logit contributions
 heights-0.015
 hesitation-0.013
used-0.013
joy-0.011
 enough-0.011
Token an
Token ID281
Feature activation+4.53e-02

Pos logit contributions
grand+0.010
 lived+0.009
 woke+0.009
L+0.009
oc+0.008
Neg logit contributions
 heights-0.011
used-0.011
 hesitation-0.010
 competitors-0.010
joy-0.009
Token empty
Token ID6565
Feature activation+1.94e-02

Pos logit contributions
grand+0.080
 lived+0.076
L+0.076
 woke+0.074
|+0.073
Neg logit contributions
 heights-0.087
 hesitation-0.084
used-0.079
iously-0.074
 muscles-0.073
Token box
Token ID3091
Feature activation+1.34e-02

Pos logit contributions
grand+0.031
 lived+0.028
L+0.026
 woke+0.025
oc+0.025
Neg logit contributions
 heights-0.042
 hesitation-0.039
used-0.038
joy-0.036
 practicing-0.035
Token.
Token ID13
Feature activation+9.58e-03

Pos logit contributions
grand+0.023
 lived+0.022
oc+0.020
L+0.020
 woke+0.020
Neg logit contributions
 heights-0.025
 hesitation-0.024
used-0.024
 practicing-0.022
joy-0.022
Token She
Token ID1375
Feature activation+3.76e-03

Pos logit contributions
grand+0.016
 lived+0.014
 woke+0.014
oc+0.014
L+0.014
Neg logit contributions
 heights-0.019
used-0.018
 hesitation-0.016
joy-0.016
 muscles-0.016
Token said
Token ID531
Feature activation+7.35e-03

Pos logit contributions
grand+0.006
|+0.005
rost+0.005
oc+0.005
tr+0.005
Neg logit contributions
used-0.007
 heights-0.007
 hesitation-0.007
 practicing-0.007
ths-0.006
Token and
Token ID290
Feature activation-1.37e-03

Pos logit contributions
used+0.010
 heights+0.010
 hesitation+0.010
 competitors+0.009
 Majesty+0.009
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.003
L-0.003
 was-0.003
Token happy
Token ID3772
Feature activation-4.98e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 practicing+0.003
 competitors+0.003
Neg logit contributions
grand-0.002
 lived-0.002
L-0.002
 woke-0.002
oc-0.002
Token.
Token ID13
Feature activation+4.51e-04

Pos logit contributions
used+0.013
 hesitation+0.013
 heights+0.013
 competitors+0.012
 runners+0.011
Neg logit contributions
 lived-0.005
grand-0.005
 woke-0.005
 was-0.005
L-0.004
Token It
Token ID632
Feature activation+4.08e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.000
L+0.000
oc+0.000
Neg logit contributions
 heights-0.001
 hesitation-0.001
used-0.001
 competitors-0.001
joy-0.001
Token learned
Token ID4499
Feature activation+4.84e-03

Pos logit contributions
grand+0.005
rost+0.004
tr+0.004
oc+0.004
L+0.004
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
 practicing-0.009
 competitors-0.008
Token an
Token ID281
Feature activation+4.50e-02

Pos logit contributions
grand+0.009
 lived+0.008
 woke+0.008
L+0.008
oc+0.008
Neg logit contributions
 heights-0.009
used-0.008
 hesitation-0.008
 practicing-0.007
 competitors-0.007
Token important
Token ID1593
Feature activation+2.25e-02

Pos logit contributions
grand+0.074
 woke+0.071
tr+0.068
 lived+0.068
L+0.067
Neg logit contributions
 heights-0.087
 hesitation-0.079
 competitors-0.075
 runners-0.072
iously-0.071
Token lesson
Token ID11483
Feature activation+6.00e-03

Pos logit contributions
grand+0.031
 lived+0.030
L+0.027
 woke+0.027
rim+0.027
Neg logit contributions
 heights-0.046
 hesitation-0.042
 practicing-0.041
 competitors-0.041
 attempts-0.040
Token that
Token ID326
Feature activation+2.19e-02

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.010
L+0.010
tr+0.009
Neg logit contributions
 heights-0.011
used-0.011
 hesitation-0.010
 competitors-0.009
 practicing-0.009
Token day
Token ID1110
Feature activation+1.48e-02

Pos logit contributions
grand+0.033
L+0.028
tr+0.028
ork+0.028
oc+0.027
Neg logit contributions
 heights-0.047
 hesitation-0.043
 practicing-0.043
used-0.042
 competitors-0.039
Token,
Token ID11
Feature activation-1.33e-05

Pos logit contributions
grand+0.019
 lived+0.019
 woke+0.017
L+0.016
oc+0.016
Neg logit contributions
 heights-0.035
used-0.033
 hesitation-0.033
 competitors-0.031
joy-0.030
Token okay
Token ID8788
Feature activation-1.17e-02

Pos logit contributions
 heights+0.016
used+0.015
 competitors+0.014
 hesitation+0.014
iously+0.014
Neg logit contributions
 lived-0.011
 woke-0.010
 named-0.010
L-0.009
grand-0.009
Token,
Token ID11
Feature activation-4.89e-03

Pos logit contributions
 heights+0.032
used+0.031
 competitors+0.030
 hesitation+0.029
 runners+0.028
Neg logit contributions
 woke-0.010
 lived-0.010
 was-0.009
 named-0.008
L-0.008
Token but
Token ID475
Feature activation+1.23e-02

Pos logit contributions
 heights+0.011
used+0.011
 hesitation+0.011
joy+0.010
 practicing+0.010
Neg logit contributions
grand-0.007
 lived-0.006
L-0.006
 woke-0.006
tr-0.006
Token he
Token ID339
Feature activation+9.95e-04

Pos logit contributions
grand+0.022
 lived+0.020
 woke+0.019
L+0.019
oc+0.018
Neg logit contributions
 heights-0.024
used-0.023
 hesitation-0.023
 practicing-0.020
joy-0.020
Token learned
Token ID4499
Feature activation+5.30e-04

Pos logit contributions
grand+0.001
L+0.001
 lived+0.001
rost+0.001
tr+0.001
Neg logit contributions
 heights-0.002
used-0.002
 hesitation-0.002
 practicing-0.002
 competitors-0.002
Token an
Token ID281
Feature activation+4.45e-02

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Token important
Token ID1593
Feature activation+2.12e-02

Pos logit contributions
grand+0.064
 lived+0.059
 woke+0.058
L+0.057
tr+0.055
Neg logit contributions
 heights-0.099
 hesitation-0.093
used-0.089
 competitors-0.086
iously-0.085
Token lesson
Token ID11483
Feature activation+5.36e-03

Pos logit contributions
grand+0.026
 lived+0.023
 woke+0.022
L+0.021
|+0.019
Neg logit contributions
 heights-0.051
 hesitation-0.048
used-0.047
 practicing-0.045
 competitors-0.045
Token.
Token ID13
Feature activation+8.41e-03

Pos logit contributions
grand+0.009
 lived+0.008
 woke+0.008
L+0.008
tr+0.008
Neg logit contributions
 heights-0.011
used-0.010
 hesitation-0.010
 competitors-0.009
 practicing-0.009
Token He
Token ID679
Feature activation+5.45e-03

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.009
L+0.009
oc+0.009
Neg logit contributions
 heights-0.021
used-0.020
 hesitation-0.020
 competitors-0.018
joy-0.018
Token needed
Token ID2622
Feature activation+1.64e-02

Pos logit contributions
grand+0.007
 lived+0.005
L+0.005
 woke+0.005
tr+0.005
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 practicing-0.012
 competitors-0.012
Token saw
Token ID2497
Feature activation+6.46e-03

Pos logit contributions
grand+0.025
 lived+0.021
oc+0.020
tr+0.020
ork+0.019
Neg logit contributions
 heights-0.037
used-0.037
 hesitation-0.036
 practicing-0.034
 competitors-0.032
Token her
Token ID607
Feature activation+1.15e-02

Pos logit contributions
grand+0.012
 lived+0.011
oc+0.011
tr+0.011
 woke+0.010
Neg logit contributions
 heights-0.012
 hesitation-0.011
used-0.011
 practicing-0.010
joy-0.010
Token and
Token ID290
Feature activation+2.06e-02

Pos logit contributions
grand+0.018
 lived+0.017
 woke+0.016
L+0.016
oc+0.015
Neg logit contributions
 heights-0.025
used-0.023
 hesitation-0.023
 competitors-0.021
joy-0.021
Token gave
Token ID2921
Feature activation+1.75e-02

Pos logit contributions
grand+0.028
 lived+0.026
 woke+0.024
L+0.023
oc+0.023
Neg logit contributions
 heights-0.048
used-0.046
 hesitation-0.046
 competitors-0.042
joy-0.041
Token her
Token ID607
Feature activation+5.85e-03

Pos logit contributions
grand+0.039
oc+0.036
 lived+0.033
tr+0.033
man+0.032
Neg logit contributions
 heights-0.027
 hesitation-0.026
used-0.025
 breaths-0.023
 practicing-0.022
Token an
Token ID281
Feature activation+4.44e-02

Pos logit contributions
grand+0.009
 lived+0.009
 woke+0.008
L+0.008
oc+0.008
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 competitors-0.011
joy-0.011
Token orange
Token ID10912
Feature activation+3.86e-03

Pos logit contributions
grand+0.095
oc+0.091
 lived+0.090
|+0.086
 woke+0.086
Neg logit contributions
 hesitation-0.066
 heights-0.066
used-0.058
 muscles-0.058
 breaths-0.055
Token umbrella
Token ID25510
Feature activation+1.98e-02

Pos logit contributions
grand+0.007
 lived+0.006
oc+0.006
L+0.005
|+0.005
Neg logit contributions
 heights-0.008
 hesitation-0.007
used-0.007
 muscles-0.007
 breaths-0.006
Token to
Token ID284
Feature activation+1.39e-02

Pos logit contributions
grand+0.038
 lived+0.036
oc+0.034
 woke+0.033
rost+0.033
Neg logit contributions
 heights-0.032
used-0.031
 hesitation-0.030
iously-0.027
 practicing-0.027
Token keep
Token ID1394
Feature activation+1.30e-02

Pos logit contributions
grand+0.026
oc+0.025
man+0.023
tr+0.021
ork+0.021
Neg logit contributions
 heights-0.023
 hesitation-0.023
used-0.020
 practicing-0.019
 muscles-0.018
Token her
Token ID607
Feature activation+6.57e-03

Pos logit contributions
grand+0.023
 lived+0.021
 woke+0.020
oc+0.020
L+0.020
Neg logit contributions
 heights-0.026
used-0.024
 hesitation-0.024
 practicing-0.022
joy-0.021
Token Henry
Token ID8616
Feature activation+2.18e-03

Pos logit contributions
 heights+0.013
used+0.012
 hesitation+0.012
 competitors+0.012
 Majesty+0.011
Neg logit contributions
 lived-0.007
 woke-0.007
 was-0.006
grand-0.006
L-0.006
Token's
Token ID338
Feature activation+9.05e-03

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.001
oc+0.001
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.006
joy-0.005
Token mom
Token ID1995
Feature activation+1.85e-02

Pos logit contributions
 lived+0.012
grand+0.012
L+0.012
 woke+0.011
tr+0.011
Neg logit contributions
used-0.020
 heights-0.020
 hesitation-0.019
iously-0.018
acies-0.017
Token gave
Token ID2921
Feature activation+6.89e-03

Pos logit contributions
grand+0.024
 lived+0.021
oc+0.019
L+0.019
 woke+0.019
Neg logit contributions
 heights-0.043
used-0.041
 hesitation-0.041
 practicing-0.039
 competitors-0.038
Token him
Token ID683
Feature activation-3.72e-03

Pos logit contributions
 lived+0.006
 woke+0.006
grand+0.006
 was+0.005
L+0.005
Neg logit contributions
 heights-0.019
used-0.018
 hesitation-0.018
 competitors-0.017
 attempts-0.017
Token an
Token ID281
Feature activation+4.44e-02

Pos logit contributions
 heights+0.009
used+0.008
 hesitation+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.005
tr-0.004
Token cookies
Token ID14746
Feature activation+8.26e-04

Pos logit contributions
grand+0.077
 lived+0.071
L+0.070
 woke+0.069
oc+0.068
Neg logit contributions
 heights-0.086
 hesitation-0.083
used-0.082
 muscles-0.074
 competitors-0.073
Token as
Token ID355
Feature activation+2.16e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
 practicing-0.001
 competitors-0.001
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.002
 was+0.002
Neg logit contributions
used-0.006
 heights-0.006
 hesitation-0.005
 competitors-0.005
joy-0.005
Token snack
Token ID26906
Feature activation-3.34e-03

Pos logit contributions
grand+0.017
 lived+0.016
 woke+0.015
L+0.014
oc+0.014
Neg logit contributions
 heights-0.026
 hesitation-0.025
used-0.025
 competitors-0.023
joy-0.023
Token.
Token ID13
Feature activation-2.06e-03

Pos logit contributions
 heights+0.007
 hesitation+0.006
used+0.006
 competitors+0.006
 practicing+0.006
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.005
oc-0.004
Token each
Token ID1123
Feature activation+1.74e-03

Pos logit contributions
 lived+0.005
grand+0.005
 woke+0.005
L+0.004
 was+0.004
Neg logit contributions
 heights-0.007
 hesitation-0.007
used-0.007
 competitors-0.006
 practicing-0.006
Token other
Token ID584
Feature activation-4.21e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
tr+0.003
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.003
 practicing-0.003
Token,
Token ID11
Feature activation+2.88e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
joy+0.009
 attempts+0.009
Neg logit contributions
 lived-0.005
 was-0.004
 woke-0.004
grand-0.004
man-0.004
Token Jake
Token ID14757
Feature activation+1.31e-02

Pos logit contributions
 lived+0.005
grand+0.005
 woke+0.005
L+0.005
 was+0.004
Neg logit contributions
 heights-0.006
used-0.005
 hesitation-0.005
 competitors-0.005
 practicing-0.005
Token smiled
Token ID13541
Feature activation-1.15e-03

Pos logit contributions
grand+0.015
 lived+0.015
 woke+0.014
L+0.012
oc+0.012
Neg logit contributions
 heights-0.034
used-0.032
 hesitation-0.032
 competitors-0.029
joy-0.029
Token an
Token ID281
Feature activation+4.41e-02

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 competitors+0.003
 practicing+0.003
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
L-0.001
tr-0.001
Token enthusiastic
Token ID21848
Feature activation+7.33e-03

Pos logit contributions
L+0.078
 lived+0.078
grand+0.078
tr+0.077
ork+0.075
Neg logit contributions
 hesitation-0.079
 heights-0.079
used-0.074
iously-0.069
 competitors-0.067
Token smile
Token ID8212
Feature activation-4.34e-03

Pos logit contributions
grand+0.012
 lived+0.011
L+0.010
 woke+0.010
oc+0.010
Neg logit contributions
 heights-0.016
 hesitation-0.015
used-0.015
joy-0.014
 competitors-0.013
Token.
Token ID13
Feature activation+2.85e-03

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.009
joy+0.008
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.006
L-0.005
oc-0.005
Token Clara
Token ID27443
Feature activation+9.48e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
 was+0.002
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.007
 competitors-0.007
 practicing-0.007
Token was
Token ID373
Feature activation-3.88e-03

Pos logit contributions
grand+0.011
 lived+0.009
 woke+0.009
L+0.009
oc+0.008
Neg logit contributions
 heights-0.024
used-0.023
 hesitation-0.023
 competitors-0.021
joy-0.021
TokenNext
Token ID10019
Feature activation-3.17e-03

Pos logit contributions
used+0.006
 heights+0.005
ths+0.005
 competitors+0.005
 hesitation+0.005
Neg logit contributions
grand-0.006
tr-0.005
 lived-0.005
 woke-0.005
oc-0.005
Token,
Token ID11
Feature activation+2.12e-03

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
 Majesty+0.006
Neg logit contributions
 lived-0.005
grand-0.004
 woke-0.004
L-0.004
 was-0.004
Token Daddy
Token ID26926
Feature activation+1.11e-02

Pos logit contributions
 lived+0.003
grand+0.003
 woke+0.003
 was+0.003
L+0.003
Neg logit contributions
 heights-0.005
 hesitation-0.004
used-0.004
 competitors-0.004
joy-0.004
Token taught
Token ID7817
Feature activation+5.38e-03

Pos logit contributions
grand+0.012
 lived+0.011
 woke+0.011
L+0.009
oc+0.009
Neg logit contributions
 heights-0.030
used-0.029
 hesitation-0.028
joy-0.026
 competitors-0.026
Token Jane
Token ID12091
Feature activation+3.97e-03

Pos logit contributions
grand+0.008
 lived+0.008
 woke+0.007
L+0.007
 was+0.006
Neg logit contributions
 heights-0.012
used-0.011
 hesitation-0.011
 competitors-0.010
 practicing-0.010
Token an
Token ID281
Feature activation+4.40e-02

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
 was+0.005
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.008
joy-0.007
 competitors-0.007
Token action
Token ID2223
Feature activation+1.10e-03

Pos logit contributions
grand+0.061
 lived+0.056
 woke+0.053
L+0.052
oc+0.049
Neg logit contributions
 heights-0.103
used-0.098
 hesitation-0.097
 competitors-0.090
 practicing-0.088
Token that
Token ID326
Feature activation+2.57e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
 was+0.001
tr+0.001
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.002
joy-0.002
 competitors-0.002
Token she
Token ID673
Feature activation+5.16e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
tr+0.003
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
joy-0.005
 competitors-0.005
Token had
Token ID550
Feature activation+2.33e-03

Pos logit contributions
grand+0.006
 lived+0.005
L+0.005
 woke+0.005
oc+0.005
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 competitors-0.012
 practicing-0.011
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
 was+0.002
Neg logit contributions
 heights-0.006
used-0.005
 hesitation-0.005
 competitors-0.005
joy-0.005
Token beautiful
Token ID4950
Feature activation+2.12e-03

Pos logit contributions
grand+0.001
oc+0.001
L+0.001
 lived+0.001
 woke+0.001
Neg logit contributions
 heights-0.001
 hesitation-0.001
used-0.001
joy-0.001
 practicing-0.001
Token and
Token ID290
Feature activation+1.90e-03

Pos logit contributions
 lived+0.001
 was+0.001
 woke+0.001
grand+0.001
L+0.001
Neg logit contributions
 hesitation-0.006
used-0.006
 heights-0.006
 competitors-0.006
 attempts-0.006
Token it
Token ID340
Feature activation+1.02e-02

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 competitors-0.004
joy-0.004
Token gl
Token ID1278
Feature activation+2.02e-02

Pos logit contributions
grand+0.013
oc+0.010
L+0.010
tr+0.009
rost+0.009
Neg logit contributions
 heights-0.025
used-0.024
 hesitation-0.023
 practicing-0.022
 competitors-0.021
Tokenowed
Token ID6972
Feature activation+1.04e-02

Pos logit contributions
grand+0.043
 lived+0.038
 woke+0.038
L+0.036
oc+0.036
Neg logit contributions
used-0.033
 heights-0.031
iously-0.029
 hesitation-0.028
 competitors-0.027
Token an
Token ID281
Feature activation+4.40e-02

Pos logit contributions
grand+0.016
 lived+0.015
 woke+0.014
L+0.014
oc+0.013
Neg logit contributions
 heights-0.022
used-0.021
 hesitation-0.021
 competitors-0.019
joy-0.019
Token even
Token ID772
Feature activation+1.31e-02

Pos logit contributions
grand+0.074
L+0.074
ork+0.072
tr+0.072
 lived+0.071
Neg logit contributions
 heights-0.085
 hesitation-0.081
used-0.079
iously-0.075
 competitors-0.070
Token brighter
Token ID26083
Feature activation+6.62e-03

Pos logit contributions
grand+0.027
 lived+0.025
 woke+0.024
L+0.023
oc+0.023
Neg logit contributions
 heights-0.022
used-0.020
 hesitation-0.020
 competitors-0.017
 practicing-0.017
Token purple
Token ID14032
Feature activation+2.25e-03

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.017
used-0.016
 hesitation-0.016
 competitors-0.015
joy-0.015
Token than
Token ID621
Feature activation+7.28e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
 competitors-0.004
joy-0.004
Token before
Token ID878
Feature activation+1.45e-03

Pos logit contributions
grand+0.011
 lived+0.011
 woke+0.011
L+0.010
tr+0.009
Neg logit contributions
 heights-0.015
used-0.015
 hesitation-0.014
 competitors-0.013
joy-0.013
Token saw
Token ID2497
Feature activation+6.62e-03

Pos logit contributions
grand+0.016
 lived+0.013
oc+0.012
L+0.012
 woke+0.011
Neg logit contributions
 heights-0.025
used-0.024
 hesitation-0.023
 practicing-0.021
joy-0.021
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.009
L+0.009
oc+0.009
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 competitors-0.012
joy-0.012
Token little
Token ID1310
Feature activation-5.72e-03

Pos logit contributions
grand+0.022
 lived+0.021
 woke+0.020
L+0.019
oc+0.018
Neg logit contributions
 heights-0.032
used-0.031
 hesitation-0.030
 competitors-0.028
joy-0.028
Token bird
Token ID6512
Feature activation+2.59e-03

Pos logit contributions
used+0.013
 heights+0.013
 hesitation+0.013
 competitors+0.012
iously+0.012
Neg logit contributions
 woke-0.007
oc-0.006
grand-0.006
L-0.006
 lived-0.006
Token singing
Token ID13777
Feature activation+5.12e-03

Pos logit contributions
grand+0.005
 lived+0.004
L+0.004
 woke+0.004
tr+0.004
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
 practicing-0.004
joy-0.004
Token an
Token ID281
Feature activation+4.39e-02

Pos logit contributions
grand+0.010
 lived+0.008
oc+0.008
L+0.008
 woke+0.007
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
joy-0.009
 practicing-0.008
Token ancient
Token ID6156
Feature activation+1.77e-02

Pos logit contributions
grand+0.078
L+0.076
 lived+0.076
ork+0.074
 woke+0.074
Neg logit contributions
 heights-0.082
 hesitation-0.078
used-0.076
iously-0.070
 practicing-0.066
Token song
Token ID3496
Feature activation+9.58e-03

Pos logit contributions
grand+0.027
 lived+0.025
 woke+0.024
L+0.023
oc+0.022
Neg logit contributions
 heights-0.039
used-0.037
 hesitation-0.036
 competitors-0.033
joy-0.033
Token.
Token ID13
Feature activation-2.45e-03

Pos logit contributions
grand+0.013
 lived+0.012
 woke+0.012
L+0.011
tr+0.011
Neg logit contributions
 heights-0.022
used-0.021
 hesitation-0.021
 competitors-0.019
joy-0.019
Token The
Token ID383
Feature activation-3.69e-03

Pos logit contributions
 heights+0.006
used+0.005
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
L-0.003
oc-0.003
Token ant
Token ID1885
Feature activation+1.58e-04

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
joy+0.006
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Token mum
Token ID25682
Feature activation-5.01e-03

Pos logit contributions
 heights+0.003
used+0.003
ner+0.003
 hesitation+0.003
 competitors+0.003
Neg logit contributions
 lived-0.003
 woke-0.003
 was-0.003
grand-0.003
 ash-0.003
Token told
Token ID1297
Feature activation+1.18e-03

Pos logit contributions
 heights+0.011
used+0.011
 hesitation+0.011
 competitors+0.010
 practicing+0.010
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.006
Token her
Token ID607
Feature activation+9.15e-03

Pos logit contributions
grand+0.002
oc+0.002
 lived+0.002
tr+0.002
L+0.002
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
 practicing-0.002
 competitors-0.002
Token it
Token ID340
Feature activation+6.98e-04

Pos logit contributions
grand+0.013
 lived+0.013
 woke+0.012
 was+0.011
ork+0.010
Neg logit contributions
 heights-0.022
used-0.021
 hesitation-0.020
 competitors-0.019
joy-0.018
Token was
Token ID373
Feature activation+6.87e-03

Pos logit contributions
grand+0.001
oc+0.000
L+0.000
rost+0.000
 lived+0.000
Neg logit contributions
 heights-0.002
used-0.002
 hesitation-0.002
 practicing-0.002
 competitors-0.002
Token an
Token ID281
Feature activation+4.39e-02

Pos logit contributions
grand+0.011
 lived+0.010
oc+0.010
 woke+0.009
L+0.009
Neg logit contributions
 heights-0.015
 hesitation-0.014
used-0.014
 practicing-0.013
joy-0.012
Token earthquake
Token ID16295
Feature activation+3.13e-03

Pos logit contributions
L+0.073
grand+0.073
 lived+0.070
 woke+0.067
oc+0.066
Neg logit contributions
 heights-0.089
 hesitation-0.081
used-0.080
iously-0.075
 muscles-0.071
Token.
Token ID13
Feature activation-1.09e-03

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.003
tr+0.003
L+0.003
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.008
joy-0.007
 competitors-0.007
Token
Token ID220
Feature activation-1.51e-02

Pos logit contributions
 heights+0.003
 hesitation+0.003
used+0.003
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token
Token ID198
Feature activation+3.68e-03

Pos logit contributions
 heights+0.036
used+0.035
 hesitation+0.034
 competitors+0.032
joy+0.031
Neg logit contributions
grand-0.019
 lived-0.017
 woke-0.016
L-0.015
oc-0.015
Token
Token ID198
Feature activation-7.34e-03

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.004
tr+0.004
oc+0.004
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.008
 competitors-0.007
 practicing-0.007
Token back
Token ID736
Feature activation+7.15e-03

Pos logit contributions
 heights+0.051
 hesitation+0.048
used+0.046
 competitors+0.043
 attempts+0.042
Neg logit contributions
 lived-0.030
 woke-0.030
grand-0.030
 was-0.028
L-0.027
Token and
Token ID290
Feature activation+1.91e-03

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.009
L+0.009
oc+0.009
Neg logit contributions
 heights-0.016
used-0.015
 hesitation-0.015
 competitors-0.014
joy-0.013
Token gave
Token ID2921
Feature activation+1.44e-02

Pos logit contributions
grand+0.003
 lived+0.002
L+0.002
tr+0.002
 woke+0.002
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 practicing-0.004
joy-0.004
Token the
Token ID262
Feature activation+2.32e-03

Pos logit contributions
grand+0.024
 lived+0.022
 woke+0.021
L+0.021
tr+0.020
Neg logit contributions
 heights-0.029
used-0.028
 hesitation-0.027
 competitors-0.025
joy-0.025
Token bird
Token ID6512
Feature activation-1.51e-02

Pos logit contributions
grand+0.003
|+0.003
rost+0.002
rim+0.002
 woke+0.002
Neg logit contributions
 heights-0.005
 hesitation-0.005
 competitors-0.005
used-0.005
joy-0.005
Token an
Token ID281
Feature activation+4.38e-02

Pos logit contributions
 heights+0.030
used+0.028
 hesitation+0.028
 competitors+0.025
joy+0.025
Neg logit contributions
grand-0.025
 lived-0.023
 woke-0.022
L-0.022
oc-0.021
Token open
Token ID1280
Feature activation-1.40e-03

Pos logit contributions
|+0.072
L+0.071
grand+0.068
 woke+0.065
 lived+0.065
Neg logit contributions
 hesitation-0.084
 heights-0.080
used-0.079
iously-0.075
 luck-0.073
Token box
Token ID3091
Feature activation+5.52e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
joy+0.003
 competitors+0.003
Neg logit contributions
grand-0.002
 lived-0.002
L-0.002
 woke-0.002
rost-0.002
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
grand+0.008
 lived+0.008
 woke+0.007
L+0.007
oc+0.007
Neg logit contributions
 heights-0.012
used-0.011
 hesitation-0.011
 competitors-0.010
joy-0.010
Token Inside
Token ID14384
Feature activation-1.15e-02

Pos logit contributions
grand+0.010
 lived+0.009
 woke+0.008
L+0.008
oc+0.008
Neg logit contributions
 heights-0.015
used-0.015
 hesitation-0.014
 competitors-0.013
joy-0.013
Token the
Token ID262
Feature activation+1.24e-03

Pos logit contributions
 heights+0.023
used+0.022
 hesitation+0.021
joy+0.019
 competitors+0.019
Neg logit contributions
grand-0.020
 lived-0.018
L-0.017
 woke-0.017
tr-0.017
Token
Token ID198
Feature activation-2.76e-03

Pos logit contributions
grand+0.009
 lived+0.008
 woke+0.008
tr+0.008
oc+0.007
Neg logit contributions
 heights-0.015
used-0.015
 hesitation-0.014
 practicing-0.013
 competitors-0.013
TokenThe
Token ID464
Feature activation-7.63e-03

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
tr-0.003
oc-0.003
Token bunny
Token ID44915
Feature activation-4.88e-03

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.015
iously+0.014
joy+0.014
Neg logit contributions
 lived-0.011
L-0.011
grand-0.011
 woke-0.011
oc-0.010
Token taught
Token ID7817
Feature activation-3.48e-03

Pos logit contributions
 heights+0.014
used+0.013
 hesitation+0.013
 competitors+0.012
joy+0.012
Neg logit contributions
grand-0.005
 lived-0.004
 woke-0.004
L-0.004
oc-0.004
Token us
Token ID514
Feature activation+8.78e-03

Pos logit contributions
used+0.008
 heights+0.008
 hesitation+0.008
joy+0.007
 competitors+0.007
Neg logit contributions
grand-0.004
 lived-0.004
L-0.004
 woke-0.004
 was-0.003
Token an
Token ID281
Feature activation+4.38e-02

Pos logit contributions
 lived+0.014
grand+0.013
 woke+0.012
 was+0.012
L+0.012
Neg logit contributions
 heights-0.018
used-0.018
 hesitation-0.017
joy-0.016
 competitors-0.016
Token important
Token ID1593
Feature activation+1.92e-02

Pos logit contributions
grand+0.057
 lived+0.051
 woke+0.050
L+0.050
tr+0.046
Neg logit contributions
 heights-0.105
 hesitation-0.100
used-0.099
 competitors-0.093
iously-0.091
Token lesson
Token ID11483
Feature activation+2.05e-03

Pos logit contributions
grand+0.021
 lived+0.018
 woke+0.017
L+0.016
oc+0.015
Neg logit contributions
 heights-0.051
 hesitation-0.048
used-0.047
 competitors-0.045
joy-0.045
Token that
Token ID326
Feature activation+1.70e-02

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 competitors-0.004
joy-0.004
Token day
Token ID1110
Feature activation+1.18e-02

Pos logit contributions
grand+0.021
 lived+0.018
L+0.017
 woke+0.017
tr+0.016
Neg logit contributions
 heights-0.042
used-0.040
 hesitation-0.040
 practicing-0.037
 competitors-0.037
Token
Token ID6184
Feature activation-8.52e-03

Pos logit contributions
 lived+0.017
grand+0.016
 woke+0.015
 was+0.014
L+0.014
Neg logit contributions
 heights-0.027
 hesitation-0.025
used-0.025
 competitors-0.024
 Majesty-0.023
Token she
Token ID673
Feature activation+4.49e-03

Pos logit contributions
grand+0.020
 lived+0.018
oc+0.017
 woke+0.017
tr+0.017
Neg logit contributions
 heights-0.018
used-0.017
 hesitation-0.017
joy-0.015
 practicing-0.015
Token saw
Token ID2497
Feature activation+5.05e-03

Pos logit contributions
grand+0.008
oc+0.006
tr+0.006
rost+0.006
 lived+0.006
Neg logit contributions
 heights-0.008
used-0.008
 practicing-0.008
 hesitation-0.008
joy-0.008
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
grand+0.009
oc+0.008
 lived+0.008
 woke+0.008
tr+0.007
Neg logit contributions
 heights-0.010
 hesitation-0.009
used-0.009
 practicing-0.008
joy-0.008
Token beautiful
Token ID4950
Feature activation+7.27e-03

Pos logit contributions
grand+0.022
 lived+0.019
|+0.017
 woke+0.017
oc+0.017
Neg logit contributions
 heights-0.024
 hesitation-0.023
used-0.022
joy-0.022
 practicing-0.021
Token f
Token ID277
Feature activation+6.72e-03

Pos logit contributions
grand+0.015
 lived+0.013
rost+0.012
|+0.012
oc+0.012
Neg logit contributions
 heights-0.012
used-0.011
 hesitation-0.011
joy-0.011
 practicing-0.011
Tokenauc
Token ID14272
Feature activation+4.37e-02

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.005
oc+0.005
L+0.005
Neg logit contributions
used-0.018
 heights-0.018
 hesitation-0.017
ipping-0.016
 practicing-0.016
Tokenet
Token ID316
Feature activation+2.27e-03

Pos logit contributions
grand+0.097
 lived+0.089
ork+0.087
L+0.085
jar+0.084
Neg logit contributions
 heights-0.054
 breaths-0.053
iously-0.043
used-0.043
 hesitation-0.043
Token.
Token ID13
Feature activation-1.73e-03

Pos logit contributions
grand+0.004
 lived+0.004
L+0.003
 woke+0.003
oc+0.003
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 practicing-0.004
 competitors-0.004
Token
Token ID220
Feature activation-6.64e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
joy+0.003
 practicing+0.002
Neg logit contributions
grand-0.003
oc-0.003
 lived-0.003
 woke-0.003
rost-0.003
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 heights+0.012
used+0.012
 hesitation+0.011
joy+0.011
 practicing+0.011
Neg logit contributions
grand-0.011
 woke-0.010
 lived-0.010
oc-0.009
tr-0.009
Token
Token ID198
Feature activation-1.03e-03

Pos logit contributions
grand+0.018
 lived+0.016
tr+0.016
oc+0.015
 woke+0.015
Neg logit contributions
 heights-0.018
used-0.018
 hesitation-0.017
joy-0.017
 practicing-0.016
Token.
Token ID13
Feature activation-7.86e-03

Pos logit contributions
grand+0.016
 lived+0.014
 woke+0.013
L+0.013
tr+0.012
Neg logit contributions
 heights-0.032
used-0.031
 hesitation-0.031
 breaths-0.027
joy-0.027
Token He
Token ID679
Feature activation+2.54e-03

Pos logit contributions
 heights+0.018
used+0.018
 hesitation+0.018
 competitors+0.016
 practicing+0.016
Neg logit contributions
grand-0.011
 lived-0.010
 woke-0.009
L-0.009
oc-0.009
Token was
Token ID373
Feature activation-6.46e-03

Pos logit contributions
grand+0.003
oc+0.003
 lived+0.002
L+0.002
tr+0.002
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
 practicing-0.005
Token wearing
Token ID5762
Feature activation+1.22e-02

Pos logit contributions
 heights+0.014
used+0.013
 hesitation+0.012
 competitors+0.012
iously+0.011
Neg logit contributions
grand-0.010
 lived-0.010
 woke-0.009
L-0.009
 was-0.009
Token his
Token ID465
Feature activation+3.27e-03

Pos logit contributions
grand+0.014
 lived+0.013
 woke+0.012
L+0.012
tr+0.010
Neg logit contributions
 heights-0.031
used-0.031
 hesitation-0.029
 competitors-0.027
iously-0.027
Token brand
Token ID4508
Feature activation+4.37e-02

Pos logit contributions
grand+0.006
 lived+0.005
 woke+0.005
L+0.005
oc+0.005
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
joy-0.005
Token new
Token ID649
Feature activation+1.46e-02

Pos logit contributions
grand+0.095
 lived+0.091
oc+0.088
 woke+0.088
|+0.086
Neg logit contributions
 heights-0.062
 hesitation-0.050
 practicing-0.047
used-0.046
joy-0.046
Token icy
Token ID30284
Feature activation+2.44e-02

Pos logit contributions
grand+0.024
 lived+0.022
 woke+0.021
oc+0.021
L+0.020
Neg logit contributions
 heights-0.030
 hesitation-0.028
used-0.028
joy-0.026
 competitors-0.026
Token blue
Token ID4171
Feature activation+6.22e-03

Pos logit contributions
grand+0.047
 lived+0.043
oc+0.042
L+0.040
 woke+0.038
Neg logit contributions
 heights-0.043
 hesitation-0.039
used-0.038
joy-0.037
 muscles-0.037
Token jacket
Token ID15224
Feature activation+2.49e-02

Pos logit contributions
grand+0.010
 lived+0.009
oc+0.008
 woke+0.008
L+0.008
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
joy-0.011
 competitors-0.011
Token as
Token ID355
Feature activation+6.69e-03

Pos logit contributions
grand+0.035
 lived+0.033
 woke+0.031
L+0.029
tr+0.028
Neg logit contributions
 heights-0.058
used-0.055
 hesitation-0.054
 competitors-0.050
joy-0.049
Token's
Token ID338
Feature activation+6.12e-03

Pos logit contributions
grand+0.022
 lived+0.018
L+0.017
 woke+0.016
tr+0.016
Neg logit contributions
 heights-0.038
used-0.037
 hesitation-0.037
 practicing-0.034
 competitors-0.034
Token mom
Token ID1995
Feature activation+1.75e-02

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.010
L+0.009
oc+0.009
Neg logit contributions
 heights-0.012
 hesitation-0.011
used-0.011
joy-0.010
 competitors-0.010
Token gave
Token ID2921
Feature activation+1.41e-02

Pos logit contributions
grand+0.025
 lived+0.020
 woke+0.019
tr+0.019
L+0.019
Neg logit contributions
used-0.039
 heights-0.039
 hesitation-0.038
 practicing-0.036
 competitors-0.035
Token them
Token ID606
Feature activation-9.76e-03

Pos logit contributions
grand+0.028
 lived+0.025
tr+0.024
oc+0.024
 woke+0.024
Neg logit contributions
 heights-0.026
 hesitation-0.024
used-0.024
joy-0.022
 practicing-0.021
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
 heights+0.019
used+0.018
 hesitation+0.018
 competitors+0.016
joy+0.016
Neg logit contributions
grand-0.017
 lived-0.016
 woke-0.015
L-0.015
oc-0.014
Token brand
Token ID4508
Feature activation+4.37e-02

Pos logit contributions
grand+0.022
 lived+0.020
oc+0.018
 woke+0.017
L+0.017
Neg logit contributions
 hesitation-0.024
 heights-0.023
used-0.022
joy-0.022
 practicing-0.021
Token new
Token ID649
Feature activation+1.84e-02

Pos logit contributions
grand+0.097
 lived+0.090
 woke+0.088
|+0.083
tr+0.083
Neg logit contributions
 heights-0.064
 hesitation-0.054
used-0.053
joy-0.051
 practicing-0.049
Token basketball
Token ID9669
Feature activation-4.16e-03

Pos logit contributions
grand+0.030
 lived+0.028
 woke+0.027
L+0.026
oc+0.025
Neg logit contributions
 heights-0.039
used-0.037
 hesitation-0.036
 competitors-0.033
joy-0.032
Token and
Token ID290
Feature activation+8.23e-03

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.005
L-0.005
oc-0.005
Token they
Token ID484
Feature activation-2.60e-03

Pos logit contributions
grand+0.014
 lived+0.013
 woke+0.013
L+0.012
oc+0.012
Neg logit contributions
 heights-0.016
used-0.015
 hesitation-0.015
 practicing-0.014
 competitors-0.014
Token played
Token ID2826
Feature activation+1.10e-02

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 practicing+0.004
 competitors+0.004
Neg logit contributions
grand-0.004
 lived-0.004
L-0.004
oc-0.004
 woke-0.004
Token could
Token ID714
Feature activation-5.78e-03

Pos logit contributions
 heights+0.006
used+0.005
 hesitation+0.005
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.003
 lived-0.002
L-0.002
 woke-0.002
oc-0.002
Token even
Token ID772
Feature activation-6.58e-03

Pos logit contributions
 heights+0.013
used+0.013
 hesitation+0.013
joy+0.012
 competitors+0.012
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.006
tr-0.006
Token turn
Token ID1210
Feature activation-4.25e-03

Pos logit contributions
 heights+0.015
used+0.015
 hesitation+0.014
joy+0.014
 competitors+0.013
Neg logit contributions
grand-0.009
 lived-0.008
 woke-0.008
L-0.007
tr-0.007
Token flip
Token ID14283
Feature activation-2.22e-02

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.009
 competitors+0.008
iously+0.008
Neg logit contributions
grand-0.006
 woke-0.006
 lived-0.006
L-0.005
 was-0.005
Token-
Token ID12
Feature activation-6.12e-03

Pos logit contributions
 heights+0.048
used+0.047
 hesitation+0.047
 practicing+0.042
joy+0.042
Neg logit contributions
grand-0.031
 lived-0.031
 woke-0.029
L-0.029
tr-0.027
Tokenfl
Token ID2704
Feature activation-7.65e-02

Pos logit contributions
 hesitation+0.013
 heights+0.013
used+0.012
 attempts+0.011
joy+0.011
Neg logit contributions
grand-0.010
L-0.009
 woke-0.009
 lived-0.009
oc-0.008
Tokenops
Token ID2840
Feature activation-1.73e-02

Pos logit contributions
 hesitation+0.160
 heights+0.144
joy+0.142
used+0.141
 attempts+0.132
Neg logit contributions
tr-0.138
 lived-0.122
grand-0.120
room-0.118
man-0.115
Token!
Token ID0
Feature activation-2.16e-04

Pos logit contributions
 heights+0.046
 hesitation+0.044
used+0.044
joy+0.043
 attempts+0.041
Neg logit contributions
grand-0.016
 woke-0.015
 lived-0.015
tr-0.015
L-0.014
Token Everyone
Token ID11075
Feature activation-2.23e-03

Pos logit contributions
 heights+0.001
used+0.000
 hesitation+0.000
 competitors+0.000
joy+0.000
Neg logit contributions
grand-0.000
 lived-0.000
 woke-0.000
L-0.000
oc-0.000
Token was
Token ID373
Feature activation+1.15e-03

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token very
Token ID845
Feature activation+2.08e-03

Pos logit contributions
 lived+0.002
grand+0.001
 was+0.001
 woke+0.001
L+0.001
Neg logit contributions
 heights-0.003
used-0.002
 hesitation-0.002
iously-0.002
 breaths-0.002
Token had
Token ID550
Feature activation+2.50e-03

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.017
used-0.016
 hesitation-0.016
 competitors-0.015
joy-0.015
Token been
Token ID587
Feature activation-9.80e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.005
 competitors-0.005
joy-0.005
Token decorated
Token ID24789
Feature activation+5.95e-04

Pos logit contributions
 heights+0.021
used+0.020
 hesitation+0.019
 competitors+0.018
joy+0.017
Neg logit contributions
grand-0.015
 lived-0.014
 woke-0.014
L-0.013
tr-0.012
Token with
Token ID351
Feature activation-5.15e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Token snow
Token ID6729
Feature activation-2.65e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
 competitors+0.009
joy+0.009
Neg logit contributions
grand-0.009
 lived-0.008
 woke-0.008
L-0.007
oc-0.007
Tokenfl
Token ID2704
Feature activation-7.04e-02

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
joy+0.006
 competitors+0.006
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
tr-0.003
man-0.003
Tokenakes
Token ID1124
Feature activation-1.29e-02

Pos logit contributions
 hesitation+0.140
 heights+0.137
used+0.127
joy+0.118
 attempts+0.116
Neg logit contributions
grand-0.122
tr-0.118
 lived-0.117
 woke-0.107
man-0.102
Token.
Token ID13
Feature activation-1.64e-04

Pos logit contributions
 heights+0.033
used+0.032
 hesitation+0.031
 competitors+0.030
joy+0.030
Neg logit contributions
grand-0.014
 lived-0.013
 woke-0.013
 was-0.012
oc-0.011
Token Thomas
Token ID5658
Feature activation+8.18e-03

Pos logit contributions
 heights+0.000
 hesitation+0.000
used+0.000
 competitors+0.000
 practicing+0.000
Neg logit contributions
grand-0.000
 lived-0.000
 woke-0.000
L-0.000
oc-0.000
Token couldn
Token ID3521
Feature activation-3.78e-03

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.023
used-0.022
 hesitation-0.021
 competitors-0.020
joy-0.020
Token't
Token ID470
Feature activation+6.54e-03

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
 practicing+0.006
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Token stretch
Token ID7539
Feature activation-7.57e-03

Pos logit contributions
grand+0.010
 lived+0.009
 woke+0.009
L+0.008
oc+0.008
Neg logit contributions
 heights-0.017
used-0.016
 hesitation-0.016
 competitors-0.015
joy-0.015
Token out
Token ID503
Feature activation-4.40e-04

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.016
 competitors+0.015
joy+0.014
Neg logit contributions
grand-0.011
 lived-0.010
 woke-0.010
L-0.010
oc-0.009
Token and
Token ID290
Feature activation-3.22e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.000
oc-0.000
Token enormous
Token ID9812
Feature activation-5.49e-03

Pos logit contributions
 heights+0.007
used+0.007
joy+0.007
 hesitation+0.007
 competitors+0.007
Neg logit contributions
 lived-0.004
 woke-0.004
grand-0.004
 was-0.004
tr-0.003
Token sun
Token ID4252
Feature activation-4.84e-03

Pos logit contributions
used+0.010
 heights+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
 lived-0.010
 woke-0.010
grand-0.009
 was-0.009
 ash-0.008
Tokenfl
Token ID2704
Feature activation-7.03e-02

Pos logit contributions
 heights+0.011
 hesitation+0.010
used+0.010
joy+0.009
 competitors+0.009
Neg logit contributions
 lived-0.007
grand-0.007
 woke-0.007
tr-0.007
man-0.007
Tokenowers
Token ID3618
Feature activation-2.97e-02

Pos logit contributions
 hesitation+0.131
 heights+0.121
used+0.111
joy+0.109
 attempts+0.108
Neg logit contributions
 lived-0.137
tr-0.128
grand-0.123
 woke-0.121
ase-0.116
Token would
Token ID561
Feature activation+1.13e-02

Pos logit contributions
 heights+0.075
 hesitation+0.070
used+0.070
joy+0.069
 competitors+0.067
Neg logit contributions
 lived-0.037
 woke-0.033
grand-0.033
 was-0.030
 belonged-0.028
Token show
Token ID905
Feature activation+2.16e-03

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.013
L+0.013
oc+0.012
Neg logit contributions
 heights-0.026
used-0.025
 hesitation-0.025
 competitors-0.023
joy-0.023
Token their
Token ID511
Feature activation+5.43e-03

Pos logit contributions
grand+0.003
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
joy-0.004
 competitors-0.004
Token bright
Token ID6016
Feature activation+1.43e-02

Pos logit contributions
grand+0.009
 lived+0.009
 woke+0.008
L+0.008
 was+0.007
Neg logit contributions
used-0.011
 heights-0.011
 hesitation-0.010
joy-0.009
 competitors-0.009
Token,
Token ID11
Feature activation-4.32e-03

Pos logit contributions
 heights+0.062
used+0.059
 hesitation+0.057
 competitors+0.055
joy+0.055
Neg logit contributions
 lived-0.025
 woke-0.022
 named-0.021
grand-0.020
 was-0.020
Token roses
Token ID42152
Feature activation-9.37e-03

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Token,
Token ID11
Feature activation-3.90e-03

Pos logit contributions
 heights+0.023
used+0.021
 hesitation+0.021
joy+0.020
 competitors+0.020
Neg logit contributions
 lived-0.012
grand-0.012
 woke-0.011
oc-0.010
 was-0.010
Token and
Token ID290
Feature activation-6.10e-04

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.006
 lived-0.005
L-0.005
 woke-0.005
oc-0.004
Token sun
Token ID4252
Feature activation-3.10e-03

Pos logit contributions
used+0.001
 heights+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
L-0.001
 was-0.001
Tokenfl
Token ID2704
Feature activation-7.02e-02

Pos logit contributions
 heights+0.006
 hesitation+0.006
used+0.006
joy+0.006
 competitors+0.006
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
tr-0.005
man-0.004
Tokenowers
Token ID3618
Feature activation-2.75e-02

Pos logit contributions
 hesitation+0.123
 heights+0.123
used+0.109
joy+0.105
 attempts+0.102
Neg logit contributions
 lived-0.137
grand-0.131
tr-0.131
 woke-0.124
oc-0.117
Token.
Token ID13
Feature activation+1.25e-04

Pos logit contributions
 heights+0.073
used+0.070
 hesitation+0.068
joy+0.067
 competitors+0.066
Neg logit contributions
 lived-0.030
grand-0.028
 woke-0.026
 was-0.024
 belonged-0.021
Token They
Token ID1119
Feature activation+2.29e-03

Pos logit contributions
grand+0.000
 lived+0.000
L+0.000
 woke+0.000
oc+0.000
Neg logit contributions
 heights-0.000
used-0.000
 hesitation-0.000
 competitors-0.000
 practicing-0.000
Token waved
Token ID26834
Feature activation+8.98e-03

Pos logit contributions
grand+0.003
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
joy-0.005
Token to
Token ID284
Feature activation-4.71e-03

Pos logit contributions
 lived+0.012
 woke+0.012
grand+0.011
L+0.011
 was+0.010
Neg logit contributions
 heights-0.021
used-0.020
 hesitation-0.019
 competitors-0.018
 practicing-0.018
Token me
Token ID502
Feature activation-6.74e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.000
L+0.000
oc+0.000
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Tokenadow
Token ID4584
Feature activation+6.21e-04

Pos logit contributions
 heights+0.012
 hesitation+0.011
used+0.011
 competitors+0.010
joy+0.010
Neg logit contributions
grand-0.012
 lived-0.012
 woke-0.011
tr-0.011
oc-0.011
Token full
Token ID1336
Feature activation-2.12e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
 practicing-0.001
Token of
Token ID286
Feature activation-7.55e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token wild
Token ID4295
Feature activation-1.49e-02

Pos logit contributions
used+0.016
 heights+0.015
 hesitation+0.014
iously+0.013
 competitors+0.013
Neg logit contributions
 lived-0.012
grand-0.011
 woke-0.011
 ash-0.010
 was-0.010
Tokenfl
Token ID2704
Feature activation-6.87e-02

Pos logit contributions
used+0.032
 hesitation+0.029
 heights+0.029
yles+0.027
ner+0.027
Neg logit contributions
 lived-0.022
 woke-0.021
 was-0.020
oc-0.019
grand-0.019
Tokenowers
Token ID3618
Feature activation-3.39e-02

Pos logit contributions
 hesitation+0.133
 heights+0.123
joy+0.118
used+0.115
 attempts+0.112
Neg logit contributions
 lived-0.115
tr-0.115
grand-0.111
man-0.107
rim-0.106
Token.
Token ID13
Feature activation-4.41e-04

Pos logit contributions
 heights+0.097
used+0.095
joy+0.091
 Majesty+0.090
 hesitation+0.089
Neg logit contributions
 lived-0.025
grand-0.024
 was-0.022
 woke-0.020
oc-0.018
Token He
Token ID679
Feature activation-1.26e-03

Pos logit contributions
 heights+0.001
 hesitation+0.001
used+0.001
 competitors+0.001
 practicing+0.001
Neg logit contributions
grand-0.001
 lived-0.000
 woke-0.000
L-0.000
oc-0.000
Token spun
Token ID26843
Feature activation-7.26e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 competitors+0.003
joy+0.003
Neg logit contributions
grand-0.002
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token around
Token ID1088
Feature activation+6.87e-03

Pos logit contributions
 heights+0.016
used+0.016
 hesitation+0.015
 competitors+0.014
 practicing+0.014
Neg logit contributions
 lived-0.010
grand-0.010
L-0.010
 woke-0.010
oc-0.009
Token.
Token ID13
Feature activation-6.59e-03

Pos logit contributions
 heights+0.008
used+0.007
 hesitation+0.007
 competitors+0.007
joy+0.007
Neg logit contributions
 lived-0.003
grand-0.003
 woke-0.003
 was-0.002
tr-0.002
Token They
Token ID1119
Feature activation-4.83e-04

Pos logit contributions
 heights+0.016
used+0.015
 hesitation+0.015
 competitors+0.014
 practicing+0.014
Neg logit contributions
grand-0.008
 lived-0.008
 woke-0.007
L-0.007
oc-0.007
Token were
Token ID547
Feature activation-1.28e-02

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
joy+0.001
 competitors+0.001
Neg logit contributions
 lived-0.000
grand-0.000
 woke-0.000
L-0.000
 was-0.000
Token called
Token ID1444
Feature activation-1.40e-03

Pos logit contributions
used+0.030
 heights+0.030
 hesitation+0.028
ipping+0.027
 competitors+0.027
Neg logit contributions
 lived-0.017
 named-0.014
 was-0.014
 woke-0.013
 ruled-0.013
Token ost
Token ID23619
Feature activation-3.00e-02

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 practicing+0.003
 competitors+0.003
Neg logit contributions
 lived-0.002
grand-0.002
 woke-0.002
tr-0.002
L-0.002
Tokenric
Token ID1173
Feature activation-6.80e-02

Pos logit contributions
 hesitation+0.059
 heights+0.058
used+0.054
joy+0.053
 attempts+0.052
Neg logit contributions
 lived-0.045
grand-0.043
tr-0.042
man-0.042
 woke-0.039
Tokenhes
Token ID956
Feature activation-8.32e-03

Pos logit contributions
 heights+0.194
 hesitation+0.186
used+0.185
joy+0.173
 competitors+0.172
Neg logit contributions
grand-0.057
 lived-0.056
 woke-0.048
L-0.043
tr-0.040
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
 heights+0.018
used+0.018
 hesitation+0.017
 competitors+0.016
 practicing+0.015
Neg logit contributions
grand-0.013
 lived-0.011
L-0.011
 woke-0.010
oc-0.010
Token Lily
Token ID20037
Feature activation+1.09e-02

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.016
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.010
 lived-0.009
 woke-0.009
L-0.008
oc-0.008
Token and
Token ID290
Feature activation+5.29e-03

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.009
L+0.008
oc+0.008
Neg logit contributions
 heights-0.030
used-0.029
 hesitation-0.028
 competitors-0.026
joy-0.026
Token Ben
Token ID3932
Feature activation-2.48e-03

Pos logit contributions
 lived+0.008
grand+0.007
 woke+0.007
L+0.007
 was+0.006
Neg logit contributions
used-0.012
 heights-0.011
 hesitation-0.011
ipping-0.011
 competitors-0.010
Token girl
Token ID2576
Feature activation+1.18e-02

Pos logit contributions
 heights+0.012
used+0.011
 hesitation+0.011
 competitors+0.010
iously+0.010
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.006
Token who
Token ID508
Feature activation+5.39e-04

Pos logit contributions
grand+0.011
 lived+0.009
ork+0.008
rost+0.008
 woke+0.008
Neg logit contributions
 heights-0.030
used-0.030
 hesitation-0.028
 practicing-0.028
joy-0.027
Token didn
Token ID1422
Feature activation-1.76e-02

Pos logit contributions
grand+0.001
ork+0.001
rim+0.001
rost+0.001
izer+0.001
Neg logit contributions
used-0.001
 practicing-0.001
 muscles-0.001
 heights-0.001
joy-0.001
Token't
Token ID470
Feature activation+1.06e-03

Pos logit contributions
used+0.027
 heights+0.027
 practicing+0.023
 hesitation+0.023
ner+0.022
Neg logit contributions
grand-0.036
 lived-0.034
oc-0.033
 woke-0.032
L-0.031
Token like
Token ID588
Feature activation-3.11e-03

Pos logit contributions
grand+0.002
tr+0.002
man+0.002
oc+0.002
 lived+0.002
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
 muscles-0.002
 practicing-0.002
Token av
Token ID1196
Feature activation-6.54e-02

Pos logit contributions
used+0.007
 heights+0.006
 hesitation+0.006
 competitors+0.006
joy+0.006
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.004
 ash-0.004
Tokenoc
Token ID420
Feature activation-2.17e-02

Pos logit contributions
 heights+0.219
used+0.217
 hesitation+0.215
 competitors+0.202
joy+0.201
Neg logit contributions
grand-0.022
oc-0.017
 lived-0.013
 woke-0.010
L-0.007
Tokenados
Token ID22484
Feature activation-3.20e-02

Pos logit contributions
 hesitation+0.030
used+0.027
 heights+0.026
 Riley+0.025
 Leah+0.024
Neg logit contributions
grand-0.046
oc-0.042
 lived-0.040
 woke-0.039
 ash-0.039
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
used+0.084
 heights+0.084
joy+0.080
 hesitation+0.079
 Majesty+0.076
Neg logit contributions
 lived-0.033
grand-0.033
 woke-0.029
 was-0.026
L-0.025
Token She
Token ID1375
Feature activation-1.90e-03

Pos logit contributions
 heights+0.033
used+0.031
 hesitation+0.030
 competitors+0.028
joy+0.027
Neg logit contributions
grand-0.031
 lived-0.029
 woke-0.027
L-0.027
oc-0.026
Token always
Token ID1464
Feature activation+3.88e-03

Pos logit contributions
 heights+0.004
used+0.004
 practicing+0.004
 muscles+0.004
 hesitation+0.004
Neg logit contributions
grand-0.003
izer-0.002
oc-0.002
tr-0.002
rost-0.002
Token to
Token ID284
Feature activation-7.17e-03

Pos logit contributions
 heights+0.024
used+0.023
 hesitation+0.022
 competitors+0.022
 runners+0.021
Neg logit contributions
 lived-0.016
 woke-0.016
grand-0.015
 was-0.013
 ruled-0.013
Token send
Token ID3758
Feature activation-1.44e-03

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.016
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.010
 lived-0.010
 woke-0.009
oc-0.008
L-0.008
Token soft
Token ID2705
Feature activation-1.68e-02

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 competitors+0.003
joy+0.003
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token white
Token ID2330
Feature activation-1.41e-02

Pos logit contributions
 heights+0.032
used+0.032
 hesitation+0.030
 competitors+0.028
 practicing+0.027
Neg logit contributions
grand-0.029
 lived-0.028
 woke-0.027
oc-0.025
L-0.024
Token snow
Token ID6729
Feature activation-4.61e-03

Pos logit contributions
 heights+0.027
used+0.026
 hesitation+0.025
 competitors+0.023
joy+0.022
Neg logit contributions
grand-0.025
 lived-0.024
 woke-0.023
oc-0.022
L-0.022
Tokenfl
Token ID2704
Feature activation-6.53e-02

Pos logit contributions
used+0.010
 heights+0.010
 hesitation+0.010
joy+0.009
 competitors+0.009
Neg logit contributions
grand-0.007
 lived-0.007
man-0.007
tr-0.007
 woke-0.006
Tokenakes
Token ID1124
Feature activation-1.64e-02

Pos logit contributions
 hesitation+0.116
 heights+0.112
used+0.108
joy+0.099
 attempts+0.095
Neg logit contributions
grand-0.124
 lived-0.122
tr-0.119
 woke-0.111
oc-0.110
Token into
Token ID656
Feature activation-1.07e-02

Pos logit contributions
 heights+0.041
used+0.039
 hesitation+0.038
 competitors+0.037
joy+0.036
Neg logit contributions
grand-0.019
 lived-0.018
 woke-0.018
tr-0.015
 was-0.015
Token the
Token ID262
Feature activation+1.25e-03

Pos logit contributions
 heights+0.024
used+0.024
 hesitation+0.023
 competitors+0.021
joy+0.021
Neg logit contributions
grand-0.015
 lived-0.014
 woke-0.014
L-0.013
oc-0.012
Token sky
Token ID6766
Feature activation-1.12e-02

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.003
used-0.002
 hesitation-0.002
joy-0.002
 competitors-0.002
Token.
Token ID13
Feature activation+2.71e-03

Pos logit contributions
 heights+0.030
used+0.029
 hesitation+0.029
 competitors+0.027
 runners+0.026
Neg logit contributions
 lived-0.011
grand-0.010
 woke-0.010
 was-0.009
tr-0.009
Token a
Token ID257
Feature activation-1.81e-03

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.005
 lived-0.004
 woke-0.004
L-0.004
oc-0.004
Token garden
Token ID11376
Feature activation-9.37e-03

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.004
 competitors+0.004
joy+0.003
Neg logit contributions
grand-0.003
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token full
Token ID1336
Feature activation+1.78e-03

Pos logit contributions
 heights+0.021
used+0.020
 hesitation+0.019
joy+0.018
 competitors+0.018
Neg logit contributions
grand-0.014
 lived-0.013
 woke-0.012
L-0.012
oc-0.011
Token of
Token ID286
Feature activation-4.40e-03

Pos logit contributions
grand+0.003
 lived+0.002
L+0.002
oc+0.002
 woke+0.002
Neg logit contributions
 heights-0.004
 hesitation-0.004
used-0.004
 competitors-0.003
joy-0.003
Token wild
Token ID4295
Feature activation-9.07e-03

Pos logit contributions
 heights+0.008
 hesitation+0.008
used+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.007
oc-0.007
Tokenfl
Token ID2704
Feature activation-6.52e-02

Pos logit contributions
used+0.017
 heights+0.017
 hesitation+0.017
 competitors+0.015
 practicing+0.015
Neg logit contributions
grand-0.016
 lived-0.016
 woke-0.015
L-0.014
oc-0.014
Tokenowers
Token ID3618
Feature activation-2.94e-02

Pos logit contributions
 hesitation+0.113
 heights+0.108
used+0.100
joy+0.094
 attempts+0.091
Neg logit contributions
grand-0.132
 lived-0.129
tr-0.124
 woke-0.116
oc-0.115
Token.
Token ID13
Feature activation+1.69e-03

Pos logit contributions
 heights+0.081
used+0.078
 hesitation+0.075
 competitors+0.073
joy+0.072
Neg logit contributions
grand-0.030
 lived-0.028
 woke-0.024
 was-0.022
oc-0.021
Token He
Token ID679
Feature activation-1.67e-03

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 competitors-0.004
 practicing-0.004
Token decided
Token ID3066
Feature activation+4.71e-03

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.004
 competitors+0.003
 practicing+0.003
Neg logit contributions
grand-0.002
 lived-0.002
L-0.002
tr-0.002
oc-0.002
Token to
Token ID284
Feature activation+5.74e-03

Pos logit contributions
grand+0.007
 woke+0.006
 lived+0.006
L+0.005
 was+0.005
Neg logit contributions
 heights-0.011
used-0.011
 hesitation-0.010
 competitors-0.010
 breaths-0.009
Token back
Token ID736
Feature activation+6.57e-03

Pos logit contributions
 heights+0.053
used+0.049
 hesitation+0.049
 competitors+0.048
 attempts+0.046
Neg logit contributions
grand-0.025
 lived-0.025
 woke-0.025
 was-0.024
L-0.024
Token with
Token ID351
Feature activation+6.30e-03

Pos logit contributions
grand+0.008
 lived+0.008
 woke+0.007
L+0.007
oc+0.006
Neg logit contributions
 heights-0.016
used-0.015
 hesitation-0.015
 competitors-0.014
 runners-0.014
Token lots
Token ID6041
Feature activation-5.87e-04

Pos logit contributions
grand+0.009
 lived+0.009
 woke+0.008
L+0.008
oc+0.007
Neg logit contributions
 heights-0.014
used-0.014
 hesitation-0.013
 competitors-0.012
 practicing-0.012
Token of
Token ID286
Feature activation-7.35e-03

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.001
 competitors+0.001
 practicing+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.000
oc-0.000
Token sun
Token ID4252
Feature activation-1.83e-03

Pos logit contributions
 heights+0.014
used+0.014
 hesitation+0.013
 competitors+0.012
joy+0.012
Neg logit contributions
grand-0.013
 lived-0.012
 woke-0.011
L-0.011
oc-0.010
Tokenfl
Token ID2704
Feature activation-6.50e-02

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.004
 competitors+0.004
 runners+0.004
Neg logit contributions
 lived-0.003
grand-0.003
 woke-0.002
 was-0.002
man-0.002
Tokenowers
Token ID3618
Feature activation-2.94e-02

Pos logit contributions
 heights+0.108
 hesitation+0.107
used+0.095
 attempts+0.089
joy+0.088
Neg logit contributions
 lived-0.131
grand-0.130
tr-0.120
 woke-0.117
oc-0.116
Token to
Token ID284
Feature activation-1.28e-03

Pos logit contributions
 heights+0.077
used+0.073
 hesitation+0.070
 competitors+0.070
joy+0.068
Neg logit contributions
 lived-0.033
grand-0.031
 woke-0.030
 was-0.027
 belonged-0.025
Token fill
Token ID6070
Feature activation-3.69e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.003
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token the
Token ID262
Feature activation+8.88e-03

Pos logit contributions
 heights+0.008
used+0.007
 hesitation+0.007
joy+0.006
 practicing+0.006
Neg logit contributions
grand-0.006
 lived-0.005
 woke-0.005
tr-0.005
L-0.005
Token prince
Token ID19716
Feature activation-1.24e-02

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.014
L+0.013
oc+0.013
Neg logit contributions
 heights-0.017
used-0.017
 hesitation-0.016
 competitors-0.015
joy-0.015
Token who
Token ID508
Feature activation+1.90e-03

Pos logit contributions
grand+0.009
 lived+0.007
 woke+0.006
ork+0.005
tr+0.005
Neg logit contributions
 heights-0.029
used-0.029
 hesitation-0.028
 practicing-0.027
joy-0.026
Token liked
Token ID8288
Feature activation-1.09e-02

Pos logit contributions
grand+0.003
ork+0.002
rim+0.002
rost+0.002
auc+0.002
Neg logit contributions
used-0.004
 practicing-0.004
 heights-0.003
 muscles-0.003
 hesitation-0.003
Token to
Token ID284
Feature activation-1.19e-02

Pos logit contributions
used+0.025
 heights+0.024
 hesitation+0.024
 competitors+0.022
iously+0.021
Neg logit contributions
grand-0.016
 woke-0.014
 lived-0.014
L-0.013
tr-0.012
Token collect
Token ID2824
Feature activation-1.01e-02

Pos logit contributions
 heights+0.027
 hesitation+0.025
used+0.025
 muscles+0.023
 competitors+0.023
Neg logit contributions
grand-0.017
 lived-0.015
oc-0.015
L-0.015
 woke-0.014
Token or
Token ID393
Feature activation-1.21e-02

Pos logit contributions
 heights+0.019
 hesitation+0.017
used+0.017
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.019
 lived-0.018
 woke-0.017
L-0.016
tr-0.016
Tokennam
Token ID7402
Feature activation-6.49e-02

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.016
 competitors+0.014
joy+0.014
Neg logit contributions
grand-0.028
 lived-0.026
 woke-0.025
L-0.025
oc-0.024
Tokenents
Token ID658
Feature activation-2.16e-02

Pos logit contributions
 heights+0.073
 hesitation+0.069
used+0.063
joy+0.053
 competitors+0.051
Neg logit contributions
grand-0.173
 lived-0.160
L-0.157
oc-0.154
 woke-0.153
Token.
Token ID13
Feature activation-9.46e-03

Pos logit contributions
 heights+0.048
used+0.046
 hesitation+0.046
 competitors+0.042
joy+0.041
Neg logit contributions
grand-0.032
 lived-0.029
 woke-0.027
L-0.027
oc-0.026
Token Everywhere
Token ID48201
Feature activation-2.37e-03

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.016
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.017
 lived-0.016
 woke-0.016
oc-0.015
L-0.015
Token she
Token ID673
Feature activation+1.73e-02

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.003
 competitors+0.003
joy+0.003
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.005
oc-0.004
Token went
Token ID1816
Feature activation-8.35e-03

Pos logit contributions
grand+0.024
tr+0.020
rim+0.019
oc+0.019
rost+0.019
Neg logit contributions
 heights-0.034
used-0.033
 competitors-0.032
 breaths-0.031
 practicing-0.031
Token John
Token ID1757
Feature activation+9.58e-05

Pos logit contributions
 heights+0.021
 hesitation+0.020
used+0.020
joy+0.019
 competitors+0.018
Neg logit contributions
 lived-0.008
 woke-0.007
grand-0.007
 was-0.006
L-0.006
Token chose
Token ID7690
Feature activation-7.84e-03

Pos logit contributions
 lived+0.000
grand+0.000
 woke+0.000
 was+0.000
oc+0.000
Neg logit contributions
 heights-0.000
 hesitation-0.000
used-0.000
 competitors-0.000
joy-0.000
Token some
Token ID617
Feature activation-6.78e-04

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.017
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.011
 lived-0.011
 woke-0.010
L-0.009
 was-0.009
Token carrots
Token ID34397
Feature activation-2.24e-02

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token and
Token ID290
Feature activation+4.28e-03

Pos logit contributions
 heights+0.055
used+0.052
 hesitation+0.051
joy+0.048
 competitors+0.048
Neg logit contributions
grand-0.029
 lived-0.028
 woke-0.025
L-0.023
oc-0.023
Token cuc
Token ID38421
Feature activation-6.37e-02

Pos logit contributions
 lived+0.006
grand+0.006
 woke+0.006
 was+0.006
oc+0.005
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
joy-0.009
 competitors-0.008
Tokenumbers
Token ID17024
Feature activation-1.50e-02

Pos logit contributions
 heights+0.130
 hesitation+0.121
used+0.109
joy+0.109
 attempts+0.106
Neg logit contributions
grand-0.109
oc-0.105
L-0.098
 lived-0.096
aker-0.089
Token to
Token ID284
Feature activation+5.30e-03

Pos logit contributions
 heights+0.035
used+0.033
 hesitation+0.032
 competitors+0.030
joy+0.030
Neg logit contributions
grand-0.021
 lived-0.020
 woke-0.019
L-0.018
oc-0.017
Token take
Token ID1011
Feature activation+1.54e-03

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
oc+0.005
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 competitors-0.011
joy-0.011
Token home
Token ID1363
Feature activation+1.50e-02

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.003
joy-0.003
Token.
Token ID13
Feature activation+6.49e-05

Pos logit contributions
grand+0.019
 lived+0.017
L+0.016
 woke+0.016
oc+0.016
Neg logit contributions
 heights-0.037
used-0.035
 hesitation-0.034
 competitors-0.032
joy-0.032
Token<|endoftext|>
Token ID50256
Feature activation-8.08e-03

Pos logit contributions
grand+0.012
 lived+0.011
 woke+0.010
L+0.010
oc+0.010
Neg logit contributions
 heights-0.020
used-0.019
 hesitation-0.019
 competitors-0.017
joy-0.017
TokenL
Token ID43
Feature activation+2.93e-02

Pos logit contributions
 heights+0.015
used+0.015
 hesitation+0.014
joy+0.014
 practicing+0.013
Neg logit contributions
grand-0.015
 lived-0.012
 woke-0.012
tr-0.012
L-0.012
Tokenily
Token ID813
Feature activation+2.43e-02

Pos logit contributions
grand+0.049
oc+0.047
 lived+0.047
tr+0.046
 hyd+0.044
Neg logit contributions
 hesitation-0.051
 heights-0.050
iley-0.048
used-0.048
joy-0.046
Token loved
Token ID6151
Feature activation-2.55e-03

Pos logit contributions
grand+0.033
 lived+0.028
L+0.027
tr+0.026
rost+0.026
Neg logit contributions
 heights-0.055
used-0.054
 hesitation-0.052
 practicing-0.049
joy-0.049
Token sun
Token ID4252
Feature activation+5.52e-03

Pos logit contributions
used+0.006
 heights+0.005
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
L-0.003
 ash-0.003
Tokenfl
Token ID2704
Feature activation-6.36e-02

Pos logit contributions
 lived+0.008
grand+0.008
 woke+0.008
man+0.007
L+0.007
Neg logit contributions
 heights-0.012
used-0.012
 hesitation-0.012
 competitors-0.011
 attempts-0.010
Tokenowers
Token ID3618
Feature activation-1.67e-02

Pos logit contributions
 heights+0.097
 hesitation+0.095
used+0.084
 attempts+0.076
 competitors+0.075
Neg logit contributions
 lived-0.141
grand-0.136
tr-0.128
 woke-0.128
L-0.125
Token.
Token ID13
Feature activation-1.85e-03

Pos logit contributions
 heights+0.042
used+0.041
 hesitation+0.040
 competitors+0.038
joy+0.037
Neg logit contributions
 lived-0.020
grand-0.019
 woke-0.017
L-0.016
 was-0.015
Token She
Token ID1375
Feature activation+1.30e-02

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.004
joy+0.003
 competitors+0.003
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
L-0.002
oc-0.002
Token liked
Token ID8288
Feature activation-1.26e-02

Pos logit contributions
grand+0.014
oc+0.011
rost+0.010
tr+0.010
L+0.010
Neg logit contributions
 heights-0.032
used-0.030
 hesitation-0.030
 practicing-0.029
 competitors-0.028
Token to
Token ID284
Feature activation-5.53e-03

Pos logit contributions
used+0.030
 hesitation+0.029
 heights+0.028
 competitors+0.027
joy+0.026
Neg logit contributions
grand-0.016
 lived-0.016
 woke-0.015
 was-0.013
 ash-0.013
Token tree
Token ID5509
Feature activation-2.30e-02

Pos logit contributions
 heights+0.068
used+0.066
 hesitation+0.065
 competitors+0.060
joy+0.060
Neg logit contributions
grand-0.029
 lived-0.028
 woke-0.025
L-0.023
oc-0.022
Token could
Token ID714
Feature activation+9.30e-04

Pos logit contributions
 heights+0.060
used+0.058
 hesitation+0.057
 competitors+0.053
joy+0.053
Neg logit contributions
grand-0.025
 lived-0.022
 woke-0.021
L-0.020
oc-0.019
Token see
Token ID766
Feature activation+5.86e-03

Pos logit contributions
grand+0.001
L+0.001
 lived+0.001
 woke+0.001
oc+0.001
Neg logit contributions
 heights-0.002
used-0.002
 hesitation-0.002
 practicing-0.002
 competitors-0.002
Token lots
Token ID6041
Feature activation+2.03e-03

Pos logit contributions
grand+0.009
 lived+0.009
 woke+0.008
oc+0.008
L+0.008
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 competitors-0.011
joy-0.011
Token of
Token ID286
Feature activation-4.71e-03

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
oc+0.001
L+0.001
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.005
 competitors-0.005
joy-0.005
Token av
Token ID1196
Feature activation-6.34e-02

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.007
oc-0.007
Tokenoc
Token ID420
Feature activation-2.74e-02

Pos logit contributions
 hesitation+0.236
 heights+0.235
used+0.232
 competitors+0.219
 Riley+0.219
Neg logit contributions
oc-0.006
grand+0.004
 lived+0.015
 woke+0.019
L+0.020
Tokenados
Token ID22484
Feature activation-3.29e-02

Pos logit contributions
 hesitation+0.039
 Riley+0.035
 heights+0.034
used+0.034
 Leah+0.032
Neg logit contributions
grand-0.059
oc-0.058
 lived-0.050
ork-0.049
 ash-0.049
Token that
Token ID326
Feature activation+2.75e-03

Pos logit contributions
 heights+0.087
used+0.085
 hesitation+0.083
joy+0.082
 attempts+0.079
Neg logit contributions
 lived-0.030
grand-0.030
 was-0.028
 woke-0.026
oc-0.024
Token the
Token ID262
Feature activation-2.51e-03

Pos logit contributions
grand+0.004
L+0.003
 woke+0.003
tr+0.003
 lived+0.003
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 practicing-0.005
 competitors-0.005
Token supplier
Token ID22693
Feature activation-8.99e-03

Pos logit contributions
used+0.005
 heights+0.005
 hesitation+0.005
 practicing+0.005
 competitors+0.005
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.003
L-0.003
oc-0.003
Token chains
Token ID14659
Feature activation-1.28e-02

Pos logit contributions
 heights+0.008
used+0.008
 hesitation+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.005
 lived-0.004
 woke-0.004
L-0.004
tr-0.004
Token,
Token ID11
Feature activation-1.18e-02

Pos logit contributions
 heights+0.030
used+0.029
 hesitation+0.028
 competitors+0.026
joy+0.026
Neg logit contributions
grand-0.017
 lived-0.016
 woke-0.015
L-0.015
oc-0.014
Token glitter
Token ID31133
Feature activation-2.87e-03

Pos logit contributions
 heights+0.023
used+0.022
 hesitation+0.021
 competitors+0.019
joy+0.019
Neg logit contributions
grand-0.021
 lived-0.020
 woke-0.019
L-0.019
oc-0.018
Tokeny
Token ID88
Feature activation-1.64e-02

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
joy+0.006
Neg logit contributions
 lived-0.003
grand-0.003
 woke-0.003
 was-0.003
L-0.003
Token snow
Token ID6729
Feature activation-6.60e-04

Pos logit contributions
 heights+0.036
used+0.036
 hesitation+0.035
joy+0.032
 competitors+0.032
Neg logit contributions
grand-0.024
 lived-0.023
 woke-0.022
oc-0.021
tr-0.021
Tokenfl
Token ID2704
Feature activation-6.32e-02

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
joy+0.001
 competitors+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
tr-0.001
man-0.001
Tokenakes
Token ID1124
Feature activation-7.94e-03

Pos logit contributions
 hesitation+0.131
 heights+0.129
used+0.123
joy+0.115
 competitors+0.109
Neg logit contributions
 lived-0.102
grand-0.102
tr-0.100
 woke-0.093
oc-0.091
Token,
Token ID11
Feature activation-8.28e-03

Pos logit contributions
 heights+0.020
used+0.020
 hesitation+0.019
joy+0.019
 competitors+0.018
Neg logit contributions
 lived-0.009
 woke-0.008
grand-0.008
 was-0.007
tr-0.007
Token and
Token ID290
Feature activation-5.58e-03

Pos logit contributions
 heights+0.020
used+0.019
 hesitation+0.019
 competitors+0.017
joy+0.017
Neg logit contributions
grand-0.011
 lived-0.010
 woke-0.009
L-0.009
oc-0.008
Token a
Token ID257
Feature activation+3.38e-04

Pos logit contributions
used+0.012
 heights+0.012
 hesitation+0.011
joy+0.010
 competitors+0.010
Neg logit contributions
grand-0.009
 lived-0.008
 woke-0.008
L-0.007
tr-0.007
Token picture
Token ID4286
Feature activation-1.62e-03

Pos logit contributions
grand+0.001
 lived+0.000
 woke+0.000
L+0.000
oc+0.000
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Token for
Token ID329
Feature activation+1.05e-02

Pos logit contributions
 heights+0.022
used+0.020
 hesitation+0.019
 competitors+0.018
 attempts+0.018
Neg logit contributions
grand-0.014
 lived-0.014
 woke-0.013
L-0.012
 was-0.012
Token some
Token ID617
Feature activation-4.48e-03

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.013
L+0.013
 ash+0.012
Neg logit contributions
 heights-0.023
used-0.023
 hesitation-0.021
 competitors-0.021
joy-0.020
Token funny
Token ID8258
Feature activation-1.04e-02

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.007
 lived-0.007
 woke-0.006
L-0.006
oc-0.006
Token little
Token ID1310
Feature activation-1.65e-02

Pos logit contributions
 heights+0.021
used+0.021
 hesitation+0.020
 competitors+0.018
joy+0.018
Neg logit contributions
grand-0.017
 lived-0.016
 woke-0.016
L-0.014
oc-0.014
Token sun
Token ID4252
Feature activation-4.88e-03

Pos logit contributions
 heights+0.034
used+0.033
 hesitation+0.030
yles+0.029
 competitors+0.028
Neg logit contributions
grand-0.025
 woke-0.024
 was-0.023
 lived-0.023
oc-0.022
Tokenfl
Token ID2704
Feature activation-6.31e-02

Pos logit contributions
 heights+0.012
used+0.011
 hesitation+0.011
 competitors+0.011
joy+0.010
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.005
tr-0.005
man-0.005
Tokenowers
Token ID3618
Feature activation-3.39e-02

Pos logit contributions
 hesitation+0.115
 heights+0.112
used+0.097
 attempts+0.095
joy+0.094
Neg logit contributions
 lived-0.119
tr-0.115
grand-0.115
oc-0.107
ase-0.104
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
 heights+0.087
used+0.082
 hesitation+0.078
 competitors+0.077
joy+0.076
Neg logit contributions
 lived-0.040
grand-0.038
 woke-0.035
 was-0.033
 belonged-0.031
Token Her
Token ID2332
Feature activation-6.27e-03

Pos logit contributions
grand+0.001
 lived+0.001
L+0.001
 woke+0.001
oc+0.001
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
 competitors-0.002
 practicing-0.002
Token m
Token ID285
Feature activation-8.50e-03

Pos logit contributions
 heights+0.012
used+0.012
 hesitation+0.011
 competitors+0.011
 runners+0.011
Neg logit contributions
 lived-0.010
grand-0.010
 was-0.010
 woke-0.010
tr-0.009
Tokenama
Token ID1689
Feature activation+2.91e-03

Pos logit contributions
 heights+0.014
 hesitation+0.013
used+0.012
 competitors+0.011
joy+0.011
Neg logit contributions
grand-0.018
 lived-0.017
oc-0.016
tr-0.015
 woke-0.015
Token da
Token ID12379
Feature activation-1.22e-02

Pos logit contributions
 heights+0.008
used+0.007
 hesitation+0.007
 competitors+0.007
joy+0.006
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.006
L-0.005
oc-0.005
Tokenis
Token ID271
Feature activation-1.67e-02

Pos logit contributions
 heights+0.024
used+0.024
 hesitation+0.023
 competitors+0.021
joy+0.021
Neg logit contributions
grand-0.021
 lived-0.020
 woke-0.019
L-0.018
oc-0.018
Tokenies
Token ID444
Feature activation-8.72e-03

Pos logit contributions
 heights+0.033
used+0.033
 hesitation+0.031
joy+0.030
 competitors+0.029
Neg logit contributions
grand-0.027
 lived-0.027
 woke-0.026
L-0.025
oc-0.023
Token,
Token ID11
Feature activation-2.27e-03

Pos logit contributions
 heights+0.021
used+0.019
 hesitation+0.019
joy+0.019
 competitors+0.019
Neg logit contributions
 lived-0.012
 woke-0.011
grand-0.010
 was-0.010
 named-0.009
Token sun
Token ID4252
Feature activation-2.42e-04

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
L-0.003
oc-0.003
Tokenfl
Token ID2704
Feature activation-6.29e-02

Pos logit contributions
 heights+0.000
 hesitation+0.000
used+0.000
joy+0.000
 competitors+0.000
Neg logit contributions
 lived-0.000
grand-0.000
 woke-0.000
tr-0.000
 was-0.000
Tokenowers
Token ID3618
Feature activation-2.45e-02

Pos logit contributions
 hesitation+0.106
 heights+0.103
used+0.093
 attempts+0.091
joy+0.090
Neg logit contributions
 lived-0.130
grand-0.123
 woke-0.116
tr-0.115
oc-0.113
Token,
Token ID11
Feature activation-8.52e-04

Pos logit contributions
 heights+0.059
used+0.055
joy+0.054
 hesitation+0.054
 competitors+0.053
Neg logit contributions
 lived-0.034
grand-0.031
 woke-0.030
 was-0.028
 belonged-0.025
Token and
Token ID290
Feature activation-9.05e-04

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token tul
Token ID48373
Feature activation-4.15e-02

Pos logit contributions
used+0.002
 heights+0.002
 hesitation+0.002
joy+0.002
 competitors+0.002
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
 was-0.001
L-0.001
Tokenips
Token ID2419
Feature activation-1.77e-03

Pos logit contributions
 heights+0.070
 hesitation+0.069
used+0.066
 attempts+0.063
 competitors+0.062
Neg logit contributions
grand-0.074
 lived-0.070
L-0.069
 woke-0.069
oc-0.068
Token a
Token ID257
Feature activation+6.79e-03

Pos logit contributions
 heights+0.001
 hesitation+0.001
used+0.001
joy+0.001
 competitors+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token big
Token ID1263
Feature activation+2.87e-03

Pos logit contributions
grand+0.011
 lived+0.010
 woke+0.009
L+0.009
oc+0.008
Neg logit contributions
 heights-0.015
used-0.014
 hesitation-0.014
 competitors-0.013
joy-0.013
Token field
Token ID2214
Feature activation-2.94e-03

Pos logit contributions
grand+0.005
 lived+0.004
 woke+0.004
L+0.004
oc+0.004
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
 practicing-0.005
Token of
Token ID286
Feature activation+5.01e-03

Pos logit contributions
 heights+0.007
used+0.006
 hesitation+0.006
joy+0.006
 competitors+0.006
Neg logit contributions
grand-0.005
 lived-0.004
 woke-0.004
L-0.004
oc-0.004
Token sun
Token ID4252
Feature activation-3.67e-03

Pos logit contributions
grand+0.009
 lived+0.008
 woke+0.008
L+0.008
oc+0.008
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
 competitors-0.008
joy-0.008
Tokenfl
Token ID2704
Feature activation-6.27e-02

Pos logit contributions
 heights+0.008
 hesitation+0.008
used+0.007
 competitors+0.007
 attempts+0.007
Neg logit contributions
 lived-0.005
 woke-0.005
grand-0.005
man-0.005
 was-0.005
Tokenowers
Token ID3618
Feature activation-2.82e-02

Pos logit contributions
 hesitation+0.114
 heights+0.106
used+0.095
 attempts+0.094
joy+0.090
Neg logit contributions
 lived-0.122
tr-0.113
grand-0.110
 woke-0.107
ase-0.107
Token.
Token ID13
Feature activation-1.24e-03

Pos logit contributions
 heights+0.074
 hesitation+0.069
used+0.069
 Majesty+0.068
 competitors+0.067
Neg logit contributions
 lived-0.031
 woke-0.027
 was-0.026
grand-0.025
 belonged-0.022
Token The
Token ID383
Feature activation-7.11e-03

Pos logit contributions
 heights+0.003
 hesitation+0.003
used+0.003
 competitors+0.003
 practicing+0.003
Neg logit contributions
grand-0.001
 lived-0.001
L-0.001
 woke-0.001
oc-0.001
Token sun
Token ID4252
Feature activation-1.25e-03

Pos logit contributions
used+0.014
 heights+0.014
 hesitation+0.013
 practicing+0.013
 competitors+0.012
Neg logit contributions
 lived-0.011
grand-0.011
 woke-0.010
L-0.010
 ash-0.010
Tokenflower
Token ID25547
Feature activation-1.35e-02

Pos logit contributions
 heights+0.003
 hesitation+0.003
used+0.003
 competitors+0.002
joy+0.002
Neg logit contributions
 lived-0.002
grand-0.002
 woke-0.002
L-0.002
tr-0.002
Token big
Token ID1263
Feature activation+5.05e-03

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.013
L+0.012
oc+0.012
Neg logit contributions
 heights-0.024
used-0.023
 hesitation-0.023
 competitors-0.021
joy-0.021
Token cage
Token ID17186
Feature activation-2.45e-03

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.011
used-0.011
 hesitation-0.010
 competitors-0.010
joy-0.010
Token with
Token ID351
Feature activation-2.84e-04

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
L-0.003
oc-0.003
Token lots
Token ID6041
Feature activation+3.27e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.000
joy+0.000
Neg logit contributions
grand-0.000
 lived-0.000
 woke-0.000
L-0.000
oc-0.000
Token of
Token ID286
Feature activation-4.18e-03

Pos logit contributions
grand+0.004
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.009
used-0.008
 hesitation-0.008
joy-0.008
 competitors-0.008
Token gor
Token ID30344
Feature activation-6.22e-02

Pos logit contributions
 heights+0.008
 hesitation+0.008
used+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.007
 lived-0.007
L-0.007
 woke-0.007
tr-0.006
Tokenillas
Token ID25314
Feature activation-2.92e-02

Pos logit contributions
 heights+0.109
 hesitation+0.107
 Riley+0.101
 movements+0.099
 Majesty+0.097
Neg logit contributions
 lived-0.095
grand-0.093
ep-0.091
oc-0.090
 ruled-0.087
Token inside
Token ID2641
Feature activation+3.17e-04

Pos logit contributions
 heights+0.076
 hesitation+0.072
used+0.071
joy+0.069
 attempts+0.068
Neg logit contributions
grand-0.033
 lived-0.031
 woke-0.029
 was-0.026
oc-0.024
Token!"
Token ID2474
Feature activation-4.60e-03

Pos logit contributions
grand+0.000
 lived+0.000
 woke+0.000
oc+0.000
L+0.000
Neg logit contributions
 heights-0.001
 hesitation-0.001
used-0.001
 competitors-0.001
joy-0.001
Token
Token ID198
Feature activation-5.73e-04

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.007
 lived-0.007
 woke-0.006
L-0.006
oc-0.006
Token
Token ID198
Feature activation-9.27e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
joy+0.001
 competitors+0.001
Neg logit contributions
grand-0.001
 lived-0.001
tr-0.001
 woke-0.001
 hyd-0.001
Token lots
Token ID6041
Feature activation+1.74e-03

Pos logit contributions
grand+0.000
 lived+0.000
 woke+0.000
L+0.000
oc+0.000
Neg logit contributions
 heights-0.000
used-0.000
 hesitation-0.000
 competitors-0.000
joy-0.000
Token of
Token ID286
Feature activation-1.07e-02

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.004
joy-0.004
 competitors-0.004
Token trees
Token ID7150
Feature activation-1.55e-02

Pos logit contributions
 heights+0.021
used+0.020
 hesitation+0.020
 competitors+0.018
joy+0.018
Neg logit contributions
grand-0.019
 lived-0.017
 woke-0.017
L-0.016
oc-0.016
Token and
Token ID290
Feature activation-2.67e-03

Pos logit contributions
 heights+0.041
used+0.040
 hesitation+0.040
 competitors+0.038
 Majesty+0.038
Neg logit contributions
 lived-0.015
 woke-0.013
grand-0.013
 was-0.013
oc-0.011
Token wild
Token ID4295
Feature activation-1.44e-02

Pos logit contributions
used+0.005
 heights+0.005
 hesitation+0.005
joy+0.004
ipping+0.004
Neg logit contributions
 lived-0.005
grand-0.005
 woke-0.005
L-0.004
 was-0.004
Tokenfl
Token ID2704
Feature activation-6.20e-02

Pos logit contributions
used+0.031
 hesitation+0.030
 heights+0.028
 practicing+0.027
ner+0.026
Neg logit contributions
 lived-0.023
 woke-0.021
grand-0.020
 was-0.020
oc-0.019
Tokenowers
Token ID3618
Feature activation-2.60e-02

Pos logit contributions
 hesitation+0.118
 heights+0.107
used+0.104
joy+0.098
 attempts+0.096
Neg logit contributions
 lived-0.116
tr-0.107
grand-0.103
 woke-0.097
room-0.097
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
 heights+0.071
used+0.069
 hesitation+0.066
 Majesty+0.065
 competitors+0.065
Neg logit contributions
 lived-0.026
 was-0.022
grand-0.022
 woke-0.021
 belonged-0.019
Token As
Token ID1081
Feature activation-2.62e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.003
L-0.003
oc-0.002
Token he
Token ID339
Feature activation+1.16e-02

Pos logit contributions
used+0.006
 heights+0.006
 hesitation+0.006
 competitors+0.005
iously+0.005
Neg logit contributions
 lived-0.004
grand-0.003
 woke-0.003
L-0.003
Tim-0.003
Token stepped
Token ID10764
Feature activation-3.72e-03

Pos logit contributions
grand+0.020
 lived+0.016
L+0.016
tr+0.015
 woke+0.015
Neg logit contributions
 heights-0.024
used-0.023
 hesitation-0.023
 competitors-0.021
 practicing-0.020
Token she
Token ID673
Feature activation-8.30e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
 was+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
 practicing-0.001
Token says
Token ID1139
Feature activation-1.19e-02

Pos logit contributions
 heights+0.017
used+0.016
 hesitation+0.016
 competitors+0.015
joy+0.014
Neg logit contributions
grand-0.014
 lived-0.013
 woke-0.012
L-0.012
oc-0.012
Token.
Token ID13
Feature activation-2.88e-03

Pos logit contributions
 heights+0.034
used+0.031
 hesitation+0.030
 competitors+0.030
 Majesty+0.029
Neg logit contributions
grand-0.010
 woke-0.009
 lived-0.009
L-0.009
 was-0.008
Token "
Token ID366
Feature activation+9.06e-03

Pos logit contributions
 heights+0.008
used+0.008
 hesitation+0.007
 competitors+0.007
iously+0.007
Neg logit contributions
L-0.003
grand-0.003
 woke-0.003
 lived-0.003
 was-0.002
TokenSomeone
Token ID28211
Feature activation+7.93e-03

Pos logit contributions
 lived+0.017
 woke+0.017
grand+0.016
 belonged+0.014
oc+0.014
Neg logit contributions
 heights-0.015
used-0.014
 competitors-0.013
 runners-0.012
 hesitation-0.012
Token must
Token ID1276
Feature activation+2.88e-02

Pos logit contributions
grand+0.010
 lived+0.009
L+0.009
 woke+0.009
oc+0.008
Neg logit contributions
 heights-0.019
used-0.018
 hesitation-0.018
 competitors-0.016
 practicing-0.016
Token have
Token ID423
Feature activation+8.24e-03

Pos logit contributions
grand+0.047
oc+0.047
tr+0.044
 woke+0.043
L+0.041
Neg logit contributions
 heights-0.050
 hesitation-0.048
used-0.046
 competitors-0.042
 practicing-0.041
Token left
Token ID1364
Feature activation+2.22e-02

Pos logit contributions
grand+0.012
 lived+0.011
 woke+0.010
L+0.010
oc+0.009
Neg logit contributions
 heights-0.019
used-0.018
 hesitation-0.018
 competitors-0.016
joy-0.016
Token them
Token ID606
Feature activation-1.12e-02

Pos logit contributions
grand+0.035
 lived+0.033
 woke+0.031
L+0.030
oc+0.029
Neg logit contributions
 heights-0.047
used-0.045
 hesitation-0.044
 competitors-0.040
joy-0.039
Token here
Token ID994
Feature activation+3.13e-03

Pos logit contributions
 heights+0.026
used+0.025
 hesitation+0.024
 competitors+0.022
joy+0.022
Neg logit contributions
grand-0.016
 lived-0.014
 woke-0.014
L-0.013
oc-0.013
Token.
Token ID13
Feature activation+4.59e-03

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.004
L+0.004
oc+0.003
Neg logit contributions
 heights-0.007
used-0.007
 hesitation-0.007
 competitors-0.006
joy-0.006
Tokenuddle
Token ID24500
Feature activation+1.96e-02

Pos logit contributions
 heights+0.008
used+0.008
 hesitation+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.007
oc-0.007
Token,
Token ID11
Feature activation-3.30e-03

Pos logit contributions
grand+0.024
 lived+0.024
 woke+0.022
L+0.022
tr+0.020
Neg logit contributions
 heights-0.047
used-0.046
 hesitation-0.045
 competitors-0.041
joy-0.041
Token making
Token ID1642
Feature activation-1.61e-03

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
joy+0.006
Neg logit contributions
 lived-0.005
 woke-0.005
grand-0.005
L-0.004
 was-0.004
Token a
Token ID257
Feature activation+1.35e-02

Pos logit contributions
used+0.004
 hesitation+0.004
 heights+0.004
 competitors+0.003
joy+0.003
Neg logit contributions
 lived-0.002
grand-0.002
L-0.002
 woke-0.002
 was-0.002
Token big
Token ID1263
Feature activation+5.58e-03

Pos logit contributions
grand+0.018
 lived+0.017
 woke+0.016
L+0.015
oc+0.014
Neg logit contributions
 heights-0.032
used-0.031
 hesitation-0.031
 competitors-0.028
joy-0.028
Token mess
Token ID2085
Feature activation+2.41e-02

Pos logit contributions
grand+0.007
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.013
used-0.013
 hesitation-0.013
 competitors-0.012
joy-0.011
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
grand+0.030
 lived+0.029
 woke+0.027
L+0.027
oc+0.024
Neg logit contributions
 heights-0.059
used-0.056
 hesitation-0.056
joy-0.051
 competitors-0.051
Token They
Token ID1119
Feature activation+3.25e-03

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.004
L+0.004
oc+0.004
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.008
 competitors-0.007
joy-0.007
Token did
Token ID750
Feature activation-2.13e-03

Pos logit contributions
grand+0.005
 lived+0.004
oc+0.004
L+0.004
 woke+0.004
Neg logit contributions
 heights-0.007
used-0.007
 hesitation-0.007
 competitors-0.006
 practicing-0.006
Token not
Token ID407
Feature activation+3.16e-03

Pos logit contributions
used+0.006
 heights+0.006
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
 lived-0.002
grand-0.002
 woke-0.002
L-0.002
 was-0.002
Token see
Token ID766
Feature activation+4.98e-03

Pos logit contributions
grand+0.004
 woke+0.004
 lived+0.004
L+0.004
oc+0.003
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.007
joy-0.007
iously-0.007
Token
Token ID198
Feature activation-2.66e-03

Pos logit contributions
grand+0.012
tr+0.011
 lived+0.011
 woke+0.011
oc+0.010
Neg logit contributions
 heights-0.013
used-0.013
 hesitation-0.013
joy-0.012
 practicing-0.012
TokenThe
Token ID464
Feature activation-6.31e-03

Pos logit contributions
used+0.005
 heights+0.005
 hesitation+0.005
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
tr-0.004
oc-0.003
Token family
Token ID1641
Feature activation+1.33e-02

Pos logit contributions
 heights+0.013
 hesitation+0.013
used+0.012
 competitors+0.011
joy+0.011
Neg logit contributions
grand-0.010
 lived-0.009
 woke-0.009
L-0.009
oc-0.008
Token spent
Token ID3377
Feature activation+4.59e-03

Pos logit contributions
grand+0.014
L+0.011
tr+0.011
 lived+0.010
oc+0.010
Neg logit contributions
 heights-0.034
used-0.034
 hesitation-0.033
 competitors-0.032
 practicing-0.031
Token the
Token ID262
Feature activation+2.21e-02

Pos logit contributions
grand+0.007
 lived+0.007
 woke+0.006
L+0.006
oc+0.006
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
 competitors-0.008
joy-0.008
Token next
Token ID1306
Feature activation+2.47e-02

Pos logit contributions
grand+0.035
 lived+0.031
L+0.029
 woke+0.029
oc+0.029
Neg logit contributions
 heights-0.045
 hesitation-0.043
used-0.042
 competitors-0.041
 practicing-0.040
Token week
Token ID1285
Feature activation-4.46e-03

Pos logit contributions
grand+0.025
 lived+0.023
 woke+0.022
L+0.021
oc+0.020
Neg logit contributions
 heights-0.066
used-0.064
 hesitation-0.063
joy-0.057
 practicing-0.056
Token playing
Token ID2712
Feature activation+6.36e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
 breaths+0.009
 competitors+0.009
Neg logit contributions
grand-0.006
 lived-0.005
 woke-0.005
 was-0.005
L-0.004
Token games
Token ID1830
Feature activation-1.18e-04

Pos logit contributions
grand+0.010
 lived+0.009
 woke+0.009
L+0.009
tr+0.008
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 competitors-0.012
joy-0.012
Token,
Token ID11
Feature activation-2.52e-03

Pos logit contributions
 heights+0.000
used+0.000
 hesitation+0.000
 competitors+0.000
joy+0.000
Neg logit contributions
 lived-0.000
grand-0.000
 woke-0.000
L-0.000
 was-0.000
Token lou
Token ID21081
Feature activation+1.24e-03

Pos logit contributions
 heights+0.005
used+0.004
 hesitation+0.004
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.005
 lived-0.004
 woke-0.004
L-0.004
oc-0.004
Token.
Token ID13
Feature activation+4.95e-03

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.002
joy-0.002
Token
Token ID220
Feature activation-5.18e-03

Pos logit contributions
grand+0.006
 lived+0.006
 woke+0.006
L+0.006
oc+0.005
Neg logit contributions
 heights-0.012
used-0.012
 hesitation-0.012
 competitors-0.011
iously-0.010
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
 heights+0.011
used+0.011
 hesitation+0.010
joy+0.010
 competitors+0.010
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
oc-0.006
tr-0.006
Token
Token ID198
Feature activation-7.74e-04

Pos logit contributions
grand+0.018
 lived+0.017
 woke+0.015
tr+0.015
oc+0.015
Neg logit contributions
 heights-0.029
used-0.028
 hesitation-0.028
joy-0.026
 competitors-0.026
Token"
Token ID1
Feature activation+9.79e-03

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 practicing+0.002
 competitors+0.002
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
tr-0.001
oc-0.001
TokenL
Token ID43
Feature activation+2.05e-02

Pos logit contributions
grand+0.017
 lived+0.015
oc+0.015
ork+0.015
rost+0.013
Neg logit contributions
 heights-0.018
used-0.017
joy-0.017
 hesitation-0.017
 practicing-0.016
Tokenily
Token ID813
Feature activation+1.93e-04

Pos logit contributions
grand+0.027
oc+0.025
 lived+0.024
tr+0.023
 woke+0.022
Neg logit contributions
 heights-0.045
used-0.045
 hesitation-0.044
joy-0.042
 practicing-0.042
Token,
Token ID11
Feature activation-3.48e-03

Pos logit contributions
grand+0.000
 lived+0.000
 woke+0.000
oc+0.000
L+0.000
Neg logit contributions
 heights-0.000
 hesitation-0.000
used-0.000
 competitors-0.000
joy-0.000
Token can
Token ID460
Feature activation-1.16e-02

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 practicing+0.006
 competitors+0.006
Neg logit contributions
grand-0.006
 lived-0.005
oc-0.005
L-0.005
 woke-0.005
Token I
Token ID314
Feature activation+6.53e-03

Pos logit contributions
 heights+0.019
used+0.018
 hesitation+0.018
 competitors+0.016
joy+0.016
Neg logit contributions
grand-0.024
 lived-0.023
 woke-0.022
L-0.021
oc-0.021
Token play
Token ID711
Feature activation+1.04e-02

Pos logit contributions
grand+0.010
 lived+0.009
 woke+0.009
L+0.008
tr+0.008
Neg logit contributions
used-0.013
 heights-0.013
 hesitation-0.012
 competitors-0.011
 muscles-0.011
Token loved
Token ID6151
Feature activation+6.81e-04

Pos logit contributions
grand+0.013
 lived+0.010
L+0.010
 woke+0.009
tr+0.009
Neg logit contributions
 heights-0.036
used-0.035
 hesitation-0.034
 competitors-0.032
 practicing-0.032
Token his
Token ID465
Feature activation+1.21e-02

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Token new
Token ID649
Feature activation+1.63e-02

Pos logit contributions
grand+0.019
 lived+0.018
 woke+0.017
L+0.017
oc+0.016
Neg logit contributions
 heights-0.026
used-0.025
 hesitation-0.024
 competitors-0.022
joy-0.022
Token toy
Token ID13373
Feature activation+4.87e-03

Pos logit contributions
grand+0.020
 lived+0.018
 woke+0.017
L+0.016
oc+0.015
Neg logit contributions
 heights-0.041
used-0.040
 hesitation-0.039
 competitors-0.036
joy-0.035
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
grand+0.006
 lived+0.006
 woke+0.005
oc+0.005
L+0.005
Neg logit contributions
 heights-0.012
used-0.012
 hesitation-0.011
joy-0.010
 competitors-0.010
Token most
Token ID749
Feature activation+2.14e-02

Pos logit contributions
grand+0.020
 lived+0.019
 woke+0.018
L+0.017
oc+0.016
Neg logit contributions
 heights-0.031
used-0.030
 hesitation-0.029
 competitors-0.027
joy-0.027
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
grand+0.030
 lived+0.028
 woke+0.026
L+0.025
oc+0.024
Neg logit contributions
 heights-0.050
used-0.048
 hesitation-0.047
 competitors-0.043
joy-0.042
Token He
Token ID679
Feature activation-2.61e-03

Pos logit contributions
grand+0.013
 lived+0.012
 woke+0.011
L+0.011
oc+0.010
Neg logit contributions
 heights-0.031
used-0.030
 hesitation-0.029
 competitors-0.027
 practicing-0.027
Token couldn
Token ID3521
Feature activation+2.44e-03

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.006
 practicing+0.006
 competitors+0.006
Neg logit contributions
grand-0.003
L-0.002
tr-0.002
 lived-0.002
oc-0.002
Token't
Token ID470
Feature activation+8.93e-03

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
 competitors-0.004
joy-0.004
Token tell
Token ID1560
Feature activation+4.46e-04

Pos logit contributions
grand+0.019
 lived+0.017
 woke+0.017
tr+0.016
L+0.016
Neg logit contributions
 heights-0.014
 hesitation-0.014
used-0.013
 competitors-0.012
 practicing-0.011
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
L-0.001
 woke-0.001
oc-0.001
Token They
Token ID1119
Feature activation+2.54e-03

Pos logit contributions
 heights+0.027
used+0.026
 hesitation+0.025
joy+0.023
 competitors+0.023
Neg logit contributions
grand-0.018
 lived-0.016
 woke-0.015
L-0.015
oc-0.015
Token had
Token ID550
Feature activation+3.75e-03

Pos logit contributions
grand+0.003
oc+0.003
tr+0.002
ork+0.002
 woke+0.002
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.006
 practicing-0.005
Token a
Token ID257
Feature activation-3.54e-05

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.005
L+0.005
tr+0.004
Neg logit contributions
 heights-0.008
used-0.008
 hesitation-0.008
 competitors-0.007
joy-0.007
Token big
Token ID1263
Feature activation-5.13e-03

Pos logit contributions
 heights+0.000
used+0.000
 hesitation+0.000
 competitors+0.000
joy+0.000
Neg logit contributions
grand-0.000
 lived-0.000
 woke-0.000
L-0.000
oc-0.000
Token drum
Token ID13026
Feature activation+1.10e-02

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
 competitors+0.009
joy+0.009
Neg logit contributions
grand-0.008
 lived-0.008
 woke-0.007
L-0.007
oc-0.007
Token that
Token ID326
Feature activation-3.19e-03

Pos logit contributions
grand+0.015
 lived+0.014
 woke+0.013
L+0.012
tr+0.012
Neg logit contributions
 heights-0.026
used-0.025
 hesitation-0.025
joy-0.023
 competitors-0.023
Token made
Token ID925
Feature activation+1.09e-02

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.007
 competitors+0.006
 practicing+0.006
Neg logit contributions
grand-0.004
oc-0.004
 lived-0.004
 woke-0.004
L-0.004
Token loud
Token ID7812
Feature activation+1.14e-03

Pos logit contributions
grand+0.017
 lived+0.016
 woke+0.015
L+0.015
oc+0.014
Neg logit contributions
 heights-0.023
used-0.022
 hesitation-0.022
 competitors-0.020
joy-0.020
Token noises
Token ID26782
Feature activation+1.60e-02

Pos logit contributions
grand+0.002
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.002
 competitors-0.002
joy-0.002
Token when
Token ID618
Feature activation-1.34e-02

Pos logit contributions
grand+0.018
 lived+0.017
 woke+0.016
L+0.015
oc+0.014
Neg logit contributions
 heights-0.041
used-0.040
 hesitation-0.039
joy-0.036
 competitors-0.036
Token passed
Token ID3804
Feature activation+4.33e-03

Pos logit contributions
grand+0.016
 lived+0.012
L+0.011
oc+0.011
 woke+0.011
Neg logit contributions
 heights-0.037
used-0.036
 hesitation-0.035
 competitors-0.033
 practicing-0.033
Token,
Token ID11
Feature activation-1.67e-03

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
tr+0.006
Neg logit contributions
 heights-0.009
used-0.009
 hesitation-0.009
 competitors-0.008
joy-0.008
Token and
Token ID290
Feature activation-7.16e-03

Pos logit contributions
 heights+0.004
 hesitation+0.003
used+0.003
 practicing+0.003
joy+0.003
Neg logit contributions
grand-0.003
 lived-0.002
L-0.002
 woke-0.002
tr-0.002
Token Benny
Token ID44275
Feature activation-4.43e-03

Pos logit contributions
 heights+0.016
used+0.015
 hesitation+0.014
 competitors+0.013
joy+0.013
Neg logit contributions
 lived-0.011
grand-0.011
 woke-0.010
L-0.010
oc-0.009
Token went
Token ID1816
Feature activation-6.18e-04

Pos logit contributions
 heights+0.012
used+0.012
 hesitation+0.011
 competitors+0.011
 practicing+0.011
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.003
L-0.003
tr-0.003
Token back
Token ID736
Feature activation+6.93e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
 attempts+0.001
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
L-0.001
ork-0.001
Token to
Token ID284
Feature activation-6.25e-03

Pos logit contributions
grand+0.010
 lived+0.010
 woke+0.009
 was+0.008
L+0.008
Neg logit contributions
 heights-0.016
 hesitation-0.015
used-0.015
 competitors-0.014
 attempts-0.014
Token playing
Token ID2712
Feature activation+9.88e-03

Pos logit contributions
 heights+0.015
used+0.014
 hesitation+0.014
 competitors+0.013
joy+0.013
Neg logit contributions
grand-0.008
 lived-0.008
 woke-0.007
L-0.007
oc-0.007
Token in
Token ID287
Feature activation-1.19e-03

Pos logit contributions
grand+0.015
 lived+0.012
 woke+0.012
tr+0.012
L+0.011
Neg logit contributions
 heights-0.023
used-0.021
 hesitation-0.021
joy-0.020
 competitors-0.019
Token the
Token ID262
Feature activation+2.27e-03

Pos logit contributions
 heights+0.002
 hesitation+0.002
used+0.002
joy+0.002
 practicing+0.002
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
tr-0.002
L-0.002
Token sunshine
Token ID34488
Feature activation+3.44e-03

Pos logit contributions
grand+0.004
 lived+0.004
oc+0.004
 woke+0.004
rost+0.003
Neg logit contributions
 heights-0.004
used-0.004
 hesitation-0.004
 competitors-0.004
joy-0.003
Token the
Token ID262
Feature activation+2.33e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token park
Token ID3952
Feature activation-4.62e-04

Pos logit contributions
grand+0.004
 lived+0.003
 woke+0.003
L+0.003
oc+0.003
Neg logit contributions
 heights-0.005
used-0.005
 hesitation-0.005
 competitors-0.004
joy-0.004
Token.
Token ID13
Feature activation-5.63e-04

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.000
Token They
Token ID1119
Feature activation-1.83e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
joy+0.001
 competitors+0.001
Neg logit contributions
grand-0.001
 lived-0.001
oc-0.001
 woke-0.001
L-0.001
Token looked
Token ID3114
Feature activation-1.77e-03

Pos logit contributions
 heights+0.005
used+0.004
 hesitation+0.004
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token up
Token ID510
Feature activation+7.82e-04

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.003
 competitors+0.003
iously+0.003
Neg logit contributions
 lived-0.003
grand-0.003
 woke-0.003
L-0.002
oc-0.002
Token at
Token ID379
Feature activation+3.20e-03

Pos logit contributions
 lived+0.001
L+0.001
 woke+0.001
grand+0.001
 was+0.001
Neg logit contributions
 heights-0.002
used-0.002
 hesitation-0.002
 competitors-0.002
 Majesty-0.002
Token the
Token ID262
Feature activation-2.80e-03

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.005
L+0.005
oc+0.004
Neg logit contributions
 heights-0.007
used-0.006
 hesitation-0.006
 competitors-0.006
joy-0.006
Token sky
Token ID6766
Feature activation-4.59e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.005
joy+0.004
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.004
oc-0.004
Token and
Token ID290
Feature activation+1.16e-03

Pos logit contributions
used+0.013
 heights+0.013
 hesitation+0.013
 competitors+0.012
joy+0.012
Neg logit contributions
 lived-0.003
 woke-0.003
 was-0.003
tr-0.003
grand-0.002
Token saw
Token ID2497
Feature activation+5.97e-04

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
tr+0.001
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.003
joy-0.003
Token Lily
Token ID20037
Feature activation+9.11e-03

Pos logit contributions
grand+0.027
 lived+0.024
oc+0.023
 woke+0.022
tr+0.020
Neg logit contributions
 heights-0.050
used-0.049
 hesitation-0.048
joy-0.045
 practicing-0.044
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
grand+0.018
ork+0.017
rost+0.016
Ble+0.016
tr+0.016
Neg logit contributions
 heights-0.013
used-0.012
 practicing-0.011
 hesitation-0.011
joy-0.010
Token One
Token ID1881
Feature activation-6.03e-03

Pos logit contributions
 heights+0.019
used+0.018
 hesitation+0.017
joy+0.017
 muscles+0.016
Neg logit contributions
grand-0.024
 lived-0.021
oc-0.021
 woke-0.020
man-0.019
Token day
Token ID1110
Feature activation+1.19e-02

Pos logit contributions
 heights+0.007
 muscles+0.007
used+0.007
 hesitation+0.007
 practicing+0.007
Neg logit contributions
grand-0.011
Twe-0.010
oc-0.010
ork-0.010
rim-0.010
Token,
Token ID11
Feature activation+5.11e-03

Pos logit contributions
grand+0.022
 lived+0.021
 woke+0.020
L+0.019
oc+0.018
Neg logit contributions
 heights-0.023
used-0.021
 hesitation-0.021
 competitors-0.019
joy-0.019
Token Lily
Token ID20037
Feature activation+1.28e-02

Pos logit contributions
grand+0.011
oc+0.010
 lived+0.009
tr+0.009
 woke+0.009
Neg logit contributions
 heights-0.007
 hesitation-0.007
used-0.007
joy-0.006
 practicing-0.006
Token's
Token ID338
Feature activation+3.15e-03

Pos logit contributions
rost+0.019
grand+0.019
tr+0.018
ork+0.017
man+0.016
Neg logit contributions
 heights-0.021
 practicing-0.021
used-0.021
 muscles-0.020
 legs-0.019
Token mom
Token ID1995
Feature activation+1.27e-02

Pos logit contributions
grand+0.006
 lived+0.005
 woke+0.005
oc+0.005
L+0.005
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
joy-0.005
Tokenmy
Token ID1820
Feature activation+3.46e-03

Pos logit contributions
ork+0.021
grand+0.021
rost+0.020
oc+0.020
tr+0.019
Neg logit contributions
 practicing-0.020
used-0.020
 enough-0.019
 hesitation-0.018
 heights-0.017
Token made
Token ID925
Feature activation-1.18e-03

Pos logit contributions
grand+0.005
tr+0.005
rost+0.005
ork+0.005
oc+0.005
Neg logit contributions
 practicing-0.006
used-0.006
 hesitation-0.006
 enough-0.006
 heights-0.006
Token some
Token ID617
Feature activation-2.03e-03

Pos logit contributions
 heights+0.002
 hesitation+0.002
used+0.002
 practicing+0.002
 muscles+0.002
Neg logit contributions
grand-0.002
 lived-0.002
oc-0.002
 woke-0.002
tr-0.002
Token she
Token ID673
Feature activation-6.07e-03

Pos logit contributions
 heights+0.009
used+0.008
 hesitation+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.006
L-0.005
 was-0.005
Token stumbled
Token ID24241
Feature activation-5.21e-03

Pos logit contributions
 heights+0.015
used+0.015
 hesitation+0.014
 competitors+0.013
joy+0.013
Neg logit contributions
grand-0.007
 lived-0.007
 woke-0.006
L-0.006
oc-0.006
Token around
Token ID1088
Feature activation-2.61e-03

Pos logit contributions
 heights+0.012
used+0.011
 hesitation+0.010
 competitors+0.010
 attempts+0.009
Neg logit contributions
 woke-0.008
L-0.008
grand-0.007
 lived-0.007
 was-0.007
Token the
Token ID262
Feature activation+4.95e-03

Pos logit contributions
 heights+0.006
used+0.005
 hesitation+0.005
 competitors+0.005
 runners+0.005
Neg logit contributions
 lived-0.004
grand-0.004
 woke-0.004
L-0.003
 was-0.003
Token park
Token ID3952
Feature activation-5.83e-03

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.011
used-0.010
 hesitation-0.010
 competitors-0.009
joy-0.009
Token until
Token ID1566
Feature activation+5.10e-04

Pos logit contributions
 heights+0.013
used+0.013
 hesitation+0.012
 competitors+0.012
 muscles+0.011
Neg logit contributions
 lived-0.008
grand-0.008
 woke-0.007
L-0.007
 was-0.007
Token the
Token ID262
Feature activation-1.96e-04

Pos logit contributions
 lived+0.001
 woke+0.001
grand+0.001
L+0.001
 was+0.001
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
 runners-0.001
Token sun
Token ID4252
Feature activation-2.14e-03

Pos logit contributions
 heights+0.000
used+0.000
 hesitation+0.000
iously+0.000
 practicing+0.000
Neg logit contributions
grand-0.000
 lived-0.000
 woke-0.000
L-0.000
tr-0.000
Token started
Token ID2067
Feature activation-1.08e-02

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.003
 lived-0.002
oc-0.002
L-0.002
 woke-0.002
Token to
Token ID284
Feature activation+5.76e-03

Pos logit contributions
 heights+0.023
used+0.022
 hesitation+0.022
 competitors+0.020
joy+0.019
Neg logit contributions
grand-0.016
 lived-0.016
 woke-0.016
L-0.015
tr-0.014
Token go
Token ID467
Feature activation+7.18e-03

Pos logit contributions
grand+0.009
 lived+0.008
oc+0.008
L+0.007
ork+0.007
Neg logit contributions
 heights-0.013
used-0.012
 hesitation-0.012
 practicing-0.011
joy-0.011
Token ran
Token ID4966
Feature activation-4.20e-03

Pos logit contributions
 heights+0.012
used+0.012
 hesitation+0.012
 competitors+0.011
joy+0.011
Neg logit contributions
grand-0.006
 lived-0.006
 woke-0.006
L-0.005
oc-0.005
Token around
Token ID1088
Feature activation-5.60e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
 competitors+0.009
 attempts+0.009
Neg logit contributions
 woke-0.005
 lived-0.005
grand-0.004
L-0.004
 was-0.004
Token the
Token ID262
Feature activation+2.90e-03

Pos logit contributions
 heights+0.013
used+0.013
 hesitation+0.012
 competitors+0.011
iously+0.011
Neg logit contributions
 lived-0.007
grand-0.007
 woke-0.007
L-0.006
oc-0.006
Token room
Token ID2119
Feature activation+1.71e-02

Pos logit contributions
grand+0.005
 lived+0.004
 woke+0.004
L+0.004
oc+0.004
Neg logit contributions
 heights-0.006
used-0.006
 hesitation-0.006
 competitors-0.005
joy-0.005
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
 lived+0.022
grand+0.022
 woke+0.021
L+0.019
 was+0.019
Neg logit contributions
 heights-0.040
used-0.039
 hesitation-0.037
 competitors-0.035
joy-0.033
Token laughing
Token ID14376
Feature activation-1.78e-02

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.016
 competitors+0.014
joy+0.014
Neg logit contributions
grand-0.021
 lived-0.020
 woke-0.019
L-0.019
oc-0.018
Token and
Token ID290
Feature activation-5.73e-03

Pos logit contributions
 heights+0.049
used+0.049
 hesitation+0.047
 competitors+0.044
 practicing+0.042
Neg logit contributions
 lived-0.016
 woke-0.015
grand-0.014
L-0.013
 was-0.012
Token smiling
Token ID16755
Feature activation-9.29e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.009
 competitors+0.009
iously+0.009
Neg logit contributions
 lived-0.011
 woke-0.010
grand-0.010
 was-0.010
L-0.009
Token.
Token ID13
Feature activation-7.00e-03

Pos logit contributions
 heights+0.024
used+0.023
 hesitation+0.023
 competitors+0.021
 attempts+0.020
Neg logit contributions
 lived-0.011
 woke-0.010
 was-0.009
L-0.009
grand-0.009
Token They
Token ID1119
Feature activation-7.24e-03

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.017
 competitors+0.015
 practicing+0.015
Neg logit contributions
grand-0.008
 lived-0.008
L-0.007
 woke-0.007
oc-0.007
Token had
Token ID550
Feature activation+6.14e-04

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.017
 competitors+0.016
joy+0.015
Neg logit contributions
grand-0.009
 lived-0.008
 woke-0.008
L-0.007
oc-0.007
Token the
Token ID262
Feature activation+4.04e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.004
 lived-0.003
 woke-0.003
L-0.003
oc-0.003
Token van
Token ID5719
Feature activation-6.01e-04

Pos logit contributions
grand+0.006
 lived+0.006
oc+0.005
L+0.005
 woke+0.005
Neg logit contributions
 heights-0.009
 hesitation-0.008
used-0.008
joy-0.008
 competitors-0.008
Token and
Token ID290
Feature activation+6.40e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 practicing+0.001
 competitors+0.001
Neg logit contributions
grand-0.001
 lived-0.001
L-0.001
 woke-0.001
rost-0.001
Token wanted
Token ID2227
Feature activation-3.22e-03

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
oc+0.005
Neg logit contributions
 heights-0.017
used-0.016
 hesitation-0.016
 competitors-0.015
joy-0.015
Token to
Token ID284
Feature activation+6.98e-03

Pos logit contributions
 heights+0.007
used+0.007
 hesitation+0.006
 competitors+0.006
joy+0.006
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.004
tr-0.004
Token count
Token ID954
Feature activation-2.00e-02

Pos logit contributions
grand+0.011
 lived+0.010
tr+0.010
L+0.010
oc+0.010
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 competitors-0.012
 muscles-0.011
Token how
Token ID703
Feature activation+8.96e-04

Pos logit contributions
 hesitation+0.048
used+0.048
 heights+0.047
joy+0.046
 competitors+0.045
Neg logit contributions
 lived-0.021
grand-0.021
 ash-0.019
 was-0.019
 woke-0.018
Token many
Token ID867
Feature activation-1.71e-02

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
 was+0.001
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
joy-0.002
 practicing-0.002
Token things
Token ID1243
Feature activation+2.92e-03

Pos logit contributions
 heights+0.035
used+0.034
 hesitation+0.033
joy+0.030
 competitors+0.030
Neg logit contributions
grand-0.029
 lived-0.027
 woke-0.025
L-0.025
oc-0.024
Token were
Token ID547
Feature activation-4.11e-03

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.004
L+0.003
 was+0.003
Neg logit contributions
 heights-0.007
used-0.007
 hesitation-0.006
joy-0.006
 competitors-0.006
Token inside
Token ID2641
Feature activation+4.23e-03

Pos logit contributions
used+0.010
 heights+0.010
 hesitation+0.010
joy+0.010
 competitors+0.009
Neg logit contributions
 lived-0.004
grand-0.004
 was-0.004
 woke-0.004
 ash-0.004
Token the
Token ID262
Feature activation-2.48e-03

Pos logit contributions
 heights+0.030
used+0.030
 hesitation+0.028
 competitors+0.027
joy+0.027
Neg logit contributions
grand-0.018
 lived-0.018
 woke-0.017
L-0.014
 was-0.014
Token birds
Token ID10087
Feature activation-1.60e-02

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.004
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
L-0.004
oc-0.004
Token would
Token ID561
Feature activation+1.31e-02

Pos logit contributions
 heights+0.043
used+0.042
 hesitation+0.041
joy+0.039
 competitors+0.038
Neg logit contributions
grand-0.016
 lived-0.015
 woke-0.014
L-0.012
oc-0.012
Token love
Token ID1842
Feature activation-6.35e-04

Pos logit contributions
grand+0.019
 lived+0.017
L+0.016
 woke+0.016
oc+0.015
Neg logit contributions
 heights-0.030
 hesitation-0.029
used-0.029
 competitors-0.026
 practicing-0.026
Token to
Token ID284
Feature activation+5.89e-03

Pos logit contributions
used+0.001
 heights+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
 was-0.001
L-0.001
Token build
Token ID1382
Feature activation-1.36e-02

Pos logit contributions
grand+0.008
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.014
used-0.014
 hesitation-0.013
 competitors-0.013
joy-0.012
Token a
Token ID257
Feature activation+8.82e-03

Pos logit contributions
used+0.030
 heights+0.030
 hesitation+0.029
 competitors+0.027
joy+0.026
Neg logit contributions
grand-0.019
 lived-0.019
 woke-0.017
L-0.017
oc-0.016
Token nest
Token ID16343
Feature activation-4.60e-03

Pos logit contributions
grand+0.013
 lived+0.012
 woke+0.011
L+0.011
oc+0.011
Neg logit contributions
 heights-0.020
used-0.019
 hesitation-0.019
 competitors-0.017
joy-0.017
Token with
Token ID351
Feature activation-4.20e-03

Pos logit contributions
 heights+0.010
used+0.010
 hesitation+0.010
 competitors+0.009
joy+0.009
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Token it
Token ID340
Feature activation-1.08e-02

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.008
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Token!
Token ID0
Feature activation+2.03e-03

Pos logit contributions
 heights+0.027
used+0.026
 hesitation+0.025
joy+0.024
 competitors+0.024
Neg logit contributions
 lived-0.013
grand-0.012
 woke-0.011
 was-0.011
L-0.010
Token It
Token ID632
Feature activation+3.47e-03

Pos logit contributions
 heights+0.015
used+0.014
 hesitation+0.014
 practicing+0.013
 competitors+0.013
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.007
oc-0.006
Token saw
Token ID2497
Feature activation-4.77e-04

Pos logit contributions
grand+0.004
L+0.003
 lived+0.003
oc+0.003
tr+0.003
Neg logit contributions
 heights-0.009
used-0.009
 hesitation-0.009
 competitors-0.008
 practicing-0.008
Token green
Token ID4077
Feature activation-1.18e-02

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token trees
Token ID7150
Feature activation-1.40e-02

Pos logit contributions
 heights+0.022
used+0.020
 hesitation+0.020
 competitors+0.018
joy+0.018
Neg logit contributions
grand-0.023
 lived-0.021
 woke-0.020
L-0.020
oc-0.019
Token,
Token ID11
Feature activation-5.45e-03

Pos logit contributions
 heights+0.034
used+0.034
 hesitation+0.033
joy+0.031
 competitors+0.031
Neg logit contributions
 lived-0.016
grand-0.015
 woke-0.015
 was-0.014
oc-0.013
Token yellow
Token ID7872
Feature activation-1.68e-02

Pos logit contributions
 heights+0.010
used+0.009
 hesitation+0.009
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.011
 lived-0.010
 woke-0.010
L-0.009
oc-0.009
Token flowers
Token ID12734
Feature activation-1.84e-02

Pos logit contributions
used+0.033
 heights+0.033
 hesitation+0.032
 practicing+0.028
joy+0.028
Neg logit contributions
grand-0.028
 lived-0.027
 woke-0.026
L-0.024
oc-0.024
Token,
Token ID11
Feature activation-3.28e-03

Pos logit contributions
 heights+0.049
used+0.048
 hesitation+0.047
joy+0.045
 competitors+0.045
Neg logit contributions
 lived-0.019
 was-0.017
 woke-0.017
grand-0.015
 belonged-0.014
Token and
Token ID290
Feature activation-2.26e-03

Pos logit contributions
 heights+0.008
used+0.008
 hesitation+0.008
 competitors+0.007
joy+0.007
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.004
L-0.003
oc-0.003
Token blue
Token ID4171
Feature activation-1.82e-02

Pos logit contributions
used+0.005
 heights+0.004
 hesitation+0.004
joy+0.004
 competitors+0.004
Neg logit contributions
grand-0.004
 lived-0.004
 woke-0.003
L-0.003
 was-0.003
Token birds
Token ID10087
Feature activation-1.51e-02

Pos logit contributions
used+0.038
 heights+0.037
 hesitation+0.036
 practicing+0.033
joy+0.032
Neg logit contributions
grand-0.028
 lived-0.027
 woke-0.026
L-0.024
tr-0.024
Token!
Token ID0
Feature activation-1.72e-03

Pos logit contributions
 heights+0.018
used+0.017
 hesitation+0.017
 competitors+0.015
joy+0.015
Neg logit contributions
grand-0.011
 lived-0.011
 woke-0.010
L-0.010
oc-0.010
Token It
Token ID632
Feature activation+6.94e-03

Pos logit contributions
 heights+0.004
 hesitation+0.004
used+0.004
 competitors+0.003
joy+0.003
Neg logit contributions
grand-0.002
 lived-0.002
L-0.002
 woke-0.002
oc-0.002
Token was
Token ID373
Feature activation+1.60e-03

Pos logit contributions
grand+0.005
 lived+0.004
 woke+0.004
L+0.003
oc+0.003
Neg logit contributions
 heights-0.021
used-0.020
 hesitation-0.020
 competitors-0.019
joy-0.019
Token done
Token ID1760
Feature activation+4.37e-03

Pos logit contributions
grand+0.003
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
 heights-0.003
used-0.003
 hesitation-0.003
 competitors-0.003
joy-0.003
Token.
Token ID13
Feature activation-4.50e-05

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
oc+0.006
Neg logit contributions
 heights-0.010
used-0.009
 hesitation-0.009
 competitors-0.008
joy-0.008
Token
Token ID220
Feature activation-1.22e-02

Pos logit contributions
 heights+0.000
 hesitation+0.000
used+0.000
 competitors+0.000
 practicing+0.000
Neg logit contributions
grand-0.000
 lived-0.000
L-0.000
 woke-0.000
oc-0.000
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
 heights+0.033
 hesitation+0.031
used+0.031
 competitors+0.029
joy+0.028
Neg logit contributions
grand-0.013
 lived-0.012
L-0.010
 woke-0.010
oc-0.009
Token
Token ID198
Feature activation+4.67e-04

Pos logit contributions
grand+0.013
 lived+0.012
 woke+0.011
L+0.011
oc+0.010
Neg logit contributions
 heights-0.026
used-0.025
 hesitation-0.025
 competitors-0.023
joy-0.023
TokenThe
Token ID464
Feature activation-4.19e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
tr+0.000
oc+0.000
Neg logit contributions
 heights-0.001
used-0.001
 hesitation-0.001
 competitors-0.001
joy-0.001
Token spark
Token ID9009
Feature activation+7.24e-03

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.008
 competitors+0.008
joy+0.008
Neg logit contributions
grand-0.007
 lived-0.006
 woke-0.006
L-0.006
oc-0.005
Tokenles
Token ID829
Feature activation-5.55e-03

Pos logit contributions
grand+0.010
 lived+0.008
L+0.008
tr+0.008
 woke+0.008
Neg logit contributions
 heights-0.017
used-0.016
 hesitation-0.016
 competitors-0.015
 practicing-0.015
Token She
Token ID1375
Feature activation-5.52e-04

Pos logit contributions
 heights+0.023
 hesitation+0.023
used+0.022
 competitors+0.020
joy+0.020
Neg logit contributions
grand-0.012
 lived-0.011
 woke-0.011
L-0.011
tr-0.010
Token saw
Token ID2497
Feature activation-1.83e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
 practicing+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token lots
Token ID6041
Feature activation+6.45e-03

Pos logit contributions
 heights+0.004
used+0.004
 hesitation+0.004
 competitors+0.004
joy+0.004
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.002
L-0.002
tr-0.002
Token of
Token ID286
Feature activation-6.65e-03

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
oc+0.005
Neg logit contributions
 heights-0.017
used-0.017
 hesitation-0.016
joy-0.015
 competitors-0.015
Token snow
Token ID6729
Feature activation+8.81e-03

Pos logit contributions
 heights+0.013
 hesitation+0.012
used+0.012
 competitors+0.011
joy+0.011
Neg logit contributions
grand-0.012
 lived-0.011
 woke-0.010
L-0.010
tr-0.010
Tokenfl
Token ID2704
Feature activation-5.35e-02

Pos logit contributions
 lived+0.011
 was+0.010
man+0.010
 woke+0.010
tr+0.010
Neg logit contributions
 hesitation-0.022
used-0.021
 heights-0.020
joy-0.019
 competitors-0.019
Tokenakes
Token ID1124
Feature activation+2.95e-03

Pos logit contributions
 hesitation+0.115
 heights+0.114
used+0.109
joy+0.101
 competitors+0.098
Neg logit contributions
 lived-0.081
grand-0.077
tr-0.075
 woke-0.073
L-0.073
Token,
Token ID11
Feature activation-5.47e-03

Pos logit contributions
 lived+0.003
 woke+0.003
 was+0.003
 ash+0.002
grand+0.002
Neg logit contributions
 heights-0.007
used-0.007
 hesitation-0.007
joy-0.007
 competitors-0.007
Token glitter
Token ID31133
Feature activation+6.11e-03

Pos logit contributions
 heights+0.012
 hesitation+0.012
used+0.011
 competitors+0.011
 practicing+0.010
Neg logit contributions
grand-0.008
 lived-0.007
 woke-0.007
L-0.007
oc-0.007
Tokening
Token ID278
Feature activation-7.53e-03

Pos logit contributions
 lived+0.007
 was+0.007
 woke+0.006
tr+0.006
man+0.006
Neg logit contributions
 heights-0.015
 hesitation-0.015
used-0.015
 Majesty-0.014
 competitors-0.014
Token in
Token ID287
Feature activation+4.52e-03

Pos logit contributions
used+0.018
 heights+0.018
 hesitation+0.018
 competitors+0.017
joy+0.016
Neg logit contributions
 lived-0.009
 woke-0.009
grand-0.008
 was-0.008
L-0.007
Token working
Token ID1762
Feature activation-1.22e-03

Pos logit contributions
 heights+0.005
used+0.004
 hesitation+0.004
 competitors+0.004
 runners+0.004
Neg logit contributions
 woke-0.004
 lived-0.004
grand-0.004
 ash-0.004
oc-0.004
Token and
Token ID290
Feature activation-5.11e-03

Pos logit contributions
 heights+0.003
used+0.003
 hesitation+0.002
 competitors+0.002
joy+0.002
Neg logit contributions
grand-0.002
 lived-0.002
 woke-0.002
L-0.002
oc-0.002
Token May
Token ID1737
Feature activation-1.66e-03

Pos logit contributions
 heights+0.011
used+0.010
 hesitation+0.010
 competitors+0.010
joy+0.009
Neg logit contributions
 lived-0.008
grand-0.007
 woke-0.007
 was-0.007
L-0.007
Token kept
Token ID4030
Feature activation-2.47e-03

Pos logit contributions
 heights+0.005
 hesitation+0.004
used+0.004
joy+0.004
 competitors+0.004
Neg logit contributions
 lived-0.002
grand-0.002
 woke-0.001
oc-0.001
L-0.001
Token on
Token ID319
Feature activation-9.42e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.004
unn+0.004
 competitors+0.004
Neg logit contributions
 lived-0.004
 ash-0.004
 woke-0.003
grand-0.003
 was-0.003
Token pe
Token ID613
Feature activation-4.78e-02

Pos logit contributions
 heights+0.018
 hesitation+0.018
used+0.018
unn+0.017
 competitors+0.017
Neg logit contributions
 ash-0.013
grand-0.013
 repair-0.013
 was-0.013
 clean-0.012
Tokeneking
Token ID18754
Feature activation-2.26e-02

Pos logit contributions
 hesitation+0.091
used+0.090
 heights+0.090
joy+0.088
 Riley+0.082
Neg logit contributions
grand-0.073
ep-0.069
 lived-0.064
oc-0.063
aker-0.062
Token.
Token ID13
Feature activation+7.66e-04

Pos logit contributions
 heights+0.050
used+0.048
 hesitation+0.047
 competitors+0.044
 practicing+0.043
Neg logit contributions
 lived-0.031
 woke-0.030
grand-0.030
L-0.028
 was-0.027
Token Soon
Token ID15894
Feature activation-5.11e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.002
 hesitation-0.002
used-0.002
 competitors-0.002
joy-0.002
Token the
Token ID262
Feature activation+3.62e-03

Pos logit contributions
used+0.013
 heights+0.013
 hesitation+0.012
 competitors+0.011
joy+0.011
Neg logit contributions
 lived-0.007
 woke-0.006
 was-0.006
L-0.005
grand-0.004
Token bird
Token ID6512
Feature activation-4.59e-03

Pos logit contributions
grand+0.005
 lived+0.005
 woke+0.005
tr+0.005
L+0.004
Neg logit contributions
used-0.008
 heights-0.008
 hesitation-0.008
 practicing-0.007
joy-0.007
Token,
Token ID11
Feature activation-3.00e-03

Pos logit contributions
 heights+0.024
used+0.023
joy+0.022
 hesitation+0.022
 competitors+0.022
Neg logit contributions
 lived-0.012
grand-0.010
 was-0.010
 woke-0.010
 belonged-0.009
Token like
Token ID588
Feature activation-1.03e-02

Pos logit contributions
 heights+0.006
used+0.006
 hesitation+0.005
 competitors+0.005
joy+0.005
Neg logit contributions
grand-0.005
 lived-0.005
 woke-0.005
L-0.005
oc-0.004
Token lions
Token ID31420
Feature activation-2.74e-02

Pos logit contributions
used+0.022
 heights+0.020
 hesitation+0.020
ipping+0.019
joy+0.019
Neg logit contributions
 lived-0.016
grand-0.016
 woke-0.015
L-0.014
 was-0.013
Token,
Token ID11
Feature activation+1.01e-03

Pos logit contributions
 heights+0.061
used+0.060
 hesitation+0.058
joy+0.055
 competitors+0.054
Neg logit contributions
 lived-0.040
grand-0.036
 woke-0.035
oc-0.033
 was-0.033
Token gir
Token ID37370
Feature activation-3.85e-02

Pos logit contributions
grand+0.002
 lived+0.002
 woke+0.002
L+0.002
oc+0.002
Neg logit contributions
used-0.002
 heights-0.002
 hesitation-0.002
joy-0.002
 competitors-0.002
Tokenaff
Token ID2001
Feature activation-4.57e-02

Pos logit contributions
used+0.088
 heights+0.085
 hesitation+0.084
joy+0.077
 practicing+0.076
Neg logit contributions
grand-0.054
 lived-0.049
oc-0.049
 woke-0.046
L-0.045
Tokenes
Token ID274
Feature activation-1.41e-02

Pos logit contributions
 heights+0.093
 hesitation+0.087
used+0.083
joy+0.080
 attempts+0.078
Neg logit contributions
 lived-0.082
grand-0.077
oc-0.072
 woke-0.072
 named-0.069
Token,
Token ID11
Feature activation-2.66e-03

Pos logit contributions
 heights+0.032
used+0.031
 hesitation+0.030
joy+0.028
 competitors+0.028
Neg logit contributions
 lived-0.020
grand-0.019
 woke-0.018
oc-0.016
L-0.016
Token and
Token ID290
Feature activation-2.13e-03

Pos logit contributions
used+0.008
 heights+0.007
 hesitation+0.007
joy+0.007
 competitors+0.007
Neg logit contributions
 lived-0.002
 woke-0.002
grand-0.002
 was-0.002
L-0.001
Token elephants
Token ID25368
Feature activation-3.63e-02

Pos logit contributions
used+0.004
 heights+0.004
ipping+0.004
 hesitation+0.004
joy+0.004
Neg logit contributions
 lived-0.004
 woke-0.004
grand-0.003
 named-0.003
 was-0.003
Token.
Token ID13
Feature activation-4.47e-03

Pos logit contributions
 heights+0.097
used+0.095
 hesitation+0.092
joy+0.089
 attempts+0.087
Neg logit contributions
 lived-0.036
grand-0.036
oc-0.030
 woke-0.030
 was-0.028
Token a
Token ID257
Feature activation+5.20e-03

Pos logit contributions
 heights+0.013
used+0.013
 hesitation+0.012
 competitors+0.012
joy+0.012
Neg logit contributions
 lived-0.008
grand-0.008
L-0.007
 woke-0.007
 was-0.007
Token stream
Token ID4269
Feature activation-5.83e-04

Pos logit contributions
grand+0.007
 lived+0.007
 woke+0.007
L+0.006
oc+0.006
Neg logit contributions
 heights-0.012
used-0.011
 hesitation-0.011
 competitors-0.010
iously-0.010
Token,
Token ID11
Feature activation-9.15e-03

Pos logit contributions
 heights+0.001
used+0.001
 hesitation+0.001
 competitors+0.001
joy+0.001
Neg logit contributions
grand-0.001
 lived-0.001
 woke-0.001
L-0.001
oc-0.001
Token some
Token ID617
Feature activation-7.27e-03

Pos logit contributions
 heights+0.020
used+0.019
 hesitation+0.019
 competitors+0.017
joy+0.017
Neg logit contributions
grand-0.014
 lived-0.013
 woke-0.012
L-0.012
oc-0.011
Token wild
Token ID4295
Feature activation-7.22e-03

Pos logit contributions
 heights+0.014
used+0.013
 hesitation+0.013
 competitors+0.012
joy+0.012
Neg logit contributions
grand-0.013
 lived-0.012
 woke-0.012
L-0.011
oc-0.011
Tokenfl
Token ID2704
Feature activation-4.79e-02

Pos logit contributions
used+0.013
 heights+0.013
 hesitation+0.013
 practicing+0.011
 competitors+0.011
Neg logit contributions
 lived-0.013
 woke-0.012
grand-0.012
 was-0.012
oc-0.011
Tokenowers
Token ID3618
Feature activation-2.58e-02

Pos logit contributions
 hesitation+0.095
 heights+0.093
used+0.083
 Majesty+0.080
 attempts+0.079
Neg logit contributions
 lived-0.080
tr-0.075
ase-0.071
room-0.070
grand-0.067
Token and
Token ID290
Feature activation-8.54e-04

Pos logit contributions
 heights+0.067
used+0.063
 hesitation+0.061
 competitors+0.060
 Majesty+0.060
Neg logit contributions
 lived-0.030
 was-0.026
grand-0.026
 woke-0.025
 belonged-0.022
Token a
Token ID257
Feature activation+7.05e-04

Pos logit contributions
 heights+0.002
used+0.002
 hesitation+0.002
 competitors+0.002
joy+0.002
Neg logit contributions
 lived-0.001
grand-0.001
 woke-0.001
L-0.001
 was-0.001
Token huge
Token ID3236
Feature activation-4.27e-03

Pos logit contributions
grand+0.001
 lived+0.001
 woke+0.001
L+0.001
oc+0.001
Neg logit contributions
 heights-0.002
used-0.002
 hesitation-0.001
 competitors-0.001
iously-0.001
Token old
Token ID1468
Feature activation-3.73e-03

Pos logit contributions
 heights+0.009
used+0.009
 hesitation+0.008
 competitors+0.008
 practicing+0.008
Neg logit contributions
 lived-0.006
 woke-0.006
grand-0.006
oc-0.006
L-0.005
Token house
Token ID2156
Feature activation+5.59e-03

Pos logit contributions
grand+0.029
 woke+0.027
 lived+0.026
L+0.026
tr+0.024
Neg logit contributions
 heights-0.052
used-0.051
 hesitation-0.050
 competitors-0.045
 practicing-0.045
Token.
Token ID13
Feature activation-2.15e-03

Pos logit contributions
grand+0.007
 lived+0.006
 woke+0.006
L+0.006
oc+0.005
Neg logit contributions
 heights-0.014
used-0.013
 hesitation-0.013
 competitors-0.012
 muscles-0.012
Token They
Token ID1119
Feature activation-4.25e-03

Pos logit contributions
 heights+0.005
used+0.005
 hesitation+0.005
 competitors+0.005
 practicing+0.005
Neg logit contributions
grand-0.003
 lived-0.003
 woke-0.002
L-0.002
tr-0.002
Token delivered
Token ID6793
Feature activation-8.44e-04

Pos logit contributions
 heights+0.010
used+0.010
 hesitation+0.010
 competitors+0.009
joy+0.009
Neg logit contributions
grand-0.006
 lived-0.005
oc-0.004
L-0.004
 woke-0.004
Token more
Token ID517
Feature activation-1.33e-02

Pos logit contributions
used+0.002
 heights+0.002
 hesitation+0.002
 competitors+0.002
 practicing+0.002
Neg logit contributions
grand-0.001
 lived-0.001
L-0.001
 woke-0.001
tr-0.001
Token rad
Token ID2511
Feature activation-4.91e-02

Pos logit contributions
 heights+0.027
used+0.026
 hesitation+0.025
 competitors+0.023
joy+0.023
Neg logit contributions
grand-0.022
 lived-0.021
 woke-0.020
L-0.019
oc-0.019
Tokenishes
Token ID5614
Feature activation-2.72e-02

Pos logit contributions
 hesitation+0.073
 heights+0.070
used+0.067
joy+0.060
iley+0.059
Neg logit contributions
grand-0.108
oc-0.107
 lived-0.100
 was-0.095
man-0.094
Token and
Token ID290
Feature activation-8.18e-03

Pos logit contributions
 heights+0.066
used+0.064
 hesitation+0.062
 competitors+0.058
joy+0.057
Neg logit contributions
grand-0.036
 lived-0.033
 woke-0.030
L-0.029
oc-0.028
Token made
Token ID925
Feature activation+2.97e-03

Pos logit contributions
 heights+0.019
used+0.018
 hesitation+0.018
 competitors+0.016
joy+0.016
Neg logit contributions
grand-0.011
 lived-0.010
 woke-0.010
L-0.009
oc-0.009
Token more
Token ID517
Feature activation-1.72e-02

Pos logit contributions
grand+0.004
 lived+0.004
 woke+0.004
L+0.004
oc+0.003
Neg logit contributions
used-0.007
 heights-0.007
 hesitation-0.006
 competitors-0.006
joy-0.006
Token people
Token ID661
Feature activation-4.17e-03

Pos logit contributions
used+0.036
 heights+0.036
 hesitation+0.034
 competitors+0.031
joy+0.031
Neg logit contributions
grand-0.027
 lived-0.026
 woke-0.024
oc-0.024
L-0.023