TOP ACTIVATIONS
MAX = 0.069

park filled with colourful or nam ents . Jimmy looked out
box full of beautiful or nam ents .
a box of shiny or nam ents that spark led in
time playing with her or nam ents and making her special
the rest of the or nam ents onto the tree .
to look at the or nam ents on the tree .
All of these lovely or nam ents reminded him of the
take one of the or nam ents outside for a walk
a box of pretty or nam ents that she wanted to
hill and put her or nam ents all around . She
more careful . The or nam ents are not toys .
he especially liked the or nam ents he kept around .
can we have some or nam ents for our snow man
she could play with the st a pler and the paper
dust and dirt . The st alks were hidden and the
a tree with shiny or nam ents hanging from its branches
went she searched for or nam ents , and the more
alone , trapped in the st inky air . One day
. They were called or nam ents and they were very
to me . The or nam ents were very important to

MIN ACTIVATIONS
MIN = -0.075

grew until it was as big as Amy .
. It is not as big or shiny as Dad 's
. It is not as big , but it is still
that it wasn 't as big as she thought . She
bigger until it was as big as Jack !
too high and too w obb ly , and it fell
grow until she became as big as the bubble .
bigger until it was as big as Tim my ! Tim
. It was now as big as a tree . She
. They are not as big or as bright as the
is too small to do big jobs ." Liz looked sad
finished it was almost as big as her bedroom !
Soon it was twice as big . Sarah was so excited
mushroom . It was as big as her head !
. It was not as impressive , but it was still
Tim grabbed the hook and anx iously put it in the
. It wasn 't as big as the one he admired
bigger . It becomes as big as Anna 's head ,
flowers were not growing as big and strong as they used
his magnet was not as big and colorful as Sarah 's

INTERVAL 0.033 - 0.069
CONTAINS 0.222%

Sue . Sue lived in a regular house with her mom
Lily . She lived in a house with her mom my
a check up and be extra clean !" The
's pretend we are mer maid s and swim in the
started to look for the source of the salt .

INTERVAL -0.003 - 0.033
CONTAINS 61.384%

shop , they saw that it was closed . " Oh
" That is an elephant . Ele ph ants live in
Jack 's mum told him , " Jack , the freezer
. Sarah was so happy to have had
up and down and around . They hear the sounds and

INTERVAL -0.039 - -0.003
CONTAINS 38.266%

but they could not find them . They asked the park
I am proud of you . And you , Jen ,
big sun with yellow and orange . Ben drew a green
The moral of the story is to never give
for Mom . They had flour , eggs , milk and

INTERVAL -0.075 - -0.039
CONTAINS 0.128%

his wings and soared so high . But try as he
to explain what the pistol was . She said it was
the crocod ile was too fast . It snapped its jaws
good ! She could not believe what she did . She
middle . It looked so tall and strong , like it
Token park
Token ID3952
Feature activation+6.13e-03

Pos logit contributions
 queens+0.051
cious+0.050
ppy+0.047
mare+0.046
 chicks+0.046
Neg logit contributions
 him-0.044
 Finally-0.040
anked-0.039
 up-0.038
 When-0.037
Token filled
Token ID5901
Feature activation+1.12e-02

Pos logit contributions
 queens+0.012
cious+0.012
mare+0.011
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.008
anked-0.007
 Finally-0.007
 When-0.006
 up-0.006
Token with
Token ID351
Feature activation+3.33e-03

Pos logit contributions
 queens+0.017
cious+0.016
mare+0.015
 seek+0.014
 cloud+0.014
Neg logit contributions
 Finally-0.018
anked-0.018
Years-0.016
 Slowly-0.016
 When-0.016
Token colourful
Token ID44799
Feature activation+4.75e-03

Pos logit contributions
 queens+0.006
cious+0.005
 chicks+0.005
mare+0.005
 soul+0.005
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.005
 When-0.005
 Up-0.005
Token or
Token ID393
Feature activation+5.41e-03

Pos logit contributions
 queens+0.008
cious+0.007
mare+0.007
 chicks+0.007
ppy+0.007
Neg logit contributions
 him-0.008
anked-0.008
 Finally-0.007
 When-0.007
 up-0.007
Tokennam
Token ID7402
Feature activation+6.93e-02

Pos logit contributions
mare+0.009
ppy+0.009
opy+0.009
cious+0.009
pillar+0.009
Neg logit contributions
 him-0.009
 up-0.008
 his-0.008
 Tim-0.007
 Finally-0.007
Tokenents
Token ID658
Feature activation+8.32e-03

Pos logit contributions
mare+0.101
cious+0.100
 queens+0.099
 soul+0.095
 chicks+0.093
Neg logit contributions
 him-0.120
anked-0.118
 Finally-0.108
 up-0.104
 When-0.103
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
 queens+0.017
cious+0.017
mare+0.016
 chicks+0.016
 soul+0.015
Neg logit contributions
 him-0.011
anked-0.009
 Finally-0.009
 up-0.008
 When-0.008
Token Jimmy
Token ID12963
Feature activation+8.53e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 soul+0.003
opy+0.003
Neg logit contributions
anked-0.002
 him-0.002
atches-0.002
ered-0.001
 up-0.001
Token looked
Token ID3114
Feature activation-2.10e-02

Pos logit contributions
 queens+0.015
cious+0.014
mare+0.013
lessness+0.013
 chicks+0.013
Neg logit contributions
 him-0.014
 Finally-0.012
anked-0.012
 up-0.012
 When-0.011
Token out
Token ID503
Feature activation-1.92e-03

Pos logit contributions
anked+0.024
 him+0.023
 Finally+0.021
 When+0.019
 Up+0.018
Neg logit contributions
 queens-0.044
cious-0.042
mare-0.040
 chicks-0.040
 soul-0.039
Token box
Token ID3091
Feature activation+2.49e-03

Pos logit contributions
 him+0.030
anked+0.028
 Finally+0.027
 up+0.026
 When+0.025
Neg logit contributions
 queens-0.029
cious-0.027
mare-0.025
ppy-0.025
 chicks-0.025
Token full
Token ID1336
Feature activation+1.76e-02

Pos logit contributions
 queens+0.004
cious+0.004
mare+0.004
lessness+0.004
ppy+0.004
Neg logit contributions
 him-0.004
 Finally-0.004
anked-0.004
 When-0.004
 After-0.003
Token of
Token ID286
Feature activation+6.73e-03

Pos logit contributions
 queens+0.025
cious+0.022
 chicks+0.022
 soul+0.021
lessness+0.021
Neg logit contributions
 him-0.033
anked-0.032
 Finally-0.032
 When-0.029
 After-0.029
Token beautiful
Token ID4950
Feature activation-4.42e-03

Pos logit contributions
 queens+0.010
cious+0.009
 chicks+0.009
 soul+0.009
 ghosts+0.008
Neg logit contributions
anked-0.012
 Finally-0.012
 him-0.011
 When-0.011
 Up-0.010
Token or
Token ID393
Feature activation+4.69e-03

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.008
cious-0.007
mare-0.007
 chicks-0.007
ppy-0.007
Tokennam
Token ID7402
Feature activation+6.78e-02

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.006
ppy+0.005
 chicks+0.005
Neg logit contributions
 him-0.009
anked-0.009
 Finally-0.009
 up-0.008
 When-0.008
Tokenents
Token ID658
Feature activation+1.02e-02

Pos logit contributions
cious+0.096
mare+0.096
 queens+0.094
 soul+0.091
 chicks+0.089
Neg logit contributions
 him-0.118
anked-0.117
 Finally-0.108
 up-0.104
 When-0.103
Token.
Token ID13
Feature activation+5.18e-03

Pos logit contributions
 queens+0.021
cious+0.021
mare+0.020
 soul+0.019
 chicks+0.019
Neg logit contributions
 him-0.013
anked-0.011
 up-0.010
 Finally-0.010
 When-0.010
Token
Token ID220
Feature activation-3.75e-03

Pos logit contributions
 queens+0.012
cious+0.011
ppy+0.010
aky+0.010
cest+0.010
Neg logit contributions
 him-0.006
 Finally-0.006
anked-0.005
 When-0.005
 After-0.005
Token
Token ID198
Feature activation+3.66e-03

Pos logit contributions
anked+0.005
 him+0.005
 Finally+0.004
 drib+0.004
asted+0.004
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.006
ppy-0.006
lessness-0.006
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.008
 chicks+0.008
ppy+0.008
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.003
 When-0.003
 up-0.003
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
cious+0.003
 queens+0.003
ppy+0.003
opy+0.003
pillar+0.003
Neg logit contributions
 him-0.002
 Finally-0.002
anked-0.001
 When-0.001
 his-0.001
Token box
Token ID3091
Feature activation-2.70e-03

Pos logit contributions
 him+0.027
anked+0.027
 Finally+0.024
 When+0.024
 Up+0.023
Neg logit contributions
 queens-0.024
cious-0.023
mare-0.022
 chicks-0.022
 soul-0.021
Token of
Token ID286
Feature activation+9.06e-04

Pos logit contributions
 him+0.004
anked+0.004
 Finally+0.003
 up+0.003
 When+0.003
Neg logit contributions
 queens-0.005
cious-0.005
 chicks-0.005
mare-0.005
ppy-0.004
Token shiny
Token ID22441
Feature activation-2.11e-02

Pos logit contributions
 queens+0.001
cious+0.001
 chicks+0.001
 soul+0.001
mare+0.001
Neg logit contributions
anked-0.002
 him-0.002
 Finally-0.001
 When-0.001
 Up-0.001
Token or
Token ID393
Feature activation+1.08e-02

Pos logit contributions
 him+0.034
anked+0.033
 Finally+0.032
 When+0.030
 Up+0.029
Neg logit contributions
 queens-0.035
cious-0.033
mare-0.031
 chicks-0.031
 soul-0.030
Tokennam
Token ID7402
Feature activation+6.60e-02

Pos logit contributions
mare+0.019
cious+0.018
pillar+0.018
ppy+0.017
plets+0.017
Neg logit contributions
 him-0.018
 up-0.016
 his-0.015
 Finally-0.015
anked-0.015
Tokenents
Token ID658
Feature activation+1.06e-02

Pos logit contributions
mare+0.100
cious+0.099
 queens+0.095
 soul+0.094
 chicks+0.092
Neg logit contributions
anked-0.110
 him-0.109
 Finally-0.102
 up-0.097
 When-0.096
Token that
Token ID326
Feature activation+1.03e-02

Pos logit contributions
 queens+0.023
cious+0.022
mare+0.021
 chicks+0.021
 soul+0.021
Neg logit contributions
 him-0.013
anked-0.011
 up-0.011
 Finally-0.010
 When-0.010
Token spark
Token ID9009
Feature activation-9.79e-03

Pos logit contributions
 queens+0.018
cious+0.017
mare+0.016
 chicks+0.016
ppy+0.015
Neg logit contributions
 him-0.017
anked-0.016
 Finally-0.015
 When-0.014
 up-0.014
Tokenled
Token ID992
Feature activation-1.60e-02

Pos logit contributions
 him+0.016
anked+0.015
 up+0.014
 Finally+0.014
 When+0.013
Neg logit contributions
 queens-0.017
cious-0.016
mare-0.015
 chicks-0.015
 soul-0.014
Token in
Token ID287
Feature activation+4.67e-03

Pos logit contributions
 him+0.021
anked+0.020
 Finally+0.019
 When+0.017
 Up+0.017
Neg logit contributions
 queens-0.032
cious-0.031
mare-0.029
 chicks-0.029
ppy-0.028
Token time
Token ID640
Feature activation-1.90e-03

Pos logit contributions
anked+0.038
 him+0.038
 Finally+0.036
 When+0.035
atches+0.033
Neg logit contributions
 queens-0.034
 soul-0.033
cious-0.033
mare-0.032
 bee-0.032
Token playing
Token ID2712
Feature activation-3.67e-03

Pos logit contributions
 him+0.003
anked+0.002
 Finally+0.002
 When+0.002
 Up+0.002
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.003
 chicks-0.003
ppy-0.003
Token with
Token ID351
Feature activation-1.11e-02

Pos logit contributions
anked+0.005
 Finally+0.005
 him+0.005
 When+0.004
 Slowly+0.004
Neg logit contributions
 queens-0.007
cious-0.006
mare-0.006
 bee-0.006
 chicks-0.006
Token her
Token ID607
Feature activation+9.88e-03

Pos logit contributions
 him+0.011
anked+0.011
 Finally+0.010
 When+0.009
 Up+0.008
Neg logit contributions
 queens-0.026
cious-0.024
 chicks-0.023
mare-0.023
 soul-0.023
Token or
Token ID393
Feature activation-1.09e-02

Pos logit contributions
 queens+0.016
cious+0.015
mare+0.014
 chicks+0.014
ppy+0.013
Neg logit contributions
 him-0.017
anked-0.016
 Finally-0.016
 When-0.015
 up-0.014
Tokennam
Token ID7402
Feature activation+6.58e-02

Pos logit contributions
 him+0.014
anked+0.012
 Finally+0.011
 up+0.011
 When+0.010
Neg logit contributions
cious-0.023
 queens-0.023
mare-0.021
ppy-0.021
opy-0.021
Tokenents
Token ID658
Feature activation+9.92e-03

Pos logit contributions
cious+0.092
 queens+0.091
mare+0.091
 chicks+0.087
 soul+0.086
Neg logit contributions
 him-0.118
anked-0.116
 Finally-0.107
 up-0.103
 When-0.103
Token and
Token ID290
Feature activation-7.91e-03

Pos logit contributions
 queens+0.021
cious+0.020
mare+0.020
 chicks+0.019
 soul+0.019
Neg logit contributions
 him-0.012
anked-0.011
 up-0.010
 Finally-0.010
 When-0.009
Token making
Token ID1642
Feature activation-3.47e-03

Pos logit contributions
 him+0.012
anked+0.012
 Finally+0.011
 When+0.011
 up+0.010
Neg logit contributions
 queens-0.014
cious-0.013
mare-0.013
 chicks-0.012
ppy-0.012
Token her
Token ID607
Feature activation+1.19e-02

Pos logit contributions
anked+0.005
 Finally+0.005
 him+0.004
 When+0.004
atches+0.004
Neg logit contributions
 queens-0.006
cious-0.006
 chicks-0.006
 soul-0.005
 bee-0.005
Token special
Token ID2041
Feature activation-3.80e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
ppy+0.017
Neg logit contributions
 him-0.020
anked-0.019
 Finally-0.018
 When-0.017
 up-0.017
Token the
Token ID262
Feature activation-6.09e-03

Pos logit contributions
 queens+0.010
cious+0.010
 chicks+0.009
 soul+0.009
mare+0.009
Neg logit contributions
anked-0.007
 him-0.007
 Finally-0.007
 When-0.006
 Up-0.006
Token rest
Token ID1334
Feature activation+1.63e-02

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 up+0.009
 When+0.009
Neg logit contributions
 queens-0.009
cious-0.009
mare-0.009
ppy-0.008
 chicks-0.008
Token of
Token ID286
Feature activation-9.37e-03

Pos logit contributions
 queens+0.033
cious+0.032
mare+0.030
 chicks+0.029
 soul+0.029
Neg logit contributions
 him-0.022
anked-0.020
 up-0.018
 Finally-0.018
 When-0.017
Token the
Token ID262
Feature activation-5.22e-03

Pos logit contributions
anked+0.014
 Finally+0.013
 him+0.013
 When+0.013
 Up+0.012
Neg logit contributions
 queens-0.016
cious-0.015
 chicks-0.015
 soul-0.014
mare-0.014
Token or
Token ID393
Feature activation-9.88e-03

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 up+0.008
 When+0.008
Neg logit contributions
 queens-0.008
cious-0.008
mare-0.008
ppy-0.007
 chicks-0.007
Tokennam
Token ID7402
Feature activation+6.57e-02

Pos logit contributions
 him+0.015
 up+0.013
 Finally+0.013
anked+0.013
 his+0.012
Neg logit contributions
cious-0.018
 queens-0.018
mare-0.017
 chicks-0.017
opy-0.017
Tokenents
Token ID658
Feature activation+4.04e-03

Pos logit contributions
mare+0.102
cious+0.099
 queens+0.097
 soul+0.096
 chicks+0.093
Neg logit contributions
anked-0.109
 him-0.107
 Finally-0.099
 up-0.094
 After-0.094
Token onto
Token ID4291
Feature activation-3.01e-03

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.008
 chicks+0.008
 soul+0.007
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.004
 up-0.004
 When-0.004
Token the
Token ID262
Feature activation+3.27e-03

Pos logit contributions
anked+0.004
 him+0.004
 Finally+0.004
 When+0.003
 Up+0.003
Neg logit contributions
 queens-0.006
cious-0.005
mare-0.005
 chicks-0.005
 soul-0.005
Token tree
Token ID5509
Feature activation-1.72e-03

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
ppy+0.005
 chicks+0.005
Neg logit contributions
 him-0.005
 Finally-0.005
anked-0.005
 up-0.005
 When-0.005
Token.
Token ID13
Feature activation-6.97e-03

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
 chicks-0.003
ppy-0.003
Token to
Token ID284
Feature activation+6.93e-03

Pos logit contributions
 Finally+0.028
anked+0.027
 When+0.025
 After+0.024
 Slowly+0.024
Neg logit contributions
 queens-0.029
 chicks-0.025
 bee-0.024
cious-0.024
 soul-0.023
Token look
Token ID804
Feature activation-2.89e-02

Pos logit contributions
 queens+0.012
cious+0.012
mare+0.011
 soul+0.011
 chicks+0.011
Neg logit contributions
anked-0.010
 Finally-0.010
 him-0.010
 When-0.009
 After-0.008
Token at
Token ID379
Feature activation-1.00e-02

Pos logit contributions
anked+0.043
 Finally+0.041
 Timothy+0.039
asted+0.037
Here+0.037
Neg logit contributions
 queens-0.046
ppy-0.045
aky-0.042
opy-0.041
 soul-0.041
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
 him+0.012
anked+0.012
 Finally+0.012
 When+0.011
 Up+0.010
Neg logit contributions
 queens-0.020
cious-0.019
 chicks-0.018
mare-0.018
 soul-0.018
Token or
Token ID393
Feature activation-1.44e-02

Pos logit contributions
 queens+0.022
cious+0.020
 chicks+0.019
mare+0.018
 soul+0.018
Neg logit contributions
 him-0.030
anked-0.030
 Finally-0.029
 When-0.027
 After-0.026
Tokennam
Token ID7402
Feature activation+6.45e-02

Pos logit contributions
 him+0.028
 up+0.024
anked+0.024
 Finally+0.024
 When+0.023
Neg logit contributions
 queens-0.022
cious-0.021
mare-0.020
 chicks-0.020
plets-0.019
Tokenents
Token ID658
Feature activation-3.93e-03

Pos logit contributions
 queens+0.087
cious+0.085
mare+0.081
 chicks+0.078
 soul+0.076
Neg logit contributions
 him-0.122
anked-0.119
 Finally-0.112
 When-0.108
 up-0.107
Token on
Token ID319
Feature activation+9.73e-03

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.004
 When+0.004
 up+0.004
Neg logit contributions
 queens-0.008
cious-0.008
mare-0.007
 chicks-0.007
ppy-0.007
Token the
Token ID262
Feature activation+5.13e-03

Pos logit contributions
 queens+0.019
cious+0.018
 chicks+0.017
mare+0.017
 soul+0.017
Neg logit contributions
anked-0.012
 him-0.012
 Finally-0.012
 When-0.011
 Up-0.010
Token tree
Token ID5509
Feature activation-9.34e-03

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.008
ppy+0.007
 chicks+0.007
Neg logit contributions
 him-0.009
anked-0.008
 Finally-0.008
 up-0.008
 When-0.007
Token.
Token ID13
Feature activation+2.68e-03

Pos logit contributions
 him+0.013
anked+0.012
 Finally+0.012
 When+0.011
 Up+0.011
Neg logit contributions
 queens-0.018
cious-0.017
 chicks-0.016
mare-0.016
ppy-0.016
Token All
Token ID1439
Feature activation+1.06e-02

Pos logit contributions
anked+0.006
 him+0.005
 up+0.005
atches+0.004
 drib+0.004
Neg logit contributions
 queens-0.006
cious-0.006
 chicks-0.006
pillar-0.006
 soul-0.006
Token of
Token ID286
Feature activation+5.88e-03

Pos logit contributions
 queens+0.021
cious+0.021
mare+0.020
 chicks+0.019
ppy+0.019
Neg logit contributions
 him-0.014
anked-0.013
 Finally-0.012
 When-0.012
 up-0.011
Token these
Token ID777
Feature activation+2.22e-02

Pos logit contributions
 queens+0.013
cious+0.012
 chicks+0.012
mare+0.011
 soul+0.011
Neg logit contributions
anked-0.006
 him-0.006
 Finally-0.006
 When-0.005
atches-0.005
Token lovely
Token ID14081
Feature activation+1.63e-02

Pos logit contributions
cious+0.038
 queens+0.037
mare+0.037
ppy+0.034
plets+0.034
Neg logit contributions
 him-0.037
anked-0.033
 Finally-0.032
 up-0.030
 When-0.030
Token or
Token ID393
Feature activation+6.22e-03

Pos logit contributions
cious+0.030
 queens+0.030
ppy+0.029
mare+0.029
plets+0.028
Neg logit contributions
 him-0.025
anked-0.022
 Finally-0.020
 up-0.020
 When-0.019
Tokennam
Token ID7402
Feature activation+6.43e-02

Pos logit contributions
mare+0.010
cious+0.010
ppy+0.009
plets+0.009
pillar+0.009
Neg logit contributions
 him-0.011
 his-0.010
 up-0.010
 Finally-0.009
anked-0.009
Tokenents
Token ID658
Feature activation+9.91e-03

Pos logit contributions
mare+0.098
cious+0.096
 queens+0.093
 soul+0.092
 moth+0.089
Neg logit contributions
 him-0.109
anked-0.107
 Finally-0.099
 When-0.093
 After-0.093
Token reminded
Token ID14516
Feature activation-1.01e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
 soul+0.016
Neg logit contributions
 him-0.015
anked-0.013
 up-0.012
 Finally-0.012
 When-0.012
Token him
Token ID683
Feature activation-2.47e-02

Pos logit contributions
anked+0.000
 Finally+0.000
atches+0.000
 When+0.000
 him+0.000
Neg logit contributions
 queens-0.003
cious-0.003
 chicks-0.003
ppy-0.003
 soul-0.003
Token of
Token ID286
Feature activation-1.68e-02

Pos logit contributions
 him+0.030
anked+0.027
 Finally+0.026
 When+0.024
 up+0.024
Neg logit contributions
 queens-0.052
cious-0.050
mare-0.048
 chicks-0.047
ppy-0.047
Token the
Token ID262
Feature activation-8.47e-03

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.016
 When+0.015
 Up+0.014
Neg logit contributions
 queens-0.037
cious-0.035
mare-0.034
 chicks-0.034
 soul-0.033
Token take
Token ID1011
Feature activation+9.01e-03

Pos logit contributions
 him+0.011
anked+0.011
 Finally+0.010
 When+0.009
 Up+0.009
Neg logit contributions
 queens-0.015
cious-0.014
mare-0.013
 chicks-0.013
 soul-0.013
Token one
Token ID530
Feature activation+1.26e-02

Pos logit contributions
 queens+0.017
cious+0.016
 chicks+0.015
 soul+0.015
mare+0.015
Neg logit contributions
anked-0.012
 Finally-0.011
 him-0.011
 When-0.010
atches-0.010
Token of
Token ID286
Feature activation+7.43e-03

Pos logit contributions
 queens+0.020
cious+0.018
 soul+0.018
 chicks+0.018
mare+0.018
Neg logit contributions
 him-0.021
anked-0.021
 Finally-0.020
 When-0.018
 Slowly-0.018
Token the
Token ID262
Feature activation+3.82e-03

Pos logit contributions
 queens+0.013
 soul+0.012
 chicks+0.012
mare+0.012
 bee+0.012
Neg logit contributions
anked-0.010
 Finally-0.010
rawled-0.009
 Up-0.009
 Slowly-0.009
Token or
Token ID393
Feature activation-1.99e-03

Pos logit contributions
cious+0.006
 queens+0.006
mare+0.006
ppy+0.006
plets+0.006
Neg logit contributions
 him-0.006
anked-0.006
 Finally-0.006
 up-0.005
 When-0.005
Tokennam
Token ID7402
Feature activation+6.40e-02

Pos logit contributions
 him+0.003
anked+0.003
 Finally+0.003
 When+0.003
 up+0.003
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
 chicks-0.003
opy-0.003
Tokenents
Token ID658
Feature activation+6.17e-03

Pos logit contributions
cious+0.092
mare+0.092
 queens+0.091
 soul+0.087
 chicks+0.085
Neg logit contributions
 him-0.111
anked-0.110
 Finally-0.101
 up-0.097
 When-0.097
Token outside
Token ID2354
Feature activation+2.55e-03

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.012
 chicks+0.011
opy+0.011
Neg logit contributions
 him-0.008
anked-0.007
 Finally-0.007
 up-0.006
 When-0.006
Token for
Token ID329
Feature activation-3.35e-03

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
 chicks+0.005
ppy+0.005
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.003
 up-0.002
Token a
Token ID257
Feature activation-1.07e-02

Pos logit contributions
anked+0.004
 him+0.004
 Finally+0.004
 When+0.003
 Up+0.003
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.006
 chicks-0.006
 soul-0.006
Token walk
Token ID2513
Feature activation+6.63e-03

Pos logit contributions
 him+0.020
anked+0.019
 Finally+0.018
 When+0.017
 up+0.017
Neg logit contributions
 queens-0.016
cious-0.015
mare-0.014
 chicks-0.014
 soul-0.013
Token a
Token ID257
Feature activation-9.36e-03

Pos logit contributions
 queens+0.018
mare+0.018
mer+0.018
plets+0.018
ppy+0.018
Neg logit contributions
 him-0.006
 Finally-0.005
 his-0.004
 When-0.004
 up-0.004
Token box
Token ID3091
Feature activation+1.44e-04

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.016
 When+0.015
 Up+0.015
Neg logit contributions
 queens-0.013
cious-0.013
mare-0.012
 chicks-0.012
 soul-0.012
Token of
Token ID286
Feature activation+4.98e-03

Pos logit contributions
 queens+0.000
cious+0.000
mare+0.000
 chicks+0.000
ppy+0.000
Neg logit contributions
 him-0.000
anked-0.000
 Finally-0.000
 When-0.000
 up-0.000
Token pretty
Token ID2495
Feature activation+3.51e-03

Pos logit contributions
 queens+0.008
 chicks+0.007
cious+0.007
 soul+0.007
 ghosts+0.006
Neg logit contributions
anked-0.009
 Finally-0.008
 him-0.008
 When-0.008
 Up-0.008
Token or
Token ID393
Feature activation+8.27e-03

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.006
 chicks+0.005
ppy+0.005
Neg logit contributions
 him-0.006
anked-0.005
 Finally-0.005
 When-0.005
 up-0.005
Tokennam
Token ID7402
Feature activation+6.39e-02

Pos logit contributions
 queens+0.014
mare+0.014
cious+0.014
plets+0.014
pillar+0.014
Neg logit contributions
 him-0.013
anked-0.011
 Finally-0.011
 up-0.011
 his-0.011
Tokenents
Token ID658
Feature activation+9.26e-03

Pos logit contributions
cious+0.095
mare+0.094
 queens+0.091
 soul+0.089
 chicks+0.088
Neg logit contributions
anked-0.108
 him-0.108
 Finally-0.099
 up-0.095
 When-0.094
Token that
Token ID326
Feature activation+9.14e-03

Pos logit contributions
 queens+0.020
cious+0.019
mare+0.019
 chicks+0.018
 soul+0.018
Neg logit contributions
 him-0.011
anked-0.010
 up-0.009
 Finally-0.009
 When-0.008
Token she
Token ID673
Feature activation+9.55e-03

Pos logit contributions
 queens+0.016
cious+0.015
mare+0.014
 chicks+0.014
ppy+0.014
Neg logit contributions
 him-0.015
anked-0.013
 Finally-0.013
 When-0.012
 up-0.012
Token wanted
Token ID2227
Feature activation+5.15e-03

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.011
 chicks+0.011
plets+0.011
Neg logit contributions
 him-0.019
 Finally-0.017
anked-0.017
 When-0.016
 up-0.016
Token to
Token ID284
Feature activation-1.13e-03

Pos logit contributions
 queens+0.009
cious+0.009
 chicks+0.008
mare+0.008
 soul+0.008
Neg logit contributions
anked-0.007
 him-0.007
 Finally-0.007
 When-0.006
 Up-0.006
Token hill
Token ID12788
Feature activation-4.08e-03

Pos logit contributions
cious+0.001
 queens+0.001
ppy+0.001
lessness+0.001
mare+0.001
Neg logit contributions
 him-0.001
 Finally-0.001
 up-0.001
 drib-0.001
anked-0.001
Token and
Token ID290
Feature activation-3.32e-03

Pos logit contributions
 him+0.006
anked+0.005
 Finally+0.005
 When+0.005
 up+0.005
Neg logit contributions
 queens-0.008
cious-0.008
mare-0.007
 chicks-0.007
ppy-0.007
Token put
Token ID1234
Feature activation-8.33e-03

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 When+0.005
 up+0.005
Neg logit contributions
 queens-0.006
cious-0.005
mare-0.005
 chicks-0.005
ppy-0.005
Token her
Token ID607
Feature activation+2.58e-02

Pos logit contributions
anked+0.013
 him+0.011
 Finally+0.011
 drib+0.010
 When+0.010
Neg logit contributions
 queens-0.015
cious-0.013
 chicks-0.013
mare-0.013
 soul-0.012
Token or
Token ID393
Feature activation-2.69e-04

Pos logit contributions
 queens+0.044
cious+0.044
mare+0.042
ppy+0.042
plets+0.041
Neg logit contributions
 him-0.040
 Finally-0.036
 up-0.036
anked-0.035
 When-0.033
Tokennam
Token ID7402
Feature activation+6.36e-02

Pos logit contributions
 him+0.000
 his+0.000
 up+0.000
 Finally+0.000
 Tim+0.000
Neg logit contributions
cious-0.000
pillar-0.000
plets-0.000
opy-0.000
umpy-0.000
Tokenents
Token ID658
Feature activation+3.42e-04

Pos logit contributions
cious+0.093
mare+0.092
 queens+0.092
 soul+0.087
 forgot+0.084
Neg logit contributions
anked-0.110
 him-0.109
 Finally-0.100
 up-0.097
 When-0.095
Token all
Token ID477
Feature activation-3.96e-04

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 chicks+0.001
ppy+0.001
Neg logit contributions
 him-0.000
anked-0.000
 Finally-0.000
 When-0.000
 Up-0.000
Token around
Token ID1088
Feature activation-1.52e-02

Pos logit contributions
anked+0.001
 him+0.001
 Finally+0.001
 When+0.000
 Up+0.000
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 soul-0.001
 chicks-0.001
Token.
Token ID13
Feature activation-8.49e-04

Pos logit contributions
 him+0.019
anked+0.018
 Finally+0.017
 When+0.016
 up+0.015
Neg logit contributions
 queens-0.031
cious-0.030
mare-0.029
 chicks-0.028
ppy-0.028
Token She
Token ID1375
Feature activation+3.37e-04

Pos logit contributions
 him+0.001
anked+0.001
 Timothy+0.001
 up+0.001
 drib+0.001
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 soul-0.001
 chicks-0.001
Token more
Token ID517
Feature activation+2.28e-02

Pos logit contributions
cious+0.014
 queens+0.014
plets+0.014
mare+0.014
pillar+0.013
Neg logit contributions
 him-0.017
 up-0.015
anked-0.015
 his-0.014
 Finally-0.014
Token careful
Token ID8161
Feature activation+1.65e-02

Pos logit contributions
 queens+0.030
cious+0.029
mare+0.028
plets+0.026
 chicks+0.026
Neg logit contributions
 him-0.045
anked-0.044
 Finally-0.041
 up-0.041
 When-0.040
Token.
Token ID13
Feature activation-8.44e-03

Pos logit contributions
 queens+0.036
cious+0.035
mare+0.034
 chicks+0.033
opy+0.032
Neg logit contributions
 him-0.020
anked-0.018
 up-0.017
 Finally-0.016
 When-0.015
Token The
Token ID383
Feature activation+1.17e-03

Pos logit contributions
 him+0.011
 Finally+0.010
 When+0.010
anked+0.010
 After+0.009
Neg logit contributions
 queens-0.018
cious-0.017
 forgot-0.016
mare-0.015
cest-0.015
Token or
Token ID393
Feature activation+1.73e-02

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
ppy+0.002
 chicks+0.002
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.002
 When-0.002
 up-0.002
Tokennam
Token ID7402
Feature activation+6.34e-02

Pos logit contributions
cious+0.040
plets+0.039
mare+0.038
opy+0.037
pillar+0.037
Neg logit contributions
 him-0.022
 his-0.019
 up-0.018
 Tim-0.018
 math-0.013
Tokenents
Token ID658
Feature activation+1.45e-02

Pos logit contributions
cious+0.101
mare+0.100
 queens+0.097
 soul+0.095
 chicks+0.094
Neg logit contributions
anked-0.103
 him-0.101
 Finally-0.089
 up-0.086
atches-0.085
Token are
Token ID389
Feature activation+1.02e-02

Pos logit contributions
 queens+0.026
cious+0.026
mare+0.024
 chicks+0.023
 soul+0.022
Neg logit contributions
 him-0.024
anked-0.021
 up-0.020
 Finally-0.019
 When-0.018
Token not
Token ID407
Feature activation+1.27e-02

Pos logit contributions
cious+0.019
 queens+0.018
mare+0.018
pillar+0.018
plets+0.017
Neg logit contributions
 him-0.015
 up-0.013
anked-0.013
 Finally-0.012
 his-0.012
Token toys
Token ID14958
Feature activation+2.48e-03

Pos logit contributions
cious+0.024
 queens+0.023
mare+0.022
pillar+0.022
plets+0.022
Neg logit contributions
 him-0.019
anked-0.016
 up-0.016
 his-0.015
 Finally-0.015
Token.
Token ID13
Feature activation-5.94e-03

Pos logit contributions
cious+0.006
 queens+0.006
mare+0.006
 chicks+0.005
plets+0.005
Neg logit contributions
 him-0.003
 up-0.002
anked-0.002
 his-0.002
 Finally-0.002
Token he
Token ID339
Feature activation+7.68e-03

Pos logit contributions
cious+0.002
pillar+0.002
ppy+0.002
 queens+0.002
opy+0.002
Neg logit contributions
 him-0.002
 up-0.002
 Finally-0.002
 his-0.002
anked-0.001
Token especially
Token ID2592
Feature activation-6.43e-03

Pos logit contributions
 queens+0.014
cious+0.013
mare+0.012
plets+0.012
 chicks+0.012
Neg logit contributions
 him-0.012
 Finally-0.011
anked-0.011
 When-0.010
 up-0.010
Token liked
Token ID8288
Feature activation-9.16e-03

Pos logit contributions
 him+0.011
 up+0.009
anked+0.009
 Finally+0.009
 his+0.009
Neg logit contributions
 queens-0.012
cious-0.011
mare-0.010
 chicks-0.010
plets-0.010
Token the
Token ID262
Feature activation+6.34e-03

Pos logit contributions
anked+0.013
 him+0.012
 Finally+0.012
 When+0.011
 Up+0.011
Neg logit contributions
 queens-0.017
cious-0.016
 chicks-0.015
mare-0.015
 soul-0.015
Token or
Token ID393
Feature activation+3.73e-03

Pos logit contributions
 queens+0.010
cious+0.010
ppy+0.009
mare+0.009
opy+0.009
Neg logit contributions
 him-0.011
anked-0.010
 Finally-0.010
 up-0.009
 When-0.009
Tokennam
Token ID7402
Feature activation+6.32e-02

Pos logit contributions
cious+0.008
ppy+0.008
mare+0.008
opy+0.008
plets+0.007
Neg logit contributions
 him-0.005
 his-0.004
 up-0.004
 Finally-0.004
 When-0.004
Tokenents
Token ID658
Feature activation+6.71e-03

Pos logit contributions
cious+0.090
 queens+0.090
mare+0.090
 chicks+0.084
 soul+0.084
Neg logit contributions
 him-0.112
anked-0.111
 Finally-0.102
 up-0.098
 When-0.097
Token he
Token ID339
Feature activation+1.12e-03

Pos logit contributions
 queens+0.015
cious+0.014
mare+0.014
 chicks+0.013
 soul+0.013
Neg logit contributions
 him-0.008
anked-0.007
 up-0.006
 Finally-0.006
 When-0.006
Token kept
Token ID4030
Feature activation-1.30e-02

Pos logit contributions
 queens+0.002
cious+0.001
mare+0.001
 chicks+0.001
plets+0.001
Neg logit contributions
 him-0.002
 Finally-0.002
anked-0.002
 When-0.002
 up-0.002
Token around
Token ID1088
Feature activation-8.31e-03

Pos logit contributions
anked+0.018
 Finally+0.017
 Up+0.016
 When+0.016
 him+0.016
Neg logit contributions
 queens-0.023
cious-0.022
 soul-0.021
 chicks-0.021
lessness-0.020
Token.
Token ID13
Feature activation-3.68e-03

Pos logit contributions
anked+0.012
 Finally+0.011
 him+0.011
 When+0.010
 Up+0.010
Neg logit contributions
 queens-0.015
cious-0.014
 chicks-0.013
 soul-0.013
mare-0.013
Token can
Token ID460
Feature activation+1.31e-02

Pos logit contributions
 him+0.038
 Tim+0.031
 up+0.030
 When+0.030
 his+0.029
Neg logit contributions
cious-0.048
umpy-0.046
plets-0.046
 queens-0.046
aky-0.045
Token we
Token ID356
Feature activation+1.81e-03

Pos logit contributions
 queens+0.014
cious+0.014
aky+0.013
mare+0.012
 chicks+0.012
Neg logit contributions
 him-0.030
anked-0.027
 Finally-0.027
 When-0.027
 up-0.026
Token have
Token ID423
Feature activation-2.63e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 chicks+0.003
ppy+0.003
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.002
 When-0.002
 Up-0.002
Token some
Token ID617
Feature activation+2.92e-02

Pos logit contributions
 him+0.003
anked+0.003
 Finally+0.003
 When+0.003
 up+0.003
Neg logit contributions
 queens-0.005
cious-0.005
mare-0.005
ppy-0.005
 chicks-0.005
Token or
Token ID393
Feature activation+1.83e-03

Pos logit contributions
 queens+0.048
 chicks+0.042
cious+0.040
 soul+0.040
mare+0.039
Neg logit contributions
 him-0.049
anked-0.048
 Finally-0.046
 When-0.044
 Up-0.042
Tokennam
Token ID7402
Feature activation+6.30e-02

Pos logit contributions
cious+0.003
 queens+0.003
mare+0.003
opy+0.003
pillar+0.003
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.002
 up-0.002
 When-0.002
Tokenents
Token ID658
Feature activation+1.22e-02

Pos logit contributions
cious+0.095
mare+0.095
 queens+0.091
 soul+0.090
 chicks+0.088
Neg logit contributions
 him-0.106
anked-0.105
 Finally-0.096
 up-0.093
 When-0.091
Token for
Token ID329
Feature activation+4.58e-03

Pos logit contributions
cious+0.026
 queens+0.025
 chicks+0.024
 soul+0.023
mare+0.023
Neg logit contributions
 him-0.017
 up-0.014
anked-0.014
 Finally-0.012
 his-0.012
Token our
Token ID674
Feature activation+9.03e-04

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.008
 chicks+0.008
ppy+0.007
Neg logit contributions
 him-0.007
anked-0.006
 Finally-0.006
 When-0.006
 Up-0.005
Token snow
Token ID6729
Feature activation-1.53e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
ppy+0.001
 chicks+0.001
Neg logit contributions
 him-0.002
anked-0.001
 Finally-0.001
 up-0.001
 When-0.001
Tokenman
Token ID805
Feature activation-5.22e-03

Pos logit contributions
 him+0.022
anked+0.020
 Finally+0.019
 up+0.019
 When+0.018
Neg logit contributions
 queens-0.028
cious-0.028
 soul-0.026
 chicks-0.025
mare-0.025
Token she
Token ID673
Feature activation-7.33e-03

Pos logit contributions
plets+0.038
cious+0.037
ppy+0.037
pillar+0.036
opy+0.036
Neg logit contributions
 him-0.019
 up-0.015
 his-0.015
 Tim-0.015
 Finally-0.013
Token could
Token ID714
Feature activation+6.54e-03

Pos logit contributions
 him+0.013
anked+0.012
 Finally+0.012
 When+0.012
 up+0.011
Neg logit contributions
 queens-0.011
cious-0.011
mare-0.010
 chicks-0.010
ppy-0.009
Token play
Token ID711
Feature activation+8.95e-03

Pos logit contributions
 queens+0.012
cious+0.011
mare+0.011
 soul+0.011
 chicks+0.010
Neg logit contributions
anked-0.009
 him-0.009
 Finally-0.008
 Up-0.008
 When-0.008
Token with
Token ID351
Feature activation-1.75e-02

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.016
ppy+0.016
Neg logit contributions
anked-0.011
 him-0.011
 Finally-0.010
 When-0.009
 Up-0.009
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
 him+0.017
anked+0.016
 Finally+0.015
 When+0.014
 Up+0.013
Neg logit contributions
 queens-0.040
cious-0.039
mare-0.037
 chicks-0.037
ppy-0.036
Token st
Token ID336
Feature activation+6.30e-02

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
 soul+0.016
Neg logit contributions
 him-0.020
anked-0.020
 Finally-0.019
 When-0.018
 Up-0.017
Tokena
Token ID64
Feature activation+3.20e-02

Pos logit contributions
 queens+0.115
cious+0.108
 chicks+0.104
 forgot+0.099
 soul+0.098
Neg logit contributions
 Finally-0.093
anked-0.091
 When-0.087
 him-0.087
 After-0.082
Tokenpler
Token ID20053
Feature activation+7.13e-04

Pos logit contributions
 queens+0.059
cious+0.057
mare+0.056
plets+0.054
 soul+0.053
Neg logit contributions
 him-0.045
 up-0.040
 Finally-0.038
anked-0.037
 When-0.037
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 chicks+0.001
ppy+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 When-0.001
 Up-0.001
Token the
Token ID262
Feature activation+6.69e-03

Pos logit contributions
 him+0.020
 Finally+0.017
anked+0.017
 When+0.017
 up+0.016
Neg logit contributions
 queens-0.029
cious-0.029
opy-0.027
ppy-0.027
aky-0.027
Token paper
Token ID3348
Feature activation+1.18e-02

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.009
ppy+0.009
 chicks+0.009
Neg logit contributions
 him-0.011
anked-0.011
 Finally-0.010
 up-0.010
 When-0.010
Token dust
Token ID8977
Feature activation+6.24e-03

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.002
cious-0.002
mare-0.002
ppy-0.002
 chicks-0.002
Token and
Token ID290
Feature activation+1.04e-04

Pos logit contributions
 queens+0.014
cious+0.013
mare+0.013
ppy+0.012
 soul+0.012
Neg logit contributions
 him-0.008
anked-0.007
 up-0.006
 Finally-0.006
 When-0.006
Token dirt
Token ID13647
Feature activation-1.96e-02

Pos logit contributions
cious+0.000
 queens+0.000
mare+0.000
umpy+0.000
ppy+0.000
Neg logit contributions
 him-0.000
 his-0.000
anked-0.000
 up-0.000
 Finally-0.000
Token.
Token ID13
Feature activation+3.60e-03

Pos logit contributions
 him+0.028
anked+0.027
 Finally+0.027
 When+0.025
 After+0.023
Neg logit contributions
 queens-0.035
cious-0.034
 chicks-0.032
mare-0.031
opy-0.031
Token The
Token ID383
Feature activation+9.76e-03

Pos logit contributions
 queens+0.006
cious+0.006
 chicks+0.006
mare+0.006
 soul+0.006
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.004
 up-0.004
atches-0.004
Token st
Token ID336
Feature activation+6.28e-02

Pos logit contributions
 queens+0.018
ppy+0.018
cious+0.018
mare+0.018
 outage+0.017
Neg logit contributions
 him-0.014
 up-0.013
 Finally-0.012
 his-0.011
 patiently-0.011
Tokenalks
Token ID23833
Feature activation+1.49e-02

Pos logit contributions
 queens+0.125
 ghosts+0.108
cious+0.107
 forgot+0.103
lessness+0.103
Neg logit contributions
 Finally-0.092
anked-0.091
 When-0.084
 After-0.083
 him-0.082
Token were
Token ID547
Feature activation+6.09e-04

Pos logit contributions
 queens+0.027
cious+0.026
mare+0.024
 chicks+0.023
ppy+0.022
Neg logit contributions
 him-0.024
anked-0.022
 Finally-0.021
 up-0.020
 When-0.020
Token hidden
Token ID7104
Feature activation-1.53e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
ppy+0.001
pillar+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 up-0.001
 When-0.001
Token and
Token ID290
Feature activation+2.22e-02

Pos logit contributions
 him+0.019
anked+0.017
 up+0.016
 Finally+0.015
 his+0.014
Neg logit contributions
cious-0.032
 queens-0.032
mare-0.031
opy-0.030
 chicks-0.030
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
cious+0.048
pillar+0.048
mare+0.047
 queens+0.046
ppy+0.046
Neg logit contributions
 him-0.030
 up-0.026
 his-0.025
 Tim-0.024
 Finally-0.022
Token a
Token ID257
Feature activation+3.08e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
ppy+0.003
 chicks+0.003
Neg logit contributions
 him-0.002
anked-0.001
 Finally-0.001
 When-0.001
 up-0.001
Token tree
Token ID5509
Feature activation+3.16e-03

Pos logit contributions
 queens+0.005
ppy+0.005
cious+0.005
lessness+0.005
 chicks+0.005
Neg logit contributions
 him-0.004
 up-0.004
anked-0.004
 Finally-0.004
 drib-0.004
Token with
Token ID351
Feature activation-2.66e-04

Pos logit contributions
 queens+0.007
cious+0.006
mare+0.006
 chicks+0.006
ppy+0.006
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.003
 When-0.003
 Up-0.003
Token shiny
Token ID22441
Feature activation-1.60e-02

Pos logit contributions
 him+0.000
anked+0.000
 Finally+0.000
 When+0.000
 up+0.000
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.000
ppy-0.000
 chicks-0.000
Token or
Token ID393
Feature activation+5.04e-04

Pos logit contributions
 him+0.027
anked+0.026
 Finally+0.025
 When+0.023
 up+0.022
Neg logit contributions
 queens-0.026
cious-0.025
mare-0.023
 chicks-0.023
ppy-0.022
Tokennam
Token ID7402
Feature activation+6.27e-02

Pos logit contributions
mare+0.001
ppy+0.001
pillar+0.001
 outage+0.001
opy+0.001
Neg logit contributions
 him-0.001
 up-0.001
 his-0.001
 Finally-0.001
 Tim-0.001
Tokenents
Token ID658
Feature activation+4.37e-03

Pos logit contributions
 queens+0.086
cious+0.086
mare+0.085
 soul+0.080
 chicks+0.079
Neg logit contributions
 him-0.113
anked-0.112
 Finally-0.104
 When-0.100
 up-0.100
Token hanging
Token ID10938
Feature activation-4.74e-03

Pos logit contributions
 queens+0.009
cious+0.008
mare+0.008
 chicks+0.008
ppy+0.008
Neg logit contributions
 him-0.006
anked-0.005
 Finally-0.005
 When-0.005
 up-0.005
Token from
Token ID422
Feature activation-2.61e-02

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 drib+0.006
 When+0.006
Neg logit contributions
 queens-0.009
 chicks-0.008
cious-0.008
opy-0.007
mare-0.007
Token its
Token ID663
Feature activation+6.22e-03

Pos logit contributions
anked+0.037
 him+0.036
 Finally+0.034
 When+0.032
 Up+0.031
Neg logit contributions
 queens-0.048
cious-0.046
 chicks-0.044
mare-0.042
 soul-0.042
Token branches
Token ID13737
Feature activation-1.15e-03

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.009
ppy+0.009
opy+0.009
Neg logit contributions
 him-0.010
anked-0.010
 Finally-0.010
 up-0.009
 When-0.009
Token went
Token ID1816
Feature activation+4.85e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 chicks+0.003
ppy+0.003
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.002
 up-0.002
Token she
Token ID673
Feature activation+1.38e-02

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.010
ppy+0.009
 chicks+0.009
Neg logit contributions
 him-0.006
anked-0.005
 Finally-0.005
 up-0.005
 When-0.005
Token searched
Token ID16499
Feature activation-4.90e-03

Pos logit contributions
 queens+0.020
cious+0.019
mare+0.018
 chicks+0.018
ppy+0.017
Neg logit contributions
 him-0.026
anked-0.024
 Finally-0.024
 When-0.022
 up-0.022
Token for
Token ID329
Feature activation+7.61e-03

Pos logit contributions
anked+0.007
 him+0.006
 Finally+0.006
 When+0.006
 Up+0.005
Neg logit contributions
 queens-0.009
cious-0.009
 chicks-0.008
mare-0.008
 soul-0.008
Token or
Token ID393
Feature activation+2.03e-03

Pos logit contributions
cious+0.014
 queens+0.014
mare+0.014
ppy+0.013
opy+0.013
Neg logit contributions
 him-0.011
anked-0.010
 Finally-0.009
 When-0.009
 up-0.009
Tokennam
Token ID7402
Feature activation+6.23e-02

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
 chicks+0.002
ppy+0.002
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.004
 When-0.004
 up-0.004
Tokenents
Token ID658
Feature activation+1.25e-02

Pos logit contributions
cious+0.086
 queens+0.085
mare+0.083
 soul+0.079
 chicks+0.078
Neg logit contributions
 him-0.114
anked-0.111
 Finally-0.105
 When-0.100
 up-0.099
Token,
Token ID11
Feature activation+3.57e-03

Pos logit contributions
 queens+0.026
cious+0.026
mare+0.024
 chicks+0.024
 soul+0.024
Neg logit contributions
 him-0.016
anked-0.013
 Finally-0.013
 up-0.013
 When-0.012
Token and
Token ID290
Feature activation+3.78e-03

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.006
ppy+0.006
 chicks+0.006
Neg logit contributions
 him-0.006
anked-0.005
 Finally-0.005
 up-0.005
 When-0.005
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.007
ppy+0.007
 chicks+0.007
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.005
 When-0.004
 up-0.004
Token more
Token ID517
Feature activation+2.41e-02

Pos logit contributions
cious+0.018
mare+0.017
ppy+0.017
 outage+0.017
plets+0.017
Neg logit contributions
 him-0.018
 Finally-0.016
 up-0.015
 his-0.014
 When-0.014
Token alone
Token ID3436
Feature activation+5.82e-03

Pos logit contributions
 queens+0.010
cious+0.009
mare+0.008
 chicks+0.008
ppy+0.008
Neg logit contributions
 him-0.011
anked-0.011
 Finally-0.011
 When-0.010
 up-0.010
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
 queens+0.013
cious+0.013
 chicks+0.012
mare+0.012
 soul+0.012
Neg logit contributions
 him-0.007
anked-0.006
 Finally-0.006
 up-0.006
 When-0.005
Token trapped
Token ID13640
Feature activation-8.48e-03

Pos logit contributions
 queens+0.025
cious+0.024
mare+0.022
ppy+0.022
 chicks+0.022
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.016
 When-0.015
 up-0.015
Token in
Token ID287
Feature activation+6.17e-03

Pos logit contributions
 him+0.011
anked+0.011
 Finally+0.010
 When+0.010
 Up+0.009
Neg logit contributions
 queens-0.016
cious-0.016
mare-0.015
 chicks-0.015
ppy-0.015
Token the
Token ID262
Feature activation+9.33e-03

Pos logit contributions
cious+0.014
 queens+0.014
ppy+0.013
mare+0.013
opy+0.013
Neg logit contributions
 him-0.007
anked-0.006
 Finally-0.006
 his-0.005
 up-0.005
Token st
Token ID336
Feature activation+6.19e-02

Pos logit contributions
cious+0.017
ppy+0.017
 queens+0.017
mare+0.016
opy+0.015
Neg logit contributions
 him-0.014
 Finally-0.014
 up-0.013
anked-0.012
 When-0.012
Tokeninky
Token ID29246
Feature activation+3.58e-02

Pos logit contributions
 queens+0.114
cious+0.106
 chicks+0.101
mare+0.097
 forgot+0.097
Neg logit contributions
 Finally-0.092
anked-0.090
 When-0.085
 him-0.085
 After-0.082
Token air
Token ID1633
Feature activation+1.05e-03

Pos logit contributions
 queens+0.068
cious+0.064
mare+0.063
plets+0.062
lessness+0.059
Neg logit contributions
 him-0.052
 up-0.051
 Finally-0.042
 When-0.040
 his-0.040
Token.
Token ID13
Feature activation+7.05e-03

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
 chicks+0.002
ppy+0.002
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 up-0.001
 When-0.001
Token One
Token ID1881
Feature activation+1.96e-02

Pos logit contributions
 queens+0.014
cious+0.014
mare+0.013
 chicks+0.013
 soul+0.013
Neg logit contributions
 him-0.009
anked-0.009
 Finally-0.008
 up-0.007
 When-0.007
Token day
Token ID1110
Feature activation+1.78e-03

Pos logit contributions
 queens+0.050
cious+0.048
plets+0.046
opy+0.045
ppy+0.044
Neg logit contributions
 him-0.017
 up-0.014
 Finally-0.014
 When-0.014
 his-0.013
Token.
Token ID13
Feature activation+2.21e-03

Pos logit contributions
 him+0.017
anked+0.017
 Finally+0.016
 When+0.014
 Up+0.014
Neg logit contributions
 queens-0.022
cious-0.021
mare-0.019
ppy-0.019
 chicks-0.019
Token They
Token ID1119
Feature activation+1.50e-02

Pos logit contributions
 queens+0.004
cious+0.004
mare+0.004
 chicks+0.004
ppy+0.004
Neg logit contributions
anked-0.003
 him-0.003
ered-0.002
 up-0.002
Years-0.002
Token were
Token ID547
Feature activation+2.96e-03

Pos logit contributions
 queens+0.021
cious+0.020
mare+0.019
 chicks+0.018
plets+0.018
Neg logit contributions
 him-0.029
anked-0.027
 Finally-0.027
 When-0.025
 up-0.025
Token called
Token ID1444
Feature activation+2.33e-02

Pos logit contributions
 queens+0.005
cious+0.005
 chicks+0.004
mare+0.004
 soul+0.004
Neg logit contributions
anked-0.005
 him-0.005
 Finally-0.004
 When-0.004
 Up-0.004
Token or
Token ID393
Feature activation+7.19e-03

Pos logit contributions
 queens+0.044
cious+0.042
mare+0.040
 chicks+0.040
 soul+0.038
Neg logit contributions
 him-0.032
anked-0.032
 Finally-0.030
 When-0.027
 After-0.026
Tokennam
Token ID7402
Feature activation+6.11e-02

Pos logit contributions
pillar+0.014
plets+0.013
mare+0.013
umpy+0.013
cious+0.012
Neg logit contributions
 him-0.012
 his-0.011
 Tim-0.010
 up-0.010
 drib-0.009
Tokenents
Token ID658
Feature activation+7.99e-03

Pos logit contributions
mare+0.086
cious+0.086
 queens+0.086
 soul+0.081
 chicks+0.078
Neg logit contributions
 him-0.109
anked-0.107
 Finally-0.098
 up-0.095
 When-0.094
Token and
Token ID290
Feature activation+8.57e-03

Pos logit contributions
 queens+0.017
cious+0.016
mare+0.016
 chicks+0.015
 soul+0.015
Neg logit contributions
 him-0.010
anked-0.009
 Finally-0.008
 up-0.008
 When-0.008
Token they
Token ID484
Feature activation+1.46e-02

Pos logit contributions
cious+0.015
 queens+0.015
mare+0.015
pillar+0.014
ppy+0.014
Neg logit contributions
 him-0.013
anked-0.012
 Finally-0.012
 When-0.011
 his-0.011
Token were
Token ID547
Feature activation-2.11e-03

Pos logit contributions
 queens+0.022
cious+0.021
mare+0.019
 chicks+0.019
 soul+0.018
Neg logit contributions
 him-0.027
anked-0.025
 Finally-0.025
 When-0.023
 up-0.023
Token very
Token ID845
Feature activation+1.81e-02

Pos logit contributions
 him+0.003
anked+0.003
 Finally+0.003
 When+0.003
 Up+0.003
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.003
 chicks-0.003
ppy-0.003
Token to
Token ID284
Feature activation-9.06e-03

Pos logit contributions
 queens+0.041
cious+0.040
mare+0.039
 chicks+0.038
plets+0.038
Neg logit contributions
 him-0.019
 up-0.016
anked-0.016
 Finally-0.015
 When-0.015
Token me
Token ID502
Feature activation-1.61e-03

Pos logit contributions
 him+0.013
anked+0.012
 Finally+0.011
 When+0.010
 up+0.010
Neg logit contributions
 queens-0.018
cious-0.017
mare-0.016
 chicks-0.016
opy-0.016
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 him+0.002
 up+0.002
anked+0.002
 Finally+0.002
 When+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
plets-0.003
 chicks-0.003
Token The
Token ID383
Feature activation-4.79e-03

Pos logit contributions
 him+0.014
 Finally+0.014
 When+0.014
anked+0.013
 After+0.012
Neg logit contributions
 queens-0.022
cious-0.021
 forgot-0.019
mare-0.019
opy-0.019
Token or
Token ID393
Feature activation+7.96e-03

Pos logit contributions
 him+0.008
anked+0.008
 Finally+0.007
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.008
cious-0.007
mare-0.007
ppy-0.007
 chicks-0.007
Tokennam
Token ID7402
Feature activation+6.09e-02

Pos logit contributions
cious+0.020
plets+0.019
mare+0.019
opy+0.018
umpy+0.018
Neg logit contributions
 him-0.008
 his-0.007
 up-0.007
 Tim-0.006
anked-0.005
Tokenents
Token ID658
Feature activation+1.10e-02

Pos logit contributions
cious+0.098
mare+0.096
 chicks+0.092
 queens+0.090
 soul+0.089
Neg logit contributions
 him-0.098
anked-0.098
 Finally-0.087
 up-0.083
 When-0.083
Token were
Token ID547
Feature activation-7.61e-03

Pos logit contributions
 queens+0.019
cious+0.019
mare+0.017
 chicks+0.017
ppy+0.016
Neg logit contributions
 him-0.018
anked-0.016
 up-0.015
 Finally-0.015
 When-0.015
Token very
Token ID845
Feature activation+8.24e-03

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.010
 up+0.010
 When+0.010
Neg logit contributions
cious-0.013
 queens-0.013
mare-0.013
pillar-0.012
plets-0.012
Token important
Token ID1593
Feature activation-9.13e-03

Pos logit contributions
cious+0.014
 queens+0.013
mare+0.013
pillar+0.012
plets+0.012
Neg logit contributions
 him-0.013
 up-0.012
anked-0.012
 Finally-0.012
 int-0.011
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.010
 When+0.010
 up+0.010
Neg logit contributions
 queens-0.018
cious-0.018
mare-0.017
 chicks-0.017
ppy-0.017
Token grew
Token ID6348
Feature activation+2.19e-02

Pos logit contributions
cious+0.015
umpy+0.014
aky+0.014
ppy+0.014
 queens+0.014
Neg logit contributions
 him-0.011
 up-0.010
 Finally-0.009
 Tim-0.009
anked-0.009
Token until
Token ID1566
Feature activation-9.63e-03

Pos logit contributions
 queens+0.040
cious+0.038
mare+0.036
 chicks+0.035
ppy+0.035
Neg logit contributions
 him-0.033
anked-0.031
 Finally-0.030
 When-0.028
 up-0.028
Token it
Token ID340
Feature activation-1.40e-03

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.011
 When+0.010
 up+0.010
Neg logit contributions
 queens-0.020
cious-0.020
ppy-0.019
mare-0.019
 chicks-0.018
Token was
Token ID373
Feature activation-1.97e-02

Pos logit contributions
 him+0.002
 up+0.002
 Finally+0.002
anked+0.002
 When+0.002
Neg logit contributions
 queens-0.003
cious-0.003
 outage-0.002
 chicks-0.002
lessness-0.002
Token as
Token ID355
Feature activation+1.08e-02

Pos logit contributions
 him+0.033
anked+0.032
 Finally+0.030
 When+0.028
 Up+0.028
Neg logit contributions
 queens-0.032
cious-0.028
mare-0.028
 chicks-0.027
ppy-0.027
Token big
Token ID1263
Feature activation-7.52e-02

Pos logit contributions
cious+0.020
mare+0.019
pillar+0.019
 queens+0.018
ppy+0.018
Neg logit contributions
 him-0.016
 up-0.015
anked-0.014
 Finally-0.014
 his-0.013
Token as
Token ID355
Feature activation-7.14e-03

Pos logit contributions
 Finally+0.095
anked+0.093
 him+0.089
Years+0.081
 When+0.079
Neg logit contributions
 queens-0.148
 chicks-0.132
cious-0.132
plets-0.127
 soul-0.126
Token Amy
Token ID14235
Feature activation-1.08e-02

Pos logit contributions
 him+0.009
 Finally+0.008
anked+0.008
 up+0.008
 When+0.007
Neg logit contributions
cious-0.015
 queens-0.014
mare-0.014
ppy-0.014
opy-0.014
Token.
Token ID13
Feature activation-4.84e-03

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.016
 When+0.015
 up+0.015
Neg logit contributions
 queens-0.018
mare-0.016
cious-0.016
 chicks-0.015
lessness-0.015
Token
Token ID220
Feature activation-6.58e-03

Pos logit contributions
 him+0.006
anked+0.006
 Finally+0.005
 drib+0.004
 up+0.004
Neg logit contributions
 queens-0.010
cious-0.010
mare-0.009
 chicks-0.009
 soul-0.009
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 him+0.009
anked+0.009
 drib+0.008
 int+0.008
asted+0.008
Neg logit contributions
 queens-0.011
cious-0.011
mare-0.010
opy-0.010
ppy-0.010
Token.
Token ID13
Feature activation+6.48e-04

Pos logit contributions
 him+0.028
anked+0.026
 Finally+0.025
 up+0.023
 When+0.023
Neg logit contributions
 queens-0.042
cious-0.041
mare-0.039
 chicks-0.038
opy-0.037
Token It
Token ID632
Feature activation+1.12e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 soul+0.001
 chicks+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 up-0.001
 When-0.001
Token is
Token ID318
Feature activation-8.57e-04

Pos logit contributions
 queens+0.014
cious+0.014
ppy+0.013
 chicks+0.013
opy+0.013
Neg logit contributions
anked-0.022
 him-0.021
 Finally-0.020
 When-0.019
 Timothy-0.019
Token not
Token ID407
Feature activation+1.00e-02

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.001
 Up+0.001
 When+0.001
Neg logit contributions
 queens-0.001
cious-0.001
 chicks-0.001
ppy-0.001
aky-0.001
Token as
Token ID355
Feature activation-3.49e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
ppy+0.017
Neg logit contributions
 him-0.014
anked-0.013
 Finally-0.013
 When-0.012
 up-0.012
Token big
Token ID1263
Feature activation-7.47e-02

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 up+0.005
 his+0.005
Neg logit contributions
cious-0.006
 queens-0.006
mare-0.006
 chicks-0.005
plets-0.005
Token or
Token ID393
Feature activation+6.10e-03

Pos logit contributions
anked+0.095
 Finally+0.093
 him+0.093
 When+0.083
 Up+0.083
Neg logit contributions
 queens-0.147
cious-0.136
 chicks-0.130
 soul-0.130
ppy-0.127
Token shiny
Token ID22441
Feature activation-5.66e-02

Pos logit contributions
cious+0.011
mare+0.010
 queens+0.010
pillar+0.010
 chicks+0.010
Neg logit contributions
 him-0.010
 up-0.009
anked-0.009
 his-0.008
 Finally-0.008
Token as
Token ID355
Feature activation-7.87e-03

Pos logit contributions
 him+0.077
anked+0.075
 Finally+0.071
 When+0.066
 Up+0.064
Neg logit contributions
 queens-0.109
cious-0.103
 chicks-0.096
mare-0.096
ppy-0.095
Token Dad
Token ID17415
Feature activation-3.83e-03

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.010
 up+0.010
 When+0.010
Neg logit contributions
 queens-0.014
cious-0.014
mare-0.013
 chicks-0.013
opy-0.013
Token's
Token ID338
Feature activation+3.96e-03

Pos logit contributions
 him+0.007
anked+0.006
 Finally+0.006
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.006
cious-0.006
mare-0.005
 chicks-0.005
ppy-0.005
Token.
Token ID13
Feature activation+2.11e-03

Pos logit contributions
 him+0.033
anked+0.033
 Finally+0.031
 When+0.029
 Up+0.028
Neg logit contributions
 queens-0.042
cious-0.039
 chicks-0.036
 soul-0.036
mare-0.036
Token It
Token ID632
Feature activation+3.54e-03

Pos logit contributions
 queens+0.004
cious+0.004
mare+0.004
 soul+0.004
 chicks+0.004
Neg logit contributions
 him-0.003
anked-0.003
 up-0.002
 Finally-0.002
 int-0.002
Token is
Token ID318
Feature activation-1.96e-05

Pos logit contributions
 queens+0.004
 forgot+0.004
 soul+0.004
cious+0.004
opy+0.003
Neg logit contributions
anked-0.007
 him-0.007
 Timothy-0.007
 Finally-0.007
 When-0.006
Token not
Token ID407
Feature activation+7.61e-03

Pos logit contributions
anked+0.000
 him+0.000
 Finally+0.000
 Up+0.000
 When+0.000
Neg logit contributions
 queens-0.000
cious-0.000
 chicks-0.000
 forgot-0.000
 soul-0.000
Token as
Token ID355
Feature activation-4.07e-03

Pos logit contributions
cious+0.014
 queens+0.014
mare+0.014
pillar+0.013
 chicks+0.013
Neg logit contributions
 him-0.011
anked-0.010
 Finally-0.009
 up-0.009
 When-0.009
Token big
Token ID1263
Feature activation-7.42e-02

Pos logit contributions
 him+0.006
anked+0.006
 up+0.005
 Finally+0.005
 his+0.005
Neg logit contributions
cious-0.007
mare-0.007
 queens-0.007
ppy-0.007
 chicks-0.007
Token,
Token ID11
Feature activation+9.69e-03

Pos logit contributions
 him+0.096
anked+0.096
 Finally+0.095
 When+0.086
 Up+0.082
Neg logit contributions
 queens-0.146
cious-0.131
 chicks-0.128
 soul-0.127
opy-0.124
Token but
Token ID475
Feature activation-1.86e-02

Pos logit contributions
 queens+0.018
cious+0.018
mare+0.018
ppy+0.017
 chicks+0.017
Neg logit contributions
 him-0.014
anked-0.012
 Finally-0.012
 up-0.012
 Tim-0.011
Token it
Token ID340
Feature activation+6.52e-04

Pos logit contributions
 him+0.025
anked+0.023
 Finally+0.022
 up+0.021
 When+0.020
Neg logit contributions
 queens-0.037
cious-0.036
mare-0.035
ppy-0.034
 chicks-0.033
Token is
Token ID318
Feature activation-1.17e-02

Pos logit contributions
 queens+0.001
cious+0.001
opy+0.001
ppy+0.001
 forgot+0.001
Neg logit contributions
anked-0.001
 him-0.001
 Finally-0.001
 Timothy-0.001
 When-0.001
Token still
Token ID991
Feature activation+8.46e-04

Pos logit contributions
 him+0.019
anked+0.019
 Finally+0.018
 When+0.017
 Up+0.017
Neg logit contributions
 queens-0.019
cious-0.018
 chicks-0.017
 soul-0.017
ppy-0.017
Token that
Token ID326
Feature activation+1.88e-02

Pos logit contributions
anked+0.015
 him+0.014
 Finally+0.014
 When+0.013
 After+0.012
Neg logit contributions
 queens-0.021
cious-0.020
 chicks-0.019
mare-0.019
 soul-0.019
Token it
Token ID340
Feature activation+4.53e-03

Pos logit contributions
cious+0.044
 queens+0.043
ppy+0.042
plets+0.042
mare+0.041
Neg logit contributions
 him-0.022
 up-0.018
 his-0.016
anked-0.016
 Tim-0.015
Token wasn
Token ID2492
Feature activation-2.50e-02

Pos logit contributions
 queens+0.010
cious+0.009
mare+0.009
 chicks+0.009
 soul+0.009
Neg logit contributions
 him-0.006
 up-0.005
anked-0.005
 Finally-0.004
 When-0.004
Token't
Token ID470
Feature activation+1.12e-02

Pos logit contributions
anked+0.047
 Finally+0.047
 him+0.044
 Slowly+0.042
 After+0.042
Neg logit contributions
 queens-0.032
opy-0.028
ppy-0.028
plets-0.028
cious-0.028
Token as
Token ID355
Feature activation+3.25e-03

Pos logit contributions
cious+0.020
 queens+0.020
mare+0.018
pillar+0.018
opy+0.018
Neg logit contributions
 him-0.017
anked-0.015
 up-0.015
 his-0.014
 Finally-0.014
Token big
Token ID1263
Feature activation-7.38e-02

Pos logit contributions
cious+0.006
 queens+0.005
mare+0.005
opy+0.005
ppy+0.005
Neg logit contributions
 him-0.005
anked-0.005
 up-0.005
 his-0.004
 Finally-0.004
Token as
Token ID355
Feature activation-5.29e-03

Pos logit contributions
 him+0.084
anked+0.083
 Finally+0.083
 When+0.073
 Up+0.070
Neg logit contributions
 queens-0.155
cious-0.147
 chicks-0.139
 soul-0.138
mare-0.137
Token she
Token ID673
Feature activation+6.67e-05

Pos logit contributions
 him+0.007
anked+0.006
 Finally+0.006
 up+0.006
 When+0.005
Neg logit contributions
 queens-0.011
cious-0.011
mare-0.010
 chicks-0.010
opy-0.010
Token thought
Token ID1807
Feature activation-5.70e-03

Pos logit contributions
 queens+0.000
cious+0.000
mare+0.000
 chicks+0.000
opy+0.000
Neg logit contributions
 him-0.000
anked-0.000
 Finally-0.000
 When-0.000
 Up-0.000
Token.
Token ID13
Feature activation-5.79e-03

Pos logit contributions
 him+0.007
anked+0.006
 Finally+0.006
 up+0.006
 When+0.006
Neg logit contributions
 queens-0.012
cious-0.012
mare-0.011
 chicks-0.011
 soul-0.011
Token She
Token ID1375
Feature activation-3.88e-03

Pos logit contributions
 him+0.009
anked+0.009
asted+0.008
 Finally+0.008
 Timothy+0.008
Neg logit contributions
cious-0.009
 queens-0.009
mare-0.009
 soul-0.008
opy-0.008
Token bigger
Token ID5749
Feature activation-3.14e-02

Pos logit contributions
 him+0.000
anked+0.000
 Finally+0.000
 up+0.000
 When+0.000
Neg logit contributions
cious-0.000
 queens-0.000
mare-0.000
ppy-0.000
 chicks-0.000
Token until
Token ID1566
Feature activation-1.06e-02

Pos logit contributions
 him+0.045
anked+0.042
 Finally+0.040
 When+0.038
 up+0.037
Neg logit contributions
 queens-0.059
cious-0.057
mare-0.054
 chicks-0.053
ppy-0.052
Token it
Token ID340
Feature activation+1.27e-04

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.012
 When+0.011
 up+0.011
Neg logit contributions
 queens-0.021
cious-0.021
mare-0.019
 chicks-0.019
ppy-0.019
Token was
Token ID373
Feature activation-2.11e-02

Pos logit contributions
 queens+0.000
cious+0.000
 chicks+0.000
mare+0.000
 soul+0.000
Neg logit contributions
 him-0.000
anked-0.000
 Finally-0.000
 When-0.000
 up-0.000
Token as
Token ID355
Feature activation+8.31e-03

Pos logit contributions
anked+0.035
 him+0.035
 Finally+0.033
 When+0.031
atches+0.031
Neg logit contributions
 queens-0.034
cious-0.029
 forgot-0.028
mare-0.028
 chicks-0.028
Token big
Token ID1263
Feature activation-7.34e-02

Pos logit contributions
cious+0.015
mare+0.014
 queens+0.014
ppy+0.014
 chicks+0.014
Neg logit contributions
 him-0.013
anked-0.011
 Finally-0.011
 up-0.011
 his-0.010
Token as
Token ID355
Feature activation-7.67e-03

Pos logit contributions
 Finally+0.103
anked+0.098
Years+0.091
 him+0.085
atches+0.084
Neg logit contributions
 queens-0.135
plets-0.115
 chicks-0.114
cious-0.112
 ghosts-0.111
Token Jack
Token ID3619
Feature activation-9.12e-03

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 When+0.008
 up+0.008
Neg logit contributions
 queens-0.015
cious-0.015
mare-0.014
 chicks-0.014
ppy-0.014
Token!
Token ID0
Feature activation-7.63e-03

Pos logit contributions
 him+0.016
anked+0.015
 Finally+0.014
 When+0.014
 up+0.014
Neg logit contributions
 queens-0.015
cious-0.013
mare-0.013
 chicks-0.013
 soul-0.012
Token
Token ID220
Feature activation-9.63e-03

Pos logit contributions
anked+0.011
 him+0.010
asted+0.010
Years+0.009
 up+0.009
Neg logit contributions
 queens-0.012
mare-0.011
pillar-0.011
 soul-0.011
cious-0.011
Token
Token ID198
Feature activation-4.39e-03

Pos logit contributions
anked+0.015
 him+0.014
asted+0.014
 drib+0.014
oted+0.013
Neg logit contributions
lessness-0.013
umpy-0.013
opy-0.013
 queens-0.013
ppy-0.012
Token too
Token ID1165
Feature activation+2.07e-02

Pos logit contributions
 him+0.023
anked+0.023
 Finally+0.022
 When+0.021
 Up+0.020
Neg logit contributions
 queens-0.021
cious-0.019
opy-0.018
mare-0.018
ppy-0.018
Token high
Token ID1029
Feature activation-6.38e-02

Pos logit contributions
cious+0.037
 queens+0.037
mare+0.035
 chicks+0.035
ppy+0.033
Neg logit contributions
 him-0.031
anked-0.030
 up-0.027
 Finally-0.027
 When-0.026
Token and
Token ID290
Feature activation+7.39e-03

Pos logit contributions
 Finally+0.097
anked+0.092
Years+0.087
 him+0.086
 When+0.086
Neg logit contributions
 queens-0.101
opy-0.094
cious-0.093
ppy-0.093
cest-0.089
Token too
Token ID1165
Feature activation+1.71e-02

Pos logit contributions
 queens+0.013
cious+0.013
mare+0.012
 chicks+0.012
ppy+0.012
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.010
 up-0.010
 When-0.010
Token w
Token ID266
Feature activation-2.98e-02

Pos logit contributions
cious+0.037
 queens+0.037
mare+0.035
 chicks+0.035
ppy+0.034
Neg logit contributions
 him-0.020
anked-0.019
 up-0.017
 Finally-0.017
 his-0.015
Tokenobb
Token ID21963
Feature activation-7.25e-02

Pos logit contributions
 him+0.046
 Finally+0.044
 When+0.043
 After+0.041
 Up+0.041
Neg logit contributions
opy-0.049
ppy-0.049
 queens-0.048
cious-0.047
aky-0.047
Tokenly
Token ID306
Feature activation-1.35e-02

Pos logit contributions
 him+0.158
 Finally+0.157
anked+0.150
 When+0.149
 Up+0.149
Neg logit contributions
opy-0.065
ppy-0.064
 queens-0.064
cious-0.061
plets-0.058
Token,
Token ID11
Feature activation+4.49e-03

Pos logit contributions
 him+0.019
anked+0.019
 Finally+0.018
 When+0.017
 Up+0.016
Neg logit contributions
 queens-0.024
cious-0.024
mare-0.022
ppy-0.022
 chicks-0.022
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
 queens+0.008
cious+0.007
mare+0.007
 chicks+0.007
 soul+0.007
Neg logit contributions
 him-0.007
anked-0.007
 Finally-0.006
 When-0.006
 Up-0.006
Token it
Token ID340
Feature activation+3.15e-03

Pos logit contributions
 queens+0.018
cious+0.017
mare+0.017
 chicks+0.016
ppy+0.016
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.011
 When-0.010
 up-0.010
Token fell
Token ID3214
Feature activation-8.54e-03

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
 chicks+0.005
 soul+0.005
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.005
 up-0.005
 When-0.004
Token grow
Token ID1663
Feature activation+1.08e-02

Pos logit contributions
 him+0.030
anked+0.027
 Finally+0.026
 When+0.025
 up+0.025
Neg logit contributions
 queens-0.032
cious-0.031
mare-0.029
ppy-0.029
opy-0.029
Token until
Token ID1566
Feature activation-1.45e-02

Pos logit contributions
mare+0.021
 queens+0.021
cious+0.020
 soul+0.019
ppy+0.019
Neg logit contributions
 him-0.016
 up-0.015
anked-0.014
 Finally-0.012
 int-0.012
Token she
Token ID673
Feature activation-9.99e-03

Pos logit contributions
 him+0.021
anked+0.020
 Finally+0.019
 When+0.018
 up+0.017
Neg logit contributions
 queens-0.027
cious-0.026
mare-0.024
 chicks-0.024
ppy-0.024
Token became
Token ID2627
Feature activation-6.11e-03

Pos logit contributions
 him+0.016
anked+0.014
 Finally+0.014
 up+0.013
 When+0.013
Neg logit contributions
 queens-0.017
cious-0.016
 chicks-0.016
mare-0.015
lessness-0.015
Token as
Token ID355
Feature activation+5.94e-03

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 up+0.009
 When+0.008
Neg logit contributions
 queens-0.011
cious-0.010
mare-0.010
ppy-0.009
 chicks-0.009
Token big
Token ID1263
Feature activation-7.20e-02

Pos logit contributions
cious+0.011
mare+0.010
 queens+0.010
ppy+0.010
pillar+0.009
Neg logit contributions
 him-0.009
 up-0.008
anked-0.008
 Finally-0.008
 his-0.008
Token as
Token ID355
Feature activation-6.78e-03

Pos logit contributions
 Finally+0.087
anked+0.086
 him+0.085
 When+0.077
 Up+0.073
Neg logit contributions
 queens-0.145
cious-0.134
 chicks-0.132
 soul-0.128
opy-0.126
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 up+0.008
 When+0.008
Neg logit contributions
 queens-0.013
cious-0.012
mare-0.012
ppy-0.012
 chicks-0.011
Token bubble
Token ID14310
Feature activation-2.20e-03

Pos logit contributions
 him+0.018
 up+0.017
anked+0.017
 drib+0.016
 Finally+0.016
Neg logit contributions
 queens-0.018
cious-0.017
mare-0.016
ppy-0.016
aky-0.015
Token.
Token ID13
Feature activation-2.90e-03

Pos logit contributions
 him+0.004
anked+0.003
 Finally+0.003
 up+0.003
 When+0.003
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.003
 chicks-0.003
ppy-0.003
Token
Token ID220
Feature activation-7.25e-03

Pos logit contributions
 him+0.004
anked+0.003
 Finally+0.003
 up+0.003
 When+0.003
Neg logit contributions
 queens-0.006
cious-0.006
mare-0.005
 soul-0.005
 chicks-0.005
Token bigger
Token ID5749
Feature activation-3.27e-02

Pos logit contributions
 him+0.004
anked+0.004
 Finally+0.003
 up+0.003
 When+0.003
Neg logit contributions
cious-0.005
 queens-0.005
mare-0.005
 chicks-0.005
ppy-0.005
Token until
Token ID1566
Feature activation-1.39e-02

Pos logit contributions
 him+0.046
anked+0.045
 Finally+0.043
 When+0.040
 Up+0.039
Neg logit contributions
 queens-0.061
cious-0.058
 chicks-0.055
mare-0.053
 soul-0.053
Token it
Token ID340
Feature activation-4.01e-03

Pos logit contributions
anked+0.021
 Finally+0.019
 him+0.019
 When+0.018
 Up+0.018
Neg logit contributions
 queens-0.024
cious-0.021
 forgot-0.020
 chicks-0.020
plets-0.020
Token was
Token ID373
Feature activation-1.99e-02

Pos logit contributions
 him+0.006
anked+0.006
 Finally+0.006
 When+0.006
 up+0.005
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.006
 chicks-0.006
 soul-0.006
Token as
Token ID355
Feature activation+5.31e-03

Pos logit contributions
 him+0.033
anked+0.033
 Finally+0.031
 When+0.030
 Up+0.029
Neg logit contributions
 queens-0.032
cious-0.027
 forgot-0.027
ppy-0.027
aky-0.026
Token big
Token ID1263
Feature activation-7.18e-02

Pos logit contributions
cious+0.009
mare+0.009
 queens+0.009
 chicks+0.009
ppy+0.009
Neg logit contributions
 him-0.008
anked-0.007
 Finally-0.007
 up-0.007
 his-0.007
Token as
Token ID355
Feature activation-9.92e-03

Pos logit contributions
 Finally+0.105
anked+0.099
Years+0.096
rawled+0.088
 Up+0.088
Neg logit contributions
 queens-0.128
plets-0.108
 chicks-0.107
 spe-0.105
 soul-0.102
Token Tim
Token ID5045
Feature activation-6.98e-03

Pos logit contributions
anked+0.013
 him+0.013
 Finally+0.012
 When+0.012
 After+0.011
Neg logit contributions
 queens-0.019
cious-0.018
 chicks-0.017
 soul-0.016
mare-0.016
Tokenmy
Token ID1820
Feature activation-1.32e-02

Pos logit contributions
 him+0.011
anked+0.011
 Finally+0.010
 When+0.010
 Up+0.009
Neg logit contributions
 queens-0.012
cious-0.011
ppy-0.010
mare-0.010
 chicks-0.010
Token!
Token ID0
Feature activation-1.14e-02

Pos logit contributions
 him+0.022
anked+0.020
 Finally+0.019
 up+0.018
 When+0.018
Neg logit contributions
 queens-0.023
mare-0.021
cious-0.020
 chicks-0.019
 soul-0.019
Token Tim
Token ID5045
Feature activation-8.35e-03

Pos logit contributions
anked+0.017
 him+0.016
Years+0.016
asted+0.016
oted+0.016
Neg logit contributions
 queens-0.015
plets-0.014
 I-0.014
umpy-0.014
lessness-0.014
Token.
Token ID13
Feature activation+1.51e-03

Pos logit contributions
 queens+0.053
cious+0.051
 chicks+0.050
mare+0.048
ppy+0.048
Neg logit contributions
 him-0.029
anked-0.026
 Finally-0.024
 up-0.023
 When-0.022
Token It
Token ID632
Feature activation+2.60e-03

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
 soul+0.002
lessness+0.002
Neg logit contributions
 him-0.002
anked-0.002
 up-0.002
 drib-0.002
 Finally-0.002
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
 chicks+0.005
 soul+0.005
Neg logit contributions
 him-0.004
anked-0.003
 up-0.003
 Finally-0.003
 When-0.003
Token now
Token ID783
Feature activation-3.45e-03

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.017
 When+0.016
 Up+0.015
Neg logit contributions
 queens-0.017
cious-0.016
 chicks-0.015
mare-0.015
 soul-0.014
Token as
Token ID355
Feature activation+6.62e-03

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 up+0.005
 When+0.004
Neg logit contributions
 queens-0.006
cious-0.006
mare-0.006
ppy-0.005
 chicks-0.005
Token big
Token ID1263
Feature activation-7.15e-02

Pos logit contributions
cious+0.012
mare+0.012
 queens+0.012
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.010
anked-0.009
 up-0.009
 Finally-0.008
 his-0.008
Token as
Token ID355
Feature activation-6.30e-03

Pos logit contributions
 Finally+0.096
anked+0.092
 him+0.084
atches+0.079
 When+0.078
Neg logit contributions
 queens-0.136
 chicks-0.120
cious-0.120
 soul-0.119
 ghosts-0.117
Token a
Token ID257
Feature activation-2.54e-02

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.012
cious-0.012
mare-0.011
 chicks-0.011
ppy-0.011
Token tree
Token ID5509
Feature activation-6.54e-03

Pos logit contributions
 him+0.048
anked+0.046
 Finally+0.045
 When+0.042
 up+0.041
Neg logit contributions
 queens-0.035
cious-0.034
mare-0.031
ppy-0.031
 chicks-0.030
Token.
Token ID13
Feature activation+4.49e-04

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 up+0.007
 When+0.007
Neg logit contributions
 queens-0.013
cious-0.013
mare-0.012
 chicks-0.012
ppy-0.012
Token She
Token ID1375
Feature activation+4.52e-03

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 soul+0.001
lessness+0.001
Neg logit contributions
 him-0.001
anked-0.001
 up-0.001
 drib-0.001
 Timothy-0.001
Token.
Token ID13
Feature activation+8.04e-03

Pos logit contributions
 queens+0.014
cious+0.014
 chicks+0.013
ppy+0.013
mare+0.013
Neg logit contributions
 him-0.011
anked-0.011
 Finally-0.011
 When-0.010
 After-0.009
Token They
Token ID1119
Feature activation+6.38e-03

Pos logit contributions
 queens+0.015
cious+0.014
mare+0.013
 chicks+0.013
 soul+0.013
Neg logit contributions
anked-0.011
 him-0.011
 up-0.010
 drib-0.009
 Finally-0.009
Token are
Token ID389
Feature activation+4.84e-03

Pos logit contributions
 queens+0.013
cious+0.013
mare+0.012
pillar+0.011
 chicks+0.011
Neg logit contributions
 him-0.009
 up-0.008
 drib-0.007
anked-0.007
 Finally-0.007
Token not
Token ID407
Feature activation+8.56e-03

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.008
pillar+0.008
ppy+0.008
Neg logit contributions
 him-0.008
 up-0.007
anked-0.007
 Finally-0.007
 When-0.006
Token as
Token ID355
Feature activation-1.20e-03

Pos logit contributions
pillar+0.015
cious+0.015
mare+0.015
 zone+0.014
ifying+0.014
Neg logit contributions
 him-0.013
 up-0.012
 his-0.012
 Tim-0.011
anked-0.011
Token big
Token ID1263
Feature activation-7.08e-02

Pos logit contributions
 him+0.002
 up+0.002
 Tim+0.002
 his+0.002
anked+0.001
Neg logit contributions
cious-0.002
mare-0.002
pillar-0.002
ppy-0.002
 zone-0.002
Token or
Token ID393
Feature activation+1.02e-02

Pos logit contributions
 him+0.086
anked+0.083
 Finally+0.082
 When+0.073
 Up+0.069
Neg logit contributions
 queens-0.146
cious-0.137
 chicks-0.133
 soul-0.129
mare-0.127
Token as
Token ID355
Feature activation-4.99e-03

Pos logit contributions
cious+0.020
pillar+0.020
mare+0.020
ppy+0.018
opy+0.018
Neg logit contributions
 him-0.015
 up-0.014
 his-0.013
 Tim-0.012
anked-0.011
Token bright
Token ID6016
Feature activation-5.21e-02

Pos logit contributions
 him+0.007
anked+0.005
 up+0.005
 Finally+0.005
 Tim+0.005
Neg logit contributions
cious-0.010
mare-0.010
 queens-0.010
pillar-0.009
ppy-0.009
Token as
Token ID355
Feature activation-5.42e-03

Pos logit contributions
 him+0.059
 Finally+0.059
anked+0.058
 When+0.051
 Up+0.049
Neg logit contributions
 queens-0.108
cious-0.100
 chicks-0.099
 soul-0.096
plets-0.095
Token the
Token ID262
Feature activation-7.73e-03

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 up+0.006
 When+0.006
Neg logit contributions
 queens-0.011
cious-0.010
mare-0.010
 chicks-0.010
ppy-0.010
Token is
Token ID318
Feature activation-7.89e-03

Pos logit contributions
 him+0.003
anked+0.003
 up+0.003
 Finally+0.003
 When+0.003
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.004
 chicks-0.003
 soul-0.003
Token too
Token ID1165
Feature activation+2.74e-02

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.011
 When+0.010
 up+0.010
Neg logit contributions
 queens-0.014
cious-0.014
mare-0.013
 chicks-0.012
ppy-0.012
Token small
Token ID1402
Feature activation-2.42e-02

Pos logit contributions
 queens+0.053
cious+0.052
mare+0.051
ppy+0.048
 soul+0.046
Neg logit contributions
 him-0.036
 up-0.035
anked-0.035
 Finally-0.034
 When-0.032
Token to
Token ID284
Feature activation-2.40e-03

Pos logit contributions
 him+0.027
 up+0.023
anked+0.022
 Finally+0.021
 When+0.020
Neg logit contributions
cious-0.055
 queens-0.054
mare-0.052
plets-0.050
 soul-0.050
Token do
Token ID466
Feature activation-2.23e-02

Pos logit contributions
 him+0.004
anked+0.003
 Finally+0.003
 When+0.003
 up+0.003
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.004
 chicks-0.004
ppy-0.004
Token big
Token ID1263
Feature activation-7.07e-02

Pos logit contributions
 him+0.033
anked+0.030
 Finally+0.029
 up+0.028
 When+0.027
Neg logit contributions
 queens-0.042
cious-0.040
mare-0.040
 chicks-0.038
 soul-0.037
Token jobs
Token ID3946
Feature activation+1.03e-02

Pos logit contributions
 him+0.102
anked+0.096
 Finally+0.091
 When+0.085
 up+0.083
Neg logit contributions
 queens-0.132
cious-0.128
mare-0.121
 chicks-0.118
ppy-0.117
Token."
Token ID526
Feature activation-1.30e-02

Pos logit contributions
 queens+0.020
cious+0.020
mare+0.019
 chicks+0.019
 soul+0.018
Neg logit contributions
 him-0.014
anked-0.012
 up-0.012
 Finally-0.012
 When-0.011
Token Liz
Token ID28313
Feature activation-2.83e-03

Pos logit contributions
 him+0.021
anked+0.021
asted+0.019
oted+0.019
rew+0.018
Neg logit contributions
lessness-0.017
 queens-0.017
cious-0.017
 forgot-0.017
mare-0.016
Token looked
Token ID3114
Feature activation-3.18e-02

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 When+0.004
 up+0.004
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.004
 chicks-0.004
ppy-0.004
Token sad
Token ID6507
Feature activation+3.27e-03

Pos logit contributions
 him+0.038
anked+0.038
 Finally+0.034
 Timothy+0.031
 Up+0.030
Neg logit contributions
 queens-0.061
cious-0.059
 chicks-0.059
 forgot-0.057
 soul-0.057
Token finished
Token ID5201
Feature activation-3.50e-02

Pos logit contributions
 him+0.015
anked+0.014
 Finally+0.014
 When+0.013
 up+0.012
Neg logit contributions
 queens-0.017
cious-0.017
mare-0.016
 chicks-0.016
ppy-0.015
Token it
Token ID340
Feature activation-6.68e-03

Pos logit contributions
 him+0.049
anked+0.048
 Finally+0.047
 When+0.043
 Up+0.042
Neg logit contributions
 queens-0.064
cious-0.062
mare-0.057
 chicks-0.057
opy-0.056
Token was
Token ID373
Feature activation-1.68e-02

Pos logit contributions
 him+0.009
anked+0.009
 Finally+0.008
 When+0.008
 Up+0.007
Neg logit contributions
 queens-0.013
cious-0.012
mare-0.011
 chicks-0.011
ppy-0.011
Token almost
Token ID2048
Feature activation-1.10e-02

Pos logit contributions
 him+0.027
anked+0.026
 Finally+0.024
 When+0.023
 Up+0.023
Neg logit contributions
 queens-0.028
cious-0.026
mare-0.025
 forgot-0.024
ppy-0.024
Token as
Token ID355
Feature activation+2.09e-03

Pos logit contributions
anked+0.016
 him+0.016
 Finally+0.015
 Up+0.014
 Slowly+0.014
Neg logit contributions
 queens-0.018
 forgot-0.017
ppy-0.016
mare-0.016
lessness-0.016
Token big
Token ID1263
Feature activation-7.04e-02

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 chicks+0.003
ppy+0.003
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.003
 up-0.003
Token as
Token ID355
Feature activation-5.63e-03

Pos logit contributions
 Finally+0.092
anked+0.088
 him+0.083
 When+0.076
 After+0.076
Neg logit contributions
 queens-0.137
 chicks-0.120
cious-0.120
 soul-0.119
plets-0.116
Token her
Token ID607
Feature activation-1.03e-02

Pos logit contributions
 him+0.008
anked+0.008
 Finally+0.007
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.010
cious-0.010
mare-0.010
ppy-0.009
 chicks-0.009
Token bedroom
Token ID14043
Feature activation+1.24e-03

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.016
 When+0.015
 Up+0.015
Neg logit contributions
 queens-0.016
cious-0.016
 chicks-0.015
mare-0.015
 soul-0.015
Token!
Token ID0
Feature activation-7.28e-03

Pos logit contributions
 queens+0.003
cious+0.002
 chicks+0.002
mare+0.002
ppy+0.002
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.001
 up-0.001
 When-0.001
Token
Token ID198
Feature activation+1.04e-03

Pos logit contributions
 him+0.011
anked+0.011
 up+0.010
 drib+0.010
Years+0.010
Neg logit contributions
 queens-0.011
cious-0.011
pillar-0.011
mare-0.011
 soul-0.010
Token Soon
Token ID15894
Feature activation-2.03e-02

Pos logit contributions
 queens+0.018
cious+0.017
 soul+0.016
mare+0.016
 chicks+0.016
Neg logit contributions
 him-0.015
anked-0.015
 up-0.012
 Timothy-0.012
atches-0.012
Token it
Token ID340
Feature activation+1.82e-03

Pos logit contributions
 him+0.025
anked+0.022
 Finally+0.022
 When+0.020
 up+0.020
Neg logit contributions
 queens-0.042
cious-0.042
mare-0.040
ppy-0.039
 chicks-0.039
Token was
Token ID373
Feature activation-1.32e-02

Pos logit contributions
 queens+0.004
cious+0.004
mare+0.004
 chicks+0.004
ppy+0.003
Neg logit contributions
 him-0.002
anked-0.002
 up-0.002
 Finally-0.002
 Up-0.002
Token twice
Token ID5403
Feature activation+1.46e-02

Pos logit contributions
anked+0.021
 him+0.021
 Finally+0.019
atches+0.019
 Up+0.018
Neg logit contributions
 queens-0.020
mare-0.018
 forgot-0.018
cious-0.018
aky-0.018
Token as
Token ID355
Feature activation+1.06e-02

Pos logit contributions
 queens+0.029
cious+0.027
ppy+0.027
 chicks+0.027
mare+0.027
Neg logit contributions
 him-0.017
anked-0.017
 Finally-0.017
 When-0.014
 After-0.014
Token big
Token ID1263
Feature activation-7.04e-02

Pos logit contributions
cious+0.020
 queens+0.020
mare+0.019
ppy+0.019
 chicks+0.018
Neg logit contributions
 him-0.015
anked-0.014
 Finally-0.013
 up-0.012
 When-0.012
Token.
Token ID13
Feature activation+7.92e-03

Pos logit contributions
 Finally+0.091
anked+0.085
 him+0.083
atches+0.074
 When+0.073
Neg logit contributions
 queens-0.137
 chicks-0.122
 soul-0.120
cious-0.118
 ghosts-0.115
Token Sarah
Token ID10490
Feature activation-7.37e-03

Pos logit contributions
 queens+0.015
cious+0.014
mare+0.014
 soul+0.013
 chicks+0.013
Neg logit contributions
 him-0.011
anked-0.010
 Finally-0.009
 up-0.009
 drib-0.008
Token was
Token ID373
Feature activation-3.09e-03

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 up+0.010
 drib+0.009
Neg logit contributions
 queens-0.014
cious-0.012
mare-0.012
lessness-0.012
plets-0.012
Token so
Token ID523
Feature activation+1.49e-02

Pos logit contributions
 him+0.004
 up+0.004
anked+0.004
 Finally+0.004
 When+0.004
Neg logit contributions
 queens-0.006
cious-0.006
ppy-0.006
mare-0.006
plets-0.005
Token excited
Token ID6568
Feature activation-1.23e-02

Pos logit contributions
 queens+0.028
cious+0.027
mare+0.026
ppy+0.025
 chicks+0.025
Neg logit contributions
 him-0.022
anked-0.022
 up-0.021
 Finally-0.020
 When-0.020
Token mushroom
Token ID28520
Feature activation-6.34e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
ppy+0.003
 chicks+0.003
Neg logit contributions
 him-0.004
anked-0.003
 Finally-0.003
 When-0.003
 up-0.003
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
 him+0.009
anked+0.009
 Finally+0.008
 When+0.008
 Up+0.007
Neg logit contributions
 queens-0.012
cious-0.011
mare-0.011
 chicks-0.011
ppy-0.010
Token It
Token ID632
Feature activation+4.17e-04

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.002
 soul+0.002
pillar+0.002
Neg logit contributions
anked-0.003
 him-0.003
 Finally-0.002
Years-0.002
 int-0.002
Token was
Token ID373
Feature activation-7.39e-03

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 chicks+0.001
ppy+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 When-0.001
 up-0.001
Token as
Token ID355
Feature activation+1.54e-02

Pos logit contributions
anked+0.014
 him+0.013
 Finally+0.013
atches+0.012
 Up+0.012
Neg logit contributions
 queens-0.010
cious-0.009
 chicks-0.009
 bee-0.009
mare-0.008
Token big
Token ID1263
Feature activation-6.99e-02

Pos logit contributions
cious+0.030
mare+0.029
plets+0.027
pillar+0.027
ppy+0.027
Neg logit contributions
 him-0.021
 up-0.019
 his-0.019
anked-0.018
 Tim-0.017
Token as
Token ID355
Feature activation-3.23e-03

Pos logit contributions
 Finally+0.091
anked+0.091
 him+0.086
Years+0.078
 int+0.076
Neg logit contributions
 queens-0.138
 chicks-0.120
cious-0.120
 witches-0.115
 bee-0.114
Token her
Token ID607
Feature activation-8.61e-03

Pos logit contributions
 him+0.004
anked+0.004
 Finally+0.004
 up+0.003
 When+0.003
Neg logit contributions
cious-0.006
 queens-0.006
mare-0.006
 chicks-0.006
opy-0.006
Token head
Token ID1182
Feature activation+2.38e-03

Pos logit contributions
 him+0.015
anked+0.015
 Finally+0.014
atches+0.014
 When+0.013
Neg logit contributions
cious-0.012
 queens-0.012
 soul-0.012
 chicks-0.011
ppy-0.011
Token!
Token ID0
Feature activation-8.20e-03

Pos logit contributions
 queens+0.005
cious+0.004
mare+0.004
 chicks+0.004
ppy+0.004
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.003
 up-0.003
Token
Token ID220
Feature activation-5.25e-03

Pos logit contributions
 him+0.012
anked+0.012
asted+0.010
Years+0.010
 int+0.010
Neg logit contributions
 queens-0.014
mare-0.013
cious-0.013
pillar-0.013
 soul-0.012
Token.
Token ID13
Feature activation+3.40e-03

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 When+0.001
 Up+0.001
Neg logit contributions
 queens-0.002
cious-0.002
mare-0.002
 chicks-0.002
ppy-0.002
Token It
Token ID632
Feature activation-5.68e-04

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.006
 chicks+0.006
ppy+0.006
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.004
 When-0.004
 Up-0.004
Token was
Token ID373
Feature activation-8.61e-03

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 When+0.001
 up+0.001
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
ppy-0.001
Token not
Token ID407
Feature activation+1.45e-02

Pos logit contributions
 him+0.015
anked+0.015
 Finally+0.014
 Up+0.013
 When+0.013
Neg logit contributions
 queens-0.013
cious-0.012
 chicks-0.012
mare-0.011
 bee-0.011
Token as
Token ID355
Feature activation+4.40e-03

Pos logit contributions
 queens+0.028
cious+0.027
mare+0.025
pillar+0.025
opy+0.025
Neg logit contributions
 him-0.020
anked-0.018
 up-0.017
 Finally-0.017
 When-0.016
Token impressive
Token ID8036
Feature activation-6.88e-02

Pos logit contributions
cious+0.008
mare+0.008
pillar+0.008
plets+0.008
opy+0.008
Neg logit contributions
 him-0.006
 up-0.005
 his-0.005
 Tim-0.005
anked-0.005
Token,
Token ID11
Feature activation+4.17e-03

Pos logit contributions
 Finally+0.084
 him+0.083
anked+0.082
Years+0.072
 Up+0.072
Neg logit contributions
 queens-0.139
cious-0.128
 chicks-0.122
 witches-0.119
 soul-0.118
Token but
Token ID475
Feature activation-1.94e-02

Pos logit contributions
 queens+0.008
cious+0.007
mare+0.007
 chicks+0.007
ppy+0.007
Neg logit contributions
 him-0.007
anked-0.006
 Finally-0.006
 up-0.005
 When-0.005
Token it
Token ID340
Feature activation+5.52e-03

Pos logit contributions
 him+0.027
anked+0.024
 Finally+0.023
 up+0.023
 When+0.022
Neg logit contributions
 queens-0.039
cious-0.039
mare-0.037
 chicks-0.036
ppy-0.036
Token was
Token ID373
Feature activation-1.87e-02

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.009
plets+0.009
 chicks+0.009
Neg logit contributions
 him-0.008
anked-0.008
 up-0.007
 Finally-0.007
 When-0.007
Token still
Token ID991
Feature activation+2.42e-03

Pos logit contributions
 him+0.030
anked+0.029
 Finally+0.027
 When+0.026
 up+0.025
Neg logit contributions
 queens-0.032
cious-0.031
mare-0.029
 chicks-0.028
ppy-0.028
TokenTim
Token ID14967
Feature activation-8.67e-03

Pos logit contributions
 him+0.002
 up+0.002
anked+0.002
 int+0.002
aled+0.002
Neg logit contributions
 queens-0.001
aky-0.001
cious-0.001
umpy-0.001
 forgot-0.001
Token grabbed
Token ID13176
Feature activation-1.41e-02

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.013
 When+0.012
 up+0.012
Neg logit contributions
 queens-0.015
cious-0.014
mare-0.013
 chicks-0.013
ppy-0.013
Token the
Token ID262
Feature activation+1.64e-03

Pos logit contributions
 Finally+0.021
anked+0.021
Years+0.019
rawled+0.018
atches+0.018
Neg logit contributions
 queens-0.020
 soul-0.020
 thumb-0.019
 lazy-0.019
mare-0.019
Token hook
Token ID8011
Feature activation-7.48e-03

Pos logit contributions
 queens+0.003
cious+0.002
mare+0.002
 chicks+0.002
ppy+0.002
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.003
 up-0.002
Token and
Token ID290
Feature activation-4.08e-03

Pos logit contributions
 him+0.011
anked+0.011
 Finally+0.011
 When+0.010
 After+0.009
Neg logit contributions
 queens-0.013
cious-0.013
 chicks-0.012
ppy-0.012
mare-0.012
Token anx
Token ID7296
Feature activation-6.87e-02

Pos logit contributions
 him+0.006
anked+0.006
 Finally+0.006
 When+0.006
 Up+0.005
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.006
 chicks-0.006
ppy-0.006
Tokeniously
Token ID6819
Feature activation-1.94e-02

Pos logit contributions
 Finally+0.126
 Up+0.124
Here+0.122
 him+0.117
ready+0.115
Neg logit contributions
cious-0.086
ppy-0.079
 soul-0.078
 throat-0.078
ious-0.078
Token put
Token ID1234
Feature activation-1.59e-02

Pos logit contributions
 him+0.028
anked+0.027
 Finally+0.026
 When+0.024
 Up+0.023
Neg logit contributions
 queens-0.035
cious-0.034
ppy-0.032
 forgot-0.032
mare-0.031
Token it
Token ID340
Feature activation-3.23e-04

Pos logit contributions
anked+0.030
 Finally+0.028
Years+0.027
 Slowly+0.026
 When+0.026
Neg logit contributions
 queens-0.017
lessness-0.016
 soul-0.016
cious-0.016
 cloud-0.016
Token in
Token ID287
Feature activation-3.21e-03

Pos logit contributions
anked+0.001
 Finally+0.001
 him+0.000
 When+0.000
 Slowly+0.000
Neg logit contributions
 queens-0.000
ppy-0.000
cious-0.000
aky-0.000
lessness-0.000
Token the
Token ID262
Feature activation+6.73e-03

Pos logit contributions
anked+0.004
 Finally+0.003
 Up+0.003
 Slowly+0.003
Years+0.003
Neg logit contributions
 queens-0.006
 soul-0.006
mare-0.006
cious-0.006
 cloud-0.006
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
anked+0.029
 him+0.028
 Finally+0.028
 When+0.025
 After+0.025
Neg logit contributions
 queens-0.034
cious-0.033
ppy-0.031
mare-0.031
 chicks-0.030
Token It
Token ID632
Feature activation+1.32e-03

Pos logit contributions
anked+0.021
 him+0.020
asted+0.020
Years+0.020
oted+0.020
Neg logit contributions
 I-0.015
 queens-0.015
cious-0.015
 chicks-0.014
mare-0.014
Token wasn
Token ID2492
Feature activation-1.99e-02

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
 chicks+0.002
ppy+0.002
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.002
 When-0.002
 up-0.002
Token't
Token ID470
Feature activation+1.13e-02

Pos logit contributions
anked+0.039
 Finally+0.039
 him+0.036
 Slowly+0.036
 After+0.035
Neg logit contributions
 queens-0.023
opy-0.020
plets-0.019
ppy-0.019
chie-0.019
Token as
Token ID355
Feature activation-1.29e-03

Pos logit contributions
 queens+0.021
cious+0.019
mare+0.019
 chicks+0.018
ppy+0.018
Neg logit contributions
anked-0.016
 him-0.016
 Finally-0.015
 When-0.014
 Up-0.014
Token big
Token ID1263
Feature activation-6.86e-02

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.002
cious-0.002
mare-0.002
 chicks-0.002
ppy-0.002
Token as
Token ID355
Feature activation-8.51e-03

Pos logit contributions
 Finally+0.093
anked+0.090
 him+0.082
Years+0.080
 Up+0.080
Neg logit contributions
 queens-0.130
 soul-0.113
cious-0.112
 chicks-0.112
cest-0.111
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
 him+0.010
anked+0.010
 Finally+0.009
 When+0.009
 up+0.008
Neg logit contributions
 queens-0.018
cious-0.017
mare-0.016
 chicks-0.016
ppy-0.016
Token one
Token ID530
Feature activation+1.35e-03

Pos logit contributions
anked+0.018
 him+0.017
 Finally+0.016
 When+0.015
 Up+0.015
Neg logit contributions
 queens-0.019
cious-0.017
 chicks-0.017
 soul-0.017
mare-0.016
Token he
Token ID339
Feature activation+5.09e-03

Pos logit contributions
 queens+0.002
cious+0.002
 soul+0.002
 chicks+0.002
mare+0.002
Neg logit contributions
anked-0.002
 him-0.002
 Finally-0.002
 When-0.002
 After-0.002
Token admired
Token ID29382
Feature activation-2.01e-02

Pos logit contributions
 queens+0.007
cious+0.006
mare+0.006
 chicks+0.006
ppy+0.005
Neg logit contributions
 him-0.010
anked-0.010
 Finally-0.010
 When-0.009
 up-0.009
Token bigger
Token ID5749
Feature activation-3.08e-02

Pos logit contributions
cious+0.004
mare+0.004
 queens+0.003
 chicks+0.003
ppy+0.003
Neg logit contributions
 him-0.002
 up-0.002
 Finally-0.002
anked-0.002
 When-0.002
Token.
Token ID13
Feature activation-1.73e-03

Pos logit contributions
 him+0.042
anked+0.040
 Finally+0.037
 When+0.035
 up+0.034
Neg logit contributions
 queens-0.060
cious-0.058
mare-0.055
 chicks-0.054
ppy-0.053
Token It
Token ID632
Feature activation+7.87e-03

Pos logit contributions
anked+0.002
 him+0.002
 up+0.002
 Finally+0.002
Years+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
 chicks-0.003
 soul-0.003
Token becomes
Token ID4329
Feature activation+1.02e-02

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.012
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.013
anked-0.013
 Finally-0.012
 up-0.012
 When-0.012
Token as
Token ID355
Feature activation+7.64e-03

Pos logit contributions
 queens+0.017
cious+0.017
mare+0.016
 chicks+0.015
ppy+0.015
Neg logit contributions
 him-0.016
anked-0.015
 Finally-0.015
 When-0.014
 up-0.014
Token big
Token ID1263
Feature activation-6.75e-02

Pos logit contributions
cious+0.014
mare+0.014
 queens+0.014
ppy+0.013
pillar+0.013
Neg logit contributions
 him-0.011
anked-0.010
 up-0.010
 Finally-0.010
 his-0.010
Token as
Token ID355
Feature activation-1.84e-03

Pos logit contributions
anked+0.083
 him+0.082
 Finally+0.082
 When+0.072
 Up+0.069
Neg logit contributions
 queens-0.135
cious-0.123
 chicks-0.121
 soul-0.118
plets-0.117
Token Anna
Token ID11735
Feature activation+1.26e-02

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 up+0.002
 When+0.002
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.003
ppy-0.003
 chicks-0.003
Token's
Token ID338
Feature activation-1.03e-02

Pos logit contributions
 queens+0.024
cious+0.022
mare+0.022
 chicks+0.021
lessness+0.021
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.016
 When-0.015
 up-0.015
Token head
Token ID1182
Feature activation-5.97e-04

Pos logit contributions
 him+0.019
anked+0.019
 Finally+0.018
 When+0.017
 up+0.016
Neg logit contributions
 queens-0.014
cious-0.014
 chicks-0.013
mare-0.013
 soul-0.012
Token,
Token ID11
Feature activation+4.23e-03

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 up+0.001
 When+0.001
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
ppy-0.001
Token flowers
Token ID12734
Feature activation-1.92e-03

Pos logit contributions
 queens+0.011
cious+0.010
mare+0.010
ppy+0.010
plets+0.010
Neg logit contributions
 him-0.013
anked-0.012
 Finally-0.011
 up-0.011
 When-0.011
Token were
Token ID547
Feature activation-1.46e-02

Pos logit contributions
 him+0.003
anked+0.002
 up+0.002
 Finally+0.002
 When+0.002
Neg logit contributions
 queens-0.004
cious-0.004
mare-0.004
ppy-0.003
 soul-0.003
Token not
Token ID407
Feature activation+9.80e-03

Pos logit contributions
 him+0.021
 up+0.019
anked+0.019
 his+0.017
 Finally+0.017
Neg logit contributions
cious-0.028
 queens-0.027
mare-0.026
pillar-0.026
ppy-0.025
Token growing
Token ID3957
Feature activation+2.20e-03

Pos logit contributions
cious+0.017
 queens+0.017
pillar+0.016
mare+0.016
 chicks+0.015
Neg logit contributions
 him-0.015
anked-0.014
 up-0.013
 Finally-0.013
 his-0.012
Token as
Token ID355
Feature activation+4.42e-03

Pos logit contributions
cious+0.005
 queens+0.004
mare+0.004
 soul+0.004
pillar+0.004
Neg logit contributions
 him-0.003
 up-0.003
anked-0.002
 Finally-0.002
 his-0.002
Token big
Token ID1263
Feature activation-6.68e-02

Pos logit contributions
cious+0.009
 queens+0.009
mare+0.008
pillar+0.008
ppy+0.008
Neg logit contributions
 him-0.006
anked-0.005
 up-0.005
 his-0.005
 Finally-0.005
Token and
Token ID290
Feature activation+1.84e-02

Pos logit contributions
 him+0.084
anked+0.080
 Finally+0.077
 When+0.070
 Up+0.067
Neg logit contributions
 queens-0.136
cious-0.130
 chicks-0.122
mare-0.122
ppy-0.120
Token strong
Token ID1913
Feature activation-2.70e-02

Pos logit contributions
cious+0.035
mare+0.034
ppy+0.033
 queens+0.032
ox+0.032
Neg logit contributions
 him-0.028
 up-0.027
 his-0.026
 Tim-0.026
 Finally-0.022
Token as
Token ID355
Feature activation-1.08e-02

Pos logit contributions
 him+0.035
anked+0.033
 Finally+0.031
 When+0.029
 up+0.028
Neg logit contributions
 queens-0.054
cious-0.053
mare-0.050
 chicks-0.049
ppy-0.049
Token they
Token ID484
Feature activation-4.01e-03

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.013
 When+0.012
 up+0.011
Neg logit contributions
 queens-0.021
cious-0.021
mare-0.020
 chicks-0.019
ppy-0.019
Token used
Token ID973
Feature activation+2.98e-03

Pos logit contributions
 him+0.007
 Finally+0.006
anked+0.006
 When+0.005
 Tim+0.005
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.007
cest-0.006
 soul-0.006
Token his
Token ID465
Feature activation-1.72e-03

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.007
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.013
cious-0.012
mare-0.012
 chicks-0.011
ppy-0.011
Token magnet
Token ID19972
Feature activation-2.32e-04

Pos logit contributions
 him+0.003
anked+0.003
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
 chicks-0.003
ppy-0.003
Token was
Token ID373
Feature activation-1.45e-02

Pos logit contributions
 him+0.000
anked+0.000
 Finally+0.000
 up+0.000
 When+0.000
Neg logit contributions
 queens-0.000
cious-0.000
mare-0.000
 chicks-0.000
ppy-0.000
Token not
Token ID407
Feature activation+9.78e-03

Pos logit contributions
 him+0.024
anked+0.023
 Finally+0.022
 When+0.021
 Up+0.020
Neg logit contributions
 queens-0.024
cious-0.023
mare-0.021
 chicks-0.021
ppy-0.021
Token as
Token ID355
Feature activation-3.08e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
ppy+0.017
Neg logit contributions
 him-0.014
anked-0.013
 Finally-0.012
 When-0.011
 up-0.011
Token big
Token ID1263
Feature activation-6.67e-02

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 When+0.004
 up+0.004
Neg logit contributions
 queens-0.005
cious-0.005
mare-0.004
 chicks-0.004
ppy-0.004
Token and
Token ID290
Feature activation+6.35e-03

Pos logit contributions
 Finally+0.093
anked+0.090
 him+0.085
 When+0.083
 Up+0.082
Neg logit contributions
 queens-0.122
 soul-0.107
 chicks-0.106
cious-0.105
opy-0.103
Token colorful
Token ID20239
Feature activation-2.40e-02

Pos logit contributions
cious+0.012
 queens+0.011
mare+0.011
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.010
anked-0.009
 up-0.008
 Finally-0.008
 his-0.008
Token as
Token ID355
Feature activation-1.13e-02

Pos logit contributions
 Finally+0.037
anked+0.036
 him+0.035
 When+0.033
Years+0.032
Neg logit contributions
 queens-0.041
cious-0.036
 soul-0.035
 chicks-0.035
opy-0.034
Token Sarah
Token ID10490
Feature activation+5.39e-03

Pos logit contributions
 him+0.013
anked+0.013
 Finally+0.012
 When+0.011
 Up+0.011
Neg logit contributions
 queens-0.024
cious-0.022
mare-0.021
 chicks-0.021
 soul-0.021
Token's
Token ID338
Feature activation-3.64e-03

Pos logit contributions
 queens+0.008
cious+0.007
mare+0.007
 chicks+0.006
 soul+0.006
Neg logit contributions
 him-0.010
anked-0.010
 Finally-0.010
 When-0.009
 up-0.009
Token Sue
Token ID26089
Feature activation+1.58e-02

Pos logit contributions
 queens+0.003
cious+0.003
 chicks+0.003
mare+0.003
 soul+0.003
Neg logit contributions
 him-0.002
 Finally-0.002
anked-0.002
 When-0.001
 up-0.001
Token.
Token ID13
Feature activation-8.69e-03

Pos logit contributions
 queens+0.035
cious+0.033
mare+0.032
 chicks+0.031
plets+0.031
Neg logit contributions
 him-0.019
anked-0.016
 Finally-0.016
 up-0.015
 When-0.015
Token Sue
Token ID26089
Feature activation+3.53e-03

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 When+0.008
 up+0.007
Neg logit contributions
 queens-0.019
cious-0.018
mare-0.017
 chicks-0.017
ppy-0.017
Token lived
Token ID5615
Feature activation+4.91e-03

Pos logit contributions
 queens+0.007
plets+0.006
mare+0.006
cious+0.006
 chicks+0.006
Neg logit contributions
 him-0.006
 up-0.005
 Finally-0.005
anked-0.005
 When-0.005
Token in
Token ID287
Feature activation+1.99e-02

Pos logit contributions
 queens+0.009
cious+0.009
mare+0.008
aky+0.008
ppy+0.008
Neg logit contributions
 him-0.007
anked-0.007
 Finally-0.006
 When-0.006
 Up-0.006
Token a
Token ID257
Feature activation+3.63e-02

Pos logit contributions
 queens+0.046
cious+0.045
ppy+0.043
mare+0.042
opy+0.042
Neg logit contributions
 him-0.021
anked-0.017
 Finally-0.016
 When-0.016
 up-0.015
Token regular
Token ID3218
Feature activation+2.87e-02

Pos logit contributions
 queens+0.066
plets+0.064
cious+0.063
ppy+0.063
lessness+0.062
Neg logit contributions
 him-0.052
 up-0.048
 Finally-0.047
 his-0.045
anked-0.044
Token house
Token ID2156
Feature activation-5.04e-04

Pos logit contributions
 queens+0.050
plets+0.050
cious+0.050
lessness+0.048
ppy+0.047
Neg logit contributions
 him-0.041
 up-0.039
 Finally-0.036
anked-0.036
 his-0.035
Token with
Token ID351
Feature activation+9.15e-04

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 When+0.000
 Up+0.000
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
 soul-0.001
Token her
Token ID607
Feature activation+1.10e-02

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
ppy+0.002
opy+0.002
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 When-0.001
 up-0.001
Token mom
Token ID1995
Feature activation+9.24e-03

Pos logit contributions
 queens+0.019
cious+0.019
ppy+0.018
plets+0.018
mare+0.018
Neg logit contributions
 him-0.017
anked-0.015
 Finally-0.015
 up-0.015
 When-0.014
Token Lily
Token ID20037
Feature activation+1.35e-02

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
 chicks+0.005
 soul+0.005
Neg logit contributions
 him-0.003
 Finally-0.003
anked-0.003
 When-0.002
 up-0.002
Token.
Token ID13
Feature activation-9.65e-03

Pos logit contributions
 queens+0.029
cious+0.027
mare+0.026
 chicks+0.026
plets+0.025
Neg logit contributions
 him-0.017
anked-0.015
 Finally-0.014
 When-0.014
 up-0.014
Token She
Token ID1375
Feature activation+1.10e-03

Pos logit contributions
 him+0.012
 Finally+0.011
anked+0.011
 When+0.010
 Up+0.009
Neg logit contributions
 queens-0.020
cious-0.020
 chicks-0.019
mare-0.019
 soul-0.018
Token lived
Token ID5615
Feature activation+5.30e-03

Pos logit contributions
 queens+0.002
cious+0.002
 chicks+0.002
plets+0.002
mare+0.002
Neg logit contributions
 him-0.002
 Finally-0.001
anked-0.001
 When-0.001
 up-0.001
Token in
Token ID287
Feature activation+2.18e-02

Pos logit contributions
 queens+0.010
cious+0.009
mare+0.009
 chicks+0.009
ppy+0.009
Neg logit contributions
 him-0.007
anked-0.007
 Finally-0.007
 When-0.006
 Up-0.006
Token a
Token ID257
Feature activation+3.69e-02

Pos logit contributions
 queens+0.049
cious+0.048
mare+0.046
ppy+0.046
 chicks+0.045
Neg logit contributions
 him-0.024
anked-0.020
 Finally-0.019
 When-0.018
 up-0.017
Token house
Token ID2156
Feature activation+3.66e-03

Pos logit contributions
 queens+0.066
cious+0.065
plets+0.064
ppy+0.064
lessness+0.063
Neg logit contributions
 him-0.053
 up-0.049
 Finally-0.049
 his-0.046
anked-0.045
Token with
Token ID351
Feature activation+3.39e-04

Pos logit contributions
 queens+0.008
cious+0.007
mare+0.007
 chicks+0.007
 soul+0.007
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.004
 When-0.003
 Up-0.003
Token her
Token ID607
Feature activation+8.58e-03

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 chicks+0.001
ppy+0.001
Neg logit contributions
 him-0.000
anked-0.000
 Finally-0.000
 When-0.000
 up-0.000
Token mom
Token ID1995
Feature activation+9.80e-03

Pos logit contributions
 queens+0.016
cious+0.016
ppy+0.015
plets+0.015
mare+0.015
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.011
 up-0.011
 Tim-0.010
Tokenmy
Token ID1820
Feature activation+3.92e-03

Pos logit contributions
 queens+0.020
cious+0.020
plets+0.019
mare+0.018
 soul+0.018
Neg logit contributions
 him-0.013
anked-0.011
 up-0.011
 Finally-0.011
 When-0.010
Token a
Token ID257
Feature activation-4.08e-03

Pos logit contributions
 queens+0.035
cious+0.033
mare+0.031
 chicks+0.031
ppy+0.030
Neg logit contributions
 him-0.030
anked-0.028
 Finally-0.027
 When-0.025
 up-0.025
Token check
Token ID2198
Feature activation+2.00e-02

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 When+0.006
 After+0.006
Neg logit contributions
 queens-0.007
cious-0.006
mare-0.006
 soul-0.006
 chicks-0.006
Tokenup
Token ID929
Feature activation+1.26e-02

Pos logit contributions
 queens+0.042
cious+0.041
 chicks+0.039
 forgot+0.039
 swarm+0.039
Neg logit contributions
 him-0.026
 up-0.023
anked-0.020
 When-0.020
 his-0.019
Token and
Token ID290
Feature activation-9.83e-03

Pos logit contributions
 queens+0.024
cious+0.023
mare+0.021
 chicks+0.021
ppy+0.021
Neg logit contributions
 him-0.018
anked-0.016
 Finally-0.016
 When-0.015
 up-0.015
Token be
Token ID307
Feature activation+2.16e-02

Pos logit contributions
 him+0.014
 Finally+0.012
anked+0.012
 up+0.012
 When+0.011
Neg logit contributions
cious-0.019
 queens-0.019
opy-0.018
ppy-0.018
 chicks-0.017
Token extra
Token ID3131
Feature activation+3.32e-02

Pos logit contributions
 queens+0.038
cious+0.038
plets+0.035
pillar+0.035
opy+0.035
Neg logit contributions
 him-0.034
 up-0.030
anked-0.030
 Finally-0.029
 his-0.028
Token clean
Token ID3424
Feature activation+5.92e-03

Pos logit contributions
 queens+0.047
cious+0.044
mare+0.041
 chicks+0.040
ppy+0.040
Neg logit contributions
 him-0.063
anked-0.061
 Finally-0.058
 When-0.056
 up-0.054
Token!"
Token ID2474
Feature activation-1.02e-02

Pos logit contributions
cious+0.011
 queens+0.011
plets+0.010
 chicks+0.010
mare+0.010
Neg logit contributions
 him-0.009
 up-0.009
anked-0.007
 his-0.007
 Finally-0.007
Token
Token ID198
Feature activation-2.02e-03

Pos logit contributions
anked+0.015
 him+0.014
Years+0.013
 up+0.012
 drib+0.012
Neg logit contributions
cious-0.017
 queens-0.016
mare-0.016
 soul-0.016
pillar-0.016
Token
Token ID198
Feature activation+9.28e-03

Pos logit contributions
anked+0.003
 him+0.003
 Finally+0.003
 up+0.003
 int+0.003
Neg logit contributions
cious-0.003
pillar-0.003
 queens-0.003
 bee-0.003
mare-0.003
TokenThe
Token ID464
Feature activation+6.36e-03

Pos logit contributions
 queens+0.013
cious+0.013
mare+0.011
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.016
 up-0.015
 When-0.015
Token's
Token ID338
Feature activation-1.66e-02

Pos logit contributions
 queens+0.035
cious+0.032
mare+0.029
 chicks+0.028
lessness+0.028
Neg logit contributions
 him-0.059
anked-0.055
 Finally-0.052
 up-0.051
 When-0.051
Token pretend
Token ID16614
Feature activation+1.79e-02

Pos logit contributions
 him+0.023
anked+0.023
 Finally+0.021
 When+0.019
 Up+0.019
Neg logit contributions
 queens-0.031
cious-0.030
mare-0.028
 chicks-0.028
ppy-0.028
Token we
Token ID356
Feature activation+1.96e-02

Pos logit contributions
aky+0.037
cious+0.037
ppy+0.037
 queens+0.037
umpy+0.035
Neg logit contributions
 him-0.024
 up-0.021
 Tim-0.018
 his-0.018
 Up-0.017
Token are
Token ID389
Feature activation+7.70e-03

Pos logit contributions
cious+0.037
 queens+0.035
mare+0.034
 soul+0.032
 moth+0.032
Neg logit contributions
 him-0.033
 up-0.026
 Up-0.025
 When-0.025
 Finally-0.025
Token mer
Token ID4017
Feature activation-1.72e-02

Pos logit contributions
cious+0.015
 queens+0.015
pillar+0.014
mare+0.014
 soul+0.014
Neg logit contributions
 him-0.011
 up-0.009
anked-0.009
 Finally-0.009
 When-0.008
Tokenmaid
Token ID23151
Feature activation+3.36e-02

Pos logit contributions
 him+0.040
anked+0.038
 Finally+0.037
 When+0.036
 Up+0.035
Neg logit contributions
 queens-0.017
cious-0.016
mare-0.014
 chicks-0.014
ppy-0.014
Tokens
Token ID82
Feature activation+2.81e-02

Pos logit contributions
 soul+0.055
lessness+0.054
 queens+0.053
cious+0.053
plets+0.051
Neg logit contributions
 him-0.052
 up-0.048
 Tim-0.039
 together-0.038
 When-0.037
Token and
Token ID290
Feature activation+1.31e-03

Pos logit contributions
 queens+0.047
cious+0.046
 soul+0.042
mare+0.042
ppy+0.042
Neg logit contributions
 him-0.047
 up-0.042
 Finally-0.041
anked-0.040
 When-0.040
Token swim
Token ID9422
Feature activation+2.87e-03

Pos logit contributions
umpy+0.002
opy+0.002
cious+0.002
aky+0.002
plets+0.002
Neg logit contributions
 him-0.002
 his-0.002
 Tim-0.002
 When-0.002
 up-0.002
Token in
Token ID287
Feature activation+2.10e-02

Pos logit contributions
 queens+0.006
cious+0.006
 chicks+0.006
 soul+0.006
mare+0.006
Neg logit contributions
 him-0.004
 up-0.003
 Finally-0.003
 When-0.003
anked-0.003
Token the
Token ID262
Feature activation+2.41e-02

Pos logit contributions
cious+0.047
 queens+0.046
mare+0.044
ppy+0.044
opy+0.043
Neg logit contributions
 him-0.025
 up-0.020
anked-0.020
 Finally-0.019
 his-0.018
Token started
Token ID2067
Feature activation+5.35e-03

Pos logit contributions
 him+0.008
anked+0.007
 Finally+0.007
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.009
cious-0.008
mare-0.008
 chicks-0.008
opy-0.007
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
 queens+0.009
cious+0.009
mare+0.008
 chicks+0.008
 soul+0.008
Neg logit contributions
anked-0.008
 him-0.008
 Finally-0.007
 When-0.007
 Up-0.007
Token look
Token ID804
Feature activation-2.23e-02

Pos logit contributions
 him+0.022
anked+0.021
 Finally+0.021
 When+0.019
 Up+0.019
Neg logit contributions
 queens-0.016
cious-0.016
mare-0.015
 chicks-0.015
 soul-0.014
Token for
Token ID329
Feature activation-1.52e-02

Pos logit contributions
anked+0.026
 him+0.025
 Finally+0.024
 When+0.021
 Up+0.021
Neg logit contributions
 queens-0.046
cious-0.043
ppy-0.041
 soul-0.041
lessness-0.041
Token the
Token ID262
Feature activation+2.29e-03

Pos logit contributions
anked+0.017
 him+0.017
 Finally+0.016
 When+0.015
 Up+0.014
Neg logit contributions
 queens-0.032
cious-0.031
 chicks-0.029
mare-0.029
 soul-0.029
Token source
Token ID2723
Feature activation+3.61e-02

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 chicks+0.003
 soul+0.003
Neg logit contributions
 him-0.004
anked-0.004
 Finally-0.004
 When-0.004
 Up-0.003
Token of
Token ID286
Feature activation-1.89e-02

Pos logit contributions
 queens+0.071
cious+0.069
mare+0.068
 chicks+0.064
ppy+0.064
Neg logit contributions
 him-0.052
anked-0.046
 Finally-0.042
 up-0.041
 When-0.041
Token the
Token ID262
Feature activation-5.15e-03

Pos logit contributions
anked+0.025
 Finally+0.024
 him+0.024
 When+0.022
 Up+0.021
Neg logit contributions
 queens-0.035
cious-0.033
 soul-0.032
 chicks-0.032
mare-0.031
Token salt
Token ID8268
Feature activation+4.64e-03

Pos logit contributions
 him+0.009
anked+0.009
 Finally+0.009
 When+0.008
 up+0.008
Neg logit contributions
 queens-0.008
cious-0.007
mare-0.007
 chicks-0.007
ppy-0.007
Token.
Token ID13
Feature activation-4.18e-03

Pos logit contributions
 queens+0.009
cious+0.009
mare+0.009
 chicks+0.008
 soul+0.008
Neg logit contributions
 him-0.006
anked-0.006
 Finally-0.005
 up-0.005
 When-0.005
Token
Token ID198
Feature activation+3.64e-03

Pos logit contributions
anked+0.006
 him+0.005
 up+0.005
asted+0.005
atches+0.005
Neg logit contributions
cious-0.007
 queens-0.007
mare-0.007
pillar-0.007
 soul-0.007
Token shop
Token ID6128
Feature activation+8.79e-03

Pos logit contributions
anked+0.003
 him+0.003
 Finally+0.003
 When+0.003
 Slowly+0.003
Neg logit contributions
 queens-0.004
mare-0.004
cious-0.004
opy-0.003
pillar-0.003
Token,
Token ID11
Feature activation-6.24e-03

Pos logit contributions
 queens+0.017
cious+0.017
mare+0.016
 chicks+0.015
ppy+0.015
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.011
 When-0.010
 Up-0.009
Token they
Token ID484
Feature activation+9.29e-03

Pos logit contributions
anked+0.009
 him+0.009
 Finally+0.008
 When+0.007
 int+0.007
Neg logit contributions
 queens-0.011
cious-0.011
 chicks-0.010
mare-0.010
 forgot-0.010
Token saw
Token ID2497
Feature activation-1.08e-02

Pos logit contributions
 queens+0.014
cious+0.013
mare+0.012
 chicks+0.012
 soul+0.012
Neg logit contributions
 him-0.018
 Finally-0.016
anked-0.016
 up-0.015
 When-0.015
Token that
Token ID326
Feature activation+1.85e-02

Pos logit contributions
anked+0.014
 him+0.014
 Finally+0.014
 When+0.013
 Up+0.013
Neg logit contributions
 queens-0.020
cious-0.019
 chicks-0.018
opy-0.017
mare-0.017
Token it
Token ID340
Feature activation+6.40e-03

Pos logit contributions
cious+0.041
 queens+0.041
ppy+0.040
mare+0.039
aky+0.038
Neg logit contributions
 him-0.022
 up-0.018
anked-0.017
 Finally-0.017
 his-0.017
Token was
Token ID373
Feature activation-1.09e-02

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.012
 soul+0.011
 chicks+0.011
Neg logit contributions
 him-0.009
anked-0.008
 up-0.008
 Finally-0.008
 When-0.007
Token closed
Token ID4838
Feature activation+1.88e-02

Pos logit contributions
 him+0.019
anked+0.019
 Finally+0.018
 When+0.017
 Up+0.016
Neg logit contributions
 queens-0.017
cious-0.016
mare-0.015
 chicks-0.015
ppy-0.014
Token.
Token ID13
Feature activation-5.73e-03

Pos logit contributions
 queens+0.039
cious+0.037
mare+0.035
 chicks+0.035
ppy+0.034
Neg logit contributions
 him-0.025
anked-0.023
 Finally-0.021
 up-0.020
 When-0.020
Token "
Token ID366
Feature activation+6.38e-03

Pos logit contributions
 him+0.006
anked+0.006
 Finally+0.005
 up+0.004
 int+0.004
Neg logit contributions
cious-0.012
 queens-0.012
 chicks-0.012
mare-0.012
opy-0.011
TokenOh
Token ID5812
Feature activation+3.03e-03

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.009
 chicks+0.009
ppy+0.009
Neg logit contributions
 him-0.011
anked-0.010
 Finally-0.010
 When-0.009
 Up-0.009
Token "
Token ID366
Feature activation-5.68e-03

Pos logit contributions
 him+0.021
anked+0.020
 Finally+0.019
 When+0.018
 up+0.017
Neg logit contributions
 queens-0.030
cious-0.028
mare-0.027
 chicks-0.027
ppy-0.026
TokenThat
Token ID2504
Feature activation+4.68e-03

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 When+0.008
 Up+0.008
Neg logit contributions
 queens-0.010
cious-0.009
mare-0.009
 chicks-0.009
 soul-0.009
Token is
Token ID318
Feature activation-7.78e-03

Pos logit contributions
 queens+0.008
lessness+0.007
cious+0.006
ppy+0.006
aky+0.006
Neg logit contributions
 him-0.009
 up-0.008
anked-0.008
 When-0.008
 Finally-0.008
Token an
Token ID281
Feature activation-6.50e-03

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 When+0.009
 Up+0.008
Neg logit contributions
 queens-0.015
cious-0.014
mare-0.014
 chicks-0.014
ppy-0.013
Token elephant
Token ID20950
Feature activation+2.74e-03

Pos logit contributions
 him+0.013
anked+0.013
 Finally+0.013
 When+0.012
 Up+0.011
Neg logit contributions
cious-0.008
 queens-0.008
 chicks-0.007
mare-0.007
 soul-0.007
Token.
Token ID13
Feature activation-6.60e-04

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.006
plets+0.005
 chicks+0.005
Neg logit contributions
 him-0.003
anked-0.002
 When-0.002
 up-0.002
 Finally-0.002
Token Ele
Token ID15987
Feature activation+6.38e-03

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 When+0.001
 up+0.001
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
ppy-0.001
Tokenph
Token ID746
Feature activation+1.52e-02

Pos logit contributions
 queens+0.009
cious+0.009
ppy+0.008
 chicks+0.008
mare+0.008
Neg logit contributions
anked-0.012
 him-0.012
 Finally-0.012
 When-0.011
 After-0.011
Tokenants
Token ID1187
Feature activation-1.46e-03

Pos logit contributions
 queens+0.029
cious+0.027
mare+0.026
 chicks+0.025
 soul+0.025
Neg logit contributions
 him-0.023
anked-0.021
 Finally-0.020
 When-0.019
 up-0.018
Token live
Token ID2107
Feature activation+1.84e-02

Pos logit contributions
 him+0.003
anked+0.002
 Finally+0.002
 up+0.002
 When+0.002
Neg logit contributions
 queens-0.002
cious-0.002
mare-0.002
ppy-0.002
 chicks-0.002
Token in
Token ID287
Feature activation+2.03e-02

Pos logit contributions
 queens+0.039
cious+0.038
mare+0.036
plets+0.036
opy+0.036
Neg logit contributions
 him-0.022
 up-0.018
anked-0.018
 Finally-0.018
 When-0.017
Token Jack
Token ID3619
Feature activation-6.84e-03

Pos logit contributions
anked+0.002
asted+0.002
oted+0.002
 him+0.002
aled+0.002
Neg logit contributions
mare-0.001
 queens-0.001
cious-0.001
 forgot-0.001
 soul-0.001
Token's
Token ID338
Feature activation-5.77e-03

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 up+0.009
 When+0.009
Neg logit contributions
 queens-0.013
cious-0.011
mare-0.011
plets-0.010
 chicks-0.010
Token mum
Token ID25682
Feature activation-5.84e-03

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 When+0.009
 Up+0.008
Neg logit contributions
 queens-0.009
cious-0.009
 chicks-0.008
mare-0.008
 soul-0.008
Token told
Token ID1297
Feature activation+9.21e-03

Pos logit contributions
 him+0.010
anked+0.009
 Finally+0.009
 When+0.008
 up+0.008
Neg logit contributions
 queens-0.010
cious-0.010
mare-0.009
 chicks-0.009
 soul-0.009
Token him
Token ID683
Feature activation-1.73e-02

Pos logit contributions
 queens+0.023
ppy+0.022
 bee+0.022
cious+0.022
 soul+0.022
Neg logit contributions
 Finally-0.005
anked-0.005
atches-0.004
 After-0.003
rew-0.003
Token,
Token ID11
Feature activation+7.39e-03

Pos logit contributions
 him+0.022
anked+0.019
 Finally+0.018
 When+0.017
 up+0.017
Neg logit contributions
 queens-0.037
cious-0.035
mare-0.034
 chicks-0.033
plets-0.033
Token "
Token ID366
Feature activation-3.37e-03

Pos logit contributions
 queens+0.013
cious+0.013
mare+0.012
 chicks+0.012
ppy+0.012
Neg logit contributions
 him-0.011
anked-0.011
 Finally-0.010
 When-0.009
 up-0.009
TokenJack
Token ID14295
Feature activation-1.86e-02

Pos logit contributions
 him+0.006
anked+0.005
 Finally+0.005
 When+0.005
 up+0.005
Neg logit contributions
 queens-0.006
cious-0.005
mare-0.005
 chicks-0.005
ppy-0.005
Token,
Token ID11
Feature activation-7.24e-06

Pos logit contributions
 him+0.020
anked+0.016
 up+0.016
 Finally+0.015
 When+0.014
Neg logit contributions
 queens-0.046
cious-0.042
mare-0.040
plets-0.040
 chicks-0.039
Token the
Token ID262
Feature activation+3.51e-03

Feature not active on prev token;
no predictions were affected.
Token freezer
Token ID30967
Feature activation+1.23e-03

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.006
 chicks+0.006
 soul+0.006
Neg logit contributions
 him-0.005
anked-0.005
 Finally-0.004
 When-0.004
 Up-0.004
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
 him+0.008
anked+0.008
 Finally+0.007
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.013
cious-0.013
mare-0.012
 chicks-0.012
ppy-0.012
Token
Token ID220
Feature activation-3.08e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 soul+0.002
lessness+0.002
Neg logit contributions
 him-0.002
anked-0.002
 up-0.001
 Finally-0.001
 drib-0.001
Token
Token ID198
Feature activation+2.05e-03

Pos logit contributions
 him+0.004
anked+0.004
 int+0.004
 drib+0.004
Years+0.003
Neg logit contributions
cious-0.005
 queens-0.005
mare-0.005
opy-0.005
ppy-0.005
Token
Token ID198
Feature activation+1.12e-02

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
lessness+0.003
pillar+0.003
Neg logit contributions
anked-0.003
 him-0.003
 Finally-0.003
 up-0.003
rew-0.002
TokenSarah
Token ID29284
Feature activation-6.04e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.016
pillar+0.016
 forgot+0.016
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.016
 up-0.015
atches-0.014
Token was
Token ID373
Feature activation+2.92e-03

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 up+0.008
 When+0.008
Neg logit contributions
 queens-0.011
cious-0.010
mare-0.010
 chicks-0.010
lessness-0.009
Token so
Token ID523
Feature activation+1.43e-02

Pos logit contributions
 queens+0.006
cious+0.006
 chicks+0.005
ppy+0.005
pillar+0.005
Neg logit contributions
 him-0.004
 up-0.004
 Finally-0.003
anked-0.003
 When-0.003
Token happy
Token ID3772
Feature activation-7.86e-03

Pos logit contributions
 queens+0.025
cious+0.024
 chicks+0.023
mare+0.023
ppy+0.022
Neg logit contributions
 him-0.023
anked-0.022
 Finally-0.021
 up-0.020
 When-0.020
Token to
Token ID284
Feature activation-3.92e-03

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 When+0.009
 Up+0.009
Neg logit contributions
 queens-0.015
cious-0.014
mare-0.014
 chicks-0.013
 soul-0.013
Token have
Token ID423
Feature activation+8.04e-03

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.007
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.006
cious-0.006
 chicks-0.005
mare-0.005
ppy-0.005
Token had
Token ID550
Feature activation-3.44e-03

Pos logit contributions
 queens+0.017
 outage+0.017
cious+0.017
plets+0.017
 chicks+0.017
Neg logit contributions
 him-0.010
 his-0.008
anked-0.007
 Finally-0.007
 up-0.007
Token up
Token ID510
Feature activation-3.84e-03

Pos logit contributions
anked+0.000
 him+0.000
 Finally+0.000
asted+0.000
 When+0.000
Neg logit contributions
 queens-0.000
cious-0.000
lessness-0.000
opy-0.000
plets-0.000
Token and
Token ID290
Feature activation+4.30e-03

Pos logit contributions
anked+0.006
 him+0.006
 Finally+0.006
 When+0.006
 Timothy+0.005
Neg logit contributions
 queens-0.006
cious-0.005
 cloud-0.005
opy-0.005
 chicks-0.005
Token down
Token ID866
Feature activation-6.18e-03

Pos logit contributions
 queens+0.010
cious+0.010
mare+0.009
ppy+0.009
 chicks+0.009
Neg logit contributions
 him-0.005
anked-0.004
 Finally-0.004
 up-0.004
 When-0.003
Token and
Token ID290
Feature activation-3.54e-04

Pos logit contributions
 him+0.009
anked+0.009
 Finally+0.008
 When+0.008
 Up+0.008
Neg logit contributions
 queens-0.011
cious-0.010
mare-0.009
 soul-0.009
opy-0.009
Token around
Token ID1088
Feature activation-1.03e-02

Pos logit contributions
 him+0.000
anked+0.000
 Finally+0.000
 up+0.000
 When+0.000
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
ppy-0.001
Token.
Token ID13
Feature activation+3.83e-03

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.013
 When+0.012
 Up+0.011
Neg logit contributions
 queens-0.020
cious-0.019
mare-0.018
 chicks-0.018
ppy-0.018
Token They
Token ID1119
Feature activation+8.44e-03

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.007
 chicks+0.007
 soul+0.007
Neg logit contributions
 him-0.005
anked-0.005
 up-0.004
 Finally-0.004
 drib-0.004
Token hear
Token ID3285
Feature activation+8.67e-03

Pos logit contributions
 queens+0.014
cious+0.014
mare+0.013
 chicks+0.013
opy+0.012
Neg logit contributions
 him-0.015
anked-0.013
 Finally-0.013
 up-0.012
 When-0.012
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 queens+0.018
cious+0.017
mare+0.016
 chicks+0.016
ppy+0.016
Neg logit contributions
 him-0.011
anked-0.011
 Finally-0.010
 When-0.009
 up-0.009
Token sounds
Token ID5238
Feature activation-1.41e-02

Pos logit contributions
 him+0.030
anked+0.028
 Finally+0.027
 When+0.026
 up+0.025
Neg logit contributions
 queens-0.026
cious-0.025
mare-0.023
 chicks-0.023
ppy-0.023
Token and
Token ID290
Feature activation-3.17e-03

Pos logit contributions
 Finally+0.023
anked+0.022
 him+0.021
 When+0.019
 After+0.019
Neg logit contributions
 queens-0.024
cious-0.022
lessness-0.020
 bee-0.020
 soul-0.020
Token but
Token ID475
Feature activation-8.05e-03

Pos logit contributions
 queens+0.014
cious+0.013
mare+0.012
 chicks+0.012
 soul+0.012
Neg logit contributions
 him-0.016
anked-0.015
 Finally-0.015
 When-0.014
 Up-0.014
Token they
Token ID484
Feature activation+6.93e-03

Pos logit contributions
 him+0.011
anked+0.009
 Finally+0.009
 up+0.009
 When+0.008
Neg logit contributions
 queens-0.016
cious-0.016
mare-0.015
opy-0.015
ppy-0.015
Token could
Token ID714
Feature activation+1.92e-03

Pos logit contributions
 queens+0.012
cious+0.011
mare+0.011
 chicks+0.011
plets+0.010
Neg logit contributions
 him-0.012
 Finally-0.010
anked-0.010
 When-0.010
 up-0.010
Token not
Token ID407
Feature activation-1.47e-03

Pos logit contributions
 queens+0.003
cious+0.003
mare+0.003
 forgot+0.003
 soul+0.003
Neg logit contributions
anked-0.003
 him-0.003
 Finally-0.003
 When-0.003
 Up-0.003
Token find
Token ID1064
Feature activation-2.24e-02

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.002
 chicks-0.002
opy-0.002
Token them
Token ID606
Feature activation-9.59e-03

Pos logit contributions
anked+0.019
 Finally+0.017
 him+0.016
 When+0.015
 Up+0.015
Neg logit contributions
 queens-0.054
cious-0.052
 chicks-0.050
mare-0.049
ppy-0.048
Token.
Token ID13
Feature activation+4.16e-03

Pos logit contributions
anked+0.014
 him+0.014
 Finally+0.013
 When+0.012
 Up+0.012
Neg logit contributions
 queens-0.017
cious-0.017
 chicks-0.015
ppy-0.015
mare-0.015
Token They
Token ID1119
Feature activation+1.30e-02

Pos logit contributions
 queens+0.008
cious+0.008
mare+0.007
 chicks+0.007
 soul+0.007
Neg logit contributions
 him-0.006
anked-0.006
 up-0.005
 drib-0.005
 Finally-0.005
Token asked
Token ID1965
Feature activation+7.53e-03

Pos logit contributions
 queens+0.019
cious+0.018
mare+0.017
 chicks+0.017
ppy+0.017
Neg logit contributions
 him-0.023
anked-0.023
 Finally-0.022
 When-0.021
 up-0.020
Token the
Token ID262
Feature activation+1.29e-03

Pos logit contributions
 queens+0.017
cious+0.016
 chicks+0.016
ppy+0.015
mare+0.015
Neg logit contributions
anked-0.008
 him-0.007
 Finally-0.007
 When-0.006
 Up-0.006
Token park
Token ID3952
Feature activation+5.53e-06

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.002
 chicks+0.002
ppy+0.002
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.002
 When-0.002
 up-0.002
Token I
Token ID314
Feature activation-2.97e-03

Pos logit contributions
 him+0.013
 Finally+0.013
 When+0.013
 After+0.012
 Up+0.012
Neg logit contributions
 queens-0.011
cious-0.010
aky-0.010
 chicks-0.009
opy-0.009
Token am
Token ID716
Feature activation+8.86e-03

Pos logit contributions
 him+0.005
 Finally+0.005
anked+0.004
 When+0.004
 After+0.004
Neg logit contributions
 queens-0.005
cious-0.005
 chicks-0.004
plets-0.004
mare-0.004
Token proud
Token ID6613
Feature activation+7.49e-03

Pos logit contributions
 queens+0.016
cious+0.016
pillar+0.015
mare+0.015
ppy+0.015
Neg logit contributions
 him-0.013
anked-0.012
 Finally-0.012
 up-0.012
 When-0.011
Token of
Token ID286
Feature activation-9.58e-03

Pos logit contributions
cious+0.013
 queens+0.013
mare+0.012
ppy+0.012
 chicks+0.012
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.010
 When-0.010
 up-0.010
Token you
Token ID345
Feature activation+2.81e-03

Pos logit contributions
anked+0.018
 Finally+0.018
 him+0.018
 When+0.017
 Up+0.017
Neg logit contributions
 queens-0.013
cious-0.012
 chicks-0.011
mare-0.010
ppy-0.010
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.006
plets+0.006
 chicks+0.006
Neg logit contributions
 him-0.003
 Finally-0.003
anked-0.003
 up-0.002
 When-0.002
Token And
Token ID843
Feature activation-1.15e-03

Pos logit contributions
 him+0.019
 Finally+0.018
 When+0.017
anked+0.017
 Up+0.016
Neg logit contributions
 queens-0.022
cious-0.021
 chicks-0.019
opy-0.019
aky-0.019
Token you
Token ID345
Feature activation-2.47e-03

Pos logit contributions
 him+0.002
 Finally+0.002
anked+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.002
cious-0.002
opy-0.002
plets-0.001
mare-0.001
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
 him+0.003
 Finally+0.003
 Tim+0.003
 When+0.003
 his+0.002
Neg logit contributions
cious-0.005
 queens-0.005
mare-0.005
plets-0.005
 chicks-0.005
Token Jen
Token ID13364
Feature activation-1.44e-04

Pos logit contributions
 him+0.018
anked+0.017
 Finally+0.016
 When+0.015
 up+0.015
Neg logit contributions
 queens-0.024
cious-0.023
mare-0.022
 chicks-0.021
ppy-0.021
Token,
Token ID11
Feature activation-1.39e-02

Pos logit contributions
 him+0.000
 up+0.000
anked+0.000
 Finally+0.000
 When+0.000
Neg logit contributions
 queens-0.000
cious-0.000
mare-0.000
 chicks-0.000
plets-0.000
Token big
Token ID1263
Feature activation-2.15e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
ppy+0.001
 chicks+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 When-0.001
 up-0.001
Token sun
Token ID4252
Feature activation-1.84e-02

Pos logit contributions
 him+0.037
anked+0.034
 Finally+0.033
 up+0.032
 When+0.031
Neg logit contributions
 queens-0.034
cious-0.033
ppy-0.031
 chicks-0.030
mare-0.030
Token with
Token ID351
Feature activation-2.29e-03

Pos logit contributions
 him+0.030
anked+0.030
 Finally+0.029
 When+0.027
 Slowly+0.026
Neg logit contributions
 queens-0.029
cious-0.028
mare-0.026
 chicks-0.026
ppy-0.026
Token yellow
Token ID7872
Feature activation-1.72e-02

Pos logit contributions
 him+0.003
anked+0.003
 Finally+0.003
 When+0.003
 Up+0.003
Neg logit contributions
 queens-0.004
cious-0.004
 chicks-0.003
 soul-0.003
mare-0.003
Token and
Token ID290
Feature activation+3.00e-03

Pos logit contributions
 him+0.029
anked+0.028
 Finally+0.027
 When+0.025
 up+0.024
Neg logit contributions
 queens-0.027
cious-0.026
mare-0.024
 chicks-0.024
 soul-0.024
Token orange
Token ID10912
Feature activation-3.03e-02

Pos logit contributions
mare+0.006
 queens+0.006
cious+0.006
ppy+0.006
lessness+0.006
Neg logit contributions
 him-0.004
 Finally-0.003
anked-0.003
 When-0.003
 up-0.003
Token.
Token ID13
Feature activation+3.35e-03

Pos logit contributions
 him+0.051
anked+0.049
 Finally+0.047
 When+0.044
 Up+0.043
Neg logit contributions
 queens-0.049
cious-0.047
 chicks-0.043
mare-0.043
ppy-0.042
Token Ben
Token ID3932
Feature activation-1.19e-03

Pos logit contributions
 queens+0.005
cious+0.005
mare+0.005
opy+0.005
pillar+0.005
Neg logit contributions
anked-0.005
 him-0.005
ered-0.004
Years-0.004
 Timothy-0.004
Token drew
Token ID9859
Feature activation+1.06e-02

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.002
cious-0.002
mare-0.002
 chicks-0.001
lessness-0.001
Token a
Token ID257
Feature activation+1.10e-03

Pos logit contributions
 queens+0.021
cious+0.020
mare+0.018
 chicks+0.018
ppy+0.018
Neg logit contributions
 him-0.014
anked-0.014
 Finally-0.013
 When-0.012
 Up-0.012
Token green
Token ID4077
Feature activation-1.55e-02

Pos logit contributions
 queens+0.002
cious+0.002
mare+0.001
ppy+0.001
 chicks+0.001
Neg logit contributions
 him-0.002
anked-0.002
 Finally-0.002
 When-0.002
 up-0.002
Token
Token ID198
Feature activation-4.11e-03

Pos logit contributions
anked+0.019
 him+0.018
 drib+0.017
 patiently+0.017
 int+0.017
Neg logit contributions
 queens-0.021
cious-0.021
pillar-0.020
 soul-0.020
umpy-0.019
Token
Token ID198
Feature activation+3.50e-03

Pos logit contributions
anked+0.007
 him+0.006
 drib+0.006
 int+0.006
 patiently+0.006
Neg logit contributions
cious-0.005
pillar-0.005
umpy-0.005
 queens-0.005
aky-0.005
TokenThe
Token ID464
Feature activation-6.74e-03

Pos logit contributions
 queens+0.005
cious+0.005
pillar+0.005
umpy+0.005
aky+0.005
Neg logit contributions
 him-0.006
anked-0.006
 Finally-0.005
 up-0.005
 int-0.005
Token moral
Token ID6573
Feature activation+1.75e-03

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.011
 When+0.010
 After+0.010
Neg logit contributions
 queens-0.011
cious-0.010
 chicks-0.009
 soul-0.009
mare-0.009
Token of
Token ID286
Feature activation-2.05e-02

Pos logit contributions
 queens+0.003
cious+0.003
ppy+0.003
mare+0.003
 chicks+0.003
Neg logit contributions
 him-0.003
anked-0.002
 Finally-0.002
 When-0.002
 Up-0.002
Token the
Token ID262
Feature activation-1.94e-02

Pos logit contributions
anked+0.030
 Finally+0.029
 him+0.028
 When+0.026
 Slowly+0.026
Neg logit contributions
 queens-0.035
cious-0.034
mare-0.033
 chicks-0.032
 soul-0.031
Token story
Token ID1621
Feature activation-2.13e-02

Pos logit contributions
 him+0.037
anked+0.036
 Finally+0.034
 When+0.032
 up+0.031
Neg logit contributions
 queens-0.028
cious-0.026
mare-0.024
 chicks-0.024
 soul-0.023
Token is
Token ID318
Feature activation-2.04e-02

Pos logit contributions
 him+0.035
anked+0.034
 Finally+0.032
 When+0.031
 up+0.030
Neg logit contributions
 queens-0.035
cious-0.033
mare-0.031
 chicks-0.030
ppy-0.030
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
 him+0.023
anked+0.022
 Finally+0.020
 When+0.019
 up+0.018
Neg logit contributions
 queens-0.044
cious-0.043
mare-0.041
 chicks-0.040
ppy-0.040
Token never
Token ID1239
Feature activation-1.45e-02

Pos logit contributions
 him+0.024
anked+0.023
 Finally+0.022
 When+0.021
 Up+0.020
Neg logit contributions
 queens-0.026
cious-0.025
mare-0.023
ppy-0.023
 soul-0.023
Token give
Token ID1577
Feature activation-2.25e-02

Pos logit contributions
 him+0.023
anked+0.022
 Finally+0.020
 When+0.020
 up+0.019
Neg logit contributions
 queens-0.025
cious-0.024
mare-0.023
 chicks-0.023
opy-0.022
Token for
Token ID329
Feature activation+3.53e-03

Pos logit contributions
anked+0.012
 him+0.011
 Finally+0.011
 When+0.010
 Up+0.010
Neg logit contributions
 queens-0.016
cious-0.015
mare-0.015
 chicks-0.014
ppy-0.014
Token Mom
Token ID11254
Feature activation+6.88e-03

Pos logit contributions
 queens+0.007
cious+0.007
 chicks+0.006
mare+0.006
 soul+0.006
Neg logit contributions
anked-0.004
 Finally-0.004
 him-0.004
 Up-0.004
 When-0.004
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.011
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.010
anked-0.010
 Finally-0.009
 When-0.009
 up-0.008
Token They
Token ID1119
Feature activation+1.77e-02

Pos logit contributions
cious+0.030
 queens+0.029
mare+0.029
 forgot+0.028
 soul+0.027
Neg logit contributions
 him-0.026
anked-0.026
Years-0.022
aled-0.022
 Timothy-0.021
Token had
Token ID550
Feature activation+6.73e-03

Pos logit contributions
 queens+0.025
cious+0.024
mare+0.022
ppy+0.022
 chicks+0.022
Neg logit contributions
 him-0.033
anked-0.032
 Finally-0.030
 When-0.029
 Up-0.029
Token flour
Token ID10601
Feature activation-5.56e-03

Pos logit contributions
 queens+0.014
cious+0.014
mare+0.013
ppy+0.013
opy+0.013
Neg logit contributions
 him-0.008
anked-0.007
 Finally-0.007
 When-0.006
 up-0.006
Token,
Token ID11
Feature activation+1.52e-02

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 up+0.006
 When+0.006
Neg logit contributions
 queens-0.011
cious-0.011
mare-0.010
 chicks-0.010
 soul-0.010
Token eggs
Token ID9653
Feature activation-6.71e-05

Pos logit contributions
cious+0.031
mare+0.031
 queens+0.030
pillar+0.029
 soul+0.029
Neg logit contributions
 him-0.021
anked-0.017
 Finally-0.016
 his-0.016
 up-0.016
Token,
Token ID11
Feature activation+1.00e-02

Pos logit contributions
 him+0.000
anked+0.000
 Finally+0.000
 up+0.000
 When+0.000
Neg logit contributions
 queens-0.000
cious-0.000
mare-0.000
 soul-0.000
 chicks-0.000
Token milk
Token ID7545
Feature activation-1.57e-03

Pos logit contributions
cious+0.019
mare+0.018
 queens+0.018
pillar+0.018
 soul+0.017
Neg logit contributions
 him-0.015
anked-0.013
 Finally-0.012
 his-0.012
 up-0.012
Token and
Token ID290
Feature activation+6.15e-05

Pos logit contributions
 him+0.002
anked+0.002
 Finally+0.002
 When+0.002
 Up+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
 chicks-0.003
ppy-0.003
Token his
Token ID465
Feature activation+1.48e-02

Pos logit contributions
anked+0.006
 Finally+0.006
 Timothy+0.005
 Up+0.005
 When+0.005
Neg logit contributions
 queens-0.017
 soul-0.017
ppy-0.016
aky-0.016
mare-0.016
Token wings
Token ID12098
Feature activation+4.68e-03

Pos logit contributions
 queens+0.018
cious+0.016
mare+0.015
ppy+0.014
 chicks+0.014
Neg logit contributions
 him-0.032
anked-0.031
 Finally-0.030
 When-0.028
 up-0.028
Token and
Token ID290
Feature activation-4.19e-03

Pos logit contributions
 queens+0.007
cious+0.007
ppy+0.007
opy+0.007
lessness+0.007
Neg logit contributions
 him-0.007
anked-0.007
 Finally-0.007
 When-0.006
atches-0.006
Token soared
Token ID35292
Feature activation+6.05e-03

Pos logit contributions
 him+0.007
anked+0.006
 Finally+0.006
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.007
cious-0.007
mare-0.006
 chicks-0.006
ppy-0.006
Token so
Token ID523
Feature activation+1.30e-02

Pos logit contributions
 queens+0.011
cious+0.010
ppy+0.010
mare+0.010
aky+0.010
Neg logit contributions
anked-0.008
 him-0.008
 Finally-0.008
atches-0.007
 When-0.007
Token high
Token ID1029
Feature activation-4.24e-02

Pos logit contributions
 queens+0.026
cious+0.025
mare+0.024
 chicks+0.024
ppy+0.023
Neg logit contributions
 him-0.017
anked-0.016
 Finally-0.015
 When-0.014
 up-0.014
Token.
Token ID13
Feature activation+4.83e-03

Pos logit contributions
anked+0.060
 him+0.056
 Finally+0.056
 When+0.050
asted+0.050
Neg logit contributions
 queens-0.075
cious-0.072
opy-0.068
plets-0.066
ppy-0.066
Token But
Token ID887
Feature activation-6.84e-03

Pos logit contributions
 queens+0.008
mare+0.007
cious+0.007
 soul+0.007
plets+0.007
Neg logit contributions
anked-0.007
 him-0.007
asted-0.006
Years-0.006
ered-0.006
Token try
Token ID1949
Feature activation+6.18e-03

Pos logit contributions
 him+0.009
anked+0.008
 Finally+0.008
 When+0.007
 up+0.007
Neg logit contributions
 queens-0.014
cious-0.013
mare-0.013
 chicks-0.013
ppy-0.012
Token as
Token ID355
Feature activation-4.38e-03

Pos logit contributions
 queens+0.013
cious+0.012
mare+0.012
 chicks+0.011
ppy+0.011
Neg logit contributions
 him-0.008
anked-0.007
 Finally-0.007
 up-0.007
 When-0.007
Token he
Token ID339
Feature activation-3.05e-02

Pos logit contributions
 him+0.007
anked+0.007
 Finally+0.006
 When+0.006
 up+0.006
Neg logit contributions
 queens-0.008
cious-0.007
mare-0.007
 chicks-0.007
ppy-0.007
Token to
Token ID284
Feature activation-9.46e-03

Pos logit contributions
 queens+0.016
cious+0.015
 chicks+0.013
mare+0.013
 soul+0.013
Neg logit contributions
anked-0.016
 Finally-0.015
 him-0.014
 When-0.013
 Up-0.013
Token explain
Token ID4727
Feature activation-9.71e-03

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.013
 When+0.012
 Up+0.011
Neg logit contributions
 queens-0.017
cious-0.017
mare-0.016
 chicks-0.016
ppy-0.015
Token what
Token ID644
Feature activation+7.62e-03

Pos logit contributions
 him+0.012
anked+0.011
 Finally+0.011
 When+0.010
 up+0.009
Neg logit contributions
 queens-0.020
cious-0.019
mare-0.018
 chicks-0.018
ppy-0.018
Token the
Token ID262
Feature activation-5.87e-04

Pos logit contributions
 queens+0.017
ppy+0.016
plets+0.016
cious+0.016
mare+0.016
Neg logit contributions
 him-0.010
 his-0.008
 up-0.008
anked-0.007
 Tim-0.007
Token pistol
Token ID16790
Feature activation-3.40e-03

Pos logit contributions
 him+0.001
anked+0.001
 Finally+0.001
 When+0.001
 Slowly+0.001
Neg logit contributions
 queens-0.001
cious-0.001
mare-0.001
 chicks-0.001
 soul-0.001
Token was
Token ID373
Feature activation-4.59e-02

Pos logit contributions
 him+0.006
anked+0.005
 Finally+0.005
 When+0.005
 up+0.005
Neg logit contributions
 queens-0.006
cious-0.005
mare-0.005
 chicks-0.005
ppy-0.005
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
 him+0.062
anked+0.059
 Finally+0.055
 When+0.052
 up+0.050
Neg logit contributions
 queens-0.090
cious-0.086
mare-0.082
 chicks-0.081
ppy-0.080
Token She
Token ID1375
Feature activation-3.22e-03

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.007
 chicks+0.007
ppy+0.007
Neg logit contributions
 him-0.006
anked-0.005
 Finally-0.005
 When-0.005
 up-0.005
Token said
Token ID531
Feature activation+1.99e-02

Pos logit contributions
 him+0.005
anked+0.005
 Finally+0.005
 When+0.004
 Up+0.004
Neg logit contributions
 queens-0.005
cious-0.005
mare-0.005
 chicks-0.005
ppy-0.005
Token it
Token ID340
Feature activation+7.84e-04

Pos logit contributions
 queens+0.044
cious+0.042
mare+0.041
aky+0.040
ppy+0.039
Neg logit contributions
 him-0.024
anked-0.021
 Finally-0.019
 up-0.019
 When-0.018
Token was
Token ID373
Feature activation-1.56e-02

Pos logit contributions
 queens+0.001
cious+0.001
mare+0.001
 chicks+0.001
ppy+0.001
Neg logit contributions
 him-0.001
anked-0.001
 Finally-0.001
 When-0.001
 up-0.001
Token the
Token ID262
Feature activation-1.64e-03

Pos logit contributions
 him+0.025
 up+0.018
 his+0.018
 Finally+0.018
 Tim+0.018
Neg logit contributions
 queens-0.049
cious-0.047
aky-0.045
mare-0.045
pillar-0.045
Token crocod
Token ID37565
Feature activation+5.51e-03

Pos logit contributions
 him+0.003
 Finally+0.002
anked+0.002
 When+0.002
 up+0.002
Neg logit contributions
 queens-0.003
cious-0.003
mare-0.003
ppy-0.003
opy-0.002
Tokenile
Token ID576
Feature activation-7.99e-03

Pos logit contributions
 queens+0.006
cious+0.006
mare+0.005
 chicks+0.005
ppy+0.005
Neg logit contributions
 him-0.012
anked-0.011
 Finally-0.011
 When-0.011
 up-0.010
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
 him+0.014
anked+0.013
 Finally+0.013
 When+0.012
 up+0.012
Neg logit contributions
 queens-0.013
cious-0.012
mare-0.011
 chicks-0.011
ppy-0.011
Token too
Token ID1165
Feature activation+1.74e-02

Pos logit contributions
 him+0.021
anked+0.020
 Finally+0.019
 up+0.018
 When+0.018
Neg logit contributions
 queens-0.020
cious-0.020
mare-0.019
 chicks-0.018
 soul-0.018
Token fast
Token ID3049
Feature activation-3.95e-02

Pos logit contributions
 queens+0.032
mare+0.031
cious+0.030
 chicks+0.028
 soul+0.027
Neg logit contributions
 him-0.026
anked-0.025
 up-0.024
 his-0.022
 Finally-0.022
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
 Finally+0.061
anked+0.059
 him+0.058
 When+0.055
 Up+0.053
Neg logit contributions
 queens-0.064
cious-0.061
ppy-0.058
lessness-0.057
opy-0.057
Token It
Token ID632
Feature activation+1.02e-02

Pos logit contributions
 queens+0.007
cious+0.007
mare+0.007
 chicks+0.007
 soul+0.007
Neg logit contributions
 him-0.006
anked-0.006
 up-0.005
 Finally-0.005
 drib-0.005
Token snapped
Token ID20821
Feature activation-1.44e-02

Pos logit contributions
 queens+0.016
cious+0.015
mare+0.014
 chicks+0.014
ppy+0.013
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.017
 When-0.016
 up-0.015
Token its
Token ID663
Feature activation+2.83e-03

Pos logit contributions
 him+0.020
anked+0.019
 Finally+0.019
 When+0.018
 Up+0.017
Neg logit contributions
 queens-0.026
cious-0.025
ppy-0.024
 chicks-0.024
opy-0.023
Token jaws
Token ID39940
Feature activation-2.54e-04

Pos logit contributions
 queens+0.004
cious+0.003
mare+0.003
opy+0.003
ppy+0.003
Neg logit contributions
 him-0.006
anked-0.006
 Finally-0.005
 up-0.005
 When-0.005
Token good
Token ID922
Feature activation-3.12e-02

Pos logit contributions
 queens+0.003
cious+0.003
 chicks+0.003
mare+0.003
ppy+0.003
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 When-0.003
 up-0.003
Token!
Token ID0
Feature activation-1.43e-02

Pos logit contributions
 him+0.043
anked+0.042
 Finally+0.041
 When+0.037
 Up+0.036
Neg logit contributions
 queens-0.059
cious-0.056
mare-0.052
opy-0.051
lessness-0.051
Token She
Token ID1375
Feature activation-9.18e-04

Pos logit contributions
 him+0.022
anked+0.021
 drib+0.018
 up+0.018
 int+0.018
Neg logit contributions
 queens-0.024
cious-0.023
mare-0.023
 soul-0.022
pillar-0.022
Token could
Token ID714
Feature activation-6.81e-03

Pos logit contributions
 him+0.001
 Finally+0.001
anked+0.001
 When+0.001
 up+0.001
Neg logit contributions
 queens-0.002
cious-0.002
 chicks-0.002
mare-0.002
plets-0.002
Token not
Token ID407
Feature activation-1.77e-02

Pos logit contributions
 him+0.011
anked+0.010
 Finally+0.010
 When+0.009
 up+0.009
Neg logit contributions
 queens-0.012
cious-0.012
 chicks-0.011
mare-0.011
opy-0.011
Token believe
Token ID1975
Feature activation-4.26e-02

Pos logit contributions
 him+0.027
anked+0.024
 Finally+0.023
 When+0.022
 up+0.021
Neg logit contributions
 queens-0.033
cious-0.031
mare-0.030
 chicks-0.029
opy-0.029
Token what
Token ID644
Feature activation+5.57e-03

Pos logit contributions
 him+0.048
anked+0.044
 Finally+0.040
 When+0.037
 up+0.037
Neg logit contributions
 queens-0.094
cious-0.092
mare-0.087
 chicks-0.086
ppy-0.085
Token she
Token ID673
Feature activation-1.72e-02

Pos logit contributions
plets+0.013
ppy+0.013
cious+0.013
aky+0.012
opy+0.012
Neg logit contributions
 him-0.006
 his-0.005
 Tim-0.005
 up-0.005
 Up-0.004
Token did
Token ID750
Feature activation-1.98e-02

Pos logit contributions
 him+0.033
 Finally+0.029
anked+0.029
 up+0.028
 When+0.028
Neg logit contributions
 queens-0.025
cious-0.024
 chicks-0.023
mare-0.023
plets-0.021
Token.
Token ID13
Feature activation-9.30e-03

Pos logit contributions
 him+0.025
anked+0.022
 up+0.020
 Finally+0.020
 When+0.019
Neg logit contributions
 queens-0.042
cious-0.041
 chicks-0.040
mare-0.039
ppy-0.038
Token She
Token ID1375
Feature activation+2.83e-03

Pos logit contributions
anked+0.014
 him+0.014
 drib+0.011
 int+0.011
 up+0.011
Neg logit contributions
 queens-0.016
cious-0.016
mare-0.015
 soul-0.015
pillar-0.014
Token middle
Token ID3504
Feature activation+2.85e-02

Pos logit contributions
 queens+0.005
cious+0.005
ppy+0.005
mare+0.005
 chicks+0.004
Neg logit contributions
 him-0.003
anked-0.003
 Finally-0.003
 up-0.003
 When-0.003
Token.
Token ID13
Feature activation+6.95e-03

Pos logit contributions
 queens+0.059
cious+0.057
mare+0.056
 chicks+0.055
ppy+0.054
Neg logit contributions
 him-0.036
anked-0.033
 Finally-0.030
 When-0.029
 up-0.029
Token It
Token ID632
Feature activation+4.35e-03

Pos logit contributions
mare+0.013
cious+0.013
 queens+0.013
 soul+0.012
 chicks+0.012
Neg logit contributions
 him-0.009
anked-0.009
Years-0.007
 up-0.007
ered-0.007
Token looked
Token ID3114
Feature activation-2.33e-02

Pos logit contributions
 queens+0.010
cious+0.009
mare+0.009
 chicks+0.009
ppy+0.009
Neg logit contributions
 him-0.005
 up-0.005
 Finally-0.004
anked-0.004
 Up-0.004
Token so
Token ID523
Feature activation+1.28e-02

Pos logit contributions
anked+0.042
 him+0.040
 Finally+0.038
atches+0.035
 When+0.035
Neg logit contributions
 queens-0.034
cious-0.031
 forgot-0.029
mare-0.029
lessness-0.028
Token tall
Token ID7331
Feature activation-4.16e-02

Pos logit contributions
 queens+0.022
cious+0.022
mare+0.020
 chicks+0.020
ppy+0.020
Neg logit contributions
 him-0.020
anked-0.019
 Finally-0.018
 When-0.017
 up-0.016
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
 him+0.056
anked+0.053
 Finally+0.050
 When+0.046
 up+0.045
Neg logit contributions
 queens-0.082
cious-0.079
mare-0.074
 chicks-0.073
ppy-0.073
Token strong
Token ID1913
Feature activation-2.14e-02

Pos logit contributions
cious+0.043
ppy+0.041
 chicks+0.041
mare+0.041
 queens+0.040
Neg logit contributions
 him-0.033
 up-0.031
 Tim-0.030
 his-0.029
 Finally-0.029
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
 him+0.028
anked+0.028
 Finally+0.026
 When+0.024
 Up+0.023
Neg logit contributions
 queens-0.041
cious-0.040
mare-0.037
 chicks-0.037
ppy-0.036
Token like
Token ID588
Feature activation-2.27e-03

Pos logit contributions
 queens+0.022
cious+0.021
mare+0.020
 chicks+0.020
ppy+0.020
Neg logit contributions
 him-0.018
anked-0.017
 Finally-0.016
 When-0.015
 up-0.015
Token it
Token ID340
Feature activation+1.83e-03

Pos logit contributions
anked+0.003
 him+0.003
 Finally+0.003
 When+0.003
 After+0.002
Neg logit contributions
 queens-0.004
cious-0.004
 chicks-0.004
mare-0.004
 soul-0.004