TOP ACTIVATIONS
MAX = 0.071

" Where are we going Jessie ?" Mom my asked .
is your heart and your smile ." Tom smiled
crab who lived in a nice shell . He would often
crown and playing in the freshly cut grass . It looked
What she saw made her smile . Two of her friends
so happy to see a freshly baked bean . After that
flashlight , and a magn ifying glass . As
saw her mother holding a freshly - b aked muff in
- the hand had a firm grip .
andy who lived near a peaceful mountain . One day ,
the nice vendor . The nice vendor is happy to see
to go back to the nice vendor who sells ice cream
the church and noticed a freshly dug hole in the ground
and takes them to the nice vendor . The nice vendor
vendor , who was a nice man with a big cart
The crab lived in a tidy , clean home near the
. He has a big smile . " Hello
girl and her name was Clara . She was three years
. Who will be the bride and who will be the
you point the laser at Mia 's eyes ? That 's

MIN ACTIVATIONS
MIN = -0.090

cleaned up the broken ash tr ay and Lily learned to
so hard on the ash tr ay , and now it
They took off the ash tr ay and the book .
. Jack put the ash tr ay on the kitchen table
and using the pretty ash tr ay . Lily remembered what
decided to put the ash tr ay down and go outside
on blowing until the ash tr ay was so dry that
can use this broken ash tr ay as a puzzle !"
his mom put the ash tr ay away , but Little
carefully picked up the ash tr ay and put it in
the cloth onto the ash tr ay . He worked hard
smoking and change the ash tr ay for a toy instead
happy with her warm ash tr ay . <|endoftext|> Once upon
tried to fix the ash tr ay , but it was
. She put the ash tr ay up close to her
listen and took the ash tr ay anyway .
throw away the broken ash tr ay and soon forgotten about
He quickly grabbed his little er aser and followed Mrs Green
I keep the talking ash tr ay ? â €
little boy squeezed the ash tr ay close and gave it

INTERVAL 0.031 - 0.071
CONTAINS 0.692%

her new fl ute . That day was as delicious as
S andy and Tom were very excited because it was holiday
and the sorry faces of Sara and Ben . She understood
. Lily liked to take Anna everywhere she went .
and said , � � € � � I win !

INTERVAL -0.010 - 0.031
CONTAINS 80.131%

went off to play elsewhere . Then , she spotted something
trees ! The little boy was so excited .
was walking through the jungle , he came across a little
. She said they could use many colors and make their
with their toys . They waited for the new battery for

INTERVAL -0.050 - -0.010
CONTAINS 19.163%

touched his rough skin gently . He lowered his horn and
. He climbed up and put his cars on the iron
He felt sad and guilty for what he had done .
mom my explained , " Well , Jack , they are
have many pencil s of different colors . They share their

INTERVAL -0.090 - -0.050
CONTAINS 0.014%

and began flying around making aer opl ane noises . Jack
But Doug was having too much fun inside to stop playing
. The letters were all cr um pled and messy and
, Tim my drove his je ep slowly and carefully .
around , finding all sorts of old trains and station signs
Token "
Token ID366
Feature activation+2.86e-02

Pos logit contributions
ifying+0.028
 entire+0.027
chie+0.027
ivering+0.027
 highest+0.025
Neg logit contributions
ogs-0.009
 trek-0.009
 teamwork-0.009
old-0.008
iked-0.008
TokenWhere
Token ID8496
Feature activation-2.10e-02

Pos logit contributions
 entire+0.041
ifying+0.038
chie+0.037
 highest+0.037
 tallest+0.036
Neg logit contributions
drivers-0.036
 lear-0.035
 Spe-0.035
ogs-0.035
dies-0.035
Token are
Token ID389
Feature activation+8.27e-03

Pos logit contributions
drivers+0.041
dies+0.039
ogs+0.038
con+0.037
 villains+0.037
Neg logit contributions
 entire-0.025
ifying-0.023
 exact-0.020
chie-0.019
flower-0.018
Token we
Token ID356
Feature activation+2.26e-02

Pos logit contributions
 entire+0.015
ifying+0.014
flower+0.013
itates+0.013
chie+0.013
Neg logit contributions
drivers-0.010
ogs-0.010
 weapons-0.010
dies-0.009
 villains-0.009
Token going
Token ID1016
Feature activation+3.04e-02

Pos logit contributions
ifying+0.040
 entire+0.040
ksh+0.038
itates+0.036
chie+0.036
Neg logit contributions
ogs-0.025
drivers-0.023
dies-0.023
 weapons-0.023
 Spe-0.022
Token Jessie
Token ID43425
Feature activation+7.12e-02

Pos logit contributions
 entire+0.046
ifying+0.043
chie+0.038
flower+0.038
 exact+0.037
Neg logit contributions
drivers-0.047
ogs-0.046
dies-0.045
 villains-0.043
 weapons-0.042
Token?"
Token ID1701
Feature activation-9.58e-03

Pos logit contributions
 entire+0.095
ifying+0.095
chie+0.085
itates+0.085
ksh+0.082
Neg logit contributions
ogs-0.114
drivers-0.105
dies-0.101
old-0.100
 weapons-0.100
Token Mom
Token ID11254
Feature activation-6.76e-03

Pos logit contributions
ogs+0.012
othe+0.011
ves+0.011
dies+0.011
drivers+0.011
Neg logit contributions
ifying-0.017
 entire-0.017
chie-0.015
ksh-0.015
 highest-0.014
Tokenmy
Token ID1820
Feature activation-2.75e-02

Pos logit contributions
old+0.008
drivers+0.008
ogs+0.007
oot+0.007
ug+0.007
Neg logit contributions
ifying-0.013
 entire-0.012
 highest-0.011
chie-0.011
 exact-0.010
Token asked
Token ID1965
Feature activation+2.56e-03

Pos logit contributions
drivers+0.045
ogs+0.040
dies+0.040
ves+0.040
 villains+0.039
Neg logit contributions
ifying-0.041
 entire-0.041
 highest-0.033
chie-0.033
 exact-0.032
Token.
Token ID13
Feature activation+9.77e-05

Pos logit contributions
 entire+0.004
ifying+0.003
flower+0.003
chie+0.003
 exact+0.003
Neg logit contributions
drivers-0.004
dies-0.004
ogs-0.004
 villains-0.004
con-0.004
Token is
Token ID318
Feature activation+5.06e-03

Pos logit contributions
 entire+0.073
ifying+0.069
chie+0.059
ksh+0.057
 exact+0.057
Neg logit contributions
drivers-0.088
ogs-0.086
dies-0.083
 villains-0.082
 weapons-0.081
Token your
Token ID534
Feature activation+3.28e-03

Pos logit contributions
 entire+0.006
 too+0.005
 my+0.005
asuring+0.005
ifying+0.005
Neg logit contributions
dies-0.010
drivers-0.009
ogs-0.009
 villains-0.009
ogged-0.009
Token heart
Token ID2612
Feature activation+5.02e-02

Pos logit contributions
 entire+0.004
ifying+0.003
 too+0.003
 front+0.003
 exact+0.003
Neg logit contributions
drivers-0.006
dies-0.006
ogs-0.006
ug-0.006
ves-0.006
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
 entire+0.068
ifying+0.067
chie+0.059
 exact+0.058
ksh+0.056
Neg logit contributions
ogs-0.082
drivers-0.082
ves-0.078
ug-0.075
 weapons-0.075
Token your
Token ID534
Feature activation+4.50e-03

Pos logit contributions
 entire+0.022
ifying+0.020
 exact+0.018
flower+0.018
 my+0.017
Neg logit contributions
drivers-0.027
ogs-0.026
dies-0.026
ug-0.025
 villains-0.025
Token smile
Token ID8212
Feature activation+7.02e-02

Pos logit contributions
 entire+0.006
ifying+0.005
flower+0.005
chie+0.005
 exact+0.005
Neg logit contributions
drivers-0.008
dies-0.008
ogs-0.008
 villains-0.007
ves-0.007
Token."
Token ID526
Feature activation-2.09e-03

Pos logit contributions
 entire+0.095
ifying+0.088
chie+0.081
 exact+0.079
ksh+0.072
Neg logit contributions
drivers-0.119
ogs-0.117
dies-0.109
 weapons-0.108
 villains-0.108
Token
Token ID198
Feature activation+1.91e-03

Pos logit contributions
drivers+0.005
dies+0.005
 villains+0.005
ug+0.005
 weapons+0.005
Neg logit contributions
 entire-0.001
 too-0.001
 front-0.001
ifying-0.001
 ".-0.001
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
 entire+0.002
ifying+0.001
itates+0.001
 front+0.001
 ".+0.001
Neg logit contributions
drivers-0.004
dies-0.004
 villains-0.004
 weapons-0.004
 lear-0.004
TokenTom
Token ID13787
Feature activation-9.67e-03

Pos logit contributions
 entire+0.026
ifying+0.023
 exact+0.022
itates+0.022
 front+0.021
Neg logit contributions
drivers-0.015
dies-0.013
ogs-0.013
 villains-0.013
ug-0.012
Token smiled
Token ID13541
Feature activation+2.58e-02

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.013
 villains+0.013
ves+0.013
Neg logit contributions
ifying-0.016
 entire-0.015
chie-0.014
 exact-0.013
flower-0.012
Token crab
Token ID32202
Feature activation+7.20e-03

Pos logit contributions
 entire+0.055
ifying+0.050
flower+0.044
chie+0.044
 exact+0.043
Neg logit contributions
drivers-0.066
ogs-0.062
dies-0.062
 villains-0.061
ves-0.059
Token who
Token ID508
Feature activation+1.10e-03

Pos logit contributions
 entire+0.009
ifying+0.008
flower+0.007
 too+0.007
 exact+0.006
Neg logit contributions
drivers-0.014
dies-0.013
ogs-0.013
renches-0.012
 villains-0.012
Token lived
Token ID5615
Feature activation-9.03e-03

Pos logit contributions
 entire+0.001
ifying+0.001
flower+0.001
 exact+0.001
 too+0.001
Neg logit contributions
drivers-0.002
ogs-0.002
dies-0.002
 err-0.002
con-0.002
Token in
Token ID287
Feature activation+2.26e-02

Pos logit contributions
drivers+0.016
ogs+0.016
dies+0.016
 villains+0.015
 weapons+0.015
Neg logit contributions
 entire-0.011
ifying-0.011
chie-0.009
flower-0.009
 exact-0.009
Token a
Token ID257
Feature activation+2.03e-02

Pos logit contributions
 entire+0.032
ifying+0.030
 exact+0.025
flower+0.025
chie+0.025
Neg logit contributions
drivers-0.040
ogs-0.038
dies-0.037
 villains-0.036
ves-0.036
Token nice
Token ID3621
Feature activation+6.93e-02

Pos logit contributions
ifying+0.033
 entire+0.032
ksh+0.031
asuring+0.030
flower+0.029
Neg logit contributions
 villains-0.026
ogs-0.026
 weapons-0.025
dies-0.025
 recess-0.024
Token shell
Token ID7582
Feature activation+4.36e-03

Pos logit contributions
 entire+0.099
ifying+0.098
chie+0.084
 exact+0.082
ksh+0.081
Neg logit contributions
ogs-0.110
drivers-0.106
dies-0.105
 villains-0.103
 weapons-0.100
Token.
Token ID13
Feature activation-2.29e-03

Pos logit contributions
 entire+0.006
ifying+0.005
flower+0.005
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.008
dies-0.007
ogs-0.007
 villains-0.007
ves-0.007
Token He
Token ID679
Feature activation-1.88e-03

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
ves+0.003
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.003
chie-0.003
 exact-0.003
Token would
Token ID561
Feature activation+1.21e-02

Pos logit contributions
drivers+0.004
ogs+0.003
dies+0.003
 villains+0.003
con+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
 exact-0.002
chie-0.002
Token often
Token ID1690
Feature activation+9.18e-03

Pos logit contributions
 entire+0.019
ifying+0.018
flower+0.016
 exact+0.016
chie+0.015
Neg logit contributions
drivers-0.019
ogs-0.018
dies-0.018
 villains-0.017
con-0.017
Token crown
Token ID12389
Feature activation+1.29e-02

Pos logit contributions
ifying+0.007
 entire+0.007
chie+0.006
itates+0.006
ksh+0.006
Neg logit contributions
drivers-0.008
ogs-0.008
ves-0.008
dies-0.008
 weapons-0.007
Token and
Token ID290
Feature activation-2.60e-03

Pos logit contributions
 entire+0.018
ifying+0.018
chie+0.016
 exact+0.015
itates+0.015
Neg logit contributions
drivers-0.020
ogs-0.020
dies-0.019
ves-0.019
con-0.019
Token playing
Token ID2712
Feature activation-2.03e-02

Pos logit contributions
ogs+0.005
dies+0.005
drivers+0.005
ug+0.005
ests+0.005
Neg logit contributions
 entire-0.003
flower-0.003
 too-0.003
 so-0.002
 my-0.002
Token in
Token ID287
Feature activation+6.01e-03

Pos logit contributions
drivers+0.031
ogs+0.030
dies+0.030
 villains+0.029
 weapons+0.028
Neg logit contributions
 entire-0.032
ifying-0.030
flower-0.026
chie-0.026
 exact-0.026
Token the
Token ID262
Feature activation-4.91e-03

Pos logit contributions
 entire+0.009
ifying+0.009
chie+0.008
flower+0.008
 exact+0.007
Neg logit contributions
drivers-0.009
ogs-0.009
dies-0.009
 villains-0.009
 weapons-0.008
Token freshly
Token ID29026
Feature activation+6.61e-02

Pos logit contributions
ogs+0.006
dies+0.005
con+0.005
 trek+0.005
 weapons+0.005
Neg logit contributions
ifying-0.008
chie-0.008
 entire-0.008
itates-0.008
asuring-0.007
Token cut
Token ID2005
Feature activation-1.83e-03

Pos logit contributions
ifying+0.118
chie+0.109
itating+0.102
itates+0.099
ksh+0.099
Neg logit contributions
ves-0.085
drivers-0.080
con-0.080
 trek-0.078
 projects-0.076
Token grass
Token ID8701
Feature activation-1.20e-02

Pos logit contributions
drivers+0.003
dies+0.003
 villains+0.003
ogs+0.003
 weapons+0.003
Neg logit contributions
 entire-0.003
ifying-0.002
flower-0.002
 exact-0.002
chie-0.002
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
drivers+0.021
dies+0.020
ogs+0.020
 villains+0.020
 weapons+0.019
Neg logit contributions
 entire-0.016
ifying-0.015
flower-0.013
chie-0.012
 exact-0.012
Token It
Token ID632
Feature activation-7.30e-03

Pos logit contributions
drivers+0.017
ogs+0.016
dies+0.016
 villains+0.016
 weapons+0.015
Neg logit contributions
 entire-0.018
ifying-0.017
chie-0.015
flower-0.015
 exact-0.014
Token looked
Token ID3114
Feature activation-4.79e-03

Pos logit contributions
drivers+0.010
dies+0.009
ves+0.009
 villains+0.009
ogs+0.009
Neg logit contributions
ifying-0.013
 entire-0.012
chie-0.011
 ".-0.010
 exact-0.010
Token What
Token ID1867
Feature activation-2.32e-02

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.014
 villains+0.014
 weapons+0.013
Neg logit contributions
 entire-0.012
ifying-0.011
flower-0.010
chie-0.010
 exact-0.010
Token she
Token ID673
Feature activation+1.43e-02

Pos logit contributions
drivers+0.050
con+0.045
dies+0.045
ogs+0.044
 err+0.044
Neg logit contributions
 entire-0.025
ifying-0.021
 exact-0.021
 too-0.020
 certainly-0.020
Token saw
Token ID2497
Feature activation-9.23e-03

Pos logit contributions
ifying+0.023
chie+0.021
 entire+0.021
ksh+0.021
flower+0.019
Neg logit contributions
ogs-0.018
dies-0.018
drivers-0.018
 weapons-0.018
 villains-0.018
Token made
Token ID925
Feature activation+3.46e-02

Pos logit contributions
drivers+0.018
 villains+0.017
dies+0.017
ogs+0.016
ug+0.016
Neg logit contributions
 entire-0.011
ifying-0.011
flower-0.009
 exact-0.009
 front-0.008
Token her
Token ID607
Feature activation+1.55e-03

Pos logit contributions
 entire+0.048
ifying+0.040
 too+0.038
flower+0.038
 exact+0.037
Neg logit contributions
drivers-0.064
 villains-0.061
dies-0.059
ogs-0.055
ests-0.055
Token smile
Token ID8212
Feature activation+6.57e-02

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
chie+0.002
 exact+0.002
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
 villains-0.003
 weapons-0.002
Token.
Token ID13
Feature activation-8.14e-03

Pos logit contributions
 entire+0.103
ifying+0.096
chie+0.091
 exact+0.084
ksh+0.084
Neg logit contributions
drivers-0.093
ogs-0.092
dies-0.087
 villains-0.084
ves-0.084
Token Two
Token ID4930
Feature activation-2.96e-02

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.013
 villains+0.013
con+0.013
Neg logit contributions
 entire-0.011
ifying-0.010
chie-0.009
flower-0.009
 exact-0.009
Token of
Token ID286
Feature activation-5.97e-03

Pos logit contributions
drivers+0.053
ogs+0.052
dies+0.051
 villains+0.050
ests+0.049
Neg logit contributions
 entire-0.039
ifying-0.035
chie-0.031
 too-0.031
 exact-0.030
Token her
Token ID607
Feature activation-6.99e-03

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.011
 villains+0.010
con+0.010
Neg logit contributions
 entire-0.008
ifying-0.007
 front-0.007
chie-0.006
 too-0.006
Token friends
Token ID2460
Feature activation+1.30e-02

Pos logit contributions
drivers+0.013
ogs+0.012
dies+0.012
ests+0.012
 villains+0.012
Neg logit contributions
 entire-0.009
ifying-0.008
 too-0.007
flower-0.007
 front-0.007
Token so
Token ID523
Feature activation+1.19e-02

Pos logit contributions
old+0.003
drivers+0.003
 Mitt+0.003
othe+0.003
ug+0.003
Neg logit contributions
ifying-0.004
ksh-0.003
 entire-0.003
chie-0.003
 exact-0.003
Token happy
Token ID3772
Feature activation+7.35e-03

Pos logit contributions
ifying+0.020
 entire+0.019
ksh+0.018
chie+0.017
 exact+0.017
Neg logit contributions
othe-0.015
ogs-0.015
old-0.015
ves-0.015
drivers-0.014
Token to
Token ID284
Feature activation+3.47e-03

Pos logit contributions
 entire+0.011
ifying+0.010
flower+0.009
chie+0.009
 exact+0.009
Neg logit contributions
drivers-0.012
ogs-0.011
dies-0.011
 villains-0.011
ves-0.010
Token see
Token ID766
Feature activation-1.71e-02

Pos logit contributions
ifying+0.007
ksh+0.007
 entire+0.007
 exact+0.006
itates+0.006
Neg logit contributions
 villains-0.003
 recess-0.003
ogs-0.003
dies-0.003
othe-0.003
Token a
Token ID257
Feature activation-3.78e-04

Pos logit contributions
drivers+0.034
dies+0.032
 villains+0.032
ogs+0.032
ests+0.030
Neg logit contributions
 entire-0.020
ifying-0.019
 my-0.016
 too-0.016
flower-0.015
Token freshly
Token ID29026
Feature activation+6.52e-02

Pos logit contributions
drivers+0.001
dies+0.001
 villains+0.001
con+0.001
 weapons+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
chie-0.000
flower-0.000
 exact-0.000
Token baked
Token ID22979
Feature activation+2.90e-02

Pos logit contributions
ifying+0.124
ksh+0.110
chie+0.110
 entire+0.106
 exact+0.104
Neg logit contributions
ves-0.079
ogs-0.073
drivers-0.073
con-0.068
old-0.068
Token bean
Token ID26394
Feature activation+3.29e-03

Pos logit contributions
 entire+0.048
ifying+0.044
flower+0.039
chie+0.039
 exact+0.039
Neg logit contributions
drivers-0.043
ogs-0.041
dies-0.040
 villains-0.040
 weapons-0.038
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 entire+0.005
ifying+0.004
flower+0.004
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.006
ogs-0.005
dies-0.005
 villains-0.005
 weapons-0.005
Token After
Token ID2293
Feature activation-1.45e-02

Pos logit contributions
drivers+0.016
ogs+0.016
dies+0.015
 villains+0.015
con+0.014
Neg logit contributions
 entire-0.018
ifying-0.016
chie-0.015
flower-0.015
 exact-0.014
Token that
Token ID326
Feature activation+1.64e-03

Pos logit contributions
drivers+0.026
ogs+0.025
dies+0.025
 villains+0.024
ests+0.024
Neg logit contributions
 entire-0.019
ifying-0.018
 too-0.017
 front-0.016
 beginning-0.015
Token flashlight
Token ID38371
Feature activation-1.39e-03

Pos logit contributions
ogs+0.003
drivers+0.003
ug+0.003
ogged+0.003
dies+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
 exact-0.002
flower-0.002
 front-0.002
Token,
Token ID11
Feature activation+6.05e-03

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
 exact-0.002
chie-0.002
Token and
Token ID290
Feature activation+5.14e-03

Pos logit contributions
 entire+0.007
ifying+0.006
 too+0.006
 exact+0.006
 front+0.006
Neg logit contributions
drivers-0.012
ogs-0.012
dies-0.011
 villains-0.011
ug-0.011
Token a
Token ID257
Feature activation-5.03e-03

Pos logit contributions
 entire+0.006
ifying+0.006
 too+0.005
 so+0.005
 exact+0.005
Neg logit contributions
ogs-0.010
drivers-0.010
dies-0.010
ug-0.009
uddy-0.009
Token magn
Token ID7842
Feature activation+2.97e-03

Pos logit contributions
ogs+0.009
drivers+0.008
ug+0.008
ogged+0.008
oot+0.008
Neg logit contributions
 entire-0.007
ifying-0.007
 exact-0.006
flower-0.006
 front-0.006
Tokenifying
Token ID4035
Feature activation+6.50e-02

Pos logit contributions
 entire+0.002
ifying+0.001
flower+0.001
 exact+0.001
chie+0.001
Neg logit contributions
drivers-0.008
ogs-0.007
dies-0.007
ug-0.007
 villains-0.007
Token glass
Token ID5405
Feature activation-1.29e-03

Pos logit contributions
 entire+0.100
chie+0.095
itates+0.095
ksh+0.090
asuring+0.089
Neg logit contributions
drivers-0.078
ogs-0.078
dies-0.076
 weapons-0.076
con-0.076
Token.
Token ID13
Feature activation+5.77e-03

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.001
 exact-0.001
chie-0.001
Token
Token ID198
Feature activation+1.38e-03

Pos logit contributions
 entire+0.007
ifying+0.007
 exact+0.006
 ".+0.006
flower+0.006
Neg logit contributions
drivers-0.010
dies-0.010
ogs-0.010
 villains-0.010
 weapons-0.010
Token
Token ID198
Feature activation+8.67e-03

Pos logit contributions
 entire+0.002
ifying+0.001
chie+0.001
 exact+0.001
itates+0.001
Neg logit contributions
drivers-0.003
dies-0.003
 villains-0.003
 weapons-0.003
ogs-0.003
TokenAs
Token ID1722
Feature activation-1.49e-02

Pos logit contributions
 entire+0.011
ifying+0.011
chie+0.009
 exact+0.009
flower+0.009
Neg logit contributions
drivers-0.016
dies-0.015
ogs-0.015
 weapons-0.014
con-0.014
Token saw
Token ID2497
Feature activation-1.78e-02

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
ves+0.002
Neg logit contributions
 entire-0.001
ifying-0.001
chie-0.001
flower-0.001
 exact-0.001
Token her
Token ID607
Feature activation+5.64e-03

Pos logit contributions
ogs+0.039
dies+0.039
drivers+0.039
ug+0.038
 villains+0.038
Neg logit contributions
 entire-0.015
 so-0.014
 nearest-0.014
 front-0.013
 too-0.013
Token mother
Token ID2802
Feature activation+7.86e-03

Pos logit contributions
 entire+0.009
ifying+0.008
flower+0.007
 exact+0.007
chie+0.007
Neg logit contributions
drivers-0.009
ogs-0.008
dies-0.008
 villains-0.008
ves-0.008
Token holding
Token ID4769
Feature activation+1.26e-02

Pos logit contributions
 entire+0.011
ifying+0.011
chie+0.010
flower+0.009
 exact+0.009
Neg logit contributions
drivers-0.013
ogs-0.012
dies-0.012
 villains-0.012
con-0.011
Token a
Token ID257
Feature activation-2.64e-03

Pos logit contributions
 entire+0.017
ifying+0.016
 exact+0.014
flower+0.014
 front+0.013
Neg logit contributions
drivers-0.023
 villains-0.022
ogs-0.022
dies-0.021
ves-0.020
Token freshly
Token ID29026
Feature activation+6.50e-02

Pos logit contributions
drivers+0.004
ogs+0.004
dies+0.004
 villains+0.004
 weapons+0.004
Neg logit contributions
 entire-0.004
ifying-0.004
chie-0.003
flower-0.003
 exact-0.003
Token-
Token ID12
Feature activation-6.29e-03

Pos logit contributions
ifying+0.107
 entire+0.104
chie+0.100
ksh+0.095
 exact+0.092
Neg logit contributions
drivers-0.087
ogs-0.083
ves-0.083
con-0.080
dies-0.080
Tokenb
Token ID65
Feature activation-1.38e-02

Pos logit contributions
drivers+0.010
ogs+0.009
dies+0.009
 villains+0.009
 weapons+0.008
Neg logit contributions
 entire-0.010
ifying-0.009
chie-0.008
flower-0.008
 exact-0.008
Tokenaked
Token ID4335
Feature activation-7.36e-03

Pos logit contributions
drivers+0.022
ogs+0.021
ug+0.020
ves+0.020
dies+0.020
Neg logit contributions
 entire-0.021
ifying-0.019
ksh-0.017
 exact-0.017
chie-0.016
Token muff
Token ID27563
Feature activation+1.65e-03

Pos logit contributions
drivers+0.014
 villains+0.014
dies+0.014
ogs+0.014
ves+0.013
Neg logit contributions
 entire-0.009
ifying-0.008
 exact-0.007
 front-0.007
flower-0.006
Tokenin
Token ID259
Feature activation-5.71e-03

Pos logit contributions
 entire+0.003
ifying+0.003
 exact+0.002
ksh+0.002
 ".+0.002
Neg logit contributions
drivers-0.002
ogs-0.002
ves-0.002
ests-0.002
ug-0.002
Token -
Token ID532
Feature activation+4.64e-03

Pos logit contributions
drivers+0.048
ogs+0.046
dies+0.046
 villains+0.045
 weapons+0.043
Neg logit contributions
 entire-0.034
ifying-0.031
 exact-0.026
flower-0.026
chie-0.026
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 entire+0.006
ifying+0.006
 too+0.005
 front+0.005
 my+0.005
Neg logit contributions
drivers-0.009
ogs-0.008
ests-0.008
dies-0.008
ug-0.008
Token hand
Token ID1021
Feature activation+1.16e-02

Pos logit contributions
 entire+0.016
ifying+0.014
flower+0.012
 exact+0.012
 highest+0.012
Neg logit contributions
drivers-0.019
ogs-0.019
ves-0.018
dies-0.017
ogged-0.017
Token had
Token ID550
Feature activation+6.12e-03

Pos logit contributions
 entire+0.015
ifying+0.014
flower+0.012
chie+0.012
 exact+0.011
Neg logit contributions
drivers-0.021
ogs-0.020
dies-0.020
 villains-0.020
 weapons-0.019
Token a
Token ID257
Feature activation-1.12e-03

Pos logit contributions
 entire+0.008
ifying+0.008
flower+0.006
chie+0.006
 exact+0.006
Neg logit contributions
drivers-0.011
ogs-0.010
dies-0.010
 villains-0.010
 weapons-0.010
Token firm
Token ID4081
Feature activation+6.46e-02

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
ves+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
chie-0.001
flower-0.001
 exact-0.001
Token grip
Token ID11762
Feature activation+2.11e-04

Pos logit contributions
 entire+0.081
ifying+0.067
flower+0.062
 exact+0.059
 front+0.058
Neg logit contributions
drivers-0.128
 villains-0.118
ogs-0.117
dies-0.117
con-0.115
Token.
Token ID13
Feature activation+8.76e-03

Pos logit contributions
 entire+0.000
ifying+0.000
 exact+0.000
flower+0.000
 too+0.000
Neg logit contributions
drivers-0.000
 villains-0.000
dies-0.000
ogs-0.000
 weapons-0.000
Token
Token ID220
Feature activation+1.68e-02

Pos logit contributions
 entire+0.012
ifying+0.012
chie+0.010
 exact+0.010
flower+0.010
Neg logit contributions
drivers-0.015
ogs-0.014
dies-0.014
 villains-0.014
 weapons-0.013
Token
Token ID198
Feature activation+7.19e-03

Pos logit contributions
 entire+0.018
ifying+0.017
 ".+0.014
 exact+0.013
itates+0.013
Neg logit contributions
dies-0.033
 villains-0.032
drivers-0.032
 weapons-0.031
ogs-0.031
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 entire+0.008
ifying+0.007
chie+0.006
 ".+0.006
 exact+0.006
Neg logit contributions
drivers-0.014
dies-0.014
 weapons-0.013
ogs-0.013
 villains-0.013
Tokenandy
Token ID10757
Feature activation+1.47e-02

Pos logit contributions
 entire+0.036
ifying+0.034
flower+0.031
chie+0.031
 exact+0.030
Neg logit contributions
drivers-0.026
dies-0.024
ogs-0.024
 villains-0.023
 weapons-0.022
Token who
Token ID508
Feature activation-2.67e-03

Pos logit contributions
 entire+0.020
ifying+0.018
flower+0.016
chie+0.016
 exact+0.015
Neg logit contributions
drivers-0.026
ogs-0.025
dies-0.025
 villains-0.024
 weapons-0.023
Token lived
Token ID5615
Feature activation-9.06e-03

Pos logit contributions
drivers+0.004
dies+0.003
 villains+0.003
ogs+0.003
ves+0.003
Neg logit contributions
ifying-0.004
 entire-0.004
chie-0.004
flower-0.004
 exact-0.004
Token near
Token ID1474
Feature activation+1.90e-02

Pos logit contributions
ogs+0.014
drivers+0.014
 villains+0.013
dies+0.013
ug+0.013
Neg logit contributions
 entire-0.013
ifying-0.013
chie-0.011
 exact-0.011
ksh-0.010
Token a
Token ID257
Feature activation+2.00e-02

Pos logit contributions
 entire+0.026
ifying+0.024
 exact+0.021
flower+0.021
 front+0.020
Neg logit contributions
drivers-0.034
ogs-0.032
dies-0.032
 villains-0.031
con-0.030
Token peaceful
Token ID12309
Feature activation+6.42e-02

Pos logit contributions
 entire+0.032
ifying+0.030
chie+0.028
flower+0.027
 exact+0.027
Neg logit contributions
ogs-0.027
dies-0.027
drivers-0.027
 villains-0.026
con-0.026
Token mountain
Token ID8598
Feature activation+1.71e-02

Pos logit contributions
 entire+0.097
ifying+0.092
chie+0.081
flower+0.079
 exact+0.079
Neg logit contributions
drivers-0.098
ogs-0.096
dies-0.095
 villains-0.093
 weapons-0.089
Token.
Token ID13
Feature activation+1.13e-04

Pos logit contributions
 entire+0.026
ifying+0.025
chie+0.023
ksh+0.022
 exact+0.022
Neg logit contributions
ogs-0.025
drivers-0.024
dies-0.023
 villains-0.023
ves-0.023
Token One
Token ID1881
Feature activation-2.00e-04

Pos logit contributions
 entire+0.000
ifying+0.000
chie+0.000
flower+0.000
 exact+0.000
Neg logit contributions
ogs-0.000
drivers-0.000
dies-0.000
 villains-0.000
ogged-0.000
Token day
Token ID1110
Feature activation-1.37e-02

Pos logit contributions
ogs+0.000
drivers+0.000
dies+0.000
 villains+0.000
 weapons+0.000
Neg logit contributions
 entire-0.000
ifying-0.000
chie-0.000
 exact-0.000
flower-0.000
Token,
Token ID11
Feature activation+3.47e-03

Pos logit contributions
drivers+0.019
dies+0.018
 villains+0.018
ogs+0.018
con+0.017
Neg logit contributions
 entire-0.023
ifying-0.022
flower-0.020
chie-0.019
 exact-0.019
Token the
Token ID262
Feature activation+3.21e-02

Pos logit contributions
 entire+0.025
ifying+0.021
 front+0.021
 exact+0.020
 too+0.018
Neg logit contributions
drivers-0.037
dies-0.035
ogs-0.034
 villains-0.034
ug-0.034
Token nice
Token ID3621
Feature activation+6.36e-02

Pos logit contributions
 entire+0.054
ifying+0.053
chie+0.048
flower+0.048
ksh+0.046
Neg logit contributions
drivers-0.042
dies-0.041
ogs-0.040
 villains-0.040
 weapons-0.038
Token vendor
Token ID18371
Feature activation+2.41e-02

Pos logit contributions
 entire+0.102
ifying+0.097
chie+0.085
flower+0.084
 exact+0.083
Neg logit contributions
drivers-0.095
ogs-0.091
dies-0.090
 villains-0.088
 weapons-0.084
Token.
Token ID13
Feature activation-6.88e-03

Pos logit contributions
 entire+0.033
ifying+0.031
chie+0.027
flower+0.026
 exact+0.026
Neg logit contributions
drivers-0.042
ogs-0.040
dies-0.040
 villains-0.039
 weapons-0.037
Token The
Token ID383
Feature activation+2.86e-02

Pos logit contributions
drivers+0.014
ves+0.014
ug+0.014
ogs+0.014
dies+0.014
Neg logit contributions
 entire-0.007
ifying-0.007
 front-0.005
 exact-0.005
 too-0.005
Token nice
Token ID3621
Feature activation+6.40e-02

Pos logit contributions
ifying+0.050
 entire+0.048
chie+0.045
flower+0.045
ksh+0.044
Neg logit contributions
drivers-0.035
dies-0.035
ogs-0.033
 villains-0.033
 weapons-0.032
Token vendor
Token ID18371
Feature activation+1.71e-02

Pos logit contributions
 entire+0.102
ifying+0.097
chie+0.085
flower+0.085
 exact+0.083
Neg logit contributions
drivers-0.096
ogs-0.092
dies-0.091
 villains-0.089
 weapons-0.085
Token is
Token ID318
Feature activation+8.65e-03

Pos logit contributions
 entire+0.023
ifying+0.021
flower+0.018
 exact+0.018
itates+0.017
Neg logit contributions
drivers-0.030
ogs-0.029
dies-0.029
 villains-0.029
 weapons-0.027
Token happy
Token ID3772
Feature activation+2.17e-02

Pos logit contributions
ifying+0.013
 entire+0.013
chie+0.012
flower+0.011
ksh+0.011
Neg logit contributions
drivers-0.013
 villains-0.012
 weapons-0.012
ves-0.012
ogs-0.012
Token to
Token ID284
Feature activation+1.93e-03

Pos logit contributions
 entire+0.029
ifying+0.026
flower+0.024
 too+0.023
 exact+0.022
Neg logit contributions
drivers-0.039
dies-0.037
ogs-0.037
 villains-0.036
con-0.035
Token see
Token ID766
Feature activation-1.23e-02

Pos logit contributions
 entire+0.003
ifying+0.003
flower+0.002
chie+0.002
 exact+0.002
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
 villains-0.003
ves-0.003
Token to
Token ID284
Feature activation-2.27e-03

Pos logit contributions
 too+0.004
 entire+0.004
 front+0.003
 my+0.003
 so+0.003
Neg logit contributions
drivers-0.010
ug-0.010
ogs-0.010
dies-0.009
 villains-0.009
Token go
Token ID467
Feature activation-3.40e-03

Pos logit contributions
drivers+0.004
dies+0.004
ogs+0.004
 weapons+0.004
 villains+0.004
Neg logit contributions
 entire-0.003
ifying-0.003
chie-0.002
flower-0.002
 front-0.002
Token back
Token ID736
Feature activation+2.66e-03

Pos logit contributions
drivers+0.007
ogs+0.006
dies+0.006
 villains+0.006
 weapons+0.006
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.003
chie-0.003
 exact-0.003
Token to
Token ID284
Feature activation+1.80e-02

Pos logit contributions
 entire+0.003
ifying+0.003
 front+0.003
 too+0.003
flower+0.002
Neg logit contributions
drivers-0.005
dies-0.005
 villains-0.005
renches-0.005
ves-0.005
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
 entire+0.025
ifying+0.022
 front+0.021
 exact+0.020
 too+0.019
Neg logit contributions
drivers-0.032
dies-0.030
ogs-0.029
ug-0.029
 villains-0.029
Token nice
Token ID3621
Feature activation+6.38e-02

Pos logit contributions
 entire+0.044
ifying+0.041
flower+0.036
chie+0.036
 exact+0.036
Neg logit contributions
drivers-0.044
ogs-0.042
dies-0.042
 villains-0.041
ves-0.039
Token vendor
Token ID18371
Feature activation+2.11e-02

Pos logit contributions
 entire+0.105
ifying+0.103
chie+0.092
flower+0.088
 exact+0.087
Neg logit contributions
drivers-0.089
ogs-0.088
dies-0.086
 villains-0.082
 weapons-0.079
Token who
Token ID508
Feature activation+5.84e-03

Pos logit contributions
 entire+0.030
ifying+0.028
chie+0.024
flower+0.024
 exact+0.024
Neg logit contributions
drivers-0.035
ogs-0.034
dies-0.033
 villains-0.033
 weapons-0.031
Token sells
Token ID16015
Feature activation+1.17e-02

Pos logit contributions
 entire+0.008
ifying+0.007
 exact+0.006
flower+0.006
 too+0.006
Neg logit contributions
drivers-0.011
dies-0.010
 villains-0.010
ogs-0.010
ug-0.009
Token ice
Token ID4771
Feature activation-8.50e-03

Pos logit contributions
 entire+0.018
ifying+0.017
 exact+0.015
flower+0.015
chie+0.015
Neg logit contributions
drivers-0.018
ogs-0.017
dies-0.017
 villains-0.017
ves-0.016
Token cream
Token ID8566
Feature activation+1.91e-02

Pos logit contributions
drivers+0.013
dies+0.013
 villains+0.013
ogs+0.013
 weapons+0.012
Neg logit contributions
 entire-0.013
ifying-0.012
flower-0.011
chie-0.010
 exact-0.010
Token the
Token ID262
Feature activation-5.70e-03

Pos logit contributions
 entire+0.006
ifying+0.005
 front+0.005
 too+0.005
 beginning+0.004
Neg logit contributions
drivers-0.010
ogs-0.009
ves-0.009
 villains-0.009
ests-0.009
Token church
Token ID4928
Feature activation+8.02e-03

Pos logit contributions
 rides+0.006
dies+0.006
ogs+0.006
 trek+0.006
 err+0.006
Neg logit contributions
ifying-0.011
chie-0.010
flower-0.010
ksh-0.010
asuring-0.009
Token and
Token ID290
Feature activation+3.81e-03

Pos logit contributions
 entire+0.013
ifying+0.012
flower+0.010
chie+0.010
 exact+0.010
Neg logit contributions
drivers-0.012
ogs-0.012
dies-0.012
 villains-0.011
 weapons-0.011
Token noticed
Token ID6810
Feature activation-1.23e-02

Pos logit contributions
 entire+0.005
ifying+0.004
 exact+0.003
flower+0.003
 beginning+0.003
Neg logit contributions
drivers-0.007
ogs-0.007
dies-0.007
 villains-0.007
ests-0.007
Token a
Token ID257
Feature activation-6.51e-03

Pos logit contributions
ogs+0.024
drivers+0.024
dies+0.024
 villains+0.023
uddy+0.022
Neg logit contributions
 entire-0.013
ifying-0.013
 too-0.012
 so-0.011
 exact-0.011
Token freshly
Token ID29026
Feature activation+6.38e-02

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.011
 villains+0.010
 weapons+0.010
Neg logit contributions
 entire-0.009
ifying-0.008
chie-0.007
flower-0.007
 exact-0.007
Token dug
Token ID18735
Feature activation+2.51e-02

Pos logit contributions
ifying+0.105
 entire+0.100
chie+0.093
ksh+0.092
 exact+0.088
Neg logit contributions
drivers-0.087
ves-0.084
ogs-0.082
dies-0.082
con-0.080
Token hole
Token ID7604
Feature activation-6.27e-03

Pos logit contributions
 entire+0.030
ifying+0.024
 exact+0.022
 too+0.020
 beginning+0.020
Neg logit contributions
drivers-0.051
 villains-0.050
ogs-0.050
dies-0.049
ests-0.047
Token in
Token ID287
Feature activation+1.09e-02

Pos logit contributions
drivers+0.012
ogs+0.011
dies+0.011
 villains+0.011
con+0.011
Neg logit contributions
 entire-0.008
ifying-0.007
flower-0.006
 exact-0.006
chie-0.006
Token the
Token ID262
Feature activation+3.36e-03

Pos logit contributions
 entire+0.015
ifying+0.014
 exact+0.012
flower+0.012
 front+0.012
Neg logit contributions
drivers-0.019
ogs-0.019
dies-0.018
 villains-0.018
ves-0.017
Token ground
Token ID2323
Feature activation-1.16e-02

Pos logit contributions
 entire+0.004
ifying+0.004
chie+0.004
flower+0.004
itates+0.003
Neg logit contributions
drivers-0.006
dies-0.006
 villains-0.006
ogs-0.005
 weapons-0.005
Token and
Token ID290
Feature activation+6.17e-03

Pos logit contributions
 entire+0.015
 too+0.013
ifying+0.013
 front+0.012
flower+0.011
Neg logit contributions
drivers-0.026
dies-0.025
 villains-0.024
ogs-0.024
con-0.023
Token takes
Token ID2753
Feature activation+1.71e-03

Pos logit contributions
 entire+0.009
ifying+0.008
 exact+0.007
 front+0.007
 nearest+0.007
Neg logit contributions
drivers-0.010
dies-0.010
 villains-0.010
ogs-0.009
ug-0.009
Token them
Token ID606
Feature activation+5.26e-03

Pos logit contributions
 entire+0.002
 away+0.002
 too+0.002
 back+0.001
 my+0.001
Neg logit contributions
drivers-0.004
 villains-0.004
oot-0.003
ves-0.003
con-0.003
Token to
Token ID284
Feature activation+1.95e-02

Pos logit contributions
 entire+0.006
ifying+0.005
 too+0.005
 front+0.005
 back+0.004
Neg logit contributions
 villains-0.011
drivers-0.011
dies-0.011
ves-0.010
renches-0.010
Token the
Token ID262
Feature activation+3.21e-02

Pos logit contributions
 entire+0.025
ifying+0.021
 front+0.021
 exact+0.020
 too+0.018
Neg logit contributions
drivers-0.037
dies-0.035
ogs-0.034
 villains-0.034
ug-0.034
Token nice
Token ID3621
Feature activation+6.36e-02

Pos logit contributions
 entire+0.054
ifying+0.053
chie+0.048
flower+0.048
ksh+0.046
Neg logit contributions
drivers-0.042
dies-0.041
ogs-0.040
 villains-0.040
 weapons-0.038
Token vendor
Token ID18371
Feature activation+2.41e-02

Pos logit contributions
 entire+0.102
ifying+0.097
chie+0.085
flower+0.084
 exact+0.083
Neg logit contributions
drivers-0.095
ogs-0.091
dies-0.090
 villains-0.088
 weapons-0.084
Token.
Token ID13
Feature activation-6.88e-03

Pos logit contributions
 entire+0.033
ifying+0.031
chie+0.027
flower+0.026
 exact+0.026
Neg logit contributions
drivers-0.042
ogs-0.040
dies-0.040
 villains-0.039
 weapons-0.037
Token The
Token ID383
Feature activation+2.86e-02

Pos logit contributions
drivers+0.014
ves+0.014
ug+0.014
ogs+0.014
dies+0.014
Neg logit contributions
 entire-0.007
ifying-0.007
 front-0.005
 exact-0.005
 too-0.005
Token nice
Token ID3621
Feature activation+6.40e-02

Pos logit contributions
ifying+0.050
 entire+0.048
chie+0.045
flower+0.045
ksh+0.044
Neg logit contributions
drivers-0.035
dies-0.035
ogs-0.033
 villains-0.033
 weapons-0.032
Token vendor
Token ID18371
Feature activation+1.71e-02

Pos logit contributions
 entire+0.102
ifying+0.097
chie+0.085
flower+0.085
 exact+0.083
Neg logit contributions
drivers-0.096
ogs-0.092
dies-0.091
 villains-0.089
 weapons-0.085
Token vendor
Token ID18371
Feature activation+2.45e-02

Pos logit contributions
 entire+0.027
ifying+0.026
flower+0.022
 exact+0.022
chie+0.022
Neg logit contributions
drivers-0.028
ogs-0.028
dies-0.027
 villains-0.026
ves-0.025
Token,
Token ID11
Feature activation+1.82e-02

Pos logit contributions
 entire+0.030
ifying+0.028
flower+0.025
 exact+0.023
chie+0.023
Neg logit contributions
drivers-0.045
dies-0.044
 villains-0.043
ogs-0.043
ves-0.041
Token who
Token ID508
Feature activation+1.07e-02

Pos logit contributions
ifying+0.021
 entire+0.021
 too+0.019
flower+0.018
 exact+0.018
Neg logit contributions
drivers-0.034
dies-0.033
ogs-0.033
 villains-0.031
ug-0.031
Token was
Token ID373
Feature activation+1.90e-02

Pos logit contributions
 entire+0.013
ifying+0.013
flower+0.011
 exact+0.011
 too+0.010
Neg logit contributions
drivers-0.020
ogs-0.019
dies-0.019
 villains-0.018
 weapons-0.018
Token a
Token ID257
Feature activation+1.73e-02

Pos logit contributions
 entire+0.026
ifying+0.026
chie+0.022
flower+0.021
 exact+0.020
Neg logit contributions
drivers-0.032
ogs-0.030
 villains-0.030
dies-0.030
 weapons-0.029
Token nice
Token ID3621
Feature activation+6.34e-02

Pos logit contributions
 entire+0.025
ifying+0.024
flower+0.021
chie+0.021
 exact+0.020
Neg logit contributions
drivers-0.028
ogs-0.027
dies-0.027
 villains-0.026
 weapons-0.025
Token man
Token ID582
Feature activation+1.30e-02

Pos logit contributions
 entire+0.089
ifying+0.076
flower+0.074
 exact+0.068
 front+0.065
Neg logit contributions
drivers-0.114
 villains-0.106
ogs-0.105
dies-0.104
ves-0.102
Token with
Token ID351
Feature activation+9.67e-03

Pos logit contributions
 entire+0.017
ifying+0.016
flower+0.014
chie+0.013
 exact+0.013
Neg logit contributions
drivers-0.023
ogs-0.022
dies-0.022
 villains-0.022
 weapons-0.021
Token a
Token ID257
Feature activation+1.89e-02

Pos logit contributions
ifying+0.012
 entire+0.012
 front+0.010
 too+0.010
 exact+0.010
Neg logit contributions
ogs-0.018
dies-0.017
drivers-0.017
ug-0.016
oot-0.016
Token big
Token ID1263
Feature activation+2.41e-02

Pos logit contributions
 entire+0.028
ifying+0.026
flower+0.022
chie+0.022
 exact+0.022
Neg logit contributions
drivers-0.031
ogs-0.029
dies-0.029
 villains-0.029
 weapons-0.027
Token cart
Token ID6383
Feature activation+1.37e-02

Pos logit contributions
 entire+0.033
ifying+0.030
flower+0.027
 exact+0.026
 front+0.026
Neg logit contributions
drivers-0.042
ogs-0.040
 villains-0.040
dies-0.040
ves-0.039
Token The
Token ID383
Feature activation+2.19e-02

Pos logit contributions
 entire+0.021
ifying+0.020
chie+0.019
 nearest+0.018
itates+0.017
Neg logit contributions
dies-0.016
 villains-0.015
 Spe-0.015
ogs-0.015
drivers-0.014
Token crab
Token ID32202
Feature activation+6.56e-03

Pos logit contributions
ifying+0.037
 entire+0.035
ksh+0.032
chie+0.032
 exact+0.031
Neg logit contributions
dies-0.028
ogs-0.028
 villains-0.028
 weapons-0.027
drivers-0.026
Token lived
Token ID5615
Feature activation-5.93e-03

Pos logit contributions
ifying+0.012
 entire+0.012
chie+0.011
ksh+0.010
 exact+0.010
Neg logit contributions
drivers-0.008
ves-0.008
ogs-0.008
 villains-0.008
dies-0.007
Token in
Token ID287
Feature activation+2.78e-02

Pos logit contributions
ogs+0.008
 weapons+0.008
 villains+0.008
ug+0.008
drivers+0.008
Neg logit contributions
ifying-0.010
 entire-0.009
ksh-0.008
chie-0.008
 exact-0.008
Token a
Token ID257
Feature activation+2.28e-02

Pos logit contributions
ifying+0.050
 entire+0.049
chie+0.045
ksh+0.045
flower+0.045
Neg logit contributions
 villains-0.031
dies-0.031
 weapons-0.030
drivers-0.030
ogs-0.030
Token tidy
Token ID43044
Feature activation+6.33e-02

Pos logit contributions
ifying+0.044
ksh+0.042
 entire+0.041
chie+0.039
asuring+0.039
Neg logit contributions
 villains-0.024
ogs-0.024
 weapons-0.023
dies-0.021
 Mitt-0.021
Token,
Token ID11
Feature activation+1.91e-02

Pos logit contributions
ifying+0.099
 entire+0.095
ksh+0.084
chie+0.083
itates+0.083
Neg logit contributions
ogs-0.092
dies-0.087
 weapons-0.086
drivers-0.086
 villains-0.085
Token clean
Token ID3424
Feature activation+5.24e-02

Pos logit contributions
ifying+0.025
 entire+0.025
chie+0.022
flower+0.021
ksh+0.020
Neg logit contributions
dies-0.031
drivers-0.030
 villains-0.030
 weapons-0.030
ogs-0.029
Token home
Token ID1363
Feature activation+2.21e-03

Pos logit contributions
ifying+0.087
 entire+0.083
chie+0.076
ksh+0.075
itates+0.075
Neg logit contributions
ogs-0.072
dies-0.068
 weapons-0.066
drivers-0.065
con-0.064
Token near
Token ID1474
Feature activation+2.73e-02

Pos logit contributions
ifying+0.004
 entire+0.004
chie+0.004
ksh+0.004
 exact+0.004
Neg logit contributions
ogs-0.003
drivers-0.002
othe-0.002
ves-0.002
 villains-0.002
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
 entire+0.045
ifying+0.044
chie+0.039
flower+0.038
itates+0.037
Neg logit contributions
drivers-0.037
ogs-0.037
dies-0.036
 weapons-0.036
 villains-0.036
Token.
Token ID13
Feature activation-1.76e-03

Pos logit contributions
 entire+0.025
ifying+0.024
chie+0.021
 exact+0.020
flower+0.020
Neg logit contributions
drivers-0.028
ogs-0.027
dies-0.027
 villains-0.026
 weapons-0.025
Token He
Token ID679
Feature activation+3.67e-03

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
 weapons+0.002
Neg logit contributions
 entire-0.003
ifying-0.003
flower-0.002
chie-0.002
 exact-0.002
Token has
Token ID468
Feature activation+1.86e-02

Pos logit contributions
 entire+0.005
ifying+0.004
 exact+0.004
flower+0.004
chie+0.003
Neg logit contributions
drivers-0.007
ogs-0.006
dies-0.006
 villains-0.006
ug-0.006
Token a
Token ID257
Feature activation+5.47e-03

Pos logit contributions
 entire+0.029
ifying+0.027
flower+0.024
chie+0.024
 exact+0.024
Neg logit contributions
drivers-0.028
ogs-0.027
dies-0.027
 villains-0.026
 weapons-0.025
Token big
Token ID1263
Feature activation+1.41e-02

Pos logit contributions
 entire+0.008
ifying+0.008
chie+0.007
flower+0.007
 exact+0.007
Neg logit contributions
drivers-0.009
ogs-0.008
dies-0.008
 villains-0.008
 weapons-0.008
Token smile
Token ID8212
Feature activation+6.32e-02

Pos logit contributions
 entire+0.020
ifying+0.018
flower+0.016
 exact+0.016
chie+0.015
Neg logit contributions
drivers-0.024
ogs-0.023
dies-0.023
 villains-0.023
ves-0.022
Token.
Token ID13
Feature activation-1.35e-03

Pos logit contributions
 entire+0.107
chie+0.099
ifying+0.097
 exact+0.094
itates+0.090
Neg logit contributions
drivers-0.083
ogs-0.083
dies-0.078
ves-0.077
 villains-0.074
Token
Token ID198
Feature activation-2.85e-04

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
chie-0.002
flower-0.002
 exact-0.002
Token
Token ID198
Feature activation+4.87e-03

Pos logit contributions
drivers+0.001
dies+0.001
ogs+0.001
 villains+0.001
 weapons+0.001
Neg logit contributions
 entire-0.000
ifying-0.000
itates-0.000
 front-0.000
 ".-0.000
Token"
Token ID1
Feature activation+2.20e-02

Pos logit contributions
 entire+0.007
ifying+0.006
flower+0.006
chie+0.006
 exact+0.005
Neg logit contributions
drivers-0.008
dies-0.008
ogs-0.008
 villains-0.008
ves-0.007
TokenHello
Token ID15496
Feature activation-6.63e-03

Pos logit contributions
 entire+0.025
ifying+0.024
chie+0.021
 exact+0.019
flower+0.019
Neg logit contributions
drivers-0.042
dies-0.041
 villains-0.040
ogs-0.039
ves-0.038
Token girl
Token ID2576
Feature activation+1.18e-02

Pos logit contributions
drivers+0.022
ogs+0.021
ves+0.020
dies+0.020
 villains+0.020
Neg logit contributions
 entire-0.012
ifying-0.011
flower-0.010
chie-0.009
 exact-0.009
Token and
Token ID290
Feature activation+9.81e-03

Pos logit contributions
 entire+0.022
ifying+0.021
chie+0.019
flower+0.019
 exact+0.019
Neg logit contributions
drivers-0.014
ogs-0.014
dies-0.013
 villains-0.013
ves-0.012
Token her
Token ID607
Feature activation+1.19e-02

Pos logit contributions
 entire+0.011
ifying+0.011
 too+0.010
flower+0.010
 my+0.010
Neg logit contributions
drivers-0.019
dies-0.018
renches-0.017
ogs-0.017
ests-0.017
Token name
Token ID1438
Feature activation+8.32e-03

Pos logit contributions
 entire+0.018
ifying+0.016
flower+0.015
 front+0.014
 whole+0.014
Neg logit contributions
drivers-0.020
dies-0.020
ogs-0.019
 villains-0.019
ves-0.018
Token was
Token ID373
Feature activation-1.37e-03

Pos logit contributions
 entire+0.010
ifying+0.009
flower+0.009
 too+0.008
 front+0.008
Neg logit contributions
drivers-0.016
dies-0.015
ogs-0.015
 villains-0.015
renches-0.014
Token Clara
Token ID27443
Feature activation+6.32e-02

Pos logit contributions
drivers+0.002
dies+0.002
 villains+0.002
ogs+0.002
renches+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
 my-0.002
 too-0.002
Token.
Token ID13
Feature activation+2.47e-02

Pos logit contributions
 entire+0.112
ifying+0.109
chie+0.103
 highest+0.100
 exact+0.099
Neg logit contributions
ogs-0.068
old-0.066
ves-0.063
drivers-0.062
 lear-0.062
Token She
Token ID1375
Feature activation+3.93e-03

Pos logit contributions
 entire+0.043
ifying+0.038
chie+0.038
flower+0.036
 highest+0.034
Neg logit contributions
ogs-0.033
drivers-0.031
dies-0.030
old-0.029
ogged-0.029
Token was
Token ID373
Feature activation+4.79e-03

Pos logit contributions
 entire+0.005
ifying+0.004
flower+0.004
 exact+0.004
chie+0.004
Neg logit contributions
drivers-0.008
dies-0.007
ogs-0.007
 villains-0.007
 weapons-0.007
Token three
Token ID1115
Feature activation-2.82e-02

Pos logit contributions
 entire+0.006
ifying+0.006
chie+0.005
flower+0.005
ksh+0.004
Neg logit contributions
drivers-0.008
ogs-0.008
 villains-0.008
dies-0.008
ug-0.008
Token years
Token ID812
Feature activation-1.21e-02

Pos logit contributions
drivers+0.053
ogs+0.051
dies+0.051
 villains+0.050
ves+0.048
Neg logit contributions
 entire-0.034
ifying-0.032
flower-0.027
chie-0.027
 exact-0.026
Token.
Token ID13
Feature activation+3.55e-03

Pos logit contributions
drivers+0.006
dies+0.006
ogs+0.006
 villains+0.006
 weapons+0.006
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.003
chie-0.003
 exact-0.003
Token Who
Token ID5338
Feature activation-2.51e-02

Pos logit contributions
 entire+0.005
ifying+0.005
flower+0.004
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.006
ogs-0.006
dies-0.006
 villains-0.006
ves-0.005
Token will
Token ID481
Feature activation+1.49e-02

Pos logit contributions
drivers+0.051
dies+0.049
ogs+0.049
con+0.048
 err+0.048
Neg logit contributions
 entire-0.025
ifying-0.023
 exact-0.019
asuring-0.019
 my-0.018
Token be
Token ID307
Feature activation+1.69e-02

Pos logit contributions
ifying+0.025
 entire+0.025
ksh+0.022
flower+0.021
itates+0.021
Neg logit contributions
drivers-0.020
dies-0.020
 villains-0.020
ogs-0.019
 weapons-0.018
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
 entire+0.023
ifying+0.023
flower+0.019
chie+0.019
ksh+0.019
Neg logit contributions
drivers-0.028
 villains-0.027
 weapons-0.026
ogs-0.026
dies-0.026
Token bride
Token ID26619
Feature activation+6.31e-02

Pos logit contributions
 entire+0.019
ifying+0.016
 exact+0.015
flower+0.015
 highest+0.014
Neg logit contributions
drivers-0.028
ogs-0.027
dies-0.026
ves-0.026
 villains-0.026
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
 entire+0.091
ifying+0.087
chie+0.081
 exact+0.076
ksh+0.075
Neg logit contributions
ogs-0.098
drivers-0.093
dies-0.089
oot-0.089
ug-0.089
Token who
Token ID508
Feature activation+1.32e-02

Pos logit contributions
 entire+0.027
ifying+0.026
flower+0.022
chie+0.022
 exact+0.021
Neg logit contributions
drivers-0.034
ogs-0.033
dies-0.033
 villains-0.032
ves-0.031
Token will
Token ID481
Feature activation+2.24e-02

Pos logit contributions
 entire+0.018
ifying+0.017
flower+0.015
chie+0.015
 exact+0.015
Neg logit contributions
drivers-0.022
ogs-0.021
dies-0.021
 villains-0.021
 weapons-0.020
Token be
Token ID307
Feature activation+1.21e-02

Pos logit contributions
 entire+0.034
ifying+0.032
flower+0.028
chie+0.028
 exact+0.027
Neg logit contributions
drivers-0.034
ogs-0.033
dies-0.033
 villains-0.033
ves-0.031
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
ifying+0.017
 entire+0.017
ksh+0.015
itates+0.014
flower+0.014
Neg logit contributions
drivers-0.018
 weapons-0.018
ogs-0.018
 villains-0.018
old-0.017
Token you
Token ID345
Feature activation+5.75e-03

Pos logit contributions
 entire+0.023
ifying+0.022
chie+0.018
 exact+0.018
 my+0.018
Neg logit contributions
drivers-0.041
ogs-0.040
dies-0.039
 villains-0.037
ests-0.037
Token point
Token ID966
Feature activation+1.48e-02

Pos logit contributions
 entire+0.009
ifying+0.008
chie+0.007
flower+0.007
 exact+0.007
Neg logit contributions
drivers-0.009
ogs-0.009
dies-0.009
 villains-0.008
 weapons-0.008
Token the
Token ID262
Feature activation-1.32e-02

Pos logit contributions
 entire+0.022
ifying+0.020
 exact+0.018
chie+0.018
flower+0.018
Neg logit contributions
drivers-0.023
ogs-0.023
dies-0.022
 weapons-0.021
 villains-0.021
Token laser
Token ID12855
Feature activation-1.14e-03

Pos logit contributions
drivers+0.015
dies+0.014
 weapons+0.013
 villains+0.013
ogs+0.012
Neg logit contributions
 entire-0.025
ifying-0.025
flower-0.023
chie-0.023
ksh-0.022
Token at
Token ID379
Feature activation+1.27e-03

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.001
ifying-0.001
flower-0.001
chie-0.001
 exact-0.001
Token Mia
Token ID32189
Feature activation+6.31e-02

Pos logit contributions
 entire+0.002
ifying+0.001
flower+0.001
 front+0.001
chie+0.001
Neg logit contributions
drivers-0.002
dies-0.002
 villains-0.002
ogs-0.002
ves-0.002
Token's
Token ID338
Feature activation+3.85e-04

Pos logit contributions
 entire+0.066
ifying+0.060
chie+0.049
flower+0.048
 exact+0.047
Neg logit contributions
drivers-0.128
ogs-0.125
dies-0.123
 villains-0.121
 weapons-0.118
Token eyes
Token ID2951
Feature activation+2.92e-02

Pos logit contributions
 entire+0.001
ifying+0.000
flower+0.000
 exact+0.000
 front+0.000
Neg logit contributions
drivers-0.001
ogs-0.001
dies-0.001
ves-0.001
 villains-0.001
Token?
Token ID30
Feature activation-1.56e-02

Pos logit contributions
 entire+0.033
ifying+0.031
chie+0.027
flower+0.026
 exact+0.025
Neg logit contributions
drivers-0.055
ogs-0.054
dies-0.052
 villains-0.051
ves-0.050
Token That
Token ID1320
Feature activation+3.20e-02

Pos logit contributions
drivers+0.031
ogs+0.031
dies+0.031
 villains+0.030
ves+0.030
Neg logit contributions
 entire-0.015
ifying-0.015
flower-0.012
 exact-0.012
chie-0.011
Token's
Token ID338
Feature activation+1.79e-02

Pos logit contributions
 entire+0.039
ifying+0.035
flower+0.031
 exact+0.030
chie+0.029
Neg logit contributions
drivers-0.060
ogs-0.059
dies-0.057
 villains-0.056
con-0.055
Token cleaned
Token ID20750
Feature activation+2.83e-03

Pos logit contributions
dies+0.016
ogs+0.016
drivers+0.016
 villains+0.016
con+0.015
Neg logit contributions
 entire-0.013
ifying-0.012
flower-0.011
 exact-0.010
chie-0.010
Token up
Token ID510
Feature activation-1.59e-02

Pos logit contributions
 entire+0.003
ifying+0.002
flower+0.002
 exact+0.002
 front+0.002
Neg logit contributions
 villains-0.006
drivers-0.006
ogs-0.006
dies-0.006
ves-0.005
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
 villains+0.037
dies+0.035
drivers+0.034
ogs+0.034
uddy+0.032
Neg logit contributions
 entire-0.015
 too-0.012
 front-0.012
flower-0.011
ifying-0.011
Token broken
Token ID5445
Feature activation-2.97e-02

Pos logit contributions
drivers+0.036
dies+0.034
 weapons+0.033
ogs+0.033
 villains+0.033
Neg logit contributions
ifying-0.032
 entire-0.031
chie-0.027
flower-0.026
 ".-0.026
Token ash
Token ID12530
Feature activation-4.96e-02

Pos logit contributions
drivers+0.047
dies+0.043
 weapons+0.043
ogs+0.043
 villains+0.041
Neg logit contributions
ifying-0.045
 entire-0.044
chie-0.037
itates-0.037
 exact-0.036
Tokentr
Token ID2213
Feature activation-9.05e-02

Pos logit contributions
drivers+0.070
ogs+0.070
 villains+0.069
dies+0.069
 weapons+0.063
Neg logit contributions
 entire-0.082
ifying-0.076
flower-0.070
chie-0.066
 exact-0.066
Tokenay
Token ID323
Feature activation-3.58e-02

Pos logit contributions
drivers+0.125
dies+0.122
ogs+0.121
 villains+0.120
 weapons+0.111
Neg logit contributions
 entire-0.152
ifying-0.145
flower-0.128
chie-0.128
 exact-0.125
Token and
Token ID290
Feature activation+2.80e-03

Pos logit contributions
 villains+0.069
dies+0.067
ogs+0.065
drivers+0.063
 trek+0.063
Neg logit contributions
 entire-0.041
 too-0.035
ifying-0.035
flower-0.034
m-0.030
Token Lily
Token ID20037
Feature activation+1.68e-03

Pos logit contributions
 entire+0.003
 still+0.002
 too+0.002
 prett+0.002
 so+0.002
Neg logit contributions
ogs-0.006
drivers-0.006
dies-0.006
ests-0.006
 villains-0.006
Token learned
Token ID4499
Feature activation+1.11e-03

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
 too+0.001
 nearest+0.001
Neg logit contributions
ogs-0.003
dies-0.003
drivers-0.003
 villains-0.003
 weapons-0.003
Token to
Token ID284
Feature activation-2.68e-02

Pos logit contributions
 entire+0.002
ifying+0.001
 too+0.001
 exact+0.001
 front+0.001
Neg logit contributions
drivers-0.002
dies-0.002
ogs-0.002
 villains-0.002
ug-0.002
Token so
Token ID523
Feature activation-8.53e-03

Pos logit contributions
 villains+0.007
dies+0.007
ogs+0.007
drivers+0.007
con+0.007
Neg logit contributions
 entire-0.007
ifying-0.006
 too-0.006
 exact-0.005
chie-0.005
Token hard
Token ID1327
Feature activation-2.36e-02

Pos logit contributions
drivers+0.017
ests+0.017
ogs+0.016
dies+0.016
 Jah+0.015
Neg logit contributions
 entire-0.010
 too-0.009
 still-0.009
ifying-0.008
 whole-0.008
Token on
Token ID319
Feature activation-2.03e-03

Pos logit contributions
 villains+0.043
ests+0.041
dies+0.041
ogs+0.041
drivers+0.040
Neg logit contributions
 entire-0.028
 too-0.025
 so-0.022
ifying-0.021
".-0.020
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
ogs+0.004
drivers+0.004
dies+0.004
 villains+0.003
oot+0.003
Neg logit contributions
 entire-0.003
ifying-0.002
 front-0.002
 too-0.002
 exact-0.002
Token ash
Token ID12530
Feature activation-4.12e-02

Pos logit contributions
drivers+0.028
ogs+0.028
dies+0.027
 villains+0.027
ves+0.026
Neg logit contributions
 entire-0.024
ifying-0.021
flower-0.019
 exact-0.019
chie-0.019
Tokentr
Token ID2213
Feature activation-8.67e-02

Pos logit contributions
ogs+0.063
 villains+0.063
drivers+0.063
dies+0.062
ves+0.058
Neg logit contributions
 entire-0.064
ifying-0.059
flower-0.055
 highest-0.051
chie-0.050
Tokenay
Token ID323
Feature activation-2.96e-02

Pos logit contributions
drivers+0.123
dies+0.122
 villains+0.122
ogs+0.120
 weapons+0.112
Neg logit contributions
 entire-0.141
ifying-0.137
chie-0.120
flower-0.119
 exact-0.115
Token,
Token ID11
Feature activation+2.65e-04

Pos logit contributions
 villains+0.052
dies+0.051
drivers+0.049
ogs+0.048
renches+0.047
Neg logit contributions
 entire-0.039
ifying-0.035
flower-0.034
 too-0.033
".-0.030
Token and
Token ID290
Feature activation+9.49e-04

Pos logit contributions
 entire+0.000
ifying+0.000
 exact+0.000
flower+0.000
 too+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
ves-0.000
Token now
Token ID783
Feature activation-7.62e-03

Pos logit contributions
 entire+0.001
ifying+0.001
 too+0.001
 so+0.001
 beginning+0.001
Neg logit contributions
ogs-0.002
drivers-0.002
dies-0.002
ests-0.002
uddy-0.002
Token it
Token ID340
Feature activation+9.50e-03

Pos logit contributions
drivers+0.013
ogs+0.012
dies+0.011
ests+0.011
renches+0.011
Neg logit contributions
 entire-0.011
ifying-0.010
 too-0.010
 my-0.009
 beginning-0.009
Token They
Token ID1119
Feature activation+5.73e-03

Pos logit contributions
 entire+0.000
ifying+0.000
chie+0.000
flower+0.000
 exact+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
 Spe-0.000
Token took
Token ID1718
Feature activation+6.79e-03

Pos logit contributions
 entire+0.008
ifying+0.008
chie+0.007
flower+0.007
 exact+0.006
Neg logit contributions
drivers-0.010
ogs-0.009
dies-0.009
 villains-0.009
ves-0.009
Token off
Token ID572
Feature activation+8.24e-03

Pos logit contributions
 entire+0.008
ifying+0.008
 front+0.007
 too+0.007
 my+0.007
Neg logit contributions
drivers-0.013
 villains-0.013
dies-0.012
ogs-0.012
ves-0.012
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
 entire+0.011
ifying+0.010
 front+0.010
 too+0.009
flower+0.009
Neg logit contributions
drivers-0.015
dies-0.015
 villains-0.015
ogs-0.014
con-0.014
Token ash
Token ID12530
Feature activation-4.73e-02

Pos logit contributions
drivers+0.041
ogs+0.040
dies+0.040
 villains+0.039
ves+0.037
Neg logit contributions
 entire-0.029
ifying-0.026
flower-0.023
 exact-0.023
chie-0.022
Tokentr
Token ID2213
Feature activation-8.39e-02

Pos logit contributions
drivers+0.064
ogs+0.062
dies+0.061
 villains+0.060
ves+0.056
Neg logit contributions
 entire-0.081
ifying-0.077
flower-0.070
chie-0.068
 exact-0.067
Tokenay
Token ID323
Feature activation-2.22e-02

Pos logit contributions
drivers+0.118
dies+0.118
 villains+0.117
ogs+0.116
 weapons+0.107
Neg logit contributions
 entire-0.136
ifying-0.132
flower-0.115
chie-0.115
 exact-0.111
Token and
Token ID290
Feature activation+2.41e-03

Pos logit contributions
 villains+0.046
dies+0.045
drivers+0.043
ogs+0.042
 trek+0.041
Neg logit contributions
 entire-0.022
ifying-0.020
 too-0.019
flower-0.018
 front-0.017
Token the
Token ID262
Feature activation+1.49e-04

Pos logit contributions
 entire+0.002
ifying+0.002
 too+0.002
 front+0.002
flower+0.002
Neg logit contributions
ogs-0.005
drivers-0.005
dies-0.005
 villains-0.005
oot-0.005
Token book
Token ID1492
Feature activation-8.03e-03

Pos logit contributions
 entire+0.000
ifying+0.000
flower+0.000
 exact+0.000
 front+0.000
Neg logit contributions
ogs-0.000
drivers-0.000
dies-0.000
ug-0.000
 villains-0.000
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
ogs+0.017
dies+0.016
drivers+0.016
 villains+0.016
ug+0.015
Neg logit contributions
 entire-0.008
ifying-0.007
 too-0.007
 front-0.006
flower-0.006
Token.
Token ID13
Feature activation+1.12e-03

Pos logit contributions
 entire+0.023
ifying+0.021
flower+0.018
 exact+0.017
 too+0.016
Neg logit contributions
drivers-0.033
ogs-0.032
dies-0.032
 villains-0.031
ves-0.031
Token Jack
Token ID3619
Feature activation-3.94e-03

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
chie+0.002
 exact+0.002
Neg logit contributions
drivers-0.002
ogs-0.002
dies-0.002
 villains-0.002
 weapons-0.001
Token put
Token ID1234
Feature activation-8.38e-03

Pos logit contributions
drivers+0.007
ogs+0.006
dies+0.006
 villains+0.006
ves+0.006
Neg logit contributions
 entire-0.006
ifying-0.005
chie-0.005
flower-0.005
 exact-0.004
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
drivers+0.014
 villains+0.013
ogs+0.013
dies+0.013
ves+0.012
Neg logit contributions
 entire-0.013
ifying-0.012
 exact-0.010
flower-0.010
chie-0.010
Token ash
Token ID12530
Feature activation-3.75e-02

Pos logit contributions
drivers+0.021
 weapons+0.019
dies+0.019
con+0.018
 teamwork+0.018
Neg logit contributions
ifying-0.025
itates-0.022
 entire-0.022
ksh-0.021
 ".-0.021
Tokentr
Token ID2213
Feature activation-8.30e-02

Pos logit contributions
drivers+0.055
ogs+0.054
dies+0.053
 villains+0.053
ves+0.049
Neg logit contributions
 entire-0.061
ifying-0.058
flower-0.051
chie-0.050
 exact-0.049
Tokenay
Token ID323
Feature activation-2.57e-02

Pos logit contributions
drivers+0.120
 villains+0.119
dies+0.119
ogs+0.117
ves+0.109
Neg logit contributions
 entire-0.134
ifying-0.131
chie-0.114
flower-0.112
 exact-0.108
Token on
Token ID319
Feature activation+1.49e-02

Pos logit contributions
dies+0.050
 villains+0.049
drivers+0.048
ogs+0.048
renches+0.045
Neg logit contributions
 entire-0.031
ifying-0.029
chie-0.024
flower-0.023
 too-0.022
Token the
Token ID262
Feature activation-1.35e-03

Pos logit contributions
 entire+0.022
ifying+0.019
 front+0.018
 too+0.017
 exact+0.017
Neg logit contributions
drivers-0.028
ogs-0.027
 villains-0.026
dies-0.025
unn-0.025
Token kitchen
Token ID9592
Feature activation+5.91e-03

Pos logit contributions
drivers+0.002
ogs+0.002
dies+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.001
chie-0.001
 exact-0.001
Token table
Token ID3084
Feature activation-1.93e-04

Pos logit contributions
 entire+0.008
ifying+0.008
 exact+0.007
flower+0.007
 front+0.007
Neg logit contributions
drivers-0.010
ogs-0.010
 villains-0.010
dies-0.010
ug-0.009
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
drivers+0.005
 villains+0.005
ogs+0.005
dies+0.005
ug+0.005
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.004
 exact-0.003
 too-0.003
Token using
Token ID1262
Feature activation+5.63e-03

Pos logit contributions
drivers+0.026
ogs+0.025
dies+0.025
 villains+0.024
 weapons+0.023
Neg logit contributions
 entire-0.022
ifying-0.021
flower-0.018
chie-0.018
 exact-0.018
Token the
Token ID262
Feature activation-2.11e-02

Pos logit contributions
 entire+0.008
ifying+0.008
 exact+0.007
chie+0.007
flower+0.007
Neg logit contributions
drivers-0.009
ogs-0.009
dies-0.009
 villains-0.009
ug-0.008
Token pretty
Token ID2495
Feature activation-4.87e-03

Pos logit contributions
drivers+0.033
dies+0.030
 weapons+0.029
ogs+0.029
con+0.029
Neg logit contributions
ifying-0.033
 entire-0.032
chie-0.028
itates-0.028
flower-0.027
Token ash
Token ID12530
Feature activation-4.52e-02

Pos logit contributions
drivers+0.007
dies+0.007
ogs+0.007
 weapons+0.007
con+0.007
Neg logit contributions
 entire-0.008
ifying-0.008
itates-0.007
chie-0.007
 exact-0.006
Tokentr
Token ID2213
Feature activation-8.28e-02

Pos logit contributions
drivers+0.063
ogs+0.063
dies+0.062
 villains+0.061
 weapons+0.056
Neg logit contributions
 entire-0.075
ifying-0.071
flower-0.065
chie-0.062
 exact-0.060
Tokenay
Token ID323
Feature activation-3.57e-02

Pos logit contributions
drivers+0.111
ogs+0.105
dies+0.104
 villains+0.102
 weapons+0.096
Neg logit contributions
 entire-0.145
ifying-0.138
flower-0.122
chie-0.122
 exact-0.121
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
 villains+0.052
dies+0.051
ogs+0.049
drivers+0.047
 trek+0.045
Neg logit contributions
 entire-0.057
ifying-0.056
flower-0.052
 too-0.049
chie-0.048
Token Lily
Token ID20037
Feature activation-1.20e-02

Pos logit contributions
drivers+0.027
ogs+0.026
dies+0.025
 villains+0.025
 weapons+0.024
Neg logit contributions
 entire-0.027
ifying-0.026
flower-0.022
chie-0.022
 exact-0.022
Token remembered
Token ID12086
Feature activation+3.81e-03

Pos logit contributions
drivers+0.015
 villains+0.015
ves+0.015
dies+0.014
 trek+0.014
Neg logit contributions
ifying-0.022
 entire-0.021
chie-0.019
 exact-0.019
 highest-0.018
Token what
Token ID644
Feature activation-8.00e-03

Pos logit contributions
 entire+0.004
ifying+0.004
 too+0.004
 my+0.004
 front+0.004
Neg logit contributions
drivers-0.007
dies-0.007
 villains-0.007
ogs-0.007
ests-0.007
Token decided
Token ID3066
Feature activation+2.02e-03

Pos logit contributions
drivers+0.033
dies+0.031
 villains+0.031
othe+0.030
 weapons+0.030
Neg logit contributions
 entire-0.019
 too-0.015
flower-0.015
 still-0.014
 absolutely-0.014
Token to
Token ID284
Feature activation-4.91e-03

Pos logit contributions
 entire+0.002
 too+0.002
 so+0.002
 my+0.002
 front+0.002
Neg logit contributions
ests-0.004
drivers-0.004
dies-0.004
renches-0.004
ogs-0.004
Token put
Token ID1234
Feature activation-7.15e-03

Pos logit contributions
drivers+0.009
dies+0.008
ogs+0.008
con+0.008
 weapons+0.008
Neg logit contributions
 entire-0.007
ifying-0.006
 front-0.005
 exact-0.005
flower-0.005
Token the
Token ID262
Feature activation-8.75e-03

Pos logit contributions
 villains+0.013
drivers+0.012
dies+0.012
ogs+0.012
ves+0.011
Neg logit contributions
 entire-0.010
ifying-0.009
 too-0.008
 front-0.008
 exact-0.008
Token ash
Token ID12530
Feature activation-3.61e-02

Pos logit contributions
drivers+0.013
 weapons+0.011
dies+0.011
con+0.011
 teamwork+0.011
Neg logit contributions
ifying-0.015
 entire-0.013
itates-0.012
 ".-0.012
ksh-0.012
Tokentr
Token ID2213
Feature activation-8.23e-02

Pos logit contributions
ogs+0.055
dies+0.054
drivers+0.054
 villains+0.054
ves+0.050
Neg logit contributions
 entire-0.057
ifying-0.053
flower-0.049
 highest-0.045
chie-0.045
Tokenay
Token ID323
Feature activation-2.52e-02

Pos logit contributions
drivers+0.114
dies+0.112
ogs+0.111
 villains+0.111
 weapons+0.102
Neg logit contributions
 entire-0.137
ifying-0.132
chie-0.116
flower-0.116
 exact-0.112
Token down
Token ID866
Feature activation+3.87e-03

Pos logit contributions
dies+0.046
 villains+0.045
ogs+0.044
drivers+0.043
renches+0.041
Neg logit contributions
 entire-0.032
ifying-0.030
chie-0.026
flower-0.026
 too-0.025
Token and
Token ID290
Feature activation+7.03e-03

Pos logit contributions
 entire+0.005
ifying+0.005
 too+0.004
 front+0.004
 exact+0.004
Neg logit contributions
drivers-0.007
dies-0.007
 villains-0.007
ogs-0.007
ves-0.007
Token go
Token ID467
Feature activation-9.34e-03

Pos logit contributions
 entire+0.009
ifying+0.008
 too+0.007
 nearest+0.007
 my+0.007
Neg logit contributions
drivers-0.013
ogs-0.012
dies-0.012
ests-0.012
ug-0.012
Token outside
Token ID2354
Feature activation-4.23e-03

Pos logit contributions
drivers+0.017
ogs+0.016
dies+0.016
 villains+0.016
 weapons+0.015
Neg logit contributions
 entire-0.012
ifying-0.011
chie-0.009
flower-0.009
 exact-0.009
Token on
Token ID319
Feature activation-5.96e-05

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.011
 villains+0.010
 weapons+0.010
Neg logit contributions
 entire-0.013
ifying-0.012
chie-0.011
flower-0.011
 exact-0.011
Token blowing
Token ID19280
Feature activation-1.18e-02

Pos logit contributions
drivers+0.000
ogs+0.000
dies+0.000
 villains+0.000
 weapons+0.000
Neg logit contributions
 entire-0.000
ifying-0.000
chie-0.000
flower-0.000
 exact-0.000
Token until
Token ID1566
Feature activation-4.80e-03

Pos logit contributions
 villains+0.024
drivers+0.023
dies+0.023
ogs+0.022
uddy+0.022
Neg logit contributions
 entire-0.015
 too-0.013
 so-0.011
ifying-0.011
 front-0.011
Token the
Token ID262
Feature activation-1.35e-02

Pos logit contributions
drivers+0.010
 villains+0.009
dies+0.009
ests+0.009
ogs+0.009
Neg logit contributions
 entire-0.006
ifying-0.006
 front-0.005
 beginning-0.005
 too-0.005
Token ash
Token ID12530
Feature activation-3.19e-02

Pos logit contributions
drivers+0.022
dies+0.021
ogs+0.021
 villains+0.021
 weapons+0.020
Neg logit contributions
 entire-0.019
ifying-0.018
chie-0.016
flower-0.015
 exact-0.015
Tokentr
Token ID2213
Feature activation-8.11e-02

Pos logit contributions
drivers+0.042
ogs+0.040
dies+0.039
 villains+0.038
 weapons+0.036
Neg logit contributions
 entire-0.057
ifying-0.054
flower-0.048
chie-0.048
 exact-0.047
Tokenay
Token ID323
Feature activation-1.69e-02

Pos logit contributions
drivers+0.112
dies+0.108
ogs+0.108
 villains+0.106
 weapons+0.100
Neg logit contributions
 entire-0.138
ifying-0.132
flower-0.116
chie-0.116
 exact-0.113
Token was
Token ID373
Feature activation-1.93e-03

Pos logit contributions
dies+0.030
 villains+0.030
drivers+0.030
ogs+0.029
 weapons+0.028
Neg logit contributions
 entire-0.022
ifying-0.020
flower-0.019
 too-0.017
chie-0.016
Token so
Token ID523
Feature activation+8.35e-03

Pos logit contributions
drivers+0.003
othe+0.003
 weapons+0.003
 Mitt+0.003
 chores+0.003
Neg logit contributions
ifying-0.003
 entire-0.003
chie-0.002
itates-0.002
ksh-0.002
Token dry
Token ID5894
Feature activation+5.44e-04

Pos logit contributions
ifying+0.012
 entire+0.012
flower+0.012
chie+0.011
ksh+0.011
Neg logit contributions
othe-0.012
drivers-0.011
ogs-0.011
ves-0.011
 teamwork-0.011
Token that
Token ID326
Feature activation+2.60e-03

Pos logit contributions
 entire+0.001
ifying+0.001
flower+0.001
 exact+0.001
chie+0.001
Neg logit contributions
drivers-0.001
ogs-0.001
 villains-0.001
dies-0.001
 weapons-0.001
Token can
Token ID460
Feature activation-2.28e-02

Pos logit contributions
drivers+0.008
ogs+0.007
 villains+0.007
dies+0.007
 weapons+0.007
Neg logit contributions
 entire-0.008
ifying-0.007
ksh-0.006
 exact-0.006
flower-0.006
Token use
Token ID779
Feature activation-1.64e-02

Pos logit contributions
drivers+0.030
ogs+0.028
dies+0.028
 villains+0.027
 weapons+0.026
Neg logit contributions
 entire-0.041
ifying-0.039
flower-0.035
chie-0.035
 exact-0.035
Token this
Token ID428
Feature activation-2.91e-02

Pos logit contributions
ogs+0.030
drivers+0.030
dies+0.030
 villains+0.029
ves+0.028
Neg logit contributions
 entire-0.021
ifying-0.019
 front-0.016
chie-0.016
 exact-0.016
Token broken
Token ID5445
Feature activation-1.94e-02

Pos logit contributions
drivers+0.051
ogs+0.049
dies+0.049
 villains+0.048
ves+0.046
Neg logit contributions
 entire-0.040
ifying-0.037
flower-0.031
chie-0.031
 exact-0.031
Token ash
Token ID12530
Feature activation-3.97e-02

Pos logit contributions
drivers+0.031
dies+0.028
ogs+0.028
 weapons+0.028
 villains+0.027
Neg logit contributions
 entire-0.030
ifying-0.029
chie-0.024
itates-0.024
flower-0.024
Tokentr
Token ID2213
Feature activation-8.11e-02

Pos logit contributions
ogs+0.057
drivers+0.057
dies+0.056
 villains+0.056
ves+0.052
Neg logit contributions
 entire-0.064
ifying-0.060
flower-0.054
chie-0.052
 highest-0.051
Tokenay
Token ID323
Feature activation-3.04e-02

Pos logit contributions
drivers+0.110
ogs+0.106
dies+0.105
 villains+0.104
 weapons+0.097
Neg logit contributions
 entire-0.139
ifying-0.133
chie-0.117
flower-0.117
 exact-0.115
Token as
Token ID355
Feature activation+1.59e-03

Pos logit contributions
dies+0.054
drivers+0.053
 villains+0.053
ogs+0.053
 trek+0.050
Neg logit contributions
 entire-0.039
ifying-0.036
flower-0.031
chie-0.030
 too-0.029
Token a
Token ID257
Feature activation-1.89e-02

Pos logit contributions
 entire+0.002
ifying+0.002
 too+0.002
 my+0.002
 mine+0.001
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
ogged-0.003
ests-0.003
Token puzzle
Token ID15027
Feature activation-1.90e-02

Pos logit contributions
drivers+0.032
ogs+0.031
dies+0.030
 villains+0.029
ves+0.029
Neg logit contributions
 entire-0.027
ifying-0.025
chie-0.022
flower-0.022
 exact-0.022
Token!"
Token ID2474
Feature activation-1.88e-02

Pos logit contributions
drivers+0.038
ogs+0.037
dies+0.037
 villains+0.036
ves+0.035
Neg logit contributions
 entire-0.021
ifying-0.019
chie-0.015
flower-0.015
 too-0.015
Token his
Token ID465
Feature activation-8.65e-03

Pos logit contributions
 entire+0.014
ifying+0.014
flower+0.012
 front+0.012
 exact+0.012
Neg logit contributions
drivers-0.009
 villains-0.008
dies-0.008
ogs-0.008
con-0.007
Token mom
Token ID1995
Feature activation+1.47e-03

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.014
 villains+0.013
ug+0.013
Neg logit contributions
 entire-0.014
ifying-0.011
 front-0.011
flower-0.011
 whole-0.010
Token put
Token ID1234
Feature activation-6.58e-03

Pos logit contributions
 entire+0.002
flower+0.001
ifying+0.001
 too+0.001
m+0.001
Neg logit contributions
ogs-0.003
 villains-0.003
dies-0.003
drivers-0.003
 weapons-0.003
Token the
Token ID262
Feature activation-1.46e-02

Pos logit contributions
 villains+0.012
drivers+0.012
ogs+0.012
dies+0.011
ves+0.011
Neg logit contributions
 entire-0.009
ifying-0.008
 too-0.007
 front-0.007
flower-0.007
Token ash
Token ID12530
Feature activation-3.08e-02

Pos logit contributions
drivers+0.024
dies+0.022
ogs+0.021
 weapons+0.021
con+0.021
Neg logit contributions
ifying-0.021
 entire-0.020
chie-0.018
itates-0.017
flower-0.017
Tokentr
Token ID2213
Feature activation-8.04e-02

Pos logit contributions
drivers+0.043
ogs+0.042
dies+0.041
 villains+0.041
ves+0.038
Neg logit contributions
 entire-0.053
ifying-0.050
flower-0.045
chie-0.043
 exact-0.043
Tokenay
Token ID323
Feature activation-2.27e-02

Pos logit contributions
drivers+0.115
dies+0.115
 villains+0.114
ogs+0.113
 weapons+0.105
Neg logit contributions
 entire-0.131
ifying-0.127
chie-0.110
flower-0.109
 exact-0.105
Token away
Token ID1497
Feature activation+5.33e-03

Pos logit contributions
dies+0.043
 villains+0.042
ogs+0.041
drivers+0.041
renches+0.038
Neg logit contributions
 entire-0.028
ifying-0.027
chie-0.022
flower-0.022
 too-0.021
Token,
Token ID11
Feature activation-5.55e-04

Pos logit contributions
 entire+0.007
ifying+0.007
flower+0.006
 exact+0.006
 too+0.006
Neg logit contributions
drivers-0.010
 villains-0.009
ogs-0.009
dies-0.009
con-0.009
Token but
Token ID475
Feature activation-1.76e-03

Pos logit contributions
drivers+0.001
ogs+0.001
dies+0.001
 villains+0.001
ves+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
flower-0.001
 exact-0.001
chie-0.001
Token Little
Token ID7703
Feature activation-7.38e-03

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
ests+0.003
Neg logit contributions
 entire-0.003
ifying-0.003
flower-0.002
 too-0.002
 exact-0.002
Token carefully
Token ID7773
Feature activation+8.50e-04

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.014
 villains+0.013
othe+0.013
Neg logit contributions
 entire-0.009
ifying-0.007
flower-0.007
 too-0.007
 exact-0.006
Token picked
Token ID6497
Feature activation+7.83e-03

Pos logit contributions
 entire+0.001
ifying+0.001
 front+0.001
 exact+0.001
 nearest+0.001
Neg logit contributions
 villains-0.002
dies-0.002
ests-0.002
ogs-0.002
drivers-0.002
Token up
Token ID510
Feature activation-3.59e-03

Pos logit contributions
 entire+0.008
ifying+0.006
 exact+0.006
 too+0.006
 front+0.006
Neg logit contributions
 villains-0.017
drivers-0.016
ogs-0.016
dies-0.015
uddy-0.015
Token the
Token ID262
Feature activation-2.09e-03

Pos logit contributions
drivers+0.008
 villains+0.007
ogs+0.007
dies+0.007
ug+0.007
Neg logit contributions
 entire-0.004
ifying-0.003
 too-0.003
 front-0.003
 exact-0.003
Token ash
Token ID12530
Feature activation-3.09e-02

Pos logit contributions
drivers+0.004
ogs+0.003
dies+0.003
 villains+0.003
ug+0.003
Neg logit contributions
 entire-0.003
ifying-0.003
 exact-0.003
flower-0.002
chie-0.002
Tokentr
Token ID2213
Feature activation-8.00e-02

Pos logit contributions
ogs+0.050
drivers+0.049
 villains+0.049
dies+0.048
ves+0.045
Neg logit contributions
 entire-0.046
ifying-0.043
flower-0.040
 exact-0.036
 highest-0.036
Tokenay
Token ID323
Feature activation-3.64e-02

Pos logit contributions
drivers+0.111
dies+0.107
ogs+0.107
 villains+0.106
 weapons+0.098
Neg logit contributions
 entire-0.135
ifying-0.130
flower-0.114
chie-0.114
 exact-0.111
Token and
Token ID290
Feature activation-1.69e-03

Pos logit contributions
 villains+0.070
dies+0.066
ogs+0.066
 recess+0.061
ogged+0.060
Neg logit contributions
 entire-0.044
ifying-0.042
 too-0.039
flower-0.039
m-0.035
Token put
Token ID1234
Feature activation-1.41e-02

Pos logit contributions
ogs+0.004
 villains+0.003
drivers+0.003
dies+0.003
ests+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.001
 nearest-0.001
 too-0.001
Token it
Token ID340
Feature activation-3.21e-03

Pos logit contributions
ogs+0.026
 villains+0.026
drivers+0.026
dies+0.025
ug+0.023
Neg logit contributions
 entire-0.018
ifying-0.017
 exact-0.014
 front-0.014
 too-0.014
Token in
Token ID287
Feature activation+9.14e-03

Pos logit contributions
drivers+0.006
dies+0.006
ogs+0.006
 villains+0.006
ves+0.006
Neg logit contributions
 entire-0.004
ifying-0.003
flower-0.003
 exact-0.003
chie-0.003
Token the
Token ID262
Feature activation-1.58e-02

Pos logit contributions
 villains+0.024
ogs+0.023
dies+0.023
drivers+0.022
old+0.021
Neg logit contributions
 entire-0.016
ifying-0.015
 too-0.013
 exact-0.013
 front-0.012
Token cloth
Token ID16270
Feature activation-5.32e-03

Pos logit contributions
drivers+0.023
dies+0.021
 weapons+0.020
con+0.020
ogs+0.019
Neg logit contributions
ifying-0.026
 entire-0.024
itates-0.022
 ".-0.022
chie-0.022
Token onto
Token ID4291
Feature activation+6.02e-03

Pos logit contributions
drivers+0.009
 villains+0.009
ogs+0.009
dies+0.009
 weapons+0.009
Neg logit contributions
 entire-0.007
ifying-0.006
flower-0.006
 exact-0.005
chie-0.005
Token the
Token ID262
Feature activation-1.20e-02

Pos logit contributions
 entire+0.006
 front+0.005
ifying+0.005
 too+0.005
 so+0.005
Neg logit contributions
ogs-0.014
 villains-0.013
unn-0.013
ogged-0.013
drivers-0.013
Token ash
Token ID12530
Feature activation-4.60e-02

Pos logit contributions
drivers+0.018
dies+0.017
con+0.016
 weapons+0.016
ogs+0.016
Neg logit contributions
ifying-0.018
 entire-0.017
chie-0.016
itates-0.016
 ".-0.015
Tokentr
Token ID2213
Feature activation-7.99e-02

Pos logit contributions
ogs+0.066
drivers+0.065
 villains+0.065
dies+0.064
 weapons+0.060
Neg logit contributions
 entire-0.075
ifying-0.071
flower-0.064
 exact-0.061
chie-0.061
Tokenay
Token ID323
Feature activation-2.65e-02

Pos logit contributions
drivers+0.109
ogs+0.105
dies+0.105
 villains+0.103
 weapons+0.097
Neg logit contributions
 entire-0.136
ifying-0.131
flower-0.115
chie-0.114
 exact-0.113
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
 villains+0.047
dies+0.046
ogs+0.045
 weapons+0.043
 err+0.043
Neg logit contributions
 entire-0.034
ifying-0.032
flower-0.029
 too-0.029
 absolutely-0.027
Token He
Token ID679
Feature activation-2.05e-02

Pos logit contributions
 entire+0.004
ifying+0.004
 exact+0.004
 front+0.003
 highest+0.003
Neg logit contributions
ogs-0.007
 villains-0.007
drivers-0.007
dies-0.007
 weapons-0.007
Token worked
Token ID3111
Feature activation-7.72e-03

Pos logit contributions
ogs+0.040
drivers+0.039
dies+0.038
 villains+0.037
 weapons+0.035
Neg logit contributions
 entire-0.025
ifying-0.021
flower-0.019
 exact-0.019
 nearest-0.018
Token hard
Token ID1327
Feature activation-1.96e-02

Pos logit contributions
 villains+0.013
ogs+0.012
dies+0.012
drivers+0.012
con+0.011
Neg logit contributions
 entire-0.012
ifying-0.010
 too-0.010
 exact-0.010
chie-0.009
Token smoking
Token ID9216
Feature activation-1.74e-03

Pos logit contributions
drivers+0.014
dies+0.014
ogs+0.013
 villains+0.013
ests+0.013
Neg logit contributions
 entire-0.011
ifying-0.010
flower-0.009
 too-0.009
 front-0.009
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
drivers+0.003
 villains+0.003
dies+0.003
ogs+0.003
con+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
 too-0.002
 exact-0.002
Token change
Token ID1487
Feature activation-1.19e-03

Pos logit contributions
drivers+0.023
ogs+0.022
dies+0.022
 villains+0.021
ves+0.021
Neg logit contributions
 entire-0.018
ifying-0.017
flower-0.015
 exact-0.015
chie-0.014
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
drivers+0.002
dies+0.002
ogs+0.002
 villains+0.002
con+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
chie-0.002
 exact-0.002
Token ash
Token ID12530
Feature activation-3.46e-02

Pos logit contributions
drivers+0.027
dies+0.025
 weapons+0.025
ogs+0.024
con+0.024
Neg logit contributions
ifying-0.026
 entire-0.025
chie-0.022
itates-0.021
flower-0.021
Tokentr
Token ID2213
Feature activation-7.97e-02

Pos logit contributions
drivers+0.047
ogs+0.046
dies+0.045
 villains+0.045
 weapons+0.042
Neg logit contributions
 entire-0.060
ifying-0.056
flower-0.051
chie-0.049
 exact-0.049
Tokenay
Token ID323
Feature activation-2.87e-02

Pos logit contributions
drivers+0.108
ogs+0.104
dies+0.103
 villains+0.102
 weapons+0.095
Neg logit contributions
 entire-0.137
ifying-0.131
chie-0.116
flower-0.116
 exact-0.114
Token for
Token ID329
Feature activation-1.23e-02

Pos logit contributions
dies+0.053
 villains+0.052
ogs+0.051
drivers+0.050
 trek+0.049
Neg logit contributions
 entire-0.036
ifying-0.033
flower-0.029
 too-0.029
chie-0.027
Token a
Token ID257
Feature activation-7.02e-03

Pos logit contributions
drivers+0.022
dies+0.022
ogs+0.022
ests+0.021
 villains+0.021
Neg logit contributions
 entire-0.017
ifying-0.014
 too-0.014
 front-0.014
 my-0.013
Token toy
Token ID13373
Feature activation-2.56e-02

Pos logit contributions
drivers+0.012
ogs+0.011
dies+0.011
 villains+0.011
ug+0.010
Neg logit contributions
 entire-0.010
ifying-0.010
chie-0.008
flower-0.008
 exact-0.008
Token instead
Token ID2427
Feature activation+1.73e-02

Pos logit contributions
ogs+0.049
drivers+0.048
dies+0.047
 villains+0.046
 trek+0.045
Neg logit contributions
 entire-0.029
ifying-0.027
flower-0.024
 too-0.024
 front-0.022
Token happy
Token ID3772
Feature activation+1.57e-02

Pos logit contributions
drivers+0.010
dies+0.009
ogs+0.009
 villains+0.009
 weapons+0.009
Neg logit contributions
 entire-0.007
ifying-0.006
flower-0.005
 exact-0.005
chie-0.005
Token with
Token ID351
Feature activation-1.02e-02

Pos logit contributions
 entire+0.022
ifying+0.020
flower+0.017
 exact+0.017
chie+0.017
Neg logit contributions
drivers-0.027
ogs-0.026
dies-0.026
 villains-0.025
ves-0.024
Token her
Token ID607
Feature activation-2.15e-03

Pos logit contributions
drivers+0.018
dies+0.018
ogs+0.017
 villains+0.017
con+0.016
Neg logit contributions
 entire-0.014
ifying-0.013
 too-0.012
 front-0.012
 exact-0.012
Token warm
Token ID5814
Feature activation+2.76e-03

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
 weapons+0.003
Neg logit contributions
 entire-0.003
ifying-0.003
chie-0.003
flower-0.003
 exact-0.003
Token ash
Token ID12530
Feature activation-3.39e-02

Pos logit contributions
 entire+0.004
ifying+0.004
chie+0.003
flower+0.003
 exact+0.003
Neg logit contributions
drivers-0.005
ogs-0.004
dies-0.004
 villains-0.004
 weapons-0.004
Tokentr
Token ID2213
Feature activation-7.97e-02

Pos logit contributions
drivers+0.044
ogs+0.041
dies+0.041
 villains+0.040
 weapons+0.038
Neg logit contributions
 entire-0.061
ifying-0.058
chie-0.052
flower-0.052
 exact-0.051
Tokenay
Token ID323
Feature activation-2.12e-02

Pos logit contributions
drivers+0.107
ogs+0.102
dies+0.102
 villains+0.099
 weapons+0.094
Neg logit contributions
 entire-0.139
ifying-0.132
flower-0.117
chie-0.117
 exact-0.115
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
drivers+0.039
dies+0.039
ogs+0.038
 villains+0.038
 weapons+0.036
Neg logit contributions
 entire-0.026
ifying-0.024
flower-0.021
chie-0.019
 exact-0.019
Token<|endoftext|>
Token ID50256
Feature activation+1.32e-02

Pos logit contributions
drivers+0.010
ogs+0.010
dies+0.010
 villains+0.010
 weapons+0.009
Neg logit contributions
 entire-0.009
ifying-0.009
flower-0.008
chie-0.008
 exact-0.008
TokenOnce
Token ID7454
Feature activation+5.89e-03

Pos logit contributions
 entire+0.018
ifying+0.017
flower+0.015
chie+0.015
 exact+0.014
Neg logit contributions
drivers-0.022
ogs-0.021
dies-0.021
 villains-0.021
 weapons-0.020
Token upon
Token ID2402
Feature activation+6.21e-03

Pos logit contributions
 entire+0.006
ifying+0.005
flower+0.004
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.012
dies-0.012
ogs-0.012
 villains-0.012
con-0.011
Token tried
Token ID3088
Feature activation-1.31e-02

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
 exact+0.002
chie+0.002
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
 villains-0.003
ests-0.003
Token to
Token ID284
Feature activation-2.38e-02

Pos logit contributions
dies+0.026
ug+0.026
ogs+0.026
 villains+0.025
drivers+0.025
Neg logit contributions
 entire-0.013
 too-0.013
 front-0.011
ifying-0.011
 beginning-0.011
Token fix
Token ID4259
Feature activation-1.46e-02

Pos logit contributions
drivers+0.040
ogs+0.039
dies+0.038
 villains+0.037
ves+0.036
Neg logit contributions
 entire-0.036
ifying-0.031
 exact-0.028
chie-0.028
flower-0.028
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
drivers+0.026
ogs+0.026
 villains+0.026
dies+0.026
ogged+0.024
Neg logit contributions
 entire-0.018
ifying-0.016
 exact-0.015
 front-0.014
 too-0.014
Token ash
Token ID12530
Feature activation-4.19e-02

Pos logit contributions
drivers+0.029
dies+0.027
 villains+0.026
 weapons+0.026
ogs+0.026
Neg logit contributions
ifying-0.026
 entire-0.025
chie-0.022
flower-0.022
itates-0.021
Tokentr
Token ID2213
Feature activation-7.94e-02

Pos logit contributions
drivers+0.058
ogs+0.057
dies+0.056
 villains+0.056
ves+0.052
Neg logit contributions
 entire-0.071
ifying-0.066
flower-0.060
chie-0.059
 exact-0.058
Tokenay
Token ID323
Feature activation-2.34e-02

Pos logit contributions
drivers+0.105
ogs+0.098
dies+0.097
 villains+0.094
 weapons+0.090
Neg logit contributions
 entire-0.141
ifying-0.134
flower-0.119
chie-0.119
 exact-0.118
Token,
Token ID11
Feature activation-4.38e-03

Pos logit contributions
 villains+0.042
dies+0.041
ogs+0.041
drivers+0.038
 weapons+0.038
Neg logit contributions
 entire-0.030
ifying-0.028
flower-0.026
 too-0.025
chie-0.024
Token but
Token ID475
Feature activation-5.65e-03

Pos logit contributions
drivers+0.008
ogs+0.008
dies+0.007
 villains+0.007
ves+0.007
Neg logit contributions
 entire-0.006
ifying-0.005
 exact-0.005
flower-0.005
chie-0.005
Token it
Token ID340
Feature activation-8.06e-03

Pos logit contributions
drivers+0.010
ogs+0.010
dies+0.010
 villains+0.009
con+0.009
Neg logit contributions
 entire-0.008
ifying-0.007
 too-0.006
 beginning-0.006
 exact-0.006
Token was
Token ID373
Feature activation-6.17e-03

Pos logit contributions
ogs+0.015
drivers+0.015
dies+0.014
 villains+0.014
con+0.013
Neg logit contributions
 entire-0.010
ifying-0.009
flower-0.008
chie-0.008
 exact-0.008
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 entire+0.004
ifying+0.003
flower+0.003
 exact+0.003
 too+0.003
Neg logit contributions
drivers-0.006
 villains-0.005
dies-0.005
ogs-0.005
ves-0.005
Token She
Token ID1375
Feature activation-6.05e-03

Pos logit contributions
drivers+0.019
dies+0.019
 villains+0.019
ogs+0.018
ves+0.018
Neg logit contributions
 entire-0.014
ifying-0.013
 exact-0.011
flower-0.011
chie-0.011
Token put
Token ID1234
Feature activation-1.52e-02

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.011
 villains+0.011
con+0.010
Neg logit contributions
 entire-0.008
ifying-0.007
flower-0.006
chie-0.006
 exact-0.006
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
drivers+0.025
 villains+0.025
ogs+0.025
dies+0.024
ves+0.022
Neg logit contributions
 entire-0.022
ifying-0.021
flower-0.018
 exact-0.018
chie-0.018
Token ash
Token ID12530
Feature activation-3.24e-02

Pos logit contributions
drivers+0.017
dies+0.016
 weapons+0.016
con+0.015
ogs+0.015
Neg logit contributions
ifying-0.020
 entire-0.018
itates-0.017
ksh-0.017
 ".-0.017
Tokentr
Token ID2213
Feature activation-7.92e-02

Pos logit contributions
drivers+0.043
ogs+0.040
dies+0.040
 villains+0.039
 weapons+0.037
Neg logit contributions
 entire-0.057
ifying-0.055
chie-0.049
flower-0.048
 exact-0.048
Tokenay
Token ID323
Feature activation-2.32e-02

Pos logit contributions
drivers+0.110
dies+0.107
ogs+0.107
 villains+0.106
 weapons+0.098
Neg logit contributions
 entire-0.133
ifying-0.128
chie-0.112
flower-0.112
 exact-0.109
Token up
Token ID510
Feature activation+5.13e-03

Pos logit contributions
dies+0.041
 villains+0.041
ogs+0.039
drivers+0.038
 lear+0.036
Neg logit contributions
 entire-0.031
ifying-0.029
chie-0.026
flower-0.025
 too-0.025
Token close
Token ID1969
Feature activation+1.47e-02

Pos logit contributions
 entire+0.007
ifying+0.006
flower+0.005
 exact+0.005
chie+0.005
Neg logit contributions
drivers-0.009
dies-0.009
 villains-0.009
ogs-0.009
ves-0.008
Token to
Token ID284
Feature activation+7.86e-03

Pos logit contributions
 entire+0.021
ifying+0.020
flower+0.018
 exact+0.017
chie+0.017
Neg logit contributions
drivers-0.025
 villains-0.024
dies-0.024
ogs-0.023
ves-0.022
Token her
Token ID607
Feature activation+1.67e-03

Pos logit contributions
 entire+0.011
ifying+0.011
 front+0.009
 exact+0.009
flower+0.009
Neg logit contributions
drivers-0.014
 villains-0.013
ogs-0.013
dies-0.013
ests-0.012
Token listen
Token ID6004
Feature activation+6.33e-03

Pos logit contributions
drivers+0.001
ogs+0.001
dies+0.001
 villains+0.001
 weapons+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
flower-0.001
chie-0.001
 exact-0.001
Token and
Token ID290
Feature activation-4.69e-03

Pos logit contributions
 entire+0.009
ifying+0.008
 exact+0.007
flower+0.007
chie+0.007
Neg logit contributions
drivers-0.011
dies-0.010
ogs-0.010
 villains-0.010
ug-0.010
Token took
Token ID1718
Feature activation-5.78e-03

Pos logit contributions
drivers+0.010
ests+0.010
ogs+0.009
dies+0.009
ug+0.009
Neg logit contributions
 too-0.005
 still-0.004
 entire-0.004
 starting-0.004
 so-0.004
Token the
Token ID262
Feature activation-6.23e-03

Pos logit contributions
 villains+0.013
drivers+0.013
uddy+0.012
dies+0.012
iked+0.012
Neg logit contributions
 too-0.005
 entire-0.005
 away-0.005
 my-0.005
 front-0.004
Token ash
Token ID12530
Feature activation-3.10e-02

Pos logit contributions
ogs+0.012
drivers+0.012
dies+0.011
 villains+0.011
ves+0.011
Neg logit contributions
 entire-0.008
ifying-0.007
 exact-0.006
flower-0.006
chie-0.006
Tokentr
Token ID2213
Feature activation-7.81e-02

Pos logit contributions
drivers+0.044
ogs+0.044
dies+0.043
 villains+0.042
ves+0.040
Neg logit contributions
 entire-0.051
ifying-0.048
flower-0.044
chie-0.042
 exact-0.041
Tokenay
Token ID323
Feature activation-2.73e-02

Pos logit contributions
drivers+0.111
dies+0.111
 villains+0.111
ogs+0.109
 weapons+0.101
Neg logit contributions
 entire-0.126
ifying-0.122
chie-0.107
flower-0.106
 exact-0.102
Token anyway
Token ID6949
Feature activation+8.27e-04

Pos logit contributions
 villains+0.056
dies+0.055
drivers+0.051
renches+0.051
 trek+0.051
Neg logit contributions
 entire-0.027
ifying-0.024
 too-0.024
".-0.021
flower-0.020
Token.
Token ID13
Feature activation-4.01e-03

Pos logit contributions
 entire+0.001
 too+0.001
ifying+0.001
".+0.000
 front+0.000
Neg logit contributions
dies-0.002
 villains-0.002
drivers-0.002
ests-0.002
 weapons-0.002
Token
Token ID198
Feature activation-3.39e-03

Pos logit contributions
drivers+0.007
ogs+0.006
dies+0.006
 villains+0.006
ves+0.006
Neg logit contributions
 entire-0.006
ifying-0.005
flower-0.005
chie-0.005
 exact-0.005
Token
Token ID198
Feature activation+7.67e-03

Pos logit contributions
dies+0.007
drivers+0.007
 villains+0.007
 weapons+0.007
ogs+0.007
Neg logit contributions
 entire-0.003
ifying-0.003
itates-0.002
 front-0.002
 ".-0.002
Token throw
Token ID3714
Feature activation-1.10e-02

Pos logit contributions
drivers+0.024
con+0.023
dies+0.023
ogs+0.023
ests+0.022
Neg logit contributions
 entire-0.017
 exact-0.014
ifying-0.014
 front-0.013
chie-0.013
Token away
Token ID1497
Feature activation+4.14e-03

Pos logit contributions
drivers+0.024
 villains+0.024
ogs+0.023
dies+0.023
ves+0.023
Neg logit contributions
 entire-0.011
ifying-0.009
 too-0.009
 away-0.008
 exact-0.008
Token the
Token ID262
Feature activation-2.29e-02

Pos logit contributions
 entire+0.005
ifying+0.005
 too+0.005
 front+0.004
 so+0.004
Neg logit contributions
drivers-0.008
dies-0.007
 villains-0.007
ogs-0.007
con-0.007
Token broken
Token ID5445
Feature activation-1.68e-02

Pos logit contributions
drivers+0.038
dies+0.035
ogs+0.035
 weapons+0.034
 villains+0.034
Neg logit contributions
ifying-0.032
 entire-0.032
chie-0.027
flower-0.026
itates-0.026
Token ash
Token ID12530
Feature activation-3.91e-02

Pos logit contributions
drivers+0.027
dies+0.025
ogs+0.025
 weapons+0.024
 villains+0.024
Neg logit contributions
 entire-0.025
ifying-0.025
chie-0.021
itates-0.021
 exact-0.020
Tokentr
Token ID2213
Feature activation-7.79e-02

Pos logit contributions
ogs+0.059
 villains+0.059
dies+0.058
drivers+0.058
ves+0.054
Neg logit contributions
 entire-0.061
ifying-0.057
flower-0.053
chie-0.049
 highest-0.049
Tokenay
Token ID323
Feature activation-2.25e-02

Pos logit contributions
drivers+0.105
ogs+0.100
dies+0.100
 villains+0.097
 weapons+0.092
Neg logit contributions
 entire-0.135
ifying-0.129
flower-0.114
chie-0.114
 exact-0.112
Token and
Token ID290
Feature activation+4.06e-03

Pos logit contributions
 villains+0.041
dies+0.041
ogs+0.039
drivers+0.038
renches+0.038
Neg logit contributions
 entire-0.028
ifying-0.026
 too-0.024
flower-0.023
chie-0.021
Token soon
Token ID2582
Feature activation-8.32e-03

Pos logit contributions
 entire+0.005
ifying+0.004
 too+0.004
 still+0.004
 so+0.004
Neg logit contributions
drivers-0.008
ogs-0.008
ests-0.007
dies-0.007
renches-0.007
Token forgotten
Token ID11564
Feature activation+1.53e-02

Pos logit contributions
drivers+0.014
ogs+0.013
dies+0.013
 villains+0.013
ests+0.013
Neg logit contributions
 entire-0.012
ifying-0.011
 too-0.009
chie-0.009
 exact-0.009
Token about
Token ID546
Feature activation-8.00e-03

Pos logit contributions
 entire+0.020
ifying+0.020
 too+0.016
chie+0.016
 exact+0.016
Neg logit contributions
drivers-0.027
dies-0.026
ogs-0.026
 villains-0.025
con-0.024
Token He
Token ID679
Feature activation-1.09e-02

Pos logit contributions
drivers+0.012
ogs+0.011
dies+0.011
 villains+0.011
ves+0.011
Neg logit contributions
 entire-0.011
ifying-0.010
 exact-0.009
flower-0.009
chie-0.009
Token quickly
Token ID2952
Feature activation+2.91e-03

Pos logit contributions
drivers+0.019
ogs+0.018
dies+0.018
 villains+0.018
 weapons+0.017
Neg logit contributions
 entire-0.015
ifying-0.014
flower-0.012
chie-0.012
 exact-0.012
Token grabbed
Token ID13176
Feature activation-1.06e-02

Pos logit contributions
 entire+0.004
ifying+0.003
chie+0.003
 exact+0.003
flower+0.003
Neg logit contributions
dies-0.005
ogs-0.005
drivers-0.005
 villains-0.005
ests-0.005
Token his
Token ID465
Feature activation-3.34e-02

Pos logit contributions
ogs+0.020
 villains+0.019
drivers+0.019
ves+0.018
dies+0.018
Neg logit contributions
 entire-0.014
ifying-0.013
 front-0.012
 my-0.012
 too-0.012
Token little
Token ID1310
Feature activation-3.16e-02

Pos logit contributions
drivers+0.038
ogs+0.036
dies+0.036
 villains+0.035
 weapons+0.033
Neg logit contributions
 entire-0.065
ifying-0.062
flower-0.056
chie-0.056
 exact-0.055
Token er
Token ID1931
Feature activation-7.77e-02

Pos logit contributions
drivers+0.047
ogs+0.047
dies+0.045
ves+0.045
 villains+0.045
Neg logit contributions
 entire-0.052
ifying-0.047
chie-0.044
flower-0.043
 exact-0.043
Tokenaser
Token ID6005
Feature activation-3.20e-02

Pos logit contributions
 villains+0.131
dies+0.131
ogs+0.126
drivers+0.126
 trek+0.124
Neg logit contributions
 entire-0.105
ifying-0.098
chie-0.095
flower-0.084
 highest-0.082
Token and
Token ID290
Feature activation-6.38e-03

Pos logit contributions
dies+0.065
drivers+0.065
 villains+0.065
ogs+0.064
ves+0.063
Neg logit contributions
 entire-0.035
 too-0.030
ifying-0.029
 front-0.028
 so-0.026
Token followed
Token ID3940
Feature activation-4.21e-03

Pos logit contributions
ogs+0.014
dies+0.013
 villains+0.013
ests+0.013
renches+0.013
Neg logit contributions
 entire-0.005
 too-0.005
 so-0.005
 nearest-0.005
 my-0.005
Token Mrs
Token ID9074
Feature activation+1.68e-03

Pos logit contributions
drivers+0.007
ogs+0.007
ests+0.007
 villains+0.007
ug+0.006
Neg logit contributions
 entire-0.006
ifying-0.006
 front-0.006
 my-0.005
 too-0.005
Token Green
Token ID3469
Feature activation-5.47e-03

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
chie+0.002
 exact+0.002
Neg logit contributions
drivers-0.003
ogs-0.003
 villains-0.003
dies-0.003
ves-0.003
Token I
Token ID314
Feature activation-9.48e-04

Pos logit contributions
drivers+0.034
dies+0.032
ogs+0.032
 villains+0.032
 weapons+0.031
Neg logit contributions
 entire-0.016
ifying-0.015
chie-0.012
 exact-0.012
flower-0.012
Token keep
Token ID1394
Feature activation-4.75e-04

Pos logit contributions
drivers+0.001
ogs+0.001
 villains+0.001
dies+0.001
ves+0.001
Neg logit contributions
 entire-0.002
ifying-0.002
ksh-0.001
 exact-0.001
itates-0.001
Token the
Token ID262
Feature activation-1.87e-02

Pos logit contributions
drivers+0.001
dies+0.001
ogs+0.001
 villains+0.001
ves+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
 exact-0.001
flower-0.001
chie-0.000
Token talking
Token ID3375
Feature activation+1.68e-02

Pos logit contributions
drivers+0.022
dies+0.020
 weapons+0.020
con+0.020
 teamwork+0.019
Neg logit contributions
ifying-0.036
ksh-0.030
 entire-0.029
chie-0.029
itates-0.029
Token ash
Token ID12530
Feature activation-1.24e-02

Pos logit contributions
 entire+0.025
ifying+0.025
chie+0.021
 exact+0.021
itates+0.021
Neg logit contributions
drivers-0.025
ogs-0.024
dies-0.024
 villains-0.023
ves-0.023
Tokentr
Token ID2213
Feature activation-7.75e-02

Pos logit contributions
drivers+0.016
ogs+0.015
dies+0.015
 villains+0.014
con+0.014
Neg logit contributions
 entire-0.022
ifying-0.021
chie-0.019
 exact-0.019
flower-0.018
Tokenay
Token ID323
Feature activation-1.06e-02

Pos logit contributions
drivers+0.099
ogs+0.093
dies+0.090
 villains+0.087
ves+0.084
Neg logit contributions
 entire-0.141
ifying-0.133
 exact-0.119
chie-0.119
flower-0.119
Token?
Token ID30
Feature activation-1.26e-02

Pos logit contributions
drivers+0.021
ogs+0.020
dies+0.020
 villains+0.020
 weapons+0.019
Neg logit contributions
 entire-0.012
ifying-0.011
chie-0.009
flower-0.009
 exact-0.009
Tokenâ
Token ID22940
Feature activation+1.82e-02

Pos logit contributions
drivers+0.022
ogs+0.021
dies+0.021
 villains+0.020
 weapons+0.019
Neg logit contributions
 entire-0.017
ifying-0.016
flower-0.014
chie-0.014
 exact-0.013
Token€
Token ID26391
Feature activation+1.18e-02

Pos logit contributions
ifying+0.027
 entire+0.024
flower+0.022
chie+0.021
 nearest+0.021
Neg logit contributions
ogs-0.029
drivers-0.028
dies-0.027
con-0.027
ogged-0.027
Token
Token ID198
Feature activation+6.19e-03

Pos logit contributions
 entire+0.017
ifying+0.016
chie+0.014
flower+0.013
 exact+0.013
Neg logit contributions
drivers-0.020
ogs-0.020
dies-0.019
 villains-0.019
con-0.018
Token little
Token ID1310
Feature activation-1.66e-03

Pos logit contributions
 entire+0.006
ifying+0.006
chie+0.005
flower+0.005
 exact+0.005
Neg logit contributions
drivers-0.008
ogs-0.007
dies-0.007
 villains-0.007
 weapons-0.007
Token boy
Token ID2933
Feature activation-2.39e-03

Pos logit contributions
drivers+0.002
dies+0.002
 weapons+0.002
ogs+0.002
 villains+0.002
Neg logit contributions
ifying-0.003
 entire-0.003
chie-0.003
 ".-0.002
itates-0.002
Token squeezed
Token ID29650
Feature activation-5.25e-03

Pos logit contributions
drivers+0.004
ogs+0.004
dies+0.004
 villains+0.004
 weapons+0.004
Neg logit contributions
 entire-0.003
ifying-0.003
flower-0.003
 exact-0.002
chie-0.002
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
drivers+0.012
 villains+0.012
ogs+0.012
ves+0.012
dies+0.011
Neg logit contributions
 entire-0.005
 too-0.004
ifying-0.004
 away-0.004
 front-0.004
Token ash
Token ID12530
Feature activation-3.98e-02

Pos logit contributions
drivers+0.024
dies+0.021
 weapons+0.021
con+0.021
ves+0.020
Neg logit contributions
ifying-0.028
 entire-0.025
 ".-0.024
ksh-0.023
chie-0.023
Tokentr
Token ID2213
Feature activation-7.72e-02

Pos logit contributions
ogs+0.062
 villains+0.061
drivers+0.061
dies+0.060
old+0.056
Neg logit contributions
 entire-0.062
ifying-0.058
flower-0.053
 highest-0.049
 front-0.049
Tokenay
Token ID323
Feature activation-1.73e-02

Pos logit contributions
drivers+0.109
dies+0.106
ogs+0.106
 villains+0.105
 weapons+0.098
Neg logit contributions
 entire-0.128
ifying-0.124
chie-0.108
flower-0.108
 exact-0.105
Token close
Token ID1969
Feature activation+9.73e-03

Pos logit contributions
 villains+0.031
dies+0.031
drivers+0.030
ogs+0.030
renches+0.029
Neg logit contributions
 entire-0.022
ifying-0.020
 too-0.019
flower-0.018
 so-0.017
Token and
Token ID290
Feature activation-7.06e-03

Pos logit contributions
 entire+0.015
ifying+0.014
flower+0.012
 exact+0.012
chie+0.012
Neg logit contributions
drivers-0.016
ogs-0.015
dies-0.015
 villains-0.015
ves-0.014
Token gave
Token ID2921
Feature activation+1.05e-02

Pos logit contributions
drivers+0.015
ogs+0.015
ests+0.014
 villains+0.014
dies+0.013
Neg logit contributions
 entire-0.008
ifying-0.007
 too-0.006
 still-0.006
 prett-0.006
Token it
Token ID340
Feature activation-5.39e-03

Pos logit contributions
 entire+0.013
ifying+0.011
 too+0.010
 front+0.010
 exact+0.009
Neg logit contributions
drivers-0.020
ogs-0.020
 villains-0.020
dies-0.019
con-0.018
Token her
Token ID607
Feature activation-4.63e-03

Pos logit contributions
 entire+0.001
ifying+0.001
 front+0.001
 too+0.001
 so+0.001
Neg logit contributions
drivers-0.002
dies-0.002
ogs-0.002
uddy-0.002
driver-0.002
Token new
Token ID649
Feature activation-1.88e-02

Pos logit contributions
drivers+0.007
ogs+0.007
dies+0.007
 villains+0.007
ves+0.006
Neg logit contributions
 entire-0.007
ifying-0.007
flower-0.006
 exact-0.006
chie-0.006
Token fl
Token ID781
Feature activation-1.48e-02

Pos logit contributions
drivers+0.028
ogs+0.027
dies+0.027
 villains+0.026
 weapons+0.025
Neg logit contributions
 entire-0.030
ifying-0.029
chie-0.025
flower-0.025
 exact-0.025
Tokenute
Token ID1133
Feature activation-1.19e-02

Pos logit contributions
drivers+0.018
ogs+0.017
dies+0.016
 villains+0.016
ves+0.015
Neg logit contributions
 entire-0.028
ifying-0.027
flower-0.024
chie-0.024
 exact-0.024
Token.
Token ID13
Feature activation+8.54e-04

Pos logit contributions
drivers+0.022
dies+0.021
ogs+0.021
 villains+0.020
 weapons+0.019
Neg logit contributions
 entire-0.015
ifying-0.014
flower-0.012
chie-0.012
 exact-0.012
Token That
Token ID1320
Feature activation+4.35e-02

Pos logit contributions
 entire+0.001
ifying+0.001
 ".+0.001
 exact+0.001
 front+0.001
Neg logit contributions
drivers-0.002
 weapons-0.002
dies-0.002
 villains-0.002
ug-0.002
Token day
Token ID1110
Feature activation+2.00e-02

Pos logit contributions
 entire+0.056
ifying+0.050
flower+0.043
 exact+0.043
chie+0.040
Neg logit contributions
drivers-0.083
ogs-0.077
dies-0.076
 villains-0.076
ug-0.073
Token was
Token ID373
Feature activation+1.60e-02

Pos logit contributions
 entire+0.029
ifying+0.028
 too+0.025
 already+0.023
 so+0.023
Neg logit contributions
drivers-0.035
dies-0.032
 villains-0.032
 weapons-0.031
renches-0.031
Token as
Token ID355
Feature activation+8.57e-03

Pos logit contributions
 entire+0.023
ifying+0.022
 exact+0.019
flower+0.018
chie+0.018
Neg logit contributions
drivers-0.027
dies-0.026
ogs-0.025
 villains-0.025
 weapons-0.024
Token delicious
Token ID12625
Feature activation+3.25e-04

Pos logit contributions
 entire+0.011
ifying+0.008
 too+0.008
 my+0.008
 whole+0.008
Neg logit contributions
drivers-0.018
dies-0.017
ogs-0.016
 villains-0.016
ogged-0.016
Token as
Token ID355
Feature activation-3.95e-04

Pos logit contributions
 entire+0.000
 too+0.000
 so+0.000
ifying+0.000
".+0.000
Neg logit contributions
drivers-0.001
 villains-0.001
renches-0.001
ogs-0.001
ests-0.001
TokenS
Token ID50
Feature activation+1.55e-02

Pos logit contributions
 entire+0.018
ifying+0.017
chie+0.015
flower+0.014
 exact+0.014
Neg logit contributions
drivers-0.023
ogs-0.022
dies-0.022
 villains-0.021
ves-0.020
Tokenandy
Token ID10757
Feature activation+7.26e-03

Pos logit contributions
 entire+0.027
ifying+0.025
chie+0.023
flower+0.022
 exact+0.022
Neg logit contributions
drivers-0.021
ogs-0.020
dies-0.020
 villains-0.019
con-0.018
Token and
Token ID290
Feature activation+1.56e-02

Pos logit contributions
 entire+0.009
ifying+0.008
flower+0.007
 exact+0.007
 ".+0.007
Neg logit contributions
drivers-0.014
dies-0.013
 villains-0.013
ogs-0.012
 weapons-0.012
Token Tom
Token ID4186
Feature activation+7.05e-03

Pos logit contributions
 entire+0.024
ifying+0.022
flower+0.020
 exact+0.020
 my+0.019
Neg logit contributions
drivers-0.027
ves-0.023
dies-0.023
ogs-0.023
con-0.023
Token were
Token ID547
Feature activation+1.08e-02

Pos logit contributions
 entire+0.011
ifying+0.010
flower+0.009
 exact+0.009
chie+0.008
Neg logit contributions
drivers-0.011
ogs-0.011
dies-0.011
 villains-0.011
ves-0.010
Token very
Token ID845
Feature activation+3.94e-02

Pos logit contributions
 entire+0.016
ifying+0.015
 exact+0.013
flower+0.013
 beginning+0.012
Neg logit contributions
drivers-0.019
dies-0.018
 villains-0.018
ves-0.017
con-0.017
Token excited
Token ID6568
Feature activation+1.41e-02

Pos logit contributions
 entire+0.047
ifying+0.043
flower+0.036
chie+0.036
 exact+0.035
Neg logit contributions
drivers-0.075
ogs-0.072
dies-0.072
 villains-0.070
 weapons-0.068
Token because
Token ID780
Feature activation+8.81e-03

Pos logit contributions
 entire+0.019
ifying+0.017
flower+0.015
 exact+0.015
chie+0.014
Neg logit contributions
drivers-0.025
ogs-0.024
dies-0.024
 villains-0.024
ves-0.023
Token it
Token ID340
Feature activation+2.66e-02

Pos logit contributions
 entire+0.017
ifying+0.016
flower+0.014
 exact+0.014
 front+0.014
Neg logit contributions
drivers-0.011
dies-0.010
ogs-0.010
 villains-0.010
con-0.010
Token was
Token ID373
Feature activation+3.55e-03

Pos logit contributions
 entire+0.039
ifying+0.037
chie+0.032
flower+0.032
 exact+0.031
Neg logit contributions
drivers-0.043
dies-0.041
ogs-0.041
 villains-0.040
ves-0.039
Token holiday
Token ID9912
Feature activation+2.13e-02

Pos logit contributions
 entire+0.005
ifying+0.005
flower+0.004
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.006
ogs-0.006
dies-0.006
 villains-0.005
ves-0.005
Token and
Token ID290
Feature activation+1.32e-03

Pos logit contributions
 entire+0.009
ifying+0.008
flower+0.007
 exact+0.007
 front+0.007
Neg logit contributions
drivers-0.013
dies-0.013
ogs-0.013
 villains-0.013
 weapons-0.012
Token the
Token ID262
Feature activation+7.09e-05

Pos logit contributions
 entire+0.001
 too+0.001
ifying+0.001
 front+0.001
 nearest+0.001
Neg logit contributions
ogs-0.003
dies-0.003
drivers-0.003
 villains-0.003
ug-0.003
Token sorry
Token ID7926
Feature activation+1.61e-02

Pos logit contributions
 entire+0.000
ifying+0.000
flower+0.000
 exact+0.000
chie+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
ug-0.000
Token faces
Token ID6698
Feature activation+1.49e-02

Pos logit contributions
 entire+0.021
ifying+0.019
flower+0.018
 too+0.017
chie+0.017
Neg logit contributions
drivers-0.029
 villains-0.027
dies-0.027
ogs-0.026
con-0.026
Token of
Token ID286
Feature activation+1.81e-04

Pos logit contributions
 entire+0.019
ifying+0.018
flower+0.015
 exact+0.015
chie+0.015
Neg logit contributions
drivers-0.027
ogs-0.026
dies-0.026
 villains-0.025
con-0.025
Token Sara
Token ID24799
Feature activation+3.24e-02

Pos logit contributions
 entire+0.000
ifying+0.000
 front+0.000
 too+0.000
chie+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
con-0.000
Token and
Token ID290
Feature activation+8.86e-03

Pos logit contributions
 entire+0.038
ifying+0.034
flower+0.029
chie+0.028
 exact+0.028
Neg logit contributions
drivers-0.063
ogs-0.061
dies-0.060
 villains-0.059
 weapons-0.057
Token Ben
Token ID3932
Feature activation+2.31e-02

Pos logit contributions
ifying+0.012
 entire+0.011
flower+0.011
 too+0.011
 front+0.011
Neg logit contributions
drivers-0.015
ogs-0.015
dies-0.015
ves-0.014
 villains-0.014
Token.
Token ID13
Feature activation-9.66e-03

Pos logit contributions
 entire+0.031
ifying+0.029
chie+0.025
flower+0.025
 exact+0.024
Neg logit contributions
drivers-0.040
ogs-0.039
dies-0.038
 villains-0.038
 weapons-0.036
Token She
Token ID1375
Feature activation-4.36e-03

Pos logit contributions
drivers+0.021
 villains+0.020
dies+0.020
ug+0.020
ves+0.019
Neg logit contributions
 entire-0.008
ifying-0.008
itates-0.007
 front-0.007
 too-0.007
Token understood
Token ID7247
Feature activation+1.45e-02

Pos logit contributions
ogs+0.008
drivers+0.008
dies+0.008
 villains+0.008
 weapons+0.007
Neg logit contributions
 entire-0.005
ifying-0.005
flower-0.004
chie-0.004
 exact-0.004
Token.
Token ID13
Feature activation+5.08e-03

Pos logit contributions
 entire+0.023
ifying+0.022
flower+0.019
chie+0.019
 exact+0.018
Neg logit contributions
drivers-0.028
ogs-0.027
dies-0.027
 villains-0.026
 weapons-0.025
Token Lily
Token ID20037
Feature activation-5.70e-03

Pos logit contributions
 entire+0.006
ifying+0.006
 ".+0.005
 exact+0.005
flower+0.005
Neg logit contributions
drivers-0.009
dies-0.009
ogs-0.009
ves-0.009
 villains-0.009
Token liked
Token ID8288
Feature activation-7.49e-03

Pos logit contributions
drivers+0.010
ogs+0.010
dies+0.010
 villains+0.009
 weapons+0.009
Neg logit contributions
 entire-0.008
ifying-0.007
chie-0.006
flower-0.006
 exact-0.006
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
drivers+0.014
dies+0.013
ogs+0.013
 villains+0.013
ug+0.012
Neg logit contributions
 entire-0.009
ifying-0.009
flower-0.008
 too-0.008
 front-0.008
Token take
Token ID1011
Feature activation-2.37e-03

Pos logit contributions
drivers+0.020
dies+0.019
ogs+0.019
 villains+0.018
 weapons+0.018
Neg logit contributions
 entire-0.019
ifying-0.017
 exact-0.015
flower-0.015
 front-0.015
Token Anna
Token ID11735
Feature activation+3.51e-02

Pos logit contributions
drivers+0.005
dies+0.004
 villains+0.004
ogs+0.004
uddy+0.004
Neg logit contributions
 entire-0.003
ifying-0.002
 too-0.002
 my-0.002
 front-0.002
Token everywhere
Token ID8347
Feature activation-1.41e-02

Pos logit contributions
 entire+0.034
ifying+0.031
flower+0.025
chie+0.024
 exact+0.024
Neg logit contributions
drivers-0.075
dies-0.072
ogs-0.072
 villains-0.071
ves-0.068
Token she
Token ID673
Feature activation+1.63e-02

Pos logit contributions
drivers+0.025
dies+0.024
ogs+0.023
 villains+0.023
con+0.023
Neg logit contributions
 entire-0.019
ifying-0.018
 too-0.017
 front-0.015
flower-0.015
Token went
Token ID1816
Feature activation-2.92e-03

Pos logit contributions
 entire+0.028
ifying+0.026
flower+0.023
 exact+0.023
chie+0.023
Neg logit contributions
drivers-0.023
ogs-0.022
dies-0.022
 villains-0.021
 weapons-0.021
Token.
Token ID13
Feature activation+2.19e-04

Pos logit contributions
drivers+0.006
dies+0.006
 villains+0.006
 teamwork+0.006
ogs+0.006
Neg logit contributions
 entire-0.003
ifying-0.002
 too-0.002
 front-0.002
".-0.002
Token
Token ID198
Feature activation-3.12e-03

Pos logit contributions
 entire+0.000
ifying+0.000
chie+0.000
flower+0.000
 exact+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
 weapons-0.000
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
 entire+0.005
ifying+0.005
flower+0.004
chie+0.004
 exact+0.004
Neg logit contributions
drivers-0.007
ogs-0.006
dies-0.006
 villains-0.006
ves-0.006
Token said
Token ID531
Feature activation-2.12e-03

Pos logit contributions
ogs+0.026
ests+0.025
dies+0.024
 villains+0.024
drivers+0.023
Neg logit contributions
 entire-0.012
ifying-0.012
 too-0.011
 prett-0.010
 still-0.010
Token,
Token ID11
Feature activation-2.87e-03

Pos logit contributions
ogs+0.004
dies+0.004
drivers+0.003
ests+0.003
 villains+0.003
Neg logit contributions
ifying-0.003
 entire-0.003
 too-0.003
chie-0.002
 so-0.002
Token �
Token ID6184
Feature activation-5.26e-03

Pos logit contributions
drivers+0.005
ogs+0.004
dies+0.004
 villains+0.004
 weapons+0.004
Neg logit contributions
 entire-0.004
ifying-0.004
chie-0.003
flower-0.003
 exact-0.003
Token�
Token ID95
Feature activation+7.96e-03

Pos logit contributions
drivers+0.008
ogs+0.007
dies+0.007
con+0.007
ves+0.007
Neg logit contributions
 entire-0.009
ifying-0.008
flower-0.008
chie-0.007
 exact-0.007
Token€
Token ID26391
Feature activation+3.22e-02

Pos logit contributions
ifying+0.012
flower+0.011
 entire+0.011
 prett+0.010
 brightest+0.009
Neg logit contributions
drivers-0.012
ogs-0.012
dies-0.012
con-0.011
 villains-0.011
Token�
Token ID129
Feature activation+1.13e-02

Pos logit contributions
 entire+0.052
ifying+0.049
chie+0.044
flower+0.044
 exact+0.044
Neg logit contributions
drivers-0.048
ogs-0.046
dies-0.044
ves-0.043
 villains-0.043
Token�
Token ID241
Feature activation+1.63e-02

Pos logit contributions
ifying+0.015
flower+0.014
 entire+0.013
m+0.012
 brightest+0.011
Neg logit contributions
ogs-0.020
ves-0.020
drivers-0.020
con-0.019
dies-0.019
TokenI
Token ID40
Feature activation-5.18e-03

Pos logit contributions
 entire+0.022
ifying+0.020
chie+0.017
flower+0.017
 exact+0.017
Neg logit contributions
drivers-0.029
ogs-0.028
dies-0.027
 villains-0.027
 weapons-0.026
Token win
Token ID1592
Feature activation-9.51e-04

Pos logit contributions
drivers+0.009
ogs+0.009
dies+0.009
 villains+0.009
 weapons+0.009
Neg logit contributions
 entire-0.007
ifying-0.006
flower-0.005
chie-0.005
 exact-0.005
Token!
Token ID0
Feature activation-2.69e-03

Pos logit contributions
drivers+0.002
dies+0.002
ogs+0.002
 villains+0.002
 weapons+0.002
Neg logit contributions
 entire-0.001
ifying-0.001
chie-0.001
 exact-0.001
flower-0.001
Token went
Token ID1816
Feature activation-3.21e-04

Pos logit contributions
ogs+0.007
dies+0.007
drivers+0.007
 villains+0.007
ests+0.007
Neg logit contributions
 entire-0.003
 too-0.003
 so-0.003
flower-0.003
ifying-0.003
Token off
Token ID572
Feature activation+7.40e-03

Pos logit contributions
drivers+0.001
ogs+0.000
dies+0.000
 villains+0.000
 weapons+0.000
Neg logit contributions
 entire-0.000
ifying-0.000
flower-0.000
chie-0.000
 exact-0.000
Token to
Token ID284
Feature activation+1.30e-02

Pos logit contributions
 entire+0.010
ifying+0.009
 front+0.008
 too+0.008
chie+0.007
Neg logit contributions
drivers-0.014
dies-0.013
 villains-0.013
ogs-0.013
ests-0.012
Token play
Token ID711
Feature activation-6.50e-03

Pos logit contributions
 entire+0.016
ifying+0.013
 front+0.013
 exact+0.013
 nearest+0.012
Neg logit contributions
drivers-0.026
dies-0.024
 villains-0.023
ests-0.023
ogs-0.023
Token elsewhere
Token ID8057
Feature activation+3.54e-04

Pos logit contributions
drivers+0.012
ogs+0.012
dies+0.012
 villains+0.011
con+0.011
Neg logit contributions
 entire-0.008
ifying-0.007
 too-0.006
 exact-0.006
 front-0.006
Token.
Token ID13
Feature activation-9.60e-04

Pos logit contributions
 entire+0.000
ifying+0.000
flower+0.000
chie+0.000
 exact+0.000
Neg logit contributions
drivers-0.001
ogs-0.001
dies-0.001
 villains-0.001
 weapons-0.001
Token Then
Token ID3244
Feature activation-1.00e-02

Pos logit contributions
drivers+0.001
ogs+0.001
dies+0.001
 villains+0.001
 weapons+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
chie-0.001
flower-0.001
 exact-0.001
Token,
Token ID11
Feature activation+2.05e-02

Pos logit contributions
drivers+0.011
ogs+0.010
dies+0.010
 villains+0.010
con+0.009
Neg logit contributions
 entire-0.020
ifying-0.019
flower-0.017
 exact-0.017
chie-0.017
Token she
Token ID673
Feature activation+6.61e-03

Pos logit contributions
 entire+0.040
ifying+0.038
flower+0.034
 exact+0.034
chie+0.034
Neg logit contributions
drivers-0.025
dies-0.022
ogs-0.022
 villains-0.022
con-0.021
Token spotted
Token ID13489
Feature activation+7.63e-04

Pos logit contributions
 entire+0.007
ifying+0.006
flower+0.006
 too+0.005
 nearest+0.005
Neg logit contributions
drivers-0.014
ogs-0.013
dies-0.013
 villains-0.013
othe-0.013
Token something
Token ID1223
Feature activation-2.26e-02

Pos logit contributions
 entire+0.001
 front+0.001
ifying+0.001
 too+0.001
 so+0.001
Neg logit contributions
 villains-0.002
dies-0.002
ogs-0.002
drivers-0.002
uddy-0.002
Token trees
Token ID7150
Feature activation-1.98e-02

Pos logit contributions
drivers+0.015
ogs+0.015
dies+0.015
 villains+0.014
 weapons+0.014
Neg logit contributions
 entire-0.013
ifying-0.013
chie-0.011
flower-0.011
 exact-0.011
Token!
Token ID0
Feature activation-5.62e-03

Pos logit contributions
drivers+0.034
dies+0.033
ogs+0.033
 villains+0.032
 weapons+0.030
Neg logit contributions
 entire-0.027
ifying-0.026
flower-0.022
chie-0.021
 exact-0.021
Token
Token ID198
Feature activation-4.29e-03

Pos logit contributions
drivers+0.010
ogs+0.010
dies+0.010
 villains+0.010
ves+0.009
Neg logit contributions
 entire-0.007
ifying-0.007
flower-0.006
chie-0.006
 exact-0.006
Token
Token ID198
Feature activation+5.88e-03

Pos logit contributions
drivers+0.008
dies+0.008
ogs+0.008
 villains+0.008
 weapons+0.007
Neg logit contributions
 entire-0.005
ifying-0.005
chie-0.004
 exact-0.004
itates-0.004
TokenThe
Token ID464
Feature activation+9.67e-04

Pos logit contributions
 entire+0.007
ifying+0.006
chie+0.005
flower+0.005
 exact+0.005
Neg logit contributions
drivers-0.011
ogs-0.011
dies-0.011
 villains-0.011
 weapons-0.010
Token little
Token ID1310
Feature activation-6.80e-03

Pos logit contributions
ifying+0.001
 entire+0.001
chie+0.001
ksh+0.001
itates+0.001
Neg logit contributions
drivers-0.002
dies-0.001
 villains-0.001
ogs-0.001
 weapons-0.001
Token boy
Token ID2933
Feature activation-4.80e-03

Pos logit contributions
dies+0.009
drivers+0.009
 villains+0.008
 weapons+0.008
ogs+0.008
Neg logit contributions
ifying-0.012
 entire-0.011
chie-0.010
itates-0.010
ksh-0.010
Token was
Token ID373
Feature activation+4.67e-03

Pos logit contributions
drivers+0.008
ogs+0.008
dies+0.008
 villains+0.008
 weapons+0.007
Neg logit contributions
 entire-0.007
ifying-0.006
flower-0.005
chie-0.005
 exact-0.005
Token so
Token ID523
Feature activation+1.51e-02

Pos logit contributions
ifying+0.005
 entire+0.005
ksh+0.004
chie+0.004
flower+0.004
Neg logit contributions
drivers-0.008
ogs-0.008
 villains-0.008
old-0.008
ug-0.008
Token excited
Token ID6568
Feature activation+6.98e-03

Pos logit contributions
ifying+0.022
 entire+0.021
ksh+0.019
flower+0.018
chie+0.018
Neg logit contributions
ogs-0.023
old-0.022
drivers-0.022
 villains-0.022
othe-0.022
Token.
Token ID13
Feature activation-5.23e-03

Pos logit contributions
 entire+0.011
ifying+0.010
flower+0.009
chie+0.009
 exact+0.009
Neg logit contributions
drivers-0.011
ogs-0.010
dies-0.010
 villains-0.010
ves-0.010
Token was
Token ID373
Feature activation-3.61e-03

Pos logit contributions
drivers+0.012
ogs+0.012
dies+0.012
 villains+0.012
ves+0.011
Neg logit contributions
 entire-0.012
ifying-0.011
chie-0.010
 exact-0.010
flower-0.010
Token walking
Token ID6155
Feature activation-5.53e-03

Pos logit contributions
drivers+0.006
dies+0.005
ogs+0.005
 villains+0.005
con+0.005
Neg logit contributions
 entire-0.006
ifying-0.005
chie-0.004
flower-0.004
 exact-0.004
Token through
Token ID832
Feature activation-9.83e-03

Pos logit contributions
drivers+0.011
ves+0.010
 villains+0.010
dies+0.010
renches+0.010
Neg logit contributions
 entire-0.007
ifying-0.006
 too-0.006
flower-0.006
chie-0.005
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
drivers+0.024
ests+0.022
ogs+0.022
oot+0.022
renches+0.022
Neg logit contributions
 entire-0.008
 front-0.008
 too-0.007
 so-0.005
 nearest-0.005
Token jungle
Token ID20712
Feature activation-1.36e-02

Pos logit contributions
drivers+0.020
ogs+0.019
dies+0.019
 villains+0.019
 weapons+0.018
Neg logit contributions
 entire-0.025
ifying-0.024
chie-0.021
flower-0.021
 exact-0.021
Token,
Token ID11
Feature activation-2.98e-03

Pos logit contributions
drivers+0.024
ests+0.022
 villains+0.021
ogs+0.021
 lear+0.021
Neg logit contributions
 entire-0.018
 too-0.017
 so-0.016
 beginning-0.015
 nearest-0.015
Token he
Token ID339
Feature activation-4.05e-03

Pos logit contributions
ogs+0.005
drivers+0.005
ogged+0.005
oot+0.005
ests+0.004
Neg logit contributions
 entire-0.004
 too-0.004
 so-0.004
ifying-0.004
 most-0.004
Token came
Token ID1625
Feature activation+1.57e-02

Pos logit contributions
drivers+0.008
ogs+0.008
dies+0.008
 villains+0.008
con+0.008
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.003
chie-0.003
 exact-0.003
Token across
Token ID1973
Feature activation+4.70e-03

Pos logit contributions
 entire+0.018
ifying+0.017
chie+0.014
flower+0.014
 exact+0.014
Neg logit contributions
drivers-0.030
ogs-0.029
dies-0.029
 villains-0.028
 weapons-0.027
Token a
Token ID257
Feature activation-8.46e-03

Pos logit contributions
 entire+0.005
 front+0.004
ifying+0.004
 too+0.004
 nearest+0.004
Neg logit contributions
drivers-0.010
ogs-0.010
ug-0.010
oot-0.009
ves-0.009
Token little
Token ID1310
Feature activation-1.09e-02

Pos logit contributions
drivers+0.015
dies+0.014
ogs+0.014
 villains+0.014
 weapons+0.013
Neg logit contributions
 entire-0.011
ifying-0.011
flower-0.009
chie-0.009
 exact-0.009
Token.
Token ID13
Feature activation-2.62e-03

Pos logit contributions
drivers+0.058
dies+0.058
ogs+0.057
 villains+0.055
ug+0.054
Neg logit contributions
 entire-0.042
ifying-0.039
flower-0.035
 too-0.034
 front-0.032
Token She
Token ID1375
Feature activation-5.72e-03

Pos logit contributions
drivers+0.005
dies+0.005
ves+0.005
ogs+0.005
 weapons+0.005
Neg logit contributions
 entire-0.003
ifying-0.003
 ".-0.002
flower-0.002
 exact-0.002
Token said
Token ID531
Feature activation-2.40e-03

Pos logit contributions
drivers+0.012
ogs+0.012
dies+0.011
ug+0.011
 villains+0.011
Neg logit contributions
 entire-0.006
flower-0.005
ifying-0.005
 too-0.004
 exact-0.004
Token they
Token ID484
Feature activation+1.38e-02

Pos logit contributions
ogs+0.004
drivers+0.004
ests+0.004
dies+0.004
 trek+0.003
Neg logit contributions
 too-0.003
 so-0.003
 my-0.003
 mine-0.003
 entire-0.003
Token could
Token ID714
Feature activation-2.47e-04

Pos logit contributions
 entire+0.023
ifying+0.022
chie+0.020
flower+0.020
 exact+0.019
Neg logit contributions
drivers-0.019
dies-0.018
ogs-0.018
 villains-0.018
ves-0.017
Token use
Token ID779
Feature activation-8.02e-03

Pos logit contributions
drivers+0.000
ogs+0.000
dies+0.000
 villains+0.000
 weapons+0.000
Neg logit contributions
 entire-0.000
ifying-0.000
flower-0.000
 exact-0.000
chie-0.000
Token many
Token ID867
Feature activation-3.37e-02

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.014
 villains+0.014
ug+0.014
Neg logit contributions
 entire-0.010
ifying-0.010
 too-0.009
 front-0.009
 my-0.008
Token colors
Token ID7577
Feature activation-2.26e-02

Pos logit contributions
drivers+0.067
dies+0.064
ogs+0.063
 villains+0.061
ug+0.060
Neg logit contributions
 entire-0.040
ifying-0.033
 exact-0.030
 too-0.030
 front-0.030
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
dies+0.035
drivers+0.035
 villains+0.034
ogs+0.034
 trek+0.031
Neg logit contributions
 entire-0.034
ifying-0.033
flower-0.030
 too-0.029
 exact-0.028
Token make
Token ID787
Feature activation+1.31e-02

Pos logit contributions
ogs+0.020
drivers+0.019
ug+0.019
ogged+0.018
dies+0.018
Neg logit contributions
 entire-0.015
ifying-0.014
 too-0.012
 exact-0.012
 front-0.012
Token their
Token ID511
Feature activation+4.36e-03

Pos logit contributions
 entire+0.016
 prett+0.014
 too+0.013
 exact+0.013
ifying+0.013
Neg logit contributions
drivers-0.025
ogs-0.024
uddy-0.023
dies-0.023
iked-0.023
Token with
Token ID351
Feature activation-7.71e-03

Pos logit contributions
drivers+0.025
ogs+0.023
dies+0.023
 villains+0.022
con+0.022
Neg logit contributions
 entire-0.019
ifying-0.018
 exact-0.015
flower-0.015
chie-0.015
Token their
Token ID511
Feature activation-1.05e-02

Pos logit contributions
drivers+0.013
dies+0.013
ogs+0.013
 villains+0.012
con+0.012
Neg logit contributions
 entire-0.011
ifying-0.010
 front-0.009
 too-0.009
flower-0.009
Token toys
Token ID14958
Feature activation-1.57e-02

Pos logit contributions
ogs+0.015
drivers+0.015
dies+0.014
oot+0.014
 villains+0.014
Neg logit contributions
 entire-0.018
ifying-0.016
 front-0.015
flower-0.014
 exact-0.014
Token.
Token ID13
Feature activation-3.19e-03

Pos logit contributions
drivers+0.032
dies+0.032
ogs+0.031
 villains+0.030
ug+0.029
Neg logit contributions
 entire-0.016
ifying-0.015
 too-0.013
flower-0.012
 front-0.012
Token They
Token ID1119
Feature activation+2.28e-03

Pos logit contributions
drivers+0.005
ogs+0.005
dies+0.005
 villains+0.005
ves+0.005
Neg logit contributions
 entire-0.005
ifying-0.005
 exact-0.004
flower-0.004
itates-0.004
Token waited
Token ID13488
Feature activation+6.53e-04

Pos logit contributions
 entire+0.002
ifying+0.002
flower+0.002
 still+0.002
 exact+0.002
Neg logit contributions
ogs-0.005
drivers-0.005
dies-0.005
con-0.004
 weapons-0.004
Token for
Token ID329
Feature activation-1.83e-03

Pos logit contributions
 entire+0.001
ifying+0.001
flower+0.001
chie+0.001
 exact+0.001
Neg logit contributions
drivers-0.001
dies-0.001
ogs-0.001
 villains-0.001
con-0.001
Token the
Token ID262
Feature activation-3.39e-04

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
con+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
 front-0.002
 too-0.002
 my-0.002
Token new
Token ID649
Feature activation-1.85e-02

Pos logit contributions
ogs+0.001
drivers+0.001
ogged+0.001
oot+0.001
old+0.001
Neg logit contributions
 entire-0.001
ifying-0.000
 front-0.000
 exact-0.000
 whole-0.000
Token battery
Token ID6555
Feature activation-2.00e-02

Pos logit contributions
ogs+0.029
othe+0.029
drivers+0.028
con+0.027
dies+0.027
Neg logit contributions
 entire-0.030
 front-0.024
 exact-0.023
 too-0.023
 whole-0.023
Token for
Token ID329
Feature activation-5.49e-03

Pos logit contributions
dies+0.036
ogs+0.036
drivers+0.036
 villains+0.035
renches+0.035
Neg logit contributions
 entire-0.023
ifying-0.021
 too-0.021
flower-0.019
chie-0.017
Token touched
Token ID12615
Feature activation+1.54e-04

Pos logit contributions
drivers+0.014
ogs+0.014
dies+0.014
 villains+0.013
ug+0.013
Neg logit contributions
 entire-0.010
ifying-0.009
flower-0.008
 front-0.008
 exact-0.008
Token his
Token ID465
Feature activation-1.60e-02

Pos logit contributions
ifying+0.000
 entire+0.000
flower+0.000
 too+0.000
 front+0.000
Neg logit contributions
 villains-0.000
drivers-0.000
dies-0.000
ogs-0.000
ves-0.000
Token rough
Token ID5210
Feature activation-6.48e-03

Pos logit contributions
drivers+0.026
dies+0.026
ogs+0.025
 villains+0.025
 weapons+0.025
Neg logit contributions
 entire-0.022
ifying-0.021
chie-0.018
 exact-0.018
itates-0.018
Token skin
Token ID4168
Feature activation-2.91e-03

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.011
 villains+0.011
ves+0.010
Neg logit contributions
 entire-0.009
ifying-0.008
flower-0.007
chie-0.007
 exact-0.007
Token gently
Token ID15165
Feature activation+2.16e-05

Pos logit contributions
drivers+0.005
ogs+0.005
dies+0.005
 villains+0.005
 weapons+0.005
Neg logit contributions
 entire-0.004
ifying-0.004
flower-0.003
 exact-0.003
chie-0.003
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
 entire+0.000
ifying+0.000
chie+0.000
flower+0.000
 exact+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
ves-0.000
con-0.000
Token He
Token ID679
Feature activation-7.07e-03

Pos logit contributions
ogs+0.024
dies+0.024
drivers+0.024
 villains+0.023
oot+0.022
Neg logit contributions
ifying-0.017
 entire-0.015
ksh-0.013
 ".-0.013
 too-0.013
Token lowered
Token ID17788
Feature activation-1.64e-02

Pos logit contributions
drivers+0.012
ogs+0.012
dies+0.012
 villains+0.012
ves+0.011
Neg logit contributions
 entire-0.009
ifying-0.009
chie-0.007
flower-0.007
 exact-0.007
Token his
Token ID465
Feature activation-1.46e-02

Pos logit contributions
drivers+0.025
ogs+0.025
 villains+0.024
dies+0.024
con+0.023
Neg logit contributions
 entire-0.025
ifying-0.025
 front-0.023
 too-0.023
flower-0.022
Token horn
Token ID12718
Feature activation-1.40e-02

Pos logit contributions
drivers+0.026
dies+0.026
ogs+0.025
 weapons+0.025
 villains+0.025
Neg logit contributions
 entire-0.017
ifying-0.016
flower-0.014
chie-0.014
 exact-0.014
Token and
Token ID290
Feature activation-8.27e-04

Pos logit contributions
drivers+0.023
ogs+0.022
dies+0.022
 villains+0.022
 weapons+0.021
Neg logit contributions
 entire-0.020
ifying-0.019
flower-0.017
chie-0.016
 exact-0.016
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
 entire+0.003
 too+0.003
ifying+0.003
 exact+0.002
 highest+0.002
Neg logit contributions
ogs-0.005
drivers-0.005
dies-0.005
 villains-0.005
renches-0.005
Token He
Token ID679
Feature activation-1.57e-02

Pos logit contributions
 villains+0.014
drivers+0.014
ogs+0.014
dies+0.014
ves+0.014
Neg logit contributions
 entire-0.011
ifying-0.010
 exact-0.009
ksh-0.009
flower-0.009
Token climbed
Token ID19952
Feature activation-1.05e-02

Pos logit contributions
ogs+0.032
drivers+0.031
dies+0.030
 villains+0.029
othe+0.029
Neg logit contributions
 entire-0.017
ifying-0.014
flower-0.013
 too-0.013
 nearest-0.012
Token up
Token ID510
Feature activation-9.15e-03

Pos logit contributions
 villains+0.021
dies+0.021
ogs+0.021
drivers+0.021
old+0.019
Neg logit contributions
 entire-0.011
ifying-0.010
 front-0.009
 highest-0.008
 too-0.008
Token and
Token ID290
Feature activation-2.84e-04

Pos logit contributions
 villains+0.022
 endings+0.021
ogs+0.021
dies+0.021
ves+0.021
Neg logit contributions
 too-0.007
 front-0.007
 entire-0.007
 so-0.006
 away-0.005
Token put
Token ID1234
Feature activation-1.93e-02

Pos logit contributions
drivers+0.001
ogs+0.001
dies+0.001
 villains+0.001
ves+0.000
Neg logit contributions
 entire-0.000
ifying-0.000
flower-0.000
 exact-0.000
chie-0.000
Token his
Token ID465
Feature activation-1.65e-02

Pos logit contributions
 villains+0.034
ogs+0.034
dies+0.033
drivers+0.033
con+0.031
Neg logit contributions
 entire-0.026
ifying-0.023
 too-0.021
 front-0.021
 exact-0.020
Token cars
Token ID5006
Feature activation-2.25e-02

Pos logit contributions
drivers+0.021
ogs+0.020
dies+0.019
 villains+0.019
ves+0.018
Neg logit contributions
 entire-0.031
ifying-0.029
chie-0.026
flower-0.026
 exact-0.026
Token on
Token ID319
Feature activation+1.10e-03

Pos logit contributions
dies+0.054
drivers+0.053
 villains+0.053
ogs+0.052
ves+0.051
Neg logit contributions
 entire-0.017
ifying-0.014
 too-0.011
chie-0.011
 highest-0.010
Token the
Token ID262
Feature activation+3.99e-03

Pos logit contributions
 entire+0.001
 front+0.001
ifying+0.001
 too+0.001
 so+0.001
Neg logit contributions
ogs-0.002
 villains-0.002
unn-0.002
drivers-0.002
dies-0.002
Token iron
Token ID6953
Feature activation-1.14e-02

Pos logit contributions
 entire+0.006
ifying+0.006
flower+0.005
chie+0.005
 exact+0.005
Neg logit contributions
drivers-0.006
dies-0.006
ogs-0.006
 villains-0.006
 weapons-0.005
Token He
Token ID679
Feature activation-6.58e-03

Pos logit contributions
drivers+0.013
ogs+0.013
dies+0.013
 villains+0.012
ves+0.012
Neg logit contributions
 entire-0.012
ifying-0.011
chie-0.010
flower-0.010
 exact-0.009
Token felt
Token ID2936
Feature activation-2.00e-03

Pos logit contributions
drivers+0.012
ogs+0.011
dies+0.011
 villains+0.011
 weapons+0.010
Neg logit contributions
 entire-0.009
ifying-0.008
flower-0.007
chie-0.007
 exact-0.007
Token sad
Token ID6507
Feature activation-4.33e-03

Pos logit contributions
drivers+0.004
ogs+0.003
dies+0.003
 villains+0.003
 weapons+0.003
Neg logit contributions
 entire-0.003
ifying-0.002
flower-0.002
chie-0.002
 exact-0.002
Token and
Token ID290
Feature activation+7.95e-03

Pos logit contributions
drivers+0.008
dies+0.008
ogs+0.008
ves+0.008
 villains+0.008
Neg logit contributions
 entire-0.005
 too-0.005
ifying-0.005
 so-0.004
 beginning-0.004
Token guilty
Token ID6717
Feature activation+1.07e-02

Pos logit contributions
 entire+0.011
ifying+0.011
flower+0.009
chie+0.009
 exact+0.009
Neg logit contributions
drivers-0.013
dies-0.013
 villains-0.013
ogs-0.013
 weapons-0.012
Token for
Token ID329
Feature activation-9.73e-03

Pos logit contributions
 entire+0.011
 too+0.010
ifying+0.010
 beginning+0.008
 so+0.008
Neg logit contributions
drivers-0.022
ogs-0.021
dies-0.021
con-0.020
 villains-0.020
Token what
Token ID644
Feature activation-4.36e-03

Pos logit contributions
ogs+0.019
drivers+0.019
ests+0.019
dies+0.019
ves+0.018
Neg logit contributions
 entire-0.011
 too-0.010
 my-0.009
 beginning-0.009
 front-0.009
Token he
Token ID339
Feature activation+7.26e-03

Pos logit contributions
drivers+0.006
ogs+0.006
ug+0.005
dies+0.005
con+0.005
Neg logit contributions
 entire-0.008
 too-0.006
ifying-0.006
 exact-0.006
 my-0.006
Token had
Token ID550
Feature activation+1.45e-02

Pos logit contributions
ifying+0.011
 entire+0.011
chie+0.010
flower+0.009
 exact+0.009
Neg logit contributions
drivers-0.011
dies-0.011
 villains-0.011
con-0.010
 weapons-0.010
Token done
Token ID1760
Feature activation+2.57e-02

Pos logit contributions
ifying+0.020
 entire+0.019
chie+0.016
flower+0.016
ksh+0.016
Neg logit contributions
ogs-0.025
drivers-0.025
 weapons-0.024
 villains-0.023
dies-0.023
Token.
Token ID13
Feature activation-6.93e-03

Pos logit contributions
 entire+0.030
ifying+0.025
 exact+0.022
 too+0.022
flower+0.021
Neg logit contributions
drivers-0.051
dies-0.049
ogs-0.048
 villains-0.048
con-0.047
Token mom
Token ID1995
Feature activation-2.26e-02

Pos logit contributions
drivers+0.024
ogs+0.023
dies+0.023
 villains+0.022
con+0.022
Neg logit contributions
 entire-0.020
 front-0.016
ifying-0.016
flower-0.016
 too-0.016
Tokenmy
Token ID1820
Feature activation-2.73e-02

Pos logit contributions
drivers+0.034
old+0.031
ogs+0.030
ug+0.029
ves+0.029
Neg logit contributions
 entire-0.036
ifying-0.036
 exact-0.029
chie-0.029
 highest-0.029
Token explained
Token ID4893
Feature activation-3.85e-03

Pos logit contributions
drivers+0.044
ogs+0.041
dies+0.041
 villains+0.040
ves+0.039
Neg logit contributions
 entire-0.041
ifying-0.039
chie-0.034
 exact-0.033
flower-0.033
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
drivers+0.007
 villains+0.007
ests+0.007
dies+0.006
ogs+0.006
Neg logit contributions
 too-0.005
 entire-0.004
ifying-0.004
 so-0.004
 my-0.004
Token "
Token ID366
Feature activation+8.22e-03

Pos logit contributions
drivers+0.017
ogs+0.015
dies+0.015
ves+0.015
 villains+0.015
Neg logit contributions
 entire-0.022
ifying-0.021
chie-0.019
flower-0.018
 exact-0.018
TokenWell
Token ID5779
Feature activation-1.56e-02

Pos logit contributions
 entire+0.010
ifying+0.009
chie+0.007
flower+0.007
 exact+0.007
Neg logit contributions
drivers-0.016
dies-0.015
ogs-0.015
 villains-0.015
ves-0.015
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
drivers+0.021
ogs+0.020
dies+0.020
 villains+0.019
con+0.018
Neg logit contributions
 entire-0.028
ifying-0.026
chie-0.023
 exact-0.023
flower-0.023
Token Jack
Token ID3619
Feature activation-9.66e-03

Pos logit contributions
 entire+0.015
ifying+0.014
 my+0.013
 exact+0.013
 too+0.013
Neg logit contributions
drivers-0.019
ogs-0.018
dies-0.017
 villains-0.017
con-0.017
Token,
Token ID11
Feature activation+4.17e-03

Pos logit contributions
drivers+0.013
ogs+0.012
dies+0.012
 villains+0.012
 weapons+0.012
Neg logit contributions
 entire-0.017
ifying-0.016
chie-0.014
flower-0.014
 exact-0.014
Token they
Token ID484
Feature activation+5.37e-03

Pos logit contributions
 entire+0.005
ifying+0.005
 exact+0.004
 my+0.004
 too+0.004
Neg logit contributions
drivers-0.007
ogs-0.007
dies-0.007
con-0.007
 villains-0.007
Token are
Token ID389
Feature activation+3.54e-03

Pos logit contributions
 entire+0.009
ifying+0.008
flower+0.007
ksh+0.007
chie+0.007
Neg logit contributions
drivers-0.008
dies-0.007
ogs-0.007
 villains-0.007
 weapons-0.007
Token have
Token ID423
Feature activation+4.12e-03

Pos logit contributions
 entire+0.003
ifying+0.002
flower+0.002
chie+0.002
 exact+0.002
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
 villains-0.002
ves-0.002
Token many
Token ID867
Feature activation-4.27e-02

Pos logit contributions
 entire+0.007
ifying+0.007
 exact+0.006
chie+0.006
 front+0.006
Neg logit contributions
ogs-0.006
drivers-0.005
dies-0.005
 villains-0.005
othe-0.005
Token pencil
Token ID21613
Feature activation-2.61e-02

Pos logit contributions
drivers+0.085
ogs+0.083
dies+0.082
 villains+0.080
ug+0.077
Neg logit contributions
 entire-0.049
ifying-0.041
 exact-0.036
chie-0.035
flower-0.034
Tokens
Token ID82
Feature activation-1.23e-02

Pos logit contributions
dies+0.050
 villains+0.049
ogs+0.049
drivers+0.048
 trek+0.047
Neg logit contributions
 entire-0.029
ifying-0.028
flower-0.024
 too-0.022
chie-0.022
Token of
Token ID286
Feature activation-1.99e-02

Pos logit contributions
drivers+0.021
dies+0.021
ogs+0.021
 villains+0.020
ves+0.019
Neg logit contributions
 entire-0.017
ifying-0.015
flower-0.013
chie-0.013
 exact-0.013
Token different
Token ID1180
Feature activation-1.60e-02

Pos logit contributions
ogs+0.041
drivers+0.041
dies+0.039
ug+0.039
 villains+0.038
Neg logit contributions
 entire-0.021
ifying-0.018
 exact-0.016
 front-0.015
 prett-0.015
Token colors
Token ID7577
Feature activation-1.47e-02

Pos logit contributions
drivers+0.033
ogs+0.031
dies+0.031
 villains+0.031
 weapons+0.030
Neg logit contributions
 entire-0.017
ifying-0.016
flower-0.013
chie-0.013
 exact-0.012
Token.
Token ID13
Feature activation+3.88e-03

Pos logit contributions
drivers+0.024
ogs+0.023
dies+0.022
 villains+0.022
ves+0.021
Neg logit contributions
 entire-0.022
ifying-0.020
chie-0.018
flower-0.017
 exact-0.017
Token They
Token ID1119
Feature activation+2.15e-03

Pos logit contributions
 entire+0.008
ifying+0.007
chie+0.007
flower+0.007
 exact+0.007
Neg logit contributions
ogs-0.004
drivers-0.004
ves-0.004
dies-0.004
 Spe-0.004
Token share
Token ID2648
Feature activation+1.56e-02

Pos logit contributions
 entire+0.003
ifying+0.003
chie+0.003
flower+0.003
 exact+0.003
Neg logit contributions
drivers-0.003
dies-0.003
ogs-0.003
 villains-0.003
 weapons-0.003
Token their
Token ID511
Feature activation-1.44e-02

Pos logit contributions
 entire+0.023
ifying+0.022
flower+0.019
 exact+0.019
 front+0.018
Neg logit contributions
drivers-0.025
ogs-0.024
dies-0.024
 villains-0.023
ug-0.022
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
drivers+0.010
ogs+0.010
dies+0.010
 villains+0.010
 weapons+0.009
Neg logit contributions
 entire-0.010
ifying-0.009
chie-0.008
flower-0.008
 exact-0.008
Token began
Token ID2540
Feature activation-2.01e-02

Pos logit contributions
ogs+0.020
dies+0.020
drivers+0.020
ests+0.020
ug+0.019
Neg logit contributions
 entire-0.015
ifying-0.014
 too-0.013
flower-0.013
 front-0.012
Token flying
Token ID7348
Feature activation-2.12e-02

Pos logit contributions
drivers+0.033
dies+0.032
ogs+0.032
 villains+0.031
ves+0.030
Neg logit contributions
 entire-0.029
ifying-0.027
flower-0.023
chie-0.023
 exact-0.023
Token around
Token ID1088
Feature activation-1.57e-03

Pos logit contributions
drivers+0.040
ogs+0.038
dies+0.038
 villains+0.037
ves+0.036
Neg logit contributions
 entire-0.026
ifying-0.024
chie-0.021
flower-0.020
 exact-0.020
Token making
Token ID1642
Feature activation+9.11e-03

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
 villains+0.003
ves+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.002
 front-0.001
chie-0.001
Token aer
Token ID9551
Feature activation-7.15e-02

Pos logit contributions
 entire+0.012
ifying+0.010
chie+0.009
 exact+0.009
 prett+0.009
Neg logit contributions
drivers-0.017
ogs-0.016
dies-0.016
 villains-0.015
ests-0.015
Tokenopl
Token ID20106
Feature activation-4.83e-02

Pos logit contributions
drivers+0.089
ogs+0.084
dies+0.083
 villains+0.081
 weapons+0.076
Neg logit contributions
 entire-0.133
ifying-0.126
chie-0.113
flower-0.113
 exact-0.111
Tokenane
Token ID1531
Feature activation-3.47e-02

Pos logit contributions
dies+0.071
 villains+0.070
drivers+0.069
 weapons+0.068
 teamwork+0.065
Neg logit contributions
 entire-0.071
ifying-0.070
chie-0.063
flower-0.061
 highest-0.058
Token noises
Token ID26782
Feature activation-2.34e-02

Pos logit contributions
drivers+0.054
ogs+0.052
dies+0.052
 villains+0.051
ves+0.048
Neg logit contributions
 entire-0.052
ifying-0.050
flower-0.043
chie-0.043
 exact-0.042
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
drivers+0.044
ogs+0.043
dies+0.042
 villains+0.041
 weapons+0.040
Neg logit contributions
 entire-0.029
ifying-0.026
flower-0.022
chie-0.022
 exact-0.021
Token Jack
Token ID3619
Feature activation-6.55e-03

Pos logit contributions
 villains+0.007
dies+0.007
drivers+0.007
 weapons+0.007
ves+0.007
Neg logit contributions
 entire-0.004
ifying-0.004
 ".-0.003
 exact-0.003
".-0.003
Token But
Token ID887
Feature activation+1.72e-04

Pos logit contributions
drivers+0.005
dies+0.005
ogs+0.005
 villains+0.005
 weapons+0.004
Neg logit contributions
 entire-0.009
ifying-0.008
flower-0.007
chie-0.007
 exact-0.007
Token Doug
Token ID15115
Feature activation-1.99e-02

Pos logit contributions
 entire+0.000
ifying+0.000
chie+0.000
flower+0.000
 exact+0.000
Neg logit contributions
drivers-0.000
ogs-0.000
dies-0.000
 villains-0.000
 weapons-0.000
Token was
Token ID373
Feature activation+7.69e-03

Pos logit contributions
drivers+0.036
ogs+0.035
dies+0.034
 villains+0.034
 weapons+0.032
Neg logit contributions
 entire-0.026
ifying-0.024
flower-0.021
chie-0.020
 exact-0.020
Token having
Token ID1719
Feature activation-1.42e-03

Pos logit contributions
ifying+0.012
 entire+0.011
ksh+0.010
flower+0.010
chie+0.009
Neg logit contributions
drivers-0.011
 villains-0.011
ogs-0.011
old-0.010
 weapons-0.010
Token too
Token ID1165
Feature activation-3.17e-04

Pos logit contributions
drivers+0.003
ogs+0.003
dies+0.003
ves+0.003
 villains+0.003
Neg logit contributions
 entire-0.002
ifying-0.001
 exact-0.001
chie-0.001
 too-0.001
Token much
Token ID881
Feature activation-5.31e-02

Pos logit contributions
drivers+0.000
dies+0.000
 villains+0.000
con+0.000
ogs+0.000
Neg logit contributions
 entire-0.001
ifying-0.000
chie-0.000
 too-0.000
 front-0.000
Token fun
Token ID1257
Feature activation-2.75e-02

Pos logit contributions
ug+0.090
ogs+0.089
dies+0.088
 Jah+0.088
or+0.088
Neg logit contributions
 entire-0.066
 too-0.063
 front-0.053
 beginning-0.050
 source-0.048
Token inside
Token ID2641
Feature activation-9.57e-03

Pos logit contributions
drivers+0.054
dies+0.053
ogs+0.051
 villains+0.051
ests+0.050
Neg logit contributions
 entire-0.029
 too-0.026
ifying-0.023
".-0.022
 so-0.021
Token to
Token ID284
Feature activation-6.41e-03

Pos logit contributions
drivers+0.019
dies+0.018
ogs+0.018
 villains+0.018
con+0.018
Neg logit contributions
 entire-0.011
ifying-0.010
 too-0.009
 front-0.008
 exact-0.008
Token stop
Token ID2245
Feature activation-2.00e-03

Pos logit contributions
drivers+0.012
dies+0.011
ogs+0.011
con+0.010
 weapons+0.010
Neg logit contributions
 entire-0.009
ifying-0.008
 front-0.007
chie-0.007
 exact-0.007
Token playing
Token ID2712
Feature activation-9.57e-03

Pos logit contributions
drivers+0.004
ogs+0.003
dies+0.003
 villains+0.003
ves+0.003
Neg logit contributions
 entire-0.003
ifying-0.002
flower-0.002
 too-0.002
 front-0.002
Token.
Token ID13
Feature activation-1.80e-03

Pos logit contributions
drivers+0.002
dies+0.002
 villains+0.002
ogs+0.002
ests+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
 too-0.001
 exact-0.001
flower-0.001
Token The
Token ID383
Feature activation-8.26e-03

Pos logit contributions
drivers+0.003
dies+0.003
ves+0.003
 villains+0.003
ogs+0.003
Neg logit contributions
 entire-0.002
ifying-0.002
 exact-0.002
chie-0.002
 ".-0.002
Token letters
Token ID7475
Feature activation-3.18e-03

Pos logit contributions
drivers+0.013
ogs+0.013
dies+0.012
 villains+0.012
ug+0.012
Neg logit contributions
 entire-0.013
ifying-0.012
 exact-0.010
flower-0.010
chie-0.010
Token were
Token ID547
Feature activation+1.64e-03

Pos logit contributions
drivers+0.005
ogs+0.005
dies+0.005
 villains+0.005
con+0.005
Neg logit contributions
 entire-0.005
ifying-0.004
flower-0.004
chie-0.004
 exact-0.004
Token all
Token ID477
Feature activation+5.93e-03

Pos logit contributions
 entire+0.002
ifying+0.002
chie+0.001
flower+0.001
 exact+0.001
Neg logit contributions
drivers-0.003
ogs-0.003
dies-0.003
 villains-0.003
ves-0.003
Token cr
Token ID1067
Feature activation-5.41e-02

Pos logit contributions
 entire+0.008
ifying+0.007
flower+0.006
chie+0.006
 exact+0.006
Neg logit contributions
drivers-0.011
ogs-0.010
dies-0.010
 villains-0.010
 weapons-0.010
Tokenum
Token ID388
Feature activation+8.03e-03

Pos logit contributions
 villains+0.084
dies+0.083
drivers+0.081
ogs+0.080
 weapons+0.078
Neg logit contributions
 entire-0.084
ifying-0.082
chie-0.071
flower-0.067
 exact-0.065
Tokenpled
Token ID10137
Feature activation-1.29e-03

Pos logit contributions
 entire+0.012
ifying+0.012
flower+0.010
chie+0.010
 exact+0.010
Neg logit contributions
drivers-0.012
ogs-0.012
dies-0.012
 villains-0.012
ves-0.011
Token and
Token ID290
Feature activation-4.28e-03

Pos logit contributions
drivers+0.002
 villains+0.002
ogs+0.002
dies+0.002
con+0.002
Neg logit contributions
 entire-0.002
ifying-0.002
flower-0.001
 exact-0.001
 too-0.001
Token messy
Token ID24097
Feature activation-9.17e-03

Pos logit contributions
ogs+0.008
dies+0.008
drivers+0.008
ests+0.008
 villains+0.008
Neg logit contributions
 entire-0.005
ifying-0.005
 too-0.004
flower-0.004
chie-0.004
Token and
Token ID290
Feature activation+5.42e-04

Pos logit contributions
ogs+0.015
 villains+0.015
drivers+0.015
dies+0.014
uddy+0.014
Neg logit contributions
 entire-0.013
ifying-0.012
 exact-0.010
flower-0.010
 too-0.010
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
drivers+0.000
 villains+0.000
dies+0.000
renches+0.000
ests+0.000
Neg logit contributions
 my-0.000
 entire-0.000
 too-0.000
ifying-0.000
 mine-0.000
Token Tim
Token ID5045
Feature activation-1.16e-02

Pos logit contributions
 entire+0.018
 too+0.017
 my+0.017
ifying+0.017
flower+0.016
Neg logit contributions
drivers-0.019
renches-0.017
ogs-0.017
ves-0.016
dies-0.016
Tokenmy
Token ID1820
Feature activation-1.73e-02

Pos logit contributions
drivers+0.024
dies+0.023
ogs+0.022
 villains+0.022
ves+0.022
Neg logit contributions
 entire-0.013
ifying-0.011
flower-0.010
 exact-0.009
chie-0.009
Token drove
Token ID10357
Feature activation-2.62e-02

Pos logit contributions
drivers+0.032
dies+0.030
ogs+0.030
 villains+0.029
 weapons+0.029
Neg logit contributions
 entire-0.022
flower-0.018
ifying-0.018
 too-0.016
 exact-0.016
Token his
Token ID465
Feature activation-6.57e-03

Pos logit contributions
drivers+0.057
resp+0.053
ogs+0.053
ves+0.053
renches+0.052
Neg logit contributions
 too-0.025
 away-0.024
 entire-0.023
 so-0.022
 back-0.022
Token je
Token ID11223
Feature activation-5.09e-02

Pos logit contributions
drivers+0.011
ogs+0.011
dies+0.010
othe+0.010
oot+0.010
Neg logit contributions
 entire-0.010
 front-0.009
ifying-0.009
 highest-0.008
 whole-0.008
Tokenep
Token ID538
Feature activation-5.83e-03

Pos logit contributions
drivers+0.103
ogs+0.100
dies+0.100
 villains+0.098
con+0.094
Neg logit contributions
 entire-0.054
ifying-0.047
chie-0.041
flower-0.039
 exact-0.038
Token slowly
Token ID6364
Feature activation+6.39e-03

Pos logit contributions
drivers+0.011
dies+0.011
ogs+0.010
renches+0.010
othe+0.010
Neg logit contributions
 entire-0.007
 too-0.006
 whole-0.005
 back-0.005
 so-0.005
Token and
Token ID290
Feature activation-7.77e-03

Pos logit contributions
 entire+0.008
ifying+0.007
 too+0.007
 front+0.006
 beginning+0.006
Neg logit contributions
drivers-0.012
dies-0.012
 villains-0.011
ves-0.011
ogs-0.011
Token carefully
Token ID7773
Feature activation-7.74e-03

Pos logit contributions
ogs+0.016
ests+0.015
drivers+0.015
renches+0.015
dies+0.015
Neg logit contributions
 entire-0.008
 too-0.007
 still-0.007
 almost-0.007
 away-0.007
Token.
Token ID13
Feature activation-9.20e-03

Pos logit contributions
dies+0.013
ogs+0.013
 villains+0.012
ests+0.012
uddy+0.012
Neg logit contributions
 entire-0.010
ifying-0.009
 too-0.009
 front-0.009
 beginning-0.008
Token around
Token ID1088
Feature activation+2.77e-04

Pos logit contributions
drivers+0.012
dies+0.011
 villains+0.011
ogs+0.011
 weapons+0.011
Neg logit contributions
 entire-0.008
ifying-0.007
chie-0.006
 front-0.006
 exact-0.006
Token,
Token ID11
Feature activation-4.54e-04

Pos logit contributions
 entire+0.000
 too+0.000
 front+0.000
 so+0.000
 beginning+0.000
Neg logit contributions
drivers-0.001
ests-0.001
 villains-0.001
dies-0.001
renches-0.001
Token finding
Token ID4917
Feature activation-6.07e-03

Pos logit contributions
drivers+0.001
ogs+0.001
dies+0.001
 villains+0.001
 weapons+0.001
Neg logit contributions
 entire-0.001
ifying-0.001
flower-0.001
chie-0.001
 exact-0.001
Token all
Token ID477
Feature activation-3.68e-03

Pos logit contributions
drivers+0.011
 villains+0.011
dies+0.011
uddy+0.011
ogs+0.011
Neg logit contributions
 entire-0.008
ifying-0.006
 exact-0.006
 prett-0.006
 front-0.006
Token sorts
Token ID10524
Feature activation-3.21e-02

Pos logit contributions
drivers+0.006
dies+0.006
 villains+0.006
ogs+0.005
 weapons+0.005
Neg logit contributions
 entire-0.006
ifying-0.005
 exact-0.005
flower-0.005
chie-0.005
Token of
Token ID286
Feature activation-5.28e-02

Pos logit contributions
dies+0.054
drivers+0.053
 villains+0.050
ogs+0.048
 Dino+0.047
Neg logit contributions
 entire-0.046
 too-0.041
ifying-0.039
chie-0.038
 so-0.037
Token old
Token ID1468
Feature activation-6.25e-02

Pos logit contributions
drivers+0.095
ogs+0.094
dies+0.092
othe+0.092
oot+0.091
Neg logit contributions
 entire-0.067
 exact-0.054
 too-0.049
ifying-0.049
 prett-0.048
Token trains
Token ID13404
Feature activation-4.08e-02

Pos logit contributions
othe+0.097
ogs+0.089
oot+0.089
ogged+0.087
 villains+0.087
Neg logit contributions
 entire-0.095
 too-0.078
 front-0.076
 exact-0.076
ifying-0.074
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
dies+0.080
 villains+0.080
drivers+0.077
renches+0.076
ogs+0.075
Neg logit contributions
 entire-0.043
 too-0.038
ifying-0.037
 so-0.034
flower-0.031
Token station
Token ID4429
Feature activation-1.38e-02

Pos logit contributions
ogs+0.022
ogged+0.020
dies+0.019
ests+0.019
ug+0.019
Neg logit contributions
 entire-0.017
ifying-0.015
 too-0.014
 exact-0.014
 so-0.014
Token signs
Token ID5895
Feature activation-1.69e-02

Pos logit contributions
 villains+0.025
dies+0.024
ogs+0.024
drivers+0.023
ug+0.023
Neg logit contributions
 entire-0.017
ifying-0.015
 too-0.014
 exact-0.014
flower-0.013