TOP ACTIVATIONS
MAX = 0.050

happy to see our big cuc umber ." Lily says .
. You found a nice cuc umber . It will be
have stopped the rh in oc eros . He is sleeping
It is the rh in oc eros !" The
I think it is a cuc umber . Mama puts cuc
thanked the elderly rh in oc eros for his wonderful surprise
try to grab the av oc ados . But the rock
store to get more av oc ados . Gary looked at
paid him for his av oc ados . Tom
. And this is a cuc umber . It is green
The scientist asked the ot ter to swim around the
and picked the right av oc ados . Sadly
They forget about the av oc ados . They just want
will take the rh in oc eros back to the zoo
Sam hugged the rh in oc eros goodbye , then ran
had never seen the av oc ados . They wished they
thank you . We love cuc umbers . And we love
garden . They cut the cuc umber and put it in
're the best rh in oc eros ever !" Tim my
It was the rh in oc eros . It lifted its

MIN ACTIVATIONS
MIN = -0.043

a snake with me . Deal ?" L ily
you have to share . Deal ?" Lily said " Deal
fold the laundry first . Deal ?" L ily
put away your blocks . Deal ?" L ily
nice after each one . Deal ?" Max nodded
any more without asking . Deal ?" L ily
we get back home . Deal ?" S ara
me fix my taxi . Deal ?" he said .
to play after that . Deal ?" Her mom
. She had a red tr icycle with a big wheel
please and thank you . Deal ?" S ara
. She had a big tr icycle that she would ride
had a pet dog called Bos co . Bos co liked
write on paper only . Deal ?" L ily
her sc ooter . She tr und led along the sidewalk
a sc ooter and a tr icycle . She loved to
and eat your food . Deal ?" Anna and
to say thank you . Deal ?" Anna nodded
," she said . " Not today . But let â
have to say okay . Deal ?" L ily

INTERVAL 0.027 - 0.050
CONTAINS 0.015%

, I can play the fl ute like you ."
's house was built long ago . L ily
She thought the rh in oc eros might like the leaves
sw am away . The ot ter thought it was so
, something strange caught her eye . It was a very

INTERVAL 0.003 - 0.027
CONTAINS 29.317%

red je ep . <|endoftext|> Once upon a time , there
teasing him . The lion lived happily ever after and was
class a quiz . Tim my didn 't know any of
then , a friendly King appeared . He said , �
, as everyone learned and played together . <|endoftext|> Once upon

INTERVAL -0.020 - 0.003
CONTAINS 70.481%

. She peek ed around the corner and saw a pretty
!" Lily said . " Let 's show it to Mom
all his veggies and never had a conflict with his mom
continued exploring the market . She guessed that she would have
Mom ." Mom hugged him and said , "

INTERVAL -0.043 - -0.020
CONTAINS 0.187%

sticks poking out and leaves all over the place . Some
said . " Maybe it is bored ," Dad
the ice cream . They think the white ice cream and
be patient . H aley didn 't know what
you want some juice ?" Mom asked . "
Token happy
Token ID3772
Feature activation+2.13e-03

Pos logit contributions
per+0.000
ula+0.000
 with+0.000
 from+0.000
ster+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
storms-0.000
 Susan-0.000
Token to
Token ID284
Feature activation-6.62e-05

Pos logit contributions
 with+0.001
 from+0.001
 by+0.001
per+0.001
ula+0.001
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
 Susan-0.005
Token see
Token ID766
Feature activation+7.97e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 Susan+0.000
Neg logit contributions
per-0.000
ula-0.000
ster-0.000
 from-0.000
 with-0.000
Token our
Token ID674
Feature activation-7.20e-03

Pos logit contributions
 from+0.006
per+0.006
ula+0.006
 with+0.006
 by+0.005
Neg logit contributions
anne-0.021
lights-0.018
storm-0.018
storms-0.017
 Susan-0.016
Token big
Token ID1263
Feature activation+4.76e-03

Pos logit contributions
anne+0.013
lights+0.011
storm+0.010
storms+0.010
 marsh+0.010
Neg logit contributions
ula-0.011
per-0.011
 from-0.010
ster-0.010
 St-0.010
Token cuc
Token ID38421
Feature activation+4.86e-02

Pos logit contributions
ula+0.006
per+0.006
ster+0.006
 from+0.006
 St+0.005
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.008
 Susan-0.007
Tokenumber
Token ID4494
Feature activation-6.73e-04

Pos logit contributions
 from+0.064
 St+0.064
 this+0.063
 liking+0.061
 My+0.061
Neg logit contributions
anne-0.072
acket-0.060
lights-0.060
 marsh-0.058
storm-0.057
Token."
Token ID526
Feature activation-5.13e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
 St-0.001
 from-0.001
Token Lily
Token ID20037
Feature activation-1.46e-02

Pos logit contributions
anne+0.012
storm+0.010
lights+0.010
storms+0.010
rated+0.009
Neg logit contributions
 St-0.006
 from-0.006
ula-0.006
 with-0.006
per-0.005
Token says
Token ID1139
Feature activation-8.20e-03

Pos logit contributions
anne+0.029
lights+0.025
storm+0.024
storms+0.022
 Susan+0.022
Neg logit contributions
per-0.020
 from-0.019
ula-0.019
 with-0.019
 St-0.019
Token.
Token ID13
Feature activation+2.19e-03

Pos logit contributions
anne+0.017
lights+0.014
storms+0.013
 Susan+0.013
storm+0.013
Neg logit contributions
ula-0.010
per-0.009
ster-0.009
 St-0.009
atching-0.009
Token.
Token ID13
Feature activation-6.87e-04

Pos logit contributions
 from+0.003
 with+0.002
per+0.002
'+0.002
 by+0.002
Neg logit contributions
anne-0.005
lights-0.005
storm-0.004
 Susan-0.004
storms-0.004
Token You
Token ID921
Feature activation+7.18e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
ula-0.001
 St-0.001
per-0.001
 with-0.001
 from-0.001
Token found
Token ID1043
Feature activation-2.18e-03

Pos logit contributions
per+0.006
 from+0.005
 solve+0.005
 with+0.005
'+0.005
Neg logit contributions
anne-0.019
lights-0.017
storm-0.016
storms-0.015
 Susan-0.015
Token a
Token ID257
Feature activation+2.71e-03

Pos logit contributions
anne+0.006
storm+0.006
lights+0.005
storms+0.005
rated+0.005
Neg logit contributions
 from-0.002
 with-0.001
 by-0.001
 this-0.001
 those-0.001
Token nice
Token ID3621
Feature activation-4.26e-03

Pos logit contributions
ula+0.003
ster+0.003
 with+0.003
per+0.003
 from+0.003
Neg logit contributions
anne-0.006
lights-0.005
storms-0.005
storm-0.005
 Susan-0.005
Token cuc
Token ID38421
Feature activation+4.77e-02

Pos logit contributions
anne+0.010
lights+0.008
storms+0.008
storm+0.008
 Susan+0.007
Neg logit contributions
per-0.005
 with-0.005
ula-0.005
 from-0.005
ster-0.005
Tokenumber
Token ID4494
Feature activation+6.42e-03

Pos logit contributions
 figure+0.065
 St+0.063
 from+0.063
 solve+0.062
 liking+0.061
Neg logit contributions
anne-0.063
acket-0.055
storm-0.054
akes-0.053
 marsh-0.052
Token.
Token ID13
Feature activation-4.14e-04

Pos logit contributions
ula+0.006
per+0.006
 St+0.006
ster+0.005
atching+0.005
Neg logit contributions
anne-0.015
storm-0.013
lights-0.012
storms-0.012
 Susan-0.012
Token It
Token ID632
Feature activation+9.41e-04

Pos logit contributions
anne+0.001
 Susan+0.001
 John+0.001
lights+0.001
 Jessie+0.001
Neg logit contributions
per-0.001
ula-0.000
ster-0.000
gg-0.000
or-0.000
Token will
Token ID481
Feature activation+3.09e-03

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
ster+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token be
Token ID307
Feature activation-8.16e-04

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
atching+0.003
 St+0.003
Neg logit contributions
anne-0.007
storm-0.006
lights-0.006
 marsh-0.006
 Susan-0.006
Token have
Token ID423
Feature activation+1.46e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.005
storms+0.005
 Susan+0.005
Neg logit contributions
per-0.002
 from-0.002
ula-0.002
 with-0.002
ster-0.002
Token stopped
Token ID5025
Feature activation-1.15e-04

Pos logit contributions
 from+0.001
 with+0.001
per+0.001
 by+0.001
 this+0.001
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
rated-0.003
storms-0.003
Token the
Token ID262
Feature activation-4.90e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 Susan+0.000
Neg logit contributions
ula-0.000
per-0.000
ster-0.000
atching-0.000
 St-0.000
Token rh
Token ID9529
Feature activation+3.07e-02

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.008
 Susan+0.008
Neg logit contributions
per-0.006
ula-0.006
 from-0.006
ster-0.006
 with-0.006
Tokenin
Token ID259
Feature activation-6.19e-03

Pos logit contributions
ula+0.041
per+0.036
 St+0.036
 from+0.036
 solve+0.034
Neg logit contributions
anne-0.059
storm-0.052
lights-0.050
storms-0.046
 marsh-0.044
Tokenoc
Token ID420
Feature activation+4.55e-02

Pos logit contributions
anne+0.015
storm+0.013
lights+0.012
storms+0.012
 marsh+0.012
Neg logit contributions
ula-0.006
per-0.006
 St-0.005
atching-0.005
ster-0.005
Tokeneros
Token ID27498
Feature activation+4.81e-03

Pos logit contributions
ula+0.073
 solve+0.065
per+0.064
 St+0.063
 figure+0.063
Neg logit contributions
anne-0.076
storm-0.067
lights-0.063
storms-0.060
 marsh-0.057
Token.
Token ID13
Feature activation+4.12e-03

Pos logit contributions
per+0.004
ula+0.004
ster+0.004
 from+0.004
 St+0.004
Neg logit contributions
anne-0.012
lights-0.010
storm-0.010
storms-0.009
 Susan-0.009
Token He
Token ID679
Feature activation+7.44e-03

Pos logit contributions
per+0.006
ula+0.006
ster+0.006
 from+0.005
 St+0.005
Neg logit contributions
anne-0.008
lights-0.006
storm-0.006
storms-0.006
 Susan-0.006
Token is
Token ID318
Feature activation+6.25e-04

Pos logit contributions
per+0.007
 with+0.007
 from+0.007
ster+0.007
 St+0.007
Neg logit contributions
anne-0.018
lights-0.016
storm-0.015
 Susan-0.014
storms-0.014
Token sleeping
Token ID11029
Feature activation+1.39e-02

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
TokenIt
Token ID1026
Feature activation-2.84e-05

Pos logit contributions
ula+0.015
per+0.015
ster+0.014
 from+0.014
 St+0.014
Neg logit contributions
anne-0.019
lights-0.016
storm-0.015
storms-0.015
 Susan-0.014
Token is
Token ID318
Feature activation-3.51e-04

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
 Susan+0.000
storms+0.000
Neg logit contributions
 with-0.000
 from-0.000
per-0.000
 by-0.000
ster-0.000
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
jo+0.001
Neg logit contributions
 from-0.000
 with-0.000
per-0.000
 by-0.000
 St-0.000
Token rh
Token ID9529
Feature activation+2.76e-02

Pos logit contributions
per+0.020
ula+0.019
ster+0.018
 from+0.018
 St+0.018
Neg logit contributions
anne-0.032
lights-0.026
storm-0.025
storms-0.024
 Susan-0.024
Tokenin
Token ID259
Feature activation-2.19e-03

Pos logit contributions
ula+0.037
 St+0.034
per+0.033
olding+0.031
 from+0.031
Neg logit contributions
anne-0.052
lights-0.046
storm-0.045
 marsh-0.041
storms-0.040
Tokenoc
Token ID420
Feature activation+4.45e-02

Pos logit contributions
anne+0.005
storm+0.004
lights+0.004
 marsh+0.004
storms+0.004
Neg logit contributions
ula-0.003
per-0.002
 St-0.002
atching-0.002
olding-0.002
Tokeneros
Token ID27498
Feature activation+2.79e-03

Pos logit contributions
ula+0.072
 solve+0.064
 St+0.064
per+0.063
etic+0.061
Neg logit contributions
anne-0.072
lights-0.063
storm-0.062
storms-0.057
 lava-0.057
Token!"
Token ID2474
Feature activation+6.72e-03

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
 from+0.002
 St+0.002
Neg logit contributions
anne-0.007
lights-0.006
storm-0.005
storms-0.005
 Susan-0.005
Token
Token ID198
Feature activation+2.33e-03

Pos logit contributions
 with+0.006
ula+0.006
 from+0.006
per+0.006
 St+0.005
Neg logit contributions
anne-0.017
storm-0.015
lights-0.015
storms-0.014
rated-0.013
Token
Token ID198
Feature activation-1.02e-03

Pos logit contributions
per+0.002
ula+0.002
 with+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.006
storm-0.005
lights-0.005
storms-0.005
 Susan-0.004
TokenThe
Token ID464
Feature activation-2.33e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
 Susan+0.002
 marsh+0.002
Neg logit contributions
ula-0.001
per-0.001
 St-0.001
atching-0.001
ster-0.001
TokenI
Token ID40
Feature activation-9.84e-03

Pos logit contributions
per+0.009
ula+0.009
 from+0.009
 solve+0.008
 by+0.008
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.008
 marsh-0.007
Token think
Token ID892
Feature activation-2.56e-02

Pos logit contributions
anne+0.023
storm+0.018
 Susan+0.017
storms+0.017
 marsh+0.017
Neg logit contributions
ula-0.011
per-0.010
ster-0.010
atching-0.009
 from-0.009
Token it
Token ID340
Feature activation+3.36e-03

Pos logit contributions
anne+0.063
lights+0.053
storm+0.052
storms+0.050
 Susan+0.048
Neg logit contributions
ula-0.024
per-0.024
 from-0.023
 with-0.022
ster-0.022
Token is
Token ID318
Feature activation+4.57e-03

Pos logit contributions
 with+0.002
 from+0.002
per+0.002
 by+0.002
ster+0.002
Neg logit contributions
anne-0.009
lights-0.008
storm-0.008
 Susan-0.007
storms-0.007
Token a
Token ID257
Feature activation+8.82e-03

Pos logit contributions
 from+0.004
 with+0.004
per+0.004
 by+0.004
ula+0.004
Neg logit contributions
anne-0.011
lights-0.010
storm-0.009
storms-0.009
 Susan-0.009
Token cuc
Token ID38421
Feature activation+4.43e-02

Pos logit contributions
per+0.011
ula+0.011
ster+0.010
 from+0.010
 St+0.010
Neg logit contributions
anne-0.018
lights-0.015
storm-0.015
storms-0.014
 Susan-0.014
Tokenumber
Token ID4494
Feature activation+4.95e-03

Pos logit contributions
 this+0.059
 St+0.057
 from+0.057
 solve+0.056
 liking+0.056
Neg logit contributions
anne-0.064
storm-0.053
acket-0.053
 marsh-0.052
storms-0.050
Token.
Token ID13
Feature activation+5.47e-03

Pos logit contributions
ula+0.005
per+0.005
ster+0.004
 St+0.004
atching+0.004
Neg logit contributions
anne-0.012
lights-0.010
storm-0.009
storms-0.009
 Susan-0.009
Token Mama
Token ID40084
Feature activation+5.29e-03

Pos logit contributions
ula+0.008
per+0.008
ster+0.007
or+0.007
 solve+0.007
Neg logit contributions
anne-0.009
lights-0.008
 Susan-0.007
storm-0.007
storms-0.007
Token puts
Token ID7584
Feature activation+1.30e-03

Pos logit contributions
per+0.006
 from+0.006
 with+0.006
ster+0.005
 by+0.005
Neg logit contributions
anne-0.012
lights-0.011
storm-0.010
storms-0.009
 Susan-0.009
Token cuc
Token ID38421
Feature activation+3.64e-02

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
ster+0.001
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token thanked
Token ID26280
Feature activation-2.52e-03

Pos logit contributions
anne+0.007
lights+0.007
storm+0.006
storms+0.006
 marsh+0.006
Neg logit contributions
 from-0.006
 by-0.005
 with-0.005
per-0.005
'-0.005
Token the
Token ID262
Feature activation+4.18e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 Susan+0.004
storms+0.004
Neg logit contributions
ula-0.003
per-0.003
ster-0.003
atching-0.003
 from-0.003
Token elderly
Token ID13830
Feature activation+1.59e-02

Pos logit contributions
ula+0.006
per+0.005
ster+0.005
atching+0.005
 from+0.005
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
storms-0.007
 Susan-0.006
Token rh
Token ID9529
Feature activation+2.79e-02

Pos logit contributions
ula+0.022
per+0.021
atching+0.020
 St+0.020
ster+0.020
Neg logit contributions
anne-0.031
lights-0.025
storm-0.025
storms-0.024
 Susan-0.024
Tokenin
Token ID259
Feature activation+5.17e-03

Pos logit contributions
ula+0.040
 St+0.036
per+0.034
 from+0.034
 liking+0.033
Neg logit contributions
anne-0.052
storm-0.041
lights-0.041
storms-0.039
 Susan-0.037
Tokenoc
Token ID420
Feature activation+4.39e-02

Pos logit contributions
ula+0.006
per+0.005
atching+0.005
yle+0.005
 St+0.005
Neg logit contributions
anne-0.012
storm-0.009
 Susan-0.009
 marsh-0.009
lights-0.009
Tokeneros
Token ID27498
Feature activation+1.57e-02

Pos logit contributions
ula+0.070
 St+0.064
per+0.063
 figure+0.063
 solve+0.062
Neg logit contributions
anne-0.075
storm-0.060
lights-0.059
storms-0.057
 Susan-0.054
Token for
Token ID329
Feature activation+5.37e-03

Pos logit contributions
ula+0.009
per+0.009
ster+0.007
atching+0.007
 St+0.007
Neg logit contributions
anne-0.043
lights-0.038
storm-0.037
storms-0.036
 Susan-0.036
Token his
Token ID465
Feature activation+7.10e-03

Pos logit contributions
ula+0.006
per+0.006
atching+0.006
ster+0.006
olding+0.005
Neg logit contributions
anne-0.011
 Susan-0.009
lights-0.009
storm-0.009
 John-0.009
Token wonderful
Token ID7932
Feature activation-5.02e-03

Pos logit contributions
ula+0.008
per+0.008
 with+0.008
 from+0.008
 solve+0.008
Neg logit contributions
anne-0.016
lights-0.013
storm-0.013
storms-0.012
 Susan-0.012
Token surprise
Token ID5975
Feature activation+9.32e-03

Pos logit contributions
anne+0.010
lights+0.008
storms+0.008
storm+0.008
 Susan+0.007
Neg logit contributions
 with-0.008
 by-0.007
 from-0.007
 for-0.007
 solve-0.007
Token try
Token ID1949
Feature activation-8.39e-03

Pos logit contributions
per+0.006
 with+0.005
 from+0.005
 St+0.005
ula+0.005
Neg logit contributions
anne-0.012
lights-0.010
storm-0.010
storms-0.010
 Susan-0.010
Token to
Token ID284
Feature activation-2.26e-03

Pos logit contributions
anne+0.020
lights+0.017
storm+0.016
storms+0.016
 marsh+0.016
Neg logit contributions
per-0.008
ula-0.008
ster-0.008
atching-0.007
 St-0.007
Token grab
Token ID5552
Feature activation-9.45e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 marsh+0.004
 Susan+0.004
Neg logit contributions
ula-0.002
per-0.002
ster-0.002
atching-0.002
 St-0.002
Token the
Token ID262
Feature activation+7.48e-03

Pos logit contributions
anne+0.024
storm+0.020
lights+0.020
storms+0.019
 Susan+0.018
Neg logit contributions
per-0.009
ula-0.009
 from-0.008
 with-0.008
 St-0.008
Token av
Token ID1196
Feature activation-1.97e-03

Pos logit contributions
per+0.009
ula+0.009
ster+0.009
 from+0.009
 with+0.009
Neg logit contributions
anne-0.016
lights-0.013
storm-0.013
storms-0.012
 Susan-0.012
Tokenoc
Token ID420
Feature activation+4.39e-02

Pos logit contributions
anne+0.004
storm+0.003
storms+0.003
die+0.003
lights+0.003
Neg logit contributions
ula-0.002
per-0.002
 St-0.002
olding-0.002
 figure-0.002
Tokenados
Token ID22484
Feature activation-2.41e-03

Pos logit contributions
ula+0.069
per+0.064
 St+0.064
 from+0.062
 figure+0.061
Neg logit contributions
anne-0.077
lights-0.062
storm-0.060
storms-0.058
 Susan-0.057
Token.
Token ID13
Feature activation-8.67e-04

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 marsh+0.004
Neg logit contributions
ula-0.003
 St-0.002
ster-0.002
Is-0.002
atching-0.002
Token But
Token ID887
Feature activation-7.78e-04

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
ula-0.001
per-0.001
 St-0.001
 from-0.001
 with-0.001
Token the
Token ID262
Feature activation+7.68e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
 marsh+0.001
 Susan+0.001
Neg logit contributions
per-0.001
ula-0.001
ster-0.001
atching-0.001
olding-0.001
Token rock
Token ID3881
Feature activation-3.39e-03

Pos logit contributions
per+0.008
ster+0.008
 with+0.008
 from+0.008
 by+0.008
Neg logit contributions
anne-0.018
lights-0.015
storm-0.014
storms-0.014
 Susan-0.013
Token store
Token ID3650
Feature activation-4.56e-04

Pos logit contributions
ula+0.007
per+0.007
atching+0.006
gg+0.006
 from+0.006
Neg logit contributions
anne-0.009
storm-0.007
lights-0.007
 marsh-0.007
 Susan-0.007
Token to
Token ID284
Feature activation-1.38e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
ula-0.000
per-0.000
ster-0.000
 St-0.000
 from-0.000
Token get
Token ID651
Feature activation+4.82e-04

Pos logit contributions
anne+0.003
 Susan+0.003
storm+0.003
lights+0.003
 marsh+0.003
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
atching-0.001
gg-0.001
Token more
Token ID517
Feature activation+3.64e-03

Pos logit contributions
 from+0.000
 with+0.000
 by+0.000
ula+0.000
 St+0.000
Neg logit contributions
anne-0.001
storm-0.001
lights-0.001
storms-0.001
jo-0.001
Token av
Token ID1196
Feature activation-2.94e-04

Pos logit contributions
 with+0.004
 from+0.004
per+0.004
 by+0.004
ula+0.004
Neg logit contributions
anne-0.008
storm-0.007
lights-0.007
storms-0.006
 Susan-0.006
Tokenoc
Token ID420
Feature activation+4.28e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
ula-0.000
per-0.000
 St-0.000
olding-0.000
ster-0.000
Tokenados
Token ID22484
Feature activation-5.88e-04

Pos logit contributions
ula+0.070
per+0.066
 St+0.065
 figure+0.065
 from+0.064
Neg logit contributions
anne-0.069
lights-0.055
storms-0.053
storm-0.052
 Susan-0.052
Token.
Token ID13
Feature activation+2.82e-04

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
 from-0.000
per-0.000
 with-0.000
ula-0.000
 by-0.000
Token Gary
Token ID10936
Feature activation+1.07e-02

Pos logit contributions
 St+0.000
 with+0.000
per+0.000
ula+0.000
 from+0.000
Neg logit contributions
anne-0.001
lights-0.000
storm-0.000
rated-0.000
 marsh-0.000
Token looked
Token ID3114
Feature activation+6.22e-05

Pos logit contributions
per+0.013
 with+0.012
ula+0.012
 from+0.012
 St+0.012
Neg logit contributions
anne-0.023
lights-0.020
storm-0.019
storms-0.018
 Susan-0.018
Token at
Token ID379
Feature activation+1.40e-02

Pos logit contributions
per+0.000
ula+0.000
 from+0.000
 with+0.000
ster+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
storms-0.000
 Susan-0.000
Token paid
Token ID3432
Feature activation-3.02e-04

Pos logit contributions
anne+0.006
lights+0.005
storms+0.005
storm+0.005
 Susan+0.005
Neg logit contributions
 from-0.004
per-0.004
 with-0.004
 solve-0.004
 figure-0.004
Token him
Token ID683
Feature activation+3.40e-03

Pos logit contributions
anne+0.001
storm+0.001
lights+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
per-0.000
ula-0.000
 from-0.000
 with-0.000
 by-0.000
Token for
Token ID329
Feature activation-5.48e-03

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 from+0.003
 St+0.003
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
storms-0.007
 Susan-0.007
Token his
Token ID465
Feature activation+3.36e-03

Pos logit contributions
anne+0.014
lights+0.011
storm+0.011
storms+0.011
rated+0.010
Neg logit contributions
per-0.005
ula-0.005
 with-0.005
 from-0.005
 solve-0.005
Token av
Token ID1196
Feature activation+3.37e-03

Pos logit contributions
 from+0.004
 with+0.004
 figure+0.003
 solve+0.003
ster+0.003
Neg logit contributions
anne-0.008
lights-0.007
storms-0.006
storm-0.006
 Susan-0.006
Tokenoc
Token ID420
Feature activation+4.26e-02

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.009
lights-0.008
storm-0.007
storms-0.007
 Susan-0.007
Tokenados
Token ID22484
Feature activation+5.62e-03

Pos logit contributions
ula+0.054
per+0.053
 from+0.050
 St+0.049
ster+0.048
Neg logit contributions
anne-0.088
lights-0.073
storm-0.071
storms-0.068
 Susan-0.067
Token.
Token ID13
Feature activation-4.90e-04

Pos logit contributions
per+0.005
ula+0.004
 from+0.004
 with+0.004
ster+0.004
Neg logit contributions
anne-0.015
lights-0.013
storm-0.012
storms-0.012
 Susan-0.012
Token
Token ID198
Feature activation+1.62e-03

Pos logit contributions
anne+0.001
storm+0.001
lights+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
ula-0.000
 St-0.000
per-0.000
 with-0.000
 from-0.000
Token
Token ID198
Feature activation-1.52e-04

Pos logit contributions
ula+0.002
per+0.002
 from+0.002
 with+0.002
ster+0.002
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
storms-0.003
 Susan-0.003
TokenTom
Token ID13787
Feature activation+5.95e-03

Pos logit contributions
anne+0.000
lights+0.000
 Susan+0.000
storm+0.000
 John+0.000
Neg logit contributions
ster-0.000
ula-0.000
per-0.000
 grant-0.000
aser-0.000
Token.
Token ID13
Feature activation+2.73e-04

Pos logit contributions
per+0.000
 from+0.000
 with+0.000
ula+0.000
ster+0.000
Neg logit contributions
anne-0.002
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token And
Token ID843
Feature activation-1.41e-02

Pos logit contributions
per+0.000
ula+0.000
ster+0.000
 from+0.000
atching+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.000
storms-0.000
 Susan-0.000
Token this
Token ID428
Feature activation+6.48e-03

Pos logit contributions
anne+0.031
lights+0.027
storm+0.026
 marsh+0.025
 Susan+0.025
Neg logit contributions
ula-0.015
per-0.015
ster-0.014
atching-0.013
 St-0.013
Token is
Token ID318
Feature activation-3.48e-03

Pos logit contributions
 with+0.006
ster+0.005
 from+0.005
per+0.005
'+0.005
Neg logit contributions
anne-0.016
lights-0.015
storm-0.014
storms-0.014
 Susan-0.013
Token a
Token ID257
Feature activation+6.44e-03

Pos logit contributions
anne+0.008
lights+0.007
storm+0.007
storms+0.007
 Susan+0.006
Neg logit contributions
per-0.004
ula-0.004
 from-0.003
 with-0.003
ster-0.003
Token cuc
Token ID38421
Feature activation+4.25e-02

Pos logit contributions
ula+0.008
per+0.007
ster+0.007
 with+0.007
 from+0.007
Neg logit contributions
anne-0.014
lights-0.012
storm-0.011
storms-0.011
 Susan-0.011
Tokenumber
Token ID4494
Feature activation+8.76e-03

Pos logit contributions
 from+0.056
 this+0.055
 solve+0.054
 St+0.054
 liking+0.053
Neg logit contributions
anne-0.068
storm-0.058
acket-0.057
 marsh-0.055
lights-0.055
Token.
Token ID13
Feature activation+2.24e-03

Pos logit contributions
ula+0.010
per+0.010
 St+0.009
ster+0.009
atching+0.009
Neg logit contributions
anne-0.019
lights-0.016
storm-0.016
storms-0.015
 Susan-0.015
Token It
Token ID632
Feature activation+5.54e-03

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
atching+0.002
 from+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 marsh-0.004
Token is
Token ID318
Feature activation+2.34e-03

Pos logit contributions
per+0.006
ula+0.006
ster+0.005
 from+0.005
 St+0.005
Neg logit contributions
anne-0.013
lights-0.011
storm-0.010
storms-0.010
 Susan-0.010
Token green
Token ID4077
Feature activation+2.08e-02

Pos logit contributions
per+0.003
ula+0.003
ster+0.002
 from+0.002
atching+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token
Token ID198
Feature activation+3.13e-03

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 from+0.002
atching+0.002
Neg logit contributions
anne-0.004
lights-0.004
storm-0.003
storms-0.003
 Susan-0.003
TokenThe
Token ID464
Feature activation+1.23e-02

Pos logit contributions
ster+0.004
ula+0.004
per+0.004
aser+0.004
 permission+0.004
Neg logit contributions
anne-0.006
storm-0.005
Charlie-0.005
 Charlie-0.004
John-0.004
Token scientist
Token ID11444
Feature activation+1.25e-02

Pos logit contributions
ula+0.019
per+0.018
atching+0.017
 My+0.016
gg+0.016
Neg logit contributions
anne-0.018
 hurricane-0.017
storm-0.017
 marsh-0.015
 lava-0.015
Token asked
Token ID1965
Feature activation+8.57e-03

Pos logit contributions
per+0.018
 with+0.017
 from+0.017
ster+0.017
'+0.016
Neg logit contributions
anne-0.024
lights-0.021
storm-0.020
storms-0.019
 marsh-0.019
Token the
Token ID262
Feature activation+8.36e-03

Pos logit contributions
ula+0.012
atching+0.011
ster+0.010
band+0.010
per+0.010
Neg logit contributions
anne-0.016
 John-0.015
 Susan-0.014
 Charlie-0.014
 George-0.013
Token ot
Token ID30972
Feature activation+4.25e-02

Pos logit contributions
ula+0.012
per+0.012
atching+0.011
gg+0.011
 My+0.010
Neg logit contributions
anne-0.013
storm-0.012
 hurricane-0.011
 marsh-0.011
 imag-0.011
Tokenter
Token ID353
Feature activation+1.54e-02

Pos logit contributions
ula+0.051
 liking+0.044
 solve+0.044
per+0.043
 from+0.042
Neg logit contributions
anne-0.071
storms-0.064
storm-0.063
die-0.058
lights-0.057
Token to
Token ID284
Feature activation+5.96e-03

Pos logit contributions
ula+0.016
per+0.014
atching+0.013
gg+0.012
ster+0.012
Neg logit contributions
anne-0.036
lights-0.030
 Susan-0.029
 Jessie-0.028
storms-0.028
Token swim
Token ID9422
Feature activation-1.39e-03

Pos logit contributions
ula+0.006
per+0.006
ster+0.005
atching+0.005
 St+0.005
Neg logit contributions
anne-0.014
lights-0.012
storm-0.012
 Susan-0.011
 marsh-0.011
Token around
Token ID1088
Feature activation-8.88e-04

Pos logit contributions
anne+0.004
storm+0.003
lights+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
per-0.001
 with-0.001
 from-0.001
 by-0.001
ula-0.001
Token the
Token ID262
Feature activation+7.96e-05

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
atching-0.001
gg-0.001
Token and
Token ID290
Feature activation+8.50e-04

Pos logit contributions
per+0.006
ula+0.005
 from+0.005
 with+0.005
ster+0.005
Neg logit contributions
anne-0.018
lights-0.016
storm-0.015
storms-0.015
 marsh-0.015
Token picked
Token ID6497
Feature activation-1.61e-03

Pos logit contributions
 from+0.001
 with+0.001
per+0.001
 solve+0.001
ster+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 marsh-0.001
Token the
Token ID262
Feature activation+7.61e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
storms+0.003
 marsh+0.003
Neg logit contributions
 from-0.002
 with-0.002
per-0.001
 by-0.001
ula-0.001
Token right
Token ID826
Feature activation-6.59e-03

Pos logit contributions
per+0.010
ula+0.010
ster+0.009
 from+0.009
 with+0.009
Neg logit contributions
anne-0.016
lights-0.013
storm-0.013
storms-0.012
 Susan-0.012
Token av
Token ID1196
Feature activation+5.52e-03

Pos logit contributions
anne+0.015
lights+0.012
storm+0.012
storms+0.012
 Susan+0.011
Neg logit contributions
per-0.008
ula-0.007
 from-0.007
ster-0.007
 with-0.007
Tokenoc
Token ID420
Feature activation+4.22e-02

Pos logit contributions
ula+0.007
per+0.006
 St+0.006
 from+0.006
 figure+0.005
Neg logit contributions
anne-0.012
storm-0.009
lights-0.009
storms-0.009
die-0.009
Tokenados
Token ID22484
Feature activation+1.77e-03

Pos logit contributions
ula+0.063
per+0.060
 St+0.058
 from+0.056
 figure+0.055
Neg logit contributions
anne-0.078
lights-0.063
storm-0.062
 Susan-0.060
storms-0.059
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
ula+0.002
per+0.002
 St+0.002
ster+0.002
atching+0.002
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
storms-0.003
 Susan-0.003
Token
Token ID198
Feature activation+3.50e-03

Pos logit contributions
per+0.002
ula+0.002
 St+0.002
 with+0.002
 from+0.002
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
storms-0.003
 marsh-0.003
Token
Token ID198
Feature activation-6.51e-04

Pos logit contributions
per+0.004
ula+0.004
ster+0.004
 from+0.004
atching+0.004
Neg logit contributions
anne-0.007
lights-0.006
storm-0.006
storms-0.006
 Susan-0.006
TokenSadly
Token ID36725
Feature activation+1.59e-03

Pos logit contributions
anne+0.001
storm+0.001
 Susan+0.001
lights+0.001
Charlie+0.001
Neg logit contributions
ster-0.001
per-0.001
ula-0.001
 from-0.001
 St-0.001
Token They
Token ID1119
Feature activation+1.27e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
 Susan+0.001
storms+0.001
Neg logit contributions
ula-0.001
per-0.001
 with-0.001
 St-0.001
 from-0.001
Token forget
Token ID6044
Feature activation-1.97e-04

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
 St+0.001
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token about
Token ID546
Feature activation-8.27e-04

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 Susan+0.000
Neg logit contributions
per-0.000
ula-0.000
ster-0.000
 from-0.000
 St-0.000
Token the
Token ID262
Feature activation+3.89e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
ula-0.001
per-0.001
 from-0.001
 with-0.001
 St-0.001
Token av
Token ID1196
Feature activation-3.31e-03

Pos logit contributions
per+0.004
ster+0.004
 by+0.004
 with+0.004
 figure+0.004
Neg logit contributions
anne-0.009
lights-0.007
 Susan-0.007
storm-0.007
storms-0.007
Tokenoc
Token ID420
Feature activation+4.20e-02

Pos logit contributions
anne+0.008
storm+0.006
lights+0.006
storms+0.006
 marsh+0.006
Neg logit contributions
ula-0.004
per-0.003
 St-0.003
 from-0.003
olding-0.003
Tokenados
Token ID22484
Feature activation-1.20e-04

Pos logit contributions
ula+0.065
per+0.061
 St+0.060
 from+0.058
 figure+0.056
Neg logit contributions
anne-0.075
lights-0.062
storm-0.062
storms-0.058
 Susan-0.057
Token.
Token ID13
Feature activation-1.08e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 marsh+0.000
Neg logit contributions
ula-0.000
per-0.000
 St-0.000
ster-0.000
atching-0.000
Token They
Token ID1119
Feature activation-2.72e-04

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
 Susan+0.001
storms+0.001
Neg logit contributions
ula-0.002
 St-0.002
 with-0.002
per-0.002
 from-0.002
Token just
Token ID655
Feature activation-1.06e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
per-0.000
 from-0.000
 solve-0.000
 with-0.000
 figure-0.000
Token want
Token ID765
Feature activation-1.05e-02

Pos logit contributions
anne+0.026
lights+0.025
storm+0.022
 Susan+0.022
storms+0.021
Neg logit contributions
per-0.010
 do-0.010
 from-0.010
 with-0.010
 give-0.010
Token will
Token ID481
Feature activation+1.30e-04

Pos logit contributions
per+0.006
'+0.006
 with+0.006
 solve+0.005
 do+0.005
Neg logit contributions
anne-0.018
lights-0.016
storm-0.015
 Susan-0.014
storms-0.014
Token take
Token ID1011
Feature activation+2.74e-03

Pos logit contributions
ula+0.000
per+0.000
ster+0.000
atching+0.000
 St+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
 Susan-0.000
 marsh-0.000
Token the
Token ID262
Feature activation-4.11e-04

Pos logit contributions
per+0.003
 with+0.003
 from+0.003
ula+0.003
 St+0.003
Neg logit contributions
anne-0.007
lights-0.006
storm-0.006
storms-0.005
 Susan-0.005
Token rh
Token ID9529
Feature activation+2.32e-02

Pos logit contributions
anne+0.001
storm+0.001
 marsh+0.001
lights+0.001
storms+0.001
Neg logit contributions
per-0.001
ula-0.001
atching-0.001
ster-0.001
 from-0.001
Tokenin
Token ID259
Feature activation-7.88e-03

Pos logit contributions
ula+0.033
per+0.030
 St+0.029
 from+0.028
 liking+0.027
Neg logit contributions
anne-0.043
lights-0.037
storm-0.035
 snowball-0.033
storms-0.033
Tokenoc
Token ID420
Feature activation+4.18e-02

Pos logit contributions
anne+0.016
storm+0.014
lights+0.013
 marsh+0.013
storms+0.013
Neg logit contributions
ula-0.010
per-0.009
 St-0.008
atching-0.008
 liking-0.008
Tokeneros
Token ID27498
Feature activation+2.51e-03

Pos logit contributions
ula+0.067
per+0.060
 solve+0.058
 from+0.058
 St+0.057
Neg logit contributions
anne-0.071
lights-0.059
storm-0.059
storms-0.055
 marsh-0.052
Token back
Token ID736
Feature activation-1.39e-02

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
 St+0.002
atching+0.002
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
 Susan-0.005
Token to
Token ID284
Feature activation+7.24e-04

Pos logit contributions
anne+0.038
storm+0.034
lights+0.033
storms+0.032
 marsh+0.031
Neg logit contributions
 from-0.010
 with-0.010
 by-0.009
per-0.009
ula-0.008
Token the
Token ID262
Feature activation-9.13e-03

Pos logit contributions
per+0.001
ula+0.001
 from+0.000
 St+0.000
 with+0.000
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token zoo
Token ID26626
Feature activation-7.39e-04

Pos logit contributions
anne+0.017
lights+0.014
storm+0.014
storms+0.013
 Susan+0.013
Neg logit contributions
per-0.013
ula-0.013
ster-0.013
 from-0.012
 St-0.012
Token Sam
Token ID3409
Feature activation-1.24e-03

Pos logit contributions
ula+0.009
per+0.009
atching+0.008
ster+0.008
olding+0.008
Neg logit contributions
anne-0.007
lights-0.006
 Susan-0.005
storm-0.005
storms-0.005
Token hugged
Token ID41130
Feature activation-2.31e-03

Pos logit contributions
anne+0.003
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.002
 from-0.002
'-0.002
 with-0.002
 by-0.002
Token the
Token ID262
Feature activation+5.47e-03

Pos logit contributions
anne+0.006
storm+0.005
lights+0.005
storms+0.005
 marsh+0.004
Neg logit contributions
 from-0.002
 with-0.002
per-0.002
 by-0.002
ula-0.002
Token rh
Token ID9529
Feature activation+2.53e-02

Pos logit contributions
ula+0.008
per+0.008
atching+0.008
ster+0.007
 from+0.007
Neg logit contributions
anne-0.010
storm-0.008
lights-0.008
 marsh-0.008
 Susan-0.007
Tokenin
Token ID259
Feature activation-3.28e-03

Pos logit contributions
ula+0.036
per+0.033
 St+0.032
 from+0.031
 solve+0.030
Neg logit contributions
anne-0.046
lights-0.039
storm-0.037
 snowball-0.035
storms-0.035
Tokenoc
Token ID420
Feature activation+4.13e-02

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
 marsh+0.006
 Susan+0.005
Neg logit contributions
ula-0.004
per-0.004
atching-0.003
 St-0.003
etic-0.003
Tokeneros
Token ID27498
Feature activation+3.68e-03

Pos logit contributions
ula+0.067
 solve+0.060
per+0.060
 St+0.058
 figure+0.058
Neg logit contributions
anne-0.070
lights-0.058
storm-0.057
storms-0.054
 Susan-0.050
Token goodbye
Token ID24829
Feature activation+1.71e-02

Pos logit contributions
 from+0.002
per+0.002
 with+0.002
 by+0.002
ster+0.002
Neg logit contributions
anne-0.010
storm-0.009
lights-0.009
storms-0.009
 Susan-0.008
Token,
Token ID11
Feature activation+1.10e-03

Pos logit contributions
per+0.010
 with+0.010
 from+0.010
ula+0.009
 by+0.009
Neg logit contributions
anne-0.047
lights-0.042
storm-0.041
storms-0.040
 marsh-0.039
Token then
Token ID788
Feature activation-4.86e-03

Pos logit contributions
 from+0.001
 with+0.001
per+0.001
 by+0.001
ula+0.001
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 marsh-0.002
Token ran
Token ID4966
Feature activation-7.25e-03

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.008
 marsh+0.007
Neg logit contributions
 from-0.007
 with-0.007
per-0.007
ster-0.006
 by-0.006
Token had
Token ID550
Feature activation-9.99e-03

Pos logit contributions
 from+0.003
 with+0.002
per+0.002
'+0.002
 by+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
 marsh-0.004
storms-0.004
Token never
Token ID1239
Feature activation-7.02e-03

Pos logit contributions
anne+0.026
lights+0.024
storm+0.022
storms+0.021
jo+0.020
Neg logit contributions
 with-0.010
 by-0.009
 solve-0.009
 St-0.009
 from-0.009
Token seen
Token ID1775
Feature activation+6.32e-03

Pos logit contributions
anne+0.013
lights+0.011
storm+0.010
storms+0.009
 marsh+0.009
Neg logit contributions
per-0.011
 from-0.011
 with-0.011
ula-0.011
 solve-0.011
Token the
Token ID262
Feature activation+7.53e-03

Pos logit contributions
 from+0.005
 with+0.005
 by+0.004
 those+0.004
 this+0.004
Neg logit contributions
anne-0.018
storm-0.016
lights-0.015
jo-0.015
rated-0.014
Token av
Token ID1196
Feature activation+2.08e-03

Pos logit contributions
per+0.008
ster+0.008
ula+0.008
 from+0.008
 with+0.008
Neg logit contributions
anne-0.017
lights-0.014
storm-0.013
storms-0.013
 Susan-0.013
Tokenoc
Token ID420
Feature activation+4.13e-02

Pos logit contributions
ula+0.002
per+0.002
 St+0.002
 from+0.002
olding+0.002
Neg logit contributions
anne-0.005
storm-0.004
lights-0.004
storms-0.004
 Susan-0.004
Tokenados
Token ID22484
Feature activation+2.68e-04

Pos logit contributions
ula+0.060
per+0.056
 St+0.053
 from+0.053
 with+0.051
Neg logit contributions
anne-0.079
storm-0.065
lights-0.064
storms-0.061
 Susan-0.061
Token.
Token ID13
Feature activation+4.02e-03

Pos logit contributions
ula+0.000
per+0.000
 St+0.000
atching+0.000
ster+0.000
Neg logit contributions
anne-0.001
storm-0.001
lights-0.000
storms-0.000
 marsh-0.000
Token They
Token ID1119
Feature activation+4.99e-04

Pos logit contributions
 St+0.005
 with+0.005
 from+0.005
per+0.005
ula+0.005
Neg logit contributions
anne-0.008
storm-0.007
lights-0.007
 marsh-0.007
rated-0.006
Token wished
Token ID16555
Feature activation-7.55e-03

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
ster+0.001
 with+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token they
Token ID484
Feature activation+4.56e-03

Pos logit contributions
anne+0.016
storm+0.013
lights+0.013
storms+0.012
 marsh+0.012
Neg logit contributions
 from-0.009
ula-0.009
 with-0.009
per-0.009
 St-0.009
Token thank
Token ID5875
Feature activation+1.63e-03

Pos logit contributions
anne+0.007
 marsh+0.006
storm+0.006
lights+0.006
 Susan+0.006
Neg logit contributions
per-0.005
ula-0.005
atching-0.004
or-0.004
ster-0.004
Token you
Token ID345
Feature activation+1.89e-02

Pos logit contributions
ula+0.001
per+0.001
ster+0.001
olding+0.001
 figure+0.001
Neg logit contributions
anne-0.004
 Susan-0.003
 Jessie-0.003
 John-0.003
 marsh-0.003
Token.
Token ID13
Feature activation+7.07e-03

Pos logit contributions
ula+0.019
per+0.019
 from+0.017
ster+0.017
 with+0.017
Neg logit contributions
anne-0.045
lights-0.038
storm-0.037
storms-0.036
 Susan-0.035
Token We
Token ID775
Feature activation+2.51e-03

Pos logit contributions
per+0.008
ula+0.008
ster+0.007
 from+0.007
atching+0.007
Neg logit contributions
anne-0.016
lights-0.013
storm-0.013
storms-0.012
 Susan-0.012
Token love
Token ID1842
Feature activation-3.16e-03

Pos logit contributions
per+0.003
ula+0.003
 from+0.003
 with+0.003
ster+0.003
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.004
 Susan-0.004
Token cuc
Token ID38421
Feature activation+4.12e-02

Pos logit contributions
anne+0.008
lights+0.007
storm+0.007
storms+0.007
 Susan+0.007
Neg logit contributions
per-0.002
ula-0.002
ster-0.002
 from-0.002
 St-0.002
Tokenumbers
Token ID17024
Feature activation+6.77e-03

Pos logit contributions
 from+0.060
 liking+0.059
 figure+0.058
 St+0.056
 this+0.055
Neg logit contributions
anne-0.056
acket-0.048
lights-0.046
akes-0.045
 marsh-0.045
Token.
Token ID13
Feature activation+2.56e-03

Pos logit contributions
per+0.006
ula+0.006
 from+0.005
ster+0.005
 with+0.005
Neg logit contributions
anne-0.017
lights-0.014
storm-0.014
storms-0.014
 Susan-0.013
Token And
Token ID843
Feature activation-1.44e-02

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 from+0.003
atching+0.003
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token we
Token ID356
Feature activation+1.12e-03

Pos logit contributions
anne+0.025
lights+0.022
 marsh+0.021
 Susan+0.021
 John+0.020
Neg logit contributions
ula-0.020
per-0.020
atching-0.018
or-0.018
olding-0.018
Token love
Token ID1842
Feature activation-2.46e-03

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
 solve+0.001
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token garden
Token ID11376
Feature activation+3.02e-03

Pos logit contributions
ula+0.002
per+0.002
 St+0.002
atching+0.002
gg+0.002
Neg logit contributions
anne-0.002
storm-0.002
lights-0.002
 marsh-0.002
 Susan-0.002
Token.
Token ID13
Feature activation-5.44e-03

Pos logit contributions
ula+0.003
per+0.003
 St+0.003
ster+0.003
atching+0.003
Neg logit contributions
anne-0.007
lights-0.006
storm-0.006
storms-0.006
 marsh-0.006
Token They
Token ID1119
Feature activation-7.22e-04

Pos logit contributions
anne+0.013
storm+0.011
lights+0.011
rated+0.010
storms+0.010
Neg logit contributions
 St-0.006
 with-0.006
ula-0.006
 from-0.006
 by-0.005
Token cut
Token ID2005
Feature activation+5.92e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
 marsh+0.001
storms+0.001
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
atching-0.001
 St-0.001
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
per+0.006
ula+0.006
ster+0.006
atching+0.006
 solve+0.006
Neg logit contributions
anne-0.013
lights-0.011
 marsh-0.011
storm-0.010
 Susan-0.010
Token cuc
Token ID38421
Feature activation+4.07e-02

Pos logit contributions
ula+0.019
aring+0.018
 St+0.017
per+0.017
ress+0.017
Neg logit contributions
anne-0.013
 marsh-0.012
 lava-0.011
lights-0.010
 Jessie-0.010
Tokenumber
Token ID4494
Feature activation-1.56e-03

Pos logit contributions
 liking+0.064
 St+0.061
 figure+0.060
 solve+0.059
 this+0.058
Neg logit contributions
anne-0.044
acket-0.043
 marsh-0.040
lights-0.040
seed-0.039
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
anne+0.003
lights+0.003
 marsh+0.003
 Susan+0.002
storm+0.002
Neg logit contributions
ula-0.002
 St-0.002
per-0.002
 liking-0.002
 My-0.002
Token put
Token ID1234
Feature activation+7.11e-04

Pos logit contributions
per+0.013
ula+0.013
 from+0.012
 with+0.012
ster+0.012
Neg logit contributions
anne-0.036
lights-0.031
storm-0.030
storms-0.029
 Susan-0.028
Token it
Token ID340
Feature activation-8.94e-03

Pos logit contributions
per+0.001
ula+0.001
 with+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.002
storm-0.001
lights-0.001
storms-0.001
 Susan-0.001
Token in
Token ID287
Feature activation-1.53e-02

Pos logit contributions
anne+0.023
lights+0.021
storm+0.020
 marsh+0.019
storms+0.019
Neg logit contributions
ula-0.006
per-0.006
 St-0.005
ster-0.005
atching-0.005
Token're
Token ID821
Feature activation-4.46e-03

Pos logit contributions
per+0.011
 from+0.011
 with+0.011
 solve+0.010
 figure+0.010
Neg logit contributions
anne-0.027
lights-0.023
storm-0.021
 Susan-0.021
storms-0.021
Token the
Token ID262
Feature activation+2.01e-02

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
 Susan+0.008
storms+0.008
Neg logit contributions
ula-0.005
per-0.005
atching-0.004
ster-0.004
olding-0.004
Token best
Token ID1266
Feature activation-1.85e-03

Pos logit contributions
ula+0.027
per+0.026
atching+0.024
ster+0.024
 from+0.024
Neg logit contributions
anne-0.040
storm-0.033
lights-0.033
storms-0.031
 marsh-0.031
Token rh
Token ID9529
Feature activation+2.33e-02

Pos logit contributions
anne+0.004
lights+0.004
storms+0.003
storm+0.003
 Susan+0.003
Neg logit contributions
 with-0.002
 from-0.002
ster-0.002
ula-0.002
 by-0.002
Tokenin
Token ID259
Feature activation+7.34e-03

Pos logit contributions
ula+0.026
per+0.024
 St+0.023
 from+0.022
atching+0.021
Neg logit contributions
anne-0.051
lights-0.043
storm-0.043
storms-0.040
 Susan-0.040
Tokenoc
Token ID420
Feature activation+4.02e-02

Pos logit contributions
per+0.009
ula+0.009
ster+0.008
 from+0.008
 with+0.008
Neg logit contributions
anne-0.016
lights-0.013
storm-0.012
storms-0.012
 Susan-0.012
Tokeneros
Token ID27498
Feature activation+1.03e-02

Pos logit contributions
ula+0.061
per+0.058
 solve+0.057
 St+0.056
 figure+0.055
Neg logit contributions
anne-0.065
storm-0.057
lights-0.054
storms-0.052
 John-0.048
Token ever
Token ID1683
Feature activation+3.70e-03

Pos logit contributions
 with+0.012
 from+0.012
ster+0.011
 by+0.011
per+0.011
Neg logit contributions
anne-0.023
lights-0.021
storm-0.019
 marsh-0.018
storms-0.018
Token!"
Token ID2474
Feature activation-2.32e-03

Pos logit contributions
 from+0.005
 with+0.005
 by+0.004
per+0.004
ster+0.004
Neg logit contributions
anne-0.008
lights-0.007
storm-0.006
 marsh-0.006
storms-0.006
Token Tim
Token ID5045
Feature activation-1.15e-02

Pos logit contributions
anne+0.004
lights+0.004
storm+0.004
 imag+0.003
rated+0.003
Neg logit contributions
 with-0.004
 from-0.004
 by-0.004
 St-0.003
 this-0.003
Tokenmy
Token ID1820
Feature activation+1.18e-03

Pos logit contributions
anne+0.022
lights+0.019
storm+0.018
 marsh+0.017
storms+0.017
Neg logit contributions
per-0.016
 from-0.016
 with-0.016
 by-0.016
ula-0.016
Token It
Token ID632
Feature activation-3.86e-03

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.009
 Susan+0.008
Neg logit contributions
per-0.008
ula-0.008
ster-0.007
 from-0.007
atching-0.007
Token was
Token ID373
Feature activation+4.09e-03

Pos logit contributions
anne+0.010
lights+0.008
 Susan+0.008
storm+0.008
storms+0.008
Neg logit contributions
 with-0.004
 from-0.003
 by-0.003
 figure-0.003
per-0.003
Token the
Token ID262
Feature activation+1.83e-02

Pos logit contributions
 from+0.004
per+0.004
 with+0.004
ula+0.004
 by+0.004
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.008
 Susan-0.008
Token rh
Token ID9529
Feature activation+2.25e-02

Pos logit contributions
per+0.023
ula+0.022
ster+0.022
 from+0.022
 with+0.021
Neg logit contributions
anne-0.040
lights-0.032
storm-0.031
storms-0.030
 Susan-0.030
Tokenin
Token ID259
Feature activation-2.36e-03

Pos logit contributions
ula+0.027
 St+0.025
per+0.024
 from+0.024
 solve+0.023
Neg logit contributions
anne-0.047
lights-0.041
storm-0.039
storms-0.037
 marsh-0.036
Tokenoc
Token ID420
Feature activation+4.02e-02

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 marsh+0.004
storms+0.004
Neg logit contributions
ula-0.003
per-0.003
 St-0.003
atching-0.003
 from-0.002
Tokeneros
Token ID27498
Feature activation-6.54e-04

Pos logit contributions
ula+0.059
 St+0.053
per+0.052
 solve+0.052
'+0.050
Neg logit contributions
anne-0.073
lights-0.064
storm-0.062
storms-0.058
 marsh-0.057
Token.
Token ID13
Feature activation-4.64e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
per-0.001
ula-0.001
ster-0.000
 from-0.000
 St-0.000
Token It
Token ID632
Feature activation-3.95e-03

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.007
 Susan+0.007
Neg logit contributions
ula-0.006
per-0.006
ster-0.006
 from-0.006
 St-0.005
Token lifted
Token ID13663
Feature activation-1.07e-03

Pos logit contributions
anne+0.009
lights+0.008
 Susan+0.007
storm+0.007
storms+0.007
Neg logit contributions
 with-0.005
 from-0.004
per-0.004
 by-0.004
 figure-0.004
Token its
Token ID663
Feature activation-1.01e-02

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 marsh+0.002
Neg logit contributions
per-0.001
ula-0.001
 from-0.001
 with-0.001
 St-0.001
Token a
Token ID257
Feature activation+1.96e-03

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
per-0.001
ula-0.001
ster-0.001
 from-0.001
 St-0.001
Token snake
Token ID17522
Feature activation-4.02e-03

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
 from+0.002
 with+0.002
Neg logit contributions
anne-0.004
lights-0.004
storm-0.003
storms-0.003
 Susan-0.003
Token with
Token ID351
Feature activation+4.46e-03

Pos logit contributions
anne+0.009
lights+0.008
storm+0.008
storms+0.007
 marsh+0.007
Neg logit contributions
ula-0.004
per-0.004
ster-0.003
 St-0.003
atching-0.003
Token me
Token ID502
Feature activation-5.53e-05

Pos logit contributions
per+0.003
ula+0.003
 with+0.003
 from+0.003
 St+0.002
Neg logit contributions
anne-0.012
lights-0.011
storm-0.011
storms-0.010
 Susan-0.010
Token.
Token ID13
Feature activation+2.37e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 marsh+0.000
Neg logit contributions
ula-0.000
per-0.000
ster-0.000
atching-0.000
 St-0.000
Token Deal
Token ID15138
Feature activation-4.32e-02

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 from+0.003
atching+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token?"
Token ID1701
Feature activation+2.36e-03

Pos logit contributions
anne+0.083
lights+0.067
storm+0.065
storms+0.062
 Susan+0.061
Neg logit contributions
per-0.062
ula-0.062
ster-0.058
 from-0.058
 St-0.057
Token
Token ID198
Feature activation+2.07e-03

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 from+0.003
 St+0.003
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token
Token ID198
Feature activation+7.72e-04

Pos logit contributions
per+0.002
ula+0.002
ster+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.004
lights-0.004
storm-0.004
storms-0.003
 Susan-0.003
TokenL
Token ID43
Feature activation+1.45e-02

Pos logit contributions
ula+0.001
per+0.001
ant+0.001
ster+0.001
 from+0.001
Neg logit contributions
anne-0.002
lights-0.001
die-0.001
 snowball-0.001
storm-0.001
Tokenily
Token ID813
Feature activation-1.20e-03

Pos logit contributions
 St+0.018
ster+0.017
 cash+0.017
gg+0.017
self+0.016
Neg logit contributions
anne-0.028
lights-0.021
oline-0.021
 Susan-0.020
storm-0.020
Token you
Token ID345
Feature activation+6.03e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
 Susan+0.006
 marsh+0.006
Neg logit contributions
ula-0.003
per-0.003
atching-0.003
ster-0.003
 St-0.002
Token have
Token ID423
Feature activation-5.54e-04

Pos logit contributions
per+0.007
ula+0.007
 from+0.006
ster+0.006
 with+0.006
Neg logit contributions
anne-0.013
lights-0.011
storm-0.011
storms-0.011
 Susan-0.010
Token to
Token ID284
Feature activation+9.38e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.001
rated+0.001
Neg logit contributions
 with-0.000
 by-0.000
 St-0.000
 from-0.000
per-0.000
Token share
Token ID2648
Feature activation-7.60e-04

Pos logit contributions
ula+0.009
per+0.008
ster+0.008
atching+0.008
 St+0.007
Neg logit contributions
anne-0.022
storm-0.019
lights-0.018
 Susan-0.018
 marsh-0.018
Token.
Token ID13
Feature activation+8.76e-04

Pos logit contributions
anne+0.002
lights+0.001
 marsh+0.001
storm+0.001
storms+0.001
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
atching-0.001
olding-0.001
Token Deal
Token ID15138
Feature activation-4.10e-02

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.001
storms-0.001
 Susan-0.001
Token?"
Token ID1701
Feature activation-2.61e-03

Pos logit contributions
anne+0.076
lights+0.061
storm+0.059
storms+0.056
 Susan+0.055
Neg logit contributions
per-0.062
ula-0.061
ster-0.058
 from-0.057
 St-0.056
Token Lily
Token ID20037
Feature activation-9.27e-03

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
storms+0.005
 marsh+0.004
Neg logit contributions
ula-0.003
per-0.003
 with-0.003
 St-0.003
 from-0.003
Token said
Token ID531
Feature activation-7.81e-03

Pos logit contributions
anne+0.015
lights+0.012
storm+0.011
storms+0.011
 Susan+0.010
Neg logit contributions
per-0.016
ula-0.016
 with-0.016
 from-0.015
ster-0.015
Token "
Token ID366
Feature activation+7.09e-03

Pos logit contributions
anne+0.017
lights+0.014
storm+0.014
storms+0.013
 Susan+0.013
Neg logit contributions
per-0.008
ula-0.008
 figure-0.007
 solve-0.007
atching-0.007
TokenDeal
Token ID45776
Feature activation-1.64e-02

Pos logit contributions
ula+0.010
per+0.010
ster+0.010
 from+0.010
 St+0.010
Neg logit contributions
anne-0.013
lights-0.011
storm-0.010
storms-0.010
 Susan-0.010
Token fold
Token ID5591
Feature activation-7.55e-05

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
 marsh+0.000
 Susan+0.000
Neg logit contributions
ula-0.000
ster-0.000
atching-0.000
per-0.000
 St-0.000
Token the
Token ID262
Feature activation+8.83e-03

Pos logit contributions
anne+0.000
lights+0.000
 marsh+0.000
storm+0.000
storms+0.000
Neg logit contributions
per-0.000
ula-0.000
ster-0.000
atching-0.000
 solve-0.000
Token laundry
Token ID25724
Feature activation+3.41e-03

Pos logit contributions
atching+0.014
ula+0.014
per+0.014
 from+0.013
 solve+0.013
Neg logit contributions
anne-0.014
lights-0.012
 marsh-0.011
storm-0.011
 imag-0.010
Token first
Token ID717
Feature activation-4.24e-03

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
atching+0.003
 St+0.003
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
storms-0.007
 marsh-0.007
Token.
Token ID13
Feature activation-3.80e-04

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.008
 Susan+0.008
Neg logit contributions
per-0.004
ula-0.004
 from-0.003
 with-0.003
ster-0.003
Token Deal
Token ID15138
Feature activation-4.07e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 Susan+0.001
storms+0.001
Neg logit contributions
ula-0.000
per-0.000
ster-0.000
atching-0.000
 from-0.000
Token?"
Token ID1701
Feature activation-2.33e-03

Pos logit contributions
anne+0.072
lights+0.059
storm+0.055
 Susan+0.052
storms+0.052
Neg logit contributions
per-0.064
ula-0.063
ster-0.060
atching-0.058
 St-0.058
Token
Token ID198
Feature activation-6.38e-03

Pos logit contributions
anne+0.004
lights+0.004
 Susan+0.003
storms+0.003
storm+0.003
Neg logit contributions
per-0.003
ster-0.003
ula-0.003
atching-0.003
gg-0.003
Token
Token ID198
Feature activation-7.13e-03

Pos logit contributions
anne+0.013
lights+0.011
storm+0.010
 Susan+0.010
storms+0.010
Neg logit contributions
per-0.008
ula-0.008
ster-0.008
atching-0.007
 from-0.007
TokenL
Token ID43
Feature activation+1.22e-02

Pos logit contributions
anne+0.015
lights+0.012
die+0.012
 Susan+0.011
 snowball+0.011
Neg logit contributions
ula-0.009
ster-0.008
per-0.008
aser-0.008
atching-0.008
Tokenily
Token ID813
Feature activation+2.25e-03

Pos logit contributions
 St+0.013
ster+0.013
 solve+0.012
gg+0.012
 figure+0.012
Neg logit contributions
anne-0.026
lights-0.022
storm-0.020
 Susan-0.019
storms-0.019
Token put
Token ID1234
Feature activation-1.42e-03

Pos logit contributions
ula+0.013
per+0.013
ster+0.012
atching+0.012
 St+0.012
Neg logit contributions
anne-0.030
lights-0.025
storm-0.025
 Susan-0.024
 marsh-0.024
Token away
Token ID1497
Feature activation-8.37e-03

Pos logit contributions
anne+0.004
storm+0.003
lights+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
 with-0.001
per-0.001
 from-0.001
ula-0.001
 by-0.001
Token your
Token ID534
Feature activation-6.13e-03

Pos logit contributions
anne+0.023
storm+0.019
lights+0.019
storms+0.019
 Susan+0.018
Neg logit contributions
per-0.006
 from-0.006
ula-0.006
 with-0.006
 by-0.005
Token blocks
Token ID7021
Feature activation-3.29e-03

Pos logit contributions
anne+0.013
storm+0.010
storms+0.010
lights+0.010
 Susan+0.010
Neg logit contributions
ula-0.008
per-0.008
 with-0.008
 from-0.008
ster-0.007
Token.
Token ID13
Feature activation+6.14e-04

Pos logit contributions
anne+0.008
lights+0.007
storm+0.007
storms+0.006
 Susan+0.006
Neg logit contributions
ula-0.003
per-0.003
ster-0.003
 St-0.003
atching-0.003
Token Deal
Token ID15138
Feature activation-4.04e-02

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
atching+0.001
 from+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
 Susan-0.001
storms-0.001
Token?"
Token ID1701
Feature activation+6.40e-05

Pos logit contributions
anne+0.071
lights+0.058
storm+0.055
 Susan+0.052
storms+0.052
Neg logit contributions
per-0.063
ula-0.063
ster-0.059
atching-0.058
 St-0.057
Token
Token ID198
Feature activation-3.30e-03

Pos logit contributions
per+0.000
ula+0.000
ster+0.000
atching+0.000
 from+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
 Susan-0.000
storms-0.000
Token
Token ID198
Feature activation-5.59e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.006
 Susan+0.006
Neg logit contributions
ula-0.004
per-0.004
 from-0.003
ster-0.003
 with-0.003
TokenL
Token ID43
Feature activation+1.09e-02

Pos logit contributions
anne+0.012
lights+0.009
 Susan+0.009
storm+0.009
 marsh+0.008
Neg logit contributions
per-0.007
ula-0.007
ster-0.007
 from-0.006
atching-0.006
Tokenily
Token ID813
Feature activation+3.38e-03

Pos logit contributions
 solve+0.013
ster+0.013
gg+0.013
 St+0.012
 figure+0.012
Neg logit contributions
anne-0.021
lights-0.018
oline-0.015
 Susan-0.015
storm-0.015
Token nice
Token ID3621
Feature activation-3.40e-03

Pos logit contributions
per+0.004
ula+0.004
ster+0.004
 St+0.003
atching+0.003
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
 marsh-0.006
storms-0.006
Token after
Token ID706
Feature activation-1.05e-02

Pos logit contributions
anne+0.009
lights+0.007
storm+0.007
storms+0.007
 Susan+0.007
Neg logit contributions
per-0.003
ula-0.003
 from-0.003
ster-0.003
 with-0.003
Token each
Token ID1123
Feature activation+1.63e-02

Pos logit contributions
anne+0.030
lights+0.025
storm+0.025
storms+0.025
 Susan+0.024
Neg logit contributions
per-0.007
ula-0.006
 from-0.006
 with-0.006
 by-0.005
Token one
Token ID530
Feature activation-2.71e-03

Pos logit contributions
per+0.020
ula+0.019
atching+0.018
ster+0.018
gg+0.017
Neg logit contributions
anne-0.035
storm-0.028
lights-0.027
 Susan-0.026
storms-0.026
Token.
Token ID13
Feature activation+2.34e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.006
 Susan+0.006
Neg logit contributions
 from-0.002
 with-0.002
 by-0.002
per-0.002
ula-0.002
Token Deal
Token ID15138
Feature activation-4.03e-02

Pos logit contributions
per+0.003
ula+0.003
ster+0.002
 from+0.002
atching+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token?"
Token ID1701
Feature activation+5.01e-05

Pos logit contributions
anne+0.075
lights+0.061
storm+0.058
storms+0.055
 Susan+0.054
Neg logit contributions
per-0.060
ula-0.060
ster-0.056
 from-0.055
 St-0.055
Token
Token ID198
Feature activation+1.35e-03

Pos logit contributions
ula+0.000
per+0.000
 with+0.000
 St+0.000
 from+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
storms-0.000
 marsh-0.000
Token
Token ID198
Feature activation-1.88e-03

Pos logit contributions
ula+0.001
per+0.001
 with+0.001
 St+0.001
 from+0.001
Neg logit contributions
anne-0.003
lights-0.003
storm-0.003
storms-0.003
 marsh-0.003
TokenMax
Token ID11518
Feature activation-2.97e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
 Susan+0.003
storms+0.003
Neg logit contributions
per-0.002
ula-0.002
ster-0.002
 from-0.002
atching-0.002
Token nodded
Token ID14464
Feature activation-1.19e-02

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 marsh+0.004
 hurricane+0.004
Neg logit contributions
per-0.005
 with-0.005
 St-0.005
 from-0.005
ster-0.005
Token any
Token ID597
Feature activation-2.88e-03

Pos logit contributions
 with+0.000
 from+0.000
 by+0.000
 this+0.000
 those+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
storms-0.000
jo-0.000
Token more
Token ID517
Feature activation-1.25e-03

Pos logit contributions
anne+0.007
storm+0.006
lights+0.006
storms+0.006
 Susan+0.006
Neg logit contributions
 from-0.003
 with-0.003
 by-0.003
per-0.002
ster-0.002
Token without
Token ID1231
Feature activation+2.97e-04

Pos logit contributions
anne+0.003
storm+0.003
lights+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
 from-0.001
 with-0.001
per-0.001
 by-0.001
'-0.001
Token asking
Token ID4737
Feature activation+1.63e-02

Pos logit contributions
per+0.000
ula+0.000
 from+0.000
 with+0.000
 St+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token.
Token ID13
Feature activation+1.93e-03

Pos logit contributions
per+0.015
ula+0.014
 from+0.013
 with+0.013
ster+0.013
Neg logit contributions
anne-0.040
lights-0.035
storm-0.034
storms-0.032
 Susan-0.032
Token Deal
Token ID15138
Feature activation-4.00e-02

Pos logit contributions
per+0.002
ula+0.002
ster+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.004
lights-0.004
storm-0.004
storms-0.003
 Susan-0.003
Token?"
Token ID1701
Feature activation-8.42e-04

Pos logit contributions
anne+0.076
lights+0.062
storm+0.060
storms+0.057
 Susan+0.055
Neg logit contributions
per-0.058
ula-0.058
ster-0.054
 from-0.054
 St-0.053
Token
Token ID198
Feature activation+7.96e-04

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 imag+0.001
Neg logit contributions
 with-0.001
 from-0.001
 St-0.001
 by-0.001
 figure-0.001
Token
Token ID198
Feature activation-3.94e-03

Pos logit contributions
ula+0.001
 with+0.001
per+0.001
"+0.001
 from+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 imag-0.002
TokenL
Token ID43
Feature activation+3.79e-03

Pos logit contributions
anne+0.008
lights+0.007
storm+0.006
storms+0.006
 marsh+0.006
Neg logit contributions
per-0.005
ula-0.005
ster-0.005
 with-0.005
 from-0.005
Tokenily
Token ID813
Feature activation-2.33e-03

Pos logit contributions
per+0.004
ula+0.004
ster+0.004
 from+0.004
atching+0.004
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
storms-0.007
 marsh-0.006
Token we
Token ID356
Feature activation+3.75e-03

Pos logit contributions
anne+0.032
lights+0.028
storm+0.027
storms+0.026
 Susan+0.026
Neg logit contributions
per-0.005
 with-0.004
ula-0.004
 from-0.004
 St-0.004
Token get
Token ID651
Feature activation-5.56e-03

Pos logit contributions
ula+0.004
per+0.004
ster+0.004
atching+0.004
 St+0.004
Neg logit contributions
anne-0.008
storm-0.007
lights-0.007
storms-0.007
 marsh-0.007
Token back
Token ID736
Feature activation-1.36e-02

Pos logit contributions
anne+0.013
lights+0.011
storm+0.011
storms+0.010
 Susan+0.010
Neg logit contributions
per-0.006
ula-0.006
 from-0.005
 with-0.005
ster-0.005
Token home
Token ID1363
Feature activation+7.31e-03

Pos logit contributions
anne+0.036
storm+0.031
lights+0.030
storms+0.030
 Susan+0.029
Neg logit contributions
 from-0.010
 with-0.010
per-0.010
 by-0.010
ula-0.010
Token.
Token ID13
Feature activation+1.99e-03

Pos logit contributions
per+0.007
ula+0.007
 from+0.006
ster+0.006
 with+0.006
Neg logit contributions
anne-0.018
lights-0.015
storm-0.015
storms-0.014
 Susan-0.014
Token Deal
Token ID15138
Feature activation-3.99e-02

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
 from+0.002
atching+0.002
Neg logit contributions
anne-0.004
lights-0.004
storm-0.003
storms-0.003
 Susan-0.003
Token?"
Token ID1701
Feature activation-5.31e-04

Pos logit contributions
anne+0.074
lights+0.060
storm+0.057
storms+0.054
 Susan+0.054
Neg logit contributions
per-0.060
ula-0.059
ster-0.056
 from-0.055
 St-0.055
Token
Token ID198
Feature activation-5.69e-03

Pos logit contributions
anne+0.001
 Susan+0.001
lights+0.001
 marsh+0.001
storms+0.001
Neg logit contributions
ster-0.001
per-0.001
ula-0.001
aser-0.001
 figure-0.001
Token
Token ID198
Feature activation-6.47e-03

Pos logit contributions
anne+0.011
lights+0.009
 marsh+0.008
 Susan+0.008
storm+0.008
Neg logit contributions
per-0.008
ula-0.008
ster-0.008
atching-0.007
 figure-0.007
TokenS
Token ID50
Feature activation+6.96e-03

Pos logit contributions
anne+0.010
 Susan+0.007
lights+0.007
 marsh+0.007
die+0.007
Neg logit contributions
ula-0.011
ster-0.010
aser-0.010
per-0.010
 figure-0.010
Tokenara
Token ID3301
Feature activation-6.19e-04

Pos logit contributions
ula+0.014
 figure+0.014
per+0.013
 from+0.013
 solve+0.013
Neg logit contributions
anne-0.010
lights-0.007
storm-0.006
 marsh-0.006
rene-0.006
Token me
Token ID502
Feature activation+5.84e-03

Pos logit contributions
ula+0.007
per+0.007
ster+0.006
atching+0.006
 from+0.006
Neg logit contributions
anne-0.015
lights-0.012
storm-0.012
storms-0.012
 Susan-0.011
Token fix
Token ID4259
Feature activation-8.30e-03

Pos logit contributions
ula+0.005
per+0.005
ster+0.005
 St+0.005
atching+0.004
Neg logit contributions
anne-0.014
lights-0.012
storm-0.012
 Susan-0.011
 marsh-0.011
Token my
Token ID616
Feature activation-1.68e-04

Pos logit contributions
anne+0.022
lights+0.019
storm+0.019
storms+0.018
 Susan+0.018
Neg logit contributions
per-0.006
ula-0.006
ster-0.005
 from-0.005
 St-0.005
Token taxi
Token ID17536
Feature activation-3.73e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 Susan+0.000
Neg logit contributions
per-0.000
ula-0.000
ster-0.000
 from-0.000
 with-0.000
Token.
Token ID13
Feature activation+3.23e-03

Pos logit contributions
anne+0.009
lights+0.007
storm+0.007
storms+0.007
 Susan+0.007
Neg logit contributions
ula-0.004
per-0.004
ster-0.003
 St-0.003
atching-0.003
Token Deal
Token ID15138
Feature activation-3.98e-02

Pos logit contributions
per+0.004
ula+0.004
ster+0.004
 from+0.004
 solve+0.004
Neg logit contributions
anne-0.006
lights-0.005
 Susan-0.005
storm-0.005
storms-0.005
Token?"
Token ID1701
Feature activation+1.26e-03

Pos logit contributions
anne+0.068
lights+0.056
storm+0.052
 Susan+0.050
storms+0.049
Neg logit contributions
per-0.064
ula-0.064
ster-0.060
atching-0.058
 St-0.058
Token he
Token ID339
Feature activation-1.32e-03

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
atching+0.002
ant+0.002
Neg logit contributions
anne-0.002
 Susan-0.002
 John-0.002
lights-0.002
 Jessie-0.002
Token said
Token ID531
Feature activation-4.23e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.001
 Susan+0.001
Neg logit contributions
ula-0.002
per-0.002
ster-0.002
atching-0.002
 from-0.002
Token.
Token ID13
Feature activation+1.85e-03

Pos logit contributions
anne+0.009
lights+0.007
 Susan+0.007
storms+0.007
 John+0.007
Neg logit contributions
ula-0.005
per-0.005
ster-0.005
atching-0.005
gg-0.004
Token
Token ID198
Feature activation+9.65e-04

Pos logit contributions
per+0.002
ula+0.002
ster+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
storms-0.003
 Susan-0.003
Token to
Token ID284
Feature activation+2.20e-03

Pos logit contributions
per+0.008
ula+0.008
ster+0.008
 from+0.007
 St+0.007
Neg logit contributions
anne-0.025
lights-0.021
storm-0.021
storms-0.020
 Susan-0.020
Token play
Token ID711
Feature activation+1.04e-02

Pos logit contributions
per+0.002
ula+0.002
 from+0.002
 with+0.002
ster+0.002
Neg logit contributions
anne-0.005
lights-0.005
storm-0.004
storms-0.004
 Susan-0.004
Token after
Token ID706
Feature activation+8.00e-04

Pos logit contributions
ula+0.006
per+0.006
ster+0.006
atching+0.005
 St+0.005
Neg logit contributions
anne-0.027
lights-0.024
storm-0.024
 marsh-0.023
storms-0.023
Token that
Token ID326
Feature activation+8.17e-04

Pos logit contributions
per+0.001
ula+0.001
 with+0.001
 from+0.001
ster+0.000
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token.
Token ID13
Feature activation+3.22e-03

Pos logit contributions
 with+0.001
 from+0.001
per+0.001
'+0.001
 by+0.001
Neg logit contributions
anne-0.002
lights-0.002
 Susan-0.002
storm-0.002
storms-0.002
Token Deal
Token ID15138
Feature activation-3.95e-02

Pos logit contributions
ula+0.004
per+0.004
ster+0.004
atching+0.003
 from+0.003
Neg logit contributions
anne-0.007
lights-0.005
storm-0.005
 Susan-0.005
 marsh-0.005
Token?"
Token ID1701
Feature activation+4.26e-03

Pos logit contributions
anne+0.071
lights+0.058
storm+0.055
storms+0.052
 Susan+0.052
Neg logit contributions
per-0.061
ula-0.060
ster-0.057
 from-0.055
 St-0.055
Token
Token ID198
Feature activation+3.77e-03

Pos logit contributions
ula+0.005
per+0.005
 from+0.004
 with+0.004
 St+0.004
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.008
 Susan-0.007
Token
Token ID198
Feature activation+2.82e-03

Pos logit contributions
ula+0.004
per+0.004
 with+0.004
 St+0.004
 from+0.004
Neg logit contributions
anne-0.009
lights-0.008
storm-0.007
storms-0.007
 Susan-0.007
TokenHer
Token ID9360
Feature activation+2.66e-03

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
 from+0.003
atching+0.003
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
 Susan-0.005
 marsh-0.005
Token mom
Token ID1995
Feature activation-3.63e-03

Pos logit contributions
ula+0.003
 from+0.002
 with+0.002
ster+0.002
per+0.002
Neg logit contributions
anne-0.007
storm-0.006
storms-0.006
lights-0.005
 Susan-0.005
Token.
Token ID13
Feature activation-4.74e-03

Pos logit contributions
ula+0.001
per+0.001
ster+0.001
atching+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token She
Token ID1375
Feature activation-8.96e-04

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.008
 Susan+0.007
Neg logit contributions
per-0.006
ula-0.006
ster-0.006
 from-0.006
 St-0.005
Token had
Token ID550
Feature activation+7.52e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
per-0.002
ula-0.002
ster-0.001
 from-0.001
atching-0.001
Token a
Token ID257
Feature activation-1.59e-03

Pos logit contributions
 with+0.006
ula+0.005
 St+0.005
 from+0.005
 by+0.005
Neg logit contributions
anne-0.020
lights-0.018
storm-0.017
storms-0.017
rated-0.016
Token red
Token ID2266
Feature activation+2.22e-04

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
ula-0.002
per-0.002
ster-0.002
 from-0.002
 St-0.002
Token tr
Token ID491
Feature activation-3.90e-02

Pos logit contributions
per+0.000
ula+0.000
atching+0.000
olding+0.000
etic+0.000
Neg logit contributions
anne-0.000
storm-0.000
lights-0.000
 marsh-0.000
 hurricane-0.000
Tokenicycle
Token ID35298
Feature activation-5.78e-03

Pos logit contributions
anne+0.075
lights+0.061
storm+0.058
storms+0.056
 Susan+0.055
Neg logit contributions
per-0.056
ula-0.056
ster-0.052
 from-0.052
 St-0.051
Token with
Token ID351
Feature activation+2.06e-02

Pos logit contributions
anne+0.013
storm+0.012
lights+0.012
storms+0.011
 John+0.010
Neg logit contributions
per-0.006
ula-0.005
ster-0.005
 cents-0.005
ant-0.005
Token a
Token ID257
Feature activation-1.00e-02

Pos logit contributions
per+0.026
ula+0.025
ster+0.024
atching+0.024
 from+0.023
Neg logit contributions
anne-0.043
lights-0.036
storm-0.034
 Susan-0.033
storms-0.033
Token big
Token ID1263
Feature activation-2.21e-02

Pos logit contributions
anne+0.022
lights+0.018
storm+0.017
storms+0.017
 Susan+0.016
Neg logit contributions
ula-0.012
per-0.012
 with-0.012
ster-0.012
 figure-0.011
Token wheel
Token ID7825
Feature activation-1.06e-02

Pos logit contributions
anne+0.049
lights+0.041
storm+0.039
storms+0.038
 Susan+0.037
Neg logit contributions
ula-0.025
per-0.025
 with-0.025
 from-0.024
ster-0.024
Token please
Token ID3387
Feature activation+4.08e-03

Pos logit contributions
per+0.000
ula+0.000
ster+0.000
 from+0.000
atching+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
 marsh-0.001
storms-0.001
Token and
Token ID290
Feature activation+3.71e-03

Pos logit contributions
ula+0.005
per+0.004
ster+0.004
atching+0.004
 St+0.004
Neg logit contributions
anne-0.009
lights-0.007
storm-0.007
 marsh-0.007
storms-0.007
Token thank
Token ID5875
Feature activation+7.81e-03

Pos logit contributions
per+0.003
ula+0.003
 from+0.003
 with+0.003
ster+0.003
Neg logit contributions
anne-0.009
lights-0.008
storm-0.008
storms-0.008
 Susan-0.007
Token you
Token ID345
Feature activation+1.32e-02

Pos logit contributions
ula+0.006
per+0.006
ster+0.005
atching+0.005
 from+0.005
Neg logit contributions
anne-0.019
lights-0.016
 Susan-0.016
storm-0.016
 marsh-0.016
Token.
Token ID13
Feature activation+1.43e-03

Pos logit contributions
ula+0.014
per+0.014
ster+0.013
atching+0.013
 St+0.012
Neg logit contributions
anne-0.030
lights-0.025
storm-0.024
storms-0.023
 marsh-0.023
Token Deal
Token ID15138
Feature activation-3.89e-02

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
atching+0.002
 from+0.002
Neg logit contributions
anne-0.003
lights-0.003
storm-0.002
 Susan-0.002
storms-0.002
Token?"
Token ID1701
Feature activation-3.58e-03

Pos logit contributions
anne+0.068
lights+0.055
storm+0.052
 Susan+0.050
storms+0.049
Neg logit contributions
per-0.062
ula-0.061
ster-0.057
atching-0.056
 St-0.055
Token
Token ID198
Feature activation-4.64e-03

Pos logit contributions
anne+0.007
lights+0.006
 Susan+0.005
storm+0.005
 marsh+0.005
Neg logit contributions
per-0.005
ula-0.005
ster-0.005
atching-0.004
 solve-0.004
Token
Token ID198
Feature activation-6.77e-03

Pos logit contributions
anne+0.009
lights+0.008
storm+0.007
 marsh+0.007
 Susan+0.007
Neg logit contributions
per-0.006
ula-0.006
ster-0.006
atching-0.006
 solve-0.005
TokenS
Token ID50
Feature activation+3.70e-03

Pos logit contributions
anne+0.011
 marsh+0.009
lights+0.009
 Susan+0.008
 snowball+0.008
Neg logit contributions
ula-0.010
per-0.010
 figure-0.010
aser-0.010
ster-0.010
Tokenara
Token ID3301
Feature activation-1.77e-03

Pos logit contributions
ula+0.007
per+0.007
 figure+0.006
 from+0.006
 St+0.006
Neg logit contributions
anne-0.006
lights-0.004
storm-0.004
 marsh-0.004
 Susan-0.004
Token.
Token ID13
Feature activation-1.22e-03

Pos logit contributions
anne+0.014
storm+0.012
lights+0.012
storms+0.011
 marsh+0.011
Neg logit contributions
 with-0.004
 from-0.003
ula-0.003
 by-0.003
per-0.003
Token She
Token ID1375
Feature activation-3.37e-03

Pos logit contributions
anne+0.003
lights+0.002
storm+0.002
storms+0.002
 marsh+0.002
Neg logit contributions
ula-0.001
per-0.001
 from-0.001
 St-0.001
 with-0.001
Token had
Token ID550
Feature activation+5.51e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
 Susan+0.006
storms+0.006
Neg logit contributions
 with-0.004
per-0.004
'-0.004
 by-0.004
 figure-0.004
Token a
Token ID257
Feature activation-4.81e-03

Pos logit contributions
 with+0.003
 by+0.002
 from+0.002
 St+0.002
 right+0.002
Neg logit contributions
anne-0.017
lights-0.016
storm-0.015
rated-0.014
storms-0.014
Token big
Token ID1263
Feature activation-1.30e-02

Pos logit contributions
anne+0.011
lights+0.010
storms+0.009
storm+0.009
 Susan+0.009
Neg logit contributions
ula-0.005
ster-0.005
 with-0.005
 figure-0.005
 St-0.004
Token tr
Token ID491
Feature activation-3.86e-02

Pos logit contributions
anne+0.029
lights+0.025
storms+0.023
storm+0.023
 Susan+0.022
Neg logit contributions
 with-0.014
per-0.014
 figure-0.014
ster-0.014
 from-0.014
Tokenicycle
Token ID35298
Feature activation-5.61e-03

Pos logit contributions
anne+0.065
lights+0.051
storm+0.049
storms+0.047
 Susan+0.046
Neg logit contributions
ula-0.064
per-0.064
ster-0.060
 with-0.059
 from-0.059
Token that
Token ID326
Feature activation-5.58e-03

Pos logit contributions
anne+0.013
lights+0.012
storm+0.011
storms+0.011
 Susan+0.011
Neg logit contributions
ula-0.005
per-0.005
ster-0.004
atching-0.004
 St-0.004
Token she
Token ID673
Feature activation+5.53e-03

Pos logit contributions
anne+0.015
lights+0.013
storm+0.013
storms+0.012
acket+0.012
Neg logit contributions
 with-0.004
 by-0.004
 St-0.004
 from-0.004
ula-0.003
Token would
Token ID561
Feature activation+6.20e-03

Pos logit contributions
per+0.008
ula+0.008
 with+0.008
ster+0.008
 from+0.008
Neg logit contributions
anne-0.011
lights-0.009
storm-0.008
 Susan-0.008
storms-0.008
Token ride
Token ID6594
Feature activation-5.09e-03

Pos logit contributions
per+0.008
ula+0.007
 with+0.007
 from+0.007
 solve+0.007
Neg logit contributions
anne-0.013
lights-0.011
storm-0.010
storms-0.010
 Susan-0.010
Token had
Token ID550
Feature activation+1.06e-02

Pos logit contributions
 with+0.004
 by+0.004
per+0.004
 from+0.004
'+0.004
Neg logit contributions
anne-0.008
lights-0.008
storm-0.007
storms-0.007
 Susan-0.007
Token a
Token ID257
Feature activation+1.05e-03

Pos logit contributions
 with+0.007
 by+0.005
 St+0.005
 from+0.004
 this+0.004
Neg logit contributions
anne-0.032
lights-0.029
storm-0.028
storms-0.027
rated-0.027
Token pet
Token ID4273
Feature activation+1.36e-02

Pos logit contributions
ula+0.001
 with+0.001
ster+0.001
 figure+0.001
 St+0.001
Neg logit contributions
anne-0.003
lights-0.002
storms-0.002
storm-0.002
 Susan-0.002
Token dog
Token ID3290
Feature activation-3.01e-03

Pos logit contributions
per+0.017
ula+0.016
atching+0.015
 solve+0.015
ster+0.015
Neg logit contributions
anne-0.028
storm-0.023
lights-0.022
storms-0.022
 marsh-0.022
Token called
Token ID1444
Feature activation+3.74e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.005
 Susan+0.005
Neg logit contributions
ula-0.003
per-0.003
ster-0.003
atching-0.003
 solve-0.003
Token Bos
Token ID14548
Feature activation-3.85e-02

Pos logit contributions
per+0.006
ula+0.006
ster+0.005
atching+0.005
 from+0.005
Neg logit contributions
anne-0.007
lights-0.005
storm-0.005
 Susan-0.005
storms-0.005
Tokenco
Token ID1073
Feature activation-2.52e-02

Pos logit contributions
anne+0.074
lights+0.064
storm+0.060
 marsh+0.059
 Susan+0.058
Neg logit contributions
ula-0.052
per-0.052
 St-0.047
ster-0.047
 by-0.047
Token.
Token ID13
Feature activation-4.67e-03

Pos logit contributions
anne+0.056
lights+0.045
storm+0.044
storms+0.043
 Susan+0.042
Neg logit contributions
per-0.030
ula-0.029
ster-0.028
atching-0.027
 from-0.027
Token Bos
Token ID14548
Feature activation-3.59e-02

Pos logit contributions
anne+0.010
storm+0.008
lights+0.008
storms+0.007
 Susan+0.007
Neg logit contributions
ula-0.006
 St-0.006
 with-0.006
per-0.006
 from-0.006
Tokenco
Token ID1073
Feature activation-2.26e-02

Pos logit contributions
anne+0.069
lights+0.060
storm+0.056
 marsh+0.055
 Susan+0.054
Neg logit contributions
ula-0.048
per-0.046
 by-0.044
 with-0.044
 St-0.043
Token liked
Token ID8288
Feature activation-9.22e-03

Pos logit contributions
anne+0.045
lights+0.042
storm+0.039
storms+0.037
 marsh+0.036
Neg logit contributions
 with-0.027
per-0.026
 by-0.026
'-0.026
 from-0.026
Token write
Token ID3551
Feature activation+9.88e-03

Pos logit contributions
ula+0.015
per+0.015
ster+0.014
 from+0.013
 St+0.013
Neg logit contributions
anne-0.026
lights-0.021
storm-0.021
storms-0.020
 Susan-0.020
Token on
Token ID319
Feature activation-9.68e-04

Pos logit contributions
per+0.008
ula+0.008
 from+0.008
 with+0.007
ster+0.007
Neg logit contributions
anne-0.025
lights-0.021
storm-0.021
storms-0.020
 Susan-0.020
Token paper
Token ID3348
Feature activation+1.15e-02

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
 Susan+0.002
storms+0.002
Neg logit contributions
ula-0.001
per-0.001
ster-0.001
atching-0.001
 from-0.001
Token only
Token ID691
Feature activation-3.81e-03

Pos logit contributions
ula+0.010
per+0.010
ster+0.009
atching+0.009
 St+0.008
Neg logit contributions
anne-0.028
lights-0.024
storm-0.023
storms-0.023
 marsh-0.022
Token.
Token ID13
Feature activation+1.43e-03

Pos logit contributions
anne+0.009
lights+0.008
storm+0.007
storms+0.007
 Susan+0.007
Neg logit contributions
ula-0.004
per-0.004
ster-0.004
 from-0.003
 St-0.003
Token Deal
Token ID15138
Feature activation-3.82e-02

Pos logit contributions
per+0.002
ula+0.002
ster+0.002
 from+0.002
atching+0.002
Neg logit contributions
anne-0.003
lights-0.002
 Susan-0.002
storm-0.002
storms-0.002
Token?"
Token ID1701
Feature activation-1.75e-03

Pos logit contributions
anne+0.067
lights+0.055
storm+0.052
 Susan+0.049
storms+0.049
Neg logit contributions
per-0.060
ula-0.059
ster-0.056
atching-0.054
 St-0.054
Token
Token ID198
Feature activation-3.70e-03

Pos logit contributions
anne+0.004
lights+0.004
storm+0.004
storms+0.003
 marsh+0.003
Neg logit contributions
ula-0.002
 with-0.002
 St-0.002
 from-0.002
per-0.002
Token
Token ID198
Feature activation-4.67e-03

Pos logit contributions
anne+0.009
storm+0.008
lights+0.008
storms+0.007
 marsh+0.007
Neg logit contributions
ula-0.003
 with-0.003
per-0.003
 from-0.003
 St-0.003
TokenL
Token ID43
Feature activation+8.86e-03

Pos logit contributions
anne+0.012
lights+0.010
storm+0.010
 Susan+0.010
 marsh+0.010
Neg logit contributions
per-0.003
ula-0.003
ster-0.003
 from-0.003
atching-0.003
Tokenily
Token ID813
Feature activation-4.16e-04

Pos logit contributions
ster+0.008
per+0.008
 St+0.008
 from+0.008
gg+0.008
Neg logit contributions
anne-0.020
lights-0.017
storm-0.016
 Susan-0.016
storms-0.015
Token her
Token ID607
Feature activation-1.08e-02

Pos logit contributions
 with+0.005
 from+0.005
 St+0.004
 do+0.004
 this+0.004
Neg logit contributions
anne-0.020
lights-0.019
storm-0.018
storms-0.018
acket-0.017
Token sc
Token ID629
Feature activation+4.73e-03

Pos logit contributions
anne+0.026
lights+0.022
storm+0.022
storms+0.021
rated+0.020
Neg logit contributions
 with-0.011
ula-0.011
 from-0.011
 figure-0.010
ster-0.010
Tokenooter
Token ID25141
Feature activation-2.13e-03

Pos logit contributions
per+0.005
ula+0.005
ster+0.005
 from+0.005
 St+0.005
Neg logit contributions
anne-0.011
lights-0.009
storm-0.008
 Susan-0.008
storms-0.008
Token.
Token ID13
Feature activation-1.33e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
per-0.002
ula-0.002
 from-0.002
 with-0.002
ster-0.002
Token She
Token ID1375
Feature activation-5.54e-04

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
 marsh+0.003
storms+0.003
Neg logit contributions
 with-0.001
per-0.001
 St-0.001
ula-0.001
 from-0.001
Token tr
Token ID491
Feature activation-3.75e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
 with-0.001
per-0.001
 from-0.001
 figure-0.001
 by-0.001
Tokenund
Token ID917
Feature activation-1.26e-02

Pos logit contributions
anne+0.069
storm+0.058
lights+0.057
 marsh+0.055
storms+0.054
Neg logit contributions
ula-0.052
per-0.051
atching-0.049
 with-0.048
yle-0.048
Tokenled
Token ID992
Feature activation-8.25e-03

Pos logit contributions
anne+0.021
storm+0.018
lights+0.017
storms+0.017
 marsh+0.016
Neg logit contributions
per-0.020
ula-0.020
 from-0.019
 with-0.019
ster-0.019
Token along
Token ID1863
Feature activation+6.19e-03

Pos logit contributions
anne+0.023
storm+0.021
lights+0.020
storms+0.019
 marsh+0.019
Neg logit contributions
 with-0.006
 from-0.005
 by-0.005
per-0.004
 St-0.004
Token the
Token ID262
Feature activation-1.59e-03

Pos logit contributions
per+0.004
ula+0.004
 with+0.004
 from+0.004
 St+0.004
Neg logit contributions
anne-0.017
lights-0.014
storm-0.014
storms-0.014
 marsh-0.013
Token sidewalk
Token ID24058
Feature activation+7.86e-03

Pos logit contributions
anne+0.003
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.002
ula-0.002
 from-0.002
ster-0.002
 with-0.002
Token a
Token ID257
Feature activation+5.57e-04

Pos logit contributions
anne+0.018
lights+0.015
storm+0.015
storms+0.014
 Susan+0.014
Neg logit contributions
per-0.008
ula-0.008
ster-0.008
 from-0.007
 St-0.007
Token sc
Token ID629
Feature activation+1.43e-03

Pos logit contributions
ula+0.001
per+0.001
ster+0.001
 with+0.001
 St+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Tokenooter
Token ID25141
Feature activation-8.55e-04

Pos logit contributions
 St+0.002
per+0.002
 from+0.002
ula+0.002
ster+0.002
Neg logit contributions
anne-0.003
 Susan-0.002
lights-0.002
jo-0.002
 John-0.002
Token and
Token ID290
Feature activation+2.46e-03

Pos logit contributions
anne+0.002
storm+0.002
lights+0.002
storms+0.002
 Susan+0.001
Neg logit contributions
per-0.001
ula-0.001
ster-0.001
atching-0.001
 St-0.001
Token a
Token ID257
Feature activation+1.58e-03

Pos logit contributions
 with+0.002
 from+0.002
 St+0.002
ula+0.001
 by+0.001
Neg logit contributions
anne-0.007
storm-0.006
lights-0.006
storms-0.006
acket-0.005
Token tr
Token ID491
Feature activation-3.75e-02

Pos logit contributions
ula+0.002
ster+0.002
 with+0.002
 figure+0.002
per+0.002
Neg logit contributions
anne-0.004
lights-0.003
storms-0.003
storm-0.003
 Susan-0.003
Tokenicycle
Token ID35298
Feature activation-7.33e-03

Pos logit contributions
anne+0.061
lights+0.048
storm+0.046
storms+0.043
 Susan+0.043
Neg logit contributions
per-0.064
ula-0.064
ster-0.061
atching-0.060
 with-0.060
Token.
Token ID13
Feature activation-6.68e-03

Pos logit contributions
anne+0.017
lights+0.014
storm+0.014
storms+0.013
 marsh+0.013
Neg logit contributions
per-0.008
ula-0.008
ster-0.007
 St-0.007
atching-0.007
Token She
Token ID1375
Feature activation-2.59e-03

Pos logit contributions
anne+0.018
lights+0.017
storm+0.017
rated+0.016
 hurricane+0.015
Neg logit contributions
 with-0.005
 My-0.004
 by-0.004
 St-0.003
 from-0.003
Token loved
Token ID6151
Feature activation-1.16e-02

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
storms+0.005
 Susan+0.005
Neg logit contributions
 with-0.003
per-0.003
 by-0.003
 St-0.003
 from-0.003
Token to
Token ID284
Feature activation+1.19e-02

Pos logit contributions
anne+0.030
lights+0.026
storm+0.025
storms+0.024
 marsh+0.023
Neg logit contributions
ula-0.009
per-0.009
 from-0.009
 with-0.009
 St-0.008
Token and
Token ID290
Feature activation-4.94e-03

Pos logit contributions
per+0.006
ula+0.006
 with+0.005
 from+0.005
ster+0.005
Neg logit contributions
anne-0.019
lights-0.016
storm-0.016
 Susan-0.016
storms-0.015
Token eat
Token ID4483
Feature activation-1.90e-03

Pos logit contributions
anne+0.013
lights+0.011
storm+0.011
storms+0.011
 Susan+0.010
Neg logit contributions
 with-0.004
 from-0.004
per-0.004
 by-0.004
 St-0.004
Token your
Token ID534
Feature activation-2.56e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
ula-0.002
per-0.002
 from-0.001
 with-0.001
ster-0.001
Token food
Token ID2057
Feature activation+1.03e-02

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
 Susan+0.005
storms+0.005
Neg logit contributions
 with-0.003
ula-0.003
per-0.003
ster-0.003
 by-0.003
Token.
Token ID13
Feature activation-4.06e-03

Pos logit contributions
per+0.009
ula+0.009
 from+0.008
 with+0.008
ster+0.008
Neg logit contributions
anne-0.026
lights-0.022
storm-0.022
 Susan-0.021
storms-0.021
Token Deal
Token ID15138
Feature activation-3.74e-02

Pos logit contributions
anne+0.009
lights+0.007
storm+0.007
storms+0.007
 marsh+0.007
Neg logit contributions
per-0.005
ula-0.005
ster-0.004
 from-0.004
atching-0.004
Token?"
Token ID1701
Feature activation-5.32e-03

Pos logit contributions
anne+0.068
lights+0.055
storm+0.052
 Susan+0.050
storms+0.050
Neg logit contributions
per-0.057
ula-0.057
ster-0.053
atching-0.052
 St-0.051
Token
Token ID198
Feature activation-7.64e-03

Pos logit contributions
anne+0.013
lights+0.011
storm+0.011
storms+0.010
 marsh+0.010
Neg logit contributions
ula-0.005
per-0.005
 with-0.005
 St-0.005
 from-0.005
Token
Token ID198
Feature activation-8.42e-03

Pos logit contributions
anne+0.019
lights+0.017
storm+0.016
storms+0.016
 imag+0.015
Neg logit contributions
ula-0.007
per-0.007
 with-0.007
 St-0.006
 from-0.006
TokenAnna
Token ID31160
Feature activation-9.64e-04

Pos logit contributions
anne+0.020
lights+0.017
storm+0.016
 Susan+0.016
 marsh+0.016
Neg logit contributions
per-0.008
ula-0.008
ster-0.007
 from-0.007
atching-0.007
Token and
Token ID290
Feature activation+7.74e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
 Susan+0.002
storms+0.002
Neg logit contributions
per-0.001
ula-0.001
 with-0.001
 from-0.001
 St-0.001
Token to
Token ID284
Feature activation+1.26e-02

Pos logit contributions
 with+0.000
per+0.000
 from+0.000
 St+0.000
 by+0.000
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token say
Token ID910
Feature activation+3.66e-03

Pos logit contributions
ula+0.014
per+0.012
ster+0.012
atching+0.012
aser+0.011
Neg logit contributions
anne-0.028
storm-0.023
 marsh-0.022
lights-0.022
 Susan-0.022
Token thank
Token ID5875
Feature activation+1.02e-02

Pos logit contributions
per+0.005
ula+0.004
ster+0.004
atching+0.004
 figure+0.004
Neg logit contributions
anne-0.007
lights-0.006
storm-0.006
 marsh-0.006
storms-0.006
Token you
Token ID345
Feature activation+1.50e-02

Pos logit contributions
ula+0.009
ster+0.009
per+0.008
 figure+0.008
olding+0.008
Neg logit contributions
anne-0.023
lights-0.019
 marsh-0.019
 Susan-0.018
storm-0.018
Token.
Token ID13
Feature activation+4.35e-03

Pos logit contributions
ula+0.015
per+0.015
ster+0.014
atching+0.013
gg+0.013
Neg logit contributions
anne-0.034
lights-0.029
storm-0.028
storms-0.027
 marsh-0.027
Token Deal
Token ID15138
Feature activation-3.74e-02

Pos logit contributions
ula+0.005
per+0.005
ster+0.005
atching+0.005
 from+0.005
Neg logit contributions
anne-0.009
lights-0.008
storm-0.007
 Susan-0.007
 marsh-0.007
Token?"
Token ID1701
Feature activation+3.97e-04

Pos logit contributions
anne+0.064
lights+0.052
storm+0.049
 Susan+0.046
storms+0.046
Neg logit contributions
per-0.060
ula-0.059
ster-0.056
atching-0.055
 St-0.054
Token
Token ID198
Feature activation-4.50e-03

Pos logit contributions
per+0.001
ula+0.000
ster+0.000
atching+0.000
 from+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token
Token ID198
Feature activation-5.44e-03

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.007
 Susan+0.007
Neg logit contributions
per-0.005
ula-0.005
ster-0.005
 from-0.005
atching-0.005
TokenAnna
Token ID31160
Feature activation-1.46e-03

Pos logit contributions
anne+0.010
lights+0.008
storm+0.007
 Susan+0.007
 marsh+0.007
Neg logit contributions
ula-0.008
per-0.008
ster-0.008
 figure-0.008
atching-0.008
Token nodded
Token ID14464
Feature activation-1.10e-02

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.003
 with-0.002
ster-0.002
 from-0.002
 St-0.002
Token,"
Token ID553
Feature activation-5.27e-03

Pos logit contributions
 with+0.001
 from+0.001
per+0.001
 by+0.001
'+0.001
Neg logit contributions
anne-0.004
lights-0.004
storm-0.003
storms-0.003
 imag-0.003
Token she
Token ID673
Feature activation-1.34e-02

Pos logit contributions
anne+0.011
storm+0.010
lights+0.010
storms+0.009
rated+0.008
Neg logit contributions
 with-0.007
 from-0.007
 St-0.006
 by-0.006
ula-0.006
Token said
Token ID531
Feature activation-1.20e-02

Pos logit contributions
lights+0.030
anne+0.030
storm+0.028
storms+0.026
 hurricane+0.025
Neg logit contributions
 from-0.013
 with-0.013
 St-0.013
 by-0.012
per-0.012
Token.
Token ID13
Feature activation-5.06e-03

Pos logit contributions
anne+0.031
lights+0.028
storm+0.027
storms+0.026
 imag+0.025
Neg logit contributions
ula-0.009
per-0.009
 with-0.009
 from-0.009
 by-0.008
Token "
Token ID366
Feature activation-2.35e-04

Pos logit contributions
anne+0.014
lights+0.013
storm+0.012
rated+0.012
 imag+0.011
Neg logit contributions
 with-0.003
 St-0.003
ula-0.003
 from-0.003
per-0.003
TokenNot
Token ID3673
Feature activation-3.72e-02

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 Susan+0.000
Neg logit contributions
per-0.000
ula-0.000
ster-0.000
 from-0.000
 St-0.000
Token today
Token ID1909
Feature activation-5.06e-04

Pos logit contributions
anne+0.103
lights+0.086
storms+0.083
storm+0.082
rones+0.081
Neg logit contributions
 with-0.029
 by-0.029
 from-0.029
 for-0.026
 this-0.026
Token.
Token ID13
Feature activation-4.65e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 imag+0.001
storms+0.001
Neg logit contributions
 with-0.000
 from-0.000
 by-0.000
'-0.000
 you-0.000
Token But
Token ID887
Feature activation-1.25e-02

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.008
 Susan+0.008
Neg logit contributions
 St-0.005
ula-0.005
per-0.005
 with-0.005
 from-0.004
Token let
Token ID1309
Feature activation-4.85e-04

Pos logit contributions
anne+0.033
lights+0.028
storm+0.027
storms+0.027
 Susan+0.025
Neg logit contributions
 from-0.010
per-0.009
 with-0.009
 by-0.009
ula-0.009
Tokenâ
Token ID22940
Feature activation-1.64e-02

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 imag+0.001
Neg logit contributions
 from-0.000
'-0.000
per-0.000
ula-0.000
 with-0.000
Token have
Token ID423
Feature activation-2.81e-03

Pos logit contributions
per+0.004
 from+0.004
 with+0.004
ula+0.004
 by+0.004
Neg logit contributions
anne-0.009
lights-0.008
storm-0.008
storms-0.007
 Susan-0.007
Token to
Token ID284
Feature activation+5.67e-03

Pos logit contributions
anne+0.009
lights+0.008
storm+0.008
storms+0.007
jo+0.007
Neg logit contributions
 with-0.001
 by-0.001
 from-0.001
 St-0.001
 this-0.001
Token say
Token ID910
Feature activation-1.90e-03

Pos logit contributions
per+0.005
ula+0.005
 from+0.005
 with+0.004
ster+0.004
Neg logit contributions
anne-0.014
lights-0.012
storm-0.012
storms-0.011
 Susan-0.011
Token okay
Token ID8788
Feature activation-4.90e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
ula-0.002
per-0.002
 from-0.002
 with-0.002
ster-0.002
Token.
Token ID13
Feature activation-8.68e-04

Pos logit contributions
anne+0.012
lights+0.011
storm+0.010
storms+0.010
 marsh+0.009
Neg logit contributions
 with-0.005
 from-0.005
 by-0.005
per-0.004
 St-0.004
Token Deal
Token ID15138
Feature activation-3.66e-02

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.001
ula-0.001
 St-0.001
 from-0.001
ster-0.001
Token?"
Token ID1701
Feature activation-1.80e-03

Pos logit contributions
anne+0.063
lights+0.051
storm+0.048
 Susan+0.046
storms+0.046
Neg logit contributions
per-0.058
ula-0.058
ster-0.055
atching-0.053
 St-0.052
Token
Token ID198
Feature activation-2.52e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
ula-0.002
per-0.002
 from-0.002
ster-0.002
 St-0.002
Token
Token ID198
Feature activation-5.03e-03

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
storms+0.005
 marsh+0.005
Neg logit contributions
ula-0.003
per-0.003
 with-0.002
 from-0.002
 St-0.002
TokenL
Token ID43
Feature activation+1.07e-02

Pos logit contributions
anne+0.013
lights+0.010
 Susan+0.010
storm+0.010
storms+0.010
Neg logit contributions
per-0.004
ula-0.004
ster-0.004
atching-0.004
 from-0.004
Tokenily
Token ID813
Feature activation-3.66e-04

Pos logit contributions
 solve+0.012
ster+0.012
gg+0.012
 figure+0.012
per+0.011
Neg logit contributions
anne-0.022
lights-0.017
jo-0.016
 Susan-0.016
storm-0.015
Token,
Token ID11
Feature activation-4.36e-03

Pos logit contributions
 from+0.002
 with+0.002
ula+0.001
per+0.001
 by+0.001
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
 Susan-0.005
Token I
Token ID314
Feature activation+4.09e-03

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.009
 Susan+0.008
Neg logit contributions
per-0.004
ula-0.004
 from-0.004
ster-0.004
 with-0.003
Token can
Token ID460
Feature activation+7.89e-04

Pos logit contributions
per+0.005
ula+0.005
 from+0.005
 with+0.005
ster+0.005
Neg logit contributions
anne-0.009
lights-0.007
storm-0.007
storms-0.007
 Susan-0.007
Token play
Token ID711
Feature activation+1.05e-02

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token the
Token ID262
Feature activation+8.11e-03

Pos logit contributions
ula+0.008
per+0.007
atching+0.007
olding+0.007
ster+0.007
Neg logit contributions
anne-0.025
lights-0.023
storm-0.022
 marsh-0.022
storms-0.021
Token fl
Token ID781
Feature activation+2.84e-02

Pos logit contributions
per+0.010
ula+0.010
ster+0.009
atching+0.009
 from+0.009
Neg logit contributions
anne-0.017
lights-0.014
storm-0.014
storms-0.013
 marsh-0.013
Tokenute
Token ID1133
Feature activation+4.58e-03

Pos logit contributions
 St+0.025
 from+0.024
per+0.023
ula+0.023
 solve+0.023
Neg logit contributions
anne-0.065
lights-0.057
storm-0.056
storms-0.050
 marsh-0.049
Token like
Token ID588
Feature activation+8.38e-03

Pos logit contributions
per+0.004
 with+0.004
 from+0.004
ula+0.004
 by+0.004
Neg logit contributions
anne-0.012
lights-0.010
storm-0.009
 Susan-0.009
storms-0.009
Token you
Token ID345
Feature activation+3.35e-03

Pos logit contributions
 with+0.005
 from+0.004
ula+0.004
 St+0.004
 by+0.004
Neg logit contributions
anne-0.025
lights-0.021
storm-0.021
rated-0.020
storms-0.020
Token."
Token ID526
Feature activation-2.19e-03

Pos logit contributions
per+0.003
 with+0.003
 from+0.003
'+0.003
 by+0.003
Neg logit contributions
anne-0.008
lights-0.007
storm-0.007
 Susan-0.007
storms-0.007
Token
Token ID198
Feature activation+1.96e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 marsh+0.004
Neg logit contributions
per-0.002
ula-0.002
 with-0.002
 from-0.002
 St-0.002
Token's
Token ID338
Feature activation-8.15e-03

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
storms+0.002
 marsh+0.002
Neg logit contributions
per-0.003
 with-0.002
 from-0.002
ula-0.002
 by-0.002
Token house
Token ID2156
Feature activation+1.44e-03

Pos logit contributions
anne+0.019
lights+0.016
storm+0.015
storms+0.015
 marsh+0.014
Neg logit contributions
per-0.008
 from-0.008
 with-0.008
ula-0.008
 by-0.008
Token was
Token ID373
Feature activation-1.04e-03

Pos logit contributions
 with+0.002
 from+0.002
per+0.002
 by+0.002
ula+0.001
Neg logit contributions
anne-0.003
lights-0.003
storm-0.003
storms-0.003
 marsh-0.003
Token built
Token ID3170
Feature activation+6.99e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 marsh+0.002
Neg logit contributions
 with-0.001
 from-0.001
per-0.001
 by-0.001
ula-0.001
Token long
Token ID890
Feature activation-3.34e-04

Pos logit contributions
per+0.006
ula+0.006
 from+0.005
 with+0.005
 by+0.005
Neg logit contributions
anne-0.018
lights-0.015
storm-0.015
storms-0.014
 Susan-0.014
Token ago
Token ID2084
Feature activation+2.98e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 Susan+0.001
die+0.001
Neg logit contributions
 with-0.000
 from-0.000
per-0.000
 by-0.000
 figure-0.000
Token.
Token ID13
Feature activation-4.47e-03

Pos logit contributions
per+0.023
ula+0.022
 with+0.021
 from+0.021
ster+0.019
Neg logit contributions
anne-0.078
lights-0.067
storm-0.066
storms-0.063
 marsh-0.062
Token
Token ID198
Feature activation-6.85e-04

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
 imag+0.009
storms+0.009
Neg logit contributions
ula-0.004
 with-0.004
 St-0.004
 from-0.004
 by-0.004
Token
Token ID198
Feature activation+9.32e-04

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
ula-0.001
 with-0.001
per-0.001
 from-0.001
 St-0.001
TokenL
Token ID43
Feature activation+5.46e-03

Pos logit contributions
ster+0.001
per+0.001
ula+0.001
 permission+0.001
aser+0.001
Neg logit contributions
anne-0.002
storm-0.002
lights-0.002
 Susan-0.002
 marsh-0.002
Tokenily
Token ID813
Feature activation+4.63e-03

Pos logit contributions
per+0.005
ster+0.005
ula+0.005
 St+0.005
gg+0.004
Neg logit contributions
anne-0.013
lights-0.011
storm-0.010
 Susan-0.010
storms-0.010
Token She
Token ID1375
Feature activation+1.33e-03

Pos logit contributions
ula+0.002
per+0.002
ster+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.003
lights-0.003
storm-0.003
storms-0.003
 Susan-0.003
Token thought
Token ID1807
Feature activation-1.79e-02

Pos logit contributions
per+0.002
 with+0.002
 St+0.002
 from+0.002
 figure+0.002
Neg logit contributions
anne-0.003
lights-0.003
storm-0.002
storms-0.002
 Susan-0.002
Token the
Token ID262
Feature activation+2.78e-03

Pos logit contributions
anne+0.043
lights+0.036
storm+0.035
storms+0.034
 Susan+0.033
Neg logit contributions
ula-0.018
per-0.018
 from-0.016
ster-0.016
 with-0.016
Token rh
Token ID9529
Feature activation+1.60e-02

Pos logit contributions
ula+0.005
per+0.005
atching+0.005
or+0.005
olding+0.004
Neg logit contributions
anne-0.004
storm-0.003
 marsh-0.003
storms-0.003
 hurricane-0.003
Tokenin
Token ID259
Feature activation-6.67e-03

Pos logit contributions
ula+0.022
per+0.020
 St+0.019
 from+0.018
 solve+0.018
Neg logit contributions
anne-0.030
lights-0.026
storm-0.026
storms-0.024
 snowball-0.023
Tokenoc
Token ID420
Feature activation+3.54e-02

Pos logit contributions
anne+0.018
lights+0.015
storm+0.015
storms+0.014
 Susan+0.014
Neg logit contributions
per-0.005
ula-0.005
ster-0.004
 from-0.004
 St-0.004
Tokeneros
Token ID27498
Feature activation+2.61e-04

Pos logit contributions
ula+0.058
per+0.053
 solve+0.051
 figure+0.050
'+0.050
Neg logit contributions
anne-0.059
storm-0.050
lights-0.048
storms-0.048
 marsh-0.043
Token might
Token ID1244
Feature activation-1.36e-03

Pos logit contributions
 with+0.000
 from+0.000
per+0.000
 by+0.000
ster+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.000
 marsh-0.000
 Susan-0.000
Token like
Token ID588
Feature activation-3.20e-04

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.001
ula-0.001
 from-0.001
ster-0.001
 with-0.001
Token the
Token ID262
Feature activation-3.40e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
 from-0.000
per-0.000
ula-0.000
 with-0.000
 St-0.000
Token leaves
Token ID5667
Feature activation-1.67e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.005
 Susan+0.005
Neg logit contributions
per-0.004
ula-0.004
ster-0.004
 from-0.004
 St-0.004
Token sw
Token ID1509
Feature activation+4.46e-03

Pos logit contributions
anne+0.006
lights+0.005
 Susan+0.005
storm+0.004
 Jessie+0.004
Neg logit contributions
 with-0.004
per-0.004
 from-0.003
 by-0.003
'-0.003
Tokenam
Token ID321
Feature activation+3.72e-03

Pos logit contributions
per+0.006
ster+0.005
ula+0.005
atching+0.005
 with+0.005
Neg logit contributions
anne-0.010
storm-0.008
lights-0.008
 Susan-0.008
storms-0.008
Token away
Token ID1497
Feature activation+1.89e-03

Pos logit contributions
 with+0.003
per+0.003
 from+0.003
 by+0.003
ula+0.003
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.008
 Susan-0.008
Token.
Token ID13
Feature activation+4.06e-03

Pos logit contributions
per+0.001
ula+0.001
 from+0.001
 with+0.001
ster+0.001
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token The
Token ID383
Feature activation+9.90e-03

Pos logit contributions
 St+0.004
ula+0.004
 with+0.004
per+0.004
 from+0.004
Neg logit contributions
anne-0.010
lights-0.008
storm-0.008
storms-0.007
 imag-0.007
Token ot
Token ID30972
Feature activation+3.00e-02

Pos logit contributions
ula+0.011
per+0.011
ster+0.010
 St+0.010
 from+0.010
Neg logit contributions
anne-0.022
lights-0.019
storm-0.018
storms-0.017
 marsh-0.017
Tokenter
Token ID353
Feature activation+1.04e-02

Pos logit contributions
ula+0.033
 St+0.029
 solve+0.029
per+0.029
 from+0.027
Neg logit contributions
anne-0.061
storm-0.055
lights-0.052
storms-0.052
 Susan-0.049
Token thought
Token ID1807
Feature activation-4.94e-03

Pos logit contributions
 with+0.012
per+0.012
 by+0.011
 from+0.011
'+0.011
Neg logit contributions
anne-0.023
lights-0.021
storm-0.019
 Susan-0.019
storms-0.019
Token it
Token ID340
Feature activation+6.16e-05

Pos logit contributions
anne+0.012
lights+0.010
storm+0.009
storms+0.009
 Susan+0.009
Neg logit contributions
per-0.005
ula-0.005
ster-0.005
atching-0.004
 St-0.004
Token was
Token ID373
Feature activation+2.09e-03

Pos logit contributions
 with+0.000
 by+0.000
 from+0.000
 do+0.000
 figure+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
rones-0.000
 Susan-0.000
Token so
Token ID523
Feature activation+6.21e-03

Pos logit contributions
 from+0.002
 with+0.002
 by+0.002
 St+0.001
 those+0.001
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
jo-0.004
Token,
Token ID11
Feature activation+1.40e-02

Pos logit contributions
ula+0.006
per+0.006
atching+0.005
ster+0.005
olding+0.005
Neg logit contributions
anne-0.015
lights-0.013
 Susan-0.013
storm-0.012
storms-0.012
Token something
Token ID1223
Feature activation-3.50e-03

Pos logit contributions
per+0.013
ula+0.013
 from+0.012
ster+0.012
 with+0.012
Neg logit contributions
anne-0.034
lights-0.029
storm-0.028
storms-0.027
 marsh-0.027
Token strange
Token ID6283
Feature activation-5.19e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.005
 Susan+0.005
Neg logit contributions
ula-0.005
 from-0.005
 with-0.004
per-0.004
 by-0.004
Token caught
Token ID4978
Feature activation+4.95e-03

Pos logit contributions
anne+0.007
lights+0.005
die+0.005
 marsh+0.005
 Amen+0.005
Neg logit contributions
 from-0.011
 with-0.010
 by-0.010
 figure-0.009
 do-0.009
Token her
Token ID607
Feature activation+2.81e-03

Pos logit contributions
ula+0.005
per+0.005
ster+0.005
 St+0.005
atching+0.005
Neg logit contributions
anne-0.011
lights-0.010
storm-0.009
 Susan-0.009
storms-0.009
Token eye
Token ID4151
Feature activation+2.72e-02

Pos logit contributions
per+0.003
ula+0.003
atching+0.002
 St+0.002
 solve+0.002
Neg logit contributions
anne-0.006
lights-0.006
storm-0.005
 marsh-0.005
storms-0.005
Token.
Token ID13
Feature activation+8.91e-03

Pos logit contributions
per+0.028
ula+0.028
 St+0.027
 solve+0.025
atching+0.025
Neg logit contributions
anne-0.061
lights-0.052
storm-0.052
 marsh-0.048
storms-0.048
Token It
Token ID632
Feature activation-5.93e-03

Pos logit contributions
per+0.012
ula+0.012
ster+0.011
 from+0.011
 St+0.011
Neg logit contributions
anne-0.018
lights-0.015
storm-0.014
storms-0.014
 Susan-0.014
Token was
Token ID373
Feature activation+5.65e-03

Pos logit contributions
anne+0.016
lights+0.014
 Susan+0.013
storms+0.013
acket+0.012
Neg logit contributions
 from-0.005
 with-0.005
 by-0.004
 figure-0.004
 do-0.003
Token a
Token ID257
Feature activation+7.68e-03

Pos logit contributions
ula+0.006
per+0.006
 from+0.006
 with+0.005
ster+0.005
Neg logit contributions
anne-0.013
lights-0.011
storm-0.011
storms-0.010
 Susan-0.010
Token very
Token ID845
Feature activation+3.27e-03

Pos logit contributions
per+0.010
ula+0.010
ster+0.009
 from+0.009
 St+0.009
Neg logit contributions
anne-0.016
lights-0.013
storm-0.013
storms-0.012
 Susan-0.012
Token red
Token ID2266
Feature activation-2.89e-03

Pos logit contributions
anne+0.040
lights+0.033
storm+0.032
storms+0.031
 Susan+0.030
Neg logit contributions
per-0.018
 with-0.017
 from-0.017
 by-0.017
 figure-0.017
Token je
Token ID11223
Feature activation+5.72e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 marsh+0.004
 hurricane+0.004
Neg logit contributions
ula-0.004
per-0.004
etic-0.004
atching-0.004
gg-0.004
Tokenep
Token ID538
Feature activation-1.33e-03

Pos logit contributions
ula+0.013
 St+0.012
 from+0.012
per+0.012
 Cel+0.012
Neg logit contributions
anne-0.006
lights-0.004
storm-0.004
 snowball-0.003
storms-0.003
Token.
Token ID13
Feature activation-8.64e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
 Susan+0.003
 marsh+0.003
Neg logit contributions
per-0.001
 from-0.001
 with-0.001
ula-0.001
 by-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.31e-02

Pos logit contributions
anne+0.022
lights+0.020
storm+0.019
storms+0.018
 imag+0.018
Neg logit contributions
 with-0.008
 from-0.007
 St-0.007
 by-0.006
ula-0.006
TokenOnce
Token ID7454
Feature activation+9.10e-03

Pos logit contributions
anne+0.034
lights+0.030
storm+0.029
storms+0.027
 Susan+0.027
Neg logit contributions
 with-0.011
 from-0.010
 by-0.010
ula-0.010
per-0.010
Token upon
Token ID2402
Feature activation-1.58e-03

Pos logit contributions
ula+0.007
per+0.007
 from+0.007
 with+0.006
ster+0.006
Neg logit contributions
anne-0.023
lights-0.020
storm-0.020
storms-0.019
 Susan-0.018
Token a
Token ID257
Feature activation-7.27e-03

Pos logit contributions
anne+0.004
lights+0.004
storm+0.003
storms+0.003
 marsh+0.003
Neg logit contributions
 with-0.001
 from-0.001
per-0.001
ula-0.001
 by-0.001
Token time
Token ID640
Feature activation-3.63e-03

Pos logit contributions
anne+0.013
lights+0.013
storms+0.012
storm+0.012
 Susan+0.011
Neg logit contributions
 with-0.010
 by-0.009
'-0.009
per-0.009
"-0.009
Token,
Token ID11
Feature activation+8.73e-04

Pos logit contributions
anne+0.009
lights+0.008
storm+0.008
 Susan+0.007
storms+0.007
Neg logit contributions
per-0.003
ula-0.003
ster-0.003
atching-0.002
 St-0.002
Token there
Token ID612
Feature activation-3.31e-03

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 St+0.001
atching+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
 Susan-0.002
storms-0.002
Token teasing
Token ID34417
Feature activation-6.35e-04

Pos logit contributions
per+0.002
ula+0.002
ster+0.002
 from+0.002
 with+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 Susan-0.004
Token him
Token ID683
Feature activation+4.50e-03

Pos logit contributions
anne+0.002
lights+0.001
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
per-0.001
 from-0.001
 with-0.001
ula-0.001
 by-0.001
Token.
Token ID13
Feature activation+7.65e-04

Pos logit contributions
ula+0.004
per+0.004
ster+0.004
 from+0.003
 St+0.003
Neg logit contributions
anne-0.011
lights-0.009
storm-0.009
storms-0.009
 Susan-0.009
Token The
Token ID383
Feature activation+3.52e-03

Pos logit contributions
 St+0.001
ula+0.001
per+0.001
 with+0.001
 from+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.001
rated-0.001
Token lion
Token ID18744
Feature activation+8.37e-03

Pos logit contributions
ula+0.005
per+0.005
ster+0.005
 from+0.005
atching+0.005
Neg logit contributions
anne-0.007
lights-0.005
storm-0.005
storms-0.005
 Susan-0.005
Token lived
Token ID5615
Feature activation+1.14e-02

Pos logit contributions
per+0.010
 with+0.010
 from+0.010
ster+0.009
 by+0.009
Neg logit contributions
anne-0.018
lights-0.016
storm-0.015
storms-0.014
 Susan-0.014
Token happily
Token ID18177
Feature activation-2.94e-03

Pos logit contributions
per+0.018
ula+0.018
 with+0.017
 from+0.017
ster+0.017
Neg logit contributions
anne-0.020
lights-0.016
storm-0.016
storms-0.015
 Susan-0.014
Token ever
Token ID1683
Feature activation+1.09e-02

Pos logit contributions
anne+0.006
storm+0.005
lights+0.005
storms+0.004
 Susan+0.004
Neg logit contributions
ula-0.004
per-0.004
atching-0.004
ster-0.004
 St-0.004
Token after
Token ID706
Feature activation+1.46e-03

Pos logit contributions
ula+0.011
atching+0.009
per+0.009
ster+0.009
 St+0.009
Neg logit contributions
anne-0.026
storm-0.022
storms-0.021
 Susan-0.020
lights-0.020
Token and
Token ID290
Feature activation-1.88e-04

Pos logit contributions
per+0.001
ula+0.001
 with+0.001
 from+0.001
 by+0.001
Neg logit contributions
anne-0.004
lights-0.003
storm-0.003
storms-0.003
 Susan-0.003
Token was
Token ID373
Feature activation-4.71e-03

Pos logit contributions
anne+0.000
lights+0.000
storm+0.000
storms+0.000
 marsh+0.000
Neg logit contributions
 with-0.000
 from-0.000
 by-0.000
 St-0.000
per-0.000
Token class
Token ID1398
Feature activation-2.93e-03

Pos logit contributions
per+0.009
ula+0.009
ster+0.008
 from+0.008
atching+0.008
Neg logit contributions
anne-0.016
lights-0.014
storm-0.013
storms-0.013
 Susan-0.012
Token a
Token ID257
Feature activation+5.68e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.006
 marsh+0.005
Neg logit contributions
ula-0.003
per-0.003
 from-0.003
 with-0.003
ster-0.003
Token quiz
Token ID38964
Feature activation+3.66e-03

Pos logit contributions
per+0.008
ula+0.007
ster+0.007
atching+0.007
 from+0.006
Neg logit contributions
anne-0.012
lights-0.010
storm-0.009
 marsh-0.009
 Susan-0.009
Token.
Token ID13
Feature activation-7.32e-03

Pos logit contributions
per+0.003
ula+0.003
ster+0.003
 St+0.003
 from+0.003
Neg logit contributions
anne-0.009
lights-0.008
storm-0.008
storms-0.007
 Susan-0.007
Token Tim
Token ID5045
Feature activation-7.88e-03

Pos logit contributions
anne+0.014
lights+0.011
storm+0.011
storms+0.010
 marsh+0.010
Neg logit contributions
ula-0.011
per-0.010
 with-0.010
 from-0.010
 St-0.010
Tokenmy
Token ID1820
Feature activation+9.42e-03

Pos logit contributions
anne+0.015
lights+0.012
storm+0.012
 marsh+0.011
storms+0.011
Neg logit contributions
per-0.012
ula-0.011
ster-0.011
 with-0.011
 from-0.011
Token didn
Token ID1422
Feature activation-7.32e-03

Pos logit contributions
per+0.012
ula+0.012
ster+0.011
 from+0.011
 with+0.011
Neg logit contributions
anne-0.020
lights-0.016
storm-0.016
storms-0.015
 Susan-0.015
Token't
Token ID470
Feature activation+2.82e-03

Pos logit contributions
anne+0.013
storm+0.011
lights+0.011
 John+0.011
 Susan+0.010
Neg logit contributions
ster-0.011
per-0.011
 cents-0.010
ater-0.009
olding-0.009
Token know
Token ID760
Feature activation-1.05e-02

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
atching+0.003
 St+0.003
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
 Susan-0.005
Token any
Token ID597
Feature activation-2.38e-03

Pos logit contributions
anne+0.023
lights+0.021
 Susan+0.020
 John+0.019
storms+0.019
Neg logit contributions
per-0.011
ula-0.011
ster-0.010
atching-0.010
gg-0.009
Token of
Token ID286
Feature activation-4.00e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
per-0.003
ula-0.003
ster-0.003
atching-0.003
 St-0.002
Token then
Token ID788
Feature activation+1.05e-02

Pos logit contributions
anne+0.014
lights+0.011
storm+0.011
storms+0.011
 Susan+0.010
Neg logit contributions
per-0.007
 from-0.006
ula-0.006
 with-0.006
ster-0.006
Token,
Token ID11
Feature activation+2.99e-04

Pos logit contributions
 with+0.005
 from+0.004
per+0.004
ula+0.004
 St+0.004
Neg logit contributions
anne-0.030
lights-0.027
storm-0.026
storms-0.025
 marsh-0.025
Token a
Token ID257
Feature activation-4.00e-03

Pos logit contributions
 with+0.000
 from+0.000
ula+0.000
 by+0.000
 solve+0.000
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 marsh-0.001
Token friendly
Token ID8030
Feature activation-9.59e-04

Pos logit contributions
anne+0.010
lights+0.009
storms+0.008
storm+0.008
 Susan+0.008
Neg logit contributions
ula-0.003
 figure-0.003
 with-0.003
ster-0.003
 from-0.003
Token King
Token ID2677
Feature activation+3.78e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 marsh+0.002
Neg logit contributions
per-0.001
ula-0.001
 from-0.001
 with-0.001
ster-0.001
Token appeared
Token ID4120
Feature activation+1.09e-02

Pos logit contributions
per+0.006
ula+0.006
ster+0.006
 from+0.006
 with+0.006
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.005
 marsh-0.005
Token.
Token ID13
Feature activation-5.78e-04

Pos logit contributions
ula+0.011
per+0.010
atching+0.010
gg+0.010
ster+0.010
Neg logit contributions
anne-0.025
lights-0.021
storm-0.020
 Susan-0.020
storms-0.020
Token He
Token ID679
Feature activation+2.80e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 marsh+0.001
storms+0.001
Neg logit contributions
per-0.001
 from-0.001
 with-0.001
ula-0.001
 St-0.001
Token said
Token ID531
Feature activation-6.99e-03

Pos logit contributions
 with+0.004
 from+0.004
per+0.004
ster+0.003
 by+0.003
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.004
 marsh-0.004
Token,
Token ID11
Feature activation+6.22e-03

Pos logit contributions
anne+0.017
 Susan+0.014
lights+0.014
storm+0.014
storms+0.013
Neg logit contributions
ula-0.007
per-0.007
atching-0.006
ster-0.005
gg-0.005
Token �
Token ID6184
Feature activation-1.46e-02

Pos logit contributions
per+0.007
ula+0.007
ster+0.006
atching+0.006
gg+0.006
Neg logit contributions
anne-0.014
storm-0.011
 Susan-0.011
lights-0.011
storms-0.011
Token,
Token ID11
Feature activation-2.49e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 Susan+0.001
 marsh+0.001
Neg logit contributions
 with-0.000
 from-0.000
per-0.000
 by-0.000
ster-0.000
Token as
Token ID355
Feature activation-6.74e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.006
 marsh+0.006
Neg logit contributions
 with-0.001
 from-0.001
 by-0.001
per-0.001
ula-0.001
Token everyone
Token ID2506
Feature activation+4.43e-03

Pos logit contributions
anne+0.018
storm+0.015
lights+0.015
storms+0.014
rated+0.014
Neg logit contributions
 from-0.006
 with-0.006
 by-0.006
 St-0.006
ula-0.005
Token learned
Token ID4499
Feature activation+9.46e-03

Pos logit contributions
 with+0.006
 from+0.006
 by+0.006
'+0.005
per+0.005
Neg logit contributions
anne-0.009
lights-0.008
storm-0.008
 marsh-0.007
 Susan-0.007
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
per+0.009
ula+0.009
ster+0.008
 from+0.008
 St+0.008
Neg logit contributions
anne-0.023
lights-0.020
storm-0.019
storms-0.018
 Susan-0.018
Token played
Token ID2826
Feature activation+8.18e-03

Pos logit contributions
anne+0.003
lights+0.002
storm+0.002
storms+0.002
acket+0.002
Neg logit contributions
 from-0.002
 with-0.001
 by-0.001
 St-0.001
 solve-0.001
Token together
Token ID1978
Feature activation+8.85e-04

Pos logit contributions
per+0.007
ula+0.007
 with+0.007
 from+0.006
ster+0.006
Neg logit contributions
anne-0.021
lights-0.018
storm-0.017
storms-0.017
 Susan-0.016
Token.
Token ID13
Feature activation-3.69e-03

Pos logit contributions
ula+0.001
per+0.001
ster+0.001
atching+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.002
storm-0.002
storms-0.002
 marsh-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.59e-02

Pos logit contributions
anne+0.010
lights+0.009
storm+0.009
storms+0.008
 imag+0.008
Neg logit contributions
 with-0.003
 from-0.003
 St-0.002
 by-0.002
ula-0.002
TokenOnce
Token ID7454
Feature activation+1.04e-02

Pos logit contributions
anne+0.042
lights+0.037
storm+0.036
storms+0.034
 imag+0.033
Neg logit contributions
 with-0.013
 from-0.012
 by-0.012
ula-0.011
 St-0.011
Token upon
Token ID2402
Feature activation-3.86e-03

Pos logit contributions
ula+0.008
per+0.008
 from+0.007
ster+0.007
 with+0.007
Neg logit contributions
anne-0.027
lights-0.023
storm-0.023
storms-0.022
 Susan-0.021
Token.
Token ID13
Feature activation+2.16e-03

Pos logit contributions
anne+0.016
lights+0.014
storm+0.013
storms+0.013
 Susan+0.013
Neg logit contributions
ula-0.007
per-0.007
ster-0.006
 St-0.006
atching-0.006
Token She
Token ID1375
Feature activation-8.68e-04

Pos logit contributions
per+0.002
ula+0.002
 with+0.002
 from+0.002
 St+0.002
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
storms-0.004
 marsh-0.004
Token peek
Token ID27185
Feature activation+8.85e-04

Pos logit contributions
anne+0.002
lights+0.002
storm+0.001
storms+0.001
 marsh+0.001
Neg logit contributions
 with-0.001
 from-0.001
per-0.001
 figure-0.001
 by-0.001
Tokened
Token ID276
Feature activation-9.03e-03

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.002
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token around
Token ID1088
Feature activation+4.07e-04

Pos logit contributions
anne+0.019
lights+0.016
storm+0.016
 Susan+0.015
storms+0.015
Neg logit contributions
ula-0.011
per-0.011
ster-0.010
atching-0.010
 St-0.010
Token the
Token ID262
Feature activation+2.77e-03

Pos logit contributions
ula+0.000
atching+0.000
 cents+0.000
per+0.000
ster+0.000
Neg logit contributions
anne-0.001
 Susan-0.001
lights-0.001
 John-0.001
 Jessie-0.001
Token corner
Token ID5228
Feature activation-6.26e-03

Pos logit contributions
per+0.005
ula+0.004
atching+0.004
ster+0.004
 St+0.004
Neg logit contributions
anne-0.005
lights-0.004
storm-0.004
 marsh-0.003
storms-0.003
Token and
Token ID290
Feature activation+4.95e-03

Pos logit contributions
anne+0.016
lights+0.014
storm+0.013
 Susan+0.013
storms+0.013
Neg logit contributions
ula-0.005
per-0.005
ster-0.004
 St-0.004
atching-0.004
Token saw
Token ID2497
Feature activation+9.44e-03

Pos logit contributions
per+0.008
ula+0.008
 from+0.008
ster+0.008
 with+0.008
Neg logit contributions
anne-0.008
lights-0.007
storm-0.006
storms-0.006
 marsh-0.006
Token a
Token ID257
Feature activation+6.35e-04

Pos logit contributions
 from+0.007
ula+0.007
 with+0.007
 St+0.007
per+0.006
Neg logit contributions
anne-0.025
storm-0.022
lights-0.022
storms-0.020
rated-0.020
Token pretty
Token ID2495
Feature activation-4.64e-03

Pos logit contributions
ula+0.001
per+0.001
ster+0.001
 from+0.001
 with+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token!"
Token ID2474
Feature activation+6.50e-03

Pos logit contributions
anne+0.004
storm+0.003
lights+0.003
 hurricane+0.003
acket+0.003
Neg logit contributions
 with-0.002
 from-0.002
 by-0.002
 for-0.001
"-0.001
Token Lily
Token ID20037
Feature activation-1.02e-02

Pos logit contributions
 with+0.007
 from+0.007
 St+0.007
ula+0.006
per+0.006
Neg logit contributions
anne-0.016
storm-0.014
lights-0.013
storms-0.013
rated-0.013
Token said
Token ID531
Feature activation-5.65e-03

Pos logit contributions
anne+0.021
lights+0.018
storm+0.017
storms+0.016
 marsh+0.016
Neg logit contributions
 with-0.013
per-0.013
 from-0.013
ula-0.013
 by-0.012
Token.
Token ID13
Feature activation+7.35e-03

Pos logit contributions
anne+0.012
lights+0.010
 Susan+0.010
storms+0.009
storm+0.009
Neg logit contributions
per-0.006
ula-0.006
ster-0.006
atching-0.006
 St-0.006
Token "
Token ID366
Feature activation+8.73e-03

Pos logit contributions
per+0.007
ula+0.006
ster+0.006
 from+0.006
 solve+0.006
Neg logit contributions
anne-0.018
lights-0.015
storm-0.014
storms-0.014
 Susan-0.014
TokenLet
Token ID5756
Feature activation-7.76e-03

Pos logit contributions
ula+0.012
per+0.012
ster+0.011
 from+0.011
atching+0.011
Neg logit contributions
anne-0.017
storm-0.014
lights-0.013
storms-0.013
 Susan-0.012
Token's
Token ID338
Feature activation+1.28e-04

Pos logit contributions
anne+0.024
lights+0.022
storm+0.021
storms+0.020
 marsh+0.020
Neg logit contributions
'-0.002
 from-0.002
per-0.002
 with-0.002
ula-0.002
Token show
Token ID905
Feature activation+2.42e-03

Pos logit contributions
per+0.000
 with+0.000
ula+0.000
 from+0.000
 solve+0.000
Neg logit contributions
anne-0.000
lights-0.000
storm-0.000
storms-0.000
 Susan-0.000
Token it
Token ID340
Feature activation-6.62e-03

Pos logit contributions
ula+0.002
per+0.002
 from+0.002
 with+0.002
 St+0.002
Neg logit contributions
anne-0.006
storm-0.005
lights-0.005
storms-0.005
 marsh-0.005
Token to
Token ID284
Feature activation+5.03e-03

Pos logit contributions
anne+0.018
lights+0.015
storm+0.015
storms+0.014
 Susan+0.014
Neg logit contributions
per-0.005
 from-0.005
ula-0.005
 with-0.005
 by-0.004
Token Mom
Token ID11254
Feature activation-7.55e-03

Pos logit contributions
per+0.005
ula+0.005
 from+0.005
 with+0.005
 solve+0.005
Neg logit contributions
anne-0.012
lights-0.010
storm-0.010
storms-0.010
 marsh-0.009
Token all
Token ID477
Feature activation-4.80e-03

Pos logit contributions
ula+0.003
per+0.003
ster+0.003
atching+0.003
 St+0.003
Neg logit contributions
anne-0.007
lights-0.006
storm-0.005
 marsh-0.005
storms-0.005
Token his
Token ID465
Feature activation+1.13e-04

Pos logit contributions
anne+0.012
storm+0.011
lights+0.011
storms+0.010
 Susan+0.010
Neg logit contributions
 from-0.004
 with-0.003
ula-0.003
 by-0.003
per-0.003
Token veggies
Token ID39425
Feature activation+1.39e-02

Pos logit contributions
 with+0.000
 by+0.000
per+0.000
ster+0.000
 from+0.000
Neg logit contributions
anne-0.000
storm-0.000
lights-0.000
storms-0.000
 Susan-0.000
Token and
Token ID290
Feature activation-2.46e-03

Pos logit contributions
ula+0.013
per+0.012
 St+0.011
atching+0.011
ster+0.010
Neg logit contributions
anne-0.033
lights-0.029
storm-0.028
 marsh-0.027
storms-0.027
Token never
Token ID1239
Feature activation-1.23e-02

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
storms+0.005
 Susan+0.004
Neg logit contributions
 with-0.003
 from-0.003
 St-0.003
 by-0.003
ster-0.002
Token had
Token ID550
Feature activation-1.25e-02

Pos logit contributions
anne+0.024
lights+0.021
storm+0.018
 marsh+0.017
 Susan+0.017
Neg logit contributions
 from-0.018
 with-0.018
 by-0.017
per-0.017
 St-0.017
Token a
Token ID257
Feature activation-1.01e-03

Pos logit contributions
anne+0.037
lights+0.034
storm+0.032
jo+0.031
storms+0.029
Neg logit contributions
 with-0.009
 from-0.008
 by-0.008
 this-0.008
 St-0.007
Token conflict
Token ID5358
Feature activation-1.55e-03

Pos logit contributions
anne+0.003
lights+0.002
storms+0.002
storm+0.002
 Susan+0.002
Neg logit contributions
 with-0.001
'-0.001
 solve-0.001
 by-0.001
ster-0.001
Token with
Token ID351
Feature activation+4.31e-03

Pos logit contributions
anne+0.004
lights+0.004
storm+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
per-0.001
ula-0.001
ster-0.001
atching-0.001
 St-0.001
Token his
Token ID465
Feature activation-4.24e-03

Pos logit contributions
 St+0.003
 with+0.003
ula+0.003
per+0.003
 from+0.003
Neg logit contributions
anne-0.012
storm-0.010
lights-0.010
storms-0.009
rated-0.009
Token mom
Token ID1995
Feature activation-7.40e-03

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.008
 Susan+0.008
Neg logit contributions
 with-0.004
 by-0.004
 from-0.004
 solve-0.004
ster-0.004
Token continued
Token ID3767
Feature activation-2.85e-03

Feature not active on prev token;
no predictions were affected.
Token exploring
Token ID13504
Feature activation+1.78e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.005
 Susan+0.005
Neg logit contributions
per-0.003
ula-0.003
ster-0.003
 from-0.003
 St-0.002
Token the
Token ID262
Feature activation+5.32e-03

Pos logit contributions
per+0.001
 with+0.001
 from+0.001
ula+0.001
 by+0.001
Neg logit contributions
anne-0.005
storm-0.004
lights-0.004
storms-0.004
 Susan-0.004
Token market
Token ID1910
Feature activation+3.14e-03

Pos logit contributions
ula+0.009
per+0.008
 St+0.008
ster+0.008
atching+0.008
Neg logit contributions
anne-0.009
lights-0.008
storm-0.007
storms-0.007
 Susan-0.007
Token.
Token ID13
Feature activation-1.06e-03

Pos logit contributions
ula+0.003
per+0.003
atching+0.003
ster+0.003
 St+0.003
Neg logit contributions
anne-0.007
lights-0.006
 Susan-0.006
storm-0.006
storms-0.006
Token She
Token ID1375
Feature activation-6.14e-03

Pos logit contributions
anne+0.003
lights+0.003
storm+0.003
 imag+0.002
rated+0.002
Neg logit contributions
 with-0.001
 St-0.001
 from-0.001
 My-0.001
 by-0.001
Token guessed
Token ID25183
Feature activation-4.89e-03

Pos logit contributions
anne+0.013
lights+0.012
storm+0.011
storms+0.011
 marsh+0.010
Neg logit contributions
 with-0.008
per-0.008
 from-0.007
'-0.007
ster-0.007
Token that
Token ID326
Feature activation-5.91e-03

Pos logit contributions
anne+0.012
lights+0.011
storm+0.010
storms+0.010
 Susan+0.010
Neg logit contributions
per-0.004
ula-0.004
ster-0.004
 St-0.003
atching-0.003
Token she
Token ID673
Feature activation-2.86e-04

Pos logit contributions
anne+0.015
lights+0.013
storm+0.013
storms+0.012
 marsh+0.012
Neg logit contributions
 from-0.005
 with-0.005
per-0.005
ula-0.005
 by-0.005
Token would
Token ID561
Feature activation-5.65e-04

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 marsh+0.001
storms+0.001
Neg logit contributions
 with-0.000
per-0.000
ster-0.000
'-0.000
 from-0.000
Token have
Token ID423
Feature activation-1.09e-02

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
storms+0.001
 Susan+0.001
Neg logit contributions
per-0.001
ula-0.001
 from-0.001
ster-0.001
 with-0.001
Token Mom
Token ID11254
Feature activation-6.10e-03

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
atching+0.001
 from+0.001
Neg logit contributions
anne-0.002
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token."
Token ID526
Feature activation-1.70e-03

Pos logit contributions
anne+0.017
lights+0.015
storm+0.014
 marsh+0.013
 Susan+0.013
Neg logit contributions
 with-0.005
 by-0.004
 from-0.004
per-0.004
ster-0.004
Token
Token ID198
Feature activation+2.46e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
storms+0.003
 marsh+0.003
Neg logit contributions
ula-0.002
per-0.002
 with-0.002
 from-0.002
 St-0.002
Token
Token ID198
Feature activation+8.23e-05

Pos logit contributions
ula+0.003
per+0.003
 from+0.002
 with+0.002
ster+0.002
Neg logit contributions
anne-0.006
lights-0.005
storm-0.005
storms-0.004
 Susan-0.004
TokenMom
Token ID29252
Feature activation+6.18e-04

Pos logit contributions
ula+0.000
per+0.000
 from+0.000
ster+0.000
atching+0.000
Neg logit contributions
anne-0.000
lights-0.000
 Susan-0.000
storm-0.000
 marsh-0.000
Token hugged
Token ID41130
Feature activation-7.85e-03

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 from+0.001
 with+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
Token him
Token ID683
Feature activation+3.82e-03

Pos logit contributions
anne+0.016
lights+0.013
storm+0.012
storms+0.012
 Susan+0.012
Neg logit contributions
per-0.010
ula-0.010
ster-0.010
atching-0.010
 from-0.009
Token and
Token ID290
Feature activation+4.01e-03

Pos logit contributions
ula+0.004
per+0.004
atching+0.004
ster+0.004
 solve+0.004
Neg logit contributions
anne-0.008
lights-0.007
 Susan-0.006
storms-0.006
 marsh-0.006
Token said
Token ID531
Feature activation-1.21e-02

Pos logit contributions
 from+0.005
 with+0.005
 St+0.005
per+0.005
 by+0.005
Neg logit contributions
anne-0.009
storm-0.007
lights-0.007
storms-0.007
rated-0.006
Token,
Token ID11
Feature activation-3.62e-03

Pos logit contributions
anne+0.028
lights+0.024
 marsh+0.023
storm+0.023
storms+0.022
Neg logit contributions
per-0.012
ula-0.011
 figure-0.011
 solve-0.010
ster-0.010
Token "
Token ID366
Feature activation-2.21e-03

Pos logit contributions
anne+0.007
storm+0.006
lights+0.006
 marsh+0.006
storms+0.006
Neg logit contributions
per-0.005
ula-0.004
or-0.004
 solve-0.004
gg-0.004
Token sticks
Token ID16461
Feature activation-1.38e-03

Pos logit contributions
anne+0.022
lights+0.019
storm+0.019
storms+0.018
 Susan+0.018
Neg logit contributions
per-0.007
ula-0.007
ster-0.006
 from-0.006
 St-0.006
Token poking
Token ID39048
Feature activation-2.71e-03

Pos logit contributions
anne+0.004
lights+0.003
storm+0.003
storms+0.003
 Susan+0.003
Neg logit contributions
per-0.001
ula-0.001
ster-0.001
 from-0.001
 St-0.001
Token out
Token ID503
Feature activation-5.51e-03

Pos logit contributions
anne+0.007
storm+0.006
lights+0.006
storms+0.005
 Susan+0.005
Neg logit contributions
 from-0.003
 with-0.002
 by-0.002
per-0.002
ula-0.002
Token and
Token ID290
Feature activation-5.60e-04

Pos logit contributions
anne+0.015
lights+0.013
storm+0.013
storms+0.013
 Susan+0.012
Neg logit contributions
per-0.003
ula-0.003
 from-0.003
 with-0.003
ster-0.003
Token leaves
Token ID5667
Feature activation-9.75e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
jo+0.001
storms+0.001
Neg logit contributions
 with-0.001
 from-0.001
 by-0.001
per-0.000
 St-0.000
Token all
Token ID477
Feature activation-2.02e-02

Pos logit contributions
anne+0.024
lights+0.021
storm+0.020
 Susan+0.019
storms+0.019
Neg logit contributions
per-0.008
 from-0.008
ula-0.008
 with-0.008
 by-0.007
Token over
Token ID625
Feature activation-1.92e-02

Pos logit contributions
anne+0.050
storm+0.042
acket+0.042
lights+0.040
icol+0.040
Neg logit contributions
 by-0.023
 with-0.021
 from-0.021
 this-0.016
 those-0.016
Token the
Token ID262
Feature activation-2.46e-03

Pos logit contributions
anne+0.055
storm+0.048
lights+0.045
storms+0.045
rated+0.043
Neg logit contributions
 with-0.014
 from-0.013
 by-0.012
per-0.012
 figure-0.009
Token place
Token ID1295
Feature activation+2.10e-03

Pos logit contributions
anne+0.005
lights+0.005
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
per-0.003
 with-0.003
 from-0.003
 by-0.003
ster-0.003
Token.
Token ID13
Feature activation-1.98e-03

Pos logit contributions
 with+0.001
 from+0.001
 by+0.001
per+0.001
ula+0.001
Neg logit contributions
anne-0.006
storm-0.005
lights-0.005
storms-0.005
 marsh-0.005
Token Some
Token ID2773
Feature activation+5.23e-03

Pos logit contributions
anne+0.005
lights+0.004
storm+0.004
 marsh+0.004
rated+0.004
Neg logit contributions
per-0.002
 St-0.002
 with-0.002
 by-0.002
ula-0.002
Token said
Token ID531
Feature activation-1.30e-02

Pos logit contributions
anne+0.052
lights+0.048
storm+0.046
 marsh+0.044
 hurricane+0.043
Neg logit contributions
 from-0.033
 by-0.033
 with-0.032
per-0.032
'-0.031
Token.
Token ID13
Feature activation-6.16e-03

Pos logit contributions
anne+0.031
lights+0.026
storm+0.025
storms+0.024
 Susan+0.024
Neg logit contributions
ula-0.013
per-0.013
ster-0.011
 St-0.011
atching-0.011
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
anne+0.016
lights+0.014
storm+0.013
storms+0.013
 marsh+0.013
Neg logit contributions
per-0.005
ula-0.005
 from-0.005
 with-0.005
 St-0.004
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
anne+0.029
storm+0.025
lights+0.025
storms+0.024
 marsh+0.023
Neg logit contributions
ula-0.011
per-0.011
 with-0.010
 from-0.010
 St-0.010
Token"
Token ID1
Feature activation-2.99e-03

Pos logit contributions
anne+0.028
lights+0.023
storm+0.022
 Susan+0.021
storms+0.021
Neg logit contributions
ula-0.018
per-0.018
ster-0.017
 from-0.016
 St-0.016
TokenMaybe
Token ID13300
Feature activation-2.36e-02

Pos logit contributions
anne+0.006
lights+0.004
storm+0.004
storms+0.004
 Susan+0.004
Neg logit contributions
per-0.005
ula-0.004
ster-0.004
 from-0.004
 St-0.004
Token it
Token ID340
Feature activation-3.12e-04

Pos logit contributions
anne+0.061
storm+0.051
lights+0.050
storms+0.049
 marsh+0.045
Neg logit contributions
 from-0.022
 with-0.022
 by-0.020
per-0.020
ster-0.019
Token is
Token ID318
Feature activation-2.99e-03

Pos logit contributions
anne+0.001
lights+0.001
storm+0.001
 Susan+0.001
 hurricane+0.001
Neg logit contributions
 with-0.000
 from-0.000
 by-0.000
per-0.000
ster-0.000
Token bored
Token ID17533
Feature activation+4.03e-03

Pos logit contributions
anne+0.009
lights+0.007
storm+0.007
jo+0.007
storms+0.007
Neg logit contributions
 from-0.003
 with-0.003
 by-0.002
 for-0.002
 right-0.002
Token,"
Token ID553
Feature activation-5.43e-03

Pos logit contributions
 with+0.004
 from+0.004
 by+0.003
 for+0.003
 of+0.002
Neg logit contributions
anne-0.011
storm-0.009
acket-0.009
lights-0.009
icol-0.009
Token Dad
Token ID17415
Feature activation-1.73e-02

Pos logit contributions
anne+0.013
storm+0.012
lights+0.011
storms+0.011
rated+0.010
Neg logit contributions
 St-0.005
 with-0.005
 from-0.005
ula-0.005
per-0.005
Token the
Token ID262
Feature activation-3.35e-03

Pos logit contributions
 St+0.005
 from+0.005
per+0.005
 with+0.005
ula+0.005
Neg logit contributions
anne-0.018
storm-0.016
storms-0.015
lights-0.014
rated-0.014
Token ice
Token ID4771
Feature activation-1.28e-03

Pos logit contributions
anne+0.007
lights+0.006
storm+0.006
storms+0.006
 Susan+0.005
Neg logit contributions
ula-0.004
per-0.004
 from-0.004
ster-0.004
 with-0.004
Token cream
Token ID8566
Feature activation+9.57e-03

Pos logit contributions
anne+0.002
lights+0.002
storm+0.001
 marsh+0.001
storms+0.001
Neg logit contributions
per-0.002
ula-0.002
 St-0.002
 grant-0.002
 solve-0.002
Token.
Token ID13
Feature activation-2.91e-03

Pos logit contributions
ula+0.012
per+0.011
 solve+0.011
 St+0.011
 grant+0.010
Neg logit contributions
anne-0.018
lights-0.017
storm-0.016
 marsh-0.015
storms-0.014
Token They
Token ID1119
Feature activation-8.90e-04

Pos logit contributions
anne+0.006
lights+0.005
storm+0.005
storms+0.004
 Susan+0.004
Neg logit contributions
ula-0.004
per-0.004
 St-0.004
 with-0.004
 from-0.004
Token think
Token ID892
Feature activation-2.10e-02

Pos logit contributions
anne+0.002
lights+0.002
storm+0.002
storms+0.002
 Susan+0.002
Neg logit contributions
per-0.001
 from-0.001
ula-0.001
 with-0.001
 solve-0.001
Token the
Token ID262
Feature activation+1.16e-03

Pos logit contributions
anne+0.050
lights+0.042
storm+0.042
storms+0.040
 Susan+0.039
Neg logit contributions
ula-0.021
per-0.020
 from-0.019
 with-0.019
 St-0.018
Token white
Token ID2330
Feature activation-4.47e-03

Pos logit contributions
 with+0.001
ula+0.001
per+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.003
lights-0.002
storm-0.002
storms-0.002
 Susan-0.002
Token ice
Token ID4771
Feature activation-5.07e-03

Pos logit contributions
anne+0.010
lights+0.008
storm+0.008
storms+0.008
 Susan+0.008
Neg logit contributions
per-0.005
ula-0.005
 with-0.005
ster-0.005
 from-0.005
Token cream
Token ID8566
Feature activation+8.29e-03

Pos logit contributions
anne+0.011
lights+0.009
storm+0.009
storms+0.009
 Susan+0.008
Neg logit contributions
ula-0.006
per-0.006
 with-0.006
ster-0.006
 from-0.006
Token and
Token ID290
Feature activation-2.30e-03

Pos logit contributions
per+0.008
ula+0.008
 from+0.008
ster+0.008
 with+0.008
Neg logit contributions
anne-0.019
lights-0.016
storm-0.016
storms-0.015
 Susan-0.015
Token be
Token ID307
Feature activation+4.19e-03

Pos logit contributions
ula+0.011
per+0.011
ster+0.010
atching+0.010
 St+0.010
Neg logit contributions
anne-0.023
lights-0.019
storm-0.019
 Susan-0.018
 marsh-0.018
Token patient
Token ID5827
Feature activation+1.54e-02

Pos logit contributions
 with+0.004
per+0.004
 by+0.004
 from+0.004
ula+0.003
Neg logit contributions
anne-0.011
storm-0.010
lights-0.009
storms-0.009
 marsh-0.008
Token.
Token ID13
Feature activation-5.54e-03

Pos logit contributions
per+0.011
ula+0.011
 from+0.010
 with+0.010
ster+0.010
Neg logit contributions
anne-0.040
lights-0.035
storm-0.034
storms-0.033
 Susan-0.033
Token
Token ID198
Feature activation+5.46e-04

Pos logit contributions
anne+0.012
lights+0.011
storm+0.011
storms+0.010
 marsh+0.010
Neg logit contributions
ula-0.006
 St-0.006
per-0.006
 with-0.006
 from-0.006
Token
Token ID198
Feature activation-6.04e-03

Pos logit contributions
per+0.001
ula+0.001
ster+0.001
 from+0.001
 St+0.001
Neg logit contributions
anne-0.001
lights-0.001
storm-0.001
storms-0.001
 Susan-0.001
TokenH
Token ID39
Feature activation-2.08e-02

Pos logit contributions
anne+0.012
storm+0.009
lights+0.009
 marsh+0.009
 Susan+0.008
Neg logit contributions
ster-0.009
per-0.009
ula-0.008
atching-0.008
 from-0.008
Tokenaley
Token ID16730
Feature activation+7.89e-03

Pos logit contributions
anne+0.040
lights+0.031
storm+0.029
storms+0.028
 Susan+0.028
Neg logit contributions
per-0.030
ula-0.030
ster-0.029
 from-0.029
 St-0.029
Token didn
Token ID1422
Feature activation-7.29e-03

Pos logit contributions
per+0.009
 with+0.009
 from+0.009
'+0.008
 by+0.008
Neg logit contributions
anne-0.017
lights-0.016
storm-0.015
storms-0.014
 marsh-0.014
Token't
Token ID470
Feature activation+6.77e-03

Pos logit contributions
anne+0.016
lights+0.014
storm+0.014
 Susan+0.013
 marsh+0.013
Neg logit contributions
ster-0.008
per-0.008
 cents-0.007
atching-0.007
gg-0.007
Token know
Token ID760
Feature activation-6.75e-03

Pos logit contributions
per+0.008
ula+0.008
 from+0.007
 with+0.007
 figure+0.007
Neg logit contributions
anne-0.015
lights-0.013
storm-0.012
storms-0.012
 marsh-0.012
Token what
Token ID644
Feature activation+4.60e-03

Pos logit contributions
anne+0.016
lights+0.014
 Susan+0.013
storm+0.013
storms+0.013
Neg logit contributions
per-0.006
ula-0.006
ster-0.006
atching-0.005
 St-0.005
Token you
Token ID345
Feature activation-5.65e-03

Pos logit contributions
anne+0.020
lights+0.017
 Susan+0.017
storm+0.016
storms+0.016
Neg logit contributions
per-0.006
ula-0.006
ster-0.005
atching-0.005
 from-0.005
Token want
Token ID765
Feature activation-6.83e-03

Pos logit contributions
anne+0.014
lights+0.012
storm+0.011
storms+0.011
 Susan+0.011
Neg logit contributions
per-0.005
ula-0.005
 with-0.005
 from-0.005
 solve-0.005
Token some
Token ID617
Feature activation-3.09e-03

Pos logit contributions
anne+0.019
lights+0.017
storm+0.016
storms+0.016
 Susan+0.015
Neg logit contributions
 with-0.005
 from-0.005
ula-0.004
 by-0.004
 St-0.004
Token juice
Token ID13135
Feature activation+6.33e-03

Pos logit contributions
anne+0.007
lights+0.005
storm+0.005
storms+0.005
 Susan+0.005
Neg logit contributions
ula-0.004
 from-0.004
 with-0.004
per-0.004
 by-0.004
Token?"
Token ID1701
Feature activation-6.39e-03

Pos logit contributions
per+0.008
ula+0.008
ster+0.008
 from+0.008
 St+0.008
Neg logit contributions
anne-0.013
lights-0.011
storm-0.010
storms-0.010
 Susan-0.010
Token Mom
Token ID11254
Feature activation-2.31e-02

Pos logit contributions
anne+0.013
storm+0.011
lights+0.011
storms+0.010
 marsh+0.010
Neg logit contributions
 with-0.009
ula-0.009
 from-0.009
 St-0.009
 by-0.008
Token asked
Token ID1965
Feature activation-3.56e-03

Pos logit contributions
anne+0.037
lights+0.030
storm+0.028
storms+0.026
 marsh+0.026
Neg logit contributions
per-0.040
ula-0.040
 from-0.038
ster-0.038
 with-0.038
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
anne+0.008
lights+0.006
 Susan+0.006
 John+0.006
storms+0.006
Neg logit contributions
per-0.004
ster-0.004
ula-0.004
atching-0.004
gg-0.004
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
anne+0.027
lights+0.023
storm+0.023
storms+0.021
rated+0.021
Neg logit contributions
ula-0.010
 with-0.009
 St-0.009
 from-0.009
per-0.008
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
anne+0.028
lights+0.024
storm+0.024
storms+0.023
 imag+0.022
Neg logit contributions
ula-0.011
per-0.010
 with-0.010
 from-0.010
 St-0.010
Token"
Token ID1
Feature activation+2.57e-03

Pos logit contributions
anne+0.036
lights+0.030
 Susan+0.029
storm+0.029
 marsh+0.029
Neg logit contributions
per-0.010
ster-0.010
ula-0.010
 from-0.009
gg-0.008