TOP ACTIVATIONS
MAX = 0.054

not yours ." Me gan tried to resist her
was deaf . Me gan saw John 's hand
now !" " Me too !" Lily said .
ducks couldn 't believe how yellow and soft the stack was
her back . Me gan 's mum got angry
very dangerous ." Me gan stopped and looked around
tell anyone . Me gan took the spoon and
says . " Me too ," Leo says .
bread . " Me too , me too !"
the bench !" Me gan smiled and said ,
man was called a " v endor ". Tim my wanted
both laughed until their tum m ies hurt .
," said Tim . " Me too !" said Sam .
. She said , " Me gan , you shouldn 't
help ? â asked Sus ie . â
They ate until their tum m ies felt heavy . Then
said . " Me too . Can we feed
fight . " Me first !" Jill says .
thing ?" asked a curious 3 year old girl , as
the apples and their tum m ies felt full and happy

MIN ACTIVATIONS
MIN = -0.056

wanted to learn more about veterin arians . She decided to
animals or use sticks as weapons . They said they loved
was able to finish his tour happily . <|endoftext|> Once there
everyone , she tucked her diary into her bed and made
toy . You are our treasure . We value you more
brother . And no more swords , OK ?"
fun . It ruined our castles . They were so pretty
sticks were no longer powerful weapons , but perfect stems and
them away from their natural habitats . We should be friendly
can help you achieve your dreams . <|endoftext|> Once upon a
. He puts on his mask . He thanks Sam .
can be more powerful than weapons . Kind ness can change
wrong . She sees their masks and their paints and their
can help us conquer our fears and make us feel better
little frog put on his sunglasses and hopped off to explore
was sad to leave her tent and puzzle behind . But
asleep dreaming about flying his spaceship and washing away all the
to hear Gr anny 's stories . <|endoftext|> Once upon a
alien friends and flew his spaceship back to Earth . He
played near the shark 's territory again . They remembered Nem

INTERVAL 0.027 - 0.054
CONTAINS 0.278%

her . The wolf asked her where she was going .
First , Sue mixed together a lot of glue
first . They count to three and put their balls in
Can we go on the slide , Mom ?" Jill asks
But Wh isk ers got all tangled up in the string

INTERVAL -0.001 - 0.027
CONTAINS 52.801%

but Lily didn 't want to . Her
to each other . They would try to guess what it
at the truck , which had stopped and was backing up
sound of a buzzing fly woke Peter up . He took
was glad that Max let her play with it . After

INTERVAL -0.028 - -0.001
CONTAINS 46.776%

. He couldn 't believe his eyes when he
One day , Annie was out walking by the shore ,
back later ." " Okay ," Leo said .
. She went to bed without delay and closed her eyes
So , he designed and built a special machine

INTERVAL -0.056 - -0.028
CONTAINS 0.145%

regular girl who had regular dreams . And she knew how
, who gave her a tour . Lily had so much
to take her long black cape and cover everything she saw
the bear always kept his honey safe in his arm chair
come ! I like your castle too ! You are a
Token not
Token ID407
Feature activation+2.24e-03

Pos logit contributions
ull+0.034
 shall+0.029
 finally+0.027
 Jane+0.027
'll+0.027
Neg logit contributions
assing-0.055
obe-0.054
ging-0.054
agging-0.051
ilda-0.051
Token yours
Token ID12431
Feature activation-3.82e-03

Pos logit contributions
ull+0.004
 Jane+0.003
 shall+0.003
'll+0.003
 finally+0.003
Neg logit contributions
assing-0.004
ging-0.004
obe-0.004
ica-0.003
agging-0.003
Token."
Token ID526
Feature activation+9.85e-03

Pos logit contributions
assing+0.008
obe+0.008
ging+0.007
 adoption+0.007
agging+0.007
Neg logit contributions
ull-0.005
 shall-0.004
'll-0.004
 Jane-0.004
 finally-0.004
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
ull+0.014
 shall+0.009
'll+0.009
 snowball+0.009
 finally+0.008
Neg logit contributions
obe-0.019
assing-0.019
 adoption-0.018
annah-0.018
 ME-0.018
Token
Token ID198
Feature activation+7.99e-03

Pos logit contributions
ull+0.011
 shall+0.008
'll+0.008
 stirred+0.007
isel+0.007
Neg logit contributions
obe-0.023
assing-0.023
annah-0.022
 adoption-0.022
agging-0.022
TokenMe
Token ID5308
Feature activation+5.42e-02

Pos logit contributions
ull+0.013
 shall+0.011
'll+0.010
 finally+0.010
 marsh+0.010
Neg logit contributions
assing-0.013
obe-0.013
 adoption-0.012
ging-0.012
agging-0.012
Tokengan
Token ID1030
Feature activation+1.16e-02

Pos logit contributions
ull+0.087
 shall+0.067
 marsh+0.066
'll+0.063
 volcan+0.063
Neg logit contributions
obe-0.087
ging-0.086
eking-0.086
assing-0.085
agging-0.084
Token tried
Token ID3088
Feature activation+8.14e-03

Pos logit contributions
ull+0.013
 shall+0.010
 marsh+0.010
 Jane+0.009
 snowball+0.009
Neg logit contributions
assing-0.024
obe-0.024
ging-0.023
ilda-0.023
 adoption-0.023
Token to
Token ID284
Feature activation-1.86e-03

Pos logit contributions
ull+0.010
 shall+0.008
 Jane+0.007
 finally+0.007
'll+0.007
Neg logit contributions
assing-0.017
obe-0.016
ging-0.016
 adoption-0.016
ilda-0.016
Token resist
Token ID4180
Feature activation+2.11e-04

Pos logit contributions
assing+0.003
ging+0.003
obe+0.003
agging+0.003
ilda+0.003
Neg logit contributions
ull-0.003
 shall-0.003
 finally-0.002
 Jane-0.002
'll-0.002
Token her
Token ID607
Feature activation+2.50e-02

Pos logit contributions
 Jane+0.000
ull+0.000
 Mom+0.000
 finally+0.000
 +0.000
Neg logit contributions
assing-0.000
iva-0.000
obe-0.000
ilda-0.000
pper-0.000
Token was
Token ID373
Feature activation+1.20e-02

Pos logit contributions
ull+0.012
 shall+0.009
ores+0.009
 marsh+0.009
 volcan+0.009
Neg logit contributions
assing-0.022
ilda-0.021
 hurting-0.021
obe-0.020
ging-0.020
Token deaf
Token ID21815
Feature activation+2.42e-03

Pos logit contributions
ull+0.018
 shall+0.015
 Jane+0.014
 finally+0.014
'll+0.014
Neg logit contributions
assing-0.022
obe-0.021
ging-0.020
 adoption-0.020
ilda-0.020
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
ull+0.003
 shall+0.003
 Jane+0.003
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.005
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
ull+0.028
 shall+0.019
isel+0.018
'll+0.018
 stirred+0.018
Neg logit contributions
assing-0.034
obe-0.034
annah-0.034
 adoption-0.033
hy-0.033
Token
Token ID198
Feature activation+1.50e-02

Pos logit contributions
ull+0.020
 stirred+0.016
isel+0.015
 cooled+0.015
 shall+0.014
Neg logit contributions
annah-0.025
ilda-0.025
assing-0.024
 yourself-0.024
iva-0.024
TokenMe
Token ID5308
Feature activation+5.22e-02

Pos logit contributions
ull+0.025
 shall+0.020
'll+0.019
 stirred+0.018
 marsh+0.018
Neg logit contributions
assing-0.024
obe-0.023
ging-0.023
 adoption-0.023
ilda-0.022
Tokengan
Token ID1030
Feature activation+1.79e-02

Pos logit contributions
ull+0.078
 shall+0.062
 marsh+0.058
 wonder+0.057
 +0.057
Neg logit contributions
assing-0.093
obe-0.091
ging-0.091
eking-0.089
agging-0.087
Token saw
Token ID2497
Feature activation+3.38e-03

Pos logit contributions
ull+0.021
 shall+0.017
 marsh+0.016
 Jane+0.015
ores+0.015
Neg logit contributions
assing-0.037
obe-0.036
ging-0.036
ilda-0.034
eking-0.034
Token John
Token ID1757
Feature activation+7.59e-04

Pos logit contributions
ull+0.006
 shall+0.005
'll+0.005
 finally+0.004
 stirred+0.004
Neg logit contributions
assing-0.005
ging-0.005
obe-0.005
ilda-0.005
 adoption-0.005
Token's
Token ID338
Feature activation+4.80e-03

Pos logit contributions
ull+0.001
 finally+0.001
 +0.001
 shall+0.001
 Jane+0.001
Neg logit contributions
assing-0.002
obe-0.002
ilda-0.001
iva-0.001
 adoption-0.001
Token hand
Token ID1021
Feature activation-4.58e-03

Pos logit contributions
ull+0.007
 shall+0.006
 Mom+0.006
 Jane+0.005
 finally+0.005
Neg logit contributions
assing-0.009
obe-0.009
ging-0.008
 adoption-0.008
ilda-0.008
Token now
Token ID783
Feature activation+2.00e-03

Pos logit contributions
ull+0.013
 shall+0.011
 Jane+0.010
 finally+0.010
'll+0.010
Neg logit contributions
assing-0.016
obe-0.016
ging-0.015
 adoption-0.015
ilda-0.015
Token!"
Token ID2474
Feature activation+5.44e-03

Pos logit contributions
ull+0.002
 shall+0.002
 finally+0.002
 Jane+0.002
'll+0.001
Neg logit contributions
assing-0.005
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token
Token ID198
Feature activation+8.51e-03

Pos logit contributions
ull+0.007
 shall+0.005
 volcan+0.005
 volcano+0.005
 stirred+0.005
Neg logit contributions
annah-0.010
obe-0.010
ilda-0.010
 ME-0.009
 adoption-0.009
Token
Token ID198
Feature activation+7.65e-03

Pos logit contributions
ull+0.012
 stirred+0.010
 volcan+0.010
 lava+0.010
 volcano+0.010
Neg logit contributions
obe-0.014
annah-0.013
ilda-0.013
 yourself-0.013
 adoption-0.013
Token"
Token ID1
Feature activation+1.04e-03

Pos logit contributions
ull+0.013
 shall+0.011
 marsh+0.010
 Jane+0.010
'll+0.010
Neg logit contributions
assing-0.013
obe-0.012
ging-0.012
 adoption-0.012
ilda-0.011
TokenMe
Token ID5308
Feature activation+5.16e-02

Pos logit contributions
ull+0.002
 shall+0.002
'll+0.002
 Jane+0.002
 marsh+0.001
Neg logit contributions
assing-0.002
ging-0.002
obe-0.001
 adoption-0.001
 hurting-0.001
Token too
Token ID1165
Feature activation+1.13e-02

Pos logit contributions
ull+0.055
 shall+0.043
 Jane+0.039
 finally+0.038
'll+0.037
Neg logit contributions
assing-0.117
obe-0.113
ging-0.111
 adoption-0.108
ilda-0.107
Token!"
Token ID2474
Feature activation+1.59e-03

Pos logit contributions
ull+0.016
 shall+0.013
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.022
obe-0.021
ging-0.020
 adoption-0.020
ilda-0.019
Token Lily
Token ID20037
Feature activation+2.68e-05

Pos logit contributions
ull+0.003
 shall+0.002
'll+0.002
 volcano+0.002
 volcan+0.002
Neg logit contributions
obe-0.002
annah-0.002
ilda-0.002
hy-0.002
 adoption-0.002
Token said
Token ID531
Feature activation-1.33e-02

Pos logit contributions
ull+0.000
 shall+0.000
'll+0.000
 marsh+0.000
 Jane+0.000
Neg logit contributions
assing-0.000
obe-0.000
ging-0.000
ilda-0.000
 adoption-0.000
Token.
Token ID13
Feature activation-3.21e-03

Pos logit contributions
assing+0.028
obe+0.026
 adoption+0.025
ging+0.025
iva+0.024
Neg logit contributions
ull-0.017
 Jane-0.014
 Mom-0.014
 shall-0.014
 finally-0.014
Token ducks
Token ID39694
Feature activation+1.75e-02

Pos logit contributions
ull+0.004
 shall+0.004
 Jane+0.004
'll+0.003
 +0.003
Neg logit contributions
assing-0.006
ging-0.005
 adoption-0.005
obe-0.005
agging-0.005
Token couldn
Token ID3521
Feature activation+1.22e-02

Pos logit contributions
 marsh+0.020
ull+0.020
 shall+0.019
ores+0.019
 snowball+0.018
Neg logit contributions
ging-0.031
assing-0.030
obe-0.030
eking-0.030
ting-0.030
Token't
Token ID470
Feature activation-5.77e-03

Pos logit contributions
ull+0.018
 shall+0.015
 Jane+0.014
'll+0.014
 stirred+0.014
Neg logit contributions
assing-0.022
obe-0.021
ging-0.020
 adoption-0.020
ilda-0.019
Token believe
Token ID1975
Feature activation+1.78e-02

Pos logit contributions
assing+0.010
obe+0.009
ging+0.009
 adoption+0.009
 hurting+0.009
Neg logit contributions
ull-0.009
 shall-0.008
 Jane-0.007
'll-0.007
 -0.007
Token how
Token ID703
Feature activation+5.69e-03

Pos logit contributions
ull+0.020
 shall+0.016
'll+0.015
 finally+0.014
 stirred+0.013
Neg logit contributions
assing-0.039
ging-0.037
obe-0.037
eking-0.036
 adoption-0.036
Token yellow
Token ID7872
Feature activation+5.13e-02

Pos logit contributions
ull+0.008
 shall+0.007
 finally+0.006
'll+0.006
 Jane+0.006
Neg logit contributions
assing-0.010
ging-0.009
obe-0.009
 adoption-0.009
 hurting-0.009
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
ull+0.052
 shall+0.042
'll+0.038
 finally+0.036
 wonder+0.035
Neg logit contributions
ging-0.109
assing-0.108
ting-0.104
obe-0.104
 hurting-0.103
Token soft
Token ID2705
Feature activation+3.00e-02

Pos logit contributions
ull+0.026
 shall+0.022
'll+0.019
isel+0.019
 finally+0.019
Neg logit contributions
assing-0.028
 hurting-0.028
obe-0.026
 adoption-0.026
ging-0.026
Token the
Token ID262
Feature activation-1.14e-03

Pos logit contributions
ull+0.037
 shall+0.030
 Jane+0.027
'll+0.027
 finally+0.027
Neg logit contributions
assing-0.062
obe-0.060
ging-0.059
 adoption-0.057
ilda-0.056
Token stack
Token ID8931
Feature activation+2.49e-02

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
 adoption+0.002
agging+0.002
Neg logit contributions
ull-0.002
 shall-0.001
 Jane-0.001
 finally-0.001
'll-0.001
Token was
Token ID373
Feature activation+1.72e-02

Pos logit contributions
ull+0.023
 shall+0.019
'll+0.016
 Jane+0.015
 +0.015
Neg logit contributions
assing-0.055
ging-0.054
obe-0.054
 adoption-0.052
 hurting-0.052
Token her
Token ID607
Feature activation+1.36e-02

Pos logit contributions
assing+0.013
obe+0.012
ging+0.012
ilda+0.011
agging+0.011
Neg logit contributions
ull-0.010
 Jane-0.009
 shall-0.008
 finally-0.008
 -0.008
Token back
Token ID736
Feature activation+4.13e-03

Pos logit contributions
 Mom+0.013
ull+0.012
 Mum+0.012
 +0.011
 shall+0.011
Neg logit contributions
assing-0.031
obe-0.031
ging-0.029
 adoption-0.029
agging-0.028
Token.
Token ID13
Feature activation+4.59e-04

Pos logit contributions
ull+0.006
 shall+0.005
 Jane+0.005
 finally+0.005
'll+0.005
Neg logit contributions
assing-0.007
obe-0.007
ging-0.007
 adoption-0.007
ilda-0.007
Token
Token ID198
Feature activation+9.96e-04

Pos logit contributions
ull+0.001
 snowball+0.001
 shall+0.000
 stirred+0.000
isel+0.000
Neg logit contributions
annah-0.001
 ME-0.001
 adoption-0.001
obe-0.001
assing-0.001
Token
Token ID198
Feature activation+4.33e-03

Pos logit contributions
ull+0.001
 shall+0.001
 stirred+0.001
'll+0.001
 snowball+0.001
Neg logit contributions
assing-0.002
annah-0.002
ilda-0.002
obe-0.002
iva-0.002
TokenMe
Token ID5308
Feature activation+5.10e-02

Pos logit contributions
ull+0.007
 shall+0.006
'll+0.006
 snowball+0.006
 stirred+0.006
Neg logit contributions
assing-0.007
obe-0.007
 adoption-0.006
ging-0.006
agging-0.006
Tokengan
Token ID1030
Feature activation+1.24e-02

Pos logit contributions
ull+0.079
 shall+0.062
 marsh+0.060
'll+0.058
 +0.057
Neg logit contributions
obe-0.085
ging-0.084
eking-0.084
assing-0.084
agging-0.082
Token's
Token ID338
Feature activation+1.60e-02

Pos logit contributions
ull+0.014
 shall+0.011
 Jane+0.009
 marsh+0.009
'll+0.009
Neg logit contributions
assing-0.027
obe-0.026
ging-0.026
 adoption-0.025
ilda-0.025
Token mum
Token ID25682
Feature activation-6.77e-03

Pos logit contributions
ull+0.019
 shall+0.016
 finally+0.015
 Mom+0.015
 Jane+0.015
Neg logit contributions
assing-0.034
obe-0.033
ging-0.032
ilda-0.031
 adoption-0.031
Token got
Token ID1392
Feature activation-3.61e-03

Pos logit contributions
assing+0.016
obe+0.016
ging+0.016
 adoption+0.015
ilda+0.015
Neg logit contributions
ull-0.006
 shall-0.005
 Jane-0.004
 finally-0.004
'll-0.004
Token angry
Token ID7954
Feature activation-7.32e-03

Pos logit contributions
assing+0.007
obe+0.007
ging+0.007
 adoption+0.006
ilda+0.006
Neg logit contributions
ull-0.005
 shall-0.004
 Jane-0.004
 finally-0.004
'll-0.004
Token very
Token ID845
Feature activation+8.88e-03

Pos logit contributions
ull+0.010
 shall+0.008
 Jane+0.008
 finally+0.008
'll+0.008
Neg logit contributions
assing-0.012
obe-0.012
ging-0.012
 adoption-0.011
ilda-0.011
Token dangerous
Token ID4923
Feature activation-1.21e-02

Pos logit contributions
ull+0.014
 shall+0.012
 Jane+0.011
 finally+0.011
'll+0.011
Neg logit contributions
assing-0.015
obe-0.015
ging-0.014
 adoption-0.014
ilda-0.014
Token."
Token ID526
Feature activation+2.72e-02

Pos logit contributions
assing+0.024
obe+0.023
ging+0.023
 adoption+0.022
ilda+0.022
Neg logit contributions
ull-0.016
 shall-0.014
 Jane-0.013
'll-0.012
 finally-0.012
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
ull+0.036
 shall+0.023
isel+0.023
'll+0.022
 stirred+0.021
Neg logit contributions
obe-0.052
annah-0.052
assing-0.051
 adoption-0.050
 ME-0.050
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
ull+0.018
 shall+0.013
isel+0.012
'll+0.012
 stirred+0.012
Neg logit contributions
annah-0.030
obe-0.030
 adoption-0.030
assing-0.030
 yourself-0.029
TokenMe
Token ID5308
Feature activation+5.04e-02

Pos logit contributions
ull+0.027
 shall+0.021
'll+0.020
 finally+0.020
 volcano+0.019
Neg logit contributions
assing-0.026
obe-0.026
 adoption-0.025
ging-0.025
 yourself-0.024
Tokengan
Token ID1030
Feature activation+2.07e-02

Pos logit contributions
ull+0.081
 shall+0.063
 marsh+0.059
'll+0.058
 volcan+0.057
Neg logit contributions
assing-0.080
obe-0.080
eking-0.080
ging-0.080
agging-0.078
Token stopped
Token ID5025
Feature activation-8.43e-04

Pos logit contributions
ull+0.024
 shall+0.019
 marsh+0.018
ores+0.016
 volcano+0.016
Neg logit contributions
assing-0.042
obe-0.042
ging-0.041
eking-0.040
ilda-0.040
Token and
Token ID290
Feature activation+4.96e-03

Pos logit contributions
assing+0.002
obe+0.002
 adoption+0.001
ilda+0.001
iva+0.001
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
 finally-0.001
 Mom-0.001
Token looked
Token ID3114
Feature activation+1.02e-02

Pos logit contributions
ull+0.006
 shall+0.004
'll+0.004
 marsh+0.004
 snowball+0.003
Neg logit contributions
assing-0.011
obe-0.011
ging-0.010
 adoption-0.010
ilda-0.010
Token around
Token ID1088
Feature activation+5.08e-03

Pos logit contributions
ull+0.014
 shall+0.012
 Jane+0.012
 finally+0.011
'll+0.011
Neg logit contributions
assing-0.019
obe-0.019
ging-0.018
 adoption-0.018
ilda-0.017
Token tell
Token ID1560
Feature activation-1.15e-03

Pos logit contributions
ull+0.020
 finally+0.019
 Mom+0.018
 shall+0.018
 Jane+0.018
Neg logit contributions
assing-0.030
ging-0.029
ilda-0.028
agging-0.028
iva-0.028
Token anyone
Token ID2687
Feature activation+6.53e-03

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
 adoption+0.002
agging+0.002
Neg logit contributions
ull-0.002
 shall-0.001
 Jane-0.001
 finally-0.001
'll-0.001
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
ull+0.009
 shall+0.007
 Jane+0.007
'll+0.007
 +0.007
Neg logit contributions
assing-0.013
obe-0.013
ging-0.012
 adoption-0.012
agging-0.012
Token
Token ID198
Feature activation+5.91e-03

Pos logit contributions
ull+0.021
 shall+0.014
 stirred+0.013
 snowball+0.013
 marsh+0.013
Neg logit contributions
assing-0.029
obe-0.029
annah-0.029
 ME-0.029
 adoption-0.028
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
ull+0.006
 shall+0.004
 stirred+0.004
'll+0.004
 marsh+0.004
Neg logit contributions
assing-0.013
obe-0.013
ilda-0.013
iva-0.012
annah-0.012
TokenMe
Token ID5308
Feature activation+5.01e-02

Pos logit contributions
ull+0.019
 shall+0.016
'll+0.015
 Jane+0.014
 finally+0.014
Neg logit contributions
assing-0.020
obe-0.019
ging-0.018
 adoption-0.018
agging-0.018
Tokengan
Token ID1030
Feature activation+2.32e-02

Pos logit contributions
ull+0.073
 shall+0.058
 marsh+0.055
'll+0.053
 wonder+0.051
Neg logit contributions
assing-0.092
obe-0.091
ging-0.090
eking-0.088
agging-0.087
Token took
Token ID1718
Feature activation-6.24e-04

Pos logit contributions
ull+0.026
 shall+0.020
 marsh+0.018
 Jane+0.017
'll+0.017
Neg logit contributions
assing-0.050
obe-0.049
ging-0.048
 adoption-0.047
eking-0.047
Token the
Token ID262
Feature activation-8.50e-05

Pos logit contributions
assing+0.001
ging+0.001
obe+0.001
 adoption+0.001
ilda+0.001
Neg logit contributions
ull-0.001
 shall-0.001
'll-0.001
 stirred-0.001
 Jane-0.001
Token spoon
Token ID24556
Feature activation+2.93e-02

Pos logit contributions
assing+0.000
obe+0.000
ging+0.000
 adoption+0.000
ilda+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 Jane-0.000
 finally-0.000
'll-0.000
Token and
Token ID290
Feature activation+9.89e-03

Pos logit contributions
ull+0.044
 shall+0.037
 Jane+0.035
 finally+0.034
'll+0.033
Neg logit contributions
assing-0.054
obe-0.052
ging-0.050
 adoption-0.049
agging-0.048
Token says
Token ID1139
Feature activation-1.21e-02

Pos logit contributions
assing+0.014
obe+0.014
ging+0.014
 adoption+0.013
ilda+0.013
Neg logit contributions
ull-0.013
 shall-0.011
 Jane-0.010
 finally-0.010
'll-0.010
Token.
Token ID13
Feature activation+2.58e-03

Pos logit contributions
assing+0.026
obe+0.024
 adoption+0.023
ging+0.023
 property+0.022
Neg logit contributions
ull-0.015
 Mom-0.012
 shall-0.012
 Jane-0.012
 -0.012
Token
Token ID198
Feature activation+9.28e-03

Pos logit contributions
ull+0.004
 shall+0.003
 cooled+0.003
'll+0.003
isel+0.003
Neg logit contributions
 ME-0.004
obe-0.004
assing-0.004
 herself-0.004
 lying-0.004
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
ull+0.012
 stirred+0.010
 cooled+0.010
 shall+0.010
 measured+0.010
Neg logit contributions
obe-0.016
ilda-0.016
assing-0.016
agging-0.015
 yourself-0.015
Token"
Token ID1
Feature activation-2.36e-03

Pos logit contributions
ull+0.018
 shall+0.016
'll+0.015
 finally+0.015
 cooled+0.015
Neg logit contributions
assing-0.016
obe-0.016
ging-0.016
 yourself-0.015
 adoption-0.015
TokenMe
Token ID5308
Feature activation+4.99e-02

Pos logit contributions
assing+0.004
obe+0.003
ging+0.003
 adoption+0.003
iva+0.003
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
'll-0.003
 Joe-0.003
Token too
Token ID1165
Feature activation+3.95e-03

Pos logit contributions
ull+0.055
 shall+0.042
 finally+0.037
 Jane+0.037
'll+0.036
Neg logit contributions
assing-0.112
obe-0.109
ging-0.106
 adoption-0.103
agging-0.103
Token,"
Token ID553
Feature activation+4.74e-05

Pos logit contributions
ull+0.005
 shall+0.004
 Jane+0.004
 finally+0.004
'll+0.004
Neg logit contributions
assing-0.008
obe-0.008
ging-0.007
 adoption-0.007
ilda-0.007
Token Leo
Token ID19632
Feature activation-1.27e-02

Pos logit contributions
ull+0.000
'll+0.000
 shall+0.000
ores+0.000
isel+0.000
Neg logit contributions
obe-0.000
 adoption-0.000
assing-0.000
ica-0.000
 hurting-0.000
Token says
Token ID1139
Feature activation-1.52e-02

Pos logit contributions
assing+0.022
obe+0.021
ging+0.021
 adoption+0.020
ilda+0.020
Neg logit contributions
ull-0.020
 shall-0.017
 Jane-0.016
 finally-0.016
'll-0.016
Token.
Token ID13
Feature activation+3.32e-03

Pos logit contributions
assing+0.029
obe+0.027
ging+0.027
 adoption+0.026
ilda+0.026
Neg logit contributions
ull-0.022
 shall-0.018
 Jane-0.018
 finally-0.017
 Mom-0.017
Token bread
Token ID8509
Feature activation+9.33e-03

Pos logit contributions
ull+0.020
 shall+0.016
 finally+0.015
 Jane+0.015
'll+0.015
Neg logit contributions
assing-0.030
ging-0.029
obe-0.029
 adoption-0.029
agging-0.028
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
ull+0.015
 shall+0.014
 finally+0.013
 Jane+0.013
'll+0.013
Neg logit contributions
assing-0.014
ging-0.014
obe-0.014
pper-0.013
 adoption-0.013
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
ull+0.018
 shall+0.014
 wonder+0.014
 reef+0.013
 stirred+0.013
Neg logit contributions
 ME-0.013
obe-0.013
oa-0.012
ting-0.012
assing-0.012
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
ull+0.019
 stirred+0.018
 marsh+0.018
 lava+0.018
 cooled+0.017
Neg logit contributions
assing-0.014
obe-0.014
 yourself-0.014
ilda-0.014
ging-0.014
Token"
Token ID1
Feature activation+2.63e-03

Pos logit contributions
ull+0.021
 shall+0.018
 marsh+0.017
 Jane+0.017
'll+0.017
Neg logit contributions
assing-0.016
ging-0.016
obe-0.015
 adoption-0.013
pper-0.013
TokenMe
Token ID5308
Feature activation+4.99e-02

Pos logit contributions
ull+0.005
 Jane+0.004
 shall+0.004
 volcano+0.004
ores+0.004
Neg logit contributions
assing-0.004
ging-0.004
obe-0.003
ting-0.003
hy-0.003
Token too
Token ID1165
Feature activation+1.45e-02

Pos logit contributions
ull+0.061
 shall+0.047
 marsh+0.044
 Jane+0.044
 finally+0.044
Neg logit contributions
assing-0.104
obe-0.101
ging-0.098
 hurting-0.095
eking-0.095
Token,
Token ID11
Feature activation+9.53e-03

Pos logit contributions
ull+0.025
 shall+0.022
 finally+0.020
'll+0.020
 stirred+0.020
Neg logit contributions
assing-0.022
obe-0.021
ging-0.020
hy-0.020
 hurting-0.020
Token me
Token ID502
Feature activation+6.86e-03

Pos logit contributions
ull+0.013
 shall+0.010
'll+0.010
 stirred+0.010
 finally+0.009
Neg logit contributions
assing-0.018
obe-0.017
ging-0.017
pper-0.017
 hurting-0.016
Token too
Token ID1165
Feature activation+1.30e-02

Pos logit contributions
ull+0.010
 shall+0.009
 Jane+0.008
 finally+0.008
 stirred+0.008
Neg logit contributions
assing-0.012
obe-0.012
ging-0.012
 adoption-0.011
agging-0.011
Token!"
Token ID2474
Feature activation+6.82e-03

Pos logit contributions
ull+0.023
 shall+0.021
'll+0.019
 finally+0.019
 stirred+0.019
Neg logit contributions
assing-0.019
ging-0.017
obe-0.017
 hurting-0.016
pper-0.016
Token the
Token ID262
Feature activation-5.41e-03

Pos logit contributions
ull+0.012
 shall+0.010
 Jane+0.010
 finally+0.009
 Mom+0.009
Neg logit contributions
assing-0.020
obe-0.020
ging-0.020
agging-0.019
ilda-0.019
Token bench
Token ID7624
Feature activation+1.08e-02

Pos logit contributions
obe+0.013
ging+0.012
assing+0.011
iva+0.011
beh+0.011
Neg logit contributions
 it-0.005
 marsh-0.005
 Mom-0.005
 snowball-0.005
 finally-0.005
Token!"
Token ID2474
Feature activation+1.03e-02

Pos logit contributions
ull+0.015
 shall+0.013
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.020
obe-0.019
ging-0.019
 adoption-0.018
ilda-0.018
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
ull+0.011
 shall+0.008
'll+0.007
isel+0.007
 finally+0.007
Neg logit contributions
obe-0.021
annah-0.020
 adoption-0.020
assing-0.020
 ME-0.020
Token
Token ID198
Feature activation+9.09e-03

Pos logit contributions
ull+0.015
 shall+0.012
isel+0.012
'll+0.012
 measured+0.011
Neg logit contributions
agging-0.020
 adoption-0.020
obe-0.020
assing-0.020
annah-0.019
TokenMe
Token ID5308
Feature activation+4.98e-02

Pos logit contributions
ull+0.015
isel+0.012
 shall+0.012
'll+0.012
 Joe+0.012
Neg logit contributions
 adoption-0.013
assing-0.013
obe-0.013
agging-0.012
 yourself-0.012
Tokengan
Token ID1030
Feature activation+1.16e-02

Pos logit contributions
ull+0.076
 shall+0.059
 marsh+0.057
 volcan+0.056
'll+0.056
Neg logit contributions
obe-0.081
agging-0.081
ging-0.080
eking-0.080
assing-0.080
Token smiled
Token ID13541
Feature activation-2.41e-03

Pos logit contributions
ull+0.014
 marsh+0.010
 shall+0.010
ores+0.010
 snowball+0.010
Neg logit contributions
assing-0.023
obe-0.023
ging-0.022
pper-0.022
hy-0.022
Token and
Token ID290
Feature activation-5.05e-03

Pos logit contributions
assing+0.005
obe+0.005
ging+0.005
 adoption+0.005
ilda+0.005
Neg logit contributions
ull-0.003
 shall-0.002
 finally-0.002
 Jane-0.002
'll-0.002
Token said
Token ID531
Feature activation-1.38e-02

Pos logit contributions
assing+0.011
obe+0.010
ging+0.010
 adoption+0.010
ilda+0.010
Neg logit contributions
ull-0.006
 shall-0.005
 Jane-0.004
'll-0.004
 finally-0.004
Token,
Token ID11
Feature activation+1.86e-03

Pos logit contributions
assing+0.027
obe+0.027
ging+0.026
 adoption+0.025
ilda+0.025
Neg logit contributions
ull-0.018
 shall-0.014
 Jane-0.013
 finally-0.013
'll-0.013
Token man
Token ID582
Feature activation+1.70e-02

Pos logit contributions
assing+0.006
obe+0.006
ging+0.005
 adoption+0.005
ilda+0.005
Neg logit contributions
ull-0.006
 shall-0.005
 finally-0.005
 Jane-0.004
'll-0.004
Token was
Token ID373
Feature activation+8.60e-03

Pos logit contributions
ull+0.021
 shall+0.016
 marsh+0.015
ores+0.015
'll+0.014
Neg logit contributions
assing-0.034
obe-0.033
ging-0.032
 hurting-0.032
hy-0.032
Token called
Token ID1444
Feature activation-7.73e-03

Pos logit contributions
ull+0.014
 shall+0.012
'll+0.012
 Jane+0.011
 marsh+0.011
Neg logit contributions
assing-0.014
obe-0.014
ging-0.013
 adoption-0.013
 hurting-0.013
Token a
Token ID257
Feature activation+1.38e-02

Pos logit contributions
obe+0.017
assing+0.017
ging+0.016
eking+0.016
agging+0.016
Neg logit contributions
ull-0.009
 Jane-0.008
 Mom-0.008
 -0.007
 shall-0.007
Token "
Token ID366
Feature activation+1.24e-02

Pos logit contributions
ull+0.022
 shall+0.018
 Jane+0.017
 finally+0.017
 stirred+0.016
Neg logit contributions
assing-0.024
ging-0.022
obe-0.022
 adoption-0.022
hy-0.021
Tokenv
Token ID85
Feature activation+4.97e-02

Pos logit contributions
ull+0.020
 shall+0.017
 Jane+0.016
 finally+0.016
'll+0.015
Neg logit contributions
assing-0.021
obe-0.020
ging-0.020
 adoption-0.019
ilda-0.019
Tokenendor
Token ID18738
Feature activation+1.08e-02

Pos logit contributions
ull+0.068
 shall+0.064
 wonder+0.061
 marsh+0.061
 finally+0.060
Neg logit contributions
obe-0.083
assing-0.081
ging-0.080
agging-0.076
 adoption-0.075
Token".
Token ID1911
Feature activation+6.01e-03

Pos logit contributions
ull+0.015
 shall+0.013
 Jane+0.012
'll+0.012
 finally+0.011
Neg logit contributions
assing-0.021
obe-0.020
ging-0.020
agging-0.019
eking-0.019
Token Tim
Token ID5045
Feature activation+8.47e-03

Pos logit contributions
ull+0.006
 shall+0.005
'll+0.004
 stirred+0.004
 marsh+0.004
Neg logit contributions
assing-0.014
obe-0.013
ilda-0.013
 adoption-0.013
ging-0.013
Tokenmy
Token ID1820
Feature activation+8.76e-03

Pos logit contributions
ull+0.013
 shall+0.011
 Jane+0.010
 finally+0.010
'll+0.010
Neg logit contributions
assing-0.015
obe-0.014
ging-0.014
ilda-0.014
 adoption-0.013
Token wanted
Token ID2227
Feature activation+5.86e-03

Pos logit contributions
ull+0.010
 shall+0.008
 marsh+0.008
ores+0.008
 Jane+0.007
Neg logit contributions
assing-0.018
obe-0.018
ilda-0.017
ging-0.017
 hurting-0.017
Token both
Token ID1111
Feature activation+1.74e-02

Pos logit contributions
ull+0.017
 shall+0.013
 snowball+0.012
ores+0.012
'll+0.012
Neg logit contributions
assing-0.023
pper-0.021
ging-0.021
obe-0.021
ilda-0.021
Token laughed
Token ID13818
Feature activation+9.76e-03

Pos logit contributions
ull+0.023
 shall+0.018
'll+0.016
 snowball+0.016
ores+0.016
Neg logit contributions
assing-0.032
hy-0.031
ging-0.031
obe-0.031
 hurting-0.030
Token until
Token ID1566
Feature activation+1.06e-02

Pos logit contributions
ull+0.012
 shall+0.010
 Jane+0.009
 finally+0.009
 Mom+0.008
Neg logit contributions
assing-0.021
obe-0.020
ging-0.019
 adoption-0.019
ilda-0.019
Token their
Token ID511
Feature activation+1.95e-02

Pos logit contributions
ull+0.013
 shall+0.011
'll+0.010
 snowball+0.008
 stirred+0.008
Neg logit contributions
assing-0.021
ging-0.021
 adoption-0.020
obe-0.020
 hurting-0.020
Token tum
Token ID11814
Feature activation+6.97e-06

Pos logit contributions
ull+0.027
 shall+0.022
 Jane+0.020
'll+0.020
 +0.019
Neg logit contributions
assing-0.038
obe-0.037
ging-0.035
 adoption-0.035
agging-0.035
Tokenm
Token ID76
Feature activation+4.97e-02

Feature not active on prev token;
no predictions were affected.
Tokenies
Token ID444
Feature activation-9.55e-03

Pos logit contributions
ull+0.089
 shall+0.079
 Jane+0.074
 finally+0.074
'll+0.074
Neg logit contributions
assing-0.073
obe-0.071
ging-0.070
agging-0.065
 adoption-0.065
Token hurt
Token ID5938
Feature activation+1.23e-02

Pos logit contributions
assing+0.023
obe+0.022
ging+0.022
 adoption+0.021
ilda+0.021
Neg logit contributions
ull-0.009
 shall-0.006
 Jane-0.006
'll-0.005
 finally-0.005
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
ull+0.017
 shall+0.014
 Jane+0.013
 finally+0.013
 Mom+0.012
Neg logit contributions
assing-0.025
obe-0.024
ilda-0.023
ging-0.023
 adoption-0.022
Token
Token ID220
Feature activation+1.23e-02

Pos logit contributions
ull+0.007
 stirred+0.005
 shall+0.005
 marsh+0.004
 volcano+0.004
Neg logit contributions
assing-0.014
obe-0.013
ilda-0.013
 adoption-0.013
hy-0.013
Token
Token ID198
Feature activation-3.08e-04

Pos logit contributions
ull+0.011
 shall+0.008
 stirred+0.008
 wonder+0.007
 finally+0.007
Neg logit contributions
assing-0.028
obe-0.028
ilda-0.027
ging-0.026
 adoption-0.026
Token,"
Token ID553
Feature activation+1.38e-02

Pos logit contributions
assing+0.009
obe+0.008
ging+0.008
 adoption+0.008
ilda+0.008
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 finally-0.005
'll-0.005
Token said
Token ID531
Feature activation+2.28e-02

Pos logit contributions
ull+0.017
 shall+0.013
'll+0.013
 marsh+0.010
 finally+0.010
Neg logit contributions
obe-0.028
assing-0.028
ging-0.027
 hurting-0.026
 adoption-0.026
Token Tim
Token ID5045
Feature activation-6.77e-03

Pos logit contributions
ull+0.029
 shall+0.028
'll+0.028
 wonder+0.024
 stirred+0.022
Neg logit contributions
ging-0.040
obe-0.039
assing-0.038
hy-0.037
ting-0.037
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
assing+0.014
 adoption+0.013
obe+0.013
ging+0.013
agging+0.013
Neg logit contributions
ull-0.008
 -0.007
 Jane-0.007
 finally-0.007
 shall-0.006
Token "
Token ID366
Feature activation+8.21e-03

Pos logit contributions
ull+0.019
 shall+0.016
 wonder+0.015
'll+0.015
 marsh+0.015
Neg logit contributions
obe-0.017
assing-0.017
 ME-0.016
 hurting-0.015
ting-0.015
TokenMe
Token ID5308
Feature activation+4.96e-02

Pos logit contributions
ull+0.013
 Jane+0.012
 shall+0.011
 marsh+0.011
 Joe+0.011
Neg logit contributions
assing-0.014
ging-0.013
obe-0.013
iva-0.011
 yourself-0.011
Token too
Token ID1165
Feature activation+5.76e-03

Pos logit contributions
ull+0.067
 shall+0.053
 marsh+0.053
 Jane+0.052
 finally+0.051
Neg logit contributions
assing-0.097
obe-0.094
 hurting-0.090
ging-0.089
eking-0.087
Token!"
Token ID2474
Feature activation+7.02e-03

Pos logit contributions
ull+0.010
 shall+0.008
 Jane+0.007
 finally+0.007
'll+0.007
Neg logit contributions
assing-0.010
obe-0.009
ging-0.009
 hurting-0.009
agging-0.008
Token said
Token ID531
Feature activation+1.23e-02

Pos logit contributions
ull+0.011
 shall+0.010
'll+0.010
 marsh+0.009
 volcano+0.009
Neg logit contributions
obe-0.012
assing-0.010
ging-0.010
 ME-0.010
 hurting-0.010
Token Sam
Token ID3409
Feature activation-1.35e-02

Pos logit contributions
ull+0.015
 shall+0.014
'll+0.014
 wonder+0.012
 stirred+0.011
Neg logit contributions
obe-0.023
ging-0.023
assing-0.022
 hurting-0.022
ting-0.021
Token.
Token ID13
Feature activation+6.74e-03

Pos logit contributions
assing+0.027
obe+0.026
ging+0.025
 adoption+0.025
agging+0.025
Neg logit contributions
ull-0.017
 Jane-0.014
 shall-0.014
 finally-0.014
 -0.014
Token.
Token ID13
Feature activation+7.38e-03

Pos logit contributions
ull+0.017
 shall+0.014
 Jane+0.013
 finally+0.013
 stirred+0.013
Neg logit contributions
assing-0.027
obe-0.026
 adoption-0.024
iva-0.024
ilda-0.024
Token She
Token ID1375
Feature activation+3.71e-03

Pos logit contributions
ull+0.014
 shall+0.011
 snowball+0.011
'll+0.011
 finally+0.010
Neg logit contributions
assing-0.010
obe-0.010
 adoption-0.010
annah-0.009
 ME-0.009
Token said
Token ID531
Feature activation+6.56e-03

Pos logit contributions
ull+0.005
 shall+0.004
 marsh+0.004
'll+0.003
 Jane+0.003
Neg logit contributions
assing-0.007
obe-0.007
 adoption-0.007
ging-0.007
ilda-0.007
Token,
Token ID11
Feature activation+4.21e-03

Pos logit contributions
ull+0.009
 shall+0.007
 Jane+0.006
 finally+0.006
'll+0.006
Neg logit contributions
assing-0.013
obe-0.013
ging-0.012
 adoption-0.012
ilda-0.012
Token "
Token ID366
Feature activation+1.39e-02

Pos logit contributions
ull+0.004
 shall+0.003
 Jane+0.003
 finally+0.003
 +0.003
Neg logit contributions
assing-0.010
obe-0.010
ging-0.009
 adoption-0.009
agging-0.009
TokenMe
Token ID5308
Feature activation+4.95e-02

Pos logit contributions
ull+0.023
 shall+0.019
 finally+0.018
 Jane+0.018
 marsh+0.018
Neg logit contributions
assing-0.023
obe-0.022
ging-0.021
 adoption-0.021
eking-0.020
Tokengan
Token ID1030
Feature activation+9.55e-03

Pos logit contributions
ull+0.058
 shall+0.046
 Jane+0.042
 finally+0.041
'll+0.041
Neg logit contributions
assing-0.107
obe-0.104
ging-0.102
 adoption-0.099
ilda-0.098
Token,
Token ID11
Feature activation+2.84e-03

Pos logit contributions
ull+0.011
 shall+0.009
 Jane+0.009
'll+0.008
 finally+0.008
Neg logit contributions
assing-0.020
obe-0.020
ging-0.019
 adoption-0.019
agging-0.019
Token you
Token ID345
Feature activation-7.39e-03

Pos logit contributions
ull+0.004
 shall+0.003
 Jane+0.003
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.005
obe-0.005
ging-0.005
 adoption-0.005
ilda-0.005
Token shouldn
Token ID6584
Feature activation-7.76e-03

Pos logit contributions
assing+0.014
 adoption+0.013
obe+0.013
 hurting+0.013
ilda+0.013
Neg logit contributions
ull-0.011
 marsh-0.008
 snowball-0.008
 Jane-0.008
 shall-0.008
Token't
Token ID470
Feature activation-6.65e-03

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
 adoption+0.013
ilda+0.013
Neg logit contributions
ull-0.011
 shall-0.009
 Jane-0.009
'll-0.008
 stirred-0.008
Token help
Token ID1037
Feature activation+2.00e-02

Pos logit contributions
ull+0.007
 Jane+0.006
 snowball+0.006
 volcan+0.006
 shall+0.005
Neg logit contributions
assing-0.005
obe-0.005
 hurting-0.005
ging-0.005
 adoption-0.005
Token?
Token ID30
Feature activation+1.97e-03

Pos logit contributions
ull+0.035
 shall+0.033
 Mom+0.032
 Jane+0.032
 Mum+0.031
Neg logit contributions
assing-0.031
obe-0.028
hy-0.028
agging-0.027
ging-0.027
Tokenâ
Token ID22940
Feature activation+1.20e-02

Pos logit contributions
ull+0.003
 shall+0.003
 Jane+0.002
'll+0.002
 finally+0.002
Neg logit contributions
assing-0.004
obe-0.003
ging-0.003
agging-0.003
eking-0.003
Token
Token ID26391
Feature activation+1.07e-02

Pos logit contributions
ull+0.019
 +0.018
 Jane+0.018
 shall+0.017
 finally+0.016
Neg logit contributions
assing-0.018
 thieves-0.018
ging-0.018
 adoption-0.018
 appearances-0.017
Token asked
Token ID1965
Feature activation+4.80e-03

Pos logit contributions
ull+0.018
 shall+0.014
 finally+0.013
 Jane+0.013
 stirred+0.013
Neg logit contributions
assing-0.019
obe-0.018
ilda-0.017
ging-0.017
 adoption-0.016
Token Sus
Token ID8932
Feature activation+4.95e-02

Pos logit contributions
ull+0.005
 shall+0.004
 Jane+0.004
 Mom+0.004
 finally+0.004
Neg logit contributions
assing-0.011
obe-0.010
ging-0.010
agging-0.010
 adoption-0.010
Tokenie
Token ID494
Feature activation+1.83e-02

Pos logit contributions
ull+0.078
 shall+0.066
 Jane+0.063
'll+0.061
 finally+0.060
Neg logit contributions
assing-0.088
obe-0.083
ging-0.081
 adoption-0.080
agging-0.077
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
ull+0.022
 shall+0.018
 Jane+0.017
'll+0.016
 finally+0.016
Neg logit contributions
assing-0.038
obe-0.037
ging-0.036
 adoption-0.035
ilda-0.035
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
ull+0.014
 shall+0.009
 stirred+0.009
 snowball+0.009
 finally+0.009
Neg logit contributions
assing-0.025
obe-0.024
ging-0.024
ilda-0.024
 adoption-0.023
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
ull+0.011
 shall+0.007
 stirred+0.007
 volcano+0.007
 snowball+0.006
Neg logit contributions
assing-0.026
obe-0.026
ging-0.025
ilda-0.025
 adoption-0.025
Tokenâ
Token ID22940
Feature activation+6.37e-03

Pos logit contributions
ull+0.016
 shall+0.013
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.019
obe-0.018
ging-0.018
 adoption-0.017
ilda-0.017
Token They
Token ID1119
Feature activation+9.78e-03

Pos logit contributions
ull+0.000
 shall+0.000
 marsh+0.000
 snowball+0.000
'll+0.000
Neg logit contributions
assing-0.000
obe-0.000
 adoption-0.000
ilda-0.000
 ME-0.000
Token ate
Token ID15063
Feature activation+2.17e-02

Pos logit contributions
ull+0.014
 shall+0.011
 snowball+0.011
 marsh+0.010
'll+0.010
Neg logit contributions
assing-0.017
obe-0.017
ilda-0.016
 adoption-0.016
pper-0.016
Token until
Token ID1566
Feature activation+8.63e-03

Pos logit contributions
ull+0.037
'll+0.030
 shall+0.028
isel+0.028
 Lots+0.027
Neg logit contributions
 hurting-0.032
ging-0.032
obe-0.031
assing-0.031
ting-0.030
Token their
Token ID511
Feature activation+1.96e-02

Pos logit contributions
ull+0.013
 shall+0.011
'll+0.011
 snowball+0.011
 Fisher+0.009
Neg logit contributions
 hurting-0.014
 adoption-0.013
assing-0.013
ging-0.013
 ME-0.013
Token tum
Token ID11814
Feature activation-2.04e-03

Pos logit contributions
ull+0.029
 shall+0.024
 Jane+0.022
'll+0.022
 finally+0.022
Neg logit contributions
assing-0.036
obe-0.035
ging-0.034
 adoption-0.033
agging-0.033
Tokenm
Token ID76
Feature activation+4.89e-02

Pos logit contributions
obe+0.003
assing+0.003
ilda+0.003
ging+0.003
eking+0.003
Neg logit contributions
ull-0.003
 shall-0.003
 wonder-0.003
'll-0.003
 marsh-0.003
Tokenies
Token ID444
Feature activation-8.70e-03

Pos logit contributions
ull+0.092
 shall+0.081
 Jane+0.077
 finally+0.076
'll+0.076
Neg logit contributions
assing-0.069
obe-0.066
ging-0.065
 adoption-0.062
ilda-0.061
Token felt
Token ID2936
Feature activation+4.55e-03

Pos logit contributions
assing+0.019
obe+0.019
ging+0.018
 adoption+0.018
ilda+0.018
Neg logit contributions
ull-0.009
 shall-0.007
 Jane-0.007
'll-0.006
 finally-0.006
Token heavy
Token ID4334
Feature activation-1.03e-02

Pos logit contributions
ull+0.007
 shall+0.006
 Jane+0.006
'll+0.006
 Joe+0.006
Neg logit contributions
 hurting-0.007
ging-0.007
assing-0.007
obe-0.007
agging-0.006
Token.
Token ID13
Feature activation+6.69e-04

Pos logit contributions
assing+0.023
obe+0.021
 adoption+0.021
iva+0.021
ilda+0.021
Neg logit contributions
ull-0.011
 finally-0.009
 shall-0.009
 Jane-0.008
 stirred-0.008
Token Then
Token ID3244
Feature activation+1.73e-03

Pos logit contributions
ull+0.001
 marsh+0.001
 snowball+0.001
 shall+0.001
 stirred+0.001
Neg logit contributions
assing-0.001
obe-0.001
ilda-0.001
iva-0.001
 adoption-0.001
Token said
Token ID531
Feature activation-1.21e-02

Pos logit contributions
ull+0.026
 volcano+0.021
 marsh+0.021
 shall+0.020
ores+0.020
Neg logit contributions
obe-0.024
ilda-0.022
pper-0.022
annah-0.021
ica-0.021
Token.
Token ID13
Feature activation-1.42e-03

Pos logit contributions
assing+0.021
ging+0.021
obe+0.020
agging+0.019
ilda+0.019
Neg logit contributions
ull-0.018
 shall-0.015
'll-0.014
 finally-0.014
 Jane-0.014
Token
Token ID198
Feature activation+7.14e-03

Pos logit contributions
 ME+0.002
assing+0.002
obe+0.002
ging+0.002
hy+0.002
Neg logit contributions
ull-0.002
 shall-0.002
isel-0.002
'll-0.002
 stirred-0.001
Token
Token ID198
Feature activation+5.59e-03

Pos logit contributions
ull+0.008
 shall+0.006
 stirred+0.006
'll+0.006
 cooled+0.006
Neg logit contributions
obe-0.014
assing-0.014
ilda-0.014
ging-0.014
agging-0.014
Token"
Token ID1
Feature activation-4.72e-03

Pos logit contributions
ull+0.007
 shall+0.006
 Jane+0.005
 finally+0.005
'll+0.005
Neg logit contributions
assing-0.012
obe-0.011
ging-0.011
 adoption-0.011
ilda-0.011
TokenMe
Token ID5308
Feature activation+4.89e-02

Pos logit contributions
assing+0.008
ging+0.008
obe+0.007
 adoption+0.007
ting+0.007
Neg logit contributions
ull-0.008
 shall-0.006
 Jane-0.006
'll-0.006
 finally-0.006
Token too
Token ID1165
Feature activation+7.80e-03

Pos logit contributions
ull+0.058
 shall+0.045
 finally+0.041
 Jane+0.040
'll+0.039
Neg logit contributions
assing-0.105
obe-0.102
ging-0.099
eking-0.096
 hurting-0.095
Token.
Token ID13
Feature activation+3.05e-03

Pos logit contributions
ull+0.012
 shall+0.010
'll+0.010
 finally+0.009
 Jane+0.009
Neg logit contributions
assing-0.013
obe-0.013
ging-0.013
 adoption-0.012
ilda-0.012
Token Can
Token ID1680
Feature activation+1.60e-02

Pos logit contributions
ull+0.005
 finally+0.004
 shall+0.004
 stirred+0.004
 cooled+0.004
Neg logit contributions
assing-0.005
hy-0.005
 adoption-0.005
ting-0.005
 yourself-0.005
Token we
Token ID356
Feature activation+1.76e-02

Pos logit contributions
ull+0.030
 shall+0.025
 Jane+0.024
 stirred+0.024
'll+0.023
Neg logit contributions
assing-0.023
obe-0.021
ging-0.021
 adoption-0.021
 hurting-0.020
Token feed
Token ID3745
Feature activation+1.67e-02

Pos logit contributions
ull+0.031
 Jane+0.024
 shall+0.023
 snowball+0.023
 stirred+0.023
Neg logit contributions
assing-0.028
pper-0.025
obe-0.025
 adoption-0.024
hy-0.024
Token fight
Token ID1907
Feature activation-7.94e-03

Pos logit contributions
ull+0.023
 shall+0.020
 Jane+0.019
 finally+0.019
'll+0.019
Neg logit contributions
assing-0.027
obe-0.026
ging-0.025
 adoption-0.024
ilda-0.024
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
assing+0.016
obe+0.015
ging+0.015
 adoption+0.014
ilda+0.014
Neg logit contributions
ull-0.011
 shall-0.009
 Jane-0.008
 finally-0.008
 Mom-0.008
Token
Token ID198
Feature activation+4.08e-03

Pos logit contributions
ull+0.019
 marsh+0.016
 snowball+0.015
 volcano+0.014
 cooled+0.013
Neg logit contributions
annah-0.022
 ME-0.022
obe-0.021
assing-0.020
iva-0.020
Token
Token ID198
Feature activation+9.48e-03

Pos logit contributions
ull+0.006
 stirred+0.005
 lava+0.005
 marsh+0.005
 volcano+0.005
Neg logit contributions
iva-0.007
ilda-0.006
assing-0.006
 ME-0.006
annah-0.006
Token"
Token ID1
Feature activation+7.18e-03

Pos logit contributions
ull+0.015
 shall+0.013
 marsh+0.013
 snowball+0.012
'll+0.012
Neg logit contributions
assing-0.015
ging-0.015
obe-0.015
 adoption-0.014
ilda-0.014
TokenMe
Token ID5308
Feature activation+4.87e-02

Pos logit contributions
ull+0.014
 shall+0.011
 Jane+0.011
 volcano+0.011
'll+0.011
Neg logit contributions
assing-0.011
ging-0.010
obe-0.010
 adoption-0.009
ting-0.009
Token first
Token ID717
Feature activation+3.46e-03

Pos logit contributions
ull+0.057
 shall+0.046
 Jane+0.042
 finally+0.041
'll+0.040
Neg logit contributions
assing-0.105
obe-0.101
ging-0.100
 adoption-0.097
ilda-0.096
Token!"
Token ID2474
Feature activation+8.23e-03

Pos logit contributions
ull+0.005
 shall+0.004
 Jane+0.004
'll+0.004
 finally+0.004
Neg logit contributions
assing-0.007
obe-0.007
ging-0.006
 adoption-0.006
ilda-0.006
Token Jill
Token ID22772
Feature activation-8.24e-03

Pos logit contributions
ull+0.009
 shall+0.006
'll+0.006
 snowball+0.006
 marsh+0.006
Neg logit contributions
obe-0.017
assing-0.017
ging-0.017
ilda-0.017
 ME-0.016
Token says
Token ID1139
Feature activation-1.16e-02

Pos logit contributions
assing+0.016
ging+0.016
obe+0.015
 adoption+0.015
agging+0.015
Neg logit contributions
ull-0.010
 shall-0.009
 Jane-0.009
 finally-0.009
 -0.009
Token.
Token ID13
Feature activation+9.75e-04

Pos logit contributions
assing+0.024
obe+0.023
 adoption+0.022
ging+0.021
ilda+0.021
Neg logit contributions
ull-0.014
 Jane-0.012
 Mom-0.012
 shall-0.012
 finally-0.012
Token thing
Token ID1517
Feature activation-3.14e-03

Pos logit contributions
ull+0.006
 shall+0.005
 Jane+0.005
'll+0.004
 finally+0.004
Neg logit contributions
assing-0.008
obe-0.008
ging-0.008
agging-0.007
eking-0.007
Token?"
Token ID1701
Feature activation+1.02e-02

Pos logit contributions
assing+0.006
obe+0.006
ging+0.006
 adoption+0.006
agging+0.005
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
'll-0.003
 finally-0.003
Token asked
Token ID1965
Feature activation+3.04e-03

Pos logit contributions
ull+0.011
 shall+0.008
'll+0.007
 marsh+0.007
 stirred+0.006
Neg logit contributions
assing-0.022
obe-0.022
ging-0.022
 adoption-0.021
ilda-0.021
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
ull+0.004
 shall+0.003
'll+0.003
 stirred+0.003
 finally+0.003
Neg logit contributions
assing-0.006
obe-0.006
ging-0.006
ilda-0.005
 adoption-0.005
Token curious
Token ID11040
Feature activation+2.66e-02

Pos logit contributions
ull+0.023
 shall+0.020
 finally+0.018
'll+0.018
 Jane+0.018
Neg logit contributions
assing-0.025
ging-0.024
obe-0.024
 adoption-0.023
hy-0.023
Token 3
Token ID513
Feature activation+4.87e-02

Pos logit contributions
ull+0.039
 shall+0.033
 Jane+0.031
 finally+0.030
'll+0.030
Neg logit contributions
assing-0.049
obe-0.047
ging-0.046
 adoption-0.045
ilda-0.044
Token year
Token ID614
Feature activation+1.18e-02

Pos logit contributions
ull+0.063
 shall+0.062
'll+0.060
 marsh+0.056
 finally+0.053
Neg logit contributions
obe-0.085
ging-0.085
assing-0.084
ilda-0.080
 adoption-0.080
Token old
Token ID1468
Feature activation+2.33e-02

Pos logit contributions
ull+0.021
 Jane+0.018
 shall+0.017
 Mom+0.016
 finally+0.016
Neg logit contributions
assing-0.020
obe-0.018
ging-0.018
 adoption-0.017
 hurting-0.017
Token girl
Token ID2576
Feature activation+1.96e-02

Pos logit contributions
ull+0.033
 shall+0.027
 Jane+0.027
 finally+0.025
'll+0.025
Neg logit contributions
assing-0.045
obe-0.043
ging-0.042
 adoption-0.041
agging-0.041
Token,
Token ID11
Feature activation+8.65e-04

Pos logit contributions
ull+0.030
 shall+0.025
 marsh+0.023
'll+0.023
 snowball+0.023
Neg logit contributions
assing-0.034
ging-0.033
obe-0.032
ilda-0.031
 adoption-0.031
Token as
Token ID355
Feature activation+7.59e-03

Pos logit contributions
ull+0.001
 shall+0.001
 Jane+0.001
 finally+0.001
'll+0.001
Neg logit contributions
assing-0.002
obe-0.002
ging-0.002
 adoption-0.002
ilda-0.002
Token the
Token ID262
Feature activation+9.11e-03

Pos logit contributions
ull+0.039
 shall+0.030
'll+0.029
 Lots+0.027
isel+0.027
Neg logit contributions
ging-0.038
 hurting-0.037
assing-0.037
obe-0.036
ting-0.035
Token apples
Token ID22514
Feature activation+1.51e-02

Pos logit contributions
ull+0.014
 shall+0.012
 Jane+0.011
 finally+0.011
'll+0.011
Neg logit contributions
assing-0.016
obe-0.015
ging-0.015
 adoption-0.014
agging-0.014
Token and
Token ID290
Feature activation+1.87e-02

Pos logit contributions
ull+0.022
 shall+0.018
'll+0.017
 Jane+0.016
 finally+0.016
Neg logit contributions
assing-0.027
obe-0.027
ging-0.026
 hurting-0.025
 adoption-0.025
Token their
Token ID511
Feature activation+2.09e-02

Pos logit contributions
ull+0.026
 shall+0.022
'll+0.020
ores+0.018
isel+0.018
Neg logit contributions
assing-0.033
 hurting-0.033
obe-0.033
ging-0.032
 adoption-0.032
Token tum
Token ID11814
Feature activation-1.49e-03

Pos logit contributions
ull+0.034
 shall+0.028
 Jane+0.026
'll+0.025
 finally+0.025
Neg logit contributions
assing-0.036
obe-0.035
ging-0.034
 adoption-0.033
agging-0.032
Tokenm
Token ID76
Feature activation+4.84e-02

Pos logit contributions
obe+0.002
ging+0.002
assing+0.002
ilda+0.002
eking+0.002
Neg logit contributions
ull-0.002
 shall-0.002
 wonder-0.002
 marsh-0.002
'll-0.002
Tokenies
Token ID444
Feature activation-4.70e-03

Pos logit contributions
ull+0.084
 shall+0.072
 Jane+0.069
 finally+0.068
'll+0.067
Neg logit contributions
assing-0.077
obe-0.073
ging-0.071
 adoption-0.069
ilda-0.068
Token felt
Token ID2936
Feature activation+3.12e-03

Pos logit contributions
assing+0.009
obe+0.009
ging+0.008
 adoption+0.008
ilda+0.008
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 finally-0.005
'll-0.005
Token full
Token ID1336
Feature activation-2.78e-03

Pos logit contributions
ull+0.005
 shall+0.004
'll+0.004
 Jane+0.004
 Joe+0.004
Neg logit contributions
 hurting-0.005
assing-0.005
ging-0.005
obe-0.005
 adoption-0.004
Token and
Token ID290
Feature activation+1.14e-02

Pos logit contributions
assing+0.005
obe+0.005
ging+0.005
 adoption+0.004
ilda+0.004
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
 finally-0.003
'll-0.003
Token happy
Token ID3772
Feature activation+6.16e-03

Pos logit contributions
ull+0.016
 shall+0.013
'll+0.012
isel+0.011
 snowball+0.011
Neg logit contributions
 hurting-0.021
ging-0.020
assing-0.019
 adoption-0.019
eking-0.019
Token wanted
Token ID2227
Feature activation+2.04e-03

Pos logit contributions
assing+0.017
obe+0.016
ilda+0.016
 adoption+0.015
 hurting+0.015
Neg logit contributions
ull-0.011
 shall-0.008
 Jane-0.008
 marsh-0.008
 snowball-0.008
Token to
Token ID284
Feature activation-4.38e-03

Pos logit contributions
ull+0.003
 shall+0.002
'll+0.002
 finally+0.002
 Jane+0.002
Neg logit contributions
assing-0.004
obe-0.004
 adoption-0.004
 hurting-0.004
ilda-0.004
Token learn
Token ID2193
Feature activation-1.26e-02

Pos logit contributions
assing+0.008
obe+0.007
ging+0.007
 adoption+0.007
ilda+0.007
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
'll-0.005
 finally-0.005
Token more
Token ID517
Feature activation-1.16e-03

Pos logit contributions
assing+0.027
ging+0.026
obe+0.026
eking+0.024
agging+0.024
Neg logit contributions
ull-0.015
 shall-0.012
 Jane-0.011
 -0.011
 Mom-0.011
Token about
Token ID546
Feature activation-1.82e-02

Pos logit contributions
assing+0.002
ging+0.002
obe+0.002
ilda+0.002
eking+0.002
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
'll-0.001
 marsh-0.001
Token veterin
Token ID33647
Feature activation-5.57e-02

Pos logit contributions
assing+0.034
ging+0.034
obe+0.034
agging+0.033
ilda+0.032
Neg logit contributions
ull-0.023
 Jane-0.020
 shall-0.020
 finally-0.020
 Mom-0.020
Tokenarians
Token ID13517
Feature activation-1.65e-02

Pos logit contributions
ging+0.057
 hurting+0.056
eking+0.056
ilda+0.055
assing+0.054
Neg logit contributions
ull-0.089
'll-0.089
 happily-0.085
 soon-0.084
 finally-0.084
Token.
Token ID13
Feature activation-3.48e-03

Pos logit contributions
assing+0.035
obe+0.034
ging+0.034
agging+0.033
ilda+0.032
Neg logit contributions
ull-0.017
 shall-0.016
 finally-0.015
'll-0.015
 Jane-0.014
Token She
Token ID1375
Feature activation-5.82e-03

Pos logit contributions
assing+0.007
ging+0.007
eking+0.007
obe+0.006
agging+0.006
Neg logit contributions
ull-0.004
 shall-0.004
 Mom-0.004
 Jane-0.004
 -0.004
Token decided
Token ID3066
Feature activation+3.17e-03

Pos logit contributions
assing+0.011
obe+0.011
ilda+0.011
 adoption+0.011
iva+0.010
Neg logit contributions
ull-0.008
 shall-0.006
 marsh-0.006
 Jane-0.006
ores-0.006
Token to
Token ID284
Feature activation-8.41e-03

Pos logit contributions
 finally+0.001
ull+0.001
 Jane+0.001
 it+0.001
 soon+0.001
Neg logit contributions
assing-0.009
agging-0.009
ging-0.009
obe-0.009
iva-0.008
Token animals
Token ID4695
Feature activation-2.11e-02

Pos logit contributions
assing+0.011
ilda+0.011
obe+0.010
ging+0.010
eking+0.010
Neg logit contributions
ull-0.013
 shall-0.012
 Jane-0.011
 snowball-0.011
'll-0.011
Token or
Token ID393
Feature activation-1.95e-02

Pos logit contributions
assing+0.042
obe+0.041
ging+0.040
ilda+0.040
 adoption+0.039
Neg logit contributions
ull-0.026
 shall-0.023
 finally-0.021
 Jane-0.020
'll-0.020
Token use
Token ID779
Feature activation-9.57e-04

Pos logit contributions
assing+0.032
obe+0.031
ging+0.030
 adoption+0.029
ilda+0.029
Neg logit contributions
ull-0.032
 shall-0.028
 Jane-0.026
 finally-0.026
'll-0.026
Token sticks
Token ID16461
Feature activation-8.31e-03

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
 adoption+0.002
 hurting+0.002
Neg logit contributions
ull-0.001
 shall-0.001
 finally-0.001
'll-0.001
 Jane-0.001
Token as
Token ID355
Feature activation+5.10e-03

Pos logit contributions
assing+0.016
obe+0.015
ging+0.015
 adoption+0.014
ilda+0.014
Neg logit contributions
ull-0.012
 shall-0.010
 Jane-0.009
 finally-0.009
'll-0.009
Token weapons
Token ID3777
Feature activation-5.21e-02

Pos logit contributions
ull+0.006
 shall+0.005
'll+0.005
 Jane+0.005
 finally+0.005
Neg logit contributions
assing-0.010
obe-0.010
ging-0.010
 adoption-0.010
agging-0.009
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
assing+0.101
obe+0.099
ging+0.096
ilda+0.094
 adoption+0.094
Neg logit contributions
ull-0.067
 shall-0.059
 finally-0.056
'll-0.055
 could-0.051
Token They
Token ID1119
Feature activation+8.55e-04

Pos logit contributions
ull+0.016
 shall+0.013
 stirred+0.012
 cooled+0.012
'll+0.012
Neg logit contributions
assing-0.023
obe-0.022
 adoption-0.022
 yourself-0.021
 yourselves-0.021
Token said
Token ID531
Feature activation+1.52e-02

Pos logit contributions
ull+0.001
 shall+0.001
 Jane+0.001
 marsh+0.001
 snowball+0.001
Neg logit contributions
assing-0.002
obe-0.002
 adoption-0.002
 hurting-0.002
hy-0.001
Token they
Token ID484
Feature activation+6.60e-03

Pos logit contributions
ull+0.018
 shall+0.014
'll+0.013
 Jane+0.012
 finally+0.012
Neg logit contributions
assing-0.032
obe-0.031
ging-0.031
 hurting-0.030
 adoption-0.030
Token loved
Token ID6151
Feature activation+1.58e-02

Pos logit contributions
ull+0.008
 Jane+0.006
 shall+0.006
 marsh+0.005
 snowball+0.005
Neg logit contributions
assing-0.014
obe-0.014
 hurting-0.013
 adoption-0.013
ging-0.013
Token was
Token ID373
Feature activation+1.11e-03

Pos logit contributions
assing+0.055
obe+0.053
ging+0.052
 adoption+0.051
ilda+0.051
Neg logit contributions
ull-0.027
 shall-0.021
 Jane-0.019
'll-0.018
 finally-0.018
Token able
Token ID1498
Feature activation+1.70e-03

Pos logit contributions
ull+0.002
 shall+0.002
 marsh+0.001
'll+0.001
 snowball+0.001
Neg logit contributions
assing-0.002
 hurting-0.002
ging-0.002
eking-0.002
agging-0.002
Token to
Token ID284
Feature activation-1.83e-02

Pos logit contributions
ull+0.002
 shall+0.002
 Jane+0.001
 marsh+0.001
'll+0.001
Neg logit contributions
assing-0.004
obe-0.003
ging-0.003
 adoption-0.003
 hurting-0.003
Token finish
Token ID5461
Feature activation-1.02e-03

Pos logit contributions
assing+0.030
obe+0.029
ging+0.028
 adoption+0.027
ilda+0.027
Neg logit contributions
ull-0.030
 shall-0.026
 Jane-0.025
'll-0.024
 finally-0.024
Token his
Token ID465
Feature activation-1.68e-02

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
ilda+0.002
 adoption+0.002
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
 finally-0.001
 -0.001
Token tour
Token ID4205
Feature activation-5.11e-02

Pos logit contributions
assing+0.030
ging+0.029
obe+0.029
ilda+0.028
iva+0.027
Neg logit contributions
ull-0.025
 shall-0.022
 finally-0.020
 Jane-0.020
'll-0.020
Token happily
Token ID18177
Feature activation-2.94e-03

Pos logit contributions
assing+0.104
ging+0.101
obe+0.098
 adoption+0.095
eking+0.093
Neg logit contributions
ull-0.063
 shall-0.055
 finally-0.054
 Jane-0.052
 -0.049
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
assing+0.007
obe+0.006
ilda+0.006
ging+0.006
iva+0.006
Neg logit contributions
ull-0.003
 finally-0.003
 shall-0.002
 happily-0.002
 soon-0.002
Token<|endoftext|>
Token ID50256
Feature activation+3.68e-03

Pos logit contributions
assing+0.037
obe+0.036
ging+0.035
 adoption+0.034
ilda+0.034
Neg logit contributions
ull-0.015
 shall-0.012
 Jane-0.010
 finally-0.010
'll-0.010
TokenOnce
Token ID7454
Feature activation-9.80e-03

Pos logit contributions
ull+0.004
 shall+0.003
isel+0.003
 stirred+0.003
 measured+0.003
Neg logit contributions
assing-0.007
obe-0.007
iva-0.007
 adoption-0.007
ilda-0.007
Token there
Token ID612
Feature activation+1.07e-02

Pos logit contributions
ilda+0.014
agging+0.013
ging+0.013
annah+0.013
assing+0.013
Neg logit contributions
ull-0.016
 stirred-0.015
 shall-0.015
 marsh-0.014
 snowball-0.014
Token everyone
Token ID2506
Feature activation-1.36e-02

Pos logit contributions
ull+0.005
 shall+0.005
 Jane+0.004
 finally+0.004
'll+0.004
Neg logit contributions
assing-0.005
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token,
Token ID11
Feature activation+2.04e-03

Pos logit contributions
assing+0.031
obe+0.029
ging+0.028
agging+0.028
 adoption+0.027
Neg logit contributions
ull-0.016
 shall-0.014
 Jane-0.013
 finally-0.013
'll-0.012
Token she
Token ID673
Feature activation-1.54e-02

Pos logit contributions
ull+0.003
 shall+0.002
'll+0.002
 Jane+0.002
 finally+0.002
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token tucked
Token ID29779
Feature activation+2.55e-03

Pos logit contributions
assing+0.031
obe+0.030
ilda+0.029
ging+0.029
 adoption+0.029
Neg logit contributions
ull-0.019
 shall-0.015
 Jane-0.014
 snowball-0.013
 marsh-0.013
Token her
Token ID607
Feature activation-4.93e-03

Pos logit contributions
ull+0.005
 shall+0.004
 Jane+0.004
 finally+0.004
'll+0.004
Neg logit contributions
assing-0.004
obe-0.003
ging-0.003
 adoption-0.003
ilda-0.003
Token diary
Token ID26339
Feature activation-5.03e-02

Pos logit contributions
assing+0.010
obe+0.009
ging+0.009
 adoption+0.009
iva+0.009
Neg logit contributions
ull-0.006
 shall-0.006
 finally-0.006
 Mom-0.006
'll-0.006
Token into
Token ID656
Feature activation-4.20e-03

Pos logit contributions
assing+0.097
obe+0.092
ging+0.088
 adoption+0.087
iva+0.083
Neg logit contributions
ull-0.068
 shall-0.060
 finally-0.060
'll-0.056
 Jane-0.055
Token her
Token ID607
Feature activation-1.17e-02

Pos logit contributions
assing+0.007
obe+0.007
ging+0.006
agging+0.006
 adoption+0.006
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.006
 finally-0.006
 Mom-0.006
Token bed
Token ID3996
Feature activation-3.59e-02

Pos logit contributions
assing+0.025
ging+0.024
obe+0.023
ting+0.022
pper+0.022
Neg logit contributions
 Mom-0.013
ull-0.013
 -0.013
 shall-0.012
 finally-0.012
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
assing+0.077
obe+0.073
ging+0.071
 adoption+0.069
eking+0.066
Neg logit contributions
ull-0.042
 finally-0.037
 shall-0.035
 Jane-0.034
 -0.034
Token made
Token ID925
Feature activation-1.33e-02

Pos logit contributions
assing+0.035
obe+0.033
ging+0.033
 adoption+0.032
agging+0.031
Neg logit contributions
ull-0.023
 shall-0.019
 Jane-0.018
 finally-0.017
'll-0.017
Token toy
Token ID13373
Feature activation-2.04e-02

Pos logit contributions
assing+0.030
obe+0.029
ging+0.029
 adoption+0.028
ilda+0.027
Neg logit contributions
ull-0.027
 shall-0.023
 Jane-0.022
 finally-0.021
'll-0.021
Token.
Token ID13
Feature activation-1.67e-03

Pos logit contributions
assing+0.041
obe+0.040
ging+0.039
eking+0.039
ilda+0.038
Neg logit contributions
ull-0.024
 shall-0.022
'll-0.020
 finally-0.019
 could-0.019
Token You
Token ID921
Feature activation-1.80e-02

Pos logit contributions
assing+0.002
obe+0.002
 adoption+0.002
ging+0.002
iva+0.002
Neg logit contributions
ull-0.003
 shall-0.003
 finally-0.002
 Jane-0.002
 stirred-0.002
Token are
Token ID389
Feature activation-4.52e-04

Pos logit contributions
assing+0.034
obe+0.033
 adoption+0.031
 hurting+0.031
ging+0.031
Neg logit contributions
ull-0.026
 Jane-0.020
 shall-0.020
 snowball-0.019
 marsh-0.018
Token our
Token ID674
Feature activation+2.12e-03

Pos logit contributions
assing+0.001
obe+0.001
ging+0.001
ilda+0.001
 adoption+0.001
Neg logit contributions
ull-0.001
 shall-0.000
 finally-0.000
 Jane-0.000
 Mom-0.000
Token treasure
Token ID14068
Feature activation-4.96e-02

Pos logit contributions
ull+0.003
 shall+0.002
 Jane+0.002
 finally+0.002
'll+0.002
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token.
Token ID13
Feature activation-2.44e-03

Pos logit contributions
assing+0.102
obe+0.098
ging+0.097
 adoption+0.092
eking+0.092
Neg logit contributions
ull-0.062
 shall-0.053
'll-0.050
 finally-0.047
 Jane-0.046
Token We
Token ID775
Feature activation-1.00e-02

Pos logit contributions
assing+0.004
obe+0.003
 adoption+0.003
ging+0.003
iva+0.003
Neg logit contributions
ull-0.004
 shall-0.004
 finally-0.004
 Jane-0.004
 stirred-0.004
Token value
Token ID1988
Feature activation-1.94e-03

Pos logit contributions
assing+0.021
obe+0.020
ging+0.019
 adoption+0.019
ilda+0.019
Neg logit contributions
ull-0.013
 shall-0.010
 Jane-0.010
 finally-0.009
'll-0.009
Token you
Token ID345
Feature activation-9.27e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.003
 adoption+0.003
agging+0.003
Neg logit contributions
ull-0.003
 shall-0.002
 Jane-0.002
'll-0.002
 finally-0.002
Token more
Token ID517
Feature activation-3.97e-03

Pos logit contributions
assing+0.019
obe+0.018
ging+0.018
 adoption+0.017
ilda+0.017
Neg logit contributions
ull-0.012
 shall-0.010
'll-0.009
 finally-0.009
 Jane-0.009
Token brother
Token ID3956
Feature activation-7.68e-03

Pos logit contributions
assing+0.001
obe+0.001
ging+0.001
 adoption+0.001
ilda+0.001
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
 finally-0.001
'll-0.001
Token.
Token ID13
Feature activation+2.17e-03

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
 adoption+0.014
ilda+0.014
Neg logit contributions
ull-0.010
 shall-0.009
 Jane-0.008
 finally-0.008
'll-0.008
Token And
Token ID843
Feature activation-3.96e-03

Pos logit contributions
ull+0.003
 finally+0.002
 stirred+0.002
 shall+0.002
 cooled+0.002
Neg logit contributions
assing-0.004
obe-0.004
hy-0.004
 adoption-0.004
iva-0.004
Token no
Token ID645
Feature activation-1.77e-03

Pos logit contributions
assing+0.006
obe+0.006
ging+0.006
 adoption+0.006
ilda+0.006
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 finally-0.005
'll-0.005
Token more
Token ID517
Feature activation-4.65e-03

Pos logit contributions
assing+0.003
obe+0.003
 hurting+0.003
 adoption+0.003
ging+0.003
Neg logit contributions
ull-0.003
 shall-0.002
'll-0.002
 finally-0.002
 Jane-0.002
Token swords
Token ID19735
Feature activation-4.94e-02

Pos logit contributions
assing+0.008
obe+0.007
ging+0.007
 hurting+0.007
 adoption+0.007
Neg logit contributions
ull-0.008
 shall-0.007
'll-0.006
 Jane-0.006
 finally-0.006
Token,
Token ID11
Feature activation-5.69e-03

Pos logit contributions
assing+0.099
obe+0.097
ging+0.095
ilda+0.093
 adoption+0.092
Neg logit contributions
ull-0.061
 shall-0.054
'll-0.051
 finally-0.050
 could-0.046
Token OK
Token ID7477
Feature activation+1.05e-02

Pos logit contributions
assing+0.011
obe+0.010
ging+0.010
 adoption+0.010
 hurting+0.010
Neg logit contributions
ull-0.008
 shall-0.007
 Jane-0.006
 finally-0.006
'll-0.006
Token?"
Token ID1701
Feature activation-2.71e-03

Pos logit contributions
ull+0.016
 shall+0.014
 finally+0.013
 marsh+0.013
 Jane+0.013
Neg logit contributions
obe-0.018
assing-0.017
 hurting-0.017
 yourself-0.016
ging-0.016
Token
Token ID198
Feature activation-1.40e-03

Pos logit contributions
obe+0.005
assing+0.005
 yourself+0.005
ging+0.005
 hurting+0.005
Neg logit contributions
ull-0.003
 shall-0.003
'll-0.003
 cooled-0.003
 stirred-0.002
Token
Token ID198
Feature activation-5.72e-03

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
ilda+0.003
 adoption+0.003
Neg logit contributions
ull-0.001
 shall-0.001
'll-0.001
 stirred-0.001
 Jane-0.001
Token fun
Token ID1257
Feature activation-1.85e-03

Pos logit contributions
assing+0.006
obe+0.005
ging+0.005
 adoption+0.005
ilda+0.005
Neg logit contributions
ull-0.005
 shall-0.004
 Jane-0.004
 finally-0.004
'll-0.004
Token.
Token ID13
Feature activation+9.18e-03

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
 adoption+0.003
ilda+0.003
Neg logit contributions
ull-0.003
 shall-0.002
 finally-0.002
 Jane-0.002
'll-0.002
Token It
Token ID632
Feature activation-2.96e-03

Pos logit contributions
ull+0.017
 finally+0.013
 shall+0.013
 stirred+0.013
 measured+0.013
Neg logit contributions
assing-0.013
 adoption-0.013
obe-0.012
 yourself-0.012
hy-0.012
Token ruined
Token ID20484
Feature activation+5.12e-04

Pos logit contributions
assing+0.005
obe+0.005
ging+0.005
 adoption+0.005
 hurting+0.005
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
 finally-0.003
 -0.003
Token our
Token ID674
Feature activation-6.42e-03

Pos logit contributions
ull+0.001
 shall+0.001
 finally+0.001
'll+0.001
 Jane+0.001
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
 adoption-0.001
ilda-0.001
Token castles
Token ID49203
Feature activation-4.92e-02

Pos logit contributions
assing+0.012
obe+0.011
ging+0.011
 adoption+0.011
agging+0.010
Neg logit contributions
ull-0.010
 shall-0.008
 Jane-0.007
 finally-0.007
'll-0.007
Token.
Token ID13
Feature activation+2.45e-03

Pos logit contributions
assing+0.098
obe+0.093
ging+0.093
 adoption+0.090
ilda+0.088
Neg logit contributions
ull-0.065
 shall-0.055
 finally-0.050
'll-0.050
 Jane-0.050
Token They
Token ID1119
Feature activation-7.70e-03

Pos logit contributions
ull+0.005
 finally+0.004
 shall+0.004
 stirred+0.004
 cooled+0.004
Neg logit contributions
assing-0.003
 adoption-0.003
 yourself-0.003
obe-0.003
iva-0.003
Token were
Token ID547
Feature activation+2.03e-03

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
 adoption+0.014
ilda+0.013
Neg logit contributions
ull-0.011
 shall-0.009
'll-0.008
 finally-0.008
 Jane-0.008
Token so
Token ID523
Feature activation-5.33e-03

Pos logit contributions
ull+0.003
 shall+0.003
'll+0.003
 Jane+0.003
 +0.003
Neg logit contributions
assing-0.003
 hurting-0.003
obe-0.003
eking-0.003
ging-0.003
Token pretty
Token ID2495
Feature activation+1.24e-02

Pos logit contributions
assing+0.009
obe+0.009
ging+0.009
eking+0.008
agging+0.008
Neg logit contributions
ull-0.008
 shall-0.007
 Jane-0.007
 finally-0.006
'll-0.006
Token sticks
Token ID16461
Feature activation-1.74e-02

Pos logit contributions
assing+0.004
obe+0.004
ging+0.003
 adoption+0.003
ilda+0.003
Neg logit contributions
ull-0.003
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token were
Token ID547
Feature activation+9.90e-03

Pos logit contributions
assing+0.043
obe+0.042
ging+0.041
 adoption+0.040
ilda+0.040
Neg logit contributions
ull-0.015
 shall-0.011
 finally-0.010
'll-0.010
 Jane-0.009
Token no
Token ID645
Feature activation+2.37e-03

Pos logit contributions
ull+0.016
 shall+0.013
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.017
obe-0.017
ging-0.016
 adoption-0.016
ilda-0.015
Token longer
Token ID2392
Feature activation-9.85e-03

Pos logit contributions
ull+0.003
 shall+0.002
 +0.002
 Jane+0.002
'll+0.002
Neg logit contributions
assing-0.005
obe-0.005
ging-0.005
 adoption-0.005
agging-0.005
Token powerful
Token ID3665
Feature activation-1.20e-03

Pos logit contributions
assing+0.019
obe+0.018
ilda+0.017
 adoption+0.017
ging+0.016
Neg logit contributions
ull-0.015
 stirred-0.012
 shall-0.012
 finally-0.012
 Jane-0.012
Token weapons
Token ID3777
Feature activation-4.91e-02

Pos logit contributions
assing+0.002
obe+0.002
ilda+0.002
ging+0.002
 adoption+0.002
Neg logit contributions
ull-0.002
 shall-0.001
 Jane-0.001
 finally-0.001
 stirred-0.001
Token,
Token ID11
Feature activation+9.77e-03

Pos logit contributions
assing+0.108
obe+0.107
ging+0.103
 adoption+0.101
agging+0.100
Neg logit contributions
ull-0.051
 shall-0.046
 finally-0.044
'll-0.042
 could-0.040
Token but
Token ID475
Feature activation+8.15e-04

Pos logit contributions
ull+0.013
 shall+0.011
 Jane+0.010
 finally+0.010
'll+0.010
Neg logit contributions
assing-0.019
obe-0.019
ging-0.018
 adoption-0.018
ilda-0.018
Token perfect
Token ID2818
Feature activation+1.00e-02

Pos logit contributions
ull+0.001
 snowball+0.001
 shall+0.001
 marsh+0.001
'll+0.001
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
 hurting-0.001
 adoption-0.001
Token stems
Token ID21552
Feature activation-5.94e-03

Pos logit contributions
ull+0.015
 shall+0.013
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.019
obe-0.018
ging-0.017
 adoption-0.017
ilda-0.016
Token and
Token ID290
Feature activation+6.21e-03

Pos logit contributions
assing+0.013
obe+0.013
ging+0.012
 adoption+0.012
ilda+0.012
Neg logit contributions
ull-0.007
 shall-0.005
'll-0.005
 finally-0.005
 Jane-0.005
Token them
Token ID606
Feature activation-1.50e-02

Pos logit contributions
assing+0.008
obe+0.008
ging+0.007
ilda+0.007
agging+0.007
Neg logit contributions
ull-0.006
 shall-0.006
'll-0.005
 Jane-0.005
 finally-0.005
Token away
Token ID1497
Feature activation-1.17e-02

Pos logit contributions
assing+0.032
obe+0.032
ging+0.030
iva+0.030
ilda+0.030
Neg logit contributions
ull-0.015
 shall-0.014
'll-0.014
 soon-0.013
 finally-0.013
Token from
Token ID422
Feature activation-3.53e-03

Pos logit contributions
assing+0.022
obe+0.021
ilda+0.020
ging+0.020
agging+0.020
Neg logit contributions
ull-0.017
 shall-0.015
 Jane-0.014
'll-0.014
 finally-0.013
Token their
Token ID511
Feature activation-4.80e-04

Pos logit contributions
assing+0.006
obe+0.006
ilda+0.006
ging+0.006
agging+0.006
Neg logit contributions
ull-0.005
 shall-0.004
 finally-0.004
 Jane-0.004
 Mom-0.004
Token natural
Token ID3288
Feature activation-4.95e-03

Pos logit contributions
assing+0.001
obe+0.001
ging+0.001
ilda+0.001
agging+0.001
Neg logit contributions
ull-0.001
 shall-0.001
'll-0.001
 finally-0.001
 wonder-0.001
Token habitats
Token ID35308
Feature activation-4.89e-02

Pos logit contributions
assing+0.008
obe+0.008
ging+0.008
ilda+0.008
 adoption+0.007
Neg logit contributions
ull-0.008
 shall-0.007
 Jane-0.007
'll-0.007
 finally-0.006
Token.
Token ID13
Feature activation-5.23e-03

Pos logit contributions
assing+0.095
obe+0.093
ging+0.089
 adoption+0.087
ilda+0.086
Neg logit contributions
ull-0.067
 shall-0.057
 Jane-0.053
 finally-0.051
'll-0.051
Token We
Token ID775
Feature activation-3.01e-03

Pos logit contributions
assing+0.011
obe+0.011
 adoption+0.010
iva+0.010
ilda+0.010
Neg logit contributions
ull-0.006
 stirred-0.005
 marsh-0.005
 shall-0.005
 snowball-0.004
Token should
Token ID815
Feature activation-1.29e-02

Pos logit contributions
assing+0.006
obe+0.005
 adoption+0.005
iva+0.005
ging+0.005
Neg logit contributions
ull-0.004
 Jane-0.003
 snowball-0.003
 marsh-0.003
 shall-0.003
Token be
Token ID307
Feature activation-2.46e-03

Pos logit contributions
assing+0.027
ging+0.026
obe+0.026
ilda+0.025
 adoption+0.025
Neg logit contributions
ull-0.015
 shall-0.013
 finally-0.012
'll-0.011
 Jane-0.011
Token friendly
Token ID8030
Feature activation+1.88e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
agging+0.004
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token can
Token ID460
Feature activation-1.57e-02

Pos logit contributions
assing+0.032
obe+0.031
ging+0.030
 adoption+0.030
eking+0.030
Neg logit contributions
ull-0.013
 shall-0.012
'll-0.010
 finally-0.010
 Jane-0.009
Token help
Token ID1037
Feature activation-1.76e-02

Pos logit contributions
assing+0.028
ging+0.028
obe+0.027
 adoption+0.026
agging+0.026
Neg logit contributions
ull-0.022
 finally-0.020
 shall-0.020
'll-0.019
 -0.018
Token you
Token ID345
Feature activation-1.46e-02

Pos logit contributions
assing+0.031
ging+0.029
hy+0.028
obe+0.027
agging+0.027
Neg logit contributions
ull-0.027
 shall-0.025
 Jane-0.024
 Mom-0.023
 finally-0.023
Token achieve
Token ID4620
Feature activation-2.32e-02

Pos logit contributions
assing+0.025
obe+0.024
ging+0.024
 adoption+0.024
agging+0.023
Neg logit contributions
ull-0.022
 shall-0.020
 finally-0.019
'll-0.019
 -0.018
Token your
Token ID534
Feature activation-1.72e-02

Pos logit contributions
assing+0.047
hy+0.045
ging+0.045
obe+0.044
 adoption+0.043
Neg logit contributions
ull-0.027
 shall-0.026
 finally-0.024
 Jane-0.022
 -0.022
Token dreams
Token ID10625
Feature activation-4.88e-02

Pos logit contributions
assing+0.032
ging+0.031
pper+0.030
eking+0.029
ilda+0.029
Neg logit contributions
ull-0.023
 shall-0.022
 finally-0.021
'll-0.020
 wonder-0.020
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
assing+0.098
ging+0.095
obe+0.094
 adoption+0.091
ilda+0.088
Neg logit contributions
ull-0.060
 shall-0.056
 finally-0.052
'll-0.049
 Jane-0.048
Token<|endoftext|>
Token ID50256
Feature activation+1.55e-03

Pos logit contributions
assing+0.029
obe+0.029
 adoption+0.028
ilda+0.028
iva+0.028
Neg logit contributions
ull-0.010
 shall-0.007
 marsh-0.007
 stirred-0.007
 finally-0.006
TokenOnce
Token ID7454
Feature activation-1.20e-02

Pos logit contributions
ull+0.002
 shall+0.001
 stirred+0.001
isel+0.001
 measured+0.001
Neg logit contributions
assing-0.003
obe-0.003
 adoption-0.003
ilda-0.003
iva-0.003
Token upon
Token ID2402
Feature activation-1.35e-02

Pos logit contributions
ilda+0.018
ging+0.016
agging+0.016
annah+0.016
assing+0.016
Neg logit contributions
ull-0.020
 shall-0.017
 stirred-0.017
 marsh-0.017
 snowball-0.017
Token a
Token ID257
Feature activation-4.45e-03

Pos logit contributions
assing+0.022
 adoption+0.021
obe+0.021
agging+0.020
ging+0.020
Neg logit contributions
ull-0.020
 shall-0.017
'll-0.016
 Jane-0.016
 marsh-0.016
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
assing+0.030
obe+0.028
ging+0.028
 adoption+0.027
ilda+0.027
Neg logit contributions
ull-0.023
 shall-0.020
 finally-0.018
 Jane-0.018
'll-0.018
Token He
Token ID679
Feature activation-2.88e-02

Pos logit contributions
obe+0.019
assing+0.018
hy+0.017
pering+0.017
 yourself+0.017
Neg logit contributions
ull-0.016
 shall-0.014
'll-0.014
 stirred-0.014
 cooled-0.014
Token puts
Token ID7584
Feature activation-7.40e-03

Pos logit contributions
assing+0.050
ging+0.048
obe+0.048
 adoption+0.046
ilda+0.045
Neg logit contributions
ull-0.045
 shall-0.039
'll-0.036
 finally-0.036
 Jane-0.036
Token on
Token ID319
Feature activation-1.83e-03

Pos logit contributions
assing+0.014
ging+0.013
obe+0.013
agging+0.012
 adoption+0.012
Neg logit contributions
ull-0.011
 shall-0.009
'll-0.008
 Jane-0.008
 stirred-0.008
Token his
Token ID465
Feature activation-2.54e-02

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
ilda+0.004
Neg logit contributions
ull-0.002
 shall-0.002
'll-0.002
 Jane-0.002
 finally-0.002
Token mask
Token ID9335
Feature activation-4.87e-02

Pos logit contributions
assing+0.042
obe+0.041
ging+0.039
ilda+0.038
iva+0.037
Neg logit contributions
ull-0.040
 shall-0.037
 finally-0.036
 snowball-0.035
 Mom-0.035
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
assing+0.105
obe+0.101
ging+0.099
 adoption+0.097
eking+0.096
Neg logit contributions
ull-0.055
 shall-0.045
 finally-0.043
 -0.041
 Jane-0.040
Token He
Token ID679
Feature activation-2.20e-02

Pos logit contributions
obe+0.020
assing+0.019
hy+0.019
 yourself+0.018
 hurting+0.017
Neg logit contributions
ull-0.019
 shall-0.017
'll-0.016
 stirred-0.016
 cooled-0.015
Token thanks
Token ID5176
Feature activation+4.35e-03

Pos logit contributions
assing+0.036
obe+0.034
ging+0.033
 adoption+0.032
ilda+0.032
Neg logit contributions
ull-0.037
 shall-0.032
 Jane-0.030
 finally-0.030
'll-0.030
Token Sam
Token ID3409
Feature activation-1.74e-02

Pos logit contributions
ull+0.005
 shall+0.005
'll+0.004
 stirred+0.004
 finally+0.004
Neg logit contributions
assing-0.009
obe-0.008
ging-0.008
 adoption-0.008
 hurting-0.008
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
assing+0.043
obe+0.040
eking+0.040
ging+0.039
agging+0.038
Neg logit contributions
ull-0.015
 could-0.014
 happily-0.014
'll-0.014
 shall-0.013
Token can
Token ID460
Feature activation-8.70e-03

Pos logit contributions
assing+0.031
obe+0.030
ging+0.029
 adoption+0.028
eking+0.028
Neg logit contributions
ull-0.013
 shall-0.012
'll-0.010
 could-0.010
 Jane-0.009
Token be
Token ID307
Feature activation+1.37e-04

Pos logit contributions
assing+0.017
ging+0.016
obe+0.016
 adoption+0.015
ilda+0.015
Neg logit contributions
ull-0.012
 shall-0.010
 finally-0.010
'll-0.010
 wonder-0.009
Token more
Token ID517
Feature activation-8.40e-03

Pos logit contributions
ull+0.000
 shall+0.000
 Jane+0.000
 finally+0.000
'll+0.000
Neg logit contributions
assing-0.000
obe-0.000
ging-0.000
 adoption-0.000
eking-0.000
Token powerful
Token ID3665
Feature activation-2.12e-02

Pos logit contributions
assing+0.013
obe+0.013
ging+0.013
 adoption+0.012
ilda+0.012
Neg logit contributions
ull-0.014
 shall-0.012
 Jane-0.012
 finally-0.012
'll-0.012
Token than
Token ID621
Feature activation-9.96e-03

Pos logit contributions
assing+0.035
ilda+0.032
obe+0.032
 adoption+0.031
ging+0.030
Neg logit contributions
ull-0.035
 shall-0.032
 wonder-0.030
'll-0.029
 finally-0.029
Token weapons
Token ID3777
Feature activation-4.86e-02

Pos logit contributions
assing+0.017
obe+0.016
ging+0.016
 adoption+0.015
ilda+0.015
Neg logit contributions
ull-0.016
 shall-0.014
 Jane-0.013
 finally-0.013
'll-0.013
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
assing+0.106
obe+0.104
ging+0.101
 adoption+0.099
ilda+0.098
Neg logit contributions
ull-0.048
 shall-0.046
'll-0.043
 could-0.043
 finally-0.042
Token Kind
Token ID14927
Feature activation-1.18e-03

Pos logit contributions
assing+0.004
obe+0.004
 adoption+0.004
iva+0.004
ging+0.004
Neg logit contributions
ull-0.003
 shall-0.003
 stirred-0.003
 finally-0.003
 marsh-0.003
Tokenness
Token ID1108
Feature activation-1.73e-02

Pos logit contributions
assing+0.002
obe+0.002
ilda+0.002
eking+0.002
ging+0.002
Neg logit contributions
ull-0.002
 shall-0.002
'll-0.002
 finally-0.002
 wonder-0.002
Token can
Token ID460
Feature activation-2.21e-03

Pos logit contributions
assing+0.040
obe+0.039
ging+0.037
 adoption+0.037
ilda+0.036
Neg logit contributions
ull-0.018
 shall-0.015
'll-0.014
 finally-0.012
 Jane-0.012
Token change
Token ID1487
Feature activation+2.61e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
ilda+0.004
Neg logit contributions
ull-0.003
 shall-0.003
 finally-0.002
'll-0.002
 -0.002
Token wrong
Token ID2642
Feature activation-5.21e-03

Pos logit contributions
ull+0.024
 finally+0.022
 Jane+0.021
 excited+0.020
 almost+0.020
Neg logit contributions
assing-0.019
iva-0.019
 adoption-0.019
hy-0.019
obe-0.019
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
assing+0.011
obe+0.010
ilda+0.010
ging+0.010
 adoption+0.010
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 Mom-0.005
 finally-0.005
Token She
Token ID1375
Feature activation-1.36e-02

Pos logit contributions
ull+0.009
 shall+0.008
 finally+0.007
'll+0.007
 stirred+0.007
Neg logit contributions
assing-0.007
obe-0.007
ging-0.006
 adoption-0.006
 ME-0.006
Token sees
Token ID7224
Feature activation+7.32e-03

Pos logit contributions
assing+0.024
obe+0.023
ging+0.023
 adoption+0.022
ilda+0.022
Neg logit contributions
ull-0.020
 shall-0.018
'll-0.017
 finally-0.017
 Jane-0.017
Token their
Token ID511
Feature activation-6.94e-04

Pos logit contributions
ull+0.009
 shall+0.008
 Jane+0.007
'll+0.007
 finally+0.007
Neg logit contributions
assing-0.015
obe-0.014
ging-0.014
 adoption-0.014
ilda-0.014
Token masks
Token ID20680
Feature activation-4.85e-02

Pos logit contributions
assing+0.001
obe+0.001
ging+0.001
 adoption+0.001
ilda+0.001
Neg logit contributions
ull-0.001
 Mom-0.001
 shall-0.001
 finally-0.001
 snowball-0.001
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
assing+0.112
obe+0.110
ilda+0.109
 adoption+0.107
iva+0.106
Neg logit contributions
ull-0.043
 shall-0.039
 finally-0.037
 stirred-0.032
 -0.032
Token their
Token ID511
Feature activation-3.79e-03

Pos logit contributions
assing+0.033
obe+0.032
ging+0.031
 adoption+0.030
ilda+0.030
Neg logit contributions
ull-0.027
 shall-0.023
 Jane-0.022
 finally-0.021
'll-0.021
Token paints
Token ID32281
Feature activation-1.19e-02

Pos logit contributions
assing+0.006
obe+0.006
ging+0.006
ilda+0.006
 adoption+0.006
Neg logit contributions
ull-0.006
 shall-0.005
 Mom-0.005
 wonder-0.005
 snowball-0.005
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
assing+0.025
obe+0.024
 adoption+0.023
ilda+0.023
ging+0.023
Neg logit contributions
ull-0.015
 shall-0.013
 finally-0.012
 Mom-0.011
 Jane-0.011
Token their
Token ID511
Feature activation-4.10e-04

Pos logit contributions
assing+0.022
obe+0.021
ging+0.021
 adoption+0.020
ilda+0.020
Neg logit contributions
ull-0.022
 shall-0.019
 Jane-0.018
 finally-0.018
'll-0.018
Token can
Token ID460
Feature activation-3.60e-03

Pos logit contributions
assing+0.035
obe+0.034
ging+0.033
 adoption+0.032
eking+0.032
Neg logit contributions
ull-0.014
 shall-0.013
'll-0.012
 could-0.011
 finally-0.010
Token help
Token ID1037
Feature activation-4.52e-03

Pos logit contributions
assing+0.007
ging+0.007
obe+0.007
 adoption+0.006
ilda+0.006
Neg logit contributions
ull-0.005
 shall-0.004
 finally-0.004
'll-0.004
 Jane-0.004
Token us
Token ID514
Feature activation-6.22e-03

Pos logit contributions
assing+0.008
obe+0.007
ging+0.007
agging+0.007
 adoption+0.007
Neg logit contributions
ull-0.007
 shall-0.007
 Jane-0.006
'll-0.006
 finally-0.006
Token conquer
Token ID23875
Feature activation-1.45e-02

Pos logit contributions
assing+0.012
obe+0.012
ging+0.011
 adoption+0.011
ilda+0.011
Neg logit contributions
ull-0.009
 shall-0.007
'll-0.007
 finally-0.007
 Jane-0.006
Token our
Token ID674
Feature activation+8.58e-05

Pos logit contributions
assing+0.029
obe+0.029
agging+0.027
ging+0.027
ilda+0.027
Neg logit contributions
ull-0.018
 shall-0.016
 finally-0.015
 Jane-0.015
'll-0.014
Token fears
Token ID10251
Feature activation-4.85e-02

Pos logit contributions
ull+0.000
 shall+0.000
'll+0.000
 wonder+0.000
 finally+0.000
Neg logit contributions
assing-0.000
obe-0.000
ging-0.000
ilda-0.000
pper-0.000
Token and
Token ID290
Feature activation-8.96e-03

Pos logit contributions
assing+0.101
obe+0.099
ging+0.093
 adoption+0.093
ilda+0.091
Neg logit contributions
ull-0.058
 shall-0.053
'll-0.048
 finally-0.047
 Jane-0.046
Token make
Token ID787
Feature activation-1.20e-02

Pos logit contributions
assing+0.016
obe+0.015
ging+0.015
 adoption+0.014
ilda+0.014
Neg logit contributions
ull-0.014
 shall-0.012
 Jane-0.011
 finally-0.011
'll-0.011
Token us
Token ID514
Feature activation-8.25e-03

Pos logit contributions
assing+0.025
obe+0.024
ging+0.023
 adoption+0.023
ilda+0.022
Neg logit contributions
ull-0.015
 shall-0.012
 Jane-0.011
 finally-0.011
'll-0.011
Token feel
Token ID1254
Feature activation-2.35e-03

Pos logit contributions
assing+0.018
obe+0.017
ging+0.017
 adoption+0.016
ilda+0.016
Neg logit contributions
ull-0.010
 shall-0.008
'll-0.007
 finally-0.007
 Jane-0.007
Token better
Token ID1365
Feature activation+7.76e-04

Pos logit contributions
assing+0.004
ging+0.004
obe+0.004
 adoption+0.004
 hurting+0.004
Neg logit contributions
ull-0.003
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token little
Token ID1310
Feature activation+1.85e-03

Pos logit contributions
ull+0.006
 shall+0.005
'll+0.005
 Jane+0.005
 finally+0.005
Neg logit contributions
assing-0.009
ging-0.009
obe-0.009
 adoption-0.008
agging-0.008
Token frog
Token ID21264
Feature activation-6.49e-03

Pos logit contributions
ull+0.002
 shall+0.002
'll+0.002
 Jane+0.002
 finally+0.002
Neg logit contributions
assing-0.004
obe-0.003
ging-0.003
 hurting-0.003
agging-0.003
Token put
Token ID1234
Feature activation-1.60e-03

Pos logit contributions
assing+0.013
obe+0.013
ging+0.013
 adoption+0.013
ilda+0.012
Neg logit contributions
ull-0.008
 shall-0.006
 Jane-0.006
'll-0.005
 marsh-0.005
Token on
Token ID319
Feature activation-1.04e-03

Pos logit contributions
ging+0.003
assing+0.003
obe+0.003
 hurting+0.003
eking+0.003
Neg logit contributions
ull-0.002
 shall-0.002
'll-0.002
 Jane-0.002
 finally-0.002
Token his
Token ID465
Feature activation-9.61e-03

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
 adoption+0.002
ilda+0.002
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
 finally-0.001
'll-0.001
Token sunglasses
Token ID36251
Feature activation-4.85e-02

Pos logit contributions
assing+0.017
obe+0.016
ging+0.016
ilda+0.015
 adoption+0.015
Neg logit contributions
ull-0.015
 shall-0.013
 finally-0.012
 Mom-0.012
 Jane-0.012
Token and
Token ID290
Feature activation-3.60e-03

Pos logit contributions
assing+0.107
obe+0.103
ging+0.099
 adoption+0.098
eking+0.094
Neg logit contributions
ull-0.052
 shall-0.046
 finally-0.045
 -0.042
'll-0.042
Token hopped
Token ID45230
Feature activation-1.20e-02

Pos logit contributions
assing+0.007
obe+0.007
ging+0.007
 adoption+0.006
ilda+0.006
Neg logit contributions
ull-0.005
 shall-0.004
 Jane-0.004
'll-0.004
 finally-0.003
Token off
Token ID572
Feature activation-1.03e-02

Pos logit contributions
assing+0.022
obe+0.021
ging+0.020
 adoption+0.020
ilda+0.020
Neg logit contributions
ull-0.018
 shall-0.015
 finally-0.015
 Jane-0.015
'll-0.014
Token to
Token ID284
Feature activation-4.57e-03

Pos logit contributions
assing+0.020
obe+0.019
ging+0.019
 adoption+0.018
ilda+0.018
Neg logit contributions
ull-0.014
 shall-0.012
 Jane-0.011
 finally-0.011
'll-0.011
Token explore
Token ID7301
Feature activation-2.27e-02

Pos logit contributions
assing+0.008
obe+0.008
ging+0.008
 adoption+0.008
ilda+0.008
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 finally-0.005
'll-0.005
Token was
Token ID373
Feature activation+3.92e-04

Pos logit contributions
assing+0.004
obe+0.004
ilda+0.004
 adoption+0.004
ging+0.004
Neg logit contributions
ull-0.003
 shall-0.003
 Jane-0.002
 marsh-0.002
ores-0.002
Token sad
Token ID6507
Feature activation-3.08e-03

Pos logit contributions
ull+0.001
 shall+0.001
 volcan+0.001
'll+0.001
 volcano+0.001
Neg logit contributions
 hurting-0.001
assing-0.001
agging-0.001
eking-0.001
obe-0.001
Token to
Token ID284
Feature activation-9.15e-03

Pos logit contributions
assing+0.006
obe+0.006
ging+0.006
 adoption+0.006
agging+0.005
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
 finally-0.003
'll-0.003
Token leave
Token ID2666
Feature activation-6.68e-03

Pos logit contributions
assing+0.017
ging+0.016
obe+0.016
 adoption+0.016
agging+0.015
Neg logit contributions
ull-0.013
 shall-0.011
 finally-0.011
 Mom-0.011
 Jane-0.011
Token her
Token ID607
Feature activation-1.22e-02

Pos logit contributions
assing+0.012
obe+0.012
ging+0.011
 adoption+0.011
agging+0.011
Neg logit contributions
ull-0.010
 Jane-0.009
 shall-0.009
 finally-0.008
 Mom-0.008
Token tent
Token ID11105
Feature activation-4.84e-02

Pos logit contributions
assing+0.023
obe+0.023
ging+0.022
 adoption+0.021
ilda+0.021
Neg logit contributions
ull-0.017
 shall-0.014
 Jane-0.013
 finally-0.013
'll-0.013
Token and
Token ID290
Feature activation-4.84e-03

Pos logit contributions
assing+0.097
obe+0.092
ging+0.091
 adoption+0.088
ilda+0.086
Neg logit contributions
ull-0.063
 finally-0.054
 shall-0.054
 Jane-0.052
'll-0.049
Token puzzle
Token ID15027
Feature activation-6.11e-03

Pos logit contributions
assing+0.009
obe+0.009
 adoption+0.009
 hurting+0.009
ging+0.009
Neg logit contributions
ull-0.007
 shall-0.005
 Jane-0.005
 marsh-0.005
 snowball-0.005
Token behind
Token ID2157
Feature activation-2.20e-03

Pos logit contributions
assing+0.012
obe+0.011
ging+0.011
 adoption+0.011
ilda+0.010
Neg logit contributions
ull-0.008
 shall-0.007
 Jane-0.007
 finally-0.007
'll-0.007
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
assing+0.004
obe+0.003
ging+0.003
 adoption+0.003
ilda+0.003
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token But
Token ID887
Feature activation-9.94e-03

Pos logit contributions
assing+0.017
obe+0.017
ilda+0.016
 adoption+0.016
annah+0.016
Neg logit contributions
ull-0.020
 shall-0.015
 snowball-0.015
 stirred-0.015
 marsh-0.015
Token asleep
Token ID16039
Feature activation-2.69e-02

Pos logit contributions
assing+0.035
obe+0.033
ging+0.033
ilda+0.031
 adoption+0.031
Neg logit contributions
ull-0.040
 shall-0.035
 Jane-0.033
'll-0.032
 finally-0.032
Token dreaming
Token ID28368
Feature activation-1.82e-02

Pos logit contributions
assing+0.059
obe+0.055
 supervision+0.055
 adoption+0.054
 property+0.054
Neg logit contributions
 finally-0.028
ull-0.027
 happily-0.025
 -0.023
 Jane-0.022
Token about
Token ID546
Feature activation-1.45e-02

Pos logit contributions
assing+0.042
obe+0.040
ging+0.040
 adoption+0.039
ilda+0.038
Neg logit contributions
ull-0.019
 shall-0.014
 Jane-0.013
 finally-0.013
'll-0.012
Token flying
Token ID7348
Feature activation-1.43e-02

Pos logit contributions
assing+0.027
obe+0.026
ging+0.025
 adoption+0.025
ilda+0.024
Neg logit contributions
ull-0.021
 shall-0.018
'll-0.016
 Jane-0.016
 finally-0.016
Token his
Token ID465
Feature activation-1.74e-02

Pos logit contributions
assing+0.025
obe+0.024
ging+0.024
 adoption+0.023
 hurting+0.023
Neg logit contributions
ull-0.022
 shall-0.019
'll-0.017
 Jane-0.017
 stirred-0.017
Token spaceship
Token ID40663
Feature activation-4.83e-02

Pos logit contributions
assing+0.033
obe+0.031
ging+0.031
 adoption+0.030
ilda+0.029
Neg logit contributions
ull-0.025
 shall-0.021
 finally-0.020
 Jane-0.020
'll-0.019
Token and
Token ID290
Feature activation-1.48e-02

Pos logit contributions
assing+0.095
obe+0.091
ging+0.090
 adoption+0.087
ilda+0.086
Neg logit contributions
ull-0.065
 shall-0.054
 finally-0.050
 Jane-0.050
'll-0.049
Token washing
Token ID20518
Feature activation-6.69e-03

Pos logit contributions
assing+0.021
 hurting+0.020
obe+0.020
ging+0.019
eking+0.018
Neg logit contributions
ull-0.028
 shall-0.025
'll-0.023
 Jane-0.023
 snowball-0.022
Token away
Token ID1497
Feature activation-6.43e-03

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
 adoption+0.014
ilda+0.013
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
 finally-0.005
'll-0.005
Token all
Token ID477
Feature activation+1.39e-02

Pos logit contributions
assing+0.014
obe+0.013
ging+0.013
agging+0.013
 adoption+0.013
Neg logit contributions
ull-0.008
 shall-0.006
 Jane-0.006
 Mom-0.006
 finally-0.006
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
ull+0.020
 shall+0.018
'll+0.017
 finally+0.016
 stirred+0.016
Neg logit contributions
obe-0.024
assing-0.023
ging-0.023
ilda-0.023
 hurting-0.023
Token to
Token ID284
Feature activation-1.86e-02

Pos logit contributions
assing+0.036
obe+0.035
ging+0.034
 adoption+0.034
ilda+0.033
Neg logit contributions
ull-0.031
 shall-0.026
 Jane-0.024
'll-0.023
 finally-0.023
Token hear
Token ID3285
Feature activation-2.00e-02

Pos logit contributions
assing+0.033
obe+0.032
ging+0.030
 adoption+0.030
ilda+0.029
Neg logit contributions
ull-0.029
 shall-0.025
 Jane-0.023
'll-0.023
 finally-0.022
Token Gr
Token ID1902
Feature activation-6.88e-04

Pos logit contributions
assing+0.041
obe+0.038
ging+0.038
pper+0.038
agging+0.037
Neg logit contributions
ull-0.025
 shall-0.022
 Jane-0.022
 Mom-0.021
 -0.020
Tokenanny
Token ID7737
Feature activation-1.34e-02

Pos logit contributions
assing+0.002
ging+0.002
obe+0.002
 hurting+0.002
 adoption+0.002
Neg logit contributions
ull-0.001
 shall-0.000
 Jane-0.000
'll-0.000
 stirred-0.000
Token's
Token ID338
Feature activation-7.30e-03

Pos logit contributions
assing+0.027
obe+0.025
ging+0.024
agging+0.024
pper+0.024
Neg logit contributions
ull-0.017
 shall-0.016
 Jane-0.015
 -0.015
 finally-0.014
Token stories
Token ID3923
Feature activation-4.82e-02

Pos logit contributions
assing+0.013
pper+0.012
annah+0.012
ging+0.012
iva+0.012
Neg logit contributions
ull-0.011
 wonder-0.010
 shall-0.009
 finally-0.009
 -0.009
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
assing+0.096
obe+0.091
ging+0.091
 adoption+0.087
eking+0.086
Neg logit contributions
ull-0.063
 shall-0.056
 finally-0.053
 Jane-0.051
'll-0.050
Token<|endoftext|>
Token ID50256
Feature activation+4.95e-03

Pos logit contributions
assing+0.040
obe+0.039
ging+0.038
ilda+0.038
 adoption+0.038
Neg logit contributions
ull-0.011
 shall-0.008
 Jane-0.006
 finally-0.006
 stirred-0.006
TokenOnce
Token ID7454
Feature activation-1.12e-02

Pos logit contributions
ull+0.005
isel+0.004
 shall+0.004
 stirred+0.004
 measured+0.004
Neg logit contributions
assing-0.010
obe-0.009
ilda-0.009
 adoption-0.009
iva-0.009
Token upon
Token ID2402
Feature activation-1.28e-02

Pos logit contributions
ilda+0.018
ging+0.016
assing+0.016
obe+0.016
annah+0.016
Neg logit contributions
ull-0.018
 shall-0.016
 stirred-0.015
 snowball-0.015
 marsh-0.015
Token a
Token ID257
Feature activation-1.06e-03

Pos logit contributions
assing+0.021
 adoption+0.020
obe+0.020
 hurting+0.019
ging+0.019
Neg logit contributions
ull-0.020
 shall-0.017
'll-0.016
isel-0.015
 Jane-0.015
Token alien
Token ID8756
Feature activation-2.38e-02

Pos logit contributions
assing+0.016
obe+0.015
 adoption+0.015
ging+0.015
agging+0.015
Neg logit contributions
ull-0.014
 shall-0.012
 Jane-0.012
 stirred-0.012
'll-0.011
Token friends
Token ID2460
Feature activation-1.63e-02

Pos logit contributions
assing+0.056
ging+0.053
obe+0.053
eking+0.051
agging+0.050
Neg logit contributions
ull-0.025
 finally-0.020
 shall-0.020
 Jane-0.019
 -0.018
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
assing+0.043
obe+0.041
eking+0.041
ging+0.040
 yourselves+0.040
Neg logit contributions
 finally-0.010
ull-0.010
 -0.010
 happily-0.009
-0.009
Token flew
Token ID13112
Feature activation-8.02e-03

Pos logit contributions
assing+0.021
obe+0.020
 adoption+0.020
ilda+0.020
ging+0.019
Neg logit contributions
ull-0.014
 shall-0.011
 Jane-0.010
 snowball-0.010
 marsh-0.010
Token his
Token ID465
Feature activation-1.76e-02

Pos logit contributions
assing+0.011
ging+0.011
obe+0.011
agging+0.011
 adoption+0.010
Neg logit contributions
ull-0.015
 shall-0.013
 stirred-0.012
'll-0.012
 marsh-0.012
Token spaceship
Token ID40663
Feature activation-4.77e-02

Pos logit contributions
assing+0.034
obe+0.033
ging+0.032
 adoption+0.031
ilda+0.031
Neg logit contributions
ull-0.024
 shall-0.020
 finally-0.019
'll-0.019
 Jane-0.018
Token back
Token ID736
Feature activation-1.72e-02

Pos logit contributions
assing+0.088
obe+0.084
ging+0.082
 adoption+0.080
ilda+0.078
Neg logit contributions
ull-0.071
 shall-0.059
 finally-0.057
 Jane-0.055
'll-0.055
Token to
Token ID284
Feature activation-5.06e-03

Pos logit contributions
assing+0.032
obe+0.031
ging+0.031
 adoption+0.030
ilda+0.030
Neg logit contributions
ull-0.024
 shall-0.020
 Jane-0.019
'll-0.018
 finally-0.018
Token Earth
Token ID3668
Feature activation-2.93e-02

Pos logit contributions
assing+0.010
obe+0.010
ging+0.010
 adoption+0.010
ilda+0.009
Neg logit contributions
ull-0.006
 shall-0.005
'll-0.005
 Jane-0.005
 stirred-0.005
Token.
Token ID13
Feature activation-4.45e-03

Pos logit contributions
assing+0.051
obe+0.049
ging+0.048
 adoption+0.046
ilda+0.046
Neg logit contributions
ull-0.046
 shall-0.039
 Jane-0.036
'll-0.036
 finally-0.036
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
assing+0.008
obe+0.008
ilda+0.008
hy+0.008
 adoption+0.008
Neg logit contributions
ull-0.006
 stirred-0.005
 shall-0.004
isel-0.004
 marsh-0.004
Token played
Token ID2826
Feature activation-1.81e-03

Pos logit contributions
assing+0.032
obe+0.031
ging+0.030
 adoption+0.030
ilda+0.029
Neg logit contributions
ull-0.017
 shall-0.014
 Jane-0.012
 finally-0.012
'll-0.012
Token near
Token ID1474
Feature activation-5.10e-03

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
 adoption+0.003
agging+0.003
Neg logit contributions
ull-0.003
 shall-0.002
 Jane-0.002
'll-0.002
 finally-0.002
Token the
Token ID262
Feature activation-9.16e-03

Pos logit contributions
assing+0.010
obe+0.009
ging+0.009
 adoption+0.009
 hurting+0.009
Neg logit contributions
ull-0.007
 shall-0.006
'll-0.005
 Jane-0.005
 finally-0.005
Token shark
Token ID21027
Feature activation-7.92e-03

Pos logit contributions
assing+0.020
obe+0.019
ging+0.019
 adoption+0.019
agging+0.018
Neg logit contributions
ull-0.010
 shall-0.008
 Jane-0.007
 finally-0.007
'll-0.007
Token's
Token ID338
Feature activation+2.24e-03

Pos logit contributions
assing+0.016
obe+0.015
 adoption+0.014
ging+0.014
 hurting+0.014
Neg logit contributions
ull-0.011
 shall-0.008
 Jane-0.008
 marsh-0.008
'll-0.008
Token territory
Token ID7674
Feature activation-4.74e-02

Pos logit contributions
ull+0.003
 shall+0.003
 Jane+0.003
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
agging-0.004
Token again
Token ID757
Feature activation-2.52e-03

Pos logit contributions
assing+0.099
obe+0.095
ging+0.093
 adoption+0.091
agging+0.090
Neg logit contributions
ull-0.058
 shall-0.046
 Jane-0.042
'll-0.042
 finally-0.040
Token.
Token ID13
Feature activation+4.47e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
agging+0.004
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
'll-0.003
 finally-0.003
Token They
Token ID1119
Feature activation+6.69e-03

Pos logit contributions
ull+0.005
 shall+0.004
 stirred+0.004
 snowball+0.004
 measured+0.004
Neg logit contributions
assing-0.009
obe-0.009
 adoption-0.009
ilda-0.008
iva-0.008
Token remembered
Token ID12086
Feature activation+5.83e-03

Pos logit contributions
ull+0.009
 shall+0.008
 Jane+0.007
 snowball+0.007
 Lots+0.007
Neg logit contributions
assing-0.013
obe-0.012
 hurting-0.012
 adoption-0.011
pper-0.011
Token Nem
Token ID22547
Feature activation+9.61e-03

Pos logit contributions
ull+0.009
 shall+0.008
 Jane+0.008
 finally+0.007
'll+0.007
Neg logit contributions
assing-0.010
obe-0.010
ging-0.009
 adoption-0.009
ilda-0.009
Token her
Token ID607
Feature activation+3.88e-03

Pos logit contributions
ull+0.004
 Jane+0.004
 finally+0.004
 +0.004
 Mom+0.004
Neg logit contributions
assing-0.005
obe-0.005
agging-0.005
ging-0.005
eking-0.005
Token.
Token ID13
Feature activation-8.03e-03

Pos logit contributions
ull+0.004
 Mom+0.004
 shall+0.004
'll+0.003
 finally+0.003
Neg logit contributions
assing-0.008
obe-0.008
ging-0.008
eking-0.008
agging-0.008
Token The
Token ID383
Feature activation-5.10e-03

Pos logit contributions
eking+0.019
ging+0.019
assing+0.019
obe+0.018
agging+0.018
Neg logit contributions
 -0.007
 Mom-0.007
 finally-0.006
ull-0.006
 Jane-0.006
Token wolf
Token ID17481
Feature activation-3.72e-03

Pos logit contributions
assing+0.007
obe+0.007
ging+0.006
 adoption+0.006
ilda+0.006
Neg logit contributions
ull-0.010
 shall-0.009
 Jane-0.008
'll-0.008
 finally-0.008
Token asked
Token ID1965
Feature activation-1.07e-02

Pos logit contributions
assing+0.009
obe+0.008
ging+0.008
 adoption+0.008
 hurting+0.008
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.002
'll-0.002
 marsh-0.002
Token her
Token ID607
Feature activation+2.81e-02

Pos logit contributions
assing+0.023
obe+0.022
agging+0.022
eking+0.022
ging+0.021
Neg logit contributions
 -0.012
ull-0.012
 Jane-0.012
 finally-0.012
 Mom-0.011
Token where
Token ID810
Feature activation-1.30e-02

Pos logit contributions
ull+0.042
 shall+0.036
 Jane+0.033
 finally+0.033
'll+0.033
Neg logit contributions
assing-0.052
obe-0.050
ging-0.049
 adoption-0.047
agging-0.046
Token she
Token ID673
Feature activation+1.92e-03

Pos logit contributions
assing+0.024
obe+0.023
ging+0.022
agging+0.022
eking+0.021
Neg logit contributions
ull-0.020
 shall-0.017
 Jane-0.017
 finally-0.016
 Mom-0.016
Token was
Token ID373
Feature activation+4.09e-03

Pos logit contributions
ull+0.001
 shall+0.001
 finally+0.001
'll+0.001
 Jane+0.001
Neg logit contributions
assing-0.005
obe-0.005
ging-0.005
agging-0.005
ilda-0.005
Token going
Token ID1016
Feature activation-5.90e-03

Pos logit contributions
ull+0.006
 shall+0.005
 finally+0.005
 Jane+0.005
 Joe+0.005
Neg logit contributions
assing-0.007
obe-0.007
ging-0.007
iva-0.007
ilda-0.007
Token.
Token ID13
Feature activation-1.74e-03

Pos logit contributions
assing+0.014
ging+0.014
iva+0.013
 supervision+0.013
obe+0.013
Neg logit contributions
ull-0.005
 finally-0.004
-0.004
 -0.004
 shall-0.004
Token
Token ID198
Feature activation+2.01e-03

Pos logit contributions
ull+0.005
 stirred+0.005
 marsh+0.004
 cooled+0.004
 snowball+0.004
Neg logit contributions
assing-0.009
ilda-0.009
obe-0.009
iva-0.009
ging-0.008
Token
Token ID198
Feature activation+4.37e-03

Pos logit contributions
ull+0.003
 stirred+0.002
 shall+0.002
 lava+0.002
 marsh+0.002
Neg logit contributions
assing-0.004
obe-0.004
ilda-0.004
annah-0.003
iva-0.003
TokenFirst
Token ID5962
Feature activation+8.31e-03

Pos logit contributions
ull+0.006
 shall+0.005
 Jane+0.005
 finally+0.005
'll+0.005
Neg logit contributions
assing-0.008
obe-0.008
ging-0.008
 adoption-0.008
ilda-0.007
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
ull+0.014
 shall+0.011
'll+0.010
ores+0.010
 stirred+0.010
Neg logit contributions
assing-0.012
obe-0.012
ging-0.012
 adoption-0.012
 hurting-0.012
Token Sue
Token ID26089
Feature activation-2.13e-03

Pos logit contributions
ull+0.040
 shall+0.031
'll+0.030
 Fisher+0.029
isel+0.028
Neg logit contributions
assing-0.032
 adoption-0.032
obe-0.032
 hurting-0.031
ging-0.031
Token mixed
Token ID7668
Feature activation+3.12e-02

Pos logit contributions
assing+0.004
obe+0.004
ilda+0.004
ging+0.004
 hurting+0.004
Neg logit contributions
ull-0.003
 shall-0.002
 Jane-0.002
 marsh-0.002
 snowball-0.002
Token together
Token ID1978
Feature activation+2.08e-02

Pos logit contributions
ull+0.051
 shall+0.043
'll+0.040
 Nobody+0.039
 Fisher+0.039
Neg logit contributions
ging-0.046
obe-0.046
 yourself-0.043
eking-0.042
assing-0.041
Token a
Token ID257
Feature activation+1.18e-02

Pos logit contributions
ull+0.033
 shall+0.027
'll+0.025
 snowball+0.025
 marsh+0.024
Neg logit contributions
ging-0.033
obe-0.033
assing-0.033
eking-0.031
 hurting-0.031
Token lot
Token ID1256
Feature activation+1.72e-02

Pos logit contributions
ull+0.018
 Jane+0.014
 finally+0.014
 shall+0.014
ores+0.013
Neg logit contributions
assing-0.020
ging-0.019
 adoption-0.019
obe-0.018
 hurting-0.018
Token of
Token ID286
Feature activation+1.15e-02

Pos logit contributions
ull+0.020
 shall+0.016
 finally+0.015
'll+0.015
 snowball+0.015
Neg logit contributions
obe-0.033
ging-0.033
assing-0.033
ilda-0.031
agging-0.031
Token glue
Token ID22749
Feature activation+4.83e-03

Pos logit contributions
ull+0.019
 shall+0.016
 finally+0.015
'll+0.015
 Fisher+0.014
Neg logit contributions
obe-0.018
assing-0.018
ging-0.018
 adoption-0.017
 hurting-0.017
Token first
Token ID717
Feature activation+7.06e-03

Pos logit contributions
assing+0.033
obe+0.032
ging+0.031
 adoption+0.030
ilda+0.030
Neg logit contributions
ull-0.030
 shall-0.026
 Jane-0.024
 finally-0.024
'll-0.024
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
ull+0.010
 shall+0.009
 Jane+0.008
 finally+0.008
'll+0.008
Neg logit contributions
assing-0.013
obe-0.013
ging-0.013
ilda-0.012
 adoption-0.012
Token They
Token ID1119
Feature activation+5.60e-03

Pos logit contributions
ull+0.029
 stirred+0.022
 cooled+0.022
 marsh+0.020
isel+0.020
Neg logit contributions
assing-0.034
 ME-0.032
obe-0.031
 adoption-0.031
hy-0.030
Token count
Token ID954
Feature activation+1.62e-02

Pos logit contributions
ull+0.008
 shall+0.007
 Jane+0.007
 finally+0.007
'll+0.007
Neg logit contributions
assing-0.010
obe-0.010
ging-0.010
 adoption-0.009
ilda-0.009
Token to
Token ID284
Feature activation+1.86e-02

Pos logit contributions
ull+0.023
 shall+0.019
'll+0.018
 finally+0.017
 Jane+0.017
Neg logit contributions
assing-0.030
ging-0.029
obe-0.029
agging-0.028
eking-0.027
Token three
Token ID1115
Feature activation+3.52e-02

Pos logit contributions
ull+0.026
 shall+0.022
'll+0.020
 Jane+0.019
 finally+0.019
Neg logit contributions
assing-0.036
obe-0.034
ging-0.033
 adoption-0.033
ilda-0.031
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
ull+0.038
 shall+0.032
 Jane+0.030
 finally+0.028
 +0.028
Neg logit contributions
assing-0.078
obe-0.075
ilda-0.072
ging-0.072
 adoption-0.072
Token put
Token ID1234
Feature activation+1.76e-02

Pos logit contributions
ull+0.014
 shall+0.011
'll+0.010
 Jane+0.010
 snowball+0.010
Neg logit contributions
assing-0.017
obe-0.016
ging-0.016
 adoption-0.015
 hurting-0.015
Token their
Token ID511
Feature activation+1.78e-02

Pos logit contributions
ull+0.025
 shall+0.021
'll+0.019
 finally+0.018
 Jane+0.018
Neg logit contributions
assing-0.033
ging-0.032
obe-0.031
 adoption-0.030
agging-0.029
Token balls
Token ID11333
Feature activation-9.18e-03

Pos logit contributions
ull+0.027
 shall+0.022
 Jane+0.021
'll+0.020
 finally+0.020
Neg logit contributions
assing-0.032
obe-0.031
ging-0.030
 adoption-0.029
agging-0.029
Token in
Token ID287
Feature activation+1.76e-02

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
agging+0.013
 adoption+0.013
Neg logit contributions
ull-0.015
 shall-0.013
 Jane-0.012
'll-0.012
 finally-0.012
TokenCan
Token ID6090
Feature activation+8.35e-03

Pos logit contributions
ull+0.009
 volcano+0.007
 shall+0.006
ores+0.006
isel+0.006
Neg logit contributions
assing-0.011
ging-0.010
obe-0.010
 adoption-0.010
ting-0.010
Token we
Token ID356
Feature activation+1.95e-02

Pos logit contributions
ull+0.014
 shall+0.012
 Jane+0.011
 finally+0.011
'll+0.011
Neg logit contributions
assing-0.014
obe-0.013
ging-0.013
 adoption-0.012
ilda-0.012
Token go
Token ID467
Feature activation+1.19e-02

Pos logit contributions
ull+0.041
 stirred+0.029
 volcan+0.029
ores+0.029
 marsh+0.029
Neg logit contributions
assing-0.024
obe-0.022
hy-0.021
ging-0.020
 yourself-0.020
Token on
Token ID319
Feature activation+1.70e-02

Pos logit contributions
ull+0.023
 shall+0.019
 stirred+0.018
 marsh+0.018
 Jane+0.018
Neg logit contributions
assing-0.017
ging-0.016
obe-0.015
agging-0.015
eking-0.015
Token the
Token ID262
Feature activation+2.29e-02

Pos logit contributions
ull+0.027
 shall+0.022
'll+0.020
 stirred+0.020
 Jane+0.020
Neg logit contributions
assing-0.030
obe-0.029
ging-0.028
 adoption-0.027
 hurting-0.027
Token slide
Token ID10649
Feature activation+2.98e-02

Pos logit contributions
ull+0.032
 shall+0.028
 Jane+0.026
 finally+0.026
'll+0.025
Neg logit contributions
assing-0.043
obe-0.042
ging-0.041
 adoption-0.040
ilda-0.039
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
ull+0.046
 shall+0.034
 wonder+0.034
 marsh+0.034
 stirred+0.033
Neg logit contributions
assing-0.053
obe-0.052
ging-0.048
ilda-0.047
 adoption-0.046
Token Mom
Token ID11254
Feature activation+2.46e-02

Pos logit contributions
ull+0.020
 snowball+0.014
 marsh+0.014
 cooled+0.013
 measured+0.013
Neg logit contributions
assing-0.030
 ME-0.029
 adoption-0.029
 yourself-0.028
ige-0.028
Token?"
Token ID1701
Feature activation+1.28e-02

Pos logit contributions
ull+0.029
 shall+0.023
 Jane+0.021
 finally+0.020
'll+0.020
Neg logit contributions
assing-0.053
obe-0.051
ging-0.050
 adoption-0.049
ilda-0.048
Token Jill
Token ID22772
Feature activation-8.79e-04

Pos logit contributions
ull+0.015
 marsh+0.013
 snowball+0.013
'll+0.013
 shall+0.012
Neg logit contributions
obe-0.024
ging-0.023
ilda-0.023
assing-0.023
 ME-0.023
Token asks
Token ID7893
Feature activation-9.59e-03

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
 adoption+0.002
ilda+0.002
Neg logit contributions
ull-0.001
 shall-0.001
 Jane-0.001
 finally-0.001
'll-0.001
Token But
Token ID887
Feature activation-7.07e-03

Pos logit contributions
assing+0.010
obe+0.009
 adoption+0.009
ging+0.009
ilda+0.009
Neg logit contributions
ull-0.007
 shall-0.006
'll-0.005
 stirred-0.005
 wonder-0.005
Token Wh
Token ID854
Feature activation+9.11e-04

Pos logit contributions
assing+0.014
obe+0.014
ging+0.013
agging+0.013
 adoption+0.013
Neg logit contributions
ull-0.009
 shall-0.007
 finally-0.007
 Jane-0.007
 Mom-0.007
Tokenisk
Token ID1984
Feature activation+7.17e-03

Pos logit contributions
ull+0.002
 shall+0.002
'll+0.002
 Jane+0.002
 finally+0.001
Neg logit contributions
ging-0.001
assing-0.001
 masse-0.001
ting-0.001
 supervision-0.001
Tokeners
Token ID364
Feature activation+1.20e-02

Pos logit contributions
ull+0.014
 shall+0.012
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.010
obe-0.009
ging-0.009
 adoption-0.009
ilda-0.008
Token got
Token ID1392
Feature activation+1.73e-02

Pos logit contributions
ull+0.013
 shall+0.009
 Jane+0.009
'll+0.008
 marsh+0.008
Neg logit contributions
assing-0.027
ging-0.026
obe-0.026
 hurting-0.025
 adoption-0.025
Token all
Token ID477
Feature activation+2.88e-02

Pos logit contributions
ull+0.030
 shall+0.025
 Jane+0.023
'll+0.023
 marsh+0.023
Neg logit contributions
assing-0.028
ging-0.027
 hurting-0.027
obe-0.026
eking-0.025
Token tangled
Token ID41431
Feature activation-9.10e-04

Pos logit contributions
ull+0.043
 shall+0.036
'll+0.032
 finally+0.031
 Jane+0.031
Neg logit contributions
assing-0.051
obe-0.051
ging-0.050
 hurting-0.049
 adoption-0.047
Token up
Token ID510
Feature activation+8.51e-03

Pos logit contributions
assing+0.002
obe+0.002
 adoption+0.002
iva+0.002
agging+0.002
Neg logit contributions
ull-0.001
 finally-0.001
 happily-0.001
 Jane-0.001
 shall-0.001
Token in
Token ID287
Feature activation+1.58e-02

Pos logit contributions
ull+0.011
 shall+0.009
 Jane+0.009
 finally+0.009
 Mom+0.008
Neg logit contributions
assing-0.017
obe-0.017
ging-0.016
 adoption-0.016
ilda-0.016
Token the
Token ID262
Feature activation-9.02e-03

Pos logit contributions
ull+0.021
 shall+0.017
 Jane+0.016
 finally+0.016
'll+0.016
Neg logit contributions
assing-0.031
obe-0.030
ging-0.030
 adoption-0.029
ilda-0.028
Token string
Token ID4731
Feature activation+1.34e-02

Pos logit contributions
assing+0.019
obe+0.019
ging+0.018
ilda+0.018
eking+0.017
Neg logit contributions
ull-0.011
 shall-0.010
 marsh-0.009
 snowball-0.009
 stirred-0.009
Token but
Token ID475
Feature activation+8.75e-04

Pos logit contributions
ull+0.006
 shall+0.005
 Jane+0.005
 finally+0.005
'll+0.004
Neg logit contributions
assing-0.008
obe-0.008
ging-0.008
 adoption-0.007
agging-0.007
Token Lily
Token ID20037
Feature activation+8.92e-03

Pos logit contributions
ull+0.001
 shall+0.001
'll+0.001
 Jane+0.001
 finally+0.001
Neg logit contributions
assing-0.002
obe-0.002
ging-0.002
 adoption-0.002
ilda-0.002
Token didn
Token ID1422
Feature activation+1.87e-02

Pos logit contributions
ull+0.011
 shall+0.008
 Jane+0.007
 marsh+0.007
 snowball+0.007
Neg logit contributions
assing-0.019
obe-0.019
ilda-0.018
ging-0.018
 adoption-0.018
Token't
Token ID470
Feature activation-3.23e-03

Pos logit contributions
ull+0.024
 shall+0.020
 finally+0.018
 Jane+0.018
'll+0.018
Neg logit contributions
assing-0.038
obe-0.036
ging-0.036
 adoption-0.035
agging-0.034
Token want
Token ID765
Feature activation+1.73e-03

Pos logit contributions
assing+0.006
ging+0.006
obe+0.006
 adoption+0.006
eking+0.006
Neg logit contributions
ull-0.004
 shall-0.004
 Jane-0.003
 finally-0.003
'll-0.003
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
ull+0.002
 shall+0.002
'll+0.002
 stirred+0.002
 Jane+0.001
Neg logit contributions
assing-0.003
obe-0.003
 adoption-0.003
ilda-0.003
 hurting-0.003
Token.
Token ID13
Feature activation+5.45e-03

Pos logit contributions
ull+0.016
 shall+0.015
 Mom+0.014
 finally+0.014
 Jane+0.014
Neg logit contributions
assing-0.032
ging-0.031
obe-0.030
 adoption-0.030
eking-0.029
Token
Token ID220
Feature activation+8.78e-03

Pos logit contributions
ull+0.009
 shall+0.006
 marsh+0.006
 snowball+0.006
 stirred+0.006
Neg logit contributions
obe-0.010
assing-0.009
ilda-0.009
 adoption-0.009
annah-0.009
Token
Token ID198
Feature activation+7.21e-03

Pos logit contributions
ull+0.010
 stirred+0.008
 marsh+0.008
 volcano+0.008
 snowball+0.007
Neg logit contributions
ilda-0.018
assing-0.017
obe-0.017
ging-0.016
iva-0.016
Token
Token ID198
Feature activation+7.12e-03

Pos logit contributions
ull+0.009
 stirred+0.008
 lava+0.007
 marsh+0.007
 shall+0.007
Neg logit contributions
ilda-0.014
obe-0.014
assing-0.013
annah-0.013
 adoption-0.013
TokenHer
Token ID9360
Feature activation+9.79e-03

Pos logit contributions
ull+0.010
 shall+0.009
 +0.009
 finally+0.009
 Mom+0.009
Neg logit contributions
assing-0.013
ging-0.012
obe-0.012
eking-0.012
 supervision-0.012
Token to
Token ID284
Feature activation-4.77e-03

Pos logit contributions
ull+0.017
 shall+0.014
 Jane+0.012
 finally+0.012
'll+0.012
Neg logit contributions
assing-0.032
obe-0.031
ging-0.030
 adoption-0.029
ilda-0.029
Token each
Token ID1123
Feature activation+8.89e-03

Pos logit contributions
assing+0.010
obe+0.009
ging+0.009
 adoption+0.009
ilda+0.009
Neg logit contributions
ull-0.006
 shall-0.005
 Jane-0.005
'll-0.005
 finally-0.005
Token other
Token ID584
Feature activation+2.65e-03

Pos logit contributions
ull+0.014
 shall+0.012
 marsh+0.012
 stirred+0.012
'll+0.012
Neg logit contributions
assing-0.014
 adoption-0.014
obe-0.013
ging-0.013
ilda-0.013
Token.
Token ID13
Feature activation+7.92e-03

Pos logit contributions
ull+0.003
 shall+0.002
 Jane+0.002
 Mom+0.002
 +0.002
Neg logit contributions
assing-0.006
obe-0.006
agging-0.005
ging-0.005
eking-0.005
Token They
Token ID1119
Feature activation+1.09e-03

Pos logit contributions
ull+0.010
 shall+0.007
 stirred+0.006
'll+0.006
 finally+0.006
Neg logit contributions
assing-0.017
obe-0.016
 adoption-0.016
ging-0.015
hy-0.015
Token would
Token ID561
Feature activation+3.83e-03

Pos logit contributions
ull+0.002
 shall+0.001
 snowball+0.001
 marsh+0.001
ores+0.001
Neg logit contributions
assing-0.002
obe-0.002
 adoption-0.002
hy-0.002
ging-0.002
Token try
Token ID1949
Feature activation+7.99e-03

Pos logit contributions
ull+0.006
 shall+0.005
 Jane+0.005
'll+0.004
 finally+0.004
Neg logit contributions
assing-0.007
obe-0.007
ging-0.006
 adoption-0.006
ilda-0.006
Token to
Token ID284
Feature activation+7.85e-04

Pos logit contributions
ull+0.010
 shall+0.008
 Jane+0.007
'll+0.007
 finally+0.007
Neg logit contributions
assing-0.016
obe-0.016
ging-0.016
 adoption-0.015
ilda-0.015
Token guess
Token ID4724
Feature activation+2.39e-03

Pos logit contributions
ull+0.001
 shall+0.001
 Jane+0.001
 finally+0.001
'll+0.001
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
 adoption-0.001
ilda-0.001
Token what
Token ID644
Feature activation-8.13e-03

Pos logit contributions
ull+0.004
 shall+0.003
 Jane+0.003
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token it
Token ID340
Feature activation+5.99e-03

Pos logit contributions
assing+0.016
 adoption+0.015
obe+0.015
 hurting+0.015
ilda+0.014
Neg logit contributions
ull-0.010
 shall-0.007
 marsh-0.007
 finally-0.007
 snowball-0.007
Token at
Token ID379
Feature activation+1.21e-02

Pos logit contributions
ull+0.020
 shall+0.017
 Jane+0.016
 finally+0.016
'll+0.016
Neg logit contributions
assing-0.016
obe-0.016
ging-0.016
agging-0.015
 hurting-0.015
Token the
Token ID262
Feature activation-3.43e-03

Pos logit contributions
ull+0.016
 shall+0.014
'll+0.013
 stirred+0.012
 finally+0.012
Neg logit contributions
assing-0.023
obe-0.022
ging-0.022
 adoption-0.021
 hurting-0.021
Token truck
Token ID7779
Feature activation-2.59e-03

Pos logit contributions
assing+0.006
obe+0.006
ging+0.006
ilda+0.006
eking+0.006
Neg logit contributions
ull-0.005
 shall-0.004
 Mom-0.004
 finally-0.004
'll-0.004
Token,
Token ID11
Feature activation-5.62e-03

Pos logit contributions
assing+0.006
obe+0.005
ging+0.005
 adoption+0.005
ilda+0.005
Neg logit contributions
ull-0.003
 shall-0.002
 finally-0.002
 Jane-0.002
'll-0.002
Token which
Token ID543
Feature activation+8.87e-03

Pos logit contributions
assing+0.010
obe+0.010
ging+0.010
 hurting+0.009
 adoption+0.009
Neg logit contributions
ull-0.008
 shall-0.007
'll-0.007
 Jane-0.006
 marsh-0.006
Token had
Token ID550
Feature activation+5.87e-03

Pos logit contributions
ull+0.008
 shall+0.006
 Jane+0.005
'll+0.005
 finally+0.005
Neg logit contributions
assing-0.020
obe-0.020
 hurting-0.019
ging-0.019
 adoption-0.019
Token stopped
Token ID5025
Feature activation+1.46e-02

Pos logit contributions
ull+0.008
 shall+0.007
 Jane+0.006
'll+0.006
 +0.006
Neg logit contributions
assing-0.011
obe-0.011
 adoption-0.010
ging-0.010
 hurting-0.010
Token and
Token ID290
Feature activation+1.14e-02

Pos logit contributions
ull+0.021
 shall+0.016
 Mom+0.016
 finally+0.016
 Jane+0.015
Neg logit contributions
assing-0.029
obe-0.027
 adoption-0.027
ilda-0.027
iva-0.027
Token was
Token ID373
Feature activation+9.92e-03

Pos logit contributions
ull+0.015
 shall+0.012
'll+0.011
 Jane+0.011
 snowball+0.010
Neg logit contributions
assing-0.023
obe-0.022
ging-0.021
 hurting-0.021
 adoption-0.021
Token backing
Token ID12285
Feature activation-1.34e-03

Pos logit contributions
ull+0.016
 shall+0.014
'll+0.013
 Jane+0.013
 +0.012
Neg logit contributions
assing-0.016
ging-0.015
obe-0.015
 hurting-0.015
eking-0.015
Token up
Token ID510
Feature activation+9.00e-04

Pos logit contributions
assing+0.003
obe+0.003
ilda+0.002
 adoption+0.002
annah+0.002
Neg logit contributions
ull-0.002
 Jane-0.001
 finally-0.001
 shall-0.001
'll-0.001
Token sound
Token ID2128
Feature activation+5.12e-03

Pos logit contributions
assing+0.021
obe+0.020
ging+0.019
ilda+0.019
agging+0.018
Neg logit contributions
ull-0.014
 shall-0.012
 finally-0.011
 Mom-0.011
 Jane-0.011
Token of
Token ID286
Feature activation+5.69e-03

Pos logit contributions
ull+0.005
 shall+0.004
'll+0.004
 +0.003
 Jane+0.003
Neg logit contributions
ging-0.011
assing-0.011
obe-0.011
 hurting-0.011
 adoption-0.011
Token a
Token ID257
Feature activation+2.60e-03

Pos logit contributions
ull+0.007
 shall+0.006
 Jane+0.006
'll+0.006
 finally+0.006
Neg logit contributions
assing-0.011
obe-0.011
ging-0.011
 adoption-0.010
ilda-0.010
Token buzzing
Token ID41719
Feature activation+5.89e-03

Pos logit contributions
ull+0.004
 shall+0.003
 Jane+0.003
'll+0.003
 stirred+0.003
Neg logit contributions
assing-0.005
ging-0.005
obe-0.004
 adoption-0.004
 hurting-0.004
Token fly
Token ID6129
Feature activation+7.09e-04

Pos logit contributions
ull+0.009
 Jane+0.007
 shall+0.007
 finally+0.007
 stirred+0.007
Neg logit contributions
assing-0.012
obe-0.011
ilda-0.010
ging-0.010
 supervision-0.010
Token woke
Token ID19092
Feature activation+5.18e-04

Pos logit contributions
ull+0.001
 shall+0.001
 finally+0.001
 Jane+0.001
'll+0.001
Neg logit contributions
assing-0.001
obe-0.001
 adoption-0.001
ging-0.001
ilda-0.001
Token Peter
Token ID5613
Feature activation+1.09e-03

Pos logit contributions
ull+0.001
 finally+0.001
 Jane+0.001
 Mom+0.000
 shall+0.000
Neg logit contributions
assing-0.001
obe-0.001
 supervision-0.001
 adoption-0.001
ging-0.001
Token up
Token ID510
Feature activation-4.05e-03

Pos logit contributions
ull+0.001
 finally+0.001
 shall+0.001
 happily+0.001
 +0.001
Neg logit contributions
assing-0.002
obe-0.002
agging-0.002
 adoption-0.002
 yourselves-0.002
Token.
Token ID13
Feature activation-6.58e-03

Pos logit contributions
assing+0.009
obe+0.008
ging+0.008
 adoption+0.008
ilda+0.008
Neg logit contributions
ull-0.005
 shall-0.004
 finally-0.004
 Jane-0.004
 -0.004
Token He
Token ID679
Feature activation-3.77e-04

Pos logit contributions
assing+0.011
 adoption+0.011
ilda+0.011
obe+0.011
iva+0.011
Neg logit contributions
ull-0.009
 shall-0.007
 cooled-0.007
 stirred-0.007
 snowball-0.007
Token took
Token ID1718
Feature activation-5.03e-03

Pos logit contributions
assing+0.001
 adoption+0.001
ilda+0.001
obe+0.001
ging+0.001
Neg logit contributions
ull-0.000
 shall-0.000
 Jane-0.000
ores-0.000
 marsh-0.000
Token was
Token ID373
Feature activation+2.58e-03

Pos logit contributions
obe+0.010
assing+0.010
 hurting+0.009
ilda+0.009
pper+0.009
Neg logit contributions
ull-0.009
 shall-0.007
 marsh-0.007
ores-0.007
 volcan-0.007
Token glad
Token ID9675
Feature activation+6.83e-03

Pos logit contributions
ull+0.004
 shall+0.003
'll+0.003
 marsh+0.003
 Jane+0.003
Neg logit contributions
assing-0.005
obe-0.005
ging-0.005
 hurting-0.005
 adoption-0.004
Token that
Token ID326
Feature activation-1.31e-03

Pos logit contributions
ull+0.010
 Jane+0.009
 shall+0.009
 finally+0.008
 Mom+0.008
Neg logit contributions
assing-0.013
ging-0.012
obe-0.012
 adoption-0.011
agging-0.011
Token Max
Token ID5436
Feature activation-1.09e-02

Pos logit contributions
assing+0.002
obe+0.002
ging+0.002
agging+0.002
eking+0.002
Neg logit contributions
ull-0.002
 shall-0.002
 finally-0.002
 Jane-0.002
 Mom-0.002
Token let
Token ID1309
Feature activation-5.55e-03

Pos logit contributions
assing+0.027
obe+0.026
ging+0.026
 adoption+0.025
agging+0.025
Neg logit contributions
ull-0.009
 shall-0.007
 finally-0.006
'll-0.006
 Jane-0.006
Token her
Token ID607
Feature activation+1.59e-02

Pos logit contributions
assing+0.010
ging+0.009
obe+0.009
eking+0.009
agging+0.009
Neg logit contributions
ull-0.009
 shall-0.008
 finally-0.007
 Jane-0.007
'll-0.007
Token play
Token ID711
Feature activation-1.02e-03

Pos logit contributions
ull+0.025
 marsh+0.020
 shall+0.020
 Jane+0.019
'll+0.019
Neg logit contributions
obe-0.026
assing-0.026
 hurting-0.024
 adoption-0.024
ging-0.024
Token with
Token ID351
Feature activation-6.67e-03

Pos logit contributions
assing+0.002
obe+0.002
 adoption+0.002
ging+0.002
agging+0.002
Neg logit contributions
ull-0.002
 shall-0.001
'll-0.001
 marsh-0.001
 stirred-0.001
Token it
Token ID340
Feature activation-5.32e-03

Pos logit contributions
assing+0.012
obe+0.011
ging+0.011
 adoption+0.011
ilda+0.011
Neg logit contributions
ull-0.011
 shall-0.009
 Jane-0.008
'll-0.008
 finally-0.008
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
assing+0.011
ging+0.011
obe+0.010
 adoption+0.010
eking+0.010
Neg logit contributions
ull-0.007
 shall-0.006
 finally-0.006
 Jane-0.005
'll-0.005
Token After
Token ID2293
Feature activation-1.72e-02

Pos logit contributions
assing+0.024
obe+0.023
ging+0.022
 adoption+0.022
ilda+0.022
Neg logit contributions
ull-0.015
 shall-0.012
 Jane-0.011
 finally-0.011
'll-0.011
Token.
Token ID13
Feature activation-1.79e-03

Pos logit contributions
ull+0.002
 shall+0.001
 Jane+0.001
 finally+0.001
'll+0.001
Neg logit contributions
assing-0.002
obe-0.002
ging-0.002
 adoption-0.002
ilda-0.002
Token
Token ID198
Feature activation+6.51e-04

Pos logit contributions
assing+0.003
obe+0.003
annah+0.003
iva+0.003
hy+0.003
Neg logit contributions
ull-0.003
 shall-0.002
 stirred-0.002
 cooled-0.002
 volcan-0.002
Token
Token ID198
Feature activation-2.54e-03

Pos logit contributions
ull+0.001
 stirred+0.001
 lava+0.001
isel+0.001
 marsh+0.001
Neg logit contributions
ilda-0.001
iva-0.001
assing-0.001
obe-0.001
annah-0.001
TokenHe
Token ID1544
Feature activation+1.81e-03

Pos logit contributions
assing+0.005
obe+0.004
ging+0.004
 adoption+0.004
iva+0.004
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 stirred-0.003
 Mom-0.003
Token couldn
Token ID3521
Feature activation+3.95e-03

Pos logit contributions
ull+0.002
 shall+0.002
 Jane+0.002
ores+0.002
 marsh+0.002
Neg logit contributions
assing-0.003
 adoption-0.003
per-0.003
ilda-0.003
obe-0.003
Token't
Token ID470
Feature activation-1.12e-02

Pos logit contributions
ull+0.005
 shall+0.004
 finally+0.004
 Jane+0.004
'll+0.004
Neg logit contributions
assing-0.008
obe-0.008
ging-0.008
 adoption-0.007
ilda-0.007
Token believe
Token ID1975
Feature activation+1.69e-02

Pos logit contributions
assing+0.019
obe+0.018
ging+0.018
 adoption+0.017
ilda+0.017
Neg logit contributions
ull-0.018
 shall-0.016
 Jane-0.015
'll-0.014
 finally-0.014
Token his
Token ID465
Feature activation+9.60e-03

Pos logit contributions
ull+0.020
 shall+0.016
 Jane+0.015
 finally+0.014
'll+0.014
Neg logit contributions
assing-0.036
obe-0.035
ging-0.034
 adoption-0.034
ilda-0.033
Token eyes
Token ID2951
Feature activation-9.29e-03

Pos logit contributions
ull+0.014
 shall+0.012
 Mom+0.011
 Jane+0.011
'll+0.011
Neg logit contributions
assing-0.018
obe-0.017
ging-0.017
 adoption-0.016
ilda-0.016
Token when
Token ID618
Feature activation+3.22e-03

Pos logit contributions
assing+0.020
obe+0.019
ging+0.018
 adoption+0.018
agging+0.018
Neg logit contributions
ull-0.010
 shall-0.010
 finally-0.009
 stirred-0.008
 -0.008
Token he
Token ID339
Feature activation-1.57e-03

Pos logit contributions
ull+0.004
 shall+0.003
'll+0.003
 stirred+0.003
 Jane+0.003
Neg logit contributions
assing-0.006
ging-0.006
obe-0.006
 adoption-0.006
 hurting-0.006
Token One
Token ID1881
Feature activation+1.17e-02

Pos logit contributions
 adoption+0.007
ilda+0.007
iva+0.007
annah+0.007
obe+0.007
Neg logit contributions
ull-0.007
 snowball-0.005
 shall-0.005
 cooled-0.005
 measured-0.005
Token day
Token ID1110
Feature activation-4.27e-03

Pos logit contributions
ull+0.016
 shall+0.015
'll+0.015
 finally+0.014
 marsh+0.014
Neg logit contributions
obe-0.018
ging-0.018
 adoption-0.018
assing-0.017
 property-0.017
Token,
Token ID11
Feature activation+8.97e-04

Pos logit contributions
assing+0.008
ging+0.008
obe+0.008
 adoption+0.007
ilda+0.007
Neg logit contributions
ull-0.006
 shall-0.005
'll-0.005
 finally-0.004
 snowball-0.004
Token Annie
Token ID25860
Feature activation+1.81e-02

Pos logit contributions
ull+0.002
 shall+0.001
'll+0.001
 snowball+0.001
 finally+0.001
Neg logit contributions
ilda-0.001
ging-0.001
assing-0.001
 adoption-0.001
obe-0.001
Token was
Token ID373
Feature activation-6.45e-03

Pos logit contributions
ull+0.020
 shall+0.016
 Jane+0.015
 marsh+0.014
 snowball+0.014
Neg logit contributions
assing-0.038
obe-0.037
ging-0.036
 adoption-0.036
ilda-0.035
Token out
Token ID503
Feature activation-8.31e-03

Pos logit contributions
assing+0.010
obe+0.009
ging+0.009
 adoption+0.009
agging+0.009
Neg logit contributions
ull-0.012
 shall-0.010
 Jane-0.010
'll-0.010
 finally-0.009
Token walking
Token ID6155
Feature activation-1.17e-02

Pos logit contributions
assing+0.018
pper+0.017
obe+0.017
ilda+0.016
iva+0.016
Neg logit contributions
ull-0.010
 finally-0.009
 happily-0.009
 stirred-0.009
 excited-0.008
Token by
Token ID416
Feature activation-3.80e-03

Pos logit contributions
assing+0.024
obe+0.022
ilda+0.021
ging+0.021
 adoption+0.021
Neg logit contributions
ull-0.016
 Jane-0.013
 shall-0.013
 finally-0.013
 Mom-0.012
Token the
Token ID262
Feature activation+2.95e-03

Pos logit contributions
assing+0.008
obe+0.008
ging+0.007
agging+0.007
ilda+0.007
Neg logit contributions
ull-0.005
 shall-0.004
 Jane-0.004
 finally-0.004
 Mom-0.004
Token shore
Token ID15191
Feature activation-2.38e-03

Pos logit contributions
ull+0.004
 shall+0.003
 Jane+0.003
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.006
obe-0.005
ging-0.005
 adoption-0.005
ilda-0.005
Token,
Token ID11
Feature activation-2.24e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
ilda+0.004
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token back
Token ID736
Feature activation+2.45e-03

Pos logit contributions
assing+0.004
ging+0.003
obe+0.003
 adoption+0.003
agging+0.003
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 stirred-0.003
 marsh-0.003
Token later
Token ID1568
Feature activation-1.56e-02

Pos logit contributions
ull+0.003
 shall+0.002
 stirred+0.002
 marsh+0.002
isel+0.002
Neg logit contributions
assing-0.005
obe-0.005
ging-0.005
 adoption-0.005
 yourself-0.005
Token."
Token ID526
Feature activation+1.62e-03

Pos logit contributions
assing+0.029
obe+0.028
ging+0.027
 adoption+0.026
ilda+0.026
Neg logit contributions
ull-0.023
 shall-0.019
 Jane-0.017
'll-0.017
 finally-0.017
Token
Token ID198
Feature activation+3.36e-04

Pos logit contributions
ull+0.002
 shall+0.001
 stirred+0.001
'll+0.001
 cooled+0.001
Neg logit contributions
assing-0.004
obe-0.004
ging-0.003
 adoption-0.003
ilda-0.003
Token
Token ID198
Feature activation-1.57e-03

Pos logit contributions
ull+0.000
 shall+0.000
 stirred+0.000
'll+0.000
 cooled+0.000
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
ilda-0.001
agging-0.001
Token"
Token ID1
Feature activation-6.45e-03

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
 adoption+0.003
ilda+0.003
Neg logit contributions
ull-0.002
 shall-0.002
 Jane-0.002
 finally-0.002
'll-0.002
TokenOkay
Token ID16454
Feature activation+3.66e-03

Pos logit contributions
assing+0.011
ging+0.010
obe+0.010
 adoption+0.010
iva+0.009
Neg logit contributions
ull-0.011
 shall-0.009
 Jane-0.009
 marsh-0.008
 Joe-0.008
Token,"
Token ID553
Feature activation-3.98e-03

Pos logit contributions
ull+0.005
 shall+0.004
 Jane+0.004
 finally+0.004
'll+0.004
Neg logit contributions
assing-0.007
obe-0.007
ging-0.007
 adoption-0.006
ilda-0.006
Token Leo
Token ID19632
Feature activation-9.58e-03

Pos logit contributions
assing+0.008
obe+0.008
ging+0.008
 adoption+0.008
ilda+0.007
Neg logit contributions
ull-0.005
 shall-0.004
 Jane-0.004
 finally-0.004
'll-0.004
Token said
Token ID531
Feature activation-1.74e-02

Pos logit contributions
assing+0.018
obe+0.018
ging+0.017
ilda+0.017
 adoption+0.017
Neg logit contributions
ull-0.013
 shall-0.011
'll-0.010
 marsh-0.009
 Jane-0.009
Token.
Token ID13
Feature activation-3.81e-04

Pos logit contributions
assing+0.028
obe+0.027
ging+0.027
 adoption+0.025
ilda+0.025
Neg logit contributions
ull-0.029
 shall-0.025
'll-0.023
 Jane-0.023
 finally-0.023
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
ull+0.010
 Jane+0.009
 shall+0.008
 finally+0.008
 Mom+0.008
Neg logit contributions
assing-0.018
obe-0.018
ging-0.017
 adoption-0.017
agging-0.017
Token She
Token ID1375
Feature activation-6.18e-03

Pos logit contributions
assing+0.025
ging+0.024
obe+0.023
eking+0.023
agging+0.023
Neg logit contributions
ull-0.019
 shall-0.017
 Jane-0.017
 -0.016
 finally-0.016
Token went
Token ID1816
Feature activation-1.18e-02

Pos logit contributions
assing+0.011
obe+0.011
ilda+0.010
 adoption+0.010
 hurting+0.010
Neg logit contributions
ull-0.009
 shall-0.007
 Jane-0.007
 volcano-0.007
 marsh-0.007
Token to
Token ID284
Feature activation-1.44e-03

Pos logit contributions
assing+0.025
obe+0.024
ging+0.024
 adoption+0.023
eking+0.022
Neg logit contributions
ull-0.014
 shall-0.011
 finally-0.011
 Jane-0.011
'll-0.010
Token bed
Token ID3996
Feature activation-2.16e-02

Pos logit contributions
assing+0.003
ging+0.003
obe+0.002
agging+0.002
eking+0.002
Neg logit contributions
ull-0.002
 shall-0.002
 Jane-0.002
 finally-0.002
 Mom-0.002
Token without
Token ID1231
Feature activation-2.04e-02

Pos logit contributions
assing+0.049
obe+0.045
ging+0.044
 supervision+0.044
 adoption+0.044
Neg logit contributions
ull-0.022
 finally-0.021
 happily-0.018
 Jane-0.018
 -0.018
Token delay
Token ID5711
Feature activation-1.59e-02

Pos logit contributions
assing+0.035
obe+0.033
ging+0.032
agging+0.032
 adoption+0.031
Neg logit contributions
ull-0.032
 finally-0.028
 Jane-0.028
 shall-0.027
 Mom-0.027
Token and
Token ID290
Feature activation-5.62e-03

Pos logit contributions
assing+0.031
obe+0.029
pper+0.029
iva+0.028
ging+0.028
Neg logit contributions
ull-0.019
 finally-0.018
 Jane-0.018
 Mom-0.017
 shall-0.017
Token closed
Token ID4838
Feature activation+2.24e-03

Pos logit contributions
assing+0.011
obe+0.011
ging+0.011
 adoption+0.010
ilda+0.010
Neg logit contributions
ull-0.007
 shall-0.006
 Jane-0.005
'll-0.005
 finally-0.005
Token her
Token ID607
Feature activation-2.12e-05

Pos logit contributions
ull+0.003
 Jane+0.003
 finally+0.003
 Mom+0.003
 shall+0.003
Neg logit contributions
assing-0.004
obe-0.004
 adoption-0.004
ging-0.004
pper-0.004
Token eyes
Token ID2951
Feature activation-1.49e-02

Pos logit contributions
assing+0.000
obe+0.000
ging+0.000
 adoption+0.000
ilda+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 Mom-0.000
 finally-0.000
 Jane-0.000
Token
Token ID198
Feature activation+3.08e-03

Pos logit contributions
ull+0.009
 stirred+0.006
 shall+0.006
 cooled+0.006
 marsh+0.006
Neg logit contributions
obe-0.012
annah-0.012
ilda-0.011
assing-0.011
iva-0.011
Token
Token ID198
Feature activation+1.01e-03

Pos logit contributions
ull+0.004
 stirred+0.004
 lava+0.004
 cooled+0.004
 marsh+0.003
Neg logit contributions
ilda-0.005
obe-0.005
iva-0.005
assing-0.005
annah-0.005
TokenSo
Token ID2396
Feature activation-6.40e-03

Pos logit contributions
ull+0.001
 shall+0.001
 Jane+0.001
 finally+0.001
'll+0.001
Neg logit contributions
assing-0.002
obe-0.002
ging-0.002
 adoption-0.002
ilda-0.002
Token,
Token ID11
Feature activation+1.64e-03

Pos logit contributions
assing+0.012
obe+0.012
ging+0.012
 adoption+0.011
ilda+0.011
Neg logit contributions
ull-0.009
 shall-0.007
'll-0.007
 stirred-0.006
 Jane-0.006
Token he
Token ID339
Feature activation-1.05e-03

Pos logit contributions
ull+0.002
 shall+0.002
'll+0.002
 Jane+0.002
 stirred+0.002
Neg logit contributions
assing-0.003
obe-0.003
ging-0.003
 adoption-0.003
 hurting-0.003
Token designed
Token ID3562
Feature activation-1.76e-03

Pos logit contributions
obe+0.002
assing+0.002
ilda+0.002
per+0.002
ging+0.002
Neg logit contributions
ull-0.001
 Jane-0.001
 marsh-0.001
 shall-0.001
ores-0.001
Token and
Token ID290
Feature activation-1.81e-03

Pos logit contributions
assing+0.003
ging+0.003
obe+0.003
ilda+0.003
 adoption+0.003
Neg logit contributions
ull-0.002
 shall-0.002
'll-0.002
 Jane-0.002
 finally-0.002
Token built
Token ID3170
Feature activation-1.20e-02

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
ilda+0.003
 adoption+0.003
Neg logit contributions
ull-0.002
 shall-0.002
'll-0.002
 snowball-0.002
 Jane-0.002
Token a
Token ID257
Feature activation+5.38e-04

Pos logit contributions
assing+0.022
obe+0.021
ging+0.021
 adoption+0.020
ilda+0.020
Neg logit contributions
ull-0.017
 shall-0.014
'll-0.013
 Jane-0.013
 stirred-0.012
Token special
Token ID2041
Feature activation-2.30e-03

Pos logit contributions
ull+0.001
 shall+0.001
 Jane+0.001
 finally+0.001
'll+0.001
Neg logit contributions
assing-0.001
obe-0.001
 adoption-0.001
ging-0.001
ilda-0.001
Token machine
Token ID4572
Feature activation-6.64e-03

Pos logit contributions
assing+0.004
obe+0.004
ging+0.004
 adoption+0.004
ilda+0.003
Neg logit contributions
ull-0.004
 shall-0.003
 Jane-0.003
 finally-0.003
'll-0.003
Token regular
Token ID3218
Feature activation-1.87e-02

Pos logit contributions
assing+0.018
obe+0.017
 hurting+0.017
 adoption+0.017
ilda+0.017
Neg logit contributions
ull-0.018
 shall-0.015
 Jane-0.015
 stirred-0.014
 Joe-0.014
Token girl
Token ID2576
Feature activation+2.32e-03

Pos logit contributions
assing+0.036
obe+0.034
ging+0.034
 adoption+0.033
ilda+0.032
Neg logit contributions
ull-0.026
 shall-0.022
 finally-0.020
 Jane-0.020
'll-0.020
Token who
Token ID508
Feature activation-9.44e-03

Pos logit contributions
ull+0.003
 shall+0.003
 Jane+0.002
 finally+0.002
'll+0.002
Neg logit contributions
assing-0.005
obe-0.005
ging-0.004
ilda-0.004
eking-0.004
Token had
Token ID550
Feature activation+1.97e-03

Pos logit contributions
assing+0.022
obe+0.021
ging+0.021
 adoption+0.020
ilda+0.020
Neg logit contributions
ull-0.009
 shall-0.007
 Jane-0.006
 finally-0.006
'll-0.006
Token regular
Token ID3218
Feature activation-1.55e-02

Pos logit contributions
ull+0.003
 shall+0.002
 Jane+0.002
 finally+0.002
'll+0.002
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
ilda-0.004
Token dreams
Token ID10625
Feature activation-3.04e-02

Pos logit contributions
assing+0.028
obe+0.027
ging+0.027
 adoption+0.025
ilda+0.025
Neg logit contributions
ull-0.023
 shall-0.020
 Jane-0.018
 finally-0.018
'll-0.018
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
assing+0.061
obe+0.058
ging+0.057
 adoption+0.055
ilda+0.054
Neg logit contributions
ull-0.040
 shall-0.034
 finally-0.033
 Jane-0.031
'll-0.031
Token And
Token ID843
Feature activation-6.29e-03

Pos logit contributions
assing+0.011
obe+0.011
 adoption+0.010
iva+0.010
ilda+0.010
Neg logit contributions
ull-0.011
 shall-0.009
 stirred-0.009
 snowball-0.009
 finally-0.009
Token she
Token ID673
Feature activation-1.21e-02

Pos logit contributions
assing+0.010
obe+0.010
ging+0.009
 adoption+0.009
ilda+0.009
Neg logit contributions
ull-0.011
 shall-0.009
 Jane-0.009
 finally-0.009
'll-0.009
Token knew
Token ID2993
Feature activation+2.52e-03

Pos logit contributions
assing+0.025
obe+0.025
 hurting+0.024
ilda+0.024
 adoption+0.024
Neg logit contributions
ull-0.014
 Jane-0.011
 shall-0.010
 marsh-0.009
 snowball-0.009
Token how
Token ID703
Feature activation-5.22e-03

Pos logit contributions
ull+0.004
 shall+0.004
 Jane+0.004
 finally+0.003
'll+0.003
Neg logit contributions
assing-0.004
obe-0.004
ging-0.004
 adoption-0.004
agging-0.004
Token,
Token ID11
Feature activation-7.68e-03

Pos logit contributions
assing+0.003
obe+0.003
ging+0.003
 adoption+0.003
ilda+0.003
Neg logit contributions
ull-0.002
 shall-0.002
 Jane-0.002
 finally-0.001
'll-0.001
Token who
Token ID508
Feature activation-7.83e-03

Pos logit contributions
assing+0.015
obe+0.014
ging+0.014
 adoption+0.013
ilda+0.013
Neg logit contributions
ull-0.011
 shall-0.009
'll-0.008
 Jane-0.008
 finally-0.008
Token gave
Token ID2921
Feature activation-9.13e-05

Pos logit contributions
assing+0.016
ilda+0.015
obe+0.015
ging+0.015
 adoption+0.015
Neg logit contributions
ull-0.009
 shall-0.007
 snowball-0.007
 marsh-0.007
'll-0.007
Token her
Token ID607
Feature activation+1.21e-02

Pos logit contributions
assing+0.000
ging+0.000
 adoption+0.000
ting+0.000
obe+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 snowball-0.000
'll-0.000
 stirred-0.000
Token a
Token ID257
Feature activation+7.72e-04

Pos logit contributions
ull+0.018
 shall+0.015
 finally+0.014
 Jane+0.013
 stirred+0.013
Neg logit contributions
assing-0.021
 adoption-0.021
obe-0.020
ging-0.020
ilda-0.020
Token tour
Token ID4205
Feature activation-2.97e-02

Pos logit contributions
ull+0.001
 Jane+0.001
 finally+0.001
 shall+0.001
 stirred+0.001
Neg logit contributions
hy-0.001
assing-0.001
 adoption-0.001
ilda-0.001
ging-0.001
Token.
Token ID13
Feature activation-7.52e-03

Pos logit contributions
assing+0.069
obe+0.066
ging+0.065
eking+0.063
 adoption+0.062
Neg logit contributions
ull-0.031
 shall-0.026
 Jane-0.026
 finally-0.023
 Mom-0.023
Token Lily
Token ID20037
Feature activation-3.13e-03

Pos logit contributions
assing+0.014
ilda+0.014
obe+0.014
 adoption+0.014
annah+0.013
Neg logit contributions
ull-0.010
 shall-0.008
 snowball-0.007
'll-0.007
 stirred-0.007
Token had
Token ID550
Feature activation+1.08e-03

Pos logit contributions
assing+0.007
obe+0.007
ilda+0.007
 adoption+0.006
ging+0.006
Neg logit contributions
ull-0.003
 shall-0.003
 snowball-0.002
 Jane-0.002
 marsh-0.002
Token so
Token ID523
Feature activation-7.63e-03

Pos logit contributions
ull+0.001
 shall+0.001
 snowball+0.001
'll+0.001
 Jane+0.001
Neg logit contributions
assing-0.002
obe-0.002
 adoption-0.002
ilda-0.002
 hurting-0.002
Token much
Token ID881
Feature activation-6.50e-03

Pos logit contributions
hy+0.013
obe+0.012
 adoption+0.012
agging+0.012
ging+0.012
Neg logit contributions
ull-0.010
 shall-0.009
'll-0.009
 Fisher-0.008
 finally-0.008
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
ull+0.007
 shall+0.006
'll+0.005
 finally+0.005
 Jane+0.005
Neg logit contributions
assing-0.011
obe-0.011
 hurting-0.010
 adoption-0.010
ging-0.010
Token take
Token ID1011
Feature activation-8.27e-03

Pos logit contributions
assing+0.038
obe+0.037
ging+0.036
 adoption+0.035
ilda+0.034
Neg logit contributions
ull-0.034
 shall-0.029
 Jane-0.027
 finally-0.027
'll-0.027
Token her
Token ID607
Feature activation+8.37e-03

Pos logit contributions
assing+0.014
obe+0.013
ging+0.013
 adoption+0.013
ilda+0.012
Neg logit contributions
ull-0.014
 shall-0.011
'll-0.011
 Jane-0.010
 finally-0.010
Token long
Token ID890
Feature activation-1.33e-02

Pos logit contributions
ull+0.012
 shall+0.010
 finally+0.010
'll+0.010
 Jane+0.010
Neg logit contributions
assing-0.016
obe-0.015
ging-0.015
ilda-0.014
 adoption-0.014
Token black
Token ID2042
Feature activation-4.24e-03

Pos logit contributions
assing+0.025
obe+0.024
ging+0.024
ilda+0.023
 adoption+0.022
Neg logit contributions
ull-0.019
 shall-0.016
 finally-0.015
 marsh-0.015
'll-0.015
Token cape
Token ID36023
Feature activation-3.07e-02

Pos logit contributions
assing+0.008
ging+0.007
obe+0.007
ilda+0.007
eking+0.007
Neg logit contributions
ull-0.007
 shall-0.006
'll-0.005
 finally-0.005
 Jane-0.005
Token and
Token ID290
Feature activation-1.38e-04

Pos logit contributions
assing+0.066
obe+0.063
ging+0.062
 adoption+0.061
eking+0.059
Neg logit contributions
ull-0.036
 shall-0.029
 finally-0.027
'll-0.026
 Jane-0.026
Token cover
Token ID3002
Feature activation-3.14e-03

Pos logit contributions
assing+0.000
obe+0.000
 adoption+0.000
ging+0.000
 hurting+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 Jane-0.000
'll-0.000
 Fisher-0.000
Token everything
Token ID2279
Feature activation-1.03e-02

Pos logit contributions
assing+0.004
ging+0.004
obe+0.004
ilda+0.004
 adoption+0.004
Neg logit contributions
ull-0.006
 shall-0.005
'll-0.005
 stirred-0.005
 finally-0.005
Token she
Token ID673
Feature activation-1.16e-02

Pos logit contributions
assing+0.021
obe+0.020
ging+0.019
 adoption+0.019
agging+0.019
Neg logit contributions
ull-0.013
 shall-0.012
 Jane-0.011
 finally-0.011
 Mom-0.010
Token saw
Token ID2497
Feature activation-8.41e-03

Pos logit contributions
assing+0.028
obe+0.027
ging+0.026
 adoption+0.026
ilda+0.026
Neg logit contributions
ull-0.011
 shall-0.008
 Jane-0.007
 marsh-0.006
 snowball-0.006
Token the
Token ID262
Feature activation-1.75e-02

Pos logit contributions
assing+0.025
obe+0.024
ging+0.023
 adoption+0.023
ilda+0.022
Neg logit contributions
ull-0.016
 shall-0.013
 Jane-0.012
'll-0.012
 finally-0.012
Token bear
Token ID6842
Feature activation-1.23e-02

Pos logit contributions
assing+0.030
obe+0.029
ilda+0.028
ging+0.028
hy+0.027
Neg logit contributions
ull-0.028
 finally-0.024
 snowball-0.024
 shall-0.023
 wonder-0.023
Token always
Token ID1464
Feature activation-1.64e-02

Pos logit contributions
assing+0.025
obe+0.025
ging+0.024
 adoption+0.023
ilda+0.023
Neg logit contributions
ull-0.015
 shall-0.012
 Jane-0.012
 marsh-0.011
 Mom-0.011
Token kept
Token ID4030
Feature activation-1.89e-02

Pos logit contributions
assing+0.035
obe+0.034
ging+0.033
 adoption+0.032
ilda+0.032
Neg logit contributions
ull-0.019
 shall-0.016
 Jane-0.014
 finally-0.014
'll-0.014
Token his
Token ID465
Feature activation-2.36e-04

Pos logit contributions
assing+0.039
obe+0.037
ging+0.037
 adoption+0.036
ilda+0.035
Neg logit contributions
ull-0.024
 shall-0.020
 Jane-0.018
'll-0.018
 finally-0.017
Token honey
Token ID12498
Feature activation-3.54e-02

Pos logit contributions
assing+0.000
obe+0.000
ging+0.000
 adoption+0.000
ilda+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 Jane-0.000
 finally-0.000
'll-0.000
Token safe
Token ID3338
Feature activation-1.01e-02

Pos logit contributions
assing+0.072
obe+0.067
 adoption+0.064
ging+0.063
 ME+0.063
Neg logit contributions
ull-0.045
 finally-0.040
 shall-0.040
'll-0.037
 happily-0.037
Token in
Token ID287
Feature activation+6.75e-04

Pos logit contributions
assing+0.021
obe+0.020
ging+0.019
 adoption+0.019
ilda+0.019
Neg logit contributions
ull-0.012
 shall-0.010
 finally-0.010
 Jane-0.009
'll-0.009
Token his
Token ID465
Feature activation-6.72e-03

Pos logit contributions
ull+0.001
 shall+0.001
'll+0.001
 Jane+0.001
 stirred+0.001
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
 adoption-0.001
 hurting-0.001
Token arm
Token ID3211
Feature activation-2.41e-02

Pos logit contributions
assing+0.014
ging+0.014
obe+0.013
ilda+0.013
iva+0.013
Neg logit contributions
ull-0.008
 shall-0.007
 Mom-0.007
 finally-0.006
 wonder-0.006
Tokenchair
Token ID16337
Feature activation-1.13e-02

Pos logit contributions
assing+0.049
ging+0.046
obe+0.045
 adoption+0.044
eking+0.043
Neg logit contributions
ull-0.030
 shall-0.026
 finally-0.025
 Jane-0.025
 -0.024
Token come
Token ID1282
Feature activation-5.71e-03

Pos logit contributions
assing+0.001
obe+0.001
ging+0.001
 adoption+0.001
ilda+0.000
Neg logit contributions
ull-0.000
 shall-0.000
 Jane-0.000
 finally-0.000
'll-0.000
Token!
Token ID0
Feature activation+1.58e-03

Pos logit contributions
assing+0.011
obe+0.010
ging+0.010
ilda+0.010
 adoption+0.010
Neg logit contributions
ull-0.009
 shall-0.007
'll-0.007
 finally-0.007
 Jane-0.007
Token I
Token ID314
Feature activation-1.29e-03

Pos logit contributions
ull+0.003
 shall+0.002
 finally+0.002
 Jane+0.002
'll+0.002
Neg logit contributions
assing-0.003
obe-0.002
ging-0.002
 adoption-0.002
ilda-0.002
Token like
Token ID588
Feature activation+2.04e-03

Pos logit contributions
assing+0.002
obe+0.002
 adoption+0.002
hy+0.002
 hurting+0.002
Neg logit contributions
ull-0.002
 Jane-0.001
 marsh-0.001
 volcan-0.001
 shall-0.001
Token your
Token ID534
Feature activation+3.96e-03

Pos logit contributions
ull+0.004
 shall+0.003
'll+0.003
 finally+0.003
 Jane+0.003
Neg logit contributions
assing-0.003
obe-0.003
ging-0.003
 adoption-0.003
 hurting-0.003
Token castle
Token ID16669
Feature activation-3.08e-02

Pos logit contributions
ull+0.006
 shall+0.005
'll+0.005
 Jane+0.005
 finally+0.005
Neg logit contributions
assing-0.007
obe-0.006
eking-0.006
ging-0.006
agging-0.006
Token too
Token ID1165
Feature activation+9.55e-03

Pos logit contributions
assing+0.056
obe+0.054
ging+0.052
 adoption+0.051
ilda+0.050
Neg logit contributions
ull-0.046
 shall-0.039
 Jane-0.036
 finally-0.036
'll-0.035
Token!
Token ID0
Feature activation-8.73e-04

Pos logit contributions
ull+0.016
 shall+0.014
 Jane+0.013
'll+0.013
 finally+0.013
Neg logit contributions
assing-0.015
obe-0.015
ging-0.014
 hurting-0.014
agging-0.014
Token You
Token ID921
Feature activation-9.35e-03

Pos logit contributions
assing+0.001
obe+0.001
 adoption+0.001
ging+0.001
hy+0.001
Neg logit contributions
ull-0.002
 shall-0.001
 finally-0.001
'll-0.001
 stirred-0.001
Token are
Token ID389
Feature activation+4.53e-04

Pos logit contributions
assing+0.016
obe+0.015
 hurting+0.015
 adoption+0.014
hy+0.014
Neg logit contributions
ull-0.015
 Jane-0.012
 snowball-0.011
 marsh-0.011
 shall-0.011
Token a
Token ID257
Feature activation+4.69e-03

Pos logit contributions
ull+0.001
 shall+0.001
 finally+0.001
 Jane+0.001
 Mom+0.000
Neg logit contributions
assing-0.001
obe-0.001
ging-0.001
ilda-0.001
 adoption-0.001