TOP ACTIVATIONS
MAX = 0.061

her bag so she unf ast ened the bag and opened
. The wild er be ast had a worried look on
selfish . <|endoftext|> S eb ast ian was a wealthy man
Maria . She was three - years - old and loved
there were two friends , Martha and Sarah . They were
saw another wild er be ast on the shore . The
alone . Cr ab saw Sara and Ben .
They hear Spot 's sn ores from the living room .
Lily replied , " To ast with jam ! It was
room now ." She takes Anna by the hand and leads
. I am a lamp - head !" Tom
and embarrassed . She took Anna and Ben by the hand
there were two friends , Jean and Jane . They always
and angry . She took Anna to the doctor right away
<|endoftext|> Tim was a three - year - old boy .
was a girl named K ait lyn . She was three
Maria . She was three - years - old and she
. He was a snow - st o pper . His
Lucy . She was three - years - old and loved
The wild wild er be ast had to think for a

MIN ACTIVATIONS
MIN = -0.067

how much I love my beans ? They are my only
" What are peanuts ?" Lily asked .
questioned . " Why do turtles have shells ?" Tom questioned
, " What kind of peanuts are these ?"
Why do you look so angry ?" "
" What makes pe b bles special ?" His mother replied
. " What are you burying ?" asked the bird .
. " Are those delicious berries for me ?"
to worry . What should he do ? He was scared
, " Is this your pants ?" The little
what we can learn about crabs ?" her mom replied .
you know what colour my castle is ?" El iza
. Her mom explained how crabs have hard shells to protect
Can I play with my spear ? â she asked
are you doing with my beans ?" It was Mrs .
you ruin your neck l aces and brace lets ?" Mom
I have some of your carrots ?" The rabbit
" Can we pick some beans ?" Ben says
questioned . " Why do frogs have legs ?" Lily questioned
Can I play in your swamp ?" Fred thought about it

INTERVAL 0.029 - 0.061
CONTAINS 0.309%

can be fun and that making friends can make you forget
perfect spot and she would come back every day to read
was a little girl named Mary . She loved playing in
gives him a new cr ay on and paper . Tom
. She saw the sun come up and it was very

INTERVAL -0.003 - 0.029
CONTAINS 62.172%

came and gave her a big , compassionate hug . She
and stronger . The fish was so happy ! He brought
the water . It had a flag with unknown colors and
don 't care . I like my red costume . I
the word . She tried to guess , but she was

INTERVAL -0.035 - -0.003
CONTAINS 37.419%

was right . Then , Lily saw a sweet
She loved Tom and wanted to be with him . They
One day , she met a wise old man who told
But Lucy was brave , so she stepped a little
money ?" Tom asked . " Sorry , sweet

INTERVAL -0.067 - -0.035
CONTAINS 0.100%

He asked the bird where he lived and the bird told
. " Can I touch it ?" she asked
can I do with this avocado ? â
. Can you find other triangles in the room ?"
had lots of different mar bles and he loved collecting them
Token her
Token ID607
Feature activation-1.40e-02

Pos logit contributions
ully+0.007
ined+0.006
 idea+0.006
agons+0.006
 accomplish+0.006
Neg logit contributions
 Anna-0.005
 Ann-0.004
 rats-0.004
 Megan-0.004
Anna-0.004
Token bag
Token ID6131
Feature activation-3.27e-02

Pos logit contributions
ined+0.025
ully+0.025
 idea+0.024
 accomplish+0.023
ble+0.023
Neg logit contributions
 Anna-0.018
 rats-0.015
 mats-0.014
Anna-0.013
 beds-0.013
Token so
Token ID523
Feature activation-1.32e-02

Pos logit contributions
agons+0.067
ully+0.065
opl+0.065
irl+0.064
ined+0.064
Neg logit contributions
 Anna-0.033
 to-0.024
 next-0.023
's-0.022
 Ann-0.021
Token she
Token ID673
Feature activation-9.76e-03

Pos logit contributions
ully+0.023
 idea+0.023
ined+0.022
 accomplish+0.021
ble+0.020
Neg logit contributions
 Anna-0.018
 Ann-0.015
 rats-0.015
Anna-0.014
 Megan-0.014
Token unf
Token ID3684
Feature activation+5.96e-03

Pos logit contributions
opl+0.018
 idea+0.016
ully+0.016
ined+0.016
 accomplish+0.016
Neg logit contributions
 Anna-0.014
Anna-0.011
 Ann-0.011
 Ana-0.010
 rats-0.010
Tokenast
Token ID459
Feature activation+6.12e-02

Pos logit contributions
 Anna+0.009
Anna+0.007
 Ann+0.007
 rats+0.007
 Megan+0.007
Neg logit contributions
 idea-0.010
ully-0.009
ined-0.009
 accomplish-0.008
ble-0.008
Tokenened
Token ID2945
Feature activation+4.24e-03

Pos logit contributions
 Anna+0.082
 Ann+0.062
Anna+0.059
 Pen+0.058
 rats+0.058
Neg logit contributions
 idea-0.108
ully-0.107
ined-0.107
 accomplish-0.103
 hobby-0.101
Token the
Token ID262
Feature activation-2.27e-03

Pos logit contributions
 Anna+0.006
 rats+0.006
Anna+0.005
 Ann+0.005
 mats+0.005
Neg logit contributions
 idea-0.006
ully-0.006
ined-0.006
ble-0.006
 accomplish-0.006
Token bag
Token ID6131
Feature activation-3.27e-03

Pos logit contributions
 idea+0.004
ully+0.003
ined+0.003
ble+0.003
agons+0.003
Neg logit contributions
 Anna-0.003
 Ann-0.003
 Megan-0.003
 Ana-0.003
 rats-0.003
Token and
Token ID290
Feature activation-9.14e-03

Pos logit contributions
ully+0.005
ined+0.005
 idea+0.005
agons+0.005
 accomplish+0.005
Neg logit contributions
 Anna-0.005
 Ann-0.004
Anna-0.004
 rats-0.004
 mats-0.004
Token opened
Token ID4721
Feature activation-3.14e-03

Pos logit contributions
ully+0.014
ined+0.014
 idea+0.013
ble+0.013
agons+0.012
Neg logit contributions
 Anna-0.015
 Ann-0.012
 rats-0.012
 mats-0.012
Anna-0.011
Token.
Token ID13
Feature activation-9.44e-04

Pos logit contributions
 idea+0.016
ully+0.016
ined+0.016
 accomplish+0.015
inkle+0.015
Neg logit contributions
 Anna-0.017
 rats-0.015
 Ann-0.015
 Megan-0.014
 mats-0.014
Token The
Token ID383
Feature activation-7.37e-03

Pos logit contributions
 idea+0.002
ined+0.002
ully+0.002
inkle+0.002
agons+0.002
Neg logit contributions
 Anna-0.001
 Ann-0.001
 Megan-0.001
 Jacob-0.001
 Pen-0.001
Token wild
Token ID4295
Feature activation-7.07e-03

Pos logit contributions
ully+0.012
ined+0.012
 idea+0.011
 accomplish+0.011
?'+0.011
Neg logit contributions
 Anna-0.010
 rats-0.009
Anna-0.008
 mats-0.008
 Pen-0.008
Tokener
Token ID263
Feature activation+4.38e-03

Pos logit contributions
 idea+0.010
ully+0.010
ined+0.010
 accomplish+0.009
ble+0.009
Neg logit contributions
 Anna-0.011
 rats-0.010
Anna-0.009
 Ann-0.009
 mats-0.009
Tokenbe
Token ID1350
Feature activation-5.90e-03

Pos logit contributions
 Anna+0.006
 Ann+0.005
 Megan+0.005
Anna+0.005
 Jacob+0.005
Neg logit contributions
 idea-0.007
agons-0.007
 accomplish-0.007
ully-0.006
 ease-0.006
Tokenast
Token ID459
Feature activation+5.68e-02

Pos logit contributions
 idea+0.008
ully+0.008
 accomplish+0.008
 shar+0.008
ined+0.008
Neg logit contributions
 Anna-0.009
 rats-0.008
Anna-0.008
 Ann-0.008
pter-0.007
Token had
Token ID550
Feature activation+5.80e-03

Pos logit contributions
 Anna+0.087
Anna+0.072
 Ann+0.070
 rats+0.070
 Pen+0.069
Neg logit contributions
 idea-0.089
ully-0.085
 hobby-0.080
ined-0.079
 accomplish-0.078
Token a
Token ID257
Feature activation-5.65e-03

Pos logit contributions
 Anna+0.008
 rats+0.007
 Ann+0.007
Anna+0.007
 mats+0.007
Neg logit contributions
 idea-0.009
ully-0.009
ined-0.009
 accomplish-0.008
ble-0.008
Token worried
Token ID7960
Feature activation-1.04e-02

Pos logit contributions
ully+0.010
ined+0.010
 accomplish+0.009
opl+0.009
agons+0.009
Neg logit contributions
 Anna-0.007
 rats-0.006
Anna-0.006
 Ann-0.006
 Ana-0.005
Token look
Token ID804
Feature activation-8.99e-03

Pos logit contributions
ully+0.017
ined+0.017
 accomplish+0.017
agons+0.016
 ease+0.016
Neg logit contributions
 Anna-0.015
 Ann-0.012
 rats-0.012
 Megan-0.011
 Pen-0.011
Token on
Token ID319
Feature activation-1.24e-02

Pos logit contributions
ully+0.018
 idea+0.017
ined+0.016
 accomplish+0.016
agons+0.016
Neg logit contributions
 Anna-0.011
 rats-0.008
 Ann-0.008
 downstairs-0.007
Anna-0.007
Token selfish
Token ID20363
Feature activation-1.57e-02

Pos logit contributions
ully+0.001
ined+0.001
 idea+0.001
inkle+0.001
agons+0.001
Neg logit contributions
 Anna-0.000
 rats-0.000
 Ann-0.000
Anna-0.000
 Pen-0.000
Token.
Token ID13
Feature activation-5.09e-03

Pos logit contributions
ully+0.026
ined+0.025
inkle+0.024
 idea+0.024
 accomplish+0.023
Neg logit contributions
 Anna-0.024
 rats-0.020
 mats-0.018
 Ann-0.018
Anna-0.018
Token<|endoftext|>
Token ID50256
Feature activation+4.26e-04

Pos logit contributions
 idea+0.008
ined+0.008
ully+0.008
ble+0.007
inkle+0.007
Neg logit contributions
 Anna-0.008
 Ann-0.006
Anna-0.006
 Jacob-0.006
 Megan-0.006
TokenS
Token ID50
Feature activation+2.04e-02

Pos logit contributions
 Anna+0.001
Anna+0.001
 Ann+0.001
 Pen+0.000
 Megan+0.000
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
 shar-0.001
Tokeneb
Token ID1765
Feature activation+1.87e-02

Pos logit contributions
 Anna+0.032
Anna+0.028
 Ann+0.027
 rats+0.027
 Megan+0.026
Neg logit contributions
 idea-0.030
ined-0.028
ully-0.028
ble-0.027
 accomplish-0.026
Tokenast
Token ID459
Feature activation+5.56e-02

Pos logit contributions
 Anna+0.024
Anna+0.020
 Ann+0.019
 Megan+0.018
pter+0.018
Neg logit contributions
 idea-0.033
ined-0.030
 gol-0.030
ully-0.029
agons-0.029
Tokenian
Token ID666
Feature activation+1.79e-03

Pos logit contributions
 Anna+0.086
Anna+0.071
 Megan+0.069
 Jacob+0.068
 Ann+0.068
Neg logit contributions
 idea-0.085
ined-0.079
ully-0.077
 accomplish-0.075
 hobby-0.074
Token was
Token ID373
Feature activation+5.03e-03

Pos logit contributions
 Anna+0.002
Anna+0.002
 Ann+0.002
 Jacob+0.002
's+0.002
Neg logit contributions
 idea-0.003
agons-0.003
reath-0.003
ble-0.003
ute-0.003
Token a
Token ID257
Feature activation+8.05e-03

Pos logit contributions
 Anna+0.007
 Ann+0.005
 Megan+0.005
Anna+0.005
 Pen+0.005
Neg logit contributions
ully-0.009
 idea-0.008
agons-0.008
 accomplish-0.008
ined-0.008
Token wealthy
Token ID11574
Feature activation+5.52e-03

Pos logit contributions
 Anna+0.012
 rats+0.011
 Ann+0.010
Anna+0.010
 mats+0.010
Neg logit contributions
 idea-0.012
ully-0.012
ined-0.011
 accomplish-0.011
ble-0.011
Token man
Token ID582
Feature activation+8.28e-03

Pos logit contributions
 Anna+0.008
 rats+0.007
Anna+0.006
 Ann+0.006
 mats+0.006
Neg logit contributions
 idea-0.009
ully-0.009
ined-0.009
 accomplish-0.008
ble-0.008
Token Maria
Token ID14200
Feature activation+3.19e-02

Pos logit contributions
pter+0.027
 rats+0.024
Rex+0.024
Jacob+0.024
 mats+0.023
Neg logit contributions
 idea-0.023
?"-0.021
?".-0.020
asty-0.020
 replied-0.020
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
 Anna+0.052
 rats+0.045
Anna+0.045
 Ann+0.044
 Pen+0.042
Neg logit contributions
 idea-0.045
ully-0.044
ined-0.042
 accomplish-0.041
ble-0.040
Token She
Token ID1375
Feature activation+1.25e-02

Pos logit contributions
 Anna+0.026
 rats+0.023
Anna+0.022
 Ann+0.021
 mats+0.021
Neg logit contributions
 idea-0.028
ully-0.027
ined-0.026
 accomplish-0.024
ble-0.024
Token was
Token ID373
Feature activation+6.09e-03

Pos logit contributions
 Anna+0.024
 rats+0.021
Anna+0.021
 Ann+0.021
 mats+0.020
Neg logit contributions
 idea-0.014
ully-0.014
ined-0.013
 accomplish-0.012
ble-0.012
Token three
Token ID1115
Feature activation+4.64e-03

Pos logit contributions
Anna+0.011
 mats+0.011
pter+0.011
 Anna+0.011
rost+0.010
Neg logit contributions
 idea-0.007
ined-0.006
 encouraged-0.005
asty-0.005
ully-0.005
Token-
Token ID12
Feature activation+5.53e-02

Pos logit contributions
 Anna+0.007
Anna+0.006
 Ann+0.006
 Pen+0.006
 mats+0.006
Neg logit contributions
 idea-0.007
ined-0.007
ully-0.006
agons-0.006
ble-0.006
Tokenyears
Token ID19002
Feature activation-1.12e-02

Pos logit contributions
 Anna+0.098
 mats+0.088
 rats+0.085
 Jacob+0.085
 beds+0.084
Neg logit contributions
ully-0.066
ined-0.066
 encouraged-0.060
 idea-0.056
mod-0.055
Token-
Token ID12
Feature activation+4.00e-02

Pos logit contributions
 idea+0.019
ully+0.019
ined+0.018
 accomplish+0.018
ble+0.017
Neg logit contributions
 Anna-0.014
 rats-0.012
Anna-0.012
 Ann-0.012
 mats-0.011
Tokenold
Token ID727
Feature activation+1.67e-02

Pos logit contributions
 Anna+0.079
 mats+0.071
 rats+0.071
ksh+0.070
 beds+0.069
Neg logit contributions
ully-0.042
ined-0.039
 idea-0.036
opl-0.034
 encouraged-0.033
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
rost+0.036
Nap+0.036
 mats+0.034
Anna+0.034
Jacob+0.034
Neg logit contributions
 idea-0.010
ully-0.009
?-0.007
 encouraged-0.007
 replied-0.007
Token loved
Token ID6151
Feature activation+2.76e-03

Pos logit contributions
 Anna+0.024
 rats+0.021
Anna+0.021
 Ann+0.021
 Megan+0.020
Neg logit contributions
ully-0.014
 idea-0.014
ined-0.012
 accomplish-0.012
ble-0.011
Token there
Token ID612
Feature activation-1.99e-03

Pos logit contributions
 idea+0.017
ully+0.015
ined+0.015
ble+0.013
 accomplish+0.013
Neg logit contributions
 Anna-0.037
 rats-0.035
Anna-0.034
 mats-0.034
 Ann-0.033
Token were
Token ID547
Feature activation-1.09e-02

Pos logit contributions
ully+0.004
agons+0.003
 idea+0.003
ined+0.003
 accomplish+0.003
Neg logit contributions
 Anna-0.003
 Ann-0.002
 Megan-0.002
Anna-0.002
 Jacob-0.002
Token two
Token ID734
Feature activation+1.37e-03

Pos logit contributions
 idea+0.015
ully+0.014
 ease+0.013
ined+0.012
?",+0.012
Neg logit contributions
 Anna-0.017
 rats-0.016
 mats-0.016
 Ann-0.015
Anna-0.015
Token friends
Token ID2460
Feature activation+6.94e-03

Pos logit contributions
 Anna+0.003
Anna+0.002
 rats+0.002
 Ann+0.002
 Megan+0.002
Neg logit contributions
 idea-0.002
ully-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token,
Token ID11
Feature activation+7.41e-04

Pos logit contributions
 Anna+0.013
 rats+0.011
Anna+0.011
 Ann+0.011
 Megan+0.011
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.008
 accomplish-0.007
ble-0.007
Token Martha
Token ID27243
Feature activation+5.53e-02

Pos logit contributions
 Anna+0.001
 rats+0.001
 Ann+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
 Anna+0.108
 mats+0.104
 rats+0.102
Anna+0.100
head+0.096
Neg logit contributions
 idea-0.052
ully-0.051
ined-0.045
 replied-0.041
ble-0.039
Token Sarah
Token ID10490
Feature activation+2.25e-02

Pos logit contributions
 Anna+0.020
 rats+0.017
 Ann+0.017
Anna+0.016
 Megan+0.016
Neg logit contributions
 idea-0.018
ined-0.017
ully-0.017
ble-0.016
 accomplish-0.016
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
 Anna+0.043
 rats+0.040
Anna+0.039
 mats+0.038
 Ann+0.037
Neg logit contributions
 idea-0.024
ully-0.023
ined-0.022
ble-0.019
 accomplish-0.019
Token They
Token ID1119
Feature activation+6.12e-03

Pos logit contributions
 Anna+0.019
 rats+0.017
Anna+0.017
 mats+0.016
 Ann+0.015
Neg logit contributions
 idea-0.023
ully-0.021
ined-0.021
 accomplish-0.019
 ease-0.019
Token were
Token ID547
Feature activation-6.46e-03

Pos logit contributions
 Anna+0.007
 Ann+0.005
Anna+0.004
 Megan+0.004
's+0.004
Neg logit contributions
opl-0.013
ble-0.013
inkle-0.012
mmm-0.012
ully-0.012
Token saw
Token ID2497
Feature activation-1.86e-03

Pos logit contributions
 idea+0.008
ully+0.008
ined+0.008
 accomplish+0.008
ble+0.007
Neg logit contributions
 Anna-0.007
 rats-0.006
Anna-0.006
 Ann-0.006
 Megan-0.005
Token another
Token ID1194
Feature activation-1.32e-02

Pos logit contributions
 idea+0.003
ully+0.003
ined+0.003
 accomplish+0.003
inkle+0.003
Neg logit contributions
 Anna-0.002
 rats-0.002
 Ann-0.002
 Megan-0.002
 Jacob-0.002
Token wild
Token ID4295
Feature activation+2.27e-03

Pos logit contributions
ined+0.022
ully+0.022
 accomplish+0.022
asty+0.020
 ease+0.020
Neg logit contributions
 Anna-0.018
 rats-0.015
 minutes-0.014
Anna-0.014
 Ann-0.014
Tokener
Token ID263
Feature activation+4.11e-03

Pos logit contributions
 Anna+0.003
 rats+0.003
 Ann+0.003
 mats+0.003
Anna+0.003
Neg logit contributions
ully-0.003
 idea-0.003
ined-0.003
 accomplish-0.003
 ease-0.003
Tokenbe
Token ID1350
Feature activation-4.84e-03

Pos logit contributions
 Anna+0.006
 Ann+0.005
 rats+0.005
 Megan+0.005
pter+0.005
Neg logit contributions
 idea-0.006
ully-0.006
 accomplish-0.006
agons-0.006
ined-0.006
Tokenast
Token ID459
Feature activation+5.52e-02

Pos logit contributions
ully+0.007
 idea+0.007
 accomplish+0.007
ined+0.007
 shar+0.007
Neg logit contributions
 Anna-0.007
 rats-0.006
Anna-0.006
 Ann-0.006
pter-0.006
Token on
Token ID319
Feature activation+2.26e-03

Pos logit contributions
 Anna+0.081
 Ann+0.065
 rats+0.064
 Pen+0.063
Anna+0.063
Neg logit contributions
 idea-0.087
ully-0.086
ined-0.083
 hobby-0.082
 accomplish-0.079
Token the
Token ID262
Feature activation+3.21e-03

Pos logit contributions
 Anna+0.003
 rats+0.002
 Ann+0.002
 Pen+0.002
 mats+0.002
Neg logit contributions
ined-0.005
ully-0.005
 idea-0.004
 accomplish-0.004
inkle-0.004
Token shore
Token ID15191
Feature activation-1.12e-02

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.004
 Ann+0.004
 mats+0.004
Neg logit contributions
 idea-0.005
ully-0.005
ined-0.005
 accomplish-0.004
ble-0.004
Token.
Token ID13
Feature activation-9.44e-04

Pos logit contributions
 idea+0.016
ully+0.016
ined+0.016
 accomplish+0.015
inkle+0.015
Neg logit contributions
 Anna-0.017
 rats-0.015
 Ann-0.015
 Megan-0.014
 mats-0.014
Token The
Token ID383
Feature activation-7.37e-03

Pos logit contributions
 idea+0.002
ined+0.002
ully+0.002
inkle+0.002
agons+0.002
Neg logit contributions
 Anna-0.001
 Ann-0.001
 Megan-0.001
 Jacob-0.001
 Pen-0.001
Token alone
Token ID3436
Feature activation+2.13e-03

Pos logit contributions
 mats+0.013
pter+0.012
 Anna+0.012
Anna+0.012
rost+0.012
Neg logit contributions
 idea-0.007
ined-0.007
ble-0.006
 encouraged-0.006
inkle-0.005
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 Anna+0.003
 rats+0.003
Anna+0.003
 Ann+0.003
 mats+0.003
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.003
ble-0.003
Token
Token ID198
Feature activation-9.65e-03

Pos logit contributions
 rats+0.018
 mats+0.017
pter+0.017
heads+0.017
 Anna+0.017
Neg logit contributions
 idea-0.011
ully-0.009
 Sebastian-0.009
ble-0.008
 Dylan-0.008
Token
Token ID198
Feature activation-2.54e-03

Pos logit contributions
 idea+0.011
ully+0.011
ined+0.011
ble+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.017
 rats-0.016
 mats-0.015
Anna-0.015
 Ann-0.015
TokenCr
Token ID13916
Feature activation+9.91e-03

Pos logit contributions
ble+0.003
 idea+0.003
ined+0.003
 accomplish+0.003
ully+0.003
Neg logit contributions
 Anna-0.004
 rats-0.004
pter-0.004
 mats-0.004
 Ann-0.004
Tokenab
Token ID397
Feature activation+5.49e-02

Pos logit contributions
 Anna+0.016
Anna+0.015
 rats+0.014
 Ann+0.014
 mats+0.014
Neg logit contributions
 idea-0.013
ully-0.013
ined-0.012
 shar-0.012
 accomplish-0.011
Token saw
Token ID2497
Feature activation+3.96e-03

Pos logit contributions
 Anna+0.084
 rats+0.078
 mats+0.073
Anna+0.072
 Ann+0.069
Neg logit contributions
 idea-0.079
ully-0.079
ble-0.076
ined-0.075
inkle-0.070
Token Sara
Token ID24799
Feature activation+2.26e-02

Pos logit contributions
 Anna+0.004
 rats+0.004
Anna+0.004
pter+0.004
 mats+0.004
Neg logit contributions
 idea-0.007
ined-0.007
ully-0.007
ble-0.007
 accomplish-0.006
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
 Anna+0.035
Anna+0.031
 rats+0.031
 mats+0.030
 Megan+0.030
Neg logit contributions
 idea-0.032
ined-0.030
ully-0.030
ble-0.029
 accomplish-0.028
Token Ben
Token ID3932
Feature activation-1.16e-03

Pos logit contributions
 Anna+0.023
 rats+0.020
Anna+0.020
 Ann+0.019
 mats+0.019
Neg logit contributions
 idea-0.039
ully-0.039
ined-0.037
ble-0.036
 accomplish-0.035
Token.
Token ID13
Feature activation+6.12e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
 accomplish+0.002
ble+0.002
Neg logit contributions
 Anna-0.002
 rats-0.001
Anna-0.001
 Ann-0.001
 mats-0.001
Token They
Token ID1119
Feature activation+1.40e-03

Feature not active on prev token;
no predictions were affected.
Token hear
Token ID3285
Feature activation+1.70e-02

Pos logit contributions
 Anna+0.002
Anna+0.002
 rats+0.002
 Pen+0.002
 Ann+0.002
Neg logit contributions
ully-0.002
ined-0.002
ble-0.002
 idea-0.002
opl-0.002
Token Spot
Token ID15899
Feature activation+1.76e-02

Pos logit contributions
 Anna+0.022
 rats+0.019
Anna+0.018
 Ann+0.018
 mats+0.017
Neg logit contributions
 idea-0.029
ully-0.028
ined-0.028
 accomplish-0.026
ble-0.026
Token's
Token ID338
Feature activation+1.90e-02

Pos logit contributions
 Anna+0.028
 rats+0.025
Anna+0.025
 mats+0.024
 Megan+0.024
Neg logit contributions
 idea-0.024
ully-0.024
ined-0.022
 accomplish-0.022
ble-0.021
Token sn
Token ID3013
Feature activation+7.92e-03

Pos logit contributions
 Anna+0.029
Anna+0.029
 rats+0.028
Ann+0.027
Jacob+0.027
Neg logit contributions
 idea-0.031
ully-0.023
ble-0.022
 accomplish-0.022
 replied-0.021
Tokenores
Token ID2850
Feature activation+5.45e-02

Pos logit contributions
 Anna+0.011
 rats+0.010
Anna+0.009
 Ann+0.009
 mats+0.009
Neg logit contributions
 idea-0.013
ully-0.013
ined-0.013
ble-0.012
 accomplish-0.012
Token from
Token ID422
Feature activation-6.42e-03

Pos logit contributions
 Anna+0.086
 rats+0.071
 Ann+0.071
Anna+0.069
 mats+0.068
Neg logit contributions
ully-0.084
 idea-0.080
ined-0.078
 accomplish-0.074
 shar-0.073
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
ully+0.012
 idea+0.012
ined+0.011
 accomplish+0.011
ble+0.011
Neg logit contributions
 Anna-0.008
 rats-0.007
 Ann-0.006
Anna-0.006
 Pen-0.006
Token living
Token ID2877
Feature activation+2.58e-02

Pos logit contributions
 Anna+0.013
 rats+0.012
 Megan+0.012
 Ann+0.012
Anna+0.011
Neg logit contributions
 idea-0.018
ined-0.016
ully-0.016
ble-0.015
agons-0.015
Token room
Token ID2119
Feature activation+1.19e-02

Pos logit contributions
 rats+0.029
 Anna+0.028
Sky+0.026
Aud+0.026
 Ann+0.026
Neg logit contributions
 idea-0.043
ined-0.042
 hobby-0.038
?-0.037
ble-0.037
Token.
Token ID13
Feature activation-6.40e-04

Pos logit contributions
 Anna+0.021
 rats+0.019
Anna+0.018
 Ann+0.018
 Megan+0.018
Neg logit contributions
 idea-0.015
ully-0.014
ined-0.014
 accomplish-0.013
ble-0.013
Token Lily
Token ID20037
Feature activation-1.70e-02

Pos logit contributions
agons+0.026
ined+0.025
ully+0.025
 idea+0.025
ble+0.025
Neg logit contributions
 Anna-0.017
 Ann-0.012
Anna-0.011
 Luna-0.011
 Mia-0.011
Token replied
Token ID8712
Feature activation-9.32e-03

Pos logit contributions
agons+0.028
ully+0.027
opl+0.027
ute+0.026
ined+0.026
Neg logit contributions
 Anna-0.030
 Jacob-0.023
 Ann-0.022
Anna-0.022
 Ana-0.022
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
ully+0.012
 idea+0.012
ined+0.011
 accomplish+0.011
agons+0.011
Neg logit contributions
 Anna-0.017
 rats-0.014
 Ann-0.014
Anna-0.014
 Megan-0.014
Token "
Token ID366
Feature activation+3.29e-03

Pos logit contributions
ully+0.032
 idea+0.031
agons+0.031
opl+0.030
ined+0.030
Neg logit contributions
 Anna-0.022
 Ann-0.016
 Pen-0.015
 Jacob-0.015
 Ana-0.015
TokenTo
Token ID2514
Feature activation+2.60e-02

Pos logit contributions
 Anna+0.004
 rats+0.004
 Ann+0.004
 mats+0.004
 Megan+0.004
Neg logit contributions
 idea-0.005
ully-0.005
ined-0.005
 accomplish-0.005
ble-0.005
Tokenast
Token ID459
Feature activation+5.42e-02

Pos logit contributions
 Anna+0.036
 Ann+0.027
Anna+0.027
 rats+0.026
 Megan+0.025
Neg logit contributions
ully-0.049
 idea-0.047
ined-0.046
agons-0.042
opl-0.042
Token with
Token ID351
Feature activation+2.44e-03

Pos logit contributions
 Anna+0.083
 Ann+0.064
Anna+0.064
 Megan+0.061
 rats+0.061
Neg logit contributions
 idea-0.089
ully-0.088
ined-0.087
 hobby-0.082
 accomplish-0.081
Token jam
Token ID16853
Feature activation-6.59e-03

Pos logit contributions
 Anna+0.003
Anna+0.003
 rats+0.003
 Ann+0.003
 Megan+0.003
Neg logit contributions
 idea-0.004
ully-0.004
ined-0.004
 accomplish-0.003
ble-0.003
Token!
Token ID0
Feature activation+5.13e-03

Pos logit contributions
ully+0.011
ined+0.009
 ease+0.009
 gol+0.009
 idea+0.009
Neg logit contributions
 Anna-0.011
 rats-0.009
 Ann-0.009
Anna-0.009
 mats-0.009
Token It
Token ID632
Feature activation+2.15e-03

Pos logit contributions
 Anna+0.006
 rats+0.005
Anna+0.005
 Ann+0.005
 mats+0.005
Neg logit contributions
 idea-0.009
ully-0.009
ined-0.009
 accomplish-0.008
ble-0.008
Token was
Token ID373
Feature activation-4.12e-03

Pos logit contributions
 Anna+0.003
 rats+0.002
Anna+0.002
 mats+0.002
 Ann+0.002
Neg logit contributions
 idea-0.004
ined-0.003
ully-0.003
 accomplish-0.003
ble-0.003
Token room
Token ID2119
Feature activation+1.01e-03

Pos logit contributions
 Anna+0.016
 rats+0.013
Anna+0.013
 Ann+0.012
 Megan+0.012
Neg logit contributions
 idea-0.026
ully-0.024
ined-0.024
 accomplish-0.023
ble-0.023
Token now
Token ID783
Feature activation-8.70e-03

Pos logit contributions
 Anna+0.002
 rats+0.001
Anna+0.001
 Ann+0.001
 mats+0.001
Neg logit contributions
ully-0.001
 idea-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token."
Token ID526
Feature activation-9.26e-03

Pos logit contributions
ully+0.019
ined+0.017
 ease+0.016
imm+0.016
 gladly+0.016
Neg logit contributions
 Anna-0.012
Anna-0.009
 rats-0.008
 Ann-0.008
 mats-0.008
Token She
Token ID1375
Feature activation+5.04e-03

Pos logit contributions
 idea+0.008
 replied+0.007
?+0.007
?"+0.007
?'+0.006
Neg logit contributions
pter-0.018
 rats-0.016
co-0.016
rost-0.016
ifier-0.015
Token takes
Token ID2753
Feature activation+1.97e-02

Pos logit contributions
 Anna+0.006
pter+0.006
 rats+0.006
 Megan+0.006
head+0.006
Neg logit contributions
 idea-0.008
ined-0.007
 encouraged-0.007
 replied-0.007
 accomplish-0.007
Token Anna
Token ID11735
Feature activation+5.41e-02

Pos logit contributions
pter+0.023
heads+0.022
cious+0.022
 rats+0.021
ksh+0.021
Neg logit contributions
 idea-0.032
 ease-0.030
?"-0.027
?-0.026
?'-0.026
Token by
Token ID416
Feature activation+1.43e-02

Pos logit contributions
 Megan+0.079
Jacob+0.078
 Ana+0.076
 Anna+0.075
ksh+0.075
Neg logit contributions
 idea-0.069
ully-0.065
 encouraged-0.060
?-0.058
 explained-0.058
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
 Anna+0.019
 rats+0.016
Anna+0.015
 Ann+0.015
 Megan+0.015
Neg logit contributions
 idea-0.025
ully-0.025
ined-0.024
 accomplish-0.023
ble-0.022
Token hand
Token ID1021
Feature activation+4.41e-03

Pos logit contributions
 Anna+0.019
 rats+0.016
Anna+0.016
 Ann+0.016
 Megan+0.015
Neg logit contributions
 idea-0.025
ully-0.024
ined-0.023
ble-0.022
 accomplish-0.022
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
 Anna+0.008
 rats+0.007
Anna+0.007
 Megan+0.007
 Ann+0.007
Neg logit contributions
 idea-0.005
ully-0.005
ined-0.005
ble-0.004
 accomplish-0.004
Token leads
Token ID5983
Feature activation+3.49e-02

Pos logit contributions
pter+0.020
rost+0.019
ksh+0.019
Anna+0.019
head+0.019
Neg logit contributions
 idea-0.022
 explained-0.020
 shar-0.020
 replied-0.020
 encouraged-0.020
Token.
Token ID13
Feature activation+4.94e-03

Pos logit contributions
 Anna+0.011
Anna+0.008
 downstairs+0.008
 Ann+0.008
head+0.008
Neg logit contributions
 idea-0.015
ined-0.015
ully-0.014
 hobby-0.014
 gladly-0.013
Token I
Token ID314
Feature activation+2.95e-03

Pos logit contributions
 rats+0.007
 Anna+0.006
 mats+0.006
 beds+0.006
 Megan+0.006
Neg logit contributions
 idea-0.007
 shar-0.006
?'-0.006
asty-0.006
ully-0.006
Token am
Token ID716
Feature activation-2.12e-03

Pos logit contributions
 Anna+0.004
Anna+0.003
 Ann+0.003
 rats+0.003
 knocks+0.003
Neg logit contributions
ully-0.006
ined-0.005
ble-0.005
agons-0.005
opl-0.005
Token a
Token ID257
Feature activation+2.44e-04

Pos logit contributions
ully+0.004
ined+0.004
 idea+0.004
agons+0.004
 accomplish+0.003
Neg logit contributions
 Anna-0.003
 Ann-0.002
 rats-0.002
 Pen-0.002
 Ana-0.002
Token lamp
Token ID20450
Feature activation+1.30e-02

Pos logit contributions
 Anna+0.000
 rats+0.000
Anna+0.000
 Ann+0.000
 Megan+0.000
Neg logit contributions
ully-0.000
 idea-0.000
ined-0.000
 accomplish-0.000
ble-0.000
Token-
Token ID12
Feature activation+5.35e-02

Pos logit contributions
 Anna+0.023
 rats+0.020
Anna+0.020
 Ann+0.020
 mats+0.020
Neg logit contributions
 idea-0.016
ully-0.015
ined-0.015
ble-0.014
 accomplish-0.014
Tokenhead
Token ID2256
Feature activation+1.58e-02

Pos logit contributions
 Anna+0.084
 rats+0.074
 mats+0.073
 Megan+0.072
 beds+0.071
Neg logit contributions
ined-0.075
ully-0.074
 shar-0.068
ble-0.066
?",-0.065
Token!"
Token ID2474
Feature activation-7.58e-03

Pos logit contributions
 Anna+0.027
 rats+0.024
Anna+0.023
 Ann+0.023
 mats+0.023
Neg logit contributions
 idea-0.021
ully-0.020
ined-0.019
 accomplish-0.018
ble-0.018
Token
Token ID198
Feature activation-4.89e-03

Pos logit contributions
ined+0.014
ully+0.014
 idea+0.014
ble+0.013
inkle+0.012
Neg logit contributions
 Anna-0.012
 Ann-0.009
Anna-0.009
 Megan-0.008
 Pen-0.008
Token
Token ID198
Feature activation-9.54e-04

Pos logit contributions
 idea+0.009
ully+0.008
ined+0.008
 accomplish+0.008
 gem+0.008
Neg logit contributions
 Anna-0.007
Anna-0.006
 Ann-0.006
Ann-0.005
 Pen-0.005
TokenTom
Token ID13787
Feature activation+7.02e-03

Pos logit contributions
 idea+0.002
ully+0.001
ined+0.001
 accomplish+0.001
ble+0.001
Neg logit contributions
 Anna-0.001
 rats-0.001
Anna-0.001
 Ann-0.001
 mats-0.001
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
 idea+0.024
ined+0.020
?+0.019
ully+0.019
ble+0.019
Neg logit contributions
 Anna-0.052
 Pen-0.050
 rats-0.050
pter-0.050
Pen-0.050
Token embarrassed
Token ID21100
Feature activation-6.53e-03

Pos logit contributions
 Anna+0.031
 rats+0.027
Anna+0.026
 Ann+0.026
 Megan+0.025
Neg logit contributions
ully-0.029
 idea-0.029
ined-0.028
 accomplish-0.026
ble-0.026
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
 idea+0.007
ined+0.007
ully+0.006
ble+0.006
 accomplish+0.006
Neg logit contributions
 Anna-0.012
 rats-0.011
Anna-0.011
pter-0.011
 Pen-0.011
Token She
Token ID1375
Feature activation+1.29e-02

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.004
 Ann+0.004
 Megan+0.004
Neg logit contributions
 idea-0.004
ully-0.004
ined-0.004
 accomplish-0.004
ble-0.004
Token took
Token ID1718
Feature activation+1.42e-02

Pos logit contributions
 Anna+0.020
Anna+0.017
 Ann+0.016
 rats+0.016
 Megan+0.015
Neg logit contributions
ully-0.022
 idea-0.020
opl-0.020
ined-0.019
ble-0.019
Token Anna
Token ID11735
Feature activation+5.35e-02

Pos logit contributions
pter+0.018
 rats+0.018
heads+0.017
 Anna+0.017
Anna+0.017
Neg logit contributions
 idea-0.020
ully-0.019
 ease-0.019
 Sammy-0.018
 shar-0.018
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
 Megan+0.083
 Anna+0.083
Jacob+0.082
 Ana+0.081
Anna+0.081
Neg logit contributions
 idea-0.057
ully-0.057
 encouraged-0.052
 explained-0.050
?-0.048
Token Ben
Token ID3932
Feature activation+2.19e-02

Pos logit contributions
 Anna+0.021
 rats+0.020
Anna+0.019
pter+0.019
 mats+0.018
Neg logit contributions
 idea-0.034
ully-0.034
 shar-0.032
 replied-0.032
ined-0.031
Token by
Token ID416
Feature activation+1.01e-02

Pos logit contributions
 Anna+0.031
Anna+0.029
 Ana+0.028
 Megan+0.028
 rats+0.028
Neg logit contributions
 idea-0.032
ully-0.029
 encouraged-0.028
 explained-0.027
 ease-0.027
Token the
Token ID262
Feature activation+9.07e-03

Pos logit contributions
 Anna+0.014
 rats+0.012
Anna+0.011
 Ann+0.011
 mats+0.011
Neg logit contributions
 idea-0.017
ully-0.017
ined-0.016
 accomplish-0.015
ble-0.015
Token hand
Token ID1021
Feature activation+1.15e-03

Pos logit contributions
 Anna+0.011
Anna+0.009
 rats+0.009
 Ann+0.009
 Megan+0.009
Neg logit contributions
 idea-0.016
ully-0.015
ined-0.015
ble-0.015
 accomplish-0.015
Token there
Token ID612
Feature activation+5.13e-04

Pos logit contributions
 idea+0.005
ully+0.005
ined+0.005
ble+0.005
 accomplish+0.005
Neg logit contributions
 Anna-0.004
 rats-0.003
 mats-0.003
Anna-0.003
 Ann-0.003
Token were
Token ID547
Feature activation-7.20e-03

Pos logit contributions
 Anna+0.001
 Ann+0.001
 Megan+0.001
Anna+0.001
 Jacob+0.001
Neg logit contributions
ully-0.001
agons-0.001
 idea-0.001
ined-0.001
 accomplish-0.001
Token two
Token ID734
Feature activation+1.00e-03

Pos logit contributions
 idea+0.009
ully+0.009
 ease+0.008
ined+0.007
 Sammy+0.007
Neg logit contributions
 rats-0.011
 Anna-0.011
 mats-0.011
 knocks-0.011
pter-0.011
Token friends
Token ID2460
Feature activation+4.04e-03

Pos logit contributions
 Anna+0.002
Anna+0.002
 Ann+0.002
 rats+0.002
 Megan+0.002
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
agons-0.001
 accomplish-0.001
Token,
Token ID11
Feature activation-4.68e-03

Pos logit contributions
 Anna+0.008
 rats+0.008
Anna+0.007
 Ann+0.007
 mats+0.007
Neg logit contributions
 idea-0.004
ully-0.004
ined-0.004
 accomplish-0.003
ble-0.003
Token Jean
Token ID11320
Feature activation+5.32e-02

Pos logit contributions
 idea+0.008
ined+0.008
ully+0.008
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.006
 Ann-0.005
 rats-0.005
Anna-0.005
 Megan-0.005
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
 Anna+0.092
 rats+0.082
Anna+0.080
 Ann+0.078
 mats+0.077
Neg logit contributions
 idea-0.068
ully-0.067
ined-0.064
 accomplish-0.059
ble-0.059
Token Jane
Token ID12091
Feature activation+2.34e-02

Pos logit contributions
 Anna+0.015
 rats+0.013
Anna+0.013
 mats+0.012
 Ann+0.012
Neg logit contributions
 idea-0.013
ully-0.012
ined-0.012
 accomplish-0.011
ble-0.011
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
 Anna+0.042
 rats+0.038
Anna+0.037
 Ann+0.036
 mats+0.036
Neg logit contributions
 idea-0.029
ully-0.028
ined-0.027
 accomplish-0.024
ble-0.024
Token They
Token ID1119
Feature activation+9.24e-03

Pos logit contributions
 Anna+0.024
 rats+0.021
Anna+0.020
 mats+0.019
 Megan+0.019
Neg logit contributions
 idea-0.029
ully-0.028
ined-0.027
 accomplish-0.025
ble-0.024
Token always
Token ID1464
Feature activation-1.01e-02

Pos logit contributions
 Anna+0.010
 Ann+0.007
Anna+0.007
's+0.006
 Mia+0.006
Neg logit contributions
opl-0.020
ble-0.018
inkle-0.018
mmm-0.018
 accomplish-0.017
Token and
Token ID290
Feature activation+1.96e-02

Pos logit contributions
 idea+0.001
ully+0.001
ined+0.001
?+0.001
 encouraged+0.001
Neg logit contributions
 Anna-0.002
pter-0.002
Anna-0.002
 Pen-0.002
 mats-0.002
Token angry
Token ID7954
Feature activation-2.57e-02

Pos logit contributions
 Anna+0.031
 rats+0.027
 Ann+0.026
Anna+0.026
 Megan+0.025
Neg logit contributions
ully-0.029
 idea-0.029
ined-0.027
 accomplish-0.026
ble-0.025
Token.
Token ID13
Feature activation+8.16e-03

Pos logit contributions
 idea+0.021
ble+0.015
ully+0.015
 replied+0.014
ined+0.014
Neg logit contributions
 Anna-0.052
Anna-0.052
pter-0.052
Pen-0.052
 Pen-0.051
Token She
Token ID1375
Feature activation+8.58e-03

Pos logit contributions
pter+0.014
 rats+0.014
 mats+0.013
ksh+0.013
 Anna+0.013
Neg logit contributions
 idea-0.008
?'-0.007
 accomplish-0.007
 replied-0.007
?".-0.007
Token took
Token ID1718
Feature activation+1.56e-02

Pos logit contributions
 Anna+0.015
Anna+0.013
 rats+0.013
 Ann+0.012
 Megan+0.012
Neg logit contributions
ully-0.012
 idea-0.012
ined-0.011
 accomplish-0.011
opl-0.011
Token Anna
Token ID11735
Feature activation+5.32e-02

Pos logit contributions
pter+0.026
ksh+0.025
heads+0.023
rd+0.023
cious+0.023
Neg logit contributions
 ease-0.019
 idea-0.018
?"-0.015
asty-0.014
 spelled-0.014
Token to
Token ID284
Feature activation+1.29e-02

Pos logit contributions
Jacob+0.085
 Megan+0.085
pter+0.084
ksh+0.083
Ann+0.080
Neg logit contributions
ully-0.054
 idea-0.054
 encouraged-0.054
 explained-0.053
?-0.049
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
pter+0.022
ksh+0.020
 rats+0.020
 mats+0.019
head+0.019
Neg logit contributions
 idea-0.015
 ease-0.014
asty-0.014
 accomplish-0.013
 shar-0.013
Token doctor
Token ID6253
Feature activation+1.85e-02

Pos logit contributions
pter+0.013
ksh+0.012
 Anna+0.012
 rats+0.012
 Ann+0.012
Neg logit contributions
 idea-0.016
ble-0.015
ined-0.014
 gem-0.014
 shar-0.014
Token right
Token ID826
Feature activation-2.06e-03

Pos logit contributions
 Anna+0.034
 rats+0.034
Anna+0.033
 Megan+0.032
 mats+0.032
Neg logit contributions
 idea-0.018
ully-0.016
 replied-0.015
 explained-0.015
 shar-0.014
Token away
Token ID1497
Feature activation-9.47e-03

Pos logit contributions
 idea+0.003
ully+0.003
ined+0.003
ble+0.003
 replied+0.003
Neg logit contributions
 Anna-0.003
 rats-0.003
 mats-0.002
 Pen-0.002
Anna-0.002
Token<|endoftext|>
Token ID50256
Feature activation+1.56e-03

Pos logit contributions
 Anna+0.010
 Ann+0.008
 Megan+0.008
Anna+0.007
 Jacob+0.007
Neg logit contributions
ined-0.014
 idea-0.014
ully-0.013
ble-0.013
inkle-0.013
TokenTim
Token ID14967
Feature activation+7.17e-03

Pos logit contributions
 Anna+0.002
Anna+0.002
 Ann+0.002
 Pen+0.002
 Megan+0.002
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.002
agons-0.002
Token was
Token ID373
Feature activation+4.90e-03

Pos logit contributions
 Anna+0.005
Anna+0.003
 Ann+0.002
's+0.002
 Jacob+0.002
Neg logit contributions
ble-0.017
 accomplish-0.017
 idea-0.017
agons-0.017
ute-0.017
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
 Anna+0.007
 rats+0.006
Anna+0.005
 Ann+0.005
 Pen+0.005
Neg logit contributions
ully-0.008
 idea-0.008
 accomplish-0.007
ined-0.007
 hobby-0.007
Token three
Token ID1115
Feature activation+2.40e-03

Pos logit contributions
 Anna+0.024
 rats+0.021
 Ann+0.020
Anna+0.020
 mats+0.020
Neg logit contributions
 idea-0.023
ully-0.022
ined-0.022
ble-0.021
 accomplish-0.020
Token-
Token ID12
Feature activation+5.29e-02

Pos logit contributions
 Anna+0.003
 rats+0.003
Anna+0.003
 Ann+0.003
 mats+0.003
Neg logit contributions
 idea-0.004
ined-0.004
ully-0.004
ble-0.003
 accomplish-0.003
Tokenyear
Token ID1941
Feature activation+2.74e-02

Pos logit contributions
 Anna+0.091
 rats+0.081
 mats+0.080
 Ann+0.077
 Pen+0.076
Neg logit contributions
ined-0.068
ully-0.067
 idea-0.067
ble-0.059
 shar-0.057
Token-
Token ID12
Feature activation+3.42e-02

Pos logit contributions
 Anna+0.042
Anna+0.037
 rats+0.035
 Ann+0.035
 mats+0.034
Neg logit contributions
 idea-0.044
ully-0.041
ined-0.040
ble-0.037
opl-0.035
Tokenold
Token ID727
Feature activation+1.83e-02

Pos logit contributions
 Anna+0.063
 rats+0.056
 mats+0.054
 Pen+0.053
 Ann+0.053
Neg logit contributions
ully-0.040
 idea-0.040
ined-0.040
 accomplish-0.034
ble-0.034
Token boy
Token ID2933
Feature activation-3.56e-03

Pos logit contributions
 Anna+0.022
 rats+0.019
Anna+0.018
 Ann+0.017
 mats+0.017
Neg logit contributions
 idea-0.033
ully-0.033
ined-0.032
 accomplish-0.030
ble-0.030
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
 idea+0.006
ully+0.006
ble+0.006
ined+0.006
 accomplish+0.006
Neg logit contributions
 Anna-0.005
 rats-0.004
 Ann-0.004
Anna-0.004
pter-0.004
Token was
Token ID373
Feature activation-1.10e-02

Pos logit contributions
ully+0.003
agons+0.003
 idea+0.002
ined+0.002
 accomplish+0.002
Neg logit contributions
 Anna-0.002
 Ann-0.002
Anna-0.002
 Megan-0.002
 Jacob-0.002
Token a
Token ID257
Feature activation-7.41e-03

Pos logit contributions
 idea+0.018
ully+0.018
ined+0.017
 accomplish+0.016
ble+0.016
Neg logit contributions
 Anna-0.015
 rats-0.013
Anna-0.013
 Ann-0.013
 Megan-0.012
Token girl
Token ID2576
Feature activation+1.18e-02

Pos logit contributions
 idea+0.011
ully+0.011
ined+0.011
ble+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.011
 rats-0.010
 Ann-0.009
Anna-0.009
 mats-0.009
Token named
Token ID3706
Feature activation+1.65e-02

Pos logit contributions
 Anna+0.028
 rats+0.027
 mats+0.026
Anna+0.026
 Megan+0.025
Neg logit contributions
 idea-0.006
ully-0.006
ined-0.006
 shar-0.004
ble-0.004
Token K
Token ID509
Feature activation+3.54e-02

Pos logit contributions
 rats+0.022
pter+0.021
 mats+0.019
Jacob+0.019
Rex+0.019
Neg logit contributions
 idea-0.025
ully-0.023
asty-0.022
?'-0.022
 replied-0.022
Tokenait
Token ID4548
Feature activation+5.29e-02

Pos logit contributions
 Anna+0.052
Anna+0.045
 Ann+0.044
 rats+0.044
 Megan+0.043
Neg logit contributions
 idea-0.056
ined-0.053
ully-0.052
 accomplish-0.051
ble-0.049
Tokenlyn
Token ID6213
Feature activation+2.00e-02

Pos logit contributions
 Anna+0.068
 rats+0.057
Anna+0.056
 Ann+0.055
 Megan+0.053
Neg logit contributions
 idea-0.092
ully-0.090
ined-0.088
 accomplish-0.084
ble-0.082
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
 Anna+0.034
 rats+0.029
Anna+0.029
 Ann+0.029
 Megan+0.028
Neg logit contributions
 idea-0.027
ully-0.027
ined-0.025
 accomplish-0.024
ble-0.023
Token She
Token ID1375
Feature activation+7.58e-03

Pos logit contributions
 Anna+0.022
 rats+0.020
Anna+0.019
 mats+0.018
 Jacob+0.018
Neg logit contributions
 idea-0.020
ully-0.019
ined-0.018
 accomplish-0.016
ble-0.016
Token was
Token ID373
Feature activation-3.81e-04

Pos logit contributions
 Anna+0.014
Anna+0.012
 rats+0.012
 Ann+0.012
 Pen+0.011
Neg logit contributions
ully-0.010
 idea-0.010
ined-0.009
 accomplish-0.009
ble-0.008
Token three
Token ID1115
Feature activation+1.55e-03

Pos logit contributions
 idea+0.000
ined+0.000
ully+0.000
ble+0.000
 encouraged+0.000
Neg logit contributions
 Anna-0.001
Anna-0.001
 rats-0.001
 mats-0.001
pter-0.001
Token Maria
Token ID14200
Feature activation+3.04e-02

Pos logit contributions
pter+0.018
 rats+0.018
Rex+0.016
 mats+0.016
Jacob+0.016
Neg logit contributions
 idea-0.022
asty-0.019
?'-0.019
?".-0.019
ully-0.019
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
 Anna+0.049
Anna+0.041
 rats+0.041
 Ann+0.040
 Pen+0.039
Neg logit contributions
 idea-0.044
ully-0.044
ined-0.042
 accomplish-0.041
ble-0.040
Token She
Token ID1375
Feature activation+1.05e-02

Pos logit contributions
 Anna+0.024
 rats+0.021
Anna+0.020
 Ann+0.020
 Megan+0.019
Neg logit contributions
 idea-0.027
ully-0.026
ined-0.026
ble-0.024
 accomplish-0.024
Token was
Token ID373
Feature activation+3.63e-03

Pos logit contributions
 Anna+0.019
Anna+0.017
 rats+0.016
 Ann+0.016
 Pen+0.015
Neg logit contributions
ully-0.013
 idea-0.013
ined-0.012
 accomplish-0.012
ble-0.012
Token three
Token ID1115
Feature activation+3.71e-03

Pos logit contributions
 Anna+0.006
Anna+0.006
 rats+0.006
 mats+0.006
pter+0.005
Neg logit contributions
 idea-0.005
ined-0.004
ully-0.004
 encouraged-0.004
asty-0.004
Token-
Token ID12
Feature activation+5.26e-02

Pos logit contributions
 Anna+0.005
Anna+0.004
 rats+0.004
 Ann+0.004
 Pen+0.004
Neg logit contributions
 idea-0.006
ined-0.006
ully-0.006
ble-0.005
agons-0.005
Tokenyears
Token ID19002
Feature activation-1.10e-02

Pos logit contributions
 Anna+0.087
 mats+0.076
 rats+0.076
Anna+0.072
 Jacob+0.071
Neg logit contributions
ully-0.071
ined-0.069
 idea-0.065
 encouraged-0.062
 shar-0.060
Token-
Token ID12
Feature activation+3.66e-02

Pos logit contributions
 idea+0.020
ully+0.019
ined+0.019
 accomplish+0.018
ble+0.018
Neg logit contributions
 Anna-0.013
 rats-0.012
Anna-0.011
 Ann-0.011
 mats-0.010
Tokenold
Token ID727
Feature activation+1.65e-02

Pos logit contributions
 Anna+0.068
 rats+0.062
 mats+0.060
Anna+0.058
 Pen+0.058
Neg logit contributions
ully-0.043
ined-0.040
 idea-0.039
asty-0.034
 shar-0.034
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
Nap+0.032
rost+0.032
Anna+0.031
 Anna+0.031
 mats+0.031
Neg logit contributions
 idea-0.013
ully-0.013
 encouraged-0.010
 replied-0.010
?-0.009
Token she
Token ID673
Feature activation+1.41e-02

Pos logit contributions
 Anna+0.019
 Ann+0.017
Anna+0.016
 rats+0.016
 Megan+0.016
Neg logit contributions
ully-0.013
 idea-0.012
ined-0.011
alk-0.011
 accomplish-0.011
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 Anna+0.029
 rats+0.024
 Ann+0.023
 mats+0.022
 chairs+0.021
Neg logit contributions
ully-0.052
ined-0.050
 idea-0.049
 accomplish-0.048
ble-0.047
Token He
Token ID679
Feature activation+5.97e-03

Pos logit contributions
 Anna+0.020
 Ann+0.017
Anna+0.016
 rats+0.016
 Megan+0.016
Neg logit contributions
ully-0.021
 idea-0.021
ined-0.021
 accomplish-0.020
ble-0.020
Token was
Token ID373
Feature activation+2.92e-03

Pos logit contributions
 Anna+0.009
 rats+0.007
Anna+0.007
 Ann+0.007
 Pen+0.007
Neg logit contributions
ully-0.010
 idea-0.010
ble-0.009
opl-0.009
 ease-0.009
Token a
Token ID257
Feature activation-1.40e-03

Pos logit contributions
 Anna+0.005
 Ann+0.004
 rats+0.004
 Pen+0.003
Anna+0.003
Neg logit contributions
ully-0.005
 idea-0.004
ined-0.004
 accomplish-0.004
agons-0.004
Token snow
Token ID6729
Feature activation+2.19e-02

Pos logit contributions
ully+0.002
 idea+0.002
ined+0.002
 accomplish+0.002
ble+0.002
Neg logit contributions
 Anna-0.002
 rats-0.002
Anna-0.002
 Ann-0.002
 Pen-0.002
Token-
Token ID12
Feature activation+5.24e-02

Pos logit contributions
 Anna+0.035
 rats+0.030
Anna+0.030
 Ann+0.029
 Megan+0.029
Neg logit contributions
 idea-0.032
ully-0.031
ined-0.030
ble-0.029
inkle-0.026
Tokenst
Token ID301
Feature activation+1.92e-02

Pos logit contributions
 Anna+0.087
 rats+0.074
 mats+0.074
 Megan+0.070
 Eva+0.070
Neg logit contributions
ined-0.076
ully-0.074
ble-0.069
hing-0.067
orn-0.064
Tokeno
Token ID78
Feature activation+1.78e-02

Pos logit contributions
 Anna+0.027
 rats+0.023
 Ann+0.023
Anna+0.023
 Pen+0.022
Neg logit contributions
ully-0.029
 idea-0.028
 accomplish-0.027
ined-0.027
 encouraged-0.026
Tokenpper
Token ID2848
Feature activation+1.46e-02

Pos logit contributions
 Anna+0.028
 rats+0.024
Anna+0.023
 Ann+0.022
 Pen+0.022
Neg logit contributions
 idea-0.028
ully-0.027
ined-0.026
 accomplish-0.025
 ease-0.024
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
 Anna+0.023
 rats+0.020
Anna+0.019
 Ann+0.019
 Pen+0.018
Neg logit contributions
 idea-0.022
ined-0.021
ully-0.021
 accomplish-0.020
 ease-0.019
Token His
Token ID2399
Feature activation-1.45e-04

Pos logit contributions
 Anna+0.014
 Ann+0.011
Anna+0.010
 Jacob+0.010
 Pen+0.010
Neg logit contributions
ully-0.015
 idea-0.015
ined-0.015
 accomplish-0.015
ble-0.015
Token Lucy
Token ID22162
Feature activation+2.27e-02

Pos logit contributions
pter+0.022
 rats+0.021
Rex+0.020
 mats+0.020
Jacob+0.020
Neg logit contributions
 idea-0.020
?"-0.018
?".-0.017
 replied-0.017
asty-0.017
Token.
Token ID13
Feature activation+1.64e-02

Pos logit contributions
 Anna+0.043
 rats+0.039
 mats+0.038
Anna+0.037
 Ann+0.037
Neg logit contributions
 idea-0.025
ully-0.024
ined-0.024
ble-0.021
 accomplish-0.021
Token She
Token ID1375
Feature activation+1.18e-02

Pos logit contributions
 rats+0.028
pter+0.027
Rex+0.026
ups+0.026
heads+0.025
Neg logit contributions
 idea-0.020
ined-0.016
?'-0.014
 replied-0.014
 ease-0.014
Token was
Token ID373
Feature activation+1.72e-03

Pos logit contributions
 Anna+0.023
 rats+0.021
Anna+0.020
 Ann+0.020
 mats+0.020
Neg logit contributions
 idea-0.013
ully-0.013
ined-0.012
 accomplish-0.011
ble-0.011
Token three
Token ID1115
Feature activation+3.48e-03

Pos logit contributions
Anna+0.003
 Anna+0.003
pter+0.003
 mats+0.003
 rats+0.003
Neg logit contributions
 idea-0.002
ined-0.002
ully-0.002
 encouraged-0.002
asty-0.002
Token-
Token ID12
Feature activation+5.22e-02

Pos logit contributions
 Anna+0.005
Anna+0.004
 rats+0.004
 Ann+0.004
 Pen+0.004
Neg logit contributions
 idea-0.006
ined-0.005
ully-0.005
agons-0.005
ble-0.005
Tokenyears
Token ID19002
Feature activation-1.18e-02

Pos logit contributions
 Anna+0.090
 mats+0.082
 rats+0.080
 beds+0.077
 Jacob+0.076
Neg logit contributions
ully-0.065
ined-0.062
 encouraged-0.056
 idea-0.055
asty-0.053
Token-
Token ID12
Feature activation+3.61e-02

Pos logit contributions
 idea+0.020
ully+0.020
ined+0.020
 accomplish+0.019
ble+0.018
Neg logit contributions
 Anna-0.015
 rats-0.013
Anna-0.012
 Ann-0.012
 mats-0.012
Tokenold
Token ID727
Feature activation+1.62e-02

Pos logit contributions
 Anna+0.069
 rats+0.063
 mats+0.062
ksh+0.060
 beds+0.060
Neg logit contributions
ully-0.040
ined-0.036
 idea-0.035
opl-0.031
asty-0.031
Token and
Token ID290
Feature activation+1.00e-02

Pos logit contributions
rost+0.033
Nap+0.032
 knocks+0.031
 burns+0.031
 mats+0.031
Neg logit contributions
 idea-0.011
ully-0.010
?-0.009
 encouraged-0.009
 replied-0.008
Token loved
Token ID6151
Feature activation+4.88e-04

Pos logit contributions
 Anna+0.020
 rats+0.017
Anna+0.017
 Ann+0.017
 Megan+0.016
Neg logit contributions
ully-0.011
 idea-0.011
ined-0.010
 accomplish-0.010
ble-0.009
TokenThe
Token ID464
Feature activation-1.22e-02

Pos logit contributions
 idea+0.005
ined+0.005
ully+0.005
inkle+0.004
ble+0.004
Neg logit contributions
 Anna-0.003
Anna-0.003
 Ann-0.003
 rats-0.003
 Pen-0.002
Token wild
Token ID4295
Feature activation-7.17e-03

Pos logit contributions
ully+0.021
ined+0.020
 idea+0.019
imm+0.019
?'+0.019
Neg logit contributions
 Anna-0.017
 rats-0.014
Anna-0.014
 mats-0.013
 Pen-0.013
Token wild
Token ID4295
Feature activation-6.55e-03

Pos logit contributions
ully+0.011
 idea+0.011
ined+0.010
 accomplish+0.010
 ease+0.009
Neg logit contributions
 Anna-0.011
 rats-0.009
Anna-0.009
 Ann-0.009
 mats-0.009
Tokener
Token ID263
Feature activation-9.98e-04

Pos logit contributions
ully+0.010
 idea+0.010
ined+0.009
 accomplish+0.009
 ease+0.009
Neg logit contributions
 Anna-0.010
 rats-0.008
Anna-0.008
 mats-0.008
 Ann-0.008
Tokenbe
Token ID1350
Feature activation-1.04e-02

Pos logit contributions
agons+0.001
 accomplish+0.001
 ease+0.001
 idea+0.001
opl+0.001
Neg logit contributions
 Anna-0.002
 Ann-0.001
 Megan-0.001
 Jacob-0.001
 Pen-0.001
Tokenast
Token ID459
Feature activation+5.22e-02

Pos logit contributions
ully+0.016
 accomplish+0.015
 shar+0.015
 idea+0.014
 ease+0.014
Neg logit contributions
 Anna-0.015
Anna-0.013
pter-0.013
 Ann-0.012
 Megan-0.012
Token had
Token ID550
Feature activation-6.40e-04

Pos logit contributions
 Anna+0.082
Anna+0.067
 Ann+0.066
 Pen+0.066
 rats+0.063
Neg logit contributions
 idea-0.080
ully-0.076
 hobby-0.072
 accomplish-0.071
opl-0.068
Token to
Token ID284
Feature activation+1.06e-02

Pos logit contributions
ined+0.001
agons+0.001
ble+0.001
ute+0.001
ully+0.001
Neg logit contributions
 Anna-0.001
 Ann-0.001
 to-0.001
 eight-0.000
 rats-0.000
Token think
Token ID892
Feature activation+7.55e-03

Pos logit contributions
 Anna+0.014
 Ann+0.011
 Megan+0.010
 Pen+0.009
Anna+0.009
Neg logit contributions
ined-0.021
ully-0.020
 hobby-0.018
opl-0.018
mod-0.018
Token for
Token ID329
Feature activation-2.69e-02

Pos logit contributions
 Anna+0.010
 Ann+0.008
 rats+0.007
 minutes+0.007
 Megan+0.007
Neg logit contributions
ully-0.015
ined-0.014
 idea-0.014
agons-0.013
 accomplish-0.013
Token a
Token ID257
Feature activation-3.23e-03

Pos logit contributions
ully+0.049
 idea+0.048
ined+0.048
 accomplish+0.046
ble+0.046
Neg logit contributions
 Anna-0.035
 rats-0.028
 Ann-0.027
Anna-0.025
 Megan-0.025
Token how
Token ID703
Feature activation+1.94e-03

Pos logit contributions
 idea+0.035
ully+0.034
ined+0.032
 accomplish+0.030
inkle+0.030
Neg logit contributions
 Anna-0.024
 rats-0.020
 Ann-0.019
Anna-0.018
 Megan-0.017
Token much
Token ID881
Feature activation-2.43e-02

Pos logit contributions
 Anna+0.001
 rats+0.001
Anna+0.001
 Ann+0.001
 mats+0.001
Neg logit contributions
 idea-0.005
ully-0.005
ined-0.004
 accomplish-0.004
inkle-0.004
Token I
Token ID314
Feature activation-2.37e-02

Pos logit contributions
ully+0.048
ined+0.047
 accomplish+0.043
ble+0.042
opl+0.041
Neg logit contributions
 Anna-0.032
 rats-0.024
Anna-0.023
 mats-0.022
 Ann-0.022
Token love
Token ID1842
Feature activation+1.49e-02

Pos logit contributions
ully+0.052
ined+0.049
ute+0.049
agons+0.048
opl+0.048
Neg logit contributions
 Anna-0.020
Anna-0.016
 Frost-0.015
 bite-0.015
 burns-0.014
Token my
Token ID616
Feature activation-2.14e-02

Pos logit contributions
 Anna+0.018
 rats+0.014
Anna+0.013
 mats+0.013
 Ann+0.012
Neg logit contributions
 idea-0.030
ully-0.029
ined-0.028
 accomplish-0.028
ble-0.026
Token beans
Token ID16567
Feature activation-6.66e-02

Pos logit contributions
ined+0.039
ully+0.039
hing+0.038
opl+0.038
 accomplish+0.035
Neg logit contributions
 Anna-0.029
 rats-0.022
 plates-0.022
 mats-0.022
 chairs-0.020
Token?
Token ID30
Feature activation+6.50e-03

Pos logit contributions
ully+0.092
ined+0.087
 ease+0.080
mod+0.076
agons+0.072
Neg logit contributions
 Anna-0.122
Anna-0.099
 to-0.095
 next-0.095
's-0.093
Token They
Token ID1119
Feature activation+8.06e-03

Pos logit contributions
 Anna+0.007
 rats+0.005
Anna+0.005
 Ann+0.005
 Pen+0.005
Neg logit contributions
 idea-0.014
ully-0.013
ined-0.013
ble-0.012
 accomplish-0.012
Token are
Token ID389
Feature activation+7.15e-03

Pos logit contributions
 Anna+0.009
 rats+0.008
Anna+0.007
 Ann+0.007
 mats+0.007
Neg logit contributions
ully-0.015
ined-0.014
 idea-0.014
ble-0.014
opl-0.013
Token my
Token ID616
Feature activation+7.84e-03

Pos logit contributions
 Anna+0.010
 rats+0.009
Anna+0.008
 Ann+0.008
 mats+0.008
Neg logit contributions
ully-0.012
 idea-0.012
ined-0.011
 accomplish-0.011
ble-0.011
Token only
Token ID691
Feature activation+8.02e-03

Pos logit contributions
 Anna+0.012
 rats+0.011
 mats+0.010
 Ann+0.010
Anna+0.010
Neg logit contributions
ined-0.012
ully-0.012
ble-0.011
 accomplish-0.011
 idea-0.011
Token
Token ID198
Feature activation-2.36e-03

Pos logit contributions
 Anna+0.006
 Ann+0.005
Anna+0.005
 rats+0.005
 Megan+0.004
Neg logit contributions
ully-0.006
 idea-0.005
ined-0.005
 accomplish-0.005
ble-0.005
Token
Token ID198
Feature activation+6.27e-03

Pos logit contributions
 idea+0.004
ully+0.004
ined+0.004
 accomplish+0.003
ble+0.003
Neg logit contributions
 Anna-0.004
Anna-0.003
 Ann-0.003
 Megan-0.003
 Pen-0.003
Token"
Token ID1
Feature activation+1.30e-02

Pos logit contributions
 Anna+0.010
 rats+0.009
 Ann+0.009
 Megan+0.008
 mats+0.008
Neg logit contributions
 idea-0.009
ully-0.008
ined-0.008
 accomplish-0.008
ble-0.008
TokenWhat
Token ID2061
Feature activation+1.23e-03

Pos logit contributions
 Anna+0.017
 rats+0.014
Anna+0.014
 Ann+0.013
 mats+0.013
Neg logit contributions
 idea-0.023
ully-0.022
ined-0.022
 accomplish-0.021
ble-0.020
Token are
Token ID389
Feature activation+6.86e-04

Pos logit contributions
 Anna+0.002
 rats+0.001
Anna+0.001
 Ann+0.001
 Megan+0.001
Neg logit contributions
ully-0.002
ined-0.002
 idea-0.002
ble-0.002
 accomplish-0.002
Token peanuts
Token ID45612
Feature activation-6.49e-02

Pos logit contributions
 Anna+0.001
 rats+0.001
Anna+0.001
 Ann+0.001
 mats+0.001
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token?"
Token ID1701
Feature activation-1.11e-02

Pos logit contributions
 ease+0.077
ully+0.077
ined+0.067
 hobby+0.064
 gladly+0.061
Neg logit contributions
 Anna-0.113
 rats-0.102
head-0.097
 downstairs-0.096
Anna-0.093
Token Lily
Token ID20037
Feature activation-1.14e-02

Pos logit contributions
ully+0.023
ined+0.023
 idea+0.023
 accomplish+0.022
ble+0.022
Neg logit contributions
 Anna-0.012
 Ann-0.008
 rats-0.008
Anna-0.008
 Megan-0.008
Token asked
Token ID1965
Feature activation+1.27e-02

Pos logit contributions
ined+0.018
ully+0.016
 idea+0.016
 ease+0.016
asty+0.016
Neg logit contributions
 Anna-0.019
 Ann-0.015
Anna-0.015
 Jacob-0.015
 Pen-0.015
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
 Anna+0.022
 rats+0.019
 Ann+0.019
Anna+0.019
 Megan+0.018
Neg logit contributions
 idea-0.017
ully-0.017
ined-0.016
 accomplish-0.015
ble-0.015
Token
Token ID198
Feature activation+1.94e-03

Pos logit contributions
 Anna+0.028
 Ann+0.022
Anna+0.021
 Pen+0.020
 Jacob+0.019
Neg logit contributions
ined-0.033
ully-0.033
 idea-0.032
agons-0.031
inkle-0.030
Token questioned
Token ID11434
Feature activation-4.51e-03

Pos logit contributions
agons+0.044
opl+0.044
ined+0.044
 ease+0.043
asty+0.042
Neg logit contributions
 Anna-0.038
's-0.030
 Jacob-0.028
Anna-0.028
 Eva-0.027
Token.
Token ID13
Feature activation+1.86e-03

Pos logit contributions
ully+0.008
ined+0.007
 idea+0.007
 accomplish+0.007
opl+0.007
Neg logit contributions
 Anna-0.007
 Ann-0.006
 Megan-0.005
 rats-0.005
 Jacob-0.005
Token "
Token ID366
Feature activation+4.10e-03

Pos logit contributions
 Anna+0.003
 Ann+0.002
Anna+0.002
 Megan+0.002
 Jacob+0.002
Neg logit contributions
ully-0.003
ined-0.003
 idea-0.003
inkle-0.003
 accomplish-0.003
TokenWhy
Token ID5195
Feature activation+2.67e-03

Pos logit contributions
 Anna+0.005
Anna+0.005
 rats+0.004
 Ann+0.004
pter+0.004
Neg logit contributions
 idea-0.007
ully-0.007
ined-0.007
inkle-0.006
 accomplish-0.006
Token do
Token ID466
Feature activation+1.04e-02

Pos logit contributions
 Anna+0.004
 rats+0.004
 Ann+0.004
Anna+0.003
 Jacob+0.003
Neg logit contributions
ully-0.004
ined-0.004
 ease-0.004
 gol-0.004
opl-0.004
Token turtles
Token ID36288
Feature activation-6.19e-02

Pos logit contributions
 Anna+0.011
Anna+0.010
 rats+0.009
 Megan+0.009
 Ann+0.009
Neg logit contributions
 idea-0.019
ully-0.018
 accomplish-0.017
ined-0.017
ble-0.017
Token have
Token ID423
Feature activation-1.65e-02

Pos logit contributions
ully+0.114
ined+0.113
opl+0.104
 idea+0.103
mod+0.101
Neg logit contributions
 Anna-0.064
 rats-0.055
 we-0.052
 wake-0.052
 to-0.046
Token shells
Token ID19679
Feature activation-4.10e-02

Pos logit contributions
ully+0.029
ined+0.029
 idea+0.028
 accomplish+0.027
ble+0.027
Neg logit contributions
 Anna-0.022
 rats-0.020
 Ann-0.016
 mats-0.016
 minutes-0.016
Token?"
Token ID1701
Feature activation-1.92e-02

Pos logit contributions
ined+0.070
ully+0.068
 idea+0.067
opl+0.063
 ease+0.063
Neg logit contributions
 Anna-0.050
 to-0.040
 rats-0.039
 we-0.038
 mats-0.036
Token Tom
Token ID4186
Feature activation-2.80e-02

Pos logit contributions
ined+0.041
ully+0.040
 idea+0.040
ble+0.039
 accomplish+0.039
Neg logit contributions
 Anna-0.022
 Ann-0.017
 Megan-0.015
Anna-0.014
 Jacob-0.014
Token questioned
Token ID11434
Feature activation-5.43e-03

Pos logit contributions
ined+0.047
opl+0.047
agons+0.046
alk+0.045
 ease+0.045
Neg logit contributions
 Anna-0.044
's-0.035
 Jacob-0.033
 Ann-0.033
Anna-0.032
Token,
Token ID11
Feature activation-2.72e-02

Pos logit contributions
ully+0.014
 idea+0.014
ined+0.014
 accomplish+0.012
agons+0.012
Neg logit contributions
 Anna-0.015
 Ann-0.012
 rats-0.012
 Megan-0.012
Anna-0.012
Token "
Token ID366
Feature activation-4.75e-03

Pos logit contributions
ined+0.058
ully+0.057
agons+0.057
 idea+0.056
inkle+0.055
Neg logit contributions
 Anna-0.034
 Ann-0.026
 Pen-0.024
 Ana-0.023
 Megan-0.023
TokenWhat
Token ID2061
Feature activation-1.29e-03

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.009
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.006
 rats-0.005
Anna-0.005
 Ann-0.004
 Megan-0.004
Token kind
Token ID1611
Feature activation-3.47e-03

Pos logit contributions
ully+0.002
ined+0.002
 idea+0.002
ble+0.002
 accomplish+0.002
Neg logit contributions
 Anna-0.002
 rats-0.001
Anna-0.001
 Ann-0.001
 Megan-0.001
Token of
Token ID286
Feature activation-4.87e-03

Pos logit contributions
ined+0.007
 accomplish+0.007
 ease+0.007
ully+0.007
oving+0.006
Neg logit contributions
 Anna-0.004
head-0.003
Anna-0.003
 rats-0.003
 Ann-0.003
Token peanuts
Token ID45612
Feature activation-6.13e-02

Pos logit contributions
ined+0.008
ully+0.007
 accomplish+0.007
oving+0.007
inkle+0.007
Neg logit contributions
 Anna-0.007
 rats-0.006
 Ann-0.005
 Frost-0.005
 Megan-0.005
Token are
Token ID389
Feature activation-1.89e-02

Pos logit contributions
ully+0.088
ined+0.081
 ease+0.081
 hobby+0.078
oving+0.076
Neg logit contributions
 Anna-0.093
 rats-0.079
 we-0.074
head-0.073
Anna-0.070
Token these
Token ID777
Feature activation-2.04e-02

Pos logit contributions
ully+0.035
ined+0.035
 idea+0.034
hing+0.033
 ease+0.033
Neg logit contributions
 Anna-0.022
 rats-0.018
 plates-0.016
 we-0.016
 chairs-0.015
Token?"
Token ID1701
Feature activation-1.84e-02

Pos logit contributions
ully+0.031
ined+0.029
 ease+0.027
alk+0.027
hing+0.027
Neg logit contributions
 Anna-0.029
 plates-0.027
 rats-0.026
 next-0.026
 to-0.025
Token
Token ID198
Feature activation+1.18e-03

Pos logit contributions
agons+0.038
 idea+0.037
ully+0.037
ined+0.036
ble+0.036
Neg logit contributions
 Anna-0.025
 Ann-0.019
 Ana-0.017
 Pen-0.016
 Megan-0.016
Token
Token ID198
Feature activation+7.15e-03

Pos logit contributions
 Anna+0.002
Anna+0.001
 Ann+0.001
 Pen+0.001
Ann+0.001
Neg logit contributions
 idea-0.002
ully-0.002
ined-0.002
agons-0.002
 accomplish-0.002
TokenWhy
Token ID5195
Feature activation+5.68e-03

Pos logit contributions
 Anna+0.007
 rats+0.006
 Ann+0.006
 mats+0.006
 Megan+0.006
Neg logit contributions
ully-0.007
ined-0.007
 idea-0.006
 accomplish-0.006
ble-0.006
Token do
Token ID466
Feature activation+1.20e-02

Pos logit contributions
 Anna+0.008
 rats+0.006
 Ann+0.006
Anna+0.006
 Pen+0.005
Neg logit contributions
ully-0.010
ined-0.010
 idea-0.009
agons-0.009
opl-0.009
Token you
Token ID345
Feature activation-2.08e-02

Pos logit contributions
ksh+0.017
rost+0.017
pter+0.016
 beds+0.016
 downstairs+0.016
Neg logit contributions
ully-0.016
 accomplish-0.016
 idea-0.016
 replied-0.015
?".-0.014
Token look
Token ID804
Feature activation-2.36e-02

Pos logit contributions
ined+0.042
ble+0.039
ully+0.039
ently+0.038
ute+0.036
Neg logit contributions
 Anna-0.020
|-0.015
 wake-0.015
 bit-0.014
 sigh-0.014
Token so
Token ID523
Feature activation-1.07e-02

Pos logit contributions
ully+0.041
ined+0.041
 idea+0.038
 accomplish+0.038
ble+0.037
Neg logit contributions
 Anna-0.034
 rats-0.027
 Ann-0.026
Anna-0.024
 downstairs-0.024
Token angry
Token ID7954
Feature activation-6.08e-02

Pos logit contributions
 idea+0.016
ully+0.016
ined+0.015
 accomplish+0.014
ble+0.014
Neg logit contributions
 Anna-0.016
 rats-0.014
Anna-0.014
 Ann-0.014
 mats-0.013
Token?"
Token ID1701
Feature activation-2.00e-02

Pos logit contributions
ully+0.072
ined+0.068
 ease+0.066
 idea+0.065
 accomplish+0.064
Neg logit contributions
 Anna-0.122
 rats-0.105
 Ann-0.099
head-0.098
Anna-0.097
Token
Token ID220
Feature activation-1.66e-02

Pos logit contributions
 idea+0.029
ully+0.029
ined+0.028
 accomplish+0.026
ble+0.026
Neg logit contributions
 Anna-0.032
 rats-0.027
Anna-0.027
 Ann-0.026
 Megan-0.025
Token
Token ID198
Feature activation-7.14e-03

Pos logit contributions
ully+0.028
ined+0.028
 idea+0.027
agons+0.026
 hobby+0.026
Neg logit contributions
 Anna-0.024
Anna-0.019
 Ann-0.019
 Pen-0.018
 Megan-0.018
Token
Token ID198
Feature activation-3.60e-04

Pos logit contributions
 idea+0.010
ully+0.010
ined+0.010
 accomplish+0.009
ble+0.009
Neg logit contributions
 Anna-0.012
Anna-0.010
 rats-0.010
 Ann-0.010
 Megan-0.010
Token"
Token ID1
Feature activation+2.07e-03

Pos logit contributions
 idea+0.001
ully+0.001
ined+0.001
 accomplish+0.001
ble+0.001
Neg logit contributions
 Anna-0.000
 rats-0.000
 Ann-0.000
 Megan-0.000
pter-0.000
Token "
Token ID366
Feature activation-7.94e-03

Pos logit contributions
ined+0.044
ully+0.043
opl+0.043
agons+0.042
 idea+0.042
Neg logit contributions
 Anna-0.024
 Ann-0.017
 Pen-0.017
 Ana-0.016
 Jacob-0.015
TokenWhat
Token ID2061
Feature activation-1.36e-03

Pos logit contributions
 idea+0.015
ully+0.014
ined+0.014
 accomplish+0.013
ble+0.013
Neg logit contributions
 Anna-0.009
 rats-0.008
Anna-0.008
 Ann-0.007
 Megan-0.007
Token makes
Token ID1838
Feature activation+1.06e-02

Pos logit contributions
ully+0.003
ined+0.003
 idea+0.002
ble+0.002
inkle+0.002
Neg logit contributions
 Anna-0.002
 rats-0.001
Anna-0.001
 Ann-0.001
 Pen-0.001
Token pe
Token ID613
Feature activation-2.67e-02

Pos logit contributions
 Anna+0.012
 rats+0.009
 Pen+0.008
 Ann+0.008
 Jacob+0.008
Neg logit contributions
ully-0.021
ined-0.020
 idea-0.019
 hobby-0.019
agons-0.018
Tokenb
Token ID65
Feature activation-3.15e-02

Pos logit contributions
ully+0.031
 idea+0.029
 gol+0.028
ined+0.028
 hobby+0.027
Neg logit contributions
 Anna-0.041
 rats-0.041
pter-0.039
 Megan-0.037
 Ann-0.037
Tokenbles
Token ID7689
Feature activation-6.00e-02

Pos logit contributions
 gol+0.021
 ease+0.021
ully+0.021
ined+0.020
 hobby+0.020
Neg logit contributions
 Anna-0.061
 Ann-0.057
 rats-0.057
 Frost-0.057
pter-0.056
Token special
Token ID2041
Feature activation-1.75e-02

Pos logit contributions
ully+0.101
ined+0.100
opl+0.090
mod+0.089
agons+0.089
Neg logit contributions
 Anna-0.065
 we-0.060
 to-0.057
 rats-0.055
head-0.054
Token?"
Token ID1701
Feature activation-1.86e-02

Pos logit contributions
ully+0.028
ined+0.025
 ease+0.024
agons+0.023
opl+0.023
Neg logit contributions
 Anna-0.026
 to-0.022
 we-0.021
 Frost-0.021
 rats-0.021
Token His
Token ID2399
Feature activation-5.49e-03

Pos logit contributions
ined+0.031
agons+0.030
 idea+0.030
ble+0.029
ully+0.028
Neg logit contributions
 Anna-0.028
 Ann-0.024
 Jacob-0.023
 Ana-0.022
 Anne-0.022
Token mother
Token ID2802
Feature activation-1.99e-02

Pos logit contributions
ined+0.011
 accomplish+0.010
hing+0.010
ully+0.010
asty+0.010
Neg logit contributions
 Anna-0.006
 rats-0.005
 Ann-0.004
 Ana-0.004
 Pen-0.004
Token replied
Token ID8712
Feature activation-7.27e-03

Pos logit contributions
 idea+0.024
ully+0.024
agons+0.023
 accomplish+0.023
opl+0.023
Neg logit contributions
 Anna-0.036
 Ann-0.031
Anna-0.031
 Megan-0.030
 Ana-0.030
Token.
Token ID13
Feature activation+4.68e-03

Pos logit contributions
ully+0.012
ined+0.011
 idea+0.011
opl+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.008
 rats-0.007
 Megan-0.006
 Ann-0.006
Anna-0.006
Token "
Token ID366
Feature activation+4.59e-03

Pos logit contributions
 Anna+0.007
 rats+0.006
 Ann+0.006
Anna+0.006
 Megan+0.006
Neg logit contributions
 idea-0.007
ully-0.007
ined-0.007
 accomplish-0.007
ble-0.006
TokenWhat
Token ID2061
Feature activation+3.08e-03

Pos logit contributions
 Anna+0.006
 rats+0.005
 Ann+0.005
 mats+0.005
Anna+0.005
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.008
 accomplish-0.007
ble-0.007
Token are
Token ID389
Feature activation+5.47e-03

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.004
 mats+0.004
 Ann+0.004
Neg logit contributions
 idea-0.005
ully-0.004
ined-0.004
 accomplish-0.004
ble-0.004
Token you
Token ID345
Feature activation-2.96e-02

Pos logit contributions
 Anna+0.008
Anna+0.007
 Ann+0.007
Ann+0.007
Jacob+0.007
Neg logit contributions
ully-0.008
 idea-0.008
ble-0.008
?",-0.008
 gem-0.008
Token burying
Token ID48360
Feature activation-5.91e-02

Pos logit contributions
ully+0.047
ined+0.044
 gladly+0.044
opl+0.042
 idea+0.041
Neg logit contributions
 Anna-0.039
 rats-0.034
 downstairs-0.030
 minutes-0.029
 Frost-0.029
Token?"
Token ID1701
Feature activation-2.37e-02

Pos logit contributions
ully+0.078
opl+0.071
ined+0.069
 idea+0.069
 accomplish+0.066
Neg logit contributions
 Anna-0.106
 rats-0.091
 downstairs-0.084
 plates-0.082
 Frost-0.082
Token asked
Token ID1965
Feature activation-3.01e-03

Pos logit contributions
 idea+0.029
ully+0.028
ined+0.028
 accomplish+0.025
ble+0.024
Neg logit contributions
 Anna-0.044
 rats-0.039
 Ann-0.038
Anna-0.037
 Pen-0.037
Token the
Token ID262
Feature activation-7.26e-03

Pos logit contributions
 idea+0.006
ully+0.006
ined+0.006
 accomplish+0.006
opl+0.006
Neg logit contributions
 Anna-0.004
 Pen-0.002
 Ann-0.002
 rats-0.002
 Frost-0.002
Token bird
Token ID6512
Feature activation-1.03e-02

Pos logit contributions
 idea+0.011
ully+0.010
ined+0.009
ble+0.009
 replied+0.008
Neg logit contributions
 Anna-0.012
Anna-0.010
 Ann-0.010
 rats-0.010
 mats-0.010
Token.
Token ID13
Feature activation+3.72e-03

Pos logit contributions
 idea+0.016
ully+0.016
ined+0.016
agons+0.015
 accomplish+0.015
Neg logit contributions
 Anna-0.015
 Ann-0.012
 rats-0.012
 Megan-0.012
 Ana-0.012
Token.
Token ID13
Feature activation-3.64e-03

Pos logit contributions
ully+0.024
 idea+0.024
ined+0.024
 accomplish+0.023
agons+0.022
Neg logit contributions
 Anna-0.027
 rats-0.023
 Ann-0.023
Anna-0.022
 Megan-0.022
Token "
Token ID366
Feature activation+6.98e-03

Pos logit contributions
ined+0.007
 idea+0.007
ully+0.007
agons+0.007
inkle+0.007
Neg logit contributions
 Anna-0.005
 Ann-0.004
 Pen-0.004
 Ana-0.004
 Megan-0.003
TokenAre
Token ID8491
Feature activation+1.02e-02

Pos logit contributions
 Anna+0.009
 rats+0.008
Anna+0.007
 Ann+0.007
 mats+0.007
Neg logit contributions
 idea-0.012
ully-0.012
ined-0.012
 accomplish-0.011
ble-0.011
Token those
Token ID883
Feature activation-6.06e-03

Pos logit contributions
 Anna+0.013
Anna+0.013
 mats+0.013
heads+0.012
pter+0.012
Neg logit contributions
 accomplish-0.014
ully-0.014
 idea-0.014
 Sammy-0.013
ble-0.013
Token delicious
Token ID12625
Feature activation-3.86e-02

Pos logit contributions
ully+0.011
ined+0.010
 idea+0.010
 accomplish+0.010
ble+0.009
Neg logit contributions
 Anna-0.009
 rats-0.007
 Ann-0.007
 Megan-0.007
 Pen-0.007
Token berries
Token ID36322
Feature activation-5.88e-02

Pos logit contributions
ully+0.069
ined+0.065
 accomplish+0.064
 ease+0.062
ble+0.061
Neg logit contributions
 Anna-0.052
 rats-0.046
 Ann-0.040
 Megan-0.039
 Jacob-0.038
Token for
Token ID329
Feature activation-6.53e-03

Pos logit contributions
ully+0.090
ined+0.080
 gol+0.079
 idea+0.076
 ease+0.075
Neg logit contributions
 Anna-0.090
 rats-0.077
 we-0.074
 to-0.071
 Megan-0.069
Token me
Token ID502
Feature activation-2.66e-02

Pos logit contributions
ully+0.014
 idea+0.014
ined+0.014
opl+0.013
 accomplish+0.013
Neg logit contributions
 Anna-0.007
 rats-0.005
 Ann-0.005
 Pen-0.004
 Megan-0.004
Token?"
Token ID1701
Feature activation-2.40e-02

Pos logit contributions
ully+0.029
ined+0.027
 idea+0.026
 ease+0.026
agons+0.025
Neg logit contributions
 Anna-0.053
 rats-0.045
 Ann-0.045
 to-0.044
 Megan-0.044
Token
Token ID198
Feature activation-2.52e-03

Pos logit contributions
agons+0.049
ined+0.048
 idea+0.047
ully+0.047
opl+0.046
Neg logit contributions
 Anna-0.032
 Pen-0.024
 Ana-0.023
 Ann-0.023
 Jacob-0.021
Token
Token ID198
Feature activation-1.11e-03

Pos logit contributions
 idea+0.005
ined+0.004
ully+0.004
agons+0.004
 accomplish+0.004
Neg logit contributions
 Anna-0.004
Anna-0.003
 Pen-0.003
 Ann-0.003
 Ana-0.003
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
 Anna+0.021
Anna+0.018
 rats+0.018
pter+0.017
 mats+0.016
Neg logit contributions
 idea-0.040
ully-0.039
ined-0.037
 shar-0.036
 accomplish-0.036
Token worry
Token ID5490
Feature activation-2.25e-03

Pos logit contributions
 Anna+0.014
 rats+0.012
 Ann+0.011
Anna+0.011
 Megan+0.011
Neg logit contributions
ined-0.020
ully-0.020
 idea-0.020
ble-0.019
inkle-0.019
Token.
Token ID13
Feature activation-1.30e-03

Pos logit contributions
ully+0.003
ined+0.003
 idea+0.003
 accomplish+0.003
 ease+0.003
Neg logit contributions
 Anna-0.004
 rats-0.003
 Ann-0.003
 Megan-0.003
 Jacob-0.003
Token What
Token ID1867
Feature activation+9.23e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
 accomplish+0.002
ble+0.002
Neg logit contributions
 Anna-0.002
 rats-0.002
Anna-0.002
 mats-0.002
 Ann-0.002
Token should
Token ID815
Feature activation-5.63e-03

Pos logit contributions
 Anna+0.016
 rats+0.014
Anna+0.014
 Ann+0.014
 Megan+0.013
Neg logit contributions
 idea-0.012
ully-0.012
ined-0.011
 accomplish-0.011
ble-0.011
Token he
Token ID339
Feature activation-5.84e-02

Pos logit contributions
 idea+0.007
ully+0.007
 accomplish+0.006
 replied+0.006
ble+0.006
Neg logit contributions
ksh-0.009
heads-0.009
pter-0.009
 mats-0.008
 minutes-0.008
Token do
Token ID466
Feature activation-2.82e-02

Pos logit contributions
ined+0.113
ully+0.112
ute+0.108
alk+0.107
 ease+0.107
Neg logit contributions
 Anna-0.047
 wake-0.044
 knocks-0.042
 burns-0.042
's-0.042
Token?
Token ID30
Feature activation-5.26e-03

Pos logit contributions
ully+0.027
ined+0.023
 ease+0.023
 idea+0.022
opl+0.021
Neg logit contributions
 Anna-0.060
Anna-0.054
 rats-0.051
 Ann-0.050
 mats-0.050
Token He
Token ID679
Feature activation-1.58e-02

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.008
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.007
 rats-0.006
Anna-0.006
 Ann-0.006
 mats-0.006
Token was
Token ID373
Feature activation+7.03e-04

Pos logit contributions
 idea+0.027
ined+0.026
opl+0.026
ully+0.026
 accomplish+0.026
Neg logit contributions
 Anna-0.022
 rats-0.019
Anna-0.018
 Ann-0.018
 Frost-0.016
Token scared
Token ID12008
Feature activation-1.14e-02

Pos logit contributions
 Anna+0.001
Anna+0.001
 rats+0.001
 mats+0.001
pter+0.001
Neg logit contributions
 idea-0.001
ined-0.001
ully-0.001
ble-0.001
 encouraged-0.001
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
 idea+0.001
ully+0.001
ined+0.001
 accomplish+0.001
ble+0.001
Neg logit contributions
 Anna-0.002
 rats-0.001
 Ann-0.001
Anna-0.001
 Megan-0.001
Token "
Token ID366
Feature activation-5.44e-03

Pos logit contributions
ully+0.030
ined+0.029
agons+0.029
 idea+0.029
opl+0.029
Neg logit contributions
 Anna-0.018
 Ann-0.013
 Pen-0.013
 Frost-0.012
 Ana-0.012
TokenIs
Token ID3792
Feature activation+2.52e-02

Pos logit contributions
ully+0.009
 idea+0.009
ined+0.009
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.007
 rats-0.006
 Ann-0.006
 mats-0.006
 Megan-0.006
Token this
Token ID428
Feature activation-5.47e-03

Pos logit contributions
 Anna+0.034
Anna+0.030
 rats+0.030
 mats+0.029
pter+0.029
Neg logit contributions
ully-0.039
 idea-0.039
ined-0.036
ble-0.035
 accomplish-0.035
Token your
Token ID534
Feature activation-1.20e-03

Pos logit contributions
ined+0.010
ully+0.010
 accomplish+0.009
inkle+0.009
ble+0.009
Neg logit contributions
 Anna-0.008
 Ann-0.006
 mats-0.006
Anna-0.006
 rats-0.006
Token pants
Token ID12581
Feature activation-5.83e-02

Pos logit contributions
ined+0.002
hing+0.002
inkle+0.002
 accomplish+0.002
ully+0.002
Neg logit contributions
 Anna-0.001
 mats-0.001
 Ann-0.001
 chairs-0.001
 rats-0.001
Token?"
Token ID1701
Feature activation-1.95e-02

Pos logit contributions
ully+0.099
alk+0.086
ined+0.085
 gol+0.084
 idea+0.083
Neg logit contributions
 Anna-0.076
 back-0.061
 rats-0.059
head-0.058
 downstairs-0.058
Token
Token ID198
Feature activation-2.29e-03

Pos logit contributions
 idea+0.040
ully+0.039
ined+0.039
ble+0.037
agons+0.037
Neg logit contributions
 Anna-0.023
 Ann-0.017
Anna-0.017
 Pen-0.016
 rats-0.016
Token
Token ID198
Feature activation+2.00e-03

Pos logit contributions
 idea+0.003
ully+0.003
ined+0.003
ble+0.003
 accomplish+0.003
Neg logit contributions
 Anna-0.004
Anna-0.003
 Ann-0.003
 rats-0.003
 Megan-0.003
TokenThe
Token ID464
Feature activation-1.46e-02

Pos logit contributions
 Anna+0.003
 rats+0.003
 mats+0.003
 Ann+0.003
 Megan+0.003
Neg logit contributions
ully-0.003
 idea-0.003
ined-0.003
 accomplish-0.003
 shar-0.002
Token little
Token ID1310
Feature activation-2.14e-02

Pos logit contributions
 idea+0.025
ully+0.025
ined+0.025
 accomplish+0.023
ble+0.023
Neg logit contributions
 Anna-0.019
 rats-0.016
Anna-0.015
 Ann-0.015
 mats-0.014
Token what
Token ID644
Feature activation+8.26e-03

Pos logit contributions
ully+0.063
 idea+0.062
ined+0.061
 accomplish+0.057
agons+0.057
Neg logit contributions
 Anna-0.045
 rats-0.036
 Ann-0.035
Anna-0.033
 Jacob-0.032
Token we
Token ID356
Feature activation-3.88e-02

Pos logit contributions
 Anna+0.009
 rats+0.008
Anna+0.007
 Ann+0.007
 Pen+0.007
Neg logit contributions
ully-0.016
 idea-0.016
ined-0.016
 accomplish-0.015
ble-0.015
Token can
Token ID460
Feature activation-2.06e-02

Pos logit contributions
ined+0.078
ully+0.075
ble+0.074
ute+0.072
ently+0.071
Neg logit contributions
 Anna-0.040
 rats-0.033
 downstairs-0.029
Anna-0.029
 Ann-0.027
Token learn
Token ID2193
Feature activation-2.74e-02

Pos logit contributions
ined+0.038
ully+0.036
opl+0.032
 gol+0.032
imm+0.032
Neg logit contributions
 Anna-0.024
 downstairs-0.018
 rats-0.018
 wake-0.018
 Jacob-0.018
Token about
Token ID546
Feature activation-3.31e-02

Pos logit contributions
ully+0.034
ined+0.030
agons+0.029
 idea+0.028
 ease+0.027
Neg logit contributions
 Anna-0.055
 Ann-0.046
 rats-0.046
Anna-0.045
 mats-0.044
Token crabs
Token ID49172
Feature activation-5.83e-02

Pos logit contributions
 idea+0.043
ully+0.042
ined+0.039
agons+0.037
 accomplish+0.036
Neg logit contributions
 Anna-0.064
 rats-0.056
 Frost-0.052
 Ann-0.052
Anna-0.050
Token?"
Token ID1701
Feature activation-1.83e-02

Pos logit contributions
ully+0.064
ined+0.064
 ease+0.061
opl+0.055
asty+0.054
Neg logit contributions
 Anna-0.114
 rats-0.103
Anna-0.095
 mats-0.091
 we-0.091
Token her
Token ID607
Feature activation+6.21e-03

Pos logit contributions
ined+0.031
 idea+0.031
ully+0.030
 accomplish+0.029
ble+0.029
Neg logit contributions
 Anna-0.027
 Ann-0.022
Anna-0.021
 rats-0.020
 Pen-0.020
Token mom
Token ID1995
Feature activation-2.13e-02

Pos logit contributions
 Anna+0.006
 rats+0.005
 Ann+0.005
Anna+0.005
 Megan+0.004
Neg logit contributions
ully-0.013
 idea-0.013
ined-0.012
 accomplish-0.012
ble-0.012
Token replied
Token ID8712
Feature activation-8.22e-05

Pos logit contributions
 idea+0.024
ully+0.023
 accomplish+0.022
ined+0.022
agons+0.021
Neg logit contributions
 Anna-0.041
 Ann-0.036
Anna-0.035
 rats-0.034
 Megan-0.034
Token.
Token ID13
Feature activation-4.71e-04

Pos logit contributions
 idea+0.000
ined+0.000
ully+0.000
ble+0.000
 replied+0.000
Neg logit contributions
 Anna-0.000
 rats-0.000
Anna-0.000
 mats-0.000
Ann-0.000
Token you
Token ID345
Feature activation-2.19e-02

Pos logit contributions
 Anna+0.014
 rats+0.013
ksh+0.012
pter+0.012
Anna+0.012
Neg logit contributions
ully-0.019
 idea-0.019
 accomplish-0.018
agons-0.017
ble-0.017
Token know
Token ID760
Feature activation-2.14e-02

Pos logit contributions
ined+0.038
ully+0.036
ble+0.035
opl+0.033
agons+0.033
Neg logit contributions
 Anna-0.030
 rats-0.024
 Ann-0.024
Anna-0.023
 Megan-0.022
Token what
Token ID644
Feature activation+1.38e-03

Pos logit contributions
 idea+0.039
ully+0.039
ined+0.038
 accomplish+0.034
inkle+0.034
Neg logit contributions
 Anna-0.029
 rats-0.024
 Ann-0.023
Anna-0.021
 Pen-0.021
Token colour
Token ID9568
Feature activation-2.01e-02

Pos logit contributions
 Anna+0.002
 rats+0.001
 Ann+0.001
Anna+0.001
 Pen+0.001
Neg logit contributions
ully-0.003
ined-0.003
 idea-0.003
inkle-0.003
 accomplish-0.003
Token my
Token ID616
Feature activation-2.29e-02

Pos logit contributions
ully+0.034
 idea+0.032
ined+0.031
 gol+0.031
 ease+0.030
Neg logit contributions
 Anna-0.028
 rats-0.022
 Ann-0.022
Anna-0.021
 mats-0.020
Token castle
Token ID16669
Feature activation-5.83e-02

Pos logit contributions
ully+0.042
ined+0.039
umper+0.036
 encouraged+0.036
 accomplish+0.036
Neg logit contributions
 Anna-0.032
 rats-0.027
 chairs-0.025
 mats-0.025
 Ana-0.025
Token is
Token ID318
Feature activation-4.15e-02

Pos logit contributions
ully+0.088
 ease+0.079
ined+0.077
 gol+0.075
agons+0.074
Neg logit contributions
 Anna-0.092
 rats-0.076
 Ann-0.075
 knocks-0.073
Anna-0.070
Token?"
Token ID1701
Feature activation-2.31e-02

Pos logit contributions
ully+0.068
 ease+0.060
agons+0.059
 gol+0.058
ined+0.058
Neg logit contributions
 to-0.061
 Anna-0.058
 next-0.054
 downstairs-0.050
 living-0.048
Token
Token ID198
Feature activation-5.39e-03

Pos logit contributions
 idea+0.038
ully+0.038
ined+0.037
 accomplish+0.035
ble+0.035
Neg logit contributions
 Anna-0.032
 rats-0.027
Anna-0.026
 Ann-0.026
 Megan-0.025
TokenEl
Token ID9527
Feature activation+1.55e-04

Pos logit contributions
 idea+0.008
ully+0.008
ined+0.007
 accomplish+0.007
ble+0.007
Neg logit contributions
 Anna-0.009
Anna-0.008
 Ann-0.008
 rats-0.007
 Megan-0.007
Tokeniza
Token ID23638
Feature activation-8.92e-03

Pos logit contributions
 Anna+0.000
 rats+0.000
Anna+0.000
 Ann+0.000
 mats+0.000
Neg logit contributions
 idea-0.000
ully-0.000
ined-0.000
 accomplish-0.000
ble-0.000
Token.
Token ID13
Feature activation+1.79e-03

Pos logit contributions
 idea+0.042
ully+0.042
ined+0.041
 accomplish+0.038
ble+0.038
Neg logit contributions
 Anna-0.044
 rats-0.039
 Ann-0.036
 mats-0.036
Anna-0.035
Token Her
Token ID2332
Feature activation+2.10e-03

Pos logit contributions
 Anna+0.003
 rats+0.002
 Ann+0.002
Anna+0.002
 mats+0.002
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
ble-0.003
 accomplish-0.003
Token mom
Token ID1995
Feature activation-7.17e-03

Pos logit contributions
 Anna+0.003
 rats+0.002
Anna+0.002
 Pen+0.002
 Megan+0.002
Neg logit contributions
 idea-0.004
ully-0.003
ined-0.003
ble-0.003
 hobby-0.003
Token explained
Token ID4893
Feature activation+6.16e-03

Pos logit contributions
 idea+0.011
agons+0.010
ully+0.010
alk+0.010
ble+0.009
Neg logit contributions
 Anna-0.012
 Ann-0.010
Anna-0.010
 rats-0.009
 Jacob-0.009
Token how
Token ID703
Feature activation+9.48e-03

Pos logit contributions
 Anna+0.008
 rats+0.007
Anna+0.007
 Ann+0.007
 Megan+0.006
Neg logit contributions
 idea-0.011
ully-0.010
ined-0.010
 accomplish-0.009
ble-0.009
Token crabs
Token ID49172
Feature activation-5.82e-02

Pos logit contributions
 Anna+0.013
 rats+0.011
Anna+0.011
 Ann+0.010
 mats+0.010
Neg logit contributions
 idea-0.016
ined-0.016
ully-0.015
inkle-0.015
ble-0.015
Token have
Token ID423
Feature activation-2.10e-02

Pos logit contributions
ully+0.105
opl+0.098
ined+0.097
 gol+0.094
?'+0.094
Neg logit contributions
 Anna-0.065
 rats-0.058
 mats-0.048
 smells-0.047
 Frost-0.047
Token hard
Token ID1327
Feature activation-1.13e-02

Pos logit contributions
ined+0.040
ble+0.038
ully+0.038
inkle+0.038
 idea+0.038
Neg logit contributions
 Anna-0.025
 rats-0.022
 mats-0.018
 Ann-0.018
 chairs-0.017
Token shells
Token ID19679
Feature activation-3.50e-02

Pos logit contributions
ully+0.023
ined+0.023
ble+0.023
 idea+0.022
 accomplish+0.022
Neg logit contributions
 Anna-0.009
 rats-0.009
 mats-0.008
 minutes-0.007
 chairs-0.007
Token to
Token ID284
Feature activation-2.94e-03

Pos logit contributions
ined+0.065
ully+0.063
 idea+0.063
oving+0.060
 accomplish+0.059
Neg logit contributions
 Anna-0.040
 rats-0.032
 mats-0.031
 to-0.029
 Ann-0.028
Token protect
Token ID1805
Feature activation+3.42e-03

Pos logit contributions
ined+0.006
ully+0.006
 idea+0.005
opl+0.005
ute+0.005
Neg logit contributions
 Anna-0.003
 rats-0.003
 mats-0.002
 Frost-0.002
 Ann-0.002
Token Can
Token ID1680
Feature activation+1.49e-02

Pos logit contributions
 Anna+0.000
 Ann+0.000
Anna+0.000
 rats+0.000
 Pen+0.000
Neg logit contributions
 idea-0.000
ined-0.000
ully-0.000
ble-0.000
 accomplish-0.000
Token I
Token ID314
Feature activation-3.38e-02

Pos logit contributions
 rats+0.018
 Anna+0.018
 mats+0.018
 beds+0.017
 minutes+0.017
Neg logit contributions
ully-0.022
?".-0.022
?",-0.022
 accomplish-0.022
 replied-0.022
Token play
Token ID711
Feature activation-2.74e-02

Pos logit contributions
ined+0.074
 idea+0.067
ully+0.067
opl+0.066
ently+0.065
Neg logit contributions
 Anna-0.030
 Frost-0.024
 wake-0.023
Anna-0.023
 we-0.023
Token with
Token ID351
Feature activation-1.12e-02

Pos logit contributions
ully+0.057
 idea+0.054
ined+0.053
 ease+0.051
opl+0.051
Neg logit contributions
 Anna-0.031
Anna-0.023
 rats-0.021
 Ann-0.021
 mats-0.020
Token my
Token ID616
Feature activation-2.10e-02

Pos logit contributions
 idea+0.025
ully+0.025
ined+0.024
 accomplish+0.023
ble+0.023
Neg logit contributions
 Anna-0.010
 rats-0.008
Anna-0.007
 Ann-0.007
 mats-0.006
Token spear
Token ID17514
Feature activation-5.81e-02

Pos logit contributions
ined+0.030
 accomplish+0.025
ully+0.025
asty+0.025
imm+0.023
Neg logit contributions
 Anna-0.036
 rats-0.033
 mats-0.030
 chairs-0.030
 plates-0.029
Token?
Token ID30
Feature activation-3.97e-03

Pos logit contributions
ully+0.073
 ease+0.069
agons+0.068
ined+0.068
leness+0.064
Neg logit contributions
 Anna-0.112
 back-0.090
 to-0.089
Anna-0.088
 rats-0.088
Tokenâ
Token ID22940
Feature activation-1.46e-03

Pos logit contributions
 idea+0.009
ined+0.009
ully+0.009
ble+0.008
inkle+0.008
Neg logit contributions
 Anna-0.004
 Ann-0.003
Anna-0.003
 Pen-0.002
 Ana-0.002
Token
Token ID26391
Feature activation-9.56e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
ble+0.002
 ease+0.002
Neg logit contributions
 Anna-0.002
Anna-0.002
 Ann-0.002
 rats-0.002
 Megan-0.002
Token she
Token ID673
Feature activation-2.42e-03

Pos logit contributions
 idea+0.010
ully+0.010
?!"+0.009
 accomplish+0.009
?",+0.009
Neg logit contributions
 Anna-0.018
 rats-0.017
 mats-0.016
 Megan-0.016
 Pen-0.016
Token asked
Token ID1965
Feature activation-2.46e-03

Pos logit contributions
ully+0.005
ined+0.004
alk+0.004
ble+0.004
agons+0.004
Neg logit contributions
 Anna-0.003
 Ann-0.002
 Pen-0.002
Anna-0.002
 Ana-0.002
Token are
Token ID389
Feature activation+5.39e-03

Pos logit contributions
 Anna+0.002
 rats+0.002
Anna+0.002
 Ann+0.002
 mats+0.002
Neg logit contributions
 idea-0.002
ully-0.002
ined-0.002
 accomplish-0.002
ble-0.002
Token you
Token ID345
Feature activation-1.85e-02

Pos logit contributions
ksh+0.007
 Anna+0.007
pter+0.007
Anna+0.007
heads+0.006
Neg logit contributions
ully-0.008
 idea-0.008
?-0.007
agons-0.007
 accomplish-0.007
Token doing
Token ID1804
Feature activation-2.58e-02

Pos logit contributions
ully+0.029
 gladly+0.028
 ease+0.027
ined+0.026
alk+0.026
Neg logit contributions
 Anna-0.028
 rats-0.022
 Ann-0.021
 Jacob-0.021
Anna-0.021
Token with
Token ID351
Feature activation-2.02e-03

Pos logit contributions
ully+0.022
 idea+0.021
ined+0.020
 ease+0.018
 accomplish+0.017
Neg logit contributions
 Anna-0.057
 rats-0.051
Anna-0.051
 Ann-0.050
 Megan-0.049
Token my
Token ID616
Feature activation-1.20e-02

Pos logit contributions
 idea+0.004
ully+0.004
ined+0.004
 accomplish+0.004
ble+0.004
Neg logit contributions
 Anna-0.002
 rats-0.002
 Ann-0.002
Anna-0.002
 mats-0.002
Token beans
Token ID16567
Feature activation-5.81e-02

Pos logit contributions
ined+0.022
ully+0.021
hing+0.020
 accomplish+0.019
opl+0.019
Neg logit contributions
 Anna-0.017
 mats-0.014
 rats-0.014
 plates-0.013
 chairs-0.013
Token?"
Token ID1701
Feature activation-2.21e-02

Pos logit contributions
ully+0.068
ined+0.064
 ease+0.063
mod+0.059
asty+0.054
Neg logit contributions
 Anna-0.110
 downstairs-0.093
Anna-0.092
 rats-0.090
 to-0.089
Token It
Token ID632
Feature activation+1.65e-02

Pos logit contributions
ined+0.041
ully+0.040
 idea+0.040
agons+0.037
ble+0.037
Neg logit contributions
 Anna-0.030
 rats-0.024
 Ann-0.023
Anna-0.022
 Ana-0.021
Token was
Token ID373
Feature activation+1.04e-02

Pos logit contributions
 Anna+0.027
 rats+0.024
Anna+0.023
 Ann+0.023
 mats+0.022
Neg logit contributions
 idea-0.023
ully-0.022
ined-0.022
ble-0.020
 accomplish-0.020
Token Mrs
Token ID9074
Feature activation+4.58e-03

Pos logit contributions
 Anna+0.014
 rats+0.011
 Ann+0.011
Anna+0.011
 Megan+0.010
Neg logit contributions
ully-0.018
 idea-0.018
ined-0.018
 accomplish-0.017
ble-0.016
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 Anna+0.008
 mats+0.008
Anna+0.008
Nap+0.007
 rats+0.007
Neg logit contributions
ully-0.005
 idea-0.005
ble-0.004
?-0.004
ined-0.004
Token you
Token ID345
Feature activation-8.80e-03

Pos logit contributions
 Anna+0.032
pter+0.031
ksh+0.031
Anna+0.030
 Megan+0.030
Neg logit contributions
 idea-0.046
 accomplish-0.043
ully-0.043
ble-0.042
 shar-0.040
Token ruin
Token ID16866
Feature activation-3.39e-02

Pos logit contributions
ined+0.018
ully+0.017
agons+0.017
 idea+0.017
opl+0.016
Neg logit contributions
 Anna-0.010
Anna-0.007
 Ann-0.007
 rats-0.006
 Luna-0.006
Token your
Token ID534
Feature activation+4.01e-03

Pos logit contributions
 idea+0.070
ully+0.069
ined+0.069
 accomplish+0.065
agons+0.063
Neg logit contributions
 Anna-0.040
 rats-0.028
Anna-0.028
 Ann-0.028
 mats-0.024
Token neck
Token ID7393
Feature activation-3.33e-02

Pos logit contributions
 Anna+0.005
 rats+0.005
 mats+0.004
Anna+0.004
 beds+0.004
Neg logit contributions
ully-0.007
ined-0.007
hing-0.006
 accomplish-0.006
 gladly-0.006
Tokenl
Token ID75
Feature activation-1.96e-02

Pos logit contributions
ully+0.050
ined+0.046
 ease+0.045
 idea+0.045
agons+0.044
Neg logit contributions
 Anna-0.050
Anna-0.042
 rats-0.040
 mats-0.040
 Ann-0.038
Tokenaces
Token ID2114
Feature activation-5.81e-02

Pos logit contributions
 gladly+0.031
ully+0.031
 ease+0.031
 Sebastian+0.030
 Barry+0.030
Neg logit contributions
 Anna-0.025
 rats-0.022
Anna-0.019
 mats-0.019
pter-0.019
Token and
Token ID290
Feature activation-5.49e-03

Pos logit contributions
ully+0.095
ined+0.092
agons+0.089
 ease+0.085
reath+0.085
Neg logit contributions
 Anna-0.076
 to-0.066
 back-0.064
's-0.064
Anna-0.059
Token brace
Token ID22581
Feature activation-2.10e-02

Pos logit contributions
ully+0.010
 idea+0.010
ined+0.010
ble+0.009
 accomplish+0.009
Neg logit contributions
 Anna-0.007
 rats-0.006
 Ann-0.006
 mats-0.006
Anna-0.006
Tokenlets
Token ID5289
Feature activation-2.56e-02

Pos logit contributions
 idea+0.034
ully+0.032
ined+0.032
 accomplish+0.031
ble+0.030
Neg logit contributions
 Anna-0.029
Anna-0.025
 rats-0.024
 Ann-0.023
pter-0.023
Token?"
Token ID1701
Feature activation-2.13e-02

Pos logit contributions
ully+0.043
ined+0.039
agons+0.038
 idea+0.037
mod+0.036
Neg logit contributions
 Anna-0.040
Anna-0.032
 back-0.029
 to-0.029
 Ann-0.028
Token Mom
Token ID11254
Feature activation-1.26e-02

Pos logit contributions
 idea+0.041
ully+0.041
ined+0.040
 accomplish+0.038
ble+0.037
Neg logit contributions
 Anna-0.024
 rats-0.020
Anna-0.019
 Ann-0.019
 Megan-0.018
Token I
Token ID314
Feature activation-3.41e-02

Pos logit contributions
ksh+0.012
 minutes+0.012
 beds+0.012
pter+0.012
 Anna+0.011
Neg logit contributions
 shar-0.013
?"-0.013
 replied-0.013
ully-0.013
 accomplish-0.013
Token have
Token ID423
Feature activation-2.74e-02

Pos logit contributions
ully+0.058
ined+0.058
 idea+0.058
ble+0.053
 accomplish+0.052
Neg logit contributions
 Anna-0.046
 rats-0.039
Anna-0.037
 Ann-0.036
 Megan-0.035
Token some
Token ID617
Feature activation-8.89e-03

Pos logit contributions
 idea+0.037
?",+0.035
ully+0.035
 ease+0.035
?".+0.034
Neg logit contributions
 Anna-0.038
Anna-0.037
 Ann-0.036
 Megan-0.036
Ann-0.036
Token of
Token ID286
Feature activation+7.98e-03

Pos logit contributions
ully+0.018
ined+0.017
 accomplish+0.016
 gol+0.016
opl+0.015
Neg logit contributions
 Anna-0.011
 rats-0.009
 plates-0.008
 mats-0.007
 minutes-0.007
Token your
Token ID534
Feature activation-1.38e-02

Pos logit contributions
 Anna+0.008
 rats+0.006
Anna+0.005
 Ann+0.005
 Pen+0.005
Neg logit contributions
ully-0.017
ined-0.017
 idea-0.016
 accomplish-0.016
ble-0.015
Token carrots
Token ID34397
Feature activation-5.81e-02

Pos logit contributions
ully+0.023
ined+0.023
 idea+0.023
 accomplish+0.021
ble+0.021
Neg logit contributions
 Anna-0.019
 rats-0.016
Anna-0.015
 mats-0.015
 Ann-0.014
Token?"
Token ID1701
Feature activation-2.21e-02

Pos logit contributions
ully+0.077
ined+0.075
 ease+0.071
opl+0.069
agons+0.068
Neg logit contributions
 Anna-0.105
 to-0.084
 back-0.082
 rats-0.082
Anna-0.081
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
 idea+0.033
ully+0.032
ined+0.031
 accomplish+0.029
ble+0.029
Neg logit contributions
 Anna-0.034
 rats-0.030
Anna-0.029
 Ann-0.029
 Megan-0.028
Token
Token ID198
Feature activation-5.67e-03

Pos logit contributions
 idea+0.017
ully+0.017
ined+0.016
 accomplish+0.015
ble+0.015
Neg logit contributions
 Anna-0.018
Anna-0.015
 Ann-0.015
 rats-0.014
 Megan-0.014
TokenThe
Token ID464
Feature activation-1.64e-02

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.008
 accomplish+0.008
 shar+0.008
Neg logit contributions
 Anna-0.008
 rats-0.007
pter-0.006
 Megan-0.006
 mats-0.006
Token rabbit
Token ID22746
Feature activation-2.73e-02

Pos logit contributions
ined+0.026
 accomplish+0.024
inkle+0.024
 idea+0.024
ully+0.024
Neg logit contributions
 Anna-0.025
 rats-0.022
Anna-0.021
 Luna-0.020
 Ann-0.019
Token "
Token ID366
Feature activation+1.95e-03

Pos logit contributions
ully+0.013
 idea+0.013
ined+0.012
 accomplish+0.012
inkle+0.012
Neg logit contributions
 Anna-0.011
 Ann-0.009
Anna-0.008
 rats-0.008
 Pen-0.008
TokenCan
Token ID6090
Feature activation+1.62e-02

Pos logit contributions
 Anna+0.003
 rats+0.003
 Ann+0.002
 Megan+0.002
 mats+0.002
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.003
ble-0.003
Token we
Token ID356
Feature activation-3.27e-02

Pos logit contributions
ksh+0.019
 minutes+0.019
pter+0.019
 rats+0.019
heads+0.018
Neg logit contributions
?"-0.023
 accomplish-0.023
 replied-0.023
?".-0.023
?-0.022
Token pick
Token ID2298
Feature activation-2.00e-02

Pos logit contributions
ined+0.063
ully+0.061
opl+0.060
oving+0.056
 idea+0.054
Neg logit contributions
 Anna-0.040
 Ana-0.029
 Mia-0.029
 rats-0.028
 Ann-0.028
Token some
Token ID617
Feature activation-1.28e-02

Pos logit contributions
 idea+0.038
ully+0.038
ined+0.037
 accomplish+0.035
ble+0.034
Neg logit contributions
 Anna-0.023
 rats-0.019
Anna-0.018
 Ann-0.017
 mats-0.016
Token beans
Token ID16567
Feature activation-5.80e-02

Pos logit contributions
ully+0.025
 accomplish+0.023
ined+0.023
opl+0.023
oving+0.022
Neg logit contributions
 Anna-0.016
 rats-0.013
 mats-0.013
 plates-0.013
 chairs-0.013
Token?"
Token ID1701
Feature activation-1.80e-02

Pos logit contributions
ully+0.096
ined+0.093
opl+0.088
 ease+0.087
oving+0.085
Neg logit contributions
 Anna-0.092
Anna-0.071
 downstairs-0.067
 to-0.066
 mats-0.065
Token
Token ID198
Feature activation-6.31e-03

Pos logit contributions
 idea+0.028
ully+0.028
ined+0.027
 accomplish+0.025
ble+0.025
Neg logit contributions
 Anna-0.027
 rats-0.023
Anna-0.023
 Ann-0.022
 mats-0.022
Token
Token ID198
Feature activation-8.45e-04

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.009
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.011
Anna-0.009
 rats-0.009
 Ann-0.009
 Megan-0.009
TokenBen
Token ID11696
Feature activation-1.31e-02

Pos logit contributions
ully+0.001
 idea+0.001
 accomplish+0.001
ble+0.001
ined+0.001
Neg logit contributions
 Anna-0.001
 rats-0.001
 Ann-0.001
pter-0.001
 Megan-0.001
Token says
Token ID1139
Feature activation+2.33e-03

Pos logit contributions
ully+0.023
ined+0.022
ble+0.021
 idea+0.021
 accomplish+0.020
Neg logit contributions
 Anna-0.019
Anna-0.015
 Ann-0.015
 rats-0.015
 mats-0.014
Token questioned
Token ID11434
Feature activation-5.60e-03

Pos logit contributions
ined+0.038
opl+0.037
agons+0.037
alk+0.036
 ease+0.036
Neg logit contributions
 Anna-0.037
's-0.028
 Ann-0.028
 Jacob-0.028
Anna-0.027
Token.
Token ID13
Feature activation+5.68e-04

Pos logit contributions
ully+0.010
 accomplish+0.010
ined+0.010
opl+0.010
agons+0.010
Neg logit contributions
 Anna-0.008
 Ann-0.007
 Megan-0.006
 rats-0.006
Anna-0.006
Token "
Token ID366
Feature activation+3.78e-03

Pos logit contributions
 Anna+0.001
 Ann+0.001
Anna+0.001
 Megan+0.001
 rats+0.001
Neg logit contributions
ully-0.001
 idea-0.001
ined-0.001
 accomplish-0.001
inkle-0.001
TokenWhy
Token ID5195
Feature activation+3.30e-03

Pos logit contributions
 Anna+0.005
Anna+0.004
 rats+0.004
 Ann+0.004
pter+0.004
Neg logit contributions
 idea-0.007
ully-0.007
ined-0.006
inkle-0.006
 ease-0.006
Token do
Token ID466
Feature activation+1.09e-02

Pos logit contributions
 Anna+0.005
 rats+0.004
 Ann+0.004
Anna+0.004
 Jacob+0.004
Neg logit contributions
ully-0.005
ined-0.005
 ease-0.005
 gol-0.005
opl-0.005
Token frogs
Token ID37475
Feature activation-5.73e-02

Pos logit contributions
 Anna+0.012
Anna+0.010
 Megan+0.010
 rats+0.010
 Ann+0.010
Neg logit contributions
 idea-0.020
ully-0.019
 accomplish-0.018
ble-0.018
ined-0.018
Token have
Token ID423
Feature activation-1.78e-02

Pos logit contributions
ully+0.105
ined+0.098
 idea+0.092
opl+0.091
 ease+0.090
Neg logit contributions
 Anna-0.065
 rats-0.057
 wake-0.050
 we-0.050
 burns-0.047
Token legs
Token ID7405
Feature activation-2.11e-02

Pos logit contributions
ully+0.032
ined+0.031
 idea+0.030
 accomplish+0.029
ble+0.029
Neg logit contributions
 Anna-0.023
 rats-0.021
 Ann-0.018
 mats-0.018
 minutes-0.017
Token?"
Token ID1701
Feature activation-2.14e-02

Pos logit contributions
ully+0.032
 idea+0.031
ined+0.030
 ease+0.029
 accomplish+0.028
Neg logit contributions
 Anna-0.031
 rats-0.026
head-0.024
Anna-0.024
 mats-0.024
Token Lily
Token ID20037
Feature activation-2.53e-02

Pos logit contributions
ully+0.040
ined+0.040
 idea+0.039
 accomplish+0.038
ble+0.038
Neg logit contributions
 Anna-0.028
 Ann-0.022
Anna-0.021
 Megan-0.020
 rats-0.020
Token questioned
Token ID11434
Feature activation-4.51e-03

Pos logit contributions
agons+0.044
opl+0.044
ined+0.044
 ease+0.043
asty+0.042
Neg logit contributions
 Anna-0.038
's-0.030
 Jacob-0.028
Anna-0.028
 Eva-0.027
Token Can
Token ID1680
Feature activation+1.35e-02

Pos logit contributions
 Anna+0.002
 rats+0.002
 mats+0.002
Anna+0.002
 minutes+0.002
Neg logit contributions
ully-0.003
 idea-0.003
ined-0.003
 accomplish-0.003
?'-0.003
Token I
Token ID314
Feature activation-3.30e-02

Pos logit contributions
 minutes+0.019
ksh+0.019
 rats+0.018
 mats+0.018
pter+0.018
Neg logit contributions
?"-0.018
?".-0.018
 replied-0.018
?",-0.017
ully-0.017
Token play
Token ID711
Feature activation-3.62e-02

Pos logit contributions
ined+0.060
 idea+0.059
ully+0.059
opl+0.055
agons+0.054
Neg logit contributions
 Anna-0.041
 Ann-0.034
Anna-0.033
 rats-0.031
 Megan-0.031
Token in
Token ID287
Feature activation-2.48e-02

Pos logit contributions
ully+0.076
 idea+0.072
ined+0.070
opl+0.069
agons+0.068
Neg logit contributions
 Anna-0.040
Anna-0.030
 rats-0.028
 Ann-0.026
 mats-0.026
Token your
Token ID534
Feature activation-1.43e-02

Pos logit contributions
 idea+0.034
ully+0.033
ined+0.032
 accomplish+0.030
ble+0.029
Neg logit contributions
 Anna-0.042
 rats-0.037
Anna-0.036
 Ann-0.035
 mats-0.034
Token swamp
Token ID26837
Feature activation-5.71e-02

Pos logit contributions
 idea+0.020
ined+0.018
ully+0.017
ble+0.016
inkle+0.016
Neg logit contributions
 Anna-0.023
 rats-0.021
Anna-0.021
 Ann-0.020
 Pen-0.020
Token?"
Token ID1701
Feature activation-1.92e-02

Pos logit contributions
ully+0.059
ined+0.050
 ease+0.049
 idea+0.047
agons+0.046
Neg logit contributions
 Anna-0.126
Anna-0.109
 rats-0.109
 Ann-0.106
 mats-0.106
Token Fred
Token ID8559
Feature activation-2.07e-02

Pos logit contributions
 idea+0.034
ined+0.034
ully+0.034
ble+0.032
 accomplish+0.031
Neg logit contributions
 Anna-0.026
 rats-0.021
 Ann-0.021
Anna-0.020
 Megan-0.020
Token thought
Token ID1807
Feature activation-1.81e-03

Pos logit contributions
ully+0.032
ined+0.031
opl+0.031
 idea+0.030
agons+0.030
Neg logit contributions
 Anna-0.034
Anna-0.027
 Ann-0.027
 Ana-0.027
 Pen-0.027
Token about
Token ID546
Feature activation-2.71e-02

Pos logit contributions
ully+0.003
ined+0.003
 idea+0.003
 accomplish+0.003
ble+0.003
Neg logit contributions
 Anna-0.002
 rats-0.002
 Ann-0.002
Anna-0.002
 Megan-0.002
Token it
Token ID340
Feature activation-2.08e-02

Pos logit contributions
ully+0.049
 idea+0.049
ined+0.048
 accomplish+0.045
inkle+0.044
Neg logit contributions
 Anna-0.035
 rats-0.029
 Ann-0.027
Anna-0.027
 Megan-0.026
Token can
Token ID460
Feature activation-1.65e-03

Pos logit contributions
 Anna+0.007
 rats+0.005
Anna+0.005
 Ann+0.005
 mats+0.005
Neg logit contributions
ully-0.008
ined-0.007
 idea-0.007
 accomplish-0.007
agons-0.007
Token be
Token ID307
Feature activation-7.87e-03

Pos logit contributions
ully+0.003
ined+0.003
 idea+0.003
agons+0.003
opl+0.003
Neg logit contributions
 Anna-0.002
Anna-0.002
 rats-0.002
 Ann-0.001
 Ana-0.001
Token fun
Token ID1257
Feature activation+3.12e-03

Pos logit contributions
ully+0.015
agons+0.014
ute+0.014
ined+0.014
opl+0.014
Neg logit contributions
 Anna-0.011
 rats-0.008
 minutes-0.007
 Ann-0.007
Anna-0.007
Token and
Token ID290
Feature activation-8.01e-03

Pos logit contributions
 Anna+0.004
 rats+0.004
Anna+0.003
 mats+0.003
 Ann+0.003
Neg logit contributions
ully-0.005
ined-0.005
 idea-0.005
 accomplish-0.005
ble-0.005
Token that
Token ID326
Feature activation+3.39e-03

Pos logit contributions
ully+0.011
 idea+0.011
ined+0.011
ble+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.013
 rats-0.011
Anna-0.011
 Ann-0.011
 mats-0.011
Token making
Token ID1642
Feature activation+2.97e-02

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.004
 Ann+0.004
 mats+0.004
Neg logit contributions
ully-0.006
ined-0.006
 idea-0.005
ble-0.005
inkle-0.005
Token friends
Token ID2460
Feature activation-9.55e-03

Pos logit contributions
 Anna+0.042
 rats+0.034
Anna+0.032
 Ann+0.032
 Ana+0.030
Neg logit contributions
ully-0.052
ined-0.050
 idea-0.050
ble-0.046
 accomplish-0.045
Token can
Token ID460
Feature activation+4.31e-03

Pos logit contributions
ully+0.018
agons+0.016
ined+0.015
 idea+0.015
 shar+0.015
Neg logit contributions
 Anna-0.013
Anna-0.010
 rats-0.009
 Ann-0.009
 Ana-0.009
Token make
Token ID787
Feature activation+1.99e-02

Pos logit contributions
 Anna+0.005
Anna+0.004
 Ann+0.004
 rats+0.004
 Sky+0.004
Neg logit contributions
ully-0.008
ined-0.008
agons-0.007
 idea-0.007
opl-0.007
Token you
Token ID345
Feature activation+3.77e-03

Pos logit contributions
 Anna+0.025
 rats+0.018
 Ann+0.017
Anna+0.017
 Ana+0.015
Neg logit contributions
ully-0.040
ined-0.038
 idea-0.036
agons-0.035
ble-0.035
Token forget
Token ID6044
Feature activation-1.37e-02

Pos logit contributions
 Anna+0.006
 rats+0.005
 Ann+0.004
Anna+0.004
 Ana+0.004
Neg logit contributions
ully-0.006
ined-0.006
agons-0.005
 idea-0.005
opl-0.005
Token perfect
Token ID2818
Feature activation+1.24e-02

Pos logit contributions
 Anna+0.021
 rats+0.018
 Ann+0.018
 Ana+0.017
 Pen+0.017
Neg logit contributions
ined-0.022
ully-0.022
 idea-0.020
ble-0.020
agons-0.020
Token spot
Token ID4136
Feature activation+2.05e-02

Pos logit contributions
 Anna+0.018
 rats+0.015
 Ann+0.015
Anna+0.015
 Megan+0.014
Neg logit contributions
ully-0.020
ined-0.019
 idea-0.018
 accomplish-0.018
ble-0.018
Token and
Token ID290
Feature activation+3.89e-03

Pos logit contributions
 Anna+0.027
 rats+0.023
Anna+0.022
 Ann+0.022
 mats+0.021
Neg logit contributions
 idea-0.035
ully-0.034
ined-0.033
 accomplish-0.031
ble-0.031
Token she
Token ID673
Feature activation+9.98e-03

Pos logit contributions
 Anna+0.006
 rats+0.005
 Ann+0.005
Anna+0.005
 Megan+0.005
Neg logit contributions
ully-0.006
 idea-0.006
ined-0.006
 accomplish-0.006
agons-0.006
Token would
Token ID561
Feature activation+3.88e-03

Pos logit contributions
 Anna+0.013
 Ann+0.011
Anna+0.011
 Pen+0.010
 Ana+0.010
Neg logit contributions
ully-0.018
agons-0.018
 idea-0.017
ined-0.016
ble-0.016
Token come
Token ID1282
Feature activation+3.43e-02

Pos logit contributions
 Anna+0.005
 Ann+0.004
Anna+0.004
 Pen+0.004
 Ana+0.004
Neg logit contributions
ully-0.007
ined-0.007
 idea-0.006
agons-0.006
ble-0.006
Token back
Token ID736
Feature activation+3.27e-04

Pos logit contributions
 Anna+0.031
 rats+0.024
Anna+0.022
 Ann+0.022
 Megan+0.021
Neg logit contributions
 idea-0.073
ully-0.072
ined-0.070
 accomplish-0.067
ble-0.066
Token every
Token ID790
Feature activation+2.23e-02

Pos logit contributions
 Anna+0.000
 Ann+0.000
 rats+0.000
Anna+0.000
 downstairs+0.000
Neg logit contributions
ully-0.001
ined-0.001
 idea-0.001
agons-0.001
ble-0.001
Token day
Token ID1110
Feature activation+2.59e-02

Pos logit contributions
 Anna+0.041
 rats+0.039
 mats+0.038
Anna+0.037
pter+0.037
Neg logit contributions
 idea-0.026
ully-0.021
ined-0.020
?'-0.019
?",-0.019
Token to
Token ID284
Feature activation+7.67e-03

Pos logit contributions
 Anna+0.040
 rats+0.036
Anna+0.034
 Ann+0.034
 mats+0.033
Neg logit contributions
 idea-0.038
ully-0.037
ined-0.036
 accomplish-0.034
ble-0.033
Token read
Token ID1100
Feature activation+1.11e-02

Pos logit contributions
 Anna+0.011
 Ann+0.008
Anna+0.008
 Ana+0.007
 rats+0.007
Neg logit contributions
ined-0.014
ully-0.014
 idea-0.013
agons-0.013
ble-0.013
Token was
Token ID373
Feature activation-2.00e-02

Pos logit contributions
ully+0.006
 idea+0.005
ined+0.005
agons+0.005
 accomplish+0.005
Neg logit contributions
 Anna-0.005
 Ann-0.005
 Megan-0.004
Anna-0.004
 rats-0.004
Token a
Token ID257
Feature activation-9.33e-03

Pos logit contributions
 idea+0.031
ully+0.030
ined+0.029
ble+0.027
 accomplish+0.027
Neg logit contributions
 Anna-0.029
 rats-0.026
Anna-0.025
 Ann-0.024
 mats-0.024
Token little
Token ID1310
Feature activation-1.08e-03

Pos logit contributions
 idea+0.014
ully+0.014
ined+0.013
ble+0.012
 accomplish+0.012
Neg logit contributions
 Anna-0.014
 rats-0.012
 Ann-0.012
 mats-0.011
 Megan-0.011
Token girl
Token ID2576
Feature activation+7.59e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
ble+0.002
 gem+0.002
Neg logit contributions
 Anna-0.001
pter-0.001
 Ann-0.001
 Megan-0.001
 rats-0.001
Token named
Token ID3706
Feature activation+1.39e-02

Pos logit contributions
 Anna+0.019
 rats+0.018
 mats+0.017
 Megan+0.017
Anna+0.017
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 shar-0.002
?'-0.002
Token Mary
Token ID5335
Feature activation+3.38e-02

Pos logit contributions
pter+0.017
 rats+0.017
 mats+0.016
Jacob+0.016
 Anna+0.016
Neg logit contributions
 idea-0.022
ully-0.020
asty-0.019
ined-0.019
?'-0.019
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
 Anna+0.057
 rats+0.050
Anna+0.049
 Ann+0.048
 Megan+0.047
Neg logit contributions
 idea-0.046
ully-0.045
ined-0.043
 accomplish-0.042
ble-0.041
Token She
Token ID1375
Feature activation+8.63e-03

Pos logit contributions
 Anna+0.023
 rats+0.022
Anna+0.021
ifier+0.021
Rex+0.020
Neg logit contributions
 idea-0.024
ully-0.020
ined-0.020
 replied-0.019
?'-0.019
Token loved
Token ID6151
Feature activation+4.16e-03

Pos logit contributions
 Anna+0.017
 rats+0.016
Anna+0.015
 Ann+0.015
 mats+0.015
Neg logit contributions
 idea-0.009
ully-0.009
ined-0.008
 accomplish-0.007
ble-0.007
Token playing
Token ID2712
Feature activation-2.59e-02

Pos logit contributions
 Anna+0.004
Anna+0.004
 rats+0.003
 Megan+0.003
 Ann+0.003
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.008
 accomplish-0.007
 shar-0.007
Token in
Token ID287
Feature activation-3.27e-03

Pos logit contributions
 idea+0.046
ully+0.045
ined+0.044
 accomplish+0.042
ble+0.041
Neg logit contributions
 Anna-0.033
 rats-0.027
Anna-0.027
 Ann-0.026
 mats-0.025
Token gives
Token ID3607
Feature activation+1.23e-02

Pos logit contributions
 Anna+0.010
 Ann+0.008
Anna+0.008
 rats+0.008
 Megan+0.008
Neg logit contributions
ully-0.010
 idea-0.009
ined-0.009
ble-0.009
 accomplish-0.009
Token him
Token ID683
Feature activation+2.12e-02

Pos logit contributions
 Anna+0.015
 rats+0.013
Anna+0.013
 mats+0.013
pter+0.012
Neg logit contributions
 idea-0.021
ully-0.020
ined-0.019
 ease-0.019
ble-0.018
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
 Anna+0.029
 rats+0.024
Anna+0.023
 Ann+0.023
 mats+0.022
Neg logit contributions
 idea-0.036
ully-0.035
ined-0.035
 accomplish-0.033
ble-0.033
Token new
Token ID649
Feature activation+5.83e-03

Pos logit contributions
 Anna+0.013
Anna+0.010
 Ann+0.009
 mats+0.009
 Ana+0.009
Neg logit contributions
ully-0.024
ined-0.023
oving-0.022
 accomplish-0.022
ble-0.022
Token cr
Token ID1067
Feature activation+8.78e-03

Pos logit contributions
 Anna+0.007
Anna+0.005
 Ann+0.005
 mats+0.005
 rats+0.005
Neg logit contributions
ully-0.011
 accomplish-0.011
ined-0.011
agons-0.010
oving-0.010
Tokenay
Token ID323
Feature activation+3.13e-02

Pos logit contributions
 Anna+0.012
Anna+0.011
 Ann+0.010
pter+0.010
 rats+0.010
Neg logit contributions
 idea-0.013
 accomplish-0.012
ully-0.012
ined-0.012
 gladly-0.012
Tokenon
Token ID261
Feature activation+5.13e-03

Pos logit contributions
 Anna+0.057
Anna+0.050
 Ann+0.049
pter+0.049
 rats+0.048
Neg logit contributions
 idea-0.037
 accomplish-0.035
ully-0.035
ined-0.033
ble-0.032
Token and
Token ID290
Feature activation-7.24e-04

Pos logit contributions
 Anna+0.008
 Ann+0.006
 rats+0.006
Anna+0.006
pter+0.006
Neg logit contributions
ully-0.008
ined-0.008
 idea-0.008
 accomplish-0.008
 ease-0.007
Token paper
Token ID3348
Feature activation+1.72e-02

Pos logit contributions
ully+0.001
ined+0.001
 idea+0.001
ble+0.001
agons+0.001
Neg logit contributions
 Anna-0.001
 Ann-0.001
 rats-0.001
Anna-0.001
 Jacob-0.001
Token.
Token ID13
Feature activation-1.30e-03

Pos logit contributions
 Anna+0.026
 Ann+0.020
pter+0.020
 mats+0.019
 rats+0.019
Neg logit contributions
ully-0.028
ined-0.027
 idea-0.027
 accomplish-0.027
oving-0.025
Token Tom
Token ID4186
Feature activation+2.37e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
ble+0.002
 accomplish+0.002
Neg logit contributions
 Anna-0.002
 Ann-0.001
Anna-0.001
 Megan-0.001
 Pen-0.001
Token.
Token ID13
Feature activation-1.05e-04

Pos logit contributions
 Anna+0.023
 rats+0.020
 Ann+0.019
Anna+0.019
 Megan+0.019
Neg logit contributions
ully-0.020
 idea-0.020
ined-0.019
 accomplish-0.017
ble-0.017
Token She
Token ID1375
Feature activation-2.17e-03

Pos logit contributions
 idea+0.000
ined+0.000
ully+0.000
ble+0.000
 accomplish+0.000
Neg logit contributions
 Anna-0.000
 Ann-0.000
Anna-0.000
 Megan-0.000
 rats-0.000
Token saw
Token ID2497
Feature activation+4.87e-04

Pos logit contributions
ully+0.004
 idea+0.004
ined+0.003
agons+0.003
ble+0.003
Neg logit contributions
 Anna-0.003
Anna-0.003
 Ann-0.003
 rats-0.003
 Megan-0.003
Token the
Token ID262
Feature activation+1.41e-03

Pos logit contributions
 Anna+0.001
 rats+0.001
 Ann+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
ully-0.001
 idea-0.001
ined-0.001
 accomplish-0.001
agons-0.001
Token sun
Token ID4252
Feature activation+2.66e-02

Pos logit contributions
 Anna+0.002
 rats+0.002
Anna+0.002
 mats+0.002
 Ann+0.002
Neg logit contributions
ully-0.002
ined-0.002
 accomplish-0.002
 idea-0.002
 shar-0.002
Token come
Token ID1282
Feature activation+3.20e-02

Pos logit contributions
 Anna+0.047
 rats+0.043
Anna+0.042
 Ann+0.041
 mats+0.040
Neg logit contributions
 idea-0.033
ully-0.032
ined-0.031
ble-0.029
 accomplish-0.028
Token up
Token ID510
Feature activation+1.92e-02

Pos logit contributions
 Anna+0.043
 rats+0.036
Anna+0.035
 Ann+0.035
 Megan+0.033
Neg logit contributions
 idea-0.054
ully-0.053
ined-0.051
 accomplish-0.049
ble-0.048
Token and
Token ID290
Feature activation-8.00e-03

Pos logit contributions
 Anna+0.025
Anna+0.020
 rats+0.020
 Ann+0.020
 Megan+0.019
Neg logit contributions
ully-0.033
 idea-0.032
ined-0.031
 accomplish-0.030
agons-0.030
Token it
Token ID340
Feature activation+1.07e-02

Pos logit contributions
 idea+0.012
ully+0.012
ined+0.011
 accomplish+0.011
ble+0.011
Neg logit contributions
 Anna-0.012
 rats-0.010
Anna-0.010
 mats-0.010
 Ann-0.010
Token was
Token ID373
Feature activation-2.44e-03

Pos logit contributions
 Anna+0.017
 rats+0.015
Anna+0.015
 Ann+0.015
 Megan+0.014
Neg logit contributions
 idea-0.015
ully-0.015
ined-0.014
 accomplish-0.013
ble-0.013
Token very
Token ID845
Feature activation+6.17e-03

Pos logit contributions
ully+0.005
 accomplish+0.004
agons+0.004
 shar+0.004
 gol+0.004
Neg logit contributions
 Anna-0.003
 Ann-0.003
 Megan-0.002
 Pen-0.002
 minutes-0.002
Token came
Token ID1625
Feature activation+1.80e-02

Pos logit contributions
 idea+0.014
ully+0.014
ined+0.014
agons+0.013
opl+0.013
Neg logit contributions
 Anna-0.021
 Ann-0.018
Anna-0.017
 rats-0.017
 Megan-0.017
Token and
Token ID290
Feature activation+8.17e-04

Pos logit contributions
 Anna+0.022
 rats+0.019
Anna+0.018
 Ann+0.018
 Megan+0.017
Neg logit contributions
 idea-0.032
ully-0.032
ined-0.031
 accomplish-0.030
ble-0.029
Token gave
Token ID2921
Feature activation-2.13e-03

Pos logit contributions
 Anna+0.001
 rats+0.001
 mats+0.001
Anna+0.001
 Ann+0.001
Neg logit contributions
ully-0.001
 idea-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token her
Token ID607
Feature activation+3.36e-04

Pos logit contributions
 idea+0.003
ully+0.003
ined+0.003
 ease+0.002
ble+0.002
Neg logit contributions
 Anna-0.003
 rats-0.003
Anna-0.003
pter-0.003
 mats-0.003
Token a
Token ID257
Feature activation+1.44e-03

Pos logit contributions
 Anna+0.000
 rats+0.000
Anna+0.000
 Ann+0.000
 Megan+0.000
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token big
Token ID1263
Feature activation-3.67e-04

Pos logit contributions
 Anna+0.002
Anna+0.001
 Ann+0.001
 Ana+0.001
 Pen+0.001
Neg logit contributions
ined-0.003
ully-0.003
opl-0.003
agons-0.003
 accomplish-0.003
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
ully+0.001
ined+0.001
 accomplish+0.001
 idea+0.001
ble+0.001
Neg logit contributions
 Anna-0.000
 rats-0.000
 Ann-0.000
Anna-0.000
 Megan-0.000
Token compassionate
Token ID32533
Feature activation+4.28e-03

Pos logit contributions
ully+0.031
ined+0.030
 accomplish+0.029
ble+0.029
 idea+0.029
Neg logit contributions
 Anna-0.024
 rats-0.019
Anna-0.018
 Ann-0.018
 mats-0.018
Token hug
Token ID16225
Feature activation+9.16e-04

Pos logit contributions
 Anna+0.006
 rats+0.006
Anna+0.005
 Ann+0.005
 Megan+0.005
Neg logit contributions
ined-0.006
ully-0.006
 idea-0.006
 accomplish-0.006
ble-0.006
Token.
Token ID13
Feature activation-8.34e-04

Pos logit contributions
 Anna+0.001
 Ann+0.001
 rats+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
inkle-0.001
Token She
Token ID1375
Feature activation-7.39e-03

Pos logit contributions
ully+0.001
 idea+0.001
ined+0.001
ble+0.001
 accomplish+0.001
Neg logit contributions
 Anna-0.001
 Ann-0.001
 Megan-0.001
Anna-0.001
 Ana-0.001
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
ully+0.011
 idea+0.010
ined+0.010
 accomplish+0.010
ble+0.009
Neg logit contributions
 Anna-0.011
 rats-0.009
 mats-0.009
Anna-0.009
 Ann-0.008
Token stronger
Token ID7387
Feature activation+1.18e-03

Pos logit contributions
 idea+0.017
ully+0.016
ined+0.016
 accomplish+0.016
ble+0.015
Neg logit contributions
 Anna-0.018
 rats-0.016
Anna-0.015
 Ann-0.015
 Megan-0.015
Token.
Token ID13
Feature activation-2.42e-03

Pos logit contributions
 Anna+0.002
 rats+0.002
Anna+0.002
 Ann+0.002
 mats+0.002
Neg logit contributions
ully-0.002
 idea-0.002
ined-0.002
 accomplish-0.001
ble-0.001
Token The
Token ID383
Feature activation-1.89e-02

Pos logit contributions
 idea+0.004
ully+0.004
ined+0.004
ble+0.004
 accomplish+0.004
Neg logit contributions
 Anna-0.003
 Ann-0.003
 rats-0.003
Anna-0.002
 Megan-0.002
Token fish
Token ID5916
Feature activation-2.20e-02

Pos logit contributions
ully+0.027
 idea+0.026
ined+0.026
 accomplish+0.024
ble+0.023
Neg logit contributions
 Anna-0.030
 rats-0.027
Anna-0.026
 mats-0.025
 Ann-0.024
Token was
Token ID373
Feature activation+4.66e-03

Pos logit contributions
ully+0.039
agons+0.037
asty+0.036
 idea+0.035
opl+0.035
Neg logit contributions
 Anna-0.029
Anna-0.023
 Frost-0.022
 Ann-0.022
 rats-0.022
Token so
Token ID523
Feature activation+6.97e-03

Pos logit contributions
 Anna+0.007
 Ann+0.006
 Megan+0.006
 rats+0.005
 Jacob+0.005
Neg logit contributions
ully-0.008
agons-0.007
 idea-0.007
ined-0.007
 accomplish-0.007
Token happy
Token ID3772
Feature activation+1.16e-02

Pos logit contributions
 Anna+0.011
Anna+0.009
 Ann+0.009
 Megan+0.009
 Frost+0.009
Neg logit contributions
ully-0.011
 idea-0.010
ined-0.010
agons-0.010
?'-0.009
Token!
Token ID0
Feature activation+4.58e-03

Pos logit contributions
 Anna+0.017
 rats+0.014
 Ann+0.013
Anna+0.013
 Megan+0.013
Neg logit contributions
 idea-0.019
ully-0.019
ined-0.018
 accomplish-0.017
ble-0.017
Token He
Token ID679
Feature activation-3.10e-03

Pos logit contributions
 Anna+0.006
 Ann+0.005
Anna+0.005
 Megan+0.005
 rats+0.005
Neg logit contributions
 idea-0.008
ined-0.008
ully-0.008
ble-0.008
 accomplish-0.007
Token brought
Token ID3181
Feature activation+1.86e-03

Pos logit contributions
ully+0.006
ined+0.006
 idea+0.006
agons+0.006
opl+0.005
Neg logit contributions
 Anna-0.004
 Frost-0.003
Anna-0.003
 Jacob-0.003
 Ann-0.003
Token the
Token ID262
Feature activation+6.51e-03

Pos logit contributions
 Anna+0.006
 rats+0.005
 Ann+0.004
Anna+0.004
 mats+0.004
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.008
 accomplish-0.007
ble-0.007
Token water
Token ID1660
Feature activation+1.63e-03

Pos logit contributions
 Anna+0.012
 rats+0.011
 Ann+0.010
Anna+0.010
 Megan+0.010
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.007
ble-0.007
 accomplish-0.007
Token.
Token ID13
Feature activation+5.88e-03

Pos logit contributions
 Anna+0.002
 rats+0.002
 Ann+0.002
 mats+0.002
Anna+0.002
Neg logit contributions
ully-0.003
 idea-0.003
 accomplish-0.003
ined-0.002
agons-0.002
Token It
Token ID632
Feature activation+1.84e-02

Pos logit contributions
 Anna+0.008
 rats+0.007
Anna+0.006
 Ann+0.006
 Megan+0.006
Neg logit contributions
 idea-0.010
ully-0.010
ined-0.010
 accomplish-0.009
ble-0.009
Token had
Token ID550
Feature activation+6.27e-03

Pos logit contributions
 Anna+0.026
 rats+0.022
Anna+0.022
 Ann+0.021
 Megan+0.020
Neg logit contributions
ully-0.030
 idea-0.030
ined-0.028
 accomplish-0.028
ble-0.026
Token a
Token ID257
Feature activation+1.55e-03

Pos logit contributions
 Anna+0.010
Anna+0.009
 rats+0.009
 Megan+0.008
 Ann+0.008
Neg logit contributions
 idea-0.009
ully-0.009
ined-0.008
ble-0.008
 ease-0.008
Token flag
Token ID6056
Feature activation+1.14e-02

Pos logit contributions
 Anna+0.002
 rats+0.002
Anna+0.002
 mats+0.002
 Ann+0.002
Neg logit contributions
ully-0.003
ined-0.002
 accomplish-0.002
 idea-0.002
asty-0.002
Token with
Token ID351
Feature activation+2.29e-02

Pos logit contributions
 Anna+0.016
 rats+0.014
Anna+0.013
 Ann+0.013
 mats+0.013
Neg logit contributions
 idea-0.018
ully-0.018
ined-0.017
 accomplish-0.017
ble-0.016
Token unknown
Token ID6439
Feature activation-2.97e-03

Pos logit contributions
 Anna+0.032
 rats+0.028
 Ann+0.026
Anna+0.026
 mats+0.026
Neg logit contributions
 idea-0.037
ully-0.036
ined-0.035
 accomplish-0.034
ble-0.033
Token colors
Token ID7577
Feature activation-3.98e-03

Pos logit contributions
ully+0.005
 accomplish+0.005
ined+0.005
 idea+0.005
 shar+0.005
Neg logit contributions
 Anna-0.003
 rats-0.003
pter-0.003
 mats-0.003
 chairs-0.003
Token and
Token ID290
Feature activation-1.10e-03

Pos logit contributions
 idea+0.006
ully+0.006
 accomplish+0.006
ined+0.006
 shar+0.006
Neg logit contributions
 Anna-0.006
 rats-0.005
 Ann-0.004
 mats-0.004
pter-0.004
Token don
Token ID836
Feature activation+1.46e-02

Pos logit contributions
ully+0.009
ined+0.008
agons+0.008
opl+0.008
ble+0.008
Neg logit contributions
 Anna-0.007
Anna-0.007
 Ann-0.006
 rats-0.006
rost-0.006
Token't
Token ID470
Feature activation-8.41e-03

Pos logit contributions
 Anna+0.018
 rats+0.015
Anna+0.014
 Ann+0.014
 mats+0.013
Neg logit contributions
 idea-0.027
ully-0.026
ined-0.025
 accomplish-0.024
ble-0.023
Token care
Token ID1337
Feature activation+7.69e-04

Pos logit contributions
ined+0.014
ully+0.014
 accomplish+0.013
agons+0.013
ble+0.013
Neg logit contributions
 Anna-0.012
Anna-0.010
 rats-0.009
 Ann-0.009
 minutes-0.009
Token.
Token ID13
Feature activation+1.54e-04

Pos logit contributions
 Anna+0.001
 Ann+0.001
 rats+0.001
Anna+0.001
 mats+0.001
Neg logit contributions
ully-0.001
 idea-0.001
ined-0.001
 gladly-0.001
 ease-0.001
Token I
Token ID314
Feature activation-2.97e-03

Pos logit contributions
 rats+0.000
 Anna+0.000
 mats+0.000
 minutes+0.000
heads+0.000
Neg logit contributions
 idea-0.000
ully-0.000
 shar-0.000
ble-0.000
?".-0.000
Token like
Token ID588
Feature activation+9.53e-03

Pos logit contributions
ully+0.005
ined+0.005
agons+0.005
opl+0.005
ble+0.005
Neg logit contributions
 Anna-0.004
Anna-0.003
 Ann-0.003
 rats-0.003
rost-0.003
Token my
Token ID616
Feature activation-1.02e-02

Pos logit contributions
 Anna+0.012
 rats+0.010
Anna+0.010
 Ann+0.009
 mats+0.009
Neg logit contributions
 idea-0.017
ully-0.017
ined-0.016
 accomplish-0.016
ble-0.015
Token red
Token ID2266
Feature activation-5.59e-03

Pos logit contributions
ined+0.018
ully+0.018
 accomplish+0.018
oving+0.017
ble+0.017
Neg logit contributions
 Anna-0.014
 mats-0.010
 rats-0.010
 beds-0.010
Anna-0.010
Token costume
Token ID16569
Feature activation-2.58e-02

Pos logit contributions
 accomplish+0.010
ined+0.010
oving+0.009
ully+0.009
 idea+0.009
Neg logit contributions
 Anna-0.007
 mats-0.006
 beds-0.005
Anna-0.005
 chairs-0.005
Token.
Token ID13
Feature activation+1.02e-03

Pos logit contributions
ully+0.049
ined+0.045
 accomplish+0.043
 ease+0.042
mod+0.041
Neg logit contributions
 Anna-0.038
 rats-0.028
Anna-0.028
 mats-0.027
 Ann-0.027
Token I
Token ID314
Feature activation-1.56e-03

Pos logit contributions
 Anna+0.001
 rats+0.001
 mats+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
 idea-0.002
ully-0.001
ined-0.001
ble-0.001
 shar-0.001
Token the
Token ID262
Feature activation-1.09e-02

Pos logit contributions
 Anna+0.001
 rats+0.001
 Ann+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
 accomplish-0.001
ble-0.001
Token word
Token ID1573
Feature activation+1.23e-02

Pos logit contributions
ully+0.020
ined+0.019
 accomplish+0.018
 ease+0.018
umper+0.017
Neg logit contributions
 Anna-0.014
 rats-0.011
 Ann-0.011
Anna-0.011
 mats-0.011
Token.
Token ID13
Feature activation+8.34e-04

Pos logit contributions
 Anna+0.019
 Ann+0.016
 rats+0.015
 Jacob+0.014
 Megan+0.014
Neg logit contributions
ully-0.021
ined-0.020
agons-0.020
 accomplish-0.019
 idea-0.019
Token She
Token ID1375
Feature activation-1.54e-03

Pos logit contributions
 Anna+0.001
 rats+0.001
Anna+0.001
 Ann+0.001
 Megan+0.001
Neg logit contributions
 idea-0.001
ully-0.001
ined-0.001
ble-0.001
 accomplish-0.001
Token tried
Token ID3088
Feature activation+1.25e-02

Pos logit contributions
opl+0.003
ully+0.003
alk+0.002
ined+0.002
agons+0.002
Neg logit contributions
 Anna-0.002
 Ann-0.002
Anna-0.002
 rats-0.002
 Megan-0.002
Token to
Token ID284
Feature activation+2.87e-04

Pos logit contributions
 Anna+0.011
 rats+0.008
 Ann+0.008
Anna+0.008
 mats+0.007
Neg logit contributions
 idea-0.028
ully-0.027
ined-0.027
 accomplish-0.026
ble-0.025
Token guess
Token ID4724
Feature activation-2.42e-02

Pos logit contributions
 Anna+0.000
Anna+0.000
 Ann+0.000
 rats+0.000
 Megan+0.000
Neg logit contributions
ined-0.000
ully-0.000
 idea-0.000
opl-0.000
agons-0.000
Token,
Token ID11
Feature activation-5.00e-03

Pos logit contributions
ully+0.046
ined+0.044
 idea+0.041
 gol+0.041
agons+0.039
Neg logit contributions
 Anna-0.038
 Ann-0.031
 rats-0.029
 Megan-0.028
 Ana-0.027
Token but
Token ID475
Feature activation-1.27e-02

Pos logit contributions
ully+0.008
ined+0.008
 idea+0.008
agons+0.007
inkle+0.007
Neg logit contributions
 Anna-0.007
 rats-0.006
 Ann-0.006
Anna-0.006
 Pen-0.006
Token she
Token ID673
Feature activation-1.41e-03

Pos logit contributions
 idea+0.018
ble+0.017
ully+0.017
 accomplish+0.016
ined+0.016
Neg logit contributions
 Anna-0.017
 rats-0.016
ksh-0.016
pter-0.016
Anna-0.016
Token was
Token ID373
Feature activation-1.91e-03

Pos logit contributions
ully+0.002
 idea+0.002
ined+0.002
agons+0.002
opl+0.002
Neg logit contributions
 Anna-0.002
 Ann-0.002
Anna-0.002
 rats-0.002
 Megan-0.002
Token was
Token ID373
Feature activation-2.18e-02

Pos logit contributions
ully+0.028
 idea+0.028
ined+0.027
 accomplish+0.026
ble+0.026
Neg logit contributions
 Anna-0.030
 rats-0.026
 Ann-0.025
Anna-0.025
 Megan-0.024
Token right
Token ID826
Feature activation-1.13e-02

Pos logit contributions
ully+0.038
 idea+0.038
ined+0.036
 accomplish+0.035
ble+0.034
Neg logit contributions
 Anna-0.028
 rats-0.024
Anna-0.023
 Ann-0.022
 Megan-0.022
Token.
Token ID13
Feature activation+3.34e-04

Pos logit contributions
 idea+0.014
ully+0.014
ined+0.013
 accomplish+0.012
ble+0.012
Neg logit contributions
 Anna-0.020
 rats-0.018
Anna-0.017
 Ann-0.017
 mats-0.017
Token
Token ID198
Feature activation-6.93e-03

Pos logit contributions
 Anna+0.001
 rats+0.001
pter+0.001
Anna+0.001
 mats+0.001
Neg logit contributions
 idea-0.000
ully-0.000
ined-0.000
 replied-0.000
 accomplish-0.000
Token
Token ID198
Feature activation+2.41e-03

Pos logit contributions
 idea+0.010
ully+0.010
ined+0.009
 accomplish+0.009
ble+0.009
Neg logit contributions
 Anna-0.011
Anna-0.010
 Ann-0.009
 rats-0.009
 Megan-0.009
TokenThen
Token ID6423
Feature activation-5.58e-03

Pos logit contributions
 Anna+0.004
 rats+0.004
 Ann+0.004
 Megan+0.003
pter+0.003
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.003
ble-0.003
Token,
Token ID11
Feature activation+1.60e-03

Pos logit contributions
 idea+0.004
 replied+0.003
ully+0.003
ined+0.003
?",+0.003
Neg logit contributions
 Anna-0.012
 rats-0.012
Anna-0.012
ksh-0.012
rost-0.012
Token Lily
Token ID20037
Feature activation+5.51e-04

Pos logit contributions
 Anna+0.002
 rats+0.002
Anna+0.002
 Ann+0.002
 Megan+0.001
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.003
ble-0.003
Token saw
Token ID2497
Feature activation+2.36e-03

Pos logit contributions
 Anna+0.001
 rats+0.000
Anna+0.000
 Ann+0.000
 mats+0.000
Neg logit contributions
 idea-0.001
ined-0.001
ully-0.001
ble-0.001
 accomplish-0.001
Token a
Token ID257
Feature activation-4.50e-03

Pos logit contributions
 Anna+0.003
 rats+0.003
 Ann+0.002
Anna+0.002
 Megan+0.002
Neg logit contributions
 idea-0.004
ully-0.004
ined-0.004
 accomplish-0.004
agons-0.004
Token sweet
Token ID6029
Feature activation+9.40e-03

Pos logit contributions
ully+0.008
ined+0.008
 idea+0.007
 accomplish+0.007
opl+0.007
Neg logit contributions
 Anna-0.006
 rats-0.005
Anna-0.005
 Ann-0.005
 mats-0.004
Token She
Token ID1375
Feature activation-7.10e-03

Pos logit contributions
 idea+0.005
ined+0.005
ully+0.005
 accomplish+0.004
ble+0.004
Neg logit contributions
 Anna-0.003
 Ann-0.002
Anna-0.002
 rats-0.002
 mats-0.002
Token loved
Token ID6151
Feature activation+3.14e-03

Pos logit contributions
ully+0.010
 idea+0.010
ined+0.010
 accomplish+0.010
opl+0.010
Neg logit contributions
 Anna-0.011
Anna-0.010
 rats-0.010
 Ann-0.010
 Megan-0.009
Token Tom
Token ID4186
Feature activation+1.67e-03

Pos logit contributions
 Anna+0.004
Anna+0.004
 rats+0.003
pter+0.003
 Megan+0.003
Neg logit contributions
 idea-0.005
ully-0.005
ined-0.005
 accomplish-0.005
ble-0.005
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 Anna+0.003
 rats+0.002
Anna+0.002
 Ann+0.002
 mats+0.002
Neg logit contributions
 idea-0.002
ully-0.002
ined-0.002
 accomplish-0.002
ble-0.002
Token wanted
Token ID2227
Feature activation+2.02e-04

Pos logit contributions
 Anna+0.014
 rats+0.012
Anna+0.011
 Ann+0.011
 Megan+0.011
Neg logit contributions
 idea-0.020
ully-0.019
ined-0.019
 accomplish-0.018
ble-0.018
Token to
Token ID284
Feature activation-6.66e-03

Pos logit contributions
 Anna+0.000
Anna+0.000
pter+0.000
 rats+0.000
Jacob+0.000
Neg logit contributions
 idea-0.000
ully-0.000
 ease-0.000
 shar-0.000
 encouraged-0.000
Token be
Token ID307
Feature activation-8.79e-03

Pos logit contributions
ined+0.014
ully+0.013
agons+0.012
inkle+0.012
 idea+0.012
Neg logit contributions
 Anna-0.008
 Ann-0.007
 Megan-0.006
Anna-0.006
 rats-0.006
Token with
Token ID351
Feature activation+5.94e-03

Pos logit contributions
 idea+0.013
ully+0.013
ined+0.012
 accomplish+0.012
ble+0.011
Neg logit contributions
 Anna-0.014
 rats-0.012
 Ann-0.012
Anna-0.012
 Megan-0.011
Token him
Token ID683
Feature activation+1.24e-03

Pos logit contributions
 Anna+0.009
 rats+0.008
Anna+0.008
 Ann+0.008
 mats+0.008
Neg logit contributions
 idea-0.008
ully-0.008
ined-0.008
 accomplish-0.007
ble-0.007
Token.
Token ID13
Feature activation-2.50e-03

Pos logit contributions
 Anna+0.002
 rats+0.002
 Ann+0.002
 Pen+0.001
 mats+0.001
Neg logit contributions
ully-0.002
ined-0.002
 idea-0.002
 accomplish-0.002
inkle-0.002
Token They
Token ID1119
Feature activation-1.21e-03

Pos logit contributions
ined+0.005
 idea+0.005
ully+0.005
ble+0.004
inkle+0.004
Neg logit contributions
 Anna-0.003
 Ann-0.002
Anna-0.002
 rats-0.002
 mats-0.002
Token One
Token ID1881
Feature activation-1.79e-02

Pos logit contributions
 Anna+0.006
 rats+0.006
Anna+0.005
 Ann+0.005
 mats+0.005
Neg logit contributions
 idea-0.005
ully-0.004
ined-0.004
 accomplish-0.004
ble-0.004
Token day
Token ID1110
Feature activation+3.27e-03

Pos logit contributions
 idea+0.019
ully+0.018
ined+0.016
ble+0.014
 replied+0.014
Neg logit contributions
 Anna-0.035
 rats-0.033
pter-0.032
 mats-0.031
Anna-0.031
Token,
Token ID11
Feature activation+2.64e-03

Pos logit contributions
 Anna+0.007
 rats+0.007
pter+0.007
Anna+0.006
ksh+0.006
Neg logit contributions
 idea-0.002
ined-0.002
ully-0.002
 replied-0.002
ble-0.002
Token she
Token ID673
Feature activation+5.30e-03

Pos logit contributions
 Anna+0.004
 rats+0.003
Anna+0.003
pter+0.003
 Ann+0.003
Neg logit contributions
 idea-0.004
ully-0.004
ined-0.004
ble-0.003
 accomplish-0.003
Token met
Token ID1138
Feature activation+2.69e-03

Pos logit contributions
 Anna+0.008
Anna+0.006
 rats+0.006
 Ann+0.006
 mats+0.006
Neg logit contributions
 idea-0.009
ully-0.008
ined-0.008
ble-0.008
 accomplish-0.008
Token a
Token ID257
Feature activation-5.94e-03

Pos logit contributions
 Anna+0.003
Anna+0.003
 Ann+0.003
 rats+0.003
 Megan+0.002
Neg logit contributions
 idea-0.005
ully-0.005
 accomplish-0.005
ined-0.005
agons-0.005
Token wise
Token ID10787
Feature activation+7.12e-03

Pos logit contributions
ined+0.010
ully+0.010
 idea+0.010
opl+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.008
Anna-0.007
 rats-0.006
 Ann-0.006
 Ana-0.006
Token old
Token ID1468
Feature activation+3.55e-04

Pos logit contributions
 Anna+0.010
 rats+0.008
 Ann+0.007
Anna+0.007
 Pen+0.007
Neg logit contributions
ined-0.012
ully-0.012
 accomplish-0.012
 idea-0.011
 ease-0.011
Token man
Token ID582
Feature activation-4.82e-03

Pos logit contributions
 Anna+0.000
 rats+0.000
Anna+0.000
 Ann+0.000
 Pen+0.000
Neg logit contributions
 ease-0.001
ined-0.001
 accomplish-0.001
irl-0.001
agons-0.001
Token who
Token ID508
Feature activation+8.96e-04

Pos logit contributions
 idea+0.008
ully+0.007
ined+0.007
 accomplish+0.007
ble+0.007
Neg logit contributions
 Anna-0.007
 rats-0.006
Anna-0.006
 Ann-0.006
 mats-0.006
Token told
Token ID1297
Feature activation+1.36e-02

Pos logit contributions
 Anna+0.001
 Ann+0.001
 rats+0.001
 Frost+0.001
 Jacob+0.001
Neg logit contributions
agons-0.002
alk-0.002
 idea-0.002
ute-0.002
reath-0.001
Token
Token ID198
Feature activation+4.41e-03

Pos logit contributions
ully+0.001
 idea+0.001
ined+0.001
ble+0.001
agons+0.001
Neg logit contributions
 Anna-0.001
Anna-0.001
 Ann-0.001
 Megan-0.001
 Pen-0.000
TokenBut
Token ID1537
Feature activation-5.09e-03

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.003
 Ann+0.003
 mats+0.003
Neg logit contributions
 idea-0.009
ully-0.008
ined-0.008
 accomplish-0.008
ble-0.008
Token Lucy
Token ID22162
Feature activation+4.79e-03

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.009
ble+0.008
 accomplish+0.008
Neg logit contributions
 Anna-0.006
 rats-0.005
Anna-0.005
 mats-0.005
pter-0.005
Token was
Token ID373
Feature activation-7.36e-03

Pos logit contributions
 Anna+0.007
 Ann+0.005
 Pen+0.005
 Megan+0.005
 Jacob+0.005
Neg logit contributions
ully-0.008
 idea-0.008
agons-0.008
ined-0.008
opl-0.008
Token brave
Token ID14802
Feature activation+6.33e-03

Pos logit contributions
 idea+0.011
ully+0.010
ined+0.010
ble+0.010
 accomplish+0.010
Neg logit contributions
 Anna-0.011
 rats-0.010
Anna-0.010
 Ann-0.010
 mats-0.010
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
 Anna+0.012
Anna+0.011
 rats+0.011
 mats+0.011
 Ann+0.011
Neg logit contributions
 idea-0.007
ully-0.006
 encouraged-0.005
ined-0.005
 replied-0.005
Token so
Token ID523
Feature activation-1.07e-02

Pos logit contributions
ined+0.023
ully+0.023
 idea+0.023
agons+0.022
ble+0.022
Neg logit contributions
 Anna-0.019
 rats-0.015
 Ann-0.015
 Pen-0.014
Anna-0.014
Token she
Token ID673
Feature activation-1.37e-02

Pos logit contributions
ully+0.019
 idea+0.019
ined+0.018
agons+0.017
 accomplish+0.017
Neg logit contributions
 Anna-0.014
 rats-0.012
 Ann-0.012
 Megan-0.011
Anna-0.011
Token stepped
Token ID10764
Feature activation-8.40e-04

Pos logit contributions
opl+0.022
 idea+0.021
ined+0.021
agons+0.020
ully+0.020
Neg logit contributions
 Anna-0.020
Anna-0.017
 Ann-0.016
 Pen-0.016
 rats-0.016
Token a
Token ID257
Feature activation+9.38e-03

Pos logit contributions
 idea+0.002
ully+0.002
ined+0.002
 accomplish+0.001
opl+0.001
Neg logit contributions
 Anna-0.001
 rats-0.001
Anna-0.001
 Ann-0.001
 Megan-0.001
Token little
Token ID1310
Feature activation+7.75e-03

Pos logit contributions
 Anna+0.011
 rats+0.009
Anna+0.009
 Ann+0.009
 mats+0.008
Neg logit contributions
ully-0.017
 idea-0.017
ined-0.017
 accomplish-0.016
ble-0.016
Token money
Token ID1637
Feature activation-8.59e-03

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.009
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.006
 rats-0.005
Anna-0.005
 Ann-0.005
 mats-0.005
Token?"
Token ID1701
Feature activation-9.69e-03

Pos logit contributions
ully+0.010
 idea+0.010
ined+0.010
 accomplish+0.010
 ease+0.009
Neg logit contributions
 Anna-0.016
Anna-0.014
 rats-0.014
 Ann-0.013
 mats-0.013
Token Tom
Token ID4186
Feature activation-1.04e-02

Pos logit contributions
 idea+0.020
ully+0.019
ined+0.019
 accomplish+0.018
ble+0.018
Neg logit contributions
 Anna-0.009
 rats-0.008
Anna-0.007
 Ann-0.007
 mats-0.007
Token asked
Token ID1965
Feature activation+1.72e-03

Pos logit contributions
 idea+0.011
ined+0.011
ully+0.011
 accomplish+0.010
ble+0.010
Neg logit contributions
 Anna-0.021
 rats-0.018
Anna-0.018
 Ann-0.018
 Pen-0.017
Token.
Token ID13
Feature activation+6.27e-03

Pos logit contributions
 Anna+0.003
 rats+0.003
Anna+0.003
 Ann+0.003
 Megan+0.003
Neg logit contributions
 idea-0.002
ully-0.002
ined-0.002
 accomplish-0.002
ble-0.002
Token
Token ID198
Feature activation-5.06e-03

Pos logit contributions
 Anna+0.010
Anna+0.009
 rats+0.008
 Ann+0.008
 Megan+0.008
Neg logit contributions
ully-0.009
 idea-0.009
ined-0.009
 accomplish-0.009
ble-0.008
Token
Token ID198
Feature activation+2.27e-03

Pos logit contributions
 idea+0.007
ully+0.007
ined+0.007
 accomplish+0.006
ble+0.006
Neg logit contributions
 Anna-0.008
Anna-0.007
 Ann-0.007
 rats-0.007
 Megan-0.007
Token"
Token ID1
Feature activation+2.08e-02

Pos logit contributions
 rats+0.004
 Anna+0.004
 mats+0.004
pter+0.004
 Pen+0.004
Neg logit contributions
 idea-0.002
ble-0.002
ully-0.002
ined-0.002
 accomplish-0.002
TokenSorry
Token ID14385
Feature activation-3.91e-03

Pos logit contributions
 Anna+0.031
 rats+0.028
 Ann+0.027
pter+0.026
 Megan+0.026
Neg logit contributions
 idea-0.030
ully-0.028
ined-0.027
 accomplish-0.027
ble-0.026
Token,
Token ID11
Feature activation+1.47e-02

Pos logit contributions
 idea+0.003
ully+0.003
?",+0.003
?+0.002
ble+0.002
Neg logit contributions
 Anna-0.008
Anna-0.008
 rats-0.008
pter-0.008
 mats-0.008
Token sweet
Token ID6029
Feature activation+4.83e-03

Pos logit contributions
 Anna+0.018
Anna+0.017
pter+0.017
 rats+0.016
 Pen+0.016
Neg logit contributions
 idea-0.025
ully-0.022
ble-0.021
 replied-0.021
ined-0.021
TokenHe
Token ID1544
Feature activation-1.05e-02

Pos logit contributions
 idea+0.001
ully+0.001
ined+0.001
inkle+0.001
ble+0.001
Neg logit contributions
 Anna-0.001
Anna-0.001
 rats-0.001
 Ann-0.001
 mats-0.001
Token asked
Token ID1965
Feature activation-5.50e-04

Pos logit contributions
 idea+0.015
ully+0.015
ined+0.015
 accomplish+0.014
ble+0.014
Neg logit contributions
 Anna-0.016
 rats-0.014
Anna-0.014
 Ann-0.014
 mats-0.013
Token the
Token ID262
Feature activation-9.81e-03

Pos logit contributions
ully+0.001
ined+0.001
 idea+0.001
 accomplish+0.001
inkle+0.001
Neg logit contributions
 Anna-0.001
 rats-0.001
 Ann-0.001
Anna-0.001
 Pen-0.001
Token bird
Token ID6512
Feature activation-8.46e-03

Pos logit contributions
 idea+0.016
ully+0.016
ined+0.015
ble+0.014
 accomplish+0.014
Neg logit contributions
 Anna-0.014
 rats-0.012
Anna-0.012
 Ann-0.011
 mats-0.011
Token where
Token ID810
Feature activation+1.90e-02

Pos logit contributions
 idea+0.011
ully+0.011
ined+0.011
 accomplish+0.010
ble+0.010
Neg logit contributions
 Anna-0.014
 rats-0.013
Anna-0.012
 Ann-0.012
 Megan-0.012
Token he
Token ID339
Feature activation-3.55e-02

Pos logit contributions
 Anna+0.030
Anna+0.027
 rats+0.027
 mats+0.026
 Ann+0.026
Neg logit contributions
 idea-0.027
ully-0.026
ined-0.025
 accomplish-0.023
ble-0.023
Token lived
Token ID5615
Feature activation-1.76e-02

Pos logit contributions
opl+0.069
ined+0.068
ute+0.067
agons+0.066
 idea+0.066
Neg logit contributions
 Anna-0.037
 rats-0.031
Anna-0.026
's-0.026
 knocks-0.026
Token and
Token ID290
Feature activation-4.93e-03

Pos logit contributions
ully+0.032
 idea+0.031
ined+0.030
agons+0.029
opl+0.029
Neg logit contributions
 Anna-0.022
 rats-0.018
Anna-0.017
 Ann-0.016
 downstairs-0.016
Token the
Token ID262
Feature activation-1.27e-02

Pos logit contributions
agons+0.010
opl+0.010
ined+0.010
 idea+0.010
inkle+0.010
Neg logit contributions
 Anna-0.006
 rats-0.004
 Ann-0.004
 Pen-0.004
 Frost-0.004
Token bird
Token ID6512
Feature activation-1.88e-02

Pos logit contributions
ully+0.023
 idea+0.023
ined+0.022
 accomplish+0.021
ble+0.021
Neg logit contributions
 Anna-0.016
 rats-0.014
Anna-0.013
 Ann-0.012
 Pen-0.012
Token told
Token ID1297
Feature activation+4.24e-03

Pos logit contributions
agons+0.034
opl+0.033
 gol+0.033
 accomplish+0.032
mmm+0.032
Neg logit contributions
 Anna-0.023
's-0.020
 rats-0.018
 Ann-0.017
 to-0.016
Token.
Token ID13
Feature activation-2.99e-03

Pos logit contributions
ully+0.017
ined+0.017
 idea+0.017
 accomplish+0.016
 ease+0.016
Neg logit contributions
 Anna-0.017
 rats-0.015
 Ann-0.014
Anna-0.013
 mats-0.013
Token
Token ID198
Feature activation-5.51e-03

Pos logit contributions
ully+0.005
 idea+0.005
ined+0.005
 accomplish+0.005
ble+0.005
Neg logit contributions
 Anna-0.004
 Ann-0.003
Anna-0.003
 Megan-0.003
 Ana-0.003
Token
Token ID198
Feature activation+1.88e-03

Pos logit contributions
 idea+0.009
ully+0.009
ined+0.009
 accomplish+0.008
ble+0.008
Neg logit contributions
 Anna-0.008
Anna-0.007
 Ann-0.007
 Megan-0.006
Ann-0.006
Token"
Token ID1
Feature activation+9.19e-03

Pos logit contributions
 Anna+0.003
 rats+0.003
 Ann+0.002
 mats+0.002
 Megan+0.002
Neg logit contributions
 idea-0.003
ully-0.003
ined-0.003
 accomplish-0.002
ble-0.002
TokenCan
Token ID6090
Feature activation+1.67e-02

Pos logit contributions
 Anna+0.010
Anna+0.009
 rats+0.008
 Ann+0.008
 Megan+0.007
Neg logit contributions
 idea-0.018
ully-0.017
ined-0.017
inkle-0.016
 accomplish-0.016
Token I
Token ID314
Feature activation-3.50e-02

Pos logit contributions
 rats+0.018
 mats+0.017
 minutes+0.017
 Anna+0.017
pter+0.017
Neg logit contributions
 replied-0.026
ully-0.026
 gem-0.026
 shar-0.026
ble-0.025
Token touch
Token ID3638
Feature activation-2.34e-02

Pos logit contributions
ined+0.076
opl+0.073
ully+0.073
 idea+0.070
oving+0.069
Neg logit contributions
 Anna-0.034
Anna-0.024
 we-0.022
 Ann-0.022
 to-0.022
Token it
Token ID340
Feature activation-2.84e-02

Pos logit contributions
 idea+0.045
ully+0.044
ined+0.043
 accomplish+0.042
agons+0.040
Neg logit contributions
 Anna-0.029
 rats-0.022
Anna-0.021
 Ann-0.020
 Megan-0.020
Token?"
Token ID1701
Feature activation-2.19e-02

Pos logit contributions
ully+0.044
agons+0.041
 gol+0.040
 ease+0.039
opl+0.039
Neg logit contributions
 Anna-0.049
 next-0.037
 to-0.037
 we-0.037
 back-0.036
Token she
Token ID673
Feature activation-2.11e-02

Pos logit contributions
ully+0.042
ined+0.041
 idea+0.041
ble+0.039
 accomplish+0.039
Neg logit contributions
 Anna-0.028
 Ann-0.021
Anna-0.021
 rats-0.020
 Megan-0.020
Token asked
Token ID1965
Feature activation+8.57e-04

Pos logit contributions
ined+0.029
ully+0.027
opl+0.027
 accomplish+0.026
agons+0.026
Neg logit contributions
 Anna-0.040
 Ann-0.032
Anna-0.032
 rats-0.031
 Pen-0.031
Token can
Token ID460
Feature activation+1.23e-03

Pos logit contributions
 Anna+0.005
 rats+0.004
Anna+0.004
 Ann+0.004
 Jacob+0.004
Neg logit contributions
ully-0.006
ined-0.006
 idea-0.005
inkle-0.005
ble-0.005
Token I
Token ID314
Feature activation-3.04e-02

Pos logit contributions
 Anna+0.002
 rats+0.001
 Ann+0.001
Anna+0.001
 Megan+0.001
Neg logit contributions
 idea-0.002
ined-0.002
ully-0.002
 accomplish-0.002
inkle-0.002
Token do
Token ID466
Feature activation-2.16e-02

Pos logit contributions
ully+0.069
ined+0.066
opl+0.065
agons+0.062
 ease+0.061
Neg logit contributions
 Anna-0.024
 wake-0.022
 to-0.020
's-0.017
 Mia-0.016
Token with
Token ID351
Feature activation-1.40e-03

Pos logit contributions
ully+0.039
 ease+0.036
ined+0.036
opl+0.034
 idea+0.033
Neg logit contributions
 Anna-0.030
Anna-0.024
 Ann-0.023
 rats-0.022
 Jacob-0.022
Token this
Token ID428
Feature activation-1.73e-02

Pos logit contributions
ined+0.003
 idea+0.003
ully+0.003
 accomplish+0.003
imm+0.003
Neg logit contributions
 Anna-0.001
 rats-0.001
 Ann-0.001
 chairs-0.001
 Frost-0.001
Token avocado
Token ID40377
Feature activation-5.04e-02

Pos logit contributions
agons+0.030
ully+0.030
hing+0.028
oving+0.028
ined+0.027
Neg logit contributions
 Anna-0.023
 half-0.019
 next-0.018
 to-0.018
 back-0.018
Token?
Token ID30
Feature activation-1.29e-03

Pos logit contributions
 ease+0.063
ully+0.062
ined+0.062
oving+0.055
hing+0.054
Neg logit contributions
 Anna-0.095
 rats-0.081
 to-0.078
 we-0.078
 next-0.075
Tokenâ
Token ID22940
Feature activation+6.35e-04

Pos logit contributions
ined+0.003
 idea+0.003
ully+0.003
inkle+0.003
ble+0.003
Neg logit contributions
 Anna-0.001
 Ann-0.001
Anna-0.001
 Hur-0.001
 Jacob-0.001
Token
Token ID26391
Feature activation-2.91e-03

Pos logit contributions
 Anna+0.001
Anna+0.001
 Ann+0.001
Ann+0.001
Jacob+0.001
Neg logit contributions
 idea-0.001
ully-0.001
 ease-0.001
ble-0.001
ined-0.001
Token
Token ID220
Feature activation-1.87e-02

Pos logit contributions
 idea+0.004
ully+0.004
ined+0.003
 accomplish+0.003
?!"+0.003
Neg logit contributions
 Anna-0.005
 rats-0.005
 mats-0.004
Anna-0.004
 Megan-0.004
Token
Token ID198
Feature activation-4.17e-03

Pos logit contributions
 idea+0.035
ined+0.034
ully+0.033
 gem+0.031
 hobby+0.030
Neg logit contributions
 Anna-0.027
 Ann-0.021
Anna-0.021
 Megan-0.019
 Pen-0.019
Token.
Token ID13
Feature activation+3.24e-03

Pos logit contributions
ully+0.016
ined+0.015
 idea+0.015
agons+0.013
 accomplish+0.013
Neg logit contributions
 Anna-0.013
 rats-0.010
Anna-0.010
 downstairs-0.010
 Ann-0.010
Token Can
Token ID1680
Feature activation+1.18e-02

Pos logit contributions
 Anna+0.004
 rats+0.003
Anna+0.003
 Ann+0.003
 Megan+0.003
Neg logit contributions
 idea-0.006
ully-0.006
ined-0.006
 accomplish-0.005
ble-0.005
Token you
Token ID345
Feature activation-3.57e-02

Pos logit contributions
 Anna+0.013
 rats+0.012
 minutes+0.012
ksh+0.012
 mats+0.012
Neg logit contributions
 idea-0.020
ully-0.020
ble-0.019
 accomplish-0.019
 shar-0.019
Token find
Token ID1064
Feature activation-3.18e-02

Pos logit contributions
ined+0.077
opl+0.074
ully+0.071
agons+0.067
ained+0.066
Neg logit contributions
 Anna-0.033
 wake-0.025
 to-0.023
ifier-0.022
 downstairs-0.022
Token other
Token ID584
Feature activation-2.16e-02

Pos logit contributions
 idea+0.059
ully+0.058
ined+0.057
 accomplish+0.055
ble+0.053
Neg logit contributions
 Anna-0.040
 rats-0.032
 Ann-0.030
Anna-0.030
 Ana-0.028
Token triangles
Token ID44360
Feature activation-4.48e-02

Pos logit contributions
ully+0.043
ined+0.043
 accomplish+0.039
inkle+0.039
 idea+0.039
Neg logit contributions
 Anna-0.024
 rats-0.023
 chairs-0.019
 mats-0.017
 Ann-0.017
Token in
Token ID287
Feature activation-1.60e-02

Pos logit contributions
ully+0.066
ined+0.062
 ease+0.060
oving+0.058
opl+0.058
Neg logit contributions
 Anna-0.074
Anna-0.059
 rats-0.058
 mats-0.056
 to-0.055
Token the
Token ID262
Feature activation+4.56e-03

Pos logit contributions
ully+0.031
ined+0.030
 idea+0.030
 accomplish+0.028
agons+0.028
Neg logit contributions
 Anna-0.020
Anna-0.015
 rats-0.015
 Ann-0.014
 Ana-0.013
Token room
Token ID2119
Feature activation-1.50e-02

Pos logit contributions
 Anna+0.006
 rats+0.005
Anna+0.005
 mats+0.004
 Ann+0.004
Neg logit contributions
ully-0.008
 idea-0.008
ined-0.008
ble-0.008
 accomplish-0.007
Token?"
Token ID1701
Feature activation-2.28e-02

Pos logit contributions
ully+0.019
ined+0.016
agons+0.016
 idea+0.016
 ease+0.016
Neg logit contributions
 Anna-0.030
Anna-0.025
 Ann-0.025
 Megan-0.024
 rats-0.024
Token
Token ID198
Feature activation-9.28e-03

Pos logit contributions
ined+0.043
ully+0.042
 idea+0.041
agons+0.040
ble+0.040
Neg logit contributions
 Anna-0.035
 Ann-0.027
Anna-0.026
 Megan-0.025
 Ana-0.024
Token had
Token ID550
Feature activation-1.73e-02

Pos logit contributions
ully+0.010
 idea+0.010
opl+0.009
ble+0.009
 accomplish+0.009
Neg logit contributions
 Anna-0.008
Anna-0.007
 Ann-0.006
 mats-0.006
 rats-0.006
Token lots
Token ID6041
Feature activation+3.86e-03

Pos logit contributions
 idea+0.024
ully+0.023
ined+0.022
 ease+0.021
 accomplish+0.021
Neg logit contributions
 Anna-0.027
 rats-0.025
Anna-0.024
 Ann-0.023
 Megan-0.023
Token of
Token ID286
Feature activation-9.77e-03

Pos logit contributions
 Anna+0.004
 mats+0.003
 rats+0.003
Anna+0.003
 Ann+0.003
Neg logit contributions
ully-0.008
 idea-0.008
 accomplish-0.007
ble-0.007
ined-0.007
Token different
Token ID1180
Feature activation-1.77e-03

Pos logit contributions
ully+0.016
ble+0.015
 accomplish+0.015
ined+0.015
?'+0.015
Neg logit contributions
 Anna-0.015
 mats-0.012
 rats-0.012
 chairs-0.012
Anna-0.011
Token mar
Token ID1667
Feature activation-3.45e-02

Pos logit contributions
ully+0.003
ble+0.003
 accomplish+0.003
ined+0.003
inkle+0.002
Neg logit contributions
 Anna-0.003
 mats-0.002
 chairs-0.002
 rats-0.002
 beds-0.002
Tokenbles
Token ID7689
Feature activation-3.64e-02

Pos logit contributions
 idea+0.036
ined+0.029
ully+0.029
?'+0.028
asty+0.027
Neg logit contributions
 Anna-0.067
 mats-0.062
Anna-0.061
 rats-0.060
pter-0.059
Token and
Token ID290
Feature activation-6.97e-03

Pos logit contributions
ined+0.069
 idea+0.068
ully+0.066
inkle+0.066
 accomplish+0.063
Neg logit contributions
 Anna-0.042
 mats-0.035
 rats-0.034
 Ann-0.030
 chairs-0.030
Token he
Token ID339
Feature activation+3.32e-03

Pos logit contributions
inkle+0.012
ined+0.012
ully+0.012
 idea+0.011
 accomplish+0.011
Neg logit contributions
 Anna-0.010
 mats-0.008
 Ann-0.008
 rats-0.008
 chairs-0.007
Token loved
Token ID6151
Feature activation-2.29e-04

Pos logit contributions
 Anna+0.005
Anna+0.004
 Ann+0.003
 mats+0.003
 Jacob+0.003
Neg logit contributions
ully-0.006
opl-0.006
 accomplish-0.005
 idea-0.005
asty-0.005
Token collecting
Token ID13157
Feature activation-2.69e-02

Pos logit contributions
inkle+0.000
 idea+0.000
ined+0.000
ully+0.000
 accomplish+0.000
Neg logit contributions
 Anna-0.000
 Ann-0.000
 rats-0.000
 mats-0.000
 chairs-0.000
Token them
Token ID606
Feature activation-1.42e-02

Pos logit contributions
ully+0.045
ined+0.042
 idea+0.042
 accomplish+0.041
?'+0.040
Neg logit contributions
 Anna-0.040
 mats-0.033
 chairs-0.030
Anna-0.030
 rats-0.029