TOP ACTIVATIONS
MAX = 8.876

the water boils , the kettle makes a whistle . It
Anna and Ben take the kettle and put it behind their
train . They put a kettle on the stove . They
the stove , and a kettle on the counter , but
steam . They mind the kettle . They have fun .
was an old and rusty kettle . Molly picked it up
He says , " Ble ch ! Medicine is y ucky
" Ch oo - ch oo , I am the
sound . Ch oo - ch oo ! It goes around
and make noises like " ch oo - ch oo "
like " ch oo - ch oo " and " be
be ep , and scree ch . One day
He said , " Ble ch ! What is this ?
noises like cho o - ch oo and be ep -
noise . Ch oo - ch oo ! The
ez es . " A ch oo !" "
. " Ch oo - ch oo !" Tim and Sue
it their " monster - ch aser ." That
they play hop sc ot ch . They draw squares on
Tim took the monster - ch aser to bed with him

MIN ACTIVATIONS
MIN = -8.635

day that hard work and persever ance pay off in the
do it with practice and persever ance . <|endoftext|> Once upon
they also require effort and persever ance . They learned that
and judge someone 's worth based on their age . <|endoftext|>
of her toys into groups based on what she wanted to
day - one that was based on trust . They went
: don 't judge someone based on how they admire the
not nice to judge people based on their smell . The
when they saw the little sho el ace and knew that
, she quickly tied her sho el aces and ran away
she shouldn 't judge things based on their appearance . Sometimes
, they saw a tiny sho el ace wrapped around the
game to play while they sho pped . He liked to
. Let 's tie your sho el aces together so they
owners quickly unt ied the sho el ace and the depend
! Can you tie my sho el aces for me ?"
friends helped her loosen her sho el aces so she could
not nice to judge things based on their looks . She
pink dress and tie her sho el aces tight , just
and how she tied her sho el aces all by herself

INTERVAL 4.498 - 8.876
CONTAINS 0.378%

not . " A ch oo !" she sne ez ed
ran away from the pick le . They ran and laughed
never forget the things Av a had taught him . <|endoftext|>
asked Sam nervously . L ola looked around but she could
friends again . <|endoftext|> Once upon a time , there was

INTERVAL 0.120 - 4.498
CONTAINS 42.919%

he realized that he could always come back to the corn
Okay , let 's bake some gem cookies together ."
their hands . Ben and Lily pet ted the goose gently
furry animals , who all looked very happy to see her
with a ride to the zoo . He is very nice

INTERVAL -4.257 - 0.120
CONTAINS 55.702%

they were made . Tim my was so happy to have
They hoped the ball would stop talking and leave them alone
calendar in her room that showed all the days of the
with his toy truck . He loaded up rocks and happily
silly jokes . One day , the fox was out walking

INTERVAL -8.635 - -4.257
CONTAINS 1.001%

!" Lily said . " Maybe it will talk to us
fruits inside . They say thank you and goodbye to the
over to give her a hug . She told M andy
his grip . Bobby plummeted back down to the bottom of
decoration ! Mum smiled with pride and the two of them
Token the
Token ID262
Feature activation-2.93e-01

Pos logit contributions
resses+0.092
ashes+0.090
heads+0.088
angers+0.088
 banks+0.087
Neg logit contributions
 course-0.078
 forgiveness-0.060
 motorcycle-0.058
 thanks-0.057
 thank-0.057
Token water
Token ID1660
Feature activation+1.74e+00

Pos logit contributions
 course+0.009
 motorcycle+0.007
 forgiveness+0.007
 end+0.006
 narrative+0.006
Neg logit contributions
resses-0.014
uggets-0.013
nes-0.013
heads-0.013
ashes-0.013
Token boils
Token ID40169
Feature activation+4.64e+00

Pos logit contributions
resses+0.088
ashes+0.086
heads+0.085
angers+0.085
 banks+0.085
Neg logit contributions
 course-0.043
 forgiveness-0.029
 motorcycle-0.028
 narrative-0.027
 understands-0.027
Token,
Token ID11
Feature activation+1.32e-01

Pos logit contributions
resses+0.241
ashes+0.234
heads+0.234
angers+0.232
uggets+0.231
Neg logit contributions
 course-0.111
 forgiveness-0.072
 motorcycle-0.069
 narrative-0.068
 understands-0.068
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
resses+0.006
heads+0.006
ashes+0.006
angers+0.006
mary+0.006
Neg logit contributions
 course-0.004
 forgiveness-0.003
 thanks-0.003
 motorcycle-0.003
 magnet-0.003
Token kettle
Token ID40231
Feature activation+8.88e+00

Pos logit contributions
 course+0.009
 motorcycle+0.007
 end+0.007
 forgiveness+0.007
 magnet+0.007
Neg logit contributions
uggets-0.013
resses-0.013
nes-0.012
heads-0.012
uring-0.012
Token makes
Token ID1838
Feature activation+1.94e+00

Pos logit contributions
resses+0.499
heads+0.483
uggets+0.481
angers+0.480
ashes+0.478
Neg logit contributions
 course-0.175
 forgiveness-0.101
 understands-0.098
 narrative-0.095
 motorcycle-0.093
Token a
Token ID257
Feature activation+1.95e-01

Pos logit contributions
resses+0.109
angers+0.106
nes+0.105
heads+0.102
uggets+0.102
Neg logit contributions
 course-0.040
 again-0.033
 thanks-0.031
 end-0.030
 is-0.028
Token whistle
Token ID16121
Feature activation-3.17e-02

Pos logit contributions
angers+0.009
resses+0.009
nes+0.008
uggets+0.008
ashes+0.008
Neg logit contributions
 course-0.006
 thank-0.005
 motorcycle-0.005
 forgiveness-0.005
 magnet-0.005
Token.
Token ID13
Feature activation+6.77e-01

Pos logit contributions
 course+0.001
 thanks+0.001
 thank+0.001
 motorcycle+0.001
 forgiveness+0.000
Neg logit contributions
resses-0.002
angers-0.002
uggets-0.002
 banks-0.002
nes-0.002
Token It
Token ID632
Feature activation+4.68e-01

Pos logit contributions
heads+0.030
resses+0.029
 banks+0.029
nes+0.029
angers+0.028
Neg logit contributions
 course-0.020
 thanks-0.017
 thank-0.017
 understands-0.016
 forgiveness-0.016
Token Anna
Token ID11735
Feature activation+3.32e+00

Pos logit contributions
heads+0.031
 banks+0.030
resses+0.030
nes+0.030
uggets+0.029
Neg logit contributions
 course-0.019
 thank-0.015
 thanks-0.015
 understands-0.015
 forgiveness-0.015
Token and
Token ID290
Feature activation+6.28e-01

Pos logit contributions
resses+0.182
uggets+0.181
angers+0.179
heads+0.178
nes+0.177
Neg logit contributions
 course-0.068
 understands-0.047
 thanks-0.044
 motorcycle-0.036
 knows-0.035
Token Ben
Token ID3932
Feature activation+5.73e-01

Pos logit contributions
resses+0.023
ashes+0.023
heads+0.022
angers+0.022
 banks+0.022
Neg logit contributions
 course-0.024
 forgiveness-0.019
 motorcycle-0.019
 narrative-0.018
 understands-0.018
Token take
Token ID1011
Feature activation-1.88e+00

Pos logit contributions
angers+0.027
resses+0.027
uggets+0.027
heads+0.026
mary+0.025
Neg logit contributions
 course-0.017
 thanks-0.012
 thank-0.012
 understands-0.012
 forgive-0.012
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
 course+0.052
 thanks+0.038
 forgiveness+0.035
 again+0.035
 understands+0.035
Neg logit contributions
resses-0.090
angers-0.088
heads-0.087
uggets-0.086
nes-0.085
Token kettle
Token ID40231
Feature activation+8.87e+00

Pos logit contributions
 course+0.008
 motorcycle+0.006
 forgiveness+0.006
 thanks+0.006
 box+0.006
Neg logit contributions
resses-0.011
uggets-0.011
nes-0.011
angers-0.010
uring-0.010
Token and
Token ID290
Feature activation+6.35e-01

Pos logit contributions
resses+0.424
angers+0.411
uggets+0.401
nes+0.399
 banks+0.390
Neg logit contributions
 course-0.216
 again-0.173
 thanks-0.168
 forgiveness-0.163
 motorcycle-0.162
Token put
Token ID1234
Feature activation+8.27e-01

Pos logit contributions
resses+0.025
angers+0.024
heads+0.024
uggets+0.024
nes+0.024
Neg logit contributions
 course-0.023
 thanks-0.018
 forgiveness-0.018
 motorcycle-0.018
 understands-0.017
Token it
Token ID340
Feature activation+9.95e-01

Pos logit contributions
resses+0.041
heads+0.040
nes+0.040
uring+0.039
angers+0.039
Neg logit contributions
 course-0.022
 thanks-0.014
 forgiveness-0.014
 magnet-0.013
 motorcycle-0.013
Token behind
Token ID2157
Feature activation-1.19e+00

Pos logit contributions
resses+0.053
angers+0.051
 banks+0.051
uggets+0.051
ashes+0.051
Neg logit contributions
 course-0.019
 motorcycle-0.012
 magnet-0.012
 again-0.012
orn-0.012
Token their
Token ID511
Feature activation+2.76e+00

Pos logit contributions
 course+0.035
 thanks+0.024
 forgiveness+0.024
 motorcycle+0.022
 again+0.022
Neg logit contributions
resses-0.057
heads-0.056
angers-0.056
ashes-0.055
uggets-0.054
Token train
Token ID4512
Feature activation+1.09e+00

Pos logit contributions
 course+0.072
 forgiveness+0.058
orn+0.057
 narrative+0.056
 understands+0.056
Neg logit contributions
resses-0.065
ashes-0.064
heads-0.063
 banks-0.063
angers-0.062
Token.
Token ID13
Feature activation+7.72e-01

Pos logit contributions
uggets+0.051
resses+0.050
mary+0.048
angers+0.048
heads+0.048
Neg logit contributions
 course-0.032
 motorcycle-0.024
 thanks-0.022
 thank-0.021
 forgiveness-0.021
Token They
Token ID1119
Feature activation+2.72e+00

Pos logit contributions
heads+0.031
 banks+0.029
nes+0.029
angers+0.029
resses+0.029
Neg logit contributions
 course-0.027
 thank-0.022
 thanks-0.022
 understands-0.021
 forgiveness-0.021
Token put
Token ID1234
Feature activation+7.60e-01

Pos logit contributions
resses+0.108
heads+0.104
ashes+0.104
uggets+0.104
angers+0.104
Neg logit contributions
 course-0.097
 forgiveness-0.074
 motorcycle-0.074
 narrative-0.072
 understands-0.072
Token a
Token ID257
Feature activation+1.58e-01

Pos logit contributions
resses+0.029
ashes+0.028
heads+0.028
angers+0.028
 banks+0.027
Neg logit contributions
 course-0.028
 forgiveness-0.022
 motorcycle-0.022
 narrative-0.021
 understands-0.021
Token kettle
Token ID40231
Feature activation+8.83e+00

Pos logit contributions
resses+0.006
uggets+0.006
ashes+0.006
angers+0.006
mary+0.006
Neg logit contributions
 course-0.005
 motorcycle-0.004
 forgiveness-0.004
 magnet-0.004
 narrative-0.004
Token on
Token ID319
Feature activation-1.66e+00

Pos logit contributions
resses+0.460
 banks+0.447
uggets+0.444
heads+0.443
ashes+0.440
Neg logit contributions
 course-0.194
 motorcycle-0.124
 forgiveness-0.116
 thanks-0.114
 magnet-0.111
Token the
Token ID262
Feature activation-2.51e-01

Pos logit contributions
 course+0.043
 motorcycle+0.027
 magnet+0.025
 thanks+0.024
 thank+0.023
Neg logit contributions
ashes-0.086
heads-0.085
uggets-0.085
resses-0.084
angers-0.084
Token stove
Token ID27635
Feature activation+3.03e+00

Pos logit contributions
 course+0.009
 motorcycle+0.007
 forgiveness+0.007
 magnet+0.007
 narrative+0.007
Neg logit contributions
uggets-0.010
resses-0.010
ashes-0.010
angers-0.009
heads-0.009
Token.
Token ID13
Feature activation+8.44e-01

Pos logit contributions
resses+0.150
ashes+0.146
uggets+0.146
heads+0.145
angers+0.145
Neg logit contributions
 course-0.078
 motorcycle-0.053
 forgiveness-0.050
 thanks-0.049
 magnet-0.049
Token They
Token ID1119
Feature activation+2.78e+00

Pos logit contributions
heads+0.035
nes+0.034
 banks+0.033
uring+0.032
resses+0.032
Neg logit contributions
 course-0.027
 thank-0.024
 thanks-0.023
 understands-0.023
 forgiveness-0.021
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
 course+0.044
 magnet+0.024
 forgiveness+0.024
 motorcycle+0.023
 thanks+0.023
Neg logit contributions
ashes-0.098
heads-0.096
angers-0.095
resses-0.095
nes-0.094
Token stove
Token ID27635
Feature activation+2.79e+00

Pos logit contributions
 course+0.009
 motorcycle+0.008
 magnet+0.008
 forgiveness+0.008
 narrative+0.007
Neg logit contributions
resses-0.005
ashes-0.005
uggets-0.005
angers-0.005
allows-0.005
Token,
Token ID11
Feature activation+2.32e-01

Pos logit contributions
resses+0.121
ashes+0.118
heads+0.117
 banks+0.116
mary+0.116
Neg logit contributions
 course-0.087
 forgiveness-0.066
 narrative-0.064
 motorcycle-0.063
 understands-0.063
Token and
Token ID290
Feature activation+4.68e-01

Pos logit contributions
resses+0.011
ashes+0.011
angers+0.011
nes+0.011
 banks+0.011
Neg logit contributions
 course-0.007
 forgiveness-0.005
 thanks-0.005
 motorcycle-0.004
 understands-0.004
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
resses+0.020
ashes+0.020
angers+0.020
 banks+0.020
heads+0.020
Neg logit contributions
 course-0.015
 forgiveness-0.011
 understands-0.011
 motorcycle-0.011
 thanks-0.011
Token kettle
Token ID40231
Feature activation+8.70e+00

Pos logit contributions
resses+0.004
ashes+0.004
heads+0.004
angers+0.004
 banks+0.004
Neg logit contributions
 course-0.004
 forgiveness-0.003
 motorcycle-0.003
 narrative-0.003
 understands-0.003
Token on
Token ID319
Feature activation-1.73e+00

Pos logit contributions
resses+0.411
heads+0.395
ashes+0.393
 banks+0.390
angers+0.388
Neg logit contributions
 course-0.247
 forgiveness-0.171
 motorcycle-0.166
 thanks-0.165
 narrative-0.161
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
 course+0.041
 magnet+0.021
 forgiveness+0.021
 motorcycle+0.021
 thanks+0.021
Neg logit contributions
ashes-0.097
heads-0.095
angers-0.094
resses-0.094
nes-0.093
Token counter
Token ID3753
Feature activation+4.29e+00

Pos logit contributions
 course+0.007
 motorcycle+0.006
 magnet+0.006
 forgiveness+0.006
 end+0.006
Neg logit contributions
resses-0.005
ashes-0.005
uggets-0.005
angers-0.005
heads-0.004
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
resses+0.197
ashes+0.191
heads+0.191
angers+0.189
uggets+0.188
Neg logit contributions
 course-0.126
 forgiveness-0.090
 motorcycle-0.087
 understands-0.086
 narrative-0.085
Token but
Token ID475
Feature activation-9.43e-02

Pos logit contributions
resses+0.012
ashes+0.012
nes+0.012
angers+0.012
 banks+0.012
Neg logit contributions
 course-0.006
 thanks-0.004
 forgiveness-0.004
 understands-0.004
 motorcycle-0.004
Token steam
Token ID13324
Feature activation+3.14e+00

Pos logit contributions
resses+0.032
ashes+0.032
 banks+0.031
mary+0.031
uggets+0.031
Neg logit contributions
 course-0.029
 narrative-0.023
orn-0.023
 forgiveness-0.023
 understands-0.022
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
resses+0.136
ashes+0.133
heads+0.131
angers+0.131
 banks+0.130
Neg logit contributions
 course-0.100
 forgiveness-0.074
 motorcycle-0.072
 narrative-0.072
 understands-0.071
Token They
Token ID1119
Feature activation+2.27e+00

Pos logit contributions
heads+0.014
resses+0.014
angers+0.014
 banks+0.014
nes+0.013
Neg logit contributions
 course-0.015
 thanks-0.013
 understands-0.013
 thank-0.012
 forgiveness-0.012
Token mind
Token ID2000
Feature activation-1.66e+00

Pos logit contributions
resses+0.093
heads+0.090
ashes+0.090
angers+0.090
uggets+0.089
Neg logit contributions
 course-0.078
 forgiveness-0.059
 motorcycle-0.058
 narrative-0.057
 understands-0.057
Token the
Token ID262
Feature activation-4.59e-01

Pos logit contributions
 course+0.051
 motorcycle+0.038
 forgiveness+0.038
 understands+0.037
 thanks+0.037
Neg logit contributions
resses-0.073
heads-0.071
uggets-0.071
ashes-0.070
 banks-0.070
Token kettle
Token ID40231
Feature activation+8.63e+00

Pos logit contributions
 course+0.018
 motorcycle+0.014
 forgiveness+0.014
 narrative+0.013
 thanks+0.013
Neg logit contributions
resses-0.018
nes-0.018
uggets-0.018
uring-0.017
heads-0.017
Token.
Token ID13
Feature activation+4.32e-01

Pos logit contributions
resses+0.418
heads+0.405
uggets+0.403
angers+0.402
ashes+0.399
Neg logit contributions
 course-0.232
 forgiveness-0.162
 thanks-0.157
 motorcycle-0.154
 understands-0.151
Token They
Token ID1119
Feature activation+2.27e+00

Pos logit contributions
heads+0.015
resses+0.015
 banks+0.015
angers+0.015
nes+0.014
Neg logit contributions
 course-0.017
 understands-0.014
 thanks-0.014
 forgiveness-0.014
 thank-0.014
Token have
Token ID423
Feature activation+7.43e-01

Pos logit contributions
resses+0.097
angers+0.096
uggets+0.096
heads+0.095
mary+0.091
Neg logit contributions
 course-0.075
 thanks-0.056
 thank-0.056
 forgiveness-0.055
 forgive-0.055
Token fun
Token ID1257
Feature activation-3.50e-01

Pos logit contributions
resses+0.030
angers+0.030
uring+0.030
heads+0.030
uggets+0.029
Neg logit contributions
 course-0.026
 thanks-0.020
 forgiveness-0.020
 understands-0.020
 friendship-0.019
Token.
Token ID13
Feature activation+4.56e-01

Pos logit contributions
 course+0.010
 forgiveness+0.007
 motorcycle+0.007
 thanks+0.007
 magnet+0.006
Neg logit contributions
resses-0.016
heads-0.016
mary-0.016
ashes-0.016
 banks-0.016
Token was
Token ID373
Feature activation-1.53e+00

Pos logit contributions
 course+0.020
 understands+0.010
orn+0.009
 forgiveness+0.009
 thanks+0.009
Neg logit contributions
resses-0.070
uggets-0.070
angers-0.069
ashes-0.068
heads-0.068
Token an
Token ID281
Feature activation-1.97e+00

Pos logit contributions
 course+0.059
 forgiveness+0.046
 motorcycle+0.045
 narrative+0.044
 understands+0.044
Neg logit contributions
resses-0.057
ashes-0.056
heads-0.055
angers-0.055
 banks-0.054
Token old
Token ID1468
Feature activation-5.74e-01

Pos logit contributions
 course+0.079
 forgiveness+0.063
 understands+0.062
 motorcycle+0.062
 narrative+0.062
Neg logit contributions
resses-0.069
ashes-0.066
heads-0.066
 banks-0.066
angers-0.065
Token and
Token ID290
Feature activation-4.84e-01

Pos logit contributions
 course+0.019
 forgiveness+0.014
 narrative+0.014
 understands+0.014
father+0.013
Neg logit contributions
resses-0.024
 banks-0.023
ashes-0.023
mary-0.023
heads-0.023
Token rusty
Token ID47855
Feature activation-1.19e+00

Pos logit contributions
 course+0.019
 forgiveness+0.015
 motorcycle+0.015
 magnet+0.014
 understands+0.014
Neg logit contributions
ashes-0.018
resses-0.018
angers-0.018
heads-0.017
uggets-0.017
Token kettle
Token ID40231
Feature activation+8.09e+00

Pos logit contributions
 course+0.041
 narrative+0.032
 understands+0.031
 forgiveness+0.031
father+0.031
Neg logit contributions
 banks-0.047
resses-0.047
mary-0.046
ashes-0.045
heads-0.045
Token.
Token ID13
Feature activation-7.73e-01

Pos logit contributions
resses+0.370
ashes+0.361
heads+0.360
angers+0.355
uggets+0.355
Neg logit contributions
 course-0.240
 forgiveness-0.172
 motorcycle-0.167
 narrative-0.163
 understands-0.162
Token Molly
Token ID30236
Feature activation+3.55e+00

Pos logit contributions
 course+0.025
 understands+0.022
 thanks+0.021
 thank+0.019
 again+0.019
Neg logit contributions
heads-0.034
resses-0.031
nes-0.031
allows-0.030
uggets-0.030
Token picked
Token ID6497
Feature activation-5.48e-01

Pos logit contributions
resses+0.176
ashes+0.175
angers+0.175
uggets+0.173
heads+0.170
Neg logit contributions
 course-0.092
 understands-0.065
 motorcycle-0.060
 forgiveness-0.060
 magnet-0.059
Token it
Token ID340
Feature activation+3.52e-01

Pos logit contributions
 course+0.017
 understands+0.012
 thanks+0.012
orn+0.012
 forgiveness+0.011
Neg logit contributions
heads-0.025
resses-0.025
nes-0.025
angers-0.024
ashes-0.024
Token up
Token ID510
Feature activation+1.62e-01

Pos logit contributions
resses+0.012
ashes+0.012
heads+0.012
angers+0.012
uggets+0.012
Neg logit contributions
 course-0.014
 forgiveness-0.011
 motorcycle-0.011
 understands-0.011
orn-0.011
Token He
Token ID679
Feature activation-1.90e+00

Pos logit contributions
resses+0.032
ashes+0.032
heads+0.031
angers+0.031
 banks+0.031
Neg logit contributions
 course-0.024
 forgiveness-0.017
 motorcycle-0.017
 narrative-0.017
 understands-0.017
Token says
Token ID1139
Feature activation+1.01e+00

Pos logit contributions
 course+0.057
 forgiveness+0.041
 motorcycle+0.040
 understands+0.040
 narrative+0.039
Neg logit contributions
resses-0.086
ashes-0.084
heads-0.084
angers-0.083
uggets-0.083
Token,
Token ID11
Feature activation+5.27e-01

Pos logit contributions
resses+0.052
ashes+0.051
heads+0.050
mary+0.050
uggets+0.050
Neg logit contributions
 course-0.024
 narrative-0.015
 forgiveness-0.015
 motorcycle-0.015
orn-0.015
Token "
Token ID366
Feature activation+1.31e+00

Pos logit contributions
resses+0.021
heads+0.020
mary+0.020
angers+0.020
ashes+0.020
Neg logit contributions
 course-0.019
 forgiveness-0.014
 narrative-0.014
orn-0.014
 motorcycle-0.014
TokenBle
Token ID43413
Feature activation-1.38e+00

Pos logit contributions
resses+0.056
ashes+0.055
heads+0.054
angers+0.054
 banks+0.054
Neg logit contributions
 course-0.043
 forgiveness-0.032
 motorcycle-0.031
 narrative-0.031
 understands-0.031
Tokench
Token ID354
Feature activation+7.75e+00

Pos logit contributions
 course+0.070
 forgiveness+0.058
 motorcycle+0.058
 narrative+0.057
 understands+0.057
Neg logit contributions
resses-0.035
ashes-0.033
heads-0.033
angers-0.032
 banks-0.032
Token!
Token ID0
Feature activation-1.14e-01

Pos logit contributions
resses+0.344
ashes+0.336
heads+0.334
angers+0.330
mary+0.330
Neg logit contributions
 course-0.235
 forgiveness-0.174
 motorcycle-0.170
 narrative-0.168
 understands-0.166
Token Medicine
Token ID11558
Feature activation+5.13e-01

Pos logit contributions
 course+0.004
 forgiveness+0.003
 understands+0.003
 motorcycle+0.003
 magnet+0.003
Neg logit contributions
resses-0.005
 banks-0.005
heads-0.005
mary-0.005
ashes-0.005
Token is
Token ID318
Feature activation-1.71e-01

Pos logit contributions
resses+0.030
heads+0.029
ashes+0.029
angers+0.029
uggets+0.029
Neg logit contributions
 course-0.009
 forgiveness-0.004
 motorcycle-0.004
 understands-0.004
orn-0.004
Token y
Token ID331
Feature activation-1.28e+00

Pos logit contributions
 course+0.006
 forgiveness+0.005
 thanks+0.005
 understands+0.004
 motorcycle+0.004
Neg logit contributions
resses-0.007
angers-0.007
ashes-0.007
heads-0.007
nes-0.007
Tokenucky
Token ID5309
Feature activation+3.57e+00

Pos logit contributions
 course+0.052
 forgiveness+0.042
 motorcycle+0.041
 understands+0.041
 narrative+0.041
Neg logit contributions
resses-0.044
ashes-0.043
heads-0.042
uggets-0.042
angers-0.041
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
resses+0.018
angers+0.018
ashes+0.018
heads+0.018
uggets+0.018
Neg logit contributions
 course-0.010
 understands-0.007
 forgiveness-0.007
orn-0.006
 motorcycle-0.006
Token"
Token ID1
Feature activation+2.42e+00

Pos logit contributions
angers+0.017
resses+0.017
ashes+0.016
ksh+0.016
uggets+0.016
Neg logit contributions
 course-0.009
 forgive-0.006
 end-0.006
 narrative-0.006
Fore-0.006
TokenCh
Token ID1925
Feature activation+3.14e+00

Pos logit contributions
resses+0.096
ashes+0.094
heads+0.093
mary+0.092
 banks+0.092
Neg logit contributions
 course-0.085
 forgiveness-0.067
 motorcycle-0.065
 understands-0.063
orn-0.063
Tokenoo
Token ID2238
Feature activation+5.15e+00

Pos logit contributions
resses+0.093
ashes+0.090
heads+0.088
angers+0.088
 banks+0.087
Neg logit contributions
 course-0.143
 forgiveness-0.118
 motorcycle-0.116
 narrative-0.115
 understands-0.114
Token-
Token ID12
Feature activation+1.62e+00

Pos logit contributions
resses+0.256
uggets+0.254
angers+0.246
ashes+0.244
heads+0.242
Neg logit contributions
 course-0.143
 motorcycle-0.093
 forgiveness-0.089
 thanks-0.088
 understands-0.088
Tokench
Token ID354
Feature activation+7.75e+00

Pos logit contributions
 banks+0.057
 thirsty+0.055
uring+0.054
uggets+0.054
resses+0.054
Neg logit contributions
 course-0.066
 motorcycle-0.053
orn-0.048
in-0.048
father-0.047
Tokenoo
Token ID2238
Feature activation+5.12e+00

Pos logit contributions
resses+0.251
ashes+0.238
heads+0.235
mary+0.235
angers+0.234
Neg logit contributions
 course-0.340
 forgiveness-0.270
 motorcycle-0.265
orn-0.261
 understands-0.259
Token,
Token ID11
Feature activation+4.04e-01

Pos logit contributions
uggets+0.262
resses+0.258
angers+0.251
ashes+0.245
 banks+0.242
Neg logit contributions
 course-0.138
 motorcycle-0.088
 thank-0.086
 thanks-0.085
 hero-0.082
Token I
Token ID314
Feature activation-9.22e-01

Pos logit contributions
resses+0.016
uggets+0.016
angers+0.016
ashes+0.015
mary+0.015
Neg logit contributions
 course-0.015
 motorcycle-0.011
 thank-0.011
 forgiveness-0.011
 thanks-0.011
Token am
Token ID716
Feature activation+1.36e+00

Pos logit contributions
 course+0.029
 forgive+0.021
 understands+0.020
 thanks+0.020
 thank+0.019
Neg logit contributions
resses-0.043
angers-0.042
uggets-0.041
heads-0.040
uring-0.040
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
 banks+0.075
ashes+0.070
angers+0.070
resses+0.070
heads+0.069
Neg logit contributions
 thanks-0.028
 thank-0.027
 course-0.025
,-0.019
 his-0.019
Token sound
Token ID2128
Feature activation+1.13e+00

Pos logit contributions
uggets+0.008
resses+0.008
angers+0.008
mary+0.007
ashes+0.007
Neg logit contributions
 course-0.009
 motorcycle-0.007
 forgiveness-0.007
 thank-0.007
 thanks-0.007
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
resses+0.051
ashes+0.049
heads+0.049
angers+0.049
uggets+0.049
Neg logit contributions
 course-0.034
 forgiveness-0.025
 motorcycle-0.024
 understands-0.023
 narrative-0.023
Token Ch
Token ID609
Feature activation+3.20e+00

Pos logit contributions
heads+0.028
resses+0.027
nes+0.026
 banks+0.026
mary+0.025
Neg logit contributions
 course-0.020
 forgiveness-0.016
 thank-0.016
 thanks-0.015
 understands-0.015
Tokenoo
Token ID2238
Feature activation+5.11e+00

Pos logit contributions
resses+0.109
heads+0.107
 banks+0.104
uggets+0.104
mary+0.103
Neg logit contributions
 course-0.132
 forgiveness-0.104
 narrative-0.101
 understands-0.099
 motorcycle-0.099
Token-
Token ID12
Feature activation+1.52e+00

Pos logit contributions
uggets+0.270
resses+0.264
heads+0.254
 banks+0.251
angers+0.251
Neg logit contributions
 course-0.125
 understands-0.085
 thanks-0.082
 motorcycle-0.076
 thank-0.074
Tokench
Token ID354
Feature activation+7.74e+00

Pos logit contributions
 banks+0.055
uggets+0.052
resses+0.052
uring+0.052
 thirsty+0.051
Neg logit contributions
 course-0.061
 motorcycle-0.048
orn-0.047
in-0.046
 understands-0.046
Tokenoo
Token ID2238
Feature activation+5.08e+00

Pos logit contributions
resses+0.303
ashes+0.291
heads+0.289
uggets+0.287
mary+0.287
Neg logit contributions
 course-0.283
 forgiveness-0.217
 motorcycle-0.213
 understands-0.210
orn-0.208
Token!
Token ID0
Feature activation-3.14e-01

Pos logit contributions
uggets+0.270
resses+0.263
heads+0.253
 banks+0.251
angers+0.251
Neg logit contributions
 course-0.122
 understands-0.083
 thanks-0.081
 motorcycle-0.074
 thank-0.073
Token It
Token ID632
Feature activation+4.24e-01

Pos logit contributions
 course+0.007
 thank+0.006
 understands+0.006
 thanks+0.006
 forgive+0.005
Neg logit contributions
resses-0.016
heads-0.016
uring-0.015
 banks-0.015
nes-0.015
Token goes
Token ID2925
Feature activation-1.38e+00

Pos logit contributions
uggets+0.024
resses+0.023
angers+0.023
heads+0.022
uring+0.022
Neg logit contributions
 course-0.010
 understands-0.007
 thanks-0.007
 is-0.006
 knows-0.006
Token around
Token ID1088
Feature activation+2.01e+00

Pos logit contributions
 course+0.038
 again+0.029
 motorcycle+0.029
 hero+0.027
 thanks+0.027
Neg logit contributions
uggets-0.064
allows-0.062
ender-0.061
 banks-0.061
resses-0.060
Token and
Token ID290
Feature activation+3.46e-01

Pos logit contributions
nes+0.020
resses+0.020
angers+0.020
uggets+0.020
heads+0.020
Neg logit contributions
 course-0.014
 thanks-0.011
 motorcycle-0.011
 thank-0.010
 again-0.010
Token make
Token ID787
Feature activation+1.02e+00

Pos logit contributions
angers+0.015
resses+0.015
uggets+0.015
heads+0.015
mary+0.014
Neg logit contributions
 course-0.011
 thank-0.009
 thanks-0.008
 motorcycle-0.008
 forgiveness-0.008
Token noises
Token ID26782
Feature activation+4.53e+00

Pos logit contributions
uggets+0.047
mary+0.045
angers+0.045
resses+0.044
heads+0.044
Neg logit contributions
 course-0.031
 thanks-0.024
 motorcycle-0.024
 is-0.023
 understands-0.023
Token like
Token ID588
Feature activation+2.78e+00

Pos logit contributions
resses+0.204
uggets+0.203
angers+0.201
 banks+0.198
mary+0.197
Neg logit contributions
 course-0.136
 motorcycle-0.102
 thanks-0.099
 forgiveness-0.095
 magnet-0.094
Token "
Token ID366
Feature activation+1.20e+00

Pos logit contributions
resses+0.109
heads+0.109
uggets+0.108
ashes+0.108
 banks+0.106
Neg logit contributions
 course-0.101
 motorcycle-0.076
 forgiveness-0.075
 thanks-0.072
 magnet-0.071
Tokench
Token ID354
Feature activation+7.74e+00

Pos logit contributions
resses+0.040
 banks+0.039
uggets+0.039
mary+0.038
angers+0.038
Neg logit contributions
 course-0.050
 motorcycle-0.039
orn-0.039
 magnet-0.037
 forgiveness-0.037
Tokenoo
Token ID2238
Feature activation+5.24e+00

Pos logit contributions
resses+0.284
ashes+0.274
uggets+0.271
angers+0.270
mary+0.269
Neg logit contributions
 course-0.310
 motorcycle-0.227
 forgiveness-0.223
orn-0.221
 magnet-0.219
Token-
Token ID12
Feature activation+1.63e+00

Pos logit contributions
uggets+0.211
resses+0.209
angers+0.205
ashes+0.203
 thirsty+0.201
Neg logit contributions
 course-0.187
 motorcycle-0.139
 thank-0.133
 thanks-0.127
 forgiveness-0.125
Tokench
Token ID354
Feature activation+7.73e+00

Pos logit contributions
uggets+0.053
 thirsty+0.052
 banks+0.051
angers+0.051
resses+0.051
Neg logit contributions
 course-0.069
 motorcycle-0.056
orn-0.050
smart-0.049
in-0.049
Tokenoo
Token ID2238
Feature activation+5.21e+00

Pos logit contributions
resses+0.270
ashes+0.260
angers+0.257
uggets+0.255
mary+0.255
Neg logit contributions
 course-0.314
 motorcycle-0.240
orn-0.232
 forgiveness-0.232
 magnet-0.230
Token"
Token ID1
Feature activation+2.31e+00

Pos logit contributions
uggets+0.207
resses+0.207
angers+0.205
ashes+0.201
 thirsty+0.200
Neg logit contributions
 course-0.183
 motorcycle-0.138
 thank-0.132
 thanks-0.126
 hero-0.124
Token like
Token ID588
Feature activation+2.78e+00

Pos logit contributions
resses+0.204
uggets+0.203
angers+0.201
 banks+0.198
mary+0.197
Neg logit contributions
 course-0.136
 motorcycle-0.102
 thanks-0.099
 forgiveness-0.095
 magnet-0.094
Token "
Token ID366
Feature activation+1.20e+00

Pos logit contributions
resses+0.109
heads+0.109
uggets+0.108
ashes+0.108
 banks+0.106
Neg logit contributions
 course-0.101
 motorcycle-0.076
 forgiveness-0.075
 thanks-0.072
 magnet-0.071
Tokench
Token ID354
Feature activation+7.74e+00

Pos logit contributions
resses+0.040
 banks+0.039
uggets+0.039
mary+0.038
angers+0.038
Neg logit contributions
 course-0.050
 motorcycle-0.039
orn-0.039
 magnet-0.037
 forgiveness-0.037
Tokenoo
Token ID2238
Feature activation+5.24e+00

Pos logit contributions
resses+0.284
ashes+0.274
uggets+0.271
angers+0.270
mary+0.269
Neg logit contributions
 course-0.310
 motorcycle-0.227
 forgiveness-0.223
orn-0.221
 magnet-0.219
Token-
Token ID12
Feature activation+1.63e+00

Pos logit contributions
uggets+0.211
resses+0.209
angers+0.205
ashes+0.203
 thirsty+0.201
Neg logit contributions
 course-0.187
 motorcycle-0.139
 thank-0.133
 thanks-0.127
 forgiveness-0.125
Tokench
Token ID354
Feature activation+7.73e+00

Pos logit contributions
uggets+0.053
 thirsty+0.052
 banks+0.051
angers+0.051
resses+0.051
Neg logit contributions
 course-0.069
 motorcycle-0.056
orn-0.050
smart-0.049
in-0.049
Tokenoo
Token ID2238
Feature activation+5.21e+00

Pos logit contributions
resses+0.270
ashes+0.260
angers+0.257
uggets+0.255
mary+0.255
Neg logit contributions
 course-0.314
 motorcycle-0.240
orn-0.232
 forgiveness-0.232
 magnet-0.230
Token"
Token ID1
Feature activation+2.31e+00

Pos logit contributions
uggets+0.207
resses+0.207
angers+0.205
ashes+0.201
 thirsty+0.200
Neg logit contributions
 course-0.183
 motorcycle-0.138
 thank-0.132
 thanks-0.126
 hero-0.124
Token and
Token ID290
Feature activation+3.80e-01

Pos logit contributions
resses+0.108
 banks+0.107
heads+0.106
ashes+0.105
uggets+0.104
Neg logit contributions
 course-0.059
 motorcycle-0.043
 forgiveness-0.042
 thanks-0.040
orn-0.040
Token "
Token ID366
Feature activation+1.23e+00

Pos logit contributions
uggets+0.017
resses+0.016
heads+0.016
angers+0.016
ashes+0.016
Neg logit contributions
 course-0.012
 thank-0.009
 motorcycle-0.009
 understands-0.009
 thanks-0.009
Tokenbe
Token ID1350
Feature activation+6.31e+00

Pos logit contributions
resses+0.046
mary+0.045
 banks+0.045
angers+0.045
uggets+0.045
Neg logit contributions
 course-0.047
 motorcycle-0.035
orn-0.035
 narrative-0.032
 forgiveness-0.032
Token be
Token ID307
Feature activation+8.28e-01

Pos logit contributions
resses+0.016
uggets+0.016
mary+0.016
angers+0.016
heads+0.016
Neg logit contributions
 course-0.016
 forgiveness-0.012
 motorcycle-0.012
 narrative-0.012
 thanks-0.012
Tokenep
Token ID538
Feature activation+3.01e+00

Pos logit contributions
 banks+0.041
resses+0.041
ender+0.040
angers+0.040
uring+0.040
Neg logit contributions
 course-0.020
 thanks-0.013
 motorcycle-0.013
smart-0.012
ma-0.012
Token,
Token ID11
Feature activation+3.97e-01

Pos logit contributions
resses+0.151
ashes+0.146
heads+0.146
uggets+0.146
mary+0.145
Neg logit contributions
 course-0.075
 forgiveness-0.050
 motorcycle-0.049
 understands-0.048
 narrative-0.047
Token and
Token ID290
Feature activation+6.94e-01

Pos logit contributions
resses+0.016
ashes+0.015
heads+0.015
uggets+0.015
angers+0.015
Neg logit contributions
 course-0.014
 forgiveness-0.011
 motorcycle-0.011
 narrative-0.011
 understands-0.011
Token scree
Token ID43465
Feature activation-1.11e+00

Pos logit contributions
resses+0.027
ashes+0.026
angers+0.026
uggets+0.026
heads+0.025
Neg logit contributions
 course-0.026
 forgiveness-0.020
 motorcycle-0.020
 narrative-0.020
 understands-0.019
Tokench
Token ID354
Feature activation+7.70e+00

Pos logit contributions
 course+0.046
 thanks+0.036
smart+0.035
 understands+0.035
orn+0.035
Neg logit contributions
uggets-0.039
resses-0.036
 thirsty-0.035
mary-0.034
 banks-0.034
Token.
Token ID13
Feature activation+7.40e-01

Pos logit contributions
resses+0.353
uggets+0.349
mary+0.346
heads+0.342
 banks+0.342
Neg logit contributions
 course-0.222
 motorcycle-0.152
 forgiveness-0.147
orn-0.142
 narrative-0.141
Token
Token ID198
Feature activation+2.90e-01

Pos logit contributions
heads+0.031
 banks+0.031
resses+0.031
mary+0.030
uring+0.030
Neg logit contributions
 course-0.023
 forgiveness-0.018
 forgive-0.018
 understands-0.017
 narrative-0.017
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
resses+0.013
ashes+0.013
heads+0.013
angers+0.013
 banks+0.013
Neg logit contributions
 course-0.009
 forgiveness-0.006
 understands-0.006
 motorcycle-0.006
 narrative-0.006
TokenOne
Token ID3198
Feature activation+2.63e+00

Pos logit contributions
resses+0.012
angers+0.012
ashes+0.012
 banks+0.012
mary+0.011
Neg logit contributions
 course-0.009
 narrative-0.007
 forgive-0.006
 forgiveness-0.006
 motorcycle-0.006
Token day
Token ID1110
Feature activation+5.84e-01

Pos logit contributions
resses+0.116
ashes+0.112
heads+0.112
angers+0.112
 banks+0.111
Neg logit contributions
 course-0.083
 forgiveness-0.062
 motorcycle-0.060
 narrative-0.059
 understands-0.059
Token He
Token ID679
Feature activation-1.91e+00

Pos logit contributions
resses+0.046
ashes+0.044
heads+0.044
angers+0.044
uggets+0.044
Neg logit contributions
 course-0.030
 forgiveness-0.021
 motorcycle-0.021
 narrative-0.021
 understands-0.020
Token said
Token ID531
Feature activation+1.29e+00

Pos logit contributions
 course+0.062
 forgiveness+0.045
 understands+0.045
 motorcycle+0.045
 narrative+0.044
Neg logit contributions
resses-0.083
angers-0.081
ashes-0.080
heads-0.080
uggets-0.080
Token,
Token ID11
Feature activation+4.49e-01

Pos logit contributions
resses+0.059
uggets+0.059
ashes+0.059
mary+0.059
allows+0.059
Neg logit contributions
 course-0.036
father-0.025
 narrative-0.025
 motorcycle-0.024
orn-0.024
Token "
Token ID366
Feature activation+1.31e+00

Pos logit contributions
resses+0.017
angers+0.017
mary+0.017
heads+0.017
ashes+0.017
Neg logit contributions
 course-0.017
 narrative-0.013
 forgiveness-0.012
orn-0.012
father-0.012
TokenBle
Token ID43413
Feature activation-1.47e+00

Pos logit contributions
resses+0.064
angers+0.063
uring+0.062
mary+0.061
 banks+0.061
Neg logit contributions
 course-0.036
 narrative-0.026
orn-0.025
 motorcycle-0.024
 forgiveness-0.024
Tokench
Token ID354
Feature activation+7.65e+00

Pos logit contributions
 course+0.073
 forgiveness+0.061
 motorcycle+0.060
orn+0.060
 understands+0.060
Neg logit contributions
resses-0.038
heads-0.036
 banks-0.036
uggets-0.036
ashes-0.035
Token!
Token ID0
Feature activation+2.08e-01

Pos logit contributions
resses+0.331
ashes+0.324
heads+0.321
angers+0.318
mary+0.318
Neg logit contributions
 course-0.237
 forgiveness-0.179
 motorcycle-0.174
 narrative-0.174
 understands-0.172
Token What
Token ID1867
Feature activation-2.79e+00

Pos logit contributions
 banks+0.009
uring+0.009
heads+0.009
resses+0.009
nes+0.009
Neg logit contributions
 course-0.006
 forgiveness-0.005
 understands-0.005
 magnet-0.004
 thank-0.004
Token is
Token ID318
Feature activation-2.20e-01

Pos logit contributions
 course+0.096
father+0.079
ously+0.079
 policeman+0.072
ement+0.072
Neg logit contributions
allows-0.102
resses-0.102
 thirsty-0.100
uggets-0.100
ashes-0.098
Token this
Token ID428
Feature activation+4.76e-02

Pos logit contributions
 course+0.007
 forgiveness+0.005
 motorcycle+0.005
 narrative+0.005
 magnet+0.005
Neg logit contributions
resses-0.009
heads-0.009
ashes-0.009
 banks-0.009
angers-0.009
Token?
Token ID30
Feature activation-3.41e-01

Pos logit contributions
resses+0.002
ashes+0.002
heads+0.002
angers+0.002
 banks+0.002
Neg logit contributions
 course-0.001
 forgiveness-0.001
 motorcycle-0.001
 understands-0.001
orn-0.001
Token noises
Token ID26782
Feature activation+4.67e+00

Pos logit contributions
uggets+0.044
mary+0.042
resses+0.042
angers+0.041
heads+0.041
Neg logit contributions
 course-0.031
 thanks-0.024
 motorcycle-0.024
 is-0.023
 understands-0.023
Token like
Token ID588
Feature activation+2.81e+00

Pos logit contributions
uggets+0.221
resses+0.220
angers+0.217
mary+0.215
 banks+0.214
Neg logit contributions
 course-0.130
 motorcycle-0.095
 thanks-0.090
 forgiveness-0.088
 magnet-0.086
Token cho
Token ID1727
Feature activation+2.52e+00

Pos logit contributions
uggets+0.108
resses+0.107
heads+0.107
ashes+0.106
mary+0.104
Neg logit contributions
 course-0.105
 motorcycle-0.081
 forgiveness-0.079
 thanks-0.076
 magnet-0.074
Tokeno
Token ID78
Feature activation+2.92e+00

Pos logit contributions
 banks+0.091
resses+0.091
uring+0.089
mary+0.088
ashes+0.087
Neg logit contributions
 course-0.094
 motorcycle-0.073
orn-0.072
father-0.071
 forgiveness-0.069
Token-
Token ID12
Feature activation+1.72e+00

Pos logit contributions
uggets+0.135
resses+0.133
angers+0.130
mary+0.128
 banks+0.128
Neg logit contributions
 course-0.084
 motorcycle-0.063
 thanks-0.055
 magnet-0.054
igh-0.053
Tokench
Token ID354
Feature activation+7.64e+00

Pos logit contributions
uggets+0.060
 thirsty+0.059
uring+0.058
 banks+0.057
resses+0.056
Neg logit contributions
 course-0.068
 motorcycle-0.056
smart-0.050
 fixes-0.049
orn-0.048
Tokenoo
Token ID2238
Feature activation+4.97e+00

Pos logit contributions
uggets+0.283
resses+0.280
mary+0.279
 banks+0.275
 thirsty+0.273
Neg logit contributions
 course-0.282
 motorcycle-0.211
orn-0.198
mate-0.195
 thank-0.194
Token and
Token ID290
Feature activation+3.78e-01

Pos logit contributions
uggets+0.230
resses+0.221
 thirsty+0.215
mary+0.214
angers+0.213
Neg logit contributions
 course-0.144
 motorcycle-0.109
 thanks-0.100
 hero-0.095
 thank-0.095
Token be
Token ID307
Feature activation+8.17e-01

Pos logit contributions
uggets+0.017
resses+0.017
angers+0.016
ashes+0.016
mary+0.016
Neg logit contributions
 course-0.012
 motorcycle-0.009
 thank-0.009
 thanks-0.008
 magnet-0.008
Tokenep
Token ID538
Feature activation+2.93e+00

Pos logit contributions
 banks+0.039
mary+0.038
resses+0.038
ender+0.038
angers+0.038
Neg logit contributions
 course-0.020
 motorcycle-0.016
 thanks-0.014
smart-0.014
 magnet-0.013
Token-
Token ID12
Feature activation+1.75e+00

Pos logit contributions
resses+0.133
uggets+0.130
mary+0.130
 banks+0.127
angers+0.127
Neg logit contributions
 course-0.083
 motorcycle-0.062
 thanks-0.059
 forgiveness-0.059
 understands-0.059
Token noise
Token ID7838
Feature activation+3.45e+00

Pos logit contributions
uggets+0.038
resses+0.037
angers+0.037
mary+0.036
heads+0.036
Neg logit contributions
 course-0.026
 thanks-0.021
 thank-0.020
 again-0.020
 friendship-0.019
Token.
Token ID13
Feature activation+5.15e-01

Pos logit contributions
resses+0.150
ashes+0.146
heads+0.145
angers+0.143
 banks+0.143
Neg logit contributions
 course-0.111
 forgiveness-0.083
 narrative-0.080
 motorcycle-0.080
orn-0.079
Token Ch
Token ID609
Feature activation+3.23e+00

Pos logit contributions
heads+0.021
resses+0.021
 banks+0.021
mary+0.020
uggets+0.020
Neg logit contributions
 course-0.017
 forgiveness-0.013
 understands-0.013
 narrative-0.012
 forgive-0.012
Tokenoo
Token ID2238
Feature activation+5.12e+00

Pos logit contributions
resses+0.107
heads+0.105
uggets+0.104
mary+0.103
 banks+0.103
Neg logit contributions
 course-0.136
 forgiveness-0.105
 motorcycle-0.105
 narrative-0.104
 understands-0.102
Token-
Token ID12
Feature activation+1.53e+00

Pos logit contributions
uggets+0.256
resses+0.253
heads+0.246
angers+0.242
mary+0.240
Neg logit contributions
 course-0.137
 understands-0.092
 thanks-0.089
 motorcycle-0.088
 forgiveness-0.086
Tokench
Token ID354
Feature activation+7.63e+00

Pos logit contributions
 banks+0.054
uggets+0.053
uring+0.052
resses+0.052
 thirsty+0.051
Neg logit contributions
 course-0.061
 motorcycle-0.048
orn-0.045
 understands-0.045
in-0.045
Tokenoo
Token ID2238
Feature activation+5.09e+00

Pos logit contributions
resses+0.299
ashes+0.289
heads+0.287
angers+0.285
mary+0.284
Neg logit contributions
 course-0.276
 forgiveness-0.212
 motorcycle-0.208
 narrative-0.205
 understands-0.205
Token!
Token ID0
Feature activation-3.90e-01

Pos logit contributions
uggets+0.260
resses+0.254
heads+0.247
angers+0.244
mary+0.241
Neg logit contributions
 course-0.131
 understands-0.087
 thanks-0.085
 motorcycle-0.083
 forgiveness-0.080
Token
Token ID198
Feature activation-9.10e-02

Pos logit contributions
 course+0.009
 understands+0.006
 forgiveness+0.006
 thanks+0.006
 forgive+0.006
Neg logit contributions
heads-0.020
resses-0.019
 banks-0.019
nes-0.019
uring-0.019
Token
Token ID198
Feature activation-9.57e-02

Pos logit contributions
 course+0.002
 understands+0.002
 forgiveness+0.002
 motorcycle+0.002
orn+0.002
Neg logit contributions
resses-0.004
ashes-0.004
heads-0.004
angers-0.004
uggets-0.004
TokenThe
Token ID464
Feature activation-7.85e-01

Pos logit contributions
 course+0.003
 narrative+0.002
 motorcycle+0.002
 forgive+0.002
 understands+0.002
Neg logit contributions
uggets-0.005
angers-0.004
ashes-0.004
resses-0.004
mary-0.004
Tokenez
Token ID8471
Feature activation+2.64e+00

Pos logit contributions
resses+0.158
ashes+0.151
angers+0.150
heads+0.150
 banks+0.150
Neg logit contributions
 course-0.167
 forgiveness-0.130
 understands-0.127
orn-0.127
 motorcycle-0.127
Tokenes
Token ID274
Feature activation+1.21e+00

Pos logit contributions
resses+0.121
 banks+0.120
uggets+0.120
heads+0.119
mary+0.118
Neg logit contributions
 course-0.074
 understands-0.054
orn-0.053
 forgiveness-0.050
 motorcycle-0.050
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
resses+0.054
ashes+0.052
heads+0.052
 banks+0.051
mary+0.051
Neg logit contributions
 course-0.037
 forgiveness-0.028
 motorcycle-0.027
 narrative-0.027
 understands-0.026
Token "
Token ID366
Feature activation+1.28e+00

Pos logit contributions
 banks+0.034
heads+0.034
resses+0.034
nes+0.034
mary+0.033
Neg logit contributions
 course-0.015
 understands-0.011
 magnet-0.011
 forgiveness-0.010
 thanks-0.010
TokenA
Token ID32
Feature activation+4.44e-01

Pos logit contributions
angers+0.065
resses+0.063
uring+0.062
uggets+0.062
 banks+0.061
Neg logit contributions
 course-0.034
orn-0.024
 narrative-0.023
 motorcycle-0.022
 understands-0.021
Tokench
Token ID354
Feature activation+7.63e+00

Pos logit contributions
resses+0.012
ashes+0.012
heads+0.012
angers+0.011
 banks+0.011
Neg logit contributions
 course-0.021
 forgiveness-0.018
 motorcycle-0.017
 understands-0.017
 narrative-0.017
Tokenoo
Token ID2238
Feature activation+5.02e+00

Pos logit contributions
resses+0.237
ashes+0.223
angers+0.223
heads+0.222
 banks+0.221
Neg logit contributions
 course-0.344
 forgiveness-0.275
 motorcycle-0.272
orn-0.271
 narrative-0.267
Token!"
Token ID2474
Feature activation+4.22e+00

Pos logit contributions
resses+0.208
angers+0.200
uggets+0.199
ashes+0.197
 banks+0.197
Neg logit contributions
 course-0.179
 motorcycle-0.131
orn-0.130
 forgiveness-0.129
 understands-0.126
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
uggets+0.251
nes+0.251
heads+0.248
resses+0.245
angers+0.244
Neg logit contributions
 course-0.065
 thanks-0.049
 thank-0.047
 understands-0.046
 he-0.044
Token
Token ID198
Feature activation+9.91e-02

Pos logit contributions
resses+0.006
ashes+0.005
 banks+0.005
heads+0.005
angers+0.005
Neg logit contributions
 course-0.003
 forgiveness-0.002
 understands-0.002
 motorcycle-0.002
orn-0.002
Token"
Token ID1
Feature activation+2.25e+00

Pos logit contributions
uggets+0.005
angers+0.005
resses+0.005
 banks+0.005
ashes+0.005
Neg logit contributions
 course-0.003
 understands-0.002
 narrative-0.002
orn-0.002
 motorcycle-0.002
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
resses+0.049
ashes+0.047
 banks+0.047
angers+0.047
uggets+0.047
Neg logit contributions
 course-0.031
 forgiveness-0.022
 motorcycle-0.022
 thanks-0.021
 understands-0.021
Token "
Token ID366
Feature activation+1.16e+00

Pos logit contributions
heads+0.024
 banks+0.024
resses+0.023
mary+0.023
ashes+0.023
Neg logit contributions
 course-0.016
 understands-0.012
 forgiveness-0.012
 thanks-0.012
 thank-0.012
TokenCh
Token ID1925
Feature activation+2.97e+00

Pos logit contributions
angers+0.054
resses+0.053
uggets+0.053
 banks+0.052
mary+0.052
Neg logit contributions
 course-0.035
 narrative-0.025
 motorcycle-0.024
orn-0.024
 forgiveness-0.023
Tokenoo
Token ID2238
Feature activation+5.03e+00

Pos logit contributions
resses+0.089
ashes+0.086
angers+0.084
heads+0.084
 banks+0.083
Neg logit contributions
 course-0.134
 forgiveness-0.111
 motorcycle-0.110
 understands-0.108
 narrative-0.108
Token-
Token ID12
Feature activation+1.53e+00

Pos logit contributions
resses+0.219
uggets+0.216
angers+0.212
ashes+0.209
heads+0.208
Neg logit contributions
 course-0.170
 motorcycle-0.121
orn-0.116
 understands-0.115
 thanks-0.115
Tokench
Token ID354
Feature activation+7.63e+00

Pos logit contributions
 banks+0.056
resses+0.055
uggets+0.054
angers+0.054
uring+0.054
Neg logit contributions
 course-0.061
 motorcycle-0.047
orn-0.045
 understands-0.044
 forgiveness-0.043
Tokenoo
Token ID2238
Feature activation+5.01e+00

Pos logit contributions
resses+0.258
ashes+0.244
mary+0.244
uggets+0.244
angers+0.243
Neg logit contributions
 course-0.324
 forgiveness-0.252
 motorcycle-0.251
orn-0.248
 understands-0.245
Token!"
Token ID2474
Feature activation+4.11e+00

Pos logit contributions
uggets+0.228
resses+0.225
angers+0.222
 banks+0.214
ashes+0.213
Neg logit contributions
 course-0.162
 motorcycle-0.114
 thank-0.110
 thanks-0.109
orn-0.106
Token Tim
Token ID5045
Feature activation-7.52e-01

Pos logit contributions
uggets+0.232
ashes+0.228
heads+0.224
ksh+0.222
nes+0.222
Neg logit contributions
 course-0.082
 thank-0.059
 he-0.057
 his-0.056
 thanks-0.054
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
 course+0.019
 understands+0.013
 thanks+0.013
 motorcycle+0.013
orn+0.011
Neg logit contributions
uggets-0.038
resses-0.038
angers-0.038
mary-0.037
 banks-0.037
Token Sue
Token ID26089
Feature activation+6.40e-01

Pos logit contributions
resses+0.023
uggets+0.022
heads+0.022
angers+0.022
 banks+0.022
Neg logit contributions
 course-0.021
 forgiveness-0.016
 motorcycle-0.016
 understands-0.016
 magnet-0.015
Token it
Token ID340
Feature activation+7.58e-01

Pos logit contributions
resses+0.186
ashes+0.180
 banks+0.179
allows+0.177
heads+0.174
Neg logit contributions
 course-0.175
 narrative-0.144
orn-0.141
 understands-0.137
 forgiveness-0.136
Token their
Token ID511
Feature activation+2.49e+00

Pos logit contributions
resses+0.033
angers+0.032
heads+0.032
ashes+0.031
uggets+0.031
Neg logit contributions
 course-0.024
 forgiveness-0.018
 motorcycle-0.018
 understands-0.018
 magnet-0.018
Token "
Token ID366
Feature activation+1.09e+00

Pos logit contributions
resses+0.100
ashes+0.098
 banks+0.096
uggets+0.092
angers+0.091
Neg logit contributions
 course-0.093
orn-0.069
 narrative-0.069
 forgiveness-0.068
father-0.065
Tokenmonster
Token ID39050
Feature activation+5.68e-01

Pos logit contributions
resses+0.037
angers+0.036
ashes+0.036
 banks+0.036
mary+0.036
Neg logit contributions
 course-0.045
 motorcycle-0.035
orn-0.035
 forgiveness-0.035
 magnet-0.034
Token-
Token ID12
Feature activation+1.48e+00

Pos logit contributions
resses+0.022
ashes+0.022
heads+0.021
mary+0.021
 banks+0.021
Neg logit contributions
 course-0.020
 forgiveness-0.016
 narrative-0.016
 motorcycle-0.015
 understands-0.015
Tokench
Token ID354
Feature activation+7.62e+00

Pos logit contributions
resses+0.057
ashes+0.056
heads+0.055
angers+0.055
 banks+0.055
Neg logit contributions
 course-0.054
 forgiveness-0.042
 motorcycle-0.041
orn-0.040
 understands-0.040
Tokenaser
Token ID6005
Feature activation-1.67e+00

Pos logit contributions
resses+0.250
ashes+0.239
heads+0.238
angers+0.236
mary+0.235
Neg logit contributions
 course-0.325
 forgiveness-0.261
 motorcycle-0.257
orn-0.253
 understands-0.252
Token."
Token ID526
Feature activation+3.72e+00

Pos logit contributions
 course+0.056
 forgiveness+0.042
 motorcycle+0.041
 understands+0.040
 narrative+0.040
Neg logit contributions
resses-0.069
ashes-0.067
heads-0.067
angers-0.067
uggets-0.066
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
heads+0.165
resses+0.164
angers+0.161
nes+0.159
uggets+0.157
Neg logit contributions
 course-0.119
 understands-0.082
 forgiveness-0.081
 thanks-0.081
 thank-0.077
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
 course+0.010
 forgiveness+0.006
 understands+0.006
orn+0.006
 motorcycle+0.006
Neg logit contributions
resses-0.017
angers-0.016
ashes-0.016
heads-0.016
 banks-0.016
TokenThat
Token ID2504
Feature activation-3.41e+00

Pos logit contributions
 course+0.013
 narrative+0.009
 motorcycle+0.009
 forgiveness+0.009
orn+0.009
Neg logit contributions
angers-0.015
resses-0.015
uggets-0.015
ashes-0.015
heads-0.014
Token they
Token ID484
Feature activation+2.22e+00

Pos logit contributions
heads+0.004
resses+0.004
 banks+0.004
angers+0.003
ashes+0.003
Neg logit contributions
 course-0.002
 thanks-0.002
 forgiveness-0.002
 motorcycle-0.001
 understands-0.001
Token play
Token ID711
Feature activation+6.60e-02

Pos logit contributions
resses+0.078
ashes+0.077
 banks+0.075
heads+0.074
mary+0.074
Neg logit contributions
 course-0.088
 forgiveness-0.071
 motorcycle-0.069
orn-0.068
 understands-0.068
Token hop
Token ID1725
Feature activation+7.42e-01

Pos logit contributions
resses+0.003
heads+0.003
mary+0.003
angers+0.003
uggets+0.003
Neg logit contributions
 course-0.002
 forgiveness-0.002
 motorcycle-0.002
 magnet-0.002
 narrative-0.002
Tokensc
Token ID1416
Feature activation+3.56e+00

Pos logit contributions
uggets+0.031
heads+0.031
angers+0.030
resses+0.030
allows+0.030
Neg logit contributions
 course-0.026
 narrative-0.018
 thanks-0.018
 motorcycle-0.018
orn-0.018
Tokenot
Token ID313
Feature activation+1.77e+00

Pos logit contributions
mary+0.112
angers+0.111
resses+0.110
 banks+0.108
heads+0.108
Neg logit contributions
 course-0.147
 forgiveness-0.117
 motorcycle-0.115
 magnet-0.113
 forgive-0.113
Tokench
Token ID354
Feature activation+7.62e+00

Pos logit contributions
uggets+0.065
heads+0.062
resses+0.062
angers+0.060
 banks+0.060
Neg logit contributions
 course-0.074
 motorcycle-0.054
 thanks-0.054
orn-0.053
 hero-0.052
Token.
Token ID13
Feature activation+8.36e-02

Pos logit contributions
mary+0.368
uggets+0.367
resses+0.364
heads+0.363
ender+0.361
Neg logit contributions
 course-0.221
orn-0.129
mate-0.124
 forgiveness-0.123
 motorcycle-0.123
Token They
Token ID1119
Feature activation+2.14e+00

Pos logit contributions
heads+0.004
nes+0.003
angers+0.003
ender+0.003
resses+0.003
Neg logit contributions
 course-0.003
 thanks-0.002
 thank-0.002
 motorcycle-0.002
 understands-0.002
Token draw
Token ID3197
Feature activation+2.86e-01

Pos logit contributions
resses+0.082
heads+0.080
angers+0.079
ashes+0.079
uggets+0.079
Neg logit contributions
 course-0.080
 forgiveness-0.061
 motorcycle-0.060
 narrative-0.060
 understands-0.059
Token squares
Token ID24438
Feature activation+4.04e+00

Pos logit contributions
uggets+0.012
heads+0.012
resses+0.012
nes+0.012
ashes+0.012
Neg logit contributions
 course-0.010
 thanks-0.007
 motorcycle-0.007
 magnet-0.007
 narrative-0.007
Token on
Token ID319
Feature activation-1.57e+00

Pos logit contributions
resses+0.210
heads+0.208
uggets+0.208
angers+0.204
 banks+0.202
Neg logit contributions
 course-0.096
 thanks-0.055
 forgiveness-0.055
 magnet-0.054
 narrative-0.054
Token Tim
Token ID5045
Feature activation-6.68e-01

Pos logit contributions
resses+0.001
heads+0.001
angers+0.001
ashes+0.001
nes+0.001
Neg logit contributions
 course-0.001
 forgiveness-0.000
 thanks-0.000
 thank-0.000
 understands-0.000
Token took
Token ID1718
Feature activation-9.55e-01

Pos logit contributions
 course+0.020
 forgiveness+0.015
 narrative+0.014
 motorcycle+0.014
 understands+0.013
Neg logit contributions
resses-0.030
ashes-0.030
heads-0.030
mary-0.029
 banks-0.029
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
 course+0.031
 forgiveness+0.023
 motorcycle+0.023
 narrative+0.023
 understands+0.022
Neg logit contributions
resses-0.041
ashes-0.040
heads-0.039
 banks-0.039
angers-0.039
Token monster
Token ID9234
Feature activation-1.70e+00

Pos logit contributions
 course+0.010
 narrative+0.008
ously+0.008
 understands+0.007
smart+0.007
Neg logit contributions
resses-0.013
ashes-0.013
 thirsty-0.013
heads-0.013
 banks-0.013
Token-
Token ID12
Feature activation+1.50e+00

Pos logit contributions
 course+0.055
 motorcycle+0.041
 forgiveness+0.041
 magnet+0.039
orn+0.039
Neg logit contributions
resses-0.073
angers-0.071
uggets-0.070
ashes-0.070
heads-0.070
Tokench
Token ID354
Feature activation+7.62e+00

Pos logit contributions
resses+0.058
ashes+0.056
angers+0.056
heads+0.056
 banks+0.055
Neg logit contributions
 course-0.056
 forgiveness-0.043
 motorcycle-0.042
orn-0.042
 understands-0.041
Tokenaser
Token ID6005
Feature activation-1.64e+00

Pos logit contributions
resses+0.355
mary+0.349
 banks+0.345
angers+0.342
heads+0.340
Neg logit contributions
 course-0.226
 forgiveness-0.154
orn-0.150
 motorcycle-0.150
 thanks-0.140
Token to
Token ID284
Feature activation+7.23e-01

Pos logit contributions
 course+0.049
 forgiveness+0.035
 motorcycle+0.034
 narrative+0.034
 understands+0.033
Neg logit contributions
resses-0.075
ashes-0.073
heads-0.072
angers-0.072
mary-0.072
Token bed
Token ID3996
Feature activation+3.39e+00

Pos logit contributions
resses+0.029
ashes+0.029
heads+0.028
angers+0.028
 banks+0.028
Neg logit contributions
 course-0.025
 forgiveness-0.019
 motorcycle-0.019
 narrative-0.019
 understands-0.018
Token with
Token ID351
Feature activation+8.96e-01

Pos logit contributions
resses+0.157
ashes+0.156
heads+0.151
uggets+0.151
 banks+0.151
Neg logit contributions
 course-0.096
 forgiveness-0.070
 motorcycle-0.069
 thanks-0.064
 narrative-0.064
Token him
Token ID683
Feature activation-8.27e-02

Pos logit contributions
resses+0.042
ashes+0.041
 banks+0.041
uggets+0.041
allows+0.041
Neg logit contributions
 course-0.024
 narrative-0.018
 motorcycle-0.017
 forgiveness-0.017
orn-0.016
Token day
Token ID1110
Feature activation-4.33e-03

Pos logit contributions
 course+0.023
orn+0.019
 motorcycle+0.019
 understands+0.019
 hero+0.018
Neg logit contributions
resses-0.019
uring-0.018
 thirsty-0.018
 banks-0.018
mary-0.017
Token that
Token ID326
Feature activation-5.91e-01

Pos logit contributions
 course+0.000
 motorcycle+0.000
orn+0.000
 magnet+0.000
 bicycle+0.000
Neg logit contributions
resses-0.000
uring-0.000
angers-0.000
mary-0.000
nes-0.000
Token hard
Token ID1327
Feature activation-3.63e+00

Pos logit contributions
 course+0.023
 motorcycle+0.019
orn+0.019
ously+0.018
 understands+0.018
Neg logit contributions
resses-0.019
 thirsty-0.018
uring-0.018
 banks-0.018
mary-0.018
Token work
Token ID670
Feature activation+3.38e-02

Pos logit contributions
 course+0.126
 forgiveness+0.096
 motorcycle+0.095
 narrative+0.094
orn+0.093
Neg logit contributions
resses-0.146
ashes-0.142
angers-0.140
 banks-0.140
heads-0.139
Token and
Token ID290
Feature activation-4.11e-01

Pos logit contributions
resses+0.002
ashes+0.002
nes+0.002
mary+0.002
uring+0.002
Neg logit contributions
 course-0.001
 motorcycle-0.001
orn-0.001
father-0.001
 forgiveness-0.001
Token persever
Token ID45712
Feature activation-8.64e+00

Pos logit contributions
 course+0.014
 motorcycle+0.012
orn+0.012
 narrative+0.011
 magnet+0.011
Neg logit contributions
resses-0.016
ashes-0.015
uring-0.015
 banks-0.015
mary-0.015
Tokenance
Token ID590
Feature activation-3.12e+00

Pos logit contributions
 course+0.362
 motorcycle+0.347
 bicycle+0.322
 spear+0.314
 magnet+0.314
Neg logit contributions
resses-0.258
angers-0.252
uring-0.234
nes-0.233
ashes-0.221
Token pay
Token ID1414
Feature activation-8.43e-01

Pos logit contributions
 course+0.073
 motorcycle+0.056
 forgiveness+0.052
orn+0.052
 narrative+0.052
Neg logit contributions
resses-0.153
ashes-0.152
 banks-0.149
mary-0.148
angers-0.146
Token off
Token ID572
Feature activation-1.52e+00

Pos logit contributions
 course+0.030
 motorcycle+0.024
orn+0.024
 narrative+0.023
 magnet+0.023
Neg logit contributions
resses-0.033
ashes-0.031
heads-0.031
mary-0.031
 banks-0.031
Token in
Token ID287
Feature activation-3.74e-01

Pos logit contributions
 course+0.046
 forgiveness+0.034
 motorcycle+0.033
 understands+0.032
 narrative+0.032
Neg logit contributions
resses-0.067
ashes-0.066
heads-0.065
angers-0.065
 banks-0.065
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
 course+0.013
 motorcycle+0.011
orn+0.011
 understands+0.011
 narrative+0.011
Neg logit contributions
resses-0.014
uggets-0.013
heads-0.013
 banks-0.013
mary-0.013
Token do
Token ID466
Feature activation-2.19e-02

Pos logit contributions
resses+0.040
ashes+0.039
heads+0.038
angers+0.038
 banks+0.038
Neg logit contributions
 course-0.036
 forgiveness-0.028
 motorcycle-0.027
 narrative-0.027
 understands-0.027
Token it
Token ID340
Feature activation+5.75e-01

Pos logit contributions
 course+0.001
 forgiveness+0.001
 understands+0.001
 motorcycle+0.001
 narrative+0.001
Neg logit contributions
resses-0.001
ashes-0.001
heads-0.001
angers-0.001
uggets-0.001
Token with
Token ID351
Feature activation+6.83e-01

Pos logit contributions
resses+0.026
ashes+0.025
heads+0.025
angers+0.025
 banks+0.025
Neg logit contributions
 course-0.017
 forgiveness-0.013
 motorcycle-0.012
 narrative-0.012
 understands-0.012
Token practice
Token ID3357
Feature activation-2.97e-01

Pos logit contributions
resses+0.026
ashes+0.025
heads+0.024
 banks+0.024
uggets+0.024
Neg logit contributions
 course-0.025
 forgiveness-0.020
 motorcycle-0.020
 narrative-0.020
 understands-0.019
Token and
Token ID290
Feature activation-3.61e-01

Pos logit contributions
 course+0.010
 forgiveness+0.007
 motorcycle+0.007
 narrative+0.007
 understands+0.007
Neg logit contributions
resses-0.013
ashes-0.012
heads-0.012
angers-0.012
 banks-0.012
Token persever
Token ID45712
Feature activation-8.60e+00

Pos logit contributions
 course+0.013
 forgiveness+0.010
 motorcycle+0.010
orn+0.010
 narrative+0.010
Neg logit contributions
resses-0.014
ashes-0.014
heads-0.013
 banks-0.013
mary-0.013
Tokenance
Token ID590
Feature activation-3.19e+00

Pos logit contributions
 course+0.337
 motorcycle+0.295
 bicycle+0.279
 magnet+0.278
 policeman+0.277
Neg logit contributions
resses-0.308
nes-0.282
angers-0.280
uring-0.276
ashes-0.266
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
 course+0.105
 forgiveness+0.081
 motorcycle+0.080
 narrative+0.078
 understands+0.078
Neg logit contributions
resses-0.133
ashes-0.129
heads-0.127
 banks-0.126
mary-0.126
Token<|endoftext|>
Token ID50256
Feature activation+1.73e+00

Pos logit contributions
 course+0.023
 motorcycle+0.018
 forgiveness+0.018
 narrative+0.017
 understands+0.017
Neg logit contributions
resses-0.029
ashes-0.028
heads-0.028
uggets-0.028
angers-0.028
TokenOnce
Token ID7454
Feature activation+1.07e+00

Pos logit contributions
resses+0.058
ashes+0.056
mary+0.056
angers+0.056
uggets+0.055
Neg logit contributions
 course-0.067
 forgiveness-0.057
 motorcycle-0.056
 narrative-0.054
father-0.053
Token upon
Token ID2402
Feature activation+4.48e+00

Pos logit contributions
angers+0.019
allows+0.019
ashes+0.018
uggets+0.018
resses+0.018
Neg logit contributions
 course-0.059
 forgiveness-0.050
 motorcycle-0.050
 narrative-0.049
 beast-0.048
Token they
Token ID484
Feature activation+2.18e+00

Pos logit contributions
 course+0.008
orn+0.006
 motorcycle+0.006
father+0.006
ously+0.006
Neg logit contributions
resses-0.006
uggets-0.006
uring-0.005
 thirsty-0.005
ashes-0.005
Token also
Token ID635
Feature activation+2.46e+00

Pos logit contributions
resses+0.088
 banks+0.088
ashes+0.088
mary+0.085
nes+0.085
Neg logit contributions
 course-0.068
 motorcycle-0.055
 forgiveness-0.051
orn-0.051
 policy-0.050
Token require
Token ID2421
Feature activation+1.59e+00

Pos logit contributions
ashes+0.098
resses+0.094
 banks+0.092
mary+0.092
 thirsty+0.090
Neg logit contributions
 course-0.081
 motorcycle-0.067
orn-0.064
 bicycle-0.062
father-0.060
Token effort
Token ID3626
Feature activation-2.49e+00

Pos logit contributions
resses+0.065
ashes+0.064
heads+0.063
uggets+0.062
 banks+0.062
Neg logit contributions
 course-0.054
 motorcycle-0.041
 forgiveness-0.040
orn-0.040
 narrative-0.040
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
 course+0.089
orn+0.073
 narrative+0.068
 motorcycle+0.068
 magnet+0.066
Neg logit contributions
resses-0.099
ashes-0.095
mary-0.094
angers-0.092
uggets-0.091
Token persever
Token ID45712
Feature activation-8.49e+00

Pos logit contributions
resses+0.011
ashes+0.011
heads+0.010
 banks+0.010
uring+0.010
Neg logit contributions
 course-0.011
orn-0.009
 motorcycle-0.009
father-0.008
 magnet-0.008
Tokenance
Token ID590
Feature activation-3.05e+00

Pos logit contributions
 course+0.339
 motorcycle+0.302
 bicycle+0.295
 magnet+0.282
 policeman+0.282
Neg logit contributions
resses-0.291
angers-0.276
nes-0.257
ashes-0.257
allows-0.251
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
 course+0.100
 motorcycle+0.079
orn+0.077
 narrative+0.077
 forgiveness+0.077
Neg logit contributions
resses-0.126
ashes-0.125
mary-0.121
 banks-0.120
angers-0.119
Token They
Token ID1119
Feature activation+2.33e+00

Pos logit contributions
resses+0.013
heads+0.013
ashes+0.013
 banks+0.013
angers+0.013
Neg logit contributions
 course-0.011
 forgiveness-0.008
 understands-0.008
 narrative-0.008
 motorcycle-0.008
Token learned
Token ID4499
Feature activation+3.95e-02

Pos logit contributions
 banks+0.087
mary+0.086
nes+0.086
ashes+0.084
resses+0.082
Neg logit contributions
 course-0.079
 motorcycle-0.068
 policy-0.067
orn-0.064
 forgiveness-0.064
Token that
Token ID326
Feature activation-3.45e-01

Pos logit contributions
resses+0.002
mary+0.001
ashes+0.001
uggets+0.001
 banks+0.001
Neg logit contributions
 course-0.001
orn-0.001
father-0.001
 narrative-0.001
 magnet-0.001
Token and
Token ID290
Feature activation-7.77e-01

Pos logit contributions
resses+0.027
ashes+0.027
uring+0.026
angers+0.025
mary+0.025
Neg logit contributions
 course-0.028
orn-0.023
 narrative-0.023
 motorcycle-0.021
father-0.021
Token judge
Token ID5052
Feature activation-2.69e+00

Pos logit contributions
 course+0.028
 forgiveness+0.021
 motorcycle+0.021
orn+0.021
 narrative+0.021
Neg logit contributions
resses-0.031
ashes-0.030
heads-0.029
 banks-0.029
mary-0.029
Token someone
Token ID2130
Feature activation-2.34e+00

Pos logit contributions
 course+0.096
 narrative+0.076
orn+0.075
 motorcycle+0.073
 forgiveness+0.072
Neg logit contributions
resses-0.105
ashes-0.101
 banks-0.100
mary-0.099
heads-0.098
Token's
Token ID338
Feature activation+1.06e+00

Pos logit contributions
 course+0.084
orn+0.071
 narrative+0.071
 motorcycle+0.067
father+0.066
Neg logit contributions
resses-0.085
ashes-0.083
mary-0.081
 banks-0.080
uring-0.080
Token worth
Token ID2861
Feature activation-2.60e+00

Pos logit contributions
resses+0.043
ashes+0.041
 banks+0.041
heads+0.041
angers+0.040
Neg logit contributions
 course-0.037
orn-0.028
 motorcycle-0.028
 narrative-0.027
 forgiveness-0.027
Token based
Token ID1912
Feature activation-8.03e+00

Pos logit contributions
 course+0.096
orn+0.083
 motorcycle+0.082
 bicycle+0.081
 magnet+0.078
Neg logit contributions
resses-0.089
uring-0.089
 banks-0.085
heads-0.085
uggets-0.084
Token on
Token ID319
Feature activation-2.03e+00

Pos logit contributions
 course+0.272
 forgiveness+0.220
 understands+0.219
 motorcycle+0.215
 narrative+0.211
Neg logit contributions
resses-0.298
ashes-0.294
uring-0.293
nes-0.293
heads-0.289
Token their
Token ID511
Feature activation+1.98e+00

Pos logit contributions
 course+0.070
orn+0.054
 narrative+0.054
 motorcycle+0.053
 forgiveness+0.052
Neg logit contributions
resses-0.081
 banks-0.077
ashes-0.077
uggets-0.077
heads-0.076
Token age
Token ID2479
Feature activation-1.32e+00

Pos logit contributions
resses+0.079
heads+0.076
ashes+0.076
uggets+0.076
angers+0.076
Neg logit contributions
 course-0.070
 forgiveness-0.055
 narrative-0.053
 motorcycle-0.053
 understands-0.052
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 course+0.044
 narrative+0.034
orn+0.034
 forgiveness+0.033
 motorcycle+0.033
Neg logit contributions
resses-0.055
ashes-0.053
 banks-0.052
angers-0.052
heads-0.052
Token<|endoftext|>
Token ID50256
Feature activation+1.66e+00

Pos logit contributions
 course+0.046
 narrative+0.039
 motorcycle+0.038
 forgiveness+0.037
ously+0.036
Neg logit contributions
resses-0.036
uggets-0.035
allows-0.033
angers-0.032
mary-0.032
Token of
Token ID286
Feature activation+1.77e+00

Pos logit contributions
 course+0.044
 motorcycle+0.036
 forgiveness+0.036
 narrative+0.035
 understands+0.034
Neg logit contributions
resses-0.051
ashes-0.049
mary-0.049
angers-0.049
 banks-0.048
Token her
Token ID607
Feature activation+2.34e-01

Pos logit contributions
resses+0.069
ashes+0.068
heads+0.067
angers+0.067
 banks+0.066
Neg logit contributions
 course-0.064
 forgiveness-0.049
 motorcycle-0.048
 understands-0.047
orn-0.047
Token toys
Token ID14958
Feature activation-8.63e-01

Pos logit contributions
resses+0.008
ashes+0.007
angers+0.007
nes+0.007
heads+0.007
Neg logit contributions
 course-0.010
 understands-0.008
 motorcycle-0.008
 forgiveness-0.008
 magnet-0.008
Token into
Token ID656
Feature activation+2.86e+00

Pos logit contributions
 course+0.027
 forgiveness+0.020
 narrative+0.020
 motorcycle+0.020
 understands+0.020
Neg logit contributions
resses-0.038
ashes-0.037
heads-0.036
angers-0.036
 banks-0.036
Token groups
Token ID2628
Feature activation+3.46e+00

Pos logit contributions
ashes+0.114
resses+0.111
heads+0.110
angers+0.108
mary+0.108
Neg logit contributions
 course-0.104
 forgiveness-0.080
 motorcycle-0.076
 magnet-0.075
 understands-0.075
Token based
Token ID1912
Feature activation-8.02e+00

Pos logit contributions
resses+0.166
heads+0.163
ashes+0.163
uggets+0.162
angers+0.162
Neg logit contributions
 course-0.093
 forgiveness-0.062
 motorcycle-0.061
 understands-0.058
 magnet-0.058
Token on
Token ID319
Feature activation-2.18e+00

Pos logit contributions
 course+0.247
 forgiveness+0.208
 motorcycle+0.195
 understands+0.195
 narrative+0.192
Neg logit contributions
resses-0.337
nes-0.330
ashes-0.326
angers-0.326
mary-0.323
Token what
Token ID644
Feature activation-3.16e+00

Pos logit contributions
 course+0.072
 forgiveness+0.054
 motorcycle+0.051
 understands+0.050
 narrative+0.050
Neg logit contributions
resses-0.092
ashes-0.092
heads-0.091
angers-0.089
uggets-0.089
Token she
Token ID673
Feature activation+5.81e-01

Pos logit contributions
 course+0.118
 forgiveness+0.092
 motorcycle+0.090
 narrative+0.089
orn+0.089
Neg logit contributions
resses-0.120
ashes-0.116
heads-0.114
angers-0.114
mary-0.114
Token wanted
Token ID2227
Feature activation-7.93e-01

Pos logit contributions
resses+0.024
ashes+0.024
mary+0.023
 banks+0.023
nes+0.023
Neg logit contributions
 course-0.019
 forgiveness-0.015
 motorcycle-0.014
 narrative-0.014
orn-0.014
Token to
Token ID284
Feature activation+2.31e-01

Pos logit contributions
 course+0.026
 narrative+0.020
 motorcycle+0.020
orn+0.020
 forgiveness+0.020
Neg logit contributions
resses-0.033
ashes-0.031
angers-0.031
mary-0.031
uggets-0.031
Token day
Token ID1110
Feature activation-6.52e-03

Pos logit contributions
 course+0.025
 forgiveness+0.019
 motorcycle+0.019
 narrative+0.018
orn+0.018
Neg logit contributions
resses-0.031
mary-0.031
ashes-0.031
 banks-0.030
allows-0.030
Token -
Token ID532
Feature activation+1.36e+00

Pos logit contributions
 course+0.000
 forgiveness+0.000
 motorcycle+0.000
 narrative+0.000
 understands+0.000
Neg logit contributions
resses-0.000
ashes-0.000
heads-0.000
angers-0.000
 banks-0.000
Token one
Token ID530
Feature activation-3.57e-01

Pos logit contributions
resses+0.052
ashes+0.050
mary+0.050
 banks+0.050
uggets+0.049
Neg logit contributions
 course-0.048
 narrative-0.038
 forgiveness-0.038
orn-0.038
 motorcycle-0.037
Token that
Token ID326
Feature activation-7.86e-01

Pos logit contributions
 course+0.012
 narrative+0.011
 forgiveness+0.010
 motorcycle+0.010
orn+0.010
Neg logit contributions
ashes-0.013
mary-0.013
resses-0.013
allows-0.012
heads-0.012
Token was
Token ID373
Feature activation-1.59e+00

Pos logit contributions
 course+0.024
 forgiveness+0.018
 motorcycle+0.018
 narrative+0.017
orn+0.017
Neg logit contributions
resses-0.034
ashes-0.033
mary-0.033
 banks-0.033
heads-0.033
Token based
Token ID1912
Feature activation-7.98e+00

Pos logit contributions
 course+0.058
 forgiveness+0.045
 motorcycle+0.043
 understands+0.043
 narrative+0.043
Neg logit contributions
resses-0.062
ashes-0.061
heads-0.060
angers-0.060
 banks-0.059
Token on
Token ID319
Feature activation-2.05e+00

Pos logit contributions
 course+0.254
 forgiveness+0.220
 narrative+0.207
 understands+0.203
 motorcycle+0.199
Neg logit contributions
resses-0.314
nes-0.312
ashes-0.310
heads-0.307
mary-0.302
Token trust
Token ID3774
Feature activation-3.18e+00

Pos logit contributions
 course+0.071
 forgiveness+0.054
 motorcycle+0.053
 understands+0.052
 narrative+0.052
Neg logit contributions
resses-0.084
ashes-0.081
heads-0.081
angers-0.080
uggets-0.080
Token.
Token ID13
Feature activation-9.20e-01

Pos logit contributions
 course+0.100
 forgiveness+0.074
 motorcycle+0.073
 narrative+0.072
 understands+0.071
Neg logit contributions
resses-0.138
ashes-0.134
heads-0.133
angers-0.132
mary-0.132
Token They
Token ID1119
Feature activation+1.44e+00

Pos logit contributions
 course+0.033
 forgiveness+0.025
 motorcycle+0.025
 understands+0.025
 narrative+0.025
Neg logit contributions
resses-0.036
ashes-0.035
heads-0.035
angers-0.035
 banks-0.034
Token went
Token ID1816
Feature activation-2.09e+00

Pos logit contributions
mary+0.055
 banks+0.055
nes+0.054
resses+0.054
allows+0.052
Neg logit contributions
 course-0.046
 forgiveness-0.040
 policy-0.039
 motorcycle-0.039
 narrative-0.038
Token:
Token ID25
Feature activation+8.02e-01

Pos logit contributions
 course+0.021
orn+0.017
 motorcycle+0.017
 magnet+0.016
 narrative+0.016
Neg logit contributions
resses-0.025
uring-0.023
ashes-0.023
angers-0.023
mary-0.023
Token don
Token ID836
Feature activation-1.26e+00

Pos logit contributions
resses+0.033
ashes+0.032
heads+0.032
angers+0.032
 banks+0.032
Neg logit contributions
 course-0.027
 forgiveness-0.021
 motorcycle-0.020
 narrative-0.020
 understands-0.020
Token't
Token ID470
Feature activation+6.31e-01

Pos logit contributions
 course+0.054
 forgiveness+0.043
 motorcycle+0.043
 narrative+0.042
 understands+0.042
Neg logit contributions
resses-0.041
ashes-0.040
heads-0.040
uggets-0.040
angers-0.039
Token judge
Token ID5052
Feature activation-2.66e+00

Pos logit contributions
resses+0.027
angers+0.026
ashes+0.026
heads+0.026
uggets+0.025
Neg logit contributions
 course-0.021
 forgiveness-0.016
 motorcycle-0.015
 narrative-0.015
 magnet-0.015
Token someone
Token ID2130
Feature activation-2.24e+00

Pos logit contributions
 course+0.093
 narrative+0.072
 motorcycle+0.071
 forgiveness+0.071
orn+0.071
Neg logit contributions
resses-0.105
ashes-0.102
mary-0.100
heads-0.100
 banks-0.100
Token based
Token ID1912
Feature activation-7.98e+00

Pos logit contributions
 course+0.075
 narrative+0.064
orn+0.062
 motorcycle+0.061
 magnet+0.059
Neg logit contributions
resses-0.087
ashes-0.085
mary-0.084
uring-0.081
 banks-0.081
Token on
Token ID319
Feature activation-1.91e+00

Pos logit contributions
 course+0.261
 motorcycle+0.214
 forgiveness+0.212
 magnet+0.205
 understands+0.204
Neg logit contributions
ashes-0.299
resses-0.299
heads-0.296
nes-0.295
uggets-0.293
Token how
Token ID703
Feature activation-3.53e+00

Pos logit contributions
 course+0.068
 forgiveness+0.052
 motorcycle+0.051
 narrative+0.051
 understands+0.050
Neg logit contributions
resses-0.075
ashes-0.073
heads-0.072
 banks-0.072
angers-0.072
Token they
Token ID484
Feature activation+1.77e+00

Pos logit contributions
 course+0.123
 forgiveness+0.096
 understands+0.093
 motorcycle+0.092
 narrative+0.091
Neg logit contributions
resses-0.141
heads-0.138
ashes-0.138
angers-0.137
uggets-0.136
Token admire
Token ID21099
Feature activation-2.18e+00

Pos logit contributions
resses+0.071
ashes+0.068
heads+0.068
angers+0.068
uggets+0.067
Neg logit contributions
 course-0.063
 forgiveness-0.048
 motorcycle-0.047
 narrative-0.047
 understands-0.047
Token the
Token ID262
Feature activation-8.19e-01

Pos logit contributions
 course+0.089
orn+0.072
 motorcycle+0.072
 narrative+0.071
 forgiveness+0.071
Neg logit contributions
resses-0.072
ashes-0.069
 banks-0.068
mary-0.068
uggets-0.068
Token not
Token ID407
Feature activation-7.09e-01

Pos logit contributions
resses+0.058
ashes+0.058
angers+0.057
heads+0.055
 banks+0.054
Neg logit contributions
 course-0.045
 forgiveness-0.037
 thanks-0.035
 narrative-0.034
 forgive-0.033
Token nice
Token ID3621
Feature activation-6.50e-01

Pos logit contributions
 course+0.022
 forgiveness+0.018
 thanks+0.018
 motorcycle+0.017
 magnet+0.016
Neg logit contributions
angers-0.031
resses-0.031
ashes-0.030
 banks-0.030
heads-0.029
Token to
Token ID284
Feature activation+3.51e-01

Pos logit contributions
 course+0.024
 motorcycle+0.018
 narrative+0.018
 forgiveness+0.018
orn+0.018
Neg logit contributions
resses-0.025
ashes-0.025
mary-0.024
heads-0.024
uggets-0.024
Token judge
Token ID5052
Feature activation-2.36e+00

Pos logit contributions
resses+0.015
angers+0.015
ashes+0.015
uggets+0.015
heads+0.014
Neg logit contributions
 course-0.012
 forgiveness-0.009
 forgive-0.009
 thank-0.008
 return-0.008
Token people
Token ID661
Feature activation-7.94e-01

Pos logit contributions
 course+0.081
 forgiveness+0.062
 motorcycle+0.059
 understands+0.059
 narrative+0.057
Neg logit contributions
resses-0.098
ashes-0.095
heads-0.094
angers-0.094
 banks-0.093
Token based
Token ID1912
Feature activation-7.93e+00

Pos logit contributions
 course+0.025
 forgiveness+0.019
 motorcycle+0.018
 narrative+0.018
orn+0.018
Neg logit contributions
resses-0.034
ashes-0.033
heads-0.033
mary-0.033
 banks-0.033
Token on
Token ID319
Feature activation-2.11e+00

Pos logit contributions
 course+0.227
 forgiveness+0.162
 motorcycle+0.158
 narrative+0.155
 understands+0.154
Neg logit contributions
resses-0.370
ashes-0.361
heads-0.359
angers-0.356
 banks-0.355
Token their
Token ID511
Feature activation+2.12e+00

Pos logit contributions
 course+0.071
 forgiveness+0.054
 motorcycle+0.051
 understands+0.050
 narrative+0.050
Neg logit contributions
resses-0.088
ashes-0.087
heads-0.087
angers-0.086
uggets-0.085
Token smell
Token ID8508
Feature activation+4.57e+00

Pos logit contributions
resses+0.079
angers+0.077
heads+0.076
ashes+0.076
uggets+0.075
Neg logit contributions
 course-0.081
 forgiveness-0.064
 motorcycle-0.063
 narrative-0.062
 understands-0.061
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
resses+0.185
ashes+0.183
mary+0.181
 banks+0.179
heads+0.178
Neg logit contributions
 course-0.149
 motorcycle-0.115
 forgiveness-0.114
 narrative-0.114
orn-0.113
Token The
Token ID383
Feature activation-5.37e-01

Pos logit contributions
 course+0.024
 forgiveness+0.017
 understands+0.017
 thanks+0.017
 forgive+0.016
Neg logit contributions
ashes-0.038
heads-0.037
 banks-0.036
resses-0.036
allows-0.036
Token when
Token ID618
Feature activation+1.15e+00

Pos logit contributions
 course+0.041
 forgiveness+0.029
 motorcycle+0.028
 understands+0.027
 narrative+0.026
Neg logit contributions
resses-0.076
ashes-0.074
heads-0.074
angers-0.073
uggets-0.073
Token they
Token ID484
Feature activation+1.50e+00

Pos logit contributions
resses+0.043
ashes+0.042
heads+0.042
angers+0.041
 banks+0.041
Neg logit contributions
 course-0.043
 forgiveness-0.034
 motorcycle-0.033
 narrative-0.033
 understands-0.033
Token saw
Token ID2497
Feature activation+6.69e-01

Pos logit contributions
 banks+0.047
 thirsty+0.046
mary+0.044
nes+0.042
allows+0.041
Neg logit contributions
 course-0.055
 policy-0.054
 motorcycle-0.053
 forgiveness-0.053
 narrative-0.052
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
resses+0.026
 banks+0.025
mary+0.025
uring+0.024
allows+0.024
Neg logit contributions
 course-0.023
 narrative-0.019
orn-0.019
 motorcycle-0.019
 forgiveness-0.018
Token little
Token ID1310
Feature activation-1.24e-01

Pos logit contributions
 course+0.037
smart+0.029
 understands+0.029
orn+0.028
 narrative+0.028
Neg logit contributions
resses-0.046
 banks-0.046
heads-0.043
allows-0.042
ashes-0.042
Token sho
Token ID7354
Feature activation-7.92e+00

Pos logit contributions
 course+0.004
 narrative+0.003
 forgiveness+0.003
 understands+0.003
orn+0.003
Neg logit contributions
resses-0.005
 banks-0.005
ashes-0.005
heads-0.005
angers-0.005
Tokenel
Token ID417
Feature activation+1.10e+00

Pos logit contributions
 course+0.274
 forgiveness+0.239
 motorcycle+0.238
 narrative+0.237
 thank+0.233
Neg logit contributions
mary-0.294
resses-0.288
angers-0.282
 banks-0.281
nes-0.277
Tokenace
Token ID558
Feature activation+5.17e-02

Pos logit contributions
resses+0.041
 banks+0.039
heads+0.039
ashes+0.039
allows+0.038
Neg logit contributions
 course-0.042
 narrative-0.034
 forgiveness-0.034
 understands-0.032
 motorcycle-0.032
Token and
Token ID290
Feature activation-7.14e-01

Pos logit contributions
resses+0.002
ashes+0.002
angers+0.002
heads+0.002
uggets+0.002
Neg logit contributions
 course-0.002
 forgiveness-0.001
 motorcycle-0.001
 understands-0.001
orn-0.001
Token knew
Token ID2993
Feature activation-3.51e+00

Pos logit contributions
 course+0.026
 narrative+0.020
orn+0.019
 forgiveness+0.019
 motorcycle+0.019
Neg logit contributions
resses-0.028
 banks-0.027
ashes-0.027
heads-0.027
mary-0.027
Token that
Token ID326
Feature activation-8.47e-01

Pos logit contributions
 course+0.128
orn+0.109
 narrative+0.107
 motorcycle+0.105
 magnet+0.101
Neg logit contributions
resses-0.125
uring-0.123
mary-0.121
 banks-0.118
nes-0.118
Token,
Token ID11
Feature activation-7.24e-01

Pos logit contributions
 course+0.061
 motorcycle+0.042
 forgiveness+0.042
 narrative+0.041
orn+0.040
Neg logit contributions
resses-0.111
ashes-0.108
mary-0.108
heads-0.108
angers-0.108
Token she
Token ID673
Feature activation+6.28e-01

Pos logit contributions
 course+0.023
 forgiveness+0.017
 thanks+0.016
 thank+0.015
 again+0.015
Neg logit contributions
heads-0.034
ashes-0.034
resses-0.034
allows-0.033
angers-0.033
Token quickly
Token ID2952
Feature activation-1.70e-01

Pos logit contributions
resses+0.025
ashes+0.024
heads+0.024
mary+0.024
angers+0.024
Neg logit contributions
 course-0.022
 forgiveness-0.017
 motorcycle-0.017
 narrative-0.017
 understands-0.016
Token tied
Token ID8165
Feature activation-6.99e-01

Pos logit contributions
 course+0.006
 forgiveness+0.005
 narrative+0.005
 motorcycle+0.005
father+0.005
Neg logit contributions
resses-0.006
ashes-0.006
mary-0.006
 banks-0.006
heads-0.006
Token her
Token ID607
Feature activation+3.30e-01

Pos logit contributions
 course+0.024
 forgiveness+0.018
 motorcycle+0.017
 understands+0.017
orn+0.017
Neg logit contributions
resses-0.029
ashes-0.028
heads-0.028
angers-0.028
 banks-0.028
Token sho
Token ID7354
Feature activation-7.85e+00

Pos logit contributions
resses+0.012
 banks+0.011
ashes+0.011
mary+0.011
heads+0.011
Neg logit contributions
 course-0.013
 forgiveness-0.011
 narrative-0.011
father-0.011
 understands-0.010
Tokenel
Token ID417
Feature activation+1.19e+00

Pos logit contributions
 course+0.284
 narrative+0.256
 thank+0.255
 motorcycle+0.246
 forgiveness+0.245
Neg logit contributions
resses-0.285
angers-0.271
uggets-0.269
mary-0.259
ashes-0.258
Tokenaces
Token ID2114
Feature activation-2.38e+00

Pos logit contributions
resses+0.033
ashes+0.030
heads+0.030
 banks+0.029
angers+0.029
Neg logit contributions
 course-0.057
 forgiveness-0.050
 narrative-0.048
 understands-0.047
 motorcycle-0.047
Token and
Token ID290
Feature activation-5.44e-01

Pos logit contributions
 course+0.081
 forgiveness+0.067
 narrative+0.065
 motorcycle+0.062
 understands+0.062
Neg logit contributions
resses-0.099
ashes-0.093
angers-0.092
nes-0.092
mary-0.092
Token ran
Token ID4966
Feature activation-8.28e-01

Pos logit contributions
 course+0.020
 forgiveness+0.016
 narrative+0.016
 motorcycle+0.016
orn+0.016
Neg logit contributions
resses-0.020
heads-0.019
mary-0.019
 banks-0.019
ashes-0.019
Token away
Token ID1497
Feature activation-2.23e+00

Pos logit contributions
 course+0.028
 motorcycle+0.020
 thanks+0.020
 magnet+0.019
 again+0.019
Neg logit contributions
uggets-0.034
allows-0.034
resses-0.033
angers-0.033
mary-0.033
Token she
Token ID673
Feature activation+8.21e-01

Pos logit contributions
 course+0.022
 motorcycle+0.018
 understands+0.017
orn+0.017
 narrative+0.017
Neg logit contributions
resses-0.022
mary-0.021
uggets-0.021
ashes-0.021
 banks-0.021
Token shouldn
Token ID6584
Feature activation-4.53e+00

Pos logit contributions
resses+0.038
ashes+0.037
mary+0.037
 banks+0.036
heads+0.036
Neg logit contributions
 course-0.024
 motorcycle-0.017
 forgiveness-0.017
 narrative-0.016
orn-0.016
Token't
Token ID470
Feature activation+7.93e-01

Pos logit contributions
 course+0.196
 forgiveness+0.159
 motorcycle+0.157
 narrative+0.155
 understands+0.154
Neg logit contributions
resses-0.145
ashes-0.139
heads-0.138
angers-0.137
 banks-0.136
Token judge
Token ID5052
Feature activation-2.44e+00

Pos logit contributions
resses+0.031
ashes+0.030
heads+0.030
 banks+0.030
angers+0.030
Neg logit contributions
 course-0.029
 forgiveness-0.022
 motorcycle-0.022
 narrative-0.021
 understands-0.021
Token things
Token ID1243
Feature activation-1.50e+00

Pos logit contributions
 course+0.088
 narrative+0.071
 motorcycle+0.069
 forgiveness+0.068
orn+0.068
Neg logit contributions
resses-0.093
ashes-0.090
mary-0.089
uggets-0.088
 banks-0.087
Token based
Token ID1912
Feature activation-7.84e+00

Pos logit contributions
 course+0.051
 motorcycle+0.045
father+0.044
 narrative+0.044
orn+0.043
Neg logit contributions
resses-0.059
ashes-0.055
mary-0.055
uring-0.054
angers-0.052
Token on
Token ID319
Feature activation-1.99e+00

Pos logit contributions
 course+0.259
 forgiveness+0.221
 motorcycle+0.215
 understands+0.211
 narrative+0.205
Neg logit contributions
resses-0.297
ashes-0.296
heads-0.291
mary-0.291
uggets-0.290
Token their
Token ID511
Feature activation+2.09e+00

Pos logit contributions
 course+0.070
 forgiveness+0.054
 motorcycle+0.053
 narrative+0.053
 understands+0.052
Neg logit contributions
resses-0.079
ashes-0.076
heads-0.076
 banks-0.075
angers-0.075
Token appearance
Token ID5585
Feature activation-3.11e+00

Pos logit contributions
 banks+0.065
resses+0.064
ashes+0.063
uggets+0.060
heads+0.060
Neg logit contributions
 course-0.093
orn-0.078
 forgiveness-0.074
 narrative-0.073
ously-0.072
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
 course+0.111
 narrative+0.089
 motorcycle+0.087
 understands+0.085
 forgiveness+0.085
Neg logit contributions
resses-0.121
 banks-0.114
angers-0.112
mary-0.112
uggets-0.112
Token Sometimes
Token ID8975
Feature activation-2.11e-01

Pos logit contributions
 course+0.042
 motorcycle+0.036
 forgiveness+0.035
 narrative+0.035
 magnet+0.034
Neg logit contributions
resses-0.025
angers-0.024
mary-0.024
ashes-0.024
uggets-0.023
Token,
Token ID11
Feature activation-9.67e-01

Pos logit contributions
 course+0.009
 forgiveness+0.007
 motorcycle+0.007
 understands+0.007
 narrative+0.006
Neg logit contributions
resses-0.013
ashes-0.013
heads-0.013
angers-0.012
uggets-0.012
Token they
Token ID484
Feature activation+1.44e+00

Pos logit contributions
 course+0.033
 forgiveness+0.025
 motorcycle+0.024
 thanks+0.023
 magnet+0.023
Neg logit contributions
resses-0.041
ashes-0.041
heads-0.040
angers-0.039
uggets-0.039
Token saw
Token ID2497
Feature activation+6.91e-01

Pos logit contributions
 banks+0.052
mary+0.052
resses+0.051
heads+0.050
nes+0.049
Neg logit contributions
 course-0.054
 forgiveness-0.044
 policy-0.043
orn-0.042
 motorcycle-0.042
Token a
Token ID257
Feature activation-5.71e-01

Pos logit contributions
resses+0.024
 banks+0.023
mary+0.023
ashes+0.023
angers+0.023
Neg logit contributions
 course-0.027
 narrative-0.022
orn-0.021
 motorcycle-0.021
 forgiveness-0.021
Token tiny
Token ID7009
Feature activation-2.33e-01

Pos logit contributions
 course+0.022
 narrative+0.017
 forgiveness+0.017
orn+0.017
father+0.017
Neg logit contributions
resses-0.021
heads-0.020
mary-0.020
 banks-0.020
ashes-0.020
Token sho
Token ID7354
Feature activation-7.84e+00

Pos logit contributions
 course+0.008
 narrative+0.006
 forgiveness+0.006
 understands+0.006
orn+0.006
Neg logit contributions
resses-0.009
 banks-0.009
ashes-0.009
heads-0.009
mary-0.009
Tokenel
Token ID417
Feature activation+1.10e+00

Pos logit contributions
 course+0.281
 forgiveness+0.249
 motorcycle+0.237
 understands+0.235
 narrative+0.232
Neg logit contributions
resses-0.285
mary-0.280
 banks-0.279
ashes-0.272
angers-0.271
Tokenace
Token ID558
Feature activation+9.17e-02

Pos logit contributions
resses+0.042
 banks+0.041
heads+0.041
ashes+0.040
allows+0.039
Neg logit contributions
 course-0.040
 forgiveness-0.033
 narrative-0.032
 understands-0.031
 motorcycle-0.031
Token wrapped
Token ID12908
Feature activation+8.02e-01

Pos logit contributions
resses+0.004
ashes+0.004
angers+0.004
uggets+0.004
heads+0.004
Neg logit contributions
 course-0.003
 motorcycle-0.002
 forgiveness-0.002
 magnet-0.002
orn-0.002
Token around
Token ID1088
Feature activation+1.50e+00

Pos logit contributions
resses+0.028
mary+0.027
 banks+0.027
ashes+0.027
uggets+0.027
Neg logit contributions
 course-0.032
 forgiveness-0.026
 narrative-0.025
 understands-0.025
 motorcycle-0.025
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
resses+0.059
mary+0.057
 banks+0.057
uggets+0.057
ashes+0.056
Neg logit contributions
 course-0.051
orn-0.040
 narrative-0.040
 understands-0.040
 motorcycle-0.040
Token game
Token ID983
Feature activation-1.97e+00

Pos logit contributions
 course+0.020
 understands+0.016
 narrative+0.016
 forgiveness+0.016
father+0.016
Neg logit contributions
resses-0.021
 banks-0.020
ashes-0.020
heads-0.020
uggets-0.019
Token to
Token ID284
Feature activation+1.89e-01

Pos logit contributions
 course+0.065
 forgiveness+0.049
 narrative+0.049
 understands+0.049
 motorcycle+0.048
Neg logit contributions
resses-0.083
ashes-0.079
angers-0.079
heads-0.078
 banks-0.078
Token play
Token ID711
Feature activation-1.99e-01

Pos logit contributions
resses+0.007
ashes+0.007
 banks+0.007
heads+0.007
mary+0.007
Neg logit contributions
 course-0.007
 forgiveness-0.006
 narrative-0.006
 understands-0.006
 motorcycle-0.005
Token while
Token ID981
Feature activation+2.76e+00

Pos logit contributions
 course+0.007
 forgiveness+0.005
 motorcycle+0.005
 magnet+0.005
 narrative+0.005
Neg logit contributions
resses-0.008
ashes-0.008
heads-0.008
mary-0.008
angers-0.008
Token they
Token ID484
Feature activation+1.76e+00

Pos logit contributions
resses+0.123
ashes+0.121
heads+0.120
mary+0.118
 banks+0.118
Neg logit contributions
 course-0.085
 forgiveness-0.063
 motorcycle-0.061
 narrative-0.059
 magnet-0.059
Token sho
Token ID7354
Feature activation-7.83e+00

Pos logit contributions
resses+0.071
ashes+0.070
 banks+0.069
mary+0.068
heads+0.068
Neg logit contributions
 course-0.060
 forgiveness-0.047
 motorcycle-0.045
 narrative-0.045
 understands-0.045
Tokenpped
Token ID1496
Feature activation-1.36e+00

Pos logit contributions
 course+0.266
 narrative+0.216
 forgiveness+0.215
 motorcycle+0.212
 understands+0.212
Neg logit contributions
resses-0.311
angers-0.301
ashes-0.301
uggets-0.301
 banks-0.298
Token.
Token ID13
Feature activation-6.48e-01

Pos logit contributions
 course+0.045
 forgiveness+0.038
 understands+0.037
 narrative+0.036
father+0.035
Neg logit contributions
resses-0.055
 banks-0.053
ashes-0.053
mary-0.052
heads-0.051
Token He
Token ID679
Feature activation-2.85e+00

Pos logit contributions
 course+0.021
 understands+0.015
 forgiveness+0.015
 motorcycle+0.015
 magnet+0.015
Neg logit contributions
heads-0.028
resses-0.027
mary-0.027
angers-0.027
ashes-0.027
Token liked
Token ID8288
Feature activation+4.72e-01

Pos logit contributions
 course+0.087
 forgiveness+0.066
 narrative+0.063
 motorcycle+0.062
orn+0.061
Neg logit contributions
resses-0.126
ashes-0.122
 banks-0.122
mary-0.122
heads-0.121
Token to
Token ID284
Feature activation+3.73e-01

Pos logit contributions
resses+0.020
ashes+0.020
heads+0.019
angers+0.019
 banks+0.019
Neg logit contributions
 course-0.015
 forgiveness-0.012
 motorcycle-0.011
 understands-0.011
 narrative-0.011
Token.
Token ID13
Feature activation-6.12e-01

Pos logit contributions
 course+0.089
 narrative+0.074
 motorcycle+0.072
 forgiveness+0.071
orn+0.071
Neg logit contributions
resses-0.078
heads-0.074
ashes-0.074
uggets-0.073
nes-0.072
Token Let
Token ID3914
Feature activation-3.59e+00

Pos logit contributions
 course+0.019
 forgiveness+0.014
 motorcycle+0.014
 understands+0.014
 magnet+0.014
Neg logit contributions
resses-0.027
ashes-0.026
heads-0.026
angers-0.026
 banks-0.026
Token's
Token ID338
Feature activation+1.15e+00

Pos logit contributions
 course+0.133
 forgiveness+0.108
 motorcycle+0.107
 narrative+0.106
 understands+0.103
Neg logit contributions
resses-0.129
 banks-0.127
ashes-0.127
mary-0.126
heads-0.125
Token tie
Token ID9839
Feature activation-1.29e+00

Pos logit contributions
resses+0.047
ashes+0.045
angers+0.045
heads+0.045
uggets+0.044
Neg logit contributions
 course-0.040
 forgiveness-0.031
 motorcycle-0.030
 magnet-0.029
 narrative-0.029
Token your
Token ID534
Feature activation+1.50e+00

Pos logit contributions
 course+0.045
 forgiveness+0.032
 motorcycle+0.032
 magnet+0.031
 thanks+0.030
Neg logit contributions
heads-0.054
angers-0.053
resses-0.053
ashes-0.053
mary-0.052
Token sho
Token ID7354
Feature activation-7.83e+00

Pos logit contributions
resses+0.052
 banks+0.049
heads+0.049
ashes+0.049
mary+0.047
Neg logit contributions
 course-0.060
 forgiveness-0.050
 narrative-0.049
 understands-0.047
father-0.047
Tokenel
Token ID417
Feature activation+1.05e+00

Pos logit contributions
 course+0.258
 narrative+0.240
 thank+0.236
 motorcycle+0.229
 policeman+0.228
Neg logit contributions
uggets-0.290
resses-0.289
angers-0.273
heads-0.269
mary-0.267
Tokenaces
Token ID2114
Feature activation-2.43e+00

Pos logit contributions
resses+0.032
ashes+0.031
heads+0.031
angers+0.030
mary+0.030
Neg logit contributions
 course-0.047
 forgiveness-0.038
 motorcycle-0.038
orn-0.038
 understands-0.037
Token together
Token ID1978
Feature activation+6.84e-01

Pos logit contributions
 course+0.074
 motorcycle+0.053
 forgiveness+0.052
orn+0.051
 magnet+0.051
Neg logit contributions
resses-0.107
heads-0.105
ashes-0.105
mary-0.104
angers-0.104
Token so
Token ID523
Feature activation+2.96e+00

Pos logit contributions
resses+0.025
ashes+0.025
heads+0.024
angers+0.024
 banks+0.024
Neg logit contributions
 course-0.026
 forgiveness-0.020
 motorcycle-0.020
 understands-0.019
orn-0.019
Token they
Token ID484
Feature activation+1.76e+00

Pos logit contributions
resses+0.101
nes+0.089
ashes+0.089
uring+0.088
mary+0.087
Neg logit contributions
 course-0.125
father-0.106
 understands-0.103
 narrative-0.103
orn-0.101
Token owners
Token ID4393
Feature activation-2.53e+00

Pos logit contributions
 course+0.044
 understands+0.034
orn+0.034
 forgiveness+0.034
ously+0.033
Neg logit contributions
 banks-0.051
resses-0.050
heads-0.050
mary-0.049
uring-0.048
Token quickly
Token ID2952
Feature activation-1.95e-01

Pos logit contributions
 course+0.078
 motorcycle+0.066
 narrative+0.065
 policy+0.064
orn+0.064
Neg logit contributions
 banks-0.099
nes-0.098
mary-0.097
heads-0.097
resses-0.096
Token unt
Token ID1418
Feature activation-5.08e+00

Pos logit contributions
 course+0.007
 narrative+0.007
 motorcycle+0.006
 forgiveness+0.006
orn+0.006
Neg logit contributions
mary-0.007
 banks-0.007
resses-0.006
uring-0.006
ashes-0.006
Tokenied
Token ID798
Feature activation+5.72e-01

Pos logit contributions
 course+0.187
 forgiveness+0.149
 motorcycle+0.149
 magnet+0.143
 narrative+0.142
Neg logit contributions
resses-0.190
ashes-0.189
heads-0.184
uggets-0.183
angers-0.183
Token the
Token ID262
Feature activation-9.68e-01

Pos logit contributions
resses+0.024
ashes+0.024
 banks+0.023
mary+0.023
heads+0.023
Neg logit contributions
 course-0.018
 forgiveness-0.014
 narrative-0.014
 motorcycle-0.014
orn-0.013
Token sho
Token ID7354
Feature activation-7.77e+00

Pos logit contributions
 course+0.034
 narrative+0.027
smart+0.027
 forgiveness+0.026
ously+0.026
Neg logit contributions
resses-0.037
 banks-0.036
heads-0.034
nes-0.033
uring-0.032
Tokenel
Token ID417
Feature activation+1.19e+00

Pos logit contributions
 course+0.272
 narrative+0.253
 motorcycle+0.249
 thank+0.248
 forgiveness+0.246
Neg logit contributions
mary-0.264
resses-0.261
 banks-0.256
angers-0.256
nes-0.250
Tokenace
Token ID558
Feature activation+8.38e-02

Pos logit contributions
resses+0.042
 banks+0.040
heads+0.039
ashes+0.038
allows+0.038
Neg logit contributions
 course-0.048
 narrative-0.040
 forgiveness-0.039
 understands-0.037
 motorcycle-0.037
Token and
Token ID290
Feature activation-6.20e-01

Pos logit contributions
resses+0.004
ashes+0.004
heads+0.004
angers+0.004
uggets+0.004
Neg logit contributions
 course-0.003
 forgiveness-0.002
 motorcycle-0.002
 narrative-0.002
 understands-0.002
Token the
Token ID262
Feature activation-9.15e-01

Pos logit contributions
 course+0.023
 narrative+0.019
orn+0.018
 motorcycle+0.018
 understands+0.017
Neg logit contributions
resses-0.023
mary-0.022
 banks-0.022
heads-0.022
ashes-0.021
Token depend
Token ID4745
Feature activation-4.31e+00

Pos logit contributions
 course+0.031
orn+0.025
 narrative+0.024
 understands+0.024
 forgiveness+0.024
Neg logit contributions
resses-0.035
 banks-0.035
heads-0.033
nes-0.032
allows-0.032
Token!
Token ID0
Feature activation-8.51e-01

Pos logit contributions
 course+0.039
 forgiveness+0.029
 motorcycle+0.028
 understands+0.027
 narrative+0.027
Neg logit contributions
resses-0.052
ashes-0.051
heads-0.050
angers-0.050
 banks-0.050
Token Can
Token ID1680
Feature activation-4.55e-01

Pos logit contributions
 course+0.033
 forgiveness+0.027
 motorcycle+0.027
 narrative+0.026
ously+0.026
Neg logit contributions
resses-0.030
ashes-0.029
angers-0.028
uggets-0.028
mary-0.028
Token you
Token ID345
Feature activation+2.43e-01

Pos logit contributions
 course+0.018
 understands+0.014
orn+0.014
 narrative+0.014
 motorcycle+0.014
Neg logit contributions
resses-0.016
heads-0.015
uggets-0.015
 banks-0.015
nes-0.015
Token tie
Token ID9839
Feature activation-1.10e+00

Pos logit contributions
resses+0.011
angers+0.011
ashes+0.011
heads+0.011
mary+0.011
Neg logit contributions
 course-0.007
 forgiveness-0.006
 motorcycle-0.005
 forgive-0.005
 magnet-0.005
Token my
Token ID616
Feature activation+1.74e+00

Pos logit contributions
 course+0.039
 forgiveness+0.029
 motorcycle+0.029
 understands+0.028
orn+0.028
Neg logit contributions
resses-0.045
ashes-0.044
heads-0.043
angers-0.043
 banks-0.043
Token sho
Token ID7354
Feature activation-7.77e+00

Pos logit contributions
resses+0.064
heads+0.060
ashes+0.060
 banks+0.060
angers+0.058
Neg logit contributions
 course-0.066
 forgiveness-0.053
 narrative-0.053
 understands-0.052
father-0.052
Tokenel
Token ID417
Feature activation+1.06e+00

Pos logit contributions
 course+0.239
 narrative+0.219
 thank+0.219
 policeman+0.215
 understands+0.215
Neg logit contributions
uggets-0.293
resses-0.289
angers-0.281
ashes-0.280
nes-0.275
Tokenaces
Token ID2114
Feature activation-2.16e+00

Pos logit contributions
resses+0.043
heads+0.041
ashes+0.041
angers+0.040
 banks+0.040
Neg logit contributions
 course-0.037
 forgiveness-0.029
 narrative-0.028
 understands-0.028
 motorcycle-0.027
Token for
Token ID329
Feature activation+1.11e+00

Pos logit contributions
 course+0.072
 forgiveness+0.057
 understands+0.054
 narrative+0.054
 motorcycle+0.054
Neg logit contributions
resses-0.090
ashes-0.087
heads-0.085
angers-0.085
uggets-0.085
Token me
Token ID502
Feature activation+1.60e+00

Pos logit contributions
resses+0.041
ashes+0.041
mary+0.041
angers+0.041
heads+0.040
Neg logit contributions
 course-0.043
 forgiveness-0.033
 motorcycle-0.032
orn-0.031
 magnet-0.031
Token?"
Token ID1701
Feature activation+3.45e+00

Pos logit contributions
resses+0.073
ashes+0.071
 banks+0.071
angers+0.070
heads+0.070
Neg logit contributions
 course-0.047
 forgiveness-0.034
 motorcycle-0.034
orn-0.033
 understands-0.032
Token friends
Token ID2460
Feature activation-9.09e-01

Pos logit contributions
resses+0.021
ashes+0.020
heads+0.020
mary+0.020
 banks+0.020
Neg logit contributions
 course-0.017
 forgiveness-0.013
 motorcycle-0.013
 narrative-0.013
 understands-0.013
Token helped
Token ID4193
Feature activation-2.31e+00

Pos logit contributions
 course+0.032
 narrative+0.027
orn+0.026
 forgiveness+0.026
father+0.026
Neg logit contributions
nes-0.033
resses-0.033
 banks-0.033
mary-0.032
ashes-0.032
Token her
Token ID607
Feature activation+4.86e-01

Pos logit contributions
 course+0.074
 forgiveness+0.057
 forgive+0.053
 thank+0.050
 motorcycle+0.050
Neg logit contributions
resses-0.101
angers-0.099
ashes-0.099
 banks-0.098
uggets-0.097
Token loosen
Token ID40953
Feature activation-2.75e+00

Pos logit contributions
resses+0.019
angers+0.019
ashes+0.019
heads+0.019
uggets+0.019
Neg logit contributions
 course-0.017
 forgiveness-0.013
 motorcycle-0.013
 magnet-0.013
 understands-0.013
Token her
Token ID607
Feature activation+5.02e-01

Pos logit contributions
 course+0.097
 forgiveness+0.073
 motorcycle+0.071
orn+0.070
 narrative+0.069
Neg logit contributions
resses-0.111
ashes-0.108
heads-0.108
angers-0.106
uggets-0.106
Token sho
Token ID7354
Feature activation-7.76e+00

Pos logit contributions
resses+0.019
angers+0.018
uggets+0.018
allows+0.018
nes+0.018
Neg logit contributions
 course-0.018
 forgiveness-0.015
 thanks-0.014
 understands-0.014
 thank-0.014
Tokenel
Token ID417
Feature activation+1.29e+00

Pos logit contributions
 course+0.261
 motorcycle+0.232
 thank+0.230
 narrative+0.230
 understands+0.228
Neg logit contributions
resses-0.287
uggets-0.285
angers-0.283
mary-0.282
nes-0.276
Tokenaces
Token ID2114
Feature activation-2.29e+00

Pos logit contributions
resses+0.041
ashes+0.041
uggets+0.040
angers+0.040
uring+0.040
Neg logit contributions
 course-0.054
orn-0.043
 forgiveness-0.042
 magnet-0.041
 motorcycle-0.041
Token so
Token ID523
Feature activation+3.06e+00

Pos logit contributions
 course+0.078
 forgiveness+0.061
 understands+0.059
 narrative+0.059
 motorcycle+0.057
Neg logit contributions
resses-0.096
ashes-0.092
heads-0.091
 banks-0.091
angers-0.091
Token she
Token ID673
Feature activation+8.79e-01

Pos logit contributions
resses+0.102
mary+0.097
ashes+0.095
nes+0.094
uring+0.093
Neg logit contributions
 course-0.128
father-0.108
 motorcycle-0.106
 narrative-0.104
 forgiveness-0.103
Token could
Token ID714
Feature activation-2.15e+00

Pos logit contributions
resses+0.039
ashes+0.038
mary+0.037
angers+0.037
nes+0.037
Neg logit contributions
 course-0.027
 forgiveness-0.020
 narrative-0.020
 motorcycle-0.020
 understands-0.019
Token not
Token ID407
Feature activation-8.94e-01

Pos logit contributions
resses+0.059
ashes+0.059
 banks+0.058
angers+0.057
nes+0.056
Neg logit contributions
 course-0.051
 forgiveness-0.044
 narrative-0.041
 policy-0.041
 thanks-0.040
Token nice
Token ID3621
Feature activation-6.94e-01

Pos logit contributions
 course+0.026
 forgiveness+0.022
 thanks+0.021
 forgive+0.020
 policy+0.019
Neg logit contributions
angers-0.041
resses-0.040
 banks-0.040
ashes-0.039
mary-0.039
Token to
Token ID284
Feature activation+5.19e-01

Pos logit contributions
 course+0.025
 forgiveness+0.019
 motorcycle+0.018
 narrative+0.018
 understands+0.018
Neg logit contributions
resses-0.028
ashes-0.027
heads-0.027
angers-0.027
 banks-0.027
Token judge
Token ID5052
Feature activation-2.46e+00

Pos logit contributions
angers+0.024
resses+0.024
uggets+0.023
allows+0.022
mary+0.022
Neg logit contributions
 course-0.017
 forgiveness-0.013
 forgive-0.012
 return-0.012
 thank-0.011
Token things
Token ID1243
Feature activation-1.41e+00

Pos logit contributions
 course+0.088
 forgiveness+0.068
 understands+0.065
 motorcycle+0.065
 thanks+0.063
Neg logit contributions
resses-0.099
heads-0.096
ashes-0.096
angers-0.095
uggets-0.094
Token based
Token ID1912
Feature activation-7.75e+00

Pos logit contributions
 course+0.048
 narrative+0.038
orn+0.037
father+0.037
 understands+0.037
Neg logit contributions
resses-0.058
ashes-0.056
mary-0.055
uring-0.054
 banks-0.054
Token on
Token ID319
Feature activation-1.84e+00

Pos logit contributions
 course+0.251
 forgiveness+0.200
 understands+0.195
 narrative+0.193
 motorcycle+0.192
Neg logit contributions
ashes-0.318
resses-0.315
heads-0.308
uggets-0.302
mary-0.301
Token their
Token ID511
Feature activation+2.27e+00

Pos logit contributions
 course+0.062
 forgiveness+0.047
 motorcycle+0.044
 understands+0.043
 narrative+0.043
Neg logit contributions
resses-0.078
heads-0.077
ashes-0.076
angers-0.076
uggets-0.075
Token looks
Token ID3073
Feature activation+1.90e-01

Pos logit contributions
resses+0.093
ashes+0.090
heads+0.089
angers+0.089
uggets+0.089
Neg logit contributions
 course-0.079
 forgiveness-0.060
 motorcycle-0.059
 narrative-0.058
 understands-0.058
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
resses+0.009
heads+0.009
ashes+0.009
angers+0.009
uggets+0.009
Neg logit contributions
 course-0.005
 forgiveness-0.004
 motorcycle-0.004
 understands-0.004
 narrative-0.004
Token She
Token ID1375
Feature activation-1.69e-01

Pos logit contributions
 course+0.014
 understands+0.011
 thanks+0.010
 again+0.010
 thank+0.010
Neg logit contributions
uggets-0.023
allows-0.023
heads-0.022
nes-0.021
resses-0.021
Token pink
Token ID11398
Feature activation+4.74e-01

Pos logit contributions
resses+0.018
 banks+0.016
angers+0.016
ashes+0.015
heads+0.015
Neg logit contributions
 course-0.021
ously-0.019
 narrative-0.018
 forgiveness-0.018
 policy-0.018
Token dress
Token ID6576
Feature activation-1.38e+00

Pos logit contributions
resses+0.017
 banks+0.016
angers+0.016
ashes+0.016
heads+0.015
Neg logit contributions
 course-0.018
 narrative-0.015
 understands-0.015
 forgiveness-0.015
ously-0.014
Token and
Token ID290
Feature activation-3.79e-01

Pos logit contributions
 course+0.053
 narrative+0.044
 understands+0.043
 forgiveness+0.043
 motorcycle+0.041
Neg logit contributions
resses-0.051
angers-0.048
mary-0.048
heads-0.047
nes-0.047
Token tie
Token ID9839
Feature activation-1.17e+00

Pos logit contributions
 course+0.015
 narrative+0.012
orn+0.011
 motorcycle+0.011
 forgiveness+0.011
Neg logit contributions
resses-0.014
heads-0.013
 banks-0.013
mary-0.013
ashes-0.013
Token her
Token ID607
Feature activation+5.81e-01

Pos logit contributions
 course+0.044
 forgiveness+0.034
 motorcycle+0.033
 understands+0.033
orn+0.033
Neg logit contributions
resses-0.044
heads-0.043
ashes-0.043
 banks-0.042
uggets-0.042
Token sho
Token ID7354
Feature activation-7.75e+00

Pos logit contributions
resses+0.020
 banks+0.018
heads+0.018
angers+0.017
mary+0.017
Neg logit contributions
 course-0.023
 narrative-0.020
 forgiveness-0.020
father-0.019
ously-0.019
Tokenel
Token ID417
Feature activation+1.33e+00

Pos logit contributions
 course+0.271
 narrative+0.245
 thank+0.239
 motorcycle+0.237
 forgiveness+0.228
Neg logit contributions
resses-0.289
angers-0.279
uggets-0.278
nes-0.271
mary-0.268
Tokenaces
Token ID2114
Feature activation-2.26e+00

Pos logit contributions
resses+0.036
heads+0.034
ashes+0.033
angers+0.033
 banks+0.032
Neg logit contributions
 course-0.065
 forgiveness-0.055
 narrative-0.054
 understands-0.053
 motorcycle-0.053
Token tight
Token ID5381
Feature activation-1.49e+00

Pos logit contributions
 course+0.084
 forgiveness+0.067
 narrative+0.066
 understands+0.066
 motorcycle+0.065
Neg logit contributions
resses-0.088
angers-0.083
ashes-0.082
nes-0.082
heads-0.082
Token,
Token ID11
Feature activation-6.36e-01

Pos logit contributions
 course+0.056
 forgiveness+0.044
 narrative+0.044
 understands+0.043
 motorcycle+0.043
Neg logit contributions
resses-0.058
 banks-0.054
ashes-0.054
angers-0.054
heads-0.053
Token just
Token ID655
Feature activation+1.61e+00

Pos logit contributions
 course+0.024
 narrative+0.019
 forgiveness+0.019
orn+0.019
 motorcycle+0.019
Neg logit contributions
resses-0.023
ashes-0.022
mary-0.022
heads-0.022
angers-0.022
Token and
Token ID290
Feature activation-3.89e-01

Pos logit contributions
 course+0.055
 narrative+0.042
 forgiveness+0.042
 motorcycle+0.040
orn+0.040
Neg logit contributions
resses-0.080
mary-0.077
heads-0.077
ashes-0.077
 banks-0.076
Token how
Token ID703
Feature activation-3.31e+00

Pos logit contributions
 course+0.014
 narrative+0.011
orn+0.011
 motorcycle+0.010
father+0.010
Neg logit contributions
resses-0.015
mary-0.015
heads-0.014
 banks-0.014
uggets-0.014
Token she
Token ID673
Feature activation+8.46e-01

Pos logit contributions
 course+0.112
 understands+0.087
 forgiveness+0.085
 motorcycle+0.082
orn+0.082
Neg logit contributions
resses-0.136
uggets-0.135
 banks-0.135
angers-0.134
heads-0.134
Token tied
Token ID8165
Feature activation-5.07e-01

Pos logit contributions
resses+0.030
mary+0.029
nes+0.027
ender+0.026
 banks+0.026
Neg logit contributions
 course-0.030
 narrative-0.028
 policy-0.027
 forgiveness-0.027
 motorcycle-0.026
Token her
Token ID607
Feature activation+5.17e-01

Pos logit contributions
 course+0.018
 forgiveness+0.014
 motorcycle+0.014
 narrative+0.014
 understands+0.014
Neg logit contributions
resses-0.020
ashes-0.019
heads-0.019
angers-0.019
mary-0.019
Token sho
Token ID7354
Feature activation-7.75e+00

Pos logit contributions
resses+0.019
 banks+0.017
ashes+0.017
angers+0.017
mary+0.017
Neg logit contributions
 course-0.020
 forgiveness-0.017
father-0.016
 narrative-0.016
 understands-0.016
Tokenel
Token ID417
Feature activation+1.35e+00

Pos logit contributions
 course+0.275
 thank+0.247
 narrative+0.245
 motorcycle+0.242
 forgiveness+0.236
Neg logit contributions
resses-0.283
angers-0.276
uggets-0.275
nes-0.267
mary-0.260
Tokenaces
Token ID2114
Feature activation-2.21e+00

Pos logit contributions
resses+0.036
ashes+0.034
heads+0.033
 banks+0.033
angers+0.033
Neg logit contributions
 course-0.066
 forgiveness-0.057
 narrative-0.055
 understands-0.055
 motorcycle-0.054
Token all
Token ID477
Feature activation-1.06e+00

Pos logit contributions
 course+0.077
 forgiveness+0.062
 narrative+0.060
 understands+0.060
 motorcycle+0.059
Neg logit contributions
resses-0.090
ashes-0.085
nes-0.084
angers-0.084
heads-0.084
Token by
Token ID416
Feature activation-2.56e-01

Pos logit contributions
 course+0.045
 forgiveness+0.043
 motorcycle+0.042
 narrative+0.041
 understands+0.041
Neg logit contributions
ashes-0.028
resses-0.028
mary-0.028
nes-0.026
 banks-0.026
Token herself
Token ID5223
Feature activation-5.46e-01

Pos logit contributions
 course+0.013
 motorcycle+0.011
 understands+0.011
 narrative+0.011
 forgiveness+0.011
Neg logit contributions
resses-0.006
 banks-0.006
heads-0.006
uring-0.006
mary-0.006
Token not
Token ID407
Feature activation-8.54e-01

Pos logit contributions
 course+0.080
 forgiveness+0.065
 motorcycle+0.063
 understands+0.063
 thanks+0.062
Neg logit contributions
resses-0.068
uggets-0.066
angers-0.066
ashes-0.066
heads-0.066
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
 course+0.031
 forgiveness+0.024
 narrative+0.024
 understands+0.023
 motorcycle+0.023
Neg logit contributions
resses-0.033
ashes-0.033
heads-0.032
 banks-0.032
mary-0.032
Token "
Token ID366
Feature activation+8.28e-01

Pos logit contributions
 course+0.008
 understands+0.006
 forgiveness+0.006
 thanks+0.006
 magnet+0.006
Neg logit contributions
ashes-0.012
heads-0.012
resses-0.012
angers-0.012
uggets-0.012
TokenA
Token ID32
Feature activation+1.77e-01

Pos logit contributions
angers+0.041
resses+0.040
 banks+0.040
uggets+0.040
ashes+0.039
Neg logit contributions
 course-0.023
 narrative-0.015
orn-0.015
 forgiveness-0.014
 motorcycle-0.014
Tokench
Token ID354
Feature activation+7.21e+00

Pos logit contributions
heads+0.006
resses+0.006
 kettle+0.006
allows+0.006
mary+0.006
Neg logit contributions
 course-0.006
 narrative-0.005
ously-0.005
 forgiveness-0.005
father-0.005
Tokenoo
Token ID2238
Feature activation+4.63e+00

Pos logit contributions
resses+0.254
heads+0.249
ashes+0.249
mary+0.243
angers+0.243
Neg logit contributions
 course-0.276
 forgiveness-0.222
 narrative-0.219
 motorcycle-0.219
 understands-0.217
Token!"
Token ID2474
Feature activation+3.46e+00

Pos logit contributions
resses+0.185
ashes+0.179
heads+0.178
angers+0.177
 banks+0.176
Neg logit contributions
 course-0.163
 forgiveness-0.125
 motorcycle-0.122
 narrative-0.121
 understands-0.120
Token she
Token ID673
Feature activation+8.96e-01

Pos logit contributions
uggets+0.195
angers+0.190
ashes+0.186
resses+0.177
ans+0.177
Neg logit contributions
 course-0.081
 thank-0.062
 understands-0.058
,-0.057
 thanks-0.057
Token sne
Token ID10505
Feature activation+4.15e+00

Pos logit contributions
angers+0.051
uggets+0.050
ashes+0.050
resses+0.049
uring+0.046
Neg logit contributions
 course-0.018
 knew-0.014
 smiled-0.013
 understood-0.013
 was-0.013
Tokenez
Token ID8471
Feature activation+2.17e+00

Pos logit contributions
resses+0.166
ashes+0.163
heads+0.162
angers+0.159
mary+0.159
Neg logit contributions
 course-0.144
 forgiveness-0.111
 motorcycle-0.111
 narrative-0.108
 understands-0.107
Tokened
Token ID276
Feature activation-2.13e-01

Pos logit contributions
resses+0.093
 banks+0.091
ashes+0.091
uggets+0.090
heads+0.090
Neg logit contributions
 course-0.069
 forgiveness-0.052
 understands-0.051
orn-0.051
 motorcycle-0.050
Token ran
Token ID4966
Feature activation-1.19e+00

Pos logit contributions
resses+0.037
heads+0.036
ashes+0.036
angers+0.035
uggets+0.035
Neg logit contributions
 course-0.025
 forgiveness-0.018
 motorcycle-0.018
 understands-0.018
 narrative-0.018
Token away
Token ID1497
Feature activation-2.32e+00

Pos logit contributions
 course+0.036
 again+0.027
 thanks+0.026
 motorcycle+0.025
 hero+0.025
Neg logit contributions
uggets-0.052
allows-0.052
angers-0.052
ender-0.051
resses-0.051
Token from
Token ID422
Feature activation-2.20e+00

Pos logit contributions
 course+0.067
 forgiveness+0.050
 thanks+0.047
 understands+0.047
 motorcycle+0.047
Neg logit contributions
resses-0.106
angers-0.103
heads-0.103
ashes-0.102
uggets-0.102
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
 course+0.076
 narrative+0.060
 understands+0.060
orn+0.059
 forgiveness+0.059
Neg logit contributions
resses-0.085
ashes-0.084
 banks-0.083
mary-0.082
uggets-0.081
Token pick
Token ID2298
Feature activation-1.36e+00

Pos logit contributions
 course+0.051
ously+0.041
 understands+0.040
orn+0.040
 forgiveness+0.040
Neg logit contributions
resses-0.049
 banks-0.049
ashes-0.048
heads-0.047
mary-0.046
Tokenle
Token ID293
Feature activation+4.86e+00

Pos logit contributions
 course+0.042
 forgiveness+0.033
 motorcycle+0.032
 understands+0.032
 narrative+0.032
Neg logit contributions
resses-0.058
ashes-0.057
mary-0.056
 banks-0.056
angers-0.056
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
resses+0.223
heads+0.218
angers+0.216
ashes+0.215
uggets+0.213
Neg logit contributions
 course-0.147
 forgiveness-0.105
 motorcycle-0.100
 understands-0.099
 thanks-0.098
Token They
Token ID1119
Feature activation+1.47e+00

Pos logit contributions
 course+0.041
 forgiveness+0.032
 understands+0.031
 thanks+0.031
 magnet+0.030
Neg logit contributions
heads-0.043
resses-0.043
ashes-0.042
 banks-0.042
angers-0.042
Token ran
Token ID4966
Feature activation-1.07e+00

Pos logit contributions
resses+0.061
mary+0.059
 banks+0.059
ashes+0.059
heads+0.058
Neg logit contributions
 course-0.048
 forgiveness-0.037
 motorcycle-0.037
 narrative-0.036
orn-0.036
Token and
Token ID290
Feature activation-5.76e-01

Pos logit contributions
 course+0.035
 thanks+0.026
 motorcycle+0.025
 hero+0.025
 understands+0.024
Neg logit contributions
uggets-0.044
resses-0.044
heads-0.044
allows-0.044
angers-0.044
Token laughed
Token ID13818
Feature activation+6.85e-01

Pos logit contributions
 course+0.018
 thanks+0.012
 forgiveness+0.012
 understands+0.012
 motorcycle+0.012
Neg logit contributions
resses-0.026
angers-0.025
heads-0.025
uggets-0.025
ashes-0.025
Token never
Token ID1239
Feature activation-3.96e+00

Pos logit contributions
 course+0.073
 motorcycle+0.058
 forgiveness+0.057
 understands+0.056
orn+0.056
Neg logit contributions
resses-0.080
 banks-0.079
ashes-0.078
mary-0.077
heads-0.077
Token forget
Token ID6044
Feature activation-1.75e+00

Pos logit contributions
 course+0.155
 motorcycle+0.129
orn+0.124
father+0.124
 narrative+0.122
Neg logit contributions
ashes-0.137
 banks-0.132
heads-0.130
uring-0.129
nes-0.129
Token the
Token ID262
Feature activation-7.00e-01

Pos logit contributions
 course+0.058
orn+0.047
 narrative+0.046
 motorcycle+0.045
 magnet+0.044
Neg logit contributions
resses-0.069
mary-0.066
 banks-0.066
uring-0.065
uggets-0.065
Token things
Token ID1243
Feature activation-1.52e+00

Pos logit contributions
 course+0.026
 motorcycle+0.020
 forgiveness+0.020
orn+0.020
 understands+0.020
Neg logit contributions
resses-0.026
ashes-0.026
heads-0.025
angers-0.025
 banks-0.025
Token Av
Token ID5184
Feature activation+1.94e+00

Pos logit contributions
 course+0.050
 narrative+0.041
orn+0.040
father+0.039
 understands+0.038
Neg logit contributions
resses-0.059
uring-0.057
nes-0.056
 thirsty-0.056
ashes-0.055
Tokena
Token ID64
Feature activation+4.95e+00

Pos logit contributions
 banks+0.071
resses+0.065
 gag+0.063
heads+0.063
uggets+0.060
Neg logit contributions
 course-0.071
 understands-0.062
 thanks-0.060
in-0.059
 knows-0.059
Token had
Token ID550
Feature activation-1.12e+00

Pos logit contributions
resses+0.219
heads+0.212
ashes+0.212
angers+0.211
 banks+0.209
Neg logit contributions
 course-0.155
 forgiveness-0.114
 understands-0.110
 motorcycle-0.110
 narrative-0.108
Token taught
Token ID7817
Feature activation-2.23e-02

Pos logit contributions
 course+0.041
 motorcycle+0.032
 forgiveness+0.031
 narrative+0.031
orn+0.031
Neg logit contributions
resses-0.043
ashes-0.042
heads-0.041
uggets-0.041
mary-0.041
Token him
Token ID683
Feature activation-3.35e-01

Pos logit contributions
 course+0.001
 forgiveness+0.001
 motorcycle+0.001
 narrative+0.001
 understands+0.001
Neg logit contributions
resses-0.001
ashes-0.001
heads-0.001
angers-0.001
 banks-0.001
Token.
Token ID13
Feature activation-5.65e-01

Pos logit contributions
 course+0.011
 narrative+0.008
 motorcycle+0.008
 forgiveness+0.008
orn+0.008
Neg logit contributions
resses-0.014
ashes-0.014
mary-0.014
heads-0.013
 banks-0.013
Token<|endoftext|>
Token ID50256
Feature activation+1.91e+00

Pos logit contributions
 course+0.018
 forgiveness+0.014
 motorcycle+0.013
 narrative+0.013
 understands+0.013
Neg logit contributions
resses-0.024
ashes-0.024
heads-0.023
angers-0.023
 banks-0.023
Token asked
Token ID1965
Feature activation+1.79e+00

Pos logit contributions
ashes+0.111
resses+0.111
angers+0.110
heads+0.109
uggets+0.108
Neg logit contributions
 course-0.060
 forgiveness-0.043
 motorcycle-0.040
 understands-0.039
 magnet-0.039
Token Sam
Token ID3409
Feature activation-1.83e+00

Pos logit contributions
resses+0.069
mary+0.067
uggets+0.067
angers+0.066
heads+0.066
Neg logit contributions
 course-0.064
 narrative-0.049
 motorcycle-0.049
 understands-0.049
 forgiveness-0.048
Token nervously
Token ID43324
Feature activation+1.02e+00

Pos logit contributions
 course+0.054
 forgiveness+0.040
 motorcycle+0.039
 understands+0.038
 magnet+0.037
Neg logit contributions
resses-0.083
ashes-0.082
angers-0.081
heads-0.080
 banks-0.080
Token.
Token ID13
Feature activation-6.96e-01

Pos logit contributions
resses+0.048
ashes+0.047
angers+0.047
uggets+0.046
 banks+0.046
Neg logit contributions
 course-0.029
 forgiveness-0.021
 motorcycle-0.020
 understands-0.019
 magnet-0.019
Token L
Token ID406
Feature activation+3.46e+00

Pos logit contributions
 course+0.020
 forgiveness+0.014
 understands+0.014
 motorcycle+0.014
 magnet+0.014
Neg logit contributions
resses-0.032
ashes-0.032
heads-0.031
angers-0.031
uggets-0.031
Tokenola
Token ID5708
Feature activation+5.60e+00

Pos logit contributions
resses+0.101
ashes+0.100
angers+0.099
nes+0.099
mary+0.097
Neg logit contributions
 course-0.149
 motorcycle-0.130
 forgiveness-0.130
 magnet-0.126
 narrative-0.124
Token looked
Token ID3114
Feature activation+5.66e-01

Pos logit contributions
mary+0.228
resses+0.225
 banks+0.223
heads+0.220
nes+0.216
Neg logit contributions
 course-0.183
 forgiveness-0.145
 narrative-0.141
 policy-0.138
 motorcycle-0.138
Token around
Token ID1088
Feature activation+1.84e+00

Pos logit contributions
resses+0.021
angers+0.020
ashes+0.020
uggets+0.020
heads+0.020
Neg logit contributions
 course-0.022
 forgiveness-0.017
 understands-0.017
 motorcycle-0.017
 thanks-0.017
Token but
Token ID475
Feature activation-4.02e-01

Pos logit contributions
resses+0.075
 banks+0.073
heads+0.073
ashes+0.073
mary+0.073
Neg logit contributions
 course-0.060
orn-0.047
 narrative-0.046
 motorcycle-0.045
 forgiveness-0.045
Token she
Token ID673
Feature activation+7.83e-01

Pos logit contributions
 course+0.016
 narrative+0.012
 forgiveness+0.012
 motorcycle+0.012
orn+0.012
Neg logit contributions
resses-0.014
mary-0.014
uring-0.014
 banks-0.014
angers-0.014
Token could
Token ID714
Feature activation-2.02e+00

Pos logit contributions
resses+0.030
mary+0.030
heads+0.029
 banks+0.029
allows+0.029
Neg logit contributions
 course-0.028
 forgiveness-0.022
 narrative-0.022
 motorcycle-0.021
 policy-0.020
Token friends
Token ID2460
Feature activation-7.62e-01

Pos logit contributions
resses+0.023
ashes+0.023
heads+0.023
angers+0.023
 banks+0.022
Neg logit contributions
 course-0.007
 forgiveness-0.003
 motorcycle-0.003
 narrative-0.003
 understands-0.003
Token again
Token ID757
Feature activation-4.09e+00

Pos logit contributions
 course+0.014
 narrative+0.009
 understands+0.009
 forgiveness+0.008
orn+0.008
Neg logit contributions
resses-0.042
ashes-0.041
heads-0.041
 banks-0.041
mary-0.041
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 course+0.135
 forgiveness+0.101
 motorcycle+0.100
 narrative+0.099
 understands+0.098
Neg logit contributions
resses-0.172
ashes-0.168
heads-0.167
 banks-0.166
mary-0.165
Token<|endoftext|>
Token ID50256
Feature activation+1.80e+00

Pos logit contributions
 course+0.020
 forgiveness+0.015
 motorcycle+0.014
 narrative+0.014
 understands+0.014
Neg logit contributions
resses-0.025
ashes-0.024
heads-0.024
angers-0.024
 banks-0.024
TokenOnce
Token ID7454
Feature activation+8.94e-01

Pos logit contributions
resses+0.068
ashes+0.065
mary+0.065
 banks+0.065
angers+0.064
Neg logit contributions
 course-0.065
 forgiveness-0.053
 motorcycle-0.052
 narrative-0.051
 understands-0.049
Token upon
Token ID2402
Feature activation+4.58e+00

Pos logit contributions
resses+0.020
allows+0.020
angers+0.020
mary+0.020
uggets+0.020
Neg logit contributions
 course-0.046
 forgiveness-0.039
 motorcycle-0.038
 narrative-0.037
 magnet-0.036
Token a
Token ID257
Feature activation-2.23e-01

Pos logit contributions
angers+0.217
ashes+0.212
mary+0.211
resses+0.210
 banks+0.208
Neg logit contributions
 course-0.137
 forgiveness-0.100
 motorcycle-0.097
 forgive-0.096
 thanks-0.095
Token time
Token ID640
Feature activation+1.43e+00

Pos logit contributions
 course+0.008
 forgiveness+0.007
 motorcycle+0.007
 narrative+0.006
orn+0.006
Neg logit contributions
resses-0.008
ashes-0.008
heads-0.008
mary-0.008
allows-0.008
Token,
Token ID11
Feature activation+4.54e-02

Pos logit contributions
resses+0.063
allows+0.061
uggets+0.060
angers+0.060
mary+0.060
Neg logit contributions
 course-0.040
 motorcycle-0.032
 forgiveness-0.032
 narrative-0.031
father-0.030
Token there
Token ID612
Feature activation+1.18e+00

Pos logit contributions
resses+0.002
heads+0.002
ashes+0.002
mary+0.002
uggets+0.002
Neg logit contributions
 course-0.002
 forgiveness-0.001
 motorcycle-0.001
 narrative-0.001
 understands-0.001
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
 banks+0.040
allows+0.040
mary+0.039
nes+0.038
 thirsty+0.038
Neg logit contributions
 course-0.041
 narrative-0.038
 forgiveness-0.037
 motorcycle-0.037
 mercy-0.035
Token he
Token ID339
Feature activation-1.09e+00

Pos logit contributions
 course+0.022
 forgiveness+0.015
 motorcycle+0.015
 narrative+0.015
 understands+0.015
Neg logit contributions
resses-0.037
 banks-0.036
ashes-0.036
angers-0.036
mary-0.036
Token realized
Token ID6939
Feature activation-1.77e-01

Pos logit contributions
 course+0.035
 forgiveness+0.026
 motorcycle+0.025
 narrative+0.025
 understands+0.025
Neg logit contributions
resses-0.048
ashes-0.046
heads-0.046
angers-0.046
 banks-0.046
Token that
Token ID326
Feature activation-7.44e-01

Pos logit contributions
 course+0.005
 narrative+0.005
 understands+0.004
 motorcycle+0.004
orn+0.004
Neg logit contributions
resses-0.007
 banks-0.007
mary-0.007
angers-0.007
uring-0.007
Token he
Token ID339
Feature activation-1.13e+00

Pos logit contributions
 course+0.023
 understands+0.017
 narrative+0.017
orn+0.017
 motorcycle+0.017
Neg logit contributions
resses-0.032
 banks-0.031
uggets-0.031
mary-0.031
ashes-0.030
Token could
Token ID714
Feature activation-2.30e+00

Pos logit contributions
 course+0.032
 forgiveness+0.023
 narrative+0.023
 motorcycle+0.023
orn+0.021
Neg logit contributions
resses-0.051
mary-0.051
 banks-0.050
ashes-0.050
heads-0.049
Token always
Token ID1464
Feature activation+1.77e+00

Pos logit contributions
 course+0.079
 forgiveness+0.061
 forgive+0.060
 magnet+0.057
 motorcycle+0.057
Neg logit contributions
resses-0.098
angers-0.095
heads-0.094
uggets-0.093
ashes-0.092
Token come
Token ID1282
Feature activation-2.35e-01

Pos logit contributions
 banks+0.060
ashes+0.058
heads+0.057
uring+0.057
resses+0.056
Neg logit contributions
 course-0.068
orn-0.057
 narrative-0.057
father-0.056
 motorcycle-0.056
Token back
Token ID736
Feature activation-4.50e+00

Pos logit contributions
 course+0.005
 forgiveness+0.003
 motorcycle+0.003
 magnet+0.003
 forgive+0.003
Neg logit contributions
resses-0.013
angers-0.012
ashes-0.012
heads-0.012
 banks-0.012
Token to
Token ID284
Feature activation+2.04e-01

Pos logit contributions
 course+0.151
 forgiveness+0.114
 motorcycle+0.112
 narrative+0.111
 understands+0.110
Neg logit contributions
resses-0.187
ashes-0.182
heads-0.181
angers-0.180
 banks-0.179
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
resses+0.009
ashes+0.009
 banks+0.009
uggets+0.009
heads+0.009
Neg logit contributions
 course-0.006
 narrative-0.004
 understands-0.004
 motorcycle-0.004
orn-0.004
Token corn
Token ID11676
Feature activation+1.39e+00

Pos logit contributions
 course+0.041
 forgiveness+0.032
 motorcycle+0.032
 narrative+0.032
 magnet+0.031
Neg logit contributions
resses-0.037
angers-0.036
heads-0.035
uggets-0.035
ashes-0.035
TokenOkay
Token ID16454
Feature activation-4.79e+00

Pos logit contributions
 banks+0.022
angers+0.022
resses+0.021
allows+0.021
ashes+0.021
Neg logit contributions
 course-0.012
 forgiveness-0.008
 narrative-0.007
 understands-0.007
 motorcycle-0.007
Token,
Token ID11
Feature activation-4.83e-01

Pos logit contributions
 course+0.080
 forgiveness+0.041
 thanks+0.036
 motorcycle+0.035
 understands+0.034
Neg logit contributions
resses-0.283
ashes-0.279
angers-0.278
 banks-0.275
uggets-0.274
Token let
Token ID1309
Feature activation-4.02e+00

Pos logit contributions
 course+0.016
 forgiveness+0.012
 motorcycle+0.012
 thanks+0.012
 understands+0.012
Neg logit contributions
resses-0.020
ashes-0.020
angers-0.020
 banks-0.019
heads-0.019
Token's
Token ID338
Feature activation+1.39e+00

Pos logit contributions
 course+0.109
 understands+0.071
 thanks+0.071
 forgiveness+0.070
 return+0.065
Neg logit contributions
resses-0.207
angers-0.203
uggets-0.200
ashes-0.199
heads-0.192
Token bake
Token ID28450
Feature activation+1.96e+00

Pos logit contributions
resses+0.062
angers+0.061
ashes+0.061
heads+0.060
mary+0.059
Neg logit contributions
 course-0.044
 forgiveness-0.032
 magnet-0.032
 motorcycle-0.031
 forgive-0.030
Token some
Token ID617
Feature activation+3.41e+00

Pos logit contributions
ashes+0.086
angers+0.085
resses+0.085
heads+0.085
 banks+0.084
Neg logit contributions
 course-0.066
 again-0.047
 thanks-0.045
 forgiveness-0.044
 is-0.043
Token gem
Token ID16840
Feature activation-4.96e+00

Pos logit contributions
resses+0.124
heads+0.122
ashes+0.121
angers+0.120
nes+0.119
Neg logit contributions
 course-0.136
 forgiveness-0.106
 motorcycle-0.103
 magnet-0.101
 thanks-0.099
Token cookies
Token ID14746
Feature activation+9.89e-01

Pos logit contributions
 course+0.165
 magnet+0.120
orn+0.118
 motorcycle+0.117
 again+0.115
Neg logit contributions
heads-0.204
allows-0.201
ashes-0.201
 gag-0.201
resses-0.198
Token together
Token ID1978
Feature activation+8.01e-01

Pos logit contributions
resses+0.044
heads+0.044
ashes+0.044
 banks+0.044
angers+0.044
Neg logit contributions
 course-0.030
 thanks-0.020
 forgiveness-0.019
 motorcycle-0.019
 again-0.019
Token."
Token ID526
Feature activation+3.26e+00

Pos logit contributions
angers+0.032
resses+0.032
ashes+0.032
heads+0.032
 banks+0.031
Neg logit contributions
 course-0.028
 thanks-0.020
 forgiveness-0.020
 motorcycle-0.020
 magnet-0.019
Token
Token ID220
Feature activation+2.51e-01

Pos logit contributions
heads+0.145
resses+0.145
ashes+0.143
angers+0.142
uring+0.142
Neg logit contributions
 course-0.103
 thanks-0.072
 forgiveness-0.072
 understands-0.069
 magnet-0.067
Token their
Token ID511
Feature activation+2.41e+00

Pos logit contributions
resses+0.116
angers+0.113
heads+0.111
ashes+0.111
 banks+0.110
Neg logit contributions
 course-0.077
 forgiveness-0.057
 motorcycle-0.054
 understands-0.054
 narrative-0.053
Token hands
Token ID2832
Feature activation-1.25e+00

Pos logit contributions
resses+0.067
angers+0.065
heads+0.062
ashes+0.062
nes+0.061
Neg logit contributions
 course-0.115
 forgiveness-0.096
 motorcycle-0.095
 understands-0.093
 narrative-0.093
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
 course+0.032
 forgiveness+0.022
 motorcycle+0.021
 thanks+0.021
 magnet+0.020
Neg logit contributions
resses-0.062
angers-0.061
ashes-0.060
heads-0.059
uggets-0.059
Token Ben
Token ID3932
Feature activation+4.12e-01

Pos logit contributions
resses+0.022
heads+0.022
 banks+0.021
ashes+0.021
angers+0.021
Neg logit contributions
 course-0.015
 understands-0.011
 forgiveness-0.011
 thanks-0.010
 motorcycle-0.010
Token and
Token ID290
Feature activation+3.54e-01

Pos logit contributions
resses+0.018
mary+0.018
ashes+0.018
 banks+0.018
heads+0.017
Neg logit contributions
 course-0.012
 forgiveness-0.010
 narrative-0.010
 motorcycle-0.009
father-0.009
Token Lily
Token ID20037
Feature activation+1.78e+00

Pos logit contributions
resses+0.013
heads+0.012
ashes+0.012
angers+0.012
 banks+0.012
Neg logit contributions
 course-0.014
 forgiveness-0.011
 motorcycle-0.011
 understands-0.011
 thanks-0.011
Token pet
Token ID4273
Feature activation-8.51e-01

Pos logit contributions
ashes+0.071
mary+0.071
 banks+0.070
resses+0.070
heads+0.067
Neg logit contributions
 course-0.061
 forgiveness-0.049
 narrative-0.048
 policy-0.047
 motorcycle-0.047
Tokented
Token ID1513
Feature activation-2.89e-01

Pos logit contributions
 course+0.031
 narrative+0.029
 policy+0.029
father+0.029
 forgiveness+0.028
Neg logit contributions
ashes-0.030
mary-0.029
 banks-0.028
resses-0.027
 thirsty-0.026
Token the
Token ID262
Feature activation-4.36e-01

Pos logit contributions
 course+0.010
 forgiveness+0.007
 motorcycle+0.007
 narrative+0.007
 understands+0.007
Neg logit contributions
resses-0.012
ashes-0.012
mary-0.011
 banks-0.011
heads-0.011
Token goose
Token ID37246
Feature activation+1.79e+00

Pos logit contributions
 course+0.015
 understands+0.011
 motorcycle+0.011
 forgiveness+0.011
 thanks+0.010
Neg logit contributions
resses-0.019
angers-0.019
heads-0.018
uggets-0.018
nes-0.018
Token gently
Token ID15165
Feature activation-1.74e+00

Pos logit contributions
resses+0.088
angers+0.086
heads+0.084
nes+0.084
uggets+0.084
Neg logit contributions
 course-0.048
 forgiveness-0.032
 thanks-0.032
 motorcycle-0.032
 understands-0.031
Token furry
Token ID40857
Feature activation+3.30e-01

Pos logit contributions
resses+0.070
ashes+0.068
angers+0.068
heads+0.067
uggets+0.067
Neg logit contributions
 course-0.073
 forgiveness-0.058
 motorcycle-0.056
 understands-0.056
 magnet-0.055
Token animals
Token ID4695
Feature activation+1.32e+00

Pos logit contributions
nes+0.012
resses+0.012
uggets+0.011
heads+0.011
ashes+0.011
Neg logit contributions
 course-0.013
 understands-0.010
 forgiveness-0.010
 motorcycle-0.010
orn-0.010
Token,
Token ID11
Feature activation-5.54e-01

Pos logit contributions
resses+0.062
heads+0.062
angers+0.061
ashes+0.061
uggets+0.061
Neg logit contributions
 course-0.037
 forgiveness-0.025
 thanks-0.025
 motorcycle-0.025
 understands-0.024
Token who
Token ID508
Feature activation-8.10e-01

Pos logit contributions
 course+0.019
 thanks+0.015
 motorcycle+0.015
 forgiveness+0.014
 magnet+0.014
Neg logit contributions
nes-0.023
ashes-0.023
angers-0.023
resses-0.022
heads-0.022
Token all
Token ID477
Feature activation-1.18e+00

Pos logit contributions
 course+0.023
 forgiveness+0.017
 narrative+0.016
 motorcycle+0.016
orn+0.015
Neg logit contributions
resses-0.037
mary-0.037
 banks-0.036
ashes-0.036
angers-0.036
Token looked
Token ID3114
Feature activation+3.67e-01

Pos logit contributions
 course+0.036
 forgiveness+0.026
 understands+0.025
 motorcycle+0.025
 thanks+0.025
Neg logit contributions
resses-0.054
heads-0.053
ashes-0.052
angers-0.052
uggets-0.052
Token very
Token ID845
Feature activation+2.17e+00

Pos logit contributions
resses+0.014
angers+0.013
heads+0.013
nes+0.013
ashes+0.013
Neg logit contributions
 course-0.014
 thanks-0.011
 forgiveness-0.011
 motorcycle-0.011
 understands-0.011
Token happy
Token ID3772
Feature activation+1.20e+00

Pos logit contributions
resses+0.089
angers+0.088
heads+0.085
nes+0.085
uggets+0.083
Neg logit contributions
 course-0.070
 thanks-0.063
 understands-0.061
 thank-0.059
 forgiveness-0.058
Token to
Token ID284
Feature activation+3.01e-01

Pos logit contributions
resses+0.056
heads+0.055
ashes+0.054
angers+0.054
nes+0.054
Neg logit contributions
 course-0.034
 forgiveness-0.024
 motorcycle-0.023
 thanks-0.023
 understands-0.023
Token see
Token ID766
Feature activation+6.86e-01

Pos logit contributions
resses+0.011
ashes+0.010
heads+0.010
angers+0.010
mary+0.010
Neg logit contributions
 course-0.012
 forgiveness-0.010
 motorcycle-0.009
 narrative-0.009
 understands-0.009
Token her
Token ID607
Feature activation+4.09e-01

Pos logit contributions
resses+0.025
ashes+0.024
mary+0.024
heads+0.024
 banks+0.023
Neg logit contributions
 course-0.026
 forgiveness-0.021
 motorcycle-0.021
 narrative-0.021
 understands-0.020
Token with
Token ID351
Feature activation+5.18e-01

Pos logit contributions
angers+0.146
resses+0.140
ashes+0.140
mary+0.136
heads+0.136
Neg logit contributions
 course-0.111
 forgiveness-0.089
 motorcycle-0.087
 magnet-0.083
 forgive-0.083
Token a
Token ID257
Feature activation-2.86e-01

Pos logit contributions
resses+0.020
ashes+0.019
heads+0.019
angers+0.019
 banks+0.019
Neg logit contributions
 course-0.019
 forgiveness-0.015
 motorcycle-0.015
 narrative-0.015
 understands-0.014
Token ride
Token ID6594
Feature activation-3.78e+00

Pos logit contributions
 course+0.010
 forgiveness+0.008
 narrative+0.008
orn+0.008
 understands+0.008
Neg logit contributions
resses-0.011
heads-0.011
ashes-0.011
 banks-0.011
mary-0.011
Token to
Token ID284
Feature activation+3.14e-01

Pos logit contributions
 course+0.122
 forgiveness+0.090
 motorcycle+0.089
 narrative+0.086
orn+0.085
Neg logit contributions
resses-0.162
ashes-0.159
heads-0.157
angers-0.157
mary-0.156
Token the
Token ID262
Feature activation-9.50e-01

Pos logit contributions
resses+0.013
ashes+0.012
heads+0.012
angers+0.012
mary+0.012
Neg logit contributions
 course-0.011
 forgiveness-0.008
 motorcycle-0.008
 narrative-0.008
 understands-0.008
Token zoo
Token ID26626
Feature activation+2.16e+00

Pos logit contributions
 course+0.034
 motorcycle+0.026
 forgiveness+0.026
 narrative+0.025
 understands+0.025
Neg logit contributions
resses-0.038
ashes-0.037
angers-0.037
uggets-0.037
heads-0.036
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
resses+0.097
ashes+0.095
heads+0.095
angers+0.094
uggets+0.094
Neg logit contributions
 course-0.065
 motorcycle-0.047
 forgiveness-0.046
 narrative-0.044
 thanks-0.043
Token He
Token ID679
Feature activation-2.66e+00

Pos logit contributions
 course+0.004
 motorcycle+0.003
 thanks+0.003
 forgiveness+0.003
 magnet+0.003
Neg logit contributions
ashes-0.006
mary-0.006
heads-0.006
resses-0.006
 banks-0.005
Token is
Token ID318
Feature activation-4.54e-01

Pos logit contributions
 course+0.070
 forgiveness+0.048
 motorcycle+0.046
 narrative+0.045
 understands+0.045
Neg logit contributions
resses-0.130
ashes-0.127
heads-0.126
angers-0.126
uggets-0.125
Token very
Token ID845
Feature activation+2.62e+00

Pos logit contributions
 course+0.014
 thanks+0.012
 motorcycle+0.011
 thank+0.011
 understands+0.011
Neg logit contributions
ashes-0.020
resses-0.020
angers-0.019
 banks-0.019
heads-0.019
Token nice
Token ID3621
Feature activation-5.16e-01

Pos logit contributions
angers+0.116
resses+0.116
 banks+0.115
uggets+0.114
ashes+0.112
Neg logit contributions
 course-0.072
 thank-0.065
 thanks-0.065
 understands-0.063
 knows-0.060
Token they
Token ID484
Feature activation+1.84e+00

Pos logit contributions
 course+0.096
 forgiveness+0.071
 motorcycle+0.069
 understands+0.069
 magnet+0.067
Neg logit contributions
heads-0.160
resses-0.159
ashes-0.156
angers-0.155
mary-0.152
Token were
Token ID547
Feature activation-1.28e+00

Pos logit contributions
resses+0.073
 banks+0.072
ashes+0.071
mary+0.070
uggets+0.068
Neg logit contributions
 course-0.064
 forgiveness-0.050
 motorcycle-0.050
 narrative-0.049
 understands-0.048
Token made
Token ID925
Feature activation+1.48e+00

Pos logit contributions
 course+0.043
 forgiveness+0.033
 thanks+0.033
 understands+0.032
 magnet+0.032
Neg logit contributions
heads-0.053
resses-0.053
ashes-0.051
angers-0.051
nes-0.050
Token.
Token ID13
Feature activation-6.88e-01

Pos logit contributions
resses+0.063
ashes+0.061
heads+0.061
angers+0.061
 banks+0.060
Neg logit contributions
 course-0.048
 forgiveness-0.036
 motorcycle-0.035
 narrative-0.034
 understands-0.034
Token Tim
Token ID5045
Feature activation-1.49e+00

Pos logit contributions
 course+0.026
 forgiveness+0.020
 motorcycle+0.020
 understands+0.019
 narrative+0.019
Neg logit contributions
resses-0.026
heads-0.026
ashes-0.026
angers-0.025
mary-0.025
Tokenmy
Token ID1820
Feature activation-1.44e+00

Pos logit contributions
 course+0.045
 forgiveness+0.037
 policy+0.036
 narrative+0.036
 thank+0.035
Neg logit contributions
mary-0.062
 banks-0.060
heads-0.060
resses-0.058
allows-0.058
Token was
Token ID373
Feature activation-1.29e+00

Pos logit contributions
 course+0.044
 forgiveness+0.033
 narrative+0.032
 motorcycle+0.031
orn+0.031
Neg logit contributions
resses-0.063
 banks-0.062
mary-0.062
heads-0.061
ashes-0.061
Token so
Token ID523
Feature activation+3.05e+00

Pos logit contributions
 course+0.040
 thanks+0.036
 understands+0.032
 thank+0.031
 forgive+0.031
Neg logit contributions
ashes-0.058
resses-0.057
angers-0.057
heads-0.054
 banks-0.053
Token happy
Token ID3772
Feature activation+1.23e+00

Pos logit contributions
resses+0.114
angers+0.113
heads+0.111
 banks+0.110
ashes+0.108
Neg logit contributions
 course-0.113
 understands-0.096
 thanks-0.094
 thank-0.092
 forgiveness-0.092
Token to
Token ID284
Feature activation+3.91e-01

Pos logit contributions
resses+0.055
ashes+0.053
heads+0.053
angers+0.053
uggets+0.052
Neg logit contributions
 course-0.038
 forgiveness-0.028
 narrative-0.027
 motorcycle-0.027
 understands-0.026
Token have
Token ID423
Feature activation+6.00e-01

Pos logit contributions
 banks+0.013
resses+0.013
ashes+0.013
heads+0.013
uring+0.013
Neg logit contributions
 course-0.015
 motorcycle-0.012
 forgiveness-0.012
 understands-0.012
 narrative-0.012
Token They
Token ID1119
Feature activation+2.56e+00

Pos logit contributions
heads+0.020
allows+0.018
resses+0.018
 banks+0.018
nes+0.018
Neg logit contributions
 course-0.021
 forgiveness-0.017
 understands-0.017
 thanks-0.017
 magnet-0.016
Token hoped
Token ID10719
Feature activation-4.23e+00

Pos logit contributions
resses+0.103
 banks+0.100
heads+0.099
ashes+0.099
mary+0.098
Neg logit contributions
 course-0.089
 forgiveness-0.069
orn-0.067
 motorcycle-0.067
 narrative-0.066
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
 course+0.166
orn+0.136
 narrative+0.133
 understands+0.129
 motorcycle+0.127
Neg logit contributions
resses-0.150
angers-0.143
uring-0.142
 banks-0.141
uggets-0.141
Token ball
Token ID2613
Feature activation-3.49e+00

Pos logit contributions
 course+0.008
smart+0.008
father+0.008
igh+0.007
ously+0.007
Neg logit contributions
resses-0.008
 thirsty-0.007
 banks-0.007
ashes-0.007
heads-0.007
Token would
Token ID561
Feature activation-1.81e+00

Pos logit contributions
 course+0.092
 forgiveness+0.071
 motorcycle+0.066
father+0.065
 narrative+0.064
Neg logit contributions
 banks-0.163
mary-0.163
resses-0.162
ashes-0.161
nes-0.158
Token stop
Token ID2245
Feature activation-1.77e+00

Pos logit contributions
 course+0.075
 motorcycle+0.059
 understands+0.059
orn+0.059
 forgiveness+0.059
Neg logit contributions
resses-0.060
heads-0.059
 banks-0.058
ashes-0.058
mary-0.057
Token talking
Token ID3375
Feature activation+1.98e+00

Pos logit contributions
 course+0.058
 forgiveness+0.043
 motorcycle+0.042
 narrative+0.042
orn+0.042
Neg logit contributions
resses-0.075
ashes-0.073
heads-0.073
 banks-0.072
angers-0.072
Token and
Token ID290
Feature activation+5.58e-01

Pos logit contributions
resses+0.086
ashes+0.083
heads+0.083
mary+0.082
angers+0.082
Neg logit contributions
 course-0.063
 forgiveness-0.046
 narrative-0.046
orn-0.045
 understands-0.044
Token leave
Token ID2666
Feature activation-1.26e+00

Pos logit contributions
resses+0.020
heads+0.019
ashes+0.019
 banks+0.019
uggets+0.018
Neg logit contributions
 course-0.022
orn-0.018
 narrative-0.017
father-0.017
 understands-0.017
Token them
Token ID606
Feature activation+3.47e+00

Pos logit contributions
 course+0.048
orn+0.040
 narrative+0.039
 motorcycle+0.039
 understands+0.038
Neg logit contributions
 banks-0.044
resses-0.044
mary-0.043
ashes-0.043
uggets-0.042
Token alone
Token ID3436
Feature activation+6.69e-01

Pos logit contributions
resses+0.139
ashes+0.135
heads+0.134
angers+0.134
uggets+0.133
Neg logit contributions
 course-0.121
 forgiveness-0.093
 motorcycle-0.091
 narrative-0.090
 understands-0.089
Token calendar
Token ID11845
Feature activation-9.17e-01

Pos logit contributions
 course+0.077
ously+0.064
 forgiveness+0.063
smart+0.063
lad+0.061
Neg logit contributions
 thirsty-0.081
 banks-0.080
resses-0.078
mary-0.077
ashes-0.077
Token in
Token ID287
Feature activation-1.42e-01

Pos logit contributions
 course+0.033
 understands+0.028
 forgiveness+0.028
 narrative+0.027
ously+0.026
Neg logit contributions
resses-0.035
mary-0.032
angers-0.032
heads-0.031
ashes-0.031
Token her
Token ID607
Feature activation+7.01e-01

Pos logit contributions
 course+0.005
 forgiveness+0.004
 thanks+0.004
 motorcycle+0.003
 understands+0.003
Neg logit contributions
ashes-0.006
resses-0.006
angers-0.006
heads-0.006
 banks-0.006
Token room
Token ID2119
Feature activation+3.42e+00

Pos logit contributions
resses+0.024
ashes+0.023
heads+0.023
angers+0.023
 banks+0.023
Neg logit contributions
 course-0.029
 forgiveness-0.023
 motorcycle-0.023
 understands-0.022
 narrative-0.022
Token that
Token ID326
Feature activation-1.34e-01

Pos logit contributions
resses+0.148
ashes+0.144
heads+0.143
angers+0.142
 banks+0.142
Neg logit contributions
 course-0.109
 forgiveness-0.081
 motorcycle-0.079
 narrative-0.078
 understands-0.078
Token showed
Token ID3751
Feature activation-7.24e-01

Pos logit contributions
 course+0.005
 motorcycle+0.004
 forgiveness+0.004
 narrative+0.004
orn+0.004
Neg logit contributions
resses-0.005
mary-0.005
angers-0.005
ashes-0.005
heads-0.005
Token all
Token ID477
Feature activation-8.26e-01

Pos logit contributions
 course+0.023
 forgiveness+0.018
 understands+0.017
 thanks+0.017
 thank+0.016
Neg logit contributions
resses-0.031
ashes-0.031
 banks-0.031
angers-0.031
heads-0.031
Token the
Token ID262
Feature activation-3.94e-01

Pos logit contributions
 course+0.026
 forgiveness+0.018
 thanks+0.018
 understands+0.018
 motorcycle+0.018
Neg logit contributions
resses-0.038
ashes-0.037
heads-0.037
angers-0.036
 banks-0.036
Token days
Token ID1528
Feature activation-1.85e+00

Pos logit contributions
 course+0.013
 narrative+0.010
 understands+0.010
 forgiveness+0.010
 thanks+0.010
Neg logit contributions
resses-0.016
uggets-0.016
ashes-0.016
angers-0.016
heads-0.016
Token of
Token ID286
Feature activation+2.11e+00

Pos logit contributions
 course+0.064
 forgiveness+0.049
 motorcycle+0.048
 understands+0.048
 narrative+0.047
Neg logit contributions
resses-0.073
ashes-0.073
heads-0.072
 banks-0.071
uggets-0.071
Token the
Token ID262
Feature activation-3.88e-01

Pos logit contributions
ashes+0.084
resses+0.083
heads+0.082
 banks+0.081
angers+0.081
Neg logit contributions
 course-0.077
 forgiveness-0.060
 motorcycle-0.057
 narrative-0.056
 understands-0.056
Token with
Token ID351
Feature activation+5.21e-01

Pos logit contributions
ashes+0.066
resses+0.066
 banks+0.064
uring+0.062
angers+0.062
Neg logit contributions
 course-0.064
orn-0.053
father-0.051
 understands-0.051
 narrative-0.051
Token his
Token ID465
Feature activation-9.63e-01

Pos logit contributions
resses+0.020
ashes+0.019
 banks+0.019
uggets+0.019
allows+0.019
Neg logit contributions
 course-0.018
orn-0.015
 narrative-0.015
 understands-0.014
 motorcycle-0.014
Token toy
Token ID13373
Feature activation-3.92e+00

Pos logit contributions
 course+0.036
orn+0.029
 narrative+0.028
 forgiveness+0.027
 understands+0.027
Neg logit contributions
resses-0.036
ashes-0.035
 banks-0.035
allows-0.034
heads-0.034
Token truck
Token ID7779
Feature activation-1.91e+00

Pos logit contributions
 course+0.131
 understands+0.104
 narrative+0.103
orn+0.102
 forgiveness+0.100
Neg logit contributions
resses-0.162
 banks-0.158
ashes-0.158
angers-0.154
mary-0.148
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
 course+0.068
 narrative+0.054
 understands+0.054
orn+0.054
 forgiveness+0.053
Neg logit contributions
resses-0.075
ashes-0.073
 banks-0.072
heads-0.071
angers-0.070
Token He
Token ID679
Feature activation-2.60e+00

Pos logit contributions
 course+0.014
 forgiveness+0.011
 narrative+0.011
orn+0.010
 motorcycle+0.010
Neg logit contributions
resses-0.013
ashes-0.012
angers-0.012
uggets-0.012
 banks-0.012
Token loaded
Token ID9639
Feature activation+3.07e+00

Pos logit contributions
 course+0.087
orn+0.070
 forgiveness+0.069
 narrative+0.068
 policy+0.067
Neg logit contributions
mary-0.103
resses-0.102
 banks-0.101
ashes-0.100
nes-0.098
Token up
Token ID510
Feature activation+4.77e-01

Pos logit contributions
resses+0.127
ashes+0.123
heads+0.122
 banks+0.121
angers+0.121
Neg logit contributions
 course-0.104
 forgiveness-0.079
 narrative-0.077
 motorcycle-0.077
 understands-0.076
Token rocks
Token ID12586
Feature activation-1.19e+00

Pos logit contributions
resses+0.021
ashes+0.020
 banks+0.020
uggets+0.020
angers+0.020
Neg logit contributions
 course-0.014
 narrative-0.011
 understands-0.011
orn-0.011
 forgiveness-0.011
Token and
Token ID290
Feature activation-1.67e-01

Pos logit contributions
 course+0.040
 forgiveness+0.033
 narrative+0.032
 understands+0.032
orn+0.030
Neg logit contributions
resses-0.049
ashes-0.048
 banks-0.047
angers-0.046
heads-0.046
Token happily
Token ID18177
Feature activation-1.78e+00

Pos logit contributions
 course+0.006
 forgiveness+0.005
 narrative+0.005
orn+0.005
 understands+0.005
Neg logit contributions
resses-0.006
 banks-0.006
ashes-0.006
heads-0.006
mary-0.006
Token silly
Token ID14397
Feature activation+8.51e-01

Pos logit contributions
angers+0.036
mary+0.036
resses+0.036
heads+0.036
nes+0.035
Neg logit contributions
 course-0.024
 thanks-0.018
 forgiveness-0.017
 motorcycle-0.016
 his-0.016
Token jokes
Token ID14532
Feature activation-1.14e+00

Pos logit contributions
resses+0.034
nes+0.034
mary+0.034
angers+0.033
uggets+0.032
Neg logit contributions
 course-0.030
 thanks-0.025
 again-0.024
 forgiveness-0.024
 motorcycle-0.024
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
 course+0.028
 motorcycle+0.021
 forgiveness+0.019
 thanks+0.019
 magnet+0.019
Neg logit contributions
resses-0.056
heads-0.055
 banks-0.055
angers-0.054
mary-0.054
Token One
Token ID1881
Feature activation+1.86e+00

Pos logit contributions
 course+0.016
 forgiveness+0.012
 motorcycle+0.012
 narrative+0.012
 understands+0.012
Neg logit contributions
resses-0.015
ashes-0.014
heads-0.014
angers-0.014
 banks-0.014
Token day
Token ID1110
Feature activation+1.74e-01

Pos logit contributions
resses+0.074
ashes+0.073
heads+0.072
mary+0.072
 banks+0.071
Neg logit contributions
 course-0.065
 forgiveness-0.050
 narrative-0.049
 understands-0.049
 motorcycle-0.048
Token,
Token ID11
Feature activation-3.82e-01

Pos logit contributions
resses+0.007
allows+0.007
ashes+0.007
 banks+0.007
 thirsty+0.007
Neg logit contributions
 course-0.006
 narrative-0.005
 forgiveness-0.004
father-0.004
 understands-0.004
Token the
Token ID262
Feature activation-4.21e-01

Pos logit contributions
 course+0.010
 forgiveness+0.007
 motorcycle+0.007
 narrative+0.007
 understands+0.007
Neg logit contributions
resses-0.019
ashes-0.018
heads-0.018
angers-0.018
 banks-0.018
Token fox
Token ID21831
Feature activation+9.72e-01

Pos logit contributions
 course+0.011
orn+0.008
 hero+0.008
 motorcycle+0.008
 beast+0.007
Neg logit contributions
uggets-0.021
resses-0.021
ashes-0.021
nes-0.021
angers-0.021
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
resses+0.044
heads+0.042
ashes+0.042
angers+0.042
uggets+0.042
Neg logit contributions
 course-0.029
 forgiveness-0.021
 motorcycle-0.021
 understands-0.021
 narrative-0.020
Token out
Token ID503
Feature activation+5.73e-01

Pos logit contributions
 course+0.033
 thanks+0.027
 motorcycle+0.025
 again+0.024
 forgiveness+0.024
Neg logit contributions
resses-0.059
 banks-0.057
heads-0.056
nes-0.055
ashes-0.055
Token walking
Token ID6155
Feature activation+1.52e-01

Pos logit contributions
resses+0.020
ashes+0.020
angers+0.020
allows+0.020
uggets+0.020
Neg logit contributions
 course-0.022
 narrative-0.018
 forgiveness-0.017
 understands-0.017
 motorcycle-0.017
Token!"
Token ID2474
Feature activation+4.07e+00

Pos logit contributions
 course+0.000
 again+0.000
 understands+0.000
 thanks+0.000
 motorcycle+0.000
Neg logit contributions
angers-0.000
uggets-0.000
resses-0.000
mary-0.000
ashes-0.000
Token Lily
Token ID20037
Feature activation+1.77e+00

Pos logit contributions
resses+0.177
 banks+0.172
ashes+0.171
ender+0.170
mary+0.170
Neg logit contributions
 course-0.128
 thank-0.106
 forgiveness-0.105
 thanks-0.101
 motorcycle-0.101
Token said
Token ID531
Feature activation+9.40e-01

Pos logit contributions
angers+0.086
uggets+0.084
ashes+0.083
resses+0.082
 banks+0.079
Neg logit contributions
 course-0.050
 understands-0.037
 motorcycle-0.035
 thanks-0.034
 forgiveness-0.034
Token.
Token ID13
Feature activation+1.41e-01

Pos logit contributions
resses+0.039
heads+0.038
ashes+0.038
angers+0.037
uggets+0.037
Neg logit contributions
 course-0.033
 narrative-0.025
orn-0.024
 forgiveness-0.024
 motorcycle-0.024
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
resses+0.006
ashes+0.005
angers+0.005
heads+0.005
mary+0.005
Neg logit contributions
 course-0.005
 forgiveness-0.004
 motorcycle-0.004
 narrative-0.004
orn-0.004
TokenMaybe
Token ID13300
Feature activation-5.65e+00

Pos logit contributions
angers+0.054
resses+0.053
 banks+0.053
mary+0.052
uggets+0.052
Neg logit contributions
 course-0.032
 motorcycle-0.023
 understands-0.022
 forgiveness-0.022
 narrative-0.022
Token it
Token ID340
Feature activation+8.76e-01

Pos logit contributions
 course+0.208
 narrative+0.165
 forgiveness+0.161
orn+0.160
 understands+0.160
Neg logit contributions
resses-0.212
 banks-0.203
heads-0.202
ashes-0.201
uggets-0.200
Token will
Token ID481
Feature activation-1.59e+00

Pos logit contributions
resses+0.049
uggets+0.048
angers+0.048
heads+0.047
ashes+0.047
Neg logit contributions
 course-0.018
 understands-0.011
 motorcycle-0.010
 thanks-0.010
 forgiveness-0.010
Token talk
Token ID1561
Feature activation+5.36e-01

Pos logit contributions
 course+0.049
 thank+0.037
 forgive+0.036
 forgiveness+0.036
 thanks+0.035
Neg logit contributions
angers-0.074
resses-0.074
uggets-0.071
ashes-0.068
heads-0.068
Token to
Token ID284
Feature activation+7.58e-01

Pos logit contributions
angers+0.023
resses+0.023
uggets+0.023
 banks+0.022
ashes+0.022
Neg logit contributions
 course-0.017
 motorcycle-0.013
 thanks-0.013
 understands-0.012
 forgiveness-0.012
Token us
Token ID514
Feature activation+3.56e+00

Pos logit contributions
resses+0.032
angers+0.032
ashes+0.031
heads+0.031
mary+0.030
Neg logit contributions
 course-0.027
 forgiveness-0.021
 motorcycle-0.020
 thanks-0.019
 thank-0.019
Token fruits
Token ID15921
Feature activation-1.42e+00

Pos logit contributions
resses+0.030
 banks+0.029
ashes+0.028
allows+0.028
uggets+0.027
Neg logit contributions
 course-0.034
 narrative-0.029
 forgiveness-0.028
orn-0.028
father-0.027
Token inside
Token ID2641
Feature activation-1.05e-01

Pos logit contributions
 course+0.050
 forgiveness+0.039
 narrative+0.039
father+0.037
 understands+0.037
Neg logit contributions
resses-0.057
ashes-0.056
 banks-0.055
mary-0.055
angers-0.054
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
 course+0.004
 forgiveness+0.003
 narrative+0.003
 motorcycle+0.003
orn+0.003
Neg logit contributions
resses-0.004
 banks-0.004
ashes-0.004
mary-0.004
uggets-0.004
Token They
Token ID1119
Feature activation+2.44e+00

Pos logit contributions
resses+0.018
heads+0.018
ashes+0.018
uggets+0.018
 banks+0.018
Neg logit contributions
 course-0.013
 motorcycle-0.009
 forgiveness-0.009
 understands-0.009
 thanks-0.009
Token say
Token ID910
Feature activation-3.55e-01

Pos logit contributions
resses+0.095
 banks+0.095
ashes+0.094
mary+0.091
angers+0.087
Neg logit contributions
 course-0.084
 forgiveness-0.068
orn-0.067
 policy-0.065
 motorcycle-0.064
Token thank
Token ID5875
Feature activation-5.76e+00

Pos logit contributions
 course+0.010
orn+0.008
father+0.008
 narrative+0.007
 motorcycle+0.007
Neg logit contributions
resses-0.016
allows-0.015
mary-0.015
ashes-0.015
uggets-0.015
Token you
Token ID345
Feature activation+4.41e-01

Pos logit contributions
 course+0.207
 forgiveness+0.155
 motorcycle+0.145
 narrative+0.142
 thanks+0.140
Neg logit contributions
resses-0.238
angers-0.235
ashes-0.234
 banks-0.232
mary-0.232
Token and
Token ID290
Feature activation+5.17e-01

Pos logit contributions
resses+0.020
ashes+0.020
angers+0.020
heads+0.019
uggets+0.019
Neg logit contributions
 course-0.013
 forgiveness-0.009
 motorcycle-0.009
 understands-0.009
 thanks-0.009
Token goodbye
Token ID24829
Feature activation-2.61e+00

Pos logit contributions
ashes+0.022
resses+0.022
 banks+0.021
uring+0.021
mary+0.020
Neg logit contributions
 course-0.015
orn-0.015
father-0.013
 policy-0.012
 motorcycle-0.012
Token to
Token ID284
Feature activation+6.39e-01

Pos logit contributions
 course+0.052
 again+0.039
 motorcycle+0.039
 forgiveness+0.038
 thanks+0.037
Neg logit contributions
angers-0.134
mary-0.130
resses-0.130
allows-0.130
uggets-0.129
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
resses+0.025
ashes+0.025
 banks+0.024
uggets+0.024
mary+0.024
Neg logit contributions
 course-0.022
orn-0.018
 narrative-0.018
 forgiveness-0.017
 understands-0.017
Token over
Token ID625
Feature activation-1.29e+00

Pos logit contributions
resses+0.092
ashes+0.091
angers+0.089
 banks+0.089
mary+0.087
Neg logit contributions
 course-0.062
 forgiveness-0.048
 motorcycle-0.046
 thanks-0.045
 understands-0.045
Token to
Token ID284
Feature activation+2.11e-01

Pos logit contributions
 course+0.049
orn+0.040
 motorcycle+0.040
 narrative+0.040
 magnet+0.038
Neg logit contributions
resses-0.047
heads-0.045
ashes-0.045
uggets-0.045
mary-0.045
Token give
Token ID1577
Feature activation-1.82e+00

Pos logit contributions
heads+0.007
ashes+0.007
uring+0.007
mary+0.007
resses+0.007
Neg logit contributions
 course-0.008
 narrative-0.007
 motorcycle-0.007
orn-0.007
 magnet-0.007
Token her
Token ID607
Feature activation+3.34e-01

Pos logit contributions
 course+0.055
 forgiveness+0.042
 thanks+0.040
 again+0.040
 understands+0.039
Neg logit contributions
resses-0.087
heads-0.083
 banks-0.081
angers-0.081
ashes-0.080
Token a
Token ID257
Feature activation-4.50e-01

Pos logit contributions
resses+0.012
ashes+0.012
heads+0.012
angers+0.012
 banks+0.012
Neg logit contributions
 course-0.013
 forgiveness-0.010
 motorcycle-0.010
 understands-0.010
 narrative-0.010
Token hug
Token ID16225
Feature activation-4.58e+00

Pos logit contributions
 course+0.017
 narrative+0.014
father+0.014
orn+0.013
pa+0.012
Neg logit contributions
ashes-0.016
resses-0.016
heads-0.016
mary-0.016
 thirsty-0.016
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
 course+0.156
 forgiveness+0.118
 narrative+0.117
 motorcycle+0.116
 understands+0.116
Neg logit contributions
resses-0.186
ashes-0.181
mary-0.180
heads-0.179
angers-0.178
Token She
Token ID1375
Feature activation-5.11e-01

Pos logit contributions
 course+0.025
 understands+0.022
 thanks+0.021
ously+0.020
 thank+0.020
Neg logit contributions
heads-0.039
nes-0.038
resses-0.038
uggets-0.037
ashes-0.036
Token told
Token ID1297
Feature activation+7.87e-01

Pos logit contributions
 course+0.020
 narrative+0.017
 policy+0.017
orn+0.016
 forgiveness+0.016
Neg logit contributions
mary-0.019
heads-0.017
resses-0.016
 thirsty-0.016
nes-0.016
Token M
Token ID337
Feature activation+1.92e+00

Pos logit contributions
resses+0.037
ashes+0.036
heads+0.036
 banks+0.035
nes+0.035
Neg logit contributions
 course-0.025
 forgiveness-0.019
 thanks-0.017
 thank-0.017
father-0.016
Tokenandy
Token ID10757
Feature activation+3.33e+00

Pos logit contributions
resses+0.068
heads+0.065
ashes+0.065
 banks+0.065
angers+0.064
Neg logit contributions
 course-0.077
 forgiveness-0.061
orn-0.059
 narrative-0.059
 understands-0.059
Token his
Token ID465
Feature activation-1.22e+00

Pos logit contributions
resses+0.029
ashes+0.028
 banks+0.028
heads+0.028
uggets+0.028
Neg logit contributions
 course-0.016
 narrative-0.011
 forgiveness-0.011
 understands-0.011
orn-0.011
Token grip
Token ID11762
Feature activation-3.65e+00

Pos logit contributions
 course+0.046
 narrative+0.036
 understands+0.035
orn+0.035
father+0.035
Neg logit contributions
resses-0.045
 banks-0.044
ashes-0.043
heads-0.042
mary-0.041
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 course+0.125
 forgiveness+0.100
 narrative+0.100
 understands+0.096
father+0.095
Neg logit contributions
resses-0.146
ashes-0.140
 banks-0.139
heads-0.138
mary-0.138
Token Bobby
Token ID17155
Feature activation+7.96e-01

Pos logit contributions
 course+0.031
 magnet+0.022
 motorcycle+0.022
 forgiveness+0.022
 thanks+0.020
Neg logit contributions
ashes-0.054
heads-0.053
mary-0.052
resses-0.051
uggets-0.051
Token plummeted
Token ID44282
Feature activation+5.19e-01

Pos logit contributions
resses+0.038
ashes+0.038
uggets+0.038
angers+0.037
heads+0.037
Neg logit contributions
 course-0.022
 motorcycle-0.015
 understands-0.015
 forgiveness-0.015
orn-0.014
Token back
Token ID736
Feature activation-4.32e+00

Pos logit contributions
resses+0.019
heads+0.019
ashes+0.019
 banks+0.018
angers+0.018
Neg logit contributions
 course-0.020
 narrative-0.016
 forgiveness-0.016
 motorcycle-0.015
orn-0.015
Token down
Token ID866
Feature activation-1.95e+00

Pos logit contributions
 course+0.179
 forgiveness+0.145
 narrative+0.144
 motorcycle+0.140
orn+0.138
Neg logit contributions
resses-0.147
ashes-0.143
 banks-0.140
heads-0.140
angers-0.139
Token to
Token ID284
Feature activation+3.03e-01

Pos logit contributions
 course+0.069
 narrative+0.057
 motorcycle+0.056
 forgiveness+0.056
 understands+0.055
Neg logit contributions
resses-0.074
ashes-0.072
 banks-0.072
uring-0.071
mary-0.071
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
 banks+0.012
resses+0.012
uring+0.012
ashes+0.012
heads+0.011
Neg logit contributions
 course-0.010
 narrative-0.009
 understands-0.008
orn-0.008
 motorcycle-0.008
Token bottom
Token ID4220
Feature activation-1.63e+00

Pos logit contributions
 course+0.049
 forgiveness+0.039
 motorcycle+0.038
 narrative+0.038
 understands+0.038
Neg logit contributions
resses-0.044
ashes-0.043
heads-0.042
angers-0.042
 banks-0.042
Token of
Token ID286
Feature activation+1.86e+00

Pos logit contributions
 course+0.045
 forgiveness+0.034
 narrative+0.033
 understands+0.032
 motorcycle+0.032
Neg logit contributions
resses-0.076
ashes-0.074
heads-0.074
 banks-0.073
angers-0.073
Token decoration
Token ID42050
Feature activation+2.17e-01

Pos logit contributions
 course+0.020
 motorcycle+0.015
 forgiveness+0.015
 magnet+0.015
 thanks+0.014
Neg logit contributions
resses-0.023
ashes-0.023
angers-0.023
uggets-0.023
heads-0.022
Token!
Token ID0
Feature activation-1.28e+00

Pos logit contributions
resses+0.010
ashes+0.010
heads+0.010
uggets+0.010
mary+0.010
Neg logit contributions
 course-0.006
 forgiveness-0.005
 motorcycle-0.004
 magnet-0.004
 thanks-0.004
Token Mum
Token ID38367
Feature activation-5.14e-01

Pos logit contributions
 course+0.053
 narrative+0.042
 forgiveness+0.042
orn+0.041
 motorcycle+0.041
Neg logit contributions
resses-0.042
uggets-0.041
ashes-0.040
 banks-0.040
angers-0.040
Token smiled
Token ID13541
Feature activation-1.58e+00

Pos logit contributions
 course+0.015
 narrative+0.012
 forgiveness+0.011
 policy+0.011
 motorcycle+0.010
Neg logit contributions
resses-0.023
mary-0.023
heads-0.022
allows-0.022
nes-0.022
Token with
Token ID351
Feature activation+5.43e-01

Pos logit contributions
 course+0.056
 narrative+0.043
orn+0.043
 forgiveness+0.041
father+0.040
Neg logit contributions
resses-0.062
mary-0.062
uring-0.061
ashes-0.060
heads-0.060
Token pride
Token ID11293
Feature activation-5.89e+00

Pos logit contributions
resses+0.016
 banks+0.015
allows+0.015
uring+0.015
 thirsty+0.015
Neg logit contributions
 policy-0.018
orn-0.018
 motorcycle-0.018
 course-0.017
father-0.017
Token and
Token ID290
Feature activation-6.86e-01

Pos logit contributions
 course+0.203
 narrative+0.169
orn+0.167
 forgiveness+0.154
 magnet+0.152
Neg logit contributions
resses-0.233
mary-0.223
angers-0.223
allows-0.222
uring-0.220
Token the
Token ID262
Feature activation-8.04e-01

Pos logit contributions
 course+0.027
orn+0.025
 narrative+0.025
 motorcycle+0.024
 policy+0.023
Neg logit contributions
mary-0.020
 thirsty-0.020
uring-0.020
resses-0.019
nes-0.018
Token two
Token ID734
Feature activation-7.92e-01

Pos logit contributions
 course+0.030
 forgiveness+0.023
orn+0.023
 narrative+0.023
 motorcycle+0.022
Neg logit contributions
resses-0.031
ashes-0.029
 banks-0.029
heads-0.029
mary-0.029
Token of
Token ID286
Feature activation+1.98e+00

Pos logit contributions
 course+0.026
orn+0.020
 forgiveness+0.020
 narrative+0.020
 motorcycle+0.019
Neg logit contributions
resses-0.033
allows-0.032
ashes-0.032
 banks-0.032
angers-0.031
Token them
Token ID606
Feature activation+3.02e+00

Pos logit contributions
uggets+0.040
allows+0.039
resses+0.038
mary+0.037
ashes+0.037
Neg logit contributions
 course-0.097
 motorcycle-0.092
 understands-0.090
 bicycle-0.090
 magnet-0.089