TOP ACTIVATIONS
MAX = 0.142

I can make a p uddle too !" so he picked
dirt next to the p uddle . Next ,
be seen in the p uddle , telling jokes and making
fear and conquered the p uddle ! <|endoftext|> Once , a
was happy and the p uddle was sunny again . And
got out of the p uddle , his fur was soaked
for falling into that p uddle ! â Jack was
both jumped in the p uddle . Max was no longer
you can cross the p uddle safely ." Joe
ashed around in the p uddle , making even bigger waves
never forgot the ancient p uddle . <|endoftext|> Once upon a
the rabbit across the p uddle and she made it to
to step in a p uddle and get her feet wet
get across the wet p uddle without getting wet .
to jump in the p uddle because he would get all
and laughed as the p uddle spl ashed around her .
saw a big dirty p uddle and mom warned Sally to
landed in a big p uddle . Lily and Tim laughed
better and loves to c uddle and get lots of pets
ashed around in the p uddle for a bit and was

MIN ACTIVATIONS
MIN = -0.178

make it whole again . Amen ." Then ,
Thank you , God . Amen ." He opened
us safe and happy . Amen ." " A
kind . Thank you . Amen ." Then she
house and our family . Amen ." Mom smiled
them and never threw a tant rum again . <|endoftext|> Once
family and our friends . Amen ." Anna said
his feet and threw a tant rum . "
clown s juggling and ac rob ats doing flips . But
saw clown s and ac rob ats and lions and elephants
time playing with her or nam ents and making her special
stubborn . She throws a tant rum . She kicks and
and play with your or nam ents . Maybe you can
thought they looked like ac rob ats . She enjoyed the
elephants , lions and ac rob ats . They are amazed
listen . She threw a tant rum in the store .
She cried and threw a tant rum . Her mom tried
her feet and threw a tant rum . She grabbed the
kept crying and throwing a tant rum . Finally
the animals , the ac rob ats and the clown s

INTERVAL 0.062 - 0.142
CONTAINS 0.841%

you , Ben . You both learned something today . Sometimes
a plan in case something like this happened . She came
warm and comfortable , just perfect for an old pigeon like
rocked even harder , and water started flooding the boat .
Max said . " I 'll make the tail , you

INTERVAL -0.018 - 0.062
CONTAINS 84.789%

made a zip with a fork on the top . She
, Tom and Mia see a big piece of cheese on
She gave her a big hug and said , " You
the plane . I push and pull the levers and turn
she did not see the sack on the other side of

INTERVAL -0.098 - -0.018
CONTAINS 14.345%

leaves of the trees . Suddenly , the mouse noticed a
came to visit them . He brought a big box with
But Ben wanted to be kind and support Max . He
Max saw the veterinarian he felt a bit scared . But
shiny and had a star on it . She reached for

INTERVAL -0.178 - -0.098
CONTAINS 0.025%

, falling is not fun ," their parent says . "
was a boy who was 3 years old . He had
l le . She was 3 years old and loved to
the kitchen . Em ma 's mom put some
George was a 3 - year - old boy who was
TokenI
Token ID40
Feature activation+3.78e-02

Pos logit contributions
ll+0.033
!+0.028
uddle+0.027
 upon+0.027
castle+0.026
Neg logit contributions
 Some-0.025
 pain-0.024
 Many-0.023
 Amen-0.022
 novels-0.021
Token can
Token ID460
Feature activation+2.08e-02

Pos logit contributions
ll+0.075
uddle+0.066
castle+0.064
!+0.061
cast+0.060
Neg logit contributions
 Some-0.063
 pain-0.059
 Many-0.057
 Amen-0.054
oscope-0.051
Token make
Token ID787
Feature activation-7.01e-03

Pos logit contributions
ll+0.045
uddle+0.039
 upon+0.037
ounce+0.036
!+0.035
Neg logit contributions
 Some-0.032
 Many-0.030
 pain-0.029
 Amen-0.028
oscope-0.026
Token a
Token ID257
Feature activation+1.89e-02

Pos logit contributions
 Some+0.011
 pain+0.011
 Many+0.011
 Amen+0.009
 aids+0.009
Neg logit contributions
ll-0.015
uddle-0.012
 upon-0.012
castle-0.012
ounce-0.012
Token p
Token ID279
Feature activation+4.00e-02

Pos logit contributions
ll+0.042
 upon+0.032
!+0.031
ounce+0.030
uddle+0.030
Neg logit contributions
 pain-0.029
 Some-0.029
 Many-0.027
 Amen-0.025
 x-0.024
Tokenuddle
Token ID24500
Feature activation+1.20e-01

Pos logit contributions
ll+0.037
!+0.023
 upon+0.023
castle+0.018
 ashore+0.017
Neg logit contributions
 Some-0.104
 Many-0.099
 pain-0.095
 Amen-0.090
 aids-0.089
Token too
Token ID1165
Feature activation+1.66e-02

Pos logit contributions
ll+0.202
uddle+0.154
 upon+0.145
ounce+0.140
ounced+0.136
Neg logit contributions
 Some-0.234
 Many-0.221
 pain-0.221
 Amen-0.206
oscope-0.203
Token!"
Token ID2474
Feature activation+4.98e-03

Pos logit contributions
ll+0.024
uddle+0.017
ounce+0.016
castle+0.016
pit+0.016
Neg logit contributions
 pain-0.036
 Some-0.036
 Many-0.034
 curly-0.032
 blonde-0.031
Token so
Token ID523
Feature activation+2.01e-02

Pos logit contributions
ll+0.011
uddle+0.010
!+0.009
castle+0.009
 upon+0.009
Neg logit contributions
 Some-0.008
 Many-0.007
 pain-0.007
 Amen-0.006
oscope-0.006
Token he
Token ID339
Feature activation+1.23e-02

Pos logit contributions
ll+0.035
uddle+0.029
!+0.028
 upon+0.027
castle+0.026
Neg logit contributions
 Some-0.040
 Many-0.037
 pain-0.037
 Amen-0.035
oscope-0.033
Token picked
Token ID6497
Feature activation+3.89e-02

Pos logit contributions
ll+0.024
uddle+0.020
 upon+0.019
castle+0.019
ounce+0.019
Neg logit contributions
 Some-0.020
 Many-0.019
 pain-0.018
ez-0.017
oscope-0.017
Token dirt
Token ID13647
Feature activation+3.32e-02

Pos logit contributions
ll+0.036
ounce+0.027
 upon+0.026
!+0.026
cast+0.026
Neg logit contributions
 Some-0.032
 pain-0.032
 Many-0.029
oscope-0.027
 blonde-0.026
Token next
Token ID1306
Feature activation-3.51e-02

Pos logit contributions
ll+0.064
uddle+0.051
ounce+0.051
 upon+0.050
lly+0.046
Neg logit contributions
 Some-0.055
 pain-0.054
oscope-0.048
 Many-0.047
 emotion-0.043
Token to
Token ID284
Feature activation+4.67e-03

Pos logit contributions
 Some+0.069
 pain+0.067
 Many+0.062
 Amen+0.059
 novels+0.059
Neg logit contributions
ll-0.063
uddle-0.047
ounce-0.045
cast-0.043
 upon-0.043
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
ll+0.009
uddle+0.008
!+0.007
 upon+0.007
ounce+0.007
Neg logit contributions
 Some-0.008
 Many-0.008
 pain-0.008
 Amen-0.007
oscope-0.007
Token p
Token ID279
Feature activation+4.06e-02

Pos logit contributions
ll+0.036
ounce+0.027
uddle+0.026
!+0.026
 upon+0.026
Neg logit contributions
 pain-0.031
 Some-0.030
 Many-0.028
 x-0.025
 Amen-0.025
Tokenuddle
Token ID24500
Feature activation+1.19e-01

Pos logit contributions
ll+0.037
!+0.028
 ashore+0.027
elcome+0.025
 upon+0.023
Neg logit contributions
 Some-0.097
 Many-0.084
pler-0.077
 curly-0.075
 pain-0.074
Token.
Token ID13
Feature activation+7.05e-03

Pos logit contributions
ll+0.231
uddle+0.182
 upon+0.168
ounce+0.168
 destination+0.157
Neg logit contributions
 Some-0.213
 pain-0.193
 Many-0.187
oscope-0.176
 aids-0.175
Token
Token ID198
Feature activation+9.25e-03

Pos logit contributions
ll+0.017
uddle+0.014
!+0.014
castle+0.014
 upon+0.013
Neg logit contributions
 Some-0.010
 Many-0.009
 pain-0.009
 Amen-0.008
 blond-0.007
Token
Token ID198
Feature activation+8.54e-03

Pos logit contributions
ll+0.019
uddle+0.016
!+0.016
castle+0.015
 upon+0.015
Neg logit contributions
 Some-0.016
 Many-0.015
 pain-0.014
 Amen-0.013
oscope-0.012
TokenNext
Token ID10019
Feature activation-3.44e-02

Pos logit contributions
ll+0.018
uddle+0.016
!+0.016
castle+0.015
 upon+0.015
Neg logit contributions
 Some-0.013
 Many-0.013
 pain-0.011
 Amen-0.011
 aids-0.010
Token,
Token ID11
Feature activation+2.96e-03

Pos logit contributions
 Some+0.074
 Many+0.069
 pain+0.069
 Amen+0.063
oscope+0.060
Neg logit contributions
ll-0.056
uddle-0.046
castle-0.043
!-0.041
 upon-0.040
Token be
Token ID307
Feature activation-2.11e-02

Pos logit contributions
ll+0.097
uddle+0.085
 upon+0.076
ounce+0.075
!+0.074
Neg logit contributions
 Some-0.089
 pain-0.088
 Many-0.086
 Amen-0.083
oscope-0.078
Token seen
Token ID1775
Feature activation+2.11e-02

Pos logit contributions
 Some+0.034
 pain+0.032
 Many+0.031
 Amen+0.030
oscope+0.027
Neg logit contributions
ll-0.044
uddle-0.038
!-0.037
castle-0.035
 upon-0.035
Token in
Token ID287
Feature activation-1.18e-02

Pos logit contributions
ll+0.038
uddle+0.032
ounce+0.029
castle+0.028
cast+0.026
Neg logit contributions
 Some-0.042
 pain-0.040
 Many-0.039
 Amen-0.037
 novels-0.036
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 Some+0.022
 Many+0.020
 Amen+0.019
 pain+0.019
oscope+0.018
Neg logit contributions
ll-0.021
!-0.019
 upon-0.018
uddle-0.017
castle-0.016
Token p
Token ID279
Feature activation+3.59e-02

Pos logit contributions
ll+0.019
uddle+0.015
!+0.014
ounce+0.014
 upon+0.013
Neg logit contributions
 Some-0.020
 pain-0.020
 Many-0.019
 Amen-0.018
oscope-0.018
Tokenuddle
Token ID24500
Feature activation+1.19e-01

Pos logit contributions
ll+0.029
!+0.023
 ashore+0.017
 upon+0.017
elcome+0.015
Neg logit contributions
 Some-0.094
 Many-0.085
 blonde-0.078
oscope-0.078
 blond-0.078
Token,
Token ID11
Feature activation-7.11e-03

Pos logit contributions
ll+0.211
uddle+0.173
ounce+0.161
 upon+0.153
 destination+0.148
Neg logit contributions
 Some-0.223
 pain-0.202
 Many-0.198
oscope-0.191
 Amen-0.185
Token telling
Token ID5149
Feature activation-9.14e-03

Pos logit contributions
 Some+0.014
 pain+0.013
 Many+0.013
 Amen+0.012
oscope+0.011
Neg logit contributions
ll-0.013
uddle-0.011
!-0.010
castle-0.010
ounce-0.009
Token jokes
Token ID14532
Feature activation+2.54e-02

Pos logit contributions
 Some+0.019
 Many+0.018
 pain+0.017
 Amen+0.016
oscope+0.016
Neg logit contributions
ll-0.016
uddle-0.013
!-0.012
castle-0.012
 upon-0.012
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
ll+0.044
uddle+0.042
castle+0.038
ounce+0.037
 destination+0.037
Neg logit contributions
 Some-0.044
 Many-0.039
 pain-0.037
 aids-0.036
 novels-0.036
Token making
Token ID1642
Feature activation-2.65e-02

Pos logit contributions
 Some+0.020
 pain+0.019
 Many+0.019
 Amen+0.018
oscope+0.017
Neg logit contributions
ll-0.021
uddle-0.018
!-0.017
castle-0.016
ounce-0.016
Token fear
Token ID3252
Feature activation+1.57e-02

Pos logit contributions
 pain+0.018
 Some+0.018
 Many+0.016
oscope+0.015
 blonde+0.015
Neg logit contributions
ll-0.023
uddle-0.018
castle-0.018
!-0.018
ounce-0.018
Token and
Token ID290
Feature activation+5.59e-03

Pos logit contributions
ll+0.027
uddle+0.024
castle+0.021
ounce+0.020
cast+0.019
Neg logit contributions
 pain-0.028
 Some-0.028
 Many-0.026
oscope-0.025
 blonde-0.025
Token conquered
Token ID29346
Feature activation+8.24e-02

Pos logit contributions
ll+0.010
uddle+0.009
!+0.008
 upon+0.008
castle+0.008
Neg logit contributions
 Some-0.010
 pain-0.010
 Many-0.010
 Amen-0.009
oscope-0.009
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
ll+0.194
uddle+0.165
castle+0.154
ounce+0.150
 upon+0.148
Neg logit contributions
 Some-0.116
 pain-0.114
 Many-0.106
 Amen-0.097
 blonde-0.096
Token p
Token ID279
Feature activation+4.09e-02

Pos logit contributions
ll+0.023
!+0.019
uddle+0.019
castle+0.018
 upon+0.018
Neg logit contributions
 Some-0.021
 pain-0.020
 Many-0.020
 Amen-0.018
oscope-0.017
Tokenuddle
Token ID24500
Feature activation+1.19e-01

Pos logit contributions
ll+0.037
!+0.031
 ashore+0.029
!".+0.027
 upon+0.026
Neg logit contributions
 Some-0.096
 Many-0.087
 pain-0.077
pler-0.076
 curly-0.075
Token!
Token ID0
Feature activation+3.97e-02

Pos logit contributions
ll+0.212
uddle+0.168
 upon+0.155
ounce+0.147
castle+0.146
Neg logit contributions
 Some-0.225
 pain-0.219
 Many-0.208
oscope-0.192
 aids-0.189
Token<|endoftext|>
Token ID50256
Feature activation+5.08e-02

Pos logit contributions
ll+0.075
!+0.064
uddle+0.061
 upon+0.059
ounce+0.057
Neg logit contributions
 Some-0.071
 pain-0.067
 Many-0.066
 Amen-0.064
oscope-0.061
TokenOnce
Token ID7454
Feature activation+1.08e-02

Pos logit contributions
ll+0.095
uddle+0.080
castle+0.077
!+0.075
ounce+0.072
Neg logit contributions
 Some-0.094
 Many-0.087
 pain-0.085
 Amen-0.081
oscope-0.075
Token,
Token ID11
Feature activation+2.60e-03

Pos logit contributions
ll+0.006
uddle+0.003
!+0.002
castle+0.002
ounce+0.001
Neg logit contributions
 Some-0.034
 Many-0.032
 pain-0.031
 Amen-0.031
 novels-0.029
Token a
Token ID257
Feature activation+6.90e-03

Pos logit contributions
ll+0.003
 upon+0.003
!+0.003
uddle+0.002
castle+0.002
Neg logit contributions
 Some-0.006
 Many-0.006
 pain-0.006
 Amen-0.006
oscope-0.006
Token was
Token ID373
Feature activation+2.85e-03

Pos logit contributions
ll+0.094
uddle+0.076
!+0.071
 upon+0.071
ounce+0.070
Neg logit contributions
 Some-0.098
 pain-0.092
 Many-0.092
 Amen-0.085
 aids-0.084
Token happy
Token ID3772
Feature activation-2.58e-02

Pos logit contributions
ll+0.006
uddle+0.005
castle+0.004
!+0.004
ounce+0.004
Neg logit contributions
 Some-0.005
 pain-0.005
 Many-0.005
 Amen-0.004
 blonde-0.004
Token and
Token ID290
Feature activation+3.47e-03

Pos logit contributions
 Some+0.040
 pain+0.038
 Many+0.036
 nightmares+0.034
 emotion+0.033
Neg logit contributions
ll-0.052
uddle-0.048
ounce-0.042
castle-0.040
 upon-0.039
Token the
Token ID262
Feature activation+1.28e-02

Pos logit contributions
ll+0.007
uddle+0.006
!+0.005
castle+0.005
 upon+0.005
Neg logit contributions
 Some-0.006
 Many-0.006
 pain-0.006
 Amen-0.005
 emotion-0.005
Token p
Token ID279
Feature activation+4.08e-02

Pos logit contributions
ll+0.023
!+0.020
uddle+0.020
 upon+0.018
castle+0.018
Neg logit contributions
 Some-0.024
 Many-0.023
 pain-0.022
 Amen-0.021
oscope-0.020
Tokenuddle
Token ID24500
Feature activation+1.18e-01

Pos logit contributions
ll+0.043
!+0.039
 upon+0.035
 ashore+0.031
castle+0.031
Neg logit contributions
 Some-0.094
 Many-0.090
 pain-0.080
 blonde-0.080
 blond-0.080
Token was
Token ID373
Feature activation+3.66e-03

Pos logit contributions
ll+0.205
uddle+0.169
ounce+0.157
 upon+0.154
castle+0.146
Neg logit contributions
 Some-0.219
 pain-0.212
 Many-0.204
oscope-0.190
 emotion-0.190
Token sunny
Token ID27737
Feature activation-7.83e-03

Pos logit contributions
ll+0.008
uddle+0.007
ounce+0.006
castle+0.006
cast+0.006
Neg logit contributions
 Some-0.006
 pain-0.006
 Amen-0.005
 Many-0.005
 blonde-0.005
Token again
Token ID757
Feature activation-3.34e-02

Pos logit contributions
 Some+0.013
 pain+0.013
 Amen+0.012
 Many+0.012
 novels+0.011
Neg logit contributions
ll-0.014
uddle-0.012
ounce-0.012
ounced-0.011
 upon-0.011
Token.
Token ID13
Feature activation+2.11e-03

Pos logit contributions
 pain+0.068
 Some+0.067
 Many+0.064
 Amen+0.061
oscope+0.059
Neg logit contributions
ll-0.050
uddle-0.045
castle-0.038
ounce-0.037
 upon-0.036
Token And
Token ID843
Feature activation+1.20e-02

Pos logit contributions
ll+0.005
!+0.004
uddle+0.004
 upon+0.004
castle+0.004
Neg logit contributions
 Some-0.003
 Many-0.003
 pain-0.003
 Amen-0.003
oscope-0.003
Token got
Token ID1392
Feature activation-5.85e-03

Pos logit contributions
ll+0.013
uddle+0.011
castle+0.010
!+0.010
ounce+0.010
Neg logit contributions
 pain-0.013
 Some-0.013
 Many-0.012
oscope-0.012
ez-0.012
Token out
Token ID503
Feature activation+3.59e-03

Pos logit contributions
 Some+0.013
 pain+0.012
 Many+0.012
 Amen+0.011
 blond+0.011
Neg logit contributions
ll-0.009
uddle-0.007
!-0.006
castle-0.006
cast-0.006
Token of
Token ID286
Feature activation-2.60e-02

Pos logit contributions
ll+0.006
!+0.005
uddle+0.005
 upon+0.005
castle+0.004
Neg logit contributions
 Some-0.007
 Many-0.007
 pain-0.007
 Amen-0.007
oscope-0.006
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
 Some+0.049
 Many+0.046
 pain+0.045
 Amen+0.043
oscope+0.040
Neg logit contributions
ll-0.048
!-0.040
uddle-0.040
 upon-0.037
castle-0.037
Token p
Token ID279
Feature activation+4.30e-02

Pos logit contributions
ll+0.022
!+0.018
uddle+0.017
ounce+0.016
 upon+0.016
Neg logit contributions
 Some-0.023
 pain-0.022
 Many-0.022
 Amen-0.020
oscope-0.019
Tokenuddle
Token ID24500
Feature activation+1.18e-01

Pos logit contributions
ll+0.034
!+0.028
 ashore+0.023
!".+0.021
castle+0.021
Neg logit contributions
 Some-0.114
 Many-0.102
 pain-0.095
 blonde-0.093
 blond-0.092
Token,
Token ID11
Feature activation+5.42e-04

Pos logit contributions
ll+0.221
uddle+0.175
 upon+0.166
ounce+0.164
castle+0.155
Neg logit contributions
 Some-0.215
 pain-0.205
 Many-0.195
oscope-0.170
 blonde-0.166
Token his
Token ID465
Feature activation-5.07e-03

Pos logit contributions
ll+0.001
uddle+0.001
!+0.001
castle+0.001
 upon+0.001
Neg logit contributions
 Some-0.001
 Many-0.001
 pain-0.001
 Amen-0.001
 novels-0.001
Token fur
Token ID9230
Feature activation+1.53e-03

Pos logit contributions
 Some+0.009
 pain+0.009
 Many+0.008
 blonde+0.007
 blond+0.007
Neg logit contributions
ll-0.010
!-0.008
 upon-0.008
ounce-0.008
uddle-0.008
Token was
Token ID373
Feature activation+1.86e-03

Pos logit contributions
ll+0.003
uddle+0.002
 upon+0.002
ounce+0.002
rying+0.002
Neg logit contributions
 pain-0.003
 Some-0.003
 blonde-0.002
 blond-0.002
oscope-0.002
Token soaked
Token ID33657
Feature activation+9.67e-02

Pos logit contributions
ll+0.004
uddle+0.003
ounce+0.003
cast+0.003
castle+0.003
Neg logit contributions
 pain-0.003
 Some-0.003
 Many-0.003
 blonde-0.003
 blond-0.003
Token for
Token ID329
Feature activation-7.36e-03

Pos logit contributions
ll+0.040
uddle+0.032
castle+0.029
ounce+0.027
 upon+0.026
Neg logit contributions
 Some-0.071
 pain-0.071
 Many-0.066
 Amen-0.065
 novels-0.063
Token falling
Token ID7463
Feature activation+6.63e-03

Pos logit contributions
 Some+0.013
 pain+0.013
 Many+0.012
 Amen+0.011
oscope+0.011
Neg logit contributions
ll-0.014
uddle-0.012
castle-0.011
 upon-0.011
ounce-0.011
Token into
Token ID656
Feature activation+4.14e-03

Pos logit contributions
ll+0.010
uddle+0.008
castle+0.008
lly+0.007
ory+0.007
Neg logit contributions
 Some-0.013
 pain-0.013
 Many-0.012
oscope-0.011
 nightmares-0.011
Token that
Token ID326
Feature activation+2.11e-02

Pos logit contributions
ll+0.008
uddle+0.007
!+0.007
 upon+0.006
castle+0.006
Neg logit contributions
 Some-0.007
 Many-0.007
 pain-0.007
 Amen-0.006
oscope-0.006
Token p
Token ID279
Feature activation+3.98e-02

Pos logit contributions
ll+0.039
uddle+0.030
ounce+0.028
castle+0.028
 upon+0.028
Neg logit contributions
 pain-0.040
 Some-0.038
 Many-0.037
 Amen-0.034
 emotion-0.033
Tokenuddle
Token ID24500
Feature activation+1.18e-01

Pos logit contributions
ll+0.039
!+0.032
castle+0.028
 ashore+0.027
 upon+0.026
Neg logit contributions
 Some-0.097
 Many-0.088
 pain-0.082
ez-0.081
 aids-0.080
Token!
Token ID0
Feature activation+4.03e-02

Pos logit contributions
ll+0.181
uddle+0.133
 upon+0.129
ounce+0.128
castle+0.123
Neg logit contributions
 Some-0.244
 pain-0.231
 Many-0.222
 aids-0.210
oscope-0.209
Tokenâ
Token ID22940
Feature activation+6.50e-02

Pos logit contributions
ll+0.087
!+0.076
uddle+0.074
 upon+0.072
ounce+0.069
Neg logit contributions
 Some-0.058
 pain-0.056
 Many-0.054
 Amen-0.051
oscope-0.049
Token
Token ID26391
Feature activation-3.49e-02

Pos logit contributions
ll+0.140
!+0.120
uddle+0.117
 upon+0.110
castle+0.110
Neg logit contributions
 Some-0.105
 Many-0.096
 pain-0.093
 Amen-0.088
 novels-0.082
Token Jack
Token ID3619
Feature activation+4.58e-02

Pos logit contributions
 Many+0.057
 Some+0.056
 pain+0.050
 Amen+0.049
 blonde+0.048
Neg logit contributions
ll-0.068
castle-0.056
uddle-0.056
rying-0.055
 upon-0.054
Token was
Token ID373
Feature activation+3.36e-03

Pos logit contributions
ll+0.076
uddle+0.065
castle+0.064
ounce+0.059
 upon+0.058
Neg logit contributions
 pain-0.084
ez-0.078
 Some-0.078
oscope-0.076
 Many-0.074
Token both
Token ID1111
Feature activation+8.64e-02

Pos logit contributions
ll+0.031
uddle+0.026
castle+0.025
!+0.024
ounce+0.023
Neg logit contributions
 Some-0.035
 Many-0.034
 pain-0.033
 Amen-0.031
 blonde-0.030
Token jumped
Token ID11687
Feature activation+4.12e-02

Pos logit contributions
ll+0.156
uddle+0.133
castle+0.126
!+0.122
ounce+0.120
Neg logit contributions
 Some-0.158
 Many-0.151
 pain-0.150
 Amen-0.135
 blond-0.134
Token in
Token ID287
Feature activation+6.05e-03

Pos logit contributions
ll+0.091
castle+0.068
field+0.065
lly+0.063
ounce+0.063
Neg logit contributions
 Some-0.065
 pain-0.063
 Many-0.056
 sorry-0.047
 emotion-0.047
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
ll+0.012
uddle+0.010
!+0.009
castle+0.009
 upon+0.009
Neg logit contributions
 Some-0.011
 pain-0.010
 Many-0.010
 Amen-0.010
oscope-0.009
Token p
Token ID279
Feature activation+3.70e-02

Pos logit contributions
ll+0.022
uddle+0.017
!+0.017
 upon+0.016
castle+0.016
Neg logit contributions
 Some-0.030
 Many-0.029
 pain-0.028
 Amen-0.027
oscope-0.026
Tokenuddle
Token ID24500
Feature activation+1.17e-01

Pos logit contributions
ll+0.032
!+0.028
 ashore+0.024
castle+0.022
!".+0.022
Neg logit contributions
 Some-0.093
 Many-0.083
 pain-0.073
pler-0.072
 blonde-0.072
Token.
Token ID13
Feature activation+6.46e-03

Pos logit contributions
ll+0.208
uddle+0.160
 upon+0.158
ounce+0.152
ory+0.148
Neg logit contributions
 Some-0.226
 pain-0.209
 Many-0.200
oscope-0.177
 emotion-0.177
Token Max
Token ID5436
Feature activation-1.18e-02

Pos logit contributions
ll+0.013
!+0.012
uddle+0.011
 upon+0.011
ounce+0.011
Neg logit contributions
 pain-0.010
 Some-0.009
 Amen-0.009
 Many-0.009
oscope-0.009
Token was
Token ID373
Feature activation+5.26e-04

Pos logit contributions
 pain+0.016
 Some+0.016
 Many+0.016
ez+0.015
 blonde+0.014
Neg logit contributions
ll-0.023
castle-0.023
 destination-0.020
uddle-0.020
 ashore-0.020
Token no
Token ID645
Feature activation-1.62e-02

Pos logit contributions
ll+0.001
uddle+0.001
!+0.001
castle+0.001
 upon+0.001
Neg logit contributions
 Some-0.001
 pain-0.001
 Many-0.001
 Amen-0.001
 blond-0.001
Token longer
Token ID2392
Feature activation+4.27e-03

Pos logit contributions
 Some+0.033
 Many+0.031
 Amen+0.031
oscope+0.029
 pain+0.029
Neg logit contributions
ll-0.027
!-0.023
uddle-0.023
 upon-0.021
castle-0.020
Token you
Token ID345
Feature activation-1.96e-02

Pos logit contributions
ll+0.032
uddle+0.027
castle+0.025
ounce+0.024
!+0.023
Neg logit contributions
 Some-0.036
 Many-0.035
 pain-0.034
 Amen-0.033
 blonde-0.030
Token can
Token ID460
Feature activation+2.02e-02

Pos logit contributions
 Some+0.033
 pain+0.031
 Many+0.031
 Amen+0.029
oscope+0.028
Neg logit contributions
ll-0.038
uddle-0.033
castle-0.032
ounce-0.031
!-0.031
Token cross
Token ID3272
Feature activation+2.27e-02

Pos logit contributions
ll+0.042
uddle+0.036
 upon+0.035
castle+0.034
!+0.034
Neg logit contributions
 Some-0.031
 Many-0.029
 pain-0.029
 Amen-0.028
oscope-0.025
Token the
Token ID262
Feature activation+1.25e-02

Pos logit contributions
ll+0.047
uddle+0.040
castle+0.037
ounce+0.036
 upon+0.036
Neg logit contributions
 Some-0.037
 pain-0.036
 Many-0.035
 Amen-0.031
oscope-0.031
Token p
Token ID279
Feature activation+3.45e-02

Pos logit contributions
ll+0.022
!+0.018
uddle+0.018
 upon+0.017
castle+0.016
Neg logit contributions
 Some-0.024
 pain-0.023
 Many-0.023
 Amen-0.022
oscope-0.021
Tokenuddle
Token ID24500
Feature activation+1.17e-01

Pos logit contributions
ll+0.027
!+0.020
 upon+0.017
 ashore+0.015
castle+0.013
Neg logit contributions
 Some-0.093
 Many-0.087
 pain-0.081
oscope-0.079
pler-0.078
Token safely
Token ID11512
Feature activation+2.45e-02

Pos logit contributions
ll+0.194
uddle+0.162
ounce+0.147
 upon+0.143
castle+0.135
Neg logit contributions
 Some-0.226
 Many-0.207
 pain-0.203
oscope-0.196
 Amen-0.195
Token."
Token ID526
Feature activation+8.14e-03

Pos logit contributions
ll+0.035
uddle+0.032
castle+0.030
ounce+0.027
 raft+0.026
Neg logit contributions
 Some-0.047
 Many-0.045
 pain-0.044
oscope-0.043
 Amen-0.041
Token
Token ID198
Feature activation+8.92e-03

Pos logit contributions
ll+0.018
uddle+0.016
castle+0.015
ounce+0.014
 destination+0.014
Neg logit contributions
 Some-0.013
 Many-0.012
 Amen-0.010
 pain-0.010
growth-0.009
Token
Token ID198
Feature activation+8.05e-03

Pos logit contributions
ll+0.019
uddle+0.017
castle+0.016
!+0.016
 upon+0.015
Neg logit contributions
 Some-0.014
 Many-0.013
 Amen-0.012
 pain-0.011
ez-0.010
TokenJoe
Token ID19585
Feature activation+6.65e-02

Pos logit contributions
ll+0.012
uddle+0.012
castle+0.010
 destination+0.009
!+0.009
Neg logit contributions
 Some-0.017
 Many-0.017
 Amen-0.015
 pain-0.014
growth-0.014
Tokenashed
Token ID5263
Feature activation+5.32e-02

Pos logit contributions
ll+0.085
!+0.078
 destination+0.073
castle+0.071
 upon+0.069
Neg logit contributions
 Some-0.075
 Many-0.070
ez-0.064
 pain-0.060
oscope-0.057
Token around
Token ID1088
Feature activation-7.41e-03

Pos logit contributions
ll+0.109
uddle+0.090
castle+0.089
 destination+0.085
ounce+0.084
Neg logit contributions
 Some-0.090
 pain-0.087
 Many-0.083
ez-0.074
 emotion-0.072
Token in
Token ID287
Feature activation-2.27e-03

Pos logit contributions
 Some+0.013
 pain+0.013
 Many+0.012
 Amen+0.011
ez+0.011
Neg logit contributions
ll-0.014
uddle-0.012
castle-0.011
ounce-0.011
 destination-0.010
Token the
Token ID262
Feature activation+6.56e-03

Pos logit contributions
 Some+0.004
 pain+0.004
 Many+0.004
 Amen+0.003
 blond+0.003
Neg logit contributions
ll-0.005
uddle-0.004
!-0.004
castle-0.004
ounce-0.004
Token p
Token ID279
Feature activation+3.87e-02

Pos logit contributions
ll+0.013
ounce+0.010
 upon+0.010
!+0.010
castle+0.010
Neg logit contributions
 Some-0.012
 pain-0.012
 Many-0.011
oscope-0.009
 blonde-0.009
Tokenuddle
Token ID24500
Feature activation+1.17e-01

Pos logit contributions
ll+0.039
!+0.036
 ashore+0.031
 aboard+0.029
!".+0.029
Neg logit contributions
 Some-0.089
 Many-0.079
 blonde-0.069
 blond-0.069
pler-0.068
Token,
Token ID11
Feature activation-2.84e-03

Pos logit contributions
ll+0.222
uddle+0.183
ounce+0.173
 upon+0.169
 destination+0.164
Neg logit contributions
 Some-0.207
 pain-0.189
 Many-0.183
oscope-0.167
 aids-0.160
Token making
Token ID1642
Feature activation-2.02e-02

Pos logit contributions
 Some+0.005
 pain+0.005
 Many+0.004
 blonde+0.004
 blond+0.004
Neg logit contributions
ll-0.005
uddle-0.005
ounce-0.004
 destination-0.004
castle-0.004
Token even
Token ID772
Feature activation+4.03e-02

Pos logit contributions
 Some+0.034
 pain+0.033
 Many+0.032
 emotion+0.027
ez+0.027
Neg logit contributions
ll-0.042
uddle-0.034
castle-0.033
!-0.032
 upon-0.032
Token bigger
Token ID5749
Feature activation-3.86e-02

Pos logit contributions
ll+0.069
!+0.061
 upon+0.058
uddle+0.058
castle+0.054
Neg logit contributions
 Some-0.076
 Many-0.072
 Amen-0.072
 pain-0.068
oscope-0.066
Token waves
Token ID9813
Feature activation+4.15e-02

Pos logit contributions
 pain+0.074
 Some+0.073
 Many+0.066
 emotion+0.061
 blond+0.060
Neg logit contributions
ll-0.071
 upon-0.052
uddle-0.052
ounce-0.050
castle-0.048
Token never
Token ID1239
Feature activation+2.44e-02

Pos logit contributions
ll+0.007
uddle+0.006
 upon+0.006
!+0.006
castle+0.005
Neg logit contributions
 Some-0.008
 pain-0.008
 Many-0.008
 Amen-0.007
 blonde-0.007
Token forgot
Token ID16453
Feature activation+7.27e-02

Pos logit contributions
ll+0.042
!+0.035
uddle+0.034
 upon+0.032
castle+0.032
Neg logit contributions
 Some-0.049
 Many-0.046
 pain-0.045
 Amen-0.043
oscope-0.041
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
ll+0.156
uddle+0.133
ounce+0.121
castle+0.120
 upon+0.112
Neg logit contributions
 Some-0.123
 Many-0.112
 pain-0.109
 novels-0.103
 aids-0.100
Token ancient
Token ID6156
Feature activation-1.35e-02

Pos logit contributions
ll+0.029
!+0.024
uddle+0.024
 upon+0.023
castle+0.023
Neg logit contributions
 Some-0.023
 Many-0.022
 pain-0.021
 Amen-0.021
oscope-0.019
Token p
Token ID279
Feature activation+3.46e-02

Pos logit contributions
 Some+0.024
 pain+0.023
 Many+0.022
 Amen+0.020
 blond+0.020
Neg logit contributions
ll-0.028
uddle-0.022
!-0.021
ounce-0.021
 upon-0.020
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.039
!+0.030
 upon+0.029
 ashore+0.027
castle+0.026
Neg logit contributions
 Some-0.081
 Many-0.074
 pain-0.069
 blonde-0.068
 blond-0.068
Token.
Token ID13
Feature activation+4.53e-03

Pos logit contributions
ll+0.202
uddle+0.165
 upon+0.152
ounce+0.151
 destination+0.135
Neg logit contributions
 Some-0.220
 pain-0.211
 Many-0.201
 aids-0.189
oscope-0.188
Token<|endoftext|>
Token ID50256
Feature activation+4.42e-02

Pos logit contributions
ll+0.008
!+0.007
uddle+0.007
 upon+0.007
ounce+0.006
Neg logit contributions
 Some-0.008
 pain-0.008
 Many-0.008
 Amen-0.008
oscope-0.007
TokenOnce
Token ID7454
Feature activation+8.17e-03

Pos logit contributions
ll+0.088
uddle+0.074
!+0.072
castle+0.070
 upon+0.068
Neg logit contributions
 Some-0.077
 Many-0.071
 pain-0.069
 Amen-0.066
oscope-0.061
Token upon
Token ID2402
Feature activation+5.13e-02

Pos logit contributions
ll+0.005
uddle+0.003
!+0.002
castle+0.002
ounce+0.001
Neg logit contributions
 Some-0.025
 Many-0.024
 pain-0.023
 Amen-0.022
 novels-0.022
Token a
Token ID257
Feature activation+8.20e-03

Pos logit contributions
ll+0.082
!+0.081
 upon+0.077
!"+0.065
uddle+0.063
Neg logit contributions
 Some-0.098
 Amen-0.089
 Many-0.088
oscope-0.087
 pain-0.087
Token the
Token ID262
Feature activation+4.49e-03

Pos logit contributions
 Some+0.003
 Many+0.003
 pain+0.003
 Amen+0.003
oscope+0.002
Neg logit contributions
ll-0.003
!-0.003
uddle-0.002
 upon-0.002
ounce-0.002
Token rabbit
Token ID22746
Feature activation+3.46e-02

Pos logit contributions
ll+0.009
!+0.008
uddle+0.008
 upon+0.007
castle+0.007
Neg logit contributions
 Some-0.008
 Many-0.007
 pain-0.007
 Amen-0.007
oscope-0.006
Token across
Token ID1973
Feature activation-1.91e-03

Pos logit contributions
ll+0.061
uddle+0.053
castle+0.050
cast+0.044
ounce+0.044
Neg logit contributions
 Some-0.063
 Many-0.060
 pain-0.058
oscope-0.058
 Amen-0.054
Token the
Token ID262
Feature activation+7.79e-03

Pos logit contributions
 Some+0.004
 Many+0.003
 pain+0.003
 Amen+0.003
oscope+0.003
Neg logit contributions
ll-0.003
!-0.003
uddle-0.003
 upon-0.003
castle-0.003
Token p
Token ID279
Feature activation+3.66e-02

Pos logit contributions
ll+0.014
!+0.011
uddle+0.010
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.015
 pain-0.015
 Many-0.014
 Amen-0.014
oscope-0.013
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.032
!+0.031
!".+0.026
!"+0.023
 ashore+0.023
Neg logit contributions
 Some-0.087
 Many-0.077
pler-0.070
itt-0.069
 blonde-0.069
Token and
Token ID290
Feature activation-3.36e-03

Pos logit contributions
ll+0.208
uddle+0.170
 upon+0.152
ounce+0.149
castle+0.146
Neg logit contributions
 Some-0.222
 pain-0.207
 Many-0.204
oscope-0.192
 blonde-0.181
Token she
Token ID673
Feature activation+1.34e-02

Pos logit contributions
 Some+0.006
 pain+0.005
 Many+0.005
oscope+0.005
 Amen+0.005
Neg logit contributions
ll-0.006
uddle-0.006
castle-0.005
!-0.005
ounce-0.005
Token made
Token ID925
Feature activation+8.91e-03

Pos logit contributions
ll+0.025
uddle+0.022
castle+0.021
!+0.020
ounce+0.019
Neg logit contributions
 Some-0.019
 pain-0.019
 Many-0.019
oscope-0.017
ez-0.016
Token it
Token ID340
Feature activation-2.55e-03

Pos logit contributions
ll+0.018
uddle+0.015
!+0.015
castle+0.014
 upon+0.014
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.014
 Amen-0.013
oscope-0.012
Token to
Token ID284
Feature activation-1.48e-03

Pos logit contributions
 pain+0.004
 Some+0.004
 Many+0.004
 nightmares+0.004
 sleeps+0.003
Neg logit contributions
ll-0.005
uddle-0.004
ory-0.004
castle-0.004
ounce-0.003
Token to
Token ID284
Feature activation+3.59e-03

Pos logit contributions
ll+0.043
uddle+0.041
cast+0.034
castle+0.033
pit+0.033
Neg logit contributions
 pain-0.041
 Some-0.038
 Many-0.034
 nightmares-0.032
 emotion-0.031
Token step
Token ID2239
Feature activation-1.18e-02

Pos logit contributions
ll+0.008
uddle+0.007
 upon+0.006
ounce+0.006
castle+0.006
Neg logit contributions
 pain-0.006
 Some-0.005
 Many-0.005
oscope-0.004
 Amen-0.004
Token in
Token ID287
Feature activation+9.87e-03

Pos logit contributions
 Some+0.022
 pain+0.021
 Many+0.020
 blonde+0.018
 blond+0.018
Neg logit contributions
ll-0.022
uddle-0.018
castle-0.016
 upon-0.015
ounce-0.015
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
ll+0.017
!+0.016
uddle+0.014
 upon+0.014
castle+0.013
Neg logit contributions
 Some-0.019
 Many-0.018
 Amen-0.018
 pain-0.017
oscope-0.016
Token p
Token ID279
Feature activation+3.84e-02

Pos logit contributions
ll+0.037
 upon+0.027
!+0.027
cast+0.027
icking+0.027
Neg logit contributions
 Some-0.034
 pain-0.034
 Many-0.032
 Amen-0.029
 emotion-0.027
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.029
!+0.019
 upon+0.013
castle+0.013
elcome+0.012
Neg logit contributions
 Some-0.105
 Many-0.097
 pain-0.091
 blonde-0.088
 blond-0.088
Token and
Token ID290
Feature activation+4.73e-03

Pos logit contributions
ll+0.206
uddle+0.166
ounce+0.148
 upon+0.146
castle+0.141
Neg logit contributions
 Some-0.226
 pain-0.217
 Many-0.207
oscope-0.193
 aids-0.185
Token get
Token ID651
Feature activation-7.40e-03

Pos logit contributions
ll+0.009
uddle+0.007
!+0.007
 upon+0.007
castle+0.007
Neg logit contributions
 Some-0.009
 Many-0.008
 pain-0.008
 Amen-0.008
oscope-0.007
Token her
Token ID607
Feature activation-8.24e-03

Pos logit contributions
 Some+0.014
 pain+0.013
 Many+0.012
 blond+0.011
 Amen+0.011
Neg logit contributions
ll-0.014
uddle-0.012
castle-0.011
!-0.011
ounce-0.010
Token feet
Token ID3625
Feature activation+2.48e-03

Pos logit contributions
 pain+0.014
 Some+0.013
 Many+0.012
 blonde+0.012
 blond+0.012
Neg logit contributions
ll-0.016
uddle-0.013
ounce-0.013
!-0.013
castle-0.012
Token wet
Token ID9583
Feature activation+4.56e-02

Pos logit contributions
ll+0.005
uddle+0.004
castle+0.003
ounce+0.003
lly+0.003
Neg logit contributions
 pain-0.004
 Some-0.004
 Many-0.004
oscope-0.004
 curly-0.004
Token get
Token ID651
Feature activation-2.86e-02

Pos logit contributions
 Some+0.006
 pain+0.006
 Many+0.006
 Amen+0.005
oscope+0.005
Neg logit contributions
ll-0.004
!-0.004
 upon-0.003
uddle-0.003
castle-0.003
Token across
Token ID1973
Feature activation-1.01e-02

Pos logit contributions
 pain+0.050
 Some+0.050
 Many+0.044
 Amen+0.041
oscope+0.040
Neg logit contributions
ll-0.056
uddle-0.048
ounce-0.044
castle-0.044
!-0.041
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
 pain+0.015
 Some+0.015
 Many+0.013
 Amen+0.012
 novels+0.012
Neg logit contributions
ll-0.024
uddle-0.020
ounce-0.019
castle-0.017
icking-0.017
Token wet
Token ID9583
Feature activation+2.86e-02

Pos logit contributions
ll+0.025
uddle+0.019
ounce+0.019
!+0.018
 upon+0.018
Neg logit contributions
 Some-0.022
 pain-0.022
 Many-0.020
oscope-0.019
 x-0.019
Token p
Token ID279
Feature activation+3.25e-02

Pos logit contributions
ll+0.060
ounce+0.043
uddle+0.042
 upon+0.042
 exactly+0.041
Neg logit contributions
 pain-0.051
 Some-0.048
 Many-0.044
oscope-0.043
 emotion-0.040
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.029
!+0.019
 ashore+0.017
elcome+0.017
 upon+0.017
Neg logit contributions
 Some-0.082
 Many-0.073
 pain-0.071
 curly-0.067
pler-0.067
Token without
Token ID1231
Feature activation-5.64e-02

Pos logit contributions
ll+0.219
uddle+0.176
ounce+0.166
 upon+0.165
 destination+0.156
Neg logit contributions
 Some-0.193
 pain-0.192
oscope-0.167
 Many-0.166
 aids-0.153
Token getting
Token ID1972
Feature activation-4.91e-02

Pos logit contributions
 pain+0.095
 Some+0.092
 Many+0.084
 blond+0.076
oscope+0.076
Neg logit contributions
ll-0.118
uddle-0.103
castle-0.094
ounce-0.092
 upon-0.091
Token wet
Token ID9583
Feature activation+2.32e-02

Pos logit contributions
 pain+0.078
 Some+0.077
 Many+0.060
 curly+0.058
 nightmares+0.057
Neg logit contributions
ll-0.106
uddle-0.089
castle-0.088
pit-0.082
ounce-0.080
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
ll+0.044
uddle+0.034
ounce+0.033
 upon+0.032
 destination+0.030
Neg logit contributions
 pain-0.044
 Some-0.040
 Many-0.035
oscope-0.035
 nightmares-0.034
Token
Token ID198
Feature activation-8.36e-03

Pos logit contributions
 Some+0.022
 Many+0.020
 pain+0.019
 Amen+0.018
oscope+0.017
Neg logit contributions
ll-0.035
uddle-0.030
!-0.030
castle-0.028
 upon-0.028
Token to
Token ID284
Feature activation+2.89e-03

Pos logit contributions
ll+0.042
uddle+0.039
castle+0.033
cast+0.031
!+0.030
Neg logit contributions
 Some-0.050
 pain-0.050
 Many-0.047
 novels-0.043
 nightmares-0.043
Token jump
Token ID4391
Feature activation+4.28e-02

Pos logit contributions
ll+0.006
uddle+0.005
castle+0.005
cast+0.005
 upon+0.005
Neg logit contributions
 pain-0.005
 Some-0.005
 Many-0.004
oscope-0.004
 Amen-0.004
Token in
Token ID287
Feature activation+7.48e-03

Pos logit contributions
ll+0.083
uddle+0.066
castle+0.064
ounce+0.058
 destination+0.057
Neg logit contributions
 Some-0.076
 pain-0.072
 Many-0.069
oscope-0.061
 novels-0.060
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
ll+0.015
uddle+0.012
!+0.012
castle+0.011
 upon+0.011
Neg logit contributions
 Some-0.014
 pain-0.013
 Many-0.013
 Amen-0.012
oscope-0.011
Token p
Token ID279
Feature activation+3.55e-02

Pos logit contributions
ll+0.027
!+0.020
castle+0.020
ounce+0.020
uddle+0.019
Neg logit contributions
 Some-0.028
 pain-0.027
 Many-0.026
 Amen-0.023
oscope-0.023
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.031
!+0.024
 ashore+0.021
castle+0.021
elcome+0.021
Neg logit contributions
 Some-0.090
 Many-0.080
 pain-0.073
 blond-0.071
 blonde-0.071
Token because
Token ID780
Feature activation+4.78e-02

Pos logit contributions
ll+0.206
uddle+0.162
 upon+0.150
ounce+0.149
castle+0.140
Neg logit contributions
 Some-0.227
 pain-0.217
 Many-0.207
oscope-0.191
 novels-0.182
Token he
Token ID339
Feature activation+1.08e-02

Pos logit contributions
ll+0.072
!+0.062
 upon+0.057
uddle+0.056
castle+0.053
Neg logit contributions
 Some-0.100
 Many-0.096
 Amen-0.094
 pain-0.093
oscope-0.090
Token would
Token ID561
Feature activation+2.27e-02

Pos logit contributions
ll+0.018
uddle+0.015
castle+0.015
!+0.015
 upon+0.014
Neg logit contributions
 Some-0.021
 pain-0.020
 Many-0.020
 Amen-0.018
oscope-0.018
Token get
Token ID651
Feature activation-7.77e-03

Pos logit contributions
ll+0.046
uddle+0.041
castle+0.038
!+0.037
 upon+0.036
Neg logit contributions
 pain-0.037
 Some-0.036
 Many-0.035
 Amen-0.031
oscope-0.031
Token all
Token ID477
Feature activation+6.76e-02

Pos logit contributions
 Some+0.014
 pain+0.013
 Many+0.012
 Amen+0.011
oscope+0.011
Neg logit contributions
ll-0.015
uddle-0.013
castle-0.013
!-0.013
ounce-0.012
Token and
Token ID290
Feature activation+3.54e-03

Pos logit contributions
ll+0.164
uddle+0.137
ounce+0.127
 upon+0.124
castle+0.120
Neg logit contributions
 Some-0.134
 pain-0.128
oscope-0.117
 Many-0.114
 blonde-0.114
Token laughed
Token ID13818
Feature activation+5.55e-02

Pos logit contributions
ll+0.006
uddle+0.005
!+0.005
 upon+0.005
castle+0.005
Neg logit contributions
 Some-0.007
 pain-0.006
 Many-0.006
 blonde-0.006
 blond-0.006
Token as
Token ID355
Feature activation+3.20e-02

Pos logit contributions
ll+0.115
uddle+0.092
ounce+0.083
castle+0.080
 upon+0.077
Neg logit contributions
 Some-0.098
 pain-0.091
 Many-0.090
 blonde-0.086
 Amen-0.084
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
ll+0.059
!+0.053
 upon+0.048
uddle+0.048
castle+0.046
Neg logit contributions
 Some-0.059
 Many-0.055
 pain-0.054
 Amen-0.052
oscope-0.052
Token p
Token ID279
Feature activation+3.72e-02

Pos logit contributions
ll+0.032
uddle+0.026
!+0.026
ounce+0.025
 upon+0.025
Neg logit contributions
 Some-0.024
 pain-0.023
 Many-0.022
 Amen-0.020
 blonde-0.019
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.039
!+0.034
 upon+0.030
castle+0.028
 ashore+0.028
Neg logit contributions
 Some-0.087
 Many-0.078
 blonde-0.074
 blond-0.074
 pain-0.073
Token spl
Token ID4328
Feature activation+4.57e-02

Pos logit contributions
ll+0.211
uddle+0.165
 upon+0.159
ounce+0.154
castle+0.151
Neg logit contributions
 Some-0.218
 pain-0.211
 Many-0.199
oscope-0.184
 blonde-0.183
Tokenashed
Token ID5263
Feature activation+5.97e-02

Pos logit contributions
ll+0.083
!+0.069
 upon+0.067
castle+0.066
 destination+0.065
Neg logit contributions
 Some-0.082
 Many-0.076
 pain-0.071
ez-0.067
oscope-0.067
Token around
Token ID1088
Feature activation+1.80e-03

Pos logit contributions
ll+0.119
uddle+0.094
castle+0.091
ounce+0.088
 upon+0.084
Neg logit contributions
 Some-0.109
 pain-0.106
 Many-0.098
 emotion-0.090
 blonde-0.090
Token her
Token ID607
Feature activation-7.58e-03

Pos logit contributions
ll+0.003
uddle+0.003
!+0.002
castle+0.002
ounce+0.002
Neg logit contributions
 Some-0.004
 pain-0.003
 Many-0.003
 Amen-0.003
 blonde-0.003
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
 pain+0.012
 blonde+0.011
 blond+0.010
 curly+0.010
 Some+0.010
Neg logit contributions
ll-0.016
ounce-0.012
castle-0.012
uddle-0.011
ounced-0.011
Token saw
Token ID2497
Feature activation+5.67e-03

Pos logit contributions
ll+0.033
uddle+0.031
castle+0.027
ounce+0.026
idents+0.026
Neg logit contributions
 Some-0.023
 Many-0.021
 pain-0.020
 blonde-0.019
 blond-0.019
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
ll+0.012
!+0.010
uddle+0.010
 upon+0.009
castle+0.009
Neg logit contributions
 Some-0.009
 Many-0.009
 pain-0.009
 Amen-0.008
oscope-0.008
Token big
Token ID1263
Feature activation+4.40e-03

Pos logit contributions
ll+0.044
icking+0.033
cast+0.033
 upon+0.033
!+0.033
Neg logit contributions
 Some-0.026
 pain-0.025
 Many-0.024
 Amen-0.023
 curly-0.021
Token dirty
Token ID11841
Feature activation+3.27e-02

Pos logit contributions
ll+0.009
uddle+0.006
 upon+0.006
ounce+0.006
cast+0.006
Neg logit contributions
 pain-0.008
 Some-0.008
 Many-0.007
 Amen-0.007
 emotion-0.007
Token p
Token ID279
Feature activation+3.63e-02

Pos logit contributions
ll+0.070
 upon+0.050
ounce+0.047
cast+0.047
rying+0.045
Neg logit contributions
 pain-0.057
 Some-0.052
 Many-0.050
 x-0.046
oscope-0.046
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.034
 upon+0.025
!+0.022
!".+0.020
elcome+0.018
Neg logit contributions
 Some-0.084
 Many-0.077
hy-0.074
 pain-0.074
 aids-0.073
Token and
Token ID290
Feature activation+5.68e-03

Pos logit contributions
ll+0.233
uddle+0.184
 upon+0.173
ounce+0.168
 destination+0.167
Neg logit contributions
 Some-0.186
 pain-0.183
oscope-0.162
 Many-0.161
 aids-0.153
Token mom
Token ID1995
Feature activation-1.31e-03

Pos logit contributions
ll+0.012
uddle+0.010
ounce+0.009
castle+0.009
!+0.008
Neg logit contributions
 Some-0.010
 Many-0.009
 pain-0.009
 curly-0.008
 blonde-0.008
Token warned
Token ID7728
Feature activation+3.18e-02

Pos logit contributions
 Some+0.002
 pain+0.002
oscope+0.002
 Many+0.002
 blonde+0.002
Neg logit contributions
ll-0.002
uddle-0.002
castle-0.002
cast-0.002
 destination-0.002
Token Sally
Token ID25737
Feature activation+7.20e-02

Pos logit contributions
ll+0.058
uddle+0.048
!+0.047
 upon+0.044
castle+0.044
Neg logit contributions
 Some-0.061
 Many-0.057
 pain-0.056
 Amen-0.053
oscope-0.050
Token to
Token ID284
Feature activation+2.75e-03

Pos logit contributions
ll+0.117
uddle+0.104
castle+0.095
cast+0.088
!+0.084
Neg logit contributions
 Some-0.148
 Many-0.135
 pain-0.130
oscope-0.122
 aids-0.121
Token landed
Token ID11406
Feature activation+1.15e-02

Pos logit contributions
ll+0.001
uddle+0.000
!+0.000
 upon+0.000
ounce+0.000
Neg logit contributions
 Some-0.000
 pain-0.000
 Many-0.000
 Amen-0.000
oscope-0.000
Token in
Token ID287
Feature activation+6.67e-03

Pos logit contributions
ll+0.024
uddle+0.020
castle+0.019
 upon+0.018
ounce+0.018
Neg logit contributions
 Some-0.019
 pain-0.018
 Many-0.017
 Amen-0.015
ez-0.015
Token a
Token ID257
Feature activation+1.71e-02

Pos logit contributions
ll+0.013
!+0.011
uddle+0.011
 upon+0.011
castle+0.011
Neg logit contributions
 Some-0.011
 Many-0.011
 pain-0.010
 Amen-0.010
oscope-0.009
Token big
Token ID1263
Feature activation+2.23e-03

Pos logit contributions
ll+0.034
!+0.027
uddle+0.027
 upon+0.026
castle+0.026
Neg logit contributions
 Some-0.030
 pain-0.029
 Many-0.028
 Amen-0.027
oscope-0.025
Token p
Token ID279
Feature activation+3.39e-02

Pos logit contributions
ll+0.004
uddle+0.003
!+0.003
 upon+0.003
castle+0.003
Neg logit contributions
 Some-0.004
 pain-0.004
 Many-0.004
 Amen-0.004
oscope-0.004
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.027
!+0.016
castle+0.014
 ashore+0.013
 upon+0.013
Neg logit contributions
 Some-0.092
 Many-0.084
 pain-0.081
 blonde-0.077
 blond-0.077
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
ll+0.219
uddle+0.169
 upon+0.157
ounce+0.153
 destination+0.149
Neg logit contributions
 Some-0.218
 pain-0.215
 Many-0.198
oscope-0.191
 Amen-0.183
Token Lily
Token ID20037
Feature activation+3.75e-03

Pos logit contributions
ll+0.009
uddle+0.007
!+0.007
castle+0.007
 upon+0.007
Neg logit contributions
 Some-0.007
 Many-0.007
 pain-0.006
 Amen-0.006
 blond-0.006
Token and
Token ID290
Feature activation-1.55e-04

Pos logit contributions
ll+0.008
castle+0.007
 destination+0.007
uddle+0.007
 ashore+0.006
Neg logit contributions
 pain-0.006
 Some-0.005
oscope-0.005
ez-0.005
 Many-0.005
Token Tim
Token ID5045
Feature activation+2.91e-02

Pos logit contributions
 Some+0.000
 pain+0.000
 Many+0.000
 Amen+0.000
oscope+0.000
Neg logit contributions
ll-0.000
uddle-0.000
!-0.000
 upon-0.000
castle-0.000
Token laughed
Token ID13818
Feature activation+5.36e-02

Pos logit contributions
ll+0.055
castle+0.053
uddle+0.050
cast+0.047
 upon+0.045
Neg logit contributions
 pain-0.046
 Some-0.045
 Many-0.043
oscope-0.042
ez-0.041
Token better
Token ID1365
Feature activation-1.20e-02

Pos logit contributions
 Some+0.029
 Many+0.027
 Amen+0.025
 pain+0.024
oscope+0.024
Neg logit contributions
ll-0.030
!-0.027
uddle-0.025
 upon-0.024
castle-0.023
Token and
Token ID290
Feature activation+4.74e-03

Pos logit contributions
 pain+0.023
 Some+0.021
 Amen+0.020
 Many+0.019
 nightmares+0.019
Neg logit contributions
ll-0.019
uddle-0.018
ounce-0.015
castle-0.015
 upon-0.015
Token loves
Token ID10408
Feature activation+5.94e-03

Pos logit contributions
ll+0.009
!+0.007
uddle+0.007
 upon+0.007
castle+0.007
Neg logit contributions
 Some-0.009
 Many-0.008
 pain-0.008
 Amen-0.008
oscope-0.007
Token to
Token ID284
Feature activation+2.52e-03

Pos logit contributions
ll+0.009
uddle+0.007
!+0.007
castle+0.007
 upon+0.006
Neg logit contributions
 Some-0.013
 pain-0.012
 Many-0.012
 Amen-0.012
oscope-0.011
Token c
Token ID269
Feature activation-1.48e-02

Pos logit contributions
ll+0.005
!+0.004
uddle+0.004
 upon+0.004
castle+0.004
Neg logit contributions
 Some-0.005
 Many-0.004
 pain-0.004
 Amen-0.004
 novels-0.004
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
 Some+0.043
 Many+0.042
 pain+0.041
 Amen+0.039
 novels+0.038
Neg logit contributions
ll-0.011
!-0.007
castle-0.006
 upon-0.006
uddle-0.005
Token and
Token ID290
Feature activation+3.66e-03

Pos logit contributions
ll+0.180
uddle+0.141
!+0.137
 upon+0.131
castle+0.130
Neg logit contributions
 Some-0.249
 Many-0.235
 pain-0.234
 Amen-0.223
oscope-0.214
Token get
Token ID651
Feature activation-1.12e-02

Pos logit contributions
ll+0.007
!+0.006
uddle+0.006
 upon+0.006
castle+0.005
Neg logit contributions
 Some-0.007
 Many-0.006
 pain-0.006
 Amen-0.006
oscope-0.005
Token lots
Token ID6041
Feature activation+3.27e-02

Pos logit contributions
 Some+0.019
 Many+0.018
 pain+0.017
 Amen+0.016
oscope+0.015
Neg logit contributions
ll-0.023
uddle-0.019
!-0.019
castle-0.018
 upon-0.018
Token of
Token ID286
Feature activation-2.10e-02

Pos logit contributions
ll+0.064
!+0.055
uddle+0.054
 upon+0.051
castle+0.050
Neg logit contributions
 Some-0.057
 Many-0.053
 pain-0.052
 Amen-0.051
oscope-0.047
Token pets
Token ID17252
Feature activation+4.93e-02

Pos logit contributions
 Some+0.037
 Many+0.034
 Amen+0.033
 pain+0.032
oscope+0.030
Neg logit contributions
ll-0.041
!-0.036
uddle-0.034
 upon-0.033
castle-0.030
Tokenashed
Token ID5263
Feature activation+6.08e-02

Pos logit contributions
ll+0.081
!+0.073
castle+0.067
 destination+0.066
 upon+0.064
Neg logit contributions
 Some-0.078
 Many-0.072
ez-0.065
 pain-0.062
 blond-0.061
Token around
Token ID1088
Feature activation+9.43e-04

Pos logit contributions
ll+0.127
uddle+0.103
castle+0.100
ounce+0.097
 destination+0.095
Neg logit contributions
 Some-0.104
 pain-0.102
 Many-0.094
ez-0.084
 emotion-0.084
Token in
Token ID287
Feature activation+8.70e-03

Pos logit contributions
ll+0.002
uddle+0.001
ounce+0.001
castle+0.001
 upon+0.001
Neg logit contributions
 Some-0.002
 pain-0.002
 Many-0.002
 Amen-0.002
oscope-0.002
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
ll+0.018
uddle+0.015
!+0.014
castle+0.014
ounce+0.014
Neg logit contributions
 Some-0.015
 pain-0.014
 Many-0.014
 Amen-0.013
oscope-0.012
Token p
Token ID279
Feature activation+3.56e-02

Pos logit contributions
ll+0.026
ounce+0.019
castle+0.019
!+0.019
 upon+0.019
Neg logit contributions
 pain-0.023
 Some-0.023
 Many-0.022
 blonde-0.019
oscope-0.019
Tokenuddle
Token ID24500
Feature activation+1.16e-01

Pos logit contributions
ll+0.032
!+0.027
 ashore+0.024
!".+0.023
!"+0.021
Neg logit contributions
 Some-0.088
 Many-0.079
 blonde-0.069
 blond-0.068
 pain-0.068
Token for
Token ID329
Feature activation-6.64e-03

Pos logit contributions
ll+0.213
uddle+0.171
ounce+0.159
 upon+0.156
 destination+0.145
Neg logit contributions
 Some-0.215
 pain-0.204
 Many-0.195
oscope-0.179
 aids-0.173
Token a
Token ID257
Feature activation+1.90e-02

Pos logit contributions
 Some+0.012
 Amen+0.012
 Many+0.011
 pain+0.010
oscope+0.010
Neg logit contributions
ll-0.011
!-0.011
 upon-0.009
uddle-0.009
!"-0.009
Token bit
Token ID1643
Feature activation+3.57e-02

Pos logit contributions
ll+0.038
uddle+0.032
!+0.031
castle+0.031
cast+0.030
Neg logit contributions
 Some-0.031
 Many-0.030
 pain-0.030
 Amen-0.027
 blonde-0.025
Token and
Token ID290
Feature activation+3.33e-03

Pos logit contributions
ll+0.063
uddle+0.051
castle+0.050
!+0.048
 upon+0.048
Neg logit contributions
 Some-0.070
 pain-0.066
 Many-0.065
 Amen-0.057
 emotion-0.056
Token was
Token ID373
Feature activation+1.30e-03

Pos logit contributions
ll+0.006
uddle+0.005
!+0.005
castle+0.005
 upon+0.005
Neg logit contributions
 Some-0.006
 pain-0.006
 Many-0.006
 Amen-0.005
oscope-0.005
Token make
Token ID787
Feature activation-2.55e-02

Pos logit contributions
 Some+0.034
 pain+0.032
 Many+0.032
 Amen+0.030
oscope+0.028
Neg logit contributions
ll-0.037
uddle-0.031
!-0.029
 upon-0.028
castle-0.028
Token it
Token ID340
Feature activation-1.48e-02

Pos logit contributions
 Some+0.042
 pain+0.039
 Many+0.039
 x+0.034
 Amen+0.034
Neg logit contributions
ll-0.055
uddle-0.045
 upon-0.044
castle-0.044
ounce-0.043
Token whole
Token ID2187
Feature activation+2.87e-02

Pos logit contributions
 Some+0.023
 pain+0.022
 Many+0.021
 Amen+0.019
oscope+0.019
Neg logit contributions
ll-0.031
uddle-0.026
ounce-0.025
!-0.023
 upon-0.023
Token again
Token ID757
Feature activation-4.85e-02

Pos logit contributions
ll+0.046
uddle+0.036
ounce+0.034
 upon+0.032
castle+0.030
Neg logit contributions
 pain-0.063
 Some-0.061
 Many-0.056
 Amen-0.055
 novels-0.053
Token.
Token ID13
Feature activation-6.20e-03

Pos logit contributions
 Some+0.095
 pain+0.093
 Many+0.089
 Amen+0.087
oscope+0.081
Neg logit contributions
ll-0.079
uddle-0.068
ounce-0.061
castle-0.060
 upon-0.058
Token Amen
Token ID34717
Feature activation-1.78e-01

Pos logit contributions
 pain+0.010
 emotion+0.009
oscope+0.009
 Some+0.009
 chapters+0.009
Neg logit contributions
ll-0.011
!-0.010
!"-0.009
uddle-0.009
 upon-0.009
Token."
Token ID526
Feature activation-1.89e-02

Pos logit contributions
 Some+0.375
 Many+0.354
 pain+0.349
 Amen+0.336
oscope+0.319
Neg logit contributions
ll-0.284
!-0.229
uddle-0.227
 upon-0.211
castle-0.208
Token
Token ID198
Feature activation-5.20e-03

Pos logit contributions
 Some+0.031
 Many+0.029
 pain+0.028
 Amen+0.027
oscope+0.024
Neg logit contributions
ll-0.040
uddle-0.034
!-0.033
castle-0.032
 upon-0.031
Token
Token ID198
Feature activation-5.24e-03

Pos logit contributions
 Some+0.010
 pain+0.009
 Many+0.009
 Amen+0.009
oscope+0.008
Neg logit contributions
ll-0.010
!-0.008
uddle-0.008
 upon-0.007
ounce-0.007
TokenThen
Token ID6423
Feature activation-3.00e-02

Pos logit contributions
 Some+0.009
 Many+0.009
 pain+0.008
 Amen+0.008
oscope+0.008
Neg logit contributions
ll-0.010
!-0.009
uddle-0.009
 upon-0.008
castle-0.008
Token,
Token ID11
Feature activation-8.53e-03

Pos logit contributions
 Some+0.054
 pain+0.051
 Many+0.049
 Amen+0.047
 novels+0.044
Neg logit contributions
ll-0.061
uddle-0.050
ounce-0.046
!-0.044
castle-0.044
Token Thank
Token ID6952
Feature activation-2.17e-02

Pos logit contributions
ll+0.003
!+0.002
!"+0.002
!".+0.002
 upon+0.002
Neg logit contributions
 pain-0.003
 blonde-0.003
 emotion-0.003
ez-0.003
 blond-0.003
Token you
Token ID345
Feature activation-3.04e-02

Pos logit contributions
 Some+0.057
 Many+0.053
 pain+0.053
 Amen+0.050
 blonde+0.050
Neg logit contributions
ll-0.025
!-0.019
uddle-0.016
 upon-0.015
castle-0.013
Token,
Token ID11
Feature activation-1.88e-03

Pos logit contributions
 Some+0.066
 Many+0.061
 Amen+0.059
 pain+0.058
 novels+0.054
Neg logit contributions
ll-0.048
uddle-0.042
castle-0.040
ounce-0.037
cast-0.036
Token God
Token ID1793
Feature activation-8.28e-02

Pos logit contributions
 Some+0.003
 Many+0.003
 pain+0.003
 Amen+0.003
oscope+0.003
Neg logit contributions
ll-0.004
!-0.003
uddle-0.003
 upon-0.003
castle-0.003
Token.
Token ID13
Feature activation+3.58e-03

Pos logit contributions
 Some+0.167
 Amen+0.158
 pain+0.156
 Many+0.155
oscope+0.148
Neg logit contributions
ll-0.131
uddle-0.120
castle-0.108
ounce-0.107
cast-0.104
Token Amen
Token ID34717
Feature activation-1.61e-01

Pos logit contributions
ll+0.007
!+0.006
!"+0.006
 upon+0.006
!".+0.006
Neg logit contributions
 pain-0.005
 Some-0.005
 blonde-0.005
 chapters-0.005
 emotion-0.005
Token."
Token ID526
Feature activation-3.19e-02

Pos logit contributions
 Some+0.370
 pain+0.353
 Many+0.349
 blonde+0.330
 blond+0.328
Neg logit contributions
ll-0.229
!-0.180
uddle-0.166
 upon-0.159
castle-0.148
Token
Token ID198
Feature activation-2.67e-03

Pos logit contributions
 Some+0.062
 pain+0.061
 Many+0.058
oscope+0.058
ez+0.056
Neg logit contributions
ll-0.052
!-0.044
 upon-0.042
uddle-0.038
castle-0.036
Token
Token ID198
Feature activation-3.09e-03

Pos logit contributions
 Some+0.006
 pain+0.005
 Many+0.005
 blonde+0.005
oscope+0.005
Neg logit contributions
ll-0.004
!-0.004
uddle-0.003
 upon-0.003
ounce-0.003
TokenHe
Token ID1544
Feature activation+8.05e-03

Pos logit contributions
 Some+0.005
 pain+0.005
 Many+0.005
 emotion+0.004
 blonde+0.004
Neg logit contributions
ll-0.006
!-0.005
uddle-0.005
 upon-0.005
ounce-0.005
Token opened
Token ID4721
Feature activation+2.27e-02

Pos logit contributions
ll+0.015
uddle+0.012
!+0.012
 upon+0.012
castle+0.011
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.014
 Amen-0.013
oscope-0.013
Token us
Token ID514
Feature activation-9.99e-04

Pos logit contributions
ll+0.001
uddle+0.001
!+0.001
castle+0.001
 upon+0.001
Neg logit contributions
 Some-0.002
 Many-0.002
 pain-0.002
 Amen-0.002
oscope-0.002
Token safe
Token ID3338
Feature activation+3.43e-02

Pos logit contributions
 pain+0.002
 Amen+0.002
 Some+0.002
oscope+0.001
 Many+0.001
Neg logit contributions
ll-0.002
uddle-0.002
castle-0.002
ounce-0.002
cast-0.001
Token and
Token ID290
Feature activation-6.13e-03

Pos logit contributions
ll+0.058
uddle+0.051
ounce+0.046
castle+0.045
 upon+0.043
Neg logit contributions
 Some-0.066
 pain-0.064
 Amen-0.063
 Many-0.061
oscope-0.058
Token happy
Token ID3772
Feature activation-3.72e-02

Pos logit contributions
 Some+0.011
 Many+0.010
 pain+0.010
 Amen+0.009
oscope+0.009
Neg logit contributions
ll-0.012
!-0.010
uddle-0.010
 upon-0.010
castle-0.009
Token.
Token ID13
Feature activation+5.90e-03

Pos logit contributions
 Some+0.077
 pain+0.073
 Many+0.073
 Amen+0.071
 novels+0.067
Neg logit contributions
ll-0.062
uddle-0.050
ounce-0.047
castle-0.045
!-0.045
Token Amen
Token ID34717
Feature activation-1.54e-01

Pos logit contributions
ll+0.010
!+0.009
!"+0.008
 upon+0.008
!".+0.008
Neg logit contributions
 pain-0.010
 emotion-0.009
 Some-0.009
ez-0.009
growth-0.009
Token."
Token ID526
Feature activation-1.22e-02

Pos logit contributions
 Some+0.361
 pain+0.342
 Many+0.340
 Amen+0.323
 blonde+0.315
Neg logit contributions
ll-0.210
!-0.166
uddle-0.157
 upon-0.147
castle-0.138
Token
Token ID198
Feature activation-1.42e-03

Pos logit contributions
 pain+0.025
 Some+0.024
oscope+0.024
 Many+0.023
 emotion+0.023
Neg logit contributions
ll-0.018
!-0.015
 upon-0.015
uddle-0.013
ounce-0.012
Token
Token ID198
Feature activation-2.08e-03

Pos logit contributions
 Some+0.003
 pain+0.003
oscope+0.003
 blonde+0.003
 blond+0.003
Neg logit contributions
ll-0.002
!-0.002
 upon-0.002
uddle-0.002
!"-0.001
Token"
Token ID1
Feature activation-2.82e-03

Pos logit contributions
 Some+0.003
 pain+0.003
 Many+0.003
oscope+0.003
 Amen+0.003
Neg logit contributions
ll-0.004
!-0.004
uddle-0.004
 upon-0.004
ounce-0.003
TokenA
Token ID32
Feature activation-1.58e-02

Pos logit contributions
 Some+0.006
 Many+0.005
 pain+0.005
oscope+0.005
 aids+0.005
Neg logit contributions
ll-0.005
!-0.004
 upon-0.003
uddle-0.003
castle-0.003
Token kind
Token ID1611
Feature activation-4.15e-02

Pos logit contributions
 Some+0.033
 Many+0.031
 pain+0.029
 Amen+0.028
oscope+0.027
Neg logit contributions
ll-0.039
!-0.035
uddle-0.032
 upon-0.032
castle-0.031
Token.
Token ID13
Feature activation+7.39e-03

Pos logit contributions
 Some+0.084
 pain+0.077
 Many+0.076
 novels+0.072
 Amen+0.071
Neg logit contributions
ll-0.072
uddle-0.061
castle-0.057
ounce-0.054
cast-0.054
Token Thank
Token ID6952
Feature activation-2.48e-02

Pos logit contributions
ll+0.013
!+0.011
 upon+0.010
!"+0.010
!".+0.010
Neg logit contributions
 pain-0.012
 Some-0.011
 emotion-0.011
 chapters-0.011
 Many-0.011
Token you
Token ID345
Feature activation-3.58e-02

Pos logit contributions
 Some+0.066
 Many+0.062
 pain+0.062
 Amen+0.059
oscope+0.058
Neg logit contributions
ll-0.027
!-0.021
uddle-0.017
 upon-0.015
!"-0.014
Token.
Token ID13
Feature activation+7.02e-03

Pos logit contributions
 Some+0.075
 Many+0.069
 pain+0.064
 Amen+0.063
 novels+0.060
Neg logit contributions
ll-0.060
uddle-0.052
castle-0.051
cast-0.048
ounce-0.046
Token Amen
Token ID34717
Feature activation-1.52e-01

Pos logit contributions
ll+0.011
!+0.010
 upon+0.009
!"+0.009
!".+0.008
Neg logit contributions
 pain-0.013
 Some-0.012
 chapters-0.012
 Many-0.012
 emotion-0.011
Token."
Token ID526
Feature activation-7.74e-04

Pos logit contributions
 Some+0.350
 Many+0.332
 pain+0.327
 Amen+0.317
oscope+0.301
Neg logit contributions
ll-0.215
!-0.167
uddle-0.167
 upon-0.153
castle-0.150
Token
Token ID198
Feature activation-1.93e-04

Pos logit contributions
 pain+0.002
 Some+0.002
oscope+0.001
 Amen+0.001
 Many+0.001
Neg logit contributions
ll-0.001
!-0.001
 upon-0.001
uddle-0.001
ounce-0.001
Token
Token ID198
Feature activation-1.03e-03

Pos logit contributions
 Some+0.000
 pain+0.000
 Many+0.000
 Amen+0.000
oscope+0.000
Neg logit contributions
ll-0.000
!-0.000
uddle-0.000
 upon-0.000
ounce-0.000
TokenThen
Token ID6423
Feature activation-3.36e-02

Pos logit contributions
 Some+0.002
 Many+0.002
 pain+0.002
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.002
!-0.002
uddle-0.002
 upon-0.002
castle-0.002
Token she
Token ID673
Feature activation+9.79e-03

Pos logit contributions
 Some+0.062
 Many+0.058
 pain+0.058
 Amen+0.054
 novels+0.050
Neg logit contributions
ll-0.064
uddle-0.053
!-0.052
castle-0.049
 upon-0.049
Token house
Token ID2156
Feature activation-2.14e-05

Pos logit contributions
ll+0.051
!+0.043
uddle+0.043
 upon+0.041
castle+0.040
Neg logit contributions
 Some-0.044
 Many-0.041
 pain-0.040
 Amen-0.038
oscope-0.035
Token and
Token ID290
Feature activation-8.68e-03

Pos logit contributions
 Some+0.000
 pain+0.000
 Many+0.000
 Amen+0.000
oscope+0.000
Neg logit contributions
ll-0.000
uddle-0.000
ounce-0.000
 upon-0.000
castle-0.000
Token our
Token ID674
Feature activation+2.54e-02

Pos logit contributions
 Some+0.016
 pain+0.016
 Many+0.015
 Amen+0.015
 novels+0.014
Neg logit contributions
ll-0.016
uddle-0.014
!-0.012
ounce-0.012
 upon-0.012
Token family
Token ID1641
Feature activation-1.10e-03

Pos logit contributions
ll+0.048
!+0.040
uddle+0.040
ounce+0.039
 upon+0.038
Neg logit contributions
 Some-0.045
 pain-0.043
 Many-0.042
 Amen-0.039
 novels-0.038
Token.
Token ID13
Feature activation+2.99e-03

Pos logit contributions
 Some+0.002
 pain+0.002
 Many+0.002
 Amen+0.002
oscope+0.002
Neg logit contributions
ll-0.002
uddle-0.002
ounce-0.002
castle-0.002
cast-0.002
Token Amen
Token ID34717
Feature activation-1.52e-01

Pos logit contributions
ll+0.005
!+0.005
!"+0.005
!".+0.004
 upon+0.004
Neg logit contributions
 pain-0.005
 Some-0.004
 Many-0.004
 chapters-0.004
 blonde-0.004
Token."
Token ID526
Feature activation-3.70e-03

Pos logit contributions
 Some+0.357
 Many+0.337
 pain+0.337
 Amen+0.321
 blonde+0.313
Neg logit contributions
ll-0.207
!-0.161
uddle-0.154
 upon-0.143
castle-0.134
Token
Token ID198
Feature activation-4.23e-03

Pos logit contributions
 Some+0.007
 pain+0.006
 Many+0.006
 Amen+0.006
oscope+0.006
Neg logit contributions
ll-0.007
!-0.006
uddle-0.006
 upon-0.005
ounce-0.005
Token
Token ID198
Feature activation-4.83e-03

Pos logit contributions
 Some+0.009
 pain+0.008
 Many+0.008
oscope+0.008
 blonde+0.008
Neg logit contributions
ll-0.007
!-0.006
uddle-0.005
 upon-0.005
ounce-0.005
TokenMom
Token ID29252
Feature activation-1.82e-02

Pos logit contributions
 Some+0.009
 pain+0.009
 Many+0.009
 Amen+0.008
oscope+0.008
Neg logit contributions
ll-0.009
!-0.007
uddle-0.007
 upon-0.007
ounce-0.006
Token smiled
Token ID13541
Feature activation+1.09e-02

Pos logit contributions
 Some+0.036
 Many+0.033
 pain+0.033
 Amen+0.032
oscope+0.031
Neg logit contributions
ll-0.030
uddle-0.026
!-0.025
castle-0.024
 upon-0.023
Token them
Token ID606
Feature activation+1.60e-02

Pos logit contributions
 Some+0.042
 Many+0.039
 pain+0.037
 Amen+0.037
oscope+0.034
Neg logit contributions
ll-0.044
!-0.038
uddle-0.036
 upon-0.035
castle-0.034
Token and
Token ID290
Feature activation-9.33e-03

Pos logit contributions
ll+0.027
uddle+0.025
castle+0.023
ounce+0.022
cast+0.021
Neg logit contributions
 Some-0.026
 pain-0.026
oscope-0.025
 novels-0.024
 nightmares-0.024
Token never
Token ID1239
Feature activation+1.14e-02

Pos logit contributions
 Some+0.018
 Many+0.017
 pain+0.017
 Amen+0.016
oscope+0.015
Neg logit contributions
ll-0.017
uddle-0.014
!-0.014
 upon-0.013
castle-0.013
Token threw
Token ID9617
Feature activation+2.86e-02

Pos logit contributions
ll+0.021
!+0.018
uddle+0.017
 upon+0.016
castle+0.016
Neg logit contributions
 Some-0.021
 Many-0.020
 pain-0.019
 Amen-0.019
oscope-0.018
Token a
Token ID257
Feature activation+7.36e-03

Pos logit contributions
ll+0.049
uddle+0.040
!+0.040
 upon+0.037
castle+0.037
Neg logit contributions
 Some-0.057
 Many-0.054
 pain-0.053
 Amen-0.051
oscope-0.048
Token tant
Token ID24246
Feature activation-1.51e-01

Pos logit contributions
ll+0.016
!+0.013
uddle+0.013
castle+0.013
 upon+0.012
Neg logit contributions
 Some-0.011
 pain-0.011
 Many-0.010
 Amen-0.009
oscope-0.009
Tokenrum
Token ID6582
Feature activation-2.60e-03

Pos logit contributions
 Some+0.244
 Many+0.222
 pain+0.210
 Amen+0.194
 blonde+0.194
Neg logit contributions
ll-0.336
!-0.286
castle-0.259
uddle-0.257
ies-0.256
Token again
Token ID757
Feature activation-3.78e-02

Pos logit contributions
 Some+0.006
 pain+0.005
 Many+0.005
 Amen+0.005
oscope+0.005
Neg logit contributions
ll-0.004
uddle-0.003
castle-0.003
!-0.003
 upon-0.003
Token.
Token ID13
Feature activation-4.37e-03

Pos logit contributions
 Some+0.078
 pain+0.077
 Many+0.073
 Amen+0.069
 blonde+0.068
Neg logit contributions
ll-0.060
uddle-0.049
castle-0.045
 upon-0.043
ounce-0.042
Token<|endoftext|>
Token ID50256
Feature activation+3.09e-02

Pos logit contributions
 pain+0.008
 Some+0.007
oscope+0.007
 Many+0.007
 Amen+0.007
Neg logit contributions
ll-0.008
!-0.007
 upon-0.007
uddle-0.006
ounce-0.006
TokenOnce
Token ID7454
Feature activation+1.22e-03

Pos logit contributions
ll+0.058
!+0.049
uddle+0.048
 upon+0.045
castle+0.044
Neg logit contributions
 Some-0.057
 Many-0.053
 pain-0.052
 Amen-0.050
oscope-0.047
Token family
Token ID1641
Feature activation-5.17e-04

Pos logit contributions
ll+0.052
!+0.042
uddle+0.042
castle+0.040
ounce+0.040
Neg logit contributions
 Some-0.044
 pain-0.041
 Many-0.041
 Amen-0.039
 novels-0.036
Token and
Token ID290
Feature activation-5.76e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
uddle-0.001
ounce-0.001
castle-0.001
!-0.001
Token our
Token ID674
Feature activation+2.55e-02

Pos logit contributions
 Some+0.010
 pain+0.010
 Many+0.010
 Amen+0.009
oscope+0.009
Neg logit contributions
ll-0.011
uddle-0.009
!-0.009
castle-0.009
ounce-0.008
Token friends
Token ID2460
Feature activation+1.35e-02

Pos logit contributions
ll+0.052
uddle+0.043
!+0.043
ounce+0.042
castle+0.042
Neg logit contributions
 Some-0.042
 pain-0.040
 Many-0.039
 Amen-0.038
 novels-0.036
Token.
Token ID13
Feature activation+3.70e-03

Pos logit contributions
ll+0.023
uddle+0.020
ounce+0.019
castle+0.018
cast+0.017
Neg logit contributions
 Some-0.026
 Many-0.025
 Amen-0.025
 pain-0.025
oscope-0.023
Token Amen
Token ID34717
Feature activation-1.51e-01

Pos logit contributions
ll+0.007
!+0.006
!"+0.006
 upon+0.006
!".+0.006
Neg logit contributions
 pain-0.006
 emotion-0.005
 Some-0.005
ez-0.005
 chapters-0.005
Token."
Token ID526
Feature activation-2.74e-03

Pos logit contributions
 Some+0.355
 Many+0.336
 pain+0.334
 Amen+0.320
oscope+0.308
Neg logit contributions
ll-0.206
!-0.161
uddle-0.155
 upon-0.143
castle-0.137
Token
Token ID198
Feature activation-1.32e-03

Pos logit contributions
 pain+0.007
oscope+0.006
 emotion+0.006
ez+0.006
 bolt+0.006
Neg logit contributions
ll-0.003
 upon-0.003
!-0.003
uddle-0.002
ounce-0.002
Token
Token ID198
Feature activation-2.05e-03

Pos logit contributions
 pain+0.003
 Some+0.003
oscope+0.003
 emotion+0.003
 blonde+0.003
Neg logit contributions
ll-0.002
!-0.001
 upon-0.001
uddle-0.001
!"-0.001
TokenAnna
Token ID31160
Feature activation-3.14e-02

Pos logit contributions
 Some+0.004
 pain+0.003
 Many+0.003
oscope+0.003
 emotion+0.003
Neg logit contributions
ll-0.004
!-0.003
 upon-0.003
uddle-0.003
lly-0.003
Token said
Token ID531
Feature activation+2.19e-02

Pos logit contributions
 Some+0.057
 Many+0.054
 pain+0.054
 Amen+0.051
oscope+0.049
Neg logit contributions
ll-0.058
uddle-0.048
!-0.048
castle-0.046
 upon-0.045
Token his
Token ID465
Feature activation-1.56e-02

Pos logit contributions
ll+0.024
!+0.023
 upon+0.021
uddle+0.019
ounce+0.018
Neg logit contributions
 Some-0.038
 Amen-0.037
 Many-0.036
 pain-0.035
oscope-0.034
Token feet
Token ID3625
Feature activation-7.48e-03

Pos logit contributions
 Some+0.027
 pain+0.026
 Many+0.025
 Amen+0.023
oscope+0.023
Neg logit contributions
ll-0.031
uddle-0.026
!-0.026
 upon-0.024
castle-0.024
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
 Some+0.014
 pain+0.014
 Many+0.013
oscope+0.013
 Amen+0.012
Neg logit contributions
ll-0.013
uddle-0.011
castle-0.010
cast-0.009
 upon-0.009
Token threw
Token ID9617
Feature activation+2.50e-02

Pos logit contributions
 Some+0.026
 pain+0.026
 Many+0.024
 Amen+0.023
 blond+0.023
Neg logit contributions
ll-0.029
uddle-0.024
!-0.022
castle-0.022
 upon-0.022
Token a
Token ID257
Feature activation+9.08e-03

Pos logit contributions
ll+0.044
!+0.037
uddle+0.036
 upon+0.034
castle+0.033
Neg logit contributions
 Some-0.048
 Many-0.046
 pain-0.044
 Amen-0.043
oscope-0.041
Token tant
Token ID24246
Feature activation-1.51e-01

Pos logit contributions
ll+0.018
!+0.015
uddle+0.015
 upon+0.014
castle+0.014
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.014
 Amen-0.013
oscope-0.013
Tokenrum
Token ID6582
Feature activation-4.97e-03

Pos logit contributions
 Some+0.246
 Many+0.224
 pain+0.207
 blonde+0.196
 Amen+0.195
Neg logit contributions
ll-0.334
!-0.285
uddle-0.256
castle-0.256
ounce-0.250
Token.
Token ID13
Feature activation-6.69e-03

Pos logit contributions
 Some+0.009
 pain+0.009
 Many+0.008
 Amen+0.008
oscope+0.008
Neg logit contributions
ll-0.009
uddle-0.007
ounce-0.007
 upon-0.006
castle-0.006
Token
Token ID198
Feature activation-2.05e-03

Pos logit contributions
 Some+0.011
 pain+0.011
 Many+0.011
 Amen+0.011
oscope+0.010
Neg logit contributions
ll-0.013
!-0.011
uddle-0.011
 upon-0.011
ounce-0.010
Token
Token ID198
Feature activation-1.56e-03

Pos logit contributions
 Some+0.004
 pain+0.004
 Many+0.004
 Amen+0.003
oscope+0.003
Neg logit contributions
ll-0.004
!-0.003
uddle-0.003
 upon-0.003
ounce-0.003
Token"
Token ID1
Feature activation-1.26e-03

Pos logit contributions
 Some+0.003
 Many+0.003
 pain+0.003
 Amen+0.003
oscope+0.002
Neg logit contributions
ll-0.003
!-0.002
uddle-0.002
 upon-0.002
castle-0.002
Token clown
Token ID25573
Feature activation+9.72e-03

Pos logit contributions
 Some+0.005
 Amen+0.005
 pain+0.005
 Many+0.005
oscope+0.005
Neg logit contributions
ll-0.005
!-0.005
uddle-0.004
 upon-0.004
castle-0.004
Tokens
Token ID82
Feature activation-4.35e-03

Pos logit contributions
ll+0.012
uddle+0.011
castle+0.010
!+0.009
!".+0.009
Neg logit contributions
 Some-0.020
 Many-0.020
ez-0.017
oscope-0.017
 novels-0.017
Token juggling
Token ID48097
Feature activation-1.08e-02

Pos logit contributions
 Some+0.007
 Many+0.007
 pain+0.007
 blonde+0.006
 aids+0.006
Neg logit contributions
ll-0.008
uddle-0.007
rying-0.006
ounce-0.006
castle-0.006
Token and
Token ID290
Feature activation-5.70e-03

Pos logit contributions
 Some+0.019
 Many+0.019
 pain+0.018
 novels+0.016
 blond+0.016
Neg logit contributions
ll-0.022
uddle-0.018
castle-0.016
 upon-0.016
ounce-0.016
Token ac
Token ID936
Feature activation-1.51e-02

Pos logit contributions
 Some+0.010
 Many+0.010
 pain+0.009
 Amen+0.009
 blond+0.009
Neg logit contributions
ll-0.011
uddle-0.010
 upon-0.008
ounce-0.008
!-0.008
Tokenrob
Token ID22609
Feature activation-1.48e-01

Pos logit contributions
 Some+0.018
 Many+0.017
 pain+0.017
ez+0.016
oscope+0.015
Neg logit contributions
ll-0.034
uddle-0.031
 upon-0.031
castle-0.030
!-0.030
Tokenats
Token ID1381
Feature activation+8.29e-02

Pos logit contributions
 Some+0.340
 Many+0.325
 pain+0.316
ez+0.298
 Amen+0.297
Neg logit contributions
ll-0.196
!-0.158
uddle-0.153
 upon-0.148
castle-0.141
Token doing
Token ID1804
Feature activation-2.82e-02

Pos logit contributions
ll+0.152
uddle+0.132
castle+0.112
 upon+0.111
ounce+0.109
Neg logit contributions
 Some-0.157
 Many-0.145
 pain-0.141
oscope-0.131
ez-0.129
Token flips
Token ID45971
Feature activation+7.74e-03

Pos logit contributions
 Some+0.046
 Many+0.043
 pain+0.042
 Amen+0.038
 novels+0.037
Neg logit contributions
ll-0.059
uddle-0.049
!-0.047
 upon-0.046
castle-0.045
Token.
Token ID13
Feature activation-6.26e-03

Pos logit contributions
ll+0.015
uddle+0.012
castle+0.011
ounce+0.010
 upon+0.010
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.013
 novels-0.012
 blonde-0.012
Token But
Token ID887
Feature activation+3.33e-04

Pos logit contributions
 pain+0.009
 Some+0.009
 Amen+0.009
 Many+0.008
oscope+0.008
Neg logit contributions
ll-0.013
!-0.011
uddle-0.011
 upon-0.011
ounce-0.010
Token saw
Token ID2497
Feature activation-1.09e-02

Pos logit contributions
 Some+0.037
 Many+0.035
 pain+0.035
 Amen+0.031
 blond+0.031
Neg logit contributions
ll-0.034
uddle-0.028
!-0.027
castle-0.025
ounce-0.025
Token clown
Token ID25573
Feature activation+4.21e-03

Pos logit contributions
 Amen+0.020
 Some+0.020
 pain+0.019
 Many+0.019
oscope+0.018
Neg logit contributions
ll-0.018
!-0.016
uddle-0.016
 upon-0.015
castle-0.015
Tokens
Token ID82
Feature activation+1.46e-03

Pos logit contributions
ll+0.006
uddle+0.005
castle+0.004
 destination+0.004
ert+0.004
Neg logit contributions
 Some-0.009
 Many-0.008
 novels-0.008
oscope-0.007
 pain-0.007
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
ll+0.003
uddle+0.002
ounce+0.002
castle+0.002
ert+0.002
Neg logit contributions
 Some-0.003
 pain-0.002
 Many-0.002
 novels-0.002
 blond-0.002
Token ac
Token ID936
Feature activation-2.25e-02

Pos logit contributions
 Some+0.020
 Many+0.019
 pain+0.018
 Amen+0.018
oscope+0.016
Neg logit contributions
ll-0.020
!-0.017
uddle-0.017
 upon-0.016
castle-0.016
Tokenrob
Token ID22609
Feature activation-1.48e-01

Pos logit contributions
 Some+0.030
 pain+0.027
 Many+0.027
 Amen+0.025
oscope+0.025
Neg logit contributions
ll-0.050
uddle-0.044
 upon-0.044
!-0.044
castle-0.043
Tokenats
Token ID1381
Feature activation+7.58e-02

Pos logit contributions
 Some+0.342
 Many+0.325
 pain+0.319
ez+0.303
 Amen+0.297
Neg logit contributions
ll-0.189
!-0.153
uddle-0.148
 upon-0.145
castle-0.139
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
ll+0.144
uddle+0.123
castle+0.110
 upon+0.104
ounce+0.102
Neg logit contributions
 Some-0.141
 Many-0.130
 pain-0.127
oscope-0.118
ez-0.113
Token lions
Token ID31420
Feature activation-2.28e-02

Pos logit contributions
 Some+0.019
 Many+0.018
 Amen+0.018
 pain+0.017
oscope+0.016
Neg logit contributions
ll-0.018
!-0.016
uddle-0.015
 upon-0.015
castle-0.014
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
 Some+0.044
 pain+0.044
 Many+0.042
oscope+0.039
 novels+0.038
Neg logit contributions
ll-0.039
uddle-0.032
 upon-0.029
castle-0.028
ounce-0.028
Token elephants
Token ID25368
Feature activation-2.33e-03

Pos logit contributions
 Some+0.020
 Many+0.018
 Amen+0.018
 pain+0.018
oscope+0.017
Neg logit contributions
ll-0.019
!-0.017
uddle-0.015
 upon-0.015
castle-0.015
Token time
Token ID640
Feature activation+7.82e-03

Pos logit contributions
ll+0.050
uddle+0.042
!+0.042
 upon+0.040
ounce+0.039
Neg logit contributions
 Some-0.038
 Many-0.035
 pain-0.035
 Amen-0.033
oscope-0.031
Token playing
Token ID2712
Feature activation-1.14e-02

Pos logit contributions
ll+0.014
uddle+0.012
ounce+0.010
castle+0.010
cast+0.010
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.014
 Amen-0.013
 blonde-0.013
Token with
Token ID351
Feature activation-3.35e-02

Pos logit contributions
 Some+0.021
 Many+0.020
 pain+0.020
 Amen+0.018
 novels+0.018
Neg logit contributions
ll-0.023
uddle-0.018
ounce-0.016
 upon-0.016
castle-0.015
Token her
Token ID607
Feature activation-2.36e-02

Pos logit contributions
 Some+0.049
 Many+0.045
 pain+0.044
 Amen+0.042
oscope+0.038
Neg logit contributions
ll-0.075
uddle-0.065
!-0.064
 upon-0.061
castle-0.061
Token or
Token ID393
Feature activation-5.22e-03

Pos logit contributions
 Some+0.040
 pain+0.039
 Many+0.037
 novels+0.035
 blonde+0.034
Neg logit contributions
ll-0.046
uddle-0.036
ounce-0.036
!-0.034
castle-0.034
Tokennam
Token ID7402
Feature activation-1.45e-01

Pos logit contributions
 Some+0.009
 pain+0.008
 Many+0.008
 Amen+0.007
 novels+0.007
Neg logit contributions
ll-0.011
uddle-0.009
!-0.009
ounce-0.008
castle-0.008
Tokenents
Token ID658
Feature activation-1.93e-02

Pos logit contributions
 Some+0.289
 Many+0.275
 pain+0.252
 Amen+0.251
 novels+0.241
Neg logit contributions
ll-0.244
uddle-0.206
!-0.198
castle-0.185
 upon-0.178
Token and
Token ID290
Feature activation-8.73e-03

Pos logit contributions
 Some+0.037
 pain+0.034
 Many+0.034
oscope+0.033
 Amen+0.031
Neg logit contributions
ll-0.032
uddle-0.028
ounce-0.024
castle-0.023
 upon-0.023
Token making
Token ID1642
Feature activation-2.94e-02

Pos logit contributions
 Some+0.017
 Many+0.015
 pain+0.015
 Amen+0.015
oscope+0.014
Neg logit contributions
ll-0.016
uddle-0.013
!-0.013
 upon-0.012
castle-0.012
Token her
Token ID607
Feature activation-2.37e-02

Pos logit contributions
 Some+0.052
 Many+0.049
 pain+0.048
 Amen+0.047
oscope+0.043
Neg logit contributions
ll-0.055
!-0.048
uddle-0.047
 upon-0.044
castle-0.043
Token special
Token ID2041
Feature activation+2.25e-04

Pos logit contributions
 Some+0.042
 Many+0.039
 pain+0.038
 Amen+0.036
oscope+0.034
Neg logit contributions
ll-0.045
!-0.040
uddle-0.038
 upon-0.037
castle-0.036
Token stubborn
Token ID24444
Feature activation+7.88e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 Amen+0.001
 pain+0.001
oscope+0.001
Neg logit contributions
ll-0.002
!-0.002
 upon-0.001
uddle-0.001
castle-0.001
Token.
Token ID13
Feature activation-2.87e-03

Pos logit contributions
ll+0.012
uddle+0.010
castle+0.009
!+0.009
 upon+0.009
Neg logit contributions
 Some-0.017
 pain-0.016
 Many-0.016
 Amen-0.015
oscope-0.015
Token She
Token ID1375
Feature activation-6.92e-03

Pos logit contributions
oscope+0.005
 Amen+0.005
ez+0.005
 pain+0.005
 bolt+0.005
Neg logit contributions
!-0.004
ll-0.004
 upon-0.004
!"-0.004
uddle-0.003
Token throws
Token ID12542
Feature activation+1.33e-02

Pos logit contributions
 Some+0.011
 Many+0.011
 Amen+0.010
 pain+0.010
oscope+0.009
Neg logit contributions
ll-0.014
!-0.012
uddle-0.012
 upon-0.011
castle-0.011
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
ll+0.020
!+0.020
 upon+0.017
uddle+0.017
!"+0.016
Neg logit contributions
 Amen-0.025
 Some-0.025
 Many-0.025
 pain-0.023
oscope-0.022
Token tant
Token ID24246
Feature activation-1.45e-01

Pos logit contributions
ll+0.033
!+0.028
uddle+0.028
 upon+0.027
castle+0.027
Neg logit contributions
 Some-0.016
 Many-0.015
 pain-0.015
 Amen-0.013
oscope-0.012
Tokenrum
Token ID6582
Feature activation-2.31e-03

Pos logit contributions
 Some+0.251
 Many+0.227
 pain+0.216
 blonde+0.210
 Amen+0.203
Neg logit contributions
ll-0.303
!-0.262
ies-0.233
!"-0.226
castle-0.225
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
 Some+0.005
 pain+0.005
 Many+0.005
 Amen+0.004
oscope+0.004
Neg logit contributions
ll-0.004
uddle-0.003
!-0.003
castle-0.003
 upon-0.003
Token She
Token ID1375
Feature activation-5.89e-03

Pos logit contributions
 Amen+0.004
oscope+0.004
 pain+0.004
ez+0.004
 emotion+0.004
Neg logit contributions
!-0.004
ll-0.004
 upon-0.004
uddle-0.003
!"-0.003
Token kicks
Token ID17364
Feature activation-2.60e-03

Pos logit contributions
 Some+0.009
 Many+0.008
 pain+0.008
 Amen+0.008
oscope+0.007
Neg logit contributions
ll-0.013
!-0.011
uddle-0.011
 upon-0.011
castle-0.010
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 Amen+0.005
 Some+0.005
 Many+0.005
 pain+0.005
oscope+0.005
Neg logit contributions
ll-0.004
!-0.004
 upon-0.003
uddle-0.003
!"-0.003
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
ll+0.074
uddle+0.069
castle+0.063
ounce+0.062
ounced+0.059
Neg logit contributions
 pain-0.068
 Some-0.067
 Many-0.062
 Amen-0.062
oscope-0.060
Token play
Token ID711
Feature activation-9.44e-03

Pos logit contributions
 Some+0.029
 pain+0.028
 Many+0.027
 Amen+0.027
oscope+0.024
Neg logit contributions
ll-0.032
uddle-0.028
!-0.026
castle-0.025
ounce-0.025
Token with
Token ID351
Feature activation-3.68e-02

Pos logit contributions
 Some+0.019
 Many+0.018
 pain+0.018
 Amen+0.017
 novels+0.016
Neg logit contributions
ll-0.017
uddle-0.013
ounce-0.012
 upon-0.012
castle-0.012
Token your
Token ID534
Feature activation-2.38e-03

Pos logit contributions
 Some+0.061
 pain+0.057
 Many+0.056
 Amen+0.052
 novels+0.050
Neg logit contributions
ll-0.079
uddle-0.065
!-0.062
ounce-0.062
 upon-0.061
Token or
Token ID393
Feature activation-1.34e-02

Pos logit contributions
 Some+0.004
 pain+0.004
 Many+0.004
 Amen+0.004
 novels+0.004
Neg logit contributions
ll-0.004
!-0.004
uddle-0.003
ounce-0.003
 upon-0.003
Tokennam
Token ID7402
Feature activation-1.44e-01

Pos logit contributions
 Some+0.024
 pain+0.024
 Many+0.023
 Amen+0.021
 novels+0.021
Neg logit contributions
ll-0.026
uddle-0.022
!-0.021
ounce-0.019
castle-0.019
Tokenents
Token ID658
Feature activation-1.70e-02

Pos logit contributions
 Some+0.288
 Many+0.273
 pain+0.253
 Amen+0.250
 novels+0.239
Neg logit contributions
ll-0.241
uddle-0.202
!-0.198
castle-0.183
 upon-0.179
Token.
Token ID13
Feature activation-8.67e-03

Pos logit contributions
 Some+0.030
 pain+0.028
 Many+0.027
oscope+0.027
 novels+0.026
Neg logit contributions
ll-0.029
uddle-0.027
ounce-0.023
castle-0.022
 upon-0.021
Token Maybe
Token ID6674
Feature activation-4.06e-02

Pos logit contributions
 pain+0.014
 Some+0.012
 emotion+0.012
oscope+0.012
 Many+0.011
Neg logit contributions
ll-0.017
!-0.015
 upon-0.015
uddle-0.013
!"-0.013
Token you
Token ID345
Feature activation-2.60e-02

Pos logit contributions
 Some+0.085
 Many+0.080
 pain+0.079
 Amen+0.076
oscope+0.072
Neg logit contributions
ll-0.066
uddle-0.053
!-0.053
 upon-0.049
castle-0.048
Token can
Token ID460
Feature activation+3.29e-03

Pos logit contributions
 Some+0.041
 Many+0.038
 pain+0.038
 Amen+0.036
oscope+0.035
Neg logit contributions
ll-0.053
uddle-0.047
castle-0.044
ounce-0.042
!-0.042
Token thought
Token ID1807
Feature activation+4.16e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
oscope+0.001
 blonde+0.001
Neg logit contributions
ll-0.001
uddle-0.001
castle-0.001
!-0.001
 upon-0.001
Token they
Token ID484
Feature activation+9.37e-03

Pos logit contributions
ll+0.006
!+0.005
 upon+0.005
uddle+0.005
castle+0.004
Neg logit contributions
 Amen-0.009
 Some-0.009
 Many-0.009
 pain-0.008
oscope-0.008
Token looked
Token ID3114
Feature activation-3.17e-04

Pos logit contributions
ll+0.014
!+0.011
uddle+0.011
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.021
 Many-0.020
 pain-0.019
 Amen-0.019
oscope-0.018
Token like
Token ID588
Feature activation+6.48e-02

Pos logit contributions
 Some+0.000
 pain+0.000
 Many+0.000
 blonde+0.000
 novels+0.000
Neg logit contributions
ll-0.001
castle-0.001
uddle-0.001
ounce-0.001
ert-0.001
Token ac
Token ID936
Feature activation-1.63e-02

Pos logit contributions
ll+0.139
uddle+0.118
!+0.115
 upon+0.110
castle+0.110
Neg logit contributions
 Some-0.105
 Many-0.097
 pain-0.095
 Amen-0.087
 novels-0.083
Tokenrob
Token ID22609
Feature activation-1.44e-01

Pos logit contributions
 Some+0.028
 Many+0.026
 pain+0.026
 Amen+0.025
oscope+0.022
Neg logit contributions
ll-0.034
!-0.029
uddle-0.028
 upon-0.026
castle-0.026
Tokenats
Token ID1381
Feature activation+8.29e-02

Pos logit contributions
 Some+0.334
 Many+0.318
 pain+0.311
 Amen+0.296
oscope+0.286
Neg logit contributions
ll-0.194
!-0.153
uddle-0.149
 upon-0.141
castle-0.137
Token.
Token ID13
Feature activation-3.32e-03

Pos logit contributions
ll+0.145
uddle+0.122
castle+0.111
 upon+0.108
ounce+0.105
Neg logit contributions
 Some-0.161
 pain-0.149
 Many-0.148
oscope-0.137
 Amen-0.136
Token She
Token ID1375
Feature activation+8.40e-04

Pos logit contributions
 pain+0.004
 Some+0.004
 Amen+0.004
 Many+0.004
oscope+0.004
Neg logit contributions
ll-0.008
!-0.007
uddle-0.007
 upon-0.007
ounce-0.006
Token enjoyed
Token ID8359
Feature activation+3.06e-02

Pos logit contributions
ll+0.002
uddle+0.001
!+0.001
castle+0.001
ounce+0.001
Neg logit contributions
 Some-0.001
 Many-0.001
 pain-0.001
 Amen-0.001
 blonde-0.001
Token the
Token ID262
Feature activation+7.10e-03

Pos logit contributions
ll+0.047
!+0.040
 upon+0.035
uddle+0.035
castle+0.034
Neg logit contributions
 Some-0.067
 Many-0.063
 pain-0.062
 Amen-0.060
oscope-0.058
Token elephants
Token ID25368
Feature activation+7.75e-03

Pos logit contributions
 Some+0.000
 Many+0.000
 pain+0.000
 Amen+0.000
oscope+0.000
Neg logit contributions
ll-0.000
uddle-0.000
!-0.000
castle-0.000
 upon-0.000
Token,
Token ID11
Feature activation-2.86e-04

Pos logit contributions
ll+0.014
uddle+0.012
castle+0.011
 upon+0.010
ounce+0.010
Neg logit contributions
 Some-0.015
 Many-0.014
 pain-0.014
oscope-0.012
 novels-0.012
Token lions
Token ID31420
Feature activation-1.58e-02

Pos logit contributions
 Some+0.001
 Amen+0.001
 Many+0.001
 pain+0.001
 aids+0.001
Neg logit contributions
ll-0.000
!-0.000
 upon-0.000
uddle-0.000
castle-0.000
Token and
Token ID290
Feature activation-7.15e-03

Pos logit contributions
 Some+0.029
 pain+0.028
 Many+0.028
oscope+0.025
 blond+0.025
Neg logit contributions
ll-0.028
uddle-0.024
 upon-0.022
castle-0.021
ounce-0.021
Token ac
Token ID936
Feature activation-1.83e-02

Pos logit contributions
 Some+0.013
 Amen+0.013
 pain+0.013
 Many+0.012
oscope+0.012
Neg logit contributions
ll-0.011
!-0.010
 upon-0.009
uddle-0.009
castle-0.009
Tokenrob
Token ID22609
Feature activation-1.44e-01

Pos logit contributions
 Some+0.024
 Many+0.022
 pain+0.021
 Amen+0.019
oscope+0.019
Neg logit contributions
ll-0.041
 upon-0.037
uddle-0.037
castle-0.036
!-0.036
Tokenats
Token ID1381
Feature activation+8.16e-02

Pos logit contributions
 Some+0.331
 Many+0.317
 pain+0.307
ez+0.292
 Amen+0.288
Neg logit contributions
ll-0.186
!-0.149
uddle-0.147
 upon-0.142
castle-0.138
Token.
Token ID13
Feature activation-1.18e-03

Pos logit contributions
ll+0.164
uddle+0.147
castle+0.127
 upon+0.123
ounce+0.121
Neg logit contributions
 Some-0.140
 Many-0.129
 pain-0.117
oscope-0.117
 aids-0.110
Token They
Token ID1119
Feature activation-1.27e-03

Pos logit contributions
 Amen+0.002
 pain+0.002
oscope+0.002
ez+0.002
 Some+0.002
Neg logit contributions
ll-0.002
!-0.002
uddle-0.002
 upon-0.002
ounce-0.002
Token are
Token ID389
Feature activation+4.40e-03

Pos logit contributions
 Some+0.002
 Many+0.002
 pain+0.002
 Amen+0.002
 blond+0.001
Neg logit contributions
ll-0.003
uddle-0.003
ounce-0.002
!-0.002
castle-0.002
Token amazed
Token ID24562
Feature activation+1.81e-02

Pos logit contributions
ll+0.009
uddle+0.008
!+0.008
ounce+0.007
castle+0.007
Neg logit contributions
 Some-0.007
 pain-0.007
 Many-0.006
 Amen-0.006
 blond-0.006
Token listen
Token ID6004
Feature activation+1.07e-02

Pos logit contributions
ll+0.028
!+0.026
 upon+0.023
ounce+0.022
uddle+0.022
Neg logit contributions
 Some-0.025
 Many-0.023
 Amen-0.022
 pain-0.021
 novels-0.021
Token.
Token ID13
Feature activation+2.58e-04

Pos logit contributions
ll+0.014
uddle+0.011
!+0.010
castle+0.010
 upon+0.009
Neg logit contributions
 Some-0.026
 Many-0.024
 pain-0.024
 Amen-0.023
oscope-0.022
Token She
Token ID1375
Feature activation-4.47e-03

Pos logit contributions
ll+0.000
!+0.000
uddle+0.000
 upon+0.000
ounce+0.000
Neg logit contributions
 Amen-0.000
oscope-0.000
 pain-0.000
ez-0.000
 Some-0.000
Token threw
Token ID9617
Feature activation+2.74e-02

Pos logit contributions
 Some+0.008
 pain+0.008
 Many+0.007
 Amen+0.007
oscope+0.007
Neg logit contributions
ll-0.008
uddle-0.007
!-0.007
 upon-0.007
castle-0.007
Token a
Token ID257
Feature activation+1.12e-02

Pos logit contributions
!+0.036
ll+0.033
 upon+0.031
uddle+0.029
 home+0.028
Neg logit contributions
 Amen-0.061
 Some-0.057
 Many-0.056
oscope-0.051
 blonde-0.050
Token tant
Token ID24246
Feature activation-1.43e-01

Pos logit contributions
ll+0.029
!+0.025
uddle+0.025
 upon+0.024
castle+0.024
Neg logit contributions
 Some-0.013
 Many-0.012
 pain-0.012
 Amen-0.011
oscope-0.010
Tokenrum
Token ID6582
Feature activation-1.20e-03

Pos logit contributions
 Some+0.256
 Many+0.238
 blonde+0.226
 pain+0.226
 blond+0.215
Neg logit contributions
ll-0.271
!-0.242
ies-0.220
!"-0.207
ounce-0.197
Token in
Token ID287
Feature activation+1.83e-03

Pos logit contributions
 Some+0.003
 Many+0.003
 pain+0.003
 Amen+0.002
oscope+0.002
Neg logit contributions
ll-0.002
uddle-0.001
!-0.001
 upon-0.001
castle-0.001
Token the
Token ID262
Feature activation+5.93e-03

Pos logit contributions
!+0.003
ll+0.002
 upon+0.002
!"+0.002
 home+0.002
Neg logit contributions
 Amen-0.004
 Some-0.004
 Many-0.004
oscope-0.003
 chapters-0.003
Token store
Token ID3650
Feature activation-2.41e-02

Pos logit contributions
ll+0.011
!+0.008
uddle+0.008
castle+0.008
 upon+0.008
Neg logit contributions
 Some-0.012
 pain-0.011
 Many-0.011
 Amen-0.010
oscope-0.010
Token.
Token ID13
Feature activation-2.18e-04

Pos logit contributions
 Some+0.048
 pain+0.045
 Many+0.044
oscope+0.041
 novels+0.039
Neg logit contributions
ll-0.042
uddle-0.034
 upon-0.031
castle-0.030
ounce-0.030
Token She
Token ID1375
Feature activation-4.86e-03

Pos logit contributions
 pain+0.005
 Some+0.005
 Amen+0.005
 Many+0.005
oscope+0.005
Neg logit contributions
ll-0.010
uddle-0.009
!-0.009
 upon-0.008
ounce-0.008
Token cried
Token ID16896
Feature activation-1.78e-02

Pos logit contributions
 Some+0.010
 Many+0.009
 pain+0.009
 Amen+0.009
oscope+0.008
Neg logit contributions
ll-0.008
uddle-0.007
!-0.007
 upon-0.006
castle-0.006
Token and
Token ID290
Feature activation-6.10e-03

Pos logit contributions
 Amen+0.049
 Some+0.047
 Many+0.046
 piping+0.045
growth+0.044
Neg logit contributions
!-0.014
ll-0.013
 upon-0.009
uddle-0.008
 home-0.007
Token threw
Token ID9617
Feature activation+2.42e-02

Pos logit contributions
 Some+0.013
 Many+0.013
 Amen+0.012
 pain+0.012
oscope+0.011
Neg logit contributions
ll-0.009
!-0.008
uddle-0.007
 upon-0.007
castle-0.006
Token a
Token ID257
Feature activation+5.03e-03

Pos logit contributions
!+0.036
 home+0.029
ll+0.029
 upon+0.029
uddle+0.027
Neg logit contributions
 Amen-0.049
 Some-0.047
 Many-0.046
 chapters-0.041
oscope-0.040
Token tant
Token ID24246
Feature activation-1.43e-01

Pos logit contributions
ll+0.012
uddle+0.011
!+0.011
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.007
 Many-0.006
 pain-0.006
 Amen-0.006
oscope-0.005
Tokenrum
Token ID6582
Feature activation-6.90e-03

Pos logit contributions
 Some+0.254
 Many+0.233
 pain+0.225
 blonde+0.221
 blond+0.213
Neg logit contributions
ll-0.273
!-0.246
ies-0.219
!"-0.208
ounce-0.200
Token.
Token ID13
Feature activation-6.32e-03

Pos logit contributions
 Some+0.016
 Many+0.015
 pain+0.015
 Amen+0.014
oscope+0.013
Neg logit contributions
ll-0.010
uddle-0.008
!-0.008
 upon-0.007
castle-0.007
Token Her
Token ID2332
Feature activation-4.63e-02

Pos logit contributions
 pain+0.009
 Amen+0.009
oscope+0.008
ez+0.008
 emotion+0.008
Neg logit contributions
ll-0.013
uddle-0.012
!-0.012
 upon-0.012
ounce-0.011
Token mom
Token ID1995
Feature activation-1.55e-02

Pos logit contributions
 Some+0.100
 Many+0.095
 Amen+0.092
 pain+0.087
oscope+0.086
Neg logit contributions
ll-0.072
uddle-0.062
!-0.059
castle-0.056
 upon-0.055
Token tried
Token ID3088
Feature activation-3.57e-03

Pos logit contributions
 Some+0.032
 Many+0.030
 pain+0.030
 Amen+0.028
oscope+0.027
Neg logit contributions
ll-0.025
uddle-0.021
!-0.020
castle-0.019
 upon-0.019
Token her
Token ID607
Feature activation-1.52e-02

Pos logit contributions
 Some+0.034
 Many+0.032
 Amen+0.031
 pain+0.030
oscope+0.028
Neg logit contributions
ll-0.034
!-0.031
uddle-0.029
 upon-0.028
ounce-0.027
Token feet
Token ID3625
Feature activation-7.74e-03

Pos logit contributions
 Some+0.026
 pain+0.025
 Many+0.024
 Amen+0.022
oscope+0.022
Neg logit contributions
ll-0.031
!-0.025
uddle-0.025
castle-0.024
 upon-0.024
Token and
Token ID290
Feature activation-3.77e-03

Pos logit contributions
 Some+0.016
 pain+0.015
 Many+0.015
oscope+0.014
 Amen+0.014
Neg logit contributions
ll-0.013
uddle-0.010
castle-0.009
!-0.009
 upon-0.009
Token threw
Token ID9617
Feature activation+2.83e-02

Pos logit contributions
 Some+0.007
 pain+0.007
 Many+0.007
 Amen+0.006
oscope+0.006
Neg logit contributions
ll-0.007
uddle-0.006
!-0.005
castle-0.005
 upon-0.005
Token a
Token ID257
Feature activation+1.07e-02

Pos logit contributions
ll+0.044
!+0.042
 upon+0.037
uddle+0.037
ounce+0.034
Neg logit contributions
 Some-0.054
 Amen-0.053
 Many-0.053
 pain-0.049
oscope-0.047
Token tant
Token ID24246
Feature activation-1.42e-01

Pos logit contributions
ll+0.030
!+0.026
uddle+0.026
 upon+0.025
castle+0.025
Neg logit contributions
 Some-0.010
 Many-0.009
 pain-0.008
 Amen-0.007
oscope-0.006
Tokenrum
Token ID6582
Feature activation-1.99e-03

Pos logit contributions
 Some+0.259
 Many+0.238
 pain+0.222
 blonde+0.215
 blond+0.208
Neg logit contributions
ll-0.275
!-0.244
ies-0.215
!"-0.206
ys-0.199
Token.
Token ID13
Feature activation-2.00e-03

Pos logit contributions
 Some+0.004
 pain+0.004
 Many+0.004
 Amen+0.004
 novels+0.004
Neg logit contributions
ll-0.003
uddle-0.003
castle-0.002
 upon-0.002
ounce-0.002
Token She
Token ID1375
Feature activation-3.91e-03

Pos logit contributions
 Amen+0.003
 pain+0.002
oscope+0.002
 Some+0.002
 Many+0.002
Neg logit contributions
ll-0.005
uddle-0.004
!-0.004
 upon-0.004
ounce-0.004
Token grabbed
Token ID13176
Feature activation+2.76e-02

Pos logit contributions
 Some+0.007
 pain+0.007
 Many+0.007
 Amen+0.006
oscope+0.006
Neg logit contributions
ll-0.007
uddle-0.006
!-0.006
castle-0.006
 upon-0.006
Token the
Token ID262
Feature activation+5.25e-03

Pos logit contributions
!+0.035
ll+0.029
 upon+0.029
 exactly+0.027
 home+0.027
Neg logit contributions
 Amen-0.065
 Many-0.056
 Some-0.055
oscope-0.053
 sleeps-0.053
Token kept
Token ID4030
Feature activation+3.20e-02

Pos logit contributions
ll+0.002
!+0.002
uddle+0.002
 upon+0.001
castle+0.001
Neg logit contributions
 Some-0.003
 Many-0.002
 pain-0.002
 Amen-0.002
oscope-0.002
Token crying
Token ID13774
Feature activation-5.15e-02

Pos logit contributions
ll+0.052
!+0.047
 upon+0.040
uddle+0.039
ounce+0.038
Neg logit contributions
 Some-0.066
 Many-0.065
 Amen-0.058
 pain-0.055
oscope-0.054
Token and
Token ID290
Feature activation-1.98e-03

Pos logit contributions
 Some+0.130
 Many+0.127
 Amen+0.125
 pain+0.118
oscope+0.116
Neg logit contributions
ll-0.059
!-0.053
uddle-0.044
 upon-0.041
ounce-0.035
Token throwing
Token ID9644
Feature activation+3.15e-04

Pos logit contributions
 Some+0.004
 Many+0.004
 pain+0.003
 Amen+0.003
oscope+0.003
Neg logit contributions
ll-0.004
!-0.003
uddle-0.003
 upon-0.003
castle-0.003
Token a
Token ID257
Feature activation+9.29e-03

Pos logit contributions
!+0.000
ll+0.000
uddle+0.000
 upon+0.000
!"+0.000
Neg logit contributions
 Amen-0.001
 Many-0.001
 Some-0.001
 chapters-0.001
oscope-0.001
Token tant
Token ID24246
Feature activation-1.41e-01

Pos logit contributions
ll+0.022
!+0.019
uddle+0.019
 upon+0.019
castle+0.018
Neg logit contributions
 Some-0.012
 Many-0.011
 pain-0.011
 Amen-0.010
oscope-0.009
Tokenrum
Token ID6582
Feature activation-2.43e-03

Pos logit contributions
 Some+0.259
 Many+0.237
 pain+0.217
 blonde+0.217
 blond+0.211
Neg logit contributions
ll-0.272
!-0.242
ies-0.210
!"-0.203
ounce-0.197
Token.
Token ID13
Feature activation-3.22e-03

Pos logit contributions
 Some+0.006
 Many+0.005
 pain+0.005
 Amen+0.005
oscope+0.005
Neg logit contributions
ll-0.003
!-0.003
uddle-0.003
 upon-0.002
castle-0.002
Token
Token ID198
Feature activation+3.65e-04

Pos logit contributions
 Amen+0.006
 pain+0.006
oscope+0.006
 Some+0.005
ez+0.005
Neg logit contributions
ll-0.006
 upon-0.005
!-0.005
uddle-0.005
ounce-0.004
Token
Token ID198
Feature activation+2.19e-04

Pos logit contributions
ll+0.001
!+0.001
 upon+0.000
uddle+0.000
ounce+0.000
Neg logit contributions
 Some-0.001
 pain-0.001
 Amen-0.001
 Many-0.001
oscope-0.001
TokenFinally
Token ID11158
Feature activation+1.52e-02

Pos logit contributions
ll+0.000
!+0.000
uddle+0.000
 upon+0.000
ounce+0.000
Neg logit contributions
 Some-0.000
 Amen-0.000
 pain-0.000
 Many-0.000
oscope-0.000
Token the
Token ID262
Feature activation-1.49e-03

Pos logit contributions
ll+0.037
uddle+0.031
!+0.029
castle+0.029
ounce+0.028
Neg logit contributions
 Some-0.032
 Many-0.030
 pain-0.028
 Amen-0.027
 novels-0.026
Token animals
Token ID4695
Feature activation-2.10e-03

Pos logit contributions
 Some+0.002
 Many+0.002
 novels+0.002
 pain+0.002
 Amen+0.002
Neg logit contributions
ll-0.003
uddle-0.003
ounce-0.003
icking-0.003
castle-0.003
Token,
Token ID11
Feature activation-7.18e-04

Pos logit contributions
 Some+0.004
 Many+0.004
 pain+0.004
 Amen+0.004
oscope+0.003
Neg logit contributions
ll-0.004
uddle-0.003
ounce-0.003
!-0.003
 upon-0.003
Token the
Token ID262
Feature activation-1.29e-03

Pos logit contributions
 Some+0.001
 pain+0.001
 Many+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
!-0.001
 upon-0.001
uddle-0.001
castle-0.001
Token ac
Token ID936
Feature activation-2.03e-02

Pos logit contributions
 Some+0.002
 Many+0.002
 pain+0.002
 novels+0.002
 Amen+0.002
Neg logit contributions
ll-0.003
uddle-0.002
!-0.002
ounce-0.002
castle-0.002
Tokenrob
Token ID22609
Feature activation-1.41e-01

Pos logit contributions
 Some+0.028
 Many+0.026
 pain+0.025
 Amen+0.024
oscope+0.023
Neg logit contributions
ll-0.045
!-0.039
uddle-0.039
 upon-0.038
castle-0.037
Tokenats
Token ID1381
Feature activation+7.87e-02

Pos logit contributions
 Some+0.322
 Many+0.308
 pain+0.297
 Amen+0.280
ez+0.280
Neg logit contributions
ll-0.177
!-0.150
 upon-0.142
uddle-0.141
castle-0.130
Token and
Token ID290
Feature activation-7.25e-03

Pos logit contributions
ll+0.131
uddle+0.111
 upon+0.099
castle+0.099
ounce+0.098
Neg logit contributions
 Some-0.160
 Many-0.152
 pain-0.141
oscope-0.135
 Amen-0.131
Token the
Token ID262
Feature activation-4.16e-04

Pos logit contributions
 Some+0.014
 Many+0.013
 pain+0.013
 Amen+0.012
oscope+0.012
Neg logit contributions
ll-0.013
!-0.011
uddle-0.010
 upon-0.010
castle-0.010
Token clown
Token ID25573
Feature activation+7.27e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
uddle-0.001
!-0.001
ounce-0.001
castle-0.001
Tokens
Token ID82
Feature activation+3.57e-03

Pos logit contributions
ll+0.010
uddle+0.008
 upon+0.007
castle+0.007
ounce+0.007
Neg logit contributions
 Some-0.016
 Many-0.015
 novels-0.014
oscope-0.014
 pain-0.013
Token you
Token ID345
Feature activation-2.67e-02

Pos logit contributions
 Some+0.059
 Many+0.056
 pain+0.055
 Amen+0.054
oscope+0.051
Neg logit contributions
ll-0.032
!-0.025
uddle-0.024
 upon-0.022
castle-0.021
Token,
Token ID11
Feature activation+2.70e-03

Pos logit contributions
 Some+0.047
 Many+0.044
 pain+0.044
 Amen+0.038
oscope+0.037
Neg logit contributions
ll-0.046
castle-0.044
uddle-0.043
cast-0.041
ounce-0.038
Token Ben
Token ID3932
Feature activation+6.67e-03

Pos logit contributions
ll+0.005
uddle+0.004
castle+0.004
!+0.004
ounce+0.004
Neg logit contributions
 Some-0.005
 pain-0.005
 Many-0.005
 Amen-0.004
 novels-0.004
Token.
Token ID13
Feature activation+9.60e-03

Pos logit contributions
ll+0.013
uddle+0.012
castle+0.011
ounce+0.010
cast+0.010
Neg logit contributions
 pain-0.011
 Some-0.011
 Amen-0.010
oscope-0.010
 Many-0.009
Token You
Token ID921
Feature activation-3.04e-02

Pos logit contributions
ll+0.018
!+0.016
 upon+0.014
uddle+0.013
!"+0.013
Neg logit contributions
 pain-0.016
 Some-0.015
 Many-0.015
oscope-0.014
 chapters-0.014
Token both
Token ID1111
Feature activation+8.28e-02

Pos logit contributions
 Some+0.055
 Many+0.051
 pain+0.050
 Amen+0.049
oscope+0.045
Neg logit contributions
ll-0.059
!-0.049
uddle-0.048
 upon-0.046
castle-0.045
Token learned
Token ID4499
Feature activation+1.81e-02

Pos logit contributions
ll+0.168
!+0.144
uddle+0.140
 upon+0.134
castle+0.130
Neg logit contributions
 Some-0.140
 Many-0.129
 pain-0.126
 Amen-0.122
oscope-0.112
Token something
Token ID1223
Feature activation-5.73e-03

Pos logit contributions
ll+0.039
uddle+0.033
castle+0.033
ounce+0.031
 upon+0.030
Neg logit contributions
 Some-0.029
 pain-0.027
 Many-0.026
 novels-0.025
 Amen-0.023
Token today
Token ID1909
Feature activation-2.43e-02

Pos logit contributions
 Some+0.008
 pain+0.007
 Many+0.007
 novels+0.007
 x+0.007
Neg logit contributions
ll-0.012
castle-0.011
 destination-0.010
ounce-0.010
pit-0.010
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
 Some+0.044
 pain+0.044
 Many+0.041
oscope+0.039
 Amen+0.038
Neg logit contributions
ll-0.042
uddle-0.036
ounce-0.034
castle-0.034
 upon-0.034
Token Sometimes
Token ID8975
Feature activation-5.47e-02

Pos logit contributions
ll+0.013
!+0.011
 upon+0.010
!"+0.010
!".+0.010
Neg logit contributions
 pain-0.012
 Some-0.011
 gag-0.011
 Many-0.011
 chapters-0.011
Token a
Token ID257
Feature activation+1.49e-02

Pos logit contributions
 Some+0.020
 pain+0.019
 Many+0.018
 Amen+0.017
 novels+0.016
Neg logit contributions
ll-0.025
uddle-0.021
castle-0.019
!-0.019
ounce-0.019
Token plan
Token ID1410
Feature activation-1.09e-02

Pos logit contributions
ll+0.033
castle+0.027
uddle+0.027
!+0.026
cast+0.026
Neg logit contributions
 Some-0.021
 pain-0.021
 Many-0.020
 Amen-0.019
 x-0.018
Token in
Token ID287
Feature activation+7.81e-03

Pos logit contributions
 Some+0.018
 pain+0.018
oscope+0.017
 Many+0.017
 novels+0.016
Neg logit contributions
ll-0.020
uddle-0.019
castle-0.016
cast-0.016
ounce-0.015
Token case
Token ID1339
Feature activation-7.25e-03

Pos logit contributions
ll+0.015
!+0.013
uddle+0.012
 upon+0.012
castle+0.011
Neg logit contributions
 Some-0.014
 Many-0.013
 pain-0.013
 Amen-0.012
oscope-0.012
Token something
Token ID1223
Feature activation-1.78e-03

Pos logit contributions
 Some+0.013
 pain+0.012
 Many+0.012
oscope+0.010
 novels+0.010
Neg logit contributions
ll-0.014
uddle-0.012
castle-0.011
ounce-0.010
cast-0.010
Token like
Token ID588
Feature activation+6.87e-02

Pos logit contributions
 pain+0.003
 Some+0.003
 Many+0.002
 blonde+0.002
 nightmares+0.002
Neg logit contributions
ll-0.004
uddle-0.003
castle-0.003
ounce-0.003
 upon-0.003
Token this
Token ID428
Feature activation-8.51e-03

Pos logit contributions
ll+0.151
uddle+0.128
!+0.120
castle+0.119
ounce+0.118
Neg logit contributions
 Some-0.109
 pain-0.100
 Many-0.098
 Amen-0.088
 novels-0.086
Token happened
Token ID3022
Feature activation-1.77e-03

Pos logit contributions
 Some+0.016
 pain+0.016
 Many+0.015
 Amen+0.014
oscope+0.014
Neg logit contributions
ll-0.016
uddle-0.013
castle-0.012
ounce-0.012
 upon-0.011
Token.
Token ID13
Feature activation+3.16e-03

Pos logit contributions
 pain+0.003
 Some+0.003
oscope+0.003
 Amen+0.003
ez+0.003
Neg logit contributions
ll-0.003
uddle-0.003
castle-0.003
ounce-0.002
cast-0.002
Token She
Token ID1375
Feature activation+4.14e-03

Pos logit contributions
ll+0.007
!+0.006
uddle+0.006
 upon+0.005
castle+0.005
Neg logit contributions
 Some-0.005
 Many-0.005
 pain-0.005
 Amen-0.004
oscope-0.004
Token came
Token ID1625
Feature activation-2.81e-04

Pos logit contributions
ll+0.008
uddle+0.007
castle+0.007
!+0.006
ounce+0.006
Neg logit contributions
 Some-0.007
 pain-0.006
 Many-0.006
 Amen-0.006
oscope-0.006
Token warm
Token ID5814
Feature activation+2.22e-02

Pos logit contributions
 Some+0.002
 pain+0.002
 Many+0.002
 Amen+0.002
 blonde+0.002
Neg logit contributions
ll-0.003
uddle-0.003
ounce-0.002
castle-0.002
!-0.002
Token and
Token ID290
Feature activation+5.95e-04

Pos logit contributions
ll+0.037
uddle+0.030
ounce+0.028
 upon+0.028
!+0.027
Neg logit contributions
 Some-0.044
 pain-0.042
 Many-0.041
 Amen-0.040
 novels-0.038
Token comfortable
Token ID6792
Feature activation+2.49e-03

Pos logit contributions
ll+0.001
uddle+0.001
!+0.001
 upon+0.001
castle+0.001
Neg logit contributions
 Some-0.001
 Many-0.001
 pain-0.001
 Amen-0.001
oscope-0.001
Token,
Token ID11
Feature activation+4.88e-03

Pos logit contributions
ll+0.004
uddle+0.004
ounce+0.004
 upon+0.003
castle+0.003
Neg logit contributions
 pain-0.004
 Some-0.004
oscope-0.004
 novels-0.004
 Many-0.004
Token just
Token ID655
Feature activation+8.42e-02

Pos logit contributions
ll+0.009
uddle+0.007
ounce+0.006
!+0.006
castle+0.006
Neg logit contributions
 Some-0.010
 pain-0.009
 Many-0.009
 Amen-0.008
oscope-0.008
Token perfect
Token ID2818
Feature activation+7.45e-02

Pos logit contributions
ll+0.152
!+0.129
uddle+0.124
 upon+0.119
castle+0.116
Neg logit contributions
 Some-0.161
 Many-0.151
 pain-0.146
 Amen-0.142
oscope-0.133
Token for
Token ID329
Feature activation-8.58e-03

Pos logit contributions
ll+0.140
uddle+0.116
ounce+0.113
 upon+0.102
cast+0.095
Neg logit contributions
 pain-0.139
 Some-0.124
 novels-0.118
 nightmares-0.117
 emotion-0.114
Token an
Token ID281
Feature activation+4.81e-02

Pos logit contributions
 Some+0.016
 Many+0.015
 pain+0.015
 Amen+0.014
oscope+0.013
Neg logit contributions
ll-0.016
uddle-0.014
!-0.013
 upon-0.013
castle-0.012
Token old
Token ID1468
Feature activation-7.27e-02

Pos logit contributions
ll+0.074
!+0.066
uddle+0.058
castle+0.058
 upon+0.056
Neg logit contributions
 Some-0.102
 Many-0.093
 pain-0.092
 Amen-0.089
ez-0.085
Token pigeon
Token ID46387
Feature activation+3.18e-02

Pos logit contributions
 pain+0.113
 Some+0.113
 Many+0.106
 Amen+0.099
 blonde+0.094
Neg logit contributions
ll-0.160
uddle-0.131
ounce-0.123
 upon-0.122
cast-0.119
Token like
Token ID588
Feature activation+6.98e-02

Pos logit contributions
ll+0.057
uddle+0.052
castle+0.045
cast+0.045
ounce+0.044
Neg logit contributions
 Some-0.055
 pain-0.051
oscope-0.051
 Many-0.051
 nightmares-0.048
Token rocked
Token ID36872
Feature activation-1.57e-02

Pos logit contributions
 Some+0.014
 pain+0.014
 Many+0.013
oscope+0.013
 Amen+0.012
Neg logit contributions
ll-0.013
!-0.011
uddle-0.011
ounce-0.010
 upon-0.010
Token even
Token ID772
Feature activation+4.07e-02

Pos logit contributions
 Some+0.033
 Many+0.031
 pain+0.031
 Amen+0.030
oscope+0.028
Neg logit contributions
ll-0.025
uddle-0.020
!-0.020
 upon-0.019
castle-0.019
Token harder
Token ID7069
Feature activation-6.20e-02

Pos logit contributions
ll+0.064
!+0.052
uddle+0.050
 upon+0.049
castle+0.046
Neg logit contributions
 Some-0.088
 Many-0.085
 pain-0.080
 Amen-0.080
 blonde-0.075
Token,
Token ID11
Feature activation-2.37e-03

Pos logit contributions
 Some+0.139
 Many+0.132
 pain+0.131
 Amen+0.125
oscope+0.120
Neg logit contributions
ll-0.091
uddle-0.072
!-0.071
 upon-0.065
castle-0.065
Token and
Token ID290
Feature activation-6.72e-03

Pos logit contributions
 Some+0.005
 pain+0.005
 Many+0.005
 Amen+0.004
oscope+0.004
Neg logit contributions
ll-0.004
uddle-0.003
!-0.003
castle-0.003
 upon-0.003
Token water
Token ID1660
Feature activation+7.83e-02

Pos logit contributions
 Some+0.013
 pain+0.013
 Many+0.012
 Amen+0.012
oscope+0.011
Neg logit contributions
ll-0.012
uddle-0.010
!-0.009
 upon-0.009
castle-0.009
Token started
Token ID2067
Feature activation+2.79e-02

Pos logit contributions
ll+0.153
uddle+0.128
castle+0.118
ounce+0.115
 destination+0.113
Neg logit contributions
 pain-0.135
 Some-0.124
oscope-0.115
 Many-0.111
ez-0.107
Token flooding
Token ID17448
Feature activation+4.95e-02

Pos logit contributions
ll+0.045
uddle+0.037
castle+0.032
ounce+0.030
!+0.030
Neg logit contributions
 Some-0.059
 pain-0.058
 Many-0.053
oscope-0.050
 blond-0.050
Token the
Token ID262
Feature activation+7.83e-03

Pos logit contributions
ll+0.100
uddle+0.080
castle+0.075
ounce+0.073
 upon+0.071
Neg logit contributions
 Some-0.091
 pain-0.089
 Many-0.081
oscope-0.074
 emotion-0.073
Token boat
Token ID8848
Feature activation-7.31e-03

Pos logit contributions
ll+0.014
ounce+0.011
 upon+0.011
!+0.011
uddle+0.010
Neg logit contributions
 pain-0.015
 Some-0.015
 Many-0.014
oscope-0.013
 Amen-0.013
Token.
Token ID13
Feature activation-4.99e-03

Pos logit contributions
 pain+0.012
oscope+0.011
 Some+0.011
 Many+0.010
 bolt+0.009
Neg logit contributions
ll-0.014
uddle-0.012
ory-0.011
andy-0.011
 upon-0.010
Token Max
Token ID5436
Feature activation-1.86e-02

Pos logit contributions
 Some+0.042
 Many+0.039
 pain+0.038
 Amen+0.037
oscope+0.034
Neg logit contributions
ll-0.048
uddle-0.041
!-0.040
castle-0.038
 upon-0.038
Token said
Token ID531
Feature activation+3.09e-02

Pos logit contributions
 Some+0.038
 Many+0.036
 pain+0.034
 Amen+0.032
oscope+0.032
Neg logit contributions
ll-0.032
uddle-0.027
castle-0.025
 upon-0.025
cast-0.024
Token.
Token ID13
Feature activation+3.30e-03

Pos logit contributions
ll+0.037
!+0.030
uddle+0.026
 upon+0.024
castle+0.023
Neg logit contributions
 Some-0.078
 Many-0.075
 pain-0.073
 Amen-0.071
oscope-0.068
Token "
Token ID366
Feature activation+4.69e-03

Pos logit contributions
ll+0.006
!+0.005
uddle+0.005
 upon+0.005
ounce+0.004
Neg logit contributions
 Some-0.006
 pain-0.006
 Many-0.006
 Amen-0.006
oscope-0.005
TokenI
Token ID40
Feature activation+2.39e-02

Pos logit contributions
ll+0.007
!+0.005
castle+0.005
 upon+0.005
uddle+0.005
Neg logit contributions
 pain-0.010
 Some-0.010
 nightmares-0.009
 Many-0.009
 blonde-0.009
Token'll
Token ID1183
Feature activation+6.48e-02

Pos logit contributions
ll+0.045
uddle+0.039
castle+0.037
!+0.037
ounce+0.035
Neg logit contributions
 Some-0.042
 pain-0.039
 Many-0.039
 Amen-0.037
oscope-0.035
Token make
Token ID787
Feature activation-1.22e-02

Pos logit contributions
ll+0.150
uddle+0.135
 upon+0.126
ounce+0.125
cast+0.119
Neg logit contributions
 Some-0.091
 pain-0.088
 Many-0.086
 Amen-0.080
oscope-0.072
Token the
Token ID262
Feature activation+6.16e-03

Pos logit contributions
 Some+0.021
 Many+0.019
 pain+0.019
 Amen+0.018
 novels+0.016
Neg logit contributions
ll-0.025
uddle-0.021
 upon-0.020
!-0.020
ounce-0.020
Token tail
Token ID7894
Feature activation+2.89e-03

Pos logit contributions
ll+0.012
uddle+0.010
ounce+0.010
!+0.009
 upon+0.009
Neg logit contributions
 Some-0.011
 pain-0.011
 Many-0.010
 Amen-0.010
 emotion-0.009
Token,
Token ID11
Feature activation+7.81e-04

Pos logit contributions
ll+0.006
uddle+0.005
ounce+0.004
 upon+0.004
ory+0.004
Neg logit contributions
 Some-0.005
 pain-0.005
 Many-0.005
 Amen-0.004
oscope-0.004
Token you
Token ID345
Feature activation-3.09e-02

Pos logit contributions
ll+0.001
uddle+0.001
!+0.001
 upon+0.001
ounce+0.001
Neg logit contributions
 Some-0.002
 Many-0.001
 pain-0.001
 Amen-0.001
oscope-0.001
Token made
Token ID925
Feature activation+1.44e-02

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
uddle-0.001
!-0.001
 upon-0.001
ounce-0.001
Token a
Token ID257
Feature activation+1.51e-02

Pos logit contributions
ll+0.028
!+0.025
uddle+0.024
 upon+0.023
castle+0.023
Neg logit contributions
 Some-0.024
 Amen-0.022
 Many-0.022
 pain-0.022
oscope-0.020
Token zip
Token ID19974
Feature activation-1.56e-02

Pos logit contributions
ll+0.035
!+0.028
uddle+0.027
ounce+0.027
 upon+0.027
Neg logit contributions
 Some-0.022
 pain-0.021
 Many-0.021
 Amen-0.018
 novels-0.017
Token with
Token ID351
Feature activation-1.48e-02

Pos logit contributions
 Some+0.030
 pain+0.028
 Many+0.027
oscope+0.025
 aids+0.025
Neg logit contributions
ll-0.030
uddle-0.022
 upon-0.021
ounce-0.020
!-0.019
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
 Amen+0.027
 Some+0.026
 Many+0.025
oscope+0.024
 pain+0.024
Neg logit contributions
ll-0.024
!-0.024
 upon-0.021
uddle-0.021
castle-0.020
Token fork
Token ID15563
Feature activation-6.74e-03

Pos logit contributions
ll+0.033
uddle+0.027
!+0.027
 upon+0.026
ounce+0.026
Neg logit contributions
 Some-0.024
 pain-0.022
 Many-0.022
 Amen-0.020
 blonde-0.019
Token on
Token ID319
Feature activation-2.32e-02

Pos logit contributions
 Some+0.013
 pain+0.012
 Many+0.011
oscope+0.011
 Amen+0.011
Neg logit contributions
ll-0.012
uddle-0.010
 upon-0.009
ounce-0.009
cast-0.008
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
 Amen+0.050
 Some+0.049
 Many+0.048
 pain+0.046
oscope+0.046
Neg logit contributions
!-0.031
ll-0.028
 upon-0.024
castle-0.024
!"-0.023
Token top
Token ID1353
Feature activation+4.84e-02

Pos logit contributions
ll+0.025
ounce+0.020
uddle+0.020
 upon+0.020
!+0.019
Neg logit contributions
 Some-0.018
 pain-0.017
 Many-0.017
oscope-0.016
 novels-0.015
Token.
Token ID13
Feature activation+1.17e-03

Pos logit contributions
ll+0.084
uddle+0.066
 upon+0.063
!+0.061
ounce+0.061
Neg logit contributions
 Some-0.098
 Many-0.092
 pain-0.091
oscope-0.083
 Amen-0.083
Token She
Token ID1375
Feature activation+5.44e-04

Pos logit contributions
ll+0.002
!+0.002
uddle+0.002
 upon+0.001
ounce+0.001
Neg logit contributions
 Amen-0.002
 pain-0.002
oscope-0.002
 emotion-0.002
ez-0.002
Token,
Token ID11
Feature activation-3.69e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 blond+0.000
 blonde+0.000
Neg logit contributions
ll-0.001
uddle-0.001
ounce-0.001
castle-0.001
rying-0.001
Token Tom
Token ID4186
Feature activation+6.89e-03

Pos logit contributions
 Some+0.006
 Many+0.005
 pain+0.004
 novels+0.004
 blonde+0.004
Neg logit contributions
ll-0.009
uddle-0.008
ounce-0.007
castle-0.007
rying-0.007
Token and
Token ID290
Feature activation-6.99e-03

Pos logit contributions
ll+0.013
uddle+0.012
castle+0.011
 destination+0.011
 exactly+0.011
Neg logit contributions
 Some-0.011
 pain-0.011
 Many-0.010
 sleeps-0.010
oscope-0.009
Token Mia
Token ID32189
Feature activation+3.62e-02

Pos logit contributions
 Some+0.012
 Many+0.011
 pain+0.011
 Amen+0.011
oscope+0.010
Neg logit contributions
ll-0.013
!-0.012
 upon-0.011
uddle-0.011
castle-0.010
Token see
Token ID766
Feature activation+1.85e-02

Pos logit contributions
ll+0.079
uddle+0.070
castle+0.065
!+0.064
ounce+0.063
Neg logit contributions
 Some-0.054
 pain-0.053
 Many-0.053
 Amen-0.045
oscope-0.043
Token a
Token ID257
Feature activation+5.96e-03

Pos logit contributions
ll+0.032
!+0.031
uddle+0.028
 upon+0.027
ounce+0.026
Neg logit contributions
 Some-0.032
 Many-0.031
 Amen-0.030
 pain-0.029
oscope-0.029
Token big
Token ID1263
Feature activation-7.75e-03

Pos logit contributions
ll+0.013
!+0.011
uddle+0.011
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.009
 pain-0.009
 Many-0.009
 Amen-0.008
 novels-0.007
Token piece
Token ID3704
Feature activation-4.70e-02

Pos logit contributions
 Some+0.014
 Many+0.013
 pain+0.012
 Amen+0.012
oscope+0.011
Neg logit contributions
ll-0.015
!-0.013
uddle-0.013
castle-0.012
 upon-0.012
Token of
Token ID286
Feature activation-2.87e-02

Pos logit contributions
 Some+0.076
 Many+0.070
 pain+0.069
 Amen+0.064
oscope+0.061
Neg logit contributions
ll-0.100
uddle-0.084
!-0.082
 upon-0.079
castle-0.078
Token cheese
Token ID9891
Feature activation-1.46e-02

Pos logit contributions
 Some+0.042
 Many+0.039
 Amen+0.039
 pain+0.036
oscope+0.035
Neg logit contributions
ll-0.060
!-0.056
uddle-0.053
 upon-0.051
castle-0.049
Token on
Token ID319
Feature activation-2.69e-02

Pos logit contributions
 pain+0.023
 Some+0.023
oscope+0.020
 novels+0.020
 Many+0.020
Neg logit contributions
ll-0.030
uddle-0.025
 upon-0.024
ounce-0.024
 destination-0.022
Token She
Token ID1375
Feature activation+4.05e-03

Pos logit contributions
 Some+0.001
 pain+0.001
 Many+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.003
!-0.002
uddle-0.002
 upon-0.002
castle-0.002
Token gave
Token ID2921
Feature activation+1.72e-02

Pos logit contributions
ll+0.007
uddle+0.006
!+0.006
 upon+0.006
castle+0.006
Neg logit contributions
 Some-0.007
 Many-0.007
 pain-0.006
 Amen-0.006
oscope-0.006
Token her
Token ID607
Feature activation-1.15e-02

Pos logit contributions
ll+0.031
!+0.028
 upon+0.027
uddle+0.025
ounce+0.023
Neg logit contributions
 Some-0.030
 Amen-0.030
 Many-0.029
 pain-0.028
 blonde-0.027
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
 Some+0.019
 Many+0.019
 Amen+0.019
 pain+0.018
oscope+0.017
Neg logit contributions
ll-0.021
!-0.020
 upon-0.019
uddle-0.018
castle-0.017
Token big
Token ID1263
Feature activation-3.67e-04

Pos logit contributions
ll+0.031
!+0.026
uddle+0.026
castle+0.025
 upon+0.024
Neg logit contributions
 Some-0.024
 Many-0.023
 pain-0.022
 Amen-0.021
oscope-0.019
Token hug
Token ID16225
Feature activation+2.25e-02

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
!-0.001
uddle-0.001
 upon-0.001
castle-0.001
Token and
Token ID290
Feature activation+1.58e-03

Pos logit contributions
ll+0.036
uddle+0.030
 upon+0.027
!+0.027
castle+0.027
Neg logit contributions
 Some-0.047
 pain-0.044
 Many-0.044
 Amen-0.042
oscope-0.041
Token said
Token ID531
Feature activation+3.16e-02

Pos logit contributions
ll+0.002
!+0.002
uddle+0.002
 upon+0.002
castle+0.002
Neg logit contributions
 Some-0.004
 Many-0.003
 pain-0.003
 Amen-0.003
oscope-0.003
Token,
Token ID11
Feature activation+3.61e-03

Pos logit contributions
ll+0.036
!+0.033
uddle+0.025
 upon+0.024
!"+0.023
Neg logit contributions
 Some-0.077
oscope-0.075
 pain-0.074
 Amen-0.073
 Many-0.073
Token "
Token ID366
Feature activation+9.91e-03

Pos logit contributions
ll+0.006
!+0.005
 upon+0.004
uddle+0.004
castle+0.004
Neg logit contributions
 Some-0.007
 pain-0.007
 Many-0.007
 Amen-0.007
oscope-0.007
TokenYou
Token ID1639
Feature activation+1.15e-03

Pos logit contributions
ll+0.019
!+0.015
 upon+0.015
uddle+0.014
ounce+0.014
Neg logit contributions
 Some-0.018
 pain-0.017
 Many-0.017
 Amen-0.016
oscope-0.015
Token the
Token ID262
Feature activation+9.47e-03

Pos logit contributions
 Some+0.024
 pain+0.023
 Many+0.022
 novels+0.021
oscope+0.020
Neg logit contributions
ll-0.028
uddle-0.024
castle-0.021
ounce-0.020
icking-0.020
Token plane
Token ID6614
Feature activation-2.77e-02

Pos logit contributions
ll+0.020
uddle+0.017
!+0.016
ounce+0.016
 upon+0.016
Neg logit contributions
 Some-0.015
 pain-0.015
 Many-0.014
oscope-0.014
 Amen-0.013
Token.
Token ID13
Feature activation+2.57e-03

Pos logit contributions
 Some+0.050
 Many+0.048
 pain+0.048
oscope+0.047
 Amen+0.044
Neg logit contributions
ll-0.047
uddle-0.041
ounce-0.037
 upon-0.035
ory-0.035
Token I
Token ID314
Feature activation+6.85e-03

Pos logit contributions
!+0.005
ll+0.005
!"+0.004
 upon+0.004
!".+0.004
Neg logit contributions
 pain-0.004
 Some-0.003
ez-0.003
oscope-0.003
 emotion-0.003
Token push
Token ID4574
Feature activation-2.69e-02

Pos logit contributions
ll+0.013
!+0.011
uddle+0.011
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.012
 Many-0.012
 pain-0.011
 Amen-0.011
oscope-0.010
Token and
Token ID290
Feature activation-7.91e-03

Pos logit contributions
 Some+0.051
 Many+0.048
 pain+0.047
 Amen+0.043
oscope+0.042
Neg logit contributions
ll-0.050
uddle-0.041
castle-0.039
!-0.038
 upon-0.037
Token pull
Token ID2834
Feature activation-1.99e-02

Pos logit contributions
 Some+0.014
 Many+0.013
 pain+0.013
 Amen+0.012
oscope+0.011
Neg logit contributions
ll-0.015
!-0.013
uddle-0.013
 upon-0.012
castle-0.012
Token the
Token ID262
Feature activation+6.39e-03

Pos logit contributions
 Some+0.038
 Many+0.035
 pain+0.035
 novels+0.031
 Amen+0.031
Neg logit contributions
ll-0.037
uddle-0.031
castle-0.029
 upon-0.027
ounce-0.027
Token levers
Token ID48872
Feature activation-2.88e-02

Pos logit contributions
ll+0.015
uddle+0.012
!+0.012
ounce+0.012
 upon+0.012
Neg logit contributions
 Some-0.009
 pain-0.009
 Many-0.008
 Amen-0.008
oscope-0.008
Token and
Token ID290
Feature activation-9.83e-03

Pos logit contributions
 Some+0.056
 pain+0.053
 Many+0.052
oscope+0.049
 Amen+0.048
Neg logit contributions
ll-0.050
uddle-0.042
ounce-0.038
 upon-0.038
castle-0.037
Token turn
Token ID1210
Feature activation-3.98e-03

Pos logit contributions
 Some+0.017
 Many+0.016
 pain+0.016
 Amen+0.015
oscope+0.014
Neg logit contributions
ll-0.019
!-0.016
uddle-0.016
 upon-0.015
castle-0.015
Token she
Token ID673
Feature activation+1.04e-02

Pos logit contributions
 Some+0.002
 Amen+0.002
 Many+0.002
 pain+0.002
oscope+0.002
Neg logit contributions
ll-0.002
!-0.002
 upon-0.001
uddle-0.001
ounce-0.001
Token did
Token ID750
Feature activation-1.19e-02

Pos logit contributions
ll+0.019
castle+0.017
uddle+0.017
cast+0.016
 destination+0.016
Neg logit contributions
 pain-0.017
 Some-0.015
 Many-0.014
ez-0.014
oscope-0.013
Token not
Token ID407
Feature activation+1.37e-02

Pos logit contributions
 Some+0.021
 pain+0.020
 Many+0.020
 blond+0.017
 blonde+0.017
Neg logit contributions
ll-0.024
uddle-0.020
castle-0.019
 upon-0.017
ounce-0.017
Token see
Token ID766
Feature activation+2.00e-02

Pos logit contributions
ll+0.031
uddle+0.029
castle+0.026
 upon+0.026
cast+0.025
Neg logit contributions
 pain-0.020
 Many-0.018
 Some-0.018
oscope-0.015
 Amen-0.015
Token the
Token ID262
Feature activation+4.94e-03

Pos logit contributions
!+0.028
ll+0.026
 upon+0.021
ounce+0.020
!"+0.020
Neg logit contributions
 Amen-0.044
 Some-0.043
 Many-0.041
oscope-0.038
 chapters-0.038
Token sack
Token ID23704
Feature activation+1.80e-02

Pos logit contributions
ll+0.010
!+0.010
uddle+0.009
 upon+0.009
ounce+0.009
Neg logit contributions
 Some-0.007
 Amen-0.007
 Many-0.007
ez-0.006
oscope-0.006
Token on
Token ID319
Feature activation-2.58e-02

Pos logit contributions
ll+0.033
uddle+0.027
castle+0.024
 upon+0.024
cast+0.023
Neg logit contributions
 pain-0.033
 Some-0.033
 Many-0.031
oscope-0.028
 x-0.028
Token the
Token ID262
Feature activation+5.87e-03

Pos logit contributions
 Some+0.051
 Amen+0.048
 Many+0.048
 pain+0.044
oscope+0.043
Neg logit contributions
!-0.040
ll-0.040
 upon-0.034
uddle-0.034
castle-0.032
Token other
Token ID584
Feature activation-1.23e-02

Pos logit contributions
ll+0.011
uddle+0.009
 upon+0.009
!+0.009
castle+0.009
Neg logit contributions
 Some-0.010
 pain-0.010
 Many-0.010
oscope-0.009
 Amen-0.009
Token side
Token ID1735
Feature activation-1.41e-02

Pos logit contributions
 Some+0.025
 Many+0.023
 pain+0.023
 Amen+0.022
oscope+0.021
Neg logit contributions
ll-0.021
uddle-0.017
!-0.017
 upon-0.016
castle-0.016
Token of
Token ID286
Feature activation-2.51e-02

Pos logit contributions
 Some+0.022
 Many+0.020
 pain+0.020
oscope+0.018
 Amen+0.017
Neg logit contributions
ll-0.030
uddle-0.026
cast-0.024
castle-0.023
 upon-0.023
Token leaves
Token ID5667
Feature activation+3.64e-02

Pos logit contributions
ll+0.004
uddle+0.004
!+0.004
 upon+0.003
ounce+0.003
Neg logit contributions
 Some-0.003
 pain-0.003
 Many-0.003
 Amen-0.003
oscope-0.003
Token of
Token ID286
Feature activation-3.25e-02

Pos logit contributions
ll+0.062
uddle+0.051
castle+0.046
 upon+0.046
!+0.046
Neg logit contributions
 Some-0.072
 pain-0.069
 Many-0.068
oscope-0.063
 Amen-0.062
Token the
Token ID262
Feature activation+4.15e-03

Pos logit contributions
 Some+0.063
 Many+0.059
 Amen+0.059
 pain+0.055
oscope+0.055
Neg logit contributions
ll-0.050
!-0.049
 upon-0.043
uddle-0.041
castle-0.040
Token trees
Token ID7150
Feature activation+7.12e-03

Pos logit contributions
ll+0.008
uddle+0.007
!+0.007
ounce+0.007
 upon+0.007
Neg logit contributions
 Some-0.007
 pain-0.007
 Many-0.007
 Amen-0.006
oscope-0.006
Token.
Token ID13
Feature activation-5.66e-03

Pos logit contributions
ll+0.012
uddle+0.010
castle+0.009
 upon+0.009
ounce+0.009
Neg logit contributions
 Some-0.014
 pain-0.013
 Many-0.013
oscope-0.012
 Amen-0.012
Token Suddenly
Token ID24975
Feature activation-3.47e-02

Pos logit contributions
 pain+0.008
 Some+0.007
 Amen+0.007
 Many+0.007
oscope+0.006
Neg logit contributions
ll-0.013
!-0.011
uddle-0.011
 upon-0.011
ounce-0.010
Token,
Token ID11
Feature activation-4.98e-03

Pos logit contributions
 Some+0.077
 Many+0.073
 pain+0.072
 Amen+0.068
 blonde+0.065
Neg logit contributions
ll-0.054
uddle-0.042
!-0.040
castle-0.039
 upon-0.037
Token the
Token ID262
Feature activation+3.91e-03

Pos logit contributions
 Some+0.009
 Many+0.009
 pain+0.009
 Amen+0.008
oscope+0.008
Neg logit contributions
ll-0.009
!-0.008
uddle-0.007
 upon-0.007
castle-0.007
Token mouse
Token ID10211
Feature activation-3.29e-02

Pos logit contributions
ll+0.008
!+0.007
uddle+0.007
 upon+0.007
castle+0.006
Neg logit contributions
 Some-0.006
 Many-0.006
 pain-0.006
 Amen-0.006
oscope-0.005
Token noticed
Token ID6810
Feature activation-1.31e-03

Pos logit contributions
 pain+0.058
 Some+0.057
 Many+0.054
oscope+0.051
ez+0.049
Neg logit contributions
ll-0.063
uddle-0.051
castle-0.049
cast-0.046
 upon-0.046
Token a
Token ID257
Feature activation+7.49e-03

Pos logit contributions
 Some+0.002
 Many+0.002
 pain+0.002
 Amen+0.002
oscope+0.002
Neg logit contributions
ll-0.003
uddle-0.002
!-0.002
castle-0.002
 upon-0.002
Token came
Token ID1625
Feature activation-1.69e-02

Pos logit contributions
ll+0.009
uddle+0.008
castle+0.007
 upon+0.007
cast+0.007
Neg logit contributions
 Some-0.008
 Many-0.007
 pain-0.007
oscope-0.006
 blond-0.006
Token to
Token ID284
Feature activation-1.16e-02

Pos logit contributions
 Some+0.033
 Many+0.030
 pain+0.030
 x+0.028
 Amen+0.027
Neg logit contributions
ll-0.029
uddle-0.028
castle-0.025
cast-0.023
lly-0.021
Token visit
Token ID3187
Feature activation-3.48e-02

Pos logit contributions
 Some+0.022
 Many+0.020
 pain+0.020
 Amen+0.019
oscope+0.019
Neg logit contributions
ll-0.020
!-0.018
 upon-0.017
uddle-0.016
castle-0.016
Token them
Token ID606
Feature activation-2.41e-03

Pos logit contributions
 Some+0.063
 Many+0.059
 Amen+0.056
 pain+0.056
 novels+0.053
Neg logit contributions
ll-0.068
uddle-0.059
castle-0.054
ounce-0.051
cast-0.048
Token.
Token ID13
Feature activation-8.50e-03

Pos logit contributions
 Some+0.004
 Many+0.004
 novels+0.004
 Amen+0.004
 pain+0.004
Neg logit contributions
ll-0.004
uddle-0.004
castle-0.004
ounce-0.004
ounced-0.004
Token He
Token ID679
Feature activation-2.13e-02

Pos logit contributions
 Some+0.012
 Many+0.011
 pain+0.010
 Amen+0.009
 blond+0.009
Neg logit contributions
ll-0.020
uddle-0.018
!-0.017
castle-0.017
 upon-0.016
Token brought
Token ID3181
Feature activation+2.19e-03

Pos logit contributions
 Some+0.038
 Many+0.035
 pain+0.033
 Amen+0.032
oscope+0.031
Neg logit contributions
ll-0.040
uddle-0.036
castle-0.034
ounce-0.032
!-0.032
Token a
Token ID257
Feature activation+1.54e-04

Pos logit contributions
ll+0.004
uddle+0.003
!+0.003
castle+0.003
 upon+0.003
Neg logit contributions
 Some-0.004
 Many-0.004
 pain-0.004
 Amen-0.004
oscope-0.003
Token big
Token ID1263
Feature activation-1.32e-02

Pos logit contributions
ll+0.000
!+0.000
uddle+0.000
castle+0.000
ounce+0.000
Neg logit contributions
 Some-0.000
 Many-0.000
 pain-0.000
 Amen-0.000
 x-0.000
Token box
Token ID3091
Feature activation-1.67e-02

Pos logit contributions
 Some+0.027
 Many+0.025
 pain+0.025
 Amen+0.024
oscope+0.023
Neg logit contributions
ll-0.023
uddle-0.018
!-0.018
 upon-0.017
ounce-0.017
Token with
Token ID351
Feature activation-2.25e-02

Pos logit contributions
 Some+0.028
 Many+0.025
 pain+0.024
 Amen+0.024
oscope+0.023
Neg logit contributions
ll-0.034
uddle-0.030
ounce-0.026
cast-0.025
 upon-0.025
TokenBut
Token ID1537
Feature activation+2.29e-02

Pos logit contributions
 Some+0.004
 Many+0.003
 pain+0.003
 Amen+0.003
oscope+0.003
Neg logit contributions
ll-0.004
uddle-0.003
!-0.003
castle-0.003
 upon-0.003
Token Ben
Token ID3932
Feature activation+4.57e-03

Pos logit contributions
ll+0.053
uddle+0.046
!+0.044
castle+0.042
 upon+0.042
Neg logit contributions
 Some-0.034
 pain-0.031
 Many-0.030
 Amen-0.028
 novels-0.026
Token wanted
Token ID2227
Feature activation-1.63e-02

Pos logit contributions
ll+0.010
castle+0.008
uddle+0.008
 upon+0.008
 ashore+0.008
Neg logit contributions
 pain-0.007
 Some-0.006
 Many-0.006
oscope-0.005
 blond-0.005
Token to
Token ID284
Feature activation-5.22e-03

Pos logit contributions
 Some+0.041
 Many+0.039
 pain+0.038
 Amen+0.037
oscope+0.035
Neg logit contributions
ll-0.021
uddle-0.015
!-0.015
 upon-0.014
castle-0.013
Token be
Token ID307
Feature activation-1.89e-02

Pos logit contributions
 Some+0.008
 Many+0.008
 pain+0.008
 Amen+0.007
 blond+0.006
Neg logit contributions
ll-0.011
uddle-0.010
 upon-0.009
!-0.009
castle-0.009
Token kind
Token ID1611
Feature activation-4.01e-02

Pos logit contributions
 Some+0.031
 pain+0.029
 Many+0.029
 Amen+0.027
 blond+0.025
Neg logit contributions
ll-0.040
uddle-0.034
!-0.033
 upon-0.032
castle-0.031
Token and
Token ID290
Feature activation-8.17e-03

Pos logit contributions
 Some+0.085
 pain+0.079
 Many+0.078
 Amen+0.071
oscope+0.071
Neg logit contributions
ll-0.066
uddle-0.054
castle-0.050
 upon-0.049
!-0.047
Token support
Token ID1104
Feature activation+1.45e-02

Pos logit contributions
 Some+0.015
 Many+0.014
 Amen+0.013
growth+0.013
 pain+0.013
Neg logit contributions
ll-0.015
!-0.013
 upon-0.012
uddle-0.012
castle-0.012
Token Max
Token ID5436
Feature activation-2.04e-02

Pos logit contributions
ll+0.023
!+0.019
uddle+0.018
 upon+0.017
castle+0.016
Neg logit contributions
 Some-0.031
 Many-0.029
 pain-0.029
 Amen-0.028
oscope-0.026
Token.
Token ID13
Feature activation+7.50e-04

Pos logit contributions
 Some+0.044
 pain+0.042
 Many+0.042
 Amen+0.039
oscope+0.039
Neg logit contributions
ll-0.031
uddle-0.025
!-0.023
castle-0.023
 upon-0.022
Token He
Token ID679
Feature activation-1.07e-02

Pos logit contributions
ll+0.002
!+0.001
uddle+0.001
 upon+0.001
ounce+0.001
Neg logit contributions
 Some-0.001
 pain-0.001
 Amen-0.001
 Many-0.001
oscope-0.001
Token Max
Token ID5436
Feature activation-2.24e-02

Pos logit contributions
ll+0.026
uddle+0.021
!+0.021
castle+0.020
 upon+0.020
Neg logit contributions
 Some-0.024
 Many-0.023
 pain-0.023
 Amen-0.021
 novels-0.020
Token saw
Token ID2497
Feature activation-4.05e-03

Pos logit contributions
 Some+0.043
 Many+0.042
 pain+0.041
oscope+0.037
 Amen+0.037
Neg logit contributions
ll-0.038
castle-0.032
uddle-0.032
!-0.029
cast-0.029
Token the
Token ID262
Feature activation+4.34e-03

Pos logit contributions
 Some+0.008
 Amen+0.007
 Many+0.007
 pain+0.007
oscope+0.007
Neg logit contributions
ll-0.007
!-0.006
 upon-0.005
uddle-0.005
ounce-0.005
Token veterinarian
Token ID41404
Feature activation+2.54e-03

Pos logit contributions
ll+0.010
!+0.009
uddle+0.009
 upon+0.008
 destination+0.008
Neg logit contributions
 Some-0.006
 Many-0.005
 Amen-0.005
 pain-0.005
ez-0.005
Token he
Token ID339
Feature activation+3.33e-05

Pos logit contributions
ll+0.004
uddle+0.003
!+0.003
castle+0.003
 upon+0.003
Neg logit contributions
 Some-0.006
 Many-0.005
 pain-0.005
 Amen-0.005
oscope-0.005
Token felt
Token ID2936
Feature activation-2.44e-02

Pos logit contributions
ll+0.000
uddle+0.000
castle+0.000
ounce+0.000
cast+0.000
Neg logit contributions
 Some-0.000
 Many-0.000
 pain-0.000
oscope-0.000
 Amen-0.000
Token a
Token ID257
Feature activation+7.44e-03

Pos logit contributions
 Some+0.041
 Many+0.038
 Amen+0.036
 pain+0.035
oscope+0.034
Neg logit contributions
ll-0.046
!-0.043
uddle-0.040
 upon-0.039
castle-0.036
Token bit
Token ID1643
Feature activation+2.85e-02

Pos logit contributions
ll+0.013
!+0.011
uddle+0.011
 upon+0.010
castle+0.010
Neg logit contributions
 Some-0.014
 Many-0.013
 Amen-0.013
 pain-0.013
oscope-0.012
Token scared
Token ID12008
Feature activation+1.69e-02

Pos logit contributions
!+0.051
ll+0.050
uddle+0.048
 upon+0.042
ounce+0.041
Neg logit contributions
 Amen-0.048
growth-0.048
 Some-0.047
 Many-0.046
 novels-0.046
Token.
Token ID13
Feature activation-4.20e-03

Pos logit contributions
ll+0.024
uddle+0.019
!+0.019
 upon+0.017
castle+0.017
Neg logit contributions
 Some-0.039
 Many-0.037
 pain-0.036
 Amen-0.035
oscope-0.033
Token But
Token ID887
Feature activation-1.12e-04

Pos logit contributions
 Some+0.006
 pain+0.006
 Many+0.006
 Amen+0.006
oscope+0.005
Neg logit contributions
ll-0.009
!-0.008
uddle-0.008
 upon-0.007
ounce-0.007
Token shiny
Token ID22441
Feature activation-2.40e-02

Pos logit contributions
 Some+0.010
 Many+0.009
 pain+0.009
 Amen+0.008
oscope+0.008
Neg logit contributions
ll-0.011
!-0.009
uddle-0.009
 upon-0.008
castle-0.008
Token and
Token ID290
Feature activation-5.35e-03

Pos logit contributions
 Some+0.054
 Many+0.051
 pain+0.050
 Amen+0.049
oscope+0.046
Neg logit contributions
ll-0.035
uddle-0.027
!-0.027
 upon-0.025
castle-0.025
Token had
Token ID550
Feature activation-2.76e-03

Pos logit contributions
 Some+0.010
 Many+0.010
 pain+0.009
 Amen+0.009
oscope+0.009
Neg logit contributions
ll-0.009
!-0.008
uddle-0.008
 upon-0.007
castle-0.007
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
 Some+0.004
 Many+0.004
 pain+0.004
 Amen+0.004
oscope+0.003
Neg logit contributions
ll-0.006
!-0.005
uddle-0.005
 upon-0.005
castle-0.005
Token star
Token ID3491
Feature activation-2.39e-02

Pos logit contributions
ll+0.030
!+0.025
uddle+0.025
 upon+0.024
castle+0.024
Neg logit contributions
 Some-0.024
 Many-0.023
 pain-0.022
 Amen-0.021
oscope-0.020
Token on
Token ID319
Feature activation-2.51e-02

Pos logit contributions
 Some+0.044
 pain+0.041
 Many+0.041
 Amen+0.038
oscope+0.037
Neg logit contributions
ll-0.045
uddle-0.036
!-0.035
 upon-0.034
ounce-0.033
Token it
Token ID340
Feature activation-1.11e-02

Pos logit contributions
 Some+0.054
 Many+0.052
 Amen+0.051
 pain+0.050
oscope+0.048
Neg logit contributions
ll-0.033
!-0.033
 upon-0.028
uddle-0.027
castle-0.026
Token.
Token ID13
Feature activation+2.62e-03

Pos logit contributions
 Some+0.023
 pain+0.021
 Many+0.021
oscope+0.019
 blonde+0.019
Neg logit contributions
ll-0.019
uddle-0.015
castle-0.014
ounce-0.014
 upon-0.013
Token She
Token ID1375
Feature activation-2.82e-04

Pos logit contributions
ll+0.005
!+0.004
uddle+0.004
 upon+0.004
ounce+0.004
Neg logit contributions
 pain-0.005
 Some-0.005
 Amen-0.004
 Many-0.004
oscope-0.004
Token reached
Token ID4251
Feature activation+2.67e-02

Pos logit contributions
 Some+0.000
 Many+0.000
 pain+0.000
 blonde+0.000
 blond+0.000
Neg logit contributions
ll-0.001
uddle-0.000
!-0.000
castle-0.000
ounce-0.000
Token for
Token ID329
Feature activation-1.17e-02

Pos logit contributions
ll+0.045
uddle+0.038
castle+0.035
ounce+0.032
!+0.030
Neg logit contributions
 Some-0.056
 pain-0.052
 Many-0.051
 x-0.047
 Amen-0.047
Token,
Token ID11
Feature activation-7.25e-03

Pos logit contributions
ll+0.005
uddle+0.004
castle+0.004
ounce+0.004
 upon+0.004
Neg logit contributions
 pain-0.005
 Amen-0.005
 Some-0.004
oscope-0.004
 Many-0.004
Token falling
Token ID7463
Feature activation-6.30e-03

Pos logit contributions
 Some+0.013
 pain+0.012
 Many+0.012
 Amen+0.011
oscope+0.011
Neg logit contributions
ll-0.014
uddle-0.012
!-0.011
ounce-0.011
castle-0.011
Token is
Token ID318
Feature activation-4.16e-03

Pos logit contributions
 Some+0.013
 pain+0.013
 Many+0.012
 Amen+0.012
oscope+0.011
Neg logit contributions
ll-0.010
uddle-0.008
castle-0.007
ounce-0.007
 upon-0.007
Token not
Token ID407
Feature activation+1.02e-02

Pos logit contributions
 pain+0.007
 Some+0.006
 Many+0.006
 nightmares+0.005
 Amen+0.005
Neg logit contributions
ll-0.009
uddle-0.008
ounce-0.007
castle-0.007
 upon-0.007
Token fun
Token ID1257
Feature activation+1.37e-02

Pos logit contributions
ll+0.022
uddle+0.020
ounce+0.018
castle+0.017
 destination+0.017
Neg logit contributions
 pain-0.017
 Some-0.014
 nightmares-0.013
 Many-0.013
 novels-0.013
Token,"
Token ID553
Feature activation-9.89e-02

Pos logit contributions
ll+0.022
uddle+0.017
 upon+0.016
castle+0.015
ounce+0.015
Neg logit contributions
 Some-0.029
 pain-0.028
 Many-0.027
 Amen-0.025
oscope-0.025
Token their
Token ID511
Feature activation-2.14e-02

Pos logit contributions
 Some+0.168
 Many+0.159
 pain+0.149
 Amen+0.148
oscope+0.146
Neg logit contributions
ll-0.176
!-0.168
 upon-0.153
uddle-0.144
!"-0.141
Token parent
Token ID2560
Feature activation-4.20e-02

Pos logit contributions
 Some+0.042
 pain+0.039
 Many+0.039
 Amen+0.037
oscope+0.035
Neg logit contributions
ll-0.038
!-0.031
uddle-0.030
 upon-0.029
castle-0.028
Token says
Token ID1139
Feature activation+1.26e-02

Pos logit contributions
 Some+0.072
 Many+0.066
 pain+0.066
 Amen+0.063
oscope+0.059
Neg logit contributions
ll-0.084
uddle-0.071
!-0.068
castle-0.067
 upon-0.066
Token.
Token ID13
Feature activation-7.19e-03

Pos logit contributions
ll+0.015
!+0.012
uddle+0.011
 upon+0.010
castle+0.009
Neg logit contributions
 Some-0.032
 Many-0.031
 pain-0.030
 Amen-0.029
 novels-0.028
Token "
Token ID366
Feature activation+2.12e-03

Pos logit contributions
 Some+0.014
 pain+0.014
 Many+0.014
 Amen+0.013
oscope+0.012
Neg logit contributions
ll-0.012
!-0.010
uddle-0.010
 upon-0.009
ounce-0.009
Token was
Token ID373
Feature activation-1.09e-02

Pos logit contributions
 Some+0.009
 Many+0.008
 pain+0.008
 Amen+0.007
 blonde+0.007
Neg logit contributions
ll-0.007
uddle-0.006
castle-0.006
ounce-0.006
cast-0.005
Token a
Token ID257
Feature activation+2.05e-03

Pos logit contributions
oscope+0.022
 Some+0.021
 aids+0.021
 Amen+0.021
 pain+0.021
Neg logit contributions
!-0.015
ll-0.014
 upon-0.014
!"-0.011
 home-0.011
Token boy
Token ID2933
Feature activation-3.75e-02

Pos logit contributions
ll+0.004
!+0.003
uddle+0.003
 upon+0.003
castle+0.003
Neg logit contributions
 Some-0.004
 Many-0.003
 pain-0.003
 Amen-0.003
oscope-0.003
Token who
Token ID508
Feature activation-1.77e-02

Pos logit contributions
 Some+0.072
 Many+0.068
 pain+0.068
 Amen+0.064
 emotion+0.063
Neg logit contributions
ll-0.065
!-0.062
uddle-0.051
 upon-0.050
ounce-0.049
Token was
Token ID373
Feature activation-1.04e-02

Pos logit contributions
 Some+0.042
 Many+0.039
 chapters+0.038
 aids+0.037
growth+0.037
Neg logit contributions
ll-0.021
!-0.021
 upon-0.017
lly-0.014
uddle-0.014
Token 3
Token ID513
Feature activation-1.13e-01

Pos logit contributions
growth+0.024
 chapters+0.023
 Some+0.023
 Many+0.023
 aids+0.023
Neg logit contributions
ll-0.012
 upon-0.011
!-0.010
 home-0.009
 swimming-0.008
Token years
Token ID812
Feature activation+2.48e-03

Pos logit contributions
 Some+0.201
 Many+0.186
 pain+0.182
 Amen+0.179
oscope+0.164
Neg logit contributions
ll-0.217
!-0.194
uddle-0.183
 upon-0.170
castle-0.170
Token old
Token ID1468
Feature activation-8.33e-02

Pos logit contributions
ll+0.005
!+0.005
uddle+0.005
 upon+0.004
castle+0.004
Neg logit contributions
 Some-0.004
 Many-0.003
 pain-0.003
 Amen-0.003
oscope-0.003
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
 Some+0.184
 Many+0.173
 pain+0.172
 Amen+0.165
oscope+0.157
Neg logit contributions
ll-0.130
uddle-0.101
!-0.094
 upon-0.093
castle-0.092
Token He
Token ID679
Feature activation-1.73e-02

Pos logit contributions
 pain+0.002
 Amen+0.002
 Some+0.002
oscope+0.002
ez+0.002
Neg logit contributions
ll-0.003
!-0.002
 upon-0.002
uddle-0.002
castle-0.002
Token had
Token ID550
Feature activation-7.14e-03

Pos logit contributions
 Some+0.031
 Many+0.029
 pain+0.028
 Amen+0.027
oscope+0.025
Neg logit contributions
ll-0.034
uddle-0.028
!-0.028
 upon-0.026
castle-0.026
Tokenl
Token ID75
Feature activation-7.58e-02

Pos logit contributions
 Some+0.031
 Many+0.029
 pain+0.029
 Amen+0.028
oscope+0.026
Neg logit contributions
ll-0.017
!-0.014
uddle-0.013
 upon-0.012
castle-0.012
Tokenle
Token ID293
Feature activation+7.32e-03

Pos logit contributions
 Some+0.186
 Many+0.176
 Amen+0.173
 pain+0.171
 blonde+0.158
Neg logit contributions
ll-0.098
!-0.082
uddle-0.070
castle-0.067
 upon-0.064
Token.
Token ID13
Feature activation-6.67e-03

Pos logit contributions
ll+0.012
!+0.010
uddle+0.009
 upon+0.009
castle+0.009
Neg logit contributions
 Some-0.015
 Many-0.015
 pain-0.014
 Amen-0.014
oscope-0.013
Token She
Token ID1375
Feature activation-3.30e-03

Pos logit contributions
 pain+0.012
 Amen+0.011
oscope+0.011
ez+0.011
 emotion+0.010
Neg logit contributions
!-0.011
 upon-0.011
ll-0.011
ounce-0.009
uddle-0.009
Token was
Token ID373
Feature activation-9.24e-03

Pos logit contributions
 Some+0.007
 Many+0.006
 pain+0.006
 Amen+0.006
 emotion+0.006
Neg logit contributions
ll-0.006
!-0.005
uddle-0.004
 upon-0.004
castle-0.004
Token 3
Token ID513
Feature activation-1.14e-01

Pos logit contributions
growth+0.023
 Some+0.022
 Many+0.022
 Amen+0.021
 pain+0.021
Neg logit contributions
ll-0.008
 upon-0.007
!-0.007
 swimming-0.006
 exactly-0.006
Token years
Token ID812
Feature activation-1.92e-03

Pos logit contributions
 Some+0.216
 Many+0.198
 Amen+0.196
 pain+0.195
oscope+0.174
Neg logit contributions
ll-0.205
!-0.185
uddle-0.173
 upon-0.157
castle-0.156
Token old
Token ID1468
Feature activation-8.28e-02

Pos logit contributions
 Some+0.003
 Many+0.002
 pain+0.002
 Amen+0.002
oscope+0.002
Neg logit contributions
ll-0.004
!-0.004
uddle-0.004
 upon-0.004
castle-0.004
Token and
Token ID290
Feature activation-4.37e-03

Pos logit contributions
 Some+0.179
 Many+0.168
 pain+0.168
 Amen+0.159
oscope+0.153
Neg logit contributions
ll-0.133
uddle-0.104
castle-0.097
 upon-0.096
!-0.095
Token loved
Token ID6151
Feature activation+2.14e-02

Pos logit contributions
 Some+0.010
 Many+0.009
 pain+0.009
oscope+0.008
 Amen+0.008
Neg logit contributions
ll-0.006
!-0.006
 upon-0.005
lly-0.005
uddle-0.005
Token to
Token ID284
Feature activation-5.12e-03

Pos logit contributions
ll+0.029
uddle+0.022
!+0.021
castle+0.020
 upon+0.019
Neg logit contributions
 Some-0.052
 Many-0.049
 pain-0.048
 Amen-0.047
 novels-0.045
Token the
Token ID262
Feature activation+4.62e-03

Pos logit contributions
 Some+0.012
 Many+0.012
 pain+0.012
 Amen+0.011
oscope+0.010
Neg logit contributions
ll-0.014
uddle-0.012
!-0.011
castle-0.011
ounce-0.011
Token kitchen
Token ID9592
Feature activation-1.49e-02

Pos logit contributions
ll+0.009
ounce+0.008
!+0.008
uddle+0.007
 upon+0.007
Neg logit contributions
 Some-0.008
 pain-0.007
 Many-0.007
 Amen-0.007
 x-0.006
Token.
Token ID13
Feature activation-5.80e-03

Pos logit contributions
 Some+0.027
 pain+0.022
 Many+0.022
oscope+0.021
 Amen+0.020
Neg logit contributions
ll-0.028
uddle-0.023
ounce-0.023
ogo-0.021
castle-0.021
Token
Token ID198
Feature activation-6.72e-04

Pos logit contributions
 Some+0.009
 pain+0.009
 Many+0.009
oscope+0.009
 Amen+0.008
Neg logit contributions
ll-0.011
!-0.010
 upon-0.010
uddle-0.009
ounce-0.008
Token
Token ID198
Feature activation-1.11e-03

Pos logit contributions
 Some+0.001
 Many+0.001
 pain+0.001
 Amen+0.001
oscope+0.001
Neg logit contributions
ll-0.001
uddle-0.001
castle-0.001
!-0.001
 upon-0.001
TokenEm
Token ID10161
Feature activation-1.00e-01

Pos logit contributions
 Some+0.002
 Many+0.002
 pain+0.002
 Amen+0.002
oscope+0.002
Neg logit contributions
ll-0.002
uddle-0.002
!-0.002
castle-0.002
 upon-0.002
Tokenma
Token ID2611
Feature activation-2.88e-02

Pos logit contributions
 Some+0.239
 Many+0.229
 pain+0.221
 Amen+0.210
 blond+0.202
Neg logit contributions
ll-0.145
!-0.112
uddle-0.105
 upon-0.100
castle-0.092
Token's
Token ID338
Feature activation-7.58e-03

Pos logit contributions
 Some+0.056
 Many+0.052
 pain+0.051
 Amen+0.048
oscope+0.048
Neg logit contributions
ll-0.049
uddle-0.041
!-0.040
 upon-0.039
castle-0.038
Token mom
Token ID1995
Feature activation-6.63e-03

Pos logit contributions
 Some+0.014
 Many+0.013
 pain+0.013
 Amen+0.012
oscope+0.012
Neg logit contributions
ll-0.014
!-0.012
uddle-0.012
 upon-0.011
castle-0.011
Token put
Token ID1234
Feature activation+4.18e-03

Pos logit contributions
 Some+0.013
 Many+0.012
 pain+0.011
 Amen+0.011
oscope+0.011
Neg logit contributions
ll-0.011
uddle-0.010
castle-0.009
!-0.009
 upon-0.009
Token some
Token ID617
Feature activation-2.59e-02

Pos logit contributions
ll+0.007
!+0.007
uddle+0.006
 upon+0.006
castle+0.005
Neg logit contributions
 Some-0.008
 Many-0.008
 Amen-0.007
 pain-0.007
 aids-0.007
TokenGeorge
Token ID20191
Feature activation-1.97e-02

Pos logit contributions
ll+0.099
uddle+0.084
castle+0.080
!+0.079
ounce+0.075
Neg logit contributions
 Some-0.093
 Many-0.086
 pain-0.083
 Amen-0.080
oscope-0.073
Token was
Token ID373
Feature activation-4.02e-03

Pos logit contributions
 Some+0.035
 pain+0.033
 Many+0.033
oscope+0.031
 Amen+0.031
Neg logit contributions
ll-0.036
uddle-0.032
castle-0.029
ounce-0.028
!-0.027
Token a
Token ID257
Feature activation+9.72e-03

Pos logit contributions
 Some+0.008
 Many+0.008
 pain+0.007
 Amen+0.007
oscope+0.007
Neg logit contributions
ll-0.007
!-0.006
uddle-0.005
 upon-0.005
castle-0.005
Token 3
Token ID513
Feature activation-1.07e-01

Pos logit contributions
ll+0.019
!+0.017
uddle+0.017
 upon+0.016
castle+0.016
Neg logit contributions
 Some-0.016
 Many-0.015
 pain-0.015
 Amen-0.014
oscope-0.014
Token-
Token ID12
Feature activation-4.19e-02

Pos logit contributions
 Some+0.204
 Many+0.187
 pain+0.180
 Amen+0.174
 blond+0.164
Neg logit contributions
ll-0.189
!-0.179
uddle-0.158
castle-0.150
 upon-0.148
Tokenyear
Token ID1941
Feature activation-1.02e-01

Pos logit contributions
 pain+0.061
 Some+0.055
 Many+0.051
growth+0.050
 blonde+0.048
Neg logit contributions
ll-0.100
 upon-0.080
!-0.079
castle-0.074
rying-0.074
Token-
Token ID12
Feature activation-4.14e-02

Pos logit contributions
 Some+0.252
 Many+0.236
 pain+0.233
 Amen+0.229
 blonde+0.217
Neg logit contributions
ll-0.126
!-0.105
uddle-0.097
castle-0.087
 upon-0.081
Tokenold
Token ID727
Feature activation+1.03e-03

Pos logit contributions
 pain+0.073
 Some+0.070
 Many+0.063
 Amen+0.061
growth+0.061
Neg logit contributions
ll-0.085
!-0.067
 upon-0.064
uddle-0.061
castle-0.061
Token boy
Token ID2933
Feature activation-2.69e-02

Pos logit contributions
ll+0.002
uddle+0.002
castle+0.002
!+0.002
ounce+0.002
Neg logit contributions
 Some-0.002
 Many-0.002
 pain-0.002
 Amen-0.002
oscope-0.001
Token who
Token ID508
Feature activation-7.95e-03

Pos logit contributions
 Some+0.052
 Many+0.047
 pain+0.047
 Amen+0.046
oscope+0.044
Neg logit contributions
ll-0.049
uddle-0.041
castle-0.038
 upon-0.037
cast-0.036
Token was
Token ID373
Feature activation-1.79e-03

Pos logit contributions
 Some+0.014
 Many+0.013
 pain+0.013
 Amen+0.012
oscope+0.012
Neg logit contributions
ll-0.015
uddle-0.013
!-0.013
 upon-0.012
castle-0.012