TOP ACTIVATIONS
MAX = 8.471

to put on her fan ci est sunglasses .
she always wore her fan ci est dress to the museum
dressed up in her fan ci est clothes and asked her
they didn 't get it unst uck . Just
We 'll help you get unst uck ." But
and put on her sh ini est shoes .
Lily , you did not clean up . You did not
couch and gave them a stern look . "
?" Mom said in a stern but gentle voice .
over . She had a stern face and a loud voice
?" she asked in a stern voice . " Why are
an old woman with a stern expression .
didn 't understand what " mod est " meant , but
re not exactly what we intended . <|endoftext|> Once upon a
". He gave Jack a stern look and walked away .
Tom and Lily with a stern face . Tom
her mother said in a stern voice .
will buy you the fan ci est dress in the store
didn 't know what " mod est " meant , but
looked at them with a stern face . "

MIN ACTIVATIONS
MIN = -12.215

happy and warm . <|endoftext|> Once upon a time , there
fun time outside . <|endoftext|> Once upon a time , there
Lily 's health . <|endoftext|> Once upon a time , there
his mom my . <|endoftext|> Once upon a time , there
and her dad . <|endoftext|> Once upon a time , there
Ben 's help . <|endoftext|> Once upon a time , there
t eddy bear . <|endoftext|> Once upon a time , there
. The end . <|endoftext|> Once upon a time , there
kind and helpful . <|endoftext|> Once upon a time , there
her mom my . <|endoftext|> Once upon a time , there
her animal friends . <|endoftext|> Once upon a time , there
few days later . <|endoftext|> Once upon a time , there
were happy friends . <|endoftext|> Once upon a time , there
her baby brother . <|endoftext|> Once upon a time , there
in the park . <|endoftext|> Once upon a time , there
in the future . <|endoftext|> Once upon a time , there
his helpful friend . <|endoftext|> Once upon a time , there
with her mom . <|endoftext|> Once upon a time , there
responsible with money . <|endoftext|> Once upon a time , there
it gave her . <|endoftext|> Once upon a time , there

INTERVAL 3.300 - 8.471
CONTAINS 4.705%

mom and dad ran to Tom and Jen and cried .
It made a loud sound . " Hon k ! Pop
and reached for the top . They found a big bag
Sam cough and run away . " Hey ,
another jar . We â € ™ re not lucky enough

INTERVAL -1.872 - 3.300
CONTAINS 75.978%

mum said , " Those are a great choice Mill ie
store and bought some fun beach toys . Lib by was
spoiled . <|endoftext|> Once upon a time , there was a
, Jack was walking in the park . He stopped to
fall . It is cold and wet . It makes their

INTERVAL -7.043 - -1.872
CONTAINS 18.173%

could make honey from the trees and it tasted sweet !
Once there was a boy named Joe . He was three
They said goodbye to the ostr ich and walked home .
. It was special , because the iron had purple and
write or think . She opened the window and

INTERVAL -12.215 - -7.043
CONTAINS 1.144%

with him . While they were walking to the
and follow advice . <|endoftext|> Once upon a time , there
said , " L ily , you need to clean up
's cane weighed the same as the bag of apples .
. But then , something bad happened . Tom
Token to
Token ID284
Feature activation+2.55e+00

Pos logit contributions
stick+0.033
 itself+0.033
iped+0.032
ez+0.032
ze+0.031
Neg logit contributions
 upon-0.026
ancer-0.023
 there-0.021
 they-0.020
ulu-0.019
Token put
Token ID1234
Feature activation+7.37e-01

Pos logit contributions
stick+0.060
 itself+0.059
iped+0.056
ze+0.055
bons+0.054
Neg logit contributions
 upon-0.061
ancer-0.056
ulu-0.048
ver-0.047
 there-0.047
Token on
Token ID319
Feature activation+1.38e+00

Pos logit contributions
 itself+0.017
stick+0.017
iped+0.016
ez+0.016
bons+0.016
Neg logit contributions
 upon-0.017
ancer-0.016
ulu-0.014
ver-0.014
as-0.013
Token her
Token ID607
Feature activation+2.85e+00

Pos logit contributions
stick+0.037
 itself+0.036
iped+0.036
ez+0.035
age+0.035
Neg logit contributions
 upon-0.027
 there-0.024
 they-0.023
ancer-0.022
 don-0.020
Token fan
Token ID4336
Feature activation+2.03e+00

Pos logit contributions
 itself+0.065
stick+0.064
iped+0.064
ze+0.062
ez+0.060
Neg logit contributions
 upon-0.065
ancer-0.061
ulu-0.053
ver-0.053
 they-0.051
Tokenci
Token ID979
Feature activation+8.47e+00

Pos logit contributions
 itself+0.033
stick+0.032
iped+0.031
ez+0.029
ze+0.029
Neg logit contributions
 upon-0.062
ancer-0.058
 there-0.053
 they-0.053
ulu-0.052
Tokenest
Token ID395
Feature activation+1.12e+00

Pos logit contributions
 itself+0.141
iped+0.126
stick+0.121
ez+0.119
izer+0.117
Neg logit contributions
 upon-0.250
ancer-0.218
 there-0.207
 they-0.206
 while-0.201
Token sunglasses
Token ID36251
Feature activation-1.01e+00

Pos logit contributions
 itself+0.023
stick+0.023
iped+0.022
ez+0.021
ze+0.020
Neg logit contributions
 upon-0.030
ancer-0.027
 there-0.025
 they-0.024
ulu-0.024
Token.
Token ID13
Feature activation+2.51e+00

Pos logit contributions
 upon+0.026
ancer+0.023
ulu+0.022
 don+0.020
 however+0.019
Neg logit contributions
stick-0.022
 itself-0.020
age-0.019
nob-0.019
iped-0.019
Token
Token ID198
Feature activation-2.17e+00

Pos logit contributions
 itself+0.059
stick+0.058
iped+0.056
nob+0.053
ez+0.053
Neg logit contributions
 upon-0.057
ancer-0.054
ulu-0.046
 there-0.046
ver-0.045
Token
Token ID198
Feature activation-2.22e+00

Pos logit contributions
ancer+0.049
 upon+0.049
 don+0.040
ulu+0.039
ver+0.039
Neg logit contributions
 itself-0.049
stick-0.046
iped-0.045
nob-0.044
ez-0.044
Token she
Token ID673
Feature activation+3.00e-01

Pos logit contributions
ancer+0.134
 upon+0.133
as+0.122
ver+0.116
ulu+0.114
Neg logit contributions
 itself-0.138
stick-0.134
 mistakes-0.128
er-0.122
ze-0.118
Token always
Token ID1464
Feature activation-8.64e-01

Pos logit contributions
stick+0.008
 itself+0.008
iped+0.007
ez+0.007
ze+0.007
Neg logit contributions
 upon-0.006
ancer-0.006
 there-0.005
ulu-0.005
 they-0.005
Token wore
Token ID12408
Feature activation+6.95e-01

Pos logit contributions
 upon+0.019
ancer+0.018
ulu+0.016
 there+0.015
ver+0.015
Neg logit contributions
 itself-0.021
stick-0.021
iped-0.020
ez-0.020
ze-0.019
Token her
Token ID607
Feature activation+2.71e+00

Pos logit contributions
 itself+0.014
stick+0.014
iped+0.013
ze+0.013
ez+0.013
Neg logit contributions
 upon-0.018
ancer-0.017
ulu-0.015
ver-0.015
 there-0.015
Token fan
Token ID4336
Feature activation+1.70e+00

Pos logit contributions
 itself+0.058
stick+0.056
iped+0.052
bons+0.052
ze+0.052
Neg logit contributions
 upon-0.065
ancer-0.064
ulu-0.058
ver-0.055
 they-0.053
Tokenci
Token ID979
Feature activation+8.43e+00

Pos logit contributions
stick+0.041
 itself+0.040
iped+0.039
ez+0.038
ze+0.037
Neg logit contributions
 upon-0.039
ancer-0.035
 there-0.031
 they-0.030
ulu-0.030
Tokenest
Token ID395
Feature activation+3.35e-01

Pos logit contributions
 itself+0.123
iped+0.118
stick+0.114
izer+0.113
ez+0.107
Neg logit contributions
 upon-0.261
ancer-0.233
 there-0.215
 they-0.213
 don-0.213
Token dress
Token ID6576
Feature activation-1.05e-01

Pos logit contributions
stick+0.007
 itself+0.007
iped+0.006
ez+0.006
ze+0.006
Neg logit contributions
 upon-0.009
ancer-0.008
ulu-0.007
 there-0.007
 they-0.007
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 upon+0.002
ancer+0.002
ulu+0.002
 there+0.002
 don+0.002
Neg logit contributions
stick-0.002
 itself-0.002
age-0.002
ze-0.002
iped-0.002
Token the
Token ID262
Feature activation+7.56e-01

Pos logit contributions
 itself+0.025
stick+0.025
iped+0.023
ze+0.023
ez+0.023
Neg logit contributions
 upon-0.022
ancer-0.020
ulu-0.018
 there-0.018
 they-0.017
Token museum
Token ID13257
Feature activation-3.14e+00

Pos logit contributions
 itself+0.017
stick+0.016
ze+0.015
iped+0.015
bons+0.015
Neg logit contributions
 upon-0.018
ancer-0.017
ulu-0.015
ver-0.015
 there-0.015
Token dressed
Token ID12049
Feature activation+1.62e+00

Pos logit contributions
 upon+0.009
ancer+0.009
ulu+0.007
ver+0.007
 don+0.007
Neg logit contributions
 itself-0.010
stick-0.010
iped-0.009
ez-0.009
ze-0.009
Token up
Token ID510
Feature activation+2.16e-01

Pos logit contributions
stick+0.039
 itself+0.039
iped+0.037
ez+0.037
ze+0.035
Neg logit contributions
 upon-0.037
ancer-0.033
 there-0.031
 they-0.029
ulu-0.028
Token in
Token ID287
Feature activation-7.47e-01

Pos logit contributions
 itself+0.005
stick+0.004
iped+0.004
nob+0.004
ze+0.004
Neg logit contributions
ancer-0.005
 upon-0.005
ulu-0.005
 don-0.004
ver-0.004
Token her
Token ID607
Feature activation+2.22e+00

Pos logit contributions
 upon+0.017
ancer+0.016
ulu+0.014
 there+0.014
ver+0.014
Neg logit contributions
 itself-0.018
stick-0.017
iped-0.017
ez-0.016
ze-0.016
Token fan
Token ID4336
Feature activation+1.19e+00

Pos logit contributions
stick+0.053
 itself+0.052
iped+0.050
ze+0.049
bons+0.048
Neg logit contributions
 upon-0.050
ancer-0.047
ulu-0.041
ver-0.040
 there-0.040
Tokenci
Token ID979
Feature activation+7.25e+00

Pos logit contributions
 itself+0.026
iped+0.025
stick+0.025
ez+0.025
ze+0.024
Neg logit contributions
 upon-0.029
ancer-0.026
 there-0.024
 they-0.024
ulu-0.022
Tokenest
Token ID395
Feature activation+7.08e-01

Pos logit contributions
 itself+0.124
izer+0.112
iped+0.112
ez+0.109
 media+0.108
Neg logit contributions
 upon-0.203
ancer-0.171
 they-0.163
 there-0.160
're-0.159
Token clothes
Token ID8242
Feature activation-1.75e+00

Pos logit contributions
stick+0.014
 itself+0.014
iped+0.014
ez+0.013
ze+0.013
Neg logit contributions
 upon-0.019
ancer-0.017
 there-0.016
 they-0.015
ulu-0.015
Token and
Token ID290
Feature activation-4.86e-01

Pos logit contributions
 upon+0.046
ancer+0.043
ulu+0.040
ver+0.037
 don+0.037
Neg logit contributions
stick-0.035
 itself-0.034
iped-0.031
ze-0.030
age-0.030
Token asked
Token ID1965
Feature activation-6.38e-01

Pos logit contributions
 upon+0.012
ancer+0.012
ver+0.010
ulu+0.010
as+0.009
Neg logit contributions
stick-0.011
 itself-0.011
iped-0.010
bons-0.010
ze-0.010
Token her
Token ID607
Feature activation+2.00e+00

Pos logit contributions
 upon+0.015
ancer+0.014
ulu+0.013
as+0.012
ver+0.012
Neg logit contributions
 itself-0.015
stick-0.014
iped-0.013
ze-0.013
nob-0.013
Token they
Token ID484
Feature activation-1.74e+00

Pos logit contributions
 upon+0.052
ancer+0.047
ulu+0.038
ver+0.036
as+0.036
Neg logit contributions
 itself-0.096
stick-0.090
iped-0.088
ze-0.085
 mistakes-0.084
Token didn
Token ID1422
Feature activation+2.12e+00

Pos logit contributions
 upon+0.036
ancer+0.035
ulu+0.028
 however+0.027
 there+0.027
Neg logit contributions
 itself-0.044
stick-0.044
iped-0.042
ez-0.041
nob-0.040
Token't
Token ID470
Feature activation-3.84e-01

Pos logit contributions
iped+0.073
stick+0.072
 itself+0.071
ez+0.069
bons+0.069
Neg logit contributions
 upon-0.024
 they-0.019
ancer-0.019
 there-0.018
ver-0.015
Token get
Token ID651
Feature activation+1.29e+00

Pos logit contributions
 upon+0.008
ancer+0.007
 there+0.006
 they+0.006
ulu+0.006
Neg logit contributions
 itself-0.010
stick-0.010
iped-0.010
ez-0.009
ze-0.009
Token it
Token ID340
Feature activation+1.39e+00

Pos logit contributions
 itself+0.031
stick+0.031
iped+0.029
ez+0.028
ze+0.028
Neg logit contributions
 upon-0.030
ancer-0.027
 there-0.024
 they-0.024
ulu-0.024
Token unst
Token ID15014
Feature activation+6.99e+00

Pos logit contributions
 itself+0.028
stick+0.027
iped+0.026
ze+0.024
nob+0.024
Neg logit contributions
 upon-0.037
ancer-0.035
ulu-0.032
 there-0.030
 they-0.030
Tokenuck
Token ID1347
Feature activation-4.33e-01

Pos logit contributions
 itself+0.150
 melody+0.149
nob+0.147
ez+0.145
 aids+0.142
Neg logit contributions
 upon-0.146
ancer-0.137
 don-0.135
 there-0.132
 they-0.128
Token.
Token ID13
Feature activation+2.67e+00

Pos logit contributions
 upon+0.013
ancer+0.012
ulu+0.011
ver+0.011
 there+0.010
Neg logit contributions
 itself-0.008
stick-0.007
iped-0.007
ze-0.006
ez-0.006
Token
Token ID198
Feature activation-2.48e+00

Pos logit contributions
stick+0.077
 itself+0.076
iped+0.074
ze+0.073
ez+0.073
Neg logit contributions
 upon-0.050
ancer-0.041
 they-0.039
 there-0.038
ulu-0.035
Token
Token ID198
Feature activation-2.46e+00

Pos logit contributions
 upon+0.049
ancer+0.043
 there+0.038
 they+0.037
ulu+0.037
Neg logit contributions
stick-0.067
 itself-0.067
iped-0.065
ez-0.063
ze-0.062
TokenJust
Token ID5703
Feature activation-2.92e+00

Pos logit contributions
 upon+0.052
 there+0.042
 they+0.042
ancer+0.041
ulu+0.040
Neg logit contributions
stick-0.066
 itself-0.063
iped-0.063
ez-0.060
bons-0.060
TokenWe
Token ID1135
Feature activation-5.53e-01

Pos logit contributions
 itself+0.052
stick+0.052
iped+0.050
ez+0.048
ze+0.048
Neg logit contributions
 upon-0.054
ancer-0.049
 there-0.044
 they-0.044
ulu-0.043
Token'll
Token ID1183
Feature activation+2.22e+00

Pos logit contributions
 upon+0.009
ancer+0.009
ulu+0.008
 however+0.007
 they+0.007
Neg logit contributions
stick-0.015
 itself-0.015
age-0.014
ez-0.014
ze-0.014
Token help
Token ID1037
Feature activation+1.24e+00

Pos logit contributions
 itself+0.056
stick+0.055
iped+0.055
ez+0.053
bons+0.051
Neg logit contributions
 upon-0.047
ancer-0.041
 there-0.038
 they-0.038
 don-0.037
Token you
Token ID345
Feature activation+1.02e+00

Pos logit contributions
iped+0.037
stick+0.037
ez+0.036
 itself+0.035
age+0.035
Neg logit contributions
 upon-0.021
ancer-0.017
 there-0.017
 they-0.017
 don-0.016
Token get
Token ID651
Feature activation+1.41e-02

Pos logit contributions
stick+0.023
 itself+0.023
ze+0.022
iped+0.021
age+0.021
Neg logit contributions
 upon-0.025
ancer-0.023
ulu-0.020
 they-0.020
 there-0.019
Token unst
Token ID15014
Feature activation+6.76e+00

Pos logit contributions
stick+0.000
 itself+0.000
iped+0.000
ez+0.000
ze+0.000
Neg logit contributions
 upon-0.000
ancer-0.000
 there-0.000
 they-0.000
ulu-0.000
Tokenuck
Token ID1347
Feature activation-2.27e-01

Pos logit contributions
 itself+0.162
nob+0.149
 melody+0.148
 aids+0.146
 media+0.142
Neg logit contributions
 there-0.116
 don-0.116
 upon-0.115
ancer-0.112
 neither-0.106
Token."
Token ID526
Feature activation+2.52e+00

Pos logit contributions
 upon+0.006
ancer+0.005
ulu+0.005
ver+0.005
 thirsty+0.005
Neg logit contributions
 itself-0.005
stick-0.005
iped-0.004
 mistakes-0.004
ze-0.004
Token
Token ID198
Feature activation-1.80e+00

Pos logit contributions
stick+0.086
nob+0.085
ze+0.085
bons+0.085
iped+0.084
Neg logit contributions
 upon-0.034
 they-0.031
 there-0.025
 however-0.023
ancer-0.021
Token
Token ID198
Feature activation-2.05e+00

Pos logit contributions
 upon+0.026
 they+0.019
 there+0.018
ulu+0.017
ancer+0.016
Neg logit contributions
stick-0.062
iped-0.061
eness-0.059
bons-0.058
ez-0.058
TokenBut
Token ID1537
Feature activation-3.72e+00

Pos logit contributions
 upon+0.037
 they+0.032
 there+0.030
 however+0.028
ulu+0.027
Neg logit contributions
stick-0.060
iped-0.058
ez-0.057
nob-0.056
bons-0.056
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
 upon+0.004
ancer+0.003
ulu+0.003
 there+0.003
 they+0.003
Neg logit contributions
stick-0.003
 itself-0.003
iped-0.003
ze-0.003
nob-0.003
Token put
Token ID1234
Feature activation+3.67e-01

Pos logit contributions
 upon+0.006
ancer+0.005
ulu+0.005
ver+0.005
as+0.004
Neg logit contributions
stick-0.005
 itself-0.005
iped-0.005
bons-0.005
ze-0.005
Token on
Token ID319
Feature activation+9.81e-01

Pos logit contributions
stick+0.009
 itself+0.009
iped+0.008
ez+0.008
ze+0.008
Neg logit contributions
 upon-0.008
ancer-0.008
 there-0.007
 they-0.007
ulu-0.007
Token her
Token ID607
Feature activation+2.75e+00

Pos logit contributions
iped+0.027
stick+0.026
age+0.026
eness+0.026
 itself+0.026
Neg logit contributions
 upon-0.019
 there-0.017
 they-0.016
ancer-0.014
 don-0.013
Token sh
Token ID427
Feature activation+1.76e+00

Pos logit contributions
 itself+0.063
stick+0.062
iped+0.060
ez+0.058
ze+0.057
Neg logit contributions
 upon-0.065
ancer-0.060
 there-0.053
ulu-0.052
 they-0.052
Tokenini
Token ID5362
Feature activation+6.72e+00

Pos logit contributions
 itself+0.029
iped+0.029
 mistakes+0.028
stick+0.028
 target+0.028
Neg logit contributions
 upon-0.047
ulu-0.041
ancer-0.040
 there-0.039
as-0.038
Tokenest
Token ID395
Feature activation+1.13e+00

Pos logit contributions
 itself+0.145
iped+0.142
 media+0.138
ez+0.136
rums+0.136
Neg logit contributions
 upon-0.168
ancer-0.139
 while-0.135
 there-0.132
 they-0.129
Token shoes
Token ID10012
Feature activation-1.05e+00

Pos logit contributions
 itself+0.023
stick+0.022
iped+0.022
ez+0.021
nob+0.020
Neg logit contributions
 upon-0.031
ancer-0.027
 there-0.026
 they-0.025
 however-0.024
Token.
Token ID13
Feature activation+2.85e+00

Pos logit contributions
 upon+0.027
ancer+0.026
ulu+0.024
 don+0.022
ver+0.022
Neg logit contributions
stick-0.021
 itself-0.020
iped-0.018
age-0.018
nob-0.018
Token
Token ID198
Feature activation-2.29e+00

Pos logit contributions
 itself+0.069
stick+0.069
iped+0.067
ez+0.066
ze+0.066
Neg logit contributions
 upon-0.066
ancer-0.058
 they-0.053
 there-0.053
ulu-0.049
Token
Token ID198
Feature activation-2.27e+00

Pos logit contributions
 upon+0.049
ancer+0.047
 there+0.039
ulu+0.039
 they+0.038
Neg logit contributions
 itself-0.056
stick-0.055
iped-0.053
ez-0.051
ze-0.051
Token Lily
Token ID20037
Feature activation+3.09e+00

Pos logit contributions
 upon+0.013
ancer+0.013
ver+0.012
ulu+0.011
as+0.011
Neg logit contributions
 itself-0.013
stick-0.013
iped-0.012
ez-0.011
 mistakes-0.011
Token,
Token ID11
Feature activation-6.49e+00

Pos logit contributions
 itself+0.078
stick+0.078
iped+0.077
 mistakes+0.074
ze+0.073
Neg logit contributions
 upon-0.061
ancer-0.061
ulu-0.052
ver-0.046
 however-0.045
Token you
Token ID345
Feature activation-2.25e-01

Pos logit contributions
ancer+0.161
 upon+0.153
ver+0.144
as+0.130
 steep+0.129
Neg logit contributions
 itself-0.137
ze-0.121
zer-0.115
stick-0.112
 wishes-0.109
Token did
Token ID750
Feature activation+4.37e+00

Pos logit contributions
ancer+0.004
 upon+0.004
ulu+0.004
 weekend+0.003
 particular+0.003
Neg logit contributions
stick-0.006
ez-0.006
ze-0.006
iped-0.006
 wishes-0.005
Token not
Token ID407
Feature activation+9.13e-01

Pos logit contributions
stick+0.149
iped+0.143
 itself+0.141
ez+0.138
eness+0.138
Neg logit contributions
 upon-0.062
 there-0.050
 they-0.048
 don-0.045
ancer-0.043
Token clean
Token ID3424
Feature activation+6.62e+00

Pos logit contributions
 itself+0.025
stick+0.025
iped+0.024
ez+0.023
ze+0.023
Neg logit contributions
 upon-0.018
ancer-0.016
 they-0.014
 there-0.014
ulu-0.013
Token up
Token ID510
Feature activation-2.66e-01

Pos logit contributions
stick+0.147
iped+0.143
age+0.142
 itself+0.141
ez+0.139
Neg logit contributions
 upon-0.162
 there-0.144
ancer-0.139
 they-0.137
 don-0.128
Token.
Token ID13
Feature activation+3.83e+00

Pos logit contributions
 upon+0.008
ancer+0.008
ulu+0.007
ver+0.007
as+0.006
Neg logit contributions
 itself-0.005
stick-0.004
 mistakes-0.004
ze-0.004
iped-0.004
Token You
Token ID921
Feature activation-2.57e-01

Pos logit contributions
stick+0.072
 itself+0.070
iped+0.069
ze+0.066
nob+0.065
Neg logit contributions
 upon-0.103
ancer-0.099
 there-0.091
 they-0.090
 don-0.088
Token did
Token ID750
Feature activation+3.84e+00

Pos logit contributions
 upon+0.006
ancer+0.005
ulu+0.004
 particular+0.004
 however+0.004
Neg logit contributions
stick-0.006
ez-0.006
ze-0.006
age-0.006
nob-0.006
Token not
Token ID407
Feature activation+7.78e-01

Pos logit contributions
stick+0.119
 itself+0.116
iped+0.115
ez+0.112
age+0.110
Neg logit contributions
 upon-0.063
ancer-0.051
 there-0.049
 they-0.047
 don-0.044
Token couch
Token ID18507
Feature activation+8.45e-01

Pos logit contributions
stick+0.050
ez+0.050
iped+0.050
 itself+0.049
eness+0.048
Neg logit contributions
 upon-0.033
ancer-0.027
 there-0.026
 they-0.026
ulu-0.024
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
 itself+0.019
stick+0.019
iped+0.018
ez+0.017
ze+0.017
Neg logit contributions
 upon-0.021
ancer-0.019
 there-0.017
 they-0.017
ulu-0.017
Token gave
Token ID2921
Feature activation-5.43e-01

Pos logit contributions
stick+0.032
 itself+0.032
iped+0.031
ze+0.030
ez+0.030
Neg logit contributions
 upon-0.033
ancer-0.031
ulu-0.027
ver-0.026
 there-0.025
Token them
Token ID606
Feature activation+3.60e-01

Pos logit contributions
 upon+0.012
ancer+0.011
 there+0.010
 they+0.010
ulu+0.010
Neg logit contributions
stick-0.013
 itself-0.013
iped-0.013
ez-0.012
age-0.012
Token a
Token ID257
Feature activation+1.74e+00

Pos logit contributions
 itself+0.009
stick+0.009
iped+0.008
ez+0.008
ze+0.008
Neg logit contributions
 upon-0.008
ancer-0.007
 there-0.006
 they-0.006
ulu-0.006
Token stern
Token ID26370
Feature activation+6.62e+00

Pos logit contributions
stick+0.041
 itself+0.041
iped+0.040
ez+0.039
ze+0.038
Neg logit contributions
 upon-0.039
ancer-0.036
 there-0.031
 they-0.031
ulu-0.031
Token look
Token ID804
Feature activation+3.14e-01

Pos logit contributions
stick+0.171
iped+0.165
bons+0.165
 itself+0.163
 media+0.161
Neg logit contributions
 upon-0.129
 don-0.112
 there-0.105
 while-0.104
 they-0.100
Token.
Token ID13
Feature activation+4.07e+00

Pos logit contributions
stick+0.007
 itself+0.007
iped+0.006
ez+0.006
ze+0.006
Neg logit contributions
 upon-0.008
ancer-0.007
 there-0.007
 they-0.007
ulu-0.006
Token
Token ID198
Feature activation-2.23e+00

Pos logit contributions
 itself+0.091
stick+0.089
iped+0.085
ez+0.080
ze+0.080
Neg logit contributions
 upon-0.098
ancer-0.092
ulu-0.080
 there-0.079
ver-0.078
Token
Token ID198
Feature activation-2.21e+00

Pos logit contributions
 upon+0.048
ancer+0.046
 there+0.038
ulu+0.038
ver+0.037
Neg logit contributions
 itself-0.055
stick-0.052
iped-0.051
ez-0.049
nob-0.048
Token"
Token ID1
Feature activation-2.04e-01

Pos logit contributions
ancer+0.054
 upon+0.053
ver+0.044
ulu+0.043
 usual+0.042
Neg logit contributions
 itself-0.048
stick-0.045
iped-0.043
ez-0.043
age-0.043
Token?"
Token ID1701
Feature activation+1.15e+00

Pos logit contributions
 itself+0.053
stick+0.052
iped+0.049
ez+0.049
ze+0.048
Neg logit contributions
 upon-0.035
ancer-0.032
ver-0.027
ulu-0.027
 they-0.025
Token Mom
Token ID11254
Feature activation-2.23e-01

Pos logit contributions
 itself+0.030
stick+0.030
iped+0.029
ez+0.028
ze+0.028
Neg logit contributions
 upon-0.023
ancer-0.021
 there-0.018
ulu-0.018
 they-0.017
Token said
Token ID531
Feature activation+7.16e-01

Pos logit contributions
 upon+0.005
ancer+0.004
 there+0.004
 they+0.004
ulu+0.004
Neg logit contributions
 itself-0.006
stick-0.006
iped-0.006
ze-0.005
ez-0.005
Token in
Token ID287
Feature activation-1.37e+00

Pos logit contributions
bons+0.019
stick+0.018
iped+0.018
izer+0.018
 itself+0.018
Neg logit contributions
 upon-0.015
 they-0.014
 there-0.014
ancer-0.013
 don-0.012
Token a
Token ID257
Feature activation+9.14e-01

Pos logit contributions
 upon+0.032
ancer+0.029
ulu+0.025
 there+0.025
 they+0.025
Neg logit contributions
 itself-0.033
stick-0.032
iped-0.031
ez-0.030
ze-0.030
Token stern
Token ID26370
Feature activation+6.45e+00

Pos logit contributions
 itself+0.022
stick+0.022
iped+0.021
ez+0.020
ze+0.020
Neg logit contributions
 upon-0.021
ancer-0.018
 there-0.017
 they-0.016
ulu-0.016
Token but
Token ID475
Feature activation-2.48e+00

Pos logit contributions
bons+0.130
stick+0.125
iped+0.123
ez+0.122
 itself+0.122
Neg logit contributions
 upon-0.163
 there-0.140
 don-0.134
 they-0.134
 though-0.129
Token gentle
Token ID10296
Feature activation+3.87e+00

Pos logit contributions
 upon+0.035
ancer+0.032
ulu+0.024
as+0.023
ver+0.022
Neg logit contributions
 itself-0.081
stick-0.076
iped-0.076
 mistakes-0.072
ez-0.072
Token voice
Token ID3809
Feature activation-6.02e-01

Pos logit contributions
iped+0.071
 itself+0.069
bons+0.067
stick+0.066
ez+0.065
Neg logit contributions
 upon-0.104
 there-0.093
ancer-0.089
 they-0.088
ulu-0.085
Token.
Token ID13
Feature activation+3.49e+00

Pos logit contributions
 upon+0.014
 there+0.012
ancer+0.012
 they+0.012
ver+0.011
Neg logit contributions
stick-0.014
 itself-0.013
bons-0.013
iped-0.013
izer-0.013
Token
Token ID198
Feature activation-2.01e+00

Pos logit contributions
 itself+0.079
stick+0.075
iped+0.072
ez+0.069
nob+0.068
Neg logit contributions
 upon-0.083
ancer-0.078
ulu-0.068
ver-0.066
 there-0.065
Token over
Token ID625
Feature activation-1.92e+00

Pos logit contributions
stick+0.009
 itself+0.009
iped+0.009
ez+0.009
ze+0.008
Neg logit contributions
 upon-0.008
ancer-0.007
 there-0.007
 they-0.007
ulu-0.006
Token.
Token ID13
Feature activation+3.84e+00

Pos logit contributions
 upon+0.049
ancer+0.044
 there+0.040
 they+0.039
ulu+0.039
Neg logit contributions
 itself-0.041
stick-0.041
iped-0.039
ez-0.038
ze-0.037
Token She
Token ID1375
Feature activation+1.56e+00

Pos logit contributions
 itself+0.059
stick+0.057
iped+0.054
 Its+0.053
ez+0.049
Neg logit contributions
ancer-0.112
 upon-0.112
ulu-0.103
ver-0.099
as-0.098
Token had
Token ID550
Feature activation+1.74e+00

Pos logit contributions
stick+0.034
iped+0.032
 itself+0.031
ez+0.030
nob+0.030
Neg logit contributions
ancer-0.038
 upon-0.036
ulu-0.030
ver-0.029
 however-0.028
Token a
Token ID257
Feature activation+1.41e+00

Pos logit contributions
 itself+0.038
stick+0.037
iped+0.035
ze+0.034
ez+0.033
Neg logit contributions
ancer-0.044
 upon-0.043
ulu-0.037
ver-0.036
 don-0.034
Token stern
Token ID26370
Feature activation+6.44e+00

Pos logit contributions
stick+0.035
 itself+0.034
iped+0.034
ze+0.033
izer+0.033
Neg logit contributions
 upon-0.028
ancer-0.028
ulu-0.023
ver-0.022
 there-0.022
Token face
Token ID1986
Feature activation+1.33e+00

Pos logit contributions
stick+0.149
 itself+0.145
bons+0.140
iped+0.139
esh+0.139
Neg logit contributions
 upon-0.143
 there-0.122
 don-0.119
ancer-0.118
 they-0.116
Token and
Token ID290
Feature activation+6.02e-01

Pos logit contributions
stick+0.031
 itself+0.030
iped+0.029
ez+0.029
ze+0.029
Neg logit contributions
 upon-0.031
ancer-0.029
ulu-0.025
ver-0.024
 they-0.024
Token a
Token ID257
Feature activation+7.72e-01

Pos logit contributions
 itself+0.014
stick+0.013
iped+0.013
ze+0.013
ez+0.013
Neg logit contributions
 upon-0.014
ancer-0.014
ulu-0.012
ver-0.011
as-0.011
Token loud
Token ID7812
Feature activation+5.15e+00

Pos logit contributions
stick+0.019
 itself+0.018
iped+0.018
ze+0.018
age+0.018
Neg logit contributions
 upon-0.016
ancer-0.016
ulu-0.013
ver-0.013
 there-0.013
Token voice
Token ID3809
Feature activation-1.50e-01

Pos logit contributions
 itself+0.104
iped+0.102
bons+0.101
stick+0.100
ulously+0.096
Neg logit contributions
 upon-0.122
 there-0.109
 they-0.108
 don-0.108
ancer-0.100
Token?"
Token ID1701
Feature activation-3.79e-02

Pos logit contributions
 itself+0.067
stick+0.067
ze+0.065
iped+0.062
izer+0.061
Neg logit contributions
 upon-0.049
ancer-0.046
ulu-0.038
ver-0.036
as-0.035
Token she
Token ID673
Feature activation+1.37e+00

Pos logit contributions
ancer+0.001
 upon+0.001
ver+0.001
as+0.001
ulu+0.001
Neg logit contributions
 itself-0.001
stick-0.001
iped-0.001
 mistakes-0.001
ez-0.001
Token asked
Token ID1965
Feature activation+4.52e-01

Pos logit contributions
stick+0.034
 itself+0.034
iped+0.033
ze+0.032
ez+0.032
Neg logit contributions
 upon-0.029
ancer-0.028
ulu-0.024
ver-0.023
 there-0.022
Token in
Token ID287
Feature activation-6.49e-01

Pos logit contributions
stick+0.010
 itself+0.010
iped+0.009
ez+0.009
age+0.009
Neg logit contributions
 upon-0.011
ancer-0.010
 there-0.010
 they-0.010
ulu-0.009
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
 upon+0.015
ancer+0.014
ulu+0.012
ver+0.012
 they+0.012
Neg logit contributions
 itself-0.015
stick-0.015
iped-0.014
ze-0.014
ez-0.014
Token stern
Token ID26370
Feature activation+6.42e+00

Pos logit contributions
 itself+0.027
stick+0.027
iped+0.026
ez+0.025
ze+0.025
Neg logit contributions
 upon-0.023
ancer-0.020
 there-0.019
 they-0.018
ulu-0.018
Token voice
Token ID3809
Feature activation+2.29e-01

Pos logit contributions
bons+0.127
ez+0.124
stick+0.123
iped+0.123
 itself+0.120
Neg logit contributions
 upon-0.164
 there-0.141
 they-0.138
ancer-0.133
 don-0.131
Token.
Token ID13
Feature activation+3.84e+00

Pos logit contributions
stick+0.005
 itself+0.005
bons+0.005
iped+0.005
izer+0.005
Neg logit contributions
 upon-0.005
 there-0.005
 they-0.005
ancer-0.004
 though-0.004
Token "
Token ID366
Feature activation+2.69e+00

Pos logit contributions
stick+0.103
 itself+0.103
iped+0.100
ez+0.097
ze+0.096
Neg logit contributions
 upon-0.076
ancer-0.067
 there-0.059
 they-0.058
ulu-0.057
TokenWhy
Token ID5195
Feature activation-4.48e+00

Pos logit contributions
 itself+0.059
stick+0.058
ez+0.055
iped+0.054
ze+0.054
Neg logit contributions
 upon-0.066
ancer-0.061
ver-0.053
ulu-0.053
 there-0.052
Token are
Token ID389
Feature activation-2.28e+00

Pos logit contributions
ancer+0.083
 upon+0.078
ulu+0.077
ait+0.072
 thirsty+0.072
Neg logit contributions
 itself-0.103
ze-0.098
stick-0.095
 wishes-0.095
 mistakes-0.095
Token an
Token ID281
Feature activation+7.36e-01

Pos logit contributions
 itself+0.043
stick+0.042
iped+0.040
ze+0.039
ez+0.039
Neg logit contributions
 upon-0.040
ancer-0.037
 there-0.031
ulu-0.031
 they-0.031
Token old
Token ID1468
Feature activation+3.63e+00

Pos logit contributions
stick+0.018
iped+0.018
ez+0.017
 itself+0.017
age+0.017
Neg logit contributions
 upon-0.015
 there-0.013
ancer-0.013
ulu-0.012
 thirsty-0.012
Token woman
Token ID2415
Feature activation+1.33e+00

Pos logit contributions
 itself+0.080
stick+0.080
iped+0.077
ez+0.074
ze+0.073
Neg logit contributions
 upon-0.089
ancer-0.081
 there-0.072
 they-0.071
ulu-0.071
Token with
Token ID351
Feature activation-2.38e+00

Pos logit contributions
iped+0.031
 itself+0.031
stick+0.030
bons+0.029
izer+0.029
Neg logit contributions
 upon-0.031
 there-0.027
ancer-0.026
 they-0.025
ulu-0.024
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
 upon+0.055
ancer+0.051
ulu+0.044
 there+0.044
 they+0.043
Neg logit contributions
 itself-0.057
stick-0.056
iped-0.054
ez-0.052
ze-0.052
Token stern
Token ID26370
Feature activation+6.41e+00

Pos logit contributions
stick+0.034
 itself+0.033
iped+0.032
ze+0.031
ez+0.031
Neg logit contributions
 upon-0.029
ancer-0.027
ulu-0.023
 there-0.023
 they-0.023
Token expression
Token ID5408
Feature activation+7.54e-01

Pos logit contributions
bons+0.122
stick+0.120
iped+0.120
ez+0.118
 itself+0.118
Neg logit contributions
 upon-0.168
ancer-0.141
 don-0.140
 there-0.139
 they-0.136
Token.
Token ID13
Feature activation+3.30e+00

Pos logit contributions
 itself+0.018
stick+0.018
iped+0.017
ez+0.017
ze+0.016
Neg logit contributions
 upon-0.017
ancer-0.015
 there-0.014
 they-0.014
ulu-0.013
Token
Token ID220
Feature activation-1.99e+00

Pos logit contributions
 itself+0.079
stick+0.077
iped+0.074
ez+0.072
ze+0.070
Neg logit contributions
 upon-0.074
ancer-0.069
 there-0.058
ulu-0.058
ver-0.058
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
 upon+0.043
ancer+0.041
ulu+0.035
 there+0.035
 they+0.034
Neg logit contributions
 itself-0.048
stick-0.046
iped-0.045
ez-0.043
ze-0.042
Token
Token ID198
Feature activation-2.32e+00

Pos logit contributions
 upon+0.044
ancer+0.040
 there+0.034
 they+0.034
ulu+0.033
Neg logit contributions
 itself-0.054
stick-0.053
iped-0.051
ez-0.050
ze-0.050
Token didn
Token ID1422
Feature activation+2.11e+00

Pos logit contributions
stick+0.042
 itself+0.041
iped+0.040
izer+0.039
ez+0.039
Neg logit contributions
 upon-0.035
 they-0.028
ancer-0.028
 there-0.028
ver-0.026
Token't
Token ID470
Feature activation+5.87e-01

Pos logit contributions
stick+0.069
iped+0.069
 itself+0.068
ez+0.067
bons+0.065
Neg logit contributions
 upon-0.029
ancer-0.024
 they-0.021
 there-0.020
ver-0.020
Token understand
Token ID1833
Feature activation+2.97e+00

Pos logit contributions
 itself+0.015
stick+0.015
ez+0.014
age+0.014
ze+0.014
Neg logit contributions
 upon-0.013
ancer-0.011
 there-0.010
ulu-0.009
ver-0.009
Token what
Token ID644
Feature activation-2.59e+00

Pos logit contributions
 itself+0.078
stick+0.076
iped+0.072
ze+0.071
ez+0.071
Neg logit contributions
 upon-0.062
ancer-0.060
ulu-0.050
ver-0.048
as-0.045
Token "
Token ID366
Feature activation+2.58e+00

Pos logit contributions
 upon+0.058
ancer+0.053
 there+0.047
 they+0.046
ulu+0.046
Neg logit contributions
stick-0.063
 itself-0.063
iped-0.061
ez-0.058
ze-0.058
Tokenmod
Token ID4666
Feature activation+6.38e+00

Pos logit contributions
stick+0.050
 itself+0.049
ez+0.046
iped+0.045
age+0.045
Neg logit contributions
 upon-0.071
ancer-0.064
 they-0.060
ulu-0.059
 there-0.059
Tokenest
Token ID395
Feature activation+4.30e-01

Pos logit contributions
ez+0.095
stick+0.092
iped+0.091
izer+0.091
nob+0.090
Neg logit contributions
 upon-0.199
ancer-0.182
 there-0.163
ver-0.161
 don-0.159
Token"
Token ID1
Feature activation-2.23e-01

Pos logit contributions
 itself+0.009
stick+0.009
iped+0.008
ez+0.008
ze+0.008
Neg logit contributions
 upon-0.011
ancer-0.010
 there-0.009
 they-0.009
ulu-0.009
Token meant
Token ID4001
Feature activation+7.06e-01

Pos logit contributions
 upon+0.005
ancer+0.005
ulu+0.004
 there+0.004
 they+0.004
Neg logit contributions
 itself-0.005
stick-0.005
iped-0.005
age-0.005
ez-0.005
Token,
Token ID11
Feature activation-3.84e+00

Pos logit contributions
 itself+0.016
stick+0.016
iped+0.015
ze+0.014
ez+0.014
Neg logit contributions
 upon-0.017
ancer-0.016
ulu-0.014
ver-0.013
 there-0.013
Token but
Token ID475
Feature activation-3.38e+00

Pos logit contributions
 upon+0.071
ancer+0.064
ulu+0.053
ver+0.051
 there+0.049
Neg logit contributions
 itself-0.113
stick-0.110
iped-0.105
ze-0.102
ez-0.101
Tokenre
Token ID260
Feature activation-2.39e-01

Pos logit contributions
 upon+0.083
ancer+0.079
 thirsty+0.071
 there+0.070
 hotels+0.069
Neg logit contributions
stick-0.128
age-0.125
 itself-0.124
iped-0.120
ez-0.119
Token not
Token ID407
Feature activation-4.67e-01

Pos logit contributions
ancer+0.005
 upon+0.005
ulu+0.005
 however+0.004
ver+0.004
Neg logit contributions
 itself-0.006
stick-0.005
ze-0.005
iped-0.005
age-0.005
Token exactly
Token ID3446
Feature activation-2.01e+00

Pos logit contributions
ancer+0.011
 upon+0.011
ulu+0.010
ver+0.009
ait+0.008
Neg logit contributions
 itself-0.010
stick-0.010
ez-0.009
 mistakes-0.009
ze-0.009
Token what
Token ID644
Feature activation-4.03e+00

Pos logit contributions
ancer+0.046
 upon+0.046
ulu+0.040
ver+0.038
as+0.035
Neg logit contributions
 itself-0.047
stick-0.043
ze-0.041
iped-0.040
ez-0.040
Token we
Token ID356
Feature activation-1.41e+00

Pos logit contributions
 upon+0.075
ancer+0.072
ulu+0.062
ver+0.056
ait+0.052
Neg logit contributions
 itself-0.112
 mistakes-0.103
stick-0.102
ze-0.100
iped-0.098
Token intended
Token ID5292
Feature activation+6.38e+00

Pos logit contributions
 upon+0.028
ancer+0.028
ulu+0.025
 however+0.022
 there+0.022
Neg logit contributions
stick-0.034
 itself-0.034
ze-0.032
ez-0.032
iped-0.032
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 itself+0.131
stick+0.128
iped+0.122
ze+0.116
ez+0.116
Neg logit contributions
 upon-0.168
ancer-0.157
ulu-0.140
 there-0.138
 they-0.136
Token<|endoftext|>
Token ID50256
Feature activation-2.84e+00

Pos logit contributions
 itself+0.041
stick+0.040
iped+0.038
ez+0.037
age+0.036
Neg logit contributions
 upon-0.045
ancer-0.041
 there-0.036
ulu-0.036
ver-0.035
TokenOnce
Token ID7454
Feature activation-1.17e+01

Pos logit contributions
 upon+0.054
ancer+0.052
 they+0.043
ver+0.043
ulu+0.043
Neg logit contributions
 itself-0.073
stick-0.070
ez-0.069
nob-0.068
iped-0.066
Token upon
Token ID2402
Feature activation-1.85e+00

Pos logit contributions
 marsh+0.105
ancer+0.102
 queens+0.096
First+0.083
 cam+0.083
Neg logit contributions
 itself-0.361
 mistakes-0.324
 His-0.322
 its-0.322
 From-0.320
Token a
Token ID257
Feature activation-2.14e+00

Pos logit contributions
 upon+0.042
ancer+0.038
 there+0.033
ulu+0.033
 they+0.033
Neg logit contributions
 itself-0.044
stick-0.044
iped-0.042
ez-0.041
ze-0.041
Token".
Token ID1911
Feature activation+2.91e+00

Pos logit contributions
 upon+0.009
ancer+0.009
ulu+0.009
ver+0.007
 particular+0.007
Neg logit contributions
stick-0.006
 itself-0.006
ze-0.005
age-0.005
ez-0.005
Token He
Token ID679
Feature activation+1.27e+00

Pos logit contributions
stick+0.085
ze+0.084
ez+0.083
age+0.083
iped+0.082
Neg logit contributions
 upon-0.055
 they-0.045
 there-0.043
ancer-0.040
 however-0.039
Token gave
Token ID2921
Feature activation-4.61e-01

Pos logit contributions
 itself+0.029
stick+0.029
iped+0.028
ze+0.027
ez+0.027
Neg logit contributions
 upon-0.030
ancer-0.028
ulu-0.024
 there-0.023
 they-0.023
Token Jack
Token ID3619
Feature activation+3.64e+00

Pos logit contributions
 upon+0.011
 there+0.009
ancer+0.009
 they+0.009
 however+0.008
Neg logit contributions
stick-0.011
age-0.011
iped-0.011
 itself-0.011
ze-0.010
Token a
Token ID257
Feature activation+1.59e+00

Pos logit contributions
stick+0.097
 itself+0.094
iped+0.093
izer+0.090
ez+0.089
Neg logit contributions
 upon-0.076
 there-0.061
ancer-0.059
 they-0.058
 however-0.053
Token stern
Token ID26370
Feature activation+6.37e+00

Pos logit contributions
stick+0.041
 itself+0.040
iped+0.039
ze+0.038
ez+0.038
Neg logit contributions
 upon-0.031
ancer-0.030
ulu-0.025
 there-0.025
 they-0.024
Token look
Token ID804
Feature activation-6.83e-04

Pos logit contributions
stick+0.147
bons+0.144
 itself+0.137
ez+0.135
iped+0.135
Neg logit contributions
 upon-0.148
 there-0.126
 don-0.124
 while-0.119
ancer-0.118
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
 upon+0.000
ancer+0.000
 there+0.000
 they+0.000
ulu+0.000
Neg logit contributions
stick-0.000
 itself-0.000
iped-0.000
ez-0.000
ze-0.000
Token walked
Token ID6807
Feature activation+2.13e+00

Pos logit contributions
 upon+0.001
ancer+0.001
ulu+0.001
ver+0.000
as+0.000
Neg logit contributions
 itself-0.001
stick-0.001
iped-0.001
ez-0.001
ze-0.001
Token away
Token ID1497
Feature activation-4.07e-01

Pos logit contributions
ez+0.047
 itself+0.046
stick+0.046
iped+0.045
bons+0.045
Neg logit contributions
 upon-0.055
 there-0.049
 they-0.045
 while-0.041
 though-0.041
Token.
Token ID13
Feature activation+2.78e+00

Pos logit contributions
 upon+0.011
ancer+0.010
ulu+0.009
 there+0.009
ver+0.008
Neg logit contributions
 itself-0.008
stick-0.008
iped-0.008
ze-0.008
ez-0.007
Token Tom
Token ID4186
Feature activation+3.60e+00

Pos logit contributions
 upon+0.012
ancer+0.011
 there+0.009
ulu+0.009
ver+0.009
Neg logit contributions
 itself-0.018
stick-0.017
iped-0.017
ez-0.016
ze-0.016
Token and
Token ID290
Feature activation+2.95e-01

Pos logit contributions
 itself+0.088
stick+0.088
iped+0.085
ez+0.082
bons+0.081
Neg logit contributions
 upon-0.080
ancer-0.069
 there-0.067
 they-0.064
 don-0.061
Token Lily
Token ID20037
Feature activation+3.76e+00

Pos logit contributions
stick+0.007
 itself+0.007
iped+0.007
ez+0.007
ze+0.006
Neg logit contributions
 upon-0.007
ancer-0.006
ver-0.005
ulu-0.005
 there-0.005
Token with
Token ID351
Feature activation-3.14e+00

Pos logit contributions
 itself+0.085
stick+0.085
iped+0.082
izer+0.080
bons+0.078
Neg logit contributions
 upon-0.092
 there-0.078
ancer-0.078
 they-0.076
 don-0.072
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
 upon+0.070
ancer+0.063
 there+0.056
ulu+0.055
 they+0.054
Neg logit contributions
 itself-0.077
stick-0.077
iped-0.074
ez-0.072
ze-0.071
Token stern
Token ID26370
Feature activation+6.35e+00

Pos logit contributions
stick+0.028
 itself+0.027
iped+0.027
ze+0.026
ez+0.026
Neg logit contributions
 upon-0.030
ancer-0.029
ulu-0.025
 there-0.024
 they-0.024
Token face
Token ID1986
Feature activation+1.05e+00

Pos logit contributions
stick+0.158
 itself+0.158
bons+0.157
iped+0.153
 media+0.152
Neg logit contributions
 upon-0.127
 don-0.106
 there-0.106
ancer-0.098
 they-0.097
Token.
Token ID13
Feature activation+3.95e+00

Pos logit contributions
 itself+0.020
stick+0.020
iped+0.019
ez+0.018
ze+0.018
Neg logit contributions
 upon-0.029
ancer-0.027
 there-0.024
ulu-0.024
 they-0.024
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
 itself+0.092
stick+0.089
iped+0.086
ez+0.082
nob+0.081
Neg logit contributions
 upon-0.091
ancer-0.086
ulu-0.075
ver-0.072
 there-0.071
Token
Token ID198
Feature activation-2.33e+00

Pos logit contributions
 upon+0.042
ancer+0.033
 there+0.032
 they+0.032
ulu+0.030
Neg logit contributions
stick-0.072
iped-0.070
 itself-0.069
ez-0.067
ze-0.067
TokenTom
Token ID13787
Feature activation+2.53e+00

Pos logit contributions
 upon+0.047
ancer+0.042
 there+0.037
ulu+0.036
 they+0.036
Neg logit contributions
 itself-0.062
stick-0.062
iped-0.060
ez-0.058
ze-0.057
Token her
Token ID607
Feature activation+1.88e+00

Pos logit contributions
 itself+0.005
stick+0.005
iped+0.005
ez+0.005
ze+0.005
Neg logit contributions
 upon-0.005
ancer-0.004
 there-0.004
 they-0.004
ulu-0.004
Token mother
Token ID2802
Feature activation-9.76e-02

Pos logit contributions
 itself+0.048
stick+0.048
iped+0.046
ze+0.045
ez+0.045
Neg logit contributions
 upon-0.040
ancer-0.036
 there-0.031
 they-0.031
ulu-0.030
Token said
Token ID531
Feature activation+1.31e+00

Pos logit contributions
 upon+0.002
ancer+0.002
 there+0.001
 they+0.001
ulu+0.001
Neg logit contributions
 itself-0.003
stick-0.003
iped-0.003
ez-0.003
ze-0.003
Token in
Token ID287
Feature activation-1.04e+00

Pos logit contributions
bons+0.034
iped+0.034
stick+0.033
izer+0.033
 itself+0.033
Neg logit contributions
 upon-0.027
 they-0.024
 there-0.023
ancer-0.022
 while-0.021
Token a
Token ID257
Feature activation+9.02e-01

Pos logit contributions
 upon+0.025
ancer+0.024
ulu+0.020
ver+0.020
 they+0.020
Neg logit contributions
 itself-0.024
stick-0.024
iped-0.023
ez-0.022
ze-0.022
Token stern
Token ID26370
Feature activation+6.33e+00

Pos logit contributions
 itself+0.024
stick+0.024
iped+0.023
ez+0.022
bons+0.022
Neg logit contributions
 upon-0.019
ancer-0.016
 there-0.015
 they-0.014
ulu-0.014
Token voice
Token ID3809
Feature activation+4.23e-01

Pos logit contributions
bons+0.139
iped+0.129
stick+0.126
 itself+0.126
ez+0.126
Neg logit contributions
 upon-0.146
 there-0.126
 don-0.122
 they-0.121
 though-0.114
Token.
Token ID13
Feature activation+2.93e+00

Pos logit contributions
 itself+0.010
iped+0.010
stick+0.010
bons+0.009
ez+0.009
Neg logit contributions
 upon-0.010
ancer-0.009
 there-0.009
 they-0.008
ulu-0.008
Token
Token ID220
Feature activation-1.89e+00

Pos logit contributions
 itself+0.068
stick+0.067
iped+0.064
nob+0.062
ze+0.061
Neg logit contributions
 upon-0.068
ancer-0.065
ulu-0.056
ver-0.054
as-0.054
Token
Token ID198
Feature activation-2.17e+00

Pos logit contributions
 upon+0.043
ancer+0.041
ulu+0.036
 there+0.034
 they+0.033
Neg logit contributions
 itself-0.044
stick-0.042
iped-0.041
nob-0.039
age-0.039
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
 upon+0.046
ancer+0.042
ulu+0.035
 there+0.035
 they+0.035
Neg logit contributions
 itself-0.056
stick-0.055
iped-0.053
ze-0.051
ez-0.051
Token will
Token ID481
Feature activation+2.42e+00

Pos logit contributions
ancer+0.010
 upon+0.010
 however+0.008
ulu+0.008
 they+0.008
Neg logit contributions
age-0.012
ez-0.012
stick-0.011
 itself-0.011
ze-0.011
Token buy
Token ID2822
Feature activation+1.84e+00

Pos logit contributions
stick+0.056
 itself+0.055
ze+0.051
iped+0.051
age+0.050
Neg logit contributions
 upon-0.057
ancer-0.054
ulu-0.047
ver-0.045
 there-0.045
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 itself+0.044
stick+0.041
iped+0.039
bons+0.039
ez+0.038
Neg logit contributions
 upon-0.044
ancer-0.040
ulu-0.035
ver-0.035
 there-0.033
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 itself+0.020
stick+0.019
bons+0.018
ze+0.017
nob+0.017
Neg logit contributions
ancer-0.026
 upon-0.025
ver-0.023
ulu-0.023
 don-0.021
Token fan
Token ID4336
Feature activation-2.48e-01

Pos logit contributions
 itself+0.024
stick+0.024
iped+0.022
ze+0.022
bons+0.021
Neg logit contributions
 upon-0.033
ancer-0.031
ver-0.028
ulu-0.028
 they-0.028
Tokenci
Token ID979
Feature activation+6.32e+00

Pos logit contributions
 upon+0.006
ancer+0.006
ulu+0.005
ver+0.005
 there+0.005
Neg logit contributions
stick-0.006
 itself-0.005
age-0.005
ze-0.005
nob-0.005
Tokenest
Token ID395
Feature activation-1.99e-01

Pos logit contributions
 itself+0.087
iped+0.081
stick+0.080
ez+0.077
izer+0.073
Neg logit contributions
 upon-0.207
ancer-0.191
 there-0.177
 they-0.177
 don-0.172
Token dress
Token ID6576
Feature activation-2.88e-01

Pos logit contributions
 upon+0.005
ancer+0.005
ver+0.004
ulu+0.004
 don+0.004
Neg logit contributions
stick-0.004
 itself-0.004
age-0.004
ze-0.004
bons-0.004
Token in
Token ID287
Feature activation-1.57e+00

Pos logit contributions
 upon+0.007
ancer+0.007
ulu+0.006
 thirsty+0.006
ver+0.006
Neg logit contributions
stick-0.006
 itself-0.006
age-0.005
er-0.005
ze-0.005
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
ancer+0.042
 upon+0.041
ver+0.038
ulu+0.035
as+0.035
Neg logit contributions
 itself-0.034
stick-0.030
bons-0.028
 mistakes-0.026
 its-0.026
Token store
Token ID3650
Feature activation-5.90e-01

Pos logit contributions
stick+0.003
 itself+0.003
ze+0.003
iped+0.003
bons+0.003
Neg logit contributions
 upon-0.003
ancer-0.003
ver-0.003
ulu-0.002
 they-0.002
Token didn
Token ID1422
Feature activation+2.25e+00

Pos logit contributions
stick+0.045
 itself+0.045
bons+0.044
izer+0.044
eness+0.042
Neg logit contributions
 upon-0.031
 they-0.026
 there-0.024
 however-0.023
 don-0.022
Token't
Token ID470
Feature activation+4.54e-01

Pos logit contributions
stick+0.077
 itself+0.076
iped+0.076
ez+0.073
bons+0.072
Neg logit contributions
 upon-0.028
ancer-0.022
 they-0.021
 there-0.019
ver-0.018
Token know
Token ID760
Feature activation-9.31e-01

Pos logit contributions
 itself+0.012
stick+0.012
ez+0.011
age+0.011
iped+0.011
Neg logit contributions
 upon-0.009
ancer-0.008
 there-0.007
ulu-0.007
 they-0.007
Token what
Token ID644
Feature activation-2.51e+00

Pos logit contributions
 upon+0.021
ancer+0.021
ulu+0.018
 thirsty+0.016
ver+0.016
Neg logit contributions
 itself-0.022
stick-0.020
iped-0.019
ez-0.019
ze-0.019
Token "
Token ID366
Feature activation+2.26e+00

Pos logit contributions
 upon+0.057
ancer+0.051
 there+0.048
 they+0.047
ulu+0.045
Neg logit contributions
stick-0.060
 itself-0.060
iped-0.058
ez-0.056
ze-0.055
Tokenmod
Token ID4666
Feature activation+6.31e+00

Pos logit contributions
stick+0.043
 itself+0.043
iped+0.040
ez+0.040
age+0.039
Neg logit contributions
 upon-0.062
ancer-0.056
 they-0.052
 there-0.052
ulu-0.052
Tokenest
Token ID395
Feature activation+3.71e-01

Pos logit contributions
 media+0.100
ez+0.100
nob+0.098
zer+0.097
ze+0.097
Neg logit contributions
 upon-0.185
ancer-0.170
 there-0.151
 if-0.150
 don-0.149
Token"
Token ID1
Feature activation-3.35e-01

Pos logit contributions
stick+0.008
 itself+0.007
iped+0.007
ez+0.007
nob+0.007
Neg logit contributions
 upon-0.010
ancer-0.009
 there-0.008
 they-0.008
 don-0.008
Token meant
Token ID4001
Feature activation+8.84e-01

Pos logit contributions
 upon+0.007
ancer+0.007
 there+0.006
ulu+0.006
 they+0.006
Neg logit contributions
 itself-0.008
stick-0.008
iped-0.008
ez-0.008
ze-0.008
Token,
Token ID11
Feature activation-4.13e+00

Pos logit contributions
stick+0.023
 itself+0.023
iped+0.023
ez+0.022
ze+0.022
Neg logit contributions
 upon-0.018
ancer-0.016
 there-0.014
 they-0.014
ulu-0.013
Token but
Token ID475
Feature activation-3.24e+00

Pos logit contributions
 upon+0.068
ancer+0.060
ulu+0.048
 there+0.047
ver+0.046
Neg logit contributions
 itself-0.127
stick-0.126
iped-0.122
ze-0.118
ez-0.117
Token looked
Token ID3114
Feature activation+3.00e-01

Pos logit contributions
 itself+0.036
stick+0.035
iped+0.034
ez+0.033
ze+0.033
Neg logit contributions
 upon-0.034
ancer-0.032
ulu-0.027
 there-0.027
ver-0.026
Token at
Token ID379
Feature activation-4.01e-01

Pos logit contributions
stick+0.008
 itself+0.008
iped+0.008
ez+0.007
ze+0.007
Neg logit contributions
 upon-0.006
ancer-0.006
ulu-0.005
 there-0.005
 they-0.005
Token them
Token ID606
Feature activation+1.26e+00

Pos logit contributions
 upon+0.008
ancer+0.007
ver+0.006
 there+0.006
ulu+0.006
Neg logit contributions
 itself-0.011
stick-0.010
iped-0.010
ez-0.010
izer-0.009
Token with
Token ID351
Feature activation-2.44e+00

Pos logit contributions
stick+0.033
iped+0.033
 itself+0.033
izer+0.033
ez+0.033
Neg logit contributions
 upon-0.025
 there-0.022
 they-0.022
ancer-0.020
 don-0.019
Token a
Token ID257
Feature activation+1.17e+00

Pos logit contributions
 upon+0.061
ancer+0.056
 there+0.050
 they+0.050
ulu+0.050
Neg logit contributions
 itself-0.053
stick-0.052
iped-0.050
ez-0.048
ze-0.048
Token stern
Token ID26370
Feature activation+6.31e+00

Pos logit contributions
stick+0.026
 itself+0.025
iped+0.025
ze+0.024
ez+0.024
Neg logit contributions
 upon-0.028
ancer-0.026
ulu-0.023
 there-0.022
 they-0.022
Token face
Token ID1986
Feature activation+1.11e+00

Pos logit contributions
stick+0.155
bons+0.153
 itself+0.152
iped+0.149
 media+0.147
Neg logit contributions
 upon-0.130
 there-0.109
 don-0.108
 they-0.101
ancer-0.100
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
stick+0.023
 itself+0.023
iped+0.022
ez+0.021
izer+0.021
Neg logit contributions
 upon-0.028
ancer-0.025
 there-0.024
 they-0.024
ulu-0.023
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
 itself+0.085
stick+0.080
iped+0.075
ze+0.073
ez+0.072
Neg logit contributions
 upon-0.100
ancer-0.097
ulu-0.085
ver-0.083
as-0.081
Token
Token ID198
Feature activation-2.18e+00

Pos logit contributions
 upon+0.043
ancer+0.039
 there+0.034
ulu+0.033
 they+0.033
Neg logit contributions
 itself-0.055
stick-0.054
iped-0.052
ez-0.050
ze-0.050
Token"
Token ID1
Feature activation-2.85e-02

Pos logit contributions
 upon+0.049
ancer+0.048
ver+0.041
ulu+0.040
 there+0.038
Neg logit contributions
 itself-0.051
stick-0.048
iped-0.046
ze-0.045
ez-0.045
Token happy
Token ID3772
Feature activation+1.60e+00

Pos logit contributions
ze+0.019
stick+0.019
 itself+0.018
iped+0.017
ez+0.017
Neg logit contributions
 upon-0.019
ancer-0.018
 they-0.015
ulu-0.015
 don-0.015
Token and
Token ID290
Feature activation+2.76e-01

Pos logit contributions
 itself+0.037
stick+0.037
iped+0.034
ez+0.034
ze+0.034
Neg logit contributions
 upon-0.037
ancer-0.036
ulu-0.031
ver-0.031
 there-0.029
Token warm
Token ID5814
Feature activation+2.25e+00

Pos logit contributions
 itself+0.007
stick+0.007
iped+0.007
ez+0.007
ze+0.007
Neg logit contributions
 upon-0.006
ancer-0.005
 there-0.005
ulu-0.004
 they-0.004
Token.
Token ID13
Feature activation+2.83e+00

Pos logit contributions
 itself+0.053
stick+0.053
iped+0.051
ez+0.049
ze+0.049
Neg logit contributions
 upon-0.052
ancer-0.047
 there-0.042
 they-0.041
ulu-0.041
Token<|endoftext|>
Token ID50256
Feature activation-2.85e+00

Pos logit contributions
stick+0.067
 itself+0.067
iped+0.064
ez+0.062
ze+0.062
Neg logit contributions
 upon-0.066
ancer-0.059
 there-0.053
 they-0.052
ulu-0.051
TokenOnce
Token ID7454
Feature activation-1.22e+01

Pos logit contributions
 upon+0.056
ancer+0.050
 there+0.044
 they+0.043
ulu+0.042
Neg logit contributions
stick-0.076
 itself-0.076
iped-0.074
ez-0.072
ze-0.071
Token upon
Token ID2402
Feature activation-1.79e+00

Pos logit contributions
ancer+0.079
 marsh+0.059
Fre+0.047
ait+0.041
ver+0.040
Neg logit contributions
 itself-0.424
 mistakes-0.389
ez-0.377
esh-0.377
itched-0.377
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
 upon+0.037
ancer+0.031
 there+0.030
 they+0.028
ulu+0.026
Neg logit contributions
stick-0.049
iped-0.048
 itself-0.047
ez-0.045
eness-0.045
Token time
Token ID640
Feature activation-5.70e+00

Pos logit contributions
ancer+0.038
 upon+0.038
 they+0.030
 there+0.030
ulu+0.029
Neg logit contributions
 itself-0.062
iped-0.061
ez-0.061
ze-0.060
izer-0.060
Token,
Token ID11
Feature activation-8.99e+00

Pos logit contributions
ancer+0.143
ait+0.123
 upon+0.120
as+0.119
 marsh+0.117
Neg logit contributions
 itself-0.106
esh-0.097
stick-0.088
ez-0.088
nob-0.087
Token there
Token ID612
Feature activation-4.11e+00

Pos logit contributions
ancer+0.111
ait+0.094
 marsh+0.086
 hotels+0.080
 upon+0.076
Neg logit contributions
 itself-0.288
iped-0.255
izer-0.252
ze-0.246
nob-0.246
Token fun
Token ID1257
Feature activation+2.98e-01

Pos logit contributions
 itself+0.036
stick+0.036
iped+0.035
ez+0.033
ze+0.033
Neg logit contributions
 upon-0.032
ancer-0.028
 there-0.025
 they-0.025
ulu-0.024
Token time
Token ID640
Feature activation-3.55e+00

Pos logit contributions
 itself+0.009
stick+0.008
iped+0.008
ez+0.008
ze+0.008
Neg logit contributions
 upon-0.005
ancer-0.004
 there-0.004
 they-0.004
ulu-0.004
Token outside
Token ID2354
Feature activation-2.65e+00

Pos logit contributions
 upon+0.087
ancer+0.083
ulu+0.075
ver+0.072
 don+0.068
Neg logit contributions
 itself-0.076
stick-0.076
ez-0.070
ze-0.069
iped-0.068
Token.
Token ID13
Feature activation+2.11e+00

Pos logit contributions
ancer+0.070
 upon+0.070
ver+0.063
ulu+0.062
as+0.060
Neg logit contributions
stick-0.050
 itself-0.049
ze-0.044
ez-0.043
iped-0.042
Token<|endoftext|>
Token ID50256
Feature activation-3.11e+00

Pos logit contributions
 itself+0.046
stick+0.046
iped+0.044
ez+0.042
ze+0.041
Neg logit contributions
 upon-0.052
ancer-0.048
 there-0.043
ulu-0.042
 they-0.042
TokenOnce
Token ID7454
Feature activation-1.22e+01

Pos logit contributions
 upon+0.065
ancer+0.059
 they+0.050
ulu+0.050
 there+0.050
Neg logit contributions
 itself-0.079
stick-0.078
iped-0.075
ez-0.074
nob-0.072
Token upon
Token ID2402
Feature activation-1.90e+00

Pos logit contributions
ancer+0.086
 marsh+0.071
First+0.051
Fre+0.050
 hotels+0.047
Neg logit contributions
 itself-0.418
 mistakes-0.380
 its-0.371
itched-0.368
 From-0.365
Token a
Token ID257
Feature activation-2.34e+00

Pos logit contributions
 upon+0.040
ancer+0.035
 there+0.032
 they+0.031
ulu+0.030
Neg logit contributions
stick-0.049
 itself-0.048
iped-0.048
ez-0.045
ze-0.045
Token time
Token ID640
Feature activation-5.85e+00

Pos logit contributions
ancer+0.040
 upon+0.040
 they+0.031
 there+0.031
ulu+0.031
Neg logit contributions
 itself-0.063
iped-0.063
ze-0.063
ez-0.063
izer-0.062
Token,
Token ID11
Feature activation-9.01e+00

Pos logit contributions
ancer+0.155
ait+0.136
as+0.130
 marsh+0.128
 upon+0.126
Neg logit contributions
 itself-0.101
esh-0.090
nob-0.080
itched-0.079
stick-0.078
Token there
Token ID612
Feature activation-4.18e+00

Pos logit contributions
ancer+0.119
ait+0.100
 marsh+0.092
 hotels+0.089
 restaurants+0.086
Neg logit contributions
 itself-0.279
iped-0.244
izer-0.242
ze-0.239
nob-0.239
Token Lily
Token ID20037
Feature activation+2.86e+00

Pos logit contributions
 upon+0.086
ancer+0.082
ulu+0.071
ver+0.070
as+0.066
Neg logit contributions
 itself-0.069
stick-0.065
iped-0.062
ez-0.059
 mistakes-0.058
Token's
Token ID338
Feature activation+1.06e+00

Pos logit contributions
 itself+0.057
stick+0.057
iped+0.054
ez+0.052
ze+0.052
Neg logit contributions
 upon-0.075
ancer-0.070
ulu-0.063
 there-0.062
 they-0.061
Token health
Token ID1535
Feature activation-1.40e+00

Pos logit contributions
stick+0.022
 itself+0.022
iped+0.021
ez+0.020
ze+0.019
Neg logit contributions
 upon-0.028
ancer-0.025
 there-0.023
 they-0.023
 don-0.022
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
 upon+0.039
ancer+0.036
ulu+0.033
 there+0.032
 they+0.031
Neg logit contributions
 itself-0.027
stick-0.026
ez-0.024
iped-0.024
age-0.023
Token<|endoftext|>
Token ID50256
Feature activation-2.90e+00

Pos logit contributions
stick+0.055
 itself+0.053
iped+0.053
ze+0.052
ez+0.052
Neg logit contributions
 upon-0.059
ancer-0.050
 they-0.049
 there-0.049
ulu-0.046
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.060
ancer+0.055
ulu+0.047
 they+0.047
 there+0.047
Neg logit contributions
 itself-0.074
stick-0.072
iped-0.070
ez-0.068
nob-0.067
Token upon
Token ID2402
Feature activation-1.93e+00

Pos logit contributions
ancer+0.100
 marsh+0.087
Fre+0.071
 queens+0.068
 cam+0.062
Neg logit contributions
 itself-0.391
 mistakes-0.362
 From-0.355
 His-0.352
 its-0.349
Token a
Token ID257
Feature activation-2.29e+00

Pos logit contributions
 upon+0.041
ancer+0.036
 there+0.033
 they+0.032
ulu+0.031
Neg logit contributions
stick-0.050
 itself-0.049
iped-0.048
ez-0.046
ze-0.045
Token time
Token ID640
Feature activation-5.75e+00

Pos logit contributions
ancer+0.043
 upon+0.041
 they+0.033
ulu+0.033
 there+0.032
Neg logit contributions
ez-0.059
ze-0.059
iped-0.058
 itself-0.058
izer-0.057
Token,
Token ID11
Feature activation-9.07e+00

Pos logit contributions
ancer+0.159
ait+0.139
as+0.136
 usual+0.130
 marsh+0.130
Neg logit contributions
 itself-0.094
esh-0.084
nob-0.073
stick-0.071
 Its-0.071
Token there
Token ID612
Feature activation-3.89e+00

Pos logit contributions
ancer+0.120
ait+0.098
 marsh+0.085
as+0.081
 hotels+0.080
Neg logit contributions
 itself-0.292
iped-0.253
nob-0.251
 its-0.249
ez-0.247
Token his
Token ID465
Feature activation+1.04e+00

Pos logit contributions
 upon+0.031
ancer+0.031
ver+0.027
ulu+0.027
as+0.026
Neg logit contributions
stick-0.026
 itself-0.024
iped-0.023
ez-0.023
ze-0.022
Token mom
Token ID1995
Feature activation-2.02e+00

Pos logit contributions
stick+0.026
 itself+0.026
iped+0.025
ze+0.025
ez+0.024
Neg logit contributions
 upon-0.022
ancer-0.020
ulu-0.017
 there-0.017
ver-0.017
Tokenmy
Token ID1820
Feature activation-3.43e-01

Pos logit contributions
ancer+0.050
 upon+0.050
ulu+0.045
ver+0.042
 there+0.040
Neg logit contributions
stick-0.042
iped-0.038
ez-0.038
age-0.037
 itself-0.037
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 upon+0.008
ancer+0.008
ulu+0.007
 there+0.007
ver+0.007
Neg logit contributions
stick-0.008
 itself-0.007
iped-0.007
ze-0.007
ez-0.007
Token<|endoftext|>
Token ID50256
Feature activation-2.68e+00

Pos logit contributions
 itself+0.031
stick+0.031
iped+0.029
ez+0.028
ze+0.027
Neg logit contributions
 upon-0.039
ancer-0.036
ulu-0.032
 there-0.032
ver-0.031
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.055
ancer+0.050
ulu+0.042
 there+0.042
 they+0.042
Neg logit contributions
 itself-0.069
stick-0.069
iped-0.066
ez-0.065
ze-0.064
Token upon
Token ID2402
Feature activation-1.53e+00

Pos logit contributions
ancer+0.098
 marsh+0.081
Fre+0.079
 queens+0.073
drops+0.072
Neg logit contributions
 itself-0.377
 mistakes-0.343
 its-0.341
 His-0.341
itched-0.340
Token a
Token ID257
Feature activation-1.94e+00

Pos logit contributions
 upon+0.032
ancer+0.028
 there+0.026
 they+0.025
ulu+0.024
Neg logit contributions
stick-0.040
 itself-0.039
iped-0.038
ez-0.037
ze-0.036
Token time
Token ID640
Feature activation-5.46e+00

Pos logit contributions
 upon+0.034
ancer+0.034
ulu+0.027
 they+0.027
 there+0.026
Neg logit contributions
ze-0.052
ez-0.051
 itself-0.051
iped-0.051
izer-0.051
Token,
Token ID11
Feature activation-8.88e+00

Pos logit contributions
ancer+0.145
ait+0.124
 upon+0.124
 usual+0.121
as+0.118
Neg logit contributions
 itself-0.096
esh-0.087
stick-0.078
ez-0.078
itched-0.078
Token there
Token ID612
Feature activation-3.87e+00

Pos logit contributions
ancer+0.119
ait+0.097
 hotels+0.086
 restaurants+0.084
 marsh+0.083
Neg logit contributions
 itself-0.284
izer-0.250
iped-0.248
ze-0.243
esh-0.241
Token and
Token ID290
Feature activation-8.60e-01

Pos logit contributions
stick+0.053
 itself+0.052
iped+0.048
ez+0.048
ze+0.047
Neg logit contributions
 upon-0.074
ancer-0.070
ulu-0.064
ver-0.062
 they-0.061
Token her
Token ID607
Feature activation+1.54e+00

Pos logit contributions
 upon+0.017
ancer+0.016
ulu+0.013
ver+0.013
as+0.013
Neg logit contributions
stick-0.023
 itself-0.022
iped-0.021
ze-0.020
ez-0.020
Token dad
Token ID9955
Feature activation-2.10e-01

Pos logit contributions
 itself+0.037
stick+0.037
ze+0.035
iped+0.034
age+0.033
Neg logit contributions
 upon-0.033
ancer-0.031
ulu-0.026
ver-0.026
 there-0.025
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
 upon+0.005
ancer+0.005
ulu+0.004
ver+0.004
 there+0.004
Neg logit contributions
stick-0.004
 itself-0.004
age-0.004
iped-0.004
ez-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.96e+00

Pos logit contributions
 itself+0.044
stick+0.044
iped+0.041
ez+0.040
age+0.039
Neg logit contributions
 upon-0.049
ancer-0.045
 there-0.040
ulu-0.040
ver-0.039
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.062
ancer+0.058
 they+0.049
ulu+0.049
 there+0.048
Neg logit contributions
 itself-0.074
stick-0.072
iped-0.069
ez-0.069
nob-0.068
Token upon
Token ID2402
Feature activation-1.77e+00

Pos logit contributions
ancer+0.089
 marsh+0.080
Fre+0.067
First+0.062
 cam+0.059
Neg logit contributions
 itself-0.396
 mistakes-0.355
itched-0.352
 its-0.350
esh-0.349
Token a
Token ID257
Feature activation-2.18e+00

Pos logit contributions
 upon+0.038
ancer+0.033
 there+0.030
 they+0.029
ulu+0.028
Neg logit contributions
stick-0.046
 itself-0.045
iped-0.044
ez-0.042
ze-0.042
Token time
Token ID640
Feature activation-5.67e+00

Pos logit contributions
ancer+0.038
 upon+0.038
 they+0.030
 there+0.029
ulu+0.029
Neg logit contributions
 itself-0.059
ze-0.058
iped-0.058
ez-0.058
izer-0.058
Token,
Token ID11
Feature activation-8.87e+00

Pos logit contributions
ancer+0.152
ait+0.131
as+0.126
 marsh+0.124
 usual+0.124
Neg logit contributions
 itself-0.098
esh-0.089
nob-0.078
itched-0.078
stick-0.078
Token there
Token ID612
Feature activation-3.92e+00

Pos logit contributions
ancer+0.112
ait+0.090
 marsh+0.083
 upon+0.079
 hotels+0.079
Neg logit contributions
 itself-0.290
iped-0.254
izer-0.251
ze-0.247
stick-0.247
Token Ben
Token ID3932
Feature activation+2.56e+00

Pos logit contributions
 upon+0.083
ancer+0.078
ver+0.067
ulu+0.067
 there+0.065
Neg logit contributions
 itself-0.076
stick-0.073
iped-0.068
bons-0.067
ez-0.067
Token's
Token ID338
Feature activation+1.44e+00

Pos logit contributions
 itself+0.049
stick+0.049
iped+0.046
ez+0.045
ze+0.044
Neg logit contributions
 upon-0.070
ancer-0.065
ulu-0.058
 there-0.058
 they-0.057
Token help
Token ID1037
Feature activation+9.62e-01

Pos logit contributions
stick+0.032
 itself+0.032
iped+0.031
ez+0.029
ze+0.029
Neg logit contributions
 upon-0.035
ancer-0.032
 there-0.029
 they-0.028
ulu-0.028
Token.
Token ID13
Feature activation+3.05e+00

Pos logit contributions
iped+0.021
stick+0.021
 itself+0.020
ez+0.020
izer+0.019
Neg logit contributions
 upon-0.024
ancer-0.021
 there-0.020
 they-0.020
 don-0.019
Token<|endoftext|>
Token ID50256
Feature activation-2.88e+00

Pos logit contributions
stick+0.072
 itself+0.071
iped+0.070
ze+0.067
ez+0.067
Neg logit contributions
 upon-0.072
ancer-0.064
 they-0.058
 there-0.057
ulu-0.056
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.058
ancer+0.051
 there+0.045
 they+0.044
ulu+0.044
Neg logit contributions
 itself-0.076
stick-0.076
iped-0.073
ez-0.071
ze-0.070
Token upon
Token ID2402
Feature activation-1.86e+00

Pos logit contributions
ancer+0.088
 marsh+0.070
Fre+0.054
 hotels+0.049
First+0.048
Neg logit contributions
 itself-0.412
 mistakes-0.374
 its-0.369
 From-0.362
 His-0.362
Token a
Token ID257
Feature activation-2.27e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.031
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.049
 itself-0.048
iped-0.048
ez-0.045
izer-0.044
Token time
Token ID640
Feature activation-5.68e+00

Pos logit contributions
 upon+0.038
ancer+0.037
 they+0.029
ulu+0.029
 there+0.029
Neg logit contributions
 itself-0.063
iped-0.062
ez-0.062
ze-0.062
izer-0.061
Token,
Token ID11
Feature activation-8.99e+00

Pos logit contributions
ancer+0.147
ait+0.127
as+0.122
 marsh+0.121
 upon+0.120
Neg logit contributions
 itself-0.104
esh-0.093
ez-0.084
iped-0.083
stick-0.083
Token there
Token ID612
Feature activation-3.96e+00

Pos logit contributions
ancer+0.115
ait+0.097
 marsh+0.088
 hotels+0.084
 restaurants+0.077
Neg logit contributions
 itself-0.288
iped-0.256
izer-0.252
ez-0.248
ze-0.247
Token t
Token ID256
Feature activation-1.92e+00

Pos logit contributions
 upon+0.029
ancer+0.027
ulu+0.023
ver+0.021
as+0.021
Neg logit contributions
stick-0.041
 itself-0.040
iped-0.038
ze-0.037
ez-0.037
Tokeneddy
Token ID21874
Feature activation+3.55e+00

Pos logit contributions
 upon+0.067
ancer+0.063
 there+0.057
 they+0.057
ulu+0.056
Neg logit contributions
 itself-0.024
stick-0.022
iped-0.022
nob-0.022
ze-0.022
Token bear
Token ID6842
Feature activation-8.61e-01

Pos logit contributions
iped+0.084
 itself+0.082
stick+0.081
nob+0.080
izer+0.079
Neg logit contributions
 upon-0.083
ancer-0.071
 they-0.069
 there-0.068
 don-0.064
Token.
Token ID13
Feature activation+2.69e+00

Pos logit contributions
 upon+0.021
ancer+0.019
 they+0.018
 there+0.018
ver+0.016
Neg logit contributions
 itself-0.019
iped-0.019
stick-0.018
izer-0.018
ez-0.018
Token<|endoftext|>
Token ID50256
Feature activation-2.88e+00

Pos logit contributions
stick+0.061
 itself+0.060
iped+0.060
ze+0.058
ez+0.058
Neg logit contributions
 upon-0.065
ancer-0.057
 they-0.055
 there-0.053
ulu-0.050
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.057
ancer+0.050
 there+0.044
 they+0.043
ulu+0.043
Neg logit contributions
stick-0.077
 itself-0.077
iped-0.075
ez-0.072
ze-0.072
Token upon
Token ID2402
Feature activation-1.58e+00

Pos logit contributions
ancer+0.090
 marsh+0.070
Fre+0.063
 queens+0.057
First+0.054
Neg logit contributions
 itself-0.396
 mistakes-0.364
 its-0.359
 His-0.357
itched-0.357
Token a
Token ID257
Feature activation-2.07e+00

Pos logit contributions
 upon+0.032
ancer+0.027
 there+0.026
 they+0.024
ulu+0.023
Neg logit contributions
stick-0.043
iped-0.042
 itself-0.042
ez-0.040
eness-0.039
Token time
Token ID640
Feature activation-5.51e+00

Pos logit contributions
ancer+0.036
 upon+0.036
 they+0.028
ulu+0.028
 there+0.028
Neg logit contributions
 itself-0.055
ze-0.055
ez-0.055
iped-0.055
izer-0.054
Token,
Token ID11
Feature activation-8.82e+00

Pos logit contributions
ancer+0.145
ait+0.123
 upon+0.120
as+0.120
 usual+0.118
Neg logit contributions
 itself-0.098
esh-0.092
ez-0.082
nob-0.081
stick-0.080
Token there
Token ID612
Feature activation-3.71e+00

Pos logit contributions
ancer+0.112
ait+0.091
 marsh+0.076
 upon+0.076
 hotels+0.076
Neg logit contributions
 itself-0.287
iped-0.256
izer-0.252
ze-0.249
ez-0.247
Token.
Token ID13
Feature activation+2.83e+00

Pos logit contributions
 upon+0.001
ancer+0.001
ulu+0.001
ver+0.001
 don+0.001
Neg logit contributions
stick-0.000
age-0.000
 itself-0.000
ez-0.000
ze-0.000
Token The
Token ID383
Feature activation+1.19e-01

Pos logit contributions
 itself+0.070
stick+0.070
iped+0.067
ez+0.065
ze+0.064
Neg logit contributions
 upon-0.062
ancer-0.056
 there-0.049
ulu-0.049
 they-0.048
Token end
Token ID886
Feature activation-3.27e+00

Pos logit contributions
 itself+0.003
stick+0.003
iped+0.003
ze+0.003
ez+0.003
Neg logit contributions
 upon-0.003
ancer-0.002
 there-0.002
ulu-0.002
 they-0.002
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
 upon+0.083
ancer+0.073
 there+0.070
 they+0.069
ulu+0.065
Neg logit contributions
stick-0.073
iped-0.072
 itself-0.071
ez-0.065
ze-0.065
Token<|endoftext|>
Token ID50256
Feature activation-3.03e+00

Pos logit contributions
stick+0.053
 itself+0.053
iped+0.051
ez+0.049
ze+0.049
Neg logit contributions
 upon-0.039
ancer-0.034
 there-0.030
 they-0.029
ulu-0.029
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.059
ancer+0.052
 there+0.046
 they+0.045
ulu+0.044
Neg logit contributions
stick-0.082
 itself-0.082
iped-0.079
ez-0.076
ze-0.076
Token upon
Token ID2402
Feature activation-1.80e+00

Pos logit contributions
ancer+0.096
 marsh+0.074
Fre+0.072
 queens+0.065
First+0.065
Neg logit contributions
 itself-0.388
 mistakes-0.361
 its-0.356
itched-0.350
 From-0.348
Token a
Token ID257
Feature activation-2.15e+00

Pos logit contributions
 upon+0.039
ancer+0.034
 there+0.031
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.046
 itself-0.045
iped-0.045
ez-0.043
ze-0.042
Token time
Token ID640
Feature activation-5.58e+00

Pos logit contributions
ancer+0.037
 upon+0.037
ulu+0.029
 they+0.028
 there+0.028
Neg logit contributions
 itself-0.058
ze-0.057
ez-0.057
iped-0.057
izer-0.057
Token,
Token ID11
Feature activation-8.90e+00

Pos logit contributions
ancer+0.147
ait+0.126
 upon+0.125
as+0.122
ulu+0.120
Neg logit contributions
 itself-0.099
esh-0.090
nob-0.083
ez-0.083
iped-0.081
Token there
Token ID612
Feature activation-3.76e+00

Pos logit contributions
ancer+0.123
ait+0.101
 marsh+0.083
 hotels+0.083
Fre+0.082
Neg logit contributions
 itself-0.282
iped-0.247
nob-0.246
izer-0.244
ez-0.242
Token kind
Token ID1611
Feature activation+3.41e+00

Pos logit contributions
 itself+0.026
stick+0.026
iped+0.026
ez+0.024
nob+0.024
Neg logit contributions
 upon-0.022
ancer-0.019
 there-0.019
 they-0.018
ulu-0.017
Token and
Token ID290
Feature activation-2.32e-01

Pos logit contributions
 itself+0.075
stick+0.074
iped+0.072
ez+0.069
ze+0.068
Neg logit contributions
 upon-0.084
ancer-0.076
 there-0.070
 they-0.068
ulu-0.067
Token helpful
Token ID7613
Feature activation+2.31e+00

Pos logit contributions
 upon+0.006
ancer+0.005
 there+0.005
 they+0.005
ulu+0.004
Neg logit contributions
stick-0.005
 itself-0.005
iped-0.005
ez-0.005
ze-0.005
Token.
Token ID13
Feature activation+2.53e+00

Pos logit contributions
 itself+0.048
stick+0.048
iped+0.047
ez+0.045
ze+0.044
Neg logit contributions
 upon-0.060
ancer-0.054
 there-0.050
 they-0.049
ulu-0.047
Token<|endoftext|>
Token ID50256
Feature activation-2.89e+00

Pos logit contributions
stick+0.065
iped+0.063
ze+0.063
 itself+0.063
eness+0.062
Neg logit contributions
 upon-0.056
 they-0.047
ancer-0.046
 there-0.044
ulu-0.042
TokenOnce
Token ID7454
Feature activation-1.21e+01

Pos logit contributions
 upon+0.059
ancer+0.054
 there+0.046
ulu+0.046
 they+0.046
Neg logit contributions
 itself-0.074
stick-0.073
iped-0.071
ez-0.069
nob-0.068
Token upon
Token ID2402
Feature activation-1.88e+00

Pos logit contributions
ancer+0.093
 marsh+0.078
Fre+0.068
 queens+0.064
 cam+0.060
Neg logit contributions
 itself-0.393
 mistakes-0.363
 His-0.356
 its-0.356
 From-0.354
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.032
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.050
iped-0.049
 itself-0.049
ez-0.046
izer-0.045
Token time
Token ID640
Feature activation-5.72e+00

Pos logit contributions
ancer+0.041
 upon+0.040
 they+0.032
ulu+0.032
 there+0.031
Neg logit contributions
 itself-0.062
ez-0.062
iped-0.062
ze-0.062
izer-0.061
Token,
Token ID11
Feature activation-8.82e+00

Pos logit contributions
ancer+0.156
ait+0.135
as+0.133
ulu+0.127
 marsh+0.127
Neg logit contributions
 itself-0.098
esh-0.087
nob-0.078
 Its-0.076
stick-0.075
Token there
Token ID612
Feature activation-3.83e+00

Pos logit contributions
ancer+0.119
ait+0.097
as+0.083
 marsh+0.081
 hotels+0.077
Neg logit contributions
 itself-0.283
nob-0.246
iped-0.246
 its-0.243
ez-0.241
Token her
Token ID607
Feature activation+2.07e+00

Pos logit contributions
 upon+0.021
ancer+0.020
as+0.017
ulu+0.017
ver+0.017
Neg logit contributions
 itself-0.023
stick-0.023
ze-0.021
iped-0.021
ez-0.021
Token mom
Token ID1995
Feature activation-4.08e-01

Pos logit contributions
 itself+0.053
stick+0.053
ze+0.051
 mistakes+0.049
iped+0.049
Neg logit contributions
 upon-0.042
ancer-0.038
ulu-0.033
ver-0.032
 there-0.032
Tokenmy
Token ID1820
Feature activation+4.48e-01

Pos logit contributions
 upon+0.011
ancer+0.011
ulu+0.010
ver+0.009
 don+0.009
Neg logit contributions
stick-0.008
age-0.007
 itself-0.007
ez-0.006
ze-0.006
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
stick+0.009
 itself+0.009
age+0.008
iped+0.008
ze+0.008
Neg logit contributions
 upon-0.012
ancer-0.011
ulu-0.010
 there-0.009
ver-0.009
Token<|endoftext|>
Token ID50256
Feature activation-2.77e+00

Pos logit contributions
 itself+0.054
stick+0.054
iped+0.051
ez+0.050
ze+0.049
Neg logit contributions
 upon-0.050
ancer-0.046
 there-0.040
ulu-0.040
ver-0.039
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.058
ancer+0.054
ulu+0.046
 they+0.045
 there+0.045
Neg logit contributions
 itself-0.070
stick-0.069
iped-0.066
ez-0.065
nob-0.064
Token upon
Token ID2402
Feature activation-1.68e+00

Pos logit contributions
ancer+0.095
 marsh+0.080
Fre+0.074
 queens+0.068
 cam+0.065
Neg logit contributions
 itself-0.386
 mistakes-0.360
 its-0.353
 From-0.346
itched-0.345
Token a
Token ID257
Feature activation-2.13e+00

Pos logit contributions
 upon+0.035
ancer+0.030
 there+0.028
 they+0.027
ulu+0.026
Neg logit contributions
stick-0.044
 itself-0.043
iped-0.043
ez-0.041
ze-0.040
Token time
Token ID640
Feature activation-5.60e+00

Pos logit contributions
ancer+0.037
 upon+0.037
ulu+0.029
 they+0.029
 there+0.028
Neg logit contributions
 itself-0.057
ze-0.057
iped-0.057
ez-0.057
izer-0.056
Token,
Token ID11
Feature activation-8.95e+00

Pos logit contributions
ancer+0.154
ait+0.133
as+0.127
 usual+0.126
 upon+0.126
Neg logit contributions
 itself-0.093
esh-0.084
ez-0.076
nob-0.075
itched-0.074
Token there
Token ID612
Feature activation-3.80e+00

Pos logit contributions
ancer+0.115
ait+0.092
 marsh+0.079
 hotels+0.078
 upon+0.078
Neg logit contributions
 itself-0.292
iped-0.257
izer-0.253
nob-0.252
ez-0.249
Token her
Token ID607
Feature activation+1.73e+00

Pos logit contributions
 itself+0.006
stick+0.006
iped+0.005
ze+0.005
ez+0.005
Neg logit contributions
 upon-0.006
ancer-0.006
ulu-0.005
ver-0.005
 there-0.005
Token animal
Token ID5044
Feature activation-2.82e-01

Pos logit contributions
stick+0.037
 itself+0.037
iped+0.036
ez+0.034
ze+0.034
Neg logit contributions
 upon-0.043
ancer-0.040
 there-0.036
 they-0.035
ulu-0.035
Token friends
Token ID2460
Feature activation-1.97e+00

Pos logit contributions
 upon+0.007
ancer+0.006
 there+0.006
 they+0.006
ulu+0.005
Neg logit contributions
 itself-0.006
stick-0.006
iped-0.006
ez-0.006
izer-0.006
Token.
Token ID13
Feature activation+2.97e+00

Pos logit contributions
 upon+0.053
ancer+0.049
 there+0.045
 they+0.044
ulu+0.044
Neg logit contributions
 itself-0.038
stick-0.038
iped-0.037
ez-0.035
ze-0.035
Token<|endoftext|>
Token ID50256
Feature activation-2.95e+00

Pos logit contributions
stick+0.081
 itself+0.079
iped+0.079
ze+0.078
ez+0.077
Neg logit contributions
 upon-0.060
ancer-0.050
 they-0.048
 there-0.047
 don-0.043
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.061
ancer+0.056
ulu+0.047
 they+0.047
 there+0.047
Neg logit contributions
 itself-0.076
stick-0.074
iped-0.072
ez-0.070
ze-0.069
Token upon
Token ID2402
Feature activation-1.62e+00

Pos logit contributions
ancer+0.094
 marsh+0.068
Fre+0.059
 queens+0.054
First+0.051
Neg logit contributions
 itself-0.397
 mistakes-0.363
 its-0.360
 His-0.357
itched-0.355
Token a
Token ID257
Feature activation-2.06e+00

Pos logit contributions
 upon+0.033
ancer+0.027
 there+0.027
 they+0.025
ulu+0.024
Neg logit contributions
stick-0.045
iped-0.043
 itself-0.043
ez-0.041
eness-0.041
Token time
Token ID640
Feature activation-5.47e+00

Pos logit contributions
ancer+0.037
 upon+0.036
ulu+0.028
 they+0.028
 there+0.027
Neg logit contributions
 itself-0.055
iped-0.055
ze-0.054
ez-0.054
izer-0.054
Token,
Token ID11
Feature activation-8.90e+00

Pos logit contributions
ancer+0.143
ait+0.121
 upon+0.120
as+0.115
ulu+0.112
Neg logit contributions
 itself-0.103
esh-0.094
nob-0.087
ez-0.086
iped-0.086
Token there
Token ID612
Feature activation-3.80e+00

Pos logit contributions
ancer+0.124
ait+0.099
 upon+0.087
 marsh+0.084
 hotels+0.083
Neg logit contributions
 itself-0.284
iped-0.254
izer-0.249
nob-0.246
ez-0.245
Token few
Token ID1178
Feature activation-3.75e+00

Pos logit contributions
stick+0.011
 itself+0.011
ze+0.011
age+0.011
nob+0.010
Neg logit contributions
 upon-0.009
ancer-0.009
ulu-0.008
ver-0.008
 there-0.008
Token days
Token ID1528
Feature activation-5.70e+00

Pos logit contributions
 upon+0.083
ancer+0.077
ulu+0.065
ver+0.064
 there+0.064
Neg logit contributions
 itself-0.093
stick-0.092
iped-0.089
ez-0.088
ze-0.087
Token later
Token ID1568
Feature activation-2.15e+00

Pos logit contributions
 upon+0.145
ancer+0.141
ulu+0.132
 don+0.115
 marsh+0.115
Neg logit contributions
stick-0.117
 itself-0.115
age-0.104
ez-0.104
iped-0.103
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 upon+0.052
ancer+0.049
ulu+0.045
 there+0.041
ver+0.040
Neg logit contributions
 itself-0.048
stick-0.048
iped-0.045
ze-0.043
age-0.043
Token<|endoftext|>
Token ID50256
Feature activation-3.11e+00

Pos logit contributions
stick+0.045
iped+0.044
 itself+0.044
ze+0.043
izer+0.043
Neg logit contributions
 upon-0.038
ancer-0.033
 there-0.030
 they-0.029
ulu-0.028
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.061
ancer+0.053
 there+0.047
 they+0.046
ulu+0.045
Neg logit contributions
stick-0.084
 itself-0.084
iped-0.082
ez-0.079
ze-0.078
Token upon
Token ID2402
Feature activation-1.89e+00

Pos logit contributions
ancer+0.088
 marsh+0.065
Fre+0.057
First+0.046
 hotels+0.044
Neg logit contributions
 itself-0.406
 mistakes-0.374
 From-0.366
 its-0.365
 His-0.363
Token a
Token ID257
Feature activation-2.17e+00

Pos logit contributions
 upon+0.039
ancer+0.034
 there+0.032
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.050
iped-0.049
 itself-0.049
ez-0.046
izer-0.045
Token time
Token ID640
Feature activation-5.67e+00

Pos logit contributions
ancer+0.039
 upon+0.038
ulu+0.030
 they+0.030
 there+0.029
Neg logit contributions
ze-0.057
 itself-0.057
ez-0.057
iped-0.057
izer-0.056
Token,
Token ID11
Feature activation-8.80e+00

Pos logit contributions
ancer+0.151
ait+0.129
as+0.127
 upon+0.125
ulu+0.124
Neg logit contributions
 itself-0.098
esh-0.089
ez-0.080
nob-0.080
itched-0.079
Token there
Token ID612
Feature activation-3.83e+00

Pos logit contributions
ancer+0.120
ait+0.095
 upon+0.085
 marsh+0.082
as+0.082
Neg logit contributions
 itself-0.275
iped-0.246
ez-0.241
nob-0.241
izer-0.241
Token were
Token ID547
Feature activation-5.58e-02

Pos logit contributions
stick+0.065
 itself+0.065
iped+0.063
ez+0.061
ze+0.061
Neg logit contributions
 upon-0.037
ancer-0.032
 there-0.027
ulu-0.026
 they-0.026
Token happy
Token ID3772
Feature activation+1.46e+00

Pos logit contributions
 upon+0.001
ancer+0.001
ulu+0.001
 there+0.001
 they+0.001
Neg logit contributions
 itself-0.002
stick-0.002
iped-0.001
ze-0.001
ez-0.001
Token friends
Token ID2460
Feature activation-2.56e+00

Pos logit contributions
iped+0.034
stick+0.034
 itself+0.034
izer+0.033
nob+0.032
Neg logit contributions
 upon-0.034
ancer-0.030
 they-0.029
 there-0.029
 however-0.026
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 upon+0.058
 there+0.052
ancer+0.052
 they+0.051
ver+0.047
Neg logit contributions
iped-0.063
stick-0.061
izer-0.060
 itself-0.058
bons-0.057
Token<|endoftext|>
Token ID50256
Feature activation-3.01e+00

Pos logit contributions
iped+0.063
stick+0.062
ze+0.060
nob+0.059
 itself+0.059
Neg logit contributions
 upon-0.040
 they-0.033
ancer-0.032
 there-0.029
ulu-0.027
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.061
ancer+0.055
 there+0.048
 they+0.047
ulu+0.047
Neg logit contributions
 itself-0.078
stick-0.077
iped-0.075
ez-0.073
ze-0.072
Token upon
Token ID2402
Feature activation-1.96e+00

Pos logit contributions
ancer+0.079
 marsh+0.058
 queens+0.049
Fre+0.047
 hotels+0.043
Neg logit contributions
 itself-0.410
 mistakes-0.376
 its-0.365
itched-0.365
esh-0.364
Token a
Token ID257
Feature activation-2.22e+00

Pos logit contributions
 upon+0.041
ancer+0.035
 there+0.033
 they+0.032
ulu+0.030
Neg logit contributions
stick-0.052
 itself-0.050
iped-0.050
ez-0.048
izer-0.047
Token time
Token ID640
Feature activation-5.63e+00

Pos logit contributions
 upon+0.037
ancer+0.037
 they+0.028
ulu+0.028
 there+0.028
Neg logit contributions
 itself-0.062
iped-0.061
ze-0.060
ez-0.060
izer-0.060
Token,
Token ID11
Feature activation-8.87e+00

Pos logit contributions
ancer+0.150
ait+0.131
as+0.124
 upon+0.124
 marsh+0.120
Neg logit contributions
 itself-0.101
esh-0.088
ez-0.080
nob-0.079
itched-0.079
Token there
Token ID612
Feature activation-3.88e+00

Pos logit contributions
ancer+0.117
ait+0.099
 upon+0.080
 hotels+0.079
as+0.078
Neg logit contributions
 itself-0.287
iped-0.251
izer-0.247
nob-0.247
ze-0.247
Token her
Token ID607
Feature activation+1.88e+00

Pos logit contributions
 upon+0.014
ancer+0.013
ulu+0.012
ver+0.012
as+0.011
Neg logit contributions
 itself-0.011
stick-0.011
ze-0.010
iped-0.010
ez-0.010
Token baby
Token ID5156
Feature activation+5.83e-01

Pos logit contributions
stick+0.039
 itself+0.039
iped+0.037
ze+0.035
ez+0.035
Neg logit contributions
 upon-0.049
ancer-0.045
ulu-0.040
 there-0.040
 they-0.040
Token brother
Token ID3956
Feature activation+1.15e+00

Pos logit contributions
stick+0.011
 itself+0.010
ze+0.010
ez+0.010
iped+0.010
Neg logit contributions
 upon-0.016
ancer-0.015
ulu-0.014
ver-0.014
 they-0.013
Token.
Token ID13
Feature activation+2.62e+00

Pos logit contributions
stick+0.021
 itself+0.021
iped+0.020
ze+0.019
ez+0.019
Neg logit contributions
 upon-0.032
ancer-0.030
ulu-0.028
 there-0.027
 they-0.027
Token<|endoftext|>
Token ID50256
Feature activation-2.74e+00

Pos logit contributions
stick+0.067
 itself+0.066
iped+0.065
ze+0.063
ez+0.062
Neg logit contributions
 upon-0.056
ancer-0.049
 there-0.044
 they-0.044
ulu-0.042
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.057
ancer+0.053
ulu+0.045
 they+0.045
 there+0.044
Neg logit contributions
 itself-0.069
stick-0.068
iped-0.065
ez-0.064
nob-0.063
Token upon
Token ID2402
Feature activation-1.88e+00

Pos logit contributions
ancer+0.089
 marsh+0.081
Fre+0.067
 cam+0.061
 queens+0.058
Neg logit contributions
 itself-0.394
 mistakes-0.371
 From-0.354
 His-0.353
 its-0.352
Token a
Token ID257
Feature activation-2.34e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.032
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.050
iped-0.049
 itself-0.049
ez-0.046
izer-0.045
Token time
Token ID640
Feature activation-5.76e+00

Pos logit contributions
ancer+0.042
 upon+0.041
 they+0.032
ulu+0.032
 there+0.032
Neg logit contributions
ez-0.062
ze-0.061
 itself-0.061
iped-0.061
izer-0.060
Token,
Token ID11
Feature activation-8.95e+00

Pos logit contributions
ancer+0.157
ait+0.136
as+0.134
ulu+0.128
 thirsty+0.128
Neg logit contributions
 itself-0.098
esh-0.087
nob-0.077
ez-0.076
 Its-0.075
Token there
Token ID612
Feature activation-3.90e+00

Pos logit contributions
ancer+0.108
ait+0.086
as+0.075
 marsh+0.074
 hotels+0.071
Neg logit contributions
 itself-0.294
iped-0.260
ez-0.257
nob-0.256
izer-0.255
Token in
Token ID287
Feature activation-8.22e-01

Pos logit contributions
stick+0.070
 itself+0.070
iped+0.066
ez+0.064
ze+0.064
Neg logit contributions
 upon-0.069
ancer-0.063
ulu-0.055
 there-0.055
 they-0.054
Token the
Token ID262
Feature activation+4.70e-01

Pos logit contributions
 upon+0.019
ancer+0.018
ver+0.015
ulu+0.015
 they+0.015
Neg logit contributions
 itself-0.020
stick-0.019
iped-0.018
ez-0.018
ze-0.018
Token park
Token ID3952
Feature activation-4.48e-01

Pos logit contributions
 itself+0.013
stick+0.013
iped+0.013
ez+0.013
rums+0.012
Neg logit contributions
 upon-0.009
 there-0.007
ancer-0.007
 they-0.006
 thirsty-0.006
Token.
Token ID13
Feature activation+2.27e+00

Pos logit contributions
 upon+0.011
ancer+0.010
ulu+0.009
ver+0.009
 there+0.009
Neg logit contributions
stick-0.010
 itself-0.010
iped-0.009
ze-0.009
age-0.009
Token<|endoftext|>
Token ID50256
Feature activation-2.94e+00

Pos logit contributions
 itself+0.058
stick+0.057
iped+0.055
ez+0.054
ze+0.053
Neg logit contributions
 upon-0.048
ancer-0.043
 there-0.038
ulu-0.037
 they-0.037
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.057
ancer+0.050
 there+0.044
 they+0.043
ulu+0.042
Neg logit contributions
stick-0.081
 itself-0.080
iped-0.078
ez-0.075
ze-0.075
Token upon
Token ID2402
Feature activation-1.84e+00

Pos logit contributions
ancer+0.079
 marsh+0.059
Fre+0.049
 queens+0.040
 hotels+0.040
Neg logit contributions
 itself-0.412
 mistakes-0.384
 its-0.368
itched-0.368
esh-0.367
Token a
Token ID257
Feature activation-2.14e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.031
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.048
 itself-0.047
iped-0.047
ez-0.045
izer-0.044
Token time
Token ID640
Feature activation-5.66e+00

Pos logit contributions
 upon+0.036
ancer+0.036
 they+0.028
 there+0.028
ulu+0.028
Neg logit contributions
 itself-0.059
iped-0.058
ez-0.058
ze-0.058
izer-0.057
Token,
Token ID11
Feature activation-8.87e+00

Pos logit contributions
ancer+0.149
ait+0.126
as+0.124
 marsh+0.121
ulu+0.121
Neg logit contributions
 itself-0.100
esh-0.091
stick-0.081
nob-0.081
ez-0.080
Token there
Token ID612
Feature activation-3.87e+00

Pos logit contributions
ancer+0.112
ait+0.088
 marsh+0.080
 upon+0.078
 hotels+0.076
Neg logit contributions
 itself-0.284
iped-0.255
izer-0.252
nob-0.251
ze-0.248
Token in
Token ID287
Feature activation-1.30e+00

Pos logit contributions
 itself+0.047
stick+0.046
iped+0.043
ze+0.043
ez+0.042
Neg logit contributions
 upon-0.044
ancer-0.041
ulu-0.037
 there-0.034
 they-0.034
Token the
Token ID262
Feature activation+3.35e-01

Pos logit contributions
 upon+0.032
ancer+0.030
ver+0.026
ulu+0.026
 they+0.025
Neg logit contributions
 itself-0.030
stick-0.029
iped-0.027
ez-0.027
ze-0.027
Token future
Token ID2003
Feature activation-1.38e+00

Pos logit contributions
 itself+0.008
stick+0.008
iped+0.008
ez+0.008
eness+0.007
Neg logit contributions
 upon-0.008
 there-0.007
ancer-0.006
 tired-0.006
 don-0.006
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
 upon+0.043
ancer+0.041
ulu+0.040
 thirsty+0.036
 marsh+0.035
Neg logit contributions
 itself-0.022
stick-0.021
 mistakes-0.018
age-0.018
iped-0.018
Token<|endoftext|>
Token ID50256
Feature activation-2.95e+00

Pos logit contributions
stick+0.073
iped+0.071
ze+0.070
eness+0.069
 itself+0.069
Neg logit contributions
 upon-0.042
ancer-0.033
 they-0.032
 there-0.031
ulu-0.027
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.060
ancer+0.055
 there+0.047
ulu+0.046
 they+0.046
Neg logit contributions
 itself-0.076
stick-0.076
iped-0.073
ez-0.071
ze-0.070
Token upon
Token ID2402
Feature activation-1.89e+00

Pos logit contributions
ancer+0.087
 marsh+0.069
Fre+0.062
 queens+0.052
 hotels+0.051
Neg logit contributions
 itself-0.403
 mistakes-0.368
 His-0.367
 its-0.359
itched-0.358
Token a
Token ID257
Feature activation-2.11e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.032
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.051
iped-0.049
 itself-0.049
ez-0.047
eness-0.046
Token time
Token ID640
Feature activation-5.60e+00

Pos logit contributions
ancer+0.036
 upon+0.036
ulu+0.028
 they+0.028
 there+0.028
Neg logit contributions
 itself-0.057
iped-0.056
ez-0.056
ze-0.056
izer-0.056
Token,
Token ID11
Feature activation-8.91e+00

Pos logit contributions
ancer+0.153
ait+0.133
as+0.128
 upon+0.125
 marsh+0.124
Neg logit contributions
 itself-0.094
esh-0.083
itched-0.073
iped-0.073
nob-0.073
Token there
Token ID612
Feature activation-3.74e+00

Pos logit contributions
ancer+0.124
ait+0.105
 hotels+0.092
 marsh+0.089
 restaurants+0.086
Neg logit contributions
 itself-0.277
iped-0.243
izer-0.242
 its-0.239
ez-0.235
Token his
Token ID465
Feature activation+9.86e-01

Pos logit contributions
 upon+0.001
ancer+0.001
 there+0.001
ulu+0.001
 they+0.001
Neg logit contributions
 itself-0.001
stick-0.001
iped-0.001
ez-0.001
ze-0.001
Token helpful
Token ID7613
Feature activation+3.08e+00

Pos logit contributions
stick+0.027
 itself+0.026
iped+0.026
ez+0.025
age+0.024
Neg logit contributions
 upon-0.021
ancer-0.017
 there-0.017
 they-0.016
 don-0.016
Token friend
Token ID1545
Feature activation-7.89e-01

Pos logit contributions
iped+0.075
stick+0.074
 itself+0.073
ez+0.072
izer+0.070
Neg logit contributions
 upon-0.071
ancer-0.059
 there-0.055
 don-0.053
 they-0.053
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
 upon+0.019
ancer+0.017
 there+0.016
 they+0.015
ulu+0.015
Neg logit contributions
 itself-0.018
stick-0.018
iped-0.018
ez-0.017
izer-0.017
Token<|endoftext|>
Token ID50256
Feature activation-2.88e+00

Pos logit contributions
stick+0.050
 itself+0.049
iped+0.048
ze+0.047
ez+0.047
Neg logit contributions
 upon-0.045
ancer-0.039
 they-0.036
 there-0.036
ulu-0.034
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.059
ancer+0.053
 there+0.045
ulu+0.045
 they+0.045
Neg logit contributions
 itself-0.075
stick-0.074
iped-0.071
ez-0.070
ze-0.068
Token upon
Token ID2402
Feature activation-1.91e+00

Pos logit contributions
ancer+0.095
 marsh+0.079
Fre+0.065
 queens+0.063
 hotels+0.057
Neg logit contributions
 itself-0.395
 mistakes-0.361
 His-0.360
 From-0.358
 its-0.356
Token a
Token ID257
Feature activation-2.11e+00

Pos logit contributions
 upon+0.040
ancer+0.034
 there+0.032
 they+0.031
ulu+0.029
Neg logit contributions
stick-0.051
 itself-0.049
iped-0.049
ez-0.047
izer-0.046
Token time
Token ID640
Feature activation-5.55e+00

Pos logit contributions
 upon+0.036
ancer+0.036
 they+0.028
ulu+0.028
 there+0.028
Neg logit contributions
 itself-0.058
ez-0.057
iped-0.057
ze-0.057
izer-0.056
Token,
Token ID11
Feature activation-8.75e+00

Pos logit contributions
ancer+0.155
ait+0.134
 usual+0.128
as+0.127
 marsh+0.126
Neg logit contributions
 itself-0.092
esh-0.082
ez-0.073
itched-0.072
iped-0.071
Token there
Token ID612
Feature activation-3.70e+00

Pos logit contributions
ancer+0.119
ait+0.097
 hotels+0.082
 marsh+0.080
 restaurants+0.079
Neg logit contributions
 itself-0.284
iped-0.249
nob-0.244
ez-0.243
 its-0.243
Token with
Token ID351
Feature activation-3.27e+00

Pos logit contributions
stick+0.004
 itself+0.004
iped+0.004
ze+0.004
age+0.004
Neg logit contributions
 upon-0.006
ancer-0.006
ulu-0.005
ver-0.005
 there-0.005
Token her
Token ID607
Feature activation+1.27e+00

Pos logit contributions
 upon+0.083
ancer+0.076
 there+0.067
ulu+0.067
 they+0.066
Neg logit contributions
 itself-0.071
stick-0.070
iped-0.066
ez-0.064
ze-0.064
Token mom
Token ID1995
Feature activation+2.54e-01

Pos logit contributions
 itself+0.029
stick+0.029
 mistakes+0.027
ze+0.027
iped+0.026
Neg logit contributions
 upon-0.029
ancer-0.028
ulu-0.025
ver-0.024
 there-0.023
Token.
Token ID13
Feature activation+2.24e+00

Pos logit contributions
stick+0.005
 itself+0.005
ez+0.004
age+0.004
ze+0.004
Neg logit contributions
 upon-0.007
ancer-0.007
ulu-0.006
ver-0.006
 they-0.006
Token<|endoftext|>
Token ID50256
Feature activation-2.95e+00

Pos logit contributions
iped+0.078
stick+0.075
ze+0.073
eness+0.073
ez+0.071
Neg logit contributions
 upon-0.033
 they-0.027
ancer-0.023
 there-0.022
ulu-0.020
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.055
ancer+0.047
 there+0.043
 they+0.041
ulu+0.040
Neg logit contributions
stick-0.084
 itself-0.082
iped-0.081
ze-0.078
ez-0.078
Token upon
Token ID2402
Feature activation-1.97e+00

Pos logit contributions
ancer+0.078
 marsh+0.052
Fre+0.039
 hotels+0.032
 queens+0.031
Neg logit contributions
 itself-0.424
 mistakes-0.398
esh-0.380
ez-0.377
an-0.377
Token a
Token ID257
Feature activation-2.33e+00

Pos logit contributions
 upon+0.041
ancer+0.034
 there+0.033
 they+0.031
 don+0.029
Neg logit contributions
stick-0.054
iped-0.052
 itself-0.052
ez-0.049
eness-0.049
Token time
Token ID640
Feature activation-5.67e+00

Pos logit contributions
 upon+0.038
ancer+0.038
ulu+0.029
 they+0.029
 there+0.029
Neg logit contributions
 itself-0.066
iped-0.064
ez-0.064
ze-0.064
izer-0.063
Token,
Token ID11
Feature activation-8.99e+00

Pos logit contributions
ancer+0.148
ait+0.126
as+0.123
ulu+0.121
 upon+0.121
Neg logit contributions
 itself-0.103
esh-0.091
ez-0.082
iped-0.081
nob-0.081
Token there
Token ID612
Feature activation-3.83e+00

Pos logit contributions
ancer+0.111
ait+0.090
 marsh+0.078
 hotels+0.074
as+0.072
Neg logit contributions
 itself-0.294
iped-0.263
izer-0.257
nob-0.256
ez-0.256
Token responsible
Token ID4497
Feature activation+3.66e+00

Pos logit contributions
stick+0.017
 itself+0.017
iped+0.016
ze+0.016
ez+0.016
Neg logit contributions
 upon-0.020
ancer-0.019
ulu-0.016
 they-0.015
ver-0.015
Token with
Token ID351
Feature activation-3.34e+00

Pos logit contributions
 itself+0.076
stick+0.074
iped+0.068
ze+0.066
 mistakes+0.066
Neg logit contributions
 upon-0.095
ancer-0.090
ulu-0.083
ver-0.077
 thirsty-0.076
Token money
Token ID1637
Feature activation-2.71e+00

Pos logit contributions
 upon+0.084
ancer+0.081
ver+0.070
ulu+0.070
 they+0.067
Neg logit contributions
 itself-0.075
stick-0.070
iped-0.066
bons-0.065
 mistakes-0.065
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 upon+0.076
ancer+0.072
ulu+0.070
 thirsty+0.062
ver+0.061
Neg logit contributions
 itself-0.050
stick-0.046
er-0.042
eness-0.042
bons-0.042
Token<|endoftext|>
Token ID50256
Feature activation-2.84e+00

Pos logit contributions
stick+0.062
iped+0.061
 itself+0.060
ze+0.059
izer+0.058
Neg logit contributions
 upon-0.040
ancer-0.032
 there-0.030
 they-0.029
ulu-0.027
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.057
ancer+0.051
 there+0.044
 they+0.044
ulu+0.043
Neg logit contributions
 itself-0.075
stick-0.075
iped-0.072
ez-0.070
ze-0.069
Token upon
Token ID2402
Feature activation-2.04e+00

Pos logit contributions
ancer+0.085
 marsh+0.075
Fre+0.062
 queens+0.052
 cam+0.052
Neg logit contributions
 itself-0.397
 mistakes-0.374
 From-0.359
 its-0.357
 His-0.355
Token a
Token ID257
Feature activation-2.27e+00

Pos logit contributions
 upon+0.044
ancer+0.039
 there+0.035
 they+0.034
ulu+0.033
Neg logit contributions
stick-0.052
 itself-0.051
iped-0.050
ez-0.048
ze-0.047
Token time
Token ID640
Feature activation-5.75e+00

Pos logit contributions
ancer+0.040
 upon+0.040
 they+0.031
ulu+0.031
 there+0.031
Neg logit contributions
ez-0.060
 itself-0.059
iped-0.059
ze-0.059
izer-0.059
Token,
Token ID11
Feature activation-8.97e+00

Pos logit contributions
ancer+0.158
ait+0.136
as+0.134
ulu+0.130
 marsh+0.129
Neg logit contributions
 itself-0.096
esh-0.083
nob-0.075
 Its-0.075
ez-0.074
Token there
Token ID612
Feature activation-3.80e+00

Pos logit contributions
ancer+0.120
ait+0.100
 marsh+0.084
as+0.083
 hotels+0.081
Neg logit contributions
 itself-0.282
nob-0.244
 its-0.244
izer-0.242
iped-0.241
Token it
Token ID340
Feature activation-1.29e+00

Pos logit contributions
 upon+0.090
ancer+0.079
ulu+0.066
ver+0.061
 don+0.061
Neg logit contributions
 itself-0.156
stick-0.154
age-0.142
er-0.141
iped-0.140
Token gave
Token ID2921
Feature activation-4.85e-01

Pos logit contributions
 upon+0.030
ancer+0.028
ulu+0.024
ver+0.023
 however+0.022
Neg logit contributions
 itself-0.030
stick-0.030
iped-0.028
nob-0.028
ze-0.028
Token her
Token ID607
Feature activation+2.22e+00

Pos logit contributions
 upon+0.012
ancer+0.011
 there+0.010
 they+0.010
ulu+0.010
Neg logit contributions
stick-0.010
 itself-0.010
iped-0.010
ez-0.010
izer-0.010
Token.
Token ID13
Feature activation+2.46e+00

Pos logit contributions
 itself+0.050
stick+0.050
iped+0.047
ez+0.046
ze+0.045
Neg logit contributions
 upon-0.053
ancer-0.049
ulu-0.043
 there-0.043
 they-0.043
Token<|endoftext|>
Token ID50256
Feature activation-2.93e+00

Pos logit contributions
stick+0.062
iped+0.061
 itself+0.060
ze+0.059
ez+0.058
Neg logit contributions
 upon-0.054
ancer-0.046
 they-0.044
 there-0.043
ulu-0.041
TokenOnce
Token ID7454
Feature activation-1.20e+01

Pos logit contributions
 upon+0.060
ancer+0.055
ulu+0.047
 they+0.047
 there+0.047
Neg logit contributions
 itself-0.075
stick-0.074
iped-0.072
ez-0.070
nob-0.069
Token upon
Token ID2402
Feature activation-1.82e+00

Pos logit contributions
ancer+0.095
 marsh+0.088
Fre+0.069
First+0.064
 pigs+0.063
Neg logit contributions
 itself-0.389
 mistakes-0.358
 its-0.348
 From-0.345
 His-0.342
Token a
Token ID257
Feature activation-2.22e+00

Pos logit contributions
 upon+0.039
ancer+0.033
 there+0.031
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.048
 itself-0.046
iped-0.046
ez-0.044
ze-0.043
Token time
Token ID640
Feature activation-5.60e+00

Pos logit contributions
ancer+0.038
 upon+0.038
 they+0.030
ulu+0.030
 there+0.029
Neg logit contributions
 itself-0.060
ze-0.059
iped-0.059
ez-0.059
izer-0.058
Token,
Token ID11
Feature activation-8.78e+00

Pos logit contributions
ancer+0.151
ait+0.130
as+0.127
 upon+0.125
ulu+0.123
Neg logit contributions
 itself-0.097
esh-0.088
ez-0.077
nob-0.077
stick-0.076
Token there
Token ID612
Feature activation-3.81e+00

Pos logit contributions
ancer+0.114
ait+0.097
 marsh+0.081
 hotels+0.079
as+0.076
Neg logit contributions
 itself-0.280
izer-0.244
iped-0.243
nob-0.242
ze-0.240
Token mom
Token ID1995
Feature activation-1.42e+00

Pos logit contributions
 itself+0.003
stick+0.003
ze+0.003
iped+0.003
 mistakes+0.003
Neg logit contributions
 upon-0.004
ancer-0.003
ver-0.003
ulu-0.003
 there-0.003
Token and
Token ID290
Feature activation-2.18e+00

Pos logit contributions
 upon+0.040
ancer+0.039
ver+0.034
ulu+0.033
 usual+0.033
Neg logit contributions
stick-0.025
 itself-0.024
ez-0.023
iped-0.023
age-0.022
Token dad
Token ID9955
Feature activation-5.57e-01

Pos logit contributions
ancer+0.046
 upon+0.043
ver+0.042
ulu+0.036
 marsh+0.033
Neg logit contributions
 itself-0.054
stick-0.050
iped-0.048
 fur-0.047
ze-0.046
Token ran
Token ID4966
Feature activation+2.34e+00

Pos logit contributions
 upon+0.011
ancer+0.011
ulu+0.009
ver+0.008
 however+0.008
Neg logit contributions
stick-0.014
 itself-0.014
ez-0.013
ze-0.013
iped-0.013
Token to
Token ID284
Feature activation+2.78e+00

Pos logit contributions
stick+0.053
 itself+0.052
iped+0.050
ez+0.048
ze+0.048
Neg logit contributions
 upon-0.056
ancer-0.053
ulu-0.046
 there-0.045
ver-0.045
Token Tom
Token ID4186
Feature activation+4.51e+00

Pos logit contributions
 itself+0.075
stick+0.075
iped+0.073
ez+0.070
ze+0.070
Neg logit contributions
 upon-0.055
ancer-0.049
 there-0.042
 they-0.041
ulu-0.041
Token and
Token ID290
Feature activation+5.71e-01

Pos logit contributions
 itself+0.106
stick+0.105
iped+0.103
ez+0.098
ze+0.097
Neg logit contributions
 upon-0.104
ancer-0.092
 there-0.086
 they-0.084
ver-0.080
Token Jen
Token ID13364
Feature activation+2.78e+00

Pos logit contributions
stick+0.013
 itself+0.012
iped+0.012
ez+0.011
nob+0.011
Neg logit contributions
 upon-0.014
ancer-0.014
ver-0.011
ulu-0.011
as-0.010
Token and
Token ID290
Feature activation-3.89e-01

Pos logit contributions
stick+0.055
 itself+0.054
iped+0.052
ez+0.050
ze+0.049
Neg logit contributions
 upon-0.075
ancer-0.069
ulu-0.062
 there-0.062
 they-0.061
Token cried
Token ID16896
Feature activation-8.44e-01

Pos logit contributions
 upon+0.009
ancer+0.009
ver+0.007
ulu+0.007
as+0.007
Neg logit contributions
stick-0.009
 itself-0.009
iped-0.009
ez-0.008
age-0.008
Token.
Token ID13
Feature activation+3.67e+00

Pos logit contributions
 upon+0.021
ancer+0.018
 they+0.017
 there+0.017
 don+0.016
Neg logit contributions
stick-0.020
iped-0.020
 itself-0.018
izer-0.018
ez-0.018
Token It
Token ID632
Feature activation+1.32e+00

Pos logit contributions
 itself+0.077
stick+0.076
iped+0.073
ez+0.070
ze+0.068
Neg logit contributions
 upon-0.086
ancer-0.079
 there-0.069
ulu-0.069
ver-0.068
Token made
Token ID925
Feature activation+1.03e-01

Pos logit contributions
stick+0.029
 itself+0.028
ze+0.028
iped+0.027
ez+0.027
Neg logit contributions
 upon-0.034
ancer-0.033
ulu-0.027
ver-0.027
ait-0.027
Token a
Token ID257
Feature activation+1.94e+00

Pos logit contributions
stick+0.003
 itself+0.003
iped+0.003
ez+0.002
nob+0.002
Neg logit contributions
 upon-0.002
ancer-0.002
 there-0.002
 they-0.002
ulu-0.002
Token loud
Token ID7812
Feature activation+5.05e+00

Pos logit contributions
stick+0.052
 itself+0.051
iped+0.051
ez+0.049
nob+0.049
Neg logit contributions
 upon-0.040
ancer-0.032
 they-0.031
 there-0.030
 don-0.029
Token sound
Token ID2128
Feature activation+7.36e-01

Pos logit contributions
iped+0.105
bons+0.098
stick+0.098
elled+0.095
nob+0.094
Neg logit contributions
 upon-0.121
 they-0.115
 don-0.112
 while-0.108
 there-0.105
Token.
Token ID13
Feature activation+3.47e+00

Pos logit contributions
 itself+0.017
stick+0.017
iped+0.016
ze+0.016
ez+0.016
Neg logit contributions
 upon-0.017
ancer-0.016
ulu-0.014
 there-0.014
 they-0.013
Token "
Token ID366
Feature activation+2.39e+00

Pos logit contributions
 itself+0.078
stick+0.077
iped+0.074
ez+0.071
ze+0.069
Neg logit contributions
 upon-0.084
ancer-0.077
ulu-0.068
 there-0.067
ver-0.066
TokenHon
Token ID29478
Feature activation+1.47e+00

Pos logit contributions
stick+0.051
 itself+0.051
iped+0.049
ez+0.046
ze+0.046
Neg logit contributions
 upon-0.060
ancer-0.055
 there-0.050
 they-0.049
ulu-0.049
Tokenk
Token ID74
Feature activation-2.66e-01

Pos logit contributions
 itself+0.028
stick+0.027
 mistakes+0.026
nob+0.026
esh+0.025
Neg logit contributions
 upon-0.036
ancer-0.035
 don-0.033
as-0.033
ver-0.031
Token!
Token ID0
Feature activation+2.92e+00

Pos logit contributions
 upon+0.006
ancer+0.006
ulu+0.005
 there+0.005
 they+0.005
Neg logit contributions
 itself-0.006
stick-0.006
ze-0.006
iped-0.006
ez-0.006
Token Pop
Token ID8099
Feature activation+4.25e-01

Pos logit contributions
 itself+0.056
stick+0.056
iped+0.053
ez+0.051
ze+0.051
Neg logit contributions
 upon-0.081
ancer-0.073
 there-0.067
ulu-0.067
 they-0.066
Token and
Token ID290
Feature activation+6.45e-01

Pos logit contributions
 upon+0.040
ancer+0.039
ulu+0.035
 don+0.032
 marsh+0.032
Neg logit contributions
 itself-0.031
stick-0.031
iped-0.029
nob-0.027
er-0.026
Token reached
Token ID4251
Feature activation-1.99e+00

Pos logit contributions
iped+0.013
stick+0.013
nob+0.013
 itself+0.013
ze+0.012
Neg logit contributions
ancer-0.017
 upon-0.017
ulu-0.014
ver-0.013
as-0.013
Token for
Token ID329
Feature activation-4.68e-01

Pos logit contributions
ancer+0.051
 upon+0.047
ulu+0.044
ver+0.043
 however+0.042
Neg logit contributions
 itself-0.041
stick-0.034
nob-0.034
iped-0.033
 mistakes-0.032
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 upon+0.012
ancer+0.011
ver+0.010
ulu+0.009
as+0.009
Neg logit contributions
 itself-0.011
stick-0.009
 fur-0.009
bons-0.009
ze-0.009
Token top
Token ID1353
Feature activation-3.77e+00

Pos logit contributions
stick+0.020
ze+0.020
 itself+0.020
nob+0.020
 mistakes+0.019
Neg logit contributions
ancer-0.024
 upon-0.024
ver-0.023
 there-0.020
ulu-0.020
Token.
Token ID13
Feature activation+3.78e+00

Pos logit contributions
 upon+0.109
ancer+0.108
ver+0.093
 don+0.092
ulu+0.090
Neg logit contributions
 itself-0.064
stick-0.063
er-0.060
nob-0.058
age-0.054
Token They
Token ID1119
Feature activation-2.59e-01

Pos logit contributions
 itself+0.078
iped+0.074
stick+0.072
ze+0.066
izer+0.065
Neg logit contributions
 upon-0.096
ancer-0.093
ulu-0.080
ver-0.079
 there-0.078
Token found
Token ID1043
Feature activation+2.86e-01

Pos logit contributions
ancer+0.007
 upon+0.006
ulu+0.005
 weekend+0.005
 there+0.004
Neg logit contributions
stick-0.006
iped-0.006
 itself-0.005
ze-0.005
nob-0.005
Token a
Token ID257
Feature activation+5.95e-01

Pos logit contributions
 itself+0.007
stick+0.006
iped+0.006
ze+0.006
ez+0.006
Neg logit contributions
 upon-0.007
ancer-0.007
ulu-0.006
ver-0.005
 don-0.005
Token big
Token ID1263
Feature activation+1.95e+00

Pos logit contributions
stick+0.013
ze+0.013
 itself+0.012
izer+0.012
age+0.012
Neg logit contributions
ancer-0.013
 upon-0.013
ver-0.012
ulu-0.011
 they-0.010
Token bag
Token ID6131
Feature activation-3.53e-01

Pos logit contributions
stick+0.044
 itself+0.043
iped+0.042
ze+0.040
ez+0.040
Neg logit contributions
 upon-0.046
ancer-0.044
ver-0.038
ulu-0.038
 they-0.037
Token Sam
Token ID3409
Feature activation+2.89e+00

Pos logit contributions
 upon+0.019
ancer+0.017
 there+0.015
ulu+0.015
ver+0.015
Neg logit contributions
stick-0.017
 itself-0.017
iped-0.017
ez-0.016
ze-0.015
Token cough
Token ID22094
Feature activation-4.09e-01

Pos logit contributions
stick+0.074
 itself+0.073
ze+0.071
iped+0.071
ez+0.070
Neg logit contributions
 upon-0.060
ancer-0.056
ulu-0.048
 there-0.046
ver-0.045
Token and
Token ID290
Feature activation+5.74e-01

Pos logit contributions
 upon+0.010
ancer+0.009
ulu+0.008
 there+0.008
ver+0.007
Neg logit contributions
 itself-0.010
stick-0.009
iped-0.009
ze-0.009
ez-0.009
Token run
Token ID1057
Feature activation+1.80e+00

Pos logit contributions
 itself+0.012
stick+0.012
iped+0.011
ez+0.011
ze+0.011
Neg logit contributions
 upon-0.015
ancer-0.014
ulu-0.012
 there-0.012
ver-0.012
Token away
Token ID1497
Feature activation-4.59e-01

Pos logit contributions
 itself+0.035
iped+0.034
stick+0.034
bons+0.032
izer+0.032
Neg logit contributions
 upon-0.049
 there-0.043
ancer-0.042
 they-0.042
 don-0.040
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
 upon+0.013
ancer+0.012
ulu+0.011
 there+0.011
ver+0.011
Neg logit contributions
 itself-0.009
stick-0.008
iped-0.008
ze-0.007
ez-0.007
Token
Token ID198
Feature activation-2.28e+00

Pos logit contributions
 itself+0.091
stick+0.090
iped+0.087
ez+0.083
ze+0.082
Neg logit contributions
 upon-0.099
ancer-0.091
 there-0.080
ulu-0.080
ver-0.078
Token
Token ID198
Feature activation-2.22e+00

Pos logit contributions
 upon+0.045
ancer+0.039
 there+0.035
 they+0.034
ulu+0.034
Neg logit contributions
stick-0.062
 itself-0.062
iped-0.060
ez-0.058
ze-0.057
Token"
Token ID1
Feature activation-3.29e-01

Pos logit contributions
 upon+0.051
ancer+0.047
 there+0.041
ulu+0.040
 they+0.040
Neg logit contributions
 itself-0.053
stick-0.052
iped-0.050
ez-0.049
ze-0.048
TokenHey
Token ID10814
Feature activation-1.51e+00

Pos logit contributions
 upon+0.008
ancer+0.007
 there+0.007
 they+0.007
ulu+0.007
Neg logit contributions
 itself-0.007
stick-0.007
iped-0.007
ez-0.007
ze-0.007
Token,
Token ID11
Feature activation-7.07e+00

Pos logit contributions
ancer+0.033
 upon+0.033
ulu+0.028
ver+0.026
as+0.026
Neg logit contributions
 itself-0.035
ze-0.033
iped-0.033
stick-0.033
ASH-0.032
Token another
Token ID1194
Feature activation-1.23e+00

Pos logit contributions
 itself+0.027
stick+0.027
iped+0.026
ez+0.025
ze+0.025
Neg logit contributions
 upon-0.026
ancer-0.023
ulu-0.020
 there-0.020
 they-0.020
Token jar
Token ID17379
Feature activation-1.74e+00

Pos logit contributions
 upon+0.032
ancer+0.031
ulu+0.028
ver+0.028
as+0.027
Neg logit contributions
stick-0.023
 itself-0.022
age-0.022
iped-0.021
 mistakes-0.021
Token.
Token ID13
Feature activation+2.59e+00

Pos logit contributions
ancer+0.047
 upon+0.046
ulu+0.040
ver+0.039
 don+0.038
Neg logit contributions
 itself-0.033
stick-0.033
age-0.030
nob-0.029
er-0.028
Token We
Token ID775
Feature activation-4.09e-01

Pos logit contributions
 itself+0.047
stick+0.046
iped+0.043
ez+0.041
bons+0.040
Neg logit contributions
 upon-0.074
ancer-0.067
ulu-0.062
 there-0.061
 they-0.060
Tokenâ
Token ID22940
Feature activation+3.47e+00

Pos logit contributions
ancer+0.008
 upon+0.008
 however+0.006
ulu+0.006
 they+0.006
Neg logit contributions
stick-0.011
 itself-0.010
age-0.010
ez-0.010
ze-0.010
Token€
Token ID26391
Feature activation+4.02e+00

Pos logit contributions
 itself+0.067
stick+0.067
 mistakes+0.062
iped+0.062
ze+0.062
Neg logit contributions
 upon-0.087
ancer-0.084
 don-0.077
ver-0.076
 there-0.075
Tokenâ„¢
Token ID8151
Feature activation-4.40e+00

Pos logit contributions
stick+0.083
age+0.079
ze+0.078
 itself+0.077
iped+0.075
Neg logit contributions
 upon-0.101
ancer-0.097
ulu-0.092
 though-0.081
 usual-0.080
Tokenre
Token ID260
Feature activation+9.91e-02

Pos logit contributions
ancer+0.070
 upon+0.067
 thirsty+0.058
 hotels+0.058
 don+0.056
Neg logit contributions
stick-0.118
age-0.115
 itself-0.114
iped-0.112
ez-0.109
Token not
Token ID407
Feature activation-9.53e-01

Pos logit contributions
 itself+0.002
stick+0.002
iped+0.002
ze+0.002
age+0.002
Neg logit contributions
ancer-0.002
 upon-0.002
ulu-0.002
 however-0.002
ver-0.002
Token lucky
Token ID9670
Feature activation+7.90e-01

Pos logit contributions
ancer+0.022
 upon+0.021
ulu+0.019
ver+0.018
 marsh+0.018
Neg logit contributions
 itself-0.021
stick-0.020
ez-0.019
age-0.019
ze-0.019
Token enough
Token ID1576
Feature activation-4.59e+00

Pos logit contributions
 itself+0.015
stick+0.015
 mistakes+0.014
iped+0.013
ze+0.013
Neg logit contributions
 upon-0.020
ancer-0.020
ulu-0.018
ver-0.018
 thirsty-0.017
Token mum
Token ID25682
Feature activation-1.60e+00

Pos logit contributions
 itself+0.038
stick+0.037
iped+0.037
ze+0.036
ez+0.035
Neg logit contributions
 upon-0.027
ancer-0.025
ver-0.021
 there-0.021
 they-0.021
Token said
Token ID531
Feature activation+2.74e-01

Pos logit contributions
 upon+0.039
ancer+0.037
ulu+0.032
 there+0.031
 don+0.030
Neg logit contributions
stick-0.035
 itself-0.034
iped-0.033
ez-0.032
ze-0.031
Token,
Token ID11
Feature activation-5.34e+00

Pos logit contributions
 itself+0.008
stick+0.007
ze+0.007
iped+0.007
ez+0.007
Neg logit contributions
 upon-0.005
ancer-0.005
ulu-0.004
as-0.004
 there-0.004
Token "
Token ID366
Feature activation+1.51e+00

Pos logit contributions
 upon+0.120
ancer+0.109
ulu+0.096
 there+0.095
ver+0.092
Neg logit contributions
 itself-0.129
stick-0.128
iped-0.121
ze-0.118
ez-0.117
TokenThose
Token ID9627
Feature activation-3.01e+00

Pos logit contributions
stick+0.034
 itself+0.034
iped+0.033
ze+0.032
ez+0.032
Neg logit contributions
 upon-0.036
ancer-0.033
 there-0.029
 they-0.029
ulu-0.029
Token are
Token ID389
Feature activation-8.52e-01

Pos logit contributions
 upon+0.060
ancer+0.056
ulu+0.053
ver+0.052
ait+0.042
Neg logit contributions
bons-0.073
stick-0.073
 itself-0.072
 mistakes-0.070
iped-0.069
Token a
Token ID257
Feature activation+1.17e+00

Pos logit contributions
 upon+0.021
ancer+0.021
ver+0.018
ulu+0.017
 however+0.016
Neg logit contributions
 itself-0.019
stick-0.017
bons-0.017
iped-0.016
er-0.015
Token great
Token ID1049
Feature activation+1.48e+00

Pos logit contributions
stick+0.028
 itself+0.028
iped+0.027
ze+0.026
ez+0.026
Neg logit contributions
 upon-0.026
ancer-0.024
ulu-0.020
ver-0.020
 there-0.020
Token choice
Token ID3572
Feature activation-2.13e+00

Pos logit contributions
stick+0.032
 itself+0.032
iped+0.031
ez+0.029
ze+0.029
Neg logit contributions
 upon-0.037
ancer-0.034
 there-0.030
ulu-0.030
 they-0.030
Token Mill
Token ID9212
Feature activation-1.78e+00

Pos logit contributions
 upon+0.052
ancer+0.048
ulu+0.043
 thirsty+0.041
ver+0.039
Neg logit contributions
 itself-0.049
stick-0.046
er-0.043
eness-0.041
age-0.041
Tokenie
Token ID494
Feature activation-2.74e-01

Pos logit contributions
 upon+0.049
ancer+0.047
ver+0.042
 don+0.041
 they+0.041
Neg logit contributions
 itself-0.033
stick-0.031
iped-0.029
esh-0.028
ez-0.028
Token store
Token ID3650
Feature activation-7.28e-01

Pos logit contributions
 itself+0.045
stick+0.045
iped+0.043
ze+0.042
izer+0.041
Neg logit contributions
 upon-0.039
ancer-0.037
ver-0.032
ulu-0.031
 don-0.030
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
 upon+0.018
ancer+0.017
ulu+0.015
 don+0.015
ver+0.015
Neg logit contributions
 itself-0.015
stick-0.015
iped-0.015
nob-0.013
ze-0.013
Token bought
Token ID5839
Feature activation+4.97e-01

Pos logit contributions
stick+0.006
iped+0.006
faced+0.006
person+0.006
nob+0.005
Neg logit contributions
ancer-0.010
ulu-0.009
 upon-0.009
as-0.009
 steep-0.008
Token some
Token ID617
Feature activation-2.96e+00

Pos logit contributions
 itself+0.012
iped+0.011
stick+0.011
bons+0.010
esh+0.010
Neg logit contributions
 upon-0.012
ancer-0.011
ulu-0.010
ver-0.009
as-0.009
Token fun
Token ID1257
Feature activation-1.46e+00

Pos logit contributions
 upon+0.072
ancer+0.070
ver+0.065
ulu+0.062
 don+0.059
Neg logit contributions
 itself-0.061
bons-0.058
 fur-0.055
iped-0.055
 mistakes-0.055
Token beach
Token ID10481
Feature activation-1.58e+00

Pos logit contributions
 upon+0.036
ancer+0.034
ver+0.031
ulu+0.030
 don+0.029
Neg logit contributions
 itself-0.031
stick-0.030
iped-0.029
bons-0.028
esh-0.027
Token toys
Token ID14958
Feature activation-2.14e+00

Pos logit contributions
 upon+0.042
ancer+0.039
ulu+0.035
 there+0.034
ver+0.034
Neg logit contributions
 itself-0.031
stick-0.031
iped-0.030
ze-0.028
ez-0.028
Token.
Token ID13
Feature activation+2.60e+00

Pos logit contributions
 upon+0.065
ancer+0.063
ulu+0.057
ver+0.056
 don+0.055
Neg logit contributions
 itself-0.033
stick-0.032
er-0.027
bons-0.027
iped-0.027
Token Lib
Token ID7980
Feature activation-3.91e-01

Pos logit contributions
 itself+0.061
iped+0.060
stick+0.060
bons+0.054
age+0.054
Neg logit contributions
 upon-0.061
ancer-0.057
ulu-0.052
 there-0.048
as-0.047
Tokenby
Token ID1525
Feature activation-2.75e-01

Pos logit contributions
 upon+0.011
ancer+0.011
 there+0.010
ulu+0.010
 they+0.010
Neg logit contributions
 itself-0.006
stick-0.006
iped-0.006
ez-0.006
ze-0.006
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
ancer+0.007
 upon+0.007
ulu+0.006
 don+0.006
 there+0.006
Neg logit contributions
stick-0.005
 itself-0.005
iped-0.005
ze-0.005
ez-0.005
Token spoiled
Token ID34781
Feature activation+1.72e+00

Pos logit contributions
 itself+0.020
stick+0.020
iped+0.019
ez+0.018
ze+0.018
Neg logit contributions
 upon-0.018
ancer-0.017
ulu-0.015
 they-0.014
 there-0.014
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
 itself+0.033
stick+0.033
iped+0.031
ez+0.030
ze+0.029
Neg logit contributions
 upon-0.047
ancer-0.043
ulu-0.039
 there-0.038
 they-0.038
Token<|endoftext|>
Token ID50256
Feature activation-2.53e+00

Pos logit contributions
stick+0.069
 itself+0.069
iped+0.067
ez+0.064
ze+0.064
Neg logit contributions
 upon-0.060
ancer-0.054
 there-0.048
 they-0.047
ulu-0.046
TokenOnce
Token ID7454
Feature activation-1.10e+01

Pos logit contributions
 upon+0.047
ancer+0.044
ulu+0.037
 they+0.036
ver+0.036
Neg logit contributions
 itself-0.068
stick-0.066
iped-0.064
ez-0.064
nob-0.063
Token upon
Token ID2402
Feature activation-1.49e+00

Pos logit contributions
ancer+0.091
 marsh+0.081
 queens+0.080
 pigs+0.068
Fre+0.066
Neg logit contributions
 itself-0.348
 His-0.327
 its-0.316
esh-0.310
 mistakes-0.310
Token a
Token ID257
Feature activation-1.61e+00

Pos logit contributions
 upon+0.032
ancer+0.028
 there+0.025
 they+0.025
ulu+0.024
Neg logit contributions
stick-0.039
 itself-0.038
iped-0.037
ez-0.036
ze-0.035
Token time
Token ID640
Feature activation-4.96e+00

Pos logit contributions
ancer+0.030
 upon+0.029
ulu+0.023
 they+0.023
 there+0.022
Neg logit contributions
ez-0.042
 itself-0.042
ze-0.042
izer-0.042
iped-0.042
Token,
Token ID11
Feature activation-8.20e+00

Pos logit contributions
ancer+0.140
ait+0.124
as+0.121
 thirsty+0.114
 marsh+0.114
Neg logit contributions
 itself-0.083
esh-0.070
nob-0.061
gh-0.061
 Its-0.060
Token there
Token ID612
Feature activation-3.12e+00

Pos logit contributions
ancer+0.112
ait+0.093
as+0.077
 upon+0.073
 dads+0.067
Neg logit contributions
 itself-0.273
nob-0.236
ez-0.236
iped-0.232
 its-0.232
Token was
Token ID373
Feature activation-5.62e-01

Pos logit contributions
 upon+0.087
ancer+0.086
ulu+0.073
ver+0.072
as+0.072
Neg logit contributions
 itself-0.057
stick-0.053
iped-0.052
ez-0.050
ze-0.049
Token a
Token ID257
Feature activation+1.10e+00

Pos logit contributions
 upon+0.011
ancer+0.010
 there+0.009
 they+0.009
ulu+0.008
Neg logit contributions
stick-0.015
iped-0.015
 itself-0.015
age-0.014
eness-0.014
Token,
Token ID11
Feature activation-7.58e+00

Pos logit contributions
ancer+0.127
 upon+0.114
as+0.109
ulu+0.108
ait+0.105
Neg logit contributions
 itself-0.103
ez-0.099
ze-0.099
iped-0.098
esh-0.096
Token Jack
Token ID3619
Feature activation+1.92e+00

Pos logit contributions
ancer+0.182
 upon+0.178
as+0.157
ulu+0.155
ver+0.151
Neg logit contributions
 itself-0.165
stick-0.150
iped-0.146
ez-0.142
ze-0.140
Token was
Token ID373
Feature activation+2.59e-01

Pos logit contributions
stick+0.054
 itself+0.054
iped+0.052
eness+0.052
ze+0.050
Neg logit contributions
 upon-0.038
 there-0.031
 they-0.029
ancer-0.028
 don-0.027
Token walking
Token ID6155
Feature activation+2.03e+00

Pos logit contributions
 itself+0.007
stick+0.007
iped+0.006
ez+0.006
ze+0.006
Neg logit contributions
 upon-0.005
ancer-0.005
 there-0.004
 they-0.004
ulu-0.004
Token in
Token ID287
Feature activation-3.64e-01

Pos logit contributions
iped+0.053
 itself+0.052
stick+0.052
ze+0.051
ez+0.051
Neg logit contributions
 upon-0.045
 there-0.037
ancer-0.036
 they-0.035
 while-0.032
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
 upon+0.007
ancer+0.006
 there+0.005
 they+0.005
ulu+0.005
Neg logit contributions
 itself-0.010
stick-0.010
iped-0.010
ez-0.009
ze-0.009
Token park
Token ID3952
Feature activation-5.91e-01

Pos logit contributions
 itself+0.044
stick+0.044
iped+0.044
eness+0.043
ez+0.042
Neg logit contributions
 upon-0.030
 there-0.024
ancer-0.023
 they-0.022
ulu-0.020
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 upon+0.013
 there+0.011
 they+0.011
ancer+0.010
 while+0.010
Neg logit contributions
stick-0.015
 itself-0.015
iped-0.015
ez-0.015
eness-0.014
Token He
Token ID679
Feature activation+1.03e+00

Pos logit contributions
stick+0.031
 itself+0.031
iped+0.029
ez+0.028
ze+0.028
Neg logit contributions
 upon-0.052
ancer-0.047
 there-0.044
 they-0.043
ulu-0.042
Token stopped
Token ID5025
Feature activation-1.25e+00

Pos logit contributions
 itself+0.027
iped+0.026
stick+0.026
ze+0.026
ez+0.025
Neg logit contributions
ancer-0.019
 upon-0.017
ver-0.016
ulu-0.015
ait-0.015
Token to
Token ID284
Feature activation+2.98e+00

Pos logit contributions
 upon+0.028
ancer+0.026
ulu+0.023
ver+0.023
 they+0.022
Neg logit contributions
 itself-0.029
stick-0.029
iped-0.027
ez-0.027
ze-0.026
Token fall
Token ID2121
Feature activation+5.92e-01

Pos logit contributions
 itself+0.072
stick+0.068
nob+0.068
esh+0.063
ze+0.062
Neg logit contributions
ancer-0.074
 upon-0.073
as-0.063
ver-0.060
ulu-0.057
Token.
Token ID13
Feature activation+3.91e+00

Pos logit contributions
 itself+0.012
stick+0.012
iped+0.011
ze+0.010
ez+0.010
Neg logit contributions
 upon-0.016
ancer-0.015
ulu-0.013
 there-0.013
 they-0.013
Token It
Token ID632
Feature activation+1.74e+00

Pos logit contributions
 itself+0.070
stick+0.067
iped+0.062
ez+0.061
izer+0.060
Neg logit contributions
 upon-0.113
ancer-0.104
ulu-0.094
 there-0.093
ver-0.092
Token is
Token ID318
Feature activation+1.78e+00

Pos logit contributions
stick+0.035
 itself+0.035
iped+0.034
ez+0.032
ze+0.032
Neg logit contributions
 upon-0.046
ancer-0.042
 there-0.038
ulu-0.038
 they-0.037
Token cold
Token ID4692
Feature activation+5.09e-01

Pos logit contributions
 itself+0.040
stick+0.037
ze+0.035
iped+0.035
 fur+0.034
Neg logit contributions
ancer-0.045
 upon-0.041
ver-0.037
ulu-0.037
as-0.035
Token and
Token ID290
Feature activation-2.04e-03

Pos logit contributions
 itself+0.010
stick+0.010
iped+0.009
nob+0.009
ez+0.009
Neg logit contributions
 upon-0.014
ancer-0.013
ulu-0.011
ver-0.011
 there-0.011
Token wet
Token ID9583
Feature activation+2.34e+00

Pos logit contributions
ancer+0.000
ver+0.000
 upon+0.000
as+0.000
ait+0.000
Neg logit contributions
 itself-0.000
stick-0.000
 its-0.000
 fur-0.000
ze-0.000
Token.
Token ID13
Feature activation+3.91e+00

Pos logit contributions
 itself+0.046
stick+0.046
iped+0.044
ez+0.042
ze+0.042
Neg logit contributions
 upon-0.063
ancer-0.058
 there-0.052
ulu-0.051
 they-0.051
Token It
Token ID632
Feature activation+1.64e+00

Pos logit contributions
 itself+0.089
stick+0.083
iped+0.079
ez+0.078
izer+0.077
Neg logit contributions
 upon-0.095
ancer-0.086
ulu-0.077
ver-0.075
 there-0.075
Token makes
Token ID1838
Feature activation+4.39e-01

Pos logit contributions
stick+0.035
 itself+0.035
iped+0.033
ez+0.032
ze+0.032
Neg logit contributions
 upon-0.041
ancer-0.039
ver-0.034
ulu-0.034
 there-0.033
Token their
Token ID511
Feature activation+3.08e+00

Pos logit contributions
stick+0.012
iped+0.012
ez+0.011
izer+0.011
age+0.011
Neg logit contributions
 upon-0.008
 there-0.007
 they-0.007
ancer-0.007
 don-0.006
Token could
Token ID714
Feature activation+4.59e-01

Pos logit contributions
stick+0.027
 itself+0.026
iped+0.026
izer+0.025
eness+0.025
Neg logit contributions
 upon-0.017
 there-0.014
ancer-0.013
 they-0.013
 don-0.012
Token make
Token ID787
Feature activation+1.25e+00

Pos logit contributions
stick+0.011
 itself+0.011
ze+0.010
iped+0.010
age+0.010
Neg logit contributions
 upon-0.010
ancer-0.010
ulu-0.008
 there-0.008
 they-0.008
Token honey
Token ID12498
Feature activation-1.14e-01

Pos logit contributions
stick+0.032
iped+0.031
ez+0.030
age+0.030
 itself+0.030
Neg logit contributions
 upon-0.026
 there-0.022
ancer-0.022
 they-0.021
 don-0.019
Token from
Token ID422
Feature activation-1.03e+00

Pos logit contributions
 upon+0.003
ancer+0.003
ulu+0.002
ver+0.002
 however+0.002
Neg logit contributions
stick-0.003
 itself-0.003
ze-0.003
age-0.002
nob-0.002
Token the
Token ID262
Feature activation+1.42e+00

Pos logit contributions
 upon+0.026
ancer+0.025
ver+0.022
ulu+0.022
 they+0.021
Neg logit contributions
 itself-0.022
stick-0.022
iped-0.020
ez-0.020
ze-0.020
Token trees
Token ID7150
Feature activation-2.69e+00

Pos logit contributions
 itself+0.031
stick+0.031
ze+0.029
iped+0.029
age+0.028
Neg logit contributions
 upon-0.034
ancer-0.032
ver-0.029
 they-0.028
ulu-0.028
Token and
Token ID290
Feature activation-8.80e-01

Pos logit contributions
 upon+0.069
ancer+0.068
ulu+0.061
ver+0.055
 don+0.055
Neg logit contributions
stick-0.058
 itself-0.054
age-0.050
nob-0.048
ze-0.048
Token it
Token ID340
Feature activation-3.65e-01

Pos logit contributions
ancer+0.023
 upon+0.022
ver+0.021
ulu+0.020
as+0.020
Neg logit contributions
 itself-0.019
stick-0.019
ze-0.017
nob-0.016
 fur-0.016
Token tasted
Token ID29187
Feature activation+1.40e+00

Pos logit contributions
 upon+0.007
ancer+0.007
ulu+0.006
ver+0.006
ait+0.006
Neg logit contributions
stick-0.010
 itself-0.010
nob-0.009
ze-0.009
iped-0.009
Token sweet
Token ID6029
Feature activation+2.32e+00

Pos logit contributions
 itself+0.035
stick+0.033
ze+0.032
iped+0.032
ez+0.031
Neg logit contributions
 upon-0.030
ancer-0.030
ver-0.025
ulu-0.024
 they-0.023
Token!
Token ID0
Feature activation+2.68e+00

Pos logit contributions
stick+0.048
 itself+0.048
iped+0.047
ez+0.044
ze+0.044
Neg logit contributions
 upon-0.060
ancer-0.054
 there-0.050
 they-0.049
ulu-0.048
TokenOnce
Token ID7454
Feature activation-1.09e+01

Pos logit contributions
 upon+0.039
ancer+0.034
 they+0.029
ulu+0.028
 there+0.028
Neg logit contributions
 itself-0.068
stick-0.067
iped-0.065
ez-0.064
nob-0.063
Token there
Token ID612
Feature activation-3.61e+00

Pos logit contributions
ancer+0.074
 marsh+0.059
 queens+0.054
 thirsty+0.049
 pigs+0.042
Neg logit contributions
 itself-0.385
esh-0.341
 mistakes-0.339
 its-0.338
iped-0.334
Token was
Token ID373
Feature activation-1.47e+00

Pos logit contributions
 upon+0.097
ancer+0.096
ulu+0.082
as+0.081
 there+0.080
Neg logit contributions
 itself-0.068
stick-0.065
iped-0.063
ez-0.060
nob-0.059
Token a
Token ID257
Feature activation+5.11e-01

Pos logit contributions
 upon+0.032
ancer+0.028
 there+0.025
 they+0.025
ulu+0.024
Neg logit contributions
stick-0.037
 itself-0.037
iped-0.036
ez-0.035
ze-0.034
Token boy
Token ID2933
Feature activation+1.03e+00

Pos logit contributions
 itself+0.014
stick+0.013
ze+0.013
iped+0.013
izer+0.013
Neg logit contributions
 upon-0.009
ancer-0.008
ver-0.007
ulu-0.007
 there-0.007
Token named
Token ID3706
Feature activation-3.17e+00

Pos logit contributions
 itself+0.031
iped+0.030
stick+0.030
eness+0.029
bons+0.028
Neg logit contributions
 upon-0.018
 there-0.016
 they-0.016
 while-0.014
ancer-0.014
Token Joe
Token ID5689
Feature activation+1.00e+00

Pos logit contributions
 upon+0.090
ancer+0.089
ulu+0.082
as+0.081
ver+0.079
Neg logit contributions
 itself-0.051
stick-0.047
nob-0.045
esh-0.043
iped-0.043
Token.
Token ID13
Feature activation+7.48e-01

Pos logit contributions
stick+0.025
iped+0.025
 itself+0.025
bons+0.024
eness+0.023
Neg logit contributions
 upon-0.023
 there-0.020
 they-0.019
ancer-0.018
 don-0.017
Token He
Token ID679
Feature activation+1.07e+00

Pos logit contributions
stick+0.017
 itself+0.017
iped+0.016
ez+0.016
ze+0.016
Neg logit contributions
 upon-0.018
ancer-0.016
 there-0.015
 they-0.014
ulu-0.014
Token was
Token ID373
Feature activation+7.93e-01

Pos logit contributions
stick+0.036
eness+0.036
 itself+0.035
iped+0.035
 mistakes+0.034
Neg logit contributions
 upon-0.017
 there-0.014
 they-0.013
 don-0.012
ancer-0.011
Token three
Token ID1115
Feature activation+7.08e-01

Pos logit contributions
stick+0.018
 itself+0.018
iped+0.018
ze+0.017
ez+0.017
Neg logit contributions
 upon-0.018
ancer-0.016
ulu-0.014
ver-0.014
 they-0.014
Token They
Token ID1119
Feature activation-2.80e-01

Pos logit contributions
 itself+0.063
stick+0.063
iped+0.060
ez+0.058
ze+0.057
Neg logit contributions
 upon-0.072
ancer-0.065
 there-0.059
ulu-0.058
 they-0.058
Token said
Token ID531
Feature activation+2.49e-01

Pos logit contributions
 upon+0.006
ancer+0.006
ulu+0.005
 there+0.005
ver+0.005
Neg logit contributions
stick-0.007
 itself-0.007
iped-0.007
ez-0.007
ze-0.007
Token goodbye
Token ID24829
Feature activation+1.75e-01

Pos logit contributions
 itself+0.006
stick+0.006
iped+0.006
ze+0.006
ez+0.006
Neg logit contributions
 upon-0.005
ancer-0.005
ulu-0.004
 there-0.004
ver-0.004
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
stick+0.004
 itself+0.004
iped+0.004
ez+0.004
ze+0.004
Neg logit contributions
 upon-0.004
ancer-0.003
 there-0.003
 they-0.003
ulu-0.003
Token the
Token ID262
Feature activation+5.66e-01

Pos logit contributions
 itself+0.028
stick+0.028
iped+0.026
ez+0.026
ze+0.026
Neg logit contributions
 upon-0.022
ancer-0.020
 there-0.017
ulu-0.017
ver-0.017
Token ostr
Token ID42387
Feature activation-3.69e+00

Pos logit contributions
stick+0.014
 itself+0.014
iped+0.014
ez+0.014
ze+0.013
Neg logit contributions
 upon-0.012
ancer-0.011
 there-0.010
 they-0.009
ulu-0.009
Tokenich
Token ID488
Feature activation+7.17e-01

Pos logit contributions
 upon+0.085
ancer+0.076
 there+0.067
 they+0.067
ulu+0.066
Neg logit contributions
 itself-0.087
stick-0.085
iped-0.083
nob-0.081
ez-0.081
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
 itself+0.017
stick+0.017
iped+0.016
ez+0.016
ze+0.015
Neg logit contributions
 upon-0.017
ancer-0.015
 there-0.013
 they-0.013
ulu-0.013
Token walked
Token ID6807
Feature activation+1.74e+00

Pos logit contributions
stick+0.002
 itself+0.002
iped+0.002
ez+0.002
ze+0.002
Neg logit contributions
 upon-0.002
ancer-0.002
ulu-0.002
ver-0.002
 there-0.002
Token home
Token ID1363
Feature activation-2.25e+00

Pos logit contributions
stick+0.035
iped+0.035
ez+0.035
 itself+0.034
ze+0.034
Neg logit contributions
 upon-0.047
 there-0.041
 they-0.040
ancer-0.038
ulu-0.036
Token.
Token ID13
Feature activation+3.00e+00

Pos logit contributions
 upon+0.061
ancer+0.060
ulu+0.053
ver+0.050
 don+0.049
Neg logit contributions
 itself-0.043
stick-0.043
iped-0.038
ze-0.037
age-0.037
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
 itself+0.029
iped+0.028
stick+0.028
ez+0.027
nob+0.026
Neg logit contributions
 upon-0.026
ancer-0.024
 there-0.022
 they-0.021
ulu-0.020
Token It
Token ID632
Feature activation+1.22e+00

Pos logit contributions
 itself+0.041
stick+0.039
ez+0.035
iped+0.035
bons+0.034
Neg logit contributions
 upon-0.065
ancer-0.061
ver-0.055
ulu-0.054
as-0.054
Token was
Token ID373
Feature activation+1.50e+00

Pos logit contributions
stick+0.024
 itself+0.023
iped+0.022
ze+0.022
nob+0.022
Neg logit contributions
 upon-0.033
ancer-0.032
ver-0.027
ulu-0.027
 there-0.027
Token special
Token ID2041
Feature activation+1.80e+00

Pos logit contributions
 itself+0.035
stick+0.033
iped+0.031
bons+0.031
ze+0.030
Neg logit contributions
 upon-0.035
ancer-0.035
ver-0.030
ulu-0.028
 don-0.028
Token,
Token ID11
Feature activation-4.50e+00

Pos logit contributions
 itself+0.041
stick+0.041
iped+0.039
ze+0.037
ez+0.037
Neg logit contributions
 upon-0.043
ancer-0.039
ulu-0.035
ver-0.034
 there-0.034
Token because
Token ID780
Feature activation-5.49e+00

Pos logit contributions
 upon+0.102
ancer+0.100
ver+0.086
ulu+0.085
as+0.080
Neg logit contributions
 itself-0.109
stick-0.105
ze-0.097
iped-0.097
bons-0.095
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 upon+0.154
ancer+0.154
suits+0.140
ver+0.139
as+0.137
Neg logit contributions
 itself-0.103
 its-0.082
 mistakes-0.079
stick-0.078
er-0.078
Token iron
Token ID6953
Feature activation-1.06e+00

Pos logit contributions
 upon+0.005
ancer+0.005
ver+0.004
 they+0.004
ulu+0.004
Neg logit contributions
stick-0.005
 itself-0.005
iped-0.005
ze-0.005
age-0.005
Token had
Token ID550
Feature activation+2.17e+00

Pos logit contributions
 upon+0.026
ancer+0.023
 there+0.021
ulu+0.020
 they+0.020
Neg logit contributions
stick-0.024
 itself-0.024
iped-0.023
ez-0.022
ze-0.022
Token purple
Token ID14032
Feature activation+2.17e+00

Pos logit contributions
 itself+0.053
stick+0.052
iped+0.050
ez+0.048
ze+0.048
Neg logit contributions
 upon-0.049
ancer-0.045
ulu-0.039
 there-0.039
ver-0.038
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
 itself+0.048
stick+0.047
iped+0.047
ez+0.045
izer+0.044
Neg logit contributions
 upon-0.053
ancer-0.047
 there-0.044
 they-0.043
ulu-0.041
Token write
Token ID3551
Feature activation+2.43e+00

Pos logit contributions
 upon+0.003
ancer+0.003
 there+0.002
ulu+0.002
 they+0.002
Neg logit contributions
 itself-0.003
stick-0.003
iped-0.003
ze-0.003
ez-0.003
Token or
Token ID393
Feature activation-7.18e-02

Pos logit contributions
 itself+0.054
stick+0.052
ze+0.049
iped+0.049
nob+0.047
Neg logit contributions
 upon-0.060
ancer-0.059
ulu-0.052
 they-0.048
ver-0.048
Token think
Token ID892
Feature activation-2.67e+00

Pos logit contributions
 upon+0.002
ancer+0.002
ver+0.002
ulu+0.002
as+0.001
Neg logit contributions
stick-0.002
 itself-0.002
ze-0.001
iped-0.001
ez-0.001
Token.
Token ID13
Feature activation+3.67e+00

Pos logit contributions
 upon+0.066
ancer+0.064
ulu+0.053
 thirsty+0.052
ll+0.051
Neg logit contributions
 itself-0.061
 mistakes-0.053
ze-0.052
stick-0.052
nob-0.051
Token
Token ID198
Feature activation-2.58e+00

Pos logit contributions
 itself+0.079
stick+0.074
iped+0.071
ez+0.068
 mistakes+0.067
Neg logit contributions
 upon-0.091
ancer-0.086
ulu-0.076
 they-0.074
ver-0.074
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
 upon+0.054
ancer+0.050
 there+0.043
ulu+0.042
 they+0.042
Neg logit contributions
 itself-0.066
stick-0.064
iped-0.062
ez-0.060
ze-0.059
TokenShe
Token ID3347
Feature activation+6.50e-01

Pos logit contributions
ancer+0.061
 upon+0.061
ver+0.051
ulu+0.049
 they+0.049
Neg logit contributions
 itself-0.052
stick-0.048
ez-0.047
iped-0.047
age-0.047
Token opened
Token ID4721
Feature activation+3.41e+00

Pos logit contributions
stick+0.017
 itself+0.017
iped+0.016
ez+0.016
ze+0.016
Neg logit contributions
 upon-0.014
ancer-0.012
 there-0.011
ulu-0.011
 they-0.010
Token the
Token ID262
Feature activation-5.58e-02

Pos logit contributions
 itself+0.084
stick+0.083
iped+0.080
ez+0.077
ze+0.076
Neg logit contributions
 upon-0.076
ancer-0.069
 there-0.060
ulu-0.060
 they-0.059
Token window
Token ID4324
Feature activation-1.14e+00

Pos logit contributions
 upon+0.001
ancer+0.001
 there+0.001
 they+0.001
ulu+0.001
Neg logit contributions
 itself-0.001
stick-0.001
iped-0.001
ez-0.001
ze-0.001
Token and
Token ID290
Feature activation-2.50e-01

Pos logit contributions
 upon+0.028
ancer+0.026
ulu+0.023
 there+0.023
 they+0.023
Neg logit contributions
 itself-0.025
stick-0.025
iped-0.023
ez-0.022
ze-0.022
Token with
Token ID351
Feature activation-2.30e+00

Pos logit contributions
 itself+0.008
stick+0.008
iped+0.008
nob+0.008
age+0.008
Neg logit contributions
 upon-0.009
ancer-0.008
ulu-0.007
 they-0.007
 don-0.007
Token him
Token ID683
Feature activation+1.26e+00

Pos logit contributions
 upon+0.062
ancer+0.057
 there+0.051
ulu+0.051
 they+0.050
Neg logit contributions
 itself-0.046
stick-0.045
iped-0.043
ez-0.042
ze-0.041
Token.
Token ID13
Feature activation+2.64e+00

Pos logit contributions
 itself+0.022
stick+0.022
iped+0.020
nob+0.019
ez+0.019
Neg logit contributions
 upon-0.035
ancer-0.034
ulu-0.032
 they-0.029
ver-0.029
Token
Token ID198
Feature activation-2.49e+00

Pos logit contributions
 itself+0.066
stick+0.065
iped+0.064
ez+0.061
age+0.060
Neg logit contributions
 upon-0.057
ancer-0.052
ulu-0.045
 there-0.045
 they-0.044
Token
Token ID198
Feature activation-2.53e+00

Pos logit contributions
 upon+0.055
ancer+0.054
ulu+0.045
 there+0.044
 don+0.043
Neg logit contributions
 itself-0.058
stick-0.055
iped-0.054
ez-0.053
age-0.052
TokenWhile
Token ID3633
Feature activation-7.34e+00

Pos logit contributions
 upon+0.053
ancer+0.048
 there+0.042
ulu+0.041
 they+0.041
Neg logit contributions
 itself-0.065
stick-0.065
iped-0.063
ez-0.061
ze-0.060
Token they
Token ID484
Feature activation-2.56e+00

Pos logit contributions
 upon+0.136
ancer+0.134
ulu+0.118
as+0.112
 don+0.107
Neg logit contributions
 itself-0.188
iped-0.180
stick-0.174
ez-0.173
bones-0.167
Token were
Token ID547
Feature activation-7.59e-01

Pos logit contributions
 upon+0.037
ancer+0.037
ulu+0.030
as+0.026
 steep+0.024
Neg logit contributions
 itself-0.075
iped-0.074
stick-0.074
ez-0.072
bons-0.069
Token walking
Token ID6155
Feature activation+1.20e+00

Pos logit contributions
ancer+0.019
 upon+0.018
ulu+0.018
ver+0.015
 don+0.015
Neg logit contributions
iped-0.014
ze-0.014
stick-0.014
 itself-0.013
 mistakes-0.013
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
 itself+0.024
stick+0.023
age+0.022
iped+0.021
er+0.021
Neg logit contributions
ancer-0.029
 upon-0.028
ulu-0.026
ait-0.025
as-0.025
Token the
Token ID262
Feature activation+7.33e-01

Pos logit contributions
 itself+0.042
stick+0.040
izer+0.038
iped+0.038
nob+0.038
Neg logit contributions
 upon-0.040
ancer-0.039
as-0.036
ulu-0.035
ver-0.033
Token and
Token ID290
Feature activation-1.40e-01

Pos logit contributions
 itself+0.046
stick+0.044
iped+0.042
ze+0.041
age+0.041
Neg logit contributions
 upon-0.043
ancer-0.041
ulu-0.036
 there-0.034
 they-0.033
Token follow
Token ID1061
Feature activation-2.08e-01

Pos logit contributions
 upon+0.003
ancer+0.003
ulu+0.003
ver+0.003
ait+0.002
Neg logit contributions
 itself-0.003
stick-0.003
iped-0.003
age-0.003
ze-0.003
Token advice
Token ID5608
Feature activation-1.61e+00

Pos logit contributions
 upon+0.005
ancer+0.005
ulu+0.004
 there+0.004
ver+0.004
Neg logit contributions
 itself-0.005
stick-0.005
iped-0.004
ze-0.004
ez-0.004
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
 upon+0.039
ancer+0.037
ulu+0.034
 thirsty+0.032
 there+0.032
Neg logit contributions
 itself-0.037
stick-0.035
iped-0.032
age-0.032
ze-0.032
Token<|endoftext|>
Token ID50256
Feature activation-2.52e+00

Pos logit contributions
stick+0.063
iped+0.062
 itself+0.062
ze+0.059
ez+0.058
Neg logit contributions
 upon-0.048
ancer-0.043
 there-0.038
 they-0.038
ulu-0.037
TokenOnce
Token ID7454
Feature activation-1.14e+01

Pos logit contributions
 upon+0.049
ancer+0.045
ulu+0.038
 they+0.038
ver+0.037
Neg logit contributions
 itself-0.066
stick-0.064
ez-0.062
iped-0.062
nob-0.061
Token upon
Token ID2402
Feature activation-1.76e+00

Pos logit contributions
ancer+0.084
 marsh+0.084
 queens+0.068
 cam+0.065
Fre+0.063
Neg logit contributions
 itself-0.382
 its-0.342
 mistakes-0.336
 His-0.334
 Its-0.334
Token a
Token ID257
Feature activation-1.75e+00

Pos logit contributions
 upon+0.038
ancer+0.034
 there+0.030
 they+0.030
ulu+0.029
Neg logit contributions
stick-0.044
 itself-0.044
iped-0.043
ez-0.041
ze-0.040
Token time
Token ID640
Feature activation-5.18e+00

Pos logit contributions
ancer+0.032
 upon+0.031
 they+0.025
ulu+0.025
 there+0.024
Neg logit contributions
ez-0.046
 itself-0.046
ze-0.046
izer-0.045
iped-0.045
Token,
Token ID11
Feature activation-8.35e+00

Pos logit contributions
ancer+0.146
ait+0.128
as+0.125
 marsh+0.122
 thirsty+0.121
Neg logit contributions
 itself-0.080
esh-0.070
 His-0.061
 Its-0.059
stick-0.059
Token there
Token ID612
Feature activation-3.31e+00

Pos logit contributions
ancer+0.110
ait+0.085
 restaurants+0.071
 upon+0.071
 marsh+0.070
Neg logit contributions
 itself-0.279
iped-0.246
nob-0.244
stick-0.239
ez-0.237
Token said
Token ID531
Feature activation+1.24e-02

Pos logit contributions
 upon+0.027
ancer+0.025
ulu+0.021
 there+0.021
ver+0.021
Neg logit contributions
stick-0.032
 itself-0.031
iped-0.031
ez-0.030
ze-0.029
Token,
Token ID11
Feature activation-5.74e+00

Pos logit contributions
 itself+0.000
stick+0.000
ze+0.000
iped+0.000
ez+0.000
Neg logit contributions
 upon-0.000
ancer-0.000
ulu-0.000
as-0.000
ver-0.000
Token "
Token ID366
Feature activation+1.54e+00

Pos logit contributions
 upon+0.130
ancer+0.119
ulu+0.105
 there+0.102
ver+0.101
Neg logit contributions
 itself-0.137
stick-0.135
iped-0.128
ze-0.125
ez-0.124
TokenL
Token ID43
Feature activation-1.67e+00

Pos logit contributions
 itself+0.038
stick+0.038
iped+0.036
ez+0.035
ze+0.034
Neg logit contributions
 upon-0.035
ancer-0.031
 there-0.028
 they-0.027
ulu-0.027
Tokenily
Token ID813
Feature activation+2.20e+00

Pos logit contributions
 upon+0.038
ancer+0.033
ulu+0.031
 there+0.030
 they+0.029
Neg logit contributions
stick-0.042
 itself-0.041
iped-0.040
ze-0.038
ez-0.038
Token,
Token ID11
Feature activation-7.22e+00

Pos logit contributions
stick+0.063
 itself+0.063
iped+0.061
ez+0.060
ze+0.059
Neg logit contributions
 upon-0.040
ancer-0.035
ulu-0.029
 there-0.029
 they-0.028
Token you
Token ID345
Feature activation-4.92e-01

Pos logit contributions
 upon+0.163
ancer+0.153
ver+0.132
ulu+0.131
as+0.125
Neg logit contributions
 itself-0.174
stick-0.167
ze-0.162
iped-0.156
ez-0.152
Token need
Token ID761
Feature activation+8.35e-01

Pos logit contributions
 upon+0.009
ancer+0.008
ulu+0.007
 they+0.007
 however+0.006
Neg logit contributions
stick-0.014
 itself-0.014
ez-0.013
ze-0.013
iped-0.013
Token to
Token ID284
Feature activation+3.00e+00

Pos logit contributions
stick+0.022
 itself+0.022
iped+0.021
ez+0.021
ze+0.020
Neg logit contributions
 upon-0.017
ancer-0.015
 there-0.014
 they-0.013
ulu-0.013
Token clean
Token ID3424
Feature activation+5.06e+00

Pos logit contributions
stick+0.076
 itself+0.076
iped+0.076
ez+0.073
izer+0.069
Neg logit contributions
 upon-0.063
ancer-0.053
 there-0.052
 don-0.050
 they-0.050
Token up
Token ID510
Feature activation-8.17e-01

Pos logit contributions
stick+0.115
 itself+0.112
iped+0.111
age+0.110
ez+0.108
Neg logit contributions
 upon-0.119
 there-0.106
ancer-0.102
 they-0.099
 don-0.094
Token's
Token ID338
Feature activation+2.47e+00

Pos logit contributions
 itself+0.071
stick+0.068
iped+0.066
ez+0.063
nob+0.062
Neg logit contributions
 upon-0.103
ancer-0.097
ulu-0.089
 there-0.086
 they-0.085
Token cane
Token ID33009
Feature activation+2.36e+00

Pos logit contributions
 itself+0.057
stick+0.056
iped+0.053
ze+0.053
 mistakes+0.051
Neg logit contributions
 upon-0.057
ancer-0.056
ver-0.048
ulu-0.048
 they-0.045
Token weighed
Token ID20261
Feature activation+3.00e-01

Pos logit contributions
stick+0.051
 itself+0.050
iped+0.047
ze+0.046
age+0.046
Neg logit contributions
 upon-0.059
ancer-0.055
ulu-0.051
 there-0.047
ver-0.047
Token the
Token ID262
Feature activation+5.54e-01

Pos logit contributions
 itself+0.007
stick+0.006
iped+0.006
ez+0.006
ze+0.006
Neg logit contributions
 upon-0.007
ancer-0.007
ulu-0.006
 they-0.006
ver-0.006
Token same
Token ID976
Feature activation+5.56e-01

Pos logit contributions
stick+0.011
 itself+0.011
ze+0.010
iped+0.010
 mistakes+0.010
Neg logit contributions
 upon-0.014
ancer-0.013
ulu-0.012
ver-0.012
 they-0.012
Token as
Token ID355
Feature activation-7.11e+00

Pos logit contributions
 itself+0.012
stick+0.012
iped+0.011
ze+0.011
ez+0.011
Neg logit contributions
 upon-0.014
ancer-0.013
ulu-0.012
 they-0.012
 there-0.011
Token the
Token ID262
Feature activation-8.88e-01

Pos logit contributions
 upon+0.160
ancer+0.153
ulu+0.134
ver+0.124
as+0.118
Neg logit contributions
 itself-0.174
stick-0.152
iped-0.148
 mistakes-0.146
ze-0.144
Token bag
Token ID6131
Feature activation-2.50e+00

Pos logit contributions
 upon+0.023
ancer+0.021
ulu+0.019
ver+0.019
 there+0.018
Neg logit contributions
stick-0.018
 itself-0.018
iped-0.017
ze-0.017
ez-0.017
Token of
Token ID286
Feature activation-2.16e+00

Pos logit contributions
 upon+0.053
ancer+0.049
ulu+0.043
 there+0.040
 however+0.039
Neg logit contributions
stick-0.063
 itself-0.062
iped-0.057
age-0.057
ze-0.056
Token apples
Token ID22514
Feature activation-2.78e+00

Pos logit contributions
 upon+0.056
ancer+0.053
ulu+0.047
ver+0.046
 there+0.044
Neg logit contributions
 itself-0.046
stick-0.044
iped-0.042
ez-0.041
bons-0.041
Token.
Token ID13
Feature activation+2.47e+00

Pos logit contributions
 upon+0.069
ancer+0.068
ulu+0.061
as+0.055
 hotels+0.055
Neg logit contributions
 itself-0.061
stick-0.058
iped-0.050
age-0.050
er-0.049
Token.
Token ID13
Feature activation+3.85e+00

Pos logit contributions
 itself+0.053
stick+0.052
iped+0.050
ez+0.048
ze+0.047
Neg logit contributions
 upon-0.071
ancer-0.065
 there-0.059
ulu-0.058
 they-0.058
Token
Token ID198
Feature activation-2.64e+00

Pos logit contributions
 itself+0.090
stick+0.085
iped+0.084
ez+0.080
izer+0.079
Neg logit contributions
 upon-0.091
ancer-0.084
ulu-0.074
 there-0.071
ver-0.070
Token
Token ID198
Feature activation-2.55e+00

Pos logit contributions
 upon+0.057
ancer+0.053
 there+0.044
ulu+0.044
 they+0.043
Neg logit contributions
 itself-0.067
stick-0.064
iped-0.062
ez-0.061
ze-0.060
TokenBut
Token ID1537
Feature activation-3.98e+00

Pos logit contributions
ancer+0.064
 upon+0.063
ulu+0.053
ver+0.050
 there+0.049
Neg logit contributions
 itself-0.055
stick-0.051
iped-0.051
age-0.050
ze-0.050
Token then
Token ID788
Feature activation-3.82e+00

Pos logit contributions
ancer+0.091
 upon+0.089
as+0.076
ulu+0.074
ait+0.072
Neg logit contributions
 itself-0.097
stick-0.084
iped-0.084
ze-0.080
 fur-0.078
Token,
Token ID11
Feature activation-7.28e+00

Pos logit contributions
 upon+0.078
ancer+0.078
ulu+0.065
as+0.061
ait+0.059
Neg logit contributions
 itself-0.099
iped-0.092
stick-0.089
ze-0.087
esh-0.085
Token something
Token ID1223
Feature activation-4.92e+00

Pos logit contributions
ancer+0.147
 upon+0.136
ulu+0.121
as+0.118
ait+0.116
Neg logit contributions
 itself-0.193
iped-0.174
stick-0.171
ze-0.168
 fur-0.164
Token bad
Token ID2089
Feature activation-3.78e-01

Pos logit contributions
 upon+0.100
ancer+0.085
 there+0.081
 they+0.080
ulu+0.077
Neg logit contributions
stick-0.134
 itself-0.129
iped-0.127
ez-0.123
bons-0.121
Token happened
Token ID3022
Feature activation+3.52e+00

Pos logit contributions
 upon+0.008
ancer+0.007
 there+0.006
 they+0.006
ulu+0.006
Neg logit contributions
stick-0.010
 itself-0.010
iped-0.009
ez-0.009
ze-0.009
Token.
Token ID13
Feature activation+3.16e+00

Pos logit contributions
stick+0.090
iped+0.086
bons+0.085
izer+0.082
ez+0.081
Neg logit contributions
 upon-0.084
 there-0.072
 they-0.070
ancer-0.064
 don-0.063
Token Tom
Token ID4186
Feature activation+2.49e+00

Pos logit contributions
stick+0.068
 itself+0.067
iped+0.065
ze+0.062
ez+0.062
Neg logit contributions
 upon-0.080
ancer-0.073
 there-0.067
 they-0.066
ulu-0.064