TOP ACTIVATIONS
MAX = 11.686

my old hoop and I 'd love to share it ."
Bob a treat , he 'd give a little shrug .
faced tough times , they 'd stick together and always be
was angry , but I should not have done it .
need it , and we should always be willing to help
" Thank you . I 'd like that !"
new things , but we should always be careful , okay
strawberries and I thought I 'd surprise you !" The bear
She looked better than they 'd ever seen before . Daisy
his heart , knowing he 'd never see him again .
He couldn 't believe he 'd been so adventurous .
, " I knew you 'd like it !" Emily was
us freeze , but we should always be careful and not
She asked Joe , " Should we take a break ?"
things happen , but we should still keep trying and working
world he promised himself he 'd come back soon . <|endoftext|>
at you , but I should not have pushed your blocks
get hurt , but we should always ask for help when
, little worm . Friends should never hurt each other ."
Sam spoke up : " Should n 't we share the

MIN ACTIVATIONS
MIN = -15.769

She said to Mr . Sp ud , " I want
, he noticed that the Sm elly King was cruel and
, Lily and Mr . Sp ud went to the park
aters . They lifted the swe aters and found a small
Tom thanked the wise owl and went home feeling
Annie wanted to come with Sp ide y and explore the
" I like your smile Sp ud . You always make
L ily asked the paramed ic , " Is the
Dad then looked at the bald man on the TV .
wanted to play with the wise old cat using the string
girl walked up to the bald man and asked , "
a big bird named O str ich wanted to swim in
so happy and thanked the gr umpy fisherman .
happy . They thanked the wise old man and played with
getting very annoyed with his rud eness and complained that he
then you can get a namesake bowl !"
very excited and hugged the bald man . Every
cute pink rabbit called Mrs Fl uffy . She was small
wants to talk to the bald man . He runs to
and carefully picked up the hairy cater pillar and put it

INTERVAL 4.823 - 11.686
CONTAINS 6.714%

They decide to forgive him and build a new dragon .
The boy was happy to have his t eddy bear to
special and delicious . She wonders if it tastes like gold
keep my promises . I will always be an honest and
the onion , thanked Mum and went back to his dinner

INTERVAL -2.041 - 4.823
CONTAINS 68.520%

drawings ," Tim my said sadly . S ally
said , " I 'm glad you like it , Tim
put her arms around her knees and rolled down the hill
As they were walking to the car , Tim my
in the snow . They are happy and fair . <|endoftext|>

INTERVAL -8.905 - -2.041
CONTAINS 21.891%

Fred won each game . One day ,
the park . Sam had a snack in his bag .
who was a ha ird ress er and asked for help
. <|endoftext|> Once upon a time there was a little girl
, it shot up in the sky and created the bright

INTERVAL -15.769 - -8.905
CONTAINS 2.874%

. He decided to measure the warmth of the sun .
ily finally gave her mom the teaspoon and her mom said
wanted to slide down with the gem , to feel the
, Jen , I found the sound ! We can still
to the store to get the diamond . But when they
Token my
Token ID616
Feature activation-4.82e+00

Pos logit contributions
 managed+0.058
ged+0.049
pped+0.049
 wouldn+0.048
 able+0.047
Neg logit contributions
est-0.070
 sky-0.067
andel-0.067
 world-0.066
umm-0.065
Token old
Token ID1468
Feature activation-6.09e+00

Pos logit contributions
andel+0.073
aff+0.071
umm+0.069
umpy+0.068
est+0.066
Neg logit contributions
 managed-0.094
 never-0.093
 wouldn-0.091
 not-0.089
 going-0.089
Token hoop
Token ID44792
Feature activation+7.43e-01

Pos logit contributions
andel+0.095
umm+0.091
aff+0.088
umpy+0.083
est+0.083
Neg logit contributions
 managed-0.120
 wouldn-0.113
 never-0.111
 him-0.109
 it-0.108
Token and
Token ID290
Feature activation+5.74e+00

Pos logit contributions
 managed+0.012
 wouldn+0.011
 never+0.011
 able+0.011
 him+0.011
Neg logit contributions
andel-0.014
umm-0.013
est-0.013
aff-0.013
 sky-0.013
Token I
Token ID314
Feature activation+7.56e+00

Pos logit contributions
 managed+0.083
pped+0.077
ged+0.071
 able+0.070
 him+0.069
Neg logit contributions
 sky-0.108
est-0.105
 sun-0.100
 world-0.100
ay-0.096
Token'd
Token ID1549
Feature activation+1.03e+01

Pos logit contributions
 managed+0.103
 able+0.092
 him+0.090
 wouldn+0.087
 paying+0.087
Neg logit contributions
est-0.152
andel-0.147
umpy-0.145
aff-0.144
 sky-0.144
Token love
Token ID1842
Feature activation+3.06e+00

Pos logit contributions
 managed+0.156
 wouldn+0.140
 able+0.137
 him+0.135
pped+0.134
Neg logit contributions
est-0.185
 sky-0.177
 world-0.174
andel-0.172
aff-0.169
Token to
Token ID284
Feature activation+5.23e+00

Pos logit contributions
 managed+0.033
 wouldn+0.025
 remembered+0.023
 able+0.023
 freed+0.022
Neg logit contributions
est-0.075
 sky-0.074
 world-0.073
andel-0.073
umm-0.072
Token share
Token ID2648
Feature activation+3.19e+00

Pos logit contributions
 managed+0.072
 wouldn+0.062
 never+0.061
 him+0.060
 able+0.059
Neg logit contributions
andel-0.112
umm-0.111
est-0.111
aff-0.108
 sky-0.105
Token it
Token ID340
Feature activation+1.22e+00

Pos logit contributions
 managed+0.029
 wouldn+0.023
 never+0.023
 him+0.023
 able+0.022
Neg logit contributions
andel-0.082
umm-0.082
est-0.081
aff-0.079
 sky-0.078
Token."
Token ID526
Feature activation-9.72e-01

Pos logit contributions
 managed+0.016
 never+0.015
 not+0.015
 able+0.015
 it+0.015
Neg logit contributions
andel-0.027
umm-0.026
aff-0.025
 sky-0.025
est-0.024
Token Bob
Token ID5811
Feature activation-1.85e+00

Pos logit contributions
 managed+0.039
 wouldn+0.029
 never+0.028
 able+0.028
 remembered+0.027
Neg logit contributions
est-0.102
andel-0.100
umm-0.098
aff-0.098
 sky-0.097
Token a
Token ID257
Feature activation-3.93e+00

Pos logit contributions
andel+0.042
umm+0.042
est+0.040
aff+0.040
 sky+0.039
Neg logit contributions
 managed-0.023
 him-0.020
 wouldn-0.020
 able-0.019
 never-0.019
Token treat
Token ID2190
Feature activation+1.70e+00

Pos logit contributions
umm+0.066
andel+0.064
aff+0.063
est+0.062
umpy+0.059
Neg logit contributions
 managed-0.074
 wouldn-0.066
 never-0.065
 it-0.064
 able-0.064
Token,
Token ID11
Feature activation+4.41e+00

Pos logit contributions
 managed+0.030
 wouldn+0.026
 never+0.026
 able+0.026
 remembered+0.025
Neg logit contributions
est-0.029
andel-0.028
umm-0.028
aff-0.027
 sky-0.027
Token he
Token ID339
Feature activation+5.42e+00

Pos logit contributions
 managed+0.075
 wouldn+0.066
pped+0.064
 never+0.064
 able+0.064
Neg logit contributions
est-0.076
andel-0.074
 sky-0.072
umm-0.072
aff-0.071
Token'd
Token ID1549
Feature activation+1.02e+01

Pos logit contributions
 managed+0.058
 able+0.051
pped+0.050
 him+0.049
 preventing+0.048
Neg logit contributions
est-0.121
 sky-0.115
aff-0.114
andel-0.112
umm-0.112
Token give
Token ID1577
Feature activation+3.58e+00

Pos logit contributions
 managed+0.128
 wouldn+0.117
 able+0.114
pped+0.113
plug+0.112
Neg logit contributions
est-0.205
 sky-0.187
 world-0.184
umpy-0.182
aff-0.181
Token a
Token ID257
Feature activation-2.72e+00

Pos logit contributions
 managed+0.030
 wouldn+0.022
 able+0.021
 remembered+0.019
 never+0.019
Neg logit contributions
est-0.094
andel-0.092
umm-0.091
aff-0.091
 sky-0.090
Token little
Token ID1310
Feature activation-3.93e+00

Pos logit contributions
umm+0.050
andel+0.048
aff+0.047
est+0.046
umpy+0.045
Neg logit contributions
 managed-0.048
 it-0.042
 wouldn-0.042
 able-0.042
 never-0.041
Token shrug
Token ID32545
Feature activation+1.13e-01

Pos logit contributions
andel+0.073
aff+0.072
umm+0.069
umpy+0.067
 sky+0.065
Neg logit contributions
 managed-0.066
 it-0.058
 wouldn-0.058
 able-0.058
 him-0.057
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 managed+0.002
 wouldn+0.002
 never+0.002
 him+0.002
 able+0.002
Neg logit contributions
andel-0.002
umm-0.002
est-0.002
aff-0.002
 sky-0.002
Token faced
Token ID7452
Feature activation+1.52e+00

Pos logit contributions
 managed+0.073
 able+0.070
 preventing+0.068
pped+0.067
 paying+0.067
Neg logit contributions
est-0.120
 sky-0.115
aff-0.111
umpy-0.108
andel-0.107
Token tough
Token ID5802
Feature activation-2.59e+00

Pos logit contributions
 managed+0.014
 him+0.011
 never+0.011
 wouldn+0.011
 able+0.010
Neg logit contributions
andel-0.039
umm-0.039
est-0.039
aff-0.038
 sky-0.037
Token times
Token ID1661
Feature activation+2.27e+00

Pos logit contributions
est+0.036
andel+0.035
umm+0.034
 sky+0.033
aff+0.033
Neg logit contributions
 managed-0.055
 wouldn-0.051
 never-0.049
 able-0.049
 him-0.047
Token,
Token ID11
Feature activation+4.13e+00

Pos logit contributions
 managed+0.044
 wouldn+0.039
 able+0.038
 remembered+0.038
 never+0.037
Neg logit contributions
est-0.034
andel-0.034
 sky-0.033
aff-0.032
umm-0.032
Token they
Token ID484
Feature activation+6.55e+00

Pos logit contributions
 managed+0.059
pped+0.052
 wouldn+0.050
 able+0.048
 him+0.047
Neg logit contributions
 sky-0.080
est-0.080
andel-0.079
umm-0.077
 world-0.076
Token'd
Token ID1549
Feature activation+9.94e+00

Pos logit contributions
 managed+0.065
pped+0.059
 able+0.057
 paying+0.056
 preventing+0.056
Neg logit contributions
est-0.146
 sky-0.145
aff-0.140
andel-0.138
umpy-0.137
Token stick
Token ID4859
Feature activation+3.19e+00

Pos logit contributions
 managed+0.114
pped+0.110
ged+0.107
 able+0.105
 wouldn+0.102
Neg logit contributions
est-0.207
 sky-0.203
 world-0.198
umpy-0.189
aff-0.189
Token together
Token ID1978
Feature activation+4.85e+00

Pos logit contributions
 managed+0.041
 wouldn+0.034
 able+0.033
 never+0.033
 remembered+0.032
Neg logit contributions
est-0.069
andel-0.067
 sky-0.066
aff-0.066
umm-0.064
Token and
Token ID290
Feature activation+5.53e+00

Pos logit contributions
 managed+0.085
 wouldn+0.077
pped+0.075
 him+0.073
 remembered+0.073
Neg logit contributions
est-0.079
 sky-0.075
andel-0.073
 world-0.072
aff-0.072
Token always
Token ID1464
Feature activation+8.31e+00

Pos logit contributions
pped+0.062
 managed+0.060
 able+0.053
 wouldn+0.051
 gonna+0.051
Neg logit contributions
 sky-0.112
est-0.112
umm-0.103
 world-0.103
bows-0.100
Token be
Token ID307
Feature activation+4.04e+00

Pos logit contributions
pped+0.103
 him+0.087
 managed+0.085
plug+0.085
check+0.082
Neg logit contributions
est-0.174
 sky-0.155
 world-0.152
umm-0.143
bows-0.141
Token was
Token ID373
Feature activation+4.96e+00

Pos logit contributions
 managed+0.084
 able+0.075
 him+0.072
 paying+0.069
 wouldn+0.069
Neg logit contributions
est-0.157
umpy-0.151
andel-0.149
aff-0.148
 sky-0.147
Token angry
Token ID7954
Feature activation-1.19e+00

Pos logit contributions
 managed+0.080
 wouldn+0.071
 him+0.068
pped+0.064
 never+0.064
Neg logit contributions
andel-0.086
est-0.086
 sky-0.086
aff-0.083
umpy-0.082
Token,
Token ID11
Feature activation+2.65e+00

Pos logit contributions
andel+0.024
umm+0.023
aff+0.022
est+0.022
 sky+0.022
Neg logit contributions
 managed-0.018
 him-0.017
 it-0.017
 never-0.017
 not-0.017
Token but
Token ID475
Feature activation+7.28e+00

Pos logit contributions
 managed+0.041
 wouldn+0.036
 never+0.035
 able+0.034
 him+0.034
Neg logit contributions
andel-0.052
est-0.052
umm-0.051
aff-0.050
 sky-0.049
Token I
Token ID314
Feature activation+7.29e+00

Pos logit contributions
 managed+0.126
pped+0.119
ged+0.115
 able+0.115
 retrieved+0.112
Neg logit contributions
 girl-0.107
 sky-0.105
 sun-0.104
 angel-0.104
est-0.101
Token should
Token ID815
Feature activation+9.86e+00

Pos logit contributions
 managed+0.075
 able+0.066
 him+0.063
 wouldn+0.059
 going+0.058
Neg logit contributions
est-0.168
umpy-0.165
aff-0.164
andel-0.163
 sky-0.162
Token not
Token ID407
Feature activation+5.77e+00

Pos logit contributions
 managed+0.116
 able+0.095
 wouldn+0.092
ged+0.089
pped+0.084
Neg logit contributions
est-0.208
aff-0.206
 sky-0.204
umm-0.198
umpy-0.197
Token have
Token ID423
Feature activation+5.61e+00

Pos logit contributions
 managed+0.077
 wouldn+0.066
 able+0.063
 never+0.062
 him+0.062
Neg logit contributions
est-0.122
andel-0.122
umm-0.121
aff-0.119
 sky-0.117
Token done
Token ID1760
Feature activation+4.44e+00

Pos logit contributions
 wouldn+0.075
 managed+0.074
ged+0.063
 gonna+0.063
bled+0.063
Neg logit contributions
 sky-0.094
 sun-0.090
est-0.089
aff-0.086
 world-0.085
Token it
Token ID340
Feature activation+1.97e+00

Pos logit contributions
 managed+0.057
 wouldn+0.046
 able+0.042
 remembered+0.041
 freed+0.041
Neg logit contributions
est-0.092
aff-0.091
umpy-0.090
 sky-0.090
andel-0.088
Token.
Token ID13
Feature activation+6.25e-01

Pos logit contributions
 managed+0.040
 wouldn+0.036
 never+0.035
 him+0.035
 able+0.035
Neg logit contributions
est-0.029
andel-0.027
umm-0.027
aff-0.026
umpy-0.026
Token need
Token ID761
Feature activation+6.57e+00

Pos logit contributions
 managed+0.091
 able+0.081
pped+0.076
 preventing+0.076
 paying+0.076
Neg logit contributions
est-0.116
 sky-0.113
andel-0.110
umm-0.109
aff-0.108
Token it
Token ID340
Feature activation+2.60e+00

Pos logit contributions
 managed+0.091
 wouldn+0.075
 remembered+0.069
 able+0.069
ged+0.063
Neg logit contributions
 sky-0.137
est-0.131
andel-0.130
 world-0.126
 sun-0.125
Token,
Token ID11
Feature activation+5.06e+00

Pos logit contributions
 managed+0.052
 wouldn+0.045
pped+0.044
 never+0.044
 him+0.044
Neg logit contributions
est-0.039
 sky-0.035
andel-0.034
umm-0.034
aff-0.034
Token and
Token ID290
Feature activation+5.65e+00

Pos logit contributions
 managed+0.072
 wouldn+0.060
 able+0.059
 him+0.058
 never+0.057
Neg logit contributions
est-0.103
andel-0.102
umm-0.100
 sky-0.099
aff-0.098
Token we
Token ID356
Feature activation+6.89e+00

Pos logit contributions
 managed+0.068
pped+0.060
ged+0.053
 wouldn+0.053
 able+0.053
Neg logit contributions
 sky-0.118
est-0.117
umm-0.113
 world-0.111
andel-0.111
Token should
Token ID815
Feature activation+9.84e+00

Pos logit contributions
 managed+0.097
 able+0.089
 paying+0.088
 preventing+0.088
pped+0.088
Neg logit contributions
est-0.126
 sky-0.123
umpy-0.119
aff-0.117
andel-0.117
Token always
Token ID1464
Feature activation+7.97e+00

Pos logit contributions
 managed+0.127
 able+0.107
ged+0.102
 wouldn+0.102
pped+0.098
Neg logit contributions
est-0.205
 sky-0.201
umm-0.198
aff-0.197
 world-0.194
Token be
Token ID307
Feature activation+3.96e+00

Pos logit contributions
 managed+0.122
pped+0.110
 wouldn+0.109
ged+0.109
 able+0.107
Neg logit contributions
est-0.137
 world-0.132
 sky-0.129
umm-0.127
 difference-0.123
Token willing
Token ID4684
Feature activation+4.32e+00

Pos logit contributions
 wouldn+0.068
 managed+0.066
pped+0.065
 him+0.064
ged+0.061
Neg logit contributions
est-0.059
umpy-0.054
 world-0.053
 sky-0.049
 girl-0.047
Token to
Token ID284
Feature activation+6.23e+00

Pos logit contributions
 managed+0.055
 wouldn+0.046
pped+0.044
 him+0.042
 able+0.042
Neg logit contributions
est-0.094
andel-0.092
aff-0.090
 sky-0.090
umpy-0.090
Token help
Token ID1037
Feature activation+2.70e+00

Pos logit contributions
 managed+0.101
 wouldn+0.094
pped+0.085
ged+0.085
 retrieved+0.080
Neg logit contributions
est-0.102
 world-0.100
aff-0.099
 sky-0.097
andel-0.096
Token "
Token ID366
Feature activation+3.69e+00

Pos logit contributions
 managed+0.062
 wouldn+0.054
 able+0.053
 never+0.052
pped+0.052
Neg logit contributions
andel-0.056
est-0.056
 sky-0.055
aff-0.055
umm-0.053
TokenThank
Token ID10449
Feature activation+5.19e-01

Pos logit contributions
 managed+0.058
 never+0.051
 wouldn+0.049
 him+0.049
 listened+0.047
Neg logit contributions
andel-0.074
est-0.071
umm-0.070
aff-0.070
umpy-0.066
Token you
Token ID345
Feature activation+4.74e-01

Pos logit contributions
 managed+0.005
 wouldn+0.004
 never+0.004
 able+0.004
 him+0.003
Neg logit contributions
andel-0.013
est-0.013
umm-0.013
aff-0.013
 sky-0.013
Token.
Token ID13
Feature activation-8.43e-02

Pos logit contributions
 managed+0.006
 never+0.006
 him+0.006
 not+0.006
 able+0.006
Neg logit contributions
andel-0.010
umm-0.010
aff-0.010
est-0.010
 sky-0.009
Token I
Token ID314
Feature activation+7.09e+00

Pos logit contributions
andel+0.002
est+0.002
aff+0.002
umm+0.002
umpy+0.002
Neg logit contributions
 managed-0.001
 never-0.001
 it-0.001
 wouldn-0.001
 him-0.001
Token'd
Token ID1549
Feature activation+9.82e+00

Pos logit contributions
 managed+0.104
 able+0.093
 him+0.091
 wouldn+0.090
pped+0.086
Neg logit contributions
est-0.140
andel-0.132
aff-0.131
umpy-0.130
umm-0.130
Token like
Token ID588
Feature activation+5.31e+00

Pos logit contributions
 managed+0.164
 wouldn+0.149
pped+0.147
 able+0.147
 him+0.144
Neg logit contributions
est-0.169
 sky-0.156
 world-0.156
umpy-0.151
aff-0.149
Token that
Token ID326
Feature activation+3.06e-01

Pos logit contributions
 managed+0.068
 wouldn+0.055
pped+0.051
 remembered+0.049
 able+0.049
Neg logit contributions
 sky-0.118
est-0.116
 world-0.114
andel-0.113
umm-0.112
Token!"
Token ID2474
Feature activation+9.21e-01

Pos logit contributions
 managed+0.006
 wouldn+0.005
 never+0.005
 it+0.005
 him+0.005
Neg logit contributions
andel-0.005
aff-0.005
umm-0.005
est-0.005
umpy-0.005
Token
Token ID198
Feature activation-2.83e+00

Pos logit contributions
 managed+0.016
 him+0.015
 wouldn+0.014
 never+0.014
 able+0.014
Neg logit contributions
andel-0.016
umm-0.016
est-0.016
aff-0.015
umpy-0.015
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
umm+0.057
est+0.055
andel+0.054
aff+0.049
 sky+0.049
Neg logit contributions
 managed-0.045
 him-0.043
 never-0.041
 it-0.041
 not-0.039
Token new
Token ID649
Feature activation-1.64e+00

Pos logit contributions
 managed+0.065
 wouldn+0.054
 able+0.051
 never+0.050
 him+0.049
Neg logit contributions
est-0.111
andel-0.110
aff-0.107
 sky-0.107
umm-0.106
Token things
Token ID1243
Feature activation+2.77e+00

Pos logit contributions
andel+0.030
umm+0.029
aff+0.028
est+0.028
umpy+0.027
Neg logit contributions
 managed-0.028
 him-0.025
 wouldn-0.025
 it-0.025
 never-0.025
Token,
Token ID11
Feature activation+4.26e+00

Pos logit contributions
 managed+0.050
 wouldn+0.043
 never+0.042
 able+0.042
 remembered+0.042
Neg logit contributions
est-0.047
andel-0.045
umm-0.045
 sky-0.045
aff-0.045
Token but
Token ID475
Feature activation+7.21e+00

Pos logit contributions
 managed+0.069
 wouldn+0.060
 able+0.058
 never+0.058
 him+0.058
Neg logit contributions
andel-0.080
est-0.079
umm-0.079
aff-0.076
 sky-0.075
Token we
Token ID356
Feature activation+6.58e+00

Pos logit contributions
 managed+0.096
pped+0.084
 able+0.083
ged+0.077
plug+0.075
Neg logit contributions
 sky-0.147
umm-0.146
est-0.144
 world-0.141
 sun-0.138
Token should
Token ID815
Feature activation+9.79e+00

Pos logit contributions
 managed+0.088
 able+0.079
 paying+0.075
pped+0.075
 saving+0.074
Neg logit contributions
 sky-0.129
est-0.128
andel-0.126
aff-0.125
umpy-0.123
Token always
Token ID1464
Feature activation+8.06e+00

Pos logit contributions
 managed+0.114
 able+0.094
 wouldn+0.091
pped+0.084
ged+0.084
Neg logit contributions
est-0.220
 sky-0.215
aff-0.214
umm-0.214
andel-0.211
Token be
Token ID307
Feature activation+4.04e+00

Pos logit contributions
 managed+0.121
 wouldn+0.108
pped+0.108
 able+0.106
ged+0.105
Neg logit contributions
est-0.145
 world-0.138
 sky-0.135
umm-0.133
aff-0.131
Token careful
Token ID8161
Feature activation-5.61e-03

Pos logit contributions
 wouldn+0.072
 him+0.070
 managed+0.068
pped+0.068
ged+0.065
Neg logit contributions
est-0.058
umpy-0.054
 world-0.052
 sky-0.049
 sun-0.048
Token,
Token ID11
Feature activation+3.86e+00

Pos logit contributions
andel+0.000
est+0.000
umm+0.000
aff+0.000
 sky+0.000
Neg logit contributions
 managed-0.000
 wouldn-0.000
 never-0.000
 able-0.000
 him-0.000
Token okay
Token ID8788
Feature activation+1.70e+00

Pos logit contributions
 managed+0.059
 able+0.049
pped+0.049
 him+0.048
 wouldn+0.048
Neg logit contributions
est-0.073
 sky-0.072
andel-0.072
umm-0.071
 world-0.069
Token strawberries
Token ID36973
Feature activation-6.04e-01

Pos logit contributions
umm+0.034
andel+0.033
aff+0.033
est+0.031
umpy+0.031
Neg logit contributions
 managed-0.037
 him-0.036
 it-0.035
 wouldn-0.034
 going-0.034
Token and
Token ID290
Feature activation+5.13e+00

Pos logit contributions
est+0.011
umm+0.010
andel+0.010
 sky+0.010
aff+0.010
Neg logit contributions
 managed-0.010
 wouldn-0.009
 remembered-0.009
 never-0.009
 him-0.008
Token I
Token ID314
Feature activation+6.63e+00

Pos logit contributions
 managed+0.062
 wouldn+0.052
pped+0.052
 able+0.051
 him+0.048
Neg logit contributions
est-0.110
 sky-0.109
umm-0.107
aff-0.105
andel-0.104
Token thought
Token ID1807
Feature activation+4.72e+00

Pos logit contributions
 managed+0.089
 able+0.080
 him+0.079
 wouldn+0.077
 saving+0.076
Neg logit contributions
est-0.134
andel-0.129
umm-0.128
aff-0.127
umpy-0.127
Token I
Token ID314
Feature activation+6.98e+00

Pos logit contributions
 managed+0.044
 wouldn+0.033
 able+0.031
 remembered+0.031
 never+0.030
Neg logit contributions
est-0.120
andel-0.118
aff-0.117
 sky-0.115
umm-0.115
Token'd
Token ID1549
Feature activation+9.78e+00

Pos logit contributions
 able+0.082
 managed+0.080
 preventing+0.077
 saving+0.077
 paying+0.076
Neg logit contributions
est-0.140
umpy-0.131
 sky-0.130
aff-0.130
 world-0.126
Token surprise
Token ID5975
Feature activation+2.98e+00

Pos logit contributions
 managed+0.157
 wouldn+0.147
pped+0.144
 able+0.140
 him+0.140
Neg logit contributions
est-0.172
 world-0.159
 sky-0.157
aff-0.156
andel-0.152
Token you
Token ID345
Feature activation+2.47e+00

Pos logit contributions
 managed+0.033
 wouldn+0.026
 able+0.025
 never+0.025
 remembered+0.024
Neg logit contributions
est-0.071
andel-0.071
umm-0.070
aff-0.069
 sky-0.067
Token!"
Token ID2474
Feature activation+4.01e-01

Pos logit contributions
 managed+0.047
 wouldn+0.041
 able+0.041
 never+0.040
 him+0.040
Neg logit contributions
est-0.040
andel-0.037
aff-0.037
umm-0.037
 sky-0.036
Token The
Token ID383
Feature activation-8.25e+00

Pos logit contributions
 managed+0.007
 him+0.006
 never+0.006
 it+0.006
 wouldn+0.006
Neg logit contributions
andel-0.007
umm-0.007
est-0.007
aff-0.007
umpy-0.007
Token bear
Token ID6842
Feature activation+2.62e+00

Pos logit contributions
andel+0.118
umm+0.117
aff+0.108
est+0.108
umpy+0.103
Neg logit contributions
 managed-0.170
 remembered-0.159
 never-0.157
 tried-0.156
 it-0.156
Token She
Token ID1375
Feature activation+5.42e+00

Pos logit contributions
 managed+0.002
 him+0.001
 it+0.001
 never+0.001
 wouldn+0.001
Neg logit contributions
andel-0.002
est-0.002
umm-0.002
aff-0.002
umpy-0.002
Token looked
Token ID3114
Feature activation+5.32e+00

Pos logit contributions
 managed+0.062
 him+0.056
 able+0.056
 saving+0.055
pped+0.054
Neg logit contributions
 sky-0.116
est-0.115
aff-0.112
andel-0.110
 world-0.107
Token better
Token ID1365
Feature activation+7.27e-01

Pos logit contributions
 managed+0.091
 wouldn+0.080
 never+0.079
 able+0.077
 him+0.077
Neg logit contributions
andel-0.093
est-0.093
 sky-0.089
umm-0.089
aff-0.088
Token than
Token ID621
Feature activation+6.02e+00

Pos logit contributions
 managed+0.014
 wouldn+0.012
pped+0.012
 able+0.012
 retrieved+0.012
Neg logit contributions
 sky-0.011
est-0.011
 world-0.010
andel-0.010
aff-0.010
Token they
Token ID484
Feature activation+4.98e+00

Pos logit contributions
 managed+0.078
 wouldn+0.064
 able+0.062
pped+0.060
 remembered+0.059
Neg logit contributions
andel-0.127
est-0.126
 sky-0.126
aff-0.126
umm-0.124
Token'd
Token ID1549
Feature activation+9.78e+00

Pos logit contributions
 managed+0.059
 able+0.055
plug+0.054
pped+0.054
 preventing+0.052
Neg logit contributions
 sky-0.101
est-0.100
 world-0.097
aff-0.095
andel-0.092
Token ever
Token ID1683
Feature activation+4.81e+00

Pos logit contributions
 managed+0.108
 wouldn+0.099
plug+0.097
 able+0.096
pped+0.095
Neg logit contributions
est-0.211
 sky-0.204
 world-0.200
aff-0.199
umpy-0.194
Token seen
Token ID1775
Feature activation+4.68e+00

Pos logit contributions
 managed+0.053
 wouldn+0.047
 him+0.044
pped+0.043
 able+0.042
Neg logit contributions
est-0.110
 sky-0.105
aff-0.105
andel-0.104
umm-0.103
Token before
Token ID878
Feature activation+4.37e+00

Pos logit contributions
 managed+0.080
 wouldn+0.072
 him+0.071
 never+0.071
 able+0.070
Neg logit contributions
andel-0.084
umm-0.082
est-0.081
aff-0.079
umpy-0.077
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
 managed+0.089
 wouldn+0.079
 able+0.076
 never+0.075
 remembered+0.074
Neg logit contributions
 sky-0.064
andel-0.062
est-0.062
umm-0.061
aff-0.059
Token Daisy
Token ID40355
Feature activation+3.17e+00

Pos logit contributions
 managed+0.006
 him+0.006
 it+0.006
 never+0.006
 wouldn+0.005
Neg logit contributions
andel-0.007
est-0.007
umm-0.007
aff-0.007
umpy-0.007
Token his
Token ID465
Feature activation-2.89e+00

Pos logit contributions
 managed+0.045
 wouldn+0.038
 never+0.036
 able+0.036
 him+0.035
Neg logit contributions
andel-0.084
est-0.084
umm-0.082
aff-0.081
 sky-0.081
Token heart
Token ID2612
Feature activation+1.15e-01

Pos logit contributions
andel+0.044
aff+0.043
umm+0.043
est+0.040
umpy+0.040
Neg logit contributions
 managed-0.059
 never-0.055
 remembered-0.053
 him-0.053
 it-0.053
Token,
Token ID11
Feature activation+4.11e+00

Pos logit contributions
 managed+0.002
 wouldn+0.002
 never+0.002
 able+0.002
 him+0.002
Neg logit contributions
andel-0.002
umm-0.002
est-0.002
aff-0.002
 sky-0.002
Token knowing
Token ID6970
Feature activation+4.73e+00

Pos logit contributions
 managed+0.074
pped+0.065
 wouldn+0.065
 him+0.062
 able+0.061
Neg logit contributions
 sky-0.064
est-0.061
andel-0.059
 world-0.058
aff-0.058
Token he
Token ID339
Feature activation+4.90e+00

Pos logit contributions
 managed+0.072
pped+0.062
plug+0.059
 retrieved+0.058
 wouldn+0.058
Neg logit contributions
 sky-0.086
est-0.084
 world-0.080
andel-0.078
 sun-0.078
Token'd
Token ID1549
Feature activation+9.74e+00

Pos logit contributions
 paying+0.064
 able+0.063
 spilling+0.062
 preventing+0.061
 managed+0.061
Neg logit contributions
est-0.096
 world-0.088
 sky-0.088
umpy-0.077
 girl-0.074
Token never
Token ID1239
Feature activation+6.52e+00

Pos logit contributions
ged+0.110
 managed+0.106
plug+0.105
pped+0.105
bled+0.103
Neg logit contributions
 sky-0.199
est-0.198
 world-0.197
 universe-0.176
 difference-0.175
Token see
Token ID766
Feature activation+4.32e+00

Pos logit contributions
 managed+0.074
 wouldn+0.067
 him+0.065
pped+0.063
plug+0.061
Neg logit contributions
est-0.145
 sky-0.138
umm-0.134
andel-0.133
 world-0.132
Token him
Token ID683
Feature activation+4.06e-01

Pos logit contributions
 managed+0.048
 wouldn+0.039
 never+0.037
 able+0.037
 remembered+0.035
Neg logit contributions
est-0.103
andel-0.102
umm-0.101
 sky-0.099
aff-0.098
Token again
Token ID757
Feature activation+2.71e+00

Pos logit contributions
 managed+0.006
 wouldn+0.005
 never+0.005
 able+0.005
 him+0.005
Neg logit contributions
andel-0.008
umm-0.008
est-0.008
aff-0.008
 sky-0.008
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 managed+0.054
 wouldn+0.049
 never+0.047
 able+0.047
 him+0.046
Neg logit contributions
est-0.040
andel-0.040
umm-0.039
aff-0.038
 sky-0.038
Token He
Token ID679
Feature activation+6.71e+00

Pos logit contributions
 managed+0.006
 him+0.006
 never+0.006
 wouldn+0.005
 it+0.005
Neg logit contributions
andel-0.006
umm-0.006
est-0.006
aff-0.006
umpy-0.006
Token couldn
Token ID3521
Feature activation+6.10e+00

Pos logit contributions
 managed+0.076
 able+0.068
 paying+0.064
pped+0.064
 him+0.063
Neg logit contributions
est-0.146
 sky-0.143
aff-0.139
andel-0.136
umpy-0.133
Token't
Token ID470
Feature activation+4.61e+00

Pos logit contributions
 managed+0.085
 wouldn+0.072
 able+0.072
 remembered+0.069
 never+0.066
Neg logit contributions
est-0.126
andel-0.124
 sky-0.123
umm-0.121
aff-0.121
Token believe
Token ID1975
Feature activation+2.55e+00

Pos logit contributions
 managed+0.064
 wouldn+0.054
 never+0.054
 him+0.053
 able+0.052
Neg logit contributions
andel-0.099
umm-0.097
est-0.097
aff-0.095
 sky-0.093
Token he
Token ID339
Feature activation+4.19e+00

Pos logit contributions
 managed+0.025
 freed+0.017
 wouldn+0.017
 swallowed+0.016
 retrieved+0.016
Neg logit contributions
est-0.065
aff-0.063
andel-0.063
 sky-0.061
umm-0.061
Token'd
Token ID1549
Feature activation+9.73e+00

Pos logit contributions
 managed+0.048
 able+0.042
 saving+0.042
 paying+0.041
 preventing+0.040
Neg logit contributions
est-0.092
 sky-0.087
 world-0.086
aff-0.082
umm-0.081
Token been
Token ID587
Feature activation+5.14e+00

Pos logit contributions
 managed+0.097
plug+0.096
 wouldn+0.094
 able+0.092
 saving+0.092
Neg logit contributions
est-0.212
 sky-0.202
 world-0.199
umpy-0.191
andel-0.185
Token so
Token ID523
Feature activation+3.19e+00

Pos logit contributions
 managed+0.063
 wouldn+0.061
 him+0.056
pped+0.052
check+0.052
Neg logit contributions
est-0.107
 sky-0.101
andel-0.100
 world-0.099
umpy-0.098
Token adventurous
Token ID39770
Feature activation-1.53e+00

Pos logit contributions
 managed+0.058
 wouldn+0.052
 never+0.050
 him+0.050
 able+0.050
Neg logit contributions
andel-0.053
est-0.052
umm-0.052
aff-0.050
 sky-0.049
Token.
Token ID13
Feature activation-9.08e-02

Pos logit contributions
andel+0.030
umm+0.029
est+0.029
aff+0.028
 sky+0.027
Neg logit contributions
 managed-0.024
 wouldn-0.021
 never-0.021
 him-0.021
 able-0.021
Token
Token ID220
Feature activation-1.39e+00

Pos logit contributions
andel+0.002
est+0.002
umm+0.002
aff+0.002
umpy+0.002
Neg logit contributions
 managed-0.001
 him-0.001
 never-0.001
 it-0.001
 wouldn-0.001
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
andel+0.028
umm+0.027
aff+0.026
est+0.025
 sky+0.025
Neg logit contributions
 managed-0.017
 him-0.016
 wouldn-0.016
 able-0.015
 never-0.015
Token "
Token ID366
Feature activation+3.28e+00

Pos logit contributions
 managed+0.031
 wouldn+0.026
 able+0.025
 never+0.025
 him+0.024
Neg logit contributions
est-0.045
andel-0.045
umm-0.044
 sky-0.043
aff-0.043
TokenI
Token ID40
Feature activation+5.55e+00

Pos logit contributions
 managed+0.052
 never+0.045
 wouldn+0.045
 him+0.043
 listened+0.042
Neg logit contributions
andel-0.065
est-0.063
umm-0.063
aff-0.062
 sky-0.059
Token knew
Token ID2993
Feature activation+6.80e+00

Pos logit contributions
 managed+0.078
 able+0.070
 him+0.068
 wouldn+0.067
pped+0.066
Neg logit contributions
est-0.111
umpy-0.104
andel-0.104
 sky-0.101
umm-0.101
Token you
Token ID345
Feature activation+3.55e+00

Pos logit contributions
 managed+0.078
 wouldn+0.059
 able+0.058
pped+0.058
 retrieved+0.057
Neg logit contributions
est-0.157
 sky-0.150
andel-0.149
umm-0.148
aff-0.145
Token'd
Token ID1549
Feature activation+9.71e+00

Pos logit contributions
 managed+0.042
pped+0.036
 able+0.035
 preventing+0.035
plug+0.035
Neg logit contributions
est-0.078
 sky-0.073
aff-0.070
umpy-0.070
 world-0.070
Token like
Token ID588
Feature activation+5.95e+00

Pos logit contributions
 managed+0.141
 wouldn+0.129
pped+0.127
 able+0.124
ged+0.124
Neg logit contributions
est-0.187
 world-0.178
 sky-0.175
umpy-0.168
aff-0.165
Token it
Token ID340
Feature activation+2.39e+00

Pos logit contributions
 managed+0.079
 wouldn+0.064
pped+0.059
 remembered+0.058
 able+0.058
Neg logit contributions
 sky-0.130
est-0.128
 world-0.126
andel-0.125
aff-0.123
Token!"
Token ID2474
Feature activation+1.08e+00

Pos logit contributions
 managed+0.041
 wouldn+0.037
 never+0.036
 him+0.036
 able+0.036
Neg logit contributions
andel-0.043
umm-0.042
est-0.041
aff-0.041
 sky-0.039
Token Emily
Token ID17608
Feature activation+3.25e+00

Pos logit contributions
 managed+0.018
 him+0.016
 wouldn+0.016
 never+0.016
 able+0.015
Neg logit contributions
andel-0.020
umm-0.020
est-0.020
aff-0.019
umpy-0.019
Token was
Token ID373
Feature activation+3.90e+00

Pos logit contributions
 managed+0.043
 wouldn+0.036
 able+0.035
 him+0.035
 never+0.035
Neg logit contributions
est-0.070
andel-0.068
aff-0.067
 sky-0.066
umm-0.066
Token us
Token ID514
Feature activation+1.23e+00

Pos logit contributions
 managed+0.059
 wouldn+0.050
 able+0.049
 never+0.048
 remembered+0.047
Neg logit contributions
andel-0.076
est-0.076
umm-0.073
aff-0.073
 sky-0.072
Token freeze
Token ID16611
Feature activation+1.31e+00

Pos logit contributions
 managed+0.020
 wouldn+0.018
 never+0.017
 able+0.017
 him+0.017
Neg logit contributions
est-0.022
andel-0.022
umm-0.022
aff-0.021
 sky-0.021
Token,
Token ID11
Feature activation+5.01e+00

Pos logit contributions
 managed+0.024
 wouldn+0.021
 never+0.021
 able+0.021
 him+0.020
Neg logit contributions
est-0.022
andel-0.022
umm-0.021
aff-0.021
 sky-0.020
Token but
Token ID475
Feature activation+6.69e+00

Pos logit contributions
 managed+0.089
 able+0.076
pped+0.076
 wouldn+0.074
 retrieved+0.074
Neg logit contributions
est-0.083
andel-0.081
 sky-0.080
umm-0.077
 world-0.076
Token we
Token ID356
Feature activation+6.79e+00

Pos logit contributions
 managed+0.081
pped+0.071
 able+0.067
 freed+0.063
 retrieved+0.062
Neg logit contributions
 sky-0.147
umm-0.140
 world-0.139
est-0.139
 sun-0.137
Token should
Token ID815
Feature activation+9.71e+00

Pos logit contributions
 managed+0.097
 able+0.087
pped+0.084
 paying+0.081
 preventing+0.080
Neg logit contributions
 sky-0.127
est-0.126
aff-0.125
andel-0.124
umpy-0.123
Token always
Token ID1464
Feature activation+8.17e+00

Pos logit contributions
 managed+0.118
 able+0.098
 wouldn+0.094
pped+0.086
 freed+0.085
Neg logit contributions
est-0.216
aff-0.211
andel-0.211
 sky-0.211
umm-0.208
Token be
Token ID307
Feature activation+3.74e+00

Pos logit contributions
 managed+0.123
 wouldn+0.109
 able+0.107
pped+0.105
ged+0.101
Neg logit contributions
est-0.149
 sky-0.141
 world-0.141
aff-0.138
andel-0.136
Token careful
Token ID8161
Feature activation+3.50e-02

Pos logit contributions
 wouldn+0.064
 managed+0.062
 him+0.059
pped+0.058
ged+0.055
Neg logit contributions
est-0.057
umpy-0.056
 world-0.051
 sky-0.049
 sun-0.049
Token and
Token ID290
Feature activation+3.92e+00

Pos logit contributions
 managed+0.001
 wouldn+0.001
 never+0.001
 able+0.001
 him+0.001
Neg logit contributions
est-0.001
andel-0.001
umm-0.001
aff-0.001
 sky-0.001
Token not
Token ID407
Feature activation+5.50e+00

Pos logit contributions
pped+0.062
 managed+0.061
 him+0.052
ged+0.052
 wouldn+0.051
Neg logit contributions
umpy-0.061
 world-0.058
 sun-0.057
 sky-0.057
est-0.055
Token She
Token ID1375
Feature activation+4.84e+00

Pos logit contributions
est+0.004
andel+0.004
umm+0.004
aff+0.004
umpy+0.004
Neg logit contributions
 managed-0.005
 wouldn-0.004
 him-0.004
 never-0.004
 it-0.004
Token asked
Token ID1965
Feature activation+5.12e+00

Pos logit contributions
 managed+0.055
 saving+0.050
 able+0.050
 him+0.049
 preventing+0.048
Neg logit contributions
 sky-0.103
est-0.102
andel-0.100
aff-0.099
 world-0.094
Token Joe
Token ID5689
Feature activation+1.23e+00

Pos logit contributions
 managed+0.054
 wouldn+0.040
 able+0.040
 never+0.039
 remembered+0.036
Neg logit contributions
est-0.127
andel-0.126
aff-0.121
 sky-0.119
umm-0.119
Token,
Token ID11
Feature activation+4.03e+00

Pos logit contributions
 managed+0.015
 wouldn+0.012
 never+0.012
 able+0.012
 him+0.012
Neg logit contributions
est-0.028
andel-0.028
aff-0.027
umm-0.027
 sky-0.026
Token "
Token ID366
Feature activation+3.56e+00

Pos logit contributions
 managed+0.074
 wouldn+0.065
 never+0.063
 able+0.063
pped+0.061
Neg logit contributions
andel-0.065
est-0.064
 sky-0.063
aff-0.061
umm-0.060
TokenShould
Token ID19926
Feature activation+9.70e+00

Pos logit contributions
 managed+0.050
 wouldn+0.043
 never+0.042
 him+0.042
 able+0.041
Neg logit contributions
andel-0.075
est-0.074
umm-0.074
aff-0.072
 sky-0.070
Token we
Token ID356
Feature activation+5.95e+00

Pos logit contributions
 managed+0.139
 able+0.121
pped+0.115
 paying+0.111
ged+0.111
Neg logit contributions
est-0.190
 sky-0.186
andel-0.178
umm-0.177
aff-0.175
Token take
Token ID1011
Feature activation+2.93e+00

Pos logit contributions
 managed+0.078
 wouldn+0.066
 able+0.065
 him+0.063
 never+0.062
Neg logit contributions
andel-0.127
est-0.125
umm-0.123
aff-0.122
 sky-0.122
Token a
Token ID257
Feature activation-4.70e+00

Pos logit contributions
 managed+0.039
 wouldn+0.033
 able+0.030
 remembered+0.030
 never+0.029
Neg logit contributions
andel-0.064
est-0.064
 sky-0.062
aff-0.060
umm-0.059
Token break
Token ID2270
Feature activation-8.76e-01

Pos logit contributions
umm+0.101
est+0.095
aff+0.094
andel+0.092
umpy+0.092
Neg logit contributions
 managed-0.069
 it-0.061
 wouldn-0.060
 able-0.059
 never-0.059
Token?"
Token ID1701
Feature activation-1.38e-01

Pos logit contributions
andel+0.018
umm+0.018
est+0.017
aff+0.017
 sky+0.016
Neg logit contributions
 managed-0.013
 wouldn-0.011
 never-0.011
 him-0.011
 able-0.011
Token things
Token ID1243
Feature activation+4.26e+00

Pos logit contributions
est+0.010
umm+0.010
 sky+0.009
andel+0.009
aff+0.009
Neg logit contributions
 managed-0.010
 able-0.008
 wouldn-0.008
 never-0.008
pped-0.008
Token happen
Token ID1645
Feature activation+3.03e+00

Pos logit contributions
 managed+0.064
 able+0.056
pped+0.053
 remembered+0.052
 paying+0.051
Neg logit contributions
est-0.079
 sky-0.079
aff-0.076
umpy-0.075
 world-0.074
Token,
Token ID11
Feature activation+4.89e+00

Pos logit contributions
 managed+0.058
 able+0.050
 wouldn+0.050
pped+0.049
 paying+0.048
Neg logit contributions
est-0.049
 sky-0.045
 world-0.044
umm-0.044
umpy-0.044
Token but
Token ID475
Feature activation+6.64e+00

Pos logit contributions
 managed+0.079
 wouldn+0.068
 able+0.067
 never+0.065
 him+0.065
Neg logit contributions
est-0.090
andel-0.089
umm-0.088
 sky-0.087
aff-0.085
Token we
Token ID356
Feature activation+6.66e+00

Pos logit contributions
 managed+0.078
pped+0.068
 able+0.063
 retrieved+0.060
ged+0.059
Neg logit contributions
 sky-0.148
est-0.142
umm-0.142
 world-0.140
 sun-0.137
Token should
Token ID815
Feature activation+9.69e+00

Pos logit contributions
 managed+0.093
 able+0.082
pped+0.078
 wouldn+0.077
 paying+0.076
Neg logit contributions
est-0.129
 sky-0.129
andel-0.127
aff-0.126
umpy-0.124
Token still
Token ID991
Feature activation+8.05e+00

Pos logit contributions
 managed+0.114
 able+0.093
 wouldn+0.090
pped+0.083
ged+0.081
Neg logit contributions
est-0.217
aff-0.213
 sky-0.213
andel-0.211
umm-0.210
Token keep
Token ID1394
Feature activation+4.32e+00

Pos logit contributions
 managed+0.110
 wouldn+0.094
 able+0.092
 him+0.087
pped+0.085
Neg logit contributions
est-0.167
andel-0.164
umm-0.163
aff-0.162
 sky-0.161
Token trying
Token ID2111
Feature activation+4.59e+00

Pos logit contributions
 managed+0.059
 wouldn+0.047
 retrieved+0.042
 remembered+0.042
 able+0.040
Neg logit contributions
 sky-0.087
est-0.087
 world-0.086
 sun-0.083
andel-0.082
Token and
Token ID290
Feature activation+5.18e+00

Pos logit contributions
 managed+0.080
 wouldn+0.069
 able+0.066
 remembered+0.065
 never+0.065
Neg logit contributions
est-0.080
andel-0.077
 sky-0.075
aff-0.075
umm-0.073
Token working
Token ID1762
Feature activation+2.65e+00

Pos logit contributions
 managed+0.067
pped+0.060
 wouldn+0.057
ged+0.056
plug+0.055
Neg logit contributions
 sky-0.094
 world-0.090
est-0.087
 difference-0.086
 sun-0.086
Token world
Token ID995
Feature activation+1.01e+00

Pos logit contributions
andel+0.125
est+0.122
aff+0.119
umm+0.119
umpy+0.111
Neg logit contributions
 managed-0.168
 never-0.167
 going-0.164
 him-0.161
 it-0.161
Token he
Token ID339
Feature activation+3.80e+00

Pos logit contributions
 managed+0.017
 wouldn+0.015
 able+0.015
 never+0.015
 remembered+0.015
Neg logit contributions
est-0.018
 sky-0.017
andel-0.017
umm-0.017
aff-0.017
Token promised
Token ID8072
Feature activation+6.01e+00

Pos logit contributions
 managed+0.042
 able+0.042
 preventing+0.040
 saving+0.039
 paying+0.039
Neg logit contributions
 sky-0.082
est-0.079
 world-0.077
umpy-0.071
aff-0.069
Token himself
Token ID2241
Feature activation+2.73e+00

Pos logit contributions
 managed+0.090
 wouldn+0.074
 able+0.074
 remembered+0.073
pped+0.070
Neg logit contributions
est-0.119
 sky-0.113
andel-0.113
aff-0.113
 world-0.111
Token he
Token ID339
Feature activation+5.35e+00

Pos logit contributions
 managed+0.040
 freed+0.033
pped+0.033
plug+0.033
 swept+0.032
Neg logit contributions
est-0.055
 sky-0.050
 world-0.049
aff-0.049
andel-0.049
Token'd
Token ID1549
Feature activation+9.69e+00

Pos logit contributions
 able+0.061
 managed+0.061
pped+0.061
 preventing+0.061
plug+0.060
Neg logit contributions
est-0.114
 world-0.101
 sky-0.100
umpy-0.099
affe-0.090
Token come
Token ID1282
Feature activation+3.73e+00

Pos logit contributions
 managed+0.110
ged+0.109
plug+0.106
 wouldn+0.106
 able+0.104
Neg logit contributions
est-0.195
 sky-0.193
 world-0.191
umpy-0.174
 sun-0.171
Token back
Token ID736
Feature activation+1.87e+00

Pos logit contributions
 managed+0.060
 wouldn+0.054
 never+0.049
 able+0.048
 listened+0.048
Neg logit contributions
est-0.070
andel-0.068
 sky-0.067
aff-0.066
 world-0.065
Token soon
Token ID2582
Feature activation+3.49e+00

Pos logit contributions
 managed+0.027
 wouldn+0.024
 never+0.023
 able+0.023
 him+0.022
Neg logit contributions
andel-0.038
est-0.038
umm-0.037
aff-0.036
 sky-0.036
Token.
Token ID13
Feature activation-5.97e-01

Pos logit contributions
 managed+0.073
 wouldn+0.066
 never+0.064
 able+0.064
 him+0.063
Neg logit contributions
est-0.048
andel-0.046
umm-0.046
 sky-0.045
aff-0.045
Token<|endoftext|>
Token ID50256
Feature activation-5.64e+00

Pos logit contributions
andel+0.011
est+0.011
umm+0.011
aff+0.010
umpy+0.010
Neg logit contributions
 managed-0.010
 him-0.009
 never-0.009
 wouldn-0.009
 it-0.009
Token at
Token ID379
Feature activation+3.08e+00

Pos logit contributions
andel+0.023
umm+0.022
est+0.022
aff+0.021
 sky+0.021
Neg logit contributions
 managed-0.013
 wouldn-0.011
 him-0.011
 never-0.011
 able-0.011
Token you
Token ID345
Feature activation+1.80e+00

Pos logit contributions
 managed+0.044
 wouldn+0.035
 able+0.033
 remembered+0.032
 freed+0.030
Neg logit contributions
aff-0.066
andel-0.066
 sky-0.064
umm-0.064
est-0.063
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
 managed+0.029
 never+0.028
 him+0.027
 wouldn+0.027
 not+0.027
Neg logit contributions
umm-0.034
andel-0.034
est-0.032
aff-0.031
 sky-0.031
Token but
Token ID475
Feature activation+6.67e+00

Pos logit contributions
 managed+0.038
 wouldn+0.033
 never+0.032
 able+0.032
 him+0.032
Neg logit contributions
andel-0.052
est-0.051
umm-0.051
aff-0.050
 sky-0.049
Token I
Token ID314
Feature activation+7.10e+00

Pos logit contributions
 managed+0.105
pped+0.094
 able+0.094
 retrieved+0.090
ged+0.090
Neg logit contributions
 sky-0.111
est-0.110
 sun-0.109
 girl-0.109
umm-0.109
Token should
Token ID815
Feature activation+9.68e+00

Pos logit contributions
 managed+0.076
 able+0.069
 him+0.065
pped+0.062
 going+0.061
Neg logit contributions
est-0.157
umpy-0.157
aff-0.154
 sky-0.151
andel-0.151
Token not
Token ID407
Feature activation+5.60e+00

Pos logit contributions
 managed+0.095
 able+0.074
 wouldn+0.073
ged+0.069
pped+0.063
Neg logit contributions
est-0.225
aff-0.225
 sky-0.220
umm-0.216
umpy-0.214
Token have
Token ID423
Feature activation+5.35e+00

Pos logit contributions
 managed+0.073
 wouldn+0.063
 able+0.060
 never+0.058
 him+0.058
Neg logit contributions
est-0.121
andel-0.120
umm-0.119
aff-0.118
 sky-0.115
Token pushed
Token ID7121
Feature activation+2.46e+00

Pos logit contributions
 wouldn+0.072
 managed+0.072
ged+0.062
pped+0.061
bled+0.060
Neg logit contributions
 sky-0.090
 sun-0.087
est-0.086
aff-0.083
 world-0.081
Token your
Token ID534
Feature activation-5.87e+00

Pos logit contributions
 managed+0.024
 wouldn+0.019
 never+0.018
 able+0.018
 him+0.017
Neg logit contributions
andel-0.062
est-0.062
umm-0.061
aff-0.061
 sky-0.060
Token blocks
Token ID7021
Feature activation-1.23e+00

Pos logit contributions
andel+0.093
aff+0.090
umm+0.087
est+0.082
umpy+0.080
Neg logit contributions
 managed-0.116
 not-0.110
 never-0.109
 wouldn-0.106
 it-0.106
Token get
Token ID651
Feature activation+3.07e+00

Pos logit contributions
 managed+0.066
pped+0.064
ged+0.057
 retrieved+0.053
plug+0.052
Neg logit contributions
 sky-0.091
 world-0.089
est-0.086
 sun-0.084
umm-0.084
Token hurt
Token ID5938
Feature activation+5.08e-01

Pos logit contributions
 managed+0.047
 wouldn+0.039
zed+0.038
pped+0.037
 never+0.037
Neg logit contributions
andel-0.053
est-0.053
 sky-0.052
umpy-0.052
 world-0.050
Token,
Token ID11
Feature activation+4.43e+00

Pos logit contributions
 managed+0.009
 wouldn+0.008
 never+0.008
 able+0.008
 him+0.008
Neg logit contributions
andel-0.008
est-0.008
umm-0.008
aff-0.008
 sky-0.008
Token but
Token ID475
Feature activation+5.72e+00

Pos logit contributions
 managed+0.085
pped+0.075
 able+0.073
 wouldn+0.072
 remembered+0.072
Neg logit contributions
est-0.067
 sky-0.065
andel-0.065
umm-0.064
 world-0.063
Token we
Token ID356
Feature activation+5.72e+00

Pos logit contributions
 managed+0.061
pped+0.050
 able+0.047
 retrieved+0.043
 freed+0.043
Neg logit contributions
 sky-0.136
umm-0.134
 world-0.133
est-0.133
andel-0.127
Token should
Token ID815
Feature activation+9.68e+00

Pos logit contributions
 managed+0.084
 able+0.075
pped+0.075
 paying+0.074
 preventing+0.072
Neg logit contributions
 sky-0.105
est-0.101
umpy-0.100
aff-0.099
 world-0.098
Token always
Token ID1464
Feature activation+7.50e+00

Pos logit contributions
 managed+0.125
 able+0.106
 wouldn+0.100
ged+0.096
pped+0.095
Neg logit contributions
est-0.206
 sky-0.200
aff-0.198
umm-0.198
andel-0.196
Token ask
Token ID1265
Feature activation+3.79e+00

Pos logit contributions
 managed+0.115
 wouldn+0.101
 able+0.101
pped+0.100
ged+0.098
Neg logit contributions
est-0.135
 world-0.129
 sky-0.126
umm-0.124
 difference-0.121
Token for
Token ID329
Feature activation+5.26e+00

Pos logit contributions
 managed+0.055
 wouldn+0.044
 able+0.042
pped+0.041
 remembered+0.041
Neg logit contributions
andel-0.079
est-0.076
aff-0.075
umm-0.074
 sky-0.074
Token help
Token ID1037
Feature activation+9.48e-01

Pos logit contributions
 managed+0.097
 wouldn+0.090
 retrieved+0.082
pped+0.079
ged+0.076
Neg logit contributions
 sky-0.075
 world-0.074
 sun-0.073
andel-0.071
est-0.071
Token when
Token ID618
Feature activation+2.95e+00

Pos logit contributions
 managed+0.019
 wouldn+0.017
 remembered+0.016
 able+0.016
 retrieved+0.016
Neg logit contributions
est-0.014
andel-0.014
aff-0.013
 sky-0.013
umm-0.013
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 managed+0.028
 never+0.026
 wouldn+0.026
 him+0.025
 able+0.025
Neg logit contributions
andel-0.029
umm-0.029
est-0.027
aff-0.027
 sky-0.026
Token little
Token ID1310
Feature activation-3.92e+00

Pos logit contributions
 managed+0.056
 wouldn+0.049
 never+0.049
 him+0.048
 able+0.047
Neg logit contributions
andel-0.072
est-0.071
umm-0.071
aff-0.070
 sky-0.067
Token worm
Token ID23849
Feature activation-1.42e+00

Pos logit contributions
andel+0.049
aff+0.047
umm+0.045
est+0.043
umpy+0.043
Neg logit contributions
 managed-0.089
 him-0.083
 it-0.082
 wouldn-0.082
 able-0.080
Token.
Token ID13
Feature activation+6.21e-01

Pos logit contributions
andel+0.024
umm+0.023
est+0.022
aff+0.022
 sky+0.022
Neg logit contributions
 managed-0.027
 wouldn-0.024
 never-0.024
 able-0.024
 him-0.023
Token Friends
Token ID14213
Feature activation+5.31e+00

Pos logit contributions
 managed+0.010
 never+0.009
 wouldn+0.009
 him+0.008
 it+0.008
Neg logit contributions
andel-0.012
est-0.012
umm-0.012
aff-0.012
 sky-0.011
Token should
Token ID815
Feature activation+9.66e+00

Pos logit contributions
 managed+0.079
 able+0.070
 wouldn+0.065
 paying+0.063
pped+0.063
Neg logit contributions
est-0.101
aff-0.097
 sky-0.095
andel-0.094
 world-0.094
Token never
Token ID1239
Feature activation+5.39e+00

Pos logit contributions
 managed+0.131
 able+0.112
 wouldn+0.108
pped+0.101
ged+0.099
Neg logit contributions
est-0.201
aff-0.197
 sky-0.193
umm-0.193
andel-0.192
Token hurt
Token ID5938
Feature activation+1.61e+00

Pos logit contributions
 managed+0.071
 wouldn+0.060
 never+0.059
 him+0.058
 able+0.057
Neg logit contributions
andel-0.120
umm-0.118
est-0.117
aff-0.114
 sky-0.112
Token each
Token ID1123
Feature activation-1.91e+00

Pos logit contributions
 managed+0.017
 wouldn+0.014
 never+0.013
 able+0.013
 him+0.013
Neg logit contributions
andel-0.039
est-0.039
umm-0.038
aff-0.038
 sky-0.037
Token other
Token ID584
Feature activation-1.98e+00

Pos logit contributions
umm+0.033
andel+0.031
aff+0.030
 sky+0.028
bows+0.027
Neg logit contributions
 managed-0.036
 him-0.034
 it-0.033
 never-0.033
 able-0.031
Token."
Token ID526
Feature activation-1.62e+00

Pos logit contributions
andel+0.035
umm+0.034
est+0.034
aff+0.033
 sky+0.032
Neg logit contributions
 managed-0.035
 wouldn-0.031
 never-0.031
 able-0.030
 him-0.030
Token Sam
Token ID3409
Feature activation+2.16e+00

Pos logit contributions
 managed+0.040
 wouldn+0.035
 able+0.034
 never+0.034
 him+0.033
Neg logit contributions
est-0.045
andel-0.044
 sky-0.043
umm-0.042
aff-0.042
Token spoke
Token ID5158
Feature activation+1.26e+00

Pos logit contributions
 managed+0.022
 able+0.020
 saving+0.019
 preventing+0.019
 him+0.019
Neg logit contributions
est-0.048
 sky-0.048
aff-0.047
umpy-0.046
andel-0.046
Token up
Token ID510
Feature activation+1.26e+00

Pos logit contributions
 managed+0.017
 wouldn+0.014
 never+0.014
 able+0.013
 him+0.013
Neg logit contributions
est-0.027
andel-0.027
umm-0.027
aff-0.026
 sky-0.026
Token:
Token ID25
Feature activation+4.76e+00

Pos logit contributions
 managed+0.021
 wouldn+0.019
 never+0.019
 able+0.018
 him+0.018
Neg logit contributions
andel-0.023
est-0.022
umm-0.022
aff-0.022
 sky-0.021
Token "
Token ID366
Feature activation+3.82e+00

Pos logit contributions
 managed+0.080
 wouldn+0.071
 able+0.069
 never+0.069
 him+0.067
Neg logit contributions
andel-0.085
est-0.085
umm-0.084
aff-0.082
 sky-0.081
TokenShould
Token ID19926
Feature activation+9.66e+00

Pos logit contributions
 managed+0.058
 wouldn+0.049
 never+0.049
 him+0.047
 able+0.046
Neg logit contributions
andel-0.078
est-0.076
umm-0.076
aff-0.075
 sky-0.072
Tokenn
Token ID77
Feature activation+2.74e+00

Pos logit contributions
 managed+0.135
 able+0.118
pped+0.110
 paying+0.107
 wouldn+0.107
Neg logit contributions
est-0.191
 sky-0.185
umm-0.183
andel-0.182
aff-0.182
Token't
Token ID470
Feature activation+4.14e+00

Pos logit contributions
 managed+0.038
 wouldn+0.032
 never+0.030
 able+0.030
 remembered+0.030
Neg logit contributions
est-0.057
andel-0.057
aff-0.055
 sky-0.055
umpy-0.054
Token we
Token ID356
Feature activation+5.65e+00

Pos logit contributions
 managed+0.050
 wouldn+0.042
 never+0.041
 able+0.040
 him+0.040
Neg logit contributions
andel-0.095
est-0.094
umm-0.094
aff-0.092
 sky-0.090
Token share
Token ID2648
Feature activation+2.99e+00

Pos logit contributions
 managed+0.072
 wouldn+0.061
 able+0.060
 never+0.058
 him+0.057
Neg logit contributions
andel-0.122
est-0.121
umm-0.121
aff-0.119
 sky-0.117
Token the
Token ID262
Feature activation-9.07e+00

Pos logit contributions
 managed+0.031
 wouldn+0.025
 never+0.023
 able+0.023
 him+0.022
Neg logit contributions
andel-0.074
est-0.074
umm-0.073
aff-0.072
 sky-0.071
Token She
Token ID1375
Feature activation+3.58e+00

Pos logit contributions
andel+0.024
est+0.024
umm+0.023
aff+0.022
 sky+0.021
Neg logit contributions
 managed-0.030
 wouldn-0.027
 him-0.027
 never-0.027
 able-0.026
Token said
Token ID531
Feature activation+1.16e+00

Pos logit contributions
 saving+0.049
 preventing+0.049
 managed+0.048
 able+0.048
 paying+0.047
Neg logit contributions
 sky-0.066
est-0.062
 girl-0.059
 sun-0.058
umpy-0.057
Token to
Token ID284
Feature activation+3.54e+00

Pos logit contributions
 managed+0.021
 wouldn+0.018
 never+0.018
 able+0.017
pped+0.017
Neg logit contributions
est-0.019
andel-0.018
 sky-0.018
aff-0.018
umm-0.017
Token Mr
Token ID1770
Feature activation-1.09e+01

Pos logit contributions
 managed+0.067
 wouldn+0.057
 able+0.056
 remembered+0.052
 never+0.052
Neg logit contributions
andel-0.059
est-0.057
aff-0.057
umm-0.057
 sky-0.056
Token.
Token ID13
Feature activation-8.51e+00

Pos logit contributions
andel+0.191
umm+0.181
est+0.178
aff+0.173
 sky+0.169
Neg logit contributions
 managed-0.195
 him-0.182
 wouldn-0.178
 never-0.174
 it-0.170
Token Sp
Token ID1338
Feature activation-1.58e+01

Pos logit contributions
andel+0.169
est+0.169
umm+0.166
aff+0.165
umpy+0.157
Neg logit contributions
 managed-0.129
 wouldn-0.112
 him-0.109
 never-0.108
 able-0.106
Tokenud
Token ID463
Feature activation-5.77e-01

Pos logit contributions
andel+0.251
aff+0.244
 sky+0.243
est+0.243
umm+0.239
Neg logit contributions
 managed-0.285
 wouldn-0.266
 never-0.259
 him-0.254
 not-0.251
Token,
Token ID11
Feature activation+3.63e+00

Pos logit contributions
est+0.008
aff+0.008
andel+0.008
 sky+0.007
umpy+0.007
Neg logit contributions
 managed-0.012
 able-0.010
 wouldn-0.010
 never-0.010
 preventing-0.010
Token "
Token ID366
Feature activation+1.77e+00

Pos logit contributions
 managed+0.081
pped+0.072
check+0.070
eks+0.070
eling+0.069
Neg logit contributions
 sky-0.047
est-0.038
 sun-0.038
 Today-0.036
 girl-0.035
TokenI
Token ID40
Feature activation+3.40e+00

Pos logit contributions
 managed+0.027
 wouldn+0.024
 able+0.024
 never+0.023
 him+0.023
Neg logit contributions
est-0.034
umm-0.034
andel-0.034
 sky-0.033
aff-0.033
Token want
Token ID765
Feature activation+3.54e+00

Pos logit contributions
 managed+0.054
 able+0.050
pped+0.049
 him+0.048
 wouldn+0.047
Neg logit contributions
est-0.060
umpy-0.058
 sky-0.056
andel-0.055
 world-0.055
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
 managed+0.016
 wouldn+0.014
 able+0.013
 never+0.013
pped+0.013
Neg logit contributions
est-0.021
andel-0.021
 sky-0.020
aff-0.020
umm-0.020
Token he
Token ID339
Feature activation+3.33e+00

Pos logit contributions
 managed+0.014
 wouldn+0.011
 never+0.011
 able+0.011
 remembered+0.010
Neg logit contributions
andel-0.033
est-0.032
umm-0.032
 sky-0.032
aff-0.032
Token noticed
Token ID6810
Feature activation+2.44e+00

Pos logit contributions
 managed+0.035
 able+0.034
 him+0.033
 paying+0.032
pped+0.032
Neg logit contributions
 sky-0.073
est-0.070
 world-0.067
andel-0.064
aff-0.064
Token that
Token ID326
Feature activation-9.71e-03

Pos logit contributions
 managed+0.028
 wouldn+0.023
 never+0.022
 able+0.022
 him+0.021
Neg logit contributions
andel-0.058
est-0.057
umm-0.057
aff-0.056
 sky-0.055
Token the
Token ID262
Feature activation-1.05e+01

Pos logit contributions
andel+0.000
est+0.000
umm+0.000
aff+0.000
 sky+0.000
Neg logit contributions
 managed-0.000
 wouldn-0.000
 never-0.000
 able-0.000
 him-0.000
Token Sm
Token ID2439
Feature activation-1.53e+01

Pos logit contributions
andel+0.163
umm+0.160
aff+0.155
est+0.153
ay+0.145
Neg logit contributions
 managed-0.206
 never-0.200
 not-0.189
 going-0.183
 it-0.182
Tokenelly
Token ID6148
Feature activation-8.70e+00

Pos logit contributions
est+0.289
andel+0.282
 sky+0.278
umm+0.273
aff+0.270
Neg logit contributions
 managed-0.234
 never-0.203
 able-0.200
 him-0.196
 wouldn-0.195
Token King
Token ID2677
Feature activation-1.90e+00

Pos logit contributions
andel+0.117
umm+0.113
aff+0.107
est+0.107
umpy+0.099
Neg logit contributions
 managed-0.189
 never-0.173
 wouldn-0.173
 it-0.170
 him-0.169
Token was
Token ID373
Feature activation+1.85e+00

Pos logit contributions
est+0.043
andel+0.042
aff+0.042
 sky+0.041
umm+0.041
Neg logit contributions
 managed-0.022
 wouldn-0.018
 able-0.018
 never-0.018
 him-0.018
Token cruel
Token ID12177
Feature activation-3.98e+00

Pos logit contributions
 managed+0.024
 wouldn+0.021
 him+0.019
 able+0.019
 never+0.018
Neg logit contributions
andel-0.039
est-0.039
 sky-0.039
umm-0.038
aff-0.037
Token and
Token ID290
Feature activation+3.62e+00

Pos logit contributions
est+0.065
andel+0.061
umm+0.060
 sky+0.060
aff+0.059
Neg logit contributions
 managed-0.076
 wouldn-0.066
 able-0.065
 never-0.065
 him-0.063
Token,
Token ID11
Feature activation-4.40e-01

Pos logit contributions
andel+0.026
est+0.026
umm+0.026
aff+0.025
 sky+0.025
Neg logit contributions
 managed-0.027
 wouldn-0.024
 never-0.024
 able-0.023
 him-0.023
Token Lily
Token ID20037
Feature activation+2.62e-01

Pos logit contributions
andel+0.008
est+0.008
umm+0.007
 sky+0.007
aff+0.007
Neg logit contributions
 managed-0.008
 wouldn-0.007
 able-0.007
 never-0.007
 him-0.006
Token and
Token ID290
Feature activation+2.10e-02

Pos logit contributions
 managed+0.004
 able+0.004
 preventing+0.004
 wouldn+0.004
 him+0.004
Neg logit contributions
est-0.005
 sky-0.005
umpy-0.004
 world-0.004
aff-0.004
Token Mr
Token ID1770
Feature activation-1.02e+01

Pos logit contributions
 managed+0.000
 wouldn+0.000
 able+0.000
 never+0.000
 remembered+0.000
Neg logit contributions
est-0.000
andel-0.000
umm-0.000
aff-0.000
 sky-0.000
Token.
Token ID13
Feature activation-8.83e+00

Pos logit contributions
andel+0.183
umm+0.175
est+0.173
aff+0.163
 sky+0.162
Neg logit contributions
 managed-0.177
 him-0.167
 wouldn-0.162
 never-0.155
 remembered-0.153
Token Sp
Token ID1338
Feature activation-1.52e+01

Pos logit contributions
andel+0.165
est+0.164
umm+0.162
aff+0.160
 sky+0.154
Neg logit contributions
 managed-0.143
 wouldn-0.126
 him-0.123
 never-0.123
 able-0.121
Tokenud
Token ID463
Feature activation+6.08e-01

Pos logit contributions
andel+0.248
 sky+0.242
est+0.240
aff+0.239
umm+0.232
Neg logit contributions
 managed-0.267
 wouldn-0.252
 never-0.244
 him-0.239
 listened-0.235
Token went
Token ID1816
Feature activation+1.16e+00

Pos logit contributions
 managed+0.010
 able+0.009
 never+0.009
 preventing+0.009
 wouldn+0.009
Neg logit contributions
est-0.011
aff-0.010
andel-0.010
 sky-0.010
umpy-0.010
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
 managed+0.018
 wouldn+0.015
pped+0.014
 never+0.014
 remembered+0.014
Neg logit contributions
 sky-0.023
andel-0.023
est-0.022
aff-0.022
 world-0.022
Token the
Token ID262
Feature activation-9.10e+00

Pos logit contributions
 managed+0.027
 wouldn+0.023
 able+0.022
pped+0.022
 never+0.021
Neg logit contributions
andel-0.036
 sky-0.035
est-0.035
aff-0.034
 world-0.034
Token park
Token ID3952
Feature activation-6.31e-01

Pos logit contributions
andel+0.104
est+0.102
umm+0.101
aff+0.097
 sky+0.094
Neg logit contributions
 managed-0.215
 wouldn-0.196
 never-0.193
 able-0.192
 him-0.191
Tokenaters
Token ID8605
Feature activation-1.44e+00

Pos logit contributions
andel+0.105
est+0.104
umm+0.098
aff+0.097
 sky+0.092
Neg logit contributions
 managed-0.317
 wouldn-0.292
 him-0.290
 never-0.289
 able-0.288
Token.
Token ID13
Feature activation-4.52e-02

Pos logit contributions
est+0.023
andel+0.023
 sky+0.022
umm+0.022
aff+0.022
Neg logit contributions
 managed-0.027
 wouldn-0.024
 able-0.024
 never-0.024
 remembered-0.023
Token They
Token ID1119
Feature activation+4.62e+00

Pos logit contributions
est+0.001
andel+0.001
umm+0.001
aff+0.001
umpy+0.001
Neg logit contributions
 managed-0.001
 wouldn-0.001
 him-0.001
 it-0.001
 never-0.001
Token lifted
Token ID13663
Feature activation+2.05e+00

Pos logit contributions
 managed+0.055
 preventing+0.054
 able+0.052
 saving+0.052
 wouldn+0.049
Neg logit contributions
 sky-0.092
est-0.089
aff-0.087
andel-0.086
umpy-0.080
Token the
Token ID262
Feature activation-9.40e+00

Pos logit contributions
 managed+0.018
 wouldn+0.014
 able+0.013
 never+0.013
 him+0.012
Neg logit contributions
andel-0.054
est-0.054
umm-0.052
aff-0.052
 sky-0.052
Token swe
Token ID3490
Feature activation-1.52e+01

Pos logit contributions
umm+0.158
andel+0.157
aff+0.154
umpy+0.145
est+0.140
Neg logit contributions
 managed-0.166
 wouldn-0.162
 it-0.152
 him-0.150
 going-0.149
Tokenaters
Token ID8605
Feature activation-3.78e-01

Pos logit contributions
andel+0.123
est+0.117
aff+0.115
umpy+0.110
umm+0.110
Neg logit contributions
 managed-0.391
 wouldn-0.368
 him-0.364
 never-0.358
 able-0.356
Token and
Token ID290
Feature activation+4.69e+00

Pos logit contributions
est+0.006
 sky+0.006
andel+0.006
aff+0.005
umm+0.005
Neg logit contributions
 managed-0.007
 remembered-0.006
 listened-0.006
 able-0.006
 wouldn-0.006
Token found
Token ID1043
Feature activation+2.73e+00

Pos logit contributions
 preventing+0.051
 managed+0.050
 saving+0.047
 gonna+0.047
 never+0.046
Neg logit contributions
 sky-0.095
est-0.094
andel-0.089
 girl-0.088
 sun-0.088
Token a
Token ID257
Feature activation-5.35e+00

Pos logit contributions
 managed+0.035
 wouldn+0.029
 never+0.027
 able+0.027
 listened+0.027
Neg logit contributions
andel-0.061
est-0.061
 sky-0.058
aff-0.058
umm-0.058
Token small
Token ID1402
Feature activation-8.10e+00

Pos logit contributions
andel+0.084
umm+0.083
est+0.080
aff+0.080
umpy+0.076
Neg logit contributions
 managed-0.105
 wouldn-0.096
 never-0.094
 it-0.093
 remembered-0.092
Token
Token ID198
Feature activation-5.02e+00

Pos logit contributions
andel+0.038
umm+0.038
est+0.038
umpy+0.034
aff+0.034
Neg logit contributions
 managed-0.031
 him-0.030
 it-0.028
 never-0.027
 wouldn-0.027
Token
Token ID198
Feature activation-4.98e+00

Pos logit contributions
umm+0.091
andel+0.090
est+0.089
aff+0.082
 sky+0.081
Neg logit contributions
 managed-0.089
 him-0.080
 never-0.080
 it-0.077
 wouldn-0.077
TokenTom
Token ID13787
Feature activation+1.66e+00

Pos logit contributions
andel+0.085
est+0.085
umm+0.084
aff+0.082
 sky+0.080
Neg logit contributions
 managed-0.089
 wouldn-0.079
 able-0.077
 never-0.077
 him-0.076
Token thanked
Token ID26280
Feature activation+2.44e+00

Pos logit contributions
 managed+0.021
 able+0.018
 wouldn+0.018
 him+0.018
 going+0.017
Neg logit contributions
est-0.036
andel-0.034
 sky-0.034
aff-0.034
umm-0.034
Token the
Token ID262
Feature activation-1.10e+01

Pos logit contributions
 managed+0.026
 wouldn+0.021
 able+0.019
 never+0.019
 remembered+0.018
Neg logit contributions
andel-0.060
est-0.059
aff-0.059
umm-0.058
 sky-0.057
Token wise
Token ID10787
Feature activation-1.52e+01

Pos logit contributions
umm+0.194
andel+0.188
aff+0.185
umpy+0.167
elly+0.163
Neg logit contributions
 never-0.193
 managed-0.191
 going-0.182
 it-0.178
 him-0.176
Token owl
Token ID39610
Feature activation-2.05e+00

Pos logit contributions
umm+0.243
andel+0.232
umpy+0.209
aff+0.206
umbs+0.204
Neg logit contributions
 never-0.266
 managed-0.259
 it-0.252
 him-0.249
 going-0.246
Token and
Token ID290
Feature activation+4.19e+00

Pos logit contributions
umm+0.046
andel+0.045
aff+0.043
umpy+0.043
est+0.042
Neg logit contributions
 managed-0.026
 him-0.024
 never-0.023
 wouldn-0.023
 able-0.023
Token went
Token ID1816
Feature activation+1.92e+00

Pos logit contributions
pped+0.054
 gonna+0.049
 managed+0.047
 ruining+0.046
 preventing+0.046
Neg logit contributions
est-0.081
 sky-0.077
 girl-0.074
 sun-0.072
andel-0.069
Token home
Token ID1363
Feature activation+4.82e-02

Pos logit contributions
 managed+0.030
 wouldn+0.027
 never+0.024
 remembered+0.024
 able+0.023
Neg logit contributions
andel-0.037
est-0.036
 sky-0.036
aff-0.036
umm-0.034
Token feeling
Token ID4203
Feature activation+2.33e+00

Pos logit contributions
 managed+0.001
 wouldn+0.001
 him+0.001
pped+0.001
 never+0.001
Neg logit contributions
andel-0.001
est-0.001
aff-0.001
umm-0.001
 sky-0.001
Token Annie
Token ID25860
Feature activation+1.06e-01

Pos logit contributions
andel+0.014
umm+0.013
est+0.013
aff+0.013
umpy+0.012
Neg logit contributions
 managed-0.015
 wouldn-0.014
 him-0.014
 never-0.014
 it-0.013
Token wanted
Token ID2227
Feature activation+3.91e+00

Pos logit contributions
 managed+0.001
 able+0.001
 paying+0.001
 preventing+0.001
pped+0.001
Neg logit contributions
est-0.002
 sky-0.002
aff-0.002
andel-0.002
umpy-0.002
Token to
Token ID284
Feature activation+4.12e+00

Pos logit contributions
 managed+0.057
 wouldn+0.049
pped+0.047
 able+0.047
 remembered+0.044
Neg logit contributions
 sky-0.078
andel-0.076
est-0.076
aff-0.075
 world-0.072
Token come
Token ID1282
Feature activation+1.06e+00

Pos logit contributions
 managed+0.055
 wouldn+0.046
 never+0.045
 able+0.044
 him+0.044
Neg logit contributions
andel-0.090
est-0.089
umm-0.088
aff-0.087
 sky-0.085
Token with
Token ID351
Feature activation+1.06e+00

Pos logit contributions
 managed+0.016
 wouldn+0.014
 able+0.013
 never+0.013
 him+0.013
Neg logit contributions
est-0.021
andel-0.021
 sky-0.020
aff-0.020
umm-0.020
Token Sp
Token ID1338
Feature activation-1.51e+01

Pos logit contributions
 managed+0.011
 wouldn+0.009
 able+0.008
 remembered+0.008
 never+0.008
Neg logit contributions
est-0.026
andel-0.026
 sky-0.025
umm-0.025
aff-0.025
Tokenide
Token ID485
Feature activation-5.94e+00

Pos logit contributions
 sky+0.208
est+0.194
andel+0.194
umm+0.190
 sun+0.179
Neg logit contributions
 managed-0.298
 never-0.277
pped-0.274
 wouldn-0.271
 him-0.268
Tokeny
Token ID88
Feature activation-2.10e+00

Pos logit contributions
umm+0.097
andel+0.096
est+0.090
aff+0.089
 sky+0.088
Neg logit contributions
 managed-0.111
 him-0.104
 wouldn-0.103
 never-0.100
 able-0.099
Token and
Token ID290
Feature activation+3.73e+00

Pos logit contributions
andel+0.044
umm+0.044
aff+0.042
est+0.042
umpy+0.041
Neg logit contributions
 managed-0.030
 him-0.027
 wouldn-0.027
 never-0.027
 able-0.026
Token explore
Token ID7301
Feature activation+5.16e-01

Pos logit contributions
pped+0.056
 managed+0.056
ged+0.049
bled+0.049
plug+0.047
Neg logit contributions
est-0.065
 sky-0.064
andel-0.059
 world-0.059
 sun-0.059
Token the
Token ID262
Feature activation-1.04e+01

Pos logit contributions
 managed+0.007
 wouldn+0.006
 him+0.006
 never+0.006
 able+0.005
Neg logit contributions
andel-0.011
umm-0.011
est-0.011
aff-0.011
 sky-0.011
Token "
Token ID366
Feature activation+1.44e+00

Pos logit contributions
 managed+0.048
 wouldn+0.042
 him+0.041
 able+0.040
 never+0.040
Neg logit contributions
 sky-0.039
andel-0.038
aff-0.038
est-0.038
umm-0.037
TokenI
Token ID40
Feature activation+4.11e+00

Pos logit contributions
 managed+0.021
 never+0.019
 wouldn+0.018
 him+0.017
 it+0.017
Neg logit contributions
andel-0.030
est-0.029
aff-0.029
umm-0.028
umpy-0.027
Token like
Token ID588
Feature activation+1.71e+00

Pos logit contributions
 managed+0.066
 able+0.059
 him+0.058
 wouldn+0.058
pped+0.056
Neg logit contributions
est-0.075
andel-0.073
aff-0.071
umm-0.071
umpy-0.070
Token your
Token ID534
Feature activation-6.78e+00

Pos logit contributions
 managed+0.025
 wouldn+0.020
 retrieved+0.019
 remembered+0.019
 able+0.019
Neg logit contributions
 sky-0.037
 sun-0.035
andel-0.035
 world-0.035
est-0.034
Token smile
Token ID8212
Feature activation-9.46e-01

Pos logit contributions
andel+0.111
aff+0.111
umm+0.106
umpy+0.103
est+0.100
Neg logit contributions
 managed-0.125
 wouldn-0.119
 him-0.116
 going-0.116
 it-0.115
Token Sp
Token ID1338
Feature activation-1.51e+01

Pos logit contributions
andel+0.017
umm+0.017
est+0.017
aff+0.016
 sky+0.016
Neg logit contributions
 managed-0.016
 wouldn-0.014
 never-0.014
 able-0.014
 him-0.014
Tokenud
Token ID463
Feature activation-3.36e+00

Pos logit contributions
 sky+0.268
andel+0.267
aff+0.260
est+0.257
umm+0.254
Neg logit contributions
 managed-0.240
 wouldn-0.218
 him-0.208
pped-0.207
 never-0.207
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
andel+0.052
est+0.051
umm+0.051
aff+0.049
 sky+0.048
Neg logit contributions
 managed-0.066
 wouldn-0.059
 never-0.058
 able-0.057
 him-0.057
Token You
Token ID921
Feature activation+3.86e+00

Pos logit contributions
andel+0.006
aff+0.006
est+0.006
umm+0.005
umpy+0.005
Neg logit contributions
 managed-0.004
 wouldn-0.004
 never-0.004
 it-0.004
 him-0.004
Token always
Token ID1464
Feature activation+6.40e+00

Pos logit contributions
 managed+0.055
 wouldn+0.048
 able+0.047
 him+0.047
 never+0.046
Neg logit contributions
est-0.078
andel-0.078
umm-0.077
aff-0.075
 sky-0.075
Token make
Token ID787
Feature activation+5.72e-01

Pos logit contributions
 managed+0.098
 able+0.086
 him+0.085
pped+0.085
 wouldn+0.084
Neg logit contributions
est-0.123
 sky-0.114
 world-0.111
andel-0.111
umm-0.110
Token
Token ID198
Feature activation-4.50e+00

Pos logit contributions
umm+0.094
andel+0.093
est+0.092
aff+0.084
 sky+0.083
Neg logit contributions
 managed-0.084
 never-0.075
 it-0.074
 him-0.073
 not-0.072
TokenL
Token ID43
Feature activation-7.17e+00

Pos logit contributions
andel+0.072
est+0.071
umm+0.070
aff+0.069
umpy+0.067
Neg logit contributions
 managed-0.085
 wouldn-0.077
 him-0.075
 able-0.074
 never-0.074
Tokenily
Token ID813
Feature activation+1.77e+00

Pos logit contributions
andel+0.108
umm+0.106
est+0.106
aff+0.102
 sky+0.100
Neg logit contributions
 managed-0.144
 wouldn-0.129
 never-0.127
 able-0.127
 him-0.126
Token asked
Token ID1965
Feature activation+3.58e+00

Pos logit contributions
 managed+0.029
 him+0.026
 able+0.026
 wouldn+0.026
 going+0.025
Neg logit contributions
est-0.030
 sky-0.029
aff-0.029
umpy-0.028
andel-0.028
Token the
Token ID262
Feature activation-1.01e+01

Pos logit contributions
 managed+0.051
 wouldn+0.040
 able+0.040
 never+0.039
 remembered+0.037
Neg logit contributions
est-0.076
andel-0.072
aff-0.072
umm-0.070
 sky-0.069
Token paramed
Token ID34581
Feature activation-1.51e+01

Pos logit contributions
umm+0.158
andel+0.154
aff+0.145
est+0.144
igsaw+0.136
Neg logit contributions
 managed-0.199
 never-0.190
 going-0.189
 not-0.186
 him-0.182
Tokenic
Token ID291
Feature activation-6.02e+00

Pos logit contributions
andel+0.219
est+0.199
 sky+0.198
umm+0.198
aff+0.191
Neg logit contributions
 managed-0.314
 wouldn-0.289
 never-0.283
 listened-0.276
 able-0.276
Token,
Token ID11
Feature activation+2.61e+00

Pos logit contributions
est+0.085
andel+0.079
aff+0.077
umm+0.073
umpy+0.072
Neg logit contributions
 managed-0.129
 wouldn-0.115
 able-0.115
 never-0.113
 him-0.111
Token "
Token ID366
Feature activation+2.42e+00

Pos logit contributions
 managed+0.053
 wouldn+0.047
pped+0.046
 him+0.045
 able+0.045
Neg logit contributions
 sky-0.036
andel-0.035
est-0.034
aff-0.033
umpy-0.032
TokenIs
Token ID3792
Feature activation+5.31e+00

Pos logit contributions
 managed+0.040
 able+0.035
 wouldn+0.035
 never+0.033
 going+0.032
Neg logit contributions
est-0.042
andel-0.042
 sky-0.042
umm-0.042
aff-0.040
Token the
Token ID262
Feature activation-8.38e+00

Pos logit contributions
 managed+0.065
 freed+0.054
 able+0.052
 retrieved+0.051
 paying+0.051
Neg logit contributions
 sky-0.110
est-0.106
andel-0.103
aff-0.102
 sun-0.098
Token Dad
Token ID17415
Feature activation+2.30e+00

Pos logit contributions
 managed+0.018
 able+0.017
 wouldn+0.016
 never+0.016
 saving+0.016
Neg logit contributions
 sky-0.014
andel-0.014
umm-0.013
est-0.013
aff-0.013
Token then
Token ID788
Feature activation+3.23e+00

Pos logit contributions
 managed+0.025
 able+0.023
 preventing+0.022
 saving+0.022
 never+0.021
Neg logit contributions
est-0.050
 sky-0.049
aff-0.048
andel-0.048
umm-0.047
Token looked
Token ID3114
Feature activation+2.82e+00

Pos logit contributions
 managed+0.036
 wouldn+0.031
 never+0.031
 able+0.031
 him+0.029
Neg logit contributions
est-0.071
 sky-0.070
andel-0.070
aff-0.069
umm-0.068
Token at
Token ID379
Feature activation+2.25e+00

Pos logit contributions
 managed+0.040
 wouldn+0.034
 never+0.033
 able+0.033
 him+0.033
Neg logit contributions
andel-0.059
est-0.058
umm-0.058
aff-0.056
 sky-0.056
Token the
Token ID262
Feature activation-9.51e+00

Pos logit contributions
 managed+0.021
 wouldn+0.017
 him+0.016
 never+0.016
 able+0.016
Neg logit contributions
andel-0.058
est-0.057
umm-0.057
aff-0.056
 sky-0.055
Token bald
Token ID28273
Feature activation-1.50e+01

Pos logit contributions
andel+0.171
aff+0.170
umm+0.164
est+0.156
affe+0.154
Neg logit contributions
 it-0.153
 managed-0.151
 not-0.148
 wouldn-0.145
 going-0.144
Token man
Token ID582
Feature activation-1.18e+00

Pos logit contributions
umm+0.220
andel+0.213
aff+0.204
est+0.193
umpy+0.188
Neg logit contributions
 managed-0.284
 it-0.280
 wouldn-0.273
 not-0.269
 him-0.264
Token on
Token ID319
Feature activation+7.54e-01

Pos logit contributions
andel+0.020
est+0.020
umm+0.019
aff+0.019
 sky+0.019
Neg logit contributions
 managed-0.021
 wouldn-0.019
 never-0.019
 able-0.018
 him-0.018
Token the
Token ID262
Feature activation-9.33e+00

Pos logit contributions
 managed+0.007
 him+0.006
 never+0.005
 wouldn+0.005
 it+0.005
Neg logit contributions
andel-0.019
est-0.019
umm-0.019
aff-0.019
umpy-0.019
Token TV
Token ID3195
Feature activation-3.61e+00

Pos logit contributions
umm+0.160
aff+0.159
andel+0.154
est+0.153
umpy+0.145
Neg logit contributions
 managed-0.155
 not-0.153
 it-0.150
 never-0.150
 going-0.149
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
andel+0.058
umm+0.055
est+0.055
aff+0.054
umpy+0.052
Neg logit contributions
 managed-0.069
 wouldn-0.063
 him-0.062
 never-0.062
 able-0.061
Token wanted
Token ID2227
Feature activation+3.49e+00

Pos logit contributions
 able+0.042
 him+0.041
 preventing+0.041
 managed+0.040
 saving+0.040
Neg logit contributions
 sky-0.059
est-0.057
aff-0.056
 world-0.052
umpy-0.052
Token to
Token ID284
Feature activation+4.04e+00

Pos logit contributions
 managed+0.049
 wouldn+0.042
 able+0.039
pped+0.038
 remembered+0.038
Neg logit contributions
 sky-0.073
andel-0.071
aff-0.070
est-0.070
 world-0.068
Token play
Token ID711
Feature activation-1.74e+00

Pos logit contributions
 managed+0.059
 wouldn+0.052
 able+0.049
 never+0.048
 him+0.047
Neg logit contributions
andel-0.081
est-0.081
aff-0.079
umm-0.079
 sky-0.078
Token with
Token ID351
Feature activation-9.54e-02

Pos logit contributions
andel+0.034
umm+0.033
est+0.033
aff+0.032
 sky+0.031
Neg logit contributions
 managed-0.028
 wouldn-0.024
 never-0.024
 him-0.024
 able-0.024
Token the
Token ID262
Feature activation-9.95e+00

Pos logit contributions
andel+0.003
est+0.003
aff+0.002
 sky+0.002
umm+0.002
Neg logit contributions
 managed-0.001
 wouldn-0.001
 able-0.001
 never-0.001
 remembered-0.001
Token wise
Token ID10787
Feature activation-1.50e+01

Pos logit contributions
andel+0.170
umm+0.168
aff+0.163
est+0.153
umpy+0.147
Neg logit contributions
 managed-0.184
 it-0.172
 never-0.168
 wouldn-0.166
 not-0.165
Token old
Token ID1468
Feature activation-1.20e+01

Pos logit contributions
umm+0.231
andel+0.216
aff+0.194
lights+0.179
ement+0.178
Neg logit contributions
 it-0.283
 not-0.263
 managed-0.262
 never-0.259
 tried-0.257
Token cat
Token ID3797
Feature activation-2.30e+00

Pos logit contributions
umm+0.160
andel+0.160
aff+0.144
est+0.137
umpy+0.133
Neg logit contributions
 managed-0.268
 it-0.261
 not-0.251
 never-0.249
 wouldn-0.248
Token using
Token ID1262
Feature activation+2.52e+00

Pos logit contributions
est+0.034
andel+0.032
aff+0.031
umm+0.031
 sky+0.031
Neg logit contributions
 managed-0.046
 wouldn-0.041
 never-0.041
 able-0.040
 him-0.040
Token the
Token ID262
Feature activation-1.00e+01

Pos logit contributions
 managed+0.029
 wouldn+0.024
 listened+0.023
pped+0.022
 remembered+0.022
Neg logit contributions
est-0.057
 sky-0.057
andel-0.056
 world-0.054
aff-0.053
Token string
Token ID4731
Feature activation-1.09e+00

Pos logit contributions
andel+0.165
umm+0.165
aff+0.161
est+0.156
umpy+0.150
Neg logit contributions
 managed-0.189
 wouldn-0.170
 it-0.168
 never-0.166
 him-0.162
Token girl
Token ID2576
Feature activation+1.46e+00

Pos logit contributions
aff+0.101
andel+0.098
umm+0.097
umpy+0.088
ay+0.080
Neg logit contributions
 managed-0.211
 remembered-0.201
 tried-0.200
 it-0.199
 wouldn-0.190
Token walked
Token ID6807
Feature activation+9.41e-01

Pos logit contributions
 managed+0.016
 able+0.014
 him+0.014
pped+0.014
 preventing+0.014
Neg logit contributions
est-0.032
 sky-0.031
aff-0.030
andel-0.030
umpy-0.029
Token up
Token ID510
Feature activation+1.16e+00

Pos logit contributions
 managed+0.012
 wouldn+0.011
 never+0.010
 able+0.010
 him+0.010
Neg logit contributions
andel-0.021
est-0.020
umm-0.020
aff-0.020
 sky-0.020
Token to
Token ID284
Feature activation+2.12e+00

Pos logit contributions
 managed+0.013
 wouldn+0.011
 able+0.010
 never+0.010
 him+0.009
Neg logit contributions
andel-0.028
est-0.027
umm-0.027
aff-0.027
 sky-0.027
Token the
Token ID262
Feature activation-9.51e+00

Pos logit contributions
 managed+0.026
 wouldn+0.022
 able+0.021
 never+0.019
 remembered+0.019
Neg logit contributions
andel-0.049
est-0.048
 sky-0.047
aff-0.047
umm-0.046
Token bald
Token ID28273
Feature activation-1.49e+01

Pos logit contributions
umm+0.159
andel+0.154
aff+0.148
est+0.145
umpy+0.136
Neg logit contributions
 managed-0.179
 it-0.174
 never-0.169
 him-0.166
 not-0.164
Token man
Token ID582
Feature activation-2.37e-01

Pos logit contributions
umm+0.228
andel+0.214
est+0.194
aff+0.191
umpy+0.190
Neg logit contributions
 managed-0.290
 it-0.278
 him-0.277
 never-0.275
 wouldn-0.272
Token and
Token ID290
Feature activation+3.44e+00

Pos logit contributions
andel+0.004
est+0.004
umm+0.004
aff+0.004
 sky+0.004
Neg logit contributions
 managed-0.004
 wouldn-0.004
 never-0.004
 able-0.003
 him-0.003
Token asked
Token ID1965
Feature activation+3.87e+00

Pos logit contributions
pped+0.049
 managed+0.048
 preventing+0.044
 wouldn+0.044
 gonna+0.043
Neg logit contributions
 sky-0.056
 girl-0.056
est-0.054
aff-0.052
andel-0.052
Token,
Token ID11
Feature activation+2.57e+00

Pos logit contributions
 managed+0.044
 wouldn+0.035
 never+0.034
 able+0.034
 remembered+0.031
Neg logit contributions
est-0.092
andel-0.091
aff-0.089
umm-0.088
 sky-0.087
Token "
Token ID366
Feature activation+2.31e+00

Pos logit contributions
 managed+0.046
 wouldn+0.040
 never+0.039
 able+0.039
 him+0.038
Neg logit contributions
andel-0.043
est-0.042
 sky-0.042
umm-0.041
aff-0.041
Token a
Token ID257
Feature activation-8.71e+00

Pos logit contributions
est+0.079
 sky+0.076
andel+0.074
umpy+0.073
 world+0.072
Neg logit contributions
 managed-0.052
 wouldn-0.045
 able-0.044
 never-0.041
check-0.041
Token big
Token ID1263
Feature activation-9.99e+00

Pos logit contributions
umm+0.135
est+0.130
andel+0.126
aff+0.114
bows+0.105
Neg logit contributions
 managed-0.176
 it-0.169
 remembered-0.169
 not-0.165
 never-0.164
Token bird
Token ID6512
Feature activation-3.20e+00

Pos logit contributions
umm+0.148
andel+0.144
aff+0.140
est+0.138
umpy+0.136
Neg logit contributions
 managed-0.191
 it-0.190
 him-0.183
 going-0.182
 remembered-0.181
Token named
Token ID3706
Feature activation-9.19e-01

Pos logit contributions
est+0.057
andel+0.056
umm+0.054
aff+0.054
 sky+0.054
Neg logit contributions
 managed-0.055
 wouldn-0.048
 able-0.047
 never-0.046
 him-0.046
Token O
Token ID440
Feature activation-9.01e+00

Pos logit contributions
umm+0.023
andel+0.023
est+0.023
umpy+0.022
aff+0.021
Neg logit contributions
 managed-0.010
 him-0.008
 it-0.008
 never-0.008
 wouldn-0.008
Tokenstr
Token ID2536
Feature activation-1.49e+01

Pos logit contributions
aff+0.189
andel+0.188
est+0.187
 sky+0.186
umpy+0.185
Neg logit contributions
 managed-0.114
 wouldn-0.101
 able-0.099
 him-0.094
 it-0.092
Tokenich
Token ID488
Feature activation-1.78e+00

Pos logit contributions
andel+0.265
est+0.263
umm+0.258
aff+0.242
 sky+0.239
Neg logit contributions
 managed-0.258
 never-0.229
 wouldn-0.224
 able-0.223
 him-0.222
Token wanted
Token ID2227
Feature activation+1.60e+00

Pos logit contributions
umm+0.038
andel+0.038
est+0.036
aff+0.035
umpy+0.034
Neg logit contributions
 managed-0.026
 never-0.023
 wouldn-0.023
 him-0.022
 it-0.022
Token to
Token ID284
Feature activation+2.95e+00

Pos logit contributions
 managed+0.021
 wouldn+0.019
 able+0.017
pped+0.016
 remembered+0.016
Neg logit contributions
 sky-0.034
est-0.034
andel-0.034
aff-0.033
umm-0.033
Token swim
Token ID9422
Feature activation-2.32e+00

Pos logit contributions
 managed+0.044
 wouldn+0.039
 able+0.037
 never+0.037
 him+0.036
Neg logit contributions
andel-0.059
est-0.058
umm-0.057
aff-0.056
 sky-0.056
Token in
Token ID287
Feature activation+2.42e-01

Pos logit contributions
umm+0.056
andel+0.055
aff+0.053
est+0.053
 sky+0.051
Neg logit contributions
 managed-0.027
 him-0.024
 wouldn-0.023
 never-0.023
 it-0.022
Token so
Token ID523
Feature activation+8.11e-01

Pos logit contributions
 managed+0.045
 wouldn+0.043
 him+0.041
pped+0.040
ged+0.038
Neg logit contributions
est-0.045
 sky-0.043
andel-0.042
umpy-0.042
aff-0.040
Token happy
Token ID3772
Feature activation-6.32e-01

Pos logit contributions
 managed+0.015
 wouldn+0.014
 him+0.013
 able+0.013
 never+0.013
Neg logit contributions
est-0.013
andel-0.013
umm-0.013
aff-0.013
umpy-0.013
Token and
Token ID290
Feature activation+3.24e+00

Pos logit contributions
est+0.010
 sky+0.010
aff+0.010
 world+0.009
umm+0.009
Neg logit contributions
 managed-0.012
 wouldn-0.010
pped-0.010
 freed-0.010
 remembered-0.010
Token thanked
Token ID26280
Feature activation+2.55e+00

Pos logit contributions
pped+0.048
eks+0.043
 managed+0.043
plug+0.042
 him+0.041
Neg logit contributions
 sky-0.058
est-0.057
 sun-0.055
aff-0.054
umm-0.051
Token the
Token ID262
Feature activation-1.11e+01

Pos logit contributions
 managed+0.027
 wouldn+0.022
 able+0.020
 never+0.020
 remembered+0.019
Neg logit contributions
andel-0.063
est-0.062
aff-0.062
umm-0.061
 sky-0.060
Token gr
Token ID1036
Feature activation-1.49e+01

Pos logit contributions
andel+0.179
umm+0.176
est+0.172
aff+0.171
bows+0.153
Neg logit contributions
 managed-0.217
 never-0.194
 not-0.189
 him-0.189
 wouldn-0.189
Tokenumpy
Token ID32152
Feature activation-1.15e+01

Pos logit contributions
 sky+0.071
umm+0.068
est+0.068
andel+0.067
bows+0.046
Neg logit contributions
 managed-0.427
 him-0.401
 able-0.395
 never-0.394
 wouldn-0.389
Token fisherman
Token ID48750
Feature activation-1.44e+00

Pos logit contributions
andel+0.167
umm+0.164
est+0.160
aff+0.155
 sky+0.146
Neg logit contributions
 managed-0.234
 wouldn-0.213
 him-0.212
 never-0.208
 it-0.208
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
andel+0.033
umm+0.033
est+0.031
aff+0.031
 sky+0.030
Neg logit contributions
 managed-0.018
 him-0.016
 wouldn-0.016
 never-0.016
 able-0.016
Token
Token ID220
Feature activation-3.58e+00

Pos logit contributions
andel+0.039
est+0.038
umm+0.037
aff+0.036
umpy+0.035
Neg logit contributions
 managed-0.036
 him-0.033
 never-0.033
 wouldn-0.032
 it-0.032
Token
Token ID198
Feature activation-5.13e+00

Pos logit contributions
andel+0.057
est+0.056
umm+0.056
aff+0.053
 sky+0.052
Neg logit contributions
 managed-0.069
 wouldn-0.061
 never-0.061
 him-0.061
 able-0.060
Token happy
Token ID3772
Feature activation-9.72e-01

Pos logit contributions
 managed+0.048
 wouldn+0.044
 him+0.042
pped+0.041
 never+0.040
Neg logit contributions
est-0.048
andel-0.045
 sky-0.045
umm-0.044
aff-0.043
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
 sky+0.018
est+0.017
umm+0.017
aff+0.017
andel+0.017
Neg logit contributions
 managed-0.016
 wouldn-0.014
pped-0.014
 able-0.013
 never-0.013
Token They
Token ID1119
Feature activation+5.03e+00

Pos logit contributions
andel+0.033
est+0.032
umm+0.031
aff+0.030
umpy+0.029
Neg logit contributions
 managed-0.033
 him-0.030
 never-0.030
 wouldn-0.030
 able-0.029
Token thanked
Token ID26280
Feature activation+2.92e+00

Pos logit contributions
 preventing+0.053
 saving+0.051
 managed+0.050
 risking+0.049
 him+0.049
Neg logit contributions
aff-0.104
 sky-0.104
est-0.100
andel-0.099
ay-0.095
Token the
Token ID262
Feature activation-1.08e+01

Pos logit contributions
 managed+0.038
 wouldn+0.032
 able+0.029
pped+0.028
 never+0.028
Neg logit contributions
 sky-0.065
andel-0.065
est-0.064
aff-0.064
umm-0.062
Token wise
Token ID10787
Feature activation-1.49e+01

Pos logit contributions
andel+0.171
umm+0.167
est+0.155
aff+0.152
ay+0.135
Neg logit contributions
 managed-0.215
 going-0.199
 never-0.198
 not-0.198
 listened-0.197
Token old
Token ID1468
Feature activation-1.22e+01

Pos logit contributions
umm+0.203
andel+0.198
est+0.165
ay+0.165
aff+0.158
Neg logit contributions
 managed-0.306
 it-0.295
 him-0.290
 never-0.282
 wouldn-0.280
Token man
Token ID582
Feature activation-1.45e+00

Pos logit contributions
andel+0.151
umm+0.147
est+0.141
aff+0.135
umpy+0.128
Neg logit contributions
 managed-0.279
 wouldn-0.254
 him-0.253
 never-0.253
 it-0.249
Token and
Token ID290
Feature activation+4.05e+00

Pos logit contributions
andel+0.033
umm+0.032
est+0.031
aff+0.030
umpy+0.030
Neg logit contributions
 managed-0.019
 him-0.016
 wouldn-0.016
 never-0.016
 able-0.016
Token played
Token ID2826
Feature activation-1.28e+00

Pos logit contributions
pped+0.058
eling+0.052
 gonna+0.051
 risking+0.050
 managed+0.048
Neg logit contributions
 sky-0.073
est-0.072
 sun-0.071
 girl-0.070
aff-0.066
Token with
Token ID351
Feature activation+5.20e-01

Pos logit contributions
est+0.027
andel+0.027
umm+0.027
aff+0.026
 sky+0.026
Neg logit contributions
 managed-0.018
 wouldn-0.015
 never-0.015
 able-0.014
 him-0.014
Token getting
Token ID1972
Feature activation+3.21e-01

Pos logit contributions
 managed+0.017
 wouldn+0.015
 him+0.014
 never+0.014
 able+0.014
Neg logit contributions
est-0.019
andel-0.019
umm-0.019
aff-0.019
 sky-0.019
Token very
Token ID845
Feature activation-1.17e+00

Pos logit contributions
 managed+0.005
 wouldn+0.005
 never+0.005
pped+0.004
 able+0.004
Neg logit contributions
andel-0.005
 sky-0.005
est-0.005
aff-0.005
umm-0.005
Token annoyed
Token ID25602
Feature activation-8.84e-01

Pos logit contributions
andel+0.020
est+0.019
aff+0.019
umm+0.019
umpy+0.019
Neg logit contributions
 managed-0.021
 wouldn-0.019
 him-0.018
 never-0.018
 able-0.018
Token with
Token ID351
Feature activation+1.34e+00

Pos logit contributions
andel+0.018
umm+0.017
est+0.017
aff+0.017
 sky+0.016
Neg logit contributions
 managed-0.013
 him-0.013
 wouldn-0.012
 never-0.012
 it-0.012
Token his
Token ID465
Feature activation-6.75e+00

Pos logit contributions
 managed+0.013
 wouldn+0.010
 remembered+0.009
 able+0.009
 never+0.008
Neg logit contributions
est-0.034
andel-0.034
 sky-0.033
aff-0.032
umm-0.032
Token rud
Token ID28906
Feature activation-1.48e+01

Pos logit contributions
andel+0.116
umm+0.115
aff+0.110
est+0.106
umpy+0.104
Neg logit contributions
 managed-0.121
 him-0.115
 not-0.115
 never-0.113
 wouldn-0.111
Tokeneness
Token ID9449
Feature activation-2.22e+00

Pos logit contributions
andel+0.191
 sky+0.177
aff+0.174
umm+0.173
 world+0.159
Neg logit contributions
 managed-0.302
 him-0.285
 wouldn-0.281
 never-0.273
 able-0.271
Token and
Token ID290
Feature activation+2.79e+00

Pos logit contributions
andel+0.036
est+0.036
umm+0.035
aff+0.034
 sky+0.034
Neg logit contributions
 managed-0.041
 wouldn-0.037
 never-0.036
 able-0.036
 him-0.036
Token complained
Token ID13832
Feature activation+8.67e-01

Pos logit contributions
pped+0.032
 managed+0.032
 able+0.026
 retrieved+0.026
plug+0.025
Neg logit contributions
est-0.059
 sky-0.057
aff-0.053
 sun-0.053
 world-0.052
Token that
Token ID326
Feature activation+2.68e+00

Pos logit contributions
 managed+0.014
 wouldn+0.012
 never+0.012
 able+0.011
 him+0.011
Neg logit contributions
est-0.017
andel-0.016
umm-0.016
aff-0.016
 sky-0.016
Token he
Token ID339
Feature activation+3.90e+00

Pos logit contributions
 managed+0.029
pped+0.024
 able+0.023
 wouldn+0.022
 retrieved+0.022
Neg logit contributions
 sky-0.063
est-0.062
 world-0.059
umm-0.059
andel-0.059
Token then
Token ID788
Feature activation+2.34e+00

Pos logit contributions
 managed+0.039
 retrieved+0.034
pped+0.032
ged+0.031
 wouldn+0.031
Neg logit contributions
est-0.073
 sky-0.071
 world-0.069
umm-0.069
andel-0.066
Token you
Token ID345
Feature activation+3.09e+00

Pos logit contributions
 managed+0.030
 able+0.026
pped+0.025
 wouldn+0.025
 retrieved+0.024
Neg logit contributions
est-0.050
 sky-0.047
umm-0.046
andel-0.045
 world-0.045
Token can
Token ID460
Feature activation+3.47e+00

Pos logit contributions
 managed+0.040
 wouldn+0.033
 able+0.033
 never+0.032
 him+0.032
Neg logit contributions
andel-0.067
est-0.067
umm-0.066
aff-0.065
 sky-0.064
Token get
Token ID651
Feature activation+1.07e+00

Pos logit contributions
 managed+0.045
 wouldn+0.038
 never+0.037
 able+0.036
 him+0.036
Neg logit contributions
andel-0.077
est-0.076
umm-0.076
aff-0.074
 sky-0.073
Token a
Token ID257
Feature activation-6.63e+00

Pos logit contributions
 managed+0.014
 wouldn+0.012
 never+0.011
 remembered+0.011
 listened+0.011
Neg logit contributions
 sky-0.021
andel-0.021
est-0.021
 sun-0.020
umm-0.020
Token namesake
Token ID49829
Feature activation-1.48e+01

Pos logit contributions
umm+0.112
andel+0.111
est+0.108
aff+0.108
umpy+0.104
Neg logit contributions
 managed-0.121
 wouldn-0.108
 it-0.107
 never-0.104
 able-0.103
Token bowl
Token ID9396
Feature activation-2.95e+00

Pos logit contributions
andel+0.242
aff+0.241
umpy+0.239
 languages+0.221
umm+0.218
Neg logit contributions
 managed-0.253
 it-0.250
 him-0.235
 wouldn-0.232
 never-0.220
Token!"
Token ID2474
Feature activation-2.42e+00

Pos logit contributions
andel+0.055
est+0.054
umm+0.053
aff+0.052
 sky+0.051
Neg logit contributions
 managed-0.048
 wouldn-0.043
 never-0.042
 able-0.041
 him-0.041
Token
Token ID220
Feature activation-4.17e+00

Pos logit contributions
andel+0.039
est+0.039
umm+0.038
aff+0.037
 sky+0.036
Neg logit contributions
 managed-0.045
 wouldn-0.040
 never-0.040
 him-0.039
 able-0.039
Token
Token ID198
Feature activation-5.63e+00

Pos logit contributions
andel+0.067
est+0.066
umm+0.066
aff+0.063
 sky+0.061
Neg logit contributions
 managed-0.079
 wouldn-0.071
 him-0.070
 never-0.070
 able-0.069
Token
Token ID198
Feature activation-5.51e+00

Pos logit contributions
andel+0.093
est+0.092
umm+0.092
aff+0.087
 sky+0.084
Neg logit contributions
 managed-0.105
 never-0.094
 him-0.093
 wouldn-0.092
 able-0.091
Token very
Token ID845
Feature activation+9.65e-01

Pos logit contributions
 managed+0.049
 wouldn+0.044
 him+0.041
 never+0.040
pped+0.040
Neg logit contributions
est-0.058
andel-0.057
 sky-0.056
umm-0.056
aff-0.055
Token excited
Token ID6568
Feature activation+6.65e-02

Pos logit contributions
 managed+0.017
 wouldn+0.015
 never+0.015
 him+0.015
 able+0.015
Neg logit contributions
andel-0.017
est-0.016
umm-0.016
aff-0.016
 sky-0.015
Token and
Token ID290
Feature activation+4.06e+00

Pos logit contributions
 managed+0.001
 wouldn+0.001
 able+0.001
 never+0.001
pped+0.001
Neg logit contributions
est-0.001
umm-0.001
andel-0.001
 sky-0.001
aff-0.001
Token hugged
Token ID41130
Feature activation+2.29e+00

Pos logit contributions
 managed+0.047
pped+0.046
 him+0.040
plug+0.040
 preventing+0.040
Neg logit contributions
est-0.082
 sky-0.082
aff-0.079
andel-0.075
umm-0.075
Token the
Token ID262
Feature activation-1.06e+01

Pos logit contributions
 managed+0.013
 him+0.012
 it+0.010
 never+0.009
 wouldn+0.009
Neg logit contributions
andel-0.067
umm-0.067
est-0.066
umpy-0.065
aff-0.064
Token bald
Token ID28273
Feature activation-1.47e+01

Pos logit contributions
umm+0.148
andel+0.146
aff+0.142
est+0.139
umpy+0.126
Neg logit contributions
 managed-0.233
 never-0.207
 wouldn-0.206
 going-0.206
 remembered-0.206
Token man
Token ID582
Feature activation-7.60e-01

Pos logit contributions
umm+0.210
andel+0.208
est+0.186
aff+0.183
 sky+0.173
Neg logit contributions
 managed-0.309
 wouldn-0.284
 remembered-0.279
 never-0.277
 it-0.277
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
andel+0.015
umm+0.015
aff+0.014
umpy+0.014
est+0.013
Neg logit contributions
 managed-0.012
 him-0.011
 never-0.011
 able-0.011
 wouldn-0.011
Token
Token ID198
Feature activation-4.62e+00

Pos logit contributions
est+0.023
andel+0.023
umm+0.022
umpy+0.021
aff+0.021
Neg logit contributions
 managed-0.020
 him-0.019
 it-0.018
 never-0.017
 able-0.017
Token
Token ID198
Feature activation-4.56e+00

Pos logit contributions
umm+0.081
est+0.080
andel+0.079
aff+0.074
 sky+0.072
Neg logit contributions
 managed-0.083
 him-0.075
 never-0.075
 it-0.072
 wouldn-0.072
TokenEvery
Token ID6109
Feature activation-3.87e+00

Pos logit contributions
andel+0.075
est+0.074
umm+0.073
aff+0.072
 sky+0.070
Neg logit contributions
 managed-0.084
 wouldn-0.076
 able-0.074
 never-0.074
 him-0.073
Token cute
Token ID13779
Feature activation-9.85e+00

Pos logit contributions
est+0.112
umm+0.111
andel+0.107
aff+0.103
bows+0.094
Neg logit contributions
 managed-0.149
 it-0.144
 remembered-0.143
 never-0.141
 not-0.141
Token pink
Token ID11398
Feature activation-1.01e+01

Pos logit contributions
umm+0.154
andel+0.145
umpy+0.140
est+0.135
lights+0.130
Neg logit contributions
 it-0.186
 him-0.174
 going-0.165
 accidentally-0.162
 not-0.159
Token rabbit
Token ID22746
Feature activation-3.00e+00

Pos logit contributions
umpy+0.170
umm+0.167
andel+0.162
aff+0.147
est+0.143
Neg logit contributions
 it-0.177
 him-0.158
 going-0.155
 never-0.153
 wouldn-0.152
Token called
Token ID1444
Feature activation+1.60e+00

Pos logit contributions
umm+0.074
andel+0.072
est+0.067
aff+0.066
umpy+0.064
Neg logit contributions
 never-0.035
 managed-0.034
 it-0.033
 him-0.032
 wouldn-0.031
Token Mrs
Token ID9074
Feature activation-1.01e+01

Pos logit contributions
 managed+0.017
 him+0.017
 it+0.016
 never+0.015
 not+0.014
Neg logit contributions
umm-0.038
andel-0.038
est-0.038
umpy-0.037
 sky-0.036
Token Fl
Token ID1610
Feature activation-1.46e+01

Pos logit contributions
andel+0.181
umm+0.177
est+0.176
umpy+0.166
 sky+0.159
Neg logit contributions
 managed-0.173
 him-0.167
 wouldn-0.157
 never-0.156
 it-0.155
Tokenuffy
Token ID15352
Feature activation-3.90e+00

Pos logit contributions
est+0.265
andel+0.252
umm+0.248
 sky+0.237
aff+0.235
Neg logit contributions
 managed-0.248
 wouldn-0.223
 him-0.222
 never-0.220
 remembered-0.216
Token.
Token ID13
Feature activation-5.38e+00

Pos logit contributions
umm+0.081
andel+0.081
est+0.075
umpy+0.074
aff+0.072
Neg logit contributions
 him-0.057
 managed-0.057
 never-0.055
 wouldn-0.054
 it-0.052
Token She
Token ID1375
Feature activation+1.95e+00

Pos logit contributions
andel+0.111
est+0.109
umm+0.107
aff+0.104
umpy+0.103
Neg logit contributions
 managed-0.075
 him-0.072
 never-0.067
 it-0.066
 going-0.065
Token was
Token ID373
Feature activation+8.93e-01

Pos logit contributions
 managed+0.030
 able+0.027
 wouldn+0.026
 preventing+0.025
 paying+0.025
Neg logit contributions
est-0.036
 sky-0.035
aff-0.034
umpy-0.033
 world-0.033
Token small
Token ID1402
Feature activation-3.99e+00

Pos logit contributions
 managed+0.013
 wouldn+0.011
 never+0.011
 able+0.010
 him+0.010
Neg logit contributions
andel-0.019
est-0.019
umm-0.018
aff-0.018
 sky-0.018
Token wants
Token ID3382
Feature activation+3.29e+00

Pos logit contributions
 managed+0.072
 able+0.070
 paying+0.062
pped+0.062
 saving+0.061
Neg logit contributions
 sky-0.083
est-0.081
aff-0.079
andel-0.077
umpy-0.076
Token to
Token ID284
Feature activation+3.86e+00

Pos logit contributions
 managed+0.034
 wouldn+0.028
 able+0.026
 never+0.026
 him+0.025
Neg logit contributions
andel-0.081
est-0.081
umm-0.080
aff-0.079
 sky-0.078
Token talk
Token ID1561
Feature activation-1.44e-01

Pos logit contributions
 managed+0.044
 never+0.037
 him+0.037
 wouldn+0.035
 it+0.035
Neg logit contributions
umm-0.094
andel-0.093
est-0.091
aff-0.088
umpy-0.086
Token to
Token ID284
Feature activation+2.26e+00

Pos logit contributions
andel+0.003
est+0.003
umm+0.003
aff+0.003
 sky+0.003
Neg logit contributions
 managed-0.002
 wouldn-0.001
 never-0.001
 able-0.001
 him-0.001
Token the
Token ID262
Feature activation-9.57e+00

Pos logit contributions
 managed+0.020
 him+0.016
 wouldn+0.016
 never+0.016
 able+0.015
Neg logit contributions
andel-0.059
umm-0.058
est-0.058
aff-0.057
 sky-0.056
Token bald
Token ID28273
Feature activation-1.46e+01

Pos logit contributions
andel+0.152
umm+0.150
est+0.142
aff+0.136
ay+0.129
Neg logit contributions
 managed-0.186
 going-0.178
 him-0.176
 not-0.174
 it-0.174
Token man
Token ID582
Feature activation-5.79e-01

Pos logit contributions
umm+0.231
andel+0.230
est+0.201
aff+0.196
ay+0.195
Neg logit contributions
 managed-0.276
 it-0.265
 wouldn-0.264
 him-0.262
 not-0.262
Token.
Token ID13
Feature activation-4.54e-01

Pos logit contributions
umm+0.011
andel+0.011
est+0.010
aff+0.010
 sky+0.010
Neg logit contributions
 managed-0.010
 him-0.009
 wouldn-0.009
 never-0.009
 able-0.009
Token He
Token ID679
Feature activation+4.42e+00

Pos logit contributions
andel+0.009
umm+0.009
est+0.008
aff+0.008
umpy+0.008
Neg logit contributions
 managed-0.007
 him-0.007
 it-0.007
 never-0.007
 wouldn-0.007
Token runs
Token ID4539
Feature activation+3.31e-02

Pos logit contributions
 managed+0.070
 able+0.065
pped+0.058
 wouldn+0.058
 paying+0.057
Neg logit contributions
est-0.081
 sky-0.080
aff-0.080
andel-0.078
umm-0.074
Token to
Token ID284
Feature activation+2.34e+00

Pos logit contributions
 managed+0.000
 wouldn+0.000
 able+0.000
 never+0.000
 him+0.000
Neg logit contributions
andel-0.001
est-0.001
aff-0.001
 sky-0.001
umm-0.001
Token and
Token ID290
Feature activation+3.56e+00

Pos logit contributions
 managed+0.020
 wouldn+0.018
 able+0.017
 never+0.017
 remembered+0.017
Neg logit contributions
andel-0.020
est-0.020
 sky-0.020
umm-0.019
aff-0.019
Token carefully
Token ID7773
Feature activation+3.26e+00

Pos logit contributions
pped+0.043
 risking+0.043
 never+0.043
 preventing+0.042
 not+0.040
Neg logit contributions
 sky-0.066
 girl-0.064
est-0.063
 sun-0.062
andel-0.061
Token picked
Token ID6497
Feature activation+1.72e+00

Pos logit contributions
 never+0.035
 managed+0.034
 wouldn+0.033
 not+0.030
 gonna+0.030
Neg logit contributions
andel-0.071
est-0.069
 sky-0.068
aff-0.065
umm-0.061
Token up
Token ID510
Feature activation+1.35e-01

Pos logit contributions
 managed+0.023
 wouldn+0.019
 never+0.019
 able+0.019
 him+0.018
Neg logit contributions
andel-0.037
est-0.037
umm-0.037
aff-0.036
 sky-0.035
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 managed+0.001
 him+0.001
 it+0.001
 wouldn+0.001
 never+0.001
Neg logit contributions
umm-0.004
andel-0.004
est-0.004
aff-0.004
umpy-0.004
Token hairy
Token ID43873
Feature activation-1.46e+01

Pos logit contributions
aff+0.197
umm+0.188
andel+0.182
lights+0.173
est+0.166
Neg logit contributions
 managed-0.185
 it-0.178
 wouldn-0.176
 tried-0.176
 going-0.174
Token cater
Token ID20825
Feature activation-1.22e+01

Pos logit contributions
umm+0.226
aff+0.225
andel+0.224
umpy+0.209
ari+0.200
Neg logit contributions
 managed-0.257
 it-0.251
 wouldn-0.247
 tried-0.245
 remembered-0.240
Tokenpillar
Token ID41643
Feature activation-2.97e+00

Pos logit contributions
est+0.120
umm+0.118
andel+0.110
aff+0.099
 sky+0.094
Neg logit contributions
 managed-0.284
 it-0.276
 wouldn-0.267
 him-0.263
pped-0.258
Token and
Token ID290
Feature activation+3.89e+00

Pos logit contributions
andel+0.051
umm+0.050
est+0.049
aff+0.048
 sky+0.046
Neg logit contributions
 managed-0.054
 wouldn-0.048
 never-0.048
 able-0.047
 him-0.047
Token put
Token ID1234
Feature activation+2.04e+00

Pos logit contributions
pped+0.044
 managed+0.041
 risking+0.041
 him+0.040
 preventing+0.040
Neg logit contributions
 sky-0.072
est-0.071
 sun-0.070
 girl-0.069
aff-0.065
Token it
Token ID340
Feature activation+3.96e-01

Pos logit contributions
 managed+0.017
 listened+0.013
 remembered+0.013
 wouldn+0.013
 able+0.013
Neg logit contributions
 sky-0.051
est-0.051
andel-0.049
ay-0.049
 sun-0.048
Token They
Token ID1119
Feature activation+6.17e+00

Pos logit contributions
 managed+0.002
 him+0.002
 never+0.002
 wouldn+0.002
 not+0.002
Neg logit contributions
andel-0.002
est-0.002
umm-0.002
aff-0.002
 sky-0.002
Token decide
Token ID5409
Feature activation+5.48e+00

Pos logit contributions
 managed+0.089
 wouldn+0.077
 able+0.076
 never+0.074
 him+0.073
Neg logit contributions
andel-0.124
est-0.124
umm-0.121
aff-0.121
 sky-0.119
Token to
Token ID284
Feature activation+5.58e+00

Pos logit contributions
 managed+0.090
eling+0.079
 gonna+0.077
 wouldn+0.077
pped+0.077
Neg logit contributions
aff-0.089
 sky-0.086
est-0.081
 sun-0.080
 girl-0.079
Token forgive
Token ID20927
Feature activation+1.29e+00

Pos logit contributions
 managed+0.076
 wouldn+0.065
 able+0.063
 him+0.060
 never+0.060
Neg logit contributions
est-0.118
andel-0.117
umm-0.116
aff-0.114
 sky-0.112
Token him
Token ID683
Feature activation+5.56e-02

Pos logit contributions
 managed+0.012
 able+0.010
 wouldn+0.008
 retrieved+0.007
 never+0.007
Neg logit contributions
est-0.033
aff-0.033
 sky-0.032
andel-0.032
umm-0.031
Token and
Token ID290
Feature activation+5.02e+00

Pos logit contributions
 managed+0.001
 him+0.001
 never+0.001
 wouldn+0.001
 able+0.001
Neg logit contributions
umm-0.001
andel-0.001
est-0.001
aff-0.001
 sky-0.001
Token build
Token ID1382
Feature activation+2.51e+00

Pos logit contributions
 managed+0.074
pped+0.071
 wouldn+0.064
ged+0.063
 able+0.062
Neg logit contributions
est-0.091
 sky-0.090
aff-0.088
 world-0.084
umm-0.084
Token a
Token ID257
Feature activation-4.95e+00

Pos logit contributions
 managed+0.028
 wouldn+0.023
 never+0.022
 able+0.022
 him+0.021
Neg logit contributions
andel-0.060
est-0.060
umm-0.059
aff-0.058
 sky-0.057
Token new
Token ID649
Feature activation-7.94e+00

Pos logit contributions
andel+0.078
est+0.077
umm+0.073
aff+0.072
umpy+0.069
Neg logit contributions
 managed-0.096
 never-0.090
 it-0.088
 not-0.088
 him-0.087
Token dragon
Token ID10441
Feature activation-3.00e+00

Pos logit contributions
andel+0.135
aff+0.123
umm+0.121
umpy+0.116
est+0.105
Neg logit contributions
 never-0.147
 managed-0.146
 it-0.146
 not-0.143
 him-0.141
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
andel+0.054
umm+0.052
est+0.051
aff+0.049
 sky+0.048
Neg logit contributions
 managed-0.054
 never-0.048
 wouldn-0.047
 able-0.047
 him-0.047
Token The
Token ID383
Feature activation-8.13e+00

Pos logit contributions
 managed+0.001
 him+0.001
 it+0.001
 never+0.001
 not+0.001
Neg logit contributions
est-0.001
andel-0.001
umm-0.001
aff-0.001
umpy-0.001
Token boy
Token ID2933
Feature activation+3.15e+00

Pos logit contributions
umm+0.124
aff+0.116
andel+0.116
est+0.113
umpy+0.111
Neg logit contributions
 it-0.152
 never-0.151
 going-0.150
 managed-0.149
 him-0.147
Token was
Token ID373
Feature activation+2.60e+00

Pos logit contributions
 managed+0.038
 able+0.033
 wouldn+0.031
 him+0.031
 preventing+0.030
Neg logit contributions
est-0.069
 sky-0.068
aff-0.066
andel-0.066
umpy-0.064
Token happy
Token ID3772
Feature activation+6.16e-01

Pos logit contributions
 managed+0.041
 wouldn+0.036
 him+0.034
 never+0.034
 able+0.034
Neg logit contributions
andel-0.049
est-0.049
umm-0.048
 sky-0.047
aff-0.047
Token to
Token ID284
Feature activation+4.51e+00

Pos logit contributions
 managed+0.011
 wouldn+0.010
 able+0.009
 never+0.009
pped+0.009
Neg logit contributions
est-0.011
umm-0.010
andel-0.010
 sky-0.010
aff-0.010
Token have
Token ID423
Feature activation+4.91e+00

Pos logit contributions
 managed+0.060
 wouldn+0.053
 able+0.050
 never+0.046
pped+0.045
Neg logit contributions
est-0.098
andel-0.095
 sky-0.093
aff-0.092
umm-0.091
Token his
Token ID465
Feature activation-4.29e+00

Pos logit contributions
 wouldn+0.065
 managed+0.064
 preventing+0.053
plug+0.051
 able+0.051
Neg logit contributions
est-0.101
 sky-0.098
 sun-0.091
 world-0.090
aff-0.090
Token t
Token ID256
Feature activation-6.67e+00

Pos logit contributions
andel+0.071
umm+0.071
aff+0.069
est+0.064
umpy+0.064
Neg logit contributions
 managed-0.079
 him-0.076
 going-0.075
 never-0.075
 not-0.075
Tokeneddy
Token ID21874
Feature activation-3.79e+00

Pos logit contributions
andel+0.079
umm+0.072
 sky+0.072
aff+0.070
est+0.070
Neg logit contributions
 him-0.147
 managed-0.143
 never-0.139
 wouldn-0.137
 able-0.136
Token bear
Token ID6842
Feature activation-4.35e-01

Pos logit contributions
andel+0.066
umm+0.064
aff+0.060
umpy+0.058
 sky+0.054
Neg logit contributions
 managed-0.070
 him-0.069
 wouldn-0.067
 never-0.066
 it-0.065
Token to
Token ID284
Feature activation+4.96e+00

Pos logit contributions
andel+0.009
umm+0.008
aff+0.008
est+0.008
 sky+0.008
Neg logit contributions
 managed-0.007
 never-0.006
 wouldn-0.006
 able-0.006
 him-0.006
Token special
Token ID2041
Feature activation-1.41e+00

Pos logit contributions
 managed+0.012
 able+0.011
 wouldn+0.011
 him+0.011
pped+0.011
Neg logit contributions
andel-0.008
umm-0.008
est-0.008
umpy-0.008
aff-0.008
Token and
Token ID290
Feature activation+4.53e+00

Pos logit contributions
andel+0.025
umm+0.024
est+0.023
aff+0.023
 sky+0.022
Neg logit contributions
 managed-0.025
 wouldn-0.023
 him-0.023
 never-0.023
 it-0.022
Token delicious
Token ID12625
Feature activation-1.97e+00

Pos logit contributions
pped+0.083
 managed+0.083
ged+0.074
 able+0.073
 retrieved+0.072
Neg logit contributions
est-0.065
 sky-0.063
 sun-0.060
umm-0.059
umpy-0.059
Token.
Token ID13
Feature activation+4.93e-01

Pos logit contributions
andel+0.034
est+0.032
umm+0.032
aff+0.031
umpy+0.031
Neg logit contributions
 managed-0.035
 him-0.033
 wouldn-0.032
 it-0.032
 never-0.031
Token She
Token ID1375
Feature activation+5.37e+00

Pos logit contributions
 managed+0.008
 him+0.008
 never+0.008
 it+0.008
 wouldn+0.007
Neg logit contributions
andel-0.009
est-0.009
umm-0.009
aff-0.009
umpy-0.008
Token wonders
Token ID19294
Feature activation+5.59e+00

Pos logit contributions
 managed+0.083
 able+0.075
 paying+0.071
 him+0.071
 wouldn+0.070
Neg logit contributions
est-0.098
 sky-0.097
aff-0.096
andel-0.094
umm-0.093
Token if
Token ID611
Feature activation+4.07e+00

Pos logit contributions
 managed+0.059
 wouldn+0.041
 able+0.040
 retrieved+0.040
 remembered+0.040
Neg logit contributions
andel-0.135
est-0.135
aff-0.131
 sky-0.130
umm-0.126
Token it
Token ID340
Feature activation+3.89e+00

Pos logit contributions
 managed+0.054
 able+0.046
pped+0.045
 retrieved+0.044
 freed+0.043
Neg logit contributions
 sky-0.086
est-0.081
umm-0.080
 world-0.079
andel-0.078
Token tastes
Token ID18221
Feature activation+1.63e+00

Pos logit contributions
 managed+0.065
 able+0.060
pped+0.056
 never+0.054
 paying+0.054
Neg logit contributions
est-0.070
 sky-0.067
umm-0.067
aff-0.066
andel-0.064
Token like
Token ID588
Feature activation+2.79e+00

Pos logit contributions
 managed+0.029
 wouldn+0.025
 never+0.024
 able+0.024
 him+0.024
Neg logit contributions
est-0.028
andel-0.028
umm-0.027
aff-0.027
 sky-0.026
Token gold
Token ID3869
Feature activation-2.97e+00

Pos logit contributions
 managed+0.042
 wouldn+0.035
 able+0.035
 never+0.034
 remembered+0.033
Neg logit contributions
andel-0.056
est-0.055
umm-0.054
 sky-0.054
aff-0.054
Token keep
Token ID1394
Feature activation+3.07e+00

Pos logit contributions
 managed+0.105
 wouldn+0.092
 able+0.088
pped+0.088
ged+0.088
Neg logit contributions
est-0.142
 world-0.131
 sky-0.128
umm-0.127
andel-0.124
Token my
Token ID616
Feature activation-6.04e+00

Pos logit contributions
 managed+0.045
 wouldn+0.038
 remembered+0.033
 freed+0.032
 retrieved+0.032
Neg logit contributions
 world-0.061
est-0.060
 sky-0.060
 sun-0.059
andel-0.058
Token promises
Token ID10497
Feature activation+2.32e+00

Pos logit contributions
aff+0.119
andel+0.113
umm+0.108
umpy+0.103
est+0.094
Neg logit contributions
 managed-0.098
 it-0.093
 never-0.093
 wouldn-0.092
 not-0.091
Token.
Token ID13
Feature activation-2.68e-01

Pos logit contributions
 managed+0.045
 wouldn+0.040
 never+0.040
 able+0.039
 him+0.039
Neg logit contributions
andel-0.036
est-0.036
umm-0.035
aff-0.034
 sky-0.033
Token I
Token ID314
Feature activation+6.09e+00

Pos logit contributions
andel+0.005
est+0.005
umm+0.005
aff+0.005
 sky+0.004
Neg logit contributions
 managed-0.005
 wouldn-0.004
 never-0.004
 him-0.004
 able-0.004
Token will
Token ID481
Feature activation+6.79e+00

Pos logit contributions
 managed+0.088
 able+0.079
 him+0.078
pped+0.077
 wouldn+0.075
Neg logit contributions
est-0.119
umpy-0.112
andel-0.112
 sky-0.111
aff-0.110
Token always
Token ID1464
Feature activation+7.29e+00

Pos logit contributions
 managed+0.092
 wouldn+0.077
 able+0.076
 him+0.074
pped+0.072
Neg logit contributions
est-0.140
andel-0.134
 sky-0.133
aff-0.132
umm-0.131
Token be
Token ID307
Feature activation+1.33e+00

Pos logit contributions
 managed+0.107
 wouldn+0.092
pped+0.092
ged+0.091
 able+0.089
Neg logit contributions
est-0.142
 world-0.132
 sky-0.129
umm-0.126
andel-0.126
Token an
Token ID281
Feature activation-7.41e+00

Pos logit contributions
 managed+0.020
 wouldn+0.019
 him+0.017
pped+0.017
 never+0.017
Neg logit contributions
est-0.025
umpy-0.023
andel-0.023
 world-0.023
 sky-0.023
Token honest
Token ID5508
Feature activation-7.78e+00

Pos logit contributions
aff+0.128
 sky+0.122
umm+0.120
bows+0.119
andel+0.119
Neg logit contributions
 managed-0.126
 never-0.123
 able-0.121
 going-0.116
 sure-0.112
Token and
Token ID290
Feature activation+3.33e+00

Pos logit contributions
andel+0.104
umm+0.099
aff+0.095
 sky+0.084
 Birds+0.081
Neg logit contributions
 managed-0.170
 never-0.162
 remembered-0.160
 it-0.160
 not-0.153
Token the
Token ID262
Feature activation-7.74e+00

Pos logit contributions
 managed+0.034
 wouldn+0.028
 never+0.027
 able+0.026
 him+0.025
Neg logit contributions
andel-0.077
est-0.077
umm-0.076
aff-0.075
 sky-0.073
Token onion
Token ID21670
Feature activation-1.00e+00

Pos logit contributions
andel+0.126
aff+0.122
umm+0.120
est+0.116
umpy+0.112
Neg logit contributions
 managed-0.149
 wouldn-0.135
 him-0.132
 never-0.130
 able-0.129
Token,
Token ID11
Feature activation+4.94e+00

Pos logit contributions
andel+0.020
est+0.020
umm+0.020
aff+0.019
 sky+0.019
Neg logit contributions
 managed-0.015
 wouldn-0.013
 never-0.013
 able-0.013
 him-0.013
Token thanked
Token ID26280
Feature activation+3.87e+00

Pos logit contributions
 managed+0.079
 wouldn+0.072
pped+0.072
 him+0.072
check+0.070
Neg logit contributions
 sky-0.082
andel-0.078
est-0.077
aff-0.073
umm-0.072
Token Mum
Token ID38367
Feature activation-1.27e+00

Pos logit contributions
 managed+0.062
 wouldn+0.054
 able+0.052
 never+0.051
 remembered+0.049
Neg logit contributions
andel-0.074
est-0.074
umm-0.073
aff-0.071
 sky-0.070
Token and
Token ID290
Feature activation+5.16e+00

Pos logit contributions
andel+0.024
umm+0.023
est+0.023
aff+0.023
 sky+0.022
Neg logit contributions
 managed-0.021
 wouldn-0.018
 never-0.018
 him-0.018
 able-0.018
Token went
Token ID1816
Feature activation+3.17e+00

Pos logit contributions
 risking+0.063
pped+0.062
assed+0.058
 preventing+0.058
 managed+0.057
Neg logit contributions
umm-0.092
 girl-0.089
 sky-0.088
est-0.087
 sun-0.087
Token back
Token ID736
Feature activation+1.52e+00

Pos logit contributions
 managed+0.049
 wouldn+0.047
 remembered+0.041
 never+0.040
 listened+0.038
Neg logit contributions
andel-0.059
 sky-0.057
aff-0.056
est-0.054
 world-0.052
Token to
Token ID284
Feature activation+3.93e+00

Pos logit contributions
 managed+0.022
 wouldn+0.021
 never+0.018
pped+0.018
 remembered+0.018
Neg logit contributions
andel-0.030
 sky-0.030
est-0.029
 world-0.028
aff-0.028
Token his
Token ID465
Feature activation-3.73e+00

Pos logit contributions
 wouldn+0.063
 managed+0.063
pped+0.057
w+0.051
bled+0.050
Neg logit contributions
 world-0.063
 sky-0.061
 sun-0.061
 difference-0.058
 little-0.058
Token dinner
Token ID8073
Feature activation-2.88e+00

Pos logit contributions
andel+0.047
est+0.047
umm+0.046
aff+0.044
 sky+0.043
Neg logit contributions
 managed-0.083
 wouldn-0.076
 never-0.075
 able-0.074
 him-0.074
Token drawings
Token ID23388
Feature activation-9.59e-01

Pos logit contributions
andel+0.114
umm+0.112
aff+0.111
est+0.105
umpy+0.104
Neg logit contributions
 managed-0.138
 wouldn-0.130
 never-0.129
 not-0.126
 going-0.125
Token,"
Token ID553
Feature activation+7.18e-01

Pos logit contributions
andel+0.018
umm+0.017
est+0.017
aff+0.017
 sky+0.016
Neg logit contributions
 managed-0.016
 wouldn-0.014
 never-0.014
 able-0.014
 him-0.014
Token Tim
Token ID5045
Feature activation+2.16e-01

Pos logit contributions
 managed+0.009
 able+0.008
 wouldn+0.008
 never+0.008
pped+0.008
Neg logit contributions
est-0.015
andel-0.015
 sky-0.015
umm-0.015
aff-0.014
Tokenmy
Token ID1820
Feature activation+3.57e+00

Pos logit contributions
 managed+0.004
 wouldn+0.003
 never+0.003
 able+0.003
 him+0.003
Neg logit contributions
andel-0.004
umm-0.004
est-0.004
aff-0.004
 sky-0.004
Token said
Token ID531
Feature activation+1.75e+00

Pos logit contributions
 managed+0.043
 wouldn+0.037
 him+0.036
 never+0.035
 able+0.035
Neg logit contributions
est-0.080
andel-0.077
aff-0.075
umm-0.075
 sky-0.075
Token sadly
Token ID21098
Feature activation+2.67e+00

Pos logit contributions
 managed+0.034
 wouldn+0.031
 never+0.030
 him+0.030
 able+0.029
Neg logit contributions
andel-0.027
est-0.027
umm-0.026
aff-0.025
 sky-0.025
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 managed+0.049
 wouldn+0.044
 able+0.043
 him+0.043
 never+0.043
Neg logit contributions
umm-0.046
andel-0.046
est-0.044
aff-0.043
 sky-0.042
Token
Token ID198
Feature activation-4.26e+00

Pos logit contributions
andel+0.005
est+0.005
umm+0.005
aff+0.005
 sky+0.005
Neg logit contributions
 managed-0.006
 wouldn-0.005
 never-0.005
 able-0.005
 him-0.005
Token
Token ID198
Feature activation-4.01e+00

Pos logit contributions
andel+0.066
est+0.066
umm+0.065
aff+0.063
 sky+0.061
Neg logit contributions
 managed-0.083
 wouldn-0.075
 never-0.073
 able-0.073
 him-0.072
TokenS
Token ID50
Feature activation-7.25e+00

Pos logit contributions
est+0.066
umpy+0.064
andel+0.064
umm+0.063
aff+0.062
Neg logit contributions
 managed-0.071
 wouldn-0.066
 able-0.063
 him-0.062
 going-0.061
Tokenally
Token ID453
Feature activation+3.45e+00

Pos logit contributions
andel+0.113
est+0.110
umm+0.108
aff+0.105
 sky+0.104
Neg logit contributions
 managed-0.142
 wouldn-0.128
 him-0.125
 never-0.124
 able-0.121
Token said
Token ID531
Feature activation+3.30e+00

Pos logit contributions
 managed+0.088
pped+0.088
 preventing+0.085
 risking+0.085
 gonna+0.084
Neg logit contributions
aff-0.089
 sky-0.087
est-0.087
 sun-0.085
 girl-0.084
Token,
Token ID11
Feature activation+3.51e+00

Pos logit contributions
 managed+0.067
 wouldn+0.060
 never+0.060
 able+0.058
 remembered+0.057
Neg logit contributions
est-0.049
andel-0.046
aff-0.045
umm-0.045
 sky-0.042
Token "
Token ID366
Feature activation+2.88e+00

Pos logit contributions
 managed+0.069
 wouldn+0.061
 him+0.059
 able+0.059
pped+0.058
Neg logit contributions
 sky-0.051
aff-0.050
andel-0.050
est-0.050
umm-0.049
TokenI
Token ID40
Feature activation+5.42e+00

Pos logit contributions
 managed+0.045
 never+0.039
 wouldn+0.039
 him+0.038
 able+0.037
Neg logit contributions
andel-0.057
est-0.055
umm-0.054
aff-0.054
 sky-0.052
Token'm
Token ID1101
Feature activation+2.99e+00

Pos logit contributions
 managed+0.084
 able+0.080
pped+0.079
 him+0.079
 saving+0.077
Neg logit contributions
est-0.096
umpy-0.089
aff-0.088
andel-0.088
 sky-0.087
Token glad
Token ID9675
Feature activation+2.92e+00

Pos logit contributions
 managed+0.055
 wouldn+0.052
ged+0.049
 aimed+0.048
 remembered+0.048
Neg logit contributions
andel-0.044
est-0.043
 sky-0.043
 sun-0.041
umpy-0.041
Token you
Token ID345
Feature activation+2.49e+00

Pos logit contributions
 managed+0.038
 wouldn+0.030
 able+0.029
pped+0.028
 remembered+0.028
Neg logit contributions
umm-0.064
est-0.064
 sky-0.062
andel-0.062
aff-0.062
Token like
Token ID588
Feature activation+2.55e+00

Pos logit contributions
 managed+0.035
 able+0.032
 preventing+0.032
 paying+0.031
 him+0.031
Neg logit contributions
est-0.048
aff-0.045
 sky-0.044
umpy-0.043
andel-0.043
Token it
Token ID340
Feature activation+1.70e+00

Pos logit contributions
 managed+0.028
 wouldn+0.021
 remembered+0.020
 retrieved+0.020
ged+0.020
Neg logit contributions
 sky-0.061
 sun-0.060
 world-0.059
est-0.059
umm-0.058
Token,
Token ID11
Feature activation+2.81e+00

Pos logit contributions
 managed+0.031
 wouldn+0.027
 never+0.027
 able+0.027
 him+0.026
Neg logit contributions
est-0.029
andel-0.028
umm-0.028
aff-0.027
 sky-0.027
Token Tim
Token ID5045
Feature activation-1.70e+00

Pos logit contributions
 managed+0.034
 wouldn+0.028
 never+0.027
 able+0.027
 him+0.027
Neg logit contributions
andel-0.065
est-0.064
umm-0.064
aff-0.063
 sky-0.062
Token put
Token ID1234
Feature activation+1.39e+00

Pos logit contributions
 saving+0.037
 able+0.037
 paying+0.035
 preventing+0.035
 managed+0.035
Neg logit contributions
 sky-0.069
est-0.067
andel-0.062
umpy-0.062
 world-0.061
Token her
Token ID607
Feature activation-4.70e+00

Pos logit contributions
 managed+0.018
 able+0.015
 never+0.015
 wouldn+0.014
 paying+0.014
Neg logit contributions
est-0.031
 sky-0.030
andel-0.030
umm-0.028
ay-0.027
Token arms
Token ID5101
Feature activation-1.40e+00

Pos logit contributions
aff+0.080
umm+0.079
andel+0.078
est+0.074
umpy+0.070
Neg logit contributions
 managed-0.088
 wouldn-0.082
 never-0.080
 not-0.079
 it-0.078
Token around
Token ID1088
Feature activation+1.28e+00

Pos logit contributions
est+0.026
andel+0.025
 sky+0.025
umpy+0.023
umm+0.023
Neg logit contributions
 managed-0.023
 wouldn-0.020
 him-0.019
 able-0.019
 remembered-0.019
Token her
Token ID607
Feature activation-5.98e+00

Pos logit contributions
 managed+0.015
 able+0.012
 wouldn+0.012
 never+0.011
 remembered+0.011
Neg logit contributions
andel-0.030
est-0.030
 sky-0.029
umm-0.028
aff-0.028
Token knees
Token ID14475
Feature activation+5.55e-01

Pos logit contributions
andel+0.104
umm+0.104
aff+0.100
est+0.094
umpy+0.091
Neg logit contributions
 managed-0.110
 wouldn-0.104
 never-0.103
 not-0.100
 him-0.100
Token and
Token ID290
Feature activation+3.26e+00

Pos logit contributions
 managed+0.010
 wouldn+0.009
 able+0.009
 never+0.009
 him+0.009
Neg logit contributions
est-0.009
andel-0.009
 sky-0.009
umm-0.009
aff-0.009
Token rolled
Token ID11686
Feature activation+1.44e-02

Pos logit contributions
 managed+0.045
pped+0.044
 resolved+0.042
 able+0.042
 gonna+0.041
Neg logit contributions
 sky-0.059
est-0.053
 sun-0.052
 girl-0.050
 voice-0.048
Token down
Token ID866
Feature activation+7.22e-01

Pos logit contributions
 managed+0.000
 wouldn+0.000
 able+0.000
 remembered+0.000
 never+0.000
Neg logit contributions
est-0.000
andel-0.000
 sky-0.000
umm-0.000
aff-0.000
Token the
Token ID262
Feature activation-8.74e+00

Pos logit contributions
 managed+0.010
 wouldn+0.009
 able+0.008
 remembered+0.008
 never+0.008
Neg logit contributions
andel-0.015
est-0.015
 sky-0.015
umm-0.014
aff-0.014
Token hill
Token ID12788
Feature activation-8.22e-01

Pos logit contributions
andel+0.133
aff+0.131
umm+0.122
est+0.117
umpy+0.103
Neg logit contributions
 managed-0.178
 it-0.170
 never-0.170
 wouldn-0.164
 going-0.161
Token
Token ID198
Feature activation-4.35e+00

Pos logit contributions
umm+0.093
andel+0.092
est+0.090
aff+0.084
 sky+0.081
Neg logit contributions
 managed-0.078
 never-0.072
 him-0.072
 it-0.070
 not-0.069
TokenAs
Token ID1722
Feature activation+9.42e-01

Pos logit contributions
est+0.070
andel+0.070
umm+0.068
umpy+0.067
aff+0.067
Neg logit contributions
 managed-0.080
 wouldn-0.072
 able-0.071
 him-0.069
 never-0.067
Token they
Token ID484
Feature activation+2.91e+00

Pos logit contributions
 managed+0.012
 wouldn+0.010
 able+0.010
pped+0.009
check+0.009
Neg logit contributions
 sky-0.021
 sun-0.020
andel-0.019
est-0.019
umm-0.019
Token were
Token ID547
Feature activation+1.30e+00

Pos logit contributions
 managed+0.032
 able+0.027
 wouldn+0.026
 not+0.025
 preventing+0.025
Neg logit contributions
est-0.067
andel-0.067
 sky-0.065
aff-0.064
umm-0.063
Token walking
Token ID6155
Feature activation+6.43e-01

Pos logit contributions
 managed+0.020
 wouldn+0.018
check+0.015
 remembered+0.015
 him+0.015
Neg logit contributions
andel-0.025
 sky-0.024
aff-0.022
est-0.022
umpy-0.021
Token to
Token ID284
Feature activation+2.65e+00

Pos logit contributions
 managed+0.010
 wouldn+0.008
 him+0.008
 never+0.008
 able+0.008
Neg logit contributions
andel-0.013
umm-0.013
est-0.013
aff-0.013
 sky-0.012
Token the
Token ID262
Feature activation-9.04e+00

Pos logit contributions
 managed+0.038
 wouldn+0.033
 able+0.030
pped+0.029
 remembered+0.028
Neg logit contributions
andel-0.055
est-0.054
 sky-0.054
 world-0.053
umm-0.052
Token car
Token ID1097
Feature activation-1.46e+00

Pos logit contributions
andel+0.138
est+0.138
umm+0.135
aff+0.133
umpy+0.124
Neg logit contributions
 never-0.175
 managed-0.173
 going-0.168
 him-0.167
 not-0.166
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
andel+0.025
est+0.025
umm+0.025
aff+0.024
 sky+0.024
Neg logit contributions
 managed-0.026
 wouldn-0.023
 never-0.022
 able-0.022
 him-0.022
Token Tim
Token ID5045
Feature activation-1.18e+00

Pos logit contributions
 managed+0.021
 able+0.018
 wouldn+0.017
pped+0.016
check+0.016
Neg logit contributions
 sky-0.028
est-0.028
andel-0.026
umm-0.026
 sun-0.026
Tokenmy
Token ID1820
Feature activation+2.10e+00

Pos logit contributions
est+0.018
 sky+0.016
andel+0.016
umpy+0.016
umm+0.016
Neg logit contributions
 managed-0.023
 able-0.021
 him-0.021
 wouldn-0.021
 it-0.020
Token in
Token ID287
Feature activation+9.64e-01

Pos logit contributions
umm+0.087
andel+0.085
est+0.079
aff+0.079
umpy+0.075
Neg logit contributions
 managed-0.058
 it-0.056
 never-0.054
 him-0.053
 not-0.052
Token the
Token ID262
Feature activation-8.19e+00

Pos logit contributions
 managed+0.010
 wouldn+0.009
 able+0.008
 never+0.008
 him+0.007
Neg logit contributions
andel-0.023
est-0.023
umm-0.023
 sky-0.023
aff-0.022
Token snow
Token ID6729
Feature activation-4.26e+00

Pos logit contributions
andel+0.125
aff+0.124
umm+0.120
est+0.117
umpy+0.109
Neg logit contributions
 managed-0.159
 never-0.152
 him-0.148
 going-0.148
 it-0.145
Token.
Token ID13
Feature activation-3.87e-01

Pos logit contributions
andel+0.078
umm+0.072
est+0.071
aff+0.070
umpy+0.069
Neg logit contributions
 managed-0.073
 never-0.067
 him-0.067
 wouldn-0.067
 it-0.064
Token They
Token ID1119
Feature activation+5.09e+00

Pos logit contributions
est+0.008
andel+0.008
umm+0.008
aff+0.007
umpy+0.007
Neg logit contributions
 managed-0.005
 it-0.005
 him-0.005
 not-0.005
 never-0.005
Token are
Token ID389
Feature activation+1.51e+00

Pos logit contributions
 managed+0.075
 able+0.067
 wouldn+0.065
 him+0.062
pped+0.061
Neg logit contributions
est-0.097
andel-0.096
 sky-0.096
aff-0.094
umm-0.093
Token happy
Token ID3772
Feature activation-1.09e+00

Pos logit contributions
 managed+0.026
 wouldn+0.024
 him+0.023
pped+0.022
 retrieved+0.021
Neg logit contributions
est-0.025
 sky-0.025
andel-0.024
umpy-0.023
 world-0.023
Token and
Token ID290
Feature activation+2.42e+00

Pos logit contributions
andel+0.019
est+0.019
umm+0.019
aff+0.018
umpy+0.017
Neg logit contributions
 managed-0.020
 never-0.017
 him-0.017
 wouldn-0.017
 able-0.017
Token fair
Token ID3148
Feature activation-1.53e+00

Pos logit contributions
 managed+0.049
pped+0.047
 wouldn+0.045
 him+0.044
 able+0.042
Neg logit contributions
 sky-0.032
est-0.031
 sun-0.030
umm-0.030
umpy-0.030
Token.
Token ID13
Feature activation-5.27e-01

Pos logit contributions
andel+0.024
umm+0.024
est+0.024
aff+0.023
 sky+0.022
Neg logit contributions
 managed-0.029
 wouldn-0.026
 never-0.026
 him-0.026
 able-0.026
Token<|endoftext|>
Token ID50256
Feature activation-8.32e+00

Pos logit contributions
est+0.012
andel+0.012
umm+0.011
aff+0.011
umpy+0.010
Neg logit contributions
 it-0.007
 managed-0.007
 not-0.007
 never-0.007
 him-0.006
Token Fred
Token ID8559
Feature activation+1.14e+00

Pos logit contributions
 managed+0.046
 wouldn+0.040
pped+0.038
 able+0.037
plug+0.037
Neg logit contributions
 sky-0.090
 world-0.086
est-0.086
umm-0.083
andel-0.082
Token won
Token ID1839
Feature activation+6.89e+00

Pos logit contributions
 managed+0.013
 able+0.011
 wouldn+0.010
 preventing+0.010
pped+0.010
Neg logit contributions
est-0.026
aff-0.025
 sky-0.025
andel-0.024
umm-0.024
Token each
Token ID1123
Feature activation-1.10e+00

Pos logit contributions
 managed+0.091
 wouldn+0.076
 able+0.074
 remembered+0.072
 never+0.071
Neg logit contributions
est-0.147
andel-0.147
 sky-0.143
aff-0.142
umm-0.141
Token game
Token ID983
Feature activation+1.11e+00

Pos logit contributions
andel+0.018
est+0.018
umm+0.018
aff+0.017
 sky+0.017
Neg logit contributions
 managed-0.021
 wouldn-0.018
 never-0.018
 able-0.018
 him-0.018
Token.
Token ID13
Feature activation-7.27e-01

Pos logit contributions
 managed+0.022
 wouldn+0.020
 able+0.020
 never+0.020
 remembered+0.020
Neg logit contributions
est-0.016
andel-0.015
umm-0.015
aff-0.015
 sky-0.015
Token
Token ID220
Feature activation-2.33e+00

Pos logit contributions
andel+0.014
est+0.014
umm+0.013
aff+0.013
umpy+0.013
Neg logit contributions
 managed-0.012
 him-0.011
 never-0.010
 it-0.010
 wouldn-0.010
Token
Token ID198
Feature activation-3.87e+00

Pos logit contributions
andel+0.040
umm+0.039
est+0.039
aff+0.037
 sky+0.036
Neg logit contributions
 managed-0.042
 him-0.038
 wouldn-0.037
 never-0.037
 able-0.037
Token
Token ID198
Feature activation-3.77e+00

Pos logit contributions
umm+0.070
andel+0.070
est+0.069
aff+0.064
 sky+0.061
Neg logit contributions
 managed-0.068
 never-0.061
 him-0.061
 it-0.059
 not-0.058
TokenOne
Token ID3198
Feature activation-4.26e+00

Pos logit contributions
andel+0.051
est+0.050
umm+0.050
aff+0.048
 sky+0.047
Neg logit contributions
 managed-0.081
 wouldn-0.074
 able-0.072
 never-0.072
 him-0.071
Token day
Token ID1110
Feature activation-1.07e-02

Pos logit contributions
aff+0.069
umm+0.068
andel+0.065
est+0.062
 sky+0.057
Neg logit contributions
 managed-0.081
 listened-0.076
 it-0.075
 never-0.075
 not-0.073
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
est+0.000
andel+0.000
umm+0.000
 sky+0.000
aff+0.000
Neg logit contributions
 managed-0.000
 wouldn-0.000
 never-0.000
 able-0.000
 him-0.000
Token the
Token ID262
Feature activation-9.93e+00

Pos logit contributions
 managed+0.014
 wouldn+0.012
 able+0.011
pped+0.011
 never+0.010
Neg logit contributions
 sky-0.022
andel-0.022
est-0.022
aff-0.021
 world-0.021
Token park
Token ID3952
Feature activation-9.98e-01

Pos logit contributions
andel+0.134
est+0.133
umm+0.127
aff+0.125
umpy+0.116
Neg logit contributions
 managed-0.213
 never-0.203
 him-0.199
 not-0.197
 going-0.197
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
andel+0.023
umm+0.023
est+0.022
aff+0.022
 sky+0.021
Neg logit contributions
 managed-0.013
 never-0.011
 wouldn-0.011
 him-0.010
 able-0.010
Token Sam
Token ID3409
Feature activation+1.76e+00

Pos logit contributions
andel+0.033
est+0.032
umm+0.032
aff+0.031
umpy+0.030
Neg logit contributions
 managed-0.032
 him-0.029
 wouldn-0.028
 never-0.028
 not-0.028
Token had
Token ID550
Feature activation+2.78e+00

Pos logit contributions
 able+0.024
 managed+0.024
 preventing+0.022
pped+0.022
 him+0.021
Neg logit contributions
 sky-0.033
est-0.032
 world-0.031
umpy-0.030
aff-0.030
Token a
Token ID257
Feature activation-5.38e+00

Pos logit contributions
 managed+0.045
 wouldn+0.045
ged+0.041
pped+0.041
eling+0.041
Neg logit contributions
 sky-0.049
est-0.044
 sun-0.043
 world-0.043
umpy-0.039
Token snack
Token ID26906
Feature activation-3.06e+00

Pos logit contributions
andel+0.078
est+0.077
umm+0.076
aff+0.074
 sky+0.069
Neg logit contributions
 managed-0.112
 wouldn-0.101
 never-0.101
 able-0.098
 him-0.098
Token in
Token ID287
Feature activation+5.20e-01

Pos logit contributions
est+0.055
andel+0.055
umm+0.054
aff+0.053
 sky+0.053
Neg logit contributions
 managed-0.052
 wouldn-0.045
 never-0.044
 able-0.044
 him-0.043
Token his
Token ID465
Feature activation-6.43e+00

Pos logit contributions
 managed+0.005
 wouldn+0.004
 able+0.004
 remembered+0.003
 never+0.003
Neg logit contributions
est-0.013
 sky-0.013
andel-0.013
umm-0.012
aff-0.012
Token bag
Token ID6131
Feature activation-2.07e+00

Pos logit contributions
andel+0.085
umm+0.082
est+0.081
aff+0.080
umpy+0.073
Neg logit contributions
 managed-0.142
 never-0.130
 wouldn-0.129
 him-0.128
 able-0.127
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
andel+0.033
umm+0.032
est+0.032
aff+0.031
 sky+0.030
Neg logit contributions
 managed-0.040
 wouldn-0.036
 never-0.036
 able-0.035
 him-0.035
Token who
Token ID508
Feature activation+4.46e+00

Pos logit contributions
andel+0.001
umm+0.001
est+0.001
aff+0.001
 sky+0.001
Neg logit contributions
 managed-0.001
 wouldn-0.001
 never-0.001
 him-0.001
 able-0.001
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
 managed+0.057
 able+0.054
 gonna+0.049
pped+0.048
 paying+0.046
Neg logit contributions
est-0.087
 sky-0.087
 world-0.082
aff-0.082
bird-0.081
Token a
Token ID257
Feature activation-5.57e+00

Pos logit contributions
 managed+0.015
 wouldn+0.013
 him+0.012
 never+0.011
 remembered+0.011
Neg logit contributions
est-0.024
andel-0.023
 sky-0.023
 world-0.023
umpy-0.023
Token ha
Token ID387
Feature activation-4.22e+00

Pos logit contributions
andel+0.079
umm+0.078
est+0.076
aff+0.075
 sky+0.069
Neg logit contributions
 managed-0.120
 wouldn-0.108
 never-0.108
 able-0.105
 remembered-0.105
Tokenird
Token ID1447
Feature activation-4.89e+00

Pos logit contributions
est+0.063
andel+0.063
umm+0.061
aff+0.060
 sky+0.058
Neg logit contributions
 managed-0.083
 wouldn-0.075
 never-0.074
 him-0.073
 able-0.073
Tokenress
Token ID601
Feature activation-5.39e+00

Pos logit contributions
andel+0.072
est+0.071
umm+0.070
aff+0.068
 sky+0.066
Neg logit contributions
 managed-0.099
 wouldn-0.089
 never-0.088
 him-0.087
 able-0.087
Tokener
Token ID263
Feature activation-6.53e-01

Pos logit contributions
andel+0.073
umm+0.071
aff+0.066
 sky+0.066
est+0.064
Neg logit contributions
 managed-0.119
 wouldn-0.109
 able-0.109
 never-0.108
 him-0.107
Token and
Token ID290
Feature activation+4.30e+00

Pos logit contributions
andel+0.012
umm+0.011
est+0.011
aff+0.011
 sky+0.011
Neg logit contributions
 managed-0.011
 wouldn-0.010
 never-0.010
 able-0.010
 him-0.010
Token asked
Token ID1965
Feature activation+4.34e+00

Pos logit contributions
pped+0.049
 managed+0.047
assed+0.045
eling+0.044
 him+0.044
Neg logit contributions
 girl-0.081
est-0.078
 sky-0.077
 little-0.074
 sun-0.072
Token for
Token ID329
Feature activation+4.16e+00

Pos logit contributions
 managed+0.052
 able+0.040
 wouldn+0.040
 never+0.040
 listened+0.037
Neg logit contributions
est-0.100
andel-0.099
aff-0.096
umm-0.094
 sky-0.093
Token help
Token ID1037
Feature activation+1.36e+00

Pos logit contributions
 managed+0.070
 wouldn+0.065
 remembered+0.057
 retrieved+0.056
 able+0.055
Neg logit contributions
andel-0.068
est-0.064
umm-0.061
 sun-0.061
 sky-0.061
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
andel+0.036
umm+0.035
est+0.034
aff+0.034
umpy+0.032
Neg logit contributions
 managed-0.037
 wouldn-0.033
 never-0.033
 able-0.032
 him-0.032
Token<|endoftext|>
Token ID50256
Feature activation-6.94e+00

Pos logit contributions
andel+0.022
est+0.021
umm+0.020
aff+0.020
umpy+0.019
Neg logit contributions
 managed-0.017
 never-0.016
 it-0.016
 him-0.015
 wouldn-0.015
TokenOnce
Token ID7454
Feature activation-6.79e+00

Pos logit contributions
andel+0.141
umm+0.137
est+0.130
aff+0.129
umpy+0.121
Neg logit contributions
 managed-0.108
 him-0.104
 it-0.101
 never-0.094
 wouldn-0.090
Token upon
Token ID2402
Feature activation-2.88e-01

Pos logit contributions
umm+0.133
andel+0.133
est+0.130
aff+0.127
 sky+0.122
Neg logit contributions
 managed-0.105
 wouldn-0.093
 him-0.091
 never-0.091
 it-0.089
Token a
Token ID257
Feature activation-8.20e+00

Pos logit contributions
est+0.006
andel+0.006
 sky+0.006
umm+0.006
aff+0.006
Neg logit contributions
 managed-0.004
 wouldn-0.003
 able-0.003
 never-0.003
 him-0.003
Token time
Token ID640
Feature activation-3.34e+00

Pos logit contributions
umm+0.140
andel+0.138
aff+0.135
est+0.131
 sky+0.123
Neg logit contributions
 managed-0.153
 never-0.151
 not-0.143
 listened-0.136
 him-0.136
Token there
Token ID612
Feature activation+7.92e-01

Pos logit contributions
andel+0.061
umm+0.060
est+0.059
aff+0.058
 sky+0.056
Neg logit contributions
 managed-0.056
 wouldn-0.050
 never-0.049
 him-0.049
 able-0.048
Token was
Token ID373
Feature activation-2.74e+00

Pos logit contributions
 managed+0.011
 able+0.010
 wouldn+0.009
 paying+0.009
 never+0.009
Neg logit contributions
est-0.017
 sky-0.016
umm-0.015
andel-0.015
 world-0.015
Token a
Token ID257
Feature activation-7.41e+00

Pos logit contributions
umm+0.073
andel+0.073
aff+0.072
est+0.071
umpy+0.069
Neg logit contributions
 managed-0.024
 him-0.020
 never-0.019
 it-0.019
 going-0.018
Token little
Token ID1310
Feature activation-8.33e+00

Pos logit contributions
est+0.117
umm+0.117
andel+0.111
aff+0.106
bows+0.098
Neg logit contributions
 managed-0.143
 it-0.141
 not-0.138
 never-0.138
 remembered-0.138
Token girl
Token ID2576
Feature activation-2.26e+00

Pos logit contributions
umm+0.109
andel+0.109
est+0.109
aff+0.108
umpy+0.098
Neg logit contributions
 it-0.169
 managed-0.155
 tried-0.153
 him-0.151
 they-0.151
Token,
Token ID11
Feature activation+2.67e+00

Pos logit contributions
est+0.011
andel+0.011
umm+0.011
aff+0.011
 sky+0.011
Neg logit contributions
 managed-0.009
 wouldn-0.008
 never-0.008
 able-0.008
 him-0.007
Token it
Token ID340
Feature activation+3.65e+00

Pos logit contributions
 managed+0.026
pped+0.020
 able+0.020
 retrieved+0.020
 wouldn+0.019
Neg logit contributions
 sky-0.067
est-0.066
umm-0.063
 world-0.063
 sun-0.061
Token shot
Token ID2823
Feature activation+6.27e-01

Pos logit contributions
pped+0.048
ged+0.048
 managed+0.047
 paying+0.046
 able+0.046
Neg logit contributions
est-0.074
 sky-0.073
 world-0.068
umm-0.065
umpy-0.063
Token up
Token ID510
Feature activation+1.66e-01

Pos logit contributions
 managed+0.009
 able+0.007
 wouldn+0.007
 never+0.007
 paying+0.007
Neg logit contributions
est-0.013
andel-0.013
 sky-0.013
umm-0.013
aff-0.012
Token in
Token ID287
Feature activation+2.30e+00

Pos logit contributions
 managed+0.003
 wouldn+0.002
 able+0.002
 remembered+0.002
 never+0.002
Neg logit contributions
est-0.003
 sky-0.003
andel-0.003
umm-0.003
aff-0.003
Token the
Token ID262
Feature activation-7.65e+00

Pos logit contributions
 managed+0.027
 wouldn+0.022
 able+0.021
 never+0.021
 him+0.020
Neg logit contributions
andel-0.054
est-0.054
umm-0.053
 sky-0.052
aff-0.052
Token sky
Token ID6766
Feature activation-1.15e+00

Pos logit contributions
aff+0.125
andel+0.118
umm+0.105
umpy+0.100
affe+0.098
Neg logit contributions
 never-0.142
 managed-0.140
 wouldn-0.138
 him-0.137
 listened-0.134
Token and
Token ID290
Feature activation+3.86e+00

Pos logit contributions
andel+0.018
est+0.018
umm+0.018
aff+0.017
 sky+0.017
Neg logit contributions
 managed-0.022
 wouldn-0.020
 never-0.020
 able-0.019
 him-0.019
Token created
Token ID2727
Feature activation+2.25e+00

Pos logit contributions
 managed+0.042
pped+0.037
 able+0.035
ged+0.031
 retrieved+0.031
Neg logit contributions
 sky-0.089
est-0.083
 sun-0.080
umm-0.079
 world-0.078
Token the
Token ID262
Feature activation-8.28e+00

Pos logit contributions
 managed+0.030
 wouldn+0.026
 able+0.024
 never+0.024
 remembered+0.024
Neg logit contributions
est-0.049
andel-0.048
umm-0.047
 sky-0.047
aff-0.046
Token bright
Token ID6016
Feature activation-8.56e+00

Pos logit contributions
andel+0.141
aff+0.139
umpy+0.125
umm+0.121
igsaw+0.118
Neg logit contributions
 never-0.150
 going-0.144
 managed-0.143
 not-0.142
 wouldn-0.141
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
 managed+0.032
 wouldn+0.029
 never+0.029
 remembered+0.029
 able+0.028
Neg logit contributions
est-0.014
andel-0.012
aff-0.012
 sky-0.012
umm-0.012
Token He
Token ID679
Feature activation+3.52e+00

Pos logit contributions
andel+0.024
est+0.023
umm+0.022
aff+0.022
umpy+0.021
Neg logit contributions
 managed-0.018
 him-0.017
 it-0.016
 wouldn-0.016
 not-0.015
Token decided
Token ID3066
Feature activation+3.15e+00

Pos logit contributions
 saving+0.039
 him+0.038
 preventing+0.038
 risking+0.037
 paying+0.037
Neg logit contributions
est-0.074
 sky-0.071
aff-0.068
umpy-0.067
andel-0.065
Token to
Token ID284
Feature activation+4.44e+00

Pos logit contributions
 managed+0.054
eling+0.051
pped+0.050
eks+0.048
 freed+0.048
Neg logit contributions
 sky-0.046
est-0.042
 world-0.042
aff-0.041
 sun-0.040
Token measure
Token ID3953
Feature activation+1.57e+00

Pos logit contributions
 managed+0.073
 wouldn+0.065
 able+0.063
 remembered+0.060
pped+0.060
Neg logit contributions
est-0.080
umm-0.078
andel-0.076
 sky-0.076
aff-0.074
Token the
Token ID262
Feature activation-9.99e+00

Pos logit contributions
 managed+0.018
 wouldn+0.015
 able+0.014
 never+0.014
 remembered+0.014
Neg logit contributions
est-0.037
andel-0.036
 sky-0.035
umm-0.035
aff-0.034
Token warmth
Token ID23125
Feature activation-2.85e+00

Pos logit contributions
andel+0.117
aff+0.113
umm+0.103
umpy+0.102
est+0.095
Neg logit contributions
 managed-0.230
 going-0.221
 never-0.221
 it-0.217
 wouldn-0.217
Token of
Token ID286
Feature activation-1.40e+00

Pos logit contributions
andel+0.045
est+0.044
umm+0.044
aff+0.043
 sky+0.041
Neg logit contributions
 managed-0.054
 wouldn-0.049
 never-0.048
 him-0.048
 able-0.048
Token the
Token ID262
Feature activation-1.09e+01

Pos logit contributions
est+0.030
andel+0.030
 sky+0.030
umm+0.030
aff+0.028
Neg logit contributions
 managed-0.019
 wouldn-0.016
 able-0.015
 never-0.015
 remembered-0.014
Token sun
Token ID4252
Feature activation-1.35e+00

Pos logit contributions
andel+0.123
aff+0.123
umm+0.109
umpy+0.107
est+0.099
Neg logit contributions
 managed-0.252
 going-0.243
 never-0.243
 it-0.240
 wouldn-0.239
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
est+0.022
umm+0.022
andel+0.022
 sky+0.021
aff+0.021
Neg logit contributions
 managed-0.025
 wouldn-0.022
 never-0.022
 able-0.022
 him-0.021
Tokenily
Token ID813
Feature activation+2.10e+00

Pos logit contributions
andel+0.115
umm+0.115
est+0.111
 sky+0.108
aff+0.108
Neg logit contributions
 managed-0.134
 wouldn-0.120
 never-0.117
 able-0.117
 him-0.116
Token finally
Token ID3443
Feature activation+3.57e+00

Pos logit contributions
 managed+0.026
 able+0.023
 him+0.022
 wouldn+0.022
 going+0.022
Neg logit contributions
est-0.045
 sky-0.043
aff-0.043
andel-0.043
umm-0.042
Token gave
Token ID2921
Feature activation-3.03e-02

Pos logit contributions
 managed+0.042
 wouldn+0.039
 him+0.037
 saving+0.036
 never+0.036
Neg logit contributions
est-0.077
andel-0.076
aff-0.075
 sky-0.073
umm-0.073
Token her
Token ID607
Feature activation-5.40e+00

Pos logit contributions
est+0.001
andel+0.001
umm+0.001
aff+0.001
 sky+0.001
Neg logit contributions
 managed-0.000
 wouldn-0.000
 never-0.000
 able-0.000
 remembered-0.000
Token mom
Token ID1995
Feature activation-2.01e+00

Pos logit contributions
umm+0.102
andel+0.099
aff+0.099
umpy+0.093
est+0.092
Neg logit contributions
 managed-0.092
 it-0.085
 wouldn-0.083
 able-0.082
 not-0.082
Token the
Token ID262
Feature activation-1.06e+01

Pos logit contributions
umm+0.058
andel+0.054
umpy+0.053
aff+0.052
est+0.051
Neg logit contributions
 managed-0.017
 him-0.016
 never-0.015
 able-0.014
 it-0.014
Token teaspoon
Token ID22326
Feature activation-3.74e+00

Pos logit contributions
aff+0.184
andel+0.180
umm+0.172
est+0.167
umpy+0.164
Neg logit contributions
 managed-0.194
 wouldn-0.178
 it-0.177
 never-0.175
 not-0.175
Token and
Token ID290
Feature activation+3.93e+00

Pos logit contributions
andel+0.074
umm+0.074
est+0.072
aff+0.072
 sky+0.069
Neg logit contributions
 managed-0.057
 wouldn-0.050
 never-0.049
 able-0.049
 him-0.049
Token her
Token ID607
Feature activation-4.89e+00

Pos logit contributions
pped+0.050
 gonna+0.049
 risking+0.047
 managed+0.046
pping+0.045
Neg logit contributions
 girl-0.072
est-0.068
 sky-0.065
 sun-0.064
 little-0.063
Token mom
Token ID1995
Feature activation+1.46e+00

Pos logit contributions
andel+0.108
aff+0.104
umm+0.101
umpy+0.100
est+0.096
Neg logit contributions
 managed-0.072
 it-0.069
 not-0.068
 wouldn-0.068
 never-0.066
Token said
Token ID531
Feature activation+1.84e+00

Pos logit contributions
 managed+0.016
 gonna+0.014
 preventing+0.013
 wouldn+0.013
 never+0.013
Neg logit contributions
est-0.033
andel-0.030
aff-0.030
umm-0.030
 sky-0.030
Token wanted
Token ID2227
Feature activation+3.80e+00

Pos logit contributions
 managed+0.043
 able+0.043
 him+0.043
 saving+0.042
 preventing+0.041
Neg logit contributions
 sky-0.091
est-0.087
aff-0.085
umpy-0.083
andel-0.081
Token to
Token ID284
Feature activation+4.09e+00

Pos logit contributions
 managed+0.047
 wouldn+0.040
 able+0.037
 remembered+0.035
pped+0.035
Neg logit contributions
 sky-0.084
andel-0.084
est-0.083
aff-0.083
umm-0.081
Token slide
Token ID10649
Feature activation-7.53e-01

Pos logit contributions
 managed+0.055
 wouldn+0.047
 never+0.046
 him+0.045
 able+0.045
Neg logit contributions
andel-0.089
est-0.088
umm-0.088
aff-0.086
 sky-0.084
Token down
Token ID866
Feature activation+9.06e-01

Pos logit contributions
andel+0.017
umm+0.017
est+0.017
aff+0.017
 sky+0.016
Neg logit contributions
 managed-0.009
 wouldn-0.008
 him-0.008
 never-0.008
 able-0.007
Token with
Token ID351
Feature activation+1.86e+00

Pos logit contributions
 managed+0.011
 wouldn+0.009
 able+0.008
 never+0.008
 him+0.008
Neg logit contributions
andel-0.021
est-0.021
umm-0.021
aff-0.020
 sky-0.020
Token the
Token ID262
Feature activation-9.87e+00

Pos logit contributions
 managed+0.017
 wouldn+0.013
 able+0.012
 never+0.012
 remembered+0.011
Neg logit contributions
andel-0.048
est-0.048
umm-0.048
 sky-0.047
aff-0.047
Token gem
Token ID16840
Feature activation-2.39e+00

Pos logit contributions
andel+0.174
umm+0.159
est+0.155
aff+0.154
umpy+0.153
Neg logit contributions
 managed-0.179
 it-0.169
 never-0.168
 not-0.164
 him-0.163
Token,
Token ID11
Feature activation+3.61e+00

Pos logit contributions
andel+0.044
umm+0.042
est+0.041
aff+0.041
umpy+0.040
Neg logit contributions
 managed-0.041
 wouldn-0.037
 never-0.036
 able-0.036
 him-0.035
Token to
Token ID284
Feature activation+3.71e+00

Pos logit contributions
 managed+0.073
pped+0.065
plug+0.063
 able+0.063
 wouldn+0.063
Neg logit contributions
 sky-0.051
est-0.048
 world-0.044
andel-0.043
 sun-0.041
Token feel
Token ID1254
Feature activation+1.83e+00

Pos logit contributions
 managed+0.053
 wouldn+0.046
 able+0.044
 remembered+0.042
pped+0.040
Neg logit contributions
est-0.074
 sky-0.073
andel-0.073
aff-0.070
umm-0.069
Token the
Token ID262
Feature activation-7.88e+00

Pos logit contributions
 managed+0.022
 wouldn+0.019
 listened+0.017
 remembered+0.017
 never+0.017
Neg logit contributions
andel-0.041
est-0.041
 sky-0.040
umpy-0.038
 world-0.038
Token,
Token ID11
Feature activation+2.05e+00

Pos logit contributions
 managed+0.027
 wouldn+0.024
 never+0.024
 able+0.023
 him+0.022
Neg logit contributions
est-0.027
andel-0.027
umm-0.026
aff-0.026
 sky-0.025
Token Jen
Token ID13364
Feature activation-1.84e+00

Pos logit contributions
 managed+0.029
 wouldn+0.025
 able+0.025
 never+0.024
 him+0.023
Neg logit contributions
est-0.042
andel-0.042
umm-0.041
 sky-0.040
aff-0.040
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
est+0.030
andel+0.030
umm+0.029
aff+0.029
 sky+0.028
Neg logit contributions
 managed-0.034
 wouldn-0.030
 able-0.029
 never-0.029
 him-0.029
Token I
Token ID314
Feature activation+6.25e+00

Pos logit contributions
 managed+0.060
 wouldn+0.052
 able+0.049
 retrieved+0.047
 remembered+0.046
Neg logit contributions
 sky-0.081
est-0.080
andel-0.078
 sun-0.077
umm-0.076
Token found
Token ID1043
Feature activation+3.08e+00

Pos logit contributions
 him+0.065
 able+0.064
 managed+0.062
 preventing+0.061
 saving+0.060
Neg logit contributions
est-0.133
umpy-0.131
 sky-0.128
 world-0.124
ay-0.123
Token the
Token ID262
Feature activation-9.50e+00

Pos logit contributions
 managed+0.033
 wouldn+0.027
 never+0.024
 able+0.024
 remembered+0.023
Neg logit contributions
est-0.076
andel-0.075
 sky-0.074
umm-0.073
aff-0.072
Token sound
Token ID2128
Feature activation-1.38e+00

Pos logit contributions
andel+0.150
aff+0.144
umm+0.142
umpy+0.131
est+0.130
Neg logit contributions
 managed-0.186
 never-0.183
 wouldn-0.178
 it-0.177
 him-0.176
Token!
Token ID0
Feature activation-5.61e-01

Pos logit contributions
est+0.020
andel+0.020
umm+0.019
aff+0.019
 sky+0.019
Neg logit contributions
 managed-0.028
 wouldn-0.025
 never-0.025
 able-0.025
 him-0.025
Token We
Token ID775
Feature activation+6.10e+00

Pos logit contributions
andel+0.012
umm+0.011
est+0.011
aff+0.011
 sky+0.010
Neg logit contributions
 managed-0.008
 him-0.007
 it-0.007
 never-0.007
 wouldn-0.007
Token can
Token ID460
Feature activation+4.39e+00

Pos logit contributions
 managed+0.070
 preventing+0.067
 able+0.066
 paying+0.065
 saving+0.065
Neg logit contributions
est-0.123
 sky-0.122
umpy-0.118
andel-0.117
aff-0.117
Token still
Token ID991
Feature activation+5.61e+00

Pos logit contributions
 managed+0.060
 wouldn+0.052
 able+0.050
 never+0.049
 him+0.048
Neg logit contributions
andel-0.093
est-0.092
umm-0.091
aff-0.090
 sky-0.088
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
 managed+0.023
 wouldn+0.021
pped+0.019
 remembered+0.018
 never+0.018
Neg logit contributions
 sky-0.025
andel-0.023
aff-0.023
est-0.023
 world-0.023
Token the
Token ID262
Feature activation-9.21e+00

Pos logit contributions
 managed+0.027
 wouldn+0.025
pped+0.022
 able+0.022
 never+0.021
Neg logit contributions
 sky-0.039
andel-0.039
est-0.038
aff-0.038
 world-0.037
Token store
Token ID3650
Feature activation-1.57e+00

Pos logit contributions
andel+0.134
est+0.130
umm+0.127
aff+0.126
umpy+0.122
Neg logit contributions
 managed-0.189
 never-0.176
 him-0.173
 wouldn-0.171
 going-0.170
Token to
Token ID284
Feature activation+2.00e+00

Pos logit contributions
est+0.031
andel+0.031
umm+0.031
 sky+0.031
aff+0.030
Neg logit contributions
 managed-0.023
 wouldn-0.020
 able-0.019
 him-0.019
 never-0.019
Token get
Token ID651
Feature activation+6.92e-01

Pos logit contributions
 managed+0.036
 wouldn+0.034
 able+0.031
pped+0.030
ged+0.030
Neg logit contributions
 sky-0.032
andel-0.032
est-0.031
 world-0.030
umm-0.030
Token the
Token ID262
Feature activation-9.49e+00

Pos logit contributions
 managed+0.011
 wouldn+0.009
 never+0.009
pped+0.009
 remembered+0.008
Neg logit contributions
est-0.013
 sky-0.013
andel-0.013
 sun-0.012
umm-0.012
Token diamond
Token ID15291
Feature activation-2.56e+00

Pos logit contributions
andel+0.163
aff+0.158
umpy+0.153
umm+0.153
est+0.147
Neg logit contributions
 managed-0.169
 him-0.161
 it-0.161
 wouldn-0.160
 never-0.158
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
andel+0.044
umm+0.043
est+0.043
aff+0.042
 sky+0.040
Neg logit contributions
 managed-0.046
 wouldn-0.041
 never-0.040
 able-0.040
 him-0.040
Token But
Token ID887
Feature activation+1.85e+00

Pos logit contributions
andel+0.026
est+0.026
umm+0.026
aff+0.025
 sky+0.024
Neg logit contributions
 managed-0.031
 wouldn-0.028
 never-0.028
 able-0.027
 him-0.027
Token when
Token ID618
Feature activation+1.72e+00

Pos logit contributions
 managed+0.029
pped+0.027
check+0.027
 able+0.026
 freed+0.025
Neg logit contributions
est-0.032
 sky-0.031
 Today-0.028
andel-0.027
umm-0.027
Token they
Token ID484
Feature activation+1.94e+00

Pos logit contributions
 managed+0.020
pped+0.018
 able+0.017
 wouldn+0.017
 freed+0.016
Neg logit contributions
 sky-0.040
est-0.038
 world-0.037
umm-0.036
 Today-0.036