TOP ACTIVATIONS
MAX = 10.303

She asked Joe , " Should we take a break ?"
get hurt , but we should always ask for help when
Sam spoke up : " Should n 't we share the
box . " Should we keep it forever ?",
, little worm . Friends should never hurt each other ."
a tip , so dad tried to find some extra money
new things , but we should always be careful and not
rude and mean . He should not have done that .
we may seem , we should believe in ourselves and never
us freeze , but we should always be careful and not
day . " Should we go outside ?" Bri
got hurt , but you should have listened to me .
not ours , and we should not take something we don
could hurt you . We should stay away from them ."
bath tub , and we should share with others who don
â Should we try the jelly ?
our environment clean . We should always pick up after ourselves
is very rude . He should not do that . He
important to remember that we should still be proud of the
money , and that we should always save some for later

MIN ACTIVATIONS
MIN = -17.013

're welcome , Mr . W agon ! We will play
" Hello , Mr . W agon ! Why are you
She said to Mr . Sp ud , " I want
, Lily and Mr . Sp ud went to the park
girl walked up to the bald man and asked , "
use . The bald man went to find another
so en vious she wanted Tr ina 's dress .
. The gr umpy bear was so mad
tree and playing with the bald man . <|endoftext|> Once upon
joy . The bald monkey stretched his arms and
and carefully picked up the hairy cater pillar and put it
very excited and hugged the bald man . Every
. The bald man was feeling very sad
home . The oct opus sw am and sw
Tom to show her the sw an . Tom
to learn more about the sw an . They spent the
to spend time with the bald man . The
Dad then looked at the bald man on the TV .
says . The bald man looks at Tom and
a big bird named O str ich wanted to swim in

INTERVAL 3.474 - 10.303
CONTAINS 16.277%

But then , Tammy remembered how exciting counting can be
in a hurry to get to his friend 's house to
from the new bottle and realized that it tasted just as
When they got to the beach , Lily ran
The pilot in the helicopter saw the little boy and asked

INTERVAL -3.355 - 3.474
CONTAINS 66.563%

Jill . They liked to do many things together .
" Mom my , can I mix too ?" Lily asked
?" Tim my asked . " I can 't , I
sn uck out of her house and walked to the ocean
box . Lily laughed and said , " This juice box

INTERVAL -10.184 - -3.355
CONTAINS 14.603%

kid and his friends loved their new toy . <|endoftext|> Once
ila and Ben thanked their mom and hugged her . They
was happy they listened to her mom my 's feeling and
it 's so pretty !" They started to argue
made his way back to the children to keep his promise

INTERVAL -17.013 - -10.184
CONTAINS 2.558%

L ily liked the idea and started numbering all
had so much fun with the basketball ! They threw it
. She showed Lily a new song that she had never
and her mom played with the k ite and had fun
and dirt . The girl smiled and put her
Token She
Token ID1375
Feature activation+2.19e+00

Pos logit contributions
 him+0.012
 it+0.012
 going+0.012
 gonna+0.011
 insisting+0.011
Neg logit contributions
est-0.015
aff-0.014
affe-0.014
ari-0.013
umpy-0.013
Token asked
Token ID1965
Feature activation+5.86e+00

Pos logit contributions
 him+0.030
 going+0.029
 it+0.027
 gonna+0.026
 insisting+0.026
Neg logit contributions
est-0.050
aff-0.050
affe-0.049
bird-0.045
 sky-0.045
Token Joe
Token ID5689
Feature activation-1.18e+00

Pos logit contributions
 going+0.042
 him+0.041
 gonna+0.038
 it+0.038
 insisting+0.034
Neg logit contributions
est-0.169
aff-0.165
affe-0.163
bird-0.153
ess-0.152
Token,
Token ID11
Feature activation+3.89e+00

Pos logit contributions
est+0.030
aff+0.030
affe+0.030
bird+0.028
ari+0.028
Neg logit contributions
 him-0.013
 going-0.012
 it-0.012
 gonna-0.011
 insisting-0.011
Token "
Token ID366
Feature activation+3.46e+00

Pos logit contributions
 him+0.066
 going+0.064
 gonna+0.062
 it+0.061
 paying+0.059
Neg logit contributions
est-0.068
aff-0.067
affe-0.066
bird-0.065
 sky-0.062
TokenShould
Token ID19926
Feature activation+9.94e+00

Pos logit contributions
 him+0.041
 going+0.039
 it+0.039
 gonna+0.034
 insisting+0.034
Neg logit contributions
est-0.087
aff-0.086
affe-0.085
umpy-0.079
ari-0.079
Token we
Token ID356
Feature activation+4.53e+00

Pos logit contributions
 him+0.123
 going+0.119
 insisting+0.113
 paying+0.111
 gonna+0.108
Neg logit contributions
est-0.231
affe-0.227
aff-0.222
bird-0.211
 sky-0.208
Token take
Token ID1011
Feature activation+3.91e+00

Pos logit contributions
 him+0.057
 going+0.054
 it+0.053
 gonna+0.048
 insisting+0.047
Neg logit contributions
est-0.110
aff-0.109
affe-0.108
ari-0.099
umpy-0.099
Token a
Token ID257
Feature activation-5.08e+00

Pos logit contributions
 going+0.037
 paying+0.035
 insisting+0.034
 him+0.033
 figuring+0.032
Neg logit contributions
est-0.099
aff-0.094
affe-0.093
 sky-0.091
bird-0.088
Token break
Token ID2270
Feature activation+7.54e-01

Pos logit contributions
aff+0.113
est+0.113
affe+0.109
umm+0.103
umpy+0.102
Neg logit contributions
 it-0.075
 him-0.074
 going-0.074
 gonna-0.065
 paying-0.064
Token?"
Token ID1701
Feature activation+1.07e+00

Pos logit contributions
 him+0.010
 going+0.010
 it+0.010
 gonna+0.009
 insisting+0.009
Neg logit contributions
est-0.018
aff-0.017
affe-0.017
umpy-0.016
ari-0.016
Token get
Token ID651
Feature activation+4.48e+00

Pos logit contributions
 him+0.053
pped+0.050
 gonna+0.049
 insisting+0.048
ged+0.047
Neg logit contributions
est-0.095
 girl-0.094
affe-0.092
 sky-0.091
aff-0.091
Token hurt
Token ID5938
Feature activation+2.26e+00

Pos logit contributions
 gonna+0.061
 him+0.060
 going+0.059
 figuring+0.058
 managing+0.057
Neg logit contributions
est-0.084
 girl-0.083
aff-0.080
umpy-0.079
 sky-0.077
Token,
Token ID11
Feature activation+4.80e+00

Pos logit contributions
 him+0.037
 going+0.036
 it+0.036
 gonna+0.033
 insisting+0.032
Neg logit contributions
est-0.046
aff-0.046
affe-0.045
umpy-0.041
ari-0.041
Token but
Token ID475
Feature activation+5.40e+00

Pos logit contributions
 him+0.089
 going+0.086
 insisting+0.080
 gonna+0.080
 it+0.079
Neg logit contributions
est-0.083
affe-0.078
aff-0.078
 sky-0.073
 girl-0.071
Token we
Token ID356
Feature activation+4.25e+00

Pos logit contributions
 him+0.039
 gonna+0.034
 going+0.033
pped+0.031
 able+0.030
Neg logit contributions
est-0.146
 girl-0.143
 sky-0.139
 angel-0.138
aff-0.138
Token should
Token ID815
Feature activation+9.76e+00

Pos logit contributions
 him+0.066
 going+0.064
 it+0.062
 insisting+0.060
 paying+0.059
Neg logit contributions
est-0.086
aff-0.086
affe-0.086
umpy-0.078
bird-0.078
Token always
Token ID1464
Feature activation+6.85e+00

Pos logit contributions
 him+0.131
 going+0.126
 it+0.123
 gonna+0.118
 insisting+0.116
Neg logit contributions
est-0.219
aff-0.216
affe-0.213
ess-0.197
umpy-0.195
Token ask
Token ID1265
Feature activation+5.27e+00

Pos logit contributions
 him+0.101
 going+0.096
 gonna+0.095
 it+0.095
 insisting+0.092
Neg logit contributions
est-0.137
affe-0.128
aff-0.128
ess-0.120
bird-0.120
Token for
Token ID329
Feature activation+6.01e+00

Pos logit contributions
 him+0.059
 going+0.059
 gonna+0.056
 it+0.053
 insisting+0.052
Neg logit contributions
est-0.123
aff-0.123
affe-0.120
ess-0.113
 girl-0.111
Token help
Token ID1037
Feature activation+2.73e+00

Pos logit contributions
 gonna+0.096
 going+0.087
 him+0.084
 insisting+0.081
ged+0.079
Neg logit contributions
est-0.097
 girl-0.094
 sky-0.092
affe-0.090
 angel-0.089
Token when
Token ID618
Feature activation+3.86e+00

Pos logit contributions
 him+0.046
 going+0.046
 gonna+0.044
 it+0.044
 insisting+0.042
Neg logit contributions
est-0.051
aff-0.049
affe-0.048
ess-0.044
umpy-0.044
Token Sam
Token ID3409
Feature activation-9.76e-01

Pos logit contributions
 him+0.035
 going+0.035
 gonna+0.033
 insisting+0.032
 paying+0.031
Neg logit contributions
est-0.053
aff-0.050
affe-0.050
 girl-0.048
bird-0.046
Token spoke
Token ID5158
Feature activation+2.41e+00

Pos logit contributions
est+0.022
affe+0.022
aff+0.022
bird+0.020
umpy+0.020
Neg logit contributions
 him-0.013
 going-0.013
 it-0.013
 gonna-0.012
 insisting-0.011
Token up
Token ID510
Feature activation+1.71e+00

Pos logit contributions
 going+0.026
 him+0.026
 gonna+0.025
 it+0.025
 paying+0.024
Neg logit contributions
est-0.061
aff-0.058
affe-0.057
bird-0.054
 sky-0.054
Token:
Token ID25
Feature activation+4.70e+00

Pos logit contributions
 him+0.029
 going+0.029
 gonna+0.028
 it+0.027
 paying+0.026
Neg logit contributions
est-0.032
aff-0.031
affe-0.031
bird-0.028
 sky-0.028
Token "
Token ID366
Feature activation+3.43e+00

Pos logit contributions
 him+0.077
 going+0.074
 gonna+0.071
 it+0.070
 insisting+0.069
Neg logit contributions
aff-0.090
affe-0.087
est-0.087
bird-0.083
 sky-0.082
TokenShould
Token ID19926
Feature activation+9.76e+00

Pos logit contributions
 him+0.049
 it+0.047
 going+0.046
 insisting+0.041
 gonna+0.041
Neg logit contributions
aff-0.077
est-0.077
affe-0.075
ari-0.071
ess-0.068
Tokenn
Token ID77
Feature activation+2.87e+00

Pos logit contributions
 him+0.121
 going+0.117
 insisting+0.107
 gonna+0.106
 it+0.105
Neg logit contributions
est-0.230
affe-0.227
aff-0.225
bird-0.210
 sky-0.205
Token't
Token ID470
Feature activation+4.35e+00

Pos logit contributions
 him+0.035
 going+0.032
 it+0.031
 insisting+0.031
 gonna+0.029
Neg logit contributions
est-0.068
aff-0.068
affe-0.068
bird-0.064
umpy-0.062
Token we
Token ID356
Feature activation+4.19e+00

Pos logit contributions
 him+0.052
 going+0.049
 it+0.048
 gonna+0.044
 insisting+0.044
Neg logit contributions
est-0.107
aff-0.106
affe-0.105
bird-0.097
umpy-0.096
Token share
Token ID2648
Feature activation+4.11e+00

Pos logit contributions
 him+0.052
 going+0.049
 it+0.049
 gonna+0.044
 insisting+0.042
Neg logit contributions
est-0.103
aff-0.102
affe-0.100
ari-0.093
umpy-0.092
Token the
Token ID262
Feature activation-1.05e+01

Pos logit contributions
 going+0.032
 him+0.032
 gonna+0.028
 insisting+0.027
 it+0.026
Neg logit contributions
est-0.115
aff-0.115
affe-0.112
bird-0.105
 sky-0.105
Token box
Token ID3091
Feature activation+1.18e+00

Pos logit contributions
aff+0.071
est+0.070
affe+0.069
ari+0.063
umpy+0.062
Neg logit contributions
 him-0.068
 it-0.067
 going-0.065
 gonna-0.060
 paying-0.059
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
 him+0.021
 going+0.020
 it+0.020
 gonna+0.019
 insisting+0.019
Neg logit contributions
est-0.022
aff-0.021
affe-0.021
ari-0.019
 sky-0.019
Token
Token ID198
Feature activation-2.31e+00

Pos logit contributions
 him+0.019
 going+0.018
 it+0.018
 gonna+0.017
 insisting+0.017
Neg logit contributions
aff-0.020
est-0.020
affe-0.019
 sky-0.018
ari-0.017
Token
Token ID198
Feature activation-2.56e+00

Pos logit contributions
est+0.047
aff+0.047
affe+0.046
umm+0.041
ari+0.041
Neg logit contributions
 him-0.038
 it-0.037
 going-0.037
 gonna-0.034
 insisting-0.033
Token"
Token ID1
Feature activation+4.22e+00

Pos logit contributions
est+0.050
affe+0.049
aff+0.049
umpy+0.045
ari+0.045
Neg logit contributions
 him-0.045
 going-0.044
 it-0.041
 gonna-0.040
 paying-0.039
TokenShould
Token ID19926
Feature activation+9.75e+00

Pos logit contributions
 him+0.072
 going+0.069
 it+0.067
 gonna+0.064
 insisting+0.061
Neg logit contributions
est-0.083
aff-0.082
affe-0.080
 sky-0.075
bird-0.074
Token we
Token ID356
Feature activation+4.21e+00

Pos logit contributions
 him+0.113
 going+0.112
 insisting+0.109
 paying+0.105
 gonna+0.099
Neg logit contributions
est-0.225
affe-0.222
aff-0.216
bird-0.209
 sky-0.207
Token keep
Token ID1394
Feature activation+3.74e+00

Pos logit contributions
 him+0.054
 going+0.052
 it+0.051
 insisting+0.047
 gonna+0.046
Neg logit contributions
est-0.099
aff-0.097
affe-0.096
bird-0.089
ess-0.088
Token it
Token ID340
Feature activation+7.97e-01

Pos logit contributions
 managed+0.029
Half+0.027
ged+0.025
 paying+0.025
to+0.023
Neg logit contributions
 sky-0.088
 angel-0.084
est-0.083
 world-0.082
 sun-0.081
Token forever
Token ID8097
Feature activation+2.34e+00

Pos logit contributions
 him+0.012
 insisting+0.012
 going+0.011
 paying+0.011
 it+0.011
Neg logit contributions
est-0.017
 sky-0.014
affe-0.014
aff-0.014
 girl-0.014
Token?",
Token ID35379
Feature activation+8.57e-01

Pos logit contributions
 him+0.039
 going+0.035
 insisting+0.033
 it+0.033
 paying+0.033
Neg logit contributions
est-0.048
aff-0.044
affe-0.043
 sky-0.042
 girl-0.041
Token,
Token ID11
Feature activation+3.89e+00

Pos logit contributions
 him+0.017
 it+0.017
 going+0.017
 gonna+0.015
 paying+0.015
Neg logit contributions
aff-0.021
est-0.021
affe-0.020
umm-0.019
ari-0.019
Token little
Token ID1310
Feature activation-4.75e+00

Pos logit contributions
 it+0.059
 him+0.058
 going+0.056
 gonna+0.051
 what+0.048
Neg logit contributions
aff-0.089
est-0.087
affe-0.085
ari-0.080
umpy-0.079
Token worm
Token ID23849
Feature activation-2.07e+00

Pos logit contributions
aff+0.060
affe+0.055
est+0.054
ari+0.051
ess+0.049
Neg logit contributions
 him-0.120
 it-0.119
 going-0.116
 gonna-0.108
 his-0.107
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
aff+0.043
est+0.041
affe+0.040
umm+0.040
andel+0.039
Neg logit contributions
 him-0.034
 it-0.033
 going-0.032
 your-0.029
 his-0.028
Token Friends
Token ID14213
Feature activation+4.72e+00

Pos logit contributions
 him+0.020
 it+0.020
 going+0.018
 gonna+0.018
 your+0.017
Neg logit contributions
aff-0.032
est-0.031
affe-0.030
umpy-0.028
andel-0.028
Token should
Token ID815
Feature activation+9.73e+00

Pos logit contributions
 him+0.069
 going+0.068
 it+0.066
 gonna+0.063
 insisting+0.062
Neg logit contributions
est-0.102
aff-0.100
affe-0.100
bird-0.090
umpy-0.090
Token never
Token ID1239
Feature activation+4.84e+00

Pos logit contributions
 him+0.127
 going+0.123
 it+0.120
 gonna+0.114
 insisting+0.111
Neg logit contributions
est-0.225
aff-0.224
affe-0.221
ess-0.201
bird-0.200
Token hurt
Token ID5938
Feature activation+2.78e+00

Pos logit contributions
 him+0.066
 going+0.063
 it+0.062
 paying+0.055
 gonna+0.054
Neg logit contributions
est-0.114
aff-0.113
affe-0.112
ari-0.104
umm-0.103
Token each
Token ID1123
Feature activation-2.88e+00

Pos logit contributions
 him+0.026
 it+0.024
 going+0.024
 gonna+0.020
 insisting+0.019
Neg logit contributions
est-0.077
aff-0.077
affe-0.076
ari-0.071
umm-0.070
Token other
Token ID584
Feature activation-2.88e+00

Pos logit contributions
aff+0.050
umm+0.049
ari+0.047
bows+0.045
ess+0.043
Neg logit contributions
 him-0.062
 it-0.062
 going-0.057
 his-0.056
 your-0.054
Token."
Token ID526
Feature activation-4.34e-01

Pos logit contributions
aff+0.058
est+0.057
affe+0.056
ari+0.052
umm+0.052
Neg logit contributions
 him-0.049
 it-0.048
 going-0.047
 gonna-0.042
 insisting-0.042
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
 going+0.022
 gonna+0.019
 insisting+0.017
 paying+0.016
 him+0.015
Neg logit contributions
est-0.115
aff-0.112
affe-0.110
 girl-0.105
bird-0.104
Token tip
Token ID8171
Feature activation+2.04e-01

Pos logit contributions
est+0.056
aff+0.056
affe+0.054
umm+0.049
umpy+0.049
Neg logit contributions
 him-0.056
 it-0.055
 going-0.055
 gonna-0.050
 paying-0.050
Token,
Token ID11
Feature activation+4.88e+00

Pos logit contributions
 him+0.003
 going+0.003
 it+0.003
 gonna+0.002
 insisting+0.002
Neg logit contributions
est-0.005
aff-0.005
affe-0.005
ari-0.004
umpy-0.004
Token so
Token ID523
Feature activation+3.57e+00

Pos logit contributions
 him+0.095
 going+0.090
 it+0.086
 insisting+0.084
 gonna+0.084
Neg logit contributions
est-0.077
affe-0.075
aff-0.072
 sky-0.068
 girl-0.066
Token dad
Token ID9955
Feature activation+6.29e-01

Pos logit contributions
 him+0.027
 going+0.026
 gonna+0.022
 it+0.022
 insisting+0.021
Neg logit contributions
est-0.101
affe-0.100
aff-0.100
 girl-0.093
ess-0.092
Token tried
Token ID3088
Feature activation+9.53e+00

Pos logit contributions
 him+0.009
 going+0.008
 gonna+0.008
 it+0.008
 insisting+0.008
Neg logit contributions
est-0.014
affe-0.014
aff-0.014
ari-0.013
 girl-0.012
Token to
Token ID284
Feature activation+7.52e+00

Pos logit contributions
 gonna+0.098
 going+0.081
 him+0.075
 insisting+0.073
 able+0.073
Neg logit contributions
aff-0.219
est-0.217
bird-0.203
 sky-0.202
affe-0.201
Token find
Token ID1064
Feature activation+6.17e+00

Pos logit contributions
 him+0.093
 going+0.093
 it+0.090
 gonna+0.088
 insisting+0.082
Neg logit contributions
est-0.174
aff-0.170
affe-0.167
bird-0.156
 girl-0.154
Token some
Token ID617
Feature activation-1.91e+00

Pos logit contributions
 going+0.051
 insisting+0.046
 gonna+0.045
 paying+0.041
 him+0.041
Neg logit contributions
est-0.171
aff-0.162
affe-0.162
bird-0.151
 girl-0.151
Token extra
Token ID3131
Feature activation-3.98e+00

Pos logit contributions
est+0.036
aff+0.036
affe+0.035
umpy+0.032
ari+0.031
Neg logit contributions
 him-0.034
 going-0.033
 it-0.032
 gonna-0.030
 insisting-0.030
Token money
Token ID1637
Feature activation+3.31e+00

Pos logit contributions
est+0.072
aff+0.069
affe+0.068
ess+0.060
 sky+0.060
Neg logit contributions
 him-0.075
 going-0.073
 it-0.070
 gonna-0.068
 insisting-0.068
Token new
Token ID649
Feature activation-1.52e+00

Pos logit contributions
 him+0.074
 gonna+0.072
 going+0.070
 insisting+0.067
 paying+0.065
Neg logit contributions
est-0.155
aff-0.153
 sky-0.141
affe-0.140
ess-0.138
Token things
Token ID1243
Feature activation+1.80e+00

Pos logit contributions
est+0.028
aff+0.026
affe+0.025
 sky+0.023
bird+0.023
Neg logit contributions
 him-0.028
 going-0.028
 it-0.026
 gonna-0.026
 insisting-0.025
Token,
Token ID11
Feature activation+5.12e+00

Pos logit contributions
 gonna+0.029
 insisting+0.029
 going+0.028
 him+0.028
 paying+0.027
Neg logit contributions
est-0.032
aff-0.032
affe-0.030
ess-0.028
 sky-0.028
Token but
Token ID475
Feature activation+5.90e+00

Pos logit contributions
 him+0.096
 going+0.092
 it+0.088
 insisting+0.087
 gonna+0.086
Neg logit contributions
est-0.087
aff-0.084
affe-0.082
 sky-0.076
 girl-0.074
Token we
Token ID356
Feature activation+4.43e+00

Pos logit contributions
 him+0.049
 gonna+0.042
pped+0.042
 going+0.042
 able+0.040
Neg logit contributions
est-0.148
 girl-0.146
 sky-0.144
aff-0.143
 angel-0.141
Token should
Token ID815
Feature activation+9.52e+00

Pos logit contributions
 him+0.069
 going+0.067
 it+0.064
 insisting+0.063
 paying+0.061
Neg logit contributions
affe-0.089
aff-0.089
est-0.088
bird-0.081
umpy-0.081
Token always
Token ID1464
Feature activation+6.64e+00

Pos logit contributions
 him+0.116
 going+0.111
 it+0.107
 gonna+0.102
 insisting+0.102
Neg logit contributions
est-0.224
aff-0.223
affe-0.220
ess-0.204
 sky-0.202
Token be
Token ID307
Feature activation+2.88e+00

Pos logit contributions
 him+0.098
 going+0.092
 it+0.091
 gonna+0.090
 insisting+0.088
Neg logit contributions
est-0.133
aff-0.126
affe-0.124
bird-0.117
 girl-0.115
Token careful
Token ID8161
Feature activation+1.03e+00

Pos logit contributions
 him+0.056
 gonna+0.047
 going+0.047
 it+0.047
 insisting+0.045
Neg logit contributions
est-0.047
 girl-0.044
umpy-0.042
 sky-0.037
affe-0.037
Token and
Token ID290
Feature activation+3.21e+00

Pos logit contributions
 him+0.016
 going+0.015
 it+0.015
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.022
aff-0.020
affe-0.020
umpy-0.019
ess-0.019
Token not
Token ID407
Feature activation+4.95e+00

Pos logit contributions
 him+0.049
pped+0.044
 gonna+0.040
 insisting+0.040
ged+0.039
Neg logit contributions
 girl-0.056
est-0.054
umpy-0.053
affe-0.052
ess-0.051
Token rude
Token ID22066
Feature activation-1.07e+00

Pos logit contributions
 him+0.014
 going+0.014
 it+0.013
 gonna+0.013
 insisting+0.013
Neg logit contributions
est-0.016
aff-0.016
affe-0.015
umpy-0.014
ess-0.014
Token and
Token ID290
Feature activation+3.66e+00

Pos logit contributions
aff+0.022
est+0.022
affe+0.022
umm+0.020
andel+0.020
Neg logit contributions
 him-0.018
 it-0.017
 going-0.017
 gonna-0.015
 his-0.015
Token mean
Token ID1612
Feature activation-4.60e-01

Pos logit contributions
 him+0.073
 going+0.067
 insisting+0.067
 gonna+0.066
 it+0.065
Neg logit contributions
affe-0.053
 girl-0.052
umpy-0.052
aff-0.051
est-0.049
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
aff+0.009
est+0.009
affe+0.009
ari+0.008
umm+0.008
Neg logit contributions
 him-0.008
 it-0.008
 going-0.008
 gonna-0.007
 insisting-0.007
Token He
Token ID679
Feature activation+3.48e+00

Pos logit contributions
 him+0.021
 going+0.020
 it+0.020
 gonna+0.019
 insisting+0.019
Neg logit contributions
est-0.024
aff-0.024
affe-0.023
umpy-0.021
ari-0.021
Token should
Token ID815
Feature activation+9.50e+00

Pos logit contributions
 him+0.050
 going+0.047
 it+0.047
 gonna+0.044
 insisting+0.044
Neg logit contributions
est-0.077
affe-0.076
aff-0.075
umpy-0.069
ari-0.069
Token not
Token ID407
Feature activation+5.34e+00

Pos logit contributions
 him+0.084
 going+0.083
 it+0.079
 gonna+0.078
 insisting+0.075
Neg logit contributions
aff-0.248
est-0.246
affe-0.245
 sky-0.225
umpy-0.225
Token have
Token ID423
Feature activation+4.33e+00

Pos logit contributions
 him+0.073
 going+0.070
 it+0.069
 gonna+0.065
 insisting+0.063
Neg logit contributions
aff-0.121
est-0.121
affe-0.120
umpy-0.109
bird-0.109
Token done
Token ID1760
Feature activation+4.11e+00

Pos logit contributions
 gonna+0.052
 him+0.051
 insisting+0.047
 going+0.045
 it+0.042
Neg logit contributions
aff-0.091
est-0.087
affe-0.085
 girl-0.084
 sky-0.083
Token that
Token ID326
Feature activation+8.83e-01

Pos logit contributions
 gonna+0.046
 him+0.044
 insisting+0.041
 going+0.041
 paying+0.040
Neg logit contributions
aff-0.089
est-0.089
affe-0.085
umpy-0.084
 girl-0.082
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
 him+0.011
 going+0.010
 it+0.010
 gonna+0.010
 insisting+0.009
Neg logit contributions
est-0.021
aff-0.021
affe-0.021
 sky-0.019
umpy-0.019
Token we
Token ID356
Feature activation+4.40e+00

Pos logit contributions
 him+0.016
 going+0.015
 it+0.014
 insisting+0.014
 gonna+0.014
Neg logit contributions
est-0.037
aff-0.036
affe-0.036
 sky-0.033
 girl-0.033
Token may
Token ID743
Feature activation+6.52e+00

Pos logit contributions
 him+0.065
 insisting+0.061
 going+0.061
 it+0.061
 gonna+0.058
Neg logit contributions
est-0.089
affe-0.088
aff-0.088
bird-0.082
 sky-0.082
Token seem
Token ID1283
Feature activation+3.23e+00

Pos logit contributions
 him+0.088
 going+0.081
 it+0.080
 insisting+0.078
 gonna+0.077
Neg logit contributions
est-0.146
aff-0.146
affe-0.141
 sky-0.133
umpy-0.133
Token,
Token ID11
Feature activation+5.93e+00

Pos logit contributions
 him+0.054
 it+0.048
 insisting+0.048
 going+0.047
 gonna+0.046
Neg logit contributions
est-0.064
aff-0.062
affe-0.058
umpy-0.057
 sky-0.056
Token we
Token ID356
Feature activation+5.16e+00

Pos logit contributions
 him+0.062
 going+0.057
 it+0.053
 insisting+0.050
 gonna+0.050
Neg logit contributions
est-0.153
aff-0.151
affe-0.150
 sky-0.140
umpy-0.138
Token should
Token ID815
Feature activation+9.50e+00

Pos logit contributions
 him+0.081
 going+0.078
 it+0.075
 insisting+0.073
 gonna+0.072
Neg logit contributions
est-0.105
affe-0.104
aff-0.104
bird-0.094
 sky-0.094
Token believe
Token ID1975
Feature activation+2.65e+00

Pos logit contributions
 him+0.128
 going+0.121
 it+0.118
 insisting+0.114
 gonna+0.114
Neg logit contributions
est-0.210
aff-0.209
affe-0.205
 sky-0.190
ess-0.189
Token in
Token ID287
Feature activation+4.97e+00

Pos logit contributions
 going+0.030
 him+0.029
 paying+0.028
 insisting+0.027
 gonna+0.027
Neg logit contributions
est-0.064
aff-0.061
affe-0.060
 girl-0.057
 angel-0.057
Token ourselves
Token ID6731
Feature activation+8.10e-01

Pos logit contributions
 him+0.064
 going+0.059
 gonna+0.056
 it+0.055
 insisting+0.054
Neg logit contributions
est-0.115
aff-0.110
affe-0.108
 sky-0.103
umpy-0.101
Token and
Token ID290
Feature activation+3.81e+00

Pos logit contributions
 him+0.014
 going+0.013
 it+0.013
 gonna+0.012
 insisting+0.012
Neg logit contributions
est-0.016
aff-0.016
affe-0.015
umpy-0.014
 sky-0.014
Token never
Token ID1239
Feature activation+5.28e+00

Pos logit contributions
 him+0.055
 gonna+0.050
 going+0.049
pped+0.049
 insisting+0.049
Neg logit contributions
est-0.076
affe-0.071
 sky-0.071
aff-0.070
 girl-0.070
Token us
Token ID514
Feature activation+2.07e-01

Pos logit contributions
 him+0.063
 going+0.062
 it+0.059
 gonna+0.058
 insisting+0.056
Neg logit contributions
est-0.111
aff-0.109
affe-0.106
umpy-0.098
 sky-0.098
Token freeze
Token ID16611
Feature activation+2.41e+00

Pos logit contributions
 him+0.003
 going+0.003
 it+0.003
 gonna+0.003
 insisting+0.003
Neg logit contributions
est-0.004
aff-0.004
affe-0.004
umpy-0.004
bird-0.004
Token,
Token ID11
Feature activation+4.24e+00

Pos logit contributions
 him+0.040
 going+0.039
 it+0.038
 gonna+0.037
 insisting+0.036
Neg logit contributions
est-0.048
aff-0.047
affe-0.045
umpy-0.042
 sky-0.041
Token but
Token ID475
Feature activation+5.48e+00

Pos logit contributions
 him+0.071
 going+0.068
 gonna+0.065
 it+0.064
 insisting+0.064
Neg logit contributions
est-0.080
aff-0.076
affe-0.075
 sky-0.070
umpy-0.069
Token we
Token ID356
Feature activation+4.71e+00

Pos logit contributions
 him+0.050
 gonna+0.044
 going+0.043
 able+0.043
pped+0.043
Neg logit contributions
est-0.133
 girl-0.130
aff-0.130
 sky-0.129
affe-0.128
Token should
Token ID815
Feature activation+9.50e+00

Pos logit contributions
 him+0.068
 going+0.066
 it+0.064
 insisting+0.061
 gonna+0.060
Neg logit contributions
est-0.102
aff-0.101
affe-0.100
umpy-0.091
bird-0.091
Token always
Token ID1464
Feature activation+6.98e+00

Pos logit contributions
 him+0.111
 going+0.106
 it+0.103
 gonna+0.097
 insisting+0.095
Neg logit contributions
est-0.231
aff-0.230
affe-0.226
ess-0.209
 sky-0.208
Token be
Token ID307
Feature activation+3.28e+00

Pos logit contributions
 him+0.101
 going+0.096
 it+0.095
 gonna+0.094
 insisting+0.092
Neg logit contributions
est-0.141
aff-0.135
affe-0.133
bird-0.124
ess-0.124
Token careful
Token ID8161
Feature activation+1.03e+00

Pos logit contributions
 him+0.061
 going+0.053
 gonna+0.053
 it+0.053
 insisting+0.051
Neg logit contributions
est-0.056
umpy-0.051
 girl-0.051
aff-0.046
affe-0.045
Token and
Token ID290
Feature activation+2.90e+00

Pos logit contributions
 him+0.016
 going+0.016
 it+0.016
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.021
aff-0.021
affe-0.020
umpy-0.019
ess-0.018
Token not
Token ID407
Feature activation+5.09e+00

Pos logit contributions
 him+0.047
pped+0.043
 gonna+0.040
 insisting+0.040
 it+0.038
Neg logit contributions
 girl-0.047
umpy-0.047
est-0.046
affe-0.044
 song-0.043
Token day
Token ID1110
Feature activation+2.22e+00

Pos logit contributions
aff+0.064
affe+0.063
est+0.060
umpy+0.056
ari+0.055
Neg logit contributions
 him-0.059
 going-0.059
 it-0.058
 gonna-0.051
 insisting-0.050
Token.
Token ID13
Feature activation+5.06e-01

Pos logit contributions
 him+0.041
 going+0.039
 it+0.038
 gonna+0.037
 insisting+0.036
Neg logit contributions
est-0.041
aff-0.040
affe-0.039
 sky-0.035
umpy-0.035
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
 him+0.008
 it+0.008
 going+0.007
 gonna+0.007
 his+0.007
Neg logit contributions
est-0.012
aff-0.011
affe-0.011
umm-0.010
andel-0.010
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
est+0.029
aff+0.029
affe+0.028
umm+0.027
ari+0.025
Neg logit contributions
 him-0.020
 it-0.019
 going-0.018
 his-0.016
 gonna-0.016
Token"
Token ID1
Feature activation+3.60e+00

Pos logit contributions
est+0.029
aff+0.029
affe+0.028
umm+0.025
ari+0.025
Neg logit contributions
 him-0.023
 going-0.022
 it-0.022
 gonna-0.020
 insisting-0.020
TokenShould
Token ID19926
Feature activation+9.46e+00

Pos logit contributions
 him+0.051
 it+0.050
 going+0.049
 insisting+0.044
 gonna+0.043
Neg logit contributions
aff-0.083
est-0.082
affe-0.081
ari-0.075
umpy-0.074
Token we
Token ID356
Feature activation+4.14e+00

Pos logit contributions
 him+0.110
 going+0.107
 insisting+0.100
 paying+0.098
 gonna+0.096
Neg logit contributions
est-0.227
affe-0.223
aff-0.217
bird-0.207
 girl-0.204
Token go
Token ID467
Feature activation+3.66e+00

Pos logit contributions
 him+0.053
 going+0.051
 it+0.050
 gonna+0.045
 insisting+0.045
Neg logit contributions
est-0.098
aff-0.097
affe-0.096
bird-0.088
ari-0.088
Token outside
Token ID2354
Feature activation+2.83e+00

Pos logit contributions
 paying+0.045
 him+0.044
 insisting+0.043
 going+0.043
 figuring+0.042
Neg logit contributions
est-0.080
aff-0.075
ari-0.074
 sky-0.074
bird-0.073
Token?"
Token ID1701
Feature activation+1.36e+00

Pos logit contributions
 him+0.043
 going+0.042
 it+0.040
 gonna+0.038
 insisting+0.037
Neg logit contributions
est-0.060
aff-0.059
affe-0.058
bird-0.053
ari-0.053
Token Bri
Token ID25866
Feature activation-5.38e+00

Pos logit contributions
 him+0.018
 it+0.017
 going+0.017
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.033
aff-0.033
affe-0.032
ari-0.030
umpy-0.030
Token got
Token ID1392
Feature activation+4.77e+00

Pos logit contributions
 him+0.017
 going+0.016
 it+0.016
 gonna+0.014
 insisting+0.014
Neg logit contributions
est-0.027
aff-0.027
affe-0.026
umpy-0.024
ari-0.024
Token hurt
Token ID5938
Feature activation+1.84e+00

Pos logit contributions
 him+0.061
 gonna+0.061
 going+0.060
 managing+0.057
 paying+0.057
Neg logit contributions
est-0.099
aff-0.093
 girl-0.091
affe-0.090
umpy-0.090
Token,
Token ID11
Feature activation+3.76e+00

Pos logit contributions
 him+0.031
 it+0.030
 going+0.029
 gonna+0.027
 insisting+0.026
Neg logit contributions
aff-0.038
est-0.037
affe-0.037
ari-0.034
umm-0.033
Token but
Token ID475
Feature activation+6.05e+00

Pos logit contributions
 him+0.065
 it+0.064
 going+0.062
 gonna+0.057
 his+0.056
Neg logit contributions
aff-0.077
est-0.075
affe-0.073
ari-0.068
umm-0.067
Token you
Token ID345
Feature activation+3.15e+00

Pos logit contributions
 able+0.067
 him+0.067
 gonna+0.067
pped+0.065
Half+0.064
Neg logit contributions
est-0.130
 girl-0.130
 angel-0.124
affe-0.122
 sky-0.121
Token should
Token ID815
Feature activation+9.40e+00

Pos logit contributions
 him+0.042
 it+0.041
 going+0.040
 gonna+0.036
 paying+0.034
Neg logit contributions
est-0.076
aff-0.075
affe-0.073
umm-0.069
andel-0.067
Token have
Token ID423
Feature activation+6.17e+00

Pos logit contributions
 him+0.097
 going+0.092
 it+0.089
 gonna+0.086
 insisting+0.083
Neg logit contributions
est-0.239
aff-0.235
affe-0.234
ess-0.217
umpy-0.216
Token listened
Token ID16399
Feature activation+2.57e+00

Pos logit contributions
 gonna+0.087
 him+0.078
 figuring+0.070
ged+0.068
 going+0.067
Neg logit contributions
 girl-0.113
est-0.106
 sky-0.102
 owner-0.102
 driver-0.101
Token to
Token ID284
Feature activation+5.08e+00

Pos logit contributions
 him+0.025
 going+0.024
 gonna+0.023
 paying+0.022
 insisting+0.021
Neg logit contributions
est-0.067
affe-0.063
aff-0.063
umpy-0.062
ess-0.061
Token me
Token ID502
Feature activation+1.56e-01

Pos logit contributions
 going+0.045
 gonna+0.043
 it+0.040
 him+0.040
 insisting+0.039
Neg logit contributions
aff-0.130
est-0.129
affe-0.128
 sky-0.118
bird-0.118
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
 him+0.003
 going+0.003
 it+0.003
 gonna+0.003
 insisting+0.002
Neg logit contributions
est-0.003
aff-0.003
affe-0.003
umpy-0.003
ari-0.003
Token not
Token ID407
Feature activation+3.99e+00

Pos logit contributions
 him+0.050
ged+0.045
pped+0.043
 insisting+0.043
 gonna+0.043
Neg logit contributions
est-0.071
aff-0.065
 sky-0.063
affe-0.063
umpy-0.061
Token ours
Token ID16903
Feature activation+1.70e+00

Pos logit contributions
 him+0.058
 insisting+0.052
 gonna+0.050
 it+0.050
 going+0.049
Neg logit contributions
est-0.086
aff-0.081
affe-0.080
 girl-0.075
ess-0.074
Token,
Token ID11
Feature activation+4.42e+00

Pos logit contributions
 him+0.029
 going+0.028
 it+0.027
 gonna+0.025
 insisting+0.025
Neg logit contributions
est-0.034
aff-0.033
affe-0.033
ari-0.029
umpy-0.029
Token and
Token ID290
Feature activation+5.17e+00

Pos logit contributions
 him+0.067
 going+0.065
 insisting+0.059
 it+0.059
 paying+0.058
Neg logit contributions
est-0.093
affe-0.088
aff-0.087
umpy-0.082
 girl-0.081
Token we
Token ID356
Feature activation+4.90e+00

Pos logit contributions
pped+0.049
 him+0.048
 going+0.045
 insisting+0.044
ged+0.044
Neg logit contributions
est-0.119
 girl-0.116
affe-0.115
 sky-0.110
aff-0.109
Token should
Token ID815
Feature activation+9.38e+00

Pos logit contributions
 him+0.074
 going+0.071
 it+0.069
 insisting+0.065
 gonna+0.065
Neg logit contributions
est-0.104
aff-0.104
affe-0.103
umpy-0.093
ari-0.093
Token not
Token ID407
Feature activation+5.60e+00

Pos logit contributions
 him+0.112
 going+0.107
 it+0.104
 gonna+0.100
 insisting+0.100
Neg logit contributions
est-0.220
aff-0.219
affe-0.216
ess-0.201
 sky-0.198
Token take
Token ID1011
Feature activation+3.12e+00

Pos logit contributions
 him+0.074
 going+0.070
 it+0.069
 gonna+0.063
 insisting+0.062
Neg logit contributions
est-0.132
aff-0.131
affe-0.129
umpy-0.119
ari-0.118
Token something
Token ID1223
Feature activation-9.47e-01

Pos logit contributions
 going+0.031
 gonna+0.028
 insisting+0.028
 him+0.027
 suggesting+0.026
Neg logit contributions
est-0.078
aff-0.075
affe-0.074
 girl-0.070
 sky-0.069
Token we
Token ID356
Feature activation+3.29e+00

Pos logit contributions
aff+0.022
est+0.022
affe+0.022
umpy+0.020
ess+0.020
Neg logit contributions
 him-0.012
 going-0.011
 it-0.011
 gonna-0.010
 insisting-0.010
Token don
Token ID836
Feature activation+4.57e+00

Pos logit contributions
 it+0.049
 him+0.048
 insisting+0.046
 going+0.046
 paying+0.044
Neg logit contributions
aff-0.065
affe-0.064
est-0.061
umpy-0.059
ess-0.057
Token could
Token ID714
Feature activation+6.82e+00

Pos logit contributions
 him+0.071
 going+0.068
 it+0.067
 insisting+0.065
 gonna+0.063
Neg logit contributions
est-0.093
affe-0.091
aff-0.090
umpy-0.083
ess-0.081
Token hurt
Token ID5938
Feature activation+3.03e+00

Pos logit contributions
 him+0.090
 going+0.088
 it+0.087
 gonna+0.086
 insisting+0.085
Neg logit contributions
est-0.143
aff-0.140
affe-0.140
bird-0.131
 sky-0.129
Token you
Token ID345
Feature activation+8.85e-01

Pos logit contributions
 him+0.027
 it+0.024
 going+0.024
 gonna+0.020
 insisting+0.019
Neg logit contributions
est-0.086
aff-0.086
affe-0.085
ari-0.079
umpy-0.079
Token.
Token ID13
Feature activation+2.70e+00

Pos logit contributions
 him+0.017
 going+0.016
 it+0.016
 gonna+0.015
 insisting+0.015
Neg logit contributions
est-0.016
aff-0.015
affe-0.015
umpy-0.014
ess-0.013
Token We
Token ID775
Feature activation+5.09e+00

Pos logit contributions
 him+0.040
 it+0.039
 going+0.037
 gonna+0.037
 insisting+0.033
Neg logit contributions
est-0.061
aff-0.061
affe-0.058
andel-0.054
umm-0.053
Token should
Token ID815
Feature activation+9.38e+00

Pos logit contributions
 him+0.071
 going+0.068
 it+0.068
 gonna+0.062
 insisting+0.060
Neg logit contributions
est-0.116
aff-0.115
affe-0.113
ari-0.104
umpy-0.104
Token stay
Token ID2652
Feature activation+2.45e+00

Pos logit contributions
 him+0.118
 going+0.114
 it+0.111
 gonna+0.104
 insisting+0.103
Neg logit contributions
est-0.219
aff-0.216
affe-0.214
ess-0.197
 sky-0.196
Token away
Token ID1497
Feature activation+1.64e+00

Pos logit contributions
 him+0.034
 paying+0.032
 going+0.032
 gonna+0.031
 figuring+0.030
Neg logit contributions
est-0.049
 girl-0.045
an-0.043
 sky-0.043
umpy-0.042
Token from
Token ID422
Feature activation+3.16e+00

Pos logit contributions
 him+0.024
 going+0.023
 it+0.023
 gonna+0.021
 insisting+0.021
Neg logit contributions
est-0.036
aff-0.036
affe-0.035
umpy-0.032
ari-0.032
Token them
Token ID606
Feature activation+1.16e-01

Pos logit contributions
 gonna+0.026
 going+0.026
 him+0.026
 paying+0.024
 insisting+0.024
Neg logit contributions
est-0.077
aff-0.075
 sky-0.075
affe-0.073
 angel-0.073
Token."
Token ID526
Feature activation+5.54e-01

Pos logit contributions
 him+0.002
 it+0.002
 going+0.002
 gonna+0.002
 paying+0.002
Neg logit contributions
aff-0.002
est-0.002
affe-0.002
umm-0.002
andel-0.002
Token bath
Token ID7837
Feature activation-3.00e+00

Pos logit contributions
affe+0.132
aff+0.130
est+0.118
umm+0.114
umpy+0.111
Neg logit contributions
 it-0.136
 him-0.133
 going-0.129
 his-0.121
 your-0.117
Tokentub
Token ID37995
Feature activation-3.65e-01

Pos logit contributions
aff+0.061
affe+0.060
est+0.060
bird+0.053
umpy+0.053
Neg logit contributions
 him-0.050
 going-0.048
 it-0.048
 gonna-0.043
 insisting-0.042
Token,
Token ID11
Feature activation+4.55e+00

Pos logit contributions
est+0.008
aff+0.008
affe+0.008
ari+0.008
umpy+0.007
Neg logit contributions
 him-0.005
 going-0.005
 it-0.005
 gonna-0.004
 insisting-0.004
Token and
Token ID290
Feature activation+4.96e+00

Pos logit contributions
 him+0.070
 going+0.067
 it+0.067
 gonna+0.061
 insisting+0.059
Neg logit contributions
est-0.098
aff-0.098
affe-0.096
umpy-0.088
ari-0.088
Token we
Token ID356
Feature activation+4.64e+00

Pos logit contributions
 him+0.055
ged+0.052
 insisting+0.052
pped+0.051
 paying+0.047
Neg logit contributions
 girl-0.106
est-0.106
 sky-0.103
 angel-0.099
affe-0.099
Token should
Token ID815
Feature activation+9.36e+00

Pos logit contributions
 him+0.070
 going+0.067
 it+0.066
 gonna+0.061
 insisting+0.061
Neg logit contributions
est-0.100
aff-0.099
affe-0.098
umpy-0.089
ari-0.089
Token share
Token ID2648
Feature activation+3.82e+00

Pos logit contributions
 him+0.123
 going+0.116
 it+0.115
 insisting+0.112
 gonna+0.110
Neg logit contributions
est-0.210
aff-0.206
affe-0.202
ess-0.189
 sky-0.188
Token with
Token ID351
Feature activation+2.44e+00

Pos logit contributions
 him+0.033
 going+0.033
 gonna+0.029
 insisting+0.028
 it+0.028
Neg logit contributions
est-0.105
aff-0.105
affe-0.102
 sky-0.096
ess-0.095
Token others
Token ID1854
Feature activation-1.68e+00

Pos logit contributions
 him+0.023
 going+0.022
 it+0.021
 gonna+0.019
 insisting+0.018
Neg logit contributions
est-0.067
aff-0.066
affe-0.065
ari-0.061
umpy-0.061
Token who
Token ID508
Feature activation+3.86e+00

Pos logit contributions
aff+0.033
est+0.033
affe+0.032
umm+0.030
andel+0.029
Neg logit contributions
 him-0.029
 it-0.029
 going-0.028
 gonna-0.025
 his-0.025
Token don
Token ID836
Feature activation+5.06e+00

Pos logit contributions
 insisting+0.059
 him+0.057
 gonna+0.057
 going+0.056
 paying+0.054
Neg logit contributions
est-0.074
aff-0.071
affe-0.070
 sky-0.068
 girl-0.067
Token
Token ID198
Feature activation-1.79e+00

Pos logit contributions
est+0.037
aff+0.037
affe+0.036
umpy+0.032
ari+0.032
Neg logit contributions
 him-0.031
 going-0.030
 it-0.029
 gonna-0.027
 insisting-0.027
Tokenâ
Token ID22940
Feature activation-6.71e+00

Pos logit contributions
est+0.036
aff+0.036
affe+0.036
umpy+0.033
ari+0.032
Neg logit contributions
 him-0.029
 going-0.029
 it-0.028
 gonna-0.026
 insisting-0.025
Token
Token ID26391
Feature activation-7.71e+00

Pos logit contributions
est+0.114
aff+0.113
affe+0.111
ari+0.098
umpy+0.098
Neg logit contributions
 him-0.133
 going-0.129
 it-0.128
 gonna-0.120
 insisting-0.119
Token
Token ID129
Feature activation-6.00e+00

Pos logit contributions
aff+0.136
affe+0.135
est+0.134
ari+0.128
ess+0.126
Neg logit contributions
 him-0.141
 going-0.134
 it-0.130
 insisting-0.127
 gonna-0.125
Token
Token ID241
Feature activation+3.69e-01

Pos logit contributions
aff+0.108
affe+0.103
est+0.101
umpy+0.096
ess+0.096
Neg logit contributions
 him-0.106
 it-0.104
 going-0.103
 paying-0.099
 able-0.098
TokenShould
Token ID19926
Feature activation+9.35e+00

Pos logit contributions
 him+0.006
 going+0.006
 it+0.005
 gonna+0.005
 able+0.005
Neg logit contributions
est-0.007
aff-0.007
affe-0.007
umpy-0.007
ess-0.007
Token we
Token ID356
Feature activation+4.05e+00

Pos logit contributions
 him+0.112
 going+0.110
 insisting+0.107
 paying+0.103
 gonna+0.099
Neg logit contributions
est-0.214
affe-0.208
aff-0.205
 sky-0.192
 girl-0.192
Token try
Token ID1949
Feature activation+6.05e+00

Pos logit contributions
 him+0.052
 going+0.050
 it+0.049
 insisting+0.045
 paying+0.044
Neg logit contributions
est-0.095
aff-0.093
affe-0.092
bird-0.085
umpy-0.085
Token the
Token ID262
Feature activation-1.00e+01

Pos logit contributions
 paying+0.052
 him+0.052
 insisting+0.049
 going+0.049
 figuring+0.048
Neg logit contributions
est-0.144
 sky-0.141
aff-0.139
bird-0.135
berry-0.135
Token jelly
Token ID27644
Feature activation-5.41e+00

Pos logit contributions
est+0.184
aff+0.177
affe+0.173
 sky+0.160
umm+0.157
Neg logit contributions
 him-0.187
 going-0.180
 it-0.173
 gonna-0.169
 insisting-0.167
Token?
Token ID30
Feature activation+2.22e+00

Pos logit contributions
est+0.116
aff+0.108
affe+0.108
 sky+0.099
 girl+0.099
Neg logit contributions
 him-0.082
 going-0.080
 insisting-0.077
 paying-0.075
 gonna-0.073
Token our
Token ID674
Feature activation-8.07e+00

Pos logit contributions
Half+0.031
 gonna+0.030
 managed+0.030
 insisting+0.028
plug+0.027
Neg logit contributions
est-0.088
 sky-0.082
 girl-0.082
aff-0.082
 world-0.078
Token environment
Token ID2858
Feature activation-9.45e-01

Pos logit contributions
aff+0.146
affe+0.141
est+0.137
ari+0.125
umpy+0.122
Neg logit contributions
 him-0.161
 it-0.158
 going-0.153
 his-0.140
 insisting-0.140
Token clean
Token ID3424
Feature activation+1.68e+00

Pos logit contributions
aff+0.021
affe+0.021
est+0.021
bird+0.019
ari+0.019
Neg logit contributions
 him-0.014
 it-0.014
 going-0.013
 paying-0.012
 your-0.012
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
 him+0.028
 going+0.027
 it+0.027
 gonna+0.025
 insisting+0.024
Neg logit contributions
est-0.034
aff-0.033
affe-0.033
umpy-0.030
ari-0.030
Token We
Token ID775
Feature activation+4.95e+00

Pos logit contributions
 it+0.031
 him+0.031
 gonna+0.029
 going+0.029
 your+0.026
Neg logit contributions
est-0.049
aff-0.049
affe-0.047
andel-0.043
ari-0.042
Token should
Token ID815
Feature activation+9.34e+00

Pos logit contributions
 him+0.071
 going+0.068
 it+0.068
 gonna+0.062
 insisting+0.060
Neg logit contributions
est-0.112
aff-0.111
affe-0.109
ari-0.100
umpy-0.099
Token always
Token ID1464
Feature activation+6.45e+00

Pos logit contributions
 him+0.118
 going+0.111
 it+0.108
 insisting+0.104
 gonna+0.103
Neg logit contributions
est-0.216
aff-0.214
affe-0.210
ess-0.197
 sky-0.194
Token pick
Token ID2298
Feature activation+3.17e+00

Pos logit contributions
 him+0.095
 it+0.088
 going+0.087
 gonna+0.087
 insisting+0.086
Neg logit contributions
est-0.127
aff-0.119
affe-0.118
 girl-0.114
ess-0.113
Token up
Token ID510
Feature activation+2.26e+00

Pos logit contributions
 him+0.037
 going+0.037
 gonna+0.034
 it+0.034
 insisting+0.033
Neg logit contributions
est-0.078
aff-0.076
affe-0.074
 sky-0.069
 girl-0.068
Token after
Token ID706
Feature activation+3.63e+00

Pos logit contributions
 him+0.026
 going+0.026
 it+0.024
 gonna+0.023
 insisting+0.023
Neg logit contributions
est-0.055
aff-0.055
affe-0.054
 sky-0.049
umpy-0.049
Token ourselves
Token ID6731
Feature activation+1.25e-02

Pos logit contributions
 insisting+0.036
 him+0.035
 gonna+0.034
 going+0.034
 paying+0.032
Neg logit contributions
aff-0.087
est-0.083
affe-0.082
 sky-0.080
 girl-0.077
Token is
Token ID318
Feature activation+4.16e+00

Pos logit contributions
aff+0.008
est+0.008
affe+0.008
ari+0.007
umm+0.007
Neg logit contributions
 him-0.007
 it-0.006
 going-0.006
 gonna-0.006
 insisting-0.006
Token very
Token ID845
Feature activation+6.74e-01

Pos logit contributions
 him+0.044
ged+0.041
 insisting+0.039
 it+0.036
zed+0.036
Neg logit contributions
affe-0.092
aff-0.091
est-0.089
 girl-0.087
 angel-0.086
Token rude
Token ID22066
Feature activation-1.07e+00

Pos logit contributions
 him+0.012
 going+0.011
 it+0.011
 insisting+0.011
 gonna+0.011
Neg logit contributions
aff-0.013
affe-0.013
est-0.013
umpy-0.012
ess-0.011
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
aff+0.022
est+0.021
affe+0.021
umm+0.019
ari+0.019
Neg logit contributions
 him-0.018
 it-0.018
 going-0.017
 gonna-0.016
 insisting-0.015
Token He
Token ID679
Feature activation+3.06e+00

Pos logit contributions
 him+0.018
 it+0.017
 going+0.016
 gonna+0.015
 your+0.015
Neg logit contributions
est-0.025
aff-0.025
affe-0.024
ari-0.022
 sky-0.022
Token should
Token ID815
Feature activation+9.34e+00

Pos logit contributions
 him+0.047
 it+0.045
 going+0.045
 insisting+0.043
 paying+0.042
Neg logit contributions
affe-0.062
est-0.062
aff-0.061
umpy-0.057
 sky-0.054
Token not
Token ID407
Feature activation+5.09e+00

Pos logit contributions
 him+0.079
 going+0.076
 it+0.073
 gonna+0.069
 insisting+0.066
Neg logit contributions
aff-0.253
est-0.253
affe-0.252
ess-0.231
umpy-0.231
Token do
Token ID466
Feature activation+5.02e+00

Pos logit contributions
 him+0.069
 going+0.066
 it+0.065
 gonna+0.061
 insisting+0.060
Neg logit contributions
aff-0.116
est-0.116
affe-0.115
umpy-0.105
bird-0.104
Token that
Token ID326
Feature activation+6.69e-02

Pos logit contributions
 gonna+0.050
 insisting+0.047
 him+0.043
 going+0.043
 managed+0.042
Neg logit contributions
aff-0.116
est-0.113
affe-0.112
 sky-0.104
 girl-0.104
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 him+0.001
 going+0.001
 gonna+0.001
 it+0.001
 insisting+0.001
Neg logit contributions
est-0.001
affe-0.001
aff-0.001
 sky-0.001
umpy-0.001
Token He
Token ID679
Feature activation+3.00e+00

Pos logit contributions
 him+0.024
 it+0.023
 going+0.022
 gonna+0.020
 his+0.020
Neg logit contributions
aff-0.032
est-0.031
affe-0.030
ari-0.029
andel-0.028
Token important
Token ID1593
Feature activation-5.32e-01

Pos logit contributions
aff+0.000
est+0.000
affe+0.000
ari+0.000
umm+0.000
Neg logit contributions
 him-0.000
 it-0.000
 going-0.000
 gonna-0.000
 insisting-0.000
Token to
Token ID284
Feature activation+6.04e+00

Pos logit contributions
est+0.012
aff+0.012
affe+0.012
umpy+0.011
ess+0.011
Neg logit contributions
 him-0.008
 going-0.007
 it-0.007
 gonna-0.007
 insisting-0.007
Token remember
Token ID3505
Feature activation+6.75e+00

Pos logit contributions
 him+0.081
 going+0.079
 it+0.077
 gonna+0.073
 insisting+0.072
Neg logit contributions
est-0.134
aff-0.131
affe-0.129
bird-0.120
 sky-0.120
Token that
Token ID326
Feature activation+2.76e+00

Pos logit contributions
 him+0.068
 going+0.065
 gonna+0.062
 insisting+0.060
 paying+0.058
Neg logit contributions
est-0.168
aff-0.160
affe-0.156
 sky-0.153
bird-0.147
Token we
Token ID356
Feature activation+4.84e+00

Pos logit contributions
 him+0.028
 going+0.026
 gonna+0.025
 it+0.024
 insisting+0.023
Neg logit contributions
est-0.072
affe-0.068
aff-0.068
 sky-0.065
 girl-0.064
Token should
Token ID815
Feature activation+9.34e+00

Pos logit contributions
 him+0.073
 going+0.070
 it+0.069
 gonna+0.065
 insisting+0.065
Neg logit contributions
est-0.101
aff-0.100
affe-0.100
bird-0.091
ari-0.090
Token still
Token ID991
Feature activation+6.97e+00

Pos logit contributions
 him+0.117
 going+0.112
 it+0.110
 gonna+0.104
 insisting+0.102
Neg logit contributions
est-0.218
aff-0.216
affe-0.214
ess-0.196
bird-0.196
Token be
Token ID307
Feature activation+3.30e+00

Pos logit contributions
 him+0.095
 going+0.091
 it+0.090
 gonna+0.082
 insisting+0.080
Neg logit contributions
est-0.161
aff-0.159
affe-0.157
ari-0.144
umpy-0.144
Token proud
Token ID6613
Feature activation+5.39e-01

Pos logit contributions
 him+0.057
 going+0.050
 it+0.050
 gonna+0.049
 insisting+0.047
Neg logit contributions
est-0.062
 girl-0.055
aff-0.054
umpy-0.054
affe-0.054
Token of
Token ID286
Feature activation+3.52e+00

Pos logit contributions
 him+0.012
 it+0.012
 going+0.011
 gonna+0.011
 insisting+0.010
Neg logit contributions
aff-0.008
affe-0.008
est-0.008
ari-0.007
ess-0.007
Token the
Token ID262
Feature activation-9.98e+00

Pos logit contributions
 going+0.027
 gonna+0.027
 him+0.026
 insisting+0.024
 paying+0.024
Neg logit contributions
aff-0.091
est-0.090
affe-0.088
 angel-0.083
bird-0.083
Token money
Token ID1637
Feature activation+2.05e+00

Pos logit contributions
est+0.096
aff+0.096
affe+0.094
ari+0.083
umpy+0.083
Neg logit contributions
 him-0.101
 going-0.098
 it-0.097
 gonna-0.091
 insisting-0.089
Token,
Token ID11
Feature activation+4.68e+00

Pos logit contributions
 him+0.036
 going+0.035
 it+0.034
 gonna+0.034
 insisting+0.033
Neg logit contributions
est-0.037
aff-0.036
affe-0.035
ari-0.033
ess-0.032
Token and
Token ID290
Feature activation+4.94e+00

Pos logit contributions
 him+0.075
 going+0.072
 it+0.070
 gonna+0.067
 insisting+0.066
Neg logit contributions
est-0.095
aff-0.092
affe-0.091
 sky-0.083
ess-0.082
Token that
Token ID326
Feature activation+3.01e+00

Pos logit contributions
 him+0.050
pped+0.046
 gonna+0.045
 going+0.045
 insisting+0.043
Neg logit contributions
est-0.115
 girl-0.111
affe-0.107
aff-0.107
 sky-0.105
Token we
Token ID356
Feature activation+4.81e+00

Pos logit contributions
 him+0.034
 gonna+0.032
 going+0.030
 insisting+0.028
 able+0.028
Neg logit contributions
est-0.074
affe-0.067
 girl-0.067
 sky-0.067
aff-0.066
Token should
Token ID815
Feature activation+9.34e+00

Pos logit contributions
 him+0.076
 going+0.075
 insisting+0.072
 it+0.071
 paying+0.070
Neg logit contributions
affe-0.093
est-0.093
aff-0.092
umpy-0.084
ari-0.083
Token always
Token ID1464
Feature activation+6.70e+00

Pos logit contributions
 him+0.118
 going+0.113
 it+0.110
 gonna+0.108
 insisting+0.108
Neg logit contributions
est-0.209
aff-0.208
affe-0.206
ess-0.192
 sky-0.188
Token save
Token ID3613
Feature activation+3.84e+00

Pos logit contributions
 him+0.095
 it+0.091
 gonna+0.090
 going+0.089
 insisting+0.086
Neg logit contributions
est-0.133
aff-0.126
affe-0.125
ess-0.118
 girl-0.117
Token some
Token ID617
Feature activation-1.64e+00

Pos logit contributions
 going+0.043
 him+0.042
 gonna+0.041
 insisting+0.039
 it+0.038
Neg logit contributions
est-0.093
aff-0.092
affe-0.088
ess-0.083
 girl-0.082
Token for
Token ID329
Feature activation+3.93e+00

Pos logit contributions
est+0.032
aff+0.031
affe+0.031
umpy+0.028
ari+0.027
Neg logit contributions
 him-0.029
 going-0.028
 it-0.027
 gonna-0.026
 insisting-0.025
Token later
Token ID1568
Feature activation+1.90e+00

Pos logit contributions
 gonna+0.043
 him+0.041
 insisting+0.039
 going+0.038
 managed+0.038
Neg logit contributions
aff-0.080
est-0.077
affe-0.076
 girl-0.073
ess-0.073
Token're
Token ID821
Feature activation+2.09e+00

Pos logit contributions
 him+0.046
 going+0.045
 it+0.045
 gonna+0.041
 paying+0.039
Neg logit contributions
aff-0.051
est-0.050
affe-0.049
umm-0.045
andel-0.044
Token welcome
Token ID7062
Feature activation+5.41e-02

Pos logit contributions
 him+0.044
 it+0.042
 going+0.041
 insisting+0.041
 paying+0.040
Neg logit contributions
est-0.031
aff-0.029
affe-0.028
 sky-0.027
umpy-0.026
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
 him+0.001
 it+0.001
 going+0.001
 gonna+0.001
 your+0.001
Neg logit contributions
aff-0.001
est-0.001
affe-0.001
umpy-0.001
andel-0.001
Token Mr
Token ID1770
Feature activation-9.48e+00

Pos logit contributions
 it+0.017
 him+0.016
 going+0.016
 gonna+0.014
 your+0.014
Neg logit contributions
aff-0.029
est-0.028
affe-0.027
ess-0.026
ari-0.026
Token.
Token ID13
Feature activation-9.60e+00

Pos logit contributions
est+0.210
aff+0.203
affe+0.200
andel+0.200
umpy+0.188
Neg logit contributions
 him-0.145
 it-0.138
 going-0.128
 what-0.115
 his-0.114
Token W
Token ID370
Feature activation-1.69e+01

Pos logit contributions
aff+0.183
est+0.182
affe+0.177
umpy+0.157
ess+0.157
Neg logit contributions
 him-0.173
 going-0.168
 it-0.166
 gonna-0.156
 insisting-0.151
Tokenagon
Token ID1840
Feature activation-2.14e+00

Pos logit contributions
est+0.299
aff+0.297
affe+0.281
bird+0.270
 sky+0.269
Neg logit contributions
 him-0.309
 going-0.307
 it-0.296
 gonna-0.292
 insisting-0.271
Token!
Token ID0
Feature activation+1.13e-01

Pos logit contributions
aff+0.047
affe+0.045
est+0.044
andel+0.043
umm+0.043
Neg logit contributions
 him-0.032
 it-0.032
 going-0.031
 his-0.027
 gonna-0.027
Token We
Token ID775
Feature activation+4.59e+00

Pos logit contributions
 him+0.002
 it+0.002
 going+0.001
 gonna+0.001
 your+0.001
Neg logit contributions
aff-0.003
est-0.003
affe-0.003
andel-0.002
ari-0.002
Token will
Token ID481
Feature activation+6.40e+00

Pos logit contributions
 him+0.067
 going+0.064
 it+0.064
 gonna+0.057
 paying+0.054
Neg logit contributions
aff-0.104
est-0.104
affe-0.101
umm-0.095
ari-0.093
Token play
Token ID711
Feature activation+1.26e+00

Pos logit contributions
 him+0.087
 it+0.082
 going+0.082
 insisting+0.078
 gonna+0.077
Neg logit contributions
est-0.144
aff-0.142
affe-0.138
ess-0.129
 sky-0.128
Token "
Token ID366
Feature activation+2.49e+00

Pos logit contributions
 him+0.060
 gonna+0.055
 going+0.055
 able+0.053
 insisting+0.053
Neg logit contributions
aff-0.049
affe-0.047
 sky-0.045
est-0.045
ess-0.043
TokenHello
Token ID15496
Feature activation-3.91e+00

Pos logit contributions
 him+0.040
 it+0.038
 going+0.037
 insisting+0.033
 gonna+0.032
Neg logit contributions
aff-0.055
est-0.054
affe-0.053
ari-0.049
ess-0.048
Token,
Token ID11
Feature activation+2.73e-01

Pos logit contributions
aff+0.088
affe+0.085
est+0.085
umpy+0.080
umm+0.077
Neg logit contributions
 it-0.058
 him-0.057
 going-0.057
 gonna-0.051
 what-0.050
Token Mr
Token ID1770
Feature activation-9.82e+00

Pos logit contributions
 him+0.004
 it+0.004
 going+0.004
 gonna+0.004
 what+0.004
Neg logit contributions
aff-0.006
est-0.006
affe-0.006
umpy-0.005
umm-0.005
Token.
Token ID13
Feature activation-1.03e+01

Pos logit contributions
est+0.217
aff+0.209
affe+0.208
andel+0.197
umm+0.192
Neg logit contributions
 him-0.151
 it-0.146
 going-0.138
 what-0.123
 your-0.121
Token W
Token ID370
Feature activation-1.67e+01

Pos logit contributions
aff+0.188
est+0.185
affe+0.181
umpy+0.162
ari+0.158
Neg logit contributions
 him-0.193
 going-0.189
 it-0.188
 gonna-0.176
 insisting-0.169
Tokenagon
Token ID1840
Feature activation-3.46e+00

Pos logit contributions
est+0.284
aff+0.281
affe+0.266
bird+0.256
 sky+0.251
Neg logit contributions
 him-0.321
 going-0.317
 it-0.306
 gonna-0.305
 insisting-0.281
Token!
Token ID0
Feature activation-2.90e-01

Pos logit contributions
aff+0.078
affe+0.075
est+0.075
andel+0.070
umm+0.070
Neg logit contributions
 him-0.050
 it-0.050
 going-0.049
 gonna-0.043
 his-0.041
Token Why
Token ID4162
Feature activation+4.41e+00

Pos logit contributions
est+0.007
aff+0.007
affe+0.007
ess+0.006
umm+0.006
Neg logit contributions
 him-0.004
 it-0.004
 going-0.004
 gonna-0.003
 insisting-0.003
Token are
Token ID389
Feature activation+1.15e+00

Pos logit contributions
 him+0.076
 going+0.074
 it+0.072
 insisting+0.070
 gonna+0.070
Neg logit contributions
est-0.085
aff-0.083
affe-0.082
umpy-0.074
ari-0.074
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 insisting+0.013
 him+0.012
 going+0.012
 gonna+0.011
 suggesting+0.011
Neg logit contributions
aff-0.027
affe-0.026
est-0.025
 girl-0.025
 sky-0.024
Token She
Token ID1375
Feature activation+1.45e+00

Pos logit contributions
est+0.009
aff+0.009
affe+0.009
umpy+0.008
 sky+0.008
Neg logit contributions
 him-0.011
 going-0.010
 it-0.010
 gonna-0.010
 insisting-0.009
Token said
Token ID531
Feature activation+2.37e+00

Pos logit contributions
 him+0.022
 going+0.021
 it+0.020
 gonna+0.020
 paying+0.020
Neg logit contributions
est-0.029
affe-0.029
aff-0.028
 girl-0.027
 sky-0.027
Token to
Token ID284
Feature activation+4.45e+00

Pos logit contributions
 gonna+0.037
 going+0.037
 him+0.036
 insisting+0.035
 paying+0.035
Neg logit contributions
est-0.046
affe-0.045
aff-0.043
 girl-0.040
bird-0.039
Token Mr
Token ID1770
Feature activation-9.73e+00

Pos logit contributions
 going+0.068
 gonna+0.068
 it+0.067
 insisting+0.065
 him+0.065
Neg logit contributions
aff-0.084
est-0.083
affe-0.080
 girl-0.074
 sky-0.074
Token.
Token ID13
Feature activation-7.70e+00

Pos logit contributions
est+0.202
aff+0.202
affe+0.199
andel+0.184
ari+0.182
Neg logit contributions
 him-0.164
 it-0.153
 going-0.148
 his-0.134
 what-0.131
Token Sp
Token ID1338
Feature activation-1.65e+01

Pos logit contributions
est+0.156
aff+0.155
affe+0.152
umpy+0.138
ari+0.137
Neg logit contributions
 him-0.126
 going-0.122
 it-0.121
 gonna-0.112
 insisting-0.110
Tokenud
Token ID463
Feature activation-1.09e+00

Pos logit contributions
aff+0.286
est+0.276
affe+0.270
 sky+0.252
ari+0.249
Neg logit contributions
 him-0.319
 going-0.308
 it-0.296
 insisting-0.283
 gonna-0.278
Token,
Token ID11
Feature activation+3.55e+00

Pos logit contributions
est+0.021
aff+0.020
affe+0.020
bird+0.018
ess+0.018
Neg logit contributions
 him-0.019
 going-0.019
 it-0.018
 gonna-0.018
 insisting-0.017
Token "
Token ID366
Feature activation+2.24e+00

Pos logit contributions
 him+0.066
 gonna+0.064
 paying+0.062
pped+0.062
 going+0.061
Neg logit contributions
 sky-0.051
 girl-0.050
affe-0.050
est-0.048
aff-0.046
TokenI
Token ID40
Feature activation+2.43e+00

Pos logit contributions
 him+0.032
 it+0.031
 going+0.030
 insisting+0.027
 gonna+0.027
Neg logit contributions
aff-0.052
est-0.051
affe-0.050
ari-0.046
ess-0.045
Token want
Token ID765
Feature activation+5.24e+00

Pos logit contributions
 him+0.041
 going+0.039
 it+0.038
 insisting+0.037
 gonna+0.037
Neg logit contributions
est-0.048
affe-0.045
aff-0.044
umpy-0.043
 girl-0.042
Token,
Token ID11
Feature activation+5.86e-01

Pos logit contributions
 him+0.000
 going+0.000
 it+0.000
 gonna+0.000
 insisting+0.000
Neg logit contributions
est-0.000
aff-0.000
affe-0.000
 sky-0.000
bird-0.000
Token Lily
Token ID20037
Feature activation-2.31e+00

Pos logit contributions
 him+0.010
 going+0.010
 gonna+0.010
 insisting+0.010
 it+0.009
Neg logit contributions
est-0.011
aff-0.011
affe-0.010
 girl-0.010
 sky-0.010
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
est+0.046
aff+0.045
affe+0.044
umpy+0.040
 sky+0.040
Neg logit contributions
 him-0.039
 going-0.037
 it-0.036
 gonna-0.035
 insisting-0.034
Token Mr
Token ID1770
Feature activation-1.01e+01

Pos logit contributions
est+0.005
aff+0.005
affe+0.005
 girl+0.005
 sky+0.005
Neg logit contributions
 gonna-0.003
 going-0.003
 him-0.003
 it-0.003
 insisting-0.002
Token.
Token ID13
Feature activation-8.63e+00

Pos logit contributions
est+0.219
affe+0.214
aff+0.212
andel+0.199
ari+0.197
Neg logit contributions
 him-0.165
 it-0.147
 going-0.145
 his-0.129
 what-0.128
Token Sp
Token ID1338
Feature activation-1.65e+01

Pos logit contributions
est+0.164
aff+0.160
affe+0.154
 sky+0.144
bird+0.144
Neg logit contributions
 him-0.152
 going-0.147
 it-0.146
 gonna-0.136
 insisting-0.134
Tokenud
Token ID463
Feature activation-9.14e-01

Pos logit contributions
aff+0.294
est+0.285
affe+0.283
 sky+0.266
ari+0.257
Neg logit contributions
 him-0.305
 going-0.291
 it-0.276
 insisting-0.267
 gonna-0.259
Token went
Token ID1816
Feature activation+2.32e+00

Pos logit contributions
est+0.018
aff+0.017
affe+0.017
umpy+0.015
ari+0.015
Neg logit contributions
 him-0.016
 going-0.016
 it-0.016
 gonna-0.015
 insisting-0.014
Token to
Token ID284
Feature activation+3.02e+00

Pos logit contributions
 him+0.027
 gonna+0.027
 it+0.026
 insisting+0.025
 paying+0.025
Neg logit contributions
ari-0.051
aff-0.051
 sky-0.050
est-0.050
 girl-0.049
Token the
Token ID262
Feature activation-8.50e+00

Pos logit contributions
 gonna+0.036
pped+0.035
 insisting+0.034
 able+0.034
 going+0.033
Neg logit contributions
 girl-0.064
aff-0.064
est-0.063
bird-0.062
 sky-0.061
Token park
Token ID3952
Feature activation+7.24e-01

Pos logit contributions
est+0.099
aff+0.097
affe+0.092
 sky+0.089
ari+0.082
Neg logit contributions
 him-0.204
 it-0.200
 going-0.198
 gonna-0.194
 insisting-0.193
Token girl
Token ID2576
Feature activation-1.20e+00

Pos logit contributions
aff+0.113
affe+0.101
ickets+0.088
ess+0.085
umpy+0.082
Neg logit contributions
 it-0.281
 him-0.270
 going-0.264
 his-0.259
 paying-0.247
Token walked
Token ID6807
Feature activation+1.99e+00

Pos logit contributions
est+0.029
aff+0.028
affe+0.028
ari+0.026
bird+0.026
Neg logit contributions
 him-0.015
 going-0.015
 it-0.014
 gonna-0.013
 insisting-0.013
Token up
Token ID510
Feature activation+2.26e+00

Pos logit contributions
 going+0.023
 him+0.023
 it+0.022
 gonna+0.021
 paying+0.020
Neg logit contributions
est-0.049
aff-0.048
affe-0.046
 girl-0.045
 sky-0.044
Token to
Token ID284
Feature activation+3.39e+00

Pos logit contributions
 him+0.020
 going+0.019
 gonna+0.019
 it+0.017
 paying+0.016
Neg logit contributions
aff-0.060
est-0.060
affe-0.057
 sky-0.057
bird-0.057
Token the
Token ID262
Feature activation-1.07e+01

Pos logit contributions
 gonna+0.027
 going+0.027
 insisting+0.024
 able+0.023
 paying+0.022
Neg logit contributions
est-0.091
aff-0.090
affe-0.088
bird-0.086
 girl-0.086
Token bald
Token ID28273
Feature activation-1.64e+01

Pos logit contributions
aff+0.192
est+0.185
affe+0.180
umm+0.173
bird+0.163
Neg logit contributions
 it-0.226
 him-0.216
 going-0.216
 his-0.198
 gonna-0.193
Token man
Token ID582
Feature activation-1.33e+00

Pos logit contributions
umm+0.256
aff+0.246
est+0.240
bird+0.231
umpy+0.225
Neg logit contributions
 it-0.351
 him-0.344
 going-0.321
 his-0.315
 paying-0.290
Token and
Token ID290
Feature activation+2.73e+00

Pos logit contributions
est+0.024
aff+0.024
affe+0.024
bird+0.022
ari+0.021
Neg logit contributions
 him-0.024
 going-0.023
 it-0.023
 gonna-0.022
 insisting-0.021
Token asked
Token ID1965
Feature activation+5.01e+00

Pos logit contributions
 gonna+0.043
 going+0.043
 him+0.043
 insisting+0.039
 preventing+0.038
Neg logit contributions
 girl-0.050
affe-0.048
est-0.048
aff-0.048
bird-0.047
Token,
Token ID11
Feature activation+3.21e+00

Pos logit contributions
 gonna+0.046
 going+0.045
 paying+0.039
 insisting+0.038
 him+0.038
Neg logit contributions
est-0.133
aff-0.130
affe-0.127
bird-0.120
 girl-0.119
Token "
Token ID366
Feature activation+2.79e+00

Pos logit contributions
 him+0.058
 going+0.054
 gonna+0.053
 it+0.052
 paying+0.050
Neg logit contributions
aff-0.053
est-0.053
affe-0.051
bird-0.051
 girl-0.050
Token use
Token ID779
Feature activation+2.36e+00

Pos logit contributions
est+0.033
aff+0.032
affe+0.031
umpy+0.028
 girl+0.028
Neg logit contributions
 him-0.024
 going-0.023
 it-0.023
 gonna-0.022
 insisting-0.022
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
 him+0.044
 going+0.042
 gonna+0.042
 insisting+0.042
 it+0.041
Neg logit contributions
est-0.039
aff-0.036
affe-0.036
 sky-0.033
umpy-0.032
Token
Token ID198
Feature activation-3.87e+00

Pos logit contributions
aff+0.012
affe+0.012
est+0.012
andel+0.012
umpy+0.011
Neg logit contributions
 him-0.007
 it-0.007
 his-0.006
 going-0.005
 an-0.005
Token
Token ID198
Feature activation-3.55e+00

Pos logit contributions
est+0.089
aff+0.087
affe+0.086
umm+0.082
andel+0.078
Neg logit contributions
 him-0.058
 it-0.057
 going-0.052
 his-0.048
 gonna-0.046
TokenThe
Token ID464
Feature activation-1.26e+01

Pos logit contributions
aff+0.085
est+0.083
umm+0.080
affe+0.078
andel+0.075
Neg logit contributions
 him-0.050
 it-0.048
 going-0.046
 his-0.046
 not-0.040
Token bald
Token ID28273
Feature activation-1.64e+01

Pos logit contributions
aff+0.195
est+0.194
affe+0.186
ess+0.167
bird+0.165
Neg logit contributions
 him-0.273
 going-0.272
 it-0.266
 gonna-0.247
 paying-0.246
Token man
Token ID582
Feature activation-1.25e+00

Pos logit contributions
aff+0.237
est+0.227
umm+0.220
bird+0.209
affe+0.209
Neg logit contributions
 it-0.367
 him-0.358
 going-0.350
 his-0.335
 paying-0.324
Token went
Token ID1816
Feature activation+2.34e+00

Pos logit contributions
est+0.030
aff+0.030
affe+0.029
ari+0.027
umpy+0.027
Neg logit contributions
 him-0.016
 going-0.015
 it-0.015
 gonna-0.014
 insisting-0.013
Token to
Token ID284
Feature activation+3.33e+00

Pos logit contributions
 him+0.025
 gonna+0.023
 it+0.023
pped+0.021
 going+0.021
Neg logit contributions
aff-0.055
est-0.053
 sky-0.053
 girl-0.052
ari-0.051
Token find
Token ID1064
Feature activation+2.96e+00

Pos logit contributions
 gonna+0.032
 insisting+0.030
 going+0.030
 able+0.029
ged+0.029
Neg logit contributions
est-0.077
aff-0.076
 girl-0.075
 sky-0.074
bird-0.073
Token another
Token ID1194
Feature activation-9.88e+00

Pos logit contributions
 going+0.023
 gonna+0.021
 him+0.020
 insisting+0.020
 it+0.018
Neg logit contributions
est-0.085
affe-0.081
aff-0.081
 sky-0.077
 girl-0.076
Token so
Token ID523
Feature activation+6.00e-01

Pos logit contributions
 him+0.023
 going+0.021
 it+0.021
 gonna+0.020
 insisting+0.019
Neg logit contributions
est-0.030
aff-0.029
affe-0.029
umpy-0.026
bird-0.026
Token en
Token ID551
Feature activation-3.21e+00

Pos logit contributions
 him+0.011
 going+0.010
 it+0.010
 gonna+0.010
 insisting+0.009
Neg logit contributions
est-0.011
aff-0.011
affe-0.011
umpy-0.010
 sky-0.010
Tokenvious
Token ID1442
Feature activation-1.59e+00

Pos logit contributions
affe+0.051
aff+0.049
est+0.048
ess+0.044
umm+0.043
Neg logit contributions
 him-0.066
 it-0.064
 going-0.063
 gonna-0.059
 his-0.057
Token she
Token ID673
Feature activation+1.35e+00

Pos logit contributions
aff+0.034
est+0.034
affe+0.034
umm+0.031
ari+0.030
Neg logit contributions
 it-0.025
 him-0.025
 going-0.024
 gonna-0.021
 what-0.020
Token wanted
Token ID2227
Feature activation+5.39e+00

Pos logit contributions
 him+0.017
 going+0.016
 it+0.016
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.032
aff-0.032
affe-0.032
umpy-0.029
ess-0.029
Token Tr
Token ID833
Feature activation-1.62e+01

Pos logit contributions
 him+0.038
 going+0.036
 it+0.032
 gonna+0.032
 insisting+0.030
Neg logit contributions
est-0.156
aff-0.156
affe-0.152
 sky-0.143
bird-0.143
Tokenina
Token ID1437
Feature activation-3.90e+00

Pos logit contributions
affe+0.290
est+0.271
aff+0.270
 sky+0.242
umm+0.240
Neg logit contributions
 going-0.300
 him-0.298
 it-0.290
 gonna-0.277
 insisting-0.271
Token's
Token ID338
Feature activation-7.36e+00

Pos logit contributions
est+0.095
aff+0.095
affe+0.093
ari+0.086
umpy+0.086
Neg logit contributions
 him-0.048
 going-0.046
 it-0.046
 gonna-0.041
 insisting-0.040
Token dress
Token ID6576
Feature activation-6.58e-01

Pos logit contributions
affe+0.139
aff+0.136
umpy+0.119
est+0.119
umm+0.119
Neg logit contributions
 it-0.152
 going-0.146
 him-0.142
 your-0.129
 what-0.128
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
aff+0.015
affe+0.015
est+0.014
umm+0.014
andel+0.014
Neg logit contributions
 him-0.009
 it-0.009
 going-0.009
 his-0.008
 not-0.008
Token
Token ID198
Feature activation-3.02e+00

Pos logit contributions
est+0.008
affe+0.008
aff+0.007
umm+0.007
umpy+0.007
Neg logit contributions
 it-0.006
 him-0.006
 going-0.006
 gonna-0.005
 his-0.005
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
aff+0.000
est+0.000
affe+0.000
ari+0.000
umm+0.000
Neg logit contributions
 him-0.000
 going-0.000
 it-0.000
 gonna-0.000
 paying-0.000
Token
Token ID220
Feature activation-2.93e+00

Pos logit contributions
est+0.010
affe+0.009
aff+0.009
umm+0.009
andel+0.009
Neg logit contributions
 him-0.007
 it-0.006
 his-0.006
 going-0.005
 gonna-0.005
Token
Token ID198
Feature activation-3.44e+00

Pos logit contributions
est+0.063
aff+0.061
affe+0.061
umm+0.056
ari+0.055
Neg logit contributions
 him-0.047
 it-0.045
 going-0.044
 gonna-0.040
 insisting-0.039
Token
Token ID198
Feature activation-3.15e+00

Pos logit contributions
est+0.072
aff+0.071
affe+0.070
umm+0.064
ari+0.063
Neg logit contributions
 him-0.056
 it-0.054
 going-0.053
 gonna-0.048
 insisting-0.047
TokenThe
Token ID464
Feature activation-1.31e+01

Pos logit contributions
est+0.065
aff+0.065
affe+0.063
umm+0.058
ari+0.058
Neg logit contributions
 him-0.052
 it-0.050
 going-0.049
 gonna-0.045
 insisting-0.044
Token gr
Token ID1036
Feature activation-1.62e+01

Pos logit contributions
aff+0.237
affe+0.234
est+0.222
ess+0.217
umm+0.216
Neg logit contributions
 going-0.272
 him-0.264
 his-0.237
 it-0.235
 not-0.231
Tokenumpy
Token ID32152
Feature activation-1.34e+01

Pos logit contributions
est+0.135
 girl+0.123
 sky+0.121
umm+0.119
bird+0.115
Neg logit contributions
 him-0.406
 it-0.369
 paying-0.368
 his-0.366
 going-0.365
Token bear
Token ID6842
Feature activation-2.39e+00

Pos logit contributions
aff+0.193
umm+0.191
affe+0.189
pillar+0.176
bird+0.175
Neg logit contributions
 him-0.301
 going-0.293
 his-0.279
 it-0.276
 paying-0.253
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
est+0.055
aff+0.055
affe+0.054
umm+0.050
ari+0.050
Neg logit contributions
 him-0.033
 going-0.031
 it-0.031
 gonna-0.028
 insisting-0.027
Token so
Token ID523
Feature activation-1.89e-01

Pos logit contributions
 him+0.028
 it+0.026
 going+0.026
 gonna+0.025
 insisting+0.024
Neg logit contributions
est-0.029
aff-0.029
affe-0.028
umpy-0.027
 sky-0.026
Token mad
Token ID8805
Feature activation-1.16e+00

Pos logit contributions
est+0.003
aff+0.003
affe+0.003
umpy+0.003
ari+0.003
Neg logit contributions
 him-0.004
 going-0.004
 it-0.003
 gonna-0.003
 insisting-0.003
Token tree
Token ID5509
Feature activation-5.23e-01

Pos logit contributions
aff+0.187
affe+0.181
est+0.176
ess+0.157
aron+0.156
Neg logit contributions
 going-0.240
 it-0.231
 him-0.208
 gonna-0.199
 insisting-0.194
Token and
Token ID290
Feature activation+3.69e+00

Pos logit contributions
aff+0.011
est+0.011
affe+0.010
umm+0.009
umpy+0.009
Neg logit contributions
 him-0.009
 going-0.008
 it-0.008
 gonna-0.008
 paying-0.007
Token playing
Token ID2712
Feature activation+1.69e+00

Pos logit contributions
 gonna+0.039
 him+0.039
 insisting+0.036
 going+0.034
 paying+0.034
Neg logit contributions
 sky-0.081
est-0.081
 girl-0.076
aff-0.073
bird-0.072
Token with
Token ID351
Feature activation+1.30e+00

Pos logit contributions
 him+0.024
 going+0.023
 it+0.022
 gonna+0.021
 insisting+0.021
Neg logit contributions
est-0.038
aff-0.037
affe-0.037
bird-0.033
umpy-0.033
Token the
Token ID262
Feature activation-1.11e+01

Pos logit contributions
 going+0.008
 him+0.007
 gonna+0.006
 insisting+0.006
 it+0.006
Neg logit contributions
est-0.039
aff-0.038
affe-0.038
 sky-0.036
bird-0.036
Token bald
Token ID28273
Feature activation-1.62e+01

Pos logit contributions
aff+0.231
affe+0.211
est+0.201
ess+0.196
ari+0.195
Neg logit contributions
 going-0.220
 it-0.206
 him-0.196
 insisting-0.180
 gonna-0.174
Token man
Token ID582
Feature activation-2.51e+00

Pos logit contributions
aff+0.229
est+0.220
umm+0.220
affe+0.206
ari+0.205
Neg logit contributions
 it-0.371
 him-0.357
 going-0.355
 paying-0.320
 his-0.319
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
est+0.049
aff+0.049
affe+0.048
ari+0.043
umpy+0.043
Neg logit contributions
 him-0.043
 going-0.041
 it-0.041
 gonna-0.038
 insisting-0.038
Token<|endoftext|>
Token ID50256
Feature activation-5.34e+00

Pos logit contributions
 him+0.007
 it+0.007
 going+0.007
 gonna+0.007
 his+0.006
Neg logit contributions
est-0.009
aff-0.009
affe-0.009
umpy-0.008
umm-0.008
TokenOnce
Token ID7454
Feature activation-5.71e+00

Pos logit contributions
est+0.121
affe+0.121
aff+0.120
umm+0.110
andel+0.109
Neg logit contributions
 him-0.081
 it-0.078
 going-0.074
 his-0.065
 gonna-0.064
Token upon
Token ID2402
Feature activation-4.34e-01

Pos logit contributions
est+0.123
aff+0.122
affe+0.120
ari+0.109
umpy+0.109
Neg logit contributions
 him-0.087
 going-0.084
 it-0.083
 gonna-0.076
 insisting-0.075
Token joy
Token ID8716
Feature activation+5.83e-01

Pos logit contributions
 him+0.053
 going+0.053
 gonna+0.050
 insisting+0.049
 it+0.049
Neg logit contributions
est-0.100
aff-0.096
affe-0.095
 sky-0.088
bird-0.088
Token.
Token ID13
Feature activation-4.53e-01

Pos logit contributions
 him+0.010
 going+0.010
 it+0.010
 gonna+0.010
 insisting+0.010
Neg logit contributions
est-0.011
aff-0.011
affe-0.010
ess-0.009
bird-0.009
Token
Token ID198
Feature activation-3.40e+00

Pos logit contributions
est+0.010
aff+0.009
affe+0.009
umpy+0.009
umm+0.009
Neg logit contributions
 him-0.007
 it-0.007
 going-0.007
 his-0.006
 gonna-0.006
Token
Token ID198
Feature activation-3.34e+00

Pos logit contributions
aff+0.064
est+0.063
affe+0.063
umpy+0.056
ari+0.056
Neg logit contributions
 him-0.060
 going-0.059
 it-0.057
 gonna-0.055
 insisting-0.054
TokenThe
Token ID464
Feature activation-1.23e+01

Pos logit contributions
est+0.067
aff+0.067
affe+0.066
umpy+0.061
bird+0.059
Neg logit contributions
 him-0.056
 going-0.054
 it-0.053
 gonna-0.050
 insisting-0.049
Token bald
Token ID28273
Feature activation-1.62e+01

Pos logit contributions
aff+0.178
est+0.172
affe+0.162
ess+0.159
umm+0.146
Neg logit contributions
 going-0.286
 him-0.281
 it-0.274
 paying-0.258
 gonna-0.258
Token monkey
Token ID21657
Feature activation-3.14e+00

Pos logit contributions
umm+0.192
andel+0.189
est+0.178
lights+0.174
aff+0.174
Neg logit contributions
 him-0.377
 it-0.371
 going-0.360
 his-0.356
 not-0.345
Token stretched
Token ID19110
Feature activation+4.19e+00

Pos logit contributions
aff+0.078
est+0.076
affe+0.074
umm+0.071
umpy+0.071
Neg logit contributions
 him-0.038
 it-0.037
 going-0.036
 gonna-0.031
 paying-0.031
Token his
Token ID465
Feature activation-7.12e+00

Pos logit contributions
 going+0.032
 gonna+0.032
 him+0.029
 insisting+0.028
 paying+0.027
Neg logit contributions
est-0.119
affe-0.109
aff-0.109
 sky-0.107
bird-0.106
Token arms
Token ID5101
Feature activation-3.24e-01

Pos logit contributions
aff+0.113
est+0.107
affe+0.104
umpy+0.096
ari+0.096
Neg logit contributions
 him-0.154
 going-0.151
 it-0.149
 gonna-0.140
 his-0.139
Token and
Token ID290
Feature activation+1.32e+00

Pos logit contributions
est+0.007
aff+0.006
affe+0.006
 sky+0.006
bird+0.006
Neg logit contributions
 him-0.005
 going-0.005
 it-0.005
 gonna-0.005
 insisting-0.005
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
 gonna+0.045
 him+0.045
 going+0.044
 it+0.042
 insisting+0.041
Neg logit contributions
est-0.047
aff-0.045
 sky-0.044
affe-0.043
bird-0.042
Token carefully
Token ID7773
Feature activation+2.69e+00

Pos logit contributions
 him+0.042
 gonna+0.041
 going+0.040
 insisting+0.040
 paying+0.038
Neg logit contributions
 girl-0.053
est-0.051
bird-0.049
affe-0.048
aff-0.048
Token picked
Token ID6497
Feature activation+3.64e+00

Pos logit contributions
 gonna+0.039
 your+0.038
 him+0.038
 our+0.036
 going+0.035
Neg logit contributions
est-0.055
aff-0.052
bird-0.051
 sky-0.049
 girl-0.047
Token up
Token ID510
Feature activation+1.71e+00

Pos logit contributions
 going+0.038
 him+0.037
 gonna+0.036
 insisting+0.035
 paying+0.034
Neg logit contributions
est-0.094
aff-0.091
affe-0.090
 sky-0.083
bird-0.083
Token the
Token ID262
Feature activation-1.32e+01

Pos logit contributions
 him+0.011
 it+0.010
 going+0.009
 gonna+0.007
 his+0.006
Neg logit contributions
est-0.053
aff-0.053
affe-0.052
umpy-0.049
ari-0.049
Token hairy
Token ID43873
Feature activation-1.61e+01

Pos logit contributions
aff+0.248
affe+0.231
ari+0.223
est+0.205
ess+0.204
Neg logit contributions
 it-0.260
 going-0.258
 him-0.235
 insisting-0.225
 trying-0.224
Token cater
Token ID20825
Feature activation-1.34e+01

Pos logit contributions
aff+0.271
ari+0.260
affe+0.259
case+0.230
umpy+0.229
Neg logit contributions
 it-0.336
 him-0.317
 going-0.314
 finding-0.285
 trying-0.285
Tokenpillar
Token ID41643
Feature activation-3.70e+00

Pos logit contributions
est+0.118
aff+0.099
 youngest+0.090
ari+0.088
ess+0.085
Neg logit contributions
 it-0.377
 him-0.361
 going-0.345
 gonna-0.331
 this-0.328
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
aff+0.073
est+0.073
affe+0.072
ari+0.065
umpy+0.065
Neg logit contributions
 him-0.064
 it-0.061
 going-0.061
 gonna-0.055
 insisting-0.055
Token put
Token ID1234
Feature activation+3.33e+00

Pos logit contributions
 him+0.046
 going+0.045
 insisting+0.043
 gonna+0.043
 paying+0.040
Neg logit contributions
est-0.062
 girl-0.061
affe-0.060
aff-0.060
 sky-0.057
Token it
Token ID340
Feature activation+2.58e-01

Pos logit contributions
 going+0.021
 paying+0.021
 insisting+0.020
 gonna+0.020
 managing+0.020
Neg logit contributions
est-0.094
 sky-0.089
affe-0.087
aff-0.085
 girl-0.085
Token very
Token ID845
Feature activation+9.43e-01

Pos logit contributions
 him+0.045
 it+0.042
 going+0.042
 gonna+0.039
 insisting+0.038
Neg logit contributions
est-0.067
aff-0.065
affe-0.064
bird-0.060
 sky-0.059
Token excited
Token ID6568
Feature activation+1.42e+00

Pos logit contributions
 him+0.016
 going+0.016
 it+0.015
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.018
aff-0.018
affe-0.018
umpy-0.016
ari-0.016
Token and
Token ID290
Feature activation+3.63e+00

Pos logit contributions
 him+0.022
 going+0.021
 gonna+0.021
 it+0.020
 insisting+0.020
Neg logit contributions
est-0.029
aff-0.028
affe-0.027
 sky-0.025
ess-0.025
Token hugged
Token ID41130
Feature activation+3.96e+00

Pos logit contributions
 him+0.050
 going+0.045
 insisting+0.044
 gonna+0.043
 it+0.042
Neg logit contributions
est-0.079
aff-0.077
affe-0.076
 girl-0.073
 sky-0.071
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 him+0.022
 it+0.018
 going+0.013
 his+0.009
 insisting+0.006
Neg logit contributions
est-0.132
affe-0.130
aff-0.130
umpy-0.127
ari-0.126
Token bald
Token ID28273
Feature activation-1.61e+01

Pos logit contributions
aff+0.187
est+0.180
affe+0.176
ess+0.161
umm+0.160
Neg logit contributions
 going-0.253
 it-0.249
 him-0.246
 gonna-0.225
 paying-0.224
Token man
Token ID582
Feature activation-2.03e+00

Pos logit contributions
aff+0.228
est+0.223
umm+0.220
bird+0.205
andel+0.203
Neg logit contributions
 it-0.352
 him-0.347
 going-0.330
 his-0.318
 paying-0.308
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
aff+0.046
affe+0.043
est+0.043
umm+0.042
umpy+0.041
Neg logit contributions
 him-0.031
 it-0.030
 going-0.029
 his-0.027
 what-0.025
Token
Token ID198
Feature activation-2.62e+00

Pos logit contributions
 him+0.002
 it+0.002
 going+0.002
 gonna+0.002
 his+0.002
Neg logit contributions
est-0.003
affe-0.003
aff-0.003
umpy-0.003
umm-0.002
Token
Token ID198
Feature activation-2.64e+00

Pos logit contributions
est+0.052
aff+0.051
affe+0.050
umpy+0.045
ari+0.045
Neg logit contributions
 him-0.045
 going-0.043
 it-0.043
 gonna-0.040
 insisting-0.039
TokenEvery
Token ID6109
Feature activation-3.27e+00

Pos logit contributions
aff+0.050
est+0.050
affe+0.049
umpy+0.045
bird+0.044
Neg logit contributions
 him-0.047
 going-0.046
 it-0.045
 gonna-0.042
 insisting-0.041
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 him+0.015
 going+0.015
 it+0.014
 gonna+0.013
 insisting+0.013
Neg logit contributions
est-0.016
aff-0.016
affe-0.016
umpy-0.014
ari-0.014
Token
Token ID220
Feature activation-2.74e+00

Pos logit contributions
est+0.030
aff+0.030
affe+0.029
 angel+0.028
andel+0.027
Neg logit contributions
 him-0.016
 it-0.015
 his-0.013
 going-0.012
 what-0.011
Token
Token ID198
Feature activation-3.72e+00

Pos logit contributions
est+0.060
aff+0.059
affe+0.058
umm+0.053
ari+0.052
Neg logit contributions
 him-0.043
 it-0.041
 going-0.040
 gonna-0.036
 his-0.036
Token
Token ID198
Feature activation-3.39e+00

Pos logit contributions
est+0.086
aff+0.085
affe+0.084
umm+0.080
andel+0.076
Neg logit contributions
 him-0.055
 it-0.054
 going-0.049
 his-0.046
 insisting-0.044
TokenThe
Token ID464
Feature activation-1.20e+01

Pos logit contributions
est+0.070
aff+0.069
affe+0.067
umm+0.062
ari+0.061
Neg logit contributions
 him-0.057
 it-0.055
 going-0.055
 gonna-0.049
 paying-0.049
Token bald
Token ID28273
Feature activation-1.60e+01

Pos logit contributions
aff+0.186
est+0.182
affe+0.176
bird+0.159
ess+0.159
Neg logit contributions
 going-0.273
 him-0.266
 it-0.261
 paying-0.243
 insisting-0.243
Token man
Token ID582
Feature activation-1.01e+00

Pos logit contributions
aff+0.229
est+0.228
umm+0.222
bird+0.213
andel+0.207
Neg logit contributions
 it-0.349
 him-0.346
 going-0.337
 his-0.320
 paying-0.311
Token was
Token ID373
Feature activation+1.94e+00

Pos logit contributions
est+0.023
aff+0.023
affe+0.023
ari+0.021
bird+0.021
Neg logit contributions
 him-0.014
 going-0.013
 it-0.013
 gonna-0.012
 insisting-0.012
Token feeling
Token ID4203
Feature activation+2.53e+00

Pos logit contributions
 him+0.037
 it+0.034
 gonna+0.033
 going+0.033
 insisting+0.031
Neg logit contributions
est-0.030
 sky-0.028
umpy-0.028
aff-0.028
 girl-0.027
Token very
Token ID845
Feature activation-3.10e-01

Pos logit contributions
 him+0.046
 going+0.044
 it+0.044
 gonna+0.044
 insisting+0.042
Neg logit contributions
est-0.044
umpy-0.040
 sky-0.039
affe-0.039
aff-0.039
Token sad
Token ID6507
Feature activation-1.30e-02

Pos logit contributions
est+0.004
umpy+0.004
aff+0.004
affe+0.004
 girl+0.004
Neg logit contributions
 him-0.007
 it-0.006
 going-0.006
 gonna-0.006
 insisting-0.006
Token home
Token ID1363
Feature activation+1.61e+00

Pos logit contributions
 going+0.034
 him+0.033
 it+0.031
 gonna+0.031
 insisting+0.031
Neg logit contributions
est-0.048
aff-0.047
affe-0.045
 sky-0.042
ess-0.041
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
 him+0.030
 going+0.029
 it+0.028
 gonna+0.028
 insisting+0.027
Neg logit contributions
est-0.029
aff-0.028
affe-0.027
ess-0.025
umpy-0.025
Token
Token ID198
Feature activation-3.60e+00

Pos logit contributions
est+0.004
aff+0.004
affe+0.003
andel+0.003
umm+0.003
Neg logit contributions
 him-0.002
 it-0.002
 going-0.002
 his-0.002
 gonna-0.002
Token
Token ID198
Feature activation-3.18e+00

Pos logit contributions
est+0.077
aff+0.076
affe+0.074
umm+0.070
ari+0.067
Neg logit contributions
 him-0.058
 it-0.056
 going-0.054
 gonna-0.048
 insisting-0.048
TokenThe
Token ID464
Feature activation-1.24e+01

Pos logit contributions
est+0.071
aff+0.070
affe+0.068
umm+0.063
ari+0.062
Neg logit contributions
 him-0.048
 it-0.046
 going-0.045
 gonna-0.041
 insisting-0.040
Token oct
Token ID19318
Feature activation-1.60e+01

Pos logit contributions
aff+0.232
est+0.227
affe+0.227
ess+0.215
bows+0.206
Neg logit contributions
 him-0.237
 going-0.236
 it-0.236
 his-0.212
 what-0.200
Tokenopus
Token ID25790
Feature activation-2.26e+00

Pos logit contributions
est+0.236
aff+0.194
affe+0.193
 girl+0.178
andel+0.177
Neg logit contributions
 him-0.367
 it-0.355
 going-0.347
 paying-0.329
 gonna-0.325
Token sw
Token ID1509
Feature activation-7.74e+00

Pos logit contributions
aff+0.057
est+0.056
affe+0.055
umm+0.052
umpy+0.051
Neg logit contributions
 him-0.027
 it-0.026
 going-0.025
 gonna-0.022
 insisting-0.021
Tokenam
Token ID321
Feature activation+1.46e+00

Pos logit contributions
est+0.157
aff+0.156
affe+0.151
ess+0.138
umpy+0.137
Neg logit contributions
 going-0.132
 him-0.129
 it-0.129
 insisting-0.120
 paying-0.118
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
 going+0.021
 him+0.021
 it+0.019
 gonna+0.019
 insisting+0.019
Neg logit contributions
est-0.032
aff-0.031
affe-0.031
umpy-0.029
ess-0.029
Token sw
Token ID1509
Feature activation-4.77e+00

Pos logit contributions
 gonna+0.028
 going+0.026
 insisting+0.025
 him+0.025
 your+0.025
Neg logit contributions
 sky-0.027
est-0.027
 girl-0.027
aff-0.025
hog-0.024
Token Tom
Token ID4186
Feature activation-2.12e+00

Pos logit contributions
 going+0.061
 gonna+0.060
 insisting+0.058
 paying+0.057
 him+0.056
Neg logit contributions
est-0.088
aff-0.086
affe-0.084
ess-0.077
 girl-0.076
Token to
Token ID284
Feature activation+5.23e+00

Pos logit contributions
est+0.045
aff+0.043
affe+0.042
bird+0.040
ari+0.040
Neg logit contributions
 gonna-0.030
 going-0.030
 him-0.029
 insisting-0.029
 preventing-0.028
Token show
Token ID905
Feature activation+3.56e+00

Pos logit contributions
 him+0.069
 going+0.068
 insisting+0.068
 it+0.067
 gonna+0.065
Neg logit contributions
est-0.110
aff-0.108
affe-0.104
bird-0.102
 sky-0.098
Token her
Token ID607
Feature activation-4.24e+00

Pos logit contributions
 going+0.027
 gonna+0.024
 insisting+0.024
 paying+0.023
 him+0.021
Neg logit contributions
est-0.100
aff-0.098
affe-0.095
bird-0.089
 girl-0.089
Token the
Token ID262
Feature activation-1.16e+01

Pos logit contributions
est+0.109
aff+0.107
affe+0.105
bird+0.097
ari+0.097
Neg logit contributions
 him-0.047
 going-0.045
 it-0.043
 gonna-0.040
 insisting-0.039
Token sw
Token ID1509
Feature activation-1.60e+01

Pos logit contributions
aff+0.260
est+0.259
affe+0.259
ari+0.236
umm+0.236
Neg logit contributions
 him-0.168
 going-0.165
 it-0.164
 gonna-0.144
 insisting-0.140
Tokenan
Token ID272
Feature activation-2.44e+00

Pos logit contributions
est+0.120
affe+0.119
aff+0.112
chair+0.105
bird+0.099
Neg logit contributions
 him-0.449
 going-0.430
 it-0.430
 gonna-0.419
 his-0.408
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
est+0.050
aff+0.050
affe+0.049
umpy+0.044
ari+0.044
Neg logit contributions
 him-0.040
 going-0.039
 it-0.038
 gonna-0.035
 insisting-0.034
Token
Token ID198
Feature activation-3.25e+00

Pos logit contributions
aff+0.023
est+0.022
affe+0.022
 sky+0.020
ari+0.020
Neg logit contributions
 him-0.016
 going-0.016
 it-0.015
 insisting-0.015
 paying-0.015
Token
Token ID198
Feature activation-3.11e+00

Pos logit contributions
est+0.064
aff+0.064
affe+0.062
ari+0.056
umpy+0.056
Neg logit contributions
 him-0.055
 going-0.053
 it-0.053
 gonna-0.049
 insisting-0.048
TokenTom
Token ID13787
Feature activation-1.22e+00

Pos logit contributions
aff+0.060
est+0.059
affe+0.059
umpy+0.053
ari+0.053
Neg logit contributions
 him-0.054
 going-0.052
 it-0.051
 insisting-0.048
 gonna-0.048
Token to
Token ID284
Feature activation+4.88e+00

Pos logit contributions
 him+0.045
 going+0.043
 gonna+0.039
 it+0.038
 insisting+0.036
Neg logit contributions
aff-0.171
est-0.171
affe-0.165
 sky-0.157
bird-0.155
Token learn
Token ID2193
Feature activation+4.15e+00

Pos logit contributions
 him+0.067
 going+0.064
 it+0.063
 gonna+0.059
 insisting+0.058
Neg logit contributions
est-0.112
aff-0.110
affe-0.108
ari-0.099
bird-0.099
Token more
Token ID517
Feature activation-2.22e+00

Pos logit contributions
 him+0.055
 it+0.053
 going+0.052
 gonna+0.045
 his+0.044
Neg logit contributions
est-0.099
aff-0.098
affe-0.098
umm-0.091
umpy-0.090
Token about
Token ID546
Feature activation+3.01e+00

Pos logit contributions
affe+0.047
aff+0.047
est+0.046
umm+0.045
ari+0.043
Neg logit contributions
 it-0.035
 going-0.035
 him-0.034
 what-0.031
 his-0.030
Token the
Token ID262
Feature activation-1.28e+01

Pos logit contributions
 him+0.028
 gonna+0.027
 going+0.026
 insisting+0.026
 paying+0.024
Neg logit contributions
aff-0.077
est-0.076
affe-0.073
 sky-0.071
bird-0.070
Token sw
Token ID1509
Feature activation-1.60e+01

Pos logit contributions
affe+0.297
aff+0.294
est+0.288
umm+0.278
chair+0.271
Neg logit contributions
 going-0.181
 it-0.179
 him-0.176
 gonna-0.144
 insisting-0.143
Tokenan
Token ID272
Feature activation-3.52e+00

Pos logit contributions
affe+0.140
est+0.138
chair+0.137
umm+0.126
aff+0.120
Neg logit contributions
 him-0.421
 going-0.408
 it-0.399
 gonna-0.391
 his-0.375
Token.
Token ID13
Feature activation+9.07e-01

Pos logit contributions
aff+0.076
est+0.074
affe+0.074
umm+0.071
umpy+0.069
Neg logit contributions
 him-0.055
 going-0.054
 it-0.053
 his-0.045
 what-0.045
Token They
Token ID1119
Feature activation+4.24e+00

Pos logit contributions
 it+0.014
 him+0.014
 going+0.013
 gonna+0.012
 his+0.012
Neg logit contributions
est-0.020
affe-0.019
aff-0.019
bird-0.018
 angel-0.018
Token spent
Token ID3377
Feature activation+1.48e+00

Pos logit contributions
 him+0.054
 going+0.052
 it+0.051
 gonna+0.045
 insisting+0.044
Neg logit contributions
est-0.103
aff-0.102
affe-0.100
umpy-0.093
ari-0.092
Token the
Token ID262
Feature activation-7.72e+00

Pos logit contributions
 him+0.015
 going+0.014
 gonna+0.013
 it+0.013
 insisting+0.012
Neg logit contributions
est-0.039
aff-0.039
affe-0.038
 sky-0.036
ari-0.035
Token to
Token ID284
Feature activation+4.23e+00

Pos logit contributions
 him+0.039
 gonna+0.035
 going+0.033
 it+0.032
 insisting+0.032
Neg logit contributions
est-0.103
aff-0.100
affe-0.098
ess-0.095
 sky-0.094
Token spend
Token ID4341
Feature activation+1.55e+00

Pos logit contributions
 him+0.062
 insisting+0.058
 going+0.058
 gonna+0.058
 it+0.057
Neg logit contributions
aff-0.085
est-0.084
affe-0.081
bird-0.078
 sky-0.075
Token time
Token ID640
Feature activation+2.36e+00

Pos logit contributions
 him+0.028
 going+0.028
 it+0.026
 gonna+0.026
 insisting+0.026
Neg logit contributions
est-0.028
aff-0.027
affe-0.027
 sky-0.024
umpy-0.024
Token with
Token ID351
Feature activation+2.83e+00

Pos logit contributions
 him+0.032
 going+0.031
 it+0.030
 gonna+0.029
 insisting+0.028
Neg logit contributions
est-0.053
aff-0.053
affe-0.051
ari-0.048
ess-0.047
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
 him+0.018
 going+0.017
 it+0.016
 gonna+0.014
 insisting+0.013
Neg logit contributions
est-0.085
aff-0.084
affe-0.083
bird-0.078
ari-0.078
Token bald
Token ID28273
Feature activation-1.59e+01

Pos logit contributions
aff+0.226
est+0.212
affe+0.212
ess+0.202
umm+0.198
Neg logit contributions
 going-0.225
 him-0.213
 it-0.213
 paying-0.190
 insisting-0.187
Token man
Token ID582
Feature activation-2.25e+00

Pos logit contributions
aff+0.222
umm+0.216
est+0.214
bird+0.199
affe+0.192
Neg logit contributions
 it-0.355
 him-0.344
 going-0.339
 paying-0.313
 his-0.311
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
est+0.046
aff+0.046
affe+0.045
umpy+0.041
ari+0.040
Neg logit contributions
 him-0.037
 going-0.036
 it-0.035
 gonna-0.032
 insisting-0.032
Token
Token ID198
Feature activation-3.37e+00

Pos logit contributions
est+0.003
aff+0.003
affe+0.003
umpy+0.003
andel+0.003
Neg logit contributions
 him-0.002
 it-0.002
 going-0.002
 his-0.002
 gonna-0.002
Token
Token ID198
Feature activation-3.27e+00

Pos logit contributions
est+0.070
aff+0.069
affe+0.067
ari+0.061
umpy+0.061
Neg logit contributions
 him-0.056
 going-0.053
 it-0.053
 gonna-0.048
 insisting-0.048
TokenThe
Token ID464
Feature activation-1.12e+01

Pos logit contributions
est+0.063
aff+0.063
affe+0.062
umpy+0.056
ari+0.055
Neg logit contributions
 him-0.057
 going-0.055
 it-0.055
 gonna-0.051
 insisting-0.050
Token Dad
Token ID17415
Feature activation+4.26e-03

Pos logit contributions
 going+0.004
 saving+0.003
 him+0.003
 able+0.003
 gonna+0.003
Neg logit contributions
est-0.004
aff-0.004
 girl-0.003
 sky-0.003
affe-0.003
Token then
Token ID788
Feature activation+3.16e+00

Pos logit contributions
 him+0.000
 going+0.000
 it+0.000
 gonna+0.000
 insisting+0.000
Neg logit contributions
est-0.000
aff-0.000
affe-0.000
bird-0.000
 sky-0.000
Token looked
Token ID3114
Feature activation+3.86e+00

Pos logit contributions
 him+0.043
 going+0.042
 gonna+0.040
 it+0.039
 insisting+0.037
Neg logit contributions
est-0.071
aff-0.069
affe-0.068
bird-0.064
 girl-0.064
Token at
Token ID379
Feature activation+3.73e+00

Pos logit contributions
 him+0.050
 going+0.047
 it+0.046
 paying+0.044
 gonna+0.043
Neg logit contributions
est-0.090
aff-0.088
affe-0.085
 girl-0.081
 sky-0.081
Token the
Token ID262
Feature activation-1.03e+01

Pos logit contributions
 him+0.028
 going+0.026
 it+0.026
 gonna+0.021
 insisting+0.020
Neg logit contributions
est-0.109
aff-0.108
affe-0.107
ari-0.100
umpy-0.100
Token bald
Token ID28273
Feature activation-1.59e+01

Pos logit contributions
affe+0.214
aff+0.205
est+0.187
ari+0.183
ess+0.178
Neg logit contributions
 it-0.193
 going-0.186
 him-0.179
 insisting-0.162
 what-0.160
Token man
Token ID582
Feature activation-1.94e+00

Pos logit contributions
affe+0.248
aff+0.248
ari+0.231
est+0.225
umm+0.220
Neg logit contributions
 it-0.346
 him-0.323
 going-0.314
 his-0.292
 paying-0.288
Token on
Token ID319
Feature activation+1.94e+00

Pos logit contributions
est+0.036
aff+0.035
affe+0.035
bird+0.032
ari+0.031
Neg logit contributions
 him-0.034
 going-0.033
 it-0.032
 gonna-0.032
 insisting-0.030
Token the
Token ID262
Feature activation-1.02e+01

Pos logit contributions
 him+0.015
 going+0.015
 it+0.013
 gonna+0.013
 insisting+0.012
Neg logit contributions
est-0.054
aff-0.054
affe-0.053
 sky-0.050
bird-0.050
Token TV
Token ID3195
Feature activation-2.48e+00

Pos logit contributions
affe+0.196
aff+0.195
est+0.186
ari+0.171
bird+0.170
Neg logit contributions
 going-0.191
 it-0.190
 him-0.186
 insisting-0.173
 his-0.165
Token.
Token ID13
Feature activation+8.15e-01

Pos logit contributions
aff+0.047
est+0.047
affe+0.046
umpy+0.042
andel+0.041
Neg logit contributions
 him-0.045
 it-0.044
 going-0.043
 paying-0.039
 insisting-0.038
Token says
Token ID1139
Feature activation+3.91e-01

Pos logit contributions
aff+0.050
est+0.049
affe+0.049
umm+0.047
ari+0.046
Neg logit contributions
 him-0.030
 going-0.029
 it-0.028
 his-0.025
 insisting-0.025
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
 him+0.007
 going+0.007
 it+0.007
 gonna+0.007
 insisting+0.006
Neg logit contributions
est-0.007
aff-0.007
affe-0.007
bird-0.006
umpy-0.006
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
est+0.009
aff+0.009
affe+0.009
umm+0.008
ari+0.008
Neg logit contributions
 him-0.007
 it-0.007
 going-0.006
 his-0.006
 gonna-0.006
Token
Token ID198
Feature activation-3.00e+00

Pos logit contributions
est+0.059
aff+0.059
affe+0.058
umpy+0.052
ari+0.052
Neg logit contributions
 him-0.051
 going-0.050
 it-0.049
 gonna-0.046
 insisting-0.045
TokenThe
Token ID464
Feature activation-1.21e+01

Pos logit contributions
est+0.063
aff+0.062
affe+0.061
ari+0.055
umpy+0.055
Neg logit contributions
 him-0.049
 it-0.047
 going-0.047
 gonna-0.043
 insisting-0.042
Token bald
Token ID28273
Feature activation-1.59e+01

Pos logit contributions
est+0.182
aff+0.180
affe+0.175
ess+0.153
ari+0.152
Neg logit contributions
 him-0.269
 going-0.264
 it-0.261
 gonna-0.244
 insisting-0.243
Token man
Token ID582
Feature activation-1.58e+00

Pos logit contributions
aff+0.225
est+0.223
umm+0.220
andel+0.205
affe+0.197
Neg logit contributions
 it-0.352
 him-0.352
 going-0.333
 his-0.324
 paying-0.310
Token looks
Token ID3073
Feature activation+2.80e+00

Pos logit contributions
est+0.034
aff+0.034
affe+0.033
umm+0.031
umpy+0.031
Neg logit contributions
 him-0.024
 it-0.023
 going-0.023
 gonna-0.020
 insisting-0.020
Token at
Token ID379
Feature activation+2.59e+00

Pos logit contributions
 him+0.031
 going+0.029
 it+0.029
 gonna+0.025
 insisting+0.024
Neg logit contributions
aff-0.073
est-0.073
affe-0.072
umm-0.067
ari-0.067
Token Tom
Token ID4186
Feature activation-3.10e+00

Pos logit contributions
 going+0.027
 gonna+0.024
 him+0.024
 insisting+0.024
 paying+0.024
Neg logit contributions
est-0.067
aff-0.066
affe-0.063
 sky-0.060
bird-0.059
Token and
Token ID290
Feature activation+2.37e+00

Pos logit contributions
est+0.060
aff+0.060
affe+0.059
umm+0.053
ari+0.053
Neg logit contributions
 him-0.055
 going-0.053
 it-0.052
 gonna-0.048
 insisting-0.047
Token a
Token ID257
Feature activation-8.27e+00

Pos logit contributions
est+0.078
 girl+0.078
bird+0.071
 sky+0.071
affe+0.069
Neg logit contributions
 him-0.038
 paying-0.037
 gonna-0.037
 insisting-0.036
 going-0.036
Token big
Token ID1263
Feature activation-9.37e+00

Pos logit contributions
est+0.141
aff+0.129
umm+0.125
bows+0.114
affe+0.114
Neg logit contributions
 it-0.181
 going-0.172
 him-0.167
 his-0.163
 their-0.158
Token bird
Token ID6512
Feature activation-4.68e+00

Pos logit contributions
aff+0.143
est+0.136
ess+0.128
umpy+0.126
umm+0.124
Neg logit contributions
 it-0.216
 going-0.209
 him-0.208
 his-0.198
 your-0.193
Token named
Token ID3706
Feature activation-1.34e+00

Pos logit contributions
est+0.080
aff+0.079
affe+0.077
ari+0.068
umpy+0.068
Neg logit contributions
 him-0.093
 going-0.090
 it-0.089
 gonna-0.084
 insisting-0.082
Token O
Token ID440
Feature activation-1.07e+01

Pos logit contributions
est+0.034
aff+0.033
affe+0.033
umpy+0.031
ari+0.031
Neg logit contributions
 him-0.016
 it-0.016
 going-0.015
 gonna-0.013
 paying-0.013
Tokenstr
Token ID2536
Feature activation-1.59e+01

Pos logit contributions
aff+0.215
affe+0.212
est+0.210
ari+0.202
umpy+0.194
Neg logit contributions
 him-0.174
 it-0.170
 going-0.166
 insisting-0.160
 gonna-0.158
Tokenich
Token ID488
Feature activation-3.40e+00

Pos logit contributions
est+0.281
affe+0.263
aff+0.259
umpy+0.234
umm+0.234
Neg logit contributions
 him-0.310
 going-0.303
 it-0.297
 paying-0.275
 gonna-0.270
Token wanted
Token ID2227
Feature activation+3.20e+00

Pos logit contributions
est+0.070
aff+0.070
umm+0.070
affe+0.066
andel+0.065
Neg logit contributions
 it-0.058
 him-0.058
 going-0.053
 his-0.051
 what-0.047
Token to
Token ID284
Feature activation+3.46e+00

Pos logit contributions
 gonna+0.033
 insisting+0.032
 him+0.031
ged+0.030
 going+0.030
Neg logit contributions
est-0.076
 sky-0.074
aff-0.074
affe-0.072
 girl-0.069
Token swim
Token ID9422
Feature activation-2.23e-01

Pos logit contributions
 him+0.051
 going+0.049
 it+0.048
 gonna+0.045
 insisting+0.045
Neg logit contributions
est-0.075
aff-0.074
affe-0.073
bird-0.067
 sky-0.066
Token in
Token ID287
Feature activation+8.88e-01

Pos logit contributions
est+0.005
aff+0.005
affe+0.005
umpy+0.005
ari+0.005
Neg logit contributions
 him-0.003
 going-0.003
 it-0.003
 gonna-0.002
 insisting-0.002
Token
Token ID198
Feature activation-3.17e+00

Pos logit contributions
aff+0.057
affe+0.057
est+0.057
ess+0.052
umpy+0.051
Neg logit contributions
 going-0.060
 him-0.060
 it-0.057
 gonna-0.056
 paying-0.054
TokenBut
Token ID1537
Feature activation+2.04e+00

Pos logit contributions
est+0.063
affe+0.063
aff+0.062
umpy+0.057
ess+0.057
Neg logit contributions
 him-0.053
 going-0.050
 it-0.049
 gonna-0.047
 paying-0.045
Token then
Token ID788
Feature activation+1.67e+00

Pos logit contributions
 him+0.025
 going+0.022
 gonna+0.022
 insisting+0.021
 paying+0.021
Neg logit contributions
est-0.047
affe-0.046
 sky-0.044
bird-0.044
aff-0.043
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
 him+0.030
 going+0.029
 gonna+0.029
 insisting+0.028
 paying+0.028
Neg logit contributions
est-0.029
affe-0.027
 girl-0.026
bird-0.025
aff-0.025
Token Tammy
Token ID50070
Feature activation-1.95e+00

Pos logit contributions
 him+0.019
 going+0.017
 gonna+0.017
 insisting+0.016
 paying+0.016
Neg logit contributions
est-0.047
affe-0.045
 girl-0.043
aff-0.043
 sky-0.043
Token remembered
Token ID12086
Feature activation+5.18e+00

Pos logit contributions
est+0.044
aff+0.043
affe+0.043
ari+0.039
bird+0.039
Neg logit contributions
 him-0.028
 going-0.027
 it-0.026
 gonna-0.024
 insisting-0.024
Token how
Token ID703
Feature activation+3.54e+00

Pos logit contributions
 gonna+0.049
 him+0.044
plug+0.043
 insisting+0.043
ged+0.039
Neg logit contributions
est-0.118
 sky-0.117
aff-0.115
 girl-0.114
bird-0.114
Token exciting
Token ID7895
Feature activation+2.55e-01

Pos logit contributions
 him+0.051
 gonna+0.047
 insisting+0.046
 paying+0.046
 going+0.046
Neg logit contributions
est-0.069
umpy-0.069
affe-0.066
 girl-0.065
 sky-0.064
Token counting
Token ID14143
Feature activation+2.42e+00

Pos logit contributions
 him+0.002
 going+0.002
 insisting+0.002
 paying+0.002
 gonna+0.001
Neg logit contributions
est-0.007
aff-0.007
affe-0.007
 sky-0.007
 girl-0.007
Token can
Token ID460
Feature activation+4.37e+00

Pos logit contributions
 him+0.032
 going+0.031
 it+0.029
 gonna+0.029
 insisting+0.028
Neg logit contributions
est-0.056
aff-0.054
affe-0.054
 sky-0.050
bird-0.049
Token be
Token ID307
Feature activation+1.40e+00

Pos logit contributions
 him+0.065
 going+0.064
 it+0.062
 gonna+0.062
 insisting+0.061
Neg logit contributions
est-0.094
affe-0.088
aff-0.088
bird-0.083
 sky-0.081
Token in
Token ID287
Feature activation+8.62e-01

Pos logit contributions
 him+0.032
 it+0.029
 going+0.027
 insisting+0.027
 gonna+0.026
Neg logit contributions
est-0.048
aff-0.045
 sky-0.043
affe-0.043
bird-0.043
Token a
Token ID257
Feature activation-5.17e+00

Pos logit contributions
 him+0.007
 gonna+0.007
 going+0.007
 it+0.007
 able+0.006
Neg logit contributions
est-0.023
 sky-0.021
 girl-0.021
affe-0.021
aff-0.021
Token hurry
Token ID23290
Feature activation+2.49e-01

Pos logit contributions
est+0.090
aff+0.089
affe+0.087
ari+0.077
umpy+0.077
Neg logit contributions
 him-0.101
 going-0.098
 it-0.098
 gonna-0.091
 insisting-0.090
Token to
Token ID284
Feature activation+3.92e+00

Pos logit contributions
 him+0.004
 it+0.004
 going+0.004
 gonna+0.003
 insisting+0.003
Neg logit contributions
est-0.005
aff-0.005
 sky-0.005
affe-0.005
ess-0.005
Token get
Token ID651
Feature activation+3.08e+00

Pos logit contributions
 him+0.047
 insisting+0.047
ged+0.045
 gonna+0.045
 going+0.044
Neg logit contributions
est-0.085
aff-0.080
 sky-0.078
 girl-0.076
affe-0.075
Token to
Token ID284
Feature activation+4.37e+00

Pos logit contributions
 figuring+0.030
 gonna+0.030
 preventing+0.028
 managing+0.028
 paying+0.028
Neg logit contributions
est-0.070
 girl-0.067
 sky-0.066
aff-0.066
bird-0.061
Token his
Token ID465
Feature activation-6.02e+00

Pos logit contributions
ged+0.043
 insisting+0.043
 him+0.042
 gonna+0.040
 managed+0.040
Neg logit contributions
est-0.101
aff-0.097
 sky-0.095
 girl-0.093
affe-0.091
Token friend
Token ID1545
Feature activation-1.20e+00

Pos logit contributions
aff+0.106
affe+0.104
est+0.098
umpy+0.089
pillar+0.086
Neg logit contributions
 going-0.126
 him-0.125
 it-0.121
 his-0.113
 insisting-0.112
Token's
Token ID338
Feature activation-5.48e+00

Pos logit contributions
aff+0.026
est+0.026
affe+0.026
umm+0.024
umpy+0.024
Neg logit contributions
 him-0.019
 going-0.018
 it-0.018
 gonna-0.016
 his-0.015
Token house
Token ID2156
Feature activation+1.84e+00

Pos logit contributions
aff+0.091
affe+0.089
est+0.080
umpy+0.074
pillar+0.072
Neg logit contributions
 going-0.126
 him-0.125
 it-0.118
 his-0.113
 insisting-0.110
Token to
Token ID284
Feature activation+3.18e+00

Pos logit contributions
 him+0.033
 it+0.031
 going+0.031
 gonna+0.030
 insisting+0.029
Neg logit contributions
est-0.035
aff-0.034
affe-0.033
ari-0.030
ess-0.030
Token from
Token ID422
Feature activation+3.01e+00

Pos logit contributions
 him+0.015
 it+0.015
 going+0.015
 gonna+0.013
 insisting+0.013
Neg logit contributions
aff-0.016
est-0.016
affe-0.016
ari-0.014
umpy-0.014
Token the
Token ID262
Feature activation-9.89e+00

Pos logit contributions
 him+0.016
 going+0.015
 it+0.013
 gonna+0.012
 paying+0.011
Neg logit contributions
est-0.093
aff-0.093
affe-0.091
 sky-0.087
bird-0.086
Token new
Token ID649
Feature activation-1.21e+01

Pos logit contributions
affe+0.202
aff+0.199
ari+0.184
ess+0.183
est+0.183
Neg logit contributions
 it-0.187
 going-0.169
 him-0.166
 trying-0.156
 what-0.150
Token bottle
Token ID9294
Feature activation-1.20e+00

Pos logit contributions
affe+0.229
aff+0.223
est+0.206
umpy+0.205
umm+0.203
Neg logit contributions
 it-0.242
 him-0.216
 going-0.213
 your-0.197
 what-0.196
Token and
Token ID290
Feature activation+3.37e+00

Pos logit contributions
aff+0.026
affe+0.025
est+0.025
ari+0.023
umpy+0.023
Neg logit contributions
 him-0.019
 it-0.019
 going-0.018
 his-0.016
 paying-0.015
Token realized
Token ID6939
Feature activation+4.33e+00

Pos logit contributions
 him+0.044
 gonna+0.042
 going+0.041
 insisting+0.039
 paying+0.038
Neg logit contributions
est-0.075
 girl-0.072
aff-0.070
 sky-0.069
affe-0.068
Token that
Token ID326
Feature activation+3.02e+00

Pos logit contributions
 gonna+0.037
 him+0.037
 insisting+0.034
 going+0.034
 paying+0.033
Neg logit contributions
est-0.107
aff-0.106
affe-0.102
 angel-0.101
 girl-0.100
Token it
Token ID340
Feature activation+3.37e+00

Pos logit contributions
 him+0.021
 gonna+0.020
 going+0.017
 insisting+0.015
 paying+0.015
Neg logit contributions
est-0.086
 sky-0.081
aff-0.081
 girl-0.080
bird-0.079
Token tasted
Token ID29187
Feature activation+3.68e+00

Pos logit contributions
 him+0.055
 going+0.052
 it+0.050
 gonna+0.049
 insisting+0.048
Neg logit contributions
est-0.070
aff-0.067
affe-0.066
bird-0.061
umpy-0.061
Token just
Token ID655
Feature activation+2.18e+00

Pos logit contributions
 him+0.063
 insisting+0.061
 going+0.060
 paying+0.059
 gonna+0.059
Neg logit contributions
est-0.070
aff-0.061
umpy-0.057
 sky-0.057
affe-0.056
Token as
Token ID355
Feature activation+2.64e+00

Pos logit contributions
 him+0.029
 gonna+0.027
 paying+0.026
 going+0.026
 insisting+0.026
Neg logit contributions
est-0.050
aff-0.045
 sky-0.043
affe-0.042
 girl-0.042
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
est+0.022
aff+0.022
affe+0.022
umm+0.020
ari+0.020
Neg logit contributions
 him-0.017
 it-0.016
 going-0.016
 gonna-0.015
 insisting-0.015
Token
Token ID198
Feature activation-2.61e+00

Pos logit contributions
est+0.053
aff+0.051
affe+0.051
umm+0.047
ari+0.046
Neg logit contributions
 him-0.039
 it-0.038
 going-0.037
 gonna-0.033
 insisting-0.033
TokenWhen
Token ID2215
Feature activation+3.88e+00

Pos logit contributions
est+0.055
aff+0.055
affe+0.054
umpy+0.049
ari+0.049
Neg logit contributions
 him-0.041
 going-0.040
 it-0.039
 gonna-0.037
 insisting-0.036
Token they
Token ID484
Feature activation+8.75e-01

Pos logit contributions
 him+0.040
 gonna+0.039
 insisting+0.039
cks+0.039
render+0.039
Neg logit contributions
 girl-0.091
 sky-0.090
est-0.084
 sun-0.084
 ceiling-0.081
Token got
Token ID1392
Feature activation+1.62e+00

Pos logit contributions
 him+0.013
 going+0.013
 gonna+0.012
 insisting+0.012
 preventing+0.012
Neg logit contributions
aff-0.018
est-0.018
affe-0.017
ari-0.016
 sky-0.016
Token to
Token ID284
Feature activation+3.67e+00

Pos logit contributions
 him+0.024
 figuring+0.023
 gonna+0.023
 paying+0.022
 preventing+0.022
Neg logit contributions
 sky-0.031
aff-0.030
est-0.030
 girl-0.030
ari-0.028
Token the
Token ID262
Feature activation-8.79e+00

Pos logit contributions
 him+0.032
 gonna+0.030
 going+0.029
 insisting+0.028
 it+0.028
Neg logit contributions
aff-0.094
est-0.093
 girl-0.089
 sky-0.088
affe-0.088
Token beach
Token ID10481
Feature activation+5.37e-01

Pos logit contributions
est+0.142
aff+0.142
affe+0.142
umpy+0.123
ess+0.122
Neg logit contributions
 him-0.186
 going-0.182
 it-0.181
 gonna-0.166
 his-0.164
Token,
Token ID11
Feature activation+2.79e+00

Pos logit contributions
 him+0.010
 going+0.009
 gonna+0.009
 insisting+0.009
 it+0.009
Neg logit contributions
est-0.010
aff-0.009
affe-0.009
 sky-0.009
 girl-0.008
Token Lily
Token ID20037
Feature activation-5.87e-01

Pos logit contributions
 him+0.045
 going+0.044
 gonna+0.042
 insisting+0.041
 paying+0.041
Neg logit contributions
est-0.055
aff-0.053
affe-0.051
 girl-0.050
 sky-0.049
Token ran
Token ID4966
Feature activation+2.12e+00

Pos logit contributions
est+0.012
aff+0.011
affe+0.011
 sky+0.011
ari+0.011
Neg logit contributions
 him-0.010
 going-0.009
 it-0.009
 gonna-0.009
 insisting-0.008
TokenThe
Token ID464
Feature activation-1.09e+01

Pos logit contributions
est+0.036
aff+0.036
affe+0.036
umpy+0.033
ari+0.032
Neg logit contributions
 him-0.029
 going-0.028
 it-0.028
 gonna-0.026
 insisting-0.025
Token pilot
Token ID8022
Feature activation-5.40e-01

Pos logit contributions
aff+0.202
ess+0.175
est+0.167
ari+0.162
affe+0.160
Neg logit contributions
 it-0.239
 going-0.236
 him-0.228
 his-0.206
 trying-0.206
Token in
Token ID287
Feature activation+1.62e+00

Pos logit contributions
aff+0.013
est+0.013
affe+0.012
umm+0.011
umpy+0.011
Neg logit contributions
 him-0.007
 it-0.007
 going-0.007
 gonna-0.006
 insisting-0.006
Token the
Token ID262
Feature activation-8.19e+00

Pos logit contributions
 him+0.011
 going+0.010
 it+0.010
 gonna+0.008
 insisting+0.007
Neg logit contributions
est-0.049
aff-0.049
affe-0.048
ari-0.045
umpy-0.045
Token helicopter
Token ID15392
Feature activation+1.65e-01

Pos logit contributions
aff+0.131
affe+0.110
ess+0.108
umpy+0.100
est+0.099
Neg logit contributions
 him-0.198
 it-0.193
 his-0.190
 going-0.186
 your-0.174
Token saw
Token ID2497
Feature activation+5.62e+00

Pos logit contributions
 him+0.002
 going+0.002
 it+0.002
 gonna+0.002
 insisting+0.002
Neg logit contributions
est-0.004
aff-0.004
affe-0.004
umpy-0.003
umm-0.003
Token the
Token ID262
Feature activation-9.15e+00

Pos logit contributions
 going+0.033
 him+0.029
 gonna+0.029
 insisting+0.026
 it+0.026
Neg logit contributions
est-0.170
aff-0.167
affe-0.165
bird-0.156
 sky-0.155
Token little
Token ID1310
Feature activation-8.29e+00

Pos logit contributions
aff+0.169
ari+0.148
affe+0.147
ess+0.145
andel+0.143
Neg logit contributions
 going-0.193
 it-0.186
 him-0.182
 trying-0.167
 insisting-0.159
Token boy
Token ID2933
Feature activation-1.03e+00

Pos logit contributions
aff+0.107
affe+0.092
ari+0.089
umpy+0.087
ess+0.085
Neg logit contributions
 it-0.221
 him-0.211
 going-0.199
 his-0.197
 trying-0.185
Token and
Token ID290
Feature activation+3.42e+00

Pos logit contributions
est+0.020
aff+0.020
affe+0.019
bird+0.018
ari+0.017
Neg logit contributions
 him-0.017
 going-0.017
 it-0.017
 gonna-0.016
 insisting-0.015
Token asked
Token ID1965
Feature activation+5.37e+00

Pos logit contributions
 him+0.043
 going+0.041
 gonna+0.040
 insisting+0.037
 it+0.036
Neg logit contributions
est-0.080
affe-0.078
aff-0.078
 girl-0.073
bird-0.073
Token Jill
Token ID22772
Feature activation-5.31e+00

Pos logit contributions
 him+0.002
 going+0.002
 it+0.002
 gonna+0.002
 insisting+0.002
Neg logit contributions
est-0.004
aff-0.004
affe-0.004
 girl-0.003
bird-0.003
Token.
Token ID13
Feature activation-2.74e+00

Pos logit contributions
umm+0.124
est+0.122
aff+0.121
affe+0.121
andel+0.112
Neg logit contributions
 him-0.081
 going-0.076
 it-0.074
 his-0.067
 their-0.067
Token They
Token ID1119
Feature activation+7.65e-01

Pos logit contributions
est+0.049
aff+0.049
affe+0.047
 sky+0.042
ari+0.042
Neg logit contributions
 him-0.051
 going-0.050
 it-0.050
 gonna-0.047
 insisting-0.046
Token liked
Token ID8288
Feature activation+2.24e+00

Pos logit contributions
 him+0.012
 going+0.011
 it+0.011
 gonna+0.010
 his+0.009
Neg logit contributions
est-0.017
aff-0.017
affe-0.017
umm-0.016
andel-0.015
Token to
Token ID284
Feature activation+2.18e+00

Pos logit contributions
pped+0.027
 insisting+0.025
 him+0.024
shadow+0.024
ged+0.024
Neg logit contributions
aff-0.049
 sky-0.047
est-0.045
affe-0.045
 cat-0.045
Token do
Token ID466
Feature activation+3.40e+00

Pos logit contributions
 him+0.028
 going+0.026
 it+0.026
 his+0.022
 gonna+0.022
Neg logit contributions
aff-0.054
est-0.054
affe-0.053
umm-0.050
umpy-0.050
Token many
Token ID867
Feature activation-4.21e+00

Pos logit contributions
 insisting+0.062
 him+0.061
 gonna+0.057
 paying+0.057
pped+0.056
Neg logit contributions
est-0.058
aff-0.052
affe-0.049
 girl-0.048
 sky-0.048
Token things
Token ID1243
Feature activation+6.95e-01

Pos logit contributions
aff+0.083
affe+0.083
est+0.081
umm+0.076
ari+0.076
Neg logit contributions
 him-0.074
 going-0.073
 it-0.073
 his-0.065
 paying-0.063
Token together
Token ID1978
Feature activation+2.63e+00

Pos logit contributions
 insisting+0.009
 gonna+0.009
 him+0.009
 it+0.009
 going+0.009
Neg logit contributions
est-0.015
aff-0.014
affe-0.013
bows-0.013
ess-0.013
Token.
Token ID13
Feature activation-9.27e-01

Pos logit contributions
 him+0.051
 going+0.050
 gonna+0.049
 it+0.049
 insisting+0.048
Neg logit contributions
est-0.043
aff-0.042
affe-0.041
bird-0.037
 sky-0.037
Token
Token ID198
Feature activation-4.14e+00

Pos logit contributions
est+0.019
affe+0.019
aff+0.018
umpy+0.017
andel+0.017
Neg logit contributions
 him-0.016
 it-0.016
 going-0.015
 gonna-0.014
 his-0.014
Token"
Token ID1
Feature activation+2.85e+00

Pos logit contributions
est+0.057
aff+0.057
affe+0.056
umm+0.052
ari+0.050
Neg logit contributions
 it-0.051
 him-0.049
 going-0.049
 gonna-0.044
 your-0.044
TokenMom
Token ID29252
Feature activation-1.14e+00

Pos logit contributions
 him+0.047
 it+0.045
 going+0.045
 insisting+0.039
 paying+0.039
Neg logit contributions
affe-0.060
aff-0.060
est-0.059
umpy-0.055
ari-0.055
Tokenmy
Token ID1820
Feature activation+1.88e-01

Pos logit contributions
est+0.019
aff+0.019
affe+0.018
ari+0.016
umpy+0.016
Neg logit contributions
 him-0.023
 going-0.022
 it-0.022
 gonna-0.021
 insisting-0.020
Token,
Token ID11
Feature activation+2.52e+00

Pos logit contributions
 him+0.003
 going+0.003
 it+0.003
 gonna+0.003
 insisting+0.003
Neg logit contributions
est-0.004
aff-0.004
affe-0.004
bird-0.003
ari-0.003
Token can
Token ID460
Feature activation+5.24e+00

Pos logit contributions
 him+0.038
 insisting+0.036
 going+0.035
 paying+0.035
 gonna+0.034
Neg logit contributions
est-0.052
 girl-0.048
affe-0.048
aff-0.046
 sky-0.045
Token I
Token ID314
Feature activation+3.15e+00

Pos logit contributions
 going+0.063
 him+0.062
 gonna+0.058
 insisting+0.058
 paying+0.056
Neg logit contributions
est-0.119
affe-0.117
aff-0.116
bird-0.111
 girl-0.109
Token mix
Token ID5022
Feature activation+1.53e+00

Pos logit contributions
 him+0.045
 going+0.043
 it+0.041
 insisting+0.041
 paying+0.039
Neg logit contributions
est-0.068
aff-0.066
affe-0.065
bird-0.063
 sky-0.060
Token too
Token ID1165
Feature activation+3.80e-01

Pos logit contributions
 going+0.016
 insisting+0.014
 him+0.014
 gonna+0.014
 paying+0.014
Neg logit contributions
est-0.040
aff-0.038
affe-0.036
 sky-0.035
 girl-0.034
Token?"
Token ID1701
Feature activation+1.32e-01

Pos logit contributions
 him+0.006
 going+0.005
 it+0.005
 gonna+0.005
 insisting+0.005
Neg logit contributions
est-0.008
aff-0.008
affe-0.008
umpy-0.007
 sky-0.007
Token Lily
Token ID20037
Feature activation-2.49e+00

Pos logit contributions
 him+0.002
 going+0.002
 it+0.002
 gonna+0.002
 insisting+0.002
Neg logit contributions
est-0.003
aff-0.003
affe-0.003
umpy-0.002
ari-0.002
Token asked
Token ID1965
Feature activation+3.01e+00

Pos logit contributions
est+0.062
aff+0.061
affe+0.060
umpy+0.056
ari+0.056
Neg logit contributions
 him-0.030
 going-0.028
 it-0.028
 gonna-0.025
 insisting-0.024
Token?"
Token ID1701
Feature activation+9.53e-01

Pos logit contributions
est+0.030
aff+0.030
affe+0.029
umm+0.027
ari+0.027
Neg logit contributions
 him-0.024
 going-0.023
 it-0.023
 gonna-0.021
 insisting-0.021
Token Tim
Token ID5045
Feature activation-2.92e+00

Pos logit contributions
 him+0.015
 it+0.014
 going+0.014
 his+0.012
 gonna+0.012
Neg logit contributions
est-0.021
aff-0.021
affe-0.021
umm-0.019
ari-0.019
Tokenmy
Token ID1820
Feature activation-4.11e-01

Pos logit contributions
aff+0.062
est+0.061
affe+0.060
umm+0.057
andel+0.056
Neg logit contributions
 him-0.047
 going-0.045
 it-0.045
 gonna-0.040
 insisting-0.040
Token asked
Token ID1965
Feature activation+3.30e+00

Pos logit contributions
est+0.009
aff+0.009
affe+0.009
bird+0.008
umpy+0.008
Neg logit contributions
 him-0.006
 going-0.006
 it-0.006
 gonna-0.005
 insisting-0.005
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 going+0.060
 him+0.059
 gonna+0.057
 it+0.057
 paying+0.054
Neg logit contributions
est-0.061
aff-0.058
affe-0.057
bird-0.052
ess-0.051
Token "
Token ID366
Feature activation+2.47e+00

Pos logit contributions
est+0.025
aff+0.024
affe+0.024
umm+0.022
umpy+0.022
Neg logit contributions
 him-0.017
 it-0.016
 going-0.015
 his-0.014
 gonna-0.014
TokenI
Token ID40
Feature activation+2.41e+00

Pos logit contributions
 him+0.038
 it+0.036
 going+0.036
 gonna+0.033
 insisting+0.032
Neg logit contributions
est-0.054
aff-0.054
affe-0.052
ari-0.048
umm-0.048
Token can
Token ID460
Feature activation+5.40e+00

Pos logit contributions
 him+0.037
 going+0.036
 it+0.033
 insisting+0.033
 gonna+0.032
Neg logit contributions
est-0.051
affe-0.049
aff-0.048
umpy-0.047
ess-0.045
Token't
Token ID470
Feature activation+3.12e+00

Pos logit contributions
 him+0.071
 going+0.071
 insisting+0.067
 paying+0.066
 it+0.066
Neg logit contributions
affe-0.114
est-0.113
aff-0.112
bird-0.105
umpy-0.101
Token,
Token ID11
Feature activation+4.06e+00

Pos logit contributions
 him+0.042
 going+0.041
 it+0.040
 insisting+0.038
 gonna+0.038
Neg logit contributions
est-0.070
affe-0.069
aff-0.068
umpy-0.063
bird-0.062
Token I
Token ID314
Feature activation+3.81e+00

Pos logit contributions
 it+0.062
 him+0.061
 going+0.057
 his+0.052
 gonna+0.052
Neg logit contributions
aff-0.094
est-0.092
affe-0.088
ari-0.085
umm-0.085
Token sn
Token ID3013
Feature activation-3.00e+00

Pos logit contributions
est+0.050
affe+0.048
aff+0.048
 sky+0.045
umpy+0.045
Neg logit contributions
 him-0.036
 going-0.034
 it-0.033
 gonna-0.032
 paying-0.032
Tokenuck
Token ID1347
Feature activation+1.76e+00

Pos logit contributions
est+0.057
aff+0.057
affe+0.055
ari+0.054
ase+0.053
Neg logit contributions
 going-0.052
 him-0.051
 it-0.049
 paying-0.047
 gonna-0.046
Token out
Token ID503
Feature activation+2.11e+00

Pos logit contributions
 him+0.022
 going+0.021
 gonna+0.021
 it+0.020
 paying+0.020
Neg logit contributions
est-0.043
aff-0.038
 sky-0.038
affe-0.038
 girl-0.037
Token of
Token ID286
Feature activation+9.09e-01

Pos logit contributions
 him+0.039
 gonna+0.037
 going+0.037
 it+0.037
 paying+0.035
Neg logit contributions
est-0.037
aff-0.035
affe-0.034
bird-0.033
 sky-0.032
Token her
Token ID607
Feature activation-8.30e+00

Pos logit contributions
 him+0.014
 gonna+0.014
 going+0.013
 it+0.013
 able+0.013
Neg logit contributions
est-0.017
 sky-0.016
aff-0.016
 girl-0.016
andel-0.015
Token house
Token ID2156
Feature activation+1.15e+00

Pos logit contributions
aff+0.142
affe+0.139
est+0.138
umpy+0.122
ess+0.119
Neg logit contributions
 him-0.171
 going-0.167
 it-0.164
 his-0.153
 gonna-0.151
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
 him+0.019
 going+0.018
 it+0.018
 gonna+0.017
 insisting+0.016
Neg logit contributions
est-0.023
aff-0.023
affe-0.022
ari-0.020
 sky-0.020
Token walked
Token ID6807
Feature activation+2.16e+00

Pos logit contributions
 him+0.035
 gonna+0.035
 going+0.035
 insisting+0.035
 paying+0.034
Neg logit contributions
 girl-0.051
 sky-0.049
est-0.049
ess-0.045
chair-0.045
Token to
Token ID284
Feature activation+3.43e+00

Pos logit contributions
 going+0.025
 him+0.024
 gonna+0.024
 paying+0.023
 it+0.023
Neg logit contributions
est-0.052
aff-0.051
 sky-0.048
affe-0.048
 girl-0.048
Token the
Token ID262
Feature activation-9.40e+00

Pos logit contributions
 gonna+0.033
 going+0.033
 insisting+0.033
 paying+0.031
 able+0.031
Neg logit contributions
est-0.081
 girl-0.080
bird-0.079
aff-0.078
 sky-0.078
Token ocean
Token ID9151
Feature activation-1.29e+00

Pos logit contributions
est+0.155
affe+0.152
aff+0.152
umpy+0.129
ess+0.128
Neg logit contributions
 him-0.200
 going-0.195
 it-0.192
 his-0.177
 gonna-0.177
Token box
Token ID3091
Feature activation+5.47e-01

Pos logit contributions
est+0.032
aff+0.032
affe+0.031
umpy+0.028
ari+0.028
Neg logit contributions
 him-0.024
 going-0.023
 it-0.023
 gonna-0.021
 insisting-0.021
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
 him+0.009
 going+0.009
 it+0.009
 gonna+0.008
 insisting+0.008
Neg logit contributions
est-0.011
aff-0.011
affe-0.011
umpy-0.010
ari-0.010
Token Lily
Token ID20037
Feature activation-1.08e+00

Pos logit contributions
 him+0.006
 it+0.005
 going+0.005
 gonna+0.005
 insisting+0.005
Neg logit contributions
est-0.007
aff-0.007
affe-0.007
ari-0.006
umpy-0.006
Token laughed
Token ID13818
Feature activation+1.81e+00

Pos logit contributions
est+0.021
aff+0.021
affe+0.021
 girl+0.019
ess+0.019
Neg logit contributions
 him-0.018
 going-0.017
 gonna-0.016
 it-0.016
 insisting-0.016
Token and
Token ID290
Feature activation+2.83e+00

Pos logit contributions
 gonna+0.035
 insisting+0.035
 paying+0.035
 going+0.034
 him+0.034
Neg logit contributions
est-0.029
aff-0.027
affe-0.026
 sky-0.024
 girl-0.024
Token said
Token ID531
Feature activation+3.40e+00

Pos logit contributions
 him+0.048
 gonna+0.043
 insisting+0.042
 going+0.042
 saving+0.041
Neg logit contributions
aff-0.050
est-0.049
 girl-0.049
affe-0.047
 sky-0.046
Token,
Token ID11
Feature activation+2.96e+00

Pos logit contributions
 him+0.063
 going+0.062
 insisting+0.061
 gonna+0.060
 paying+0.058
Neg logit contributions
est-0.059
aff-0.059
affe-0.057
bird-0.050
 sky-0.049
Token "
Token ID366
Feature activation+2.52e+00

Pos logit contributions
 him+0.053
 going+0.050
 gonna+0.049
 insisting+0.048
 paying+0.047
Neg logit contributions
aff-0.052
affe-0.050
est-0.049
 sky-0.047
bird-0.045
TokenThis
Token ID1212
Feature activation-5.07e+00

Pos logit contributions
 him+0.036
 it+0.035
 going+0.032
 gonna+0.028
 insisting+0.028
Neg logit contributions
aff-0.058
affe-0.056
ari-0.056
est-0.055
anda-0.053
Token juice
Token ID13135
Feature activation-6.03e-01

Pos logit contributions
aff+0.111
est+0.109
affe+0.109
umpy+0.100
ari+0.099
Neg logit contributions
 him-0.076
 it-0.075
 going-0.074
 gonna-0.066
 insisting-0.064
Token box
Token ID3091
Feature activation+8.62e-01

Pos logit contributions
est+0.012
aff+0.012
affe+0.012
ari+0.011
umpy+0.011
Neg logit contributions
 him-0.010
 going-0.009
 it-0.009
 gonna-0.009
 insisting-0.009
Token kid
Token ID5141
Feature activation-2.86e+00

Pos logit contributions
aff+0.118
umm+0.110
affe+0.106
bows+0.106
umbs+0.105
Neg logit contributions
 it-0.210
 him-0.199
 his-0.192
 going-0.182
 what-0.179
Token and
Token ID290
Feature activation+6.05e-02

Pos logit contributions
est+0.066
aff+0.065
affe+0.064
ari+0.059
umpy+0.059
Neg logit contributions
 him-0.039
 going-0.038
 it-0.037
 gonna-0.034
 insisting-0.033
Token his
Token ID465
Feature activation-8.23e+00

Pos logit contributions
 him+0.000
 going+0.000
 it+0.000
 gonna+0.000
 insisting+0.000
Neg logit contributions
est-0.002
aff-0.002
affe-0.002
 sky-0.002
 girl-0.002
Token friends
Token ID2460
Feature activation+2.86e-01

Pos logit contributions
aff+0.156
affe+0.145
est+0.141
ess+0.135
umm+0.134
Neg logit contributions
 him-0.164
 going-0.158
 it-0.155
 his-0.151
 what-0.138
Token loved
Token ID6151
Feature activation+3.29e+00

Pos logit contributions
 him+0.004
 going+0.004
 it+0.004
 gonna+0.003
 insisting+0.003
Neg logit contributions
est-0.007
aff-0.007
affe-0.006
ari-0.006
umpy-0.006
Token their
Token ID511
Feature activation-8.14e+00

Pos logit contributions
pped+0.037
shadow+0.035
 insisting+0.032
 gonna+0.031
eling+0.030
Neg logit contributions
aff-0.074
 cat-0.069
 elephant-0.069
affe-0.069
 girl-0.068
Token new
Token ID649
Feature activation-1.01e+01

Pos logit contributions
aff+0.147
est+0.144
affe+0.143
umpy+0.127
ari+0.127
Neg logit contributions
 him-0.157
 going-0.152
 it-0.151
 gonna-0.139
 insisting-0.136
Token toy
Token ID13373
Feature activation-4.46e+00

Pos logit contributions
aff+0.162
affe+0.154
est+0.146
umpy+0.136
ari+0.136
Neg logit contributions
 him-0.222
 it-0.219
 going-0.217
 his-0.202
 gonna-0.196
Token.
Token ID13
Feature activation-2.34e-01

Pos logit contributions
aff+0.087
est+0.085
affe+0.083
umpy+0.076
andel+0.075
Neg logit contributions
 him-0.081
 going-0.077
 it-0.077
 gonna-0.069
 his-0.068
Token<|endoftext|>
Token ID50256
Feature activation-5.96e+00

Pos logit contributions
est+0.005
aff+0.005
affe+0.005
andel+0.005
umm+0.005
Neg logit contributions
 him-0.004
 it-0.004
 going-0.003
 his-0.003
 their-0.003
TokenOnce
Token ID7454
Feature activation-6.18e+00

Pos logit contributions
est+0.131
aff+0.131
affe+0.130
umm+0.120
andel+0.117
Neg logit contributions
 him-0.093
 it-0.090
 going-0.086
 his-0.076
 gonna-0.075
Tokenila
Token ID10102
Feature activation-1.89e+00

Pos logit contributions
est+0.107
umm+0.105
aff+0.104
affe+0.104
 sky+0.097
Neg logit contributions
 him-0.126
 it-0.121
 going-0.118
 what-0.110
 your-0.110
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
aff+0.042
est+0.041
affe+0.040
umm+0.039
andel+0.039
Neg logit contributions
 it-0.029
 him-0.028
 going-0.027
 his-0.024
 insisting-0.023
Token Ben
Token ID3932
Feature activation-8.88e-01

Pos logit contributions
aff+0.003
est+0.003
affe+0.003
bird+0.002
ess+0.002
Neg logit contributions
 him-0.001
 going-0.001
 it-0.001
 gonna-0.001
 insisting-0.001
Token thanked
Token ID26280
Feature activation+3.80e+00

Pos logit contributions
aff+0.019
est+0.018
affe+0.018
umm+0.017
andel+0.017
Neg logit contributions
 him-0.015
 it-0.014
 going-0.014
 paying-0.012
 his-0.012
Token their
Token ID511
Feature activation-7.85e+00

Pos logit contributions
 him+0.025
 going+0.024
 it+0.022
 gonna+0.020
 insisting+0.018
Neg logit contributions
aff-0.113
est-0.113
affe-0.111
bird-0.104
ari-0.104
Token mom
Token ID1995
Feature activation-3.60e+00

Pos logit contributions
est+0.166
affe+0.162
aff+0.159
pillar+0.148
umpy+0.145
Neg logit contributions
 him-0.136
 it-0.134
 going-0.131
 his-0.122
 their-0.122
Token and
Token ID290
Feature activation+2.85e+00

Pos logit contributions
umm+0.097
aff+0.095
andel+0.093
affe+0.092
est+0.092
Neg logit contributions
 him-0.044
 it-0.039
 going-0.037
 what-0.036
 his-0.034
Token hugged
Token ID41130
Feature activation+3.61e+00

Pos logit contributions
 him+0.044
 gonna+0.043
 it+0.041
 going+0.041
 insisting+0.039
Neg logit contributions
affe-0.055
est-0.055
 girl-0.053
aff-0.053
ari-0.050
Token her
Token ID607
Feature activation-5.91e+00

Pos logit contributions
 him+0.026
 it+0.023
 going+0.020
 his+0.016
 their+0.014
Neg logit contributions
est-0.112
aff-0.112
affe-0.111
umm-0.106
ari-0.106
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
aff+0.160
umm+0.157
pillar+0.151
affe+0.150
est+0.148
Neg logit contributions
 it-0.073
 him-0.071
 his-0.069
 their-0.065
 going-0.064
Token They
Token ID1119
Feature activation+3.44e+00

Pos logit contributions
est+0.005
affe+0.005
aff+0.005
pillar+0.004
bird+0.004
Neg logit contributions
 it-0.003
 him-0.003
 going-0.003
 their-0.003
 his-0.002
Token was
Token ID373
Feature activation+2.34e+00

Pos logit contributions
est+0.022
aff+0.022
affe+0.022
umm+0.020
ari+0.020
Neg logit contributions
 him-0.015
 going-0.015
 it-0.015
 gonna-0.013
 insisting-0.013
Token happy
Token ID3772
Feature activation+4.34e-01

Pos logit contributions
 him+0.039
 it+0.037
 going+0.037
 gonna+0.034
 insisting+0.034
Neg logit contributions
est-0.047
aff-0.045
affe-0.044
umpy-0.041
bird-0.041
Token they
Token ID484
Feature activation+2.65e+00

Pos logit contributions
 gonna+0.005
 him+0.005
 going+0.005
 insisting+0.005
 paying+0.005
Neg logit contributions
est-0.010
aff-0.009
 girl-0.009
affe-0.009
 sky-0.009
Token listened
Token ID16399
Feature activation+2.21e+00

Pos logit contributions
 him+0.041
 insisting+0.041
 paying+0.039
 going+0.038
 gonna+0.037
Neg logit contributions
est-0.051
affe-0.048
ess-0.047
 girl-0.047
aff-0.046
Token to
Token ID284
Feature activation+4.54e+00

Pos logit contributions
 him+0.027
 gonna+0.026
 going+0.025
 insisting+0.024
 paying+0.023
Neg logit contributions
est-0.051
aff-0.048
ess-0.048
 sky-0.047
affe-0.047
Token her
Token ID607
Feature activation-4.39e+00

Pos logit contributions
 gonna+0.034
 him+0.033
 going+0.032
 paying+0.030
 able+0.029
Neg logit contributions
est-0.122
aff-0.118
affe-0.115
 girl-0.113
 sky-0.113
Token mom
Token ID1995
Feature activation-1.98e+00

Pos logit contributions
aff+0.100
affe+0.095
est+0.093
pillar+0.090
umpy+0.090
Neg logit contributions
 it-0.076
 going-0.075
 him-0.071
 what-0.067
 not-0.066
Tokenmy
Token ID1820
Feature activation-1.21e+00

Pos logit contributions
aff+0.034
affe+0.033
est+0.033
umm+0.030
umpy+0.029
Neg logit contributions
 him-0.041
 going-0.040
 it-0.039
 insisting-0.036
 paying-0.036
Token's
Token ID338
Feature activation-4.03e+00

Pos logit contributions
aff+0.025
est+0.024
affe+0.024
umpy+0.022
umm+0.022
Neg logit contributions
 him-0.020
 going-0.020
 it-0.019
 insisting-0.018
 gonna-0.017
Token feeling
Token ID4203
Feature activation+2.67e+00

Pos logit contributions
aff+0.087
affe+0.083
est+0.081
umpy+0.076
pillar+0.075
Neg logit contributions
 him-0.069
 going-0.068
 it-0.066
 insisting-0.062
 what-0.060
Token and
Token ID290
Feature activation+3.75e+00

Pos logit contributions
 him+0.045
 going+0.042
 it+0.041
 gonna+0.041
 insisting+0.040
Neg logit contributions
est-0.054
affe-0.049
aff-0.049
umpy-0.047
 sky-0.047
Token it
Token ID340
Feature activation+1.92e+00

Pos logit contributions
 it+0.070
 him+0.069
 going+0.066
 gonna+0.062
 what+0.058
Neg logit contributions
aff-0.085
est-0.083
affe-0.081
ari-0.077
ess-0.075
Token's
Token ID338
Feature activation+1.00e+00

Pos logit contributions
 him+0.033
 going+0.031
 it+0.031
 insisting+0.030
 gonna+0.029
Neg logit contributions
est-0.039
affe-0.037
aff-0.036
ess-0.033
bird-0.033
Token so
Token ID523
Feature activation+1.60e+00

Pos logit contributions
 him+0.017
ged+0.015
 insisting+0.015
to+0.015
 suggesting+0.015
Neg logit contributions
est-0.017
aff-0.015
 girl-0.015
affe-0.015
 angel-0.015
Token pretty
Token ID2495
Feature activation-6.58e-01

Pos logit contributions
 him+0.029
 going+0.028
 insisting+0.027
 gonna+0.026
 it+0.026
Neg logit contributions
est-0.029
aff-0.029
affe-0.029
umpy-0.026
ess-0.025
Token!"
Token ID2474
Feature activation+5.91e-02

Pos logit contributions
est+0.012
aff+0.012
affe+0.012
umpy+0.011
ess+0.010
Neg logit contributions
 him-0.012
 going-0.012
 it-0.011
 gonna-0.011
 insisting-0.011
Token
Token ID198
Feature activation-3.36e+00

Pos logit contributions
 him+0.001
 it+0.001
 going+0.001
 his+0.001
 their+0.001
Neg logit contributions
est-0.001
affe-0.001
aff-0.001
umpy-0.001
umm-0.001
Token
Token ID198
Feature activation-3.52e+00

Pos logit contributions
est+0.075
aff+0.072
affe+0.072
umm+0.068
ari+0.065
Neg logit contributions
 him-0.053
 it-0.050
 going-0.048
 his-0.043
 gonna-0.043
TokenThey
Token ID2990
Feature activation+2.64e+00

Pos logit contributions
est+0.082
aff+0.079
affe+0.078
umm+0.075
ari+0.072
Neg logit contributions
 him-0.051
 it-0.049
 going-0.046
 his-0.043
 your-0.042
Token started
Token ID2067
Feature activation+5.39e+00

Pos logit contributions
 him+0.032
 going+0.031
 it+0.030
 gonna+0.029
 insisting+0.028
Neg logit contributions
est-0.063
aff-0.063
affe-0.062
bird-0.057
ari-0.057
Token to
Token ID284
Feature activation+3.87e+00

Pos logit contributions
 gonna+0.058
 him+0.055
pped+0.049
 going+0.046
 it+0.046
Neg logit contributions
aff-0.120
bird-0.118
est-0.116
 sky-0.115
 girl-0.113
Token argue
Token ID7267
Feature activation+3.05e+00

Pos logit contributions
 him+0.054
 going+0.052
 it+0.052
 gonna+0.049
 insisting+0.047
Neg logit contributions
est-0.086
aff-0.085
affe-0.083
bird-0.077
 sky-0.076
Token made
Token ID925
Feature activation+3.21e+00

Pos logit contributions
 him+0.039
 it+0.037
 going+0.037
 gonna+0.036
 insisting+0.036
Neg logit contributions
est-0.053
affe-0.051
aff-0.051
ari-0.048
 sky-0.048
Token his
Token ID465
Feature activation-3.66e+00

Pos logit contributions
 going+0.024
 gonna+0.023
 insisting+0.023
 him+0.022
 paying+0.021
Neg logit contributions
est-0.093
aff-0.085
affe-0.084
 sky-0.081
 girl-0.080
Token way
Token ID835
Feature activation+1.94e+00

Pos logit contributions
aff+0.068
affe+0.059
est+0.056
umpy+0.055
andel+0.054
Neg logit contributions
 going-0.078
 it-0.077
 him-0.077
 his-0.072
 insisting-0.069
Token back
Token ID736
Feature activation+1.44e+00

Pos logit contributions
 him+0.028
 going+0.027
 it+0.026
 gonna+0.026
 insisting+0.025
Neg logit contributions
est-0.042
aff-0.040
affe-0.040
 sky-0.038
ess-0.038
Token to
Token ID284
Feature activation+3.72e+00

Pos logit contributions
 him+0.023
 going+0.022
 it+0.022
 gonna+0.022
 insisting+0.021
Neg logit contributions
est-0.029
aff-0.028
affe-0.027
 sky-0.026
ess-0.025
Token the
Token ID262
Feature activation-9.38e+00

Pos logit contributions
 gonna+0.033
 him+0.030
 insisting+0.029
ged+0.029
 going+0.028
Neg logit contributions
est-0.094
 sky-0.089
aff-0.088
affe-0.087
bird-0.086
Token children
Token ID1751
Feature activation-1.42e+00

Pos logit contributions
aff+0.166
est+0.160
affe+0.158
umpy+0.142
ari+0.138
Neg logit contributions
 him-0.195
 going-0.195
 it-0.185
 his-0.170
 insisting-0.166
Token to
Token ID284
Feature activation+3.47e+00

Pos logit contributions
est+0.029
aff+0.029
affe+0.028
umpy+0.025
umm+0.025
Neg logit contributions
 him-0.024
 going-0.023
 it-0.022
 gonna-0.021
 insisting-0.020
Token keep
Token ID1394
Feature activation+3.63e+00

Pos logit contributions
 gonna+0.036
 him+0.035
 going+0.035
 insisting+0.034
 it+0.033
Neg logit contributions
est-0.082
aff-0.080
affe-0.079
 sky-0.077
bird-0.077
Token his
Token ID465
Feature activation-5.69e+00

Pos logit contributions
Half+0.035
 gonna+0.034
 managed+0.033
plug+0.033
pped+0.032
Neg logit contributions
est-0.086
 sky-0.079
 world-0.077
 girl-0.075
 angel-0.074
Token promise
Token ID6991
Feature activation+2.49e+00

Pos logit contributions
aff+0.110
affe+0.102
est+0.098
umpy+0.092
ari+0.090
Neg logit contributions
 him-0.110
 going-0.110
 it-0.108
 his-0.097
 insisting-0.097
Token
Token ID198
Feature activation-2.88e+00

Pos logit contributions
est+0.027
aff+0.026
affe+0.026
ari+0.023
umpy+0.023
Neg logit contributions
 him-0.022
 going-0.021
 it-0.021
 gonna-0.019
 insisting-0.019
Token
Token ID198
Feature activation-2.92e+00

Pos logit contributions
est+0.060
aff+0.058
affe+0.058
umm+0.052
ari+0.052
Neg logit contributions
 him-0.047
 it-0.046
 going-0.045
 gonna-0.041
 insisting-0.041
TokenL
Token ID43
Feature activation-7.56e+00

Pos logit contributions
est+0.045
aff+0.044
affe+0.043
ari+0.037
umpy+0.037
Neg logit contributions
 him-0.063
 going-0.062
 it-0.061
 gonna-0.058
 insisting-0.057
Tokenily
Token ID813
Feature activation-6.97e-01

Pos logit contributions
aff+0.126
affe+0.125
est+0.124
umm+0.116
 sky+0.113
Neg logit contributions
 him-0.148
 it-0.146
 going-0.143
 what-0.132
 gonna-0.132
Token liked
Token ID8288
Feature activation+3.80e+00

Pos logit contributions
est+0.016
aff+0.016
affe+0.015
umpy+0.014
ari+0.014
Neg logit contributions
 him-0.010
 going-0.009
 it-0.009
 gonna-0.008
 insisting-0.008
Token the
Token ID262
Feature activation-1.09e+01

Pos logit contributions
 gonna+0.032
 him+0.031
pped+0.029
 managed+0.029
 able+0.028
Neg logit contributions
aff-0.093
est-0.089
bird-0.086
 sky-0.086
 angel-0.085
Token idea
Token ID2126
Feature activation+2.30e+00

Pos logit contributions
aff+0.236
affe+0.224
ari+0.207
est+0.204
umpy+0.203
Neg logit contributions
 going-0.185
 him-0.183
 it-0.177
 insisting-0.161
 his-0.159
Token and
Token ID290
Feature activation+3.19e+00

Pos logit contributions
 him+0.046
 going+0.044
 it+0.043
 gonna+0.042
 insisting+0.041
Neg logit contributions
est-0.039
affe-0.037
aff-0.037
 sky-0.033
ari-0.033
Token started
Token ID2067
Feature activation+5.23e+00

Pos logit contributions
 him+0.049
 insisting+0.045
 gonna+0.044
 going+0.044
 risking+0.043
Neg logit contributions
est-0.061
 girl-0.061
affe-0.058
aff-0.058
bird-0.054
Token numbering
Token ID47622
Feature activation+2.75e+00

Pos logit contributions
 gonna+0.058
 him+0.055
 it+0.049
pped+0.047
 your+0.046
Neg logit contributions
est-0.118
aff-0.116
bird-0.115
 sky-0.112
 girl-0.107
Token all
Token ID477
Feature activation-9.14e-02

Pos logit contributions
 him+0.025
 going+0.024
 it+0.022
 gonna+0.022
 insisting+0.020
Neg logit contributions
est-0.075
aff-0.074
affe-0.072
 sky-0.068
bird-0.068
Token had
Token ID550
Feature activation+4.90e+00

Pos logit contributions
 him+0.044
 going+0.043
 it+0.042
 gonna+0.040
 insisting+0.039
Neg logit contributions
aff-0.077
est-0.077
affe-0.076
 sky-0.070
bird-0.070
Token so
Token ID523
Feature activation+2.18e+00

Pos logit contributions
 him+0.067
 insisting+0.062
 paying+0.059
 going+0.058
 it+0.058
Neg logit contributions
est-0.102
 sky-0.100
aff-0.098
affe-0.092
 girl-0.089
Token much
Token ID881
Feature activation-1.35e+00

Pos logit contributions
 him+0.030
 going+0.029
 it+0.029
 gonna+0.026
 insisting+0.026
Neg logit contributions
est-0.050
aff-0.050
affe-0.049
umpy-0.045
ari-0.045
Token fun
Token ID1257
Feature activation+1.56e+00

Pos logit contributions
est+0.026
aff+0.026
affe+0.025
 sky+0.024
umpy+0.023
Neg logit contributions
 him-0.022
 going-0.021
 it-0.021
 gonna-0.021
 insisting-0.020
Token with
Token ID351
Feature activation+2.77e+00

Pos logit contributions
 him+0.022
 gonna+0.020
 insisting+0.019
 going+0.019
 paying+0.018
Neg logit contributions
est-0.034
aff-0.032
affe-0.031
 sky-0.030
bird-0.029
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 going+0.020
 him+0.020
 gonna+0.019
 insisting+0.017
 it+0.017
Neg logit contributions
est-0.077
aff-0.076
affe-0.075
 sky-0.072
bird-0.071
Token basketball
Token ID9669
Feature activation-1.92e+00

Pos logit contributions
aff+0.248
affe+0.237
ess+0.237
est+0.229
ari+0.226
Neg logit contributions
 him-0.186
 going-0.185
 it-0.184
 his-0.164
 insisting-0.153
Token!
Token ID0
Feature activation+9.56e-01

Pos logit contributions
est+0.039
aff+0.039
affe+0.038
umpy+0.035
umm+0.034
Neg logit contributions
 him-0.032
 going-0.031
 it-0.030
 gonna-0.027
 paying-0.027
Token They
Token ID1119
Feature activation+3.52e+00

Pos logit contributions
 him+0.015
 it+0.014
 going+0.012
 gonna+0.012
 his+0.012
Neg logit contributions
est-0.023
aff-0.021
affe-0.021
 angel-0.020
umm-0.020
Token threw
Token ID9617
Feature activation+4.14e+00

Pos logit contributions
 him+0.048
 going+0.046
 it+0.045
 gonna+0.041
 insisting+0.041
Neg logit contributions
est-0.081
aff-0.081
affe-0.079
ari-0.073
bird-0.072
Token it
Token ID340
Feature activation+9.88e-01

Pos logit contributions
 going+0.024
 him+0.023
 gonna+0.019
 paying+0.018
 insisting+0.018
Neg logit contributions
est-0.126
aff-0.124
affe-0.122
 sky-0.116
 girl-0.114
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
 gonna+0.026
 going+0.025
 him+0.023
 paying+0.023
 insisting+0.022
Neg logit contributions
aff-0.063
est-0.063
affe-0.059
 girl-0.059
ess-0.058
Token She
Token ID1375
Feature activation+2.21e+00

Pos logit contributions
 it+0.003
 him+0.003
 going+0.003
 gonna+0.003
 his+0.003
Neg logit contributions
est-0.004
affe-0.004
aff-0.004
umm-0.004
bird-0.004
Token showed
Token ID3751
Feature activation+2.62e+00

Pos logit contributions
 him+0.035
 gonna+0.031
 going+0.031
 it+0.030
 insisting+0.029
Neg logit contributions
affe-0.044
aff-0.044
est-0.044
 girl-0.043
ari-0.043
Token Lily
Token ID20037
Feature activation-1.84e+00

Pos logit contributions
 him+0.018
 it+0.016
 going+0.013
 his+0.010
 what+0.009
Neg logit contributions
est-0.082
affe-0.081
aff-0.081
umm-0.077
ari-0.077
Token a
Token ID257
Feature activation-4.71e+00

Pos logit contributions
est+0.049
aff+0.046
affe+0.045
bird+0.045
 girl+0.044
Neg logit contributions
 gonna-0.018
 him-0.016
 going-0.015
 insisting-0.014
 paying-0.013
Token new
Token ID649
Feature activation-1.06e+01

Pos logit contributions
aff+0.082
est+0.080
affe+0.079
umm+0.072
umpy+0.070
Neg logit contributions
 it-0.092
 going-0.092
 him-0.090
 gonna-0.084
 paying-0.083
Token song
Token ID3496
Feature activation-2.79e-01

Pos logit contributions
aff+0.182
affe+0.165
umm+0.158
est+0.153
umpy+0.153
Neg logit contributions
 it-0.231
 going-0.226
 him-0.212
 your-0.200
 insisting-0.200
Token that
Token ID326
Feature activation+1.75e+00

Pos logit contributions
est+0.007
aff+0.007
affe+0.007
ari+0.006
ess+0.006
Neg logit contributions
 him-0.003
 going-0.003
 it-0.003
 gonna-0.003
 insisting-0.003
Token she
Token ID673
Feature activation+1.30e+00

Pos logit contributions
 him+0.020
 going+0.019
 it+0.019
 gonna+0.017
 insisting+0.017
Neg logit contributions
est-0.044
aff-0.044
affe-0.043
ari-0.040
umpy-0.040
Token had
Token ID550
Feature activation+5.10e+00

Pos logit contributions
 him+0.020
 going+0.019
 it+0.019
 gonna+0.017
 insisting+0.017
Neg logit contributions
est-0.028
aff-0.028
affe-0.027
umpy-0.025
ari-0.025
Token never
Token ID1239
Feature activation+2.81e+00

Pos logit contributions
 him+0.058
 going+0.052
 it+0.051
 gonna+0.048
 insisting+0.048
Neg logit contributions
est-0.128
aff-0.128
affe-0.124
 sky-0.118
ess-0.116
Token and
Token ID290
Feature activation-9.01e-01

Pos logit contributions
est+0.021
aff+0.021
affe+0.021
 sky+0.019
ari+0.019
Neg logit contributions
 him-0.019
 going-0.018
 it-0.018
 gonna-0.017
 insisting-0.017
Token her
Token ID607
Feature activation-7.38e+00

Pos logit contributions
est+0.026
affe+0.025
aff+0.025
 girl+0.024
bird+0.024
Neg logit contributions
 him-0.006
 going-0.006
 gonna-0.006
 it-0.006
 paying-0.005
Token mom
Token ID1995
Feature activation-3.72e-01

Pos logit contributions
aff+0.187
est+0.176
affe+0.172
umm+0.166
umpy+0.164
Neg logit contributions
 it-0.113
 going-0.108
 him-0.097
 not-0.096
 their-0.092
Token played
Token ID2826
Feature activation+1.44e-01

Pos logit contributions
est+0.009
aff+0.009
affe+0.009
umpy+0.008
umm+0.008
Neg logit contributions
 him-0.005
 going-0.004
 it-0.004
 gonna-0.004
 insisting-0.004
Token with
Token ID351
Feature activation+5.02e-01

Pos logit contributions
 him+0.001
 going+0.001
 gonna+0.001
 it+0.001
 paying+0.001
Neg logit contributions
est-0.004
aff-0.004
affe-0.004
 girl-0.003
bird-0.003
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 him+0.003
 it+0.003
 going+0.003
 his+0.002
 what+0.002
Neg logit contributions
est-0.016
aff-0.016
affe-0.016
umm-0.015
ari-0.015
Token k
Token ID479
Feature activation-1.11e+01

Pos logit contributions
aff+0.185
affe+0.167
ari+0.166
est+0.164
ess+0.153
Neg logit contributions
 it-0.261
 going-0.256
 him-0.240
 your-0.224
 trying-0.218
Tokenite
Token ID578
Feature activation-1.18e+00

Pos logit contributions
est+0.150
aff+0.138
affe+0.132
umm+0.116
ess+0.109
Neg logit contributions
 it-0.269
 him-0.265
 going-0.250
 your-0.236
 gonna-0.235
Token and
Token ID290
Feature activation+2.72e+00

Pos logit contributions
aff+0.035
affe+0.034
est+0.034
umpy+0.033
umm+0.033
Neg logit contributions
 it-0.010
 him-0.010
 going-0.009
 their-0.007
 his-0.007
Token had
Token ID550
Feature activation+4.30e+00

Pos logit contributions
 him+0.033
 going+0.031
 it+0.029
 gonna+0.029
 insisting+0.028
Neg logit contributions
est-0.066
aff-0.066
affe-0.065
 sky-0.060
bird-0.059
Token fun
Token ID1257
Feature activation+9.40e-01

Pos logit contributions
 him+0.056
 going+0.053
 it+0.051
 insisting+0.050
 gonna+0.049
Neg logit contributions
est-0.100
aff-0.099
affe-0.096
 sky-0.091
ess-0.088
Token and
Token ID290
Feature activation+3.20e+00

Pos logit contributions
aff+0.014
affe+0.014
est+0.014
umpy+0.013
ari+0.013
Neg logit contributions
 him-0.011
 going-0.010
 it-0.010
 gonna-0.009
 paying-0.009
Token dirt
Token ID13647
Feature activation+1.53e+00

Pos logit contributions
 him+0.039
 going+0.035
 gonna+0.035
 it+0.033
 insisting+0.032
Neg logit contributions
est-0.077
aff-0.073
affe-0.072
 sky-0.071
 girl-0.069
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
 him+0.027
 going+0.026
 it+0.026
 gonna+0.024
 insisting+0.024
Neg logit contributions
est-0.029
aff-0.029
affe-0.028
umpy-0.025
ari-0.025
Token
Token ID198
Feature activation-2.72e+00

Pos logit contributions
 it+0.008
 him+0.008
 going+0.007
 gonna+0.006
 his+0.006
Neg logit contributions
est-0.014
affe-0.014
aff-0.014
an-0.013
ari-0.012
Token
Token ID198
Feature activation-2.60e+00

Pos logit contributions
est+0.052
aff+0.052
affe+0.051
umpy+0.046
ari+0.046
Neg logit contributions
 him-0.047
 going-0.046
 it-0.045
 gonna-0.042
 insisting-0.041
TokenThe
Token ID464
Feature activation-1.06e+01

Pos logit contributions
est+0.053
aff+0.053
affe+0.052
umpy+0.047
ari+0.047
Neg logit contributions
 him-0.043
 going-0.042
 it-0.041
 gonna-0.038
 insisting-0.037
Token girl
Token ID2576
Feature activation-1.04e+00

Pos logit contributions
aff+0.162
affe+0.139
ess+0.123
umbs+0.122
ari+0.117
Neg logit contributions
 going-0.270
 it-0.268
 what-0.246
 him-0.243
 insisting-0.239
Token smiled
Token ID13541
Feature activation+9.90e-01

Pos logit contributions
est+0.024
aff+0.024
affe+0.023
umpy+0.021
umm+0.021
Neg logit contributions
 him-0.015
 it-0.014
 going-0.014
 gonna-0.012
 insisting-0.012
Token and
Token ID290
Feature activation+2.93e+00

Pos logit contributions
 him+0.016
 going+0.016
 it+0.016
 gonna+0.015
 insisting+0.014
Neg logit contributions
est-0.020
aff-0.020
affe-0.020
ari-0.018
umpy-0.018
Token put
Token ID1234
Feature activation+3.46e+00

Pos logit contributions
 him+0.048
 going+0.045
 gonna+0.044
 insisting+0.042
 paying+0.041
Neg logit contributions
est-0.058
affe-0.054
aff-0.054
 girl-0.053
 sky-0.052
Token her
Token ID607
Feature activation-6.11e+00

Pos logit contributions
 going+0.029
 him+0.028
 gonna+0.027
 paying+0.025
 insisting+0.023
Neg logit contributions
est-0.097
aff-0.090
affe-0.089
 sky-0.088
 girl-0.087