TOP ACTIVATIONS
MAX = 0.102

beauty around her without always needing to touch it . <|endoftext|>
thought she was ignorant and deserved it . He
to help you when you need it the most . <|endoftext|>
need to dry off and have lunch !" she said .
to things getting broken and needing to be replaced . It
't we go inside and have some cookies and milk ?
and demand help when she needs it . The park was
, Tim my . You deserved to win ." <|endoftext|> Once
Maybe he is lonely and needs a friend . Maybe he
can help us when we need them . <|endoftext|> Once upon
to be careless . They needed to ask for help when
, why don 't you have a twist ?" Baby smiled
't you swim together and have fun ? You can take
ask for help when you need it . <|endoftext|> Once upon
would learn to dance and have fun . They shared their
can make new friends and have fun . Sharing is caring
too far away and always have a friend with you just
is lonely . Maybe he needs a friend ." Lily says
always be space explorers and have fun adventures . But they
they all agreed that it deserved its special name - Flag

MIN ACTIVATIONS
MIN = -0.191

One day Aless io was walking down the street
he wanted . <|endoftext|> Ak io was a 3 year old
!" R aff y said , " That
K ait lyn . K ait lyn went to school every
was a girl named K ait lyn . She was three
So K ait lyn put on her shoes
morning j ogs . <|endoftext|> Lu cy wanted to be an
. The chip m unk told the boy
are ." The hipp o was confused and asked
with us ?" Le o nodded , and Sara
, a little girl named Ab bie decided that she would
it in the trash ." Nem o and his friends worked
a little girl named K ait lyn . K ait lyn
ers and some juice . Ab ig ail said , "
It might be spicy ." Nem o thought for a moment
for behaving . <|endoftext|> S oph ie was a very successful
qualify ." D ennis was so sad and angry
healthy ." D ennis was disappointed , but then
floor . The chip mun ks were very curious
! â € Baby : � � € �

INTERVAL 0.029 - 0.102
CONTAINS 25.095%

branch . She could not move . She could only j
But , his mom said that it was too cold outside
Max saw a little bird laying on the ground . "
the cake carefully . She put a big , red bow
, my loves . That is the moon . It is

INTERVAL -0.045 - 0.029
CONTAINS 63.234%

glitter and was very spark ly . He was so excited
From that day on , the bugs rocked the blue rock
her happy . She knew she would build another ig loo
Dan wanted to find something special in the woods . He
with excitement . The elephant listened carefully . His

INTERVAL -0.118 - -0.045
CONTAINS 10.704%

around us . <|endoftext|> Once there was a little girl named
singing birds . The happy frog and bird decided that this
was getting so fast that he started to blur . He
world and express himself . <|endoftext|> Once upon a time ,
the olive was ready and Jack took his first bite .

INTERVAL -0.191 - -0.118
CONTAINS 0.967%

were gone . <|endoftext|> L ily and Ben were best friends
fish ! The fish pushed him to the side
did a good job ," Ben said , adm iring their
pa ." L ila and Sam smile . They
to go home , the hipp o felt sad . He
Token beauty
Token ID8737
Feature activation-2.07e-02

Pos logit contributions
 yours+0.011
 electricity+0.011
 our+0.010
 theirs+0.010
 such+0.010
Neg logit contributions
op-0.019
anda-0.019
ppy-0.019
oise-0.019
ony-0.019
Token around
Token ID1088
Feature activation+1.83e-02

Pos logit contributions
op+0.038
ppy+0.038
anda+0.037
ony+0.036
int+0.036
Neg logit contributions
 electricity-0.040
 yours-0.040
 our-0.036
 such-0.036
 its-0.035
Token her
Token ID607
Feature activation-7.16e-03

Pos logit contributions
 electricity+0.022
 yours+0.021
 awesome+0.018
 our+0.017
 theirs+0.017
Neg logit contributions
ppy-0.047
y-0.046
ie-0.046
op-0.046
i-0.045
Token without
Token ID1231
Feature activation+6.16e-02

Pos logit contributions
op+0.015
ppy+0.015
anda+0.014
ony+0.014
int+0.014
Neg logit contributions
 electricity-0.012
 yours-0.012
 our-0.011
 such-0.011
 its-0.011
Token always
Token ID1464
Feature activation+6.31e-02

Pos logit contributions
 electricity+0.088
 awesome+0.087
 yours+0.086
 our+0.080
 yourselves+0.074
Neg logit contributions
i-0.134
ppy-0.130
y-0.122
ony-0.122
ie-0.119
Token needing
Token ID18139
Feature activation+1.02e-01

Pos logit contributions
 electricity+0.116
 yours+0.113
 our+0.102
 theirs+0.099
 awesome+0.099
Neg logit contributions
ppy-0.120
owed-0.117
y-0.116
i-0.114
adow-0.113
Token to
Token ID284
Feature activation+7.16e-02

Pos logit contributions
 electricity+0.125
 awesome+0.124
 yours+0.119
 theirs+0.111
 yourselves+0.109
Neg logit contributions
ony-0.213
ppy-0.209
i-0.208
owed-0.204
s-0.200
Token touch
Token ID3638
Feature activation+5.19e-02

Pos logit contributions
 electricity+0.134
 yours+0.131
 our+0.120
 theirs+0.118
 such+0.117
Neg logit contributions
ppy-0.136
op-0.134
ony-0.132
int-0.129
owed-0.127
Token it
Token ID340
Feature activation+1.10e-02

Pos logit contributions
 electricity+0.059
 yours+0.057
 awesome+0.050
 relief+0.047
 theirs+0.046
Neg logit contributions
y-0.127
ie-0.127
i-0.125
ppy-0.123
ony-0.122
Token.
Token ID13
Feature activation-5.47e-03

Pos logit contributions
 electricity+0.020
 yours+0.019
 our+0.018
 such+0.017
 theirs+0.017
Neg logit contributions
ppy-0.022
op-0.022
ony-0.021
anda-0.021
int-0.021
Token<|endoftext|>
Token ID50256
Feature activation-4.51e-02

Pos logit contributions
op+0.010
ppy+0.010
anda+0.010
ony+0.010
uffy+0.010
Neg logit contributions
 electricity-0.010
 yours-0.010
 our-0.009
 such-0.009
 theirs-0.009
Token thought
Token ID1807
Feature activation+6.54e-02

Pos logit contributions
op+0.042
pp+0.039
oise+0.039
uffy+0.038
ops+0.038
Neg logit contributions
 its-0.043
 electricity-0.041
 such-0.040
 a-0.039
 magic-0.039
Token she
Token ID673
Feature activation-2.94e-02

Pos logit contributions
 electricity+0.110
 yours+0.110
 our+0.098
 such+0.097
 awesome+0.095
Neg logit contributions
op-0.136
ppy-0.134
anda-0.130
ony-0.128
uffy-0.126
Token was
Token ID373
Feature activation+5.54e-02

Pos logit contributions
op+0.052
ony+0.050
int+0.050
oise+0.049
ppy+0.049
Neg logit contributions
 electricity-0.057
 such-0.055
 fun-0.053
 yours-0.053
 its-0.053
Token ignorant
Token ID20109
Feature activation+1.52e-02

Pos logit contributions
 electricity+0.087
 yours+0.077
 outer+0.076
 purpose+0.076
 theirs+0.074
Neg logit contributions
ppy-0.115
bug-0.114
's-0.107
anda-0.106
ish-0.105
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
 electricity+0.029
 yours+0.029
 our+0.027
 such+0.026
 its+0.026
Neg logit contributions
ppy-0.028
op-0.028
anda-0.027
ony-0.027
int-0.026
Token deserved
Token ID20143
Feature activation+1.01e-01

Pos logit contributions
 yours+0.036
 electricity+0.036
 purpose+0.034
 such+0.032
 outer+0.032
Neg logit contributions
ppy-0.041
anda-0.039
op-0.038
ony-0.038
int-0.038
Token it
Token ID340
Feature activation+2.09e-02

Pos logit contributions
 electricity+0.115
 yours+0.109
 theirs+0.105
 awesome+0.102
 outer+0.101
Neg logit contributions
ppy-0.236
ony-0.229
i-0.226
op-0.224
ches-0.223
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
 electricity+0.039
 yours+0.037
 our+0.034
 such+0.034
 its+0.034
Neg logit contributions
ppy-0.041
op-0.040
anda-0.038
ony-0.038
y-0.038
Token
Token ID198
Feature activation-5.98e-03

Pos logit contributions
 yours+0.029
 electricity+0.029
 our+0.026
 such+0.026
 awesome+0.026
Neg logit contributions
op-0.028
ppy-0.027
anda-0.026
ony-0.026
int-0.026
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
ppy+0.013
op+0.013
anda+0.012
int+0.012
ony+0.012
Neg logit contributions
 electricity-0.010
 yours-0.010
 our-0.009
 its-0.009
 such-0.009
TokenHe
Token ID1544
Feature activation-2.87e-02

Pos logit contributions
ppy+0.028
int+0.027
oise+0.027
owed+0.026
anda+0.026
Neg logit contributions
 electricity-0.020
 yours-0.019
 our-0.019
 your-0.019
 its-0.018
Token to
Token ID284
Feature activation+7.85e-02

Pos logit contributions
 electricity+0.106
 yours+0.099
 awesome+0.095
 outer+0.094
 relief+0.094
Neg logit contributions
s-0.131
op-0.130
y-0.130
anda-0.124
ppy-0.124
Token help
Token ID1037
Feature activation+5.10e-02

Pos logit contributions
 electricity+0.125
 yours+0.112
 awesome+0.106
 outer+0.104
 sight+0.103
Neg logit contributions
ll-0.162
ppy-0.160
ie-0.156
oise-0.155
ony-0.154
Token you
Token ID345
Feature activation+2.52e-02

Pos logit contributions
 electricity+0.062
 yours+0.061
 our+0.052
 such+0.051
 theirs+0.051
Neg logit contributions
ppy-0.129
op-0.129
ony-0.125
anda-0.124
oise-0.123
Token when
Token ID618
Feature activation+5.39e-02

Pos logit contributions
 electricity+0.047
 yours+0.046
 such+0.041
 our+0.041
 awesome+0.041
Neg logit contributions
ppy-0.048
op-0.048
ony-0.046
anda-0.046
y-0.045
Token you
Token ID345
Feature activation+2.53e-02

Pos logit contributions
 electricity+0.089
 yours+0.087
 awesome+0.079
 theirs+0.079
 such+0.078
Neg logit contributions
op-0.103
s-0.102
ll-0.101
mun-0.100
ie-0.099
Token need
Token ID761
Feature activation+9.85e-02

Pos logit contributions
 electricity+0.040
 yours+0.040
 our+0.036
 such+0.036
 peace+0.035
Neg logit contributions
op-0.056
anda-0.055
ppy-0.055
ony-0.054
uffy-0.053
Token it
Token ID340
Feature activation+3.12e-02

Pos logit contributions
 electricity+0.118
 awesome+0.118
 theirs+0.105
 yours+0.102
 relief+0.101
Neg logit contributions
s-0.208
's-0.199
ie-0.198
i-0.197
ony-0.196
Token the
Token ID262
Feature activation+2.23e-02

Pos logit contributions
 electricity+0.056
 yours+0.054
 our+0.049
 such+0.049
 its+0.048
Neg logit contributions
ppy-0.062
op-0.060
ony-0.059
anda-0.058
int-0.058
Token most
Token ID749
Feature activation+3.43e-02

Pos logit contributions
 yours+0.036
 its+0.034
 electricity+0.034
 such+0.033
 theirs+0.032
Neg logit contributions
oise-0.044
op-0.043
ppy-0.043
ish-0.042
int-0.042
Token.
Token ID13
Feature activation+8.59e-03

Pos logit contributions
 electricity+0.040
 yours+0.038
 theirs+0.033
 its+0.033
 our+0.033
Neg logit contributions
op-0.083
anda-0.081
ish-0.081
y-0.081
oise-0.081
Token<|endoftext|>
Token ID50256
Feature activation-4.01e-02

Pos logit contributions
 electricity+0.015
 yours+0.015
 our+0.013
 such+0.013
 its+0.013
Neg logit contributions
op-0.018
ppy-0.018
anda-0.017
ony-0.017
int-0.017
Token need
Token ID761
Feature activation+8.38e-02

Pos logit contributions
 electricity+0.011
 yours+0.011
 our+0.011
 such+0.010
 peace+0.010
Neg logit contributions
int-0.015
ony-0.015
op-0.015
anda-0.015
ppy-0.015
Token to
Token ID284
Feature activation+7.32e-02

Pos logit contributions
 comparison+0.140
 awesome+0.138
 theirs+0.136
 outer+0.135
 relief+0.128
Neg logit contributions
's-0.135
ches-0.133
ppy-0.131
s-0.126
op-0.124
Token dry
Token ID5894
Feature activation+4.52e-02

Pos logit contributions
 electricity+0.147
 yours+0.145
 such+0.130
 theirs+0.130
 its+0.129
Neg logit contributions
ppy-0.128
op-0.121
anda-0.115
ony-0.115
oise-0.114
Token off
Token ID572
Feature activation+3.37e-02

Pos logit contributions
 electricity+0.051
 yours+0.047
 history+0.043
 awesome+0.043
 outer+0.042
Neg logit contributions
ppy-0.114
i-0.113
ie-0.112
anda-0.111
op-0.111
Token and
Token ID290
Feature activation+5.70e-02

Pos logit contributions
 electricity+0.048
 yours+0.043
 theirs+0.039
 power+0.038
 awesome+0.038
Neg logit contributions
ppy-0.078
op-0.078
anda-0.075
y-0.074
uffy-0.074
Token have
Token ID423
Feature activation+9.78e-02

Pos logit contributions
 electricity+0.099
 yours+0.097
 such+0.088
 theirs+0.086
 its+0.085
Neg logit contributions
ppy-0.113
op-0.110
anda-0.107
int-0.105
ony-0.105
Token lunch
Token ID9965
Feature activation+1.49e-02

Pos logit contributions
 yours+0.115
 outer+0.113
 electricity+0.112
 comparison+0.106
 theirs+0.104
Neg logit contributions
's-0.219
ppy-0.205
ll-0.200
o-0.200
s-0.197
Token!"
Token ID2474
Feature activation-1.38e-02

Pos logit contributions
 yours+0.024
 electricity+0.024
 our+0.022
 such+0.021
 its+0.021
Neg logit contributions
op-0.032
ppy-0.032
anda-0.031
ony-0.031
int-0.031
Token she
Token ID673
Feature activation-4.20e-02

Pos logit contributions
op+0.029
ppy+0.029
ony+0.028
anda+0.028
int+0.028
Neg logit contributions
 yours-0.023
 electricity-0.023
 such-0.021
 our-0.020
 awesome-0.020
Token said
Token ID531
Feature activation+7.61e-04

Pos logit contributions
oise+0.086
ig+0.083
ish+0.082
ops+0.081
op+0.081
Neg logit contributions
 its-0.078
 their-0.075
 a-0.074
 electricity-0.073
 our-0.073
Token.
Token ID13
Feature activation+8.30e-03

Pos logit contributions
 electricity+0.001
 yours+0.001
 its+0.001
 our+0.001
 such+0.001
Neg logit contributions
ppy-0.002
op-0.002
anda-0.002
oise-0.002
ony-0.002
Token to
Token ID284
Feature activation+7.46e-02

Pos logit contributions
 electricity+0.053
 yours+0.053
 awesome+0.046
 such+0.046
 theirs+0.045
Neg logit contributions
ppy-0.086
op-0.085
ony-0.083
y-0.083
anda-0.083
Token things
Token ID1243
Feature activation+4.36e-03

Pos logit contributions
 yours+0.084
 electricity+0.077
 awesome+0.072
 relief+0.065
 satisfaction+0.065
Neg logit contributions
ie-0.161
ppy-0.158
y-0.155
's-0.155
i-0.152
Token getting
Token ID1972
Feature activation+6.77e-02

Pos logit contributions
 electricity+0.008
 yours+0.008
 our+0.007
 such+0.007
 its+0.007
Neg logit contributions
op-0.009
ppy-0.009
anda-0.008
ony-0.008
ll-0.008
Token broken
Token ID5445
Feature activation+3.28e-02

Pos logit contributions
 electricity+0.106
 yours+0.106
 such+0.096
 relief+0.094
 awesome+0.093
Neg logit contributions
ppy-0.141
y-0.138
s-0.137
ie-0.137
anda-0.137
Token and
Token ID290
Feature activation+4.45e-02

Pos logit contributions
 electricity+0.068
 yours+0.067
 our+0.062
 such+0.061
 its+0.061
Neg logit contributions
ppy-0.056
op-0.055
anda-0.053
ony-0.053
int-0.052
Token needing
Token ID18139
Feature activation+9.78e-02

Pos logit contributions
 electricity+0.063
 yours+0.062
 such+0.055
 awesome+0.054
 outer+0.054
Neg logit contributions
ll-0.093
ie-0.092
s-0.091
op-0.091
y-0.091
Token to
Token ID284
Feature activation+7.30e-02

Pos logit contributions
 awesome+0.123
 outer+0.118
 comparison+0.118
 yours+0.117
 relief+0.115
Neg logit contributions
s-0.194
op-0.191
's-0.183
ony-0.181
ppy-0.176
Token be
Token ID307
Feature activation+7.05e-02

Pos logit contributions
 electricity+0.133
 yours+0.131
 awesome+0.119
 sight+0.117
 outer+0.117
Neg logit contributions
op-0.129
ppy-0.127
ie-0.124
ony-0.122
ll-0.121
Token replaced
Token ID6928
Feature activation+3.87e-02

Pos logit contributions
 electricity+0.118
 yours+0.114
 outer+0.110
 comparison+0.100
 sight+0.100
Neg logit contributions
ppy-0.135
y-0.128
oise-0.128
bug-0.127
ish-0.127
Token.
Token ID13
Feature activation+7.55e-03

Pos logit contributions
 electricity+0.075
 yours+0.074
 our+0.066
 awesome+0.066
 such+0.065
Neg logit contributions
ppy-0.070
op-0.069
y-0.068
ony-0.067
anda-0.067
Token It
Token ID632
Feature activation+3.41e-03

Pos logit contributions
 yours+0.012
 electricity+0.012
 such+0.011
 our+0.011
 its+0.011
Neg logit contributions
ppy-0.017
op-0.016
anda-0.016
ony-0.016
int-0.016
Token't
Token ID470
Feature activation+4.36e-02

Pos logit contributions
 electricity+0.068
 yours+0.067
 our+0.062
 such+0.061
 its+0.060
Neg logit contributions
ppy-0.101
op-0.099
anda-0.098
int-0.097
ony-0.097
Token we
Token ID356
Feature activation+3.39e-02

Pos logit contributions
 yours+0.048
 electricity+0.046
 our+0.043
 such+0.040
 your+0.039
Neg logit contributions
op-0.120
ppy-0.119
int-0.118
anda-0.117
ony-0.117
Token go
Token ID467
Feature activation+4.63e-02

Pos logit contributions
 yours+0.053
 electricity+0.053
 our+0.048
 such+0.047
 its+0.046
Neg logit contributions
ppy-0.075
op-0.075
anda-0.074
ony-0.073
int-0.073
Token inside
Token ID2641
Feature activation+3.97e-02

Pos logit contributions
 electricity+0.083
 yours+0.076
 relief+0.070
 its+0.069
 satisfaction+0.068
Neg logit contributions
op-0.092
anda-0.091
ppy-0.091
ll-0.089
s-0.088
Token and
Token ID290
Feature activation+6.09e-02

Pos logit contributions
 electricity+0.088
 yours+0.081
 relief+0.077
 awesome+0.076
 theirs+0.075
Neg logit contributions
y-0.058
ppy-0.058
op-0.058
ie-0.058
s-0.057
Token have
Token ID423
Feature activation+9.77e-02

Pos logit contributions
 electricity+0.115
 yours+0.111
 theirs+0.104
 awesome+0.101
 such+0.101
Neg logit contributions
ppy-0.107
op-0.105
bug-0.104
anda-0.101
ll-0.100
Token some
Token ID617
Feature activation+4.26e-02

Pos logit contributions
 comparison+0.114
 electricity+0.111
 outer+0.105
wered+0.101
 yours+0.100
Neg logit contributions
's-0.219
s-0.184
bug-0.175
mie-0.175
 pads-0.173
Token cookies
Token ID14746
Feature activation-1.60e-02

Pos logit contributions
 electricity+0.055
 yours+0.055
 its+0.051
 comparison+0.050
 theirs+0.050
Neg logit contributions
op-0.094
int-0.092
anda-0.090
ie-0.089
ish-0.087
Token and
Token ID290
Feature activation+5.45e-02

Pos logit contributions
ppy+0.031
op+0.031
anda+0.029
ony+0.029
int+0.029
Neg logit contributions
 electricity-0.030
 yours-0.030
 our-0.027
 such-0.027
 its-0.027
Token milk
Token ID7545
Feature activation+1.08e-03

Pos logit contributions
 yours+0.085
 electricity+0.085
 its+0.075
 theirs+0.075
 such+0.074
Neg logit contributions
ppy-0.112
op-0.109
anda-0.107
bug-0.106
ony-0.106
Token?
Token ID30
Feature activation+1.96e-02

Pos logit contributions
 yours+0.002
 electricity+0.002
 our+0.002
 such+0.002
 its+0.002
Neg logit contributions
op-0.002
ppy-0.002
anda-0.002
ony-0.002
int-0.002
Token and
Token ID290
Feature activation+6.63e-02

Pos logit contributions
 yours+0.080
 electricity+0.080
 theirs+0.073
 awesome+0.072
 such+0.070
Neg logit contributions
ppy-0.072
op-0.068
ony-0.067
y-0.067
s-0.066
Token demand
Token ID3512
Feature activation+7.64e-02

Pos logit contributions
 electricity+0.110
 yours+0.107
 theirs+0.097
 awesome+0.095
 such+0.095
Neg logit contributions
op-0.131
ppy-0.131
ll-0.130
ony-0.127
s-0.126
Token help
Token ID1037
Feature activation+4.85e-02

Pos logit contributions
 yours+0.098
 outer+0.087
 electricity+0.087
 awesome+0.086
 theirs+0.084
Neg logit contributions
s-0.172
op-0.172
ony-0.165
ppy-0.165
anda-0.163
Token when
Token ID618
Feature activation+4.86e-02

Pos logit contributions
 electricity+0.074
 yours+0.074
 our+0.065
 such+0.065
 theirs+0.064
Neg logit contributions
ppy-0.108
op-0.107
ony-0.104
anda-0.103
y-0.102
Token she
Token ID673
Feature activation-5.58e-03

Pos logit contributions
 yours+0.071
 electricity+0.069
 theirs+0.064
 such+0.063
 awesome+0.063
Neg logit contributions
op-0.105
s-0.100
ie-0.100
ppy-0.100
ony-0.100
Token needs
Token ID2476
Feature activation+9.77e-02

Pos logit contributions
op+0.011
ppy+0.010
bi+0.010
anda+0.010
oise+0.010
Neg logit contributions
 electricity-0.011
 yours-0.010
 our-0.010
 peace-0.010
 such-0.009
Token it
Token ID340
Feature activation+3.20e-02

Pos logit contributions
 awesome+0.131
 yours+0.113
 electricity+0.112
 theirs+0.111
 outer+0.111
Neg logit contributions
s-0.194
's-0.194
ches-0.185
op-0.185
i-0.185
Token.
Token ID13
Feature activation+9.48e-03

Pos logit contributions
 electricity+0.059
 yours+0.057
 such+0.052
 our+0.051
 awesome+0.051
Neg logit contributions
ppy-0.062
op-0.059
ony-0.058
owed-0.057
ll-0.057
Token The
Token ID383
Feature activation-5.28e-03

Pos logit contributions
 electricity+0.016
 yours+0.016
 our+0.015
 such+0.015
 its+0.014
Neg logit contributions
op-0.020
ppy-0.019
anda-0.019
ony-0.019
int-0.019
Token park
Token ID3952
Feature activation-3.86e-02

Pos logit contributions
op+0.010
ppy+0.010
anda+0.010
int+0.010
ony+0.010
Neg logit contributions
 yours-0.009
 electricity-0.009
 our-0.009
 such-0.008
 its-0.008
Token was
Token ID373
Feature activation+6.33e-02

Pos logit contributions
ppy+0.074
op+0.071
int+0.071
ony+0.068
oise+0.065
Neg logit contributions
 electricity-0.078
 yours-0.073
 our-0.072
 peace-0.071
 such-0.071
Token,
Token ID11
Feature activation+4.21e-02

Pos logit contributions
 electricity+0.050
 yours+0.049
 its+0.045
 such+0.045
 theirs+0.045
Neg logit contributions
op-0.039
ppy-0.038
anda-0.037
uffy-0.036
ony-0.035
Token Tim
Token ID5045
Feature activation-8.21e-02

Pos logit contributions
 electricity+0.067
 yours+0.060
 outer+0.055
 such+0.055
 its+0.054
Neg logit contributions
op-0.087
anda-0.087
ppy-0.087
ll-0.085
uffy-0.085
Tokenmy
Token ID1820
Feature activation-3.85e-02

Pos logit contributions
op+0.095
ony+0.088
int+0.086
anda+0.083
owed+0.079
Neg logit contributions
 yours-0.230
 our-0.216
 electricity-0.213
 such-0.212
 your-0.212
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
op+0.074
ppy+0.074
anda+0.072
ony+0.070
int+0.070
Neg logit contributions
 electricity-0.072
 yours-0.070
 our-0.063
 such-0.063
 theirs-0.063
Token You
Token ID921
Feature activation+1.72e-02

Pos logit contributions
 electricity+0.024
 yours+0.024
 our+0.021
 such+0.021
 its+0.021
Neg logit contributions
op-0.040
ppy-0.039
anda-0.038
int-0.038
ony-0.038
Token deserved
Token ID20143
Feature activation+9.76e-02

Pos logit contributions
 electricity+0.036
 yours+0.036
 our+0.034
 such+0.034
 its+0.033
Neg logit contributions
op-0.029
ppy-0.028
anda-0.028
ony-0.028
int-0.027
Token to
Token ID284
Feature activation+7.52e-02

Pos logit contributions
 electricity+0.120
 relief+0.120
 theirs+0.106
 yourselves+0.106
 comparison+0.104
Neg logit contributions
's-0.214
s-0.206
op-0.190
ne-0.189
ches-0.188
Token win
Token ID1592
Feature activation+5.65e-02

Pos logit contributions
 electricity+0.151
 yours+0.141
 theirs+0.132
 its+0.129
 sight+0.127
Neg logit contributions
op-0.119
ppy-0.117
oise-0.114
ll-0.113
ops-0.112
Token."
Token ID526
Feature activation-7.87e-03

Pos logit contributions
 electricity+0.097
 yours+0.091
 relief+0.090
 theirs+0.086
 hers+0.083
Neg logit contributions
ppy-0.108
op-0.104
s-0.104
y-0.103
ll-0.100
Token<|endoftext|>
Token ID50256
Feature activation-3.69e-02

Pos logit contributions
ppy+0.017
op+0.016
anda+0.016
ony+0.016
int+0.016
Neg logit contributions
 electricity-0.013
 yours-0.013
 our-0.012
 such-0.012
 its-0.012
TokenOnce
Token ID7454
Feature activation-2.07e-02

Pos logit contributions
ppy+0.078
op+0.077
anda+0.076
ony+0.076
int+0.075
Neg logit contributions
 electricity-0.061
 yours-0.061
 such-0.056
 our-0.056
 its-0.055
Token Maybe
Token ID6674
Feature activation+5.79e-02

Pos logit contributions
 electricity+0.051
 yours+0.050
 such+0.046
 sight+0.046
 awesome+0.046
Neg logit contributions
ppy-0.043
op-0.043
anda-0.041
int-0.040
ony-0.040
Token he
Token ID339
Feature activation-1.73e-02

Pos logit contributions
 electricity+0.105
 yours+0.097
 relief+0.092
 such+0.090
 purpose+0.089
Neg logit contributions
ppy-0.105
op-0.104
s-0.102
ie-0.102
i-0.100
Token is
Token ID318
Feature activation+5.59e-02

Pos logit contributions
op+0.039
int+0.038
ppy+0.038
ail+0.037
bi+0.037
Neg logit contributions
 yours-0.026
 electricity-0.024
 our-0.023
 respect-0.022
 your-0.022
Token lonely
Token ID21757
Feature activation+3.33e-02

Pos logit contributions
 electricity+0.088
 outer+0.070
 purpose+0.069
 relief+0.066
 respect+0.065
Neg logit contributions
bug-0.122
ppy-0.117
anda-0.115
bi-0.113
oise-0.110
Token and
Token ID290
Feature activation+3.04e-02

Pos logit contributions
 electricity+0.065
 yours+0.061
 outer+0.056
 its+0.055
 such+0.055
Neg logit contributions
ppy-0.062
y-0.060
op-0.060
anda-0.058
oise-0.056
Token needs
Token ID2476
Feature activation+9.75e-02

Pos logit contributions
 electricity+0.055
 yours+0.052
 purpose+0.050
 such+0.049
 outer+0.047
Neg logit contributions
ppy-0.058
anda-0.058
bug-0.056
op-0.055
s-0.055
Token a
Token ID257
Feature activation+6.69e-02

Pos logit contributions
 comparison+0.133
 outer+0.125
 sacred+0.118
 electricity+0.118
 awesome+0.118
Neg logit contributions
s-0.178
's-0.177
of-0.156
ie-0.155
ches-0.154
Token friend
Token ID1545
Feature activation-4.70e-03

Pos logit contributions
 electricity+0.131
 yours+0.124
 such+0.122
 theirs+0.114
 its+0.113
Neg logit contributions
op-0.124
anda-0.121
ppy-0.118
uffy-0.117
ll-0.115
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
ppy+0.011
op+0.011
anda+0.011
ony+0.010
int+0.010
Neg logit contributions
 electricity-0.007
 yours-0.007
 our-0.006
 its-0.006
 such-0.006
Token Maybe
Token ID6674
Feature activation+5.57e-02

Pos logit contributions
 electricity+0.046
 yours+0.045
 such+0.042
 our+0.041
 its+0.041
Neg logit contributions
ppy-0.038
op-0.037
anda-0.036
ony-0.036
int-0.036
Token he
Token ID339
Feature activation-2.20e-02

Pos logit contributions
 electricity+0.101
 yours+0.095
 such+0.087
 awesome+0.086
 relief+0.085
Neg logit contributions
ppy-0.106
op-0.102
ie-0.101
anda-0.100
ony-0.099
Token can
Token ID460
Feature activation+6.41e-02

Pos logit contributions
ppy+0.061
op+0.061
anda+0.059
ony+0.059
int+0.058
Neg logit contributions
 electricity-0.058
 yours-0.058
 our-0.053
 such-0.052
 its-0.052
Token help
Token ID1037
Feature activation+4.99e-02

Pos logit contributions
 electricity+0.099
 yours+0.098
 our+0.087
 such+0.087
 its+0.086
Neg logit contributions
op-0.144
ppy-0.144
anda-0.140
ony-0.139
int-0.138
Token us
Token ID514
Feature activation+2.67e-03

Pos logit contributions
 electricity+0.051
 yours+0.051
 our+0.043
 such+0.042
 its+0.041
Neg logit contributions
op-0.138
ppy-0.138
anda-0.135
ony-0.134
int-0.134
Token when
Token ID618
Feature activation+4.76e-02

Pos logit contributions
 electricity+0.005
 yours+0.005
 our+0.004
 such+0.004
 its+0.004
Neg logit contributions
op-0.005
ppy-0.005
anda-0.005
ony-0.005
int-0.005
Token we
Token ID356
Feature activation+3.12e-02

Pos logit contributions
 electricity+0.094
 yours+0.093
 relief+0.090
 such+0.087
 awesome+0.087
Neg logit contributions
op-0.075
s-0.073
ll-0.072
i-0.071
ppy-0.070
Token need
Token ID761
Feature activation+9.70e-02

Pos logit contributions
 electricity+0.051
 yours+0.051
 our+0.047
 such+0.046
 its+0.045
Neg logit contributions
op-0.066
ppy-0.066
anda-0.065
ony-0.064
int-0.064
Token them
Token ID606
Feature activation+9.07e-03

Pos logit contributions
 relief+0.148
 electricity+0.139
 theirs+0.131
 awesome+0.130
 sacred+0.129
Neg logit contributions
s-0.197
op-0.182
i-0.176
ll-0.173
re-0.172
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 electricity+0.017
 yours+0.017
 its+0.015
 such+0.015
 our+0.015
Neg logit contributions
ppy-0.017
op-0.017
anda-0.016
ony-0.016
ll-0.016
Token<|endoftext|>
Token ID50256
Feature activation-3.26e-02

Pos logit contributions
 yours+0.025
 electricity+0.025
 our+0.023
 such+0.023
 its+0.023
Neg logit contributions
ppy-0.031
op-0.031
anda-0.030
ony-0.030
int-0.030
TokenOnce
Token ID7454
Feature activation-1.48e-02

Pos logit contributions
ppy+0.071
op+0.071
anda+0.070
ony+0.069
int+0.069
Neg logit contributions
 electricity-0.051
 yours-0.051
 such-0.048
 our-0.047
 its-0.047
Token upon
Token ID2402
Feature activation+1.45e-02

Pos logit contributions
int+0.026
ppy+0.026
op+0.025
oise+0.025
ony+0.025
Neg logit contributions
 yours-0.030
 our-0.029
 its-0.028
 your-0.028
 theirs-0.027
Token to
Token ID284
Feature activation+7.86e-02

Pos logit contributions
 electricity+0.109
 yours+0.107
 such+0.098
 our+0.096
 awesome+0.095
Neg logit contributions
ppy-0.129
op-0.127
anda-0.123
ony-0.121
y-0.121
Token be
Token ID307
Feature activation+7.23e-02

Pos logit contributions
 electricity+0.143
 yours+0.141
 such+0.129
 theirs+0.128
 awesome+0.126
Neg logit contributions
ppy-0.155
op-0.151
y-0.147
anda-0.145
ie-0.145
Token careless
Token ID36138
Feature activation+3.13e-02

Pos logit contributions
 electricity+0.129
 yours+0.122
 outer+0.117
 such+0.109
 purpose+0.109
Neg logit contributions
ppy-0.147
y-0.138
anda-0.137
bug-0.137
oise-0.133
Token.
Token ID13
Feature activation+2.89e-02

Pos logit contributions
 electricity+0.061
 yours+0.061
 our+0.055
 such+0.054
 awesome+0.054
Neg logit contributions
ppy-0.056
op-0.055
y-0.054
anda-0.053
oise-0.051
Token They
Token ID1119
Feature activation-9.61e-03

Pos logit contributions
 yours+0.058
 electricity+0.057
 such+0.052
 awesome+0.052
 outer+0.051
Neg logit contributions
op-0.049
uffy-0.049
ppy-0.049
anda-0.047
s-0.045
Token needed
Token ID2622
Feature activation+9.68e-02

Pos logit contributions
op+0.020
ppy+0.019
int+0.019
anda+0.019
oise+0.019
Neg logit contributions
 electricity-0.017
 yours-0.016
 such-0.016
 its-0.015
 our-0.015
Token to
Token ID284
Feature activation+8.11e-02

Pos logit contributions
 awesome+0.167
 yours+0.164
 comparison+0.160
 outer+0.159
 ours+0.148
Neg logit contributions
op-0.139
's-0.135
s-0.130
mun-0.120
ie-0.118
Token ask
Token ID1265
Feature activation+4.83e-02

Pos logit contributions
 electricity+0.150
 yours+0.150
 theirs+0.136
 such+0.136
 awesome+0.135
Neg logit contributions
ppy-0.151
op-0.145
ony-0.140
anda-0.138
ie-0.137
Token for
Token ID329
Feature activation+7.45e-02

Pos logit contributions
 electricity+0.079
 yours+0.077
 awesome+0.070
 power+0.067
 outer+0.066
Neg logit contributions
ppy-0.104
ie-0.096
i-0.095
op-0.094
owed-0.093
Token help
Token ID1037
Feature activation+3.22e-02

Pos logit contributions
 yours+0.121
 comparison+0.112
 awesome+0.109
 theirs+0.104
 sight+0.104
Neg logit contributions
ppy-0.123
's-0.114
op-0.112
ll-0.108
ie-0.108
Token when
Token ID618
Feature activation+4.41e-02

Pos logit contributions
 electricity+0.057
 yours+0.056
 our+0.051
 such+0.051
 its+0.050
Neg logit contributions
ppy-0.066
op-0.065
anda-0.063
ony-0.063
int-0.062
Token,
Token ID11
Feature activation+3.30e-02

Pos logit contributions
 yours+0.019
 electricity+0.019
 our+0.017
 such+0.017
 its+0.016
Neg logit contributions
op-0.033
ppy-0.032
anda-0.032
ony-0.032
int-0.032
Token why
Token ID1521
Feature activation+6.31e-02

Pos logit contributions
 yours+0.060
 electricity+0.060
 our+0.056
 such+0.055
 its+0.054
Neg logit contributions
op-0.065
ppy-0.065
anda-0.063
ony-0.063
int-0.063
Token don
Token ID836
Feature activation+4.83e-02

Pos logit contributions
 electricity+0.111
 yours+0.110
 our+0.100
 such+0.100
 its+0.099
Neg logit contributions
op-0.127
ppy-0.127
anda-0.123
ony-0.122
int-0.121
Token't
Token ID470
Feature activation+4.15e-02

Pos logit contributions
 electricity+0.059
 yours+0.057
 such+0.055
 our+0.055
 your+0.052
Neg logit contributions
ppy-0.127
anda-0.126
int-0.124
bi-0.123
oise-0.123
Token you
Token ID345
Feature activation+2.08e-02

Pos logit contributions
 yours+0.038
 our+0.035
 your+0.033
 such+0.032
 electricity+0.031
Neg logit contributions
int-0.127
anda-0.124
op-0.123
oise-0.123
owed-0.123
Token have
Token ID423
Feature activation+9.67e-02

Pos logit contributions
 yours+0.029
 such+0.026
 our+0.026
 your+0.026
 respect+0.025
Neg logit contributions
anda-0.049
op-0.049
oise-0.048
mun-0.048
uffy-0.047
Token a
Token ID257
Feature activation+6.13e-02

Pos logit contributions
 electricity+0.117
 theirs+0.096
 relief+0.095
 yours+0.094
 respect+0.088
Neg logit contributions
's-0.226
ppy-0.225
i-0.223
ony-0.218
ie-0.217
Token twist
Token ID14528
Feature activation-6.04e-03

Pos logit contributions
 electricity+0.118
 yours+0.115
 such+0.106
 theirs+0.105
 our+0.104
Neg logit contributions
op-0.115
ppy-0.114
anda-0.113
int-0.110
ony-0.110
Token?"
Token ID1701
Feature activation-1.81e-02

Pos logit contributions
ppy+0.011
op+0.011
ony+0.011
int+0.010
anda+0.010
Neg logit contributions
 yours-0.012
 electricity-0.012
 our-0.011
 such-0.011
 its-0.011
Token Baby
Token ID14801
Feature activation-1.30e-01

Pos logit contributions
anda+0.044
ppy+0.044
oise+0.042
ony+0.042
op+0.041
Neg logit contributions
 electricity-0.025
 its-0.024
 such-0.024
 yours-0.023
 our-0.023
Token smiled
Token ID13541
Feature activation+2.55e-03

Pos logit contributions
oise+0.228
op+0.219
mun+0.212
ops+0.210
pp+0.209
Neg logit contributions
 its-0.247
 our-0.225
 your-0.222
 the-0.222
 my-0.222
Token't
Token ID470
Feature activation+4.09e-02

Pos logit contributions
 electricity+0.068
 yours+0.067
 our+0.061
 such+0.060
 its+0.059
Neg logit contributions
ppy-0.091
op-0.090
anda-0.088
ony-0.088
int-0.087
Token you
Token ID345
Feature activation+3.32e-02

Pos logit contributions
 yours+0.047
 electricity+0.046
 our+0.043
 such+0.040
 its+0.039
Neg logit contributions
op-0.110
ppy-0.109
int-0.108
anda-0.108
ony-0.108
Token swim
Token ID9422
Feature activation+1.16e-02

Pos logit contributions
 yours+0.053
 electricity+0.053
 our+0.049
 such+0.048
 its+0.047
Neg logit contributions
op-0.073
ppy-0.072
anda-0.071
ony-0.070
int-0.070
Token together
Token ID1978
Feature activation+2.00e-02

Pos logit contributions
 electricity+0.022
 yours+0.021
 such+0.019
 our+0.019
 its+0.019
Neg logit contributions
ppy-0.022
op-0.022
anda-0.022
ony-0.021
int-0.021
Token and
Token ID290
Feature activation+5.60e-02

Pos logit contributions
 electricity+0.040
 yours+0.039
 such+0.035
 its+0.035
 theirs+0.035
Neg logit contributions
ppy-0.036
op-0.036
anda-0.034
y-0.034
ony-0.033
Token have
Token ID423
Feature activation+9.66e-02

Pos logit contributions
 electricity+0.102
 yours+0.095
 such+0.087
 theirs+0.086
 awesome+0.084
Neg logit contributions
ppy-0.107
op-0.102
anda-0.101
bug-0.101
oise-0.100
Token fun
Token ID1257
Feature activation+3.54e-02

Pos logit contributions
 electricity+0.105
 outer+0.091
 comparison+0.083
 yours+0.082
 theirs+0.076
Neg logit contributions
's-0.238
s-0.218
bug-0.218
ll-0.214
ppy-0.212
Token?
Token ID30
Feature activation+1.55e-02

Pos logit contributions
 electricity+0.077
 yours+0.070
 theirs+0.065
 relief+0.064
 its+0.064
Neg logit contributions
op-0.057
ppy-0.057
y-0.057
anda-0.055
ll-0.054
Token You
Token ID921
Feature activation+8.64e-03

Pos logit contributions
 electricity+0.025
 yours+0.025
 such+0.022
 theirs+0.022
 awesome+0.022
Neg logit contributions
op-0.034
ppy-0.034
anda-0.032
int-0.032
ony-0.032
Token can
Token ID460
Feature activation+6.29e-02

Pos logit contributions
 electricity+0.015
 yours+0.015
 our+0.014
 such+0.014
 its+0.014
Neg logit contributions
op-0.018
ppy-0.017
anda-0.017
ony-0.017
int-0.017
Token take
Token ID1011
Feature activation+6.07e-02

Pos logit contributions
 yours+0.096
 electricity+0.094
 our+0.089
 such+0.086
 its+0.084
Neg logit contributions
ppy-0.142
op-0.142
ony-0.138
int-0.138
anda-0.138
Token ask
Token ID1265
Feature activation+6.25e-02

Pos logit contributions
 electricity+0.136
 yours+0.123
 theirs+0.118
 awesome+0.118
 outer+0.116
Neg logit contributions
ppy-0.129
ll-0.124
ie-0.121
i-0.121
ony-0.119
Token for
Token ID329
Feature activation+7.73e-02

Pos logit contributions
 electricity+0.095
 yours+0.093
 awesome+0.089
 sight+0.084
 outer+0.084
Neg logit contributions
ppy-0.133
ony-0.122
op-0.120
i-0.119
owed-0.119
Token help
Token ID1037
Feature activation+4.12e-02

Pos logit contributions
 yours+0.109
 theirs+0.104
 hers+0.099
 awesome+0.098
 electricity+0.097
Neg logit contributions
mun-0.131
ony-0.131
's-0.130
i-0.129
ppy-0.128
Token when
Token ID618
Feature activation+4.92e-02

Pos logit contributions
 electricity+0.073
 yours+0.073
 theirs+0.066
 awesome+0.066
 such+0.065
Neg logit contributions
ppy-0.082
op-0.081
ony-0.078
y-0.077
ie-0.076
Token you
Token ID345
Feature activation+2.36e-02

Pos logit contributions
 yours+0.080
 electricity+0.078
 theirs+0.076
 awesome+0.074
 such+0.074
Neg logit contributions
op-0.093
ll-0.089
i-0.088
s-0.087
ie-0.087
Token need
Token ID761
Feature activation+9.65e-02

Pos logit contributions
 electricity+0.035
 yours+0.034
 such+0.031
 our+0.031
 peace+0.031
Neg logit contributions
anda-0.055
op-0.054
ppy-0.053
uffy-0.052
ony-0.052
Token it
Token ID340
Feature activation+2.65e-02

Pos logit contributions
 theirs+0.125
 awesome+0.122
 yours+0.111
 outer+0.104
 electricity+0.103
Neg logit contributions
's-0.194
op-0.193
s-0.190
ony-0.183
'-0.182
Token.
Token ID13
Feature activation+6.70e-03

Pos logit contributions
 electricity+0.048
 yours+0.047
 our+0.043
 such+0.042
 its+0.042
Neg logit contributions
ppy-0.052
op-0.052
ony-0.050
anda-0.050
int-0.049
Token<|endoftext|>
Token ID50256
Feature activation-3.88e-02

Pos logit contributions
 electricity+0.013
 yours+0.012
 our+0.011
 awesome+0.011
 theirs+0.011
Neg logit contributions
op-0.013
ppy-0.013
anda-0.012
ony-0.012
int-0.012
TokenOnce
Token ID7454
Feature activation-2.28e-02

Pos logit contributions
ppy+0.079
op+0.079
anda+0.077
ony+0.077
int+0.076
Neg logit contributions
 electricity-0.067
 yours-0.067
 our-0.061
 such-0.061
 its-0.060
Token upon
Token ID2402
Feature activation+9.45e-03

Pos logit contributions
ppy+0.038
int+0.037
op+0.036
oise+0.036
ony+0.036
Neg logit contributions
 yours-0.049
 our-0.046
 electricity-0.046
 its-0.045
 your-0.045
Token would
Token ID561
Feature activation+7.02e-02

Pos logit contributions
op+0.026
int+0.026
ppy+0.025
ony+0.025
oise+0.024
Neg logit contributions
 yours-0.032
 electricity-0.031
 such-0.030
 its-0.030
 our-0.030
Token learn
Token ID2193
Feature activation+7.37e-02

Pos logit contributions
 electricity+0.127
 yours+0.117
 outer+0.106
 such+0.105
 awesome+0.104
Neg logit contributions
ppy-0.138
ll-0.135
op-0.128
anda-0.128
ony-0.126
Token to
Token ID284
Feature activation+7.81e-02

Pos logit contributions
 electricity+0.113
 relief+0.106
 yours+0.106
 comparison+0.105
 satisfaction+0.101
Neg logit contributions
s-0.152
's-0.139
y-0.137
bug-0.132
ie-0.131
Token dance
Token ID9280
Feature activation+3.68e-02

Pos logit contributions
 electricity+0.146
 yours+0.138
 theirs+0.131
 such+0.126
 awesome+0.124
Neg logit contributions
ppy-0.144
ll-0.137
ie-0.137
i-0.135
op-0.135
Token and
Token ID290
Feature activation+5.79e-02

Pos logit contributions
 electricity+0.077
 yours+0.075
 our+0.066
 awesome+0.065
 such+0.065
Neg logit contributions
ppy-0.062
y-0.060
s-0.059
op-0.058
owed-0.056
Token have
Token ID423
Feature activation+9.64e-02

Pos logit contributions
 electricity+0.104
 yours+0.100
 such+0.089
 theirs+0.089
 awesome+0.088
Neg logit contributions
ppy-0.111
op-0.107
ll-0.105
bug-0.105
anda-0.105
Token fun
Token ID1257
Feature activation+4.07e-02

Pos logit contributions
 electricity+0.130
 yours+0.119
 outer+0.115
 sacred+0.108
 comparison+0.107
Neg logit contributions
's-0.203
ll-0.182
s-0.181
 pads-0.179
ppy-0.174
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
 electricity+0.084
 yours+0.081
 theirs+0.073
 our+0.072
 awesome+0.072
Neg logit contributions
op-0.069
ppy-0.068
y-0.068
anda-0.065
s-0.065
Token They
Token ID1119
Feature activation-1.37e-02

Pos logit contributions
 yours+0.039
 electricity+0.037
 such+0.034
 our+0.033
 awesome+0.033
Neg logit contributions
op-0.043
ppy-0.042
anda-0.042
uffy-0.041
ony-0.041
Token shared
Token ID4888
Feature activation+5.79e-02

Pos logit contributions
op+0.027
int+0.026
anda+0.025
ony+0.025
ppy+0.025
Neg logit contributions
 electricity-0.025
 yours-0.024
 its-0.024
 such-0.024
 our-0.023
Token their
Token ID511
Feature activation+3.13e-02

Pos logit contributions
 electricity+0.074
 yours+0.068
 outer+0.054
 awesome+0.054
 power+0.052
Neg logit contributions
s-0.149
y-0.141
ppy-0.139
's-0.137
bug-0.136
Token can
Token ID460
Feature activation+7.05e-02

Pos logit contributions
 electricity+0.019
 yours+0.019
 our+0.018
 such+0.017
 its+0.017
Neg logit contributions
op-0.024
ppy-0.024
int-0.024
anda-0.024
ony-0.024
Token make
Token ID787
Feature activation+6.70e-02

Pos logit contributions
 yours+0.099
 electricity+0.095
 our+0.093
 respect+0.090
 such+0.090
Neg logit contributions
int-0.164
ppy-0.163
op-0.162
anda-0.161
oise-0.158
Token new
Token ID649
Feature activation+3.24e-02

Pos logit contributions
 electricity+0.070
 yours+0.064
 theirs+0.056
 outer+0.054
 power+0.052
Neg logit contributions
ppy-0.173
ony-0.170
ie-0.168
anda-0.168
op-0.167
Token friends
Token ID2460
Feature activation+1.55e-03

Pos logit contributions
 electricity+0.050
 yours+0.050
 our+0.045
 such+0.044
 its+0.044
Neg logit contributions
op-0.072
ppy-0.072
anda-0.070
ony-0.070
int-0.069
Token and
Token ID290
Feature activation+5.39e-02

Pos logit contributions
 electricity+0.003
 yours+0.003
 our+0.003
 such+0.003
 its+0.002
Neg logit contributions
ppy-0.003
op-0.003
anda-0.003
ony-0.003
int-0.003
Token have
Token ID423
Feature activation+9.64e-02

Pos logit contributions
 electricity+0.082
 yours+0.079
 such+0.071
 theirs+0.068
 outer+0.067
Neg logit contributions
ppy-0.111
bug-0.106
mun-0.106
ie-0.106
anda-0.106
Token fun
Token ID1257
Feature activation+3.71e-02

Pos logit contributions
 electricity+0.113
 outer+0.097
 yours+0.096
 comparison+0.093
 yourselves+0.087
Neg logit contributions
's-0.222
s-0.209
mun-0.199
bugs-0.199
bug-0.199
Token.
Token ID13
Feature activation+3.86e-02

Pos logit contributions
 electricity+0.072
 yours+0.070
 theirs+0.063
 such+0.063
 its+0.062
Neg logit contributions
op-0.069
ppy-0.068
y-0.066
anda-0.065
oise-0.064
Token Sharing
Token ID36644
Feature activation+3.23e-02

Pos logit contributions
 electricity+0.071
 yours+0.068
 awesome+0.063
 theirs+0.063
 sight+0.061
Neg logit contributions
ppy-0.074
op-0.072
ony-0.068
bug-0.068
anda-0.067
Token is
Token ID318
Feature activation+7.46e-02

Pos logit contributions
 electricity+0.065
 yours+0.064
 such+0.058
 theirs+0.058
 our+0.057
Neg logit contributions
ppy-0.057
op-0.056
anda-0.054
ony-0.054
y-0.053
Token caring
Token ID18088
Feature activation+1.29e-02

Pos logit contributions
 electricity+0.102
 outer+0.100
 comparison+0.093
 sight+0.089
 purpose+0.086
Neg logit contributions
's-0.150
ppy-0.149
bug-0.147
ll-0.142
a-0.142
Token too
Token ID1165
Feature activation+4.60e-02

Pos logit contributions
 yours+0.067
 electricity+0.065
 theirs+0.059
 our+0.057
 awesome+0.057
Neg logit contributions
ppy-0.070
op-0.068
y-0.068
anda-0.068
ches-0.067
Token far
Token ID1290
Feature activation+1.84e-02

Pos logit contributions
 yours+0.064
 electricity+0.063
 such+0.056
 theirs+0.055
 our+0.054
Neg logit contributions
op-0.108
ppy-0.107
ony-0.105
y-0.105
anda-0.104
Token away
Token ID1497
Feature activation+3.20e-02

Pos logit contributions
 electricity+0.038
 yours+0.038
 such+0.034
 awesome+0.034
 theirs+0.034
Neg logit contributions
ppy-0.031
y-0.030
op-0.030
ony-0.029
anda-0.029
Token and
Token ID290
Feature activation+5.23e-02

Pos logit contributions
 electricity+0.062
 yours+0.062
 our+0.055
 such+0.055
 theirs+0.054
Neg logit contributions
ppy-0.058
op-0.057
anda-0.055
y-0.055
ony-0.055
Token always
Token ID1464
Feature activation+6.81e-02

Pos logit contributions
 electricity+0.083
 yours+0.081
 theirs+0.073
 such+0.071
 its+0.070
Neg logit contributions
ppy-0.108
op-0.107
ll-0.105
anda-0.104
ony-0.104
Token have
Token ID423
Feature activation+9.63e-02

Pos logit contributions
 electricity+0.124
 yours+0.121
 theirs+0.110
 such+0.109
 our+0.107
Neg logit contributions
ppy-0.132
op-0.127
ony-0.125
int-0.125
anda-0.125
Token a
Token ID257
Feature activation+7.31e-02

Pos logit contributions
 yours+0.104
 electricity+0.103
 outer+0.097
 theirs+0.090
 comparison+0.086
Neg logit contributions
's-0.225
ppy-0.224
s-0.218
ll-0.217
y-0.215
Token friend
Token ID1545
Feature activation-4.75e-03

Pos logit contributions
 electricity+0.090
 yours+0.090
 such+0.078
 theirs+0.076
 its+0.075
Neg logit contributions
op-0.187
ppy-0.183
anda-0.183
int-0.179
ony-0.179
Token with
Token ID351
Feature activation+4.32e-02

Pos logit contributions
op+0.010
ppy+0.010
anda+0.010
ony+0.010
int+0.010
Neg logit contributions
 electricity-0.008
 yours-0.008
 our-0.007
 such-0.007
 its-0.007
Token you
Token ID345
Feature activation+1.02e-02

Pos logit contributions
 electricity+0.047
 yours+0.045
 theirs+0.039
 awesome+0.038
 outer+0.037
Neg logit contributions
ppy-0.112
ony-0.108
op-0.106
anda-0.106
ie-0.106
Token just
Token ID655
Feature activation+3.73e-02

Pos logit contributions
 electricity+0.018
 yours+0.018
 our+0.016
 such+0.016
 its+0.016
Neg logit contributions
ppy-0.021
op-0.020
anda-0.020
ony-0.020
int-0.019
Token is
Token ID318
Feature activation+5.42e-02

Pos logit contributions
op+0.050
ppy+0.048
int+0.048
ail+0.048
enny+0.047
Neg logit contributions
 electricity-0.033
 yours-0.033
 our-0.031
 respect-0.030
 such-0.030
Token lonely
Token ID21757
Feature activation+3.13e-02

Pos logit contributions
 electricity+0.087
 yours+0.075
 outer+0.071
 respect+0.070
 satisfaction+0.070
Neg logit contributions
ppy-0.111
bug-0.107
anda-0.107
y-0.106
i-0.103
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
 electricity+0.060
 yours+0.059
 such+0.054
 our+0.053
 its+0.053
Neg logit contributions
ppy-0.058
op-0.058
anda-0.056
y-0.055
ony-0.055
Token Maybe
Token ID6674
Feature activation+5.24e-02

Pos logit contributions
 electricity+0.042
 yours+0.042
 such+0.039
 theirs+0.038
 our+0.038
Neg logit contributions
ppy-0.036
op-0.036
anda-0.035
int-0.034
ony-0.034
Token he
Token ID339
Feature activation-2.38e-02

Pos logit contributions
 electricity+0.094
 yours+0.091
 such+0.082
 theirs+0.081
 awesome+0.081
Neg logit contributions
ie-0.095
ppy-0.094
op-0.093
anda-0.092
s-0.091
Token needs
Token ID2476
Feature activation+9.61e-02

Pos logit contributions
op+0.056
ppy+0.053
int+0.052
ail+0.052
bi+0.052
Neg logit contributions
 electricity-0.034
 yours-0.034
 our-0.032
 respect-0.031
 your-0.031
Token a
Token ID257
Feature activation+6.40e-02

Pos logit contributions
 comparison+0.111
 awesome+0.110
 electricity+0.104
 theirs+0.102
 relief+0.097
Neg logit contributions
's-0.190
ie-0.187
ony-0.185
s-0.184
ppy-0.181
Token friend
Token ID1545
Feature activation-6.88e-03

Pos logit contributions
 electricity+0.122
 yours+0.118
 such+0.114
 theirs+0.108
 its+0.107
Neg logit contributions
op-0.121
ppy-0.117
anda-0.116
uffy-0.114
int-0.113
Token."
Token ID526
Feature activation-9.27e-03

Pos logit contributions
ppy+0.015
op+0.015
anda+0.014
ony+0.014
int+0.014
Neg logit contributions
 electricity-0.011
 yours-0.011
 our-0.010
 such-0.010
 its-0.010
Token Lily
Token ID20037
Feature activation-1.07e-01

Pos logit contributions
ppy+0.020
op+0.020
anda+0.019
uffy+0.019
s+0.019
Neg logit contributions
 yours-0.015
 electricity-0.015
 such-0.013
 awesome-0.012
 theirs-0.012
Token says
Token ID1139
Feature activation-1.47e-02

Pos logit contributions
op+0.270
oise+0.259
pp+0.243
ail+0.243
uffy+0.242
Neg logit contributions
 fun-0.152
 trouble-0.151
 our-0.150
 electricity-0.149
 your-0.149
Token always
Token ID1464
Feature activation+7.50e-02

Pos logit contributions
 electricity+0.140
 yours+0.133
 theirs+0.119
 such+0.118
 our+0.117
Neg logit contributions
ppy-0.150
op-0.149
anda-0.147
ony-0.145
ll-0.144
Token be
Token ID307
Feature activation+7.97e-02

Pos logit contributions
 electricity+0.147
 yours+0.139
 theirs+0.128
 such+0.124
 our+0.121
Neg logit contributions
ppy-0.136
int-0.131
ony-0.130
owed-0.130
anda-0.128
Token space
Token ID2272
Feature activation+1.27e-02

Pos logit contributions
 electricity+0.132
 yours+0.110
 purpose+0.104
 comparison+0.098
 relief+0.097
Neg logit contributions
bug-0.170
ppy-0.170
y-0.165
adow-0.165
anda-0.163
Token explorers
Token ID45531
Feature activation-1.29e-02

Pos logit contributions
 electricity+0.024
 yours+0.023
 theirs+0.021
 such+0.020
 its+0.020
Neg logit contributions
ppy-0.024
anda-0.023
op-0.023
y-0.023
ony-0.023
Token and
Token ID290
Feature activation+6.28e-02

Pos logit contributions
ppy+0.024
op+0.024
anda+0.023
ony+0.023
y+0.023
Neg logit contributions
 electricity-0.025
 yours-0.024
 our-0.022
 such-0.022
 its-0.021
Token have
Token ID423
Feature activation+9.61e-02

Pos logit contributions
 electricity+0.112
 yours+0.103
 purpose+0.092
 theirs+0.091
 such+0.091
Neg logit contributions
bug-0.115
ppy-0.112
anda-0.106
ll-0.105
s-0.105
Token fun
Token ID1257
Feature activation+3.85e-02

Pos logit contributions
 electricity+0.110
 yours+0.090
 yourselves+0.078
 comparison+0.078
 relief+0.075
Neg logit contributions
's-0.234
bug-0.229
ppy-0.227
i-0.224
bugs-0.221
Token adventures
Token ID17545
Feature activation+1.47e-02

Pos logit contributions
 electricity+0.080
 yours+0.077
 theirs+0.069
 relief+0.068
 awesome+0.067
Neg logit contributions
op-0.065
y-0.064
ppy-0.062
anda-0.062
ie-0.062
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 electricity+0.028
 yours+0.027
 our+0.024
 such+0.024
 its+0.024
Neg logit contributions
op-0.028
ppy-0.028
anda-0.027
y-0.026
ony-0.026
Token But
Token ID887
Feature activation+2.48e-02

Pos logit contributions
 yours+0.054
 electricity+0.053
 such+0.048
 awesome+0.047
 theirs+0.047
Neg logit contributions
ppy-0.035
uffy-0.034
s-0.034
op-0.033
anda-0.031
Token they
Token ID484
Feature activation-2.05e-02

Pos logit contributions
 electricity+0.048
 yours+0.045
 awesome+0.041
 purpose+0.041
 yourselves+0.040
Neg logit contributions
i-0.041
ll-0.041
ppy-0.041
y-0.040
op-0.040
Token they
Token ID484
Feature activation-3.30e-03

Pos logit contributions
 electricity+0.075
 yours+0.070
 yourselves+0.061
 purpose+0.060
 respect+0.059
Neg logit contributions
y-0.129
s-0.128
op-0.128
ie-0.127
ppy-0.126
Token all
Token ID477
Feature activation+1.32e-02

Pos logit contributions
op+0.005
ppy+0.005
anda+0.005
int+0.005
ony+0.005
Neg logit contributions
 yours-0.007
 electricity-0.007
 our-0.007
 such-0.007
 its-0.007
Token agreed
Token ID4987
Feature activation+4.69e-02

Pos logit contributions
 electricity+0.029
 yours+0.029
 our+0.027
 such+0.027
 its+0.027
Neg logit contributions
op-0.021
ppy-0.021
anda-0.020
ony-0.020
int-0.020
Token that
Token ID326
Feature activation+6.12e-02

Pos logit contributions
 electricity+0.082
 yours+0.076
 yourselves+0.071
 outer+0.070
 awesome+0.067
Neg logit contributions
s-0.093
op-0.093
y-0.091
ppy-0.089
's-0.088
Token it
Token ID340
Feature activation+1.12e-02

Pos logit contributions
 electricity+0.076
 yours+0.070
 yourselves+0.063
 awesome+0.062
 purpose+0.059
Neg logit contributions
op-0.143
y-0.143
ie-0.142
s-0.141
i-0.141
Token deserved
Token ID20143
Feature activation+9.61e-02

Pos logit contributions
 electricity+0.024
 yours+0.023
 such+0.023
 its+0.023
 our+0.022
Neg logit contributions
op-0.019
int-0.019
anda-0.018
ony-0.018
ppy-0.018
Token its
Token ID663
Feature activation+4.74e-02

Pos logit contributions
 electricity+0.128
 yours+0.125
 yourselves+0.119
 awesome+0.112
 theirs+0.109
Neg logit contributions
ppy-0.209
y-0.206
s-0.202
ony-0.200
ches-0.200
Token special
Token ID2041
Feature activation+2.31e-02

Pos logit contributions
 electricity+0.069
 yours+0.066
 such+0.062
 yourselves+0.062
 theirs+0.059
Neg logit contributions
anda-0.104
ony-0.101
ppy-0.101
op-0.100
int-0.099
Token name
Token ID1438
Feature activation+2.23e-03

Pos logit contributions
 electricity+0.036
 yours+0.034
 such+0.032
 theirs+0.031
 yourselves+0.031
Neg logit contributions
op-0.050
int-0.049
anda-0.048
y-0.048
ony-0.048
Token -
Token ID532
Feature activation+6.35e-02

Pos logit contributions
 electricity+0.004
 yours+0.004
 our+0.004
 awesome+0.004
 such+0.004
Neg logit contributions
ppy-0.004
op-0.004
y-0.004
ony-0.004
anda-0.004
Token Flag
Token ID19762
Feature activation-3.62e-02

Pos logit contributions
 electricity+0.084
 yours+0.076
 purpose+0.071
 yourselves+0.068
 awesome+0.068
Neg logit contributions
op-0.146
ppy-0.141
bug-0.138
s-0.138
y-0.138
Token
Token ID198
Feature activation-1.95e-02

Pos logit contributions
ppy+0.044
op+0.044
anda+0.043
ony+0.043
int+0.043
Neg logit contributions
 yours-0.028
 electricity-0.028
 our-0.025
 such-0.025
 its-0.025
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
ppy+0.042
op+0.041
anda+0.040
int+0.040
ony+0.040
Neg logit contributions
 electricity-0.032
 yours-0.032
 our-0.029
 its-0.029
 such-0.029
TokenOne
Token ID3198
Feature activation-2.06e-02

Pos logit contributions
ppy+0.045
op+0.044
anda+0.044
int+0.043
ony+0.043
Neg logit contributions
 yours-0.038
 electricity-0.038
 our-0.035
 its-0.034
 such-0.034
Token day
Token ID1110
Feature activation-1.24e-03

Pos logit contributions
ppy+0.050
op+0.049
ony+0.048
anda+0.047
int+0.047
Neg logit contributions
 electricity-0.029
 yours-0.029
 our-0.027
 its-0.026
 peace-0.026
Token Aless
Token ID47319
Feature activation-1.09e-01

Pos logit contributions
op+0.002
ppy+0.002
anda+0.002
ony+0.002
int+0.002
Neg logit contributions
 electricity-0.003
 yours-0.003
 our-0.003
 such-0.003
 theirs-0.003
Tokenio
Token ID952
Feature activation-1.81e-01

Pos logit contributions
op+0.163
mun+0.159
oise+0.156
ppy+0.155
int+0.148
Neg logit contributions
 yours-0.251
 electricity-0.243
 our-0.229
 its-0.225
 theirs-0.223
Token was
Token ID373
Feature activation+1.48e-02

Pos logit contributions
mun+0.289
op+0.281
oise+0.275
sie+0.275
ppy+0.271
Neg logit contributions
 respect-0.333
 yours-0.323
 a-0.319
 an-0.319
 history-0.317
Token walking
Token ID6155
Feature activation-1.91e-03

Pos logit contributions
 electricity+0.024
 yours+0.024
 our+0.022
 such+0.021
 its+0.021
Neg logit contributions
op-0.032
ppy-0.032
anda-0.031
ony-0.031
int-0.031
Token down
Token ID866
Feature activation+2.53e-02

Pos logit contributions
op+0.004
ppy+0.004
anda+0.004
ony+0.004
int+0.004
Neg logit contributions
 yours-0.003
 electricity-0.003
 its-0.003
 our-0.003
 such-0.003
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
 yours+0.032
 electricity+0.029
 awesome+0.025
 ours+0.023
 respect+0.023
Neg logit contributions
y-0.063
ie-0.061
op-0.061
bug-0.061
oise-0.060
Token street
Token ID4675
Feature activation-4.65e-02

Pos logit contributions
ppy+0.044
op+0.043
int+0.043
owed+0.043
ony+0.041
Neg logit contributions
 electricity-0.034
 yours-0.033
 our-0.031
 your-0.030
 peace-0.029
Token he
Token ID339
Feature activation-2.49e-02

Pos logit contributions
 electricity+0.051
 yours+0.051
 our+0.047
 such+0.047
 its+0.046
Neg logit contributions
op-0.051
ppy-0.051
anda-0.049
ony-0.049
int-0.049
Token wanted
Token ID2227
Feature activation+5.43e-02

Pos logit contributions
op+0.043
int+0.041
ppy+0.040
ony+0.040
anda+0.040
Neg logit contributions
 yours-0.050
 electricity-0.049
 such-0.048
 fun-0.047
 its-0.046
Token.
Token ID13
Feature activation-7.74e-03

Pos logit contributions
 electricity+0.097
 yours+0.091
 our+0.086
 awesome+0.085
 outer+0.084
Neg logit contributions
ppy-0.102
op-0.101
y-0.099
ie-0.099
ony-0.096
Token<|endoftext|>
Token ID50256
Feature activation-5.10e-02

Pos logit contributions
ppy+0.017
op+0.017
anda+0.017
ony+0.017
oise+0.017
Neg logit contributions
 electricity-0.012
 yours-0.012
 such-0.011
 its-0.011
 our-0.011
TokenAk
Token ID33901
Feature activation-1.30e-01

Pos logit contributions
ppy+0.116
anda+0.112
int+0.112
op+0.112
ony+0.112
Neg logit contributions
 electricity-0.077
 yours-0.076
 such-0.074
 its-0.072
 peace-0.071
Tokenio
Token ID952
Feature activation-1.75e-01

Pos logit contributions
op+0.159
mun+0.143
int+0.137
ppy+0.137
uffy+0.131
Neg logit contributions
 yours-0.337
 your-0.308
 our-0.307
 such-0.306
 theirs-0.305
Token was
Token ID373
Feature activation+1.25e-02

Pos logit contributions
mun+0.300
int+0.292
oise+0.290
op+0.290
anda+0.279
Neg logit contributions
 its-0.304
 your-0.296
 yours-0.292
 an-0.288
 such-0.286
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
 electricity+0.019
 yours+0.019
 our+0.017
 such+0.016
 theirs+0.016
Neg logit contributions
op-0.028
ppy-0.028
int-0.028
anda-0.028
ony-0.027
Token 3
Token ID513
Feature activation-5.29e-02

Pos logit contributions
 yours+0.015
 electricity+0.015
 our+0.014
 its+0.014
 awesome+0.014
Neg logit contributions
owed-0.026
ppy-0.025
ony-0.025
op-0.025
ops-0.025
Token year
Token ID614
Feature activation+1.46e-03

Pos logit contributions
ppy+0.116
pp+0.113
ll+0.113
uffy+0.113
int+0.105
Neg logit contributions
 its-0.085
 electricity-0.079
 fun-0.079
 their-0.078
 our-0.078
Token old
Token ID1468
Feature activation-4.08e-02

Pos logit contributions
 electricity+0.001
 yours+0.001
 our+0.001
 such+0.001
 its+0.001
Neg logit contributions
op-0.004
ppy-0.004
anda-0.004
ony-0.004
int-0.004
Token!"
Token ID2474
Feature activation-5.13e-02

Pos logit contributions
op+0.030
ppy+0.030
anda+0.029
ony+0.029
int+0.028
Neg logit contributions
 electricity-0.026
 yours-0.026
 our-0.024
 such-0.024
 its-0.023
Token
Token ID220
Feature activation-3.15e-02

Pos logit contributions
ppy+0.120
anda+0.120
int+0.118
oise+0.117
ony+0.117
Neg logit contributions
 its-0.073
 yours-0.070
 our-0.068
 electricity-0.068
 your-0.068
Token
Token ID198
Feature activation-3.30e-02

Pos logit contributions
ppy+0.077
oise+0.075
int+0.074
anda+0.074
ony+0.073
Neg logit contributions
 yours-0.043
 electricity-0.041
 its-0.040
 our-0.039
 such-0.038
Token
Token ID198
Feature activation-3.06e-02

Pos logit contributions
ppy+0.077
int+0.076
op+0.076
anda+0.076
ish+0.075
Neg logit contributions
 yours-0.046
 its-0.044
 electricity-0.044
 our-0.043
 your-0.042
TokenR
Token ID49
Feature activation-1.06e-01

Pos logit contributions
anda+0.063
ppy+0.063
int+0.062
op+0.062
ony+0.062
Neg logit contributions
 yours-0.051
 electricity-0.050
 our-0.049
 its-0.048
 your-0.047
Tokenaff
Token ID2001
Feature activation-1.70e-01

Pos logit contributions
oise+0.147
op+0.147
mun+0.142
pp+0.142
ppy+0.140
Neg logit contributions
 electricity-0.231
 yours-0.228
 purpose-0.217
 our-0.214
 such-0.211
Tokeny
Token ID88
Feature activation-9.64e-02

Pos logit contributions
uddle+0.221
uffy+0.212
op+0.212
int+0.204
rig+0.201
Neg logit contributions
 its-0.399
 such-0.364
 yours-0.361
 their-0.354
 a-0.348
Token said
Token ID531
Feature activation-1.04e-02

Pos logit contributions
op+0.167
ppy+0.158
oise+0.157
anda+0.153
uffy+0.152
Neg logit contributions
 its-0.189
 electricity-0.183
 a-0.178
 such-0.178
 our-0.178
Token,
Token ID11
Feature activation-2.06e-02

Pos logit contributions
op+0.017
ppy+0.017
anda+0.016
ony+0.016
int+0.016
Neg logit contributions
 electricity-0.022
 yours-0.022
 our-0.020
 such-0.020
 its-0.020
Token "
Token ID366
Feature activation-7.22e-03

Pos logit contributions
ppy+0.053
op+0.053
anda+0.052
ony+0.052
int+0.052
Neg logit contributions
 yours-0.024
 electricity-0.024
 its-0.022
 our-0.022
 such-0.021
TokenThat
Token ID2504
Feature activation+9.65e-03

Pos logit contributions
anda+0.017
op+0.017
ppy+0.017
ony+0.017
uffy+0.017
Neg logit contributions
 yours-0.010
 electricity-0.010
 such-0.009
 our-0.009
 its-0.008
Token K
Token ID509
Feature activation-1.36e-01

Pos logit contributions
op+0.019
int+0.018
ppy+0.018
oise+0.018
ony+0.018
Neg logit contributions
 yours-0.007
 electricity-0.007
 our-0.006
 its-0.006
 your-0.006
Tokenait
Token ID4548
Feature activation-1.63e-01

Pos logit contributions
mun+0.153
op+0.141
oise+0.123
ppy+0.122
ops+0.112
Neg logit contributions
 yours-0.366
 electricity-0.338
 theirs-0.332
 its-0.331
 our-0.329
Tokenlyn
Token ID6213
Feature activation-1.00e-01

Pos logit contributions
op+0.250
int+0.227
anda+0.219
ony+0.218
ppy+0.217
Neg logit contributions
 yours-0.381
 its-0.374
 electricity-0.367
 such-0.360
 our-0.357
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
ppy+0.247
int+0.244
oise+0.236
op+0.235
ony+0.235
Neg logit contributions
 yours-0.124
 electricity-0.123
 its-0.122
 our-0.120
 your-0.114
Token K
Token ID509
Feature activation-1.37e-01

Pos logit contributions
ppy+0.054
op+0.054
int+0.052
ony+0.052
anda+0.052
Neg logit contributions
 electricity-0.046
 yours-0.045
 its-0.042
 our-0.042
 such-0.042
Tokenait
Token ID4548
Feature activation-1.69e-01

Pos logit contributions
mun+0.193
ppy+0.187
ouse+0.186
oise+0.181
op+0.178
Neg logit contributions
 yours-0.299
 electricity-0.298
 our-0.272
 such-0.266
 your-0.266
Tokenlyn
Token ID6213
Feature activation-9.91e-02

Pos logit contributions
op+0.258
int+0.231
ppy+0.227
ony+0.222
anda+0.220
Neg logit contributions
 yours-0.393
 its-0.383
 electricity-0.381
 such-0.373
 our-0.366
Token went
Token ID1816
Feature activation+5.21e-03

Pos logit contributions
ppy+0.220
int+0.218
oise+0.211
op+0.205
ony+0.204
Neg logit contributions
 electricity-0.146
 our-0.145
 its-0.145
 your-0.144
 yours-0.142
Token to
Token ID284
Feature activation+3.13e-02

Pos logit contributions
 electricity+0.007
 yours+0.007
 such+0.006
 our+0.006
 theirs+0.006
Neg logit contributions
op-0.012
ppy-0.012
anda-0.012
ony-0.012
int-0.012
Token school
Token ID1524
Feature activation-2.09e-02

Pos logit contributions
 electricity+0.042
 yours+0.038
 awesome+0.033
 theirs+0.033
 consequences+0.031
Neg logit contributions
bug-0.071
ie-0.071
ppy-0.070
op-0.068
i-0.068
Token every
Token ID790
Feature activation+1.96e-02

Pos logit contributions
ppy+0.043
op+0.042
anda+0.041
ony+0.041
int+0.041
Neg logit contributions
 electricity-0.036
 yours-0.036
 our-0.033
 such-0.033
 its-0.032
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
ppy+0.086
int+0.085
ony+0.081
oise+0.080
op+0.080
Neg logit contributions
 its-0.095
 electricity-0.094
 such-0.093
 yours-0.092
 our-0.090
Token a
Token ID257
Feature activation-5.71e-03

Pos logit contributions
 electricity+0.012
 yours+0.012
 our+0.008
 such+0.008
 theirs+0.008
Neg logit contributions
ppy-0.054
op-0.054
int-0.053
ony-0.053
anda-0.053
Token girl
Token ID2576
Feature activation-9.79e-02

Pos logit contributions
owed+0.015
ony+0.014
ppy+0.014
int+0.014
ops+0.014
Neg logit contributions
 our-0.007
 fun-0.007
 magic-0.007
 electricity-0.007
 yours-0.007
Token named
Token ID3706
Feature activation-8.97e-03

Pos logit contributions
int+0.184
ppy+0.178
ll+0.177
ops+0.177
anda+0.172
Neg logit contributions
 its-0.182
 yours-0.176
 your-0.174
 electricity-0.174
 our-0.173
Token K
Token ID509
Feature activation-1.39e-01

Pos logit contributions
op+0.025
int+0.025
ppy+0.025
ony+0.024
oise+0.024
Neg logit contributions
 yours-0.010
 electricity-0.009
 our-0.008
 your-0.008
 its-0.008
Tokenait
Token ID4548
Feature activation-1.65e-01

Pos logit contributions
mun+0.153
op+0.143
ppy+0.126
oise+0.125
s+0.123
Neg logit contributions
 yours-0.371
 electricity-0.350
 theirs-0.334
 our-0.331
 its-0.330
Tokenlyn
Token ID6213
Feature activation-1.04e-01

Pos logit contributions
op+0.246
int+0.225
anda+0.219
ony+0.217
ppy+0.216
Neg logit contributions
 yours-0.386
 its-0.379
 electricity-0.377
 such-0.367
 our-0.364
Token.
Token ID13
Feature activation-2.56e-02

Pos logit contributions
ppy+0.268
int+0.267
oise+0.255
ches+0.254
ony+0.252
Neg logit contributions
 its-0.117
 electricity-0.115
 our-0.114
 yours-0.112
 your-0.107
Token She
Token ID1375
Feature activation-5.19e-02

Pos logit contributions
ppy+0.050
op+0.049
anda+0.048
ony+0.048
int+0.047
Neg logit contributions
 electricity-0.047
 yours-0.047
 our-0.043
 such-0.043
 its-0.043
Token was
Token ID373
Feature activation+2.22e-02

Pos logit contributions
ppy+0.114
int+0.113
oise+0.113
op+0.113
ony+0.111
Neg logit contributions
 electricity-0.080
 its-0.078
 yours-0.075
 fun-0.075
 your-0.074
Token three
Token ID1115
Feature activation-3.94e-02

Pos logit contributions
 electricity+0.029
 yours+0.028
 our+0.025
 such+0.024
 its+0.024
Neg logit contributions
op-0.055
ppy-0.055
anda-0.054
ony-0.054
int-0.053
Token
Token ID220
Feature activation-6.01e-03

Pos logit contributions
ppy+0.041
oise+0.039
op+0.038
int+0.038
ony+0.037
Neg logit contributions
 electricity-0.027
 yours-0.025
 its-0.025
 our-0.024
 such-0.024
Token
Token ID198
Feature activation-1.68e-02

Pos logit contributions
ppy+0.015
oise+0.014
int+0.014
ony+0.014
anda+0.014
Neg logit contributions
 yours-0.008
 electricity-0.008
 its-0.008
 our-0.007
 such-0.007
Token
Token ID198
Feature activation-2.41e-02

Pos logit contributions
ppy+0.041
int+0.040
ish+0.039
oise+0.039
anda+0.038
Neg logit contributions
 electricity-0.023
 yours-0.022
 its-0.022
 our-0.021
 such-0.021
TokenSo
Token ID2396
Feature activation+3.40e-03

Pos logit contributions
ppy+0.043
int+0.041
op+0.041
anda+0.040
ony+0.040
Neg logit contributions
 electricity-0.048
 yours-0.047
 our-0.046
 its-0.045
 such-0.045
Token K
Token ID509
Feature activation-1.27e-01

Pos logit contributions
 electricity+0.006
 yours+0.006
 our+0.006
 its+0.006
 such+0.006
Neg logit contributions
ppy-0.007
op-0.006
int-0.006
anda-0.006
oise-0.006
Tokenait
Token ID4548
Feature activation-1.64e-01

Pos logit contributions
mun+0.194
ppy+0.184
ouse+0.183
oise+0.182
bug+0.178
Neg logit contributions
 electricity-0.262
 yours-0.260
 our-0.239
 your-0.236
 theirs-0.230
Tokenlyn
Token ID6213
Feature activation-9.24e-02

Pos logit contributions
op+0.294
uddle+0.266
uffy+0.253
mun+0.248
s+0.245
Neg logit contributions
 its-0.341
 yours-0.337
 such-0.322
 electricity-0.321
 our-0.318
Token put
Token ID1234
Feature activation+2.74e-02

Pos logit contributions
ppy+0.186
ony+0.171
pp+0.169
oise+0.168
uffy+0.167
Neg logit contributions
 a-0.145
 the-0.142
 an-0.141
 fun-0.138
 their-0.137
Token on
Token ID319
Feature activation+3.62e-02

Pos logit contributions
 electricity+0.044
 yours+0.044
 our+0.039
 such+0.039
 its+0.039
Neg logit contributions
ppy-0.060
op-0.060
anda-0.058
ony-0.058
int-0.058
Token her
Token ID607
Feature activation+1.05e-03

Pos logit contributions
 yours+0.045
 electricity+0.045
 theirs+0.037
 awesome+0.037
 such+0.035
Neg logit contributions
op-0.088
ony-0.087
ppy-0.084
anda-0.084
i-0.084
Token shoes
Token ID10012
Feature activation-4.48e-02

Pos logit contributions
 electricity+0.002
 yours+0.002
 such+0.001
 our+0.001
 its+0.001
Neg logit contributions
op-0.002
ppy-0.002
anda-0.002
int-0.002
ony-0.002
Token morning
Token ID3329
Feature activation+5.33e-04

Pos logit contributions
op+0.024
anda+0.024
ppy+0.024
ony+0.023
int+0.023
Neg logit contributions
 electricity-0.017
 yours-0.017
 such-0.015
 its-0.015
 our-0.015
Token j
Token ID474
Feature activation-4.09e-02

Pos logit contributions
 electricity+0.001
 yours+0.001
 our+0.001
 such+0.001
 theirs+0.001
Neg logit contributions
op-0.001
ppy-0.001
anda-0.001
ony-0.001
int-0.001
Tokenogs
Token ID18463
Feature activation-3.49e-02

Pos logit contributions
ppy+0.033
ony+0.028
op+0.027
anda+0.027
owed+0.026
Neg logit contributions
 yours-0.122
 electricity-0.118
 our-0.113
 your-0.112
 such-0.112
Token.
Token ID13
Feature activation-2.43e-02

Pos logit contributions
op+0.073
ppy+0.073
anda+0.071
ony+0.071
int+0.071
Neg logit contributions
 yours-0.058
 electricity-0.058
 our-0.053
 such-0.053
 its-0.052
Token<|endoftext|>
Token ID50256
Feature activation-5.49e-02

Pos logit contributions
ppy+0.055
op+0.054
ony+0.054
anda+0.054
int+0.053
Neg logit contributions
 yours-0.037
 electricity-0.036
 such-0.034
 our-0.034
 its-0.034
TokenLu
Token ID25596
Feature activation-1.64e-01

Pos logit contributions
ppy+0.119
anda+0.116
op+0.116
ony+0.116
int+0.115
Neg logit contributions
 electricity-0.087
 yours-0.087
 such-0.083
 its-0.081
 our-0.080
Tokency
Token ID948
Feature activation-1.32e-01

Pos logit contributions
ppy+0.165
anda+0.152
op+0.151
ony+0.142
uddle+0.135
Neg logit contributions
 its-0.422
 such-0.419
 yours-0.415
 our-0.408
 your-0.406
Token wanted
Token ID2227
Feature activation+3.90e-02

Pos logit contributions
int+0.263
ppy+0.260
anda+0.252
oise+0.251
ony+0.250
Neg logit contributions
 your-0.225
 its-0.225
 our-0.220
 such-0.214
 the-0.212
Token to
Token ID284
Feature activation+3.75e-02

Pos logit contributions
 electricity+0.061
 yours+0.060
 relief+0.053
 awesome+0.052
 theirs+0.052
Neg logit contributions
op-0.081
y-0.078
ppy-0.076
s-0.076
ie-0.076
Token be
Token ID307
Feature activation+3.95e-02

Pos logit contributions
 electricity+0.067
 yours+0.066
 our+0.060
 such+0.060
 its+0.059
Neg logit contributions
op-0.075
ppy-0.074
anda-0.072
ony-0.072
int-0.071
Token an
Token ID281
Feature activation+4.08e-03

Pos logit contributions
 electricity+0.060
 yours+0.055
 our+0.047
 theirs+0.047
 its+0.047
Neg logit contributions
ppy-0.090
oise-0.088
op-0.087
int-0.087
bug-0.084
Token.
Token ID13
Feature activation-6.65e-03

Pos logit contributions
 yours+0.029
 electricity+0.028
 awesome+0.025
 our+0.025
 outer+0.024
Neg logit contributions
op-0.037
ppy-0.037
ony-0.035
y-0.035
owed-0.035
Token
Token ID220
Feature activation+1.72e-03

Pos logit contributions
ppy+0.014
op+0.014
anda+0.013
ony+0.013
int+0.013
Neg logit contributions
 electricity-0.012
 yours-0.011
 our-0.010
 such-0.010
 its-0.010
Token
Token ID198
Feature activation-5.66e-03

Pos logit contributions
 yours+0.003
 electricity+0.003
 our+0.002
 such+0.002
 its+0.002
Neg logit contributions
ppy-0.004
anda-0.004
oise-0.004
int-0.004
ony-0.004
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
ppy+0.012
anda+0.012
op+0.012
ony+0.012
int+0.012
Neg logit contributions
 electricity-0.009
 yours-0.009
 our-0.008
 such-0.008
 its-0.008
TokenThe
Token ID464
Feature activation-2.75e-02

Pos logit contributions
op+0.022
ppy+0.022
anda+0.022
ony+0.021
int+0.021
Neg logit contributions
 electricity-0.017
 yours-0.016
 our-0.015
 such-0.015
 its-0.014
Token chip
Token ID11594
Feature activation-1.64e-01

Pos logit contributions
ppy+0.052
ony+0.049
op+0.049
int+0.049
anda+0.048
Neg logit contributions
 electricity-0.055
 yours-0.052
 our-0.050
 peace-0.049
 such-0.049
Tokenm
Token ID76
Feature activation-4.62e-02

Pos logit contributions
ppy+0.262
owed+0.242
ony+0.242
adow+0.239
anda+0.233
Neg logit contributions
 good-0.328
 the-0.315
 peace-0.314
 their-0.309
 fun-0.307
Tokenunk
Token ID2954
Feature activation-1.07e-01

Pos logit contributions
ppy+0.077
pp+0.073
oise+0.073
bug+0.070
ony+0.068
Neg logit contributions
 electricity-0.096
 purpose-0.087
 power-0.086
 yours-0.084
 trouble-0.084
Token told
Token ID1297
Feature activation+1.75e-02

Pos logit contributions
ppy+0.176
op+0.172
ony+0.169
uffy+0.168
int+0.167
Neg logit contributions
 its-0.208
 such-0.206
 electricity-0.203
 our-0.194
 peace-0.193
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
 yours+0.019
 electricity+0.017
 awesome+0.015
 outer+0.014
 theirs+0.013
Neg logit contributions
op-0.047
ppy-0.047
ony-0.046
i-0.046
ie-0.046
Token boy
Token ID2933
Feature activation-8.80e-02

Pos logit contributions
ppy+0.033
op+0.031
anda+0.031
ony+0.031
int+0.031
Neg logit contributions
 electricity-0.034
 yours-0.033
 such-0.031
 our-0.031
 peace-0.030
Token are
Token ID389
Feature activation+1.28e-02

Pos logit contributions
ppy+0.106
op+0.103
int+0.103
anda+0.101
ony+0.101
Neg logit contributions
 electricity-0.078
 yours-0.075
 its-0.072
 such-0.071
 our-0.069
Token."
Token ID526
Feature activation-4.40e-02

Pos logit contributions
 electricity+0.019
 outer+0.017
 purpose+0.016
 yours+0.015
 such+0.015
Neg logit contributions
bug-0.026
anda-0.026
ll-0.026
ppy-0.026
ie-0.025
Token
Token ID198
Feature activation-3.76e-02

Pos logit contributions
ppy+0.107
op+0.105
int+0.104
oise+0.104
anda+0.104
Neg logit contributions
 yours-0.059
 its-0.058
 our-0.058
 electricity-0.057
 your-0.056
Token
Token ID198
Feature activation-3.44e-02

Pos logit contributions
ppy+0.080
op+0.080
anda+0.078
int+0.078
ony+0.077
Neg logit contributions
 yours-0.061
 electricity-0.061
 our-0.057
 its-0.056
 such-0.055
TokenThe
Token ID464
Feature activation-3.18e-02

Pos logit contributions
ppy+0.069
anda+0.069
int+0.068
op+0.068
ony+0.067
Neg logit contributions
 yours-0.059
 electricity-0.059
 our-0.057
 its-0.055
 your-0.055
Token hipp
Token ID18568
Feature activation-1.64e-01

Pos logit contributions
op+0.059
ppy+0.059
anda+0.058
ony+0.057
int+0.056
Neg logit contributions
 yours-0.061
 electricity-0.061
 such-0.055
 our-0.055
 its-0.054
Tokeno
Token ID78
Feature activation-1.28e-01

Pos logit contributions
ppy+0.170
ches+0.148
int+0.140
anna+0.139
ony+0.138
Neg logit contributions
 our-0.411
 its-0.410
 electricity-0.404
 respect-0.403
 such-0.402
Token was
Token ID373
Feature activation+1.78e-02

Pos logit contributions
op+0.238
ppy+0.226
uffy+0.222
oise+0.220
erry+0.217
Neg logit contributions
 our-0.228
 its-0.227
 electricity-0.226
 their-0.213
 peace-0.211
Token confused
Token ID10416
Feature activation+2.60e-04

Pos logit contributions
 electricity+0.033
 yours+0.032
 outer+0.029
 our+0.028
 respect+0.028
Neg logit contributions
anda-0.033
ppy-0.032
bug-0.032
op-0.032
ony-0.031
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 electricity+0.000
 yours+0.000
 its+0.000
 our+0.000
 such+0.000
Neg logit contributions
op-0.001
ppy-0.001
anda-0.001
ony-0.001
oise-0.001
Token asked
Token ID1965
Feature activation-6.37e-03

Pos logit contributions
ppy+0.022
op+0.022
anda+0.022
ony+0.021
int+0.021
Neg logit contributions
 electricity-0.029
 yours-0.029
 such-0.027
 our-0.026
 its-0.026
Token with
Token ID351
Feature activation+2.71e-02

Pos logit contributions
 electricity+0.049
 yours+0.046
 theirs+0.042
 awesome+0.041
 respect+0.041
Neg logit contributions
op-0.044
ppy-0.042
y-0.041
ie-0.041
oise-0.040
Token us
Token ID514
Feature activation-4.41e-02

Pos logit contributions
 electricity+0.028
 yours+0.026
 theirs+0.022
 awesome+0.022
 relief+0.022
Neg logit contributions
op-0.072
ie-0.071
ppy-0.070
anda-0.070
uffy-0.069
Token?"
Token ID1701
Feature activation-4.12e-02

Pos logit contributions
op+0.099
ppy+0.098
int+0.097
ony+0.096
anda+0.095
Neg logit contributions
 yours-0.069
 electricity-0.066
 our-0.063
 such-0.062
 its-0.061
Token
Token ID198
Feature activation-3.14e-02

Pos logit contributions
ppy+0.095
op+0.094
anda+0.093
ony+0.092
int+0.092
Neg logit contributions
 yours-0.060
 electricity-0.059
 its-0.057
 our-0.056
 such-0.055
Token
Token ID198
Feature activation-2.98e-02

Pos logit contributions
ppy+0.069
op+0.068
anda+0.067
int+0.067
ony+0.067
Neg logit contributions
 yours-0.049
 electricity-0.048
 its-0.046
 our-0.046
 your-0.044
TokenLe
Token ID3123
Feature activation-1.63e-01

Pos logit contributions
ppy+0.059
anda+0.059
int+0.059
op+0.058
ony+0.058
Neg logit contributions
 yours-0.051
 electricity-0.050
 our-0.049
 its-0.049
 your-0.048
Tokeno
Token ID78
Feature activation-1.36e-01

Pos logit contributions
cases+0.173
op+0.166
mun+0.156
ops+0.150
uddle+0.143
Neg logit contributions
 its-0.418
 yours-0.395
 our-0.391
 respect-0.390
 your-0.384
Token nodded
Token ID14464
Feature activation-9.25e-03

Pos logit contributions
op+0.265
erry+0.250
ppy+0.248
oise+0.246
unn+0.239
Neg logit contributions
 its-0.249
 our-0.227
 a-0.224
 their-0.223
 an-0.223
Token,
Token ID11
Feature activation-2.59e-02

Pos logit contributions
op+0.017
ppy+0.017
anda+0.017
ony+0.016
int+0.016
Neg logit contributions
 yours-0.018
 electricity-0.018
 our-0.017
 its-0.017
 such-0.016
Token and
Token ID290
Feature activation-7.43e-03

Pos logit contributions
op+0.052
ppy+0.052
anda+0.051
ony+0.051
oise+0.050
Neg logit contributions
 yours-0.045
 electricity-0.045
 its-0.044
 our-0.042
 such-0.042
Token Sara
Token ID24799
Feature activation-1.30e-01

Pos logit contributions
op+0.014
ppy+0.014
anda+0.014
ony+0.014
int+0.013
Neg logit contributions
 its-0.014
 yours-0.014
 electricity-0.014
 our-0.013
 such-0.013
Token,
Token ID11
Feature activation-3.39e-03

Pos logit contributions
ppy+0.001
op+0.001
anda+0.001
ony+0.001
int+0.001
Neg logit contributions
 electricity-0.001
 yours-0.001
 our-0.001
 such-0.001
 its-0.001
Token a
Token ID257
Feature activation-4.83e-03

Pos logit contributions
op+0.008
ppy+0.008
anda+0.008
int+0.008
ony+0.008
Neg logit contributions
 yours-0.004
 electricity-0.004
 our-0.004
 such-0.004
 its-0.004
Token little
Token ID1310
Feature activation-4.10e-02

Pos logit contributions
ppy+0.011
op+0.010
anda+0.010
ony+0.010
int+0.010
Neg logit contributions
 electricity-0.008
 yours-0.008
 our-0.007
 such-0.007
 its-0.007
Token girl
Token ID2576
Feature activation-1.03e-01

Pos logit contributions
ppy+0.069
ony+0.068
op+0.067
ops+0.067
owed+0.067
Neg logit contributions
 electricity-0.086
 yours-0.085
 our-0.081
 its-0.080
 peace-0.079
Token named
Token ID3706
Feature activation-1.34e-02

Pos logit contributions
ppy+0.200
ops+0.199
ll+0.197
op+0.193
int+0.193
Neg logit contributions
 its-0.180
 your-0.168
 our-0.167
 such-0.166
 electricity-0.165
Token Ab
Token ID2275
Feature activation-1.63e-01

Pos logit contributions
op+0.034
int+0.033
ppy+0.033
ony+0.033
anda+0.032
Neg logit contributions
 yours-0.018
 electricity-0.017
 our-0.016
 such-0.015
 its-0.015
Tokenbie
Token ID12590
Feature activation-1.47e-01

Pos logit contributions
cases+0.153
op+0.148
ops+0.140
mun+0.130
ables+0.130
Neg logit contributions
 yours-0.410
 our-0.399
 your-0.396
 such-0.395
 my-0.375
Token decided
Token ID3066
Feature activation+8.41e-03

Pos logit contributions
ppy+0.258
op+0.258
int+0.257
ony+0.250
anda+0.246
Neg logit contributions
 its-0.264
 a-0.254
 our-0.248
 such-0.247
 an-0.246
Token that
Token ID326
Feature activation+3.09e-02

Pos logit contributions
 electricity+0.013
 yours+0.013
 our+0.011
 theirs+0.011
 awesome+0.011
Neg logit contributions
op-0.018
ppy-0.018
ony-0.018
anda-0.018
y-0.018
Token she
Token ID673
Feature activation-4.24e-02

Pos logit contributions
 electricity+0.060
 yours+0.060
 our+0.054
 such+0.053
 theirs+0.053
Neg logit contributions
op-0.056
ppy-0.055
anda-0.054
ony-0.053
int-0.052
Token would
Token ID561
Feature activation+2.46e-02

Pos logit contributions
op+0.077
ppy+0.076
int+0.075
ony+0.073
uffy+0.073
Neg logit contributions
 yours-0.081
 electricity-0.080
 such-0.077
 its-0.076
 fun-0.074
Token it
Token ID340
Feature activation-8.91e-03

Pos logit contributions
 electricity+0.027
 yours+0.025
 awesome+0.021
 theirs+0.021
 our+0.021
Neg logit contributions
ppy-0.073
op-0.073
ony-0.072
y-0.072
anda-0.071
Token in
Token ID287
Feature activation+3.66e-02

Pos logit contributions
ppy+0.017
op+0.017
anda+0.017
ony+0.017
int+0.016
Neg logit contributions
 electricity-0.016
 yours-0.016
 our-0.015
 such-0.015
 its-0.014
Token the
Token ID262
Feature activation+2.89e-03

Pos logit contributions
 electricity+0.042
 yours+0.040
 awesome+0.038
 sacred+0.032
 theirs+0.031
Neg logit contributions
y-0.082
ppy-0.077
op-0.077
bug-0.075
i-0.075
Token trash
Token ID13913
Feature activation-4.37e-02

Pos logit contributions
 yours+0.005
 theirs+0.005
 electricity+0.005
 such+0.005
 our+0.005
Neg logit contributions
anda-0.005
op-0.005
ish-0.005
ppy-0.005
ony-0.005
Token."
Token ID526
Feature activation-4.46e-02

Pos logit contributions
ppy+0.084
op+0.083
y+0.080
anda+0.079
ony+0.079
Neg logit contributions
 electricity-0.081
 yours-0.081
 our-0.073
 awesome-0.073
 theirs-0.073
Token Nem
Token ID22547
Feature activation-1.63e-01

Pos logit contributions
ppy+0.107
anda+0.105
int+0.105
ony+0.104
oise+0.104
Neg logit contributions
 yours-0.061
 its-0.060
 our-0.058
 electricity-0.058
 your-0.058
Tokeno
Token ID78
Feature activation-1.26e-01

Pos logit contributions
ig+0.251
int+0.229
ops+0.227
sie+0.222
ede+0.222
Neg logit contributions
 its-0.326
 his-0.308
 our-0.307
 their-0.306
 your-0.303
Token and
Token ID290
Feature activation-2.77e-02

Pos logit contributions
op+0.241
uffy+0.229
oise+0.219
ppy+0.216
pp+0.215
Neg logit contributions
 its-0.249
 their-0.235
 our-0.234
 your-0.229
 the-0.223
Token his
Token ID465
Feature activation-4.68e-02

Pos logit contributions
op+0.076
int+0.074
ppy+0.074
ony+0.073
anda+0.072
Neg logit contributions
 yours-0.031
 electricity-0.030
 our-0.028
 its-0.028
 your-0.027
Token friends
Token ID2460
Feature activation-8.34e-02

Pos logit contributions
ppy+0.102
op+0.102
ony+0.099
anda+0.099
int+0.099
Neg logit contributions
 electricity-0.075
 yours-0.074
 our-0.068
 such-0.066
 its-0.066
Token worked
Token ID3111
Feature activation+2.51e-02

Pos logit contributions
op+0.152
anda+0.152
uffy+0.150
ony+0.146
ppy+0.141
Neg logit contributions
 its-0.156
 their-0.154
 such-0.151
 yours-0.149
 respect-0.148
Token a
Token ID257
Feature activation-5.70e-03

Pos logit contributions
 electricity+0.012
 yours+0.012
 our+0.009
 such+0.009
 its+0.008
Neg logit contributions
op-0.058
ppy-0.058
anda-0.057
ony-0.057
int-0.057
Token little
Token ID1310
Feature activation-3.54e-02

Pos logit contributions
owed+0.015
ony+0.014
ppy+0.014
ops+0.014
int+0.014
Neg logit contributions
 fun-0.007
 our-0.007
 good-0.007
 magic-0.007
 luxury-0.007
Token girl
Token ID2576
Feature activation-9.53e-02

Pos logit contributions
owed+0.070
ppy+0.065
int+0.065
ops+0.064
ony+0.064
Neg logit contributions
 our-0.066
 yours-0.065
 electricity-0.064
 your-0.064
 its-0.062
Token named
Token ID3706
Feature activation-6.68e-03

Pos logit contributions
int+0.171
ppy+0.166
ops+0.165
op+0.163
ony+0.161
Neg logit contributions
 its-0.183
 yours-0.182
 our-0.178
 your-0.177
 electricity-0.177
Token K
Token ID509
Feature activation-1.36e-01

Pos logit contributions
op+0.019
int+0.018
ppy+0.018
oise+0.018
ony+0.018
Neg logit contributions
 yours-0.007
 electricity-0.007
 our-0.006
 its-0.006
 your-0.006
Tokenait
Token ID4548
Feature activation-1.63e-01

Pos logit contributions
mun+0.153
op+0.141
oise+0.123
ppy+0.122
ops+0.112
Neg logit contributions
 yours-0.366
 electricity-0.338
 theirs-0.332
 its-0.331
 our-0.329
Tokenlyn
Token ID6213
Feature activation-1.00e-01

Pos logit contributions
op+0.250
int+0.227
anda+0.219
ony+0.218
ppy+0.217
Neg logit contributions
 yours-0.381
 its-0.374
 electricity-0.367
 such-0.360
 our-0.357
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
ppy+0.247
int+0.244
oise+0.236
op+0.235
ony+0.235
Neg logit contributions
 yours-0.124
 electricity-0.123
 its-0.122
 our-0.120
 your-0.114
Token K
Token ID509
Feature activation-1.37e-01

Pos logit contributions
ppy+0.054
op+0.054
int+0.052
ony+0.052
anda+0.052
Neg logit contributions
 electricity-0.046
 yours-0.045
 its-0.042
 our-0.042
 such-0.042
Tokenait
Token ID4548
Feature activation-1.69e-01

Pos logit contributions
mun+0.193
ppy+0.187
ouse+0.186
oise+0.181
op+0.178
Neg logit contributions
 yours-0.299
 electricity-0.298
 our-0.272
 such-0.266
 your-0.266
Tokenlyn
Token ID6213
Feature activation-9.91e-02

Pos logit contributions
op+0.258
int+0.231
ppy+0.227
ony+0.222
anda+0.220
Neg logit contributions
 yours-0.393
 its-0.383
 electricity-0.381
 such-0.373
 our-0.366
Tokeners
Token ID364
Feature activation-8.45e-02

Pos logit contributions
op+0.083
ppy+0.082
int+0.081
anda+0.080
ony+0.080
Neg logit contributions
 electricity-0.090
 yours-0.088
 such-0.081
 its-0.081
 our-0.081
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
op+0.166
ppy+0.165
int+0.164
anda+0.162
ony+0.161
Neg logit contributions
 yours-0.153
 electricity-0.152
 such-0.140
 our-0.140
 its-0.137
Token some
Token ID617
Feature activation-1.43e-02

Pos logit contributions
ppy+0.021
op+0.021
anda+0.021
ony+0.020
int+0.020
Neg logit contributions
 electricity-0.019
 yours-0.019
 our-0.017
 such-0.017
 its-0.017
Token juice
Token ID13135
Feature activation-5.09e-02

Pos logit contributions
op+0.033
ppy+0.032
anda+0.032
ony+0.032
int+0.031
Neg logit contributions
 yours-0.021
 electricity-0.020
 our-0.019
 such-0.018
 theirs-0.018
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
int+0.112
op+0.111
ppy+0.110
ony+0.110
oise+0.110
Neg logit contributions
 yours-0.081
 electricity-0.079
 our-0.072
 its-0.071
 such-0.071
Token Ab
Token ID2275
Feature activation-1.63e-01

Pos logit contributions
ppy+0.049
op+0.049
anda+0.048
ony+0.048
int+0.048
Neg logit contributions
 yours-0.034
 electricity-0.034
 our-0.031
 its-0.030
 such-0.030
Tokenig
Token ID328
Feature activation-1.18e-01

Pos logit contributions
op+0.153
ppy+0.135
ouse+0.133
cases+0.132
uddy+0.129
Neg logit contributions
 yours-0.419
 your-0.409
 such-0.402
 our-0.399
 my-0.388
Tokenail
Token ID603
Feature activation-1.27e-01

Pos logit contributions
int+0.089
ppy+0.082
oise+0.081
mun+0.080
op+0.080
Neg logit contributions
 your-0.324
 our-0.322
 its-0.319
 yours-0.319
 electricity-0.314
Token said
Token ID531
Feature activation-2.55e-03

Pos logit contributions
op+0.236
ppy+0.228
int+0.227
oise+0.223
erry+0.215
Neg logit contributions
 electricity-0.225
 your-0.223
 its-0.222
 our-0.220
 such-0.216
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
op+0.004
ppy+0.004
anda+0.004
ony+0.004
int+0.003
Neg logit contributions
 electricity-0.006
 yours-0.006
 our-0.005
 such-0.005
 theirs-0.005
Token "
Token ID366
Feature activation-1.11e-02

Pos logit contributions
op+0.026
ppy+0.026
anda+0.026
ony+0.026
int+0.026
Neg logit contributions
 electricity-0.015
 yours-0.015
 our-0.013
 such-0.013
 its-0.012
Token It
Token ID632
Feature activation-2.31e-02

Pos logit contributions
op+0.061
ppy+0.061
anda+0.059
ony+0.059
int+0.058
Neg logit contributions
 yours-0.045
 electricity-0.045
 our-0.041
 such-0.040
 its-0.040
Token might
Token ID1244
Feature activation+2.67e-02

Pos logit contributions
op+0.046
ppy+0.043
int+0.043
ony+0.042
oise+0.041
Neg logit contributions
 electricity-0.045
 yours-0.044
 our-0.044
 its-0.042
 such-0.041
Token be
Token ID307
Feature activation+3.21e-02

Pos logit contributions
 electricity+0.047
 yours+0.046
 such+0.043
 theirs+0.042
 yourselves+0.042
Neg logit contributions
ppy-0.050
ony-0.047
anda-0.046
op-0.046
y-0.046
Token spicy
Token ID26880
Feature activation-3.21e-02

Pos logit contributions
 electricity+0.041
 yours+0.039
 purpose+0.034
 yourselves+0.034
 outer+0.034
Neg logit contributions
ppy-0.079
ches-0.073
anda-0.073
y-0.072
ony-0.072
Token."
Token ID526
Feature activation-4.32e-02

Pos logit contributions
op+0.065
ppy+0.065
anda+0.064
int+0.064
ony+0.064
Neg logit contributions
 yours-0.057
 electricity-0.057
 our-0.052
 such-0.051
 its-0.050
Token Nem
Token ID22547
Feature activation-1.62e-01

Pos logit contributions
op+0.101
ppy+0.101
int+0.098
ony+0.098
anda+0.098
Neg logit contributions
 yours-0.063
 electricity-0.060
 its-0.060
 our-0.060
 your-0.058
Tokeno
Token ID78
Feature activation-1.24e-01

Pos logit contributions
sie+0.252
ig+0.241
cases+0.238
ird+0.227
ops+0.225
Neg logit contributions
 its-0.305
 our-0.296
 your-0.286
 yours-0.284
 his-0.280
Token thought
Token ID1807
Feature activation+4.90e-03

Pos logit contributions
op+0.255
uffy+0.235
oise+0.230
itten+0.228
pp+0.225
Neg logit contributions
 its-0.230
 our-0.214
 your-0.212
 their-0.210
 the-0.208
Token for
Token ID329
Feature activation+1.62e-02

Pos logit contributions
 electricity+0.007
 yours+0.007
 our+0.007
 such+0.006
 its+0.006
Neg logit contributions
ppy-0.011
op-0.011
anda-0.011
ony-0.011
int-0.011
Token a
Token ID257
Feature activation+8.22e-03

Pos logit contributions
 yours+0.024
 electricity+0.021
 ours+0.020
 comparison+0.019
 theirs+0.019
Neg logit contributions
ppy-0.036
ie-0.033
i-0.032
ches-0.031
owed-0.031
Token moment
Token ID2589
Feature activation-3.81e-02

Pos logit contributions
 electricity+0.015
 yours+0.015
 our+0.014
 such+0.014
 its+0.013
Neg logit contributions
op-0.016
ppy-0.016
anda-0.015
ony-0.015
int-0.015
Token for
Token ID329
Feature activation+3.20e-02

Pos logit contributions
ppy+0.114
op+0.113
int+0.112
anda+0.111
ony+0.110
Neg logit contributions
 yours-0.070
 electricity-0.069
 such-0.064
 our-0.064
 its-0.063
Token behaving
Token ID37722
Feature activation+1.80e-02

Pos logit contributions
 electricity+0.052
 yours+0.048
 purpose+0.041
 outer+0.041
 ours+0.041
Neg logit contributions
i-0.063
ppy-0.062
ll-0.060
ie-0.060
mun-0.060
Token.
Token ID13
Feature activation-2.85e-02

Pos logit contributions
 electricity+0.033
 yours+0.032
 our+0.029
 its+0.029
 theirs+0.028
Neg logit contributions
op-0.035
ppy-0.035
anda-0.033
ony-0.033
y-0.033
Token<|endoftext|>
Token ID50256
Feature activation-5.90e-02

Pos logit contributions
ppy+0.062
op+0.062
int+0.060
anda+0.060
ony+0.060
Neg logit contributions
 yours-0.046
 electricity-0.045
 its-0.042
 our-0.042
 such-0.042
TokenS
Token ID50
Feature activation-1.24e-01

Pos logit contributions
ppy+0.131
op+0.128
anda+0.127
int+0.127
ony+0.126
Neg logit contributions
 electricity-0.091
 yours-0.091
 such-0.087
 its-0.086
 our-0.084
Tokenoph
Token ID2522
Feature activation-1.62e-01

Pos logit contributions
op+0.161
ppy+0.146
oise+0.142
mun+0.138
int+0.137
Neg logit contributions
 yours-0.314
 our-0.294
 your-0.293
 such-0.290
 theirs-0.284
Tokenie
Token ID494
Feature activation-1.30e-01

Pos logit contributions
op+0.198
int+0.169
oise+0.165
oping+0.156
oped+0.156
Neg logit contributions
 yours-0.440
 its-0.432
 your-0.422
 our-0.420
 such-0.413
Token was
Token ID373
Feature activation+1.50e-03

Pos logit contributions
ppy+0.238
int+0.237
op+0.234
oise+0.225
anda+0.225
Neg logit contributions
 its-0.248
 your-0.245
 our-0.241
 electricity-0.236
 yours-0.234
Token a
Token ID257
Feature activation-1.76e-03

Pos logit contributions
 yours+0.002
 electricity+0.002
 our+0.002
 such+0.002
 its+0.002
Neg logit contributions
ppy-0.004
op-0.004
anda-0.004
ony-0.004
int-0.003
Token very
Token ID845
Feature activation+1.81e-02

Pos logit contributions
owed+0.004
ops+0.004
ppy+0.004
ony+0.004
op+0.004
Neg logit contributions
 yours-0.002
 our-0.002
 electricity-0.002
 magic-0.002
 luxury-0.002
Token successful
Token ID4388
Feature activation-2.20e-02

Pos logit contributions
 yours+0.023
 electricity+0.022
 our+0.020
 its+0.019
 awesome+0.019
Neg logit contributions
ppy-0.046
op-0.045
ony-0.044
anda-0.044
int-0.044
Token qualify
Token ID12780
Feature activation+2.25e-02

Pos logit contributions
 electricity+0.023
 yours+0.022
 our+0.020
 such+0.020
 awesome+0.020
Neg logit contributions
ppy-0.027
op-0.026
y-0.025
ony-0.025
oise-0.025
Token."
Token ID526
Feature activation-3.23e-02

Pos logit contributions
 electricity+0.043
 yours+0.041
 relief+0.037
 such+0.037
 its+0.037
Neg logit contributions
ppy-0.041
op-0.041
y-0.040
anda-0.039
ony-0.038
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
ppy+0.077
anda+0.075
op+0.075
int+0.075
oise+0.075
Neg logit contributions
 electricity-0.044
 yours-0.044
 its-0.043
 our-0.042
 such-0.042
Token
Token ID198
Feature activation-2.30e-02

Pos logit contributions
ppy+0.046
op+0.045
int+0.045
anda+0.045
ony+0.044
Neg logit contributions
 electricity-0.032
 yours-0.032
 our-0.030
 its-0.030
 such-0.029
TokenD
Token ID35
Feature activation-1.03e-01

Pos logit contributions
ppy+0.053
int+0.052
op+0.052
anda+0.052
ony+0.051
Neg logit contributions
 electricity-0.034
 yours-0.033
 our-0.031
 its-0.031
 such-0.030
Tokenennis
Token ID10679
Feature activation-1.62e-01

Pos logit contributions
mun+0.117
ppy+0.108
oise+0.108
op+0.107
bug+0.106
Neg logit contributions
 yours-0.267
 electricity-0.253
 theirs-0.241
 your-0.238
 our-0.236
Token was
Token ID373
Feature activation+2.16e-02

Pos logit contributions
op+0.293
oise+0.273
uffy+0.273
erry+0.272
anda+0.271
Neg logit contributions
 its-0.305
 electricity-0.290
 such-0.290
 your-0.282
 our-0.281
Token so
Token ID523
Feature activation+2.97e-02

Pos logit contributions
 electricity+0.043
 yours+0.042
 our+0.037
 theirs+0.037
 outer+0.036
Neg logit contributions
ppy-0.038
anda-0.038
bug-0.038
op-0.037
int-0.037
Token sad
Token ID6507
Feature activation+3.69e-03

Pos logit contributions
 electricity+0.047
 yours+0.046
 our+0.042
 such+0.042
 its+0.042
Neg logit contributions
ppy-0.066
op-0.066
anda-0.064
ony-0.064
int-0.063
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
 electricity+0.006
 yours+0.006
 our+0.006
 such+0.006
 its+0.006
Neg logit contributions
ppy-0.008
op-0.007
anda-0.007
ony-0.007
int-0.007
Token angry
Token ID7954
Feature activation-1.34e-02

Pos logit contributions
op+0.022
ppy+0.021
anda+0.020
ony+0.020
int+0.020
Neg logit contributions
 electricity-0.020
 yours-0.020
 our-0.019
 its-0.018
 such-0.018
Token healthy
Token ID5448
Feature activation-4.30e-03

Pos logit contributions
 electricity+0.036
 outer+0.032
 relief+0.028
 comparison+0.027
 sight+0.027
Neg logit contributions
ppy-0.095
y-0.091
ie-0.090
i-0.089
bi-0.088
Token."
Token ID526
Feature activation-3.40e-02

Pos logit contributions
ppy+0.008
op+0.008
anda+0.008
ony+0.008
int+0.008
Neg logit contributions
 electricity-0.008
 yours-0.008
 our-0.007
 such-0.007
 its-0.007
Token
Token ID198
Feature activation-2.43e-02

Pos logit contributions
ppy+0.078
op+0.077
ony+0.076
anda+0.076
oise+0.076
Neg logit contributions
 yours-0.051
 its-0.048
 electricity-0.048
 our-0.048
 such-0.046
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
ppy+0.055
op+0.054
int+0.053
oise+0.053
anda+0.053
Neg logit contributions
 yours-0.037
 electricity-0.036
 its-0.034
 our-0.034
 such-0.033
TokenD
Token ID35
Feature activation-9.84e-02

Pos logit contributions
ppy+0.057
int+0.056
anda+0.056
ony+0.055
owed+0.055
Neg logit contributions
 yours-0.039
 electricity-0.039
 our-0.037
 its-0.036
 your-0.036
Tokenennis
Token ID10679
Feature activation-1.62e-01

Pos logit contributions
op+0.113
ppy+0.112
mun+0.110
oise+0.107
ony+0.105
Neg logit contributions
 yours-0.256
 electricity-0.250
 our-0.233
 theirs-0.232
 its-0.231
Token was
Token ID373
Feature activation+1.85e-02

Pos logit contributions
oise+0.284
op+0.284
anda+0.280
unn+0.279
uffy+0.278
Neg logit contributions
 its-0.297
 a-0.283
 such-0.269
 an-0.269
 electricity-0.266
Token disappointed
Token ID11679
Feature activation+1.87e-02

Pos logit contributions
 electricity+0.033
 yours+0.032
 our+0.029
 such+0.028
 theirs+0.028
Neg logit contributions
ppy-0.037
op-0.037
anda-0.036
int-0.036
ony-0.036
Token,
Token ID11
Feature activation-8.61e-03

Pos logit contributions
 electricity+0.036
 yours+0.036
 our+0.033
 such+0.033
 its+0.033
Neg logit contributions
ppy-0.035
op-0.035
anda-0.033
ony-0.033
int-0.033
Token but
Token ID475
Feature activation-2.63e-02

Pos logit contributions
op+0.015
ppy+0.015
anda+0.015
ony+0.015
int+0.014
Neg logit contributions
 electricity-0.017
 yours-0.017
 our-0.016
 such-0.016
 its-0.016
Token then
Token ID788
Feature activation-9.43e-03

Pos logit contributions
op+0.066
ppy+0.064
anda+0.063
ony+0.063
int+0.063
Neg logit contributions
 yours-0.036
 electricity-0.035
 its-0.032
 our-0.032
 such-0.032
Token floor
Token ID4314
Feature activation-4.06e-02

Pos logit contributions
ppy+0.011
op+0.010
anda+0.010
ony+0.010
int+0.010
Neg logit contributions
 yours-0.009
 electricity-0.009
 our-0.009
 such-0.009
 theirs-0.008
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
ppy+0.086
op+0.086
anda+0.084
ony+0.084
int+0.084
Neg logit contributions
 electricity-0.067
 yours-0.066
 such-0.060
 our-0.060
 its-0.059
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
ppy+0.047
op+0.046
ony+0.046
int+0.046
anda+0.046
Neg logit contributions
 electricity-0.028
 yours-0.028
 our-0.025
 such-0.025
 its-0.025
Token
Token ID198
Feature activation-2.22e-02

Pos logit contributions
ppy+0.040
int+0.039
op+0.038
anda+0.038
ony+0.038
Neg logit contributions
 electricity-0.026
 yours-0.026
 our-0.024
 its-0.024
 such-0.024
TokenThe
Token ID464
Feature activation-2.72e-02

Pos logit contributions
ppy+0.044
int+0.043
anda+0.043
ony+0.043
op+0.043
Neg logit contributions
 electricity-0.039
 yours-0.039
 our-0.037
 its-0.036
 such-0.035
Token chip
Token ID11594
Feature activation-1.62e-01

Pos logit contributions
op+0.055
ppy+0.055
int+0.053
ony+0.053
owed+0.053
Neg logit contributions
 electricity-0.049
 yours-0.047
 our-0.043
 peace-0.043
 power-0.043
Tokenmun
Token ID6199
Feature activation-7.65e-02

Pos logit contributions
ppy+0.249
owed+0.240
int+0.240
anda+0.232
ony+0.230
Neg logit contributions
 peace-0.327
 good-0.321
 the-0.319
 such-0.311
 love-0.310
Tokenks
Token ID591
Feature activation-4.88e-02

Pos logit contributions
uffy+0.177
ll+0.175
ail+0.168
ops+0.167
op+0.164
Neg logit contributions
 electricity-0.110
 its-0.106
 yours-0.103
 theirs-0.098
 such-0.098
Token were
Token ID547
Feature activation+3.19e-02

Pos logit contributions
op+0.072
ppy+0.071
anda+0.071
ony+0.070
uffy+0.069
Neg logit contributions
 electricity-0.110
 yours-0.106
 its-0.104
 such-0.104
 our-0.102
Token very
Token ID845
Feature activation+3.69e-02

Pos logit contributions
 electricity+0.059
 yours+0.057
 outer+0.050
 our+0.050
 respect+0.049
Neg logit contributions
anda-0.059
ppy-0.058
y-0.058
op-0.058
bug-0.058
Token curious
Token ID11040
Feature activation-1.20e-03

Pos logit contributions
 electricity+0.057
 yours+0.057
 such+0.050
 our+0.049
 its+0.048
Neg logit contributions
op-0.084
ppy-0.083
anda-0.082
ony-0.081
int-0.080
Token!
Token ID0
Feature activation-2.27e-02

Pos logit contributions
 electricity+0.038
 yours+0.038
 theirs+0.035
 relief+0.034
 its+0.034
Neg logit contributions
op-0.053
ppy-0.051
y-0.051
anda-0.049
uffy-0.049
Tokenâ
Token ID22940
Feature activation-9.34e-02

Pos logit contributions
ppy+0.053
op+0.052
anda+0.051
ony+0.051
int+0.051
Neg logit contributions
 yours-0.033
 electricity-0.033
 our-0.029
 such-0.029
 its-0.029
Token€
Token ID26391
Feature activation-8.39e-02

Pos logit contributions
op+0.167
ppy+0.164
anda+0.162
ony+0.158
uffy+0.155
Neg logit contributions
 yours-0.186
 electricity-0.186
 such-0.171
 our-0.168
 your-0.166
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
ppy+0.167
ie+0.162
oise+0.161
int+0.161
ony+0.158
Neg logit contributions
 yours-0.143
 electricity-0.141
 our-0.140
 your-0.137
 such-0.136
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
op+0.034
ppy+0.034
anda+0.033
ony+0.033
int+0.033
Neg logit contributions
 electricity-0.030
 yours-0.030
 our-0.027
 such-0.027
 its-0.027
TokenBaby
Token ID36534
Feature activation-1.61e-01

Pos logit contributions
ppy+0.043
op+0.042
anda+0.042
ony+0.042
int+0.042
Neg logit contributions
 electricity-0.041
 yours-0.041
 our-0.038
 such-0.038
 its-0.037
Token:
Token ID25
Feature activation+8.51e-03

Pos logit contributions
op+0.254
ppy+0.252
ony+0.247
int+0.242
pp+0.241
Neg logit contributions
 your-0.300
 our-0.296
 its-0.291
 the-0.290
 my-0.290
Token �
Token ID6184
Feature activation-4.27e-02

Pos logit contributions
 electricity+0.012
 yours+0.012
 our+0.011
 such+0.011
 its+0.011
Neg logit contributions
op-0.020
ppy-0.020
anda-0.019
ony-0.019
int-0.019
Token�
Token ID95
Feature activation-5.93e-02

Pos logit contributions
ppy+0.070
op+0.068
anda+0.067
ony+0.067
int+0.066
Neg logit contributions
 electricity-0.091
 yours-0.091
 our-0.084
 such-0.084
 its-0.082
Token€
Token ID26391
Feature activation-8.05e-02

Pos logit contributions
op+0.106
ppy+0.105
int+0.101
anda+0.101
ony+0.100
Neg logit contributions
 yours-0.120
 electricity-0.119
 our-0.110
 its-0.109
 theirs-0.109
Token�
Token ID129
Feature activation-3.45e-02

Pos logit contributions
ppy+0.133
oise+0.131
ie+0.130
ony+0.127
int+0.126
Neg logit contributions
 our-0.154
 electricity-0.151
 your-0.151
 yours-0.151
 such-0.149
Token branch
Token ID8478
Feature activation-2.90e-02

Pos logit contributions
 electricity+0.057
 yours+0.057
 our+0.053
 its+0.052
 such+0.052
Neg logit contributions
ppy-0.053
op-0.052
int-0.051
anda-0.051
ony-0.051
Token.
Token ID13
Feature activation+8.48e-03

Pos logit contributions
ppy+0.063
op+0.063
anda+0.062
int+0.062
ony+0.062
Neg logit contributions
 electricity-0.046
 yours-0.046
 its-0.041
 such-0.041
 our-0.041
Token She
Token ID1375
Feature activation-2.87e-02

Pos logit contributions
 yours+0.014
 electricity+0.014
 awesome+0.012
 our+0.012
 theirs+0.012
Neg logit contributions
op-0.019
ppy-0.018
uffy-0.018
anda-0.017
ony-0.017
Token could
Token ID714
Feature activation+5.52e-02

Pos logit contributions
int+0.045
op+0.045
oise+0.044
ppy+0.043
anda+0.043
Neg logit contributions
 electricity-0.064
 its-0.063
 yours-0.061
 our-0.060
 such-0.060
Token not
Token ID407
Feature activation+3.82e-02

Pos logit contributions
 electricity+0.065
 yours+0.064
 our+0.055
 such+0.055
 its+0.054
Neg logit contributions
op-0.145
ppy-0.144
anda-0.141
ony-0.141
int-0.140
Token move
Token ID1445
Feature activation+3.13e-02

Pos logit contributions
 yours+0.054
 electricity+0.054
 its+0.052
 our+0.050
 such+0.048
Neg logit contributions
anda-0.091
op-0.087
uffy-0.087
ony-0.087
mun-0.087
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 electricity+0.056
 yours+0.054
 awesome+0.050
 such+0.048
 outer+0.048
Neg logit contributions
ppy-0.062
y-0.061
op-0.060
anda-0.059
ony-0.058
Token She
Token ID1375
Feature activation-2.70e-02

Pos logit contributions
 yours+0.026
 electricity+0.026
 our+0.023
 awesome+0.023
 theirs+0.023
Neg logit contributions
op-0.029
ppy-0.028
uffy-0.027
ony-0.027
anda-0.027
Token could
Token ID714
Feature activation+5.93e-02

Pos logit contributions
int+0.046
oise+0.046
op+0.046
pp+0.044
ppy+0.043
Neg logit contributions
 electricity-0.057
 its-0.056
 yours-0.053
 such-0.053
 our-0.053
Token only
Token ID691
Feature activation+3.98e-02

Pos logit contributions
 electricity+0.069
 yours+0.069
 our+0.060
 such+0.059
 its+0.058
Neg logit contributions
op-0.156
ppy-0.155
anda-0.152
ony-0.151
int-0.151
Token j
Token ID474
Feature activation-1.23e-02

Pos logit contributions
 yours+0.066
 electricity+0.066
 its+0.063
 such+0.060
 our+0.060
Neg logit contributions
op-0.085
anda-0.084
ppy-0.084
ony-0.082
uffy-0.082
Token But
Token ID887
Feature activation+4.92e-03

Pos logit contributions
 electricity+0.005
 yours+0.005
 our+0.005
 such+0.005
 its+0.004
Neg logit contributions
op-0.006
ppy-0.006
anda-0.006
int-0.006
ony-0.006
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
 electricity+0.006
 yours+0.006
 awesome+0.005
 our+0.005
 such+0.005
Neg logit contributions
op-0.013
ppy-0.013
y-0.012
ie-0.012
anda-0.012
Token his
Token ID465
Feature activation-5.07e-03

Pos logit contributions
 electricity+0.021
 yours+0.020
 awesome+0.017
 our+0.017
 outer+0.017
Neg logit contributions
op-0.034
ppy-0.034
ie-0.033
y-0.033
int-0.032
Token mom
Token ID1995
Feature activation-8.28e-02

Pos logit contributions
ppy+0.012
op+0.012
anda+0.011
ony+0.011
int+0.011
Neg logit contributions
 electricity-0.008
 yours-0.007
 our-0.007
 such-0.007
 its-0.007
Token said
Token ID531
Feature activation+3.05e-02

Pos logit contributions
ppy+0.170
oise+0.167
unn+0.164
ony+0.162
adow+0.162
Neg logit contributions
 it-0.121
 fun-0.119
 a-0.118
 such-0.114
 an-0.114
Token that
Token ID326
Feature activation+5.07e-02

Pos logit contributions
 electricity+0.047
 yours+0.047
 outer+0.041
 our+0.041
 peace+0.040
Neg logit contributions
op-0.070
ppy-0.068
ony-0.065
anda-0.065
i-0.064
Token it
Token ID340
Feature activation-4.24e-04

Pos logit contributions
 electricity+0.069
 yours+0.068
 our+0.060
 such+0.059
 its+0.058
Neg logit contributions
op-0.123
ppy-0.122
anda-0.119
ony-0.118
int-0.117
Token was
Token ID373
Feature activation+5.93e-02

Pos logit contributions
int+0.001
ppy+0.001
oise+0.001
ony+0.001
op+0.001
Neg logit contributions
 electricity-0.001
 yours-0.001
 its-0.001
 our-0.001
 such-0.001
Token too
Token ID1165
Feature activation+5.50e-02

Pos logit contributions
 electricity+0.090
 yours+0.083
 outer+0.081
 yourselves+0.081
 respect+0.075
Neg logit contributions
ppy-0.125
i-0.119
y-0.118
ches-0.117
bi-0.116
Token cold
Token ID4692
Feature activation+2.03e-03

Pos logit contributions
 yours+0.068
 electricity+0.067
 such+0.063
 our+0.056
 theirs+0.056
Neg logit contributions
op-0.142
anda-0.136
ony-0.136
ppy-0.135
y-0.135
Token outside
Token ID2354
Feature activation+1.20e-02

Pos logit contributions
 electricity+0.003
 yours+0.003
 our+0.003
 such+0.003
 its+0.003
Neg logit contributions
op-0.004
ppy-0.004
anda-0.004
ony-0.004
int-0.004
Token Max
Token ID5436
Feature activation-9.98e-02

Pos logit contributions
 yours+0.041
 electricity+0.041
 yourselves+0.036
 theirs+0.036
 respect+0.035
Neg logit contributions
op-0.038
s-0.037
ppy-0.037
y-0.035
uffy-0.035
Token saw
Token ID2497
Feature activation+5.21e-02

Pos logit contributions
int+0.225
owed+0.219
ony+0.217
op+0.216
anda+0.214
Neg logit contributions
 a-0.160
 our-0.159
 your-0.157
 its-0.156
 their-0.152
Token a
Token ID257
Feature activation+1.75e-02

Pos logit contributions
 yours+0.069
 electricity+0.069
 satisfaction+0.063
 comparison+0.062
 relief+0.061
Neg logit contributions
o-0.114
ie-0.109
y-0.109
's-0.108
ppy-0.106
Token little
Token ID1310
Feature activation-1.52e-02

Pos logit contributions
 electricity+0.023
 yours+0.023
 our+0.021
 such+0.020
 its+0.020
Neg logit contributions
ppy-0.043
op-0.042
ony-0.041
anda-0.041
int-0.041
Token bird
Token ID6512
Feature activation-6.63e-02

Pos logit contributions
ppy+0.029
int+0.027
owed+0.027
ll+0.027
ony+0.027
Neg logit contributions
 electricity-0.030
 yours-0.029
 our-0.028
 its-0.028
 peace-0.027
Token laying
Token ID16299
Feature activation+3.57e-02

Pos logit contributions
ppy+0.159
int+0.154
anda+0.153
op+0.153
ony+0.152
Neg logit contributions
 its-0.094
 yours-0.092
 your-0.086
 such-0.086
 our-0.086
Token on
Token ID319
Feature activation+5.50e-02

Pos logit contributions
 electricity+0.073
 yours+0.070
 awesome+0.068
 purpose+0.067
 respect+0.065
Neg logit contributions
y-0.061
ppy-0.059
op-0.059
bug-0.058
int-0.057
Token the
Token ID262
Feature activation+1.27e-02

Pos logit contributions
 yours+0.071
 awesome+0.066
 electricity+0.062
 theirs+0.060
 pleasure+0.058
Neg logit contributions
y-0.123
ie-0.119
's-0.118
op-0.117
bug-0.117
Token ground
Token ID2323
Feature activation-2.09e-02

Pos logit contributions
 electricity+0.022
 yours+0.022
 our+0.020
 its+0.020
 such+0.020
Neg logit contributions
ppy-0.026
op-0.026
int-0.026
anda-0.026
ony-0.025
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
ppy+0.046
op+0.045
anda+0.045
ony+0.044
int+0.044
Neg logit contributions
 electricity-0.033
 yours-0.033
 its-0.030
 such-0.030
 our-0.030
Token "
Token ID366
Feature activation+1.14e-02

Pos logit contributions
ppy+0.033
op+0.033
anda+0.032
ony+0.032
int+0.032
Neg logit contributions
 electricity-0.019
 yours-0.019
 our-0.017
 such-0.017
 its-0.016
Token the
Token ID262
Feature activation-2.21e-03

Pos logit contributions
 yours+0.026
 electricity+0.025
 awesome+0.023
 emergencies+0.022
 our+0.022
Neg logit contributions
y-0.061
ppy-0.059
ony-0.059
ie-0.059
i-0.058
Token cake
Token ID12187
Feature activation-6.02e-02

Pos logit contributions
ppy+0.005
op+0.005
ony+0.005
y+0.005
anda+0.005
Neg logit contributions
 yours-0.003
 electricity-0.003
 theirs-0.003
 such-0.003
 our-0.003
Token carefully
Token ID7773
Feature activation+8.49e-03

Pos logit contributions
ppy+0.133
int+0.132
op+0.132
oise+0.131
mun+0.131
Neg logit contributions
 yours-0.094
 electricity-0.091
 its-0.086
 such-0.085
 our-0.085
Token.
Token ID13
Feature activation-6.00e-03

Pos logit contributions
 electricity+0.016
 yours+0.016
 our+0.014
 such+0.014
 awesome+0.014
Neg logit contributions
op-0.016
ppy-0.016
ony-0.016
anda-0.015
y-0.015
Token She
Token ID1375
Feature activation-3.65e-02

Pos logit contributions
ppy+0.012
op+0.012
anda+0.012
ony+0.012
int+0.012
Neg logit contributions
 electricity-0.011
 yours-0.011
 such-0.010
 our-0.010
 its-0.009
Token put
Token ID1234
Feature activation+3.30e-02

Pos logit contributions
uffy+0.071
oise+0.071
op+0.071
anda+0.070
ppy+0.070
Neg logit contributions
 its-0.061
 electricity-0.059
 such-0.056
 magic-0.056
 fun-0.056
Token a
Token ID257
Feature activation+2.84e-02

Pos logit contributions
 electricity+0.038
 yours+0.035
 purpose+0.031
 awesome+0.030
 theirs+0.030
Neg logit contributions
ppy-0.082
op-0.082
ip-0.081
ie-0.081
y-0.081
Token big
Token ID1263
Feature activation+7.76e-03

Pos logit contributions
 yours+0.046
 electricity+0.046
 theirs+0.042
 such+0.041
 our+0.039
Neg logit contributions
op-0.063
ppy-0.060
anda-0.060
ony-0.059
int-0.058
Token,
Token ID11
Feature activation+1.85e-02

Pos logit contributions
 electricity+0.016
 yours+0.016
 theirs+0.015
 such+0.015
 our+0.014
Neg logit contributions
op-0.013
ppy-0.013
anda-0.012
int-0.012
ony-0.012
Token red
Token ID2266
Feature activation-8.58e-03

Pos logit contributions
 yours+0.026
 electricity+0.024
 theirs+0.022
 purpose+0.021
 such+0.021
Neg logit contributions
ppy-0.042
int-0.042
op-0.041
ony-0.041
bug-0.040
Token bow
Token ID9563
Feature activation-3.53e-02

Pos logit contributions
op+0.016
ppy+0.015
y+0.015
anda+0.015
int+0.015
Neg logit contributions
 electricity-0.016
 yours-0.016
 awesome-0.015
 theirs-0.015
 such-0.015
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
ppy+0.107
op+0.105
int+0.103
anda+0.102
ony+0.102
Neg logit contributions
 yours-0.104
 electricity-0.103
 our-0.095
 such-0.093
 its-0.093
Token my
Token ID616
Feature activation-7.27e-03

Pos logit contributions
 electricity+0.018
 yours+0.017
 outer+0.017
 peace+0.016
 such+0.015
Neg logit contributions
ppy-0.036
op-0.036
anda-0.036
ony-0.034
y-0.034
Token loves
Token ID10408
Feature activation+2.81e-02

Pos logit contributions
op+0.017
anda+0.016
ony+0.016
ppy+0.016
int+0.016
Neg logit contributions
 yours-0.011
 electricity-0.011
 theirs-0.010
 such-0.010
 our-0.009
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 electricity+0.055
 yours+0.054
 such+0.050
 theirs+0.050
 our+0.050
Neg logit contributions
op-0.050
ppy-0.049
anda-0.048
ony-0.048
y-0.048
Token That
Token ID1320
Feature activation+1.23e-02

Pos logit contributions
 yours+0.021
 electricity+0.020
 awesome+0.020
 fun+0.020
 sight+0.019
Neg logit contributions
op-0.032
ppy-0.031
anda-0.030
ony-0.030
bug-0.030
Token is
Token ID318
Feature activation+5.29e-02

Pos logit contributions
 electricity+0.026
 yours+0.026
 our+0.024
 such+0.024
 theirs+0.024
Neg logit contributions
op-0.020
ppy-0.020
anda-0.019
ony-0.019
int-0.019
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
 electricity+0.061
 purpose+0.060
 relief+0.058
 yours+0.056
 satisfaction+0.056
Neg logit contributions
ppy-0.121
bug-0.119
anda-0.117
i-0.115
ony-0.115
Token moon
Token ID8824
Feature activation-7.69e-02

Pos logit contributions
 theirs+0.024
 yours+0.024
 electricity+0.024
 its+0.022
 our+0.022
Neg logit contributions
anda-0.032
ony-0.031
ppy-0.031
op-0.031
y-0.030
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
ppy+0.152
op+0.151
int+0.148
ony+0.145
anda+0.145
Neg logit contributions
 yours-0.140
 electricity-0.139
 our-0.129
 its-0.126
 such-0.126
Token It
Token ID632
Feature activation+2.44e-03

Pos logit contributions
 yours+0.025
 electricity+0.024
 awesome+0.023
 sight+0.023
 theirs+0.023
Neg logit contributions
op-0.037
ppy-0.037
int-0.036
ony-0.035
anda-0.035
Token is
Token ID318
Feature activation+5.96e-02

Pos logit contributions
 electricity+0.004
 yours+0.004
 our+0.004
 its+0.004
 such+0.004
Neg logit contributions
op-0.005
ppy-0.005
int-0.005
oise-0.005
anda-0.005
Token glitter
Token ID31133
Feature activation-2.91e-02

Pos logit contributions
 yours+0.034
 electricity+0.033
 our+0.030
 theirs+0.029
 such+0.029
Neg logit contributions
op-0.049
ony-0.048
anda-0.048
ppy-0.048
y-0.047
Token and
Token ID290
Feature activation+3.77e-02

Pos logit contributions
ppy+0.049
op+0.049
anda+0.048
int+0.048
ony+0.047
Neg logit contributions
 yours-0.060
 electricity-0.060
 such-0.057
 our-0.055
 its-0.055
Token was
Token ID373
Feature activation+5.44e-02

Pos logit contributions
 yours+0.055
 electricity+0.054
 theirs+0.048
 yourselves+0.046
 our+0.046
Neg logit contributions
s-0.078
y-0.077
i-0.077
ony-0.077
ppy-0.077
Token very
Token ID845
Feature activation+5.99e-02

Pos logit contributions
 electricity+0.085
 yours+0.081
 yourselves+0.075
 our+0.075
 relief+0.073
Neg logit contributions
y-0.111
anda-0.110
i-0.110
bug-0.110
ppy-0.108
Token spark
Token ID9009
Feature activation-2.72e-02

Pos logit contributions
 electricity+0.085
 yours+0.080
 such+0.077
 yourselves+0.071
 anything+0.068
Neg logit contributions
anda-0.140
ony-0.132
s-0.132
y-0.129
i-0.128
Tokenly
Token ID306
Feature activation+1.35e-02

Pos logit contributions
op+0.034
ppy+0.033
anda+0.033
ony+0.032
oise+0.032
Neg logit contributions
 yours-0.069
 electricity-0.068
 such-0.065
 our-0.063
 its-0.063
Token.
Token ID13
Feature activation-9.21e-03

Pos logit contributions
 electricity+0.026
 yours+0.026
 our+0.024
 theirs+0.023
 its+0.023
Neg logit contributions
op-0.024
ppy-0.024
y-0.024
anda-0.024
ony-0.023
Token He
Token ID679
Feature activation-2.79e-02

Pos logit contributions
ppy+0.022
oise+0.022
anda+0.021
ony+0.021
op+0.021
Neg logit contributions
 electricity-0.013
 yours-0.013
 its-0.012
 our-0.012
 such-0.012
Token was
Token ID373
Feature activation+4.20e-02

Pos logit contributions
op+0.053
uffy+0.051
ppy+0.051
int+0.050
erry+0.050
Neg logit contributions
 its-0.052
 electricity-0.050
 yours-0.049
 our-0.049
 such-0.049
Token so
Token ID523
Feature activation+5.03e-02

Pos logit contributions
 electricity+0.093
 yours+0.085
 purpose+0.080
 yourselves+0.077
 our+0.077
Neg logit contributions
bug-0.068
anda-0.065
ppy-0.063
ony-0.062
op-0.062
Token excited
Token ID6568
Feature activation+4.04e-03

Pos logit contributions
 electricity+0.102
 yours+0.098
 such+0.086
 theirs+0.086
 our+0.085
Neg logit contributions
op-0.091
anda-0.089
ony-0.087
int-0.084
y-0.084
Token From
Token ID3574
Feature activation+1.73e-03

Pos logit contributions
ppy+0.032
op+0.031
int+0.031
ony+0.031
anda+0.031
Neg logit contributions
 yours-0.021
 electricity-0.020
 its-0.019
 our-0.019
 such-0.019
Token that
Token ID326
Feature activation-5.37e-03

Pos logit contributions
 electricity+0.003
 yours+0.003
 theirs+0.002
 such+0.002
 anything+0.002
Neg logit contributions
anda-0.004
i-0.004
ie-0.004
ppy-0.004
ony-0.004
Token day
Token ID1110
Feature activation-9.51e-03

Pos logit contributions
ppy+0.012
op+0.012
anda+0.012
int+0.012
ony+0.012
Neg logit contributions
 yours-0.008
 electricity-0.008
 our-0.008
 its-0.008
 such-0.008
Token on
Token ID319
Feature activation-6.52e-03

Pos logit contributions
ppy+0.021
op+0.021
anda+0.020
ony+0.020
int+0.020
Neg logit contributions
 yours-0.015
 electricity-0.015
 our-0.014
 such-0.014
 its-0.014
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
ppy+0.011
op+0.011
int+0.011
ony+0.011
anda+0.011
Neg logit contributions
 electricity-0.014
 yours-0.013
 our-0.013
 its-0.013
 such-0.013
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
 yours+0.024
 electricity+0.023
 our+0.021
 such+0.021
 its+0.021
Neg logit contributions
ppy-0.051
op-0.050
int-0.050
ony-0.050
anda-0.049
Token bugs
Token ID11316
Feature activation-1.10e-01

Pos logit contributions
ppy+0.041
op+0.041
anda+0.040
ony+0.040
int+0.040
Neg logit contributions
 electricity-0.025
 yours-0.025
 our-0.022
 such-0.022
 its-0.022
Token rocked
Token ID36872
Feature activation+1.93e-02

Pos logit contributions
int+0.202
anda+0.196
ony+0.195
ppy+0.195
op+0.193
Neg logit contributions
 its-0.205
 yours-0.201
 such-0.199
 electricity-0.199
 our-0.188
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
 yours+0.035
 electricity+0.035
 our+0.031
 such+0.030
 awesome+0.030
Neg logit contributions
ppy-0.038
op-0.037
y-0.036
anda-0.036
ony-0.036
Token blue
Token ID4171
Feature activation-3.69e-02

Pos logit contributions
op+0.020
ppy+0.020
anda+0.019
ony+0.019
int+0.019
Neg logit contributions
 electricity-0.019
 yours-0.019
 our-0.017
 such-0.017
 its-0.017
Token rock
Token ID3881
Feature activation-6.05e-02

Pos logit contributions
op+0.068
ppy+0.068
anda+0.066
ony+0.065
int+0.065
Neg logit contributions
 electricity-0.072
 yours-0.071
 our-0.065
 such-0.065
 its-0.064
Token her
Token ID607
Feature activation+6.20e-03

Pos logit contributions
 electricity+0.057
 yours+0.056
 our+0.047
 theirs+0.045
 awesome+0.044
Neg logit contributions
ppy-0.130
op-0.129
ie-0.127
ony-0.125
anda-0.125
Token happy
Token ID3772
Feature activation+1.99e-02

Pos logit contributions
 yours+0.010
 electricity+0.010
 such+0.010
 our+0.010
 its+0.010
Neg logit contributions
ppy-0.013
op-0.013
int-0.013
ony-0.013
oise-0.013
Token.
Token ID13
Feature activation+2.55e-03

Pos logit contributions
 electricity+0.037
 yours+0.037
 our+0.033
 such+0.032
 theirs+0.032
Neg logit contributions
op-0.038
ppy-0.037
anda-0.037
y-0.036
ony-0.036
Token She
Token ID1375
Feature activation-2.74e-02

Pos logit contributions
 yours+0.004
 electricity+0.004
 such+0.004
 our+0.004
 its+0.004
Neg logit contributions
ppy-0.005
op-0.005
anda-0.005
int-0.005
ony-0.005
Token knew
Token ID2993
Feature activation+4.55e-02

Pos logit contributions
op+0.059
uffy+0.055
int+0.054
oise+0.053
ony+0.053
Neg logit contributions
 its-0.046
 fun-0.045
 such-0.044
 our-0.044
 yours-0.043
Token she
Token ID673
Feature activation-2.62e-02

Pos logit contributions
 electricity+0.065
 yours+0.061
 our+0.053
 awesome+0.053
 theirs+0.052
Neg logit contributions
op-0.105
ppy-0.103
anda-0.103
ony-0.102
i-0.102
Token would
Token ID561
Feature activation+4.89e-02

Pos logit contributions
op+0.044
int+0.043
ony+0.042
ppy+0.042
anda+0.042
Neg logit contributions
 yours-0.054
 electricity-0.053
 such-0.052
 its-0.051
 power-0.049
Token build
Token ID1382
Feature activation+5.34e-02

Pos logit contributions
 electricity+0.091
 yours+0.087
 theirs+0.079
 our+0.078
 awesome+0.077
Neg logit contributions
ppy-0.094
op-0.091
ony-0.089
ll-0.089
anda-0.088
Token another
Token ID1194
Feature activation+1.59e-02

Pos logit contributions
 yours+0.053
 electricity+0.051
 theirs+0.041
 respect+0.038
 our+0.038
Neg logit contributions
ie-0.136
ig-0.134
anda-0.133
ony-0.132
ppy-0.132
Token ig
Token ID45329
Feature activation-5.98e-02

Pos logit contributions
 electricity+0.029
 yours+0.029
 our+0.026
 such+0.026
 its+0.026
Neg logit contributions
op-0.031
ppy-0.031
anda-0.030
ony-0.030
int-0.030
Tokenloo
Token ID29680
Feature activation-3.32e-02

Pos logit contributions
ppy+0.126
ony+0.118
op+0.117
oise+0.114
int+0.114
Neg logit contributions
 electricity-0.104
 yours-0.101
 such-0.093
 our-0.092
 awesome-0.091
TokenDan
Token ID21174
Feature activation-1.47e-01

Pos logit contributions
ppy+0.053
op+0.053
anda+0.052
ony+0.052
int+0.052
Neg logit contributions
 yours-0.047
 electricity-0.047
 our-0.044
 such-0.043
 its-0.043
Token wanted
Token ID2227
Feature activation+4.35e-02

Pos logit contributions
op+0.273
uffy+0.249
int+0.243
ppy+0.241
erry+0.241
Neg logit contributions
 its-0.263
 such-0.257
 our-0.254
 your-0.253
 the-0.252
Token to
Token ID284
Feature activation+3.87e-02

Pos logit contributions
 yours+0.065
 electricity+0.064
 outer+0.055
 awesome+0.055
 theirs+0.055
Neg logit contributions
op-0.095
ony-0.093
ppy-0.092
anda-0.091
i-0.090
Token find
Token ID1064
Feature activation+5.42e-02

Pos logit contributions
 electricity+0.062
 yours+0.061
 our+0.055
 such+0.054
 its+0.054
Neg logit contributions
op-0.085
ppy-0.085
anda-0.082
ony-0.082
int-0.082
Token something
Token ID1223
Feature activation+2.32e-03

Pos logit contributions
 electricity+0.060
 yours+0.056
 comparison+0.052
 yourselves+0.049
 purpose+0.048
Neg logit contributions
ony-0.132
ie-0.132
ish-0.130
ppy-0.130
int-0.128
Token special
Token ID2041
Feature activation-1.07e-02

Pos logit contributions
 electricity+0.003
 yours+0.003
 our+0.003
 such+0.003
 yourselves+0.003
Neg logit contributions
op-0.005
anda-0.005
ppy-0.005
ony-0.005
y-0.005
Token in
Token ID287
Feature activation+2.86e-02

Pos logit contributions
ppy+0.020
op+0.020
anda+0.019
ony+0.019
int+0.019
Neg logit contributions
 electricity-0.020
 yours-0.020
 our-0.018
 such-0.018
 its-0.018
Token the
Token ID262
Feature activation-4.56e-03

Pos logit contributions
 electricity+0.037
 yours+0.031
 awesome+0.030
 consequences+0.027
 respect+0.027
Neg logit contributions
y-0.064
ie-0.064
bug-0.062
bi-0.062
oise-0.062
Token woods
Token ID16479
Feature activation-3.52e-02

Pos logit contributions
op+0.008
anda+0.007
ony+0.007
ppy+0.007
oise+0.007
Neg logit contributions
 electricity-0.009
 yours-0.009
 theirs-0.009
 such-0.009
 our-0.008
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
ppy+0.074
op+0.074
int+0.072
anda+0.072
ony+0.071
Neg logit contributions
 yours-0.059
 electricity-0.059
 such-0.054
 our-0.054
 its-0.053
Token He
Token ID679
Feature activation-5.02e-02

Pos logit contributions
ppy+0.036
oise+0.035
anda+0.035
op+0.035
ie+0.034
Neg logit contributions
 electricity-0.020
 yours-0.020
 its-0.019
 such-0.019
 our-0.018
Token with
Token ID351
Feature activation+5.33e-02

Pos logit contributions
 electricity+0.030
 yours+0.030
 such+0.027
 our+0.027
 its+0.027
Neg logit contributions
ppy-0.039
op-0.039
anda-0.038
int-0.038
ony-0.037
Token excitement
Token ID14067
Feature activation-2.46e-04

Pos logit contributions
 yours+0.077
 electricity+0.063
 yourselves+0.059
 our+0.058
 your+0.057
Neg logit contributions
y-0.119
op-0.114
bug-0.113
mun-0.113
ony-0.113
Token.
Token ID13
Feature activation-5.55e-03

Pos logit contributions
op+0.000
ppy+0.000
anda+0.000
ony+0.000
int+0.000
Neg logit contributions
 electricity-0.001
 yours-0.001
 our-0.000
 such-0.000
 its-0.000
Token
Token ID198
Feature activation-4.73e-03

Pos logit contributions
ppy+0.013
op+0.012
anda+0.012
int+0.012
ony+0.012
Neg logit contributions
 electricity-0.008
 yours-0.008
 its-0.008
 our-0.008
 such-0.008
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
ppy+0.011
int+0.010
anda+0.010
op+0.010
ony+0.010
Neg logit contributions
 electricity-0.007
 yours-0.007
 our-0.007
 its-0.007
 such-0.006
TokenThe
Token ID464
Feature activation-2.72e-02

Pos logit contributions
ppy+0.027
anda+0.027
int+0.026
op+0.026
ony+0.026
Neg logit contributions
 electricity-0.018
 yours-0.017
 our-0.016
 its-0.016
 such-0.016
Token elephant
Token ID20950
Feature activation-1.18e-01

Pos logit contributions
anda+0.045
op+0.044
ppy+0.044
ony+0.043
oise+0.043
Neg logit contributions
 yours-0.058
 electricity-0.057
 such-0.053
 our-0.052
 its-0.052
Token listened
Token ID16399
Feature activation+1.75e-02

Pos logit contributions
uffy+0.220
ppy+0.217
op+0.212
pp+0.207
erry+0.204
Neg logit contributions
 its-0.217
 electricity-0.203
 a-0.196
 fun-0.195
 peace-0.195
Token carefully
Token ID7773
Feature activation+1.07e-02

Pos logit contributions
 yours+0.034
 electricity+0.033
 such+0.030
 our+0.029
 yourselves+0.029
Neg logit contributions
op-0.033
ppy-0.033
y-0.032
s-0.032
ony-0.032
Token.
Token ID13
Feature activation+5.43e-03

Pos logit contributions
 yours+0.026
 electricity+0.026
 our+0.024
 awesome+0.024
 such+0.023
Neg logit contributions
op-0.015
ppy-0.014
ony-0.014
y-0.013
int-0.013
Token His
Token ID2399
Feature activation-8.27e-03

Pos logit contributions
 yours+0.009
 electricity+0.009
 our+0.008
 such+0.008
 awesome+0.008
Neg logit contributions
op-0.011
ppy-0.011
int-0.011
anda-0.011
ony-0.011
Token around
Token ID1088
Feature activation+1.54e-02

Pos logit contributions
op+0.004
ppy+0.004
anda+0.004
ony+0.004
ll+0.004
Neg logit contributions
 yours-0.003
 electricity-0.003
 our-0.003
 theirs-0.003
 such-0.003
Token us
Token ID514
Feature activation-3.14e-02

Pos logit contributions
 electricity+0.018
 yours+0.018
 awesome+0.016
 theirs+0.016
 such+0.015
Neg logit contributions
ppy-0.039
op-0.038
y-0.038
ie-0.037
ony-0.037
Token.
Token ID13
Feature activation-3.26e-03

Pos logit contributions
op+0.064
ppy+0.064
anda+0.062
int+0.062
ony+0.062
Neg logit contributions
 electricity-0.054
 yours-0.054
 our-0.049
 such-0.049
 its-0.048
Token<|endoftext|>
Token ID50256
Feature activation-5.29e-02

Pos logit contributions
ppy+0.007
op+0.007
anda+0.007
int+0.007
ony+0.007
Neg logit contributions
 electricity-0.005
 yours-0.005
 such-0.005
 our-0.005
 its-0.005
TokenOnce
Token ID7454
Feature activation-2.75e-02

Pos logit contributions
ppy+0.116
anda+0.114
int+0.114
ony+0.114
op+0.113
Neg logit contributions
 electricity-0.081
 yours-0.080
 such-0.080
 its-0.077
 peace-0.076
Token there
Token ID612
Feature activation-4.80e-02

Pos logit contributions
int+0.045
ppy+0.045
op+0.044
ony+0.043
oise+0.043
Neg logit contributions
 yours-0.059
 our-0.056
 electricity-0.055
 its-0.055
 your-0.054
Token was
Token ID373
Feature activation+3.27e-02

Pos logit contributions
ppy+0.074
int+0.071
op+0.070
ony+0.070
oise+0.068
Neg logit contributions
 yours-0.109
 electricity-0.108
 such-0.105
 its-0.105
 our-0.103
Token a
Token ID257
Feature activation+1.07e-03

Pos logit contributions
 electricity+0.026
 yours+0.025
 our+0.019
 such+0.018
 theirs+0.017
Neg logit contributions
op-0.096
ppy-0.095
int-0.093
ony-0.092
y-0.092
Token little
Token ID1310
Feature activation-2.78e-02

Pos logit contributions
 fun+0.001
 our+0.001
 electricity+0.001
 yours+0.001
 magic+0.001
Neg logit contributions
owed-0.003
ony-0.003
ppy-0.003
int-0.002
ops-0.002
Token girl
Token ID2576
Feature activation-9.34e-02

Pos logit contributions
owed+0.050
ppy+0.048
int+0.048
ony+0.048
ll+0.046
Neg logit contributions
 electricity-0.055
 yours-0.055
 our-0.054
 your-0.052
 its-0.052
Token named
Token ID3706
Feature activation+7.81e-04

Pos logit contributions
int+0.152
ppy+0.152
op+0.149
anda+0.147
ony+0.147
Neg logit contributions
 electricity-0.193
 yours-0.193
 its-0.189
 our-0.185
 your-0.182
Token singing
Token ID13777
Feature activation+3.99e-02

Pos logit contributions
 yours+0.047
 electricity+0.046
 theirs+0.040
 our+0.039
 such+0.038
Neg logit contributions
ppy-0.077
op-0.076
ony-0.076
anda-0.076
mun-0.075
Token birds
Token ID10087
Feature activation-5.81e-02

Pos logit contributions
 yours+0.076
 electricity+0.075
 yourselves+0.065
 theirs+0.065
 our+0.065
Neg logit contributions
y-0.073
ppy-0.072
owed-0.071
op-0.070
s-0.069
Token.
Token ID13
Feature activation-1.21e-03

Pos logit contributions
op+0.130
ppy+0.130
int+0.129
anda+0.127
ony+0.126
Neg logit contributions
 yours-0.087
 electricity-0.085
 its-0.083
 such-0.081
 our-0.080
Token The
Token ID383
Feature activation-2.21e-02

Pos logit contributions
ppy+0.003
op+0.003
anda+0.003
int+0.003
ony+0.003
Neg logit contributions
 electricity-0.002
 yours-0.002
 its-0.002
 our-0.002
 such-0.002
Token happy
Token ID3772
Feature activation-9.19e-03

Pos logit contributions
ppy+0.038
op+0.037
ony+0.036
anda+0.036
int+0.035
Neg logit contributions
 electricity-0.047
 yours-0.046
 our-0.043
 such-0.043
 its-0.043
Token frog
Token ID21264
Feature activation-1.07e-01

Pos logit contributions
ppy+0.019
op+0.019
ony+0.018
int+0.018
anda+0.018
Neg logit contributions
 electricity-0.016
 yours-0.016
 its-0.015
 our-0.015
 such-0.015
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
op+0.196
uffy+0.191
ppy+0.185
int+0.183
anda+0.181
Neg logit contributions
 its-0.209
 such-0.198
 our-0.192
 peace-0.190
 electricity-0.189
Token bird
Token ID6512
Feature activation-8.44e-02

Pos logit contributions
 electricity+0.012
 yours+0.011
 our+0.009
 such+0.009
 its+0.009
Neg logit contributions
op-0.048
ppy-0.048
anda-0.047
ony-0.047
int-0.047
Token decided
Token ID3066
Feature activation+3.16e-02

Pos logit contributions
op+0.137
int+0.131
ppy+0.126
uffy+0.126
ony+0.124
Neg logit contributions
 its-0.182
 electricity-0.174
 yours-0.173
 our-0.172
 such-0.172
Token that
Token ID326
Feature activation+4.59e-02

Pos logit contributions
 electricity+0.057
 yours+0.052
 yourselves+0.047
 awesome+0.047
 emergencies+0.047
Neg logit contributions
s-0.060
ll-0.057
's-0.057
i-0.056
y-0.055
Token this
Token ID428
Feature activation-7.41e-04

Pos logit contributions
 electricity+0.063
 yours+0.060
 awesome+0.052
 theirs+0.051
 yourselves+0.049
Neg logit contributions
op-0.105
ppy-0.105
ie-0.104
y-0.104
i-0.103
Token was
Token ID373
Feature activation+2.33e-02

Pos logit contributions
op+0.215
int+0.207
ppy+0.206
ony+0.197
oise+0.196
Neg logit contributions
 its-0.234
 our-0.227
 electricity-0.226
 such-0.223
 your-0.221
Token getting
Token ID1972
Feature activation+5.52e-02

Pos logit contributions
 electricity+0.041
 yours+0.039
 our+0.035
 theirs+0.034
 such+0.033
Neg logit contributions
ppy-0.048
anda-0.047
op-0.047
ony-0.046
bug-0.045
Token so
Token ID523
Feature activation+4.04e-02

Pos logit contributions
 yours+0.106
 electricity+0.105
 purpose+0.092
 our+0.092
 yourselves+0.090
Neg logit contributions
ppy-0.100
op-0.097
anda-0.096
ony-0.096
y-0.095
Token fast
Token ID3049
Feature activation-1.84e-02

Pos logit contributions
 yours+0.071
 electricity+0.070
 such+0.063
 our+0.062
 its+0.060
Neg logit contributions
op-0.083
ppy-0.081
anda-0.081
ony-0.080
int-0.079
Token that
Token ID326
Feature activation+2.25e-02

Pos logit contributions
op+0.040
ppy+0.039
y+0.038
anda+0.038
ony+0.038
Neg logit contributions
 yours-0.029
 electricity-0.029
 our-0.026
 theirs-0.025
 such-0.025
Token he
Token ID339
Feature activation-4.71e-02

Pos logit contributions
 electricity+0.036
 yours+0.036
 our+0.032
 theirs+0.032
 awesome+0.031
Neg logit contributions
op-0.047
ppy-0.046
y-0.045
anda-0.045
ie-0.044
Token started
Token ID2067
Feature activation+3.13e-02

Pos logit contributions
op+0.076
int+0.075
ony+0.073
ppy+0.073
uffy+0.072
Neg logit contributions
 electricity-0.100
 yours-0.097
 its-0.096
 such-0.094
 our-0.093
Token to
Token ID284
Feature activation+3.75e-02

Pos logit contributions
 yours+0.049
 electricity+0.048
 our+0.042
 awesome+0.042
 theirs+0.042
Neg logit contributions
op-0.070
y-0.066
ppy-0.066
anda-0.065
ony-0.065
Token blur
Token ID23671
Feature activation-1.22e-02

Pos logit contributions
 yours+0.077
 electricity+0.076
 our+0.070
 such+0.069
 theirs+0.068
Neg logit contributions
ppy-0.065
op-0.064
anda-0.062
ony-0.062
int-0.061
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
op+0.025
ppy+0.025
anda+0.024
ony+0.024
int+0.024
Neg logit contributions
 electricity-0.021
 yours-0.021
 our-0.019
 such-0.019
 its-0.019
Token He
Token ID679
Feature activation-4.09e-02

Pos logit contributions
op+0.029
ppy+0.029
anda+0.029
ony+0.028
int+0.028
Neg logit contributions
 electricity-0.021
 yours-0.021
 our-0.019
 such-0.019
 its-0.018
Token world
Token ID995
Feature activation-1.88e-02

Pos logit contributions
 yours+0.001
 electricity+0.001
 theirs+0.001
 its+0.001
 such+0.001
Neg logit contributions
anda-0.001
ppy-0.001
op-0.001
ony-0.001
oise-0.001
Token and
Token ID290
Feature activation+3.80e-02

Pos logit contributions
op+0.039
ppy+0.039
int+0.039
anda+0.038
ony+0.038
Neg logit contributions
 yours-0.032
 electricity-0.031
 such-0.029
 our-0.029
 its-0.028
Token express
Token ID4911
Feature activation+4.76e-02

Pos logit contributions
 electricity+0.064
 yours+0.061
 theirs+0.053
 awesome+0.052
 our+0.052
Neg logit contributions
ppy-0.075
op-0.074
ll-0.074
ony-0.073
anda-0.073
Token himself
Token ID2241
Feature activation-6.97e-03

Pos logit contributions
 electricity+0.060
 yours+0.051
 awesome+0.049
 trouble+0.046
 ours+0.044
Neg logit contributions
ie-0.117
i-0.116
y-0.115
ppy-0.113
ony-0.110
Token.
Token ID13
Feature activation-7.17e-03

Pos logit contributions
op+0.015
ppy+0.015
anda+0.015
int+0.014
ony+0.014
Neg logit contributions
 electricity-0.011
 yours-0.011
 our-0.010
 such-0.010
 its-0.010
Token<|endoftext|>
Token ID50256
Feature activation-5.33e-02

Pos logit contributions
ppy+0.016
op+0.016
int+0.016
anda+0.016
ony+0.016
Neg logit contributions
 yours-0.011
 electricity-0.011
 its-0.010
 such-0.010
 our-0.010
TokenOnce
Token ID7454
Feature activation-3.27e-02

Pos logit contributions
ppy+0.115
anda+0.112
op+0.112
int+0.111
ony+0.111
Neg logit contributions
 electricity-0.085
 yours-0.085
 such-0.083
 its-0.081
 peace-0.079
Token upon
Token ID2402
Feature activation-6.15e-03

Pos logit contributions
int+0.056
ppy+0.056
op+0.055
ony+0.054
oise+0.053
Neg logit contributions
 yours-0.068
 its-0.065
 our-0.064
 your-0.063
 electricity-0.062
Token a
Token ID257
Feature activation-2.82e-02

Pos logit contributions
ppy+0.017
op+0.017
int+0.016
ony+0.016
y+0.016
Neg logit contributions
 electricity-0.006
 yours-0.006
 our-0.005
 theirs-0.004
 purpose-0.004
Token time
Token ID640
Feature activation-1.58e-02

Pos logit contributions
op+0.085
pp+0.085
ony+0.085
oise+0.085
anda+0.085
Neg logit contributions
 fun-0.022
 awesome-0.017
 yours-0.017
 our-0.016
 peace-0.015
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
int+0.034
ppy+0.033
oise+0.033
ony+0.032
op+0.031
Neg logit contributions
 yours-0.027
 our-0.026
 your-0.026
 its-0.026
 electricity-0.025
Token the
Token ID262
Feature activation-4.45e-03

Pos logit contributions
 electricity+0.044
 yours+0.042
 theirs+0.038
 outer+0.038
 awesome+0.038
Neg logit contributions
ppy-0.020
op-0.020
ie-0.020
y-0.018
int-0.018
Token olive
Token ID19450
Feature activation-8.62e-02

Pos logit contributions
ppy+0.010
op+0.010
anda+0.010
ony+0.010
int+0.010
Neg logit contributions
 electricity-0.007
 yours-0.007
 our-0.006
 such-0.006
 its-0.006
Token was
Token ID373
Feature activation+5.35e-02

Pos logit contributions
ppy+0.163
ony+0.160
oise+0.157
pp+0.156
int+0.154
Neg logit contributions
 electricity-0.167
 fun-0.156
 magic-0.156
 power-0.155
 its-0.154
Token ready
Token ID3492
Feature activation+2.61e-02

Pos logit contributions
 electricity+0.110
 yours+0.101
 respect+0.094
 yourselves+0.093
 purpose+0.093
Neg logit contributions
int-0.091
ppy-0.091
bi-0.085
bug-0.084
op-0.084
Token and
Token ID290
Feature activation+3.40e-02

Pos logit contributions
 electricity+0.050
 yours+0.048
 our+0.044
 awesome+0.043
 outer+0.043
Neg logit contributions
op-0.048
ppy-0.048
y-0.047
ie-0.045
owed-0.045
Token Jack
Token ID3619
Feature activation-9.11e-02

Pos logit contributions
 electricity+0.056
 yours+0.054
 our+0.047
 such+0.047
 theirs+0.047
Neg logit contributions
ppy-0.072
op-0.071
int-0.069
anda-0.068
ony-0.068
Token took
Token ID1718
Feature activation+5.08e-02

Pos logit contributions
mun+0.168
oise+0.168
uffy+0.168
op+0.166
pp+0.163
Neg logit contributions
 its-0.166
 such-0.159
 a-0.153
 our-0.152
 electricity-0.150
Token his
Token ID465
Feature activation+2.71e-03

Pos logit contributions
 electricity+0.065
 yours+0.064
 theirs+0.062
 diversity+0.059
 yourselves+0.057
Neg logit contributions
int-0.117
op-0.112
ie-0.112
ig-0.112
ppy-0.111
Token first
Token ID717
Feature activation+8.99e-03

Pos logit contributions
 electricity+0.004
 yours+0.004
 our+0.004
 such+0.004
 its+0.004
Neg logit contributions
ppy-0.006
op-0.006
anda-0.006
int-0.006
ony-0.006
Token bite
Token ID13197
Feature activation+1.31e-02

Pos logit contributions
 yours+0.015
 electricity+0.015
 our+0.014
 its+0.014
 such+0.013
Neg logit contributions
ppy-0.019
op-0.018
anda-0.018
int-0.018
ony-0.018
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
 electricity+0.023
 yours+0.023
 such+0.021
 our+0.021
 its+0.020
Neg logit contributions
ppy-0.027
op-0.027
anda-0.026
ony-0.026
int-0.026
Token were
Token ID547
Feature activation+4.43e-02

Pos logit contributions
op+0.045
anda+0.044
int+0.044
oise+0.043
uffy+0.043
Neg logit contributions
 electricity-0.048
 its-0.046
 such-0.045
 yours-0.044
 respect-0.044
Token gone
Token ID3750
Feature activation+2.79e-02

Pos logit contributions
 electricity+0.087
 yours+0.084
 purpose+0.082
 outer+0.082
 such+0.077
Neg logit contributions
ppy-0.076
anda-0.074
bug-0.072
bi-0.068
y-0.068
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
 electricity+0.056
 yours+0.055
 such+0.049
 awesome+0.049
 our+0.048
Neg logit contributions
ppy-0.049
y-0.048
op-0.048
anda-0.047
ll-0.046
Token<|endoftext|>
Token ID50256
Feature activation-5.03e-02

Pos logit contributions
 yours+0.039
 electricity+0.036
 awesome+0.034
 outer+0.033
 such+0.033
Neg logit contributions
op-0.026
uffy-0.024
s-0.024
ppy-0.023
anda-0.023
TokenL
Token ID43
Feature activation-1.00e-01

Pos logit contributions
ppy+0.103
anda+0.100
ony+0.100
op+0.099
int+0.099
Neg logit contributions
 electricity-0.087
 yours-0.086
 such-0.083
 our-0.081
 its-0.081
Tokenily
Token ID813
Feature activation-1.25e-01

Pos logit contributions
op+0.202
ppy+0.201
int+0.200
anda+0.197
oise+0.196
Neg logit contributions
 yours-0.182
 electricity-0.174
 our-0.169
 its-0.164
 theirs-0.162
Token and
Token ID290
Feature activation-2.86e-02

Pos logit contributions
op+0.230
int+0.222
ppy+0.220
ony+0.218
anda+0.217
Neg logit contributions
 electricity-0.238
 its-0.234
 yours-0.231
 our-0.223
 such-0.222
Token Ben
Token ID3932
Feature activation-1.13e-01

Pos logit contributions
ppy+0.076
op+0.076
anda+0.074
ony+0.074
int+0.074
Neg logit contributions
 electricity-0.032
 yours-0.032
 our-0.027
 such-0.027
 its-0.026
Token were
Token ID547
Feature activation+1.06e-03

Pos logit contributions
op+0.191
anda+0.179
ppy+0.178
int+0.177
ony+0.177
Neg logit contributions
 electricity-0.244
 yours-0.242
 our-0.230
 its-0.230
 such-0.227
Token best
Token ID1266
Feature activation+2.08e-02

Pos logit contributions
 yours+0.002
 electricity+0.002
 our+0.001
 such+0.001
 its+0.001
Neg logit contributions
ppy-0.002
op-0.002
ony-0.002
anda-0.002
int-0.002
Token friends
Token ID2460
Feature activation-3.75e-02

Pos logit contributions
 electricity+0.039
 yours+0.039
 such+0.035
 our+0.035
 its+0.035
Neg logit contributions
op-0.041
ppy-0.039
anda-0.039
ony-0.038
int-0.038
Token fish
Token ID5916
Feature activation-9.66e-02

Pos logit contributions
op+0.018
ppy+0.018
anda+0.017
y+0.017
int+0.017
Neg logit contributions
 yours-0.020
 electricity-0.020
 such-0.018
 theirs-0.018
 our-0.018
Token!
Token ID0
Feature activation-1.95e-02

Pos logit contributions
int+0.218
ppy+0.214
ony+0.211
anda+0.210
owed+0.208
Neg logit contributions
 yours-0.147
 electricity-0.142
 such-0.135
 its-0.135
 our-0.135
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
ppy+0.046
op+0.045
anda+0.045
ony+0.045
oise+0.045
Neg logit contributions
 electricity-0.027
 yours-0.027
 its-0.025
 such-0.025
 our-0.024
Token
Token ID198
Feature activation-1.83e-02

Pos logit contributions
ppy+0.024
anda+0.023
op+0.023
int+0.023
ony+0.023
Neg logit contributions
 electricity-0.017
 yours-0.017
 its-0.015
 our-0.015
 such-0.015
TokenThe
Token ID464
Feature activation-2.93e-02

Pos logit contributions
ppy+0.039
anda+0.039
owed+0.038
ony+0.038
int+0.038
Neg logit contributions
 electricity-0.029
 yours-0.028
 our-0.027
 its-0.026
 such-0.026
Token fish
Token ID5916
Feature activation-1.34e-01

Pos logit contributions
ppy+0.060
op+0.059
anda+0.058
ony+0.057
int+0.057
Neg logit contributions
 electricity-0.051
 yours-0.051
 such-0.046
 our-0.046
 its-0.045
Token pushed
Token ID7121
Feature activation+1.80e-02

Pos logit contributions
uffy+0.250
ppy+0.247
op+0.235
pp+0.233
ony+0.232
Neg logit contributions
 its-0.256
 electricity-0.234
 our-0.228
 their-0.226
 peace-0.225
Token him
Token ID683
Feature activation-5.33e-02

Pos logit contributions
 yours+0.018
 electricity+0.018
 theirs+0.014
 such+0.014
 our+0.014
Neg logit contributions
ppy-0.050
op-0.049
ony-0.048
y-0.048
int-0.048
Token to
Token ID284
Feature activation+4.59e-02

Pos logit contributions
ppy+0.117
int+0.116
anda+0.115
op+0.115
oise+0.114
Neg logit contributions
 electricity-0.084
 yours-0.084
 its-0.078
 our-0.077
 such-0.076
Token the
Token ID262
Feature activation-4.20e-03

Pos logit contributions
 yours+0.060
 electricity+0.052
 awesome+0.048
 diversity+0.044
 yourselves+0.043
Neg logit contributions
ppy-0.101
bug-0.096
i-0.095
ie-0.094
ony-0.094
Token side
Token ID1735
Feature activation-3.29e-02

Pos logit contributions
anda+0.007
ppy+0.007
op+0.007
ony+0.007
oise+0.007
Neg logit contributions
 yours-0.008
 electricity-0.008
 theirs-0.007
 such-0.007
 its-0.007
Token did
Token ID750
Feature activation+5.26e-02

Pos logit contributions
ppy+0.016
op+0.016
int+0.015
ony+0.015
anda+0.015
Neg logit contributions
 electricity-0.014
 yours-0.014
 our-0.013
 its-0.013
 such-0.013
Token a
Token ID257
Feature activation+3.55e-02

Pos logit contributions
 electricity+0.075
 yours+0.064
 relief+0.062
 emergencies+0.061
 theirs+0.059
Neg logit contributions
op-0.112
s-0.112
ony-0.111
anda-0.110
's-0.109
Token good
Token ID922
Feature activation+9.81e-03

Pos logit contributions
 electricity+0.038
 yours+0.036
 such+0.033
 theirs+0.031
 its+0.030
Neg logit contributions
anda-0.094
ppy-0.094
op-0.093
ony-0.091
uffy-0.090
Token job
Token ID1693
Feature activation+1.39e-02

Pos logit contributions
 electricity+0.012
 yours+0.011
 our+0.010
 such+0.010
 its+0.010
Neg logit contributions
op-0.026
ppy-0.026
anda-0.025
ony-0.025
int-0.025
Token,"
Token ID553
Feature activation-2.64e-02

Pos logit contributions
 electricity+0.024
 yours+0.024
 our+0.022
 such+0.022
 its+0.021
Neg logit contributions
ppy-0.029
op-0.028
anda-0.028
ony-0.027
int-0.027
Token Ben
Token ID3932
Feature activation-1.25e-01

Pos logit contributions
ppy+0.060
op+0.060
anda+0.059
ony+0.058
int+0.057
Neg logit contributions
 electricity-0.040
 yours-0.039
 such-0.035
 our-0.034
 its-0.033
Token said
Token ID531
Feature activation-1.21e-02

Pos logit contributions
oise+0.284
uffy+0.280
ig+0.271
op+0.271
anda+0.264
Neg logit contributions
 a-0.196
 its-0.195
 an-0.182
 fun-0.180
 your-0.176
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
ppy+0.027
op+0.027
anda+0.026
ony+0.026
oise+0.026
Neg logit contributions
 electricity-0.019
 yours-0.019
 its-0.018
 our-0.017
 such-0.017
Token adm
Token ID6178
Feature activation-1.54e-03

Pos logit contributions
op+0.042
ppy+0.042
anda+0.041
ony+0.041
int+0.041
Neg logit contributions
 electricity-0.029
 yours-0.029
 our-0.026
 its-0.026
 such-0.026
Tokeniring
Token ID3428
Feature activation+2.67e-02

Pos logit contributions
ppy+0.003
op+0.003
anda+0.003
ony+0.003
int+0.003
Neg logit contributions
 electricity-0.003
 yours-0.003
 our-0.003
 such-0.003
 its-0.002
Token their
Token ID511
Feature activation+4.77e-03

Pos logit contributions
 electricity+0.028
 yours+0.025
 awesome+0.022
 power+0.020
 such+0.019
Neg logit contributions
i-0.067
ie-0.067
y-0.066
anda-0.065
ppy-0.065
Tokenpa
Token ID8957
Feature activation-9.19e-02

Pos logit contributions
ppy+0.112
op+0.110
ony+0.107
anda+0.107
int+0.104
Neg logit contributions
 electricity-0.247
 yours-0.242
 its-0.228
 our-0.227
 such-0.226
Token."
Token ID526
Feature activation-3.23e-02

Pos logit contributions
op+0.188
ppy+0.188
anda+0.182
ony+0.181
int+0.181
Neg logit contributions
 electricity-0.158
 yours-0.158
 our-0.143
 such-0.142
 its-0.140
Token
Token ID198
Feature activation-2.15e-02

Pos logit contributions
op+0.065
ppy+0.065
anda+0.063
ony+0.063
int+0.062
Neg logit contributions
 electricity-0.057
 yours-0.057
 our-0.051
 such-0.051
 its-0.050
Token
Token ID198
Feature activation-2.49e-02

Pos logit contributions
op+0.040
ppy+0.039
anda+0.039
ony+0.038
uffy+0.037
Neg logit contributions
 yours-0.042
 electricity-0.042
 such-0.038
 our-0.037
 theirs-0.037
TokenL
Token ID43
Feature activation-9.18e-02

Pos logit contributions
op+0.040
ppy+0.040
anda+0.038
ony+0.038
oise+0.036
Neg logit contributions
 yours-0.056
 electricity-0.055
 such-0.051
 theirs-0.050
 our-0.050
Tokenila
Token ID10102
Feature activation-1.28e-01

Pos logit contributions
op+0.159
ppy+0.151
int+0.150
oise+0.149
ony+0.148
Neg logit contributions
 yours-0.191
 electricity-0.189
 our-0.174
 such-0.173
 its-0.171
Token and
Token ID290
Feature activation-2.61e-02

Pos logit contributions
op+0.292
oise+0.280
ppy+0.277
erry+0.272
uffy+0.271
Neg logit contributions
 its-0.199
 your-0.194
 our-0.194
 a-0.187
 peace-0.187
Token Sam
Token ID3409
Feature activation-1.12e-01

Pos logit contributions
op+0.065
ppy+0.065
int+0.064
anda+0.064
ony+0.063
Neg logit contributions
 electricity-0.034
 yours-0.034
 our-0.030
 such-0.030
 its-0.029
Token smile
Token ID8212
Feature activation-3.36e-02

Pos logit contributions
op+0.258
anda+0.247
uffy+0.242
ppy+0.238
ony+0.238
Neg logit contributions
 yours-0.167
 electricity-0.167
 respect-0.162
 our-0.159
 such-0.158
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
ppy+0.050
op+0.048
y+0.047
int+0.046
owed+0.045
Neg logit contributions
 yours-0.079
 electricity-0.079
 our-0.071
 such-0.070
 theirs-0.070
Token They
Token ID1119
Feature activation-2.83e-02

Pos logit contributions
op+0.002
s+0.002
ppy+0.002
y+0.002
ella+0.002
Neg logit contributions
 yours-0.004
 electricity-0.004
 awesome-0.004
 such-0.004
 purpose-0.004
Token to
Token ID284
Feature activation+4.84e-02

Pos logit contributions
 electricity+0.028
 yours+0.027
 awesome+0.024
 yourselves+0.024
 our+0.024
Neg logit contributions
op-0.040
ppy-0.040
ll-0.039
y-0.039
ony-0.039
Token go
Token ID467
Feature activation+2.23e-02

Pos logit contributions
 electricity+0.089
 yours+0.088
 our+0.080
 such+0.080
 its+0.079
Neg logit contributions
op-0.094
ppy-0.094
anda-0.092
ony-0.091
int-0.090
Token home
Token ID1363
Feature activation-7.09e-03

Pos logit contributions
 electricity+0.033
 yours+0.033
 such+0.028
 our+0.028
 theirs+0.028
Neg logit contributions
op-0.051
ppy-0.050
anda-0.050
ony-0.049
int-0.049
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
ppy+0.011
op+0.011
anda+0.011
ony+0.011
int+0.011
Neg logit contributions
 electricity-0.015
 yours-0.015
 our-0.014
 such-0.014
 its-0.014
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
 electricity+0.023
 yours+0.023
 our+0.019
 theirs+0.018
 awesome+0.018
Neg logit contributions
op-0.042
ppy-0.042
anda-0.041
uffy-0.041
ony-0.040
Token hipp
Token ID18568
Feature activation-1.43e-01

Pos logit contributions
op+0.031
ppy+0.031
anda+0.030
int+0.030
ony+0.030
Neg logit contributions
 yours-0.020
 electricity-0.019
 our-0.017
 such-0.017
 theirs-0.017
Tokeno
Token ID78
Feature activation-1.04e-01

Pos logit contributions
ppy+0.173
ches+0.156
int+0.151
ig+0.150
ony+0.150
Neg logit contributions
 such-0.316
 its-0.312
 peace-0.299
 the-0.286
 their-0.285
Token felt
Token ID2936
Feature activation+5.29e-02

Pos logit contributions
op+0.176
oise+0.169
mun+0.168
int+0.159
ony+0.159
Neg logit contributions
 its-0.202
 electricity-0.195
 our-0.192
 such-0.186
 peace-0.186
Token sad
Token ID6507
Feature activation+2.11e-02

Pos logit contributions
 electricity+0.097
 yours+0.094
 purpose+0.082
 yourselves+0.082
 theirs+0.081
Neg logit contributions
ppy-0.095
int-0.094
adow-0.092
ackers-0.089
ie-0.089
Token.
Token ID13
Feature activation-4.79e-03

Pos logit contributions
 electricity+0.045
 yours+0.045
 our+0.042
 theirs+0.040
 such+0.040
Neg logit contributions
ppy-0.034
op-0.034
y-0.033
anda-0.032
int-0.032
Token He
Token ID679
Feature activation-3.06e-02

Pos logit contributions
op+0.010
ppy+0.009
anda+0.009
int+0.009
ony+0.009
Neg logit contributions
 yours-0.009
 electricity-0.009
 our-0.008
 theirs-0.008
 such-0.008