TOP ACTIVATIONS
MAX = 9.371

time , there was a toddler named Jack . Jack 's
, there was a naughty toddler named Tim my . One
<|endoftext|> Once there was a toddler named Jack who wanted to
a time there was a Stub born Old Woman . She
time , there was a boy . He had an original
, there was a shy boy named Tim . He was
time , there was a boy named Tom . Tom liked
time , there was a boy named Tim . Tim loved
time , there was a boy named Tim my . Tim
time , there was a boy named Jack . He went
time , there was a boy named Paul . He wanted
time , there was a boy named Tim my who loved
time , there was a boy named Tim my . He
time , there was a boy named Joey . One day
time , there was a boy called Bob . Bob was
time , there was a boy named Tim my . Tim
time , there was a boy named Tim my . Tim
time , there was a boy named Alex . He was
time , there was a boy named Joe . He was
time , there was a boy who lived in the forest

MIN ACTIVATIONS
MIN = -8.607

p . It fl apped its wings . It could fly
welcome !" It fl apped its wings and flew away .
ped louder and fl apped its wing . "
him . It fl apped its wings and made loud noises
branch . He fl apped its wings and tried to help
. Then it fl apped its wings and chased Lily .
The fox smiled and waved its tail . It watched them
was happy . It waved its antenna e .
his sed and fl apped its wings . It spl ashed
disappointment . M arl on 's mum hugged him
. The bird fl apped its wings one more time and
. Blue bird fl apped its wings hard and flew higher
The cat smiled its biggest smile , with its
. The bird fl apped its wings and flew a little
to grow l and sn arl to try and scare her
sw am away , waving its tail goodbye .
time the toy wink ed it made her smile . The
and the little mouse waved its tiny paws . Joy smiled
ped happily and fl apped its wings . Ben and Lily
car . It fl apped its wings and flew fast through

INTERVAL 4.877 - 9.371
CONTAINS 2.019%

world , but there was a big , dark fence around
, there was a little girl named Lily . She loved
, there was a little boy named Tim . He saw
monkey . The monkey loved to play all day long .
One day , a little girl was walking in a park

INTERVAL 0.382 - 4.877
CONTAINS 40.597%

Lily and her friends were scared . They didn 't know
They smile . They are friends again . They play with
are you doing ?" A loud voice shouted . It was
Tom . " It 's okay ," she said , "
scared anymore . They played with Gig i all day long

INTERVAL -4.112 - 0.382
CONTAINS 54.889%

was a little girl . Every day they played in the
she forgot all about her cell . However Lisa
. Sara picked it up and said , " Look ,
outside to play . They skipped and hopped , just like
she asked her mom . Her mom explained that it was

INTERVAL -8.607 - -4.112
CONTAINS 2.495%

She was very happy and said , " Let 's go
tower out of them . It was a fun and delicious
, just like you . It breat hes and feels and
then , one lady said she had some clean pieces of
run away . They hope the dog won 't catch them
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.106
 ha+0.081
 read+0.074
nam+0.074
ñ+0.073
Neg logit contributions
lights-0.122
udd-0.114
orting-0.110
ires-0.107
s-0.106
Token,
Token ID11
Feature activation+2.91e+00

Pos logit contributions
e+0.039
 ha+0.030
 read+0.028
 weep+0.026
 named+0.026
Neg logit contributions
lights-0.043
orting-0.038
udd-0.038
ires-0.037
angles-0.037
Token there
Token ID612
Feature activation-1.70e+00

Pos logit contributions
e+0.076
 ha+0.059
ñ+0.058
 read+0.055
 weep+0.053
Neg logit contributions
lights-0.082
s-0.073
ires-0.073
udd-0.072
orting-0.072
Token was
Token ID373
Feature activation+5.44e+00

Pos logit contributions
lights+0.048
orting+0.043
udd+0.042
ires+0.041
angles+0.041
Neg logit contributions
e-0.046
 ha-0.035
 read-0.033
 weep-0.032
ñ-0.031
Token a
Token ID257
Feature activation+6.65e+00

Pos logit contributions
e+0.136
 ha+0.134
ñ+0.126
 weep+0.125
 disob+0.112
Neg logit contributions
ires-0.116
lights-0.112
udd-0.107
 They-0.104
s-0.102
Token toddler
Token ID30773
Feature activation+9.37e+00

Pos logit contributions
e+0.247
ez+0.190
ñ+0.180
years+0.178
tail+0.178
Neg logit contributions
udd-0.112
uddle-0.111
s-0.105
lights-0.098
 new-0.089
Token named
Token ID3706
Feature activation+5.93e+00

Pos logit contributions
e+0.146
ñ+0.119
 ha+0.111
nam+0.105
ZE+0.103
Neg logit contributions
s-0.313
lights-0.299
udd-0.274
orting-0.270
ails-0.263
Token Jack
Token ID3619
Feature activation+4.14e+00

Pos logit contributions
e+0.160
 ha+0.120
ñ+0.110
 weep+0.109
 read+0.109
Neg logit contributions
lights-0.164
orting-0.149
udd-0.149
s-0.146
ires-0.145
Token.
Token ID13
Feature activation+2.47e+00

Pos logit contributions
e+0.132
 ha+0.113
ñ+0.103
nam+0.101
 weep+0.100
Neg logit contributions
lights-0.087
udd-0.080
s-0.079
orting-0.076
ires-0.073
Token Jack
Token ID3619
Feature activation+2.36e+00

Pos logit contributions
e+0.059
 ha+0.041
 read+0.041
 weep+0.038
 named+0.037
Neg logit contributions
lights-0.079
orting-0.070
udd-0.070
angles-0.068
ires-0.068
Token's
Token ID338
Feature activation+1.09e+00

Pos logit contributions
e+0.065
 ha+0.056
ñ+0.052
nam+0.051
 fright+0.048
Neg logit contributions
lights-0.055
s-0.054
udd-0.053
orting-0.048
ires-0.047
Token,
Token ID11
Feature activation+3.04e+00

Pos logit contributions
e+0.063
 ha+0.048
 read+0.046
 weep+0.044
ñ+0.044
Neg logit contributions
lights-0.061
orting-0.054
udd-0.053
ires-0.052
s-0.051
Token there
Token ID612
Feature activation-1.54e+00

Pos logit contributions
e+0.089
ñ+0.075
 ha+0.072
 read+0.068
 comb+0.068
Neg logit contributions
lights-0.074
s-0.066
ires-0.063
orting-0.061
udd-0.060
Token was
Token ID373
Feature activation+5.34e+00

Pos logit contributions
lights+0.041
udd+0.036
orting+0.036
ires+0.035
s+0.034
Neg logit contributions
e-0.044
 ha-0.034
 read-0.032
ñ-0.032
 weep-0.031
Token a
Token ID257
Feature activation+6.89e+00

Pos logit contributions
 ha+0.139
e+0.139
 weep+0.131
ñ+0.126
 disob+0.120
Neg logit contributions
ires-0.107
lights-0.103
 They-0.100
udd-0.094
s-0.090
Token naughty
Token ID45128
Feature activation+3.38e+00

Pos logit contributions
e+0.255
ez+0.195
ñ+0.181
tail+0.180
years+0.180
Neg logit contributions
udd-0.116
uddle-0.116
s-0.110
lights-0.108
 new-0.097
Token toddler
Token ID30773
Feature activation+9.20e+00

Pos logit contributions
e+0.105
nam+0.076
years+0.071
tail+0.069
 Amen+0.068
Neg logit contributions
udd-0.080
lights-0.079
uddle-0.075
orting-0.074
der-0.074
Token named
Token ID3706
Feature activation+5.68e+00

Pos logit contributions
e+0.152
 ha+0.109
ñ+0.108
nam+0.101
 weep+0.097
Neg logit contributions
lights-0.309
s-0.302
orting-0.285
udd-0.276
ails-0.268
Token Tim
Token ID5045
Feature activation+4.61e+00

Pos logit contributions
e+0.146
 ha+0.106
 read+0.098
 weep+0.096
ñ+0.095
Neg logit contributions
lights-0.169
udd-0.153
orting-0.153
ires-0.148
angles-0.146
Tokenmy
Token ID1820
Feature activation+4.78e+00

Pos logit contributions
e+0.134
 ha+0.112
ñ+0.101
 read+0.098
nam+0.098
Neg logit contributions
lights-0.112
udd-0.104
orting-0.100
s-0.097
angles-0.095
Token.
Token ID13
Feature activation+2.39e+00

Pos logit contributions
e+0.161
 ha+0.142
ñ+0.128
nam+0.127
 weep+0.126
Neg logit contributions
lights-0.088
s-0.080
udd-0.079
orting-0.076
ires-0.071
Token One
Token ID1881
Feature activation+4.95e-01

Pos logit contributions
e+0.054
 ha+0.038
 read+0.037
 weep+0.035
 named+0.035
Neg logit contributions
lights-0.078
orting-0.069
udd-0.069
angles-0.068
ires-0.068
Token<|endoftext|>
Token ID50256
Feature activation+1.86e+00

Pos logit contributions
lights+0.046
udd+0.043
angles+0.041
orting+0.041
falls+0.040
Neg logit contributions
e-0.038
 named-0.026
 ha-0.025
 read-0.025
 3-0.025
TokenOnce
Token ID7454
Feature activation-2.30e+00

Pos logit contributions
e+0.062
 ha+0.051
 read+0.048
 weep+0.048
ñ+0.047
Neg logit contributions
lights-0.039
orting-0.034
udd-0.033
s-0.033
falls-0.032
Token there
Token ID612
Feature activation-2.56e+00

Pos logit contributions
lights+0.065
orting+0.062
udd+0.059
falls+0.059
rows+0.058
Neg logit contributions
e-0.056
 named-0.042
 ha-0.041
 weep-0.039
 read-0.039
Token was
Token ID373
Feature activation+5.03e+00

Pos logit contributions
lights+0.074
orting+0.066
udd+0.065
ires+0.064
angles+0.063
Neg logit contributions
e-0.069
 ha-0.051
 read-0.048
 weep-0.046
 named-0.046
Token a
Token ID257
Feature activation+6.52e+00

Pos logit contributions
e+0.111
 ha+0.106
ñ+0.098
 weep+0.094
 comb+0.086
Neg logit contributions
lights-0.130
ires-0.126
udd-0.122
s-0.119
orting-0.115
Token toddler
Token ID30773
Feature activation+9.14e+00

Pos logit contributions
e+0.232
ez+0.170
years+0.170
ñ+0.165
nam+0.165
Neg logit contributions
udd-0.118
s-0.117
uddle-0.116
lights-0.106
orting-0.094
Token named
Token ID3706
Feature activation+5.41e+00

Pos logit contributions
e+0.150
ñ+0.113
 ha+0.112
nam+0.101
ZE+0.098
Neg logit contributions
s-0.306
lights-0.300
udd-0.276
orting-0.275
ails-0.262
Token Jack
Token ID3619
Feature activation+4.09e+00

Pos logit contributions
e+0.141
 ha+0.105
 read+0.096
ñ+0.096
 weep+0.095
Neg logit contributions
lights-0.156
orting-0.142
udd-0.141
ires-0.138
s-0.136
Token who
Token ID508
Feature activation+3.17e+00

Pos logit contributions
e+0.127
 ha+0.106
ñ+0.096
 read+0.096
 weep+0.094
Neg logit contributions
lights-0.093
udd-0.083
s-0.081
orting-0.080
ires-0.077
Token wanted
Token ID2227
Feature activation+1.77e+00

Pos logit contributions
e+0.091
 ha+0.074
nam+0.070
bug+0.068
ñ+0.068
Neg logit contributions
udd-0.071
s-0.069
lights-0.067
orting-0.064
falls-0.060
Token to
Token ID284
Feature activation+4.95e+00

Pos logit contributions
e+0.063
 ha+0.049
 weep+0.047
 Amen+0.046
bug+0.046
Neg logit contributions
lights-0.033
orting-0.029
s-0.029
udd-0.028
ires-0.027
Token a
Token ID257
Feature activation+4.30e+00

Pos logit contributions
e+0.005
 ha+0.004
 weep+0.004
ñ+0.004
 comb+0.004
Neg logit contributions
lights-0.004
udd-0.004
ires-0.004
s-0.004
der-0.004
Token time
Token ID640
Feature activation+1.54e+00

Pos logit contributions
e+0.111
 ha+0.087
nam+0.083
 weep+0.080
 read+0.080
Neg logit contributions
lights-0.118
udd-0.113
orting-0.106
ires-0.104
s-0.103
Token there
Token ID612
Feature activation-1.27e+00

Pos logit contributions
e+0.042
 ha+0.032
 read+0.030
 weep+0.029
ñ+0.028
Neg logit contributions
lights-0.044
orting-0.039
udd-0.038
ires-0.037
angles-0.037
Token was
Token ID373
Feature activation+5.52e+00

Pos logit contributions
lights+0.036
orting+0.032
udd+0.032
ires+0.031
angles+0.031
Neg logit contributions
e-0.035
 ha-0.026
 read-0.025
 weep-0.024
ñ-0.023
Token a
Token ID257
Feature activation+6.68e+00

Pos logit contributions
e+0.123
 ha+0.118
 weep+0.110
ñ+0.108
 fright+0.100
Neg logit contributions
ires-0.134
lights-0.133
udd-0.132
s-0.123
orting-0.122
Token Stub
Token ID41135
Feature activation+8.98e+00

Pos logit contributions
e+0.244
ez+0.188
ñ+0.180
years+0.178
 fright+0.177
Neg logit contributions
udd-0.116
uddle-0.112
s-0.111
lights-0.101
ires-0.091
Tokenborn
Token ID6286
Feature activation+4.09e+00

Pos logit contributions
e+0.284
 ha+0.228
 read+0.216
 weep+0.210
ñ+0.208
Neg logit contributions
lights-0.207
orting-0.184
udd-0.183
ires-0.175
angles-0.172
Token Old
Token ID5706
Feature activation+4.96e+00

Pos logit contributions
e+0.112
 ha+0.082
 weep+0.080
 read+0.077
nam+0.076
Neg logit contributions
lights-0.109
udd-0.102
orting-0.100
s-0.098
falls-0.095
Token Woman
Token ID14381
Feature activation+6.16e+00

Pos logit contributions
e+0.124
 ha+0.091
 read+0.085
 weep+0.081
ñ+0.080
Neg logit contributions
lights-0.151
orting-0.135
udd-0.135
ires-0.131
angles-0.130
Token.
Token ID13
Feature activation+2.40e+00

Pos logit contributions
e+0.190
 ha+0.157
 weep+0.147
 read+0.142
ñ+0.141
Neg logit contributions
lights-0.141
udd-0.124
s-0.123
orting-0.123
ires-0.119
Token She
Token ID1375
Feature activation-5.68e-01

Pos logit contributions
e+0.056
 ha+0.042
 read+0.040
 named+0.037
tail+0.035
Neg logit contributions
lights-0.079
orting-0.067
udd-0.067
angles-0.066
ires-0.065
Token time
Token ID640
Feature activation+1.50e+00

Pos logit contributions
e+0.116
 ha+0.092
nam+0.087
ñ+0.084
 read+0.084
Neg logit contributions
lights-0.125
udd-0.121
orting-0.113
ires-0.111
s-0.111
Token,
Token ID11
Feature activation+3.45e+00

Pos logit contributions
e+0.039
 ha+0.030
 read+0.028
 weep+0.026
 named+0.026
Neg logit contributions
lights-0.044
orting-0.039
udd-0.039
angles-0.038
ires-0.038
Token there
Token ID612
Feature activation-1.10e+00

Pos logit contributions
e+0.091
 ha+0.069
ñ+0.068
 read+0.065
 weep+0.064
Neg logit contributions
lights-0.097
ires-0.087
udd-0.086
s-0.086
orting-0.085
Token was
Token ID373
Feature activation+5.76e+00

Pos logit contributions
lights+0.034
orting+0.030
udd+0.030
ires+0.029
angles+0.029
Neg logit contributions
e-0.028
 ha-0.021
 read-0.019
 named-0.018
 weep-0.018
Token a
Token ID257
Feature activation+7.06e+00

Pos logit contributions
e+0.129
 ha+0.125
 weep+0.121
ñ+0.114
 fright+0.107
Neg logit contributions
ires-0.140
udd-0.133
lights-0.132
s-0.127
orting-0.121
Token boy
Token ID2933
Feature activation+8.97e+00

Pos logit contributions
e+0.260
ez+0.204
ñ+0.195
 fright+0.193
tail+0.190
Neg logit contributions
udd-0.115
uddle-0.113
s-0.111
lights-0.099
ires-0.095
Token.
Token ID13
Feature activation+3.18e+00

Pos logit contributions
ñ+0.128
Rober+0.128
nam+0.127
 ha+0.124
e+0.122
Neg logit contributions
s-0.284
lights-0.255
udd-0.255
orting-0.244
ails-0.237
Token He
Token ID679
Feature activation+4.31e-02

Pos logit contributions
e+0.122
 ha+0.107
ñ+0.101
 comb+0.099
 weep+0.098
Neg logit contributions
lights-0.049
falls-0.037
orting-0.036
ires-0.036
s-0.035
Token had
Token ID550
Feature activation+7.72e-01

Pos logit contributions
e+0.001
nam+0.001
 ha+0.001
ñ+0.001
 fright+0.001
Neg logit contributions
s-0.001
udd-0.001
lights-0.001
 They-0.000
 they-0.000
Token an
Token ID281
Feature activation+3.48e+00

Pos logit contributions
e+0.016
 weep+0.012
ñ+0.011
 ha+0.011
bug+0.011
Neg logit contributions
lights-0.022
udd-0.021
s-0.021
ires-0.020
angles-0.019
Token original
Token ID2656
Feature activation+6.25e-01

Pos logit contributions
e+0.118
ñ+0.096
 ha+0.095
 read+0.088
 weep+0.088
Neg logit contributions
ires-0.064
lights-0.063
udd-0.057
angles-0.057
orting-0.056
Token,
Token ID11
Feature activation+3.48e+00

Pos logit contributions
e+0.075
 ha+0.057
 read+0.053
 weep+0.051
ñ+0.051
Neg logit contributions
lights-0.076
orting-0.067
udd-0.067
ires-0.065
angles-0.064
Token there
Token ID612
Feature activation-1.22e+00

Pos logit contributions
e+0.084
 ha+0.063
ñ+0.063
 read+0.058
 weep+0.057
Neg logit contributions
lights-0.104
ires-0.095
s-0.094
orting-0.093
udd-0.093
Token was
Token ID373
Feature activation+5.77e+00

Pos logit contributions
lights+0.034
orting+0.030
udd+0.030
ires+0.029
angles+0.029
Neg logit contributions
e-0.034
 ha-0.026
 read-0.024
 weep-0.023
ñ-0.023
Token a
Token ID257
Feature activation+7.24e+00

Pos logit contributions
 ha+0.150
e+0.143
ñ+0.136
 weep+0.136
 fright+0.122
Neg logit contributions
ires-0.132
 They-0.114
lights-0.114
udd-0.112
s-0.109
Token shy
Token ID15800
Feature activation+3.20e+00

Pos logit contributions
e+0.264
ez+0.208
ñ+0.202
years+0.196
 fright+0.191
Neg logit contributions
udd-0.120
uddle-0.119
s-0.116
ires-0.110
lights-0.107
Token boy
Token ID2933
Feature activation+8.96e+00

Pos logit contributions
e+0.097
nam+0.070
 ha+0.066
years+0.066
 read+0.064
Neg logit contributions
lights-0.079
udd-0.077
der-0.073
orting-0.072
uddle-0.072
Token named
Token ID3706
Feature activation+6.14e+00

Pos logit contributions
e+0.118
 ha+0.112
ñ+0.108
nam+0.104
Rober+0.097
Neg logit contributions
s-0.293
lights-0.286
orting-0.279
udd-0.273
ails-0.263
Token Tim
Token ID5045
Feature activation+4.85e+00

Pos logit contributions
e+0.161
 ha+0.120
 weep+0.111
 read+0.109
 comb+0.109
Neg logit contributions
lights-0.177
orting-0.159
udd-0.157
ires-0.154
falls-0.153
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
e+0.157
 ha+0.140
ñ+0.129
nam+0.126
 weep+0.123
Neg logit contributions
lights-0.092
udd-0.086
s-0.085
orting-0.082
ires-0.077
Token He
Token ID679
Feature activation-4.46e-01

Pos logit contributions
e+0.064
 ha+0.048
 read+0.044
 weep+0.043
ñ+0.042
Neg logit contributions
lights-0.074
orting-0.066
udd-0.066
ires-0.064
angles-0.064
Token was
Token ID373
Feature activation+3.53e+00

Pos logit contributions
s+0.008
udd+0.007
lights+0.007
falls+0.006
orting+0.006
Neg logit contributions
e-0.014
 ha-0.012
nam-0.012
bug-0.012
 fright-0.012
Token time
Token ID640
Feature activation+2.43e+00

Pos logit contributions
e+0.125
 ha+0.099
nam+0.091
ñ+0.089
 weep+0.088
Neg logit contributions
lights-0.137
udd-0.130
orting-0.124
s-0.120
ires-0.120
Token,
Token ID11
Feature activation+3.16e+00

Pos logit contributions
e+0.067
 ha+0.051
 read+0.049
 weep+0.047
ñ+0.046
Neg logit contributions
lights-0.067
orting-0.059
udd-0.059
ires-0.057
angles-0.057
Token there
Token ID612
Feature activation-1.45e+00

Pos logit contributions
e+0.084
ñ+0.068
 ha+0.067
 read+0.062
 comb+0.061
Neg logit contributions
lights-0.086
s-0.078
ires-0.076
orting-0.074
udd-0.074
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
lights+0.039
udd+0.034
orting+0.034
ires+0.033
angles+0.033
Neg logit contributions
e-0.041
 ha-0.032
 read-0.030
ñ-0.029
 weep-0.029
Token a
Token ID257
Feature activation+7.02e+00

Pos logit contributions
 ha+0.148
e+0.144
 weep+0.138
ñ+0.135
 disob+0.126
Neg logit contributions
ires-0.114
lights-0.110
 They-0.107
udd-0.103
s-0.100
Token boy
Token ID2933
Feature activation+8.95e+00

Pos logit contributions
e+0.259
ez+0.202
ñ+0.191
years+0.189
tail+0.184
Neg logit contributions
uddle-0.118
udd-0.117
s-0.109
lights-0.106
 new-0.098
Token named
Token ID3706
Feature activation+5.89e+00

Pos logit contributions
ñ+0.129
nam+0.129
e+0.125
Rober+0.125
 ha+0.125
Neg logit contributions
s-0.272
lights-0.258
udd-0.249
orting-0.249
ails-0.240
Token Tom
Token ID4186
Feature activation+4.85e+00

Pos logit contributions
e+0.152
 ha+0.114
 read+0.104
 weep+0.104
ñ+0.103
Neg logit contributions
lights-0.173
orting-0.155
udd-0.154
ires-0.151
s-0.150
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
e+0.154
 ha+0.133
ñ+0.121
nam+0.118
 weep+0.117
Neg logit contributions
lights-0.103
udd-0.092
s-0.091
orting-0.090
ires-0.086
Token Tom
Token ID4186
Feature activation+2.88e+00

Pos logit contributions
e+0.056
 read+0.038
 ha+0.038
 named+0.034
ñ+0.032
Neg logit contributions
lights-0.081
udd-0.072
orting-0.072
ires-0.070
angles-0.069
Token liked
Token ID8288
Feature activation+3.25e+00

Pos logit contributions
e+0.080
 ha+0.069
ñ+0.063
nam+0.062
 fright+0.061
Neg logit contributions
lights-0.067
s-0.064
udd-0.063
orting-0.058
ires-0.057
Token time
Token ID640
Feature activation+2.41e+00

Pos logit contributions
e+0.121
 ha+0.095
nam+0.087
 read+0.085
ñ+0.085
Neg logit contributions
lights-0.136
udd-0.128
orting-0.123
s-0.119
ires-0.119
Token,
Token ID11
Feature activation+3.27e+00

Pos logit contributions
e+0.068
 ha+0.053
 read+0.050
ñ+0.049
 weep+0.048
Neg logit contributions
lights-0.064
orting-0.057
udd-0.056
s-0.055
ires-0.055
Token there
Token ID612
Feature activation-1.36e+00

Pos logit contributions
e+0.087
ñ+0.072
 ha+0.069
 read+0.064
 weep+0.063
Neg logit contributions
lights-0.087
s-0.080
ires-0.079
orting-0.076
udd-0.075
Token was
Token ID373
Feature activation+5.56e+00

Pos logit contributions
lights+0.036
udd+0.031
orting+0.031
ires+0.030
s+0.030
Neg logit contributions
e-0.039
 ha-0.030
 read-0.028
ñ-0.028
 weep-0.028
Token a
Token ID257
Feature activation+7.03e+00

Pos logit contributions
 ha+0.149
e+0.143
 weep+0.138
ñ+0.136
 disob+0.126
Neg logit contributions
ires-0.114
 They-0.106
lights-0.103
s-0.097
udd-0.096
Token boy
Token ID2933
Feature activation+8.95e+00

Pos logit contributions
e+0.262
ez+0.205
ñ+0.192
years+0.192
tail+0.186
Neg logit contributions
uddle-0.116
udd-0.115
s-0.108
lights-0.103
 new-0.099
Token named
Token ID3706
Feature activation+6.02e+00

Pos logit contributions
ñ+0.129
nam+0.129
e+0.126
Rober+0.125
 ha+0.123
Neg logit contributions
s-0.271
lights-0.254
udd-0.249
orting-0.249
ails-0.238
Token Tim
Token ID5045
Feature activation+4.82e+00

Pos logit contributions
e+0.163
 ha+0.125
 weep+0.117
ñ+0.114
 comb+0.113
Neg logit contributions
lights-0.166
orting-0.148
udd-0.145
s-0.144
ires-0.144
Token.
Token ID13
Feature activation+2.59e+00

Pos logit contributions
e+0.154
 ha+0.139
ñ+0.129
nam+0.125
tail+0.120
Neg logit contributions
lights-0.091
udd-0.086
s-0.085
orting-0.081
angles-0.077
Token Tim
Token ID5045
Feature activation+3.11e+00

Pos logit contributions
e+0.064
 ha+0.046
 read+0.044
 weep+0.041
 named+0.041
Neg logit contributions
lights-0.080
udd-0.072
orting-0.072
ires-0.070
angles-0.070
Token loved
Token ID6151
Feature activation+3.21e+00

Pos logit contributions
e+0.085
 ha+0.079
ñ+0.074
nam+0.072
 fright+0.069
Neg logit contributions
udd-0.065
s-0.065
lights-0.065
orting-0.057
angles-0.056
Token time
Token ID640
Feature activation+2.28e+00

Pos logit contributions
e+0.116
 ha+0.086
 read+0.078
 weep+0.076
ñ+0.076
Neg logit contributions
lights-0.148
udd-0.135
orting-0.133
ires-0.130
angles-0.128
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
e+0.063
 ha+0.047
 read+0.045
 weep+0.043
ñ+0.043
Neg logit contributions
lights-0.064
orting-0.056
udd-0.056
ires-0.054
angles-0.054
Token there
Token ID612
Feature activation-1.41e+00

Pos logit contributions
e+0.085
ñ+0.070
 ha+0.068
 read+0.063
 comb+0.062
Neg logit contributions
lights-0.079
s-0.074
ires-0.072
orting-0.070
udd-0.069
Token was
Token ID373
Feature activation+5.60e+00

Pos logit contributions
lights+0.038
udd+0.033
orting+0.033
ires+0.032
s+0.032
Neg logit contributions
e-0.040
 ha-0.031
 read-0.029
ñ-0.028
 weep-0.028
Token a
Token ID257
Feature activation+6.97e+00

Pos logit contributions
 ha+0.141
e+0.137
 weep+0.130
ñ+0.124
 disob+0.113
Neg logit contributions
ires-0.130
lights-0.121
udd-0.120
 They-0.113
s-0.112
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.259
ez+0.202
years+0.193
ñ+0.193
 weep+0.186
Neg logit contributions
uddle-0.115
udd-0.115
s-0.107
lights-0.101
 new-0.095
Token named
Token ID3706
Feature activation+5.88e+00

Pos logit contributions
nam+0.130
 ha+0.129
Rober+0.128
ñ+0.126
e+0.121
Neg logit contributions
s-0.275
lights-0.253
udd-0.248
orting-0.247
ails-0.239
Token Tim
Token ID5045
Feature activation+4.67e+00

Pos logit contributions
e+0.154
 ha+0.117
 weep+0.106
 read+0.105
ñ+0.105
Neg logit contributions
lights-0.170
orting-0.153
udd-0.150
ires-0.148
s-0.147
Tokenmy
Token ID1820
Feature activation+4.89e+00

Pos logit contributions
e+0.134
 ha+0.115
ñ+0.104
nam+0.101
 read+0.099
Neg logit contributions
lights-0.111
udd-0.104
s-0.100
orting-0.100
angles-0.096
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
e+0.165
 ha+0.151
ñ+0.135
nam+0.135
 weep+0.131
Neg logit contributions
s-0.081
lights-0.080
udd-0.075
orting-0.069
ires-0.066
Token Tim
Token ID5045
Feature activation+2.55e+00

Pos logit contributions
e+0.060
 ha+0.043
 read+0.041
 weep+0.039
 named+0.039
Neg logit contributions
lights-0.076
udd-0.068
orting-0.068
ires-0.066
angles-0.066
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Jack
Token ID3619
Feature activation+4.20e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Token.
Token ID13
Feature activation+3.25e+00

Pos logit contributions
e+0.134
 ha+0.112
ñ+0.102
 weep+0.101
 read+0.101
Neg logit contributions
lights-0.092
udd-0.083
s-0.081
orting-0.079
ires-0.076
Token He
Token ID679
Feature activation-2.76e-01

Pos logit contributions
e+0.072
 read+0.051
 named+0.046
 ha+0.044
 weep+0.043
Neg logit contributions
lights-0.107
udd-0.098
orting-0.097
rows-0.096
uddle-0.093
Token went
Token ID1816
Feature activation+2.03e+00

Pos logit contributions
s+0.004
udd+0.004
lights+0.003
 They+0.003
falls+0.003
Neg logit contributions
e-0.009
nam-0.008
 ha-0.008
 fright-0.008
bug-0.008
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Paul
Token ID3362
Feature activation+5.66e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
e+0.166
 ha+0.133
 read+0.122
 weep+0.121
ñ+0.119
Neg logit contributions
lights-0.144
udd-0.128
orting-0.126
s-0.124
angles-0.122
Token He
Token ID679
Feature activation-3.31e-01

Pos logit contributions
e+0.075
 read+0.063
 named+0.058
 Amen+0.058
 ha+0.056
Neg logit contributions
lights-0.086
uddle-0.083
udd-0.083
orting-0.082
rows-0.081
Token wanted
Token ID2227
Feature activation+9.74e-01

Pos logit contributions
s+0.005
udd+0.005
lights+0.004
 They+0.004
falls+0.004
Neg logit contributions
e-0.011
nam-0.010
 ha-0.010
 fright-0.009
bug-0.009
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Tim
Token ID5045
Feature activation+4.72e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Tokenmy
Token ID1820
Feature activation+4.81e+00

Pos logit contributions
e+0.134
 ha+0.109
ñ+0.099
 read+0.097
 weep+0.096
Neg logit contributions
lights-0.120
udd-0.111
orting-0.108
s-0.106
angles-0.104
Token who
Token ID508
Feature activation+3.32e+00

Pos logit contributions
e+0.158
 ha+0.137
ñ+0.124
nam+0.123
 weep+0.122
Neg logit contributions
lights-0.094
udd-0.088
s-0.088
orting-0.081
ires-0.076
Token loved
Token ID6151
Feature activation+3.07e+00

Pos logit contributions
e+0.092
 ha+0.084
nam+0.076
ñ+0.075
bug+0.074
Neg logit contributions
udd-0.071
s-0.070
lights-0.066
ires-0.059
orting-0.059
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Tim
Token ID5045
Feature activation+4.72e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Tokenmy
Token ID1820
Feature activation+4.81e+00

Pos logit contributions
e+0.134
 ha+0.109
ñ+0.099
 read+0.097
 weep+0.096
Neg logit contributions
lights-0.120
udd-0.111
orting-0.108
s-0.106
angles-0.104
Token.
Token ID13
Feature activation+3.31e+00

Pos logit contributions
e+0.158
 ha+0.137
ñ+0.124
nam+0.123
 weep+0.122
Neg logit contributions
lights-0.094
udd-0.088
s-0.088
orting-0.081
ires-0.076
Token He
Token ID679
Feature activation-3.09e-01

Pos logit contributions
e+0.065
 read+0.042
 named+0.041
 ha+0.040
 weep+0.037
Neg logit contributions
lights-0.116
udd-0.106
orting-0.106
ires-0.105
angles-0.104
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Joey
Token ID26154
Feature activation+4.50e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
e+0.139
 ha+0.119
ñ+0.110
 read+0.107
 weep+0.106
Neg logit contributions
lights-0.096
s-0.090
udd-0.090
orting-0.085
ires-0.083
Token One
Token ID1881
Feature activation+5.94e-01

Pos logit contributions
e+0.056
 named+0.044
 3+0.037
 Amen+0.036
 read+0.034
Neg logit contributions
lights-0.111
ups-0.104
angles-0.102
cases-0.102
udd-0.100
Token day
Token ID1110
Feature activation+1.15e+00

Pos logit contributions
e+0.019
 ha+0.017
ñ+0.016
nam+0.015
 comb+0.015
Neg logit contributions
lights-0.012
s-0.011
udd-0.011
ires-0.010
orting-0.010
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token called
Token ID1444
Feature activation+6.29e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Bob
Token ID5811
Feature activation+5.07e+00

Pos logit contributions
e+0.179
 weep+0.137
 comb+0.136
 ha+0.135
 read+0.133
Neg logit contributions
lights-0.148
orting-0.137
falls-0.133
s-0.130
ires-0.129
Token.
Token ID13
Feature activation+2.97e+00

Pos logit contributions
e+0.158
 ha+0.138
 weep+0.126
nam+0.125
ñ+0.125
Neg logit contributions
lights-0.107
s-0.101
udd-0.098
orting-0.094
angles-0.089
Token Bob
Token ID5811
Feature activation+3.66e+00

Pos logit contributions
e+0.057
 read+0.037
 ha+0.036
 named+0.036
 Amen+0.033
Neg logit contributions
lights-0.107
udd-0.099
orting-0.098
angles-0.096
ires-0.094
Token was
Token ID373
Feature activation+3.84e+00

Pos logit contributions
e+0.098
 ha+0.087
nam+0.081
ñ+0.081
 weep+0.078
Neg logit contributions
lights-0.085
s-0.084
udd-0.082
orting-0.075
ires-0.074
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Tim
Token ID5045
Feature activation+4.72e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Tokenmy
Token ID1820
Feature activation+4.81e+00

Pos logit contributions
e+0.134
 ha+0.109
ñ+0.099
 read+0.097
 weep+0.096
Neg logit contributions
lights-0.120
udd-0.111
orting-0.108
s-0.106
angles-0.104
Token.
Token ID13
Feature activation+3.31e+00

Pos logit contributions
e+0.158
 ha+0.137
ñ+0.124
nam+0.123
 weep+0.122
Neg logit contributions
lights-0.094
udd-0.088
s-0.088
orting-0.081
ires-0.076
Token Tim
Token ID5045
Feature activation+2.87e+00

Pos logit contributions
e+0.065
 read+0.042
 named+0.041
 ha+0.040
 weep+0.037
Neg logit contributions
lights-0.116
udd-0.106
orting-0.106
ires-0.105
angles-0.104
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Tim
Token ID5045
Feature activation+4.72e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Tokenmy
Token ID1820
Feature activation+4.81e+00

Pos logit contributions
e+0.134
 ha+0.109
ñ+0.099
 read+0.097
 weep+0.096
Neg logit contributions
lights-0.120
udd-0.111
orting-0.108
s-0.106
angles-0.104
Token.
Token ID13
Feature activation+3.31e+00

Pos logit contributions
e+0.158
 ha+0.137
ñ+0.124
nam+0.123
 weep+0.122
Neg logit contributions
lights-0.094
udd-0.088
s-0.088
orting-0.081
ires-0.076
Token Tim
Token ID5045
Feature activation+2.87e+00

Pos logit contributions
e+0.065
 read+0.042
 named+0.041
 ha+0.040
 weep+0.037
Neg logit contributions
lights-0.116
udd-0.106
orting-0.106
ires-0.105
angles-0.104
Token time
Token ID640
Feature activation+1.49e+00

Pos logit contributions
e+0.119
 ha+0.091
nam+0.087
 read+0.085
 weep+0.084
Neg logit contributions
lights-0.131
udd-0.125
orting-0.119
ires-0.116
s-0.114
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
e+0.043
 ha+0.033
 read+0.032
ñ+0.031
 weep+0.031
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
s-0.033
Token there
Token ID612
Feature activation-9.42e-01

Pos logit contributions
e+0.102
ñ+0.080
 ha+0.079
 read+0.076
 weep+0.075
Neg logit contributions
lights-0.095
udd-0.085
ires-0.085
orting-0.083
s-0.083
Token was
Token ID373
Feature activation+5.57e+00

Pos logit contributions
lights+0.028
orting+0.026
udd+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 named-0.016
 weep-0.016
Token a
Token ID257
Feature activation+7.09e+00

Pos logit contributions
e+0.132
 ha+0.129
 weep+0.126
ñ+0.116
 comb+0.113
Neg logit contributions
lights-0.120
ires-0.119
udd-0.117
s-0.110
 They-0.107
Token boy
Token ID2933
Feature activation+8.93e+00

Pos logit contributions
e+0.266
ez+0.212
ñ+0.198
tail+0.193
years+0.193
Neg logit contributions
uddle-0.111
udd-0.110
s-0.101
lights-0.098
 new-0.094
Token named
Token ID3706
Feature activation+6.32e+00

Pos logit contributions
nam+0.132
 ha+0.130
Rober+0.125
e+0.125
ñ+0.122
Neg logit contributions
s-0.283
lights-0.260
udd-0.256
orting-0.241
ails-0.236
Token Alex
Token ID4422
Feature activation+4.96e+00

Pos logit contributions
e+0.159
 ha+0.117
 read+0.108
 weep+0.105
ñ+0.104
Neg logit contributions
lights-0.193
orting-0.173
udd-0.172
ires-0.168
angles-0.168
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
e+0.162
 ha+0.144
ñ+0.133
nam+0.132
 weep+0.130
Neg logit contributions
lights-0.094
s-0.089
udd-0.083
orting-0.077
ires-0.075
Token He
Token ID679
Feature activation-3.19e-01

Pos logit contributions
e+0.090
 read+0.069
 ha+0.066
 named+0.062
 Amen+0.061
Neg logit contributions
lights-0.082
orting-0.078
udd-0.076
rows-0.073
ires-0.070
Token was
Token ID373
Feature activation+3.41e+00

Pos logit contributions
s+0.005
udd+0.005
lights+0.004
 They+0.004
falls+0.004
Neg logit contributions
e-0.010
nam-0.010
 ha-0.009
 fright-0.009
bug-0.009
Token time
Token ID640
Feature activation+2.30e+00

Pos logit contributions
e+0.126
 ha+0.101
nam+0.095
 read+0.090
ñ+0.090
Neg logit contributions
lights-0.133
udd-0.126
orting-0.120
s-0.117
ires-0.116
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
e+0.065
 ha+0.049
 read+0.047
 weep+0.045
ñ+0.045
Neg logit contributions
lights-0.062
orting-0.055
udd-0.055
ires-0.053
s-0.053
Token there
Token ID612
Feature activation-1.34e+00

Pos logit contributions
e+0.088
ñ+0.074
 ha+0.072
 comb+0.068
 read+0.068
Neg logit contributions
lights-0.078
s-0.070
ires-0.067
orting-0.066
udd-0.064
Token was
Token ID373
Feature activation+5.65e+00

Pos logit contributions
lights+0.035
udd+0.030
orting+0.030
s+0.030
ires+0.029
Neg logit contributions
e-0.039
 ha-0.030
 read-0.028
ñ-0.028
 weep-0.028
Token a
Token ID257
Feature activation+7.05e+00

Pos logit contributions
 ha+0.152
e+0.147
 weep+0.140
ñ+0.135
 disob+0.132
Neg logit contributions
ires-0.107
lights-0.107
 They-0.104
udd-0.099
s-0.094
Token boy
Token ID2933
Feature activation+8.92e+00

Pos logit contributions
e+0.261
ez+0.200
years+0.186
ñ+0.185
tail+0.184
Neg logit contributions
uddle-0.118
udd-0.117
s-0.109
lights-0.108
 new-0.097
Token named
Token ID3706
Feature activation+5.80e+00

Pos logit contributions
e+0.126
ñ+0.123
 ha+0.120
nam+0.119
Rober+0.117
Neg logit contributions
s-0.278
lights-0.264
udd-0.259
orting-0.254
ails-0.244
Token Joe
Token ID5689
Feature activation+5.32e+00

Pos logit contributions
e+0.148
 ha+0.110
 read+0.101
 weep+0.099
ñ+0.098
Neg logit contributions
lights-0.173
orting-0.156
udd-0.155
ires-0.151
angles-0.150
Token.
Token ID13
Feature activation+2.51e+00

Pos logit contributions
e+0.176
 ha+0.156
ñ+0.144
nam+0.138
 read+0.138
Neg logit contributions
lights-0.101
s-0.091
udd-0.091
orting-0.086
ires-0.081
Token He
Token ID679
Feature activation-6.07e-01

Pos logit contributions
e+0.054
 read+0.036
 named+0.034
 3+0.030
 ha+0.029
Neg logit contributions
lights-0.084
orting-0.077
udd-0.077
rows-0.073
uddle-0.073
Token was
Token ID373
Feature activation+3.54e+00

Pos logit contributions
s+0.010
udd+0.009
lights+0.008
 They+0.007
orting+0.006
Neg logit contributions
e-0.019
nam-0.018
 ha-0.018
 fright-0.017
bug-0.017
Token time
Token ID640
Feature activation+3.10e+00

Pos logit contributions
e+0.145
 ha+0.121
nam+0.118
ñ+0.110
 comb+0.110
Neg logit contributions
lights-0.130
udd-0.128
s-0.117
orting-0.117
ires-0.116
Token,
Token ID11
Feature activation+3.46e+00

Pos logit contributions
e+0.089
 ha+0.069
 read+0.066
ñ+0.064
 comb+0.064
Neg logit contributions
lights-0.081
orting-0.071
udd-0.071
s-0.069
ires-0.069
Token there
Token ID612
Feature activation-1.04e+00

Pos logit contributions
e+0.095
ñ+0.082
 ha+0.079
 read+0.073
 comb+0.072
Neg logit contributions
lights-0.089
s-0.078
ires-0.078
orting-0.074
udd-0.073
Token was
Token ID373
Feature activation+5.59e+00

Pos logit contributions
lights+0.027
udd+0.024
orting+0.024
s+0.023
ires+0.023
Neg logit contributions
e-0.030
 ha-0.023
 read-0.022
ñ-0.022
 weep-0.021
Token a
Token ID257
Feature activation+7.37e+00

Pos logit contributions
 ha+0.157
e+0.152
 weep+0.146
ñ+0.143
 disob+0.138
Neg logit contributions
ires-0.098
lights-0.096
 They-0.095
udd-0.086
 new-0.083
Token boy
Token ID2933
Feature activation+8.91e+00

Pos logit contributions
e+0.276
ez+0.215
ñ+0.197
years+0.196
tail+0.196
Neg logit contributions
uddle-0.121
udd-0.118
lights-0.108
s-0.107
 new-0.102
Token who
Token ID508
Feature activation+3.42e+00

Pos logit contributions
e+0.126
ñ+0.124
nam+0.121
 ha+0.119
Rober+0.118
Neg logit contributions
s-0.274
lights-0.261
udd-0.253
orting-0.249
ails-0.238
Token lived
Token ID5615
Feature activation+3.79e+00

Pos logit contributions
e+0.094
 ha+0.082
ñ+0.075
bug+0.073
nam+0.073
Neg logit contributions
lights-0.075
s-0.073
udd-0.073
orting-0.065
ires-0.063
Token in
Token ID287
Feature activation+6.81e+00

Pos logit contributions
e+0.099
ñ+0.090
 ha+0.082
tail+0.079
 weep+0.078
Neg logit contributions
ires-0.086
udd-0.084
s-0.083
lights-0.079
orting-0.079
Token the
Token ID262
Feature activation+2.66e+00

Pos logit contributions
e+0.143
 ha+0.128
ñ+0.124
bug+0.119
 stub+0.113
Neg logit contributions
lights-0.174
ires-0.159
 They-0.146
s-0.144
orting-0.143
Token forest
Token ID8222
Feature activation+4.07e+00

Pos logit contributions
e+0.079
 ha+0.061
 read+0.057
ñ+0.057
 weep+0.056
Neg logit contributions
lights-0.067
udd-0.060
orting-0.059
ires-0.057
falls-0.056
Tokenp
Token ID79
Feature activation-1.48e+00

Pos logit contributions
lights+0.100
udd+0.091
angles+0.089
uddle+0.088
orting+0.088
Neg logit contributions
e-0.067
 read-0.040
 weep-0.040
 ha-0.039
 named-0.038
Token.
Token ID13
Feature activation-8.48e-01

Pos logit contributions
lights+0.041
udd+0.037
orting+0.037
ires+0.036
angles+0.036
Neg logit contributions
e-0.041
 ha-0.030
 read-0.029
 weep-0.028
 named-0.028
Token It
Token ID632
Feature activation-4.78e+00

Pos logit contributions
lights+0.025
angles+0.022
udd+0.021
orting+0.021
cases+0.021
Neg logit contributions
e-0.019
 ha-0.016
 named-0.016
 raven-0.015
 read-0.015
Token fl
Token ID781
Feature activation-3.16e+00

Pos logit contributions
lights+0.152
orting+0.138
ires+0.135
udd+0.134
angles+0.133
Neg logit contributions
e-0.115
 read-0.083
 ha-0.083
 named-0.076
 comb-0.074
Tokenapped
Token ID6320
Feature activation-5.37e+00

Pos logit contributions
lights+0.085
falls+0.076
orting+0.072
angles+0.071
cases+0.070
Neg logit contributions
e-0.091
bug-0.064
 named-0.060
 ha-0.059
years-0.056
Token its
Token ID663
Feature activation-7.73e+00

Pos logit contributions
lights+0.189
udd+0.178
orting+0.178
falls+0.171
angles+0.168
Neg logit contributions
e-0.111
 read-0.066
 ha-0.065
 named-0.063
 weep-0.062
Token wings
Token ID12098
Feature activation-5.10e+00

Pos logit contributions
lights+0.240
orting+0.215
udd+0.212
ires+0.209
angles+0.208
Neg logit contributions
e-0.197
 ha-0.140
 read-0.134
 weep-0.129
 comb-0.125
Token.
Token ID13
Feature activation-6.99e-01

Pos logit contributions
lights+0.165
ires+0.159
angles+0.156
orting+0.148
udd+0.145
Neg logit contributions
e-0.122
 read-0.083
 comb-0.074
 named-0.073
 weep-0.072
Token It
Token ID632
Feature activation-4.71e+00

Pos logit contributions
lights+0.023
angles+0.020
udd+0.020
orting+0.020
s+0.019
Neg logit contributions
e-0.015
 ha-0.012
 named-0.011
 read-0.010
Ste-0.010
Token could
Token ID714
Feature activation+3.08e+00

Pos logit contributions
lights+0.149
orting+0.136
udd+0.133
ires+0.132
angles+0.130
Neg logit contributions
e-0.113
 ha-0.081
 read-0.079
 weep-0.072
 named-0.072
Token fly
Token ID6129
Feature activation-4.60e-01

Pos logit contributions
e+0.067
 ha+0.047
 read+0.042
ñ+0.041
 weep+0.040
Neg logit contributions
lights-0.103
orting-0.093
udd-0.092
ires-0.091
angles-0.090
Token welcome
Token ID7062
Feature activation-1.40e-01

Pos logit contributions
e+0.018
 ha+0.014
ñ+0.013
 read+0.013
 weep+0.012
Neg logit contributions
lights-0.020
orting-0.018
udd-0.018
ires-0.018
falls-0.018
Token!"
Token ID2474
Feature activation-7.87e-01

Pos logit contributions
lights+0.003
orting+0.003
s+0.003
udd+0.003
ires+0.003
Neg logit contributions
e-0.004
 ha-0.003
 weep-0.003
 read-0.003
nam-0.003
Token It
Token ID632
Feature activation-4.17e+00

Pos logit contributions
lights+0.021
ires+0.019
angles+0.019
udd+0.018
falls+0.018
Neg logit contributions
e-0.022
 named-0.017
 read-0.017
 ha-0.017
bug-0.016
Token fl
Token ID781
Feature activation-2.69e+00

Pos logit contributions
lights+0.136
orting+0.124
ires+0.122
angles+0.120
udd+0.120
Neg logit contributions
e-0.100
 read-0.069
 ha-0.069
 named-0.066
 weep-0.062
Tokenapped
Token ID6320
Feature activation-5.41e+00

Pos logit contributions
lights+0.077
falls+0.066
der+0.065
angles+0.065
cases+0.064
Neg logit contributions
e-0.076
 named-0.051
bug-0.050
 ha-0.045
years-0.043
Token its
Token ID663
Feature activation-7.60e+00

Pos logit contributions
lights+0.188
udd+0.177
orting+0.177
falls+0.174
cases+0.174
Neg logit contributions
e-0.115
 read-0.069
 named-0.066
 weep-0.065
 ha-0.061
Token wings
Token ID12098
Feature activation-5.15e+00

Pos logit contributions
lights+0.242
orting+0.219
udd+0.215
angles+0.212
ires+0.212
Neg logit contributions
e-0.189
 ha-0.130
 read-0.125
 weep-0.123
 comb-0.116
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
lights+0.166
ires+0.162
angles+0.158
orting+0.149
udd+0.146
Neg logit contributions
e-0.123
 read-0.082
 named-0.075
 yawn-0.073
 comb-0.073
Token flew
Token ID13112
Feature activation-3.24e+00

Pos logit contributions
lights+0.039
orting+0.036
ires+0.036
udd+0.036
angles+0.035
Neg logit contributions
e-0.026
 ha-0.019
 read-0.018
 named-0.016
 weep-0.016
Token away
Token ID1497
Feature activation-2.87e+00

Pos logit contributions
lights+0.106
ires+0.098
udd+0.097
angles+0.096
orting+0.095
Neg logit contributions
e-0.074
 ha-0.048
 named-0.046
 read-0.045
 fright-0.044
Token.
Token ID13
Feature activation-9.39e-01

Pos logit contributions
lights+0.085
angles+0.078
orting+0.078
udd+0.076
ires+0.075
Neg logit contributions
e-0.074
 ha-0.055
 read-0.053
 weep-0.051
 named-0.050
Tokenped
Token ID9124
Feature activation-3.18e+00

Pos logit contributions
lights+0.139
orting+0.123
udd+0.122
angles+0.119
ires+0.119
Neg logit contributions
e-0.112
 ha-0.076
 read-0.071
ñ-0.068
 named-0.068
Token louder
Token ID27089
Feature activation-3.03e+00

Pos logit contributions
lights+0.082
orting+0.072
udd+0.072
s+0.070
ires+0.069
Neg logit contributions
e-0.093
 ha-0.074
 read-0.068
 weep-0.067
ñ-0.066
Token and
Token ID290
Feature activation+1.60e-01

Pos logit contributions
lights+0.092
angles+0.085
udd+0.084
orting+0.084
uddle+0.082
Neg logit contributions
e-0.080
 ha-0.055
 read-0.055
 named-0.051
 weep-0.048
Token fl
Token ID781
Feature activation-3.14e+00

Pos logit contributions
e+0.004
 ha+0.003
 read+0.003
 comb+0.003
 named+0.003
Neg logit contributions
lights-0.005
ires-0.004
udd-0.004
angles-0.004
orting-0.004
Tokenapped
Token ID6320
Feature activation-5.43e+00

Pos logit contributions
lights+0.083
falls+0.072
angles+0.069
orting+0.068
udd+0.065
Neg logit contributions
e-0.095
 ha-0.066
bug-0.066
 named-0.062
years-0.059
Token its
Token ID663
Feature activation-7.40e+00

Pos logit contributions
lights+0.181
udd+0.171
orting+0.170
falls+0.165
angles+0.161
Neg logit contributions
e-0.123
 read-0.076
 ha-0.075
 named-0.073
 weep-0.070
Token wing
Token ID8539
Feature activation-4.70e+00

Pos logit contributions
lights+0.209
udd+0.184
orting+0.184
ires+0.180
angles+0.177
Neg logit contributions
e-0.200
 ha-0.152
 read-0.143
ñ-0.137
 weep-0.137
Token.
Token ID13
Feature activation-7.30e-01

Pos logit contributions
lights+0.136
ires+0.123
angles+0.122
orting+0.122
udd+0.118
Neg logit contributions
e-0.129
 ha-0.092
 read-0.091
 comb-0.086
 weep-0.085
Token
Token ID198
Feature activation-3.19e+00

Pos logit contributions
lights+0.022
orting+0.019
udd+0.019
angles+0.018
falls+0.018
Neg logit contributions
e-0.019
 ha-0.014
 named-0.014
 read-0.014
 3-0.012
Token
Token ID198
Feature activation-2.28e+00

Pos logit contributions
lights+0.089
orting+0.077
angles+0.074
udd+0.074
falls+0.074
Neg logit contributions
e-0.092
 ha-0.069
 read-0.068
 named-0.066
 weep-0.062
Token"
Token ID1
Feature activation+2.75e+00

Pos logit contributions
lights+0.072
orting+0.064
udd+0.063
ires+0.062
angles+0.062
Neg logit contributions
e-0.057
 ha-0.042
 read-0.039
 named-0.036
ñ-0.036
Token him
Token ID683
Feature activation+7.19e-02

Pos logit contributions
lights+0.001
orting+0.001
s+0.001
udd+0.001
ires+0.001
Neg logit contributions
e-0.001
 ha-0.001
 read-0.001
nam-0.001
ñ-0.001
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
e+0.002
 ha+0.002
 read+0.001
 weep+0.001
ñ+0.001
Neg logit contributions
lights-0.002
orting-0.002
udd-0.002
ires-0.002
angles-0.002
Token It
Token ID632
Feature activation-4.26e+00

Pos logit contributions
lights+0.006
orting+0.005
udd+0.005
ires+0.005
angles+0.005
Neg logit contributions
e-0.009
 ha-0.007
 read-0.007
 weep-0.007
ñ-0.007
Token fl
Token ID781
Feature activation-2.81e+00

Pos logit contributions
lights+0.120
s+0.107
udd+0.103
falls+0.100
orting+0.099
Neg logit contributions
e-0.114
 ha-0.087
ñ-0.080
nam-0.077
 read-0.076
Tokenapped
Token ID6320
Feature activation-5.31e+00

Pos logit contributions
lights+0.067
falls+0.057
orting+0.057
angles+0.056
udd+0.055
Neg logit contributions
e-0.089
bug-0.066
 named-0.063
 ha-0.062
years-0.062
Token its
Token ID663
Feature activation-7.39e+00

Pos logit contributions
lights+0.178
udd+0.167
orting+0.166
ires+0.158
angles+0.158
Neg logit contributions
e-0.120
 read-0.074
 ha-0.073
 named-0.072
 weep-0.070
Token wings
Token ID12098
Feature activation-4.96e+00

Pos logit contributions
lights+0.218
orting+0.194
udd+0.193
ires+0.188
angles+0.186
Neg logit contributions
e-0.194
 ha-0.145
 read-0.136
 weep-0.130
ñ-0.129
Token and
Token ID290
Feature activation-8.25e-01

Pos logit contributions
lights+0.160
ires+0.150
angles+0.148
orting+0.142
udd+0.139
Neg logit contributions
e-0.122
 read-0.081
 named-0.075
 weep-0.074
 comb-0.073
Token made
Token ID925
Feature activation-2.70e+00

Pos logit contributions
lights+0.027
orting+0.024
udd+0.024
ires+0.023
angles+0.023
Neg logit contributions
e-0.019
 ha-0.014
 read-0.013
 weep-0.012
ñ-0.012
Token loud
Token ID7812
Feature activation-6.28e-01

Pos logit contributions
lights+0.088
orting+0.080
udd+0.079
ires+0.077
angles+0.077
Neg logit contributions
e-0.062
 ha-0.044
 read-0.041
 weep-0.038
ñ-0.038
Token noises
Token ID26782
Feature activation+3.06e-01

Pos logit contributions
lights+0.012
orting+0.010
udd+0.010
ires+0.009
angles+0.009
Neg logit contributions
e-0.023
 ha-0.019
 read-0.018
 weep-0.018
ñ-0.018
Token branch
Token ID8478
Feature activation+2.57e-01

Pos logit contributions
e+0.043
 ha+0.032
ñ+0.028
 read+0.028
bug+0.028
Neg logit contributions
lights-0.043
s-0.039
udd-0.038
orting-0.038
uddle-0.037
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
e+0.007
 read+0.005
 named+0.005
 ha+0.005
bug+0.004
Neg logit contributions
lights-0.008
ires-0.007
udd-0.007
angles-0.007
orting-0.007
Token He
Token ID679
Feature activation-2.59e+00

Pos logit contributions
lights+0.017
udd+0.015
angles+0.014
orting+0.014
ires+0.014
Neg logit contributions
e-0.018
 ha-0.014
 read-0.013
 named-0.013
bug-0.013
Token fl
Token ID781
Feature activation-3.65e+00

Pos logit contributions
lights+0.080
orting+0.072
udd+0.072
ires+0.070
angles+0.069
Neg logit contributions
e-0.064
 ha-0.046
 read-0.043
 weep-0.041
ñ-0.041
Tokenapped
Token ID6320
Feature activation-5.28e+00

Pos logit contributions
lights+0.091
falls+0.080
orting+0.078
udd+0.077
angles+0.073
Neg logit contributions
e-0.116
 ha-0.084
bug-0.082
 named-0.079
 weep-0.074
Token its
Token ID663
Feature activation-7.35e+00

Pos logit contributions
lights+0.176
udd+0.168
orting+0.166
ires+0.158
falls+0.156
Neg logit contributions
e-0.120
 read-0.076
 ha-0.075
 named-0.069
 weep-0.069
Token wings
Token ID12098
Feature activation-5.02e+00

Pos logit contributions
lights+0.235
orting+0.212
udd+0.210
angles+0.205
ires+0.204
Neg logit contributions
e-0.184
 ha-0.129
 read-0.122
 weep-0.117
 comb-0.116
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
lights+0.156
ires+0.146
angles+0.142
orting+0.140
udd+0.139
Neg logit contributions
e-0.127
 read-0.087
 comb-0.078
 named-0.078
 ha-0.078
Token tried
Token ID3088
Feature activation-2.79e+00

Pos logit contributions
lights+0.038
orting+0.034
udd+0.034
s+0.033
angles+0.032
Neg logit contributions
e-0.032
 ha-0.023
 read-0.021
 weep-0.021
ñ-0.021
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
lights+0.084
udd+0.074
orting+0.073
ires+0.072
angles+0.071
Neg logit contributions
e-0.074
 ha-0.055
 read-0.054
 comb-0.050
 weep-0.049
Token help
Token ID1037
Feature activation-1.25e+00

Pos logit contributions
e+0.033
 ha+0.025
ñ+0.025
years+0.024
nam+0.023
Neg logit contributions
lights-0.037
orting-0.034
angles-0.033
ires-0.033
falls-0.033
Token.
Token ID13
Feature activation-7.40e-01

Pos logit contributions
e+0.005
 ha+0.004
 read+0.004
 weep+0.003
 named+0.003
Neg logit contributions
lights-0.005
orting-0.004
udd-0.004
ires-0.004
angles-0.004
Token Then
Token ID3244
Feature activation-5.88e-02

Pos logit contributions
lights+0.021
udd+0.018
angles+0.018
orting+0.017
ires+0.017
Neg logit contributions
e-0.021
 ha-0.016
 named-0.015
 read-0.015
 weep-0.015
Token it
Token ID340
Feature activation-4.68e+00

Pos logit contributions
lights+0.002
orting+0.001
udd+0.001
ires+0.001
angles+0.001
Neg logit contributions
e-0.002
 ha-0.001
 read-0.001
 weep-0.001
ñ-0.001
Token fl
Token ID781
Feature activation-3.16e+00

Pos logit contributions
lights+0.149
orting+0.133
udd+0.133
ires+0.129
angles+0.129
Neg logit contributions
e-0.111
 ha-0.079
 read-0.074
 weep-0.070
ñ-0.070
Tokenapped
Token ID6320
Feature activation-5.38e+00

Pos logit contributions
lights+0.087
falls+0.079
orting+0.074
udd+0.071
angles+0.071
Neg logit contributions
e-0.091
bug-0.059
 ha-0.057
 named-0.057
years-0.054
Token its
Token ID663
Feature activation-7.33e+00

Pos logit contributions
lights+0.177
udd+0.164
orting+0.164
falls+0.158
ires+0.158
Neg logit contributions
e-0.125
 ha-0.079
 read-0.079
 weep-0.076
 named-0.075
Token wings
Token ID12098
Feature activation-4.93e+00

Pos logit contributions
lights+0.229
orting+0.205
udd+0.202
ires+0.197
angles+0.197
Neg logit contributions
e-0.187
 ha-0.135
 read-0.125
 weep-0.122
ñ-0.118
Token and
Token ID290
Feature activation-9.00e-01

Pos logit contributions
lights+0.163
ires+0.154
angles+0.151
orting+0.144
udd+0.141
Neg logit contributions
e-0.118
 read-0.078
 weep-0.070
 named-0.070
 comb-0.069
Token chased
Token ID26172
Feature activation-1.61e+00

Pos logit contributions
lights+0.029
orting+0.026
udd+0.026
ires+0.025
angles+0.025
Neg logit contributions
e-0.021
 ha-0.015
 read-0.014
 weep-0.013
ñ-0.013
Token Lily
Token ID20037
Feature activation-6.15e-01

Pos logit contributions
lights+0.034
s+0.034
orting+0.034
ires+0.031
angles+0.031
Neg logit contributions
e-0.046
 Amen-0.039
 ha-0.037
ñ-0.037
 read-0.036
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
lights+0.016
orting+0.014
udd+0.014
ires+0.014
angles+0.014
Neg logit contributions
e-0.018
 ha-0.014
 read-0.013
 weep-0.012
ñ-0.012
TokenThe
Token ID464
Feature activation-5.04e+00

Pos logit contributions
lights+0.065
falls+0.054
ires+0.053
angles+0.053
orting+0.052
Neg logit contributions
e-0.068
 ha-0.053
 read-0.050
 named-0.048
ñ-0.045
Token fox
Token ID21831
Feature activation+1.96e+00

Pos logit contributions
lights+0.171
orting+0.158
udd+0.157
ires+0.155
falls+0.152
Neg logit contributions
e-0.107
 ha-0.070
 read-0.064
 weep-0.063
ñ-0.063
Token smiled
Token ID13541
Feature activation-3.78e+00

Pos logit contributions
e+0.051
 ha+0.038
 read+0.035
 weep+0.034
ñ+0.034
Neg logit contributions
lights-0.057
orting-0.051
udd-0.051
ires-0.049
angles-0.049
Token and
Token ID290
Feature activation-1.78e+00

Pos logit contributions
lights+0.126
angles+0.124
falls+0.121
orting+0.117
udd+0.114
Neg logit contributions
e-0.074
 read-0.051
 a-0.048
 named-0.048
 yawn-0.047
Token waved
Token ID26834
Feature activation-3.40e+00

Pos logit contributions
lights+0.059
orting+0.052
angles+0.052
udd+0.051
ires+0.051
Neg logit contributions
e-0.042
 ha-0.031
 read-0.031
 named-0.028
 comb-0.028
Token its
Token ID663
Feature activation-7.30e+00

Pos logit contributions
lights+0.107
angles+0.098
falls+0.098
udd+0.096
orting+0.096
Neg logit contributions
e-0.081
 ha-0.059
 named-0.054
 read-0.053
 comb-0.051
Token tail
Token ID7894
Feature activation-3.12e+00

Pos logit contributions
lights+0.211
orting+0.185
udd+0.184
ires+0.181
angles+0.178
Neg logit contributions
e-0.195
 ha-0.147
 read-0.138
ñ-0.132
 weep-0.132
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
lights+0.094
udd+0.086
angles+0.085
orting+0.085
ires+0.084
Neg logit contributions
e-0.081
 read-0.057
 ha-0.056
 comb-0.053
 named-0.053
Token It
Token ID632
Feature activation-4.80e+00

Pos logit contributions
lights+0.006
angles+0.005
orting+0.005
udd+0.005
cases+0.005
Neg logit contributions
e-0.004
 named-0.004
 ha-0.004
 a-0.003
 3-0.003
Token watched
Token ID7342
Feature activation-1.85e+00

Pos logit contributions
lights+0.153
orting+0.138
udd+0.136
ires+0.134
angles+0.133
Neg logit contributions
e-0.115
 ha-0.083
 read-0.080
 named-0.073
 weep-0.073
Token them
Token ID606
Feature activation-2.37e+00

Pos logit contributions
lights+0.044
udd+0.038
orting+0.037
angles+0.036
ires+0.035
Neg logit contributions
e-0.060
 ha-0.047
 read-0.045
 named-0.044
 weep-0.043
Token was
Token ID373
Feature activation+8.12e-01

Pos logit contributions
lights+0.138
orting+0.125
udd+0.122
ires+0.121
angles+0.120
Neg logit contributions
e-0.107
 ha-0.077
 read-0.074
 weep-0.068
 named-0.068
Token happy
Token ID3772
Feature activation-2.09e+00

Pos logit contributions
e+0.019
 ha+0.014
 read+0.013
ñ+0.012
 weep+0.012
Neg logit contributions
lights-0.026
udd-0.023
orting-0.023
ires-0.023
angles-0.022
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
lights+0.067
angles+0.064
udd+0.063
orting+0.062
cases+0.060
Neg logit contributions
e-0.047
 read-0.035
 ha-0.033
 named-0.032
bug-0.030
Token It
Token ID632
Feature activation-4.50e+00

Pos logit contributions
e+0.006
 named+0.004
 read+0.004
 ha+0.004
 Amen+0.004
Neg logit contributions
lights-0.007
udd-0.007
orting-0.006
uddle-0.006
angles-0.006
Token waved
Token ID26834
Feature activation-3.01e+00

Pos logit contributions
lights+0.145
orting+0.131
udd+0.129
ires+0.128
angles+0.127
Neg logit contributions
e-0.106
 ha-0.076
 read-0.073
 named-0.067
 weep-0.067
Token its
Token ID663
Feature activation-7.27e+00

Pos logit contributions
lights+0.097
udd+0.088
orting+0.087
falls+0.087
angles+0.086
Neg logit contributions
e-0.073
 ha-0.051
 named-0.047
 read-0.047
 weep-0.044
Token antenna
Token ID20509
Feature activation-1.50e+00

Pos logit contributions
lights+0.238
orting+0.216
udd+0.215
angles+0.208
ires+0.207
Neg logit contributions
e-0.171
 ha-0.120
 read-0.112
 weep-0.109
 comb-0.106
Tokene
Token ID68
Feature activation-2.24e+00

Pos logit contributions
lights+0.074
s+0.070
orting+0.070
udd+0.069
ires+0.068
Neg logit contributions
e-0.003
 ha+0.001
 read+0.004
ñ+0.004
 weep+0.004
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
lights+0.058
orting+0.051
udd+0.051
ires+0.049
s+0.049
Neg logit contributions
e-0.065
 ha-0.051
 read-0.048
 weep-0.046
ñ-0.046
Token
Token ID198
Feature activation-2.85e+00

Pos logit contributions
lights+0.008
udd+0.007
orting+0.007
angles+0.007
rows+0.007
Neg logit contributions
e-0.007
 read-0.005
 ha-0.005
 named-0.005
 a-0.004
Token
Token ID198
Feature activation-2.18e+00

Pos logit contributions
lights+0.088
orting+0.076
angles+0.075
falls+0.075
udd+0.074
Neg logit contributions
e-0.073
 read-0.052
 named-0.051
 ha-0.050
 weep-0.044
Token his
Token ID465
Feature activation-2.83e+00

Pos logit contributions
lights+0.131
orting+0.119
ires+0.116
udd+0.115
angles+0.112
Neg logit contributions
e-0.122
 ha-0.093
 read-0.089
 weep-0.085
 comb-0.083
Tokensed
Token ID36622
Feature activation-2.74e+00

Pos logit contributions
lights+0.076
udd+0.067
orting+0.066
ires+0.066
angles+0.064
Neg logit contributions
e-0.079
 ha-0.060
 read-0.057
ñ-0.056
 weep-0.055
Token and
Token ID290
Feature activation-5.84e-01

Pos logit contributions
lights+0.076
udd+0.066
orting+0.066
ires+0.065
angles+0.065
Neg logit contributions
e-0.079
 ha-0.059
 read-0.056
 weep-0.054
 named-0.053
Token fl
Token ID781
Feature activation-3.54e+00

Pos logit contributions
lights+0.018
orting+0.017
udd+0.017
ires+0.016
angles+0.016
Neg logit contributions
e-0.014
 ha-0.010
 read-0.009
 weep-0.009
ñ-0.009
Tokenapped
Token ID6320
Feature activation-5.57e+00

Pos logit contributions
lights+0.091
orting+0.084
falls+0.082
udd+0.077
angles+0.076
Neg logit contributions
e-0.109
 ha-0.074
bug-0.072
 named-0.068
 weep-0.067
Token its
Token ID663
Feature activation-7.26e+00

Pos logit contributions
lights+0.182
udd+0.174
orting+0.173
ires+0.166
angles+0.165
Neg logit contributions
e-0.129
 ha-0.082
 read-0.080
 weep-0.078
 named-0.076
Token wings
Token ID12098
Feature activation-4.96e+00

Pos logit contributions
lights+0.218
udd+0.193
orting+0.193
ires+0.188
angles+0.186
Neg logit contributions
e-0.183
 ha-0.136
 read-0.127
ñ-0.121
 weep-0.121
Token.
Token ID13
Feature activation+6.12e-05

Pos logit contributions
lights+0.153
ires+0.143
angles+0.141
orting+0.137
udd+0.136
Neg logit contributions
e-0.128
 read-0.085
 ha-0.081
 weep-0.081
 comb-0.079
Token It
Token ID632
Feature activation-4.17e+00

Pos logit contributions
e+0.000
 ha+0.000
 read+0.000
 named+0.000
 weep+0.000
Neg logit contributions
lights-0.000
udd-0.000
orting-0.000
angles-0.000
ires-0.000
Token spl
Token ID4328
Feature activation-2.90e+00

Pos logit contributions
lights+0.121
s+0.108
udd+0.107
falls+0.103
orting+0.102
Neg logit contributions
e-0.105
 ha-0.076
nam-0.071
ñ-0.071
 lawyer-0.068
Tokenashed
Token ID5263
Feature activation-4.06e+00

Pos logit contributions
lights+0.058
ires+0.045
orting+0.043
angles+0.043
falls+0.043
Neg logit contributions
e-0.106
bug-0.080
 ha-0.079
 named-0.078
ñ-0.077
Token disappointment
Token ID18641
Feature activation-3.27e+00

Pos logit contributions
e+0.008
 ha+0.006
 read+0.006
ñ+0.006
 weep+0.005
Neg logit contributions
lights-0.011
orting-0.009
udd-0.009
ires-0.009
falls-0.009
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
lights+0.099
angles+0.094
udd+0.092
orting+0.091
ires+0.090
Neg logit contributions
e-0.081
 ha-0.057
 read-0.056
 comb-0.052
 named-0.051
Token
Token ID198
Feature activation-3.29e+00

Pos logit contributions
lights+0.038
udd+0.033
angles+0.032
orting+0.032
falls+0.031
Neg logit contributions
e-0.041
 ha-0.032
 read-0.030
 named-0.030
 weep-0.029
Token
Token ID198
Feature activation-2.33e+00

Pos logit contributions
lights+0.084
orting+0.074
udd+0.073
ires+0.071
angles+0.070
Neg logit contributions
e-0.098
 ha-0.076
 read-0.072
 weep-0.070
ñ-0.070
TokenM
Token ID44
Feature activation-1.26e+00

Pos logit contributions
lights+0.061
orting+0.055
udd+0.054
ires+0.053
s+0.052
Neg logit contributions
e-0.065
 ha-0.050
 read-0.047
 weep-0.047
ñ-0.046
Tokenarl
Token ID7063
Feature activation-7.25e+00

Pos logit contributions
lights+0.023
orting+0.019
udd+0.018
falls+0.018
angles+0.018
Neg logit contributions
e-0.047
 ha-0.039
 read-0.037
 named-0.036
 weep-0.036
Tokenon
Token ID261
Feature activation-2.86e+00

Pos logit contributions
lights+0.182
udd+0.173
orting+0.162
angles+0.151
ires+0.151
Neg logit contributions
e-0.216
 ha-0.172
 comb-0.156
 read-0.151
te-0.143
Token's
Token ID338
Feature activation-1.38e+00

Pos logit contributions
lights+0.085
orting+0.076
udd+0.076
ires+0.074
angles+0.074
Neg logit contributions
e-0.074
 ha-0.054
 read-0.051
 weep-0.049
ñ-0.048
Token mum
Token ID25682
Feature activation+1.40e+00

Pos logit contributions
lights+0.036
udd+0.031
orting+0.031
ires+0.031
angles+0.030
Neg logit contributions
e-0.041
 ha-0.031
 read-0.030
ñ-0.029
 weep-0.029
Token hugged
Token ID41130
Feature activation-3.72e+00

Pos logit contributions
e+0.035
 ha+0.026
ñ+0.024
 read+0.024
nam+0.024
Neg logit contributions
lights-0.040
udd-0.037
orting-0.036
s-0.036
angles-0.034
Token him
Token ID683
Feature activation-1.94e+00

Pos logit contributions
lights+0.121
orting+0.109
udd+0.109
ires+0.106
angles+0.106
Neg logit contributions
e-0.085
 ha-0.060
 read-0.056
 weep-0.053
ñ-0.052
Token.
Token ID13
Feature activation-6.25e-01

Pos logit contributions
lights+0.093
orting+0.082
udd+0.081
ires+0.079
angles+0.078
Neg logit contributions
e-0.094
 ha-0.072
 read-0.068
 weep-0.065
ñ-0.065
Token The
Token ID383
Feature activation-5.13e+00

Pos logit contributions
lights+0.019
angles+0.017
udd+0.016
orting+0.016
falls+0.016
Neg logit contributions
e-0.015
 read-0.012
 ha-0.011
 named-0.011
 3-0.011
Token bird
Token ID6512
Feature activation+4.95e-01

Pos logit contributions
lights+0.134
udd+0.116
orting+0.113
ires+0.110
uddle+0.110
Neg logit contributions
e-0.152
ñ-0.109
 ha-0.105
nam-0.101
 read-0.098
Token fl
Token ID781
Feature activation-3.11e+00

Pos logit contributions
e+0.012
 ha+0.009
 read+0.009
 named+0.008
 weep+0.008
Neg logit contributions
lights-0.016
orting-0.014
ires-0.014
udd-0.014
angles-0.014
Tokenapped
Token ID6320
Feature activation-4.48e+00

Pos logit contributions
lights+0.071
falls+0.061
orting+0.060
angles+0.058
udd+0.058
Neg logit contributions
e-0.105
 ha-0.078
bug-0.078
 named-0.074
bird-0.071
Token its
Token ID663
Feature activation-7.25e+00

Pos logit contributions
lights+0.172
udd+0.166
orting+0.162
uddle+0.160
falls+0.158
Neg logit contributions
e-0.081
 read-0.041
 weep-0.040
 ha-0.038
 named-0.037
Token wings
Token ID12098
Feature activation-4.97e+00

Pos logit contributions
lights+0.228
orting+0.207
udd+0.204
angles+0.202
ires+0.200
Neg logit contributions
e-0.183
 ha-0.127
 read-0.123
 weep-0.119
 comb-0.114
Token one
Token ID530
Feature activation-5.55e+00

Pos logit contributions
lights+0.157
ires+0.155
angles+0.150
orting+0.142
udd+0.141
Neg logit contributions
e-0.121
 read-0.081
 comb-0.075
 weep-0.074
 named-0.073
Token more
Token ID517
Feature activation-2.76e+00

Pos logit contributions
lights+0.150
falls+0.137
udd+0.136
angles+0.133
ires+0.133
Neg logit contributions
e-0.163
 read-0.117
 ha-0.117
bug-0.113
 comb-0.111
Token time
Token ID640
Feature activation-1.61e+00

Pos logit contributions
lights+0.093
angles+0.089
orting+0.088
udd+0.086
falls+0.086
Neg logit contributions
e-0.062
 read-0.038
 weep-0.037
 ha-0.036
 named-0.035
Token and
Token ID290
Feature activation-4.07e-01

Pos logit contributions
lights+0.050
angles+0.045
udd+0.044
orting+0.044
ires+0.043
Neg logit contributions
e-0.042
 read-0.029
 ha-0.029
 named-0.027
 comb-0.026
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
e+0.026
 ha+0.021
 read+0.020
 weep+0.019
ñ+0.019
Neg logit contributions
lights-0.022
orting-0.019
udd-0.019
ires-0.019
falls-0.018
Token Blue
Token ID4518
Feature activation-8.02e-01

Pos logit contributions
e+0.003
 ha+0.003
 read+0.003
 named+0.002
 weep+0.002
Neg logit contributions
lights-0.003
udd-0.002
orting-0.002
angles-0.002
ires-0.002
Tokenbird
Token ID16944
Feature activation+1.92e-01

Pos logit contributions
lights+0.025
orting+0.022
ires+0.022
udd+0.021
angles+0.021
Neg logit contributions
e-0.020
 ha-0.014
 read-0.014
 named-0.013
 weep-0.013
Token fl
Token ID781
Feature activation-4.09e+00

Pos logit contributions
e+0.005
 ha+0.003
 read+0.003
 named+0.003
bug+0.003
Neg logit contributions
lights-0.006
orting-0.006
ires-0.006
udd-0.005
angles-0.005
Tokenapped
Token ID6320
Feature activation-5.61e+00

Pos logit contributions
lights+0.100
orting+0.091
falls+0.086
udd+0.085
angles+0.084
Neg logit contributions
e-0.120
bug-0.088
 ha-0.085
years-0.084
 named-0.080
Token its
Token ID663
Feature activation-7.24e+00

Pos logit contributions
lights+0.194
udd+0.188
orting+0.184
uddle+0.181
angles+0.178
Neg logit contributions
e-0.117
 read-0.072
 named-0.066
 ha-0.065
 weep-0.064
Token wings
Token ID12098
Feature activation-5.33e+00

Pos logit contributions
lights+0.230
orting+0.210
udd+0.206
angles+0.205
ires+0.202
Neg logit contributions
e-0.178
 ha-0.124
 read-0.118
 weep-0.115
 comb-0.111
Token hard
Token ID1327
Feature activation-3.59e+00

Pos logit contributions
lights+0.172
ires+0.164
angles+0.163
orting+0.154
udd+0.152
Neg logit contributions
e-0.129
 read-0.085
 comb-0.077
 named-0.076
 yawn-0.075
Token and
Token ID290
Feature activation+3.13e-01

Pos logit contributions
lights+0.120
angles+0.113
udd+0.109
orting+0.106
ires+0.105
Neg logit contributions
e-0.075
 read-0.050
 comb-0.046
 ha-0.045
 yawn-0.044
Token flew
Token ID13112
Feature activation-3.23e+00

Pos logit contributions
e+0.007
 ha+0.005
 read+0.005
 weep+0.005
 named+0.005
Neg logit contributions
lights-0.010
orting-0.009
ires-0.009
udd-0.009
angles-0.009
Token higher
Token ID2440
Feature activation-2.25e+00

Pos logit contributions
lights+0.100
udd+0.092
angles+0.090
ires+0.089
orting+0.089
Neg logit contributions
e-0.079
 ha-0.054
 read-0.052
 fright-0.050
 named-0.050
Token
Token ID198
Feature activation-3.22e+00

Pos logit contributions
lights+0.029
angles+0.025
udd+0.025
ires+0.025
orting+0.024
Neg logit contributions
e-0.032
 ha-0.024
 read-0.024
 named-0.023
 weep-0.022
Token
Token ID198
Feature activation-2.47e+00

Pos logit contributions
lights+0.076
orting+0.065
udd+0.065
ires+0.063
angles+0.062
Neg logit contributions
e-0.103
 ha-0.082
 read-0.078
 weep-0.075
ñ-0.075
TokenThe
Token ID464
Feature activation-5.06e+00

Pos logit contributions
lights+0.064
orting+0.058
udd+0.058
ires+0.055
s+0.055
Neg logit contributions
e-0.071
 ha-0.055
 weep-0.051
 read-0.051
ñ-0.050
Token cat
Token ID3797
Feature activation+3.14e+00

Pos logit contributions
lights+0.124
udd+0.117
orting+0.116
s+0.111
ires+0.110
Neg logit contributions
e-0.144
ñ-0.100
nam-0.099
 ha-0.095
 weep-0.094
Token smiled
Token ID13541
Feature activation-3.73e+00

Pos logit contributions
e+0.083
 ha+0.063
 read+0.057
 weep+0.057
ñ+0.056
Neg logit contributions
lights-0.089
udd-0.080
orting-0.079
s-0.076
angles-0.075
Token its
Token ID663
Feature activation-7.21e+00

Pos logit contributions
lights+0.120
angles+0.117
orting+0.112
falls+0.112
udd+0.108
Neg logit contributions
e-0.080
 read-0.058
 3-0.056
 named-0.054
 three-0.052
Token biggest
Token ID4094
Feature activation-1.89e+00

Pos logit contributions
lights+0.242
orting+0.216
udd+0.213
angles+0.211
falls+0.205
Neg logit contributions
e-0.167
 read-0.117
 ha-0.116
 comb-0.113
 weep-0.112
Token smile
Token ID8212
Feature activation-3.24e+00

Pos logit contributions
lights+0.059
orting+0.052
ires+0.052
angles+0.052
udd+0.051
Neg logit contributions
e-0.048
 ha-0.036
 read-0.035
 weep-0.033
 comb-0.033
Token,
Token ID11
Feature activation-9.16e-01

Pos logit contributions
lights+0.097
udd+0.089
orting+0.087
ires+0.085
angles+0.085
Neg logit contributions
e-0.086
 ha-0.061
 read-0.061
 comb-0.057
 named-0.056
Token with
Token ID351
Feature activation-9.83e-01

Pos logit contributions
lights+0.030
angles+0.028
ires+0.027
uddle+0.027
orting+0.027
Neg logit contributions
e-0.020
 read-0.016
 named-0.014
 comb-0.014
 yawn-0.013
Token its
Token ID663
Feature activation-5.47e+00

Pos logit contributions
lights+0.033
udd+0.030
orting+0.030
angles+0.029
ires+0.029
Neg logit contributions
e-0.022
 ha-0.015
 read-0.014
 weep-0.013
 named-0.013
Token.
Token ID13
Feature activation-8.74e-01

Pos logit contributions
lights+0.054
udd+0.049
orting+0.048
ires+0.048
angles+0.047
Neg logit contributions
e-0.050
 ha-0.036
 read-0.035
 weep-0.033
 named-0.033
Token The
Token ID383
Feature activation-4.61e+00

Pos logit contributions
lights+0.025
udd+0.021
orting+0.021
rows+0.021
falls+0.020
Neg logit contributions
e-0.023
 read-0.018
 ha-0.018
 named-0.016
 lawyer-0.016
Token bird
Token ID6512
Feature activation+3.10e-01

Pos logit contributions
lights+0.117
udd+0.106
orting+0.105
s+0.102
uddle+0.101
Neg logit contributions
e-0.131
ñ-0.094
 read-0.087
years-0.085
nam-0.085
Token fl
Token ID781
Feature activation-2.99e+00

Pos logit contributions
e+0.008
 ha+0.005
 read+0.005
 weep+0.005
 named+0.005
Neg logit contributions
lights-0.010
orting-0.009
udd-0.009
ires-0.008
angles-0.008
Tokenapped
Token ID6320
Feature activation-5.03e+00

Pos logit contributions
lights+0.072
falls+0.062
orting+0.060
udd+0.058
angles+0.057
Neg logit contributions
e-0.100
 ha-0.074
bug-0.071
 named-0.070
 weep-0.067
Token its
Token ID663
Feature activation-7.21e+00

Pos logit contributions
lights+0.167
udd+0.158
orting+0.156
ires+0.149
falls+0.149
Neg logit contributions
e-0.116
 read-0.072
 ha-0.072
 named-0.068
 weep-0.068
Token wings
Token ID12098
Feature activation-5.25e+00

Pos logit contributions
lights+0.240
orting+0.216
udd+0.213
ires+0.212
angles+0.211
Neg logit contributions
e-0.172
 ha-0.118
 read-0.111
 weep-0.109
 comb-0.107
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
lights+0.173
ires+0.167
angles+0.163
orting+0.154
udd+0.152
Neg logit contributions
e-0.123
 read-0.078
 named-0.073
 comb-0.072
 weep-0.069
Token flew
Token ID13112
Feature activation-3.19e+00

Pos logit contributions
lights+0.032
orting+0.029
udd+0.029
ires+0.028
angles+0.028
Neg logit contributions
e-0.025
 ha-0.019
 read-0.017
 weep-0.017
ñ-0.016
Token a
Token ID257
Feature activation+6.43e-01

Pos logit contributions
lights+0.101
udd+0.092
ires+0.091
orting+0.090
angles+0.090
Neg logit contributions
e-0.076
 ha-0.052
 read-0.050
 named-0.048
 fright-0.048
Token little
Token ID1310
Feature activation+1.29e+00

Pos logit contributions
e+0.018
 ha+0.013
 read+0.012
ñ+0.012
 weep+0.012
Neg logit contributions
lights-0.017
udd-0.015
s-0.015
orting-0.015
uddle-0.014
Token to
Token ID284
Feature activation+2.75e+00

Pos logit contributions
e+0.017
 ha+0.013
 read+0.012
 weep+0.012
ñ+0.012
Neg logit contributions
lights-0.015
orting-0.013
udd-0.013
ires-0.012
angles-0.012
Token grow
Token ID1663
Feature activation-2.07e+00

Pos logit contributions
e+0.061
 ha+0.045
ñ+0.041
years+0.040
nam+0.039
Neg logit contributions
lights-0.079
falls-0.075
orting-0.074
udd-0.074
s-0.072
Tokenl
Token ID75
Feature activation+7.18e-01

Pos logit contributions
lights+0.051
angles+0.049
orting+0.048
ires+0.047
udd+0.046
Neg logit contributions
e-0.058
 named-0.040
 ha-0.040
bug-0.040
 read-0.040
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
e+0.021
 ha+0.016
 read+0.015
ñ+0.015
 weep+0.015
Neg logit contributions
lights-0.019
orting-0.016
udd-0.016
ires-0.016
angles-0.016
Token sn
Token ID3013
Feature activation-4.52e-01

Pos logit contributions
e+0.034
 ha+0.024
 read+0.021
ñ+0.021
 weep+0.020
Neg logit contributions
lights-0.050
orting-0.045
udd-0.044
falls-0.044
ires-0.043
Tokenarl
Token ID7063
Feature activation-7.20e+00

Pos logit contributions
lights+0.010
angles+0.008
ires+0.008
udd+0.007
falls+0.007
Neg logit contributions
e-0.015
 named-0.012
 ha-0.012
bug-0.012
 read-0.011
Token to
Token ID284
Feature activation+5.35e-01

Pos logit contributions
lights+0.125
udd+0.110
orting+0.108
angles+0.104
ires+0.095
Neg logit contributions
e-0.229
 ha-0.183
te-0.174
ggie-0.172
 comb-0.172
Token try
Token ID1949
Feature activation-2.09e+00

Pos logit contributions
e+0.013
 ha+0.010
ñ+0.009
 read+0.008
bug+0.008
Neg logit contributions
lights-0.016
orting-0.014
udd-0.014
ires-0.014
falls-0.014
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
lights+0.057
udd+0.050
orting+0.050
ires+0.049
angles+0.049
Neg logit contributions
e-0.059
 ha-0.045
 read-0.044
 weep-0.041
 comb-0.041
Token scare
Token ID19437
Feature activation-1.54e+00

Pos logit contributions
e+0.043
 ha+0.030
 read+0.028
ñ+0.026
 weep+0.026
Neg logit contributions
lights-0.060
orting-0.054
udd-0.054
ires-0.052
angles-0.052
Token her
Token ID607
Feature activation-1.79e+00

Pos logit contributions
lights+0.042
orting+0.037
udd+0.037
ires+0.036
angles+0.035
Neg logit contributions
e-0.043
 ha-0.033
 read-0.031
 weep-0.030
ñ-0.030
Token sw
Token ID1509
Feature activation-3.90e+00

Pos logit contributions
e+0.011
 read+0.008
 named+0.008
 ha+0.008
 weep+0.007
Neg logit contributions
lights-0.013
orting-0.012
angles-0.012
ires-0.012
udd-0.012
Tokenam
Token ID321
Feature activation-2.51e+00

Pos logit contributions
lights+0.078
orting+0.060
falls+0.058
ires+0.054
udd+0.054
Neg logit contributions
e-0.146
 ha-0.109
 named-0.106
ggie-0.103
bug-0.101
Token away
Token ID1497
Feature activation-2.75e+00

Pos logit contributions
lights+0.084
ires+0.077
angles+0.076
udd+0.075
orting+0.074
Neg logit contributions
e-0.058
 ha-0.038
 named-0.037
 read-0.036
 fright-0.035
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
lights+0.084
angles+0.079
orting+0.078
udd+0.076
ires+0.075
Neg logit contributions
e-0.067
 ha-0.050
 read-0.048
 weep-0.046
 named-0.046
Token waving
Token ID25849
Feature activation-3.84e+00

Pos logit contributions
lights+0.007
angles+0.007
ires+0.006
udd+0.006
orting+0.006
Neg logit contributions
e-0.005
 read-0.004
 named-0.004
 weep-0.004
 ha-0.004
Token its
Token ID663
Feature activation-7.20e+00

Pos logit contributions
lights+0.111
udd+0.102
orting+0.100
angles+0.099
ires+0.097
Neg logit contributions
e-0.103
 ha-0.076
 read-0.071
 named-0.070
 weep-0.068
Token tail
Token ID7894
Feature activation-2.79e+00

Pos logit contributions
lights+0.221
orting+0.203
udd+0.202
ires+0.196
angles+0.196
Neg logit contributions
e-0.180
 ha-0.131
 read-0.123
 weep-0.120
 comb-0.116
Token goodbye
Token ID24829
Feature activation-3.05e+00

Pos logit contributions
lights+0.087
angles+0.079
orting+0.079
ires+0.079
udd+0.078
Neg logit contributions
e-0.070
 read-0.049
 ha-0.048
 comb-0.046
 weep-0.045
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
lights+0.092
angles+0.083
ires+0.081
orting+0.081
udd+0.080
Neg logit contributions
e-0.079
 ha-0.058
 read-0.058
 named-0.055
 weep-0.053
Token
Token ID220
Feature activation-3.37e+00

Pos logit contributions
lights+0.061
angles+0.057
rows+0.055
orting+0.054
udd+0.053
Neg logit contributions
e-0.029
 named-0.026
 read-0.021
 ha-0.021
Ste-0.019
Token
Token ID198
Feature activation-3.99e+00

Pos logit contributions
lights+0.107
angles+0.089
orting+0.088
falls+0.087
udd+0.087
Neg logit contributions
e-0.088
 ha-0.060
 named-0.059
 read-0.058
bug-0.055
Token time
Token ID640
Feature activation-5.18e-01

Pos logit contributions
lights+0.012
s+0.010
udd+0.010
orting+0.009
angles+0.009
Neg logit contributions
e-0.016
 ha-0.013
ñ-0.013
nam-0.013
Mic-0.012
Token the
Token ID262
Feature activation-3.28e+00

Pos logit contributions
lights+0.014
orting+0.013
udd+0.013
ires+0.012
angles+0.012
Neg logit contributions
e-0.015
 ha-0.011
 read-0.011
 comb-0.010
ñ-0.010
Token toy
Token ID13373
Feature activation-2.15e-01

Pos logit contributions
lights+0.083
ires+0.070
udd+0.069
orting+0.068
uddle+0.068
Neg logit contributions
e-0.100
ñ-0.070
 named-0.069
 read-0.068
 weep-0.068
Token wink
Token ID41955
Feature activation-5.07e+00

Pos logit contributions
lights+0.006
orting+0.006
udd+0.006
ires+0.006
angles+0.005
Neg logit contributions
e-0.006
 ha-0.004
 read-0.004
 comb-0.004
 named-0.004
Tokened
Token ID276
Feature activation-4.61e+00

Pos logit contributions
lights+0.134
orting+0.123
ires+0.121
angles+0.120
udd+0.119
Neg logit contributions
e-0.147
 ha-0.112
 read-0.104
 comb-0.103
bug-0.100
Token it
Token ID340
Feature activation-7.19e+00

Pos logit contributions
lights+0.149
angles+0.140
orting+0.139
udd+0.139
ires+0.138
Neg logit contributions
e-0.103
 ha-0.070
 read-0.066
 comb-0.064
 named-0.064
Token made
Token ID925
Feature activation-2.20e+00

Pos logit contributions
lights+0.232
ires+0.217
orting+0.217
udd+0.216
angles+0.215
Neg logit contributions
e-0.164
 read-0.117
 comb-0.110
 ha-0.108
 named-0.103
Token her
Token ID607
Feature activation-7.56e-01

Pos logit contributions
lights+0.080
orting+0.073
udd+0.073
ires+0.071
angles+0.071
Neg logit contributions
e-0.043
 ha-0.027
 read-0.025
 comb-0.023
 weep-0.023
Token smile
Token ID8212
Feature activation-3.81e+00

Pos logit contributions
lights+0.024
udd+0.021
orting+0.021
ires+0.020
angles+0.020
Neg logit contributions
e-0.018
 ha-0.013
 read-0.012
ñ-0.012
 weep-0.012
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
lights+0.110
orting+0.099
udd+0.099
ires+0.096
angles+0.095
Neg logit contributions
e-0.103
 ha-0.075
 read-0.074
 comb-0.068
 weep-0.068
Token The
Token ID383
Feature activation-5.47e+00

Pos logit contributions
lights+0.039
udd+0.033
orting+0.033
angles+0.032
falls+0.032
Neg logit contributions
e-0.046
 ha-0.036
 read-0.034
 named-0.033
 weep-0.032
Token and
Token ID290
Feature activation-9.66e-01

Pos logit contributions
lights+0.051
udd+0.045
angles+0.044
orting+0.044
ires+0.044
Neg logit contributions
e-0.043
 read-0.032
 ha-0.031
 weep-0.030
 comb-0.030
Token the
Token ID262
Feature activation-5.29e+00

Pos logit contributions
lights+0.029
udd+0.026
orting+0.026
ires+0.026
angles+0.025
Neg logit contributions
e-0.025
 ha-0.018
 read-0.017
 weep-0.016
 named-0.016
Token little
Token ID1310
Feature activation-9.93e-01

Pos logit contributions
lights+0.162
ires+0.143
s+0.142
orting+0.141
udd+0.140
Neg logit contributions
e-0.127
 ha-0.088
ñ-0.084
nam-0.075
 read-0.075
Token mouse
Token ID10211
Feature activation+1.13e+00

Pos logit contributions
lights+0.033
s+0.030
orting+0.029
ires+0.029
udd+0.029
Neg logit contributions
e-0.021
 ha-0.015
ñ-0.014
 read-0.013
nam-0.012
Token waved
Token ID26834
Feature activation-3.57e+00

Pos logit contributions
e+0.028
 ha+0.020
 read+0.019
 named+0.018
 weep+0.017
Neg logit contributions
lights-0.036
orting-0.033
ires-0.033
udd-0.033
angles-0.032
Token its
Token ID663
Feature activation-7.17e+00

Pos logit contributions
lights+0.107
angles+0.098
udd+0.097
orting+0.095
falls+0.095
Neg logit contributions
e-0.092
 ha-0.068
 read-0.065
 weep-0.063
 named-0.062
Token tiny
Token ID7009
Feature activation-3.73e+00

Pos logit contributions
lights+0.223
orting+0.207
udd+0.207
angles+0.203
ires+0.198
Neg logit contributions
e-0.179
 ha-0.127
 weep-0.120
 read-0.120
 comb-0.116
Token paws
Token ID49193
Feature activation-1.91e+00

Pos logit contributions
lights+0.104
orting+0.092
udd+0.092
ires+0.090
angles+0.089
Neg logit contributions
e-0.103
 ha-0.078
 read-0.074
 weep-0.071
ñ-0.070
Token.
Token ID13
Feature activation-1.98e+00

Pos logit contributions
lights+0.059
ires+0.055
angles+0.054
udd+0.053
orting+0.052
Neg logit contributions
e-0.048
 read-0.033
 ha-0.032
 named-0.031
bug-0.031
Token Joy
Token ID14087
Feature activation-1.57e+00

Pos logit contributions
lights+0.068
udd+0.062
orting+0.059
angles+0.059
falls+0.056
Neg logit contributions
e-0.044
 ha-0.030
 read-0.030
bug-0.028
 named-0.027
Token smiled
Token ID13541
Feature activation-4.08e+00

Pos logit contributions
lights+0.044
udd+0.040
s+0.039
orting+0.039
angles+0.038
Neg logit contributions
e-0.040
 ha-0.030
ñ-0.028
 read-0.028
nam-0.028
Tokenped
Token ID9124
Feature activation-2.88e+00

Pos logit contributions
lights+0.121
udd+0.107
orting+0.106
uddle+0.106
angles+0.105
Neg logit contributions
e-0.083
 ha-0.050
 read-0.047
 named-0.046
bug-0.046
Token happily
Token ID18177
Feature activation-1.15e+00

Pos logit contributions
lights+0.078
orting+0.069
udd+0.068
ires+0.066
s+0.066
Neg logit contributions
e-0.081
 ha-0.062
 read-0.058
 weep-0.056
ñ-0.056
Token and
Token ID290
Feature activation-7.18e-01

Pos logit contributions
lights+0.033
udd+0.029
orting+0.029
angles+0.028
ires+0.028
Neg logit contributions
e-0.032
 ha-0.023
 read-0.023
 weep-0.022
 named-0.021
Token fl
Token ID781
Feature activation-3.47e+00

Pos logit contributions
lights+0.023
orting+0.021
udd+0.021
ires+0.021
angles+0.021
Neg logit contributions
e-0.017
 ha-0.012
 read-0.012
 named-0.011
 weep-0.011
Tokenapped
Token ID6320
Feature activation-5.28e+00

Pos logit contributions
lights+0.089
falls+0.081
angles+0.078
orting+0.077
 horizon+0.076
Neg logit contributions
e-0.098
bug-0.070
 named-0.068
years-0.067
 ha-0.064
Token its
Token ID663
Feature activation-7.17e+00

Pos logit contributions
lights+0.166
udd+0.159
orting+0.157
angles+0.151
ires+0.151
Neg logit contributions
e-0.127
 ha-0.084
 read-0.083
 named-0.081
 weep-0.080
Token wings
Token ID12098
Feature activation-4.72e+00

Pos logit contributions
lights+0.210
orting+0.186
udd+0.186
ires+0.181
angles+0.179
Neg logit contributions
e-0.189
 ha-0.141
 read-0.132
 weep-0.127
ñ-0.126
Token.
Token ID13
Feature activation-5.19e-01

Pos logit contributions
lights+0.144
ires+0.135
angles+0.134
orting+0.130
udd+0.128
Neg logit contributions
e-0.122
 read-0.083
 ha-0.077
 named-0.077
 weep-0.077
Token Ben
Token ID3932
Feature activation+1.33e-01

Pos logit contributions
lights+0.014
udd+0.012
angles+0.012
orting+0.012
ires+0.011
Neg logit contributions
e-0.015
 ha-0.012
 named-0.011
 read-0.011
bug-0.010
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
e+0.004
 ha+0.003
nam+0.003
ñ+0.003
 weep+0.003
Neg logit contributions
lights-0.003
udd-0.003
s-0.003
orting-0.003
falls-0.003
Token Lily
Token ID20037
Feature activation-6.14e-01

Pos logit contributions
lights+0.026
falls+0.021
s+0.021
orting+0.021
ires+0.020
Neg logit contributions
e-0.032
 ha-0.023
ñ-0.023
 Amen-0.022
 weep-0.022
Token car
Token ID1097
Feature activation+1.08e+00

Pos logit contributions
lights+0.048
udd+0.042
orting+0.041
falls+0.041
angles+0.040
Neg logit contributions
e-0.046
 ha-0.033
 read-0.032
 weep-0.031
ñ-0.031
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
e+0.028
 ha+0.020
 read+0.020
 named+0.018
 weep+0.018
Neg logit contributions
lights-0.033
ires-0.030
orting-0.029
udd-0.029
angles-0.029
Token It
Token ID632
Feature activation-4.26e+00

Pos logit contributions
lights+0.004
udd+0.004
orting+0.004
angles+0.004
ires+0.004
Neg logit contributions
e-0.006
 ha-0.005
 read-0.005
 named-0.004
 weep-0.004
Token fl
Token ID781
Feature activation-2.72e+00

Pos logit contributions
lights+0.137
orting+0.127
ires+0.125
udd+0.123
angles+0.121
Neg logit contributions
e-0.100
 ha-0.073
 read-0.070
 weep-0.064
 named-0.064
Tokenapped
Token ID6320
Feature activation-5.21e+00

Pos logit contributions
lights+0.069
falls+0.065
orting+0.063
der+0.058
ups+0.057
Neg logit contributions
e-0.080
bug-0.052
 named-0.051
 ha-0.049
years-0.048
Token its
Token ID663
Feature activation-7.17e+00

Pos logit contributions
lights+0.172
udd+0.166
orting+0.166
uddle+0.158
cases+0.158
Neg logit contributions
e-0.117
 read-0.071
 named-0.070
 ha-0.065
 weep-0.063
Token wings
Token ID12098
Feature activation-4.65e+00

Pos logit contributions
lights+0.212
orting+0.189
udd+0.188
ires+0.183
angles+0.182
Neg logit contributions
e-0.187
 ha-0.139
 read-0.131
 weep-0.125
ñ-0.124
Token and
Token ID290
Feature activation-6.69e-01

Pos logit contributions
lights+0.152
ires+0.146
angles+0.141
orting+0.138
udd+0.135
Neg logit contributions
e-0.111
 read-0.073
 named-0.067
 comb-0.065
 weep-0.062
Token flew
Token ID13112
Feature activation-3.18e+00

Pos logit contributions
lights+0.022
orting+0.020
udd+0.020
ires+0.020
angles+0.019
Neg logit contributions
e-0.015
 ha-0.011
 read-0.010
 weep-0.009
ñ-0.009
Token fast
Token ID3049
Feature activation-1.31e+00

Pos logit contributions
lights+0.097
udd+0.089
ires+0.088
orting+0.088
angles+0.086
Neg logit contributions
e-0.080
 ha-0.055
 read-0.052
 named-0.051
 fright-0.049
Token through
Token ID832
Feature activation+1.89e-01

Pos logit contributions
lights+0.035
orting+0.031
udd+0.031
ires+0.030
angles+0.030
Neg logit contributions
e-0.038
 ha-0.029
 read-0.027
 weep-0.026
ñ-0.026
Token world
Token ID995
Feature activation+1.11e+00

Pos logit contributions
lights+0.007
udd+0.006
orting+0.006
ires+0.006
falls+0.005
Neg logit contributions
e-0.010
 ha-0.007
 read-0.007
ñ-0.007
 weep-0.007
Token,
Token ID11
Feature activation+1.79e+00

Pos logit contributions
e+0.031
 ha+0.023
 read+0.023
 weep+0.021
 named+0.021
Neg logit contributions
lights-0.031
udd-0.027
orting-0.027
ires-0.027
angles-0.026
Token but
Token ID475
Feature activation+9.10e-01

Pos logit contributions
e+0.048
 ha+0.036
 read+0.034
 weep+0.033
ñ+0.032
Neg logit contributions
lights-0.052
orting-0.046
udd-0.046
ires-0.045
angles-0.044
Token there
Token ID612
Feature activation-2.66e+00

Pos logit contributions
e+0.026
 ha+0.019
 read+0.018
ñ+0.018
 weep+0.018
Neg logit contributions
lights-0.025
orting-0.022
udd-0.022
ires-0.021
angles-0.021
Token was
Token ID373
Feature activation+3.75e+00

Pos logit contributions
lights+0.068
udd+0.059
orting+0.059
s+0.059
ires+0.057
Neg logit contributions
e-0.078
 ha-0.060
 read-0.057
ñ-0.057
 weep-0.055
Token a
Token ID257
Feature activation+5.16e+00

Pos logit contributions
e+0.091
ñ+0.060
 ha+0.060
 Amen+0.058
 weep+0.058
Neg logit contributions
lights-0.116
udd-0.104
orting-0.100
s-0.100
ires-0.100
Token big
Token ID1263
Feature activation+3.90e+00

Pos logit contributions
e+0.195
ez+0.136
years+0.134
ñ+0.131
bug+0.130
Neg logit contributions
uddle-0.100
lights-0.089
udd-0.089
s-0.085
falls-0.073
Token,
Token ID11
Feature activation+3.85e+00

Pos logit contributions
e+0.128
years+0.093
nam+0.092
ñ+0.091
 ha+0.090
Neg logit contributions
lights-0.086
orting-0.075
udd-0.074
uddle-0.073
falls-0.072
Token dark
Token ID3223
Feature activation+3.64e+00

Pos logit contributions
e+0.131
 ha+0.101
ñ+0.097
 Amen+0.091
 read+0.090
Neg logit contributions
lights-0.084
udd-0.070
orting-0.066
s-0.066
falls-0.065
Token fence
Token ID13990
Feature activation+3.51e+00

Pos logit contributions
e+0.109
nam+0.082
 ha+0.082
years+0.077
ñ+0.077
Neg logit contributions
lights-0.096
udd-0.079
s-0.077
orting-0.076
falls-0.076
Token around
Token ID1088
Feature activation+2.88e+00

Pos logit contributions
e+0.098
 ha+0.074
 read+0.071
 named+0.068
 weep+0.067
Neg logit contributions
lights-0.097
udd-0.086
orting-0.086
angles-0.085
ires-0.084
Token,
Token ID11
Feature activation+2.85e+00

Pos logit contributions
e+0.051
 ha+0.039
 read+0.037
 weep+0.035
ñ+0.035
Neg logit contributions
lights-0.050
orting-0.045
udd-0.044
ires-0.043
angles-0.043
Token there
Token ID612
Feature activation-1.64e+00

Pos logit contributions
e+0.079
ñ+0.065
 ha+0.064
 read+0.060
 comb+0.059
Neg logit contributions
lights-0.075
s-0.066
ires-0.065
udd-0.063
orting-0.063
Token was
Token ID373
Feature activation+5.42e+00

Pos logit contributions
lights+0.043
udd+0.038
orting+0.038
ires+0.037
s+0.037
Neg logit contributions
e-0.047
 ha-0.036
 read-0.034
ñ-0.034
 weep-0.033
Token a
Token ID257
Feature activation+6.66e+00

Pos logit contributions
e+0.138
 ha+0.137
 weep+0.130
ñ+0.122
 disob+0.118
Neg logit contributions
lights-0.111
ires-0.109
 They-0.104
udd-0.102
s-0.098
Token little
Token ID1310
Feature activation+5.23e+00

Pos logit contributions
e+0.247
ez+0.190
years+0.178
ñ+0.176
tail+0.175
Neg logit contributions
uddle-0.113
udd-0.112
s-0.105
lights-0.101
 new-0.091
Token girl
Token ID2576
Feature activation+7.21e+00

Pos logit contributions
e+0.169
 read+0.122
ñ+0.120
 ha+0.118
nam+0.115
Neg logit contributions
lights-0.116
s-0.108
udd-0.108
uddle-0.102
orting-0.101
Token named
Token ID3706
Feature activation+5.49e+00

Pos logit contributions
e+0.112
ñ+0.103
nam+0.095
 ha+0.094
Rober+0.087
Neg logit contributions
s-0.223
lights-0.219
udd-0.216
orting-0.201
ires-0.194
Token Lily
Token ID20037
Feature activation+3.81e+00

Pos logit contributions
e+0.175
 ha+0.134
ñ+0.132
 Amen+0.126
 weep+0.121
Neg logit contributions
lights-0.122
udd-0.113
s-0.109
orting-0.108
ires-0.106
Token.
Token ID13
Feature activation+2.33e+00

Pos logit contributions
e+0.124
 ha+0.110
ñ+0.100
nam+0.096
 weep+0.095
Neg logit contributions
lights-0.074
udd-0.069
s-0.068
orting-0.064
ires-0.061
Token She
Token ID1375
Feature activation-9.49e-01

Pos logit contributions
e+0.075
 ha+0.059
 read+0.056
 weep+0.055
ñ+0.054
Neg logit contributions
lights-0.055
orting-0.047
udd-0.047
ires-0.046
angles-0.045
Token loved
Token ID6151
Feature activation+2.83e+00

Pos logit contributions
s+0.016
udd+0.016
lights+0.013
ires+0.011
orting+0.011
Neg logit contributions
e-0.030
nam-0.027
 ha-0.027
bug-0.026
 fright-0.026
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
lights+0.074
udd+0.067
orting+0.065
ires+0.065
angles+0.064
Neg logit contributions
e-0.062
 ha-0.048
 read-0.044
 named-0.043
 weep-0.041
Token there
Token ID612
Feature activation-2.35e+00

Pos logit contributions
e+0.037
 ha+0.027
ñ+0.026
 read+0.026
 weep+0.026
Neg logit contributions
lights-0.036
orting-0.032
ires-0.031
udd-0.031
angles-0.031
Token was
Token ID373
Feature activation+3.71e+00

Pos logit contributions
lights+0.073
orting+0.064
udd+0.063
ires+0.062
falls+0.061
Neg logit contributions
e-0.061
 ha-0.044
 read-0.042
 named-0.040
 weep-0.039
Token a
Token ID257
Feature activation+5.52e+00

Pos logit contributions
e+0.078
 ha+0.058
 weep+0.054
ñ+0.052
 comb+0.050
Neg logit contributions
lights-0.114
udd-0.102
ires-0.099
orting-0.099
der-0.098
Token little
Token ID1310
Feature activation+4.20e+00

Pos logit contributions
e+0.205
ez+0.152
years+0.139
osc+0.134
ñ+0.134
Neg logit contributions
uddle-0.102
lights-0.092
udd-0.088
s-0.086
 new-0.083
Token boy
Token ID2933
Feature activation+7.55e+00

Pos logit contributions
e+0.144
 read+0.103
ñ+0.102
years+0.098
ggie+0.097
Neg logit contributions
lights-0.086
uddle-0.075
udd-0.074
s-0.073
orting-0.070
Token named
Token ID3706
Feature activation+5.11e+00

Pos logit contributions
e+0.114
 ha+0.089
ñ+0.085
nam+0.085
Pen+0.077
Neg logit contributions
s-0.256
lights-0.245
udd-0.237
orting-0.232
ails-0.221
Token Tim
Token ID5045
Feature activation+4.03e+00

Pos logit contributions
e+0.135
 ha+0.099
 read+0.095
ñ+0.092
 weep+0.091
Neg logit contributions
lights-0.145
orting-0.132
udd-0.130
ires-0.128
angles-0.127
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
e+0.119
 ha+0.096
ñ+0.088
 read+0.086
nam+0.085
Neg logit contributions
lights-0.098
udd-0.092
orting-0.089
s-0.088
angles-0.084
Token He
Token ID679
Feature activation-1.04e+00

Pos logit contributions
e+0.061
 ha+0.044
 read+0.043
 named+0.042
 weep+0.040
Neg logit contributions
lights-0.074
udd-0.066
orting-0.065
angles-0.065
ires-0.064
Token saw
Token ID2497
Feature activation-5.86e-01

Pos logit contributions
s+0.020
udd+0.018
lights+0.018
falls+0.015
orting+0.015
Neg logit contributions
e-0.032
 ha-0.027
nam-0.027
ñ-0.025
 weep-0.025
Token monkey
Token ID21657
Feature activation+7.73e+00

Pos logit contributions
e+0.088
nam+0.065
years+0.064
tail+0.059
 Amen+0.057
Neg logit contributions
udd-0.062
uddle-0.061
lights-0.061
der-0.056
orting-0.056
Token.
Token ID13
Feature activation+2.63e+00

Pos logit contributions
e+0.131
 ha+0.111
nam+0.110
 weep+0.099
ñ+0.097
Neg logit contributions
s-0.249
lights-0.237
udd-0.232
orting-0.226
ires-0.216
Token The
Token ID383
Feature activation-1.89e+00

Pos logit contributions
e+0.114
 ha+0.100
ñ+0.095
 comb+0.093
 weep+0.092
Neg logit contributions
lights-0.025
ires-0.017
falls-0.017
orting-0.016
s-0.016
Token monkey
Token ID21657
Feature activation+5.53e+00

Pos logit contributions
udd+0.048
lights+0.048
orting+0.048
ires+0.047
falls+0.047
Neg logit contributions
e-0.048
nam-0.035
ñ-0.034
 weep-0.033
 Amen-0.033
Token loved
Token ID6151
Feature activation+2.76e+00

Pos logit contributions
e+0.153
 ha+0.123
ñ+0.116
nam+0.114
 weep+0.114
Neg logit contributions
lights-0.136
s-0.130
udd-0.128
orting-0.119
angles-0.117
Token to
Token ID284
Feature activation+5.49e+00

Pos logit contributions
e+0.098
 ha+0.078
 weep+0.074
 Amen+0.073
ñ+0.073
Neg logit contributions
lights-0.052
falls-0.045
udd-0.045
s-0.043
orting-0.043
Token play
Token ID711
Feature activation+2.71e+00

Pos logit contributions
e+0.139
years+0.112
nam+0.110
 ha+0.107
Ste+0.104
Neg logit contributions
lights-0.130
falls-0.123
orting-0.118
udd-0.116
angles-0.115
Token all
Token ID477
Feature activation+5.45e-04

Pos logit contributions
e+0.073
 ha+0.059
 weep+0.058
nam+0.058
ñ+0.054
Neg logit contributions
lights-0.066
orting-0.066
udd-0.062
ires-0.061
s-0.061
Token day
Token ID1110
Feature activation+6.26e-01

Pos logit contributions
e+0.000
 ha+0.000
 read+0.000
 named+0.000
 weep+0.000
Neg logit contributions
lights-0.000
udd-0.000
orting-0.000
ires-0.000
angles-0.000
Token long
Token ID890
Feature activation-7.14e-02

Pos logit contributions
e+0.018
 ha+0.014
 read+0.013
 weep+0.012
ñ+0.012
Neg logit contributions
lights-0.017
orting-0.015
udd-0.015
ires-0.014
s-0.014
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
lights+0.002
s+0.001
orting+0.001
ires+0.001
udd+0.001
Neg logit contributions
e-0.002
 ha-0.002
 weep-0.002
 read-0.002
ñ-0.002
TokenOne
Token ID3198
Feature activation+5.37e-01

Pos logit contributions
e+0.049
 ha+0.040
 read+0.038
 weep+0.037
ñ+0.037
Neg logit contributions
lights-0.032
orting-0.028
udd-0.027
s-0.027
ires-0.026
Token day
Token ID1110
Feature activation+6.07e-01

Pos logit contributions
e+0.017
 ha+0.014
ñ+0.014
nam+0.013
 comb+0.013
Neg logit contributions
lights-0.012
udd-0.010
s-0.010
orting-0.010
ires-0.009
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
e+0.016
 ha+0.012
 read+0.012
 weep+0.011
ñ+0.011
Neg logit contributions
lights-0.017
orting-0.015
udd-0.015
ires-0.015
angles-0.015
Token a
Token ID257
Feature activation+4.97e+00

Pos logit contributions
e+0.048
 ha+0.038
ñ+0.038
 comb+0.035
 weep+0.034
Neg logit contributions
s-0.064
lights-0.063
ires-0.059
orting-0.058
falls-0.056
Token little
Token ID1310
Feature activation+4.81e+00

Pos logit contributions
e+0.171
 weep+0.125
ez+0.122
tail+0.120
nam+0.120
Neg logit contributions
udd-0.094
uddle-0.090
s-0.090
lights-0.090
orting-0.084
Token girl
Token ID2576
Feature activation+6.31e+00

Pos logit contributions
e+0.151
 read+0.113
ñ+0.111
 ha+0.108
nam+0.107
Neg logit contributions
lights-0.104
s-0.099
udd-0.097
orting-0.097
uddle-0.089
Token was
Token ID373
Feature activation+5.01e+00

Pos logit contributions
nam+0.101
ñ+0.088
e+0.088
chard+0.076
Ste+0.075
Neg logit contributions
s-0.197
lights-0.185
udd-0.184
orting-0.172
ires-0.165
Token walking
Token ID6155
Feature activation+3.78e+00

Pos logit contributions
e+0.118
ñ+0.109
 read+0.088
 fright+0.087
 ha+0.085
Neg logit contributions
udd-0.120
ires-0.118
lights-0.112
orting-0.108
s-0.107
Token in
Token ID287
Feature activation+4.41e+00

Pos logit contributions
e+0.105
ñ+0.092
 ha+0.088
ggie+0.083
 tresp+0.080
Neg logit contributions
lights-0.084
s-0.083
orting-0.077
udd-0.073
ires-0.072
Token a
Token ID257
Feature activation+6.39e+00

Pos logit contributions
e+0.127
ñ+0.109
 ha+0.103
 liar+0.097
 Amen+0.097
Neg logit contributions
lights-0.092
s-0.080
ires-0.074
falls-0.074
orting-0.074
Token park
Token ID3952
Feature activation+4.27e+00

Pos logit contributions
e+0.192
 ha+0.145
 read+0.137
ñ+0.136
 weep+0.134
Neg logit contributions
lights-0.161
udd-0.145
orting-0.141
s-0.138
ires-0.136
Token Lily
Token ID20037
Feature activation+7.44e-02

Pos logit contributions
e+0.004
 ha+0.003
 named+0.003
 read+0.003
 3+0.003
Neg logit contributions
lights-0.006
angles-0.006
ires-0.006
falls-0.005
orting-0.005
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
e+0.002
 Amen+0.002
nam+0.002
 ha+0.002
ñ+0.002
Neg logit contributions
s-0.002
udd-0.002
lights-0.002
orting-0.001
uddle-0.001
Token her
Token ID607
Feature activation-2.40e+00

Pos logit contributions
lights+0.008
s+0.008
orting+0.008
angles+0.007
ires+0.007
Neg logit contributions
e-0.010
ñ-0.007
 ha-0.007
 stub-0.007
bug-0.006
Token friends
Token ID2460
Feature activation+1.04e+00

Pos logit contributions
lights+0.052
udd+0.045
orting+0.043
ires+0.043
s+0.042
Neg logit contributions
e-0.081
 ha-0.062
ñ-0.062
 read-0.060
nam-0.059
Token were
Token ID547
Feature activation+1.11e+00

Pos logit contributions
e+0.028
 ha+0.021
ñ+0.020
 Amen+0.020
 read+0.019
Neg logit contributions
lights-0.027
s-0.026
udd-0.025
orting-0.025
angles-0.023
Token scared
Token ID12008
Feature activation+5.63e-01

Pos logit contributions
e+0.024
 ha+0.017
ñ+0.017
 read+0.015
 Amen+0.015
Neg logit contributions
lights-0.034
udd-0.032
orting-0.032
s-0.031
ires-0.031
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
e+0.012
 named+0.009
 ha+0.008
 weep+0.008
 read+0.008
Neg logit contributions
lights-0.018
angles-0.018
falls-0.017
ires-0.017
udd-0.017
Token They
Token ID1119
Feature activation-2.14e+00

Pos logit contributions
lights+0.000
angles+0.000
ires+0.000
falls+0.000
udd+0.000
Neg logit contributions
e-0.000
 named-0.000
 ha-0.000
 read-0.000
 3-0.000
Token didn
Token ID1422
Feature activation+1.54e+00

Pos logit contributions
lights+0.058
s+0.054
orting+0.054
udd+0.053
angles+0.051
Neg logit contributions
e-0.057
 ha-0.041
ñ-0.039
bug-0.035
 Amen-0.035
Token't
Token ID470
Feature activation+1.05e-01

Pos logit contributions
e+0.039
 ha+0.030
nam+0.028
 weep+0.027
ñ+0.027
Neg logit contributions
lights-0.044
s-0.040
orting-0.040
ires-0.039
udd-0.038
Token know
Token ID760
Feature activation-1.18e+00

Pos logit contributions
e+0.003
 ha+0.002
nam+0.002
ñ+0.002
 read+0.002
Neg logit contributions
lights-0.003
orting-0.003
udd-0.003
s-0.003
falls-0.003
TokenThey
Token ID2990
Feature activation-2.52e+00

Pos logit contributions
lights+0.074
falls+0.067
orting+0.065
udd+0.065
angles+0.064
Neg logit contributions
e-0.059
 ha-0.043
 read-0.041
 named-0.040
bug-0.038
Token smile
Token ID8212
Feature activation-4.17e+00

Pos logit contributions
lights+0.065
udd+0.062
orting+0.062
angles+0.060
falls+0.058
Neg logit contributions
e-0.067
 ha-0.049
ñ-0.047
 Amen-0.047
 lawyer-0.047
Token.
Token ID13
Feature activation+8.96e-02

Pos logit contributions
lights+0.129
angles+0.113
falls+0.112
ires+0.111
orting+0.109
Neg logit contributions
e-0.105
 read-0.080
 ha-0.075
 weep-0.072
 named-0.070
Token They
Token ID1119
Feature activation-3.16e+00

Pos logit contributions
e+0.003
 named+0.003
 read+0.003
 ha+0.002
 3+0.002
Neg logit contributions
lights-0.002
udd-0.002
angles-0.002
orting-0.002
rows-0.001
Token are
Token ID389
Feature activation-2.49e-01

Pos logit contributions
lights+0.087
udd+0.079
orting+0.079
angles+0.075
ires+0.075
Neg logit contributions
e-0.086
 ha-0.065
 read-0.060
ñ-0.059
 weep-0.059
Token friends
Token ID2460
Feature activation+2.50e+00

Pos logit contributions
lights+0.008
orting+0.007
udd+0.007
angles+0.007
ires+0.007
Neg logit contributions
e-0.006
 ha-0.004
 read-0.004
 named-0.004
 weep-0.004
Token again
Token ID757
Feature activation-1.43e+00

Pos logit contributions
e+0.071
 ha+0.052
 read+0.049
 named+0.048
 weep+0.048
Neg logit contributions
lights-0.071
angles-0.063
ires-0.062
falls-0.062
udd-0.061
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
lights+0.035
orting+0.031
udd+0.031
ires+0.030
s+0.030
Neg logit contributions
e-0.044
 ha-0.034
 read-0.032
ñ-0.031
 weep-0.031
Token They
Token ID1119
Feature activation-2.86e+00

Pos logit contributions
e+0.005
 named+0.004
 read+0.004
 ha+0.004
bug+0.004
Neg logit contributions
lights-0.003
angles-0.003
udd-0.003
orting-0.003
rows-0.003
Token play
Token ID711
Feature activation+1.82e-01

Pos logit contributions
lights+0.076
orting+0.071
udd+0.071
angles+0.068
falls+0.067
Neg logit contributions
e-0.076
 ha-0.058
ñ-0.054
 weep-0.052
 read-0.051
Token with
Token ID351
Feature activation+1.06e+00

Pos logit contributions
e+0.005
 ha+0.004
 weep+0.003
 read+0.003
ñ+0.003
Neg logit contributions
lights-0.005
orting-0.005
udd-0.005
ires-0.004
angles-0.004
Token are
Token ID389
Feature activation+3.46e+00

Pos logit contributions
e+0.036
 Amen+0.033
years+0.033
nam+0.032
tail+0.032
Neg logit contributions
s-0.014
udd-0.012
angles-0.012
lights-0.011
ires-0.011
Token you
Token ID345
Feature activation+7.78e-01

Pos logit contributions
e+0.098
 ha+0.075
 read+0.071
 weep+0.068
ñ+0.068
Neg logit contributions
lights-0.094
orting-0.083
udd-0.083
ires-0.081
angles-0.080
Token doing
Token ID1804
Feature activation+7.77e-01

Pos logit contributions
e+0.021
ñ+0.017
 Amen+0.017
tail+0.017
 weep+0.016
Neg logit contributions
udd-0.018
orting-0.017
ires-0.017
angles-0.016
falls-0.016
Token?"
Token ID1701
Feature activation+3.12e+00

Pos logit contributions
e+0.020
years+0.016
nam+0.016
 weep+0.015
ñ+0.015
Neg logit contributions
lights-0.020
orting-0.019
ires-0.019
falls-0.018
angles-0.018
Token A
Token ID317
Feature activation+2.55e+00

Pos logit contributions
e+0.085
 ha+0.064
ñ+0.060
 read+0.060
 weep+0.058
Neg logit contributions
lights-0.088
ires-0.076
orting-0.076
udd-0.075
s-0.075
Token loud
Token ID7812
Feature activation+2.26e+00

Pos logit contributions
e+0.081
nam+0.059
ñ+0.058
 read+0.058
 weep+0.057
Neg logit contributions
lights-0.061
s-0.053
udd-0.049
uddle-0.049
falls-0.048
Token voice
Token ID3809
Feature activation+1.85e+00

Pos logit contributions
e+0.060
nam+0.048
 ha+0.045
 read+0.042
years+0.042
Neg logit contributions
lights-0.061
s-0.054
orting-0.054
falls-0.051
udd-0.051
Token shouted
Token ID17293
Feature activation-1.89e+00

Pos logit contributions
e+0.056
 ha+0.044
 read+0.041
ñ+0.041
 weep+0.040
Neg logit contributions
lights-0.046
orting-0.040
udd-0.039
ires-0.039
s-0.038
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
lights+0.046
orting+0.040
udd+0.039
s+0.039
ires+0.038
Neg logit contributions
e-0.059
 ha-0.046
 read-0.045
ñ-0.043
 weep-0.043
Token It
Token ID632
Feature activation-3.95e+00

Pos logit contributions
e+0.008
 ha+0.007
 read+0.006
 named+0.006
 weep+0.006
Neg logit contributions
lights-0.006
udd-0.006
orting-0.006
angles-0.005
ires-0.005
Token was
Token ID373
Feature activation+1.98e+00

Pos logit contributions
lights+0.102
s+0.087
udd+0.087
falls+0.086
orting+0.085
Neg logit contributions
e-0.116
 ha-0.090
ñ-0.083
nam-0.082
 weep-0.082
Token Tom
Token ID4186
Feature activation+3.12e-01

Pos logit contributions
lights+0.018
orting+0.016
udd+0.016
ires+0.016
angles+0.016
Neg logit contributions
e-0.014
 ha-0.010
 read-0.009
 weep-0.009
ñ-0.009
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
e+0.009
 ha+0.007
 read+0.006
 weep+0.006
 named+0.006
Neg logit contributions
lights-0.009
udd-0.008
orting-0.007
ires-0.007
angles-0.007
Token "
Token ID366
Feature activation+1.77e+00

Pos logit contributions
lights+0.033
udd+0.028
angles+0.028
orting+0.028
rows+0.027
Neg logit contributions
e-0.030
 read-0.024
 ha-0.024
 named-0.023
bug-0.022
TokenIt
Token ID1026
Feature activation-5.11e+00

Pos logit contributions
e+0.047
 ha+0.035
 read+0.033
 weep+0.032
ñ+0.031
Neg logit contributions
lights-0.052
orting-0.046
udd-0.045
ires-0.044
angles-0.044
Token's
Token ID338
Feature activation-3.70e-01

Pos logit contributions
lights+0.153
orting+0.134
ires+0.132
udd+0.130
angles+0.129
Neg logit contributions
e-0.136
 ha-0.102
 read-0.099
 weep-0.095
 comb-0.092
Token okay
Token ID8788
Feature activation+6.70e-01

Pos logit contributions
lights+0.009
orting+0.008
udd+0.008
ires+0.008
angles+0.008
Neg logit contributions
e-0.011
 ha-0.009
ñ-0.008
 read-0.008
 weep-0.008
Token,"
Token ID553
Feature activation-3.45e-01

Pos logit contributions
e+0.021
 ha+0.016
 read+0.015
nam+0.015
 weep+0.015
Neg logit contributions
lights-0.016
s-0.014
orting-0.014
udd-0.014
ires-0.013
Token she
Token ID673
Feature activation-2.86e+00

Pos logit contributions
lights+0.009
udd+0.008
orting+0.008
ires+0.007
angles+0.007
Neg logit contributions
e-0.010
 ha-0.008
 read-0.008
 weep-0.008
ñ-0.007
Token said
Token ID531
Feature activation-3.66e+00

Pos logit contributions
lights+0.068
udd+0.060
orting+0.059
s+0.057
angles+0.056
Neg logit contributions
e-0.090
 ha-0.071
 read-0.066
 weep-0.065
ñ-0.064
Token,
Token ID11
Feature activation-4.80e-01

Pos logit contributions
lights+0.100
udd+0.088
orting+0.087
angles+0.086
ires+0.086
Neg logit contributions
e-0.104
 ha-0.079
 read-0.075
 weep-0.073
ñ-0.071
Token "
Token ID366
Feature activation+1.23e+00

Pos logit contributions
lights+0.011
orting+0.010
udd+0.010
s+0.010
falls+0.009
Neg logit contributions
e-0.015
 ha-0.012
ñ-0.011
 read-0.010
 weep-0.010
Token scared
Token ID12008
Feature activation+1.40e+00

Pos logit contributions
e+0.035
 ha+0.026
tail+0.025
ñ+0.025
 Amen+0.023
Neg logit contributions
lights-0.029
udd-0.028
falls-0.027
orting-0.027
s-0.026
Token anymore
Token ID7471
Feature activation+7.66e-01

Pos logit contributions
e+0.040
 ha+0.031
 read+0.029
 weep+0.029
 named+0.028
Neg logit contributions
lights-0.038
udd-0.033
angles-0.032
orting-0.032
falls-0.032
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
e+0.023
 ha+0.018
 read+0.017
 weep+0.017
ñ+0.016
Neg logit contributions
lights-0.019
orting-0.017
udd-0.017
ires-0.016
s-0.016
Token They
Token ID1119
Feature activation-2.57e+00

Pos logit contributions
lights+0.004
udd+0.003
angles+0.003
orting+0.003
ires+0.003
Neg logit contributions
e-0.006
 named-0.005
 read-0.005
 ha-0.005
 lawyer-0.005
Token played
Token ID2826
Feature activation+1.13e-02

Pos logit contributions
lights+0.063
udd+0.058
orting+0.058
s+0.056
angles+0.055
Neg logit contributions
e-0.076
 ha-0.058
ñ-0.054
 weep-0.052
 read-0.051
Token with
Token ID351
Feature activation+8.94e-01

Pos logit contributions
e+0.000
 ha+0.000
 weep+0.000
 read+0.000
ñ+0.000
Neg logit contributions
lights-0.000
orting-0.000
udd-0.000
ires-0.000
angles-0.000
Token Gig
Token ID19525
Feature activation+4.20e-01

Pos logit contributions
e+0.019
 ha+0.013
 read+0.012
 weep+0.012
ñ+0.011
Neg logit contributions
lights-0.030
orting-0.027
udd-0.027
ires-0.027
angles-0.027
Tokeni
Token ID72
Feature activation+1.73e-01

Pos logit contributions
e+0.011
 ha+0.008
 read+0.007
ñ+0.007
 comb+0.007
Neg logit contributions
lights-0.013
orting-0.011
ires-0.011
angles-0.010
falls-0.010
Token all
Token ID477
Feature activation-2.41e+00

Pos logit contributions
e+0.005
 ha+0.004
 read+0.004
 weep+0.004
ñ+0.004
Neg logit contributions
lights-0.005
orting-0.004
udd-0.004
ires-0.004
angles-0.004
Token day
Token ID1110
Feature activation-6.57e-01

Pos logit contributions
lights+0.066
orting+0.058
udd+0.058
ires+0.056
angles+0.056
Neg logit contributions
e-0.069
 ha-0.052
 read-0.049
 weep-0.047
ñ-0.047
Token long
Token ID890
Feature activation-1.57e+00

Pos logit contributions
lights+0.019
orting+0.017
udd+0.017
angles+0.016
ires+0.016
Neg logit contributions
e-0.018
 ha-0.013
 read-0.013
 weep-0.012
 named-0.012
Token was
Token ID373
Feature activation+3.18e+00

Pos logit contributions
e+0.054
 ha+0.041
 read+0.039
 weep+0.037
 named+0.037
Neg logit contributions
lights-0.057
orting-0.050
udd-0.049
ires-0.049
angles-0.048
Token a
Token ID257
Feature activation+4.92e+00

Pos logit contributions
e+0.072
ñ+0.070
 fright+0.059
 ha+0.059
 weep+0.057
Neg logit contributions
udd-0.067
lights-0.065
ires-0.064
 they-0.062
s-0.061
Token little
Token ID1310
Feature activation+4.35e+00

Pos logit contributions
e+0.159
ez+0.111
ñ+0.110
nam+0.108
 weep+0.107
Neg logit contributions
s-0.096
udd-0.092
lights-0.088
 new-0.080
 they-0.080
Token girl
Token ID2576
Feature activation+6.00e+00

Pos logit contributions
e+0.122
 ha+0.088
 read+0.083
ñ+0.083
 comb+0.077
Neg logit contributions
lights-0.119
udd-0.110
s-0.109
orting-0.105
ires-0.099
Token.
Token ID13
Feature activation+9.04e-01

Pos logit contributions
e+0.169
 ha+0.144
nam+0.133
ñ+0.133
 weep+0.131
Neg logit contributions
lights-0.138
s-0.135
udd-0.123
orting-0.122
der-0.109
Token Every
Token ID3887
Feature activation+2.69e-01

Pos logit contributions
e+0.023
 read+0.022
 named+0.019
Anne+0.018
 3+0.017
Neg logit contributions
lights-0.022
uddle-0.021
ires-0.021
rows-0.021
orting-0.020
Token day
Token ID1110
Feature activation+2.49e-01

Pos logit contributions
e+0.008
 ha+0.006
 read+0.005
 weep+0.005
ñ+0.005
Neg logit contributions
lights-0.007
orting-0.007
udd-0.007
ires-0.006
angles-0.006
Token they
Token ID484
Feature activation-8.36e-01

Pos logit contributions
e+0.006
 ha+0.005
 read+0.005
 named+0.005
ñ+0.004
Neg logit contributions
lights-0.007
udd-0.007
ires-0.007
orting-0.006
falls-0.006
Token played
Token ID2826
Feature activation+9.43e-01

Pos logit contributions
lights+0.016
udd+0.015
s+0.014
angles+0.014
orting+0.014
Neg logit contributions
e-0.029
 ha-0.024
ñ-0.022
 weep-0.022
 x-0.022
Token in
Token ID287
Feature activation+3.22e+00

Pos logit contributions
e+0.036
 ha+0.031
 weep+0.030
nam+0.029
bug+0.029
Neg logit contributions
lights-0.014
orting-0.013
udd-0.012
s-0.011
ires-0.011
Token the
Token ID262
Feature activation-3.97e-01

Pos logit contributions
e+0.092
 ha+0.081
ñ+0.077
 Amen+0.073
 comb+0.070
Neg logit contributions
lights-0.075
s-0.064
angles-0.062
udd-0.061
orting-0.059
Token she
Token ID673
Feature activation-2.67e+00

Pos logit contributions
lights+0.031
orting+0.027
udd+0.027
ires+0.026
angles+0.026
Neg logit contributions
e-0.036
 ha-0.028
 read-0.027
 weep-0.026
 comb-0.025
Token forgot
Token ID16453
Feature activation-3.37e+00

Pos logit contributions
lights+0.069
udd+0.065
s+0.063
orting+0.062
falls+0.059
Neg logit contributions
e-0.072
 ha-0.054
ñ-0.051
nam-0.051
 Amen-0.050
Token all
Token ID477
Feature activation-2.36e+00

Pos logit contributions
lights+0.097
orting+0.085
angles+0.084
udd+0.084
ires+0.083
Neg logit contributions
e-0.089
 ha-0.068
 read-0.068
 named-0.062
 weep-0.062
Token about
Token ID546
Feature activation+3.32e-01

Pos logit contributions
lights+0.050
orting+0.042
udd+0.042
ires+0.040
angles+0.040
Neg logit contributions
e-0.082
 ha-0.066
 read-0.064
 weep-0.062
ñ-0.061
Token her
Token ID607
Feature activation-1.53e+00

Pos logit contributions
e+0.007
 ha+0.005
 read+0.005
ñ+0.005
 weep+0.004
Neg logit contributions
lights-0.011
orting-0.010
udd-0.010
ires-0.010
falls-0.009
Token cell
Token ID2685
Feature activation-1.03e+00

Pos logit contributions
lights+0.045
orting+0.040
udd+0.040
ires+0.039
angles+0.038
Neg logit contributions
e-0.040
 ha-0.030
 read-0.028
 weep-0.027
ñ-0.026
Token.
Token ID13
Feature activation-1.63e+00

Pos logit contributions
lights+0.029
ires+0.027
angles+0.026
orting+0.026
udd+0.026
Neg logit contributions
e-0.027
 ha-0.020
 read-0.020
 comb-0.019
 weep-0.018
Token
Token ID198
Feature activation-3.71e+00

Pos logit contributions
lights+0.047
orting+0.040
udd+0.040
falls+0.039
angles+0.038
Neg logit contributions
e-0.046
 read-0.035
 ha-0.034
 named-0.031
 weep-0.030
Token
Token ID198
Feature activation-2.81e+00

Pos logit contributions
lights+0.117
orting+0.102
udd+0.102
falls+0.100
angles+0.099
Neg logit contributions
e-0.093
 ha-0.066
 read-0.065
 named-0.059
 weep-0.059
TokenHowever
Token ID4864
Feature activation-1.94e+00

Pos logit contributions
lights+0.074
orting+0.065
udd+0.064
ires+0.062
angles+0.062
Neg logit contributions
e-0.082
 ha-0.064
 read-0.060
 weep-0.058
ñ-0.057
Token Lisa
Token ID15378
Feature activation-1.51e+00

Pos logit contributions
lights+0.055
orting+0.048
udd+0.048
ires+0.047
angles+0.046
Neg logit contributions
e-0.053
 ha-0.040
 read-0.038
 weep-0.036
ñ-0.036
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
lights+0.067
orting+0.061
udd+0.061
angles+0.059
ires+0.059
Neg logit contributions
e-0.065
 ha-0.049
 read-0.046
 comb-0.044
 named-0.044
Token Sara
Token ID24799
Feature activation+1.68e+00

Pos logit contributions
e+0.047
 ha+0.037
 read+0.035
 weep+0.033
 named+0.033
Neg logit contributions
lights-0.037
udd-0.033
orting-0.033
angles-0.031
ires-0.031
Token picked
Token ID6497
Feature activation-2.85e+00

Pos logit contributions
e+0.045
 ha+0.035
ñ+0.033
nam+0.032
 Amen+0.032
Neg logit contributions
lights-0.043
udd-0.042
s-0.040
orting-0.039
angles-0.038
Token it
Token ID340
Feature activation-4.37e+00

Pos logit contributions
lights+0.079
orting+0.071
udd+0.070
ires+0.069
angles+0.068
Neg logit contributions
e-0.078
 ha-0.059
 read-0.056
 weep-0.054
ñ-0.053
Token up
Token ID510
Feature activation-1.34e+00

Pos logit contributions
lights+0.115
udd+0.100
orting+0.098
ires+0.094
angles+0.094
Neg logit contributions
e-0.130
 ha-0.098
 read-0.098
 comb-0.092
 named-0.091
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
lights+0.033
orting+0.028
udd+0.028
ires+0.027
s+0.027
Neg logit contributions
e-0.042
 ha-0.033
 read-0.031
 weep-0.030
ñ-0.030
Token said
Token ID531
Feature activation-4.86e+00

Pos logit contributions
lights+0.004
orting+0.003
udd+0.003
ires+0.003
angles+0.003
Neg logit contributions
e-0.003
 ha-0.002
 read-0.002
 weep-0.002
ñ-0.002
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
lights+0.150
udd+0.132
orting+0.131
angles+0.128
ires+0.128
Neg logit contributions
e-0.123
 ha-0.094
 read-0.085
 named-0.081
 weep-0.080
Token "
Token ID366
Feature activation+2.51e+00

Pos logit contributions
lights+0.027
udd+0.023
angles+0.023
orting+0.023
ires+0.022
Neg logit contributions
e-0.031
 ha-0.025
 read-0.024
 named-0.023
 comb-0.023
TokenLook
Token ID8567
Feature activation-3.01e+00

Pos logit contributions
e+0.068
 ha+0.053
 weep+0.051
 read+0.049
ñ+0.049
Neg logit contributions
lights-0.064
orting-0.060
udd-0.059
s-0.058
ires-0.057
Token,
Token ID11
Feature activation+1.33e+00

Pos logit contributions
lights+0.065
udd+0.063
s+0.062
orting+0.061
angles+0.060
Neg logit contributions
e-0.092
 weep-0.072
 Amen-0.072
 read-0.070
 ha-0.070
Token outside
Token ID2354
Feature activation+1.23e+00

Pos logit contributions
e+0.025
 read+0.019
 ha+0.018
 named+0.018
 weep+0.016
Neg logit contributions
lights-0.033
ires-0.029
angles-0.029
udd-0.028
orting-0.028
Token to
Token ID284
Feature activation+2.75e+00

Pos logit contributions
e+0.038
 ha+0.030
 read+0.028
ñ+0.028
 weep+0.027
Neg logit contributions
lights-0.030
udd-0.026
orting-0.026
s-0.025
ires-0.025
Token play
Token ID711
Feature activation+1.75e+00

Pos logit contributions
e+0.074
ñ+0.060
 ha+0.059
nam+0.058
Mic+0.057
Neg logit contributions
lights-0.060
orting-0.058
udd-0.056
falls-0.056
s-0.054
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
e+0.055
 ha+0.045
nam+0.043
 weep+0.043
ñ+0.042
Neg logit contributions
lights-0.039
orting-0.035
udd-0.034
s-0.033
ires-0.030
Token They
Token ID1119
Feature activation-2.60e+00

Pos logit contributions
lights+0.012
orting+0.010
udd+0.010
angles+0.010
ires+0.009
Neg logit contributions
e-0.023
 ha-0.019
 read-0.019
 weep-0.018
 named-0.018
Token skipped
Token ID26684
Feature activation-7.07e-01

Pos logit contributions
lights+0.071
udd+0.068
s+0.065
angles+0.064
orting+0.063
Neg logit contributions
e-0.068
 ha-0.051
ñ-0.049
 Amen-0.047
 weep-0.046
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
lights+0.020
orting+0.017
ires+0.017
udd+0.017
angles+0.017
Neg logit contributions
e-0.020
 ha-0.015
 read-0.014
 named-0.014
 weep-0.013
Token hopped
Token ID45230
Feature activation-1.41e+00

Pos logit contributions
lights+0.000
s+0.000
udd+0.000
falls+0.000
orting+0.000
Neg logit contributions
e-0.000
 x-0.000
 ha-0.000
 lawyer-0.000
 Amen-0.000
Token,
Token ID11
Feature activation-2.86e-02

Pos logit contributions
lights+0.041
udd+0.036
ires+0.036
angles+0.036
orting+0.036
Neg logit contributions
e-0.038
 ha-0.028
 read-0.027
 named-0.026
 weep-0.024
Token just
Token ID655
Feature activation-1.95e+00

Pos logit contributions
lights+0.001
orting+0.001
ires+0.001
udd+0.001
angles+0.001
Neg logit contributions
e-0.001
 ha-0.000
 read-0.000
 named-0.000
 weep-0.000
Token like
Token ID588
Feature activation+2.48e+00

Pos logit contributions
lights+0.062
orting+0.056
udd+0.055
ires+0.054
angles+0.053
Neg logit contributions
e-0.046
 ha-0.033
 read-0.031
ñ-0.029
 weep-0.029
Token she
Token ID673
Feature activation-1.54e+00

Pos logit contributions
e+0.070
 ha+0.054
 read+0.052
 weep+0.049
 named+0.049
Neg logit contributions
lights-0.067
orting-0.060
udd-0.059
ires-0.058
angles-0.057
Token asked
Token ID1965
Feature activation-3.03e+00

Pos logit contributions
s+0.035
lights+0.035
udd+0.035
orting+0.032
angles+0.031
Neg logit contributions
e-0.044
 Amen-0.031
 ha-0.030
 weep-0.030
nam-0.030
Token her
Token ID607
Feature activation-1.74e+00

Pos logit contributions
lights+0.087
orting+0.084
ires+0.080
udd+0.079
s+0.078
Neg logit contributions
e-0.076
 ha-0.055
ñ-0.052
 weep-0.050
 Amen-0.049
Token mom
Token ID1995
Feature activation+3.53e+00

Pos logit contributions
lights+0.048
udd+0.042
orting+0.042
ires+0.041
angles+0.040
Neg logit contributions
e-0.050
 ha-0.037
 read-0.035
 weep-0.034
ñ-0.034
Token.
Token ID13
Feature activation-8.23e-01

Pos logit contributions
e+0.087
 read+0.070
 ha+0.066
 named+0.064
bug+0.062
Neg logit contributions
lights-0.110
udd-0.092
ires-0.091
falls-0.090
uddle-0.090
Token Her
Token ID2332
Feature activation-2.32e+00

Pos logit contributions
lights+0.025
udd+0.022
orting+0.022
angles+0.021
falls+0.021
Neg logit contributions
e-0.021
 ha-0.016
 read-0.016
 named-0.014
 weep-0.014
Token mom
Token ID1995
Feature activation+2.96e+00

Pos logit contributions
lights+0.058
udd+0.052
orting+0.049
ires+0.049
s+0.049
Neg logit contributions
e-0.073
 ha-0.053
ñ-0.051
 read-0.051
 weep-0.050
Token explained
Token ID4893
Feature activation-4.74e+00

Pos logit contributions
e+0.088
 ha+0.067
 weep+0.062
 read+0.062
ñ+0.062
Neg logit contributions
lights-0.074
udd-0.067
orting-0.067
s-0.064
angles-0.063
Token that
Token ID326
Feature activation-3.39e+00

Pos logit contributions
lights+0.097
orting+0.083
udd+0.082
ires+0.078
angles+0.077
Neg logit contributions
e-0.166
 ha-0.134
 read-0.128
 weep-0.125
ñ-0.124
Token it
Token ID340
Feature activation-5.65e+00

Pos logit contributions
lights+0.088
orting+0.077
udd+0.077
ires+0.074
angles+0.074
Neg logit contributions
e-0.100
 ha-0.077
 read-0.073
 weep-0.070
ñ-0.070
Token was
Token ID373
Feature activation+1.62e+00

Pos logit contributions
lights+0.165
orting+0.147
udd+0.146
ires+0.143
angles+0.140
Neg logit contributions
e-0.150
 ha-0.113
 read-0.109
 weep-0.103
 comb-0.101
Token She
Token ID1375
Feature activation-3.60e+00

Pos logit contributions
lights+0.051
orting+0.050
udd+0.050
angles+0.048
uddle+0.046
Neg logit contributions
e-0.042
 Amen-0.034
 read-0.033
 3-0.031
 named-0.030
Token was
Token ID373
Feature activation+7.91e-01

Pos logit contributions
lights+0.108
orting+0.096
udd+0.094
ires+0.093
angles+0.092
Neg logit contributions
e-0.094
 ha-0.069
 read-0.066
 weep-0.062
 named-0.061
Token very
Token ID845
Feature activation+2.86e+00

Pos logit contributions
e+0.021
ñ+0.016
 ha+0.016
 read+0.015
 weep+0.015
Neg logit contributions
lights-0.021
udd-0.019
orting-0.018
ires-0.018
s-0.017
Token happy
Token ID3772
Feature activation-1.36e+00

Pos logit contributions
e+0.073
 ha+0.054
 read+0.050
 weep+0.048
ñ+0.047
Neg logit contributions
lights-0.086
orting-0.077
udd-0.077
ires-0.075
angles-0.075
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
lights+0.040
angles+0.037
udd+0.037
orting+0.037
ires+0.035
Neg logit contributions
e-0.035
 read-0.026
 ha-0.026
 named-0.024
 comb-0.024
Token said
Token ID531
Feature activation-4.30e+00

Pos logit contributions
e+0.008
 read+0.006
 ha+0.006
 named+0.005
 weep+0.005
Neg logit contributions
lights-0.011
orting-0.010
angles-0.010
ires-0.009
udd-0.009
Token,
Token ID11
Feature activation-9.64e-01

Pos logit contributions
lights+0.130
udd+0.117
orting+0.115
angles+0.113
ires+0.112
Neg logit contributions
e-0.111
 ha-0.085
 read-0.078
 weep-0.073
 named-0.072
Token "
Token ID366
Feature activation+9.01e-01

Pos logit contributions
lights+0.027
udd+0.024
angles+0.024
orting+0.023
uddle+0.023
Neg logit contributions
e-0.027
 read-0.022
 ha-0.021
 named-0.020
 comb-0.020
TokenLet
Token ID5756
Feature activation+1.55e+00

Pos logit contributions
e+0.024
 ha+0.019
 weep+0.018
 read+0.017
ñ+0.017
Neg logit contributions
lights-0.025
udd-0.022
orting-0.022
ires-0.021
angles-0.021
Token's
Token ID338
Feature activation+6.33e-01

Pos logit contributions
e+0.045
 ha+0.034
 read+0.033
 weep+0.031
ñ+0.031
Neg logit contributions
lights-0.042
orting-0.037
udd-0.037
ires-0.036
angles-0.035
Token go
Token ID467
Feature activation+2.17e+00

Pos logit contributions
e+0.017
ñ+0.014
nam+0.013
 ha+0.013
years+0.013
Neg logit contributions
lights-0.015
udd-0.013
orting-0.013
angles-0.013
falls-0.013
Token tower
Token ID10580
Feature activation+1.38e+00

Pos logit contributions
e+0.050
 ha+0.037
 read+0.035
ñ+0.033
 weep+0.033
Neg logit contributions
lights-0.052
udd-0.045
orting-0.045
ires-0.044
s-0.044
Token out
Token ID503
Feature activation+5.70e-02

Pos logit contributions
e+0.037
 read+0.026
 ha+0.025
bug+0.023
 weep+0.023
Neg logit contributions
lights-0.039
orting-0.037
ires-0.036
udd-0.036
angles-0.035
Token of
Token ID286
Feature activation+3.91e-02

Pos logit contributions
e+0.002
 read+0.001
 ha+0.001
 named+0.001
 weep+0.001
Neg logit contributions
lights-0.001
orting-0.001
udd-0.001
angles-0.001
ires-0.001
Token them
Token ID606
Feature activation-2.97e+00

Pos logit contributions
e+0.001
 ha+0.001
 read+0.001
 weep+0.001
ñ+0.001
Neg logit contributions
lights-0.001
orting-0.001
udd-0.001
ires-0.001
angles-0.001
Token.
Token ID13
Feature activation-9.56e-01

Pos logit contributions
lights+0.090
orting+0.080
udd+0.079
angles+0.079
ires+0.078
Neg logit contributions
e-0.078
 read-0.056
 ha-0.055
 weep-0.051
 named-0.049
Token It
Token ID632
Feature activation-4.93e+00

Pos logit contributions
lights+0.023
udd+0.020
orting+0.019
angles+0.019
ires+0.018
Neg logit contributions
e-0.031
 ha-0.024
 read-0.023
 named-0.023
 weep-0.022
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
lights+0.155
orting+0.144
ires+0.140
udd+0.139
angles+0.135
Neg logit contributions
e-0.122
 ha-0.088
 read-0.086
 weep-0.079
 named-0.077
Token a
Token ID257
Feature activation+3.23e+00

Pos logit contributions
e+0.025
 ha+0.018
 read+0.016
 weep+0.015
ñ+0.015
Neg logit contributions
lights-0.038
orting-0.034
udd-0.034
ires-0.033
angles-0.033
Token fun
Token ID1257
Feature activation+9.53e-01

Pos logit contributions
e+0.097
 ha+0.070
ñ+0.067
 read+0.067
 weep+0.066
Neg logit contributions
lights-0.079
udd-0.070
orting-0.066
s-0.066
uddle-0.065
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
e+0.027
 ha+0.020
 read+0.019
 weep+0.018
ñ+0.018
Neg logit contributions
lights-0.026
orting-0.023
udd-0.023
ires-0.023
angles-0.022
Token delicious
Token ID12625
Feature activation-2.16e-01

Pos logit contributions
e+0.042
 ha+0.032
 read+0.031
 named+0.029
 weep+0.028
Neg logit contributions
lights-0.048
orting-0.045
udd-0.044
ires-0.043
angles-0.043
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
e+0.030
 ha+0.022
 read+0.021
 named+0.020
 weep+0.020
Neg logit contributions
lights-0.034
orting-0.030
udd-0.030
angles-0.030
ires-0.029
Token just
Token ID655
Feature activation-7.86e-01

Pos logit contributions
e+0.027
 ha+0.020
 read+0.019
 weep+0.018
 named+0.017
Neg logit contributions
lights-0.035
udd-0.031
orting-0.031
angles-0.030
ires-0.030
Token like
Token ID588
Feature activation+2.53e+00

Pos logit contributions
lights+0.025
orting+0.022
udd+0.022
ires+0.022
angles+0.022
Neg logit contributions
e-0.019
 ha-0.013
 read-0.012
 weep-0.012
ñ-0.012
Token you
Token ID345
Feature activation+7.21e-01

Pos logit contributions
e+0.064
 ha+0.047
 read+0.044
 weep+0.042
ñ+0.041
Neg logit contributions
lights-0.077
udd-0.068
orting-0.068
ires-0.066
angles-0.066
Token.
Token ID13
Feature activation+7.84e-01

Pos logit contributions
e+0.021
 ha+0.016
 read+0.015
 weep+0.014
 named+0.014
Neg logit contributions
lights-0.020
orting-0.017
udd-0.017
ires-0.017
angles-0.017
Token It
Token ID632
Feature activation-4.19e+00

Pos logit contributions
e+0.024
 ha+0.019
 read+0.018
ñ+0.018
 weep+0.017
Neg logit contributions
lights-0.018
orting-0.017
ires-0.016
udd-0.016
angles-0.015
Token breat
Token ID4882
Feature activation+9.22e-01

Pos logit contributions
lights+0.094
orting+0.079
udd+0.077
angles+0.075
ires+0.074
Neg logit contributions
e-0.145
 ha-0.115
 read-0.111
 weep-0.108
 named-0.106
Tokenhes
Token ID956
Feature activation-1.84e+00

Pos logit contributions
e+0.031
 ha+0.026
 weep+0.026
 read+0.025
nam+0.025
Neg logit contributions
lights-0.018
ires-0.017
orting-0.017
udd-0.016
angles-0.015
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
lights+0.055
udd+0.049
ires+0.049
orting+0.049
angles+0.047
Neg logit contributions
e-0.050
 ha-0.034
 read-0.032
 weep-0.032
 named-0.031
Token feels
Token ID5300
Feature activation-8.84e-01

Pos logit contributions
e+0.045
 ha+0.032
 read+0.031
 weep+0.029
 comb+0.028
Neg logit contributions
lights-0.056
orting-0.051
udd-0.050
angles-0.049
ires-0.049
Token and
Token ID290
Feature activation+5.57e-01

Pos logit contributions
lights+0.029
orting+0.026
udd+0.026
angles+0.026
ires+0.025
Neg logit contributions
e-0.020
 ha-0.014
 read-0.013
 weep-0.013
 named-0.012
Token then
Token ID788
Feature activation-2.73e-01

Pos logit contributions
lights+0.016
udd+0.015
orting+0.014
angles+0.014
ires+0.014
Neg logit contributions
e-0.013
 ha-0.010
 read-0.009
 weep-0.009
 named-0.008
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
lights+0.008
udd+0.007
orting+0.007
ires+0.007
angles+0.007
Neg logit contributions
e-0.008
 ha-0.006
 read-0.005
 weep-0.005
ñ-0.005
Token one
Token ID530
Feature activation-3.14e+00

Pos logit contributions
e+0.004
 ha+0.003
 read+0.003
 weep+0.003
 named+0.003
Neg logit contributions
lights-0.005
udd-0.004
orting-0.004
angles-0.004
ires-0.004
Token lady
Token ID10846
Feature activation+2.54e+00

Pos logit contributions
lights+0.064
s+0.053
orting+0.052
udd+0.052
ires+0.051
Neg logit contributions
e-0.107
 ha-0.086
ñ-0.084
 read-0.082
 weep-0.081
Token said
Token ID531
Feature activation-4.09e+00

Pos logit contributions
e+0.065
 ha+0.047
 weep+0.044
 read+0.044
ñ+0.043
Neg logit contributions
lights-0.074
udd-0.067
orting-0.066
s-0.065
angles-0.063
Token she
Token ID673
Feature activation-4.59e+00

Pos logit contributions
lights+0.097
orting+0.084
udd+0.082
ires+0.080
falls+0.079
Neg logit contributions
e-0.130
 ha-0.101
 read-0.097
 weep-0.095
ñ-0.093
Token had
Token ID550
Feature activation-1.82e+00

Pos logit contributions
lights+0.129
udd+0.114
orting+0.114
ires+0.110
angles+0.110
Neg logit contributions
e-0.125
 ha-0.095
 read-0.089
 weep-0.086
ñ-0.085
Token some
Token ID617
Feature activation-1.35e+00

Pos logit contributions
lights+0.055
orting+0.049
udd+0.049
ires+0.047
angles+0.047
Neg logit contributions
e-0.046
 ha-0.034
 read-0.032
 weep-0.031
ñ-0.030
Token clean
Token ID3424
Feature activation-2.10e+00

Pos logit contributions
lights+0.039
orting+0.035
udd+0.034
ires+0.033
angles+0.033
Neg logit contributions
e-0.036
 ha-0.027
 read-0.025
 weep-0.024
ñ-0.024
Token pieces
Token ID5207
Feature activation-9.58e-01

Pos logit contributions
lights+0.062
orting+0.056
udd+0.056
ires+0.054
angles+0.053
Neg logit contributions
e-0.055
 ha-0.041
 read-0.038
 weep-0.036
ñ-0.036
Token of
Token ID286
Feature activation+1.24e+00

Pos logit contributions
lights+0.030
ires+0.027
udd+0.027
angles+0.027
orting+0.026
Neg logit contributions
e-0.023
 read-0.015
bug-0.014
 named-0.014
 comb-0.014
Token run
Token ID1057
Feature activation-9.73e-01

Pos logit contributions
lights+0.002
s+0.002
orting+0.002
angles+0.002
falls+0.002
Neg logit contributions
e-0.002
 ha-0.001
ñ-0.001
years-0.001
 lawyer-0.001
Token away
Token ID1497
Feature activation-2.43e+00

Pos logit contributions
lights+0.028
udd+0.025
orting+0.025
ires+0.024
angles+0.024
Neg logit contributions
e-0.026
 ha-0.020
 read-0.019
 weep-0.018
 named-0.018
Token.
Token ID13
Feature activation+3.26e-02

Pos logit contributions
lights+0.067
udd+0.060
orting+0.060
angles+0.059
ires+0.057
Neg logit contributions
e-0.068
 ha-0.052
 read-0.049
 weep-0.047
 named-0.046
Token They
Token ID1119
Feature activation-2.94e+00

Pos logit contributions
e+0.001
 ha+0.001
 named+0.001
 read+0.001
 weep+0.001
Neg logit contributions
lights-0.001
angles-0.001
udd-0.001
orting-0.001
falls-0.001
Token hope
Token ID2911
Feature activation-2.10e+00

Pos logit contributions
lights+0.082
udd+0.077
s+0.077
orting+0.077
falls+0.075
Neg logit contributions
e-0.070
 ha-0.050
ñ-0.050
nam-0.046
 lawyer-0.046
Token the
Token ID262
Feature activation-4.30e+00

Pos logit contributions
lights+0.070
udd+0.068
angles+0.067
ires+0.066
orting+0.064
Neg logit contributions
e-0.043
 ha-0.032
 read-0.032
 named-0.031
 weep-0.027
Token dog
Token ID3290
Feature activation+2.90e+00

Pos logit contributions
lights+0.145
orting+0.131
udd+0.131
ires+0.128
angles+0.127
Neg logit contributions
e-0.094
 ha-0.066
 read-0.061
 weep-0.057
ñ-0.056
Token won
Token ID1839
Feature activation-2.86e-01

Pos logit contributions
e+0.093
 ha+0.074
 read+0.072
 weep+0.068
 named+0.067
Neg logit contributions
lights-0.075
orting-0.059
udd-0.059
ires-0.059
angles-0.058
Token't
Token ID470
Feature activation+2.45e-01

Pos logit contributions
lights+0.007
s+0.007
orting+0.006
ires+0.006
udd+0.006
Neg logit contributions
e-0.008
nam-0.007
 ha-0.007
ñ-0.006
bug-0.006
Token catch
Token ID4929
Feature activation+1.64e+00

Pos logit contributions
e+0.006
 ha+0.004
ñ+0.003
nam+0.003
 read+0.003
Neg logit contributions
lights-0.008
orting-0.007
udd-0.007
s-0.007
falls-0.007
Token them
Token ID606
Feature activation-1.20e+00

Pos logit contributions
e+0.055
 ha+0.045
 read+0.043
ñ+0.042
 comb+0.041
Neg logit contributions
lights-0.034
orting-0.030
udd-0.029
s-0.029
ires-0.029