TOP ACTIVATIONS
MAX = 10.394

fun that he didn 't realize he had gone too far
But what he didn 't realize was that his mess was
fun that they didn 't realize how dangerous it was to
fun that they didn 't realize it was getting dark .
it . I didn 't realize it was under the bench
things that he didn 't realize how tired he was getting
. Sometimes we don 't realize how much nicer things are
. But Sally didn 't realize how costly the items were
careless that he didn 't realize it until it was too
that he didn 't even realize that he was getting closer
fun that she didn 't realize just how wet she was
all that she didn 't realize that the chamber was starting
after it and didn 't realize she was getting far from
, but she didn 't realize how far she had gone
brave and he didn 't realize how fierce the animals in
busy that he didn 't realize how hot it was inside
noises that he didn 't realize he was getting very wet
mistake . He didn 't realize there was something else in
ducks that he did not realize he had left the drain
, but she didn 't realize that it would make her

MIN ACTIVATIONS
MIN = -12.601

But the ost ric hes did not like that
cried . " The ost ric hes are mean ! They
. They were called ost ric hes . Lily and Ben
shouted . " The ost ric hes are so funny !"
ostr ich pen . Ost ric hes are big birds with
ostr ich pen . Ost ric hes are big birds with
a big box of ch ocol ates ! <|endoftext|> Once upon
and looked at the ost ric hes . They saw one
L ily , these ch ocol ates are bitter ," he
too close . The ost ric hes might pe ck you
a big box of ch ocol ates !
find a box of ch ocol ates . "
. One of the ost ric hes looked different from the
but be careful . Ost ric hes can be gr umpy
noticed something . The ch ocol ates were bitter . They
wanted to touch the ost ric hes . They reached out
was a box of ch ocol ates . Lily loved ch
bled up all the ch ocol ates , and she was
was a box of ch ocol ates ! Alice and her
. It sees other ost ric hes . It makes new

INTERVAL 4.645 - 10.394
CONTAINS 1.517%

said , " Thank you ! I promise I 'll only
his mom could trust the reliable veterinarian who could replace Dave
, " Okay , Tommy . We can play together ."
sky and make pretty colors . We can watch them tonight
I want you to be alert and watch out for cars

INTERVAL -1.104 - 4.645
CONTAINS 71.623%

The big boy does not listen . He laughs .
from flower to flower , collecting n ectar . It was
L ily and Ben thanked the fairy and hugged her .
is : We should be careful and nice with other people
Lily drank some water and lay down on the couch .

INTERVAL -6.853 - -1.104
CONTAINS 22.613%

grown up ! She ordered pizza and ice cream , her
okay , Billy . I 'm not hurt too bad .
little cub for the nice time he had and the cub
allows us to measure your temperature and make sure you 're
underneath , happy that their blanket was now resisting its wild

INTERVAL -12.601 - -6.853
CONTAINS 4.246%

give it to you ?" John looked down to
," Ben says . They finish their toast and
insisted they stayed outside . But the mom was
coming towards them . The bear and mouse were
made him happy . <|endoftext|> Once upon a time there was
Token fun
Token ID1257
Feature activation+1.54e-01

Pos logit contributions
+0.050
uch+0.047
irl+0.047
St+0.046
P+0.045
Neg logit contributions
 address-0.066
der-0.060
 Acc-0.060
 Tim-0.060
ly-0.060
Token that
Token ID326
Feature activation+4.40e+00

Pos logit contributions
 address+0.003
 Acc+0.003
person+0.002
 son+0.002
 apology+0.002
Neg logit contributions
-0.006
irl-0.006
uch-0.005
St-0.005
ink-0.005
Token he
Token ID339
Feature activation+6.73e-01

Pos logit contributions
 address+0.091
 apology+0.080
person+0.079
der+0.077
 Acc+0.075
Neg logit contributions
 time-0.154
irl-0.154
Once-0.152
uch-0.150
St-0.147
Token didn
Token ID1422
Feature activation-1.19e+00

Pos logit contributions
 Tim+0.014
 address+0.013
 arrow+0.013
ly+0.013
 cries+0.013
Neg logit contributions
-0.024
ink-0.022
icer-0.022
uch-0.022
 total-0.022
Token't
Token ID470
Feature activation-8.87e-01

Pos logit contributions
+0.020
uch+0.020
P+0.020
St+0.019
irl+0.019
Neg logit contributions
 address-0.043
 Tim-0.042
der-0.041
 cries-0.041
 arrow-0.040
Token realize
Token ID6537
Feature activation+8.84e+00

Pos logit contributions
+0.026
St+0.025
uch+0.025
irl+0.025
P+0.024
Neg logit contributions
 address-0.023
 Acc-0.021
der-0.020
 Tim-0.020
 cries-0.019
Token he
Token ID339
Feature activation+8.96e-01

Pos logit contributions
 address+0.167
person+0.161
 apology+0.143
 writes+0.142
ly+0.142
Neg logit contributions
irl-0.302
St-0.295
-0.295
if-0.295
 total-0.280
Token had
Token ID550
Feature activation-5.36e-02

Pos logit contributions
 address+0.018
 Tim+0.018
 cries+0.017
ly+0.017
 arrow+0.017
Neg logit contributions
-0.031
P-0.029
St-0.029
uch-0.029
irl-0.029
Token gone
Token ID3750
Feature activation+1.90e+00

Pos logit contributions
+0.002
ink+0.002
uch+0.002
icer+0.002
P+0.002
Neg logit contributions
 address-0.001
 Tim-0.001
 arrow-0.001
 that-0.001
 cries-0.001
Token too
Token ID1165
Feature activation-2.63e+00

Pos logit contributions
 address+0.034
 Tim+0.031
der+0.029
 cries+0.029
ly+0.029
Neg logit contributions
-0.071
uch-0.069
St-0.068
irl-0.068
P-0.067
Token far
Token ID1290
Feature activation+5.59e-01

Pos logit contributions
+0.086
uch+0.083
St+0.081
irl+0.081
ink+0.080
Neg logit contributions
 Tim-0.059
 address-0.057
 cries-0.055
 arrow-0.054
 inside-0.053
Token But
Token ID887
Feature activation+8.01e-01

Pos logit contributions
 address+0.062
 Tim+0.054
der+0.053
ly+0.053
 arrow+0.052
Neg logit contributions
-0.140
uch-0.136
St-0.135
irl-0.134
P-0.132
Token what
Token ID644
Feature activation+1.67e+00

Pos logit contributions
 address+0.014
ly+0.012
der+0.012
 cries+0.012
person+0.012
Neg logit contributions
-0.030
uch-0.029
irl-0.029
St-0.029
P-0.028
Token he
Token ID339
Feature activation+4.16e-01

Pos logit contributions
 address+0.034
 Tim+0.031
der+0.030
ly+0.029
 arrow+0.029
Neg logit contributions
-0.058
uch-0.057
St-0.056
irl-0.056
P-0.055
Token didn
Token ID1422
Feature activation-1.16e+00

Pos logit contributions
 address+0.009
 Tim+0.009
 arrow+0.008
 cries+0.008
ly+0.008
Neg logit contributions
-0.014
St-0.013
uch-0.013
irl-0.013
P-0.013
Token't
Token ID470
Feature activation-1.00e+00

Pos logit contributions
+0.020
uch+0.019
St+0.019
P+0.019
irl+0.018
Neg logit contributions
 address-0.042
 Tim-0.041
der-0.040
 cries-0.040
 arrow-0.039
Token realize
Token ID6537
Feature activation+8.81e+00

Pos logit contributions
uch+0.027
irl+0.027
 time+0.027
St+0.027
Once+0.027
Neg logit contributions
 address-0.024
der-0.023
 cries-0.023
 Acc-0.023
person-0.022
Token was
Token ID373
Feature activation-2.68e+00

Pos logit contributions
 address+0.125
ly+0.104
person+0.104
 cries+0.102
 arrow+0.100
Neg logit contributions
-0.352
uch-0.345
irl-0.344
St-0.342
if-0.336
Token that
Token ID326
Feature activation+4.02e+00

Pos logit contributions
uch+0.086
+0.085
irl+0.084
St+0.082
P+0.082
Neg logit contributions
 address-0.058
person-0.053
der-0.052
ly-0.052
 cries-0.051
Token his
Token ID465
Feature activation+8.44e-01

Pos logit contributions
 address+0.074
ly+0.068
person+0.067
der+0.066
 Acc+0.063
Neg logit contributions
uch-0.140
-0.139
irl-0.137
St-0.135
P-0.132
Token mess
Token ID2085
Feature activation-1.08e-01

Pos logit contributions
 address+0.016
der+0.015
ly+0.015
person+0.015
 Tim+0.015
Neg logit contributions
-0.029
uch-0.028
irl-0.028
St-0.028
 total-0.027
Token was
Token ID373
Feature activation-3.31e+00

Pos logit contributions
+0.004
uch+0.004
irl+0.004
St+0.004
ink+0.003
Neg logit contributions
 address-0.002
 Tim-0.002
der-0.002
 arrow-0.002
 cries-0.002
Token fun
Token ID1257
Feature activation+2.04e-01

Pos logit contributions
+0.057
uch+0.055
St+0.054
irl+0.054
P+0.053
Neg logit contributions
 address-0.065
 Tim-0.062
der-0.060
ly-0.059
 arrow-0.059
Token that
Token ID326
Feature activation+4.46e+00

Pos logit contributions
 address+0.004
 Acc+0.003
person+0.003
ly+0.003
der+0.003
Neg logit contributions
-0.008
irl-0.007
uch-0.007
P-0.007
St-0.007
Token they
Token ID484
Feature activation+1.04e+00

Pos logit contributions
 address+0.096
ly+0.087
person+0.086
 apology+0.084
der+0.084
Neg logit contributions
-0.151
irl-0.149
uch-0.149
St-0.145
 total-0.144
Token didn
Token ID1422
Feature activation-1.13e+00

Pos logit contributions
 address+0.022
 Tim+0.022
 arrow+0.020
ly+0.020
der+0.020
Neg logit contributions
-0.035
uch-0.033
St-0.033
ink-0.033
irl-0.033
Token't
Token ID470
Feature activation-9.52e-01

Pos logit contributions
+0.018
uch+0.017
St+0.017
P+0.017
irl+0.017
Neg logit contributions
 address-0.042
 Tim-0.042
der-0.040
 cries-0.040
 arrow-0.040
Token realize
Token ID6537
Feature activation+8.79e+00

Pos logit contributions
+0.028
St+0.027
uch+0.027
irl+0.027
P+0.027
Neg logit contributions
 address-0.024
der-0.022
 Acc-0.022
 cries-0.021
 Tim-0.021
Token how
Token ID703
Feature activation-1.38e+00

Pos logit contributions
person+0.177
 address+0.173
ly+0.160
 apology+0.158
 writes+0.158
Neg logit contributions
irl-0.287
-0.280
St-0.279
if-0.276
 total-0.267
Token dangerous
Token ID4923
Feature activation+5.87e+00

Pos logit contributions
+0.038
uch+0.037
irl+0.037
St+0.037
P+0.036
Neg logit contributions
 address-0.039
der-0.035
person-0.034
ly-0.034
 arrow-0.034
Token it
Token ID340
Feature activation+6.95e-01

Pos logit contributions
 address+0.112
 Acc+0.105
person+0.102
der+0.097
old+0.097
Neg logit contributions
-0.203
uch-0.202
irl-0.198
St-0.194
P-0.189
Token was
Token ID373
Feature activation-2.46e+00

Pos logit contributions
 Tim+0.013
 address+0.013
 cries+0.012
der+0.012
ly+0.012
Neg logit contributions
-0.025
St-0.024
P-0.023
irl-0.023
uch-0.023
Token to
Token ID284
Feature activation+9.55e-01

Pos logit contributions
+0.104
uch+0.102
St+0.101
irl+0.101
P+0.099
Neg logit contributions
 address-0.031
 Tim-0.026
der-0.025
ly-0.024
 arrow-0.024
Token fun
Token ID1257
Feature activation+1.82e-01

Pos logit contributions
+0.056
uch+0.054
St+0.053
irl+0.053
P+0.052
Neg logit contributions
 address-0.064
 Tim-0.060
der-0.059
ly-0.058
 arrow-0.058
Token that
Token ID326
Feature activation+4.46e+00

Pos logit contributions
 address+0.004
 Acc+0.003
person+0.003
ly+0.003
der+0.003
Neg logit contributions
-0.007
irl-0.006
uch-0.006
ink-0.006
P-0.006
Token they
Token ID484
Feature activation+1.05e+00

Pos logit contributions
 address+0.090
ly+0.078
der+0.078
person+0.078
 apology+0.075
Neg logit contributions
-0.160
uch-0.156
irl-0.155
St-0.153
P-0.149
Token didn
Token ID1422
Feature activation-1.12e+00

Pos logit contributions
 Tim+0.023
 address+0.022
 arrow+0.020
ly+0.020
 decide+0.020
Neg logit contributions
-0.036
uch-0.034
ink-0.034
St-0.034
irl-0.033
Token't
Token ID470
Feature activation-9.23e-01

Pos logit contributions
+0.018
uch+0.018
St+0.017
P+0.017
irl+0.017
Neg logit contributions
 address-0.041
 Tim-0.041
der-0.040
 cries-0.039
 arrow-0.038
Token realize
Token ID6537
Feature activation+8.74e+00

Pos logit contributions
+0.028
uch+0.028
St+0.027
irl+0.027
P+0.027
Neg logit contributions
 address-0.022
der-0.020
 Acc-0.020
 Tim-0.020
 cries-0.019
Token it
Token ID340
Feature activation+4.39e-01

Pos logit contributions
person+0.175
 address+0.169
ly+0.159
 cries+0.154
 writes+0.154
Neg logit contributions
irl-0.289
-0.281
St-0.280
if-0.276
 time-0.268
Token was
Token ID373
Feature activation-3.31e+00

Pos logit contributions
 address+0.009
 Tim+0.009
 cries+0.008
der+0.008
ly+0.008
Neg logit contributions
-0.015
St-0.015
uch-0.015
P-0.014
irl-0.014
Token getting
Token ID1972
Feature activation-2.05e+00

Pos logit contributions
+0.121
ink+0.113
uch+0.111
P+0.110
St+0.107
Neg logit contributions
 Tim-0.079
 for-0.073
 earlier-0.068
 inside-0.068
 his-0.067
Token dark
Token ID3223
Feature activation+2.62e+00

Pos logit contributions
+0.068
uch+0.064
ink+0.063
irl+0.062
St+0.061
Neg logit contributions
 Tim-0.047
 address-0.046
 cries-0.042
 arrow-0.042
 inside-0.042
Token.
Token ID13
Feature activation+3.77e+00

Pos logit contributions
 address+0.050
 apology+0.045
 Acc+0.044
person+0.043
old+0.041
Neg logit contributions
-0.094
uch-0.091
ink-0.090
irl-0.089
if-0.088
Token it
Token ID340
Feature activation-3.02e-01

Pos logit contributions
 address+0.028
der+0.028
 Acc+0.027
person+0.026
ly+0.025
Neg logit contributions
P-0.046
uch-0.046
St-0.046
-0.045
irl-0.045
Token.
Token ID13
Feature activation+5.04e+00

Pos logit contributions
+0.011
St+0.011
irl+0.011
uch+0.011
P+0.011
Neg logit contributions
 address-0.005
 Tim-0.005
ly-0.005
der-0.005
 cries-0.005
Token I
Token ID314
Feature activation-7.02e-02

Pos logit contributions
 address+0.075
ly+0.067
der+0.065
 cries+0.064
 arrow+0.064
Neg logit contributions
-0.203
St-0.195
uch-0.193
irl-0.190
ink-0.188
Token didn
Token ID1422
Feature activation-1.18e+00

Pos logit contributions
+0.002
St+0.002
uch+0.002
P+0.002
irl+0.002
Neg logit contributions
 address-0.002
 Tim-0.002
 fix-0.001
 Acc-0.001
 say-0.001
Token't
Token ID470
Feature activation-1.03e+00

Pos logit contributions
+0.017
uch+0.016
irl+0.015
St+0.015
ink+0.014
Neg logit contributions
 address-0.049
 Tim-0.046
der-0.046
ly-0.046
 Acc-0.045
Token realize
Token ID6537
Feature activation+8.70e+00

Pos logit contributions
+0.034
uch+0.034
St+0.033
irl+0.033
P+0.033
Neg logit contributions
 address-0.022
 Tim-0.020
der-0.020
ly-0.019
 cries-0.019
Token it
Token ID340
Feature activation+2.71e-01

Pos logit contributions
 address+0.148
person+0.136
ly+0.130
 cries+0.125
 officer+0.122
Neg logit contributions
irl-0.314
-0.308
St-0.308
uch-0.307
if-0.302
Token was
Token ID373
Feature activation-3.28e+00

Pos logit contributions
 address+0.005
 Tim+0.005
ly+0.004
der+0.004
 cries+0.004
Neg logit contributions
-0.010
St-0.010
irl-0.010
P-0.010
uch-0.010
Token under
Token ID739
Feature activation+1.06e+00

Pos logit contributions
+0.125
uch+0.114
St+0.113
ink+0.113
P+0.113
Neg logit contributions
 Tim-0.068
 address-0.066
 for-0.063
 arrow-0.061
 inside-0.061
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
 address+0.017
 Tim+0.016
der+0.015
 cries+0.015
 arrow+0.015
Neg logit contributions
-0.041
uch-0.041
St-0.040
irl-0.040
P-0.039
Token bench
Token ID7624
Feature activation-1.63e+00

Pos logit contributions
 Acc+0.045
 address+0.042
 Tim+0.041
der+0.040
fully+0.040
Neg logit contributions
-0.035
St-0.033
uch-0.033
irl-0.032
 total-0.032
Token things
Token ID1243
Feature activation-2.83e+00

Pos logit contributions
 address+0.069
 Tim+0.065
der+0.063
 Acc+0.062
 cries+0.061
Neg logit contributions
-0.091
St-0.089
irl-0.089
uch-0.089
P-0.087
Token that
Token ID326
Feature activation+4.24e+00

Pos logit contributions
+0.110
uch+0.106
irl+0.105
St+0.105
ink+0.105
Neg logit contributions
 address-0.044
 Acc-0.039
 cries-0.036
 Tim-0.036
 son-0.035
Token he
Token ID339
Feature activation+3.66e-01

Pos logit contributions
 Tim+0.058
 address+0.053
 arrow+0.045
 him+0.043
 fix+0.042
Neg logit contributions
-0.183
uch-0.175
St-0.175
irl-0.175
ink-0.174
Token didn
Token ID1422
Feature activation-1.23e+00

Pos logit contributions
 Tim+0.007
 address+0.006
 arrow+0.006
 inside+0.006
ly+0.006
Neg logit contributions
-0.014
uch-0.013
St-0.013
icer-0.013
irl-0.013
Token't
Token ID470
Feature activation-1.13e+00

Pos logit contributions
+0.021
uch+0.020
St+0.020
P+0.020
irl+0.019
Neg logit contributions
 address-0.045
 Tim-0.044
der-0.043
 cries-0.042
ly-0.042
Token realize
Token ID6537
Feature activation+8.69e+00

Pos logit contributions
+0.038
uch+0.037
St+0.036
irl+0.036
P+0.035
Neg logit contributions
 address-0.025
 Tim-0.023
der-0.022
ly-0.022
 arrow-0.022
Token how
Token ID703
Feature activation-1.73e+00

Pos logit contributions
 address+0.130
person+0.111
ly+0.108
 Acc+0.108
 cries+0.107
Neg logit contributions
-0.334
irl-0.326
St-0.325
if-0.323
uch-0.319
Token tired
Token ID10032
Feature activation+3.42e+00

Pos logit contributions
+0.051
uch+0.049
St+0.049
irl+0.049
ink+0.048
Neg logit contributions
 address-0.044
der-0.040
 Tim-0.039
 arrow-0.039
ly-0.039
Token he
Token ID339
Feature activation+8.89e-01

Pos logit contributions
 address+0.050
 Acc+0.047
person+0.040
 cries+0.040
 apology+0.039
Neg logit contributions
-0.131
uch-0.127
St-0.126
irl-0.125
ink-0.125
Token was
Token ID373
Feature activation-2.39e+00

Pos logit contributions
 address+0.017
 Tim+0.017
ly+0.016
 arrow+0.016
 cries+0.016
Neg logit contributions
-0.032
irl-0.030
St-0.030
P-0.030
uch-0.030
Token getting
Token ID1972
Feature activation-1.18e+00

Pos logit contributions
+0.095
uch+0.094
irl+0.093
ink+0.092
St+0.092
Neg logit contributions
 address-0.033
 Tim-0.030
der-0.029
 Acc-0.028
ly-0.027
Token.
Token ID13
Feature activation+4.62e+00

Pos logit contributions
 address+0.026
 arrow+0.024
 Tim+0.023
 son+0.023
 says+0.022
Neg logit contributions
-0.047
irl-0.045
St-0.045
uch-0.044
ink-0.044
Token Sometimes
Token ID8975
Feature activation+1.68e+00

Pos logit contributions
 address+0.071
 Tim+0.065
der+0.060
 arrow+0.058
 Acc+0.058
Neg logit contributions
-0.182
uch-0.180
irl-0.178
St-0.176
P-0.175
Token we
Token ID356
Feature activation+9.44e-01

Pos logit contributions
 address+0.024
 Tim+0.023
 arrow+0.022
 that+0.021
ly+0.020
Neg logit contributions
-0.072
St-0.068
uch-0.068
P-0.068
irl-0.066
Token don
Token ID836
Feature activation-2.60e+00

Pos logit contributions
 address+0.020
 fix+0.019
 Tim+0.019
 arrow+0.019
 say+0.019
Neg logit contributions
-0.032
P-0.031
St-0.031
uch-0.031
irl-0.030
Token't
Token ID470
Feature activation-1.19e+00

Pos logit contributions
 total+0.053
St+0.053
icer+0.052
P+0.052
uch+0.051
Neg logit contributions
 Tim-0.081
 address-0.081
 that-0.080
 cries-0.079
der-0.078
Token realize
Token ID6537
Feature activation+8.68e+00

Pos logit contributions
+0.049
uch+0.046
irl+0.046
icer+0.046
ink+0.046
Neg logit contributions
 address-0.018
 Tim-0.017
 say-0.017
 arrow-0.017
 fix-0.016
Token how
Token ID703
Feature activation-1.44e+00

Pos logit contributions
 address+0.118
 Tim+0.102
der+0.098
ly+0.096
 arrow+0.095
Neg logit contributions
-0.354
uch-0.348
St-0.344
irl-0.344
ink-0.337
Token much
Token ID881
Feature activation-1.18e+00

Pos logit contributions
+0.047
uch+0.046
St+0.046
irl+0.045
P+0.044
Neg logit contributions
 address-0.032
 Tim-0.029
der-0.028
ly-0.028
 arrow-0.028
Token nicer
Token ID36597
Feature activation+4.28e+00

Pos logit contributions
+0.044
St+0.043
uch+0.042
ink+0.042
P+0.042
Neg logit contributions
 address-0.023
 Tim-0.022
 arrow-0.021
 cries-0.020
der-0.020
Token things
Token ID1243
Feature activation-1.89e+00

Pos logit contributions
 address+0.072
 Tim+0.065
der+0.063
ly+0.061
 arrow+0.061
Neg logit contributions
-0.166
uch-0.161
St-0.159
irl-0.158
P-0.156
Token are
Token ID389
Feature activation-3.90e+00

Pos logit contributions
+0.066
P+0.066
St+0.064
icer+0.063
uch+0.062
Neg logit contributions
 address-0.039
 Tim-0.036
ly-0.036
der-0.036
 that-0.035
Token.
Token ID13
Feature activation+4.16e+00

Pos logit contributions
+0.081
uch+0.078
ink+0.078
St+0.077
irl+0.077
Neg logit contributions
 address-0.027
 Acc-0.025
der-0.024
 Tim-0.024
 cries-0.023
Token But
Token ID887
Feature activation+1.10e+00

Pos logit contributions
 address+0.063
 alien+0.058
der+0.056
 writes+0.054
 arrow+0.054
Neg logit contributions
-0.176
St-0.157
P-0.154
uch-0.151
 total-0.150
Token Sally
Token ID25737
Feature activation+1.55e+00

Pos logit contributions
 address+0.019
 Tim+0.017
der+0.016
ly+0.016
 cries+0.016
Neg logit contributions
-0.042
uch-0.041
St-0.040
irl-0.040
P-0.040
Token didn
Token ID1422
Feature activation-1.34e+00

Pos logit contributions
 address+0.034
 Tim+0.032
der+0.030
ly+0.030
 cries+0.030
Neg logit contributions
-0.051
uch-0.050
St-0.049
irl-0.049
P-0.048
Token't
Token ID470
Feature activation-1.15e+00

Pos logit contributions
P+0.027
uch+0.026
St+0.026
if+0.025
irl+0.025
Neg logit contributions
 Tim-0.044
 address-0.043
 cries-0.042
 that-0.042
 fix-0.041
Token realize
Token ID6537
Feature activation+8.67e+00

Pos logit contributions
+0.044
uch+0.043
irl+0.042
St+0.042
ink+0.042
Neg logit contributions
 address-0.020
 Tim-0.018
ly-0.017
 arrow-0.017
 fix-0.017
Token how
Token ID703
Feature activation-1.51e+00

Pos logit contributions
 address+0.124
person+0.110
ly+0.105
 Acc+0.104
 cries+0.104
Neg logit contributions
-0.341
St-0.332
irl-0.332
uch-0.324
if-0.324
Token costly
Token ID16378
Feature activation+5.41e+00

Pos logit contributions
+0.050
uch+0.048
St+0.047
irl+0.047
P+0.046
Neg logit contributions
 address-0.034
 Tim-0.032
der-0.030
ly-0.030
 cries-0.030
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
 address+0.080
 Tim+0.073
ly+0.067
der+0.067
 arrow+0.066
Neg logit contributions
-0.218
uch-0.214
St-0.210
irl-0.210
P-0.207
Token items
Token ID3709
Feature activation-1.44e+00

Pos logit contributions
 address+0.046
ly+0.041
 arrow+0.040
 Tim+0.040
 cries+0.040
Neg logit contributions
-0.071
uch-0.068
ink-0.065
irl-0.065
St-0.065
Token were
Token ID547
Feature activation-4.22e+00

Pos logit contributions
+0.045
uch+0.044
St+0.043
irl+0.043
P+0.043
Neg logit contributions
 address-0.034
 Tim-0.032
der-0.030
ly-0.030
 arrow-0.030
Token careless
Token ID36138
Feature activation+5.69e+00

Pos logit contributions
+0.116
icer+0.108
P+0.108
uch+0.106
irl+0.104
Neg logit contributions
 Tim-0.046
 address-0.043
 cries-0.042
 old-0.041
 fix-0.039
Token that
Token ID326
Feature activation+4.54e+00

Pos logit contributions
 address+0.096
 Acc+0.082
 arrow+0.076
 apology+0.076
person+0.075
Neg logit contributions
-0.217
uch-0.207
ink-0.207
irl-0.207
if-0.206
Token he
Token ID339
Feature activation+5.16e-01

Pos logit contributions
 address+0.064
ly+0.056
der+0.055
person+0.054
 Acc+0.052
Neg logit contributions
uch-0.178
-0.177
St-0.177
irl-0.176
 total-0.173
Token didn
Token ID1422
Feature activation-1.22e+00

Pos logit contributions
 address+0.011
 Tim+0.011
 cries+0.010
ly+0.010
 arrow+0.010
Neg logit contributions
-0.018
uch-0.017
St-0.017
P-0.016
irl-0.016
Token't
Token ID470
Feature activation-1.11e+00

Pos logit contributions
+0.021
uch+0.020
St+0.020
P+0.020
irl+0.020
Neg logit contributions
 address-0.044
 Tim-0.044
der-0.042
 cries-0.042
 fix-0.041
Token realize
Token ID6537
Feature activation+8.65e+00

Pos logit contributions
+0.037
uch+0.036
St+0.036
irl+0.036
P+0.035
Neg logit contributions
 address-0.024
der-0.021
 Tim-0.021
 Acc-0.021
 cries-0.020
Token it
Token ID340
Feature activation+4.64e-01

Pos logit contributions
 address+0.141
person+0.130
ly+0.122
 writes+0.119
 Acc+0.115
Neg logit contributions
-0.314
St-0.313
if-0.311
irl-0.309
 total-0.301
Token until
Token ID1566
Feature activation+1.44e+00

Pos logit contributions
 address+0.010
 Tim+0.009
der+0.009
 arrow+0.009
 Acc+0.009
Neg logit contributions
-0.015
uch-0.015
St-0.015
irl-0.015
ink-0.014
Token it
Token ID340
Feature activation-1.86e-01

Pos logit contributions
 address+0.023
ly+0.020
der+0.020
person+0.020
 Acc+0.020
Neg logit contributions
-0.056
uch-0.054
irl-0.054
St-0.053
 total-0.052
Token was
Token ID373
Feature activation-4.07e+00

Pos logit contributions
+0.008
P+0.007
irl+0.007
St+0.007
if+0.007
Neg logit contributions
 Tim-0.003
der-0.003
 cries-0.003
 inside-0.003
 address-0.002
Token too
Token ID1165
Feature activation-3.92e+00

Pos logit contributions
+0.133
uch+0.127
ink+0.124
P+0.121
St+0.118
Neg logit contributions
 Tim-0.112
 inside-0.106
 for-0.105
 his-0.100
 about-0.099
Token that
Token ID326
Feature activation+4.46e+00

Pos logit contributions
 Acc+0.070
 address+0.069
 apology+0.065
 decide+0.060
 fixes+0.058
Neg logit contributions
if-0.116
-0.115
uch-0.114
 time-0.112
ink-0.112
Token he
Token ID339
Feature activation+2.94e-01

Pos logit contributions
 address+0.057
der+0.055
ly+0.051
 Acc+0.051
 apology+0.049
Neg logit contributions
irl-0.180
St-0.180
uch-0.178
-0.176
 total-0.174
Token didn
Token ID1422
Feature activation-1.35e+00

Pos logit contributions
 Tim+0.005
 arrow+0.004
ly+0.004
 address+0.004
 cries+0.004
Neg logit contributions
-0.012
icer-0.011
uch-0.011
ink-0.011
P-0.011
Token't
Token ID470
Feature activation-1.03e+00

Pos logit contributions
+0.022
uch+0.021
St+0.021
P+0.021
irl+0.021
Neg logit contributions
 address-0.050
 Tim-0.049
der-0.048
 cries-0.047
 arrow-0.047
Token even
Token ID772
Feature activation+1.39e+00

Pos logit contributions
+0.032
uch+0.031
St+0.031
irl+0.030
P+0.030
Neg logit contributions
 address-0.025
 Acc-0.022
der-0.022
 Tim-0.021
 cries-0.021
Token realize
Token ID6537
Feature activation+8.64e+00

Pos logit contributions
 address+0.035
 Tim+0.032
 Acc+0.032
der+0.032
person+0.031
Neg logit contributions
-0.040
St-0.039
irl-0.039
uch-0.039
P-0.038
Token that
Token ID326
Feature activation+4.55e+00

Pos logit contributions
 address+0.142
person+0.135
 Acc+0.127
ly+0.127
 apology+0.124
Neg logit contributions
irl-0.305
-0.303
if-0.302
St-0.302
 total-0.293
Token he
Token ID339
Feature activation+6.90e-01

Pos logit contributions
 address+0.068
der+0.064
ly+0.063
 Acc+0.063
person+0.060
Neg logit contributions
irl-0.167
St-0.167
-0.166
uch-0.165
 total-0.165
Token was
Token ID373
Feature activation-3.32e+00

Pos logit contributions
 Tim+0.013
 address+0.012
ly+0.012
 cries+0.012
 arrow+0.012
Neg logit contributions
-0.026
P-0.024
St-0.024
uch-0.024
icer-0.024
Token getting
Token ID1972
Feature activation-2.02e+00

Pos logit contributions
+0.127
icer+0.121
uch+0.119
ink+0.117
P+0.116
Neg logit contributions
 inside-0.066
 Tim-0.065
 for-0.063
 about-0.061
 earlier-0.060
Token closer
Token ID5699
Feature activation+4.56e+00

Pos logit contributions
+0.067
uch+0.064
ink+0.061
irl+0.060
icer+0.060
Neg logit contributions
 Tim-0.048
 address-0.047
 inside-0.045
 up-0.043
 fix-0.043
Token fun
Token ID1257
Feature activation+6.81e-02

Pos logit contributions
+0.063
uch+0.061
St+0.060
ink+0.059
icer+0.058
Neg logit contributions
 address-0.057
 Tim-0.056
der-0.054
 arrow-0.054
 cries-0.053
Token that
Token ID326
Feature activation+4.36e+00

Pos logit contributions
 address+0.001
 Tim+0.001
der+0.001
ly+0.001
 arrow+0.001
Neg logit contributions
-0.003
uch-0.003
irl-0.003
St-0.003
P-0.003
Token she
Token ID673
Feature activation+1.30e+00

Pos logit contributions
 address+0.057
 Tim+0.051
der+0.048
ly+0.046
 arrow+0.046
Neg logit contributions
-0.183
uch-0.179
St-0.177
irl-0.176
P-0.174
Token didn
Token ID1422
Feature activation-1.20e+00

Pos logit contributions
 Tim+0.024
 address+0.023
ly+0.023
 arrow+0.023
 cries+0.022
Neg logit contributions
-0.048
icer-0.045
 total-0.044
uch-0.044
St-0.043
Token't
Token ID470
Feature activation-8.79e-01

Pos logit contributions
P+0.022
uch+0.022
+0.022
St+0.022
 total+0.021
Neg logit contributions
 Tim-0.041
der-0.040
 address-0.040
 cries-0.039
 that-0.038
Token realize
Token ID6537
Feature activation+8.63e+00

Pos logit contributions
+0.026
irl+0.026
St+0.026
uch+0.025
P+0.025
Neg logit contributions
 address-0.022
 Acc-0.019
person-0.018
der-0.018
 fixes-0.018
Token just
Token ID655
Feature activation+1.66e+00

Pos logit contributions
 address+0.117
person+0.098
ly+0.092
 arrow+0.090
 writes+0.088
Neg logit contributions
-0.353
irl-0.346
St-0.345
uch-0.339
if-0.335
Token how
Token ID703
Feature activation-1.70e+00

Pos logit contributions
 address+0.036
person+0.034
 Acc+0.031
der+0.031
 writes+0.031
Neg logit contributions
-0.051
irl-0.051
St-0.050
 total-0.049
uch-0.049
Token wet
Token ID9583
Feature activation+4.27e+00

Pos logit contributions
+0.054
uch+0.052
St+0.051
irl+0.051
P+0.050
Neg logit contributions
 address-0.039
 Tim-0.038
der-0.036
 cries-0.036
ly-0.036
Token she
Token ID673
Feature activation+1.58e+00

Pos logit contributions
 address+0.065
 Tim+0.055
der+0.053
 arrow+0.052
ly+0.052
Neg logit contributions
-0.170
uch-0.167
irl-0.164
St-0.164
P-0.162
Token was
Token ID373
Feature activation-2.51e+00

Pos logit contributions
 Tim+0.028
ly+0.027
 address+0.027
 cries+0.027
der+0.026
Neg logit contributions
-0.058
St-0.055
P-0.054
uch-0.054
ink-0.054
Token all
Token ID477
Feature activation-7.25e-01

Pos logit contributions
+0.014
uch+0.013
St+0.013
irl+0.013
P+0.013
Neg logit contributions
 address-0.005
 Tim-0.004
der-0.004
ly-0.004
 arrow-0.004
Token that
Token ID326
Feature activation+3.84e+00

Pos logit contributions
+0.025
irl+0.025
uch+0.025
St+0.024
if+0.024
Neg logit contributions
 address-0.014
der-0.013
 Acc-0.012
person-0.012
 arrow-0.012
Token she
Token ID673
Feature activation+6.07e-01

Pos logit contributions
 address+0.049
 Tim+0.048
 cries+0.041
ly+0.040
 arrow+0.040
Neg logit contributions
-0.162
uch-0.158
St-0.158
irl-0.156
P-0.156
Token didn
Token ID1422
Feature activation-1.32e+00

Pos logit contributions
 address+0.011
 Tim+0.011
ly+0.010
 arrow+0.010
 cries+0.010
Neg logit contributions
-0.023
uch-0.022
St-0.021
icer-0.021
P-0.021
Token't
Token ID470
Feature activation-1.13e+00

Pos logit contributions
+0.022
uch+0.021
St+0.021
irl+0.020
P+0.020
Neg logit contributions
 address-0.050
 Tim-0.048
der-0.047
 cries-0.047
ly-0.046
Token realize
Token ID6537
Feature activation+8.59e+00

Pos logit contributions
+0.037
uch+0.036
St+0.035
irl+0.035
P+0.035
Neg logit contributions
 address-0.024
 Acc-0.022
der-0.022
 Tim-0.022
 cries-0.021
Token that
Token ID326
Feature activation+4.38e+00

Pos logit contributions
 address+0.123
 Tim+0.101
ly+0.101
person+0.100
 arrow+0.100
Neg logit contributions
-0.343
irl-0.333
St-0.332
uch-0.330
ink-0.325
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 address+0.076
der+0.067
 Tim+0.066
ly+0.066
 Acc+0.065
Neg logit contributions
-0.160
uch-0.156
irl-0.155
St-0.154
P-0.151
Token chamber
Token ID11847
Feature activation-1.53e+00

Pos logit contributions
 address+0.037
 Tim+0.034
der+0.033
ly+0.033
 arrow+0.033
Neg logit contributions
-0.052
uch-0.051
St-0.049
irl-0.049
ink-0.049
Token was
Token ID373
Feature activation-3.87e+00

Pos logit contributions
+0.049
uch+0.048
St+0.047
irl+0.047
P+0.046
Neg logit contributions
 address-0.035
 Tim-0.032
der-0.031
ly-0.031
 arrow-0.031
Token starting
Token ID3599
Feature activation+5.35e-01

Pos logit contributions
+0.135
uch+0.131
P+0.128
St+0.126
irl+0.125
Neg logit contributions
 address-0.086
 Tim-0.083
 inside-0.081
 for-0.079
 earlier-0.078
Token after
Token ID706
Feature activation+1.60e+00

Pos logit contributions
 address+0.049
 Acc+0.046
der+0.043
 fixes+0.041
 writes+0.041
Neg logit contributions
-0.097
St-0.093
P-0.092
irl-0.091
 total-0.091
Token it
Token ID340
Feature activation-4.01e-01

Pos logit contributions
 address+0.025
der+0.021
 Acc+0.021
ly+0.020
person+0.020
Neg logit contributions
-0.063
St-0.061
irl-0.061
uch-0.060
P-0.059
Token and
Token ID290
Feature activation+1.58e+00

Pos logit contributions
+0.016
uch+0.016
St+0.016
irl+0.016
P+0.016
Neg logit contributions
 address-0.006
 Tim-0.005
der-0.005
ly-0.005
 cries-0.005
Token didn
Token ID1422
Feature activation-1.49e+00

Pos logit contributions
 address+0.037
der+0.034
 Acc+0.033
ly+0.033
person+0.032
Neg logit contributions
-0.048
St-0.048
irl-0.047
uch-0.047
P-0.045
Token't
Token ID470
Feature activation-1.82e+00

Pos logit contributions
+0.025
uch+0.024
St+0.023
irl+0.023
P+0.023
Neg logit contributions
 address-0.056
 Tim-0.055
der-0.053
 cries-0.053
 arrow-0.052
Token realize
Token ID6537
Feature activation+8.58e+00

Pos logit contributions
+0.067
uch+0.065
St+0.064
irl+0.064
ink+0.063
Neg logit contributions
 address-0.034
 Tim-0.031
der-0.030
ly-0.030
 arrow-0.029
Token she
Token ID673
Feature activation+1.27e+00

Pos logit contributions
 address+0.130
person+0.114
 Acc+0.108
ly+0.107
 writes+0.105
Neg logit contributions
-0.325
irl-0.318
St-0.317
 total-0.307
if-0.306
Token was
Token ID373
Feature activation-3.26e+00

Pos logit contributions
 address+0.029
 Tim+0.027
der+0.027
ly+0.026
 arrow+0.026
Neg logit contributions
-0.041
uch-0.039
St-0.039
irl-0.039
P-0.038
Token getting
Token ID1972
Feature activation-2.09e+00

Pos logit contributions
+0.117
uch+0.111
St+0.109
P+0.108
ink+0.108
Neg logit contributions
 Tim-0.069
 address-0.066
 inside-0.063
 earlier-0.061
 cries-0.061
Token far
Token ID1290
Feature activation+1.81e-01

Pos logit contributions
+0.067
uch+0.065
St+0.064
irl+0.064
P+0.063
Neg logit contributions
 address-0.048
 Tim-0.044
der-0.043
ly-0.042
 arrow-0.042
Token from
Token ID422
Feature activation-7.88e-01

Pos logit contributions
 address+0.004
 Acc+0.004
person+0.004
 Tim+0.004
 arrow+0.004
Neg logit contributions
-0.005
P-0.005
St-0.005
if-0.005
ink-0.005
Token,
Token ID11
Feature activation-7.95e-01

Pos logit contributions
 address+0.020
 Tim+0.018
der+0.017
ly+0.016
 arrow+0.016
Neg logit contributions
-0.077
uch-0.076
St-0.075
irl-0.075
P-0.074
Token but
Token ID475
Feature activation+8.86e-01

Pos logit contributions
+0.026
uch+0.025
St+0.025
irl+0.025
P+0.024
Neg logit contributions
 address-0.018
der-0.016
 Tim-0.016
ly-0.015
 arrow-0.015
Token she
Token ID673
Feature activation+2.94e-01

Pos logit contributions
 address+0.014
der+0.012
 Tim+0.012
ly+0.012
 cries+0.012
Neg logit contributions
-0.034
uch-0.034
irl-0.033
St-0.033
P-0.033
Token didn
Token ID1422
Feature activation-1.50e+00

Pos logit contributions
 Tim+0.004
 inside+0.004
 that+0.004
 cries+0.004
 he+0.004
Neg logit contributions
-0.012
 total-0.012
St-0.012
br-0.011
P-0.011
Token't
Token ID470
Feature activation-1.42e+00

Pos logit contributions
P+0.029
uch+0.029
St+0.028
+0.028
irl+0.028
Neg logit contributions
 Tim-0.049
 address-0.049
der-0.048
 cries-0.047
 that-0.047
Token realize
Token ID6537
Feature activation+8.57e+00

Pos logit contributions
+0.057
uch+0.055
St+0.054
ink+0.054
P+0.053
Neg logit contributions
 address-0.022
 Tim-0.021
 say-0.020
 fix-0.020
 decide-0.020
Token how
Token ID703
Feature activation-1.75e+00

Pos logit contributions
 address+0.128
person+0.116
ly+0.111
 Acc+0.106
 cries+0.106
Neg logit contributions
-0.329
St-0.326
irl-0.325
if-0.316
uch-0.316
Token far
Token ID1290
Feature activation+6.74e-01

Pos logit contributions
+0.056
uch+0.054
St+0.052
irl+0.052
ink+0.051
Neg logit contributions
 address-0.040
 Tim-0.039
 cries-0.037
ly-0.037
der-0.037
Token she
Token ID673
Feature activation+1.45e+00

Pos logit contributions
 address+0.012
 Acc+0.011
person+0.010
der+0.010
 son+0.010
Neg logit contributions
-0.024
uch-0.024
irl-0.024
St-0.024
P-0.023
Token had
Token ID550
Feature activation+7.67e-02

Pos logit contributions
 Tim+0.025
 address+0.024
ly+0.024
 cries+0.023
 arrow+0.022
Neg logit contributions
-0.055
P-0.052
St-0.052
irl-0.051
uch-0.051
Token gone
Token ID3750
Feature activation+2.26e+00

Pos logit contributions
 address+0.002
 Tim+0.002
 inside+0.002
 that+0.002
 earlier+0.002
Neg logit contributions
-0.003
icer-0.002
uch-0.002
St-0.002
irl-0.002
Token brave
Token ID14802
Feature activation+3.73e+00

Pos logit contributions
+0.149
uch+0.145
St+0.142
irl+0.142
P+0.139
Neg logit contributions
 address-0.108
 Tim-0.101
der-0.097
ly-0.096
 cries-0.096
Token and
Token ID290
Feature activation+6.13e-01

Pos logit contributions
 address+0.076
 Acc+0.073
 apology+0.066
person+0.064
 decide+0.062
Neg logit contributions
-0.122
if-0.120
St-0.119
ink-0.118
 total-0.117
Token he
Token ID339
Feature activation-2.28e-01

Pos logit contributions
 address+0.014
der+0.013
 Acc+0.012
 Tim+0.012
 cries+0.012
Neg logit contributions
-0.019
uch-0.019
St-0.019
irl-0.019
P-0.018
Token didn
Token ID1422
Feature activation-1.49e+00

Pos logit contributions
+0.009
uch+0.008
irl+0.008
St+0.008
P+0.008
Neg logit contributions
 address-0.004
 Tim-0.004
 arrow-0.004
 cries-0.004
ly-0.004
Token't
Token ID470
Feature activation-1.57e+00

Pos logit contributions
+0.026
uch+0.025
P+0.024
St+0.024
irl+0.024
Neg logit contributions
 address-0.054
 Tim-0.053
 cries-0.051
der-0.051
 arrow-0.050
Token realize
Token ID6537
Feature activation+8.53e+00

Pos logit contributions
+0.062
uch+0.060
St+0.059
irl+0.059
ink+0.059
Neg logit contributions
 address-0.025
 Tim-0.023
ly-0.022
 arrow-0.022
 cries-0.021
Token how
Token ID703
Feature activation-1.65e+00

Pos logit contributions
 address+0.121
person+0.114
 Acc+0.103
ly+0.102
 apology+0.100
Neg logit contributions
St-0.327
irl-0.325
-0.324
if-0.323
 total-0.312
Token fierce
Token ID14800
Feature activation+4.84e+00

Pos logit contributions
+0.049
uch+0.049
St+0.048
irl+0.048
P+0.047
Neg logit contributions
 address-0.042
der-0.037
 Tim-0.037
 arrow-0.036
person-0.036
Token the
Token ID262
Feature activation+1.84e+00

Pos logit contributions
 address+0.057
 Tim+0.051
der+0.047
ly+0.046
 arrow+0.046
Neg logit contributions
-0.210
uch-0.206
irl-0.203
St-0.202
P-0.200
Token animals
Token ID4695
Feature activation-1.81e+00

Pos logit contributions
 address+0.033
 cries+0.029
ly+0.029
 arrow+0.028
 son+0.028
Neg logit contributions
-0.073
irl-0.068
uch-0.068
ink-0.068
St-0.066
Token in
Token ID287
Feature activation-1.30e+00

Pos logit contributions
+0.050
uch+0.049
irl+0.048
St+0.048
ink+0.047
Neg logit contributions
 address-0.048
 Tim-0.044
der-0.044
 Acc-0.043
ly-0.043
Token busy
Token ID8179
Feature activation+2.68e+00

Pos logit contributions
+0.097
uch+0.094
St+0.092
irl+0.091
P+0.091
Neg logit contributions
 address-0.062
 Tim-0.059
der-0.055
 cries-0.054
 Acc-0.054
Token that
Token ID326
Feature activation+3.97e+00

Pos logit contributions
 address+0.053
 Acc+0.048
 arrow+0.046
 apology+0.046
 Tim+0.046
Neg logit contributions
-0.092
irl-0.090
ink-0.088
if-0.087
uch-0.087
Token he
Token ID339
Feature activation+1.69e-01

Pos logit contributions
 address+0.038
der+0.030
ly+0.029
 Tim+0.028
 cries+0.028
Neg logit contributions
-0.178
uch-0.176
irl-0.174
St-0.174
P-0.171
Token didn
Token ID1422
Feature activation-1.47e+00

Pos logit contributions
 address+0.003
 Tim+0.003
ly+0.003
 cries+0.003
der+0.003
Neg logit contributions
-0.006
uch-0.006
St-0.006
P-0.006
icer-0.006
Token't
Token ID470
Feature activation-1.16e+00

Pos logit contributions
+0.026
uch+0.025
P+0.025
St+0.025
irl+0.024
Neg logit contributions
 address-0.053
 Tim-0.052
der-0.051
 cries-0.050
 fix-0.049
Token realize
Token ID6537
Feature activation+8.52e+00

Pos logit contributions
+0.038
uch+0.037
St+0.036
irl+0.036
P+0.036
Neg logit contributions
 address-0.026
 Tim-0.023
der-0.023
ly-0.022
 Acc-0.022
Token how
Token ID703
Feature activation-1.74e+00

Pos logit contributions
 address+0.127
person+0.114
ly+0.109
 cries+0.107
 Acc+0.107
Neg logit contributions
-0.323
irl-0.319
St-0.318
if-0.314
 total-0.309
Token hot
Token ID3024
Feature activation+1.87e+00

Pos logit contributions
+0.047
uch+0.045
St+0.045
irl+0.045
ink+0.044
Neg logit contributions
 address-0.049
der-0.044
 Tim-0.044
 arrow-0.044
ly-0.044
Token it
Token ID340
Feature activation+2.27e-01

Pos logit contributions
 address+0.029
 Tim+0.025
der+0.025
 cries+0.024
 Acc+0.024
Neg logit contributions
-0.073
uch-0.072
irl-0.070
St-0.070
P-0.069
Token was
Token ID373
Feature activation-2.73e+00

Pos logit contributions
 Tim+0.004
 address+0.004
ly+0.004
 cries+0.004
der+0.004
Neg logit contributions
-0.009
P-0.008
St-0.008
irl-0.008
uch-0.008
Token inside
Token ID2641
Feature activation+1.98e+00

Pos logit contributions
+0.117
uch+0.114
St+0.114
irl+0.113
P+0.112
Neg logit contributions
 address-0.033
 Tim-0.029
der-0.027
ly-0.026
 arrow-0.026
Token noises
Token ID26782
Feature activation-2.31e+00

Pos logit contributions
 address+0.054
person+0.044
 Tim+0.043
 arrow+0.043
 Acc+0.042
Neg logit contributions
-0.148
irl-0.147
uch-0.145
St-0.144
ink-0.143
Token that
Token ID326
Feature activation+4.02e+00

Pos logit contributions
+0.085
St+0.083
irl+0.082
uch+0.081
ink+0.081
Neg logit contributions
 address-0.041
 Acc-0.036
person-0.033
 apology-0.033
 arrow-0.032
Token he
Token ID339
Feature activation+1.00e-01

Pos logit contributions
 address+0.068
 Tim+0.060
der+0.059
ly+0.058
 arrow+0.057
Neg logit contributions
-0.152
uch-0.149
St-0.147
irl-0.147
P-0.144
Token didn
Token ID1422
Feature activation-1.59e+00

Pos logit contributions
 Tim+0.002
 address+0.002
 cries+0.002
der+0.002
ly+0.002
Neg logit contributions
-0.004
P-0.003
uch-0.003
St-0.003
icer-0.003
Token't
Token ID470
Feature activation-1.48e+00

Pos logit contributions
+0.028
uch+0.027
P+0.027
St+0.027
irl+0.026
Neg logit contributions
 Tim-0.056
 address-0.055
der-0.054
 cries-0.054
 fix-0.053
Token realize
Token ID6537
Feature activation+8.52e+00

Pos logit contributions
+0.051
uch+0.049
St+0.049
irl+0.049
P+0.048
Neg logit contributions
 address-0.031
 Tim-0.028
der-0.027
ly-0.027
 arrow-0.027
Token he
Token ID339
Feature activation+5.28e-01

Pos logit contributions
 address+0.128
person+0.110
ly+0.105
 Acc+0.099
 writes+0.099
Neg logit contributions
-0.330
irl-0.327
St-0.324
if-0.319
uch-0.317
Token was
Token ID373
Feature activation-3.33e+00

Pos logit contributions
 address+0.011
 Tim+0.010
ly+0.010
 cries+0.010
der+0.010
Neg logit contributions
-0.019
P-0.018
St-0.018
irl-0.017
uch-0.017
Token getting
Token ID1972
Feature activation-2.12e+00

Pos logit contributions
+0.122
uch+0.116
ink+0.114
icer+0.113
P+0.113
Neg logit contributions
 Tim-0.066
 address-0.063
 inside-0.062
 for-0.060
 that-0.059
Token very
Token ID845
Feature activation-2.92e+00

Pos logit contributions
+0.069
uch+0.066
ink+0.063
irl+0.063
St+0.063
Neg logit contributions
 address-0.048
 Tim-0.048
 cries-0.045
der-0.044
ly-0.044
Token wet
Token ID9583
Feature activation+4.38e+00

Pos logit contributions
+0.092
uch+0.088
St+0.086
P+0.085
irl+0.085
Neg logit contributions
 address-0.069
 Tim-0.066
der-0.064
 cries-0.064
ly-0.063
Token mistake
Token ID7457
Feature activation+3.23e+00

Pos logit contributions
+0.040
uch+0.039
irl+0.038
St+0.038
P+0.037
Neg logit contributions
 address-0.029
 Tim-0.027
der-0.027
 cries-0.026
ly-0.026
Token.
Token ID13
Feature activation+4.47e+00

Pos logit contributions
 address+0.081
 Acc+0.073
 arrow+0.070
 apology+0.067
 bury+0.066
Neg logit contributions
-0.096
ink-0.091
if-0.090
re-0.087
h-0.086
Token He
Token ID679
Feature activation+5.75e-01

Pos logit contributions
 address+0.043
ly+0.038
 decide+0.034
 arrow+0.034
der+0.034
Neg logit contributions
-0.202
St-0.191
uch-0.186
 total-0.186
ink-0.184
Token didn
Token ID1422
Feature activation-1.60e+00

Pos logit contributions
 address+0.014
 Tim+0.012
der+0.012
 Acc+0.012
ly+0.012
Neg logit contributions
-0.018
uch-0.017
St-0.017
irl-0.017
ink-0.017
Token't
Token ID470
Feature activation-1.19e+00

Pos logit contributions
+0.027
uch+0.027
St+0.026
P+0.026
irl+0.026
Neg logit contributions
 address-0.058
 Tim-0.058
der-0.056
 cries-0.055
 arrow-0.054
Token realize
Token ID6537
Feature activation+8.51e+00

Pos logit contributions
+0.042
uch+0.041
St+0.041
irl+0.040
P+0.040
Neg logit contributions
 address-0.023
 Tim-0.021
der-0.021
ly-0.020
 arrow-0.020
Token there
Token ID612
Feature activation-1.17e-01

Pos logit contributions
 address+0.131
person+0.119
 cries+0.115
ly+0.114
 apology+0.112
Neg logit contributions
-0.317
St-0.312
if-0.312
irl-0.304
 total-0.300
Token was
Token ID373
Feature activation-3.29e+00

Pos logit contributions
+0.004
uch+0.004
St+0.003
irl+0.003
P+0.003
Neg logit contributions
 address-0.003
 Tim-0.003
der-0.003
ly-0.003
 cries-0.003
Token something
Token ID1223
Feature activation-1.00e+00

Pos logit contributions
+0.118
uch+0.114
ink+0.113
irl+0.113
P+0.112
Neg logit contributions
 Tim-0.066
 address-0.066
der-0.060
 arrow-0.057
 son-0.057
Token else
Token ID2073
Feature activation-1.07e+00

Pos logit contributions
+0.033
uch+0.033
St+0.032
irl+0.032
P+0.032
Neg logit contributions
 address-0.022
person-0.019
 Acc-0.018
der-0.018
 arrow-0.018
Token in
Token ID287
Feature activation-1.16e+00

Pos logit contributions
+0.040
uch+0.039
irl+0.038
St+0.038
if+0.038
Neg logit contributions
 address-0.018
person-0.016
der-0.015
 Acc-0.015
 arrow-0.015
Token ducks
Token ID39694
Feature activation-3.94e+00

Pos logit contributions
 old+0.020
 Tim+0.019
 ot+0.018
 inside+0.018
 cries+0.018
Neg logit contributions
-0.031
uch-0.030
St-0.029
ink-0.028
P-0.028
Token that
Token ID326
Feature activation+3.76e+00

Pos logit contributions
+0.153
uch+0.148
ink+0.146
irl+0.145
St+0.144
Neg logit contributions
 address-0.064
 Tim-0.054
der-0.054
 Acc-0.053
 cries-0.052
Token he
Token ID339
Feature activation-2.40e-01

Pos logit contributions
 Tim+0.068
 address+0.063
 arrow+0.056
 him+0.054
 that+0.054
Neg logit contributions
-0.142
uch-0.139
irl-0.138
St-0.136
P-0.134
Token did
Token ID750
Feature activation-7.23e-01

Pos logit contributions
+0.009
uch+0.008
St+0.008
icer+0.008
irl+0.008
Neg logit contributions
 address-0.005
 Tim-0.004
 arrow-0.004
ly-0.004
 inside-0.004
Token not
Token ID407
Feature activation-4.19e-01

Pos logit contributions
+0.028
uch+0.028
St+0.028
P+0.027
irl+0.027
Neg logit contributions
 address-0.011
 Tim-0.010
der-0.010
ly-0.009
 arrow-0.009
Token realize
Token ID6537
Feature activation+8.50e+00

Pos logit contributions
+0.015
uch+0.014
St+0.014
irl+0.014
ink+0.014
Neg logit contributions
 address-0.008
 Tim-0.008
 arrow-0.007
ly-0.007
der-0.007
Token he
Token ID339
Feature activation+2.85e-01

Pos logit contributions
 address+0.116
ly+0.096
 cries+0.095
person+0.094
 arrow+0.093
Neg logit contributions
-0.343
St-0.335
irl-0.335
uch-0.330
if-0.328
Token had
Token ID550
Feature activation-5.53e-01

Pos logit contributions
 address+0.006
 Tim+0.005
ly+0.005
 arrow+0.005
 cries+0.005
Neg logit contributions
-0.010
St-0.010
P-0.009
irl-0.009
uch-0.009
Token left
Token ID1364
Feature activation+1.80e+00

Pos logit contributions
+0.020
uch+0.019
icer+0.019
ink+0.018
irl+0.018
Neg logit contributions
 address-0.012
 Tim-0.011
 that-0.011
 inside-0.011
 for-0.011
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
 address+0.026
 Tim+0.023
der+0.022
ly+0.022
 arrow+0.022
Neg logit contributions
-0.072
uch-0.071
St-0.070
irl-0.070
P-0.069
Token drain
Token ID14782
Feature activation-1.48e+00

Pos logit contributions
 old+0.029
 inside+0.029
 address+0.028
 Tim+0.028
 hospital+0.028
Neg logit contributions
uch-0.043
-0.043
St-0.042
ink-0.042
P-0.042
Token,
Token ID11
Feature activation-8.49e-01

Pos logit contributions
+0.052
uch+0.050
ink+0.049
St+0.049
irl+0.049
Neg logit contributions
 address-0.020
 Tim-0.017
 Acc-0.017
 arrow-0.017
 cries-0.016
Token but
Token ID475
Feature activation+9.47e-01

Pos logit contributions
+0.026
St+0.025
uch+0.025
irl+0.025
P+0.024
Neg logit contributions
 address-0.020
person-0.018
der-0.018
 Tim-0.018
 Acc-0.018
Token she
Token ID673
Feature activation+4.90e-01

Pos logit contributions
 address+0.015
person+0.015
der+0.014
 Acc+0.014
ly+0.014
Neg logit contributions
uch-0.034
St-0.034
-0.034
irl-0.034
 total-0.033
Token didn
Token ID1422
Feature activation-1.63e+00

Pos logit contributions
 address+0.009
 Tim+0.009
der+0.008
 cries+0.008
ly+0.008
Neg logit contributions
-0.018
St-0.017
irl-0.017
uch-0.016
 total-0.016
Token't
Token ID470
Feature activation-1.44e+00

Pos logit contributions
P+0.031
St+0.031
uch+0.030
+0.030
irl+0.030
Neg logit contributions
 Tim-0.054
 address-0.053
der-0.052
 cries-0.051
 that-0.051
Token realize
Token ID6537
Feature activation+8.49e+00

Pos logit contributions
+0.056
uch+0.054
St+0.053
irl+0.053
ink+0.053
Neg logit contributions
 address-0.024
 Tim-0.022
der-0.021
ly-0.020
 arrow-0.020
Token that
Token ID326
Feature activation+4.42e+00

Pos logit contributions
 address+0.134
person+0.133
ly+0.122
 alien+0.119
 arrow+0.117
Neg logit contributions
-0.313
St-0.310
irl-0.306
if-0.303
 total-0.293
Token it
Token ID340
Feature activation+1.40e-01

Pos logit contributions
person+0.079
 address+0.078
ly+0.077
der+0.071
old+0.070
Neg logit contributions
-0.152
St-0.151
uch-0.148
irl-0.148
 total-0.147
Token would
Token ID561
Feature activation+3.89e-01

Pos logit contributions
 Tim+0.003
 address+0.002
 cries+0.002
ly+0.002
der+0.002
Neg logit contributions
-0.005
irl-0.005
P-0.005
St-0.005
uch-0.005
Token make
Token ID787
Feature activation+2.00e+00

Pos logit contributions
 Tim+0.007
 address+0.007
 say+0.007
 fix+0.007
 cries+0.006
Neg logit contributions
P-0.014
-0.014
irl-0.014
ink-0.014
St-0.014
Token her
Token ID607
Feature activation+1.08e+00

Pos logit contributions
 address+0.020
 Tim+0.019
 cries+0.016
der+0.015
ly+0.015
Neg logit contributions
-0.092
uch-0.089
St-0.087
P-0.086
irl-0.086
Token
Token ID198
Feature activation-8.75e+00

Pos logit contributions
 address+0.049
 Tim+0.045
der+0.040
 cries+0.038
 Acc+0.038
Neg logit contributions
-0.152
uch-0.151
irl-0.149
St-0.147
ink-0.146
Token
Token ID198
Feature activation-8.20e+00

Pos logit contributions
+0.243
uch+0.234
St+0.234
irl+0.230
P+0.227
Neg logit contributions
 address-0.236
der-0.221
ly-0.220
 Tim-0.217
 arrow-0.214
TokenBut
Token ID1537
Feature activation+8.59e-01

Pos logit contributions
+0.212
uch+0.202
irl+0.199
St+0.194
ink+0.193
Neg logit contributions
 address-0.243
 Tim-0.230
 arrow-0.222
der-0.217
 cries-0.217
Token the
Token ID262
Feature activation+6.62e-01

Pos logit contributions
 address+0.012
 Tim+0.010
der+0.010
ly+0.010
 cries+0.009
Neg logit contributions
-0.036
uch-0.035
St-0.034
irl-0.034
P-0.034
Token ost
Token ID23619
Feature activation-7.16e+00

Pos logit contributions
 address+0.012
 cries+0.010
 say+0.010
 says+0.010
 hospital+0.010
Neg logit contributions
-0.027
irl-0.025
uch-0.025
P-0.024
St-0.024
Tokenric
Token ID1173
Feature activation-1.26e+01

Pos logit contributions
P+0.187
+0.175
St+0.175
irl+0.170
br+0.165
Neg logit contributions
der-0.202
 address-0.198
 Acc-0.194
 Tim-0.192
 cries-0.188
Tokenhes
Token ID956
Feature activation-4.07e-01

Pos logit contributions
+0.400
 total+0.344
St+0.342
irl+0.341
uch+0.335
Neg logit contributions
 address-0.314
der-0.308
 about-0.296
 Tim-0.294
 that-0.294
Token did
Token ID750
Feature activation-1.39e+00

Pos logit contributions
+0.014
uch+0.013
St+0.013
irl+0.013
P+0.013
Neg logit contributions
 address-0.009
 Tim-0.008
der-0.008
ly-0.008
 cries-0.008
Token not
Token ID407
Feature activation-9.42e-01

Pos logit contributions
+0.046
P+0.045
uch+0.045
ink+0.044
icer+0.043
Neg logit contributions
 address-0.031
 Tim-0.030
der-0.028
 cries-0.028
 that-0.028
Token like
Token ID588
Feature activation+1.67e+00

Pos logit contributions
+0.039
ink+0.037
uch+0.037
icer+0.036
P+0.036
Neg logit contributions
 address-0.015
 say-0.014
 Tim-0.014
 fix-0.014
ly-0.013
Token that
Token ID326
Feature activation+4.24e+00

Pos logit contributions
 address+0.024
 Tim+0.021
der+0.020
ly+0.020
 arrow+0.019
Neg logit contributions
-0.068
uch-0.066
St-0.066
irl-0.065
P-0.065
Token cried
Token ID16896
Feature activation-3.17e-01

Pos logit contributions
 address+0.015
 Tim+0.014
der+0.013
 Acc+0.013
 arrow+0.013
Neg logit contributions
-0.030
uch-0.029
St-0.029
irl-0.029
P-0.028
Token.
Token ID13
Feature activation+4.12e+00

Pos logit contributions
St+0.010
+0.009
 time+0.009
P+0.009
Once+0.009
Neg logit contributions
 address-0.008
person-0.007
 disl-0.007
grand-0.007
 Acc-0.007
Token "
Token ID366
Feature activation-1.43e+00

Pos logit contributions
 address+0.055
 Tim+0.049
der+0.046
ly+0.044
 cries+0.044
Neg logit contributions
-0.172
uch-0.170
irl-0.167
St-0.167
P-0.164
TokenThe
Token ID464
Feature activation+1.16e+00

Pos logit contributions
+0.040
irl+0.038
ink+0.037
uch+0.037
 total+0.036
Neg logit contributions
 Tim-0.037
 address-0.037
 cries-0.036
 arrow-0.035
der-0.035
Token ost
Token ID23619
Feature activation-6.45e+00

Pos logit contributions
 address+0.023
 Tim+0.021
der+0.020
 arrow+0.020
ly+0.020
Neg logit contributions
-0.041
uch-0.040
St-0.039
irl-0.039
ink-0.039
Tokenric
Token ID1173
Feature activation-1.25e+01

Pos logit contributions
+0.144
P+0.142
St+0.140
Once+0.132
irl+0.130
Neg logit contributions
 address-0.198
der-0.198
 cries-0.197
 Tim-0.195
 says-0.190
Tokenhes
Token ID956
Feature activation-1.07e-01

Pos logit contributions
+0.397
 total+0.340
St+0.332
uch+0.331
irl+0.331
Neg logit contributions
 address-0.313
der-0.308
 cries-0.300
 says-0.297
 that-0.296
Token are
Token ID389
Feature activation-4.83e+00

Pos logit contributions
+0.004
St+0.003
irl+0.003
uch+0.003
P+0.003
Neg logit contributions
 address-0.002
 Tim-0.002
 cries-0.002
ly-0.002
der-0.002
Token mean
Token ID1612
Feature activation+4.87e+00

Pos logit contributions
+0.196
icer+0.174
P+0.173
ink+0.170
uch+0.168
Neg logit contributions
 for-0.097
 that-0.091
 Tim-0.091
 cries-0.091
 address-0.089
Token!
Token ID0
Feature activation+4.90e+00

Pos logit contributions
 address+0.114
 Acc+0.103
 Tim+0.101
person+0.099
 arrow+0.097
Neg logit contributions
uch-0.149
-0.145
P-0.144
St-0.144
ink-0.142
Token They
Token ID1119
Feature activation+5.78e-01

Pos logit contributions
 address+0.068
 arrow+0.059
ly+0.059
der+0.054
person+0.053
Neg logit contributions
-0.201
St-0.195
uch-0.194
ink-0.192
P-0.187
Token.
Token ID13
Feature activation+4.26e+00

Pos logit contributions
+0.081
uch+0.079
St+0.078
irl+0.078
ink+0.077
Neg logit contributions
 address-0.023
 Tim-0.020
der-0.018
 Acc-0.018
 arrow-0.018
Token They
Token ID1119
Feature activation-6.04e-02

Pos logit contributions
 address+0.048
ly+0.040
 arrow+0.039
der+0.038
 cries+0.037
Neg logit contributions
-0.187
St-0.179
irl-0.174
P-0.173
ink-0.172
Token were
Token ID547
Feature activation-6.09e+00

Pos logit contributions
+0.002
uch+0.002
P+0.002
if+0.002
icer+0.002
Neg logit contributions
 address-0.001
 Tim-0.001
 arrow-0.001
 say-0.001
 says-0.001
Token called
Token ID1444
Feature activation-1.45e+00

Pos logit contributions
+0.224
uch+0.212
P+0.204
St+0.198
icer+0.195
Neg logit contributions
 address-0.132
 Tim-0.132
 for-0.121
 inside-0.119
 about-0.118
Token ost
Token ID23619
Feature activation-6.64e+00

Pos logit contributions
+0.055
uch+0.054
irl+0.053
St+0.053
P+0.053
Neg logit contributions
 address-0.025
 Tim-0.023
der-0.021
 arrow-0.021
ly-0.021
Tokenric
Token ID1173
Feature activation-1.25e+01

Pos logit contributions
P+0.158
+0.153
irl+0.146
St+0.143
uch+0.132
Neg logit contributions
 address-0.210
der-0.202
 Tim-0.202
 cries-0.199
ly-0.194
Tokenhes
Token ID956
Feature activation-8.18e-01

Pos logit contributions
+0.410
uch+0.349
irl+0.341
St+0.340
P+0.334
Neg logit contributions
 address-0.306
 inside-0.295
 about-0.293
der-0.293
 that-0.288
Token.
Token ID13
Feature activation+3.76e+00

Pos logit contributions
+0.033
uch+0.032
irl+0.031
St+0.031
P+0.031
Neg logit contributions
 address-0.012
 Tim-0.011
der-0.011
ly-0.011
 cries-0.010
Token Lily
Token ID20037
Feature activation+1.26e+00

Pos logit contributions
 Tim+0.042
 address+0.039
der+0.029
 son+0.028
 He+0.028
Neg logit contributions
uch-0.173
irl-0.168
-0.166
P-0.163
ink-0.162
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
 address+0.018
 Tim+0.017
 cries+0.016
 arrow+0.015
ly+0.015
Neg logit contributions
-0.052
uch-0.050
irl-0.049
St-0.049
P-0.049
Token Ben
Token ID3932
Feature activation+9.50e-01

Pos logit contributions
 address+0.002
ly+0.002
der+0.002
 Acc+0.001
person+0.001
Neg logit contributions
-0.005
St-0.005
uch-0.005
irl-0.005
P-0.005
Token shouted
Token ID17293
Feature activation-4.20e-01

Pos logit contributions
 address+0.008
 Tim+0.007
der+0.007
 Acc+0.007
ly+0.007
Neg logit contributions
-0.021
uch-0.020
St-0.020
irl-0.020
P-0.020
Token.
Token ID13
Feature activation+4.02e+00

Pos logit contributions
+0.012
 time+0.011
St+0.011
re+0.010
ink+0.010
Neg logit contributions
 address-0.011
 writes-0.010
 Acc-0.010
 arrow-0.010
grand-0.009
Token "
Token ID366
Feature activation-1.52e+00

Pos logit contributions
 address+0.072
ly+0.063
der+0.063
 Tim+0.063
 arrow+0.062
Neg logit contributions
-0.149
St-0.144
uch-0.140
P-0.140
irl-0.139
TokenThe
Token ID464
Feature activation+1.06e+00

Pos logit contributions
+0.044
uch+0.043
irl+0.042
ink+0.042
icer+0.039
Neg logit contributions
 address-0.040
 Tim-0.039
 arrow-0.037
der-0.036
 Let-0.036
Token ost
Token ID23619
Feature activation-6.73e+00

Pos logit contributions
 address+0.019
 Tim+0.018
der+0.017
ly+0.017
 cries+0.016
Neg logit contributions
-0.039
uch-0.038
St-0.038
irl-0.038
P-0.037
Tokenric
Token ID1173
Feature activation-1.24e+01

Pos logit contributions
P+0.174
St+0.163
+0.160
irl+0.158
br+0.146
Neg logit contributions
 address-0.191
 Tim-0.188
der-0.186
 elephant-0.182
 Acc-0.179
Tokenhes
Token ID956
Feature activation-1.55e-01

Pos logit contributions
+0.395
St+0.335
 total+0.333
uch+0.330
irl+0.328
Neg logit contributions
 address-0.314
 says-0.298
 about-0.294
der-0.294
 cries-0.292
Token are
Token ID389
Feature activation-4.93e+00

Pos logit contributions
+0.005
St+0.005
P+0.005
uch+0.005
irl+0.005
Neg logit contributions
 address-0.003
 Tim-0.003
 cries-0.003
ly-0.003
 says-0.003
Token so
Token ID523
Feature activation-2.59e+00

Pos logit contributions
+0.193
P+0.178
uch+0.174
icer+0.170
ink+0.165
Neg logit contributions
 address-0.105
 for-0.104
 Tim-0.101
 that-0.096
 arrow-0.096
Token funny
Token ID8258
Feature activation+3.23e+00

Pos logit contributions
+0.094
icer+0.083
uch+0.083
P+0.081
St+0.078
Neg logit contributions
 Tim-0.058
 that-0.056
 cries-0.054
 old-0.053
 address-0.053
Token!"
Token ID2474
Feature activation+1.70e+00

Pos logit contributions
 address+0.066
 Tim+0.062
der+0.060
ly+0.059
 arrow+0.058
Neg logit contributions
-0.112
uch-0.109
St-0.107
irl-0.106
P-0.105
Token ostr
Token ID42387
Feature activation-4.71e+00

Pos logit contributions
 address+0.042
 Tim+0.040
der+0.040
ly+0.039
 Acc+0.038
Neg logit contributions
-0.049
uch-0.047
St-0.047
P-0.046
irl-0.046
Tokenich
Token ID488
Feature activation+4.37e-01

Pos logit contributions
St+0.122
irl+0.122
+0.120
P+0.116
 total+0.112
Neg logit contributions
 address-0.129
der-0.125
 Tim-0.124
 cries-0.121
 arrow-0.118
Token pen
Token ID3112
Feature activation+3.92e-01

Pos logit contributions
 address+0.008
der+0.008
person+0.007
 arrow+0.007
 Acc+0.007
Neg logit contributions
-0.016
uch-0.015
ink-0.015
St-0.014
irl-0.014
Token.
Token ID13
Feature activation+4.27e+00

Pos logit contributions
 address+0.005
 Tim+0.005
 Acc+0.004
der+0.004
 cries+0.004
Neg logit contributions
-0.016
uch-0.016
St-0.015
irl-0.015
P-0.015
Token Ost
Token ID38919
Feature activation-3.11e+00

Pos logit contributions
ly+0.056
 address+0.053
der+0.051
fully+0.049
 arrow+0.048
Neg logit contributions
-0.176
St-0.168
 total-0.163
ink-0.161
irl-0.158
Tokenric
Token ID1173
Feature activation-1.23e+01

Pos logit contributions
irl+0.066
br+0.065
+0.064
P+0.062
uch+0.060
Neg logit contributions
 address-0.105
 Tim-0.101
 earlier-0.097
 him-0.096
 Sharing-0.095
Tokenhes
Token ID956
Feature activation-9.73e-01

Pos logit contributions
+0.399
St+0.339
uch+0.337
irl+0.337
 total+0.332
Neg logit contributions
 address-0.303
 about-0.296
 Tim-0.290
 that-0.286
der-0.286
Token are
Token ID389
Feature activation-6.08e+00

Pos logit contributions
+0.032
uch+0.031
St+0.031
irl+0.031
P+0.030
Neg logit contributions
 address-0.021
 Tim-0.020
der-0.019
ly-0.019
 cries-0.019
Token big
Token ID1263
Feature activation+1.46e+00

Pos logit contributions
+0.217
uch+0.201
P+0.196
irl+0.193
St+0.188
Neg logit contributions
 address-0.143
 Tim-0.135
 for-0.128
 arrow-0.127
 inside-0.123
Token birds
Token ID10087
Feature activation-3.85e+00

Pos logit contributions
 address+0.030
person+0.029
 Acc+0.028
fully+0.027
 apology+0.026
Neg logit contributions
ink-0.047
-0.046
St-0.046
icer-0.046
 total-0.045
Token with
Token ID351
Feature activation-1.47e+00

Pos logit contributions
+0.166
uch+0.162
St+0.161
irl+0.160
P+0.159
Neg logit contributions
 address-0.045
 Tim-0.041
der-0.037
 cries-0.037
ly-0.036
Token ostr
Token ID42387
Feature activation-4.67e+00

Pos logit contributions
 address+0.040
der+0.038
ly+0.038
 Tim+0.038
 Acc+0.037
Neg logit contributions
-0.045
St-0.044
uch-0.044
P-0.042
irl-0.042
Tokenich
Token ID488
Feature activation+4.58e-01

Pos logit contributions
St+0.119
irl+0.119
+0.118
P+0.114
 total+0.110
Neg logit contributions
 address-0.129
der-0.125
 Tim-0.125
 cries-0.121
 arrow-0.118
Token pen
Token ID3112
Feature activation+2.84e-01

Pos logit contributions
 address+0.009
 Acc+0.008
der+0.008
person+0.008
 arrow+0.008
Neg logit contributions
-0.016
ink-0.015
uch-0.015
St-0.015
irl-0.015
Token.
Token ID13
Feature activation+4.14e+00

Pos logit contributions
 address+0.004
 Acc+0.004
 Tim+0.004
 cries+0.004
der+0.003
Neg logit contributions
-0.011
uch-0.011
ink-0.011
irl-0.011
P-0.011
Token Ost
Token ID38919
Feature activation-3.08e+00

Pos logit contributions
ly+0.050
 address+0.047
der+0.047
 cries+0.042
fully+0.042
Neg logit contributions
-0.174
St-0.169
 total-0.163
ink-0.161
irl-0.161
Tokenric
Token ID1173
Feature activation-1.23e+01

Pos logit contributions
irl+0.065
br+0.063
+0.063
P+0.061
St+0.058
Neg logit contributions
 address-0.104
 Tim-0.099
 him-0.095
 earlier-0.095
 say-0.094
Tokenhes
Token ID956
Feature activation-9.49e-01

Pos logit contributions
+0.403
uch+0.336
St+0.336
 total+0.333
irl+0.333
Neg logit contributions
 address-0.301
 about-0.293
 says-0.288
 Tim-0.285
 that-0.285
Token are
Token ID389
Feature activation-5.94e+00

Pos logit contributions
+0.033
uch+0.032
St+0.031
P+0.031
irl+0.031
Neg logit contributions
 address-0.019
 Tim-0.018
 cries-0.017
der-0.017
ly-0.017
Token big
Token ID1263
Feature activation+1.45e+00

Pos logit contributions
+0.217
uch+0.197
P+0.193
irl+0.189
St+0.183
Neg logit contributions
 address-0.140
 Tim-0.130
 for-0.128
 arrow-0.127
 inside-0.122
Token birds
Token ID10087
Feature activation-3.87e+00

Pos logit contributions
 address+0.029
person+0.029
 Acc+0.028
fully+0.026
 Tim+0.026
Neg logit contributions
ink-0.046
St-0.046
-0.045
icer-0.045
 total-0.045
Token with
Token ID351
Feature activation-1.42e+00

Pos logit contributions
+0.176
uch+0.171
St+0.170
irl+0.169
P+0.168
Neg logit contributions
 address-0.037
 Tim-0.032
 cries-0.029
der-0.029
ly-0.028
Token a
Token ID257
Feature activation-1.83e+00

Pos logit contributions
uch+0.031
+0.031
irl+0.030
St+0.030
ink+0.030
Neg logit contributions
 address-0.013
 Tim-0.012
 arrow-0.011
 cries-0.010
 son-0.010
Token big
Token ID1263
Feature activation+1.34e+00

Pos logit contributions
uch+0.061
irl+0.060
ink+0.060
+0.059
P+0.058
Neg logit contributions
 address-0.041
 arrow-0.039
 son-0.038
 fix-0.037
 say-0.036
Token box
Token ID3091
Feature activation-3.46e+00

Pos logit contributions
 address+0.028
 Tim+0.026
 arrow+0.025
ly+0.025
 cries+0.025
Neg logit contributions
-0.045
uch-0.045
St-0.044
irl-0.044
P-0.043
Token of
Token ID286
Feature activation-1.02e+00

Pos logit contributions
+0.116
ink+0.111
uch+0.111
St+0.108
P+0.107
Neg logit contributions
 address-0.072
 Acc-0.069
 Tim-0.065
 cries-0.064
der-0.062
Token ch
Token ID442
Feature activation-3.22e+00

Pos logit contributions
+0.030
St+0.029
irl+0.029
 total+0.029
P+0.029
Neg logit contributions
der-0.023
 address-0.022
person-0.021
 Acc-0.021
fully-0.021
Tokenocol
Token ID4668
Feature activation-1.23e+01

Pos logit contributions
+0.079
uch+0.077
St+0.074
 total+0.068
if+0.068
Neg logit contributions
ly-0.101
 address-0.098
der-0.097
 cries-0.096
old-0.095
Tokenates
Token ID689
Feature activation-3.75e+00

Pos logit contributions
+0.271
irl+0.270
St+0.268
 total+0.253
Uh+0.239
Neg logit contributions
 address-0.368
fully-0.358
 inside-0.351
der-0.350
ly-0.341
Token!
Token ID0
Feature activation+3.42e+00

Pos logit contributions
+0.141
uch+0.137
ink+0.136
irl+0.132
P+0.132
Neg logit contributions
 address-0.062
 Acc-0.059
 Tim-0.055
 cries-0.053
 arrow-0.052
Token<|endoftext|>
Token ID50256
Feature activation-5.26e+00

Pos logit contributions
 address+0.079
 Tim+0.073
der+0.071
ly+0.070
 cries+0.069
Neg logit contributions
-0.109
uch-0.107
irl-0.105
St-0.105
P-0.103
TokenOnce
Token ID7454
Feature activation-6.70e+00

Pos logit contributions
uch+0.155
irl+0.153
St+0.141
+0.138
 total+0.138
Neg logit contributions
 Tim-0.143
 He-0.137
 inside-0.128
 The-0.126
 Let-0.126
Token upon
Token ID2402
Feature activation-4.57e+00

Pos logit contributions
+0.081
uch+0.075
irl+0.072
St+0.071
ink+0.067
Neg logit contributions
 address-0.286
 Tim-0.271
der-0.271
 arrow-0.268
ly-0.268
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
+0.095
irl+0.093
St+0.092
ink+0.088
uch+0.087
Neg logit contributions
 address-0.064
person-0.060
ly-0.057
der-0.054
 arrow-0.053
Token looked
Token ID3114
Feature activation-8.16e-01

Pos logit contributions
 address+0.036
 Tim+0.034
der+0.034
 arrow+0.033
ly+0.033
Neg logit contributions
-0.039
St-0.038
uch-0.038
irl-0.037
P-0.036
Token at
Token ID379
Feature activation+5.71e-01

Pos logit contributions
irl+0.023
St+0.022
ink+0.022
P+0.021
 time+0.021
Neg logit contributions
 address-0.020
person-0.020
 Acc-0.018
grand-0.017
 writes-0.017
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
 address+0.009
ly+0.009
person+0.009
der+0.008
 writes+0.008
Neg logit contributions
-0.022
St-0.021
irl-0.021
 total-0.020
uch-0.019
Token ost
Token ID23619
Feature activation-6.01e+00

Pos logit contributions
 address+0.038
 cries+0.034
 says+0.034
 old+0.033
 say+0.033
Neg logit contributions
-0.072
irl-0.068
P-0.067
uch-0.067
St-0.063
Tokenric
Token ID1173
Feature activation-1.21e+01

Pos logit contributions
P+0.146
+0.138
St+0.134
irl+0.134
br+0.125
Neg logit contributions
 address-0.182
der-0.181
 cries-0.176
 Tim-0.172
 Acc-0.169
Tokenhes
Token ID956
Feature activation+2.86e-02

Pos logit contributions
+0.356
uch+0.312
St+0.311
 total+0.309
irl+0.305
Neg logit contributions
 address-0.325
der-0.305
 Tim-0.302
 cries-0.300
 says-0.297
Token.
Token ID13
Feature activation+4.28e+00

Pos logit contributions
 address+0.000
 Tim+0.000
der+0.000
 Acc+0.000
 arrow+0.000
Neg logit contributions
-0.001
uch-0.001
St-0.001
irl-0.001
ink-0.001
Token They
Token ID1119
Feature activation-1.45e-01

Pos logit contributions
 arrow+0.062
ly+0.061
fully+0.058
old+0.058
person+0.058
Neg logit contributions
-0.173
St-0.160
 total-0.153
ink-0.147
icer-0.146
Token saw
Token ID2497
Feature activation+2.11e+00

Pos logit contributions
+0.006
uch+0.006
if+0.005
irl+0.005
P+0.005
Neg logit contributions
 Tim-0.003
 address-0.003
 say-0.002
 arrow-0.002
 says-0.002
Token one
Token ID530
Feature activation-1.65e+00

Pos logit contributions
 address+0.045
person+0.044
ly+0.042
 Acc+0.042
 writes+0.040
Neg logit contributions
St-0.067
-0.065
 total-0.062
uch-0.062
irl-0.062
TokenL
Token ID43
Feature activation-3.55e+00

Pos logit contributions
+0.019
ink+0.018
uch+0.018
irl+0.018
icer+0.017
Neg logit contributions
 address-0.018
 Tim-0.017
der-0.017
 writes-0.017
 says-0.016
Tokenily
Token ID813
Feature activation+2.52e+00

Pos logit contributions
+0.152
uch+0.148
St+0.147
irl+0.146
P+0.145
Neg logit contributions
 address-0.044
 Tim-0.038
der-0.036
ly-0.035
 arrow-0.034
Token,
Token ID11
Feature activation-5.40e-01

Pos logit contributions
 address+0.044
 Tim+0.040
der+0.038
ly+0.037
 cries+0.037
Neg logit contributions
-0.094
uch-0.092
St-0.091
irl-0.091
P-0.090
Token these
Token ID777
Feature activation-8.01e-01

Pos logit contributions
+0.021
uch+0.020
St+0.020
irl+0.020
ink+0.020
Neg logit contributions
 address-0.009
 Tim-0.009
der-0.008
 arrow-0.008
 cries-0.008
Token ch
Token ID442
Feature activation-4.00e+00

Pos logit contributions
+0.028
uch+0.026
P+0.026
irl+0.026
St+0.026
Neg logit contributions
 address-0.017
 Tim-0.016
 cries-0.016
ly-0.016
 says-0.015
Tokenocol
Token ID4668
Feature activation-1.21e+01

Pos logit contributions
+0.109
Once+0.107
St+0.106
uch+0.106
 total+0.102
Neg logit contributions
der-0.110
ly-0.109
 address-0.103
 cries-0.103
 apology-0.099
Tokenates
Token ID689
Feature activation-4.34e+00

Pos logit contributions
+0.287
irl+0.262
 sudden+0.256
Once+0.244
St+0.240
Neg logit contributions
 address-0.368
 writes-0.356
 him-0.351
fully-0.350
der-0.343
Token are
Token ID389
Feature activation-5.60e+00

Pos logit contributions
+0.152
St+0.149
P+0.145
irl+0.145
uch+0.144
Neg logit contributions
 address-0.085
 Tim-0.081
ly-0.077
 cries-0.077
der-0.077
Token bitter
Token ID12922
Feature activation+3.22e+00

Pos logit contributions
+0.205
P+0.187
uch+0.187
irl+0.182
St+0.181
Neg logit contributions
 Tim-0.122
 address-0.121
 for-0.118
 that-0.112
 cries-0.107
Token,"
Token ID553
Feature activation-8.50e-01

Pos logit contributions
 address+0.063
 Acc+0.058
person+0.055
 arrow+0.054
 Tim+0.053
Neg logit contributions
-0.111
uch-0.110
ink-0.107
P-0.106
if-0.104
Token he
Token ID339
Feature activation-4.97e-01

Pos logit contributions
P+0.026
St+0.025
Once+0.025
+0.025
irl+0.025
Neg logit contributions
person-0.020
 address-0.019
 automobile-0.018
fully-0.017
itor-0.017
Token too
Token ID1165
Feature activation-2.74e+00

Pos logit contributions
+0.076
St+0.074
irl+0.074
uch+0.073
ink+0.072
Neg logit contributions
 Tim-0.037
 inside-0.035
 that-0.034
 for-0.034
ly-0.034
Token close
Token ID1969
Feature activation+3.82e+00

Pos logit contributions
+0.082
uch+0.074
ink+0.074
St+0.074
irl+0.073
Neg logit contributions
 Tim-0.074
 address-0.070
 arrow-0.069
 that-0.068
 cries-0.068
Token.
Token ID13
Feature activation+4.99e+00

Pos logit contributions
 address+0.052
 Tim+0.046
der+0.044
ly+0.042
 arrow+0.042
Neg logit contributions
-0.158
uch-0.154
St-0.152
irl-0.152
P-0.150
Token The
Token ID383
Feature activation+1.95e+00

Pos logit contributions
 address+0.074
 Tim+0.065
der+0.064
ly+0.062
 arrow+0.062
Neg logit contributions
-0.199
uch-0.194
St-0.192
irl-0.191
P-0.188
Token ost
Token ID23619
Feature activation-6.17e+00

Pos logit contributions
 address+0.035
 old+0.029
 arrow+0.029
 hospital+0.029
 for+0.029
Neg logit contributions
-0.077
irl-0.076
uch-0.074
P-0.074
St-0.073
Tokenric
Token ID1173
Feature activation-1.20e+01

Pos logit contributions
P+0.159
+0.147
St+0.146
irl+0.142
Once+0.133
Neg logit contributions
 address-0.187
der-0.182
 cries-0.177
 for-0.175
 says-0.175
Tokenhes
Token ID956
Feature activation-1.61e-02

Pos logit contributions
+0.399
St+0.333
irl+0.331
 total+0.328
uch+0.326
Neg logit contributions
 address-0.291
 about-0.280
 says-0.276
 that-0.274
 for-0.272
Token might
Token ID1244
Feature activation-7.72e-01

Pos logit contributions
+0.001
St+0.001
P+0.001
irl+0.001
uch+0.001
Neg logit contributions
 address-0.000
 Tim-0.000
ly-0.000
 cries-0.000
 says-0.000
Token pe
Token ID613
Feature activation-6.14e+00

Pos logit contributions
+0.032
P+0.031
icer+0.030
St+0.029
ink+0.029
Neg logit contributions
 say-0.014
 fix-0.013
 for-0.013
 that-0.012
 address-0.011
Tokenck
Token ID694
Feature activation+1.08e+00

Pos logit contributions
+0.211
P+0.177
Once+0.175
 total+0.169
St+0.167
Neg logit contributions
 address-0.153
 cries-0.148
 up-0.148
 for-0.148
 say-0.145
Token you
Token ID345
Feature activation-1.33e+00

Pos logit contributions
 address+0.017
ly+0.016
 Tim+0.016
der+0.016
 cries+0.015
Neg logit contributions
-0.044
uch-0.041
irl-0.041
St-0.041
P-0.040
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
 address+0.037
ly+0.035
der+0.035
 cries+0.035
 Acc+0.035
Neg logit contributions
-0.051
St-0.049
uch-0.048
irl-0.047
 total-0.047
Token big
Token ID1263
Feature activation+1.75e+00

Pos logit contributions
+0.043
uch+0.043
irl+0.042
ink+0.042
St+0.041
Neg logit contributions
 address-0.032
 arrow-0.029
 Tim-0.029
 son-0.028
 fix-0.027
Token box
Token ID3091
Feature activation-2.85e+00

Pos logit contributions
 address+0.041
der+0.039
 Acc+0.038
 Tim+0.037
ly+0.037
Neg logit contributions
-0.054
uch-0.051
 total-0.050
irl-0.050
St-0.049
Token of
Token ID286
Feature activation-8.32e-01

Pos logit contributions
+0.091
ink+0.086
uch+0.086
irl+0.083
 total+0.082
Neg logit contributions
 address-0.066
 Acc-0.062
der-0.061
 cries-0.060
 apology-0.059
Token ch
Token ID442
Feature activation-2.90e+00

Pos logit contributions
 total+0.024
St+0.024
P+0.024
+0.024
irl+0.023
Neg logit contributions
der-0.019
 address-0.018
 Acc-0.018
person-0.017
ly-0.017
Tokenocol
Token ID4668
Feature activation-1.20e+01

Pos logit contributions
+0.072
uch+0.069
St+0.067
if+0.063
irl+0.063
Neg logit contributions
 address-0.088
ly-0.088
 arrow-0.085
der-0.084
 Tim-0.083
Tokenates
Token ID689
Feature activation-3.50e+00

Pos logit contributions
+0.261
irl+0.258
St+0.256
 total+0.234
 sudden+0.233
Neg logit contributions
 address-0.369
 inside-0.346
fully-0.345
 arrow-0.333
iece-0.333
Token!
Token ID0
Feature activation+3.93e+00

Pos logit contributions
+0.115
uch+0.112
ink+0.111
irl+0.108
St+0.106
Neg logit contributions
 address-0.076
 Acc-0.072
 Tim-0.069
 cries-0.067
der-0.066
Token
Token ID220
Feature activation-8.38e+00

Pos logit contributions
 address+0.065
 alien+0.064
 arrow+0.063
der+0.063
 decide+0.061
Neg logit contributions
-0.152
St-0.139
Once-0.131
 total-0.127
P-0.127
Token
Token ID198
Feature activation-8.12e+00

Pos logit contributions
uch+0.265
+0.265
irl+0.257
St+0.256
P+0.254
Neg logit contributions
 address-0.192
 Tim-0.187
der-0.171
 cries-0.169
 Acc-0.166
Token
Token ID198
Feature activation-7.61e+00

Pos logit contributions
uch+0.286
+0.282
irl+0.273
ink+0.267
St+0.267
Neg logit contributions
 Tim-0.164
 address-0.157
 cries-0.143
old-0.142
 He-0.140
Token find
Token ID1064
Feature activation+1.31e+00

Pos logit contributions
 address+0.000
 Tim+0.000
der+0.000
 arrow+0.000
 cries+0.000
Neg logit contributions
-0.001
uch-0.001
St-0.001
irl-0.001
P-0.001
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
 Acc+0.028
 address+0.028
person+0.028
der+0.027
ly+0.026
Neg logit contributions
St-0.041
 total-0.039
irl-0.038
-0.038
uch-0.037
Token box
Token ID3091
Feature activation-2.85e+00

Pos logit contributions
+0.039
uch+0.038
ink+0.037
irl+0.037
P+0.036
Neg logit contributions
 address-0.025
 arrow-0.023
 Tim-0.023
 fix-0.021
 inside-0.021
Token of
Token ID286
Feature activation-8.40e-01

Pos logit contributions
+0.081
ink+0.081
 total+0.078
irl+0.076
uch+0.076
Neg logit contributions
 Acc-0.072
 address-0.066
 disl-0.066
 cries-0.065
der-0.065
Token ch
Token ID442
Feature activation-2.96e+00

Pos logit contributions
St+0.026
irl+0.025
+0.025
 total+0.025
P+0.024
Neg logit contributions
 address-0.019
der-0.018
person-0.017
 Acc-0.017
ly-0.017
Tokenocol
Token ID4668
Feature activation-1.20e+01

Pos logit contributions
+0.085
uch+0.083
St+0.078
P+0.076
if+0.076
Neg logit contributions
 address-0.080
ly-0.079
der-0.076
 arrow-0.076
 cries-0.076
Tokenates
Token ID689
Feature activation-3.44e+00

Pos logit contributions
+0.268
irl+0.251
St+0.242
 sudden+0.230
 total+0.221
Neg logit contributions
 address-0.385
 inside-0.358
 writes-0.355
fully-0.351
iece-0.345
Token.
Token ID13
Feature activation+3.81e+00

Pos logit contributions
+0.128
ink+0.125
uch+0.124
irl+0.120
St+0.120
Neg logit contributions
 Acc-0.057
 address-0.057
 Tim-0.050
 cries-0.050
person-0.050
Token
Token ID198
Feature activation-8.22e+00

Pos logit contributions
 address+0.075
ly+0.071
der+0.068
 cries+0.068
 arrow+0.068
Neg logit contributions
-0.137
St-0.131
irl-0.124
P-0.123
 total-0.122
Token
Token ID198
Feature activation-7.79e+00

Pos logit contributions
+0.235
uch+0.226
St+0.225
irl+0.224
P+0.219
Neg logit contributions
 address-0.216
der-0.199
 Tim-0.199
ly-0.197
 arrow-0.194
Token"
Token ID1
Feature activation-3.80e-01

Pos logit contributions
+0.177
uch+0.171
St+0.170
irl+0.166
P+0.166
Neg logit contributions
 address-0.252
 Tim-0.237
der-0.235
ly-0.232
 cries-0.231
Token.
Token ID13
Feature activation+4.70e+00

Pos logit contributions
+0.049
uch+0.048
irl+0.048
St+0.048
P+0.046
Neg logit contributions
 address-0.041
 Tim-0.038
der-0.037
 Acc-0.037
person-0.037
Token One
Token ID1881
Feature activation-2.68e+00

Pos logit contributions
 address+0.066
 Tim+0.058
der+0.056
ly+0.055
 arrow+0.054
Neg logit contributions
-0.192
uch-0.187
St-0.185
irl-0.184
P-0.182
Token of
Token ID286
Feature activation+8.47e-03

Pos logit contributions
+0.101
irl+0.099
P+0.097
St+0.096
uch+0.096
Neg logit contributions
 address-0.049
 Tim-0.046
 that-0.045
 for-0.044
 says-0.043
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 Tim+0.000
 him+0.000
 that+0.000
 his+0.000
 address+0.000
Neg logit contributions
uch-0.000
-0.000
ink-0.000
if-0.000
irl-0.000
Token ost
Token ID23619
Feature activation-6.45e+00

Pos logit contributions
 address+0.016
 old+0.015
 say+0.015
 elephant+0.015
 says+0.014
Neg logit contributions
-0.047
irl-0.044
uch-0.042
P-0.040
ink-0.039
Tokenric
Token ID1173
Feature activation-1.20e+01

Pos logit contributions
P+0.171
+0.169
St+0.162
irl+0.161
br+0.149
Neg logit contributions
 address-0.173
der-0.165
 for-0.165
 Acc-0.165
 elephant-0.163
Tokenhes
Token ID956
Feature activation+2.84e-01

Pos logit contributions
+0.355
St+0.311
uch+0.309
irl+0.306
 total+0.302
Neg logit contributions
 address-0.328
der-0.307
 Tim-0.303
 says-0.299
 cries-0.298
Token looked
Token ID3114
Feature activation-6.13e-01

Pos logit contributions
 address+0.006
 Tim+0.006
ly+0.006
der+0.006
 cries+0.006
Neg logit contributions
-0.009
St-0.009
uch-0.009
irl-0.009
P-0.009
Token different
Token ID1180
Feature activation+3.71e+00

Pos logit contributions
irl+0.019
St+0.019
+0.018
P+0.018
uch+0.018
Neg logit contributions
 address-0.014
person-0.014
 Acc-0.012
 Tim-0.012
 alien-0.012
Token from
Token ID422
Feature activation+3.56e-02

Pos logit contributions
 address+0.084
ly+0.077
 Tim+0.077
der+0.076
 cries+0.075
Neg logit contributions
-0.127
uch-0.122
P-0.118
irl-0.118
St-0.117
Token the
Token ID262
Feature activation+1.53e+00

Pos logit contributions
 address+0.001
 Tim+0.001
der+0.001
ly+0.001
 cries+0.001
Neg logit contributions
-0.001
uch-0.001
St-0.001
irl-0.001
P-0.001
Token but
Token ID475
Feature activation+1.44e+00

Pos logit contributions
+0.016
uch+0.016
irl+0.016
St+0.015
ink+0.015
Neg logit contributions
 address-0.010
 Tim-0.010
ly-0.009
 arrow-0.009
der-0.009
Token be
Token ID307
Feature activation-5.42e+00

Pos logit contributions
 address+0.033
 Tim+0.032
 arrow+0.030
 son+0.028
 say+0.028
Neg logit contributions
-0.048
uch-0.046
ink-0.046
St-0.046
irl-0.046
Token careful
Token ID8161
Feature activation+3.42e+00

Pos logit contributions
+0.204
uch+0.180
irl+0.171
ink+0.165
 total+0.162
Neg logit contributions
 address-0.126
 for-0.118
 Tim-0.114
 inside-0.112
 old-0.111
Token.
Token ID13
Feature activation+5.07e+00

Pos logit contributions
 address+0.053
 Tim+0.046
der+0.045
 arrow+0.044
ly+0.043
Neg logit contributions
-0.134
uch-0.132
irl-0.130
St-0.129
ink-0.129
Token Ost
Token ID38919
Feature activation-2.88e+00

Pos logit contributions
 address+0.076
 Tim+0.069
der+0.063
 Acc+0.062
 cries+0.061
Neg logit contributions
-0.202
uch-0.199
irl-0.197
St-0.196
P-0.195
Tokenric
Token ID1173
Feature activation-1.20e+01

Pos logit contributions
br+0.063
irl+0.062
P+0.060
+0.059
 sudden+0.056
Neg logit contributions
 address-0.095
 say-0.088
 Tim-0.088
 The-0.086
 that-0.085
Tokenhes
Token ID956
Feature activation-5.94e-01

Pos logit contributions
+0.393
 total+0.327
uch+0.326
St+0.325
irl+0.324
Neg logit contributions
 address-0.304
 says-0.284
 about-0.283
 that-0.280
 for-0.279
Token can
Token ID460
Feature activation-1.39e+00

Pos logit contributions
+0.022
P+0.020
St+0.020
uch+0.020
irl+0.020
Neg logit contributions
 address-0.011
 Tim-0.010
 cries-0.010
 that-0.010
 says-0.010
Token be
Token ID307
Feature activation-5.14e+00

Pos logit contributions
P+0.048
+0.048
if+0.046
icer+0.046
irl+0.045
Neg logit contributions
 say-0.028
 fix-0.026
 address-0.026
 that-0.026
 Tim-0.026
Token gr
Token ID1036
Feature activation-2.05e+00

Pos logit contributions
+0.200
irl+0.170
uch+0.168
icer+0.165
P+0.163
Neg logit contributions
 for-0.110
 address-0.110
 Tim-0.109
 inside-0.106
 that-0.104
Tokenumpy
Token ID32152
Feature activation+4.39e+00

Pos logit contributions
+0.063
 total+0.059
irl+0.059
St+0.059
P+0.057
Neg logit contributions
old-0.051
 cries-0.048
ly-0.048
 address-0.047
der-0.047
Token noticed
Token ID6810
Feature activation+4.25e+00

Pos logit contributions
 address+0.033
 Tim+0.030
 cries+0.029
der+0.029
ly+0.029
Neg logit contributions
-0.046
uch-0.044
irl-0.044
St-0.044
P-0.043
Token something
Token ID1223
Feature activation-1.49e+00

Pos logit contributions
 address+0.090
person+0.086
 Acc+0.082
 apology+0.079
 Let+0.079
Neg logit contributions
-0.138
St-0.135
P-0.134
irl-0.133
uch-0.131
Token.
Token ID13
Feature activation+4.48e+00

Pos logit contributions
+0.040
irl+0.040
St+0.040
uch+0.039
P+0.039
Neg logit contributions
 address-0.040
 Acc-0.037
person-0.037
 Tim-0.036
 son-0.035
Token The
Token ID383
Feature activation+1.33e+00

Pos logit contributions
 address+0.058
ly+0.052
der+0.048
 arrow+0.047
 cries+0.046
Neg logit contributions
-0.191
St-0.182
P-0.178
uch-0.178
irl-0.176
Token ch
Token ID442
Feature activation-4.17e+00

Pos logit contributions
 address+0.024
 Tim+0.021
der+0.021
ly+0.020
 Acc+0.020
Neg logit contributions
-0.049
uch-0.047
St-0.047
irl-0.046
P-0.046
Tokenocol
Token ID4668
Feature activation-1.20e+01

Pos logit contributions
+0.086
St+0.084
uch+0.084
P+0.080
irl+0.080
Neg logit contributions
ly-0.142
 address-0.140
der-0.140
 alien-0.135
 cries-0.135
Tokenates
Token ID689
Feature activation-4.02e+00

Pos logit contributions
+0.266
irl+0.248
 sudden+0.232
St+0.229
Once+0.223
Neg logit contributions
 address-0.376
fully-0.370
der-0.367
 writes-0.364
 inside-0.352
Token were
Token ID547
Feature activation-5.97e+00

Pos logit contributions
+0.124
St+0.122
P+0.121
uch+0.119
irl+0.119
Neg logit contributions
 address-0.092
 Tim-0.091
der-0.088
ly-0.087
 cries-0.086
Token bitter
Token ID12922
Feature activation+3.48e+00

Pos logit contributions
+0.206
uch+0.199
P+0.192
irl+0.190
St+0.187
Neg logit contributions
 address-0.135
 Tim-0.126
der-0.121
 cries-0.117
 arrow-0.117
Token.
Token ID13
Feature activation+4.19e+00

Pos logit contributions
 address+0.063
 Acc+0.058
person+0.057
 arrow+0.053
old+0.051
Neg logit contributions
-0.128
uch-0.124
ink-0.121
St-0.119
irl-0.118
Token They
Token ID1119
Feature activation+1.42e-01

Pos logit contributions
 address+0.044
 Tim+0.036
ly+0.034
der+0.034
 arrow+0.034
Neg logit contributions
-0.187
uch-0.181
St-0.180
irl-0.178
P-0.177
Token wanted
Token ID2227
Feature activation+1.53e+00

Pos logit contributions
 address+0.001
 Acc+0.001
person+0.001
 Tim+0.001
der+0.001
Neg logit contributions
St-0.002
-0.002
uch-0.002
irl-0.002
P-0.002
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
 address+0.023
person+0.022
der+0.022
ly+0.022
 Acc+0.021
Neg logit contributions
St-0.057
-0.055
 total-0.054
uch-0.054
irl-0.053
Token touch
Token ID3638
Feature activation+1.94e+00

Pos logit contributions
 address+0.027
der+0.025
ly+0.024
 Tim+0.024
 Acc+0.023
Neg logit contributions
-0.044
uch-0.043
irl-0.043
St-0.043
P-0.041
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
 address+0.027
 Tim+0.024
der+0.022
 arrow+0.022
 cries+0.022
Neg logit contributions
-0.080
uch-0.079
irl-0.077
St-0.077
ink-0.076
Token ost
Token ID23619
Feature activation-5.86e+00

Pos logit contributions
 address+0.031
 cries+0.027
 arrow+0.027
 Tim+0.026
 say+0.026
Neg logit contributions
-0.073
irl-0.071
uch-0.068
P-0.068
St-0.068
Tokenric
Token ID1173
Feature activation-1.19e+01

Pos logit contributions
P+0.144
St+0.134
irl+0.132
+0.131
br+0.126
Neg logit contributions
 address-0.170
 Tim-0.169
der-0.168
 Sharing-0.164
 cries-0.164
Tokenhes
Token ID956
Feature activation-1.03e-02

Pos logit contributions
+0.331
St+0.295
uch+0.292
 total+0.290
irl+0.286
Neg logit contributions
 address-0.336
 Tim-0.321
der-0.314
 cries-0.311
 says-0.307
Token.
Token ID13
Feature activation+3.79e+00

Pos logit contributions
+0.000
uch+0.000
St+0.000
irl+0.000
ink+0.000
Neg logit contributions
 address-0.000
 Acc-0.000
 Tim-0.000
der-0.000
 arrow-0.000
Token They
Token ID1119
Feature activation-1.13e-01

Pos logit contributions
 address+0.051
 Tim+0.043
der+0.042
ly+0.042
 arrow+0.041
Neg logit contributions
-0.159
uch-0.154
St-0.154
irl-0.152
P-0.151
Token reached
Token ID4251
Feature activation-2.03e+00

Pos logit contributions
+0.004
uch+0.004
St+0.004
irl+0.004
P+0.003
Neg logit contributions
 address-0.003
 Tim-0.002
der-0.002
ly-0.002
 arrow-0.002
Token out
Token ID503
Feature activation-1.01e+00

Pos logit contributions
+0.075
irl+0.074
St+0.073
uch+0.072
 total+0.070
Neg logit contributions
 address-0.035
der-0.033
ly-0.032
person-0.032
 Acc-0.032
Token was
Token ID373
Feature activation-4.42e+00

Pos logit contributions
+0.040
uch+0.039
St+0.039
irl+0.039
P+0.038
Neg logit contributions
 address-0.030
 Tim-0.028
der-0.027
ly-0.027
 arrow-0.027
Token a
Token ID257
Feature activation-2.25e+00

Pos logit contributions
+0.146
uch+0.143
St+0.142
irl+0.141
ink+0.138
Neg logit contributions
 address-0.093
der-0.086
 Tim-0.086
ly-0.084
 cries-0.083
Token box
Token ID3091
Feature activation-3.04e+00

Pos logit contributions
+0.074
uch+0.073
irl+0.071
St+0.071
ink+0.070
Neg logit contributions
 address-0.051
 Tim-0.047
 arrow-0.046
der-0.043
 cries-0.043
Token of
Token ID286
Feature activation-1.06e+00

Pos logit contributions
+0.097
ink+0.096
uch+0.094
St+0.091
 total+0.090
Neg logit contributions
 address-0.065
 Acc-0.065
der-0.061
 cries-0.061
 apology-0.061
Token ch
Token ID442
Feature activation-3.17e+00

Pos logit contributions
St+0.031
P+0.030
 total+0.030
irl+0.030
+0.030
Neg logit contributions
der-0.024
 address-0.023
person-0.023
 Acc-0.023
fully-0.022
Tokenocol
Token ID4668
Feature activation-1.19e+01

Pos logit contributions
+0.080
uch+0.077
St+0.072
irl+0.072
if+0.070
Neg logit contributions
ly-0.096
 address-0.094
 Tim-0.092
 arrow-0.092
der-0.091
Tokenates
Token ID689
Feature activation-3.46e+00

Pos logit contributions
+0.256
irl+0.245
St+0.243
 total+0.223
Once+0.213
Neg logit contributions
 address-0.383
 inside-0.362
fully-0.360
der-0.354
 him-0.354
Token.
Token ID13
Feature activation+4.13e+00

Pos logit contributions
+0.129
uch+0.126
ink+0.125
irl+0.121
St+0.121
Neg logit contributions
 address-0.059
 Acc-0.057
 Tim-0.053
 cries-0.052
 apology-0.051
Token Lily
Token ID20037
Feature activation+1.38e+00

Pos logit contributions
 address+0.053
ly+0.050
 arrow+0.047
person+0.045
 cries+0.045
Neg logit contributions
-0.175
St-0.168
P-0.160
irl-0.158
 total-0.157
Token loved
Token ID6151
Feature activation+1.24e+00

Pos logit contributions
 address+0.031
 Tim+0.029
der+0.029
ly+0.028
 Acc+0.028
Neg logit contributions
-0.044
St-0.043
uch-0.043
irl-0.042
ink-0.042
Token ch
Token ID442
Feature activation-2.35e+00

Pos logit contributions
 address+0.021
 Tim+0.019
der+0.017
 arrow+0.017
 cries+0.017
Neg logit contributions
-0.049
uch-0.049
irl-0.047
ink-0.047
P-0.046
Tokenbled
Token ID9342
Feature activation+2.81e+00

Pos logit contributions
ink+0.035
irl+0.035
+0.035
uch+0.033
St+0.033
Neg logit contributions
 address-0.046
 officer-0.044
 Acc-0.044
 arrow-0.044
 disl-0.044
Token up
Token ID510
Feature activation+1.67e+00

Pos logit contributions
 address+0.027
 Tim+0.023
ly+0.021
der+0.020
 cries+0.020
Neg logit contributions
-0.128
uch-0.126
irl-0.123
St-0.122
P-0.122
Token all
Token ID477
Feature activation-4.41e-01

Pos logit contributions
 address+0.024
 Tim+0.021
der+0.021
ly+0.020
 arrow+0.020
Neg logit contributions
-0.067
uch-0.066
St-0.065
irl-0.065
P-0.064
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
St+0.016
P+0.015
+0.015
irl+0.015
 total+0.015
Neg logit contributions
 address-0.009
der-0.008
 Acc-0.007
 arrow-0.007
person-0.007
Token ch
Token ID442
Feature activation-3.27e+00

Pos logit contributions
 address+0.020
ly+0.020
 for+0.019
 inside+0.019
 says+0.018
Neg logit contributions
-0.050
uch-0.046
irl-0.045
ink-0.044
P-0.043
Tokenocol
Token ID4668
Feature activation-1.19e+01

Pos logit contributions
+0.083
uch+0.082
St+0.081
irl+0.075
P+0.075
Neg logit contributions
 address-0.098
ly-0.096
der-0.093
old-0.092
 cries-0.089
Tokenates
Token ID689
Feature activation-3.87e+00

Pos logit contributions
+0.264
irl+0.248
St+0.239
 sudden+0.234
 total+0.229
Neg logit contributions
 address-0.376
fully-0.369
der-0.351
 inside-0.347
ly-0.344
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
+0.147
uch+0.142
ink+0.140
irl+0.138
St+0.137
Neg logit contributions
 address-0.068
 Acc-0.060
 Tim-0.057
 cries-0.055
 arrow-0.055
Token and
Token ID290
Feature activation+6.37e-01

Pos logit contributions
+0.042
St+0.041
uch+0.040
irl+0.040
ink+0.039
Neg logit contributions
 address-0.024
der-0.022
 Tim-0.021
ly-0.021
 cries-0.021
Token she
Token ID673
Feature activation+3.47e-01

Pos logit contributions
 address+0.013
der+0.012
 Acc+0.012
ly+0.012
 apology+0.012
Neg logit contributions
-0.021
St-0.020
irl-0.020
uch-0.020
ink-0.020
Token was
Token ID373
Feature activation-4.13e+00

Pos logit contributions
 address+0.007
 Tim+0.007
der+0.006
 cries+0.006
 arrow+0.006
Neg logit contributions
-0.012
uch-0.012
St-0.011
irl-0.011
P-0.011
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
 address+0.031
 Tim+0.029
der+0.027
ly+0.027
 cries+0.026
Neg logit contributions
-0.071
uch-0.070
St-0.069
irl-0.069
P-0.068
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
+0.133
uch+0.130
St+0.128
irl+0.128
P+0.126
Neg logit contributions
 address-0.082
 Tim-0.076
der-0.074
ly-0.072
 arrow-0.072
Token box
Token ID3091
Feature activation-3.13e+00

Pos logit contributions
uch+0.049
+0.049
irl+0.048
St+0.047
ink+0.047
Neg logit contributions
 address-0.043
 Tim-0.040
 arrow-0.039
 son-0.037
 cries-0.037
Token of
Token ID286
Feature activation-9.54e-01

Pos logit contributions
+0.103
ink+0.101
uch+0.094
irl+0.093
if+0.093
Neg logit contributions
 Acc-0.069
 address-0.063
 disl-0.062
der-0.061
 apology-0.061
Token ch
Token ID442
Feature activation-3.00e+00

Pos logit contributions
 total+0.024
St+0.024
P+0.023
+0.023
irl+0.022
Neg logit contributions
der-0.026
fully-0.024
person-0.023
 Acc-0.023
 address-0.023
Tokenocol
Token ID4668
Feature activation-1.18e+01

Pos logit contributions
+0.066
uch+0.064
St+0.061
irl+0.059
P+0.058
Neg logit contributions
 address-0.101
ly-0.097
 Tim-0.095
der-0.095
 cries-0.094
Tokenates
Token ID689
Feature activation-3.59e+00

Pos logit contributions
+0.260
irl+0.255
St+0.246
 total+0.231
 sudden+0.215
Neg logit contributions
 address-0.385
 inside-0.352
fully-0.350
 arrow-0.345
der-0.343
Token!
Token ID0
Feature activation+4.29e+00

Pos logit contributions
+0.127
ink+0.127
uch+0.121
if+0.119
irl+0.116
Neg logit contributions
 Acc-0.072
 address-0.062
 apology-0.061
 Tim-0.060
 fixes-0.059
Token Alice
Token ID14862
Feature activation+9.79e-01

Pos logit contributions
 address+0.073
 Tim+0.065
der+0.061
 cries+0.060
ly+0.059
Neg logit contributions
uch-0.165
-0.164
irl-0.162
P-0.160
St-0.159
Token and
Token ID290
Feature activation+8.17e-01

Pos logit contributions
 address+0.019
 Tim+0.018
 cries+0.017
ly+0.016
 arrow+0.016
Neg logit contributions
-0.035
uch-0.035
irl-0.034
P-0.034
St-0.033
Token her
Token ID607
Feature activation+6.38e-01

Pos logit contributions
der+0.012
 Acc+0.012
 address+0.011
ly+0.010
 alien+0.010
Neg logit contributions
St-0.031
-0.030
irl-0.030
 total-0.030
uch-0.029
Token.
Token ID13
Feature activation+4.20e+00

Pos logit contributions
 address+0.007
 Tim+0.006
der+0.006
ly+0.005
 cries+0.005
Neg logit contributions
-0.031
uch-0.030
St-0.030
irl-0.030
P-0.029
Token It
Token ID632
Feature activation-1.27e+00

Pos logit contributions
ly+0.060
 airplane+0.054
 address+0.053
fully+0.053
 cries+0.050
Neg logit contributions
-0.169
St-0.160
 total-0.149
ink-0.147
Once-0.144
Token sees
Token ID7224
Feature activation+2.68e+00

Pos logit contributions
+0.044
uch+0.043
St+0.043
irl+0.043
ink+0.042
Neg logit contributions
 address-0.025
 Tim-0.023
der-0.022
ly-0.022
 arrow-0.021
Token other
Token ID584
Feature activation+1.85e+00

Pos logit contributions
 address+0.049
 Acc+0.043
 Tim+0.043
ly+0.042
person+0.042
Neg logit contributions
-0.095
St-0.094
uch-0.093
irl-0.092
P-0.091
Token ost
Token ID23619
Feature activation-5.83e+00

Pos logit contributions
 address+0.032
 Tim+0.029
ly+0.026
 arrow+0.026
 son+0.026
Neg logit contributions
-0.075
uch-0.072
irl-0.071
St-0.069
P-0.069
Tokenric
Token ID1173
Feature activation-1.18e+01

Pos logit contributions
P+0.158
St+0.149
irl+0.148
+0.145
br+0.143
Neg logit contributions
 address-0.156
 Tim-0.149
 Acc-0.147
 earlier-0.146
 elephant-0.145
Tokenhes
Token ID956
Feature activation+1.70e-01

Pos logit contributions
+0.330
St+0.295
uch+0.294
irl+0.289
 total+0.284
Neg logit contributions
 address-0.340
 Tim-0.318
der-0.314
 cries-0.312
 says-0.305
Token.
Token ID13
Feature activation+3.85e+00

Pos logit contributions
 address+0.002
 Tim+0.002
der+0.002
 Acc+0.002
ly+0.002
Neg logit contributions
-0.007
uch-0.007
St-0.007
irl-0.007
ink-0.007
Token It
Token ID632
Feature activation-1.40e+00

Pos logit contributions
 address+0.041
ly+0.036
 cries+0.033
 arrow+0.033
der+0.033
Neg logit contributions
-0.170
St-0.163
uch-0.159
irl-0.159
ink-0.159
Token makes
Token ID1838
Feature activation+1.63e+00

Pos logit contributions
+0.049
uch+0.048
St+0.048
irl+0.048
ink+0.047
Neg logit contributions
 address-0.027
 Tim-0.025
der-0.024
ly-0.024
 arrow-0.023
Token new
Token ID649
Feature activation+2.54e+00

Pos logit contributions
 address+0.031
der+0.028
 Acc+0.028
 Tim+0.027
ly+0.027
Neg logit contributions
-0.056
uch-0.055
St-0.055
irl-0.055
ink-0.054
Token said
Token ID531
Feature activation+4.72e-01

Pos logit contributions
 address+0.021
 Acc+0.018
 Tim+0.018
der+0.018
person+0.018
Neg logit contributions
-0.032
St-0.032
uch-0.032
irl-0.032
P-0.032
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
 address+0.010
 Tim+0.009
 cries+0.009
 arrow+0.009
 writes+0.009
Neg logit contributions
-0.016
ink-0.015
St-0.015
uch-0.015
irl-0.015
Token "
Token ID366
Feature activation-1.53e+00

Pos logit contributions
+0.056
uch+0.056
irl+0.055
St+0.055
P+0.055
Neg logit contributions
 address-0.012
 Tim-0.011
der-0.010
ly-0.009
 arrow-0.009
TokenThank
Token ID10449
Feature activation-2.20e+00

Pos logit contributions
irl+0.053
uch+0.053
+0.051
icer+0.051
ink+0.050
Neg logit contributions
 Let-0.029
person-0.029
 address-0.028
ly-0.027
 He-0.027
Token you
Token ID345
Feature activation-2.06e+00

Pos logit contributions
+0.080
irl+0.080
 total+0.079
uch+0.078
icer+0.078
Neg logit contributions
 that-0.040
 him-0.040
 Tim-0.038
 for-0.037
ly-0.036
Token!
Token ID0
Feature activation+4.76e+00

Pos logit contributions
+0.079
uch+0.077
St+0.076
irl+0.075
P+0.074
Neg logit contributions
 address-0.035
 Tim-0.031
der-0.030
 arrow-0.029
 cries-0.029
Token I
Token ID314
Feature activation-3.18e-01

Pos logit contributions
 Tim+0.068
 address+0.066
 Let+0.061
 Acc+0.061
 He+0.058
Neg logit contributions
uch-0.193
irl-0.193
P-0.188
-0.186
St-0.185
Token promise
Token ID6991
Feature activation+2.99e+00

Pos logit contributions
+0.010
icer+0.010
St+0.010
P+0.010
uch+0.010
Neg logit contributions
 address-0.007
 say-0.007
 Tim-0.007
 says-0.007
 decide-0.007
Token I
Token ID314
Feature activation+1.09e-01

Pos logit contributions
 address+0.055
 Tim+0.050
der+0.048
ly+0.047
 arrow+0.047
Neg logit contributions
-0.109
uch-0.107
St-0.105
irl-0.104
ink-0.103
Token'll
Token ID1183
Feature activation+6.82e-02

Pos logit contributions
 address+0.003
 Tim+0.003
 say+0.002
 says+0.002
 fix+0.002
Neg logit contributions
-0.003
St-0.003
P-0.003
irl-0.003
uch-0.003
Token only
Token ID691
Feature activation+1.41e+00

Pos logit contributions
 address+0.001
 say+0.001
 fix+0.001
 Tim+0.001
 arrow+0.001
Neg logit contributions
-0.002
icer-0.002
uch-0.002
St-0.002
P-0.002
Token his
Token ID465
Feature activation+6.99e-01

Pos logit contributions
ly+0.015
der+0.015
 911+0.015
 address+0.015
 apology+0.014
Neg logit contributions
-0.023
ink-0.023
St-0.023
irl-0.023
if-0.022
Token mom
Token ID1995
Feature activation+1.94e+00

Pos logit contributions
 address+0.010
der+0.009
 Tim+0.009
ly+0.009
 Acc+0.008
Neg logit contributions
-0.028
uch-0.027
St-0.027
irl-0.027
ink-0.026
Token could
Token ID714
Feature activation-2.11e+00

Pos logit contributions
 address+0.042
 Tim+0.039
der+0.038
ly+0.037
 arrow+0.037
Neg logit contributions
-0.064
uch-0.063
St-0.062
irl-0.061
ink-0.061
Token trust
Token ID3774
Feature activation+1.73e+00

Pos logit contributions
+0.077
uch+0.077
St+0.076
P+0.075
irl+0.075
Neg logit contributions
 address-0.038
 Tim-0.035
 fix-0.032
der-0.032
 arrow-0.031
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
 address+0.017
 Tim+0.013
der+0.013
ly+0.012
 arrow+0.012
Neg logit contributions
-0.078
uch-0.077
St-0.076
irl-0.076
P-0.075
Token reliable
Token ID9314
Feature activation+4.94e+00

Pos logit contributions
 address+0.037
 Tim+0.033
der+0.032
ly+0.031
 cries+0.031
Neg logit contributions
-0.083
uch-0.082
irl-0.080
St-0.080
ink-0.079
Token veterinarian
Token ID41404
Feature activation+2.81e+00

Pos logit contributions
 address+0.097
 Acc+0.095
 Tim+0.093
der+0.090
 arrow+0.086
Neg logit contributions
-0.169
St-0.162
uch-0.157
 total-0.155
ink-0.155
Token who
Token ID508
Feature activation+2.64e+00

Pos logit contributions
 address+0.045
 Acc+0.041
der+0.041
 Tim+0.041
 arrow+0.039
Neg logit contributions
-0.105
uch-0.101
ink-0.099
St-0.099
if-0.098
Token could
Token ID714
Feature activation-1.79e+00

Pos logit contributions
 address+0.059
 Tim+0.055
der+0.054
ly+0.053
 arrow+0.053
Neg logit contributions
-0.084
uch-0.082
St-0.081
irl-0.080
ink-0.079
Token replace
Token ID6330
Feature activation+3.99e+00

Pos logit contributions
+0.066
uch+0.065
St+0.064
irl+0.064
P+0.064
Neg logit contributions
 address-0.032
 Tim-0.029
der-0.028
 cries-0.027
ly-0.027
Token Dave
Token ID9935
Feature activation+2.57e-01

Pos logit contributions
 address+0.053
der+0.045
 Acc+0.043
ly+0.043
 arrow+0.041
Neg logit contributions
-0.163
St-0.161
uch-0.159
irl-0.158
P-0.157
Token,
Token ID11
Feature activation-9.98e-01

Pos logit contributions
 address+0.023
person+0.021
 pavement+0.020
 arrow+0.020
 cries+0.020
Neg logit contributions
-0.027
irl-0.026
St-0.025
ink-0.025
P-0.024
Token "
Token ID366
Feature activation-1.11e+00

Pos logit contributions
+0.037
St+0.036
irl+0.034
ink+0.033
if+0.033
Neg logit contributions
 address-0.017
ly-0.015
old-0.015
 cries-0.015
 arrow-0.014
TokenOkay
Token ID16454
Feature activation+1.44e+00

Pos logit contributions
ink+0.036
irl+0.036
uch+0.036
+0.036
icer+0.034
Neg logit contributions
 address-0.023
 Tim-0.022
 Let-0.022
person-0.022
 Sharing-0.022
Token,
Token ID11
Feature activation-7.28e-01

Pos logit contributions
 address+0.028
der+0.026
 cries+0.025
 arrow+0.025
 Acc+0.025
Neg logit contributions
-0.051
uch-0.049
irl-0.048
St-0.048
ink-0.047
Token Tommy
Token ID19919
Feature activation+9.87e-01

Pos logit contributions
+0.028
St+0.027
uch+0.027
irl+0.027
 total+0.026
Neg logit contributions
 address-0.011
der-0.011
 cries-0.010
ly-0.010
 Acc-0.010
Token.
Token ID13
Feature activation+5.14e+00

Pos logit contributions
der+0.026
 address+0.025
 footsteps+0.024
 Acc+0.023
 apology+0.023
Neg logit contributions
uch-0.027
ink-0.027
-0.026
P-0.025
St-0.025
Token We
Token ID775
Feature activation+3.66e-01

Pos logit contributions
 address+0.065
 Tim+0.058
der+0.053
ly+0.051
 arrow+0.051
Neg logit contributions
-0.216
uch-0.213
irl-0.210
St-0.210
P-0.208
Token can
Token ID460
Feature activation-7.16e-01

Pos logit contributions
 address+0.009
 Tim+0.009
 arrow+0.008
 fix+0.008
 decide+0.008
Neg logit contributions
-0.011
uch-0.011
St-0.011
if-0.010
irl-0.010
Token play
Token ID711
Feature activation+2.25e-01

Pos logit contributions
+0.027
uch+0.026
St+0.026
irl+0.026
P+0.026
Neg logit contributions
 address-0.012
 Tim-0.011
der-0.011
 arrow-0.010
ly-0.010
Token together
Token ID1978
Feature activation+3.80e-01

Pos logit contributions
 address+0.005
 Tim+0.004
der+0.004
ly+0.004
 cries+0.004
Neg logit contributions
-0.008
uch-0.007
irl-0.007
St-0.007
P-0.007
Token."
Token ID526
Feature activation+2.25e+00

Pos logit contributions
 for+0.004
 Tim+0.004
 address+0.004
der+0.004
 inside+0.004
Neg logit contributions
-0.017
uch-0.016
St-0.016
P-0.016
ink-0.015
Token sky
Token ID6766
Feature activation-3.91e+00

Pos logit contributions
 address+0.029
 Tim+0.026
 cries+0.025
ly+0.025
 arrow+0.025
Neg logit contributions
-0.054
irl-0.053
uch-0.053
St-0.052
ink-0.051
Token and
Token ID290
Feature activation+1.76e+00

Pos logit contributions
+0.169
St+0.163
P+0.162
uch+0.160
irl+0.160
Neg logit contributions
 address-0.051
 Tim-0.045
der-0.042
 for-0.041
ly-0.041
Token make
Token ID787
Feature activation+2.15e+00

Pos logit contributions
 address+0.031
 Tim+0.028
der+0.028
ly+0.027
 cries+0.027
Neg logit contributions
-0.064
uch-0.063
St-0.062
irl-0.062
ink-0.060
Token pretty
Token ID2495
Feature activation+1.68e+00

Pos logit contributions
 address+0.042
 Tim+0.038
der+0.036
 cries+0.036
 arrow+0.036
Neg logit contributions
-0.078
uch-0.075
St-0.074
irl-0.073
P-0.073
Token colors
Token ID7577
Feature activation-2.15e+00

Pos logit contributions
 address+0.031
 Tim+0.028
der+0.027
 Acc+0.027
 arrow+0.027
Neg logit contributions
-0.059
irl-0.058
uch-0.058
St-0.058
ink-0.057
Token.
Token ID13
Feature activation+4.73e+00

Pos logit contributions
+0.077
uch+0.076
irl+0.076
ink+0.076
St+0.075
Neg logit contributions
 address-0.038
 Tim-0.035
 Acc-0.034
 cries-0.033
der-0.033
Token We
Token ID775
Feature activation+6.21e-01

Pos logit contributions
 address+0.065
 Tim+0.057
der+0.055
ly+0.054
 arrow+0.053
Neg logit contributions
-0.193
uch-0.189
St-0.187
irl-0.186
P-0.184
Token can
Token ID460
Feature activation-6.43e-01

Pos logit contributions
 address+0.013
 fix+0.012
 Tim+0.012
 say+0.012
 arrow+0.012
Neg logit contributions
-0.022
P-0.021
uch-0.021
St-0.021
irl-0.020
Token watch
Token ID2342
Feature activation+3.11e+00

Pos logit contributions
+0.024
P+0.024
uch+0.024
St+0.023
irl+0.023
Neg logit contributions
 address-0.012
 Tim-0.011
 fix-0.011
 say-0.010
 arrow-0.010
Token them
Token ID606
Feature activation-1.38e+00

Pos logit contributions
 address+0.049
 Tim+0.044
der+0.041
ly+0.040
 cries+0.040
Neg logit contributions
-0.124
uch-0.121
P-0.118
St-0.118
irl-0.118
Token tonight
Token ID9975
Feature activation+8.62e-02

Pos logit contributions
+0.054
P+0.052
uch+0.052
St+0.051
irl+0.050
Neg logit contributions
 address-0.023
 Tim-0.022
der-0.020
 arrow-0.020
 fix-0.020
Token I
Token ID314
Feature activation+7.26e-02

Pos logit contributions
 address+0.072
 Tim+0.063
der+0.061
ly+0.060
 arrow+0.059
Neg logit contributions
-0.212
uch-0.207
St-0.205
irl-0.204
P-0.201
Token want
Token ID765
Feature activation+2.00e+00

Pos logit contributions
 say+0.001
 Tim+0.001
 address+0.001
 bury+0.001
 decide+0.001
Neg logit contributions
P-0.003
-0.002
St-0.002
if-0.002
uch-0.002
Token you
Token ID345
Feature activation-3.46e-01

Pos logit contributions
 address+0.021
 Tim+0.018
der+0.016
ly+0.015
 cries+0.015
Neg logit contributions
-0.090
uch-0.088
irl-0.087
St-0.087
ink-0.086
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
+0.017
St+0.016
irl+0.016
ink+0.016
uch+0.016
Neg logit contributions
 address-0.003
 Tim-0.002
 for-0.002
ly-0.002
 inside-0.002
Token be
Token ID307
Feature activation-4.13e+00

Pos logit contributions
 address+0.019
 Tim+0.018
 say+0.017
 fix+0.016
 decide+0.016
Neg logit contributions
-0.055
St-0.054
P-0.054
uch-0.054
if-0.053
Token alert
Token ID7995
Feature activation+5.94e+00

Pos logit contributions
+0.149
uch+0.138
St+0.133
ink+0.133
irl+0.131
Neg logit contributions
 address-0.088
 Tim-0.088
 inside-0.079
 for-0.079
 arrow-0.078
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
 address+0.099
 Acc+0.090
person+0.088
 arrow+0.084
 fixes+0.082
Neg logit contributions
if-0.207
irl-0.203
uch-0.201
ink-0.201
St-0.201
Token watch
Token ID2342
Feature activation+2.54e+00

Pos logit contributions
 address+0.024
 Tim+0.022
der+0.022
ly+0.021
 cries+0.021
Neg logit contributions
-0.035
uch-0.034
St-0.034
irl-0.034
P-0.033
Token out
Token ID503
Feature activation-1.12e+00

Pos logit contributions
 address+0.041
 Tim+0.037
der+0.035
ly+0.034
 arrow+0.034
Neg logit contributions
-0.098
uch-0.096
St-0.095
irl-0.095
P-0.093
Token for
Token ID329
Feature activation+3.74e-01

Pos logit contributions
irl+0.039
St+0.039
uch+0.039
ink+0.039
+0.038
Neg logit contributions
 address-0.020
person-0.018
 Acc-0.017
der-0.017
 arrow-0.017
Token cars
Token ID5006
Feature activation-2.32e+00

Pos logit contributions
 Tim+0.006
 address+0.005
 him+0.005
 cries+0.005
 that+0.005
Neg logit contributions
-0.016
ink-0.015
uch-0.015
P-0.015
St-0.015
Token
Token ID198
Feature activation-8.00e+00

Pos logit contributions
+0.266
uch+0.261
irl+0.253
St+0.252
ink+0.249
Neg logit contributions
 address-0.208
 Tim-0.198
der-0.187
 cries-0.187
 arrow-0.184
TokenThe
Token ID464
Feature activation+8.03e-01

Pos logit contributions
+0.239
uch+0.231
ink+0.225
irl+0.219
icer+0.215
Neg logit contributions
 Tim-0.197
 address-0.195
 fix-0.191
 arrow-0.189
 decide-0.188
Token big
Token ID1263
Feature activation+9.85e-01

Pos logit contributions
 address+0.016
der+0.015
 Acc+0.014
ly+0.014
person+0.014
Neg logit contributions
-0.028
St-0.027
uch-0.027
irl-0.027
P-0.027
Token boy
Token ID2933
Feature activation-1.07e+00

Pos logit contributions
der+0.026
person+0.026
fully+0.025
 Acc+0.024
 address+0.024
Neg logit contributions
 time-0.028
 total-0.025
Once-0.025
icer-0.025
 year-0.025
Token does
Token ID857
Feature activation+9.07e-01

Pos logit contributions
irl+0.036
+0.036
St+0.036
uch+0.036
ink+0.035
Neg logit contributions
 address-0.020
person-0.019
 Tim-0.018
 Acc-0.018
der-0.018
Token not
Token ID407
Feature activation-1.07e+00

Pos logit contributions
 address+0.023
 Tim+0.022
der+0.021
 cries+0.021
 arrow+0.021
Neg logit contributions
-0.027
uch-0.026
St-0.025
P-0.025
ink-0.025
Token listen
Token ID6004
Feature activation+4.12e+00

Pos logit contributions
+0.041
uch+0.041
St+0.041
irl+0.041
P+0.040
Neg logit contributions
 address-0.017
 Tim-0.015
der-0.015
 Acc-0.014
 cries-0.014
Token.
Token ID13
Feature activation+4.13e+00

Pos logit contributions
 address+0.080
 Acc+0.078
 alien+0.077
 arrow+0.076
person+0.071
Neg logit contributions
St-0.132
 time-0.130
if-0.130
-0.129
 total-0.127
Token He
Token ID679
Feature activation+2.14e-01

Pos logit contributions
 address+0.059
 alien+0.055
 airplane+0.055
 arrow+0.055
person+0.052
Neg logit contributions
-0.169
St-0.160
P-0.148
irl-0.147
ink-0.145
Token laughs
Token ID22051
Feature activation+1.43e+00

Pos logit contributions
 address+0.003
 Tim+0.003
der+0.002
ly+0.002
 arrow+0.002
Neg logit contributions
-0.009
uch-0.009
St-0.008
irl-0.008
P-0.008
Token.
Token ID13
Feature activation+4.02e+00

Pos logit contributions
 address+0.034
person+0.034
 Acc+0.032
grand+0.030
 disl+0.030
Neg logit contributions
St-0.043
P-0.041
Once-0.040
 time-0.040
-0.039
Token from
Token ID422
Feature activation-4.52e-01

Pos logit contributions
+0.032
uch+0.031
St+0.031
irl+0.031
P+0.031
Neg logit contributions
 address-0.015
 Tim-0.014
der-0.013
ly-0.013
 cries-0.013
Token flower
Token ID15061
Feature activation-7.79e-01

Pos logit contributions
+0.015
irl+0.015
St+0.014
 total+0.014
uch+0.014
Neg logit contributions
 address-0.010
person-0.009
 Acc-0.009
der-0.009
old-0.009
Token to
Token ID284
Feature activation+5.94e-01

Pos logit contributions
+0.031
uch+0.030
St+0.030
irl+0.030
P+0.029
Neg logit contributions
 address-0.012
 Tim-0.011
der-0.010
ly-0.010
 arrow-0.010
Token flower
Token ID15061
Feature activation-3.48e-01

Pos logit contributions
 address+0.010
 Tim+0.009
der+0.009
 arrow+0.009
 cries+0.009
Neg logit contributions
-0.022
uch-0.022
St-0.022
irl-0.022
ink-0.021
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
+0.013
uch+0.013
St+0.013
irl+0.013
P+0.013
Neg logit contributions
 address-0.006
 Tim-0.005
der-0.005
 arrow-0.005
 cries-0.005
Token collecting
Token ID13157
Feature activation+2.09e+00

Pos logit contributions
+0.039
uch+0.038
St+0.038
irl+0.038
P+0.037
Neg logit contributions
 address-0.020
 Tim-0.018
der-0.018
ly-0.017
 arrow-0.017
Token n
Token ID299
Feature activation-4.44e+00

Pos logit contributions
 address+0.043
 for+0.042
ly+0.041
 inside+0.041
 that+0.041
Neg logit contributions
-0.077
uch-0.075
irl-0.067
icer-0.067
ink-0.066
Tokenectar
Token ID44504
Feature activation-4.51e+00

Pos logit contributions
+0.116
St+0.116
 total+0.115
uch+0.115
br+0.107
Neg logit contributions
 inside-0.114
der-0.112
fully-0.111
 Acc-0.111
old-0.110
Token.
Token ID13
Feature activation+4.07e+00

Pos logit contributions
+0.201
uch+0.197
St+0.195
irl+0.194
P+0.192
Neg logit contributions
 address-0.047
 Tim-0.040
der-0.037
ly-0.036
 arrow-0.035
Token It
Token ID632
Feature activation-1.47e+00

Pos logit contributions
 alien+0.071
 arrow+0.069
 address+0.069
 officer+0.066
person+0.066
Neg logit contributions
-0.154
St-0.142
ink-0.127
irl-0.127
Once-0.126
Token was
Token ID373
Feature activation-4.35e+00

Pos logit contributions
+0.055
uch+0.051
icer+0.050
P+0.049
irl+0.048
Neg logit contributions
 address-0.028
der-0.028
 cries-0.027
 Tim-0.027
 says-0.026
TokenL
Token ID43
Feature activation-4.25e+00

Pos logit contributions
+0.222
uch+0.216
irl+0.203
ink+0.202
St+0.195
Neg logit contributions
 address-0.247
 Tim-0.237
der-0.229
 arrow-0.229
 fix-0.227
Tokenily
Token ID813
Feature activation+1.78e+00

Pos logit contributions
+0.184
uch+0.180
St+0.178
irl+0.178
P+0.176
Neg logit contributions
 address-0.051
 Tim-0.044
der-0.041
ly-0.040
 arrow-0.039
Token and
Token ID290
Feature activation+4.12e-02

Pos logit contributions
 address+0.025
 Tim+0.022
der+0.021
ly+0.020
 cries+0.020
Neg logit contributions
-0.073
uch-0.071
St-0.070
irl-0.070
P-0.069
Token Ben
Token ID3932
Feature activation+4.70e-01

Pos logit contributions
 address+0.001
 Acc+0.001
person+0.001
der+0.001
ly+0.001
Neg logit contributions
-0.002
St-0.002
irl-0.002
uch-0.001
P-0.001
Token thanked
Token ID26280
Feature activation-4.64e-01

Pos logit contributions
 address+0.009
 Tim+0.008
ly+0.008
der+0.008
 arrow+0.008
Neg logit contributions
-0.017
uch-0.016
irl-0.016
St-0.016
P-0.016
Token the
Token ID262
Feature activation+1.75e+00

Pos logit contributions
+0.015
St+0.015
irl+0.015
 total+0.014
P+0.014
Neg logit contributions
person-0.009
der-0.009
 address-0.009
ly-0.009
 Acc-0.009
Token fairy
Token ID25607
Feature activation+3.44e-01

Pos logit contributions
 address+0.032
 Tim+0.028
 arrow+0.027
 cries+0.027
der+0.027
Neg logit contributions
-0.066
uch-0.065
irl-0.063
St-0.062
ink-0.062
Token and
Token ID290
Feature activation+9.84e-01

Pos logit contributions
 address+0.005
 Acc+0.005
 Tim+0.005
 disl+0.004
 pavement+0.004
Neg logit contributions
-0.013
St-0.012
ink-0.012
if-0.012
 total-0.012
Token hugged
Token ID41130
Feature activation+1.11e+00

Pos logit contributions
 Acc+0.022
der+0.021
ly+0.021
 address+0.021
person+0.021
Neg logit contributions
St-0.029
P-0.028
-0.028
irl-0.028
uch-0.027
Token her
Token ID607
Feature activation-4.17e-02

Pos logit contributions
 address+0.023
 Acc+0.023
person+0.022
 fixes+0.021
der+0.021
Neg logit contributions
St-0.035
irl-0.033
-0.032
P-0.032
 time-0.032
Token.
Token ID13
Feature activation+4.12e+00

Pos logit contributions
+0.002
St+0.002
irl+0.002
uch+0.002
ink+0.002
Neg logit contributions
 address-0.001
 Tim-0.001
der-0.001
 Acc-0.000
ly-0.000
Token is
Token ID318
Feature activation-1.78e+00

Pos logit contributions
uch+0.069
+0.065
ink+0.065
irl+0.064
 total+0.063
Neg logit contributions
 address-0.080
 son-0.079
 arrow-0.077
 alien-0.077
person-0.075
Token:
Token ID25
Feature activation+1.13e-01

Pos logit contributions
+0.068
uch+0.067
St+0.067
P+0.066
irl+0.066
Neg logit contributions
 address-0.029
ly-0.027
person-0.025
der-0.025
 son-0.024
Token We
Token ID775
Feature activation+5.43e-01

Pos logit contributions
 address+0.002
ly+0.002
 Tim+0.002
der+0.002
 arrow+0.002
Neg logit contributions
-0.004
St-0.004
uch-0.004
irl-0.004
ink-0.004
Token should
Token ID815
Feature activation+3.38e+00

Pos logit contributions
 address+0.013
 Tim+0.012
der+0.012
ly+0.012
 Acc+0.011
Neg logit contributions
-0.017
uch-0.016
St-0.016
irl-0.016
P-0.016
Token be
Token ID307
Feature activation-4.33e+00

Pos logit contributions
 address+0.074
der+0.070
ly+0.069
 Tim+0.068
 cries+0.066
Neg logit contributions
-0.109
uch-0.108
irl-0.108
St-0.105
P-0.104
Token careful
Token ID8161
Feature activation+4.10e+00

Pos logit contributions
+0.125
uch+0.121
St+0.119
irl+0.118
P+0.116
Neg logit contributions
 address-0.114
 Tim-0.107
der-0.104
ly-0.103
 arrow-0.102
Token and
Token ID290
Feature activation+3.72e-01

Pos logit contributions
 address+0.085
 Tim+0.077
der+0.075
 arrow+0.073
person+0.072
Neg logit contributions
uch-0.136
-0.134
irl-0.134
St-0.131
if-0.131
Token nice
Token ID3621
Feature activation+2.21e+00

Pos logit contributions
 address+0.008
 Tim+0.007
der+0.007
person+0.007
ly+0.007
Neg logit contributions
uch-0.012
St-0.012
irl-0.012
-0.012
P-0.012
Token with
Token ID351
Feature activation-5.96e-01

Pos logit contributions
 address+0.045
person+0.041
 Acc+0.040
 arrow+0.040
 Tim+0.040
Neg logit contributions
 time-0.065
St-0.065
if-0.065
P-0.065
uch-0.064
Token other
Token ID584
Feature activation+1.11e+00

Pos logit contributions
+0.023
uch+0.022
St+0.022
irl+0.022
ink+0.022
Neg logit contributions
 address-0.010
 Tim-0.009
der-0.009
 arrow-0.008
ly-0.008
Token people
Token ID661
Feature activation-2.59e+00

Pos logit contributions
 address+0.023
 Tim+0.021
der+0.021
 arrow+0.020
 Acc+0.020
Neg logit contributions
-0.037
uch-0.036
St-0.036
irl-0.035
P-0.035
Token Lily
Token ID20037
Feature activation+1.39e+00

Pos logit contributions
 address+0.067
ly+0.064
 arrow+0.064
person+0.063
 alien+0.059
Neg logit contributions
-0.169
St-0.158
Once-0.149
irl-0.147
P-0.145
Token drank
Token ID24070
Feature activation+2.35e+00

Pos logit contributions
 address+0.029
 Tim+0.027
 cries+0.026
der+0.026
ly+0.025
Neg logit contributions
-0.048
uch-0.046
irl-0.045
St-0.045
P-0.045
Token some
Token ID617
Feature activation+8.86e-02

Pos logit contributions
 address+0.043
person+0.039
 alien+0.037
 arrow+0.036
old+0.036
Neg logit contributions
-0.087
St-0.086
uch-0.083
irl-0.083
P-0.083
Token water
Token ID1660
Feature activation-1.53e+00

Pos logit contributions
 address+0.002
 Tim+0.001
ly+0.001
der+0.001
 cries+0.001
Neg logit contributions
-0.003
uch-0.003
irl-0.003
St-0.003
ink-0.003
Token and
Token ID290
Feature activation+9.67e-01

Pos logit contributions
+0.061
uch+0.058
St+0.058
ink+0.057
irl+0.057
Neg logit contributions
 address-0.024
 Tim-0.020
 Acc-0.020
 arrow-0.020
person-0.020
Token lay
Token ID3830
Feature activation-8.78e-01

Pos logit contributions
 address+0.024
person+0.024
 Acc+0.023
 alien+0.023
old+0.022
Neg logit contributions
irl-0.026
St-0.026
uch-0.026
-0.026
P-0.026
Token down
Token ID866
Feature activation+2.79e-01

Pos logit contributions
+0.032
uch+0.031
St+0.031
irl+0.030
P+0.030
Neg logit contributions
 address-0.017
 Tim-0.015
 arrow-0.015
der-0.015
 cries-0.014
Token on
Token ID319
Feature activation-4.01e-01

Pos logit contributions
 address+0.006
 arrow+0.005
person+0.005
old+0.005
 Acc+0.005
Neg logit contributions
-0.010
St-0.009
ink-0.009
P-0.009
irl-0.009
Token the
Token ID262
Feature activation+2.20e+00

Pos logit contributions
+0.016
St+0.015
irl+0.014
uch+0.014
P+0.014
Neg logit contributions
 address-0.007
person-0.006
old-0.006
ly-0.006
 arrow-0.006
Token couch
Token ID18507
Feature activation-1.18e+00

Pos logit contributions
old+0.054
 address+0.052
 Acc+0.050
 Tim+0.049
person+0.048
Neg logit contributions
-0.064
St-0.064
uch-0.062
icer-0.061
if-0.060
Token.
Token ID13
Feature activation+4.17e+00

Pos logit contributions
+0.044
St+0.042
uch+0.041
ink+0.041
irl+0.040
Neg logit contributions
 address-0.020
 Acc-0.019
 arrow-0.018
 Tim-0.018
 alien-0.018
Token grown
Token ID7334
Feature activation-9.19e-01

Pos logit contributions
+0.085
uch+0.082
St+0.081
irl+0.081
P+0.079
Neg logit contributions
 address-0.059
 Tim-0.054
der-0.053
ly-0.052
 arrow-0.052
Token up
Token ID510
Feature activation+8.68e-01

Pos logit contributions
St+0.035
+0.035
uch+0.035
P+0.034
irl+0.034
Neg logit contributions
 address-0.015
 Tim-0.013
der-0.013
 Acc-0.013
 arrow-0.012
Token!
Token ID0
Feature activation+3.97e+00

Pos logit contributions
 address+0.011
 Tim+0.011
 earlier+0.010
der+0.009
 for+0.009
Neg logit contributions
uch-0.036
-0.036
St-0.035
irl-0.035
P-0.035
Token She
Token ID1375
Feature activation+3.59e-01

Pos logit contributions
 address+0.059
 arrow+0.055
 alien+0.053
ly+0.050
 airplane+0.049
Neg logit contributions
-0.170
St-0.153
uch-0.145
P-0.145
irl-0.144
Token ordered
Token ID6149
Feature activation+2.50e+00

Pos logit contributions
 address+0.008
 Tim+0.007
der+0.007
ly+0.007
 arrow+0.007
Neg logit contributions
-0.012
uch-0.011
St-0.011
irl-0.011
P-0.011
Token pizza
Token ID14256
Feature activation-1.94e+00

Pos logit contributions
 address+0.042
 Tim+0.038
der+0.035
ly+0.035
 cries+0.034
Neg logit contributions
-0.097
uch-0.095
irl-0.093
St-0.093
P-0.092
Token and
Token ID290
Feature activation+6.51e-01

Pos logit contributions
+0.083
uch+0.081
St+0.080
irl+0.080
P+0.079
Neg logit contributions
 address-0.023
 Tim-0.020
der-0.019
ly-0.018
 arrow-0.018
Token ice
Token ID4771
Feature activation-5.04e+00

Pos logit contributions
 address+0.014
der+0.013
 Acc+0.013
ly+0.013
 cries+0.012
Neg logit contributions
-0.021
uch-0.020
St-0.020
irl-0.020
P-0.020
Token cream
Token ID8566
Feature activation-1.45e+00

Pos logit contributions
+0.168
uch+0.163
St+0.163
irl+0.162
P+0.159
Neg logit contributions
 address-0.111
 Tim-0.103
ly-0.101
der-0.099
 son-0.094
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
+0.058
uch+0.055
St+0.054
ink+0.054
P+0.053
Neg logit contributions
 address-0.022
 Acc-0.020
 arrow-0.019
 cries-0.019
 Tim-0.018
Token her
Token ID607
Feature activation+8.10e-01

Pos logit contributions
+0.046
uch+0.045
irl+0.044
St+0.044
P+0.044
Neg logit contributions
 address-0.020
 Tim-0.019
der-0.018
ly-0.017
 arrow-0.017
Token okay
Token ID8788
Feature activation+1.07e+00

Pos logit contributions
 Tim+0.007
 for+0.006
 address+0.006
 Billy+0.006
 arrow+0.005
Neg logit contributions
-0.008
ink-0.008
St-0.007
P-0.007
irl-0.007
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 address+0.027
person+0.024
 writes+0.023
der+0.023
 arrow+0.022
Neg logit contributions
uch-0.032
-0.031
 total-0.030
irl-0.030
St-0.029
Token Billy
Token ID15890
Feature activation+2.42e+00

Pos logit contributions
+0.005
Once+0.004
St+0.004
uch+0.004
 total+0.004
Neg logit contributions
 address-0.001
old-0.001
 writes-0.001
person-0.001
minded-0.001
Token.
Token ID13
Feature activation+5.42e+00

Pos logit contributions
 address+0.058
person+0.053
 footsteps+0.052
der+0.051
 pavement+0.050
Neg logit contributions
if-0.071
uch-0.071
St-0.071
ink-0.069
 time-0.067
Token I
Token ID314
Feature activation+1.47e-01

Pos logit contributions
 address+0.113
ly+0.102
 decide+0.102
der+0.099
 arrow+0.096
Neg logit contributions
-0.190
uch-0.177
St-0.177
ink-0.166
irl-0.166
Token'm
Token ID1101
Feature activation-3.45e+00

Pos logit contributions
 address+0.004
 Tim+0.004
der+0.004
 cries+0.004
 Acc+0.004
Neg logit contributions
-0.004
uch-0.004
St-0.004
irl-0.004
P-0.004
Token not
Token ID407
Feature activation-1.39e+00

Pos logit contributions
+0.109
uch+0.106
St+0.104
irl+0.103
P+0.102
Neg logit contributions
 address-0.081
 Tim-0.078
der-0.074
 cries-0.073
 arrow-0.073
Token hurt
Token ID5938
Feature activation+3.37e+00

Pos logit contributions
+0.047
St+0.044
uch+0.044
irl+0.044
ink+0.043
Neg logit contributions
 Tim-0.031
 address-0.031
 arrow-0.029
 cries-0.028
der-0.028
Token too
Token ID1165
Feature activation-3.19e+00

Pos logit contributions
 address+0.075
person+0.060
 writes+0.060
 arrow+0.059
 son+0.058
Neg logit contributions
uch-0.113
-0.112
St-0.111
P-0.109
if-0.109
Token bad
Token ID2089
Feature activation+5.59e+00

Pos logit contributions
+0.105
uch+0.103
St+0.101
irl+0.101
P+0.099
Neg logit contributions
 address-0.070
 Tim-0.065
der-0.063
ly-0.062
 arrow-0.062
Token.
Token ID13
Feature activation+4.97e+00

Pos logit contributions
 address+0.133
person+0.120
 Acc+0.119
 arrow+0.117
old+0.113
Neg logit contributions
uch-0.161
-0.156
if-0.152
 time-0.151
re-0.150
Token little
Token ID1310
Feature activation+2.92e+00

Pos logit contributions
 address+0.040
 son+0.037
 old+0.037
 cries+0.037
 Tim+0.036
Neg logit contributions
irl-0.079
uch-0.079
P-0.077
-0.077
ink-0.073
Token cub
Token ID13617
Feature activation-3.94e+00

Pos logit contributions
 address+0.057
 Tim+0.051
der+0.050
 Acc+0.049
ly+0.049
Neg logit contributions
-0.103
uch-0.101
St-0.100
irl-0.098
P-0.097
Token for
Token ID329
Feature activation+7.68e-02

Pos logit contributions
ink+0.133
+0.132
St+0.131
if+0.130
 total+0.126
Neg logit contributions
 address-0.072
 Acc-0.069
 writes-0.062
 disl-0.062
person-0.061
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
 address+0.001
 Tim+0.001
der+0.001
 arrow+0.001
ly+0.001
Neg logit contributions
-0.003
uch-0.003
St-0.003
irl-0.003
P-0.003
Token nice
Token ID3621
Feature activation+2.87e+00

Pos logit contributions
 address+0.045
 son+0.041
 cries+0.040
 Tim+0.040
 arrow+0.040
Neg logit contributions
uch-0.063
-0.063
P-0.063
irl-0.063
ink-0.062
Token time
Token ID640
Feature activation-3.72e+00

Pos logit contributions
 address+0.073
 Tim+0.069
der+0.068
 Acc+0.067
ly+0.066
Neg logit contributions
-0.082
uch-0.079
St-0.079
irl-0.079
ink-0.077
Token he
Token ID339
Feature activation-5.07e-01

Pos logit contributions
+0.141
ink+0.138
uch+0.136
if+0.134
St+0.134
Neg logit contributions
 address-0.057
 Acc-0.052
 arrow-0.047
 apology-0.047
der-0.046
Token had
Token ID550
Feature activation-9.99e-01

Pos logit contributions
+0.019
uch+0.018
irl+0.018
St+0.018
icer+0.017
Neg logit contributions
 address-0.010
 Tim-0.009
ly-0.009
 arrow-0.008
 says-0.008
Token and
Token ID290
Feature activation+9.29e-01

Pos logit contributions
+0.037
icer+0.034
uch+0.034
St+0.034
irl+0.033
Neg logit contributions
 address-0.021
 Tim-0.020
 inside-0.019
 for-0.019
 arrow-0.018
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
 address+0.020
 Acc+0.019
der+0.019
person+0.018
ly+0.018
Neg logit contributions
-0.027
uch-0.027
irl-0.027
P-0.027
St-0.027
Token cub
Token ID13617
Feature activation-4.53e+00

Pos logit contributions
 address+0.026
 son+0.025
 cries+0.024
 old+0.024
 Tim+0.023
Neg logit contributions
irl-0.051
uch-0.050
P-0.050
ink-0.048
-0.048
Token allows
Token ID3578
Feature activation+2.33e+00

Pos logit contributions
 address+0.030
 cries+0.026
der+0.026
 Tim+0.025
 fixes+0.025
Neg logit contributions
-0.046
irl-0.045
P-0.045
uch-0.044
St-0.043
Token us
Token ID514
Feature activation-2.26e+00

Pos logit contributions
 Tim+0.049
 address+0.049
 that+0.047
 him+0.046
ly+0.046
Neg logit contributions
-0.084
uch-0.078
St-0.077
P-0.076
irl-0.076
Token to
Token ID284
Feature activation+7.94e-01

Pos logit contributions
+0.107
St+0.103
uch+0.102
irl+0.102
P+0.102
Neg logit contributions
 address-0.019
 Tim-0.016
ly-0.014
 arrow-0.014
 for-0.014
Token measure
Token ID3953
Feature activation+2.01e+00

Pos logit contributions
 address+0.015
 Tim+0.013
 fix+0.013
 say+0.013
 arrow+0.013
Neg logit contributions
-0.030
P-0.029
uch-0.029
St-0.029
icer-0.028
Token your
Token ID534
Feature activation+2.01e+00

Pos logit contributions
 address+0.029
 Tim+0.028
 for+0.026
 that+0.024
 him+0.024
Neg logit contributions
-0.087
uch-0.082
P-0.080
St-0.078
ink-0.078
Token temperature
Token ID5951
Feature activation-1.34e+00

Pos logit contributions
 address+0.043
 cries+0.039
 Tim+0.038
 son+0.037
der+0.037
Neg logit contributions
-0.069
uch-0.068
irl-0.067
P-0.067
St-0.066
Token and
Token ID290
Feature activation+1.32e+00

Pos logit contributions
+0.054
St+0.053
irl+0.053
P+0.053
uch+0.053
Neg logit contributions
 address-0.019
 Tim-0.017
der-0.016
ly-0.016
 cries-0.015
Token make
Token ID787
Feature activation+1.84e+00

Pos logit contributions
 address+0.026
 Tim+0.024
der+0.023
ly+0.023
 arrow+0.023
Neg logit contributions
-0.046
uch-0.045
St-0.044
irl-0.044
P-0.043
Token sure
Token ID1654
Feature activation+3.24e+00

Pos logit contributions
 address+0.035
 Tim+0.032
 cries+0.030
der+0.030
 arrow+0.030
Neg logit contributions
-0.068
uch-0.065
St-0.064
irl-0.063
P-0.063
Token you
Token ID345
Feature activation-6.29e-01

Pos logit contributions
 address+0.066
 Tim+0.061
der+0.058
ly+0.057
 cries+0.057
Neg logit contributions
-0.113
uch-0.110
St-0.108
irl-0.108
P-0.107
Token're
Token ID821
Feature activation-4.74e+00

Pos logit contributions
+0.021
P+0.021
St+0.020
irl+0.020
icer+0.019
Neg logit contributions
 address-0.014
 fix-0.013
 Tim-0.013
 say-0.013
 decide-0.012
Token underneath
Token ID14638
Feature activation+2.55e+00

Pos logit contributions
 address+0.027
der+0.024
 Tim+0.024
 arrow+0.023
 Acc+0.023
Neg logit contributions
-0.059
irl-0.058
St-0.058
uch-0.058
ink-0.057
Token,
Token ID11
Feature activation-6.21e-01

Pos logit contributions
 address+0.037
der+0.031
 Tim+0.031
 arrow+0.030
 Acc+0.030
Neg logit contributions
-0.104
St-0.100
irl-0.099
uch-0.099
ink-0.098
Token happy
Token ID3772
Feature activation+2.64e+00

Pos logit contributions
+0.021
uch+0.020
St+0.020
irl+0.020
P+0.019
Neg logit contributions
 address-0.014
 Tim-0.013
der-0.012
ly-0.012
 cries-0.012
Token that
Token ID326
Feature activation+4.13e+00

Pos logit contributions
 address+0.047
 Tim+0.042
 Acc+0.040
 arrow+0.040
person+0.040
Neg logit contributions
-0.094
uch-0.091
irl-0.091
ink-0.091
St-0.090
Token their
Token ID511
Feature activation+7.90e-01

Pos logit contributions
 address+0.051
 Tim+0.049
 arrow+0.039
der+0.039
 cries+0.039
Neg logit contributions
-0.179
uch-0.176
St-0.171
P-0.171
irl-0.170
Token blanket
Token ID18447
Feature activation-3.00e+00

Pos logit contributions
 address+0.017
 Tim+0.016
der+0.015
ly+0.015
 arrow+0.015
Neg logit contributions
-0.026
uch-0.025
St-0.025
irl-0.025
P-0.025
Token was
Token ID373
Feature activation-4.27e+00

Pos logit contributions
+0.100
uch+0.098
St+0.097
irl+0.096
P+0.096
Neg logit contributions
 address-0.064
 Tim-0.060
der-0.058
ly-0.057
 cries-0.056
Token now
Token ID783
Feature activation+5.59e-01

Pos logit contributions
+0.153
uch+0.146
St+0.144
P+0.143
icer+0.141
Neg logit contributions
 Tim-0.088
 inside-0.087
 address-0.086
 for-0.086
 that-0.083
Token resisting
Token ID28943
Feature activation+3.76e+00

Pos logit contributions
 address+0.012
 Tim+0.011
der+0.011
ly+0.011
 arrow+0.011
Neg logit contributions
-0.019
uch-0.018
St-0.018
irl-0.018
P-0.018
Token its
Token ID663
Feature activation+2.30e+00

Pos logit contributions
 address+0.043
 Tim+0.041
ly+0.037
der+0.036
 cries+0.036
Neg logit contributions
-0.164
uch-0.162
irl-0.158
St-0.157
P-0.157
Token wild
Token ID4295
Feature activation+3.84e+00

Pos logit contributions
 address+0.051
 cries+0.047
 arrow+0.046
 Tim+0.046
ly+0.045
Neg logit contributions
-0.078
P-0.076
uch-0.075
St-0.074
ink-0.073
Token give
Token ID1577
Feature activation+7.56e-01

Pos logit contributions
 address+0.016
 Tim+0.015
 arrow+0.014
 fix+0.014
 say+0.014
Neg logit contributions
-0.034
uch-0.033
St-0.033
P-0.033
icer-0.032
Token it
Token ID340
Feature activation-3.29e-01

Pos logit contributions
 address+0.016
 Tim+0.015
ly+0.014
 arrow+0.014
 cries+0.014
Neg logit contributions
-0.026
uch-0.025
irl-0.025
St-0.025
ink-0.024
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
+0.013
uch+0.013
St+0.013
irl+0.013
P+0.013
Neg logit contributions
 address-0.005
 Tim-0.004
der-0.004
ly-0.004
 arrow-0.004
Token you
Token ID345
Feature activation-9.62e-01

Pos logit contributions
 address+0.024
 Tim+0.022
 arrow+0.021
der+0.021
ly+0.021
Neg logit contributions
-0.033
uch-0.032
St-0.031
irl-0.031
P-0.030
Token?"
Token ID1701
Feature activation+3.08e-01

Pos logit contributions
+0.032
uch+0.031
St+0.030
irl+0.030
ink+0.030
Neg logit contributions
 address-0.022
 Tim-0.020
 arrow-0.019
der-0.019
ly-0.019
Token
Token ID198
Feature activation-8.14e+00

Pos logit contributions
der+0.005
 address+0.005
ly+0.005
 apology+0.004
 Acc+0.004
Neg logit contributions
-0.011
St-0.011
ink-0.010
irl-0.010
 total-0.010
Token
Token ID198
Feature activation-7.59e+00

Pos logit contributions
+0.241
uch+0.238
irl+0.232
St+0.230
ink+0.226
Neg logit contributions
 address-0.207
 Tim-0.199
 arrow-0.184
 cries-0.183
ly-0.182
TokenJohn
Token ID7554
Feature activation-1.31e-01

Pos logit contributions
+0.200
uch+0.195
irl+0.192
ink+0.186
St+0.185
Neg logit contributions
 address-0.222
 Tim-0.214
person-0.203
 arrow-0.202
ly-0.201
Token looked
Token ID3114
Feature activation-8.77e-01

Pos logit contributions
+0.005
uch+0.005
irl+0.005
P+0.005
St+0.004
Neg logit contributions
 address-0.003
ly-0.002
 Tim-0.002
 cries-0.002
 arrow-0.002
Token down
Token ID866
Feature activation+2.62e-01

Pos logit contributions
irl+0.030
St+0.029
P+0.028
 total+0.028
ink+0.028
Neg logit contributions
 address-0.017
person-0.015
 Acc-0.015
der-0.014
 writes-0.013
Token to
Token ID284
Feature activation+8.31e-01

Pos logit contributions
 address+0.006
 Acc+0.005
 disl+0.005
person+0.005
 writes+0.005
Neg logit contributions
-0.009
St-0.008
P-0.008
ink-0.008
 total-0.008
Token,"
Token ID553
Feature activation-5.09e-01

Pos logit contributions
+0.110
uch+0.109
irl+0.106
St+0.105
ink+0.105
Neg logit contributions
 address-0.045
 Tim-0.040
der-0.038
 Acc-0.038
 arrow-0.038
Token Ben
Token ID3932
Feature activation+9.81e-01

Pos logit contributions
+0.016
irl+0.016
Once+0.016
St+0.016
P+0.015
Neg logit contributions
person-0.013
ly-0.012
fully-0.011
 address-0.011
der-0.011
Token says
Token ID1139
Feature activation+4.81e-01

Pos logit contributions
person+0.012
 Acc+0.011
fully+0.011
ly+0.010
 address+0.010
Neg logit contributions
St-0.040
irl-0.039
-0.038
P-0.038
ink-0.037
Token.
Token ID13
Feature activation+3.92e+00

Pos logit contributions
 arrow+0.012
 address+0.012
 alien+0.011
 disl+0.011
 automobile+0.011
Neg logit contributions
-0.013
irl-0.012
ink-0.012
oe-0.012
St-0.012
Token
Token ID198
Feature activation-8.05e+00

Pos logit contributions
 address+0.113
ly+0.106
 arrow+0.103
 cries+0.101
der+0.101
Neg logit contributions
-0.106
St-0.100
P-0.097
irl-0.094
uch-0.091
Token
Token ID198
Feature activation-7.60e+00

Pos logit contributions
+0.195
St+0.194
irl+0.190
P+0.187
ink+0.183
Neg logit contributions
 address-0.244
ly-0.228
der-0.224
person-0.223
 arrow-0.223
TokenThey
Token ID2990
Feature activation+2.98e-02

Pos logit contributions
P+0.157
St+0.157
+0.149
irl+0.147
uch+0.143
Neg logit contributions
 address-0.266
 cries-0.247
 Tim-0.247
der-0.243
ly-0.243
Token finish
Token ID5461
Feature activation+1.60e+00

Pos logit contributions
 address+0.001
 Tim+0.001
der+0.001
ly+0.001
 arrow+0.001
Neg logit contributions
-0.001
uch-0.001
St-0.001
irl-0.001
ink-0.001
Token their
Token ID511
Feature activation+2.14e-01

Pos logit contributions
 address+0.023
ly+0.021
 arrow+0.020
der+0.020
person+0.020
Neg logit contributions
-0.065
irl-0.063
St-0.063
uch-0.062
ink-0.061
Token toast
Token ID27805
Feature activation-1.64e+00

Pos logit contributions
 address+0.005
 cries+0.004
 Tim+0.004
 says+0.004
ly+0.004
Neg logit contributions
-0.007
St-0.007
ink-0.007
uch-0.007
irl-0.007
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
+0.063
uch+0.061
irl+0.060
P+0.060
St+0.060
Neg logit contributions
 address-0.027
 Acc-0.024
 Tim-0.024
der-0.024
 cries-0.023
Token insisted
Token ID11189
Feature activation+2.15e+00

Pos logit contributions
 address+0.004
 Tim+0.004
 cries+0.004
ly+0.004
 arrow+0.004
Neg logit contributions
-0.008
uch-0.008
St-0.008
irl-0.008
P-0.008
Token they
Token ID484
Feature activation+7.90e-02

Pos logit contributions
 address+0.038
 alien+0.031
 arrow+0.031
 Acc+0.031
 writes+0.030
Neg logit contributions
-0.084
uch-0.079
irl-0.078
ink-0.078
St-0.078
Token stayed
Token ID9658
Feature activation-3.58e+00

Pos logit contributions
 address+0.002
der+0.002
 Acc+0.002
 writes+0.002
 arrow+0.002
Neg logit contributions
uch-0.003
-0.002
irl-0.002
St-0.002
 total-0.002
Token outside
Token ID2354
Feature activation-1.98e-01

Pos logit contributions
+0.117
irl+0.117
uch+0.114
St+0.114
P+0.113
Neg logit contributions
 address-0.077
der-0.072
 Tim-0.070
ly-0.069
 Acc-0.068
Token.
Token ID13
Feature activation+3.37e+00

Pos logit contributions
+0.007
 total+0.007
irl+0.006
ink+0.006
St+0.006
Neg logit contributions
 address-0.004
 arrow-0.003
person-0.003
 Acc-0.003
 writes-0.003
Token
Token ID198
Feature activation-8.19e+00

Pos logit contributions
 address+0.076
 arrow+0.071
der+0.070
 alien+0.069
ly+0.069
Neg logit contributions
-0.115
St-0.106
P-0.100
Once-0.098
 total-0.098
Token
Token ID198
Feature activation-7.60e+00

Pos logit contributions
uch+0.316
+0.305
ink+0.301
irl+0.298
 total+0.292
Neg logit contributions
 Tim-0.148
 He-0.126
old-0.120
 address-0.117
 that-0.117
TokenBut
Token ID1537
Feature activation+9.06e-01

Pos logit contributions
+0.223
uch+0.221
ink+0.211
irl+0.207
icer+0.199
Neg logit contributions
 Tim-0.199
 address-0.185
 He-0.183
 fix-0.180
 arrow-0.178
Token the
Token ID262
Feature activation+8.55e-01

Pos logit contributions
 address+0.016
 cries+0.015
der+0.015
 Acc+0.015
old+0.014
Neg logit contributions
-0.033
St-0.032
 total-0.031
irl-0.031
P-0.030
Token mom
Token ID1995
Feature activation+1.96e+00

Pos logit contributions
 address+0.018
 Tim+0.017
 Acc+0.017
der+0.017
 writes+0.017
Neg logit contributions
-0.028
uch-0.027
irl-0.026
St-0.026
P-0.026
Token was
Token ID373
Feature activation-4.32e+00

Pos logit contributions
 address+0.040
 Acc+0.036
der+0.036
 Tim+0.035
 cries+0.034
Neg logit contributions
-0.066
uch-0.066
St-0.064
ink-0.064
irl-0.063
Token coming
Token ID2406
Feature activation-1.09e+00

Pos logit contributions
+0.139
uch+0.134
P+0.133
irl+0.131
St+0.130
Neg logit contributions
 address-0.085
 Tim-0.082
 inside-0.078
 earlier-0.075
 that-0.075
Token towards
Token ID3371
Feature activation+3.15e+00

Pos logit contributions
+0.037
 total+0.037
St+0.037
uch+0.037
ink+0.036
Neg logit contributions
 address-0.022
 Acc-0.019
 writes-0.019
der-0.019
 fixes-0.017
Token them
Token ID606
Feature activation-1.17e+00

Pos logit contributions
 address+0.062
 Tim+0.060
ly+0.054
der+0.054
 cries+0.054
Neg logit contributions
-0.112
uch-0.111
St-0.107
irl-0.107
P-0.106
Token.
Token ID13
Feature activation+3.54e+00

Pos logit contributions
+0.046
uch+0.045
St+0.044
irl+0.044
ink+0.044
Neg logit contributions
 address-0.018
 Tim-0.016
der-0.015
ly-0.015
 arrow-0.015
Token
Token ID198
Feature activation-8.47e+00

Pos logit contributions
 Tim+0.065
 address+0.062
 cries+0.054
der+0.054
 He+0.054
Neg logit contributions
uch-0.133
irl-0.131
-0.127
P-0.127
St-0.126
Token
Token ID198
Feature activation-7.93e+00

Pos logit contributions
uch+0.310
+0.297
irl+0.290
ink+0.286
icer+0.283
Neg logit contributions
 Tim-0.168
 address-0.151
 He-0.142
 cries-0.139
old-0.137
TokenThe
Token ID464
Feature activation+6.21e-01

Pos logit contributions
uch+0.248
+0.241
icer+0.233
irl+0.231
ink+0.231
Neg logit contributions
 Tim-0.190
 fix-0.170
 address-0.167
 Let-0.167
 He-0.167
Token bear
Token ID6842
Feature activation-2.00e+00

Pos logit contributions
 address+0.009
 Tim+0.009
 arrow+0.009
 cries+0.008
 alien+0.008
Neg logit contributions
-0.025
uch-0.025
ink-0.024
irl-0.024
P-0.024
Token and
Token ID290
Feature activation+5.30e-01

Pos logit contributions
+0.080
irl+0.078
uch+0.078
St+0.077
P+0.077
Neg logit contributions
 address-0.030
 Tim-0.029
 arrow-0.027
 cries-0.027
der-0.026
Token mouse
Token ID10211
Feature activation-6.79e-01

Pos logit contributions
 address+0.008
der+0.007
ly+0.007
 Acc+0.007
person+0.006
Neg logit contributions
-0.021
St-0.020
irl-0.020
uch-0.020
P-0.020
Token were
Token ID547
Feature activation-5.92e+00

Pos logit contributions
+0.020
uch+0.019
St+0.019
irl+0.019
ink+0.018
Neg logit contributions
 address-0.017
 Tim-0.016
der-0.016
ly-0.016
 Acc-0.015
Token made
Token ID925
Feature activation+1.34e+00

Pos logit contributions
 address+0.009
 Tim+0.009
 arrow+0.008
 cries+0.008
ly+0.008
Neg logit contributions
-0.017
uch-0.016
irl-0.016
St-0.016
P-0.016
Token him
Token ID683
Feature activation-6.03e-01

Pos logit contributions
 Tim+0.006
 him+0.004
 address+0.002
 son+0.001
 Max+0.001
Neg logit contributions
-0.073
uch-0.071
P-0.069
irl-0.068
St-0.068
Token happy
Token ID3772
Feature activation+2.96e+00

Pos logit contributions
+0.020
uch+0.020
St+0.019
P+0.019
irl+0.019
Neg logit contributions
 address-0.013
 Tim-0.012
der-0.011
 cries-0.011
 arrow-0.011
Token.
Token ID13
Feature activation+3.56e+00

Pos logit contributions
 address+0.048
 Acc+0.041
person+0.040
 arrow+0.040
 Tim+0.040
Neg logit contributions
-0.111
irl-0.106
ink-0.106
uch-0.105
St-0.104
Token<|endoftext|>
Token ID50256
Feature activation-5.32e+00

Pos logit contributions
 address+0.074
 Tim+0.073
der+0.066
 cries+0.065
 Acc+0.065
Neg logit contributions
uch-0.121
irl-0.119
-0.119
St-0.116
P-0.115
TokenOnce
Token ID7454
Feature activation-6.88e+00

Pos logit contributions
uch+0.159
irl+0.156
St+0.146
+0.142
if+0.142
Neg logit contributions
 Tim-0.141
 He-0.137
 inside-0.128
 him-0.125
 The-0.124
Token upon
Token ID2402
Feature activation-4.65e+00

Pos logit contributions
+0.090
uch+0.084
St+0.080
irl+0.079
P+0.076
Neg logit contributions
 address-0.287
 Tim-0.277
der-0.272
ly-0.270
 arrow-0.269
Token a
Token ID257
Feature activation-6.53e-01

Pos logit contributions
+0.181
uch+0.176
ink+0.173
irl+0.170
St+0.170
Neg logit contributions
 address-0.078
 Tim-0.076
 arrow-0.067
 cries-0.065
ly-0.065
Token time
Token ID640
Feature activation-3.32e+00

Pos logit contributions
ink+0.015
uch+0.014
+0.014
if+0.014
unks+0.013
Neg logit contributions
 Tim-0.021
 about-0.020
 for-0.020
 Max-0.018
 that-0.018
Token there
Token ID612
Feature activation-1.01e+00

Pos logit contributions
St+0.152
uch+0.149
+0.147
irl+0.147
P+0.146
Neg logit contributions
 Tim-0.044
 that-0.028
 for-0.028
 Max-0.026
 inside-0.026
Token was
Token ID373
Feature activation-4.36e+00

Pos logit contributions
+0.036
uch+0.035
St+0.035
P+0.035
irl+0.034
Neg logit contributions
 Tim-0.020
 address-0.019
 cries-0.017
 arrow-0.017
 Acc-0.016