TOP ACTIVATIONS
MAX = 0.298

so he stepped forward courage ously . When he
cat grinned mis chie v ously . " Never try to
boy smiled mis chie v ously and said , " Well
question , and she courage ously answered it .
's wonderful , my d arl ings !" Mom my says
you go , my d arl ings . Enjoy your lunch
let out a big sn arl and the mouse knew he
you okay , my d arl ings ?" she asked .
disappointment . M arl on 's mum hugged him
" Hello , my d arl ings . I 'm so
" Hello , my d arl ings . I 'm so
perfect fish , my d arl ings ," mom says .
to grow l and sn arl to try and scare her
you go , my d arl ings ," Mom said .
What happened , my d arl ings ? Why are you
's good , my d arl ings . I 'm glad
" Hello , my d arl ings . I 'm so
have fun , my d arl ings ?" she asks .
you too , my d arl ings . But you have
Not now , my d arl ings , it is too

MIN ACTIVATIONS
MIN = -0.392

" dragon ", and " tre asure ". They think the
Rich the S aus age Man ! <|endoftext|> Once
proudly said , " s aus ages !" Her dad smiled
man replied , " S aus ages ! Lots and lots
along to the hy m ns , said prayers and listened
a tub of strawberry - ban ana yogurt . She tore
new words , like " ban ana " and " flower
trying to keep the he ns in line . Billy watched
little shelter for the he ns . Billy couldn 't believe
was playing with some cor ns he found in his garden
- a beautiful cheese o me lette . John was so
the tree if they were av oc ados , and they
The o me lette tasted even better than
. They ate the o me lette together and it was
. Mom was making o me lets for the family .
It sounded like a " me ow ". The boy stopped
" I will buy more av oc ados tomorrow . You
so happy and brought the av oc ados home to share
always survive . <|endoftext|> Es me was hungry . She wanted
the hot oven . Es me was so disappointed . She

INTERVAL 0.125 - 0.298
CONTAINS 2.511%

oven . He saw the towel inside the oven . He
find all sorts of different ways to make your dreams come
time , there was a little boy named Tim my .
his things . He learned that some things belonged to others
dog ch ases Tom and bites his leg . Tom screams

INTERVAL -0.047 - 0.125
CONTAINS 82.297%

's really good !" M olly and Jack kept
. They put their bags in the car and
will share and be nice ." Mom smiled and gave them
had gone out . J ill thought : "
. The ducks get scared . They fly away .

INTERVAL -0.220 - -0.047
CONTAINS 15.113%

inside and play ?" The cat d arted inside , and
a time there was a boy . The boy 's name
When the party day finally came , everyone arrived with gifts
the hospital . L ily was okay , but
, but it 's good for you . How about this

INTERVAL -0.392 - -0.220
CONTAINS 0.079%

Wh isk ers . Wh isk ers liked to sit by
panic . She loved that v ase very much and now
. Can I play dress - up with something else ?"
't be a scared y - cat . It will be
his Mom my say that cel ery was an incredible vegetable
Token so
Token ID523
Feature activation+5.29e-03

Pos logit contributions
 ado+0.021
ously+0.021
 overboard+0.020
arl+0.020
 fault+0.018
Neg logit contributions
aus-0.029
tre-0.028
ns-0.027
roc-0.026
ban-0.026
Token he
Token ID339
Feature activation+2.90e-02

Pos logit contributions
ously+0.004
 ado+0.004
 overboard+0.004
arl+0.004
 unconscious+0.003
Neg logit contributions
aus-0.009
tre-0.009
roc-0.008
me-0.008
ns-0.008
Token stepped
Token ID10764
Feature activation-1.39e-02

Pos logit contributions
ously+0.026
 overboard+0.024
 ado+0.023
 off+0.022
ching+0.022
Neg logit contributions
tre-0.047
aus-0.046
roc-0.044
ocol-0.044
ban-0.043
Token forward
Token ID2651
Feature activation-6.48e-02

Pos logit contributions
aus+0.024
tre+0.024
roc+0.023
ns+0.023
 spe+0.022
Neg logit contributions
ously-0.011
 overboard-0.011
 ado-0.011
arl-0.011
ching-0.009
Token courage
Token ID11917
Feature activation+1.92e-01

Pos logit contributions
aus+0.081
tre+0.081
ns+0.076
ban+0.074
me+0.072
Neg logit contributions
 ado-0.079
arl-0.077
ously-0.077
 overboard-0.076
 fault-0.070
Tokenously
Token ID3481
Feature activation+2.98e-01

Pos logit contributions
arl+0.269
 ado+0.264
 overboard+0.249
ously+0.246
 humor+0.244
Neg logit contributions
tre-0.205
aus-0.200
ns-0.195
ban-0.186
oc-0.184
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
arl+0.417
 punk+0.397
 overboard+0.393
 ado+0.393
 humor+0.392
Neg logit contributions
tre-0.308
aus-0.295
ns-0.290
oc-0.281
ban-0.271
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
 ado+0.013
ously+0.013
 overboard+0.012
arl+0.012
osity+0.011
Neg logit contributions
aus-0.022
tre-0.021
ns-0.021
 spe-0.020
roc-0.020
Token
Token ID198
Feature activation+2.37e-02

Pos logit contributions
ously+0.022
 ado+0.022
arl+0.022
 overboard+0.021
ching+0.020
Neg logit contributions
aus-0.036
tre-0.035
ns-0.034
roc-0.033
 spe-0.033
TokenWhen
Token ID2215
Feature activation+3.80e-02

Pos logit contributions
 ado+0.019
ously+0.019
arl+0.019
 overboard+0.019
ching+0.017
Neg logit contributions
aus-0.040
tre-0.039
ns-0.037
 spe-0.037
roc-0.036
Token he
Token ID339
Feature activation+2.95e-02

Pos logit contributions
 off+0.021
ously+0.020
 way+0.020
 ado+0.019
 overboard+0.019
Neg logit contributions
aus-0.074
roc-0.072
tre-0.071
ns-0.071
me-0.069
Token cat
Token ID3797
Feature activation-8.52e-02

Pos logit contributions
tre+0.038
aus+0.037
ban+0.034
roc+0.033
me+0.033
Neg logit contributions
arl-0.025
 ado-0.025
ching-0.024
 appearances-0.024
 overboard-0.024
Token grinned
Token ID36268
Feature activation-2.73e-02

Pos logit contributions
aus+0.129
tre+0.129
ban+0.118
roc+0.117
library+0.117
Neg logit contributions
ously-0.088
 ado-0.081
 overboard-0.079
ching-0.075
 unconscious-0.073
Token mis
Token ID2984
Feature activation-9.12e-02

Pos logit contributions
aus+0.031
tre+0.030
ns+0.028
roc+0.028
oc+0.028
Neg logit contributions
 ado-0.037
arl-0.036
ously-0.035
 overboard-0.035
 humor-0.033
Tokenchie
Token ID3043
Feature activation-1.18e-01

Pos logit contributions
aus+0.137
tre+0.130
 mac+0.130
 ruled+0.129
 reception+0.126
Neg logit contributions
ously-0.099
ching-0.092
 overboard-0.091
 ado-0.089
arl-0.086
Tokenv
Token ID85
Feature activation-8.24e-02

Pos logit contributions
aus+0.167
tre+0.165
ns+0.155
ban+0.153
 spe+0.151
Neg logit contributions
ously-0.126
 ado-0.123
 overboard-0.118
arl-0.116
ching-0.112
Tokenously
Token ID3481
Feature activation+2.94e-01

Pos logit contributions
aus+0.114
oc+0.110
tre+0.110
roc+0.109
me+0.106
Neg logit contributions
 ado-0.090
arl-0.088
 overboard-0.087
 punk-0.082
osity-0.082
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
arl+0.424
 overboard+0.407
 ado+0.405
 punk+0.391
osity+0.389
Neg logit contributions
aus-0.291
oc-0.290
tre-0.288
ns-0.280
ban-0.260
Token "
Token ID366
Feature activation-2.49e-02

Pos logit contributions
 ado+0.017
ously+0.016
 overboard+0.016
arl+0.016
 humor+0.015
Neg logit contributions
aus-0.013
tre-0.013
ns-0.012
oc-0.012
roc-0.011
TokenNever
Token ID12295
Feature activation+2.39e-02

Pos logit contributions
tre+0.035
aus+0.034
ns+0.033
me+0.032
oc+0.032
Neg logit contributions
 ado-0.028
arl-0.027
ously-0.027
 overboard-0.027
 fault-0.025
Token try
Token ID1949
Feature activation+1.38e-01

Pos logit contributions
ously+0.030
arl+0.030
 ado+0.029
 overboard+0.027
 unconscious+0.026
Neg logit contributions
aus-0.028
tre-0.028
ns-0.028
ban-0.027
roc-0.027
Token to
Token ID284
Feature activation+1.58e-03

Pos logit contributions
arl+0.235
ously+0.226
 ado+0.208
osity+0.198
idget+0.197
Neg logit contributions
tre-0.118
ns-0.113
 av-0.113
ban-0.107
 spe-0.104
Token boy
Token ID2933
Feature activation-6.10e-02

Pos logit contributions
 ado+0.112
ously+0.111
 overboard+0.105
arl+0.105
ching+0.095
Neg logit contributions
aus-0.190
tre-0.188
ns-0.179
ban-0.175
roc-0.173
Token smiled
Token ID13541
Feature activation-1.14e-02

Pos logit contributions
aus+0.094
tre+0.094
ns+0.088
roc+0.087
ban+0.086
Neg logit contributions
ously-0.059
 ado-0.056
 overboard-0.052
arl-0.048
ching-0.048
Token mis
Token ID2984
Feature activation-9.72e-02

Pos logit contributions
aus+0.014
tre+0.014
ns+0.013
ban+0.013
roc+0.013
Neg logit contributions
 ado-0.014
ously-0.014
arl-0.014
 overboard-0.013
 fault-0.012
Tokenchie
Token ID3043
Feature activation-1.21e-01

Pos logit contributions
aus+0.144
tre+0.135
 reception+0.135
 ruled+0.133
ns+0.132
Neg logit contributions
ously-0.106
 overboard-0.100
ching-0.096
 ado-0.096
arl-0.094
Tokenv
Token ID85
Feature activation-8.33e-02

Pos logit contributions
aus+0.167
tre+0.166
ns+0.156
ban+0.152
roc+0.151
Neg logit contributions
ously-0.133
 ado-0.132
arl-0.127
 overboard-0.126
ching-0.116
Tokenously
Token ID3481
Feature activation+2.94e-01

Pos logit contributions
aus+0.124
tre+0.120
me+0.117
roc+0.117
oc+0.115
Neg logit contributions
arl-0.080
 ado-0.079
 overboard-0.079
 punk-0.078
 risking-0.077
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
arl+0.423
 overboard+0.402
 ado+0.401
 fault+0.389
 appearances+0.382
Neg logit contributions
aus-0.309
tre-0.304
oc-0.292
ns-0.284
me-0.276
Token said
Token ID531
Feature activation-1.83e-02

Pos logit contributions
aus+0.017
tre+0.017
ns+0.016
me+0.015
ban+0.015
Neg logit contributions
arl-0.016
 ado-0.016
 overboard-0.015
ously-0.015
 fault-0.015
Token,
Token ID11
Feature activation+1.41e-02

Pos logit contributions
aus+0.027
tre+0.026
ns+0.025
roc+0.024
me+0.024
Neg logit contributions
ously-0.019
 ado-0.019
arl-0.018
 overboard-0.018
ching-0.016
Token "
Token ID366
Feature activation-2.98e-02

Pos logit contributions
ously+0.016
 ado+0.015
arl+0.015
 overboard+0.015
 shattering+0.014
Neg logit contributions
aus-0.020
tre-0.019
ns-0.019
roc-0.019
ban-0.018
TokenWell
Token ID5779
Feature activation+7.17e-02

Pos logit contributions
aus+0.046
tre+0.045
ns+0.043
roc+0.042
ban+0.042
Neg logit contributions
 ado-0.028
ously-0.028
arl-0.028
 overboard-0.027
 fault-0.024
Token question
Token ID1808
Feature activation-1.81e-03

Pos logit contributions
 ado+0.004
ously+0.004
arl+0.003
 overboard+0.003
 fault+0.003
Neg logit contributions
aus-0.005
tre-0.005
ns-0.004
roc-0.004
ban-0.004
Token,
Token ID11
Feature activation+8.38e-03

Pos logit contributions
aus+0.003
tre+0.003
ns+0.003
roc+0.003
 lin+0.003
Neg logit contributions
ously-0.002
 ado-0.002
arl-0.002
 overboard-0.002
 fault-0.001
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
ously+0.009
 ado+0.009
arl+0.008
 overboard+0.008
ching+0.008
Neg logit contributions
aus-0.012
tre-0.012
roc-0.012
ns-0.011
ban-0.011
Token she
Token ID673
Feature activation+1.01e-02

Pos logit contributions
aus+0.024
tre+0.024
ns+0.022
roc+0.022
ban+0.022
Neg logit contributions
ously-0.017
 ado-0.017
 overboard-0.016
arl-0.016
 shattering-0.015
Token courage
Token ID11917
Feature activation+1.92e-01

Pos logit contributions
ously+0.008
 joins+0.007
 ado+0.006
 shrug+0.006
 kicks+0.006
Neg logit contributions
tre-0.017
aus-0.017
roc-0.017
ban-0.016
ocol-0.016
Tokenously
Token ID3481
Feature activation+2.93e-01

Pos logit contributions
 ado+0.229
arl+0.228
 overboard+0.219
ously+0.218
 fault+0.203
Neg logit contributions
aus-0.248
tre-0.244
ns-0.231
ban-0.224
me-0.221
Token answered
Token ID9373
Feature activation+3.28e-02

Pos logit contributions
arl+0.364
 ado+0.348
 overboard+0.341
ously+0.328
 fault+0.324
Neg logit contributions
aus-0.376
tre-0.372
ns-0.356
oc-0.339
ban-0.338
Token it
Token ID340
Feature activation-2.92e-02

Pos logit contributions
ously+0.033
 ado+0.031
 overboard+0.030
arl+0.029
ching+0.027
Neg logit contributions
aus-0.051
tre-0.050
ns-0.048
roc-0.047
me-0.046
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
aus+0.044
tre+0.043
ns+0.041
roc+0.040
ban+0.039
Neg logit contributions
ously-0.030
 ado-0.029
 overboard-0.027
arl-0.027
ching-0.026
Token
Token ID220
Feature activation-1.57e-02

Pos logit contributions
aus+0.005
tre+0.005
ns+0.005
roc+0.005
ban+0.005
Neg logit contributions
ously-0.004
 ado-0.003
arl-0.003
 overboard-0.003
osity-0.003
Token
Token ID198
Feature activation+1.81e-02

Pos logit contributions
aus+0.021
tre+0.021
ns+0.019
me+0.019
ban+0.019
Neg logit contributions
 ado-0.018
ously-0.018
arl-0.018
 overboard-0.017
 fault-0.016
Token's
Token ID338
Feature activation-3.36e-02

Pos logit contributions
 ado+0.219
ously+0.217
arl+0.217
 overboard+0.207
osity+0.191
Neg logit contributions
aus-0.235
tre-0.230
ns-0.216
me-0.213
ban-0.210
Token wonderful
Token ID7932
Feature activation+4.42e-02

Pos logit contributions
aus+0.040
tre+0.036
me+0.033
ban+0.033
roc+0.033
Neg logit contributions
ously-0.049
 ado-0.047
 overboard-0.044
osity-0.043
arl-0.043
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
 ado+0.056
arl+0.056
ously+0.056
 overboard+0.055
ching+0.051
Neg logit contributions
aus-0.053
tre-0.053
ns-0.048
roc-0.048
ban-0.048
Token my
Token ID616
Feature activation+1.36e-01

Pos logit contributions
 ado+0.028
ously+0.028
arl+0.028
 overboard+0.026
 humor+0.025
Neg logit contributions
tre-0.048
aus-0.047
ns-0.046
ban-0.044
roc-0.044
Token d
Token ID288
Feature activation+1.10e-02

Pos logit contributions
 ado+0.149
ously+0.147
arl+0.142
 overboard+0.141
ching+0.126
Neg logit contributions
aus-0.191
tre-0.188
ns-0.179
ban-0.175
roc-0.174
Tokenarl
Token ID7063
Feature activation+2.91e-01

Pos logit contributions
 ado+0.007
ously+0.007
 overboard+0.007
arl+0.006
 punk+0.005
Neg logit contributions
aus-0.021
tre-0.020
ns-0.019
ban-0.019
roc-0.019
Tokenings
Token ID654
Feature activation-8.30e-02

Pos logit contributions
 overboard+0.447
 ado+0.445
ously+0.439
 unconscious+0.433
 punk+0.432
Neg logit contributions
aus-0.262
tre-0.248
me-0.234
roc-0.231
oc-0.228
Token!"
Token ID2474
Feature activation+3.56e-02

Pos logit contributions
aus+0.110
tre+0.108
ns+0.102
ban+0.099
me+0.099
Neg logit contributions
 ado-0.097
ously-0.097
 overboard-0.093
arl-0.093
 fault-0.087
Token Mom
Token ID11254
Feature activation-1.09e-02

Pos logit contributions
 ado+0.035
ously+0.035
arl+0.033
 overboard+0.032
 humor+0.030
Neg logit contributions
aus-0.054
tre-0.053
ns-0.051
roc-0.050
ban-0.049
Tokenmy
Token ID1820
Feature activation+4.19e-02

Pos logit contributions
aus+0.015
tre+0.015
ns+0.014
me+0.014
ban+0.013
Neg logit contributions
 ado-0.012
arl-0.012
ously-0.012
 overboard-0.012
 fault-0.011
Token says
Token ID1139
Feature activation+2.39e-02

Pos logit contributions
 ado+0.049
ously+0.048
arl+0.048
 overboard+0.047
 fault+0.043
Neg logit contributions
aus-0.056
tre-0.055
ns-0.052
me-0.050
ban-0.050
Token you
Token ID345
Feature activation-5.44e-02

Pos logit contributions
 ado+0.011
ously+0.011
 overboard+0.011
arl+0.011
 fault+0.010
Neg logit contributions
aus-0.013
tre-0.013
ns-0.012
roc-0.012
ban-0.012
Token go
Token ID467
Feature activation-4.43e-02

Pos logit contributions
aus+0.069
tre+0.068
ns+0.065
ban+0.063
me+0.063
Neg logit contributions
 ado-0.066
ously-0.065
arl-0.065
 overboard-0.063
 shattering-0.059
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
aus+0.059
tre+0.059
ns+0.056
me+0.053
ban+0.053
Neg logit contributions
ously-0.052
 ado-0.050
arl-0.050
 overboard-0.048
osity-0.045
Token my
Token ID616
Feature activation+1.27e-01

Pos logit contributions
 ado+0.012
arl+0.011
ously+0.011
 overboard+0.011
 way+0.010
Neg logit contributions
tre-0.019
aus-0.019
roc-0.018
ns-0.018
ban-0.018
Token d
Token ID288
Feature activation+4.97e-03

Pos logit contributions
 ado+0.129
ously+0.128
arl+0.128
 overboard+0.124
 fault+0.116
Neg logit contributions
aus-0.185
tre-0.183
ns-0.173
ban-0.167
roc-0.167
Tokenarl
Token ID7063
Feature activation+2.83e-01

Pos logit contributions
 ado+0.002
ously+0.002
arl+0.002
 overboard+0.002
 fault+0.001
Neg logit contributions
aus-0.010
tre-0.010
ns-0.010
roc-0.010
ban-0.010
Tokenings
Token ID654
Feature activation-8.85e-02

Pos logit contributions
 ado+0.395
 overboard+0.392
ously+0.391
 punk+0.381
 unconscious+0.381
Neg logit contributions
aus-0.291
tre-0.274
oc-0.261
me-0.258
roc-0.257
Token.
Token ID13
Feature activation-1.11e-03

Pos logit contributions
aus+0.120
tre+0.118
ns+0.112
me+0.108
ban+0.108
Neg logit contributions
 ado-0.100
ously-0.099
arl-0.097
 overboard-0.096
 fault-0.088
Token Enjoy
Token ID18179
Feature activation-3.80e-03

Pos logit contributions
aus+0.002
tre+0.002
ns+0.002
roc+0.002
ban+0.001
Neg logit contributions
 ado-0.001
arl-0.001
ously-0.001
 overboard-0.001
 humor-0.001
Token your
Token ID534
Feature activation+9.88e-02

Pos logit contributions
aus+0.006
tre+0.006
ns+0.006
roc+0.006
ban+0.006
Neg logit contributions
 ado-0.003
arl-0.003
ously-0.003
 overboard-0.003
 humor-0.003
Token lunch
Token ID9965
Feature activation-4.77e-02

Pos logit contributions
 ado+0.110
arl+0.109
 overboard+0.107
ously+0.105
 humor+0.101
Neg logit contributions
aus-0.135
tre-0.133
ns-0.124
roc-0.122
ban-0.121
Token let
Token ID1309
Feature activation+1.17e-01

Pos logit contributions
aus+0.010
tre+0.010
ns+0.009
roc+0.009
ban+0.009
Neg logit contributions
ously-0.007
 ado-0.007
 overboard-0.007
ching-0.006
arl-0.006
Token out
Token ID503
Feature activation+7.61e-02

Pos logit contributions
 off+0.098
ching+0.092
 way+0.090
 over+0.079
 overboard+0.074
Neg logit contributions
ban-0.217
 spe-0.204
roc-0.201
library-0.201
bel-0.196
Token a
Token ID257
Feature activation+8.21e-03

Pos logit contributions
 ado+0.093
arl+0.092
ously+0.091
 overboard+0.089
 fault+0.083
Neg logit contributions
aus-0.097
tre-0.096
ns-0.090
roc-0.087
ban-0.086
Token big
Token ID1263
Feature activation+1.89e-02

Pos logit contributions
 ado+0.008
ously+0.008
arl+0.008
 overboard+0.008
ching+0.007
Neg logit contributions
aus-0.012
tre-0.012
ns-0.012
roc-0.011
ban-0.011
Token sn
Token ID3013
Feature activation-1.24e-01

Pos logit contributions
 ado+0.018
ously+0.018
arl+0.017
 overboard+0.017
ching+0.015
Neg logit contributions
tre-0.030
aus-0.029
ns-0.028
me-0.027
ban-0.027
Tokenarl
Token ID7063
Feature activation+2.80e-01

Pos logit contributions
aus+0.172
tre+0.169
ns+0.159
roc+0.155
ban+0.154
Neg logit contributions
 ado-0.140
ously-0.139
arl-0.135
 overboard-0.134
 fault-0.122
Token and
Token ID290
Feature activation-9.52e-03

Pos logit contributions
 ado+0.348
ously+0.341
 overboard+0.337
arl+0.335
 fault+0.309
Neg logit contributions
aus-0.349
tre-0.342
ns-0.320
ban-0.312
me-0.311
Token the
Token ID262
Feature activation-3.28e-02

Pos logit contributions
aus+0.014
tre+0.014
roc+0.013
ban+0.013
ns+0.012
Neg logit contributions
 ado-0.010
ously-0.010
 overboard-0.010
 joins-0.009
arl-0.009
Token mouse
Token ID10211
Feature activation-6.59e-02

Pos logit contributions
aus+0.036
ns+0.036
oc+0.035
 whis+0.035
tre+0.034
Neg logit contributions
arl-0.048
ously-0.047
 ado-0.046
 overboard-0.045
osity-0.044
Token knew
Token ID2993
Feature activation+6.13e-02

Pos logit contributions
tre+0.075
ns+0.073
aus+0.071
 spe+0.070
oc+0.068
Neg logit contributions
arl-0.099
 ado-0.092
 fault-0.084
 overboard-0.082
osity-0.081
Token he
Token ID339
Feature activation+2.73e-02

Pos logit contributions
 ado+0.054
arl+0.053
ously+0.053
 overboard+0.051
 fault+0.045
Neg logit contributions
aus-0.100
tre-0.098
ns-0.094
ban-0.091
roc-0.091
Token you
Token ID345
Feature activation-7.10e-02

Pos logit contributions
 ado+0.088
arl+0.087
ously+0.086
 overboard+0.085
 appearances+0.077
Neg logit contributions
aus-0.096
tre-0.095
ns-0.091
oc-0.088
ban-0.087
Token okay
Token ID8788
Feature activation-5.35e-02

Pos logit contributions
aus+0.072
tre+0.066
me+0.062
ns+0.062
ban+0.055
Neg logit contributions
ously-0.112
 ado-0.109
arl-0.106
osity-0.103
 appearances-0.103
Token,
Token ID11
Feature activation+5.40e-03

Pos logit contributions
aus+0.068
tre+0.066
me+0.061
roc+0.061
ns+0.061
Neg logit contributions
ously-0.066
 ado-0.066
arl-0.063
 overboard-0.063
 fault-0.058
Token my
Token ID616
Feature activation+1.15e-01

Pos logit contributions
 ado+0.006
ously+0.006
arl+0.006
 overboard+0.006
 fault+0.005
Neg logit contributions
aus-0.008
tre-0.007
ns-0.007
roc-0.007
me-0.007
Token d
Token ID288
Feature activation-3.73e-03

Pos logit contributions
 ado+0.127
ously+0.120
 overboard+0.117
osity+0.108
arl+0.107
Neg logit contributions
aus-0.168
tre-0.159
me-0.153
oc-0.152
ns-0.151
Tokenarl
Token ID7063
Feature activation+2.78e-01

Pos logit contributions
aus+0.008
tre+0.008
roc+0.008
ns+0.008
me+0.008
Neg logit contributions
 ado-0.001
ously-0.001
 overboard-0.001
 way-0.001
 joins-0.001
Tokenings
Token ID654
Feature activation-8.46e-02

Pos logit contributions
 overboard+0.460
 ado+0.456
ously+0.453
 unconscious+0.444
 punk+0.437
Neg logit contributions
aus-0.213
me-0.198
tre-0.197
roc-0.191
oc-0.187
Token?"
Token ID1701
Feature activation+4.08e-03

Pos logit contributions
tre+0.127
aus+0.125
ns+0.120
roc+0.117
ban+0.117
Neg logit contributions
arl-0.086
 ado-0.084
ously-0.082
 overboard-0.079
ching-0.073
Token she
Token ID673
Feature activation+7.28e-03

Pos logit contributions
 ado+0.004
ously+0.004
arl+0.004
 overboard+0.004
 humor+0.004
Neg logit contributions
aus-0.006
tre-0.006
ns-0.006
roc-0.006
ban-0.005
Token asked
Token ID1965
Feature activation+1.32e-02

Pos logit contributions
 ado+0.006
ously+0.006
 overboard+0.005
arl+0.005
ching+0.005
Neg logit contributions
tre-0.012
aus-0.012
roc-0.012
ns-0.011
oc-0.011
Token.
Token ID13
Feature activation-4.04e-03

Pos logit contributions
ously+0.011
 ado+0.010
arl+0.009
 overboard+0.009
 unconscious+0.009
Neg logit contributions
aus-0.024
tre-0.023
oc-0.023
roc-0.023
ns-0.023
Token disappointment
Token ID18641
Feature activation+5.73e-02

Pos logit contributions
tre+0.034
 spe+0.033
aus+0.031
oc+0.031
ban+0.030
Neg logit contributions
arl-0.030
 ado-0.029
 overboard-0.029
osity-0.029
ously-0.029
Token.
Token ID13
Feature activation+5.19e-03

Pos logit contributions
 ado+0.071
arl+0.070
ously+0.068
 overboard+0.068
 fault+0.063
Neg logit contributions
aus-0.071
tre-0.070
ns-0.066
ban-0.064
me-0.064
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
 ado+0.007
arl+0.006
ously+0.006
 overboard+0.006
ching+0.006
Neg logit contributions
aus-0.006
tre-0.006
ns-0.006
oc-0.006
me-0.006
Token
Token ID198
Feature activation+1.76e-02

Pos logit contributions
 ado+0.021
ously+0.021
arl+0.021
 overboard+0.021
 fault+0.019
Neg logit contributions
aus-0.021
tre-0.021
ns-0.020
me-0.019
ban-0.019
TokenM
Token ID44
Feature activation-7.05e-02

Pos logit contributions
 ado+0.021
ously+0.021
arl+0.021
 overboard+0.021
 fault+0.019
Neg logit contributions
aus-0.023
tre-0.022
ns-0.021
me-0.020
ban-0.020
Tokenarl
Token ID7063
Feature activation+2.78e-01

Pos logit contributions
tre+0.088
ns+0.084
aus+0.082
 spe+0.079
roc+0.077
Neg logit contributions
ously-0.093
arl-0.092
 ado-0.088
 humor-0.081
 overboard-0.081
Tokenon
Token ID261
Feature activation-8.81e-02

Pos logit contributions
 ado+0.291
arl+0.283
ously+0.282
 overboard+0.281
 fault+0.255
Neg logit contributions
aus-0.398
tre-0.393
ns-0.371
me-0.367
ban-0.362
Token's
Token ID338
Feature activation-5.49e-02

Pos logit contributions
aus+0.119
tre+0.116
ns+0.111
me+0.108
ban+0.107
Neg logit contributions
 ado-0.102
arl-0.100
ously-0.099
 overboard-0.096
 fault-0.089
Token mum
Token ID25682
Feature activation+3.02e-02

Pos logit contributions
aus+0.065
ns+0.063
 spe+0.062
 reception+0.061
tre+0.060
Neg logit contributions
 ado-0.072
arl-0.071
 overboard-0.067
ously-0.065
osity-0.061
Token hugged
Token ID41130
Feature activation+1.01e-01

Pos logit contributions
ously+0.031
 ado+0.029
 overboard+0.027
arl+0.026
ching+0.026
Neg logit contributions
aus-0.045
tre-0.045
roc-0.043
ns-0.041
library-0.041
Token him
Token ID683
Feature activation-1.12e-01

Pos logit contributions
ously+0.119
 ado+0.111
arl+0.107
 overboard+0.104
 off+0.097
Neg logit contributions
aus-0.143
tre-0.139
ns-0.137
library-0.130
ban-0.130
Token"
Token ID1
Feature activation-6.49e-02

Pos logit contributions
 ado+0.022
arl+0.022
ously+0.022
 overboard+0.021
ching+0.019
Neg logit contributions
aus-0.023
tre-0.022
ns-0.021
roc-0.020
ban-0.020
TokenHello
Token ID15496
Feature activation+3.02e-02

Pos logit contributions
aus+0.099
tre+0.098
ns+0.094
roc+0.093
oc+0.091
Neg logit contributions
 ado-0.062
ously-0.062
arl-0.061
 overboard-0.058
 punk-0.053
Token,
Token ID11
Feature activation+9.06e-03

Pos logit contributions
 ado+0.025
arl+0.024
ously+0.023
 overboard+0.022
 punk+0.022
Neg logit contributions
aus-0.050
ns-0.049
tre-0.049
roc-0.046
ban-0.046
Token my
Token ID616
Feature activation+1.23e-01

Pos logit contributions
arl+0.011
 ado+0.010
ously+0.010
 overboard+0.010
 punk+0.009
Neg logit contributions
aus-0.012
tre-0.012
ns-0.012
roc-0.011
ban-0.011
Token d
Token ID288
Feature activation-6.13e-03

Pos logit contributions
arl+0.102
ously+0.096
 ado+0.095
 fault+0.090
 overboard+0.087
Neg logit contributions
aus-0.209
ns-0.203
tre-0.200
ban-0.193
 lin-0.189
Tokenarl
Token ID7063
Feature activation+2.78e-01

Pos logit contributions
aus+0.014
tre+0.014
ns+0.014
roc+0.013
oc+0.013
Neg logit contributions
arl-0.002
ously-0.001
 ado-0.001
 overboard-0.001
osity-0.001
Tokenings
Token ID654
Feature activation-8.82e-02

Pos logit contributions
 ado+0.429
 overboard+0.425
ously+0.425
arl+0.402
 unconscious+0.401
Neg logit contributions
aus-0.251
tre-0.247
me-0.229
roc-0.224
oc-0.217
Token.
Token ID13
Feature activation-7.24e-04

Pos logit contributions
tre+0.138
aus+0.138
roc+0.132
ns+0.131
Tre+0.130
Neg logit contributions
arl-0.082
 ado-0.080
ously-0.077
 overboard-0.072
ching-0.070
Token I
Token ID314
Feature activation+8.27e-02

Pos logit contributions
aus+0.001
tre+0.001
ns+0.001
roc+0.001
me+0.001
Neg logit contributions
 ado-0.001
arl-0.001
ously-0.001
 overboard-0.001
osity-0.001
Token'm
Token ID1101
Feature activation+1.55e-02

Pos logit contributions
 ado+0.084
ously+0.083
arl+0.082
 overboard+0.081
ching+0.075
Neg logit contributions
aus-0.122
tre-0.117
ns-0.112
roc-0.110
ban-0.109
Token so
Token ID523
Feature activation+1.05e-03

Pos logit contributions
arl+0.011
 ado+0.011
 overboard+0.011
 punk+0.010
 fault+0.010
Neg logit contributions
aus-0.027
oc-0.027
tre-0.026
roc-0.026
ns-0.026
Token"
Token ID1
Feature activation-5.90e-02

Pos logit contributions
 ado+0.034
ously+0.034
arl+0.033
 overboard+0.033
 fault+0.030
Neg logit contributions
aus-0.027
tre-0.027
ns-0.025
roc-0.024
ban-0.024
TokenHello
Token ID15496
Feature activation+3.08e-02

Pos logit contributions
aus+0.083
tre+0.083
ban+0.077
me+0.077
ns+0.076
Neg logit contributions
 ado-0.067
ously-0.066
 overboard-0.065
arl-0.063
 fault-0.062
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
 ado+0.025
arl+0.024
ously+0.023
 overboard+0.021
 punk+0.021
Neg logit contributions
aus-0.051
ns-0.050
tre-0.049
roc-0.048
Tre-0.047
Token my
Token ID616
Feature activation+1.21e-01

Pos logit contributions
arl+0.015
 ado+0.015
ously+0.015
 overboard+0.014
 punk+0.013
Neg logit contributions
aus-0.019
tre-0.019
ns-0.019
roc-0.018
ban-0.018
Token d
Token ID288
Feature activation-3.47e-03

Pos logit contributions
arl+0.107
 ado+0.101
ously+0.100
 fault+0.098
 overboard+0.095
Neg logit contributions
aus-0.194
tre-0.190
ns-0.187
ban-0.178
roc-0.177
Tokenarl
Token ID7063
Feature activation+2.77e-01

Pos logit contributions
tre+0.009
aus+0.009
ns+0.009
 spe+0.009
roc+0.009
Neg logit contributions
arl-0.000
ously+0.000
 ado+0.000
 overboard+0.000
osity+0.001
Tokenings
Token ID654
Feature activation-8.70e-02

Pos logit contributions
 ado+0.417
ously+0.414
 overboard+0.413
arl+0.395
 unconscious+0.382
Neg logit contributions
aus-0.267
tre-0.262
me-0.243
roc-0.234
ns-0.233
Token.
Token ID13
Feature activation+7.04e-03

Pos logit contributions
tre+0.127
aus+0.126
roc+0.123
ns+0.120
Tre+0.119
Neg logit contributions
arl-0.090
 ado-0.088
ously-0.085
 overboard-0.080
ching-0.079
Token I
Token ID314
Feature activation+8.40e-02

Pos logit contributions
arl+0.006
 ado+0.006
ously+0.005
 overboard+0.005
osity+0.005
Neg logit contributions
aus-0.012
ns-0.011
tre-0.011
roc-0.011
 lin-0.011
Token'm
Token ID1101
Feature activation+1.69e-02

Pos logit contributions
 ado+0.075
ously+0.073
arl+0.072
 overboard+0.072
ching+0.068
Neg logit contributions
aus-0.134
tre-0.127
roc-0.123
ns-0.120
ban-0.120
Token so
Token ID523
Feature activation-3.37e-04

Pos logit contributions
arl+0.011
 ado+0.011
 overboard+0.010
 punk+0.010
 waving+0.010
Neg logit contributions
oc-0.031
aus-0.031
roc-0.030
tre-0.029
ns-0.029
Token perfect
Token ID2818
Feature activation+1.04e-02

Pos logit contributions
 overboard+0.007
 ado+0.007
 punk+0.007
arl+0.007
 way+0.007
Neg logit contributions
oc-0.014
aus-0.014
tre-0.014
me-0.014
ns-0.014
Token fish
Token ID5916
Feature activation+2.18e-02

Pos logit contributions
 ado+0.011
ously+0.011
arl+0.011
 overboard+0.011
 fault+0.010
Neg logit contributions
aus-0.015
tre-0.015
ns-0.014
roc-0.014
me-0.013
Token,
Token ID11
Feature activation-1.75e-04

Pos logit contributions
 ado+0.025
ously+0.025
arl+0.025
 overboard+0.024
 fault+0.022
Neg logit contributions
aus-0.029
tre-0.029
ns-0.028
ban-0.027
roc-0.026
Token my
Token ID616
Feature activation+1.17e-01

Pos logit contributions
tre+0.000
ns+0.000
aus+0.000
ban+0.000
oc+0.000
Neg logit contributions
arl-0.000
 ado-0.000
ously-0.000
 humor-0.000
 overboard-0.000
Token d
Token ID288
Feature activation-8.33e-03

Pos logit contributions
 ado+0.133
ously+0.132
arl+0.128
 overboard+0.126
 shattering+0.114
Neg logit contributions
aus-0.161
tre-0.159
ns-0.152
roc-0.148
ban-0.147
Tokenarl
Token ID7063
Feature activation+2.77e-01

Pos logit contributions
aus+0.014
tre+0.014
ns+0.013
roc+0.013
ban+0.013
Neg logit contributions
 ado-0.007
ously-0.007
 overboard-0.006
arl-0.006
 shattering-0.006
Tokenings
Token ID654
Feature activation-8.69e-02

Pos logit contributions
 ado+0.453
 unconscious+0.448
 overboard+0.447
ously+0.447
Charl+0.442
Neg logit contributions
aus-0.215
tre-0.202
me-0.195
roc-0.190
oc-0.186
Token,"
Token ID553
Feature activation-8.11e-02

Pos logit contributions
tre+0.128
aus+0.127
ns+0.120
roc+0.120
Tre+0.118
Neg logit contributions
arl-0.089
 ado-0.088
ously-0.087
 overboard-0.083
ching-0.078
Token mom
Token ID1995
Feature activation-4.64e-02

Pos logit contributions
aus+0.119
tre+0.118
ns+0.113
oc+0.108
roc+0.108
Neg logit contributions
 ado-0.085
ously-0.084
arl-0.081
 overboard-0.078
 appearances-0.075
Token says
Token ID1139
Feature activation+1.47e-02

Pos logit contributions
aus+0.067
tre+0.065
ns+0.061
ban+0.060
me+0.060
Neg logit contributions
 ado-0.050
ously-0.050
 overboard-0.049
arl-0.048
 fault-0.044
Token.
Token ID13
Feature activation-6.03e-03

Pos logit contributions
ously+0.014
 ado+0.014
arl+0.014
 overboard+0.013
 fault+0.012
Neg logit contributions
aus-0.023
tre-0.023
roc-0.022
ns-0.022
oc-0.021
Token to
Token ID284
Feature activation-6.35e-03

Pos logit contributions
arl+0.047
 ado+0.045
ously+0.044
 overboard+0.043
 punk+0.041
Neg logit contributions
tre-0.042
aus-0.040
ns-0.039
ban-0.038
 spe-0.038
Token grow
Token ID1663
Feature activation-1.34e-01

Pos logit contributions
aus+0.009
tre+0.009
ns+0.008
roc+0.008
ban+0.008
Neg logit contributions
 ado-0.007
ously-0.007
arl-0.007
 overboard-0.006
 fault-0.006
Tokenl
Token ID75
Feature activation-2.08e-01

Pos logit contributions
aus+0.222
tre+0.215
roc+0.207
ban+0.204
 lin+0.204
Neg logit contributions
ching-0.117
 overboard-0.117
ously-0.116
 ado-0.114
 off-0.110
Token and
Token ID290
Feature activation-2.03e-02

Pos logit contributions
tre+0.311
aus+0.306
roc+0.290
Tre+0.282
ns+0.282
Neg logit contributions
ously-0.214
arl-0.207
 ado-0.206
 overboard-0.205
ching-0.188
Token sn
Token ID3013
Feature activation-1.31e-01

Pos logit contributions
tre+0.026
aus+0.026
ns+0.025
ban+0.024
 spe+0.024
Neg logit contributions
arl-0.024
ously-0.024
 ado-0.024
 overboard-0.023
 fault-0.021
Tokenarl
Token ID7063
Feature activation+2.76e-01

Pos logit contributions
aus+0.201
tre+0.199
ns+0.189
ban+0.183
roc+0.183
Neg logit contributions
 ado-0.129
ously-0.128
arl-0.126
 overboard-0.123
 fault-0.109
Token to
Token ID284
Feature activation-6.40e-03

Pos logit contributions
 ado+0.335
 overboard+0.328
ously+0.326
arl+0.324
 punk+0.303
Neg logit contributions
aus-0.343
tre-0.336
ns-0.319
ban-0.310
roc-0.309
Token try
Token ID1949
Feature activation+1.27e-01

Pos logit contributions
aus+0.011
tre+0.010
me+0.010
ns+0.009
oc+0.009
Neg logit contributions
 ado-0.006
ously-0.006
arl-0.005
 overboard-0.005
 humor-0.005
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
 ado+0.133
ously+0.132
 overboard+0.127
arl+0.124
 fault+0.117
Neg logit contributions
aus-0.190
tre-0.183
ns-0.175
roc-0.171
me-0.170
Token scare
Token ID19437
Feature activation+6.08e-02

Pos logit contributions
tre+0.024
ns+0.024
aus+0.024
ban+0.023
 spe+0.023
Neg logit contributions
arl-0.026
ously-0.025
 ado-0.025
 overboard-0.024
 fault-0.023
Token her
Token ID607
Feature activation-2.63e-03

Pos logit contributions
 ado+0.063
ously+0.063
arl+0.060
 overboard+0.060
ching+0.055
Neg logit contributions
aus-0.090
tre-0.087
ns-0.082
me-0.081
ban-0.080
Token you
Token ID345
Feature activation-7.41e-02

Pos logit contributions
library+0.002
aus+0.002
tre+0.002
Tre+0.002
 lin+0.002
Neg logit contributions
 that-0.001
 hurts-0.001
 way-0.001
 kicks-0.001
 ado-0.001
Token go
Token ID467
Feature activation-4.40e-02

Pos logit contributions
tre+0.105
aus+0.104
ns+0.102
 spe+0.099
ban+0.096
Neg logit contributions
arl-0.079
 ado-0.077
ously-0.076
 overboard-0.075
ching-0.070
Token,
Token ID11
Feature activation+2.16e-03

Pos logit contributions
aus+0.067
tre+0.065
ns+0.063
ban+0.061
me+0.060
Neg logit contributions
 ado-0.045
arl-0.043
 overboard-0.042
ously-0.042
ching-0.037
Token my
Token ID616
Feature activation+1.16e-01

Pos logit contributions
arl+0.002
 ado+0.002
ously+0.002
 overboard+0.002
 humor+0.002
Neg logit contributions
tre-0.003
aus-0.003
ns-0.003
ban-0.003
roc-0.003
Token d
Token ID288
Feature activation-7.68e-03

Pos logit contributions
 ado+0.140
ously+0.137
 overboard+0.132
arl+0.126
osity+0.119
Neg logit contributions
aus-0.153
tre-0.149
ns-0.143
oc-0.140
me-0.140
Tokenarl
Token ID7063
Feature activation+2.76e-01

Pos logit contributions
aus+0.019
tre+0.019
ns+0.018
roc+0.018
ban+0.018
Neg logit contributions
 ado-0.001
ously-0.000
arl-0.000
 overboard-0.000
 fault+0.001
Tokenings
Token ID654
Feature activation-8.84e-02

Pos logit contributions
 ado+0.455
 overboard+0.451
ously+0.449
 unconscious+0.432
 punk+0.431
Neg logit contributions
aus-0.211
tre-0.198
me-0.194
roc-0.190
oc-0.185
Token,"
Token ID553
Feature activation-7.84e-02

Pos logit contributions
tre+0.139
aus+0.133
Tre+0.133
ns+0.131
 spe+0.131
Neg logit contributions
arl-0.086
 ado-0.079
ously-0.079
 overboard-0.075
ching-0.074
Token Mom
Token ID11254
Feature activation-1.92e-02

Pos logit contributions
ns+0.144
oc+0.140
tre+0.140
aus+0.137
roc+0.134
Neg logit contributions
 ado-0.054
ously-0.049
 humor-0.048
arl-0.048
 kisses-0.046
Token said
Token ID531
Feature activation-2.37e-02

Pos logit contributions
tre+0.033
roc+0.032
aus+0.031
 spe+0.031
ns+0.031
Neg logit contributions
ously-0.015
 ado-0.013
 joins-0.012
 kisses-0.012
 unconscious-0.012
Token.
Token ID13
Feature activation-7.82e-03

Pos logit contributions
aus+0.036
tre+0.035
ns+0.034
roc+0.033
oc+0.033
Neg logit contributions
ously-0.024
 ado-0.024
arl-0.023
 overboard-0.023
 fault-0.020
TokenWhat
Token ID2061
Feature activation+8.63e-02

Pos logit contributions
aus+0.048
tre+0.047
ns+0.045
ban+0.044
roc+0.044
Neg logit contributions
 ado-0.030
ously-0.030
arl-0.030
 overboard-0.029
 fault-0.026
Token happened
Token ID3022
Feature activation+8.66e-02

Pos logit contributions
 ado+0.098
ously+0.096
arl+0.096
 overboard+0.093
osity+0.084
Neg logit contributions
aus-0.120
tre-0.118
ns-0.111
me-0.108
ban-0.108
Token,
Token ID11
Feature activation+6.11e-03

Pos logit contributions
 ado+0.068
arl+0.066
 overboard+0.064
ously+0.063
 kicks+0.055
Neg logit contributions
aus-0.152
ns-0.146
tre-0.145
ban-0.138
roc-0.137
Token my
Token ID616
Feature activation+1.19e-01

Pos logit contributions
ously+0.009
 overboard+0.009
 ado+0.009
inated+0.008
osity+0.008
Neg logit contributions
aus-0.007
me-0.006
 reception-0.006
 pizz-0.006
 Dr-0.006
Token d
Token ID288
Feature activation-6.94e-03

Pos logit contributions
 ado+0.120
ously+0.118
arl+0.114
 overboard+0.114
osity+0.101
Neg logit contributions
aus-0.179
tre-0.176
ns-0.168
roc-0.163
me-0.163
Tokenarl
Token ID7063
Feature activation+2.76e-01

Pos logit contributions
aus+0.017
tre+0.017
ns+0.016
roc+0.016
ban+0.016
Neg logit contributions
 ado-0.000
ously-0.000
arl-0.000
 overboard-0.000
 fault+0.001
Tokenings
Token ID654
Feature activation-8.77e-02

Pos logit contributions
 overboard+0.443
 ado+0.441
ously+0.436
 unconscious+0.424
 punk+0.419
Neg logit contributions
aus-0.226
tre-0.214
me-0.207
roc-0.201
oc-0.200
Token?
Token ID30
Feature activation+4.49e-02

Pos logit contributions
aus+0.128
tre+0.125
ns+0.119
me+0.115
roc+0.115
Neg logit contributions
 ado-0.091
ously-0.091
arl-0.088
 overboard-0.087
 fault-0.079
Token Why
Token ID4162
Feature activation+4.85e-02

Pos logit contributions
 ado+0.048
ously+0.048
arl+0.046
 overboard+0.046
 fault+0.042
Neg logit contributions
tre-0.064
aus-0.063
ns-0.060
ban-0.058
me-0.058
Token are
Token ID389
Feature activation+1.33e-02

Pos logit contributions
 ado+0.064
ously+0.063
arl+0.063
 overboard+0.061
osity+0.056
Neg logit contributions
aus-0.059
tre-0.057
ns-0.054
me-0.053
roc-0.052
Token you
Token ID345
Feature activation-7.08e-02

Pos logit contributions
 ado+0.016
arl+0.015
ously+0.015
 overboard+0.015
 appearances+0.014
Neg logit contributions
aus-0.017
tre-0.017
ns-0.016
oc-0.016
roc-0.016
Token's
Token ID338
Feature activation-5.48e-02

Pos logit contributions
 ado+0.156
ously+0.156
 overboard+0.152
 fault+0.146
 way+0.145
Neg logit contributions
tre-0.218
aus-0.217
ns-0.206
roc-0.199
ban-0.198
Token good
Token ID922
Feature activation+4.78e-02

Pos logit contributions
tre+0.091
ns+0.088
ban+0.088
aus+0.087
oc+0.086
Neg logit contributions
arl-0.046
 ado-0.041
 way-0.041
 appearances-0.041
 overboard-0.040
Token,
Token ID11
Feature activation+4.10e-03

Pos logit contributions
 ado+0.049
ously+0.048
arl+0.048
 overboard+0.045
 fault+0.043
Neg logit contributions
aus-0.072
tre-0.071
ns-0.069
roc-0.065
Tre-0.065
Token my
Token ID616
Feature activation+1.17e-01

Pos logit contributions
 ado+0.004
ously+0.004
 overboard+0.004
arl+0.004
ching+0.004
Neg logit contributions
aus-0.006
tre-0.006
ns-0.005
roc-0.005
me-0.005
Token d
Token ID288
Feature activation-7.41e-03

Pos logit contributions
arl+0.105
ously+0.103
 ado+0.100
 fault+0.099
 overboard+0.093
Neg logit contributions
aus-0.190
tre-0.188
ns-0.179
ban-0.173
library-0.171
Tokenarl
Token ID7063
Feature activation+2.76e-01

Pos logit contributions
tre+0.015
aus+0.015
ns+0.015
oc+0.014
ban+0.014
Neg logit contributions
arl-0.004
ously-0.003
 ado-0.003
 overboard-0.003
 fault-0.003
Tokenings
Token ID654
Feature activation-8.86e-02

Pos logit contributions
 ado+0.422
ously+0.416
 overboard+0.416
arl+0.399
 unconscious+0.389
Neg logit contributions
aus-0.255
tre-0.249
me-0.230
roc-0.227
ns-0.224
Token.
Token ID13
Feature activation-4.41e-03

Pos logit contributions
tre+0.132
aus+0.130
ns+0.123
roc+0.122
Tre+0.120
Neg logit contributions
arl-0.090
 ado-0.089
ously-0.088
 overboard-0.083
ching-0.078
Token I
Token ID314
Feature activation+7.93e-02

Pos logit contributions
aus+0.006
tre+0.006
ns+0.006
roc+0.006
ban+0.006
Neg logit contributions
arl-0.005
 ado-0.005
ously-0.005
 overboard-0.004
 fault-0.004
Token'm
Token ID1101
Feature activation+1.39e-02

Pos logit contributions
 ado+0.069
ously+0.069
arl+0.066
 overboard+0.065
ching+0.062
Neg logit contributions
aus-0.127
tre-0.124
ns-0.116
library-0.115
roc-0.115
Token glad
Token ID9675
Feature activation+1.43e-01

Pos logit contributions
 ado+0.011
arl+0.011
ously+0.010
 overboard+0.010
 fault+0.009
Neg logit contributions
aus-0.024
tre-0.024
ns-0.022
roc-0.022
oc-0.022
Token "
Token ID366
Feature activation-2.93e-02

Pos logit contributions
arl+0.006
ously+0.005
 ado+0.005
 overboard+0.005
 shattering+0.005
Neg logit contributions
aus-0.009
tre-0.008
roc-0.008
ns-0.008
library-0.008
TokenHello
Token ID15496
Feature activation+2.91e-02

Pos logit contributions
aus+0.043
tre+0.043
ns+0.041
roc+0.040
ban+0.039
Neg logit contributions
 ado-0.030
ously-0.030
arl-0.029
 overboard-0.028
ching-0.025
Token,
Token ID11
Feature activation+6.79e-03

Pos logit contributions
arl+0.023
 ado+0.023
ously+0.022
 overboard+0.020
 punk+0.020
Neg logit contributions
aus-0.048
tre-0.047
ns-0.047
roc-0.045
ban-0.045
Token my
Token ID616
Feature activation+1.22e-01

Pos logit contributions
 ado+0.007
ously+0.007
 overboard+0.007
arl+0.007
 fault+0.006
Neg logit contributions
aus-0.010
tre-0.010
ns-0.009
me-0.009
roc-0.009
Token d
Token ID288
Feature activation-6.23e-03

Pos logit contributions
arl+0.081
ously+0.079
 ado+0.079
 overboard+0.072
 fault+0.072
Neg logit contributions
aus-0.224
tre-0.219
ns-0.215
ban-0.208
roc-0.205
Tokenarl
Token ID7063
Feature activation+2.76e-01

Pos logit contributions
aus+0.015
tre+0.015
ns+0.015
roc+0.014
ban+0.014
Neg logit contributions
arl-0.001
ously-0.001
 ado-0.000
 overboard-0.000
osity+0.000
Tokenings
Token ID654
Feature activation-8.92e-02

Pos logit contributions
 overboard+0.437
 ado+0.433
ously+0.429
 unconscious+0.412
 shattering+0.404
Neg logit contributions
aus-0.239
tre-0.230
me-0.222
roc-0.213
oc-0.212
Token.
Token ID13
Feature activation-3.49e-03

Pos logit contributions
aus+0.124
tre+0.123
ns+0.116
roc+0.112
ban+0.112
Neg logit contributions
 ado-0.098
ously-0.098
arl-0.096
 overboard-0.094
 fault-0.085
Token I
Token ID314
Feature activation+8.27e-02

Pos logit contributions
aus+0.005
tre+0.005
ns+0.005
ban+0.005
roc+0.005
Neg logit contributions
 ado-0.004
ously-0.004
arl-0.003
 overboard-0.003
 fault-0.003
Token'm
Token ID1101
Feature activation+1.65e-02

Pos logit contributions
 ado+0.091
ously+0.090
arl+0.089
 overboard+0.087
 fault+0.079
Neg logit contributions
aus-0.115
tre-0.113
ns-0.107
roc-0.104
ban-0.104
Token so
Token ID523
Feature activation+4.72e-04

Pos logit contributions
arl+0.010
 ado+0.010
 overboard+0.010
 punk+0.009
 unconscious+0.009
Neg logit contributions
aus-0.031
oc-0.030
roc-0.030
tre-0.030
ns-0.029
Token have
Token ID423
Feature activation-1.36e-03

Pos logit contributions
aus+0.109
tre+0.109
ns+0.102
roc+0.100
ban+0.099
Neg logit contributions
 ado-0.077
ously-0.076
arl-0.075
 overboard-0.074
ching-0.067
Token fun
Token ID1257
Feature activation+6.37e-03

Pos logit contributions
aus+0.002
tre+0.002
ns+0.002
ban+0.002
roc+0.002
Neg logit contributions
ously-0.002
arl-0.002
 ado-0.002
 overboard-0.001
osity-0.001
Token,
Token ID11
Feature activation-1.30e-03

Pos logit contributions
 ado+0.008
ously+0.008
arl+0.007
 overboard+0.007
 fault+0.007
Neg logit contributions
aus-0.008
tre-0.008
ns-0.008
ban-0.008
roc-0.007
Token my
Token ID616
Feature activation+1.15e-01

Pos logit contributions
ns+0.002
tre+0.002
ban+0.002
aus+0.002
 lin+0.002
Neg logit contributions
arl-0.001
 ado-0.001
ously-0.001
 punk-0.001
 appearances-0.001
Token d
Token ID288
Feature activation-9.27e-03

Pos logit contributions
 ado+0.127
ously+0.126
arl+0.124
 overboard+0.122
 fault+0.110
Neg logit contributions
aus-0.160
tre-0.158
ns-0.150
roc-0.145
ban-0.145
Tokenarl
Token ID7063
Feature activation+2.75e-01

Pos logit contributions
aus+0.020
tre+0.020
ns+0.019
roc+0.019
ban+0.019
Neg logit contributions
 ado-0.003
ously-0.003
 overboard-0.003
arl-0.002
 fault-0.002
Tokenings
Token ID654
Feature activation-8.91e-02

Pos logit contributions
 overboard+0.431
 ado+0.425
ously+0.421
 unconscious+0.413
 shattering+0.405
Neg logit contributions
aus-0.241
tre-0.227
me-0.216
roc-0.216
oc-0.214
Token?"
Token ID1701
Feature activation-2.28e-03

Pos logit contributions
tre+0.141
aus+0.134
 spe+0.133
ns+0.132
Tre+0.131
Neg logit contributions
arl-0.085
 ado-0.081
ously-0.077
 overboard-0.075
ching-0.073
Token she
Token ID673
Feature activation+3.12e-03

Pos logit contributions
aus+0.004
tre+0.004
ns+0.004
me+0.003
oc+0.003
Neg logit contributions
 ado-0.002
ously-0.002
arl-0.002
 overboard-0.002
 humor-0.002
Token asks
Token ID7893
Feature activation+1.05e-01

Pos logit contributions
 ado+0.001
ously+0.001
 kicks+0.001
arl+0.001
 overboard+0.001
Neg logit contributions
tre-0.006
aus-0.006
roc-0.006
ns-0.006
oc-0.006
Token.
Token ID13
Feature activation-5.69e-03

Pos logit contributions
ously+0.080
 ado+0.066
arl+0.058
 off+0.057
ges+0.056
Neg logit contributions
aus-0.201
oc-0.198
ns-0.195
tre-0.191
 lin-0.186
Token you
Token ID345
Feature activation-7.10e-02

Pos logit contributions
 ado+0.131
ously+0.129
arl+0.128
 overboard+0.125
 humor+0.117
Neg logit contributions
aus-0.133
tre-0.131
ns-0.125
oc-0.121
roc-0.119
Token too
Token ID1165
Feature activation-9.81e-02

Pos logit contributions
aus+0.077
tre+0.074
ns+0.069
ban+0.068
me+0.067
Neg logit contributions
 ado-0.102
ously-0.101
arl-0.099
 overboard-0.098
osity-0.092
Token,
Token ID11
Feature activation+6.30e-03

Pos logit contributions
aus+0.078
ban+0.078
oc+0.076
tre+0.075
 spe+0.074
Neg logit contributions
ously-0.156
 overboard-0.154
 ado-0.153
osity-0.148
 shattering-0.147
Token my
Token ID616
Feature activation+1.19e-01

Pos logit contributions
 ado+0.007
ously+0.006
arl+0.006
 overboard+0.006
 fault+0.006
Neg logit contributions
aus-0.009
tre-0.009
ns-0.009
roc-0.008
ban-0.008
Token d
Token ID288
Feature activation-5.79e-03

Pos logit contributions
 ado+0.111
ously+0.110
arl+0.108
 overboard+0.106
 fault+0.093
Neg logit contributions
aus-0.187
tre-0.185
ns-0.177
roc-0.172
ban-0.171
Tokenarl
Token ID7063
Feature activation+2.75e-01

Pos logit contributions
aus+0.014
tre+0.014
ns+0.013
roc+0.013
ban+0.013
Neg logit contributions
 ado-0.001
ously-0.001
 overboard-0.000
arl-0.000
 fault+0.000
Tokenings
Token ID654
Feature activation-8.86e-02

Pos logit contributions
 overboard+0.459
 ado+0.447
 unconscious+0.443
ously+0.439
 punk+0.437
Neg logit contributions
aus-0.208
me-0.194
tre-0.193
roc-0.185
oc-0.184
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
aus+0.114
tre+0.110
ns+0.104
me+0.102
ban+0.102
Neg logit contributions
 ado-0.108
ously-0.106
 overboard-0.104
arl-0.102
 fault-0.096
Token But
Token ID887
Feature activation+3.67e-02

Pos logit contributions
tre+0.003
aus+0.003
oc+0.002
ns+0.002
me+0.002
Neg logit contributions
ously-0.002
 ado-0.002
 overboard-0.002
arl-0.002
ching-0.002
Token you
Token ID345
Feature activation-7.10e-02

Pos logit contributions
 ado+0.045
ously+0.045
arl+0.044
 overboard+0.043
 fault+0.040
Neg logit contributions
aus-0.046
tre-0.046
ns-0.043
ban-0.042
roc-0.042
Token have
Token ID423
Feature activation+5.30e-03

Pos logit contributions
tre+0.131
 lin+0.125
aus+0.123
Tre+0.123
library+0.122
Neg logit contributions
arl-0.041
 overboard-0.040
 fault-0.040
 ado-0.039
ching-0.039
TokenNot
Token ID3673
Feature activation+1.40e-01

Pos logit contributions
tre+0.088
aus+0.086
Tre+0.085
ban+0.083
me+0.082
Neg logit contributions
 ado-0.095
 overboard-0.091
 fault-0.090
ously-0.090
upid-0.086
Token now
Token ID783
Feature activation+6.94e-02

Pos logit contributions
 ado+0.153
ously+0.150
arl+0.149
 overboard+0.145
 fault+0.134
Neg logit contributions
aus-0.201
tre-0.197
ns-0.189
roc-0.181
me-0.180
Token,
Token ID11
Feature activation+1.81e-03

Pos logit contributions
ously+0.083
 ado+0.083
arl+0.081
 overboard+0.080
osity+0.074
Neg logit contributions
aus-0.089
tre-0.088
ns-0.083
ban-0.081
me-0.080
Token my
Token ID616
Feature activation+1.17e-01

Pos logit contributions
 overboard+0.002
 ado+0.002
ously+0.002
arl+0.002
ching+0.002
Neg logit contributions
aus-0.002
tre-0.002
me-0.002
roc-0.002
 spe-0.002
Token d
Token ID288
Feature activation-7.07e-03

Pos logit contributions
 ado+0.113
ously+0.111
 overboard+0.106
arl+0.105
osity+0.096
Neg logit contributions
aus-0.185
tre-0.178
ns-0.172
me-0.170
oc-0.167
Tokenarl
Token ID7063
Feature activation+2.75e-01

Pos logit contributions
tre+0.018
aus+0.018
ns+0.017
roc+0.017
ban+0.017
Neg logit contributions
arl-0.000
ously+0.000
 ado+0.000
 overboard+0.000
 fault+0.001
Tokenings
Token ID654
Feature activation-8.78e-02

Pos logit contributions
 ado+0.428
 overboard+0.426
ously+0.425
 unconscious+0.402
arl+0.400
Neg logit contributions
aus-0.246
tre-0.235
me-0.224
roc-0.218
oc-0.213
Token,
Token ID11
Feature activation+1.65e-03

Pos logit contributions
tre+0.130
aus+0.125
roc+0.123
Tre+0.122
ns+0.121
Neg logit contributions
arl-0.097
 ado-0.090
ously-0.088
 overboard-0.084
ching-0.082
Token it
Token ID340
Feature activation-3.23e-02

Pos logit contributions
arl+0.001
 ado+0.001
ously+0.001
 overboard+0.001
 shattering+0.001
Neg logit contributions
tre-0.003
ns-0.003
ban-0.003
aus-0.003
library-0.003
Token is
Token ID318
Feature activation+2.01e-02

Pos logit contributions
aus+0.045
tre+0.044
roc+0.041
ns+0.041
ban+0.041
Neg logit contributions
 ado-0.035
ously-0.035
 overboard-0.034
arl-0.034
ching-0.032
Token too
Token ID1165
Feature activation-1.01e-01

Pos logit contributions
 overboard+0.020
 ado+0.020
arl+0.020
 fault+0.019
 shattering+0.018
Neg logit contributions
tre-0.029
aus-0.029
roc-0.028
ban-0.028
oc-0.028
Token "
Token ID366
Feature activation-3.30e-02

Pos logit contributions
aus+0.077
tre+0.076
ns+0.071
roc+0.071
oc+0.069
Neg logit contributions
ously-0.053
 ado-0.052
arl-0.049
 overboard-0.049
 shattering-0.045
Tokendragon
Token ID14844
Feature activation-1.21e-01

Pos logit contributions
aus+0.050
 spe+0.048
tre+0.047
ns+0.047
 reception+0.046
Neg logit contributions
arl-0.030
ously-0.029
 ado-0.029
ching-0.028
 overboard-0.028
Token",
Token ID1600
Feature activation-5.53e-02

Pos logit contributions
aus+0.170
tre+0.167
ns+0.155
 spe+0.151
library+0.150
Neg logit contributions
 ado-0.128
arl-0.127
ously-0.127
 overboard-0.123
ching-0.116
Token and
Token ID290
Feature activation-2.27e-02

Pos logit contributions
aus+0.081
tre+0.080
ns+0.075
roc+0.075
library+0.072
Neg logit contributions
ously-0.057
 ado-0.056
arl-0.054
 overboard-0.053
 humor-0.048
Token "
Token ID366
Feature activation-3.54e-02

Pos logit contributions
aus+0.033
tre+0.033
ns+0.030
roc+0.030
me+0.030
Neg logit contributions
 ado-0.024
ously-0.023
arl-0.023
 overboard-0.022
 humor-0.020
Tokentre
Token ID33945
Feature activation-3.92e-01

Pos logit contributions
aus+0.051
 spe+0.049
tre+0.048
 reception+0.048
ns+0.047
Neg logit contributions
arl-0.035
ously-0.033
 ado-0.033
 overboard-0.033
ching-0.032
Tokenasure
Token ID5015
Feature activation-4.43e-02

Pos logit contributions
 spe+0.547
tre+0.539
ns+0.511
aus+0.503
oc+0.496
Neg logit contributions
arl-0.459
ously-0.418
 overboard-0.415
upid-0.412
 fault-0.406
Token".
Token ID1911
Feature activation+2.46e-02

Pos logit contributions
aus+0.065
tre+0.064
 spe+0.059
 lin+0.059
ns+0.058
Neg logit contributions
arl-0.045
ching-0.043
 overboard-0.042
 off-0.041
upid-0.041
Token They
Token ID1119
Feature activation+1.62e-02

Pos logit contributions
ously+0.022
 ado+0.022
arl+0.021
 overboard+0.020
ching+0.019
Neg logit contributions
aus-0.038
tre-0.038
 spe-0.036
ns-0.036
roc-0.036
Token think
Token ID892
Feature activation+4.86e-02

Pos logit contributions
 ado+0.018
ously+0.017
arl+0.017
 overboard+0.017
 fault+0.015
Neg logit contributions
aus-0.023
tre-0.022
ns-0.021
ban-0.021
roc-0.021
Token the
Token ID262
Feature activation-4.84e-02

Pos logit contributions
ously+0.045
 ado+0.044
arl+0.041
 overboard+0.041
 fault+0.040
Neg logit contributions
tre-0.076
aus-0.076
roc-0.072
ns-0.070
oc-0.070
Token
Token ID447
Feature activation-8.91e-02

Pos logit contributions
aus+0.147
roc+0.132
tre+0.127
ocol+0.126
 dom+0.123
Neg logit contributions
ously-0.108
Charl-0.107
 ado-0.102
ching-0.102
 overboard-0.101
Token
Token ID250
Feature activation-9.67e-02

Pos logit contributions
 reception+0.163
roc+0.161
tre+0.160
ns+0.160
 omn+0.157
Neg logit contributions
arl-0.048
Charl-0.043
 overboard-0.034
 punk-0.033
Again-0.033
Token Rich
Token ID3998
Feature activation-5.42e-02

Pos logit contributions
aus+0.147
roc+0.142
tre+0.138
ns+0.136
 reception+0.133
Neg logit contributions
arl-0.087
ously-0.084
 ado-0.082
ching-0.080
 humor-0.079
Token the
Token ID262
Feature activation-3.33e-02

Pos logit contributions
aus+0.079
tre+0.079
ns+0.075
 lin+0.074
library+0.073
Neg logit contributions
ously-0.055
 ado-0.052
arl-0.051
 overboard-0.049
ching-0.048
Token S
Token ID311
Feature activation-2.54e-01

Pos logit contributions
tre+0.050
aus+0.049
 lin+0.046
me+0.046
ns+0.046
Neg logit contributions
 ado-0.031
arl-0.031
ously-0.030
 overboard-0.030
 way-0.030
Tokenaus
Token ID8717
Feature activation-3.82e-01

Pos logit contributions
 spe+0.283
 lin+0.253
tre+0.240
Tre+0.239
roc+0.230
Neg logit contributions
arl-0.400
ously-0.374
 ado-0.356
ching-0.350
osity-0.334
Tokenage
Token ID496
Feature activation-5.25e-02

Pos logit contributions
aus+0.518
tre+0.515
library+0.493
 lin+0.484
 av+0.471
Neg logit contributions
ously-0.445
arl-0.420
ching-0.408
 overboard-0.397
 ado-0.395
Token Man
Token ID1869
Feature activation-1.11e-01

Pos logit contributions
aus+0.078
tre+0.077
ns+0.073
roc+0.072
ban+0.071
Neg logit contributions
ously-0.053
 ado-0.051
arl-0.051
 overboard-0.050
ching-0.046
Token!
Token ID0
Feature activation+1.99e-02

Pos logit contributions
aus+0.168
tre+0.165
roc+0.157
ns+0.155
 lin+0.153
Neg logit contributions
ously-0.107
 ado-0.102
arl-0.099
 overboard-0.098
ching-0.094
Token<|endoftext|>
Token ID50256
Feature activation-5.01e-02

Pos logit contributions
 ado+0.025
ously+0.025
 overboard+0.024
arl+0.024
 fault+0.022
Neg logit contributions
aus-0.025
tre-0.024
ns-0.022
ban-0.022
me-0.022
TokenOnce
Token ID7454
Feature activation+3.33e-02

Pos logit contributions
aus+0.074
tre+0.070
roc+0.070
ns+0.068
ban+0.066
Neg logit contributions
 ado-0.050
ously-0.049
arl-0.049
 overboard-0.049
ching-0.046
Token proudly
Token ID23017
Feature activation+2.72e-03

Pos logit contributions
tre+0.001
aus+0.001
roc+0.001
Tre+0.001
ban+0.001
Neg logit contributions
ously-0.000
 ado-0.000
 overboard-0.000
 joins-0.000
 kicks-0.000
Token said
Token ID531
Feature activation-1.74e-02

Pos logit contributions
arl+0.003
 ado+0.003
 overboard+0.003
ously+0.003
 fault+0.003
Neg logit contributions
aus-0.003
tre-0.003
ns-0.003
me-0.003
ban-0.003
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
aus+0.023
tre+0.023
ns+0.022
ban+0.021
me+0.021
Neg logit contributions
 ado-0.020
arl-0.020
ously-0.020
 overboard-0.019
 fault-0.018
Token "
Token ID366
Feature activation-2.98e-02

Pos logit contributions
ously+0.011
arl+0.010
 ado+0.010
 overboard+0.010
 shattering+0.009
Neg logit contributions
aus-0.015
tre-0.014
roc-0.014
ns-0.014
library-0.013
Tokens
Token ID82
Feature activation+8.13e-03

Pos logit contributions
aus+0.049
roc+0.047
ns+0.047
 spe+0.047
tre+0.047
Neg logit contributions
ously-0.023
arl-0.022
 ado-0.021
 overboard-0.020
ching-0.020
Tokenaus
Token ID8717
Feature activation-3.81e-01

Pos logit contributions
arl+0.006
 ado+0.006
ously+0.006
 overboard+0.006
 fault+0.005
Neg logit contributions
tre-0.014
aus-0.014
ns-0.013
ban-0.013
roc-0.013
Tokenages
Token ID1095
Feature activation-1.14e-01

Pos logit contributions
aus+0.508
tre+0.497
library+0.476
 av+0.468
 spe+0.468
Neg logit contributions
ously-0.456
arl-0.424
 ado-0.412
 overboard-0.411
ching-0.410
Token!"
Token ID2474
Feature activation+3.12e-02

Pos logit contributions
aus+0.146
tre+0.141
ns+0.133
me+0.133
ban+0.128
Neg logit contributions
 ado-0.141
ously-0.139
arl-0.133
 overboard-0.132
 appearances-0.123
Token Her
Token ID2332
Feature activation+2.52e-02

Pos logit contributions
 ado+0.032
ously+0.032
arl+0.031
 overboard+0.031
 fault+0.028
Neg logit contributions
aus-0.046
tre-0.045
ns-0.043
roc-0.042
ban-0.042
Token dad
Token ID9955
Feature activation+1.30e-02

Pos logit contributions
 ado+0.022
ously+0.022
arl+0.021
 fault+0.020
 overboard+0.020
Neg logit contributions
tre-0.042
aus-0.041
ns-0.040
roc-0.038
ban-0.038
Token smiled
Token ID13541
Feature activation-1.31e-02

Pos logit contributions
ously+0.010
ching+0.008
 joins+0.008
 ado+0.008
 overboard+0.008
Neg logit contributions
tre-0.022
roc-0.021
aus-0.021
Tre-0.021
library-0.020
Token man
Token ID582
Feature activation-1.54e-01

Pos logit contributions
tre+0.062
aus+0.062
ns+0.057
ban+0.054
roc+0.054
Neg logit contributions
 ado-0.108
arl-0.106
ously-0.106
 overboard-0.103
 punk-0.097
Token replied
Token ID8712
Feature activation-1.95e-02

Pos logit contributions
tre+0.253
aus+0.249
ns+0.240
roc+0.235
ban+0.232
Neg logit contributions
ously-0.127
 ado-0.121
 overboard-0.114
 kicks-0.109
 punk-0.108
Token,
Token ID11
Feature activation+1.71e-02

Pos logit contributions
aus+0.032
tre+0.031
ns+0.031
library+0.029
roc+0.029
Neg logit contributions
ously-0.018
 overboard-0.017
 ado-0.017
arl-0.016
 shattering-0.015
Token "
Token ID366
Feature activation-2.28e-02

Pos logit contributions
ously+0.019
 ado+0.019
arl+0.019
 overboard+0.019
 shattering+0.017
Neg logit contributions
aus-0.023
tre-0.023
ns-0.022
roc-0.021
ban-0.021
TokenS
Token ID50
Feature activation-9.64e-02

Pos logit contributions
aus+0.039
ns+0.037
tre+0.037
roc+0.036
 spe+0.036
Neg logit contributions
arl-0.016
 overboard-0.016
 ado-0.016
ously-0.015
itating-0.014
Tokenaus
Token ID8717
Feature activation-3.76e-01

Pos logit contributions
 spe+0.152
ns+0.148
 reception+0.134
ban+0.132
 ranc+0.129
Neg logit contributions
arl-0.116
ously-0.094
 ado-0.091
 fault-0.087
 overboard-0.087
Tokenages
Token ID1095
Feature activation-1.11e-01

Pos logit contributions
aus+0.500
tre+0.489
library+0.463
 av+0.460
 lin+0.455
Neg logit contributions
ously-0.450
arl-0.426
 ado-0.412
 overboard-0.411
ching-0.403
Token!
Token ID0
Feature activation+2.24e-02

Pos logit contributions
aus+0.132
tre+0.127
ns+0.118
me+0.117
ban+0.113
Neg logit contributions
ously-0.149
 ado-0.148
arl-0.139
 overboard-0.138
 appearances-0.130
Token Lots
Token ID25024
Feature activation-5.17e-02

Pos logit contributions
arl+0.021
 ado+0.019
 overboard+0.019
osity+0.018
ously+0.018
Neg logit contributions
tre-0.034
aus-0.034
ns-0.034
 lin-0.032
roc-0.032
Token and
Token ID290
Feature activation-8.22e-03

Pos logit contributions
aus+0.059
tre+0.057
oc+0.052
me+0.051
ns+0.051
Neg logit contributions
ously-0.073
 ado-0.072
arl-0.070
 overboard-0.069
 fault-0.066
Token lots
Token ID6041
Feature activation-1.84e-02

Pos logit contributions
roc+0.012
tre+0.012
aus+0.012
ban+0.011
me+0.011
Neg logit contributions
 ado-0.008
 overboard-0.007
arl-0.007
 way-0.007
 off-0.007
Token along
Token ID1863
Feature activation+1.03e-01

Pos logit contributions
aus+0.001
tre+0.001
roc+0.001
ns+0.001
oc+0.001
Neg logit contributions
ously-0.001
 ado-0.000
 off-0.000
 overboard-0.000
arl-0.000
Token to
Token ID284
Feature activation-4.61e-03

Pos logit contributions
 off+0.079
 that+0.074
ously+0.074
 way+0.067
 ado+0.067
Neg logit contributions
aus-0.199
roc-0.177
tre-0.176
ns-0.174
icer-0.166
Token the
Token ID262
Feature activation-4.20e-02

Pos logit contributions
aus+0.007
tre+0.006
ns+0.006
roc+0.006
me+0.006
Neg logit contributions
 ado-0.005
ously-0.005
arl-0.005
 overboard-0.005
 humor-0.004
Token hy
Token ID2537
Feature activation-1.46e-01

Pos logit contributions
aus+0.063
tre+0.060
roc+0.057
oc+0.056
ns+0.056
Neg logit contributions
 ado-0.043
 way-0.041
ously-0.041
 overboard-0.040
arl-0.040
Tokenm
Token ID76
Feature activation-9.05e-03

Pos logit contributions
aus+0.239
tre+0.236
ns+0.226
ban+0.219
roc+0.219
Neg logit contributions
 ado-0.127
ously-0.126
arl-0.123
 overboard-0.120
 fault-0.106
Tokenns
Token ID5907
Feature activation-3.62e-01

Pos logit contributions
aus+0.010
tre+0.010
ns+0.009
ban+0.009
roc+0.009
Neg logit contributions
ously-0.013
 ado-0.013
arl-0.013
 overboard-0.012
 fault-0.012
Token,
Token ID11
Feature activation+6.87e-03

Pos logit contributions
aus+0.518
tre+0.511
roc+0.488
ns+0.471
ban+0.462
Neg logit contributions
ously-0.385
 ado-0.383
arl-0.368
 overboard-0.368
ching-0.341
Token said
Token ID531
Feature activation-2.47e-02

Pos logit contributions
 ado+0.007
ously+0.007
arl+0.007
 overboard+0.006
 humor+0.006
Neg logit contributions
aus-0.010
tre-0.010
roc-0.010
ban-0.009
ns-0.009
Token prayers
Token ID15707
Feature activation-1.05e-01

Pos logit contributions
aus+0.038
tre+0.037
ns+0.036
roc+0.035
oc+0.034
Neg logit contributions
ously-0.025
 ado-0.024
arl-0.024
 overboard-0.023
osity-0.021
Token and
Token ID290
Feature activation-1.85e-02

Pos logit contributions
aus+0.128
tre+0.127
ns+0.119
me+0.118
ban+0.117
Neg logit contributions
 ado-0.132
 overboard-0.128
arl-0.128
ously-0.127
 humor-0.118
Token listened
Token ID16399
Feature activation+8.63e-02

Pos logit contributions
aus+0.024
tre+0.023
ns+0.023
 spe+0.022
ban+0.022
Neg logit contributions
arl-0.022
 ado-0.022
 overboard-0.022
ously-0.021
 fault-0.020
Token a
Token ID257
Feature activation+2.65e-03

Pos logit contributions
 ado+0.079
ously+0.078
arl+0.076
 overboard+0.074
 humor+0.069
Neg logit contributions
aus-0.110
tre-0.108
ns-0.104
roc-0.103
ban-0.103
Token tub
Token ID12202
Feature activation+1.23e-01

Pos logit contributions
 ado+0.003
ously+0.003
arl+0.003
 overboard+0.003
 appearances+0.003
Neg logit contributions
aus-0.004
tre-0.004
ns-0.003
me-0.003
ban-0.003
Token of
Token ID286
Feature activation+3.02e-02

Pos logit contributions
ously+0.127
 ado+0.126
arl+0.124
 overboard+0.121
 fault+0.109
Neg logit contributions
aus-0.179
tre-0.176
ns-0.169
roc-0.164
ban-0.162
Token strawberry
Token ID41236
Feature activation+2.23e-02

Pos logit contributions
ously+0.037
 ado+0.036
arl+0.036
 overboard+0.035
osity+0.032
Neg logit contributions
aus-0.040
tre-0.039
ns-0.037
me-0.036
ban-0.035
Token-
Token ID12
Feature activation-2.24e-01

Pos logit contributions
ously+0.024
 ado+0.024
 overboard+0.024
arl+0.023
 fault+0.021
Neg logit contributions
aus-0.031
tre-0.030
ns-0.029
roc-0.029
ban-0.028
Tokenban
Token ID3820
Feature activation-3.58e-01

Pos logit contributions
aus+0.308
tre+0.298
ns+0.288
roc+0.288
 spe+0.280
Neg logit contributions
ously-0.229
 overboard-0.227
 ado-0.227
arl-0.226
ching-0.208
Tokenana
Token ID2271
Feature activation-9.36e-02

Pos logit contributions
aus+0.464
tre+0.457
ns+0.427
roc+0.420
me+0.409
Neg logit contributions
ously-0.432
 ado-0.429
arl-0.421
 overboard-0.411
 fault-0.386
Token yogurt
Token ID32132
Feature activation-2.18e-02

Pos logit contributions
aus+0.123
tre+0.122
ns+0.115
me+0.111
ban+0.111
Neg logit contributions
 ado-0.110
ously-0.108
arl-0.107
 overboard-0.105
 fault-0.096
Token.
Token ID13
Feature activation+2.53e-03

Pos logit contributions
aus+0.030
tre+0.030
ns+0.028
roc+0.027
ban+0.027
Neg logit contributions
 ado-0.024
ously-0.024
arl-0.024
 overboard-0.023
 fault-0.021
Token She
Token ID1375
Feature activation-3.74e-03

Pos logit contributions
 ado+0.003
ously+0.003
arl+0.003
 overboard+0.003
ching+0.003
Neg logit contributions
aus-0.003
tre-0.003
ns-0.003
ban-0.003
me-0.003
Token tore
Token ID26197
Feature activation+1.03e-01

Pos logit contributions
aus+0.005
tre+0.005
roc+0.005
ns+0.005
ban+0.005
Neg logit contributions
ously-0.004
 ado-0.004
 overboard-0.004
arl-0.003
 joins-0.003
Token new
Token ID649
Feature activation+2.46e-02

Pos logit contributions
aus+0.064
tre+0.064
ns+0.060
roc+0.058
ban+0.058
Neg logit contributions
 ado-0.048
ously-0.047
arl-0.046
 overboard-0.045
 fault-0.041
Token words
Token ID2456
Feature activation-5.26e-02

Pos logit contributions
 ado+0.026
ously+0.026
arl+0.025
 overboard+0.025
 fault+0.023
Neg logit contributions
aus-0.036
tre-0.035
ns-0.033
ban-0.032
roc-0.032
Token,
Token ID11
Feature activation+1.35e-03

Pos logit contributions
aus+0.052
tre+0.051
ban+0.047
ns+0.047
me+0.045
Neg logit contributions
arl-0.078
 ado-0.078
 overboard-0.074
ously-0.072
 shattering-0.071
Token like
Token ID588
Feature activation-2.03e-02

Pos logit contributions
 ado+0.001
ously+0.001
arl+0.001
 overboard+0.001
 fault+0.001
Neg logit contributions
aus-0.002
tre-0.002
ns-0.002
ban-0.002
me-0.002
Token "
Token ID366
Feature activation-3.18e-02

Pos logit contributions
aus+0.026
tre+0.025
ns+0.025
ban+0.024
 spe+0.023
Neg logit contributions
ously-0.025
 ado-0.025
arl-0.024
 overboard-0.024
 shattering-0.022
Tokenban
Token ID3820
Feature activation-3.58e-01

Pos logit contributions
aus+0.044
 reception+0.042
roc+0.041
tre+0.040
 spe+0.040
Neg logit contributions
ching-0.033
arl-0.033
 ado-0.032
 overboard-0.032
ously-0.031
Tokenana
Token ID2271
Feature activation-9.88e-02

Pos logit contributions
aus+0.479
tre+0.477
ns+0.444
roc+0.427
Tre+0.425
Neg logit contributions
arl-0.421
 ado-0.412
ously-0.410
 overboard-0.391
 fault-0.371
Token"
Token ID1
Feature activation-6.99e-02

Pos logit contributions
aus+0.145
tre+0.144
ns+0.135
roc+0.133
Tre+0.133
Neg logit contributions
arl-0.098
ously-0.097
 ado-0.097
 overboard-0.092
ching-0.089
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
ocol+0.116
 spe+0.114
roc+0.114
tre+0.113
oc+0.113
Neg logit contributions
 ado-0.057
ously-0.055
 off-0.054
arl-0.053
ching-0.051
Token "
Token ID366
Feature activation-3.25e-02

Pos logit contributions
aus+0.023
tre+0.022
ns+0.022
 spe+0.021
ban+0.021
Neg logit contributions
 ado-0.036
ously-0.036
 overboard-0.036
arl-0.034
 fault-0.032
Tokenflower
Token ID25547
Feature activation+2.99e-02

Pos logit contributions
aus+0.044
tre+0.042
 reception+0.041
roc+0.041
 spe+0.041
Neg logit contributions
arl-0.036
ously-0.035
 ado-0.034
ching-0.034
 overboard-0.033
Token trying
Token ID2111
Feature activation+9.33e-02

Pos logit contributions
aus+0.010
tre+0.010
ns+0.009
 spe+0.009
me+0.009
Neg logit contributions
ously-0.009
 ado-0.009
arl-0.009
 overboard-0.009
osity-0.008
Token to
Token ID284
Feature activation-1.34e-03

Pos logit contributions
arl+0.144
 ado+0.133
ously+0.128
osity+0.122
 humor+0.121
Neg logit contributions
tre-0.096
aus-0.093
ns-0.087
ban-0.083
roc-0.082
Token keep
Token ID1394
Feature activation+4.07e-02

Pos logit contributions
aus+0.002
tre+0.002
ns+0.002
me+0.002
ban+0.002
Neg logit contributions
 ado-0.002
ously-0.002
arl-0.002
 overboard-0.001
 fault-0.001
Token the
Token ID262
Feature activation-4.08e-02

Pos logit contributions
 ado+0.043
 overboard+0.042
ously+0.041
arl+0.040
ching+0.037
Neg logit contributions
aus-0.059
tre-0.058
roc-0.054
ban-0.054
ns-0.054
Token he
Token ID339
Feature activation+2.59e-02

Pos logit contributions
aus+0.058
tre+0.057
ns+0.054
roc+0.052
ban+0.052
Neg logit contributions
 ado-0.045
ously-0.044
arl-0.043
 overboard-0.043
 fault-0.039
Tokenns
Token ID5907
Feature activation-3.57e-01

Pos logit contributions
 ado+0.026
 overboard+0.025
ously+0.025
ching+0.024
 kicks+0.024
Neg logit contributions
aus-0.037
tre-0.037
roc-0.036
ban-0.034
library-0.034
Token in
Token ID287
Feature activation-2.26e-02

Pos logit contributions
aus+0.474
tre+0.468
ns+0.446
ban+0.425
me+0.423
Neg logit contributions
 ado-0.415
ously-0.411
arl-0.407
 overboard-0.392
osity-0.363
Token line
Token ID1627
Feature activation-1.18e-01

Pos logit contributions
aus+0.028
tre+0.028
ns+0.026
ban+0.025
roc+0.025
Neg logit contributions
 ado-0.028
ously-0.028
arl-0.028
 overboard-0.027
osity-0.025
Token.
Token ID13
Feature activation+7.60e-03

Pos logit contributions
aus+0.152
tre+0.150
ns+0.141
ban+0.135
me+0.135
Neg logit contributions
 ado-0.142
arl-0.139
ously-0.139
 overboard-0.134
 punk-0.126
Token Billy
Token ID15890
Feature activation+2.25e-02

Pos logit contributions
 ado+0.009
ously+0.009
arl+0.008
 overboard+0.008
 fault+0.008
Neg logit contributions
aus-0.010
tre-0.010
ns-0.010
roc-0.009
ban-0.009
Token watched
Token ID7342
Feature activation+9.89e-02

Pos logit contributions
 ado+0.025
ously+0.025
arl+0.024
 overboard+0.024
 fault+0.022
Neg logit contributions
aus-0.031
tre-0.031
ns-0.029
roc-0.028
ban-0.028
Token little
Token ID1310
Feature activation+1.23e-01

Pos logit contributions
 overboard+0.003
 way+0.003
arl+0.003
ching+0.003
 ado+0.003
Neg logit contributions
aus-0.006
tre-0.006
roc-0.005
me-0.005
ban-0.005
Token shelter
Token ID11772
Feature activation-1.84e-02

Pos logit contributions
 ado+0.118
 overboard+0.117
ously+0.116
arl+0.110
ching+0.107
Neg logit contributions
aus-0.190
tre-0.185
ns-0.172
me-0.169
roc-0.169
Token for
Token ID329
Feature activation-1.67e-01

Pos logit contributions
aus+0.027
tre+0.026
ns+0.025
roc+0.025
ban+0.024
Neg logit contributions
ously-0.019
 ado-0.019
 overboard-0.019
arl-0.019
ching-0.017
Token the
Token ID262
Feature activation-3.72e-02

Pos logit contributions
aus+0.237
tre+0.235
ns+0.220
me+0.218
roc+0.217
Neg logit contributions
 ado-0.179
ously-0.178
arl-0.172
 overboard-0.172
 humor-0.157
Token he
Token ID339
Feature activation+2.64e-02

Pos logit contributions
tre+0.053
aus+0.052
roc+0.049
ns+0.048
ban+0.047
Neg logit contributions
 ado-0.039
ously-0.039
arl-0.039
 overboard-0.038
ching-0.036
Tokenns
Token ID5907
Feature activation-3.57e-01

Pos logit contributions
ously+0.024
 ado+0.024
 way+0.023
 overboard+0.023
ching+0.023
Neg logit contributions
aus-0.039
tre-0.039
roc-0.038
ban-0.037
library-0.036
Token.
Token ID13
Feature activation+9.48e-03

Pos logit contributions
tre+0.425
aus+0.418
ns+0.417
ban+0.383
oc+0.375
Neg logit contributions
arl-0.456
 ado-0.456
ously-0.436
 overboard-0.424
 appearances-0.413
Token Billy
Token ID15890
Feature activation+2.33e-02

Pos logit contributions
 ado+0.012
ously+0.012
arl+0.012
 overboard+0.012
 fault+0.011
Neg logit contributions
aus-0.012
tre-0.011
oc-0.011
ns-0.010
roc-0.010
Token couldn
Token ID3521
Feature activation-5.66e-02

Pos logit contributions
 ado+0.027
ously+0.027
arl+0.027
 overboard+0.026
 fault+0.024
Neg logit contributions
aus-0.031
tre-0.030
ns-0.029
roc-0.028
ban-0.028
Token't
Token ID470
Feature activation+4.45e-03

Pos logit contributions
roc+0.100
ns+0.099
tre+0.098
aus+0.097
 spe+0.093
Neg logit contributions
 overboard-0.039
ching-0.038
arl-0.036
 ado-0.035
ously-0.034
Token believe
Token ID1975
Feature activation+1.40e-02

Pos logit contributions
 ado+0.005
ously+0.005
arl+0.004
 overboard+0.004
 fault+0.004
Neg logit contributions
aus-0.007
tre-0.006
ns-0.006
ban-0.006
roc-0.006
Token was
Token ID373
Feature activation-2.39e-03

Pos logit contributions
ously+0.027
 overboard+0.026
 ado+0.026
 kicks+0.025
 joins+0.024
Neg logit contributions
aus-0.047
tre-0.046
roc-0.045
ban-0.045
ns-0.043
Token playing
Token ID2712
Feature activation+8.85e-02

Pos logit contributions
aus+0.004
tre+0.004
roc+0.004
oc+0.004
ns+0.004
Neg logit contributions
 ado-0.002
 overboard-0.002
ously-0.002
arl-0.002
 fault-0.002
Token with
Token ID351
Feature activation-4.97e-02

Pos logit contributions
 overboard+0.080
ously+0.074
ching+0.074
 shattering+0.071
 ado+0.071
Neg logit contributions
aus-0.146
tre-0.136
ns-0.133
ban-0.133
 spe-0.132
Token some
Token ID617
Feature activation-3.96e-02

Pos logit contributions
tre+0.061
aus+0.060
ns+0.057
 dom+0.056
 av+0.054
Neg logit contributions
arl-0.067
ously-0.066
 ado-0.064
osity-0.061
 overboard-0.059
Token cor
Token ID1162
Feature activation-7.26e-02

Pos logit contributions
aus+0.053
tre+0.052
ns+0.050
me+0.048
ban+0.047
Neg logit contributions
ously-0.047
 ado-0.047
arl-0.045
 overboard-0.044
osity-0.042
Tokenns
Token ID5907
Feature activation-3.56e-01

Pos logit contributions
aus+0.085
tre+0.081
 ruled+0.080
library+0.077
Tre+0.076
Neg logit contributions
 overboard-0.087
 joins-0.087
 ado-0.087
arl-0.086
ching-0.085
Token he
Token ID339
Feature activation+3.05e-02

Pos logit contributions
ns+0.421
tre+0.411
aus+0.407
oc+0.378
me+0.362
Neg logit contributions
arl-0.484
 ado-0.465
ously-0.449
osity-0.435
 overboard-0.432
Token found
Token ID1043
Feature activation-6.53e-02

Pos logit contributions
 ado+0.025
 kicks+0.025
ously+0.025
 overboard+0.024
ching+0.024
Neg logit contributions
aus-0.047
tre-0.046
roc-0.046
ban-0.046
 reception-0.043
Token in
Token ID287
Feature activation-1.86e-02

Pos logit contributions
aus+0.101
tre+0.097
roc+0.095
ban+0.092
library+0.092
Neg logit contributions
ously-0.063
 overboard-0.060
 ado-0.059
arl-0.055
ching-0.055
Token his
Token ID465
Feature activation+6.33e-02

Pos logit contributions
aus+0.029
tre+0.028
ns+0.027
me+0.026
roc+0.026
Neg logit contributions
 ado-0.018
 overboard-0.017
ously-0.017
arl-0.016
 appearances-0.016
Token garden
Token ID11376
Feature activation+8.45e-02

Pos logit contributions
ously+0.089
arl+0.085
 ado+0.081
osity+0.079
urbed+0.077
Neg logit contributions
tre-0.078
ns-0.075
 lin-0.074
 reception-0.073
aus-0.073
Token -
Token ID532
Feature activation-1.32e-02

Pos logit contributions
ously+0.050
 ado+0.049
arl+0.048
 overboard+0.047
 fault+0.043
Neg logit contributions
aus-0.070
tre-0.069
ns-0.066
roc-0.065
ban-0.064
Token a
Token ID257
Feature activation+3.03e-03

Pos logit contributions
aus+0.021
roc+0.020
tre+0.020
ns+0.020
ban+0.019
Neg logit contributions
ously-0.011
 ado-0.011
 overboard-0.011
arl-0.011
 shattering-0.010
Token beautiful
Token ID4950
Feature activation-2.48e-02

Pos logit contributions
 way+0.003
 punk+0.003
arl+0.003
 ado+0.003
ously+0.003
Neg logit contributions
aus-0.005
tre-0.005
roc-0.004
ns-0.004
ban-0.004
Token cheese
Token ID9891
Feature activation-1.67e-01

Pos logit contributions
aus+0.038
tre+0.037
ns+0.036
roc+0.035
ban+0.035
Neg logit contributions
 ado-0.023
ously-0.023
 overboard-0.022
arl-0.022
 way-0.022
Token o
Token ID267
Feature activation-2.01e-01

Pos logit contributions
aus+0.233
tre+0.230
ns+0.217
roc+0.214
ban+0.211
Neg logit contributions
 ado-0.182
ously-0.182
arl-0.177
 overboard-0.175
 fault-0.158
Tokenme
Token ID1326
Feature activation-3.44e-01

Pos logit contributions
roc+0.250
Tre+0.247
 spe+0.240
tre+0.237
aus+0.233
Neg logit contributions
arl-0.252
ching-0.206
ously-0.201
 ado-0.200
osity-0.198
Tokenlette
Token ID21348
Feature activation-1.05e-01

Pos logit contributions
aus+0.448
tre+0.431
roc+0.412
library+0.409
Tre+0.397
Neg logit contributions
ously-0.403
arl-0.385
 ado-0.376
 overboard-0.369
 fault-0.362
Token.
Token ID13
Feature activation+6.14e-03

Pos logit contributions
aus+0.141
tre+0.140
ns+0.133
me+0.129
ban+0.128
Neg logit contributions
 ado-0.121
arl-0.119
ously-0.118
 overboard-0.115
 fault-0.105
Token John
Token ID1757
Feature activation+2.35e-02

Pos logit contributions
 ado+0.007
ously+0.007
arl+0.007
 overboard+0.006
osity+0.006
Neg logit contributions
aus-0.009
tre-0.008
ns-0.008
roc-0.008
ban-0.008
Token was
Token ID373
Feature activation-7.83e-03

Pos logit contributions
ously+0.027
 ado+0.026
 overboard+0.025
arl+0.025
ching+0.023
Neg logit contributions
aus-0.032
tre-0.031
ns-0.029
roc-0.029
ban-0.029
Token so
Token ID523
Feature activation+8.26e-04

Pos logit contributions
aus+0.011
tre+0.011
ns+0.010
roc+0.010
ban+0.010
Neg logit contributions
 ado-0.008
ously-0.008
 overboard-0.008
arl-0.008
 fault-0.007
Token the
Token ID262
Feature activation-3.97e-02

Pos logit contributions
ously+0.005
 off+0.005
 how+0.004
 ado+0.004
 way+0.004
Neg logit contributions
tre-0.013
ns-0.013
 lin-0.012
aus-0.012
me-0.012
Token tree
Token ID5509
Feature activation-5.84e-02

Pos logit contributions
tre+0.064
aus+0.063
 lin+0.060
me+0.060
oc+0.059
Neg logit contributions
arl-0.035
 way-0.033
 living-0.031
 ado-0.031
 two-0.030
Token if
Token ID611
Feature activation+2.58e-02

Pos logit contributions
aus+0.102
tre+0.099
ns+0.098
ban+0.096
me+0.095
Neg logit contributions
ously-0.048
 ado-0.045
 overboard-0.041
arl-0.040
 way-0.039
Token they
Token ID484
Feature activation+2.30e-02

Pos logit contributions
 that+0.013
 way+0.009
ously+0.009
 risking+0.009
 forgetting+0.009
Neg logit contributions
oc-0.056
aus-0.052
icer-0.052
ns-0.052
roc-0.051
Token were
Token ID547
Feature activation-3.71e-02

Pos logit contributions
ously+0.025
 ado+0.023
 overboard+0.023
arl+0.021
 kicks+0.020
Neg logit contributions
aus-0.033
tre-0.032
roc-0.031
me-0.030
 lin-0.030
Token av
Token ID1196
Feature activation-3.44e-01

Pos logit contributions
aus+0.052
tre+0.051
ns+0.049
ban+0.047
roc+0.047
Neg logit contributions
 ado-0.041
ously-0.041
arl-0.040
 overboard-0.039
 fault-0.035
Tokenoc
Token ID420
Feature activation-3.37e-01

Pos logit contributions
 reception+0.183
aus+0.178
tre+0.161
Tre+0.141
 ranc+0.139
Neg logit contributions
 ado-0.664
ously-0.661
 overboard-0.656
ching-0.644
arl-0.626
Tokenados
Token ID22484
Feature activation-1.06e-01

Pos logit contributions
tre+0.504
aus+0.486
me+0.468
 lin+0.468
ns+0.467
Neg logit contributions
ously-0.357
 ado-0.331
 overboard-0.323
arl-0.321
 humor-0.306
Token,
Token ID11
Feature activation+6.56e-03

Pos logit contributions
aus+0.145
tre+0.144
ns+0.136
roc+0.133
ban+0.131
Neg logit contributions
 ado-0.118
ously-0.117
arl-0.114
 overboard-0.113
 fault-0.103
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
ously+0.005
 ado+0.005
 overboard+0.005
arl+0.005
 shattering+0.004
Neg logit contributions
aus-0.011
tre-0.011
roc-0.011
ns-0.011
ban-0.011
Token they
Token ID484
Feature activation+2.15e-02

Pos logit contributions
aus+0.026
tre+0.026
ns+0.024
roc+0.024
ban+0.024
Neg logit contributions
ously-0.017
 ado-0.017
 overboard-0.016
arl-0.015
 humor-0.014
Token
Token ID220
Feature activation-1.28e-02

Pos logit contributions
 ado+0.013
arl+0.013
ously+0.013
 overboard+0.012
ching+0.012
Neg logit contributions
tre-0.011
aus-0.011
oc-0.010
ns-0.010
ban-0.010
Token
Token ID198
Feature activation+1.95e-02

Pos logit contributions
aus+0.018
tre+0.017
ns+0.016
roc+0.016
ban+0.016
Neg logit contributions
 ado-0.014
ously-0.014
arl-0.014
 overboard-0.014
 fault-0.012
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
 ado+0.023
ously+0.023
arl+0.022
 overboard+0.022
 fault+0.020
Neg logit contributions
aus-0.026
tre-0.026
ns-0.024
ban-0.023
me-0.023
TokenThe
Token ID464
Feature activation-2.41e-02

Pos logit contributions
 ado+0.024
ously+0.023
arl+0.023
 overboard+0.023
 fault+0.021
Neg logit contributions
aus-0.027
tre-0.027
ns-0.025
ban-0.025
me-0.025
Token o
Token ID267
Feature activation-2.01e-01

Pos logit contributions
tre+0.036
aus+0.034
roc+0.033
ban+0.033
oc+0.032
Neg logit contributions
 way-0.024
 courage-0.023
arl-0.022
 living-0.022
 humor-0.022
Tokenme
Token ID1326
Feature activation-3.44e-01

Pos logit contributions
 spe+0.229
aus+0.215
tre+0.213
roc+0.213
ban+0.212
Neg logit contributions
arl-0.244
ously-0.236
mill-0.231
ching-0.226
 living-0.221
Tokenlette
Token ID21348
Feature activation-1.05e-01

Pos logit contributions
aus+0.445
tre+0.444
ns+0.406
roc+0.401
ban+0.398
Neg logit contributions
ously-0.420
 ado-0.397
arl-0.395
 overboard-0.384
osity-0.364
Token tasted
Token ID29187
Feature activation-9.37e-02

Pos logit contributions
aus+0.132
tre+0.130
ns+0.124
me+0.121
ban+0.118
Neg logit contributions
arl-0.132
 ado-0.131
ously-0.127
 overboard-0.125
 fault-0.116
Token even
Token ID772
Feature activation-1.18e-01

Pos logit contributions
aus+0.128
tre+0.127
ns+0.121
roc+0.117
ban+0.117
Neg logit contributions
ously-0.104
 ado-0.104
 overboard-0.101
arl-0.099
ching-0.092
Token better
Token ID1365
Feature activation-1.07e-03

Pos logit contributions
aus+0.170
tre+0.168
ns+0.159
ban+0.156
roc+0.155
Neg logit contributions
 ado-0.125
ously-0.122
arl-0.119
 overboard-0.118
ching-0.106
Token than
Token ID621
Feature activation+7.79e-03

Pos logit contributions
tre+0.001
aus+0.001
ns+0.001
me+0.001
ban+0.001
Neg logit contributions
arl-0.001
 ado-0.001
 overboard-0.001
ously-0.001
 punk-0.001
Token.
Token ID13
Feature activation-6.07e-04

Pos logit contributions
 ado+0.006
arl+0.006
 overboard+0.006
ously+0.006
 appearances+0.006
Neg logit contributions
tre-0.004
aus-0.004
 spe-0.003
me-0.003
ban-0.003
Token They
Token ID1119
Feature activation+2.02e-02

Pos logit contributions
aus+0.001
tre+0.001
ns+0.001
ban+0.001
roc+0.001
Neg logit contributions
 ado-0.001
ously-0.001
arl-0.001
 overboard-0.001
 fault-0.001
Token ate
Token ID15063
Feature activation-1.74e-02

Pos logit contributions
ously+0.020
 ado+0.020
 overboard+0.019
arl+0.018
 joins+0.017
Neg logit contributions
aus-0.030
tre-0.030
roc-0.028
ns-0.028
ban-0.028
Token the
Token ID262
Feature activation-4.09e-02

Pos logit contributions
aus+0.027
tre+0.026
ns+0.025
roc+0.025
library+0.025
Neg logit contributions
ously-0.017
 overboard-0.016
 ado-0.016
arl-0.015
 off-0.015
Token o
Token ID267
Feature activation-2.03e-01

Pos logit contributions
aus+0.055
tre+0.054
roc+0.052
ns+0.051
ban+0.051
Neg logit contributions
 ado-0.044
ously-0.044
 overboard-0.043
arl-0.043
 way-0.041
Tokenme
Token ID1326
Feature activation-3.44e-01

Pos logit contributions
roc+0.232
library+0.232
aus+0.224
Tre+0.221
tre+0.219
Neg logit contributions
arl-0.241
 ado-0.218
ously-0.213
mill-0.207
ching-0.207
Tokenlette
Token ID21348
Feature activation-1.04e-01

Pos logit contributions
aus+0.426
tre+0.411
library+0.405
roc+0.380
Tre+0.376
Neg logit contributions
ously-0.417
 ado-0.395
 overboard-0.392
arl-0.389
ching-0.382
Token together
Token ID1978
Feature activation+2.87e-02

Pos logit contributions
aus+0.144
tre+0.142
ns+0.134
roc+0.133
ban+0.131
Neg logit contributions
ously-0.115
 ado-0.115
 overboard-0.112
arl-0.109
 fault-0.101
Token and
Token ID290
Feature activation-2.29e-02

Pos logit contributions
 ado+0.035
ously+0.035
arl+0.034
 overboard+0.033
 fault+0.031
Neg logit contributions
aus-0.037
tre-0.036
ns-0.034
ban-0.033
roc-0.033
Token it
Token ID340
Feature activation-3.65e-02

Pos logit contributions
aus+0.033
tre+0.033
ns+0.031
roc+0.031
ban+0.030
Neg logit contributions
ously-0.024
 ado-0.023
 overboard-0.022
arl-0.022
 humor-0.020
Token was
Token ID373
Feature activation-1.47e-02

Pos logit contributions
aus+0.052
tre+0.051
roc+0.050
ban+0.048
ns+0.048
Neg logit contributions
ously-0.038
 ado-0.036
 overboard-0.036
ching-0.034
 joins-0.034
Token.
Token ID13
Feature activation-3.01e-04

Pos logit contributions
aus+0.141
tre+0.140
ns+0.132
roc+0.132
ban+0.128
Neg logit contributions
ously-0.101
 ado-0.100
arl-0.097
 overboard-0.096
 fault-0.089
Token Mom
Token ID11254
Feature activation-1.56e-02

Pos logit contributions
aus+0.001
tre+0.000
ns+0.000
ban+0.000
oc+0.000
Neg logit contributions
 ado-0.000
arl-0.000
ously-0.000
 overboard-0.000
 shattering-0.000
Token was
Token ID373
Feature activation-1.14e-02

Pos logit contributions
tre+0.024
aus+0.023
roc+0.023
ban+0.022
Tre+0.022
Neg logit contributions
ously-0.016
 ado-0.014
 overboard-0.013
 unconscious-0.013
ching-0.013
Token making
Token ID1642
Feature activation+4.48e-02

Pos logit contributions
aus+0.018
tre+0.018
oc+0.018
roc+0.017
ban+0.017
Neg logit contributions
arl-0.011
 ado-0.010
 overboard-0.010
 unconscious-0.010
 shattering-0.010
Token o
Token ID267
Feature activation-2.04e-01

Pos logit contributions
 off+0.039
 two+0.036
 that+0.035
 soap+0.035
 fault+0.035
Neg logit contributions
oc-0.078
tre-0.076
ns-0.073
Tre-0.072
aus-0.070
Tokenme
Token ID1326
Feature activation-3.43e-01

Pos logit contributions
 spe+0.244
Tre+0.235
 ranc+0.224
 ruled+0.223
library+0.221
Neg logit contributions
arl-0.245
ching-0.237
itating-0.233
 ado-0.222
pping-0.215
Tokenlets
Token ID5289
Feature activation-6.87e-02

Pos logit contributions
tre+0.518
 spe+0.509
Tre+0.498
aus+0.494
library+0.492
Neg logit contributions
ching-0.329
arl-0.327
ously-0.327
ps-0.318
 ado-0.307
Token for
Token ID329
Feature activation-1.73e-01

Pos logit contributions
aus+0.106
tre+0.103
roc+0.100
ns+0.098
oc+0.097
Neg logit contributions
ously-0.065
 ado-0.064
 overboard-0.064
arl-0.061
 off-0.058
Token the
Token ID262
Feature activation-4.14e-02

Pos logit contributions
oc+0.298
tre+0.280
aus+0.277
ns+0.270
roc+0.268
Neg logit contributions
 ado-0.134
arl-0.133
 appearances-0.128
ously-0.128
 spilling-0.125
Token family
Token ID1641
Feature activation-3.52e-02

Pos logit contributions
tre+0.061
aus+0.060
roc+0.059
oc+0.058
 Si+0.053
Neg logit contributions
 way-0.046
 living-0.041
 toy-0.040
 fault-0.039
arl-0.037
Token.
Token ID13
Feature activation+1.60e-04

Pos logit contributions
aus+0.047
tre+0.046
ns+0.044
ban+0.042
me+0.042
Neg logit contributions
 ado-0.041
ously-0.041
arl-0.041
 overboard-0.039
osity-0.036
Token It
Token ID632
Feature activation-4.84e-02

Pos logit contributions
aus+0.007
tre+0.007
ns+0.007
me+0.006
 spe+0.006
Neg logit contributions
ously-0.004
 ado-0.004
 overboard-0.004
arl-0.004
ching-0.004
Token sounded
Token ID14846
Feature activation-8.86e-03

Pos logit contributions
aus+0.061
tre+0.060
ns+0.056
roc+0.054
ban+0.054
Neg logit contributions
 ado-0.060
ously-0.060
arl-0.059
 overboard-0.058
 fault-0.053
Token like
Token ID588
Feature activation-2.09e-02

Pos logit contributions
aus+0.010
tre+0.010
ns+0.010
ban+0.009
 spe+0.009
Neg logit contributions
 ado-0.012
arl-0.012
ously-0.012
 overboard-0.011
osity-0.011
Token a
Token ID257
Feature activation-1.75e-03

Pos logit contributions
aus+0.028
tre+0.028
ns+0.026
roc+0.026
ban+0.026
Neg logit contributions
 ado-0.024
ously-0.023
arl-0.023
 overboard-0.023
 fault-0.021
Token "
Token ID366
Feature activation-3.71e-02

Pos logit contributions
aus+0.002
tre+0.002
ns+0.002
ban+0.002
roc+0.002
Neg logit contributions
 ado-0.002
ously-0.002
arl-0.002
 overboard-0.002
 fault-0.002
Tokenme
Token ID1326
Feature activation-3.43e-01

Pos logit contributions
aus+0.052
 pizz+0.049
 dom+0.047
 reception+0.047
roc+0.047
Neg logit contributions
ching-0.039
 overboard-0.037
arl-0.037
 ado-0.036
ously-0.035
Tokenow
Token ID322
Feature activation-1.24e-01

Pos logit contributions
aus+0.379
tre+0.371
ns+0.345
roc+0.331
ban+0.331
Neg logit contributions
 ado-0.479
ously-0.477
arl-0.474
 overboard-0.464
 fault-0.430
Token".
Token ID1911
Feature activation+2.80e-02

Pos logit contributions
aus+0.195
tre+0.194
ban+0.180
roc+0.180
Tre+0.180
Neg logit contributions
arl-0.114
 ado-0.111
ously-0.110
 overboard-0.107
ching-0.105
Token The
Token ID383
Feature activation-8.53e-02

Pos logit contributions
ously+0.027
 ado+0.027
 overboard+0.026
arl+0.025
 unconscious+0.023
Neg logit contributions
aus-0.043
tre-0.041
ns-0.040
 spe-0.039
roc-0.039
Token boy
Token ID2933
Feature activation-6.39e-02

Pos logit contributions
aus+0.111
tre+0.109
ns+0.105
roc+0.101
ban+0.101
Neg logit contributions
 ado-0.102
ously-0.102
arl-0.099
 overboard-0.098
 fault-0.090
Token stopped
Token ID5025
Feature activation+7.34e-02

Pos logit contributions
aus+0.092
tre+0.090
ns+0.083
roc+0.083
ban+0.082
Neg logit contributions
ously-0.070
 ado-0.068
 overboard-0.065
arl-0.062
ching-0.059
Token "
Token ID366
Feature activation-3.01e-02

Pos logit contributions
 ado+0.013
ously+0.013
arl+0.013
 overboard+0.013
 fault+0.012
Neg logit contributions
aus-0.012
tre-0.011
ns-0.011
ban-0.010
me-0.010
TokenI
Token ID40
Feature activation+4.09e-02

Pos logit contributions
aus+0.047
tre+0.045
ns+0.044
roc+0.043
ban+0.042
Neg logit contributions
ously-0.028
 ado-0.028
arl-0.028
 overboard-0.027
ching-0.024
Token will
Token ID481
Feature activation-8.15e-03

Pos logit contributions
 ado+0.037
ously+0.036
arl+0.036
 overboard+0.035
ching+0.034
Neg logit contributions
aus-0.064
tre-0.062
ns-0.060
roc-0.059
ban-0.059
Token buy
Token ID2822
Feature activation-7.92e-02

Pos logit contributions
tre+0.008
ban+0.008
me+0.008
aus+0.008
 spe+0.008
Neg logit contributions
ously-0.012
 ado-0.012
 overboard-0.011
 unconscious-0.011
arl-0.011
Token more
Token ID517
Feature activation-5.27e-03

Pos logit contributions
aus+0.118
tre+0.116
ns+0.113
oc+0.110
roc+0.110
Neg logit contributions
 ado-0.075
 overboard-0.072
ously-0.069
arl-0.069
 humor-0.068
Token av
Token ID1196
Feature activation-3.43e-01

Pos logit contributions
aus+0.007
tre+0.007
ns+0.006
roc+0.006
ban+0.006
Neg logit contributions
 ado-0.006
ously-0.006
arl-0.006
 overboard-0.006
 fault-0.006
Tokenoc
Token ID420
Feature activation-3.31e-01

Pos logit contributions
 reception+0.266
 ranc+0.224
Tre+0.214
 symp+0.211
aus+0.208
Neg logit contributions
 ado-0.580
ching-0.575
 overboard-0.563
ously-0.557
arl-0.548
Tokenados
Token ID22484
Feature activation-1.04e-01

Pos logit contributions
tre+0.526
Tre+0.496
aus+0.493
 spe+0.491
library+0.489
Neg logit contributions
ously-0.306
 ado-0.286
arl-0.282
 humor-0.280
 overboard-0.274
Token tomorrow
Token ID9439
Feature activation-1.07e-01

Pos logit contributions
aus+0.140
tre+0.138
ns+0.131
roc+0.130
ban+0.126
Neg logit contributions
 ado-0.118
ously-0.118
arl-0.116
 overboard-0.115
 fault-0.106
Token.
Token ID13
Feature activation-2.87e-04

Pos logit contributions
aus+0.168
tre+0.166
roc+0.159
ns+0.158
ban+0.153
Neg logit contributions
 ado-0.097
arl-0.094
ously-0.094
 overboard-0.090
 fault-0.086
Token You
Token ID921
Feature activation-4.83e-02

Pos logit contributions
aus+0.000
tre+0.000
ns+0.000
roc+0.000
me+0.000
Neg logit contributions
arl-0.000
 ado-0.000
ously-0.000
 overboard-0.000
 humor-0.000
Token so
Token ID523
Feature activation-4.70e-03

Pos logit contributions
aus+0.015
tre+0.015
ns+0.014
roc+0.013
ban+0.013
Neg logit contributions
 ado-0.012
ously-0.012
 overboard-0.012
arl-0.011
 fault-0.011
Token happy
Token ID3772
Feature activation-6.19e-02

Pos logit contributions
aus+0.006
tre+0.005
ns+0.005
roc+0.005
ban+0.005
Neg logit contributions
 ado-0.006
ously-0.006
arl-0.006
 overboard-0.006
 fault-0.006
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
tre+0.080
aus+0.078
ns+0.073
ban+0.072
me+0.071
Neg logit contributions
arl-0.076
 ado-0.074
 overboard-0.071
ously-0.071
 fault-0.065
Token brought
Token ID3181
Feature activation-6.64e-03

Pos logit contributions
aus+0.021
tre+0.020
ns+0.019
roc+0.019
ban+0.019
Neg logit contributions
 ado-0.016
ously-0.016
arl-0.016
 overboard-0.015
 fault-0.014
Token the
Token ID262
Feature activation-3.88e-02

Pos logit contributions
aus+0.010
tre+0.010
ban+0.010
oc+0.010
me+0.010
Neg logit contributions
 ado-0.006
ously-0.006
 overboard-0.006
 off-0.006
arl-0.005
Token av
Token ID1196
Feature activation-3.42e-01

Pos logit contributions
aus+0.042
tre+0.040
ban+0.040
me+0.039
 lin+0.039
Neg logit contributions
 ado-0.054
 way-0.052
arl-0.051
ously-0.051
 overboard-0.051
Tokenoc
Token ID420
Feature activation-3.35e-01

Pos logit contributions
aus+0.178
 reception+0.176
tre+0.161
ban+0.144
 ruled+0.143
Neg logit contributions
 ado-0.635
ously-0.633
 overboard-0.631
ching-0.623
arl-0.596
Tokenados
Token ID22484
Feature activation-1.05e-01

Pos logit contributions
tre+0.531
aus+0.520
ns+0.498
 lin+0.490
me+0.490
Neg logit contributions
ously-0.306
arl-0.290
 ado-0.289
 overboard-0.284
 fault-0.254
Token home
Token ID1363
Feature activation-2.66e-02

Pos logit contributions
aus+0.156
tre+0.153
ns+0.146
roc+0.143
ban+0.142
Neg logit contributions
ously-0.105
 ado-0.105
 overboard-0.102
arl-0.102
 fault-0.091
Token to
Token ID284
Feature activation-6.89e-03

Pos logit contributions
aus+0.047
roc+0.045
tre+0.043
ns+0.042
oc+0.042
Neg logit contributions
ously-0.022
 overboard-0.020
 ado-0.019
 off-0.018
 that-0.017
Token share
Token ID2648
Feature activation-2.48e-02

Pos logit contributions
tre+0.010
aus+0.010
ns+0.010
ban+0.010
roc+0.010
Neg logit contributions
 ado-0.006
arl-0.006
ously-0.006
 overboard-0.006
 humor-0.006
Token always
Token ID1464
Feature activation+9.79e-03

Pos logit contributions
aus+0.066
tre+0.065
ns+0.061
ban+0.059
roc+0.059
Neg logit contributions
 ado-0.056
ously-0.056
arl-0.055
 overboard-0.054
 fault-0.049
Token survive
Token ID7866
Feature activation-1.24e-01

Pos logit contributions
arl+0.014
ously+0.014
 ado+0.014
 overboard+0.013
 unconscious+0.012
Neg logit contributions
tre-0.010
aus-0.010
ns-0.010
 spe-0.009
roc-0.009
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
aus+0.162
tre+0.161
ns+0.153
me+0.146
roc+0.146
Neg logit contributions
 ado-0.146
ously-0.145
arl-0.143
 overboard-0.140
 fault-0.128
Token<|endoftext|>
Token ID50256
Feature activation-4.68e-02

Pos logit contributions
 ado+0.014
ously+0.013
arl+0.013
 overboard+0.013
 fault+0.012
Neg logit contributions
aus-0.018
tre-0.017
ns-0.017
roc-0.016
ban-0.016
TokenEs
Token ID23041
Feature activation-1.77e-01

Pos logit contributions
aus+0.067
tre+0.065
roc+0.063
ns+0.063
ban+0.061
Neg logit contributions
 ado-0.049
ously-0.049
arl-0.048
 overboard-0.047
ching-0.044
Tokenme
Token ID1326
Feature activation-3.42e-01

Pos logit contributions
 spe+0.243
aus+0.217
 av+0.217
ban+0.215
ns+0.215
Neg logit contributions
 ado-0.205
ously-0.197
arl-0.186
ching-0.179
 overboard-0.174
Token was
Token ID373
Feature activation-3.93e-03

Pos logit contributions
aus+0.495
tre+0.493
ns+0.467
ban+0.453
 spe+0.451
Neg logit contributions
ously-0.358
 ado-0.350
arl-0.342
 overboard-0.337
 fault-0.305
Token hungry
Token ID14720
Feature activation-1.15e-01

Pos logit contributions
aus+0.005
tre+0.005
ns+0.005
me+0.005
library+0.004
Neg logit contributions
ously-0.005
 ado-0.005
arl-0.005
 overboard-0.004
 appearances-0.004
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
aus+0.160
tre+0.159
ns+0.151
roc+0.148
ban+0.145
Neg logit contributions
ously-0.125
 ado-0.125
arl-0.122
 overboard-0.121
 fault-0.109
Token She
Token ID1375
Feature activation+5.88e-03

Pos logit contributions
 ado+0.011
arl+0.010
ously+0.010
 overboard+0.009
 fault+0.009
Neg logit contributions
ns-0.020
aus-0.019
oc-0.019
tre-0.019
roc-0.019
Token wanted
Token ID2227
Feature activation+5.13e-02

Pos logit contributions
ously+0.004
 kicks+0.004
 ado+0.003
 kicked+0.003
 way+0.003
Neg logit contributions
aus-0.010
roc-0.010
tre-0.010
ban-0.010
Tre-0.009
Token the
Token ID262
Feature activation-2.49e-02

Pos logit contributions
aus+0.026
tre+0.025
ns+0.024
roc+0.024
me+0.024
Neg logit contributions
 ado-0.017
ously-0.017
arl-0.016
 overboard-0.016
 way-0.016
Token hot
Token ID3024
Feature activation-4.65e-02

Pos logit contributions
tre+0.034
aus+0.034
roc+0.031
ban+0.031
ns+0.031
Neg logit contributions
arl-0.027
 ado-0.027
 overboard-0.027
ously-0.026
 way-0.025
Token oven
Token ID14361
Feature activation-1.07e-01

Pos logit contributions
aus+0.059
tre+0.056
ns+0.054
oc+0.053
me+0.052
Neg logit contributions
ously-0.060
 ado-0.059
arl-0.057
 overboard-0.057
osity-0.054
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
aus+0.165
tre+0.163
roc+0.157
ns+0.152
ban+0.150
Neg logit contributions
ously-0.101
 ado-0.101
arl-0.097
 overboard-0.097
 fault-0.089
Token Es
Token ID8678
Feature activation-2.29e-01

Pos logit contributions
 ado+0.012
ously+0.012
arl+0.012
 overboard+0.011
 shattering+0.010
Neg logit contributions
ns-0.019
aus-0.019
tre-0.018
oc-0.018
roc-0.018
Tokenme
Token ID1326
Feature activation-3.41e-01

Pos logit contributions
 spe+0.183
roc+0.161
 reception+0.153
ban+0.152
 ranc+0.148
Neg logit contributions
ously-0.366
ching-0.365
 ado-0.358
mill-0.353
 blows-0.344
Token was
Token ID373
Feature activation-2.84e-03

Pos logit contributions
aus+0.490
tre+0.490
ns+0.461
ban+0.449
roc+0.445
Neg logit contributions
ously-0.364
 ado-0.354
arl-0.347
 overboard-0.342
ching-0.310
Token so
Token ID523
Feature activation+4.90e-03

Pos logit contributions
aus+0.004
tre+0.004
oc+0.004
roc+0.004
ns+0.004
Neg logit contributions
 ado-0.003
ously-0.003
 overboard-0.003
arl-0.003
 unconscious-0.003
Token disappointed
Token ID11679
Feature activation-5.39e-03

Pos logit contributions
arl+0.004
 ado+0.004
 overboard+0.004
ously+0.004
 kicks+0.004
Neg logit contributions
tre-0.008
aus-0.008
oc-0.008
ban-0.007
ns-0.007
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
tre+0.005
aus+0.005
me+0.004
 reception+0.004
ban+0.004
Neg logit contributions
 overboard-0.008
arl-0.008
 ado-0.008
 punk-0.008
 humor-0.008
Token She
Token ID1375
Feature activation+7.10e-03

Pos logit contributions
 ado+0.013
ously+0.012
arl+0.012
 overboard+0.012
 fault+0.011
Neg logit contributions
aus-0.019
ns-0.018
tre-0.018
oc-0.017
roc-0.017
Token oven
Token ID14361
Feature activation-1.10e-01

Pos logit contributions
aus+0.054
tre+0.053
ns+0.051
roc+0.049
ban+0.049
Neg logit contributions
 ado-0.042
ously-0.042
arl-0.041
 overboard-0.040
 fault-0.037
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
aus+0.185
tre+0.181
roc+0.170
ns+0.169
Tre+0.165
Neg logit contributions
ously-0.099
 ado-0.093
arl-0.090
 overboard-0.087
 way-0.085
Token He
Token ID679
Feature activation-1.75e-02

Pos logit contributions
aus+0.016
tre+0.015
ns+0.015
me+0.015
ban+0.015
Neg logit contributions
arl-0.010
 ado-0.009
ously-0.009
 overboard-0.009
osity-0.008
Token saw
Token ID2497
Feature activation-2.85e-02

Pos logit contributions
aus+0.032
roc+0.031
tre+0.031
ban+0.030
ocol+0.030
Neg logit contributions
ously-0.012
 sprayed-0.012
 joins-0.011
 kicked-0.011
 kicks-0.011
Token the
Token ID262
Feature activation-3.70e-02

Pos logit contributions
aus+0.028
ns+0.027
tre+0.026
 dom+0.025
 av+0.025
Neg logit contributions
 ado-0.043
ously-0.043
arl-0.043
 overboard-0.041
osity-0.041
Token towel
Token ID24808
Feature activation+1.72e-01

Pos logit contributions
aus+0.053
tre+0.053
ns+0.049
roc+0.049
ban+0.048
Neg logit contributions
arl-0.039
 ado-0.039
 overboard-0.038
ously-0.038
 fault-0.036
Token inside
Token ID2641
Feature activation+8.96e-02

Pos logit contributions
 ado+0.195
ously+0.194
arl+0.190
 overboard+0.188
 fault+0.170
Neg logit contributions
aus-0.233
tre-0.230
ns-0.218
roc-0.211
ban-0.210
Token the
Token ID262
Feature activation-3.71e-02

Pos logit contributions
ously+0.092
 ado+0.089
arl+0.083
 overboard+0.083
 unconscious+0.077
Neg logit contributions
aus-0.134
tre-0.131
ns-0.128
roc-0.126
ban-0.124
Token oven
Token ID14361
Feature activation-1.10e-01

Pos logit contributions
aus+0.054
tre+0.052
roc+0.049
ns+0.048
ban+0.047
Neg logit contributions
arl-0.040
 ado-0.039
 overboard-0.038
ously-0.037
 shattering-0.036
Token.
Token ID13
Feature activation-9.93e-03

Pos logit contributions
aus+0.176
tre+0.172
roc+0.164
ns+0.162
Tre+0.156
Neg logit contributions
ously-0.102
 ado-0.101
arl-0.098
 overboard-0.095
 shattering-0.087
Token He
Token ID679
Feature activation-1.71e-02

Pos logit contributions
aus+0.016
tre+0.016
ns+0.016
ban+0.016
roc+0.016
Neg logit contributions
arl-0.009
 ado-0.008
ously-0.007
 overboard-0.007
 wipes-0.007
Token find
Token ID1064
Feature activation-4.61e-02

Pos logit contributions
 ado+0.003
ously+0.003
arl+0.003
 overboard+0.002
 fault+0.002
Neg logit contributions
aus-0.003
tre-0.003
ns-0.003
ban-0.003
me-0.003
Token all
Token ID477
Feature activation+2.14e-02

Pos logit contributions
aus+0.061
tre+0.061
ns+0.058
ban+0.055
roc+0.055
Neg logit contributions
 ado-0.053
ously-0.053
arl-0.053
 overboard-0.051
osity-0.047
Token sorts
Token ID10524
Feature activation-3.26e-02

Pos logit contributions
arl+0.028
ously+0.028
 ado+0.028
 overboard+0.027
 punk+0.025
Neg logit contributions
tre-0.026
aus-0.026
ns-0.023
me-0.023
roc-0.023
Token of
Token ID286
Feature activation+2.87e-02

Pos logit contributions
tre+0.036
aus+0.035
ban+0.032
ns+0.032
me+0.031
Neg logit contributions
 ado-0.045
 overboard-0.044
arl-0.044
ously-0.043
 shattering-0.042
Token different
Token ID1180
Feature activation+4.53e-02

Pos logit contributions
 ado+0.032
ously+0.031
arl+0.030
 overboard+0.029
 humor+0.028
Neg logit contributions
aus-0.040
tre-0.039
ns-0.037
me-0.036
roc-0.036
Token ways
Token ID2842
Feature activation+1.58e-01

Pos logit contributions
 ado+0.055
ously+0.055
arl+0.054
 overboard+0.053
 fault+0.049
Neg logit contributions
aus-0.058
tre-0.056
ns-0.053
ban-0.051
roc-0.051
Token to
Token ID284
Feature activation-2.58e-03

Pos logit contributions
arl+0.224
 ado+0.222
ously+0.214
 overboard+0.210
 shattering+0.207
Neg logit contributions
tre-0.171
aus-0.154
me-0.147
ban-0.147
ns-0.147
Token make
Token ID787
Feature activation-1.22e-03

Pos logit contributions
aus+0.004
tre+0.003
ns+0.003
roc+0.003
me+0.003
Neg logit contributions
 ado-0.003
ously-0.003
arl-0.003
 overboard-0.003
 humor-0.003
Token your
Token ID534
Feature activation+9.62e-02

Pos logit contributions
aus+0.002
tre+0.002
ns+0.002
 spe+0.002
roc+0.002
Neg logit contributions
 ado-0.001
 overboard-0.001
 off-0.001
 way-0.001
 fault-0.001
Token dreams
Token ID10625
Feature activation-8.08e-02

Pos logit contributions
 ado+0.093
 way+0.092
 fault+0.090
 overboard+0.088
arl+0.087
Neg logit contributions
aus-0.151
tre-0.142
ban-0.137
me-0.133
ns-0.133
Token come
Token ID1282
Feature activation-8.43e-02

Pos logit contributions
roc+0.125
aus+0.125
tre+0.121
oc+0.119
ns+0.117
Neg logit contributions
 ado-0.074
arl-0.073
 overboard-0.070
 off-0.069
ously-0.068
Token time
Token ID640
Feature activation+3.36e-02

Pos logit contributions
 ado+0.010
ously+0.009
 overboard+0.009
arl+0.009
osity+0.009
Neg logit contributions
aus-0.009
tre-0.008
ns-0.008
me-0.008
oc-0.008
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
 ado+0.042
ously+0.042
arl+0.041
 overboard+0.040
 humor+0.038
Neg logit contributions
tre-0.042
aus-0.041
ns-0.037
me-0.037
roc-0.037
Token there
Token ID612
Feature activation+4.71e-02

Pos logit contributions
ously+0.022
 ado+0.021
 humor+0.021
arl+0.020
 fault+0.020
Neg logit contributions
tre-0.017
aus-0.015
me-0.015
 spe-0.015
 ruled-0.014
Token was
Token ID373
Feature activation-6.20e-03

Pos logit contributions
arl+0.066
 ado+0.066
ously+0.064
 overboard+0.063
osity+0.059
Neg logit contributions
tre-0.053
aus-0.052
ns-0.048
me-0.048
oc-0.047
Token a
Token ID257
Feature activation+7.56e-03

Pos logit contributions
aus+0.010
tre+0.010
roc+0.009
ns+0.009
ban+0.009
Neg logit contributions
 overboard-0.005
 ado-0.005
ously-0.005
arl-0.005
 punk-0.005
Token little
Token ID1310
Feature activation+1.29e-01

Pos logit contributions
 punk+0.003
arl+0.003
 toy+0.002
 living+0.002
 way+0.002
Neg logit contributions
tre-0.016
aus-0.016
ban-0.015
ns-0.015
roc-0.015
Token boy
Token ID2933
Feature activation-5.31e-02

Pos logit contributions
arl+0.107
 ado+0.100
 overboard+0.097
 punk+0.096
ously+0.093
Neg logit contributions
tre-0.215
aus-0.211
ns-0.205
ban-0.201
roc-0.197
Token named
Token ID3706
Feature activation-5.42e-02

Pos logit contributions
tre+0.054
aus+0.053
ns+0.050
oc+0.049
me+0.048
Neg logit contributions
 ado-0.078
ously-0.078
arl-0.078
 overboard-0.076
 fault-0.072
Token Tim
Token ID5045
Feature activation-4.59e-02

Pos logit contributions
aus+0.055
tre+0.052
ns+0.047
me+0.045
roc+0.044
Neg logit contributions
ously-0.088
 ado-0.086
arl-0.081
 overboard-0.080
ching-0.078
Tokenmy
Token ID1820
Feature activation+5.26e-02

Pos logit contributions
tre+0.042
aus+0.041
oc+0.040
 ruled+0.038
ns+0.036
Neg logit contributions
arl-0.073
osity-0.072
 ado-0.071
 overboard-0.070
 humor-0.069
Token.
Token ID13
Feature activation+6.98e-03

Pos logit contributions
 ado+0.089
arl+0.089
ously+0.083
 overboard+0.082
osity+0.082
Neg logit contributions
oc-0.052
tre-0.044
 ruled-0.043
ns-0.040
me-0.039
Token his
Token ID465
Feature activation+5.46e-02

Pos logit contributions
aus+0.007
tre+0.007
ns+0.007
me+0.006
ban+0.006
Neg logit contributions
 ado-0.006
ously-0.006
arl-0.005
 overboard-0.005
 fault-0.005
Token things
Token ID1243
Feature activation-6.33e-02

Pos logit contributions
 ado+0.056
ously+0.055
arl+0.055
 overboard+0.054
 fault+0.050
Neg logit contributions
aus-0.080
tre-0.079
ns-0.075
ban-0.073
me-0.073
Token.
Token ID13
Feature activation-3.97e-03

Pos logit contributions
tre+0.072
aus+0.072
ns+0.067
ban+0.064
me+0.064
Neg logit contributions
 ado-0.084
arl-0.083
ously-0.083
 overboard-0.077
 humor-0.076
Token He
Token ID679
Feature activation-9.30e-03

Pos logit contributions
aus+0.005
tre+0.005
ns+0.005
roc+0.005
ban+0.005
Neg logit contributions
 ado-0.005
ously-0.005
arl-0.005
 overboard-0.005
ching-0.004
Token learned
Token ID4499
Feature activation-3.78e-02

Pos logit contributions
tre+0.014
aus+0.014
ban+0.013
ns+0.013
roc+0.013
Neg logit contributions
ously-0.009
 ado-0.009
 overboard-0.008
 kicks-0.008
 joins-0.008
Token that
Token ID326
Feature activation+1.86e-01

Pos logit contributions
aus+0.053
tre+0.050
roc+0.050
 av+0.049
 dom+0.048
Neg logit contributions
 ado-0.040
ously-0.038
arl-0.038
 overboard-0.036
osity-0.036
Token some
Token ID617
Feature activation-4.57e-02

Pos logit contributions
ously+0.183
 ado+0.183
 overboard+0.180
arl+0.171
 fault+0.166
Neg logit contributions
aus-0.279
tre-0.275
ns-0.267
me-0.256
roc-0.256
Token things
Token ID1243
Feature activation-6.49e-02

Pos logit contributions
aus+0.048
tre+0.047
ns+0.044
roc+0.042
me+0.041
Neg logit contributions
 ado-0.066
arl-0.066
ously-0.065
 overboard-0.064
inated-0.059
Token belonged
Token ID19611
Feature activation-1.51e-01

Pos logit contributions
aus+0.070
tre+0.070
ns+0.064
 ruled+0.061
me+0.061
Neg logit contributions
 ado-0.090
arl-0.090
ously-0.089
 overboard-0.083
 punk-0.080
Token to
Token ID284
Feature activation-7.27e-03

Pos logit contributions
aus+0.202
tre+0.201
ns+0.189
roc+0.183
ban+0.182
Neg logit contributions
 ado-0.174
ously-0.172
arl-0.170
 overboard-0.166
 shattering-0.150
Token others
Token ID1854
Feature activation+4.55e-02

Pos logit contributions
aus+0.008
tre+0.008
ns+0.008
roc+0.008
ban+0.007
Neg logit contributions
ously-0.010
 ado-0.010
arl-0.010
 overboard-0.010
ching-0.009
Token dog
Token ID3290
Feature activation+1.02e-02

Pos logit contributions
tre+0.110
aus+0.110
ns+0.102
me+0.100
ban+0.100
Neg logit contributions
 ado-0.073
arl-0.073
ously-0.072
 overboard-0.070
 fault-0.064
Token ch
Token ID442
Feature activation-9.78e-03

Pos logit contributions
 ado+0.010
ously+0.010
arl+0.010
 overboard+0.010
 fault+0.009
Neg logit contributions
aus-0.015
tre-0.015
ns-0.014
ban-0.014
roc-0.014
Tokenases
Token ID1386
Feature activation+9.63e-02

Pos logit contributions
aus+0.015
roc+0.015
tre+0.015
ns+0.015
oc+0.014
Neg logit contributions
 ado-0.009
ously-0.009
 overboard-0.008
 shattering-0.008
arl-0.008
Token Tom
Token ID4186
Feature activation-2.63e-02

Pos logit contributions
ously+0.097
 ado+0.094
arl+0.092
 overboard+0.091
 off+0.090
Neg logit contributions
aus-0.143
tre-0.141
ban-0.135
 spe-0.134
me-0.133
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
tre+0.024
ns+0.024
aus+0.023
oc+0.023
me+0.021
Neg logit contributions
arl-0.040
 ado-0.039
osity-0.038
ously-0.038
 humor-0.038
Token bites
Token ID26081
Feature activation+1.28e-01

Pos logit contributions
ns+0.035
aus+0.034
tre+0.034
 spe+0.033
oc+0.032
Neg logit contributions
ously-0.028
 ado-0.027
arl-0.027
 overboard-0.026
osity-0.025
Token his
Token ID465
Feature activation+5.51e-02

Pos logit contributions
 ado+0.159
ously+0.159
arl+0.156
 overboard+0.154
 fault+0.141
Neg logit contributions
aus-0.161
tre-0.159
ns-0.150
roc-0.144
ban-0.144
Token leg
Token ID1232
Feature activation+5.39e-02

Pos logit contributions
 ado+0.056
ously+0.056
arl+0.054
 overboard+0.052
osity+0.049
Neg logit contributions
aus-0.081
tre-0.080
ns-0.077
roc-0.074
ban-0.074
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 ado+0.077
arl+0.075
osity+0.074
ously+0.074
 humor+0.071
Neg logit contributions
tre-0.054
ns-0.054
aus-0.053
ban-0.051
oc-0.051
Token Tom
Token ID4186
Feature activation-3.01e-02

Pos logit contributions
aus+0.016
tre+0.016
ns+0.015
me+0.015
ban+0.015
Neg logit contributions
arl-0.012
 ado-0.012
 overboard-0.011
ously-0.011
 kicks-0.010
Token screams
Token ID26557
Feature activation+1.44e-01

Pos logit contributions
ns+0.042
aus+0.040
oc+0.040
tre+0.039
me+0.038
Neg logit contributions
 ado-0.034
arl-0.034
ously-0.034
osity-0.032
 overboard-0.031
Token's
Token ID338
Feature activation-4.83e-02

Pos logit contributions
aus+0.028
me+0.027
tre+0.026
ns+0.026
oc+0.023
Neg logit contributions
arl-0.038
 ado-0.038
ously-0.037
 overboard-0.035
 humor-0.033
Token really
Token ID1107
Feature activation-5.34e-02

Pos logit contributions
aus+0.057
tre+0.054
ns+0.052
me+0.051
 spe+0.050
Neg logit contributions
ously-0.069
 ado-0.067
arl-0.065
 overboard-0.061
osity-0.059
Token good
Token ID922
Feature activation+5.14e-02

Pos logit contributions
aus+0.064
tre+0.061
ns+0.059
me+0.056
roc+0.056
Neg logit contributions
ously-0.075
 ado-0.073
arl-0.070
 overboard-0.069
 unconscious-0.065
Token!"
Token ID2474
Feature activation+3.60e-02

Pos logit contributions
 ado+0.078
arl+0.073
ously+0.072
 appearances+0.072
 overboard+0.072
Neg logit contributions
aus-0.050
tre-0.046
me-0.046
ban-0.044
 mac-0.043
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 ado+0.053
 overboard+0.051
arl+0.051
 appearances+0.051
ously+0.049
Neg logit contributions
aus-0.038
tre-0.037
me-0.033
oc-0.033
ns-0.031
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 ado+0.024
ously+0.024
arl+0.023
 overboard+0.023
 fault+0.021
Neg logit contributions
aus-0.028
tre-0.028
ns-0.026
roc-0.025
ban-0.025
TokenM
Token ID44
Feature activation-6.73e-02

Pos logit contributions
 ado+0.026
ously+0.026
arl+0.025
 overboard+0.025
 fault+0.023
Neg logit contributions
aus-0.026
tre-0.026
ns-0.024
me-0.024
ban-0.023
Tokenolly
Token ID5098
Feature activation+1.67e-02

Pos logit contributions
tre+0.117
ns+0.113
aus+0.110
 spe+0.109
oc+0.108
Neg logit contributions
arl-0.056
ously-0.055
 ado-0.051
 overboard-0.047
 fault-0.045
Token and
Token ID290
Feature activation-8.94e-03

Pos logit contributions
ously+0.017
 ado+0.016
 overboard+0.016
arl+0.015
 joins+0.015
Neg logit contributions
aus-0.024
tre-0.024
ns-0.023
roc-0.023
ban-0.022
Token Jack
Token ID3619
Feature activation+8.04e-02

Pos logit contributions
aus+0.013
tre+0.013
ns+0.012
ban+0.012
roc+0.012
Neg logit contributions
ously-0.009
 ado-0.009
arl-0.009
 overboard-0.009
 humor-0.008
Token kept
Token ID4030
Feature activation+7.53e-02

Pos logit contributions
ously+0.083
 ado+0.078
 overboard+0.077
arl+0.071
 joins+0.070
Neg logit contributions
tre-0.118
aus-0.116
roc-0.109
ns-0.108
ban-0.107
Token.
Token ID13
Feature activation+1.51e-03

Pos logit contributions
tre+0.021
aus+0.020
ban+0.020
me+0.019
ns+0.019
Neg logit contributions
 ado-0.024
arl-0.023
 overboard-0.022
ously-0.022
 fault-0.021
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
 ado+0.002
 overboard+0.002
ously+0.002
 appearances+0.002
arl+0.002
Neg logit contributions
tre-0.002
aus-0.001
me-0.001
ban-0.001
library-0.001
Token
Token ID198
Feature activation+2.18e-02

Pos logit contributions
 ado+0.026
ously+0.026
 overboard+0.025
arl+0.025
 fault+0.023
Neg logit contributions
aus-0.028
tre-0.028
ns-0.026
me-0.025
ban-0.025
TokenThey
Token ID2990
Feature activation+7.62e-02

Pos logit contributions
 ado+0.025
ously+0.025
arl+0.025
 overboard+0.024
ching+0.022
Neg logit contributions
aus-0.029
tre-0.029
ns-0.027
roc-0.026
ban-0.026
Token put
Token ID1234
Feature activation-1.28e-02

Pos logit contributions
ously+0.076
 ado+0.072
 overboard+0.070
arl+0.068
 joins+0.064
Neg logit contributions
aus-0.112
tre-0.112
roc-0.111
ns-0.104
ban-0.104
Token their
Token ID511
Feature activation+5.61e-02

Pos logit contributions
aus+0.022
ban+0.021
roc+0.021
tre+0.021
ns+0.021
Neg logit contributions
 ado-0.010
arl-0.010
 overboard-0.010
ously-0.009
 off-0.009
Token bags
Token ID11668
Feature activation-1.35e-01

Pos logit contributions
 ado+0.058
arl+0.057
ously+0.056
 overboard+0.056
 fault+0.050
Neg logit contributions
aus-0.083
tre-0.080
ns-0.077
ban-0.076
roc-0.075
Token in
Token ID287
Feature activation-2.52e-02

Pos logit contributions
aus+0.218
roc+0.209
tre+0.208
ns+0.206
ban+0.200
Neg logit contributions
 overboard-0.122
ously-0.122
 ado-0.122
arl-0.121
 off-0.105
Token the
Token ID262
Feature activation-3.92e-02

Pos logit contributions
aus+0.041
ns+0.039
roc+0.038
tre+0.038
ban+0.038
Neg logit contributions
 ado-0.023
ously-0.021
 overboard-0.021
arl-0.021
 way-0.020
Token car
Token ID1097
Feature activation-1.71e-02

Pos logit contributions
tre+0.050
aus+0.050
ns+0.047
 reception+0.045
me+0.045
Neg logit contributions
ously-0.049
 ado-0.048
 overboard-0.046
arl-0.045
 fault-0.043
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
aus+0.024
tre+0.024
roc+0.023
ns+0.022
ban+0.022
Neg logit contributions
ously-0.019
 ado-0.019
 overboard-0.018
arl-0.018
 fault-0.016
Token will
Token ID481
Feature activation-2.52e-03

Pos logit contributions
aus+0.053
tre+0.052
ns+0.049
me+0.048
ban+0.048
Neg logit contributions
 ado-0.048
ously-0.047
arl-0.047
 overboard-0.046
osity-0.042
Token share
Token ID2648
Feature activation-1.57e-02

Pos logit contributions
me+0.003
tre+0.003
aus+0.002
ban+0.002
roc+0.002
Neg logit contributions
ously-0.004
 ado-0.004
 overboard-0.004
 unconscious-0.003
arl-0.003
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
aus+0.026
tre+0.025
library+0.024
ns+0.024
 spe+0.024
Neg logit contributions
 ado-0.012
 overboard-0.012
arl-0.012
ously-0.012
 joins-0.012
Token be
Token ID307
Feature activation-1.02e-02

Pos logit contributions
aus+0.026
tre+0.026
ns+0.024
ban+0.024
me+0.024
Neg logit contributions
 ado-0.026
ously-0.026
arl-0.025
 overboard-0.025
ching-0.023
Token nice
Token ID3621
Feature activation+5.61e-02

Pos logit contributions
aus+0.015
tre+0.015
ns+0.014
roc+0.013
me+0.013
Neg logit contributions
ously-0.011
 ado-0.011
arl-0.010
 overboard-0.010
 fault-0.009
Token."
Token ID526
Feature activation+3.16e-02

Pos logit contributions
 ado+0.071
ously+0.070
arl+0.068
 overboard+0.067
 shattering+0.063
Neg logit contributions
tre-0.069
aus-0.067
ns-0.064
ban-0.064
me-0.062
Token Mom
Token ID11254
Feature activation-1.79e-02

Pos logit contributions
 ado+0.037
ously+0.036
arl+0.036
 overboard+0.036
ching+0.033
Neg logit contributions
tre-0.042
aus-0.041
ns-0.039
ban-0.038
roc-0.038
Token smiled
Token ID13541
Feature activation-1.21e-02

Pos logit contributions
tre+0.027
aus+0.027
ns+0.025
roc+0.025
ban+0.024
Neg logit contributions
ously-0.018
 ado-0.017
arl-0.017
 overboard-0.016
 joins-0.015
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
aus+0.012
tre+0.011
ns+0.010
me+0.010
oc+0.010
Neg logit contributions
 ado-0.018
 overboard-0.018
arl-0.017
ously-0.017
 fault-0.017
Token gave
Token ID2921
Feature activation-2.44e-02

Pos logit contributions
aus+0.025
tre+0.023
ns+0.022
me+0.021
 lin+0.021
Neg logit contributions
 ado-0.029
 overboard-0.028
arl-0.028
ously-0.028
 fault-0.027
Token them
Token ID606
Feature activation-3.92e-02

Pos logit contributions
aus+0.036
oc+0.035
tre+0.035
ns+0.034
 spe+0.033
Neg logit contributions
 ado-0.024
ously-0.024
arl-0.024
 overboard-0.022
 off-0.021
Token had
Token ID550
Feature activation-7.54e-03

Pos logit contributions
arl+0.115
 ado+0.112
ously+0.109
 overboard+0.106
 humor+0.102
Neg logit contributions
tre-0.109
aus-0.106
ns-0.104
me-0.101
 spe-0.099
Token gone
Token ID3750
Feature activation+1.40e-01

Pos logit contributions
tre+0.011
aus+0.011
ns+0.010
 spe+0.010
ban+0.010
Neg logit contributions
 ado-0.008
arl-0.008
ously-0.008
 overboard-0.008
 humor-0.007
Token out
Token ID503
Feature activation+7.31e-02

Pos logit contributions
 ado+0.167
arl+0.167
ously+0.165
 overboard+0.157
 humor+0.154
Neg logit contributions
tre-0.176
aus-0.171
ns-0.162
 spe-0.157
ban-0.154
Token.
Token ID13
Feature activation+7.78e-03

Pos logit contributions
arl+0.097
 ado+0.097
ously+0.093
 overboard+0.092
 humor+0.087
Neg logit contributions
tre-0.085
ns-0.079
 spe-0.077
aus-0.077
 ruled-0.074
Token
Token ID198
Feature activation+1.98e-02

Pos logit contributions
ously+0.007
 ado+0.007
arl+0.007
 overboard+0.007
osity+0.006
Neg logit contributions
aus-0.012
ns-0.012
tre-0.011
 spe-0.011
me-0.011
Token
Token ID198
Feature activation+2.06e-02

Pos logit contributions
ously+0.020
 ado+0.019
arl+0.019
 overboard+0.018
ching+0.017
Neg logit contributions
aus-0.030
ns-0.029
tre-0.029
 spe-0.028
roc-0.027
TokenJ
Token ID41
Feature activation-1.28e-01

Pos logit contributions
ously+0.026
 ado+0.025
arl+0.024
 overboard+0.024
ching+0.022
Neg logit contributions
aus-0.027
ns-0.025
 spe-0.024
tre-0.024
 dom-0.022
Tokenill
Token ID359
Feature activation-2.61e-02

Pos logit contributions
tre+0.144
aus+0.144
roc+0.130
ns+0.127
me+0.127
Neg logit contributions
 ado-0.177
 overboard-0.171
arl-0.170
ously-0.168
 risking-0.160
Token thought
Token ID1807
Feature activation+6.97e-02

Pos logit contributions
aus+0.034
tre+0.033
ns+0.032
me+0.031
ban+0.030
Neg logit contributions
arl-0.032
 ado-0.031
ously-0.031
 overboard-0.030
 fault-0.028
Token:
Token ID25
Feature activation+6.95e-02

Pos logit contributions
ously+0.062
 ado+0.056
 fault+0.051
 overboard+0.051
 off+0.050
Neg logit contributions
roc-0.118
tre-0.117
aus-0.116
ns-0.115
oc-0.114
Token "
Token ID366
Feature activation-2.62e-02

Pos logit contributions
 overboard+0.052
ously+0.052
 ado+0.050
ching+0.048
inated+0.048
Neg logit contributions
aus-0.120
ns-0.112
tre-0.112
roc-0.109
 lin-0.109
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
 ado+0.135
ously+0.133
arl+0.131
 overboard+0.128
 fault+0.120
Neg logit contributions
aus-0.147
tre-0.145
ns-0.138
ban-0.132
me-0.132
Token The
Token ID383
Feature activation-8.41e-02

Pos logit contributions
tre+0.016
aus+0.016
ns+0.015
roc+0.015
me+0.015
Neg logit contributions
arl-0.011
 ado-0.010
ously-0.010
 overboard-0.009
ching-0.009
Token ducks
Token ID39694
Feature activation-1.69e-02

Pos logit contributions
aus+0.124
tre+0.120
ns+0.117
roc+0.115
ban+0.114
Neg logit contributions
 ado-0.086
ously-0.086
arl-0.081
 overboard-0.079
osity-0.075
Token get
Token ID651
Feature activation+4.83e-02

Pos logit contributions
ns+0.019
tre+0.018
me+0.018
aus+0.017
 ruled+0.017
Neg logit contributions
 ado-0.023
ously-0.022
Charl-0.022
arl-0.022
 fault-0.022
Token scared
Token ID12008
Feature activation+1.67e-02

Pos logit contributions
 overboard+0.040
 ado+0.038
arl+0.037
ously+0.037
 off+0.034
Neg logit contributions
aus-0.082
tre-0.080
ns-0.077
oc-0.077
me-0.075
Token.
Token ID13
Feature activation-8.62e-03

Pos logit contributions
 ado+0.019
ously+0.019
arl+0.018
 overboard+0.018
 fault+0.016
Neg logit contributions
aus-0.023
tre-0.022
ns-0.021
roc-0.021
ban-0.020
Token They
Token ID1119
Feature activation+2.56e-02

Pos logit contributions
tre+0.014
aus+0.014
ns+0.013
me+0.013
roc+0.013
Neg logit contributions
arl-0.008
 ado-0.007
 overboard-0.007
ously-0.007
 kicks-0.006
Token fly
Token ID6129
Feature activation-9.94e-02

Pos logit contributions
ously+0.030
 ado+0.030
arl+0.029
 overboard+0.028
osity+0.028
Neg logit contributions
aus-0.033
ns-0.033
tre-0.032
 spe-0.030
ban-0.030
Token away
Token ID1497
Feature activation+1.11e-01

Pos logit contributions
tre+0.161
aus+0.161
ns+0.154
roc+0.151
ban+0.149
Neg logit contributions
 ado-0.086
ously-0.085
arl-0.083
 overboard-0.079
osity-0.071
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
 ado+0.183
ously+0.178
Charl+0.176
arl+0.175
 humor+0.173
Neg logit contributions
 spe-0.086
tre-0.080
oc-0.076
 ruled-0.075
 dom-0.073
Token
Token ID198
Feature activation+5.28e-03

Pos logit contributions
aus+0.018
tre+0.018
ns+0.017
roc+0.016
me+0.016
Neg logit contributions
 ado-0.013
arl-0.013
ously-0.013
 overboard-0.013
 fault-0.012
Token inside
Token ID2641
Feature activation+9.33e-02

Pos logit contributions
aus+0.140
tre+0.131
ns+0.130
roc+0.127
ban+0.126
Neg logit contributions
 overboard-0.062
arl-0.060
 ado-0.060
ously-0.055
 off-0.052
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 ado+0.069
 overboard+0.068
arl+0.068
 off+0.065
ously+0.061
Neg logit contributions
aus-0.158
tre-0.156
ns-0.155
roc-0.153
ban-0.153
Token play
Token ID711
Feature activation+3.77e-02

Pos logit contributions
tre+0.017
aus+0.017
ns+0.016
me+0.016
ban+0.016
Neg logit contributions
ously-0.015
 ado-0.015
arl-0.015
 overboard-0.014
 fault-0.013
Token?"
Token ID1701
Feature activation+4.87e-03

Pos logit contributions
 overboard+0.026
 off+0.024
 that+0.024
arl+0.023
 joins+0.023
Neg logit contributions
aus-0.068
roc-0.065
tre-0.064
ns-0.063
Tre-0.062
Token The
Token ID383
Feature activation-7.96e-02

Pos logit contributions
 ado+0.005
ously+0.005
arl+0.005
 overboard+0.005
 fault+0.005
Neg logit contributions
aus-0.007
tre-0.007
ns-0.006
roc-0.006
ban-0.006
Token cat
Token ID3797
Feature activation-8.84e-02

Pos logit contributions
aus+0.102
tre+0.099
ns+0.096
me+0.093
library+0.091
Neg logit contributions
ously-0.100
 ado-0.098
 overboard-0.095
arl-0.093
 fault-0.086
Token d
Token ID288
Feature activation-2.34e-03

Pos logit contributions
aus+0.129
tre+0.127
ns+0.120
roc+0.118
ban+0.117
Neg logit contributions
ously-0.092
 ado-0.090
 overboard-0.087
arl-0.086
ching-0.078
Tokenarted
Token ID19112
Feature activation+1.59e-01

Pos logit contributions
aus+0.004
tre+0.004
ns+0.004
roc+0.004
me+0.004
Neg logit contributions
 ado-0.002
ously-0.002
 overboard-0.002
arl-0.002
 fault-0.001
Token inside
Token ID2641
Feature activation+9.42e-02

Pos logit contributions
ously+0.100
 off+0.087
 overboard+0.087
 ado+0.083
 unconscious+0.076
Neg logit contributions
aus-0.316
roc-0.306
tre-0.303
ns-0.299
oc-0.298
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
ously+0.102
 ado+0.096
 overboard+0.091
arl+0.090
 unconscious+0.084
Neg logit contributions
aus-0.141
tre-0.135
ns-0.134
roc-0.132
ban-0.128
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
arl+0.017
 ado+0.017
ously+0.016
 overboard+0.016
 fault+0.016
Neg logit contributions
tre-0.014
aus-0.014
ns-0.013
 spe-0.013
me-0.013
Token a
Token ID257
Feature activation+6.50e-03

Pos logit contributions
aus+0.109
tre+0.107
ns+0.102
roc+0.099
ban+0.098
Neg logit contributions
 ado-0.084
ously-0.084
arl-0.082
 overboard-0.081
ching-0.073
Token time
Token ID640
Feature activation+3.02e-02

Pos logit contributions
 ado+0.008
ously+0.008
 overboard+0.008
arl+0.008
osity+0.007
Neg logit contributions
aus-0.008
tre-0.008
ns-0.008
me-0.008
oc-0.007
Token there
Token ID612
Feature activation+4.49e-02

Pos logit contributions
 ado+0.038
ously+0.037
arl+0.037
 overboard+0.036
 humor+0.034
Neg logit contributions
tre-0.038
aus-0.037
ns-0.034
me-0.033
roc-0.033
Token was
Token ID373
Feature activation-6.59e-03

Pos logit contributions
 ado+0.053
ously+0.053
arl+0.051
 overboard+0.051
 fault+0.046
Neg logit contributions
aus-0.059
tre-0.058
ns-0.055
ban-0.053
roc-0.053
Token a
Token ID257
Feature activation+6.71e-03

Pos logit contributions
aus+0.009
tre+0.009
ns+0.009
roc+0.009
ban+0.009
Neg logit contributions
 ado-0.007
ously-0.007
 overboard-0.007
arl-0.007
 fault-0.006
Token boy
Token ID2933
Feature activation-5.44e-02

Pos logit contributions
arl+0.003
 punk+0.003
 way+0.002
 toy+0.002
 ado+0.002
Neg logit contributions
tre-0.014
aus-0.013
ban-0.013
ns-0.013
roc-0.013
Token.
Token ID13
Feature activation+8.62e-03

Pos logit contributions
tre+0.055
aus+0.055
ns+0.051
oc+0.051
me+0.049
Neg logit contributions
 ado-0.081
ously-0.080
arl-0.080
 overboard-0.078
 fault-0.074
Token The
Token ID383
Feature activation-7.35e-02

Pos logit contributions
 ado+0.011
ously+0.011
arl+0.010
 overboard+0.010
 fault+0.009
Neg logit contributions
aus-0.011
tre-0.011
ns-0.010
roc-0.010
ban-0.010
Token boy
Token ID2933
Feature activation-5.42e-02

Pos logit contributions
aus+0.090
tre+0.085
ns+0.078
roc+0.077
me+0.077
Neg logit contributions
ously-0.100
 ado-0.099
 overboard-0.093
arl-0.092
osity-0.088
Token's
Token ID338
Feature activation-4.90e-02

Pos logit contributions
aus+0.090
tre+0.086
roc+0.083
ban+0.083
ns+0.082
Neg logit contributions
 ado-0.043
ously-0.043
 overboard-0.041
 kicks-0.039
 punk-0.038
Token name
Token ID1438
Feature activation+5.94e-03

Pos logit contributions
tre+0.076
aus+0.075
ns+0.072
ban+0.070
me+0.070
Neg logit contributions
arl-0.047
 ado-0.045
 overboard-0.044
ously-0.044
 fault-0.043
TokenWhen
Token ID2215
Feature activation+4.49e-02

Pos logit contributions
ously+0.024
 ado+0.023
arl+0.023
 overboard+0.022
ching+0.021
Neg logit contributions
aus-0.037
tre-0.036
ns-0.035
 spe-0.035
roc-0.034
Token the
Token ID262
Feature activation-2.33e-02

Pos logit contributions
ously+0.037
 ado+0.036
 off+0.036
 overboard+0.035
 way+0.034
Neg logit contributions
aus-0.074
ns-0.074
tre-0.072
oc-0.071
roc-0.070
Token party
Token ID2151
Feature activation-6.79e-02

Pos logit contributions
aus+0.027
tre+0.027
ns+0.025
me+0.024
ban+0.024
Neg logit contributions
ously-0.031
 ado-0.031
 overboard-0.030
arl-0.030
 fault-0.028
Token day
Token ID1110
Feature activation+4.73e-02

Pos logit contributions
aus+0.104
ns+0.102
tre+0.101
ban+0.097
roc+0.097
Neg logit contributions
ously-0.065
 ado-0.064
 way-0.059
 overboard-0.058
arl-0.058
Token finally
Token ID3443
Feature activation-5.77e-02

Pos logit contributions
ously+0.049
 ado+0.047
arl+0.045
 overboard+0.045
 off+0.041
Neg logit contributions
aus-0.070
tre-0.068
ns-0.068
roc-0.066
oc-0.065
Token came
Token ID1625
Feature activation-7.09e-02

Pos logit contributions
aus+0.069
tre+0.068
me+0.061
 reception+0.059
 ruled+0.059
Neg logit contributions
arl-0.076
 ado-0.075
ously-0.073
 overboard-0.073
 appearances-0.069
Token,
Token ID11
Feature activation+2.35e-02

Pos logit contributions
tre+0.086
aus+0.081
me+0.076
roc+0.075
library+0.075
Neg logit contributions
 ado-0.091
arl-0.089
ously-0.088
 overboard-0.084
 humor-0.083
Token everyone
Token ID2506
Feature activation-1.87e-03

Pos logit contributions
ously+0.023
 ado+0.023
 overboard+0.021
arl+0.021
 off+0.020
Neg logit contributions
aus-0.036
ns-0.035
tre-0.035
roc-0.033
oc-0.033
Token arrived
Token ID5284
Feature activation+2.27e-02

Pos logit contributions
tre+0.002
aus+0.002
 ruled+0.002
me+0.002
 reception+0.002
Neg logit contributions
arl-0.003
 ado-0.003
 fault-0.003
 overboard-0.003
 appearances-0.003
Token with
Token ID351
Feature activation-4.57e-02

Pos logit contributions
 ado+0.029
arl+0.028
ously+0.028
 overboard+0.027
ching+0.025
Neg logit contributions
tre-0.028
aus-0.027
ns-0.025
me-0.025
roc-0.024
Token gifts
Token ID13201
Feature activation+9.23e-03

Pos logit contributions
aus+0.062
tre+0.061
ns+0.058
ban+0.056
 spe+0.056
Neg logit contributions
 ado-0.053
ously-0.052
arl-0.052
 overboard-0.050
osity-0.046
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
ously+0.010
 ado+0.010
arl+0.010
 overboard+0.010
osity+0.009
Neg logit contributions
aus-0.009
tre-0.009
ns-0.009
ban-0.008
roc-0.008
Token hospital
Token ID4436
Feature activation-3.46e-02

Pos logit contributions
aus+0.042
tre+0.042
ns+0.039
ban+0.038
me+0.038
Neg logit contributions
 ado-0.030
arl-0.029
ously-0.029
 overboard-0.029
 fault-0.026
Token.
Token ID13
Feature activation+5.57e-03

Pos logit contributions
aus+0.051
tre+0.049
roc+0.047
ns+0.047
ban+0.045
Neg logit contributions
 ado-0.036
ously-0.036
arl-0.035
 overboard-0.035
 fault-0.032
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
 ado+0.006
arl+0.006
ously+0.006
 overboard+0.005
osity+0.005
Neg logit contributions
aus-0.008
tre-0.008
ns-0.008
roc-0.007
ban-0.007
Token
Token ID198
Feature activation+1.74e-02

Pos logit contributions
arl+0.015
ously+0.014
 ado+0.014
 overboard+0.014
ching+0.013
Neg logit contributions
aus-0.026
tre-0.025
ns-0.024
 spe-0.024
roc-0.024
TokenL
Token ID43
Feature activation-1.32e-01

Pos logit contributions
arl+0.019
ously+0.018
 ado+0.018
 overboard+0.018
ching+0.017
Neg logit contributions
aus-0.025
tre-0.024
ns-0.023
 spe-0.022
roc-0.022
Tokenily
Token ID813
Feature activation-3.24e-02

Pos logit contributions
 spe+0.197
tre+0.195
aus+0.194
ns+0.194
ban+0.186
Neg logit contributions
arl-0.135
ously-0.132
 ado-0.127
 fault-0.112
 overboard-0.112
Token was
Token ID373
Feature activation-9.46e-03

Pos logit contributions
tre+0.049
aus+0.048
ban+0.045
roc+0.045
Tre+0.044
Neg logit contributions
ously-0.033
 ado-0.032
 overboard-0.030
 joins-0.029
 kicks-0.029
Token okay
Token ID8788
Feature activation-5.61e-02

Pos logit contributions
aus+0.012
tre+0.012
ns+0.011
me+0.011
ban+0.011
Neg logit contributions
ously-0.012
 ado-0.012
arl-0.011
 overboard-0.011
osity-0.010
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
aus+0.070
tre+0.070
ns+0.065
ban+0.065
me+0.064
Neg logit contributions
 ado-0.069
arl-0.067
ously-0.067
 overboard-0.066
 humor-0.060
Token but
Token ID475
Feature activation+1.60e-02

Pos logit contributions
ously+0.014
 ado+0.014
arl+0.013
 overboard+0.013
 shattering+0.012
Neg logit contributions
aus-0.022
tre-0.022
ns-0.021
roc-0.021
ban-0.020
Token,
Token ID11
Feature activation+1.54e-03

Pos logit contributions
aus+0.079
tre+0.078
ns+0.073
roc+0.072
ban+0.071
Neg logit contributions
 ado-0.052
ously-0.052
arl-0.051
 overboard-0.049
 fault-0.046
Token but
Token ID475
Feature activation+8.07e-03

Pos logit contributions
arl+0.001
 that+0.001
 shattering+0.001
 overboard+0.001
 fault+0.001
Neg logit contributions
roc-0.003
tre-0.003
library-0.003
aus-0.003
ns-0.002
Token it
Token ID340
Feature activation-3.19e-02

Pos logit contributions
 ado+0.009
ously+0.009
arl+0.009
 overboard+0.009
 fault+0.008
Neg logit contributions
aus-0.011
tre-0.011
ns-0.011
ban-0.010
roc-0.010
Token's
Token ID338
Feature activation-5.63e-02

Pos logit contributions
aus+0.044
tre+0.044
ns+0.040
ban+0.040
roc+0.040
Neg logit contributions
 ado-0.034
ously-0.034
 overboard-0.034
arl-0.033
ching-0.031
Token good
Token ID922
Feature activation+4.48e-02

Pos logit contributions
aus+0.073
tre+0.072
ns+0.068
ban+0.066
roc+0.066
Neg logit contributions
 ado-0.068
ously-0.067
arl-0.066
 overboard-0.065
 fault-0.060
Token for
Token ID329
Feature activation-1.70e-01

Pos logit contributions
Charl+0.083
itating+0.083
 ado+0.081
arl+0.080
 punk+0.079
Neg logit contributions
aus-0.025
tre-0.023
 mac-0.022
ns-0.021
 dom-0.021
Token you
Token ID345
Feature activation-7.55e-02

Pos logit contributions
aus+0.217
tre+0.214
ns+0.202
ban+0.195
roc+0.194
Neg logit contributions
 ado-0.208
ously-0.207
arl-0.203
 overboard-0.200
 fault-0.183
Token.
Token ID13
Feature activation-6.23e-03

Pos logit contributions
aus+0.093
tre+0.092
ns+0.089
ban+0.085
me+0.084
Neg logit contributions
 ado-0.094
ously-0.094
arl-0.091
 overboard-0.089
osity-0.083
Token How
Token ID1374
Feature activation+1.04e-02

Pos logit contributions
aus+0.009
tre+0.009
ns+0.008
ban+0.008
roc+0.008
Neg logit contributions
arl-0.007
 ado-0.007
ously-0.006
 overboard-0.006
 fault-0.006
Token about
Token ID546
Feature activation+4.65e-02

Pos logit contributions
 fault+0.008
arl+0.007
ously+0.007
 ado+0.007
upid+0.006
Neg logit contributions
tre-0.019
ns-0.018
 reception-0.018
aus-0.018
 spe-0.018
Token this
Token ID428
Feature activation+1.20e-02

Pos logit contributions
 ado+0.044
 overboard+0.042
arl+0.042
 appearances+0.041
 joins+0.040
Neg logit contributions
aus-0.071
tre-0.070
roc-0.067
ban-0.066
Tre-0.065
Token Wh
Token ID854
Feature activation-1.80e-01

Pos logit contributions
tre+0.092
aus+0.090
ban+0.089
oc+0.089
me+0.087
Neg logit contributions
arl-0.038
 ado-0.035
 punk-0.033
 overboard-0.032
 way-0.032
Tokenisk
Token ID1984
Feature activation-2.41e-01

Pos logit contributions
tre+0.143
 spe+0.138
aus+0.137
ban+0.127
 lin+0.122
Neg logit contributions
arl-0.307
 ado-0.303
ously-0.302
 overboard-0.286
 fault-0.286
Tokeners
Token ID364
Feature activation-1.65e-01

Pos logit contributions
aus+0.256
 spe+0.254
tre+0.241
 symp+0.237
ban+0.236
Neg logit contributions
ously-0.333
ching-0.328
 ado-0.321
 overboard-0.317
arl-0.310
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
aus+0.268
tre+0.260
ns+0.245
roc+0.240
library+0.240
Neg logit contributions
ously-0.147
 ado-0.143
arl-0.137
 overboard-0.136
 appearances-0.124
Token Wh
Token ID854
Feature activation-1.80e-01

Pos logit contributions
arl+0.013
 ado+0.012
 kicks+0.012
 way+0.011
 overboard+0.010
Neg logit contributions
tre-0.015
me-0.015
 lin-0.015
ban-0.015
aus-0.015
Tokenisk
Token ID1984
Feature activation-2.41e-01

Pos logit contributions
 spe+0.169
ban+0.164
tre+0.155
 reception+0.147
aus+0.145
Neg logit contributions
arl-0.293
 ado-0.286
 fault-0.274
ously-0.272
ching-0.270
Tokeners
Token ID364
Feature activation-1.66e-01

Pos logit contributions
aus+0.240
tre+0.227
 spe+0.223
ban+0.217
ns+0.206
Neg logit contributions
ously-0.354
 ado-0.347
 overboard-0.344
ching-0.343
arl-0.334
Token liked
Token ID8288
Feature activation+4.88e-02

Pos logit contributions
aus+0.273
tre+0.265
ns+0.251
ban+0.250
 spe+0.249
Neg logit contributions
ously-0.133
 ado-0.130
 overboard-0.123
 living-0.121
 punk-0.117
Token to
Token ID284
Feature activation+3.40e-03

Pos logit contributions
 overboard+0.039
 ado+0.039
ously+0.038
arl+0.038
 fault+0.035
Neg logit contributions
tre-0.081
aus-0.081
me-0.078
ns-0.077
oc-0.076
Token sit
Token ID1650
Feature activation-7.72e-02

Pos logit contributions
arl+0.002
 ado+0.002
 overboard+0.002
 humor+0.002
 kicks+0.002
Neg logit contributions
tre-0.006
aus-0.006
roc-0.006
Tre-0.006
oc-0.005
Token by
Token ID416
Feature activation-9.50e-02

Pos logit contributions
ban+0.181
engu+0.180
aus+0.172
 pizz+0.171
 reception+0.164
Neg logit contributions
 down-0.029
arl-0.024
 overboard-0.023
 off-0.023
 over-0.022
Token panic
Token ID13619
Feature activation+2.24e-02

Pos logit contributions
ously+0.001
arl+0.001
 ado+0.001
 overboard+0.001
osity+0.001
Neg logit contributions
tre-0.002
aus-0.002
roc-0.002
me-0.002
ban-0.002
Token.
Token ID13
Feature activation+9.86e-03

Pos logit contributions
 ado+0.026
ously+0.026
arl+0.025
 overboard+0.025
 fault+0.023
Neg logit contributions
aus-0.030
tre-0.030
ns-0.028
ban-0.027
roc-0.027
Token She
Token ID1375
Feature activation+1.55e-03

Pos logit contributions
ously+0.013
 ado+0.013
arl+0.013
 overboard+0.013
 fault+0.012
Neg logit contributions
tre-0.011
aus-0.011
me-0.010
roc-0.010
ns-0.010
Token loved
Token ID6151
Feature activation+6.84e-02

Pos logit contributions
 ado+0.002
arl+0.002
ously+0.002
 overboard+0.002
 fault+0.002
Neg logit contributions
aus-0.002
tre-0.002
ns-0.002
roc-0.002
me-0.002
Token that
Token ID326
Feature activation+1.93e-01

Pos logit contributions
ously+0.095
 ado+0.093
arl+0.093
 overboard+0.092
ching+0.086
Neg logit contributions
aus-0.074
tre-0.074
 av-0.068
roc-0.068
ns-0.066
Token v
Token ID410
Feature activation-2.57e-01

Pos logit contributions
 fault+0.166
 way+0.164
 ado+0.158
 overboard+0.155
ously+0.150
Neg logit contributions
ns-0.337
aus-0.334
tre-0.315
ban-0.313
roc-0.308
Tokenase
Token ID589
Feature activation-1.23e-01

Pos logit contributions
aus+0.319
 spe+0.318
ns+0.309
tre+0.307
ban+0.289
Neg logit contributions
arl-0.339
ously-0.313
 overboard-0.310
 ado-0.304
 fault-0.300
Token very
Token ID845
Feature activation-8.63e-02

Pos logit contributions
aus+0.164
tre+0.162
ns+0.153
me+0.148
roc+0.147
Neg logit contributions
 ado-0.142
ously-0.141
arl-0.140
 overboard-0.136
osity-0.123
Token much
Token ID881
Feature activation-3.12e-02

Pos logit contributions
aus+0.120
tre+0.119
ns+0.112
roc+0.108
me+0.108
Neg logit contributions
 ado-0.096
ously-0.095
arl-0.093
 overboard-0.091
osity-0.083
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
aus+0.031
tre+0.029
oc+0.028
ban+0.026
me+0.026
Neg logit contributions
 ado-0.046
 overboard-0.046
ously-0.045
arl-0.044
osity-0.044
Token now
Token ID783
Feature activation+7.55e-02

Pos logit contributions
aus+0.012
tre+0.012
ns+0.011
me+0.011
 ruled+0.011
Neg logit contributions
arl-0.021
 ado-0.019
 overboard-0.019
ously-0.019
ching-0.018
Token.
Token ID13
Feature activation-6.34e-03

Pos logit contributions
aus+0.111
tre+0.111
ns+0.106
roc+0.103
ban+0.100
Neg logit contributions
 ado-0.074
arl-0.073
ously-0.073
 overboard-0.071
 fault-0.064
Token Can
Token ID1680
Feature activation+5.60e-03

Pos logit contributions
aus+0.009
tre+0.009
ns+0.008
ban+0.008
me+0.008
Neg logit contributions
 ado-0.007
ously-0.007
arl-0.007
 overboard-0.007
 fault-0.006
Token I
Token ID314
Feature activation+7.90e-02

Pos logit contributions
arl+0.004
 ado+0.004
 overboard+0.004
ously+0.004
 appearances+0.003
Neg logit contributions
aus-0.010
ns-0.010
tre-0.009
roc-0.009
oc-0.009
Token play
Token ID711
Feature activation+3.45e-02

Pos logit contributions
ously+0.097
 ado+0.096
 overboard+0.092
arl+0.089
 appearances+0.086
Neg logit contributions
tre-0.103
aus-0.103
me-0.098
ns-0.094
ban-0.094
Token dress
Token ID6576
Feature activation+1.06e-02

Pos logit contributions
 overboard+0.029
arl+0.029
 ado+0.028
 joins+0.028
 punk+0.028
Neg logit contributions
aus-0.056
ns-0.053
tre-0.053
roc-0.052
 spe-0.052
Token-
Token ID12
Feature activation-2.24e-01

Pos logit contributions
 off+0.010
 that+0.009
arl+0.008
 ado+0.008
 punk+0.008
Neg logit contributions
 spe-0.018
ns-0.018
aus-0.018
roc-0.017
Tre-0.017
Tokenup
Token ID929
Feature activation-2.61e-02

Pos logit contributions
aus+0.366
 spe+0.346
ns+0.335
tre+0.334
 symp+0.325
Neg logit contributions
 ado-0.184
 overboard-0.181
arl-0.179
ching-0.169
ously-0.168
Token with
Token ID351
Feature activation-5.82e-02

Pos logit contributions
aus+0.042
roc+0.041
ns+0.041
 spe+0.040
tre+0.040
Neg logit contributions
arl-0.023
 ado-0.022
 off-0.022
 overboard-0.021
 joins-0.021
Token something
Token ID1223
Feature activation-1.73e-02

Pos logit contributions
aus+0.090
tre+0.088
ns+0.085
roc+0.083
oc+0.083
Neg logit contributions
 ado-0.054
arl-0.053
 overboard-0.052
ously-0.051
 humor-0.047
Token else
Token ID2073
Feature activation+9.08e-02

Pos logit contributions
aus+0.023
tre+0.023
me+0.022
ban+0.021
ns+0.021
Neg logit contributions
ously-0.022
 ado-0.021
 overboard-0.020
arl-0.020
osity-0.019
Token?"
Token ID1701
Feature activation-4.15e-03

Pos logit contributions
ously+0.116
 ado+0.113
arl+0.110
 overboard+0.109
 shattering+0.100
Neg logit contributions
aus-0.113
tre-0.112
ns-0.105
me-0.103
ban-0.101
Token't
Token ID470
Feature activation-5.09e-04

Pos logit contributions
ns+0.119
tre+0.117
aus+0.116
roc+0.113
 spe+0.112
Neg logit contributions
arl-0.054
 overboard-0.050
ching-0.049
 ado-0.049
ously-0.047
Token be
Token ID307
Feature activation-1.43e-02

Pos logit contributions
roc+0.000
tre+0.000
aus+0.000
ban+0.000
me+0.000
Neg logit contributions
ously-0.001
arl-0.001
 ado-0.001
 fault-0.001
 overboard-0.001
Token a
Token ID257
Feature activation-2.54e-03

Pos logit contributions
ns+0.014
aus+0.013
tre+0.013
oc+0.013
roc+0.013
Neg logit contributions
ously-0.022
 ado-0.020
arl-0.020
osity-0.020
 humor-0.018
Token scared
Token ID12008
Feature activation+1.31e-02

Pos logit contributions
oc+0.003
tre+0.003
aus+0.003
 spe+0.003
roc+0.003
Neg logit contributions
 ado-0.003
ously-0.003
arl-0.003
 overboard-0.003
osity-0.003
Tokeny
Token ID88
Feature activation+1.52e-01

Pos logit contributions
 ado+0.016
 overboard+0.016
ously+0.016
osity+0.016
arl+0.016
Neg logit contributions
aus-0.015
oc-0.015
tre-0.015
 spe-0.014
roc-0.014
Token-
Token ID12
Feature activation-2.25e-01

Pos logit contributions
ously+0.145
arl+0.140
 ado+0.139
 overboard+0.133
ching+0.126
Neg logit contributions
aus-0.235
tre-0.234
ns-0.231
ban-0.220
library-0.216
Tokencat
Token ID9246
Feature activation-1.72e-01

Pos logit contributions
aus+0.283
tre+0.275
ns+0.261
 spe+0.257
 lin+0.253
Neg logit contributions
arl-0.270
 ado-0.266
 overboard-0.266
ously-0.265
ching-0.258
Token.
Token ID13
Feature activation-2.32e-03

Pos logit contributions
aus+0.254
tre+0.251
ns+0.236
roc+0.233
library+0.229
Neg logit contributions
ously-0.174
 ado-0.172
arl-0.168
 overboard-0.167
ching-0.155
Token It
Token ID632
Feature activation-4.60e-02

Pos logit contributions
aus+0.003
tre+0.003
ns+0.003
roc+0.003
ban+0.003
Neg logit contributions
 ado-0.002
ously-0.002
arl-0.002
 overboard-0.002
 fault-0.002
Token will
Token ID481
Feature activation-1.12e-02

Pos logit contributions
aus+0.050
tre+0.050
ns+0.048
me+0.047
oc+0.045
Neg logit contributions
arl-0.067
 ado-0.066
ously-0.066
 overboard-0.061
 humor-0.060
Token be
Token ID307
Feature activation-1.62e-02

Pos logit contributions
aus+0.017
tre+0.017
ns+0.016
roc+0.016
oc+0.015
Neg logit contributions
arl-0.011
 ado-0.011
 overboard-0.011
ously-0.011
ching-0.010
Token his
Token ID465
Feature activation+6.06e-02

Pos logit contributions
tre+0.048
aus+0.048
ns+0.044
roc+0.044
me+0.044
Neg logit contributions
ously-0.033
 ado-0.033
 overboard-0.032
arl-0.031
ching-0.030
Token Mom
Token ID11254
Feature activation-8.30e-03

Pos logit contributions
 ado+0.067
ously+0.066
arl+0.065
 overboard+0.064
 fault+0.058
Neg logit contributions
aus-0.085
tre-0.084
ns-0.079
roc-0.077
ban-0.077
Tokenmy
Token ID1820
Feature activation+5.05e-02

Pos logit contributions
aus+0.013
tre+0.013
ns+0.012
roc+0.012
ban+0.012
Neg logit contributions
 ado-0.008
ously-0.008
arl-0.007
 overboard-0.007
ching-0.006
Token say
Token ID910
Feature activation-2.00e-02

Pos logit contributions
 ado+0.056
ously+0.055
 overboard+0.053
arl+0.053
ching+0.049
Neg logit contributions
aus-0.070
tre-0.069
ns-0.064
roc-0.064
ban-0.063
Token that
Token ID326
Feature activation+1.92e-01

Pos logit contributions
aus+0.027
tre+0.027
ns+0.025
roc+0.025
ban+0.024
Neg logit contributions
ously-0.023
 ado-0.023
arl-0.022
 overboard-0.022
 fault-0.020
Token cel
Token ID18725
Feature activation-2.29e-01

Pos logit contributions
ously+0.148
 way+0.148
 overboard+0.143
 ado+0.143
ching+0.138
Neg logit contributions
roc-0.306
tre-0.301
aus-0.296
ban-0.289
ns-0.286
Tokenery
Token ID1924
Feature activation-2.69e-02

Pos logit contributions
tre+0.345
aus+0.334
roc+0.322
library+0.322
 reception+0.321
Neg logit contributions
arl-0.224
ously-0.221
ching-0.220
 overboard-0.217
 ado-0.209
Token was
Token ID373
Feature activation-5.89e-03

Pos logit contributions
aus+0.036
tre+0.036
roc+0.033
ns+0.033
ban+0.033
Neg logit contributions
ously-0.031
 ado-0.030
 overboard-0.029
arl-0.029
ching-0.027
Token an
Token ID281
Feature activation-9.45e-02

Pos logit contributions
tre+0.008
aus+0.008
roc+0.008
oc+0.008
ban+0.008
Neg logit contributions
 ado-0.006
 overboard-0.006
ously-0.006
arl-0.006
 fault-0.006
Token incredible
Token ID8082
Feature activation-5.47e-02

Pos logit contributions
tre+0.140
ns+0.132
ban+0.132
aus+0.129
Tre+0.129
Neg logit contributions
 ado-0.092
 overboard-0.088
arl-0.086
ously-0.085
 way-0.083
Token vegetable
Token ID20236
Feature activation+4.86e-02

Pos logit contributions
tre+0.073
aus+0.072
ns+0.070
ban+0.068
roc+0.068
Neg logit contributions
 way-0.060
 overboard-0.059
ously-0.059
 ado-0.059
 fault-0.057