TOP ACTIVATIONS
MAX = 0.321

, came a loud BO OM ! Sarah jumped in surprise
Kate heard a loud BO OM . She yelled " What
that they spelled " M OM ". He said that it
then suddenly it went BO OM ! Dad yelled " What
found a special pink pops icle inside the freezer ! She
one ... BO OM ! The hut
not bigger than a fing ern ail . They also found
M ummy used her fing ern ails to gently pull the
the cooler , her pops icle stayed nice and cool -
my gave her a pops icle to taste . It was
a delicious ice cream pops icle ! She was so happy
make it into a pops icle . <|endoftext|> Once upon a
mom gave him a pops icle to cool him down .
a barn out of pops icle sticks . รข
the man for a pops icle . " Can I have
him down . The pops icle was very cold and made
the dove with her fing ern ails . The dove was
smooth feeling of the pops icle on his tongue . Tim
his knee and a pops icle to make him feel better
as he ate his pops icle , feeling cooler and more

MIN ACTIVATIONS
MIN = -0.343

run and play and stay active . Pet ey on the
fast . It 's really active !" Mary smiled
happy to see him so active and healthy . And that
being lazy to being an active and happy pet . He
reminder to be wise and active even when having fun .
lazy robot had become so active . <|endoftext|> Once upon a
. The squirrel was very active and didn 't want to
't be lazy , being active and doing exercise makes us
would remind her to stay active and have fun , but
to have fun and be active . The lazy human then
Jack was a happy and active child who loved to play
. Spot was always very active and full of energy .
old . Jack was very active and loved to run around
you also need to be responsible and careful . Dirt is
. She was a very active little girl . She liked
It teaches you to be responsible and organized . It also
rules . Be careful and responsible . <|endoftext|> L ily and
. You are smart and responsible . You can wear your
You should be careful and responsible ." L ily
should have been kind and loving . We love you ,

INTERVAL 0.155 - 0.321
CONTAINS 5.744%

girl called Lucy . She was only three , with curly
beautiful sunset . They all stopped playing to watch it .
upon a time there there was a fierce big cat called
They use them to make a robot . The robot has
is modest , but it is my favorite ." <|endoftext|> Once

INTERVAL -0.011 - 0.155
CONTAINS 50.983%

snack . Then , he packed his toys back into the
snapping in the wind . <|endoftext|> Once upon a time there
join in with . Finally , they agreed
two friends , Lucy and Toy a , were together working
was magical ! She put it on and was filled with

INTERVAL -0.177 - -0.011
CONTAINS 40.790%

followed their plan . Whenever they did , they were always
a big t eddy bear and a book . They hug
pretend to be animals , or pirates , or superheroes .
! She could now run and play without worrying about going
explore the island , but he was fearful .

INTERVAL -0.343 - -0.177
CONTAINS 2.483%

They also said they were sorry for the lion and hoped
table . The table was strong and could protect them from
, a small bird felt helpless . It could not fly
, Sara . We are sorry , Tom ." They hugged
so proud to have the flexible statue . He showed all
Token,
Token ID11
Feature activation-6.48e-02

Pos logit contributions
OM+0.262
ena+0.213
 been+0.208
 happened+0.208
emon+0.204
Neg logit contributions
leness-0.302
lest-0.296
mer-0.291
 active-0.291
 alike-0.286
Token came
Token ID1625
Feature activation+1.28e-01

Pos logit contributions
leness+0.108
lest+0.107
mer+0.103
 active+0.102
ling+0.099
Neg logit contributions
OM-0.098
ena-0.083
 been-0.083
 happened-0.080
 a-0.077
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
OM+0.223
ena+0.186
emon+0.182
 CA+0.182
 happened+0.181
Neg logit contributions
leness-0.179
lest-0.178
 active-0.177
mer-0.176
 alike-0.173
Token loud
Token ID7812
Feature activation-1.59e-01

Pos logit contributions
OM+0.282
 happened+0.229
ena+0.228
 been+0.227
 CA+0.225
Neg logit contributions
leness-0.292
lest-0.286
 active-0.275
mer-0.275
 alike-0.270
Token BO
Token ID16494
Feature activation+2.34e-02

Pos logit contributions
leness+0.269
lest+0.267
 active+0.260
 alike+0.255
mer+0.255
Neg logit contributions
OM-0.231
 happened-0.190
 CA-0.188
ena-0.185
 been-0.185
TokenOM
Token ID2662
Feature activation+3.21e-01

Pos logit contributions
 happened+0.012
 truth+0.010
emon+0.008
 blood+0.007
 CA+0.007
Neg logit contributions
mer-0.059
bing-0.057
 forth-0.056
ishly-0.055
lest-0.055
Token!
Token ID0
Feature activation+4.90e-02

Pos logit contributions
OM+0.543
 CA+0.476
emon+0.472
 been+0.461
ena+0.454
Neg logit contributions
ling-0.459
mer-0.458
lest-0.442
leness-0.434
 active-0.422
Token Sarah
Token ID10490
Feature activation-8.14e-03

Pos logit contributions
OM+0.090
emon+0.078
 happened+0.077
ena+0.076
 been+0.075
Neg logit contributions
leness-0.055
 active-0.054
 alike-0.054
 content-0.054
mer-0.054
Token jumped
Token ID11687
Feature activation+1.01e-01

Pos logit contributions
leness+0.016
lest+0.016
 active+0.015
mer+0.015
 alike+0.015
Neg logit contributions
OM-0.010
ena-0.008
 been-0.007
 happened-0.007
 CA-0.007
Token in
Token ID287
Feature activation-6.52e-02

Pos logit contributions
OM+0.157
 a+0.129
ena+0.128
 been+0.127
 happened+0.126
Neg logit contributions
leness-0.168
lest-0.162
mer-0.152
 active-0.152
 alike-0.149
Token surprise
Token ID5975
Feature activation+2.22e-02

Pos logit contributions
lest+0.115
leness+0.114
mer+0.110
 active+0.108
ling+0.106
Neg logit contributions
OM-0.092
 been-0.079
ena-0.078
 a-0.077
 happened-0.075
Token Kate
Token ID16693
Feature activation+3.98e-02

Pos logit contributions
lest+0.078
leness+0.077
 active+0.077
 alike+0.076
mer+0.075
Neg logit contributions
OM-0.109
 CA-0.092
emon-0.090
ena-0.090
 been-0.087
Token heard
Token ID2982
Feature activation+1.37e-01

Pos logit contributions
OM+0.056
emon+0.043
ena+0.042
 CA+0.042
 truth+0.041
Neg logit contributions
lest-0.070
ling-0.069
mer-0.069
leness-0.069
 forth-0.067
Token a
Token ID257
Feature activation+1.87e-01

Pos logit contributions
OM+0.165
emon+0.118
 CA+0.113
 fault+0.103
OU+0.101
Neg logit contributions
 active-0.250
 content-0.249
 alike-0.246
ling-0.244
 energetic-0.242
Token loud
Token ID7812
Feature activation-1.60e-01

Pos logit contributions
OM+0.258
 CA+0.220
 truth+0.214
 happened+0.210
 a+0.205
Neg logit contributions
leness-0.336
lest-0.322
mer-0.319
 alike-0.306
 active-0.305
Token BO
Token ID16494
Feature activation+2.50e-02

Pos logit contributions
leness+0.319
lest+0.318
mer+0.313
 active+0.310
 alike+0.307
Neg logit contributions
OM-0.183
 CA-0.145
 happened-0.142
ena-0.140
 been-0.139
TokenOM
Token ID2662
Feature activation+3.20e-01

Pos logit contributions
 happened+0.018
 truth+0.014
 blood+0.014
 CA+0.013
 a+0.013
Neg logit contributions
mer-0.057
lest-0.055
bing-0.054
leness-0.054
ishly-0.054
Token.
Token ID13
Feature activation+3.96e-02

Pos logit contributions
OM+0.566
emon+0.514
 CA+0.504
 been+0.500
 happened+0.489
Neg logit contributions
ling-0.437
mer-0.413
lest-0.402
bing-0.401
 active-0.389
Token She
Token ID1375
Feature activation-1.21e-01

Pos logit contributions
OM+0.085
emon+0.077
ena+0.073
 CA+0.073
 dentist+0.073
Neg logit contributions
 active-0.034
 alike-0.032
 content-0.031
 energetic-0.031
lest-0.030
Token yelled
Token ID22760
Feature activation+6.66e-02

Pos logit contributions
ling+0.213
 forth+0.211
 active+0.207
mer+0.205
lest+0.205
Neg logit contributions
OM-0.164
emon-0.139
 dentist-0.122
ena-0.119
 fault-0.118
Token "
Token ID366
Feature activation-8.84e-02

Pos logit contributions
OM+0.101
ena+0.086
 happened+0.084
 a+0.082
 been+0.081
Neg logit contributions
leness-0.113
lest-0.109
mer-0.106
 active-0.103
 forth-0.101
TokenWhat
Token ID2061
Feature activation+5.71e-02

Pos logit contributions
 forth+0.134
mer+0.124
lest+0.119
ling+0.117
 active+0.117
Neg logit contributions
 fault-0.127
emon-0.125
OM-0.125
 truth-0.122
 happened-0.117
Token that
Token ID326
Feature activation-3.02e-02

Pos logit contributions
OM+0.094
ena+0.078
 been+0.077
 happened+0.076
emon+0.076
Neg logit contributions
leness-0.081
lest-0.079
mer-0.078
 active-0.077
 alike-0.076
Token they
Token ID484
Feature activation-1.63e-01

Pos logit contributions
leness+0.049
lest+0.047
mer+0.047
 active+0.047
 alike+0.046
Neg logit contributions
OM-0.046
ena-0.038
 happened-0.038
 been-0.038
emon-0.037
Token spelled
Token ID32213
Feature activation+8.26e-02

Pos logit contributions
leness+0.311
lest+0.298
 active+0.296
mer+0.293
 forth+0.289
Neg logit contributions
OM-0.196
 happened-0.178
 been-0.177
ena-0.164
 a-0.155
Token "
Token ID366
Feature activation-8.56e-02

Pos logit contributions
OM+0.119
ena+0.100
 happened+0.097
 been+0.095
emon+0.093
Neg logit contributions
leness-0.141
lest-0.138
mer-0.135
 active-0.135
 forth-0.130
TokenM
Token ID44
Feature activation+4.04e-02

Pos logit contributions
mer+0.133
lest+0.132
leness+0.130
 active+0.126
ling+0.125
Neg logit contributions
OM-0.134
ena-0.111
 happened-0.109
 CA-0.108
 dentist-0.108
TokenOM
Token ID2662
Feature activation+3.19e-01

Pos logit contributions
OM+0.058
 CA+0.050
 happened+0.049
 been+0.048
ena+0.047
Neg logit contributions
mer-0.065
lest-0.065
leness-0.065
 active-0.062
 alike-0.061
Token".
Token ID1911
Feature activation+3.90e-02

Pos logit contributions
OM+0.489
 been+0.428
 happened+0.419
 CA+0.413
 dentist+0.410
Neg logit contributions
mer-0.482
ling-0.480
leness-0.473
lest-0.470
 alike-0.466
Token He
Token ID679
Feature activation-1.19e-01

Pos logit contributions
OM+0.061
ena+0.051
 been+0.049
 happened+0.049
emon+0.049
Neg logit contributions
leness-0.062
lest-0.060
 active-0.059
mer-0.059
 alike-0.058
Token said
Token ID531
Feature activation+3.26e-02

Pos logit contributions
leness+0.198
lest+0.198
 active+0.195
mer+0.195
 alike+0.192
Neg logit contributions
OM-0.176
ena-0.135
 CA-0.135
emon-0.134
 been-0.132
Token that
Token ID326
Feature activation-4.33e-02

Pos logit contributions
OM+0.050
 happened+0.043
ena+0.042
 been+0.042
emon+0.041
Neg logit contributions
leness-0.053
 forth-0.051
lest-0.051
 active-0.050
mer-0.050
Token it
Token ID340
Feature activation+7.57e-02

Pos logit contributions
leness+0.069
lest+0.067
mer+0.067
 active+0.066
 forth+0.065
Neg logit contributions
OM-0.067
ena-0.057
 happened-0.056
 been-0.056
emon-0.055
Token then
Token ID788
Feature activation+9.98e-03

Pos logit contributions
leness+0.044
mer+0.042
lest+0.042
 active+0.041
ling+0.041
Neg logit contributions
OM-0.035
 been-0.031
 happened-0.030
 a-0.029
ena-0.028
Token suddenly
Token ID6451
Feature activation+6.72e-02

Pos logit contributions
OM+0.014
 happened+0.013
 been+0.013
 a+0.012
ena+0.012
Neg logit contributions
leness-0.017
lest-0.017
mer-0.017
 active-0.016
ling-0.016
Token it
Token ID340
Feature activation+7.89e-02

Pos logit contributions
OM+0.098
ena+0.079
 happened+0.079
 been+0.078
 CA+0.076
Neg logit contributions
leness-0.114
lest-0.112
mer-0.111
 active-0.110
 alike-0.108
Token went
Token ID1816
Feature activation+7.40e-02

Pos logit contributions
OM+0.109
ena+0.089
emon+0.086
 CA+0.083
anya+0.082
Neg logit contributions
ling-0.139
mer-0.139
lest-0.138
 active-0.133
leness-0.132
Token BO
Token ID16494
Feature activation+1.96e-02

Pos logit contributions
OM+0.123
 happened+0.100
ena+0.100
 been+0.100
 CA+0.099
Neg logit contributions
leness-0.113
lest-0.112
 active-0.107
mer-0.107
 alike-0.105
TokenOM
Token ID2662
Feature activation+3.17e-01

Pos logit contributions
 happened+0.015
 blood+0.015
 truth+0.013
 dentist+0.011
 a+0.011
Neg logit contributions
bing-0.042
mer-0.041
iting-0.040
izing-0.040
acking-0.039
Token!
Token ID0
Feature activation+5.14e-02

Pos logit contributions
OM+0.547
emon+0.530
 been+0.501
 CA+0.498
 happened+0.495
Neg logit contributions
ling-0.429
mer-0.403
bing-0.387
 active-0.382
lest-0.368
Token Dad
Token ID17415
Feature activation-4.86e-02

Pos logit contributions
OM+0.092
ena+0.080
 happened+0.077
emon+0.077
 been+0.076
Neg logit contributions
leness-0.065
 active-0.065
lest-0.065
mer-0.064
 alike-0.062
Token yelled
Token ID22760
Feature activation+6.26e-02

Pos logit contributions
leness+0.105
 active+0.099
lest+0.098
mer+0.097
 alike+0.097
Neg logit contributions
OM-0.051
 been-0.041
ena-0.040
 happened-0.040
 a-0.034
Token "
Token ID366
Feature activation-9.21e-02

Pos logit contributions
OM+0.090
 a+0.079
 happened+0.077
 been+0.075
ena+0.075
Neg logit contributions
leness-0.112
lest-0.107
mer-0.101
 forth-0.101
 active-0.099
TokenWhat
Token ID2061
Feature activation+5.16e-02

Pos logit contributions
mer+0.156
lest+0.153
leness+0.145
 forth+0.145
 active+0.145
Neg logit contributions
OM-0.126
 dentist-0.109
ena-0.109
 happened-0.108
 CA-0.106
Token found
Token ID1043
Feature activation+9.30e-02

Pos logit contributions
lest+0.216
leness+0.215
 active+0.213
mer+0.211
 alike+0.209
Neg logit contributions
OM-0.136
ena-0.100
 CA-0.097
emon-0.097
 happened-0.096
Token a
Token ID257
Feature activation+1.84e-01

Pos logit contributions
OM+0.118
 CA+0.084
ena+0.082
emon+0.078
 happened+0.076
Neg logit contributions
lest-0.179
 alike-0.178
leness-0.178
 active-0.175
ling-0.174
Token special
Token ID2041
Feature activation-1.68e-01

Pos logit contributions
OM+0.275
 been+0.224
 happened+0.223
ena+0.219
 dentist+0.218
Neg logit contributions
leness-0.311
lest-0.298
mer-0.294
 active-0.287
ling-0.284
Token pink
Token ID11398
Feature activation-6.60e-02

Pos logit contributions
leness+0.279
lest+0.275
 active+0.272
mer+0.272
 alike+0.267
Neg logit contributions
OM-0.248
ena-0.202
 CA-0.192
 been-0.192
 happened-0.192
Token pops
Token ID26384
Feature activation+4.58e-02

Pos logit contributions
leness+0.109
lest+0.108
mer+0.106
 active+0.106
 alike+0.104
Neg logit contributions
OM-0.098
ena-0.080
 been-0.077
 happened-0.077
emon-0.077
Tokenicle
Token ID1548
Feature activation+3.04e-01

Pos logit contributions
OM+0.056
ena+0.048
 CA+0.045
 happened+0.045
 been+0.044
Neg logit contributions
mer-0.082
ling-0.080
lest-0.078
ishly-0.078
 grate-0.076
Token inside
Token ID2641
Feature activation-2.78e-02

Pos logit contributions
OM+0.472
ena+0.386
 happened+0.381
 been+0.380
emon+0.374
Neg logit contributions
leness-0.500
lest-0.477
 active-0.477
mer-0.469
 alike-0.460
Token the
Token ID262
Feature activation+3.76e-02

Pos logit contributions
leness+0.048
lest+0.047
 active+0.046
mer+0.046
 alike+0.045
Neg logit contributions
OM-0.040
 been-0.033
ena-0.033
 happened-0.032
emon-0.031
Token freezer
Token ID30967
Feature activation+1.22e-01

Pos logit contributions
OM+0.047
ena+0.037
 happened+0.035
emon+0.035
 CA+0.034
Neg logit contributions
leness-0.071
lest-0.070
mer-0.070
 active-0.070
 alike-0.069
Token!
Token ID0
Feature activation+5.55e-02

Pos logit contributions
OM+0.174
ena+0.140
 been+0.137
 happened+0.134
emon+0.131
Neg logit contributions
leness-0.220
lest-0.212
 active-0.211
mer-0.204
 alike-0.204
Token She
Token ID1375
Feature activation-1.12e-01

Pos logit contributions
OM+0.105
ena+0.089
 happened+0.087
emon+0.086
 been+0.086
Neg logit contributions
leness-0.069
lest-0.067
 active-0.067
mer-0.066
 alike-0.066
Token one
Token ID530
Feature activation+1.35e-01

Pos logit contributions
leness+0.196
lest+0.193
mer+0.191
ling+0.189
 active+0.189
Neg logit contributions
OM-0.093
ena-0.073
 been-0.069
emon-0.067
 happened-0.063
Token...
Token ID986
Feature activation+1.10e-02

Pos logit contributions
OM+0.160
 happened+0.151
 a+0.147
ena+0.146
 been+0.145
Neg logit contributions
leness-0.261
 active-0.252
ishly-0.245
 energetic-0.245
mer-0.238
Token
Token ID198
Feature activation+4.95e-02

Pos logit contributions
OM+0.016
ena+0.013
 been+0.013
 happened+0.013
emon+0.013
Neg logit contributions
leness-0.018
lest-0.018
 active-0.018
mer-0.018
 alike-0.017
Token
Token ID198
Feature activation+5.01e-02

Pos logit contributions
OM+0.085
 happened+0.071
 been+0.068
OU+0.067
emon+0.067
Neg logit contributions
oc-0.065
leness-0.063
 active-0.061
lest-0.061
 alike-0.061
TokenBO
Token ID8202
Feature activation+4.56e-03

Pos logit contributions
OM+0.094
 happened+0.082
 been+0.078
 CA+0.078
ena+0.077
Neg logit contributions
leness-0.062
lest-0.061
mer-0.060
 active-0.060
 alike-0.056
TokenOM
Token ID2662
Feature activation+3.04e-01

Pos logit contributions
 happened+0.003
 been+0.002
emon+0.002
 truth+0.002
una+0.002
Neg logit contributions
 grate-0.010
 forward-0.010
ishly-0.010
mer-0.010
 forth-0.010
Token!
Token ID0
Feature activation+3.27e-02

Pos logit contributions
OM+0.518
 been+0.448
 CA+0.445
emon+0.444
 happened+0.441
Neg logit contributions
ling-0.427
mer-0.419
lest-0.414
leness-0.404
 active-0.401
Token
Token ID198
Feature activation+5.01e-02

Pos logit contributions
OM+0.058
 happened+0.053
 been+0.050
 dentist+0.049
ena+0.049
Neg logit contributions
oc-0.034
 We-0.034
mer-0.033
 grate-0.032
lest-0.032
Token
Token ID198
Feature activation+5.01e-02

Pos logit contributions
OM+0.084
 happened+0.072
 been+0.068
ena+0.065
 fault+0.065
Neg logit contributions
oc-0.067
 active-0.062
leness-0.061
lest-0.059
mer-0.059
TokenThe
Token ID464
Feature activation-4.09e-02

Pos logit contributions
OM+0.078
 happened+0.069
 been+0.064
 dentist+0.063
 truth+0.063
Neg logit contributions
leness-0.073
lest-0.073
 active-0.072
mer-0.071
oc-0.069
Token hut
Token ID40812
Feature activation-1.18e-01

Pos logit contributions
leness+0.064
lest+0.063
 active+0.062
mer+0.062
 alike+0.061
Neg logit contributions
OM-0.065
ena-0.054
emon-0.052
 been-0.052
 CA-0.051
Token not
Token ID407
Feature activation+3.99e-02

Pos logit contributions
OM+0.342
ena+0.281
 been+0.276
 happened+0.274
emon+0.270
Neg logit contributions
leness-0.381
lest-0.374
mer-0.368
 active-0.363
 alike-0.356
Token bigger
Token ID5749
Feature activation-3.73e-02

Pos logit contributions
OM+0.061
 been+0.053
 happened+0.052
ena+0.051
emon+0.049
Neg logit contributions
leness-0.064
lest-0.062
mer-0.061
 active-0.059
 forth-0.059
Token than
Token ID621
Feature activation-1.47e-01

Pos logit contributions
leness+0.060
lest+0.059
 active+0.056
ishly+0.055
mer+0.055
Neg logit contributions
OM-0.054
 happened-0.052
 been-0.051
 truth-0.048
ena-0.048
Token a
Token ID257
Feature activation+1.83e-01

Pos logit contributions
leness+0.264
lest+0.260
 active+0.257
mer+0.256
ling+0.250
Neg logit contributions
OM-0.198
 been-0.163
 happened-0.160
ena-0.160
emon-0.152
Token fing
Token ID8038
Feature activation+3.63e-02

Pos logit contributions
OM+0.249
 happened+0.201
 been+0.199
ena+0.199
 truth+0.199
Neg logit contributions
leness-0.325
lest-0.318
ling-0.306
 active-0.306
 forth-0.304
Tokenern
Token ID1142
Feature activation+2.99e-01

Pos logit contributions
 happened+0.046
OM+0.045
 CA+0.041
 was+0.037
 Ow+0.035
Neg logit contributions
leness-0.058
ishly-0.056
mer-0.055
edient-0.055
 grate-0.055
Tokenail
Token ID603
Feature activation+1.26e-01

Pos logit contributions
OM+0.532
 CA+0.492
 Ow+0.455
 truth+0.451
O+0.448
Neg logit contributions
mer-0.417
lest-0.402
ling-0.378
leness-0.376
 active-0.365
Token.
Token ID13
Feature activation+3.83e-02

Pos logit contributions
OM+0.184
ena+0.152
 been+0.151
 happened+0.150
emon+0.146
Neg logit contributions
leness-0.213
lest-0.207
 active-0.205
 alike-0.200
mer-0.200
Token They
Token ID1119
Feature activation-1.47e-01

Pos logit contributions
OM+0.070
ena+0.059
 been+0.059
 CA+0.058
emon+0.058
Neg logit contributions
leness-0.054
lest-0.052
 forth-0.051
ling-0.050
 active-0.049
Token also
Token ID635
Feature activation+2.76e-04

Pos logit contributions
leness+0.288
 active+0.278
lest+0.278
mer+0.274
ling+0.266
Neg logit contributions
OM-0.164
 been-0.148
 happened-0.144
ena-0.141
 a-0.134
Token found
Token ID1043
Feature activation+9.08e-02

Pos logit contributions
OM+0.000
ena+0.000
emon+0.000
 CA+0.000
 happened+0.000
Neg logit contributions
leness-0.000
lest-0.000
mer-0.000
 active-0.000
 alike-0.000
Token M
Token ID337
Feature activation+4.87e-02

Pos logit contributions
leness+0.039
 active+0.038
lest+0.038
mer+0.037
ling+0.036
Neg logit contributions
OM-0.031
 happened-0.029
 a-0.029
 been-0.027
ena-0.026
Tokenummy
Token ID13513
Feature activation-1.13e-01

Pos logit contributions
OM+0.125
 CA+0.120
ena+0.115
 dentist+0.114
 been+0.113
Neg logit contributions
lest-0.019
mer-0.018
 active-0.017
ling-0.016
 alike-0.015
Token used
Token ID973
Feature activation-4.36e-02

Pos logit contributions
lest+0.180
mer+0.175
ling+0.174
 active+0.173
leness+0.173
Neg logit contributions
OM-0.180
 CA-0.140
ena-0.135
 Ow-0.135
 been-0.132
Token her
Token ID607
Feature activation+2.57e-02

Pos logit contributions
leness+0.078
lest+0.077
mer+0.076
 active+0.076
 alike+0.074
Neg logit contributions
OM-0.059
ena-0.047
 been-0.046
 happened-0.046
emon-0.045
Token fing
Token ID8038
Feature activation+3.43e-02

Pos logit contributions
OM+0.044
ena+0.037
emon+0.035
 CA+0.035
 dentist+0.035
Neg logit contributions
lest-0.038
leness-0.038
 active-0.036
mer-0.036
 alike-0.035
Tokenern
Token ID1142
Feature activation+2.93e-01

Pos logit contributions
OM+0.037
 CA+0.034
 happened+0.033
tr+0.028
 dentist+0.027
Neg logit contributions
leness-0.069
 active-0.064
 grate-0.063
ishly-0.063
edient-0.063
Tokenails
Token ID1768
Feature activation-8.69e-03

Pos logit contributions
OM+0.518
 CA+0.433
ena+0.423
 dentist+0.418
 happened+0.418
Neg logit contributions
leness-0.409
lest-0.408
mer-0.392
 active-0.386
 alike-0.385
Token to
Token ID284
Feature activation-2.07e-02

Pos logit contributions
leness+0.014
 active+0.014
lest+0.014
mer+0.014
 energetic+0.013
Neg logit contributions
OM-0.013
 happened-0.011
 been-0.011
ena-0.010
 a-0.010
Token gently
Token ID15165
Feature activation-7.84e-02

Pos logit contributions
leness+0.033
lest+0.033
mer+0.033
 active+0.032
ling+0.031
Neg logit contributions
OM-0.032
 been-0.027
ena-0.026
 happened-0.026
emon-0.025
Token pull
Token ID2834
Feature activation+8.13e-03

Pos logit contributions
mer+0.134
leness+0.133
 active+0.131
lest+0.129
 energetic+0.125
Neg logit contributions
OM-0.105
 a-0.097
 been-0.094
 happened-0.092
emon-0.087
Token the
Token ID262
Feature activation+2.87e-02

Pos logit contributions
OM+0.010
 a+0.009
ena+0.009
 been+0.008
 happened+0.008
Neg logit contributions
leness-0.015
lest-0.015
 active-0.014
mer-0.014
 energetic-0.014
Token the
Token ID262
Feature activation+7.35e-02

Pos logit contributions
leness+0.294
lest+0.275
bing+0.256
ling+0.255
ized+0.255
Neg logit contributions
OM-0.155
 dentist-0.141
 a-0.133
ena-0.130
 happened-0.124
Token cooler
Token ID19346
Feature activation+9.60e-02

Pos logit contributions
OM+0.112
ena+0.092
 been+0.089
 happened+0.089
emon+0.088
Neg logit contributions
leness-0.119
lest-0.117
mer-0.115
 active-0.115
 alike-0.113
Token,
Token ID11
Feature activation-6.43e-02

Pos logit contributions
OM+0.145
 been+0.119
ena+0.117
 happened+0.116
emon+0.113
Neg logit contributions
leness-0.161
lest-0.153
 active-0.151
 alike-0.149
mer-0.146
Token her
Token ID607
Feature activation+3.11e-02

Pos logit contributions
leness+0.115
mer+0.110
lest+0.109
 alike+0.102
 active+0.102
Neg logit contributions
OM-0.088
 been-0.080
 a-0.079
ena-0.075
 happened-0.074
Token pops
Token ID26384
Feature activation+4.06e-02

Pos logit contributions
OM+0.047
ena+0.038
 been+0.038
 happened+0.038
emon+0.037
Neg logit contributions
leness-0.051
lest-0.050
mer-0.049
 active-0.049
 alike-0.048
Tokenicle
Token ID1548
Feature activation+2.92e-01

Pos logit contributions
OM+0.049
 dentist+0.046
ena+0.044
 CA+0.043
orman+0.042
Neg logit contributions
ling-0.070
 forth-0.069
bing-0.068
lest-0.067
mer-0.066
Token stayed
Token ID9658
Feature activation+7.56e-02

Pos logit contributions
OM+0.480
ena+0.395
 been+0.389
 happened+0.387
emon+0.382
Neg logit contributions
leness-0.446
lest-0.432
 active-0.427
mer-0.426
 alike-0.418
Token nice
Token ID3621
Feature activation-1.50e-01

Pos logit contributions
 a+0.107
OM+0.097
 been+0.097
 time+0.096
 an+0.090
Neg logit contributions
leness-0.130
lest-0.127
mer-0.116
cient-0.116
ishly-0.112
Token and
Token ID290
Feature activation-2.77e-02

Pos logit contributions
leness+0.264
lest+0.252
 active+0.248
mer+0.244
 alike+0.243
Neg logit contributions
OM-0.210
 been-0.174
 happened-0.172
ena-0.171
emon-0.165
Token cool
Token ID3608
Feature activation-8.04e-02

Pos logit contributions
mer+0.042
leness+0.042
cient+0.039
 alike+0.039
ishly+0.038
Neg logit contributions
 been-0.041
 a-0.040
OM-0.039
 was-0.036
 happened-0.035
Token -
Token ID532
Feature activation+1.66e-04

Pos logit contributions
leness+0.168
lest+0.158
 active+0.155
oc+0.152
 knowledgeable+0.152
Neg logit contributions
OM-0.081
 happened-0.072
 been-0.072
 a-0.068
 time-0.061
Tokenmy
Token ID1820
Feature activation+5.69e-02

Pos logit contributions
leness+0.056
lest+0.055
 active+0.055
mer+0.055
 alike+0.054
Neg logit contributions
OM-0.036
ena-0.027
 happened-0.026
 been-0.026
 CA-0.026
Token gave
Token ID2921
Feature activation-4.55e-02

Pos logit contributions
OM+0.081
ena+0.066
 been+0.064
 happened+0.064
emon+0.063
Neg logit contributions
leness-0.099
lest-0.096
mer-0.095
 active-0.095
 alike-0.093
Token her
Token ID607
Feature activation+2.39e-02

Pos logit contributions
 forth+0.059
ling+0.059
 alike+0.058
 active+0.055
leness+0.055
Neg logit contributions
OM-0.093
ena-0.072
 CA-0.072
emon-0.071
OO-0.067
Token a
Token ID257
Feature activation+1.80e-01

Pos logit contributions
OM+0.029
 CA+0.018
 happened+0.017
ena+0.017
 dentist+0.015
Neg logit contributions
 active-0.051
lest-0.050
 positive-0.049
ling-0.048
 energetic-0.048
Token pops
Token ID26384
Feature activation+3.90e-02

Pos logit contributions
OM+0.316
ena+0.275
 been+0.255
emon+0.252
 happened+0.247
Neg logit contributions
lest-0.256
 active-0.253
 alike-0.246
leness-0.245
mer-0.242
Tokenicle
Token ID1548
Feature activation+2.92e-01

Pos logit contributions
OM+0.047
ena+0.043
 dentist+0.041
abella+0.041
aila+0.040
Neg logit contributions
mer-0.071
 grate-0.068
lest-0.068
ling-0.067
oc-0.065
Token to
Token ID284
Feature activation-2.16e-02

Pos logit contributions
OM+0.490
ena+0.407
 happened+0.395
 been+0.394
 CA+0.393
Neg logit contributions
lest-0.427
leness-0.426
mer-0.419
 active-0.417
 alike-0.409
Token taste
Token ID6938
Feature activation+1.00e-01

Pos logit contributions
leness+0.033
lest+0.032
mer+0.032
 active+0.031
 alike+0.031
Neg logit contributions
OM-0.035
 been-0.029
ena-0.029
 happened-0.028
emon-0.028
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
OM+0.141
 happened+0.114
ena+0.113
 been+0.113
emon+0.112
Neg logit contributions
leness-0.173
lest-0.170
mer-0.169
 active-0.167
 alike-0.163
Token It
Token ID632
Feature activation+1.72e-02

Pos logit contributions
OM+0.038
ena+0.029
 CA+0.029
 happened+0.029
 been+0.029
Neg logit contributions
leness-0.047
lest-0.046
mer-0.045
 active-0.044
ling-0.044
Token was
Token ID373
Feature activation+2.25e-01

Pos logit contributions
OM+0.016
ena+0.011
emon+0.010
 CA+0.010
 been+0.010
Neg logit contributions
lest-0.038
leness-0.038
mer-0.038
 active-0.038
ling-0.037
Token a
Token ID257
Feature activation+1.76e-01

Pos logit contributions
leness+0.050
 active+0.049
lest+0.049
mer+0.049
 alike+0.048
Neg logit contributions
OM-0.019
ena-0.012
 CA-0.011
emon-0.011
 happened-0.011
Token delicious
Token ID12625
Feature activation-1.59e-01

Pos logit contributions
OM+0.248
 dentist+0.203
 happened+0.201
 been+0.199
 truth+0.198
Neg logit contributions
leness-0.310
lest-0.301
mer-0.291
 active-0.289
ishly-0.283
Token ice
Token ID4771
Feature activation+1.50e-01

Pos logit contributions
leness+0.304
lest+0.299
mer+0.294
 active+0.294
 alike+0.290
Neg logit contributions
OM-0.198
ena-0.151
 been-0.148
 happened-0.148
emon-0.145
Token cream
Token ID8566
Feature activation+9.66e-02

Pos logit contributions
OM+0.196
ena+0.155
 CA+0.150
 happened+0.147
 been+0.147
Neg logit contributions
lest-0.274
leness-0.273
mer-0.272
 active-0.267
ling-0.266
Token pops
Token ID26384
Feature activation+3.67e-02

Pos logit contributions
OM+0.144
ena+0.117
 been+0.114
 happened+0.114
emon+0.112
Neg logit contributions
leness-0.161
lest-0.158
 active-0.156
mer-0.155
 alike-0.152
Tokenicle
Token ID1548
Feature activation+2.92e-01

Pos logit contributions
OM+0.063
 CA+0.057
ena+0.057
 dentist+0.056
emon+0.055
Neg logit contributions
ling-0.047
mer-0.046
 forth-0.045
 forward-0.043
lest-0.043
Token!
Token ID0
Feature activation+5.11e-02

Pos logit contributions
OM+0.431
ena+0.343
 happened+0.342
 been+0.341
emon+0.331
Neg logit contributions
leness-0.498
lest-0.479
 active-0.478
mer-0.471
 alike-0.465
Token She
Token ID1375
Feature activation-1.18e-01

Pos logit contributions
OM+0.083
ena+0.068
 happened+0.066
emon+0.066
 been+0.065
Neg logit contributions
leness-0.077
 active-0.075
lest-0.075
mer-0.074
 alike-0.074
Token was
Token ID373
Feature activation+2.24e-01

Pos logit contributions
lest+0.221
leness+0.221
 active+0.217
mer+0.216
 alike+0.213
Neg logit contributions
OM-0.152
ena-0.115
emon-0.111
 CA-0.110
 happened-0.110
Token so
Token ID523
Feature activation+3.46e-02

Pos logit contributions
OM+0.371
ena+0.297
 CA+0.294
 happened+0.294
emon+0.290
Neg logit contributions
 active-0.334
leness-0.334
lest-0.331
 alike-0.327
mer-0.324
Token happy
Token ID3772
Feature activation-2.27e-01

Pos logit contributions
OM+0.066
 been+0.062
 happened+0.060
 a+0.060
ena+0.059
Neg logit contributions
leness-0.043
mer-0.041
lest-0.039
 forth-0.038
ishly-0.037
Token make
Token ID787
Feature activation-3.84e-02

Pos logit contributions
leness+0.037
lest+0.037
mer+0.036
 alike+0.035
ling+0.034
Neg logit contributions
OM-0.036
 been-0.031
 a-0.029
 dentist-0.029
ena-0.029
Token it
Token ID340
Feature activation+8.21e-02

Pos logit contributions
 active+0.061
ling+0.060
leness+0.060
lest+0.060
 alike+0.059
Neg logit contributions
OM-0.061
 CA-0.048
ena-0.047
emon-0.046
 happened-0.045
Token into
Token ID656
Feature activation-1.91e-02

Pos logit contributions
OM+0.122
 been+0.102
 happened+0.099
ena+0.097
emon+0.096
Neg logit contributions
leness-0.139
mer-0.134
lest-0.132
 alike-0.128
 active-0.127
Token a
Token ID257
Feature activation+1.77e-01

Pos logit contributions
leness+0.038
lest+0.038
mer+0.037
 active+0.037
 alike+0.037
Neg logit contributions
OM-0.022
ena-0.017
 been-0.016
 happened-0.016
emon-0.016
Token pops
Token ID26384
Feature activation+3.78e-02

Pos logit contributions
OM+0.229
 dentist+0.213
 time+0.199
 truth+0.193
 a+0.191
Neg logit contributions
leness-0.331
ishly-0.315
lest-0.314
bing-0.300
mer-0.293
Tokenicle
Token ID1548
Feature activation+2.91e-01

Pos logit contributions
OM+0.040
ena+0.037
 happened+0.034
 CA+0.034
abella+0.032
Neg logit contributions
mer-0.071
ling-0.070
lest-0.068
oc-0.068
 grate-0.067
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
OM+0.403
 been+0.312
 dentist+0.312
 happened+0.309
ena+0.309
Neg logit contributions
leness-0.543
 active-0.505
lest-0.503
mer-0.496
 alike-0.488
Token<|endoftext|>
Token ID50256
Feature activation+8.41e-02

Pos logit contributions
OM+0.027
ena+0.023
emon+0.022
 happened+0.022
 been+0.021
Neg logit contributions
 content-0.020
oc-0.020
leness-0.020
 active-0.019
 energetic-0.019
TokenOnce
Token ID7454
Feature activation+4.74e-02

Pos logit contributions
OM+0.195
emon+0.175
OU+0.166
OO+0.165
ena+0.162
Neg logit contributions
 together-0.065
oc-0.063
mer-0.063
 around-0.061
 freely-0.053
Token upon
Token ID2402
Feature activation+1.20e-01

Pos logit contributions
OM+0.068
emon+0.054
 CA+0.053
ena+0.053
 been+0.051
Neg logit contributions
lest-0.082
leness-0.080
 alike-0.080
ling-0.080
 active-0.079
Token a
Token ID257
Feature activation+1.93e-01

Pos logit contributions
OM+0.231
anger+0.206
omm+0.198
abella+0.189
wen+0.185
Neg logit contributions
 active-0.164
 content-0.159
 happy-0.159
 healthy-0.157
 together-0.153
Token mom
Token ID1995
Feature activation-2.32e-02

Pos logit contributions
 active+0.017
lest+0.015
 energetic+0.015
 forth+0.015
mer+0.015
Neg logit contributions
OM-0.022
 CA-0.020
emon-0.020
ena-0.019
 been-0.018
Token gave
Token ID2921
Feature activation-4.63e-02

Pos logit contributions
lest+0.036
 active+0.035
ling+0.035
 forth+0.035
leness+0.035
Neg logit contributions
OM-0.037
 CA-0.030
ena-0.030
emon-0.029
 truth-0.028
Token him
Token ID683
Feature activation+3.82e-02

Pos logit contributions
 forth+0.049
ling+0.046
 active+0.046
 alike+0.045
leness+0.045
Neg logit contributions
OM-0.100
ena-0.085
emon-0.083
 CA-0.083
 dentist-0.078
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
OM+0.042
 CA+0.028
 happened+0.026
ena+0.025
 truth+0.023
Neg logit contributions
 active-0.083
lest-0.079
 alike-0.079
 energetic-0.078
ling-0.078
Token pops
Token ID26384
Feature activation+3.96e-02

Pos logit contributions
OM+0.324
ena+0.289
 been+0.264
emon+0.260
anya+0.250
Neg logit contributions
 active-0.249
lest-0.239
 alike-0.239
 positive-0.236
 energetic-0.228
Tokenicle
Token ID1548
Feature activation+2.91e-01

Pos logit contributions
abella+0.050
OM+0.049
ena+0.045
 flown+0.045
Fore+0.044
Neg logit contributions
oc-0.064
 grate-0.061
free-0.057
mer-0.055
ishly-0.055
Token to
Token ID284
Feature activation-2.18e-02

Pos logit contributions
OM+0.512
ena+0.435
 been+0.421
 CA+0.421
emon+0.418
Neg logit contributions
lest-0.390
mer-0.382
 active-0.379
ling-0.378
 forth-0.378
Token cool
Token ID3608
Feature activation-7.87e-02

Pos logit contributions
leness+0.033
lest+0.032
 active+0.032
mer+0.031
 alike+0.031
Neg logit contributions
OM-0.036
ena-0.030
 happened-0.029
emon-0.029
 been-0.029
Token him
Token ID683
Feature activation+3.86e-02

Pos logit contributions
leness+0.101
lest+0.099
 active+0.095
mer+0.095
 alike+0.093
Neg logit contributions
OM-0.150
ena-0.126
 happened-0.126
 been-0.125
 CA-0.123
Token down
Token ID866
Feature activation-7.71e-02

Pos logit contributions
OM+0.063
ena+0.052
 been+0.051
 happened+0.051
emon+0.050
Neg logit contributions
leness-0.059
lest-0.058
mer-0.057
 active-0.057
 alike-0.056
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
mer+0.108
 active+0.107
leness+0.106
lest+0.105
 alike+0.104
Neg logit contributions
OM-0.134
ena-0.111
emon-0.110
 CA-0.109
 happened-0.107
Token a
Token ID257
Feature activation+1.71e-01

Pos logit contributions
leness+0.051
mer+0.050
lest+0.049
ling+0.048
 active+0.048
Neg logit contributions
OM-0.031
 been-0.026
ena-0.026
 a-0.025
 happened-0.025
Token barn
Token ID25203
Feature activation-2.28e-02

Pos logit contributions
OM+0.232
 been+0.193
 dentist+0.193
 a+0.193
 truth+0.192
Neg logit contributions
leness-0.298
lest-0.285
mer-0.281
ling-0.272
 active-0.270
Token out
Token ID503
Feature activation+5.81e-03

Pos logit contributions
leness+0.044
 active+0.042
lest+0.042
 energetic+0.040
 forth+0.039
Neg logit contributions
OM-0.029
 been-0.024
 happened-0.024
ena-0.023
emon-0.023
Token of
Token ID286
Feature activation-1.24e-01

Pos logit contributions
OM+0.007
 a+0.006
 been+0.006
 happened+0.006
ena+0.006
Neg logit contributions
leness-0.011
 active-0.011
lest-0.011
ishly-0.011
 energetic-0.010
Token pops
Token ID26384
Feature activation+3.69e-02

Pos logit contributions
leness+0.207
lest+0.204
mer+0.200
 active+0.200
 alike+0.195
Neg logit contributions
OM-0.185
ena-0.151
 been-0.150
 happened-0.148
emon-0.145
Tokenicle
Token ID1548
Feature activation+2.91e-01

Pos logit contributions
OM+0.048
ena+0.040
 CA+0.037
 dentist+0.037
 happened+0.036
Neg logit contributions
mer-0.066
lest-0.065
ling-0.065
 alike-0.064
leness-0.064
Token sticks
Token ID16461
Feature activation+3.98e-02

Pos logit contributions
OM+0.402
 happened+0.324
 been+0.317
ena+0.311
emon+0.306
Neg logit contributions
leness-0.527
 active-0.502
lest-0.499
mer-0.490
 alike-0.482
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
OM+0.047
 been+0.042
 a+0.042
 happened+0.041
emon+0.038
Neg logit contributions
leness-0.076
 active-0.073
lest-0.072
 energetic-0.069
 efficient-0.068
Token
Token ID198
Feature activation+6.62e-02

Pos logit contributions
OM+0.024
ena+0.019
 happened+0.018
 been+0.018
emon+0.018
Neg logit contributions
leness-0.030
lest-0.029
mer-0.029
 active-0.029
 alike-0.029
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
OM+0.102
ena+0.081
 happened+0.080
 CA+0.079
 dentist+0.079
Neg logit contributions
leness-0.105
 active-0.103
lest-0.103
 alike-0.102
mer-0.102
Tokenรข
Token ID22940
Feature activation-1.97e-02

Pos logit contributions
OM+0.101
ena+0.083
 happened+0.082
 CA+0.080
 been+0.080
Neg logit contributions
leness-0.107
lest-0.105
 active-0.104
mer-0.103
 alike-0.101
Token the
Token ID262
Feature activation+3.05e-02

Pos logit contributions
leness+0.060
 active+0.059
lest+0.059
 alike+0.059
 forth+0.058
Neg logit contributions
OM-0.071
ena-0.057
 CA-0.056
emon-0.056
 been-0.055
Token man
Token ID582
Feature activation+7.98e-03

Pos logit contributions
OM+0.052
ena+0.047
emon+0.043
abella+0.042
 happened+0.042
Neg logit contributions
 active-0.041
 alike-0.040
 energetic-0.039
mer-0.039
lest-0.039
Token for
Token ID329
Feature activation-1.15e-01

Pos logit contributions
OM+0.010
 a+0.009
ena+0.009
emon+0.009
 been+0.009
Neg logit contributions
leness-0.015
lest-0.014
mer-0.014
 active-0.014
 forth-0.013
Token a
Token ID257
Feature activation+1.83e-01

Pos logit contributions
leness+0.177
 active+0.174
lest+0.174
 forth+0.169
 alike+0.167
Neg logit contributions
OM-0.185
ena-0.152
 CA-0.148
 happened-0.146
emon-0.142
Token pops
Token ID26384
Feature activation+3.97e-02

Pos logit contributions
OM+0.298
ena+0.262
 been+0.235
anya+0.233
emon+0.230
Neg logit contributions
lest-0.275
 active-0.274
 alike-0.264
leness-0.261
mer-0.260
Tokenicle
Token ID1548
Feature activation+2.90e-01

Pos logit contributions
abella+0.054
ena+0.053
OM+0.052
 happened+0.051
 been+0.048
Neg logit contributions
oc-0.057
 grate-0.057
mer-0.054
ishly-0.051
 tightly-0.050
Token.
Token ID13
Feature activation+3.39e-02

Pos logit contributions
OM+0.477
ena+0.394
 CA+0.389
 happened+0.384
 been+0.383
Neg logit contributions
lest-0.427
leness-0.423
mer-0.423
 active-0.417
 alike-0.412
Token "
Token ID366
Feature activation-8.67e-02

Pos logit contributions
OM+0.047
 a+0.041
 dentist+0.039
 CA+0.039
ena+0.038
Neg logit contributions
leness-0.065
lest-0.062
ling-0.061
 forth-0.059
mer-0.059
TokenCan
Token ID6090
Feature activation-2.81e-02

Pos logit contributions
leness+0.149
lest+0.147
 active+0.145
mer+0.144
 alike+0.141
Neg logit contributions
OM-0.125
ena-0.101
 happened-0.098
 been-0.097
emon-0.096
Token I
Token ID314
Feature activation-2.74e-02

Pos logit contributions
leness+0.037
lest+0.036
mer+0.036
 active+0.035
 alike+0.035
Neg logit contributions
OM-0.052
ena-0.044
 been-0.043
 happened-0.043
 CA-0.043
Token have
Token ID423
Feature activation-6.72e-03

Pos logit contributions
leness+0.048
lest+0.045
mer+0.045
ling+0.044
 alike+0.044
Neg logit contributions
OM-0.040
 been-0.034
ena-0.032
emon-0.032
 CA-0.032
Token him
Token ID683
Feature activation+3.86e-02

Pos logit contributions
leness+0.101
lest+0.099
 active+0.095
mer+0.095
 alike+0.093
Neg logit contributions
OM-0.150
ena-0.126
 happened-0.126
 been-0.125
 CA-0.123
Token down
Token ID866
Feature activation-7.71e-02

Pos logit contributions
OM+0.063
ena+0.052
 been+0.051
 happened+0.051
emon+0.050
Neg logit contributions
leness-0.059
lest-0.058
mer-0.057
 active-0.057
 alike-0.056
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
mer+0.108
 active+0.107
leness+0.106
lest+0.105
 alike+0.104
Neg logit contributions
OM-0.134
ena-0.111
emon-0.110
 CA-0.109
 happened-0.107
Token The
Token ID383
Feature activation-5.97e-02

Pos logit contributions
OM+0.054
ena+0.046
emon+0.044
 been+0.044
 happened+0.044
Neg logit contributions
leness-0.045
lest-0.045
 active-0.044
mer-0.044
 alike-0.043
Token pops
Token ID26384
Feature activation+3.94e-02

Pos logit contributions
 active+0.072
ling+0.072
 forth+0.072
lest+0.072
 energetic+0.071
Neg logit contributions
abella-0.093
OM-0.093
ena-0.088
aila-0.086
rett-0.085
Tokenicle
Token ID1548
Feature activation+2.90e-01

Pos logit contributions
OM+0.042
abella+0.042
Fore+0.041
 truth+0.040
ena+0.039
Neg logit contributions
ling-0.066
lest-0.064
oc-0.063
mer-0.063
free-0.061
Token was
Token ID373
Feature activation+2.25e-01

Pos logit contributions
OM+0.338
ena+0.266
emon+0.252
 CA+0.239
 been+0.235
Neg logit contributions
lest-0.576
mer-0.561
leness-0.559
ling-0.558
 forth-0.555
Token very
Token ID845
Feature activation-7.32e-02

Pos logit contributions
OM+0.374
ena+0.308
 CA+0.298
emon+0.297
 happened+0.296
Neg logit contributions
leness-0.334
lest-0.330
 active-0.329
mer-0.322
 alike-0.322
Token cold
Token ID4692
Feature activation-1.09e-01

Pos logit contributions
leness+0.139
ishly+0.136
mer+0.128
 forth+0.126
lest+0.126
Neg logit contributions
OM-0.094
 a-0.082
 happened-0.081
 been-0.079
O-0.075
Token and
Token ID290
Feature activation-2.20e-02

Pos logit contributions
leness+0.171
lest+0.167
 active+0.163
mer+0.163
 alike+0.160
Neg logit contributions
OM-0.176
ena-0.144
 been-0.142
 happened-0.142
emon-0.138
Token made
Token ID925
Feature activation+1.36e-02

Pos logit contributions
leness+0.043
mer+0.042
lest+0.040
 alike+0.040
ling+0.038
Neg logit contributions
OM-0.027
 happened-0.025
 a-0.024
 been-0.024
 CA-0.021
Token the
Token ID262
Feature activation+2.54e-02

Pos logit contributions
OM+0.055
 dentist+0.047
emon+0.046
 CA+0.044
ena+0.044
Neg logit contributions
 forth-0.023
ling-0.022
 forward-0.022
ging-0.021
bing-0.021
Token dove
Token ID38862
Feature activation+1.55e-01

Pos logit contributions
OM+0.043
ena+0.036
abella+0.036
 dentist+0.035
emon+0.034
Neg logit contributions
ling-0.038
 forth-0.037
lest-0.036
mer-0.036
leness-0.035
Token with
Token ID351
Feature activation-9.75e-02

Pos logit contributions
OM+0.244
 been+0.203
ena+0.200
 a+0.198
 happened+0.196
Neg logit contributions
leness-0.250
mer-0.232
lest-0.231
 active-0.231
 alike-0.230
Token her
Token ID607
Feature activation+2.24e-02

Pos logit contributions
leness+0.148
lest+0.146
 forth+0.145
 content+0.141
 alike+0.141
Neg logit contributions
OM-0.164
ena-0.132
emon-0.122
orman-0.119
abella-0.119
Token fing
Token ID8038
Feature activation+2.87e-02

Pos logit contributions
OM+0.040
ena+0.033
 dentist+0.033
emon+0.033
 CA+0.032
Neg logit contributions
lest-0.031
leness-0.030
mer-0.029
 active-0.029
 forth-0.029
Tokenern
Token ID1142
Feature activation+2.89e-01

Pos logit contributions
 happened+0.032
OM+0.032
 CA+0.029
 was+0.024
 a+0.024
Neg logit contributions
leness-0.054
 grate-0.050
ishly-0.050
edient-0.050
 active-0.050
Tokenails
Token ID1768
Feature activation-1.12e-02

Pos logit contributions
OM+0.547
 dentist+0.520
 CA+0.514
 truth+0.499
 happened+0.462
Neg logit contributions
lest-0.352
mer-0.320
ling-0.319
bs-0.301
leness-0.300
Token.
Token ID13
Feature activation+2.62e-02

Pos logit contributions
leness+0.017
lest+0.016
mer+0.016
ling+0.016
 alike+0.016
Neg logit contributions
OM-0.019
ena-0.016
emon-0.015
 CA-0.015
 dentist-0.015
Token The
Token ID383
Feature activation-6.09e-02

Pos logit contributions
OM+0.047
ena+0.040
 happened+0.039
 been+0.039
 CA+0.039
Neg logit contributions
leness-0.036
lest-0.035
mer-0.035
 active-0.034
 alike-0.034
Token dove
Token ID38862
Feature activation+1.54e-01

Pos logit contributions
ling+0.081
lest+0.080
 forth+0.076
leness+0.074
 content+0.073
Neg logit contributions
OM-0.115
omm-0.095
nob-0.095
abella-0.095
tr-0.095
Token was
Token ID373
Feature activation+2.22e-01

Pos logit contributions
OM+0.164
 happened+0.126
 been+0.123
ena+0.122
 CA+0.119
Neg logit contributions
leness-0.322
 active-0.311
lest-0.309
mer-0.308
 alike-0.307
Token smooth
Token ID7209
Feature activation-1.31e-01

Pos logit contributions
OM+0.048
abella+0.047
emon+0.047
aila+0.047
ena+0.046
Neg logit contributions
 energetic-0.031
 active-0.031
ling-0.031
 content-0.031
 forth-0.029
Token feeling
Token ID4203
Feature activation+5.37e-02

Pos logit contributions
ling+0.195
 forth+0.185
lest+0.178
oc+0.171
mer+0.169
Neg logit contributions
OM-0.203
aila-0.200
abella-0.196
ena-0.193
Fore-0.190
Token of
Token ID286
Feature activation-1.23e-01

Pos logit contributions
 happened+0.056
 a+0.056
 been+0.056
OM+0.053
ily+0.053
Neg logit contributions
leness-0.104
 forth-0.102
ishly-0.102
cient-0.099
 positive-0.098
Token the
Token ID262
Feature activation+2.85e-02

Pos logit contributions
leness+0.202
lest+0.200
mer+0.200
 active+0.195
 alike+0.192
Neg logit contributions
OM-0.188
ena-0.153
 been-0.153
 happened-0.151
emon-0.147
Token pops
Token ID26384
Feature activation+3.87e-02

Pos logit contributions
OM+0.050
ena+0.044
emon+0.043
 CA+0.043
 happened+0.041
Neg logit contributions
 active-0.039
lest-0.039
ling-0.039
leness-0.038
 forth-0.038
Tokenicle
Token ID1548
Feature activation+2.89e-01

Pos logit contributions
OM+0.039
 CA+0.037
abella+0.036
 truth+0.035
emon+0.034
Neg logit contributions
ling-0.071
lest-0.070
 forth-0.070
mer-0.069
oc-0.067
Token on
Token ID319
Feature activation-1.06e-01

Pos logit contributions
OM+0.380
 a+0.311
 been+0.309
 happened+0.307
ena+0.283
Neg logit contributions
leness-0.557
mer-0.520
 active-0.505
lest-0.505
 alike-0.498
Token his
Token ID465
Feature activation+2.47e-03

Pos logit contributions
leness+0.149
lest+0.145
 active+0.144
mer+0.143
 alike+0.140
Neg logit contributions
OM-0.184
ena-0.154
emon-0.151
 happened-0.150
 been-0.149
Token tongue
Token ID11880
Feature activation+6.68e-02

Pos logit contributions
OM+0.004
ena+0.003
 been+0.003
 happened+0.003
 CA+0.003
Neg logit contributions
leness-0.004
lest-0.004
mer-0.004
 alike-0.004
 active-0.004
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
OM+0.094
 been+0.075
ena+0.075
 happened+0.074
 CA+0.070
Neg logit contributions
leness-0.119
lest-0.115
mer-0.114
 active-0.113
 alike-0.110
Token Tim
Token ID5045
Feature activation+3.55e-02

Pos logit contributions
OM+0.046
ena+0.038
emon+0.037
 happened+0.036
 been+0.036
Neg logit contributions
leness-0.048
 active-0.047
lest-0.047
 alike-0.046
mer-0.046
Token his
Token ID465
Feature activation-8.91e-04

Pos logit contributions
lest+0.218
mer+0.214
leness+0.209
ling+0.199
 active+0.199
Neg logit contributions
OM-0.165
 been-0.149
 a-0.148
ena-0.137
 happened-0.137
Token knee
Token ID10329
Feature activation+1.03e-01

Pos logit contributions
leness+0.001
lest+0.001
 active+0.001
mer+0.001
 alike+0.001
Neg logit contributions
OM-0.001
ena-0.001
emon-0.001
 happened-0.001
 CA-0.001
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
OM+0.150
 been+0.122
ena+0.121
 happened+0.119
 CA+0.114
Neg logit contributions
leness-0.176
lest-0.171
mer-0.169
 active-0.168
 alike-0.166
Token a
Token ID257
Feature activation+1.74e-01

Pos logit contributions
leness+0.042
mer+0.041
lest+0.040
 alike+0.039
 active+0.039
Neg logit contributions
OM-0.037
 happened-0.032
 been-0.031
ena-0.031
 dentist-0.029
Token pops
Token ID26384
Feature activation+3.36e-02

Pos logit contributions
OM+0.272
ena+0.223
 been+0.218
 happened+0.217
emon+0.215
Neg logit contributions
leness-0.275
lest-0.270
mer-0.266
 active-0.264
 alike-0.260
Tokenicle
Token ID1548
Feature activation+2.89e-01

Pos logit contributions
OM+0.039
ena+0.036
abella+0.035
emon+0.034
Fore+0.033
Neg logit contributions
free-0.055
 grate-0.055
ishly-0.055
ling-0.054
lest-0.054
Token to
Token ID284
Feature activation-2.43e-02

Pos logit contributions
OM+0.465
ena+0.385
 been+0.371
emon+0.371
 happened+0.371
Neg logit contributions
leness-0.444
lest-0.440
mer-0.432
 active-0.429
 alike-0.423
Token make
Token ID787
Feature activation-4.11e-02

Pos logit contributions
leness+0.036
lest+0.036
mer+0.036
 active+0.035
 alike+0.035
Neg logit contributions
OM-0.040
ena-0.033
 been-0.033
 happened-0.033
 CA-0.032
Token him
Token ID683
Feature activation+3.70e-02

Pos logit contributions
leness+0.072
lest+0.070
 active+0.069
mer+0.069
 alike+0.068
Neg logit contributions
OM-0.058
ena-0.046
 been-0.045
 happened-0.045
emon-0.044
Token feel
Token ID1254
Feature activation+6.48e-02

Pos logit contributions
OM+0.061
ena+0.051
 been+0.050
 happened+0.050
emon+0.049
Neg logit contributions
leness-0.055
lest-0.054
mer-0.054
 active-0.053
 alike-0.052
Token better
Token ID1365
Feature activation-1.01e-01

Pos logit contributions
OM+0.084
 been+0.074
 happened+0.073
 a+0.073
 truth+0.071
Neg logit contributions
mer-0.114
lest-0.112
leness-0.112
 forth-0.107
 alike-0.106
Token as
Token ID355
Feature activation+6.20e-02

Pos logit contributions
 a+0.032
OM+0.030
 been+0.028
 happened+0.027
ena+0.026
Neg logit contributions
leness-0.064
lest-0.061
mer-0.058
fy-0.056
 active-0.055
Token he
Token ID339
Feature activation-1.34e-01

Pos logit contributions
OM+0.092
 happened+0.086
 been+0.084
 a+0.083
ena+0.081
Neg logit contributions
leness-0.099
mer-0.096
lest-0.095
 forth-0.093
ling-0.090
Token ate
Token ID15063
Feature activation-7.44e-03

Pos logit contributions
leness+0.205
mer+0.205
lest+0.202
 active+0.199
ling+0.198
Neg logit contributions
OM-0.218
ena-0.175
emon-0.174
 CA-0.169
 truth-0.167
Token his
Token ID465
Feature activation+5.73e-04

Pos logit contributions
lest+0.014
leness+0.014
mer+0.014
 active+0.013
ling+0.013
Neg logit contributions
OM-0.009
 a-0.008
 happened-0.008
 been-0.008
 truth-0.007
Token pops
Token ID26384
Feature activation+3.47e-02

Pos logit contributions
OM+0.001
ena+0.001
 CA+0.001
emon+0.001
 been+0.001
Neg logit contributions
leness-0.001
lest-0.001
 active-0.001
mer-0.001
 alike-0.001
Tokenicle
Token ID1548
Feature activation+2.89e-01

Pos logit contributions
OM+0.043
 CA+0.035
ena+0.035
 happened+0.034
 been+0.033
Neg logit contributions
ling-0.063
mer-0.062
lest-0.062
 forth-0.061
 grate-0.060
Token,
Token ID11
Feature activation-7.00e-02

Pos logit contributions
OM+0.394
 been+0.308
 happened+0.306
ena+0.304
 a+0.298
Neg logit contributions
leness-0.530
 active-0.498
lest-0.496
mer-0.487
 alike-0.482
Token feeling
Token ID4203
Feature activation+5.53e-02

Pos logit contributions
mer+0.127
leness+0.122
lest+0.119
 active+0.113
edient+0.112
Neg logit contributions
OM-0.095
 been-0.086
 a-0.086
ena-0.078
 happened-0.078
Token cooler
Token ID19346
Feature activation+7.62e-02

Pos logit contributions
 a+0.082
 been+0.078
 happened+0.071
 something+0.069
OM+0.069
Neg logit contributions
 forth-0.087
mer-0.087
lest-0.086
leness-0.084
ishly-0.080
Token and
Token ID290
Feature activation-2.91e-02

Pos logit contributions
OM+0.108
 been+0.095
 a+0.094
 happened+0.089
ena+0.082
Neg logit contributions
leness-0.140
lest-0.124
 active-0.120
 forth-0.118
reci-0.117
Token more
Token ID517
Feature activation-2.08e-02

Pos logit contributions
mer+0.046
leness+0.046
lest+0.045
 active+0.043
 alike+0.043
Neg logit contributions
OM-0.045
 been-0.039
 happened-0.038
ena-0.037
 CA-0.036
Token run
Token ID1057
Feature activation-2.68e-02

Pos logit contributions
mer+0.006
leness+0.005
lest+0.005
ling+0.005
 alike+0.005
Neg logit contributions
OM-0.006
 been-0.005
ena-0.005
 a-0.005
 happened-0.005
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
leness+0.033
lest+0.032
mer+0.032
 active+0.032
 alike+0.031
Neg logit contributions
OM-0.052
ena-0.044
 been-0.043
 happened-0.043
emon-0.043
Token play
Token ID711
Feature activation-2.41e-02

Pos logit contributions
leness+0.041
mer+0.041
lest+0.041
ling+0.039
 alike+0.039
Neg logit contributions
OM-0.034
 been-0.030
 a-0.029
 happened-0.027
ena-0.026
Token and
Token ID290
Feature activation-3.10e-02

Pos logit contributions
leness+0.034
lest+0.034
mer+0.032
 active+0.032
 alike+0.032
Neg logit contributions
OM-0.042
ena-0.036
 been-0.035
 happened-0.035
emon-0.034
Token stay
Token ID2652
Feature activation+1.94e-02

Pos logit contributions
leness+0.053
mer+0.053
lest+0.053
ling+0.051
 alike+0.049
Neg logit contributions
OM-0.043
 been-0.039
 a-0.037
 happened-0.035
 CA-0.034
Token active
Token ID4075
Feature activation-3.43e-01

Pos logit contributions
 a+0.028
OM+0.026
 been+0.025
 time+0.024
 an+0.024
Neg logit contributions
lest-0.034
leness-0.033
ishly-0.030
cient-0.030
mer-0.029
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
leness+0.587
lest+0.580
mer+0.555
 active+0.554
 alike+0.549
Neg logit contributions
OM-0.489
ena-0.401
 been-0.396
 happened-0.387
emon-0.375
Token Pet
Token ID4767
Feature activation+7.03e-02

Pos logit contributions
OM+0.027
ena+0.022
 happened+0.022
 been+0.022
emon+0.021
Neg logit contributions
leness-0.026
lest-0.025
 active-0.025
mer-0.025
 alike-0.024
Tokeney
Token ID2959
Feature activation-2.47e-02

Pos logit contributions
OM+0.142
 dentist+0.117
ena+0.117
emon+0.116
 CA+0.115
Neg logit contributions
mer-0.087
leness-0.085
lest-0.083
ling-0.082
 alike-0.078
Token on
Token ID319
Feature activation-1.09e-01

Pos logit contributions
leness+0.049
lest+0.047
mer+0.046
 active+0.046
 forth+0.046
Neg logit contributions
OM-0.028
ena-0.024
 been-0.023
 happened-0.022
emon-0.021
Token the
Token ID262
Feature activation+2.75e-02

Pos logit contributions
leness+0.185
lest+0.185
mer+0.182
 alike+0.178
 active+0.177
Neg logit contributions
OM-0.146
 been-0.125
 a-0.124
ena-0.123
 happened-0.122
Token fast
Token ID3049
Feature activation-1.51e-01

Pos logit contributions
OM+0.055
ena+0.042
 happened+0.041
 been+0.040
 dentist+0.040
Neg logit contributions
leness-0.082
 active-0.082
lest-0.081
mer-0.081
 alike-0.081
Token.
Token ID13
Feature activation+2.36e-02

Pos logit contributions
leness+0.257
lest+0.252
mer+0.242
 active+0.238
 alike+0.237
Neg logit contributions
OM-0.203
 a-0.176
 happened-0.174
 been-0.173
ena-0.172
Token It
Token ID632
Feature activation+1.51e-02

Pos logit contributions
OM+0.037
ena+0.031
 CA+0.029
emon+0.029
 been+0.029
Neg logit contributions
leness-0.040
ling-0.040
lest-0.039
 forth-0.038
mer-0.038
Token's
Token ID338
Feature activation-1.24e-02

Pos logit contributions
OM+0.018
 been+0.015
 happened+0.015
ena+0.015
 CA+0.014
Neg logit contributions
leness-0.029
mer-0.028
 active-0.028
lest-0.028
 alike-0.028
Token really
Token ID1107
Feature activation-7.08e-03

Pos logit contributions
 active+0.024
 alike+0.023
leness+0.023
 forth+0.023
 positive+0.023
Neg logit contributions
OM-0.016
 CA-0.012
emon-0.011
ena-0.011
 happened-0.011
Token active
Token ID4075
Feature activation-3.38e-01

Pos logit contributions
leness+0.011
lest+0.011
mer+0.011
 forth+0.011
ishly+0.010
Neg logit contributions
OM-0.011
 been-0.009
 happened-0.009
ena-0.009
emon-0.009
Token!"
Token ID2474
Feature activation+9.39e-02

Pos logit contributions
leness+0.586
lest+0.579
 forth+0.550
mer+0.549
 alike+0.548
Neg logit contributions
OM-0.460
 been-0.400
 happened-0.389
ena-0.386
 a-0.372
Token
Token ID198
Feature activation+6.97e-02

Pos logit contributions
OM+0.151
 CA+0.121
emon+0.120
 dentist+0.119
ena+0.113
Neg logit contributions
leness-0.141
lest-0.138
 alike-0.137
 active-0.132
 content-0.131
Token
Token ID198
Feature activation+6.94e-02

Pos logit contributions
OM+0.131
emon+0.104
 CA+0.101
 fault+0.100
 dentist+0.100
Neg logit contributions
oc-0.081
 content-0.079
leness-0.078
 together-0.075
 active-0.075
TokenMary
Token ID24119
Feature activation+1.05e-01

Pos logit contributions
OM+0.103
 happened+0.082
 CA+0.081
 dentist+0.079
 been+0.079
Neg logit contributions
leness-0.110
 active-0.109
lest-0.106
ling-0.104
 alike-0.104
Token smiled
Token ID13541
Feature activation+3.64e-02

Pos logit contributions
OM+0.157
 CA+0.124
emon+0.121
ena+0.120
 happened+0.119
Neg logit contributions
lest-0.173
leness-0.172
 active-0.172
mer-0.169
ling-0.168
Token happy
Token ID3772
Feature activation-2.32e-01

Pos logit contributions
OM+0.374
 been+0.329
ena+0.328
 happened+0.311
emon+0.309
Neg logit contributions
leness-0.320
lest-0.313
mer-0.305
 active-0.288
 forth-0.288
Token to
Token ID284
Feature activation-2.43e-02

Pos logit contributions
leness+0.478
lest+0.467
mer+0.442
ling+0.427
 active+0.426
Neg logit contributions
OM-0.255
 been-0.250
 a-0.247
 happened-0.242
 truth-0.226
Token see
Token ID766
Feature activation-3.16e-02

Pos logit contributions
lest+0.042
leness+0.042
mer+0.042
ling+0.041
 active+0.041
Neg logit contributions
OM-0.034
 been-0.029
ena-0.028
 happened-0.027
 CA-0.026
Token him
Token ID683
Feature activation+3.85e-02

Pos logit contributions
leness+0.054
lest+0.053
mer+0.053
 active+0.052
 forth+0.050
Neg logit contributions
OM-0.044
ena-0.037
 been-0.037
 happened-0.037
 dentist-0.035
Token so
Token ID523
Feature activation+3.70e-02

Pos logit contributions
OM+0.056
 been+0.047
ena+0.046
 happened+0.045
emon+0.044
Neg logit contributions
leness-0.064
lest-0.063
mer-0.062
 active-0.060
 alike-0.060
Token active
Token ID4075
Feature activation-3.38e-01

Pos logit contributions
 a+0.057
 been+0.053
OM+0.053
 happened+0.049
 was+0.048
Neg logit contributions
leness-0.056
mer-0.055
 forth-0.054
ishly-0.053
oc-0.052
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
lest+0.644
leness+0.632
 active+0.604
mer+0.586
 alike+0.586
Neg logit contributions
OM-0.398
 been-0.355
 happened-0.347
 a-0.339
ena-0.334
Token healthy
Token ID5448
Feature activation-2.89e-01

Pos logit contributions
mer+0.038
leness+0.035
oc+0.035
lest+0.035
ling+0.033
Neg logit contributions
 been-0.041
 a-0.041
OM-0.036
 happened-0.036
 was-0.035
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
lest+0.531
leness+0.529
 active+0.501
 alike+0.496
mer+0.494
Neg logit contributions
OM-0.358
 been-0.313
 happened-0.308
ena-0.294
 truth-0.289
Token And
Token ID843
Feature activation-1.86e-01

Pos logit contributions
OM+0.029
ena+0.024
 been+0.024
 CA+0.023
 happened+0.023
Neg logit contributions
leness-0.027
lest-0.027
mer-0.026
 active-0.026
ling-0.026
Token that
Token ID326
Feature activation-3.67e-02

Pos logit contributions
leness+0.295
mer+0.279
ling+0.272
lest+0.269
 active+0.266
Neg logit contributions
 a-0.278
 been-0.277
OM-0.275
 happened-0.257
 was-0.254
Token being
Token ID852
Feature activation+5.36e-03

Pos logit contributions
leness+0.077
lest+0.077
mer+0.075
 active+0.073
 alike+0.073
Neg logit contributions
OM-0.068
ena-0.057
 been-0.055
 happened-0.055
 CA-0.054
Token lazy
Token ID16931
Feature activation-2.06e-01

Pos logit contributions
 been+0.008
 a+0.008
OM+0.008
ena+0.008
 happened+0.007
Neg logit contributions
leness-0.009
lest-0.008
mer-0.008
ishly-0.008
 alike-0.008
Token to
Token ID284
Feature activation-2.28e-02

Pos logit contributions
leness+0.381
lest+0.376
 active+0.348
mer+0.347
 alike+0.343
Neg logit contributions
OM-0.262
 happened-0.231
 been-0.229
ena-0.224
 a-0.216
Token being
Token ID852
Feature activation+5.32e-03

Pos logit contributions
mer+0.038
lest+0.038
leness+0.038
ling+0.036
 active+0.036
Neg logit contributions
OM-0.034
 been-0.029
ena-0.028
 happened-0.027
 truth-0.027
Token an
Token ID281
Feature activation+1.21e-01

Pos logit contributions
 a+0.008
 been+0.008
 happened+0.007
ena+0.007
 an+0.007
Neg logit contributions
leness-0.009
lest-0.009
mer-0.008
ishly-0.008
 alike-0.008
Token active
Token ID4075
Feature activation-3.38e-01

Pos logit contributions
OM+0.256
ena+0.219
 happened+0.217
 CA+0.215
 been+0.215
Neg logit contributions
leness-0.127
lest-0.124
 active-0.123
mer-0.121
 alike-0.117
Token and
Token ID290
Feature activation-2.35e-02

Pos logit contributions
leness+0.575
lest+0.559
 active+0.534
 alike+0.533
ling+0.530
Neg logit contributions
OM-0.488
ena-0.413
 been-0.408
 happened-0.399
 dentist-0.392
Token happy
Token ID3772
Feature activation-2.30e-01

Pos logit contributions
mer+0.040
leness+0.039
ling+0.038
lest+0.038
ishly+0.037
Neg logit contributions
 been-0.035
 a-0.035
OM-0.032
 happened-0.032
 an-0.029
Token pet
Token ID4273
Feature activation+4.17e-02

Pos logit contributions
leness+0.421
lest+0.412
mer+0.393
fy+0.385
ishly+0.383
Neg logit contributions
OM-0.303
 happened-0.280
 truth-0.264
 been-0.262
 dentist-0.256
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
OM+0.052
ena+0.046
 happened+0.046
 a+0.044
 been+0.044
Neg logit contributions
leness-0.077
lest-0.074
 active-0.072
 forth-0.070
 energetic-0.069
Token He
Token ID679
Feature activation-1.19e-01

Pos logit contributions
OM+0.031
ena+0.026
 happened+0.026
emon+0.026
 been+0.026
Neg logit contributions
leness-0.015
lest-0.014
 active-0.014
mer-0.014
 alike-0.014
Token reminder
Token ID15438
Feature activation-6.43e-02

Pos logit contributions
 active+0.287
leness+0.284
lest+0.278
mer+0.278
 alike+0.272
Neg logit contributions
OM-0.271
ena-0.222
 CA-0.215
emon-0.212
 happened-0.208
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
leness+0.096
lest+0.093
mer+0.091
 active+0.091
 alike+0.090
Neg logit contributions
OM-0.108
ena-0.091
 been-0.090
 happened-0.089
emon-0.088
Token be
Token ID307
Feature activation+1.12e-01

Pos logit contributions
leness+0.034
lest+0.033
mer+0.033
 active+0.032
ling+0.032
Neg logit contributions
OM-0.032
ena-0.027
 been-0.026
 happened-0.026
emon-0.025
Token wise
Token ID10787
Feature activation-2.16e-01

Pos logit contributions
OM+0.178
 been+0.173
 a+0.168
ena+0.167
emon+0.159
Neg logit contributions
leness-0.166
lest-0.163
mer-0.153
 forth-0.150
ling-0.150
Token and
Token ID290
Feature activation-2.22e-02

Pos logit contributions
leness+0.381
lest+0.371
mer+0.350
 forth+0.349
ling+0.347
Neg logit contributions
OM-0.294
 been-0.258
ena-0.256
 happened-0.249
 a-0.240
Token active
Token ID4075
Feature activation-3.38e-01

Pos logit contributions
mer+0.040
ling+0.039
leness+0.039
lest+0.037
oc+0.036
Neg logit contributions
 been-0.031
 a-0.030
OM-0.028
 happened-0.027
 an-0.025
Token even
Token ID772
Feature activation-4.79e-02

Pos logit contributions
lest+0.642
leness+0.635
 forth+0.567
ling+0.567
cient+0.566
Neg logit contributions
OM-0.380
 a-0.377
 been-0.369
 happened-0.351
ena-0.340
Token when
Token ID618
Feature activation-3.40e-02

Pos logit contributions
leness+0.079
ishly+0.079
mer+0.074
lest+0.072
 alike+0.071
Neg logit contributions
 a-0.064
 been-0.060
OM-0.059
 happened-0.058
ena-0.058
Token having
Token ID1719
Feature activation-4.96e-02

Pos logit contributions
leness+0.050
lest+0.049
mer+0.049
 active+0.048
 alike+0.047
Neg logit contributions
OM-0.057
ena-0.047
 happened-0.047
 been-0.047
emon-0.046
Token fun
Token ID1257
Feature activation-9.84e-02

Pos logit contributions
leness+0.088
lest+0.088
mer+0.086
 active+0.084
 alike+0.084
Neg logit contributions
OM-0.067
 been-0.054
 happened-0.054
ena-0.053
emon-0.051
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
lest+0.166
mer+0.165
leness+0.165
cient+0.163
oc+0.159
Neg logit contributions
 happened-0.123
OM-0.118
 a-0.116
 been-0.114
 time-0.107
Token lazy
Token ID16931
Feature activation-2.05e-01

Pos logit contributions
OM+0.033
 happened+0.028
ena+0.028
 been+0.027
 truth+0.027
Neg logit contributions
leness-0.033
lest-0.032
mer-0.031
ling-0.030
 alike-0.030
Token robot
Token ID9379
Feature activation+1.35e-02

Pos logit contributions
leness+0.336
lest+0.331
mer+0.319
 alike+0.316
 active+0.315
Neg logit contributions
OM-0.307
 happened-0.259
 been-0.257
ena-0.252
 truth-0.245
Token had
Token ID550
Feature activation+7.29e-02

Pos logit contributions
OM+0.011
 been+0.008
 happened+0.008
ena+0.008
 CA+0.007
Neg logit contributions
leness-0.031
lest-0.031
mer-0.030
 active-0.030
 alike-0.030
Token become
Token ID1716
Feature activation+1.22e-01

Pos logit contributions
OM+0.095
ena+0.072
emon+0.070
 CA+0.069
 dentist+0.067
Neg logit contributions
leness-0.137
 active-0.135
lest-0.132
 alike-0.132
mer-0.130
Token so
Token ID523
Feature activation+3.60e-02

Pos logit contributions
OM+0.141
 been+0.124
ena+0.122
 happened+0.121
 a+0.119
Neg logit contributions
leness-0.235
lest-0.233
mer-0.223
 forth-0.217
ling-0.215
Token active
Token ID4075
Feature activation-3.37e-01

Pos logit contributions
 a+0.051
 been+0.051
 happened+0.048
 truth+0.045
 an+0.044
Neg logit contributions
 forth-0.060
oc-0.058
leness-0.057
mer-0.057
ishly-0.056
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
lest+0.667
leness+0.661
 active+0.612
cient+0.608
mer+0.599
Neg logit contributions
OM-0.374
 happened-0.347
 been-0.344
 a-0.335
 truth-0.312
Token<|endoftext|>
Token ID50256
Feature activation+8.04e-02

Pos logit contributions
OM+0.030
ena+0.025
 happened+0.024
 been+0.024
emon+0.024
Neg logit contributions
leness-0.030
lest-0.030
mer-0.029
 active-0.029
 alike-0.029
TokenOnce
Token ID7454
Feature activation+4.48e-02

Pos logit contributions
OM+0.150
emon+0.127
ena+0.123
 dentist+0.120
OU+0.120
Neg logit contributions
mer-0.108
 together-0.096
ling-0.095
 alike-0.092
oc-0.092
Token upon
Token ID2402
Feature activation+1.20e-01

Pos logit contributions
OM+0.061
ena+0.047
emon+0.047
 CA+0.046
 been+0.045
Neg logit contributions
lest-0.080
leness-0.080
 alike-0.079
mer-0.078
 active-0.078
Token a
Token ID257
Feature activation+1.94e-01

Pos logit contributions
OM+0.217
omm+0.181
anger+0.180
emon+0.170
ACK+0.162
Neg logit contributions
 active-0.180
 content-0.174
 energetic-0.169
 together-0.168
 happy-0.166
Token.
Token ID13
Feature activation+3.09e-02

Pos logit contributions
lest+0.282
leness+0.282
 active+0.269
mer+0.261
fy+0.256
Neg logit contributions
OM-0.173
 been-0.148
 happened-0.148
 a-0.146
 CA-0.137
Token The
Token ID383
Feature activation-5.83e-02

Pos logit contributions
OM+0.054
 CA+0.045
 been+0.045
 happened+0.045
ena+0.044
Neg logit contributions
leness-0.043
lest-0.043
mer-0.042
 active-0.041
 alike-0.040
Token squirrel
Token ID33039
Feature activation+1.03e-01

Pos logit contributions
 active+0.084
 forth+0.082
leness+0.081
lest+0.080
ling+0.079
Neg logit contributions
OM-0.104
ena-0.090
emon-0.087
abella-0.086
 happened-0.081
Token was
Token ID373
Feature activation+2.23e-01

Pos logit contributions
OM+0.081
 happened+0.067
 been+0.065
 CA+0.061
 was+0.061
Neg logit contributions
leness-0.236
 active-0.225
 positive-0.222
lest-0.222
mer-0.221
Token very
Token ID845
Feature activation-7.44e-02

Pos logit contributions
OM+0.424
ena+0.349
emon+0.344
 happened+0.342
 CA+0.342
Neg logit contributions
 active-0.288
leness-0.283
 alike-0.277
lest-0.276
mer-0.274
Token active
Token ID4075
Feature activation-3.37e-01

Pos logit contributions
leness+0.127
ishly+0.127
lest+0.117
 forth+0.113
ience+0.112
Neg logit contributions
 a-0.098
OM-0.096
ila-0.090
 been-0.089
 happened-0.089
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
leness+0.560
lest+0.555
mer+0.533
 active+0.533
 alike+0.524
Neg logit contributions
OM-0.503
ena-0.415
 been-0.410
 happened-0.408
 CA-0.393
Token didn
Token ID1422
Feature activation+9.48e-02

Pos logit contributions
leness+0.041
lest+0.040
mer+0.039
ling+0.037
 alike+0.037
Neg logit contributions
OM-0.023
 been-0.022
 happened-0.022
 a-0.021
 was-0.019
Token't
Token ID470
Feature activation+5.96e-02

Pos logit contributions
OM+0.173
ena+0.159
anya+0.149
 happened+0.148
emon+0.146
Neg logit contributions
 active-0.105
 forward-0.104
lest-0.102
othe-0.100
bing-0.099
Token want
Token ID765
Feature activation-9.39e-02

Pos logit contributions
OM+0.099
 been+0.084
ena+0.084
 happened+0.083
 CA+0.081
Neg logit contributions
leness-0.087
lest-0.085
mer-0.084
 active-0.083
 forth-0.081
Token to
Token ID284
Feature activation-2.26e-02

Pos logit contributions
 alike+0.105
leness+0.104
 active+0.101
lest+0.100
mer+0.095
Neg logit contributions
OM-0.194
ena-0.174
emon-0.155
 dentist-0.151
anya-0.150
Token't
Token ID470
Feature activation+6.07e-02

Pos logit contributions
lest+0.132
leness+0.131
mer+0.130
 active+0.129
ling+0.127
Neg logit contributions
OM-0.111
ena-0.096
emon-0.091
 happened-0.089
 dentist-0.087
Token be
Token ID307
Feature activation+1.14e-01

Pos logit contributions
OM+0.107
 been+0.094
 happened+0.086
 a+0.086
 truth+0.085
Neg logit contributions
leness-0.088
mer-0.087
 alike-0.084
lest-0.083
ling-0.080
Token lazy
Token ID16931
Feature activation-2.02e-01

Pos logit contributions
OM+0.193
ena+0.169
 been+0.169
 happened+0.164
 a+0.161
Neg logit contributions
leness-0.161
lest-0.160
mer-0.157
 alike-0.148
 forth-0.145
Token,
Token ID11
Feature activation-5.95e-02

Pos logit contributions
leness+0.365
lest+0.350
mer+0.334
oc+0.330
 alike+0.325
Neg logit contributions
OM-0.255
 happened-0.236
 been-0.235
 a-0.228
 truth-0.225
Token being
Token ID852
Feature activation+6.39e-03

Pos logit contributions
leness+0.128
mer+0.122
ging+0.112
lest+0.111
 alike+0.111
Neg logit contributions
 a-0.065
OM-0.062
 truth-0.058
 been-0.054
 something-0.053
Token active
Token ID4075
Feature activation-3.36e-01

Pos logit contributions
OM+0.012
 been+0.011
ena+0.010
 happened+0.010
 a+0.010
Neg logit contributions
leness-0.008
lest-0.008
mer-0.008
 alike-0.007
ling-0.007
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
leness+0.587
lest+0.576
 active+0.547
mer+0.547
 alike+0.543
Neg logit contributions
OM-0.462
 been-0.393
 happened-0.383
ena-0.383
 truth-0.367
Token doing
Token ID1804
Feature activation-1.05e-01

Pos logit contributions
mer+0.033
oc+0.031
lest+0.031
leness+0.031
ling+0.029
Neg logit contributions
 been-0.027
OM-0.026
 a-0.025
 happened-0.021
 truth-0.021
Token exercise
Token ID5517
Feature activation-4.85e-02

Pos logit contributions
leness+0.181
lest+0.180
mer+0.179
 alike+0.172
 active+0.172
Neg logit contributions
OM-0.150
ena-0.121
 been-0.120
 happened-0.118
emon-0.115
Token makes
Token ID1838
Feature activation-3.87e-02

Pos logit contributions
leness+0.082
lest+0.082
 forth+0.075
mer+0.075
 alike+0.075
Neg logit contributions
OM-0.069
 been-0.059
 a-0.058
ena-0.057
 happened-0.056
Token us
Token ID514
Feature activation+6.05e-02

Pos logit contributions
leness+0.053
lest+0.052
mer+0.051
 active+0.050
 alike+0.049
Neg logit contributions
OM-0.069
ena-0.058
 been-0.057
 happened-0.057
emon-0.057
Token would
Token ID561
Feature activation+4.63e-02

Pos logit contributions
leness+0.032
mer+0.030
lest+0.030
 active+0.030
 energetic+0.029
Neg logit contributions
OM-0.019
ena-0.017
 been-0.016
 happened-0.016
 dentist-0.015
Token remind
Token ID7101
Feature activation-1.30e-01

Pos logit contributions
OM+0.059
 been+0.051
 a+0.048
 truth+0.047
ena+0.047
Neg logit contributions
mer-0.089
lest-0.087
leness-0.086
ling-0.082
 active-0.081
Token her
Token ID607
Feature activation+2.52e-02

Pos logit contributions
 alike+0.189
leness+0.180
lest+0.174
 forth+0.173
 grate+0.172
Neg logit contributions
OM-0.221
 CA-0.190
emon-0.184
 dentist-0.177
 happened-0.173
Token to
Token ID284
Feature activation-1.90e-02

Pos logit contributions
OM+0.040
ena+0.033
 been+0.033
 happened+0.033
emon+0.032
Neg logit contributions
leness-0.040
lest-0.039
mer-0.039
 active-0.038
 alike-0.037
Token stay
Token ID2652
Feature activation+3.48e-02

Pos logit contributions
leness+0.032
lest+0.032
mer+0.031
 active+0.031
ling+0.030
Neg logit contributions
OM-0.028
 been-0.023
ena-0.023
 happened-0.022
emon-0.022
Token active
Token ID4075
Feature activation-3.35e-01

Pos logit contributions
 a+0.053
 something+0.047
 an+0.046
 been+0.044
OM+0.042
Neg logit contributions
lest-0.061
leness-0.059
cient-0.055
mer-0.055
ishly-0.052
Token and
Token ID290
Feature activation-1.98e-02

Pos logit contributions
lest+0.663
leness+0.660
 alike+0.609
mer+0.601
 active+0.592
Neg logit contributions
OM-0.360
 a-0.341
 been-0.324
ena-0.309
 truth-0.296
Token have
Token ID423
Feature activation-6.14e-03

Pos logit contributions
mer+0.037
leness+0.036
ling+0.035
lest+0.035
oc+0.034
Neg logit contributions
 been-0.026
 a-0.025
OM-0.025
 something-0.021
 happened-0.021
Token fun
Token ID1257
Feature activation-9.69e-02

Pos logit contributions
 active+0.010
leness+0.010
 alike+0.010
 content+0.010
 energetic+0.010
Neg logit contributions
OM-0.009
ena-0.007
 CA-0.007
emon-0.007
 happened-0.007
Token,
Token ID11
Feature activation-6.57e-02

Pos logit contributions
leness+0.193
lest+0.192
cient+0.190
mer+0.187
ishly+0.185
Neg logit contributions
 a-0.092
OM-0.091
 been-0.090
 happened-0.087
 time-0.077
Token but
Token ID475
Feature activation-6.41e-02

Pos logit contributions
leness+0.116
mer+0.114
lest+0.112
ling+0.111
 active+0.106
Neg logit contributions
OM-0.090
 been-0.082
 a-0.077
ena-0.076
 truth-0.073
Token to
Token ID284
Feature activation-1.99e-02

Pos logit contributions
OM+0.066
 been+0.057
 happened+0.055
 a+0.054
ena+0.054
Neg logit contributions
leness-0.096
lest-0.091
 active-0.087
ling-0.085
mer-0.085
Token have
Token ID423
Feature activation-7.33e-03

Pos logit contributions
leness+0.027
lest+0.026
 active+0.026
mer+0.026
 alike+0.025
Neg logit contributions
OM-0.036
ena-0.031
 happened-0.030
 been-0.030
emon-0.030
Token fun
Token ID1257
Feature activation-9.79e-02

Pos logit contributions
leness+0.013
 active+0.013
 alike+0.013
mer+0.013
lest+0.012
Neg logit contributions
OM-0.010
ena-0.008
 CA-0.008
emon-0.007
 Ow-0.007
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
leness+0.175
lest+0.174
mer+0.166
ishly+0.161
 alike+0.161
Neg logit contributions
OM-0.129
 been-0.113
 happened-0.113
 a-0.110
ena-0.108
Token be
Token ID307
Feature activation+1.10e-01

Pos logit contributions
mer+0.031
lest+0.030
leness+0.030
ling+0.029
 alike+0.029
Neg logit contributions
OM-0.029
 been-0.027
 happened-0.025
 a-0.025
ena-0.024
Token active
Token ID4075
Feature activation-3.34e-01

Pos logit contributions
OM+0.148
 been+0.140
 a+0.136
ena+0.135
 happened+0.124
Neg logit contributions
leness-0.196
lest-0.194
ishly-0.179
mer-0.179
 alike-0.175
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
lest+0.645
leness+0.644
mer+0.581
ling+0.572
 forth+0.571
Neg logit contributions
OM-0.401
 a-0.351
 been-0.350
ena-0.345
 happened-0.326
Token The
Token ID383
Feature activation-5.96e-02

Pos logit contributions
OM+0.063
emon+0.055
ena+0.055
 happened+0.053
 been+0.051
Neg logit contributions
leness-0.033
 active-0.033
 content-0.033
oc-0.032
 energetic-0.032
Token lazy
Token ID16931
Feature activation-2.05e-01

Pos logit contributions
leness+0.122
lest+0.120
mer+0.118
 alike+0.117
 active+0.116
Neg logit contributions
OM-0.068
ena-0.053
 been-0.050
 happened-0.049
emon-0.048
Token human
Token ID1692
Feature activation-4.41e-02

Pos logit contributions
leness+0.336
 alike+0.332
lest+0.325
mer+0.320
 active+0.315
Neg logit contributions
OM-0.306
ena-0.259
 happened-0.255
 been-0.248
 truth-0.241
Token then
Token ID788
Feature activation+1.33e-02

Pos logit contributions
leness+0.095
lest+0.092
mer+0.089
 alike+0.089
 active+0.088
Neg logit contributions
OM-0.045
 been-0.036
ena-0.036
 happened-0.034
emon-0.031
Token Jack
Token ID3619
Feature activation-2.33e-02

Pos logit contributions
OM+0.061
 a+0.052
 happened+0.051
 been+0.050
 truth+0.049
Neg logit contributions
lest-0.068
leness-0.066
mer-0.060
ling-0.059
edient-0.056
Token was
Token ID373
Feature activation+2.32e-01

Pos logit contributions
leness+0.039
lest+0.038
 active+0.038
mer+0.037
 alike+0.037
Neg logit contributions
OM-0.035
ena-0.028
emon-0.027
 happened-0.027
 been-0.027
Token a
Token ID257
Feature activation+1.88e-01

Pos logit contributions
OM+0.303
ena+0.235
 happened+0.227
emon+0.225
 been+0.225
Neg logit contributions
leness-0.431
lest-0.426
 active-0.423
mer-0.420
 alike-0.415
Token happy
Token ID3772
Feature activation-2.23e-01

Pos logit contributions
OM+0.244
 a+0.233
 dentist+0.224
 CA+0.211
 happened+0.206
Neg logit contributions
leness-0.374
lest-0.323
ishly-0.322
 forth-0.320
ling-0.312
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
 active+0.340
lest+0.337
mer+0.336
leness+0.336
 alike+0.329
Neg logit contributions
OM-0.356
ena-0.292
emon-0.286
 been-0.284
 CA-0.280
Token active
Token ID4075
Feature activation-3.31e-01

Pos logit contributions
leness+0.025
mer+0.025
lest+0.024
ishly+0.023
ling+0.023
Neg logit contributions
OM-0.021
 been-0.020
 a-0.020
 happened-0.019
ena-0.017
Token child
Token ID1200
Feature activation+8.71e-02

Pos logit contributions
leness+0.539
lest+0.530
mer+0.525
 active+0.524
 alike+0.513
Neg logit contributions
OM-0.506
ena-0.412
 been-0.400
 happened-0.400
emon-0.398
Token who
Token ID508
Feature activation-6.59e-02

Pos logit contributions
OM+0.079
 been+0.078
ena+0.073
 a+0.072
 happened+0.067
Neg logit contributions
leness-0.182
lest-0.177
 forth-0.172
mer-0.172
 active-0.169
Token loved
Token ID6151
Feature activation-7.73e-02

Pos logit contributions
leness+0.136
mer+0.134
lest+0.134
 forth+0.132
ishly+0.129
Neg logit contributions
 been-0.069
 happened-0.064
OM-0.061
 a-0.060
 was-0.059
Token to
Token ID284
Feature activation-1.63e-02

Pos logit contributions
leness+0.150
lest+0.148
mer+0.147
ling+0.139
ishly+0.137
Neg logit contributions
OM-0.095
 been-0.086
 happened-0.083
 a-0.081
 dentist-0.078
Token play
Token ID711
Feature activation-1.84e-02

Pos logit contributions
mer+0.024
leness+0.023
lest+0.023
ling+0.022
 alike+0.022
Neg logit contributions
OM-0.028
 been-0.024
ena-0.024
 happened-0.024
 CA-0.023
Token.
Token ID13
Feature activation+3.95e-02

Pos logit contributions
OM+0.022
emon+0.018
 CA+0.017
ena+0.017
 happened+0.017
Neg logit contributions
 active-0.017
mer-0.017
leness-0.016
 alike-0.016
ling-0.016
Token Spot
Token ID15899
Feature activation+1.15e-02

Pos logit contributions
OM+0.062
ena+0.051
 been+0.050
 happened+0.050
 CA+0.049
Neg logit contributions
leness-0.063
lest-0.062
mer-0.061
 active-0.060
 alike-0.060
Token was
Token ID373
Feature activation+2.28e-01

Pos logit contributions
OM+0.020
emon+0.016
ena+0.016
 CA+0.016
 dentist+0.015
Neg logit contributions
 active-0.017
mer-0.016
leness-0.016
lest-0.016
ling-0.016
Token always
Token ID1464
Feature activation-6.50e-02

Pos logit contributions
OM+0.304
ena+0.237
 happened+0.229
 been+0.228
emon+0.228
Neg logit contributions
leness-0.419
lest-0.413
 active-0.409
mer-0.407
 alike-0.401
Token very
Token ID845
Feature activation-7.20e-02

Pos logit contributions
leness+0.111
lest+0.109
mer+0.100
ishly+0.098
oc+0.096
Neg logit contributions
OM-0.095
 been-0.089
 a-0.087
ena-0.083
 happened-0.080
Token active
Token ID4075
Feature activation-3.31e-01

Pos logit contributions
ishly+0.128
leness+0.127
 forth+0.123
lest+0.119
ized+0.113
Neg logit contributions
 a-0.100
 been-0.092
ila-0.087
OM-0.086
ily-0.084
Token and
Token ID290
Feature activation-1.61e-02

Pos logit contributions
leness+0.567
lest+0.561
mer+0.530
 active+0.529
 alike+0.525
Neg logit contributions
OM-0.476
ena-0.400
 been-0.396
 happened-0.387
emon-0.369
Token full
Token ID1336
Feature activation-1.70e-01

Pos logit contributions
leness+0.032
lest+0.032
mer+0.032
ized+0.031
ling+0.030
Neg logit contributions
 been-0.017
 a-0.016
OM-0.016
 happened-0.015
 was-0.014
Token of
Token ID286
Feature activation-1.19e-01

Pos logit contributions
leness+0.244
lest+0.240
mer+0.236
 active+0.232
 alike+0.228
Neg logit contributions
OM-0.294
ena-0.246
 been-0.244
 happened-0.243
emon-0.239
Token energy
Token ID2568
Feature activation+7.08e-03

Pos logit contributions
lest+0.189
mer+0.187
leness+0.186
 alike+0.176
ling+0.175
Neg logit contributions
OM-0.182
 been-0.156
ena-0.153
 happened-0.152
 truth-0.150
Token.
Token ID13
Feature activation+3.86e-02

Pos logit contributions
 a+0.006
 happened+0.006
 been+0.006
OM+0.006
 truth+0.005
Neg logit contributions
leness-0.015
lest-0.015
 forth-0.015
mer-0.014
 active-0.014
Token old
Token ID1468
Feature activation-7.54e-02

Pos logit contributions
leness+0.113
ishly+0.108
 forth+0.106
mer+0.104
lest+0.104
Neg logit contributions
 dentist-0.048
OM-0.046
 truth-0.045
ily-0.043
 a-0.042
Token.
Token ID13
Feature activation+4.14e-02

Pos logit contributions
leness+0.133
lest+0.130
 active+0.127
mer+0.126
 alike+0.123
Neg logit contributions
OM-0.105
ena-0.087
 been-0.085
 happened-0.084
 CA-0.081
Token Jack
Token ID3619
Feature activation-2.25e-02

Pos logit contributions
OM+0.080
ena+0.068
 happened+0.068
 been+0.068
 truth+0.066
Neg logit contributions
lest-0.055
leness-0.054
mer-0.051
ling-0.049
 forth-0.048
Token was
Token ID373
Feature activation+2.30e-01

Pos logit contributions
leness+0.039
lest+0.039
mer+0.038
 active+0.038
 alike+0.037
Neg logit contributions
OM-0.031
ena-0.026
 been-0.025
 happened-0.025
 CA-0.024
Token very
Token ID845
Feature activation-7.11e-02

Pos logit contributions
OM+0.270
 been+0.230
ena+0.227
 a+0.220
 happened+0.212
Neg logit contributions
leness-0.468
lest-0.450
mer-0.446
ling-0.425
 forth-0.418
Token active
Token ID4075
Feature activation-3.29e-01

Pos logit contributions
leness+0.125
ishly+0.124
 forth+0.119
ience+0.109
mer+0.109
Neg logit contributions
 a-0.104
 been-0.093
OM-0.091
ila-0.090
ena-0.089
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
leness+0.624
lest+0.616
mer+0.564
 active+0.557
 alike+0.555
Neg logit contributions
OM-0.427
 been-0.365
ena-0.365
 a-0.352
 happened-0.351
Token loved
Token ID6151
Feature activation-7.90e-02

Pos logit contributions
mer+0.031
leness+0.031
lest+0.029
ishly+0.029
ishment+0.029
Neg logit contributions
 been-0.014
 a-0.014
OM-0.013
 happened-0.013
 was-0.012
Token to
Token ID284
Feature activation-1.78e-02

Pos logit contributions
leness+0.139
lest+0.137
mer+0.137
ling+0.129
 alike+0.128
Neg logit contributions
OM-0.111
 been-0.094
 happened-0.094
ena-0.091
 a-0.087
Token run
Token ID1057
Feature activation-2.91e-02

Pos logit contributions
leness+0.025
lest+0.025
mer+0.025
 alike+0.024
 active+0.024
Neg logit contributions
OM-0.031
ena-0.026
 been-0.025
 happened-0.025
 CA-0.025
Token around
Token ID1088
Feature activation-1.39e-01

Pos logit contributions
lest+0.044
leness+0.043
mer+0.038
 forth+0.038
 alike+0.038
Neg logit contributions
OM-0.046
 a-0.043
 been-0.041
 happened-0.040
ena-0.039
Token you
Token ID345
Feature activation-6.28e-02

Pos logit contributions
leness+0.177
mer+0.174
lest+0.170
ling+0.165
 active+0.164
Neg logit contributions
OM-0.147
 been-0.127
 happened-0.120
ena-0.119
 truth-0.119
Token also
Token ID635
Feature activation-3.08e-02

Pos logit contributions
leness+0.102
lest+0.101
mer+0.098
 active+0.098
ling+0.096
Neg logit contributions
OM-0.096
ena-0.080
 been-0.080
 happened-0.077
emon-0.076
Token need
Token ID761
Feature activation-1.02e-01

Pos logit contributions
lest+0.056
mer+0.055
 forth+0.052
ling+0.052
leness+0.052
Neg logit contributions
 been-0.040
OM-0.039
 a-0.037
 truth-0.036
ena-0.035
Token to
Token ID284
Feature activation-4.65e-02

Pos logit contributions
 active+0.100
 positive+0.098
 alike+0.095
 content+0.093
leness+0.092
Neg logit contributions
OM-0.217
ena-0.187
 CA-0.181
anya-0.177
emon-0.175
Token be
Token ID307
Feature activation+8.93e-02

Pos logit contributions
leness+0.078
lest+0.078
mer+0.078
 active+0.075
 alike+0.075
Neg logit contributions
OM-0.068
ena-0.055
 been-0.055
emon-0.053
 happened-0.053
Token responsible
Token ID4497
Feature activation-3.27e-01

Pos logit contributions
 been+0.140
 a+0.139
OM+0.132
ena+0.129
 something+0.121
Neg logit contributions
lest-0.138
mer-0.135
leness-0.135
oc-0.124
ishly-0.123
Token and
Token ID290
Feature activation-5.91e-02

Pos logit contributions
leness+0.617
lest+0.603
mer+0.576
 active+0.568
 alike+0.567
Neg logit contributions
OM-0.408
 been-0.348
ena-0.343
 truth-0.324
 a-0.322
Token careful
Token ID8161
Feature activation-2.62e-01

Pos logit contributions
mer+0.100
leness+0.095
lest+0.093
oc+0.093
ishly+0.090
Neg logit contributions
 been-0.093
 a-0.089
OM-0.085
 something-0.078
 happened-0.075
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
leness+0.460
lest+0.444
mer+0.441
 alike+0.432
 active+0.426
Neg logit contributions
OM-0.374
 been-0.308
ena-0.306
 happened-0.292
 truth-0.287
Token Dirt
Token ID49124
Feature activation+5.41e-03

Pos logit contributions
leness+0.026
 active+0.025
lest+0.024
mer+0.024
 alike+0.023
Neg logit contributions
OM-0.043
 happened-0.037
ena-0.037
 been-0.036
emon-0.036
Token is
Token ID318
Feature activation+1.44e-01

Pos logit contributions
OM+0.005
 happened+0.003
 been+0.003
ena+0.003
 CA+0.003
Neg logit contributions
leness-0.012
 active-0.012
lest-0.012
mer-0.012
 alike-0.012
Token.
Token ID13
Feature activation+4.54e-02

Pos logit contributions
OM+0.214
emon+0.171
 CA+0.169
ena+0.162
 dentist+0.161
Neg logit contributions
lest-0.150
ling-0.150
 active-0.148
mer-0.145
 alike-0.145
Token She
Token ID1375
Feature activation-1.03e-01

Pos logit contributions
OM+0.086
ena+0.073
 CA+0.072
emon+0.071
 happened+0.071
Neg logit contributions
leness-0.055
 active-0.055
 alike-0.054
mer-0.053
lest-0.053
Token was
Token ID373
Feature activation+2.35e-01

Pos logit contributions
 active+0.114
 energetic+0.097
ling+0.096
mer+0.089
lest+0.089
Neg logit contributions
OM-0.233
emon-0.196
 fault-0.195
 Ow-0.187
ena-0.186
Token a
Token ID257
Feature activation+1.92e-01

Pos logit contributions
OM+0.350
 CA+0.267
 happened+0.266
ena+0.265
emon+0.263
Neg logit contributions
 active-0.406
 alike-0.389
 energetic-0.385
lest-0.385
leness-0.379
Token very
Token ID845
Feature activation-6.53e-02

Pos logit contributions
OM+0.242
 a+0.230
 dentist+0.214
ena+0.206
 CA+0.201
Neg logit contributions
leness-0.388
lest-0.338
ishly-0.329
 forth-0.329
ling-0.327
Token active
Token ID4075
Feature activation-3.26e-01

Pos logit contributions
leness+0.119
ishly+0.113
 forth+0.112
bing+0.108
lest+0.103
Neg logit contributions
 a-0.083
OM-0.082
ily-0.081
ila-0.080
 happened-0.078
Token little
Token ID1310
Feature activation+2.78e-02

Pos logit contributions
leness+0.514
lest+0.504
mer+0.490
 active+0.489
 alike+0.482
Neg logit contributions
OM-0.517
ena-0.427
 been-0.415
 happened-0.414
 CA-0.409
Token girl
Token ID2576
Feature activation+4.06e-02

Pos logit contributions
 a+0.024
OM+0.022
 truth+0.020
 been+0.019
 dentist+0.019
Neg logit contributions
leness-0.065
lest-0.058
ized-0.056
bing-0.055
reci-0.055
Token.
Token ID13
Feature activation+4.46e-02

Pos logit contributions
OM+0.046
ena+0.045
 been+0.043
 a+0.042
 happened+0.039
Neg logit contributions
leness-0.080
lest-0.077
ling-0.072
 forth-0.072
 active-0.071
Token She
Token ID1375
Feature activation-1.04e-01

Pos logit contributions
OM+0.083
ena+0.069
 CA+0.069
emon+0.068
 happened+0.067
Neg logit contributions
 alike-0.055
 active-0.055
leness-0.055
mer-0.053
 energetic-0.052
Token liked
Token ID8288
Feature activation-4.20e-02

Pos logit contributions
 active+0.126
 energetic+0.112
 alike+0.112
mer+0.111
lest+0.110
Neg logit contributions
OM-0.217
emon-0.179
 CA-0.176
 Ow-0.173
ena-0.172
Token It
Token ID632
Feature activation-2.74e-03

Pos logit contributions
leness+0.016
ling+0.016
lest+0.016
mer+0.015
 forth+0.015
Neg logit contributions
OM-0.019
ena-0.017
 been-0.016
 CA-0.016
emon-0.016
Token teaches
Token ID17324
Feature activation-8.59e-02

Pos logit contributions
leness+0.006
lest+0.006
mer+0.006
 active+0.006
 alike+0.006
Neg logit contributions
OM-0.003
ena-0.002
 been-0.002
 happened-0.002
 CA-0.002
Token you
Token ID345
Feature activation-6.18e-02

Pos logit contributions
leness+0.110
 alike+0.104
 active+0.103
lest+0.102
 forth+0.101
Neg logit contributions
OM-0.166
emon-0.134
 CA-0.134
ena-0.133
 happened-0.131
Token to
Token ID284
Feature activation-4.40e-02

Pos logit contributions
leness+0.104
mer+0.103
lest+0.103
 active+0.100
 forth+0.099
Neg logit contributions
OM-0.089
ena-0.074
 been-0.073
 happened-0.072
emon-0.070
Token be
Token ID307
Feature activation+9.14e-02

Pos logit contributions
lest+0.075
mer+0.075
leness+0.072
ling+0.070
 active+0.069
Neg logit contributions
OM-0.066
 been-0.056
ena-0.056
 truth-0.052
 happened-0.051
Token responsible
Token ID4497
Feature activation-3.26e-01

Pos logit contributions
OM+0.137
 a+0.137
 been+0.135
ena+0.134
 happened+0.121
Neg logit contributions
lest-0.143
mer-0.139
leness-0.138
ishly-0.128
 alike-0.123
Token and
Token ID290
Feature activation-5.13e-02

Pos logit contributions
leness+0.628
lest+0.613
 active+0.578
 forth+0.577
mer+0.577
Neg logit contributions
OM-0.387
ena-0.334
 been-0.332
 truth-0.318
 a-0.316
Token organized
Token ID8389
Feature activation-2.56e-01

Pos logit contributions
mer+0.089
ishly+0.085
leness+0.082
lest+0.081
rily+0.079
Neg logit contributions
 a-0.077
 been-0.074
 something-0.066
OM-0.066
 not-0.065
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
lest+0.483
leness+0.483
 active+0.456
mer+0.455
 forth+0.451
Neg logit contributions
OM-0.293
ena-0.263
 a-0.263
 been-0.256
 truth-0.252
Token It
Token ID632
Feature activation-4.99e-03

Pos logit contributions
leness+0.017
lest+0.017
mer+0.016
ling+0.016
 active+0.016
Neg logit contributions
OM-0.023
ena-0.020
 been-0.019
 CA-0.019
emon-0.019
Token also
Token ID635
Feature activation-2.67e-02

Pos logit contributions
leness+0.011
lest+0.011
mer+0.011
 active+0.011
 alike+0.010
Neg logit contributions
OM-0.005
 been-0.003
ena-0.003
 happened-0.003
 CA-0.003
Token rules
Token ID3173
Feature activation+8.48e-03

Pos logit contributions
OM+0.004
 truth+0.003
 been+0.003
 dentist+0.003
 happened+0.003
Neg logit contributions
leness-0.007
lest-0.007
mer-0.007
 active-0.007
 alike-0.007
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
OM+0.009
ena+0.008
 been+0.008
 a+0.008
 happened+0.007
Neg logit contributions
leness-0.017
 active-0.017
lest-0.017
 forth-0.017
 energetic-0.016
Token Be
Token ID1355
Feature activation+1.67e-02

Pos logit contributions
leness+0.020
lest+0.020
mer+0.019
 active+0.019
 alike+0.019
Neg logit contributions
OM-0.025
ena-0.021
 been-0.021
 happened-0.021
 CA-0.021
Token careful
Token ID8161
Feature activation-2.60e-01

Pos logit contributions
OM+0.028
ena+0.027
 been+0.027
 a+0.026
 happened+0.024
Neg logit contributions
leness-0.023
lest-0.023
ishly-0.022
mer-0.022
 alike-0.021
Token and
Token ID290
Feature activation-5.45e-02

Pos logit contributions
leness+0.491
mer+0.471
lest+0.463
 alike+0.460
 active+0.456
Neg logit contributions
OM-0.344
 been-0.290
ena-0.287
 a-0.271
 happened-0.259
Token responsible
Token ID4497
Feature activation-3.26e-01

Pos logit contributions
mer+0.105
umm+0.102
ishly+0.101
rily+0.098
ized+0.096
Neg logit contributions
 a-0.073
 been-0.070
 not-0.062
 something-0.060
 blood-0.058
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
leness+0.635
lest+0.621
 active+0.620
 energetic+0.615
mer+0.603
Neg logit contributions
OM-0.361
 been-0.317
ena-0.310
 happened-0.304
 a-0.303
Token<|endoftext|>
Token ID50256
Feature activation+6.33e-02

Pos logit contributions
leness+0.023
lest+0.022
mer+0.022
 active+0.022
 alike+0.021
Neg logit contributions
OM-0.025
ena-0.020
 been-0.020
 happened-0.020
emon-0.020
TokenL
Token ID43
Feature activation+1.01e-01

Pos logit contributions
OM+0.103
anya+0.084
ena+0.082
abella+0.076
emon+0.075
Neg logit contributions
mer-0.102
ling-0.092
 together-0.091
oc-0.088
 forward-0.088
Tokenily
Token ID813
Feature activation+1.36e-01

Pos logit contributions
OM+0.172
tr+0.163
 CA+0.135
 happened+0.126
Whe+0.117
Neg logit contributions
overs-0.135
ling-0.132
 together-0.126
s-0.120
 alike-0.118
Token and
Token ID290
Feature activation-3.44e-02

Pos logit contributions
OM+0.220
ena+0.175
emon+0.173
 CA+0.171
 been+0.170
Neg logit contributions
leness-0.210
mer-0.206
lest-0.206
 active-0.206
 alike-0.201
Token.
Token ID13
Feature activation-2.86e-03

Pos logit contributions
lest+0.103
leness+0.102
mer+0.098
 active+0.098
 efficient+0.094
Neg logit contributions
OM-0.072
ena-0.061
 been-0.061
 a-0.060
 happened-0.057
Token You
Token ID921
Feature activation-6.93e-02

Pos logit contributions
ling+0.005
lest+0.005
leness+0.005
mer+0.004
 forth+0.004
Neg logit contributions
OM-0.005
ena-0.004
 CA-0.004
emon-0.004
 been-0.004
Token are
Token ID389
Feature activation+4.76e-02

Pos logit contributions
leness+0.125
lest+0.123
mer+0.121
 active+0.121
 alike+0.120
Neg logit contributions
OM-0.094
ena-0.074
 happened-0.072
 been-0.071
emon-0.071
Token smart
Token ID4451
Feature activation-2.61e-01

Pos logit contributions
OM+0.056
ena+0.049
 been+0.048
 a+0.045
emon+0.044
Neg logit contributions
leness-0.093
lest-0.093
mer-0.091
ling-0.088
 forth-0.086
Token and
Token ID290
Feature activation-4.49e-02

Pos logit contributions
leness+0.425
lest+0.418
mer+0.408
 active+0.407
 alike+0.399
Neg logit contributions
OM-0.398
ena-0.329
 been-0.321
 happened-0.320
emon-0.315
Token responsible
Token ID4497
Feature activation-3.25e-01

Pos logit contributions
ling+0.077
leness+0.073
bing+0.072
lest+0.072
 forth+0.071
Neg logit contributions
 a-0.068
 been-0.062
ena-0.059
OM-0.057
 happened-0.056
Token.
Token ID13
Feature activation-8.24e-03

Pos logit contributions
leness+0.596
lest+0.588
 active+0.565
mer+0.556
 forth+0.551
Neg logit contributions
OM-0.416
ena-0.354
 been-0.352
 truth-0.338
 happened-0.338
Token You
Token ID921
Feature activation-7.44e-02

Pos logit contributions
ling+0.012
lest+0.012
leness+0.012
mer+0.011
 forth+0.011
Neg logit contributions
OM-0.014
ena-0.013
 CA-0.012
 been-0.012
emon-0.012
Token can
Token ID460
Feature activation-1.78e-02

Pos logit contributions
leness+0.137
lest+0.134
mer+0.133
 active+0.132
ling+0.130
Neg logit contributions
OM-0.097
 been-0.079
ena-0.079
 happened-0.077
emon-0.074
Token wear
Token ID5806
Feature activation-6.08e-02

Pos logit contributions
leness+0.030
lest+0.029
mer+0.029
 active+0.028
ling+0.027
Neg logit contributions
OM-0.027
ena-0.022
 been-0.022
 truth-0.022
 dentist-0.021
Token your
Token ID534
Feature activation-8.07e-02

Pos logit contributions
leness+0.100
lest+0.100
mer+0.098
 active+0.097
 alike+0.095
Neg logit contributions
OM-0.091
ena-0.074
 been-0.072
 happened-0.072
 CA-0.071
Token You
Token ID921
Feature activation-6.77e-02

Pos logit contributions
leness+0.021
lest+0.020
mer+0.020
 active+0.020
 alike+0.019
Neg logit contributions
OM-0.033
ena-0.028
 been-0.027
 happened-0.027
emon-0.027
Token should
Token ID815
Feature activation-7.18e-02

Pos logit contributions
leness+0.103
lest+0.102
mer+0.100
 active+0.099
 alike+0.097
Neg logit contributions
OM-0.111
ena-0.091
 happened-0.088
 been-0.088
emon-0.088
Token be
Token ID307
Feature activation+9.23e-02

Pos logit contributions
leness+0.127
mer+0.127
lest+0.127
 active+0.122
 alike+0.121
Neg logit contributions
OM-0.100
 been-0.080
ena-0.080
 happened-0.077
emon-0.076
Token careful
Token ID8161
Feature activation-2.58e-01

Pos logit contributions
 been+0.150
OM+0.149
 a+0.145
 happened+0.136
ena+0.135
Neg logit contributions
mer-0.130
lest-0.128
leness-0.125
 alike-0.116
ishly-0.113
Token and
Token ID290
Feature activation-5.12e-02

Pos logit contributions
leness+0.479
lest+0.453
mer+0.450
 alike+0.447
 active+0.430
Neg logit contributions
OM-0.343
 been-0.291
ena-0.286
 truth-0.276
 a-0.268
Token responsible
Token ID4497
Feature activation-3.24e-01

Pos logit contributions
mer+0.081
leness+0.080
lest+0.078
ishly+0.077
oc+0.077
Neg logit contributions
 been-0.076
 a-0.072
OM-0.070
 blood-0.063
ena-0.061
Token."
Token ID526
Feature activation+1.52e-02

Pos logit contributions
leness+0.633
lest+0.617
 active+0.593
 energetic+0.588
 efficient+0.582
Neg logit contributions
OM-0.377
ena-0.322
 been-0.322
 truth-0.313
 a-0.310
Token
Token ID198
Feature activation+3.85e-02

Pos logit contributions
OM+0.018
ena+0.014
 happened+0.013
 been+0.013
 CA+0.013
Neg logit contributions
leness-0.031
lest-0.030
 active-0.029
mer-0.029
 alike-0.029
Token
Token ID198
Feature activation+3.94e-02

Pos logit contributions
OM+0.062
emon+0.049
ena+0.048
 been+0.047
 happened+0.046
Neg logit contributions
leness-0.061
 active-0.059
ling-0.059
lest-0.059
mer-0.059
TokenL
Token ID43
Feature activation+8.50e-02

Pos logit contributions
OM+0.033
ena+0.025
 CA+0.023
 been+0.023
O+0.023
Neg logit contributions
lest-0.089
leness-0.089
 alike-0.088
mer-0.087
 forth-0.086
Tokenily
Token ID813
Feature activation+1.24e-01

Pos logit contributions
OM+0.124
tr+0.106
abella+0.097
 happened+0.095
rett+0.095
Neg logit contributions
ling-0.126
mer-0.123
overs-0.108
ubby-0.108
ging-0.106
Token should
Token ID815
Feature activation-7.69e-02

Pos logit contributions
leness+0.282
lest+0.276
 active+0.271
mer+0.271
 alike+0.270
Neg logit contributions
OM-0.301
ena-0.248
emon-0.239
 CA-0.239
 happened-0.238
Token have
Token ID423
Feature activation-2.89e-02

Pos logit contributions
mer+0.162
lest+0.154
leness+0.150
ling+0.148
 active+0.147
Neg logit contributions
OM-0.091
 been-0.080
 truth-0.071
ena-0.070
 happened-0.069
Token been
Token ID587
Feature activation+1.72e-01

Pos logit contributions
 active+0.034
 forth+0.034
leness+0.034
 alike+0.033
lest+0.032
Neg logit contributions
OM-0.056
ena-0.046
 CA-0.046
emon-0.045
 dentist-0.044
Token kind
Token ID1611
Feature activation-2.44e-01

Pos logit contributions
OM+0.289
ena+0.239
 happened+0.233
 been+0.232
emon+0.231
Neg logit contributions
leness-0.256
lest-0.251
 active-0.247
mer-0.246
 alike-0.242
Token and
Token ID290
Feature activation-6.02e-02

Pos logit contributions
leness+0.439
lest+0.430
 active+0.416
mer+0.408
 alike+0.408
Neg logit contributions
OM-0.329
ena-0.278
 been-0.276
 happened-0.263
 truth-0.262
Token loving
Token ID14442
Feature activation-3.24e-01

Pos logit contributions
leness+0.096
oc+0.096
ling+0.095
mer+0.092
rily+0.090
Neg logit contributions
 a-0.088
 been-0.087
OM-0.086
ena-0.081
 truth-0.075
Token.
Token ID13
Feature activation-2.93e-02

Pos logit contributions
leness+0.540
lest+0.532
 active+0.513
mer+0.503
ling+0.497
Neg logit contributions
OM-0.478
 been-0.410
ena-0.407
 happened-0.390
 truth-0.388
Token We
Token ID775
Feature activation-1.84e-01

Pos logit contributions
ling+0.031
leness+0.031
 forth+0.029
lest+0.029
mer+0.026
Neg logit contributions
OM-0.063
ena-0.058
 CA-0.057
 been-0.056
 happened-0.056
Token love
Token ID1842
Feature activation-1.35e-01

Pos logit contributions
leness+0.290
lest+0.284
mer+0.280
 active+0.279
 alike+0.273
Neg logit contributions
OM-0.292
ena-0.240
 been-0.235
 happened-0.234
emon-0.232
Token you
Token ID345
Feature activation-6.34e-02

Pos logit contributions
leness+0.205
lest+0.202
mer+0.201
 active+0.193
ling+0.193
Neg logit contributions
OM-0.220
 been-0.194
ena-0.193
 happened-0.184
 a-0.183
Token,
Token ID11
Feature activation-1.03e-01

Pos logit contributions
leness+0.123
lest+0.123
mer+0.122
 efficient+0.117
 active+0.117
Neg logit contributions
OM-0.068
 a-0.065
 been-0.065
 truth-0.060
ena-0.057
Token girl
Token ID2576
Feature activation+4.06e-02

Pos logit contributions
OM+0.027
 a+0.026
ara+0.025
 dentist+0.023
 CA+0.023
Neg logit contributions
leness-0.060
lest-0.054
ishly-0.052
bing-0.052
edient-0.052
Token called
Token ID1444
Feature activation+5.49e-02

Pos logit contributions
OM+0.054
ena+0.050
 been+0.047
 happened+0.045
 a+0.045
Neg logit contributions
leness-0.074
lest-0.070
mer-0.068
 alike-0.067
 active-0.066
Token Lucy
Token ID22162
Feature activation+7.27e-02

Pos logit contributions
OM+0.092
 CA+0.074
 happened+0.072
 dentist+0.072
ena+0.071
Neg logit contributions
 alike-0.077
 active-0.076
mer-0.076
lest-0.075
leness-0.074
Token.
Token ID13
Feature activation+4.42e-02

Pos logit contributions
OM+0.119
ena+0.095
emon+0.094
 CA+0.093
 happened+0.093
Neg logit contributions
leness-0.112
lest-0.111
 active-0.110
mer-0.109
 alike-0.108
Token She
Token ID1375
Feature activation-1.04e-01

Pos logit contributions
OM+0.085
ena+0.072
 happened+0.071
emon+0.070
 been+0.070
Neg logit contributions
leness-0.054
lest-0.053
 active-0.053
mer-0.052
 alike-0.051
Token was
Token ID373
Feature activation+2.34e-01

Pos logit contributions
 active+0.126
ling+0.108
 energetic+0.108
mer+0.103
lest+0.101
Neg logit contributions
OM-0.221
emon-0.186
 fault-0.183
ena-0.179
 dentist-0.176
Token only
Token ID691
Feature activation+1.99e-02

Pos logit contributions
OM+0.352
ena+0.276
 happened+0.271
 CA+0.271
emon+0.269
Neg logit contributions
 active-0.389
leness-0.383
lest-0.382
 alike-0.378
mer-0.376
Token three
Token ID1115
Feature activation+8.96e-02

Pos logit contributions
OM+0.031
 CA+0.025
 happened+0.024
ena+0.024
emon+0.024
Neg logit contributions
 active-0.032
lest-0.032
 alike-0.032
mer-0.031
leness-0.031
Token,
Token ID11
Feature activation-5.26e-02

Pos logit contributions
ena+0.119
OM+0.114
 been+0.113
 a+0.108
 happened+0.105
Neg logit contributions
leness-0.144
lest-0.140
mer-0.134
ishly-0.134
 forth-0.133
Token with
Token ID351
Feature activation-8.33e-02

Pos logit contributions
leness+0.064
lest+0.063
mer+0.062
 active+0.062
 alike+0.060
Neg logit contributions
OM-0.102
ena-0.087
 been-0.085
 happened-0.085
emon-0.085
Token curly
Token ID45731
Feature activation-1.28e-01

Pos logit contributions
leness+0.149
mer+0.148
lest+0.147
ling+0.142
 active+0.142
Neg logit contributions
OM-0.113
ena-0.093
 been-0.092
 happened-0.090
emon-0.088
Token beautiful
Token ID4950
Feature activation-1.34e-01

Pos logit contributions
OM+0.306
ena+0.262
 been+0.254
emon+0.252
 happened+0.241
Neg logit contributions
 active-0.273
mer-0.272
lest-0.269
leness-0.268
 alike-0.267
Token sunset
Token ID26428
Feature activation+3.98e-02

Pos logit contributions
mer+0.188
 active+0.187
ling+0.184
 forth+0.183
leness+0.182
Neg logit contributions
OM-0.228
ena-0.198
 been-0.190
emon-0.189
abella-0.179
Token.
Token ID13
Feature activation+3.61e-02

Pos logit contributions
OM+0.064
ena+0.053
 been+0.051
 CA+0.051
emon+0.051
Neg logit contributions
leness-0.061
lest-0.061
mer-0.060
 active-0.059
ling-0.058
Token They
Token ID1119
Feature activation-1.45e-01

Pos logit contributions
OM+0.060
ena+0.050
 been+0.049
emon+0.049
 happened+0.049
Neg logit contributions
leness-0.054
lest-0.053
mer-0.052
 active-0.052
 alike-0.051
Token all
Token ID477
Feature activation-9.01e-04

Pos logit contributions
lest+0.223
leness+0.222
 alike+0.221
 active+0.218
 forth+0.216
Neg logit contributions
OM-0.238
ena-0.191
emon-0.190
 CA-0.185
 dentist-0.183
Token stopped
Token ID5025
Feature activation+1.92e-01

Pos logit contributions
leness+0.001
lest+0.001
 active+0.001
mer+0.001
 alike+0.001
Neg logit contributions
OM-0.001
ena-0.001
emon-0.001
 CA-0.001
 happened-0.001
Token playing
Token ID2712
Feature activation-1.07e-01

Pos logit contributions
OM+0.243
 happened+0.227
 a+0.223
 been+0.222
ena+0.211
Neg logit contributions
leness-0.373
lest-0.354
fy-0.348
mer-0.339
 active-0.326
Token to
Token ID284
Feature activation-1.61e-02

Pos logit contributions
lest+0.189
leness+0.184
mer+0.175
 active+0.174
ling+0.169
Neg logit contributions
OM-0.159
ena-0.134
 happened-0.133
 a-0.133
 been-0.132
Token watch
Token ID2342
Feature activation-8.71e-04

Pos logit contributions
leness+0.021
lest+0.020
 active+0.020
 alike+0.020
 forth+0.020
Neg logit contributions
OM-0.030
ena-0.025
 happened-0.025
emon-0.025
 CA-0.024
Token it
Token ID340
Feature activation+8.48e-02

Pos logit contributions
leness+0.001
lest+0.001
 active+0.001
 alike+0.001
mer+0.001
Neg logit contributions
OM-0.001
ena-0.001
emon-0.001
 been-0.001
 happened-0.001
Token.
Token ID13
Feature activation+3.50e-02

Pos logit contributions
OM+0.133
ena+0.109
 been+0.105
 happened+0.105
emon+0.105
Neg logit contributions
leness-0.135
lest-0.133
 active-0.131
mer-0.131
 alike-0.129
Token upon
Token ID2402
Feature activation+1.16e-01

Pos logit contributions
OM+0.057
ena+0.044
emon+0.044
 CA+0.044
 been+0.042
Neg logit contributions
lest-0.075
leness-0.075
ling-0.074
 alike-0.073
mer-0.073
Token a
Token ID257
Feature activation+1.94e-01

Pos logit contributions
OM+0.236
omm+0.205
anger+0.203
emon+0.191
abella+0.187
Neg logit contributions
 active-0.151
 content-0.144
 energetic-0.141
 happy-0.140
 healthy-0.137
Token time
Token ID640
Feature activation+1.50e-01

Pos logit contributions
OM+0.173
 CA+0.120
emon+0.112
ena+0.109
 happened+0.103
Neg logit contributions
 active-0.446
ling-0.430
leness-0.427
mer-0.426
 energetic-0.425
Token there
Token ID612
Feature activation+1.54e-03

Pos logit contributions
OM+0.318
ena+0.280
emon+0.274
 dentist+0.274
 CA+0.272
Neg logit contributions
lest-0.144
 alike-0.142
mer-0.140
 active-0.140
leness-0.138
Token there
Token ID612
Feature activation+2.60e-03

Pos logit contributions
OM+0.002
emon+0.002
OU+0.002
 dentist+0.002
anya+0.002
Neg logit contributions
mer-0.002
oc-0.002
ling-0.002
 together-0.002
lest-0.002
Token was
Token ID373
Feature activation+2.42e-01

Pos logit contributions
OM+0.004
emon+0.004
OU+0.004
anya+0.003
 dentist+0.003
Neg logit contributions
oc-0.004
mer-0.004
ling-0.003
 together-0.003
lest-0.003
Token a
Token ID257
Feature activation+1.95e-01

Pos logit contributions
OM+0.360
anger+0.241
 CA+0.239
 arose+0.238
abella+0.236
Neg logit contributions
 active-0.469
 energetic-0.443
 content-0.437
 together-0.430
 healthy-0.412
Token fierce
Token ID14800
Feature activation-1.67e-01

Pos logit contributions
OM+0.213
ena+0.168
 dentist+0.165
 CA+0.163
 happened+0.158
Neg logit contributions
leness-0.409
lest-0.388
ishly-0.375
mer-0.373
 forth-0.369
Token big
Token ID1263
Feature activation-8.27e-02

Pos logit contributions
leness+0.273
lest+0.268
mer+0.264
 active+0.264
 alike+0.259
Neg logit contributions
OM-0.253
ena-0.205
 been-0.200
 happened-0.199
emon-0.198
Token cat
Token ID3797
Feature activation+6.43e-02

Pos logit contributions
leness+0.154
lest+0.152
mer+0.143
 alike+0.138
ling+0.136
Neg logit contributions
OM-0.108
 dentist-0.094
 a-0.089
 CA-0.088
ena-0.088
Token called
Token ID1444
Feature activation+5.95e-02

Pos logit contributions
OM+0.096
ena+0.083
 been+0.080
 happened+0.080
 CA+0.077
Neg logit contributions
leness-0.106
lest-0.103
mer-0.100
 active-0.099
 alike-0.099
Token They
Token ID1119
Feature activation-1.73e-01

Pos logit contributions
leness+0.006
lest+0.006
ling+0.005
 active+0.005
 forth+0.005
Neg logit contributions
OM-0.010
ena-0.009
emon-0.008
 happened-0.008
 been-0.008
Token use
Token ID779
Feature activation-1.11e-01

Pos logit contributions
leness+0.336
mer+0.334
lest+0.323
ling+0.322
bing+0.320
Neg logit contributions
OM-0.207
ena-0.202
 a-0.190
 been-0.187
 dentist-0.173
Token them
Token ID606
Feature activation+7.44e-02

Pos logit contributions
leness+0.200
lest+0.197
mer+0.195
 active+0.194
ling+0.190
Neg logit contributions
OM-0.150
ena-0.123
 been-0.120
 happened-0.119
emon-0.116
Token to
Token ID284
Feature activation-4.31e-02

Pos logit contributions
OM+0.090
 a+0.081
 been+0.081
ena+0.080
 happened+0.079
Neg logit contributions
leness-0.146
lest-0.138
mer-0.137
 active-0.135
 energetic-0.132
Token make
Token ID787
Feature activation-5.20e-02

Pos logit contributions
leness+0.072
mer+0.071
lest+0.070
ling+0.070
 active+0.067
Neg logit contributions
OM-0.067
ena-0.059
 been-0.057
 a-0.057
 dentist-0.052
Token a
Token ID257
Feature activation+1.59e-01

Pos logit contributions
leness+0.095
lest+0.094
 active+0.093
mer+0.092
 alike+0.091
Neg logit contributions
OM-0.069
ena-0.054
 been-0.052
 happened-0.052
 CA-0.052
Token robot
Token ID9379
Feature activation-1.25e-02

Pos logit contributions
OM+0.203
 dentist+0.169
 truth+0.159
tr+0.154
 CA+0.153
Neg logit contributions
leness-0.311
lest-0.307
mer-0.301
ishly-0.293
 active-0.289
Token.
Token ID13
Feature activation-5.62e-03

Pos logit contributions
leness+0.024
mer+0.023
 active+0.022
lest+0.022
 energetic+0.021
Neg logit contributions
OM-0.015
 a-0.015
ena-0.014
 dentist-0.013
tr-0.013
Token The
Token ID383
Feature activation-7.86e-02

Pos logit contributions
leness+0.007
lest+0.007
ling+0.007
 forth+0.007
 active+0.007
Neg logit contributions
OM-0.011
ena-0.010
emon-0.009
 CA-0.009
 happened-0.009
Token robot
Token ID9379
Feature activation-1.12e-02

Pos logit contributions
leness+0.120
lest+0.118
mer+0.116
 active+0.116
 alike+0.113
Neg logit contributions
OM-0.129
ena-0.107
 been-0.104
 happened-0.104
emon-0.103
Token has
Token ID468
Feature activation-1.90e-02

Pos logit contributions
leness+0.023
mer+0.023
 active+0.023
lest+0.022
 energetic+0.022
Neg logit contributions
OM-0.011
ena-0.010
 happened-0.010
 been-0.010
 dentist-0.009
Token is
Token ID318
Feature activation+1.61e-01

Pos logit contributions
OM+0.020
ena+0.015
 been+0.014
 happened+0.014
emon+0.014
Neg logit contributions
leness-0.040
lest-0.039
 active-0.039
mer-0.039
 alike-0.038
Token modest
Token ID12949
Feature activation-2.27e-01

Pos logit contributions
OM+0.222
 been+0.187
ena+0.181
 happened+0.181
emon+0.176
Neg logit contributions
leness-0.286
lest-0.278
mer-0.276
 active-0.265
 alike-0.265
Token,
Token ID11
Feature activation-7.97e-02

Pos logit contributions
leness+0.411
lest+0.390
mer+0.375
 alike+0.372
 active+0.371
Neg logit contributions
OM-0.311
 been-0.259
ena-0.252
 happened-0.249
 truth-0.246
Token but
Token ID475
Feature activation-8.31e-02

Pos logit contributions
leness+0.143
lest+0.139
mer+0.139
ling+0.131
 alike+0.129
Neg logit contributions
OM-0.114
 been-0.094
ena-0.093
 a-0.092
 happened-0.091
Token it
Token ID340
Feature activation+7.25e-02

Pos logit contributions
leness+0.160
mer+0.157
lest+0.150
ling+0.150
 active+0.144
Neg logit contributions
OM-0.111
 been-0.100
 a-0.095
 happened-0.095
 truth-0.091
Token is
Token ID318
Feature activation+1.62e-01

Pos logit contributions
OM+0.086
 been+0.067
 happened+0.066
ena+0.066
emon+0.064
Neg logit contributions
leness-0.142
lest-0.138
mer-0.137
 active-0.137
 alike-0.135
Token my
Token ID616
Feature activation+1.29e-02

Pos logit contributions
OM+0.234
ena+0.193
 been+0.192
 happened+0.187
emon+0.186
Neg logit contributions
leness-0.276
lest-0.271
mer-0.268
 active-0.261
 alike-0.258
Token favorite
Token ID4004
Feature activation-4.66e-02

Pos logit contributions
OM+0.020
ena+0.017
 been+0.016
 happened+0.016
emon+0.016
Neg logit contributions
leness-0.021
lest-0.020
mer-0.020
 active-0.020
 alike-0.020
Token."
Token ID526
Feature activation+3.04e-02

Pos logit contributions
lest+0.070
 active+0.070
mer+0.069
leness+0.068
ling+0.067
Neg logit contributions
OM-0.078
ena-0.065
 CA-0.063
emon-0.061
 happened-0.060
Token<|endoftext|>
Token ID50256
Feature activation+7.81e-02

Pos logit contributions
OM+0.044
ena+0.036
 CA+0.035
 been+0.035
 happened+0.035
Neg logit contributions
leness-0.052
mer-0.052
lest-0.052
 active-0.051
ling-0.050
TokenOnce
Token ID7454
Feature activation+3.98e-02

Pos logit contributions
OM+0.161
emon+0.136
ena+0.133
anya+0.129
OU+0.129
Neg logit contributions
mer-0.084
oc-0.080
 together-0.078
ling-0.076
 around-0.070
Token snack
Token ID26906
Feature activation+9.46e-02

Pos logit contributions
OM+0.277
ena+0.240
 been+0.224
emon+0.223
 happened+0.221
Neg logit contributions
lest-0.269
leness-0.269
 active-0.268
mer-0.261
 alike-0.260
Token.
Token ID13
Feature activation+2.47e-02

Pos logit contributions
OM+0.136
ena+0.109
 been+0.107
 happened+0.107
emon+0.105
Neg logit contributions
leness-0.167
lest-0.162
 active-0.158
mer-0.156
 alike-0.154
Token Then
Token ID3244
Feature activation-8.10e-02

Pos logit contributions
OM+0.051
ena+0.046
emon+0.044
 happened+0.043
 been+0.043
Neg logit contributions
 active-0.025
leness-0.024
 energetic-0.023
lest-0.023
 alike-0.023
Token,
Token ID11
Feature activation-6.70e-02

Pos logit contributions
leness+0.128
mer+0.126
lest+0.124
 active+0.121
 alike+0.119
Neg logit contributions
OM-0.133
 happened-0.114
 been-0.111
 a-0.108
ena-0.106
Token he
Token ID339
Feature activation-1.28e-01

Pos logit contributions
 forth+0.054
lest+0.051
 active+0.051
 alike+0.046
leness+0.046
Neg logit contributions
OM-0.155
ena-0.140
emon-0.138
 CA-0.136
 truth-0.133
Token packed
Token ID11856
Feature activation+1.04e-02

Pos logit contributions
ling+0.170
 active+0.167
 forth+0.164
 energetic+0.155
 grate+0.155
Neg logit contributions
OM-0.245
emon-0.197
 fault-0.197
 CA-0.192
 truth-0.190
Token his
Token ID465
Feature activation-8.66e-04

Pos logit contributions
OM+0.023
ena+0.020
 CA+0.019
emon+0.019
OO+0.018
Neg logit contributions
 active-0.010
ling-0.010
 grate-0.010
 content-0.010
 forth-0.010
Token toys
Token ID14958
Feature activation+1.32e-02

Pos logit contributions
leness+0.001
lest+0.001
 active+0.001
mer+0.001
 alike+0.001
Neg logit contributions
OM-0.002
ena-0.001
emon-0.001
 happened-0.001
 been-0.001
Token back
Token ID736
Feature activation-3.28e-02

Pos logit contributions
OM+0.019
 been+0.016
ena+0.016
 happened+0.015
 a+0.015
Neg logit contributions
leness-0.023
lest-0.022
 active-0.021
mer-0.021
 alike-0.021
Token into
Token ID656
Feature activation-1.73e-02

Pos logit contributions
leness+0.048
lest+0.047
mer+0.046
 active+0.046
 alike+0.045
Neg logit contributions
OM-0.056
ena-0.047
 been-0.046
 happened-0.045
emon-0.045
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
leness+0.027
lest+0.026
 active+0.026
mer+0.025
 alike+0.025
Neg logit contributions
OM-0.028
ena-0.023
 happened-0.023
emon-0.022
 CA-0.022
Token snapping
Token ID35733
Feature activation-6.26e-02

Pos logit contributions
leness+0.003
lest+0.003
 active+0.003
mer+0.003
 alike+0.003
Neg logit contributions
OM-0.002
 happened-0.002
 been-0.002
ena-0.002
 truth-0.002
Token in
Token ID287
Feature activation-6.32e-02

Pos logit contributions
leness+0.104
lest+0.103
 active+0.098
mer+0.098
 forth+0.095
Neg logit contributions
OM-0.087
 happened-0.078
 a-0.076
 been-0.076
ena-0.076
Token the
Token ID262
Feature activation+2.77e-02

Pos logit contributions
lest+0.116
leness+0.115
mer+0.112
 active+0.110
ling+0.110
Neg logit contributions
OM-0.082
ena-0.068
 been-0.068
 happened-0.066
 truth-0.065
Token wind
Token ID2344
Feature activation+2.15e-02

Pos logit contributions
OM+0.033
 been+0.026
 truth+0.025
ena+0.025
 CA+0.025
Neg logit contributions
lest-0.054
leness-0.053
mer-0.052
 alike-0.051
 active-0.051
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
OM+0.027
 been+0.022
 happened+0.022
ena+0.021
 a+0.020
Neg logit contributions
leness-0.041
 active-0.040
lest-0.039
 efficient-0.038
mer-0.038
Token<|endoftext|>
Token ID50256
Feature activation+8.01e-02

Pos logit contributions
OM+0.022
ena+0.018
 happened+0.018
emon+0.017
 been+0.017
Neg logit contributions
leness-0.019
lest-0.018
 alike-0.018
 active-0.018
mer-0.018
TokenOnce
Token ID7454
Feature activation+4.81e-02

Pos logit contributions
OM+0.137
emon+0.115
ena+0.115
anya+0.111
 dentist+0.111
Neg logit contributions
mer-0.118
ling-0.107
 together-0.105
 forward-0.102
 alike-0.101
Token upon
Token ID2402
Feature activation+1.21e-01

Pos logit contributions
OM+0.068
emon+0.053
ena+0.053
 CA+0.052
 been+0.050
Neg logit contributions
lest-0.084
leness-0.083
 alike-0.082
mer-0.082
ling-0.082
Token a
Token ID257
Feature activation+1.95e-01

Pos logit contributions
OM+0.222
omm+0.198
anger+0.197
emon+0.182
abella+0.180
Neg logit contributions
 active-0.168
 content-0.165
 happy-0.163
 energetic-0.160
 healthy-0.158
Token time
Token ID640
Feature activation+1.49e-01

Pos logit contributions
OM+0.197
 CA+0.152
emon+0.141
ena+0.136
anya+0.135
Neg logit contributions
 active-0.424
mer-0.403
ling-0.401
 energetic-0.401
 alike-0.397
Token there
Token ID612
Feature activation+8.18e-03

Pos logit contributions
OM+0.344
 dentist+0.306
ena+0.305
anya+0.302
 CA+0.298
Neg logit contributions
 alike-0.115
lest-0.109
mer-0.108
 active-0.104
leness-0.100
Token join
Token ID4654
Feature activation-3.69e-02

Pos logit contributions
lest+0.033
mer+0.032
leness+0.032
ling+0.031
 active+0.030
Neg logit contributions
OM-0.028
ena-0.024
 been-0.024
 a-0.023
 truth-0.022
Token in
Token ID287
Feature activation-6.18e-02

Pos logit contributions
lest+0.070
leness+0.070
fy+0.064
 active+0.062
mer+0.062
Neg logit contributions
 a-0.045
 been-0.041
OM-0.041
ena-0.040
 happened-0.040
Token with
Token ID351
Feature activation-9.29e-02

Pos logit contributions
lest+0.134
leness+0.127
ling+0.120
 active+0.118
mer+0.117
Neg logit contributions
 a-0.070
 been-0.061
 time-0.058
OM-0.057
ena-0.057
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
lest+0.172
leness+0.169
 active+0.163
 forth+0.162
mer+0.162
Neg logit contributions
OM-0.110
 a-0.110
ena-0.105
 been-0.104
 happened-0.101
Token
Token ID220
Feature activation+4.94e-02

Pos logit contributions
OM+0.023
ena+0.019
 been+0.019
 a+0.018
 CA+0.018
Neg logit contributions
lest-0.035
 active-0.034
leness-0.034
ling-0.034
mer-0.033
Token
Token ID198
Feature activation+6.60e-02

Pos logit contributions
OM+0.073
emon+0.058
 happened+0.058
ena+0.057
 CA+0.057
Neg logit contributions
leness-0.078
 alike-0.078
lest-0.077
 active-0.076
mer-0.076
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
OM+0.090
ena+0.069
 happened+0.069
emon+0.068
 CA+0.067
Neg logit contributions
leness-0.119
lest-0.117
 active-0.115
mer-0.115
 alike-0.114
TokenFinally
Token ID11158
Feature activation-1.52e-02

Pos logit contributions
OM+0.099
ena+0.081
 been+0.079
emon+0.078
 CA+0.077
Neg logit contributions
leness-0.109
lest-0.107
mer-0.106
 alike-0.104
 active-0.104
Token,
Token ID11
Feature activation-6.47e-02

Pos logit contributions
leness+0.026
lest+0.026
 active+0.025
mer+0.025
ling+0.024
Neg logit contributions
OM-0.022
 happened-0.020
 a-0.020
 been-0.019
ena-0.018
Token they
Token ID484
Feature activation-1.58e-01

Pos logit contributions
leness+0.110
lest+0.101
 active+0.100
ling+0.097
 alike+0.096
Neg logit contributions
OM-0.104
 a-0.092
 been-0.087
 happened-0.086
ena-0.083
Token agreed
Token ID4987
Feature activation-1.63e-02

Pos logit contributions
leness+0.268
lest+0.264
mer+0.261
 active+0.260
 alike+0.257
Neg logit contributions
OM-0.231
ena-0.184
 happened-0.178
emon-0.178
 been-0.178
Token two
Token ID734
Feature activation+1.22e-01

Pos logit contributions
OM+0.006
ena+0.005
 CA+0.005
emon+0.005
 dentist+0.005
Neg logit contributions
lest-0.003
leness-0.003
 active-0.003
mer-0.003
 alike-0.003
Token friends
Token ID2460
Feature activation-4.54e-02

Pos logit contributions
OM+0.146
 been+0.134
 happened+0.131
ena+0.125
 a+0.124
Neg logit contributions
leness-0.229
mer-0.221
lest-0.216
 active-0.211
 alike-0.206
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
leness+0.076
lest+0.074
mer+0.073
 active+0.073
 alike+0.071
Neg logit contributions
OM-0.067
ena-0.055
 happened-0.055
 been-0.055
emon-0.053
Token Lucy
Token ID22162
Feature activation+6.98e-02

Pos logit contributions
mer+0.100
 active+0.100
 alike+0.099
lest+0.098
 energetic+0.094
Neg logit contributions
OM-0.085
 dentist-0.064
 CA-0.063
 truth-0.061
emon-0.061
Token and
Token ID290
Feature activation-1.40e-02

Pos logit contributions
OM+0.106
ena+0.090
 been+0.089
 happened+0.088
emon+0.085
Neg logit contributions
leness-0.113
lest-0.110
 active-0.108
mer-0.108
 alike-0.106
Token Toy
Token ID10977
Feature activation+4.66e-02

Pos logit contributions
leness+0.019
lest+0.018
 active+0.018
mer+0.018
 alike+0.017
Neg logit contributions
OM-0.026
ena-0.022
 been-0.021
 happened-0.021
emon-0.021
Tokena
Token ID64
Feature activation+1.23e-01

Pos logit contributions
OM+0.077
ena+0.064
 been+0.062
 happened+0.062
emon+0.062
Neg logit contributions
leness-0.070
lest-0.069
mer-0.068
 active-0.067
 alike-0.066
Token,
Token ID11
Feature activation-5.96e-02

Pos logit contributions
OM+0.213
ena+0.175
 CA+0.171
emon+0.170
 happened+0.169
Neg logit contributions
leness-0.176
lest-0.172
mer-0.170
 active-0.169
 alike-0.168
Token were
Token ID547
Feature activation+1.17e-01

Pos logit contributions
 alike+0.087
 active+0.083
lest+0.082
mer+0.082
leness+0.081
Neg logit contributions
OM-0.103
emon-0.082
 CA-0.079
ena-0.078
 dentist-0.078
Token together
Token ID1978
Feature activation-2.03e-01

Pos logit contributions
OM+0.181
 been+0.158
ena+0.158
 happened+0.152
emon+0.148
Neg logit contributions
leness-0.188
lest-0.183
mer-0.179
 active-0.170
ling-0.167
Token working
Token ID1762
Feature activation-1.42e-01

Pos logit contributions
leness+0.365
lest+0.353
mer+0.337
 active+0.335
 forth+0.326
Neg logit contributions
OM-0.275
 been-0.240
ena-0.239
 happened-0.237
 a-0.233
Token was
Token ID373
Feature activation+2.21e-01

Pos logit contributions
OM+0.042
ena+0.028
emon+0.026
 CA+0.025
 been+0.024
Neg logit contributions
leness-0.115
lest-0.114
mer-0.114
 active-0.113
 alike-0.112
Token magical
Token ID10883
Feature activation-1.02e-01

Pos logit contributions
OM+0.324
ena+0.254
 CA+0.246
emon+0.245
 happened+0.237
Neg logit contributions
leness-0.378
 active-0.376
lest-0.372
 alike-0.367
mer-0.366
Token!
Token ID0
Feature activation+4.78e-02

Pos logit contributions
leness+0.176
lest+0.171
 active+0.165
mer+0.164
 alike+0.163
Neg logit contributions
OM-0.141
 happened-0.122
 been-0.120
ena-0.118
 truth-0.114
Token She
Token ID1375
Feature activation-1.22e-01

Pos logit contributions
OM+0.088
ena+0.075
 been+0.073
 happened+0.073
emon+0.073
Neg logit contributions
leness-0.063
lest-0.061
 active-0.060
mer-0.060
 alike-0.059
Token put
Token ID1234
Feature activation+3.90e-02

Pos logit contributions
leness+0.218
lest+0.215
mer+0.211
 active+0.211
 alike+0.208
Neg logit contributions
OM-0.169
ena-0.134
 been-0.129
 happened-0.129
emon-0.129
Token it
Token ID340
Feature activation+8.04e-02

Pos logit contributions
OM+0.055
ena+0.045
 been+0.044
 happened+0.044
 CA+0.043
Neg logit contributions
leness-0.068
lest-0.067
mer-0.066
 active-0.066
 alike-0.064
Token on
Token ID319
Feature activation-1.05e-01

Pos logit contributions
OM+0.120
 been+0.106
 a+0.105
 happened+0.100
ena+0.098
Neg logit contributions
leness-0.139
lest-0.128
mer-0.127
oc-0.120
 alike-0.120
Token and
Token ID290
Feature activation-2.29e-02

Pos logit contributions
leness+0.184
lest+0.183
mer+0.178
 active+0.176
 alike+0.173
Neg logit contributions
OM-0.149
ena-0.121
 been-0.118
 happened-0.116
 CA-0.112
Token was
Token ID373
Feature activation+2.20e-01

Pos logit contributions
leness+0.045
mer+0.043
lest+0.042
 active+0.042
 alike+0.042
Neg logit contributions
OM-0.028
 happened-0.024
 been-0.023
ena-0.023
 a-0.023
Token filled
Token ID5901
Feature activation-8.74e-02

Pos logit contributions
OM+0.359
ena+0.305
 been+0.300
 happened+0.295
emon+0.289
Neg logit contributions
leness-0.336
lest-0.329
mer-0.325
 active-0.316
 alike-0.312
Token with
Token ID351
Feature activation-9.75e-02

Pos logit contributions
leness+0.121
lest+0.119
mer+0.111
 alike+0.110
 active+0.110
Neg logit contributions
OM-0.157
ena-0.136
 been-0.134
 happened-0.131
 a-0.129
Token followed
Token ID3940
Feature activation-3.35e-03

Pos logit contributions
leness+0.321
lest+0.310
 active+0.301
mer+0.297
 alike+0.289
Neg logit contributions
OM-0.196
 been-0.162
ena-0.159
 happened-0.154
emon-0.143
Token their
Token ID511
Feature activation-1.02e-01

Pos logit contributions
leness+0.007
lest+0.007
mer+0.006
 fortunate+0.006
 knowledgeable+0.006
Neg logit contributions
 a-0.004
OM-0.003
 an-0.003
 something-0.003
ena-0.003
Token plan
Token ID1410
Feature activation-2.50e-02

Pos logit contributions
leness+0.198
lest+0.184
mer+0.174
 alike+0.168
ling+0.167
Neg logit contributions
OM-0.129
 been-0.114
 truth-0.108
 a-0.104
 time-0.103
Token.
Token ID13
Feature activation+2.02e-02

Pos logit contributions
lest+0.044
leness+0.043
mer+0.042
 active+0.041
ling+0.040
Neg logit contributions
OM-0.034
 been-0.031
 a-0.030
ena-0.030
 happened-0.029
Token Whenever
Token ID21326
Feature activation-3.69e-02

Pos logit contributions
OM+0.041
ena+0.035
 been+0.035
 CA+0.035
emon+0.035
Neg logit contributions
leness-0.024
lest-0.023
ling-0.023
 forth-0.022
mer-0.022
Token they
Token ID484
Feature activation-1.62e-01

Pos logit contributions
leness+0.050
lest+0.048
 active+0.047
ling+0.046
mer+0.046
Neg logit contributions
OM-0.066
 happened-0.061
 been-0.059
 a-0.057
ena-0.056
Token did
Token ID750
Feature activation+9.07e-02

Pos logit contributions
leness+0.311
lest+0.304
 active+0.299
mer+0.294
 alike+0.291
Neg logit contributions
OM-0.201
 been-0.160
 happened-0.160
ena-0.158
 CA-0.151
Token,
Token ID11
Feature activation-6.51e-02

Pos logit contributions
OM+0.128
 happened+0.119
 been+0.118
 a+0.116
 something+0.106
Neg logit contributions
mer-0.154
lest-0.149
leness-0.146
 active-0.140
 forth-0.139
Token they
Token ID484
Feature activation-1.63e-01

Pos logit contributions
leness+0.105
lest+0.096
fy+0.095
mer+0.094
ling+0.092
Neg logit contributions
 a-0.101
OM-0.100
 been-0.094
 happened-0.094
 something-0.091
Token were
Token ID547
Feature activation+1.14e-01

Pos logit contributions
leness+0.338
lest+0.325
 active+0.314
mer+0.311
bing+0.308
Neg logit contributions
OM-0.173
 been-0.153
ena-0.148
 happened-0.145
 a-0.132
Token always
Token ID1464
Feature activation-6.95e-02

Pos logit contributions
OM+0.162
 been+0.152
ena+0.151
 a+0.145
 happened+0.141
Neg logit contributions
leness-0.189
lest-0.186
mer-0.179
oc-0.170
 forth-0.167
Token a
Token ID257
Feature activation+1.61e-01

Pos logit contributions
OM+0.023
emon+0.014
 CA+0.014
ena+0.014
 been+0.014
Neg logit contributions
lest-0.043
 alike-0.043
leness-0.042
 active-0.042
 forth-0.042
Token big
Token ID1263
Feature activation-1.15e-01

Pos logit contributions
OM+0.227
ena+0.182
 been+0.175
 happened+0.175
emon+0.174
Neg logit contributions
leness-0.280
lest-0.276
 active-0.271
mer-0.271
 alike-0.267
Token t
Token ID256
Feature activation+3.23e-02

Pos logit contributions
leness+0.206
lest+0.203
mer+0.199
 active+0.199
 alike+0.195
Neg logit contributions
OM-0.159
ena-0.126
 happened-0.123
 been-0.123
 CA-0.122
Tokeneddy
Token ID21874
Feature activation-4.65e-02

Pos logit contributions
OM+0.063
 happened+0.055
emon+0.055
 been+0.055
 CA+0.054
Neg logit contributions
mer-0.037
lest-0.037
ling-0.035
leness-0.034
 alike-0.034
Token bear
Token ID6842
Feature activation+5.25e-02

Pos logit contributions
leness+0.077
lest+0.075
 active+0.074
mer+0.073
 alike+0.072
Neg logit contributions
OM-0.070
ena-0.059
 happened-0.057
 been-0.057
 CA-0.056
Token and
Token ID290
Feature activation-3.93e-02

Pos logit contributions
OM+0.068
ena+0.060
 been+0.058
 happened+0.057
emon+0.056
Neg logit contributions
leness-0.100
lest-0.095
 active-0.095
 energetic-0.091
mer-0.090
Token a
Token ID257
Feature activation+1.61e-01

Pos logit contributions
leness+0.102
lest+0.100
mer+0.100
 active+0.099
 alike+0.098
Neg logit contributions
OM-0.023
ena-0.012
 been-0.010
 happened-0.010
emon-0.010
Token book
Token ID1492
Feature activation+5.55e-02

Pos logit contributions
OM+0.203
 happened+0.158
ena+0.157
 CA+0.157
 been+0.156
Neg logit contributions
leness-0.306
mer-0.297
lest-0.297
 active-0.293
ling-0.288
Token.
Token ID13
Feature activation+3.70e-04

Pos logit contributions
OM+0.081
ena+0.068
 been+0.065
 happened+0.065
emon+0.064
Neg logit contributions
leness-0.095
lest-0.093
 active-0.091
mer-0.090
 alike-0.088
Token They
Token ID1119
Feature activation-1.68e-01

Pos logit contributions
OM+0.001
ena+0.001
 CA+0.001
 happened+0.001
 a+0.001
Neg logit contributions
leness-0.001
lest-0.001
ling-0.000
 forth-0.000
 active-0.000
Token hug
Token ID16225
Feature activation-2.49e-02

Pos logit contributions
leness+0.326
mer+0.325
lest+0.316
 active+0.313
ling+0.309
Neg logit contributions
OM-0.202
ena-0.180
 been-0.172
 a-0.169
 dentist-0.156
Token pretend
Token ID16614
Feature activation-5.83e-02

Pos logit contributions
leness+0.330
izing+0.316
mer+0.316
ized+0.315
ling+0.313
Neg logit contributions
 been-0.176
ena-0.173
OM-0.162
 a-0.162
 happened-0.156
Token to
Token ID284
Feature activation-3.40e-02

Pos logit contributions
leness+0.086
lest+0.084
mer+0.082
 active+0.082
 alike+0.080
Neg logit contributions
OM-0.098
ena-0.083
 happened-0.083
 been-0.082
 CA-0.080
Token be
Token ID307
Feature activation+9.78e-02

Pos logit contributions
 active+0.053
leness+0.051
lest+0.051
mer+0.050
 forth+0.050
Neg logit contributions
OM-0.052
 happened-0.043
 CA-0.040
OU-0.040
emon-0.040
Token animals
Token ID4695
Feature activation+4.10e-02

Pos logit contributions
OM+0.113
ena+0.085
 been+0.082
 happened+0.081
emon+0.080
Neg logit contributions
leness-0.197
lest-0.194
mer-0.192
 active-0.191
 alike-0.189
Token,
Token ID11
Feature activation-7.80e-02

Pos logit contributions
 a+0.039
ena+0.034
 been+0.033
OM+0.032
 happened+0.032
Neg logit contributions
leness-0.091
lest-0.088
 active-0.082
 forth-0.082
 efficient-0.080
Token or
Token ID393
Feature activation-3.26e-02

Pos logit contributions
 active+0.112
 alike+0.110
lest+0.109
leness+0.107
 forth+0.105
Neg logit contributions
OM-0.132
ena-0.103
 CA-0.102
emon-0.102
 happened-0.101
Token pirates
Token ID27516
Feature activation+9.59e-03

Pos logit contributions
leness+0.059
mer+0.058
lest+0.057
ling+0.056
 active+0.055
Neg logit contributions
OM-0.042
 been-0.037
 happened-0.037
ena-0.036
 a-0.035
Token,
Token ID11
Feature activation-7.92e-02

Pos logit contributions
OM+0.011
ena+0.010
 happened+0.010
 been+0.009
 a+0.009
Neg logit contributions
leness-0.019
lest-0.018
 active-0.018
 energetic-0.017
 efficient-0.017
Token or
Token ID393
Feature activation-3.35e-02

Pos logit contributions
leness+0.134
lest+0.131
mer+0.129
 active+0.128
 alike+0.126
Neg logit contributions
OM-0.117
ena-0.095
 been-0.093
 happened-0.092
emon-0.091
Token superheroes
Token ID40896
Feature activation-9.36e-02

Pos logit contributions
leness+0.061
mer+0.060
lest+0.058
ling+0.057
 active+0.056
Neg logit contributions
OM-0.044
 been-0.039
 happened-0.038
 a-0.038
ena-0.037
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
lest+0.183
 active+0.181
leness+0.181
 energetic+0.176
 forth+0.176
Neg logit contributions
OM-0.102
ena-0.094
 been-0.088
 happened-0.087
 a-0.085
Token!
Token ID0
Feature activation+6.10e-02

Pos logit contributions
OM+0.072
 been+0.060
 happened+0.059
 a+0.057
ena+0.056
Neg logit contributions
leness-0.100
lest-0.095
 active-0.093
mer-0.091
 alike-0.090
Token She
Token ID1375
Feature activation-1.14e-01

Pos logit contributions
OM+0.107
ena+0.089
 CA+0.088
 happened+0.087
 been+0.087
Neg logit contributions
leness-0.089
lest-0.087
mer-0.085
ling-0.082
 active-0.082
Token could
Token ID714
Feature activation+7.09e-03

Pos logit contributions
lest+0.174
 active+0.173
leness+0.173
mer+0.168
 alike+0.167
Neg logit contributions
OM-0.188
ena-0.151
emon-0.148
 CA-0.146
 happened-0.145
Token now
Token ID783
Feature activation+3.71e-02

Pos logit contributions
OM+0.009
 a+0.007
 been+0.007
ena+0.007
 happened+0.007
Neg logit contributions
leness-0.015
mer-0.014
lest-0.013
ling-0.013
 alike-0.013
Token run
Token ID1057
Feature activation-3.22e-02

Pos logit contributions
OM+0.049
 been+0.044
 a+0.042
 happened+0.041
ena+0.040
Neg logit contributions
leness-0.068
lest-0.067
mer-0.065
 alike-0.063
ling-0.063
Token and
Token ID290
Feature activation-1.97e-02

Pos logit contributions
leness+0.038
lest+0.037
mer+0.036
 active+0.036
 alike+0.035
Neg logit contributions
OM-0.064
ena-0.055
 been-0.054
 happened-0.054
emon-0.053
Token play
Token ID711
Feature activation-2.39e-02

Pos logit contributions
leness+0.041
mer+0.039
ling+0.039
lest+0.038
 alike+0.037
Neg logit contributions
OM-0.022
 a-0.021
 been-0.021
 happened-0.017
 was-0.017
Token without
Token ID1231
Feature activation-3.25e-02

Pos logit contributions
lest+0.037
leness+0.037
 alike+0.033
mer+0.033
 active+0.032
Neg logit contributions
OM-0.040
 a-0.034
 been-0.034
ena-0.034
 happened-0.033
Token worrying
Token ID18916
Feature activation-1.34e-01

Pos logit contributions
leness+0.051
lest+0.050
 active+0.050
mer+0.049
 alike+0.049
Neg logit contributions
OM-0.052
ena-0.043
emon-0.042
 CA-0.041
 happened-0.041
Token about
Token ID546
Feature activation-1.84e-02

Pos logit contributions
leness+0.216
 active+0.213
ling+0.212
lest+0.211
 alike+0.211
Neg logit contributions
OM-0.206
 CA-0.164
ena-0.164
emon-0.163
 been-0.156
Token going
Token ID1016
Feature activation-8.12e-02

Pos logit contributions
leness+0.029
lest+0.028
 active+0.028
mer+0.028
 alike+0.028
Neg logit contributions
OM-0.029
ena-0.024
emon-0.023
 CA-0.023
 happened-0.023
Token explore
Token ID7301
Feature activation-6.87e-02

Pos logit contributions
leness+0.024
lest+0.024
 active+0.024
mer+0.023
 alike+0.023
Neg logit contributions
OM-0.033
ena-0.028
 happened-0.027
emon-0.027
 been-0.027
Token the
Token ID262
Feature activation+3.39e-02

Pos logit contributions
leness+0.116
lest+0.115
mer+0.113
 active+0.111
 alike+0.109
Neg logit contributions
OM-0.100
ena-0.081
 been-0.081
 happened-0.080
 CA-0.078
Token island
Token ID7022
Feature activation+1.31e-01

Pos logit contributions
OM+0.048
 been+0.040
 truth+0.039
 happened+0.038
 CA+0.038
Neg logit contributions
leness-0.060
lest-0.058
ishly-0.056
 active-0.055
 forth-0.054
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
 a+0.137
OM+0.134
 been+0.125
 something+0.116
 was+0.115
Neg logit contributions
leness-0.278
lest-0.267
 active-0.243
reci-0.240
 alike-0.239
Token but
Token ID475
Feature activation-5.88e-02

Pos logit contributions
leness+0.100
mer+0.099
lest+0.098
ling+0.092
 alike+0.089
Neg logit contributions
OM-0.084
 a-0.075
 been-0.075
 truth-0.068
ena-0.068
Token he
Token ID339
Feature activation-1.22e-01

Pos logit contributions
leness+0.090
lest+0.085
mer+0.083
 active+0.081
 alike+0.081
Neg logit contributions
OM-0.101
 been-0.087
 happened-0.086
ena-0.084
 a-0.083
Token was
Token ID373
Feature activation+2.26e-01

Pos logit contributions
mer+0.175
ling+0.168
 forth+0.164
 active+0.164
lest+0.158
Neg logit contributions
OM-0.211
emon-0.179
 CA-0.179
OU-0.169
 Ow-0.168
Token fearful
Token ID26844
Feature activation-1.99e-01

Pos logit contributions
OM+0.409
ena+0.340
emon+0.332
 CA+0.331
 happened+0.331
Neg logit contributions
leness-0.308
 active-0.304
lest-0.302
mer-0.299
 alike-0.294
Token.
Token ID13
Feature activation+3.37e-02

Pos logit contributions
leness+0.429
lest+0.401
mer+0.389
ishly+0.389
cient+0.376
Neg logit contributions
OM-0.198
 a-0.194
 happened-0.194
 been-0.184
 truth-0.174
Token
Token ID198
Feature activation+7.79e-02

Pos logit contributions
OM+0.062
emon+0.055
ena+0.054
 CA+0.053
 happened+0.052
Neg logit contributions
 active-0.040
mer-0.040
 alike-0.039
lest-0.038
leness-0.038
Token
Token ID198
Feature activation+7.74e-02

Pos logit contributions
OM+0.140
emon+0.126
 CA+0.121
ena+0.114
 happened+0.114
Neg logit contributions
 active-0.094
oc-0.094
mer-0.092
ling-0.091
leness-0.090
Token They
Token ID1119
Feature activation-1.75e-01

Pos logit contributions
leness+0.007
lest+0.007
mer+0.007
 active+0.007
ling+0.006
Neg logit contributions
OM-0.015
ena-0.013
 been-0.013
 happened-0.013
 CA-0.013
Token also
Token ID635
Feature activation-2.22e-02

Pos logit contributions
leness+0.255
lest+0.252
 alike+0.249
 active+0.249
mer+0.246
Neg logit contributions
OM-0.297
ena-0.236
 CA-0.236
emon-0.234
 happened-0.231
Token said
Token ID531
Feature activation+2.04e-02

Pos logit contributions
leness+0.033
lest+0.032
 active+0.032
 alike+0.032
mer+0.032
Neg logit contributions
OM-0.038
ena-0.030
 CA-0.029
emon-0.029
 happened-0.029
Token they
Token ID484
Feature activation-1.82e-01

Pos logit contributions
 a+0.035
 happened+0.034
OM+0.033
 truth+0.032
emon+0.031
Neg logit contributions
mer-0.031
leness-0.029
lest-0.028
 forth-0.028
 active-0.027
Token were
Token ID547
Feature activation+1.00e-01

Pos logit contributions
leness+0.310
lest+0.305
mer+0.300
 active+0.300
 alike+0.295
Neg logit contributions
OM-0.267
ena-0.215
 been-0.209
 happened-0.209
emon-0.207
Token sorry
Token ID7926
Feature activation-2.75e-01

Pos logit contributions
 a+0.170
 been+0.163
ena+0.161
ily+0.156
OM+0.156
Neg logit contributions
lest-0.148
leness-0.145
mer-0.144
oc-0.137
 forth-0.125
Token for
Token ID329
Feature activation-1.32e-01

Pos logit contributions
leness+0.552
mer+0.540
lest+0.517
 active+0.494
cient+0.481
Neg logit contributions
OM-0.331
 a-0.314
 happened-0.301
 been-0.297
 truth-0.295
Token the
Token ID262
Feature activation+8.96e-03

Pos logit contributions
lest+0.254
mer+0.251
leness+0.247
 active+0.237
ling+0.236
Neg logit contributions
OM-0.172
 been-0.156
 a-0.153
ena-0.150
 happened-0.145
Token lion
Token ID18744
Feature activation+5.66e-02

Pos logit contributions
OM+0.014
ena+0.012
emon+0.011
 CA+0.011
 been+0.011
Neg logit contributions
leness-0.014
lest-0.014
 active-0.014
mer-0.014
 alike-0.013
Token and
Token ID290
Feature activation-4.69e-02

Pos logit contributions
OM+0.064
 been+0.061
 happened+0.061
ena+0.058
 a+0.056
Neg logit contributions
leness-0.109
 active-0.103
lest-0.102
 forth-0.100
mer-0.099
Token hoped
Token ID10719
Feature activation-4.23e-02

Pos logit contributions
leness+0.085
mer+0.084
ling+0.083
lest+0.081
bing+0.080
Neg logit contributions
 been-0.061
 a-0.060
 happened-0.058
OM-0.058
ena-0.056
Token table
Token ID3084
Feature activation+3.40e-02

Pos logit contributions
OM+0.009
 truth+0.007
 been+0.007
ena+0.007
 dentist+0.007
Neg logit contributions
leness-0.014
mer-0.014
lest-0.014
 active-0.013
 alike-0.013
Token.
Token ID13
Feature activation-2.74e-03

Pos logit contributions
OM+0.047
ena+0.038
 been+0.037
 happened+0.036
emon+0.036
Neg logit contributions
leness-0.061
lest-0.060
 active-0.059
mer-0.058
 forth-0.057
Token The
Token ID383
Feature activation-8.48e-02

Pos logit contributions
leness+0.005
lest+0.005
mer+0.005
 forth+0.005
ling+0.004
Neg logit contributions
OM-0.004
ena-0.003
 happened-0.003
 CA-0.003
 been-0.003
Token table
Token ID3084
Feature activation+3.41e-02

Pos logit contributions
leness+0.147
lest+0.142
mer+0.140
 active+0.136
 alike+0.135
Neg logit contributions
OM-0.127
 been-0.103
 happened-0.102
 truth-0.100
ena-0.099
Token was
Token ID373
Feature activation+2.06e-01

Pos logit contributions
OM+0.032
ena+0.022
 CA+0.022
 been+0.021
emon+0.021
Neg logit contributions
lest-0.074
leness-0.074
mer-0.073
 active-0.073
ling-0.072
Token strong
Token ID1913
Feature activation-2.41e-01

Pos logit contributions
OM+0.329
ena+0.275
 been+0.274
 happened+0.269
emon+0.260
Neg logit contributions
leness-0.332
lest-0.324
mer-0.317
 active-0.302
 alike-0.302
Token and
Token ID290
Feature activation-4.21e-02

Pos logit contributions
leness+0.546
lest+0.534
cient+0.513
mer+0.502
 active+0.499
Neg logit contributions
OM-0.206
 a-0.205
ena-0.183
 been-0.183
 accident-0.173
Token could
Token ID714
Feature activation-6.14e-03

Pos logit contributions
leness+0.082
lest+0.077
mer+0.076
ishly+0.075
oc+0.075
Neg logit contributions
 a-0.053
 been-0.052
OM-0.051
 happened-0.050
 was-0.045
Token protect
Token ID1805
Feature activation-1.27e-01

Pos logit contributions
leness+0.013
lest+0.013
mer+0.012
 active+0.012
 alike+0.012
Neg logit contributions
OM-0.007
ena-0.005
 been-0.005
 happened-0.005
 CA-0.005
Token them
Token ID606
Feature activation+7.68e-02

Pos logit contributions
leness+0.182
lest+0.179
mer+0.178
 active+0.177
 alike+0.175
Neg logit contributions
OM-0.217
ena-0.177
 CA-0.175
 happened-0.174
emon-0.174
Token from
Token ID422
Feature activation-6.12e-02

Pos logit contributions
OM+0.069
 a+0.069
 been+0.061
ena+0.059
 truth+0.058
Neg logit contributions
leness-0.170
mer-0.160
lest-0.156
 active-0.153
 alike-0.151
Token,
Token ID11
Feature activation-6.20e-02

Pos logit contributions
lest+0.002
leness+0.002
mer+0.002
 active+0.002
 alike+0.002
Neg logit contributions
OM-0.004
ena-0.004
 dentist-0.004
 CA-0.004
emon-0.004
Token a
Token ID257
Feature activation+1.85e-01

Pos logit contributions
leness+0.139
lest+0.137
mer+0.136
 active+0.135
 alike+0.134
Neg logit contributions
OM-0.058
ena-0.040
 been-0.038
 happened-0.038
emon-0.037
Token small
Token ID1402
Feature activation-1.41e-01

Pos logit contributions
OM+0.234
 happened+0.188
 CA+0.183
ena+0.181
 dentist+0.179
Neg logit contributions
leness-0.351
lest-0.337
 forth-0.326
mer-0.324
 active-0.322
Token bird
Token ID6512
Feature activation+9.37e-02

Pos logit contributions
leness+0.279
lest+0.261
 active+0.251
 alike+0.251
 forth+0.249
Neg logit contributions
OM-0.158
 happened-0.140
 a-0.135
ena-0.132
 dentist-0.132
Token felt
Token ID2936
Feature activation+1.39e-01

Pos logit contributions
OM+0.129
ena+0.101
emon+0.097
 been+0.094
 CA+0.094
Neg logit contributions
leness-0.169
lest-0.168
mer-0.165
ling-0.164
 active-0.164
Token helpless
Token ID21144
Feature activation-1.97e-01

Pos logit contributions
OM+0.226
ena+0.189
 been+0.189
 happened+0.189
 CA+0.185
Neg logit contributions
leness-0.210
lest-0.206
mer-0.203
 active-0.198
 alike-0.196
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
ling+0.261
 active+0.255
mer+0.246
lest+0.245
 alike+0.242
Neg logit contributions
OM-0.374
ena-0.314
emon-0.309
 CA-0.306
 dentist-0.296
Token It
Token ID632
Feature activation+2.18e-02

Pos logit contributions
OM+0.058
ena+0.049
 CA+0.049
 been+0.049
 happened+0.049
Neg logit contributions
leness-0.046
lest-0.045
mer-0.043
 active-0.043
 forth-0.042
Token could
Token ID714
Feature activation+8.82e-03

Pos logit contributions
OM+0.031
ena+0.024
emon+0.024
 been+0.023
 CA+0.023
Neg logit contributions
leness-0.038
lest-0.038
mer-0.038
 active-0.038
ling-0.037
Token not
Token ID407
Feature activation+3.76e-02

Pos logit contributions
OM+0.012
ena+0.010
emon+0.009
 happened+0.009
 been+0.009
Neg logit contributions
leness-0.016
lest-0.016
 active-0.015
mer-0.015
 alike-0.015
Token fly
Token ID6129
Feature activation+1.24e-02

Pos logit contributions
OM+0.060
ena+0.049
 been+0.048
 happened+0.048
emon+0.048
Neg logit contributions
leness-0.059
lest-0.058
mer-0.057
 active-0.056
 alike-0.055
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
leness+0.483
mer+0.467
lest+0.465
 active+0.450
 alike+0.435
Neg logit contributions
OM-0.434
ena-0.361
 happened-0.358
 been-0.358
 truth-0.348
Token Sara
Token ID24799
Feature activation+1.30e-01

Pos logit contributions
leness+0.214
lest+0.207
mer+0.204
 active+0.202
ling+0.200
Neg logit contributions
OM-0.125
ena-0.101
 been-0.094
emon-0.093
 CA-0.092
Token.
Token ID13
Feature activation-2.73e-02

Pos logit contributions
OM+0.114
 been+0.108
ena+0.108
 a+0.105
 happened+0.104
Neg logit contributions
leness-0.280
lest-0.274
 active-0.262
mer-0.261
 efficient-0.259
Token We
Token ID775
Feature activation-1.86e-01

Pos logit contributions
ling+0.039
leness+0.037
lest+0.033
 forth+0.033
 active+0.032
Neg logit contributions
OM-0.052
 CA-0.048
 a-0.046
 Ow-0.046
ena-0.045
Token are
Token ID389
Feature activation+3.95e-02

Pos logit contributions
leness+0.254
 alike+0.253
lest+0.245
 active+0.243
ling+0.237
Neg logit contributions
OM-0.334
ena-0.277
 CA-0.274
 dentist-0.271
emon-0.268
Token sorry
Token ID7926
Feature activation-2.86e-01

Pos logit contributions
OM+0.077
ena+0.073
 been+0.073
 a+0.072
 happened+0.071
Neg logit contributions
lest-0.048
leness-0.047
mer-0.047
oc-0.040
fy-0.040
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
leness+0.483
mer+0.468
lest+0.465
 active+0.452
 alike+0.438
Neg logit contributions
OM-0.436
ena-0.362
 happened-0.359
 been-0.358
 truth-0.347
Token Tom
Token ID4186
Feature activation+3.00e-02

Pos logit contributions
leness+0.210
lest+0.203
mer+0.200
 active+0.197
ling+0.195
Neg logit contributions
OM-0.134
ena-0.110
 been-0.103
emon-0.101
 CA-0.101
Token."
Token ID526
Feature activation+5.43e-03

Pos logit contributions
OM+0.037
 been+0.032
 happened+0.032
ena+0.031
 CA+0.030
Neg logit contributions
leness-0.055
lest-0.055
mer-0.053
 active-0.053
 alike-0.052
Token They
Token ID1119
Feature activation-1.90e-01

Pos logit contributions
OM+0.008
ena+0.006
 been+0.006
 happened+0.006
emon+0.006
Neg logit contributions
leness-0.009
lest-0.009
mer-0.009
 active-0.009
 alike-0.009
Token hugged
Token ID41130
Feature activation-8.91e-02

Pos logit contributions
 alike+0.235
leness+0.234
 active+0.233
 forth+0.229
lest+0.229
Neg logit contributions
OM-0.363
emon-0.295
ena-0.295
 CA-0.294
 happened-0.292
Token so
Token ID523
Feature activation+3.69e-02

Pos logit contributions
OM+0.338
ena+0.285
 been+0.280
 happened+0.272
emon+0.269
Neg logit contributions
leness-0.370
lest-0.364
mer-0.355
 active-0.347
ling-0.342
Token proud
Token ID6613
Feature activation-2.38e-01

Pos logit contributions
OM+0.062
 been+0.061
 a+0.061
 happened+0.059
ena+0.059
Neg logit contributions
leness-0.051
 forth-0.049
mer-0.048
lest-0.047
ishly-0.047
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
leness+0.450
lest+0.425
 active+0.422
mer+0.417
ishly+0.415
Neg logit contributions
OM-0.284
 happened-0.274
 been-0.261
 a-0.260
ena-0.259
Token have
Token ID423
Feature activation-6.09e-03

Pos logit contributions
mer+0.041
lest+0.040
leness+0.040
ling+0.040
fy+0.039
Neg logit contributions
OM-0.027
 been-0.025
ena-0.024
 a-0.023
 truth-0.021
Token the
Token ID262
Feature activation+2.57e-02

Pos logit contributions
leness+0.011
 active+0.011
lest+0.011
 alike+0.011
mer+0.011
Neg logit contributions
OM-0.008
ena-0.006
 CA-0.006
emon-0.006
 dentist-0.006
Token flexible
Token ID12846
Feature activation-1.87e-01

Pos logit contributions
OM+0.042
ena+0.034
emon+0.034
 CA+0.034
 happened+0.033
Neg logit contributions
leness-0.038
lest-0.038
mer-0.038
 alike-0.038
 active-0.038
Token statue
Token ID15207
Feature activation+1.48e-01

Pos logit contributions
leness+0.338
lest+0.332
 active+0.327
mer+0.324
 alike+0.319
Neg logit contributions
OM-0.253
ena-0.203
 been-0.199
 happened-0.198
emon-0.191
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
OM+0.188
 been+0.171
ena+0.167
 a+0.166
 happened+0.161
Neg logit contributions
leness-0.275
lest-0.259
 active-0.249
mer-0.247
ling-0.245
Token He
Token ID679
Feature activation-1.11e-01

Pos logit contributions
OM+0.036
ena+0.030
 been+0.030
 CA+0.029
 truth+0.029
Neg logit contributions
lest-0.036
leness-0.035
ling-0.035
 active-0.035
 forth-0.034
Token showed
Token ID3751
Feature activation-4.75e-02

Pos logit contributions
leness+0.190
lest+0.186
mer+0.183
 active+0.182
 alike+0.179
Neg logit contributions
OM-0.161
ena-0.133
 been-0.130
 happened-0.129
 CA-0.126
Token all
Token ID477
Feature activation-4.63e-03

Pos logit contributions
leness+0.086
lest+0.084
mer+0.084
 active+0.083
 alike+0.081
Neg logit contributions
OM-0.062
ena-0.052
 been-0.051
 happened-0.050
 a-0.048