TOP ACTIVATIONS
MAX = 16.286

ran around the park , letting his new k ite fly
tree and thanked it for letting them enjoy its beauty .
thank Bob and Rex for letting them play . They say
thank you to them for letting him see them . He
soon . Thank you for letting us play with it ."
home . " Thanks for letting me wear your tool belt
, " Thank you for letting me play on the equipment
Mom and thanked her for letting them go out . They
, " Thank you for letting me fly your k ite
" Thanks for letting me borrow your toy ,
for introducing herself and for letting her borrow the gray book
over and over again for letting him join in .
. They thanked Mom for letting them use them .
to his mom my for letting him go outside .
fixing the dress and not letting the delay stop her from
. " Thank you for letting me use it . You
the plastic ball , never letting it get in the way
you , Amy , for letting me play with your toys
for soaking the crack and letting me out . You are
. " Thank you for letting me touch your feather ,"

MIN ACTIVATIONS
MIN = -19.835

. They take their umb rell as . "
too slow . The umb rell as go far away .
try to catch their umb rell as . But they are
Exc use me squ ir rell y , I was wondering
them with towels and umb rell as . They hug them
the kids collected their umb rell as and went out to
. They have no umb rell as . They
. We have our umb rell as to keep us dry
. It takes their umb rell as . "
some people running with umb rell as and some cars spl
became heavy and their umb rell as began to dro op
too slow . The umb rell as go far away .
try to catch their umb rell as . But they are
had no jackets or umb rell as , so now we
. They opened their umb rell as and ran around in
jackets , grabbed their umb rell as , and slowly walked
. He had no umb rell as or hats to keep
. It takes their umb rell as . "
about rain bows and umb rell as . They were very
. They take their umb rell as . "

INTERVAL 7.255 - 16.286
CONTAINS 6.889%

! Can I play with you ?" Tim asked . The
never scared . He remembered that the rat had filled the
wants to be our friend ." " Let 's
their toys . <|endoftext|> Tom likes to watch videos on his
there was a boy called Tom . Tom liked to play

INTERVAL -1.775 - 7.255
CONTAINS 59.771%

stack them up . They find a hat , a scarf
Lightning !" Her mom held her hand tightly and
, mom ," they said . " We will not go
come help you ." His mom came over and
and then nodded in understanding . The next morning

INTERVAL -10.805 - -1.775
CONTAINS 30.689%

walk in the forest . He looked high and low ,
named Tim my . Tim my loved to play at the
stubborn and didn 't mean any harm . They continued their
. She had a big tub full of water and bubbles
Sam and asked them what happened . Tom and Sam told

INTERVAL -19.835 - -10.805
CONTAINS 2.652%

carried it back home . Mom my was
with tiny white balls . The rabbit said
to rescue the puppy . The puppy was
't know how to get home . Finally ,
is nice . He offers his toy train to Lily and
Token ran
Token ID4966
Feature activation-5.84e-02

Pos logit contributions
by+0.385
rell+0.379
lying+0.371
 awoke+0.367
 destination+0.362
Neg logit contributions
 we-0.362
 I-0.318
 you-0.295
 who-0.263
 let-0.254
Token around
Token ID1088
Feature activation-2.41e+00

Pos logit contributions
rell+0.003
ench+0.003
lying+0.003
by+0.003
 awoke+0.003
Neg logit contributions
 we-0.003
 I-0.003
 you-0.003
 who-0.003
 that-0.003
Token the
Token ID262
Feature activation+3.58e+00

Pos logit contributions
rell+0.156
by+0.155
lying+0.151
 destination+0.150
ench+0.149
Neg logit contributions
 we-0.100
 I-0.086
 you-0.075
 let-0.066
 who-0.066
Token park
Token ID3952
Feature activation-3.72e+00

Pos logit contributions
 we+0.237
 I+0.215
 you+0.202
 who+0.190
 let+0.187
Neg logit contributions
rell-0.151
by-0.148
lying-0.143
ench-0.142
 destination-0.139
Token,
Token ID11
Feature activation-4.35e+00

Pos logit contributions
rell+0.238
by+0.231
lying+0.231
ench+0.229
 awoke+0.220
Neg logit contributions
 we-0.171
 I-0.147
 you-0.133
 who-0.129
 let-0.120
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
rell+0.235
by+0.229
lying+0.226
 destination+0.220
ench+0.220
Neg logit contributions
 we-0.230
 I-0.206
 you-0.191
 who-0.169
 let-0.165
Token his
Token ID465
Feature activation-1.10e+01

Pos logit contributions
 we+0.829
 I+0.748
 you+0.719
 let+0.665
 who+0.644
Neg logit contributions
rell-0.883
lying-0.882
ench-0.861
by-0.834
 awoke-0.809
Token new
Token ID649
Feature activation-1.74e+00

Pos logit contributions
rell+0.660
by+0.651
lying+0.638
ench+0.631
 destination+0.624
Neg logit contributions
 we-0.529
 I-0.465
 you-0.423
 who-0.391
 let-0.377
Token k
Token ID479
Feature activation+3.96e-01

Pos logit contributions
rell+0.101
lying+0.098
ench+0.096
by+0.096
 awoke+0.092
Neg logit contributions
 we-0.090
 I-0.079
 you-0.074
 who-0.068
 let-0.066
Tokenite
Token ID578
Feature activation+4.52e-01

Pos logit contributions
 we+0.023
 I+0.021
 you+0.019
 let+0.019
 who+0.018
Neg logit contributions
by-0.020
rell-0.020
ench-0.019
lying-0.018
 destination-0.018
Token fly
Token ID6129
Feature activation+3.98e+00

Pos logit contributions
 we+0.024
 I+0.021
 you+0.019
 who+0.018
 let+0.017
Neg logit contributions
rell-0.026
lying-0.025
by-0.025
ench-0.025
 destination-0.024
Token tree
Token ID5509
Feature activation-1.45e+00

Pos logit contributions
 we+0.164
 I+0.144
 you+0.135
 who+0.126
 let+0.125
Neg logit contributions
rell-0.149
by-0.145
lying-0.143
 destination-0.140
ench-0.140
Token and
Token ID290
Feature activation+4.01e+00

Pos logit contributions
rell+0.082
by+0.080
lying+0.079
ench+0.078
 destination+0.077
Neg logit contributions
 we-0.074
 I-0.065
 you-0.060
 who-0.055
 let-0.054
Token thanked
Token ID26280
Feature activation+2.91e+00

Pos logit contributions
 we+0.184
 I+0.152
 you+0.142
 who+0.131
 let+0.126
Neg logit contributions
rell-0.231
by-0.226
 awoke-0.221
 doorway-0.218
lying-0.217
Token it
Token ID340
Feature activation+6.23e+00

Pos logit contributions
 we+0.146
 I+0.126
 you+0.112
 who+0.107
 let+0.106
Neg logit contributions
rell-0.161
by-0.161
lying-0.155
 destination-0.154
zie-0.153
Token for
Token ID329
Feature activation+8.89e-01

Pos logit contributions
 we+0.311
 I+0.273
 you+0.253
 who+0.236
 let+0.226
Neg logit contributions
rell-0.372
by-0.359
lying-0.357
ench-0.352
 destination-0.341
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.048
 I+0.042
 you+0.037
 who+0.036
 let+0.035
Neg logit contributions
rell-0.046
 destination-0.044
by-0.044
 doorway-0.043
lying-0.043
Token them
Token ID606
Feature activation-4.50e+00

Pos logit contributions
 we+0.716
 I+0.626
 you+0.587
 who+0.510
 let+0.504
Neg logit contributions
rell-1.027
lying-0.996
by-0.994
ench-0.982
 destination-0.949
Token enjoy
Token ID2883
Feature activation+3.46e+00

Pos logit contributions
rell+0.260
lying+0.257
by+0.253
ench+0.246
 destination+0.242
Neg logit contributions
 we-0.234
 I-0.207
 you-0.193
 who-0.178
 let-0.174
Token its
Token ID663
Feature activation-3.86e+00

Pos logit contributions
 we+0.151
 I+0.133
 you+0.112
 let+0.111
 who+0.108
Neg logit contributions
by-0.203
 destination-0.202
rell-0.202
ench-0.193
lying-0.192
Token beauty
Token ID8737
Feature activation-2.86e+00

Pos logit contributions
rell+0.190
lying+0.186
by+0.178
ench+0.176
zie+0.169
Neg logit contributions
 we-0.235
 I-0.211
 you-0.201
 who-0.185
 let-0.180
Token.
Token ID13
Feature activation-6.61e-02

Pos logit contributions
rell+0.165
by+0.160
lying+0.159
ench+0.157
 destination+0.153
Neg logit contributions
 we-0.147
 I-0.130
 you-0.120
 who-0.111
 let-0.107
Token thank
Token ID5875
Feature activation+7.18e+00

Pos logit contributions
rell+0.001
by+0.001
lying+0.001
ench+0.001
 awoke+0.001
Neg logit contributions
 we-0.001
 I-0.001
 you-0.001
 who-0.001
 let-0.001
Token Bob
Token ID5811
Feature activation-3.19e+00

Pos logit contributions
 we+0.320
 I+0.292
 let+0.238
 who+0.235
 you+0.227
Neg logit contributions
by-0.414
rell-0.396
lying-0.380
 destination-0.372
rolled-0.372
Token and
Token ID290
Feature activation+4.06e+00

Pos logit contributions
rell+0.196
lying+0.190
by+0.189
ench+0.188
 destination+0.184
Neg logit contributions
 we-0.151
 I-0.132
 you-0.121
 who-0.113
 let-0.108
Token Rex
Token ID17853
Feature activation+5.16e+00

Pos logit contributions
 we+0.175
 I+0.152
 you+0.137
 who+0.125
 let+0.115
Neg logit contributions
by-0.256
rell-0.253
lying-0.247
 destination-0.241
ench-0.240
Token for
Token ID329
Feature activation+8.89e-01

Pos logit contributions
 we+0.268
 I+0.235
 you+0.218
 who+0.204
 let+0.198
Neg logit contributions
rell-0.297
lying-0.286
by-0.285
ench-0.283
 destination-0.276
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.043
 I+0.038
 you+0.032
 let+0.029
 who+0.028
Neg logit contributions
rell-0.050
by-0.049
 ease-0.047
 destination-0.047
ench-0.047
Token them
Token ID606
Feature activation-4.50e+00

Pos logit contributions
 we+0.834
 I+0.735
 you+0.723
 who+0.651
 let+0.650
Neg logit contributions
rell-0.905
lying-0.878
ench-0.864
by-0.852
 doorway-0.841
Token play
Token ID711
Feature activation-4.12e-01

Pos logit contributions
rell+0.259
lying+0.253
by+0.251
ench+0.245
 destination+0.241
Neg logit contributions
 we-0.234
 I-0.205
 you-0.192
 who-0.177
 let-0.174
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
rell+0.022
by+0.022
ench+0.022
lying+0.022
 destination+0.021
Neg logit contributions
 we-0.021
 I-0.018
 you-0.016
 let-0.015
 who-0.015
Token They
Token ID1119
Feature activation+1.16e-01

Pos logit contributions
rell+0.003
lying+0.003
 Eventually+0.003
by+0.002
ench+0.002
Neg logit contributions
 we-0.003
 I-0.003
 you-0.003
 who-0.003
 let-0.003
Token say
Token ID910
Feature activation+3.29e+00

Pos logit contributions
 we+0.006
 I+0.005
 you+0.005
 who+0.004
 oh+0.004
Neg logit contributions
rell-0.007
by-0.007
lying-0.006
 destination-0.006
ench-0.006
Token thank
Token ID5875
Feature activation+7.31e+00

Pos logit contributions
 we+0.144
 I+0.128
 you+0.120
 who+0.116
 let+0.111
Neg logit contributions
rell-0.121
by-0.119
lying-0.114
 awoke-0.113
ench-0.110
Token you
Token ID345
Feature activation+7.48e+00

Pos logit contributions
 we+0.131
 I+0.093
 who+0.059
 spies+0.056
 chefs+0.055
Neg logit contributions
by-0.581
rell-0.564
lying-0.558
zie-0.539
rolled-0.533
Token to
Token ID284
Feature activation-3.56e+00

Pos logit contributions
 we+0.366
 I+0.319
 you+0.297
 who+0.275
 let+0.260
Neg logit contributions
rell-0.467
lying-0.447
ench-0.446
by-0.436
 destination-0.429
Token them
Token ID606
Feature activation-4.49e+00

Pos logit contributions
rell+0.184
by+0.181
zie+0.177
 destination+0.172
 ease+0.169
Neg logit contributions
 we-0.188
 I-0.164
 you-0.144
We-0.143
 who-0.139
Token for
Token ID329
Feature activation+8.70e-01

Pos logit contributions
rell+0.284
by+0.277
lying+0.274
ench+0.271
 destination+0.265
Neg logit contributions
 we-0.205
 I-0.178
 you-0.162
 who-0.149
 let-0.143
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.043
 I+0.038
We+0.030
 you+0.030
 who+0.030
Neg logit contributions
rell-0.046
 ease-0.044
 destination-0.044
 doorway-0.043
 home-0.043
Token him
Token ID683
Feature activation-1.10e+01

Pos logit contributions
 we+0.944
 I+0.849
 you+0.816
 who+0.742
 let+0.734
Neg logit contributions
rell-0.800
lying-0.771
ench-0.762
by-0.761
 destination-0.727
Token see
Token ID766
Feature activation+5.44e+00

Pos logit contributions
rell+0.626
by+0.609
lying+0.607
ench+0.596
 destination+0.584
Neg logit contributions
 we-0.573
 I-0.508
 you-0.469
 who-0.435
 let-0.423
Token them
Token ID606
Feature activation-4.48e+00

Pos logit contributions
 we+0.298
 I+0.264
 you+0.228
 let+0.224
 who+0.218
Neg logit contributions
by-0.269
rell-0.266
 destination-0.250
lying-0.245
ench-0.245
Token.
Token ID13
Feature activation-7.04e-02

Pos logit contributions
rell+0.273
lying+0.266
by+0.266
ench+0.260
zie+0.253
Neg logit contributions
 we-0.218
 I-0.190
 you-0.176
 who-0.163
 let-0.157
Token He
Token ID679
Feature activation-6.88e+00

Pos logit contributions
rell+0.002
lying+0.002
 Eventually+0.002
by+0.002
rolled+0.002
Neg logit contributions
 we-0.005
 you-0.004
 I-0.004
 who-0.004
 let-0.004
Token soon
Token ID2582
Feature activation+3.74e+00

Pos logit contributions
by+0.880
 destination+0.878
ench+0.856
rell+0.836
lying+0.821
Neg logit contributions
 we-0.725
 I-0.674
 you-0.583
Bad-0.552
You-0.548
Token.
Token ID13
Feature activation-6.20e-02

Pos logit contributions
 we+0.159
 I+0.139
 you+0.122
 who+0.117
Bad+0.113
Neg logit contributions
rell-0.230
lying-0.224
ench-0.224
by-0.224
 destination-0.224
Token Thank
Token ID6952
Feature activation+7.60e+00

Pos logit contributions
 Eventually+0.004
 Today+0.004
 Along+0.003
 Later+0.003
lying+0.003
Neg logit contributions
 we-0.002
 that-0.002
ms-0.002
ts-0.002
 you-0.002
Token you
Token ID345
Feature activation+7.49e+00

Pos logit contributions
 we+0.134
 annoying+0.120
 wasting+0.107
 exc+0.104
 I+0.101
Neg logit contributions
by-0.549
day-0.521
rell-0.516
lying-0.512
am-0.505
Token for
Token ID329
Feature activation+8.83e-01

Pos logit contributions
 we+0.395
 I+0.354
 you+0.322
 who+0.307
 let+0.297
Neg logit contributions
rell-0.401
by-0.399
lying-0.385
ench-0.383
 destination-0.379
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.041
 I+0.037
 you+0.029
We+0.028
 don+0.028
Neg logit contributions
rell-0.048
by-0.047
 destination-0.047
 home-0.046
 doorway-0.046
Token us
Token ID514
Feature activation-6.45e+00

Pos logit contributions
 we+0.858
 I+0.764
 you+0.706
 who+0.656
 let+0.638
Neg logit contributions
rell-0.875
by-0.851
lying-0.843
ench-0.831
 destination-0.810
Token play
Token ID711
Feature activation-3.96e-01

Pos logit contributions
rell+0.339
by+0.334
ench+0.325
lying+0.320
 destination+0.316
Neg logit contributions
 we-0.350
 I-0.315
 you-0.285
 who-0.271
 let-0.260
Token with
Token ID351
Feature activation+1.02e+01

Pos logit contributions
 destination+0.022
ench+0.022
rell+0.021
 home+0.021
date+0.021
Neg logit contributions
 we-0.017
 I-0.015
 let-0.013
 wasting-0.013
Bad-0.012
Token it
Token ID340
Feature activation+6.24e+00

Pos logit contributions
 we+0.450
 I+0.404
 let+0.354
You+0.335
 who+0.330
Neg logit contributions
by-0.587
 destination-0.566
rell-0.555
ench-0.532
zie-0.525
Token."
Token ID526
Feature activation+8.43e+00

Pos logit contributions
 we+0.270
 I+0.233
 you+0.209
 who+0.191
 let+0.184
Neg logit contributions
rell-0.407
by-0.397
lying-0.393
ench-0.389
 destination-0.382
Token home
Token ID1363
Feature activation-1.59e+01

Pos logit contributions
rell+0.272
lying+0.271
ench+0.256
aced+0.250
by+0.250
Neg logit contributions
 we-0.361
 I-0.321
 you-0.304
 who-0.298
 that-0.296
Token.
Token ID13
Feature activation-6.67e-02

Pos logit contributions
rell+0.978
by+0.942
lying+0.940
ench+0.920
zie+0.890
Neg logit contributions
 we-0.784
 I-0.682
 you-0.626
 who-0.592
 let-0.561
Token "
Token ID366
Feature activation+3.58e+00

Pos logit contributions
by+0.003
rell+0.003
rol+0.003
ench+0.003
lying+0.003
Neg logit contributions
 we-0.004
 I-0.004
 you-0.003
 who-0.003
 let-0.003
TokenThanks
Token ID9690
Feature activation+1.94e+00

Pos logit contributions
 we+0.168
 I+0.144
 you+0.130
 who+0.116
 that+0.112
Neg logit contributions
rell-0.218
by-0.215
lying-0.215
 awoke-0.211
ench-0.207
Token for
Token ID329
Feature activation+8.66e-01

Pos logit contributions
 we+0.101
 I+0.091
 you+0.083
 who+0.075
 let+0.074
Neg logit contributions
rell-0.113
lying-0.109
by-0.108
ench-0.107
 awoke-0.103
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.042
 I+0.036
 who+0.030
 you+0.030
We+0.029
Neg logit contributions
rell-0.048
 destination-0.047
by-0.046
 doorway-0.046
zie-0.045
Token me
Token ID502
Feature activation-7.87e+00

Pos logit contributions
 we+0.853
 I+0.767
 you+0.737
 let+0.650
 who+0.649
Neg logit contributions
rell-0.894
lying-0.879
by-0.857
ench-0.839
 awoke-0.810
Token wear
Token ID5806
Feature activation+5.31e+00

Pos logit contributions
rell+0.429
by+0.416
ench+0.413
 destination+0.404
lying+0.402
Neg logit contributions
 we-0.408
 I-0.363
 you-0.330
 who-0.314
 let-0.301
Token your
Token ID534
Feature activation+4.40e-01

Pos logit contributions
 we+0.183
 I+0.151
 you+0.126
 who+0.118
 let+0.113
Neg logit contributions
rell-0.390
by-0.378
lying-0.372
ench-0.371
 destination-0.371
Token tool
Token ID2891
Feature activation-8.45e-01

Pos logit contributions
 we+0.024
 I+0.022
 you+0.020
 who+0.019
 let+0.018
Neg logit contributions
rell-0.024
lying-0.023
by-0.023
ench-0.023
 awoke-0.022
Token belt
Token ID10999
Feature activation-9.82e-01

Pos logit contributions
rell+0.051
lying+0.049
by+0.049
ench+0.048
 awoke+0.047
Neg logit contributions
 we-0.042
 I-0.037
 you-0.034
 who-0.031
 let-0.030
Token,
Token ID11
Feature activation-4.33e+00

Pos logit contributions
 we+0.113
 I+0.102
 you+0.094
 who+0.087
 let+0.085
Neg logit contributions
rell-0.106
by-0.102
lying-0.101
ench-0.100
 destination-0.098
Token "
Token ID366
Feature activation+3.59e+00

Pos logit contributions
rell+0.256
lying+0.249
by+0.244
ench+0.239
 awoke+0.238
Neg logit contributions
 we-0.200
 I-0.172
 you-0.163
 who-0.142
 let-0.137
TokenThank
Token ID10449
Feature activation+1.10e+01

Pos logit contributions
 we+0.170
 I+0.139
 you+0.129
 who+0.118
 that+0.114
Neg logit contributions
lying-0.215
rell-0.213
by-0.210
 awoke-0.209
zie-0.203
Token you
Token ID345
Feature activation+7.48e+00

Pos logit contributions
 we+0.201
 who+0.133
 let+0.126
 exc+0.119
 I+0.115
Neg logit contributions
rell-0.849
by-0.829
ippy-0.820
lying-0.815
zie-0.808
Token for
Token ID329
Feature activation+8.79e-01

Pos logit contributions
 we+0.394
 I+0.350
 you+0.323
 who+0.298
 let+0.290
Neg logit contributions
rell-0.426
by-0.414
lying-0.411
ench-0.404
 destination-0.394
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.044
 I+0.037
We+0.031
ms+0.031
 don+0.030
Neg logit contributions
rell-0.046
 home-0.043
 doorway-0.043
 ease-0.042
zie-0.042
Token me
Token ID502
Feature activation-7.88e+00

Pos logit contributions
 we+0.931
 I+0.834
 you+0.768
 who+0.729
 let+0.705
Neg logit contributions
rell-0.799
by-0.775
lying-0.761
ench-0.755
 destination-0.737
Token play
Token ID711
Feature activation-4.11e-01

Pos logit contributions
rell+0.409
by+0.398
ench+0.394
lying+0.379
 destination+0.373
Neg logit contributions
 we-0.425
 I-0.379
 you-0.341
 who-0.331
Bad-0.316
Token on
Token ID319
Feature activation+6.51e+00

Pos logit contributions
ench+0.024
rell+0.024
 destination+0.023
date+0.023
 home+0.022
Neg logit contributions
 we-0.017
 I-0.015
 let-0.012
 wasting-0.012
 you-0.012
Token the
Token ID262
Feature activation+3.65e+00

Pos logit contributions
 we+0.234
 I+0.193
 who+0.165
 you+0.158
 let+0.157
Neg logit contributions
rell-0.460
 destination-0.438
by-0.437
ench-0.430
zie-0.423
Token equipment
Token ID5112
Feature activation-3.81e+00

Pos logit contributions
 we+0.202
 I+0.182
 you+0.164
 who+0.159
 let+0.154
Neg logit contributions
rell-0.191
by-0.183
 destination-0.179
ench-0.176
lying-0.174
Token Mom
Token ID11254
Feature activation+4.34e+00

Pos logit contributions
 we+0.138
 I+0.121
 you+0.105
 who+0.091
 let+0.090
Neg logit contributions
by-0.210
rell-0.208
lying-0.203
ench-0.200
 destination-0.200
Token and
Token ID290
Feature activation+4.09e+00

Pos logit contributions
 we+0.191
 I+0.163
 you+0.150
 who+0.147
 let+0.137
Neg logit contributions
rell-0.282
lying-0.275
ench-0.274
by-0.270
 destination-0.266
Token thanked
Token ID26280
Feature activation+2.93e+00

Pos logit contributions
 we+0.195
 I+0.174
 you+0.164
 who+0.145
 spies+0.138
Neg logit contributions
by-0.231
rell-0.221
 awoke-0.219
lying-0.219
zie-0.211
Token her
Token ID607
Feature activation-8.42e+00

Pos logit contributions
 we+0.169
 I+0.154
 you+0.136
 who+0.131
 let+0.127
Neg logit contributions
by-0.139
rell-0.134
lying-0.132
 destination-0.131
zie-0.129
Token for
Token ID329
Feature activation+8.84e-01

Pos logit contributions
rell+0.472
by+0.468
lying+0.457
ench+0.453
 destination+0.447
Neg logit contributions
 we-0.435
 I-0.388
 you-0.356
 who-0.326
 let-0.318
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.043
 I+0.039
 you+0.032
 let+0.028
 who+0.028
Neg logit contributions
rell-0.049
 destination-0.048
 doorway-0.048
by-0.047
 ease-0.047
Token them
Token ID606
Feature activation-4.48e+00

Pos logit contributions
 we+0.792
 I+0.691
 you+0.682
 let+0.620
 who+0.603
Neg logit contributions
rell-0.962
lying-0.919
ench-0.898
by-0.893
 awoke-0.860
Token go
Token ID467
Feature activation-5.61e+00

Pos logit contributions
rell+0.250
lying+0.245
by+0.243
ench+0.236
 destination+0.232
Neg logit contributions
 we-0.241
 I-0.213
 you-0.199
 who-0.185
 let-0.182
Token out
Token ID503
Feature activation-9.40e+00

Pos logit contributions
rell+0.268
lying+0.264
by+0.261
ench+0.255
 awoke+0.248
Neg logit contributions
 we-0.357
 I-0.316
 you-0.302
 who-0.282
 let-0.274
Token.
Token ID13
Feature activation-6.76e-02

Pos logit contributions
rell+0.570
by+0.557
lying+0.552
ench+0.544
 destination+0.531
Neg logit contributions
 we-0.457
 I-0.400
 you-0.366
 who-0.338
 let-0.326
Token They
Token ID1119
Feature activation+1.19e-01

Pos logit contributions
rell+0.003
lying+0.003
by+0.003
ench+0.003
 destination+0.003
Neg logit contributions
 we-0.004
 I-0.003
 you-0.003
 who-0.003
 let-0.003
Token,
Token ID11
Feature activation-4.34e+00

Pos logit contributions
 we+0.144
 I+0.133
 you+0.125
 who+0.119
 let+0.117
Neg logit contributions
rell-0.072
by-0.069
lying-0.067
ench-0.066
 destination-0.064
Token "
Token ID366
Feature activation+3.58e+00

Pos logit contributions
rell+0.239
lying+0.236
by+0.230
 awoke+0.227
ench+0.222
Neg logit contributions
 we-0.212
 I-0.184
 you-0.176
 who-0.154
 let-0.148
TokenThank
Token ID10449
Feature activation+1.10e+01

Pos logit contributions
 we+0.163
 I+0.138
 you+0.120
 who+0.115
 that+0.110
Neg logit contributions
by-0.219
lying-0.218
rell-0.216
 awoke-0.213
rol-0.205
Token you
Token ID345
Feature activation+7.48e+00

Pos logit contributions
 we+0.193
 who+0.112
 I+0.110
 let+0.109
 exc+0.104
Neg logit contributions
by-0.857
rell-0.855
lying-0.832
ippy-0.822
zie-0.817
Token for
Token ID329
Feature activation+8.85e-01

Pos logit contributions
 we+0.405
 I+0.361
 you+0.332
 who+0.312
 let+0.302
Neg logit contributions
rell-0.403
by-0.396
lying-0.388
ench-0.384
 destination-0.377
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.042
 I+0.036
 you+0.029
We+0.029
 don+0.029
Neg logit contributions
rell-0.049
 destination-0.047
 awoke-0.046
 doorway-0.046
 home-0.045
Token me
Token ID502
Feature activation-7.87e+00

Pos logit contributions
 we+0.920
 I+0.822
 you+0.749
 who+0.720
 let+0.692
Neg logit contributions
rell-0.800
by-0.779
lying-0.762
ench-0.759
 destination-0.745
Token fly
Token ID6129
Feature activation+4.00e+00

Pos logit contributions
rell+0.417
by+0.408
ench+0.407
lying+0.391
 destination+0.390
Neg logit contributions
 we-0.411
 I-0.365
 you-0.327
 who-0.316
Bad-0.306
Token your
Token ID534
Feature activation+4.55e-01

Pos logit contributions
 we+0.203
 I+0.179
 you+0.162
 who+0.153
 let+0.148
Neg logit contributions
rell-0.228
by-0.224
lying-0.218
ench-0.218
 destination-0.214
Token k
Token ID479
Feature activation+4.05e-01

Pos logit contributions
 we+0.028
 I+0.025
 you+0.024
 who+0.022
 let+0.022
Neg logit contributions
rell-0.021
by-0.021
lying-0.021
ench-0.020
 destination-0.019
Tokenite
Token ID578
Feature activation+4.59e-01

Pos logit contributions
 we+0.024
 I+0.022
 you+0.020
 who+0.019
 let+0.019
Neg logit contributions
rell-0.020
by-0.020
ench-0.019
lying-0.019
 destination-0.018
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
rell+0.702
by+0.689
lying+0.685
ench+0.670
 destination+0.670
Neg logit contributions
 we-0.526
 I-0.467
 you-0.423
 who-0.389
 let-0.365
Token
Token ID198
Feature activation+2.90e-02

Pos logit contributions
 we+0.001
 I+0.001
 you+0.001
 who+0.001
 let+0.001
Neg logit contributions
rell-0.001
by-0.001
lying-0.001
ench-0.001
 destination-0.001
Token"
Token ID1
Feature activation+1.40e+00

Pos logit contributions
 we+0.001
 I+0.001
 you+0.001
 who+0.001
 let+0.001
Neg logit contributions
rell-0.002
lying-0.002
by-0.002
ench-0.002
rolled-0.002
TokenThanks
Token ID9690
Feature activation+1.93e+00

Pos logit contributions
 we+0.071
 I+0.058
 you+0.054
 spies+0.052
 that+0.051
Neg logit contributions
lying-0.073
rell-0.071
side-0.071
rol-0.068
 awoke-0.068
Token for
Token ID329
Feature activation+8.71e-01

Pos logit contributions
 we+0.100
 I+0.089
 you+0.082
 who+0.075
 let+0.073
Neg logit contributions
rell-0.110
by-0.107
lying-0.106
ench-0.105
 destination-0.102
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.042
 I+0.035
We+0.030
ms+0.029
 who+0.029
Neg logit contributions
rell-0.046
 destination-0.044
 doorway-0.044
zie-0.044
 ease-0.043
Token me
Token ID502
Feature activation-7.87e+00

Pos logit contributions
 we+0.885
 I+0.793
 you+0.748
 who+0.685
 let+0.670
Neg logit contributions
rell-0.855
lying-0.830
by-0.828
ench-0.811
 destination-0.788
Token borrow
Token ID8804
Feature activation-4.02e-01

Pos logit contributions
rell+0.404
by+0.392
 destination+0.388
ench+0.382
rolled+0.375
Neg logit contributions
 we-0.402
 I-0.362
 you-0.321
You-0.317
Bad-0.316
Token your
Token ID534
Feature activation+4.47e-01

Pos logit contributions
rell+0.029
 destination+0.028
by+0.028
lying+0.027
ench+0.026
Neg logit contributions
 we-0.013
 I-0.011
 who-0.009
 let-0.008
 you-0.008
Token toy
Token ID13373
Feature activation-1.91e+00

Pos logit contributions
 we+0.025
 I+0.023
 you+0.021
 who+0.020
 let+0.020
Neg logit contributions
rell-0.022
by-0.022
 destination-0.021
lying-0.021
ench-0.020
Token,
Token ID11
Feature activation-4.32e+00

Pos logit contributions
rell+0.102
by+0.100
lying+0.098
 destination+0.097
ench+0.095
Neg logit contributions
 we-0.102
 I-0.092
 you-0.084
 who-0.080
 let-0.079
Token for
Token ID329
Feature activation+8.68e-01

Pos logit contributions
rell+0.016
ench+0.015
lying+0.015
by+0.015
 destination+0.014
Neg logit contributions
 we-0.010
 I-0.008
 you-0.007
 who-0.007
 let-0.007
Token introducing
Token ID16118
Feature activation+5.51e+00

Pos logit contributions
 we+0.045
 I+0.040
 you+0.034
 who+0.032
 let+0.031
Neg logit contributions
rell-0.048
by-0.045
 ease-0.044
 doorway-0.044
lying-0.044
Token herself
Token ID5223
Feature activation-2.79e+00

Pos logit contributions
 we+0.328
 I+0.295
 you+0.273
 who+0.257
 let+0.251
Neg logit contributions
rell-0.272
by-0.265
lying-0.261
ench-0.257
 destination-0.250
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
rell+0.174
lying+0.169
ench+0.167
by+0.165
 destination+0.159
Neg logit contributions
 we-0.134
 I-0.117
 you-0.109
 who-0.103
 that-0.096
Token for
Token ID329
Feature activation+8.73e-01

Pos logit contributions
 we+0.198
 I+0.175
 you+0.159
 who+0.146
 let+0.140
Neg logit contributions
rell-0.236
by-0.235
lying-0.229
 destination-0.224
ench-0.221
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.043
 I+0.037
 you+0.031
 who+0.029
 let+0.028
Neg logit contributions
by-0.050
rell-0.050
 doorway-0.048
lying-0.048
 destination-0.048
Token her
Token ID607
Feature activation-8.42e+00

Pos logit contributions
 we+0.820
 I+0.728
 you+0.706
 let+0.625
 who+0.624
Neg logit contributions
rell-0.919
lying-0.884
ench-0.874
by-0.867
 awoke-0.835
Token borrow
Token ID8804
Feature activation-4.12e-01

Pos logit contributions
rell+0.476
by+0.472
lying+0.460
ench+0.455
 destination+0.450
Neg logit contributions
 we-0.430
 I-0.381
 you-0.346
 who-0.322
 let-0.312
Token the
Token ID262
Feature activation+3.64e+00

Pos logit contributions
rell+0.029
by+0.029
lying+0.028
ench+0.028
 destination+0.028
Neg logit contributions
 we-0.016
 I-0.013
 you-0.011
 who-0.010
 let-0.010
Token gray
Token ID12768
Feature activation+2.53e+00

Pos logit contributions
 we+0.239
 I+0.217
 you+0.195
 who+0.192
 let+0.190
Neg logit contributions
by-0.149
rell-0.147
 destination-0.141
lying-0.137
 doorway-0.135
Token book
Token ID1492
Feature activation-3.78e+00

Pos logit contributions
 we+0.155
 I+0.140
 you+0.130
 who+0.123
 let+0.121
Neg logit contributions
rell-0.119
by-0.116
lying-0.113
ench-0.111
 destination-0.109
Token over
Token ID625
Feature activation-4.07e+00

Pos logit contributions
rell+0.277
by+0.269
lying+0.267
ench+0.264
 destination+0.258
Neg logit contributions
 we-0.213
 I-0.186
 you-0.170
 who-0.157
 let-0.151
Token and
Token ID290
Feature activation+4.07e+00

Pos logit contributions
by+0.237
rell+0.235
lying+0.231
 destination+0.226
ench+0.226
Neg logit contributions
 we-0.201
 I-0.177
 you-0.160
 who-0.146
 let-0.144
Token over
Token ID625
Feature activation-4.07e+00

Pos logit contributions
 we+0.197
 I+0.176
 you+0.161
 who+0.147
 let+0.138
Neg logit contributions
by-0.238
rell-0.237
lying-0.231
ench-0.225
 destination-0.224
Token again
Token ID757
Feature activation-3.83e+00

Pos logit contributions
rell+0.216
by+0.209
lying+0.207
ench+0.204
 destination+0.199
Neg logit contributions
 we-0.229
 I-0.205
 you-0.190
 who-0.178
 let-0.173
Token for
Token ID329
Feature activation+8.66e-01

Pos logit contributions
by+0.191
rell+0.185
lying+0.184
 destination+0.180
ench+0.178
Neg logit contributions
 we-0.215
 I-0.194
 you-0.179
 who-0.162
 let-0.160
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.042
 I+0.037
 you+0.031
 who+0.028
 let+0.027
Neg logit contributions
rell-0.050
by-0.049
 doorway-0.048
 ease-0.047
 destination-0.047
Token him
Token ID683
Feature activation-1.10e+01

Pos logit contributions
 we+0.923
 I+0.821
 you+0.806
 who+0.730
 let+0.728
Neg logit contributions
rell-0.815
lying-0.795
ench-0.775
by-0.773
 awoke-0.761
Token join
Token ID4654
Feature activation+3.38e+00

Pos logit contributions
rell+0.624
lying+0.605
by+0.604
ench+0.594
 destination+0.584
Neg logit contributions
 we-0.580
 I-0.515
 you-0.477
 who-0.439
 let-0.432
Token in
Token ID287
Feature activation+3.33e+00

Pos logit contributions
 we+0.184
 I+0.164
 you+0.151
 who+0.140
 let+0.137
Neg logit contributions
rell-0.184
by-0.179
lying-0.177
ench-0.175
 destination-0.171
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
 we+0.158
 I+0.139
 you+0.124
 who+0.115
 let+0.112
Neg logit contributions
rell-0.201
by-0.197
lying-0.194
ench-0.192
 destination-0.190
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
by+0.005
rell+0.005
lying+0.005
ench+0.005
 destination+0.005
Neg logit contributions
 we-0.005
 I-0.004
 you-0.004
 who-0.004
 let-0.003
Token.
Token ID13
Feature activation-7.04e-02

Pos logit contributions
rell+0.202
by+0.195
lying+0.195
ench+0.191
 awoke+0.185
Neg logit contributions
 we-0.165
 I-0.142
 you-0.130
 who-0.126
 let-0.120
Token They
Token ID1119
Feature activation+1.15e-01

Pos logit contributions
rell+0.003
by+0.003
lying+0.003
ench+0.003
 destination+0.003
Neg logit contributions
 we-0.004
 I-0.004
 you-0.003
 who-0.003
 let-0.003
Token thanked
Token ID26280
Feature activation+2.92e+00

Pos logit contributions
 we+0.006
 I+0.006
 you+0.005
 who+0.005
 let+0.005
Neg logit contributions
rell-0.006
by-0.006
lying-0.006
ench-0.006
 destination-0.006
Token Mom
Token ID11254
Feature activation+4.32e+00

Pos logit contributions
 we+0.126
 I+0.115
 you+0.096
 let+0.086
 who+0.086
Neg logit contributions
by-0.180
 destination-0.174
rell-0.173
zie-0.167
lying-0.166
Token for
Token ID329
Feature activation+8.66e-01

Pos logit contributions
 we+0.220
 I+0.191
 you+0.179
 who+0.177
 let+0.165
Neg logit contributions
rell-0.254
ench-0.248
lying-0.245
by-0.237
rolled-0.234
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.041
 I+0.036
 you+0.031
 who+0.027
 let+0.027
Neg logit contributions
rell-0.052
by-0.050
 destination-0.049
 doorway-0.049
lying-0.049
Token them
Token ID606
Feature activation-4.49e+00

Pos logit contributions
 we+0.782
 I+0.671
 you+0.656
 let+0.600
 who+0.585
Neg logit contributions
rell-0.982
lying-0.927
ench-0.918
by-0.913
 awoke-0.877
Token use
Token ID779
Feature activation+8.60e+00

Pos logit contributions
rell+0.264
lying+0.262
by+0.256
ench+0.251
zie+0.246
Neg logit contributions
 we-0.233
 I-0.202
 you-0.189
 who-0.177
 let-0.174
Token them
Token ID606
Feature activation-4.50e+00

Pos logit contributions
 we+0.321
 I+0.267
 you+0.241
 who+0.215
 let+0.202
Neg logit contributions
rell-0.618
by-0.606
lying-0.605
ench-0.597
 awoke-0.580
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
rell+0.275
lying+0.270
ench+0.263
by+0.256
zie+0.254
Neg logit contributions
 we-0.232
 I-0.195
 you-0.189
 who-0.180
 that-0.170
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
rell+0.005
by+0.005
lying+0.005
ench+0.005
 destination+0.004
Neg logit contributions
 we-0.005
 I-0.004
 you-0.004
 who-0.004
 let-0.004
Token to
Token ID284
Feature activation-3.56e+00

Pos logit contributions
 we+0.238
 I+0.211
 you+0.192
 who+0.177
 let+0.173
Neg logit contributions
rell-0.266
by-0.265
lying-0.258
ench-0.254
 destination-0.250
Token his
Token ID465
Feature activation-1.10e+01

Pos logit contributions
rell+0.155
by+0.149
lying+0.147
ench+0.145
 destination+0.140
Neg logit contributions
 we-0.234
 I-0.213
 you-0.199
 who-0.189
 let-0.184
Token mom
Token ID1995
Feature activation+5.31e+00

Pos logit contributions
rell+0.779
by+0.766
lying+0.756
ench+0.750
 destination+0.741
Neg logit contributions
 we-0.420
 I-0.356
 you-0.315
 who-0.280
 let-0.269
Tokenmy
Token ID1820
Feature activation-7.82e+00

Pos logit contributions
 we+0.398
 I+0.368
 who+0.349
 you+0.348
 let+0.337
Neg logit contributions
rell-0.185
ench-0.174
lying-0.173
by-0.173
 destination-0.169
Token for
Token ID329
Feature activation+8.66e-01

Pos logit contributions
rell+0.558
by+0.537
ench+0.535
lying+0.535
rol+0.521
Neg logit contributions
 we-0.308
 I-0.257
 who-0.232
 you-0.231
 let-0.211
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.044
 I+0.039
 you+0.033
 pige+0.030
We+0.030
Neg logit contributions
rell-0.046
 ease-0.044
 doorway-0.043
rolled-0.043
 destination-0.043
Token him
Token ID683
Feature activation-1.10e+01

Pos logit contributions
 we+1.009
 I+0.911
 you+0.905
 let+0.828
 who+0.811
Neg logit contributions
rell-0.739
by-0.715
ench-0.702
lying-0.701
zie-0.668
Token go
Token ID467
Feature activation-5.63e+00

Pos logit contributions
lying+0.649
rell+0.646
by+0.624
 destination+0.609
zie+0.609
Neg logit contributions
 we-0.577
 I-0.506
 you-0.476
 who-0.440
 let-0.437
Token outside
Token ID2354
Feature activation-5.15e+00

Pos logit contributions
lying+0.308
rell+0.306
by+0.305
ench+0.296
aced+0.293
Neg logit contributions
 we-0.324
 I-0.286
 you-0.270
 who-0.246
 that-0.236
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
rell+0.319
by+0.310
lying+0.309
ench+0.303
 destination+0.294
Neg logit contributions
 we-0.251
 I-0.219
 you-0.200
 who-0.188
 let-0.177
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
rell+0.005
by+0.005
ench+0.005
aced+0.005
lying+0.005
Neg logit contributions
 we-0.004
 I-0.004
 you-0.003
 who-0.003
 let-0.003
Token fixing
Token ID18682
Feature activation+6.66e+00

Pos logit contributions
 we+0.046
 I+0.040
 you+0.034
We+0.031
 pige+0.031
Neg logit contributions
rell-0.045
 ease-0.045
zie-0.045
 destination-0.044
lying-0.044
Token the
Token ID262
Feature activation+3.63e+00

Pos logit contributions
 we+0.324
 I+0.285
 you+0.255
 who+0.237
 let+0.233
Neg logit contributions
rell-0.398
by-0.391
lying-0.383
ench-0.377
 destination-0.374
Token dress
Token ID6576
Feature activation+1.45e+00

Pos logit contributions
 we+0.205
 I+0.183
 you+0.168
 who+0.158
 let+0.156
Neg logit contributions
rell-0.185
by-0.183
lying-0.178
 destination-0.176
ench-0.176
Token and
Token ID290
Feature activation+4.08e+00

Pos logit contributions
 we+0.070
 I+0.062
 you+0.057
 who+0.053
 let+0.050
Neg logit contributions
rell-0.088
by-0.086
lying-0.086
ench-0.085
 awoke-0.082
Token not
Token ID407
Feature activation+8.48e+00

Pos logit contributions
 we+0.199
 I+0.177
 you+0.161
 who+0.150
Bad+0.141
Neg logit contributions
rell-0.225
by-0.224
 destination-0.214
 doorway-0.213
lying-0.212
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.416
 I+0.362
 you+0.333
 who+0.304
 let+0.287
Neg logit contributions
rell-0.494
by-0.488
lying-0.478
ench-0.464
 destination-0.464
Token the
Token ID262
Feature activation+3.65e+00

Pos logit contributions
 we+0.848
 I+0.756
 you+0.705
 who+0.648
 let+0.635
Neg logit contributions
rell-0.882
by-0.854
lying-0.851
ench-0.840
 destination-0.814
Token delay
Token ID5711
Feature activation-1.23e+00

Pos logit contributions
 we+0.185
 I+0.166
 who+0.144
 you+0.143
 let+0.136
Neg logit contributions
by-0.199
rell-0.194
 destination-0.193
 doorway-0.186
lying-0.186
Token stop
Token ID2245
Feature activation-1.26e+00

Pos logit contributions
rell+0.071
by+0.069
lying+0.068
ench+0.067
 destination+0.065
Neg logit contributions
 we-0.063
 I-0.056
 you-0.052
 who-0.048
 let-0.046
Token her
Token ID607
Feature activation-8.43e+00

Pos logit contributions
rell+0.070
by+0.068
lying+0.067
 destination+0.066
ench+0.065
Neg logit contributions
 we-0.064
 I-0.057
 you-0.051
 who-0.049
 let-0.047
Token from
Token ID422
Feature activation+1.50e+00

Pos logit contributions
rell+0.449
by+0.446
lying+0.433
 destination+0.430
ench+0.424
Neg logit contributions
 we-0.453
 I-0.409
 you-0.371
 who-0.346
 let-0.337
Token.
Token ID13
Feature activation-7.09e-02

Pos logit contributions
 we+0.134
 I+0.119
 you+0.110
 who+0.102
 let+0.099
Neg logit contributions
rell-0.139
by-0.136
lying-0.134
ench-0.132
 destination-0.129
Token "
Token ID366
Feature activation+3.57e+00

Pos logit contributions
rell+0.004
lying+0.004
 Eventually+0.003
ench+0.003
by+0.003
Neg logit contributions
 we-0.004
 I-0.003
 you-0.003
 who-0.003
 let-0.003
TokenThank
Token ID10449
Feature activation+1.10e+01

Pos logit contributions
 we+0.165
 I+0.135
 you+0.122
 wasting+0.115
 not+0.112
Neg logit contributions
lying-0.194
side-0.191
rell-0.183
acked-0.180
by-0.179
Token you
Token ID345
Feature activation+7.47e+00

Pos logit contributions
 we+0.223
 exc+0.190
 let+0.187
 wasting+0.181
 who+0.161
Neg logit contributions
by-0.744
rell-0.731
lying-0.702
day-0.696
ippy-0.689
Token for
Token ID329
Feature activation+8.72e-01

Pos logit contributions
 we+0.412
 I+0.369
 you+0.338
 who+0.321
 let+0.312
Neg logit contributions
rell-0.388
by-0.376
lying-0.370
ench-0.365
 destination-0.361
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.043
 I+0.036
We+0.031
ms+0.029
 We+0.027
Neg logit contributions
rell-0.044
 home-0.043
 ease-0.042
zie-0.040
 doorway-0.040
Token me
Token ID502
Feature activation-7.89e+00

Pos logit contributions
 we+0.882
 I+0.786
 you+0.720
 who+0.680
 let+0.658
Neg logit contributions
rell-0.845
by-0.823
lying-0.810
ench-0.799
 destination-0.783
Token use
Token ID779
Feature activation+8.61e+00

Pos logit contributions
rell+0.446
by+0.438
ench+0.423
lying+0.422
 destination+0.412
Neg logit contributions
 we-0.398
 I-0.352
 you-0.315
 who-0.294
 let-0.283
Token it
Token ID340
Feature activation+6.21e+00

Pos logit contributions
 we+0.293
 I+0.240
 you+0.187
 who+0.182
 let+0.175
Neg logit contributions
rell-0.621
by-0.605
 destination-0.597
lying-0.585
ench-0.573
Token.
Token ID13
Feature activation-7.59e-02

Pos logit contributions
 we+0.265
 I+0.229
 you+0.205
 who+0.187
 let+0.179
Neg logit contributions
rell-0.410
by-0.400
lying-0.397
ench-0.392
 destination-0.385
Token You
Token ID921
Feature activation+9.35e+00

Pos logit contributions
 Eventually+0.004
 Today+0.004
rell+0.004
lying+0.004
 Later+0.004
Neg logit contributions
 we-0.004
 pige-0.003
 wasting-0.003
 mosquitoes-0.003
ts-0.003
Token the
Token ID262
Feature activation+3.64e+00

Pos logit contributions
 we+0.445
 I+0.394
 you+0.340
 let+0.301
 who+0.296
Neg logit contributions
by-0.654
rell-0.646
 destination-0.624
ench-0.620
lying-0.617
Token plastic
Token ID7309
Feature activation-3.01e-01

Pos logit contributions
 we+0.200
 I+0.178
 you+0.161
 who+0.157
 let+0.152
Neg logit contributions
rell-0.187
by-0.186
 destination-0.182
ench-0.179
lying-0.176
Token ball
Token ID2613
Feature activation+2.84e+00

Pos logit contributions
rell+0.018
by+0.017
lying+0.017
ench+0.017
 destination+0.017
Neg logit contributions
 we-0.015
 I-0.013
 you-0.012
 who-0.011
 let-0.011
Token,
Token ID11
Feature activation-4.34e+00

Pos logit contributions
 we+0.133
 I+0.115
 you+0.106
 who+0.102
 that+0.096
Neg logit contributions
rell-0.177
lying-0.175
by-0.173
ench-0.172
 awoke-0.167
Token never
Token ID1239
Feature activation+5.45e+00

Pos logit contributions
rell+0.244
by+0.236
lying+0.225
 ease+0.225
zie+0.224
Neg logit contributions
 we-0.209
 I-0.191
 you-0.173
 yourselves-0.147
 who-0.141
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.276
 I+0.244
 you+0.223
 who+0.207
 let+0.201
Neg logit contributions
rell-0.318
by-0.309
lying-0.307
ench-0.303
 destination-0.296
Token it
Token ID340
Feature activation+6.21e+00

Pos logit contributions
 we+0.747
 I+0.654
 you+0.590
 who+0.546
 let+0.524
Neg logit contributions
rell-0.971
by-0.954
lying-0.938
ench-0.928
 destination-0.912
Token get
Token ID651
Feature activation-6.76e-01

Pos logit contributions
 we+0.323
 I+0.285
 you+0.266
 let+0.242
 who+0.240
Neg logit contributions
rell-0.356
lying-0.353
by-0.352
ench-0.341
 doorway-0.340
Token in
Token ID287
Feature activation+3.32e+00

Pos logit contributions
rell+0.036
by+0.036
lying+0.035
ench+0.035
 destination+0.034
Neg logit contributions
 we-0.037
 I-0.033
 you-0.030
 who-0.028
 let-0.027
Token the
Token ID262
Feature activation+3.64e+00

Pos logit contributions
 we+0.156
 I+0.137
 you+0.122
 who+0.113
 let+0.109
Neg logit contributions
rell-0.202
by-0.199
lying-0.196
 destination-0.194
ench-0.194
Token way
Token ID835
Feature activation-6.86e+00

Pos logit contributions
 we+0.226
 I+0.209
 you+0.195
 who+0.180
 let+0.175
Neg logit contributions
rell-0.173
lying-0.165
by-0.164
ench-0.158
rolled-0.155
Token you
Token ID345
Feature activation+7.49e+00

Pos logit contributions
 we+0.195
 who+0.130
 wasting+0.115
 let+0.114
 exc+0.109
Neg logit contributions
rell-0.839
by-0.832
ippy-0.821
lying-0.821
day-0.801
Token,
Token ID11
Feature activation-4.33e+00

Pos logit contributions
 we+0.412
 I+0.368
 you+0.340
 who+0.315
 let+0.307
Neg logit contributions
rell-0.412
by-0.398
lying-0.396
ench-0.389
 destination-0.378
Token Amy
Token ID14235
Feature activation+1.55e+00

Pos logit contributions
 destination+0.239
rell+0.231
zie+0.228
by+0.227
 awoke+0.225
Neg logit contributions
 we-0.180
ms-0.156
We-0.153
 yourselves-0.144
 you-0.139
Token,
Token ID11
Feature activation-4.33e+00

Pos logit contributions
 we+0.085
 I+0.076
 you+0.070
 who+0.065
 let+0.063
Neg logit contributions
rell-0.082
by-0.081
lying-0.079
ench-0.078
 destination-0.076
Token for
Token ID329
Feature activation+8.65e-01

Pos logit contributions
 destination+0.243
by+0.243
rell+0.243
 awoke+0.240
zie+0.235
Neg logit contributions
 we-0.165
Bad-0.132
 I-0.130
ms-0.129
 yourselves-0.122
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.041
 I+0.034
 who+0.028
 don+0.028
 you+0.027
Neg logit contributions
rell-0.047
 doorway-0.046
 destination-0.045
 home-0.044
by-0.044
Token me
Token ID502
Feature activation-7.87e+00

Pos logit contributions
 we+0.870
 I+0.781
 you+0.739
 who+0.670
 let+0.664
Neg logit contributions
rell-0.866
lying-0.845
by-0.840
ench-0.818
 destination-0.792
Token play
Token ID711
Feature activation-4.03e-01

Pos logit contributions
rell+0.390
by+0.383
ench+0.380
lying+0.366
 destination+0.364
Neg logit contributions
 we-0.433
 I-0.386
 you-0.347
 who-0.343
Bad-0.335
Token with
Token ID351
Feature activation+1.02e+01

Pos logit contributions
ench+0.024
 destination+0.023
rell+0.023
 home+0.022
date+0.022
Neg logit contributions
 we-0.017
 I-0.014
 wasting-0.012
 let-0.012
 exc-0.012
Token your
Token ID534
Feature activation+4.38e-01

Pos logit contributions
 we+0.382
 I+0.303
 let+0.273
 who+0.272
We+0.266
Neg logit contributions
rell-0.653
by-0.640
 destination-0.630
 ease-0.614
zie-0.608
Token toys
Token ID14958
Feature activation+1.34e+00

Pos logit contributions
 we+0.022
 I+0.020
 you+0.017
 who+0.017
 let+0.017
Neg logit contributions
by-0.024
rell-0.024
 destination-0.024
lying-0.022
ench-0.022
Token for
Token ID329
Feature activation+8.51e-01

Pos logit contributions
 we+0.395
 I+0.352
 you+0.321
 who+0.304
 let+0.295
Neg logit contributions
rell-0.397
by-0.395
lying-0.384
ench-0.379
 destination-0.372
Token soaking
Token ID41714
Feature activation+6.06e-01

Pos logit contributions
 we+0.041
 I+0.035
 you+0.028
We+0.027
 don+0.027
Neg logit contributions
rell-0.047
by-0.045
zie-0.044
 doorway-0.044
ench-0.044
Token the
Token ID262
Feature activation+3.66e+00

Pos logit contributions
 we+0.026
 I+0.022
 you+0.019
 who+0.018
 let+0.018
Neg logit contributions
rell-0.039
by-0.038
ench-0.038
lying-0.037
zie-0.037
Token crack
Token ID8469
Feature activation+2.47e+00

Pos logit contributions
 we+0.206
 I+0.186
 you+0.169
 who+0.161
 let+0.157
Neg logit contributions
rell-0.189
by-0.185
ench-0.180
lying-0.178
 destination-0.176
Token and
Token ID290
Feature activation+4.06e+00

Pos logit contributions
 we+0.125
 I+0.111
 you+0.101
 who+0.094
 let+0.091
Neg logit contributions
rell-0.143
by-0.139
lying-0.138
ench-0.136
 destination-0.133
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.184
 I+0.161
 you+0.143
 who+0.143
 let+0.129
Neg logit contributions
rell-0.226
by-0.223
ench-0.217
 awoke-0.216
 destination-0.214
Token me
Token ID502
Feature activation-7.86e+00

Pos logit contributions
 we+0.777
 I+0.683
 you+0.625
 who+0.575
 let+0.558
Neg logit contributions
rell-0.952
by-0.927
lying-0.920
ench-0.907
 destination-0.887
Token out
Token ID503
Feature activation-9.40e+00

Pos logit contributions
rell+0.435
by+0.427
ench+0.424
lying+0.411
 destination+0.403
Neg logit contributions
 we-0.405
 I-0.360
 you-0.324
 who-0.305
 let-0.293
Token.
Token ID13
Feature activation-6.61e-02

Pos logit contributions
rell+0.558
by+0.542
lying+0.532
ench+0.531
 destination+0.517
Neg logit contributions
 we-0.448
 I-0.394
 you-0.356
 who-0.330
 let-0.321
Token You
Token ID921
Feature activation+9.38e+00

Pos logit contributions
rell+0.004
 Eventually+0.004
lying+0.004
 awoke+0.003
 Today+0.003
Neg logit contributions
 we-0.003
 you-0.002
 who-0.002
 wasting-0.002
 let-0.002
Token are
Token ID389
Feature activation-3.17e+00

Pos logit contributions
 we+0.521
 I+0.467
 you+0.421
 who+0.397
Bad+0.371
Neg logit contributions
rell-0.488
ench-0.475
 awoke-0.463
by-0.463
lying-0.449
Token.
Token ID13
Feature activation-8.53e-02

Pos logit contributions
 we+0.020
 I+0.017
 you+0.015
 let+0.014
 who+0.014
Neg logit contributions
rell-0.025
by-0.024
lying-0.024
ench-0.023
 awoke-0.023
Token "
Token ID366
Feature activation+3.57e+00

Pos logit contributions
rell+0.004
lying+0.004
 Eventually+0.004
zie+0.004
 awoke+0.004
Neg logit contributions
 we-0.005
 I-0.004
 you-0.004
 let-0.003
 who-0.003
TokenThank
Token ID10449
Feature activation+1.10e+01

Pos logit contributions
 we+0.169
 I+0.137
 you+0.128
 that+0.116
 not+0.110
Neg logit contributions
lying-0.214
rell-0.210
by-0.209
 awoke-0.206
ench-0.203
Token you
Token ID345
Feature activation+7.49e+00

Pos logit contributions
 we+0.187
 I+0.109
 let+0.107
 who+0.095
 exc+0.084
Neg logit contributions
rell-0.884
by-0.882
lying-0.855
zie-0.842
ench-0.837
Token for
Token ID329
Feature activation+8.54e-01

Pos logit contributions
 we+0.386
 I+0.342
 you+0.315
 who+0.291
 let+0.282
Neg logit contributions
rell-0.434
by-0.421
lying-0.418
ench-0.412
 destination-0.401
Token letting
Token ID9616
Feature activation+1.59e+01

Pos logit contributions
 we+0.042
 I+0.035
We+0.031
ms+0.028
 you+0.027
Neg logit contributions
rell-0.044
zie-0.043
 doorway-0.042
 home-0.042
 ease-0.041
Token me
Token ID502
Feature activation-7.86e+00

Pos logit contributions
 we+0.915
 I+0.818
 you+0.751
 who+0.715
 let+0.691
Neg logit contributions
rell-0.809
by-0.787
lying-0.771
ench-0.765
 destination-0.747
Token touch
Token ID3638
Feature activation+1.45e+00

Pos logit contributions
rell+0.433
ench+0.429
by+0.424
lying+0.406
 destination+0.400
Neg logit contributions
 we-0.397
 I-0.349
 you-0.311
 who-0.298
Bad-0.290
Token your
Token ID534
Feature activation+4.42e-01

Pos logit contributions
 we+0.053
 I+0.042
 let+0.034
 who+0.034
 spies+0.033
Neg logit contributions
rell-0.099
by-0.095
ench-0.092
 destination-0.092
 awoke-0.091
Token feather
Token ID17972
Feature activation-6.07e-01

Pos logit contributions
 we+0.025
 I+0.023
 you+0.021
 who+0.020
 let+0.019
Neg logit contributions
rell-0.023
by-0.022
lying-0.022
ench-0.022
 destination-0.021
Token,"
Token ID553
Feature activation+2.20e+00

Pos logit contributions
rell+0.041
by+0.040
lying+0.040
ench+0.039
 destination+0.038
Neg logit contributions
 we-0.025
 I-0.022
 you-0.019
 who-0.017
 let-0.017
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
rell+0.592
by+0.591
ench+0.585
lying+0.582
 awoke+0.564
Neg logit contributions
 we-0.416
 I-0.353
 you-0.319
 who-0.315
 let-0.305
Token They
Token ID1119
Feature activation-7.36e-01

Pos logit contributions
rell+0.056
by+0.054
lying+0.054
ench+0.053
 destination+0.052
Neg logit contributions
 we-0.043
 I-0.037
 you-0.034
 who-0.031
 let-0.030
Token take
Token ID1011
Feature activation+4.21e+00

Pos logit contributions
rell+0.040
by+0.039
zie+0.038
lying+0.038
 destination+0.037
Neg logit contributions
 we-0.037
 I-0.033
 you-0.030
 who-0.028
Bad-0.027
Token their
Token ID511
Feature activation-7.30e+00

Pos logit contributions
 we+0.196
 I+0.167
 you+0.158
 who+0.144
 let+0.142
Neg logit contributions
lying-0.262
rell-0.259
ench-0.255
by-0.254
 destination-0.251
Token umb
Token ID20810
Feature activation-7.38e+00

Pos logit contributions
ench+0.380
lying+0.380
rell+0.368
 awoke+0.358
by+0.358
Neg logit contributions
 we-0.435
 I-0.370
 you-0.360
 who-0.329
 let-0.328
Tokenrell
Token ID11252
Feature activation-1.98e+01

Pos logit contributions
by+0.123
 awoke+0.109
ench+0.091
lying+0.087
 doorway+0.081
Neg logit contributions
 we-0.676
 I-0.623
 you-0.612
 that-0.588
 let-0.588
Tokenas
Token ID292
Feature activation+2.47e+00

Pos logit contributions
rell+1.196
by+1.172
ench+1.142
lying+1.141
 destination+1.098
Neg logit contributions
 we-0.960
 I-0.854
 you-0.769
 who-0.708
 let-0.689
Token.
Token ID13
Feature activation-9.67e-01

Pos logit contributions
 we+0.139
 I+0.120
 you+0.115
 who+0.109
 let+0.104
Neg logit contributions
rell-0.132
lying-0.130
by-0.129
ench-0.128
 awoke-0.124
Token
Token ID198
Feature activation-8.94e-01

Pos logit contributions
rell+0.053
lying+0.052
by+0.052
ench+0.051
 destination+0.049
Neg logit contributions
 we-0.051
 I-0.044
 you-0.042
 who-0.039
 let-0.037
Token
Token ID198
Feature activation-9.01e-01

Pos logit contributions
rell+0.048
by+0.047
lying+0.047
ench+0.046
 destination+0.045
Neg logit contributions
 we-0.048
 I-0.043
 you-0.039
 who-0.037
 let-0.035
Token"
Token ID1
Feature activation+4.55e-01

Pos logit contributions
lying+0.055
ench+0.055
by+0.054
rell+0.054
side+0.053
Neg logit contributions
 we-0.038
 I-0.033
 you-0.029
 who-0.027
 don-0.025
Token too
Token ID1165
Feature activation-3.67e+00

Pos logit contributions
rell+0.215
ench+0.205
by+0.199
aced+0.199
 destination+0.195
Neg logit contributions
 we-0.202
 I-0.176
 you-0.163
We-0.136
You-0.136
Token slow
Token ID3105
Feature activation+2.84e+00

Pos logit contributions
by+0.208
ench+0.200
rell+0.200
lying+0.187
aced+0.187
Neg logit contributions
 we-0.176
 I-0.161
 you-0.144
 who-0.130
 let-0.125
Token.
Token ID13
Feature activation-9.22e-01

Pos logit contributions
 we+0.132
 I+0.113
 you+0.106
 who+0.099
 let+0.092
Neg logit contributions
rell-0.180
ench-0.177
lying-0.175
 awoke-0.172
by-0.172
Token The
Token ID383
Feature activation+6.30e-01

Pos logit contributions
lying+0.038
 Eventually+0.038
rell+0.037
ench+0.037
 Today+0.035
Neg logit contributions
 we-0.055
 you-0.049
 I-0.049
 who-0.045
 let-0.044
Token umb
Token ID20810
Feature activation-7.37e+00

Pos logit contributions
 we+0.036
 I+0.034
 you+0.031
 who+0.030
 let+0.028
Neg logit contributions
rell-0.032
 destination-0.029
ench-0.029
by-0.028
 doorway-0.027
Tokenrell
Token ID11252
Feature activation-1.98e+01

Pos logit contributions
by+0.062
lying+0.042
ench+0.039
rell+0.035
 destination+0.030
Neg logit contributions
 we-0.743
 I-0.698
 you-0.675
 who-0.641
 let-0.640
Tokenas
Token ID292
Feature activation+2.49e+00

Pos logit contributions
rell+1.025
by+1.000
ench+0.987
rol+0.962
lying+0.904
Neg logit contributions
 we-1.090
 I-1.016
 you-0.913
 let-0.859
 who-0.857
Token go
Token ID467
Feature activation-6.52e+00

Pos logit contributions
 we+0.127
 I+0.114
 you+0.102
 who+0.097
 let+0.091
Neg logit contributions
rell-0.139
by-0.134
lying-0.131
ench-0.131
 destination-0.130
Token far
Token ID1290
Feature activation+9.99e-01

Pos logit contributions
rell+0.352
by+0.343
lying+0.337
ench+0.332
 destination+0.325
Neg logit contributions
 we-0.357
 I-0.320
 you-0.294
 who-0.276
 let-0.267
Token away
Token ID1497
Feature activation-1.44e+01

Pos logit contributions
 we+0.071
 I+0.065
 you+0.061
 who+0.058
 let+0.057
Neg logit contributions
rell-0.038
by-0.037
lying-0.036
ench-0.035
 destination-0.034
Token.
Token ID13
Feature activation-9.98e-01

Pos logit contributions
by+0.641
 destination+0.610
ench+0.600
rell+0.589
lying+0.581
Neg logit contributions
 we-0.848
 I-0.790
 you-0.726
 who-0.662
 let-0.658
Token try
Token ID1949
Feature activation+7.50e-04

Pos logit contributions
 we+0.143
 I+0.126
 you+0.117
 who+0.106
 let+0.099
Neg logit contributions
rell-0.141
by-0.140
ench-0.137
lying-0.136
 destination-0.135
Token to
Token ID284
Feature activation-4.32e+00

Pos logit contributions
 we+0.000
 I+0.000
 you+0.000
 who+0.000
 let+0.000
Neg logit contributions
rell-0.000
lying-0.000
by-0.000
ench-0.000
 destination-0.000
Token catch
Token ID4929
Feature activation+6.65e+00

Pos logit contributions
rell+0.211
ench+0.206
 destination+0.205
rolled+0.199
 ease+0.198
Neg logit contributions
 we-0.233
 I-0.220
 you-0.192
 who-0.174
You-0.171
Token their
Token ID511
Feature activation-7.20e+00

Pos logit contributions
 we+0.239
 I+0.203
 you+0.157
 who+0.156
 let+0.140
Neg logit contributions
by-0.465
rell-0.464
 destination-0.449
ench-0.448
lying-0.442
Token umb
Token ID20810
Feature activation-7.28e+00

Pos logit contributions
rell+0.273
by+0.262
lying+0.258
ench+0.253
 destination+0.244
Neg logit contributions
 we-0.511
 I-0.469
 you-0.442
 who-0.420
 let-0.411
Tokenrell
Token ID11252
Feature activation-1.97e+01

Pos logit contributions
by+0.100
 awoke+0.079
lying+0.062
 doorway+0.056
ench+0.056
Neg logit contributions
 we-0.705
 I-0.659
 you-0.645
 that-0.610
 let-0.609
Tokenas
Token ID292
Feature activation+2.58e+00

Pos logit contributions
rell+1.157
by+1.128
ench+1.101
lying+1.080
rol+1.050
Neg logit contributions
 we-0.987
 I-0.889
 you-0.795
 who-0.743
 let-0.726
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
 we+0.136
 I+0.120
 you+0.111
 who+0.103
 let+0.100
Neg logit contributions
rell-0.146
by-0.143
lying-0.141
ench-0.139
 destination-0.136
Token But
Token ID887
Feature activation+3.90e+00

Pos logit contributions
lying+0.035
 Eventually+0.035
ench+0.034
rell+0.033
 destination+0.033
Neg logit contributions
 we-0.053
 you-0.047
 I-0.047
 who-0.044
 wasting-0.043
Token they
Token ID484
Feature activation+5.99e+00

Pos logit contributions
 we+0.107
ms+0.097
 I+0.095
 pige+0.094
 who+0.091
Neg logit contributions
 destination-0.248
zie-0.237
by-0.237
rell-0.235
 home-0.234
Token are
Token ID389
Feature activation-4.00e+00

Pos logit contributions
 we+0.298
 I+0.265
 you+0.241
 oh+0.218
 who+0.216
Neg logit contributions
by-0.317
ench-0.313
 destination-0.312
rell-0.311
lying-0.296
TokenExc
Token ID40127
Feature activation-1.80e+00

Pos logit contributions
 we+0.149
 I+0.128
 you+0.114
 who+0.109
 not+0.109
Neg logit contributions
lying-0.155
by-0.153
rell-0.152
 awoke-0.150
ench-0.142
Tokenuse
Token ID1904
Feature activation-3.57e+00

Pos logit contributions
rell+0.062
by+0.058
ench+0.057
lying+0.057
 destination+0.054
Neg logit contributions
 we-0.135
 I-0.124
 you-0.118
 who-0.112
 let-0.110
Token me
Token ID502
Feature activation-8.65e+00

Pos logit contributions
rell+0.202
lying+0.192
ench+0.188
by+0.186
 doorway+0.181
Neg logit contributions
 we-0.193
 I-0.172
 you-0.166
 who-0.149
 let-0.147
Token squ
Token ID2809
Feature activation+1.60e-01

Pos logit contributions
by+0.372
rell+0.367
ench+0.357
lying+0.355
 destination+0.353
Neg logit contributions
 we-0.543
 I-0.484
 you-0.455
 who-0.426
 let-0.416
Tokenir
Token ID343
Feature activation-3.17e+00

Pos logit contributions
 we+0.011
 I+0.010
 you+0.010
 who+0.009
 let+0.009
Neg logit contributions
rell-0.006
 awoke-0.006
 doorway-0.006
side-0.006
 destination-0.006
Tokenrell
Token ID11252
Feature activation-1.97e+01

Pos logit contributions
rell+0.121
by+0.120
lying+0.119
ench+0.116
 destination+0.114
Neg logit contributions
 we-0.219
 I-0.204
 you-0.191
 who-0.181
 let-0.178
Tokeny
Token ID88
Feature activation-9.20e+00

Pos logit contributions
rell+1.139
by+1.128
ench+1.087
lying+1.085
rol+1.047
Neg logit contributions
 we-0.994
 I-0.874
 you-0.799
 who-0.737
 let-0.735
Token,
Token ID11
Feature activation-5.14e+00

Pos logit contributions
rell+0.508
by+0.500
lying+0.488
ench+0.483
 destination+0.472
Neg logit contributions
 we-0.486
 I-0.424
 you-0.395
 who-0.362
 let-0.353
Token I
Token ID314
Feature activation+1.21e+01

Pos logit contributions
rell+0.290
 awoke+0.284
 destination+0.274
zie+0.272
 doorway+0.260
Neg logit contributions
 we-0.177
ms-0.164
 yourselves-0.144
We-0.143
 spies-0.139
Token was
Token ID373
Feature activation-2.71e+00

Pos logit contributions
 we+0.661
 I+0.596
 you+0.549
 who+0.509
 yourselves+0.486
Neg logit contributions
by-0.628
lying-0.608
ench-0.605
rell-0.596
 awoke-0.586
Token wondering
Token ID11263
Feature activation-2.89e-01

Pos logit contributions
rell+0.138
ench+0.135
by+0.130
 destination+0.129
lying+0.127
Neg logit contributions
 we-0.141
 I-0.119
 you-0.106
 let-0.102
ms-0.101
Token them
Token ID606
Feature activation-5.10e+00

Pos logit contributions
 we+0.015
 I+0.014
 you+0.012
 who+0.012
 let+0.012
Neg logit contributions
by-0.012
rell-0.012
 destination-0.011
lying-0.011
ench-0.011
Token with
Token ID351
Feature activation+9.60e+00

Pos logit contributions
rell+0.295
by+0.287
lying+0.285
ench+0.281
 destination+0.273
Neg logit contributions
 we-0.263
 I-0.231
 you-0.214
 who-0.201
 let-0.194
Token towels
Token ID38494
Feature activation-5.70e+00

Pos logit contributions
 we+0.424
 I+0.369
 you+0.326
 who+0.298
 let+0.289
Neg logit contributions
rell-0.619
by-0.604
lying-0.593
ench-0.587
 destination-0.585
Token and
Token ID290
Feature activation+3.43e+00

Pos logit contributions
rell+0.340
by+0.332
ench+0.329
lying+0.328
 destination+0.316
Neg logit contributions
 we-0.283
 I-0.246
 you-0.227
 who-0.214
 let-0.205
Token umb
Token ID20810
Feature activation-7.13e+00

Pos logit contributions
 we+0.196
 I+0.179
 you+0.167
 who+0.154
 let+0.147
Neg logit contributions
rell-0.162
by-0.159
lying-0.150
 awoke-0.149
 doorway-0.149
Tokenrell
Token ID11252
Feature activation-1.96e+01

Pos logit contributions
by+0.115
 awoke+0.098
ench+0.098
 destination+0.093
 doorway+0.088
Neg logit contributions
 we-0.636
 you-0.587
 I-0.587
 that-0.573
 let-0.562
Tokenas
Token ID292
Feature activation+2.71e+00

Pos logit contributions
rell+1.240
lying+1.204
by+1.204
ench+1.184
 destination+1.171
Neg logit contributions
 we-0.892
 I-0.771
 you-0.704
 who-0.648
 let-0.620
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
 we+0.137
 I+0.120
 you+0.111
 who+0.105
 let+0.100
Neg logit contributions
rell-0.159
by-0.155
lying-0.154
ench-0.153
 destination-0.149
Token They
Token ID1119
Feature activation-5.57e-01

Pos logit contributions
rell+0.035
by+0.035
lying+0.034
ench+0.034
 destination+0.032
Neg logit contributions
 we-0.043
 I-0.038
 you-0.036
 who-0.033
 let-0.033
Token hug
Token ID16225
Feature activation-2.42e+00

Pos logit contributions
rell+0.028
 destination+0.028
by+0.027
lying+0.027
ench+0.027
Neg logit contributions
 we-0.030
 I-0.027
 you-0.025
 oh-0.022
Bad-0.022
Token them
Token ID606
Feature activation-5.17e+00

Pos logit contributions
by+0.121
rell+0.113
zie+0.111
lying+0.110
 awoke+0.107
Neg logit contributions
 we-0.135
 I-0.123
 you-0.109
 let-0.098
 who-0.097
Token the
Token ID262
Feature activation+3.12e+00

Pos logit contributions
by+0.287
 destination+0.279
rell+0.278
lying+0.276
ench+0.270
Neg logit contributions
 we-0.232
 I-0.211
 you-0.188
 who-0.177
 let-0.165
Token kids
Token ID3988
Feature activation+4.88e+00

Pos logit contributions
 we+0.130
 I+0.112
 you+0.099
 who+0.092
 let+0.087
Neg logit contributions
rell-0.209
by-0.202
lying-0.198
ench-0.197
 destination-0.197
Token collected
Token ID7723
Feature activation-3.89e+00

Pos logit contributions
 we+0.261
 I+0.225
 who+0.216
 let+0.212
 you+0.209
Neg logit contributions
rell-0.280
lying-0.269
ench-0.265
by-0.261
 destination-0.255
Token their
Token ID511
Feature activation-6.98e+00

Pos logit contributions
rell+0.258
lying+0.254
by+0.253
ench+0.247
 destination+0.242
Neg logit contributions
 we-0.166
 I-0.141
 you-0.128
 who-0.118
 let-0.113
Token umb
Token ID20810
Feature activation-7.07e+00

Pos logit contributions
rell+0.423
lying+0.413
rol+0.402
jo+0.397
zie+0.397
Neg logit contributions
 we-0.354
 I-0.309
 you-0.293
 who-0.266
 let-0.256
Tokenrell
Token ID11252
Feature activation-1.95e+01

Pos logit contributions
by+0.143
 awoke+0.120
lying+0.101
ench+0.101
 destination+0.094
Neg logit contributions
 we-0.617
 I-0.576
 you-0.559
 that-0.543
 let-0.517
Tokenas
Token ID292
Feature activation+2.81e+00

Pos logit contributions
rell+1.225
lying+1.203
by+1.192
 destination+1.173
ench+1.173
Neg logit contributions
 we-0.891
 I-0.768
 you-0.705
 who-0.647
 let-0.615
Token and
Token ID290
Feature activation+3.50e+00

Pos logit contributions
 we+0.119
 I+0.098
 you+0.090
 who+0.087
 let+0.079
Neg logit contributions
rell-0.191
lying-0.186
by-0.186
ench-0.185
 awoke-0.178
Token went
Token ID1816
Feature activation-6.14e+00

Pos logit contributions
 we+0.196
 I+0.180
 you+0.166
 who+0.153
 let+0.144
Neg logit contributions
rell-0.178
by-0.177
lying-0.167
ench-0.166
 destination-0.165
Token out
Token ID503
Feature activation-9.99e+00

Pos logit contributions
lying+0.359
rell+0.358
ench+0.343
by+0.340
 awoke+0.332
Neg logit contributions
 we-0.326
 I-0.281
 you-0.263
 that-0.255
 who-0.252
Token to
Token ID284
Feature activation-4.16e+00

Pos logit contributions
rell+0.638
lying+0.622
ench+0.607
by+0.605
zie+0.578
Neg logit contributions
 we-0.487
 I-0.410
 you-0.382
 who-0.378
 that-0.354
Token.
Token ID13
Feature activation-5.62e-01

Pos logit contributions
by+0.029
rell+0.028
 destination+0.026
ench+0.026
lying+0.025
Neg logit contributions
 we-0.039
 I-0.036
 you-0.033
Bad-0.031
 let-0.031
Token They
Token ID1119
Feature activation-3.74e-01

Pos logit contributions
lying+0.025
 Eventually+0.024
ench+0.024
rell+0.024
by+0.023
Neg logit contributions
 we-0.033
 I-0.029
 you-0.029
 let-0.026
 who-0.026
Token have
Token ID423
Feature activation+6.82e+00

Pos logit contributions
rell+0.022
by+0.022
lying+0.021
ench+0.021
 destination+0.021
Neg logit contributions
 we-0.018
 I-0.015
 you-0.014
 who-0.013
 let-0.012
Token no
Token ID645
Feature activation+5.55e+00

Pos logit contributions
 we+0.272
 I+0.229
 you+0.204
 who+0.182
 let+0.179
Neg logit contributions
rell-0.481
lying-0.472
by-0.465
ench-0.461
zie-0.450
Token umb
Token ID20810
Feature activation-7.01e+00

Pos logit contributions
 we+0.344
 I+0.309
 you+0.290
 who+0.273
 let+0.268
Neg logit contributions
rell-0.260
by-0.253
lying-0.253
ench-0.246
 awoke-0.237
Tokenrell
Token ID11252
Feature activation-1.95e+01

Pos logit contributions
by+0.076
 awoke+0.050
ench+0.045
 destination+0.041
 doorway+0.041
Neg logit contributions
 we-0.690
 I-0.648
 you-0.631
 let-0.602
 who-0.591
Tokenas
Token ID292
Feature activation+2.84e+00

Pos logit contributions
rell+1.074
by+1.057
ench+1.013
rol+0.991
lying+0.988
Neg logit contributions
 we-1.046
 I-0.955
 you-0.855
 who-0.792
 let-0.790
Token.
Token ID13
Feature activation-6.12e-01

Pos logit contributions
 we+0.150
 I+0.134
 you+0.122
 who+0.114
 let+0.111
Neg logit contributions
rell-0.155
by-0.150
lying-0.149
ench-0.146
 destination-0.144
Token
Token ID198
Feature activation-5.21e-01

Pos logit contributions
lying+0.025
 Eventually+0.025
ench+0.024
rell+0.024
by+0.023
Neg logit contributions
 we-0.037
 I-0.032
 you-0.032
 who-0.030
 let-0.030
Token
Token ID198
Feature activation-5.34e-01

Pos logit contributions
by+0.023
rell+0.022
ench+0.022
rolled+0.021
lying+0.021
Neg logit contributions
 we-0.031
 I-0.029
 you-0.026
 let-0.024
 who-0.024
TokenThey
Token ID2990
Feature activation+2.84e+00

Pos logit contributions
by+0.026
side+0.024
ench+0.024
 Eventually+0.024
 Later+0.024
Neg logit contributions
 we-0.025
 I-0.023
 pige-0.020
 letting-0.020
 you-0.020
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
 we+0.451
 I+0.408
 you+0.377
 who+0.346
 let+0.337
Neg logit contributions
by-0.379
rell-0.374
lying-0.362
ench-0.351
zie-0.342
Token We
Token ID775
Feature activation+9.96e+00

Pos logit contributions
 Eventually+0.030
rell+0.028
zie+0.028
 Later+0.027
 Today+0.027
Neg logit contributions
 we-0.020
 pige-0.018
 who-0.017
 spies-0.017
ms-0.017
Token have
Token ID423
Feature activation+6.87e+00

Pos logit contributions
 we+0.468
 I+0.412
 you+0.372
 who+0.343
 let+0.327
Neg logit contributions
rell-0.613
by-0.600
lying-0.589
ench-0.587
 destination-0.573
Token our
Token ID674
Feature activation-2.81e-01

Pos logit contributions
 we+0.399
 I+0.363
 you+0.333
 who+0.318
 let+0.309
Neg logit contributions
rell-0.335
by-0.331
ench-0.319
lying-0.314
 destination-0.313
Token umb
Token ID20810
Feature activation-6.97e+00

Pos logit contributions
by+0.013
rell+0.013
 destination+0.013
ench+0.013
lying+0.013
Neg logit contributions
 we-0.016
 I-0.015
 you-0.014
 who-0.013
 let-0.013
Tokenrell
Token ID11252
Feature activation-1.94e+01

Pos logit contributions
by+0.090
 awoke+0.078
lying+0.062
 doorway+0.057
ench+0.055
Neg logit contributions
 we-0.668
 I-0.620
 you-0.609
 let-0.574
 that-0.571
Tokenas
Token ID292
Feature activation+2.91e+00

Pos logit contributions
by+1.031
rell+1.024
ench+0.994
rol+0.974
lying+0.926
Neg logit contributions
 we-1.058
 I-0.980
 you-0.869
 let-0.824
 who-0.811
Token to
Token ID284
Feature activation-4.03e+00

Pos logit contributions
 we+0.140
 I+0.126
 you+0.112
 let+0.101
 who+0.100
Neg logit contributions
rell-0.170
by-0.164
lying-0.159
ench-0.158
 destination-0.157
Token keep
Token ID1394
Feature activation+2.26e+00

Pos logit contributions
rell+0.206
ench+0.196
 destination+0.194
by+0.190
lying+0.184
Neg logit contributions
 we-0.225
 I-0.202
 you-0.179
 who-0.171
 spies-0.164
Token us
Token ID514
Feature activation-6.96e+00

Pos logit contributions
 we+0.108
 I+0.102
 who+0.083
 let+0.082
 you+0.081
Neg logit contributions
rell-0.130
by-0.117
 destination-0.115
ench-0.114
zie-0.113
Token dry
Token ID5894
Feature activation+1.34e+00

Pos logit contributions
rell+0.363
ench+0.348
by+0.346
 destination+0.336
lying+0.332
Neg logit contributions
 we-0.379
 I-0.351
 you-0.312
 who-0.294
 let-0.284
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 we+0.345
 I+0.312
 you+0.287
 who+0.258
 let+0.258
Neg logit contributions
rell-0.291
by-0.277
lying-0.273
ench-0.263
 destination-0.262
Token It
Token ID632
Feature activation-3.92e-01

Pos logit contributions
 Eventually+0.017
ench+0.017
rell+0.017
 awoke+0.016
 Today+0.016
Neg logit contributions
 we-0.030
 you-0.026
 I-0.026
 who-0.025
 let-0.025
Token takes
Token ID2753
Feature activation+3.55e+00

Pos logit contributions
rell+0.022
by+0.021
 awoke+0.020
 destination+0.020
ench+0.020
Neg logit contributions
 we-0.019
 I-0.017
 you-0.016
 who-0.015
 oh-0.014
Token their
Token ID511
Feature activation-6.85e+00

Pos logit contributions
 we+0.133
 I+0.112
 you+0.101
 who+0.087
 let+0.086
Neg logit contributions
rell-0.255
lying-0.249
by-0.248
ench-0.246
 destination-0.240
Token umb
Token ID20810
Feature activation-6.94e+00

Pos logit contributions
by+0.459
rell+0.454
ench+0.440
lying+0.437
 destination+0.436
Neg logit contributions
 we-0.275
 I-0.246
 you-0.210
 who-0.199
 let-0.183
Tokenrell
Token ID11252
Feature activation-1.94e+01

Pos logit contributions
by+0.113
 awoke+0.095
 doorway+0.080
ench+0.075
 destination+0.073
Neg logit contributions
 we-0.636
 I-0.594
 you-0.586
 that-0.554
 let-0.553
Tokenas
Token ID292
Feature activation+2.93e+00

Pos logit contributions
rell+1.130
by+1.124
ench+1.088
lying+1.072
rol+1.025
Neg logit contributions
 we-0.968
 I-0.869
 you-0.775
 who-0.724
 let-0.705
Token.
Token ID13
Feature activation-5.32e-01

Pos logit contributions
 we+0.164
 I+0.146
 you+0.137
 who+0.127
 let+0.124
Neg logit contributions
rell-0.156
by-0.151
lying-0.150
ench-0.148
 destination-0.144
Token
Token ID198
Feature activation-4.42e-01

Pos logit contributions
 Eventually+0.020
rell+0.018
ench+0.018
 Today+0.018
lying+0.017
Neg logit contributions
 we-0.033
 you-0.029
 who-0.028
 I-0.028
 oh-0.028
Token
Token ID198
Feature activation-4.46e-01

Pos logit contributions
by+0.018
rell+0.018
ench+0.018
 Eventually+0.017
lying+0.017
Neg logit contributions
 we-0.027
 I-0.024
 you-0.023
 who-0.022
 let-0.022
Token"
Token ID1
Feature activation+9.22e-01

Pos logit contributions
ench+0.020
by+0.020
side+0.019
lying+0.019
 awoke+0.019
Neg logit contributions
 we-0.021
 I-0.019
 oh-0.018
 letting-0.017
 who-0.017
Token some
Token ID617
Feature activation+1.16e+00

Pos logit contributions
 we+0.024
 I+0.022
 you+0.019
 let+0.017
 who+0.016
Neg logit contributions
rell-0.032
by-0.031
 doorway-0.030
 destination-0.030
 awoke-0.030
Token people
Token ID661
Feature activation+4.68e+00

Pos logit contributions
 we+0.058
 I+0.051
 you+0.046
 who+0.042
 let+0.041
Neg logit contributions
rell-0.069
by-0.066
ench-0.065
lying-0.064
 destination-0.064
Token running
Token ID2491
Feature activation+9.00e-01

Pos logit contributions
 we+0.228
 I+0.205
 you+0.186
 who+0.163
 let+0.163
Neg logit contributions
rell-0.276
by-0.267
lying-0.263
ench-0.260
 destination-0.256
Token with
Token ID351
Feature activation+9.81e+00

Pos logit contributions
 we+0.043
 I+0.036
 who+0.034
 you+0.034
 that+0.032
Neg logit contributions
lying-0.055
ench-0.055
rell-0.055
by-0.054
zie-0.052
Token umb
Token ID20810
Feature activation-6.94e+00

Pos logit contributions
 we+0.425
 I+0.365
 you+0.331
 who+0.304
 let+0.291
Neg logit contributions
rell-0.643
by-0.631
lying-0.626
ench-0.618
 destination-0.602
Tokenrell
Token ID11252
Feature activation-1.94e+01

Pos logit contributions
by+0.141
ench+0.123
 awoke+0.114
 destination+0.111
lying+0.093
Neg logit contributions
 we-0.592
 I-0.547
 you-0.536
 that-0.527
 let-0.513
Tokenas
Token ID292
Feature activation+2.93e+00

Pos logit contributions
rell+1.251
lying+1.229
by+1.216
ench+1.200
 destination+1.200
Neg logit contributions
 we-0.850
 I-0.723
 you-0.662
 who-0.614
 let-0.576
Token and
Token ID290
Feature activation+3.61e+00

Pos logit contributions
 we+0.134
 I+0.116
 you+0.105
 who+0.100
 let+0.094
Neg logit contributions
rell-0.185
by-0.182
lying-0.180
ench-0.179
 destination-0.173
Token some
Token ID617
Feature activation+1.10e+00

Pos logit contributions
 we+0.185
 I+0.169
 you+0.157
 who+0.137
 let+0.131
Neg logit contributions
rell-0.196
by-0.185
 awoke-0.181
 doorway-0.178
lying-0.175
Token cars
Token ID5006
Feature activation+1.38e+00

Pos logit contributions
 we+0.056
 I+0.050
 you+0.046
 who+0.042
 let+0.041
Neg logit contributions
rell-0.063
by-0.061
lying-0.061
ench-0.060
 destination-0.059
Token spl
Token ID4328
Feature activation-9.19e-01

Pos logit contributions
 we+0.066
 I+0.057
 you+0.052
 who+0.050
 let+0.047
Neg logit contributions
rell-0.084
by-0.084
lying-0.082
ench-0.082
 destination-0.080
Token became
Token ID2627
Feature activation-1.29e+01

Pos logit contributions
rell+0.283
by+0.280
lying+0.276
ench+0.274
 destination+0.262
Neg logit contributions
 we-0.321
 I-0.285
 you-0.264
 who-0.249
 let-0.245
Token heavy
Token ID4334
Feature activation+1.87e+00

Pos logit contributions
rell+0.785
lying+0.767
by+0.764
ench+0.742
 awoke+0.725
Neg logit contributions
 we-0.624
 I-0.544
 you-0.500
 let-0.467
 who-0.466
Token and
Token ID290
Feature activation+3.66e+00

Pos logit contributions
 we+0.091
 I+0.080
 you+0.073
 who+0.068
 let+0.065
Neg logit contributions
rell-0.113
by-0.110
lying-0.110
ench-0.108
 destination-0.105
Token their
Token ID511
Feature activation-6.85e+00

Pos logit contributions
 we+0.144
 I+0.126
 you+0.113
 who+0.108
 let+0.095
Neg logit contributions
rell-0.234
by-0.231
 destination-0.225
zie-0.223
lying-0.221
Token umb
Token ID20810
Feature activation-6.94e+00

Pos logit contributions
rell+0.373
lying+0.361
by+0.360
ench+0.354
 destination+0.342
Neg logit contributions
 we-0.375
 I-0.332
 you-0.309
 who-0.287
 let-0.279
Tokenrell
Token ID11252
Feature activation-1.94e+01

Pos logit contributions
by+0.120
 awoke+0.096
lying+0.087
 destination+0.085
ench+0.084
Neg logit contributions
 we-0.628
 I-0.588
 you-0.571
 that-0.547
 let-0.546
Tokenas
Token ID292
Feature activation+2.93e+00

Pos logit contributions
rell+1.226
lying+1.216
 awoke+1.190
by+1.190
 destination+1.185
Neg logit contributions
 we-0.870
 I-0.742
 you-0.680
 who-0.628
 let-0.597
Token began
Token ID2540
Feature activation-1.19e+01

Pos logit contributions
 we+0.166
 I+0.147
 you+0.137
 who+0.131
 let+0.128
Neg logit contributions
rell-0.153
lying-0.150
by-0.149
ench-0.147
 destination-0.143
Token to
Token ID284
Feature activation-4.03e+00

Pos logit contributions
rell+0.709
lying+0.688
by+0.681
ench+0.680
zie+0.664
Neg logit contributions
 we-0.611
 I-0.521
 you-0.486
 who-0.461
 let-0.454
Token dro
Token ID3102
Feature activation-6.34e+00

Pos logit contributions
rell+0.228
by+0.220
lying+0.218
ench+0.216
 destination+0.211
Neg logit contributions
 we-0.210
 I-0.187
 you-0.172
 who-0.160
 let-0.153
Tokenop
Token ID404
Feature activation+2.24e-01

Pos logit contributions
by+0.312
rell+0.309
lying+0.301
 destination+0.296
zie+0.293
Neg logit contributions
 we-0.371
 I-0.333
 you-0.307
 who-0.289
 let-0.282
Token too
Token ID1165
Feature activation-3.19e+00

Pos logit contributions
rell+0.189
ench+0.180
by+0.175
aced+0.175
 destination+0.172
Neg logit contributions
 we-0.179
 I-0.156
 you-0.145
We-0.120
You-0.120
Token slow
Token ID3105
Feature activation+3.32e+00

Pos logit contributions
by+0.179
ench+0.173
rell+0.173
lying+0.162
aced+0.161
Neg logit contributions
 we-0.156
 I-0.142
 you-0.128
 who-0.115
 let-0.110
Token.
Token ID13
Feature activation-4.38e-01

Pos logit contributions
 we+0.156
 I+0.134
 you+0.125
 who+0.117
 let+0.110
Neg logit contributions
rell-0.209
ench-0.205
lying-0.202
by-0.200
 awoke-0.199
Token The
Token ID383
Feature activation+1.14e+00

Pos logit contributions
lying+0.018
 Eventually+0.018
ench+0.018
rell+0.018
 destination+0.017
Neg logit contributions
 we-0.026
 you-0.023
 I-0.023
 who-0.021
 let-0.021
Token umb
Token ID20810
Feature activation-6.88e+00

Pos logit contributions
 we+0.067
 I+0.061
 you+0.056
 who+0.055
 let+0.051
Neg logit contributions
rell-0.057
 destination-0.052
ench-0.052
by-0.051
lying-0.049
Tokenrell
Token ID11252
Feature activation-1.93e+01

Pos logit contributions
by+0.055
lying+0.034
ench+0.032
rell+0.026
 awoke+0.024
Neg logit contributions
 we-0.697
 I-0.656
 you-0.634
 who-0.602
 let-0.602
Tokenas
Token ID292
Feature activation+2.99e+00

Pos logit contributions
rell+1.022
by+0.992
ench+0.975
rol+0.951
lying+0.908
Neg logit contributions
 we-1.054
 I-0.978
 you-0.881
 let-0.824
 who-0.823
Token go
Token ID467
Feature activation-6.01e+00

Pos logit contributions
 we+0.153
 I+0.138
 you+0.123
 who+0.117
 let+0.109
Neg logit contributions
rell-0.167
by-0.161
lying-0.158
ench-0.157
 destination-0.156
Token far
Token ID1290
Feature activation+1.49e+00

Pos logit contributions
rell+0.318
by+0.310
lying+0.305
ench+0.300
 destination+0.294
Neg logit contributions
 we-0.336
 I-0.301
 you-0.278
 who-0.260
 let-0.252
Token away
Token ID1497
Feature activation-1.39e+01

Pos logit contributions
 we+0.106
 I+0.097
 you+0.091
 who+0.086
 let+0.085
Neg logit contributions
rell-0.057
by-0.055
lying-0.054
ench-0.052
 destination-0.051
Token.
Token ID13
Feature activation-4.85e-01

Pos logit contributions
by+0.611
 destination+0.588
ench+0.574
rell+0.565
lying+0.556
Neg logit contributions
 we-0.820
 I-0.765
 you-0.703
 who-0.641
 let-0.638
Token try
Token ID1949
Feature activation+4.64e-01

Pos logit contributions
 we+0.168
 I+0.149
 you+0.138
 who+0.125
 let+0.117
Neg logit contributions
rell-0.165
by-0.164
ench-0.159
lying-0.159
 destination-0.157
Token to
Token ID284
Feature activation-3.84e+00

Pos logit contributions
 we+0.025
 I+0.022
 you+0.021
 who+0.020
 let+0.019
Neg logit contributions
rell-0.026
lying-0.025
by-0.025
ench-0.024
 destination-0.024
Token catch
Token ID4929
Feature activation+7.13e+00

Pos logit contributions
rell+0.188
ench+0.183
 destination+0.181
 ease+0.178
rolled+0.178
Neg logit contributions
 we-0.208
 I-0.196
 you-0.171
 greedy-0.154
 who-0.154
Token their
Token ID511
Feature activation-6.74e+00

Pos logit contributions
 we+0.256
 I+0.218
 you+0.168
 who+0.167
 let+0.150
Neg logit contributions
by-0.501
rell-0.498
 destination-0.482
ench-0.481
lying-0.476
Token umb
Token ID20810
Feature activation-6.81e+00

Pos logit contributions
rell+0.255
by+0.246
lying+0.242
ench+0.237
 destination+0.228
Neg logit contributions
 we-0.479
 I-0.438
 you-0.414
 who-0.393
 let-0.385
Tokenrell
Token ID11252
Feature activation-1.93e+01

Pos logit contributions
by+0.097
 awoke+0.079
lying+0.060
 doorway+0.056
ench+0.056
Neg logit contributions
 we-0.655
 I-0.613
 you-0.599
 that-0.567
 let-0.565
Tokenas
Token ID292
Feature activation+3.05e+00

Pos logit contributions
rell+1.142
by+1.112
ench+1.086
lying+1.072
rol+1.033
Neg logit contributions
 we-0.955
 I-0.857
 you-0.769
 who-0.715
 let-0.698
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
 we+0.160
 I+0.141
 you+0.130
 who+0.121
 let+0.117
Neg logit contributions
rell-0.173
by-0.169
lying-0.167
ench-0.165
 destination-0.161
Token But
Token ID887
Feature activation+4.40e+00

Pos logit contributions
lying+0.016
 Eventually+0.016
ench+0.015
rell+0.015
 destination+0.015
Neg logit contributions
 we-0.025
 you-0.022
 I-0.022
 who-0.020
 wasting-0.020
Token they
Token ID484
Feature activation+6.48e+00

Pos logit contributions
 we+0.123
 I+0.108
ms+0.107
 pige+0.105
 who+0.103
Neg logit contributions
 destination-0.280
by-0.267
zie-0.267
rell-0.265
 home-0.263
Token are
Token ID389
Feature activation-3.53e+00

Pos logit contributions
 we+0.326
 I+0.289
 you+0.264
 oh+0.240
 who+0.236
Neg logit contributions
by-0.339
ench-0.334
 destination-0.333
rell-0.333
lying-0.316
Token had
Token ID550
Feature activation+1.65e+00

Pos logit contributions
 we+0.557
 I+0.498
 you+0.459
 who+0.428
 let+0.414
Neg logit contributions
rell-0.546
by-0.532
lying-0.523
ench-0.518
 destination-0.505
Token no
Token ID645
Feature activation+5.83e+00

Pos logit contributions
 we+0.093
 I+0.084
 you+0.077
 who+0.073
 let+0.070
Neg logit contributions
rell-0.085
by-0.083
ench-0.080
lying-0.080
 destination-0.080
Token jackets
Token ID35129
Feature activation-5.38e+00

Pos logit contributions
 we+0.352
 I+0.314
 you+0.299
 let+0.278
 who+0.277
Neg logit contributions
rell-0.284
lying-0.277
by-0.269
ench-0.265
 awoke-0.260
Token or
Token ID393
Feature activation+5.56e+00

Pos logit contributions
rell+0.301
by+0.294
lying+0.290
 destination+0.288
ench+0.285
Neg logit contributions
 we-0.262
 I-0.234
 you-0.215
 who-0.198
 let-0.195
Token umb
Token ID20810
Feature activation-6.74e+00

Pos logit contributions
 we+0.252
 I+0.221
 you+0.202
 who+0.194
You+0.190
Neg logit contributions
rell-0.308
by-0.293
 destination-0.289
 awoke-0.274
ench-0.273
Tokenrell
Token ID11252
Feature activation-1.92e+01

Pos logit contributions
by+0.107
 doorway+0.080
ench+0.079
 awoke+0.078
 destination+0.070
Neg logit contributions
 we-0.616
 I-0.582
 you-0.562
 let-0.539
 that-0.537
Tokenas
Token ID292
Feature activation+3.11e+00

Pos logit contributions
rell+1.162
by+1.141
lying+1.113
ench+1.109
 destination+1.071
Neg logit contributions
 we-0.928
 I-0.821
 you-0.743
 who-0.684
 let-0.667
Token,
Token ID11
Feature activation-4.60e+00

Pos logit contributions
 we+0.138
 I+0.120
 you+0.107
 who+0.100
 let+0.097
Neg logit contributions
rell-0.195
by-0.188
lying-0.187
ench-0.184
 destination-0.182
Token so
Token ID523
Feature activation-3.57e+00

Pos logit contributions
 destination+0.271
rell+0.270
 awoke+0.255
ench+0.251
rol+0.249
Neg logit contributions
 we-0.148
 I-0.128
Bad-0.114
ms-0.113
 cooperative-0.111
Token now
Token ID783
Feature activation+7.49e+00

Pos logit contributions
by+0.248
 destination+0.241
 home+0.225
rolled+0.225
 ease+0.222
Neg logit contributions
ms-0.072
Bad-0.057
You-0.055
 wasting-0.053
ila-0.053
Token we
Token ID356
Feature activation+1.30e+01

Pos logit contributions
ms+0.150
 forgiving+0.147
Bad+0.144
 wasting+0.142
 exc+0.140
Neg logit contributions
 destination-0.524
by-0.507
 awoke-0.497
 home-0.477
rell-0.465
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
rell+0.173
by+0.173
lying+0.172
ench+0.170
 destination+0.165
Neg logit contributions
 we-0.129
 I-0.113
 you-0.101
 who-0.098
 let-0.092
Token They
Token ID1119
Feature activation-8.85e-02

Pos logit contributions
rell+0.015
lying+0.014
by+0.014
ench+0.014
 destination+0.014
Neg logit contributions
 we-0.015
 I-0.013
 you-0.012
 who-0.011
 let-0.011
Token opened
Token ID4721
Feature activation-4.92e+00

Pos logit contributions
rell+0.004
by+0.004
 awoke+0.004
lying+0.004
ench+0.004
Neg logit contributions
 we-0.005
 I-0.005
 you-0.004
 who-0.004
 let-0.004
Token their
Token ID511
Feature activation-6.65e+00

Pos logit contributions
rell+0.371
lying+0.365
ench+0.357
by+0.357
zie+0.343
Neg logit contributions
 we-0.174
 I-0.141
 you-0.125
 who-0.114
 let-0.107
Token umb
Token ID20810
Feature activation-6.73e+00

Pos logit contributions
lying+0.340
rell+0.338
ench+0.320
 awoke+0.315
zie+0.306
Neg logit contributions
 we-0.403
 I-0.345
 you-0.334
 who-0.308
 let-0.298
Tokenrell
Token ID11252
Feature activation-1.92e+01

Pos logit contributions
by+0.157
ench+0.124
 awoke+0.116
lying+0.103
cluded+0.101
Neg logit contributions
 we-0.559
 I-0.520
 that-0.511
 you-0.500
 let-0.481
Tokenas
Token ID292
Feature activation+3.14e+00

Pos logit contributions
rell+1.238
lying+1.220
by+1.198
 awoke+1.186
 destination+1.183
Neg logit contributions
 we-0.843
 I-0.711
 you-0.657
 who-0.610
 let-0.572
Token and
Token ID290
Feature activation+3.83e+00

Pos logit contributions
 we+0.128
 I+0.104
 you+0.095
 who+0.093
 let+0.084
Neg logit contributions
rell-0.218
by-0.215
ench-0.214
lying-0.214
 awoke-0.203
Token ran
Token ID4966
Feature activation-1.62e-01

Pos logit contributions
 we+0.191
 I+0.181
 you+0.168
 yourselves+0.148
Bad+0.144
Neg logit contributions
by-0.203
 awoke-0.192
rell-0.191
rolled-0.184
 doorway-0.182
Token around
Token ID1088
Feature activation-2.54e+00

Pos logit contributions
rell+0.008
lying+0.008
ench+0.008
by+0.008
zie+0.008
Neg logit contributions
 we-0.010
 I-0.009
 you-0.008
 who-0.008
 let-0.008
Token in
Token ID287
Feature activation+3.08e+00

Pos logit contributions
rell+0.164
by+0.163
lying+0.158
ench+0.157
 destination+0.157
Neg logit contributions
 we-0.107
 I-0.094
 you-0.083
 who-0.073
 let-0.072
Token jackets
Token ID35129
Feature activation-5.28e+00

Pos logit contributions
ench+0.336
lying+0.335
by+0.331
 awoke+0.322
rell+0.318
Neg logit contributions
 we-0.370
 I-0.318
 who-0.309
 you-0.300
 let-0.294
Token,
Token ID11
Feature activation-4.47e+00

Pos logit contributions
ench+0.383
rell+0.382
by+0.377
lying+0.376
 awoke+0.369
Neg logit contributions
 we-0.192
 I-0.150
 who-0.146
 that-0.137
 you-0.134
Token grabbed
Token ID13176
Feature activation+2.34e-01

Pos logit contributions
rell+0.255
by+0.245
lying+0.239
zie+0.237
 awoke+0.236
Neg logit contributions
 we-0.220
 I-0.202
 you-0.185
 who-0.150
 yourselves-0.147
Token their
Token ID511
Feature activation-6.58e+00

Pos logit contributions
 we+0.010
 I+0.009
 you+0.008
 who+0.007
 let+0.007
Neg logit contributions
rell-0.015
by-0.015
lying-0.014
ench-0.014
 destination-0.014
Token umb
Token ID20810
Feature activation-6.65e+00

Pos logit contributions
rell+0.332
by+0.326
lying+0.318
 destination+0.312
ench+0.311
Neg logit contributions
 we-0.378
 I-0.343
 you-0.314
 who-0.297
 let-0.289
Tokenrell
Token ID11252
Feature activation-1.91e+01

Pos logit contributions
by+0.122
 awoke+0.106
ench+0.088
 doorway+0.075
lying+0.071
Neg logit contributions
 we-0.604
 I-0.566
 you-0.548
 that-0.530
 let-0.517
Tokenas
Token ID292
Feature activation+3.20e+00

Pos logit contributions
rell+1.211
lying+1.179
by+1.179
ench+1.159
 destination+1.147
Neg logit contributions
 we-0.864
 I-0.746
 you-0.680
 who-0.626
 let-0.598
Token,
Token ID11
Feature activation-4.52e+00

Pos logit contributions
 we+0.135
 I+0.111
 you+0.103
 who+0.099
 let+0.091
Neg logit contributions
rell-0.214
by-0.212
ench-0.211
lying-0.210
 awoke-0.204
Token and
Token ID290
Feature activation+3.90e+00

Pos logit contributions
rell+0.254
lying+0.233
 awoke+0.232
by+0.231
 doorway+0.229
Neg logit contributions
 we-0.224
 I-0.206
 you-0.191
 who-0.151
Bad-0.150
Token slowly
Token ID6364
Feature activation+1.47e-01

Pos logit contributions
 we+0.209
 I+0.192
 you+0.176
 who+0.158
 let+0.148
Neg logit contributions
rell-0.204
by-0.203
lying-0.193
 destination-0.189
 awoke-0.189
Token walked
Token ID6807
Feature activation-5.73e+00

Pos logit contributions
 we+0.008
 I+0.007
 who+0.007
 let+0.007
 you+0.007
Neg logit contributions
ench-0.008
rell-0.008
by-0.008
lying-0.008
zie-0.007
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
by+0.015
rell+0.015
lying+0.014
ench+0.014
 destination+0.014
Neg logit contributions
 we-0.014
 I-0.012
 you-0.011
 let-0.010
 who-0.010
Token He
Token ID679
Feature activation-6.96e+00

Pos logit contributions
by+0.007
rell+0.007
ench+0.006
lying+0.006
 destination+0.006
Neg logit contributions
 we-0.010
 I-0.009
 you-0.008
 who-0.008
 let-0.008
Token had
Token ID550
Feature activation+1.80e+00

Pos logit contributions
rell+0.398
by+0.391
lying+0.385
ench+0.378
 destination+0.372
Neg logit contributions
 we-0.355
 I-0.314
 you-0.288
 who-0.265
 let-0.255
Token no
Token ID645
Feature activation+5.97e+00

Pos logit contributions
 we+0.084
 I+0.073
 you+0.067
 who+0.061
 let+0.060
Neg logit contributions
rell-0.113
lying-0.110
by-0.110
ench-0.109
 destination-0.105
Token umb
Token ID20810
Feature activation-6.59e+00

Pos logit contributions
 we+0.307
 I+0.275
 you+0.255
 who+0.235
 let+0.231
Neg logit contributions
rell-0.341
lying-0.336
by-0.326
ench-0.324
 awoke-0.318
Tokenrell
Token ID11252
Feature activation-1.91e+01

Pos logit contributions
by+0.124
 awoke+0.103
 destination+0.095
 doorway+0.092
ench+0.092
Neg logit contributions
 we-0.579
 I-0.548
 you-0.531
 let-0.511
 that-0.508
Tokenas
Token ID292
Feature activation+3.27e+00

Pos logit contributions
rell+1.182
by+1.155
lying+1.142
ench+1.129
 destination+1.104
Neg logit contributions
 we-0.892
 I-0.780
 you-0.708
 who-0.650
 let-0.628
Token or
Token ID393
Feature activation+5.69e+00

Pos logit contributions
 we+0.165
 I+0.145
 you+0.134
 who+0.124
 let+0.120
Neg logit contributions
rell-0.191
by-0.187
lying-0.185
ench-0.183
 destination-0.178
Token hats
Token ID23910
Feature activation-1.55e+00

Pos logit contributions
 we+0.276
 I+0.242
 you+0.220
 who+0.210
 let+0.200
Neg logit contributions
rell-0.319
by-0.298
 destination-0.285
zie-0.283
 awoke-0.281
Token to
Token ID284
Feature activation-3.67e+00

Pos logit contributions
rell+0.093
by+0.091
ench+0.090
lying+0.089
zie+0.086
Neg logit contributions
 we-0.077
 I-0.068
 you-0.062
 who-0.058
 let-0.056
Token keep
Token ID1394
Feature activation+2.63e+00

Pos logit contributions
rell+0.202
by+0.193
ench+0.187
lying+0.187
 destination+0.185
Neg logit contributions
 we-0.195
 I-0.172
 you-0.155
 who-0.145
 let-0.137
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
 we+0.362
 I+0.327
 you+0.301
 who+0.270
 let+0.269
Neg logit contributions
rell-0.313
by-0.298
lying-0.294
 destination-0.283
ench-0.283
Token It
Token ID632
Feature activation-5.10e-02

Pos logit contributions
 Eventually+0.005
rell+0.005
ench+0.005
lying+0.005
 Today+0.005
Neg logit contributions
 we-0.008
 you-0.007
 I-0.007
 who-0.007
 let-0.007
Token takes
Token ID2753
Feature activation+3.90e+00

Pos logit contributions
rell+0.003
by+0.003
 awoke+0.003
 destination+0.003
ench+0.003
Neg logit contributions
 we-0.002
 I-0.002
 you-0.002
 who-0.002
 oh-0.002
Token their
Token ID511
Feature activation-6.51e+00

Pos logit contributions
 we+0.145
 I+0.122
 you+0.111
 who+0.095
 let+0.094
Neg logit contributions
rell-0.281
lying-0.275
by-0.273
ench-0.271
 destination-0.265
Token umb
Token ID20810
Feature activation-6.58e+00

Pos logit contributions
rell+0.347
by+0.344
lying+0.330
ench+0.329
 destination+0.327
Neg logit contributions
 we-0.354
 I-0.321
 you-0.292
 who-0.278
 let-0.267
Tokenrell
Token ID11252
Feature activation-1.90e+01

Pos logit contributions
by+0.108
 awoke+0.088
 doorway+0.072
ench+0.070
 destination+0.067
Neg logit contributions
 we-0.607
 I-0.568
 you-0.557
 that-0.528
 let-0.526
Tokenas
Token ID292
Feature activation+3.29e+00

Pos logit contributions
rell+1.127
by+1.115
ench+1.081
lying+1.073
 destination+1.027
Neg logit contributions
 we-0.938
 I-0.837
 you-0.750
 who-0.696
 let-0.678
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 we+0.177
 I+0.157
 you+0.146
 who+0.136
 let+0.132
Neg logit contributions
rell-0.183
by-0.178
lying-0.176
ench-0.174
 destination-0.169
Token
Token ID198
Feature activation-6.11e-02

Pos logit contributions
 Eventually+0.006
rell+0.005
ench+0.005
 Today+0.005
lying+0.005
Neg logit contributions
 we-0.010
 you-0.008
 who-0.008
 I-0.008
 oh-0.008
Token
Token ID198
Feature activation-6.31e-02

Pos logit contributions
by+0.002
rell+0.002
ench+0.002
 Eventually+0.002
lying+0.002
Neg logit contributions
 we-0.004
 I-0.003
 you-0.003
 who-0.003
 let-0.003
Token"
Token ID1
Feature activation+1.27e+00

Pos logit contributions
ench+0.003
by+0.003
side+0.003
lying+0.003
 Eventually+0.003
Neg logit contributions
 we-0.003
 I-0.003
 letting-0.003
 oh-0.003
 maybe-0.003
Token about
Token ID546
Feature activation-9.21e-01

Pos logit contributions
 we+0.231
 I+0.202
 you+0.184
 who+0.170
 let+0.164
Neg logit contributions
rell-0.292
by-0.285
lying-0.282
ench-0.278
 destination-0.273
Token rain
Token ID6290
Feature activation-6.49e+00

Pos logit contributions
by+0.051
rell+0.050
 destination+0.048
 ease+0.048
rolled+0.046
Neg logit contributions
 we-0.043
 I-0.039
 you-0.033
 let-0.030
We-0.029
Tokenbows
Token ID25435
Feature activation+1.98e+00

Pos logit contributions
rell+0.291
by+0.270
lying+0.263
ench+0.257
 destination+0.253
Neg logit contributions
 we-0.420
 I-0.388
 you-0.361
 who-0.334
 let-0.329
Token and
Token ID290
Feature activation+4.02e+00

Pos logit contributions
 we+0.107
 I+0.102
 you+0.091
Bad+0.086
 yourselves+0.082
Neg logit contributions
rell-0.097
by-0.092
lying-0.088
 destination-0.087
 awoke-0.087
Token umb
Token ID20810
Feature activation-6.55e+00

Pos logit contributions
 we+0.207
 I+0.190
 you+0.177
 who+0.155
 don+0.153
Neg logit contributions
rell-0.206
by-0.190
 awoke-0.180
 ease-0.177
zie-0.173
Tokenrell
Token ID11252
Feature activation-1.90e+01

Pos logit contributions
by+0.075
ench+0.067
 destination+0.063
 awoke+0.057
lying+0.054
Neg logit contributions
 we-0.618
 I-0.575
 you-0.563
 let-0.541
 that-0.538
Tokenas
Token ID292
Feature activation+3.31e+00

Pos logit contributions
rell+1.165
by+1.141
lying+1.119
ench+1.110
 destination+1.079
Neg logit contributions
 we-0.907
 I-0.799
 you-0.724
 who-0.663
 let-0.644
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
 we+0.178
 I+0.159
 you+0.146
 who+0.134
 let+0.131
Neg logit contributions
rell-0.181
by-0.175
lying-0.173
ench-0.169
 destination-0.166
Token They
Token ID1119
Feature activation+4.02e-02

Pos logit contributions
 Eventually+0.006
rell+0.006
lying+0.005
ench+0.005
 Later+0.005
Neg logit contributions
 we-0.008
 you-0.007
 I-0.007
 who-0.006
 let-0.006
Token were
Token ID547
Feature activation-5.97e+00

Pos logit contributions
 we+0.002
 I+0.002
 you+0.002
 yourselves+0.002
 spies+0.002
Neg logit contributions
by-0.002
rell-0.002
 awoke-0.002
 ease-0.002
lying-0.002
Token very
Token ID845
Feature activation-9.59e+00

Pos logit contributions
rell+0.307
by+0.287
 destination+0.282
zie+0.280
ench+0.279
Neg logit contributions
 we-0.326
 I-0.293
 you-0.263
 who-0.230
Bad-0.229
Token.
Token ID13
Feature activation-9.86e-02

Pos logit contributions
rell+0.541
by+0.540
lying+0.532
ench+0.529
 awoke+0.510
Neg logit contributions
 we-0.382
 I-0.326
 you-0.295
 who-0.283
 let-0.274
Token They
Token ID1119
Feature activation+7.69e-02

Pos logit contributions
rell+0.005
lying+0.005
by+0.005
ench+0.005
 destination+0.005
Neg logit contributions
 we-0.005
 I-0.004
 you-0.004
 who-0.004
 let-0.004
Token take
Token ID1011
Feature activation+5.04e+00

Pos logit contributions
 we+0.004
 I+0.004
 you+0.003
 who+0.003
 letting+0.003
Neg logit contributions
rell-0.004
by-0.004
lying-0.004
 destination-0.004
zie-0.004
Token their
Token ID511
Feature activation-6.46e+00

Pos logit contributions
 we+0.238
 I+0.204
 you+0.192
 who+0.174
 let+0.172
Neg logit contributions
lying-0.311
rell-0.310
ench-0.304
by-0.303
 destination-0.297
Token umb
Token ID20810
Feature activation-6.55e+00

Pos logit contributions
lying+0.316
rell+0.315
ench+0.314
by+0.304
 awoke+0.297
Neg logit contributions
 we-0.396
 I-0.346
 you-0.331
 who-0.305
 let-0.304
Tokenrell
Token ID11252
Feature activation-1.90e+01

Pos logit contributions
by+0.117
 awoke+0.100
ench+0.084
lying+0.080
 destination+0.074
Neg logit contributions
 we-0.597
 I-0.553
 you-0.540
 that-0.518
 let-0.517
Tokenas
Token ID292
Feature activation+3.33e+00

Pos logit contributions
rell+1.167
by+1.141
lying+1.121
ench+1.114
 destination+1.082
Neg logit contributions
 we-0.900
 I-0.792
 you-0.717
 who-0.658
 let-0.639
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 we+0.159
 I+0.135
 you+0.126
 who+0.118
 let+0.112
Neg logit contributions
rell-0.206
by-0.202
lying-0.202
ench-0.200
 awoke-0.194
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
rell+0.006
lying+0.006
by+0.006
ench+0.006
 Eventually+0.006
Neg logit contributions
 we-0.007
 I-0.006
 you-0.006
 who-0.005
 let-0.005
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
rell+0.001
by+0.001
lying+0.001
ench+0.001
 destination+0.001
Neg logit contributions
 we-0.001
 I-0.001
 you-0.001
 who-0.001
 let-0.001
Token"
Token ID1
Feature activation+1.33e+00

Pos logit contributions
lying+0.002
by+0.002
ench+0.002
rell+0.002
side+0.002
Neg logit contributions
 we-0.002
 I-0.001
 you-0.001
 who-0.001
 that-0.001
Token!
Token ID0
Feature activation-4.71e-01

Pos logit contributions
by+0.297
rell+0.284
lying+0.274
 destination+0.273
ench+0.268
Neg logit contributions
 we-0.257
 I-0.226
 you-0.201
 let-0.198
 who-0.192
Token Can
Token ID1680
Feature activation+1.11e+01

Pos logit contributions
 awoke+0.026
 Eventually+0.026
rell+0.026
lying+0.026
 Today+0.026
Neg logit contributions
 we-0.018
 not-0.016
 wasting-0.016
 you-0.015
 annoying-0.015
Token I
Token ID314
Feature activation+1.29e+01

Pos logit contributions
 chefs+0.157
 cooperative+0.157
 wasting+0.148
ms+0.133
 spies+0.131
Neg logit contributions
rell-0.822
rolled-0.792
 His-0.785
by-0.780
zie-0.779
Token play
Token ID711
Feature activation-4.15e-01

Pos logit contributions
 we+0.691
 I+0.609
 you+0.574
 who+0.548
 wasting+0.523
Neg logit contributions
rell-0.656
by-0.648
 destination-0.627
ench-0.621
lying-0.616
Token with
Token ID351
Feature activation+1.02e+01

Pos logit contributions
date+0.022
ench+0.022
 destination+0.022
 home+0.022
 awoke+0.022
Neg logit contributions
 we-0.017
 I-0.015
 wasting-0.014
 exc-0.013
keys-0.013
Token you
Token ID345
Feature activation+7.47e+00

Pos logit contributions
 we+0.246
 I+0.176
 exc+0.175
 let+0.163
 don+0.150
Neg logit contributions
by-0.752
rell-0.749
 destination-0.711
 home-0.703
 awoke-0.700
Token?"
Token ID1701
Feature activation+6.90e+00

Pos logit contributions
 we+0.272
 I+0.226
 you+0.196
 who+0.173
 let+0.166
Neg logit contributions
rell-0.524
by-0.521
ench-0.512
 destination-0.511
lying-0.509
Token Tim
Token ID5045
Feature activation+3.42e+00

Pos logit contributions
 we+0.292
 I+0.244
 you+0.221
 who+0.208
 let+0.202
Neg logit contributions
by-0.424
rell-0.423
lying-0.419
 awoke-0.402
zie-0.398
Token asked
Token ID1965
Feature activation+2.63e+00

Pos logit contributions
 we+0.153
 I+0.143
 let+0.133
 who+0.131
 you+0.128
Neg logit contributions
rell-0.215
ench-0.213
rol-0.205
 destination-0.204
side-0.200
Token.
Token ID13
Feature activation-6.08e-02

Pos logit contributions
 we+0.115
 I+0.103
 you+0.094
 who+0.091
 let+0.087
Neg logit contributions
rell-0.176
ench-0.170
 destination-0.167
lying-0.165
by-0.162
Token The
Token ID383
Feature activation+1.50e+00

Pos logit contributions
rell+0.003
by+0.003
lying+0.003
ench+0.003
 destination+0.003
Neg logit contributions
 we-0.003
 I-0.003
 you-0.003
 who-0.003
 let-0.002
Token never
Token ID1239
Feature activation+5.06e+00

Pos logit contributions
 we+0.190
 I+0.168
 you+0.157
 who+0.142
 yourselves+0.134
Neg logit contributions
by-0.194
rell-0.184
 awoke-0.180
ench-0.177
lying-0.176
Token scared
Token ID12008
Feature activation+3.27e+00

Pos logit contributions
 we+0.253
 I+0.226
 you+0.203
 let+0.200
 who+0.194
Neg logit contributions
rell-0.304
lying-0.295
ench-0.293
 destination-0.290
by-0.284
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 we+0.186
 I+0.165
 you+0.152
 who+0.144
 let+0.140
Neg logit contributions
rell-0.166
by-0.163
lying-0.158
ench-0.156
 destination-0.152
Token He
Token ID679
Feature activation-7.28e+00

Pos logit contributions
rell+0.020
by+0.020
lying+0.020
ench+0.020
 destination+0.019
Neg logit contributions
 we-0.032
 I-0.030
 you-0.027
 who-0.025
 let-0.025
Token remembered
Token ID12086
Feature activation-1.46e+00

Pos logit contributions
rell+0.441
lying+0.426
by+0.426
ench+0.421
 destination+0.410
Neg logit contributions
 we-0.354
 I-0.311
 you-0.283
 who-0.266
 let-0.259
Token that
Token ID326
Feature activation+1.16e+01

Pos logit contributions
by+0.098
rell+0.096
ench+0.094
lying+0.093
 destination+0.093
Neg logit contributions
 we-0.059
 I-0.050
 you-0.044
 who-0.038
 let-0.037
Token the
Token ID262
Feature activation+3.20e+00

Pos logit contributions
 we+0.554
 I+0.484
 you+0.443
 who+0.411
 let+0.391
Neg logit contributions
by-0.631
rell-0.616
 awoke-0.612
 destination-0.595
ench-0.593
Token rat
Token ID4227
Feature activation+3.12e+00

Pos logit contributions
 we+0.170
 I+0.146
 you+0.131
 let+0.125
 who+0.120
Neg logit contributions
 doorway-0.172
by-0.167
 destination-0.162
ench-0.161
rell-0.160
Token had
Token ID550
Feature activation+1.44e+00

Pos logit contributions
 we+0.169
 I+0.152
 you+0.139
 who+0.126
 let+0.125
Neg logit contributions
by-0.164
rell-0.163
lying-0.157
ench-0.156
 awoke-0.154
Token filled
Token ID5901
Feature activation+9.36e-01

Pos logit contributions
 we+0.075
 I+0.066
 you+0.062
 who+0.057
 let+0.056
Neg logit contributions
rell-0.083
lying-0.081
by-0.080
ench-0.079
 destination-0.077
Token the
Token ID262
Feature activation+3.17e+00

Pos logit contributions
 we+0.048
 I+0.043
 you+0.035
 let+0.034
 who+0.033
Neg logit contributions
by-0.053
ench-0.050
rell-0.049
lying-0.048
 destination-0.048
Token wants
Token ID3382
Feature activation+8.74e+00

Pos logit contributions
 we+0.150
 I+0.120
 you+0.112
 who+0.096
 let+0.093
Neg logit contributions
by-0.206
rell-0.205
lying-0.198
 awoke-0.196
ench-0.189
Token to
Token ID284
Feature activation-3.74e+00

Pos logit contributions
 we+0.498
 I+0.446
 who+0.401
 you+0.396
 let+0.385
Neg logit contributions
by-0.422
rell-0.408
lying-0.385
 destination-0.378
ench-0.371
Token be
Token ID307
Feature activation-3.97e+00

Pos logit contributions
rell+0.190
by+0.189
ench+0.180
lying+0.178
 destination+0.178
Neg logit contributions
 we-0.208
 I-0.184
 you-0.166
 who-0.164
We-0.148
Token our
Token ID674
Feature activation-2.50e-02

Pos logit contributions
by+0.230
rell+0.216
ench+0.209
lying+0.204
 home+0.204
Neg logit contributions
 we-0.189
 I-0.164
 you-0.136
 don-0.125
 let-0.123
Token friend
Token ID1545
Feature activation-7.36e-02

Pos logit contributions
by+0.001
 destination+0.001
rell+0.001
ench+0.001
lying+0.001
Neg logit contributions
 we-0.001
 I-0.001
 you-0.001
 who-0.001
 let-0.001
Token."
Token ID526
Feature activation+8.21e+00

Pos logit contributions
rell+0.004
by+0.004
lying+0.004
ench+0.004
 destination+0.004
Neg logit contributions
 we-0.003
 I-0.003
 you-0.002
 let-0.002
 wasting-0.002
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
 we+0.457
 I+0.402
 you+0.374
 let+0.355
 who+0.351
Neg logit contributions
rell-0.410
lying-0.404
by-0.401
ench-0.388
 destination-0.376
Token
Token ID198
Feature activation-1.95e-01

Pos logit contributions
rell+0.009
by+0.009
lying+0.009
ench+0.009
 destination+0.008
Neg logit contributions
 we-0.011
 I-0.009
 you-0.009
 let-0.008
 who-0.008
Token"
Token ID1
Feature activation+1.16e+00

Pos logit contributions
by+0.012
lying+0.012
ench+0.011
rell+0.011
side+0.011
Neg logit contributions
 we-0.008
 I-0.007
 you-0.006
 let-0.006
 don-0.005
TokenLet
Token ID5756
Feature activation+9.85e+00

Pos logit contributions
 we+0.058
 I+0.050
 you+0.047
 wasting+0.044
 that+0.043
Neg logit contributions
lying-0.057
side-0.054
by-0.052
 awoke-0.049
rell-0.048
Token's
Token ID338
Feature activation-1.06e+01

Pos logit contributions
 we+0.580
 I+0.513
 you+0.510
 let+0.452
 that+0.423
Neg logit contributions
rell-0.528
rol-0.517
lying-0.511
 doorway-0.511
rolled-0.507
Token their
Token ID511
Feature activation-6.43e+00

Pos logit contributions
 we+0.484
 I+0.438
 you+0.379
 let+0.351
 who+0.339
Neg logit contributions
by-0.612
rell-0.599
 destination-0.574
ench-0.572
lying-0.563
Token toys
Token ID14958
Feature activation+1.35e+00

Pos logit contributions
by+0.375
 destination+0.360
rell+0.359
 doorway+0.341
lying+0.340
Neg logit contributions
 we-0.309
 I-0.284
 you-0.246
 let-0.235
 who-0.231
Token.
Token ID13
Feature activation-6.20e-02

Pos logit contributions
 we+0.072
 I+0.065
 you+0.059
 who+0.055
 let+0.054
Neg logit contributions
rell-0.073
by-0.071
lying-0.070
ench-0.069
 destination-0.068
Token<|endoftext|>
Token ID50256
Feature activation-7.10e+00

Pos logit contributions
rell+0.003
lying+0.003
 Eventually+0.003
by+0.003
ench+0.003
Neg logit contributions
 we-0.004
 I-0.003
 you-0.003
 let-0.003
 who-0.003
TokenTom
Token ID13787
Feature activation+7.64e+00

Pos logit contributions
rell+0.385
by+0.377
lying+0.374
ench+0.367
 destination+0.364
Neg logit contributions
 we-0.391
 I-0.356
 you-0.329
 who-0.305
 let-0.291
Token likes
Token ID7832
Feature activation+1.06e+01

Pos logit contributions
 we+0.508
 I+0.486
 who+0.456
 you+0.456
 let+0.447
Neg logit contributions
rell-0.307
ench-0.305
rol-0.289
 destination-0.288
lying-0.288
Token to
Token ID284
Feature activation-3.55e+00

Pos logit contributions
 we+0.649
 I+0.588
 you+0.538
 who+0.522
 let+0.516
Neg logit contributions
by-0.471
rell-0.467
lying-0.426
ench-0.419
 destination-0.413
Token watch
Token ID2342
Feature activation+5.89e+00

Pos logit contributions
rell+0.186
by+0.179
lying+0.175
ench+0.170
zie+0.168
Neg logit contributions
 we-0.192
 I-0.168
 you-0.157
 who-0.150
Bad-0.141
Token videos
Token ID5861
Feature activation+7.27e+00

Pos logit contributions
 we+0.307
 I+0.271
 you+0.243
 who+0.227
 let+0.223
Neg logit contributions
rell-0.330
by-0.318
lying-0.311
ench-0.309
 destination-0.306
Token on
Token ID319
Feature activation+6.51e+00

Pos logit contributions
 we+0.386
 I+0.342
 you+0.307
 let+0.286
 who+0.283
Neg logit contributions
rell-0.394
lying-0.369
by-0.360
ench-0.355
 awoke-0.351
Token his
Token ID465
Feature activation-1.10e+01

Pos logit contributions
 we+0.347
 I+0.302
 you+0.266
 who+0.259
 let+0.255
Neg logit contributions
rell-0.347
by-0.328
lying-0.320
 destination-0.317
ench-0.308
Token there
Token ID612
Feature activation+7.46e+00

Pos logit contributions
rell+0.291
lying+0.283
by+0.282
ench+0.281
 destination+0.273
Neg logit contributions
 we-0.307
 I-0.275
 you-0.254
 who-0.234
 let-0.225
Token was
Token ID373
Feature activation-2.13e+00

Pos logit contributions
 we+0.342
 I+0.304
 you+0.272
 who+0.256
 let+0.252
Neg logit contributions
rell-0.486
lying-0.475
ench-0.464
by-0.461
 destination-0.452
Token a
Token ID257
Feature activation+7.01e-01

Pos logit contributions
rell+0.150
by+0.147
ench+0.144
lying+0.144
 awoke+0.139
Neg logit contributions
 we-0.081
 I-0.068
 you-0.059
 who-0.052
 let-0.051
Token boy
Token ID2933
Feature activation+4.38e+00

Pos logit contributions
 we+0.036
 I+0.033
 yourselves+0.032
 you+0.032
ms+0.031
Neg logit contributions
by-0.036
ench-0.031
 doorway-0.030
 den-0.030
 awoke-0.029
Token called
Token ID1444
Feature activation+4.53e+00

Pos logit contributions
 we+0.214
 I+0.188
 you+0.167
 don+0.144
 let+0.141
Neg logit contributions
by-0.254
rell-0.238
lying-0.231
ench-0.228
jo-0.226
Token Tom
Token ID4186
Feature activation+7.54e+00

Pos logit contributions
 we+0.197
 I+0.163
 you+0.152
 who+0.143
 let+0.135
Neg logit contributions
rell-0.286
by-0.275
lying-0.271
zie-0.265
ench-0.260
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
 we+0.351
 I+0.310
 who+0.293
 you+0.284
 let+0.265
Neg logit contributions
rell-0.482
lying-0.464
ench-0.462
 destination-0.459
by-0.447
Token Tom
Token ID4186
Feature activation+7.53e+00

Pos logit contributions
by+0.031
ench+0.031
rell+0.030
lying+0.030
 destination+0.030
Neg logit contributions
 we-0.007
 I-0.005
 you-0.003
 let-0.003
 who-0.002
Token liked
Token ID8288
Feature activation+5.87e+00

Pos logit contributions
 we+0.367
 I+0.323
 you+0.294
 who+0.272
 let+0.263
Neg logit contributions
rell-0.454
by-0.442
lying-0.438
ench-0.433
 destination-0.424
Token to
Token ID284
Feature activation-3.85e+00

Pos logit contributions
 we+0.357
 I+0.323
 you+0.298
 who+0.282
 let+0.278
Neg logit contributions
rell-0.271
by-0.266
lying-0.251
ench-0.249
 destination-0.247
Token play
Token ID711
Feature activation-7.17e-01

Pos logit contributions
rell+0.212
by+0.206
lying+0.204
ench+0.201
 destination+0.196
Neg logit contributions
 we-0.208
 I-0.185
 you-0.171
 who-0.159
 let-0.155
Token stack
Token ID8931
Feature activation+3.45e+00

Pos logit contributions
 we+0.179
 I+0.162
 you+0.149
 who+0.139
Bad+0.130
Neg logit contributions
rell-0.191
by-0.188
 awoke-0.177
lying-0.176
zie-0.172
Token them
Token ID606
Feature activation-5.01e+00

Pos logit contributions
 we+0.180
 I+0.160
 you+0.147
 who+0.136
 let+0.132
Neg logit contributions
rell-0.195
by-0.190
lying-0.187
ench-0.184
 destination-0.180
Token up
Token ID510
Feature activation-6.40e+00

Pos logit contributions
lying+0.311
ench+0.310
rell+0.308
by+0.304
 awoke+0.302
Neg logit contributions
 we-0.238
 I-0.198
 you-0.184
 let-0.177
 who-0.177
Token.
Token ID13
Feature activation-6.12e-01

Pos logit contributions
rell+0.393
by+0.380
lying+0.379
ench+0.374
 destination+0.364
Neg logit contributions
 we-0.306
 I-0.267
 you-0.245
 who-0.228
 let-0.220
Token They
Token ID1119
Feature activation-4.32e-01

Pos logit contributions
rell+0.033
lying+0.031
by+0.031
ench+0.030
 destination+0.030
Neg logit contributions
 we-0.033
 I-0.030
 you-0.028
 who-0.026
 let-0.025
Token find
Token ID1064
Feature activation+8.16e-02

Pos logit contributions
rell+0.025
by+0.024
lying+0.023
 destination+0.023
 awoke+0.023
Neg logit contributions
 we-0.021
 I-0.018
 you-0.017
 who-0.015
Bad-0.014
Token a
Token ID257
Feature activation+4.15e-01

Pos logit contributions
 we+0.003
 I+0.003
 you+0.002
 who+0.002
 let+0.002
Neg logit contributions
rell-0.006
lying-0.006
by-0.006
ench-0.006
 destination-0.006
Token hat
Token ID6877
Feature activation-1.74e+00

Pos logit contributions
 we+0.021
 I+0.019
 you+0.017
 who+0.016
 let+0.016
Neg logit contributions
rell-0.025
lying-0.024
by-0.023
ench-0.023
zie-0.022
Token,
Token ID11
Feature activation-4.91e+00

Pos logit contributions
rell+0.105
lying+0.100
by+0.100
 destination+0.098
ench+0.098
Neg logit contributions
 we-0.082
 I-0.071
 you-0.066
 who-0.059
 let-0.057
Token a
Token ID257
Feature activation+3.72e-01

Pos logit contributions
rell+0.250
 awoke+0.227
 ease+0.222
 trek+0.219
zie+0.217
Neg logit contributions
 we-0.248
 I-0.212
 you-0.206
ms-0.188
Bad-0.180
Token scarf
Token ID36953
Feature activation-2.86e+00

Pos logit contributions
 we+0.020
 I+0.018
 you+0.017
 who+0.016
 let+0.015
Neg logit contributions
rell-0.020
by-0.019
lying-0.019
ench-0.019
 destination-0.018
Token Lightning
Token ID12469
Feature activation-2.69e+00

Pos logit contributions
rell+0.027
 awoke+0.027
rolled+0.026
 Eventually+0.025
wood+0.025
Neg logit contributions
 we-0.026
 you-0.023
 who-0.022
ms-0.022
 wasting-0.021
Token!"
Token ID2474
Feature activation+5.72e+00

Pos logit contributions
rell+0.163
 awoke+0.146
lying+0.142
ench+0.142
 destination+0.141
Neg logit contributions
 we-0.128
 I-0.112
 you-0.104
 who-0.103
 spies-0.097
Token
Token ID198
Feature activation-5.30e-02

Pos logit contributions
 we+0.285
 I+0.252
 you+0.224
 who+0.215
 let+0.205
Neg logit contributions
rell-0.353
by-0.347
 destination-0.343
ench-0.340
lying-0.338
Token
Token ID198
Feature activation-5.06e-02

Pos logit contributions
rell+0.003
lying+0.003
by+0.003
ench+0.003
 destination+0.003
Neg logit contributions
 we-0.003
 I-0.002
 you-0.002
 who-0.002
 let-0.002
TokenHer
Token ID9360
Feature activation-8.29e+00

Pos logit contributions
by+0.002
lying+0.002
rell+0.002
ench+0.002
rolled+0.002
Neg logit contributions
 we-0.003
 I-0.003
 you-0.003
 who-0.003
 let-0.002
Token mom
Token ID1995
Feature activation+5.22e+00

Pos logit contributions
by+0.564
rell+0.561
lying+0.542
 destination+0.542
ench+0.538
Neg logit contributions
 we-0.334
 I-0.290
 you-0.259
 who-0.227
 let-0.216
Token held
Token ID2714
Feature activation+3.71e+00

Pos logit contributions
 we+0.290
 I+0.260
 you+0.241
 who+0.220
 let+0.213
Neg logit contributions
rell-0.278
by-0.272
lying-0.267
ench-0.261
 destination-0.257
Token her
Token ID607
Feature activation-8.56e+00

Pos logit contributions
 we+0.217
 I+0.196
 you+0.184
 who+0.174
 let+0.167
Neg logit contributions
rell-0.189
by-0.182
lying-0.180
ench-0.179
 destination-0.174
Token hand
Token ID1021
Feature activation-5.97e-01

Pos logit contributions
rell+0.425
by+0.415
lying+0.408
ench+0.403
 destination+0.393
Neg logit contributions
 we-0.508
 I-0.458
 you-0.425
 who-0.397
 let-0.388
Token tightly
Token ID17707
Feature activation-1.69e+00

Pos logit contributions
rell+0.044
lying+0.042
ench+0.042
by+0.041
zie+0.041
Neg logit contributions
 we-0.022
 I-0.018
 who-0.018
 you-0.016
 let-0.016
Token and
Token ID290
Feature activation+3.96e+00

Pos logit contributions
rell+0.124
by+0.120
lying+0.119
ench+0.118
zie+0.116
Neg logit contributions
 we-0.063
 I-0.052
 who-0.048
 you-0.046
 that-0.043
Token,
Token ID11
Feature activation-4.36e+00

Pos logit contributions
 we+0.414
 I+0.380
 who+0.363
You+0.362
 you+0.361
Neg logit contributions
 destination-0.228
by-0.225
rell-0.216
lying-0.214
ench-0.202
Token mom
Token ID1995
Feature activation+5.29e+00

Pos logit contributions
 destination+0.180
zie+0.166
 home+0.160
 Today+0.147
 back+0.146
Neg logit contributions
ms-0.188
we-0.183
Bad-0.182
 talented-0.182
We-0.180
Token,"
Token ID553
Feature activation+2.16e+00

Pos logit contributions
 we+0.299
 I+0.269
 you+0.246
 who+0.230
Bad+0.226
Neg logit contributions
by-0.266
rell-0.260
lying-0.250
 destination-0.239
zie-0.238
Token they
Token ID484
Feature activation+6.80e+00

Pos logit contributions
 we+0.086
 I+0.075
keys+0.072
 talented+0.069
 cooperative+0.066
Neg logit contributions
lying-0.110
rell-0.101
by-0.101
 His-0.098
 Eventually-0.096
Token said
Token ID531
Feature activation+1.94e+00

Pos logit contributions
 we+0.261
 I+0.219
 you+0.196
 who+0.179
 let+0.176
Neg logit contributions
rell-0.483
by-0.472
ench-0.467
lying-0.466
 destination-0.458
Token.
Token ID13
Feature activation-9.86e-02

Pos logit contributions
 we+0.106
 I+0.096
 you+0.088
 who+0.081
 let+0.079
Neg logit contributions
rell-0.102
by-0.100
lying-0.099
ench-0.097
 destination-0.094
Token "
Token ID366
Feature activation+3.53e+00

Pos logit contributions
lying+0.005
rell+0.005
 Eventually+0.005
by+0.005
 doorway+0.005
Neg logit contributions
 we-0.005
 I-0.004
 you-0.004
 let-0.004
 who-0.004
TokenWe
Token ID1135
Feature activation+1.13e+01

Pos logit contributions
 we+0.143
 I+0.129
 you+0.104
 that+0.100
 dear+0.092
Neg logit contributions
side-0.223
 Eventually-0.201
lying-0.196
by-0.196
 Along-0.194
Token will
Token ID481
Feature activation+1.20e+00

Pos logit contributions
 we+0.533
 I+0.467
 you+0.422
 who+0.386
 let+0.366
Neg logit contributions
rell-0.693
by-0.685
lying-0.672
ench-0.666
 destination-0.654
Token not
Token ID407
Feature activation+8.44e+00

Pos logit contributions
 we+0.060
 I+0.052
 you+0.047
 who+0.044
 pige+0.044
Neg logit contributions
 destination-0.065
by-0.063
rell-0.063
ench-0.063
lying-0.062
Token go
Token ID467
Feature activation-5.67e+00

Pos logit contributions
 we+0.428
 I+0.375
 you+0.334
 who+0.312
Bad+0.302
Neg logit contributions
by-0.468
ench-0.453
lying-0.450
rell-0.450
 destination-0.435
Token come
Token ID1282
Feature activation-4.35e+00

Pos logit contributions
rell+0.241
ench+0.237
 destination+0.237
by+0.229
 ease+0.225
Neg logit contributions
 we-0.244
 I-0.211
 pige-0.202
 who-0.192
 you-0.192
Token help
Token ID1037
Feature activation+4.02e+00

Pos logit contributions
by+0.128
rell+0.123
 destination+0.120
ench+0.120
lying+0.112
Neg logit contributions
 we-0.328
 I-0.304
 you-0.290
 who-0.274
 let-0.267
Token you
Token ID345
Feature activation+6.96e+00

Pos logit contributions
 we+0.088
 I+0.061
 pige+0.061
Bad+0.059
 greedy+0.052
Neg logit contributions
by-0.296
rell-0.296
ench-0.283
 home-0.279
zie-0.278
Token."
Token ID526
Feature activation+7.91e+00

Pos logit contributions
 we+0.340
 I+0.299
 you+0.272
 who+0.250
 let+0.239
Neg logit contributions
rell-0.405
by-0.397
ench-0.396
lying-0.384
 destination-0.382
Token
Token ID198
Feature activation-4.97e-01

Pos logit contributions
 we+0.376
 I+0.331
 you+0.297
 who+0.280
 let+0.262
Neg logit contributions
by-0.501
rell-0.499
 destination-0.491
lying-0.486
ench-0.483
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
rell+0.030
by+0.029
lying+0.029
ench+0.028
 destination+0.028
Neg logit contributions
 we-0.025
 I-0.022
 you-0.020
 who-0.019
 let-0.018
TokenHis
Token ID6653
Feature activation-1.14e+01

Pos logit contributions
rell+0.034
ench+0.033
lying+0.032
by+0.032
zie+0.031
Neg logit contributions
 we-0.020
 I-0.018
 you-0.015
 who-0.013
 let-0.013
Token mom
Token ID1995
Feature activation+4.76e+00

Pos logit contributions
rell+0.838
by+0.821
lying+0.815
ench+0.806
 destination+0.792
Neg logit contributions
 we-0.415
 I-0.347
 you-0.304
 who-0.269
 let-0.256
Token came
Token ID1625
Feature activation-5.00e+00

Pos logit contributions
 we+0.268
 I+0.240
 you+0.222
 who+0.203
 let+0.196
Neg logit contributions
rell-0.250
by-0.246
lying-0.241
ench-0.237
 destination-0.231
Token over
Token ID625
Feature activation-4.62e+00

Pos logit contributions
rell+0.304
lying+0.293
ench+0.282
by+0.278
 destination+0.274
Neg logit contributions
 we-0.253
 I-0.220
 who-0.217
 you-0.211
 let-0.202
Token and
Token ID290
Feature activation+3.51e+00

Pos logit contributions
by+0.273
lying+0.269
rell+0.268
ench+0.260
 destination+0.258
Neg logit contributions
 we-0.219
 I-0.194
 you-0.172
 who-0.154
 let-0.152
Token and
Token ID290
Feature activation+3.70e+00

Pos logit contributions
 we+0.149
 I+0.132
 you+0.119
Bad+0.116
 who+0.113
Neg logit contributions
by-0.130
lying-0.127
 destination-0.125
rell-0.124
 awoke-0.122
Token then
Token ID788
Feature activation+8.69e+00

Pos logit contributions
 we+0.147
 I+0.127
 you+0.112
 who+0.101
Bad+0.095
Neg logit contributions
by-0.239
 awoke-0.231
lying-0.230
rell-0.227
 destination-0.227
Token nodded
Token ID14464
Feature activation-2.59e+00

Pos logit contributions
 we+0.442
 I+0.392
 you+0.361
 who+0.339
 let+0.315
Neg logit contributions
by-0.485
rell-0.481
lying-0.472
 destination-0.459
ench-0.458
Token in
Token ID287
Feature activation+2.96e+00

Pos logit contributions
rell+0.137
by+0.134
lying+0.133
ench+0.130
 destination+0.128
Neg logit contributions
 we-0.143
 I-0.128
 you-0.118
 who-0.110
 let-0.107
Token understanding
Token ID4547
Feature activation+4.24e+00

Pos logit contributions
 we+0.237
 I+0.215
 you+0.200
 who+0.194
 let+0.186
Neg logit contributions
lying-0.077
rell-0.076
by-0.073
ench-0.073
 destination-0.071
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
 we+0.216
 I+0.190
 you+0.174
 who+0.161
 let+0.156
Neg logit contributions
rell-0.244
by-0.240
lying-0.237
ench-0.233
 destination-0.228
Token
Token ID198
Feature activation-3.78e-01

Pos logit contributions
rell+0.029
by+0.028
ench+0.028
lying+0.028
 destination+0.028
Neg logit contributions
 we-0.023
 I-0.021
 you-0.018
 who-0.017
 let-0.016
Token
Token ID198
Feature activation-3.73e-01

Pos logit contributions
rell+0.023
by+0.023
lying+0.023
ench+0.022
 destination+0.022
Neg logit contributions
 we-0.018
 I-0.016
 you-0.015
 who-0.013
 let-0.013
TokenThe
Token ID464
Feature activation+3.14e+00

Pos logit contributions
rell+0.021
by+0.021
lying+0.021
ench+0.021
zie+0.020
Neg logit contributions
 we-0.019
 I-0.016
 you-0.015
 who-0.014
 let-0.013
Token next
Token ID1306
Feature activation+8.54e+00

Pos logit contributions
 we+0.155
 I+0.138
 you+0.121
 who+0.119
 let+0.113
Neg logit contributions
by-0.171
 doorway-0.170
rell-0.168
 awoke-0.167
 destination-0.165
Token morning
Token ID3329
Feature activation-1.11e+01

Pos logit contributions
 we+0.655
 I+0.604
 you+0.572
 who+0.546
 let+0.536
Neg logit contributions
rell-0.282
by-0.267
lying-0.264
ench-0.258
 destination-0.247
Token walk
Token ID2513
Feature activation-6.73e+00

Pos logit contributions
 we+0.028
 I+0.025
 you+0.024
 who+0.022
 let+0.022
Neg logit contributions
rell-0.025
lying-0.024
ench-0.023
by-0.023
zie-0.023
Token in
Token ID287
Feature activation+2.83e+00

Pos logit contributions
rell+0.472
lying+0.456
by+0.449
ench+0.448
 destination+0.438
Neg logit contributions
 we-0.276
 I-0.236
 you-0.216
 who-0.203
 let-0.183
Token the
Token ID262
Feature activation+3.13e+00

Pos logit contributions
 we+0.093
 I+0.077
 you+0.067
 who+0.059
 let+0.054
Neg logit contributions
rell-0.219
by-0.214
lying-0.213
ench-0.211
 destination-0.204
Token forest
Token ID8222
Feature activation-8.26e+00

Pos logit contributions
 we+0.193
 I+0.189
 you+0.181
 who+0.166
 that+0.162
Neg logit contributions
lying-0.152
aced-0.145
rell-0.143
ously-0.141
by-0.139
Token.
Token ID13
Feature activation-6.08e-01

Pos logit contributions
rell+0.525
lying+0.507
by+0.502
ench+0.491
zie+0.480
Neg logit contributions
 we-0.392
 I-0.340
 you-0.313
 who-0.308
 let-0.281
Token He
Token ID679
Feature activation-7.42e+00

Pos logit contributions
by+0.029
rell+0.029
lying+0.028
ench+0.028
 destination+0.027
Neg logit contributions
 we-0.038
 I-0.036
 you-0.031
 who-0.031
 let-0.031
Token looked
Token ID3114
Feature activation-6.60e+00

Pos logit contributions
rell+0.426
lying+0.412
ench+0.405
by+0.402
 destination+0.389
Neg logit contributions
 we-0.390
 I-0.344
 you-0.315
 who-0.305
 let-0.298
Token high
Token ID1029
Feature activation-2.43e+00

Pos logit contributions
rell+0.358
lying+0.348
ench+0.334
by+0.331
 destination+0.323
Neg logit contributions
 we-0.375
 I-0.333
 you-0.312
 who-0.307
 let-0.286
Token and
Token ID290
Feature activation+3.52e+00

Pos logit contributions
rell+0.152
by+0.150
lying+0.148
ench+0.146
 destination+0.144
Neg logit contributions
 we-0.110
 I-0.096
 you-0.086
 who-0.079
 let-0.077
Token low
Token ID1877
Feature activation-3.98e+00

Pos logit contributions
 we+0.240
 I+0.219
 you+0.206
 who+0.196
 let+0.190
Neg logit contributions
rell-0.141
by-0.137
lying-0.133
ench-0.130
 destination-0.127
Token,
Token ID11
Feature activation-4.91e+00

Pos logit contributions
rell+0.225
by+0.218
ench+0.217
lying+0.216
 destination+0.207
Neg logit contributions
 we-0.212
 I-0.189
 you-0.173
 who-0.165
 let-0.158
Token named
Token ID3706
Feature activation+9.11e+00

Pos logit contributions
 we+0.220
 I+0.196
 you+0.173
 don+0.155
 yourselves+0.151
Neg logit contributions
by-0.245
rell-0.228
jo-0.218
ench-0.216
lying-0.216
Token Tim
Token ID5045
Feature activation+3.13e+00

Pos logit contributions
 we+0.388
 I+0.330
 you+0.299
 who+0.278
 let+0.260
Neg logit contributions
rell-0.595
by-0.578
lying-0.565
ench-0.552
zie-0.551
Tokenmy
Token ID1820
Feature activation-8.11e+00

Pos logit contributions
 we+0.225
 I+0.209
 who+0.209
 you+0.201
 let+0.199
Neg logit contributions
rell-0.113
lying-0.110
 destination-0.109
ench-0.108
rolled-0.103
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
by+0.428
rell+0.417
lying+0.395
ench+0.392
 awoke+0.379
Neg logit contributions
 we-0.452
 I-0.405
 you-0.373
 let-0.322
 don-0.315
Token Tim
Token ID5045
Feature activation+3.10e+00

Pos logit contributions
by+0.029
ench+0.028
 destination+0.028
lying+0.027
 awoke+0.027
Neg logit contributions
 we-0.012
 I-0.011
 you-0.008
 let-0.007
 who-0.007
Tokenmy
Token ID1820
Feature activation-8.13e+00

Pos logit contributions
 we+0.219
 I+0.205
 who+0.200
 you+0.196
 let+0.192
Neg logit contributions
lying-0.112
rell-0.111
 destination-0.111
ench-0.107
rol-0.103
Token loved
Token ID6151
Feature activation+7.97e-01

Pos logit contributions
rell+0.462
by+0.454
lying+0.444
ench+0.438
 destination+0.425
Neg logit contributions
 we-0.420
 I-0.371
 you-0.342
 who-0.312
 let-0.305
Token to
Token ID284
Feature activation-3.88e+00

Pos logit contributions
 we+0.046
 I+0.041
 you+0.038
 who+0.036
 let+0.035
Neg logit contributions
rell-0.041
by-0.040
lying-0.039
ench-0.038
 destination-0.037
Token play
Token ID711
Feature activation-7.45e-01

Pos logit contributions
rell+0.206
by+0.200
lying+0.197
ench+0.194
 destination+0.189
Neg logit contributions
 we-0.216
 I-0.193
 you-0.179
 who-0.167
 let-0.161
Token at
Token ID379
Feature activation+7.57e-01

Pos logit contributions
rell+0.039
ench+0.039
by+0.038
lying+0.037
 destination+0.036
Neg logit contributions
 we-0.039
 I-0.036
 you-0.031
 let-0.028
 who-0.028
Token the
Token ID262
Feature activation+3.30e+00

Pos logit contributions
 we+0.028
 I+0.024
 you+0.021
 who+0.018
 let+0.017
Neg logit contributions
rell-0.054
lying-0.054
by-0.053
ench-0.053
 destination-0.051
Token stubborn
Token ID24444
Feature activation+4.70e+00

Pos logit contributions
rell+0.035
ench+0.034
by+0.034
lying+0.034
 destination+0.032
Neg logit contributions
 we-0.026
 I-0.022
 you-0.019
 let-0.017
 don-0.016
Token and
Token ID290
Feature activation+3.18e+00

Pos logit contributions
 we+0.221
 I+0.194
 you+0.177
 who+0.165
 let+0.158
Neg logit contributions
rell-0.292
by-0.284
ench-0.283
lying-0.282
 destination-0.276
Token didn
Token ID1422
Feature activation-2.76e+00

Pos logit contributions
 we+0.119
 I+0.102
 you+0.086
 pige+0.079
 spies+0.079
Neg logit contributions
by-0.194
rell-0.186
lying-0.181
wood-0.179
bank-0.178
Token't
Token ID470
Feature activation+4.42e+00

Pos logit contributions
rell+0.177
lying+0.166
ench+0.165
by+0.164
 destination+0.162
Neg logit contributions
 we-0.119
 I-0.102
 who-0.091
 you-0.090
 let-0.084
Token mean
Token ID1612
Feature activation+4.39e+00

Pos logit contributions
 we+0.200
 I+0.173
 you+0.156
 who+0.143
 let+0.135
Neg logit contributions
rell-0.281
by-0.275
lying-0.272
ench-0.268
 destination-0.261
Token any
Token ID597
Feature activation-4.91e+00

Pos logit contributions
 we+0.232
 I+0.206
 you+0.193
 who+0.177
 let+0.171
Neg logit contributions
rell-0.253
lying-0.242
ench-0.241
by-0.239
 destination-0.235
Token harm
Token ID4419
Feature activation+5.02e+00

Pos logit contributions
rell+0.274
by+0.274
lying+0.263
ench+0.259
 destination+0.254
Neg logit contributions
 we-0.250
 I-0.220
 you-0.198
 who-0.182
 let-0.178
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 we+0.263
 I+0.233
 you+0.213
 who+0.198
 let+0.193
Neg logit contributions
rell-0.280
by-0.274
lying-0.269
ench-0.265
 destination-0.258
Token They
Token ID1119
Feature activation-8.37e-01

Pos logit contributions
rell+0.069
by+0.068
ench+0.067
lying+0.066
 destination+0.066
Neg logit contributions
 we-0.042
 I-0.036
 you-0.031
 who-0.029
 let-0.027
Token continued
Token ID3767
Feature activation-8.51e+00

Pos logit contributions
rell+0.051
by+0.050
lying+0.049
ench+0.049
 destination+0.048
Neg logit contributions
 we-0.040
 I-0.035
 you-0.032
 who-0.029
 let-0.028
Token their
Token ID511
Feature activation-7.40e+00

Pos logit contributions
rell+0.552
ench+0.542
lying+0.531
zie+0.522
rol+0.521
Neg logit contributions
 we-0.404
 I-0.331
 you-0.312
 who-0.308
 that-0.295
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
rell+0.454
lying+0.440
by+0.437
ench+0.419
 awoke+0.416
Neg logit contributions
 we-0.542
 I-0.482
 you-0.453
 who-0.430
 let-0.413
Token She
Token ID1375
Feature activation-6.32e+00

Pos logit contributions
by+0.034
rell+0.032
 destination+0.032
ench+0.031
lying+0.031
Neg logit contributions
 we-0.054
 I-0.052
 you-0.046
 who-0.045
 let-0.044
Token had
Token ID550
Feature activation+1.14e+00

Pos logit contributions
by+0.300
 awoke+0.270
rell+0.265
 trek+0.255
zie+0.253
Neg logit contributions
 we-0.363
 I-0.331
 you-0.323
 yourselves-0.298
Bad-0.279
Token a
Token ID257
Feature activation+2.24e-01

Pos logit contributions
 we+0.049
 I+0.041
 you+0.037
 who+0.032
 let+0.031
Neg logit contributions
by-0.073
rell-0.073
ench-0.070
lying-0.069
 doorway-0.069
Token big
Token ID1263
Feature activation+2.28e+00

Pos logit contributions
 we+0.011
 I+0.010
 you+0.009
 who+0.009
 let+0.008
Neg logit contributions
by-0.013
rell-0.012
ench-0.012
lying-0.012
 destination-0.012
Token tub
Token ID12202
Feature activation-7.10e+00

Pos logit contributions
 we+0.128
 I+0.115
 you+0.105
 who+0.098
 let+0.096
Neg logit contributions
rell-0.121
by-0.117
lying-0.117
ench-0.115
 destination-0.111
Token full
Token ID1336
Feature activation+4.10e+00

Pos logit contributions
by+0.419
rell+0.417
ench+0.407
lying+0.404
 doorway+0.399
Neg logit contributions
 we-0.325
 I-0.272
 you-0.260
Bad-0.226
 let-0.223
Token of
Token ID286
Feature activation+4.02e+00

Pos logit contributions
 we+0.177
 I+0.153
 you+0.137
 who+0.125
 let+0.121
Neg logit contributions
rell-0.270
by-0.265
lying-0.262
ench-0.258
 destination-0.254
Token water
Token ID1660
Feature activation-2.43e+00

Pos logit contributions
 we+0.202
 I+0.178
 you+0.161
 who+0.151
 let+0.146
Neg logit contributions
rell-0.235
by-0.228
lying-0.224
ench-0.223
 destination-0.217
Token and
Token ID290
Feature activation+3.28e+00

Pos logit contributions
rell+0.139
by+0.134
lying+0.132
ench+0.129
 destination+0.128
Neg logit contributions
 we-0.120
 I-0.106
 you-0.098
 who-0.088
 let-0.086
Token bubbles
Token ID25037
Feature activation+4.00e+00

Pos logit contributions
 we+0.168
 I+0.148
 you+0.136
 who+0.129
Bad+0.120
Neg logit contributions
rell-0.177
by-0.175
zie-0.163
ench-0.163
 doorway-0.163
Token Sam
Token ID3409
Feature activation+7.68e+00

Pos logit contributions
 we+0.149
 I+0.127
 you+0.122
 don+0.096
keys+0.096
Neg logit contributions
by-0.224
rell-0.219
 doorway-0.216
lying-0.216
zie-0.215
Token and
Token ID290
Feature activation+3.63e+00

Pos logit contributions
 we+0.300
 who+0.249
 I+0.244
 you+0.233
 let+0.212
Neg logit contributions
rell-0.533
lying-0.531
 destination-0.522
by-0.521
ench-0.516
Token asked
Token ID1965
Feature activation+2.17e+00

Pos logit contributions
 we+0.154
 I+0.133
 you+0.123
 who+0.103
 let+0.096
Neg logit contributions
rell-0.234
by-0.233
lying-0.227
zie-0.221
 awoke-0.219
Token them
Token ID606
Feature activation-4.96e+00

Pos logit contributions
 we+0.114
 I+0.100
 you+0.094
 who+0.089
 let+0.085
Neg logit contributions
rell-0.127
lying-0.121
ench-0.121
by-0.120
 destination-0.116
Token what
Token ID644
Feature activation+3.50e+00

Pos logit contributions
rell+0.281
by+0.277
lying+0.271
ench+0.269
 destination+0.263
Neg logit contributions
 we-0.252
 I-0.225
 you-0.206
 who-0.188
 let-0.183
Token happened
Token ID3022
Feature activation-3.31e+00

Pos logit contributions
 we+0.201
 I+0.188
 you+0.176
 yourselves+0.171
 who+0.163
Neg logit contributions
by-0.152
 awoke-0.137
rell-0.128
jo-0.123
 doorway-0.121
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
by+0.145
lying+0.142
rell+0.139
ench+0.136
 destination+0.135
Neg logit contributions
 we-0.197
 I-0.180
 you-0.163
 let-0.152
 who-0.149
Token Tom
Token ID4186
Feature activation+7.32e+00

Pos logit contributions
rell+0.029
lying+0.028
by+0.028
 destination+0.027
ench+0.027
Neg logit contributions
 we-0.029
 I-0.025
 you-0.024
 who-0.022
 let-0.021
Token and
Token ID290
Feature activation+3.58e+00

Pos logit contributions
 we+0.348
 I+0.310
 you+0.282
 who+0.234
 let+0.227
Neg logit contributions
rell-0.440
by-0.436
lying-0.429
 awoke-0.419
ench-0.416
Token Sam
Token ID3409
Feature activation+7.62e+00

Pos logit contributions
 we+0.147
 I+0.126
 you+0.118
 who+0.101
 don+0.089
Neg logit contributions
rell-0.234
by-0.233
zie-0.229
lying-0.225
 doorway-0.217
Token told
Token ID1297
Feature activation+5.05e+00

Pos logit contributions
 we+0.332
 I+0.282
 you+0.256
 who+0.248
 let+0.241
Neg logit contributions
rell-0.502
by-0.487
lying-0.484
ench-0.479
 destination-0.470
Token carried
Token ID5281
Feature activation+2.53e+00

Pos logit contributions
 we+0.192
 I+0.171
 you+0.164
 who+0.149
Bad+0.144
Neg logit contributions
by-0.158
rell-0.158
rolled-0.151
 awoke-0.149
lying-0.149
Token it
Token ID340
Feature activation+5.51e+00

Pos logit contributions
 we+0.091
 I+0.076
 you+0.066
 who+0.059
 let+0.055
Neg logit contributions
rell-0.186
by-0.182
lying-0.180
ench-0.178
 destination-0.175
Token back
Token ID736
Feature activation-1.50e+01

Pos logit contributions
 we+0.262
 I+0.230
 that+0.215
 you+0.215
 not+0.207
Neg logit contributions
 awoke-0.342
lying-0.342
rell-0.337
ench-0.330
jo-0.325
Token home
Token ID1363
Feature activation-1.67e+01

Pos logit contributions
rell+0.938
lying+0.905
ench+0.880
 awoke+0.879
by+0.848
Neg logit contributions
 we-0.755
 I-0.667
 you-0.615
 who-0.599
 that-0.561
Token.
Token ID13
Feature activation-8.13e-01

Pos logit contributions
rell+0.867
by+0.858
lying+0.841
ench+0.831
 destination+0.821
Neg logit contributions
 we-0.910
 I-0.813
 you-0.750
 who-0.689
 let-0.675
Token
Token ID220
Feature activation-1.20e+01

Pos logit contributions
by+0.036
rell+0.036
ench+0.034
lying+0.034
 destination+0.033
Neg logit contributions
 we-0.054
 I-0.051
 you-0.046
 who-0.043
 let-0.043
Token
Token ID198
Feature activation-7.50e-01

Pos logit contributions
rell+0.688
by+0.672
lying+0.664
ench+0.655
 destination+0.641
Neg logit contributions
 we-0.620
 I-0.549
 you-0.504
 who-0.468
 let-0.453
Token
Token ID198
Feature activation-7.37e-01

Pos logit contributions
rell+0.045
by+0.043
lying+0.043
ench+0.042
 destination+0.042
Neg logit contributions
 we-0.038
 I-0.033
 you-0.030
 who-0.028
 let-0.027
TokenMom
Token ID29252
Feature activation+2.82e+00

Pos logit contributions
rell+0.042
lying+0.041
by+0.041
ench+0.041
 awoke+0.039
Neg logit contributions
 we-0.037
 I-0.033
 you-0.030
 who-0.027
 let-0.026
Tokenmy
Token ID1820
Feature activation-8.60e+00

Pos logit contributions
 we+0.219
 I+0.204
 you+0.193
 who+0.189
 let+0.187
Neg logit contributions
rell-0.086
ench-0.083
lying-0.082
by-0.080
 destination-0.078
Token was
Token ID373
Feature activation-2.67e+00

Pos logit contributions
by+0.462
rell+0.455
 awoke+0.441
lying+0.437
rolled+0.431
Neg logit contributions
 we-0.447
 I-0.406
 you-0.379
 yourselves-0.333
Bad-0.317
Token with
Token ID351
Feature activation+9.40e+00

Pos logit contributions
 we+0.021
 I+0.018
 you+0.014
 who+0.013
 let+0.013
Neg logit contributions
by-0.040
ench-0.039
rell-0.038
lying-0.038
 destination-0.037
Token tiny
Token ID7009
Feature activation+1.65e+00

Pos logit contributions
 we+0.514
 I+0.460
 you+0.429
 who+0.400
 let+0.387
Neg logit contributions
rell-0.510
by-0.494
lying-0.493
ench-0.481
 destination-0.469
Token white
Token ID2330
Feature activation+2.32e+00

Pos logit contributions
 we+0.088
 I+0.079
 you+0.072
 who+0.068
 let+0.065
Neg logit contributions
rell-0.091
lying-0.088
by-0.087
ench-0.087
zie-0.086
Token balls
Token ID11333
Feature activation+2.90e+00

Pos logit contributions
 we+0.116
 I+0.105
 you+0.095
 who+0.090
 let+0.085
Neg logit contributions
rell-0.133
lying-0.132
 destination-0.130
zie-0.130
ench-0.129
Token.
Token ID13
Feature activation-9.42e-01

Pos logit contributions
 we+0.146
 I+0.130
 you+0.118
 who+0.111
 let+0.107
Neg logit contributions
rell-0.170
by-0.169
lying-0.165
ench-0.164
 destination-0.160
Token
Token ID220
Feature activation-1.21e+01

Pos logit contributions
rell+0.048
by+0.046
lying+0.046
ench+0.045
 destination+0.044
Neg logit contributions
 we-0.055
 I-0.050
 you-0.046
 who-0.043
 let-0.042
Token
Token ID198
Feature activation-8.49e-01

Pos logit contributions
rell+0.724
by+0.710
lying+0.702
ench+0.687
 destination+0.681
Neg logit contributions
 we-0.601
 I-0.532
 you-0.485
 who-0.451
 let-0.431
Token
Token ID198
Feature activation-8.60e-01

Pos logit contributions
rell+0.044
by+0.043
lying+0.042
ench+0.042
 awoke+0.040
Neg logit contributions
 we-0.047
 I-0.042
 you-0.039
 who-0.036
 let-0.035
TokenThe
Token ID464
Feature activation+2.65e+00

Pos logit contributions
rell+0.049
lying+0.048
ench+0.048
by+0.047
side+0.046
Neg logit contributions
 we-0.042
 I-0.035
 you-0.032
 spies-0.028
 don-0.028
Token rabbit
Token ID22746
Feature activation+3.29e+00

Pos logit contributions
 we+0.153
 I+0.134
 you+0.126
 yourselves+0.126
ms+0.116
Neg logit contributions
by-0.119
rell-0.116
 den-0.114
 doorway-0.113
 destination-0.112
Token said
Token ID531
Feature activation+1.04e+00

Pos logit contributions
 we+0.172
 I+0.152
 you+0.140
 who+0.129
 let+0.125
Neg logit contributions
rell-0.186
by-0.181
lying-0.179
ench-0.176
 destination-0.172
Token to
Token ID284
Feature activation-4.39e+00

Pos logit contributions
lying+0.177
by+0.176
rell+0.174
ench+0.171
 awoke+0.169
Neg logit contributions
 we-0.057
 who-0.047
 that-0.042
 I-0.040
 you-0.031
Token rescue
Token ID9992
Feature activation+4.81e-01

Pos logit contributions
rell+0.236
by+0.230
lying+0.225
zie+0.225
ench+0.224
Neg logit contributions
 we-0.229
 I-0.203
 you-0.182
 who-0.166
 pige-0.156
Token the
Token ID262
Feature activation+2.77e+00

Pos logit contributions
 we+0.020
 I+0.017
 you+0.015
 who+0.014
 let+0.013
Neg logit contributions
rell-0.032
by-0.031
lying-0.031
ench-0.030
 destination-0.029
Token puppy
Token ID26188
Feature activation-1.10e+00

Pos logit contributions
 we+0.156
 I+0.140
 you+0.129
 who+0.122
 let+0.119
Neg logit contributions
rell-0.139
by-0.134
ench-0.132
 destination-0.132
lying-0.131
Token.
Token ID13
Feature activation-9.46e-01

Pos logit contributions
rell+0.070
by+0.069
lying+0.068
ench+0.067
 destination+0.065
Neg logit contributions
 we-0.052
 I-0.045
 who-0.042
 you-0.041
 let-0.039
Token
Token ID220
Feature activation-1.22e+01

Pos logit contributions
by+0.045
rell+0.044
ench+0.042
lying+0.042
 destination+0.041
Neg logit contributions
 we-0.061
 I-0.057
 you-0.051
 who-0.048
 let-0.048
Token
Token ID198
Feature activation-8.76e-01

Pos logit contributions
rell+0.721
by+0.709
lying+0.700
ench+0.686
 destination+0.681
Neg logit contributions
 we-0.604
 I-0.534
 you-0.488
 who-0.454
 let-0.435
Token
Token ID198
Feature activation-8.70e-01

Pos logit contributions
rell+0.047
by+0.046
lying+0.045
ench+0.045
 destination+0.043
Neg logit contributions
 we-0.047
 I-0.042
 you-0.039
 who-0.036
 let-0.035
TokenThe
Token ID464
Feature activation+2.63e+00

Pos logit contributions
rell+0.048
lying+0.047
ench+0.047
side+0.046
by+0.045
Neg logit contributions
 we-0.043
 I-0.037
 you-0.034
 who-0.031
 spies-0.030
Token puppy
Token ID26188
Feature activation-1.16e+00

Pos logit contributions
 we+0.138
 I+0.123
 you+0.113
 who+0.107
 let+0.102
Neg logit contributions
rell-0.145
by-0.140
lying-0.137
 destination-0.135
ench-0.135
Token was
Token ID373
Feature activation-2.81e+00

Pos logit contributions
rell+0.064
by+0.062
lying+0.062
ench+0.061
 destination+0.059
Neg logit contributions
 we-0.063
 I-0.056
 you-0.052
 who-0.049
 let-0.048
Token't
Token ID470
Feature activation+5.32e+00

Pos logit contributions
rell+0.134
by+0.133
lying+0.131
ench+0.129
 destination+0.127
Neg logit contributions
 we-0.069
 I-0.058
 you-0.052
 who-0.044
 let-0.043
Token know
Token ID760
Feature activation-1.91e+00

Pos logit contributions
 we+0.276
 I+0.246
 you+0.227
 who+0.211
 let+0.208
Neg logit contributions
rell-0.304
by-0.295
lying-0.293
ench-0.289
 destination-0.285
Token how
Token ID703
Feature activation+2.81e+00

Pos logit contributions
by+0.109
rell+0.108
zie+0.108
 destination+0.106
 awoke+0.104
Neg logit contributions
 we-0.080
 I-0.069
 you-0.059
We-0.057
Bad-0.055
Token to
Token ID284
Feature activation-3.56e+00

Pos logit contributions
 we+0.152
 I+0.135
 you+0.126
 who+0.113
 let+0.110
Neg logit contributions
rell-0.165
lying-0.160
ench-0.154
by-0.154
 destination-0.154
Token get
Token ID651
Feature activation-6.56e-01

Pos logit contributions
rell+0.195
by+0.192
ench+0.187
lying+0.187
 destination+0.184
Neg logit contributions
 we-0.190
 I-0.169
 you-0.154
 who-0.144
 let-0.136
Token home
Token ID1363
Feature activation-1.59e+01

Pos logit contributions
rell+0.025
lying+0.024
by+0.024
ench+0.024
 destination+0.023
Neg logit contributions
 we-0.047
 I-0.043
 you-0.041
 who-0.038
 let-0.038
Token.
Token ID13
Feature activation-7.09e-02

Pos logit contributions
rell+0.973
lying+0.950
by+0.944
ench+0.919
 awoke+0.893
Neg logit contributions
 we-0.796
 I-0.693
 you-0.642
 who-0.597
 let-0.568
Token
Token ID198
Feature activation+2.92e-02

Pos logit contributions
rell+0.004
by+0.004
ench+0.004
lying+0.004
 destination+0.004
Neg logit contributions
 we-0.004
 I-0.004
 you-0.003
 who-0.003
 let-0.003
Token
Token ID198
Feature activation+2.69e-02

Pos logit contributions
 we+0.001
 I+0.001
 you+0.001
 who+0.001
 let+0.001
Neg logit contributions
rell-0.002
by-0.002
lying-0.002
ench-0.002
 destination-0.002
TokenFinally
Token ID11158
Feature activation-3.43e+00

Pos logit contributions
 we+0.001
 I+0.001
 you+0.001
 who+0.001
 let+0.001
Neg logit contributions
rell-0.002
by-0.002
lying-0.002
ench-0.002
zie-0.002
Token,
Token ID11
Feature activation-4.34e+00

Pos logit contributions
rell+0.166
by+0.162
lying+0.159
ench+0.157
 destination+0.154
Neg logit contributions
 we-0.206
 I-0.186
 you-0.173
 who-0.163
 let-0.159
Token is
Token ID318
Feature activation+2.08e-01

Pos logit contributions
rell+0.430
by+0.423
lying+0.405
 destination+0.400
 awoke+0.396
Neg logit contributions
 we-0.366
 I-0.321
 you-0.302
 who-0.269
 oh-0.266
Token nice
Token ID3621
Feature activation-2.47e+00

Pos logit contributions
 we+0.011
 I+0.009
 you+0.008
 don+0.007
 who+0.007
Neg logit contributions
rell-0.011
by-0.011
lying-0.011
 destination-0.011
rol-0.010
Token.
Token ID13
Feature activation-8.58e-01

Pos logit contributions
by+0.117
rell+0.113
lying+0.111
 destination+0.108
ench+0.104
Neg logit contributions
 we-0.144
 I-0.129
 you-0.120
 let-0.109
Bad-0.108
Token He
Token ID679
Feature activation-7.68e+00

Pos logit contributions
rell+0.049
lying+0.047
 Eventually+0.044
by+0.044
ench+0.044
Neg logit contributions
 we-0.043
 I-0.036
 you-0.035
 who-0.033
 let-0.031
Token offers
Token ID4394
Feature activation+3.00e+00

Pos logit contributions
rell+0.415
by+0.406
lying+0.385
 destination+0.383
 awoke+0.378
Neg logit contributions
 we-0.375
 I-0.331
 you-0.312
 oh-0.294
Bad-0.282
Token his
Token ID465
Feature activation-1.18e+01

Pos logit contributions
 we+0.156
 I+0.137
 you+0.124
 who+0.116
 let+0.111
Neg logit contributions
by-0.166
rell-0.165
lying-0.159
 destination-0.153
ench-0.152
Token toy
Token ID13373
Feature activation-2.73e+00

Pos logit contributions
rell+0.654
by+0.630
 destination+0.618
lying+0.602
rolled+0.595
Neg logit contributions
 we-0.623
 I-0.563
 you-0.523
 who-0.474
 let-0.454
Token train
Token ID4512
Feature activation+1.25e+00

Pos logit contributions
rell+0.146
by+0.142
lying+0.138
 destination+0.136
ench+0.133
Neg logit contributions
 we-0.149
 I-0.133
 you-0.123
 who-0.114
 let-0.112
Token to
Token ID284
Feature activation-4.39e+00

Pos logit contributions
 we+0.067
 I+0.060
 you+0.055
 who+0.052
 let+0.050
Neg logit contributions
rell-0.069
by-0.067
lying-0.066
ench-0.066
 destination-0.064
Token Lily
Token ID20037
Feature activation+7.35e+00

Pos logit contributions
rell+0.240
by+0.235
 destination+0.231
lying+0.229
zie+0.227
Neg logit contributions
 we-0.214
 I-0.185
 you-0.161
 who-0.155
We-0.147
Token and
Token ID290
Feature activation+3.20e+00

Pos logit contributions
 we+0.332
 I+0.289
 you+0.268
 who+0.267
 let+0.245
Neg logit contributions
rell-0.471
by-0.455
ench-0.453
lying-0.450
 doorway-0.449