TOP ACTIVATIONS
MAX = 19.066

. They take their umb rell as . "
too slow . The umb rell as go far away .
to Fin ny and Crab by fixed the fin .
try to catch their umb rell as . But they are
Exc use me squ ir rell y , I was wondering
's broken fin . Crab by said , " I can
. D olly told Crab by about Fin ny 's broken
. They said their good by es and then each went
. We have our umb rell as to keep us dry
and decided to take Crab by 's advice . T una
broke . Ab by was very sad . She
by the crab . Crab by was good at fixing things
orange and pick . Lib by shouted out the words excited
a little girl named Gab by who liked to roll around
sh ru gging . Rob by thought this was funny and
the kids collected their umb rell as and went out to
her tools and fixed Rob by 's arms and legs .
away . Shell by carried with her the warmth
go very far , Rob by was happy to lead the
find her friend , Crab by the crab . Crab by

MIN ACTIVATIONS
MIN = -16.705

Thank you Mom my for letting me help ," said Lily
for soaking the crack and letting me out . You are
tree and thanked it for letting them enjoy its beauty .
thanked the star fish for letting them touch it and left
up . Thank you for letting me play with you ."
, mighty bird , for letting me see the world from
. They thanked mom for letting them use the mixer and
" Thank you for letting us pet him ," Ben
thank Bob and Rex for letting them play . They say
soon . Thank you for letting us play with it ."
says , " OK , let 's play soccer !"
you , Tom , for letting me try your fl ute
. " Thank you for letting me touch your feather ,"
for introducing herself and for letting her borrow the gray book
thank you to them for letting him see them . He
fixing the dress and not letting the delay stop her from
, " Thank you for letting me fly your k ite
Mom and thanked her for letting them go out . They
said , " Okay , let 's pack up our cones
ran around the park , letting his new k ite fly

INTERVAL 10.123 - 19.066
CONTAINS 3.114%

As Tim my 's mom my put on her
sister kept trying to take away the plastic pieces .
and started playing it . Suddenly , a
In the morning when he woke up ,
hears him and runs to him . She sees his knee

INTERVAL 1.181 - 10.123
CONTAINS 33.303%

it was too hard for her to manage . She asked
friends arrived at the cave , they saw that it was
goodbye to the gard ener , thanked her and came back
was very happy playing with her new baby brother . <|endoftext|>
lot of clothes , shoes , hats and scar ves .

INTERVAL -7.762 - 1.181
CONTAINS 58.488%

" Me neither ," Tim says . " But
also learned that solving problems is more important than playing with
wolf was very obedient he could teach him to be clever
See , I told you it would make you happy !"
enjoyed playing in the garden . <|endoftext|> Once upon a time

INTERVAL -16.705 - -7.762
CONTAINS 5.095%

wearing pretty dresses and suits that fit them nicely . They
. She also told him that it 's important to cl
and sing very loud . We make up a story with
wanted to keep reading , but he didn 't want to
not to eat me , I can show you where the
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
by+0.526
 awoke+0.512
rell+0.500
ench+0.497
zie+0.492
Neg logit contributions
 we-0.343
 I-0.297
 who-0.284
 you-0.261
 let-0.253
Token They
Token ID1119
Feature activation-7.55e-01

Pos logit contributions
by+0.033
 awoke+0.032
rell+0.032
ench+0.031
lying+0.031
Neg logit contributions
 we-0.024
 I-0.021
 who-0.019
 you-0.019
 yourselves-0.017
Token take
Token ID1011
Feature activation-4.69e+00

Pos logit contributions
 we+0.036
 I+0.033
 who+0.030
 yourselves+0.030
 you+0.030
Neg logit contributions
by-0.041
 awoke-0.039
rell-0.039
zie-0.038
 destination-0.038
Token their
Token ID511
Feature activation+6.37e+00

Pos logit contributions
 we+0.206
 I+0.178
 you+0.165
 who+0.164
 let+0.150
Neg logit contributions
by-0.288
 awoke-0.282
lying-0.280
rell-0.278
zie-0.278
Token umb
Token ID20810
Feature activation+7.22e+00

Pos logit contributions
 awoke+0.348
ench+0.342
by+0.338
lying+0.336
rell+0.326
Neg logit contributions
 we-0.344
 I-0.291
 you-0.279
 who-0.273
 let-0.256
Tokenrell
Token ID11252
Feature activation+1.91e+01

Pos logit contributions
by+0.132
 awoke+0.125
ench+0.082
 destination+0.081
 doorway+0.079
Neg logit contributions
 we-0.636
 I-0.593
 you-0.575
 who-0.564
 let-0.555
Tokenas
Token ID292
Feature activation-2.25e+00

Pos logit contributions
by+1.166
rell+1.121
ench+1.078
 awoke+1.069
lying+1.056
Neg logit contributions
 we-0.855
 I-0.776
 who-0.679
 you-0.676
 let-0.613
Token.
Token ID13
Feature activation+5.76e-01

Pos logit contributions
 we+0.121
 I+0.107
 who+0.102
 you+0.100
 let+0.092
Neg logit contributions
by-0.118
 awoke-0.116
rell-0.114
ench-0.111
lying-0.111
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
by+0.031
 awoke+0.030
rell+0.030
ench+0.029
lying+0.029
Neg logit contributions
 we-0.028
 I-0.025
 who-0.024
 you-0.023
 yourselves-0.022
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
by+0.065
rell+0.063
 awoke+0.063
ench+0.060
 destination+0.060
Neg logit contributions
 we-0.063
 I-0.057
 who-0.051
 you-0.051
 yourselves-0.047
Token"
Token ID1
Feature activation-8.51e-01

Pos logit contributions
by+0.074
 awoke+0.073
ench+0.071
lying+0.071
rell+0.070
Neg logit contributions
 we-0.049
 I-0.043
 who-0.038
 you-0.037
 yourselves-0.035
Token too
Token ID1165
Feature activation+2.69e+00

Pos logit contributions
rell+0.182
by+0.180
 home+0.178
ench+0.174
 destination+0.171
Neg logit contributions
 we-0.172
 I-0.152
 you-0.139
You-0.126
 yourselves-0.126
Token slow
Token ID3105
Feature activation-3.75e+00

Pos logit contributions
by+0.156
ench+0.143
rell+0.141
bi+0.134
 home+0.134
Neg logit contributions
 we-0.121
 I-0.112
 you-0.099
 who-0.097
 singers-0.091
Token.
Token ID13
Feature activation+4.42e-01

Pos logit contributions
 we+0.163
 I+0.142
 who+0.133
 you+0.130
 let+0.114
Neg logit contributions
 awoke-0.237
by-0.232
rell-0.230
ench-0.228
zie-0.224
Token The
Token ID383
Feature activation-1.88e+00

Pos logit contributions
 Eventually+0.018
lying+0.018
ench+0.017
 awoke+0.017
rell+0.017
Neg logit contributions
 we-0.025
 I-0.022
 you-0.022
 who-0.021
ms-0.021
Token umb
Token ID20810
Feature activation+6.96e+00

Pos logit contributions
 we+0.102
 I+0.096
 who+0.091
 you+0.085
 yourselves+0.080
Neg logit contributions
rell-0.092
by-0.087
 destination-0.087
ench-0.083
 awoke-0.081
Tokenrell
Token ID11252
Feature activation+1.90e+01

Pos logit contributions
by+0.066
 awoke+0.046
rell+0.032
 destination+0.031
ench+0.029
Neg logit contributions
 we-0.677
 I-0.642
 you-0.614
 who-0.610
 let-0.587
Tokenas
Token ID292
Feature activation-2.22e+00

Pos logit contributions
by+0.994
rell+0.953
ench+0.927
rol+0.913
rolled+0.859
Neg logit contributions
 we-0.972
 I-0.924
 who-0.818
 you-0.810
Bad-0.780
Token go
Token ID467
Feature activation+6.03e+00

Pos logit contributions
 we+0.109
 I+0.100
 who+0.090
 you+0.087
 yourselves+0.084
Neg logit contributions
by-0.120
rell-0.117
 awoke-0.115
 destination-0.113
ench-0.111
Token far
Token ID1290
Feature activation-1.57e+00

Pos logit contributions
by+0.326
rell+0.314
 awoke+0.309
 destination+0.301
ench+0.298
Neg logit contributions
 we-0.311
 I-0.283
 who-0.258
 you-0.255
 let-0.233
Token away
Token ID1497
Feature activation+1.48e+01

Pos logit contributions
 we+0.107
 I+0.100
 who+0.093
 you+0.092
 let+0.087
Neg logit contributions
by-0.059
rell-0.055
 awoke-0.054
 destination-0.052
lying-0.051
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
by+0.663
 destination+0.611
 awoke+0.600
ench+0.576
 home+0.568
Neg logit contributions
 we-0.833
 I-0.791
 you-0.713
 who-0.695
Bad-0.694
Token to
Token ID284
Feature activation+4.69e+00

Pos logit contributions
 awoke+0.328
by+0.323
rell+0.320
lying+0.312
zie+0.312
Neg logit contributions
 we-0.342
 I-0.298
 who-0.286
 you-0.276
 let-0.258
Token Fin
Token ID4463
Feature activation+1.02e+00

Pos logit contributions
by+0.305
rell+0.292
 awoke+0.291
 destination+0.287
ench+0.285
Neg logit contributions
 we-0.189
 I-0.167
 who-0.146
 you-0.145
 yourselves-0.130
Tokenny
Token ID3281
Feature activation+1.30e+01

Pos logit contributions
rell+0.039
 awoke+0.038
 doorway+0.037
rol+0.035
 destination+0.034
Neg logit contributions
 we-0.070
 who-0.065
 I-0.064
 let-0.061
 you-0.061
Token and
Token ID290
Feature activation-3.95e+00

Pos logit contributions
rell+0.665
by+0.660
 awoke+0.649
zie+0.637
rol+0.625
Neg logit contributions
 we-0.718
 who-0.649
 I-0.631
 you-0.580
 let-0.568
Token Crab
Token ID32379
Feature activation+7.68e-01

Pos logit contributions
 we+0.185
 I+0.167
 who+0.150
 you+0.150
 yourselves+0.139
Neg logit contributions
by-0.231
 awoke-0.220
rell-0.219
 destination-0.212
lying-0.211
Tokenby
Token ID1525
Feature activation+1.89e+01

Pos logit contributions
 doorway+0.018
rell+0.017
 awoke+0.015
zie+0.014
rol+0.012
Neg logit contributions
 we-0.067
 you-0.062
 I-0.060
 who-0.060
 let-0.060
Token fixed
Token ID5969
Feature activation-1.08e+00

Pos logit contributions
by+1.047
rell+1.021
 awoke+1.014
 destination+0.992
ench+0.987
Neg logit contributions
 we-0.957
 I-0.842
 who-0.811
 you-0.764
 let-0.727
Token the
Token ID262
Feature activation-2.86e+00

Pos logit contributions
 we+0.045
 I+0.040
 who+0.034
 you+0.034
 let+0.030
Neg logit contributions
by-0.071
rell-0.067
 destination-0.066
 awoke-0.065
ench-0.064
Token fin
Token ID957
Feature activation+1.64e+00

Pos logit contributions
 we+0.139
 I+0.126
 who+0.116
 you+0.112
 let+0.103
Neg logit contributions
by-0.163
 awoke-0.155
rell-0.151
 destination-0.150
ench-0.147
Token.
Token ID13
Feature activation+6.03e-01

Pos logit contributions
by+0.094
rell+0.091
 awoke+0.091
 destination+0.088
lying+0.088
Neg logit contributions
 we-0.079
 I-0.071
 who-0.064
 you-0.063
 let-0.058
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
by+0.034
rell+0.032
 awoke+0.032
 destination+0.031
ench+0.031
Neg logit contributions
 we-0.030
 I-0.027
 who-0.025
 you-0.025
 let-0.022
Token try
Token ID1949
Feature activation-2.58e-01

Pos logit contributions
 we+0.209
 I+0.189
 you+0.171
 who+0.167
 yourselves+0.160
Neg logit contributions
by-0.217
 awoke-0.209
rell-0.204
 destination-0.202
ench-0.199
Token to
Token ID284
Feature activation+4.39e+00

Pos logit contributions
 we+0.014
 I+0.013
 who+0.012
 you+0.012
 let+0.011
Neg logit contributions
by-0.013
rell-0.013
 awoke-0.013
ench-0.012
lying-0.012
Token catch
Token ID4929
Feature activation-6.07e+00

Pos logit contributions
 destination+0.206
rell+0.204
by+0.203
ench+0.201
 ease+0.199
Neg logit contributions
 we-0.225
 I-0.217
 yourselves-0.187
 cooperative-0.186
 you-0.185
Token their
Token ID511
Feature activation+6.06e+00

Pos logit contributions
 we+0.225
 I+0.198
 who+0.171
 cooperative+0.152
 you+0.149
Neg logit contributions
by-0.408
rell-0.384
 destination-0.383
 home-0.375
 awoke-0.374
Token umb
Token ID20810
Feature activation+6.91e+00

Pos logit contributions
by+0.231
rell+0.219
 awoke+0.216
 destination+0.207
ench+0.204
Neg logit contributions
 we-0.410
 I-0.381
 who-0.356
 you-0.354
 let-0.332
Tokenrell
Token ID11252
Feature activation+1.89e+01

Pos logit contributions
by+0.105
 awoke+0.092
 doorway+0.053
ench+0.047
lying+0.047
Neg logit contributions
 we-0.644
 I-0.608
 you-0.588
 who-0.569
 let-0.559
Tokenas
Token ID292
Feature activation-2.27e+00

Pos logit contributions
by+1.121
rell+1.084
ench+1.038
rol+1.001
lying+0.997
Neg logit contributions
 we-0.875
 I-0.804
 who-0.710
 you-0.696
 let-0.644
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
 we+0.113
 I+0.102
 who+0.092
 you+0.092
 let+0.084
Neg logit contributions
by-0.127
rell-0.123
 awoke-0.122
 destination-0.118
ench-0.118
Token But
Token ID887
Feature activation-4.36e+00

Pos logit contributions
 Eventually+0.017
lying+0.016
by+0.016
 destination+0.016
ench+0.016
Neg logit contributions
 we-0.023
 you-0.020
 I-0.020
 wasting-0.020
 who-0.020
Token they
Token ID484
Feature activation-6.30e+00

Pos logit contributions
ms+0.118
 we+0.117
 pige+0.110
 who+0.110
 cooperative+0.107
Neg logit contributions
 destination-0.275
by-0.273
 awoke-0.272
 home-0.266
zie-0.265
Token are
Token ID389
Feature activation+3.57e+00

Pos logit contributions
 we+0.303
 I+0.274
 yourselves+0.245
 you+0.245
 who+0.239
Neg logit contributions
by-0.333
 destination-0.319
 awoke-0.311
ench-0.310
rell-0.306
TokenExc
Token ID40127
Feature activation+1.65e+00

Pos logit contributions
 we+0.147
 I+0.128
 who+0.116
 you+0.112
 spies+0.109
Neg logit contributions
by-0.170
 awoke-0.169
lying-0.160
rell-0.159
zie-0.152
Tokenuse
Token ID1904
Feature activation+4.81e+00

Pos logit contributions
by+0.055
rell+0.053
 awoke+0.052
ench+0.049
 destination+0.049
Neg logit contributions
 we-0.119
 I-0.110
 you-0.105
 who-0.104
 let-0.098
Token me
Token ID502
Feature activation+8.83e+00

Pos logit contributions
rell+0.272
 awoke+0.264
by+0.263
lying+0.257
ench+0.253
Neg logit contributions
 we-0.239
 I-0.216
 you-0.207
 who-0.196
 let-0.182
Token squ
Token ID2809
Feature activation-9.29e-01

Pos logit contributions
by+0.377
 awoke+0.352
rell+0.348
 destination+0.343
ench+0.337
Neg logit contributions
 we-0.547
 I-0.499
 who-0.463
 you-0.462
 yourselves-0.448
Tokenir
Token ID343
Feature activation+3.18e+00

Pos logit contributions
 we+0.059
 I+0.056
 you+0.054
 who+0.053
 yourselves+0.050
Neg logit contributions
 awoke-0.037
rell-0.035
 doorway-0.034
by-0.034
 destination-0.033
Tokenrell
Token ID11252
Feature activation+1.89e+01

Pos logit contributions
by+0.127
 awoke+0.119
rell+0.118
 destination+0.115
lying+0.113
Neg logit contributions
 we-0.209
 I-0.196
 who-0.182
 you-0.182
 let-0.170
Tokeny
Token ID88
Feature activation+9.48e+00

Pos logit contributions
by+1.129
rell+1.075
ench+1.033
 awoke+1.029
lying+1.016
Neg logit contributions
 we-0.872
 I-0.775
 who-0.693
 you-0.691
 let-0.641
Token,
Token ID11
Feature activation+5.65e+00

Pos logit contributions
by+0.515
rell+0.491
 awoke+0.488
 destination+0.471
ench+0.469
Neg logit contributions
 we-0.486
 I-0.431
 you-0.394
 who-0.392
 yourselves-0.367
Token I
Token ID314
Feature activation-1.21e+01

Pos logit contributions
 awoke+0.323
rell+0.305
 destination+0.298
zie+0.296
by+0.286
Neg logit contributions
ms-0.196
 yourselves-0.185
 we-0.185
 cooperative-0.166
We-0.163
Token was
Token ID373
Feature activation+3.22e+00

Pos logit contributions
 we+0.638
 I+0.584
 yourselves+0.531
 who+0.529
 you+0.528
Neg logit contributions
by-0.630
 awoke-0.602
ench-0.573
rell-0.570
lying-0.568
Token wondering
Token ID11263
Feature activation-4.80e-01

Pos logit contributions
by+0.160
 awoke+0.157
rell+0.157
ench+0.155
 destination+0.153
Neg logit contributions
 we-0.159
 I-0.137
ms-0.128
Bad-0.122
 yourselves-0.122
Token's
Token ID338
Feature activation+1.07e+01

Pos logit contributions
by+0.728
 awoke+0.690
rell+0.689
 destination+0.670
lying+0.663
Neg logit contributions
 we-0.635
 I-0.576
 you-0.515
 who-0.504
 yourselves-0.477
Token broken
Token ID5445
Feature activation-1.48e+00

Pos logit contributions
rell+0.642
by+0.634
 awoke+0.630
lying+0.607
zie+0.606
Neg logit contributions
 we-0.494
 I-0.434
 who-0.402
 you-0.395
 let-0.358
Token fin
Token ID957
Feature activation+1.54e+00

Pos logit contributions
 we+0.072
 I+0.065
 who+0.060
 you+0.058
 let+0.055
Neg logit contributions
rell-0.084
 awoke-0.080
by-0.080
lying-0.080
rol-0.079
Token.
Token ID13
Feature activation+5.15e-01

Pos logit contributions
by+0.094
rell+0.090
 awoke+0.089
 destination+0.086
ench+0.086
Neg logit contributions
 we-0.069
 I-0.061
 you-0.054
 who-0.053
 yourselves-0.049
Token Crab
Token ID32379
Feature activation+7.04e-01

Pos logit contributions
by+0.029
 awoke+0.028
rell+0.028
 destination+0.027
zie+0.027
Neg logit contributions
 we-0.026
 I-0.024
 you-0.021
 who-0.021
 let-0.019
Tokenby
Token ID1525
Feature activation+1.89e+01

Pos logit contributions
rell+0.014
 doorway+0.014
 destination+0.012
 awoke+0.012
zie+0.011
Neg logit contributions
 we-0.060
 I-0.056
 you-0.055
 who-0.055
 let-0.054
Token said
Token ID531
Feature activation-8.05e-02

Pos logit contributions
by+0.949
 awoke+0.858
rell+0.806
rolled+0.790
lying+0.766
Neg logit contributions
 we-1.027
 I-0.982
 yourselves-0.955
 you-0.905
You-0.823
Token,
Token ID11
Feature activation+5.58e+00

Pos logit contributions
 we+0.005
 I+0.004
 you+0.004
 who+0.004
 let+0.004
Neg logit contributions
by-0.004
rell-0.004
 awoke-0.004
 destination-0.004
ench-0.003
Token "
Token ID366
Feature activation-2.93e+00

Pos logit contributions
 awoke+0.261
rell+0.254
 Eventually+0.246
by+0.244
lying+0.241
Neg logit contributions
 we-0.279
 yourselves-0.263
 I-0.242
 you-0.240
ms-0.236
TokenI
Token ID40
Feature activation-8.85e+00

Pos logit contributions
 we+0.122
 I+0.106
 who+0.092
 you+0.087
 spies+0.086
Neg logit contributions
by-0.193
rell-0.178
lying-0.178
 awoke-0.177
ench-0.174
Token can
Token ID460
Feature activation-4.26e+00

Pos logit contributions
 we+0.379
 I+0.334
 who+0.300
 you+0.295
 let+0.269
Neg logit contributions
by-0.556
rell-0.549
 awoke-0.537
 destination-0.525
zie-0.523
Token.
Token ID13
Feature activation+4.96e-01

Pos logit contributions
 we+0.020
 I+0.018
 who+0.016
 you+0.016
 yourselves+0.015
Neg logit contributions
by-0.022
rell-0.021
 awoke-0.021
 destination-0.021
lying-0.020
Token D
Token ID360
Feature activation-2.10e+00

Pos logit contributions
by+0.024
rell+0.023
 awoke+0.023
 destination+0.022
ench+0.022
Neg logit contributions
 we-0.029
 I-0.026
 who-0.024
 you-0.024
 let-0.022
Tokenolly
Token ID5098
Feature activation+4.26e+00

Pos logit contributions
 we+0.135
 I+0.126
 you+0.117
 who+0.116
 let+0.108
Neg logit contributions
by-0.085
rell-0.084
 awoke-0.082
 destination-0.080
lying-0.077
Token told
Token ID1297
Feature activation-3.99e+00

Pos logit contributions
by+0.236
 awoke+0.226
rell+0.225
 destination+0.216
lying+0.215
Neg logit contributions
 we-0.214
 I-0.195
 you-0.177
 who-0.172
 yourselves-0.166
Token Crab
Token ID32379
Feature activation+6.67e-01

Pos logit contributions
 we+0.196
 I+0.178
 you+0.165
 who+0.158
 let+0.148
Neg logit contributions
by-0.226
 awoke-0.224
rell-0.219
 doorway-0.216
 destination-0.214
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.009
 doorway+0.009
 awoke+0.008
 destination+0.007
rolled+0.007
Neg logit contributions
 we-0.062
 I-0.058
 who-0.057
 you-0.057
 let-0.055
Token about
Token ID546
Feature activation+1.86e+00

Pos logit contributions
by+1.021
rell+1.001
 awoke+0.992
 destination+0.962
lying+0.958
Neg logit contributions
 we-0.963
 I-0.871
 who-0.801
 you-0.789
 let-0.719
Token Fin
Token ID4463
Feature activation+9.45e-01

Pos logit contributions
rell+0.095
by+0.095
 ease+0.090
 awoke+0.089
zie+0.089
Neg logit contributions
 we-0.091
 I-0.081
We-0.073
ms-0.072
 you-0.071
Tokenny
Token ID3281
Feature activation+1.29e+01

Pos logit contributions
 awoke+0.036
by+0.035
rell+0.035
 destination+0.034
ench+0.033
Neg logit contributions
 we-0.063
 I-0.058
 who-0.056
 you-0.055
 let-0.052
Token's
Token ID338
Feature activation+1.07e+01

Pos logit contributions
by+0.728
 awoke+0.690
rell+0.689
 destination+0.670
lying+0.663
Neg logit contributions
 we-0.635
 I-0.576
 you-0.515
 who-0.504
 yourselves-0.477
Token broken
Token ID5445
Feature activation-1.48e+00

Pos logit contributions
rell+0.642
by+0.634
 awoke+0.630
lying+0.607
zie+0.606
Neg logit contributions
 we-0.494
 I-0.434
 who-0.402
 you-0.395
 let-0.358
Token.
Token ID13
Feature activation+7.07e-01

Pos logit contributions
by+0.070
rell+0.068
 awoke+0.067
 destination+0.065
ench+0.065
Neg logit contributions
 we-0.055
 I-0.049
 who-0.045
 you-0.044
 let-0.040
Token They
Token ID1119
Feature activation-3.27e-01

Pos logit contributions
by+0.038
 awoke+0.036
zie+0.035
rell+0.035
ench+0.034
Neg logit contributions
 we-0.038
 I-0.035
 who-0.031
 you-0.031
 let-0.029
Token said
Token ID531
Feature activation+2.56e-02

Pos logit contributions
 we+0.017
 I+0.015
 who+0.015
 let+0.014
 you+0.014
Neg logit contributions
by-0.018
rell-0.017
ench-0.017
lying-0.017
 awoke-0.017
Token their
Token ID511
Feature activation+6.84e+00

Pos logit contributions
by+0.001
 awoke+0.001
rell+0.001
 destination+0.001
ench+0.001
Neg logit contributions
 we-0.002
 I-0.002
 who-0.002
 you-0.002
 yourselves-0.001
Token good
Token ID922
Feature activation-1.31e+00

Pos logit contributions
by+0.367
rell+0.357
 awoke+0.355
lying+0.342
ench+0.340
Neg logit contributions
 we-0.357
 I-0.321
 you-0.293
 who-0.292
 let-0.267
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
 we+0.112
 I+0.104
 you+0.100
 who+0.098
 let+0.094
Neg logit contributions
rell-0.029
 awoke-0.026
rol-0.024
lying-0.024
leys-0.023
Tokenes
Token ID274
Feature activation-2.23e+00

Pos logit contributions
by+1.018
rell+0.971
 awoke+0.967
 destination+0.928
ench+0.922
Neg logit contributions
 we-0.973
 I-0.882
 you-0.798
 who-0.794
 yourselves-0.732
Token and
Token ID290
Feature activation-3.81e+00

Pos logit contributions
 we+0.095
 I+0.082
 who+0.080
 you+0.074
 let+0.071
Neg logit contributions
by-0.141
rell-0.139
ench-0.134
zie-0.133
 destination-0.133
Token then
Token ID788
Feature activation-7.55e+00

Pos logit contributions
 we+0.206
 I+0.193
 you+0.173
 who+0.173
 yourselves+0.166
Neg logit contributions
by-0.191
 awoke-0.181
rell-0.178
 destination-0.174
lying-0.168
Token each
Token ID1123
Feature activation-2.08e+00

Pos logit contributions
 we+0.376
 I+0.336
 who+0.306
 you+0.302
 let+0.281
Neg logit contributions
by-0.426
rell-0.418
 awoke-0.408
ench-0.400
lying-0.397
Token went
Token ID1816
Feature activation+6.39e+00

Pos logit contributions
 we+0.111
 I+0.101
 who+0.097
 you+0.093
 let+0.090
Neg logit contributions
by-0.108
rell-0.107
 destination-0.104
lying-0.103
ench-0.102
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
 we+0.397
 I+0.365
 you+0.331
 who+0.326
 yourselves+0.322
Neg logit contributions
by-0.362
rell-0.332
 awoke-0.325
lying-0.319
zie-0.318
Token We
Token ID775
Feature activation-1.06e+01

Pos logit contributions
 Eventually+0.022
zie+0.021
rell+0.020
 Today+0.020
 awoke+0.020
Neg logit contributions
 pige-0.014
ms-0.013
 we-0.013
 who-0.013
 wasting-0.013
Token have
Token ID423
Feature activation-6.68e+00

Pos logit contributions
 we+0.478
 I+0.426
 who+0.381
 you+0.378
 let+0.340
Neg logit contributions
by-0.647
rell-0.626
 awoke-0.621
 destination-0.604
ench-0.601
Token our
Token ID674
Feature activation-1.88e-01

Pos logit contributions
 we+0.382
 I+0.353
 who+0.326
 you+0.320
 let+0.299
Neg logit contributions
by-0.319
rell-0.302
 awoke-0.301
 destination-0.292
ench-0.288
Token umb
Token ID20810
Feature activation+6.88e+00

Pos logit contributions
 we+0.010
 I+0.010
 who+0.009
 you+0.009
 yourselves+0.008
Neg logit contributions
by-0.009
rell-0.009
 destination-0.009
 awoke-0.008
ench-0.008
Tokenrell
Token ID11252
Feature activation+1.88e+01

Pos logit contributions
by+0.102
 awoke+0.096
 doorway+0.057
 destination+0.056
lying+0.051
Neg logit contributions
 we-0.634
 I-0.594
 you-0.577
 who-0.559
 let-0.547
Tokenas
Token ID292
Feature activation-2.56e+00

Pos logit contributions
by+1.045
rell+0.973
ench+0.953
rol+0.942
rolled+0.892
Neg logit contributions
 we-0.948
 I-0.893
 who-0.776
 you-0.770
Bad-0.759
Token to
Token ID284
Feature activation+4.45e+00

Pos logit contributions
 we+0.116
 I+0.107
 you+0.092
 who+0.091
Bad+0.087
Neg logit contributions
by-0.147
rell-0.144
 awoke-0.140
 destination-0.137
ench-0.134
Token keep
Token ID1394
Feature activation-2.29e+00

Pos logit contributions
rell+0.215
by+0.212
 destination+0.211
ench+0.206
 awoke+0.202
Neg logit contributions
 we-0.237
 I-0.217
 who-0.194
 you-0.188
 spies-0.187
Token us
Token ID514
Feature activation+7.17e+00

Pos logit contributions
 we+0.105
 I+0.101
 who+0.088
Bad+0.086
 spies+0.083
Neg logit contributions
rell-0.125
by-0.119
 home-0.114
 destination-0.114
zie-0.112
Token dry
Token ID5894
Feature activation-1.83e+00

Pos logit contributions
by+0.362
rell+0.356
ench+0.344
 destination+0.341
 awoke+0.339
Neg logit contributions
 we-0.370
 I-0.350
 who-0.309
 you-0.303
Bad-0.298
Token and
Token ID290
Feature activation-3.95e+00

Pos logit contributions
 we+0.256
 I+0.228
 who+0.209
 you+0.204
 let+0.185
Neg logit contributions
by-0.344
rell-0.338
 awoke-0.332
 destination-0.322
ench-0.322
Token decided
Token ID3066
Feature activation+2.76e+00

Pos logit contributions
 we+0.203
 I+0.186
 you+0.168
 who+0.164
 yourselves+0.162
Neg logit contributions
by-0.213
 awoke-0.203
rell-0.193
 destination-0.193
zie-0.190
Token to
Token ID284
Feature activation+4.66e+00

Pos logit contributions
by+0.131
 awoke+0.121
 destination+0.119
rell+0.115
lying+0.115
Neg logit contributions
 we-0.150
 I-0.139
 who-0.130
 yourselves-0.126
 you-0.126
Token take
Token ID1011
Feature activation-4.71e+00

Pos logit contributions
by+0.266
rell+0.256
 awoke+0.256
 destination+0.248
ench+0.247
Neg logit contributions
 we-0.225
 I-0.201
 who-0.183
 you-0.180
 yourselves-0.164
Token Crab
Token ID32379
Feature activation+7.19e-01

Pos logit contributions
 we+0.186
 I+0.165
 you+0.152
 who+0.141
 let+0.128
Neg logit contributions
by-0.305
 awoke-0.304
rell-0.302
zie-0.296
 destination-0.296
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.012
 doorway+0.012
 awoke+0.011
rol+0.010
rolled+0.009
Neg logit contributions
 we-0.067
 you-0.062
 who-0.061
 I-0.061
 let-0.059
Token's
Token ID338
Feature activation+1.06e+01

Pos logit contributions
rell+1.052
 destination+1.045
 awoke+1.009
lying+1.008
zie+1.007
Neg logit contributions
 we-0.956
 who-0.868
 you-0.826
 I-0.823
 let-0.737
Token advice
Token ID5608
Feature activation+3.70e+00

Pos logit contributions
by+0.531
rell+0.516
 awoke+0.507
lying+0.487
ench+0.487
Neg logit contributions
 we-0.599
 I-0.547
 who-0.502
 you-0.502
 let-0.462
Token.
Token ID13
Feature activation+7.49e-01

Pos logit contributions
by+0.224
rell+0.221
 awoke+0.219
 destination+0.210
zie+0.210
Neg logit contributions
 we-0.167
 I-0.149
 who-0.139
 you-0.135
 let-0.122
Token T
Token ID309
Feature activation-4.04e+00

Pos logit contributions
by+0.035
rell+0.033
 awoke+0.033
 destination+0.032
lying+0.032
Neg logit contributions
 we-0.044
 I-0.040
 who-0.037
 you-0.037
 let-0.034
Tokenuna
Token ID9613
Feature activation-1.01e+00

Pos logit contributions
 we+0.173
 I+0.154
 you+0.138
 who+0.137
 let+0.122
Neg logit contributions
by-0.250
rell-0.248
 awoke-0.244
 destination-0.239
ench-0.237
Token broke
Token ID6265
Feature activation+2.32e+00

Pos logit contributions
 we+0.290
 I+0.262
 who+0.240
 you+0.237
 let+0.223
Neg logit contributions
by-0.317
rell-0.304
 awoke-0.300
 destination-0.295
ench-0.295
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
by+0.124
 awoke+0.123
rell+0.121
ench+0.118
lying+0.117
Neg logit contributions
 we-0.121
 I-0.109
 who-0.102
 you-0.100
 let-0.092
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
by+0.031
rell+0.028
 awoke+0.028
 destination+0.028
ench+0.028
Neg logit contributions
 we-0.028
 I-0.026
 who-0.023
 you-0.023
 let-0.021
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
by+0.075
rell+0.073
 awoke+0.072
 destination+0.070
zie+0.070
Neg logit contributions
 we-0.055
 I-0.049
 who-0.044
 you-0.044
 let-0.039
TokenAb
Token ID4826
Feature activation+6.98e+00

Pos logit contributions
by+0.076
rell+0.074
 awoke+0.073
 destination+0.071
ench+0.071
Neg logit contributions
 we-0.052
 I-0.046
 who-0.042
 you-0.041
 let-0.037
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.142
 awoke+0.116
 doorway+0.103
rol+0.103
ench+0.100
Neg logit contributions
 we-0.599
 who-0.556
 I-0.540
 you-0.539
 let-0.523
Token was
Token ID373
Feature activation+3.07e+00

Pos logit contributions
by+0.906
 awoke+0.839
rell+0.788
 destination+0.762
rolled+0.762
Neg logit contributions
 we-1.069
 I-0.996
 you-0.905
 yourselves-0.895
 who-0.856
Token very
Token ID845
Feature activation+8.81e+00

Pos logit contributions
by+0.175
rell+0.169
 awoke+0.168
 destination+0.163
lying+0.162
Neg logit contributions
 we-0.150
 I-0.135
 who-0.123
 you-0.121
 let-0.111
Token sad
Token ID6507
Feature activation+2.51e-01

Pos logit contributions
by+0.505
rell+0.474
 awoke+0.471
ench+0.463
lying+0.459
Neg logit contributions
 we-0.429
 I-0.385
 who-0.347
 you-0.342
 let-0.312
Token.
Token ID13
Feature activation+6.43e-01

Pos logit contributions
by+0.015
rell+0.014
 awoke+0.014
 destination+0.014
ench+0.013
Neg logit contributions
 we-0.012
 I-0.011
 you-0.010
 who-0.010
 let-0.009
Token She
Token ID1375
Feature activation+5.15e+00

Pos logit contributions
by+0.035
rell+0.032
 awoke+0.032
ench+0.032
at+0.031
Neg logit contributions
 we-0.034
 I-0.034
 who-0.030
 you-0.029
 yourselves-0.027
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.013
 doorway+0.012
 awoke+0.012
rolled+0.011
rol+0.011
Neg logit contributions
 we-0.054
 who-0.050
 I-0.050
 you-0.049
 let-0.047
Token the
Token ID262
Feature activation-3.05e+00

Pos logit contributions
by+0.884
 awoke+0.805
rell+0.781
lying+0.760
 destination+0.752
Neg logit contributions
 we-1.077
 I-0.996
 yourselves-0.919
 you-0.914
 who-0.870
Token crab
Token ID32202
Feature activation-2.41e+00

Pos logit contributions
 we+0.133
 I+0.118
 who+0.107
 you+0.103
 yourselves+0.094
Neg logit contributions
by-0.190
 awoke-0.178
rell-0.177
 destination-0.176
ench-0.171
Token.
Token ID13
Feature activation+4.60e-01

Pos logit contributions
 we+0.119
 I+0.107
 who+0.101
 you+0.097
 let+0.089
Neg logit contributions
by-0.136
rell-0.134
 awoke-0.131
ench-0.129
 destination-0.129
Token Crab
Token ID32379
Feature activation+6.33e-01

Pos logit contributions
by+0.024
rell+0.023
 awoke+0.023
 destination+0.022
ench+0.022
Neg logit contributions
 we-0.024
 I-0.022
 who-0.020
 you-0.020
 let-0.018
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.014
 doorway+0.014
 awoke+0.012
 destination+0.012
rol+0.011
Neg logit contributions
 we-0.054
 I-0.050
 you-0.049
 who-0.049
 let-0.048
Token was
Token ID373
Feature activation+3.07e+00

Pos logit contributions
by+0.956
 awoke+0.862
rell+0.811
rolled+0.807
lying+0.779
Neg logit contributions
 we-0.985
 yourselves-0.948
 I-0.936
 you-0.871
You-0.831
Token good
Token ID922
Feature activation-1.44e+00

Pos logit contributions
by+0.190
rell+0.187
 awoke+0.185
ench+0.181
 destination+0.179
Neg logit contributions
 we-0.131
 I-0.113
 you-0.100
 who-0.096
 yourselves-0.090
Token at
Token ID379
Feature activation+5.91e-02

Pos logit contributions
 we+0.082
 I+0.076
 yourselves+0.069
 you+0.067
Bad+0.066
Neg logit contributions
by-0.071
 home-0.061
ogged-0.061
 destination-0.061
lying-0.060
Token fixing
Token ID18682
Feature activation-5.90e+00

Pos logit contributions
by+0.003
rell+0.003
 awoke+0.003
 destination+0.003
ench+0.003
Neg logit contributions
 we-0.003
 I-0.003
 who-0.002
 you-0.002
 yourselves-0.002
Token things
Token ID1243
Feature activation-4.03e-01

Pos logit contributions
 we+0.323
 I+0.294
 who+0.266
 you+0.263
 let+0.246
Neg logit contributions
by-0.304
rell-0.291
 awoke-0.284
 destination-0.278
lying-0.275
Token orange
Token ID10912
Feature activation-2.24e+00

Pos logit contributions
 we+0.007
 I+0.006
 who+0.006
 you+0.005
 let+0.005
Neg logit contributions
by-0.006
rell-0.005
 awoke-0.005
 destination-0.005
lying-0.005
Token and
Token ID290
Feature activation-3.84e+00

Pos logit contributions
 we+0.105
 I+0.095
 you+0.086
 who+0.086
 let+0.075
Neg logit contributions
by-0.133
 awoke-0.129
rell-0.126
ench-0.126
 destination-0.124
Token pick
Token ID2298
Feature activation-2.90e+00

Pos logit contributions
 we+0.180
 I+0.160
 you+0.145
 who+0.143
 yourselves+0.138
Neg logit contributions
by-0.219
 awoke-0.204
 destination-0.202
rell-0.200
 doorway-0.197
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
 we+0.137
 I+0.123
 who+0.114
 you+0.113
 let+0.101
Neg logit contributions
by-0.166
 awoke-0.163
rell-0.163
 destination-0.157
ench-0.157
Token Lib
Token ID7980
Feature activation+7.93e+00

Pos logit contributions
by+0.040
 awoke+0.038
rell+0.038
ench+0.037
 destination+0.037
Neg logit contributions
 we-0.042
 I-0.041
 who-0.035
 you-0.034
 let-0.032
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.175
rol+0.117
lying+0.113
bank+0.110
 doorway+0.105
Neg logit contributions
 we-0.696
 who-0.663
 you-0.636
 I-0.633
 let-0.633
Token shouted
Token ID17293
Feature activation+1.64e+00

Pos logit contributions
by+0.940
 awoke+0.886
rell+0.845
 destination+0.825
rolled+0.820
Neg logit contributions
 we-1.045
 I-0.961
 you-0.888
 yourselves-0.861
 who-0.844
Token out
Token ID503
Feature activation+1.06e+01

Pos logit contributions
by+0.091
rell+0.089
 awoke+0.087
 destination+0.085
ench+0.085
Neg logit contributions
 we-0.083
 I-0.076
 who-0.069
 you-0.068
 let-0.062
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
rell+0.630
by+0.630
 awoke+0.611
zie+0.606
ench+0.601
Neg logit contributions
 we-0.494
 I-0.440
 who-0.414
 you-0.387
 let-0.368
Token words
Token ID2456
Feature activation-3.90e+00

Pos logit contributions
 we+0.137
 I+0.126
 you+0.114
 who+0.113
 let+0.103
Neg logit contributions
by-0.160
 awoke-0.157
rell-0.156
ench-0.151
lying-0.148
Token excited
Token ID6568
Feature activation+1.21e-01

Pos logit contributions
 we+0.154
 I+0.135
 who+0.126
 you+0.116
 let+0.110
Neg logit contributions
by-0.261
rell-0.254
ench-0.252
 awoke-0.245
zie-0.245
Token a
Token ID257
Feature activation-2.91e-01

Pos logit contributions
by+0.211
 awoke+0.206
rell+0.205
ench+0.198
zie+0.195
Neg logit contributions
 we-0.126
 I-0.109
 you-0.093
 who-0.091
 yourselves-0.085
Token little
Token ID1310
Feature activation-4.43e+00

Pos logit contributions
 yourselves+0.015
ms+0.015
 we+0.014
 I+0.013
 you+0.013
Neg logit contributions
by-0.014
 awoke-0.012
 home-0.012
 doorway-0.012
 destination-0.011
Token girl
Token ID2576
Feature activation-2.78e+00

Pos logit contributions
 we+0.214
 yourselves+0.196
 I+0.192
 letting+0.187
 not+0.184
Neg logit contributions
by-0.242
 awoke-0.205
 home-0.202
rell-0.202
 doorway-0.200
Token named
Token ID3706
Feature activation-8.72e+00

Pos logit contributions
 we+0.143
 I+0.128
 yourselves+0.125
Bad+0.118
 you+0.116
Neg logit contributions
by-0.143
 awoke-0.127
jo-0.124
zie-0.121
 doorway-0.120
Token Gab
Token ID12300
Feature activation+6.94e+00

Pos logit contributions
 we+0.259
 I+0.207
 who+0.186
 you+0.174
 yourselves+0.164
Neg logit contributions
by-0.645
rell-0.630
 awoke-0.622
zie-0.618
ench-0.604
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.084
ench+0.068
 awoke+0.063
 destination+0.055
rol+0.055
Neg logit contributions
 we-0.655
 who-0.616
 I-0.613
 you-0.598
 let-0.578
Token who
Token ID508
Feature activation-1.20e+01

Pos logit contributions
by+1.042
 awoke+0.970
rell+0.958
ench+0.917
 destination+0.916
Neg logit contributions
 we-0.965
 I-0.874
 you-0.792
 who-0.743
 yourselves-0.735
Token liked
Token ID8288
Feature activation-5.69e+00

Pos logit contributions
 we+0.494
 I+0.451
 who+0.422
 you+0.396
 let+0.374
Neg logit contributions
rell-0.774
 destination-0.770
by-0.754
ench-0.737
lying-0.737
Token to
Token ID284
Feature activation+4.68e+00

Pos logit contributions
 we+0.334
 I+0.307
 who+0.282
 you+0.281
 yourselves+0.262
Neg logit contributions
by-0.264
rell-0.251
 awoke-0.248
 destination-0.237
ench-0.235
Token roll
Token ID4836
Feature activation-2.83e+00

Pos logit contributions
by+0.264
rell+0.254
 awoke+0.252
 destination+0.246
lying+0.244
Neg logit contributions
 we-0.233
 I-0.211
 who-0.190
 you-0.190
 let-0.174
Token around
Token ID1088
Feature activation+3.75e+00

Pos logit contributions
 we+0.139
 I+0.127
 you+0.121
 who+0.116
 yourselves+0.102
Neg logit contributions
 awoke-0.162
by-0.158
 destination-0.157
 doorway-0.154
rell-0.154
Token sh
Token ID427
Feature activation+3.34e+00

Pos logit contributions
by+0.058
rell+0.055
 awoke+0.055
 destination+0.054
lying+0.053
Neg logit contributions
 we-0.051
 I-0.046
 who-0.041
 you-0.041
 let-0.038
Tokenru
Token ID622
Feature activation+1.65e-01

Pos logit contributions
by+0.132
 awoke+0.126
rell+0.125
 destination+0.121
 doorway+0.116
Neg logit contributions
 we-0.219
 I-0.203
 who-0.194
 you-0.191
 let-0.178
Tokengging
Token ID18792
Feature activation-1.70e+00

Pos logit contributions
by+0.009
rell+0.008
lying+0.008
 awoke+0.008
ench+0.008
Neg logit contributions
 we-0.009
 I-0.008
 who-0.007
 you-0.007
 let-0.007
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
 we+0.086
 I+0.078
 who+0.071
 you+0.071
 let+0.065
Neg logit contributions
by-0.093
rell-0.091
 awoke-0.090
 destination-0.087
ench-0.087
Token Rob
Token ID3851
Feature activation+3.45e+00

Pos logit contributions
by+0.022
rell+0.020
 awoke+0.020
 destination+0.019
ench+0.019
Neg logit contributions
 we-0.029
 I-0.026
 who-0.024
 you-0.024
 let-0.022
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.089
ench+0.076
 doorway+0.076
 awoke+0.073
zie+0.069
Neg logit contributions
 we-0.274
 I-0.256
 who-0.254
 you-0.246
 let-0.242
Token thought
Token ID1807
Feature activation-2.86e+00

Pos logit contributions
by+0.969
 awoke+0.922
rell+0.879
 destination+0.858
lying+0.856
Neg logit contributions
 we-1.009
 I-0.927
 you-0.839
 yourselves-0.821
 who-0.805
Token this
Token ID428
Feature activation-3.95e+00

Pos logit contributions
 we+0.120
 I+0.105
 who+0.094
 you+0.091
 yourselves+0.088
Neg logit contributions
by-0.176
rell-0.171
 awoke-0.171
 destination-0.166
zie-0.163
Token was
Token ID373
Feature activation+3.01e+00

Pos logit contributions
 we+0.225
 I+0.205
 you+0.188
 who+0.187
 yourselves+0.178
Neg logit contributions
by-0.191
 awoke-0.183
rell-0.181
 destination-0.173
ench-0.171
Token funny
Token ID8258
Feature activation-4.79e-01

Pos logit contributions
by+0.180
 awoke+0.174
rell+0.174
 destination+0.169
zie+0.167
Neg logit contributions
 we-0.128
 I-0.110
 who-0.100
 you-0.096
 yourselves-0.093
Token and
Token ID290
Feature activation-4.06e+00

Pos logit contributions
 we+0.024
 I+0.022
 who+0.020
 you+0.020
 let+0.018
Neg logit contributions
by-0.026
rell-0.025
 awoke-0.025
 destination-0.024
ench-0.024
Token the
Token ID262
Feature activation-3.37e+00

Pos logit contributions
by+0.229
 awoke+0.217
 destination+0.215
rell+0.212
zie+0.209
Neg logit contributions
 we-0.182
 I-0.167
 who-0.150
 you-0.147
 yourselves-0.141
Token kids
Token ID3988
Feature activation-5.05e+00

Pos logit contributions
 we+0.129
 I+0.113
 who+0.100
 you+0.097
 let+0.086
Neg logit contributions
by-0.226
rell-0.221
 awoke-0.219
 destination-0.214
zie-0.212
Token collected
Token ID7723
Feature activation+4.13e+00

Pos logit contributions
 we+0.257
 who+0.233
 I+0.224
 let+0.215
 you+0.204
Neg logit contributions
rell-0.280
by-0.274
ench-0.266
lying-0.266
 destination-0.262
Token their
Token ID511
Feature activation+6.38e+00

Pos logit contributions
by+0.271
 awoke+0.263
rell+0.263
lying+0.257
 destination+0.253
Neg logit contributions
 we-0.167
 I-0.145
 who-0.132
 you-0.128
 let-0.114
Token umb
Token ID20810
Feature activation+6.82e+00

Pos logit contributions
rell+0.376
 awoke+0.372
by+0.369
zie+0.366
jo+0.365
Neg logit contributions
 we-0.302
 I-0.267
 you-0.248
 who-0.244
 let-0.217
Tokenrell
Token ID11252
Feature activation+1.88e+01

Pos logit contributions
by+0.153
 awoke+0.138
 destination+0.097
ench+0.094
lying+0.089
Neg logit contributions
 we-0.569
 I-0.536
 you-0.514
 who-0.489
 that-0.489
Tokenas
Token ID292
Feature activation-2.48e+00

Pos logit contributions
by+1.187
 awoke+1.183
rell+1.155
 destination+1.138
lying+1.125
Neg logit contributions
 we-0.787
 I-0.689
 who-0.619
 you-0.614
 let-0.539
Token and
Token ID290
Feature activation-4.08e+00

Pos logit contributions
 we+0.097
 I+0.081
 who+0.078
 you+0.072
 let+0.064
Neg logit contributions
by-0.168
rell-0.164
 awoke-0.164
ench-0.160
lying-0.159
Token went
Token ID1816
Feature activation+6.02e+00

Pos logit contributions
 we+0.220
 I+0.204
 you+0.186
 who+0.184
 yourselves+0.175
Neg logit contributions
by-0.208
rell-0.197
 awoke-0.196
 destination-0.189
zie-0.187
Token out
Token ID503
Feature activation+1.04e+01

Pos logit contributions
by+0.332
 awoke+0.331
rell+0.329
lying+0.329
ench+0.317
Neg logit contributions
 we-0.312
 I-0.273
 who-0.260
 you-0.251
 that-0.238
Token to
Token ID284
Feature activation+4.45e+00

Pos logit contributions
by+0.628
rell+0.628
 awoke+0.610
lying+0.608
ench+0.600
Neg logit contributions
 we-0.492
 I-0.421
 who-0.418
 you-0.385
 let-0.351
Token her
Token ID607
Feature activation+8.57e+00

Pos logit contributions
by+0.693
rell+0.681
 awoke+0.680
ench+0.678
lying+0.675
Neg logit contributions
 we-0.418
 I-0.365
 who-0.350
 you-0.326
 let-0.290
Token tools
Token ID4899
Feature activation+1.74e-01

Pos logit contributions
by+0.467
rell+0.450
 awoke+0.446
 destination+0.432
ench+0.429
Neg logit contributions
 we-0.444
 I-0.402
 who-0.366
 you-0.363
 let-0.332
Token and
Token ID290
Feature activation-3.87e+00

Pos logit contributions
by+0.014
 awoke+0.014
rell+0.013
ench+0.013
zie+0.013
Neg logit contributions
 we-0.005
 who-0.004
 I-0.004
 you-0.003
 that-0.003
Token fixed
Token ID5969
Feature activation-1.05e+00

Pos logit contributions
 we+0.203
 I+0.186
 who+0.169
 you+0.168
 yourselves+0.154
Neg logit contributions
by-0.206
rell-0.197
 awoke-0.196
 destination-0.189
lying-0.187
Token Rob
Token ID3851
Feature activation+3.47e+00

Pos logit contributions
 we+0.054
 I+0.049
 who+0.045
 you+0.044
 let+0.040
Neg logit contributions
by-0.057
 awoke-0.055
rell-0.055
ench-0.054
lying-0.053
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.058
 awoke+0.055
 doorway+0.049
ench+0.046
rol+0.042
Neg logit contributions
 we-0.314
 who-0.301
 you-0.292
 I-0.286
 let-0.269
Token's
Token ID338
Feature activation+1.07e+01

Pos logit contributions
by+0.901
rell+0.874
 awoke+0.869
 destination+0.841
ench+0.838
Neg logit contributions
 we-1.082
 I-0.981
 who-0.931
 you-0.911
 let-0.846
Token arms
Token ID5101
Feature activation+4.24e+00

Pos logit contributions
by+0.566
 awoke+0.552
rell+0.549
lying+0.526
ench+0.526
Neg logit contributions
 we-0.568
 I-0.518
 who-0.473
 you-0.470
 let-0.433
Token and
Token ID290
Feature activation-3.85e+00

Pos logit contributions
by+0.292
 awoke+0.284
rell+0.274
 destination+0.274
zie+0.273
Neg logit contributions
 we-0.155
 I-0.139
 who-0.135
 you-0.118
 that-0.115
Token legs
Token ID7405
Feature activation+2.06e-01

Pos logit contributions
 we+0.191
 I+0.172
 who+0.157
 you+0.155
 let+0.142
Neg logit contributions
by-0.216
rell-0.210
 awoke-0.208
ench-0.202
 destination-0.201
Token.
Token ID13
Feature activation+7.21e-01

Pos logit contributions
by+0.013
 awoke+0.013
zie+0.012
rell+0.012
ench+0.012
Neg logit contributions
 we-0.009
 I-0.008
 who-0.007
 you-0.007
 let-0.007
Token away
Token ID1497
Feature activation+1.49e+01

Pos logit contributions
rell+0.097
by+0.096
 awoke+0.094
zie+0.092
lying+0.091
Neg logit contributions
 we-0.118
 I-0.108
 you-0.102
 who-0.101
 that-0.096
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
by+0.860
rell+0.839
 awoke+0.827
ench+0.800
 destination+0.800
Neg logit contributions
 we-0.720
 I-0.646
 who-0.587
 you-0.579
 let-0.527
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
by+0.029
rell+0.028
 awoke+0.028
 destination+0.027
lying+0.027
Neg logit contributions
 we-0.037
 I-0.034
 who-0.031
 you-0.031
 let-0.029
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
by+0.066
rell+0.064
 awoke+0.063
 destination+0.062
lying+0.061
Neg logit contributions
 we-0.061
 I-0.055
 who-0.051
 you-0.050
 let-0.045
TokenShell
Token ID23248
Feature activation+6.12e-01

Pos logit contributions
by+0.074
rell+0.071
 awoke+0.070
ench+0.069
rolled+0.069
Neg logit contributions
 we-0.052
 I-0.047
 you-0.040
 who-0.039
 yourselves-0.037
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.020
 awoke+0.019
 destination+0.019
ench+0.019
 doorway+0.018
Neg logit contributions
 we-0.044
 I-0.040
 who-0.039
 you-0.038
 let-0.037
Token carried
Token ID5281
Feature activation-2.80e+00

Pos logit contributions
by+1.006
 awoke+0.965
rell+0.929
 destination+0.894
zie+0.884
Neg logit contributions
 we-0.973
 I-0.887
 you-0.807
 who-0.784
 yourselves-0.778
Token with
Token ID351
Feature activation-9.47e+00

Pos logit contributions
 we+0.104
 I+0.091
 you+0.084
 who+0.081
 let+0.071
Neg logit contributions
by-0.188
 awoke-0.187
rell-0.185
ench-0.182
 destination-0.180
Token her
Token ID607
Feature activation+8.57e+00

Pos logit contributions
 we+0.495
 I+0.453
 you+0.434
 who+0.426
 let+0.390
Neg logit contributions
 awoke-0.490
by-0.482
rell-0.479
ench-0.478
lying-0.473
Token the
Token ID262
Feature activation-2.90e+00

Pos logit contributions
by+0.467
rell+0.442
 awoke+0.439
 destination+0.434
ench+0.424
Neg logit contributions
 we-0.439
 I-0.400
 who-0.358
 you-0.358
 yourselves-0.336
Token warmth
Token ID23125
Feature activation+3.06e+00

Pos logit contributions
 we+0.152
 I+0.137
 who+0.125
 you+0.124
 let+0.114
Neg logit contributions
by-0.157
rell-0.151
 awoke-0.150
 destination-0.145
ench-0.144
Token go
Token ID467
Feature activation+6.09e+00

Pos logit contributions
 we+0.213
 I+0.188
 who+0.169
 you+0.165
 yourselves+0.156
Neg logit contributions
by-0.305
rell-0.288
 awoke-0.287
 destination-0.287
lying-0.277
Token very
Token ID845
Feature activation+8.86e+00

Pos logit contributions
by+0.337
 awoke+0.331
rell+0.324
zie+0.318
ench+0.314
Neg logit contributions
 we-0.317
 I-0.284
 you-0.262
 who-0.260
 let-0.229
Token far
Token ID1290
Feature activation-1.48e+00

Pos logit contributions
by+0.547
 destination+0.510
rell+0.509
 awoke+0.507
ench+0.499
Neg logit contributions
 we-0.386
 I-0.347
 who-0.311
 you-0.302
 yourselves-0.279
Token,
Token ID11
Feature activation+5.63e+00

Pos logit contributions
 we+0.084
 I+0.076
 who+0.070
 you+0.070
 let+0.064
Neg logit contributions
by-0.073
rell-0.071
 awoke-0.070
 destination-0.067
ench-0.067
Token Rob
Token ID3851
Feature activation+3.56e+00

Pos logit contributions
by+0.334
 destination+0.327
 awoke+0.317
rell+0.316
 Eventually+0.301
Neg logit contributions
 we-0.222
 I-0.206
 yourselves-0.180
 you-0.174
 who-0.173
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.077
 doorway+0.061
ench+0.057
 awoke+0.051
rol+0.051
Neg logit contributions
 we-0.305
 who-0.292
 I-0.283
 you-0.275
 let-0.269
Token was
Token ID373
Feature activation+3.15e+00

Pos logit contributions
by+1.010
 awoke+0.961
rell+0.932
 destination+0.911
lying+0.893
Neg logit contributions
 we-0.962
 I-0.880
 you-0.797
 who-0.760
 yourselves-0.757
Token happy
Token ID3772
Feature activation-4.35e-01

Pos logit contributions
by+0.192
rell+0.185
 awoke+0.185
 destination+0.180
ench+0.179
Neg logit contributions
 we-0.141
 I-0.125
 you-0.111
 who-0.109
 yourselves-0.100
Token to
Token ID284
Feature activation+4.70e+00

Pos logit contributions
 we+0.022
 I+0.020
 yourselves+0.019
 you+0.019
Bad+0.018
Neg logit contributions
by-0.022
 awoke-0.021
 destination-0.020
rell-0.020
ench-0.019
Token lead
Token ID1085
Feature activation+9.65e-02

Pos logit contributions
by+0.313
rell+0.304
 awoke+0.302
 destination+0.295
lying+0.293
Neg logit contributions
 we-0.183
 I-0.159
 who-0.141
 you-0.138
 yourselves-0.121
Token the
Token ID262
Feature activation-2.87e+00

Pos logit contributions
rell+0.005
 awoke+0.005
by+0.005
ench+0.005
zie+0.005
Neg logit contributions
 we-0.005
 I-0.005
 you-0.005
 who-0.005
 let-0.004
Token find
Token ID1064
Feature activation-5.41e-02

Pos logit contributions
by+0.284
rell+0.275
 awoke+0.273
 destination+0.266
ench+0.265
Neg logit contributions
 we-0.197
 I-0.175
 who-0.156
 you-0.154
 let-0.137
Token her
Token ID607
Feature activation+8.38e+00

Pos logit contributions
 we+0.002
 I+0.002
 who+0.002
 you+0.002
 let+0.002
Neg logit contributions
by-0.003
 awoke-0.003
rell-0.003
 destination-0.003
lying-0.003
Token friend
Token ID1545
Feature activation+9.24e-01

Pos logit contributions
by+0.440
 destination+0.409
 awoke+0.402
rell+0.391
ench+0.388
Neg logit contributions
 we-0.446
 I-0.413
 you-0.372
 who-0.360
 yourselves-0.355
Token,
Token ID11
Feature activation+5.45e+00

Pos logit contributions
by+0.049
rell+0.048
 awoke+0.047
ench+0.046
 destination+0.046
Neg logit contributions
 we-0.049
 I-0.044
 who-0.041
 you-0.040
 let-0.037
Token Crab
Token ID32379
Feature activation+6.15e-01

Pos logit contributions
by+0.322
 awoke+0.319
rell+0.315
zie+0.305
ench+0.301
Neg logit contributions
 we-0.223
 I-0.190
 yourselves-0.175
 you-0.171
ms-0.166
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.013
 doorway+0.012
 awoke+0.012
rolled+0.011
rol+0.011
Neg logit contributions
 we-0.054
 who-0.050
 I-0.050
 you-0.049
 let-0.047
Token the
Token ID262
Feature activation-3.05e+00

Pos logit contributions
by+0.884
 awoke+0.805
rell+0.781
lying+0.760
 destination+0.752
Neg logit contributions
 we-1.077
 I-0.996
 yourselves-0.919
 you-0.914
 who-0.870
Token crab
Token ID32202
Feature activation-2.41e+00

Pos logit contributions
 we+0.133
 I+0.118
 who+0.107
 you+0.103
 yourselves+0.094
Neg logit contributions
by-0.190
 awoke-0.178
rell-0.177
 destination-0.176
ench-0.171
Token.
Token ID13
Feature activation+4.60e-01

Pos logit contributions
 we+0.119
 I+0.107
 who+0.101
 you+0.097
 let+0.089
Neg logit contributions
by-0.136
rell-0.134
 awoke-0.131
ench-0.129
 destination-0.129
Token Crab
Token ID32379
Feature activation+6.33e-01

Pos logit contributions
by+0.024
rell+0.023
 awoke+0.023
 destination+0.022
ench+0.022
Neg logit contributions
 we-0.024
 I-0.022
 who-0.020
 you-0.020
 let-0.018
Tokenby
Token ID1525
Feature activation+1.88e+01

Pos logit contributions
rell+0.014
 doorway+0.014
 awoke+0.012
 destination+0.012
rol+0.011
Neg logit contributions
 we-0.054
 I-0.050
 you-0.049
 who-0.049
 let-0.048
TokenThank
Token ID10449
Feature activation-1.14e+01

Pos logit contributions
 we+0.148
 I+0.124
 who+0.104
 you+0.104
 spies+0.100
Neg logit contributions
by-0.248
 awoke-0.240
lying-0.239
rell-0.235
day-0.231
Token you
Token ID345
Feature activation-7.53e+00

Pos logit contributions
 we+0.198
 exc+0.142
 I+0.124
 who+0.120
 wasting+0.118
Neg logit contributions
by-0.902
day-0.879
rell-0.858
zie-0.839
ench-0.837
Token Mom
Token ID11254
Feature activation-4.53e+00

Pos logit contributions
 we+0.395
 I+0.358
 who+0.327
 you+0.324
 let+0.298
Neg logit contributions
by-0.401
rell-0.385
 awoke-0.382
 destination-0.370
ench-0.368
Tokenmy
Token ID1820
Feature activation+7.25e+00

Pos logit contributions
 we+0.254
 I+0.232
 who+0.214
 you+0.213
 let+0.196
Neg logit contributions
by-0.223
rell-0.216
 awoke-0.214
 destination-0.207
ench-0.205
Token for
Token ID329
Feature activation-1.28e+00

Pos logit contributions
by+0.459
rell+0.445
 awoke+0.442
 destination+0.430
ench+0.428
Neg logit contributions
 we-0.305
 I-0.270
 who-0.240
 you-0.238
 let-0.212
Token letting
Token ID9616
Feature activation-1.66e+01

Pos logit contributions
 we+0.059
 I+0.051
 who+0.046
ms+0.045
We+0.042
Neg logit contributions
by-0.068
rell-0.067
 awoke-0.067
day-0.066
ench-0.065
Token me
Token ID502
Feature activation+7.81e+00

Pos logit contributions
 we+0.851
 I+0.770
 you+0.702
 who+0.699
 let+0.636
Neg logit contributions
by-0.910
rell-0.879
 awoke-0.872
 destination-0.844
lying-0.840
Token help
Token ID1037
Feature activation-4.33e+00

Pos logit contributions
by+0.432
ench+0.413
rell+0.411
 awoke+0.405
 destination+0.397
Neg logit contributions
 we-0.370
 I-0.332
 who-0.304
Bad-0.297
 yourselves-0.293
Token,"
Token ID553
Feature activation-2.24e+00

Pos logit contributions
 we+0.170
 I+0.145
 who+0.132
Bad+0.132
 spies+0.125
Neg logit contributions
by-0.274
ench-0.255
rell-0.254
zie-0.252
 destination-0.251
Token said
Token ID531
Feature activation-1.46e+00

Pos logit contributions
 we+0.097
 I+0.087
 yourselves+0.077
 who+0.076
 you+0.075
Neg logit contributions
by-0.132
 awoke-0.122
rell-0.122
lying-0.120
zie-0.120
Token Lily
Token ID20037
Feature activation-8.11e+00

Pos logit contributions
 we+0.060
 I+0.052
 who+0.047
 you+0.046
 let+0.041
Neg logit contributions
by-0.095
rell-0.094
 awoke-0.093
 destination-0.091
ench-0.090
Token for
Token ID329
Feature activation-1.48e+00

Pos logit contributions
 we+0.400
 I+0.364
 who+0.333
 you+0.327
 yourselves+0.306
Neg logit contributions
by-0.411
rell-0.389
 awoke-0.389
 destination-0.377
ench-0.373
Token soaking
Token ID41714
Feature activation-1.81e+00

Pos logit contributions
 we+0.067
 I+0.060
 who+0.048
We+0.048
ms+0.047
Neg logit contributions
by-0.080
rell-0.077
 awoke-0.077
 home-0.077
zie-0.076
Token the
Token ID262
Feature activation-4.25e+00

Pos logit contributions
 we+0.073
 I+0.065
 who+0.058
 you+0.054
 let+0.051
Neg logit contributions
by-0.115
rell-0.109
 awoke-0.108
zie-0.108
ench-0.107
Token crack
Token ID8469
Feature activation-3.01e+00

Pos logit contributions
 we+0.224
 I+0.205
 who+0.187
 you+0.182
 let+0.171
Neg logit contributions
by-0.224
rell-0.213
 awoke-0.213
 destination-0.207
ench-0.204
Token and
Token ID290
Feature activation-5.63e+00

Pos logit contributions
 we+0.144
 I+0.130
 who+0.117
 you+0.115
 let+0.105
Neg logit contributions
by-0.174
rell-0.168
 awoke-0.166
 destination-0.161
ench-0.161
Token letting
Token ID9616
Feature activation-1.66e+01

Pos logit contributions
 we+0.241
 I+0.215
 who+0.206
 you+0.187
ms+0.187
Neg logit contributions
by-0.314
 awoke-0.309
rell-0.299
 destination-0.293
zie-0.289
Token me
Token ID502
Feature activation+7.74e+00

Pos logit contributions
 we+0.771
 I+0.690
 you+0.624
 who+0.617
 let+0.559
Neg logit contributions
by-0.984
rell-0.960
 awoke-0.950
lying-0.923
 destination-0.919
Token out
Token ID503
Feature activation+9.43e+00

Pos logit contributions
by+0.426
rell+0.408
 awoke+0.404
ench+0.399
 destination+0.391
Neg logit contributions
 we-0.381
 I-0.344
 who-0.310
 you-0.304
Bad-0.291
Token.
Token ID13
Feature activation-7.16e-01

Pos logit contributions
by+0.541
rell+0.525
 awoke+0.524
ench+0.502
 destination+0.502
Neg logit contributions
 we-0.436
 I-0.391
 who-0.352
 you-0.346
 yourselves-0.319
Token You
Token ID921
Feature activation-1.04e+01

Pos logit contributions
 we+0.030
ms+0.025
 wasting+0.025
 who+0.025
 pige+0.025
Neg logit contributions
 awoke-0.039
 Eventually-0.039
rell-0.038
lying-0.037
zie-0.037
Token are
Token ID389
Feature activation+2.75e+00

Pos logit contributions
 we+0.549
 I+0.499
 who+0.450
 yourselves+0.445
 you+0.443
Neg logit contributions
 awoke-0.534
by-0.531
rell-0.521
ench-0.507
 destination-0.488
Token tree
Token ID5509
Feature activation+1.37e+00

Pos logit contributions
 we+0.164
 I+0.147
 who+0.136
 you+0.135
 let+0.126
Neg logit contributions
by-0.151
 awoke-0.146
rell-0.145
 destination-0.142
lying-0.138
Token and
Token ID290
Feature activation-4.94e+00

Pos logit contributions
by+0.076
rell+0.074
 awoke+0.073
 destination+0.071
ench+0.070
Neg logit contributions
 we-0.067
 I-0.060
 you-0.054
 who-0.054
 let-0.049
Token thanked
Token ID26280
Feature activation-3.32e+00

Pos logit contributions
 we+0.219
 I+0.186
 who+0.172
 you+0.170
 yourselves+0.163
Neg logit contributions
by-0.281
 awoke-0.279
rell-0.270
 destination-0.262
 doorway-0.261
Token it
Token ID340
Feature activation-6.65e+00

Pos logit contributions
 we+0.161
 I+0.142
 who+0.129
 you+0.123
Bad+0.118
Neg logit contributions
by-0.184
rell-0.173
 awoke-0.173
 destination-0.172
zie-0.171
Token for
Token ID329
Feature activation-1.19e+00

Pos logit contributions
 we+0.317
 I+0.284
 who+0.262
 you+0.257
 let+0.233
Neg logit contributions
by-0.387
rell-0.378
 awoke-0.374
ench-0.360
lying-0.360
Token letting
Token ID9616
Feature activation-1.66e+01

Pos logit contributions
 we+0.062
 I+0.055
 who+0.049
 you+0.048
We+0.045
Neg logit contributions
by-0.060
 awoke-0.059
rell-0.059
 destination-0.058
 doorway-0.057
Token them
Token ID606
Feature activation+4.25e+00

Pos logit contributions
 we+0.715
 I+0.639
 you+0.594
 who+0.557
 let+0.511
Neg logit contributions
by-1.046
rell-1.029
 awoke-1.014
lying-0.991
ench-0.989
Token enjoy
Token ID2883
Feature activation-3.75e+00

Pos logit contributions
by+0.243
rell+0.235
 awoke+0.234
lying+0.232
zie+0.227
Neg logit contributions
 we-0.211
 I-0.189
 you-0.174
 who-0.173
 let-0.158
Token its
Token ID663
Feature activation+2.75e+00

Pos logit contributions
 we+0.159
 I+0.143
 who+0.126
Bad+0.121
 let+0.119
Neg logit contributions
by-0.217
 destination-0.211
 awoke-0.208
rell-0.203
 home-0.200
Token beauty
Token ID8737
Feature activation+2.15e+00

Pos logit contributions
by+0.128
rell+0.127
 awoke+0.124
lying+0.124
ench+0.118
Neg logit contributions
 we-0.162
 I-0.148
 you-0.139
 who-0.137
 let-0.125
Token.
Token ID13
Feature activation-7.31e-01

Pos logit contributions
by+0.124
rell+0.120
 awoke+0.118
 destination+0.115
lying+0.115
Neg logit contributions
 we-0.104
 I-0.093
 who-0.084
 you-0.084
 let-0.076
Token thanked
Token ID26280
Feature activation-3.97e+00

Pos logit contributions
 we+0.470
 I+0.422
 who+0.381
 you+0.379
 let+0.343
Neg logit contributions
by-0.564
rell-0.545
 awoke-0.541
 destination-0.525
ench-0.522
Token the
Token ID262
Feature activation-3.88e+00

Pos logit contributions
 we+0.215
 I+0.196
 who+0.180
 you+0.179
 let+0.164
Neg logit contributions
by-0.205
rell-0.198
 awoke-0.196
ench-0.189
 destination-0.189
Token star
Token ID3491
Feature activation-4.87e+00

Pos logit contributions
 we+0.182
 I+0.162
 you+0.142
 who+0.142
 yourselves+0.136
Neg logit contributions
by-0.226
 awoke-0.213
 destination-0.210
rell-0.206
 doorway-0.206
Tokenfish
Token ID11084
Feature activation-3.26e+00

Pos logit contributions
 we+0.214
 I+0.189
 who+0.174
 you+0.168
 let+0.151
Neg logit contributions
by-0.298
rell-0.295
 awoke-0.288
ench-0.282
lying-0.282
Token for
Token ID329
Feature activation-1.20e+00

Pos logit contributions
 we+0.144
 I+0.127
 who+0.121
 you+0.113
 let+0.104
Neg logit contributions
rell-0.201
by-0.200
 awoke-0.191
lying-0.190
zie-0.190
Token letting
Token ID9616
Feature activation-1.66e+01

Pos logit contributions
 we+0.056
 I+0.050
 who+0.044
 you+0.042
 yourselves+0.040
Neg logit contributions
by-0.067
 awoke-0.065
rell-0.064
 destination-0.062
zie-0.061
Token them
Token ID606
Feature activation+4.34e+00

Pos logit contributions
 we+0.790
 I+0.708
 you+0.645
 who+0.636
 let+0.577
Neg logit contributions
by-0.963
rell-0.942
 awoke-0.928
 destination-0.901
ench-0.900
Token touch
Token ID3638
Feature activation-1.71e+00

Pos logit contributions
by+0.244
rell+0.237
 awoke+0.235
lying+0.229
 destination+0.228
Neg logit contributions
 we-0.215
 I-0.193
 who-0.176
 you-0.176
 let-0.160
Token it
Token ID340
Feature activation-6.54e+00

Pos logit contributions
 we+0.075
 I+0.067
 who+0.059
 you+0.057
 let+0.054
Neg logit contributions
by-0.103
 awoke-0.098
rell-0.097
 destination-0.096
ench-0.094
Token and
Token ID290
Feature activation-5.37e+00

Pos logit contributions
 we+0.306
 I+0.271
 you+0.247
 who+0.246
 let+0.222
Neg logit contributions
by-0.394
rell-0.382
 awoke-0.377
lying-0.369
zie-0.365
Token left
Token ID1364
Feature activation-6.00e-01

Pos logit contributions
 we+0.242
 I+0.221
 who+0.197
 you+0.196
 yourselves+0.188
Neg logit contributions
by-0.315
 awoke-0.302
 destination-0.292
rell-0.290
ench-0.282
Token up
Token ID510
Feature activation+5.63e+00

Pos logit contributions
by+0.018
rell+0.017
 awoke+0.017
 destination+0.017
ench+0.017
Neg logit contributions
 we-0.016
 I-0.014
 who-0.013
 you-0.013
 let-0.012
Token.
Token ID13
Feature activation+5.66e-02

Pos logit contributions
by+0.361
rell+0.357
 awoke+0.350
ench+0.343
lying+0.343
Neg logit contributions
 we-0.232
 I-0.206
 who-0.184
 you-0.183
 let-0.158
Token Thank
Token ID6952
Feature activation-7.57e+00

Pos logit contributions
by+0.003
rell+0.003
 awoke+0.003
 destination+0.002
ench+0.002
Neg logit contributions
 we-0.003
 I-0.003
 who-0.003
 you-0.003
 let-0.003
Token you
Token ID345
Feature activation-7.45e+00

Pos logit contributions
 we+0.119
 wasting+0.075
 I+0.074
 exc+0.065
 who+0.060
Neg logit contributions
by-0.620
day-0.592
 awoke-0.578
lying-0.573
rell-0.567
Token for
Token ID329
Feature activation-1.19e+00

Pos logit contributions
 we+0.383
 I+0.346
 who+0.314
 you+0.314
 let+0.285
Neg logit contributions
by-0.407
rell-0.397
 awoke-0.391
 destination-0.379
ench-0.377
Token letting
Token ID9616
Feature activation-1.64e+01

Pos logit contributions
 we+0.053
 I+0.046
 who+0.038
We+0.038
ms+0.036
Neg logit contributions
by-0.067
rell-0.065
 awoke-0.064
zie-0.063
 destination-0.063
Token me
Token ID502
Feature activation+7.62e+00

Pos logit contributions
 we+0.830
 I+0.752
 you+0.687
 who+0.679
 let+0.620
Neg logit contributions
by-0.909
rell-0.881
 awoke-0.874
ench-0.844
 destination-0.842
Token play
Token ID711
Feature activation-9.84e-02

Pos logit contributions
by+0.425
rell+0.407
 awoke+0.401
ench+0.397
 destination+0.391
Neg logit contributions
 we-0.368
 I-0.332
 who-0.300
 you-0.293
 yourselves-0.281
Token with
Token ID351
Feature activation-1.09e+01

Pos logit contributions
 we+0.004
 I+0.003
 who+0.003
 wasting+0.003
Bad+0.003
Neg logit contributions
by-0.006
 awoke-0.006
 destination-0.006
rell-0.006
ench-0.006
Token you
Token ID345
Feature activation-7.45e+00

Pos logit contributions
 we+0.439
 I+0.377
 who+0.341
 let+0.310
 you+0.306
Neg logit contributions
by-0.690
rell-0.655
 awoke-0.645
 destination-0.642
zie-0.640
Token."
Token ID526
Feature activation-8.92e+00

Pos logit contributions
 we+0.364
 I+0.327
 who+0.298
 you+0.298
 let+0.267
Neg logit contributions
by-0.426
rell-0.418
 awoke-0.410
lying-0.398
ench-0.397
Token,
Token ID11
Feature activation+4.37e+00

Pos logit contributions
by+0.517
rell+0.500
 awoke+0.497
 destination+0.483
lying+0.479
Neg logit contributions
 we-0.383
 I-0.342
 who-0.306
 you-0.304
 let-0.273
Token mighty
Token ID18680
Feature activation-4.87e+00

Pos logit contributions
by+0.263
 awoke+0.261
rell+0.253
zie+0.248
 destination+0.246
Neg logit contributions
 we-0.161
 I-0.136
ms-0.129
 yourselves-0.125
 who-0.122
Token bird
Token ID6512
Feature activation-4.55e+00

Pos logit contributions
 we+0.246
 I+0.219
 who+0.202
 you+0.198
You+0.191
Neg logit contributions
by-0.263
 awoke-0.245
 destination-0.243
rell-0.237
 doorway-0.235
Token,
Token ID11
Feature activation+4.70e+00

Pos logit contributions
 we+0.246
 I+0.224
 you+0.202
 who+0.201
 yourselves+0.190
Neg logit contributions
by-0.233
 awoke-0.223
rell-0.220
ench-0.209
zie-0.208
Token for
Token ID329
Feature activation-1.14e+00

Pos logit contributions
by+0.277
 awoke+0.277
rell+0.263
 destination+0.257
zie+0.255
Neg logit contributions
 we-0.190
 I-0.163
 yourselves-0.146
 you-0.144
 who-0.141
Token letting
Token ID9616
Feature activation-1.64e+01

Pos logit contributions
 we+0.049
 I+0.043
 who+0.039
 you+0.035
We+0.035
Neg logit contributions
by-0.067
rell-0.065
 awoke-0.064
 destination-0.063
zie-0.062
Token me
Token ID502
Feature activation+7.85e+00

Pos logit contributions
 we+0.810
 I+0.737
 you+0.702
 who+0.651
 let+0.607
Neg logit contributions
by-0.938
 awoke-0.910
rell-0.909
lying-0.892
ench-0.886
Token see
Token ID766
Feature activation-5.55e+00

Pos logit contributions
by+0.449
rell+0.432
 awoke+0.429
ench+0.419
 destination+0.415
Neg logit contributions
 we-0.372
 I-0.334
 who-0.300
 you-0.295
 yourselves-0.277
Token the
Token ID262
Feature activation-3.98e+00

Pos logit contributions
 we+0.214
 I+0.182
 who+0.153
Bad+0.152
We+0.150
Neg logit contributions
by-0.352
rell-0.338
 awoke-0.336
 destination-0.325
zie-0.325
Token world
Token ID995
Feature activation+2.00e+00

Pos logit contributions
 we+0.199
 I+0.179
 who+0.162
 you+0.156
 yourselves+0.150
Neg logit contributions
by-0.219
 destination-0.207
 awoke-0.205
rell-0.204
ench-0.197
Token from
Token ID422
Feature activation-1.36e+00

Pos logit contributions
by+0.117
rell+0.114
 awoke+0.113
 destination+0.109
ench+0.109
Neg logit contributions
 we-0.092
 I-0.083
 who-0.075
 you-0.074
 let-0.067
Token.
Token ID13
Feature activation-5.45e-01

Pos logit contributions
 we+0.199
 I+0.179
 who+0.161
 you+0.160
 let+0.145
Neg logit contributions
by-0.250
rell-0.242
 awoke-0.240
 destination-0.233
ench-0.232
Token They
Token ID1119
Feature activation-1.97e+00

Pos logit contributions
 we+0.031
 I+0.027
 who+0.026
 you+0.025
 yourselves+0.024
Neg logit contributions
by-0.025
rell-0.025
 awoke-0.025
lying-0.024
zie-0.023
Token thanked
Token ID26280
Feature activation-3.15e+00

Pos logit contributions
 we+0.096
 I+0.086
 you+0.077
 who+0.077
 yourselves+0.071
Neg logit contributions
by-0.111
 awoke-0.107
rell-0.106
 destination-0.102
ench-0.102
Token mom
Token ID1995
Feature activation-5.87e+00

Pos logit contributions
 we+0.135
 I+0.124
 who+0.104
ms+0.103
 yourselves+0.102
Neg logit contributions
by-0.189
 awoke-0.177
rell-0.174
 destination-0.173
zie-0.173
Token for
Token ID329
Feature activation-1.24e+00

Pos logit contributions
 we+0.254
 I+0.223
 who+0.207
 you+0.200
 let+0.180
Neg logit contributions
by-0.364
rell-0.358
 awoke-0.355
ench-0.346
 destination-0.345
Token letting
Token ID9616
Feature activation-1.64e+01

Pos logit contributions
 we+0.057
 I+0.052
 who+0.042
 you+0.042
ms+0.040
Neg logit contributions
by-0.069
 awoke-0.067
rell-0.067
 doorway-0.065
 ease-0.065
Token them
Token ID606
Feature activation+4.20e+00

Pos logit contributions
 we+0.787
 I+0.698
 you+0.668
 who+0.643
 let+0.605
Neg logit contributions
by-0.937
rell-0.925
 awoke-0.916
ench-0.887
lying-0.883
Token use
Token ID779
Feature activation-8.91e+00

Pos logit contributions
by+0.244
rell+0.240
 awoke+0.236
lying+0.233
 destination+0.231
Neg logit contributions
 we-0.202
 I-0.180
 who-0.165
 you-0.165
 let-0.152
Token the
Token ID262
Feature activation-4.04e+00

Pos logit contributions
 we+0.327
 I+0.284
 who+0.247
 you+0.244
 let+0.212
Neg logit contributions
by-0.613
rell-0.596
 awoke-0.592
 destination-0.577
ench-0.575
Token mixer
Token ID33938
Feature activation-3.92e-01

Pos logit contributions
 we+0.210
 I+0.190
 who+0.174
 you+0.173
 let+0.158
Neg logit contributions
by-0.216
rell-0.209
 awoke-0.207
 destination-0.200
ench-0.200
Token and
Token ID290
Feature activation-5.22e+00

Pos logit contributions
 we+0.019
 I+0.017
 who+0.016
 you+0.015
 let+0.014
Neg logit contributions
by-0.022
 awoke-0.022
rell-0.022
lying-0.021
ench-0.021
Token
Token ID198
Feature activation+2.88e-01

Pos logit contributions
 we+0.005
 I+0.005
 who+0.004
 you+0.004
 let+0.004
Neg logit contributions
by-0.006
rell-0.006
 awoke-0.006
lying-0.006
ench-0.006
Token"
Token ID1
Feature activation-1.77e+00

Pos logit contributions
by+0.018
 awoke+0.017
rell+0.017
lying+0.017
ench+0.016
Neg logit contributions
 we-0.012
 I-0.011
 who-0.010
 you-0.009
 let-0.009
TokenThank
Token ID10449
Feature activation-1.12e+01

Pos logit contributions
 we+0.081
 I+0.069
 wasting+0.062
 yourselves+0.062
 you+0.061
Neg logit contributions
 awoke-0.096
day-0.094
by-0.094
lying-0.093
date-0.089
Token you
Token ID345
Feature activation-7.44e+00

Pos logit contributions
 we+0.185
 wasting+0.143
 exc+0.143
 who+0.136
Bad+0.125
Neg logit contributions
by-0.881
day-0.824
rell-0.814
lying-0.806
zie-0.800
Token for
Token ID329
Feature activation-1.11e+00

Pos logit contributions
 we+0.415
 I+0.378
 who+0.348
 you+0.347
 let+0.319
Neg logit contributions
by-0.374
rell-0.361
 awoke-0.357
 destination-0.343
ench-0.342
Token letting
Token ID9616
Feature activation-1.64e+01

Pos logit contributions
 we+0.051
 I+0.044
We+0.040
ms+0.040
 who+0.036
Neg logit contributions
zie-0.056
rell-0.056
 home-0.055
by-0.054
 doorway-0.053
Token us
Token ID514
Feature activation+6.28e+00

Pos logit contributions
 we+0.854
 I+0.775
 who+0.706
 you+0.704
 let+0.643
Neg logit contributions
by-0.877
rell-0.846
 awoke-0.839
 destination-0.810
ench-0.808
Token pet
Token ID4273
Feature activation-3.51e+00

Pos logit contributions
by+0.349
rell+0.334
 awoke+0.329
ench+0.325
 destination+0.322
Neg logit contributions
 we-0.306
 I-0.277
 who-0.254
 you-0.243
 yourselves-0.232
Token him
Token ID683
Feature activation+1.10e+01

Pos logit contributions
 we+0.149
 I+0.131
 who+0.119
 exc+0.114
 let+0.111
Neg logit contributions
by-0.211
 awoke-0.200
rell-0.198
 destination-0.193
zie-0.187
Token,"
Token ID553
Feature activation-2.32e+00

Pos logit contributions
by+0.735
rell+0.720
 awoke+0.714
lying+0.694
 destination+0.692
Neg logit contributions
 we-0.431
 I-0.376
 you-0.335
 who-0.332
 let-0.287
Token Ben
Token ID3932
Feature activation-8.25e+00

Pos logit contributions
 we+0.096
 I+0.085
 who+0.075
 you+0.073
 yourselves+0.072
Neg logit contributions
by-0.143
 awoke-0.136
rell-0.134
lying-0.132
 destination-0.130
Token thank
Token ID5875
Feature activation-7.02e+00

Pos logit contributions
 we+0.098
 I+0.088
 you+0.078
 who+0.076
 yourselves+0.075
Neg logit contributions
by-0.112
 awoke-0.109
rell-0.106
 destination-0.104
lying-0.102
Token Bob
Token ID5811
Feature activation+4.11e+00

Pos logit contributions
 we+0.299
 I+0.279
 who+0.242
ms+0.234
Bad+0.233
Neg logit contributions
by-0.409
rell-0.369
 destination-0.358
rolled-0.354
zie-0.354
Token and
Token ID290
Feature activation-5.22e+00

Pos logit contributions
by+0.249
rell+0.245
 awoke+0.242
 destination+0.237
ench+0.235
Neg logit contributions
 we-0.182
 I-0.162
 who-0.148
 you-0.144
 let-0.130
Token Rex
Token ID17853
Feature activation-4.73e+00

Pos logit contributions
 we+0.215
 I+0.190
 who+0.168
 you+0.167
 yourselves+0.157
Neg logit contributions
by-0.332
 awoke-0.317
rell-0.313
 destination-0.307
zie-0.303
Token for
Token ID329
Feature activation-1.38e+00

Pos logit contributions
 we+0.233
 I+0.208
 who+0.192
 you+0.188
 let+0.173
Neg logit contributions
by-0.266
rell-0.261
 awoke-0.258
ench-0.251
 destination-0.251
Token letting
Token ID9616
Feature activation-1.64e+01

Pos logit contributions
 we+0.064
 I+0.058
 you+0.047
 who+0.046
ms+0.043
Neg logit contributions
by-0.078
rell-0.075
 awoke-0.073
 ease-0.072
 destination-0.072
Token them
Token ID606
Feature activation+4.39e+00

Pos logit contributions
 we+0.800
 I+0.714
 you+0.695
 who+0.673
 let+0.630
Neg logit contributions
rell-0.908
by-0.906
 awoke-0.904
lying-0.875
ench-0.874
Token play
Token ID711
Feature activation-9.84e-02

Pos logit contributions
by+0.250
rell+0.243
 awoke+0.243
lying+0.236
zie+0.234
Neg logit contributions
 we-0.215
 I-0.192
 who-0.176
 you-0.176
 let-0.162
Token.
Token ID13
Feature activation-7.06e-01

Pos logit contributions
 we+0.005
 I+0.004
 you+0.004
 who+0.004
 yourselves+0.004
Neg logit contributions
by-0.005
 awoke-0.005
ench-0.005
rell-0.005
 destination-0.005
Token They
Token ID1119
Feature activation-1.76e+00

Pos logit contributions
 we+0.037
 I+0.032
 you+0.032
 who+0.031
 wasting+0.030
Neg logit contributions
rell-0.033
 awoke-0.033
 Eventually-0.032
by-0.032
lying-0.031
Token say
Token ID910
Feature activation-3.34e+00

Pos logit contributions
 we+0.084
 I+0.076
 you+0.067
 yourselves+0.067
 who+0.066
Neg logit contributions
by-0.100
rell-0.096
 awoke-0.095
 destination-0.092
zie-0.090
Token soon
Token ID2582
Feature activation-3.03e+00

Pos logit contributions
by+0.912
 destination+0.886
 awoke+0.854
 home+0.850
ench+0.840
Neg logit contributions
 we-0.684
 I-0.650
Bad-0.579
You-0.567
 you-0.546
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 we+0.120
 I+0.107
 who+0.096
Bad+0.095
 you+0.091
Neg logit contributions
by-0.186
 awoke-0.182
 destination-0.181
rell-0.180
ench-0.177
Token Thank
Token ID6952
Feature activation-7.48e+00

Pos logit contributions
ms+0.022
 we+0.022
ts+0.021
 annoying+0.021
 wasting+0.020
Neg logit contributions
 Eventually-0.035
 Today-0.034
 awoke-0.033
 Along-0.033
 Late-0.032
Token you
Token ID345
Feature activation-7.36e+00

Pos logit contributions
 annoying+0.146
 wasting+0.132
 we+0.128
 exc+0.125
 dumb+0.113
Neg logit contributions
by-0.536
day-0.524
 home-0.495
 awoke-0.487
rell-0.479
Token for
Token ID329
Feature activation-1.06e+00

Pos logit contributions
 we+0.380
 I+0.347
 who+0.319
 you+0.310
 yourselves+0.298
Neg logit contributions
by-0.389
rell-0.367
 awoke-0.366
 destination-0.358
ench-0.351
Token letting
Token ID9616
Feature activation-1.63e+01

Pos logit contributions
 we+0.047
 I+0.043
ms+0.035
We+0.035
 who+0.034
Neg logit contributions
by-0.057
 home-0.057
 awoke-0.056
rell-0.055
 destination-0.055
Token us
Token ID514
Feature activation+6.49e+00

Pos logit contributions
 we+0.823
 I+0.743
 you+0.680
 who+0.673
 let+0.614
Neg logit contributions
by-0.908
rell-0.880
 awoke-0.872
lying-0.842
ench-0.841
Token play
Token ID711
Feature activation+1.05e-01

Pos logit contributions
by+0.342
rell+0.327
 awoke+0.323
 destination+0.315
ench+0.314
Neg logit contributions
 we-0.338
 I-0.309
 who-0.281
 you-0.276
 yourselves-0.258
Token with
Token ID351
Feature activation-1.07e+01

Pos logit contributions
 home+0.006
 destination+0.006
 awoke+0.006
date+0.006
ench+0.006
Neg logit contributions
 we-0.004
 I-0.004
 wasting-0.004
Bad-0.003
 pige-0.003
Token it
Token ID340
Feature activation-6.54e+00

Pos logit contributions
 we+0.445
 I+0.408
ms+0.366
You+0.365
 who+0.358
Neg logit contributions
by-0.621
 destination-0.583
 awoke-0.557
 home-0.556
rell-0.552
Token."
Token ID526
Feature activation-8.90e+00

Pos logit contributions
 we+0.265
 I+0.233
 who+0.206
 you+0.203
 let+0.181
Neg logit contributions
by-0.424
rell-0.411
 awoke-0.409
 destination-0.398
ench-0.396
Token says
Token ID1139
Feature activation-4.17e+00

Pos logit contributions
 we+0.505
 I+0.459
 yourselves+0.457
 who+0.427
 cooperative+0.421
Neg logit contributions
 awoke-0.539
by-0.517
rell-0.509
 destination-0.505
ench-0.496
Token,
Token ID11
Feature activation+2.92e+00

Pos logit contributions
 we+0.209
 I+0.188
 who+0.171
 you+0.169
 yourselves+0.160
Neg logit contributions
by-0.226
 awoke-0.216
rell-0.212
lying-0.205
 destination-0.205
Token "
Token ID366
Feature activation-4.53e+00

Pos logit contributions
by+0.158
 awoke+0.156
lying+0.150
rell+0.149
 destination+0.147
Neg logit contributions
 we-0.124
 I-0.108
 yourselves-0.106
Bad-0.101
 you-0.100
TokenOK
Token ID11380
Feature activation-7.40e+00

Pos logit contributions
 we+0.171
 I+0.140
 who+0.134
 wasting+0.130
 spies+0.126
Neg logit contributions
by-0.289
lying-0.282
 awoke-0.280
rell-0.276
day-0.271
Token,
Token ID11
Feature activation+3.44e+00

Pos logit contributions
 we+0.456
 I+0.418
 who+0.408
 yourselves+0.399
 wasting+0.396
Neg logit contributions
by-0.287
 destination-0.271
 awoke-0.265
rell-0.262
lying-0.257
Token let
Token ID1309
Feature activation-1.63e+01

Pos logit contributions
by+0.201
 destination+0.197
zie+0.196
 awoke+0.196
 home+0.190
Neg logit contributions
 yourselves-0.105
 wasting-0.099
 we-0.096
ms-0.096
Bad-0.094
Token's
Token ID338
Feature activation+9.18e+00

Pos logit contributions
 we+0.991
 I+0.908
 you+0.861
 who+0.831
 let+0.796
Neg logit contributions
by-0.743
rell-0.731
 awoke-0.708
lying-0.693
ench-0.684
Token play
Token ID711
Feature activation-8.75e-02

Pos logit contributions
by+0.455
 destination+0.435
ench+0.422
 awoke+0.421
rell+0.406
Neg logit contributions
 we-0.482
 I-0.436
 who-0.401
Bad-0.391
 yourselves-0.387
Token soccer
Token ID11783
Feature activation-3.42e+00

Pos logit contributions
 we+0.004
 I+0.003
 wasting+0.003
 exc+0.003
 pige+0.003
Neg logit contributions
by-0.005
 destination-0.005
 awoke-0.005
 home-0.005
ench-0.004
Token!"
Token ID2474
Feature activation-6.19e+00

Pos logit contributions
 we+0.140
 I+0.125
 who+0.114
 wasting+0.113
 yourselves+0.109
Neg logit contributions
by-0.206
 destination-0.193
rell-0.189
 awoke-0.185
ench-0.184
Token
Token ID198
Feature activation-1.66e-01

Pos logit contributions
 we+0.328
 I+0.297
 who+0.271
 you+0.270
 yourselves+0.259
Neg logit contributions
by-0.310
rell-0.303
 awoke-0.294
lying-0.291
zie-0.283
Token you
Token ID345
Feature activation-8.03e+00

Pos logit contributions
 we+0.264
 I+0.240
Bad+0.231
 who+0.226
ms+0.223
Neg logit contributions
by-0.262
 destination-0.240
rell-0.238
 home-0.237
 awoke-0.233
Token,
Token ID11
Feature activation+4.32e+00

Pos logit contributions
 we+0.346
 I+0.305
 who+0.274
 you+0.269
 yourselves+0.242
Neg logit contributions
by-0.498
rell-0.480
 awoke-0.479
 destination-0.465
ench-0.463
Token Tom
Token ID4186
Feature activation-7.66e+00

Pos logit contributions
by+0.265
 awoke+0.256
 destination+0.250
zie+0.239
ench+0.238
Neg logit contributions
 we-0.136
 yourselves-0.123
ms-0.119
Bad-0.118
 wasting-0.109
Token,
Token ID11
Feature activation+3.96e+00

Pos logit contributions
 we+0.320
 I+0.280
 who+0.247
 you+0.245
 yourselves+0.221
Neg logit contributions
by-0.489
 awoke-0.470
rell-0.465
lying-0.451
ench-0.451
Token for
Token ID329
Feature activation-1.08e+00

Pos logit contributions
by+0.238
 awoke+0.238
 destination+0.224
zie+0.222
rell+0.218
Neg logit contributions
 we-0.124
Bad-0.106
 yourselves-0.106
ms-0.102
 I-0.100
Token letting
Token ID9616
Feature activation-1.63e+01

Pos logit contributions
 we+0.049
 I+0.043
 who+0.040
We+0.035
ms+0.034
Neg logit contributions
by-0.060
 awoke-0.057
rell-0.057
 destination-0.056
ench-0.055
Token me
Token ID502
Feature activation+7.87e+00

Pos logit contributions
 we+0.840
 I+0.764
 you+0.709
 who+0.690
 let+0.636
Neg logit contributions
by-0.885
rell-0.859
 awoke-0.854
lying-0.827
 destination-0.822
Token try
Token ID1949
Feature activation-1.30e+00

Pos logit contributions
by+0.441
ench+0.420
rell+0.419
 awoke+0.418
 destination+0.407
Neg logit contributions
 we-0.369
 I-0.330
 who-0.300
Bad-0.288
 you-0.286
Token your
Token ID534
Feature activation-1.14e+00

Pos logit contributions
 we+0.055
 I+0.049
 who+0.045
Bad+0.042
 you+0.041
Neg logit contributions
by-0.079
rell-0.075
 destination-0.074
 awoke-0.073
ench-0.073
Token fl
Token ID781
Feature activation-3.84e-01

Pos logit contributions
 we+0.061
 I+0.055
 who+0.051
 you+0.050
 let+0.046
Neg logit contributions
by-0.059
rell-0.057
 awoke-0.056
 destination-0.055
ench-0.054
Tokenute
Token ID1133
Feature activation-2.99e-01

Pos logit contributions
 we+0.024
 I+0.022
 who+0.020
 you+0.020
 let+0.019
Neg logit contributions
by-0.017
 destination-0.015
ench-0.015
rell-0.015
 awoke-0.015
Token.
Token ID13
Feature activation-8.05e-01

Pos logit contributions
 we+0.082
 I+0.072
 you+0.064
 who+0.063
 yourselves+0.059
Neg logit contributions
by-0.110
 awoke-0.109
rell-0.107
ench-0.103
 destination-0.103
Token "
Token ID366
Feature activation-3.87e+00

Pos logit contributions
 we+0.040
 I+0.033
ms+0.033
 who+0.033
 wasting+0.032
Neg logit contributions
rell-0.039
 awoke-0.039
 Eventually-0.038
lying-0.037
zie-0.037
TokenThank
Token ID10449
Feature activation-1.10e+01

Pos logit contributions
 we+0.174
 I+0.142
 you+0.130
 wasting+0.123
 spies+0.121
Neg logit contributions
 awoke-0.231
by-0.228
day-0.223
lying-0.221
rell-0.216
Token you
Token ID345
Feature activation-7.51e+00

Pos logit contributions
 we+0.177
 who+0.120
 I+0.111
 exc+0.106
 let+0.100
Neg logit contributions
by-0.886
rell-0.851
day-0.846
 awoke-0.839
zie-0.830
Token for
Token ID329
Feature activation-1.08e+00

Pos logit contributions
 we+0.380
 I+0.344
 who+0.311
 you+0.311
 let+0.282
Neg logit contributions
by-0.417
rell-0.405
 awoke-0.400
 destination-0.386
lying-0.386
Token letting
Token ID9616
Feature activation-1.63e+01

Pos logit contributions
 we+0.051
 I+0.043
We+0.040
ms+0.039
 pige+0.036
Neg logit contributions
 home-0.055
zie-0.054
rell-0.054
 awoke-0.054
 doorway-0.052
Token me
Token ID502
Feature activation+7.69e+00

Pos logit contributions
 we+0.874
 I+0.794
 who+0.727
 you+0.719
 let+0.662
Neg logit contributions
by-0.846
rell-0.814
 awoke-0.807
 destination-0.780
ench-0.776
Token touch
Token ID3638
Feature activation-1.79e+00

Pos logit contributions
by+0.422
rell+0.405
ench+0.403
 awoke+0.399
 destination+0.388
Neg logit contributions
 we-0.370
 I-0.331
 who-0.301
Bad-0.295
 yourselves-0.290
Token your
Token ID534
Feature activation-1.19e+00

Pos logit contributions
 we+0.064
 I+0.053
 who+0.047
 exc+0.046
 spies+0.046
Neg logit contributions
by-0.116
rell-0.114
 awoke-0.113
zie-0.109
 destination-0.109
Token feather
Token ID17972
Feature activation+6.38e-02

Pos logit contributions
 we+0.063
 I+0.058
 who+0.053
 you+0.052
 let+0.048
Neg logit contributions
by-0.062
rell-0.060
 awoke-0.059
 destination-0.057
ench-0.057
Token,"
Token ID553
Feature activation-2.06e+00

Pos logit contributions
by+0.004
rell+0.004
 awoke+0.004
 destination+0.004
lying+0.004
Neg logit contributions
 we-0.003
 I-0.002
 who-0.002
 you-0.002
 let-0.002
Token for
Token ID329
Feature activation-9.88e-01

Pos logit contributions
rell+0.075
by+0.074
 awoke+0.073
ench+0.072
 destination+0.071
Neg logit contributions
 we-0.045
 I-0.039
 who-0.037
 you-0.034
 let-0.031
Token introducing
Token ID16118
Feature activation-6.06e+00

Pos logit contributions
 we+0.049
 I+0.044
 who+0.037
 you+0.036
We+0.035
Neg logit contributions
by-0.052
rell-0.052
 awoke-0.051
 ease-0.050
 destination-0.050
Token herself
Token ID5223
Feature activation+2.15e+00

Pos logit contributions
 we+0.339
 I+0.309
 who+0.284
 you+0.281
 let+0.260
Neg logit contributions
by-0.302
rell-0.289
 awoke-0.286
 destination-0.277
lying-0.274
Token and
Token ID290
Feature activation-5.19e+00

Pos logit contributions
by+0.127
rell+0.127
 awoke+0.125
lying+0.122
ench+0.122
Neg logit contributions
 we-0.100
 I-0.088
 who-0.084
 you-0.081
 let-0.072
Token for
Token ID329
Feature activation-9.76e-01

Pos logit contributions
 we+0.243
 I+0.218
 who+0.195
 you+0.194
 yourselves+0.180
Neg logit contributions
by-0.302
 awoke-0.289
rell-0.287
 destination-0.280
lying-0.275
Token letting
Token ID9616
Feature activation-1.63e+01

Pos logit contributions
 we+0.045
 I+0.040
 who+0.033
 you+0.033
We+0.031
Neg logit contributions
by-0.056
 awoke-0.054
rell-0.053
 destination-0.052
 doorway-0.052
Token her
Token ID607
Feature activation+7.70e+00

Pos logit contributions
 we+0.785
 I+0.708
 you+0.679
 who+0.644
 let+0.604
Neg logit contributions
rell-0.915
by-0.913
 awoke-0.901
ench-0.876
lying-0.873
Token borrow
Token ID8804
Feature activation+4.02e-01

Pos logit contributions
by+0.438
rell+0.418
 awoke+0.416
 destination+0.406
ench+0.401
Neg logit contributions
 we-0.375
 I-0.338
 who-0.305
 you-0.302
 yourselves-0.277
Token the
Token ID262
Feature activation-3.58e+00

Pos logit contributions
by+0.028
rell+0.027
 awoke+0.026
 destination+0.026
lying+0.026
Neg logit contributions
 we-0.015
 I-0.013
 who-0.011
 you-0.011
 let-0.010
Token gray
Token ID12768
Feature activation-2.97e+00

Pos logit contributions
 we+0.223
 I+0.204
 who+0.189
 yourselves+0.185
 you+0.181
Neg logit contributions
by-0.153
 awoke-0.141
 destination-0.139
rell-0.139
 doorway-0.133
Token book
Token ID1492
Feature activation+3.47e+00

Pos logit contributions
 we+0.174
 I+0.159
 who+0.147
 you+0.145
 let+0.136
Neg logit contributions
by-0.140
rell-0.133
 awoke-0.132
 destination-0.127
lying-0.125
Token thank
Token ID5875
Feature activation-6.68e+00

Pos logit contributions
 we+0.125
 I+0.114
 who+0.108
 you+0.105
 yourselves+0.101
Neg logit contributions
by-0.105
 awoke-0.101
rell-0.099
 destination-0.094
lying-0.093
Token you
Token ID345
Feature activation-7.22e+00

Pos logit contributions
 we+0.116
 I+0.088
 who+0.074
 wasting+0.074
 spies+0.070
Neg logit contributions
by-0.524
rell-0.487
lying-0.478
zie-0.478
day-0.477
Token to
Token ID284
Feature activation+3.84e+00

Pos logit contributions
 we+0.338
 I+0.300
 who+0.276
 you+0.274
 let+0.242
Neg logit contributions
rell-0.430
by-0.425
 awoke-0.416
ench-0.413
lying-0.408
Token them
Token ID606
Feature activation+4.43e+00

Pos logit contributions
by+0.195
zie+0.185
rell+0.185
 destination+0.179
 ease+0.176
Neg logit contributions
 we-0.197
 I-0.175
We-0.163
 who-0.157
ms-0.157
Token for
Token ID329
Feature activation-8.41e-01

Pos logit contributions
by+0.279
rell+0.270
 awoke+0.268
 destination+0.261
ench+0.259
Neg logit contributions
 we-0.189
 I-0.168
 who-0.149
 you-0.148
 let-0.132
Token letting
Token ID9616
Feature activation-1.62e+01

Pos logit contributions
 we+0.040
 I+0.036
We+0.030
ms+0.030
 who+0.030
Neg logit contributions
 home-0.042
 awoke-0.042
rell-0.042
 ease-0.042
by-0.042
Token him
Token ID683
Feature activation+1.13e+01

Pos logit contributions
 we+0.916
 I+0.834
 you+0.795
 who+0.767
 let+0.717
Neg logit contributions
by-0.801
rell-0.789
 awoke-0.779
ench-0.759
lying-0.754
Token see
Token ID766
Feature activation-4.98e+00

Pos logit contributions
by+0.632
rell+0.612
 awoke+0.608
 destination+0.591
lying+0.588
Neg logit contributions
 we-0.560
 I-0.504
 who-0.459
 you-0.457
 let-0.417
Token them
Token ID606
Feature activation+4.46e+00

Pos logit contributions
 we+0.261
 I+0.235
 who+0.206
 wasting+0.202
Bad+0.202
Neg logit contributions
by-0.248
 awoke-0.230
rell-0.229
 destination-0.223
zie-0.217
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
by+0.268
rell+0.260
 awoke+0.256
lying+0.252
zie+0.249
Neg logit contributions
 we-0.206
 I-0.182
 who-0.167
 you-0.165
 let-0.148
Token He
Token ID679
Feature activation+6.24e+00

Pos logit contributions
 we+0.026
 I+0.023
 you+0.023
 who+0.023
 wasting+0.023
Neg logit contributions
by-0.013
rell-0.013
 Eventually-0.013
 awoke-0.013
lying-0.012
Token fixing
Token ID18682
Feature activation-6.80e+00

Pos logit contributions
 we+0.056
 I+0.049
 pige+0.041
We+0.041
 you+0.040
Neg logit contributions
 ease-0.056
by-0.056
zie-0.056
 destination-0.055
rell-0.055
Token the
Token ID262
Feature activation-3.39e+00

Pos logit contributions
 we+0.317
 I+0.284
 who+0.252
 you+0.247
 let+0.230
Neg logit contributions
by-0.401
rell-0.385
 awoke-0.375
 destination-0.373
zie-0.368
Token dress
Token ID6576
Feature activation-1.95e+00

Pos logit contributions
 we+0.180
 I+0.163
 who+0.148
 you+0.147
 let+0.137
Neg logit contributions
by-0.177
rell-0.168
 awoke-0.166
 destination-0.166
ench-0.160
Token and
Token ID290
Feature activation-4.61e+00

Pos logit contributions
 we+0.089
 I+0.080
 who+0.072
 you+0.071
 let+0.064
Neg logit contributions
by-0.118
 awoke-0.114
rell-0.114
ench-0.110
lying-0.109
Token not
Token ID407
Feature activation-8.80e+00

Pos logit contributions
 we+0.218
 I+0.198
 who+0.178
 you+0.176
Bad+0.169
Neg logit contributions
by-0.254
rell-0.239
 awoke-0.237
 destination-0.235
 doorway-0.233
Token letting
Token ID9616
Feature activation-1.62e+01

Pos logit contributions
 we+0.409
 I+0.362
 you+0.325
 who+0.325
 yourselves+0.303
Neg logit contributions
by-0.514
rell-0.491
 awoke-0.483
 destination-0.476
lying-0.471
Token the
Token ID262
Feature activation-3.59e+00

Pos logit contributions
 we+0.818
 I+0.743
 you+0.682
 who+0.673
 let+0.619
Neg logit contributions
by-0.891
rell-0.868
 awoke-0.867
ench-0.832
lying-0.830
Token delay
Token ID5711
Feature activation+1.53e+00

Pos logit contributions
 we+0.171
 I+0.156
 who+0.143
 yourselves+0.131
 you+0.131
Neg logit contributions
by-0.202
 destination-0.190
rell-0.185
 doorway-0.182
 awoke-0.182
Token stop
Token ID2245
Feature activation+1.32e+00

Pos logit contributions
by+0.087
rell+0.084
 awoke+0.083
 destination+0.080
ench+0.080
Neg logit contributions
 we-0.075
 I-0.067
 who-0.061
 you-0.060
 let-0.055
Token her
Token ID607
Feature activation+7.69e+00

Pos logit contributions
by+0.072
rell+0.070
 destination+0.068
 awoke+0.068
lying+0.066
Neg logit contributions
 we-0.064
 I-0.058
 who-0.052
 you-0.051
Bad-0.048
Token from
Token ID422
Feature activation-1.05e+00

Pos logit contributions
by+0.416
rell+0.393
 destination+0.389
 awoke+0.385
lying+0.375
Neg logit contributions
 we-0.391
 I-0.359
 who-0.322
 you-0.319
 yourselves-0.303
Token,
Token ID11
Feature activation+4.80e+00

Pos logit contributions
 we+0.063
 I+0.059
 who+0.055
 you+0.055
 let+0.052
Neg logit contributions
by-0.033
rell-0.032
 awoke-0.031
 destination-0.030
ench-0.029
Token "
Token ID366
Feature activation-3.52e+00

Pos logit contributions
 awoke+0.262
by+0.261
rell+0.254
lying+0.249
 destination+0.244
Neg logit contributions
 we-0.219
 I-0.193
 yourselves-0.182
 you-0.181
 who-0.173
TokenThank
Token ID10449
Feature activation-1.07e+01

Pos logit contributions
 we+0.147
 I+0.126
 who+0.113
 you+0.107
 wasting+0.106
Neg logit contributions
by-0.223
 awoke-0.220
lying-0.208
rell-0.208
day-0.204
Token you
Token ID345
Feature activation-7.24e+00

Pos logit contributions
 we+0.178
 who+0.134
 wasting+0.129
 exc+0.123
 I+0.110
Neg logit contributions
by-0.842
day-0.813
rell-0.805
 awoke-0.804
zie-0.788
Token for
Token ID329
Feature activation-8.54e-01

Pos logit contributions
 we+0.384
 I+0.348
 who+0.318
 you+0.315
 let+0.290
Neg logit contributions
by-0.382
rell-0.366
 awoke-0.364
 destination-0.352
ench-0.350
Token letting
Token ID9616
Feature activation-1.62e+01

Pos logit contributions
 we+0.039
 I+0.033
ms+0.029
 who+0.028
We+0.028
Neg logit contributions
 awoke-0.047
rell-0.046
 home-0.045
 destination-0.045
by-0.045
Token me
Token ID502
Feature activation+8.14e+00

Pos logit contributions
 we+0.871
 I+0.791
 who+0.725
 you+0.714
 let+0.659
Neg logit contributions
by-0.840
rell-0.808
 awoke-0.801
 destination-0.776
ench-0.771
Token fly
Token ID6129
Feature activation-4.45e+00

Pos logit contributions
by+0.428
rell+0.410
 awoke+0.405
ench+0.401
 destination+0.397
Neg logit contributions
 we-0.409
 I-0.369
 who-0.339
Bad-0.327
 you-0.326
Token your
Token ID534
Feature activation-7.69e-01

Pos logit contributions
 we+0.217
 I+0.195
 who+0.177
 you+0.173
 let+0.160
Neg logit contributions
by-0.251
rell-0.241
 awoke-0.239
 destination-0.234
ench-0.232
Token k
Token ID479
Feature activation-6.09e-01

Pos logit contributions
 we+0.044
 I+0.041
 who+0.037
 you+0.037
 let+0.034
Neg logit contributions
by-0.037
rell-0.036
 awoke-0.035
lying-0.034
ench-0.034
Tokenite
Token ID578
Feature activation-5.37e-01

Pos logit contributions
 we+0.035
 I+0.032
 who+0.029
 you+0.029
 let+0.027
Neg logit contributions
by-0.030
rell-0.028
 destination-0.027
 awoke-0.027
ench-0.027
Token Mom
Token ID11254
Feature activation-4.67e+00

Pos logit contributions
 we+0.153
 I+0.138
 you+0.114
 who+0.110
 yourselves+0.106
Neg logit contributions
by-0.256
 awoke-0.241
rell-0.237
 destination-0.237
ench-0.231
Token and
Token ID290
Feature activation-5.18e+00

Pos logit contributions
 we+0.192
 I+0.167
 who+0.160
 you+0.149
 let+0.137
Neg logit contributions
by-0.297
rell-0.293
 awoke-0.291
ench-0.285
 destination-0.284
Token thanked
Token ID26280
Feature activation-2.90e+00

Pos logit contributions
 we+0.234
 I+0.212
 you+0.195
 yourselves+0.191
 who+0.189
Neg logit contributions
by-0.300
 awoke-0.290
rell-0.272
zie-0.268
lying-0.266
Token her
Token ID607
Feature activation+7.41e+00

Pos logit contributions
 we+0.159
 I+0.148
 who+0.132
Bad+0.129
 yourselves+0.128
Neg logit contributions
by-0.140
 awoke-0.128
 destination-0.128
zie-0.126
rell-0.124
Token for
Token ID329
Feature activation-9.56e-01

Pos logit contributions
by+0.408
 awoke+0.383
rell+0.383
 destination+0.378
ench+0.371
Neg logit contributions
 we-0.373
 I-0.341
 you-0.306
 who-0.303
 yourselves-0.286
Token letting
Token ID9616
Feature activation-1.62e+01

Pos logit contributions
 we+0.044
 I+0.041
 you+0.033
 who+0.032
ms+0.031
Neg logit contributions
by-0.052
rell-0.051
 destination-0.051
 awoke-0.051
 doorway-0.050
Token them
Token ID606
Feature activation+4.41e+00

Pos logit contributions
 we+0.744
 I+0.657
 you+0.641
 who+0.611
 let+0.587
Neg logit contributions
rell-0.961
by-0.942
 awoke-0.928
lying-0.910
ench-0.901
Token go
Token ID467
Feature activation+5.34e+00

Pos logit contributions
by+0.244
rell+0.237
 awoke+0.235
lying+0.232
 destination+0.228
Neg logit contributions
 we-0.226
 I-0.202
 who-0.187
 you-0.186
 let-0.172
Token out
Token ID503
Feature activation+9.86e+00

Pos logit contributions
by+0.252
 awoke+0.246
rell+0.241
lying+0.235
ench+0.231
Neg logit contributions
 we-0.328
 I-0.294
 you-0.277
 who-0.275
 let-0.254
Token.
Token ID13
Feature activation-8.16e-01

Pos logit contributions
by+0.589
rell+0.570
 awoke+0.564
lying+0.547
ench+0.547
Neg logit contributions
 we-0.459
 I-0.408
 who-0.370
 you-0.366
 let-0.330
Token They
Token ID1119
Feature activation-1.81e+00

Pos logit contributions
 we+0.043
 I+0.038
 you+0.036
 who+0.036
 let+0.033
Neg logit contributions
by-0.041
rell-0.040
 awoke-0.040
lying-0.038
zie-0.038
Token said
Token ID531
Feature activation-1.94e+00

Pos logit contributions
by+0.395
 awoke+0.376
rell+0.375
lying+0.360
 destination+0.359
Neg logit contributions
 we-0.370
 I-0.335
 you-0.303
 who-0.302
 yourselves-0.279
Token,
Token ID11
Feature activation+3.99e+00

Pos logit contributions
 we+0.096
 I+0.086
 who+0.078
 you+0.078
 yourselves+0.075
Neg logit contributions
by-0.106
 awoke-0.102
rell-0.099
lying-0.096
 destination-0.095
Token "
Token ID366
Feature activation-4.12e+00

Pos logit contributions
 awoke+0.206
by+0.204
lying+0.193
rell+0.190
 destination+0.188
Neg logit contributions
 we-0.173
 yourselves-0.156
 I-0.152
ms-0.150
Bad-0.148
TokenOkay
Token ID16454
Feature activation-5.54e+00

Pos logit contributions
 we+0.167
 I+0.140
 spies+0.140
 wasting+0.133
 who+0.129
Neg logit contributions
by-0.243
day-0.229
lying-0.226
side-0.225
 awoke-0.225
Token,
Token ID11
Feature activation+3.96e+00

Pos logit contributions
 we+0.349
 I+0.319
 who+0.319
 pige+0.311
 wasting+0.311
Neg logit contributions
by-0.205
 destination-0.193
zie-0.187
rell-0.184
 awoke-0.182
Token let
Token ID1309
Feature activation-1.62e+01

Pos logit contributions
zie+0.247
 destination+0.244
by+0.244
 awoke+0.228
 home+0.224
Neg logit contributions
 pige-0.108
ms-0.107
Bad-0.106
 wasting-0.106
 yourselves-0.105
Token's
Token ID338
Feature activation+9.35e+00

Pos logit contributions
 we+0.995
 I+0.911
 you+0.861
 who+0.837
 let+0.799
Neg logit contributions
by-0.724
rell-0.704
 awoke-0.688
lying-0.672
 destination-0.666
Token pack
Token ID2353
Feature activation+8.15e-01

Pos logit contributions
by+0.437
 destination+0.435
ench+0.417
 awoke+0.403
rolled+0.390
Neg logit contributions
 we-0.491
 I-0.447
ms-0.423
Bad-0.423
 who-0.423
Token up
Token ID510
Feature activation+5.65e+00

Pos logit contributions
by+0.055
rell+0.051
 destination+0.051
 awoke+0.051
ench+0.049
Neg logit contributions
 we-0.030
 I-0.026
 who-0.024
 let-0.021
Bad-0.021
Token our
Token ID674
Feature activation-9.84e-01

Pos logit contributions
by+0.346
 awoke+0.337
rell+0.336
 destination+0.334
 home+0.323
Neg logit contributions
 we-0.202
 I-0.181
 who-0.159
Bad-0.157
 let-0.152
Token cones
Token ID47314
Feature activation+1.12e+00

Pos logit contributions
 we+0.053
 I+0.048
 you+0.044
 who+0.044
 let+0.040
Neg logit contributions
by-0.051
rell-0.050
 awoke-0.049
ench-0.047
lying-0.047
Token ran
Token ID4966
Feature activation-9.12e-01

Pos logit contributions
by+0.345
 awoke+0.335
rell+0.319
 destination+0.315
lying+0.309
Neg logit contributions
 we-0.316
 I-0.284
 you-0.258
 who-0.251
 yourselves-0.244
Token around
Token ID1088
Feature activation+2.94e+00

Pos logit contributions
 we+0.052
 I+0.048
 who+0.045
 you+0.045
 let+0.042
Neg logit contributions
by-0.044
rell-0.043
 awoke-0.043
ench-0.042
lying-0.040
Token the
Token ID262
Feature activation-3.80e+00

Pos logit contributions
by+0.191
 awoke+0.184
rell+0.182
 destination+0.180
zie+0.176
Neg logit contributions
 we-0.116
 I-0.102
 you-0.086
 who-0.085
 yourselves-0.080
Token park
Token ID3952
Feature activation+3.71e+00

Pos logit contributions
 we+0.240
 I+0.220
 you+0.203
 who+0.203
 let+0.190
Neg logit contributions
by-0.163
rell-0.152
 awoke-0.152
 destination-0.148
ench-0.145
Token,
Token ID11
Feature activation+4.10e+00

Pos logit contributions
by+0.231
rell+0.225
 awoke+0.224
ench+0.218
lying+0.217
Neg logit contributions
 we-0.164
 I-0.144
 who-0.135
 you-0.127
 let-0.117
Token letting
Token ID9616
Feature activation-1.61e+01

Pos logit contributions
by+0.219
rell+0.211
 awoke+0.211
 destination+0.204
zie+0.202
Neg logit contributions
 we-0.207
 I-0.188
 you-0.171
 who-0.164
 yourselves-0.160
Token his
Token ID465
Feature activation+1.07e+01

Pos logit contributions
 we+0.796
 I+0.732
 you+0.697
 who+0.669
 let+0.653
Neg logit contributions
by-0.858
rell-0.857
lying-0.855
 awoke-0.854
ench-0.844
Token new
Token ID649
Feature activation+1.23e+00

Pos logit contributions
by+0.640
rell+0.616
 awoke+0.612
 destination+0.597
ench+0.591
Neg logit contributions
 we-0.493
 I-0.441
 who-0.397
 you-0.393
 let-0.354
Token k
Token ID479
Feature activation-6.52e-01

Pos logit contributions
by+0.069
rell+0.069
 awoke+0.068
lying+0.066
ench+0.066
Neg logit contributions
 we-0.060
 I-0.054
 who-0.049
 you-0.049
 let-0.044
Tokenite
Token ID578
Feature activation-6.49e-01

Pos logit contributions
 we+0.036
 I+0.033
 who+0.030
 let+0.030
 you+0.029
Neg logit contributions
by-0.033
rell-0.031
ench-0.029
 destination-0.029
rolled-0.029
Token fly
Token ID6129
Feature activation-4.55e+00

Pos logit contributions
 we+0.033
 I+0.029
 you+0.027
 who+0.026
 let+0.024
Neg logit contributions
by-0.037
 awoke-0.035
rell-0.035
lying-0.034
ench-0.034
Token
Token ID198
Feature activation+5.15e-01

Pos logit contributions
 we+0.004
 I+0.004
 who+0.004
 you+0.004
 let+0.003
Neg logit contributions
by-0.003
rell-0.003
 awoke-0.003
lying-0.003
 destination-0.003
Token
Token ID198
Feature activation+4.23e-01

Pos logit contributions
by+0.030
rell+0.029
 awoke+0.028
 destination+0.028
ench+0.027
Neg logit contributions
 we-0.025
 I-0.023
 who-0.021
 you-0.020
 let-0.018
TokenAs
Token ID1722
Feature activation+3.43e+00

Pos logit contributions
by+0.026
 awoke+0.025
rell+0.024
lying+0.024
ench+0.024
Neg logit contributions
 we-0.019
 I-0.016
 who-0.014
 you-0.014
 yourselves-0.013
Token Tim
Token ID5045
Feature activation-2.09e+00

Pos logit contributions
by+0.230
 awoke+0.217
 destination+0.216
rell+0.211
zie+0.210
Neg logit contributions
 we-0.124
 I-0.110
 who-0.100
 yourselves-0.093
 you-0.091
Tokenmy
Token ID1820
Feature activation+8.40e+00

Pos logit contributions
 we+0.130
 I+0.125
 who+0.124
 let+0.121
 you+0.116
Neg logit contributions
ench-0.078
rell-0.078
by-0.075
 destination-0.074
leys-0.072
Token's
Token ID338
Feature activation+1.02e+01

Pos logit contributions
rell+0.454
by+0.454
ench+0.436
 destination+0.435
lying+0.432
Neg logit contributions
 we-0.420
 I-0.379
 who-0.362
 you-0.343
 let-0.334
Token mom
Token ID1995
Feature activation-4.63e+00

Pos logit contributions
by+0.662
rell+0.661
 awoke+0.655
lying+0.634
ench+0.630
Neg logit contributions
 we-0.422
 I-0.370
 you-0.328
 who-0.323
 let-0.284
Tokenmy
Token ID1820
Feature activation+8.31e+00

Pos logit contributions
 we+0.323
 I+0.301
 who+0.292
 you+0.282
 let+0.274
Neg logit contributions
by-0.161
rell-0.153
ench-0.147
 awoke-0.146
 destination-0.143
Token put
Token ID1234
Feature activation-8.25e+00

Pos logit contributions
by+0.470
rell+0.456
 awoke+0.446
 destination+0.437
ench+0.437
Neg logit contributions
 we-0.406
 I-0.364
 who-0.337
 you-0.328
 let-0.305
Token on
Token ID319
Feature activation-5.88e+00

Pos logit contributions
 we+0.347
 I+0.313
 you+0.285
 who+0.284
 let+0.247
Neg logit contributions
by-0.524
 awoke-0.507
rell-0.502
ench-0.498
 destination-0.488
Token her
Token ID607
Feature activation+8.13e+00

Pos logit contributions
 we+0.211
 I+0.184
 you+0.166
 who+0.164
 let+0.137
Neg logit contributions
by-0.412
 awoke-0.404
rell-0.401
ench-0.393
lying-0.386
Token sister
Token ID6621
Feature activation-2.45e+00

Pos logit contributions
by+0.695
rell+0.661
 awoke+0.658
 destination+0.650
ench+0.639
Neg logit contributions
 we-0.525
 I-0.468
 who-0.423
 you-0.418
 let-0.377
Token kept
Token ID4030
Feature activation+9.55e-01

Pos logit contributions
 we+0.126
 I+0.114
 who+0.105
 you+0.103
 let+0.096
Neg logit contributions
by-0.133
rell-0.129
 awoke-0.127
 destination-0.123
ench-0.123
Token trying
Token ID2111
Feature activation-3.93e+00

Pos logit contributions
by+0.054
rell+0.052
 awoke+0.052
ench+0.050
 destination+0.050
Neg logit contributions
 we-0.047
 I-0.042
 who-0.039
 you-0.038
 let-0.035
Token to
Token ID284
Feature activation+4.44e+00

Pos logit contributions
 we+0.162
 who+0.146
 I+0.139
 not+0.131
 you+0.129
Neg logit contributions
rell-0.255
 awoke-0.253
by-0.242
zie-0.242
rol-0.238
Token take
Token ID1011
Feature activation-4.78e+00

Pos logit contributions
by+0.219
 destination+0.215
ench+0.208
 ease+0.207
rell+0.206
Neg logit contributions
 we-0.234
 I-0.212
 pige-0.190
 who-0.182
We-0.177
Token away
Token ID1497
Feature activation+1.46e+01

Pos logit contributions
 we+0.227
 I+0.204
 you+0.193
 who+0.189
 let+0.173
Neg logit contributions
 awoke-0.275
rell-0.268
by-0.268
ench-0.260
zie-0.254
Token the
Token ID262
Feature activation-3.17e+00

Pos logit contributions
by+0.821
rell+0.789
 awoke+0.781
 destination+0.763
lying+0.755
Neg logit contributions
 we-0.723
 I-0.652
 you-0.584
 who-0.583
 let-0.529
Token plastic
Token ID7309
Feature activation+1.69e-02

Pos logit contributions
 we+0.163
 I+0.152
 who+0.139
 you+0.136
 yourselves+0.133
Neg logit contributions
by-0.160
 destination-0.155
rell-0.153
 doorway-0.150
 awoke-0.146
Token pieces
Token ID5207
Feature activation-1.11e+00

Pos logit contributions
by+0.001
rell+0.001
 awoke+0.001
ench+0.001
lying+0.001
Neg logit contributions
 we-0.001
 I-0.001
 who-0.001
 you-0.001
 let-0.001
Token.
Token ID13
Feature activation+5.04e-01

Pos logit contributions
 we+0.050
 I+0.044
 you+0.041
 who+0.041
 let+0.036
Neg logit contributions
by-0.070
 awoke-0.068
rell-0.068
ench-0.066
lying-0.065
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
by+0.032
 awoke+0.032
rell+0.031
 destination+0.030
ench+0.030
Neg logit contributions
 we-0.021
 I-0.020
 you-0.017
 who-0.017
You-0.015
Token and
Token ID290
Feature activation-3.92e+00

Pos logit contributions
by+0.642
rell+0.622
 awoke+0.614
ench+0.599
 destination+0.597
Neg logit contributions
 we-0.487
 I-0.435
 who-0.393
 you-0.388
 let-0.351
Token started
Token ID2067
Feature activation+9.91e+00

Pos logit contributions
 we+0.206
 I+0.188
 who+0.173
 you+0.172
 yourselves+0.164
Neg logit contributions
by-0.201
 awoke-0.193
rell-0.187
 destination-0.185
zie-0.179
Token playing
Token ID2712
Feature activation-7.91e-01

Pos logit contributions
by+0.550
rell+0.534
 awoke+0.532
ench+0.522
zie+0.517
Neg logit contributions
 we-0.505
 I-0.455
 who-0.417
 you-0.412
 let-0.381
Token it
Token ID340
Feature activation-5.60e+00

Pos logit contributions
 we+0.038
 I+0.034
 you+0.029
 who+0.029
 yourselves+0.028
Neg logit contributions
by-0.043
 awoke-0.042
 destination-0.042
rell-0.041
ench-0.040
Token.
Token ID13
Feature activation+6.72e-01

Pos logit contributions
 we+0.250
 I+0.224
 you+0.203
 who+0.201
 let+0.184
Neg logit contributions
by-0.352
 awoke-0.340
rell-0.335
ench-0.332
lying-0.330
Token
Token ID220
Feature activation+1.21e+01

Pos logit contributions
by+0.035
ench+0.033
rell+0.032
 awoke+0.031
 destination+0.031
Neg logit contributions
 we-0.039
 I-0.038
 who-0.033
 you-0.032
 let-0.030
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
by+0.757
 awoke+0.730
rell+0.718
 destination+0.703
lying+0.697
Neg logit contributions
 we-0.542
 I-0.496
 who-0.443
 you-0.436
 let-0.386
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
by+0.082
rell+0.080
 awoke+0.078
 destination+0.077
lying+0.076
Neg logit contributions
 we-0.063
 I-0.057
 who-0.052
 you-0.051
 let-0.046
TokenSuddenly
Token ID38582
Feature activation-2.21e+00

Pos logit contributions
by+0.085
rell+0.082
 awoke+0.082
 destination+0.080
ench+0.079
Neg logit contributions
 we-0.059
 I-0.052
 who-0.046
 you-0.046
 let-0.041
Token,
Token ID11
Feature activation+5.67e+00

Pos logit contributions
 we+0.117
 I+0.105
 you+0.094
 who+0.093
 let+0.085
Neg logit contributions
rell-0.122
by-0.121
ench-0.119
lying-0.119
rol-0.117
Token a
Token ID257
Feature activation-1.65e-01

Pos logit contributions
by+0.323
 awoke+0.314
rell+0.306
 destination+0.301
ench+0.293
Neg logit contributions
 we-0.268
 I-0.243
 who-0.218
 you-0.216
 yourselves-0.203
Token
Token ID220
Feature activation+1.15e+01

Pos logit contributions
by+0.012
rell+0.012
 destination+0.011
 awoke+0.011
ench+0.011
Neg logit contributions
 we-0.014
 I-0.014
 who-0.012
 you-0.012
 let-0.012
Token
Token ID198
Feature activation+8.50e-01

Pos logit contributions
by+0.691
 awoke+0.664
rell+0.658
 destination+0.645
lying+0.638
Neg logit contributions
 we-0.541
 I-0.494
 who-0.441
 you-0.438
 let-0.389
Token
Token ID198
Feature activation+8.38e-01

Pos logit contributions
by+0.048
rell+0.047
 awoke+0.046
 destination+0.045
lying+0.044
Neg logit contributions
 we-0.042
 I-0.038
 who-0.034
 you-0.034
 let-0.031
TokenIn
Token ID818
Feature activation+4.52e-01

Pos logit contributions
by+0.052
 awoke+0.051
rell+0.049
ench+0.048
 destination+0.048
Neg logit contributions
 we-0.037
 I-0.032
 who-0.028
 you-0.028
 yourselves-0.026
Token the
Token ID262
Feature activation-3.31e+00

Pos logit contributions
by+0.034
rell+0.034
ench+0.033
lying+0.033
 awoke+0.033
Neg logit contributions
 we-0.014
 I-0.012
 you-0.010
 who-0.010
 let-0.008
Token morning
Token ID3329
Feature activation+1.18e+01

Pos logit contributions
 we+0.197
 I+0.187
 you+0.181
 who+0.173
 let+0.163
Neg logit contributions
rell-0.154
rolled-0.152
lying-0.144
rol-0.142
by-0.139
Token when
Token ID618
Feature activation-4.32e+00

Pos logit contributions
by+0.625
rell+0.612
lying+0.584
ench+0.584
zie+0.581
Neg logit contributions
 we-0.650
 I-0.596
 you-0.546
 who-0.529
 let-0.494
Token he
Token ID339
Feature activation+2.70e+00

Pos logit contributions
 we+0.207
 I+0.184
 you+0.166
 who+0.155
 let+0.143
Neg logit contributions
by-0.259
rell-0.258
lying-0.250
ench-0.248
 awoke-0.247
Token woke
Token ID19092
Feature activation+7.00e+00

Pos logit contributions
rell+0.122
lying+0.110
ench+0.109
by+0.108
zie+0.106
Neg logit contributions
 we-0.172
 I-0.160
 who-0.153
 let-0.153
 you-0.147
Token up
Token ID510
Feature activation+6.45e+00

Pos logit contributions
rell+0.313
by+0.299
lying+0.298
 awoke+0.287
rol+0.285
Neg logit contributions
 we-0.441
 I-0.398
 you-0.383
 who-0.373
 that-0.350
Token,
Token ID11
Feature activation+5.25e+00

Pos logit contributions
rell+0.405
zie+0.394
rol+0.388
by+0.387
leys+0.376
Neg logit contributions
 we-0.302
 I-0.265
 you-0.239
 who-0.236
 that-0.232
Token hears
Token ID23180
Feature activation-8.02e-01

Pos logit contributions
 we+0.130
 I+0.117
 who+0.107
 you+0.105
 let+0.098
Neg logit contributions
by-0.150
rell-0.146
 awoke-0.144
 destination-0.142
ench-0.141
Token him
Token ID683
Feature activation+1.16e+01

Pos logit contributions
 we+0.042
 I+0.038
 you+0.035
 who+0.035
 let+0.032
Neg logit contributions
by-0.042
rell-0.041
 awoke-0.041
ench-0.040
 destination-0.040
Token and
Token ID290
Feature activation-4.18e+00

Pos logit contributions
by+0.873
 destination+0.853
 awoke+0.848
rell+0.842
 doorway+0.839
Neg logit contributions
 we-0.356
 who-0.311
 I-0.293
 you-0.260
 let-0.250
Token runs
Token ID4539
Feature activation-2.12e+00

Pos logit contributions
 we+0.221
 I+0.200
 who+0.183
 you+0.183
 yourselves+0.170
Neg logit contributions
by-0.217
 awoke-0.208
rell-0.207
 destination-0.196
zie-0.196
Token to
Token ID284
Feature activation+4.17e+00

Pos logit contributions
 we+0.115
 I+0.104
 you+0.098
 who+0.097
 let+0.091
Neg logit contributions
 awoke-0.110
rell-0.110
by-0.109
ench-0.106
zie-0.105
Token him
Token ID683
Feature activation+1.16e+01

Pos logit contributions
by+0.224
rell+0.214
 awoke+0.213
 destination+0.208
ench+0.205
Neg logit contributions
 we-0.216
 I-0.195
 who-0.178
 you-0.175
 yourselves-0.161
Token.
Token ID13
Feature activation+5.93e-01

Pos logit contributions
by+0.756
 awoke+0.744
rell+0.738
ench+0.715
 destination+0.706
Neg logit contributions
 we-0.492
 I-0.434
 who-0.432
 you-0.386
 let-0.366
Token She
Token ID1375
Feature activation+4.61e+00

Pos logit contributions
by+0.032
rell+0.031
 awoke+0.030
 destination+0.030
ench+0.029
Neg logit contributions
 we-0.032
 I-0.031
 who-0.027
 you-0.027
 let-0.024
Token sees
Token ID7224
Feature activation-2.92e+00

Pos logit contributions
by+0.248
 awoke+0.239
rell+0.234
 destination+0.229
zie+0.224
Neg logit contributions
 we-0.233
 I-0.208
 you-0.188
 who-0.188
 yourselves-0.178
Token his
Token ID465
Feature activation+1.12e+01

Pos logit contributions
 we+0.145
 I+0.129
 you+0.115
 who+0.113
 let+0.107
Neg logit contributions
by-0.162
rell-0.157
 awoke-0.155
zie-0.147
 destination-0.146
Token knee
Token ID10329
Feature activation-1.08e+00

Pos logit contributions
by+0.643
 destination+0.602
rell+0.598
 awoke+0.587
ench+0.581
Neg logit contributions
 we-0.539
 I-0.489
 you-0.436
 who-0.433
 yourselves-0.395
Token it
Token ID340
Feature activation-6.34e+00

Pos logit contributions
 we+0.263
 I+0.221
 who+0.196
 you+0.186
 yourselves+0.176
Neg logit contributions
by-0.588
 awoke-0.573
 destination-0.564
rell-0.555
zie-0.551
Token was
Token ID373
Feature activation+2.16e+00

Pos logit contributions
 we+0.353
 I+0.323
 who+0.296
 you+0.295
 yourselves+0.271
Neg logit contributions
by-0.316
rell-0.303
 awoke-0.302
 destination-0.291
ench-0.288
Token too
Token ID1165
Feature activation+2.09e+00

Pos logit contributions
by+0.132
rell+0.130
 awoke+0.128
 destination+0.127
zie+0.124
Neg logit contributions
 we-0.092
 I-0.081
 who-0.073
 you-0.072
 yourselves-0.065
Token hard
Token ID1327
Feature activation+8.42e-01

Pos logit contributions
by+0.112
rell+0.106
 awoke+0.104
 destination+0.101
ench+0.100
Neg logit contributions
 we-0.106
 I-0.097
 who-0.090
 you-0.088
 yourselves-0.084
Token for
Token ID329
Feature activation-7.74e-01

Pos logit contributions
by+0.054
 awoke+0.053
ench+0.052
rell+0.051
zie+0.051
Neg logit contributions
 we-0.037
 I-0.032
 you-0.029
 who-0.029
 let-0.026
Token her
Token ID607
Feature activation+7.86e+00

Pos logit contributions
 we+0.042
 I+0.038
 you+0.035
 who+0.035
 let+0.032
Neg logit contributions
by-0.040
rell-0.038
 awoke-0.038
ench-0.037
 destination-0.037
Token to
Token ID284
Feature activation+4.05e+00

Pos logit contributions
by+0.414
rell+0.385
 awoke+0.385
 destination+0.383
lying+0.372
Neg logit contributions
 we-0.410
 I-0.375
 who-0.338
 you-0.337
 yourselves-0.324
Token manage
Token ID6687
Feature activation-3.56e-01

Pos logit contributions
by+0.219
rell+0.215
 awoke+0.215
 destination+0.212
lying+0.206
Neg logit contributions
 we-0.200
 I-0.179
 who-0.160
 you-0.158
 yourselves-0.153
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
 we+0.016
 I+0.014
 you+0.013
 who+0.013
 let+0.011
Neg logit contributions
by-0.022
rell-0.022
 awoke-0.022
ench-0.021
lying-0.021
Token She
Token ID1375
Feature activation+4.24e+00

Pos logit contributions
 we+0.023
 I+0.021
 who+0.019
 you+0.019
 let+0.018
Neg logit contributions
by-0.020
rell-0.019
 awoke-0.019
 destination-0.018
ench-0.018
Token asked
Token ID1965
Feature activation-2.07e+00

Pos logit contributions
by+0.217
 awoke+0.206
rell+0.197
 destination+0.195
lying+0.191
Neg logit contributions
 we-0.235
 I-0.215
 you-0.196
 who-0.188
 yourselves-0.186
Token friends
Token ID2460
Feature activation-2.12e+00

Pos logit contributions
rell+0.872
lying+0.862
 awoke+0.852
rol+0.820
ench+0.818
Neg logit contributions
 we-0.348
 I-0.287
 who-0.256
 you-0.253
 spies-0.231
Token arrived
Token ID5284
Feature activation+1.57e+01

Pos logit contributions
 we+0.116
 who+0.108
 let+0.106
 I+0.102
 you+0.095
Neg logit contributions
rell-0.110
ench-0.105
by-0.102
lying-0.101
zie-0.096
Token at
Token ID379
Feature activation-3.77e-01

Pos logit contributions
rell+1.109
 awoke+1.026
zie+1.018
lying+0.997
jo+0.990
Neg logit contributions
 we-0.691
 who-0.564
 I-0.561
 you-0.528
 that-0.500
Token the
Token ID262
Feature activation-3.41e+00

Pos logit contributions
 we+0.012
 I+0.010
 who+0.009
 you+0.009
 let+0.007
Neg logit contributions
by-0.028
rell-0.027
 awoke-0.027
lying-0.026
ench-0.026
Token cave
Token ID11527
Feature activation+9.71e+00

Pos logit contributions
 we+0.218
 I+0.206
 you+0.191
 who+0.188
 that+0.177
Neg logit contributions
rell-0.157
lying-0.152
by-0.141
ogged-0.138
rolled-0.137
Token,
Token ID11
Feature activation+4.97e+00

Pos logit contributions
by+0.511
rell+0.506
 awoke+0.485
lying+0.477
ench+0.476
Neg logit contributions
 we-0.539
 I-0.480
 who-0.444
 you-0.437
 let-0.399
Token they
Token ID484
Feature activation-6.20e+00

Pos logit contributions
by+0.344
 awoke+0.333
rell+0.324
 destination+0.318
rolled+0.316
Neg logit contributions
 we-0.164
 I-0.146
 you-0.124
 who-0.117
 yourselves-0.116
Token saw
Token ID2497
Feature activation+4.57e-02

Pos logit contributions
 we+0.325
 I+0.292
 who+0.278
 you+0.263
 let+0.257
Neg logit contributions
rell-0.333
by-0.333
ench-0.315
lying-0.314
zie-0.308
Token that
Token ID326
Feature activation-1.09e+01

Pos logit contributions
by+0.003
rell+0.003
 awoke+0.003
 destination+0.003
lying+0.003
Neg logit contributions
 we-0.002
 I-0.001
 who-0.001
 you-0.001
 let-0.001
Token it
Token ID340
Feature activation-6.07e+00

Pos logit contributions
 we+0.372
 I+0.326
 you+0.278
 who+0.270
 yourselves+0.251
Neg logit contributions
by-0.758
 awoke-0.739
 destination-0.715
rell-0.713
ench-0.700
Token was
Token ID373
Feature activation+2.58e+00

Pos logit contributions
 we+0.303
 I+0.271
 who+0.248
 you+0.241
 let+0.228
Neg logit contributions
by-0.349
rell-0.339
ench-0.328
lying-0.328
 awoke-0.320
Token goodbye
Token ID24829
Feature activation+9.67e+00

Pos logit contributions
 we+0.017
 I+0.016
 you+0.014
 who+0.014
 let+0.013
Neg logit contributions
by-0.015
rell-0.015
 awoke-0.014
zie-0.014
 destination-0.014
Token to
Token ID284
Feature activation+4.47e+00

Pos logit contributions
rell+0.566
by+0.565
 awoke+0.557
ench+0.538
zie+0.533
Neg logit contributions
 we-0.455
 I-0.405
 who-0.373
 you-0.359
 let-0.334
Token the
Token ID262
Feature activation-3.28e+00

Pos logit contributions
by+0.281
rell+0.270
 awoke+0.267
 destination+0.264
zie+0.263
Neg logit contributions
 we-0.190
 I-0.166
 you-0.144
 who-0.144
 yourselves-0.131
Token gard
Token ID16985
Feature activation-2.08e+00

Pos logit contributions
 we+0.129
 I+0.120
 you+0.104
 who+0.101
 let+0.093
Neg logit contributions
by-0.220
rell-0.216
 awoke-0.214
lying-0.210
rol-0.208
Tokenener
Token ID877
Feature activation+1.26e+00

Pos logit contributions
 we+0.118
 I+0.111
 who+0.105
 you+0.098
 let+0.094
Neg logit contributions
by-0.098
rell-0.097
ench-0.095
 awoke-0.095
 destination-0.093
Token,
Token ID11
Feature activation+5.47e+00

Pos logit contributions
rell+0.081
by+0.080
ench+0.079
 awoke+0.078
lying+0.076
Neg logit contributions
 we-0.052
 I-0.045
 who-0.045
 you-0.040
 let-0.038
Token thanked
Token ID26280
Feature activation-2.19e+00

Pos logit contributions
 awoke+0.318
by+0.308
rell+0.303
 doorway+0.300
lying+0.297
Neg logit contributions
 we-0.238
 I-0.208
 you-0.192
 yourselves-0.188
 spies-0.174
Token her
Token ID607
Feature activation+8.25e+00

Pos logit contributions
 we+0.152
 I+0.141
 who+0.131
 you+0.128
 yourselves+0.124
Neg logit contributions
by-0.078
rell-0.072
 awoke-0.072
 destination-0.070
zie-0.068
Token and
Token ID290
Feature activation-4.00e+00

Pos logit contributions
by+0.462
rell+0.448
 awoke+0.444
ench+0.429
lying+0.428
Neg logit contributions
 we-0.415
 I-0.374
 who-0.342
 you-0.337
 let-0.309
Token came
Token ID1625
Feature activation+5.65e+00

Pos logit contributions
 we+0.220
 I+0.201
 who+0.184
 you+0.184
 yourselves+0.173
Neg logit contributions
by-0.200
 awoke-0.192
rell-0.188
 destination-0.182
lying-0.180
Token back
Token ID736
Feature activation+1.50e+01

Pos logit contributions
rell+0.244
lying+0.218
 awoke+0.216
ench+0.214
by+0.213
Neg logit contributions
 we-0.374
 I-0.352
 who-0.347
 you-0.330
 let-0.329
Token was
Token ID373
Feature activation+2.79e+00

Pos logit contributions
 we+0.040
 I+0.036
 you+0.033
 who+0.032
 yourselves+0.031
Neg logit contributions
by-0.042
 awoke-0.041
rell-0.040
 destination-0.039
zie-0.038
Token very
Token ID845
Feature activation+8.62e+00

Pos logit contributions
by+0.162
 awoke+0.157
rell+0.151
ench+0.149
 destination+0.148
Neg logit contributions
 we-0.131
 I-0.115
 you-0.102
 who-0.098
 yourselves-0.095
Token happy
Token ID3772
Feature activation-5.56e-01

Pos logit contributions
by+0.497
 home+0.420
ench+0.417
day+0.411
 awoke+0.402
Neg logit contributions
 we-0.393
 I-0.350
 yourselves-0.329
 pige-0.306
 you-0.302
Token playing
Token ID2712
Feature activation-9.13e-01

Pos logit contributions
 we+0.027
 I+0.025
Bad+0.024
 yourselves+0.023
 exc+0.022
Neg logit contributions
by-0.029
 home-0.026
 awoke-0.025
 destination-0.025
 doorway-0.024
Token with
Token ID351
Feature activation-9.78e+00

Pos logit contributions
 we+0.033
 I+0.029
 you+0.023
 who+0.023
 let+0.021
Neg logit contributions
by-0.061
 awoke-0.059
ench-0.058
rell-0.058
 destination-0.057
Token her
Token ID607
Feature activation+8.29e+00

Pos logit contributions
 we+0.625
 I+0.573
 you+0.501
We+0.497
 who+0.493
Neg logit contributions
by-0.395
rell-0.358
 ease-0.355
zie-0.351
 awoke-0.345
Token new
Token ID649
Feature activation+1.98e+00

Pos logit contributions
by+0.446
 destination+0.421
rell+0.400
 doorway+0.399
 awoke+0.398
Neg logit contributions
 we-0.420
 I-0.389
 you-0.340
 yourselves-0.329
 let-0.328
Token baby
Token ID5156
Feature activation+1.33e+00

Pos logit contributions
by+0.113
rell+0.111
 awoke+0.110
lying+0.107
ench+0.106
Neg logit contributions
 we-0.096
 I-0.086
 who-0.079
 you-0.078
 let-0.070
Token brother
Token ID3956
Feature activation-3.51e+00

Pos logit contributions
by+0.078
rell+0.073
 awoke+0.073
 destination+0.071
ench+0.070
Neg logit contributions
 we-0.064
 I-0.057
 you-0.051
 who-0.050
 yourselves-0.047
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 we+0.156
 I+0.138
 you+0.121
 yourselves+0.117
 who+0.116
Neg logit contributions
by-0.212
 awoke-0.204
rell-0.202
 destination-0.196
lying-0.195
Token<|endoftext|>
Token ID50256
Feature activation+7.62e+00

Pos logit contributions
by+0.016
rell+0.016
 awoke+0.016
 destination+0.015
ench+0.015
Neg logit contributions
 we-0.015
 I-0.013
 who-0.012
 you-0.012
 let-0.011
Token lot
Token ID1256
Feature activation+1.92e+00

Pos logit contributions
 we+0.040
 I+0.037
 you+0.034
 who+0.032
 yourselves+0.032
Neg logit contributions
by-0.043
 awoke-0.041
 doorway-0.041
 destination-0.041
rell-0.037
Token of
Token ID286
Feature activation-3.67e+00

Pos logit contributions
by+0.105
rell+0.103
 awoke+0.101
 doorway+0.097
 destination+0.097
Neg logit contributions
 we-0.096
 I-0.086
 you-0.077
 who-0.076
 let-0.071
Token clothes
Token ID8242
Feature activation+3.90e+00

Pos logit contributions
 we+0.182
 I+0.164
 you+0.140
 who+0.139
 let+0.133
Neg logit contributions
by-0.208
rell-0.199
 awoke-0.198
 doorway-0.189
 destination-0.189
Token,
Token ID11
Feature activation+4.96e+00

Pos logit contributions
by+0.205
rell+0.204
 awoke+0.203
 destination+0.202
lying+0.193
Neg logit contributions
 we-0.183
 I-0.169
Bad-0.155
 you-0.152
 yourselves-0.147
Token shoes
Token ID10012
Feature activation+3.82e+00

Pos logit contributions
 awoke+0.246
rell+0.244
by+0.234
zie+0.229
 trek+0.228
Neg logit contributions
 we-0.234
 I-0.214
 you-0.189
Bad-0.180
We-0.179
Token,
Token ID11
Feature activation+4.98e+00

Pos logit contributions
rell+0.148
by+0.147
 awoke+0.139
 destination+0.138
lying+0.134
Neg logit contributions
 we-0.239
 I-0.224
 you-0.208
 who-0.207
Bad-0.205
Token hats
Token ID23910
Feature activation+1.28e+00

Pos logit contributions
rell+0.248
 awoke+0.245
by+0.241
zie+0.224
 trek+0.220
Neg logit contributions
 we-0.251
 I-0.236
 you-0.210
Bad-0.203
 yourselves-0.200
Token and
Token ID290
Feature activation-4.54e+00

Pos logit contributions
by+0.062
rell+0.060
 awoke+0.059
 destination+0.057
lying+0.056
Neg logit contributions
 we-0.073
 I-0.067
 who-0.061
 you-0.061
 yourselves-0.056
Token scar
Token ID10153
Feature activation+1.69e+00

Pos logit contributions
 we+0.242
 I+0.221
 you+0.203
 who+0.198
 yourselves+0.194
Neg logit contributions
by-0.222
rell-0.221
 awoke-0.217
lying-0.195
 doorway-0.192
Tokenves
Token ID1158
Feature activation-3.64e+00

Pos logit contributions
rell+0.092
by+0.090
 awoke+0.085
lying+0.084
ench+0.083
Neg logit contributions
 we-0.084
 I-0.080
 who-0.069
 you-0.067
Bad-0.066
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
 we+0.177
 I+0.158
 who+0.145
 you+0.142
 let+0.131
Neg logit contributions
by-0.209
rell-0.200
 awoke-0.200
ench-0.195
 destination-0.194
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
 we+0.010
 I+0.009
 you+0.008
 who+0.008
 yourselves+0.008
Neg logit contributions
by-0.010
rell-0.010
 awoke-0.010
lying-0.010
ench-0.009
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
by+0.007
rell+0.007
 awoke+0.007
ench+0.007
lying+0.007
Neg logit contributions
 we-0.008
 I-0.007
 who-0.006
 you-0.006
 let-0.006
Token"
Token ID1
Feature activation-1.76e+00

Pos logit contributions
by+0.009
rell+0.009
zie+0.009
rolled+0.008
lying+0.008
Neg logit contributions
 we-0.007
 I-0.006
 pige-0.005
 wasting-0.005
 who-0.005
TokenMe
Token ID5308
Feature activation-2.69e-01

Pos logit contributions
 we+0.078
 ours+0.068
 I+0.067
 wasting+0.066
 you+0.062
Neg logit contributions
side-0.083
day-0.081
 awoke-0.080
by-0.079
 distance-0.077
Token neither
Token ID6159
Feature activation-7.38e+00

Pos logit contributions
 we+0.013
 I+0.011
 who+0.011
 let+0.010
 you+0.010
Neg logit contributions
by-0.014
rell-0.014
 destination-0.014
ench-0.014
lying-0.013
Token,"
Token ID553
Feature activation-1.57e+00

Pos logit contributions
 we+0.318
 I+0.283
 you+0.253
 who+0.246
 let+0.223
Neg logit contributions
by-0.471
rell-0.458
 awoke-0.457
 destination-0.455
ench-0.452
Token Tim
Token ID5045
Feature activation-2.88e+00

Pos logit contributions
 we+0.060
ms+0.052
 I+0.051
 cooperative+0.049
 yourselves+0.048
Neg logit contributions
zie-0.089
by-0.089
rell-0.087
 awoke-0.083
 ease-0.083
Token says
Token ID1139
Feature activation-2.10e+00

Pos logit contributions
 we+0.088
 I+0.077
 who+0.067
 you+0.067
 let+0.059
Neg logit contributions
by-0.212
 awoke-0.209
rolled-0.207
ench-0.206
rell-0.205
Token.
Token ID13
Feature activation-8.95e-02

Pos logit contributions
 we+0.104
 I+0.094
 who+0.085
 you+0.085
 let+0.077
Neg logit contributions
by-0.118
rell-0.114
 awoke-0.113
 destination-0.110
ench-0.109
Token "
Token ID366
Feature activation-3.96e+00

Pos logit contributions
 we+0.005
 I+0.004
 you+0.004
 who+0.004
 yourselves+0.004
Neg logit contributions
rell-0.004
by-0.004
 awoke-0.004
lying-0.004
 doorway-0.004
TokenBut
Token ID1537
Feature activation-1.03e+01

Pos logit contributions
 we+0.158
 I+0.142
 dear+0.121
 you+0.116
 artists+0.112
Neg logit contributions
side-0.219
 awoke-0.218
day-0.214
by-0.209
age-0.201
Token also
Token ID635
Feature activation-6.64e+00

Pos logit contributions
by+0.253
rell+0.245
 awoke+0.242
 destination+0.235
ench+0.234
Neg logit contributions
 we-0.218
 I-0.196
 who-0.178
 you-0.176
 let-0.161
Token learned
Token ID4499
Feature activation+8.32e-02

Pos logit contributions
 we+0.350
 I+0.318
 who+0.291
 you+0.288
 let+0.265
Neg logit contributions
by-0.353
rell-0.341
 awoke-0.337
 destination-0.326
ench-0.325
Token that
Token ID326
Feature activation-1.10e+01

Pos logit contributions
by+0.006
rell+0.006
 awoke+0.006
 destination+0.006
ench+0.006
Neg logit contributions
 we-0.002
 I-0.002
 who-0.001
 you-0.001
 let-0.001
Token solving
Token ID18120
Feature activation+4.38e-02

Pos logit contributions
 we+0.381
 I+0.353
 who+0.315
 yourselves+0.309
Bad+0.300
Neg logit contributions
by-0.686
 awoke-0.651
 destination-0.638
day-0.636
 home-0.633
Token problems
Token ID2761
Feature activation+4.87e+00

Pos logit contributions
by+0.002
 destination+0.002
rell+0.002
 awoke+0.002
ench+0.002
Neg logit contributions
 we-0.002
 I-0.002
 you-0.002
 who-0.002
 let-0.002
Token is
Token ID318
Feature activation-1.11e+00

Pos logit contributions
by+0.280
rell+0.277
 awoke+0.264
 destination+0.263
lying+0.260
Neg logit contributions
 we-0.212
 I-0.190
 who-0.167
 you-0.163
Bad-0.161
Token more
Token ID517
Feature activation+3.78e+00

Pos logit contributions
 we+0.055
 I+0.051
 yourselves+0.044
 pige+0.044
 you+0.043
Neg logit contributions
by-0.055
 destination-0.053
rell-0.051
ench-0.050
 home-0.049
Token important
Token ID1593
Feature activation-1.38e+00

Pos logit contributions
by+0.184
rell+0.171
 destination+0.169
ench+0.167
 awoke+0.167
Neg logit contributions
 we-0.206
 I-0.188
 you-0.167
 who-0.164
Bad-0.162
Token than
Token ID621
Feature activation-8.91e+00

Pos logit contributions
 we+0.052
 I+0.046
 you+0.039
 who+0.038
Bad+0.037
Neg logit contributions
by-0.091
rell-0.085
 destination-0.084
lying-0.084
 awoke-0.083
Token playing
Token ID2712
Feature activation-1.33e+00

Pos logit contributions
 we+0.392
 I+0.347
 who+0.315
We+0.296
Bad+0.294
Neg logit contributions
by-0.516
rell-0.486
 destination-0.460
 ease-0.459
zie-0.448
Token with
Token ID351
Feature activation-1.02e+01

Pos logit contributions
 we+0.059
 I+0.053
 who+0.044
 pige+0.044
 you+0.044
Neg logit contributions
rell-0.073
by-0.073
 destination-0.071
ench-0.071
 awoke-0.070
Token wolf
Token ID17481
Feature activation+3.65e+00

Pos logit contributions
 we+0.124
 I+0.110
 who+0.098
 you+0.097
 let+0.086
Neg logit contributions
by-0.186
rell-0.180
 awoke-0.179
 destination-0.174
ench-0.173
Token was
Token ID373
Feature activation+3.17e+00

Pos logit contributions
by+0.192
 awoke+0.183
rell+0.181
 destination+0.175
lying+0.174
Neg logit contributions
 we-0.192
 I-0.174
 you-0.158
 who-0.158
 yourselves-0.149
Token very
Token ID845
Feature activation+8.90e+00

Pos logit contributions
by+0.183
rell+0.175
 awoke+0.175
ench+0.172
 destination+0.172
Neg logit contributions
 we-0.139
 I-0.119
 you-0.104
 yourselves-0.104
We-0.100
Token obedient
Token ID41652
Feature activation-3.26e+00

Pos logit contributions
by+0.577
lying+0.520
ench+0.519
 destination+0.516
rell+0.515
Neg logit contributions
 we-0.353
 I-0.305
 who-0.278
 yourselves-0.262
 you-0.262
Token he
Token ID339
Feature activation+3.03e+00

Pos logit contributions
 we+0.154
 I+0.138
 who+0.125
 you+0.124
 let+0.112
Neg logit contributions
by-0.190
rell-0.183
 awoke-0.182
 destination-0.177
ench-0.176
Token could
Token ID714
Feature activation-1.68e+00

Pos logit contributions
by+0.153
 awoke+0.145
rell+0.143
 destination+0.138
lying+0.138
Neg logit contributions
 we-0.163
 I-0.148
 who-0.135
 you-0.134
 yourselves-0.133
Token teach
Token ID4545
Feature activation-8.87e+00

Pos logit contributions
 we+0.076
 I+0.068
 who+0.061
 you+0.060
 yourselves+0.055
Neg logit contributions
by-0.100
rell-0.097
 awoke-0.096
 destination-0.094
ench-0.093
Token him
Token ID683
Feature activation+1.21e+01

Pos logit contributions
 we+0.498
 I+0.453
 who+0.416
 you+0.408
 let+0.383
Neg logit contributions
by-0.437
rell-0.420
 awoke-0.413
 destination-0.401
lying-0.395
Token to
Token ID284
Feature activation+4.71e+00

Pos logit contributions
by+0.779
rell+0.750
 awoke+0.743
ench+0.728
 destination+0.725
Neg logit contributions
 we-0.489
 I-0.428
 you-0.371
 who-0.370
 let-0.332
Token be
Token ID307
Feature activation+4.26e+00

Pos logit contributions
by+0.277
rell+0.262
 awoke+0.258
 destination+0.257
ench+0.253
Neg logit contributions
 we-0.214
 I-0.188
 who-0.168
 you-0.162
Bad-0.152
Token clever
Token ID14169
Feature activation-6.56e+00

Pos logit contributions
by+0.264
ench+0.244
 destination+0.240
rell+0.240
 awoke+0.240
Neg logit contributions
 we-0.186
 I-0.162
 you-0.138
 who-0.129
 yourselves-0.125
TokenSee
Token ID6214
Feature activation-2.99e+00

Pos logit contributions
 we+0.156
 I+0.127
 spies+0.117
 who+0.114
 yourselves+0.109
Neg logit contributions
by-0.242
 awoke-0.232
rell-0.230
lying-0.228
day-0.227
Token,
Token ID11
Feature activation+4.54e+00

Pos logit contributions
 we+0.178
 I+0.161
 who+0.156
 you+0.146
ms+0.146
Neg logit contributions
by-0.133
 destination-0.119
 awoke-0.118
rell-0.117
zie-0.115
Token I
Token ID314
Feature activation-1.27e+01

Pos logit contributions
by+0.282
zie+0.273
 destination+0.273
 awoke+0.271
 home+0.266
Neg logit contributions
ms-0.131
 spies-0.118
 we-0.110
Bad-0.108
 pige-0.104
Token told
Token ID1297
Feature activation-4.76e+00

Pos logit contributions
 we+0.645
 I+0.580
 you+0.524
 who+0.521
 yourselves+0.496
Neg logit contributions
by-0.706
 awoke-0.683
rell-0.656
ench-0.646
 destination-0.643
Token you
Token ID345
Feature activation-7.14e+00

Pos logit contributions
 we+0.083
 I+0.056
 who+0.054
 spies+0.044
ms+0.042
Neg logit contributions
by-0.393
rell-0.378
 destination-0.375
zie-0.374
 awoke-0.366
Token it
Token ID340
Feature activation-6.30e+00

Pos logit contributions
 we+0.252
 I+0.217
Bad+0.199
 who+0.199
 yourselves+0.189
Neg logit contributions
by-0.469
 awoke-0.449
 destination-0.435
ench-0.434
rell-0.428
Token would
Token ID561
Feature activation+3.73e+00

Pos logit contributions
 we+0.350
 I+0.319
 who+0.294
 you+0.291
 let+0.269
Neg logit contributions
by-0.313
rell-0.300
 awoke-0.299
 destination-0.288
ench-0.286
Token make
Token ID787
Feature activation-9.89e+00

Pos logit contributions
by+0.224
rell+0.216
 awoke+0.213
 destination+0.213
ench+0.210
Neg logit contributions
 we-0.166
 I-0.146
 who-0.135
 you-0.126
 yourselves-0.120
Token you
Token ID345
Feature activation-7.13e+00

Pos logit contributions
 we+0.325
 I+0.278
 who+0.243
 you+0.209
 let+0.202
Neg logit contributions
by-0.712
rell-0.678
 awoke-0.671
ench-0.668
 destination-0.666
Token happy
Token ID3772
Feature activation-1.01e+00

Pos logit contributions
 we+0.370
 I+0.336
 who+0.306
 you+0.303
 yourselves+0.279
Neg logit contributions
by-0.380
rell-0.364
 awoke-0.362
ench-0.351
 destination-0.350
Token!"
Token ID2474
Feature activation-5.90e+00

Pos logit contributions
 we+0.048
 I+0.043
 who+0.041
Bad+0.040
 you+0.038
Neg logit contributions
by-0.053
 awoke-0.051
rell-0.051
 destination-0.050
ench-0.049
Token enjoyed
Token ID8359
Feature activation+1.76e+00

Pos logit contributions
 we+0.223
 I+0.196
 who+0.182
 you+0.177
 yourselves+0.173
Neg logit contributions
by-0.291
 awoke-0.276
rell-0.270
 destination-0.269
lying-0.261
Token playing
Token ID2712
Feature activation-1.72e+00

Pos logit contributions
by+0.108
 destination+0.101
 ease+0.100
rell+0.098
 awoke+0.098
Neg logit contributions
 we-0.073
 I-0.065
 who-0.053
 you-0.052
ms-0.051
Token in
Token ID287
Feature activation-3.06e+00

Pos logit contributions
 we+0.070
 I+0.062
 wasting+0.049
 you+0.049
 pige+0.048
Neg logit contributions
by-0.104
 awoke-0.100
rell-0.098
ench-0.098
 destination-0.098
Token the
Token ID262
Feature activation-3.29e+00

Pos logit contributions
 we+0.125
 I+0.107
 who+0.092
 you+0.091
 let+0.085
Neg logit contributions
by-0.192
 awoke-0.185
 destination-0.185
rell-0.183
 doorway-0.182
Token garden
Token ID11376
Feature activation+3.97e+00

Pos logit contributions
 we+0.189
 I+0.175
 you+0.162
 who+0.161
 yourselves+0.147
Neg logit contributions
by-0.158
rell-0.154
 awoke-0.150
lying-0.150
ench-0.145
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
by+0.222
rell+0.213
 awoke+0.211
 destination+0.207
ench+0.205
Neg logit contributions
 we-0.193
 I-0.174
 you-0.155
 who-0.155
 yourselves-0.142
Token<|endoftext|>
Token ID50256
Feature activation+6.84e+00

Pos logit contributions
 we+0.021
 I+0.019
 who+0.018
 you+0.017
 let+0.016
Neg logit contributions
by-0.016
rell-0.015
 awoke-0.015
 destination-0.015
ench-0.015
TokenOnce
Token ID7454
Feature activation+5.46e+00

Pos logit contributions
by+0.418
rell+0.404
 destination+0.396
 awoke+0.393
lying+0.392
Neg logit contributions
 we-0.308
 I-0.285
 who-0.254
 you-0.253
 let-0.224
Token upon
Token ID2402
Feature activation+1.09e+01

Pos logit contributions
rell+0.243
lying+0.239
 awoke+0.235
ench+0.221
wood+0.220
Neg logit contributions
 we-0.357
 I-0.326
 who-0.304
 you-0.303
 not-0.277
Token a
Token ID257
Feature activation-5.92e-01

Pos logit contributions
by+0.585
 awoke+0.577
rell+0.537
ench+0.531
 ease+0.528
Neg logit contributions
 we-0.525
 I-0.441
ms-0.406
 you-0.380
 yourselves-0.380
Token time
Token ID640
Feature activation+6.22e+00

Pos logit contributions
 we+0.038
 I+0.035
 who+0.034
 you+0.033
 let+0.032
Neg logit contributions
rell-0.024
by-0.023
 awoke-0.023
 destination-0.022
lying-0.022
Token wearing
Token ID5762
Feature activation-8.00e+00

Pos logit contributions
by+0.076
rell+0.073
 awoke+0.073
 destination+0.071
lying+0.071
Neg logit contributions
 we-0.058
 I-0.051
 who-0.047
 you-0.046
 let-0.041
Token pretty
Token ID2495
Feature activation-4.46e+00

Pos logit contributions
 we+0.362
 I+0.322
 who+0.289
 you+0.287
 let+0.258
Neg logit contributions
by-0.484
rell-0.469
 awoke-0.465
 destination-0.452
ench-0.450
Token dresses
Token ID27309
Feature activation+1.19e-01

Pos logit contributions
 we+0.214
 I+0.188
 who+0.176
 you+0.174
 let+0.155
Neg logit contributions
by-0.254
rell-0.246
 awoke-0.245
lying-0.242
ench-0.242
Token and
Token ID290
Feature activation-4.19e+00

Pos logit contributions
by+0.006
rell+0.006
 awoke+0.006
 destination+0.006
lying+0.006
Neg logit contributions
 we-0.006
 I-0.006
 you-0.005
 who-0.005
 yourselves-0.005
Token suits
Token ID14803
Feature activation+1.76e+00

Pos logit contributions
 we+0.206
 I+0.189
 who+0.174
 you+0.170
 yourselves+0.163
Neg logit contributions
by-0.228
rell-0.219
 awoke-0.213
 destination-0.204
zie-0.201
Token that
Token ID326
Feature activation-1.07e+01

Pos logit contributions
by+0.100
rell+0.097
 awoke+0.096
ench+0.093
 destination+0.093
Neg logit contributions
 we-0.086
 I-0.077
 who-0.070
 you-0.069
 let-0.063
Token fit
Token ID4197
Feature activation-1.93e+00

Pos logit contributions
 we+0.484
 I+0.434
 who+0.392
 you+0.388
 yourselves+0.356
Neg logit contributions
by-0.633
rell-0.613
 awoke-0.610
 destination-0.589
lying-0.581
Token them
Token ID606
Feature activation+5.04e+00

Pos logit contributions
 we+0.092
 I+0.083
 who+0.074
 you+0.073
 let+0.067
Neg logit contributions
by-0.112
rell-0.107
 awoke-0.106
 destination-0.103
lying-0.102
Token nicely
Token ID16576
Feature activation+2.49e+00

Pos logit contributions
by+0.304
rell+0.294
 awoke+0.292
 destination+0.284
ench+0.283
Neg logit contributions
 we-0.227
 I-0.203
 who-0.182
 you-0.180
 let-0.162
Token.
Token ID13
Feature activation+3.81e-01

Pos logit contributions
by+0.137
rell+0.129
 awoke+0.129
 destination+0.125
lying+0.124
Neg logit contributions
 we-0.126
 I-0.114
 you-0.103
 who-0.102
 yourselves-0.094
Token They
Token ID1119
Feature activation-7.22e-01

Pos logit contributions
by+0.021
rell+0.020
 awoke+0.020
 destination+0.019
ench+0.019
Neg logit contributions
 we-0.020
 I-0.018
 who-0.016
 you-0.016
 let-0.015
Token.
Token ID13
Feature activation-1.59e+00

Pos logit contributions
 we+0.122
 I+0.104
 who+0.096
Bad+0.086
ms+0.084
Neg logit contributions
by-0.300
 destination-0.292
ench-0.287
rell-0.285
 awoke-0.285
Token She
Token ID1375
Feature activation+3.24e+00

Pos logit contributions
 we+0.057
 wasting+0.053
 pige+0.049
ms+0.048
 spies+0.048
Neg logit contributions
 Eventually-0.092
 Today-0.089
 awoke-0.089
lying-0.087
rell-0.086
Token also
Token ID635
Feature activation-7.57e+00

Pos logit contributions
 awoke+0.163
by+0.160
rell+0.136
lying+0.135
 arrived+0.134
Neg logit contributions
 we-0.170
 I-0.155
 yourselves-0.152
 you-0.145
 spies-0.138
Token told
Token ID1297
Feature activation-5.45e+00

Pos logit contributions
 we+0.397
 I+0.356
 you+0.325
 yourselves+0.312
 spies+0.297
Neg logit contributions
by-0.395
 awoke-0.386
rell-0.355
lying-0.348
 doorway-0.334
Token him
Token ID683
Feature activation+1.07e+01

Pos logit contributions
 we+0.248
 I+0.221
 who+0.206
We+0.193
Bad+0.189
Neg logit contributions
by-0.311
rell-0.293
zie-0.286
 awoke-0.280
 destination-0.279
Token that
Token ID326
Feature activation-1.18e+01

Pos logit contributions
by+0.664
 awoke+0.618
rell+0.603
 destination+0.589
ench+0.583
Neg logit contributions
 we-0.451
 I-0.402
 who-0.359
 you-0.355
 yourselves-0.344
Token it
Token ID340
Feature activation-6.85e+00

Pos logit contributions
 we+0.424
 I+0.365
 who+0.353
Bad+0.346
 yourselves+0.343
Neg logit contributions
by-0.697
 awoke-0.674
rell-0.653
zie-0.635
 destination-0.623
Token's
Token ID338
Feature activation+9.49e+00

Pos logit contributions
 we+0.397
 I+0.368
 who+0.339
 you+0.336
 yourselves+0.329
Neg logit contributions
by-0.312
 awoke-0.302
rell-0.297
 destination-0.281
zie-0.274
Token important
Token ID1593
Feature activation-1.82e+00

Pos logit contributions
by+0.591
 awoke+0.548
 destination+0.548
rell+0.541
ench+0.537
Neg logit contributions
 we-0.392
 I-0.340
 you-0.282
 who-0.280
ms-0.271
Token to
Token ID284
Feature activation+3.64e+00

Pos logit contributions
 we+0.104
 I+0.095
 who+0.088
 you+0.087
 let+0.081
Neg logit contributions
by-0.088
rell-0.083
 awoke-0.083
 destination-0.080
lying-0.079
Token cl
Token ID537
Feature activation-4.12e+00

Pos logit contributions
by+0.207
rell+0.192
zie+0.189
 destination+0.188
ench+0.188
Neg logit contributions
 we-0.171
 I-0.149
 who-0.137
We-0.129
Bad-0.128
Token and
Token ID290
Feature activation-4.07e+00

Pos logit contributions
by+0.219
rell+0.213
 destination+0.206
 awoke+0.203
lying+0.203
Neg logit contributions
 we-0.180
 I-0.168
Bad-0.153
 yourselves-0.143
 you-0.143
Token sing
Token ID1702
Feature activation-5.82e+00

Pos logit contributions
 we+0.188
 I+0.175
Bad+0.174
 who+0.169
 wasting+0.161
Neg logit contributions
by-0.201
 awoke-0.190
rell-0.189
 destination-0.185
 stroll-0.172
Token very
Token ID845
Feature activation+8.47e+00

Pos logit contributions
 we+0.258
 I+0.236
 who+0.200
 you+0.197
 wasting+0.195
Neg logit contributions
by-0.347
 awoke-0.328
rell-0.326
 destination-0.318
lying-0.310
Token loud
Token ID7812
Feature activation-4.84e+00

Pos logit contributions
by+0.452
 destination+0.390
 awoke+0.363
leys+0.363
 farewell+0.360
Neg logit contributions
 we-0.395
 I-0.394
Bad-0.385
 letting-0.366
You-0.354
Token.
Token ID13
Feature activation+7.16e-01

Pos logit contributions
 we+0.227
 I+0.204
 who+0.180
 you+0.178
 let+0.165
Neg logit contributions
by-0.279
rell-0.271
 awoke-0.269
lying-0.259
 destination-0.258
Token We
Token ID775
Feature activation-1.06e+01

Pos logit contributions
 Eventually+0.036
 Today+0.036
rell+0.034
 Late+0.033
 awoke+0.033
Neg logit contributions
 we-0.030
 wasting-0.030
ms-0.029
ts-0.028
 greedy-0.028
Token make
Token ID787
Feature activation-9.43e+00

Pos logit contributions
 we+0.471
 I+0.425
 yourselves+0.380
 who+0.370
 you+0.367
Neg logit contributions
by-0.640
 awoke-0.618
rell-0.609
 destination-0.600
ench-0.589
Token up
Token ID510
Feature activation+6.43e+00

Pos logit contributions
 we+0.414
 I+0.387
 who+0.327
Bad+0.317
ms+0.309
Neg logit contributions
by-0.567
 destination-0.534
rell-0.524
 awoke-0.523
ench-0.500
Token a
Token ID257
Feature activation-5.79e-01

Pos logit contributions
by+0.345
 destination+0.337
 awoke+0.334
 home+0.323
rell+0.322
Neg logit contributions
 we-0.281
 I-0.270
Bad-0.237
You-0.226
 who-0.225
Token story
Token ID1621
Feature activation+5.76e+00

Pos logit contributions
 we+0.028
 I+0.027
 yourselves+0.024
 who+0.024
 you+0.023
Neg logit contributions
by-0.032
 destination-0.031
 awoke-0.029
 doorway-0.029
ench-0.028
Token with
Token ID351
Feature activation-9.86e+00

Pos logit contributions
by+0.313
 awoke+0.303
rell+0.300
 destination+0.298
lying+0.286
Neg logit contributions
 we-0.269
 I-0.253
Bad-0.229
 yourselves-0.220
 you-0.220
Token wanted
Token ID2227
Feature activation-3.89e+00

Pos logit contributions
by+0.380
rell+0.380
 awoke+0.362
ench+0.360
 destination+0.355
Neg logit contributions
 we-0.347
 I-0.314
 who-0.292
 you-0.283
 let-0.268
Token to
Token ID284
Feature activation+4.23e+00

Pos logit contributions
 we+0.204
 I+0.185
 who+0.170
 you+0.168
 let+0.155
Neg logit contributions
by-0.207
rell-0.202
 awoke-0.199
ench-0.193
lying-0.193
Token keep
Token ID1394
Feature activation-2.38e+00

Pos logit contributions
by+0.236
 destination+0.225
rell+0.225
 awoke+0.224
ench+0.219
Neg logit contributions
 we-0.203
 I-0.177
 who-0.165
 you-0.157
Bad-0.153
Token reading
Token ID3555
Feature activation-7.92e+00

Pos logit contributions
 we+0.120
 I+0.110
 who+0.097
 you+0.096
 let+0.090
Neg logit contributions
by-0.130
rell-0.124
 destination-0.123
 awoke-0.123
lying-0.118
Token,
Token ID11
Feature activation+5.18e+00

Pos logit contributions
 we+0.376
 I+0.337
 who+0.307
 you+0.304
 let+0.275
Neg logit contributions
by-0.460
rell-0.445
 awoke-0.440
ench-0.428
lying-0.426
Token but
Token ID475
Feature activation-8.02e+00

Pos logit contributions
by+0.322
rell+0.317
 awoke+0.317
 destination+0.310
lying+0.306
Neg logit contributions
 we-0.206
 I-0.180
 you-0.157
 who-0.153
 yourselves-0.147
Token he
Token ID339
Feature activation+2.45e+00

Pos logit contributions
 we+0.386
 I+0.347
 who+0.321
 you+0.312
 let+0.290
Neg logit contributions
rell-0.482
by-0.466
ench-0.455
lying-0.449
 awoke-0.448
Token didn
Token ID1422
Feature activation+2.37e+00

Pos logit contributions
rell+0.140
by+0.138
 awoke+0.133
ench+0.131
lying+0.131
Neg logit contributions
 we-0.119
 I-0.108
 who-0.100
 you-0.097
 let-0.091
Token't
Token ID470
Feature activation-4.50e+00

Pos logit contributions
by+0.154
rell+0.151
 awoke+0.149
 destination+0.146
ench+0.145
Neg logit contributions
 we-0.094
 I-0.082
 who-0.074
 you-0.072
 let-0.064
Token want
Token ID765
Feature activation-5.47e+00

Pos logit contributions
 we+0.203
 I+0.177
 who+0.160
 you+0.159
 yourselves+0.153
Neg logit contributions
by-0.269
rell-0.258
 awoke-0.256
lying-0.251
ench-0.250
Token to
Token ID284
Feature activation+4.27e+00

Pos logit contributions
 we+0.278
 I+0.257
 you+0.236
 who+0.235
 let+0.213
Neg logit contributions
rell-0.300
by-0.300
 awoke-0.295
ench-0.291
lying-0.288
Token not
Token ID407
Feature activation-8.57e+00

Pos logit contributions
by+0.076
rell+0.072
 awoke+0.071
 destination+0.070
ench+0.070
Neg logit contributions
 we-0.063
 I-0.056
 who-0.053
 you-0.050
 yourselves-0.046
Token to
Token ID284
Feature activation+4.06e+00

Pos logit contributions
 we+0.475
 I+0.432
 who+0.398
 you+0.394
 let+0.362
Neg logit contributions
by-0.431
rell-0.412
 awoke-0.407
 destination-0.394
ench-0.393
Token eat
Token ID4483
Feature activation-2.93e+00

Pos logit contributions
by+0.249
 destination+0.229
rell+0.229
ench+0.228
lying+0.223
Neg logit contributions
 we-0.174
 I-0.150
 who-0.134
 spies-0.130
 yourselves-0.128
Token me
Token ID502
Feature activation+8.21e+00

Pos logit contributions
 we+0.123
 I+0.107
 who+0.098
We+0.090
 let+0.088
Neg logit contributions
by-0.174
rell-0.168
ench-0.166
 destination-0.166
 awoke-0.163
Token,
Token ID11
Feature activation+4.96e+00

Pos logit contributions
by+0.413
rell+0.395
ench+0.388
 awoke+0.388
 destination+0.385
Neg logit contributions
 we-0.438
 I-0.395
 who-0.361
 you-0.352
Bad-0.336
Token I
Token ID314
Feature activation-1.26e+01

Pos logit contributions
 awoke+0.363
by+0.363
ench+0.361
 destination+0.361
rell+0.353
Neg logit contributions
ms-0.077
 we-0.063
 spies-0.063
 who-0.060
 pige-0.059
Token can
Token ID460
Feature activation-4.85e+00

Pos logit contributions
 we+0.631
 I+0.570
 who+0.518
 you+0.514
 let+0.469
Neg logit contributions
by-0.695
rell-0.669
 awoke-0.664
 destination-0.643
ench-0.640
Token show
Token ID905
Feature activation-8.57e+00

Pos logit contributions
 we+0.206
 I+0.180
 who+0.166
 you+0.156
 yourselves+0.154
Neg logit contributions
by-0.296
rell-0.286
ench-0.281
 destination-0.281
 awoke-0.280
Token you
Token ID345
Feature activation-6.99e+00

Pos logit contributions
 we+0.149
 I+0.091
 who+0.081
 exc+0.063
 spies+0.060
Neg logit contributions
by-0.721
rell-0.684
 destination-0.677
 awoke-0.673
zie-0.662
Token where
Token ID810
Feature activation-7.46e+00

Pos logit contributions
 we+0.266
 I+0.231
 who+0.197
 you+0.194
Bad+0.181
Neg logit contributions
by-0.464
 awoke-0.440
rell-0.437
ench-0.434
 destination-0.430
Token the
Token ID262
Feature activation-3.51e+00

Pos logit contributions
 we+0.102
 who+0.066
 I+0.060
Bad+0.036
ms+0.035
Neg logit contributions
by-0.641
 awoke-0.618
rell-0.613
 destination-0.595
zie-0.592