TOP ACTIVATIONS
MAX = 4.363

't care . She went to bed feeling brilliant , thinking
so much . She went to her mom my and said
his lesson . They went to their room and drew some
Tim my got to Billy 's house , they played with
goodbye . They ran back to the park . They looked
closed the drawer and went to play with her toys .
idea . They both went to Lily 's mom and requested
the coal away and went to play with something else .
't work . Lily went to her mom my with tears
favorite fruit . She went to bed that night with a
he heard a noise outside his room and his mother entered
Twe ety flew to Jerry 's house and knocked on the
As Tim went to the kitchen , he saw
herself safe . She went to ask her mum for a
to do . She went to her mom my and told
the table . She went to play with her toys .
finished her sandwich and went to play outside . She saw
help him . Tim went to bed , hoping that maybe
with Max . She went to Max and said sorry .
They all go to the car and drive to

MIN ACTIVATIONS
MIN = -4.739

sad and lonely . " Hello , elephant !"
what is inside . " Let 's lift the
like their uniform . One day , Lily and
make a fort . " Let 's use this
at the ground . " Look , Lily ,
one is there . " Let 's enter the
colors and shapes . " Let 's measure how
in the box . " Let 's lift the
are too high . " Let 's climb the
make a fort . " Let 's lock the
wet and muddy . " Can we go to
on the fence . " Hello , little ones
to the top . " Ready , set ,
to climb it . " Let 's go up
is a flood . " Oh no !" Lily
had a flag . " Wow , look at
a big box . " What can we do
from the sky . " Mom , can we
was behind it . " Let 's go to
what is inside . " Let 's grab the

INTERVAL 2.088 - 4.363
CONTAINS 1.871%

did not want to go to his car . They refused
not go away . They want to join the line .
told him that they needed to replace the truck .
. The cater pillar wanted to go on an adventure .
. Tim went to the park and saw his

INTERVAL -0.188 - 2.088
CONTAINS 57.883%

and they both became friends . The girl was so proud
They cut paper , fold paper , and draw on paper
?" Sam brave ly nodded his head and
's dad stopped in his tracks . He stared and point
your weight . You have to be careful ." Tim my

INTERVAL -2.464 - -0.188
CONTAINS 38.082%

to hold the bird , but he dropped his gun .
learn how to talk like him too ?" Mom laughed ,
he was tired he was proud of himself for trying .
â € � � That is the license , â
has many branches and leaves . They want to chop it

INTERVAL -4.739 - -2.464
CONTAINS 2.164%

friend 's birthday . The helpful lady took the
't permit that ." Ab by was disappointed and
my smiled and said , " That 's wonderful , Lily
animals and people . " Look , Tom ,
and Sam cheered . They said , " We did it
Token't
Token ID470
Feature activation+4.11e-01

Pos logit contributions
ww+0.890
dd+0.706
www+0.666
lish+0.662
P+0.632
Neg logit contributions
 search-0.745
 bed-0.739
 lap-0.718
 call-0.717
 bank-0.709
Token care
Token ID1337
Feature activation-3.40e-01

Pos logit contributions
 bed+0.498
 search+0.494
 lap+0.488
 bank+0.472
 call+0.459
Neg logit contributions
ww-1.196
dd-1.003
www-0.967
lish-0.960
ha-0.925
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
ww+0.977
dd+0.816
www+0.783
lish+0.775
P+0.754
Neg logit contributions
 search-0.433
 bed-0.428
 lap-0.411
 call-0.410
 bank-0.403
Token She
Token ID1375
Feature activation+1.39e+00

Pos logit contributions
 search+2.041
 bank+1.998
 call+1.971
 bed+1.913
 lap+1.875
Neg logit contributions
ww-4.246
dd-3.445
www-3.380
lish-3.377
P-3.297
Token went
Token ID1816
Feature activation+1.56e+00

Pos logit contributions
 bed+2.457
 lap+2.418
 search+2.414
 driveway+2.306
 call+2.296
Neg logit contributions
ww-3.321
dd-2.802
lish-2.584
www-2.569
P-2.496
Token to
Token ID284
Feature activation+4.36e+00

Pos logit contributions
 backs+1.967
 search+1.958
 eye+1.919
 lap+1.913
 call+1.896
Neg logit contributions
ww-4.227
dd-3.550
lish-3.437
P-3.381
www-3.368
Token bed
Token ID3996
Feature activation+1.26e+00

Pos logit contributions
 backs+8.914
 bank+8.872
 driveway+8.657
 hood+8.354
anks+8.306
Neg logit contributions
ww-8.800
 oh-7.773
 you-7.771
"-6.768
dd-6.673
Token feeling
Token ID4203
Feature activation+7.90e-01

Pos logit contributions
 search+2.038
 eye+1.945
 call+1.932
 backs+1.918
 bank+1.901
Neg logit contributions
ww-3.092
dd-2.598
lish-2.467
www-2.386
bly-2.343
Token brilliant
Token ID10457
Feature activation+1.33e-01

Pos logit contributions
 search+1.425
 lap+1.395
 bed+1.381
 bank+1.372
 call+1.368
Neg logit contributions
ww-1.798
dd-1.474
lish-1.403
www-1.337
bly-1.329
Token,
Token ID11
Feature activation+9.28e-01

Pos logit contributions
 search+0.161
 bed+0.156
 lap+0.152
 call+0.152
 bank+0.149
Neg logit contributions
ww-0.389
dd-0.330
lish-0.316
www-0.313
iest-0.303
Token thinking
Token ID3612
Feature activation-3.84e-01

Pos logit contributions
 search+1.428
 bank+1.375
 bed+1.371
 call+1.366
 lap+1.359
Neg logit contributions
ww-2.340
dd-1.939
lish-1.846
www-1.809
bly-1.742
Token so
Token ID523
Feature activation-1.02e-01

Pos logit contributions
 search+0.682
 bed+0.656
 lap+0.643
 call+0.638
 bank+0.638
Neg logit contributions
ww-1.617
dd-1.345
lish-1.314
www-1.298
iest-1.266
Token much
Token ID881
Feature activation+6.24e-01

Pos logit contributions
ww+0.167
dd+0.119
www+0.109
lish+0.107
P+0.100
Neg logit contributions
 search-0.257
 bed-0.256
 lap-0.251
 call-0.251
 bank-0.250
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
 search+0.727
 lap+0.689
 bank+0.687
 bed+0.679
 call+0.668
Neg logit contributions
ww-1.866
dd-1.566
lish-1.514
www-1.512
ha-1.472
Token She
Token ID1375
Feature activation+1.43e+00

Pos logit contributions
 search+2.235
 eye+2.146
 bed+2.134
 bank+2.126
 call+2.125
Neg logit contributions
ww-4.876
dd-3.962
www-3.931
lish-3.916
P-3.809
Token went
Token ID1816
Feature activation+2.62e+00

Pos logit contributions
 lap+2.653
 bed+2.608
 search+2.547
 eye+2.420
 bank+2.414
Neg logit contributions
ww-3.274
dd-2.698
www-2.496
lish-2.483
P-2.469
Token to
Token ID284
Feature activation+4.35e+00

Pos logit contributions
 backs+4.009
 eyes+3.849
 eye+3.847
 lap+3.835
 bank+3.817
Neg logit contributions
ww-6.437
dd-5.254
P-5.108
ha-5.023
www-5.000
Token her
Token ID607
Feature activation+2.39e+00

Pos logit contributions
 bank+10.607
 backs+10.073
yard+9.758
 banks+9.741
anks+9.736
Neg logit contributions
ww-7.564
 you-6.583
 oh-6.515
"-6.076
www-5.565
Token mom
Token ID1995
Feature activation+1.93e+00

Pos logit contributions
 bank+3.360
 backs+3.181
 search+3.149
 lap+3.128
 call+3.082
Neg logit contributions
ww-6.211
P-4.985
lish-4.915
a-4.866
dd-4.858
Tokenmy
Token ID1820
Feature activation+2.45e+00

Pos logit contributions
 bank+3.255
 backs+3.226
ses+3.167
 tears+3.161
 bed+3.145
Neg logit contributions
ww-4.399
lish-3.533
P-3.455
dd-3.441
"-3.346
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
ses+3.551
 backs+3.520
 bank+3.440
 tears+3.364
 lap+3.348
Neg logit contributions
ww-6.171
dd-5.075
lish-4.947
"-4.932
a-4.844
Token said
Token ID531
Feature activation-2.05e+00

Pos logit contributions
 lap+1.416
 search+1.394
 bed+1.393
 bank+1.380
 eye+1.375
Neg logit contributions
ww-2.838
dd-2.386
www-2.266
P-2.230
lish-2.227
Token his
Token ID465
Feature activation+1.62e+00

Pos logit contributions
 search+1.821
 lap+1.782
 bed+1.749
 bank+1.734
 driveway+1.726
Neg logit contributions
ww-2.383
dd-1.871
lish-1.815
www-1.765
iest-1.746
Token lesson
Token ID11483
Feature activation-8.72e-02

Pos logit contributions
 search+2.493
 lap+2.488
 bed+2.486
 bank+2.453
 call+2.426
Neg logit contributions
ww-4.052
iest-3.107
dd-3.100
lish-3.094
www-3.068
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
ww+0.271
dd+0.231
www+0.222
lish+0.222
ha+0.215
Neg logit contributions
 search-0.089
 bed-0.087
 lap-0.084
 call-0.083
 bank-0.082
Token They
Token ID1119
Feature activation+1.99e+00

Pos logit contributions
 bank+1.446
 search+1.434
 lap+1.388
 bed+1.379
 call+1.346
Neg logit contributions
ww-3.888
dd-3.206
lish-3.113
www-3.092
P-3.026
Token went
Token ID1816
Feature activation+1.50e+00

Pos logit contributions
 bed+4.022
 lap+3.938
 search+3.827
 eyes+3.691
 driveway+3.626
Neg logit contributions
ww-4.293
dd-3.516
ha-3.247
P-3.239
www-3.238
Token to
Token ID284
Feature activation+4.29e+00

Pos logit contributions
 search+1.883
 eye+1.830
 backs+1.822
 bed+1.821
 lap+1.820
Neg logit contributions
ww-4.107
dd-3.385
lish-3.290
P-3.290
www-3.229
Token their
Token ID511
Feature activation+2.76e+00

Pos logit contributions
 eye+9.769
 backs+9.365
 bank+9.353
anks+9.223
 lap+9.173
Neg logit contributions
ww-7.559
 you-6.065
 oh-5.962
"-5.901
Anna-5.298
Token room
Token ID2119
Feature activation+2.48e+00

Pos logit contributions
 eye+3.124
 call+3.109
 lap+3.060
 search+3.026
 bank+2.889
Neg logit contributions
ww-7.723
lish-6.121
P-6.059
www-6.045
a-5.918
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
 search+3.515
 call+3.450
 backs+3.436
 bank+3.410
 eye+3.341
Neg logit contributions
ww-6.133
dd-5.125
lish-5.037
a-4.947
"-4.890
Token drew
Token ID9859
Feature activation+9.90e-01

Pos logit contributions
anks+4.145
 backs+4.138
 lap+4.121
 eye+4.083
 bed+4.006
Neg logit contributions
ww-6.576
dd-5.320
P-5.113
www-5.055
a-5.031
Token some
Token ID617
Feature activation+7.50e-01

Pos logit contributions
 search+1.650
 lap+1.605
 bed+1.588
 bank+1.581
 call+1.568
Neg logit contributions
ww-2.380
dd-1.948
lish-1.900
iest-1.805
P-1.804
Token Tim
Token ID5045
Feature activation+1.95e+00

Pos logit contributions
ww+2.281
dd+1.905
lish+1.828
www+1.814
iest+1.752
Neg logit contributions
 search-1.176
 bed-1.171
 call-1.104
 bank-1.084
 lap-1.076
Tokenmy
Token ID1820
Feature activation+1.35e+00

Pos logit contributions
 bank+3.204
 backs+3.167
 lap+3.163
 bed+3.162
 tears+3.152
Neg logit contributions
ww-4.957
ha-3.809
dd-3.797
www-3.786
P-3.706
Token got
Token ID1392
Feature activation-7.12e-02

Pos logit contributions
 bed+2.223
 lap+2.209
 search+2.200
 backs+2.181
 bank+2.155
Neg logit contributions
ww-3.298
dd-2.660
ha-2.553
www-2.515
P-2.440
Token to
Token ID284
Feature activation+2.63e+00

Pos logit contributions
ww+0.198
dd+0.165
www+0.158
lish+0.156
P+0.152
Neg logit contributions
 search-0.097
 bed-0.095
 lap-0.093
 call-0.092
 bank-0.091
Token Billy
Token ID15890
Feature activation+2.70e+00

Pos logit contributions
 bank+4.913
 lap+4.779
 backs+4.691
 eye+4.688
 doorstep+4.568
Neg logit contributions
ww-6.000
dd-4.422
www-4.420
ha-4.224
 oh-4.160
Token's
Token ID338
Feature activation+4.25e+00

Pos logit contributions
 tears+5.334
ses+5.304
 backs+5.248
 lap+5.127
 eyes+5.025
Neg logit contributions
ww-5.552
dd-4.260
lish-4.168
"-4.120
ha-4.104
Token house
Token ID2156
Feature activation+8.03e-01

Pos logit contributions
 bank+5.689
growth+5.685
 ease+5.463
 search+5.311
 backs+5.294
Neg logit contributions
ww-11.734
www-8.936
 oh-8.663
P-8.585
"-8.564
Token,
Token ID11
Feature activation-8.36e-01

Pos logit contributions
 search+1.390
 lap+1.380
 bed+1.356
 backs+1.355
 bank+1.351
Neg logit contributions
ww-1.842
dd-1.467
lish-1.404
iest-1.393
www-1.393
Token they
Token ID484
Feature activation+1.15e+00

Pos logit contributions
ww+2.247
dd+1.846
www+1.769
lish+1.762
P+1.692
Neg logit contributions
 search-1.271
 bed-1.259
 call-1.210
 bank-1.187
 lap-1.174
Token played
Token ID2826
Feature activation+2.59e-02

Pos logit contributions
 lap+1.797
 bed+1.764
 search+1.722
 eye+1.679
 backs+1.667
Neg logit contributions
ww-2.883
dd-2.445
ha-2.290
www-2.251
P-2.216
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
 search+0.050
 bed+0.049
 lap+0.048
 call+0.048
 bank+0.048
Neg logit contributions
ww-0.057
dd-0.045
www-0.042
lish-0.042
P-0.040
Token goodbye
Token ID24829
Feature activation+4.00e-01

Pos logit contributions
ww+3.546
dd+2.836
www+2.756
lish+2.717
P+2.638
Neg logit contributions
 bed-2.534
 search-2.447
 call-2.420
 eyes-2.398
 lap-2.381
Token.
Token ID13
Feature activation+5.13e-01

Pos logit contributions
 search+0.434
 bed+0.428
 call+0.407
 lap+0.406
 bank+0.404
Neg logit contributions
ww-1.200
dd-1.019
lish-0.981
www-0.971
ha-0.940
Token They
Token ID1119
Feature activation+2.07e+00

Pos logit contributions
 search+0.498
 bed+0.484
 bank+0.470
 lap+0.465
 call+0.459
Neg logit contributions
ww-1.634
dd-1.388
www-1.333
lish-1.322
P-1.309
Token ran
Token ID4966
Feature activation+6.09e-01

Pos logit contributions
 lap+3.459
 bed+3.388
 search+3.357
 eyes+3.248
 backs+3.197
Neg logit contributions
ww-5.145
dd-4.285
P-4.062
ha-4.022
www-3.962
Token back
Token ID736
Feature activation+2.15e+00

Pos logit contributions
 search+0.742
 bed+0.724
 lap+0.711
 call+0.706
 eye+0.694
Neg logit contributions
ww-1.758
dd-1.482
www-1.406
lish-1.401
ha-1.382
Token to
Token ID284
Feature activation+4.21e+00

Pos logit contributions
 eye+3.213
 lap+3.183
 tears+3.175
 call+3.145
 bank+3.113
Neg logit contributions
ww-5.460
dd-4.443
ha-4.230
P-4.178
www-4.175
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 eye+10.304
 bank+10.210
 doorstep+9.895
anks+9.790
 lap+9.685
Neg logit contributions
ww-6.935
 oh-5.423
dd-5.402
"-5.141
 you-4.854
Token park
Token ID3952
Feature activation+2.64e+00

Pos logit contributions
 search+1.226
 lap+1.111
 bed+1.092
 backs+1.073
 call+1.065
Neg logit contributions
ww-3.712
dd-3.037
www-3.034
lish-2.963
P-2.955
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
 search+4.088
 backs+4.085
 bed+4.027
 eye+4.012
 eyes+3.951
Neg logit contributions
ww-6.158
dd-4.983
"-4.960
lish-4.907
a-4.755
Token They
Token ID1119
Feature activation+2.37e+00

Pos logit contributions
 bank+1.157
 search+1.148
 bed+1.120
 lap+1.096
 eye+1.092
Neg logit contributions
ww-3.370
dd-2.840
www-2.713
P-2.674
lish-2.664
Token looked
Token ID3114
Feature activation-3.66e-01

Pos logit contributions
 lap+4.154
 bed+4.093
 eyes+4.085
 search+3.995
 backs+3.935
Neg logit contributions
ww-5.501
dd-4.624
P-4.384
ha-4.332
a-4.313
Token closed
Token ID4838
Feature activation+1.79e+00

Pos logit contributions
 search+1.597
 lap+1.565
 bed+1.562
 call+1.501
 bank+1.490
Neg logit contributions
ww-2.521
dd-2.098
lish-1.965
www-1.952
P-1.950
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
 search+2.905
 call+2.903
 bank+2.859
 lap+2.781
 eye+2.776
Neg logit contributions
ww-4.284
dd-3.717
lish-3.505
bly-3.365
P-3.337
Token drawer
Token ID33451
Feature activation+1.92e+00

Pos logit contributions
 search+1.307
 bank+1.119
 call+1.119
 lap+1.113
 bed+1.107
Neg logit contributions
ww-3.901
lish-3.247
www-3.185
dd-3.173
P-3.076
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 search+2.585
 call+2.515
 bed+2.482
 bank+2.459
 backs+2.416
Neg logit contributions
ww-5.007
lish-4.159
dd-4.152
iest-4.084
a-4.024
Token went
Token ID1816
Feature activation+2.08e+00

Pos logit contributions
 search+1.532
 lap+1.523
 bank+1.471
 call+1.465
 bed+1.455
Neg logit contributions
ww-2.652
dd-2.224
www-2.071
lish-2.058
ha-2.008
Token to
Token ID284
Feature activation+4.21e+00

Pos logit contributions
 backs+3.002
 bank+2.913
 search+2.881
 cries+2.821
 eye+2.813
Neg logit contributions
ww-5.276
dd-4.373
P-4.241
lish-4.189
ha-4.152
Token play
Token ID711
Feature activation+9.76e-01

Pos logit contributions
 bank+8.215
 backs+7.709
anks+7.389
ses+7.334
 hood+7.249
Neg logit contributions
ww-9.422
 you-8.163
 oh-7.903
"-7.733
www-7.463
Token with
Token ID351
Feature activation+1.48e+00

Pos logit contributions
 search+1.608
 backs+1.596
 bank+1.584
 call+1.569
 bed+1.561
Neg logit contributions
ww-2.279
dd-1.889
lish-1.854
P-1.785
iest-1.740
Token her
Token ID607
Feature activation+1.33e+00

Pos logit contributions
 bank+2.305
 lap+2.295
 search+2.263
 driveway+2.253
 eye+2.252
Neg logit contributions
ww-3.650
dd-3.110
lish-2.991
P-2.901
bly-2.853
Token toys
Token ID14958
Feature activation+1.95e+00

Pos logit contributions
 search+1.931
 bank+1.883
 lap+1.883
 call+1.853
 eye+1.839
Neg logit contributions
ww-3.427
dd-2.812
lish-2.773
www-2.663
iest-2.646
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 search+2.685
 backs+2.625
 lap+2.623
 call+2.570
 bed+2.527
Neg logit contributions
ww-5.008
dd-4.294
lish-4.192
iest-4.124
ha-4.003
Token idea
Token ID2126
Feature activation+7.40e-01

Pos logit contributions
 backs+2.761
 lap+2.725
 bank+2.699
 search+2.658
 slam+2.580
Neg logit contributions
ww-5.617
iest-4.557
www-4.437
ha-4.321
dd-4.307
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 search+0.962
 bed+0.961
 bank+0.952
 lap+0.934
 call+0.910
Neg logit contributions
ww-2.040
dd-1.703
lish-1.646
www-1.626
iest-1.600
Token They
Token ID1119
Feature activation+1.84e+00

Pos logit contributions
 search+3.198
 bed+3.052
 call+3.023
 bank+2.993
 eye+2.951
Neg logit contributions
ww-5.609
lish-4.453
www-4.438
dd-4.320
ha-4.192
Token both
Token ID1111
Feature activation+1.01e+00

Pos logit contributions
 bed+3.125
 lap+3.118
 search+2.886
 cries+2.808
 tears+2.805
Neg logit contributions
ww-4.496
dd-3.849
ha-3.606
P-3.577
www-3.512
Token went
Token ID1816
Feature activation+9.29e-01

Pos logit contributions
 bed+1.719
 search+1.647
 lap+1.646
 call+1.575
 bank+1.567
Neg logit contributions
ww-2.514
dd-2.094
www-1.955
ha-1.937
lish-1.927
Token to
Token ID284
Feature activation+4.18e+00

Pos logit contributions
 search+1.189
 eye+1.188
 bed+1.180
 lap+1.178
 tears+1.148
Neg logit contributions
ww-2.549
dd-2.121
www-2.026
lish-2.007
P-1.989
Token Lily
Token ID20037
Feature activation+2.61e+00

Pos logit contributions
anks+9.647
 eye+9.526
 backs+9.440
 lap+9.363
 bank+9.263
Neg logit contributions
ww-6.858
 oh-6.011
"-5.478
 you-5.376
dd-4.983
Token's
Token ID338
Feature activation+3.48e+00

Pos logit contributions
 backs+4.942
 tears+4.880
 eyes+4.836
 lap+4.834
 bed+4.773
Neg logit contributions
ww-5.298
dd-4.133
a-4.131
lish-4.004
"-3.984
Token mom
Token ID1995
Feature activation+1.87e+00

Pos logit contributions
 choke+3.653
anks+3.615
 call+3.529
 bank+3.499
 carve+3.455
Neg logit contributions
ww-9.720
"-7.918
lish-7.796
 oh-7.740
P-7.710
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
 bed+3.115
 tears+3.059
 lap+3.045
 backs+3.038
 bank+3.009
Neg logit contributions
ww-4.267
dd-3.465
lish-3.405
a-3.312
www-3.244
Token requested
Token ID9167
Feature activation-7.05e-01

Pos logit contributions
 lap+2.524
 search+2.455
 bed+2.455
 eye+2.415
 backs+2.405
Neg logit contributions
ww-4.178
dd-3.485
www-3.285
P-3.260
ha-3.224
Token the
Token ID262
Feature activation+2.61e-01

Pos logit contributions
 lap+2.490
 doorstep+2.486
 search+2.485
 bed+2.467
 call+2.415
Neg logit contributions
ww-3.691
dd-3.020
lish-2.903
w-2.776
www-2.736
Token coal
Token ID5655
Feature activation+1.13e+00

Pos logit contributions
 search+0.364
 bed+0.357
 lap+0.353
 bank+0.342
 call+0.342
Neg logit contributions
ww-0.715
dd-0.592
www-0.563
lish-0.561
ha-0.547
Token away
Token ID1497
Feature activation+6.79e-01

Pos logit contributions
 search+1.756
 bed+1.740
 lap+1.692
 call+1.675
 doorstep+1.618
Neg logit contributions
ww-2.874
dd-2.356
lish-2.260
www-2.179
iest-2.167
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
 search+0.895
 bed+0.876
 lap+0.864
 call+0.854
 bank+0.829
Neg logit contributions
ww-1.877
dd-1.564
lish-1.523
iest-1.483
www-1.474
Token went
Token ID1816
Feature activation+1.38e+00

Pos logit contributions
 search+3.063
 lap+3.030
 doorstep+2.973
 bed+2.958
 call+2.900
Neg logit contributions
ww-4.386
dd-3.626
lish-3.401
www-3.272
ha-3.213
Token to
Token ID284
Feature activation+4.17e+00

Pos logit contributions
 search+1.788
 lap+1.712
 eye+1.707
 bed+1.694
 call+1.671
Neg logit contributions
ww-3.768
dd-3.141
lish-3.042
P-2.986
www-2.945
Token play
Token ID711
Feature activation+1.15e+00

Pos logit contributions
 lap+8.704
 backs+8.436
 bank+8.398
 doorstep+8.319
 trunk+8.291
Neg logit contributions
ww-8.327
 oh-6.667
 you-6.402
dd-6.070
"-5.866
Token with
Token ID351
Feature activation+1.04e+00

Pos logit contributions
 search+2.112
 bed+2.065
 lap+2.062
 backs+2.035
 call+2.018
Neg logit contributions
ww-2.489
dd-2.044
lish-2.009
P-1.919
bly-1.900
Token something
Token ID1223
Feature activation-8.66e-01

Pos logit contributions
 lap+1.916
 search+1.902
 bed+1.854
 call+1.852
 bank+1.835
Neg logit contributions
ww-2.325
dd-1.913
lish-1.793
www-1.721
P-1.704
Token else
Token ID2073
Feature activation+1.65e-01

Pos logit contributions
ww+1.650
dd+1.229
www+1.160
lish+1.106
P+1.066
Neg logit contributions
 search-1.981
 bed-1.966
 call-1.931
 lap-1.912
 bank-1.900
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 search+0.171
 bed+0.169
 lap+0.162
 call+0.161
 bank+0.157
Neg logit contributions
ww-0.505
dd-0.431
lish-0.414
www-0.411
P-0.399
Token't
Token ID470
Feature activation+6.51e-01

Pos logit contributions
ww+0.605
dd+0.479
www+0.451
lish+0.447
P+0.428
Neg logit contributions
 search-0.517
 bed-0.513
 lap-0.499
 call-0.498
 bank-0.494
Token work
Token ID670
Feature activation-2.85e-01

Pos logit contributions
 lap+0.462
 bed+0.445
 bank+0.441
 search+0.441
 backs+0.406
Neg logit contributions
ww-2.221
dd-1.899
www-1.851
lish-1.835
ha-1.813
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
ww+0.872
dd+0.739
www+0.710
lish+0.705
P+0.686
Neg logit contributions
 search-0.307
 bed-0.302
 lap-0.288
 call-0.287
 bank-0.283
Token Lily
Token ID20037
Feature activation+1.17e+00

Pos logit contributions
 search+1.880
 call+1.814
 eye+1.803
 bank+1.794
 bed+1.794
Neg logit contributions
ww-3.980
dd-3.262
lish-3.196
www-3.177
P-3.121
Token went
Token ID1816
Feature activation+1.57e+00

Pos logit contributions
 bed+1.942
 lap+1.910
 search+1.908
 call+1.811
 eye+1.784
Neg logit contributions
ww-2.873
dd-2.404
lish-2.274
www-2.237
ha-2.202
Token to
Token ID284
Feature activation+4.17e+00

Pos logit contributions
 search+2.007
 lap+1.991
 backs+1.990
 eye+1.989
 bed+1.919
Neg logit contributions
ww-4.259
dd-3.539
lish-3.424
www-3.403
P-3.368
Token her
Token ID607
Feature activation+2.96e+00

Pos logit contributions
 bank+8.992
 backs+8.524
 driveway+8.266
anks+8.149
 lap+8.116
Neg logit contributions
ww-7.978
 oh-7.204
 you-6.988
"-6.465
dd-6.178
Token mom
Token ID1995
Feature activation+1.72e+00

Pos logit contributions
 bank+3.757
 backs+3.712
anks+3.535
 search+3.531
 lap+3.530
Neg logit contributions
ww-7.772
lish-6.370
a-6.275
 oh-6.157
P-6.134
Tokenmy
Token ID1820
Feature activation+2.27e+00

Pos logit contributions
 bank+2.777
 backs+2.758
 bed+2.740
 tears+2.711
 lap+2.696
Neg logit contributions
ww-3.996
lish-3.220
dd-3.148
P-3.092
a-3.037
Token with
Token ID351
Feature activation+1.77e+00

Pos logit contributions
 backs+3.162
ses+3.119
 lap+3.073
 bank+3.034
 eye+3.021
Neg logit contributions
ww-5.769
dd-4.756
lish-4.672
"-4.518
a-4.512
Token tears
Token ID10953
Feature activation+7.12e-01

Pos logit contributions
 bank+3.069
 lap+3.020
 driveway+2.928
 doorstep+2.915
 backs+2.882
Neg logit contributions
ww-4.127
dd-3.402
P-3.231
lish-3.196
bly-3.133
Token favorite
Token ID4004
Feature activation+1.12e+00

Pos logit contributions
 search+2.248
 lap+2.241
 call+2.234
 bed+2.218
 backs+2.177
Neg logit contributions
ww-3.087
dd-2.440
www-2.338
lish-2.317
iest-2.289
Token fruit
Token ID8234
Feature activation+1.08e-01

Pos logit contributions
 call+1.678
 search+1.671
 lap+1.668
 bed+1.667
 backs+1.645
Neg logit contributions
ww-2.877
dd-2.305
www-2.243
P-2.204
lish-2.198
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
 search+0.130
 bed+0.127
 call+0.123
 lap+0.123
 bank+0.120
Neg logit contributions
ww-0.318
dd-0.266
www-0.256
lish-0.255
P-0.248
Token She
Token ID1375
Feature activation+1.30e+00

Pos logit contributions
 search+2.009
 bank+1.945
 call+1.887
 eye+1.825
 bed+1.817
Neg logit contributions
ww-4.223
dd-3.428
www-3.342
ha-3.221
P-3.195
Token went
Token ID1816
Feature activation+1.69e+00

Pos logit contributions
 search+2.199
 lap+2.146
 bed+2.142
 call+2.080
 bank+2.065
Neg logit contributions
ww-3.227
dd-2.658
www-2.477
P-2.414
ha-2.392
Token to
Token ID284
Feature activation+4.16e+00

Pos logit contributions
 backs+2.252
 search+2.232
 eye+2.196
 bank+2.188
 lap+2.152
Neg logit contributions
ww-4.528
dd-3.786
www-3.586
P-3.544
lish-3.534
Token bed
Token ID3996
Feature activation+1.12e+00

Pos logit contributions
 bank+8.731
 backs+8.440
 eye+8.136
 hood+7.919
anks+7.897
Neg logit contributions
ww-8.928
 you-7.213
 oh-6.899
dd-6.817
www-6.771
Token that
Token ID326
Feature activation+3.23e-01

Pos logit contributions
 search+1.765
 eye+1.693
 call+1.666
 backs+1.665
 bank+1.664
Neg logit contributions
ww-2.808
dd-2.350
lish-2.212
www-2.175
bly-2.119
Token night
Token ID1755
Feature activation+2.70e-01

Pos logit contributions
 search+0.537
 bed+0.528
 bank+0.527
 lap+0.525
 call+0.522
Neg logit contributions
ww-0.793
dd-0.639
www-0.607
lish-0.600
iest-0.583
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
 search+0.370
 bed+0.359
 lap+0.354
 bank+0.351
 call+0.350
Neg logit contributions
ww-0.741
dd-0.617
lish-0.586
www-0.586
ha-0.564
Token a
Token ID257
Feature activation+7.51e-01

Pos logit contributions
 bank+2.281
 lap+2.235
 search+2.225
 backs+2.150
 driveway+2.150
Neg logit contributions
ww-3.193
dd-2.598
lish-2.428
www-2.386
P-2.362
Token he
Token ID339
Feature activation-8.47e-02

Pos logit contributions
ww+2.039
dd+1.672
www+1.616
lish+1.594
P+1.538
Neg logit contributions
 bed-1.110
 search-1.104
 call-1.049
 bank-1.033
 lap-1.028
Token heard
Token ID2982
Feature activation+6.29e-01

Pos logit contributions
ww+0.223
dd+0.184
www+0.175
lish+0.174
P+0.169
Neg logit contributions
 search-0.128
 bed-0.126
 lap-0.123
 call-0.122
 bank-0.121
Token a
Token ID257
Feature activation+3.78e-01

Pos logit contributions
 search+1.049
 lap+1.021
 bed+1.009
 bank+1.007
 call+0.985
Neg logit contributions
ww-1.540
dd-1.263
lish-1.189
www-1.172
ha-1.151
Token noise
Token ID7838
Feature activation+8.40e-01

Pos logit contributions
 search+0.529
 bed+0.523
 lap+0.511
 bank+0.506
 eye+0.499
Neg logit contributions
ww-1.032
dd-0.838
www-0.815
lish-0.813
ha-0.781
Token outside
Token ID2354
Feature activation+1.47e+00

Pos logit contributions
 bed+1.234
 lap+1.231
 search+1.228
 bank+1.222
 driveway+1.165
Neg logit contributions
ww-2.179
dd-1.841
lish-1.756
www-1.701
iest-1.676
Token his
Token ID465
Feature activation+4.10e+00

Pos logit contributions
 lap+1.991
 search+1.983
 bank+1.952
 eye+1.903
 bed+1.878
Neg logit contributions
ww-3.950
dd-3.316
lish-3.202
iest-3.151
www-3.101
Token room
Token ID2119
Feature activation+1.71e+00

Pos logit contributions
 bank+5.076
 search+4.956
 ease+4.461
 carve+4.395
 lap+4.130
Neg logit contributions
ww-12.642
a-10.142
lish-9.927
www-9.680
P-9.571
Token and
Token ID290
Feature activation-2.91e-01

Pos logit contributions
 search+2.204
 lap+2.122
 bed+2.087
 bank+2.065
 backs+2.056
Neg logit contributions
ww-4.712
dd-3.953
lish-3.868
www-3.775
iest-3.702
Token his
Token ID465
Feature activation+2.33e+00

Pos logit contributions
ww+0.787
dd+0.651
www+0.621
lish+0.617
P+0.597
Neg logit contributions
 search-0.421
 bed-0.416
 call-0.402
 lap-0.401
 bank-0.396
Token mother
Token ID2802
Feature activation+6.58e-01

Pos logit contributions
 search+3.330
 bank+3.253
 lap+3.120
 call+3.007
 find+2.938
Neg logit contributions
ww-6.323
dd-5.078
lish-5.008
www-4.879
a-4.746
Token entered
Token ID5982
Feature activation+1.20e+00

Pos logit contributions
 bed+1.023
 search+1.014
 lap+1.009
 bank+0.992
 eye+0.967
Neg logit contributions
ww-1.647
dd-1.367
lish-1.304
www-1.284
ha-1.267
TokenTwe
Token ID32665
Feature activation+4.54e-01

Pos logit contributions
ww+4.181
dd+3.333
bly+3.093
lish+3.090
iest+3.069
Neg logit contributions
 bank-4.446
 search-4.351
 call-4.199
 find-4.081
 lap-4.079
Tokenety
Token ID2963
Feature activation+1.79e-01

Pos logit contributions
 bed+0.801
 search+0.789
 call+0.762
 lap+0.761
 bank+0.744
Neg logit contributions
ww-1.104
dd-0.889
www-0.844
lish-0.829
P-0.820
Token flew
Token ID13112
Feature activation-2.10e-01

Pos logit contributions
 bed+0.246
 search+0.244
 lap+0.238
 call+0.232
 eye+0.229
Neg logit contributions
ww-0.490
dd-0.409
www-0.390
lish-0.386
ha-0.378
Token to
Token ID284
Feature activation+2.35e+00

Pos logit contributions
ww+0.594
dd+0.496
www+0.474
lish+0.471
P+0.456
Neg logit contributions
 search-0.278
 bed-0.275
 lap-0.265
 call-0.264
 bank-0.261
Token Jerry
Token ID13075
Feature activation+2.29e+00

Pos logit contributions
 lap+4.364
 eye+4.246
 backs+4.177
 driveway+4.150
 doorstep+4.065
Neg logit contributions
ww-5.226
dd-4.195
P-4.036
www-3.955
ha-3.944
Token's
Token ID338
Feature activation+4.10e+00

Pos logit contributions
 tears+3.713
 backs+3.675
 bed+3.648
 lap+3.617
 driveway+3.614
Neg logit contributions
ww-5.303
dd-4.424
"-4.148
a-4.116
P-4.080
Token house
Token ID2156
Feature activation+2.48e+00

Pos logit contributions
 call+5.168
 yell+4.889
 backs+4.838
 calls+4.745
 choke+4.697
Neg logit contributions
ww-11.252
"-8.987
a-8.902
P-8.805
www-8.408
Token and
Token ID290
Feature activation+8.72e-01

Pos logit contributions
 search+3.799
 backs+3.773
 bed+3.753
 call+3.718
 eye+3.652
Neg logit contributions
ww-5.878
dd-4.801
lish-4.674
"-4.640
www-4.594
Token knocked
Token ID13642
Feature activation+5.04e-01

Pos logit contributions
 search+1.193
 lap+1.185
 bed+1.174
 eye+1.170
 call+1.120
Neg logit contributions
ww-2.390
dd-1.990
www-1.902
lish-1.842
ha-1.842
Token on
Token ID319
Feature activation+1.39e+00

Pos logit contributions
 search+0.906
 bed+0.891
 lap+0.884
 call+0.878
 bank+0.872
Neg logit contributions
ww-1.182
dd-0.957
www-0.898
lish-0.880
P-0.872
Token the
Token ID262
Feature activation+2.53e+00

Pos logit contributions
 lap+2.261
 eye+2.242
 search+2.215
 bed+2.189
 call+2.181
Neg logit contributions
ww-3.397
dd-2.799
www-2.638
lish-2.605
ha-2.599
Token
Token ID198
Feature activation+9.74e-01

Pos logit contributions
 eye+3.267
 backs+3.257
 bed+3.210
 bank+3.158
 search+3.112
Neg logit contributions
ww-5.699
www-4.519
dd-4.479
P-4.322
ha-4.302
Token
Token ID198
Feature activation-1.55e+00

Pos logit contributions
 bank+1.275
 bed+1.263
 backs+1.258
 eye+1.239
 search+1.239
Neg logit contributions
ww-2.709
dd-2.274
www-2.160
ha-2.127
P-2.122
TokenAs
Token ID1722
Feature activation-6.74e-01

Pos logit contributions
ww+2.936
dd+2.258
lish+2.110
www+2.036
bly+1.996
Neg logit contributions
 search-3.609
 bed-3.461
 call-3.460
 lap-3.456
 bank-3.414
Token Tim
Token ID5045
Feature activation+2.16e+00

Pos logit contributions
ww+1.681
dd+1.362
lish+1.303
www+1.300
bly+1.242
Neg logit contributions
 search-1.140
 bed-1.135
 call-1.077
 lap-1.069
 bank-1.067
Token went
Token ID1816
Feature activation-3.36e-01

Pos logit contributions
 bed+3.830
 lap+3.800
 eyes+3.598
 tears+3.588
 backs+3.549
Neg logit contributions
ww-5.200
dd-4.273
www-3.976
ha-3.969
P-3.952
Token to
Token ID284
Feature activation+4.09e+00

Pos logit contributions
ww+0.969
dd+0.813
www+0.778
lish+0.773
P+0.748
Neg logit contributions
 search-0.427
 bed-0.422
 lap-0.404
 call-0.403
 bank-0.397
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
anks+8.136
 lap+7.807
 backs+7.748
 hood+7.703
 eyes+7.616
Neg logit contributions
ww-8.757
 you-6.805
 oh-6.764
"-6.598
www-6.597
Token kitchen
Token ID9592
Feature activation+8.94e-01

Pos logit contributions
 lap+2.190
 search+2.109
 backs+2.102
 eye+2.033
 tears+2.032
Neg logit contributions
ww-5.223
P-4.223
www-4.185
dd-4.118
lish-4.117
Token,
Token ID11
Feature activation-6.06e-01

Pos logit contributions
 search+1.544
 lap+1.535
 backs+1.502
 call+1.500
 bed+1.498
Neg logit contributions
ww-2.072
dd-1.640
www-1.564
lish-1.558
iest-1.513
Token he
Token ID339
Feature activation+8.43e-01

Pos logit contributions
ww+1.319
dd+1.027
www+0.975
lish+0.962
P+0.916
Neg logit contributions
 search-1.214
 bed-1.208
 call-1.165
 lap-1.159
 bank-1.156
Token saw
Token ID2497
Feature activation+5.63e-01

Pos logit contributions
 lap+1.320
 bed+1.315
 search+1.289
 backs+1.248
 call+1.247
Neg logit contributions
ww-2.091
dd-1.755
lish-1.630
P-1.629
www-1.624
Token herself
Token ID5223
Feature activation+1.59e-01

Pos logit contributions
 bank+2.029
 lap+2.014
 search+1.997
 eye+1.972
 call+1.962
Neg logit contributions
ww-2.738
dd-2.298
lish-2.130
www-2.102
P-2.063
Token safe
Token ID3338
Feature activation+4.98e-01

Pos logit contributions
 search+0.248
 bed+0.243
 call+0.239
 lap+0.239
 bank+0.235
Neg logit contributions
ww-0.400
dd-0.328
lish-0.310
www-0.310
bly-0.298
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 search+0.557
 bed+0.535
 lap+0.532
 call+0.532
 bank+0.516
Neg logit contributions
ww-1.468
dd-1.252
lish-1.209
www-1.188
iest-1.169
Token She
Token ID1375
Feature activation+8.56e-01

Pos logit contributions
 bank+2.660
 search+2.597
 eye+2.553
 call+2.548
 bed+2.529
Neg logit contributions
ww-5.109
www-4.100
dd-4.033
ha-3.923
P-3.905
Token went
Token ID1816
Feature activation+1.21e+00

Pos logit contributions
 bed+1.308
 search+1.282
 lap+1.271
 call+1.211
 bank+1.200
Neg logit contributions
ww-2.231
dd-1.878
www-1.762
lish-1.735
ha-1.724
Token to
Token ID284
Feature activation+4.08e+00

Pos logit contributions
 search+1.603
 backs+1.588
 eye+1.585
 bed+1.581
 eyes+1.581
Neg logit contributions
ww-3.249
dd-2.696
www-2.594
lish-2.565
P-2.557
Token ask
Token ID1265
Feature activation+1.36e+00

Pos logit contributions
anks+8.615
 bank+8.546
ses+8.314
 backs+8.249
stick+7.998
Neg logit contributions
ww-8.062
 oh-6.917
 you-6.679
"-6.389
www-6.245
Token her
Token ID607
Feature activation+8.10e-01

Pos logit contributions
 bank+2.156
 search+2.100
 driveway+2.065
 lap+2.046
 backs+2.006
Neg logit contributions
ww-3.322
dd-2.787
lish-2.694
www-2.589
bly-2.559
Token mum
Token ID25682
Feature activation+5.34e-01

Pos logit contributions
 search+0.968
 bank+0.943
 lap+0.938
 bed+0.921
 backs+0.893
Neg logit contributions
ww-2.341
dd-1.947
lish-1.896
www-1.873
P-1.831
Token for
Token ID329
Feature activation+3.36e-01

Pos logit contributions
 search+0.833
 bed+0.823
 lap+0.812
 bank+0.798
 call+0.787
Neg logit contributions
ww-1.328
dd-1.081
lish-1.043
www-1.028
ha-0.996
Token a
Token ID257
Feature activation+2.60e-01

Pos logit contributions
 search+0.344
 bank+0.334
 lap+0.332
 bed+0.332
 eye+0.322
Neg logit contributions
ww-1.031
dd-0.883
www-0.841
lish-0.840
P-0.823
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
ww+0.892
dd+0.756
www+0.726
lish+0.722
P+0.701
Neg logit contributions
 search-0.318
 bed-0.313
 lap-0.298
 call-0.298
 bank-0.292
Token do
Token ID466
Feature activation-8.42e-01

Pos logit contributions
 bank+2.449
 lap+2.446
 backs+2.343
 driveway+2.335
 bed+2.327
Neg logit contributions
ww-3.757
dd-3.056
www-2.923
iest-2.905
w-2.856
Token.
Token ID13
Feature activation+2.18e+00

Pos logit contributions
ww+2.783
dd+2.375
www+2.294
lish+2.257
ha+2.209
Neg logit contributions
 search-0.754
 bed-0.731
 call-0.679
 lap-0.671
 bank-0.644
Token She
Token ID1375
Feature activation+2.00e+00

Pos logit contributions
 search+3.143
 bed+3.063
 bank+3.046
 lap+2.967
 call+2.962
Neg logit contributions
ww-5.837
lish-4.719
www-4.694
dd-4.530
ha-4.361
Token went
Token ID1816
Feature activation+2.19e+00

Pos logit contributions
 lap+3.804
 bed+3.742
 search+3.512
 driveway+3.441
anks+3.384
Neg logit contributions
ww-4.641
dd-3.911
lish-3.583
www-3.571
ha-3.571
Token to
Token ID284
Feature activation+4.08e+00

Pos logit contributions
 backs+3.288
 lap+3.219
 eyes+3.171
 bank+3.168
 eye+3.114
Neg logit contributions
ww-5.434
dd-4.423
P-4.315
lish-4.251
www-4.226
Token her
Token ID607
Feature activation+2.42e+00

Pos logit contributions
 bank+9.553
 backs+8.881
 driveway+8.754
 lap+8.677
 banks+8.564
Neg logit contributions
ww-7.715
 oh-6.480
 you-6.232
www-5.629
"-5.626
Token mom
Token ID1995
Feature activation+2.16e+00

Pos logit contributions
 bank+3.681
 backs+3.473
 lap+3.418
 search+3.405
 call+3.351
Neg logit contributions
ww-6.123
P-4.834
a-4.756
lish-4.732
dd-4.688
Tokenmy
Token ID1820
Feature activation+2.31e+00

Pos logit contributions
 bank+3.182
 backs+3.181
ses+3.137
 tears+3.078
 lap+3.016
Neg logit contributions
ww-5.327
lish-4.330
dd-4.298
P-4.294
"-4.275
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 backs+3.248
ses+3.214
 bank+3.193
 lap+3.126
 tears+3.116
Neg logit contributions
ww-5.859
dd-4.887
lish-4.763
"-4.727
P-4.636
Token told
Token ID1297
Feature activation+1.72e+00

Pos logit contributions
 lap+1.633
 bank+1.629
 search+1.617
 bed+1.599
 eye+1.578
Neg logit contributions
ww-3.118
dd-2.603
www-2.473
P-2.438
lish-2.412
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
 search+2.948
 call+2.868
 lap+2.862
 driveway+2.808
 eye+2.798
Neg logit contributions
ww-4.366
dd-3.531
www-3.367
P-3.363
lish-3.360
Token table
Token ID3084
Feature activation+1.45e+00

Pos logit contributions
 search+1.502
 call+1.375
 lap+1.306
 bank+1.289
 bed+1.260
Neg logit contributions
ww-4.431
dd-3.672
www-3.621
lish-3.548
P-3.500
Token.
Token ID13
Feature activation+1.99e+00

Pos logit contributions
 search+1.810
 call+1.762
 bed+1.712
 backs+1.670
 bank+1.664
Neg logit contributions
ww-4.000
dd-3.334
lish-3.244
iest-3.197
www-3.185
Token She
Token ID1375
Feature activation+1.65e+00

Pos logit contributions
 bank+2.974
 search+2.949
 call+2.801
 bed+2.789
 eye+2.759
Neg logit contributions
ww-5.347
www-4.270
dd-4.224
ha-3.985
"-3.973
Token went
Token ID1816
Feature activation+1.04e+00

Pos logit contributions
 search+2.564
 bed+2.554
 lap+2.539
 call+2.449
 eye+2.436
Neg logit contributions
ww-4.152
dd-3.504
www-3.233
P-3.219
a-3.187
Token to
Token ID284
Feature activation+4.07e+00

Pos logit contributions
 search+1.338
 eye+1.288
 backs+1.285
 bank+1.283
 call+1.274
Neg logit contributions
ww-2.827
dd-2.349
lish-2.260
www-2.257
ha-2.207
Token play
Token ID711
Feature activation+7.83e-02

Pos logit contributions
 bank+9.048
 backs+8.854
 driveway+8.593
 hood+8.557
 eye+8.421
Neg logit contributions
ww-7.598
 you-6.433
 oh-6.347
"-5.965
www-5.691
Token with
Token ID351
Feature activation+1.18e+00

Pos logit contributions
 search+0.137
 bed+0.135
 call+0.132
 lap+0.132
 bank+0.131
Neg logit contributions
ww-0.185
dd-0.149
lish-0.141
www-0.141
P-0.137
Token her
Token ID607
Feature activation+1.68e+00

Pos logit contributions
 search+1.925
 bank+1.910
 lap+1.906
 driveway+1.881
 call+1.872
Neg logit contributions
ww-2.881
dd-2.376
www-2.231
lish-2.223
P-2.192
Token toys
Token ID14958
Feature activation+1.67e+00

Pos logit contributions
 search+2.636
 call+2.615
 bank+2.582
 lap+2.577
 driveway+2.543
Neg logit contributions
ww-4.091
dd-3.279
lish-3.258
P-3.138
www-3.127
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
 search+2.482
 lap+2.409
 call+2.387
 backs+2.365
 bed+2.348
Neg logit contributions
ww-4.212
dd-3.509
lish-3.385
iest-3.297
P-3.275
Token finished
Token ID5201
Feature activation+1.58e+00

Pos logit contributions
 bed+1.961
 search+1.924
 lap+1.886
 call+1.830
 bank+1.800
Neg logit contributions
ww-3.007
dd-2.476
www-2.333
lish-2.322
ha-2.278
Token her
Token ID607
Feature activation+2.01e+00

Pos logit contributions
 eye+2.574
 search+2.569
 lap+2.530
 bank+2.518
 driveway+2.508
Neg logit contributions
ww-3.935
dd-3.320
www-3.042
lish-3.030
P-2.979
Token sandwich
Token ID20433
Feature activation+1.60e+00

Pos logit contributions
 driveway+3.049
 bank+3.042
 backs+2.997
 search+2.936
 lap+2.917
Neg logit contributions
ww-5.281
dd-4.198
lish-4.030
P-4.004
www-3.951
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
 search+2.194
 backs+2.111
 call+2.098
 bank+2.092
 driveway+2.029
Neg logit contributions
ww-4.365
dd-3.631
lish-3.527
www-3.424
iest-3.419
Token went
Token ID1816
Feature activation+2.04e+00

Pos logit contributions
 search+1.418
 lap+1.376
 call+1.336
 bank+1.330
 bed+1.326
Neg logit contributions
ww-2.455
dd-2.036
www-1.911
lish-1.875
P-1.839
Token to
Token ID284
Feature activation+4.07e+00

Pos logit contributions
 backs+2.957
 search+2.829
 eye+2.821
 bank+2.807
 eyes+2.771
Neg logit contributions
ww-5.398
dd-4.401
P-4.268
www-4.191
lish-4.153
Token play
Token ID711
Feature activation+7.72e-01

Pos logit contributions
 backs+7.586
 bank+7.537
 eye+7.123
 driveway+7.032
 hood+7.009
Neg logit contributions
ww-9.677
 you-7.819
 oh-7.447
www-7.315
"-7.268
Token outside
Token ID2354
Feature activation+1.86e-01

Pos logit contributions
 search+1.366
 backs+1.326
 bank+1.325
 call+1.324
 bed+1.320
Neg logit contributions
ww-1.756
dd-1.432
lish-1.381
P-1.347
www-1.317
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 search+0.230
 bed+0.225
 call+0.220
 lap+0.219
 bank+0.219
Neg logit contributions
ww-0.528
dd-0.444
lish-0.426
www-0.424
P-0.411
Token She
Token ID1375
Feature activation+9.91e-01

Pos logit contributions
 bank+2.642
 search+2.604
 bed+2.414
 call+2.412
 eye+2.411
Neg logit contributions
ww-5.050
www-4.022
dd-3.939
lish-3.847
ha-3.813
Token saw
Token ID2497
Feature activation+5.70e-01

Pos logit contributions
 search+1.575
 lap+1.559
 bed+1.557
 call+1.523
 eye+1.509
Neg logit contributions
ww-2.466
dd-2.054
www-1.903
P-1.885
ha-1.873
Token help
Token ID1037
Feature activation+1.09e+00

Pos logit contributions
 lap+0.995
 bed+0.973
 search+0.951
 bank+0.945
 backs+0.925
Neg logit contributions
ww-2.373
dd-1.994
www-1.908
lish-1.890
iest-1.879
Token him
Token ID683
Feature activation+2.70e+00

Pos logit contributions
 lap+1.479
 search+1.470
 bed+1.468
 bank+1.426
 eye+1.415
Neg logit contributions
ww-2.955
dd-2.449
lish-2.379
www-2.344
iest-2.303
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 lap+3.823
 driveway+3.748
 bed+3.731
 backs+3.702
 tears+3.680
Neg logit contributions
ww-6.948
a-5.711
dd-5.664
iest-5.615
lish-5.592
Token Tim
Token ID5045
Feature activation+1.55e+00

Pos logit contributions
 search+1.857
 bed+1.818
 eye+1.746
 bank+1.735
 call+1.723
Neg logit contributions
ww-3.902
dd-3.158
www-3.128
lish-3.019
ha-2.982
Token went
Token ID1816
Feature activation+1.63e+00

Pos logit contributions
 bed+2.648
 lap+2.587
 search+2.578
 call+2.402
 backs+2.390
Neg logit contributions
ww-3.723
dd-3.050
lish-2.897
www-2.864
ha-2.863
Token to
Token ID284
Feature activation+4.07e+00

Pos logit contributions
 backs+2.074
 lap+2.040
 eye+2.035
 search+2.032
 bank+1.967
Neg logit contributions
ww-4.382
dd-3.658
lish-3.531
www-3.491
P-3.474
Token bed
Token ID3996
Feature activation+1.11e+00

Pos logit contributions
 backs+8.630
 bank+8.174
 eye+7.988
stick+7.822
 driveway+7.799
Neg logit contributions
ww-7.934
 oh-6.932
 you-6.479
"-6.132
dd-6.026
Token,
Token ID11
Feature activation+2.06e+00

Pos logit contributions
 search+1.867
 eye+1.798
 call+1.774
 backs+1.764
 lap+1.764
Neg logit contributions
ww-2.648
dd-2.196
lish-2.082
www-2.024
bly-1.969
Token hoping
Token ID7725
Feature activation+1.29e+00

Pos logit contributions
 bank+3.312
 eye+3.266
 backs+3.258
 search+3.246
 bed+3.196
Neg logit contributions
ww-4.806
dd-3.967
www-3.672
lish-3.653
 oh-3.596
Token that
Token ID326
Feature activation+2.18e-04

Pos logit contributions
 search+2.306
 bank+2.292
 lap+2.267
 backs+2.254
 call+2.200
Neg logit contributions
ww-2.907
dd-2.328
www-2.163
iest-2.159
lish-2.158
Token maybe
Token ID3863
Feature activation-7.94e-01

Pos logit contributions
 search+0.000
 bed+0.000
 lap+0.000
 call+0.000
 bank+0.000
Neg logit contributions
ww-0.001
dd-0.000
www-0.000
lish-0.000
ha-0.000
Token with
Token ID351
Feature activation+2.02e+00

Pos logit contributions
 search+1.635
 bed+1.607
 lap+1.594
 call+1.569
 bank+1.545
Neg logit contributions
ww-2.548
dd-2.037
lish-2.008
www-1.948
iest-1.946
Token Max
Token ID5436
Feature activation+1.09e+00

Pos logit contributions
 lap+3.357
 search+3.349
 driveway+3.259
 bank+3.249
 doorstep+3.212
Neg logit contributions
ww-4.970
dd-4.077
lish-3.849
www-3.765
bly-3.716
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
 search+1.320
 bed+1.272
 lap+1.260
 driveway+1.238
 call+1.220
Neg logit contributions
ww-3.110
dd-2.609
lish-2.589
www-2.469
a-2.420
Token She
Token ID1375
Feature activation+1.81e+00

Pos logit contributions
 bank+2.146
 search+2.089
 bed+2.001
 eye+1.999
 lap+1.983
Neg logit contributions
ww-4.180
dd-3.363
www-3.270
lish-3.248
P-3.213
Token went
Token ID1816
Feature activation+1.87e+00

Pos logit contributions
 lap+3.361
 search+3.334
 bed+3.306
 driveway+3.078
 call+3.071
Neg logit contributions
ww-4.139
dd-3.406
P-3.206
lish-3.148
ha-3.098
Token to
Token ID284
Feature activation+4.06e+00

Pos logit contributions
 search+2.502
 backs+2.429
 lap+2.426
 eye+2.408
 bed+2.363
Neg logit contributions
ww-4.908
dd-4.040
P-3.940
lish-3.936
www-3.838
Token Max
Token ID5436
Feature activation+2.78e+00

Pos logit contributions
 bank+8.818
 eye+8.746
 backs+8.307
 trunk+8.048
 banks+8.039
Neg logit contributions
ww-7.709
 you-6.769
 oh-6.500
"-6.360
dd-5.702
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
 eyes+4.603
 eye+4.588
 backs+4.553
 driveway+4.539
ses+4.521
Neg logit contributions
ww-6.140
"-5.093
dd-5.051
a-5.016
lish-4.992
Token said
Token ID531
Feature activation-2.03e+00

Pos logit contributions
 bed+2.914
 search+2.892
 lap+2.871
 eye+2.869
 bank+2.804
Neg logit contributions
ww-5.026
dd-4.047
www-3.942
P-3.875
ha-3.859
Token sorry
Token ID7926
Feature activation+2.26e-01

Pos logit contributions
ww+5.238
dd+4.164
www+4.148
lish+3.976
P+3.847
Neg logit contributions
 bed-3.448
 eyes-3.337
 tears-3.325
 call-3.284
 search-3.271
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
 search+0.257
 bed+0.249
 lap+0.242
 bank+0.240
 call+0.237
Neg logit contributions
ww-0.679
dd-0.572
lish-0.552
www-0.548
ha-0.530
Token
Token ID198
Feature activation+5.04e-01

Pos logit contributions
 search+0.866
 bank+0.848
 bed+0.835
 eye+0.815
 call+0.808
Neg logit contributions
ww-1.910
dd-1.576
www-1.524
lish-1.513
P-1.488
Token
Token ID198
Feature activation-2.34e+00

Pos logit contributions
 search+0.629
 bank+0.610
 bed+0.604
 call+0.591
 lap+0.587
Neg logit contributions
ww-1.452
dd-1.216
www-1.159
lish-1.149
P-1.137
TokenThey
Token ID2990
Feature activation+2.11e+00

Pos logit contributions
ww+4.020
dd+3.152
iest+2.773
lish+2.734
www+2.720
Neg logit contributions
 bed-5.870
 lap-5.865
 search-5.805
 call-5.675
 driveway-5.557
Token all
Token ID477
Feature activation+1.96e+00

Pos logit contributions
 lap+2.167
 search+2.104
 eyes+2.065
 bank+2.060
 bed+2.025
Neg logit contributions
ww-6.338
dd-5.422
P-5.123
www-5.118
ha-5.104
Token go
Token ID467
Feature activation+9.32e-01

Pos logit contributions
 bank+1.998
 lap+1.991
 search+1.989
 doorstep+1.980
 eyes+1.970
Neg logit contributions
ww-6.001
dd-5.045
www-4.817
lish-4.777
ha-4.718
Token to
Token ID284
Feature activation+4.05e+00

Pos logit contributions
 search+1.235
 bed+1.224
 tears+1.201
 lap+1.194
 eye+1.193
Neg logit contributions
ww-2.536
dd-2.103
www-1.999
lish-1.987
P-1.968
Token the
Token ID262
Feature activation+4.68e-01

Pos logit contributions
 eye+8.910
anks+8.892
 bank+8.770
 backs+8.763
stick+8.743
Neg logit contributions
ww-7.422
 oh-5.678
dd-5.481
 you-5.344
"-5.317
Token car
Token ID1097
Feature activation+2.89e+00

Pos logit contributions
 search+0.516
 lap+0.497
 bed+0.494
 call+0.479
 eye+0.476
Neg logit contributions
ww-1.409
dd-1.176
www-1.145
lish-1.141
P-1.124
Token and
Token ID290
Feature activation+2.36e+00

Pos logit contributions
 tears+4.748
 backs+4.624
 eyes+4.559
 search+4.432
 cries+4.411
Neg logit contributions
ww-6.745
"-5.444
a-5.443
iest-5.367
dd-5.361
Token drive
Token ID3708
Feature activation+1.17e-01

Pos logit contributions
 eyes+2.510
 eye+2.461
 lap+2.454
 doorstep+2.452
 search+2.439
Neg logit contributions
ww-6.801
dd-5.608
www-5.461
ha-5.346
P-5.265
Token to
Token ID284
Feature activation+2.77e+00

Pos logit contributions
 search+0.162
 bed+0.160
 call+0.155
 lap+0.154
 bank+0.152
Neg logit contributions
ww-0.320
dd-0.266
www-0.253
lish-0.253
ha-0.245
Token sad
Token ID6507
Feature activation+6.05e-01

Pos logit contributions
ww+1.272
dd+1.037
www+0.987
lish+0.976
P+0.945
Neg logit contributions
 search-0.807
 bed-0.797
 call-0.777
 lap-0.770
 bank-0.762
Token and
Token ID290
Feature activation+7.78e-02

Pos logit contributions
 search+0.709
 bed+0.683
 lap+0.678
 bank+0.670
 call+0.659
Neg logit contributions
ww-1.790
dd-1.501
lish-1.457
www-1.440
iest-1.401
Token lonely
Token ID21757
Feature activation+6.64e-01

Pos logit contributions
 search+0.120
 bed+0.119
 lap+0.116
 call+0.114
 bank+0.114
Neg logit contributions
ww-0.200
dd-0.165
www-0.157
lish-0.156
P-0.149
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
 search+0.751
 bed+0.724
 lap+0.720
 call+0.708
 bank+0.703
Neg logit contributions
ww-1.962
dd-1.654
lish-1.614
www-1.583
iest-1.558
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
ww+2.890
dd+2.459
www+2.363
lish+2.341
iest+2.307
Neg logit contributions
 search-0.916
 bed-0.903
 lap-0.857
 call-0.835
 bank-0.819
Token
Token ID198
Feature activation-4.74e+00

Pos logit contributions
ww+6.231
dd+5.354
lish+5.182
www+5.154
iest+5.043
Neg logit contributions
 search-1.573
 bed-1.452
 lap-1.379
 call-1.356
 find-1.255
Token"
Token ID1
Feature activation-2.22e+00

Pos logit contributions
ww+9.538
iest+7.198
dd+7.013
reetings+6.401
www+6.392
Neg logit contributions
 search-13.247
 find-12.448
 bank-12.220
 call-12.202
 lap-12.156
TokenHello
Token ID15496
Feature activation-2.22e+00

Pos logit contributions
ww+4.983
dd+3.899
www+3.703
lish+3.561
iest+3.552
Neg logit contributions
 search-4.625
 call-4.449
 lap-4.364
 bed-4.329
 bank-4.270
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
ww+6.212
dd+5.469
bly+5.117
lish+5.080
www+4.968
Neg logit contributions
 search-3.197
 bed-3.089
 call-3.037
 eyes-2.991
 eye-2.888
Token elephant
Token ID20950
Feature activation-1.84e+00

Pos logit contributions
ww+3.734
dd+3.258
lish+3.085
www+3.030
bly+2.963
Neg logit contributions
 call-1.438
 search-1.424
 bed-1.416
 bank-1.308
 eye-1.307
Token!"
Token ID2474
Feature activation-3.93e-01

Pos logit contributions
ww+4.707
dd+3.961
lish+3.679
bly+3.647
www+3.632
Neg logit contributions
 search-2.972
 bed-2.961
 call-2.946
 bank-2.861
 lap-2.824
Token what
Token ID644
Feature activation-3.44e-01

Pos logit contributions
ww+0.837
dd+0.671
www+0.635
lish+0.629
P+0.606
Neg logit contributions
 search-0.634
 bed-0.630
 call-0.611
 lap-0.611
 bank-0.603
Token is
Token ID318
Feature activation-3.73e-01

Pos logit contributions
ww+0.883
dd+0.724
www+0.687
lish+0.685
P+0.659
Neg logit contributions
 search-0.542
 bed-0.535
 lap-0.517
 call-0.517
 bank-0.510
Token inside
Token ID2641
Feature activation-4.82e-01

Pos logit contributions
ww+1.004
dd+0.829
www+0.791
lish+0.787
P+0.760
Neg logit contributions
 search-0.544
 bed-0.537
 call-0.518
 lap-0.517
 bank-0.512
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
ww+1.511
dd+1.285
www+1.235
lish+1.226
P+1.197
Neg logit contributions
 search-0.492
 bed-0.488
 call-0.460
 lap-0.458
 bank-0.447
Token
Token ID198
Feature activation-9.58e-01

Pos logit contributions
 search+0.200
 bed+0.198
 call+0.190
 lap+0.188
 bank+0.187
Neg logit contributions
ww-0.555
dd-0.469
www-0.451
lish-0.449
P-0.439
Token
Token ID198
Feature activation-4.70e+00

Pos logit contributions
ww+3.073
dd+2.627
lish+2.536
www+2.518
bly+2.438
Neg logit contributions
 search-0.939
 bed-0.911
 lap-0.869
 call-0.857
 bank-0.793
Token"
Token ID1
Feature activation-1.30e+00

Pos logit contributions
ww+9.455
dd+7.471
iest+7.221
reetings+6.857
w+6.489
Neg logit contributions
 search-12.302
 lap-11.875
 call-11.702
 play-11.437
 find-11.420
TokenLet
Token ID5756
Feature activation-1.39e+00

Pos logit contributions
ww+2.885
dd+2.299
lish+2.137
www+2.129
iest+2.052
Neg logit contributions
 search-2.571
 bed-2.474
 lap-2.456
 call-2.453
 bank-2.409
Token's
Token ID338
Feature activation+1.83e+00

Pos logit contributions
ww+4.059
dd+3.459
lish+3.315
www+3.263
bly+3.245
Neg logit contributions
 search-1.774
 bed-1.742
 call-1.725
 bank-1.640
 find-1.615
Token lift
Token ID10303
Feature activation-6.22e-01

Pos logit contributions
 lap+2.041
 tears+1.960
 cries+1.910
 bank+1.887
 bed+1.885
Neg logit contributions
ww-5.423
dd-4.502
www-4.464
P-4.313
bly-4.227
Token the
Token ID262
Feature activation+1.97e-01

Pos logit contributions
ww+1.702
dd+1.412
www+1.348
lish+1.340
P+1.297
Neg logit contributions
 search-0.890
 bed-0.886
 call-0.843
 lap-0.838
 bank-0.834
Token like
Token ID588
Feature activation+2.46e-01

Pos logit contributions
 bed+1.273
 lap+1.241
 search+1.234
 bank+1.205
 doorstep+1.165
Neg logit contributions
ww-2.739
dd-2.298
www-2.196
lish-2.152
ha-2.129
Token their
Token ID511
Feature activation+1.39e+00

Pos logit contributions
 search+0.345
 bed+0.338
 lap+0.331
 bank+0.326
 call+0.326
Neg logit contributions
ww-0.665
dd-0.549
www-0.524
lish-0.519
ha-0.507
Token uniform
Token ID8187
Feature activation+1.11e+00

Pos logit contributions
 search+1.914
 bank+1.869
 bed+1.833
 lap+1.821
 call+1.795
Neg logit contributions
ww-3.712
dd-2.960
www-2.898
lish-2.885
iest-2.849
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
 search+1.361
 bed+1.320
 lap+1.298
 tears+1.261
 bank+1.258
Neg logit contributions
ww-3.174
dd-2.572
www-2.524
lish-2.509
iest-2.490
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
 search+0.763
 bed+0.736
 bank+0.731
 call+0.719
 eye+0.710
Neg logit contributions
ww-1.936
dd-1.620
lish-1.560
www-1.555
P-1.529
Token
Token ID198
Feature activation-4.70e+00

Pos logit contributions
ww+3.308
dd+2.821
lish+2.739
www+2.719
iest+2.630
Neg logit contributions
 search-0.998
 bed-0.968
 lap-0.915
 call-0.905
 bank-0.845
TokenOne
Token ID3198
Feature activation-1.29e+00

Pos logit contributions
ww+12.342
iest+10.150
dd+9.629
reetings+9.390
www+9.334
Neg logit contributions
 search-9.623
 call-8.866
 find-8.762
 bank-8.652
 play-8.603
Token day
Token ID1110
Feature activation-3.17e+00

Pos logit contributions
ww+3.190
dd+2.683
lish+2.490
www+2.478
bly+2.416
Neg logit contributions
 bed-2.244
 search-2.239
 eye-2.172
 call-2.158
 lap-2.128
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
ww+10.461
www+8.836
dd+8.805
lish+8.580
bly+8.528
Neg logit contributions
 bed-3.798
 search-3.484
 eyes-3.380
 call-3.238
 find-2.993
Token Lily
Token ID20037
Feature activation+5.49e-01

Pos logit contributions
ww+6.156
dd+5.062
www+4.910
lish+4.855
reetings+4.811
Neg logit contributions
 bed-3.722
 search-3.516
 call-3.379
 eyes-3.343
 tears-3.240
Token and
Token ID290
Feature activation+4.26e-02

Pos logit contributions
 search+0.632
 bed+0.629
 lap+0.611
 bank+0.594
 call+0.591
Neg logit contributions
ww-1.612
dd-1.352
www-1.304
lish-1.274
P-1.269
Token make
Token ID787
Feature activation+8.78e-01

Pos logit contributions
 lap+2.118
 cries+2.097
 tears+2.054
 eyes+2.051
 bank+2.035
Neg logit contributions
ww-6.460
www-5.388
dd-5.320
ha-5.223
P-5.209
Token a
Token ID257
Feature activation+5.56e-01

Pos logit contributions
 search+1.525
 lap+1.522
 bed+1.466
 call+1.459
 bank+1.454
Neg logit contributions
ww-2.042
dd-1.650
lish-1.554
www-1.537
bly-1.501
Token fort
Token ID6285
Feature activation+8.35e-01

Pos logit contributions
 search+0.748
 lap+0.726
 bed+0.713
 bank+0.702
 backs+0.699
Neg logit contributions
ww-1.543
dd-1.244
www-1.221
lish-1.192
ha-1.172
Token.
Token ID13
Feature activation-5.22e-02

Pos logit contributions
 search+1.223
 lap+1.214
 bed+1.211
 bank+1.194
 backs+1.183
Neg logit contributions
ww-2.142
dd-1.728
www-1.667
lish-1.647
iest-1.645
Token
Token ID198
Feature activation-1.47e+00

Pos logit contributions
ww+0.159
dd+0.135
www+0.130
lish+0.129
P+0.126
Neg logit contributions
 search-0.057
 bed-0.056
 lap-0.053
 call-0.053
 bank-0.053
Token
Token ID198
Feature activation-4.69e+00

Pos logit contributions
ww+4.898
dd+4.199
lish+4.079
www+4.046
iest+3.911
Neg logit contributions
 search-1.314
 bed-1.237
 lap-1.180
 call-1.168
 find-1.060
Token"
Token ID1
Feature activation-1.81e+00

Pos logit contributions
ww+11.133
iest+8.916
dd+8.831
reetings+8.166
www+8.115
Neg logit contributions
 search-10.890
 call-10.185
 find-10.121
 lap-10.040
 play-9.913
TokenLet
Token ID5756
Feature activation-1.72e+00

Pos logit contributions
ww+4.321
dd+3.468
lish+3.263
www+3.256
iest+3.166
Neg logit contributions
 search-3.358
 lap-3.186
 call-3.169
 bed-3.168
 bank-3.122
Token's
Token ID338
Feature activation+1.36e+00

Pos logit contributions
ww+5.134
dd+4.413
lish+4.238
bly+4.181
www+4.146
Neg logit contributions
 search-2.127
 bed-2.060
 call-2.047
 find-1.946
 bank-1.945
Token use
Token ID779
Feature activation-4.87e-02

Pos logit contributions
 lap+1.321
 tears+1.260
 bed+1.231
 cries+1.229
 bank+1.188
Neg logit contributions
ww-4.182
dd-3.501
www-3.437
P-3.356
bly-3.338
Token this
Token ID428
Feature activation-3.70e-01

Pos logit contributions
ww+0.118
dd+0.095
www+0.090
lish+0.089
ha+0.086
Neg logit contributions
 search-0.082
 bed-0.081
 lap-0.080
 call-0.079
 bank-0.079
Token at
Token ID379
Feature activation+1.36e+00

Pos logit contributions
ww+1.681
dd+1.363
www+1.302
lish+1.286
P+1.250
Neg logit contributions
 search-1.050
 bed-1.037
 call-1.008
 lap-1.003
 bank-0.989
Token the
Token ID262
Feature activation+2.79e-01

Pos logit contributions
 lap+2.505
 bed+2.470
 search+2.466
 call+2.442
 doorstep+2.438
Neg logit contributions
ww-3.036
dd-2.437
www-2.255
bly-2.226
iest-2.218
Token ground
Token ID2323
Feature activation+6.33e-01

Pos logit contributions
 search+0.271
 bed+0.264
 lap+0.257
 call+0.251
 bank+0.244
Neg logit contributions
ww-0.884
dd-0.746
www-0.726
lish-0.719
ha-0.706
Token.
Token ID13
Feature activation+5.97e-01

Pos logit contributions
 search+0.706
 bed+0.699
 lap+0.677
 call+0.672
 bank+0.651
Neg logit contributions
ww-1.855
dd-1.571
lish-1.524
www-1.503
bly-1.462
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
 search+0.653
 bed+0.641
 call+0.625
 eye+0.618
 lap+0.610
Neg logit contributions
ww-1.787
dd-1.510
www-1.451
lish-1.446
P-1.420
Token
Token ID198
Feature activation-4.69e+00

Pos logit contributions
ww+3.531
dd+3.010
lish+2.926
www+2.903
iest+2.819
Neg logit contributions
 search-1.036
 bed-1.002
 lap-0.952
 call-0.941
 bank-0.877
Token"
Token ID1
Feature activation-1.87e+00

Pos logit contributions
ww+10.069
iest+8.486
dd+7.545
reetings+7.035
www+7.008
Neg logit contributions
 search-12.042
 find-11.502
 lap-11.222
 call-11.197
 play-11.159
TokenLook
Token ID8567
Feature activation-1.64e+00

Pos logit contributions
ww+4.260
dd+3.483
lish+3.198
www+3.192
iest+3.159
Neg logit contributions
 search-3.618
 bed-3.464
 lap-3.459
 call-3.433
 bank-3.377
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
ww+4.509
dd+3.737
www+3.582
lish+3.546
iest+3.490
Neg logit contributions
 bed-2.421
 search-2.369
 call-2.271
 eyes-2.259
 eye-2.254
Token Lily
Token ID20037
Feature activation-7.62e-01

Pos logit contributions
ww+5.441
dd+4.758
lish+4.577
www+4.467
iest+4.410
Neg logit contributions
 bed-2.291
 search-2.178
 call-2.059
 eyes-2.010
 find-1.959
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
ww+1.998
dd+1.658
lish+1.571
www+1.562
bly+1.517
Neg logit contributions
 search-1.161
 bed-1.157
 call-1.121
 lap-1.107
 eye-1.100
Token one
Token ID530
Feature activation+7.94e-01

Pos logit contributions
 search+0.585
 lap+0.566
 bed+0.564
 bank+0.554
 call+0.547
Neg logit contributions
ww-1.077
dd-0.878
www-0.848
lish-0.840
iest-0.820
Token is
Token ID318
Feature activation+3.03e-01

Pos logit contributions
 bed+1.256
 bank+1.226
 search+1.223
 lap+1.215
 call+1.174
Neg logit contributions
ww-2.012
dd-1.607
www-1.575
iest-1.509
ha-1.501
Token there
Token ID612
Feature activation+2.78e-01

Pos logit contributions
 search+0.570
 bed+0.567
 lap+0.559
 bank+0.554
 call+0.550
Neg logit contributions
ww-0.665
dd-0.531
www-0.498
lish-0.493
ha-0.474
Token.
Token ID13
Feature activation+1.76e-01

Pos logit contributions
 search+0.293
 bed+0.288
 lap+0.280
 call+0.273
 bank+0.273
Neg logit contributions
ww-0.841
dd-0.714
www-0.685
lish-0.680
ha-0.666
Token
Token ID198
Feature activation-6.33e-01

Pos logit contributions
 search+0.203
 bed+0.198
 call+0.193
 lap+0.190
 bank+0.189
Neg logit contributions
ww-0.521
dd-0.439
www-0.421
lish-0.420
P-0.411
Token
Token ID198
Feature activation-4.67e+00

Pos logit contributions
ww+1.985
dd+1.692
lish+1.624
www+1.624
iest+1.567
Neg logit contributions
 search-0.653
 bed-0.637
 lap-0.609
 call-0.602
 bank-0.583
Token"
Token ID1
Feature activation-1.75e+00

Pos logit contributions
ww+9.589
iest+7.680
dd+7.294
reetings+6.980
www+6.611
Neg logit contributions
 search-11.949
 bank-11.521
 find-11.313
 lap-11.180
 call-11.091
TokenLet
Token ID5756
Feature activation-1.65e+00

Pos logit contributions
ww+3.829
dd+3.051
lish+2.794
www+2.793
iest+2.683
Neg logit contributions
 search-3.631
 bed-3.487
 call-3.443
 lap-3.426
 bank-3.385
Token's
Token ID338
Feature activation+1.59e+00

Pos logit contributions
ww+4.914
dd+4.213
lish+4.056
bly+4.010
www+3.970
Neg logit contributions
 search-2.031
 bed-1.968
 call-1.967
 bank-1.883
 find-1.854
Token enter
Token ID3802
Feature activation-7.91e-01

Pos logit contributions
 lap+1.845
 tears+1.732
 bed+1.709
 cries+1.705
 bank+1.685
Neg logit contributions
ww-4.610
dd-3.786
www-3.771
ha-3.594
P-3.590
Token the
Token ID262
Feature activation+4.23e-02

Pos logit contributions
ww+2.152
dd+1.781
lish+1.698
www+1.695
P+1.640
Neg logit contributions
 bed-1.166
 search-1.160
 lap-1.098
 call-1.097
 bank-1.076
Token colors
Token ID7577
Feature activation+1.63e-01

Pos logit contributions
 bed+1.495
 lap+1.477
 search+1.476
 call+1.470
 bank+1.411
Neg logit contributions
ww-2.755
dd-2.262
iest-2.196
lish-2.183
www-2.155
Token and
Token ID290
Feature activation+6.98e-01

Pos logit contributions
 search+0.174
 bed+0.169
 lap+0.163
 call+0.162
 bank+0.160
Neg logit contributions
ww-0.496
dd-0.420
www-0.403
lish-0.403
iest-0.395
Token shapes
Token ID15268
Feature activation+3.78e-02

Pos logit contributions
 search+0.980
 bed+0.970
 lap+0.961
 bank+0.939
 eye+0.935
Neg logit contributions
ww-1.862
dd-1.513
www-1.465
lish-1.461
iest-1.415
Token.
Token ID13
Feature activation-1.31e-02

Pos logit contributions
 search+0.039
 bed+0.038
 lap+0.037
 call+0.036
 bank+0.036
Neg logit contributions
ww-0.117
dd-0.099
www-0.095
lish-0.095
iest-0.092
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
ww+0.040
dd+0.034
www+0.032
lish+0.032
P+0.032
Neg logit contributions
 search-0.014
 bed-0.014
 lap-0.013
 call-0.013
 bank-0.013
Token
Token ID198
Feature activation-4.66e+00

Pos logit contributions
ww+4.385
dd+3.746
lish+3.647
www+3.615
iest+3.511
Neg logit contributions
 search-1.226
 bed-1.177
 lap-1.111
 call-1.100
 find-1.005
Token"
Token ID1
Feature activation-1.50e+00

Pos logit contributions
ww+9.709
iest+7.799
dd+7.178
reetings+7.073
lish+6.722
Neg logit contributions
 search-12.004
 call-11.304
 find-11.287
 bed-11.264
 play-11.115
TokenLet
Token ID5756
Feature activation-1.46e+00

Pos logit contributions
ww+3.486
dd+2.803
www+2.616
lish+2.605
iest+2.500
Neg logit contributions
 search-2.845
 bed-2.717
 lap-2.713
 call-2.699
 bank-2.668
Token's
Token ID338
Feature activation+1.44e+00

Pos logit contributions
ww+4.301
dd+3.690
lish+3.519
www+3.473
bly+3.466
Neg logit contributions
 search-1.856
 bed-1.810
 call-1.773
 bank-1.706
 find-1.689
Token measure
Token ID3953
Feature activation-3.91e-01

Pos logit contributions
 lap+1.252
 bed+1.189
 tears+1.150
 bank+1.130
 eye+1.114
Neg logit contributions
ww-4.558
dd-3.811
www-3.778
P-3.708
bly-3.676
Token how
Token ID703
Feature activation-2.00e+00

Pos logit contributions
ww+0.989
dd+0.810
www+0.767
lish+0.762
P+0.735
Neg logit contributions
 search-0.628
 bed-0.622
 lap-0.601
 call-0.601
 bank-0.594
Token in
Token ID287
Feature activation-7.15e-01

Pos logit contributions
ww+2.332
dd+1.927
lish+1.847
www+1.843
P+1.780
Neg logit contributions
 search-1.187
 bed-1.166
 call-1.138
 bank-1.117
 lap-1.099
Token the
Token ID262
Feature activation+3.49e-01

Pos logit contributions
ww+1.917
dd+1.582
lish+1.513
www+1.502
P+1.452
Neg logit contributions
 search-1.084
 bed-1.061
 call-1.018
 lap-1.009
 bank-1.003
Token box
Token ID3091
Feature activation+1.36e-01

Pos logit contributions
 search+0.359
 bed+0.351
 lap+0.343
 call+0.337
 bank+0.331
Neg logit contributions
ww-1.087
dd-0.901
www-0.887
lish-0.871
P-0.865
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 search+0.144
 bed+0.140
 lap+0.136
 call+0.133
 bank+0.132
Neg logit contributions
ww-0.417
dd-0.353
www-0.340
lish-0.338
iest-0.330
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
ww+0.346
dd+0.293
www+0.281
lish+0.280
P+0.273
Neg logit contributions
 search-0.121
 bed-0.119
 lap-0.114
 call-0.113
 bank-0.112
Token
Token ID198
Feature activation-4.66e+00

Pos logit contributions
ww+3.584
dd+3.063
lish+2.968
www+2.944
iest+2.855
Neg logit contributions
 search-1.056
 bed-1.012
 lap-0.967
 call-0.962
 bank-0.885
Token"
Token ID1
Feature activation-1.53e+00

Pos logit contributions
ww+9.873
iest+7.947
dd+7.665
reetings+7.279
www+6.862
Neg logit contributions
 search-11.866
 call-11.364
 lap-11.122
 find-11.085
 play-10.924
TokenLet
Token ID5756
Feature activation-1.27e+00

Pos logit contributions
ww+3.400
dd+2.674
www+2.498
lish+2.483
iest+2.445
Neg logit contributions
 search-3.079
 call-2.970
 bed-2.926
 bank-2.920
 lap-2.919
Token's
Token ID338
Feature activation+1.72e+00

Pos logit contributions
ww+3.662
dd+3.111
lish+2.976
www+2.932
bly+2.908
Neg logit contributions
 search-1.662
 bed-1.617
 call-1.614
 bank-1.537
 lap-1.508
Token lift
Token ID10303
Feature activation-6.47e-01

Pos logit contributions
 lap+1.988
 bed+1.903
 tears+1.894
 cries+1.825
 eyes+1.783
Neg logit contributions
ww-5.037
dd-4.144
www-4.133
P-3.972
ha-3.916
Token the
Token ID262
Feature activation+7.55e-02

Pos logit contributions
ww+1.786
dd+1.486
www+1.418
lish+1.410
P+1.365
Neg logit contributions
 search-0.909
 bed-0.904
 call-0.862
 lap-0.856
 bank-0.852
Token are
Token ID389
Feature activation+1.55e-01

Pos logit contributions
 lap+3.672
 bed+3.648
 driveway+3.389
 search+3.385
 bank+3.318
Neg logit contributions
ww-4.228
dd-3.538
www-3.294
w-3.174
ha-3.111
Token too
Token ID1165
Feature activation-9.93e-01

Pos logit contributions
 search+0.263
 bed+0.261
 lap+0.255
 bank+0.253
 call+0.252
Neg logit contributions
ww-0.375
dd-0.304
www-0.287
lish-0.287
bly-0.273
Token high
Token ID1029
Feature activation-4.76e-01

Pos logit contributions
ww+2.479
dd+2.005
lish+1.931
www+1.906
iest+1.849
Neg logit contributions
 search-1.653
 bed-1.619
 lap-1.564
 call-1.552
 eye-1.534
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
ww+1.476
dd+1.250
www+1.206
lish+1.192
P+1.168
Neg logit contributions
 search-0.496
 bed-0.489
 call-0.464
 lap-0.463
 bank-0.459
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
ww+0.708
dd+0.601
www+0.577
lish+0.574
P+0.558
Neg logit contributions
 search-0.243
 bed-0.239
 lap-0.228
 call-0.227
 bank-0.224
Token
Token ID198
Feature activation-4.64e+00

Pos logit contributions
ww+4.414
dd+3.792
lish+3.676
www+3.640
iest+3.552
Neg logit contributions
 search-1.224
 bed-1.171
 lap-1.124
 call-1.093
 bank-0.994
Token"
Token ID1
Feature activation-1.95e+00

Pos logit contributions
ww+9.211
iest+7.460
dd+6.938
reetings+6.530
www+6.178
Neg logit contributions
 search-12.392
 lap-11.961
 call-11.508
 bed-11.444
 find-11.364
TokenLet
Token ID5756
Feature activation-1.49e+00

Pos logit contributions
ww+4.352
dd+3.416
www+3.213
iest+3.159
lish+3.152
Neg logit contributions
 search-3.944
 lap-3.734
 bed-3.718
 call-3.710
 bank-3.695
Token's
Token ID338
Feature activation+1.55e+00

Pos logit contributions
ww+4.337
dd+3.713
lish+3.549
bly+3.494
www+3.488
Neg logit contributions
 search-1.898
 bed-1.844
 call-1.842
 bank-1.757
 find-1.717
Token climb
Token ID12080
Feature activation-1.05e+00

Pos logit contributions
 lap+1.723
 bed+1.696
 tears+1.632
 bank+1.625
 search+1.592
Neg logit contributions
ww-4.608
dd-3.790
www-3.783
P-3.592
ha-3.556
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
ww+2.928
dd+2.451
lish+2.343
www+2.334
P+2.241
Neg logit contributions
 search-1.460
 bed-1.440
 bank-1.379
 call-1.362
 lap-1.347
Token make
Token ID787
Feature activation+1.28e+00

Pos logit contributions
 lap+2.530
 cries+2.520
 tears+2.497
 eyes+2.459
 backs+2.407
Neg logit contributions
ww-7.202
www-6.023
dd-5.818
ha-5.780
P-5.738
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
 lap+2.347
 search+2.328
 driveway+2.256
 call+2.212
 bed+2.211
Neg logit contributions
ww-2.887
dd-2.304
lish-2.160
www-2.145
bly-2.092
Token fort
Token ID6285
Feature activation+7.99e-01

Pos logit contributions
 search+1.366
 lap+1.347
 backs+1.323
 bank+1.286
 bed+1.280
Neg logit contributions
ww-2.898
www-2.289
dd-2.279
ha-2.197
lish-2.196
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 search+1.245
 bed+1.230
 lap+1.229
 bank+1.218
 backs+1.204
Neg logit contributions
ww-1.982
dd-1.581
www-1.527
lish-1.510
iest-1.503
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
ww+1.544
dd+1.314
www+1.263
lish+1.253
iest+1.220
Neg logit contributions
 search-0.505
 bed-0.495
 lap-0.473
 call-0.468
 bank-0.460
Token
Token ID198
Feature activation-4.63e+00

Pos logit contributions
ww+4.500
dd+3.855
lish+3.743
www+3.709
bly+3.589
Neg logit contributions
 search-1.243
 bed-1.179
 lap-1.127
 call-1.116
 find-1.011
Token"
Token ID1
Feature activation-1.72e+00

Pos logit contributions
ww+10.283
iest+8.270
dd+8.037
reetings+7.547
www+7.291
Neg logit contributions
 search-11.192
 call-10.606
 lap-10.453
 find-10.448
 play-10.274
TokenLet
Token ID5756
Feature activation-1.61e+00

Pos logit contributions
ww+3.992
dd+3.217
lish+2.992
www+2.984
iest+2.900
Neg logit contributions
 search-3.243
 lap-3.110
 bed-3.090
 call-3.085
 bank-3.018
Token's
Token ID338
Feature activation+1.33e+00

Pos logit contributions
ww+4.772
dd+4.098
lish+3.922
bly+3.879
www+3.848
Neg logit contributions
 search-2.006
 bed-1.945
 call-1.929
 bank-1.838
 find-1.829
Token lock
Token ID5793
Feature activation-1.08e+00

Pos logit contributions
 lap+1.302
 tears+1.248
 cries+1.217
 bed+1.215
 bank+1.175
Neg logit contributions
ww-4.104
dd-3.430
www-3.369
P-3.292
bly-3.269
Token the
Token ID262
Feature activation+3.17e-02

Pos logit contributions
ww+2.993
dd+2.514
www+2.376
lish+2.371
P+2.320
Neg logit contributions
 bed-1.535
 search-1.500
 call-1.442
 lap-1.432
 bank-1.409
Token wet
Token ID9583
Feature activation-8.84e-01

Pos logit contributions
 search+0.032
 bed+0.031
 lap+0.031
 call+0.031
 bank+0.031
Neg logit contributions
ww-0.052
dd-0.042
www-0.040
lish-0.040
P-0.039
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
ww+2.864
dd+2.452
www+2.364
lish+2.322
P+2.292
Neg logit contributions
 bed-0.832
 search-0.821
 call-0.758
 lap-0.749
 eye-0.742
Token muddy
Token ID39085
Feature activation-5.00e-01

Pos logit contributions
ww+3.448
dd+2.872
www+2.732
lish+2.681
P+2.657
Neg logit contributions
 bed-1.901
 search-1.896
 call-1.759
 lap-1.747
 tears-1.735
Token.
Token ID13
Feature activation-4.19e-01

Pos logit contributions
ww+1.567
dd+1.334
www+1.283
lish+1.268
P+1.241
Neg logit contributions
 search-0.506
 bed-0.503
 lap-0.471
 call-0.469
 bank-0.464
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
ww+1.291
dd+1.095
www+1.052
lish+1.042
ha+1.013
Neg logit contributions
 search-0.454
 bed-0.449
 lap-0.427
 call-0.424
 bank-0.417
Token
Token ID198
Feature activation-4.62e+00

Pos logit contributions
ww+4.340
dd+3.707
www+3.585
lish+3.581
iest+3.476
Neg logit contributions
 search-1.205
 bed-1.159
 lap-1.093
 call-1.082
 bank-0.986
Token"
Token ID1
Feature activation-2.43e+00

Pos logit contributions
ww+9.825
iest+7.350
dd+6.955
www+6.833
w+6.517
Neg logit contributions
 search-12.013
 find-11.369
 call-11.267
 lap-11.204
 bed-11.139
TokenCan
Token ID6090
Feature activation-2.40e+00

Pos logit contributions
ww+5.713
dd+4.424
www+4.274
iest+4.179
lish+4.108
Neg logit contributions
 search-4.841
 bed-4.609
 lap-4.528
 call-4.523
 bank-4.365
Token we
Token ID356
Feature activation+4.38e-01

Pos logit contributions
ww+4.468
dd+3.523
lish+3.105
www+3.104
iest+3.080
Neg logit contributions
 search-6.118
 bed-6.092
 call-5.884
 find-5.823
 lap-5.770
Token go
Token ID467
Feature activation-1.42e-01

Pos logit contributions
 bed+0.406
 search+0.406
 lap+0.402
 bank+0.399
 tears+0.382
Neg logit contributions
ww-1.376
dd-1.162
www-1.130
lish-1.106
ha-1.091
Token to
Token ID284
Feature activation+9.14e-01

Pos logit contributions
ww+0.392
dd+0.326
www+0.312
lish+0.309
ha+0.300
Neg logit contributions
 search-0.193
 bed-0.191
 lap-0.185
 call-0.183
 bank-0.182
Token on
Token ID319
Feature activation+9.81e-01

Pos logit contributions
ww+4.145
dd+3.208
www+3.176
lish+2.969
ha+2.964
Neg logit contributions
 bed-2.894
 search-2.859
 bank-2.791
 lap-2.783
 tears-2.766
Token the
Token ID262
Feature activation+5.59e-01

Pos logit contributions
 lap+1.608
 search+1.607
 eye+1.569
 call+1.567
 bed+1.553
Neg logit contributions
ww-2.370
dd-1.940
lish-1.852
P-1.803
www-1.802
Token fence
Token ID13990
Feature activation+9.59e-01

Pos logit contributions
 search+0.542
 bed+0.497
 call+0.489
 lap+0.483
 bank+0.469
Neg logit contributions
ww-1.768
dd-1.494
www-1.446
lish-1.445
P-1.412
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 search+1.108
 bed+1.084
 call+1.032
 lap+1.017
 bank+1.016
Neg logit contributions
ww-2.768
dd-2.321
lish-2.261
www-2.241
ha-2.184
Token
Token ID198
Feature activation-2.00e+00

Pos logit contributions
 search+0.148
 bed+0.144
 call+0.140
 lap+0.139
 bank+0.138
Neg logit contributions
ww-0.363
dd-0.306
www-0.293
lish-0.291
P-0.285
Token
Token ID198
Feature activation-4.62e+00

Pos logit contributions
ww+6.907
dd+5.981
lish+5.829
www+5.735
iest+5.649
Neg logit contributions
 search-1.640
 bed-1.514
 lap-1.443
 call-1.432
 find-1.320
Token"
Token ID1
Feature activation-1.92e+00

Pos logit contributions
ww+8.889
iest+6.961
dd+6.550
reetings+5.955
www+5.938
Neg logit contributions
 search-12.530
 find-12.215
 play-11.814
 bank-11.754
 lap-11.725
TokenHello
Token ID15496
Feature activation-2.43e+00

Pos logit contributions
ww+4.330
dd+3.454
www+3.273
lish+3.210
iest+3.136
Neg logit contributions
 search-3.723
 bed-3.573
 call-3.553
 lap-3.550
 bank-3.484
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
ww+6.981
dd+6.222
bly+5.843
lish+5.743
www+5.681
Neg logit contributions
 search-3.367
 bed-3.284
 eyes-3.200
 call-3.156
 eye-3.150
Token little
Token ID1310
Feature activation-1.55e+00

Pos logit contributions
ww+4.087
dd+3.516
lish+3.286
www+3.232
P+3.162
Neg logit contributions
 call-2.370
 bed-2.362
 search-2.323
 eye-2.281
 eyes-2.219
Token ones
Token ID3392
Feature activation-1.59e+00

Pos logit contributions
ww+4.273
dd+3.679
www+3.431
bly+3.359
lish+3.346
Neg logit contributions
 search-2.304
 bed-2.235
 eye-2.168
 call-2.130
 eyes-2.116
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
ww+1.104
dd+0.922
www+0.883
lish+0.877
P+0.849
Neg logit contributions
 search-0.507
 bed-0.499
 lap-0.479
 call-0.478
 bank-0.471
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
 bed+2.306
 lap+2.289
 eye+2.242
 search+2.219
 eyes+2.212
Neg logit contributions
ww-2.776
dd-2.190
www-2.126
ha-2.095
iest-2.016
Token top
Token ID1353
Feature activation-3.21e-01

Pos logit contributions
ww+0.590
dd+0.500
www+0.481
lish+0.477
P+0.464
Neg logit contributions
 search-0.208
 bed-0.204
 lap-0.196
 call-0.194
 bank-0.191
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
ww+0.903
dd+0.755
www+0.720
lish+0.716
P+0.694
Neg logit contributions
 search-0.424
 bed-0.419
 lap-0.403
 call-0.402
 bank-0.397
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 search+0.282
 bed+0.280
 call+0.268
 lap+0.265
 bank+0.264
Neg logit contributions
ww-0.766
dd-0.644
www-0.621
lish-0.618
P-0.605
Token
Token ID198
Feature activation-4.62e+00

Pos logit contributions
ww+4.353
dd+3.730
lish+3.638
www+3.577
iest+3.481
Neg logit contributions
 search-1.244
 bed-1.177
 lap-1.133
 call-1.115
 bank-1.020
Token"
Token ID1
Feature activation-1.50e+00

Pos logit contributions
ww+10.106
dd+8.003
iest+7.904
reetings+7.618
w+7.085
Neg logit contributions
 search-11.374
 find-10.644
 call-10.606
 bank-10.455
 lap-10.436
TokenReady
Token ID35474
Feature activation-1.08e+00

Pos logit contributions
ww+3.507
dd+2.837
lish+2.641
www+2.625
bly+2.553
Neg logit contributions
 search-2.824
 lap-2.698
 bed-2.678
 call-2.676
 bank-2.663
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
ww+2.911
dd+2.425
www+2.297
lish+2.296
P+2.222
Neg logit contributions
 search-1.636
 bed-1.609
 lap-1.556
 call-1.556
 eye-1.499
Token set
Token ID900
Feature activation-1.91e+00

Pos logit contributions
ww+4.648
dd+4.041
lish+3.946
www+3.833
iest+3.812
Neg logit contributions
 search-1.296
 bed-1.258
 call-1.190
 eye-1.126
 bank-1.124
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
ww+5.330
dd+4.632
lish+4.442
www+4.243
bly+4.206
Neg logit contributions
 search-2.741
 bed-2.730
 call-2.632
 find-2.534
 lap-2.532
Token to
Token ID284
Feature activation+1.98e+00

Pos logit contributions
 lap+1.273
 bed+1.259
 search+1.233
 call+1.221
 eye+1.217
Neg logit contributions
ww-2.708
dd-2.259
www-2.150
ha-2.124
lish-2.112
Token climb
Token ID12080
Feature activation-9.51e-02

Pos logit contributions
 bed+2.327
 lap+2.282
 tears+2.242
 eyes+2.239
 eye+2.189
Neg logit contributions
ww-5.667
www-4.686
dd-4.584
ha-4.553
iest-4.442
Token it
Token ID340
Feature activation+1.36e+00

Pos logit contributions
ww+0.240
dd+0.196
www+0.186
lish+0.185
ha+0.179
Neg logit contributions
 search-0.152
 bed-0.150
 lap-0.147
 call-0.146
 bank-0.143
Token.
Token ID13
Feature activation+6.58e-02

Pos logit contributions
 lap+1.787
 bed+1.786
 call+1.777
 search+1.767
 eye+1.753
Neg logit contributions
ww-3.718
dd-2.997
www-2.963
ha-2.917
iest-2.906
Token
Token ID198
Feature activation-4.00e-01

Pos logit contributions
 search+0.074
 bed+0.073
 call+0.070
 lap+0.069
 bank+0.069
Neg logit contributions
ww-0.198
dd-0.167
www-0.161
lish-0.160
P-0.156
Token
Token ID198
Feature activation-4.61e+00

Pos logit contributions
ww+1.233
dd+1.046
www+1.005
lish+1.001
P+0.968
Neg logit contributions
 search-0.426
 bed-0.418
 lap-0.399
 call-0.397
 bank-0.387
Token"
Token ID1
Feature activation-1.57e+00

Pos logit contributions
ww+10.334
iest+8.045
dd+7.937
reetings+7.328
www+7.245
Neg logit contributions
 search-11.397
 find-10.857
 call-10.633
 lap-10.501
 bank-10.495
TokenLet
Token ID5756
Feature activation-1.38e+00

Pos logit contributions
ww+3.533
dd+2.831
lish+2.633
www+2.613
iest+2.553
Neg logit contributions
 search-3.091
 lap-2.951
 bed-2.934
 call-2.916
 bank-2.897
Token's
Token ID338
Feature activation+1.21e+00

Pos logit contributions
ww+4.013
dd+3.420
lish+3.272
www+3.225
bly+3.220
Neg logit contributions
 search-1.770
 bed-1.725
 call-1.697
 bank-1.648
 lap-1.602
Token go
Token ID467
Feature activation-4.02e-01

Pos logit contributions
 lap+1.352
 bed+1.344
 tears+1.303
 bank+1.274
 search+1.268
Neg logit contributions
ww-3.549
dd-2.926
www-2.904
P-2.775
ha-2.760
Token up
Token ID510
Feature activation-1.05e+00

Pos logit contributions
ww+1.058
dd+0.873
www+0.828
lish+0.827
P+0.795
Neg logit contributions
 search-0.609
 bed-0.600
 lap-0.581
 call-0.580
 bank-0.573
Token is
Token ID318
Feature activation+1.46e-01

Pos logit contributions
ww+2.060
dd+1.720
www+1.605
lish+1.596
P+1.558
Neg logit contributions
 search-1.253
 bed-1.234
 lap-1.195
 bank-1.195
 call-1.192
Token a
Token ID257
Feature activation-7.95e-01

Pos logit contributions
 search+0.237
 bed+0.232
 lap+0.229
 call+0.227
 bank+0.226
Neg logit contributions
ww-0.366
dd-0.297
www-0.282
lish-0.281
P-0.269
Token flood
Token ID6947
Feature activation-1.60e-01

Pos logit contributions
ww+2.142
dd+1.818
lish+1.708
www+1.698
bly+1.649
Neg logit contributions
 bed-1.151
 search-1.150
 call-1.099
 lap-1.082
 bank-1.081
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
ww+0.488
dd+0.413
www+0.397
lish+0.395
P+0.383
Neg logit contributions
 search-0.172
 bed-0.169
 lap-0.162
 call-0.161
 bank-0.158
Token
Token ID198
Feature activation+4.54e-03

Pos logit contributions
 search+0.544
 bed+0.522
 bank+0.516
 call+0.515
 lap+0.506
Neg logit contributions
ww-1.274
dd-1.065
lish-1.023
www-1.019
P-0.998
Token
Token ID198
Feature activation-4.61e+00

Pos logit contributions
 search+0.005
 bed+0.005
 lap+0.005
 call+0.005
 bank+0.005
Neg logit contributions
ww-0.014
dd-0.011
www-0.011
lish-0.011
ha-0.011
Token"
Token ID1
Feature activation-1.93e+00

Pos logit contributions
ww+10.451
iest+8.671
dd+8.090
reetings+7.812
www+7.632
Neg logit contributions
 search-10.886
 find-10.274
 bank-10.268
 bed-9.925
 play-9.830
TokenOh
Token ID5812
Feature activation-2.26e+00

Pos logit contributions
ww+4.411
dd+3.475
www+3.313
lish+3.234
iest+3.166
Neg logit contributions
 search-3.785
 bed-3.664
 call-3.589
 lap-3.579
 bank-3.560
Token no
Token ID645
Feature activation-8.79e-01

Pos logit contributions
ww+5.433
dd+4.782
lish+4.394
www+4.274
bly+4.249
Neg logit contributions
 bed-4.232
 eyes-4.100
 search-4.055
 call-3.938
 lap-3.854
Token!"
Token ID2474
Feature activation-5.07e-01

Pos logit contributions
ww+1.944
dd+1.567
lish+1.454
www+1.445
P+1.417
Neg logit contributions
 bed-1.715
 search-1.707
 call-1.656
 lap-1.633
 bank-1.628
Token Lily
Token ID20037
Feature activation+1.44e+00

Pos logit contributions
ww+1.368
dd+1.129
www+1.079
lish+1.069
iest+1.033
Neg logit contributions
 search-0.742
 bed-0.739
 call-0.708
 lap-0.704
 bank-0.698
Token had
Token ID550
Feature activation+6.34e-02

Pos logit contributions
ww+3.306
dd+2.750
lish+2.596
www+2.577
P+2.512
Neg logit contributions
 search-2.050
 bed-2.022
 call-1.921
 lap-1.903
 tears-1.868
Token a
Token ID257
Feature activation+9.00e-02

Pos logit contributions
 search+0.097
 bed+0.095
 lap+0.093
 call+0.092
 bank+0.091
Neg logit contributions
ww-0.165
dd-0.135
www-0.129
lish-0.127
ha-0.124
Token flag
Token ID6056
Feature activation+3.25e-03

Pos logit contributions
 search+0.120
 bed+0.117
 lap+0.114
 call+0.113
 bank+0.112
Neg logit contributions
ww-0.252
dd-0.209
www-0.201
lish-0.197
ha-0.193
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
 search+0.005
 bed+0.005
 lap+0.005
 call+0.005
 bank+0.005
Neg logit contributions
ww-0.008
dd-0.007
www-0.006
lish-0.006
ha-0.006
Token
Token ID198
Feature activation-1.25e+00

Pos logit contributions
ww+1.382
dd+1.175
www+1.129
lish+1.121
iest+1.090
Neg logit contributions
 search-0.458
 bed-0.452
 lap-0.430
 call-0.425
 bank-0.420
Token
Token ID198
Feature activation-4.61e+00

Pos logit contributions
ww+4.144
dd+3.548
lish+3.428
www+3.416
iest+3.311
Neg logit contributions
 search-1.157
 bed-1.114
 lap-1.062
 call-1.038
 bank-0.966
Token"
Token ID1
Feature activation-1.78e+00

Pos logit contributions
ww+9.361
iest+7.302
dd+6.801
reetings+6.599
w+6.440
Neg logit contributions
 search-12.244
 find-11.875
 bank-11.731
 lap-11.670
 play-11.527
TokenWow
Token ID22017
Feature activation-2.42e+00

Pos logit contributions
ww+3.903
dd+3.135
lish+2.924
www+2.902
iest+2.780
Neg logit contributions
 search-3.575
 call-3.403
 bed-3.389
 lap-3.384
 bank-3.368
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
ww+6.963
dd+5.886
lish+5.652
www+5.608
P+5.545
Neg logit contributions
 bed-3.524
 search-3.365
 call-3.132
 find-3.129
 eyes-3.095
Token look
Token ID804
Feature activation-2.17e+00

Pos logit contributions
ww+5.019
dd+4.405
lish+4.300
www+4.189
iest+4.165
Neg logit contributions
 bed-1.256
 search-1.253
 call-1.143
 eyes-1.050
 tears-1.046
Token at
Token ID379
Feature activation-3.67e-01

Pos logit contributions
ww+5.447
dd+4.425
www+4.219
iest+4.181
bly+4.161
Neg logit contributions
 bed-3.655
 search-3.646
 eye-3.553
 eyes-3.550
 call-3.540
Token a
Token ID257
Feature activation+2.40e-01

Pos logit contributions
 search+0.291
 bed+0.286
 lap+0.283
 call+0.280
 bank+0.276
Neg logit contributions
ww-0.455
dd-0.370
www-0.351
lish-0.350
ha-0.338
Token big
Token ID1263
Feature activation+9.49e-01

Pos logit contributions
 search+0.331
 bed+0.324
 lap+0.320
 call+0.315
 bank+0.312
Neg logit contributions
ww-0.659
dd-0.538
www-0.520
lish-0.513
ha-0.502
Token box
Token ID3091
Feature activation+7.60e-02

Pos logit contributions
 search+1.125
 lap+1.114
 call+1.091
 backs+1.089
 bank+1.082
Neg logit contributions
ww-2.742
dd-2.215
www-2.182
lish-2.123
ha-2.099
Token.
Token ID13
Feature activation+5.47e-01

Pos logit contributions
 search+0.129
 bed+0.127
 lap+0.124
 call+0.123
 bank+0.123
Neg logit contributions
ww-0.184
dd-0.148
www-0.140
lish-0.140
iest-0.135
Token
Token ID198
Feature activation-9.78e-01

Pos logit contributions
 search+0.627
 bed+0.617
 bank+0.602
 call+0.598
 lap+0.591
Neg logit contributions
ww-1.604
dd-1.349
lish-1.307
www-1.294
P-1.274
Token
Token ID198
Feature activation-4.59e+00

Pos logit contributions
ww+3.150
dd+2.696
lish+2.597
www+2.590
iest+2.499
Neg logit contributions
 search-0.960
 bed-0.929
 lap-0.883
 call-0.867
 bank-0.806
Token"
Token ID1
Feature activation-1.83e+00

Pos logit contributions
ww+11.422
dd+9.182
iest+9.097
www+8.542
w+8.441
Neg logit contributions
 search-10.480
 lap-9.563
 call-9.453
 play-9.434
 bed-9.353
TokenWhat
Token ID2061
Feature activation-1.47e+00

Pos logit contributions
ww+4.447
dd+3.546
www+3.394
lish+3.297
iest+3.248
Neg logit contributions
 search-3.398
 bed-3.211
 lap-3.194
 call-3.177
 bank-3.133
Token can
Token ID460
Feature activation-7.18e-01

Pos logit contributions
ww+4.072
dd+3.483
lish+3.280
www+3.262
bly+3.162
Neg logit contributions
 search-2.120
 bed-2.078
 call-2.018
 lap-1.948
 eye-1.923
Token we
Token ID356
Feature activation+1.46e+00

Pos logit contributions
ww+1.029
dd+0.695
www+0.617
lish+0.614
P+0.562
Neg logit contributions
 search-1.980
 bed-1.957
 call-1.919
 lap-1.908
 bank-1.890
Token do
Token ID466
Feature activation-1.71e+00

Pos logit contributions
 lap+2.076
 tears+2.005
 bank+1.989
 bed+1.987
 backs+1.962
Neg logit contributions
ww-3.853
dd-3.102
www-3.064
P-2.998
lish-2.928
Token from
Token ID422
Feature activation+7.72e-01

Pos logit contributions
 search+0.351
 bed+0.342
 lap+0.338
 call+0.331
 bank+0.326
Neg logit contributions
ww-0.730
dd-0.606
lish-0.581
www-0.579
bly-0.559
Token the
Token ID262
Feature activation+2.01e-01

Pos logit contributions
 lap+1.152
 search+1.139
 call+1.116
 bed+1.102
 bank+1.089
Neg logit contributions
ww-1.980
dd-1.645
lish-1.548
www-1.537
bly-1.503
Token sky
Token ID6766
Feature activation+1.06e+00

Pos logit contributions
 search+0.202
 bed+0.191
 lap+0.186
 call+0.185
 bank+0.181
Neg logit contributions
ww-0.628
dd-0.528
www-0.513
lish-0.512
P-0.497
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
 search+1.256
 bed+1.208
 lap+1.194
 bank+1.170
 call+1.159
Neg logit contributions
ww-3.071
dd-2.542
lish-2.507
www-2.454
iest-2.432
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
ww+0.501
dd+0.425
www+0.408
lish+0.406
P+0.395
Neg logit contributions
 search-0.175
 bed-0.172
 lap-0.164
 call-0.164
 bank-0.161
Token
Token ID198
Feature activation-4.58e+00

Pos logit contributions
ww+4.673
dd+3.998
lish+3.874
www+3.861
iest+3.773
Neg logit contributions
 search-1.268
 bed-1.220
 lap-1.158
 call-1.140
 find-1.030
Token"
Token ID1
Feature activation-2.43e+00

Pos logit contributions
ww+9.954
iest+8.084
dd+7.237
www+7.071
w+6.845
Neg logit contributions
 search-11.300
 find-10.692
 call-10.600
 lap-10.577
 bed-10.539
TokenMom
Token ID29252
Feature activation-1.65e-01

Pos logit contributions
ww+5.724
dd+4.443
www+4.275
iest+4.161
lish+4.065
Neg logit contributions
 search-4.839
 bed-4.565
 lap-4.532
 call-4.525
 bank-4.411
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
ww+0.464
dd+0.387
www+0.371
lish+0.368
ha+0.356
Neg logit contributions
 search-0.217
 bed-0.214
 lap-0.207
 call-0.205
 bank-0.203
Token can
Token ID460
Feature activation-1.60e+00

Pos logit contributions
ww+4.651
dd+4.007
lish+3.800
www+3.729
P+3.662
Neg logit contributions
 bed-2.417
 search-2.388
 call-2.286
 find-2.209
 eye-2.201
Token we
Token ID356
Feature activation+5.04e-01

Pos logit contributions
ww+2.630
dd+1.922
lish+1.709
www+1.698
P+1.613
Neg logit contributions
 search-4.279
 bed-4.249
 call-4.163
 lap-4.075
 find-4.016
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
ww+2.404
dd+2.002
lish+1.906
www+1.897
bly+1.843
Neg logit contributions
 search-1.298
 bed-1.265
 call-1.219
 lap-1.204
 bank-1.197
Token behind
Token ID2157
Feature activation-2.43e-01

Pos logit contributions
ww+3.433
dd+2.840
lish+2.760
www+2.716
P+2.653
Neg logit contributions
 search-1.650
 bed-1.614
 call-1.581
 bank-1.555
 find-1.496
Token it
Token ID340
Feature activation+9.99e-01

Pos logit contributions
ww+0.558
dd+0.445
www+0.420
lish+0.416
P+0.399
Neg logit contributions
 search-0.449
 bed-0.445
 lap-0.433
 call-0.432
 bank-0.428
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
 lap+1.241
 bed+1.217
 search+1.212
 bank+1.187
 backs+1.170
Neg logit contributions
ww-2.847
dd-2.353
www-2.297
lish-2.268
ha-2.260
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 search+0.191
 bed+0.189
 call+0.181
 lap+0.180
 bank+0.177
Neg logit contributions
ww-0.539
dd-0.457
www-0.439
lish-0.436
P-0.427
Token
Token ID198
Feature activation-4.57e+00

Pos logit contributions
ww+3.784
dd+3.231
lish+3.130
www+3.113
iest+3.006
Neg logit contributions
 search-1.095
 bed-1.051
 lap-1.008
 call-0.998
 bank-0.922
Token"
Token ID1
Feature activation-1.77e+00

Pos logit contributions
ww+10.529
iest+8.434
dd+8.090
reetings+7.777
www+7.581
Neg logit contributions
 search-10.588
 find-10.065
 bank-9.999
 call-9.985
 lap-9.949
TokenLet
Token ID5756
Feature activation-1.41e+00

Pos logit contributions
ww+3.973
dd+3.161
lish+2.944
www+2.925
iest+2.875
Neg logit contributions
 search-3.533
 lap-3.355
 call-3.354
 bed-3.343
 bank-3.284
Token's
Token ID338
Feature activation+1.47e+00

Pos logit contributions
ww+4.109
dd+3.495
lish+3.348
www+3.299
bly+3.291
Neg logit contributions
 search-1.804
 bed-1.755
 call-1.747
 bank-1.667
 find-1.631
Token go
Token ID467
Feature activation+1.72e-01

Pos logit contributions
 lap+1.689
 tears+1.635
 bed+1.619
 bank+1.574
 cries+1.574
Neg logit contributions
ww-4.308
dd-3.545
www-3.527
ha-3.359
P-3.352
Token to
Token ID284
Feature activation+1.23e+00

Pos logit contributions
 search+0.243
 bed+0.242
 lap+0.236
 bank+0.232
 call+0.232
Neg logit contributions
ww-0.464
dd-0.381
www-0.367
lish-0.359
ha-0.354
Token what
Token ID644
Feature activation-6.86e-01

Pos logit contributions
 lap+1.672
 search+1.670
 bed+1.660
 bank+1.640
 call+1.610
Neg logit contributions
ww-1.713
dd-1.335
www-1.249
lish-1.235
ha-1.225
Token is
Token ID318
Feature activation-6.16e-01

Pos logit contributions
ww+1.773
dd+1.468
lish+1.390
www+1.383
bly+1.334
Neg logit contributions
 search-1.081
 bed-1.063
 call-1.023
 lap-1.019
 bank-1.006
Token inside
Token ID2641
Feature activation-4.94e-01

Pos logit contributions
ww+1.677
dd+1.382
www+1.322
lish+1.319
P+1.272
Neg logit contributions
 search-0.897
 bed-0.882
 call-0.853
 bank-0.841
 lap-0.839
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
ww+1.562
dd+1.330
www+1.278
lish+1.269
P+1.240
Neg logit contributions
 search-0.494
 bed-0.489
 call-0.459
 lap-0.457
 bank-0.448
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
 search+0.180
 bed+0.179
 call+0.172
 lap+0.170
 bank+0.168
Neg logit contributions
ww-0.502
dd-0.425
www-0.409
lish-0.406
P-0.397
Token
Token ID198
Feature activation-4.57e+00

Pos logit contributions
ww+4.492
dd+3.847
lish+3.740
www+3.696
bly+3.583
Neg logit contributions
 search-1.265
 bed-1.195
 lap-1.152
 call-1.130
 bank-1.028
Token"
Token ID1
Feature activation-1.24e+00

Pos logit contributions
ww+10.002
dd+7.652
iest+7.584
reetings+7.568
w+7.014
Neg logit contributions
 search-11.125
 lap-10.482
 call-10.393
 find-10.378
 bank-10.322
TokenLet
Token ID5756
Feature activation-1.35e+00

Pos logit contributions
ww+2.692
dd+2.120
lish+1.971
www+1.964
iest+1.875
Neg logit contributions
 search-2.513
 bed-2.419
 lap-2.395
 call-2.393
 bank-2.360
Token's
Token ID338
Feature activation+1.80e+00

Pos logit contributions
ww+3.952
dd+3.359
lish+3.226
www+3.170
bly+3.157
Neg logit contributions
 search-1.745
 bed-1.697
 call-1.683
 bank-1.602
 find-1.584
Token grab
Token ID5552
Feature activation-5.85e-01

Pos logit contributions
 lap+1.959
 tears+1.853
 cries+1.828
 bed+1.800
 bank+1.774
Neg logit contributions
ww-5.389
dd-4.479
www-4.449
P-4.315
bly-4.241
Token the
Token ID262
Feature activation+5.83e-02

Pos logit contributions
ww+1.583
dd+1.311
www+1.250
lish+1.246
P+1.198
Neg logit contributions
 search-0.853
 bed-0.844
 call-0.805
 lap-0.804
 bank-0.801
Token did
Token ID750
Feature activation+3.92e-02

Pos logit contributions
 lap+3.789
 bed+3.780
 search+3.562
 driveway+3.435
 eye+3.381
Neg logit contributions
ww-3.980
dd-3.334
www-3.039
ha-3.033
lish-2.989
Token not
Token ID407
Feature activation+5.78e-01

Pos logit contributions
 search+0.066
 bed+0.065
 lap+0.064
 call+0.063
 bank+0.063
Neg logit contributions
ww-0.096
dd-0.078
www-0.073
lish-0.073
ha-0.070
Token want
Token ID765
Feature activation+1.44e+00

Pos logit contributions
 bed+0.748
 lap+0.732
 search+0.726
 bank+0.712
 eyes+0.685
Neg logit contributions
ww-1.646
dd-1.372
www-1.320
lish-1.302
ha-1.275
Token to
Token ID284
Feature activation+9.94e-01

Pos logit contributions
 lap+1.971
 search+1.936
 bank+1.885
 bed+1.884
 eye+1.878
Neg logit contributions
ww-3.940
dd-3.224
www-3.108
lish-3.066
ha-3.030
Token go
Token ID467
Feature activation+4.52e-01

Pos logit contributions
 lap+1.033
 bed+1.011
 search+1.001
 eye+0.965
 bank+0.950
Neg logit contributions
ww-3.058
dd-2.558
www-2.484
lish-2.460
P-2.399
Token to
Token ID284
Feature activation+2.63e+00

Pos logit contributions
 search+0.623
 bed+0.622
 lap+0.614
 call+0.591
 eye+0.589
Neg logit contributions
ww-1.220
dd-1.004
lish-0.963
www-0.961
P-0.936
Token his
Token ID465
Feature activation+2.14e+00

Pos logit contributions
 lap+4.800
 eye+4.650
 driveway+4.567
 backs+4.489
 doorstep+4.473
Neg logit contributions
ww-5.981
dd-4.623
www-4.414
ha-4.280
iest-4.226
Token car
Token ID1097
Feature activation+1.34e+00

Pos logit contributions
 lap+2.729
 search+2.699
 call+2.597
 backs+2.588
 bed+2.545
Neg logit contributions
ww-5.808
dd-4.537
www-4.526
lish-4.507
a-4.430
Token.
Token ID13
Feature activation+8.02e-01

Pos logit contributions
 bed+1.755
 search+1.751
 lap+1.742
 backs+1.721
 tears+1.718
Neg logit contributions
ww-3.607
dd-2.949
lish-2.896
iest-2.842
www-2.833
Token They
Token ID1119
Feature activation+1.67e+00

Pos logit contributions
 search+0.823
 bed+0.805
 lap+0.772
 eye+0.768
 bank+0.767
Neg logit contributions
ww-2.496
dd-2.089
www-2.035
lish-2.016
P-1.984
Token refused
Token ID6520
Feature activation+7.09e-01

Pos logit contributions
 bed+3.290
 lap+3.257
 search+3.102
 driveway+2.990
 eye+2.959
Neg logit contributions
ww-3.654
dd-3.039
www-2.806
ha-2.797
lish-2.743
Token not
Token ID407
Feature activation+3.69e-01

Pos logit contributions
ww+1.529
dd+1.250
www+1.192
lish+1.185
ha+1.140
Neg logit contributions
 search-0.932
 bed-0.916
 call-0.894
 lap-0.887
 bank-0.875
Token go
Token ID467
Feature activation-4.51e-02

Pos logit contributions
 bed+0.517
 search+0.503
 lap+0.497
 bank+0.484
 eye+0.474
Neg logit contributions
ww-1.010
dd-0.838
www-0.803
lish-0.793
ha-0.773
Token away
Token ID1497
Feature activation-1.71e-01

Pos logit contributions
ww+0.121
dd+0.100
www+0.095
lish+0.095
P+0.092
Neg logit contributions
 search-0.065
 bed-0.065
 lap-0.063
 call-0.062
 bank-0.061
Token.
Token ID13
Feature activation+1.69e+00

Pos logit contributions
ww+0.534
dd+0.454
www+0.436
lish+0.434
P+0.422
Neg logit contributions
 search-0.175
 bed-0.172
 lap-0.165
 call-0.163
 bank-0.161
Token They
Token ID1119
Feature activation+1.97e+00

Pos logit contributions
 search+2.015
 lap+1.936
 bed+1.928
 bank+1.905
 doorstep+1.886
Neg logit contributions
ww-5.013
dd-4.151
lish-4.059
www-3.992
P-3.964
Token want
Token ID765
Feature activation+2.13e+00

Pos logit contributions
 bed+3.035
 lap+2.974
 search+2.791
 driveway+2.775
 eyes+2.774
Neg logit contributions
ww-5.018
dd-4.151
www-3.993
ha-3.885
lish-3.858
Token to
Token ID284
Feature activation+1.67e+00

Pos logit contributions
 lap+3.289
 driveway+3.166
 bank+3.151
 trunk+3.141
 eye+3.134
Neg logit contributions
ww-5.271
dd-4.387
lish-4.182
P-4.021
www-4.010
Token join
Token ID4654
Feature activation+2.50e-01

Pos logit contributions
 bed+1.931
 lap+1.929
 eyes+1.883
 search+1.882
 eye+1.879
Neg logit contributions
ww-4.845
www-3.928
dd-3.891
ha-3.730
iest-3.698
Token the
Token ID262
Feature activation-6.13e-01

Pos logit contributions
 search+0.353
 bed+0.348
 lap+0.341
 bank+0.339
 call+0.336
Neg logit contributions
ww-0.671
dd-0.559
lish-0.533
www-0.529
P-0.517
Token line
Token ID1627
Feature activation-5.87e-01

Pos logit contributions
ww+1.678
dd+1.409
lish+1.330
www+1.326
bly+1.283
Neg logit contributions
 search-0.881
 bed-0.874
 call-0.843
 lap-0.840
 bank-0.824
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
ww+1.854
dd+1.585
www+1.521
lish+1.510
P+1.467
Neg logit contributions
 search-0.583
 bed-0.574
 lap-0.546
 call-0.544
 bank-0.534
Token told
Token ID1297
Feature activation+5.25e-01

Pos logit contributions
 bed+0.499
 search+0.498
 lap+0.489
 bank+0.479
 eye+0.465
Neg logit contributions
ww-0.942
dd-0.781
www-0.745
lish-0.739
ha-0.722
Token him
Token ID683
Feature activation-1.18e+00

Pos logit contributions
 lap+0.606
 search+0.603
 bed+0.587
 bank+0.584
 call+0.568
Neg logit contributions
ww-1.559
dd-1.326
www-1.260
lish-1.245
ha-1.219
Token that
Token ID326
Feature activation-6.47e-01

Pos logit contributions
ww+3.155
dd+2.593
lish+2.480
www+2.479
P+2.464
Neg logit contributions
 bed-1.814
 search-1.789
 call-1.719
 lap-1.680
 tears-1.675
Token they
Token ID484
Feature activation+6.87e-01

Pos logit contributions
ww+1.517
dd+1.211
www+1.151
lish+1.145
P+1.099
Neg logit contributions
 bed-1.180
 search-1.179
 call-1.133
 lap-1.125
 bank-1.119
Token needed
Token ID2622
Feature activation+1.06e+00

Pos logit contributions
 lap+1.170
 bed+1.170
 search+1.146
 bank+1.111
 call+1.097
Neg logit contributions
ww-1.643
dd-1.349
www-1.257
lish-1.253
ha-1.231
Token to
Token ID284
Feature activation+2.19e+00

Pos logit contributions
 lap+1.357
 search+1.327
 bank+1.318
 eye+1.305
 bed+1.282
Neg logit contributions
ww-2.992
dd-2.499
www-2.379
ha-2.329
lish-2.308
Token replace
Token ID6330
Feature activation+6.81e-01

Pos logit contributions
 lap+2.426
 backs+2.131
 eyes+2.115
 tears+2.092
 bank+2.080
Neg logit contributions
ww-6.537
dd-5.433
www-5.333
bly-5.293
P-5.171
Token the
Token ID262
Feature activation+8.88e-01

Pos logit contributions
 search+1.193
 lap+1.174
 bed+1.163
 call+1.149
 eye+1.134
Neg logit contributions
ww-1.604
dd-1.288
www-1.219
lish-1.209
P-1.165
Token truck
Token ID7779
Feature activation+5.06e-02

Pos logit contributions
 search+1.055
 bed+1.034
 lap+1.012
 call+0.977
 backs+0.953
Neg logit contributions
ww-2.582
dd-2.138
www-2.086
lish-2.043
iest-2.011
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
 search+0.080
 bed+0.079
 lap+0.076
 call+0.076
 bank+0.075
Neg logit contributions
ww-0.129
dd-0.104
www-0.100
lish-0.099
ha-0.095
Token
Token ID198
Feature activation+8.86e-01

Pos logit contributions
 search+1.566
 eye+1.564
 bed+1.539
 call+1.497
 lap+1.488
Neg logit contributions
ww-3.284
dd-2.693
www-2.648
ha-2.531
P-2.521
Token.
Token ID13
Feature activation+9.21e-01

Pos logit contributions
 search+0.080
 bed+0.079
 lap+0.076
 call+0.076
 bank+0.075
Neg logit contributions
ww-0.144
dd-0.119
www-0.113
lish-0.113
P-0.110
Token The
Token ID383
Feature activation+3.46e-01

Pos logit contributions
 search+0.892
 bank+0.879
 bed+0.863
 call+0.854
 lap+0.837
Neg logit contributions
ww-2.923
dd-2.436
www-2.389
lish-2.344
P-2.297
Token cater
Token ID20825
Feature activation-1.09e+00

Pos logit contributions
 search+0.668
 bed+0.656
 lap+0.654
 call+0.647
 bank+0.646
Neg logit contributions
ww-0.767
dd-0.591
www-0.567
lish-0.560
P-0.550
Tokenpillar
Token ID41643
Feature activation-1.85e-01

Pos logit contributions
ww+1.900
dd+1.373
www+1.275
lish+1.265
bly+1.207
Neg logit contributions
 search-2.578
 bed-2.552
 lap-2.502
 bank-2.498
 call-2.493
Token wanted
Token ID2227
Feature activation+8.38e-01

Pos logit contributions
ww+0.459
dd+0.373
www+0.354
lish+0.351
P+0.339
Neg logit contributions
 search-0.303
 bed-0.301
 lap-0.292
 call-0.291
 bank-0.288
Token to
Token ID284
Feature activation+2.80e+00

Pos logit contributions
 search+1.047
 lap+1.033
 bed+1.022
 call+1.018
 bank+1.012
Neg logit contributions
ww-2.396
dd-1.977
www-1.901
P-1.859
lish-1.856
Token go
Token ID467
Feature activation+8.47e-01

Pos logit contributions
 driveway+3.564
 backs+3.443
 eyes+3.421
 workshop+3.362
 lap+3.330
Neg logit contributions
ww-8.079
www-6.506
P-6.434
dd-6.276
"-6.116
Token on
Token ID319
Feature activation+9.01e-01

Pos logit contributions
 backs+1.487
 tears+1.482
 lap+1.481
 bank+1.479
 bed+1.478
Neg logit contributions
ww-1.942
dd-1.514
www-1.451
P-1.425
lish-1.415
Token an
Token ID281
Feature activation+1.46e+00

Pos logit contributions
 call+1.489
 lap+1.484
 eye+1.472
 search+1.452
 bank+1.452
Neg logit contributions
ww-2.214
dd-1.769
www-1.680
P-1.655
lish-1.641
Token adventure
Token ID8855
Feature activation+6.11e-01

Pos logit contributions
 lap+1.798
 search+1.786
 bank+1.780
 call+1.763
 bed+1.751
Neg logit contributions
ww-4.213
dd-3.454
www-3.386
P-3.252
lish-3.236
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 search+0.820
 bed+0.817
 lap+0.807
 call+0.805
 bank+0.800
Neg logit contributions
ww-1.670
dd-1.380
www-1.329
lish-1.301
P-1.283
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
 search+3.070
 tears+2.962
 slam+2.908
 lap+2.889
 backs+2.843
Neg logit contributions
ww-5.847
dd-4.780
iest-4.695
lish-4.672
www-4.471
Token
Token ID198
Feature activation+8.39e-01

Pos logit contributions
 bank+2.671
 search+2.648
 eye+2.626
 doorstep+2.621
 bed+2.619
Neg logit contributions
ww-4.799
dd-3.788
www-3.739
P-3.636
ha-3.598
Token
Token ID198
Feature activation-1.49e+00

Pos logit contributions
 bank+1.068
 bed+1.058
 search+1.049
 backs+1.045
 eye+1.038
Neg logit contributions
ww-2.357
dd-1.986
www-1.881
ha-1.849
P-1.847
TokenTim
Token ID14967
Feature activation+8.79e-01

Pos logit contributions
ww+2.801
dd+2.149
lish+1.992
www+1.941
bly+1.911
Neg logit contributions
 search-3.494
 bed-3.346
 call-3.345
 bank-3.335
 lap-3.327
Token went
Token ID1816
Feature activation+1.02e+00

Pos logit contributions
 bed+1.338
 lap+1.307
 search+1.288
 call+1.247
 bank+1.231
Neg logit contributions
ww-2.290
dd-1.895
www-1.781
P-1.780
ha-1.752
Token to
Token ID284
Feature activation+3.79e+00

Pos logit contributions
 bed+1.304
 search+1.300
 backs+1.294
 bank+1.287
 lap+1.286
Neg logit contributions
ww-2.796
dd-2.303
www-2.212
P-2.202
lish-2.198
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 backs+8.318
 bank+8.139
 doorstep+7.891
 driveway+7.626
 lap+7.580
Neg logit contributions
ww-7.470
 oh-5.844
"-5.690
P-5.680
 you-5.631
Token park
Token ID3952
Feature activation+2.03e+00

Pos logit contributions
 lap+1.266
 search+1.261
 backs+1.215
 bed+1.187
 call+1.184
Neg logit contributions
ww-3.532
P-2.871
www-2.825
dd-2.817
lish-2.781
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
 backs+3.766
 tears+3.711
 bed+3.659
 eyes+3.591
 cries+3.572
Neg logit contributions
ww-4.277
dd-3.280
www-3.167
"-3.165
a-3.150
Token saw
Token ID2497
Feature activation+1.32e+00

Pos logit contributions
 lap+2.375
 backs+2.328
 eye+2.325
 bed+2.303
 search+2.275
Neg logit contributions
ww-3.924
dd-3.258
www-3.070
ha-3.057
P-3.013
Token his
Token ID465
Feature activation+2.10e+00

Pos logit contributions
 search+2.334
 lap+2.299
 bank+2.228
 doorstep+2.211
 call+2.210
Neg logit contributions
ww-3.127
dd-2.508
P-2.337
lish-2.333
www-2.316
Token and
Token ID290
Feature activation-4.29e-01

Pos logit contributions
 search+0.471
 bed+0.466
 lap+0.453
 bank+0.444
 call+0.440
Neg logit contributions
ww-1.005
dd-0.843
www-0.804
lish-0.799
iest-0.778
Token they
Token ID484
Feature activation+7.04e-01

Pos logit contributions
ww+1.134
dd+0.934
www+0.889
lish+0.884
P+0.855
Neg logit contributions
 search-0.646
 bed-0.638
 call-0.619
 lap-0.616
 bank-0.609
Token both
Token ID1111
Feature activation+7.68e-01

Pos logit contributions
 search+0.969
 bed+0.967
 lap+0.955
 eye+0.907
 bank+0.904
Neg logit contributions
ww-1.929
dd-1.623
www-1.532
lish-1.506
ha-1.503
Token became
Token ID2627
Feature activation-7.85e-01

Pos logit contributions
 search+1.232
 bed+1.213
 lap+1.190
 bank+1.162
 call+1.159
Neg logit contributions
ww-1.937
dd-1.583
www-1.499
lish-1.470
ha-1.447
Token friends
Token ID2460
Feature activation-6.34e-01

Pos logit contributions
ww+2.048
dd+1.684
www+1.603
lish+1.583
ha+1.526
Neg logit contributions
 search-1.223
 bed-1.208
 call-1.168
 bank-1.154
 lap-1.151
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
ww+1.994
dd+1.689
www+1.632
lish+1.610
P+1.577
Neg logit contributions
 search-0.648
 bed-0.639
 lap-0.609
 call-0.601
 bank-0.590
Token The
Token ID383
Feature activation-7.39e-01

Pos logit contributions
 search+2.036
 call+1.942
 bank+1.933
 bed+1.932
 eye+1.916
Neg logit contributions
ww-4.505
dd-3.686
www-3.593
lish-3.512
P-3.470
Token girl
Token ID2576
Feature activation+3.97e-01

Pos logit contributions
ww+1.656
dd+1.325
www+1.230
lish+1.203
bly+1.158
Neg logit contributions
 search-1.452
 bed-1.440
 call-1.404
 lap-1.390
 eye-1.388
Token was
Token ID373
Feature activation-8.43e-01

Pos logit contributions
 search+0.604
 bed+0.597
 lap+0.577
 call+0.572
 bank+0.569
Neg logit contributions
ww-1.030
dd-0.844
www-0.805
lish-0.793
ha-0.776
Token so
Token ID523
Feature activation-1.63e+00

Pos logit contributions
ww+2.192
dd+1.797
www+1.714
lish+1.703
P+1.643
Neg logit contributions
 search-1.331
 bed-1.323
 call-1.279
 lap-1.273
 eye-1.245
Token proud
Token ID6613
Feature activation-6.63e-01

Pos logit contributions
ww+4.127
dd+3.406
lish+3.246
www+3.223
ha+3.116
Neg logit contributions
 search-2.697
 bed-2.610
 call-2.602
 lap-2.549
 tears-2.521
Token They
Token ID1119
Feature activation+9.81e-01

Pos logit contributions
 search+1.175
 bank+1.168
 bed+1.144
 backs+1.129
 doorstep+1.117
Neg logit contributions
ww-3.473
dd-2.895
lish-2.872
www-2.813
P-2.771
Token cut
Token ID2005
Feature activation+1.12e+00

Pos logit contributions
 lap+1.204
 bed+1.202
 search+1.139
 bank+1.118
 tears+1.113
Neg logit contributions
ww-2.813
dd-2.376
www-2.264
P-2.249
ha-2.224
Token paper
Token ID3348
Feature activation+1.59e+00

Pos logit contributions
 search+1.708
 lap+1.654
 bed+1.647
 doorstep+1.605
 call+1.605
Neg logit contributions
ww-2.931
dd-2.387
lish-2.326
iest-2.263
www-2.258
Token,
Token ID11
Feature activation+7.32e-01

Pos logit contributions
 search+2.268
 lap+2.204
 bed+2.166
 driveway+2.122
 call+2.115
Neg logit contributions
ww-4.209
dd-3.411
lish-3.339
iest-3.274
a-3.261
Token fold
Token ID5591
Feature activation+1.26e+00

Pos logit contributions
 search+0.792
 bed+0.763
 lap+0.750
 bank+0.746
 doorstep+0.719
Neg logit contributions
ww-2.221
dd-1.870
lish-1.800
www-1.792
P-1.772
Token paper
Token ID3348
Feature activation+1.45e+00

Pos logit contributions
 search+1.623
 lap+1.569
 doorstep+1.544
 bed+1.542
 bank+1.541
Neg logit contributions
ww-3.527
dd-2.898
lish-2.822
www-2.779
iest-2.769
Token,
Token ID11
Feature activation+8.04e-01

Pos logit contributions
 search+2.121
 lap+2.026
 bed+2.018
 call+1.984
 bank+1.983
Neg logit contributions
ww-3.817
dd-3.095
lish-3.024
iest-2.948
www-2.937
Token and
Token ID290
Feature activation+1.10e+00

Pos logit contributions
 search+0.872
 bed+0.836
 bank+0.821
 lap+0.819
 doorstep+0.790
Neg logit contributions
ww-2.434
dd-2.053
lish-1.978
www-1.955
P-1.946
Token draw
Token ID3197
Feature activation+5.29e-01

Pos logit contributions
 search+1.140
 lap+1.121
 bank+1.104
 bed+1.102
 doorstep+1.063
Neg logit contributions
ww-3.337
dd-2.807
lish-2.707
www-2.683
P-2.632
Token on
Token ID319
Feature activation+1.28e+00

Pos logit contributions
 search+0.629
 bed+0.602
 lap+0.596
 bank+0.590
 call+0.584
Neg logit contributions
ww-1.543
dd-1.289
lish-1.253
www-1.232
iest-1.211
Token paper
Token ID3348
Feature activation+1.46e+00

Pos logit contributions
 search+1.623
 lap+1.609
 call+1.567
 bed+1.547
 bank+1.537
Neg logit contributions
ww-3.595
dd-2.971
lish-2.874
www-2.845
ha-2.829
Token?"
Token ID1701
Feature activation+2.58e+00

Pos logit contributions
ww+1.277
dd+0.995
www+0.929
lish+0.922
P+0.874
Neg logit contributions
 search-1.276
 bed-1.261
 call-1.227
 lap-1.225
 bank-1.211
Token
Token ID220
Feature activation+8.61e-01

Pos logit contributions
 doorstep+3.322
 bed+3.301
 search+3.241
 backs+3.241
 bank+3.179
Neg logit contributions
ww-7.350
dd-5.917
P-5.914
www-5.879
lish-5.802
Token
Token ID198
Feature activation+6.19e-01

Pos logit contributions
 search+1.103
 bed+1.082
 bank+1.046
 lap+1.041
 call+1.035
Neg logit contributions
ww-2.430
dd-2.035
www-1.937
lish-1.923
P-1.891
Token
Token ID198
Feature activation-1.11e+00

Pos logit contributions
 search+0.768
 bed+0.763
 bank+0.740
 call+0.730
 lap+0.726
Neg logit contributions
ww-1.789
dd-1.496
www-1.433
P-1.417
lish-1.405
TokenSam
Token ID16305
Feature activation+6.48e-01

Pos logit contributions
ww+2.076
dd+1.593
lish+1.459
www+1.452
iest+1.409
Neg logit contributions
 search-2.523
 bed-2.461
 lap-2.444
 call-2.441
 bank-2.417
Token brave
Token ID14802
Feature activation+3.33e-01

Pos logit contributions
 bed+0.954
 search+0.935
 lap+0.908
 call+0.889
 bank+0.882
Neg logit contributions
ww-1.721
dd-1.422
www-1.362
lish-1.346
ha-1.324
Tokenly
Token ID306
Feature activation-5.19e-01

Pos logit contributions
 search+0.756
 bed+0.756
 lap+0.741
 bank+0.735
 call+0.727
Neg logit contributions
ww-0.624
dd-0.470
lish-0.439
www-0.434
ha-0.411
Token nodded
Token ID14464
Feature activation-1.81e-02

Pos logit contributions
ww+1.500
dd+1.253
www+1.203
lish+1.195
P+1.159
Neg logit contributions
 search-0.650
 bed-0.637
 call-0.618
 lap-0.610
 bank-0.608
Token his
Token ID465
Feature activation+1.75e+00

Pos logit contributions
ww+0.055
dd+0.047
lish+0.045
www+0.045
P+0.043
Neg logit contributions
 search-0.020
 bed-0.019
 lap-0.018
 call-0.018
 bank-0.018
Token head
Token ID1182
Feature activation+3.81e-01

Pos logit contributions
 search+1.010
 bed+0.914
 bank+0.908
 call+0.855
 lap+0.738
Neg logit contributions
ww-6.251
dd-5.270
lish-5.235
www-5.220
P-4.961
Token and
Token ID290
Feature activation-1.57e-01

Pos logit contributions
 search+0.437
 bed+0.430
 call+0.408
 bank+0.406
 lap+0.405
Neg logit contributions
ww-1.118
dd-0.953
lish-0.913
www-0.902
bly-0.882
Token's
Token ID338
Feature activation+7.41e-01

Pos logit contributions
 bed+1.080
 search+1.070
 lap+1.060
 call+1.009
 bank+1.006
Neg logit contributions
ww-1.736
dd-1.452
lish-1.354
www-1.353
ha-1.336
Token dad
Token ID9955
Feature activation+6.98e-02

Pos logit contributions
 search+0.957
 bed+0.919
 lap+0.904
 bank+0.891
 call+0.889
Neg logit contributions
ww-2.088
dd-1.739
www-1.666
lish-1.646
P-1.616
Token stopped
Token ID5025
Feature activation+1.02e+00

Pos logit contributions
 search+0.102
 bed+0.101
 lap+0.099
 bank+0.096
 call+0.096
Neg logit contributions
ww-0.185
dd-0.154
www-0.146
lish-0.145
ha-0.141
Token in
Token ID287
Feature activation+1.71e+00

Pos logit contributions
 lap+1.325
 bank+1.303
 search+1.295
 bed+1.283
 eye+1.247
Neg logit contributions
ww-2.801
dd-2.400
lish-2.276
www-2.219
P-2.200
Token his
Token ID465
Feature activation+2.33e+00

Pos logit contributions
 bank+2.104
 lap+1.987
 eye+1.947
 backs+1.931
 search+1.917
Neg logit contributions
ww-4.813
dd-4.050
www-3.893
lish-3.836
P-3.794
Token tracks
Token ID8339
Feature activation+1.51e+00

Pos logit contributions
 search+2.700
 call+2.566
anks+2.527
 bed+2.513
 backs+2.502
Neg logit contributions
ww-6.799
www-5.359
P-5.237
dd-5.216
lish-5.054
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
 search+1.857
 bed+1.725
 backs+1.702
 lap+1.675
 bank+1.663
Neg logit contributions
ww-4.271
dd-3.602
lish-3.484
www-3.437
iest-3.390
Token He
Token ID679
Feature activation-3.26e-01

Pos logit contributions
 search+1.357
 bed+1.301
 lap+1.256
 eye+1.253
 bank+1.237
Neg logit contributions
ww-2.934
dd-2.401
www-2.367
lish-2.293
ha-2.273
Token stared
Token ID18484
Feature activation-1.24e+00

Pos logit contributions
ww+0.920
dd+0.768
www+0.734
lish+0.731
P+0.707
Neg logit contributions
 search-0.430
 bed-0.423
 call-0.408
 lap-0.407
 bank-0.402
Token and
Token ID290
Feature activation-3.47e-01

Pos logit contributions
ww+3.308
dd+2.637
www+2.588
ha+2.457
P+2.447
Neg logit contributions
 bed-2.003
 search-1.979
 eye-1.943
 eyes-1.920
 call-1.917
Token point
Token ID966
Feature activation-1.28e+00

Pos logit contributions
ww+0.995
dd+0.832
www+0.797
lish+0.791
P+0.768
Neg logit contributions
 search-0.447
 bed-0.440
 call-0.423
 lap-0.422
 bank-0.415
Token your
Token ID534
Feature activation+2.05e+00

Pos logit contributions
ww+3.353
dd+2.794
www+2.680
lish+2.618
P+2.574
Neg logit contributions
 bed-1.665
 search-1.649
 call-1.568
 bank-1.566
 lap-1.561
Token weight
Token ID3463
Feature activation-5.20e-01

Pos logit contributions
 search+2.165
 bank+2.068
 bed+2.036
 lap+2.006
 call+1.989
Neg logit contributions
ww-6.140
lish-5.134
dd-4.991
a-4.928
w-4.823
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
ww+1.505
dd+1.265
www+1.211
lish+1.198
P+1.167
Neg logit contributions
 search-0.649
 bed-0.644
 lap-0.617
 call-0.611
 bank-0.610
Token You
Token ID921
Feature activation+7.38e-01

Pos logit contributions
ww+1.015
dd+0.832
www+0.788
lish+0.784
P+0.756
Neg logit contributions
 search-0.638
 bed-0.631
 lap-0.614
 call-0.613
 bank-0.603
Token have
Token ID423
Feature activation+6.61e-01

Pos logit contributions
 bed+1.311
 lap+1.302
 search+1.287
 bank+1.241
 call+1.225
Neg logit contributions
ww-1.737
dd-1.407
www-1.318
ha-1.283
lish-1.272
Token to
Token ID284
Feature activation+1.20e+00

Pos logit contributions
 search+0.855
 bed+0.851
 lap+0.841
 call+0.818
 bank+0.814
Neg logit contributions
ww-1.884
dd-1.562
www-1.499
lish-1.463
P-1.460
Token be
Token ID307
Feature activation-1.08e+00

Pos logit contributions
 lap+1.404
 bed+1.370
 search+1.355
 bank+1.321
 eyes+1.301
Neg logit contributions
ww-3.498
dd-2.910
www-2.833
lish-2.725
P-2.719
Token careful
Token ID8161
Feature activation-1.50e+00

Pos logit contributions
ww+2.045
dd+1.537
www+1.428
lish+1.382
ha+1.349
Neg logit contributions
 search-2.521
 bed-2.493
 call-2.463
 lap-2.440
 eye-2.420
Token."
Token ID526
Feature activation+1.91e+00

Pos logit contributions
ww+4.675
dd+3.921
www+3.791
P+3.756
lish+3.734
Neg logit contributions
 bed-1.714
 search-1.712
 lap-1.671
 call-1.633
 eye-1.586
Token Tim
Token ID5045
Feature activation+1.08e+00

Pos logit contributions
 search+2.888
 eye+2.862
 backs+2.833
 bank+2.808
 bed+2.802
Neg logit contributions
ww-5.134
dd-4.186
www-4.028
P-3.891
lish-3.824
Tokenmy
Token ID1820
Feature activation+4.55e-01

Pos logit contributions
 search+1.645
 bed+1.635
 bank+1.584
 call+1.549
 lap+1.544
Neg logit contributions
ww-2.880
dd-2.313
lish-2.260
www-2.258
P-2.180
Token to
Token ID284
Feature activation+1.75e+00

Pos logit contributions
 bed+0.739
 search+0.738
 lap+0.732
 bank+0.703
 eye+0.702
Neg logit contributions
ww-1.760
dd-1.481
www-1.415
lish-1.391
ha-1.375
Token hold
Token ID1745
Feature activation-1.01e+00

Pos logit contributions
 bed+1.931
 driveway+1.909
 lap+1.897
 eyes+1.862
 backs+1.825
Neg logit contributions
ww-5.229
dd-4.320
www-4.267
iest-4.118
P-4.095
Token the
Token ID262
Feature activation-3.51e-01

Pos logit contributions
ww+2.903
dd+2.401
www+2.325
lish+2.287
ha+2.227
Neg logit contributions
 search-1.357
 bed-1.336
 lap-1.285
 call-1.271
 eye-1.255
Token bird
Token ID6512
Feature activation-2.99e-01

Pos logit contributions
ww+0.916
dd+0.754
www+0.717
lish+0.711
P+0.687
Neg logit contributions
 search-0.539
 bed-0.533
 lap-0.515
 call-0.515
 bank-0.510
Token,
Token ID11
Feature activation+8.77e-01

Pos logit contributions
ww+0.818
dd+0.680
www+0.649
lish+0.644
P+0.624
Neg logit contributions
 search-0.417
 bed-0.412
 lap-0.398
 call-0.396
 bank-0.392
Token but
Token ID475
Feature activation-1.78e+00

Pos logit contributions
 search+1.446
 bed+1.431
 bank+1.395
 lap+1.391
 call+1.376
Neg logit contributions
ww-2.128
dd-1.703
www-1.643
lish-1.596
ha-1.547
Token he
Token ID339
Feature activation-4.39e-01

Pos logit contributions
ww+5.039
dd+4.132
www+3.982
lish+3.925
P+3.870
Neg logit contributions
 search-2.591
 bed-2.553
 bank-2.474
 call-2.421
 tears-2.397
Token dropped
Token ID5710
Feature activation+3.53e-01

Pos logit contributions
ww+1.147
dd+0.938
www+0.895
lish+0.889
P+0.858
Neg logit contributions
 search-0.674
 bed-0.662
 call-0.644
 lap-0.641
 bank-0.638
Token his
Token ID465
Feature activation+1.37e+00

Pos logit contributions
 search+0.559
 bed+0.551
 lap+0.542
 call+0.538
 bank+0.530
Neg logit contributions
ww-0.894
dd-0.738
www-0.694
lish-0.693
P-0.667
Token gun
Token ID2485
Feature activation+4.56e-01

Pos logit contributions
 search+1.535
 bed+1.496
 call+1.434
 bank+1.432
 lap+1.430
Neg logit contributions
ww-4.092
dd-3.381
www-3.257
lish-3.241
iest-3.226
Token.
Token ID13
Feature activation+7.97e-01

Pos logit contributions
 search+0.612
 bed+0.605
 lap+0.583
 call+0.579
 bank+0.564
Neg logit contributions
ww-1.259
dd-1.056
lish-1.009
www-1.000
P-0.968
Token learn
Token ID2193
Feature activation-6.10e-01

Pos logit contributions
 bed+0.714
 search+0.686
 bank+0.686
 lap+0.672
 tears+0.651
Neg logit contributions
ww-2.278
dd-1.903
www-1.870
lish-1.818
P-1.812
Token how
Token ID703
Feature activation-1.85e+00

Pos logit contributions
ww+1.548
dd+1.281
www+1.205
lish+1.199
P+1.154
Neg logit contributions
 search-0.978
 bed-0.974
 call-0.943
 lap-0.937
 bank-0.922
Token to
Token ID284
Feature activation+1.44e+00

Pos logit contributions
ww+6.065
dd+5.360
lish+5.092
www+5.023
P+4.983
Neg logit contributions
 search-1.665
 bed-1.656
 call-1.598
 lap-1.506
 eyes-1.490
Token talk
Token ID1561
Feature activation-8.02e-01

Pos logit contributions
 bed+1.618
 lap+1.576
 bank+1.490
 eye+1.480
 search+1.466
Neg logit contributions
ww-4.270
dd-3.521
www-3.471
ha-3.372
lish-3.370
Token like
Token ID588
Feature activation-5.01e-01

Pos logit contributions
ww+2.117
dd+1.777
www+1.668
lish+1.652
P+1.611
Neg logit contributions
 search-1.205
 bed-1.196
 call-1.163
 lap-1.151
 bank-1.123
Token him
Token ID683
Feature activation-3.53e-01

Pos logit contributions
ww+1.253
dd+1.026
www+0.970
lish+0.965
P+0.928
Neg logit contributions
 search-0.818
 bed-0.813
 call-0.788
 lap-0.783
 bank-0.773
Token too
Token ID1165
Feature activation-9.84e-01

Pos logit contributions
ww+0.704
dd+0.542
www+0.504
lish+0.499
P+0.475
Neg logit contributions
 search-0.753
 bed-0.747
 lap-0.729
 call-0.729
 bank-0.722
Token?"
Token ID1701
Feature activation+2.41e+00

Pos logit contributions
ww+2.023
dd+1.620
lish+1.471
www+1.460
P+1.416
Neg logit contributions
 search-2.061
 bed-2.044
 call-2.006
 lap-1.981
 bank-1.948
Token Mom
Token ID11254
Feature activation+5.04e-01

Pos logit contributions
 bank+2.767
 backs+2.742
 search+2.661
 doorstep+2.635
 bed+2.632
Neg logit contributions
ww-7.044
www-5.741
P-5.714
dd-5.675
lish-5.604
Token laughed
Token ID13818
Feature activation-6.28e-01

Pos logit contributions
 bed+0.722
 search+0.716
 bank+0.697
 lap+0.687
 eye+0.667
Neg logit contributions
ww-1.355
dd-1.126
www-1.079
lish-1.077
ha-1.044
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
ww+2.044
dd+1.740
www+1.688
lish+1.652
ha+1.626
Neg logit contributions
 search-0.573
 bed-0.566
 lap-0.537
 call-0.533
 eye-0.514
Token he
Token ID339
Feature activation-1.00e-02

Pos logit contributions
ww+2.622
dd+2.077
www+1.999
lish+1.982
P+1.929
Neg logit contributions
 bed-1.944
 search-1.938
 call-1.827
 bank-1.815
 lap-1.812
Token was
Token ID373
Feature activation-1.03e+00

Pos logit contributions
ww+0.026
dd+0.021
www+0.020
lish+0.020
ha+0.019
Neg logit contributions
 search-0.016
 bed-0.016
 lap-0.015
 call-0.015
 bank-0.015
Token tired
Token ID10032
Feature activation+6.01e-02

Pos logit contributions
ww+2.849
dd+2.322
www+2.257
lish+2.209
ha+2.166
Neg logit contributions
 search-1.510
 bed-1.497
 call-1.423
 lap-1.407
 bank-1.399
Token he
Token ID339
Feature activation-2.78e-01

Pos logit contributions
 search+0.084
 bed+0.082
 lap+0.081
 call+0.079
 bank+0.079
Neg logit contributions
ww-0.164
dd-0.136
lish-0.130
www-0.130
iest-0.125
Token was
Token ID373
Feature activation-6.32e-01

Pos logit contributions
ww+0.703
dd+0.573
www+0.545
lish+0.541
P+0.522
Neg logit contributions
 search-0.446
 bed-0.441
 lap-0.428
 call-0.427
 bank-0.423
Token proud
Token ID6613
Feature activation-3.51e-01

Pos logit contributions
ww+1.729
dd+1.418
www+1.368
lish+1.354
P+1.311
Neg logit contributions
 search-0.912
 bed-0.905
 call-0.865
 lap-0.863
 bank-0.852
Token of
Token ID286
Feature activation+2.40e-01

Pos logit contributions
ww+0.889
dd+0.725
www+0.689
lish+0.683
P+0.660
Neg logit contributions
 search-0.567
 bed-0.562
 lap-0.544
 call-0.543
 bank-0.535
Token himself
Token ID2241
Feature activation-2.15e-01

Pos logit contributions
 search+0.334
 bed+0.326
 lap+0.322
 bank+0.320
 call+0.320
Neg logit contributions
ww-0.653
dd-0.547
www-0.518
lish-0.516
bly-0.499
Token for
Token ID329
Feature activation+4.41e-01

Pos logit contributions
ww+0.656
dd+0.556
www+0.534
lish+0.531
P+0.516
Neg logit contributions
 search-0.232
 bed-0.228
 lap-0.219
 call-0.217
 bank-0.214
Token trying
Token ID2111
Feature activation+7.60e-01

Pos logit contributions
 search+0.716
 lap+0.701
 bed+0.697
 bank+0.695
 eye+0.694
Neg logit contributions
ww-1.086
dd-0.898
www-0.838
lish-0.835
bly-0.809
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
 search+1.195
 call+1.185
 lap+1.169
 bank+1.165
 eye+1.164
Neg logit contributions
ww-1.892
dd-1.540
www-1.457
lish-1.446
P-1.403
Token
Token ID198
Feature activation-2.86e+00

Pos logit contributions
ww+4.736
dd+4.004
lish+3.915
www+3.845
iest+3.758
Neg logit contributions
 search-1.591
 bed-1.578
 call-1.511
 lap-1.493
 bank-1.368
Tokenâ
Token ID22940
Feature activation+1.02e-01

Pos logit contributions
ww+5.733
dd+4.367
lish+4.173
iest+4.016
www+3.974
Neg logit contributions
 search-6.610
 bed-6.529
 call-6.448
 lap-6.361
 driveway-6.137
Token€
Token ID26391
Feature activation-1.44e-01

Pos logit contributions
 search+0.191
 bed+0.190
 lap+0.183
 call+0.183
 bank+0.183
Neg logit contributions
ww-0.233
dd-0.185
www-0.175
lish-0.174
ha-0.165
Token�
Token ID129
Feature activation+1.22e+00

Pos logit contributions
ww+0.375
dd+0.307
www+0.293
lish+0.290
ha+0.280
Neg logit contributions
 search-0.221
 bed-0.219
 lap-0.212
 call-0.212
 bank-0.210
Token�
Token ID241
Feature activation-1.71e+00

Pos logit contributions
 search+2.429
 bed+2.402
 lap+2.324
 eye+2.321
 call+2.290
Neg logit contributions
ww-2.578
dd-2.126
www-1.957
lish-1.907
bly-1.819
TokenThat
Token ID2504
Feature activation-7.03e-01

Pos logit contributions
ww+4.169
dd+3.324
lish+3.302
www+3.162
bly+3.089
Neg logit contributions
 search-2.975
 bed-2.886
 lap-2.843
 call-2.824
 bank-2.769
Token is
Token ID318
Feature activation-1.40e+00

Pos logit contributions
ww+1.893
dd+1.586
lish+1.506
www+1.494
P+1.448
Neg logit contributions
 search-1.024
 bed-1.010
 call-0.978
 lap-0.967
 bank-0.963
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
ww+3.883
dd+3.310
lish+3.098
www+3.096
P+3.019
Neg logit contributions
 bed-1.987
 search-1.965
 call-1.932
 eye-1.839
 bank-1.818
Token license
Token ID5964
Feature activation-1.32e+00

Pos logit contributions
 search+0.168
 bed+0.165
 lap+0.162
 bank+0.159
 call+0.159
Neg logit contributions
ww-0.316
dd-0.258
www-0.249
lish-0.245
P-0.239
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
ww+3.645
dd+3.097
lish+2.913
www+2.885
P+2.844
Neg logit contributions
 search-1.880
 bed-1.861
 call-1.823
 lap-1.761
 bank-1.742
Tokenâ
Token ID22940
Feature activation-1.09e+00

Pos logit contributions
ww+5.566
dd+4.762
lish+4.548
P+4.483
www+4.476
Neg logit contributions
 bed-2.339
 search-2.248
 call-2.232
 eye-2.106
 eyes-2.094
Token has
Token ID468
Feature activation+7.17e-02

Pos logit contributions
ww+0.357
dd+0.293
www+0.280
lish+0.277
ha+0.268
Neg logit contributions
 search-0.208
 bed-0.206
 lap-0.200
 call-0.198
 bank-0.196
Token many
Token ID867
Feature activation+7.40e-01

Pos logit contributions
 search+0.107
 bed+0.105
 lap+0.103
 call+0.102
 bank+0.101
Neg logit contributions
ww-0.189
dd-0.155
www-0.148
lish-0.146
P-0.142
Token branches
Token ID13737
Feature activation-7.39e-03

Pos logit contributions
 search+0.666
 bed+0.636
 lap+0.626
 call+0.619
 bank+0.591
Neg logit contributions
ww-2.381
dd-2.023
www-1.964
lish-1.941
iest-1.918
Token and
Token ID290
Feature activation-3.32e-01

Pos logit contributions
ww+0.021
dd+0.018
www+0.017
lish+0.017
iest+0.016
Neg logit contributions
 search-0.009
 bed-0.009
 lap-0.009
 call-0.009
 bank-0.009
Token leaves
Token ID5667
Feature activation+7.70e-02

Pos logit contributions
ww+1.030
dd+0.876
www+0.842
lish+0.837
P+0.814
Neg logit contributions
 search-0.344
 bed-0.338
 lap-0.321
 call-0.320
 bank-0.316
Token.
Token ID13
Feature activation-7.35e-01

Pos logit contributions
 search+0.092
 bed+0.090
 lap+0.087
 call+0.086
 bank+0.085
Neg logit contributions
ww-0.225
dd-0.190
www-0.181
lish-0.180
iest-0.175
Token They
Token ID1119
Feature activation+1.58e+00

Pos logit contributions
ww+2.255
dd+1.921
lish+1.838
www+1.835
iest+1.778
Neg logit contributions
 search-0.796
 bed-0.794
 lap-0.757
 call-0.753
 bank-0.720
Token want
Token ID765
Feature activation+1.77e+00

Pos logit contributions
 bed+2.050
 lap+2.046
 search+1.977
 doorstep+1.916
 eye+1.896
Neg logit contributions
ww-4.280
dd-3.582
www-3.424
ha-3.362
lish-3.335
Token to
Token ID284
Feature activation+2.55e+00

Pos logit contributions
 lap+2.549
 eye+2.494
 search+2.472
 doorstep+2.394
 bank+2.390
Neg logit contributions
ww-4.724
dd-3.825
www-3.661
ha-3.579
P-3.534
Token chop
Token ID30506
Feature activation+4.46e-01

Pos logit contributions
 lap+3.326
 eyes+3.259
 tears+3.255
 workshop+3.208
 eye+3.152
Neg logit contributions
ww-7.161
www-5.735
dd-5.517
ha-5.483
"-5.471
Token it
Token ID340
Feature activation+9.63e-01

Pos logit contributions
 search+0.671
 bed+0.659
 lap+0.650
 call+0.644
 eye+0.631
Neg logit contributions
ww-1.169
dd-0.953
www-0.909
lish-0.906
P-0.885
Token friend
Token ID1545
Feature activation+4.84e-01

Pos logit contributions
 search+1.412
 lap+1.361
 bed+1.357
 call+1.325
 bank+1.316
Neg logit contributions
ww-2.734
dd-2.247
www-2.142
lish-2.138
P-2.072
Token's
Token ID338
Feature activation+1.24e+00

Pos logit contributions
 search+0.671
 bed+0.658
 lap+0.647
 bank+0.633
 call+0.625
Neg logit contributions
ww-1.312
dd-1.087
www-1.037
lish-1.035
P-1.003
Token birthday
Token ID10955
Feature activation+3.41e-01

Pos logit contributions
 search+1.746
 lap+1.704
 bed+1.684
 call+1.671
 bank+1.662
Neg logit contributions
ww-3.319
dd-2.688
www-2.591
lish-2.548
P-2.497
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
 search+0.410
 bed+0.397
 lap+0.388
 bank+0.382
 call+0.382
Neg logit contributions
ww-0.999
dd-0.831
www-0.805
lish-0.796
ha-0.776
Token
Token ID198
Feature activation+6.37e-02

Pos logit contributions
 search+1.487
 eye+1.458
 bank+1.413
 bed+1.395
 backs+1.390
Neg logit contributions
ww-3.597
dd-2.945
www-2.903
P-2.808
lish-2.807
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
 search+0.073
 bed+0.072
 lap+0.069
 bank+0.069
 call+0.069
Neg logit contributions
ww-0.190
dd-0.161
www-0.154
lish-0.152
P-0.150
TokenThe
Token ID464
Feature activation-8.21e-01

Pos logit contributions
ww+5.782
dd+4.540
lish+4.416
iest+4.412
www+4.037
Neg logit contributions
 search-6.359
 call-6.168
 bed-6.140
 lap-6.050
 find-5.922
Token helpful
Token ID7613
Feature activation-6.78e-01

Pos logit contributions
ww+2.166
dd+1.798
www+1.704
lish+1.686
ha+1.633
Neg logit contributions
 search-1.268
 bed-1.264
 call-1.216
 lap-1.198
 bank-1.193
Token lady
Token ID10846
Feature activation+2.59e-01

Pos logit contributions
ww+1.775
dd+1.460
www+1.386
lish+1.375
P+1.336
Neg logit contributions
 search-1.044
 bed-1.031
 call-1.003
 bank-0.995
 lap-0.993
Token took
Token ID1718
Feature activation+5.37e-01

Pos logit contributions
 bed+0.362
 search+0.359
 lap+0.350
 call+0.338
 bank+0.337
Neg logit contributions
ww-0.706
dd-0.588
www-0.561
lish-0.556
ha-0.543
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
 search+0.786
 lap+0.774
 bed+0.774
 bank+0.754
 call+0.751
Neg logit contributions
ww-1.434
dd-1.188
www-1.126
lish-1.116
ha-1.092
Token't
Token ID470
Feature activation+2.15e-01

Pos logit contributions
 bed+1.316
 lap+1.311
 search+1.300
 eye+1.275
 tears+1.264
Neg logit contributions
ww-2.024
dd-1.660
www-1.575
bly-1.535
lish-1.532
Token permit
Token ID8749
Feature activation-8.99e-01

Pos logit contributions
 bed+0.203
 search+0.202
 lap+0.198
 bank+0.187
 call+0.187
Neg logit contributions
ww-0.679
dd-0.581
www-0.559
lish-0.553
P-0.541
Token that
Token ID326
Feature activation-6.82e-01

Pos logit contributions
ww+1.571
dd+1.155
www+1.055
lish+1.054
iest+1.003
Neg logit contributions
 search-2.176
 bed-2.172
 call-2.113
 lap-2.091
 bank-2.089
Token."
Token ID526
Feature activation+9.68e-01

Pos logit contributions
ww+1.829
dd+1.530
www+1.444
lish+1.443
P+1.401
Neg logit contributions
 search-1.006
 bed-1.005
 call-0.960
 lap-0.957
 bank-0.946
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 search+1.144
 eye+1.090
 bed+1.085
 call+1.068
 bank+1.064
Neg logit contributions
ww-2.837
dd-2.370
www-2.296
lish-2.274
P-2.233
Token
Token ID198
Feature activation-2.76e+00

Pos logit contributions
ww+1.363
dd+1.154
www+1.110
lish+1.106
iest+1.071
Neg logit contributions
 search-0.474
 bed-0.468
 lap-0.447
 call-0.443
 bank-0.434
TokenAb
Token ID4826
Feature activation-3.79e-01

Pos logit contributions
ww+5.534
iest+4.339
dd+4.218
lish+4.064
www+3.879
Neg logit contributions
 search-6.188
 bed-6.041
 lap-5.991
 call-5.924
 bank-5.902
Tokenby
Token ID1525
Feature activation-6.58e-01

Pos logit contributions
ww+0.931
dd+0.753
www+0.715
lish+0.708
P+0.680
Neg logit contributions
 search-0.646
 bed-0.637
 lap-0.620
 call-0.619
 bank-0.613
Token was
Token ID373
Feature activation+8.01e-02

Pos logit contributions
ww+1.789
dd+1.472
www+1.412
lish+1.402
iest+1.360
Neg logit contributions
 search-0.939
 bed-0.918
 call-0.893
 lap-0.890
 bank-0.888
Token disappointed
Token ID11679
Feature activation+1.31e-01

Pos logit contributions
 search+0.131
 bed+0.130
 lap+0.126
 bank+0.125
 call+0.125
Neg logit contributions
ww-0.198
dd-0.162
www-0.153
lish-0.153
P-0.147
Token and
Token ID290
Feature activation-1.62e-01

Pos logit contributions
 search+0.183
 bed+0.180
 lap+0.176
 bank+0.174
 call+0.173
Neg logit contributions
ww-0.356
dd-0.296
lish-0.282
www-0.282
iest-0.270
Tokenmy
Token ID1820
Feature activation+3.27e-01

Pos logit contributions
 search+0.659
 bed+0.659
 lap+0.639
 bank+0.635
 call+0.613
Neg logit contributions
ww-1.251
dd-1.039
www-0.992
lish-0.986
ha-0.969
Token smiled
Token ID13541
Feature activation+2.20e-01

Pos logit contributions
 search+0.461
 bed+0.460
 lap+0.447
 bank+0.437
 call+0.432
Neg logit contributions
ww-0.882
dd-0.739
www-0.700
lish-0.695
ha-0.682
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
 search+0.244
 bed+0.239
 bank+0.231
 lap+0.231
 call+0.228
Neg logit contributions
ww-0.660
dd-0.561
lish-0.542
www-0.533
bly-0.520
Token said
Token ID531
Feature activation-2.67e+00

Pos logit contributions
 search+0.311
 bed+0.305
 lap+0.297
 bank+0.294
 call+0.289
Neg logit contributions
ww-0.664
dd-0.556
www-0.532
lish-0.523
ha-0.513
Token,
Token ID11
Feature activation-6.85e-01

Pos logit contributions
ww+8.412
dd+7.034
www+6.870
P+6.782
w+6.693
Neg logit contributions
 bed-3.174
 eyes-3.112
 search-3.023
 call-3.003
 tears-2.918
Token "
Token ID366
Feature activation-2.93e+00

Pos logit contributions
ww+1.873
dd+1.551
www+1.477
lish+1.467
P+1.428
Neg logit contributions
 search-0.972
 bed-0.966
 call-0.940
 lap-0.936
 bank-0.909
TokenThat
Token ID2504
Feature activation-9.30e-01

Pos logit contributions
ww+6.364
dd+5.249
lish+4.880
www+4.633
iest+4.622
Neg logit contributions
 search-5.904
 lap-5.701
 call-5.592
 bed-5.373
 backs-5.356
Token's
Token ID338
Feature activation-1.91e+00

Pos logit contributions
ww+2.607
dd+2.217
lish+2.101
www+2.089
P+2.042
Neg logit contributions
 search-1.273
 bed-1.265
 call-1.219
 eye-1.190
 lap-1.186
Token wonderful
Token ID7932
Feature activation-1.77e+00

Pos logit contributions
ww+5.255
dd+4.512
www+4.182
lish+4.167
iest+4.094
Neg logit contributions
 bed-2.879
 search-2.877
 call-2.728
 eye-2.620
 sleep-2.598
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
ww+5.114
dd+4.453
P+4.154
lish+4.146
www+4.094
Neg logit contributions
 bed-2.412
 search-2.337
 call-2.222
 eyes-2.187
 lap-2.147
Token Lily
Token ID20037
Feature activation-1.19e+00

Pos logit contributions
ww+6.288
dd+5.518
P+5.196
lish+5.076
w+5.070
Neg logit contributions
 bed-3.394
 search-3.197
 call-3.167
 eyes-3.089
 tears-3.056
Token animals
Token ID4695
Feature activation-1.24e-02

Pos logit contributions
 lap+1.867
 search+1.853
 bank+1.806
 eye+1.801
 bed+1.778
Neg logit contributions
ww-2.980
dd-2.386
www-2.289
lish-2.241
ha-2.218
Token and
Token ID290
Feature activation-4.44e-02

Pos logit contributions
ww+0.035
dd+0.029
www+0.028
lish+0.027
ha+0.027
Neg logit contributions
 search-0.016
 bed-0.016
 lap-0.016
 call-0.015
 bank-0.015
Token people
Token ID661
Feature activation+3.03e-01

Pos logit contributions
ww+0.122
dd+0.101
www+0.097
lish+0.096
ha+0.093
Neg logit contributions
 search-0.060
 bed-0.059
 lap-0.058
 call-0.057
 bank-0.057
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
 search+0.394
 bed+0.390
 lap+0.379
 bank+0.368
 call+0.367
Neg logit contributions
ww-0.848
dd-0.710
www-0.675
lish-0.672
iest-0.654
Token
Token ID198
Feature activation-4.35e-01

Pos logit contributions
ww+0.892
dd+0.758
www+0.728
lish+0.724
P+0.704
Neg logit contributions
 search-0.303
 bed-0.298
 lap-0.284
 call-0.283
 bank-0.278
Token
Token ID198
Feature activation-3.72e+00

Pos logit contributions
ww+1.336
dd+1.131
www+1.088
lish+1.085
P+1.048
Neg logit contributions
 search-0.469
 bed-0.461
 lap-0.439
 call-0.437
 bank-0.428
Token"
Token ID1
Feature activation-1.36e+00

Pos logit contributions
ww+8.405
iest+6.826
lish+6.535
dd+6.459
bly+6.190
Neg logit contributions
 search-8.219
 find-7.688
 call-7.647
 bed-7.634
 yard-7.565
TokenLook
Token ID8567
Feature activation-1.62e+00

Pos logit contributions
ww+2.773
dd+2.183
www+2.005
lish+1.974
iest+1.893
Neg logit contributions
 search-2.917
 bed-2.835
 call-2.833
 bank-2.788
 lap-2.787
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
ww+4.527
dd+3.797
www+3.612
lish+3.595
bly+3.525
Neg logit contributions
 bed-2.326
 search-2.304
 call-2.212
 eyes-2.169
 eye-2.162
Token Tom
Token ID4186
Feature activation-7.67e-01

Pos logit contributions
ww+4.351
dd+3.784
lish+3.596
www+3.508
bly+3.426
Neg logit contributions
 bed-2.199
 search-2.123
 call-2.067
 eyes-1.952
 find-1.944
Token,
Token ID11
Feature activation-1.84e+00

Pos logit contributions
ww+2.026
dd+1.689
www+1.593
lish+1.592
bly+1.546
Neg logit contributions
 search-1.163
 bed-1.143
 call-1.118
 eye-1.095
 lap-1.092
Token and
Token ID290
Feature activation+5.45e-01

Pos logit contributions
 bed+1.260
 search+1.250
 lap+1.190
 bank+1.151
 call+1.144
Neg logit contributions
ww-2.725
dd-2.307
lish-2.199
www-2.182
ha-2.162
Token Sam
Token ID3409
Feature activation+2.01e+00

Pos logit contributions
 bed+0.665
 search+0.661
 lap+0.649
 bank+0.638
 eye+0.623
Neg logit contributions
ww-1.572
dd-1.312
www-1.266
lish-1.261
ha-1.234
Token cheered
Token ID34767
Feature activation-8.51e-01

Pos logit contributions
 bed+3.499
 lap+3.360
 search+3.299
 driveway+3.147
anks+3.137
Neg logit contributions
ww-4.689
dd-4.031
lish-3.734
ha-3.730
www-3.607
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
ww+2.845
dd+2.420
www+2.363
lish+2.290
P+2.276
Neg logit contributions
 search-0.749
 bed-0.746
 call-0.709
 lap-0.680
 eye-0.672
Token They
Token ID1119
Feature activation+1.26e+00

Pos logit contributions
 search+1.982
 bed+1.981
 lap+1.859
 call+1.812
 bank+1.803
Neg logit contributions
ww-4.869
dd-3.980
www-3.920
lish-3.899
ha-3.800
Token said
Token ID531
Feature activation-2.55e+00

Pos logit contributions
 bed+2.138
 lap+2.118
 search+2.105
 eye+1.972
 driveway+1.971
Neg logit contributions
ww-3.080
dd-2.554
www-2.382
ha-2.371
P-2.345
Token,
Token ID11
Feature activation-1.62e-01

Pos logit contributions
ww+7.909
dd+6.696
www+6.557
lish+6.365
P+6.363
Neg logit contributions
 eyes-3.147
 bed-3.121
 search-3.026
 call-2.915
 tears-2.903
Token "
Token ID366
Feature activation-2.36e+00

Pos logit contributions
ww+0.426
dd+0.350
www+0.334
lish+0.331
P+0.320
Neg logit contributions
 search-0.245
 bed-0.241
 lap-0.234
 call-0.233
 bank-0.231
TokenWe
Token ID1135
Feature activation-2.83e-01

Pos logit contributions
ww+4.540
dd+3.362
www+3.211
lish+3.193
iest+3.186
Neg logit contributions
 search-5.406
 bank-5.254
 call-5.167
 backs-5.112
 tears-5.053
Token did
Token ID750
Feature activation-1.48e+00

Pos logit contributions
ww+0.797
dd+0.665
www+0.636
lish+0.632
P+0.612
Neg logit contributions
 search-0.378
 bed-0.372
 call-0.359
 lap-0.359
 bank-0.354
Token it
Token ID340
Feature activation-1.57e+00

Pos logit contributions
ww+3.889
dd+3.186
lish+3.046
www+3.039
iest+2.992
Neg logit contributions
 search-2.400
 bed-2.379
 call-2.289
 find-2.215
 bank-2.206