TOP ACTIVATIONS
MAX = 4.204

carp ets . The palace belonged to a King
, fluffy p anda . The p anda was very amazing
led in amaz ement . The palace garden was filled with
went on a hike . The sky was cloudy and it
smiled and said , " The brilliant palace is lit up
One day , the hipp o saw something that
was a big surprise . The other animals were very excited
happy . So the King and Queen decided to
One day , the p anda went for a
kitchen . The poor dark hipp o never
some food . The hipp o went to the
hall ways and up the grand stair ways . Everywhere she
imagined a big palace . The palace was very pretty .
liked the sweet smell . The hipp o skipped on and
and cheered . They thanked the hipp o for his show
. The elderly rh in oc eros was so happy to
strong . The hipp o found some vegetables
o was so attractive . The hipp o was excited to
the gorilla . The gorilla was big and strong
was a p anda . The p anda was very gloomy

MIN ACTIVATIONS
MIN = -4.187

asked if he could borrow a strip . But , his
branch . They look for another tree . They see one
" How do you snap a knot ?"
he promised to never attach a net to someone again .
dad if he could take a seat under the shady tree
green rope ! He took the rope and ran to the
." She threw the rope into the hole and
tied the other end of the rope to her hand ,
L ily took the rope and started to swing
They tried to unt ie the knot , but it was
his bag and took out a jar . He said it
. They decide to find another game to play . They
't be able to get a good shot .
he replied . He took a long piece of string and
fly . They look at each other . They smile .
was excited ! He grabbed the rope and began to swing
L ulu took a deep breath and decided to
it again . He tried another , and another . He
showed them how to snap the knot by pulling it really
day she got a little too close to a pond and

INTERVAL 2.106 - 4.204
CONTAINS 0.241%

, they looked around in amaz ement . It was a
the turkey to hurt their snow man . They pick up
able to help out Mr . Jenkins . She knew that
from the tree and onto the ground .
remembered how they made the snow man . How they laughed

INTERVAL 0.008 - 2.106
CONTAINS 48.571%

carefully . When the dizz y feeling had
," said grandma . Charles smiled .
that she won the prize . He told her , "
He sat up stra ighter and waited for the cat to
be mad !" Ben was scared too . He

INTERVAL -2.089 - 0.008
CONTAINS 50.983%

. He thanked the pine tree for helping him reach his
always said ' please ' and ' thank you '. One
, bugs and flowers . He even made a new friend
the park they found the biggest ear Tom had ever seen
. " Why did you do that ?" asked Lily .

INTERVAL -4.187 - -2.089
CONTAINS 0.205%

the coal with the gloves and the stick . They feel
back . " Why not , Ben ?" Lily
across a net . In this way , the snail would
. He wanted to take a nap in the grass .
So she ran over to the pitch , shouting " hi
Token carp
Token ID39465
Feature activation+6.97e-01

Pos logit contributions
ator+0.648
man+0.502
umm+0.467
icles+0.443
ators+0.437
Neg logit contributions
bs-0.526
 slips-0.387
 net-0.385
 knot-0.375
ip-0.371
Tokenets
Token ID1039
Feature activation+6.55e-01

Pos logit contributions
ator+1.765
man+1.407
umm+1.327
icles+1.264
ators+1.251
Neg logit contributions
bs-1.093
 net-0.743
 slips-0.742
 knot-0.716
ip-0.706
Token.
Token ID13
Feature activation+4.52e-02

Pos logit contributions
ator+1.637
man+1.301
umm+1.233
icles+1.169
ators+1.157
Neg logit contributions
bs-1.029
 net-0.711
 slips-0.708
 knot-0.686
ata-0.670
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
ator+0.101
man+0.078
umm+0.073
icles+0.070
ators+0.068
Neg logit contributions
bs-0.083
 net-0.062
 slips-0.062
 knot-0.060
 loses-0.059
Token
Token ID198
Feature activation+4.97e-01

Pos logit contributions
ator+1.175
man+0.937
umm+0.880
icles+0.839
ators+0.830
Neg logit contributions
bs-0.736
 net-0.509
 slips-0.508
 knot-0.492
ip-0.481
TokenThe
Token ID464
Feature activation+4.20e+00

Pos logit contributions
ator+1.064
man+0.809
umm+0.751
icles+0.706
ators+0.697
Neg logit contributions
bs-0.973
 net-0.730
 slips-0.729
 knot-0.711
ip-0.701
Token palace
Token ID20562
Feature activation+9.70e-01

Pos logit contributions
ator+10.304
icles+8.250
umm+8.192
man+7.971
esh+7.613
Neg logit contributions
bs-7.770
 net-6.045
 tie-5.926
 knot-5.784
 loop-5.777
Token belonged
Token ID19611
Feature activation+9.62e-01

Pos logit contributions
ator+2.135
man+1.613
umm+1.560
icles+1.463
ators+1.418
Neg logit contributions
bs-1.825
 slips-1.351
 net-1.348
 knot-1.307
 loop-1.298
Token to
Token ID284
Feature activation+3.75e-01

Pos logit contributions
ator+2.300
man+1.792
umm+1.721
icles+1.622
ators+1.590
Neg logit contributions
bs-1.626
 net-1.166
 slips-1.157
ip-1.109
 knot-1.105
Token a
Token ID257
Feature activation+4.37e-01

Pos logit contributions
ator+1.074
man+0.881
umm+0.835
icles+0.804
ators+0.797
Neg logit contributions
bs-0.459
 slips-0.275
 net-0.272
 knot-0.262
ip-0.256
Token King
Token ID2677
Feature activation-6.07e-01

Pos logit contributions
ator+1.046
man+0.825
umm+0.773
icles+0.730
ators+0.723
Neg logit contributions
bs-0.734
 slips-0.521
 net-0.519
 knot-0.504
ip-0.496
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
ator+0.059
man+0.045
umm+0.042
icles+0.039
ators+0.038
Neg logit contributions
bs-0.054
 slips-0.040
ata-0.039
 net-0.039
 loses-0.039
Token fluffy
Token ID39145
Feature activation+4.21e-01

Pos logit contributions
bs+1.456
 slips+1.036
 loses+1.035
ata+1.019
ip+1.000
Neg logit contributions
ator-1.804
man-1.450
umm-1.322
 birth-1.235
omm-1.202
Token p
Token ID279
Feature activation+4.19e-01

Pos logit contributions
ator+0.982
man+0.770
umm+0.719
icles+0.679
ators+0.670
Neg logit contributions
bs-0.736
 slips-0.530
 net-0.527
 knot-0.512
 loses-0.507
Tokenanda
Token ID5282
Feature activation+1.75e-01

Pos logit contributions
ator+0.803
man+0.588
umm+0.540
icles+0.501
ators+0.493
Neg logit contributions
bs-0.903
 net-0.701
 slips-0.699
 knot-0.687
 loses-0.676
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
ator+0.438
man+0.353
umm+0.323
icles+0.311
ators+0.308
Neg logit contributions
bs-0.275
 slips-0.192
 net-0.191
 knot-0.187
 loses-0.182
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+3.501
man+2.629
umm+2.449
ators+2.386
 birth+2.261
Neg logit contributions
bs-2.772
 net-2.040
 slips-2.029
 knot-1.982
-1.969
Token p
Token ID279
Feature activation+2.44e+00

Pos logit contributions
ator+11.467
umm+9.140
man+8.970
ators+8.915
icles+8.399
Neg logit contributions
bs-5.327
 loop-4.906
 net-4.571
 dove-4.497
 lime-4.411
Tokenanda
Token ID5282
Feature activation+1.00e+00

Pos logit contributions
ator+5.102
man+3.882
umm+3.727
icles+3.430
ators+3.387
Neg logit contributions
bs-5.156
ip-3.952
 slips-3.908
op-3.848
 loop-3.723
Token was
Token ID373
Feature activation+1.39e+00

Pos logit contributions
ator+2.112
man+1.560
umm+1.524
icles+1.408
ators+1.370
Neg logit contributions
bs-1.999
 slips-1.503
 net-1.496
ip-1.468
 knot-1.455
Token very
Token ID845
Feature activation+1.85e+00

Pos logit contributions
ator+3.454
umm+2.647
man+2.645
ators+2.449
icles+2.448
Neg logit contributions
bs-2.294
 net-1.652
 slips-1.624
 tie-1.599
 loop-1.580
Token amazing
Token ID4998
Feature activation+2.23e-01

Pos logit contributions
ator+4.998
man+3.929
umm+3.866
ators+3.772
icles+3.713
Neg logit contributions
bs-2.883
 slips-1.859
 net-1.841
 tie-1.796
ip-1.796
Tokenled
Token ID992
Feature activation+8.26e-02

Pos logit contributions
ator+0.181
man+0.122
umm+0.107
icles+0.098
ators+0.096
Neg logit contributions
bs-0.294
 net-0.241
 slips-0.239
 knot-0.235
 loop-0.232
Token in
Token ID287
Feature activation+5.05e-01

Pos logit contributions
ator+0.185
man+0.144
umm+0.134
icles+0.127
ators+0.125
Neg logit contributions
bs-0.151
 slips-0.111
 net-0.109
 knot-0.107
ip-0.107
Token amaz
Token ID40642
Feature activation+2.61e+00

Pos logit contributions
ator+1.251
man+0.992
umm+0.935
icles+0.889
ators+0.879
Neg logit contributions
bs-0.814
 net-0.568
 slips-0.567
 knot-0.549
ip-0.539
Tokenement
Token ID972
Feature activation+4.96e-01

Pos logit contributions
ator+4.662
man+3.699
umm+3.046
 Amen+2.996
icles+2.891
Neg logit contributions
bs-5.857
 knot-4.630
 slips-4.530
ip-4.525
 net-4.465
Token.
Token ID13
Feature activation+9.93e-02

Pos logit contributions
ator+1.236
man+0.984
umm+0.927
icles+0.882
ators+0.872
Neg logit contributions
bs-0.783
 net-0.543
 slips-0.541
 knot-0.525
ip-0.513
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+0.210
man+0.160
umm+0.150
icles+0.140
 birth+0.138
Neg logit contributions
bs-0.194
 slips-0.145
 net-0.144
ata-0.141
 knot-0.140
Token palace
Token ID20562
Feature activation+9.17e-01

Pos logit contributions
ator+8.836
umm+6.972
icles+6.815
man+6.760
ators+6.448
Neg logit contributions
bs-7.989
 loop-6.170
 tie-6.134
 net-6.132
 knot-5.941
Token garden
Token ID11376
Feature activation+3.79e-01

Pos logit contributions
ator+1.995
man+1.516
umm+1.438
icles+1.355
ators+1.322
Neg logit contributions
bs-1.740
 net-1.296
 slips-1.296
 knot-1.268
 loop-1.251
Token was
Token ID373
Feature activation+9.59e-01

Pos logit contributions
ator+0.782
man+0.590
umm+0.545
icles+0.511
ators+0.504
Neg logit contributions
bs-0.762
 net-0.577
 slips-0.576
 knot-0.563
ip-0.557
Token filled
Token ID5901
Feature activation+8.78e-01

Pos logit contributions
ator+2.302
man+1.802
umm+1.713
icles+1.611
ators+1.598
Neg logit contributions
bs-1.612
 net-1.158
 slips-1.146
 knot-1.122
 loop-1.107
Token with
Token ID351
Feature activation+1.02e+00

Pos logit contributions
ator+1.981
man+1.523
umm+1.443
icles+1.359
ators+1.335
Neg logit contributions
bs-1.586
 net-1.172
 slips-1.151
 knot-1.134
 tie-1.119
Token went
Token ID1816
Feature activation+5.27e-01

Pos logit contributions
bs+0.999
 knot+0.752
 net+0.728
 slips+0.723
 loop+0.705
Neg logit contributions
ator-1.179
man-0.946
icles-0.824
umm-0.820
 him-0.817
Token on
Token ID319
Feature activation-7.81e-01

Pos logit contributions
ator+1.169
man+0.900
umm+0.840
icles+0.792
ators+0.781
Neg logit contributions
bs-0.982
 net-0.726
 slips-0.724
 knot-0.707
ip-0.694
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
bs+1.260
 slips+0.864
 knot+0.792
op+0.785
ata+0.783
Neg logit contributions
ator-1.960
man-1.565
 birth-1.448
ators-1.417
umm-1.414
Token hike
Token ID18390
Feature activation+2.50e-01

Pos logit contributions
ator+0.231
man+0.178
umm+0.160
icles+0.152
 birth+0.152
Neg logit contributions
bs-0.211
 slips-0.158
 net-0.155
 knot-0.153
 loses-0.152
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
ator+0.542
man+0.418
umm+0.380
icles+0.362
ators+0.359
Neg logit contributions
bs-0.478
 slips-0.355
 net-0.354
 knot-0.348
ip-0.343
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+0.300
man+0.235
umm+0.218
icles+0.209
ators+0.203
Neg logit contributions
bs-0.239
 net-0.175
 slips-0.175
 knot-0.172
 loses-0.168
Token sky
Token ID6766
Feature activation+1.12e+00

Pos logit contributions
ator+10.254
umm+7.381
ators+7.379
icles+7.300
opher+7.165
Neg logit contributions
bs-6.491
 net-6.169
 loop-6.101
 knot-6.055
 rope-6.030
Token was
Token ID373
Feature activation+1.33e+00

Pos logit contributions
ator+2.471
man+1.851
umm+1.785
icles+1.644
ators+1.637
Neg logit contributions
bs-2.128
 slips-1.590
 net-1.587
 loop-1.547
-1.544
Token cloudy
Token ID40026
Feature activation+7.42e-01

Pos logit contributions
ator+3.365
man+2.616
umm+2.529
ators+2.366
icles+2.349
Neg logit contributions
bs-2.060
 net-1.496
 slips-1.459
-1.432
 loop-1.424
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
ator+2.083
man+1.700
umm+1.618
icles+1.548
ators+1.539
Neg logit contributions
bs-0.937
 net-0.577
 slips-0.577
 knot-0.550
 loop-0.536
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
ator+2.562
man+2.017
umm+1.927
icles+1.794
ators+1.789
Neg logit contributions
bs-1.612
 net-1.130
 slips-1.122
 knot-1.089
 loop-1.085
Token smiled
Token ID13541
Feature activation+3.77e-01

Pos logit contributions
ator+0.872
man+0.704
umm+0.657
icles+0.630
ators+0.624
Neg logit contributions
bs-0.501
 net-0.340
 slips-0.338
 knot-0.327
 loses-0.320
Token and
Token ID290
Feature activation-9.96e-01

Pos logit contributions
ator+1.042
man+0.854
umm+0.806
icles+0.771
ators+0.764
Neg logit contributions
bs-0.498
 slips-0.314
 net-0.314
 knot-0.300
 loses-0.293
Token said
Token ID531
Feature activation-3.58e-01

Pos logit contributions
bs+1.608
 slips+1.214
 net+1.204
 loses+1.198
 knot+1.191
Neg logit contributions
ator-2.325
man-1.900
umm-1.665
 birth-1.647
icles-1.642
Token,
Token ID11
Feature activation+4.61e-01

Pos logit contributions
bs+0.548
 slips+0.392
 net+0.389
ata+0.379
 knot+0.378
Neg logit contributions
ator-0.889
man-0.720
umm-0.663
icles-0.646
 birth-0.630
Token "
Token ID366
Feature activation-1.62e-01

Pos logit contributions
ator+1.056
man+0.822
umm+0.768
icles+0.727
ators+0.717
Neg logit contributions
bs-0.821
 net-0.599
 slips-0.597
 knot-0.582
 loses-0.572
TokenThe
Token ID464
Feature activation+4.01e+00

Pos logit contributions
bs+0.320
 net+0.242
 slips+0.242
 knot+0.236
 loses+0.234
Neg logit contributions
ator-0.341
man-0.263
umm-0.246
icles-0.227
ators-0.223
Token brilliant
Token ID10457
Feature activation+2.62e+00

Pos logit contributions
ator+8.651
man+6.694
icles+6.452
rim+6.420
umm+6.383
Neg logit contributions
bs-7.901
 net-6.575
 knot-6.437
-6.377
 loop-6.364
Token palace
Token ID20562
Feature activation+4.38e-01

Pos logit contributions
ator+4.428
man+3.067
umm+3.019
icles+2.799
rim+2.672
Neg logit contributions
bs-6.193
 net-5.058
 knot-5.024
 tie-4.987
 loop-4.932
Token is
Token ID318
Feature activation+3.09e-01

Pos logit contributions
ator+0.978
man+0.756
umm+0.704
icles+0.664
ators+0.656
Neg logit contributions
bs-0.804
 net-0.592
 slips-0.589
 knot-0.576
ip-0.566
Token lit
Token ID6578
Feature activation+9.84e-01

Pos logit contributions
ator+0.695
man+0.538
umm+0.498
icles+0.473
ators+0.467
Neg logit contributions
bs-0.567
 slips-0.415
 net-0.415
 knot-0.405
ip-0.399
Token up
Token ID510
Feature activation+3.22e-01

Pos logit contributions
ator+2.028
man+1.521
umm+1.439
icles+1.326
ators+1.304
Neg logit contributions
bs-1.950
 net-1.481
 slips-1.472
 knot-1.460
 loop-1.440
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
ator+0.285
man+0.224
umm+0.209
icles+0.200
ators+0.194
Neg logit contributions
bs-0.218
 net-0.157
 slips-0.157
 knot-0.154
 loses-0.151
Token
Token ID198
Feature activation+7.88e-01

Pos logit contributions
ator+0.497
man+0.396
umm+0.364
icles+0.350
ators+0.346
Neg logit contributions
bs-0.329
 net-0.232
 slips-0.232
 knot-0.226
 loses-0.220
TokenOne
Token ID3198
Feature activation-6.62e-01

Pos logit contributions
ator+1.745
man+1.328
umm+1.258
icles+1.175
ators+1.156
Neg logit contributions
bs-1.493
 net-1.112
 slips-1.107
 knot-1.089
 loop-1.074
Token day
Token ID1110
Feature activation-5.09e-01

Pos logit contributions
bs+1.624
 slips+1.339
 loses+1.333
 knot+1.315
 net+1.311
Neg logit contributions
ator-1.034
man-0.764
umm-0.620
icles-0.574
 birth-0.557
Token,
Token ID11
Feature activation+2.61e-01

Pos logit contributions
bs+0.807
 knot+0.570
 net+0.569
 slips+0.562
ata+0.560
Neg logit contributions
ator-1.223
man-0.994
umm-0.908
icles-0.897
 him-0.860
Token the
Token ID262
Feature activation+4.01e+00

Pos logit contributions
ator+0.631
man+0.503
umm+0.469
icles+0.450
ators+0.439
Neg logit contributions
bs-0.431
 slips-0.303
 net-0.301
 knot-0.295
ata-0.290
Token hipp
Token ID18568
Feature activation+9.58e-01

Pos logit contributions
ator+11.425
umm+8.916
ators+8.870
man+8.680
zie+8.470
Neg logit contributions
bs-4.983
 net-4.314
 dove-3.974
 lime-3.804
 tie-3.688
Tokeno
Token ID78
Feature activation-3.89e-01

Pos logit contributions
ator+1.964
man+1.444
umm+1.380
icles+1.275
ators+1.258
Neg logit contributions
bs-1.950
 slips-1.476
 net-1.472
ip-1.439
ata-1.427
Token saw
Token ID2497
Feature activation-3.48e-01

Pos logit contributions
bs+0.744
 knot+0.553
 net+0.544
 slips+0.540
ip+0.523
Neg logit contributions
ator-0.828
man-0.656
umm-0.569
ators-0.548
icles-0.546
Token something
Token ID1223
Feature activation-4.07e-01

Pos logit contributions
bs+0.521
 slips+0.340
 knot+0.340
 net+0.328
ata+0.325
Neg logit contributions
ator-0.911
man-0.749
icles-0.682
umm-0.675
 him-0.659
Token that
Token ID326
Feature activation+1.13e+00

Pos logit contributions
bs+0.785
 slips+0.585
 net+0.568
ata+0.567
ip+0.566
Neg logit contributions
ator-0.856
man-0.673
umm-0.599
icles-0.575
 birth-0.572
Token was
Token ID373
Feature activation+2.29e-01

Pos logit contributions
ator+1.699
man+1.276
umm+1.216
icles+1.133
ators+1.115
Neg logit contributions
bs-1.520
 net-1.138
 slips-1.137
 knot-1.112
-1.095
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
ator+0.601
man+0.486
umm+0.454
icles+0.438
ators+0.433
Neg logit contributions
bs-0.333
 slips-0.221
 net-0.219
 knot-0.213
 loses-0.208
Token big
Token ID1263
Feature activation+2.10e-01

Pos logit contributions
ator+0.404
man+0.316
umm+0.290
icles+0.270
 birth+0.268
Neg logit contributions
bs-0.340
 slips-0.249
 net-0.244
 loses-0.240
 knot-0.239
Token surprise
Token ID5975
Feature activation+1.19e+00

Pos logit contributions
ator+0.473
man+0.370
umm+0.344
icles+0.322
ators+0.317
Neg logit contributions
bs-0.384
 slips-0.278
 net-0.273
ata-0.269
ip-0.268
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
ator+2.868
man+2.215
umm+2.152
icles+2.020
ators+2.005
Neg logit contributions
bs-1.940
 net-1.403
 slips-1.369
 knot-1.348
 tie-1.342
Token The
Token ID383
Feature activation+4.00e+00

Pos logit contributions
ator+3.540
man+2.337
umm+2.189
ators+2.061
icles+1.904
Neg logit contributions
bs-4.864
 net-3.973
 slips-3.869
-3.865
 knot-3.829
Token other
Token ID584
Feature activation+3.18e+00

Pos logit contributions
ator+11.262
umm+8.846
man+8.772
ators+8.655
icles+8.469
Neg logit contributions
bs-4.901
 net-4.392
-4.151
 loop-4.088
 rope-3.970
Token animals
Token ID4695
Feature activation-1.09e-01

Pos logit contributions
ator+6.943
man+4.920
umm+4.918
omm+4.429
 Amen+4.403
Neg logit contributions
bs-6.139
 net-4.966
-4.918
 dove-4.735
 loop-4.709
Token were
Token ID547
Feature activation+1.12e+00

Pos logit contributions
bs+0.209
 slips+0.159
 net+0.157
 knot+0.157
ip+0.154
Neg logit contributions
ator-0.232
man-0.179
umm-0.158
icles-0.152
ators-0.152
Token very
Token ID845
Feature activation+4.88e-01

Pos logit contributions
ator+2.630
man+2.036
umm+1.998
icles+1.867
ators+1.818
Neg logit contributions
bs-1.949
 net-1.428
 slips-1.399
 tie-1.366
 knot-1.354
Token excited
Token ID6568
Feature activation+1.13e+00

Pos logit contributions
ator+1.200
man+0.950
umm+0.893
icles+0.848
ators+0.839
Neg logit contributions
bs-0.792
 net-0.556
 slips-0.554
 knot-0.539
ip-0.527
Token happy
Token ID3772
Feature activation+1.54e+00

Pos logit contributions
ator+0.821
man+0.648
umm+0.604
icles+0.574
ators+0.568
Neg logit contributions
bs-0.577
 slips-0.412
 net-0.410
 knot-0.398
ip-0.393
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
ator+4.115
man+3.295
umm+3.177
icles+3.027
omm+2.998
Neg logit contributions
bs-2.183
 net-1.455
 slips-1.438
 loses-1.415
 knot-1.373
Token
Token ID198
Feature activation+5.45e-01

Pos logit contributions
ator+3.387
man+2.616
umm+2.501
ators+2.331
icles+2.324
Neg logit contributions
bs-2.407
 net-1.724
-1.718
 slips-1.696
 tie-1.682
Token
Token ID198
Feature activation+7.05e-01

Pos logit contributions
ator+1.372
man+1.094
umm+1.033
icles+0.983
ators+0.972
Neg logit contributions
bs-0.845
 net-0.581
 slips-0.579
 knot-0.561
ip-0.548
TokenSo
Token ID2396
Feature activation+1.23e+00

Pos logit contributions
ator+1.630
man+1.263
umm+1.193
icles+1.125
ators+1.108
Neg logit contributions
bs-1.262
 net-0.919
 slips-0.913
 knot-0.896
 loop-0.880
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
ator+3.494
man+2.801
umm+2.716
icles+2.601
ators+2.584
Neg logit contributions
bs-1.606
 net-1.024
 slips-0.981
 knot-0.969
 tie-0.940
Token King
Token ID2677
Feature activation-3.40e-01

Pos logit contributions
ator+8.540
icles+6.925
ators+6.508
umm+6.507
opher+6.073
Neg logit contributions
bs-7.701
 tie-6.157
 net-6.039
-5.866
 dove-5.749
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
bs+0.607
 slips+0.447
 net+0.445
 knot+0.444
 loses+0.429
Neg logit contributions
ator-0.778
man-0.632
umm-0.544
ators-0.527
icles-0.527
Token Queen
Token ID7542
Feature activation+6.14e-01

Pos logit contributions
ator+2.313
man+1.686
umm+1.628
icles+1.516
ators+1.483
Neg logit contributions
bs-2.382
 net-1.844
 knot-1.803
 slips-1.791
 tie-1.787
Token decided
Token ID3066
Feature activation+2.09e-01

Pos logit contributions
ator+1.352
man+1.036
umm+0.973
icles+0.913
ators+0.900
Neg logit contributions
bs-1.150
 net-0.850
 slips-0.847
 knot-0.826
ip-0.815
Token to
Token ID284
Feature activation+4.72e-01

Pos logit contributions
ator+0.472
man+0.370
umm+0.337
icles+0.324
ators+0.319
Neg logit contributions
bs-0.378
 slips-0.276
 net-0.274
 knot-0.270
ip-0.265
Token
Token ID198
Feature activation+8.35e-02

Pos logit contributions
bs+0.232
 net+0.169
 slips+0.165
 knot+0.165
 loses+0.159
Neg logit contributions
ator-0.329
man-0.260
umm-0.236
icles-0.231
ators-0.226
Token
Token ID198
Feature activation+3.84e-01

Pos logit contributions
ator+0.202
man+0.161
umm+0.147
icles+0.142
ators+0.140
Neg logit contributions
bs-0.138
 slips-0.098
 net-0.098
 knot-0.095
 loses-0.093
TokenOne
Token ID3198
Feature activation-6.48e-01

Pos logit contributions
ator+0.811
man+0.617
umm+0.567
icles+0.535
ators+0.527
Neg logit contributions
bs-0.765
 slips-0.576
 net-0.575
 knot-0.560
 loses-0.555
Token day
Token ID1110
Feature activation-9.16e-01

Pos logit contributions
bs+1.580
 slips+1.282
 loses+1.274
 net+1.260
 knot+1.257
Neg logit contributions
ator-1.038
man-0.765
umm-0.628
icles-0.577
 birth-0.562
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
bs+1.533
 knot+1.110
ata+1.102
 slips+1.084
 loses+1.082
Neg logit contributions
ator-2.110
man-1.705
umm-1.536
icles-1.531
 him-1.479
Token the
Token ID262
Feature activation+3.98e+00

Pos logit contributions
bs+0.986
 slips+0.686
ata+0.681
 loses+0.669
ip+0.669
Neg logit contributions
ator-1.489
man-1.214
umm-1.106
icles-1.094
 birth-1.047
Token p
Token ID279
Feature activation+2.10e+00

Pos logit contributions
ator+11.385
man+9.069
ators+8.763
umm+8.753
icles+8.415
Neg logit contributions
bs-5.297
 loop-4.778
 dove-4.312
 lime-4.296
 net-4.249
Tokenanda
Token ID5282
Feature activation-6.76e-01

Pos logit contributions
ator+4.280
man+3.211
umm+3.057
icles+2.838
ators+2.826
Neg logit contributions
bs-4.522
ip-3.451
 slips-3.450
op-3.365
 loop-3.257
Token went
Token ID1816
Feature activation+2.37e-01

Pos logit contributions
bs+1.310
 knot+0.981
 net+0.974
 slips+0.952
ata+0.924
Neg logit contributions
ator-1.414
man-1.136
umm-0.953
ators-0.930
icles-0.914
Token for
Token ID329
Feature activation-4.04e-01

Pos logit contributions
ator+0.530
man+0.414
umm+0.377
icles+0.360
ators+0.355
Neg logit contributions
bs-0.437
 slips-0.319
 net-0.317
 knot-0.313
ip-0.306
Token a
Token ID257
Feature activation-8.69e-01

Pos logit contributions
bs+0.551
 slips+0.361
 net+0.336
 knot+0.335
ip+0.334
Neg logit contributions
ator-1.093
man-0.886
umm-0.819
icles-0.817
 birth-0.806
Token kitchen
Token ID9592
Feature activation+5.37e-01

Pos logit contributions
ator+6.091
man+4.581
esh+4.327
icles+4.324
umm+4.312
Neg logit contributions
bs-3.941
 net-3.275
 slips-2.966
 lime-2.964
 knot-2.927
Token.
Token ID13
Feature activation+4.45e-01

Pos logit contributions
ator+1.321
man+1.046
umm+0.986
icles+0.937
ators+0.926
Neg logit contributions
bs-0.866
 net-0.606
 slips-0.604
 knot-0.586
ip-0.574
Token
Token ID220
Feature activation-3.41e-01

Pos logit contributions
ator+1.020
man+0.793
umm+0.743
icles+0.702
ators+0.692
Neg logit contributions
bs-0.795
 net-0.582
 slips-0.579
 knot-0.565
ip-0.554
Token
Token ID198
Feature activation-7.89e-02

Pos logit contributions
bs+0.577
 net+0.437
 knot+0.422
 slips+0.417
 loop+0.404
Neg logit contributions
ator-0.798
man-0.623
umm-0.570
icles-0.556
omm-0.544
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
bs+0.131
 net+0.096
 slips+0.095
 knot+0.093
 loses+0.090
Neg logit contributions
ator-0.187
man-0.148
umm-0.134
icles-0.130
ators-0.128
TokenThe
Token ID464
Feature activation+3.97e+00

Pos logit contributions
ator+0.771
man+0.606
umm+0.565
icles+0.536
ators+0.533
Neg logit contributions
bs-0.539
 net-0.386
 slips-0.385
 knot-0.371
ip-0.367
Token poor
Token ID3595
Feature activation+3.51e+00

Pos logit contributions
ator+10.964
umm+8.435
man+8.352
icles+8.314
ators+8.276
Neg logit contributions
bs-5.496
 net-4.526
-4.205
 lime-4.063
 tie-4.000
Token dark
Token ID3223
Feature activation+2.42e+00

Pos logit contributions
ator+9.218
umm+7.173
ators+6.908
man+6.882
zie+6.543
Neg logit contributions
bs-5.064
 net-3.899
 dove-3.737
 lime-3.664
 tie-3.625
Token hipp
Token ID18568
Feature activation+8.20e-01

Pos logit contributions
ator+5.694
man+4.247
umm+4.076
ators+3.908
 Amen+3.880
Neg logit contributions
bs-4.144
 net-3.052
 slips-3.027
 loop-2.881
 knot-2.864
Tokeno
Token ID78
Feature activation+1.75e-01

Pos logit contributions
ator+1.680
man+1.246
umm+1.172
icles+1.093
ators+1.075
Neg logit contributions
bs-1.677
 slips-1.274
 net-1.268
ip-1.233
ata-1.229
Token never
Token ID1239
Feature activation-6.37e-01

Pos logit contributions
ator+0.394
man+0.308
umm+0.283
icles+0.268
ators+0.266
Neg logit contributions
bs-0.314
 net-0.230
 slips-0.228
 knot-0.224
ip-0.218
Token some
Token ID617
Feature activation-3.89e-01

Pos logit contributions
ator+0.163
man+0.133
umm+0.123
icles+0.120
ators+0.117
Neg logit contributions
bs-0.089
 slips-0.059
 net-0.057
 knot-0.057
 loses-0.056
Token food
Token ID2057
Feature activation+3.25e-01

Pos logit contributions
bs+0.810
 slips+0.631
 loses+0.616
ata+0.611
 knot+0.607
Neg logit contributions
ator-0.761
man-0.572
umm-0.532
icles-0.500
 birth-0.489
Token.
Token ID13
Feature activation-7.83e-01

Pos logit contributions
ator+0.705
man+0.544
umm+0.501
icles+0.474
ators+0.466
Neg logit contributions
bs-0.615
 slips-0.458
 net-0.455
 knot-0.448
 loses-0.440
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
bs+1.873
 net+1.469
 slips+1.461
 loses+1.459
 knot+1.445
Neg logit contributions
ator-1.299
man-0.969
umm-0.851
icles-0.828
omm-0.756
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
bs+0.360
 net+0.262
 slips+0.260
 knot+0.255
 loses+0.248
Neg logit contributions
ator-0.497
man-0.397
umm-0.353
icles-0.344
ators-0.337
TokenThe
Token ID464
Feature activation+3.95e+00

Pos logit contributions
bs+0.193
 slips+0.136
 net+0.135
 loses+0.130
 knot+0.129
Neg logit contributions
ator-0.278
man-0.226
umm-0.205
icles-0.197
ators-0.193
Token hipp
Token ID18568
Feature activation+1.47e+00

Pos logit contributions
ator+10.565
umm+8.150
ators+8.093
opher+7.728
zie+7.632
Neg logit contributions
bs-5.462
 net-4.953
 dove-4.511
 tie-4.491
 lime-4.455
Tokeno
Token ID78
Feature activation+3.68e-01

Pos logit contributions
ator+3.135
man+2.292
umm+2.275
icles+2.077
 birth+2.074
Neg logit contributions
bs-2.862
 slips-2.113
ip-2.104
 net-2.104
op-2.101
Token went
Token ID1816
Feature activation+1.17e+00

Pos logit contributions
ator+0.858
man+0.676
umm+0.626
icles+0.594
ators+0.588
Neg logit contributions
bs-0.638
 net-0.460
 slips-0.459
 knot-0.450
 loses-0.440
Token to
Token ID284
Feature activation+1.68e-01

Pos logit contributions
ator+2.877
man+2.206
umm+2.191
icles+2.027
ators+2.020
Neg logit contributions
bs-1.946
 net-1.407
 slips-1.368
 tie-1.328
ip-1.312
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
ator+0.405
man+0.321
umm+0.295
icles+0.285
ators+0.279
Neg logit contributions
bs-0.285
 slips-0.201
 net-0.197
ip-0.195
 knot-0.195
Token hall
Token ID6899
Feature activation+2.23e-01

Pos logit contributions
ator+6.513
man+4.929
icles+4.915
umm+4.785
omm+4.391
Neg logit contributions
bs-6.790
 loop-5.607
 net-5.345
 slips-5.247
 knot-5.236
Tokenways
Token ID1322
Feature activation-1.75e-01

Pos logit contributions
ator+0.559
man+0.446
umm+0.419
icles+0.398
ators+0.394
Neg logit contributions
bs-0.347
 net-0.240
 slips-0.238
 knot-0.234
ip-0.227
Token and
Token ID290
Feature activation+1.52e-01

Pos logit contributions
bs+0.277
 net+0.195
 slips+0.194
 knot+0.192
ip+0.184
Neg logit contributions
ator-0.431
man-0.343
umm-0.318
icles-0.304
ators-0.301
Token up
Token ID510
Feature activation+4.61e-02

Pos logit contributions
ator+0.345
man+0.269
umm+0.247
icles+0.235
ators+0.235
Neg logit contributions
bs-0.270
 net-0.199
 slips-0.197
 knot-0.195
ip-0.190
Token the
Token ID262
Feature activation+3.12e+00

Pos logit contributions
ator+0.110
man+0.086
umm+0.080
icles+0.076
ators+0.076
Neg logit contributions
bs-0.077
 net-0.055
 slips-0.055
 knot-0.054
ip-0.053
Token grand
Token ID4490
Feature activation+3.89e+00

Pos logit contributions
ator+6.790
man+5.193
icles+5.118
umm+5.094
omm+4.780
Neg logit contributions
bs-6.314
 loop-4.825
 net-4.737
 slips-4.709
 tie-4.634
Token stair
Token ID18761
Feature activation+1.61e-01

Pos logit contributions
ator+7.487
man+6.189
umm+5.706
 Amen+5.544
icles+5.540
Neg logit contributions
bs-7.807
 loop-6.067
 knot-6.001
 loses-5.937
 net-5.871
Tokenways
Token ID1322
Feature activation-2.74e-01

Pos logit contributions
ator+0.380
man+0.300
umm+0.278
icles+0.265
ators+0.263
Neg logit contributions
bs-0.273
 net-0.197
 slips-0.194
 knot-0.190
ip-0.187
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
bs+0.450
 net+0.328
 slips+0.323
 knot+0.321
ip+0.308
Neg logit contributions
ator-0.656
man-0.517
umm-0.479
icles-0.457
ators-0.451
Token Everywhere
Token ID48201
Feature activation-9.03e-01

Pos logit contributions
ator+0.380
man+0.287
umm+0.272
icles+0.252
 birth+0.248
Neg logit contributions
bs-0.355
 net-0.269
 slips-0.267
 knot-0.261
ata-0.258
Token she
Token ID673
Feature activation+6.94e-01

Pos logit contributions
bs+1.401
ata+1.010
 net+0.992
 slips+0.987
 loop+0.960
Neg logit contributions
ator-2.192
man-1.749
umm-1.658
 birth-1.607
ators-1.561
Token imagined
Token ID15758
Feature activation+2.32e-01

Pos logit contributions
ator+0.375
man+0.291
umm+0.265
ators+0.250
icles+0.250
Neg logit contributions
bs-0.323
 net-0.239
 slips-0.237
 knot-0.234
ip-0.230
Token a
Token ID257
Feature activation-6.63e-02

Pos logit contributions
ator+0.582
man+0.462
umm+0.431
icles+0.416
ators+0.411
Neg logit contributions
bs-0.365
 net-0.253
 slips-0.251
 knot-0.246
ip-0.239
Token big
Token ID1263
Feature activation+2.51e-02

Pos logit contributions
bs+0.123
 slips+0.089
 net+0.087
 loses+0.086
ip+0.086
Neg logit contributions
ator-0.148
man-0.116
umm-0.106
 birth-0.099
ators-0.098
Token palace
Token ID20562
Feature activation-6.99e-02

Pos logit contributions
ator+0.053
man+0.040
umm+0.037
icles+0.034
ators+0.034
Neg logit contributions
bs-0.050
 slips-0.037
 net-0.037
ata-0.036
 loses-0.036
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
bs+0.144
 net+0.110
 slips+0.109
 knot+0.107
 loses+0.106
Neg logit contributions
ator-0.143
man-0.109
umm-0.098
ators-0.092
icles-0.091
Token The
Token ID383
Feature activation+3.87e+00

Pos logit contributions
bs+2.443
 net+1.976
 loses+1.956
 slips+1.955
 knot+1.899
Neg logit contributions
ator-2.181
man-1.616
umm-1.525
icles-1.485
omm-1.454
Token palace
Token ID20562
Feature activation+6.04e-01

Pos logit contributions
ator+9.276
umm+7.278
icles+6.982
rim+6.894
man+6.837
Neg logit contributions
bs-6.996
 net-5.577
 tie-5.433
 loop-5.390
 knot-5.368
Token was
Token ID373
Feature activation+1.48e+00

Pos logit contributions
ator+1.306
man+0.995
umm+0.931
icles+0.875
ators+0.861
Neg logit contributions
bs-1.161
 net-0.866
 slips-0.865
 knot-0.845
ip-0.830
Token very
Token ID845
Feature activation+1.57e+00

Pos logit contributions
ator+3.826
man+3.007
umm+2.961
icles+2.749
ators+2.729
Neg logit contributions
bs-2.232
 slips-1.523
 net-1.522
 tie-1.491
 loop-1.491
Token pretty
Token ID2495
Feature activation-3.28e-01

Pos logit contributions
ator+4.128
man+3.291
umm+3.154
icles+3.039
ators+3.017
Neg logit contributions
bs-2.443
 slips-1.615
 net-1.591
 knot-1.543
 loop-1.539
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
bs+0.516
 slips+0.352
 net+0.345
 loses+0.330
 knot+0.325
Neg logit contributions
ator-0.838
man-0.674
umm-0.638
icles-0.600
ators-0.593
Token liked
Token ID8288
Feature activation+8.01e-01

Pos logit contributions
bs+0.390
 net+0.292
 slips+0.292
ata+0.283
 knot+0.282
Neg logit contributions
ator-0.429
man-0.331
umm-0.296
icles-0.285
 birth-0.284
Token the
Token ID262
Feature activation+7.74e-01

Pos logit contributions
ator+2.046
man+1.629
umm+1.552
icles+1.461
ators+1.456
Neg logit contributions
bs-1.228
 net-0.854
 slips-0.839
 knot-0.815
 loop-0.799
Token sweet
Token ID6029
Feature activation+1.05e+00

Pos logit contributions
ator+1.930
man+1.527
umm+1.443
icles+1.378
ators+1.368
Neg logit contributions
bs-1.242
 net-0.881
 slips-0.863
 knot-0.854
 tie-0.829
Token smell
Token ID8508
Feature activation-2.61e-01

Pos logit contributions
ator+2.316
man+1.764
umm+1.633
ators+1.564
icles+1.537
Neg logit contributions
bs-1.992
 net-1.488
 slips-1.459
 knot-1.456
 tie-1.427
Token.
Token ID13
Feature activation+2.85e-01

Pos logit contributions
bs+0.387
 slips+0.264
 net+0.259
 knot+0.254
 loop+0.248
Neg logit contributions
ator-0.661
man-0.539
umm-0.507
icles-0.483
 birth-0.473
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
ator+0.622
man+0.479
umm+0.444
icles+0.422
ators+0.411
Neg logit contributions
bs-0.545
 net-0.406
 slips-0.404
 knot-0.395
 loses-0.390
Token hipp
Token ID18568
Feature activation+1.53e+00

Pos logit contributions
ator+11.044
ators+8.734
umm+8.668
man+8.554
icles+8.280
Neg logit contributions
bs-4.848
 net-4.097
 dove-3.889
 lime-3.859
 tie-3.804
Tokeno
Token ID78
Feature activation+2.82e-01

Pos logit contributions
ator+3.312
man+2.431
umm+2.379
 birth+2.204
icles+2.191
Neg logit contributions
bs-3.002
 slips-2.204
op-2.194
ip-2.193
 net-2.186
Token skipped
Token ID26684
Feature activation+9.43e-01

Pos logit contributions
ator+0.625
man+0.485
umm+0.448
icles+0.423
ators+0.419
Neg logit contributions
bs-0.517
 net-0.381
 slips-0.381
 knot-0.371
 loses-0.364
Token on
Token ID319
Feature activation+4.36e-01

Pos logit contributions
ator+2.054
man+1.542
umm+1.467
icles+1.363
ators+1.357
Neg logit contributions
bs-1.833
 net-1.392
 slips-1.365
 knot-1.347
 loop-1.330
Token and
Token ID290
Feature activation-3.09e-01

Pos logit contributions
ator+0.993
man+0.770
umm+0.719
icles+0.681
ators+0.671
Neg logit contributions
bs-0.792
 slips-0.576
 net-0.574
 knot-0.559
ip-0.552
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
ator+1.095
man+0.875
umm+0.821
icles+0.783
ators+0.774
Neg logit contributions
bs-0.683
 slips-0.471
 net-0.471
 knot-0.456
ip-0.448
Token cheered
Token ID34767
Feature activation+1.62e+00

Pos logit contributions
bs+1.937
op+1.474
ip+1.445
 knot+1.442
 slips+1.429
Neg logit contributions
ator-2.383
man-1.946
umm-1.650
icles-1.627
 birth-1.617
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
ator+4.423
man+3.490
umm+3.445
icles+3.243
ators+3.239
Neg logit contributions
bs-2.264
 net-1.446
 knot-1.408
 slips-1.407
-1.406
Token They
Token ID1119
Feature activation-5.38e-01

Pos logit contributions
ator+0.795
man+0.611
umm+0.570
icles+0.537
ators+0.528
Neg logit contributions
bs-0.687
 net-0.510
 slips-0.508
 knot-0.497
ip-0.489
Token thanked
Token ID26280
Feature activation+9.75e-01

Pos logit contributions
bs+1.027
 slips+0.771
 knot+0.768
 net+0.759
ip+0.752
Neg logit contributions
ator-1.125
man-0.872
umm-0.762
icles-0.741
ators-0.736
Token the
Token ID262
Feature activation+3.83e+00

Pos logit contributions
ator+2.557
man+2.047
umm+1.966
icles+1.842
ators+1.836
Neg logit contributions
bs-1.439
 net-0.978
 slips-0.948
 knot-0.922
 tie-0.910
Token hipp
Token ID18568
Feature activation+2.02e+00

Pos logit contributions
ator+8.340
umm+6.285
ators+6.049
man+5.721
icles+5.651
Neg logit contributions
bs-7.315
 net-6.471
 lime-6.223
 dove-6.204
 loop-6.175
Tokeno
Token ID78
Feature activation+2.13e-01

Pos logit contributions
ator+4.507
umm+3.349
man+3.240
 birth+3.068
ators+3.025
Neg logit contributions
bs-3.831
op-2.810
ip-2.799
 slips-2.711
 net-2.702
Token for
Token ID329
Feature activation+1.45e+00

Pos logit contributions
ator+0.518
man+0.415
umm+0.382
icles+0.367
ators+0.361
Neg logit contributions
bs-0.348
 slips-0.245
 net-0.244
 knot-0.238
 loses-0.233
Token his
Token ID465
Feature activation+3.84e-01

Pos logit contributions
ator+3.347
man+2.588
umm+2.546
icles+2.323
ators+2.291
Neg logit contributions
bs-2.528
 net-1.957
 tie-1.862
 loop-1.849
 slips-1.845
Token show
Token ID905
Feature activation+7.45e-02

Pos logit contributions
ator+0.784
man+0.591
umm+0.544
icles+0.509
ators+0.501
Neg logit contributions
bs-0.781
 slips-0.593
 net-0.589
 knot-0.578
ip-0.573
Token.
Token ID13
Feature activation-3.70e-02

Pos logit contributions
ator+1.193
man+0.945
umm+0.891
icles+0.846
ators+0.837
Neg logit contributions
bs-0.782
 net-0.546
 slips-0.546
 knot-0.529
ip-0.518
Token The
Token ID383
Feature activation+1.75e+00

Pos logit contributions
bs+0.072
 net+0.054
 slips+0.054
 knot+0.053
 loses+0.052
Neg logit contributions
ator-0.079
man-0.061
umm-0.056
icles-0.053
ators-0.052
Token elderly
Token ID13830
Feature activation+2.01e+00

Pos logit contributions
ator+3.483
man+2.555
umm+2.408
icles+2.325
ators+2.216
Neg logit contributions
bs-3.667
 net-2.966
 knot-2.889
 loop-2.872
-2.831
Token rh
Token ID9529
Feature activation+6.54e-01

Pos logit contributions
ator+4.232
umm+3.129
man+3.088
icles+3.026
ators+2.917
Neg logit contributions
bs-3.893
 net-3.069
 tie-2.972
-2.940
 loop-2.931
Tokenin
Token ID259
Feature activation+1.53e+00

Pos logit contributions
ator+1.479
man+1.144
umm+1.077
icles+1.017
ators+1.001
Neg logit contributions
bs-1.185
 net-0.869
 slips-0.864
 knot-0.844
ip-0.831
Tokenoc
Token ID420
Feature activation+3.83e+00

Pos logit contributions
ator+3.981
umm+3.096
man+3.082
icles+2.973
ators+2.904
Neg logit contributions
bs-2.252
 net-1.524
-1.489
ip-1.479
 slips-1.477
Tokeneros
Token ID27498
Feature activation+3.86e-01

Pos logit contributions
ator+5.435
 birth+3.389
 Amen+3.244
man+3.217
zie+3.123
Neg logit contributions
bs-9.499
ip-8.308
 net-8.219
op-8.161
 knot-8.035
Token was
Token ID373
Feature activation+8.22e-01

Pos logit contributions
ator+0.846
man+0.652
umm+0.603
icles+0.568
ators+0.562
Neg logit contributions
bs-0.721
 net-0.533
 slips-0.532
 knot-0.520
 loses-0.510
Token so
Token ID523
Feature activation+4.76e-01

Pos logit contributions
ator+1.912
man+1.485
umm+1.410
icles+1.330
ators+1.311
Neg logit contributions
bs-1.439
 net-1.044
 slips-1.038
 knot-1.009
-0.998
Token happy
Token ID3772
Feature activation+7.80e-01

Pos logit contributions
ator+1.171
man+0.927
umm+0.874
icles+0.828
ators+0.819
Neg logit contributions
bs-0.772
 net-0.542
 slips-0.540
 knot-0.525
ip-0.513
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
ator+1.975
man+1.569
umm+1.489
icles+1.417
ators+1.407
Neg logit contributions
bs-1.207
 net-0.837
 slips-0.831
 knot-0.802
 loses-0.786
Token strong
Token ID1913
Feature activation-6.26e-01

Pos logit contributions
ator+1.233
man+0.956
umm+0.895
icles+0.845
ators+0.835
Neg logit contributions
bs-0.973
 net-0.711
 slips-0.709
 knot-0.691
ip-0.678
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
bs+0.999
 slips+0.671
 net+0.653
 knot+0.630
ip+0.627
Neg logit contributions
ator-1.576
man-1.291
umm-1.218
icles-1.118
ators-1.113
Token
Token ID220
Feature activation-1.91e-01

Pos logit contributions
ator+0.330
man+0.256
umm+0.237
icles+0.225
ators+0.218
Neg logit contributions
bs-0.282
 net-0.209
 slips-0.208
 knot-0.204
 loses-0.201
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
bs+0.324
 net+0.238
 knot+0.232
 slips+0.232
 loses+0.223
Neg logit contributions
ator-0.454
man-0.359
umm-0.325
icles-0.319
ators-0.311
Token
Token ID198
Feature activation+8.24e-02

Pos logit contributions
bs+0.325
 net+0.235
 slips+0.233
 knot+0.230
 loses+0.223
Neg logit contributions
ator-0.449
man-0.357
umm-0.319
icles-0.312
ators-0.305
TokenThe
Token ID464
Feature activation+3.82e+00

Pos logit contributions
ator+0.205
man+0.164
umm+0.152
icles+0.145
ators+0.144
Neg logit contributions
bs-0.133
 slips-0.092
 net-0.092
 knot-0.088
 loses-0.088
Token hipp
Token ID18568
Feature activation+1.53e+00

Pos logit contributions
ator+8.705
umm+6.545
ators+6.347
opher+6.077
icles+6.075
Neg logit contributions
bs-6.749
 net-6.022
 tie-5.680
 dove-5.650
 lime-5.598
Tokeno
Token ID78
Feature activation+5.07e-01

Pos logit contributions
ator+3.280
man+2.406
umm+2.378
icles+2.183
 birth+2.182
Neg logit contributions
bs-2.983
op-2.199
 slips-2.192
ip-2.188
 net-2.172
Token found
Token ID1043
Feature activation+8.08e-03

Pos logit contributions
ator+1.161
man+0.903
umm+0.845
icles+0.798
ators+0.789
Neg logit contributions
bs-0.903
 net-0.657
 slips-0.655
 knot-0.639
ip-0.626
Token some
Token ID617
Feature activation-6.77e-01

Pos logit contributions
ator+0.022
man+0.018
umm+0.017
icles+0.016
ators+0.016
Neg logit contributions
bs-0.011
 slips-0.007
 knot-0.007
 net-0.007
 loses-0.007
Token vegetables
Token ID13701
Feature activation-7.19e-01

Pos logit contributions
bs+1.299
 slips+0.984
 loses+0.972
ata+0.963
ip+0.946
Neg logit contributions
ator-1.431
man-1.104
umm-1.028
icles-0.976
 birth-0.943
Tokeno
Token ID78
Feature activation+2.52e-01

Pos logit contributions
ator+2.885
man+2.117
umm+2.054
ators+1.885
icles+1.882
Neg logit contributions
bs-2.692
 slips-1.992
 net-1.987
ip-1.973
-1.957
Token was
Token ID373
Feature activation+7.44e-01

Pos logit contributions
ator+0.525
man+0.400
umm+0.366
icles+0.346
ators+0.341
Neg logit contributions
bs-0.496
 net-0.377
 slips-0.375
 knot-0.368
 loop-0.359
Token so
Token ID523
Feature activation+1.11e+00

Pos logit contributions
ator+1.804
man+1.420
umm+1.349
icles+1.275
ators+1.260
Neg logit contributions
bs-1.235
 net-0.881
 slips-0.871
 knot-0.843
 loses-0.830
Token attractive
Token ID10966
Feature activation+2.66e-01

Pos logit contributions
ator+2.742
man+2.168
umm+2.056
icles+1.963
ators+1.955
Neg logit contributions
bs-1.860
 net-1.288
 slips-1.277
-1.220
 knot-1.216
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
ator+0.648
man+0.514
umm+0.483
icles+0.460
ators+0.453
Neg logit contributions
bs-0.435
 slips-0.304
 net-0.304
 knot-0.296
ata-0.289
Token The
Token ID383
Feature activation+3.82e+00

Pos logit contributions
ator+0.274
man+0.216
umm+0.201
icles+0.194
ators+0.188
Neg logit contributions
bs-0.203
 slips-0.144
 net-0.143
 knot-0.142
 loses-0.139
Token hipp
Token ID18568
Feature activation+1.72e+00

Pos logit contributions
ator+8.460
umm+6.137
ators+6.058
man+5.969
esh+5.718
Neg logit contributions
bs-7.226
 net-6.395
 dove-6.060
 lime-6.034
 tie-5.969
Tokeno
Token ID78
Feature activation+5.38e-01

Pos logit contributions
ator+3.768
man+2.770
umm+2.752
icles+2.522
 birth+2.522
Neg logit contributions
bs-3.311
op-2.427
 slips-2.415
ip-2.410
 net-2.385
Token was
Token ID373
Feature activation+8.13e-01

Pos logit contributions
ator+1.230
man+0.956
umm+0.895
icles+0.846
ators+0.835
Neg logit contributions
bs-0.960
 net-0.699
 slips-0.697
 knot-0.680
ip-0.667
Token excited
Token ID6568
Feature activation+6.18e-01

Pos logit contributions
ator+1.954
man+1.528
umm+1.458
icles+1.376
ators+1.360
Neg logit contributions
bs-1.372
 net-0.988
 slips-0.979
 knot-0.943
 tie-0.934
Token to
Token ID284
Feature activation+6.44e-01

Pos logit contributions
ator+1.577
man+1.257
umm+1.189
icles+1.131
ators+1.122
Neg logit contributions
bs-0.953
 net-0.654
 slips-0.650
 knot-0.629
 loses-0.615
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
bs+0.221
 net+0.154
 knot+0.154
 slips+0.153
ip+0.148
Neg logit contributions
ator-0.317
man-0.257
umm-0.230
icles-0.225
 birth-0.222
Token gorilla
Token ID45314
Feature activation-3.42e-01

Pos logit contributions
ator+2.885
man+2.096
umm+2.020
icles+1.888
ators+1.862
Neg logit contributions
bs-2.659
 net-2.121
 slips-2.059
 knot-2.030
 loop-2.017
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
bs+0.578
 slips+0.414
 net+0.405
 knot+0.400
 loses+0.388
Neg logit contributions
ator-0.816
man-0.662
umm-0.587
icles-0.568
ators-0.560
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
bs+1.477
 knot+1.148
 net+1.133
 slips+1.122
ata+1.103
Neg logit contributions
ator-1.398
man-1.078
umm-0.980
icles-0.970
omm-0.930
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
bs+0.739
 net+0.542
 slips+0.534
 knot+0.531
 loses+0.518
Neg logit contributions
ator-0.984
man-0.789
icles-0.693
umm-0.685
ators-0.670
TokenThe
Token ID464
Feature activation+3.82e+00

Pos logit contributions
bs+1.364
 net+1.041
 slips+1.030
 knot+1.016
 loses+1.016
Neg logit contributions
ator-1.452
man-1.123
icles-0.986
umm-0.978
ators-0.956
Token gorilla
Token ID45314
Feature activation+1.24e+00

Pos logit contributions
ator+8.450
umm+6.146
icles+5.777
ators+5.745
esh+5.654
Neg logit contributions
bs-7.171
 net-6.624
 dove-6.446
 lime-6.131
 rope-6.105
Token was
Token ID373
Feature activation+6.79e-01

Pos logit contributions
ator+2.900
man+2.179
umm+2.168
icles+2.023
ators+1.981
Neg logit contributions
bs-2.191
 slips-1.616
 net-1.600
 loses-1.539
 knot-1.536
Token big
Token ID1263
Feature activation-4.54e-01

Pos logit contributions
ator+1.670
man+1.314
umm+1.251
icles+1.184
ators+1.170
Neg logit contributions
bs-1.104
 net-0.777
 slips-0.775
 knot-0.749
 loses-0.735
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
bs+0.677
 slips+0.424
 net+0.421
ata+0.418
ip+0.409
Neg logit contributions
ator-1.174
man-0.976
umm-0.898
ators-0.851
icles-0.848
Token strong
Token ID1913
Feature activation-6.88e-02

Pos logit contributions
ator+0.598
man+0.475
umm+0.438
icles+0.418
ators+0.412
Neg logit contributions
bs-0.440
 net-0.316
 slips-0.315
 knot-0.307
ata-0.302
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
ator+1.897
man+1.430
umm+1.357
icles+1.256
ators+1.246
Neg logit contributions
bs-1.723
 slips-1.306
 net-1.304
 knot-1.271
 loop-1.259
Token a
Token ID257
Feature activation-8.02e-01

Pos logit contributions
bs+1.269
 slips+0.867
 loses+0.839
 knot+0.802
ip+0.788
Neg logit contributions
ator-2.215
man-1.782
umm-1.668
 birth-1.667
icles-1.637
Token p
Token ID279
Feature activation+5.82e-02

Pos logit contributions
bs+1.440
 slips+1.045
 loses+1.034
ata+1.022
 net+0.975
Neg logit contributions
ator-1.829
man-1.487
umm-1.323
 birth-1.243
omm-1.220
Tokenanda
Token ID5282
Feature activation-5.57e-02

Pos logit contributions
ator+0.107
man+0.077
umm+0.069
icles+0.063
ators+0.063
Neg logit contributions
bs-0.129
 net-0.102
 slips-0.102
 knot-0.101
 loses-0.099
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
bs+0.089
 slips+0.063
 net+0.062
 knot+0.061
 loses+0.060
Neg logit contributions
ator-0.138
man-0.112
umm-0.101
icles-0.097
ators-0.096
Token The
Token ID383
Feature activation+3.80e+00

Pos logit contributions
ator+2.725
man+2.062
umm+1.928
ators+1.849
icles+1.795
Neg logit contributions
bs-2.198
 net-1.627
 slips-1.616
 knot-1.581
-1.558
Token p
Token ID279
Feature activation+2.17e+00

Pos logit contributions
ator+10.483
man+8.187
umm+8.131
ators+8.111
icles+7.598
Neg logit contributions
bs-5.234
 loop-4.849
 net-4.526
 dove-4.475
 lime-4.418
Tokenanda
Token ID5282
Feature activation+8.65e-01

Pos logit contributions
ator+4.418
man+3.337
umm+3.170
icles+2.970
ators+2.914
Neg logit contributions
bs-4.677
ip-3.576
 slips-3.543
op-3.493
 net-3.398
Token was
Token ID373
Feature activation+1.20e+00

Pos logit contributions
ator+1.787
man+1.322
umm+1.271
icles+1.179
ators+1.150
Neg logit contributions
bs-1.756
 slips-1.328
 net-1.326
ip-1.292
 knot-1.290
Token very
Token ID845
Feature activation+1.67e+00

Pos logit contributions
ator+2.891
man+2.217
umm+2.175
ators+2.032
icles+2.028
Neg logit contributions
bs-2.046
 net-1.490
 slips-1.465
 tie-1.428
 knot-1.414
Token gloomy
Token ID46400
Feature activation+1.34e-01

Pos logit contributions
ator+4.491
man+3.559
umm+3.454
ators+3.381
icles+3.332
Neg logit contributions
bs-2.574
 net-1.675
 slips-1.673
 tie-1.606
ip-1.596
Token asked
Token ID1965
Feature activation+7.90e-01

Pos logit contributions
ator+0.402
man+0.319
umm+0.288
icles+0.277
ators+0.274
Neg logit contributions
bs-0.304
 slips-0.221
 net-0.220
 knot-0.217
ip-0.212
Token if
Token ID611
Feature activation+7.22e-01

Pos logit contributions
ator+2.053
man+1.637
umm+1.570
icles+1.477
ators+1.467
Neg logit contributions
bs-1.194
 net-0.802
 slips-0.794
 knot-0.770
ip-0.760
Token he
Token ID339
Feature activation-6.26e-01

Pos logit contributions
ator+1.656
man+1.279
umm+1.211
icles+1.135
ators+1.124
Neg logit contributions
bs-1.296
 net-0.946
 slips-0.940
 knot-0.912
ip-0.900
Token could
Token ID714
Feature activation-6.36e-01

Pos logit contributions
bs+1.089
 net+0.831
 slips+0.800
 knot+0.793
ata+0.780
Neg logit contributions
ator-1.385
man-1.128
umm-0.999
icles-0.951
 birth-0.948
Token borrow
Token ID8804
Feature activation-1.62e+00

Pos logit contributions
bs+1.048
 net+0.753
 slips+0.753
ip+0.723
ata+0.722
Neg logit contributions
ator-1.520
man-1.230
umm-1.077
icles-1.067
 birth-1.036
Token a
Token ID257
Feature activation-4.19e+00

Pos logit contributions
bs+3.602
 slips+2.911
ata+2.878
op+2.763
 net+2.710
Neg logit contributions
ator-2.903
 him-2.186
man-2.132
icles-2.006
 himself-1.960
Token strip
Token ID10283
Feature activation-7.82e-01

Pos logit contributions
bs+8.615
imes+6.870
 loses+6.250
ume+6.162
 slips+6.080
Neg logit contributions
ator-8.503
man-7.993
 me-7.163
 us-6.387
 next-6.067
Token.
Token ID13
Feature activation+6.60e-01

Pos logit contributions
bs+1.232
 slips+0.864
 net+0.823
 loses+0.823
 knot+0.800
Neg logit contributions
ator-1.957
man-1.628
umm-1.449
icles-1.421
ators-1.352
Token But
Token ID887
Feature activation-2.79e-01

Pos logit contributions
ator+1.574
man+1.232
umm+1.163
icles+1.097
ators+1.090
Neg logit contributions
bs-1.124
 net-0.802
 slips-0.800
 knot-0.775
ip-0.763
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
bs+0.555
 slips+0.418
 net+0.412
 knot+0.410
ata+0.408
Neg logit contributions
ator-0.586
man-0.456
umm-0.399
icles-0.394
ators-0.379
Token his
Token ID465
Feature activation-1.54e+00

Pos logit contributions
bs+0.624
 slips+0.470
 net+0.460
 knot+0.457
 loses+0.455
Neg logit contributions
ator-0.695
man-0.543
umm-0.483
icles-0.474
ators-0.457
Token branch
Token ID8478
Feature activation-1.03e+00

Pos logit contributions
bs+3.318
ip+2.313
ata+2.297
 loses+2.259
ver+2.229
Neg logit contributions
ator-4.382
man-3.783
umm-3.115
 birth-3.046
icles-2.942
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
bs+1.779
ip+1.277
 net+1.275
 slips+1.251
 loses+1.248
Neg logit contributions
ator-2.350
man-1.978
icles-1.656
umm-1.628
ators-1.588
Token They
Token ID1119
Feature activation-1.97e+00

Pos logit contributions
bs+0.547
 net+0.413
 slips+0.413
 knot+0.408
ip+0.404
Neg logit contributions
ator-0.564
man-0.423
umm-0.404
icles-0.385
 birth-0.364
Token look
Token ID804
Feature activation-2.54e-01

Pos logit contributions
bs+3.333
ip+2.628
ata+2.476
op+2.474
ching+2.470
Neg logit contributions
ator-4.287
man-3.329
 him-3.027
ators-2.984
 birth-2.908
Token for
Token ID329
Feature activation-1.30e+00

Pos logit contributions
bs+0.470
 slips+0.356
 net+0.354
ata+0.347
 knot+0.346
Neg logit contributions
ator-0.543
man-0.419
umm-0.377
icles-0.364
ators-0.361
Token another
Token ID1194
Feature activation-3.96e+00

Pos logit contributions
bs+2.321
 slips+1.750
ata+1.740
ip+1.727
op+1.670
Neg logit contributions
ator-2.974
man-2.249
 him-2.168
icles-2.156
umm-2.089
Token tree
Token ID5509
Feature activation-1.55e+00

Pos logit contributions
bs+8.769
imes+6.377
ip+6.320
 loses+6.268
azz+6.267
Neg logit contributions
ator-8.187
man-7.194
 next-5.952
 me-5.691
 us-5.522
Token.
Token ID13
Feature activation-6.21e-02

Pos logit contributions
bs+2.948
 slips+2.185
 knot+2.158
ip+2.124
 net+2.105
Neg logit contributions
ator-3.335
man-2.785
umm-2.242
icles-2.215
ators-2.190
Token They
Token ID1119
Feature activation-1.13e+00

Pos logit contributions
bs+0.123
 net+0.093
 slips+0.093
 knot+0.092
ip+0.091
Neg logit contributions
ator-0.129
man-0.097
umm-0.092
icles-0.087
ators-0.083
Token see
Token ID766
Feature activation-1.41e+00

Pos logit contributions
bs+1.855
ip+1.415
ata+1.351
ching+1.340
 net+1.324
Neg logit contributions
ator-2.554
man-2.005
 him-1.774
ators-1.765
umm-1.741
Token one
Token ID530
Feature activation-1.98e+00

Pos logit contributions
bs+2.262
ata+1.615
ip+1.594
op+1.553
 slips+1.499
Neg logit contributions
ator-3.446
man-2.863
 him-2.613
icles-2.609
 birth-2.498
Token "
Token ID366
Feature activation+9.97e-01

Pos logit contributions
ator+1.136
man+0.884
umm+0.826
icles+0.782
ators+0.771
Neg logit contributions
bs-0.887
 net-0.646
 slips-0.645
 knot-0.628
 loses-0.616
TokenHow
Token ID2437
Feature activation-2.43e+00

Pos logit contributions
ator+1.919
man+1.388
umm+1.304
icles+1.209
ators+1.165
Neg logit contributions
bs-2.158
 net-1.685
 slips-1.668
 knot-1.651
 tie-1.642
Token do
Token ID466
Feature activation-1.60e-02

Pos logit contributions
bs+4.751
 net+3.703
ata+3.496
ip+3.482
 knot+3.478
Neg logit contributions
ator-5.088
man-4.010
 birth-3.476
icles-3.461
 him-3.417
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
bs+0.034
 slips+0.026
 net+0.026
ata+0.025
 knot+0.025
Neg logit contributions
ator-0.031
man-0.023
umm-0.021
icles-0.020
 birth-0.020
Token snap
Token ID11495
Feature activation-1.35e+00

Pos logit contributions
bs+2.133
ip+1.561
 net+1.514
 slips+1.471
ata+1.419
Neg logit contributions
ator-2.971
man-2.304
umm-2.102
 him-2.051
icles-2.038
Token a
Token ID257
Feature activation-3.86e+00

Pos logit contributions
bs+2.691
ata+2.037
 slips+1.999
 net+1.994
ip+1.933
Neg logit contributions
ator-2.800
man-2.116
icles-1.903
ators-1.891
 us-1.850
Token knot
Token ID29654
Feature activation+3.48e-02

Pos logit contributions
bs+5.782
imes+3.793
azz+3.616
ata+3.493
ip+3.396
Neg logit contributions
ator-10.225
man-9.002
 me-8.477
umm-8.099
 us-7.922
Token?"
Token ID1701
Feature activation+6.77e-01

Pos logit contributions
ator+0.078
man+0.061
umm+0.055
icles+0.053
ators+0.052
Neg logit contributions
bs-0.064
 net-0.047
 slips-0.046
 knot-0.045
ip-0.045
Token
Token ID220
Feature activation-4.64e-01

Pos logit contributions
ator+1.664
man+1.309
umm+1.242
icles+1.174
ators+1.166
Neg logit contributions
bs-1.117
 net-0.782
 slips-0.777
 knot-0.756
ip-0.745
Token
Token ID198
Feature activation+8.65e-01

Pos logit contributions
bs+0.797
 net+0.590
 slips+0.575
 knot+0.573
 loses+0.555
Neg logit contributions
ator-1.093
man-0.859
umm-0.780
icles-0.758
omm-0.742
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
ator+2.241
man+1.785
umm+1.725
icles+1.630
ators+1.612
Neg logit contributions
bs-1.301
 net-0.861
 slips-0.858
 knot-0.833
ip-0.830
Token he
Token ID339
Feature activation+4.46e-01

Pos logit contributions
ator+3.181
man+2.470
umm+2.405
icles+2.259
ators+2.257
Neg logit contributions
bs-2.058
 slips-1.451
 net-1.450
 loses-1.388
-1.375
Token promised
Token ID8072
Feature activation+1.22e+00

Pos logit contributions
ator+1.074
man+0.849
umm+0.794
icles+0.754
ators+0.746
Neg logit contributions
bs-0.744
 net-0.527
 slips-0.525
 knot-0.510
ip-0.500
Token to
Token ID284
Feature activation+1.38e+00

Pos logit contributions
ator+2.919
man+2.258
umm+2.199
ators+2.052
icles+2.022
Neg logit contributions
bs-2.073
 net-1.491
 slips-1.476
-1.431
 tie-1.429
Token never
Token ID1239
Feature activation-5.09e-01

Pos logit contributions
ator+2.904
man+2.162
umm+2.094
ators+1.918
icles+1.918
Neg logit contributions
bs-2.736
 net-2.123
 tie-2.096
 slips-2.091
 knot-2.072
Token attach
Token ID10199
Feature activation+1.92e-02

Pos logit contributions
bs+0.961
ip+0.699
ata+0.694
 net+0.688
 slips+0.685
Neg logit contributions
ator-1.104
man-0.869
umm-0.795
icles-0.748
 birth-0.737
Token a
Token ID257
Feature activation-3.78e+00

Pos logit contributions
ator+0.043
man+0.034
umm+0.031
icles+0.030
ators+0.029
Neg logit contributions
bs-0.035
 slips-0.025
 net-0.025
 knot-0.025
ata-0.025
Token net
Token ID2010
Feature activation+6.61e-02

Pos logit contributions
bs+5.547
ird+4.156
ip+3.909
ata+3.855
azz+3.626
Neg logit contributions
ator-9.805
man-9.193
 us-7.999
 me-7.690
umm-7.296
Token to
Token ID284
Feature activation-3.68e-01

Pos logit contributions
ator+0.133
man+0.101
umm+0.091
icles+0.086
ators+0.084
Neg logit contributions
bs-0.134
 slips-0.102
 net-0.102
 knot-0.100
ip-0.099
Token someone
Token ID2130
Feature activation-9.04e-01

Pos logit contributions
bs+0.687
 slips+0.503
ip+0.489
ata+0.487
 net+0.486
Neg logit contributions
ator-0.826
man-0.638
umm-0.581
icles-0.571
ators-0.549
Token again
Token ID757
Feature activation-6.86e-01

Pos logit contributions
bs+1.507
 net+1.064
 slips+1.061
 knot+1.051
ata+1.040
Neg logit contributions
ator-2.133
man-1.705
umm-1.530
ators-1.495
 birth-1.484
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
bs+1.234
 net+0.893
 knot+0.892
 slips+0.887
ata+0.879
Neg logit contributions
ator-1.552
man-1.223
umm-1.115
ators-1.041
icles-1.041
Token dad
Token ID9955
Feature activation-3.21e-01

Pos logit contributions
bs+1.179
 slips+0.897
ata+0.895
 loses+0.873
 net+0.864
Neg logit contributions
ator-1.138
man-0.873
umm-0.777
 birth-0.747
ators-0.719
Token if
Token ID611
Feature activation+2.27e-01

Pos logit contributions
bs+0.477
 net+0.332
 slips+0.330
 knot+0.324
 loop+0.317
Neg logit contributions
ator-0.820
man-0.670
umm-0.605
icles-0.596
ators-0.588
Token he
Token ID339
Feature activation-2.52e-01

Pos logit contributions
ator+0.510
man+0.398
umm+0.365
icles+0.348
ators+0.343
Neg logit contributions
bs-0.415
 slips-0.305
 net-0.304
 knot-0.299
ata-0.294
Token could
Token ID714
Feature activation-5.83e-01

Pos logit contributions
bs+0.437
 net+0.327
 slips+0.317
 knot+0.314
 loop+0.307
Neg logit contributions
ator-0.574
man-0.456
umm-0.414
icles-0.394
ators-0.390
Token take
Token ID1011
Feature activation-6.57e-01

Pos logit contributions
bs+1.018
 net+0.743
 slips+0.737
ata+0.721
 knot+0.714
Neg logit contributions
ator-1.340
man-1.052
umm-0.948
icles-0.929
 birth-0.897
Token a
Token ID257
Feature activation-3.73e+00

Pos logit contributions
bs+1.163
 slips+0.880
 net+0.834
ata+0.825
 loses+0.823
Neg logit contributions
ator-1.495
man-1.151
umm-1.073
 him-1.052
icles-1.045
Token seat
Token ID5852
Feature activation-2.88e-01

Pos logit contributions
bs+7.090
 slips+6.108
 loses+5.832
imes+5.712
 pads+5.638
Neg logit contributions
ator-7.161
man-6.026
 me-5.254
 us-5.193
 birth-5.020
Token under
Token ID739
Feature activation+5.79e-02

Pos logit contributions
bs+0.528
 net+0.391
 slips+0.391
 knot+0.378
 loses+0.373
Neg logit contributions
ator-0.643
man-0.507
umm-0.456
icles-0.443
ators-0.436
Token the
Token ID262
Feature activation-3.80e-01

Pos logit contributions
ator+0.131
man+0.101
umm+0.094
icles+0.091
ators+0.089
Neg logit contributions
bs-0.105
 slips-0.077
 net-0.075
 knot-0.074
ip-0.074
Token shady
Token ID36135
Feature activation-1.29e-01

Pos logit contributions
bs+0.695
 slips+0.492
ata+0.481
ip+0.481
 net+0.475
Neg logit contributions
ator-0.871
man-0.681
umm-0.639
icles-0.603
ators-0.588
Token tree
Token ID5509
Feature activation-1.10e+00

Pos logit contributions
bs+0.221
 slips+0.154
 loses+0.149
 net+0.148
ata+0.147
Neg logit contributions
ator-0.312
man-0.247
umm-0.228
icles-0.217
ators-0.213
Token green
Token ID4077
Feature activation-9.67e-01

Pos logit contributions
bs+3.110
 loses+2.194
ip+2.188
ata+2.185
 slips+2.149
Neg logit contributions
ator-3.437
man-2.821
umm-2.421
 birth-2.273
ators-2.204
Token rope
Token ID17182
Feature activation-7.44e-01

Pos logit contributions
bs+1.736
 loses+1.192
ata+1.169
 slips+1.160
ip+1.152
Neg logit contributions
ator-2.304
man-1.875
umm-1.705
icles-1.617
ators-1.557
Token!
Token ID0
Feature activation-1.14e-01

Pos logit contributions
bs+1.409
ip+0.983
 slips+0.973
 loses+0.966
 net+0.963
Neg logit contributions
ator-1.670
man-1.352
umm-1.170
icles-1.145
ators-1.122
Token He
Token ID679
Feature activation-5.83e-01

Pos logit contributions
bs+0.253
 net+0.195
 slips+0.194
 knot+0.191
ip+0.191
Neg logit contributions
ator-0.212
man-0.158
umm-0.144
icles-0.136
ators-0.130
Token took
Token ID1718
Feature activation-3.25e-01

Pos logit contributions
bs+1.112
 net+0.841
 slips+0.835
 loses+0.808
ata+0.805
Neg logit contributions
ator-1.225
man-0.957
umm-0.835
icles-0.809
 birth-0.808
Token the
Token ID262
Feature activation-3.71e+00

Pos logit contributions
bs+0.530
 slips+0.375
 net+0.373
 loses+0.361
 knot+0.356
Neg logit contributions
ator-0.789
man-0.625
umm-0.580
icles-0.569
 him-0.548
Token rope
Token ID17182
Feature activation-9.46e-01

Pos logit contributions
bs+5.515
azz+4.022
 loses+3.680
ip+3.369
ata+3.329
Neg logit contributions
ator-9.488
man-8.135
 us-7.353
 me-7.320
icles-7.248
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
bs+1.798
 slips+1.308
 loses+1.287
ip+1.275
 net+1.257
Neg logit contributions
ator-2.083
man-1.677
icles-1.445
umm-1.436
esh-1.380
Token ran
Token ID4966
Feature activation+5.82e-01

Pos logit contributions
bs+4.265
ume+3.350
 loses+3.328
 slips+3.322
 net+3.309
Neg logit contributions
ator-4.124
man-3.232
 him-2.918
icles-2.904
esh-2.806
Token to
Token ID284
Feature activation-8.28e-01

Pos logit contributions
ator+1.304
man+1.006
umm+0.942
icles+0.886
ators+0.876
Neg logit contributions
bs-1.072
 net-0.790
 slips-0.787
 knot-0.768
ip-0.754
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
bs+1.534
ip+1.092
 slips+1.080
ata+1.071
 net+1.053
Neg logit contributions
ator-1.902
man-1.457
 him-1.336
icles-1.307
umm-1.289
Token."
Token ID526
Feature activation-1.29e+00

Pos logit contributions
bs+1.557
 net+1.169
 slips+1.149
ip+1.121
 knot+1.120
Neg logit contributions
ator-1.850
man-1.407
icles-1.266
umm-1.265
ators-1.237
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
bs+2.465
 knot+1.947
ata+1.926
 loop+1.892
 net+1.891
Neg logit contributions
ator-2.552
man-1.961
umm-1.856
 him-1.849
omm-1.754
Token
Token ID198
Feature activation-1.86e+00

Pos logit contributions
bs+2.295
 knot+1.756
 net+1.741
ata+1.712
 slips+1.710
Neg logit contributions
ator-2.569
man-2.003
 him-1.787
icles-1.698
umm-1.673
TokenShe
Token ID3347
Feature activation-1.07e+00

Pos logit contributions
bs+2.865
ata+1.972
ip+1.966
ver+1.929
 loses+1.919
Neg logit contributions
ator-4.545
man-3.709
 him-3.337
 us-3.305
umm-3.257
Token threw
Token ID9617
Feature activation-1.47e+00

Pos logit contributions
bs+1.818
 net+1.299
 slips+1.275
 loses+1.257
 knot+1.250
Neg logit contributions
ator-2.475
man-2.043
icles-1.731
umm-1.718
ators-1.714
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
bs+2.759
op+2.037
 slips+2.030
ata+1.998
ip+1.971
Neg logit contributions
ator-3.049
man-2.426
 him-2.348
icles-2.272
umm-2.115
Token rope
Token ID17182
Feature activation-7.22e-01

Pos logit contributions
bs+5.357
azz+3.505
ird+3.413
 loses+3.259
ata+3.144
Neg logit contributions
ator-9.801
man-8.747
 us-7.482
icles-7.463
esh-7.195
Token into
Token ID656
Feature activation-1.69e+00

Pos logit contributions
bs+1.419
 net+1.053
 slips+1.047
ip+1.035
ata+1.030
Neg logit contributions
ator-1.486
man-1.190
umm-1.014
icles-1.010
ators-0.964
Token the
Token ID262
Feature activation-1.83e+00

Pos logit contributions
bs+3.433
 slips+2.618
ip+2.587
 loses+2.459
 pads+2.457
Neg logit contributions
ator-3.545
man-2.574
 him-2.515
icles-2.418
 himself-2.352
Token hole
Token ID7604
Feature activation-2.39e-01

Pos logit contributions
bs+3.077
ver+2.169
ip+2.083
ata+2.023
azz+1.910
Neg logit contributions
ator-4.574
man-3.827
umm-3.451
ators-3.204
 us-3.178
Token and
Token ID290
Feature activation-2.00e+00

Pos logit contributions
bs+0.394
 net+0.277
 slips+0.273
 knot+0.272
ip+0.268
Neg logit contributions
ator-0.575
man-0.462
umm-0.420
icles-0.408
ators-0.397
Token tied
Token ID8165
Feature activation-5.93e-01

Pos logit contributions
bs+2.319
 net+1.780
 slips+1.756
 loses+1.697
 knot+1.687
Neg logit contributions
ator-2.310
man-1.829
umm-1.493
ators-1.477
 birth-1.444
Token the
Token ID262
Feature activation-3.40e+00

Pos logit contributions
bs+1.032
 slips+0.744
 net+0.731
 loses+0.728
ata+0.720
Neg logit contributions
ator-1.365
man-1.092
icles-0.994
umm-0.972
 him-0.944
Token other
Token ID584
Feature activation-2.75e+00

Pos logit contributions
bs+4.955
azz+3.096
ip+2.902
 loses+2.891
ata+2.875
Neg logit contributions
ator-8.894
man-8.044
 me-6.849
 next-6.809
ators-6.767
Token end
Token ID886
Feature activation-1.40e+00

Pos logit contributions
bs+5.262
ip+3.717
 loses+3.714
ata+3.668
azz+3.650
Neg logit contributions
ator-5.882
man-4.964
ators-4.350
icles-4.314
umm-4.311
Token of
Token ID286
Feature activation-1.83e+00

Pos logit contributions
bs+2.172
 loses+1.583
 net+1.562
 slips+1.516
ata+1.506
Neg logit contributions
ator-3.429
man-2.836
umm-2.522
icles-2.497
lo-2.356
Token the
Token ID262
Feature activation-3.66e+00

Pos logit contributions
bs+3.890
 loses+2.990
 slips+2.966
ata+2.867
ip+2.703
Neg logit contributions
ator-3.720
man-2.880
 him-2.639
icles-2.612
ators-2.417
Token rope
Token ID17182
Feature activation-6.25e-01

Pos logit contributions
bs+5.586
azz+3.764
ver+3.501
ata+3.424
aves+3.367
Neg logit contributions
ator-9.409
man-8.228
 next-7.619
 me-7.339
 us-7.324
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
bs+1.028
 net+0.725
 slips+0.718
 loses+0.704
 knot+0.696
Neg logit contributions
ator-1.491
man-1.212
umm-1.083
icles-1.059
ators-1.029
Token her
Token ID607
Feature activation-1.67e+00

Pos logit contributions
bs+2.050
 slips+1.510
 net+1.467
 loses+1.458
ip+1.453
Neg logit contributions
ator-2.502
man-1.940
 him-1.733
umm-1.732
icles-1.725
Token hand
Token ID1021
Feature activation-8.31e-01

Pos logit contributions
bs+3.458
ata+2.622
 slips+2.554
ching+2.467
ip+2.462
Neg logit contributions
ator-3.404
man-2.673
umm-2.323
 birth-2.229
ators-2.176
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
bs+1.353
 net+0.978
 slips+0.973
 loses+0.950
 knot+0.926
Neg logit contributions
ator-1.966
man-1.643
icles-1.441
umm-1.437
ators-1.373
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
bs+0.985
 net+0.759
 knot+0.729
 slips+0.725
 loses+0.706
Neg logit contributions
ator-1.317
man-1.027
umm-0.921
icles-0.915
ators-0.889
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
bs+0.971
 net+0.741
 slips+0.727
 knot+0.705
 loses+0.693
Neg logit contributions
ator-1.246
man-0.979
umm-0.873
icles-0.843
 birth-0.830
TokenL
Token ID43
Feature activation-8.73e-01

Pos logit contributions
bs+2.138
 net+1.681
 slips+1.662
 loses+1.599
ip+1.531
Neg logit contributions
ator-2.538
man-1.973
umm-1.808
 birth-1.703
omm-1.681
Tokenily
Token ID813
Feature activation-7.44e-01

Pos logit contributions
bs+1.876
 net+1.510
 slips+1.470
 knot+1.417
 loses+1.409
Neg logit contributions
ator-1.758
man-1.215
umm-1.114
ators-1.062
icles-1.061
Token took
Token ID1718
Feature activation-1.74e+00

Pos logit contributions
bs+1.414
 net+1.056
 slips+1.049
 loses+1.012
 knot+1.005
Neg logit contributions
ator-1.583
man-1.248
umm-1.074
esh-1.047
ators-1.046
Token the
Token ID262
Feature activation-3.58e+00

Pos logit contributions
bs+3.244
 slips+2.638
 net+2.589
 loses+2.588
ata+2.468
Neg logit contributions
ator-3.623
man-2.642
 him-2.623
icles-2.510
umm-2.459
Token rope
Token ID17182
Feature activation-1.11e+00

Pos logit contributions
bs+5.260
 loses+3.597
azz+3.546
ird+3.084
aves+2.990
Neg logit contributions
ator-9.590
man-8.268
 us-7.149
icles-7.132
 me-7.068
Token and
Token ID290
Feature activation-2.17e+00

Pos logit contributions
bs+2.116
 slips+1.602
 loses+1.570
ip+1.509
 knot+1.507
Neg logit contributions
ator-2.428
man-1.941
icles-1.659
umm-1.651
ators-1.578
Token started
Token ID2067
Feature activation-1.48e+00

Pos logit contributions
bs+4.269
ching+3.495
 loses+3.458
 slips+3.437
ume+3.433
Neg logit contributions
ator-4.098
man-3.175
 him-2.847
icles-2.782
ators-2.708
Token to
Token ID284
Feature activation-1.53e+00

Pos logit contributions
bs+3.051
 slips+2.372
ata+2.354
 loses+2.343
 net+2.293
Neg logit contributions
ator-2.832
man-2.177
icles-1.887
 birth-1.871
 him-1.859
Token swing
Token ID9628
Feature activation-1.29e-02

Pos logit contributions
bs+2.719
 slips+1.956
 loses+1.952
ata+1.952
ip+1.942
Neg logit contributions
ator-3.506
man-2.733
umm-2.484
esh-2.467
icles-2.445
Token They
Token ID1119
Feature activation-8.42e-01

Pos logit contributions
bs+0.470
 slips+0.349
 net+0.348
 knot+0.342
 loses+0.341
Neg logit contributions
ator-0.558
man-0.435
umm-0.403
icles-0.385
 birth-0.371
Token tried
Token ID3088
Feature activation-1.16e+00

Pos logit contributions
bs+1.675
 slips+1.327
 loses+1.283
 net+1.278
 knot+1.266
Neg logit contributions
ator-1.678
man-1.283
icles-1.118
umm-1.108
ators-1.091
Token to
Token ID284
Feature activation-7.43e-01

Pos logit contributions
bs+2.402
 slips+1.814
ata+1.776
 knot+1.763
 net+1.762
Neg logit contributions
ator-2.318
man-1.797
icles-1.540
 him-1.530
 birth-1.494
Token unt
Token ID1418
Feature activation-4.30e-01

Pos logit contributions
bs+1.158
ip+0.794
 slips+0.785
ata+0.736
 net+0.733
Neg logit contributions
ator-1.895
man-1.514
umm-1.382
icles-1.359
ators-1.329
Tokenie
Token ID494
Feature activation-1.23e+00

Pos logit contributions
bs+1.046
 net+0.855
 slips+0.851
ata+0.832
 knot+0.831
Neg logit contributions
ator-0.696
man-0.462
umm-0.396
icles-0.393
ators-0.380
Token the
Token ID262
Feature activation-3.56e+00

Pos logit contributions
bs+2.366
 slips+1.815
 net+1.783
 loses+1.728
ata+1.727
Neg logit contributions
ator-2.601
man-2.008
icles-1.825
 him-1.749
umm-1.690
Token knot
Token ID29654
Feature activation-6.16e-01

Pos logit contributions
bs+5.275
azz+3.477
ird+3.069
aves+2.964
ip+2.957
Neg logit contributions
ator-9.459
man-8.099
 us-7.432
 me-7.411
umm-7.295
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
bs+1.032
 slips+0.732
 net+0.732
ip+0.705
 knot+0.705
Neg logit contributions
ator-1.457
man-1.176
umm-1.039
icles-1.033
ators-0.997
Token but
Token ID475
Feature activation-6.50e-01

Pos logit contributions
bs+3.382
ip+2.462
 slips+2.442
ata+2.413
 net+2.368
Neg logit contributions
ator-3.644
man-2.810
icles-2.645
 him-2.617
 us-2.484
Token it
Token ID340
Feature activation-3.00e-02

Pos logit contributions
bs+1.056
 slips+0.756
ata+0.751
 net+0.745
ip+0.725
Neg logit contributions
ator-1.539
man-1.230
icles-1.100
umm-1.092
ators-1.068
Token was
Token ID373
Feature activation-6.99e-01

Pos logit contributions
bs+0.062
 net+0.047
 slips+0.046
 knot+0.045
ip+0.045
Neg logit contributions
ator-0.060
man-0.046
umm-0.041
icles-0.039
ators-0.038
Token his
Token ID465
Feature activation-2.29e+00

Pos logit contributions
ator+1.649
man+1.307
umm+1.240
icles+1.170
ators+1.159
Neg logit contributions
bs-1.055
 net-0.738
 slips-0.732
 knot-0.713
 loop-0.696
Token bag
Token ID6131
Feature activation-5.98e-01

Pos logit contributions
bs+4.408
ata+3.493
ver+3.260
ching+3.170
ip+3.046
Neg logit contributions
ator-4.907
man-3.992
 us-3.358
ators-3.277
 birth-3.264
Token and
Token ID290
Feature activation-1.67e+00

Pos logit contributions
bs+0.908
 slips+0.615
 net+0.605
 knot+0.601
 loses+0.598
Neg logit contributions
ator-1.538
man-1.285
umm-1.128
icles-1.127
ators-1.087
Token took
Token ID1718
Feature activation+6.22e-02

Pos logit contributions
bs+3.036
 knot+2.376
ching+2.342
op+2.318
 slips+2.278
Neg logit contributions
ator-3.453
man-2.797
ators-2.426
icles-2.400
 him-2.371
Token out
Token ID503
Feature activation-5.99e-01

Pos logit contributions
ator+0.150
man+0.119
umm+0.111
icles+0.107
ators+0.105
Neg logit contributions
bs-0.102
 slips-0.072
 net-0.072
 knot-0.070
 loses-0.069
Token a
Token ID257
Feature activation-3.54e+00

Pos logit contributions
bs+1.053
 slips+0.796
 loses+0.762
 net+0.761
 knot+0.756
Neg logit contributions
ator-1.375
man-1.080
icles-0.982
ators-0.939
umm-0.938
Token jar
Token ID17379
Feature activation-9.48e-01

Pos logit contributions
bs+6.757
 loses+5.635
ver+5.250
 pads+5.212
ip+5.088
Neg logit contributions
ator-7.359
man-6.598
 guest-5.011
 birth-4.918
 us-4.915
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
bs+1.607
 slips+1.202
 net+1.164
 knot+1.162
 loses+1.159
Neg logit contributions
ator-2.224
man-1.834
umm-1.586
icles-1.573
ators-1.477
Token He
Token ID679
Feature activation-4.64e-01

Pos logit contributions
bs+1.947
 net+1.549
 knot+1.536
 slips+1.525
ata+1.506
Neg logit contributions
ator-1.374
man-0.998
umm-0.885
icles-0.853
omm-0.779
Token said
Token ID531
Feature activation-5.57e-01

Pos logit contributions
bs+0.792
 net+0.564
 slips+0.562
 knot+0.554
 loses+0.543
Neg logit contributions
ator-1.075
man-0.861
umm-0.766
icles-0.743
ators-0.739
Token it
Token ID340
Feature activation+7.11e-01

Pos logit contributions
bs+0.884
 knot+0.628
ata+0.623
 slips+0.622
 net+0.620
Neg logit contributions
ator-1.347
man-1.103
icles-0.979
umm-0.975
 birth-0.946
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
ator+3.810
man+3.043
umm+2.942
ators+2.780
icles+2.771
Neg logit contributions
bs-2.066
 net-1.372
 slips-1.342
 loses-1.291
 loop-1.273
Token They
Token ID1119
Feature activation-3.02e-01

Pos logit contributions
bs+1.280
 knot+1.001
 slips+0.985
 net+0.983
ip+0.973
Neg logit contributions
ator-1.330
man-1.020
umm-0.940
icles-0.888
 birth-0.881
Token decide
Token ID5409
Feature activation-8.64e-01

Pos logit contributions
bs+0.510
 net+0.369
 slips+0.368
ip+0.358
ata+0.356
Neg logit contributions
ator-0.693
man-0.543
umm-0.500
icles-0.476
 birth-0.471
Token to
Token ID284
Feature activation+7.73e-01

Pos logit contributions
bs+1.744
 slips+1.385
 net+1.354
ata+1.349
 knot+1.347
Neg logit contributions
ator-1.742
man-1.378
 birth-1.165
icles-1.154
 him-1.128
Token find
Token ID1064
Feature activation-1.56e+00

Pos logit contributions
ator+1.677
man+1.282
umm+1.209
icles+1.131
ators+1.113
Neg logit contributions
bs-1.466
 net-1.098
 slips-1.092
 knot-1.076
 loop-1.062
Token another
Token ID1194
Feature activation-3.54e+00

Pos logit contributions
bs+2.632
 slips+1.972
op+1.965
ip+1.958
ata+1.956
Neg logit contributions
ator-3.661
man-2.883
 him-2.691
icles-2.633
 us-2.579
Token game
Token ID983
Feature activation+1.19e-01

Pos logit contributions
bs+8.636
ume+6.546
azz+6.523
 loses+6.465
ip+6.329
Neg logit contributions
ator-6.707
man-5.617
 next-4.462
umm-4.183
 us-4.165
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
ator+0.285
man+0.226
umm+0.207
icles+0.199
ators+0.197
Neg logit contributions
bs-0.199
 slips-0.141
 net-0.141
 knot-0.139
ip-0.136
Token play
Token ID711
Feature activation-1.17e+00

Pos logit contributions
bs+3.635
ip+2.819
 slips+2.740
ata+2.709
op+2.629
Neg logit contributions
ator-3.290
man-2.355
 him-2.061
 us-2.029
ators-2.004
Token.
Token ID13
Feature activation-9.55e-01

Pos logit contributions
bs+2.407
 slips+1.799
ip+1.772
ata+1.698
 loses+1.689
Neg logit contributions
ator-2.473
man-1.895
ators-1.621
 him-1.576
umm-1.567
Token They
Token ID1119
Feature activation-1.20e-01

Pos logit contributions
bs+1.967
 knot+1.540
 net+1.529
ip+1.523
 slips+1.505
Neg logit contributions
ator-1.848
man-1.421
umm-1.289
icles-1.218
 birth-1.207
Token't
Token ID470
Feature activation-8.28e-01

Pos logit contributions
bs+3.702
 net+3.095
 knot+3.046
 slips+2.988
 loop+2.961
Neg logit contributions
ator-1.451
man-0.825
icles-0.575
umm-0.538
 him-0.524
Token be
Token ID307
Feature activation-1.40e+00

Pos logit contributions
bs+1.379
 net+0.985
ip+0.983
 slips+0.968
 knot+0.935
Neg logit contributions
ator-1.976
man-1.610
icles-1.392
umm-1.389
 him-1.364
Token able
Token ID1498
Feature activation-4.81e-01

Pos logit contributions
bs+2.262
 slips+1.568
 knot+1.550
 net+1.550
ip+1.522
Neg logit contributions
ator-3.455
man-2.783
ators-2.454
 him-2.406
ping-2.401
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
bs+0.930
 slips+0.698
 net+0.687
ip+0.687
 knot+0.685
Neg logit contributions
ator-1.000
man-0.766
umm-0.694
 birth-0.671
icles-0.668
Token get
Token ID651
Feature activation-5.49e-01

Pos logit contributions
bs+2.370
ip+1.729
 slips+1.675
ata+1.668
 net+1.612
Neg logit contributions
ator-2.845
man-2.177
 him-1.960
icles-1.950
umm-1.904
Token a
Token ID257
Feature activation-3.52e+00

Pos logit contributions
bs+0.979
ata+0.738
 slips+0.726
 knot+0.720
 net+0.714
Neg logit contributions
ator-1.206
man-0.937
icles-0.848
umm-0.847
 him-0.832
Token good
Token ID922
Feature activation-2.04e+00

Pos logit contributions
bs+7.847
ata+6.106
imes+6.030
ems+5.826
ip+5.665
Neg logit contributions
ator-6.498
man-5.837
umm-4.624
 Driver-4.352
 us-4.247
Token shot
Token ID2823
Feature activation-1.24e+00

Pos logit contributions
bs+4.399
ata+3.433
ip+3.280
 loses+3.231
 slips+3.213
Neg logit contributions
ator-3.948
man-3.145
umm-2.892
icles-2.517
 birth-2.508
Token.
Token ID13
Feature activation+8.06e-01

Pos logit contributions
bs+2.316
 slips+1.673
ip+1.637
 net+1.634
ata+1.595
Neg logit contributions
ator-2.760
man-2.237
icles-1.954
umm-1.939
ators-1.883
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
ator+2.003
man+1.591
umm+1.489
icles+1.415
ators+1.412
Neg logit contributions
bs-1.303
 net-0.906
 slips-0.906
 knot-0.869
ip-0.862
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
ator+1.058
man+0.842
umm+0.791
icles+0.754
ators+0.745
Neg logit contributions
bs-0.668
 net-0.465
 slips-0.463
 knot-0.450
 loses-0.440
Token he
Token ID339
Feature activation-5.76e-01

Pos logit contributions
ator+1.845
man+1.420
umm+1.350
icles+1.269
ators+1.257
Neg logit contributions
bs-1.448
 net-1.047
 slips-1.039
 knot-1.019
 tie-1.001
Token replied
Token ID8712
Feature activation+1.91e-01

Pos logit contributions
bs+0.909
 slips+0.663
 net+0.661
 loses+0.635
 knot+0.614
Neg logit contributions
ator-1.400
man-1.174
umm-1.020
icles-0.984
omm-0.978
Token.
Token ID13
Feature activation+6.87e-01

Pos logit contributions
ator+0.461
man+0.365
umm+0.338
icles+0.323
ators+0.318
Neg logit contributions
bs-0.318
 slips-0.229
 net-0.228
 knot-0.221
 loses-0.217
Token He
Token ID679
Feature activation-9.87e-01

Pos logit contributions
ator+1.651
man+1.295
umm+1.224
icles+1.159
ators+1.149
Neg logit contributions
bs-1.153
 net-0.816
 slips-0.815
 knot-0.791
ip-0.778
Token took
Token ID1718
Feature activation+4.08e-01

Pos logit contributions
bs+1.829
 slips+1.376
 net+1.373
 loses+1.359
 knot+1.286
Neg logit contributions
ator-2.150
man-1.747
umm-1.452
esh-1.414
ators-1.410
Token a
Token ID257
Feature activation-3.51e+00

Pos logit contributions
ator+1.052
man+0.843
umm+0.796
icles+0.760
ators+0.750
Neg logit contributions
bs-0.613
 slips-0.416
 net-0.416
 knot-0.401
 loses-0.393
Token long
Token ID890
Feature activation-2.44e+00

Pos logit contributions
bs+6.655
 loses+5.715
 slips+5.322
imes+5.259
 pads+5.061
Neg logit contributions
ator-7.107
man-6.282
 me-4.876
 us-4.543
 mantle-4.540
Token piece
Token ID3704
Feature activation-8.93e-01

Pos logit contributions
bs+4.390
 loses+3.529
 slips+3.289
ata+3.033
ip+2.910
Neg logit contributions
ator-5.494
man-4.465
esh-3.797
icles-3.639
zie-3.631
Token of
Token ID286
Feature activation-2.69e+00

Pos logit contributions
bs+1.376
 slips+0.965
 loses+0.941
 net+0.914
ip+0.899
Neg logit contributions
ator-2.247
man-1.881
umm-1.678
icles-1.634
 him-1.569
Token string
Token ID4731
Feature activation-9.62e-01

Pos logit contributions
bs+4.314
 loses+3.443
 slips+3.402
azz+3.315
ver+3.277
Neg logit contributions
ator-6.322
man-5.033
 him-4.820
icles-4.733
 us-4.671
Token and
Token ID290
Feature activation-2.53e+00

Pos logit contributions
bs+1.596
 slips+1.134
 loses+1.110
 net+1.070
ata+1.066
Neg logit contributions
ator-2.342
man-1.935
umm-1.689
icles-1.686
 him-1.601
Token fly
Token ID6129
Feature activation-1.44e+00

Pos logit contributions
bs+3.474
ip+2.739
op+2.515
ata+2.515
ver+2.509
Neg logit contributions
ator-3.546
man-2.697
umm-2.242
 birth-2.223
 us-2.192
Token.
Token ID13
Feature activation-7.14e-01

Pos logit contributions
bs+2.696
 loses+1.962
 slips+1.949
ip+1.941
ata+1.854
Neg logit contributions
ator-3.301
man-2.659
ators-2.195
icles-2.191
umm-2.186
Token They
Token ID1119
Feature activation-3.38e-01

Pos logit contributions
bs+1.481
 net+1.138
 knot+1.137
ata+1.127
ip+1.123
Neg logit contributions
ator-1.389
man-1.043
umm-0.987
icles-0.941
 birth-0.925
Token look
Token ID804
Feature activation-7.73e-01

Pos logit contributions
bs+0.536
ip+0.378
 net+0.374
ata+0.363
 slips+0.362
Neg logit contributions
ator-0.828
man-0.648
umm-0.600
ators-0.584
icles-0.578
Token at
Token ID379
Feature activation-1.19e-01

Pos logit contributions
bs+1.361
 net+1.030
 slips+1.029
ata+1.010
 knot+1.006
Neg logit contributions
ator-1.689
man-1.302
umm-1.167
 birth-1.166
 him-1.141
Token each
Token ID1123
Feature activation-3.50e+00

Pos logit contributions
bs+0.220
 slips+0.161
 net+0.159
ata+0.159
ip+0.157
Neg logit contributions
ator-0.270
man-0.207
umm-0.191
icles-0.188
ators-0.184
Token other
Token ID584
Feature activation-1.06e+00

Pos logit contributions
bs+8.508
 slips+7.027
ip+7.004
 loses+6.771
ver+6.671
Neg logit contributions
ator-6.088
man-4.484
 us-4.091
 him-3.690
 them-3.671
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
bs+1.889
 net+1.397
 slips+1.390
ip+1.350
ata+1.335
Neg logit contributions
ator-2.425
man-1.913
umm-1.699
icles-1.690
ators-1.666
Token They
Token ID1119
Feature activation-2.36e-01

Pos logit contributions
bs+1.025
 net+0.786
 knot+0.784
 slips+0.772
ata+0.768
Neg logit contributions
ator-1.027
man-0.778
umm-0.734
icles-0.698
 birth-0.688
Token smile
Token ID8212
Feature activation-3.26e-01

Pos logit contributions
bs+0.371
ip+0.259
 net+0.259
 slips+0.252
ata+0.251
Neg logit contributions
ator-0.580
man-0.456
umm-0.423
ators-0.409
icles-0.407
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
bs+0.529
 net+0.378
 slips+0.368
 knot+0.366
ip+0.358
Neg logit contributions
ator-0.779
man-0.634
umm-0.577
icles-0.551
 birth-0.550
Token was
Token ID373
Feature activation-6.02e-01

Pos logit contributions
bs+1.656
 slips+1.255
 net+1.247
 loses+1.212
 knot+1.199
Neg logit contributions
ator-1.712
man-1.365
umm-1.140
icles-1.111
ators-1.093
Token excited
Token ID6568
Feature activation+9.86e-01

Pos logit contributions
bs+1.205
 slips+0.883
ip+0.875
 net+0.870
 knot+0.859
Neg logit contributions
ator-1.289
man-1.002
umm-0.858
opher-0.831
ators-0.831
Token!
Token ID0
Feature activation-6.50e-01

Pos logit contributions
ator+2.709
man+2.181
umm+2.086
ators+1.989
icles+1.985
Neg logit contributions
bs-1.338
 net-0.866
 slips-0.850
 knot-0.822
 tie-0.806
Token He
Token ID679
Feature activation-7.71e-01

Pos logit contributions
bs+1.728
 net+1.395
ata+1.373
 knot+1.369
 slips+1.366
Neg logit contributions
ator-0.927
man-0.634
umm-0.587
icles-0.493
omm-0.485
Token grabbed
Token ID13176
Feature activation-1.59e+00

Pos logit contributions
bs+1.366
 slips+1.010
 net+0.986
 loses+0.980
ata+0.960
Neg logit contributions
ator-1.702
man-1.373
umm-1.183
 birth-1.142
icles-1.138
Token the
Token ID262
Feature activation-3.48e+00

Pos logit contributions
bs+3.287
 loses+2.608
 slips+2.590
ata+2.534
 knot+2.426
Neg logit contributions
ator-3.119
man-2.342
 him-2.206
icles-2.184
 himself-2.015
Token rope
Token ID17182
Feature activation-1.04e+00

Pos logit contributions
bs+4.946
azz+3.429
 loses+3.259
+3.048
ird+3.040
Neg logit contributions
ator-9.386
man-8.320
 us-7.040
 next-6.876
icles-6.874
Token and
Token ID290
Feature activation-2.11e+00

Pos logit contributions
bs+1.689
 slips+1.177
 loses+1.152
ata+1.109
ip+1.108
Neg logit contributions
ator-2.603
man-2.162
umm-1.896
icles-1.881
esh-1.820
Token began
Token ID2540
Feature activation-1.07e+00

Pos logit contributions
bs+4.156
ume+3.169
 loses+3.168
 slips+3.135
azz+3.134
Neg logit contributions
ator-4.126
man-3.359
esh-2.900
 him-2.855
icles-2.845
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
bs+2.165
ata+1.649
 slips+1.636
 loses+1.609
 knot+1.603
Neg logit contributions
ator-2.158
man-1.684
esh-1.449
umm-1.409
icles-1.404
Token swing
Token ID9628
Feature activation+2.49e-01

Pos logit contributions
bs+2.635
ip+1.923
ata+1.849
 slips+1.811
 loses+1.790
Neg logit contributions
ator-3.459
man-2.802
esh-2.476
umm-2.390
 him-2.369
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
ator+0.417
man+0.329
umm+0.307
icles+0.292
ators+0.286
Neg logit contributions
bs-0.300
 net-0.216
 slips-0.215
 knot-0.211
 loses-0.206
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
ator+0.856
man+0.682
umm+0.635
icles+0.609
ators+0.599
Neg logit contributions
bs-0.549
 net-0.383
 slips-0.383
 knot-0.372
 loses-0.363
TokenL
Token ID43
Feature activation+6.86e-01

Pos logit contributions
ator+0.951
man+0.734
umm+0.681
icles+0.642
ators+0.635
Neg logit contributions
bs-0.802
 net-0.593
 slips-0.593
 knot-0.576
 loses-0.568
Tokenulu
Token ID15712
Feature activation-3.31e-01

Pos logit contributions
ator+1.545
man+1.198
umm+1.120
icles+1.052
ators+1.044
Neg logit contributions
bs-1.262
 slips-0.925
 net-0.924
 knot-0.899
ip-0.887
Token took
Token ID1718
Feature activation-4.82e-01

Pos logit contributions
bs+0.584
 net+0.422
 slips+0.421
 knot+0.410
 loses+0.404
Neg logit contributions
ator-0.753
man-0.597
umm-0.529
icles-0.507
ators-0.507
Token a
Token ID257
Feature activation-3.47e+00

Pos logit contributions
bs+0.826
 slips+0.613
 net+0.596
 loses+0.593
 knot+0.584
Neg logit contributions
ator-1.108
man-0.877
umm-0.805
icles-0.784
 birth-0.763
Token deep
Token ID2769
Feature activation-2.80e-01

Pos logit contributions
bs+6.390
 loses+5.529
imes+5.211
 pads+5.090
ip+5.082
Neg logit contributions
ator-6.605
man-6.244
 me-4.699
 us-4.440
opher-4.329
Token breath
Token ID8033
Feature activation-1.58e+00

Pos logit contributions
bs+0.632
 slips+0.494
 loses+0.486
ata+0.486
 net+0.486
Neg logit contributions
ator-0.505
man-0.369
umm-0.321
icles-0.303
esh-0.295
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
bs+2.596
 slips+1.977
 net+1.911
 knot+1.869
 loses+1.853
Neg logit contributions
ator-3.707
man-3.048
 him-2.679
umm-2.640
icles-2.582
Token decided
Token ID3066
Feature activation-6.30e-01

Pos logit contributions
bs+5.411
op+4.573
ume+4.497
imes+4.389
 knot+4.278
Neg logit contributions
ator-5.350
man-4.488
 him-4.010
esh-3.865
 me-3.734
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
bs+1.226
 net+0.929
 slips+0.928
ata+0.918
 knot+0.906
Neg logit contributions
ator-1.302
man-1.017
umm-0.870
icles-0.867
 birth-0.851
Token it
Token ID340
Feature activation-7.91e-02

Pos logit contributions
bs+1.951
 net+1.363
 slips+1.352
ata+1.319
ip+1.287
Neg logit contributions
ator-2.489
man-1.882
 him-1.751
icles-1.717
 himself-1.694
Token again
Token ID757
Feature activation-9.53e-01

Pos logit contributions
bs+0.149
 net+0.109
 slips+0.108
 knot+0.104
ip+0.103
Neg logit contributions
ator-0.176
man-0.135
umm-0.125
icles-0.119
ators-0.117
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
bs+1.819
 slips+1.334
 net+1.314
ata+1.283
 knot+1.280
Neg logit contributions
ator-2.108
man-1.643
umm-1.495
ators-1.401
icles-1.393
Token He
Token ID679
Feature activation-6.00e-01

Pos logit contributions
ator+0.643
man+0.468
umm+0.428
icles+0.396
ators+0.385
Neg logit contributions
bs-0.786
 net-0.616
 slips-0.613
 knot-0.602
 loses-0.594
Token tried
Token ID3088
Feature activation-1.24e+00

Pos logit contributions
bs+1.069
 slips+0.760
 net+0.758
 loses+0.742
 knot+0.733
Neg logit contributions
ator-1.338
man-1.062
umm-0.938
ators-0.917
icles-0.916
Token another
Token ID1194
Feature activation-3.47e+00

Pos logit contributions
bs+2.572
ata+1.830
 net+1.819
 slips+1.814
 knot+1.766
Neg logit contributions
ator-2.535
man-1.944
 him-1.757
icles-1.697
 himself-1.663
Token,
Token ID11
Feature activation-2.19e+00

Pos logit contributions
bs+7.156
ip+5.172
ume+4.946
imes+4.801
ata+4.786
Neg logit contributions
ator-7.221
man-5.871
 me-5.408
umm-5.356
 himself-5.047
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
bs+4.281
op+2.981
ata+2.962
ip+2.853
ume+2.842
Neg logit contributions
ator-4.516
man-3.487
 him-3.341
umm-3.274
icles-3.265
Token another
Token ID1194
Feature activation-2.47e+00

Pos logit contributions
bs+3.741
 net+2.896
 knot+2.861
 slips+2.834
op+2.810
Neg logit contributions
ator-2.880
man-2.151
 him-1.896
icles-1.837
umm-1.769
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
bs+5.125
 slips+3.717
 loses+3.683
ip+3.681
imes+3.592
Neg logit contributions
ator-5.018
man-4.231
umm-3.676
 him-3.239
icles-3.178
Token He
Token ID679
Feature activation-8.87e-01

Pos logit contributions
ator+0.619
man+0.456
umm+0.421
icles+0.392
ators+0.381
Neg logit contributions
bs-0.696
 net-0.540
 slips-0.536
 knot-0.527
 loses-0.520
Token showed
Token ID3751
Feature activation+6.22e-01

Pos logit contributions
ator+0.428
man+0.340
umm+0.311
icles+0.297
ators+0.295
Neg logit contributions
bs-0.310
 net-0.222
 slips-0.221
 knot-0.214
 loses-0.211
Token them
Token ID606
Feature activation-4.24e-01

Pos logit contributions
ator+1.006
man+0.688
umm+0.621
icles+0.560
ators+0.550
Neg logit contributions
bs-1.531
 net-1.229
 slips-1.223
 knot-1.206
ip-1.191
Token how
Token ID703
Feature activation-1.57e+00

Pos logit contributions
bs+0.824
 slips+0.630
 net+0.613
ata+0.609
ip+0.609
Neg logit contributions
ator-0.888
man-0.673
icles-0.601
umm-0.600
 him-0.582
Token to
Token ID284
Feature activation-9.43e-01

Pos logit contributions
bs+3.059
ata+2.618
 net+2.461
 slips+2.453
 knot+2.441
Neg logit contributions
ator-2.910
man-2.211
 him-2.005
umm-1.915
 birth-1.915
Token snap
Token ID11495
Feature activation-1.44e+00

Pos logit contributions
bs+1.551
ip+1.069
 net+1.055
 slips+1.043
ata+1.021
Neg logit contributions
ator-2.363
man-1.863
icles-1.653
ators-1.642
umm-1.641
Token the
Token ID262
Feature activation-3.47e+00

Pos logit contributions
bs+2.647
 slips+2.016
ata+2.009
 net+1.969
 loses+1.938
Neg logit contributions
ator-3.095
man-2.388
icles-2.199
 him-2.186
ators-2.111
Token knot
Token ID29654
Feature activation-1.15e+00

Pos logit contributions
bs+4.695
azz+2.987
ip+2.842
ver+2.700
ata+2.686
Neg logit contributions
ator-9.398
man-7.791
 me-7.576
umm-7.523
 us-7.494
Token by
Token ID416
Feature activation-1.64e+00

Pos logit contributions
bs+2.085
 net+1.495
 slips+1.494
ip+1.487
 loses+1.457
Neg logit contributions
ator-2.611
man-2.100
umm-1.797
icles-1.793
ators-1.739
Token pulling
Token ID10427
Feature activation-2.04e+00

Pos logit contributions
bs+3.652
ip+2.749
 slips+2.714
ata+2.697
 loses+2.667
Neg logit contributions
ator-3.294
man-2.428
icles-2.146
umm-2.043
ping-2.025
Token it
Token ID340
Feature activation-6.66e-01

Pos logit contributions
bs+3.819
 slips+2.846
ata+2.819
 loses+2.775
 net+2.772
Neg logit contributions
ator-4.464
man-3.421
 him-3.316
icles-3.100
 us-3.086
Token really
Token ID1107
Feature activation-1.39e+00

Pos logit contributions
bs+1.271
 net+0.954
ip+0.931
 slips+0.929
ata+0.913
Neg logit contributions
ator-1.406
man-1.103
umm-0.990
icles-0.962
ators-0.916
Token day
Token ID1110
Feature activation-1.14e+00

Pos logit contributions
bs+2.561
 loses+2.126
 slips+2.115
 net+2.077
 knot+2.067
Neg logit contributions
ator-1.429
man-1.066
umm-0.796
icles-0.740
 birth-0.727
Token she
Token ID673
Feature activation-4.08e-01

Pos logit contributions
bs+1.924
ata+1.402
 net+1.397
 knot+1.385
 loses+1.365
Neg logit contributions
ator-2.552
man-2.088
icles-1.903
 him-1.887
umm-1.835
Token got
Token ID1392
Feature activation-6.96e-01

Pos logit contributions
bs+0.815
 net+0.612
 knot+0.605
 slips+0.600
ip+0.596
Neg logit contributions
ator-0.837
man-0.645
umm-0.557
ators-0.552
 birth-0.546
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
bs+1.134
 knot+0.831
 slips+0.813
ata+0.804
 net+0.798
Neg logit contributions
ator-1.616
man-1.296
icles-1.166
 birth-1.161
umm-1.156
Token little
Token ID1310
Feature activation-1.97e+00

Pos logit contributions
bs+2.967
 loses+2.100
ip+2.099
 slips+2.081
ata+2.081
Neg logit contributions
ator-3.351
man-2.777
umm-2.440
 birth-2.320
omm-2.233
Token too
Token ID1165
Feature activation-3.46e+00

Pos logit contributions
bs+3.786
ata+2.829
ip+2.745
ver+2.739
 slips+2.670
Neg logit contributions
ator-4.062
man-3.434
umm-3.224
 birth-2.761
icles-2.747
Token close
Token ID1969
Feature activation-6.09e-01

Pos logit contributions
bs+6.642
ata+6.010
 loses+5.902
azz+5.524
 slips+5.481
Neg logit contributions
ator-6.208
man-5.357
ind-4.584
 him-4.570
 birth-4.214
Token to
Token ID284
Feature activation-8.91e-01

Pos logit contributions
bs+1.116
 net+0.822
ata+0.802
 slips+0.799
 knot+0.795
Neg logit contributions
ator-1.336
man-1.059
umm-0.951
icles-0.936
ators-0.901
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
bs+1.671
ip+1.251
 slips+1.249
 net+1.232
ata+1.198
Neg logit contributions
ator-1.960
man-1.486
 him-1.333
icles-1.330
umm-1.329
Token pond
Token ID16723
Feature activation-4.10e-01

Pos logit contributions
bs+2.393
 slips+1.755
 loses+1.718
ip+1.708
ata+1.702
Neg logit contributions
ator-2.539
man-2.081
umm-1.799
 birth-1.708
icles-1.613
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
bs+0.704
 knot+0.497
ip+0.488
 slips+0.487
 net+0.486
Neg logit contributions
ator-0.957
man-0.777
umm-0.691
icles-0.674
ators-0.655
Token,
Token ID11
Feature activation-7.94e-01

Pos logit contributions
ator+0.233
man+0.187
umm+0.176
icles+0.169
ators+0.166
Neg logit contributions
bs-0.133
 net-0.091
 slips-0.090
 knot-0.087
ip-0.086
Token they
Token ID484
Feature activation+6.13e-02

Pos logit contributions
bs+1.332
 slips+0.955
 knot+0.946
ip+0.940
ata+0.929
Neg logit contributions
ator-1.891
man-1.481
umm-1.362
icles-1.339
 birth-1.315
Token looked
Token ID3114
Feature activation-3.33e-03

Pos logit contributions
ator+0.144
man+0.114
umm+0.104
icles+0.101
ators+0.100
Neg logit contributions
bs-0.103
 slips-0.074
 net-0.074
 knot-0.073
ip-0.072
Token around
Token ID1088
Feature activation+9.48e-01

Pos logit contributions
bs+0.007
 slips+0.005
 net+0.005
 knot+0.005
ip+0.005
Neg logit contributions
ator-0.007
man-0.005
umm-0.005
icles-0.004
ators-0.004
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
ator+2.552
man+2.059
umm+1.967
icles+1.872
ators+1.854
Neg logit contributions
bs-1.336
 net-0.890
 slips-0.868
 knot-0.839
-0.822
Token amaz
Token ID40642
Feature activation+2.57e+00

Pos logit contributions
ator+0.306
man+0.239
umm+0.224
icles+0.212
ators+0.211
Neg logit contributions
bs-0.223
 slips-0.159
ip-0.157
 net-0.156
ata-0.153
Tokenement
Token ID972
Feature activation+2.28e-01

Pos logit contributions
ator+4.501
man+3.641
umm+3.133
 Amen+2.924
icles+2.837
Neg logit contributions
bs-5.767
 slips-4.509
 net-4.506
 knot-4.491
ip-4.364
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
ator+0.559
man+0.443
umm+0.415
icles+0.397
ators+0.391
Neg logit contributions
bs-0.367
 net-0.258
 slips-0.257
 knot-0.252
ip-0.245
Token It
Token ID632
Feature activation+6.70e-01

Pos logit contributions
ator+0.092
man+0.070
umm+0.066
icles+0.062
ators+0.061
Neg logit contributions
bs-0.079
 net-0.058
 knot-0.058
 slips-0.058
ip-0.056
Token was
Token ID373
Feature activation+7.45e-01

Pos logit contributions
ator+1.423
man+1.076
umm+1.009
icles+0.945
ators+0.929
Neg logit contributions
bs-1.311
 net-0.986
 slips-0.986
 knot-0.962
 loop-0.948
Token a
Token ID257
Feature activation-5.23e-01

Pos logit contributions
ator+2.007
man+1.625
umm+1.547
icles+1.472
ators+1.459
Neg logit contributions
bs-1.025
 net-0.673
 slips-0.669
 knot-0.642
 loop-0.627
Token the
Token ID262
Feature activation+4.38e-01

Pos logit contributions
ator+1.602
man+1.253
umm+1.182
icles+1.112
ators+1.105
Neg logit contributions
bs-1.152
 net-0.829
 slips-0.822
 knot-0.797
 loses-0.785
Token turkey
Token ID26876
Feature activation-8.60e-02

Pos logit contributions
ator+1.032
man+0.812
umm+0.757
icles+0.717
ators+0.707
Neg logit contributions
bs-0.763
 slips-0.545
 net-0.543
 knot-0.529
ip-0.523
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
bs+0.151
 slips+0.108
 net+0.107
 knot+0.107
ip+0.106
Neg logit contributions
ator-0.196
man-0.158
umm-0.142
icles-0.137
ators-0.133
Token hurt
Token ID5938
Feature activation+6.02e-01

Pos logit contributions
ator+2.663
man+2.095
umm+2.016
ators+1.883
icles+1.862
Neg logit contributions
bs-1.747
 net-1.234
 slips-1.229
 tie-1.200
 knot-1.196
Token their
Token ID511
Feature activation+1.89e+00

Pos logit contributions
ator+1.351
man+1.042
umm+0.977
icles+0.917
ators+0.907
Neg logit contributions
bs-1.109
 net-0.818
 slips-0.815
 knot-0.795
 loses-0.780
Token snow
Token ID6729
Feature activation+2.27e+00

Pos logit contributions
ator+4.534
man+3.434
umm+3.399
icles+3.185
ators+3.157
Neg logit contributions
bs-3.288
 net-2.577
 knot-2.472
 slips-2.426
 loop-2.415
Tokenman
Token ID805
Feature activation-1.14e+00

Pos logit contributions
ator+2.308
umm+1.009
 birth+0.805
omm+0.687
opher+0.687
Neg logit contributions
bs-6.925
 net-5.960
 slips-5.906
 knot-5.876
 loop-5.826
Token.
Token ID13
Feature activation+4.68e-01

Pos logit contributions
bs+2.162
 slips+1.597
 net+1.539
ip+1.522
ata+1.465
Neg logit contributions
ator-2.516
man-2.042
umm-1.746
icles-1.706
 him-1.668
Token They
Token ID1119
Feature activation-2.13e-01

Pos logit contributions
ator+1.061
man+0.823
umm+0.768
icles+0.726
ators+0.716
Neg logit contributions
bs-0.851
 net-0.624
 slips-0.621
 knot-0.608
ip-0.597
Token pick
Token ID2298
Feature activation+3.15e-02

Pos logit contributions
bs+0.331
 net+0.230
ip+0.229
 slips+0.224
ata+0.222
Neg logit contributions
ator-0.518
man-0.412
umm-0.380
icles-0.369
ators-0.366
Token up
Token ID510
Feature activation-7.84e-01

Pos logit contributions
ator+0.058
man+0.043
umm+0.038
icles+0.037
ators+0.035
Neg logit contributions
bs-0.070
 slips-0.055
 net-0.054
 knot-0.054
 loses-0.053
Token able
Token ID1498
Feature activation+5.70e-01

Pos logit contributions
bs+0.305
 slips+0.223
 knot+0.220
 net+0.219
ip+0.212
Neg logit contributions
ator-0.398
man-0.313
umm-0.282
ators-0.271
 birth-0.268
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
ator+1.337
man+1.043
umm+0.980
icles+0.927
ators+0.917
Neg logit contributions
bs-0.996
 net-0.718
 slips-0.716
 knot-0.697
ip-0.684
Token help
Token ID1037
Feature activation+5.23e-01

Pos logit contributions
bs+0.266
 slips+0.201
 net+0.200
ip+0.198
 knot+0.193
Neg logit contributions
ator-0.283
man-0.215
umm-0.190
icles-0.182
ators-0.181
Token out
Token ID503
Feature activation-3.48e-02

Pos logit contributions
ator+1.191
man+0.923
umm+0.864
icles+0.816
ators+0.805
Neg logit contributions
bs-0.948
 net-0.693
 slips-0.692
 knot-0.674
ip-0.662
Token Mr
Token ID1770
Feature activation+1.21e+00

Pos logit contributions
bs+0.057
 net+0.041
 slips+0.041
 knot+0.040
ip+0.039
Neg logit contributions
ator-0.084
man-0.067
umm-0.061
icles-0.059
ators-0.058
Token.
Token ID13
Feature activation+2.90e+00

Pos logit contributions
ator+3.136
man+2.498
umm+2.381
icles+2.266
ators+2.245
Neg logit contributions
bs-1.787
 slips-1.212
 net-1.205
 knot-1.175
 loses-1.153
Token Jenkins
Token ID21835
Feature activation-2.19e-01

Pos logit contributions
ator+6.629
man+5.064
ators+4.557
umm+4.486
esh+4.320
Neg logit contributions
bs-5.386
-4.306
 slips-4.117
 lime-4.099
 net-4.039
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
bs+0.337
 net+0.235
 slips+0.235
 knot+0.228
 loop+0.220
Neg logit contributions
ator-0.547
man-0.442
umm-0.404
icles-0.388
ators-0.387
Token She
Token ID1375
Feature activation+8.89e-02

Pos logit contributions
ator+2.989
man+2.293
umm+2.108
ators+2.052
icles+2.019
Neg logit contributions
bs-2.327
 slips-1.679
-1.672
 net-1.659
ip-1.649
Token knew
Token ID2993
Feature activation+3.79e-02

Pos logit contributions
ator+0.202
man+0.159
umm+0.144
icles+0.137
ators+0.136
Neg logit contributions
bs-0.159
 net-0.116
 slips-0.116
 knot-0.113
 loses-0.111
Token that
Token ID326
Feature activation+3.46e-01

Pos logit contributions
ator+0.089
man+0.069
umm+0.064
icles+0.062
ators+0.061
Neg logit contributions
bs-0.066
 slips-0.048
 net-0.048
 knot-0.046
 loses-0.045
Token from
Token ID422
Feature activation+1.16e+00

Pos logit contributions
ator+2.248
man+1.711
umm+1.634
ators+1.523
icles+1.511
Neg logit contributions
bs-1.768
 slips-1.305
 net-1.296
 knot-1.264
 loop-1.239
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
ator+2.944
man+2.339
umm+2.257
icles+2.089
ators+2.085
Neg logit contributions
bs-1.765
 net-1.278
 slips-1.239
 knot-1.208
 loop-1.190
Token tree
Token ID5509
Feature activation-3.81e-01

Pos logit contributions
ator+4.264
man+3.342
umm+3.098
ators+3.029
icles+2.998
Neg logit contributions
bs-2.692
 net-2.032
 slips-2.000
 knot-1.977
 loop-1.914
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
bs+0.616
 slips+0.423
 net+0.421
 knot+0.412
ip+0.409
Neg logit contributions
ator-0.927
man-0.759
umm-0.687
icles-0.660
ators-0.641
Token onto
Token ID4291
Feature activation+5.28e-01

Pos logit contributions
ator+2.619
man+2.045
umm+1.970
ators+1.845
icles+1.842
Neg logit contributions
bs-1.656
 slips-1.168
 net-1.163
 knot-1.112
 loses-1.103
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
ator+1.320
man+1.049
umm+0.989
icles+0.941
ators+0.930
Neg logit contributions
bs-0.839
 net-0.582
 slips-0.580
 knot-0.563
ip-0.550
Token ground
Token ID2323
Feature activation+2.73e-01

Pos logit contributions
ator+6.119
man+4.859
umm+4.518
ators+4.451
icles+4.376
Neg logit contributions
bs-3.306
 net-2.492
 knot-2.488
 loop-2.383
 slips-2.349
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
ator+0.665
man+0.529
umm+0.493
icles+0.471
ators+0.463
Neg logit contributions
bs-0.444
 net-0.311
 slips-0.309
 knot-0.303
ip-0.298
Token
Token ID220
Feature activation-2.66e-01

Pos logit contributions
ator+1.198
man+0.935
umm+0.877
icles+0.829
ators+0.818
Neg logit contributions
bs-0.906
 net-0.656
 slips-0.654
 knot-0.637
ip-0.625
Token
Token ID198
Feature activation-1.82e-02

Pos logit contributions
bs+0.454
 net+0.336
 knot+0.328
 slips+0.326
 tie+0.316
Neg logit contributions
ator-0.624
man-0.492
umm-0.444
icles-0.436
ators-0.424
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
bs+0.030
 net+0.022
 slips+0.022
 knot+0.021
 loses+0.021
Neg logit contributions
ator-0.044
man-0.035
umm-0.031
icles-0.030
ators-0.030
Token remembered
Token ID12086
Feature activation-4.97e-01

Pos logit contributions
bs+1.304
 slips+0.967
 net+0.964
ip+0.962
 knot+0.955
Neg logit contributions
ator-1.487
man-1.171
umm-1.005
icles-1.004
ators-1.002
Token how
Token ID703
Feature activation-6.34e-01

Pos logit contributions
bs+0.904
 slips+0.662
 knot+0.657
ata+0.657
 net+0.641
Neg logit contributions
ator-1.126
man-0.874
icles-0.793
umm-0.776
 birth-0.769
Token they
Token ID484
Feature activation+8.35e-02

Pos logit contributions
bs+1.153
 net+0.885
ata+0.882
 knot+0.880
 slips+0.878
Neg logit contributions
ator-1.381
man-1.080
icles-0.958
umm-0.952
 birth-0.943
Token made
Token ID925
Feature activation-1.98e-01

Pos logit contributions
ator+0.187
man+0.145
umm+0.132
icles+0.128
ators+0.127
Neg logit contributions
bs-0.150
 net-0.111
 slips-0.111
 knot-0.109
ip-0.106
Token the
Token ID262
Feature activation+1.42e+00

Pos logit contributions
bs+0.357
 slips+0.257
 net+0.253
ip+0.247
 knot+0.247
Neg logit contributions
ator-0.459
man-0.360
umm-0.327
icles-0.324
ators-0.313
Token snow
Token ID6729
Feature activation+2.37e+00

Pos logit contributions
ator+3.321
man+2.487
umm+2.432
ators+2.266
icles+2.264
Neg logit contributions
bs-2.454
 net-1.902
 slips-1.863
 loop-1.862
 knot-1.857
Tokenman
Token ID805
Feature activation-7.04e-02

Pos logit contributions
ator+2.161
umm+0.875
opher+0.569
 birth+0.559
omm+0.521
Neg logit contributions
bs-7.475
 slips-6.486
 net-6.383
 loop-6.358
ata-6.309
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
bs+0.117
 slips+0.083
 net+0.083
 knot+0.081
ip+0.079
Neg logit contributions
ator-0.169
man-0.136
umm-0.123
icles-0.119
ators-0.117
Token How
Token ID1374
Feature activation-1.74e+00

Pos logit contributions
bs+1.350
 net+1.033
 slips+1.027
 knot+1.021
ip+1.014
Neg logit contributions
ator-1.306
man-0.984
umm-0.915
icles-0.869
omm-0.838
Token they
Token ID484
Feature activation+6.17e-01

Pos logit contributions
bs+3.341
 net+2.700
 knot+2.589
 lime+2.519
 tie+2.512
Neg logit contributions
ator-3.619
man-2.847
 him-2.484
icles-2.482
 birth-2.454
Token laughed
Token ID13818
Feature activation-6.38e-01

Pos logit contributions
ator+1.431
man+1.114
umm+1.050
icles+0.989
ators+0.977
Neg logit contributions
bs-1.085
 net-0.784
 slips-0.783
 knot-0.761
 loses-0.748
Token carefully
Token ID7773
Feature activation-4.43e-01

Pos logit contributions
bs+0.461
 net+0.334
ip+0.332
 slips+0.330
ata+0.320
Neg logit contributions
ator-0.566
man-0.449
umm-0.406
icles-0.389
ators-0.378
Token.
Token ID13
Feature activation-4.61e-02

Pos logit contributions
bs+0.776
 net+0.564
 slips+0.556
ip+0.542
 knot+0.539
Neg logit contributions
ator-1.003
man-0.796
umm-0.727
icles-0.708
ators-0.688
Token
Token ID220
Feature activation-2.29e-01

Pos logit contributions
bs+0.090
 net+0.067
 slips+0.067
 knot+0.066
 loses+0.065
Neg logit contributions
ator-0.098
man-0.075
umm-0.070
icles-0.066
 birth-0.064
Token
Token ID198
Feature activation-3.29e-01

Pos logit contributions
bs+0.388
 net+0.287
 knot+0.278
 slips+0.277
 loses+0.268
Neg logit contributions
ator-0.540
man-0.427
umm-0.386
icles-0.380
ators-0.370
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
bs+0.572
 net+0.419
 slips+0.418
 knot+0.412
 loses+0.397
Neg logit contributions
ator-0.765
man-0.602
umm-0.534
icles-0.526
ators-0.514
TokenWhen
Token ID2215
Feature activation+4.05e-01

Pos logit contributions
ator+0.569
man+0.455
umm+0.426
icles+0.405
ators+0.401
Neg logit contributions
bs-0.353
 net-0.243
 slips-0.243
 knot-0.235
ip-0.232
Token the
Token ID262
Feature activation+1.51e+00

Pos logit contributions
ator+0.901
man+0.693
umm+0.647
icles+0.609
ators+0.599
Neg logit contributions
bs-0.761
 slips-0.561
 net-0.561
 knot-0.549
ip-0.539
Token dizz
Token ID29527
Feature activation-4.00e-01

Pos logit contributions
ator+4.211
man+3.397
umm+3.186
ators+3.153
icles+3.129
Neg logit contributions
bs-2.009
 net-1.422
 knot-1.331
 tie-1.316
 loop-1.281
Tokeny
Token ID88
Feature activation+8.09e-01

Pos logit contributions
bs+0.745
 net+0.563
 knot+0.560
 slips+0.549
 tie+0.543
Neg logit contributions
ator-0.884
man-0.686
umm-0.635
icles-0.610
ators-0.582
Token feeling
Token ID4203
Feature activation-4.05e-01

Pos logit contributions
ator+1.952
man+1.525
umm+1.435
icles+1.372
ators+1.362
Neg logit contributions
bs-1.360
 net-0.985
 slips-0.968
 knot-0.953
 tie-0.930
Token had
Token ID550
Feature activation+6.35e-01

Pos logit contributions
bs+0.671
 net+0.493
 knot+0.488
 slips+0.480
 loop+0.470
Neg logit contributions
ator-0.954
man-0.751
umm-0.688
icles-0.665
 birth-0.650
Token,"
Token ID553
Feature activation+9.60e-01

Pos logit contributions
ator+1.351
man+1.065
umm+1.003
icles+0.952
ators+0.941
Neg logit contributions
bs-0.922
 net-0.652
 slips-0.649
 knot-0.632
ip-0.618
Token said
Token ID531
Feature activation+1.07e+00

Pos logit contributions
ator+2.513
man+1.987
umm+1.938
icles+1.830
ators+1.805
Neg logit contributions
bs-1.408
 net-0.939
 slips-0.927
 knot-0.918
-0.899
Token grandma
Token ID49890
Feature activation+5.48e-01

Pos logit contributions
ator+2.105
man+1.510
umm+1.480
icles+1.337
ators+1.317
Neg logit contributions
bs-2.273
 net-1.739
 slips-1.725
 knot-1.711
-1.690
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
ator+1.367
man+1.086
umm+1.026
icles+0.975
ators+0.964
Neg logit contributions
bs-0.864
 net-0.598
 slips-0.596
 knot-0.578
ip-0.566
Token
Token ID198
Feature activation+6.19e-01

Pos logit contributions
ator+0.311
man+0.249
umm+0.228
icles+0.219
ators+0.215
Neg logit contributions
bs-0.214
 net-0.151
 slips-0.151
 knot-0.147
 loses-0.145
Token
Token ID198
Feature activation+7.19e-01

Pos logit contributions
ator+1.561
man+1.243
umm+1.181
icles+1.121
ators+1.108
Neg logit contributions
bs-0.958
 net-0.659
 slips-0.655
 knot-0.637
ip-0.621
TokenCharles
Token ID28711
Feature activation+3.60e-01

Pos logit contributions
ator+1.600
man+1.227
umm+1.162
icles+1.093
ators+1.071
Neg logit contributions
bs-1.328
 net-0.983
 slips-0.976
 knot-0.959
-0.937
Token smiled
Token ID13541
Feature activation+5.50e-01

Pos logit contributions
ator+0.816
man+0.636
umm+0.588
icles+0.557
ators+0.551
Neg logit contributions
bs-0.648
 net-0.475
 slips-0.473
 knot-0.461
 loses-0.453
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
ator+1.463
man+1.181
umm+1.120
icles+1.069
ators+1.058
Neg logit contributions
bs-0.783
 net-0.516
 slips-0.514
 knot-0.496
ip-0.483
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
ator+2.840
man+2.153
umm+2.097
icles+1.957
ators+1.939
Neg logit contributions
bs-2.151
 slips-1.577
 net-1.551
-1.539
 knot-1.531
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
ator+3.556
umm+2.861
man+2.857
icles+2.685
ators+2.629
Neg logit contributions
bs-1.708
-1.100
 net-1.067
 slips-1.053
 knot-1.049
Token that
Token ID326
Feature activation-8.40e-01

Pos logit contributions
bs+1.443
 slips+1.067
 net+1.025
 knot+0.986
 loses+0.984
Neg logit contributions
ator-2.152
man-1.771
umm-1.567
icles-1.503
 him-1.485
Token she
Token ID673
Feature activation+3.36e-01

Pos logit contributions
bs+1.398
 slips+1.011
 net+0.985
ata+0.962
 loses+0.960
Neg logit contributions
ator-2.012
man-1.616
umm-1.511
 birth-1.434
icles-1.398
Token won
Token ID1839
Feature activation-8.02e-02

Pos logit contributions
ator+0.784
man+0.614
umm+0.571
icles+0.539
ators+0.537
Neg logit contributions
bs-0.585
 net-0.424
 slips-0.420
 knot-0.411
ip-0.402
Token the
Token ID262
Feature activation+3.20e-01

Pos logit contributions
bs+0.142
 net+0.102
 slips+0.101
 knot+0.098
ip+0.097
Neg logit contributions
ator-0.190
man-0.147
umm-0.134
icles-0.130
ators-0.130
Token prize
Token ID11596
Feature activation-2.00e-01

Pos logit contributions
ator+0.680
man+0.517
umm+0.481
icles+0.447
ators+0.444
Neg logit contributions
bs-0.629
 slips-0.469
 net-0.467
 knot-0.454
ip-0.453
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
bs+0.339
 slips+0.241
 net+0.239
 knot+0.232
ip+0.229
Neg logit contributions
ator-0.481
man-0.382
umm-0.349
icles-0.332
ators-0.331
Token He
Token ID679
Feature activation+1.30e-01

Pos logit contributions
ator+0.353
man+0.264
umm+0.248
icles+0.230
ators+0.223
Neg logit contributions
bs-0.353
 net-0.271
 slips-0.269
 knot-0.265
 loses-0.261
Token told
Token ID1297
Feature activation+1.71e+00

Pos logit contributions
ator+0.288
man+0.225
umm+0.205
icles+0.194
ators+0.193
Neg logit contributions
bs-0.239
 net-0.176
 slips-0.175
 knot-0.170
ata-0.168
Token her
Token ID607
Feature activation-9.25e-01

Pos logit contributions
ator+3.667
man+2.840
umm+2.650
ators+2.485
icles+2.385
Neg logit contributions
bs-3.395
 net-2.552
 knot-2.483
 tie-2.478
-2.456
Token,
Token ID11
Feature activation+3.25e-01

Pos logit contributions
bs+1.762
 slips+1.365
ata+1.295
 net+1.295
 loses+1.292
Neg logit contributions
ator-2.000
man-1.569
umm-1.333
icles-1.329
 birth-1.307
Token "
Token ID366
Feature activation+9.38e-01

Pos logit contributions
ator+0.755
man+0.592
umm+0.550
icles+0.523
ators+0.514
Neg logit contributions
bs-0.569
 slips-0.411
 net-0.411
 knot-0.400
 loses-0.394
Token He
Token ID679
Feature activation+3.63e-01

Pos logit contributions
bs+0.185
 net+0.142
 slips+0.142
 knot+0.140
 tie+0.138
Neg logit contributions
ator-0.160
man-0.118
umm-0.110
icles-0.102
omm-0.098
Token sat
Token ID3332
Feature activation+1.30e+00

Pos logit contributions
ator+0.842
man+0.661
umm+0.614
icles+0.583
ators+0.576
Neg logit contributions
bs-0.632
 net-0.456
 slips-0.454
 knot-0.442
ip-0.435
Token up
Token ID510
Feature activation+8.80e-01

Pos logit contributions
ator+3.006
man+2.278
umm+2.249
icles+2.080
 birth+2.068
Neg logit contributions
bs-2.341
 slips-1.733
 net-1.732
ip-1.664
 knot-1.661
Token stra
Token ID3534
Feature activation+2.94e-01

Pos logit contributions
ator+2.268
man+1.797
umm+1.730
icles+1.634
ators+1.622
Neg logit contributions
bs-1.327
 net-0.911
 slips-0.904
 knot-0.866
ip-0.858
Tokenighter
Token ID4799
Feature activation-8.98e-01

Pos logit contributions
ator+0.619
man+0.462
umm+0.431
icles+0.409
ators+0.401
Neg logit contributions
bs-0.586
 net-0.447
 slips-0.443
 knot-0.435
 loop-0.426
Token and
Token ID290
Feature activation+2.78e-01

Pos logit contributions
bs+1.378
 slips+0.922
 net+0.904
 loses+0.899
+0.896
Neg logit contributions
ator-2.297
man-1.871
umm-1.701
icles-1.643
ators-1.638
Token waited
Token ID13488
Feature activation+6.43e-01

Pos logit contributions
ator+0.627
man+0.489
umm+0.455
icles+0.430
ators+0.426
Neg logit contributions
bs-0.501
 net-0.367
 slips-0.364
 knot-0.358
ip-0.351
Token for
Token ID329
Feature activation+1.17e-01

Pos logit contributions
ator+1.459
man+1.130
umm+1.061
icles+0.999
ators+0.988
Neg logit contributions
bs-1.162
 net-0.850
 slips-0.847
 knot-0.825
ip-0.811
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
ator+0.279
man+0.218
umm+0.205
icles+0.197
ators+0.191
Neg logit contributions
bs-0.198
 slips-0.143
 net-0.140
 knot-0.137
ip-0.135
Token cat
Token ID3797
Feature activation-1.47e-01

Pos logit contributions
bs+2.035
 slips+1.483
 loses+1.456
ip+1.435
ata+1.379
Neg logit contributions
ator-2.737
man-2.253
umm-2.136
 birth-1.907
omm-1.885
Token to
Token ID284
Feature activation+3.98e-01

Pos logit contributions
bs+0.267
 slips+0.200
 net+0.198
 knot+0.196
 loop+0.190
Neg logit contributions
ator-0.322
man-0.256
umm-0.227
icles-0.221
ators-0.216
Token be
Token ID307
Feature activation+4.50e-01

Pos logit contributions
ator+4.081
man+3.221
umm+3.113
ators+2.949
icles+2.889
Neg logit contributions
bs-2.734
 net-1.937
 tie-1.928
 slips-1.918
-1.895
Token mad
Token ID8805
Feature activation+7.50e-01

Pos logit contributions
ator+1.076
man+0.846
umm+0.793
icles+0.752
ators+0.744
Neg logit contributions
bs-0.763
 net-0.543
 slips-0.542
 knot-0.527
ip-0.519
Token!"
Token ID2474
Feature activation-4.44e-01

Pos logit contributions
ator+1.821
man+1.430
umm+1.347
icles+1.278
ators+1.267
Neg logit contributions
bs-1.250
 slips-0.887
 net-0.883
 knot-0.858
 loses-0.845
Token
Token ID198
Feature activation-7.49e-01

Pos logit contributions
bs+0.767
 net+0.567
 knot+0.563
ata+0.558
ip+0.547
Neg logit contributions
ator-1.000
man-0.786
umm-0.719
icles-0.699
omm-0.695
Token
Token ID198
Feature activation-1.02e+00

Pos logit contributions
bs+1.334
 net+1.007
 knot+0.987
ata+0.984
 slips+0.979
Neg logit contributions
ator-1.674
man-1.326
icles-1.130
umm-1.128
omm-1.121
TokenBen
Token ID11696
Feature activation+3.19e-01

Pos logit contributions
bs+2.609
ata+2.147
 net+2.119
 slips+2.101
 loses+2.101
Neg logit contributions
ator-1.527
man-1.046
umm-0.784
omm-0.778
esh-0.743
Token was
Token ID373
Feature activation+1.59e-01

Pos logit contributions
ator+0.756
man+0.598
umm+0.554
icles+0.528
ators+0.523
Neg logit contributions
bs-0.542
 net-0.384
 slips-0.380
 knot-0.374
ip-0.368
Token scared
Token ID12008
Feature activation+5.10e-01

Pos logit contributions
ator+0.387
man+0.307
umm+0.282
icles+0.271
ators+0.270
Neg logit contributions
bs-0.263
 net-0.184
 slips-0.182
 knot-0.179
ip-0.178
Token too
Token ID1165
Feature activation-1.58e+00

Pos logit contributions
ator+1.282
man+1.021
umm+0.963
icles+0.916
ators+0.905
Neg logit contributions
bs-0.803
 net-0.556
 slips-0.554
 knot-0.537
ip-0.525
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
bs+3.018
ata+2.301
 net+2.300
 knot+2.282
 slips+2.276
Neg logit contributions
ator-3.346
man-2.623
umm-2.301
 him-2.276
icles-2.185
Token He
Token ID679
Feature activation+6.15e-01

Pos logit contributions
ator+3.269
man+2.323
umm+2.171
ators+2.067
icles+1.974
Neg logit contributions
bs-3.641
 slips-2.932
 net-2.849
 knot-2.806
 loses-2.790
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
bs+1.251
 slips+0.891
 net+0.877
 knot+0.871
 loses+0.852
Neg logit contributions
ator-1.647
man-1.327
umm-1.187
icles-1.141
ators-1.124
Token He
Token ID679
Feature activation-1.41e-02

Pos logit contributions
bs+0.884
 net+0.694
 slips+0.687
 knot+0.687
 loses+0.678
Neg logit contributions
ator-0.660
man-0.480
umm-0.443
icles-0.401
omm-0.388
Token thanked
Token ID26280
Feature activation+1.27e+00

Pos logit contributions
bs+0.027
 net+0.020
 slips+0.020
 knot+0.019
ata+0.019
Neg logit contributions
ator-0.030
man-0.024
umm-0.021
ators-0.020
icles-0.020
Token the
Token ID262
Feature activation+7.15e-01

Pos logit contributions
ator+3.335
man+2.689
umm+2.635
icles+2.430
ators+2.429
Neg logit contributions
bs-1.803
 net-1.245
 tie-1.189
 knot-1.185
 slips-1.173
Token pine
Token ID20161
Feature activation+5.04e-01

Pos logit contributions
ator+1.803
man+1.430
umm+1.352
icles+1.290
ators+1.276
Neg logit contributions
bs-1.129
 net-0.789
 slips-0.781
 knot-0.764
 loop-0.744
Token tree
Token ID5509
Feature activation-4.06e-01

Pos logit contributions
ator+1.224
man+0.968
umm+0.910
icles+0.864
ators+0.853
Neg logit contributions
bs-0.830
 net-0.585
 slips-0.583
 knot-0.566
 loses-0.554
Token for
Token ID329
Feature activation+9.62e-01

Pos logit contributions
bs+0.663
 slips+0.478
 loses+0.457
 net+0.457
 knot+0.448
Neg logit contributions
ator-0.990
man-0.810
umm-0.724
icles-0.699
ators-0.680
Token helping
Token ID5742
Feature activation+9.41e-01

Pos logit contributions
ator+1.995
man+1.500
umm+1.432
icles+1.321
ators+1.300
Neg logit contributions
bs-1.914
 net-1.485
 slips-1.445
 knot-1.433
 tie-1.418
Token him
Token ID683
Feature activation-4.74e-01

Pos logit contributions
ator+1.151
man+0.654
umm+0.583
icles+0.472
ators+0.462
Neg logit contributions
bs-2.708
 net-2.260
 slips-2.237
 knot-2.217
 tie-2.209
Token reach
Token ID3151
Feature activation-1.11e-01

Pos logit contributions
bs+0.817
 net+0.586
 slips+0.586
ip+0.572
 knot+0.564
Neg logit contributions
ator-1.116
man-0.873
umm-0.798
icles-0.772
ators-0.751
Token his
Token ID465
Feature activation+1.65e-04

Pos logit contributions
bs+0.203
 slips+0.146
 net+0.143
ata+0.142
 knot+0.142
Neg logit contributions
ator-0.254
man-0.196
umm-0.181
icles-0.175
ators-0.171
Token always
Token ID1464
Feature activation+1.44e-01

Pos logit contributions
ator+2.413
man+1.818
umm+1.752
icles+1.654
ators+1.600
Neg logit contributions
bs-2.099
 slips-1.562
 net-1.545
-1.536
 knot-1.520
Token said
Token ID531
Feature activation-1.11e-01

Pos logit contributions
ator+0.311
man+0.240
umm+0.221
icles+0.208
ators+0.207
Neg logit contributions
bs-0.272
 net-0.205
 slips-0.202
 knot-0.199
ip-0.196
Token '
Token ID705
Feature activation+1.89e-01

Pos logit contributions
bs+0.198
 slips+0.144
 net+0.143
ata+0.141
 knot+0.140
Neg logit contributions
ator-0.253
man-0.199
umm-0.183
icles-0.176
 birth-0.172
Tokenplease
Token ID29688
Feature activation-9.37e-01

Pos logit contributions
ator+0.357
man+0.263
umm+0.242
icles+0.225
ators+0.221
Neg logit contributions
bs-0.408
 net-0.322
 slips-0.319
 knot-0.314
 loses-0.308
Token'
Token ID6
Feature activation+7.10e-01

Pos logit contributions
bs+1.472
 net+1.055
 slips+1.037
 loses+1.017
 loop+1.012
Neg logit contributions
ator-2.326
man-1.910
umm-1.697
icles-1.684
ators-1.646
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
ator+1.868
man+1.502
umm+1.427
icles+1.360
ators+1.349
Neg logit contributions
bs-1.015
 net-0.673
 slips-0.669
 knot-0.648
ip-0.631
Token '
Token ID705
Feature activation+4.46e-03

Pos logit contributions
bs+0.029
 slips+0.022
 net+0.022
 knot+0.021
ata+0.021
Neg logit contributions
ator-0.034
man-0.026
umm-0.024
icles-0.022
 birth-0.022
Tokenthank
Token ID40716
Feature activation+3.06e-01

Pos logit contributions
ator+0.008
man+0.006
umm+0.006
icles+0.005
ators+0.005
Neg logit contributions
bs-0.010
 net-0.008
 slips-0.008
 knot-0.008
 loses-0.007
Token you
Token ID345
Feature activation-1.73e-01

Pos logit contributions
ator+0.567
man+0.411
umm+0.377
icles+0.349
ators+0.342
Neg logit contributions
bs-0.676
 slips-0.530
 net-0.528
 knot-0.516
 loses-0.511
Token'.
Token ID4458
Feature activation+1.72e-01

Pos logit contributions
bs+0.289
 slips+0.210
 net+0.207
 loses+0.201
 knot+0.199
Neg logit contributions
ator-0.414
man-0.330
umm-0.303
icles-0.290
ators-0.287
Token One
Token ID1881
Feature activation-7.53e-01

Pos logit contributions
ator+0.375
man+0.289
umm+0.269
icles+0.255
ators+0.249
Neg logit contributions
bs-0.327
 net-0.243
 slips-0.243
 knot-0.238
 loses-0.234
Token,
Token ID11
Feature activation-8.03e-01

Pos logit contributions
bs+1.488
 slips+1.002
 knot+0.990
 net+0.980
 loses+0.975
Neg logit contributions
ator-2.274
man-1.867
umm-1.651
icles-1.627
ators-1.591
Token bugs
Token ID11316
Feature activation-8.28e-01

Pos logit contributions
bs+1.496
ip+1.078
 slips+1.076
 knot+1.073
op+1.059
Neg logit contributions
ator-1.763
man-1.363
icles-1.244
ators-1.221
umm-1.219
Token and
Token ID290
Feature activation-1.21e+00

Pos logit contributions
bs+1.295
 slips+0.872
 net+0.861
 knot+0.857
 loses+0.846
Neg logit contributions
ator-2.074
man-1.682
umm-1.501
icles-1.471
ators-1.446
Token flowers
Token ID12734
Feature activation-3.37e-01

Pos logit contributions
bs+2.602
ip+2.046
op+2.010
 slips+1.986
ata+1.967
Neg logit contributions
ator-2.237
man-1.673
 birth-1.528
umm-1.524
icles-1.510
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
bs+0.576
 slips+0.413
 net+0.405
 knot+0.404
 loses+0.398
Neg logit contributions
ator-0.793
man-0.633
umm-0.568
icles-0.554
ators-0.543
Token He
Token ID679
Feature activation-5.80e-02

Pos logit contributions
ator+3.063
man+2.337
umm+2.165
ators+2.049
icles+2.028
Neg logit contributions
bs-2.470
 slips-1.814
 net-1.781
-1.778
 loop-1.740
Token even
Token ID772
Feature activation-1.63e+00

Pos logit contributions
bs+0.106
 net+0.079
 slips+0.078
 knot+0.077
ata+0.075
Neg logit contributions
ator-0.128
man-0.100
umm-0.090
ators-0.087
icles-0.086
Token made
Token ID925
Feature activation+4.11e-01

Pos logit contributions
bs+3.155
 knot+2.627
 net+2.554
 slips+2.530
op+2.496
Neg logit contributions
ator-3.183
man-2.552
 birth-2.139
ators-2.116
py-2.113
Token a
Token ID257
Feature activation-2.43e-01

Pos logit contributions
ator+1.033
man+0.824
umm+0.776
icles+0.741
ators+0.731
Neg logit contributions
bs-0.638
 slips-0.439
 net-0.438
 knot-0.423
ip-0.415
Token new
Token ID649
Feature activation+1.68e-01

Pos logit contributions
bs+0.453
 slips+0.326
ata+0.319
ip+0.318
 loses+0.315
Neg logit contributions
ator-0.542
man-0.424
umm-0.394
 birth-0.365
ators-0.360
Token friend
Token ID1545
Feature activation-7.04e-01

Pos logit contributions
ator+0.356
man+0.272
umm+0.250
icles+0.232
ators+0.231
Neg logit contributions
bs-0.336
 slips-0.251
 net-0.247
ata-0.243
ip-0.242
Token the
Token ID262
Feature activation+9.71e-01

Pos logit contributions
bs+2.980
ip+2.176
op+2.161
 slips+2.146
ata+2.125
Neg logit contributions
ator-3.127
man-2.316
 him-2.181
 birth-2.126
icles-2.101
Token park
Token ID3952
Feature activation+1.71e-01

Pos logit contributions
ator+2.198
man+1.691
umm+1.598
icles+1.514
ators+1.479
Neg logit contributions
bs-1.758
 net-1.330
 slips-1.311
 knot-1.301
 loop-1.282
Token they
Token ID484
Feature activation+2.39e-01

Pos logit contributions
ator+0.446
man+0.362
umm+0.337
icles+0.325
ators+0.320
Neg logit contributions
bs-0.252
 net-0.168
 slips-0.167
 knot-0.164
ata-0.158
Token found
Token ID1043
Feature activation-5.68e-01

Pos logit contributions
ator+0.548
man+0.429
umm+0.395
icles+0.377
ators+0.375
Neg logit contributions
bs-0.422
 net-0.307
 slips-0.306
 knot-0.303
 loses-0.293
Token the
Token ID262
Feature activation+8.66e-01

Pos logit contributions
bs+0.898
 slips+0.607
 knot+0.597
 loses+0.589
 net+0.579
Neg logit contributions
ator-1.437
man-1.169
icles-1.055
umm-1.054
ators-1.026
Token biggest
Token ID4094
Feature activation-6.47e-01

Pos logit contributions
ator+1.872
man+1.418
umm+1.327
icles+1.256
ators+1.232
Neg logit contributions
bs-1.660
 net-1.260
 slips-1.251
 knot-1.239
 loop-1.213
Token ear
Token ID1027
Feature activation-4.70e-01

Pos logit contributions
bs+1.215
ip+0.875
 slips+0.868
ata+0.852
 net+0.837
Neg logit contributions
ator-1.432
man-1.132
umm-1.044
 birth-0.968
icles-0.963
Token Tom
Token ID4186
Feature activation-2.63e-01

Pos logit contributions
bs+0.819
 net+0.614
 slips+0.599
 knot+0.597
 loses+0.576
Neg logit contributions
ator-1.085
man-0.873
umm-0.760
icles-0.747
ators-0.738
Token had
Token ID550
Feature activation+8.89e-01

Pos logit contributions
bs+0.471
 slips+0.347
 net+0.346
 knot+0.343
 loses+0.329
Neg logit contributions
ator-0.590
man-0.471
umm-0.416
icles-0.402
ators-0.400
Token ever
Token ID1683
Feature activation-9.08e-01

Pos logit contributions
ator+1.431
man+0.974
umm+0.896
icles+0.803
ators+0.780
Neg logit contributions
bs-2.193
 net-1.764
 slips-1.760
ata-1.720
 loses-1.718
Token seen
Token ID1775
Feature activation+2.01e-01

Pos logit contributions
bs+1.477
 knot+1.134
 net+1.078
 slips+1.073
 loop+1.045
Neg logit contributions
ator-2.121
man-1.707
umm-1.568
 birth-1.524
icles-1.514
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
ator+2.559
man+2.038
umm+1.943
icles+1.828
ators+1.793
Neg logit contributions
bs-1.634
 net-1.139
 slips-1.121
 knot-1.103
 tie-1.084
Token "
Token ID366
Feature activation+8.01e-01

Pos logit contributions
ator+3.388
man+2.529
umm+2.370
ators+2.294
icles+2.227
Neg logit contributions
bs-2.834
 slips-2.104
 net-2.065
 knot-2.041
 loop-2.022
TokenWhy
Token ID5195
Feature activation-1.43e+00

Pos logit contributions
ator+1.504
man+1.089
umm+0.995
icles+0.925
ators+0.902
Neg logit contributions
bs-1.778
 net-1.381
 slips-1.380
 knot-1.360
ata-1.345
Token did
Token ID750
Feature activation-2.34e-01

Pos logit contributions
bs+2.595
 net+1.948
ip+1.875
ata+1.821
 knot+1.810
Neg logit contributions
ator-3.173
man-2.582
icles-2.272
ators-2.212
 birth-2.178
Token you
Token ID345
Feature activation-4.92e-01

Pos logit contributions
bs+0.521
 slips+0.409
 net+0.407
ata+0.401
ip+0.397
Neg logit contributions
ator-0.420
man-0.305
umm-0.271
icles-0.261
 birth-0.256
Token do
Token ID466
Feature activation-3.80e-01

Pos logit contributions
bs+0.811
 slips+0.588
ip+0.577
 net+0.576
 knot+0.564
Neg logit contributions
ator-1.170
man-0.907
umm-0.837
icles-0.828
ators-0.797
Token that
Token ID326
Feature activation-2.37e-01

Pos logit contributions
bs+0.711
 slips+0.519
 net+0.508
ata+0.496
ip+0.492
Neg logit contributions
ator-0.854
man-0.648
umm-0.591
icles-0.583
 birth-0.567
Token?"
Token ID1701
Feature activation+1.72e-01

Pos logit contributions
bs+0.467
 net+0.342
 slips+0.340
ip+0.333
ata+0.329
Neg logit contributions
ator-0.513
man-0.402
umm-0.360
icles-0.339
ators-0.335
Token asked
Token ID1965
Feature activation+5.34e-01

Pos logit contributions
ator+0.387
man+0.305
umm+0.279
icles+0.264
 birth+0.259
Neg logit contributions
bs-0.305
 net-0.227
 slips-0.226
 knot-0.219
ip-0.215
Token Lily
Token ID20037
Feature activation-5.69e-01

Pos logit contributions
ator+1.212
man+0.938
umm+0.877
icles+0.828
ators+0.818
Neg logit contributions
bs-0.973
 net-0.713
 slips-0.711
 knot-0.693
ip-0.681
Token.
Token ID13
Feature activation+9.43e-01

Pos logit contributions
bs+0.997
 net+0.765
 slips+0.744
 knot+0.720
ata+0.701
Neg logit contributions
ator-1.304
man-1.042
umm-0.906
icles-0.884
ators-0.875
Token the
Token ID262
Feature activation+5.50e-01

Pos logit contributions
bs+2.577
 slips+1.972
ata+1.958
 loses+1.954
 net+1.953
Neg logit contributions
ator-2.950
man-2.316
icles-2.134
 him-2.065
umm-2.014
Token coal
Token ID5655
Feature activation+4.73e-01

Pos logit contributions
ator+1.276
man+0.994
umm+0.932
icles+0.882
ators+0.871
Neg logit contributions
bs-0.970
 net-0.703
 slips-0.701
 knot-0.682
ip-0.669
Token with
Token ID351
Feature activation-1.15e+00

Pos logit contributions
ator+1.241
man+1.002
umm+0.946
icles+0.905
ators+0.894
Neg logit contributions
bs-0.683
 net-0.452
 slips-0.451
 knot-0.435
ip-0.425
Token the
Token ID262
Feature activation-8.66e-02

Pos logit contributions
bs+2.030
ata+1.547
ip+1.540
 slips+1.536
 loses+1.518
Neg logit contributions
ator-2.473
man-1.959
icles-1.794
 him-1.787
umm-1.740
Token gloves
Token ID18051
Feature activation-9.84e-01

Pos logit contributions
bs+0.157
 slips+0.113
 net+0.112
ip+0.111
 knot+0.110
Neg logit contributions
ator-0.195
man-0.155
umm-0.141
icles-0.135
ators-0.130
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
bs+1.608
 slips+1.113
 knot+1.097
 net+1.097
ip+1.075
Neg logit contributions
ator-2.394
man-1.992
icles-1.745
umm-1.719
 him-1.679
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
bs+3.983
ip+3.329
op+3.314
ata+3.283
ver+3.161
Neg logit contributions
ator-4.121
man-3.369
 him-3.116
icles-2.977
 birth-2.931
Token stick
Token ID4859
Feature activation-8.21e-01

Pos logit contributions
bs+0.537
 slips+0.376
ip+0.372
ata+0.370
 net+0.368
Neg logit contributions
ator-0.718
man-0.579
umm-0.528
icles-0.509
ators-0.493
Token.
Token ID13
Feature activation-9.62e-01

Pos logit contributions
bs+1.408
 net+1.021
 slips+1.019
ip+1.016
 knot+1.015
Neg logit contributions
ator-1.851
man-1.516
icles-1.309
umm-1.288
ators-1.243
Token They
Token ID1119
Feature activation-1.04e+00

Pos logit contributions
bs+2.045
 net+1.607
ata+1.571
ip+1.570
 knot+1.568
Neg logit contributions
ator-1.809
man-1.327
umm-1.260
icles-1.211
 birth-1.169
Token feel
Token ID1254
Feature activation-1.52e+00

Pos logit contributions
bs+1.721
ip+1.294
 net+1.229
ata+1.221
op+1.204
Neg logit contributions
ator-2.364
man-1.863
umm-1.661
 him-1.651
ators-1.627
Token back
Token ID736
Feature activation+6.32e-03

Pos logit contributions
ator+1.927
man+1.479
umm+1.413
icles+1.307
ators+1.292
Neg logit contributions
bs-1.584
 net-1.173
 slips-1.169
 loses-1.126
 knot-1.126
Token.
Token ID13
Feature activation-8.36e-01

Pos logit contributions
ator+0.015
man+0.012
umm+0.011
icles+0.010
ators+0.010
Neg logit contributions
bs-0.011
 net-0.008
 slips-0.008
 knot-0.008
ip-0.008
Token
Token ID198
Feature activation-3.34e-01

Pos logit contributions
bs+1.707
 net+1.340
ata+1.306
 knot+1.293
 slips+1.273
Neg logit contributions
ator-1.635
man-1.231
umm-1.152
omm-1.096
icles-1.082
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
bs+0.572
 net+0.423
 slips+0.414
 knot+0.411
 loses+0.398
Neg logit contributions
ator-0.780
man-0.617
umm-0.544
icles-0.539
omm-0.525
Token"
Token ID1
Feature activation+4.73e-01

Pos logit contributions
bs+1.935
 net+1.453
 slips+1.408
 loses+1.393
 knot+1.368
Neg logit contributions
ator-2.595
man-2.088
umm-1.755
esh-1.754
 us-1.748
TokenWhy
Token ID5195
Feature activation-2.21e+00

Pos logit contributions
ator+1.015
man+0.772
umm+0.718
icles+0.674
ators+0.665
Neg logit contributions
bs-0.932
 net-0.700
 slips-0.698
 knot-0.682
ip-0.673
Token not
Token ID407
Feature activation-1.16e+00

Pos logit contributions
bs+4.560
 net+3.649
ata+3.385
ip+3.330
 loop+3.326
Neg logit contributions
ator-4.412
man-3.602
icles-3.133
 him-3.033
ators-2.924
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
bs+2.427
ata+1.845
ip+1.792
 slips+1.751
ching+1.724
Neg logit contributions
ator-2.424
man-1.874
umm-1.621
icles-1.604
esh-1.580
Token Ben
Token ID3932
Feature activation-1.05e+00

Pos logit contributions
bs+1.994
ata+1.329
 slips+1.299
ching+1.280
ip+1.260
Neg logit contributions
ator-3.259
man-2.586
umm-2.457
icles-2.412
omm-2.368
Token?"
Token ID1701
Feature activation-3.67e-01

Pos logit contributions
bs+2.159
 net+1.565
ata+1.537
 knot+1.535
ip+1.519
Neg logit contributions
ator-2.198
man-1.798
umm-1.462
ators-1.405
icles-1.384
Token Lily
Token ID20037
Feature activation-1.42e+00

Pos logit contributions
bs+0.668
 net+0.513
 slips+0.498
ata+0.493
ip+0.492
Neg logit contributions
ator-0.784
man-0.610
umm-0.565
 birth-0.532
icles-0.531
Token across
Token ID1973
Feature activation-1.33e+00

Pos logit contributions
bs+0.425
 slips+0.313
 net+0.312
ip+0.311
 knot+0.308
Neg logit contributions
ator-0.466
man-0.360
umm-0.318
ators-0.312
icles-0.312
Token a
Token ID257
Feature activation-1.65e+00

Pos logit contributions
bs+2.432
ip+1.750
 slips+1.721
op+1.680
 knot+1.668
Neg logit contributions
ator-3.117
man-2.395
 him-2.205
ators-2.173
 us-2.145
Token net
Token ID2010
Feature activation-5.94e-01

Pos logit contributions
bs+3.569
ip+2.552
 slips+2.513
ata+2.485
 loses+2.427
Neg logit contributions
ator-3.475
man-2.726
umm-2.355
 birth-2.208
ators-2.171
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
bs+1.088
ip+0.775
 slips+0.773
 knot+0.767
 net+0.761
Neg logit contributions
ator-1.324
man-1.054
umm-0.925
icles-0.889
ators-0.885
Token In
Token ID554
Feature activation-9.05e-01

Pos logit contributions
bs+1.109
 slips+0.838
 net+0.834
 knot+0.830
 loses+0.817
Neg logit contributions
ator-1.030
man-0.782
umm-0.733
icles-0.687
omm-0.649
Token this
Token ID428
Feature activation-2.46e+00

Pos logit contributions
bs+1.669
ata+1.187
ip+1.141
 slips+1.141
 loses+1.126
Neg logit contributions
ator-2.097
man-1.633
umm-1.446
ators-1.416
icles-1.413
Token way
Token ID835
Feature activation+4.54e-01

Pos logit contributions
bs+5.573
ip+4.187
op+4.107
imes+4.077
ata+4.048
Neg logit contributions
ator-4.911
man-3.926
 us-3.133
 me-3.082
umm-2.993
Token,
Token ID11
Feature activation-8.62e-01

Pos logit contributions
ator+1.139
man+0.908
umm+0.853
icles+0.814
ators+0.804
Neg logit contributions
bs-0.710
 net-0.488
 slips-0.487
 knot-0.473
ip-0.462
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
bs+1.558
ata+1.125
 slips+1.111
 knot+1.105
ip+1.104
Neg logit contributions
ator-1.935
man-1.538
umm-1.385
icles-1.375
ators-1.337
Token snail
Token ID47374
Feature activation-1.09e+00

Pos logit contributions
bs+3.039
ata+2.287
ip+2.240
 slips+2.223
 loses+2.203
Neg logit contributions
ator-2.581
man-1.977
umm-1.835
opher-1.588
py-1.540
Token would
Token ID561
Feature activation-1.11e-01

Pos logit contributions
bs+2.040
 knot+1.459
ip+1.438
 slips+1.437
 net+1.436
Neg logit contributions
ator-2.372
man-1.891
ators-1.579
umm-1.566
 birth-1.519
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
bs+0.150
 net+0.108
 slips+0.107
 knot+0.104
ip+0.103
Neg logit contributions
ator-0.202
man-0.159
umm-0.148
icles-0.140
ators-0.138
Token He
Token ID679
Feature activation-3.05e-01

Pos logit contributions
ator+0.639
man+0.457
umm+0.420
icles+0.384
ators+0.373
Neg logit contributions
bs-0.824
 net-0.648
 slips-0.646
 knot-0.635
 loses-0.627
Token wanted
Token ID2227
Feature activation+1.64e-01

Pos logit contributions
bs+0.552
 net+0.400
 slips+0.397
 knot+0.389
ata+0.384
Neg logit contributions
ator-0.672
man-0.532
umm-0.476
icles-0.458
ators-0.455
Token to
Token ID284
Feature activation-1.68e-01

Pos logit contributions
ator+0.372
man+0.290
umm+0.268
icles+0.258
ators+0.252
Neg logit contributions
bs-0.299
 slips-0.218
 net-0.215
 knot-0.212
ip-0.209
Token take
Token ID1011
Feature activation-5.44e-01

Pos logit contributions
bs+0.304
 slips+0.219
 net+0.217
ip+0.215
ata+0.213
Neg logit contributions
ator-0.383
man-0.297
umm-0.272
icles-0.262
ators-0.256
Token a
Token ID257
Feature activation-2.53e+00

Pos logit contributions
bs+0.912
 slips+0.656
 net+0.628
 loses+0.626
ata+0.624
Neg logit contributions
ator-1.300
man-1.019
umm-0.955
icles-0.927
 birth-0.902
Token nap
Token ID25422
Feature activation-8.72e-02

Pos logit contributions
bs+4.480
 slips+3.401
ata+3.383
 loses+3.375
 pads+3.314
Neg logit contributions
ator-5.224
man-4.476
 birth-3.691
icles-3.680
umm-3.641
Token in
Token ID287
Feature activation+8.73e-02

Pos logit contributions
bs+0.146
 slips+0.103
 net+0.101
 loses+0.098
 knot+0.098
Neg logit contributions
ator-0.211
man-0.168
umm-0.155
icles-0.148
ators-0.146
Token the
Token ID262
Feature activation+4.52e-02

Pos logit contributions
ator+0.197
man+0.152
umm+0.141
icles+0.134
ators+0.132
Neg logit contributions
bs-0.162
 slips-0.118
 net-0.116
ip-0.114
 knot-0.113
Token grass
Token ID8701
Feature activation-4.54e-01

Pos logit contributions
ator+0.097
man+0.075
umm+0.070
icles+0.065
ators+0.064
Neg logit contributions
bs-0.087
 slips-0.063
ata-0.062
 net-0.062
ip-0.062
Token.
Token ID13
Feature activation+3.35e-01

Pos logit contributions
bs+0.757
 slips+0.523
 net+0.514
 knot+0.511
 loses+0.504
Neg logit contributions
ator-1.097
man-0.890
umm-0.816
icles-0.788
ators-0.764
TokenSo
Token ID2396
Feature activation+1.05e+00

Pos logit contributions
ator+1.439
man+1.136
umm+1.071
icles+1.016
ators+1.003
Neg logit contributions
bs-0.984
 net-0.694
 slips-0.693
 knot-0.674
 loop-0.658
Token she
Token ID673
Feature activation-8.17e-01

Pos logit contributions
ator+2.823
man+2.277
umm+2.173
icles+2.074
ators+2.051
Neg logit contributions
bs-1.518
 slips-0.978
 net-0.973
 knot-0.957
-0.937
Token ran
Token ID4966
Feature activation-7.81e-02

Pos logit contributions
bs+1.525
 slips+1.143
 net+1.116
 knot+1.115
 loses+1.110
Neg logit contributions
ator-1.738
man-1.391
umm-1.180
icles-1.175
ators-1.171
Token over
Token ID625
Feature activation-4.45e-01

Pos logit contributions
bs+0.163
 slips+0.127
 net+0.125
 knot+0.124
 loses+0.122
Neg logit contributions
ator-0.153
man-0.114
umm-0.102
icles-0.097
ators-0.095
Token to
Token ID284
Feature activation-2.36e-01

Pos logit contributions
bs+0.827
 slips+0.616
 net+0.600
 knot+0.597
ip+0.591
Neg logit contributions
ator-0.974
man-0.750
umm-0.679
icles-0.664
ators-0.660
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
bs+0.536
 slips+0.429
 net+0.419
ip+0.418
 knot+0.416
Neg logit contributions
ator-0.427
man-0.302
umm-0.260
icles-0.258
ators-0.252
Token pitch
Token ID7078
Feature activation-6.26e-01

Pos logit contributions
bs+4.002
ip+2.911
ata+2.674
azz+2.631
 slips+2.605
Neg logit contributions
ator-5.432
man-4.355
umm-4.000
 us-3.948
 birth-3.793
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
bs+0.982
 slips+0.670
 loses+0.654
ip+0.648
 knot+0.639
Neg logit contributions
ator-1.581
man-1.301
umm-1.163
icles-1.133
ators-1.112
Token shouting
Token ID19642
Feature activation+6.49e-01

Pos logit contributions
bs+2.385
 knot+1.795
ip+1.748
 slips+1.741
 net+1.722
Neg logit contributions
ator-2.786
man-2.176
 birth-1.909
icles-1.904
 him-1.885
Token "
Token ID366
Feature activation+1.03e+00

Pos logit contributions
ator+1.610
man+1.274
umm+1.207
icles+1.143
ators+1.133
Neg logit contributions
bs-1.041
 net-0.724
 slips-0.720
 knot-0.701
ip-0.684
Tokenhi
Token ID5303
Feature activation-3.69e-01

Pos logit contributions
ator+2.130
man+1.579
umm+1.469
icles+1.360
ators+1.336
Neg logit contributions
bs-2.107
 slips-1.593
 net-1.589
-1.560
ata-1.560