TOP ACTIVATIONS
MAX = 4.204

carp ets . The palace belonged to a King
, fluffy p anda . The p anda was very amazing
led in amaz ement . The palace garden was filled with
went on a hike . The sky was cloudy and it
smiled and said , " The brilliant palace is lit up
One day , the hipp o saw something that
was a big surprise . The other animals were very excited
happy . So the King and Queen decided to
One day , the p anda went for a
kitchen . The poor dark hipp o never
some food . The hipp o went to the
hall ways and up the grand stair ways . Everywhere she
imagined a big palace . The palace was very pretty .
liked the sweet smell . The hipp o skipped on and
and cheered . They thanked the hipp o for his show
. The elderly rh in oc eros was so happy to
strong . The hipp o found some vegetables
o was so attractive . The hipp o was excited to
the gorilla . The gorilla was big and strong
was a p anda . The p anda was very gloomy

MIN ACTIVATIONS
MIN = -4.187

asked if he could borrow a strip . But , his
branch . They look for another tree . They see one
" How do you snap a knot ?"
he promised to never attach a net to someone again .
dad if he could take a seat under the shady tree
green rope ! He took the rope and ran to the
." She threw the rope into the hole and
tied the other end of the rope to her hand ,
L ily took the rope and started to swing
They tried to unt ie the knot , but it was
his bag and took out a jar . He said it
. They decide to find another game to play . They
't be able to get a good shot .
he replied . He took a long piece of string and
fly . They look at each other . They smile .
was excited ! He grabbed the rope and began to swing
L ulu took a deep breath and decided to
it again . He tried another , and another . He
showed them how to snap the knot by pulling it really
day she got a little too close to a pond and

INTERVAL 2.106 - 4.204
CONTAINS 0.241%

! â Her mom told her that she had to
After she finished the pear , Rachel looked at
open this heavy box ?" The bear tried to turn the
grandma . Her mom my hugged her and told her that
. He went home and told his dad all about his

INTERVAL 0.008 - 2.106
CONTAINS 48.571%

can be . M um and Dad were so
a mess . He had glue and cr ay on on
It â s lots of
wanted to build a castle so he could look after all
chasing them , so they decided to play a trick on

INTERVAL -2.089 - 0.008
CONTAINS 50.983%

be careful . So , even though he hadn 't found
and made a castle . They had fun and forgot about
What is that ?" Tom asks . " It
chat and sing together . And so , Bun and the
she called a taxi to take her to work .

INTERVAL -4.187 - -2.089
CONTAINS 0.205%

. The bird sees Lily too and sings a pretty song
Tom said , and threw a stick for Max to catch
like a horn . Lily and Ben think the bird is
says . " What is it , Mom ?
Mom my gave Tommy a big box and asked him
Token carp
Token ID39465
Feature activation+6.97e-01

Pos logit contributions
ator+0.648
man+0.502
umm+0.467
icles+0.443
ators+0.437
Neg logit contributions
bs-0.526
 slips-0.387
 net-0.385
 knot-0.375
ip-0.371
Tokenets
Token ID1039
Feature activation+6.55e-01

Pos logit contributions
ator+1.765
man+1.407
umm+1.327
icles+1.264
ators+1.251
Neg logit contributions
bs-1.093
 net-0.743
 slips-0.742
 knot-0.716
ip-0.706
Token.
Token ID13
Feature activation+4.52e-02

Pos logit contributions
ator+1.637
man+1.301
umm+1.233
icles+1.169
ators+1.157
Neg logit contributions
bs-1.029
 net-0.711
 slips-0.708
 knot-0.686
ata-0.670
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
ator+0.101
man+0.078
umm+0.073
icles+0.070
ators+0.068
Neg logit contributions
bs-0.083
 net-0.062
 slips-0.062
 knot-0.060
 loses-0.059
Token
Token ID198
Feature activation+4.97e-01

Pos logit contributions
ator+1.175
man+0.937
umm+0.880
icles+0.839
ators+0.830
Neg logit contributions
bs-0.736
 net-0.509
 slips-0.508
 knot-0.492
ip-0.481
TokenThe
Token ID464
Feature activation+4.20e+00

Pos logit contributions
ator+1.064
man+0.809
umm+0.751
icles+0.706
ators+0.697
Neg logit contributions
bs-0.973
 net-0.730
 slips-0.729
 knot-0.711
ip-0.701
Token palace
Token ID20562
Feature activation+9.70e-01

Pos logit contributions
ator+10.304
icles+8.250
umm+8.192
man+7.971
esh+7.613
Neg logit contributions
bs-7.770
 net-6.045
 tie-5.926
 knot-5.784
 loop-5.777
Token belonged
Token ID19611
Feature activation+9.62e-01

Pos logit contributions
ator+2.135
man+1.613
umm+1.560
icles+1.463
ators+1.418
Neg logit contributions
bs-1.825
 slips-1.351
 net-1.348
 knot-1.307
 loop-1.298
Token to
Token ID284
Feature activation+3.75e-01

Pos logit contributions
ator+2.300
man+1.792
umm+1.721
icles+1.622
ators+1.590
Neg logit contributions
bs-1.626
 net-1.166
 slips-1.157
ip-1.109
 knot-1.105
Token a
Token ID257
Feature activation+4.37e-01

Pos logit contributions
ator+1.074
man+0.881
umm+0.835
icles+0.804
ators+0.797
Neg logit contributions
bs-0.459
 slips-0.275
 net-0.272
 knot-0.262
ip-0.256
Token King
Token ID2677
Feature activation-6.07e-01

Pos logit contributions
ator+1.046
man+0.825
umm+0.773
icles+0.730
ators+0.723
Neg logit contributions
bs-0.734
 slips-0.521
 net-0.519
 knot-0.504
ip-0.496
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
ator+0.059
man+0.045
umm+0.042
icles+0.039
ators+0.038
Neg logit contributions
bs-0.054
 slips-0.040
ata-0.039
 net-0.039
 loses-0.039
Token fluffy
Token ID39145
Feature activation+4.21e-01

Pos logit contributions
bs+1.456
 slips+1.036
 loses+1.035
ata+1.019
ip+1.000
Neg logit contributions
ator-1.804
man-1.450
umm-1.322
 birth-1.235
omm-1.202
Token p
Token ID279
Feature activation+4.19e-01

Pos logit contributions
ator+0.982
man+0.770
umm+0.719
icles+0.679
ators+0.670
Neg logit contributions
bs-0.736
 slips-0.530
 net-0.527
 knot-0.512
 loses-0.507
Tokenanda
Token ID5282
Feature activation+1.75e-01

Pos logit contributions
ator+0.803
man+0.588
umm+0.540
icles+0.501
ators+0.493
Neg logit contributions
bs-0.903
 net-0.701
 slips-0.699
 knot-0.687
 loses-0.676
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
ator+0.438
man+0.353
umm+0.323
icles+0.311
ators+0.308
Neg logit contributions
bs-0.275
 slips-0.192
 net-0.191
 knot-0.187
 loses-0.182
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+3.501
man+2.629
umm+2.449
ators+2.386
 birth+2.261
Neg logit contributions
bs-2.772
 net-2.040
 slips-2.029
 knot-1.982
-1.969
Token p
Token ID279
Feature activation+2.44e+00

Pos logit contributions
ator+11.467
umm+9.140
man+8.970
ators+8.915
icles+8.399
Neg logit contributions
bs-5.327
 loop-4.906
 net-4.571
 dove-4.497
 lime-4.411
Tokenanda
Token ID5282
Feature activation+1.00e+00

Pos logit contributions
ator+5.102
man+3.882
umm+3.727
icles+3.430
ators+3.387
Neg logit contributions
bs-5.156
ip-3.952
 slips-3.908
op-3.848
 loop-3.723
Token was
Token ID373
Feature activation+1.39e+00

Pos logit contributions
ator+2.112
man+1.560
umm+1.524
icles+1.408
ators+1.370
Neg logit contributions
bs-1.999
 slips-1.503
 net-1.496
ip-1.468
 knot-1.455
Token very
Token ID845
Feature activation+1.85e+00

Pos logit contributions
ator+3.454
umm+2.647
man+2.645
ators+2.449
icles+2.448
Neg logit contributions
bs-2.294
 net-1.652
 slips-1.624
 tie-1.599
 loop-1.580
Token amazing
Token ID4998
Feature activation+2.23e-01

Pos logit contributions
ator+4.998
man+3.929
umm+3.866
ators+3.772
icles+3.713
Neg logit contributions
bs-2.883
 slips-1.859
 net-1.841
 tie-1.796
ip-1.796
Tokenled
Token ID992
Feature activation+8.26e-02

Pos logit contributions
ator+0.181
man+0.122
umm+0.107
icles+0.098
ators+0.096
Neg logit contributions
bs-0.294
 net-0.241
 slips-0.239
 knot-0.235
 loop-0.232
Token in
Token ID287
Feature activation+5.05e-01

Pos logit contributions
ator+0.185
man+0.144
umm+0.134
icles+0.127
ators+0.125
Neg logit contributions
bs-0.151
 slips-0.111
 net-0.109
 knot-0.107
ip-0.107
Token amaz
Token ID40642
Feature activation+2.61e+00

Pos logit contributions
ator+1.251
man+0.992
umm+0.935
icles+0.889
ators+0.879
Neg logit contributions
bs-0.814
 net-0.568
 slips-0.567
 knot-0.549
ip-0.539
Tokenement
Token ID972
Feature activation+4.96e-01

Pos logit contributions
ator+4.662
man+3.699
umm+3.046
 Amen+2.996
icles+2.891
Neg logit contributions
bs-5.857
 knot-4.630
 slips-4.530
ip-4.525
 net-4.465
Token.
Token ID13
Feature activation+9.93e-02

Pos logit contributions
ator+1.236
man+0.984
umm+0.927
icles+0.882
ators+0.872
Neg logit contributions
bs-0.783
 net-0.543
 slips-0.541
 knot-0.525
ip-0.513
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+0.210
man+0.160
umm+0.150
icles+0.140
 birth+0.138
Neg logit contributions
bs-0.194
 slips-0.145
 net-0.144
ata-0.141
 knot-0.140
Token palace
Token ID20562
Feature activation+9.17e-01

Pos logit contributions
ator+8.836
umm+6.972
icles+6.815
man+6.760
ators+6.448
Neg logit contributions
bs-7.989
 loop-6.170
 tie-6.134
 net-6.132
 knot-5.941
Token garden
Token ID11376
Feature activation+3.79e-01

Pos logit contributions
ator+1.995
man+1.516
umm+1.438
icles+1.355
ators+1.322
Neg logit contributions
bs-1.740
 net-1.296
 slips-1.296
 knot-1.268
 loop-1.251
Token was
Token ID373
Feature activation+9.59e-01

Pos logit contributions
ator+0.782
man+0.590
umm+0.545
icles+0.511
ators+0.504
Neg logit contributions
bs-0.762
 net-0.577
 slips-0.576
 knot-0.563
ip-0.557
Token filled
Token ID5901
Feature activation+8.78e-01

Pos logit contributions
ator+2.302
man+1.802
umm+1.713
icles+1.611
ators+1.598
Neg logit contributions
bs-1.612
 net-1.158
 slips-1.146
 knot-1.122
 loop-1.107
Token with
Token ID351
Feature activation+1.02e+00

Pos logit contributions
ator+1.981
man+1.523
umm+1.443
icles+1.359
ators+1.335
Neg logit contributions
bs-1.586
 net-1.172
 slips-1.151
 knot-1.134
 tie-1.119
Token went
Token ID1816
Feature activation+5.27e-01

Pos logit contributions
bs+0.999
 knot+0.752
 net+0.728
 slips+0.723
 loop+0.705
Neg logit contributions
ator-1.179
man-0.946
icles-0.824
umm-0.820
 him-0.817
Token on
Token ID319
Feature activation-7.81e-01

Pos logit contributions
ator+1.169
man+0.900
umm+0.840
icles+0.792
ators+0.781
Neg logit contributions
bs-0.982
 net-0.726
 slips-0.724
 knot-0.707
ip-0.694
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
bs+1.260
 slips+0.864
 knot+0.792
op+0.785
ata+0.783
Neg logit contributions
ator-1.960
man-1.565
 birth-1.448
ators-1.417
umm-1.414
Token hike
Token ID18390
Feature activation+2.50e-01

Pos logit contributions
ator+0.231
man+0.178
umm+0.160
icles+0.152
 birth+0.152
Neg logit contributions
bs-0.211
 slips-0.158
 net-0.155
 knot-0.153
 loses-0.152
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
ator+0.542
man+0.418
umm+0.380
icles+0.362
ators+0.359
Neg logit contributions
bs-0.478
 slips-0.355
 net-0.354
 knot-0.348
ip-0.343
Token The
Token ID383
Feature activation+4.03e+00

Pos logit contributions
ator+0.300
man+0.235
umm+0.218
icles+0.209
ators+0.203
Neg logit contributions
bs-0.239
 net-0.175
 slips-0.175
 knot-0.172
 loses-0.168
Token sky
Token ID6766
Feature activation+1.12e+00

Pos logit contributions
ator+10.254
umm+7.381
ators+7.379
icles+7.300
opher+7.165
Neg logit contributions
bs-6.491
 net-6.169
 loop-6.101
 knot-6.055
 rope-6.030
Token was
Token ID373
Feature activation+1.33e+00

Pos logit contributions
ator+2.471
man+1.851
umm+1.785
icles+1.644
ators+1.637
Neg logit contributions
bs-2.128
 slips-1.590
 net-1.587
 loop-1.547
-1.544
Token cloudy
Token ID40026
Feature activation+7.42e-01

Pos logit contributions
ator+3.365
man+2.616
umm+2.529
ators+2.366
icles+2.349
Neg logit contributions
bs-2.060
 net-1.496
 slips-1.459
-1.432
 loop-1.424
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
ator+2.083
man+1.700
umm+1.618
icles+1.548
ators+1.539
Neg logit contributions
bs-0.937
 net-0.577
 slips-0.577
 knot-0.550
 loop-0.536
Token it
Token ID340
Feature activation+1.32e+00

Pos logit contributions
ator+2.562
man+2.017
umm+1.927
icles+1.794
ators+1.789
Neg logit contributions
bs-1.612
 net-1.130
 slips-1.122
 knot-1.089
 loop-1.085
Token smiled
Token ID13541
Feature activation+3.77e-01

Pos logit contributions
ator+0.872
man+0.704
umm+0.657
icles+0.630
ators+0.624
Neg logit contributions
bs-0.501
 net-0.340
 slips-0.338
 knot-0.327
 loses-0.320
Token and
Token ID290
Feature activation-9.96e-01

Pos logit contributions
ator+1.042
man+0.854
umm+0.806
icles+0.771
ators+0.764
Neg logit contributions
bs-0.498
 slips-0.314
 net-0.314
 knot-0.300
 loses-0.293
Token said
Token ID531
Feature activation-3.58e-01

Pos logit contributions
bs+1.608
 slips+1.214
 net+1.204
 loses+1.198
 knot+1.191
Neg logit contributions
ator-2.325
man-1.900
umm-1.665
 birth-1.647
icles-1.642
Token,
Token ID11
Feature activation+4.61e-01

Pos logit contributions
bs+0.548
 slips+0.392
 net+0.389
ata+0.379
 knot+0.378
Neg logit contributions
ator-0.889
man-0.720
umm-0.663
icles-0.646
 birth-0.630
Token "
Token ID366
Feature activation-1.62e-01

Pos logit contributions
ator+1.056
man+0.822
umm+0.768
icles+0.727
ators+0.717
Neg logit contributions
bs-0.821
 net-0.599
 slips-0.597
 knot-0.582
 loses-0.572
TokenThe
Token ID464
Feature activation+4.01e+00

Pos logit contributions
bs+0.320
 net+0.242
 slips+0.242
 knot+0.236
 loses+0.234
Neg logit contributions
ator-0.341
man-0.263
umm-0.246
icles-0.227
ators-0.223
Token brilliant
Token ID10457
Feature activation+2.62e+00

Pos logit contributions
ator+8.651
man+6.694
icles+6.452
rim+6.420
umm+6.383
Neg logit contributions
bs-7.901
 net-6.575
 knot-6.437
-6.377
 loop-6.364
Token palace
Token ID20562
Feature activation+4.38e-01

Pos logit contributions
ator+4.428
man+3.067
umm+3.019
icles+2.799
rim+2.672
Neg logit contributions
bs-6.193
 net-5.058
 knot-5.024
 tie-4.987
 loop-4.932
Token is
Token ID318
Feature activation+3.09e-01

Pos logit contributions
ator+0.978
man+0.756
umm+0.704
icles+0.664
ators+0.656
Neg logit contributions
bs-0.804
 net-0.592
 slips-0.589
 knot-0.576
ip-0.566
Token lit
Token ID6578
Feature activation+9.84e-01

Pos logit contributions
ator+0.695
man+0.538
umm+0.498
icles+0.473
ators+0.467
Neg logit contributions
bs-0.567
 slips-0.415
 net-0.415
 knot-0.405
ip-0.399
Token up
Token ID510
Feature activation+3.22e-01

Pos logit contributions
ator+2.028
man+1.521
umm+1.439
icles+1.326
ators+1.304
Neg logit contributions
bs-1.950
 net-1.481
 slips-1.472
 knot-1.460
 loop-1.440
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
ator+0.285
man+0.224
umm+0.209
icles+0.200
ators+0.194
Neg logit contributions
bs-0.218
 net-0.157
 slips-0.157
 knot-0.154
 loses-0.151
Token
Token ID198
Feature activation+7.88e-01

Pos logit contributions
ator+0.497
man+0.396
umm+0.364
icles+0.350
ators+0.346
Neg logit contributions
bs-0.329
 net-0.232
 slips-0.232
 knot-0.226
 loses-0.220
TokenOne
Token ID3198
Feature activation-6.62e-01

Pos logit contributions
ator+1.745
man+1.328
umm+1.258
icles+1.175
ators+1.156
Neg logit contributions
bs-1.493
 net-1.112
 slips-1.107
 knot-1.089
 loop-1.074
Token day
Token ID1110
Feature activation-5.09e-01

Pos logit contributions
bs+1.624
 slips+1.339
 loses+1.333
 knot+1.315
 net+1.311
Neg logit contributions
ator-1.034
man-0.764
umm-0.620
icles-0.574
 birth-0.557
Token,
Token ID11
Feature activation+2.61e-01

Pos logit contributions
bs+0.807
 knot+0.570
 net+0.569
 slips+0.562
ata+0.560
Neg logit contributions
ator-1.223
man-0.994
umm-0.908
icles-0.897
 him-0.860
Token the
Token ID262
Feature activation+4.01e+00

Pos logit contributions
ator+0.631
man+0.503
umm+0.469
icles+0.450
ators+0.439
Neg logit contributions
bs-0.431
 slips-0.303
 net-0.301
 knot-0.295
ata-0.290
Token hipp
Token ID18568
Feature activation+9.58e-01

Pos logit contributions
ator+11.425
umm+8.916
ators+8.870
man+8.680
zie+8.470
Neg logit contributions
bs-4.983
 net-4.314
 dove-3.974
 lime-3.804
 tie-3.688
Tokeno
Token ID78
Feature activation-3.89e-01

Pos logit contributions
ator+1.964
man+1.444
umm+1.380
icles+1.275
ators+1.258
Neg logit contributions
bs-1.950
 slips-1.476
 net-1.472
ip-1.439
ata-1.427
Token saw
Token ID2497
Feature activation-3.48e-01

Pos logit contributions
bs+0.744
 knot+0.553
 net+0.544
 slips+0.540
ip+0.523
Neg logit contributions
ator-0.828
man-0.656
umm-0.569
ators-0.548
icles-0.546
Token something
Token ID1223
Feature activation-4.07e-01

Pos logit contributions
bs+0.521
 slips+0.340
 knot+0.340
 net+0.328
ata+0.325
Neg logit contributions
ator-0.911
man-0.749
icles-0.682
umm-0.675
 him-0.659
Token that
Token ID326
Feature activation+1.13e+00

Pos logit contributions
bs+0.785
 slips+0.585
 net+0.568
ata+0.567
ip+0.566
Neg logit contributions
ator-0.856
man-0.673
umm-0.599
icles-0.575
 birth-0.572
Token was
Token ID373
Feature activation+2.29e-01

Pos logit contributions
ator+1.699
man+1.276
umm+1.216
icles+1.133
ators+1.115
Neg logit contributions
bs-1.520
 net-1.138
 slips-1.137
 knot-1.112
-1.095
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
ator+0.601
man+0.486
umm+0.454
icles+0.438
ators+0.433
Neg logit contributions
bs-0.333
 slips-0.221
 net-0.219
 knot-0.213
 loses-0.208
Token big
Token ID1263
Feature activation+2.10e-01

Pos logit contributions
ator+0.404
man+0.316
umm+0.290
icles+0.270
 birth+0.268
Neg logit contributions
bs-0.340
 slips-0.249
 net-0.244
 loses-0.240
 knot-0.239
Token surprise
Token ID5975
Feature activation+1.19e+00

Pos logit contributions
ator+0.473
man+0.370
umm+0.344
icles+0.322
ators+0.317
Neg logit contributions
bs-0.384
 slips-0.278
 net-0.273
ata-0.269
ip-0.268
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
ator+2.868
man+2.215
umm+2.152
icles+2.020
ators+2.005
Neg logit contributions
bs-1.940
 net-1.403
 slips-1.369
 knot-1.348
 tie-1.342
Token The
Token ID383
Feature activation+4.00e+00

Pos logit contributions
ator+3.540
man+2.337
umm+2.189
ators+2.061
icles+1.904
Neg logit contributions
bs-4.864
 net-3.973
 slips-3.869
-3.865
 knot-3.829
Token other
Token ID584
Feature activation+3.18e+00

Pos logit contributions
ator+11.262
umm+8.846
man+8.772
ators+8.655
icles+8.469
Neg logit contributions
bs-4.901
 net-4.392
-4.151
 loop-4.088
 rope-3.970
Token animals
Token ID4695
Feature activation-1.09e-01

Pos logit contributions
ator+6.943
man+4.920
umm+4.918
omm+4.429
 Amen+4.403
Neg logit contributions
bs-6.139
 net-4.966
-4.918
 dove-4.735
 loop-4.709
Token were
Token ID547
Feature activation+1.12e+00

Pos logit contributions
bs+0.209
 slips+0.159
 net+0.157
 knot+0.157
ip+0.154
Neg logit contributions
ator-0.232
man-0.179
umm-0.158
icles-0.152
ators-0.152
Token very
Token ID845
Feature activation+4.88e-01

Pos logit contributions
ator+2.630
man+2.036
umm+1.998
icles+1.867
ators+1.818
Neg logit contributions
bs-1.949
 net-1.428
 slips-1.399
 tie-1.366
 knot-1.354
Token excited
Token ID6568
Feature activation+1.13e+00

Pos logit contributions
ator+1.200
man+0.950
umm+0.893
icles+0.848
ators+0.839
Neg logit contributions
bs-0.792
 net-0.556
 slips-0.554
 knot-0.539
ip-0.527
Token happy
Token ID3772
Feature activation+1.54e+00

Pos logit contributions
ator+0.821
man+0.648
umm+0.604
icles+0.574
ators+0.568
Neg logit contributions
bs-0.577
 slips-0.412
 net-0.410
 knot-0.398
ip-0.393
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
ator+4.115
man+3.295
umm+3.177
icles+3.027
omm+2.998
Neg logit contributions
bs-2.183
 net-1.455
 slips-1.438
 loses-1.415
 knot-1.373
Token
Token ID198
Feature activation+5.45e-01

Pos logit contributions
ator+3.387
man+2.616
umm+2.501
ators+2.331
icles+2.324
Neg logit contributions
bs-2.407
 net-1.724
-1.718
 slips-1.696
 tie-1.682
Token
Token ID198
Feature activation+7.05e-01

Pos logit contributions
ator+1.372
man+1.094
umm+1.033
icles+0.983
ators+0.972
Neg logit contributions
bs-0.845
 net-0.581
 slips-0.579
 knot-0.561
ip-0.548
TokenSo
Token ID2396
Feature activation+1.23e+00

Pos logit contributions
ator+1.630
man+1.263
umm+1.193
icles+1.125
ators+1.108
Neg logit contributions
bs-1.262
 net-0.919
 slips-0.913
 knot-0.896
 loop-0.880
Token the
Token ID262
Feature activation+4.00e+00

Pos logit contributions
ator+3.494
man+2.801
umm+2.716
icles+2.601
ators+2.584
Neg logit contributions
bs-1.606
 net-1.024
 slips-0.981
 knot-0.969
 tie-0.940
Token King
Token ID2677
Feature activation-3.40e-01

Pos logit contributions
ator+8.540
icles+6.925
ators+6.508
umm+6.507
opher+6.073
Neg logit contributions
bs-7.701
 tie-6.157
 net-6.039
-5.866
 dove-5.749
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
bs+0.607
 slips+0.447
 net+0.445
 knot+0.444
 loses+0.429
Neg logit contributions
ator-0.778
man-0.632
umm-0.544
ators-0.527
icles-0.527
Token Queen
Token ID7542
Feature activation+6.14e-01

Pos logit contributions
ator+2.313
man+1.686
umm+1.628
icles+1.516
ators+1.483
Neg logit contributions
bs-2.382
 net-1.844
 knot-1.803
 slips-1.791
 tie-1.787
Token decided
Token ID3066
Feature activation+2.09e-01

Pos logit contributions
ator+1.352
man+1.036
umm+0.973
icles+0.913
ators+0.900
Neg logit contributions
bs-1.150
 net-0.850
 slips-0.847
 knot-0.826
ip-0.815
Token to
Token ID284
Feature activation+4.72e-01

Pos logit contributions
ator+0.472
man+0.370
umm+0.337
icles+0.324
ators+0.319
Neg logit contributions
bs-0.378
 slips-0.276
 net-0.274
 knot-0.270
ip-0.265
Token
Token ID198
Feature activation+8.35e-02

Pos logit contributions
bs+0.232
 net+0.169
 slips+0.165
 knot+0.165
 loses+0.159
Neg logit contributions
ator-0.329
man-0.260
umm-0.236
icles-0.231
ators-0.226
Token
Token ID198
Feature activation+3.84e-01

Pos logit contributions
ator+0.202
man+0.161
umm+0.147
icles+0.142
ators+0.140
Neg logit contributions
bs-0.138
 slips-0.098
 net-0.098
 knot-0.095
 loses-0.093
TokenOne
Token ID3198
Feature activation-6.48e-01

Pos logit contributions
ator+0.811
man+0.617
umm+0.567
icles+0.535
ators+0.527
Neg logit contributions
bs-0.765
 slips-0.576
 net-0.575
 knot-0.560
 loses-0.555
Token day
Token ID1110
Feature activation-9.16e-01

Pos logit contributions
bs+1.580
 slips+1.282
 loses+1.274
 net+1.260
 knot+1.257
Neg logit contributions
ator-1.038
man-0.765
umm-0.628
icles-0.577
 birth-0.562
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
bs+1.533
 knot+1.110
ata+1.102
 slips+1.084
 loses+1.082
Neg logit contributions
ator-2.110
man-1.705
umm-1.536
icles-1.531
 him-1.479
Token the
Token ID262
Feature activation+3.98e+00

Pos logit contributions
bs+0.986
 slips+0.686
ata+0.681
 loses+0.669
ip+0.669
Neg logit contributions
ator-1.489
man-1.214
umm-1.106
icles-1.094
 birth-1.047
Token p
Token ID279
Feature activation+2.10e+00

Pos logit contributions
ator+11.385
man+9.069
ators+8.763
umm+8.753
icles+8.415
Neg logit contributions
bs-5.297
 loop-4.778
 dove-4.312
 lime-4.296
 net-4.249
Tokenanda
Token ID5282
Feature activation-6.76e-01

Pos logit contributions
ator+4.280
man+3.211
umm+3.057
icles+2.838
ators+2.826
Neg logit contributions
bs-4.522
ip-3.451
 slips-3.450
op-3.365
 loop-3.257
Token went
Token ID1816
Feature activation+2.37e-01

Pos logit contributions
bs+1.310
 knot+0.981
 net+0.974
 slips+0.952
ata+0.924
Neg logit contributions
ator-1.414
man-1.136
umm-0.953
ators-0.930
icles-0.914
Token for
Token ID329
Feature activation-4.04e-01

Pos logit contributions
ator+0.530
man+0.414
umm+0.377
icles+0.360
ators+0.355
Neg logit contributions
bs-0.437
 slips-0.319
 net-0.317
 knot-0.313
ip-0.306
Token a
Token ID257
Feature activation-8.69e-01

Pos logit contributions
bs+0.551
 slips+0.361
 net+0.336
 knot+0.335
ip+0.334
Neg logit contributions
ator-1.093
man-0.886
umm-0.819
icles-0.817
 birth-0.806
Token kitchen
Token ID9592
Feature activation+5.37e-01

Pos logit contributions
ator+6.091
man+4.581
esh+4.327
icles+4.324
umm+4.312
Neg logit contributions
bs-3.941
 net-3.275
 slips-2.966
 lime-2.964
 knot-2.927
Token.
Token ID13
Feature activation+4.45e-01

Pos logit contributions
ator+1.321
man+1.046
umm+0.986
icles+0.937
ators+0.926
Neg logit contributions
bs-0.866
 net-0.606
 slips-0.604
 knot-0.586
ip-0.574
Token
Token ID220
Feature activation-3.41e-01

Pos logit contributions
ator+1.020
man+0.793
umm+0.743
icles+0.702
ators+0.692
Neg logit contributions
bs-0.795
 net-0.582
 slips-0.579
 knot-0.565
ip-0.554
Token
Token ID198
Feature activation-7.89e-02

Pos logit contributions
bs+0.577
 net+0.437
 knot+0.422
 slips+0.417
 loop+0.404
Neg logit contributions
ator-0.798
man-0.623
umm-0.570
icles-0.556
omm-0.544
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
bs+0.131
 net+0.096
 slips+0.095
 knot+0.093
 loses+0.090
Neg logit contributions
ator-0.187
man-0.148
umm-0.134
icles-0.130
ators-0.128
TokenThe
Token ID464
Feature activation+3.97e+00

Pos logit contributions
ator+0.771
man+0.606
umm+0.565
icles+0.536
ators+0.533
Neg logit contributions
bs-0.539
 net-0.386
 slips-0.385
 knot-0.371
ip-0.367
Token poor
Token ID3595
Feature activation+3.51e+00

Pos logit contributions
ator+10.964
umm+8.435
man+8.352
icles+8.314
ators+8.276
Neg logit contributions
bs-5.496
 net-4.526
-4.205
 lime-4.063
 tie-4.000
Token dark
Token ID3223
Feature activation+2.42e+00

Pos logit contributions
ator+9.218
umm+7.173
ators+6.908
man+6.882
zie+6.543
Neg logit contributions
bs-5.064
 net-3.899
 dove-3.737
 lime-3.664
 tie-3.625
Token hipp
Token ID18568
Feature activation+8.20e-01

Pos logit contributions
ator+5.694
man+4.247
umm+4.076
ators+3.908
 Amen+3.880
Neg logit contributions
bs-4.144
 net-3.052
 slips-3.027
 loop-2.881
 knot-2.864
Tokeno
Token ID78
Feature activation+1.75e-01

Pos logit contributions
ator+1.680
man+1.246
umm+1.172
icles+1.093
ators+1.075
Neg logit contributions
bs-1.677
 slips-1.274
 net-1.268
ip-1.233
ata-1.229
Token never
Token ID1239
Feature activation-6.37e-01

Pos logit contributions
ator+0.394
man+0.308
umm+0.283
icles+0.268
ators+0.266
Neg logit contributions
bs-0.314
 net-0.230
 slips-0.228
 knot-0.224
ip-0.218
Token some
Token ID617
Feature activation-3.89e-01

Pos logit contributions
ator+0.163
man+0.133
umm+0.123
icles+0.120
ators+0.117
Neg logit contributions
bs-0.089
 slips-0.059
 net-0.057
 knot-0.057
 loses-0.056
Token food
Token ID2057
Feature activation+3.25e-01

Pos logit contributions
bs+0.810
 slips+0.631
 loses+0.616
ata+0.611
 knot+0.607
Neg logit contributions
ator-0.761
man-0.572
umm-0.532
icles-0.500
 birth-0.489
Token.
Token ID13
Feature activation-7.83e-01

Pos logit contributions
ator+0.705
man+0.544
umm+0.501
icles+0.474
ators+0.466
Neg logit contributions
bs-0.615
 slips-0.458
 net-0.455
 knot-0.448
 loses-0.440
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
bs+1.873
 net+1.469
 slips+1.461
 loses+1.459
 knot+1.445
Neg logit contributions
ator-1.299
man-0.969
umm-0.851
icles-0.828
omm-0.756
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
bs+0.360
 net+0.262
 slips+0.260
 knot+0.255
 loses+0.248
Neg logit contributions
ator-0.497
man-0.397
umm-0.353
icles-0.344
ators-0.337
TokenThe
Token ID464
Feature activation+3.95e+00

Pos logit contributions
bs+0.193
 slips+0.136
 net+0.135
 loses+0.130
 knot+0.129
Neg logit contributions
ator-0.278
man-0.226
umm-0.205
icles-0.197
ators-0.193
Token hipp
Token ID18568
Feature activation+1.47e+00

Pos logit contributions
ator+10.565
umm+8.150
ators+8.093
opher+7.728
zie+7.632
Neg logit contributions
bs-5.462
 net-4.953
 dove-4.511
 tie-4.491
 lime-4.455
Tokeno
Token ID78
Feature activation+3.68e-01

Pos logit contributions
ator+3.135
man+2.292
umm+2.275
icles+2.077
 birth+2.074
Neg logit contributions
bs-2.862
 slips-2.113
ip-2.104
 net-2.104
op-2.101
Token went
Token ID1816
Feature activation+1.17e+00

Pos logit contributions
ator+0.858
man+0.676
umm+0.626
icles+0.594
ators+0.588
Neg logit contributions
bs-0.638
 net-0.460
 slips-0.459
 knot-0.450
 loses-0.440
Token to
Token ID284
Feature activation+1.68e-01

Pos logit contributions
ator+2.877
man+2.206
umm+2.191
icles+2.027
ators+2.020
Neg logit contributions
bs-1.946
 net-1.407
 slips-1.368
 tie-1.328
ip-1.312
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
ator+0.405
man+0.321
umm+0.295
icles+0.285
ators+0.279
Neg logit contributions
bs-0.285
 slips-0.201
 net-0.197
ip-0.195
 knot-0.195
Token hall
Token ID6899
Feature activation+2.23e-01

Pos logit contributions
ator+6.513
man+4.929
icles+4.915
umm+4.785
omm+4.391
Neg logit contributions
bs-6.790
 loop-5.607
 net-5.345
 slips-5.247
 knot-5.236
Tokenways
Token ID1322
Feature activation-1.75e-01

Pos logit contributions
ator+0.559
man+0.446
umm+0.419
icles+0.398
ators+0.394
Neg logit contributions
bs-0.347
 net-0.240
 slips-0.238
 knot-0.234
ip-0.227
Token and
Token ID290
Feature activation+1.52e-01

Pos logit contributions
bs+0.277
 net+0.195
 slips+0.194
 knot+0.192
ip+0.184
Neg logit contributions
ator-0.431
man-0.343
umm-0.318
icles-0.304
ators-0.301
Token up
Token ID510
Feature activation+4.61e-02

Pos logit contributions
ator+0.345
man+0.269
umm+0.247
icles+0.235
ators+0.235
Neg logit contributions
bs-0.270
 net-0.199
 slips-0.197
 knot-0.195
ip-0.190
Token the
Token ID262
Feature activation+3.12e+00

Pos logit contributions
ator+0.110
man+0.086
umm+0.080
icles+0.076
ators+0.076
Neg logit contributions
bs-0.077
 net-0.055
 slips-0.055
 knot-0.054
ip-0.053
Token grand
Token ID4490
Feature activation+3.89e+00

Pos logit contributions
ator+6.790
man+5.193
icles+5.118
umm+5.094
omm+4.780
Neg logit contributions
bs-6.314
 loop-4.825
 net-4.737
 slips-4.709
 tie-4.634
Token stair
Token ID18761
Feature activation+1.61e-01

Pos logit contributions
ator+7.487
man+6.189
umm+5.706
 Amen+5.544
icles+5.540
Neg logit contributions
bs-7.807
 loop-6.067
 knot-6.001
 loses-5.937
 net-5.871
Tokenways
Token ID1322
Feature activation-2.74e-01

Pos logit contributions
ator+0.380
man+0.300
umm+0.278
icles+0.265
ators+0.263
Neg logit contributions
bs-0.273
 net-0.197
 slips-0.194
 knot-0.190
ip-0.187
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
bs+0.450
 net+0.328
 slips+0.323
 knot+0.321
ip+0.308
Neg logit contributions
ator-0.656
man-0.517
umm-0.479
icles-0.457
ators-0.451
Token Everywhere
Token ID48201
Feature activation-9.03e-01

Pos logit contributions
ator+0.380
man+0.287
umm+0.272
icles+0.252
 birth+0.248
Neg logit contributions
bs-0.355
 net-0.269
 slips-0.267
 knot-0.261
ata-0.258
Token she
Token ID673
Feature activation+6.94e-01

Pos logit contributions
bs+1.401
ata+1.010
 net+0.992
 slips+0.987
 loop+0.960
Neg logit contributions
ator-2.192
man-1.749
umm-1.658
 birth-1.607
ators-1.561
Token imagined
Token ID15758
Feature activation+2.32e-01

Pos logit contributions
ator+0.375
man+0.291
umm+0.265
ators+0.250
icles+0.250
Neg logit contributions
bs-0.323
 net-0.239
 slips-0.237
 knot-0.234
ip-0.230
Token a
Token ID257
Feature activation-6.63e-02

Pos logit contributions
ator+0.582
man+0.462
umm+0.431
icles+0.416
ators+0.411
Neg logit contributions
bs-0.365
 net-0.253
 slips-0.251
 knot-0.246
ip-0.239
Token big
Token ID1263
Feature activation+2.51e-02

Pos logit contributions
bs+0.123
 slips+0.089
 net+0.087
 loses+0.086
ip+0.086
Neg logit contributions
ator-0.148
man-0.116
umm-0.106
 birth-0.099
ators-0.098
Token palace
Token ID20562
Feature activation-6.99e-02

Pos logit contributions
ator+0.053
man+0.040
umm+0.037
icles+0.034
ators+0.034
Neg logit contributions
bs-0.050
 slips-0.037
 net-0.037
ata-0.036
 loses-0.036
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
bs+0.144
 net+0.110
 slips+0.109
 knot+0.107
 loses+0.106
Neg logit contributions
ator-0.143
man-0.109
umm-0.098
ators-0.092
icles-0.091
Token The
Token ID383
Feature activation+3.87e+00

Pos logit contributions
bs+2.443
 net+1.976
 loses+1.956
 slips+1.955
 knot+1.899
Neg logit contributions
ator-2.181
man-1.616
umm-1.525
icles-1.485
omm-1.454
Token palace
Token ID20562
Feature activation+6.04e-01

Pos logit contributions
ator+9.276
umm+7.278
icles+6.982
rim+6.894
man+6.837
Neg logit contributions
bs-6.996
 net-5.577
 tie-5.433
 loop-5.390
 knot-5.368
Token was
Token ID373
Feature activation+1.48e+00

Pos logit contributions
ator+1.306
man+0.995
umm+0.931
icles+0.875
ators+0.861
Neg logit contributions
bs-1.161
 net-0.866
 slips-0.865
 knot-0.845
ip-0.830
Token very
Token ID845
Feature activation+1.57e+00

Pos logit contributions
ator+3.826
man+3.007
umm+2.961
icles+2.749
ators+2.729
Neg logit contributions
bs-2.232
 slips-1.523
 net-1.522
 tie-1.491
 loop-1.491
Token pretty
Token ID2495
Feature activation-3.28e-01

Pos logit contributions
ator+4.128
man+3.291
umm+3.154
icles+3.039
ators+3.017
Neg logit contributions
bs-2.443
 slips-1.615
 net-1.591
 knot-1.543
 loop-1.539
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
bs+0.516
 slips+0.352
 net+0.345
 loses+0.330
 knot+0.325
Neg logit contributions
ator-0.838
man-0.674
umm-0.638
icles-0.600
ators-0.593
Token liked
Token ID8288
Feature activation+8.01e-01

Pos logit contributions
bs+0.390
 net+0.292
 slips+0.292
ata+0.283
 knot+0.282
Neg logit contributions
ator-0.429
man-0.331
umm-0.296
icles-0.285
 birth-0.284
Token the
Token ID262
Feature activation+7.74e-01

Pos logit contributions
ator+2.046
man+1.629
umm+1.552
icles+1.461
ators+1.456
Neg logit contributions
bs-1.228
 net-0.854
 slips-0.839
 knot-0.815
 loop-0.799
Token sweet
Token ID6029
Feature activation+1.05e+00

Pos logit contributions
ator+1.930
man+1.527
umm+1.443
icles+1.378
ators+1.368
Neg logit contributions
bs-1.242
 net-0.881
 slips-0.863
 knot-0.854
 tie-0.829
Token smell
Token ID8508
Feature activation-2.61e-01

Pos logit contributions
ator+2.316
man+1.764
umm+1.633
ators+1.564
icles+1.537
Neg logit contributions
bs-1.992
 net-1.488
 slips-1.459
 knot-1.456
 tie-1.427
Token.
Token ID13
Feature activation+2.85e-01

Pos logit contributions
bs+0.387
 slips+0.264
 net+0.259
 knot+0.254
 loop+0.248
Neg logit contributions
ator-0.661
man-0.539
umm-0.507
icles-0.483
 birth-0.473
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
ator+0.622
man+0.479
umm+0.444
icles+0.422
ators+0.411
Neg logit contributions
bs-0.545
 net-0.406
 slips-0.404
 knot-0.395
 loses-0.390
Token hipp
Token ID18568
Feature activation+1.53e+00

Pos logit contributions
ator+11.044
ators+8.734
umm+8.668
man+8.554
icles+8.280
Neg logit contributions
bs-4.848
 net-4.097
 dove-3.889
 lime-3.859
 tie-3.804
Tokeno
Token ID78
Feature activation+2.82e-01

Pos logit contributions
ator+3.312
man+2.431
umm+2.379
 birth+2.204
icles+2.191
Neg logit contributions
bs-3.002
 slips-2.204
op-2.194
ip-2.193
 net-2.186
Token skipped
Token ID26684
Feature activation+9.43e-01

Pos logit contributions
ator+0.625
man+0.485
umm+0.448
icles+0.423
ators+0.419
Neg logit contributions
bs-0.517
 net-0.381
 slips-0.381
 knot-0.371
 loses-0.364
Token on
Token ID319
Feature activation+4.36e-01

Pos logit contributions
ator+2.054
man+1.542
umm+1.467
icles+1.363
ators+1.357
Neg logit contributions
bs-1.833
 net-1.392
 slips-1.365
 knot-1.347
 loop-1.330
Token and
Token ID290
Feature activation-3.09e-01

Pos logit contributions
ator+0.993
man+0.770
umm+0.719
icles+0.681
ators+0.671
Neg logit contributions
bs-0.792
 slips-0.576
 net-0.574
 knot-0.559
ip-0.552
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
ator+1.095
man+0.875
umm+0.821
icles+0.783
ators+0.774
Neg logit contributions
bs-0.683
 slips-0.471
 net-0.471
 knot-0.456
ip-0.448
Token cheered
Token ID34767
Feature activation+1.62e+00

Pos logit contributions
bs+1.937
op+1.474
ip+1.445
 knot+1.442
 slips+1.429
Neg logit contributions
ator-2.383
man-1.946
umm-1.650
icles-1.627
 birth-1.617
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
ator+4.423
man+3.490
umm+3.445
icles+3.243
ators+3.239
Neg logit contributions
bs-2.264
 net-1.446
 knot-1.408
 slips-1.407
-1.406
Token They
Token ID1119
Feature activation-5.38e-01

Pos logit contributions
ator+0.795
man+0.611
umm+0.570
icles+0.537
ators+0.528
Neg logit contributions
bs-0.687
 net-0.510
 slips-0.508
 knot-0.497
ip-0.489
Token thanked
Token ID26280
Feature activation+9.75e-01

Pos logit contributions
bs+1.027
 slips+0.771
 knot+0.768
 net+0.759
ip+0.752
Neg logit contributions
ator-1.125
man-0.872
umm-0.762
icles-0.741
ators-0.736
Token the
Token ID262
Feature activation+3.83e+00

Pos logit contributions
ator+2.557
man+2.047
umm+1.966
icles+1.842
ators+1.836
Neg logit contributions
bs-1.439
 net-0.978
 slips-0.948
 knot-0.922
 tie-0.910
Token hipp
Token ID18568
Feature activation+2.02e+00

Pos logit contributions
ator+8.340
umm+6.285
ators+6.049
man+5.721
icles+5.651
Neg logit contributions
bs-7.315
 net-6.471
 lime-6.223
 dove-6.204
 loop-6.175
Tokeno
Token ID78
Feature activation+2.13e-01

Pos logit contributions
ator+4.507
umm+3.349
man+3.240
 birth+3.068
ators+3.025
Neg logit contributions
bs-3.831
op-2.810
ip-2.799
 slips-2.711
 net-2.702
Token for
Token ID329
Feature activation+1.45e+00

Pos logit contributions
ator+0.518
man+0.415
umm+0.382
icles+0.367
ators+0.361
Neg logit contributions
bs-0.348
 slips-0.245
 net-0.244
 knot-0.238
 loses-0.233
Token his
Token ID465
Feature activation+3.84e-01

Pos logit contributions
ator+3.347
man+2.588
umm+2.546
icles+2.323
ators+2.291
Neg logit contributions
bs-2.528
 net-1.957
 tie-1.862
 loop-1.849
 slips-1.845
Token show
Token ID905
Feature activation+7.45e-02

Pos logit contributions
ator+0.784
man+0.591
umm+0.544
icles+0.509
ators+0.501
Neg logit contributions
bs-0.781
 slips-0.593
 net-0.589
 knot-0.578
ip-0.573
Token.
Token ID13
Feature activation-3.70e-02

Pos logit contributions
ator+1.193
man+0.945
umm+0.891
icles+0.846
ators+0.837
Neg logit contributions
bs-0.782
 net-0.546
 slips-0.546
 knot-0.529
ip-0.518
Token The
Token ID383
Feature activation+1.75e+00

Pos logit contributions
bs+0.072
 net+0.054
 slips+0.054
 knot+0.053
 loses+0.052
Neg logit contributions
ator-0.079
man-0.061
umm-0.056
icles-0.053
ators-0.052
Token elderly
Token ID13830
Feature activation+2.01e+00

Pos logit contributions
ator+3.483
man+2.555
umm+2.408
icles+2.325
ators+2.216
Neg logit contributions
bs-3.667
 net-2.966
 knot-2.889
 loop-2.872
-2.831
Token rh
Token ID9529
Feature activation+6.54e-01

Pos logit contributions
ator+4.232
umm+3.129
man+3.088
icles+3.026
ators+2.917
Neg logit contributions
bs-3.893
 net-3.069
 tie-2.972
-2.940
 loop-2.931
Tokenin
Token ID259
Feature activation+1.53e+00

Pos logit contributions
ator+1.479
man+1.144
umm+1.077
icles+1.017
ators+1.001
Neg logit contributions
bs-1.185
 net-0.869
 slips-0.864
 knot-0.844
ip-0.831
Tokenoc
Token ID420
Feature activation+3.83e+00

Pos logit contributions
ator+3.981
umm+3.096
man+3.082
icles+2.973
ators+2.904
Neg logit contributions
bs-2.252
 net-1.524
-1.489
ip-1.479
 slips-1.477
Tokeneros
Token ID27498
Feature activation+3.86e-01

Pos logit contributions
ator+5.435
 birth+3.389
 Amen+3.244
man+3.217
zie+3.123
Neg logit contributions
bs-9.499
ip-8.308
 net-8.219
op-8.161
 knot-8.035
Token was
Token ID373
Feature activation+8.22e-01

Pos logit contributions
ator+0.846
man+0.652
umm+0.603
icles+0.568
ators+0.562
Neg logit contributions
bs-0.721
 net-0.533
 slips-0.532
 knot-0.520
 loses-0.510
Token so
Token ID523
Feature activation+4.76e-01

Pos logit contributions
ator+1.912
man+1.485
umm+1.410
icles+1.330
ators+1.311
Neg logit contributions
bs-1.439
 net-1.044
 slips-1.038
 knot-1.009
-0.998
Token happy
Token ID3772
Feature activation+7.80e-01

Pos logit contributions
ator+1.171
man+0.927
umm+0.874
icles+0.828
ators+0.819
Neg logit contributions
bs-0.772
 net-0.542
 slips-0.540
 knot-0.525
ip-0.513
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
ator+1.975
man+1.569
umm+1.489
icles+1.417
ators+1.407
Neg logit contributions
bs-1.207
 net-0.837
 slips-0.831
 knot-0.802
 loses-0.786
Token strong
Token ID1913
Feature activation-6.26e-01

Pos logit contributions
ator+1.233
man+0.956
umm+0.895
icles+0.845
ators+0.835
Neg logit contributions
bs-0.973
 net-0.711
 slips-0.709
 knot-0.691
ip-0.678
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
bs+0.999
 slips+0.671
 net+0.653
 knot+0.630
ip+0.627
Neg logit contributions
ator-1.576
man-1.291
umm-1.218
icles-1.118
ators-1.113
Token
Token ID220
Feature activation-1.91e-01

Pos logit contributions
ator+0.330
man+0.256
umm+0.237
icles+0.225
ators+0.218
Neg logit contributions
bs-0.282
 net-0.209
 slips-0.208
 knot-0.204
 loses-0.201
Token
Token ID198
Feature activation-1.90e-01

Pos logit contributions
bs+0.324
 net+0.238
 knot+0.232
 slips+0.232
 loses+0.223
Neg logit contributions
ator-0.454
man-0.359
umm-0.325
icles-0.319
ators-0.311
Token
Token ID198
Feature activation+8.24e-02

Pos logit contributions
bs+0.325
 net+0.235
 slips+0.233
 knot+0.230
 loses+0.223
Neg logit contributions
ator-0.449
man-0.357
umm-0.319
icles-0.312
ators-0.305
TokenThe
Token ID464
Feature activation+3.82e+00

Pos logit contributions
ator+0.205
man+0.164
umm+0.152
icles+0.145
ators+0.144
Neg logit contributions
bs-0.133
 slips-0.092
 net-0.092
 knot-0.088
 loses-0.088
Token hipp
Token ID18568
Feature activation+1.53e+00

Pos logit contributions
ator+8.705
umm+6.545
ators+6.347
opher+6.077
icles+6.075
Neg logit contributions
bs-6.749
 net-6.022
 tie-5.680
 dove-5.650
 lime-5.598
Tokeno
Token ID78
Feature activation+5.07e-01

Pos logit contributions
ator+3.280
man+2.406
umm+2.378
icles+2.183
 birth+2.182
Neg logit contributions
bs-2.983
op-2.199
 slips-2.192
ip-2.188
 net-2.172
Token found
Token ID1043
Feature activation+8.08e-03

Pos logit contributions
ator+1.161
man+0.903
umm+0.845
icles+0.798
ators+0.789
Neg logit contributions
bs-0.903
 net-0.657
 slips-0.655
 knot-0.639
ip-0.626
Token some
Token ID617
Feature activation-6.77e-01

Pos logit contributions
ator+0.022
man+0.018
umm+0.017
icles+0.016
ators+0.016
Neg logit contributions
bs-0.011
 slips-0.007
 knot-0.007
 net-0.007
 loses-0.007
Token vegetables
Token ID13701
Feature activation-7.19e-01

Pos logit contributions
bs+1.299
 slips+0.984
 loses+0.972
ata+0.963
ip+0.946
Neg logit contributions
ator-1.431
man-1.104
umm-1.028
icles-0.976
 birth-0.943
Tokeno
Token ID78
Feature activation+2.52e-01

Pos logit contributions
ator+2.885
man+2.117
umm+2.054
ators+1.885
icles+1.882
Neg logit contributions
bs-2.692
 slips-1.992
 net-1.987
ip-1.973
-1.957
Token was
Token ID373
Feature activation+7.44e-01

Pos logit contributions
ator+0.525
man+0.400
umm+0.366
icles+0.346
ators+0.341
Neg logit contributions
bs-0.496
 net-0.377
 slips-0.375
 knot-0.368
 loop-0.359
Token so
Token ID523
Feature activation+1.11e+00

Pos logit contributions
ator+1.804
man+1.420
umm+1.349
icles+1.275
ators+1.260
Neg logit contributions
bs-1.235
 net-0.881
 slips-0.871
 knot-0.843
 loses-0.830
Token attractive
Token ID10966
Feature activation+2.66e-01

Pos logit contributions
ator+2.742
man+2.168
umm+2.056
icles+1.963
ators+1.955
Neg logit contributions
bs-1.860
 net-1.288
 slips-1.277
-1.220
 knot-1.216
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
ator+0.648
man+0.514
umm+0.483
icles+0.460
ators+0.453
Neg logit contributions
bs-0.435
 slips-0.304
 net-0.304
 knot-0.296
ata-0.289
Token The
Token ID383
Feature activation+3.82e+00

Pos logit contributions
ator+0.274
man+0.216
umm+0.201
icles+0.194
ators+0.188
Neg logit contributions
bs-0.203
 slips-0.144
 net-0.143
 knot-0.142
 loses-0.139
Token hipp
Token ID18568
Feature activation+1.72e+00

Pos logit contributions
ator+8.460
umm+6.137
ators+6.058
man+5.969
esh+5.718
Neg logit contributions
bs-7.226
 net-6.395
 dove-6.060
 lime-6.034
 tie-5.969
Tokeno
Token ID78
Feature activation+5.38e-01

Pos logit contributions
ator+3.768
man+2.770
umm+2.752
icles+2.522
 birth+2.522
Neg logit contributions
bs-3.311
op-2.427
 slips-2.415
ip-2.410
 net-2.385
Token was
Token ID373
Feature activation+8.13e-01

Pos logit contributions
ator+1.230
man+0.956
umm+0.895
icles+0.846
ators+0.835
Neg logit contributions
bs-0.960
 net-0.699
 slips-0.697
 knot-0.680
ip-0.667
Token excited
Token ID6568
Feature activation+6.18e-01

Pos logit contributions
ator+1.954
man+1.528
umm+1.458
icles+1.376
ators+1.360
Neg logit contributions
bs-1.372
 net-0.988
 slips-0.979
 knot-0.943
 tie-0.934
Token to
Token ID284
Feature activation+6.44e-01

Pos logit contributions
ator+1.577
man+1.257
umm+1.189
icles+1.131
ators+1.122
Neg logit contributions
bs-0.953
 net-0.654
 slips-0.650
 knot-0.629
 loses-0.615
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
bs+0.221
 net+0.154
 knot+0.154
 slips+0.153
ip+0.148
Neg logit contributions
ator-0.317
man-0.257
umm-0.230
icles-0.225
 birth-0.222
Token gorilla
Token ID45314
Feature activation-3.42e-01

Pos logit contributions
ator+2.885
man+2.096
umm+2.020
icles+1.888
ators+1.862
Neg logit contributions
bs-2.659
 net-2.121
 slips-2.059
 knot-2.030
 loop-2.017
Token.
Token ID13
Feature activation-7.20e-01

Pos logit contributions
bs+0.578
 slips+0.414
 net+0.405
 knot+0.400
 loses+0.388
Neg logit contributions
ator-0.816
man-0.662
umm-0.587
icles-0.568
ators-0.560
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
bs+1.477
 knot+1.148
 net+1.133
 slips+1.122
ata+1.103
Neg logit contributions
ator-1.398
man-1.078
umm-0.980
icles-0.970
omm-0.930
Token
Token ID198
Feature activation-6.95e-01

Pos logit contributions
bs+0.739
 net+0.542
 slips+0.534
 knot+0.531
 loses+0.518
Neg logit contributions
ator-0.984
man-0.789
icles-0.693
umm-0.685
ators-0.670
TokenThe
Token ID464
Feature activation+3.82e+00

Pos logit contributions
bs+1.364
 net+1.041
 slips+1.030
 knot+1.016
 loses+1.016
Neg logit contributions
ator-1.452
man-1.123
icles-0.986
umm-0.978
ators-0.956
Token gorilla
Token ID45314
Feature activation+1.24e+00

Pos logit contributions
ator+8.450
umm+6.146
icles+5.777
ators+5.745
esh+5.654
Neg logit contributions
bs-7.171
 net-6.624
 dove-6.446
 lime-6.131
 rope-6.105
Token was
Token ID373
Feature activation+6.79e-01

Pos logit contributions
ator+2.900
man+2.179
umm+2.168
icles+2.023
ators+1.981
Neg logit contributions
bs-2.191
 slips-1.616
 net-1.600
 loses-1.539
 knot-1.536
Token big
Token ID1263
Feature activation-4.54e-01

Pos logit contributions
ator+1.670
man+1.314
umm+1.251
icles+1.184
ators+1.170
Neg logit contributions
bs-1.104
 net-0.777
 slips-0.775
 knot-0.749
 loses-0.735
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
bs+0.677
 slips+0.424
 net+0.421
ata+0.418
ip+0.409
Neg logit contributions
ator-1.174
man-0.976
umm-0.898
ators-0.851
icles-0.848
Token strong
Token ID1913
Feature activation-6.88e-02

Pos logit contributions
ator+0.598
man+0.475
umm+0.438
icles+0.418
ators+0.412
Neg logit contributions
bs-0.440
 net-0.316
 slips-0.315
 knot-0.307
ata-0.302
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
ator+1.897
man+1.430
umm+1.357
icles+1.256
ators+1.246
Neg logit contributions
bs-1.723
 slips-1.306
 net-1.304
 knot-1.271
 loop-1.259
Token a
Token ID257
Feature activation-8.02e-01

Pos logit contributions
bs+1.269
 slips+0.867
 loses+0.839
 knot+0.802
ip+0.788
Neg logit contributions
ator-2.215
man-1.782
umm-1.668
 birth-1.667
icles-1.637
Token p
Token ID279
Feature activation+5.82e-02

Pos logit contributions
bs+1.440
 slips+1.045
 loses+1.034
ata+1.022
 net+0.975
Neg logit contributions
ator-1.829
man-1.487
umm-1.323
 birth-1.243
omm-1.220
Tokenanda
Token ID5282
Feature activation-5.57e-02

Pos logit contributions
ator+0.107
man+0.077
umm+0.069
icles+0.063
ators+0.063
Neg logit contributions
bs-0.129
 net-0.102
 slips-0.102
 knot-0.101
 loses-0.099
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
bs+0.089
 slips+0.063
 net+0.062
 knot+0.061
 loses+0.060
Neg logit contributions
ator-0.138
man-0.112
umm-0.101
icles-0.097
ators-0.096
Token The
Token ID383
Feature activation+3.80e+00

Pos logit contributions
ator+2.725
man+2.062
umm+1.928
ators+1.849
icles+1.795
Neg logit contributions
bs-2.198
 net-1.627
 slips-1.616
 knot-1.581
-1.558
Token p
Token ID279
Feature activation+2.17e+00

Pos logit contributions
ator+10.483
man+8.187
umm+8.131
ators+8.111
icles+7.598
Neg logit contributions
bs-5.234
 loop-4.849
 net-4.526
 dove-4.475
 lime-4.418
Tokenanda
Token ID5282
Feature activation+8.65e-01

Pos logit contributions
ator+4.418
man+3.337
umm+3.170
icles+2.970
ators+2.914
Neg logit contributions
bs-4.677
ip-3.576
 slips-3.543
op-3.493
 net-3.398
Token was
Token ID373
Feature activation+1.20e+00

Pos logit contributions
ator+1.787
man+1.322
umm+1.271
icles+1.179
ators+1.150
Neg logit contributions
bs-1.756
 slips-1.328
 net-1.326
ip-1.292
 knot-1.290
Token very
Token ID845
Feature activation+1.67e+00

Pos logit contributions
ator+2.891
man+2.217
umm+2.175
ators+2.032
icles+2.028
Neg logit contributions
bs-2.046
 net-1.490
 slips-1.465
 tie-1.428
 knot-1.414
Token gloomy
Token ID46400
Feature activation+1.34e-01

Pos logit contributions
ator+4.491
man+3.559
umm+3.454
ators+3.381
icles+3.332
Neg logit contributions
bs-2.574
 net-1.675
 slips-1.673
 tie-1.606
ip-1.596
Token asked
Token ID1965
Feature activation+7.90e-01

Pos logit contributions
ator+0.402
man+0.319
umm+0.288
icles+0.277
ators+0.274
Neg logit contributions
bs-0.304
 slips-0.221
 net-0.220
 knot-0.217
ip-0.212
Token if
Token ID611
Feature activation+7.22e-01

Pos logit contributions
ator+2.053
man+1.637
umm+1.570
icles+1.477
ators+1.467
Neg logit contributions
bs-1.194
 net-0.802
 slips-0.794
 knot-0.770
ip-0.760
Token he
Token ID339
Feature activation-6.26e-01

Pos logit contributions
ator+1.656
man+1.279
umm+1.211
icles+1.135
ators+1.124
Neg logit contributions
bs-1.296
 net-0.946
 slips-0.940
 knot-0.912
ip-0.900
Token could
Token ID714
Feature activation-6.36e-01

Pos logit contributions
bs+1.089
 net+0.831
 slips+0.800
 knot+0.793
ata+0.780
Neg logit contributions
ator-1.385
man-1.128
umm-0.999
icles-0.951
 birth-0.948
Token borrow
Token ID8804
Feature activation-1.62e+00

Pos logit contributions
bs+1.048
 net+0.753
 slips+0.753
ip+0.723
ata+0.722
Neg logit contributions
ator-1.520
man-1.230
umm-1.077
icles-1.067
 birth-1.036
Token a
Token ID257
Feature activation-4.19e+00

Pos logit contributions
bs+3.602
 slips+2.911
ata+2.878
op+2.763
 net+2.710
Neg logit contributions
ator-2.903
 him-2.186
man-2.132
icles-2.006
 himself-1.960
Token strip
Token ID10283
Feature activation-7.82e-01

Pos logit contributions
bs+8.615
imes+6.870
 loses+6.250
ume+6.162
 slips+6.080
Neg logit contributions
ator-8.503
man-7.993
 me-7.163
 us-6.387
 next-6.067
Token.
Token ID13
Feature activation+6.60e-01

Pos logit contributions
bs+1.232
 slips+0.864
 net+0.823
 loses+0.823
 knot+0.800
Neg logit contributions
ator-1.957
man-1.628
umm-1.449
icles-1.421
ators-1.352
Token But
Token ID887
Feature activation-2.79e-01

Pos logit contributions
ator+1.574
man+1.232
umm+1.163
icles+1.097
ators+1.090
Neg logit contributions
bs-1.124
 net-0.802
 slips-0.800
 knot-0.775
ip-0.763
Token,
Token ID11
Feature activation-3.23e-01

Pos logit contributions
bs+0.555
 slips+0.418
 net+0.412
 knot+0.410
ata+0.408
Neg logit contributions
ator-0.586
man-0.456
umm-0.399
icles-0.394
ators-0.379
Token his
Token ID465
Feature activation-1.54e+00

Pos logit contributions
bs+0.624
 slips+0.470
 net+0.460
 knot+0.457
 loses+0.455
Neg logit contributions
ator-0.695
man-0.543
umm-0.483
icles-0.474
ators-0.457
Token branch
Token ID8478
Feature activation-1.03e+00

Pos logit contributions
bs+3.318
ip+2.313
ata+2.297
 loses+2.259
ver+2.229
Neg logit contributions
ator-4.382
man-3.783
umm-3.115
 birth-3.046
icles-2.942
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
bs+1.779
ip+1.277
 net+1.275
 slips+1.251
 loses+1.248
Neg logit contributions
ator-2.350
man-1.978
icles-1.656
umm-1.628
ators-1.588
Token They
Token ID1119
Feature activation-1.97e+00

Pos logit contributions
bs+0.547
 net+0.413
 slips+0.413
 knot+0.408
ip+0.404
Neg logit contributions
ator-0.564
man-0.423
umm-0.404
icles-0.385
 birth-0.364
Token look
Token ID804
Feature activation-2.54e-01

Pos logit contributions
bs+3.333
ip+2.628
ata+2.476
op+2.474
ching+2.470
Neg logit contributions
ator-4.287
man-3.329
 him-3.027
ators-2.984
 birth-2.908
Token for
Token ID329
Feature activation-1.30e+00

Pos logit contributions
bs+0.470
 slips+0.356
 net+0.354
ata+0.347
 knot+0.346
Neg logit contributions
ator-0.543
man-0.419
umm-0.377
icles-0.364
ators-0.361
Token another
Token ID1194
Feature activation-3.96e+00

Pos logit contributions
bs+2.321
 slips+1.750
ata+1.740
ip+1.727
op+1.670
Neg logit contributions
ator-2.974
man-2.249
 him-2.168
icles-2.156
umm-2.089
Token tree
Token ID5509
Feature activation-1.55e+00

Pos logit contributions
bs+8.769
imes+6.377
ip+6.320
 loses+6.268
azz+6.267
Neg logit contributions
ator-8.187
man-7.194
 next-5.952
 me-5.691
 us-5.522
Token.
Token ID13
Feature activation-6.21e-02

Pos logit contributions
bs+2.948
 slips+2.185
 knot+2.158
ip+2.124
 net+2.105
Neg logit contributions
ator-3.335
man-2.785
umm-2.242
icles-2.215
ators-2.190
Token They
Token ID1119
Feature activation-1.13e+00

Pos logit contributions
bs+0.123
 net+0.093
 slips+0.093
 knot+0.092
ip+0.091
Neg logit contributions
ator-0.129
man-0.097
umm-0.092
icles-0.087
ators-0.083
Token see
Token ID766
Feature activation-1.41e+00

Pos logit contributions
bs+1.855
ip+1.415
ata+1.351
ching+1.340
 net+1.324
Neg logit contributions
ator-2.554
man-2.005
 him-1.774
ators-1.765
umm-1.741
Token one
Token ID530
Feature activation-1.98e+00

Pos logit contributions
bs+2.262
ata+1.615
ip+1.594
op+1.553
 slips+1.499
Neg logit contributions
ator-3.446
man-2.863
 him-2.613
icles-2.609
 birth-2.498
Token "
Token ID366
Feature activation+9.97e-01

Pos logit contributions
ator+1.136
man+0.884
umm+0.826
icles+0.782
ators+0.771
Neg logit contributions
bs-0.887
 net-0.646
 slips-0.645
 knot-0.628
 loses-0.616
TokenHow
Token ID2437
Feature activation-2.43e+00

Pos logit contributions
ator+1.919
man+1.388
umm+1.304
icles+1.209
ators+1.165
Neg logit contributions
bs-2.158
 net-1.685
 slips-1.668
 knot-1.651
 tie-1.642
Token do
Token ID466
Feature activation-1.60e-02

Pos logit contributions
bs+4.751
 net+3.703
ata+3.496
ip+3.482
 knot+3.478
Neg logit contributions
ator-5.088
man-4.010
 birth-3.476
icles-3.461
 him-3.417
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
bs+0.034
 slips+0.026
 net+0.026
ata+0.025
 knot+0.025
Neg logit contributions
ator-0.031
man-0.023
umm-0.021
icles-0.020
 birth-0.020
Token snap
Token ID11495
Feature activation-1.35e+00

Pos logit contributions
bs+2.133
ip+1.561
 net+1.514
 slips+1.471
ata+1.419
Neg logit contributions
ator-2.971
man-2.304
umm-2.102
 him-2.051
icles-2.038
Token a
Token ID257
Feature activation-3.86e+00

Pos logit contributions
bs+2.691
ata+2.037
 slips+1.999
 net+1.994
ip+1.933
Neg logit contributions
ator-2.800
man-2.116
icles-1.903
ators-1.891
 us-1.850
Token knot
Token ID29654
Feature activation+3.48e-02

Pos logit contributions
bs+5.782
imes+3.793
azz+3.616
ata+3.493
ip+3.396
Neg logit contributions
ator-10.225
man-9.002
 me-8.477
umm-8.099
 us-7.922
Token?"
Token ID1701
Feature activation+6.77e-01

Pos logit contributions
ator+0.078
man+0.061
umm+0.055
icles+0.053
ators+0.052
Neg logit contributions
bs-0.064
 net-0.047
 slips-0.046
 knot-0.045
ip-0.045
Token
Token ID220
Feature activation-4.64e-01

Pos logit contributions
ator+1.664
man+1.309
umm+1.242
icles+1.174
ators+1.166
Neg logit contributions
bs-1.117
 net-0.782
 slips-0.777
 knot-0.756
ip-0.745
Token
Token ID198
Feature activation+8.65e-01

Pos logit contributions
bs+0.797
 net+0.590
 slips+0.575
 knot+0.573
 loses+0.555
Neg logit contributions
ator-1.093
man-0.859
umm-0.780
icles-0.758
omm-0.742
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
ator+2.241
man+1.785
umm+1.725
icles+1.630
ators+1.612
Neg logit contributions
bs-1.301
 net-0.861
 slips-0.858
 knot-0.833
ip-0.830
Token he
Token ID339
Feature activation+4.46e-01

Pos logit contributions
ator+3.181
man+2.470
umm+2.405
icles+2.259
ators+2.257
Neg logit contributions
bs-2.058
 slips-1.451
 net-1.450
 loses-1.388
-1.375
Token promised
Token ID8072
Feature activation+1.22e+00

Pos logit contributions
ator+1.074
man+0.849
umm+0.794
icles+0.754
ators+0.746
Neg logit contributions
bs-0.744
 net-0.527
 slips-0.525
 knot-0.510
ip-0.500
Token to
Token ID284
Feature activation+1.38e+00

Pos logit contributions
ator+2.919
man+2.258
umm+2.199
ators+2.052
icles+2.022
Neg logit contributions
bs-2.073
 net-1.491
 slips-1.476
-1.431
 tie-1.429
Token never
Token ID1239
Feature activation-5.09e-01

Pos logit contributions
ator+2.904
man+2.162
umm+2.094
ators+1.918
icles+1.918
Neg logit contributions
bs-2.736
 net-2.123
 tie-2.096
 slips-2.091
 knot-2.072
Token attach
Token ID10199
Feature activation+1.92e-02

Pos logit contributions
bs+0.961
ip+0.699
ata+0.694
 net+0.688
 slips+0.685
Neg logit contributions
ator-1.104
man-0.869
umm-0.795
icles-0.748
 birth-0.737
Token a
Token ID257
Feature activation-3.78e+00

Pos logit contributions
ator+0.043
man+0.034
umm+0.031
icles+0.030
ators+0.029
Neg logit contributions
bs-0.035
 slips-0.025
 net-0.025
 knot-0.025
ata-0.025
Token net
Token ID2010
Feature activation+6.61e-02

Pos logit contributions
bs+5.547
ird+4.156
ip+3.909
ata+3.855
azz+3.626
Neg logit contributions
ator-9.805
man-9.193
 us-7.999
 me-7.690
umm-7.296
Token to
Token ID284
Feature activation-3.68e-01

Pos logit contributions
ator+0.133
man+0.101
umm+0.091
icles+0.086
ators+0.084
Neg logit contributions
bs-0.134
 slips-0.102
 net-0.102
 knot-0.100
ip-0.099
Token someone
Token ID2130
Feature activation-9.04e-01

Pos logit contributions
bs+0.687
 slips+0.503
ip+0.489
ata+0.487
 net+0.486
Neg logit contributions
ator-0.826
man-0.638
umm-0.581
icles-0.571
ators-0.549
Token again
Token ID757
Feature activation-6.86e-01

Pos logit contributions
bs+1.507
 net+1.064
 slips+1.061
 knot+1.051
ata+1.040
Neg logit contributions
ator-2.133
man-1.705
umm-1.530
ators-1.495
 birth-1.484
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
bs+1.234
 net+0.893
 knot+0.892
 slips+0.887
ata+0.879
Neg logit contributions
ator-1.552
man-1.223
umm-1.115
ators-1.041
icles-1.041
Token dad
Token ID9955
Feature activation-3.21e-01

Pos logit contributions
bs+1.179
 slips+0.897
ata+0.895
 loses+0.873
 net+0.864
Neg logit contributions
ator-1.138
man-0.873
umm-0.777
 birth-0.747
ators-0.719
Token if
Token ID611
Feature activation+2.27e-01

Pos logit contributions
bs+0.477
 net+0.332
 slips+0.330
 knot+0.324
 loop+0.317
Neg logit contributions
ator-0.820
man-0.670
umm-0.605
icles-0.596
ators-0.588
Token he
Token ID339
Feature activation-2.52e-01

Pos logit contributions
ator+0.510
man+0.398
umm+0.365
icles+0.348
ators+0.343
Neg logit contributions
bs-0.415
 slips-0.305
 net-0.304
 knot-0.299
ata-0.294
Token could
Token ID714
Feature activation-5.83e-01

Pos logit contributions
bs+0.437
 net+0.327
 slips+0.317
 knot+0.314
 loop+0.307
Neg logit contributions
ator-0.574
man-0.456
umm-0.414
icles-0.394
ators-0.390
Token take
Token ID1011
Feature activation-6.57e-01

Pos logit contributions
bs+1.018
 net+0.743
 slips+0.737
ata+0.721
 knot+0.714
Neg logit contributions
ator-1.340
man-1.052
umm-0.948
icles-0.929
 birth-0.897
Token a
Token ID257
Feature activation-3.73e+00

Pos logit contributions
bs+1.163
 slips+0.880
 net+0.834
ata+0.825
 loses+0.823
Neg logit contributions
ator-1.495
man-1.151
umm-1.073
 him-1.052
icles-1.045
Token seat
Token ID5852
Feature activation-2.88e-01

Pos logit contributions
bs+7.090
 slips+6.108
 loses+5.832
imes+5.712
 pads+5.638
Neg logit contributions
ator-7.161
man-6.026
 me-5.254
 us-5.193
 birth-5.020
Token under
Token ID739
Feature activation+5.79e-02

Pos logit contributions
bs+0.528
 net+0.391
 slips+0.391
 knot+0.378
 loses+0.373
Neg logit contributions
ator-0.643
man-0.507
umm-0.456
icles-0.443
ators-0.436
Token the
Token ID262
Feature activation-3.80e-01

Pos logit contributions
ator+0.131
man+0.101
umm+0.094
icles+0.091
ators+0.089
Neg logit contributions
bs-0.105
 slips-0.077
 net-0.075
 knot-0.074
ip-0.074
Token shady
Token ID36135
Feature activation-1.29e-01

Pos logit contributions
bs+0.695
 slips+0.492
ata+0.481
ip+0.481
 net+0.475
Neg logit contributions
ator-0.871
man-0.681
umm-0.639
icles-0.603
ators-0.588
Token tree
Token ID5509
Feature activation-1.10e+00

Pos logit contributions
bs+0.221
 slips+0.154
 loses+0.149
 net+0.148
ata+0.147
Neg logit contributions
ator-0.312
man-0.247
umm-0.228
icles-0.217
ators-0.213
Token green
Token ID4077
Feature activation-9.67e-01

Pos logit contributions
bs+3.110
 loses+2.194
ip+2.188
ata+2.185
 slips+2.149
Neg logit contributions
ator-3.437
man-2.821
umm-2.421
 birth-2.273
ators-2.204
Token rope
Token ID17182
Feature activation-7.44e-01

Pos logit contributions
bs+1.736
 loses+1.192
ata+1.169
 slips+1.160
ip+1.152
Neg logit contributions
ator-2.304
man-1.875
umm-1.705
icles-1.617
ators-1.557
Token!
Token ID0
Feature activation-1.14e-01

Pos logit contributions
bs+1.409
ip+0.983
 slips+0.973
 loses+0.966
 net+0.963
Neg logit contributions
ator-1.670
man-1.352
umm-1.170
icles-1.145
ators-1.122
Token He
Token ID679
Feature activation-5.83e-01

Pos logit contributions
bs+0.253
 net+0.195
 slips+0.194
 knot+0.191
ip+0.191
Neg logit contributions
ator-0.212
man-0.158
umm-0.144
icles-0.136
ators-0.130
Token took
Token ID1718
Feature activation-3.25e-01

Pos logit contributions
bs+1.112
 net+0.841
 slips+0.835
 loses+0.808
ata+0.805
Neg logit contributions
ator-1.225
man-0.957
umm-0.835
icles-0.809
 birth-0.808
Token the
Token ID262
Feature activation-3.71e+00

Pos logit contributions
bs+0.530
 slips+0.375
 net+0.373
 loses+0.361
 knot+0.356
Neg logit contributions
ator-0.789
man-0.625
umm-0.580
icles-0.569
 him-0.548
Token rope
Token ID17182
Feature activation-9.46e-01

Pos logit contributions
bs+5.515
azz+4.022
 loses+3.680
ip+3.369
ata+3.329
Neg logit contributions
ator-9.488
man-8.135
 us-7.353
 me-7.320
icles-7.248
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
bs+1.798
 slips+1.308
 loses+1.287
ip+1.275
 net+1.257
Neg logit contributions
ator-2.083
man-1.677
icles-1.445
umm-1.436
esh-1.380
Token ran
Token ID4966
Feature activation+5.82e-01

Pos logit contributions
bs+4.265
ume+3.350
 loses+3.328
 slips+3.322
 net+3.309
Neg logit contributions
ator-4.124
man-3.232
 him-2.918
icles-2.904
esh-2.806
Token to
Token ID284
Feature activation-8.28e-01

Pos logit contributions
ator+1.304
man+1.006
umm+0.942
icles+0.886
ators+0.876
Neg logit contributions
bs-1.072
 net-0.790
 slips-0.787
 knot-0.768
ip-0.754
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
bs+1.534
ip+1.092
 slips+1.080
ata+1.071
 net+1.053
Neg logit contributions
ator-1.902
man-1.457
 him-1.336
icles-1.307
umm-1.289
Token."
Token ID526
Feature activation-1.29e+00

Pos logit contributions
bs+1.557
 net+1.169
 slips+1.149
ip+1.121
 knot+1.120
Neg logit contributions
ator-1.850
man-1.407
icles-1.266
umm-1.265
ators-1.237
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
bs+2.465
 knot+1.947
ata+1.926
 loop+1.892
 net+1.891
Neg logit contributions
ator-2.552
man-1.961
umm-1.856
 him-1.849
omm-1.754
Token
Token ID198
Feature activation-1.86e+00

Pos logit contributions
bs+2.295
 knot+1.756
 net+1.741
ata+1.712
 slips+1.710
Neg logit contributions
ator-2.569
man-2.003
 him-1.787
icles-1.698
umm-1.673
TokenShe
Token ID3347
Feature activation-1.07e+00

Pos logit contributions
bs+2.865
ata+1.972
ip+1.966
ver+1.929
 loses+1.919
Neg logit contributions
ator-4.545
man-3.709
 him-3.337
 us-3.305
umm-3.257
Token threw
Token ID9617
Feature activation-1.47e+00

Pos logit contributions
bs+1.818
 net+1.299
 slips+1.275
 loses+1.257
 knot+1.250
Neg logit contributions
ator-2.475
man-2.043
icles-1.731
umm-1.718
ators-1.714
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
bs+2.759
op+2.037
 slips+2.030
ata+1.998
ip+1.971
Neg logit contributions
ator-3.049
man-2.426
 him-2.348
icles-2.272
umm-2.115
Token rope
Token ID17182
Feature activation-7.22e-01

Pos logit contributions
bs+5.357
azz+3.505
ird+3.413
 loses+3.259
ata+3.144
Neg logit contributions
ator-9.801
man-8.747
 us-7.482
icles-7.463
esh-7.195
Token into
Token ID656
Feature activation-1.69e+00

Pos logit contributions
bs+1.419
 net+1.053
 slips+1.047
ip+1.035
ata+1.030
Neg logit contributions
ator-1.486
man-1.190
umm-1.014
icles-1.010
ators-0.964
Token the
Token ID262
Feature activation-1.83e+00

Pos logit contributions
bs+3.433
 slips+2.618
ip+2.587
 loses+2.459
 pads+2.457
Neg logit contributions
ator-3.545
man-2.574
 him-2.515
icles-2.418
 himself-2.352
Token hole
Token ID7604
Feature activation-2.39e-01

Pos logit contributions
bs+3.077
ver+2.169
ip+2.083
ata+2.023
azz+1.910
Neg logit contributions
ator-4.574
man-3.827
umm-3.451
ators-3.204
 us-3.178
Token and
Token ID290
Feature activation-2.00e+00

Pos logit contributions
bs+0.394
 net+0.277
 slips+0.273
 knot+0.272
ip+0.268
Neg logit contributions
ator-0.575
man-0.462
umm-0.420
icles-0.408
ators-0.397
Token tied
Token ID8165
Feature activation-5.93e-01

Pos logit contributions
bs+2.319
 net+1.780
 slips+1.756
 loses+1.697
 knot+1.687
Neg logit contributions
ator-2.310
man-1.829
umm-1.493
ators-1.477
 birth-1.444
Token the
Token ID262
Feature activation-3.40e+00

Pos logit contributions
bs+1.032
 slips+0.744
 net+0.731
 loses+0.728
ata+0.720
Neg logit contributions
ator-1.365
man-1.092
icles-0.994
umm-0.972
 him-0.944
Token other
Token ID584
Feature activation-2.75e+00

Pos logit contributions
bs+4.955
azz+3.096
ip+2.902
 loses+2.891
ata+2.875
Neg logit contributions
ator-8.894
man-8.044
 me-6.849
 next-6.809
ators-6.767
Token end
Token ID886
Feature activation-1.40e+00

Pos logit contributions
bs+5.262
ip+3.717
 loses+3.714
ata+3.668
azz+3.650
Neg logit contributions
ator-5.882
man-4.964
ators-4.350
icles-4.314
umm-4.311
Token of
Token ID286
Feature activation-1.83e+00

Pos logit contributions
bs+2.172
 loses+1.583
 net+1.562
 slips+1.516
ata+1.506
Neg logit contributions
ator-3.429
man-2.836
umm-2.522
icles-2.497
lo-2.356
Token the
Token ID262
Feature activation-3.66e+00

Pos logit contributions
bs+3.890
 loses+2.990
 slips+2.966
ata+2.867
ip+2.703
Neg logit contributions
ator-3.720
man-2.880
 him-2.639
icles-2.612
ators-2.417
Token rope
Token ID17182
Feature activation-6.25e-01

Pos logit contributions
bs+5.586
azz+3.764
ver+3.501
ata+3.424
aves+3.367
Neg logit contributions
ator-9.409
man-8.228
 next-7.619
 me-7.339
 us-7.324
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
bs+1.028
 net+0.725
 slips+0.718
 loses+0.704
 knot+0.696
Neg logit contributions
ator-1.491
man-1.212
umm-1.083
icles-1.059
ators-1.029
Token her
Token ID607
Feature activation-1.67e+00

Pos logit contributions
bs+2.050
 slips+1.510
 net+1.467
 loses+1.458
ip+1.453
Neg logit contributions
ator-2.502
man-1.940
 him-1.733
umm-1.732
icles-1.725
Token hand
Token ID1021
Feature activation-8.31e-01

Pos logit contributions
bs+3.458
ata+2.622
 slips+2.554
ching+2.467
ip+2.462
Neg logit contributions
ator-3.404
man-2.673
umm-2.323
 birth-2.229
ators-2.176
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
bs+1.353
 net+0.978
 slips+0.973
 loses+0.950
 knot+0.926
Neg logit contributions
ator-1.966
man-1.643
icles-1.441
umm-1.437
ators-1.373
Token
Token ID198
Feature activation-5.50e-01

Pos logit contributions
bs+0.985
 net+0.759
 knot+0.729
 slips+0.725
 loses+0.706
Neg logit contributions
ator-1.317
man-1.027
umm-0.921
icles-0.915
ators-0.889
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
bs+0.971
 net+0.741
 slips+0.727
 knot+0.705
 loses+0.693
Neg logit contributions
ator-1.246
man-0.979
umm-0.873
icles-0.843
 birth-0.830
TokenL
Token ID43
Feature activation-8.73e-01

Pos logit contributions
bs+2.138
 net+1.681
 slips+1.662
 loses+1.599
ip+1.531
Neg logit contributions
ator-2.538
man-1.973
umm-1.808
 birth-1.703
omm-1.681
Tokenily
Token ID813
Feature activation-7.44e-01

Pos logit contributions
bs+1.876
 net+1.510
 slips+1.470
 knot+1.417
 loses+1.409
Neg logit contributions
ator-1.758
man-1.215
umm-1.114
ators-1.062
icles-1.061
Token took
Token ID1718
Feature activation-1.74e+00

Pos logit contributions
bs+1.414
 net+1.056
 slips+1.049
 loses+1.012
 knot+1.005
Neg logit contributions
ator-1.583
man-1.248
umm-1.074
esh-1.047
ators-1.046
Token the
Token ID262
Feature activation-3.58e+00

Pos logit contributions
bs+3.244
 slips+2.638
 net+2.589
 loses+2.588
ata+2.468
Neg logit contributions
ator-3.623
man-2.642
 him-2.623
icles-2.510
umm-2.459
Token rope
Token ID17182
Feature activation-1.11e+00

Pos logit contributions
bs+5.260
 loses+3.597
azz+3.546
ird+3.084
aves+2.990
Neg logit contributions
ator-9.590
man-8.268
 us-7.149
icles-7.132
 me-7.068
Token and
Token ID290
Feature activation-2.17e+00

Pos logit contributions
bs+2.116
 slips+1.602
 loses+1.570
ip+1.509
 knot+1.507
Neg logit contributions
ator-2.428
man-1.941
icles-1.659
umm-1.651
ators-1.578
Token started
Token ID2067
Feature activation-1.48e+00

Pos logit contributions
bs+4.269
ching+3.495
 loses+3.458
 slips+3.437
ume+3.433
Neg logit contributions
ator-4.098
man-3.175
 him-2.847
icles-2.782
ators-2.708
Token to
Token ID284
Feature activation-1.53e+00

Pos logit contributions
bs+3.051
 slips+2.372
ata+2.354
 loses+2.343
 net+2.293
Neg logit contributions
ator-2.832
man-2.177
icles-1.887
 birth-1.871
 him-1.859
Token swing
Token ID9628
Feature activation-1.29e-02

Pos logit contributions
bs+2.719
 slips+1.956
 loses+1.952
ata+1.952
ip+1.942
Neg logit contributions
ator-3.506
man-2.733
umm-2.484
esh-2.467
icles-2.445
Token They
Token ID1119
Feature activation-8.42e-01

Pos logit contributions
bs+0.470
 slips+0.349
 net+0.348
 knot+0.342
 loses+0.341
Neg logit contributions
ator-0.558
man-0.435
umm-0.403
icles-0.385
 birth-0.371
Token tried
Token ID3088
Feature activation-1.16e+00

Pos logit contributions
bs+1.675
 slips+1.327
 loses+1.283
 net+1.278
 knot+1.266
Neg logit contributions
ator-1.678
man-1.283
icles-1.118
umm-1.108
ators-1.091
Token to
Token ID284
Feature activation-7.43e-01

Pos logit contributions
bs+2.402
 slips+1.814
ata+1.776
 knot+1.763
 net+1.762
Neg logit contributions
ator-2.318
man-1.797
icles-1.540
 him-1.530
 birth-1.494
Token unt
Token ID1418
Feature activation-4.30e-01

Pos logit contributions
bs+1.158
ip+0.794
 slips+0.785
ata+0.736
 net+0.733
Neg logit contributions
ator-1.895
man-1.514
umm-1.382
icles-1.359
ators-1.329
Tokenie
Token ID494
Feature activation-1.23e+00

Pos logit contributions
bs+1.046
 net+0.855
 slips+0.851
ata+0.832
 knot+0.831
Neg logit contributions
ator-0.696
man-0.462
umm-0.396
icles-0.393
ators-0.380
Token the
Token ID262
Feature activation-3.56e+00

Pos logit contributions
bs+2.366
 slips+1.815
 net+1.783
 loses+1.728
ata+1.727
Neg logit contributions
ator-2.601
man-2.008
icles-1.825
 him-1.749
umm-1.690
Token knot
Token ID29654
Feature activation-6.16e-01

Pos logit contributions
bs+5.275
azz+3.477
ird+3.069
aves+2.964
ip+2.957
Neg logit contributions
ator-9.459
man-8.099
 us-7.432
 me-7.411
umm-7.295
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
bs+1.032
 slips+0.732
 net+0.732
ip+0.705
 knot+0.705
Neg logit contributions
ator-1.457
man-1.176
umm-1.039
icles-1.033
ators-0.997
Token but
Token ID475
Feature activation-6.50e-01

Pos logit contributions
bs+3.382
ip+2.462
 slips+2.442
ata+2.413
 net+2.368
Neg logit contributions
ator-3.644
man-2.810
icles-2.645
 him-2.617
 us-2.484
Token it
Token ID340
Feature activation-3.00e-02

Pos logit contributions
bs+1.056
 slips+0.756
ata+0.751
 net+0.745
ip+0.725
Neg logit contributions
ator-1.539
man-1.230
icles-1.100
umm-1.092
ators-1.068
Token was
Token ID373
Feature activation-6.99e-01

Pos logit contributions
bs+0.062
 net+0.047
 slips+0.046
 knot+0.045
ip+0.045
Neg logit contributions
ator-0.060
man-0.046
umm-0.041
icles-0.039
ators-0.038
Token his
Token ID465
Feature activation-2.29e+00

Pos logit contributions
ator+1.649
man+1.307
umm+1.240
icles+1.170
ators+1.159
Neg logit contributions
bs-1.055
 net-0.738
 slips-0.732
 knot-0.713
 loop-0.696
Token bag
Token ID6131
Feature activation-5.98e-01

Pos logit contributions
bs+4.408
ata+3.493
ver+3.260
ching+3.170
ip+3.046
Neg logit contributions
ator-4.907
man-3.992
 us-3.358
ators-3.277
 birth-3.264
Token and
Token ID290
Feature activation-1.67e+00

Pos logit contributions
bs+0.908
 slips+0.615
 net+0.605
 knot+0.601
 loses+0.598
Neg logit contributions
ator-1.538
man-1.285
umm-1.128
icles-1.127
ators-1.087
Token took
Token ID1718
Feature activation+6.22e-02

Pos logit contributions
bs+3.036
 knot+2.376
ching+2.342
op+2.318
 slips+2.278
Neg logit contributions
ator-3.453
man-2.797
ators-2.426
icles-2.400
 him-2.371
Token out
Token ID503
Feature activation-5.99e-01

Pos logit contributions
ator+0.150
man+0.119
umm+0.111
icles+0.107
ators+0.105
Neg logit contributions
bs-0.102
 slips-0.072
 net-0.072
 knot-0.070
 loses-0.069
Token a
Token ID257
Feature activation-3.54e+00

Pos logit contributions
bs+1.053
 slips+0.796
 loses+0.762
 net+0.761
 knot+0.756
Neg logit contributions
ator-1.375
man-1.080
icles-0.982
ators-0.939
umm-0.938
Token jar
Token ID17379
Feature activation-9.48e-01

Pos logit contributions
bs+6.757
 loses+5.635
ver+5.250
 pads+5.212
ip+5.088
Neg logit contributions
ator-7.359
man-6.598
 guest-5.011
 birth-4.918
 us-4.915
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
bs+1.607
 slips+1.202
 net+1.164
 knot+1.162
 loses+1.159
Neg logit contributions
ator-2.224
man-1.834
umm-1.586
icles-1.573
ators-1.477
Token He
Token ID679
Feature activation-4.64e-01

Pos logit contributions
bs+1.947
 net+1.549
 knot+1.536
 slips+1.525
ata+1.506
Neg logit contributions
ator-1.374
man-0.998
umm-0.885
icles-0.853
omm-0.779
Token said
Token ID531
Feature activation-5.57e-01

Pos logit contributions
bs+0.792
 net+0.564
 slips+0.562
 knot+0.554
 loses+0.543
Neg logit contributions
ator-1.075
man-0.861
umm-0.766
icles-0.743
ators-0.739
Token it
Token ID340
Feature activation+7.11e-01

Pos logit contributions
bs+0.884
 knot+0.628
ata+0.623
 slips+0.622
 net+0.620
Neg logit contributions
ator-1.347
man-1.103
icles-0.979
umm-0.975
 birth-0.946
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
ator+3.810
man+3.043
umm+2.942
ators+2.780
icles+2.771
Neg logit contributions
bs-2.066
 net-1.372
 slips-1.342
 loses-1.291
 loop-1.273
Token They
Token ID1119
Feature activation-3.02e-01

Pos logit contributions
bs+1.280
 knot+1.001
 slips+0.985
 net+0.983
ip+0.973
Neg logit contributions
ator-1.330
man-1.020
umm-0.940
icles-0.888
 birth-0.881
Token decide
Token ID5409
Feature activation-8.64e-01

Pos logit contributions
bs+0.510
 net+0.369
 slips+0.368
ip+0.358
ata+0.356
Neg logit contributions
ator-0.693
man-0.543
umm-0.500
icles-0.476
 birth-0.471
Token to
Token ID284
Feature activation+7.73e-01

Pos logit contributions
bs+1.744
 slips+1.385
 net+1.354
ata+1.349
 knot+1.347
Neg logit contributions
ator-1.742
man-1.378
 birth-1.165
icles-1.154
 him-1.128
Token find
Token ID1064
Feature activation-1.56e+00

Pos logit contributions
ator+1.677
man+1.282
umm+1.209
icles+1.131
ators+1.113
Neg logit contributions
bs-1.466
 net-1.098
 slips-1.092
 knot-1.076
 loop-1.062
Token another
Token ID1194
Feature activation-3.54e+00

Pos logit contributions
bs+2.632
 slips+1.972
op+1.965
ip+1.958
ata+1.956
Neg logit contributions
ator-3.661
man-2.883
 him-2.691
icles-2.633
 us-2.579
Token game
Token ID983
Feature activation+1.19e-01

Pos logit contributions
bs+8.636
ume+6.546
azz+6.523
 loses+6.465
ip+6.329
Neg logit contributions
ator-6.707
man-5.617
 next-4.462
umm-4.183
 us-4.165
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
ator+0.285
man+0.226
umm+0.207
icles+0.199
ators+0.197
Neg logit contributions
bs-0.199
 slips-0.141
 net-0.141
 knot-0.139
ip-0.136
Token play
Token ID711
Feature activation-1.17e+00

Pos logit contributions
bs+3.635
ip+2.819
 slips+2.740
ata+2.709
op+2.629
Neg logit contributions
ator-3.290
man-2.355
 him-2.061
 us-2.029
ators-2.004
Token.
Token ID13
Feature activation-9.55e-01

Pos logit contributions
bs+2.407
 slips+1.799
ip+1.772
ata+1.698
 loses+1.689
Neg logit contributions
ator-2.473
man-1.895
ators-1.621
 him-1.576
umm-1.567
Token They
Token ID1119
Feature activation-1.20e-01

Pos logit contributions
bs+1.967
 knot+1.540
 net+1.529
ip+1.523
 slips+1.505
Neg logit contributions
ator-1.848
man-1.421
umm-1.289
icles-1.218
 birth-1.207
Token't
Token ID470
Feature activation-8.28e-01

Pos logit contributions
bs+3.702
 net+3.095
 knot+3.046
 slips+2.988
 loop+2.961
Neg logit contributions
ator-1.451
man-0.825
icles-0.575
umm-0.538
 him-0.524
Token be
Token ID307
Feature activation-1.40e+00

Pos logit contributions
bs+1.379
 net+0.985
ip+0.983
 slips+0.968
 knot+0.935
Neg logit contributions
ator-1.976
man-1.610
icles-1.392
umm-1.389
 him-1.364
Token able
Token ID1498
Feature activation-4.81e-01

Pos logit contributions
bs+2.262
 slips+1.568
 knot+1.550
 net+1.550
ip+1.522
Neg logit contributions
ator-3.455
man-2.783
ators-2.454
 him-2.406
ping-2.401
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
bs+0.930
 slips+0.698
 net+0.687
ip+0.687
 knot+0.685
Neg logit contributions
ator-1.000
man-0.766
umm-0.694
 birth-0.671
icles-0.668
Token get
Token ID651
Feature activation-5.49e-01

Pos logit contributions
bs+2.370
ip+1.729
 slips+1.675
ata+1.668
 net+1.612
Neg logit contributions
ator-2.845
man-2.177
 him-1.960
icles-1.950
umm-1.904
Token a
Token ID257
Feature activation-3.52e+00

Pos logit contributions
bs+0.979
ata+0.738
 slips+0.726
 knot+0.720
 net+0.714
Neg logit contributions
ator-1.206
man-0.937
icles-0.848
umm-0.847
 him-0.832
Token good
Token ID922
Feature activation-2.04e+00

Pos logit contributions
bs+7.847
ata+6.106
imes+6.030
ems+5.826
ip+5.665
Neg logit contributions
ator-6.498
man-5.837
umm-4.624
 Driver-4.352
 us-4.247
Token shot
Token ID2823
Feature activation-1.24e+00

Pos logit contributions
bs+4.399
ata+3.433
ip+3.280
 loses+3.231
 slips+3.213
Neg logit contributions
ator-3.948
man-3.145
umm-2.892
icles-2.517
 birth-2.508
Token.
Token ID13
Feature activation+8.06e-01

Pos logit contributions
bs+2.316
 slips+1.673
ip+1.637
 net+1.634
ata+1.595
Neg logit contributions
ator-2.760
man-2.237
icles-1.954
umm-1.939
ators-1.883
Token
Token ID198
Feature activation+4.24e-01

Pos logit contributions
ator+2.003
man+1.591
umm+1.489
icles+1.415
ators+1.412
Neg logit contributions
bs-1.303
 net-0.906
 slips-0.906
 knot-0.869
ip-0.862
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
ator+1.058
man+0.842
umm+0.791
icles+0.754
ators+0.745
Neg logit contributions
bs-0.668
 net-0.465
 slips-0.463
 knot-0.450
 loses-0.440
Token he
Token ID339
Feature activation-5.76e-01

Pos logit contributions
ator+1.845
man+1.420
umm+1.350
icles+1.269
ators+1.257
Neg logit contributions
bs-1.448
 net-1.047
 slips-1.039
 knot-1.019
 tie-1.001
Token replied
Token ID8712
Feature activation+1.91e-01

Pos logit contributions
bs+0.909
 slips+0.663
 net+0.661
 loses+0.635
 knot+0.614
Neg logit contributions
ator-1.400
man-1.174
umm-1.020
icles-0.984
omm-0.978
Token.
Token ID13
Feature activation+6.87e-01

Pos logit contributions
ator+0.461
man+0.365
umm+0.338
icles+0.323
ators+0.318
Neg logit contributions
bs-0.318
 slips-0.229
 net-0.228
 knot-0.221
 loses-0.217
Token He
Token ID679
Feature activation-9.87e-01

Pos logit contributions
ator+1.651
man+1.295
umm+1.224
icles+1.159
ators+1.149
Neg logit contributions
bs-1.153
 net-0.816
 slips-0.815
 knot-0.791
ip-0.778
Token took
Token ID1718
Feature activation+4.08e-01

Pos logit contributions
bs+1.829
 slips+1.376
 net+1.373
 loses+1.359
 knot+1.286
Neg logit contributions
ator-2.150
man-1.747
umm-1.452
esh-1.414
ators-1.410
Token a
Token ID257
Feature activation-3.51e+00

Pos logit contributions
ator+1.052
man+0.843
umm+0.796
icles+0.760
ators+0.750
Neg logit contributions
bs-0.613
 slips-0.416
 net-0.416
 knot-0.401
 loses-0.393
Token long
Token ID890
Feature activation-2.44e+00

Pos logit contributions
bs+6.655
 loses+5.715
 slips+5.322
imes+5.259
 pads+5.061
Neg logit contributions
ator-7.107
man-6.282
 me-4.876
 us-4.543
 mantle-4.540
Token piece
Token ID3704
Feature activation-8.93e-01

Pos logit contributions
bs+4.390
 loses+3.529
 slips+3.289
ata+3.033
ip+2.910
Neg logit contributions
ator-5.494
man-4.465
esh-3.797
icles-3.639
zie-3.631
Token of
Token ID286
Feature activation-2.69e+00

Pos logit contributions
bs+1.376
 slips+0.965
 loses+0.941
 net+0.914
ip+0.899
Neg logit contributions
ator-2.247
man-1.881
umm-1.678
icles-1.634
 him-1.569
Token string
Token ID4731
Feature activation-9.62e-01

Pos logit contributions
bs+4.314
 loses+3.443
 slips+3.402
azz+3.315
ver+3.277
Neg logit contributions
ator-6.322
man-5.033
 him-4.820
icles-4.733
 us-4.671
Token and
Token ID290
Feature activation-2.53e+00

Pos logit contributions
bs+1.596
 slips+1.134
 loses+1.110
 net+1.070
ata+1.066
Neg logit contributions
ator-2.342
man-1.935
umm-1.689
icles-1.686
 him-1.601
Token fly
Token ID6129
Feature activation-1.44e+00

Pos logit contributions
bs+3.474
ip+2.739
op+2.515
ata+2.515
ver+2.509
Neg logit contributions
ator-3.546
man-2.697
umm-2.242
 birth-2.223
 us-2.192
Token.
Token ID13
Feature activation-7.14e-01

Pos logit contributions
bs+2.696
 loses+1.962
 slips+1.949
ip+1.941
ata+1.854
Neg logit contributions
ator-3.301
man-2.659
ators-2.195
icles-2.191
umm-2.186
Token They
Token ID1119
Feature activation-3.38e-01

Pos logit contributions
bs+1.481
 net+1.138
 knot+1.137
ata+1.127
ip+1.123
Neg logit contributions
ator-1.389
man-1.043
umm-0.987
icles-0.941
 birth-0.925
Token look
Token ID804
Feature activation-7.73e-01

Pos logit contributions
bs+0.536
ip+0.378
 net+0.374
ata+0.363
 slips+0.362
Neg logit contributions
ator-0.828
man-0.648
umm-0.600
ators-0.584
icles-0.578
Token at
Token ID379
Feature activation-1.19e-01

Pos logit contributions
bs+1.361
 net+1.030
 slips+1.029
ata+1.010
 knot+1.006
Neg logit contributions
ator-1.689
man-1.302
umm-1.167
 birth-1.166
 him-1.141
Token each
Token ID1123
Feature activation-3.50e+00

Pos logit contributions
bs+0.220
 slips+0.161
 net+0.159
ata+0.159
ip+0.157
Neg logit contributions
ator-0.270
man-0.207
umm-0.191
icles-0.188
ators-0.184
Token other
Token ID584
Feature activation-1.06e+00

Pos logit contributions
bs+8.508
 slips+7.027
ip+7.004
 loses+6.771
ver+6.671
Neg logit contributions
ator-6.088
man-4.484
 us-4.091
 him-3.690
 them-3.671
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
bs+1.889
 net+1.397
 slips+1.390
ip+1.350
ata+1.335
Neg logit contributions
ator-2.425
man-1.913
umm-1.699
icles-1.690
ators-1.666
Token They
Token ID1119
Feature activation-2.36e-01

Pos logit contributions
bs+1.025
 net+0.786
 knot+0.784
 slips+0.772
ata+0.768
Neg logit contributions
ator-1.027
man-0.778
umm-0.734
icles-0.698
 birth-0.688
Token smile
Token ID8212
Feature activation-3.26e-01

Pos logit contributions
bs+0.371
ip+0.259
 net+0.259
 slips+0.252
ata+0.251
Neg logit contributions
ator-0.580
man-0.456
umm-0.423
ators-0.409
icles-0.407
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
bs+0.529
 net+0.378
 slips+0.368
 knot+0.366
ip+0.358
Neg logit contributions
ator-0.779
man-0.634
umm-0.577
icles-0.551
 birth-0.550
Token was
Token ID373
Feature activation-6.02e-01

Pos logit contributions
bs+1.656
 slips+1.255
 net+1.247
 loses+1.212
 knot+1.199
Neg logit contributions
ator-1.712
man-1.365
umm-1.140
icles-1.111
ators-1.093
Token excited
Token ID6568
Feature activation+9.86e-01

Pos logit contributions
bs+1.205
 slips+0.883
ip+0.875
 net+0.870
 knot+0.859
Neg logit contributions
ator-1.289
man-1.002
umm-0.858
opher-0.831
ators-0.831
Token!
Token ID0
Feature activation-6.50e-01

Pos logit contributions
ator+2.709
man+2.181
umm+2.086
ators+1.989
icles+1.985
Neg logit contributions
bs-1.338
 net-0.866
 slips-0.850
 knot-0.822
 tie-0.806
Token He
Token ID679
Feature activation-7.71e-01

Pos logit contributions
bs+1.728
 net+1.395
ata+1.373
 knot+1.369
 slips+1.366
Neg logit contributions
ator-0.927
man-0.634
umm-0.587
icles-0.493
omm-0.485
Token grabbed
Token ID13176
Feature activation-1.59e+00

Pos logit contributions
bs+1.366
 slips+1.010
 net+0.986
 loses+0.980
ata+0.960
Neg logit contributions
ator-1.702
man-1.373
umm-1.183
 birth-1.142
icles-1.138
Token the
Token ID262
Feature activation-3.48e+00

Pos logit contributions
bs+3.287
 loses+2.608
 slips+2.590
ata+2.534
 knot+2.426
Neg logit contributions
ator-3.119
man-2.342
 him-2.206
icles-2.184
 himself-2.015
Token rope
Token ID17182
Feature activation-1.04e+00

Pos logit contributions
bs+4.946
azz+3.429
 loses+3.259
+3.048
ird+3.040
Neg logit contributions
ator-9.386
man-8.320
 us-7.040
 next-6.876
icles-6.874
Token and
Token ID290
Feature activation-2.11e+00

Pos logit contributions
bs+1.689
 slips+1.177
 loses+1.152
ata+1.109
ip+1.108
Neg logit contributions
ator-2.603
man-2.162
umm-1.896
icles-1.881
esh-1.820
Token began
Token ID2540
Feature activation-1.07e+00

Pos logit contributions
bs+4.156
ume+3.169
 loses+3.168
 slips+3.135
azz+3.134
Neg logit contributions
ator-4.126
man-3.359
esh-2.900
 him-2.855
icles-2.845
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
bs+2.165
ata+1.649
 slips+1.636
 loses+1.609
 knot+1.603
Neg logit contributions
ator-2.158
man-1.684
esh-1.449
umm-1.409
icles-1.404
Token swing
Token ID9628
Feature activation+2.49e-01

Pos logit contributions
bs+2.635
ip+1.923
ata+1.849
 slips+1.811
 loses+1.790
Neg logit contributions
ator-3.459
man-2.802
esh-2.476
umm-2.390
 him-2.369
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
ator+0.417
man+0.329
umm+0.307
icles+0.292
ators+0.286
Neg logit contributions
bs-0.300
 net-0.216
 slips-0.215
 knot-0.211
 loses-0.206
Token
Token ID198
Feature activation+4.28e-01

Pos logit contributions
ator+0.856
man+0.682
umm+0.635
icles+0.609
ators+0.599
Neg logit contributions
bs-0.549
 net-0.383
 slips-0.383
 knot-0.372
 loses-0.363
TokenL
Token ID43
Feature activation+6.86e-01

Pos logit contributions
ator+0.951
man+0.734
umm+0.681
icles+0.642
ators+0.635
Neg logit contributions
bs-0.802
 net-0.593
 slips-0.593
 knot-0.576
 loses-0.568
Tokenulu
Token ID15712
Feature activation-3.31e-01

Pos logit contributions
ator+1.545
man+1.198
umm+1.120
icles+1.052
ators+1.044
Neg logit contributions
bs-1.262
 slips-0.925
 net-0.924
 knot-0.899
ip-0.887
Token took
Token ID1718
Feature activation-4.82e-01

Pos logit contributions
bs+0.584
 net+0.422
 slips+0.421
 knot+0.410
 loses+0.404
Neg logit contributions
ator-0.753
man-0.597
umm-0.529
icles-0.507
ators-0.507
Token a
Token ID257
Feature activation-3.47e+00

Pos logit contributions
bs+0.826
 slips+0.613
 net+0.596
 loses+0.593
 knot+0.584
Neg logit contributions
ator-1.108
man-0.877
umm-0.805
icles-0.784
 birth-0.763
Token deep
Token ID2769
Feature activation-2.80e-01

Pos logit contributions
bs+6.390
 loses+5.529
imes+5.211
 pads+5.090
ip+5.082
Neg logit contributions
ator-6.605
man-6.244
 me-4.699
 us-4.440
opher-4.329
Token breath
Token ID8033
Feature activation-1.58e+00

Pos logit contributions
bs+0.632
 slips+0.494
 loses+0.486
ata+0.486
 net+0.486
Neg logit contributions
ator-0.505
man-0.369
umm-0.321
icles-0.303
esh-0.295
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
bs+2.596
 slips+1.977
 net+1.911
 knot+1.869
 loses+1.853
Neg logit contributions
ator-3.707
man-3.048
 him-2.679
umm-2.640
icles-2.582
Token decided
Token ID3066
Feature activation-6.30e-01

Pos logit contributions
bs+5.411
op+4.573
ume+4.497
imes+4.389
 knot+4.278
Neg logit contributions
ator-5.350
man-4.488
 him-4.010
esh-3.865
 me-3.734
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
bs+1.226
 net+0.929
 slips+0.928
ata+0.918
 knot+0.906
Neg logit contributions
ator-1.302
man-1.017
umm-0.870
icles-0.867
 birth-0.851
Token it
Token ID340
Feature activation-7.91e-02

Pos logit contributions
bs+1.951
 net+1.363
 slips+1.352
ata+1.319
ip+1.287
Neg logit contributions
ator-2.489
man-1.882
 him-1.751
icles-1.717
 himself-1.694
Token again
Token ID757
Feature activation-9.53e-01

Pos logit contributions
bs+0.149
 net+0.109
 slips+0.108
 knot+0.104
ip+0.103
Neg logit contributions
ator-0.176
man-0.135
umm-0.125
icles-0.119
ators-0.117
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
bs+1.819
 slips+1.334
 net+1.314
ata+1.283
 knot+1.280
Neg logit contributions
ator-2.108
man-1.643
umm-1.495
ators-1.401
icles-1.393
Token He
Token ID679
Feature activation-6.00e-01

Pos logit contributions
ator+0.643
man+0.468
umm+0.428
icles+0.396
ators+0.385
Neg logit contributions
bs-0.786
 net-0.616
 slips-0.613
 knot-0.602
 loses-0.594
Token tried
Token ID3088
Feature activation-1.24e+00

Pos logit contributions
bs+1.069
 slips+0.760
 net+0.758
 loses+0.742
 knot+0.733
Neg logit contributions
ator-1.338
man-1.062
umm-0.938
ators-0.917
icles-0.916
Token another
Token ID1194
Feature activation-3.47e+00

Pos logit contributions
bs+2.572
ata+1.830
 net+1.819
 slips+1.814
 knot+1.766
Neg logit contributions
ator-2.535
man-1.944
 him-1.757
icles-1.697
 himself-1.663
Token,
Token ID11
Feature activation-2.19e+00

Pos logit contributions
bs+7.156
ip+5.172
ume+4.946
imes+4.801
ata+4.786
Neg logit contributions
ator-7.221
man-5.871
 me-5.408
umm-5.356
 himself-5.047
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
bs+4.281
op+2.981
ata+2.962
ip+2.853
ume+2.842
Neg logit contributions
ator-4.516
man-3.487
 him-3.341
umm-3.274
icles-3.265
Token another
Token ID1194
Feature activation-2.47e+00

Pos logit contributions
bs+3.741
 net+2.896
 knot+2.861
 slips+2.834
op+2.810
Neg logit contributions
ator-2.880
man-2.151
 him-1.896
icles-1.837
umm-1.769
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
bs+5.125
 slips+3.717
 loses+3.683
ip+3.681
imes+3.592
Neg logit contributions
ator-5.018
man-4.231
umm-3.676
 him-3.239
icles-3.178
Token He
Token ID679
Feature activation-8.87e-01

Pos logit contributions
ator+0.619
man+0.456
umm+0.421
icles+0.392
ators+0.381
Neg logit contributions
bs-0.696
 net-0.540
 slips-0.536
 knot-0.527
 loses-0.520
Token showed
Token ID3751
Feature activation+6.22e-01

Pos logit contributions
ator+0.428
man+0.340
umm+0.311
icles+0.297
ators+0.295
Neg logit contributions
bs-0.310
 net-0.222
 slips-0.221
 knot-0.214
 loses-0.211
Token them
Token ID606
Feature activation-4.24e-01

Pos logit contributions
ator+1.006
man+0.688
umm+0.621
icles+0.560
ators+0.550
Neg logit contributions
bs-1.531
 net-1.229
 slips-1.223
 knot-1.206
ip-1.191
Token how
Token ID703
Feature activation-1.57e+00

Pos logit contributions
bs+0.824
 slips+0.630
 net+0.613
ata+0.609
ip+0.609
Neg logit contributions
ator-0.888
man-0.673
icles-0.601
umm-0.600
 him-0.582
Token to
Token ID284
Feature activation-9.43e-01

Pos logit contributions
bs+3.059
ata+2.618
 net+2.461
 slips+2.453
 knot+2.441
Neg logit contributions
ator-2.910
man-2.211
 him-2.005
umm-1.915
 birth-1.915
Token snap
Token ID11495
Feature activation-1.44e+00

Pos logit contributions
bs+1.551
ip+1.069
 net+1.055
 slips+1.043
ata+1.021
Neg logit contributions
ator-2.363
man-1.863
icles-1.653
ators-1.642
umm-1.641
Token the
Token ID262
Feature activation-3.47e+00

Pos logit contributions
bs+2.647
 slips+2.016
ata+2.009
 net+1.969
 loses+1.938
Neg logit contributions
ator-3.095
man-2.388
icles-2.199
 him-2.186
ators-2.111
Token knot
Token ID29654
Feature activation-1.15e+00

Pos logit contributions
bs+4.695
azz+2.987
ip+2.842
ver+2.700
ata+2.686
Neg logit contributions
ator-9.398
man-7.791
 me-7.576
umm-7.523
 us-7.494
Token by
Token ID416
Feature activation-1.64e+00

Pos logit contributions
bs+2.085
 net+1.495
 slips+1.494
ip+1.487
 loses+1.457
Neg logit contributions
ator-2.611
man-2.100
umm-1.797
icles-1.793
ators-1.739
Token pulling
Token ID10427
Feature activation-2.04e+00

Pos logit contributions
bs+3.652
ip+2.749
 slips+2.714
ata+2.697
 loses+2.667
Neg logit contributions
ator-3.294
man-2.428
icles-2.146
umm-2.043
ping-2.025
Token it
Token ID340
Feature activation-6.66e-01

Pos logit contributions
bs+3.819
 slips+2.846
ata+2.819
 loses+2.775
 net+2.772
Neg logit contributions
ator-4.464
man-3.421
 him-3.316
icles-3.100
 us-3.086
Token really
Token ID1107
Feature activation-1.39e+00

Pos logit contributions
bs+1.271
 net+0.954
ip+0.931
 slips+0.929
ata+0.913
Neg logit contributions
ator-1.406
man-1.103
umm-0.990
icles-0.962
ators-0.916
Token day
Token ID1110
Feature activation-1.14e+00

Pos logit contributions
bs+2.561
 loses+2.126
 slips+2.115
 net+2.077
 knot+2.067
Neg logit contributions
ator-1.429
man-1.066
umm-0.796
icles-0.740
 birth-0.727
Token she
Token ID673
Feature activation-4.08e-01

Pos logit contributions
bs+1.924
ata+1.402
 net+1.397
 knot+1.385
 loses+1.365
Neg logit contributions
ator-2.552
man-2.088
icles-1.903
 him-1.887
umm-1.835
Token got
Token ID1392
Feature activation-6.96e-01

Pos logit contributions
bs+0.815
 net+0.612
 knot+0.605
 slips+0.600
ip+0.596
Neg logit contributions
ator-0.837
man-0.645
umm-0.557
ators-0.552
 birth-0.546
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
bs+1.134
 knot+0.831
 slips+0.813
ata+0.804
 net+0.798
Neg logit contributions
ator-1.616
man-1.296
icles-1.166
 birth-1.161
umm-1.156
Token little
Token ID1310
Feature activation-1.97e+00

Pos logit contributions
bs+2.967
 loses+2.100
ip+2.099
 slips+2.081
ata+2.081
Neg logit contributions
ator-3.351
man-2.777
umm-2.440
 birth-2.320
omm-2.233
Token too
Token ID1165
Feature activation-3.46e+00

Pos logit contributions
bs+3.786
ata+2.829
ip+2.745
ver+2.739
 slips+2.670
Neg logit contributions
ator-4.062
man-3.434
umm-3.224
 birth-2.761
icles-2.747
Token close
Token ID1969
Feature activation-6.09e-01

Pos logit contributions
bs+6.642
ata+6.010
 loses+5.902
azz+5.524
 slips+5.481
Neg logit contributions
ator-6.208
man-5.357
ind-4.584
 him-4.570
 birth-4.214
Token to
Token ID284
Feature activation-8.91e-01

Pos logit contributions
bs+1.116
 net+0.822
ata+0.802
 slips+0.799
 knot+0.795
Neg logit contributions
ator-1.336
man-1.059
umm-0.951
icles-0.936
ators-0.901
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
bs+1.671
ip+1.251
 slips+1.249
 net+1.232
ata+1.198
Neg logit contributions
ator-1.960
man-1.486
 him-1.333
icles-1.330
umm-1.329
Token pond
Token ID16723
Feature activation-4.10e-01

Pos logit contributions
bs+2.393
 slips+1.755
 loses+1.718
ip+1.708
ata+1.702
Neg logit contributions
ator-2.539
man-2.081
umm-1.799
 birth-1.708
icles-1.613
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
bs+0.704
 knot+0.497
ip+0.488
 slips+0.487
 net+0.486
Neg logit contributions
ator-0.957
man-0.777
umm-0.691
icles-0.674
ators-0.655
Token!
Token ID0
Feature activation-3.44e-01

Pos logit contributions
ator+0.727
man+0.574
umm+0.534
icles+0.510
ators+0.502
Neg logit contributions
bs-0.521
 slips-0.372
 net-0.372
 knot-0.362
 loses-0.355
Tokenâ
Token ID22940
Feature activation+5.61e-01

Pos logit contributions
bs+0.712
 slips+0.554
 net+0.551
 loses+0.535
 knot+0.531
Neg logit contributions
ator-0.672
man-0.503
umm-0.479
icles-0.446
omm-0.431
Token
Token ID26391
Feature activation-5.47e-01

Pos logit contributions
ator+1.035
man+0.748
umm+0.685
icles+0.634
ators+0.623
Neg logit contributions
bs-1.255
 net-0.981
 slips-0.980
 knot-0.962
ip-0.948
Token Her
Token ID2332
Feature activation+8.61e-01

Pos logit contributions
bs+0.936
 net+0.688
 knot+0.672
 slips+0.671
 tie+0.644
Neg logit contributions
ator-1.268
man-0.999
umm-0.923
icles-0.867
 birth-0.867
Token mom
Token ID1995
Feature activation+3.67e-01

Pos logit contributions
ator+1.959
man+1.510
umm+1.406
icles+1.340
ators+1.331
Neg logit contributions
bs-1.578
 net-1.169
 slips-1.159
 knot-1.136
ip-1.110
Token told
Token ID1297
Feature activation+2.13e+00

Pos logit contributions
ator+0.926
man+0.742
umm+0.694
icles+0.663
ators+0.656
Neg logit contributions
bs-0.567
 net-0.393
 slips-0.388
 knot-0.378
 loses-0.368
Token her
Token ID607
Feature activation-3.67e-01

Pos logit contributions
ator+4.496
man+3.447
umm+3.193
ators+2.899
icles+2.719
Neg logit contributions
bs-4.502
-3.501
 knot-3.387
 net-3.324
 slips-3.312
Token that
Token ID326
Feature activation-1.07e+00

Pos logit contributions
bs+0.626
 slips+0.461
 net+0.454
 knot+0.444
ata+0.439
Neg logit contributions
ator-0.859
man-0.681
umm-0.612
icles-0.601
 birth-0.587
Token she
Token ID673
Feature activation+4.24e-02

Pos logit contributions
bs+1.811
 knot+1.320
 slips+1.315
 net+1.275
ata+1.267
Neg logit contributions
ator-2.504
man-2.003
umm-1.837
icles-1.833
 birth-1.791
Token had
Token ID550
Feature activation-2.88e-01

Pos logit contributions
ator+0.095
man+0.075
umm+0.068
icles+0.065
ators+0.065
Neg logit contributions
bs-0.076
 net-0.057
 slips-0.056
 knot-0.055
 loop-0.054
Token to
Token ID284
Feature activation-3.68e-01

Pos logit contributions
bs+0.519
 slips+0.381
 knot+0.373
 net+0.372
ip+0.364
Neg logit contributions
ator-0.642
man-0.501
umm-0.456
 birth-0.447
icles-0.444
Token
Token ID198
Feature activation-5.01e-01

Pos logit contributions
bs+2.566
 net+2.058
 knot+1.959
 loses+1.897
 slips+1.871
Neg logit contributions
ator-3.142
man-2.380
umm-2.149
omm-2.144
icles-2.126
Token
Token ID198
Feature activation-6.87e-01

Pos logit contributions
bs+0.858
 slips+0.642
 net+0.639
 knot+0.627
 loses+0.627
Neg logit contributions
ator-1.161
man-0.908
umm-0.816
 birth-0.789
icles-0.788
TokenAfter
Token ID3260
Feature activation-3.99e-02

Pos logit contributions
bs+1.257
 slips+0.962
 net+0.943
 loses+0.941
 knot+0.919
Neg logit contributions
ator-1.516
man-1.168
umm-1.051
 birth-1.026
ators-0.993
Token she
Token ID673
Feature activation+5.67e-01

Pos logit contributions
bs+0.066
 slips+0.048
 net+0.047
 knot+0.046
 loses+0.045
Neg logit contributions
ator-0.095
man-0.075
umm-0.070
icles-0.067
 birth-0.067
Token finished
Token ID5201
Feature activation+9.76e-01

Pos logit contributions
ator+1.267
man+0.977
umm+0.913
icles+0.861
ators+0.850
Neg logit contributions
bs-1.050
 net-0.774
 slips-0.772
 knot-0.753
ip-0.741
Token the
Token ID262
Feature activation+2.33e+00

Pos logit contributions
ator+2.303
man+1.802
umm+1.699
ators+1.590
icles+1.588
Neg logit contributions
bs-1.694
 net-1.232
 slips-1.216
ip-1.188
 knot-1.183
Token pear
Token ID25286
Feature activation+1.84e-02

Pos logit contributions
ator+6.491
man+5.297
ators+4.903
umm+4.736
icles+4.707
Neg logit contributions
bs-3.162
 lime-2.272
 knot-2.267
 net-2.253
-2.209
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
ator+0.048
man+0.038
umm+0.036
icles+0.035
ators+0.034
Neg logit contributions
bs-0.027
 net-0.018
 slips-0.018
 knot-0.017
 loses-0.017
Token Rachel
Token ID15984
Feature activation-6.42e-02

Pos logit contributions
bs+2.625
ip+2.005
op+1.959
 loses+1.948
 slips+1.936
Neg logit contributions
ator-3.259
man-2.507
umm-2.315
 birth-2.234
 him-2.199
Token looked
Token ID3114
Feature activation+1.34e-01

Pos logit contributions
bs+0.109
 net+0.078
 slips+0.078
 knot+0.076
 loses+0.074
Neg logit contributions
ator-0.150
man-0.120
umm-0.109
icles-0.104
 birth-0.103
Token at
Token ID379
Feature activation-6.56e-01

Pos logit contributions
ator+0.283
man+0.215
umm+0.201
icles+0.186
 birth+0.184
Neg logit contributions
bs-0.258
 slips-0.194
 net-0.193
 knot-0.190
ata-0.188
Token open
Token ID1280
Feature activation-4.43e-01

Pos logit contributions
bs+0.868
 slips+0.632
ip+0.621
 net+0.619
ata+0.606
Neg logit contributions
ator-1.042
man-0.795
umm-0.708
icles-0.691
ators-0.688
Token this
Token ID428
Feature activation-5.43e-01

Pos logit contributions
bs+0.819
 slips+0.598
 net+0.581
ata+0.580
 knot+0.567
Neg logit contributions
ator-1.006
man-0.765
icles-0.699
umm-0.692
ators-0.689
Token heavy
Token ID4334
Feature activation-1.12e+00

Pos logit contributions
bs+1.176
 slips+0.866
 net+0.857
 loses+0.853
ip+0.851
Neg logit contributions
ator-1.096
man-0.819
umm-0.730
ators-0.690
icles-0.685
Token box
Token ID3091
Feature activation-1.24e-01

Pos logit contributions
bs+2.504
ip+1.855
 slips+1.847
ata+1.829
 loses+1.819
Neg logit contributions
ator-2.210
man-1.663
umm-1.408
ators-1.400
icles-1.395
Token?"
Token ID1701
Feature activation-8.39e-02

Pos logit contributions
bs+0.236
 net+0.173
 slips+0.172
 knot+0.169
ip+0.167
Neg logit contributions
ator-0.273
man-0.212
umm-0.191
icles-0.183
ators-0.182
Token The
Token ID383
Feature activation+2.26e+00

Pos logit contributions
bs+0.156
 net+0.116
 slips+0.115
 knot+0.114
ata+0.110
Neg logit contributions
ator-0.184
man-0.144
umm-0.132
icles-0.126
ators-0.121
Token bear
Token ID6842
Feature activation+1.02e+00

Pos logit contributions
ator+4.219
umm+2.922
man+2.752
icles+2.699
ators+2.673
Neg logit contributions
bs-5.037
 net-4.243
 knot-4.221
 loop-4.189
 tie-4.105
Token tried
Token ID3088
Feature activation-5.90e-01

Pos logit contributions
ator+2.682
man+2.108
umm+2.063
icles+1.945
ators+1.930
Neg logit contributions
bs-1.512
 net-1.009
 slips-1.004
 knot-0.980
 loop-0.966
Token to
Token ID284
Feature activation+4.83e-01

Pos logit contributions
bs+1.164
 slips+0.867
 net+0.844
 knot+0.842
ata+0.829
Neg logit contributions
ator-1.262
man-0.990
icles-0.856
umm-0.831
ators-0.825
Token turn
Token ID1210
Feature activation+5.88e-02

Pos logit contributions
ator+1.177
man+0.930
umm+0.873
icles+0.830
ators+0.821
Neg logit contributions
bs-0.797
 net-0.561
 slips-0.560
 knot-0.542
ip-0.535
Token the
Token ID262
Feature activation+5.45e-01

Pos logit contributions
ator+0.142
man+0.112
umm+0.103
icles+0.101
ators+0.098
Neg logit contributions
bs-0.097
 slips-0.069
 net-0.068
 knot-0.066
ata-0.066
Token grandma
Token ID49890
Feature activation-2.85e-01

Pos logit contributions
ator+0.588
man+0.447
umm+0.413
icles+0.389
ators+0.383
Neg logit contributions
bs-0.544
 slips-0.409
 net-0.406
 knot-0.395
ata-0.394
Token.
Token ID13
Feature activation+4.24e-01

Pos logit contributions
bs+0.490
 slips+0.364
 net+0.358
 knot+0.342
 loop+0.337
Neg logit contributions
ator-0.665
man-0.536
umm-0.479
icles-0.462
ators-0.459
Token Her
Token ID2332
Feature activation+3.71e-01

Pos logit contributions
ator+0.949
man+0.733
umm+0.685
icles+0.647
ators+0.638
Neg logit contributions
bs-0.777
 net-0.574
 slips-0.570
 knot-0.557
ip-0.546
Token mom
Token ID1995
Feature activation+5.14e-01

Pos logit contributions
ator+0.738
man+0.551
umm+0.503
icles+0.470
ators+0.465
Neg logit contributions
bs-0.774
 net-0.596
 slips-0.594
 knot-0.579
ata-0.573
Tokenmy
Token ID1820
Feature activation+3.03e-01

Pos logit contributions
ator+1.199
man+0.936
umm+0.877
icles+0.830
ators+0.820
Neg logit contributions
bs-0.900
 net-0.651
 slips-0.648
 knot-0.632
ip-0.619
Token hugged
Token ID41130
Feature activation+2.52e+00

Pos logit contributions
ator+0.718
man+0.569
umm+0.522
icles+0.499
ators+0.495
Neg logit contributions
bs-0.516
 net-0.371
 slips-0.368
 knot-0.360
ip-0.351
Token her
Token ID607
Feature activation-2.81e-01

Pos logit contributions
ator+6.270
umm+4.756
man+4.676
ators+4.352
icles+4.217
Neg logit contributions
bs-4.662
-3.289
 knot-3.210
 tie-3.136
 slips-3.071
Token and
Token ID290
Feature activation-4.82e-01

Pos logit contributions
bs+0.454
 slips+0.325
 net+0.317
 loses+0.309
ata+0.306
Neg logit contributions
ator-0.694
man-0.562
umm-0.505
icles-0.486
ators-0.484
Token told
Token ID1297
Feature activation+1.84e+00

Pos logit contributions
bs+0.755
 net+0.541
 slips+0.538
ata+0.520
 knot+0.517
Neg logit contributions
ator-1.176
man-0.960
umm-0.857
icles-0.833
 birth-0.830
Token her
Token ID607
Feature activation-5.44e-01

Pos logit contributions
ator+4.173
man+3.224
umm+3.027
ators+2.823
icles+2.693
Neg logit contributions
bs-3.626
-2.636
 knot-2.601
 slips-2.515
 tie-2.507
Token that
Token ID326
Feature activation-4.21e-03

Pos logit contributions
bs+0.990
 net+0.758
 slips+0.756
 knot+0.728
ata+0.726
Neg logit contributions
ator-1.195
man-0.942
umm-0.819
 birth-0.814
icles-0.811
Token.
Token ID13
Feature activation+9.02e-01

Pos logit contributions
ator+2.025
man+1.604
umm+1.514
icles+1.443
ators+1.432
Neg logit contributions
bs-1.280
 net-0.884
 slips-0.880
 knot-0.857
 loses-0.841
Token He
Token ID679
Feature activation+1.87e-01

Pos logit contributions
ator+1.947
man+1.475
umm+1.372
icles+1.288
ators+1.288
Neg logit contributions
bs-1.749
 slips-1.305
 net-1.300
ip-1.269
 knot-1.266
Token went
Token ID1816
Feature activation+9.78e-01

Pos logit contributions
ator+0.411
man+0.318
umm+0.292
icles+0.275
ators+0.272
Neg logit contributions
bs-0.351
 net-0.259
 slips-0.258
 knot-0.252
ata-0.247
Token home
Token ID1363
Feature activation+7.20e-01

Pos logit contributions
ator+2.021
man+1.504
umm+1.440
icles+1.334
ators+1.310
Neg logit contributions
bs-1.988
 net-1.525
 slips-1.507
 tie-1.468
 knot-1.467
Token and
Token ID290
Feature activation+5.39e-01

Pos logit contributions
ator+1.764
man+1.392
umm+1.324
icles+1.258
ators+1.240
Neg logit contributions
bs-1.175
 net-0.822
 slips-0.818
 knot-0.794
ip-0.782
Token told
Token ID1297
Feature activation+2.13e+00

Pos logit contributions
ator+1.267
man+0.991
umm+0.930
icles+0.880
ators+0.870
Neg logit contributions
bs-0.935
 net-0.673
 slips-0.671
 knot-0.653
ip-0.641
Token his
Token ID465
Feature activation+1.44e+00

Pos logit contributions
ator+4.136
umm+3.060
man+3.012
ators+2.681
icles+2.639
Neg logit contributions
bs-4.587
 net-3.552
 tie-3.499
 knot-3.470
 loop-3.446
Token dad
Token ID9955
Feature activation-6.58e-01

Pos logit contributions
ator+3.114
man+2.344
umm+2.251
icles+2.146
ators+2.100
Neg logit contributions
bs-2.829
 net-2.161
 knot-2.079
 slips-2.075
 loop-2.062
Token all
Token ID477
Feature activation-7.81e-01

Pos logit contributions
bs+1.281
 slips+0.987
 net+0.973
 knot+0.955
ata+0.944
Neg logit contributions
ator-1.386
man-1.083
icles-0.938
umm-0.931
 him-0.926
Token about
Token ID546
Feature activation+8.39e-01

Pos logit contributions
bs+1.550
 slips+1.218
 net+1.194
ip+1.168
 loses+1.168
Neg logit contributions
ator-1.619
man-1.232
icles-1.067
umm-1.055
 birth-1.052
Token his
Token ID465
Feature activation+6.86e-01

Pos logit contributions
ator+1.676
man+1.247
umm+1.166
icles+1.073
ators+1.060
Neg logit contributions
bs-1.750
 net-1.356
 slips-1.337
 knot-1.316
 tie-1.302
Token can
Token ID460
Feature activation+3.09e-01

Pos logit contributions
bs+0.863
 net+0.645
 knot+0.636
 slips+0.635
ata+0.625
Neg logit contributions
ator-0.955
man-0.729
umm-0.673
icles-0.652
ators-0.625
Token be
Token ID307
Feature activation-7.90e-01

Pos logit contributions
ator+0.751
man+0.596
umm+0.558
icles+0.531
ators+0.522
Neg logit contributions
bs-0.506
 net-0.358
 slips-0.356
 knot-0.345
ip-0.341
Token.
Token ID13
Feature activation-6.05e-01

Pos logit contributions
bs+1.470
 slips+1.130
 net+1.129
 knot+1.112
ip+1.079
Neg logit contributions
ator-1.730
man-1.329
umm-1.185
icles-1.165
ators-1.145
Token
Token ID220
Feature activation-7.38e-01

Pos logit contributions
bs+1.300
 net+0.993
 knot+0.989
ata+0.975
 loses+0.972
Neg logit contributions
ator-1.133
man-0.837
umm-0.821
icles-0.755
omm-0.742
Token
Token ID198
Feature activation+4.04e-02

Pos logit contributions
bs+1.304
 net+0.991
 knot+0.966
 slips+0.930
 loses+0.922
Neg logit contributions
ator-1.693
man-1.324
umm-1.203
icles-1.198
omm-1.157
TokenM
Token ID44
Feature activation+6.53e-01

Pos logit contributions
ator+0.093
man+0.073
umm+0.068
icles+0.064
ators+0.063
Neg logit contributions
bs-0.072
 net-0.052
 slips-0.052
 knot-0.051
 loses-0.050
Tokenum
Token ID388
Feature activation+4.77e-01

Pos logit contributions
ator+1.066
man+0.730
umm+0.652
icles+0.593
ators+0.586
Neg logit contributions
bs-1.612
 slips-1.293
 net-1.292
 knot-1.268
ip-1.260
Token and
Token ID290
Feature activation-7.74e-02

Pos logit contributions
ator+1.240
man+0.997
umm+0.940
icles+0.898
ators+0.889
Neg logit contributions
bs-0.705
 net-0.474
 slips-0.472
 knot-0.456
 loses-0.445
Token Dad
Token ID17415
Feature activation+1.18e-01

Pos logit contributions
bs+0.167
 slips+0.130
 knot+0.129
ata+0.128
 net+0.128
Neg logit contributions
ator-0.145
man-0.107
umm-0.100
icles-0.094
 birth-0.090
Token were
Token ID547
Feature activation+1.06e+00

Pos logit contributions
ator+0.268
man+0.208
umm+0.192
icles+0.183
ators+0.182
Neg logit contributions
bs-0.209
 net-0.155
 slips-0.153
 knot-0.150
 loses-0.148
Token so
Token ID523
Feature activation+3.90e-01

Pos logit contributions
ator+2.512
man+1.977
umm+1.885
icles+1.774
ators+1.734
Neg logit contributions
bs-1.813
 net-1.302
 slips-1.293
-1.263
 knot-1.258
Token a
Token ID257
Feature activation+5.37e-02

Pos logit contributions
ator+1.759
man+1.304
umm+1.215
ators+1.113
icles+1.111
Neg logit contributions
bs-1.839
 net-1.412
 slips-1.401
 knot-1.386
 loop-1.363
Token mess
Token ID2085
Feature activation+9.26e-01

Pos logit contributions
ator+0.123
man+0.097
umm+0.090
icles+0.084
 birth+0.083
Neg logit contributions
bs-0.096
 slips-0.069
 net-0.068
ip-0.067
ata-0.066
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
ator+2.355
man+1.872
umm+1.785
icles+1.694
ators+1.680
Neg logit contributions
bs-1.424
 slips-0.962
 net-0.959
 knot-0.929
ata-0.924
Token He
Token ID679
Feature activation+1.98e-01

Pos logit contributions
ator+0.283
man+0.213
umm+0.198
icles+0.186
ators+0.180
Neg logit contributions
bs-0.281
 net-0.213
 slips-0.212
 knot-0.209
 loop-0.205
Token had
Token ID550
Feature activation+6.55e-02

Pos logit contributions
ator+0.455
man+0.358
umm+0.328
icles+0.314
ators+0.311
Neg logit contributions
bs-0.349
 net-0.253
 slips-0.250
 knot-0.246
ip-0.242
Token glue
Token ID22749
Feature activation+3.94e-01

Pos logit contributions
ator+0.153
man+0.120
umm+0.111
icles+0.108
ators+0.105
Neg logit contributions
bs-0.113
 slips-0.080
 net-0.080
 knot-0.079
ip-0.077
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
ator+1.039
man+0.841
umm+0.790
icles+0.759
ators+0.749
Neg logit contributions
bs-0.565
 net-0.374
 slips-0.372
 knot-0.360
 loses-0.349
Token cr
Token ID1067
Feature activation+1.15e+00

Pos logit contributions
bs+1.767
 slips+1.265
ip+1.257
op+1.251
 net+1.250
Neg logit contributions
ator-2.315
man-1.858
icles-1.713
umm-1.698
 birth-1.640
Tokenay
Token ID323
Feature activation+1.18e-01

Pos logit contributions
ator+2.245
man+1.667
umm+1.521
ators+1.420
 birth+1.396
Neg logit contributions
bs-2.457
 slips-1.871
 net-1.842
ata-1.834
ip-1.829
Tokenon
Token ID261
Feature activation+5.75e-02

Pos logit contributions
ator+0.286
man+0.226
umm+0.210
icles+0.204
ators+0.199
Neg logit contributions
bs-0.190
 net-0.138
 slips-0.136
 knot-0.133
 loses-0.130
Token on
Token ID319
Feature activation+1.16e-01

Pos logit contributions
ator+0.143
man+0.115
umm+0.106
icles+0.103
ators+0.100
Neg logit contributions
bs-0.090
 net-0.063
 slips-0.063
 knot-0.061
 loses-0.060
Token
Token ID95
Feature activation+1.67e+00

Pos logit contributions
ator+2.448
man+1.866
umm+1.734
icles+1.644
ators+1.630
Neg logit contributions
bs-2.263
 net-1.713
 slips-1.702
 knot-1.672
 tie-1.668
Token
Token ID26391
Feature activation+5.96e-02

Pos logit contributions
ator+3.318
man+2.479
umm+2.331
icles+2.225
 birth+2.187
Neg logit contributions
bs-3.324
-2.636
 net-2.576
 slips-2.567
ata-2.565
Token
Token ID129
Feature activation+1.58e+00

Pos logit contributions
ator+0.139
man+0.109
umm+0.101
icles+0.095
ators+0.095
Neg logit contributions
bs-0.103
 net-0.075
 slips-0.074
 knot-0.074
 loop-0.071
Token
Token ID241
Feature activation-1.20e-01

Pos logit contributions
ator+2.876
man+2.086
umm+1.978
icles+1.794
ators+1.781
Neg logit contributions
bs-3.527
 net-2.795
 slips-2.778
-2.769
 loop-2.759
TokenIt
Token ID1026
Feature activation+3.76e-01

Pos logit contributions
bs+0.238
 net+0.184
 slips+0.181
 knot+0.179
 loses+0.174
Neg logit contributions
ator-0.246
man-0.188
umm-0.172
icles-0.164
ators-0.159
Tokenâ
Token ID22940
Feature activation+4.26e-01

Pos logit contributions
ator+0.904
man+0.716
umm+0.667
icles+0.635
ators+0.628
Neg logit contributions
bs-0.625
 net-0.442
 slips-0.439
 knot-0.428
ip-0.421
Token
Token ID26391
Feature activation-4.89e-01

Pos logit contributions
ator+0.778
man+0.561
umm+0.513
icles+0.473
ators+0.463
Neg logit contributions
bs-0.959
 net-0.753
 slips-0.750
 knot-0.737
 loses-0.727
Token
Token ID8151
Feature activation-4.18e-01

Pos logit contributions
bs+0.803
 net+0.578
 knot+0.575
 slips+0.563
 loop+0.547
Neg logit contributions
ator-1.169
man-0.934
umm-0.857
icles-0.816
ators-0.811
Tokens
Token ID82
Feature activation+1.67e-01

Pos logit contributions
bs+0.757
 net+0.571
 knot+0.564
 slips+0.552
 loop+0.545
Neg logit contributions
ator-0.937
man-0.737
umm-0.667
icles-0.654
omm-0.630
Token lots
Token ID6041
Feature activation-1.09e+00

Pos logit contributions
ator+0.360
man+0.275
umm+0.254
icles+0.242
ators+0.238
Neg logit contributions
bs-0.321
 net-0.237
 slips-0.236
 knot-0.232
ip-0.231
Token of
Token ID286
Feature activation-8.09e-01

Pos logit contributions
bs+1.697
 slips+1.209
 net+1.180
ata+1.165
 knot+1.139
Neg logit contributions
ator-2.657
man-2.091
umm-2.043
icles-1.913
 birth-1.886
Token wanted
Token ID2227
Feature activation+1.89e-01

Pos logit contributions
ator+2.824
umm+2.082
man+2.032
icles+1.909
ators+1.856
Neg logit contributions
bs-2.666
 net-2.015
 slips-2.006
 tie-1.971
 loop-1.958
Token to
Token ID284
Feature activation+3.80e-01

Pos logit contributions
ator+0.426
man+0.333
umm+0.307
icles+0.294
 birth+0.290
Neg logit contributions
bs-0.343
 slips-0.252
 net-0.248
 knot-0.245
ip-0.241
Token build
Token ID1382
Feature activation-3.42e-02

Pos logit contributions
ator+0.846
man+0.653
umm+0.604
icles+0.573
ators+0.564
Neg logit contributions
bs-0.704
 slips-0.519
 net-0.516
 knot-0.502
ip-0.501
Token a
Token ID257
Feature activation-6.94e-03

Pos logit contributions
bs+0.050
 slips+0.033
 net+0.032
ip+0.032
ata+0.031
Neg logit contributions
ator-0.090
man-0.072
umm-0.068
icles-0.065
 birth-0.065
Token castle
Token ID16669
Feature activation+2.21e-01

Pos logit contributions
bs+0.016
 slips+0.012
ata+0.012
 net+0.012
ip+0.012
Neg logit contributions
ator-0.013
man-0.009
umm-0.008
 birth-0.008
icles-0.007
Token so
Token ID523
Feature activation+5.11e-01

Pos logit contributions
ator+0.492
man+0.382
umm+0.351
icles+0.331
ators+0.328
Neg logit contributions
bs-0.412
 slips-0.303
 net-0.302
 knot-0.294
ip-0.293
Token he
Token ID339
Feature activation+1.06e+00

Pos logit contributions
ator+1.181
man+0.920
umm+0.861
icles+0.814
ators+0.804
Neg logit contributions
bs-0.908
 net-0.660
 slips-0.658
 knot-0.641
ip-0.629
Token could
Token ID714
Feature activation+1.35e+00

Pos logit contributions
ator+2.708
man+2.123
umm+2.087
icles+1.952
ators+1.923
Neg logit contributions
bs-1.641
 net-1.115
 slips-1.110
 knot-1.069
 tie-1.068
Token look
Token ID804
Feature activation+4.44e-01

Pos logit contributions
ator+3.247
man+2.528
umm+2.497
icles+2.303
ators+2.296
Neg logit contributions
bs-2.306
 net-1.648
 tie-1.647
 slips-1.633
 loop-1.611
Token after
Token ID706
Feature activation+4.89e-01

Pos logit contributions
ator+0.935
man+0.708
umm+0.657
icles+0.617
ators+0.609
Neg logit contributions
bs-0.872
 net-0.658
 slips-0.658
 knot-0.642
ip-0.632
Token all
Token ID477
Feature activation-2.78e-01

Pos logit contributions
ator+1.132
man+0.881
umm+0.826
icles+0.782
ators+0.772
Neg logit contributions
bs-0.863
 net-0.625
 slips-0.625
 knot-0.607
ip-0.597
Token chasing
Token ID20023
Feature activation+4.71e-02

Pos logit contributions
ator+1.586
man+1.132
umm+1.048
icles+0.945
ators+0.932
Neg logit contributions
bs-2.039
 net-1.606
 slips-1.596
 knot-1.567
 loses-1.552
Token them
Token ID606
Feature activation-1.76e-01

Pos logit contributions
ator+0.091
man+0.067
umm+0.061
icles+0.058
ators+0.056
Neg logit contributions
bs-0.099
 slips-0.078
 net-0.077
 knot-0.075
ip-0.075
Token,
Token ID11
Feature activation+5.70e-01

Pos logit contributions
bs+0.312
 slips+0.228
 net+0.226
 knot+0.220
ip+0.219
Neg logit contributions
ator-0.401
man-0.314
umm-0.288
icles-0.276
ators-0.272
Token so
Token ID523
Feature activation-4.02e-01

Pos logit contributions
ator+1.455
man+1.163
umm+1.100
icles+1.047
ators+1.037
Neg logit contributions
bs-0.866
 net-0.590
 slips-0.588
 knot-0.569
 loses-0.555
Token they
Token ID484
Feature activation+5.84e-01

Pos logit contributions
bs+0.592
 slips+0.411
 net+0.401
 knot+0.396
ata+0.391
Neg logit contributions
ator-1.025
man-0.837
umm-0.770
icles-0.737
 birth-0.728
Token decided
Token ID3066
Feature activation+5.23e-01

Pos logit contributions
ator+1.309
man+1.009
umm+0.946
icles+0.891
ators+0.879
Neg logit contributions
bs-1.075
 net-0.791
 slips-0.788
 knot-0.769
ip-0.755
Token to
Token ID284
Feature activation+9.35e-01

Pos logit contributions
ator+1.218
man+0.951
umm+0.891
icles+0.843
ators+0.833
Neg logit contributions
bs-0.915
 net-0.661
 slips-0.660
 knot-0.643
ip-0.630
Token play
Token ID711
Feature activation-1.19e-01

Pos logit contributions
ator+2.172
man+1.689
umm+1.612
icles+1.507
ators+1.493
Neg logit contributions
bs-1.628
 net-1.197
 slips-1.178
 knot-1.159
 tie-1.146
Token a
Token ID257
Feature activation-2.51e-01

Pos logit contributions
bs+0.210
 slips+0.155
 net+0.149
ip+0.147
 knot+0.147
Neg logit contributions
ator-0.274
man-0.216
umm-0.195
icles-0.188
ators-0.187
Token trick
Token ID6908
Feature activation-5.61e-02

Pos logit contributions
bs+0.500
 slips+0.378
 net+0.360
ip+0.358
 loses+0.354
Neg logit contributions
ator-0.531
man-0.409
umm-0.369
 birth-0.347
icles-0.339
Token on
Token ID319
Feature activation-3.66e-01

Pos logit contributions
bs+0.098
 slips+0.072
 net+0.071
 knot+0.070
 loses+0.068
Neg logit contributions
ator-0.129
man-0.102
umm-0.092
icles-0.089
ators-0.088
Token be
Token ID307
Feature activation+3.53e-02

Pos logit contributions
bs+0.433
 slips+0.331
ip+0.330
 net+0.325
ata+0.322
Neg logit contributions
ator-0.401
man-0.299
umm-0.265
icles-0.254
ators-0.246
Token careful
Token ID8161
Feature activation+1.89e-01

Pos logit contributions
ator+0.086
man+0.068
umm+0.062
ators+0.060
icles+0.060
Neg logit contributions
bs-0.059
 slips-0.041
 net-0.041
 knot-0.040
ip-0.040
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
ator+0.461
man+0.366
umm+0.340
icles+0.326
ators+0.322
Neg logit contributions
bs-0.310
 net-0.218
 slips-0.218
 knot-0.211
ip-0.208
Token So
Token ID1406
Feature activation+8.02e-01

Pos logit contributions
ator+1.278
man+0.954
umm+0.881
icles+0.824
ators+0.814
Neg logit contributions
bs-1.298
 net-0.992
 slips-0.991
 knot-0.968
ip-0.955
Token,
Token ID11
Feature activation+4.48e-01

Pos logit contributions
ator+1.966
man+1.545
umm+1.461
icles+1.385
ators+1.376
Neg logit contributions
bs-1.321
 net-0.936
 slips-0.931
 knot-0.905
-0.885
Token even
Token ID772
Feature activation-1.54e+00

Pos logit contributions
ator+0.988
man+0.760
umm+0.709
icles+0.669
ators+0.659
Neg logit contributions
bs-0.840
 net-0.621
 slips-0.620
 knot-0.606
ip-0.595
Token though
Token ID996
Feature activation-1.40e-02

Pos logit contributions
bs+2.535
 net+1.812
 knot+1.807
op+1.793
 slips+1.788
Neg logit contributions
ator-3.658
man-3.052
umm-2.687
 him-2.652
 birth-2.625
Token he
Token ID339
Feature activation+3.71e-01

Pos logit contributions
bs+0.027
 slips+0.020
 net+0.020
 knot+0.020
ata+0.020
Neg logit contributions
ator-0.030
man-0.023
umm-0.021
icles-0.020
ators-0.019
Token hadn
Token ID8020
Feature activation-6.89e-02

Pos logit contributions
ator+0.803
man+0.616
umm+0.569
icles+0.536
ators+0.531
Neg logit contributions
bs-0.708
 net-0.528
 slips-0.525
 knot-0.515
ip-0.504
Token't
Token ID470
Feature activation-1.64e+00

Pos logit contributions
bs+0.184
 net+0.149
 slips+0.149
 knot+0.148
ip+0.145
Neg logit contributions
ator-0.098
man-0.064
umm-0.053
icles-0.049
ators-0.047
Token found
Token ID1043
Feature activation+9.18e-02

Pos logit contributions
bs+3.436
ip+2.720
 knot+2.686
 slips+2.618
 net+2.605
Neg logit contributions
ator-3.063
man-2.395
 him-2.057
 birth-1.974
umm-1.968
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
ator+0.945
man+0.764
umm+0.719
icles+0.689
ators+0.681
Neg logit contributions
bs-0.520
 slips-0.341
 net-0.341
 knot-0.332
ip-0.324
Token made
Token ID925
Feature activation-7.36e-02

Pos logit contributions
bs+2.398
op+1.771
ip+1.762
 knot+1.749
 slips+1.736
Neg logit contributions
ator-2.987
man-2.394
icles-2.168
 birth-2.107
ators-2.096
Token a
Token ID257
Feature activation-1.04e-01

Pos logit contributions
bs+0.100
 slips+0.064
 net+0.063
 knot+0.062
ip+0.062
Neg logit contributions
ator-0.198
man-0.161
umm-0.150
icles-0.148
ators-0.143
Token castle
Token ID16669
Feature activation+6.20e-01

Pos logit contributions
bs+0.195
 slips+0.140
ip+0.139
 net+0.137
ata+0.137
Neg logit contributions
ator-0.231
man-0.181
umm-0.165
icles-0.155
 birth-0.154
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
ator+1.441
man+1.122
umm+1.057
icles+0.999
ators+0.987
Neg logit contributions
bs-1.080
 net-0.782
 slips-0.779
 knot-0.759
 loop-0.744
Token They
Token ID1119
Feature activation-3.70e-01

Pos logit contributions
bs+0.707
 net+0.536
 knot+0.532
 slips+0.532
 loses+0.521
Neg logit contributions
ator-0.742
man-0.559
umm-0.522
icles-0.503
 birth-0.482
Token had
Token ID550
Feature activation+6.96e-01

Pos logit contributions
bs+0.674
 knot+0.497
 net+0.496
 slips+0.495
ip+0.485
Neg logit contributions
ator-0.815
man-0.640
umm-0.560
icles-0.556
ators-0.554
Token fun
Token ID1257
Feature activation+1.99e-01

Pos logit contributions
ator+1.846
man+1.489
umm+1.414
icles+1.342
ators+1.329
Neg logit contributions
bs-1.004
 net-0.675
 slips-0.667
 knot-0.643
 loop-0.631
Token and
Token ID290
Feature activation-4.61e-01

Pos logit contributions
ator+0.474
man+0.370
umm+0.343
icles+0.328
ators+0.325
Neg logit contributions
bs-0.343
 slips-0.244
 net-0.243
 knot-0.239
ip-0.234
Token forgot
Token ID16453
Feature activation-8.76e-02

Pos logit contributions
bs+0.859
 knot+0.643
 slips+0.641
ip+0.640
 net+0.638
Neg logit contributions
ator-0.999
man-0.773
umm-0.708
 birth-0.684
icles-0.679
Token about
Token ID546
Feature activation+4.95e-01

Pos logit contributions
bs+0.167
 slips+0.125
 net+0.123
 knot+0.122
ata+0.121
Neg logit contributions
ator-0.190
man-0.145
umm-0.131
icles-0.128
 birth-0.127
TokenWhat
Token ID2061
Feature activation-2.64e+00

Pos logit contributions
ator+0.323
man+0.224
umm+0.201
icles+0.183
ators+0.179
Neg logit contributions
bs-0.485
 net-0.388
 slips-0.387
 knot-0.380
ip-0.379
Token is
Token ID318
Feature activation-8.02e-02

Pos logit contributions
bs+6.482
ip+5.142
 net+5.048
 knot+4.797
ounce+4.774
Neg logit contributions
ator-4.305
man-3.357
 birth-3.120
 him-2.828
 us-2.667
Token that
Token ID326
Feature activation-5.34e-01

Pos logit contributions
bs+0.140
 slips+0.100
 net+0.099
 knot+0.097
ata+0.096
Neg logit contributions
ator-0.188
man-0.148
umm-0.135
icles-0.130
 birth-0.128
Token?"
Token ID1701
Feature activation-4.64e-01

Pos logit contributions
bs+1.056
 slips+0.771
ip+0.768
 net+0.764
 loses+0.741
Neg logit contributions
ator-1.147
man-0.925
umm-0.797
icles-0.756
ators-0.731
Token Tom
Token ID4186
Feature activation-7.13e-01

Pos logit contributions
bs+0.768
 net+0.560
ip+0.557
ata+0.546
 knot+0.544
Neg logit contributions
ator-1.075
man-0.848
umm-0.782
icles-0.759
 birth-0.751
Token asks
Token ID7893
Feature activation-3.52e-01

Pos logit contributions
bs+1.303
 net+0.943
ip+0.925
ata+0.883
 knot+0.882
Neg logit contributions
ator-1.583
man-1.257
umm-1.126
ators-1.091
icles-1.090
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
bs+0.610
 net+0.457
 slips+0.444
 knot+0.442
ip+0.433
Neg logit contributions
ator-0.812
man-0.652
umm-0.580
icles-0.576
ators-0.553
Token
Token ID198
Feature activation-2.76e-01

Pos logit contributions
bs+0.620
 net+0.463
 slips+0.456
 knot+0.450
ip+0.448
Neg logit contributions
ator-0.711
man-0.572
umm-0.500
icles-0.482
omm-0.476
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
bs+0.468
 net+0.350
 slips+0.338
 knot+0.336
 tie+0.325
Neg logit contributions
ator-0.648
man-0.514
umm-0.455
icles-0.448
omm-0.438
Token"
Token ID1
Feature activation-6.17e-01

Pos logit contributions
bs+0.360
 net+0.260
 slips+0.250
 knot+0.247
ip+0.245
Neg logit contributions
ator-0.553
man-0.448
umm-0.401
icles-0.387
ators-0.384
TokenIt
Token ID1026
Feature activation-1.10e-01

Pos logit contributions
bs+1.317
 net+1.031
 slips+1.008
ip+1.003
 knot+1.002
Neg logit contributions
ator-1.176
man-0.903
umm-0.783
icles-0.747
ators-0.745
Token chat
Token ID8537
Feature activation+5.79e-01

Pos logit contributions
ator+3.998
man+3.076
umm+3.003
icles+2.803
ators+2.790
Neg logit contributions
bs-3.084
 net-2.310
 tie-2.290
 knot-2.256
-2.246
Token and
Token ID290
Feature activation+3.26e-01

Pos logit contributions
ator+1.300
man+1.004
umm+0.940
icles+0.886
ators+0.875
Neg logit contributions
bs-1.061
 net-0.779
 slips-0.776
 knot-0.759
ip-0.744
Token sing
Token ID1702
Feature activation+8.96e-01

Pos logit contributions
ator+0.751
man+0.589
umm+0.548
icles+0.519
ators+0.512
Neg logit contributions
bs-0.572
 slips-0.415
 net-0.413
 knot-0.403
ip-0.400
Token together
Token ID1978
Feature activation+7.64e-01

Pos logit contributions
ator+2.046
man+1.573
umm+1.497
icles+1.409
ators+1.388
Neg logit contributions
bs-1.622
 net-1.179
 slips-1.158
 knot-1.148
 loop-1.122
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
ator+1.652
man+1.256
umm+1.179
icles+1.109
ators+1.092
Neg logit contributions
bs-1.469
 net-1.098
 slips-1.088
 knot-1.068
ip-1.048
Token And
Token ID843
Feature activation-2.46e-01

Pos logit contributions
ator+1.737
man+1.358
umm+1.281
icles+1.214
ators+1.205
Neg logit contributions
bs-1.234
 net-0.877
 slips-0.873
 knot-0.852
ip-0.841
Token so
Token ID523
Feature activation-8.19e-02

Pos logit contributions
bs+0.478
 slips+0.366
 net+0.361
ata+0.350
 knot+0.349
Neg logit contributions
ator-0.520
man-0.402
umm-0.368
icles-0.344
 birth-0.340
Token,
Token ID11
Feature activation+3.59e-02

Pos logit contributions
bs+0.139
 net+0.102
 slips+0.102
 knot+0.098
ata+0.097
Neg logit contributions
ator-0.191
man-0.153
umm-0.140
icles-0.131
ators-0.130
Token Bun
Token ID28515
Feature activation+4.16e-01

Pos logit contributions
ator+0.070
man+0.052
umm+0.047
icles+0.044
ators+0.043
Neg logit contributions
bs-0.076
 slips-0.059
 net-0.059
ata-0.057
 knot-0.057
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
ator+0.939
man+0.729
umm+0.677
icles+0.638
ators+0.631
Neg logit contributions
bs-0.758
 net-0.557
 slips-0.556
 knot-0.541
 loses-0.530
Token the
Token ID262
Feature activation+7.81e-01

Pos logit contributions
ator+0.654
man+0.518
umm+0.481
icles+0.458
ators+0.453
Neg logit contributions
bs-0.461
 slips-0.328
 net-0.325
ata-0.315
ip-0.314
Token she
Token ID673
Feature activation-2.51e-01

Pos logit contributions
bs+0.096
 slips+0.068
 net+0.067
 knot+0.065
 loses+0.065
Neg logit contributions
ator-0.155
man-0.125
umm-0.116
icles-0.112
 birth-0.111
Token called
Token ID1444
Feature activation+2.37e-01

Pos logit contributions
bs+0.462
 net+0.347
 slips+0.346
 knot+0.335
 loses+0.333
Neg logit contributions
ator-0.548
man-0.437
umm-0.383
ators-0.371
icles-0.371
Token a
Token ID257
Feature activation-2.54e-01

Pos logit contributions
ator+0.496
man+0.376
umm+0.343
icles+0.330
ators+0.324
Neg logit contributions
bs-0.467
 slips-0.355
 net-0.353
 knot-0.344
ip-0.342
Token taxi
Token ID17536
Feature activation+9.44e-01

Pos logit contributions
bs+0.487
 slips+0.362
 loses+0.354
 net+0.353
ip+0.353
Neg logit contributions
ator-0.546
man-0.433
umm-0.383
 birth-0.362
ators-0.356
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
ator+1.858
man+1.330
umm+1.288
icles+1.178
ators+1.155
Neg logit contributions
bs-1.995
 slips-1.526
 net-1.524
 tie-1.493
 knot-1.491
Token take
Token ID1011
Feature activation-1.77e-01

Pos logit contributions
ator+0.914
man+0.706
umm+0.655
icles+0.624
ators+0.615
Neg logit contributions
bs-0.748
 net-0.551
 slips-0.551
 knot-0.536
ip-0.528
Token her
Token ID607
Feature activation-4.29e-01

Pos logit contributions
bs+0.330
 slips+0.255
 net+0.254
 knot+0.244
 loses+0.242
Neg logit contributions
ator-0.380
man-0.292
umm-0.268
icles-0.258
 birth-0.255
Token to
Token ID284
Feature activation-6.84e-01

Pos logit contributions
bs+0.803
 slips+0.603
 net+0.600
 knot+0.582
ata+0.579
Neg logit contributions
ator-0.939
man-0.733
umm-0.645
icles-0.640
ators-0.635
Token work
Token ID670
Feature activation-8.91e-02

Pos logit contributions
bs+1.205
 slips+0.884
ip+0.867
 net+0.865
ata+0.834
Neg logit contributions
ator-1.597
man-1.238
icles-1.115
umm-1.109
 birth-1.109
Token.
Token ID13
Feature activation-5.51e-03

Pos logit contributions
bs+0.147
 net+0.106
 slips+0.105
 knot+0.101
 loses+0.099
Neg logit contributions
ator-0.216
man-0.172
umm-0.157
icles-0.152
ators-0.151
Token
Token ID220
Feature activation-4.00e-01

Pos logit contributions
bs+0.010
 net+0.008
 slips+0.007
 knot+0.007
 loses+0.007
Neg logit contributions
ator-0.012
man-0.010
umm-0.009
icles-0.008
 birth-0.008
Token.
Token ID13
Feature activation-6.02e-01

Pos logit contributions
bs+1.211
 slips+0.882
 net+0.877
ip+0.854
 loses+0.850
Neg logit contributions
ator-1.515
man-1.226
umm-1.065
icles-1.011
ators-0.997
Token The
Token ID383
Feature activation-1.84e-01

Pos logit contributions
bs+1.191
 net+0.932
 slips+0.917
 loses+0.889
 knot+0.885
Neg logit contributions
ator-1.222
man-0.933
umm-0.876
icles-0.821
omm-0.814
Token bird
Token ID6512
Feature activation+5.60e-01

Pos logit contributions
bs+0.336
ip+0.241
 slips+0.237
ata+0.236
 loses+0.230
Neg logit contributions
ator-0.419
man-0.336
umm-0.303
icles-0.287
ators-0.282
Token sees
Token ID7224
Feature activation-2.35e-01

Pos logit contributions
ator+1.387
man+1.100
umm+1.038
icles+0.986
ators+0.975
Neg logit contributions
bs-0.896
 net-0.624
 slips-0.623
 knot-0.604
 loses-0.590
Token Lily
Token ID20037
Feature activation-1.22e+00

Pos logit contributions
bs+0.450
 slips+0.342
 net+0.337
 knot+0.329
ip+0.326
Neg logit contributions
ator-0.505
man-0.384
umm-0.353
icles-0.346
 birth-0.333
Token too
Token ID1165
Feature activation-2.23e+00

Pos logit contributions
bs+1.908
 slips+1.293
 net+1.279
ip+1.277
ata+1.214
Neg logit contributions
ator-3.000
man-2.495
 him-2.168
ping-2.156
icles-2.137
Token and
Token ID290
Feature activation-7.83e-01

Pos logit contributions
bs+4.404
 net+3.539
 knot+3.498
 slips+3.373
ip+3.314
Neg logit contributions
ator-4.451
man-3.474
 him-3.171
umm-2.955
 himself-2.947
Token sings
Token ID33041
Feature activation+4.13e-01

Pos logit contributions
bs+1.349
ata+0.987
ip+0.981
 net+0.972
op+0.957
Neg logit contributions
ator-1.752
man-1.403
umm-1.238
 birth-1.229
icles-1.220
Token a
Token ID257
Feature activation-3.91e-01

Pos logit contributions
ator+1.063
man+0.854
umm+0.802
icles+0.766
ators+0.757
Neg logit contributions
bs-0.625
 slips-0.426
 net-0.426
 knot-0.410
ip-0.402
Token pretty
Token ID2495
Feature activation+2.38e-01

Pos logit contributions
bs+0.843
 slips+0.658
ip+0.646
 loses+0.636
 net+0.635
Neg logit contributions
ator-0.744
man-0.572
umm-0.504
 birth-0.454
icles-0.443
Token song
Token ID3496
Feature activation+1.85e-01

Pos logit contributions
ator+0.460
man+0.344
umm+0.310
icles+0.287
ators+0.280
Neg logit contributions
bs-0.514
 slips-0.398
 net-0.393
ip-0.386
 loses-0.384
Token Tom
Token ID4186
Feature activation-5.31e-01

Pos logit contributions
bs+1.043
 slips+0.810
 net+0.808
ip+0.775
 knot+0.769
Neg logit contributions
ator-1.118
man-0.876
umm-0.769
icles-0.763
 birth-0.745
Token said
Token ID531
Feature activation-5.32e-02

Pos logit contributions
bs+0.821
 net+0.593
 slips+0.579
 knot+0.557
 loses+0.538
Neg logit contributions
ator-1.295
man-1.089
icles-0.938
umm-0.938
ators-0.910
Token,
Token ID11
Feature activation-2.74e-01

Pos logit contributions
bs+0.084
 net+0.060
 slips+0.060
 knot+0.058
 loop+0.057
Neg logit contributions
ator-0.130
man-0.105
umm-0.096
icles-0.093
ators-0.090
Token and
Token ID290
Feature activation-1.14e-01

Pos logit contributions
bs+0.515
 slips+0.376
 net+0.371
ip+0.369
 knot+0.367
Neg logit contributions
ator-0.599
man-0.471
umm-0.417
icles-0.406
ators-0.395
Token threw
Token ID9617
Feature activation-8.86e-01

Pos logit contributions
bs+0.196
 slips+0.143
 net+0.143
 knot+0.140
 loses+0.137
Neg logit contributions
ator-0.260
man-0.209
umm-0.187
icles-0.182
ators-0.177
Token a
Token ID257
Feature activation-2.72e+00

Pos logit contributions
bs+1.708
 slips+1.293
op+1.273
ip+1.261
ata+1.251
Neg logit contributions
ator-1.795
man-1.441
 him-1.307
icles-1.265
umm-1.251
Token stick
Token ID4859
Feature activation-9.96e-01

Pos logit contributions
bs+4.766
 loses+3.612
ver+3.572
 slips+3.469
ip+3.461
Neg logit contributions
ator-5.702
man-5.335
umm-4.187
opher-3.961
icles-3.945
Token for
Token ID329
Feature activation+8.28e-01

Pos logit contributions
bs+1.960
 slips+1.463
 loses+1.439
ip+1.431
 net+1.430
Neg logit contributions
ator-2.009
man-1.646
umm-1.358
icles-1.344
esh-1.297
Token Max
Token ID5436
Feature activation-1.34e+00

Pos logit contributions
ator+2.130
man+1.707
umm+1.626
icles+1.528
ators+1.522
Neg logit contributions
bs-1.256
 net-0.865
 slips-0.848
 knot-0.835
 loop-0.816
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
bs+2.466
 slips+1.854
 knot+1.789
 net+1.767
ip+1.739
Neg logit contributions
ator-2.779
man-2.327
 him-1.987
icles-1.937
umm-1.878
Token catch
Token ID4929
Feature activation-6.85e-01

Pos logit contributions
bs+2.016
ip+1.456
 slips+1.412
op+1.387
ata+1.387
Neg logit contributions
ator-2.639
man-2.125
 him-1.946
icles-1.899
 us-1.843
Token like
Token ID588
Feature activation-5.48e-01

Pos logit contributions
bs+0.154
 net+0.110
 slips+0.110
ip+0.108
 knot+0.106
Neg logit contributions
ator-0.220
man-0.174
umm-0.160
icles-0.153
 birth-0.153
Token a
Token ID257
Feature activation-9.02e-01

Pos logit contributions
bs+0.894
 slips+0.629
ip+0.612
ata+0.607
 knot+0.606
Neg logit contributions
ator-1.319
man-1.057
umm-0.965
icles-0.949
 birth-0.945
Token horn
Token ID12718
Feature activation-3.51e-02

Pos logit contributions
bs+1.759
ip+1.282
 slips+1.280
ata+1.251
op+1.230
Neg logit contributions
ator-1.916
man-1.554
umm-1.349
 birth-1.278
 Amen-1.213
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
bs+0.058
 net+0.041
 slips+0.040
 knot+0.039
ip+0.039
Neg logit contributions
ator-0.084
man-0.068
umm-0.062
icles-0.060
ators-0.058
Token Lily
Token ID20037
Feature activation-1.13e+00

Pos logit contributions
bs+3.073
 knot+2.389
 net+2.383
 slips+2.328
 loses+2.310
Neg logit contributions
ator-3.271
man-2.450
umm-2.368
icles-2.282
omm-2.212
Token and
Token ID290
Feature activation-2.36e+00

Pos logit contributions
bs+1.816
 net+1.214
 knot+1.168
ip+1.160
op+1.130
Neg logit contributions
ator-2.784
man-2.293
umm-1.965
ators-1.959
opher-1.923
Token Ben
Token ID3932
Feature activation-5.61e-01

Pos logit contributions
bs+3.939
ata+2.933
op+2.819
ching+2.786
ird+2.707
Neg logit contributions
ator-5.373
 him-4.570
man-4.454
icles-4.153
 us-4.048
Token think
Token ID892
Feature activation-1.15e+00

Pos logit contributions
bs+0.936
ip+0.680
 net+0.664
 slips+0.660
 knot+0.648
Neg logit contributions
ator-1.297
man-1.031
umm-0.915
ators-0.896
icles-0.895
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
bs+1.909
ata+1.424
 net+1.382
 slips+1.380
ip+1.377
Neg logit contributions
ator-2.646
man-2.100
 birth-1.894
 him-1.884
icles-1.856
Token bird
Token ID6512
Feature activation+6.98e-01

Pos logit contributions
ator+1.251
man+0.955
umm+0.892
icles+0.839
ators+0.828
Neg logit contributions
bs-1.098
 net-0.820
 slips-0.816
 knot-0.799
ip-0.784
Token is
Token ID318
Feature activation+2.16e-01

Pos logit contributions
ator+1.617
man+1.252
umm+1.187
icles+1.117
ators+1.103
Neg logit contributions
bs-1.237
 slips-0.897
 net-0.896
 knot-0.871
 loses-0.858
Token says
Token ID1139
Feature activation-7.78e-01

Pos logit contributions
ator+2.454
man+2.020
umm+1.950
icles+1.857
ators+1.842
Neg logit contributions
bs-0.934
 slips-0.533
 net-0.522
 knot-0.498
-0.486
Token.
Token ID13
Feature activation-9.73e-01

Pos logit contributions
bs+1.373
 net+1.062
 slips+1.057
 knot+1.047
 loop+1.025
Neg logit contributions
ator-1.741
man-1.403
umm-1.216
icles-1.198
 him-1.147
Token
Token ID198
Feature activation-6.18e-01

Pos logit contributions
bs+1.779
 net+1.351
 knot+1.305
ata+1.304
 slips+1.288
Neg logit contributions
ator-2.029
man-1.665
umm-1.469
omm-1.420
icles-1.393
Token
Token ID198
Feature activation-5.07e-01

Pos logit contributions
bs+1.087
 net+0.792
 slips+0.769
ata+0.766
 knot+0.763
Neg logit contributions
ator-1.399
man-1.123
umm-0.973
icles-0.957
esh-0.931
Token"
Token ID1
Feature activation+1.40e+00

Pos logit contributions
bs+0.835
 net+0.591
 slips+0.568
ata+0.568
 knot+0.555
Neg logit contributions
ator-1.206
man-0.984
umm-0.883
icles-0.843
ators-0.836
TokenWhat
Token ID2061
Feature activation-2.59e+00

Pos logit contributions
ator+2.838
man+2.074
umm+1.908
icles+1.827
ators+1.781
Neg logit contributions
bs-2.953
 net-2.281
 slips-2.248
 knot-2.246
 tie-2.222
Token is
Token ID318
Feature activation+6.70e-01

Pos logit contributions
bs+5.659
ip+4.245
 net+4.234
 loop+4.159
 slips+4.111
Neg logit contributions
ator-4.928
man-4.006
 birth-3.866
 him-3.473
ators-3.300
Token it
Token ID340
Feature activation+8.17e-01

Pos logit contributions
ator+1.679
man+1.335
umm+1.262
icles+1.198
ators+1.185
Neg logit contributions
bs-1.065
 net-0.739
 slips-0.734
 knot-0.714
ip-0.700
Token,
Token ID11
Feature activation+8.17e-01

Pos logit contributions
ator+1.934
man+1.506
umm+1.431
icles+1.343
ators+1.336
Neg logit contributions
bs-1.400
 slips-1.005
 net-1.003
 knot-0.971
ip-0.961
Token Mom
Token ID11254
Feature activation-5.42e-01

Pos logit contributions
ator+1.789
man+1.362
umm+1.278
icles+1.193
ators+1.187
Neg logit contributions
bs-1.554
 net-1.166
 slips-1.163
 knot-1.147
 loop-1.127
Token?
Token ID30
Feature activation-1.89e-01

Pos logit contributions
bs+1.091
 net+0.848
 knot+0.817
 slips+0.816
 loses+0.798
Neg logit contributions
ator-1.138
man-0.891
umm-0.772
ators-0.750
icles-0.749
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
bs+0.619
 net+0.460
 slips+0.443
 knot+0.436
 loses+0.428
Neg logit contributions
ator-0.834
man-0.667
umm-0.599
icles-0.586
ators-0.572
TokenMom
Token ID29252
Feature activation+7.48e-01

Pos logit contributions
bs+1.195
 net+0.920
 slips+0.894
 loses+0.865
 knot+0.855
Neg logit contributions
ator-1.306
man-1.021
umm-0.930
icles-0.891
ators-0.876
Tokenmy
Token ID1820
Feature activation+2.00e-02

Pos logit contributions
ator+1.441
man+1.047
umm+0.985
icles+0.899
ators+0.886
Neg logit contributions
bs-1.612
 slips-1.246
 net-1.239
 knot-1.218
ip-1.209
Token gave
Token ID2921
Feature activation+4.17e-01

Pos logit contributions
ator+0.049
man+0.040
umm+0.036
icles+0.034
ators+0.034
Neg logit contributions
bs-0.032
 net-0.023
 slips-0.023
 knot-0.022
 loses-0.022
Token Tommy
Token ID19919
Feature activation-1.37e+00

Pos logit contributions
ator+0.921
man+0.709
umm+0.660
icles+0.626
ators+0.616
Neg logit contributions
bs-0.785
 net-0.584
 slips-0.582
 knot-0.567
 loses-0.558
Token a
Token ID257
Feature activation-2.39e+00

Pos logit contributions
bs+2.195
 slips+1.717
 loses+1.637
 knot+1.597
 net+1.581
Neg logit contributions
ator-3.192
man-2.644
 him-2.437
icles-2.386
 birth-2.273
Token big
Token ID1263
Feature activation-1.49e+00

Pos logit contributions
bs+4.666
 loses+3.777
 slips+3.574
 pads+3.571
ata+3.510
Neg logit contributions
ator-4.689
man-4.120
 birth-3.193
umm-3.094
omm-3.086
Token box
Token ID3091
Feature activation-7.37e-01

Pos logit contributions
bs+2.963
 loses+2.266
ata+2.248
 slips+2.199
 pads+2.182
Neg logit contributions
ator-3.004
man-2.448
umm-2.132
icles-2.012
omm-1.978
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
bs+1.251
 slips+0.937
 loses+0.909
 net+0.900
 knot+0.885
Neg logit contributions
ator-1.721
man-1.412
umm-1.247
icles-1.212
esh-1.162
Token asked
Token ID1965
Feature activation+1.34e-01

Pos logit contributions
bs+2.650
 slips+2.054
 net+2.015
 loses+1.981
ching+1.945
Neg logit contributions
ator-3.302
man-2.723
 birth-2.378
icles-2.359
 him-2.341
Token him
Token ID683
Feature activation-9.54e-01

Pos logit contributions
ator+0.267
man+0.199
umm+0.182
icles+0.175
ators+0.170
Neg logit contributions
bs-0.276
 net-0.216
 slips-0.213
 knot-0.209
 loop-0.206