TOP ACTIVATIONS
MAX = 5.071

- second lead . When Bolton only semi - cle ared
? No more bad news pl s . Good updates ?
who holds dual British and Pakistani citizenship , said of the
that weren 't explained as well as they could have been
official statement soon . As well as updates regarding the future
It got worse for Bolton and even better for Bent
month . Against Sweden in June , they almost
. After the Blackhawks traded him to the Th
Meanwhile on Wednesday , a Swedish appeals court upheld a court
Width , $ SR . Height ) $ image = [
, Arsenal , Liverpool and Tottenham Hots pur stating the
AL EM Israeli and Turkish officials have reached a preliminary
1 - 0 loss to Slovakia with head coach Tom Moh
telling us .' Scroll down for video
Width ), ($ SR . Height ))) ) } } Catch
, as suggested by the Bulgarian ambassador . People
x $ ($ SR . Height )" - Ver b ose
($ s z 1 . Height ), $ SR . Width
charge at Old Trafford against Swansea on Sunday by logging on
have competed alongside Chelsea and Tottenham Hots pur for the Premier

MIN ACTIVATIONS
MIN = -4.860

community . C ue ll ar also took a
ops hatch lings , six months for the larger Hyp ac
York s G ug gen heim Museum , he
100 . G err ard s 100
the countdown clock . Five minutes remaining . Then four minutes
6 . 5 ), K ug el in Feldman , K
showing the countdown . Fifty minutes . An advisor
Super bowl - Italian St ere otypes - Ret ards -
el in Feldman , K ug el , and Schiff man
Encyclopedia , Church Fathers , Sum ma , Bible and more
Life of Christ . Sac ram ents . Bapt ism .
N ug ent , who has been
protest featured speakers including M ere tz City Council woman Dr
a broad desert ( the poems don t argue
without using the International Ph on etic Alphabet ,
born Farmer on " Fr ances Farmer Will Have Her Revenge
layers to that . St ere otypes aren t
and that he will be impe ached , she
On social media , Michael G . Flynn has shared
of his political career , Bernie s current Millenn

INTERVAL 2.588 - 5.071
CONTAINS 0.731%

A system that ensures no Canadian patient is left unable to
doing an album with each band . So , there are
answer for Trump s many mistakes , ignoring the
by forces from Kenya and Ethiopia , who engaged in heavy
res ize ( image , height = 600 ) gray =

INTERVAL 0.106 - 2.588
CONTAINS 44.300%

is the role played by state supreme court justices in the
used to show the extent of the rubbish The
patrick _ 12 , writes . " He had ' F
1 a 1 . Sw immer means to keep fit
and do not necessarily reflect the views of Comcast . <|endoftext|>

INTERVAL -2.377 - 0.106
CONTAINS 54.149%

become increasingly worried about in the last twelve months :
- 1985 - 2 Timothy Dalton films , 1987 - 1989
my church . I was constantly told it was unacceptable to
, then raised the glass . Thank God for
. So CA must = rough ly

INTERVAL -4.860 - -2.377
CONTAINS 0.820%

se aux f é d é ra ux d ans la
Gen etics [ edit ] Ep ider
and many photos of m oom in characters models .
Mother of the Year in 2010 at
U . S .- H ait i relations . U .
Token-
Token ID12
Feature activation+6.73e-01

Pos logit contributions
apper+1.731
 lid+1.617
 Pixie+1.597
PDATE+1.591
 accents+1.466
Neg logit contributions
-1.464
give-1.297
hon-1.279
VICE-1.246
udo-1.242
Tokensecond
Token ID12227
Feature activation-8.21e-02

Pos logit contributions
apper+1.212
 lid+1.147
 Pixie+1.128
PDATE+1.113
 accents+1.031
Neg logit contributions
-1.100
give-0.985
hon-0.973
VICE-0.941
udo-0.935
Token lead
Token ID1085
Feature activation+2.41e+00

Pos logit contributions
+0.162
give+0.149
hon+0.146
 decree+0.144
 affirm+0.143
Neg logit contributions
apper-0.121
PDATE-0.109
 Pixie-0.108
 lid-0.107
ANY-0.097
Token.
Token ID13
Feature activation+6.40e-01

Pos logit contributions
apper+4.112
 lid+3.892
 Pixie+3.783
PDATE+3.678
 Deluxe+3.482
Neg logit contributions
-4.108
give-3.662
hon-3.640
VICE-3.558
udo-3.537
Token When
Token ID1649
Feature activation+6.72e-01

Pos logit contributions
apper+0.970
 lid+0.907
PDATE+0.879
 Pixie+0.865
 accents+0.786
Neg logit contributions
-1.236
give-1.126
hon-1.109
VICE-1.088
udo-1.086
Token Bolton
Token ID35293
Feature activation+5.07e+00

Pos logit contributions
apper+1.127
 lid+1.050
PDATE+1.029
 Pixie+1.027
 Deluxe+0.942
Neg logit contributions
-1.180
give-1.067
hon-1.049
VICE-1.025
udo-1.021
Token only
Token ID691
Feature activation+3.08e-01

Pos logit contributions
apper+8.372
 lid+8.192
 Pixie+7.714
 Hots+7.297
 accents+7.243
Neg logit contributions
-8.343
VICE-7.315
give-7.264
hon-7.212
ograms-6.981
Token semi
Token ID10663
Feature activation+2.38e+00

Pos logit contributions
apper+0.541
 lid+0.508
 Pixie+0.501
PDATE+0.497
 Deluxe+0.458
Neg logit contributions
-0.514
give-0.462
hon-0.455
udo-0.442
VICE-0.442
Token-
Token ID12
Feature activation+3.54e-01

Pos logit contributions
apper+4.512
 lid+4.267
 Pixie+4.180
PDATE+4.115
 Deluxe+3.867
Neg logit contributions
-3.639
give-3.216
hon-3.170
VICE-3.088
udo-3.059
Tokencle
Token ID2375
Feature activation+7.99e-02

Pos logit contributions
apper+0.700
 lid+0.671
 Pixie+0.660
PDATE+0.654
 accents+0.609
Neg logit contributions
-0.512
give-0.456
hon-0.453
VICE-0.429
udo-0.427
Tokenared
Token ID1144
Feature activation+4.48e-01

Pos logit contributions
apper+0.173
 lid+0.167
PDATE+0.166
 Pixie+0.165
 Deluxe+0.153
Neg logit contributions
-0.099
give-0.087
hon-0.086
udo-0.082
VICE-0.082
Token?
Token ID30
Feature activation+1.73e+00

Pos logit contributions
apper+2.006
 lid+1.921
 Pixie+1.857
 Deluxe+1.720
 accents+1.676
Neg logit contributions
-2.377
hon-2.054
VICE-2.016
thood-2.013
udo-2.009
Token No
Token ID1400
Feature activation+9.80e-01

Pos logit contributions
apper+2.282
 Pixie+2.166
 lid+2.124
 Deluxe+1.946
 Temperature+1.882
Neg logit contributions
-3.613
hon-3.224
give-3.224
VICE-3.188
 decree-3.179
Token more
Token ID517
Feature activation+1.34e+00

Pos logit contributions
apper+1.621
 lid+1.540
 Pixie+1.462
 Deluxe+1.361
 accents+1.355
Neg logit contributions
-1.772
hon-1.558
give-1.542
udo-1.520
VICE-1.519
Token bad
Token ID2089
Feature activation+2.55e+00

Pos logit contributions
apper+2.305
 lid+2.220
 Pixie+2.092
 Deluxe+1.993
 accents+1.955
Neg logit contributions
-2.348
thood-1.994
hon-1.992
give-1.985
VICE-1.966
Token news
Token ID1705
Feature activation+2.20e+00

Pos logit contributions
apper+2.471
 lid+2.412
 Pixie+2.095
 accents+2.023
 Deluxe+1.923
Neg logit contributions
-6.324
thood-5.640
give-5.495
hon-5.476
lyak-5.472
Token pl
Token ID458
Feature activation+4.73e+00

Pos logit contributions
apper+2.962
 lid+2.851
 Pixie+2.668
 accents+2.462
 Deluxe+2.399
Neg logit contributions
-4.568
hon-4.099
VICE-4.091
udo-4.025
give-3.990
Tokens
Token ID82
Feature activation+3.10e+00

Pos logit contributions
apper+6.158
 lid+5.180
 Pixie+5.002
 Deluxe+4.939
 accents+4.839
Neg logit contributions
-9.532
 decree-8.876
VICE-8.406
 affirm-8.328
lyak-8.204
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
apper+4.197
 lid+3.798
 Pixie+3.733
 Deluxe+3.592
ANY+3.454
Neg logit contributions
-6.251
 decree-5.560
thood-5.498
gments-5.438
 affirm-5.412
Token Good
Token ID4599
Feature activation+1.55e+00

Pos logit contributions
apper+2.541
 lid+2.396
 Pixie+2.365
PDATE+2.244
 Deluxe+2.179
Neg logit contributions
-2.629
give-2.326
hon-2.303
VICE-2.256
udo-2.252
Token updates
Token ID5992
Feature activation+1.45e+00

Pos logit contributions
apper+1.870
 lid+1.761
 Pixie+1.616
 accents+1.515
PDATE+1.514
Neg logit contributions
-3.480
give-3.119
hon-3.112
udo-3.068
thood-3.060
Token?
Token ID30
Feature activation+1.80e+00

Pos logit contributions
apper+2.120
 lid+2.011
 Pixie+1.933
 Deluxe+1.843
ANY+1.782
Neg logit contributions
-2.808
 decree-2.471
hon-2.442
thood-2.430
udo-2.424
Token who
Token ID508
Feature activation+1.04e+00

Pos logit contributions
apper+1.978
 lid+1.861
 Pixie+1.823
PDATE+1.792
 Deluxe+1.671
Neg logit contributions
-1.915
give-1.713
hon-1.695
VICE-1.650
udo-1.640
Token holds
Token ID6622
Feature activation+2.17e+00

Pos logit contributions
apper+1.745
 lid+1.633
 Pixie+1.604
PDATE+1.588
ANY+1.461
Neg logit contributions
-1.813
give-1.629
hon-1.613
VICE-1.569
 affirm-1.562
Token dual
Token ID10668
Feature activation+9.45e-01

Pos logit contributions
apper+3.515
 lid+3.302
 Pixie+3.223
PDATE+3.146
 Deluxe+2.933
Neg logit contributions
-3.935
give-3.549
hon-3.513
VICE-3.424
udo-3.412
Token British
Token ID3517
Feature activation+2.77e+00

Pos logit contributions
apper+1.591
 lid+1.486
 Pixie+1.460
PDATE+1.448
 Deluxe+1.325
Neg logit contributions
-1.656
give-1.489
hon-1.483
VICE-1.439
 decree-1.432
Token and
Token ID290
Feature activation+1.31e-01

Pos logit contributions
apper+5.137
 lid+4.885
 Pixie+4.769
PDATE+4.644
 accents+4.465
Neg logit contributions
-4.304
give-3.803
hon-3.727
VICE-3.653
udo-3.624
Token Pakistani
Token ID17807
Feature activation+4.72e+00

Pos logit contributions
apper+0.226
 lid+0.211
 Pixie+0.208
PDATE+0.207
 Deluxe+0.190
Neg logit contributions
-0.221
give-0.199
hon-0.198
VICE-0.192
 decree-0.191
Token citizenship
Token ID14218
Feature activation+2.82e-01

Pos logit contributions
apper+7.492
 lid+7.227
 Pixie+6.851
 accents+6.795
 Deluxe+6.423
Neg logit contributions
-8.103
give-7.184
udo-6.961
VICE-6.939
hon-6.921
Token,
Token ID11
Feature activation+5.48e-01

Pos logit contributions
apper+0.557
 lid+0.522
 Pixie+0.514
PDATE+0.514
 Deluxe+0.479
Neg logit contributions
-0.415
give-0.366
hon-0.363
 decree-0.351
 affirm-0.349
Token said
Token ID531
Feature activation+7.66e-02

Pos logit contributions
apper+1.127
 lid+1.067
 Pixie+1.054
PDATE+1.046
 Deluxe+0.978
Neg logit contributions
-0.754
give-0.656
hon-0.649
 affirm-0.623
VICE-0.623
Token of
Token ID286
Feature activation+9.41e-01

Pos logit contributions
apper+0.137
 lid+0.127
PDATE+0.126
 Pixie+0.126
 Deluxe+0.116
Neg logit contributions
-0.126
give-0.114
hon-0.112
 decree-0.109
 affirm-0.109
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
apper+1.638
 lid+1.528
 Pixie+1.508
PDATE+1.493
 Deluxe+1.380
Neg logit contributions
-1.601
give-1.440
hon-1.420
VICE-1.378
 decree-1.377
Token that
Token ID326
Feature activation+3.87e-01

Pos logit contributions
+2.826
hon+2.552
give+2.527
 affirm+2.456
telling+2.456
Neg logit contributions
apper-4.026
PDATE-3.856
 lid-3.736
 Pixie-3.722
ANY-3.453
Token weren
Token ID6304
Feature activation+1.51e+00

Pos logit contributions
apper+0.717
 lid+0.675
 Pixie+0.662
PDATE+0.658
 Deluxe+0.609
Neg logit contributions
-0.610
give-0.546
hon-0.538
VICE-0.522
udo-0.521
Token't
Token ID470
Feature activation+9.84e-01

Pos logit contributions
apper+2.344
 lid+2.127
 Pixie+2.072
 Deluxe+1.979
ANY+1.944
Neg logit contributions
-2.830
 decree-2.507
hon-2.490
give-2.472
 affirm-2.452
Token explained
Token ID4893
Feature activation+2.62e-01

Pos logit contributions
apper+1.783
 lid+1.677
 Pixie+1.645
PDATE+1.632
 Deluxe+1.511
Neg logit contributions
-1.594
give-1.427
hon-1.409
VICE-1.367
udo-1.359
Token as
Token ID355
Feature activation+2.51e+00

Pos logit contributions
apper+0.469
 lid+0.438
PDATE+0.438
 Pixie+0.433
ANY+0.398
Neg logit contributions
-0.426
give-0.388
hon-0.380
 affirm-0.374
 decree-0.367
Token well
Token ID880
Feature activation+4.72e+00

Pos logit contributions
apper+4.129
 lid+3.873
 Pixie+3.787
PDATE+3.661
 Deluxe+3.460
Neg logit contributions
-4.495
give-3.981
hon-3.973
VICE-3.878
udo-3.871
Token as
Token ID355
Feature activation+1.89e+00

Pos logit contributions
apper+7.677
 lid+7.354
 Pixie+7.121
PDATE+6.705
 accents+6.625
Neg logit contributions
-7.963
hon-6.945
give-6.945
VICE-6.853
udo-6.829
Token they
Token ID484
Feature activation+1.83e+00

Pos logit contributions
apper+3.445
 lid+3.254
 Pixie+3.190
PDATE+3.123
 Deluxe+2.939
Neg logit contributions
-3.041
give-2.697
hon-2.669
VICE-2.597
udo-2.580
Token could
Token ID714
Feature activation+2.36e+00

Pos logit contributions
apper+2.988
 lid+2.792
 Pixie+2.719
PDATE+2.569
 Deluxe+2.500
Neg logit contributions
-3.294
give-2.919
hon-2.905
VICE-2.842
ograms-2.820
Token have
Token ID423
Feature activation+1.59e+00

Pos logit contributions
apper+4.485
 lid+4.255
 Pixie+4.162
PDATE+4.028
 Deluxe+3.865
Neg logit contributions
-3.605
give-3.150
hon-3.132
VICE-3.045
udo-3.023
Token been
Token ID587
Feature activation+1.53e+00

Pos logit contributions
apper+2.668
 lid+2.497
 Pixie+2.446
PDATE+2.434
 Deluxe+2.227
Neg logit contributions
-2.794
give-2.520
hon-2.492
VICE-2.424
udo-2.410
Token official
Token ID1743
Feature activation+5.19e-01

Pos logit contributions
apper+1.253
 lid+1.099
 Pixie+0.921
 Deluxe+0.816
 accents+0.741
Neg logit contributions
-5.270
give-4.829
VICE-4.785
hon-4.783
gments-4.740
Token statement
Token ID2643
Feature activation+1.35e+00

Pos logit contributions
apper+0.764
 lid+0.735
 Pixie+0.689
 Deluxe+0.631
PDATE+0.626
Neg logit contributions
-1.027
give-0.908
hon-0.908
udo-0.898
thood-0.892
Token soon
Token ID2582
Feature activation+1.71e+00

Pos logit contributions
apper+1.813
 lid+1.738
 Pixie+1.629
 Deluxe+1.494
 accents+1.454
Neg logit contributions
-2.831
give-2.525
ograms-2.487
udo-2.474
hon-2.465
Token.
Token ID13
Feature activation+2.46e+00

Pos logit contributions
apper+2.475
 lid+2.270
 Pixie+2.214
 Deluxe+2.107
ANY+2.073
Neg logit contributions
-3.357
 decree-2.947
thood-2.902
 affirm-2.885
hon-2.885
Token As
Token ID1081
Feature activation+3.64e+00

Pos logit contributions
apper+3.287
 Pixie+3.062
 lid+2.969
 Deluxe+2.785
 Temperature+2.692
Neg logit contributions
-4.992
give-4.524
VICE-4.475
 decree-4.473
hon-4.456
Token well
Token ID880
Feature activation+4.72e+00

Pos logit contributions
apper+3.977
 lid+3.901
 Pixie+3.278
 Deluxe+3.232
 accents+3.214
Neg logit contributions
-8.330
ograms-7.341
udo-7.315
thood-7.236
gments-7.197
Token as
Token ID355
Feature activation+3.19e+00

Pos logit contributions
apper+6.165
 lid+5.988
 Deluxe+5.218
 Pixie+5.177
 accents+5.159
Neg logit contributions
-9.460
VICE-8.226
thood-8.127
udo-8.039
ograms-7.928
Token updates
Token ID5992
Feature activation+2.55e+00

Pos logit contributions
apper+3.388
 lid+3.230
 Pixie+2.909
 accents+2.768
 Deluxe+2.661
Neg logit contributions
-7.500
hon-6.702
give-6.646
udo-6.611
thood-6.593
Token regarding
Token ID5115
Feature activation+2.82e+00

Pos logit contributions
apper+3.468
 lid+3.432
 Pixie+3.146
 Deluxe+2.937
 accents+2.933
Neg logit contributions
-5.085
give-4.603
thood-4.525
hon-4.506
VICE-4.505
Token the
Token ID262
Feature activation+2.79e+00

Pos logit contributions
apper+3.853
 lid+3.661
 Pixie+3.496
 Deluxe+3.322
 accents+3.142
Neg logit contributions
-5.710
ograms-4.973
hon-4.970
thood-4.966
udo-4.911
Token future
Token ID2003
Feature activation+2.53e+00

Pos logit contributions
apper+2.563
 lid+2.449
 Pixie+2.162
 Deluxe+2.046
 accents+1.843
Neg logit contributions
-7.127
ograms-6.238
udo-6.144
thood-6.143
hon-6.140
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
apper+0.466
 lid+0.424
 Pixie+0.411
 Deluxe+0.378
ANY+0.369
Neg logit contributions
-0.784
give-0.701
hon-0.699
 decree-0.693
udo-0.689
TokenIt
Token ID1026
Feature activation+1.93e+00

Pos logit contributions
apper+0.293
 lid+0.277
 Pixie+0.270
PDATE+0.267
 Deluxe+0.246
Neg logit contributions
-0.311
give-0.281
hon-0.278
VICE-0.272
udo-0.269
Token got
Token ID1392
Feature activation+1.79e+00

Pos logit contributions
apper+2.959
 lid+2.793
 Pixie+2.699
PDATE+2.574
 accents+2.465
Neg logit contributions
-3.632
give-3.268
hon-3.235
VICE-3.186
udo-3.150
Token worse
Token ID4785
Feature activation+7.93e-01

Pos logit contributions
apper+3.270
 lid+3.137
 Pixie+3.051
PDATE+2.922
 accents+2.814
Neg logit contributions
-2.850
hon-2.496
give-2.488
VICE-2.410
udo-2.378
Token for
Token ID329
Feature activation+8.79e-01

Pos logit contributions
apper+1.412
 lid+1.325
 Pixie+1.301
PDATE+1.292
 Deluxe+1.194
Neg logit contributions
-1.309
give-1.172
hon-1.159
VICE-1.127
 affirm-1.119
Token Bolton
Token ID35293
Feature activation+4.69e+00

Pos logit contributions
apper+1.306
 lid+1.206
PDATE+1.176
 Pixie+1.174
 Deluxe+1.060
Neg logit contributions
-1.715
give-1.569
hon-1.543
VICE-1.512
udo-1.505
Token and
Token ID290
Feature activation+5.77e-01

Pos logit contributions
apper+7.491
 lid+7.190
 Pixie+6.968
 accents+6.411
PDATE+6.368
Neg logit contributions
-8.252
give-7.201
hon-7.145
VICE-7.133
udo-6.999
Token even
Token ID772
Feature activation+2.07e+00

Pos logit contributions
apper+0.980
 lid+0.916
PDATE+0.894
 Pixie+0.894
 Deluxe+0.820
Neg logit contributions
-0.997
give-0.902
hon-0.887
VICE-0.865
udo-0.862
Token better
Token ID1365
Feature activation+1.90e+00

Pos logit contributions
apper+3.543
 lid+3.347
 Pixie+3.278
PDATE+3.167
 Deluxe+3.000
Neg logit contributions
-3.558
hon-3.147
give-3.145
VICE-3.056
udo-3.035
Token for
Token ID329
Feature activation+5.55e-01

Pos logit contributions
apper+3.170
 lid+2.987
 Pixie+2.912
PDATE+2.829
 Deluxe+2.663
Neg logit contributions
-3.354
give-3.003
hon-2.974
VICE-2.896
udo-2.892
Token Bent
Token ID20421
Feature activation+2.70e+00

Pos logit contributions
apper+0.847
 lid+0.781
PDATE+0.773
 Pixie+0.758
 Deluxe+0.691
Neg logit contributions
-1.055
give-0.968
hon-0.947
VICE-0.930
udo-0.928
Token month
Token ID1227
Feature activation-5.83e-01

Pos logit contributions
+0.199
give+0.178
hon+0.173
 affirm+0.169
 decree+0.168
Neg logit contributions
apper-0.322
PDATE-0.306
 lid-0.304
 Pixie-0.300
ANY-0.283
Token.
Token ID13
Feature activation+3.62e-01

Pos logit contributions
+0.926
give+0.830
hon+0.823
udo+0.801
 affirm+0.799
Neg logit contributions
apper-1.055
 lid-0.994
 Pixie-0.986
PDATE-0.984
ANY-0.903
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
apper+0.557
 lid+0.520
PDATE+0.504
 Pixie+0.499
 accents+0.454
Neg logit contributions
-0.688
give-0.630
hon-0.620
VICE-0.607
udo-0.607
Token
Token ID198
Feature activation-3.19e-01

Pos logit contributions
+0.351
give+0.317
hon+0.315
VICE+0.309
 decree+0.309
Neg logit contributions
apper-0.228
 lid-0.209
 Pixie-0.204
PDATE-0.190
 Deluxe-0.185
TokenAgainst
Token ID39276
Feature activation+1.94e-02

Pos logit contributions
+0.549
give+0.499
hon+0.492
VICE+0.482
udo+0.475
Neg logit contributions
apper-0.546
 lid-0.520
 Pixie-0.508
PDATE-0.504
 accents-0.462
Token Sweden
Token ID10710
Feature activation+4.63e+00

Pos logit contributions
apper+0.033
 lid+0.031
PDATE+0.031
 Pixie+0.030
ANY+0.028
Neg logit contributions
-0.033
give-0.031
hon-0.030
 affirm-0.029
VICE-0.029
Token in
Token ID287
Feature activation+8.97e-01

Pos logit contributions
apper+7.429
 lid+7.072
 Pixie+6.882
PDATE+6.371
 Deluxe+6.360
Neg logit contributions
-8.072
give-7.125
hon-7.095
VICE-6.998
thood-6.905
Token June
Token ID2795
Feature activation-1.21e+00

Pos logit contributions
apper+1.578
 lid+1.478
PDATE+1.447
 Pixie+1.447
ANY+1.330
Neg logit contributions
-1.499
give-1.353
hon-1.333
VICE-1.292
 decree-1.289
Token,
Token ID11
Feature activation-2.15e-01

Pos logit contributions
+1.778
give+1.601
hon+1.587
udo+1.540
 decree+1.523
Neg logit contributions
apper-2.311
PDATE-2.166
 lid-2.155
 Pixie-2.120
ANY-1.990
Token they
Token ID484
Feature activation+3.63e-01

Pos logit contributions
+0.370
give+0.338
hon+0.332
VICE+0.325
udo+0.324
Neg logit contributions
apper-0.362
 lid-0.337
PDATE-0.332
 Pixie-0.330
ANY-0.303
Token almost
Token ID2048
Feature activation-6.26e-01

Pos logit contributions
apper+0.640
 lid+0.602
 Pixie+0.591
PDATE+0.586
 Deluxe+0.541
Neg logit contributions
-0.606
give-0.543
hon-0.537
VICE-0.521
udo-0.520
Token.
Token ID13
Feature activation+5.15e-01

Pos logit contributions
apper+4.033
 lid+3.808
 Pixie+3.718
PDATE+3.610
 Deluxe+3.412
Neg logit contributions
-4.028
give-3.587
hon-3.542
VICE-3.480
udo-3.463
Token
Token ID198
Feature activation+2.22e-01

Pos logit contributions
apper+0.802
 lid+0.755
PDATE+0.731
 Pixie+0.719
 accents+0.658
Neg logit contributions
-0.972
give-0.882
hon-0.877
VICE-0.852
udo-0.849
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
apper+0.289
 lid+0.263
 Pixie+0.256
 Deluxe+0.234
ANY+0.229
Neg logit contributions
-0.475
give-0.425
hon-0.424
 decree-0.419
udo-0.417
TokenAfter
Token ID3260
Feature activation+1.92e+00

Pos logit contributions
apper+0.272
 lid+0.261
 Pixie+0.255
PDATE+0.254
 accents+0.234
Neg logit contributions
-0.232
give-0.208
hon-0.206
VICE-0.201
udo-0.196
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
apper+3.289
 lid+3.108
 Pixie+3.030
PDATE+2.927
 Deluxe+2.775
Neg logit contributions
-3.273
give-2.905
hon-2.879
VICE-2.822
udo-2.810
Token Blackhawks
Token ID35663
Feature activation+4.63e+00

Pos logit contributions
apper+2.733
 lid+2.574
 Pixie+2.519
PDATE+2.474
 Deluxe+2.312
Neg logit contributions
-2.614
give-2.336
hon-2.310
VICE-2.249
udo-2.236
Token traded
Token ID14018
Feature activation+1.14e+00

Pos logit contributions
apper+7.875
 lid+7.616
 Pixie+7.264
 Deluxe+6.777
 Hots+6.714
Neg logit contributions
-7.482
VICE-6.397
give-6.382
hon-6.320
ograms-6.278
Token him
Token ID683
Feature activation+4.62e-01

Pos logit contributions
apper+1.899
 lid+1.777
 Pixie+1.737
PDATE+1.721
 Deluxe+1.588
Neg logit contributions
-2.004
give-1.804
hon-1.789
VICE-1.738
udo-1.730
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
apper+0.851
 lid+0.800
 Pixie+0.788
PDATE+0.786
ANY+0.726
Neg logit contributions
-0.732
give-0.653
hon-0.649
VICE-0.625
 affirm-0.624
Token the
Token ID262
Feature activation+5.59e-01

Pos logit contributions
apper+0.201
 lid+0.188
PDATE+0.185
 Pixie+0.182
ANY+0.168
Neg logit contributions
-0.197
give-0.179
hon-0.176
VICE-0.170
 affirm-0.170
Token Th
Token ID536
Feature activation+9.27e-01

Pos logit contributions
apper+0.814
 lid+0.751
PDATE+0.739
 Pixie+0.732
ANY+0.660
Neg logit contributions
-1.099
give-1.015
hon-1.004
VICE-0.971
 affirm-0.968
TokenMeanwhile
Token ID10294
Feature activation+2.26e+00

Pos logit contributions
apper+1.252
 lid+1.194
 Pixie+1.172
PDATE+1.159
 accents+1.072
Neg logit contributions
-1.105
give-0.990
hon-0.980
VICE-0.958
udo-0.942
Token on
Token ID319
Feature activation-1.44e-02

Pos logit contributions
apper+4.140
 lid+3.906
 Pixie+3.830
PDATE+3.769
 Deluxe+3.527
Neg logit contributions
-3.615
give-3.214
hon-3.178
VICE-3.088
udo-3.068
Token Wednesday
Token ID3583
Feature activation-3.82e-01

Pos logit contributions
+0.024
give+0.021
hon+0.021
VICE+0.020
 decree+0.020
Neg logit contributions
apper-0.026
 lid-0.024
PDATE-0.024
 Pixie-0.024
 Deluxe-0.022
Token,
Token ID11
Feature activation+5.23e-01

Pos logit contributions
+0.571
give+0.508
hon+0.504
 decree+0.497
 affirm+0.495
Neg logit contributions
apper-0.725
PDATE-0.685
 lid-0.679
 Pixie-0.676
ANY-0.628
Token a
Token ID257
Feature activation+9.79e-01

Pos logit contributions
apper+0.886
 lid+0.829
 Pixie+0.817
PDATE+0.813
ANY+0.744
Neg logit contributions
-0.904
give-0.817
hon-0.806
VICE-0.788
 decree-0.784
Token Swedish
Token ID14023
Feature activation+4.56e+00

Pos logit contributions
apper+1.441
 lid+1.341
 Pixie+1.307
PDATE+1.278
 Deluxe+1.177
Neg logit contributions
-1.921
give-1.746
hon-1.730
VICE-1.691
udo-1.683
Token appeals
Token ID15192
Feature activation+1.37e-01

Pos logit contributions
apper+7.044
 lid+6.839
 Pixie+6.311
 accents+5.976
 Deluxe+5.889
Neg logit contributions
-8.291
give-7.189
hon-7.172
VICE-7.110
udo-7.049
Token court
Token ID2184
Feature activation+6.53e-01

Pos logit contributions
apper+0.275
 lid+0.260
 Pixie+0.260
PDATE+0.256
 Deluxe+0.240
Neg logit contributions
-0.193
give-0.171
hon-0.168
 decree-0.168
 affirm-0.163
Token upheld
Token ID24816
Feature activation+1.61e-01

Pos logit contributions
apper+1.224
 lid+1.151
 Pixie+1.136
PDATE+1.128
 Deluxe+1.046
Neg logit contributions
-1.011
give-0.906
hon-0.891
 decree-0.870
 affirm-0.864
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
apper+0.269
PDATE+0.249
 lid+0.248
 Pixie+0.247
 Deluxe+0.225
Neg logit contributions
-0.277
give-0.256
hon-0.251
 decree-0.249
 affirm-0.246
Token court
Token ID2184
Feature activation+2.89e-01

Pos logit contributions
apper+2.232
 lid+2.110
 Pixie+2.049
PDATE+1.984
 Deluxe+1.870
Neg logit contributions
-2.537
give-2.270
hon-2.256
VICE-2.200
udo-2.191
TokenWidth
Token ID30916
Feature activation+2.85e+00

Pos logit contributions
apper+1.016
 lid+0.954
 Pixie+0.924
PDATE+0.917
 Deluxe+0.829
Neg logit contributions
-1.419
give-1.294
hon-1.281
VICE-1.257
udo-1.245
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
apper+4.850
 lid+4.552
 Pixie+4.455
PDATE+4.328
 Deluxe+4.075
Neg logit contributions
-4.859
give-4.351
hon-4.320
VICE-4.197
udo-4.173
Token$
Token ID3
Feature activation-1.45e-01

Pos logit contributions
apper+2.182
 lid+2.038
 Pixie+1.982
PDATE+1.954
 accents+1.796
Neg logit contributions
-2.715
give-2.457
hon-2.414
VICE-2.384
udo-2.356
TokenSR
Token ID12562
Feature activation+5.89e-01

Pos logit contributions
+0.275
give+0.248
hon+0.246
VICE+0.243
udo+0.239
Neg logit contributions
apper-0.224
 lid-0.213
 Pixie-0.210
PDATE-0.208
 accents-0.190
Token.
Token ID13
Feature activation+8.62e-01

Pos logit contributions
apper+0.965
 lid+0.885
 Pixie+0.870
 Deluxe+0.814
ANY+0.807
Neg logit contributions
-1.056
give-0.938
hon-0.933
 decree-0.930
 affirm-0.920
TokenHeight
Token ID23106
Feature activation+4.54e+00

Pos logit contributions
apper+1.114
 lid+1.031
 Pixie+0.997
PDATE+0.979
 Deluxe+0.882
Neg logit contributions
-1.848
give-1.695
hon-1.680
VICE-1.649
udo-1.637
Token)
Token ID8
Feature activation+1.88e+00

Pos logit contributions
apper+6.674
 lid+6.189
 Pixie+6.043
PDATE+5.782
 accents+5.458
Neg logit contributions
-8.432
give-7.621
hon-7.586
thood-7.414
lyak-7.401
Token $
Token ID720
Feature activation-6.20e-01

Pos logit contributions
apper+3.028
 lid+2.828
 Pixie+2.764
PDATE+2.729
 Deluxe+2.509
Neg logit contributions
-3.442
give-3.115
hon-3.073
VICE-2.999
udo-2.995
Tokenimage
Token ID9060
Feature activation-1.57e-01

Pos logit contributions
+1.054
give+0.939
hon+0.932
VICE+0.910
udo+0.902
Neg logit contributions
apper-1.064
 lid-1.027
PDATE-1.002
 Pixie-1.000
 accents-0.918
Token =
Token ID796
Feature activation+9.19e-02

Pos logit contributions
+0.241
give+0.215
hon+0.209
VICE+0.208
udo+0.205
Neg logit contributions
apper-0.294
 lid-0.282
 Pixie-0.274
PDATE-0.273
 accents-0.256
Token [
Token ID685
Feature activation-9.60e-01

Pos logit contributions
apper+0.161
 lid+0.149
 Pixie+0.147
PDATE+0.146
 accents+0.135
Neg logit contributions
-0.155
give-0.139
hon-0.133
 affirm-0.133
 decree-0.133
Token,
Token ID11
Feature activation-2.19e-01

Pos logit contributions
apper+2.376
 lid+2.237
 Pixie+2.201
PDATE+2.183
 Deluxe+2.026
Neg logit contributions
-1.992
give-1.766
hon-1.743
VICE-1.685
udo-1.681
Token Arsenal
Token ID13837
Feature activation+1.28e+00

Pos logit contributions
+0.437
give+0.408
hon+0.398
VICE+0.389
 affirm+0.388
Neg logit contributions
apper-0.311
PDATE-0.285
 lid-0.285
 Pixie-0.274
ANY-0.249
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
apper+2.477
 lid+2.331
 Pixie+2.292
PDATE+2.272
 Deluxe+2.117
Neg logit contributions
-1.921
give-1.700
hon-1.679
udo-1.617
VICE-1.605
Token Liverpool
Token ID11761
Feature activation+3.16e+00

Pos logit contributions
apper+0.151
 lid+0.140
PDATE+0.138
 Pixie+0.135
ANY+0.122
Neg logit contributions
-0.198
give-0.183
hon-0.180
udo-0.175
VICE-0.175
Token and
Token ID290
Feature activation-2.26e-01

Pos logit contributions
apper+5.968
 lid+5.672
 Pixie+5.541
PDATE+5.442
 Deluxe+5.133
Neg logit contributions
-4.836
give-4.264
hon-4.219
VICE-4.113
udo-4.069
Token Tottenham
Token ID24272
Feature activation+4.52e+00

Pos logit contributions
+0.432
give+0.405
hon+0.393
udo+0.386
 affirm+0.385
Neg logit contributions
apper-0.338
 lid-0.310
PDATE-0.310
 Pixie-0.300
ANY-0.273
Token Hots
Token ID41239
Feature activation+2.84e+00

Pos logit contributions
apper+7.077
 lid+6.686
 Pixie+6.451
PDATE+6.222
 Hots+6.007
Neg logit contributions
-8.127
give-7.276
hon-7.220
VICE-7.196
udo-7.051
Tokenpur
Token ID14225
Feature activation+1.70e+00

Pos logit contributions
apper+4.160
 lid+3.805
 Pixie+3.727
 Hots+3.462
 Deluxe+3.372
Neg logit contributions
-5.581
VICE-4.923
give-4.913
 decree-4.863
hon-4.829
Token
Token ID851
Feature activation-2.15e-01

Pos logit contributions
apper+2.978
 lid+2.790
 Pixie+2.741
PDATE+2.709
 Deluxe+2.509
Neg logit contributions
-2.871
give-2.573
hon-2.544
VICE-2.459
udo-2.453
Token stating
Token ID12316
Feature activation+4.53e-01

Pos logit contributions
+0.330
give+0.294
hon+0.289
 decree+0.281
 affirm+0.280
Neg logit contributions
apper-0.406
 lid-0.381
 Pixie-0.377
PDATE-0.376
 Deluxe-0.348
Token the
Token ID262
Feature activation+3.78e-01

Pos logit contributions
apper+0.888
 lid+0.829
PDATE+0.825
 Pixie+0.818
 Deluxe+0.754
Neg logit contributions
-0.669
give-0.600
hon-0.587
 affirm-0.574
 decree-0.565
TokenAL
Token ID1847
Feature activation-7.64e-02

Pos logit contributions
+0.421
give+0.380
hon+0.375
VICE+0.369
 decree+0.365
Neg logit contributions
apper-0.346
 lid-0.323
 Pixie-0.318
PDATE-0.311
 Deluxe-0.286
TokenEM
Token ID3620
Feature activation-1.29e+00

Pos logit contributions
+0.137
give+0.124
hon+0.122
VICE+0.120
udo+0.118
Neg logit contributions
apper-0.126
 lid-0.117
 Pixie-0.115
PDATE-0.112
 Deluxe-0.104
Token
Token ID851
Feature activation+4.76e-01

Pos logit contributions
+2.069
give+1.766
hon+1.762
VICE+1.722
 affirm+1.693
Neg logit contributions
apper-2.398
 lid-2.302
PDATE-2.237
 Pixie-2.235
rawler-2.059
Token Israeli
Token ID6085
Feature activation+8.66e-01

Pos logit contributions
apper+0.768
 lid+0.724
PDATE+0.700
 Pixie+0.697
 accents+0.634
Neg logit contributions
-0.872
give-0.790
hon-0.779
VICE-0.765
udo-0.756
Token and
Token ID290
Feature activation-4.71e-01

Pos logit contributions
apper+1.282
 lid+1.190
 Pixie+1.172
PDATE+1.156
 Deluxe+1.049
Neg logit contributions
-1.684
give-1.541
hon-1.523
VICE-1.493
 decree-1.491
Token Turkish
Token ID9663
Feature activation+4.52e+00

Pos logit contributions
+0.863
give+0.802
hon+0.788
VICE+0.779
 decree+0.779
Neg logit contributions
apper-0.729
PDATE-0.674
 lid-0.672
 Pixie-0.669
 Deluxe-0.601
Token officials
Token ID2828
Feature activation-1.67e-01

Pos logit contributions
apper+6.439
 lid+6.076
 Pixie+5.701
 accents+5.433
PDATE+5.355
Neg logit contributions
-8.744
give-7.713
hon-7.680
udo-7.564
thood-7.535
Token have
Token ID423
Feature activation+3.94e-01

Pos logit contributions
+0.262
give+0.234
hon+0.232
 affirm+0.226
 decree+0.225
Neg logit contributions
apper-0.309
 lid-0.293
PDATE-0.290
 Pixie-0.289
ANY-0.265
Token reached
Token ID4251
Feature activation+9.17e-01

Pos logit contributions
apper+0.691
 lid+0.647
 Pixie+0.638
PDATE+0.638
 Deluxe+0.583
Neg logit contributions
-0.658
give-0.593
hon-0.586
 affirm-0.570
 decree-0.569
Token a
Token ID257
Feature activation+6.12e-01

Pos logit contributions
apper+1.485
 lid+1.380
 Pixie+1.368
PDATE+1.367
ANY+1.228
Neg logit contributions
-1.660
give-1.512
hon-1.490
 decree-1.469
 affirm-1.464
Token preliminary
Token ID15223
Feature activation+1.63e+00

Pos logit contributions
apper+1.039
 lid+0.962
PDATE+0.959
 Pixie+0.956
ANY+0.869
Neg logit contributions
-1.060
give-0.961
hon-0.947
 decree-0.938
 affirm-0.920
Token 1
Token ID352
Feature activation+3.53e-01

Pos logit contributions
apper+3.493
 lid+3.290
 Pixie+3.213
PDATE+3.132
 Deluxe+2.934
Neg logit contributions
-3.697
give-3.312
hon-3.281
VICE-3.203
udo-3.184
Token-
Token ID12
Feature activation+1.05e+00

Pos logit contributions
apper+0.715
 lid+0.679
PDATE+0.670
 Pixie+0.666
 accents+0.621
Neg logit contributions
-0.492
give-0.435
hon-0.432
udo-0.416
VICE-0.416
Token0
Token ID15
Feature activation+7.52e-01

Pos logit contributions
apper+1.516
 lid+1.404
 Pixie+1.364
PDATE+1.326
 Deluxe+1.230
Neg logit contributions
-2.091
give-1.898
hon-1.880
VICE-1.844
udo-1.835
Token loss
Token ID2994
Feature activation+8.14e-01

Pos logit contributions
apper+1.231
 lid+1.128
PDATE+1.126
 Pixie+1.124
ANY+1.017
Neg logit contributions
-1.352
give-1.227
hon-1.214
 decree-1.187
 affirm-1.187
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
apper+1.515
 lid+1.421
PDATE+1.405
 Pixie+1.404
ANY+1.295
Neg logit contributions
-1.277
give-1.147
hon-1.130
 affirm-1.091
VICE-1.089
Token Slovakia
Token ID36839
Feature activation+4.48e+00

Pos logit contributions
apper+1.877
 lid+1.754
 Pixie+1.714
PDATE+1.703
 Deluxe+1.557
Neg logit contributions
-2.030
give-1.838
hon-1.817
VICE-1.770
udo-1.758
Token with
Token ID351
Feature activation+1.87e+00

Pos logit contributions
apper+6.654
 lid+6.275
 Pixie+6.082
 Deluxe+5.646
PDATE+5.617
Neg logit contributions
-8.322
give-7.429
hon-7.354
VICE-7.347
thood-7.269
Token head
Token ID1182
Feature activation+7.60e-01

Pos logit contributions
apper+3.122
 lid+2.930
 Pixie+2.865
PDATE+2.808
 Deluxe+2.615
Neg logit contributions
-3.312
give-2.976
hon-2.946
VICE-2.872
udo-2.856
Token coach
Token ID3985
Feature activation+6.62e-01

Pos logit contributions
apper+0.994
 lid+0.905
 Pixie+0.892
PDATE+0.881
 Deluxe+0.785
Neg logit contributions
-1.624
hon-1.489
give-1.488
udo-1.442
 affirm-1.437
Token Tom
Token ID4186
Feature activation-1.63e-01

Pos logit contributions
apper+1.129
 lid+1.062
PDATE+1.035
 Pixie+1.032
 Deluxe+0.945
Neg logit contributions
-1.140
give-1.027
hon-1.019
VICE-0.988
udo-0.987
Token Moh
Token ID9719
Feature activation-5.06e-01

Pos logit contributions
+0.286
give+0.258
hon+0.256
udo+0.248
ograms+0.244
Neg logit contributions
apper-0.271
PDATE-0.257
 lid-0.257
 Pixie-0.250
ANY-0.229
Token telling
Token ID5149
Feature activation-5.20e-01

Pos logit contributions
apper+0.892
 lid+0.841
 Pixie+0.828
PDATE+0.823
 Deluxe+0.768
Neg logit contributions
-0.669
give-0.591
hon-0.581
VICE-0.565
 affirm-0.563
Token us
Token ID514
Feature activation+2.49e-01

Pos logit contributions
+0.742
give+0.663
hon+0.643
 affirm+0.633
VICE+0.628
Neg logit contributions
apper-1.037
 lid-0.974
PDATE-0.968
 Pixie-0.961
 Deluxe-0.893
Token.'
Token ID2637
Feature activation+1.40e+00

Pos logit contributions
apper+0.476
 lid+0.448
PDATE+0.444
 Pixie+0.443
 Deluxe+0.408
Neg logit contributions
-0.377
give-0.337
hon-0.331
 affirm-0.322
VICE-0.320
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
apper+2.023
 lid+1.884
 Pixie+1.833
PDATE+1.795
 Deluxe+1.644
Neg logit contributions
-2.784
give-2.540
hon-2.521
VICE-2.464
udo-2.450
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
apper+1.696
 lid+1.501
 Pixie+1.463
 Deluxe+1.384
ANY+1.329
Neg logit contributions
-3.170
 decree-2.834
thood-2.827
give-2.821
hon-2.818
TokenScroll
Token ID29261
Feature activation+4.47e+00

Pos logit contributions
apper+2.098
 lid+1.960
 Pixie+1.920
PDATE+1.879
 Deluxe+1.765
Neg logit contributions
-2.085
give-1.862
hon-1.843
udo-1.790
VICE-1.788
Token down
Token ID866
Feature activation+8.22e-01

Pos logit contributions
apper+5.881
 lid+5.415
 Pixie+5.240
PDATE+4.909
 Deluxe+4.700
Neg logit contributions
-9.213
give-8.426
hon-8.292
udo-8.133
thood-8.126
Token for
Token ID329
Feature activation-1.36e-02

Pos logit contributions
apper+1.519
 lid+1.423
 Pixie+1.398
PDATE+1.385
ANY+1.301
Neg logit contributions
-1.304
give-1.161
hon-1.137
 affirm-1.119
VICE-1.116
Token video
Token ID2008
Feature activation+1.59e+00

Pos logit contributions
+0.025
give+0.024
hon+0.023
 decree+0.023
 affirm+0.023
Neg logit contributions
apper-0.021
 lid-0.020
 Pixie-0.019
PDATE-0.019
ANY-0.017
Token
Token ID198
Feature activation+1.93e+00

Pos logit contributions
apper+2.310
 lid+2.146
 Pixie+2.092
PDATE+2.040
 Deluxe+1.867
Neg logit contributions
-3.176
give-2.894
hon-2.874
VICE-2.809
udo-2.789
Token
Token ID198
Feature activation+1.58e+00

Pos logit contributions
apper+2.201
 lid+1.933
 Pixie+1.916
 Deluxe+1.816
ANY+1.729
Neg logit contributions
-4.331
 decree-3.911
thood-3.898
gments-3.888
 affirm-3.883
TokenWidth
Token ID30916
Feature activation+2.38e+00

Pos logit contributions
apper+0.988
 lid+0.923
 Pixie+0.895
PDATE+0.884
 Deluxe+0.802
Neg logit contributions
-1.401
give-1.279
hon-1.266
VICE-1.242
udo-1.231
Token),
Token ID828
Feature activation+2.30e+00

Pos logit contributions
apper+4.147
 lid+3.902
 Pixie+3.810
PDATE+3.717
 Deluxe+3.497
Neg logit contributions
-3.977
give-3.559
hon-3.523
VICE-3.430
udo-3.410
Token($
Token ID16763
Feature activation-3.31e-01

Pos logit contributions
apper+3.739
 lid+3.503
 Pixie+3.420
PDATE+3.354
 Deluxe+3.119
Neg logit contributions
-4.141
give-3.730
hon-3.697
VICE-3.602
udo-3.587
TokenSR
Token ID12562
Feature activation+4.25e-01

Pos logit contributions
+0.627
give+0.569
hon+0.563
VICE+0.562
udo+0.547
Neg logit contributions
apper-0.506
 lid-0.482
 Pixie-0.474
PDATE-0.469
 accents-0.429
Token.
Token ID13
Feature activation+9.21e-01

Pos logit contributions
apper+0.698
 lid+0.643
 Pixie+0.630
 Deluxe+0.588
ANY+0.583
Neg logit contributions
-0.763
give-0.673
hon-0.671
 decree-0.667
 affirm-0.660
TokenHeight
Token ID23106
Feature activation+4.46e+00

Pos logit contributions
apper+1.233
 lid+1.146
 Pixie+1.107
PDATE+1.093
 Deluxe+0.985
Neg logit contributions
-1.936
give-1.777
hon-1.759
VICE-1.732
udo-1.708
Token)))
Token ID22305
Feature activation+2.85e+00

Pos logit contributions
apper+7.239
 lid+6.774
 Pixie+6.529
PDATE+6.387
 accents+6.007
Neg logit contributions
-7.712
give-6.933
hon-6.891
VICE-6.697
udo-6.671
Token )
Token ID1267
Feature activation+1.32e+00

Pos logit contributions
apper+4.445
 lid+4.160
 Pixie+4.055
PDATE+3.959
 Deluxe+3.684
Neg logit contributions
-5.307
give-4.795
hon-4.756
VICE-4.643
udo-4.611
Token }
Token ID1782
Feature activation-4.99e-01

Pos logit contributions
apper+2.143
 lid+1.994
 Pixie+1.954
PDATE+1.940
 Deluxe+1.770
Neg logit contributions
-2.407
give-2.178
hon-2.152
udo-2.102
VICE-2.098
Token }
Token ID1782
Feature activation-8.74e-01

Pos logit contributions
+0.786
give+0.715
 affirm+0.687
hon+0.685
udo+0.684
Neg logit contributions
apper-0.905
 lid-0.838
PDATE-0.838
 Pixie-0.829
ANY-0.766
Token Catch
Token ID25750
Feature activation+1.09e+00

Pos logit contributions
+1.377
give+1.264
 affirm+1.216
udo+1.215
 decree+1.213
Neg logit contributions
apper-1.569
PDATE-1.455
 lid-1.447
 Pixie-1.433
ANY-1.322
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
apper+4.063
 lid+3.841
 Pixie+3.758
PDATE+3.660
 Deluxe+3.442
Neg logit contributions
-3.824
give-3.391
hon-3.359
VICE-3.290
udo-3.273
Token as
Token ID355
Feature activation+8.13e-01

Pos logit contributions
apper+0.941
 lid+0.884
 Pixie+0.867
PDATE+0.858
 Deluxe+0.801
Neg logit contributions
-0.829
give-0.742
hon-0.732
VICE-0.708
 affirm-0.704
Token suggested
Token ID5220
Feature activation+1.88e+00

Pos logit contributions
apper+1.389
 lid+1.301
 Pixie+1.278
PDATE+1.263
 Deluxe+1.171
Neg logit contributions
-1.399
give-1.263
hon-1.248
VICE-1.214
 affirm-1.207
Token by
Token ID416
Feature activation+7.80e-01

Pos logit contributions
apper+3.506
 lid+3.310
 Pixie+3.248
PDATE+3.198
 Deluxe+2.998
Neg logit contributions
-2.954
give-2.617
hon-2.588
VICE-2.508
udo-2.491
Token the
Token ID262
Feature activation+8.06e-01

Pos logit contributions
apper+1.330
 lid+1.242
 Pixie+1.225
PDATE+1.212
 Deluxe+1.119
Neg logit contributions
-1.350
give-1.217
hon-1.201
VICE-1.168
 decree-1.164
Token Bulgarian
Token ID42119
Feature activation+4.46e+00

Pos logit contributions
apper+1.345
 lid+1.252
 Pixie+1.236
PDATE+1.221
 Deluxe+1.125
Neg logit contributions
-1.426
give-1.284
hon-1.273
VICE-1.236
 decree-1.233
Token ambassador
Token ID14791
Feature activation-4.81e-03

Pos logit contributions
apper+7.249
 lid+6.964
 Pixie+6.611
PDATE+6.361
 accents+6.322
Neg logit contributions
-7.579
give-6.816
hon-6.707
VICE-6.646
udo-6.642
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
+0.008
hon+0.007
give+0.007
 affirm+0.006
 decree+0.006
Neg logit contributions
apper-0.009
 lid-0.008
PDATE-0.008
 Pixie-0.008
 Deluxe-0.008
Token
Token ID198
Feature activation+3.88e-01

Pos logit contributions
apper+0.859
 lid+0.802
 Pixie+0.776
PDATE+0.775
 Deluxe+0.698
Neg logit contributions
-1.077
give-0.980
hon-0.970
VICE-0.947
udo-0.937
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
apper+0.498
 lid+0.451
 Pixie+0.439
 Deluxe+0.403
ANY+0.395
Neg logit contributions
-0.835
give-0.749
hon-0.747
 decree-0.741
udo-0.736
TokenPeople
Token ID8061
Feature activation+1.77e-01

Pos logit contributions
apper+0.483
 lid+0.455
 Pixie+0.445
PDATE+0.440
 Deluxe+0.405
Neg logit contributions
-0.515
give-0.466
hon-0.461
VICE-0.450
udo-0.444
Tokenx
Token ID87
Feature activation+1.23e+00

Pos logit contributions
apper+3.445
 lid+3.246
 Pixie+3.178
PDATE+3.114
 Deluxe+2.915
Neg logit contributions
-3.354
give-2.991
hon-2.965
VICE-2.886
udo-2.868
Token$
Token ID3
Feature activation-6.02e-01

Pos logit contributions
apper+2.186
 lid+2.061
 Pixie+2.020
PDATE+1.985
 Deluxe+1.855
Neg logit contributions
-2.028
give-1.816
hon-1.792
VICE-1.743
udo-1.732
Token($
Token ID16763
Feature activation-3.09e-01

Pos logit contributions
+0.920
give+0.817
hon+0.806
VICE+0.781
udo+0.779
Neg logit contributions
apper-1.139
 lid-1.076
 Pixie-1.059
PDATE-1.054
 Deluxe-0.976
TokenSR
Token ID12562
Feature activation+6.28e-01

Pos logit contributions
+0.557
give+0.500
hon+0.495
VICE+0.489
udo+0.482
Neg logit contributions
apper-0.503
 lid-0.481
 Pixie-0.475
PDATE-0.467
 accents-0.432
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
apper+1.142
 lid+1.087
PDATE+1.068
 Pixie+1.064
 accents+0.975
Neg logit contributions
-1.011
give-0.902
hon-0.894
VICE-0.876
udo-0.865
TokenHeight
Token ID23106
Feature activation+4.42e+00

Pos logit contributions
apper+0.999
 lid+0.926
 Pixie+0.895
PDATE+0.880
 Deluxe+0.793
Neg logit contributions
-1.639
give-1.503
hon-1.492
VICE-1.464
udo-1.450
Token)"
Token ID16725
Feature activation+1.97e+00

Pos logit contributions
apper+7.153
 lid+6.673
 Pixie+6.460
PDATE+6.304
 accents+5.956
Neg logit contributions
-7.628
give-6.787
hon-6.780
thood-6.634
lyak-6.590
Token -
Token ID532
Feature activation+2.13e+00

Pos logit contributions
apper+3.347
 lid+3.138
 Pixie+3.072
PDATE+3.023
 Deluxe+2.804
Neg logit contributions
-3.427
give-3.082
hon-3.042
VICE-2.964
udo-2.953
TokenVer
Token ID13414
Feature activation+9.23e-01

Pos logit contributions
apper+3.433
 lid+3.195
 Pixie+3.117
PDATE+3.043
 Deluxe+2.862
Neg logit contributions
-3.840
give-3.421
hon-3.395
VICE-3.314
 decree-3.300
Tokenb
Token ID65
Feature activation+3.43e+00

Pos logit contributions
apper+1.630
 lid+1.549
 Pixie+1.500
PDATE+1.418
 Deluxe+1.404
Neg logit contributions
-1.514
give-1.322
hon-1.314
VICE-1.292
 decree-1.288
Tokenose
Token ID577
Feature activation+1.71e+00

Pos logit contributions
apper+6.724
 lid+6.285
 Pixie+6.144
 Deluxe+5.852
rawler+5.771
Neg logit contributions
-4.845
 decree-4.033
 combatants-3.970
hon-3.925
thood-3.905
Token($
Token ID16763
Feature activation-3.20e-01

Pos logit contributions
apper+1.146
 lid+1.069
 Pixie+1.046
PDATE+1.031
 Deluxe+0.944
Neg logit contributions
-1.389
give-1.263
hon-1.243
VICE-1.217
udo-1.210
Tokens
Token ID82
Feature activation+2.99e+00

Pos logit contributions
+0.512
give+0.455
hon+0.454
VICE+0.444
udo+0.435
Neg logit contributions
apper-0.583
 lid-0.560
 Pixie-0.552
PDATE-0.546
 Deluxe-0.507
Tokenz
Token ID89
Feature activation+1.04e+00

Pos logit contributions
apper+4.944
 lid+4.319
 Pixie+4.202
XT+4.021
ANY+3.945
Neg logit contributions
-5.352
 affirm-4.642
 decree-4.624
give-4.560
 combatants-4.514
Token1
Token ID16
Feature activation-3.60e-01

Pos logit contributions
apper+1.630
 lid+1.537
 Pixie+1.504
PDATE+1.477
 Deluxe+1.361
Neg logit contributions
-1.928
give-1.743
hon-1.732
VICE-1.692
udo-1.678
Token.
Token ID13
Feature activation+7.84e-01

Pos logit contributions
+0.588
give+0.520
hon+0.517
udo+0.505
VICE+0.499
Neg logit contributions
apper-0.646
 lid-0.611
PDATE-0.605
 Pixie-0.598
 Deluxe-0.547
TokenHeight
Token ID23106
Feature activation+4.41e+00

Pos logit contributions
apper+1.096
 lid+1.022
 Pixie+0.994
PDATE+0.982
 Deluxe+0.888
Neg logit contributions
-1.595
give-1.460
hon-1.447
VICE-1.416
udo-1.406
Token),
Token ID828
Feature activation+2.30e+00

Pos logit contributions
apper+7.001
 lid+6.485
 Pixie+6.352
PDATE+6.075
ANY+5.780
Neg logit contributions
-7.727
give-6.950
hon-6.881
 decree-6.735
lyak-6.730
Token$
Token ID3
Feature activation+2.83e-02

Pos logit contributions
apper+3.784
 lid+3.548
 Pixie+3.466
PDATE+3.400
 Deluxe+3.162
Neg logit contributions
-4.117
give-3.705
hon-3.672
VICE-3.577
udo-3.560
TokenSR
Token ID12562
Feature activation+9.80e-01

Pos logit contributions
apper+0.053
 lid+0.050
 Pixie+0.049
PDATE+0.049
 Deluxe+0.046
Neg logit contributions
-0.045
give-0.039
hon-0.039
VICE-0.038
udo-0.038
Token.
Token ID13
Feature activation+7.82e-01

Pos logit contributions
apper+1.753
 lid+1.661
 Pixie+1.634
PDATE+1.603
 Deluxe+1.496
Neg logit contributions
-1.608
give-1.433
hon-1.423
VICE-1.390
udo-1.381
TokenWidth
Token ID30916
Feature activation+2.88e+00

Pos logit contributions
apper+1.112
 lid+1.041
 Pixie+1.011
PDATE+0.996
 Deluxe+0.903
Neg logit contributions
-1.576
give-1.439
hon-1.425
VICE-1.398
udo-1.385
Token charge
Token ID3877
Feature activation-6.19e-01

Pos logit contributions
apper+1.817
 lid+1.660
PDATE+1.636
 Pixie+1.632
ANY+1.478
Neg logit contributions
-2.157
give-1.973
hon-1.953
 decree-1.916
VICE-1.908
Token at
Token ID379
Feature activation+1.36e+00

Pos logit contributions
+1.042
give+0.944
hon+0.938
 decree+0.907
 affirm+0.903
Neg logit contributions
apper-1.074
PDATE-0.992
 lid-0.987
 Pixie-0.977
rawler-0.893
Token Old
Token ID5706
Feature activation+1.64e+00

Pos logit contributions
apper+2.549
 lid+2.399
 Pixie+2.351
PDATE+2.334
 Deluxe+2.165
Neg logit contributions
-2.123
give-1.897
hon-1.871
VICE-1.814
udo-1.802
Token Trafford
Token ID41107
Feature activation+2.15e+00

Pos logit contributions
apper+2.210
 lid+2.089
 Pixie+2.038
 Deluxe+1.872
PDATE+1.849
Neg logit contributions
-3.383
give-3.031
hon-2.990
VICE-2.968
 decree-2.955
Token against
Token ID1028
Feature activation+1.39e-01

Pos logit contributions
apper+3.342
 lid+3.130
 Pixie+3.048
PDATE+2.972
 Deluxe+2.768
Neg logit contributions
-4.037
give-3.649
hon-3.607
VICE-3.530
udo-3.513
Token Swansea
Token ID36019
Feature activation+4.38e+00

Pos logit contributions
apper+0.212
 lid+0.197
PDATE+0.195
 Pixie+0.191
ANY+0.173
Neg logit contributions
-0.263
give-0.242
hon-0.239
VICE-0.233
udo-0.232
Token on
Token ID319
Feature activation+3.35e-01

Pos logit contributions
apper+6.354
 lid+5.948
 Pixie+5.762
PDATE+5.468
 Deluxe+5.246
Neg logit contributions
-8.427
give-7.602
hon-7.513
VICE-7.421
lyak-7.292
Token Sunday
Token ID3502
Feature activation-4.43e-01

Pos logit contributions
apper+0.553
 lid+0.518
 Pixie+0.507
PDATE+0.502
 Deluxe+0.461
Neg logit contributions
-0.597
give-0.538
hon-0.531
VICE-0.519
udo-0.514
Token by
Token ID416
Feature activation-2.68e-01

Pos logit contributions
+0.777
give+0.704
hon+0.692
 affirm+0.677
 decree+0.674
Neg logit contributions
apper-0.733
 lid-0.682
PDATE-0.675
 Pixie-0.674
ANY-0.618
Token logging
Token ID18931
Feature activation+6.15e-01

Pos logit contributions
+0.424
give+0.376
hon+0.371
 decree+0.361
VICE+0.360
Neg logit contributions
apper-0.496
 lid-0.465
PDATE-0.458
 Pixie-0.458
ANY-0.422
Token on
Token ID319
Feature activation-2.97e-01

Pos logit contributions
apper+1.075
 lid+1.002
 Pixie+0.983
PDATE+0.976
ANY+0.899
Neg logit contributions
-1.041
give-0.938
hon-0.926
VICE-0.903
udo-0.891
Token have
Token ID423
Feature activation+8.71e-01

Pos logit contributions
+0.309
give+0.278
hon+0.272
 affirm+0.266
 decree+0.264
Neg logit contributions
apper-0.437
PDATE-0.415
 Pixie-0.412
 lid-0.411
ANY-0.378
Token competed
Token ID32440
Feature activation+1.80e+00

Pos logit contributions
apper+1.549
 lid+1.452
 Pixie+1.428
PDATE+1.416
 Deluxe+1.309
Neg logit contributions
-1.439
give-1.296
hon-1.276
VICE-1.240
udo-1.236
Token alongside
Token ID7848
Feature activation+2.38e+00

Pos logit contributions
apper+3.065
 lid+2.880
 Pixie+2.818
PDATE+2.767
 Deluxe+2.577
Neg logit contributions
-3.122
give-2.801
hon-2.772
VICE-2.700
udo-2.685
Token Chelsea
Token ID12147
Feature activation+3.25e+00

Pos logit contributions
apper+3.446
 lid+3.216
 Pixie+3.134
PDATE+3.025
 Deluxe+2.818
Neg logit contributions
-4.724
give-4.276
hon-4.253
VICE-4.159
udo-4.128
Token and
Token ID290
Feature activation-1.84e-01

Pos logit contributions
apper+5.640
 lid+5.346
 Pixie+5.187
PDATE+5.032
 Deluxe+4.792
Neg logit contributions
-5.490
give-4.853
hon-4.795
VICE-4.724
udo-4.654
Token Tottenham
Token ID24272
Feature activation+4.37e+00

Pos logit contributions
+0.369
give+0.344
hon+0.338
udo+0.332
 affirm+0.331
Neg logit contributions
apper-0.259
PDATE-0.238
 lid-0.238
 Pixie-0.231
rawler-0.207
Token Hots
Token ID41239
Feature activation+3.00e+00

Pos logit contributions
apper+7.172
 lid+6.821
 Pixie+6.578
PDATE+6.258
 Hots+6.155
Neg logit contributions
-7.634
give-6.706
VICE-6.636
hon-6.620
udo-6.468
Tokenpur
Token ID14225
Feature activation+1.73e+00

Pos logit contributions
apper+4.344
 lid+3.921
 Pixie+3.837
 Hots+3.594
 Deluxe+3.502
Neg logit contributions
-5.927
VICE-5.261
 decree-5.219
give-5.213
ograms-5.140
Token for
Token ID329
Feature activation+2.73e-01

Pos logit contributions
apper+3.092
 lid+2.903
 Pixie+2.852
PDATE+2.825
 Deluxe+2.613
Neg logit contributions
-2.853
give-2.562
hon-2.534
VICE-2.450
udo-2.447
Token the
Token ID262
Feature activation+2.48e-01

Pos logit contributions
apper+0.488
PDATE+0.450
 lid+0.444
 Pixie+0.442
ANY+0.405
Neg logit contributions
-0.454
give-0.410
hon-0.403
 decree-0.394
udo-0.392
Token Premier
Token ID9952
Feature activation-1.55e-02

Pos logit contributions
apper+0.354
PDATE+0.318
 Pixie+0.316
 lid+0.314
ANY+0.281
Neg logit contributions
-0.503
give-0.459
hon-0.457
 decree-0.447
VICE-0.443
Token community
Token ID2055
Feature activation+1.91e-01

Pos logit contributions
apper+0.279
 lid+0.257
PDATE+0.253
 Pixie+0.251
 accents+0.226
Neg logit contributions
-0.361
give-0.330
hon-0.327
udo-0.320
VICE-0.319
Token.
Token ID13
Feature activation+5.26e-01

Pos logit contributions
apper+0.359
 lid+0.338
PDATE+0.334
 Pixie+0.330
 accents+0.306
Neg logit contributions
-0.299
give-0.264
hon-0.263
VICE-0.254
 affirm-0.253
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
apper+0.811
 lid+0.755
PDATE+0.740
 Pixie+0.726
 accents+0.671
Neg logit contributions
-1.016
hon-0.912
give-0.911
VICE-0.885
udo-0.883
Token
Token ID198
Feature activation+3.47e-01

Pos logit contributions
apper+0.532
 lid+0.483
 Pixie+0.469
 Deluxe+0.431
ANY+0.421
Neg logit contributions
-0.891
give-0.797
hon-0.796
 decree-0.789
thood-0.785
TokenC
Token ID34
Feature activation+1.49e+00

Pos logit contributions
apper+0.613
 lid+0.585
 Pixie+0.574
PDATE+0.571
 accents+0.525
Neg logit contributions
-0.579
give-0.521
hon-0.516
VICE-0.504
udo-0.494
Tokenue
Token ID518
Feature activation-4.86e+00

Pos logit contributions
apper+2.284
 lid+2.164
 Pixie+2.111
PDATE+2.082
 Deluxe+1.908
Neg logit contributions
-2.794
give-2.545
hon-2.527
udo-2.468
VICE-2.461
Tokenll
Token ID297
Feature activation-6.89e-01

Pos logit contributions
+5.366
give+5.134
hon+5.087
udo+4.712
lyak+4.662
Neg logit contributions
PDATE-10.671
apper-10.242
 Pixie-9.968
 lid-9.688
ANY-9.063
Tokenar
Token ID283
Feature activation+1.73e-02

Pos logit contributions
+0.931
give+0.804
hon+0.791
VICE+0.769
udo+0.761
Neg logit contributions
apper-1.440
 lid-1.364
 Pixie-1.339
PDATE-1.305
 Deluxe-1.252
Token also
Token ID635
Feature activation+1.04e+00

Pos logit contributions
apper+0.034
 lid+0.032
 Pixie+0.031
PDATE+0.031
ANY+0.029
Neg logit contributions
-0.025
give-0.023
hon-0.022
 affirm-0.021
VICE-0.021
Token took
Token ID1718
Feature activation+2.80e-01

Pos logit contributions
apper+1.998
 lid+1.902
 Pixie+1.842
PDATE+1.774
 accents+1.721
Neg logit contributions
-1.580
give-1.374
hon-1.360
VICE-1.335
udo-1.321
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
apper+0.467
PDATE+0.438
 lid+0.438
 Pixie+0.434
ANY+0.397
Neg logit contributions
-0.480
give-0.443
hon-0.434
 affirm-0.424
VICE-0.423
Tokenops
Token ID2840
Feature activation+4.30e-01

Pos logit contributions
apper+1.002
 lid+0.876
 Pixie+0.832
PDATE+0.765
 Deluxe+0.720
Neg logit contributions
-2.308
give-2.131
hon-2.114
VICE-2.092
 decree-2.082
Token hatch
Token ID25834
Feature activation+4.72e-01

Pos logit contributions
apper+0.834
 lid+0.780
PDATE+0.775
 Pixie+0.774
ANY+0.718
Neg logit contributions
-0.638
give-0.570
hon-0.561
udo-0.543
 decree-0.539
Tokenlings
Token ID17783
Feature activation+1.37e-01

Pos logit contributions
apper+0.884
 lid+0.833
PDATE+0.821
 Pixie+0.819
 accents+0.760
Neg logit contributions
-0.734
give-0.652
hon-0.645
udo-0.623
VICE-0.617
Token,
Token ID11
Feature activation-4.89e-01

Pos logit contributions
apper+0.272
 lid+0.256
PDATE+0.255
 Pixie+0.253
ANY+0.236
Neg logit contributions
-0.198
give-0.175
hon-0.172
VICE-0.166
udo-0.165
Token six
Token ID2237
Feature activation-1.26e+00

Pos logit contributions
+0.708
give+0.634
hon+0.614
 affirm+0.593
VICE+0.592
Neg logit contributions
apper-0.967
 lid-0.907
 Pixie-0.895
PDATE-0.895
ANY-0.833
Token months
Token ID1933
Feature activation-4.73e+00

Pos logit contributions
+1.706
give+1.537
hon+1.484
 affirm+1.458
ograms+1.449
Neg logit contributions
apper-2.567
PDATE-2.373
 lid-2.372
 Pixie-2.362
ANY-2.186
Token for
Token ID329
Feature activation-1.36e+00

Pos logit contributions
+5.761
give+4.948
thood+4.800
hon+4.772
 affirm+4.730
Neg logit contributions
apper-9.125
PDATE-8.642
 lid-8.588
 Pixie-8.410
ANY-8.185
Token the
Token ID262
Feature activation-4.43e-01

Pos logit contributions
+2.139
give+1.958
hon+1.893
udo+1.844
 affirm+1.842
Neg logit contributions
apper-2.444
PDATE-2.288
 lid-2.209
 Pixie-2.175
ANY-2.059
Token larger
Token ID4025
Feature activation+1.50e+00

Pos logit contributions
+0.915
give+0.852
hon+0.839
 decree+0.820
udo+0.817
Neg logit contributions
apper-0.590
PDATE-0.537
 lid-0.526
 Pixie-0.517
ANY-0.468
Token Hyp
Token ID21209
Feature activation-2.48e-01

Pos logit contributions
apper+2.139
 lid+1.977
 Pixie+1.926
PDATE+1.897
 Deluxe+1.727
Neg logit contributions
-3.026
give-2.763
hon-2.736
VICE-2.671
udo-2.668
Tokenac
Token ID330
Feature activation-9.51e-02

Pos logit contributions
+0.397
give+0.354
hon+0.350
VICE+0.338
udo+0.338
Neg logit contributions
apper-0.454
 lid-0.430
 Pixie-0.422
PDATE-0.418
 Deluxe-0.387
Token York
Token ID1971
Feature activation-1.02e+00

Pos logit contributions
+3.207
give+2.924
hon+2.892
 affirm+2.685
 decree+2.683
Neg logit contributions
apper-4.083
PDATE-3.889
 lid-3.846
 Pixie-3.672
ANY-3.487
Token
Token ID447
Feature activation+9.06e-03

Pos logit contributions
+1.963
give+1.740
hon+1.735
 decree+1.711
 affirm+1.698
Neg logit contributions
apper-1.539
PDATE-1.410
 lid-1.408
 Pixie-1.362
ANY-1.251
Token
Token ID247
Feature activation-1.86e+00

Pos logit contributions
apper+0.018
 lid+0.017
 Pixie+0.016
PDATE+0.016
 Deluxe+0.015
Neg logit contributions
-0.014
give-0.012
hon-0.012
VICE-0.011
udo-0.011
Tokens
Token ID82
Feature activation+2.08e+00

Pos logit contributions
+2.967
give+2.657
hon+2.641
VICE+2.543
udo+2.529
Neg logit contributions
apper-3.388
 lid-3.207
PDATE-3.176
 Pixie-3.149
 accents-2.876
Token G
Token ID402
Feature activation+9.34e-03

Pos logit contributions
apper+3.441
 lid+3.248
 Pixie+3.176
PDATE+3.080
 Deluxe+2.900
Neg logit contributions
-3.670
give-3.295
hon-3.265
VICE-3.195
udo-3.177
Tokenug
Token ID1018
Feature activation-4.64e+00

Pos logit contributions
apper+0.018
 lid+0.017
 Pixie+0.017
PDATE+0.017
 accents+0.015
Neg logit contributions
-0.014
give-0.012
hon-0.012
udo-0.012
 affirm-0.012
Tokengen
Token ID5235
Feature activation-1.36e+00

Pos logit contributions
+4.192
udo+3.774
hon+3.766
give+3.613
lyak+3.579
Neg logit contributions
PDATE-10.401
apper-10.138
 Pixie-9.870
 lid-9.674
XT-9.104
Tokenheim
Token ID9096
Feature activation+4.45e-01

Pos logit contributions
+1.524
give+1.362
hon+1.354
 decree+1.302
udo+1.267
Neg logit contributions
apper-3.020
PDATE-2.921
 lid-2.907
 Pixie-2.852
ANY-2.720
Token Museum
Token ID9594
Feature activation-2.01e-01

Pos logit contributions
apper+0.674
 lid+0.614
 Pixie+0.605
PDATE+0.604
ANY+0.542
Neg logit contributions
-0.862
give-0.780
hon-0.778
 decree-0.754
 affirm-0.750
Token,
Token ID11
Feature activation+8.68e-02

Pos logit contributions
+0.322
give+0.282
hon+0.282
 affirm+0.270
 decree+0.270
Neg logit contributions
apper-0.371
 lid-0.345
PDATE-0.343
 Pixie-0.343
ANY-0.313
Token he
Token ID339
Feature activation-1.56e-01

Pos logit contributions
apper+0.160
 lid+0.150
 Pixie+0.147
PDATE+0.146
 Deluxe+0.136
Neg logit contributions
-0.138
give-0.123
hon-0.122
VICE-0.118
 decree-0.117
Token 100
Token ID1802
Feature activation-1.06e+00

Pos logit contributions
+0.001
give+0.001
hon+0.001
 decree+0.001
 affirm+0.001
Neg logit contributions
apper-0.001
 lid-0.001
 Pixie-0.001
PDATE-0.001
ANY-0.001
Token.
Token ID13
Feature activation-5.06e-01

Pos logit contributions
+1.798
hon+1.632
give+1.615
 affirm+1.565
udo+1.540
Neg logit contributions
apper-1.841
 lid-1.703
PDATE-1.687
 Pixie-1.686
rawler-1.530
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
+0.946
give+0.863
hon+0.851
VICE+0.833
udo+0.827
Neg logit contributions
apper-0.782
 lid-0.746
PDATE-0.722
 Pixie-0.715
 accents-0.651
Token
Token ID198
Feature activation-1.86e-01

Pos logit contributions
+0.228
give+0.206
hon+0.205
 decree+0.201
udo+0.201
Neg logit contributions
apper-0.149
 lid-0.137
 Pixie-0.134
PDATE-0.127
 Deluxe-0.121
TokenG
Token ID38
Feature activation-7.75e-01

Pos logit contributions
+0.327
give+0.298
hon+0.294
VICE+0.290
udo+0.282
Neg logit contributions
apper-0.311
 lid-0.299
 Pixie-0.292
PDATE-0.290
 accents-0.265
Tokenerr
Token ID8056
Feature activation-4.58e+00

Pos logit contributions
+0.878
give+0.769
hon+0.758
VICE+0.731
udo+0.728
Neg logit contributions
apper-1.724
 Pixie-1.691
 lid-1.688
PDATE-1.654
 accents-1.552
Tokenard
Token ID446
Feature activation-1.79e+00

Pos logit contributions
+3.867
give+3.527
udo+3.460
hon+3.410
VICE+3.214
Neg logit contributions
PDATE-10.976
 Pixie-10.621
apper-10.231
 lid-10.041
 Deluxe-9.896
Token
Token ID447
Feature activation+7.18e-01

Pos logit contributions
+2.951
hon+2.640
give+2.628
 affirm+2.556
udo+2.555
Neg logit contributions
apper-3.121
PDATE-2.985
 Pixie-2.870
 lid-2.869
ANY-2.648
Token
Token ID247
Feature activation-7.53e-01

Pos logit contributions
apper+1.307
 lid+1.233
 Pixie+1.211
PDATE+1.191
 Deluxe+1.111
Neg logit contributions
-1.158
give-1.029
hon-1.019
VICE-0.990
udo-0.985
Tokens
Token ID82
Feature activation+2.72e+00

Pos logit contributions
+1.290
give+1.159
hon+1.157
VICE+1.111
udo+1.101
Neg logit contributions
apper-1.294
 lid-1.221
 Pixie-1.209
PDATE-1.192
 accents-1.088
Token 100
Token ID1802
Feature activation-1.19e+00

Pos logit contributions
apper+4.215
 lid+3.976
 Pixie+3.840
PDATE+3.727
 Deluxe+3.513
Neg logit contributions
-5.084
give-4.560
hon-4.506
VICE-4.441
udo-4.406
Token the
Token ID262
Feature activation-3.87e-01

Pos logit contributions
+0.451
give+0.403
hon+0.394
 decree+0.388
 affirm+0.386
Neg logit contributions
apper-0.484
PDATE-0.447
 lid-0.446
 Pixie-0.437
ANY-0.404
Token countdown
Token ID33970
Feature activation-3.09e-01

Pos logit contributions
+0.667
give+0.601
hon+0.588
 decree+0.584
 affirm+0.573
Neg logit contributions
apper-0.660
 Pixie-0.608
PDATE-0.606
 lid-0.605
ANY-0.551
Token clock
Token ID8801
Feature activation+4.69e-01

Pos logit contributions
+0.493
give+0.444
hon+0.437
 affirm+0.428
 decree+0.427
Neg logit contributions
apper-0.558
PDATE-0.527
 Pixie-0.525
 lid-0.517
ANY-0.483
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
apper+0.885
 Pixie+0.832
PDATE+0.829
 lid+0.828
 Deluxe+0.764
Neg logit contributions
-0.727
give-0.639
hon-0.632
 affirm-0.613
 decree-0.609
Token Five
Token ID10579
Feature activation-3.05e-01

Pos logit contributions
+0.377
give+0.340
hon+0.334
udo+0.327
VICE+0.326
Neg logit contributions
apper-0.313
 lid-0.290
PDATE-0.285
 Pixie-0.280
 accents-0.256
Token minutes
Token ID2431
Feature activation-4.54e+00

Pos logit contributions
+0.487
give+0.435
hon+0.428
 affirm+0.417
 decree+0.415
Neg logit contributions
apper-0.561
 lid-0.524
 Pixie-0.519
PDATE-0.518
 Deluxe-0.476
Token remaining
Token ID5637
Feature activation-7.95e-02

Pos logit contributions
+5.711
 decree+5.359
 affirm+5.357
hon+5.316
give+5.277
Neg logit contributions
apper-8.523
PDATE-8.498
 Pixie-8.033
 lid-7.605
ANY-7.457
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
+0.123
give+0.111
hon+0.110
 affirm+0.107
VICE+0.106
Neg logit contributions
apper-0.148
 lid-0.139
 Pixie-0.138
PDATE-0.138
ANY-0.127
Token Then
Token ID3244
Feature activation-6.33e-01

Pos logit contributions
+0.962
give+0.869
hon+0.853
VICE+0.834
udo+0.833
Neg logit contributions
apper-0.817
 lid-0.760
PDATE-0.746
 Pixie-0.730
 accents-0.671
Token four
Token ID1440
Feature activation+3.20e-01

Pos logit contributions
+1.025
give+0.926
hon+0.906
 affirm+0.893
 decree+0.892
Neg logit contributions
apper-1.138
PDATE-1.053
 Pixie-1.052
 lid-1.048
 Deluxe-0.960
Token minutes
Token ID2431
Feature activation-3.20e+00

Pos logit contributions
apper+0.579
 lid+0.542
 Pixie+0.534
PDATE+0.528
 Deluxe+0.490
Neg logit contributions
-0.520
give-0.465
hon-0.458
VICE-0.444
 affirm-0.444
Token6
Token ID21
Feature activation+2.80e-01

Pos logit contributions
+0.086
give+0.078
hon+0.077
VICE+0.075
udo+0.075
Neg logit contributions
apper-0.071
 lid-0.067
 Pixie-0.065
PDATE-0.064
 Deluxe-0.059
Token.
Token ID13
Feature activation-5.99e-01

Pos logit contributions
apper+0.520
 lid+0.488
 Pixie+0.483
PDATE+0.480
 Deluxe+0.442
Neg logit contributions
-0.443
give-0.395
hon-0.389
 decree-0.376
udo-0.376
Token5
Token ID20
Feature activation-1.10e-01

Pos logit contributions
+1.080
give+0.977
hon+0.966
VICE+0.939
udo+0.936
Neg logit contributions
apper-0.976
 lid-0.917
 Pixie-0.895
PDATE-0.891
 accents-0.811
Token),
Token ID828
Feature activation+8.37e-02

Pos logit contributions
+0.166
give+0.149
hon+0.146
 decree+0.141
udo+0.140
Neg logit contributions
apper-0.212
 lid-0.199
 Pixie-0.196
PDATE-0.196
ANY-0.182
Token K
Token ID509
Feature activation+7.71e-01

Pos logit contributions
apper+0.152
 lid+0.143
 Pixie+0.140
PDATE+0.139
 Deluxe+0.130
Neg logit contributions
-0.135
give-0.121
hon-0.119
 decree-0.115
VICE-0.115
Tokenug
Token ID1018
Feature activation-4.46e+00

Pos logit contributions
apper+1.298
 lid+1.235
 Pixie+1.215
PDATE+1.206
 Deluxe+1.106
Neg logit contributions
-1.329
hon-1.205
give-1.204
udo-1.173
VICE-1.159
Tokenel
Token ID417
Feature activation-6.58e-01

Pos logit contributions
+4.531
hon+4.087
give+4.058
lyak+4.018
udo+3.884
Neg logit contributions
apper-9.566
PDATE-9.424
 Pixie-9.301
 lid-8.970
ANY-8.551
Token in
Token ID287
Feature activation-8.02e-01

Pos logit contributions
+0.981
hon+0.858
give+0.852
 affirm+0.827
udo+0.820
Neg logit contributions
apper-1.275
 lid-1.208
 Pixie-1.202
PDATE-1.196
ANY-1.108
Token Feldman
Token ID45498
Feature activation-1.00e+00

Pos logit contributions
+1.372
give+1.237
hon+1.231
 affirm+1.191
 decree+1.186
Neg logit contributions
apper-1.377
 lid-1.284
PDATE-1.261
 Pixie-1.254
ANY-1.161
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
+1.616
hon+1.430
give+1.417
 affirm+1.383
 decree+1.374
Neg logit contributions
apper-1.810
 lid-1.699
 Pixie-1.697
PDATE-1.691
ANY-1.562
Token K
Token ID509
Feature activation+5.05e-01

Pos logit contributions
+0.333
give+0.295
hon+0.294
VICE+0.282
 affirm+0.282
Neg logit contributions
apper-0.376
 lid-0.353
 Pixie-0.347
PDATE-0.344
ANY-0.319
Token showing
Token ID4478
Feature activation-2.58e-01

Pos logit contributions
+1.171
give+1.053
hon+1.042
 decree+1.034
 affirm+1.029
Neg logit contributions
apper-1.238
PDATE-1.176
 Pixie-1.132
 lid-1.131
rawler-1.054
Token the
Token ID262
Feature activation+2.27e-01

Pos logit contributions
+0.416
give+0.378
hon+0.367
 affirm+0.365
 decree+0.364
Neg logit contributions
apper-0.467
PDATE-0.431
 lid-0.431
 Pixie-0.425
 Deluxe-0.392
Token countdown
Token ID33970
Feature activation-2.61e-01

Pos logit contributions
apper+0.384
 lid+0.354
 Pixie+0.351
PDATE+0.350
ANY+0.319
Neg logit contributions
-0.397
give-0.361
hon-0.354
 decree-0.347
 affirm-0.344
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
+0.399
give+0.357
hon+0.351
 affirm+0.347
 decree+0.347
Neg logit contributions
apper-0.494
PDATE-0.466
 Pixie-0.461
 lid-0.456
ANY-0.427
Token Fifty
Token ID35351
Feature activation-9.74e-01

Pos logit contributions
+1.010
give+0.908
hon+0.899
udo+0.877
VICE+0.873
Neg logit contributions
apper-0.822
 lid-0.764
PDATE-0.759
 Pixie-0.726
 accents-0.669
Token minutes
Token ID2431
Feature activation-4.43e+00

Pos logit contributions
+1.458
give+1.291
hon+1.275
 decree+1.237
 affirm+1.229
Neg logit contributions
apper-1.877
 lid-1.754
PDATE-1.753
 Pixie-1.744
 Deluxe-1.600
Token.
Token ID13
Feature activation-9.96e-01

Pos logit contributions
+5.638
 decree+5.420
 affirm+5.379
hon+5.351
give+5.311
Neg logit contributions
PDATE-8.348
apper-8.172
 Pixie-7.806
 lid-7.317
rawler-7.194
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
+1.867
give+1.693
hon+1.666
udo+1.628
VICE+1.621
Neg logit contributions
apper-1.562
 lid-1.457
PDATE-1.440
 Pixie-1.382
 accents-1.285
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
+2.521
give+2.312
hon+2.286
VICE+2.235
udo+2.218
Neg logit contributions
apper-1.845
 lid-1.721
PDATE-1.685
 Pixie-1.675
 accents-1.491
TokenAn
Token ID2025
Feature activation-8.76e-02

Pos logit contributions
+2.318
give+2.090
hon+2.067
VICE+2.043
udo+1.973
Neg logit contributions
apper-2.341
 lid-2.259
PDATE-2.235
 Pixie-2.200
 accents-2.019
Token advisor
Token ID20685
Feature activation-8.35e-01

Pos logit contributions
+0.145
give+0.129
hon+0.128
 decree+0.123
VICE+0.123
Neg logit contributions
apper-0.156
 lid-0.147
 Pixie-0.145
PDATE-0.143
 Deluxe-0.133
Token Super
Token ID3115
Feature activation+3.91e-01

Pos logit contributions
apper+0.998
 lid+0.947
PDATE+0.920
 Pixie+0.916
ANY+0.848
Neg logit contributions
-0.915
give-0.824
hon-0.818
udo-0.791
VICE-0.779
Tokenbowl
Token ID36859
Feature activation+5.83e-01

Pos logit contributions
apper+0.686
 lid+0.653
PDATE+0.638
 Pixie+0.634
ANY+0.587
Neg logit contributions
-0.648
give-0.592
hon-0.587
udo-0.562
VICE-0.556
Token-
Token ID12
Feature activation+5.39e-01

Pos logit contributions
apper+1.009
 lid+0.948
 Pixie+0.922
PDATE+0.919
ANY+0.847
Neg logit contributions
-0.989
give-0.897
hon-0.886
udo-0.857
 decree-0.851
TokenItalian
Token ID45696
Feature activation+2.44e+00

Pos logit contributions
apper+0.925
 lid+0.875
 Pixie+0.852
PDATE+0.843
 Deluxe+0.778
Neg logit contributions
-0.928
give-0.833
hon-0.825
VICE-0.806
udo-0.796
Token St
Token ID520
Feature activation+1.39e+00

Pos logit contributions
apper+3.911
 lid+3.642
 Pixie+3.590
PDATE+3.459
 Deluxe+3.274
Neg logit contributions
-4.440
give-3.996
hon-3.955
VICE-3.874
 affirm-3.826
Tokenere
Token ID567
Feature activation-4.41e+00

Pos logit contributions
apper+2.328
 lid+2.215
 Pixie+2.153
PDATE+2.151
 Deluxe+1.975
Neg logit contributions
-2.400
give-2.179
hon-2.169
udo-2.112
VICE-2.077
Tokenotypes
Token ID13567
Feature activation-1.58e+00

Pos logit contributions
ograms+4.295
+4.061
hon+3.940
give+3.789
otypes+3.727
Neg logit contributions
PDATE-9.736
 lid-9.440
apper-9.221
 Pixie-8.970
ANY-8.809
Token-
Token ID12
Feature activation+4.17e-01

Pos logit contributions
+2.223
give+1.988
hon+1.985
udo+1.940
telling+1.851
Neg logit contributions
apper-3.077
 lid-2.935
PDATE-2.834
 Pixie-2.826
ANY-2.657
TokenRet
Token ID9781
Feature activation-4.55e-01

Pos logit contributions
apper+0.714
 lid+0.678
 Pixie+0.658
PDATE+0.652
 accents+0.600
Neg logit contributions
-0.720
give-0.647
hon-0.640
VICE-0.627
udo-0.617
Tokenards
Token ID1371
Feature activation-3.48e-01

Pos logit contributions
+0.769
give+0.700
hon+0.698
udo+0.678
VICE+0.672
Neg logit contributions
apper-0.766
 lid-0.735
PDATE-0.728
 Pixie-0.715
 accents-0.661
Token-
Token ID12
Feature activation+4.42e-01

Pos logit contributions
+0.609
give+0.555
hon+0.554
udo+0.536
VICE+0.528
Neg logit contributions
apper-0.578
 lid-0.546
PDATE-0.531
 Pixie-0.521
ANY-0.479
Tokenel
Token ID417
Feature activation-6.58e-01

Pos logit contributions
+4.531
hon+4.087
give+4.058
lyak+4.018
udo+3.884
Neg logit contributions
apper-9.566
PDATE-9.424
 Pixie-9.301
 lid-8.970
ANY-8.551
Token in
Token ID287
Feature activation-8.02e-01

Pos logit contributions
+0.981
hon+0.858
give+0.852
 affirm+0.827
udo+0.820
Neg logit contributions
apper-1.275
 lid-1.208
 Pixie-1.202
PDATE-1.196
ANY-1.108
Token Feldman
Token ID45498
Feature activation-1.00e+00

Pos logit contributions
+1.372
give+1.237
hon+1.231
 affirm+1.191
 decree+1.186
Neg logit contributions
apper-1.377
 lid-1.284
PDATE-1.261
 Pixie-1.254
ANY-1.161
Token,
Token ID11
Feature activation-2.06e-01

Pos logit contributions
+1.616
hon+1.430
give+1.417
 affirm+1.383
 decree+1.374
Neg logit contributions
apper-1.810
 lid-1.699
 Pixie-1.697
PDATE-1.691
ANY-1.562
Token K
Token ID509
Feature activation+5.05e-01

Pos logit contributions
+0.333
give+0.295
hon+0.294
VICE+0.282
 affirm+0.282
Neg logit contributions
apper-0.376
 lid-0.353
 Pixie-0.347
PDATE-0.344
ANY-0.319
Tokenug
Token ID1018
Feature activation-4.38e+00

Pos logit contributions
apper+0.818
 lid+0.779
 Pixie+0.762
PDATE+0.758
 accents+0.693
Neg logit contributions
-0.904
hon-0.820
give-0.819
udo-0.806
VICE-0.790
Tokenel
Token ID417
Feature activation-8.11e-01

Pos logit contributions
+4.302
hon+3.871
give+3.832
lyak+3.803
udo+3.712
Neg logit contributions
apper-9.534
PDATE-9.294
 Pixie-9.216
 lid-8.872
ANY-8.591
Token,
Token ID11
Feature activation-7.41e-03

Pos logit contributions
+1.277
hon+1.133
give+1.117
udo+1.087
 affirm+1.078
Neg logit contributions
apper-1.506
 lid-1.408
PDATE-1.401
 Pixie-1.391
ANY-1.302
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
+0.012
give+0.011
hon+0.010
VICE+0.010
udo+0.010
Neg logit contributions
apper-0.014
 lid-0.013
 Pixie-0.013
PDATE-0.012
 Deluxe-0.012
Token Schiff
Token ID41665
Feature activation+1.17e+00

Pos logit contributions
+0.552
give+0.498
hon+0.494
udo+0.477
VICE+0.475
Neg logit contributions
apper-0.558
 lid-0.521
PDATE-0.512
 Pixie-0.506
ANY-0.468
Tokenman
Token ID805
Feature activation+8.06e-01

Pos logit contributions
apper+1.743
 lid+1.580
 Pixie+1.496
 Deluxe+1.413
rawler+1.380
Neg logit contributions
-2.300
give-2.028
VICE-2.009
 decree-2.006
ograms-1.987
Token Encyclopedia
Token ID36732
Feature activation-2.35e-02

Pos logit contributions
apper+0.980
 lid+0.915
 Pixie+0.904
PDATE+0.899
ANY+0.845
Neg logit contributions
-0.800
give-0.735
hon-0.721
 affirm-0.718
 decree-0.707
Token,
Token ID11
Feature activation+9.75e-02

Pos logit contributions
+0.038
give+0.034
hon+0.033
VICE+0.033
udo+0.033
Neg logit contributions
apper-0.042
 lid-0.040
 Pixie-0.039
PDATE-0.038
 Deluxe-0.036
Token Church
Token ID4564
Feature activation-2.16e-01

Pos logit contributions
apper+0.197
 lid+0.187
 Pixie+0.184
PDATE+0.181
 Deluxe+0.171
Neg logit contributions
-0.137
give-0.120
hon-0.118
VICE-0.114
udo-0.114
Token Fathers
Token ID30969
Feature activation-2.49e+00

Pos logit contributions
+0.326
give+0.288
hon+0.285
VICE+0.276
udo+0.274
Neg logit contributions
apper-0.415
 lid-0.393
 Pixie-0.385
PDATE-0.381
 Deluxe-0.356
Token,
Token ID11
Feature activation-7.15e-01

Pos logit contributions
+3.082
hon+2.735
 affirm+2.661
give+2.621
VICE+2.603
Neg logit contributions
PDATE-5.340
apper-5.209
 lid-4.955
 Pixie-4.804
rawler-4.631
Token Sum
Token ID5060
Feature activation-4.37e+00

Pos logit contributions
+1.223
hon+1.094
give+1.091
 affirm+1.078
 decree+1.064
Neg logit contributions
apper-1.229
 lid-1.152
PDATE-1.127
 Pixie-1.124
rawler-1.033
Tokenma
Token ID2611
Feature activation-3.56e+00

Pos logit contributions
+2.861
hon+2.529
give+2.430
udo+2.372
gments+2.330
Neg logit contributions
PDATE-11.927
 Pixie-11.107
apper-11.068
 lid-10.968
ANY-10.361
Token,
Token ID11
Feature activation-1.99e-01

Pos logit contributions
+4.750
give+4.316
hon+4.279
 decree+4.234
 affirm+4.234
Neg logit contributions
apper-7.163
PDATE-6.934
 Pixie-6.590
 lid-6.571
ANY-6.222
Token Bible
Token ID9111
Feature activation+1.56e-01

Pos logit contributions
+0.330
give+0.296
hon+0.295
 affirm+0.288
 decree+0.286
Neg logit contributions
apper-0.355
 lid-0.330
PDATE-0.323
 Pixie-0.323
ANY-0.299
Token and
Token ID290
Feature activation-4.53e-01

Pos logit contributions
apper+0.299
 lid+0.285
 Pixie+0.278
PDATE+0.271
 Deluxe+0.259
Neg logit contributions
-0.238
give-0.209
hon-0.206
VICE-0.201
udo-0.201
Token more
Token ID517
Feature activation+2.92e-01

Pos logit contributions
+0.752
give+0.671
hon+0.663
VICE+0.646
udo+0.643
Neg logit contributions
apper-0.803
 lid-0.758
 Pixie-0.743
PDATE-0.729
 Deluxe-0.682
Token Life
Token ID5155
Feature activation-1.62e+00

Pos logit contributions
+0.295
give+0.266
hon+0.265
 decree+0.257
udo+0.256
Neg logit contributions
apper-0.293
 lid-0.276
PDATE-0.270
 Pixie-0.268
ANY-0.247
Token of
Token ID286
Feature activation-1.05e+00

Pos logit contributions
+2.168
hon+1.939
give+1.924
udo+1.840
 decree+1.804
Neg logit contributions
apper-3.335
 lid-3.203
PDATE-3.199
 Pixie-3.077
ANY-2.942
Token Christ
Token ID1951
Feature activation-1.67e+00

Pos logit contributions
+1.409
give+1.274
hon+1.252
udo+1.215
 affirm+1.198
Neg logit contributions
apper-2.145
PDATE-2.039
 lid-2.031
 Pixie-1.971
ANY-1.863
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
+2.447
hon+2.168
give+2.132
udo+2.053
 affirm+2.047
Neg logit contributions
apper-3.230
PDATE-3.060
 lid-3.057
 Pixie-3.018
ANY-2.808
Token Sac
Token ID9295
Feature activation-1.84e+00

Pos logit contributions
+0.865
give+0.789
hon+0.787
udo+0.758
 decree+0.756
Neg logit contributions
apper-0.780
 lid-0.732
PDATE-0.719
 Pixie-0.694
ANY-0.640
Tokenram
Token ID859
Feature activation-4.32e+00

Pos logit contributions
+3.301
give+3.177
hon+3.110
ograms+3.038
udo+2.997
Neg logit contributions
apper-2.767
PDATE-2.699
 lid-2.594
 Pixie-2.557
 accents-2.355
Tokenents
Token ID658
Feature activation-2.98e+00

Pos logit contributions
+6.666
thood+6.208
hon+6.205
give+6.141
ograms+6.074
Neg logit contributions
PDATE-6.940
apper-6.773
 lid-6.767
 Pixie-6.482
XT-5.967
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
+3.982
hon+3.537
give+3.444
 decree+3.439
udo+3.397
Neg logit contributions
apper-5.942
PDATE-5.658
 lid-5.575
 Pixie-5.564
rawler-5.200
Token Bapt
Token ID18226
Feature activation-6.60e-01

Pos logit contributions
+0.559
give+0.512
hon+0.512
udo+0.492
 affirm+0.491
Neg logit contributions
apper-0.515
 lid-0.482
PDATE-0.474
 Pixie-0.458
ANY-0.423
Tokenism
Token ID1042
Feature activation-1.05e+00

Pos logit contributions
+0.864
hon+0.813
give+0.790
gments+0.769
ograms+0.765
Neg logit contributions
apper-1.339
PDATE-1.335
 lid-1.299
 Pixie-1.290
ANY-1.178
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
+1.542
hon+1.395
give+1.381
udo+1.338
 decree+1.314
Neg logit contributions
apper-2.028
 lid-1.932
PDATE-1.923
 Pixie-1.905
ANY-1.746
Token
Token ID447
Feature activation+7.06e-01

Pos logit contributions
apper+0.062
 lid+0.058
PDATE+0.056
 Pixie+0.055
 accents+0.051
Neg logit contributions
-0.075
give-0.068
hon-0.067
VICE-0.067
udo-0.066
Token
Token ID251
Feature activation+2.36e+00

Pos logit contributions
apper+1.324
 lid+1.252
 Pixie+1.227
PDATE+1.207
 Deluxe+1.133
Neg logit contributions
-1.098
give-0.972
hon-0.961
VICE-0.933
udo-0.927
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
apper+3.482
 lid+3.232
 Pixie+3.161
PDATE+3.068
 Deluxe+2.854
Neg logit contributions
-4.555
give-4.130
hon-4.096
VICE-3.998
udo-3.991
Token
Token ID198
Feature activation+9.09e-02

Pos logit contributions
apper+0.314
 lid+0.284
 Pixie+0.277
 Deluxe+0.253
ANY+0.251
Neg logit contributions
-0.516
give-0.464
hon-0.462
 decree-0.458
 affirm-0.455
TokenN
Token ID45
Feature activation-1.07e-01

Pos logit contributions
apper+0.155
 lid+0.149
 Pixie+0.145
PDATE+0.144
 accents+0.132
Neg logit contributions
-0.157
give-0.142
hon-0.140
VICE-0.138
udo-0.135
Tokenug
Token ID1018
Feature activation-4.28e+00

Pos logit contributions
+0.209
give+0.192
hon+0.190
udo+0.186
VICE+0.185
Neg logit contributions
apper-0.158
 lid-0.148
 Pixie-0.145
PDATE-0.142
 accents-0.130
Tokenent
Token ID298
Feature activation-1.44e+00

Pos logit contributions
+6.019
give+5.870
hon+5.667
udo+5.499
lyak+5.331
Neg logit contributions
PDATE-8.168
apper-7.817
 Pixie-7.733
 lid-7.475
 accents-6.833
Token,
Token ID11
Feature activation+4.69e-01

Pos logit contributions
+2.207
give+1.959
hon+1.940
 affirm+1.925
 decree+1.885
Neg logit contributions
apper-2.655
PDATE-2.585
 lid-2.497
 Pixie-2.483
ANY-2.289
Token who
Token ID508
Feature activation+5.39e-01

Pos logit contributions
apper+0.830
 lid+0.782
 Pixie+0.769
PDATE+0.766
ANY+0.705
Neg logit contributions
-0.773
give-0.694
hon-0.685
VICE-0.668
 affirm-0.661
Token has
Token ID468
Feature activation+1.05e+00

Pos logit contributions
apper+0.926
 lid+0.868
 Pixie+0.852
PDATE+0.849
ANY+0.778
Neg logit contributions
-0.921
give-0.828
hon-0.819
VICE-0.797
 affirm-0.794
Token been
Token ID587
Feature activation+1.59e+00

Pos logit contributions
apper+1.865
 lid+1.759
 Pixie+1.721
PDATE+1.685
 Deluxe+1.581
Neg logit contributions
-1.755
give-1.565
hon-1.549
VICE-1.508
udo-1.499
Token protest
Token ID5402
Feature activation-2.34e+00

Pos logit contributions
+0.773
give+0.699
hon+0.684
 decree+0.673
 affirm+0.669
Neg logit contributions
apper-0.834
PDATE-0.775
 lid-0.769
 Pixie-0.764
 Deluxe-0.697
Token featured
Token ID8096
Feature activation+8.63e-01

Pos logit contributions
+3.826
give+3.410
hon+3.325
 affirm+3.319
 decree+3.279
Neg logit contributions
apper-4.072
PDATE-3.928
 Pixie-3.823
 lid-3.780
ANY-3.465
Token speakers
Token ID11636
Feature activation-5.43e-01

Pos logit contributions
apper+1.495
 lid+1.400
 Pixie+1.373
PDATE+1.361
 Deluxe+1.258
Neg logit contributions
-1.470
give-1.321
hon-1.304
VICE-1.273
udo-1.263
Token including
Token ID1390
Feature activation-3.67e-03

Pos logit contributions
+0.817
give+0.718
hon+0.714
 affirm+0.693
 decree+0.685
Neg logit contributions
apper-1.047
 lid-0.989
PDATE-0.975
 Pixie-0.969
 Deluxe-0.900
Token M
Token ID337
Feature activation+3.95e-01

Pos logit contributions
+0.006
give+0.006
hon+0.006
VICE+0.006
 decree+0.006
Neg logit contributions
apper-0.006
 lid-0.006
PDATE-0.006
 Pixie-0.006
 Deluxe-0.005
Tokenere
Token ID567
Feature activation-4.28e+00

Pos logit contributions
apper+0.693
 lid+0.664
PDATE+0.652
 Pixie+0.652
 accents+0.597
Neg logit contributions
-0.649
hon-0.593
give-0.585
udo-0.570
VICE-0.568
Tokentz
Token ID22877
Feature activation-5.79e-02

Pos logit contributions
+3.361
hon+3.244
give+2.891
lyak+2.872
udo+2.764
Neg logit contributions
apper-10.120
PDATE-10.068
 lid-9.912
 Pixie-9.491
ANY-9.286
Token City
Token ID2254
Feature activation+1.70e-01

Pos logit contributions
+0.120
hon+0.110
give+0.110
 decree+0.107
VICE+0.107
Neg logit contributions
apper-0.077
 lid-0.071
PDATE-0.071
 Pixie-0.070
 accents-0.062
Token Council
Token ID4281
Feature activation-2.59e-01

Pos logit contributions
apper+0.243
 lid+0.225
PDATE+0.221
 Pixie+0.219
 accents+0.197
Neg logit contributions
-0.341
hon-0.312
give-0.311
VICE-0.301
 decree-0.300
Tokenwoman
Token ID8580
Feature activation-2.83e-01

Pos logit contributions
+0.462
hon+0.418
give+0.415
 decree+0.401
 affirm+0.398
Neg logit contributions
apper-0.420
 lid-0.402
PDATE-0.397
 Pixie-0.395
 accents-0.357
Token Dr
Token ID1583
Feature activation-1.12e+00

Pos logit contributions
+0.487
give+0.442
hon+0.439
 decree+0.423
 affirm+0.423
Neg logit contributions
apper-0.484
 lid-0.452
PDATE-0.449
 Pixie-0.435
 accents-0.402
Token a
Token ID257
Feature activation-7.58e-01

Pos logit contributions
+0.504
give+0.456
hon+0.450
 decree+0.438
 affirm+0.437
Neg logit contributions
apper-0.492
 lid-0.459
PDATE-0.453
 Pixie-0.447
 Deluxe-0.412
Token broad
Token ID3154
Feature activation+8.61e-01

Pos logit contributions
+1.250
give+1.137
hon+1.111
 decree+1.097
 affirm+1.082
Neg logit contributions
apper-1.327
PDATE-1.241
 lid-1.219
 Pixie-1.209
ANY-1.120
Token desert
Token ID10326
Feature activation+7.48e-01

Pos logit contributions
apper+1.526
 lid+1.433
 Pixie+1.406
PDATE+1.393
 Deluxe+1.294
Neg logit contributions
-1.431
give-1.279
hon-1.265
VICE-1.225
udo-1.222
Token (
Token ID357
Feature activation-1.66e+00

Pos logit contributions
apper+1.354
 lid+1.270
 Pixie+1.252
PDATE+1.239
 Deluxe+1.152
Neg logit contributions
-1.216
give-1.086
hon-1.071
VICE-1.040
udo-1.037
Tokenthe
Token ID1169
Feature activation-8.92e-01

Pos logit contributions
+2.305
give+2.116
hon+2.067
VICE+1.954
ograms+1.895
Neg logit contributions
apper-3.299
 lid-3.188
PDATE-3.146
 Pixie-3.112
 Deluxe-2.891
Token poems
Token ID31888
Feature activation-4.26e+00

Pos logit contributions
+1.524
give+1.389
hon+1.368
 decree+1.334
 affirm+1.323
Neg logit contributions
apper-1.514
PDATE-1.398
 lid-1.394
 Pixie-1.377
ANY-1.270
Token don
Token ID836
Feature activation-2.76e+00

Pos logit contributions
+6.263
give+5.899
 affirm+5.767
hon+5.667
 decree+5.619
Neg logit contributions
apper-7.543
PDATE-7.289
 Pixie-6.829
 lid-6.780
ANY-6.480
Token
Token ID447
Feature activation-7.25e-01

Pos logit contributions
+4.504
give+4.073
hon+3.968
VICE+3.835
udo+3.806
Neg logit contributions
apper-4.834
PDATE-4.727
 Pixie-4.568
 lid-4.564
 Deluxe-4.066
Token
Token ID247
Feature activation-1.91e+00

Pos logit contributions
+1.044
give+0.913
hon+0.902
VICE+0.874
udo+0.868
Neg logit contributions
apper-1.445
 lid-1.372
 Pixie-1.344
PDATE-1.321
 Deluxe-1.249
Tokent
Token ID83
Feature activation+1.90e+00

Pos logit contributions
+2.313
give+2.067
hon+1.977
udo+1.871
VICE+1.869
Neg logit contributions
apper-4.207
PDATE-4.033
 lid-3.980
 Pixie-3.973
 Deluxe-3.671
Token argue
Token ID7267
Feature activation-1.06e-01

Pos logit contributions
apper+3.624
 lid+3.475
 Pixie+3.319
 accents+3.150
PDATE+3.121
Neg logit contributions
-2.869
hon-2.469
give-2.434
VICE-2.414
udo-2.409
Token without
Token ID1231
Feature activation+8.41e-01

Pos logit contributions
+0.646
give+0.589
hon+0.584
 affirm+0.569
 decree+0.563
Neg logit contributions
apper-0.666
 lid-0.624
PDATE-0.624
 Pixie-0.614
 Deluxe-0.564
Token using
Token ID1262
Feature activation+7.25e-01

Pos logit contributions
apper+1.520
 lid+1.432
 Pixie+1.404
PDATE+1.383
 Deluxe+1.292
Neg logit contributions
-1.368
give-1.220
hon-1.205
VICE-1.171
udo-1.163
Token the
Token ID262
Feature activation+3.10e-01

Pos logit contributions
apper+1.270
 lid+1.191
 Pixie+1.168
PDATE+1.155
 Deluxe+1.072
Neg logit contributions
-1.221
give-1.094
hon-1.081
VICE-1.049
udo-1.044
Token International
Token ID4037
Feature activation+8.20e-01

Pos logit contributions
apper+0.538
 lid+0.502
PDATE+0.495
 Pixie+0.494
 Deluxe+0.453
Neg logit contributions
-0.523
give-0.472
hon-0.468
 affirm-0.452
VICE-0.450
Token Ph
Token ID1380
Feature activation-3.54e-01

Pos logit contributions
apper+1.267
 lid+1.179
PDATE+1.150
 Pixie+1.144
ANY+1.036
Neg logit contributions
-1.553
give-1.409
hon-1.400
VICE-1.352
 decree-1.352
Tokenon
Token ID261
Feature activation-4.25e+00

Pos logit contributions
+0.413
give+0.360
hon+0.359
udo+0.336
 affirm+0.332
Neg logit contributions
apper-0.784
PDATE-0.759
 lid-0.757
 Pixie-0.748
 Deluxe-0.696
Tokenetic
Token ID5139
Feature activation-4.36e-01

Pos logit contributions
+3.974
ograms+3.857
hon+3.558
give+3.371
udo+3.290
Neg logit contributions
PDATE-9.741
 lid-9.364
apper-9.358
 Pixie-9.097
ANY-8.614
Token Alphabet
Token ID45695
Feature activation-6.30e-01

Pos logit contributions
+0.701
hon+0.635
give+0.632
 decree+0.612
udo+0.608
Neg logit contributions
apper-0.782
PDATE-0.753
 lid-0.739
 Pixie-0.711
ANY-0.672
Token,
Token ID11
Feature activation-5.04e-01

Pos logit contributions
+1.014
give+0.899
hon+0.894
 decree+0.860
 affirm+0.858
Neg logit contributions
apper-1.139
PDATE-1.073
 lid-1.068
 Pixie-1.060
rawler-0.965
Token
Token ID447
Feature activation-7.25e-01

Pos logit contributions
+0.855
give+0.774
hon+0.763
VICE+0.738
 affirm+0.733
Neg logit contributions
apper-0.870
 lid-0.811
 Pixie-0.796
PDATE-0.792
 Deluxe-0.727
Token
Token ID251
Feature activation+1.09e+00

Pos logit contributions
+1.117
give+1.004
hon+0.987
udo+0.951
VICE+0.950
Neg logit contributions
apper-1.349
 lid-1.276
 Pixie-1.263
PDATE-1.227
 Deluxe-1.152
Tokenborn
Token ID6286
Feature activation+1.68e-01

Pos logit contributions
+1.237
give+1.160
hon+1.151
udo+1.111
VICE+1.107
Neg logit contributions
apper-0.865
PDATE-0.822
 lid-0.819
 Pixie-0.788
 Deluxe-0.702
Token Farmer
Token ID37755
Feature activation-4.36e-01

Pos logit contributions
apper+0.265
 lid+0.245
PDATE+0.244
 Pixie+0.238
 Deluxe+0.218
Neg logit contributions
-0.309
give-0.282
hon-0.279
VICE-0.273
 affirm-0.271
Token on
Token ID319
Feature activation-1.07e+00

Pos logit contributions
+0.758
give+0.679
hon+0.677
udo+0.657
VICE+0.653
Neg logit contributions
apper-0.737
 lid-0.691
PDATE-0.683
 Pixie-0.661
ANY-0.612
Token "
Token ID366
Feature activation-1.77e+00

Pos logit contributions
+1.694
give+1.528
hon+1.492
VICE+1.452
 affirm+1.449
Neg logit contributions
apper-1.958
 lid-1.817
PDATE-1.816
 Pixie-1.765
ANY-1.646
TokenFr
Token ID6732
Feature activation-2.13e+00

Pos logit contributions
+3.046
give+2.807
hon+2.778
VICE+2.738
udo+2.591
Neg logit contributions
apper-2.947
 lid-2.862
PDATE-2.813
 Pixie-2.709
 accents-2.544
Tokenances
Token ID1817
Feature activation-4.25e+00

Pos logit contributions
+3.088
give+2.945
hon+2.943
udo+2.870
thood+2.733
Neg logit contributions
PDATE-3.916
apper-3.805
 lid-3.751
 Pixie-3.730
 accents-3.397
Token Farmer
Token ID37755
Feature activation-6.72e-02

Pos logit contributions
+6.792
hon+6.252
give+6.244
thood+6.068
udo+5.958
Neg logit contributions
PDATE-6.961
apper-6.773
 lid-6.482
 Pixie-5.810
ANY-5.775
Token Will
Token ID2561
Feature activation-1.33e+00

Pos logit contributions
+0.103
give+0.092
hon+0.091
udo+0.088
VICE+0.088
Neg logit contributions
apper-0.127
 lid-0.120
PDATE-0.118
 Pixie-0.116
ANY-0.108
Token Have
Token ID8192
Feature activation+7.78e-02

Pos logit contributions
+2.290
give+2.184
hon+2.130
udo+2.042
 affirm+2.027
Neg logit contributions
apper-2.201
PDATE-2.134
 lid-2.063
 Pixie-1.976
ANY-1.857
Token Her
Token ID2332
Feature activation+6.20e-01

Pos logit contributions
apper+0.127
 lid+0.118
PDATE+0.116
 Pixie+0.114
 accents+0.104
Neg logit contributions
-0.139
give-0.128
hon-0.126
udo-0.122
VICE-0.121
Token Revenge
Token ID24366
Feature activation+2.31e-02

Pos logit contributions
apper+1.004
 lid+0.935
PDATE+0.917
 Pixie+0.910
 accents+0.831
Neg logit contributions
-1.120
give-1.020
hon-1.010
udo-0.979
VICE-0.979
Token layers
Token ID11685
Feature activation-1.23e+00

Pos logit contributions
+1.608
give+1.468
hon+1.451
 affirm+1.436
 decree+1.401
Neg logit contributions
apper-1.776
PDATE-1.654
 lid-1.650
 Pixie-1.633
 Deluxe-1.503
Token to
Token ID284
Feature activation-1.42e+00

Pos logit contributions
+1.737
hon+1.554
give+1.534
 affirm+1.497
 decree+1.474
Neg logit contributions
apper-2.418
PDATE-2.294
 Pixie-2.266
 lid-2.260
ANY-2.099
Token that
Token ID326
Feature activation-6.51e-01

Pos logit contributions
+2.219
give+1.995
hon+1.977
 affirm+1.950
VICE+1.892
Neg logit contributions
apper-2.580
PDATE-2.411
 lid-2.391
 Pixie-2.373
 Deluxe-2.199
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
+0.989
hon+0.890
give+0.885
 affirm+0.866
 decree+0.860
Neg logit contributions
apper-1.215
PDATE-1.137
 lid-1.129
 Pixie-1.125
ANY-1.038
Token St
Token ID520
Feature activation+6.73e-01

Pos logit contributions
+1.334
give+1.208
hon+1.192
VICE+1.162
udo+1.157
Neg logit contributions
apper-1.216
 lid-1.139
PDATE-1.115
 Pixie-1.090
 accents-0.992
Tokenere
Token ID567
Feature activation-4.24e+00

Pos logit contributions
apper+1.133
 lid+1.086
PDATE+1.059
 Pixie+1.059
 Deluxe+0.975
Neg logit contributions
-1.154
hon-1.049
give-1.047
udo-1.011
 affirm-1.001
Tokenotypes
Token ID13567
Feature activation-2.42e+00

Pos logit contributions
+1.318
ograms+1.163
otypes+0.986
hon+0.941
give+0.795
Neg logit contributions
PDATE-12.121
 lid-11.983
apper-11.933
 Pixie-11.601
 Deluxe-11.377
Token aren
Token ID3588
Feature activation-1.63e+00

Pos logit contributions
+3.485
give+3.185
hon+3.118
 affirm+3.101
 decree+3.050
Neg logit contributions
apper-4.618
PDATE-4.482
 Pixie-4.326
 lid-4.257
ANY-3.996
Token
Token ID447
Feature activation-5.34e-01

Pos logit contributions
+2.721
give+2.442
hon+2.404
VICE+2.333
udo+2.322
Neg logit contributions
apper-2.877
 lid-2.732
PDATE-2.716
 Pixie-2.681
 accents-2.432
Token
Token ID247
Feature activation-1.64e+00

Pos logit contributions
+0.828
give+0.733
hon+0.724
VICE+0.703
udo+0.699
Neg logit contributions
apper-1.003
 lid-0.949
 Pixie-0.930
PDATE-0.915
 Deluxe-0.859
Tokent
Token ID83
Feature activation+2.01e+00

Pos logit contributions
+2.085
give+1.835
hon+1.789
 affirm+1.698
udo+1.696
Neg logit contributions
apper-3.527
 lid-3.343
PDATE-3.320
 Pixie-3.302
 Deluxe-3.069
Token and
Token ID290
Feature activation-7.12e-01

Pos logit contributions
apper+1.106
 lid+1.034
 Pixie+1.025
PDATE+1.011
 Deluxe+0.935
Neg logit contributions
-0.998
give-0.890
hon-0.878
VICE-0.846
 decree-0.846
Token that
Token ID326
Feature activation-5.11e-02

Pos logit contributions
+1.121
give+1.009
hon+0.982
 affirm+0.955
 decree+0.955
Neg logit contributions
apper-1.316
 lid-1.228
PDATE-1.217
 Pixie-1.212
 Deluxe-1.117
Token he
Token ID339
Feature activation+3.74e-01

Pos logit contributions
+0.087
give+0.079
hon+0.078
 decree+0.075
VICE+0.075
Neg logit contributions
apper-0.088
 lid-0.082
PDATE-0.081
 Pixie-0.081
 Deluxe-0.074
Token will
Token ID481
Feature activation-1.56e+00

Pos logit contributions
apper+0.617
 lid+0.576
 Pixie+0.565
PDATE+0.556
 Deluxe+0.514
Neg logit contributions
-0.669
give-0.603
hon-0.597
VICE-0.580
udo-0.577
Token be
Token ID307
Feature activation-3.31e-01

Pos logit contributions
+2.171
give+1.951
hon+1.891
 decree+1.878
 affirm+1.869
Neg logit contributions
apper-3.143
PDATE-2.992
 Pixie-2.924
 lid-2.904
ANY-2.696
Token impe
Token ID18516
Feature activation-4.23e+00

Pos logit contributions
+0.457
give+0.401
hon+0.397
 decree+0.381
 affirm+0.381
Neg logit contributions
apper-0.677
 lid-0.634
 Pixie-0.631
PDATE-0.628
 Deluxe-0.585
Tokenached
Token ID2317
Feature activation-1.59e+00

Pos logit contributions
+7.257
give+6.932
hon+6.897
udo+6.708
gments+6.622
Neg logit contributions
PDATE-6.599
 Pixie-6.418
apper-6.347
 lid-6.091
 accents-5.611
Token,
Token ID11
Feature activation-6.30e-01

Pos logit contributions
+2.483
give+2.201
hon+2.161
 affirm+2.114
 decree+2.106
Neg logit contributions
apper-2.967
PDATE-2.750
 lid-2.747
 Pixie-2.726
 accents-2.533
Token
Token ID447
Feature activation+1.27e-02

Pos logit contributions
+1.088
give+0.984
hon+0.968
 affirm+0.932
 decree+0.932
Neg logit contributions
apper-1.079
 lid-0.999
 Pixie-0.983
PDATE-0.974
 Deluxe-0.900
Token
Token ID251
Feature activation+1.75e+00

Pos logit contributions
apper+0.021
 lid+0.020
 Pixie+0.020
PDATE+0.019
 Deluxe+0.018
Neg logit contributions
-0.022
give-0.020
hon-0.020
VICE-0.019
udo-0.019
Token she
Token ID673
Feature activation-5.17e-02

Pos logit contributions
apper+3.262
 lid+3.094
 Pixie+3.028
PDATE+2.947
 Deluxe+2.793
Neg logit contributions
-2.699
give-2.381
hon-2.358
VICE-2.303
udo-2.301
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
+0.287
give+0.259
hon+0.258
 decree+0.253
VICE+0.253
Neg logit contributions
apper-0.183
 lid-0.167
 Pixie-0.163
PDATE-0.149
 Deluxe-0.148
TokenOn
Token ID2202
Feature activation+1.78e-01

Pos logit contributions
+0.439
give+0.396
hon+0.392
VICE+0.385
udo+0.378
Neg logit contributions
apper-0.419
 lid-0.399
 Pixie-0.389
PDATE-0.387
 accents-0.354
Token social
Token ID1919
Feature activation+9.63e-01

Pos logit contributions
apper+0.313
 lid+0.295
 Pixie+0.288
PDATE+0.286
 Deluxe+0.265
Neg logit contributions
-0.296
give-0.266
hon-0.262
VICE-0.255
udo-0.253
Token media
Token ID2056
Feature activation+5.24e-01

Pos logit contributions
apper+1.202
 lid+1.119
 Pixie+1.088
PDATE+1.060
 Deluxe+0.961
Neg logit contributions
-2.095
give-1.930
hon-1.913
VICE-1.873
 decree-1.870
Token,
Token ID11
Feature activation-7.38e-03

Pos logit contributions
apper+0.979
 lid+0.923
 Pixie+0.905
PDATE+0.901
 Deluxe+0.834
Neg logit contributions
-0.823
give-0.731
hon-0.724
VICE-0.704
 affirm-0.698
Token Michael
Token ID3899
Feature activation-4.23e+00

Pos logit contributions
+0.013
give+0.012
hon+0.011
VICE+0.011
 decree+0.011
Neg logit contributions
apper-0.013
 lid-0.012
 Pixie-0.012
PDATE-0.011
 Deluxe-0.011
Token G
Token ID402
Feature activation+1.07e-01

Pos logit contributions
+6.194
hon+5.520
give+5.468
 affirm+5.407
VICE+5.339
Neg logit contributions
apper-7.540
PDATE-7.105
 lid-7.080
 Pixie-6.937
rawler-6.375
Token.
Token ID13
Feature activation-8.69e-01

Pos logit contributions
apper+0.196
 lid+0.187
 Pixie+0.185
PDATE+0.181
 Deluxe+0.168
Neg logit contributions
-0.168
give-0.150
hon-0.149
VICE-0.144
 affirm-0.143
Token Flynn
Token ID14654
Feature activation-3.37e-01

Pos logit contributions
+1.533
give+1.381
hon+1.367
VICE+1.334
udo+1.325
Neg logit contributions
apper-1.443
 lid-1.354
 Pixie-1.326
PDATE-1.314
 Deluxe-1.204
Token has
Token ID468
Feature activation+6.72e-02

Pos logit contributions
+0.568
give+0.505
hon+0.504
 affirm+0.488
VICE+0.486
Neg logit contributions
apper-0.589
 lid-0.551
PDATE-0.543
 Pixie-0.543
ANY-0.494
Token shared
Token ID4888
Feature activation+1.09e+00

Pos logit contributions
apper+0.121
 lid+0.114
 Pixie+0.111
PDATE+0.111
 Deluxe+0.102
Neg logit contributions
-0.110
give-0.098
hon-0.097
VICE-0.094
 affirm-0.093
Token of
Token ID286
Feature activation-3.21e-01

Pos logit contributions
+1.649
give+1.413
hon+1.407
 affirm+1.346
 decree+1.343
Neg logit contributions
apper-2.469
 lid-2.302
PDATE-2.280
 Pixie-2.264
rawler-2.099
Token his
Token ID465
Feature activation-8.16e-01

Pos logit contributions
+0.553
hon+0.501
give+0.498
VICE+0.484
 decree+0.483
Neg logit contributions
apper-0.541
 lid-0.505
 Pixie-0.493
PDATE-0.491
 accents-0.453
Token political
Token ID1964
Feature activation-1.82e+00

Pos logit contributions
+1.248
hon+1.116
give+1.114
 decree+1.085
VICE+1.076
Neg logit contributions
apper-1.535
 lid-1.426
PDATE-1.422
 Pixie-1.411
 accents-1.316
Token career
Token ID3451
Feature activation-2.54e-01

Pos logit contributions
+3.144
hon+2.857
give+2.851
VICE+2.787
 decree+2.777
Neg logit contributions
apper-3.013
PDATE-2.753
 lid-2.744
 Pixie-2.729
 accents-2.543
Token,
Token ID11
Feature activation-4.33e-01

Pos logit contributions
+0.378
hon+0.331
give+0.330
VICE+0.318
 affirm+0.316
Neg logit contributions
apper-0.500
 lid-0.468
 Pixie-0.463
PDATE-0.461
 Deluxe-0.426
Token Bernie
Token ID10477
Feature activation-4.22e+00

Pos logit contributions
+0.685
give+0.614
hon+0.607
VICE+0.590
 affirm+0.586
Neg logit contributions
apper-0.798
 lid-0.745
PDATE-0.738
 Pixie-0.736
 Deluxe-0.673
Token
Token ID447
Feature activation-2.89e-01

Pos logit contributions
+6.005
hon+5.342
 unquestion+5.206
 affirm+5.187
give+5.078
Neg logit contributions
apper-7.831
PDATE-7.440
 Pixie-7.314
 lid-7.296
ANE-6.633
Token
Token ID247
Feature activation-1.43e+00

Pos logit contributions
+0.394
give+0.347
hon+0.342
VICE+0.330
udo+0.327
Neg logit contributions
apper-0.591
 lid-0.560
 Pixie-0.558
PDATE-0.549
 accents-0.513
Tokens
Token ID82
Feature activation+2.06e+00

Pos logit contributions
+2.161
hon+1.934
give+1.911
VICE+1.860
 affirm+1.825
Neg logit contributions
apper-2.735
PDATE-2.562
 lid-2.561
 Pixie-2.556
ANY-2.324
Token current
Token ID1459
Feature activation-4.20e-01

Pos logit contributions
apper+3.426
 lid+3.237
 Pixie+3.147
PDATE+3.055
 Deluxe+2.881
Neg logit contributions
-3.632
give-3.244
hon-3.216
VICE-3.135
udo-3.129
Token Millenn
Token ID19007
Feature activation-1.39e+00

Pos logit contributions
+0.755
give+0.687
hon+0.672
 decree+0.664
 affirm+0.663
Neg logit contributions
apper-0.683
PDATE-0.634
 lid-0.627
 Pixie-0.622
ANY-0.569
TokenA
Token ID32
Feature activation+5.25e-01

Pos logit contributions
+2.019
give+1.857
hon+1.829
VICE+1.823
udo+1.740
Neg logit contributions
apper-1.968
 lid-1.892
PDATE-1.863
 Pixie-1.850
 accents-1.685
Token system
Token ID1080
Feature activation-6.50e-01

Pos logit contributions
apper+0.903
 lid+0.844
 Pixie+0.829
PDATE+0.818
 accents+0.758
Neg logit contributions
-0.900
give-0.810
hon-0.800
VICE-0.779
udo-0.773
Token that
Token ID326
Feature activation-5.27e-01

Pos logit contributions
+0.996
give+0.902
hon+0.891
 affirm+0.862
VICE+0.857
Neg logit contributions
apper-1.212
 lid-1.130
 Pixie-1.124
PDATE-1.122
ANY-1.031
Token ensures
Token ID19047
Feature activation+4.23e-01

Pos logit contributions
+0.876
give+0.796
hon+0.786
 affirm+0.770
VICE+0.761
Neg logit contributions
apper-0.923
 lid-0.856
PDATE-0.853
 Pixie-0.846
ANY-0.776
Token no
Token ID645
Feature activation+3.91e-01

Pos logit contributions
apper+0.782
 lid+0.731
 Pixie+0.720
PDATE+0.717
 Deluxe+0.664
Neg logit contributions
-0.669
give-0.601
hon-0.592
VICE-0.575
 affirm-0.573
Token Canadian
Token ID5398
Feature activation+2.76e+00

Pos logit contributions
apper+0.693
 lid+0.646
 Pixie+0.637
PDATE+0.632
 Deluxe+0.585
Neg logit contributions
-0.649
give-0.583
hon-0.575
VICE-0.563
 affirm-0.556
Token patient
Token ID5827
Feature activation+3.91e-01

Pos logit contributions
apper+4.896
 lid+4.649
 Pixie+4.540
PDATE+4.392
 Deluxe+4.174
Neg logit contributions
-4.554
give-4.014
hon-3.996
udo-3.871
VICE-3.869
Token is
Token ID318
Feature activation+2.79e-01

Pos logit contributions
apper+0.665
 lid+0.616
 Pixie+0.610
PDATE+0.607
 Deluxe+0.554
Neg logit contributions
-0.673
give-0.610
hon-0.598
VICE-0.595
 affirm-0.585
Token left
Token ID1364
Feature activation-8.17e-01

Pos logit contributions
apper+0.533
 lid+0.502
 Pixie+0.494
PDATE+0.489
 Deluxe+0.456
Neg logit contributions
-0.426
give-0.378
hon-0.372
VICE-0.362
udo-0.359
Token unable
Token ID5906
Feature activation-8.49e-02

Pos logit contributions
+1.404
give+1.299
hon+1.271
 affirm+1.267
VICE+1.252
Neg logit contributions
apper-1.353
PDATE-1.251
 Pixie-1.237
 lid-1.230
 Temperature-1.134
Token to
Token ID284
Feature activation+8.28e-01

Pos logit contributions
+0.125
give+0.112
hon+0.109
VICE+0.107
 affirm+0.107
Neg logit contributions
apper-0.165
 lid-0.154
 Pixie-0.154
PDATE-0.151
 Deluxe-0.142
Token doing
Token ID1804
Feature activation-3.98e-01

Pos logit contributions
+0.834
give+0.755
hon+0.735
 affirm+0.718
VICE+0.718
Neg logit contributions
apper-0.972
 lid-0.911
PDATE-0.900
 Pixie-0.885
 Deluxe-0.812
Token an
Token ID281
Feature activation-5.64e-01

Pos logit contributions
+0.650
give+0.589
hon+0.576
VICE+0.561
 affirm+0.560
Neg logit contributions
apper-0.713
 lid-0.668
PDATE-0.654
 Pixie-0.650
 Deluxe-0.597
Token album
Token ID5062
Feature activation-4.95e-01

Pos logit contributions
+0.992
hon+0.897
give+0.897
 affirm+0.866
VICE+0.864
Neg logit contributions
apper-0.928
 lid-0.871
PDATE-0.850
 Pixie-0.847
 Deluxe-0.770
Token with
Token ID351
Feature activation+1.28e-01

Pos logit contributions
+0.828
give+0.739
hon+0.729
 decree+0.711
 affirm+0.708
Neg logit contributions
apper-0.868
 lid-0.808
PDATE-0.800
 Pixie-0.797
 accents-0.727
Token each
Token ID1123
Feature activation-1.70e+00

Pos logit contributions
apper+0.222
 lid+0.206
PDATE+0.203
 Pixie+0.201
 Deluxe+0.184
Neg logit contributions
-0.218
give-0.197
hon-0.194
VICE-0.188
 affirm-0.187
Token band
Token ID4097
Feature activation+2.65e+00

Pos logit contributions
+2.749
give+2.474
hon+2.462
 decree+2.421
 affirm+2.385
Neg logit contributions
apper-2.927
PDATE-2.836
 lid-2.724
 Pixie-2.702
rawler-2.479
Token.
Token ID13
Feature activation-7.04e-01

Pos logit contributions
apper+4.592
 lid+4.316
 Pixie+4.225
PDATE+4.053
 Deluxe+3.882
Neg logit contributions
-4.461
give-3.985
hon-3.923
VICE-3.852
udo-3.813
Token So
Token ID1406
Feature activation-3.00e-01

Pos logit contributions
+1.276
give+1.151
hon+1.132
VICE+1.115
udo+1.103
Neg logit contributions
apper-1.147
 lid-1.076
PDATE-1.048
 Pixie-1.020
 accents-0.941
Token,
Token ID11
Feature activation-2.88e-01

Pos logit contributions
+0.457
give+0.408
hon+0.400
VICE+0.389
 decree+0.384
Neg logit contributions
apper-0.567
 lid-0.538
 Pixie-0.526
PDATE-0.524
 Deluxe-0.485
Token there
Token ID612
Feature activation+8.60e-01

Pos logit contributions
+0.455
give+0.407
hon+0.401
VICE+0.389
 decree+0.386
Neg logit contributions
apper-0.531
 lid-0.499
 Pixie-0.489
PDATE-0.487
 Deluxe-0.449
Token are
Token ID389
Feature activation-1.23e+00

Pos logit contributions
apper+1.385
 lid+1.296
 Pixie+1.269
PDATE+1.261
 Deluxe+1.147
Neg logit contributions
-1.558
give-1.414
hon-1.404
VICE-1.367
udo-1.358
Token answer
Token ID3280
Feature activation+6.40e-01

Pos logit contributions
+0.277
give+0.248
hon+0.242
 affirm+0.236
 decree+0.235
Neg logit contributions
apper-0.384
 lid-0.361
PDATE-0.361
 Pixie-0.359
ANY-0.331
Token for
Token ID329
Feature activation-1.18e-01

Pos logit contributions
apper+1.177
 lid+1.108
 Pixie+1.089
PDATE+1.079
 Deluxe+1.003
Neg logit contributions
-1.015
give-0.909
hon-0.895
VICE-0.875
 affirm-0.869
Token Trump
Token ID1301
Feature activation-1.38e+00

Pos logit contributions
+0.192
give+0.175
hon+0.170
VICE+0.167
 affirm+0.166
Neg logit contributions
apper-0.211
 lid-0.197
PDATE-0.196
 Pixie-0.192
 Deluxe-0.178
Token
Token ID447
Feature activation+1.16e+00

Pos logit contributions
+2.269
hon+2.084
give+2.069
 affirm+2.009
 decree+2.004
Neg logit contributions
apper-2.353
 lid-2.248
PDATE-2.236
 Pixie-2.185
rawler-1.985
Token
Token ID247
Feature activation-1.08e+00

Pos logit contributions
apper+2.275
 lid+2.162
 Pixie+2.123
PDATE+2.093
 Deluxe+1.963
Neg logit contributions
-1.699
give-1.493
hon-1.477
VICE-1.430
udo-1.421
Tokens
Token ID82
Feature activation+2.68e+00

Pos logit contributions
+1.776
give+1.612
hon+1.583
VICE+1.540
 affirm+1.525
Neg logit contributions
apper-1.908
 lid-1.813
 Pixie-1.773
PDATE-1.769
 Deluxe-1.622
Token many
Token ID867
Feature activation-5.35e-01

Pos logit contributions
apper+4.801
 lid+4.559
 Pixie+4.427
PDATE+4.279
 accents+4.103
Neg logit contributions
-4.346
give-3.817
hon-3.783
VICE-3.687
udo-3.680
Token mistakes
Token ID10135
Feature activation-9.01e-01

Pos logit contributions
+0.918
give+0.830
hon+0.822
 affirm+0.803
 decree+0.797
Neg logit contributions
apper-0.912
 lid-0.851
PDATE-0.842
 Pixie-0.839
 Deluxe-0.762
Token,
Token ID11
Feature activation-3.21e-01

Pos logit contributions
+1.413
give+1.252
hon+1.240
 affirm+1.202
 decree+1.189
Neg logit contributions
apper-1.673
 lid-1.575
PDATE-1.575
 Pixie-1.561
rawler-1.440
Token ignoring
Token ID15482
Feature activation-5.47e-01

Pos logit contributions
+0.491
give+0.443
hon+0.435
 affirm+0.422
VICE+0.421
Neg logit contributions
apper-0.606
 lid-0.570
PDATE-0.560
 Pixie-0.558
 Deluxe-0.517
Token the
Token ID262
Feature activation+1.36e-01

Pos logit contributions
+0.882
give+0.803
hon+0.784
 affirm+0.775
 decree+0.769
Neg logit contributions
apper-0.984
PDATE-0.920
 lid-0.915
 Pixie-0.897
rawler-0.831
Token by
Token ID416
Feature activation-1.08e-01

Pos logit contributions
apper+0.530
 lid+0.498
 Pixie+0.490
PDATE+0.486
 Deluxe+0.450
Neg logit contributions
-0.484
give-0.432
hon-0.429
 decree-0.413
VICE-0.413
Token forces
Token ID3386
Feature activation-1.77e-01

Pos logit contributions
+0.181
give+0.165
hon+0.162
 decree+0.160
VICE+0.158
Neg logit contributions
apper-0.189
PDATE-0.174
 lid-0.174
 Pixie-0.172
ANY-0.159
Token from
Token ID422
Feature activation-4.67e-01

Pos logit contributions
+0.273
give+0.245
hon+0.242
 decree+0.238
 affirm+0.235
Neg logit contributions
apper-0.332
 lid-0.310
PDATE-0.306
 Pixie-0.305
 Deluxe-0.282
Token Kenya
Token ID21506
Feature activation+1.09e+00

Pos logit contributions
+0.765
give+0.695
hon+0.685
 decree+0.670
VICE+0.667
Neg logit contributions
apper-0.830
 lid-0.767
PDATE-0.766
 Pixie-0.757
ANY-0.704
Token and
Token ID290
Feature activation-3.57e-01

Pos logit contributions
apper+2.102
 lid+1.990
 Pixie+1.959
PDATE+1.938
 Deluxe+1.809
Neg logit contributions
-1.648
give-1.450
hon-1.437
udo-1.382
VICE-1.381
Token Ethiopia
Token ID31592
Feature activation+3.13e+00

Pos logit contributions
+0.583
give+0.529
hon+0.521
VICE+0.506
 decree+0.504
Neg logit contributions
apper-0.638
 lid-0.595
PDATE-0.587
 Pixie-0.585
 Deluxe-0.540
Token,
Token ID11
Feature activation-1.08e-01

Pos logit contributions
apper+5.283
 lid+4.975
 Pixie+4.847
PDATE+4.687
 accents+4.468
Neg logit contributions
-5.324
give-4.760
hon-4.707
VICE-4.632
udo-4.581
Token who
Token ID508
Feature activation-1.05e-01

Pos logit contributions
+0.168
give+0.151
hon+0.148
VICE+0.144
 decree+0.144
Neg logit contributions
apper-0.202
 lid-0.189
 Pixie-0.186
PDATE-0.185
 Deluxe-0.172
Token engaged
Token ID7953
Feature activation+6.85e-01

Pos logit contributions
+0.156
give+0.139
hon+0.137
 decree+0.132
VICE+0.132
Neg logit contributions
apper-0.205
 lid-0.194
 Pixie-0.190
PDATE-0.190
 Deluxe-0.176
Token in
Token ID287
Feature activation-4.84e-01

Pos logit contributions
apper+1.181
 lid+1.094
PDATE+1.080
 Pixie+1.077
 Deluxe+0.984
Neg logit contributions
-1.174
give-1.065
hon-1.046
VICE-1.025
 combatants-1.017
Token heavy
Token ID4334
Feature activation+4.77e-01

Pos logit contributions
+0.805
give+0.738
hon+0.722
 affirm+0.712
 decree+0.705
Neg logit contributions
apper-0.845
PDATE-0.785
 lid-0.774
 Pixie-0.770
 Deluxe-0.715
Tokenres
Token ID411
Feature activation+6.78e-01

Pos logit contributions
apper+2.795
 lid+2.591
 Pixie+2.478
 Deluxe+2.376
PDATE+2.358
Neg logit contributions
-1.940
give-1.606
hon-1.583
 decree-1.579
thood-1.571
Tokenize
Token ID1096
Feature activation+1.36e+00

Pos logit contributions
apper+1.234
 lid+1.138
 Pixie+1.096
 Deluxe+1.033
 accents+1.015
Neg logit contributions
-1.111
hon-0.955
give-0.950
 decree-0.944
VICE-0.942
Token(
Token ID7
Feature activation+1.39e+00

Pos logit contributions
apper+2.265
 lid+2.055
 Pixie+2.015
ANY+1.948
 Deluxe+1.927
Neg logit contributions
-2.374
 decree-2.108
hon-2.094
 affirm-2.093
thood-2.065
Tokenimage
Token ID9060
Feature activation-1.47e-02

Pos logit contributions
apper+2.226
 lid+2.007
 Pixie+1.886
ANY+1.812
rawler+1.791
Neg logit contributions
-2.583
 decree-2.274
hon-2.254
 affirm-2.249
 combatants-2.234
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
+0.025
hon+0.022
give+0.022
 decree+0.021
 affirm+0.021
Neg logit contributions
apper-0.026
 lid-0.024
 Pixie-0.024
PDATE-0.022
 Deluxe-0.022
Token height
Token ID6001
Feature activation+3.60e+00

Pos logit contributions
apper+1.779
 lid+1.700
 Pixie+1.558
 accents+1.424
PDATE+1.410
Neg logit contributions
-3.272
give-2.937
hon-2.936
VICE-2.888
ograms-2.879
Token=
Token ID28
Feature activation+1.17e+00

Pos logit contributions
apper+4.696
 Pixie+4.127
ANY+4.060
 lid+4.031
 Deluxe+3.879
Neg logit contributions
-7.034
 decree-6.600
 affirm-6.434
gments-6.408
hon-6.378
Token600
Token ID8054
Feature activation+7.46e-01

Pos logit contributions
apper+2.061
 lid+1.921
 Pixie+1.882
PDATE+1.828
ANY+1.740
Neg logit contributions
-1.946
give-1.733
hon-1.726
 decree-1.673
 affirm-1.668
Token)
Token ID8
Feature activation+1.58e+00

Pos logit contributions
apper+1.060
 lid+0.982
 Pixie+0.938
 Deluxe+0.876
ANY+0.858
Neg logit contributions
-1.493
hon-1.338
give-1.335
 decree-1.323
 affirm-1.315
Token gray
Token ID12768
Feature activation+5.06e-01

Pos logit contributions
apper+2.927
 lid+2.743
 Pixie+2.721
PDATE+2.665
 Deluxe+2.499
Neg logit contributions
-2.498
give-2.229
hon-2.196
VICE-2.143
 decree-2.136
Token =
Token ID796
Feature activation-2.12e-01

Pos logit contributions
apper+0.859
 lid+0.797
 Pixie+0.781
 Deluxe+0.725
PDATE+0.718
Neg logit contributions
-0.873
give-0.778
hon-0.776
 decree-0.763
 affirm-0.759
Token is
Token ID318
Feature activation-1.23e+00

Pos logit contributions
+0.971
give+0.878
hon+0.863
 decree+0.845
 affirm+0.843
Neg logit contributions
apper-1.022
 Pixie-0.943
PDATE-0.940
 lid-0.939
ANY-0.862
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
+1.784
give+1.597
hon+1.568
 affirm+1.528
 decree+1.517
Neg logit contributions
apper-2.401
PDATE-2.239
 Pixie-2.238
 lid-2.231
 Deluxe-2.061
Token role
Token ID2597
Feature activation-1.23e+00

Pos logit contributions
+1.889
give+1.751
hon+1.725
 decree+1.709
 affirm+1.689
Neg logit contributions
apper-1.830
PDATE-1.722
 Pixie-1.685
 lid-1.666
ANY-1.546
Token played
Token ID2826
Feature activation-4.04e-01

Pos logit contributions
+1.706
give+1.519
hon+1.499
 affirm+1.462
 decree+1.456
Neg logit contributions
apper-2.443
PDATE-2.305
 lid-2.286
 Pixie-2.281
ANY-2.111
Token by
Token ID416
Feature activation-6.08e-01

Pos logit contributions
+0.563
give+0.493
hon+0.489
 affirm+0.475
VICE+0.468
Neg logit contributions
apper-0.822
 lid-0.776
 Pixie-0.765
PDATE-0.752
 Deluxe-0.710
Token state
Token ID1181
Feature activation+6.58e-01

Pos logit contributions
+1.034
give+0.951
hon+0.925
 decree+0.910
 affirm+0.907
Neg logit contributions
apper-1.034
PDATE-0.969
 lid-0.949
 Pixie-0.944
ANY-0.862
Token supreme
Token ID17700
Feature activation-2.30e-01

Pos logit contributions
apper+1.134
 lid+1.061
 Pixie+1.042
PDATE+1.039
 Deluxe+0.951
Neg logit contributions
-1.121
give-1.015
hon-1.001
 affirm-0.969
 decree-0.968
Token court
Token ID2184
Feature activation-1.63e+00

Pos logit contributions
+0.326
give+0.297
hon+0.291
 affirm+0.287
 decree+0.284
Neg logit contributions
apper-0.455
PDATE-0.431
 lid-0.428
 Pixie-0.423
ANY-0.395
Token justices
Token ID27818
Feature activation-1.83e+00

Pos logit contributions
+2.152
give+1.957
 affirm+1.901
hon+1.889
 decree+1.888
Neg logit contributions
apper-3.287
PDATE-3.170
 lid-3.090
 Pixie-3.081
 Deluxe-2.918
Token in
Token ID287
Feature activation-6.04e-01

Pos logit contributions
+2.765
hon+2.455
give+2.446
 affirm+2.372
 decree+2.370
Neg logit contributions
apper-3.401
 lid-3.228
PDATE-3.194
 Pixie-3.135
rawler-2.956
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
+0.907
give+0.823
hon+0.804
 decree+0.795
 affirm+0.792
Neg logit contributions
apper-1.143
PDATE-1.069
 lid-1.063
 Pixie-1.057
 Deluxe-0.977
Token used
Token ID973
Feature activation+2.79e-01

Pos logit contributions
apper+1.277
 lid+1.194
 Pixie+1.176
PDATE+1.173
 Deluxe+1.075
Neg logit contributions
-1.277
give-1.146
hon-1.138
VICE-1.107
udo-1.099
Token to
Token ID284
Feature activation-3.23e-01

Pos logit contributions
apper+0.530
PDATE+0.500
 Pixie+0.498
 lid+0.496
ANY+0.463
Neg logit contributions
-0.421
give-0.375
hon-0.370
 affirm-0.358
VICE-0.357
Token show
Token ID905
Feature activation+1.34e+00

Pos logit contributions
+0.452
give+0.402
hon+0.394
 affirm+0.385
 decree+0.381
Neg logit contributions
apper-0.647
PDATE-0.616
 lid-0.611
 Pixie-0.610
ANY-0.565
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
apper+2.384
 lid+2.234
 Pixie+2.193
PDATE+2.186
 Deluxe+2.013
Neg logit contributions
-2.206
give-1.988
hon-1.951
VICE-1.901
 affirm-1.894
Token extent
Token ID6287
Feature activation+1.53e+00

Pos logit contributions
apper+2.690
 lid+2.516
 Pixie+2.470
PDATE+2.440
 Deluxe+2.254
Neg logit contributions
-2.760
give-2.486
hon-2.454
VICE-2.389
udo-2.377
Token of
Token ID286
Feature activation+1.16e+00

Pos logit contributions
apper+3.069
 lid+2.908
 Pixie+2.859
PDATE+2.822
 Deluxe+2.653
Neg logit contributions
-2.204
give-1.932
hon-1.906
VICE-1.842
udo-1.835
Token the
Token ID262
Feature activation+7.36e-01

Pos logit contributions
apper+2.089
 lid+1.951
PDATE+1.932
 Pixie+1.916
ANY+1.771
Neg logit contributions
-1.871
give-1.685
hon-1.660
VICE-1.613
 decree-1.609
Token rubbish
Token ID35022
Feature activation+1.49e+00

Pos logit contributions
apper+1.352
PDATE+1.258
 lid+1.257
 Pixie+1.245
ANY+1.154
Neg logit contributions
-1.164
give-1.048
hon-1.028
 decree-1.007
 affirm-0.998
Token
Token ID198
Feature activation+6.74e-01

Pos logit contributions
apper+2.555
 lid+2.398
 Pixie+2.350
PDATE+2.314
 Deluxe+2.150
Neg logit contributions
-2.568
give-2.302
hon-2.278
VICE-2.217
udo-2.205
Token
Token ID198
Feature activation+3.17e-01

Pos logit contributions
apper+0.858
 lid+0.777
 Pixie+0.760
 Deluxe+0.699
ANY+0.679
Neg logit contributions
-1.455
hon-1.308
give-1.305
 decree-1.294
 affirm-1.288
TokenThe
Token ID464
Feature activation+1.25e+00

Pos logit contributions
apper+0.512
 lid+0.488
PDATE+0.476
 Pixie+0.476
 accents+0.433
Neg logit contributions
-0.575
give-0.521
hon-0.515
VICE-0.510
udo-0.497
Tokenpatrick
Token ID29615
Feature activation-1.05e-01

Pos logit contributions
+0.965
give+0.867
hon+0.863
VICE+0.820
udo+0.820
Neg logit contributions
apper-1.221
 lid-1.179
 Pixie-1.153
PDATE-1.152
 accents-1.065
Token_
Token ID62
Feature activation+4.36e-01

Pos logit contributions
+0.164
give+0.146
hon+0.144
udo+0.139
VICE+0.139
Neg logit contributions
apper-0.198
 lid-0.187
 Pixie-0.185
PDATE-0.185
 Deluxe-0.170
Token12
Token ID1065
Feature activation+1.40e-01

Pos logit contributions
apper+0.858
 lid+0.814
 Pixie+0.799
PDATE+0.788
 Deluxe+0.741
Neg logit contributions
-0.637
give-0.561
hon-0.554
VICE-0.535
udo-0.532
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
apper+0.256
 lid+0.243
 Pixie+0.239
PDATE+0.238
 accents+0.218
Neg logit contributions
-0.225
give-0.200
hon-0.197
VICE-0.193
udo-0.190
Token writes
Token ID6797
Feature activation-6.71e-01

Pos logit contributions
apper+0.406
 lid+0.387
 Pixie+0.381
PDATE+0.380
 Deluxe+0.357
Neg logit contributions
-0.204
give-0.173
hon-0.169
VICE-0.164
 affirm-0.161
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
+1.108
give+0.992
VICE+0.964
hon+0.962
 affirm+0.956
Neg logit contributions
apper-1.189
 lid-1.111
PDATE-1.109
 Pixie-1.091
ANY-1.019
Token "
Token ID366
Feature activation-4.77e-01

Pos logit contributions
apper+0.298
 lid+0.284
PDATE+0.273
 Pixie+0.271
 accents+0.252
Neg logit contributions
-0.281
give-0.252
hon-0.252
VICE-0.246
udo-0.243
TokenHe
Token ID1544
Feature activation-2.25e-01

Pos logit contributions
+0.780
give+0.697
hon+0.689
VICE+0.675
udo+0.664
Neg logit contributions
apper-0.860
 lid-0.818
 Pixie-0.793
PDATE-0.792
 accents-0.730
Token had
Token ID550
Feature activation+1.36e-01

Pos logit contributions
+0.376
give+0.335
hon+0.332
VICE+0.322
udo+0.322
Neg logit contributions
apper-0.396
 lid-0.372
 Pixie-0.365
PDATE-0.364
 Deluxe-0.335
Token '
Token ID705
Feature activation-1.41e+00

Pos logit contributions
apper+0.229
 lid+0.212
PDATE+0.211
 Pixie+0.208
 Deluxe+0.192
Neg logit contributions
-0.237
give-0.214
hon-0.211
 affirm-0.206
udo-0.206
TokenF
Token ID37
Feature activation+4.52e-01

Pos logit contributions
+2.284
give+2.095
hon+2.076
VICE+1.994
udo+1.938
Neg logit contributions
apper-2.508
 lid-2.416
PDATE-2.389
 Pixie-2.353
 Deluxe-2.150
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
+0.468
give+0.422
hon+0.421
 decree+0.413
VICE+0.413
Neg logit contributions
apper-0.299
 lid-0.275
 Pixie-0.267
PDATE-0.246
 Deluxe-0.242
Token1
Token ID16
Feature activation-8.43e-01

Pos logit contributions
+0.782
give+0.708
hon+0.701
VICE+0.680
udo+0.672
Neg logit contributions
apper-0.786
 lid-0.745
 Pixie-0.732
PDATE-0.727
 accents-0.665
Tokena
Token ID64
Feature activation-1.51e-01

Pos logit contributions
+0.990
give+0.865
hon+0.856
udo+0.804
 decree+0.789
Neg logit contributions
apper-1.869
 lid-1.799
PDATE-1.788
 Pixie-1.762
 accents-1.645
Token 1
Token ID352
Feature activation-1.16e+00

Pos logit contributions
+0.257
give+0.232
hon+0.232
udo+0.225
 affirm+0.223
Neg logit contributions
apper-0.260
 lid-0.241
PDATE-0.237
 Pixie-0.234
 accents-0.215
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
+1.898
give+1.700
hon+1.676
VICE+1.627
udo+1.619
Neg logit contributions
apper-2.080
 lid-1.972
PDATE-1.937
 Pixie-1.924
 accents-1.763
Token Sw
Token ID2451
Feature activation+2.15e+00

Pos logit contributions
+0.887
give+0.806
hon+0.796
udo+0.776
VICE+0.775
Neg logit contributions
apper-0.693
 lid-0.646
PDATE-0.640
 Pixie-0.614
 accents-0.565
Tokenimmer
Token ID10957
Feature activation-4.43e-01

Pos logit contributions
apper+1.833
 lid+1.641
 Pixie+1.573
PDATE+1.532
 Deluxe+1.277
Neg logit contributions
-5.533
give-5.170
hon-5.141
VICE-5.032
udo-5.032
Token means
Token ID1724
Feature activation-1.46e-01

Pos logit contributions
+0.722
give+0.646
hon+0.641
 affirm+0.619
 decree+0.618
Neg logit contributions
apper-0.789
PDATE-0.737
 lid-0.733
 Pixie-0.732
 Deluxe-0.666
Token to
Token ID284
Feature activation-2.98e-01

Pos logit contributions
+0.219
give+0.194
hon+0.190
 affirm+0.184
VICE+0.183
Neg logit contributions
apper-0.282
 lid-0.265
 Pixie-0.264
PDATE-0.261
 Deluxe-0.244
Token keep
Token ID1394
Feature activation+1.16e+00

Pos logit contributions
+0.447
give+0.407
hon+0.397
 affirm+0.390
 decree+0.384
Neg logit contributions
apper-0.568
PDATE-0.537
 Pixie-0.530
 lid-0.529
 Deluxe-0.485
Token fit
Token ID4197
Feature activation+2.04e+00

Pos logit contributions
apper+1.921
 lid+1.780
 Pixie+1.774
PDATE+1.757
 Deluxe+1.612
Neg logit contributions
-2.059
give-1.880
hon-1.856
VICE-1.809
 affirm-1.805
Token and
Token ID290
Feature activation-3.50e-01

Pos logit contributions
+1.230
give+1.039
hon+1.021
VICE+0.986
 decree+0.984
Neg logit contributions
apper-1.871
 lid-1.811
 Pixie-1.779
PDATE-1.761
 Deluxe-1.658
Token do
Token ID466
Feature activation+4.83e-02

Pos logit contributions
+0.432
give+0.375
hon+0.369
VICE+0.361
 affirm+0.355
Neg logit contributions
apper-0.759
 lid-0.726
PDATE-0.713
 Pixie-0.708
 accents-0.664
Token not
Token ID407
Feature activation-6.55e-01

Pos logit contributions
apper+0.101
 lid+0.096
 Pixie+0.095
PDATE+0.092
 Deluxe+0.089
Neg logit contributions
-0.064
give-0.055
hon-0.055
udo-0.053
VICE-0.053
Token necessarily
Token ID6646
Feature activation+1.23e+00

Pos logit contributions
+0.697
give+0.595
 affirm+0.593
hon+0.587
 decree+0.565
Neg logit contributions
apper-1.521
PDATE-1.498
 lid-1.475
 Pixie-1.455
ANE-1.358
Token reflect
Token ID4079
Feature activation+1.93e-01

Pos logit contributions
apper+2.345
 lid+2.227
 Pixie+2.193
PDATE+2.178
 Deluxe+2.020
Neg logit contributions
-1.869
give-1.654
hon-1.634
 affirm-1.581
VICE-1.575
Token the
Token ID262
Feature activation+5.14e-01

Pos logit contributions
apper+0.359
 lid+0.348
PDATE+0.340
 Pixie+0.338
 Deluxe+0.309
Neg logit contributions
-0.303
give-0.266
hon-0.262
VICE-0.258
 affirm-0.256
Token views
Token ID5009
Feature activation-5.40e-01

Pos logit contributions
apper+0.843
 lid+0.820
PDATE+0.803
 Pixie+0.801
 Deluxe+0.725
Neg logit contributions
-0.900
hon-0.807
give-0.807
 decree-0.786
VICE-0.786
Token of
Token ID286
Feature activation+1.12e-01

Pos logit contributions
+0.731
hon+0.621
give+0.615
VICE+0.594
 decree+0.594
Neg logit contributions
apper-1.121
 lid-1.080
 Pixie-1.066
PDATE-1.052
ANY-0.980
Token Comcast
Token ID21830
Feature activation+4.67e-01

Pos logit contributions
apper+0.187
 lid+0.177
PDATE+0.173
 Pixie+0.172
 accents+0.155
Neg logit contributions
-0.199
give-0.178
hon-0.176
VICE-0.173
 decree-0.171
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
apper+0.830
 lid+0.796
 Pixie+0.775
PDATE+0.769
 accents+0.709
Neg logit contributions
-0.768
give-0.684
hon-0.679
VICE-0.664
udo-0.653
Token<|endoftext|>
Token ID50256
Feature activation+1.63e+00

Pos logit contributions
+0.409
give+0.370
hon+0.366
VICE+0.361
udo+0.354
Neg logit contributions
apper-0.292
 lid-0.280
PDATE-0.269
 Pixie-0.263
 accents-0.239
Token become
Token ID1716
Feature activation+6.56e-02

Pos logit contributions
apper+4.760
 lid+4.602
 Pixie+4.449
PDATE+4.241
 Deluxe+4.117
Neg logit contributions
-3.872
give-3.408
hon-3.330
VICE-3.293
udo-3.229
Token increasingly
Token ID6481
Feature activation-3.60e-01

Pos logit contributions
apper+0.121
 lid+0.114
 Pixie+0.113
PDATE+0.112
 Deluxe+0.104
Neg logit contributions
-0.104
give-0.092
hon-0.091
VICE-0.089
 affirm-0.088
Token worried
Token ID7960
Feature activation-1.34e-01

Pos logit contributions
+0.528
give+0.468
hon+0.465
 affirm+0.453
VICE+0.450
Neg logit contributions
apper-0.699
 lid-0.654
PDATE-0.652
 Pixie-0.650
 Deluxe-0.601
Token about
Token ID546
Feature activation+4.32e-01

Pos logit contributions
+0.203
give+0.180
hon+0.176
 affirm+0.172
VICE+0.171
Neg logit contributions
apper-0.256
 lid-0.239
 Pixie-0.237
PDATE-0.236
 Deluxe-0.221
Token in
Token ID287
Feature activation+3.05e-01

Pos logit contributions
apper+0.777
 lid+0.726
 Pixie+0.717
PDATE+0.711
 Deluxe+0.661
Neg logit contributions
-0.709
give-0.633
hon-0.624
VICE-0.604
 affirm-0.603
Token the
Token ID262
Feature activation-2.96e-01

Pos logit contributions
apper+0.537
 lid+0.497
 Pixie+0.491
PDATE+0.485
 Deluxe+0.449
Neg logit contributions
-0.514
give-0.462
hon-0.455
 decree-0.441
 affirm-0.439
Token last
Token ID938
Feature activation-1.74e+00

Pos logit contributions
+0.470
give+0.424
hon+0.411
 decree+0.401
 affirm+0.398
Neg logit contributions
apper-0.550
PDATE-0.506
 lid-0.505
 Pixie-0.502
 Deluxe-0.463
Token twelve
Token ID14104
Feature activation-6.04e-01

Pos logit contributions
+2.835
give+2.610
 decree+2.532
hon+2.524
 affirm+2.478
Neg logit contributions
apper-3.076
PDATE-2.851
 lid-2.791
 Pixie-2.783
ANY-2.556
Token months
Token ID1933
Feature activation-2.66e+00

Pos logit contributions
+0.904
give+0.791
hon+0.781
 affirm+0.756
 decree+0.756
Neg logit contributions
apper-1.184
 lid-1.111
 Pixie-1.089
PDATE-1.088
 Deluxe-1.009
Token:
Token ID25
Feature activation-3.51e-01

Pos logit contributions
+3.907
give+3.295
hon+3.262
 affirm+3.138
 decree+3.097
Neg logit contributions
apper-5.181
PDATE-4.803
 Pixie-4.784
 lid-4.758
rawler-4.458
Token
Token ID198
Feature activation-6.56e-01

Pos logit contributions
+0.634
give+0.577
hon+0.564
VICE+0.553
 decree+0.553
Neg logit contributions
apper-0.570
 Pixie-0.519
 lid-0.518
PDATE-0.518
 Deluxe-0.470
Token-
Token ID12
Feature activation+2.34e-01

Pos logit contributions
apper+0.783
 lid+0.745
 Pixie+0.733
PDATE+0.721
 accents+0.674
Neg logit contributions
-0.568
give-0.500
hon-0.496
udo-0.473
VICE-0.472
Token1985
Token ID29110
Feature activation+2.90e+00

Pos logit contributions
apper+0.358
 lid+0.334
 Pixie+0.325
PDATE+0.320
 Deluxe+0.293
Neg logit contributions
-0.447
give-0.405
hon-0.401
VICE-0.392
udo-0.390
Token-
Token ID12
Feature activation+5.26e-01

Pos logit contributions
apper+5.307
 lid+5.005
 Pixie+4.912
PDATE+4.813
 Deluxe+4.563
Neg logit contributions
-4.547
give-4.029
hon-3.973
VICE-3.890
udo-3.870
Token 2
Token ID362
Feature activation+4.54e-01

Pos logit contributions
apper+0.815
 lid+0.761
 Pixie+0.742
PDATE+0.728
 Deluxe+0.672
Neg logit contributions
-0.990
give-0.897
hon-0.888
VICE-0.867
udo-0.863
Token Timothy
Token ID22283
Feature activation-1.83e+00

Pos logit contributions
apper+0.787
 lid+0.742
 Pixie+0.720
PDATE+0.716
 Deluxe+0.659
Neg logit contributions
-0.774
give-0.693
hon-0.686
VICE-0.666
udo-0.662
Token Dalton
Token ID33261
Feature activation+7.94e-03

Pos logit contributions
+3.015
hon+2.870
give+2.809
 affirm+2.693
udo+2.660
Neg logit contributions
apper-3.020
PDATE-2.994
 lid-2.898
 Pixie-2.710
ANY-2.675
Token films
Token ID7328
Feature activation-1.10e+00

Pos logit contributions
apper+0.017
 lid+0.016
PDATE+0.016
 Pixie+0.016
ANY+0.015
Neg logit contributions
-0.010
give-0.009
hon-0.009
 decree-0.008
udo-0.008
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
+1.581
give+1.382
hon+1.370
udo+1.314
VICE+1.308
Neg logit contributions
apper-2.168
 lid-2.041
 Pixie-2.021
PDATE-1.985
 accents-1.862
Token 1987
Token ID12923
Feature activation+1.86e+00

Pos logit contributions
+0.393
give+0.338
hon+0.333
VICE+0.318
 decree+0.316
Neg logit contributions
apper-0.694
 lid-0.660
 Pixie-0.649
PDATE-0.649
ANY-0.604
Token-
Token ID12
Feature activation+3.56e-01

Pos logit contributions
apper+3.569
 lid+3.378
 Pixie+3.313
PDATE+3.260
 Deluxe+3.061
Neg logit contributions
-2.830
give-2.498
hon-2.469
VICE-2.391
udo-2.377
Token1989
Token ID25475
Feature activation+1.90e+00

Pos logit contributions
apper+0.636
 lid+0.600
 Pixie+0.587
PDATE+0.577
 Deluxe+0.540
Neg logit contributions
-0.585
give-0.522
hon-0.516
VICE-0.502
udo-0.499
Token my
Token ID616
Feature activation-1.20e+00

Pos logit contributions
+1.697
give+1.542
hon+1.502
 affirm+1.495
VICE+1.493
Neg logit contributions
apper-2.144
PDATE-1.999
 lid-1.972
 Pixie-1.943
ANY-1.825
Token church
Token ID4928
Feature activation+5.07e-01

Pos logit contributions
+2.076
give+1.930
hon+1.883
 affirm+1.877
 decree+1.853
Neg logit contributions
apper-1.993
PDATE-1.852
 lid-1.797
 Pixie-1.771
ANY-1.650
Token.
Token ID13
Feature activation-5.85e-01

Pos logit contributions
apper+0.915
 lid+0.854
 Pixie+0.840
PDATE+0.840
 Deluxe+0.772
Neg logit contributions
-0.823
give-0.742
hon-0.732
VICE-0.713
 affirm-0.708
Token I
Token ID314
Feature activation-9.21e-01

Pos logit contributions
+1.039
give+0.942
hon+0.927
VICE+0.911
udo+0.906
Neg logit contributions
apper-0.971
 lid-0.914
PDATE-0.894
 Pixie-0.868
 accents-0.805
Token was
Token ID373
Feature activation-1.35e+00

Pos logit contributions
+1.446
give+1.289
hon+1.272
VICE+1.236
 affirm+1.228
Neg logit contributions
apper-1.710
 lid-1.608
 Pixie-1.586
PDATE-1.578
 Deluxe-1.456
Token constantly
Token ID7558
Feature activation-1.84e+00

Pos logit contributions
+1.905
give+1.710
hon+1.684
 affirm+1.648
VICE+1.641
Neg logit contributions
apper-2.699
PDATE-2.518
 lid-2.512
 Pixie-2.483
ANY-2.318
Token told
Token ID1297
Feature activation-2.57e+00

Pos logit contributions
+2.529
give+2.260
hon+2.230
 affirm+2.219
 decree+2.172
Neg logit contributions
apper-3.704
PDATE-3.480
 lid-3.431
 Pixie-3.403
ANY-3.207
Token it
Token ID340
Feature activation+1.30e+00

Pos logit contributions
+3.323
give+3.084
 affirm+3.024
hon+2.919
 unquestion+2.833
Neg logit contributions
apper-5.253
PDATE-5.001
 lid-4.897
 Pixie-4.813
ANY-4.542
Token was
Token ID373
Feature activation+7.67e-01

Pos logit contributions
apper+1.988
 lid+1.864
 Pixie+1.811
PDATE+1.751
 Deluxe+1.646
Neg logit contributions
-2.471
give-2.229
hon-2.209
VICE-2.159
udo-2.152
Token unacceptable
Token ID18010
Feature activation+1.09e-03

Pos logit contributions
apper+1.374
 lid+1.299
 Pixie+1.272
PDATE+1.245
 Deluxe+1.170
Neg logit contributions
-1.258
give-1.118
hon-1.107
VICE-1.076
udo-1.070
Token to
Token ID284
Feature activation+7.34e-01

Pos logit contributions
apper+0.002
 lid+0.002
PDATE+0.002
 Pixie+0.002
 Deluxe+0.002
Neg logit contributions
-0.002
give-0.001
hon-0.001
VICE-0.001
 affirm-0.001
Token,
Token ID11
Feature activation+4.79e-01

Pos logit contributions
+0.317
give+0.279
hon+0.276
 decree+0.268
udo+0.268
Neg logit contributions
apper-0.368
PDATE-0.345
 Pixie-0.342
 lid-0.340
ANY-0.315
Token then
Token ID788
Feature activation-7.80e-01

Pos logit contributions
apper+0.901
 lid+0.845
 Pixie+0.835
PDATE+0.827
 Deluxe+0.769
Neg logit contributions
-0.743
give-0.661
hon-0.651
VICE-0.631
 decree-0.628
Token raised
Token ID4376
Feature activation+7.64e-01

Pos logit contributions
+1.194
give+1.068
hon+1.049
 affirm+1.020
 decree+1.016
Neg logit contributions
apper-1.468
PDATE-1.376
 lid-1.372
 Pixie-1.361
ANY-1.253
Token the
Token ID262
Feature activation-2.94e-01

Pos logit contributions
apper+1.251
 lid+1.154
PDATE+1.139
 Pixie+1.139
ANY+1.041
Neg logit contributions
-1.365
give-1.244
hon-1.232
 affirm-1.197
VICE-1.192
Token glass
Token ID5405
Feature activation-4.07e-01

Pos logit contributions
+0.510
give+0.471
hon+0.460
 decree+0.449
 affirm+0.446
Neg logit contributions
apper-0.490
PDATE-0.454
 Pixie-0.447
 lid-0.443
ANY-0.411
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
+0.638
give+0.563
hon+0.559
 affirm+0.543
 decree+0.536
Neg logit contributions
apper-0.758
PDATE-0.708
 Pixie-0.706
 lid-0.694
ANY-0.650
Token
Token ID564
Feature activation-8.19e-01

Pos logit contributions
+0.194
give+0.174
hon+0.172
VICE+0.167
udo+0.166
Neg logit contributions
apper-0.186
 lid-0.171
PDATE-0.169
 Pixie-0.168
 accents-0.152
Token
Token ID250
Feature activation-3.49e-01

Pos logit contributions
+1.350
give+1.205
hon+1.190
VICE+1.158
udo+1.152
Neg logit contributions
apper-1.462
 lid-1.378
 Pixie-1.350
PDATE-1.327
 Deluxe-1.239
TokenThank
Token ID10449
Feature activation-1.04e+00

Pos logit contributions
+0.594
give+0.534
hon+0.527
VICE+0.515
udo+0.509
Neg logit contributions
apper-0.604
 lid-0.570
 Pixie-0.557
PDATE-0.550
 Deluxe-0.509
Token God
Token ID1793
Feature activation-1.52e+00

Pos logit contributions
+1.366
give+1.215
hon+1.173
 decree+1.120
 affirm+1.112
Neg logit contributions
apper-2.222
 lid-2.068
 Pixie-2.063
PDATE-2.038
ANY-1.916
Token for
Token ID329
Feature activation-5.25e-01

Pos logit contributions
+2.278
give+2.056
hon+2.023
 decree+1.951
 affirm+1.944
Neg logit contributions
apper-2.881
PDATE-2.692
 lid-2.662
 Pixie-2.652
ANY-2.468
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
+0.594
give+0.548
hon+0.537
VICE+0.532
udo+0.527
Neg logit contributions
apper-0.605
 lid-0.573
 Pixie-0.571
PDATE-0.550
 Deluxe-0.510
Token So
Token ID1406
Feature activation+3.31e-02

Pos logit contributions
apper+0.471
 lid+0.436
PDATE+0.426
 Pixie+0.424
 accents+0.382
Neg logit contributions
-0.594
give-0.539
hon-0.532
VICE-0.520
 affirm-0.516
Token CA
Token ID7257
Feature activation+1.17e-01

Pos logit contributions
apper+0.061
 lid+0.057
 Pixie+0.056
PDATE+0.056
 Deluxe+0.052
Neg logit contributions
-0.053
give-0.047
hon-0.046
VICE-0.045
 affirm-0.045
Token must
Token ID1276
Feature activation-1.82e+00

Pos logit contributions
apper+0.216
 lid+0.201
 Pixie+0.198
PDATE+0.196
 Deluxe+0.182
Neg logit contributions
-0.188
give-0.168
hon-0.165
VICE-0.163
udo-0.160
Token =
Token ID796
Feature activation-4.60e-01

Pos logit contributions
+2.532
give+2.221
hon+2.163
 affirm+2.108
 decree+2.097
Neg logit contributions
apper-3.629
PDATE-3.538
 Pixie-3.443
 lid-3.411
 Deluxe-3.146
Token
Token ID564
Feature activation-2.90e-01

Pos logit contributions
+0.829
give+0.750
hon+0.736
VICE+0.720
 affirm+0.715
Neg logit contributions
apper-0.748
 lid-0.687
PDATE-0.681
 Pixie-0.678
 Deluxe-0.616
Token
Token ID246
Feature activation+2.67e-01

Pos logit contributions
+0.461
give+0.409
hon+0.398
VICE+0.393
udo+0.386
Neg logit contributions
apper-0.535
 lid-0.509
 Pixie-0.506
PDATE-0.490
 accents-0.463
Tokenrough
Token ID740
Feature activation+8.83e-01

Pos logit contributions
apper+0.502
 lid+0.477
 Pixie+0.469
PDATE+0.466
 accents+0.433
Neg logit contributions
-0.415
give-0.371
hon-0.366
VICE-0.354
udo-0.347
Tokenly
Token ID306
Feature activation+2.41e-01

Pos logit contributions
apper+1.582
 lid+1.476
 Pixie+1.460
PDATE+1.442
 Deluxe+1.341
Neg logit contributions
-1.462
give-1.307
hon-1.291
VICE-1.248
udo-1.238
Token
Token ID447
Feature activation+7.16e-01

Pos logit contributions
apper+0.426
 lid+0.395
 Pixie+0.395
PDATE+0.391
 Deluxe+0.359
Neg logit contributions
-0.406
give-0.363
hon-0.355
VICE-0.346
 decree-0.341
Token
Token ID247
Feature activation-1.10e+00

Pos logit contributions
apper+1.371
 lid+1.301
 Pixie+1.260
PDATE+1.225
 Deluxe+1.182
Neg logit contributions
-1.084
give-0.951
hon-0.941
VICE-0.917
 affirm-0.910
Tokense
Token ID325
Feature activation-8.22e-01

Pos logit contributions
+2.581
give+2.337
hon+2.297
 affirm+2.250
 decree+2.248
Neg logit contributions
apper-2.995
PDATE-2.796
 lid-2.788
 Pixie-2.765
 Deluxe-2.575
Tokenaux
Token ID14644
Feature activation-1.25e+00

Pos logit contributions
+1.030
give+0.878
hon+0.876
udo+0.839
VICE+0.837
Neg logit contributions
apper-1.790
 lid-1.699
 Pixie-1.669
PDATE-1.665
 Deluxe-1.571
Token f
Token ID277
Feature activation+1.33e-02

Pos logit contributions
+1.792
give+1.588
hon+1.573
 decree+1.533
 affirm+1.522
Neg logit contributions
apper-2.477
PDATE-2.313
 lid-2.308
 Pixie-2.293
rawler-2.135
Tokené
Token ID2634
Feature activation-3.33e+00

Pos logit contributions
apper+0.026
 lid+0.025
 Pixie+0.025
PDATE+0.024
 Deluxe+0.023
Neg logit contributions
-0.019
give-0.017
hon-0.017
udo-0.017
VICE-0.016
Tokend
Token ID67
Feature activation+7.17e-02

Pos logit contributions
+4.791
hon+4.552
give+4.523
udo+4.320
gments+4.193
Neg logit contributions
apper-6.123
PDATE-5.971
 Pixie-5.906
 lid-5.772
ANY-5.382
Tokené
Token ID2634
Feature activation-3.07e+00

Pos logit contributions
apper+0.130
 lid+0.123
PDATE+0.123
 Pixie+0.122
 Deluxe+0.111
Neg logit contributions
-0.114
give-0.103
hon-0.103
udo-0.099
VICE-0.098
Tokenra
Token ID430
Feature activation-2.02e+00

Pos logit contributions
+4.801
hon+4.385
give+4.362
ograms+4.211
udo+4.186
Neg logit contributions
apper-5.379
PDATE-5.149
 Pixie-5.078
 lid-5.040
 Deluxe-4.705
Tokenux
Token ID2821
Feature activation-1.12e+00

Pos logit contributions
+2.928
give+2.609
udo+2.606
hon+2.588
VICE+2.485
Neg logit contributions
apper-3.834
PDATE-3.652
 Pixie-3.651
 lid-3.609
ANY-3.399
Token d
Token ID288
Feature activation-8.58e-02

Pos logit contributions
+1.595
give+1.411
hon+1.397
 decree+1.352
 affirm+1.346
Neg logit contributions
apper-2.234
 lid-2.088
 Pixie-2.075
PDATE-2.074
 Deluxe-1.919
Tokenans
Token ID504
Feature activation-2.08e+00

Pos logit contributions
+0.141
hon+0.129
give+0.128
udo+0.124
VICE+0.123
Neg logit contributions
apper-0.149
 lid-0.142
PDATE-0.141
 Pixie-0.140
 Deluxe-0.127
Token la
Token ID8591
Feature activation-1.51e+00

Pos logit contributions
+2.982
hon+2.720
give+2.685
 decree+2.634
 affirm+2.625
Neg logit contributions
apper-3.980
PDATE-3.705
 Pixie-3.684
 lid-3.629
ANY-3.457
Token
Token ID198
Feature activation-7.03e-01

Pos logit contributions
+1.400
give+1.275
hon+1.265
VICE+1.237
udo+1.232
Neg logit contributions
apper-0.966
 lid-0.895
 Pixie-0.871
PDATE-0.848
 Deluxe-0.781
Token
Token ID198
Feature activation-8.14e-01

Pos logit contributions
+1.427
give+1.301
hon+1.290
VICE+1.262
udo+1.256
Neg logit contributions
apper-0.988
 lid-0.915
 Pixie-0.891
PDATE-0.868
 Deluxe-0.798
TokenGen
Token ID13746
Feature activation-1.63e+00

Pos logit contributions
+1.419
give+1.298
hon+1.288
VICE+1.258
udo+1.245
Neg logit contributions
apper-1.342
 lid-1.291
PDATE-1.285
 Pixie-1.263
 accents-1.141
Tokenetics
Token ID14596
Feature activation-1.84e-01

Pos logit contributions
+1.856
hon+1.624
give+1.591
udo+1.567
ograms+1.552
Neg logit contributions
apper-3.571
 lid-3.485
PDATE-3.472
 Pixie-3.404
ANY-3.204
Token [
Token ID685
Feature activation-2.53e+00

Pos logit contributions
+0.302
hon+0.273
give+0.272
 affirm+0.265
udo+0.264
Neg logit contributions
apper-0.326
PDATE-0.303
 lid-0.302
 Pixie-0.297
ANY-0.275
Token edit
Token ID4370
Feature activation-2.38e+00

Pos logit contributions
+2.163
give+1.852
hon+1.790
udo+1.642
VICE+1.627
Neg logit contributions
apper-6.397
PDATE-6.268
 lid-6.134
 Pixie-6.107
ANY-5.703
Token ]
Token ID2361
Feature activation-6.23e-01

Pos logit contributions
+4.103
give+3.777
hon+3.694
udo+3.575
VICE+3.557
Neg logit contributions
apper-4.011
PDATE-3.894
 lid-3.757
 Pixie-3.683
 accents-3.325
Token
Token ID198
Feature activation-6.72e-01

Pos logit contributions
+1.253
give+1.144
hon+1.133
VICE+1.109
udo+1.104
Neg logit contributions
apper-0.883
 lid-0.819
 Pixie-0.797
PDATE-0.785
 Deluxe-0.713
Token
Token ID198
Feature activation-8.39e-01

Pos logit contributions
+1.360
give+1.240
hon+1.230
VICE+1.203
udo+1.197
Neg logit contributions
apper-0.946
 lid-0.877
 Pixie-0.853
PDATE-0.834
 Deluxe-0.764
TokenEp
Token ID13807
Feature activation-1.44e+00

Pos logit contributions
+1.463
give+1.336
hon+1.330
VICE+1.297
udo+1.281
Neg logit contributions
apper-1.390
 lid-1.332
PDATE-1.322
 Pixie-1.306
 accents-1.180
Tokenider
Token ID1304
Feature activation-1.48e+00

Pos logit contributions
+1.738
hon+1.598
give+1.557
udo+1.479
gments+1.439
Neg logit contributions
apper-3.037
PDATE-3.008
 Pixie-2.979
 lid-2.964
 accents-2.753
Token and
Token ID290
Feature activation-6.46e-01

Pos logit contributions
+0.998
give+0.880
hon+0.866
 affirm+0.851
 decree+0.848
Neg logit contributions
apper-1.232
 lid-1.162
PDATE-1.160
 Pixie-1.137
ANY-1.051
Token many
Token ID867
Feature activation-4.25e-02

Pos logit contributions
+1.077
give+0.987
hon+0.960
 affirm+0.953
 decree+0.945
Neg logit contributions
apper-1.116
PDATE-1.042
 lid-1.029
 Pixie-1.014
ANY-0.946
Token photos
Token ID5205
Feature activation-1.75e+00

Pos logit contributions
+0.072
give+0.065
hon+0.064
 affirm+0.062
 decree+0.062
Neg logit contributions
apper-0.074
 lid-0.069
PDATE-0.068
 Pixie-0.067
ANY-0.062
Token of
Token ID286
Feature activation-6.65e-01

Pos logit contributions
+2.542
give+2.223
hon+2.195
udo+2.160
 affirm+2.129
Neg logit contributions
apper-3.402
PDATE-3.216
 lid-3.214
 Pixie-3.158
rawler-2.933
Token m
Token ID285
Feature activation-3.44e-01

Pos logit contributions
+1.104
give+1.003
hon+0.987
 affirm+0.972
 decree+0.972
Neg logit contributions
apper-1.152
PDATE-1.080
 lid-1.058
 Pixie-1.036
ANY-0.974
Tokenoom
Token ID4207
Feature activation-2.51e+00

Pos logit contributions
+0.605
hon+0.561
give+0.560
udo+0.548
ograms+0.543
Neg logit contributions
apper-0.540
PDATE-0.531
 lid-0.527
 Pixie-0.514
 accents-0.465
Tokenin
Token ID259
Feature activation-4.17e-01

Pos logit contributions
+2.347
hon+2.075
give+2.034
ograms+1.896
udo+1.884
Neg logit contributions
apper-5.934
PDATE-5.829
 Pixie-5.698
 lid-5.657
 Deluxe-5.335
Token characters
Token ID3435
Feature activation+5.85e-01

Pos logit contributions
+0.682
give+0.614
hon+0.610
udo+0.591
 affirm+0.586
Neg logit contributions
apper-0.740
 lid-0.696
PDATE-0.689
 Pixie-0.679
ANY-0.628
Token models
Token ID4981
Feature activation-2.04e-01

Pos logit contributions
apper+1.086
 lid+1.024
PDATE+1.001
 Pixie+1.000
 Deluxe+0.922
Neg logit contributions
-0.926
give-0.820
hon-0.810
VICE-0.785
udo-0.785
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
+0.317
give+0.280
hon+0.278
udo+0.271
 affirm+0.268
Neg logit contributions
apper-0.381
 lid-0.359
PDATE-0.355
 Pixie-0.351
ANY-0.325
Token
Token ID198
Feature activation-5.25e-01

Pos logit contributions
+0.411
hon+0.370
give+0.370
VICE+0.357
udo+0.357
Neg logit contributions
apper-0.334
 lid-0.309
PDATE-0.308
 Pixie-0.293
ANY-0.272
Token
Token ID564
Feature activation-5.06e-01

Pos logit contributions
+0.694
hon+0.625
give+0.622
udo+0.602
VICE+0.600
Neg logit contributions
apper-0.744
 lid-0.697
PDATE-0.689
 Pixie-0.676
ANY-0.627
Token
Token ID250
Feature activation+1.87e-01

Pos logit contributions
+0.825
give+0.734
hon+0.726
VICE+0.706
udo+0.702
Neg logit contributions
apper-0.912
 lid-0.861
 Pixie-0.844
PDATE-0.831
 Deluxe-0.776
TokenMother
Token ID31398
Feature activation+5.32e-02

Pos logit contributions
apper+0.334
 lid+0.317
 Pixie+0.309
PDATE+0.307
 accents+0.284
Neg logit contributions
-0.308
give-0.276
hon-0.273
VICE-0.266
udo-0.261
Token of
Token ID286
Feature activation-9.08e-02

Pos logit contributions
apper+0.100
 lid+0.094
PDATE+0.091
 Pixie+0.091
 accents+0.085
Neg logit contributions
-0.083
give-0.074
hon-0.073
VICE-0.070
udo-0.070
Token the
Token ID262
Feature activation-8.28e-01

Pos logit contributions
+0.152
give+0.137
hon+0.136
VICE+0.132
 decree+0.130
Neg logit contributions
apper-0.159
 lid-0.151
PDATE-0.146
 Pixie-0.144
 accents-0.136
Token Year
Token ID6280
Feature activation-2.46e+00

Pos logit contributions
+1.100
give+0.954
hon+0.941
VICE+0.899
 decree+0.891
Neg logit contributions
apper-1.717
 lid-1.661
PDATE-1.626
 Pixie-1.599
 accents-1.514
Token
Token ID447
Feature activation+6.98e-01

Pos logit contributions
+3.759
hon+3.348
give+3.297
 affirm+3.193
 decree+3.139
Neg logit contributions
apper-4.467
PDATE-4.245
 lid-4.224
 Pixie-4.138
ANY-3.892
Token
Token ID251
Feature activation+2.22e+00

Pos logit contributions
apper+1.404
 lid+1.340
 Pixie+1.319
PDATE+1.295
 accents+1.217
Neg logit contributions
-0.989
give-0.864
hon-0.859
VICE-0.826
udo-0.824
Token in
Token ID287
Feature activation-2.24e-02

Pos logit contributions
apper+3.909
 lid+3.676
 Pixie+3.614
PDATE+3.507
 Deluxe+3.320
Neg logit contributions
-3.668
give-3.268
hon-3.221
VICE-3.150
udo-3.138
Token 2010
Token ID3050
Feature activation-8.33e-01

Pos logit contributions
+0.035
give+0.031
hon+0.031
 decree+0.030
VICE+0.030
Neg logit contributions
apper-0.042
 lid-0.039
PDATE-0.038
 Pixie-0.038
ANY-0.035
Token at
Token ID379
Feature activation+5.54e-01

Pos logit contributions
+1.295
hon+1.159
give+1.141
 affirm+1.107
 decree+1.101
Neg logit contributions
apper-1.560
 lid-1.459
PDATE-1.440
 Pixie-1.431
ANY-1.327
Token U
Token ID471
Feature activation-2.21e-01

Pos logit contributions
apper+3.091
 lid+2.904
 Pixie+2.837
PDATE+2.771
 Deluxe+2.581
Neg logit contributions
-3.427
give-3.084
hon-3.051
VICE-2.973
udo-2.962
Token.
Token ID13
Feature activation-7.55e-01

Pos logit contributions
+0.389
give+0.344
hon+0.341
 decree+0.335
 affirm+0.333
Neg logit contributions
apper-0.372
 lid-0.344
 Pixie-0.339
PDATE-0.320
 Deluxe-0.313
TokenS
Token ID50
Feature activation+1.17e+00

Pos logit contributions
+1.444
give+1.306
hon+1.295
VICE+1.262
udo+1.260
Neg logit contributions
apper-1.149
 lid-1.068
 Pixie-1.047
PDATE-1.014
 Deluxe-0.948
Token.-
Token ID7874
Feature activation+3.80e-01

Pos logit contributions
apper+2.091
 lid+1.966
 Pixie+1.943
PDATE+1.905
 Deluxe+1.782
Neg logit contributions
-1.927
give-1.730
hon-1.712
VICE-1.673
udo-1.662
TokenH
Token ID39
Feature activation+5.05e-01

Pos logit contributions
apper+0.683
 lid+0.642
 Pixie+0.632
PDATE+0.628
 Deluxe+0.578
Neg logit contributions
-0.619
give-0.565
hon-0.560
VICE-0.547
udo-0.536
Tokenait
Token ID4548
Feature activation-2.45e+00

Pos logit contributions
apper+0.841
 lid+0.773
 Pixie+0.747
PDATE+0.716
 Deluxe+0.692
Neg logit contributions
-0.917
give-0.814
hon-0.801
VICE-0.781
 affirm-0.780
Tokeni
Token ID72
Feature activation+9.62e-02

Pos logit contributions
+2.632
give+2.280
hon+2.274
udo+2.165
VICE+2.137
Neg logit contributions
apper-5.646
PDATE-5.450
 lid-5.402
 Pixie-5.340
 Deluxe-4.971
Token relations
Token ID2316
Feature activation+7.66e-01

Pos logit contributions
apper+0.144
 lid+0.131
PDATE+0.131
 Pixie+0.129
 Deluxe+0.116
Neg logit contributions
-0.186
give-0.171
hon-0.169
VICE-0.166
 decree-0.166
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
apper+1.397
 lid+1.310
 Pixie+1.290
PDATE+1.284
 Deluxe+1.183
Neg logit contributions
-1.241
give-1.104
hon-1.095
 affirm-1.060
VICE-1.059
Token U
Token ID471
Feature activation-1.12e+00

Pos logit contributions
apper+0.508
 lid+0.475
 Pixie+0.457
PDATE+0.456
 accents+0.416
Neg logit contributions
-0.615
give-0.557
hon-0.551
VICE-0.541
udo-0.536
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
+1.859
give+1.664
hon+1.647
VICE+1.603
udo+1.592
Neg logit contributions
apper-1.973
 lid-1.864
 Pixie-1.822
PDATE-1.802
 Deluxe-1.670