TOP ACTIVATIONS
MAX = 3.598

o , who lives in Lomb ard , was by chance
said that they did . Lomb ardi explains that people in
Part icle X - Ray Spect rom eter ( AP X
compared precision Acceler ator Mass Spect rom etry ( AMS )
planetary system revis ited by HAR PS - N : A
a possible component of dark matter . If
in the Christmas season and fl aunting the decorations .
s ability to be dark , s ard onic ,
s flight path towards approaching lunar surface with the high gain
pped the ante by warning dark ly about t
ed edges *) graph = Fl atten [ Table [ Table
planet from the forces of Dark se id .
, another way in which dark energy creates distortions in the
mission , high quality infrared spect ra of Mars were also
member of LS ST 's dark energy collaboration , says that
New Genesis , to face Dark se id 's army ,
- 1 ). He ikes fl es hes out this commitment
- m arijuana dispensaries . Dark - haired , bes pect
. I 'm glad nan ocl oud wasn 't the one
and muff ins . Ground fl ax seed also qualifies here

MIN ACTIVATIONS
MIN = -3.002

She has been covering the McCain campaign reaction from the rather
45 HD Bs and Patrick Bert ole tti finished third with
us can listen to the clar inet quint et without fearing
for the information , this clar ifies and adds to what
Man Singh Dal al ( Ret d ) to stir up
. and Bou lt bee Ave . area , north of
to investigate charges that a teacher has inserted her religious and
a great way to kick start a child s
As the father of a little girl , Coach can understand
means an earlier start for elementary schools , and sets off
phone calls . After a teacher ( known in the media
's an awesome way to start your job really , really
potential beneficiary of the Nancy McCain 2013 Family Trust .
the grenade attacks at a Pesh awar cinema house on 13
ng ke ( meaning " et ernal " in the Mongol
https :// kill ert of ure c ords .
dividends . Psycho was a cert ifiable smash , and he
one lamb ste aks from Scot mid .' The
Get our daily newsletter Upgrade your inbox and get our
Get our daily newsletter Upgrade your inbox and get our

INTERVAL 1.948 - 3.598
CONTAINS 0.183%

surprised me . We are Luther ans , and Luther
case and Tan j ian Fl ock 1 C - 5
). After the lights came back up ,
om Hart man Institute . Amid plac ards including
budgetary needs . This book falls into the co

INTERVAL 0.298 - 1.948
CONTAINS 28.074%

ignore the new rules from the central government and will continue
a release date for either KH 3 of FF 14 .
from the closing defender or a lack of concentration , Wallace
. Graphics ] :: From Image ($ b mp ) $
into missed opportunities . The four plays above , all potential

INTERVAL -1.352 - 0.298
CONTAINS 70.413%

judged , said the Prophet , with
Fang orn Forest , where they are befriend ed by Tree
time in the fast ed state is actually a form of
up the most g en eral
's also attracted interest from other NFL teams and in the

INTERVAL -3.002 - -1.352
CONTAINS 1.329%

The video showed Martinez , who was handcuffed ,
addition , the age of children depicted through content on the
her 6 - month - old , Conan , and dog
. During the cart oon crisis
TO THE W ATER VILLE TA VER N , as it
Tokeno
Token ID78
Feature activation-2.14e-01

Pos logit contributions
rils+0.073
atography+0.073
+0.073
+0.072
estyles+0.072
Neg logit contributions
nas-0.054
yang-0.046
worthy-0.045
deal-0.045
uable-0.043
Token,
Token ID11
Feature activation-1.08e-01

Pos logit contributions
nas+0.937
yang+0.784
worthy+0.776
deal+0.771
uable+0.745
Neg logit contributions
rils-1.238
atography-1.223
estyles-1.215
-1.211
-1.206
Token who
Token ID508
Feature activation+3.68e-01

Pos logit contributions
nas+0.501
yang+0.426
worthy+0.420
deal+0.420
uable+0.407
Neg logit contributions
rils-0.594
atography-0.590
estyles-0.586
-0.584
-0.581
Token lives
Token ID3160
Feature activation+6.62e-01

Pos logit contributions
rils+1.978
atography+1.970
+1.942
+1.934
estyles+1.930
Neg logit contributions
nas-1.740
yang-1.498
deal-1.473
worthy-1.470
uable-1.438
Token in
Token ID287
Feature activation+6.33e-01

Pos logit contributions
rils+3.742
atography+3.731
+3.687
+3.657
estyles+3.651
Neg logit contributions
nas-2.888
yang-2.482
deal-2.442
worthy-2.419
uable-2.386
Token Lomb
Token ID28503
Feature activation+3.60e+00

Pos logit contributions
atography+3.345
rils+3.345
+3.290
+3.260
estyles+3.259
Neg logit contributions
nas-2.993
yang-2.594
deal-2.558
worthy-2.540
uable-2.495
Tokenard
Token ID446
Feature activation+3.94e-01

Pos logit contributions
rils+12.809
atography+12.268
+11.721
axies+11.664
estyles+10.781
Neg logit contributions
nas-17.292
yang-16.046
 insulting-15.897
xus-15.591
 exch-15.568
Token,
Token ID11
Feature activation+5.36e-01

Pos logit contributions
rils+2.155
atography+2.147
+2.116
+2.105
estyles+2.101
Neg logit contributions
nas-1.828
yang-1.571
deal-1.544
worthy-1.540
uable-1.509
Token was
Token ID373
Feature activation+7.50e-01

Pos logit contributions
rils+2.887
atography+2.876
+2.835
+2.823
estyles+2.818
Neg logit contributions
nas-2.528
yang-2.177
deal-2.140
worthy-2.135
uable-2.090
Token by
Token ID416
Feature activation+1.22e-01

Pos logit contributions
atography+4.260
rils+4.232
+4.204
+4.160
estyles+4.115
Neg logit contributions
nas-3.244
yang-2.797
deal-2.720
worthy-2.704
uable-2.679
Token chance
Token ID2863
Feature activation+4.71e-01

Pos logit contributions
rils+0.731
atography+0.727
+0.717
estyles+0.717
+0.717
Neg logit contributions
nas-0.507
yang-0.426
deal-0.419
worthy-0.419
uable-0.406
Token said
Token ID531
Feature activation-1.94e-01

Pos logit contributions
rils+4.423
atography+4.403
+4.389
+4.367
estyles+4.357
Neg logit contributions
nas-3.334
yang-2.853
deal-2.788
worthy-2.787
uable-2.737
Token that
Token ID326
Feature activation+1.04e-01

Pos logit contributions
nas+0.871
yang+0.741
deal+0.725
worthy+0.722
uable+0.708
Neg logit contributions
rils-1.094
atography-1.091
-1.070
estyles-1.066
-1.061
Token they
Token ID484
Feature activation-1.09e-01

Pos logit contributions
rils+0.591
atography+0.589
+0.580
+0.577
estyles+0.576
Neg logit contributions
nas-0.466
yang-0.397
worthy-0.389
deal-0.388
uable-0.379
Token did
Token ID750
Feature activation+2.16e-01

Pos logit contributions
nas+0.509
yang+0.435
deal+0.429
worthy+0.422
uable+0.413
Neg logit contributions
rils-0.602
atography-0.600
-0.591
-0.590
estyles-0.589
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
rils+1.148
atography+1.143
+1.126
+1.123
estyles+1.120
Neg logit contributions
nas-1.037
yang-0.895
deal-0.880
worthy-0.877
uable-0.857
Token Lomb
Token ID28503
Feature activation+3.59e+00

Pos logit contributions
nas+0.980
yang+0.850
worthy+0.834
deal+0.832
uable+0.809
Neg logit contributions
rils-0.974
atography-0.964
estyles-0.946
-0.943
-0.938
Tokenardi
Token ID22490
Feature activation+8.87e-01

Pos logit contributions
rils+14.718
atography+14.389
axies+13.951
+13.669
estyles+13.304
Neg logit contributions
nas-15.230
yang-14.166
 insulting-13.337
xus-13.304
deal-13.212
Token explains
Token ID6688
Feature activation-8.74e-02

Pos logit contributions
atography+4.790
rils+4.780
+4.725
+4.688
estyles+4.660
Neg logit contributions
nas-4.070
yang-3.511
deal-3.453
worthy-3.439
uable-3.381
Token that
Token ID326
Feature activation+3.03e-01

Pos logit contributions
nas+0.369
yang+0.310
deal+0.305
worthy+0.304
uable+0.295
Neg logit contributions
rils-0.515
atography-0.514
-0.505
-0.505
estyles-0.503
Token people
Token ID661
Feature activation+3.44e-01

Pos logit contributions
rils+1.698
atography+1.692
+1.668
+1.661
estyles+1.658
Neg logit contributions
nas-1.366
yang-1.167
deal-1.146
worthy-1.144
uable-1.118
Token in
Token ID287
Feature activation+1.46e-02

Pos logit contributions
rils+1.925
atography+1.920
+1.894
+1.884
estyles+1.882
Neg logit contributions
nas-1.543
yang-1.316
deal-1.293
worthy-1.292
uable-1.264
Token Part
Token ID2142
Feature activation+6.48e-01

Pos logit contributions
nas+4.390
yang+3.972
worthy+3.923
deal+3.790
 redeem+3.750
Neg logit contributions
rils-3.457
estyles-3.423
atography-3.331
-3.300
axies-3.268
Tokenicle
Token ID1548
Feature activation+1.13e+00

Pos logit contributions
rils+2.177
atography+2.169
+2.118
estyles+2.091
+2.090
Neg logit contributions
nas-4.363
yang-3.941
deal-3.895
worthy-3.894
uable-3.836
Token X
Token ID1395
Feature activation+1.56e-01

Pos logit contributions
atography+4.407
rils+4.318
+4.285
+4.200
axies+4.105
Neg logit contributions
nas-6.949
yang-6.241
deal-6.142
worthy-6.093
uable-6.003
Token-
Token ID12
Feature activation-9.07e-01

Pos logit contributions
rils+0.958
atography+0.950
estyles+0.941
+0.939
+0.939
Neg logit contributions
nas-0.618
yang-0.516
worthy-0.504
deal-0.504
uable-0.491
TokenRay
Token ID19591
Feature activation+6.89e-01

Pos logit contributions
nas+3.113
yang+2.510
deal+2.478
worthy+2.477
uable+2.343
Neg logit contributions
estyles-6.014
rils-5.977
-5.917
 cues-5.841
atography-5.806
Token Spect
Token ID13058
Feature activation+3.38e+00

Pos logit contributions
atography+2.609
rils+2.591
+2.541
+2.512
estyles+2.482
Neg logit contributions
nas-4.326
yang-3.880
deal-3.849
worthy-3.843
uable-3.783
Tokenrom
Token ID398
Feature activation+2.16e+00

Pos logit contributions
atography+8.842
rils+8.710
+8.141
ography+7.784
axies+7.300
Neg logit contributions
nas-21.540
deal-20.262
xus-20.104
 redeem-19.783
worthy-19.660
Tokeneter
Token ID2357
Feature activation+4.91e-01

Pos logit contributions
atography+3.546
+2.970
rils+2.824
axies+2.589
 cues+2.401
Neg logit contributions
nas-17.301
yang-16.067
worthy-16.053
deal-15.967
 insulting-15.807
Token (
Token ID357
Feature activation+5.71e-01

Pos logit contributions
rils+2.622
atography+2.622
+2.580
+2.562
estyles+2.547
Neg logit contributions
nas-2.314
yang-1.996
deal-1.973
worthy-1.962
uable-1.922
TokenAP
Token ID2969
Feature activation+1.16e+00

Pos logit contributions
rils+2.471
atography+2.460
+2.415
+2.404
estyles+2.391
Neg logit contributions
nas-3.293
yang-2.920
deal-2.885
worthy-2.881
uable-2.839
TokenX
Token ID55
Feature activation-3.60e-01

Pos logit contributions
rils+6.355
atography+6.335
+6.275
+6.203
estyles+6.078
Neg logit contributions
nas-5.020
deal-4.344
worthy-4.306
yang-4.288
uable-4.224
Token compared
Token ID3688
Feature activation+4.77e-01

Pos logit contributions
rils+1.611
atography+1.603
+1.583
+1.579
estyles+1.575
Neg logit contributions
nas-1.300
yang-1.112
deal-1.092
worthy-1.088
uable-1.064
Token precision
Token ID15440
Feature activation-8.96e-02

Pos logit contributions
rils+2.530
atography+2.526
+2.490
estyles+2.475
+2.474
Neg logit contributions
nas-2.283
yang-1.968
deal-1.947
worthy-1.931
uable-1.892
Token Acceler
Token ID29805
Feature activation-5.22e-01

Pos logit contributions
nas+0.467
yang+0.410
worthy+0.403
deal+0.399
uable+0.392
Neg logit contributions
rils-0.445
atography-0.434
-0.431
-0.429
estyles-0.427
Tokenator
Token ID1352
Feature activation-3.36e-01

Pos logit contributions
nas+3.054
yang+2.718
worthy+2.670
uable+2.640
deal+2.618
Neg logit contributions
rils-2.166
-2.162
estyles-2.128
-2.125
atography-2.100
Token Mass
Token ID5674
Feature activation+2.05e+00

Pos logit contributions
nas+1.751
yang+1.541
worthy+1.513
deal+1.484
uable+1.469
Neg logit contributions
rils-1.641
estyles-1.600
atography-1.599
-1.597
-1.588
Token Spect
Token ID13058
Feature activation+3.36e+00

Pos logit contributions
atography+7.265
rils+6.938
axies+6.796
 cues+6.761
+6.752
Neg logit contributions
nas-12.582
deal-11.288
worthy-11.278
yang-11.230
xus-11.197
Tokenrom
Token ID398
Feature activation+2.26e+00

Pos logit contributions
atography+9.519
rils+9.421
+8.730
estyles+8.167
axies+7.986
Neg logit contributions
nas-20.950
deal-19.069
xus-18.789
 redeem-18.771
worthy-18.567
Tokenetry
Token ID11973
Feature activation+4.75e-01

Pos logit contributions
atography+6.949
rils+6.277
+6.109
axies+6.080
estyles+6.036
Neg logit contributions
nas-15.052
worthy-13.641
deal-13.581
yang-13.542
uable-13.311
Token (
Token ID357
Feature activation+6.06e-01

Pos logit contributions
rils+2.371
atography+2.369
+2.330
+2.316
estyles+2.308
Neg logit contributions
nas-2.413
yang-2.099
deal-2.083
worthy-2.066
uable-2.028
TokenAMS
Token ID40834
Feature activation+5.45e-01

Pos logit contributions
rils+2.543
atography+2.531
+2.484
+2.473
estyles+2.461
Neg logit contributions
nas-3.571
yang-3.177
deal-3.133
worthy-3.130
uable-3.083
Token)
Token ID8
Feature activation+5.39e-01

Pos logit contributions
rils+2.922
atography+2.911
+2.869
+2.857
estyles+2.852
Neg logit contributions
nas-2.593
yang-2.235
deal-2.198
worthy-2.193
uable-2.146
Token planetary
Token ID27047
Feature activation+6.46e-01

Pos logit contributions
rils+2.235
atography+2.227
+2.196
+2.182
estyles+2.177
Neg logit contributions
nas-1.840
yang-1.578
deal-1.553
worthy-1.547
uable-1.513
Token system
Token ID1080
Feature activation+8.84e-01

Pos logit contributions
rils+3.510
atography+3.506
+3.458
+3.418
axies+3.400
Neg logit contributions
nas-2.992
yang-2.583
deal-2.546
worthy-2.529
uable-2.474
Token revis
Token ID22124
Feature activation+1.18e+00

Pos logit contributions
rils+4.645
atography+4.639
+4.566
estyles+4.521
+4.513
Neg logit contributions
nas-4.192
yang-3.646
deal-3.589
worthy-3.560
uable-3.503
Tokenited
Token ID863
Feature activation+3.69e-01

Pos logit contributions
rils+5.794
atography+5.663
estyles+5.660
axies+5.610
+5.609
Neg logit contributions
nas-5.795
yang-5.147
deal-5.076
worthy-5.062
xus-4.890
Token by
Token ID416
Feature activation+4.73e-01

Pos logit contributions
rils+2.114
atography+2.106
+2.077
+2.070
estyles+2.066
Neg logit contributions
nas-1.620
yang-1.377
deal-1.352
worthy-1.349
uable-1.317
Token HAR
Token ID43638
Feature activation+3.29e+00

Pos logit contributions
rils+2.550
atography+2.542
+2.504
+2.491
estyles+2.485
Neg logit contributions
nas-2.219
yang-1.911
deal-1.886
worthy-1.879
uable-1.836
TokenPS
Token ID3705
Feature activation+6.76e-01

Pos logit contributions
+10.388
 cues+9.840
atography+9.738
axies+9.559
LLOW+9.268
Neg logit contributions
nas-19.012
deal-17.634
uable-17.343
ufact-17.239
worthy-17.109
Token-
Token ID12
Feature activation+4.73e-01

Pos logit contributions
rils+3.711
atography+3.704
+3.661
+3.622
estyles+3.613
Neg logit contributions
nas-3.098
yang-2.656
deal-2.625
worthy-2.601
uable-2.550
TokenN
Token ID45
Feature activation+6.92e-01

Pos logit contributions
rils+2.709
atography+2.700
+2.667
+2.648
estyles+2.643
Neg logit contributions
nas-2.061
yang-1.753
deal-1.721
worthy-1.714
uable-1.679
Token:
Token ID25
Feature activation+9.30e-02

Pos logit contributions
rils+3.713
atography+3.695
+3.654
+3.592
axies+3.588
Neg logit contributions
nas-3.237
yang-2.791
deal-2.765
worthy-2.742
uable-2.689
Token A
Token ID317
Feature activation+3.19e-01

Pos logit contributions
rils+0.482
atography+0.479
estyles+0.472
+0.471
+0.471
Neg logit contributions
nas-0.459
yang-0.397
worthy-0.391
deal-0.389
uable-0.383
Token
Token ID851
Feature activation+5.76e-01

Pos logit contributions
nas+2.397
yang+2.058
deal+2.008
worthy+1.964
uable+1.913
Neg logit contributions
atography-3.269
estyles-3.240
rils-3.239
-3.181
-3.168
Token a
Token ID257
Feature activation+3.12e-01

Pos logit contributions
rils+3.303
atography+3.285
+3.246
+3.228
estyles+3.215
Neg logit contributions
nas-2.504
yang-2.122
deal-2.090
worthy-2.079
uable-2.037
Token possible
Token ID1744
Feature activation+8.19e-02

Pos logit contributions
rils+1.713
atography+1.706
+1.680
estyles+1.674
+1.674
Neg logit contributions
nas-1.443
yang-1.240
deal-1.218
worthy-1.215
uable-1.188
Token component
Token ID7515
Feature activation-6.03e-01

Pos logit contributions
rils+0.450
atography+0.449
estyles+0.442
+0.440
+0.439
Neg logit contributions
nas-0.380
yang-0.326
deal-0.322
worthy-0.321
uable-0.313
Token of
Token ID286
Feature activation+3.08e-01

Pos logit contributions
nas+2.415
yang+1.995
worthy+1.950
deal+1.946
uable+1.911
Neg logit contributions
rils-3.671
atography-3.662
estyles-3.602
-3.581
-3.572
Token dark
Token ID3223
Feature activation+3.27e+00

Pos logit contributions
rils+1.714
atography+1.708
+1.684
+1.677
estyles+1.671
Neg logit contributions
nas-1.397
yang-1.196
deal-1.176
worthy-1.173
uable-1.147
Token matter
Token ID2300
Feature activation+1.55e+00

Pos logit contributions
+12.750
axies+12.728
 cues+12.661
+12.612
rils+12.611
Neg logit contributions
nas-15.030
xus-14.154
ufact-13.828
worthy-13.318
yang-13.257
Token.
Token ID13
Feature activation+4.77e-02

Pos logit contributions
rils+7.876
atography+7.794
+7.678
+7.647
 cues+7.628
Neg logit contributions
nas-7.097
yang-6.221
deal-6.202
uable-6.154
worthy-6.121
Token
Token ID198
Feature activation+4.87e-01

Pos logit contributions
rils+0.241
atography+0.240
estyles+0.237
+0.236
+0.235
Neg logit contributions
nas-0.242
yang-0.211
deal-0.208
worthy-0.206
uable-0.201
Token
Token ID198
Feature activation+4.13e-01

Pos logit contributions
rils+2.256
atography+2.246
+2.210
+2.169
estyles+2.163
Neg logit contributions
nas-2.610
yang-2.311
worthy-2.256
deal-2.255
uable-2.251
TokenIf
Token ID1532
Feature activation-2.05e-01

Pos logit contributions
rils+2.002
atography+1.991
+1.963
+1.952
estyles+1.945
Neg logit contributions
nas-2.166
yang-1.895
worthy-1.865
deal-1.865
uable-1.834
Token in
Token ID287
Feature activation-2.08e-01

Pos logit contributions
nas+0.371
yang+0.309
worthy+0.306
deal+0.302
uable+0.291
Neg logit contributions
rils-0.556
atography-0.552
-0.547
estyles-0.546
-0.545
Token the
Token ID262
Feature activation+1.58e-01

Pos logit contributions
nas+0.923
yang+0.782
worthy+0.770
deal+0.754
uable+0.738
Neg logit contributions
rils-1.194
atography-1.191
estyles-1.176
-1.175
-1.175
Token Christmas
Token ID6786
Feature activation+2.58e-01

Pos logit contributions
rils+0.864
atography+0.862
+0.851
estyles+0.850
+0.850
Neg logit contributions
nas-0.741
yang-0.634
worthy-0.624
deal-0.623
uable-0.606
Token season
Token ID1622
Feature activation+8.80e-01

Pos logit contributions
rils+1.256
atography+1.256
+1.236
estyles+1.230
+1.229
Neg logit contributions
nas-1.354
yang-1.185
deal-1.169
worthy-1.165
uable-1.138
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
rils+4.900
atography+4.849
+4.791
+4.757
estyles+4.735
Neg logit contributions
nas-3.869
yang-3.310
deal-3.242
uable-3.235
worthy-3.215
Token fl
Token ID781
Feature activation+3.22e+00

Pos logit contributions
nas+0.470
yang+0.400
worthy+0.392
deal+0.391
uable+0.379
Neg logit contributions
rils-0.604
atography-0.600
estyles-0.595
-0.594
-0.591
Tokenaunting
Token ID20706
Feature activation+5.91e-01

Pos logit contributions
atography+11.793
estyles+11.094
axies+10.861
rils+10.758
————————+10.512
Neg logit contributions
nas-15.531
deal-14.505
xus-14.419
yang-13.835
 exch-13.566
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
rils+3.511
atography+3.490
+3.459
+3.423
estyles+3.419
Neg logit contributions
nas-2.430
yang-2.055
deal-2.019
worthy-1.991
uable-1.963
Token decorations
Token ID39011
Feature activation-1.41e-01

Pos logit contributions
nas+0.485
yang+0.410
worthy+0.409
deal+0.406
uable+0.394
Neg logit contributions
rils-0.591
atography-0.590
estyles-0.583
-0.583
-0.579
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
nas+0.595
yang+0.498
worthy+0.494
deal+0.488
uable+0.472
Neg logit contributions
rils-0.836
atography-0.831
-0.823
estyles-0.820
-0.819
Token
Token ID198
Feature activation+9.79e-02

Pos logit contributions
nas+0.932
yang+0.811
worthy+0.800
deal+0.799
uable+0.771
Neg logit contributions
rils-0.941
atography-0.931
estyles-0.924
-0.912
-0.910
Token
Token ID247
Feature activation-5.17e-01

Pos logit contributions
nas+5.231
yang+4.736
deal+4.661
worthy+4.611
uable+4.579
Neg logit contributions
atography-3.930
-3.854
estyles-3.847
rils-3.664
axies-3.612
Tokens
Token ID82
Feature activation+1.69e-01

Pos logit contributions
nas+2.217
yang+1.851
worthy+1.840
deal+1.816
uable+1.785
Neg logit contributions
atography-3.010
rils-2.992
-2.962
estyles-2.951
-2.944
Token ability
Token ID2694
Feature activation-6.78e-01

Pos logit contributions
rils+0.934
atography+0.929
+0.916
+0.915
estyles+0.914
Neg logit contributions
nas-0.782
yang-0.667
worthy-0.660
deal-0.657
uable-0.642
Token to
Token ID284
Feature activation+1.63e-01

Pos logit contributions
nas+2.528
yang+2.035
worthy+2.032
deal+2.018
uable+1.944
Neg logit contributions
rils-4.293
atography-4.246
-4.232
-4.224
estyles-4.195
Token be
Token ID307
Feature activation+2.74e-01

Pos logit contributions
rils+0.913
atography+0.909
+0.898
+0.897
estyles+0.894
Neg logit contributions
nas-0.737
yang-0.630
deal-0.620
worthy-0.616
uable-0.603
Token dark
Token ID3223
Feature activation+3.20e+00

Pos logit contributions
rils+1.613
atography+1.607
+1.587
+1.581
estyles+1.579
Neg logit contributions
nas-1.157
yang-0.976
deal-0.958
worthy-0.956
uable-0.932
Token,
Token ID11
Feature activation+3.89e-01

Pos logit contributions
+13.029
+13.013
 cues+12.997
rils+12.973
atography+12.967
Neg logit contributions
nas-13.896
xus-12.591
yang-12.527
uable-12.187
worthy-11.615
Token s
Token ID264
Feature activation+8.56e-01

Pos logit contributions
rils+2.255
atography+2.247
+2.217
+2.209
estyles+2.205
Neg logit contributions
nas-1.680
yang-1.424
deal-1.398
worthy-1.394
uable-1.361
Tokenard
Token ID446
Feature activation-4.59e-01

Pos logit contributions
rils+4.204
atography+4.179
+4.092
estyles+4.069
+4.066
Neg logit contributions
nas-4.400
yang-3.867
deal-3.796
worthy-3.784
uable-3.697
Tokenonic
Token ID9229
Feature activation+8.44e-01

Pos logit contributions
nas+1.846
worthy+1.566
yang+1.555
uable+1.545
deal+1.519
Neg logit contributions
-2.773
atography-2.728
-2.722
rils-2.677
estyles-2.650
Token,
Token ID11
Feature activation+4.44e-03

Pos logit contributions
rils+4.603
atography+4.575
+4.524
+4.504
estyles+4.459
Neg logit contributions
nas-3.823
yang-3.319
deal-3.246
worthy-3.214
uable-3.169
Tokens
Token ID82
Feature activation+6.45e-01

Pos logit contributions
rils+1.867
atography+1.860
+1.828
estyles+1.828
+1.826
Neg logit contributions
nas-1.649
yang-1.421
worthy-1.393
deal-1.391
uable-1.361
Token flight
Token ID5474
Feature activation+6.09e-01

Pos logit contributions
rils+3.410
atography+3.400
+3.351
+3.331
estyles+3.324
Neg logit contributions
nas-3.104
yang-2.680
deal-2.643
worthy-2.631
uable-2.578
Token path
Token ID3108
Feature activation+1.48e+00

Pos logit contributions
rils+2.763
atography+2.751
+2.704
+2.689
estyles+2.683
Neg logit contributions
nas-3.392
yang-2.992
deal-2.953
worthy-2.942
uable-2.894
Token towards
Token ID3371
Feature activation+4.49e-01

Pos logit contributions
rils+8.040
atography+8.023
+7.853
 cues+7.760
estyles+7.759
Neg logit contributions
nas-6.343
deal-5.531
yang-5.513
worthy-5.360
uable-5.359
Token approaching
Token ID13885
Feature activation+6.87e-01

Pos logit contributions
rils+2.416
atography+2.407
+2.374
+2.362
estyles+2.356
Neg logit contributions
nas-2.117
yang-1.823
deal-1.794
worthy-1.788
uable-1.750
Token lunar
Token ID25572
Feature activation+3.19e+00

Pos logit contributions
rils+3.459
atography+3.448
+3.401
+3.373
estyles+3.348
Neg logit contributions
nas-3.441
yang-2.999
deal-2.976
worthy-2.945
uable-2.889
Token surface
Token ID4417
Feature activation+4.60e-01

Pos logit contributions
 cues+14.974
+14.619
atography+14.498
rils+14.293
axies+14.034
Neg logit contributions
nas-12.528
deal-11.682
xus-11.428
yang-11.085
ufact-10.804
Token with
Token ID351
Feature activation+1.20e-01

Pos logit contributions
rils+2.655
atography+2.646
+2.608
estyles+2.603
+2.601
Neg logit contributions
nas-2.015
yang-1.701
worthy-1.671
deal-1.662
uable-1.621
Token the
Token ID262
Feature activation+2.32e-01

Pos logit contributions
rils+0.679
atography+0.674
estyles+0.667
+0.666
+0.666
Neg logit contributions
nas-0.541
yang-0.460
worthy-0.452
deal-0.446
uable-0.439
Token high
Token ID1029
Feature activation+5.49e-01

Pos logit contributions
rils+1.220
atography+1.210
estyles+1.198
+1.193
+1.193
Neg logit contributions
nas-1.136
yang-0.979
worthy-0.964
deal-0.958
uable-0.937
Token gain
Token ID4461
Feature activation+8.64e-01

Pos logit contributions
rils+3.135
atography+3.126
+3.088
+3.075
estyles+3.057
Neg logit contributions
nas-2.388
yang-2.030
deal-2.003
worthy-1.985
uable-1.947
Tokenpped
Token ID1496
Feature activation-1.25e+00

Pos logit contributions
rils+2.980
atography+2.969
+2.919
estyles+2.895
+2.889
Neg logit contributions
nas-3.319
yang-2.910
deal-2.860
worthy-2.859
uable-2.809
Token the
Token ID262
Feature activation-9.86e-01

Pos logit contributions
nas+4.811
deal+4.003
yang+3.997
worthy+3.980
 insulting+3.809
Neg logit contributions
rils-7.332
atography-7.332
-7.308
estyles-7.291
axies-7.239
Token ante
Token ID29692
Feature activation-1.01e+00

Pos logit contributions
nas+3.236
worthy+2.633
deal+2.617
yang+2.597
 insulting+2.595
Neg logit contributions
rils-6.480
atography-6.432
estyles-6.416
axies-6.402
-6.379
Token by
Token ID416
Feature activation+5.19e-01

Pos logit contributions
nas+4.044
worthy+3.346
deal+3.253
yang+3.244
 insulting+3.217
Neg logit contributions
rils-6.007
atography-5.999
-5.977
estyles-5.942
axies-5.857
Token warning
Token ID6509
Feature activation-3.33e-01

Pos logit contributions
rils+3.010
atography+2.999
+2.961
+2.948
estyles+2.939
Neg logit contributions
nas-2.236
yang-1.895
deal-1.860
worthy-1.854
uable-1.813
Token dark
Token ID3223
Feature activation+3.17e+00

Pos logit contributions
nas+1.462
yang+1.241
deal+1.233
worthy+1.223
uable+1.175
Neg logit contributions
atography-1.896
rils-1.888
estyles-1.871
-1.866
-1.852
Tokenly
Token ID306
Feature activation+4.71e-01

Pos logit contributions
rils+13.766
+13.585
 cues+13.467
+13.334
————————+12.779
Neg logit contributions
nas-14.682
yang-12.864
uable-12.541
xus-12.510
ufact-12.145
Token about
Token ID546
Feature activation+3.70e-01

Pos logit contributions
rils+2.764
atography+2.755
+2.723
+2.709
estyles+2.701
Neg logit contributions
nas-1.991
yang-1.685
deal-1.650
worthy-1.644
uable-1.612
Token
Token ID564
Feature activation+9.20e-01

Pos logit contributions
rils+2.069
atography+2.061
+2.033
+2.024
estyles+2.021
Neg logit contributions
nas-1.670
yang-1.427
deal-1.402
worthy-1.398
uable-1.367
Token
Token ID250
Feature activation-4.72e-01

Pos logit contributions
rils+2.773
+2.755
atography+2.738
+2.655
estyles+2.637
Neg logit contributions
nas-6.510
yang-5.906
deal-5.836
worthy-5.834
uable-5.753
Tokent
Token ID83
Feature activation-5.64e-01

Pos logit contributions
nas+2.232
deal+1.942
yang+1.929
worthy+1.921
uable+1.833
Neg logit contributions
atography-2.523
rils-2.501
estyles-2.497
-2.455
axies-2.443
Tokened
Token ID276
Feature activation+4.52e-01

Pos logit contributions
rils+4.198
atography+4.174
+4.136
+4.094
estyles+4.074
Neg logit contributions
nas-3.960
yang-3.433
deal-3.388
worthy-3.352
uable-3.292
Token edges
Token ID13015
Feature activation+1.02e-01

Pos logit contributions
rils+2.409
atography+2.405
+2.361
estyles+2.360
+2.358
Neg logit contributions
nas-2.163
yang-1.859
deal-1.836
worthy-1.829
uable-1.791
Token *)
Token ID31936
Feature activation+6.13e-01

Pos logit contributions
atography+0.570
rils+0.568
+0.559
estyles+0.558
+0.556
Neg logit contributions
nas-0.462
yang-0.391
worthy-0.388
deal-0.388
uable-0.376
Token graph
Token ID4823
Feature activation-3.67e-01

Pos logit contributions
rils+3.252
atography+3.240
+3.203
+3.180
estyles+3.163
Neg logit contributions
nas-2.926
yang-2.531
worthy-2.483
deal-2.483
uable-2.431
Token =
Token ID796
Feature activation+4.59e-01

Pos logit contributions
nas+1.745
yang+1.466
worthy+1.462
deal+1.453
uable+1.432
Neg logit contributions
atography-1.967
rils-1.957
estyles-1.936
-1.922
-1.908
Token Fl
Token ID1610
Feature activation+3.15e+00

Pos logit contributions
rils+2.251
atography+2.242
+2.208
+2.197
estyles+2.191
Neg logit contributions
nas-2.386
yang-2.085
deal-2.053
worthy-2.049
uable-2.010
Tokenatten
Token ID41769
Feature activation-1.83e-01

Pos logit contributions
atography+10.278
axies+9.475
+9.409
estyles+9.162
rils+8.856
Neg logit contributions
nas-17.285
yang-16.313
 redeem-16.307
xus-16.015
deal-15.968
Token[
Token ID58
Feature activation-2.81e-01

Pos logit contributions
nas+0.917
yang+0.787
worthy+0.779
deal+0.778
uable+0.762
Neg logit contributions
rils-0.937
atography-0.933
estyles-0.914
-0.910
-0.904
Token Table
Token ID8655
Feature activation-1.27e+00

Pos logit contributions
nas+1.392
yang+1.173
deal+1.167
worthy+1.166
uable+1.124
Neg logit contributions
atography-1.471
rils-1.458
estyles-1.444
-1.431
-1.412
Token[
Token ID58
Feature activation-4.86e-01

Pos logit contributions
nas+5.754
worthy+4.860
yang+4.848
deal+4.801
uable+4.727
Neg logit contributions
rils-6.739
atography-6.687
estyles-6.610
-6.558
axies-6.539
TokenTable
Token ID10962
Feature activation-1.94e+00

Pos logit contributions
nas+2.439
yang+2.075
deal+2.063
worthy+2.059
uable+1.990
Neg logit contributions
atography-2.481
rils-2.472
estyles-2.433
-2.421
axies-2.375
Token planet
Token ID5440
Feature activation-2.11e-01

Pos logit contributions
nas+1.613
yang+1.351
worthy+1.324
deal+1.309
uable+1.264
Neg logit contributions
rils-2.615
atography-2.596
estyles-2.573
-2.546
-2.542
Token from
Token ID422
Feature activation+6.17e-01

Pos logit contributions
nas+0.876
yang+0.726
worthy+0.715
deal+0.711
uable+0.694
Neg logit contributions
rils-1.272
atography-1.260
estyles-1.249
-1.241
-1.239
Token the
Token ID262
Feature activation+1.25e-01

Pos logit contributions
rils+3.376
atography+3.360
+3.319
+3.302
estyles+3.285
Neg logit contributions
nas-2.849
yang-2.445
deal-2.399
worthy-2.392
uable-2.341
Token forces
Token ID3386
Feature activation+1.66e-01

Pos logit contributions
rils+0.670
atography+0.669
estyles+0.663
+0.661
+0.657
Neg logit contributions
nas-0.596
yang-0.511
deal-0.506
worthy-0.504
uable-0.492
Token of
Token ID286
Feature activation+3.50e-01

Pos logit contributions
rils+0.939
atography+0.939
+0.926
+0.923
estyles+0.920
Neg logit contributions
nas-0.737
yang-0.631
deal-0.619
worthy-0.617
uable-0.602
Token Dark
Token ID3801
Feature activation+3.13e+00

Pos logit contributions
rils+1.871
atography+1.863
+1.835
estyles+1.833
+1.825
Neg logit contributions
nas-1.672
yang-1.443
deal-1.418
worthy-1.417
uable-1.385
Tokense
Token ID325
Feature activation+1.87e+00

Pos logit contributions
+10.793
rils+10.758
+10.695
————————+10.184
atography+10.129
Neg logit contributions
nas-16.113
uable-14.664
xus-14.429
yang-14.352
worthy-14.203
Tokenid
Token ID312
Feature activation+5.00e-01

Pos logit contributions
rils+7.943
atography+7.848
+7.692
estyles+7.606
axies+7.520
Neg logit contributions
nas-10.123
worthy-8.985
yang-8.936
uable-8.767
xus-8.697
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
rils+2.858
atography+2.850
+2.811
+2.800
estyles+2.799
Neg logit contributions
nas-2.201
yang-1.871
deal-1.837
worthy-1.830
uable-1.788
Token
Token ID198
Feature activation+5.77e-01

Pos logit contributions
nas+1.131
yang+0.979
deal+0.975
worthy+0.969
uable+0.941
Neg logit contributions
rils-1.205
atography-1.189
estyles-1.186
-1.176
-1.163
Token
Token ID198
Feature activation+2.65e-01

Pos logit contributions
rils+2.654
atography+2.651
+2.596
estyles+2.545
+2.544
Neg logit contributions
nas-3.101
yang-2.751
deal-2.684
worthy-2.683
uable-2.678
Token,
Token ID11
Feature activation+4.73e-02

Pos logit contributions
rils+1.422
atography+1.422
+1.397
estyles+1.393
+1.389
Neg logit contributions
nas-1.108
yang-0.944
worthy-0.926
deal-0.919
uable-0.894
Token another
Token ID1194
Feature activation-6.17e-01

Pos logit contributions
rils+0.279
atography+0.278
+0.274
estyles+0.273
+0.273
Neg logit contributions
nas-0.200
yang-0.170
worthy-0.165
deal-0.165
uable-0.160
Token way
Token ID835
Feature activation-3.88e-01

Pos logit contributions
nas+2.593
yang+2.260
worthy+2.247
deal+2.165
 redeem+2.117
Neg logit contributions
rils-3.604
atography-3.565
estyles-3.536
-3.518
-3.509
Token in
Token ID287
Feature activation-6.78e-01

Pos logit contributions
nas+1.580
yang+1.324
worthy+1.293
deal+1.273
uable+1.229
Neg logit contributions
atography-2.343
rils-2.338
estyles-2.306
-2.305
-2.297
Token which
Token ID543
Feature activation+5.69e-02

Pos logit contributions
nas+2.944
yang+2.585
worthy+2.447
deal+2.431
xus+2.400
Neg logit contributions
rils-3.872
-3.797
atography-3.788
-3.787
estyles-3.760
Token dark
Token ID3223
Feature activation+3.12e+00

Pos logit contributions
rils+0.294
atography+0.294
+0.288
estyles+0.288
+0.287
Neg logit contributions
nas-0.282
yang-0.245
worthy-0.241
deal-0.239
uable-0.233
Token energy
Token ID2568
Feature activation+1.53e+00

Pos logit contributions
 cues+13.594
axies+13.223
+12.875
rils+12.730
+12.485
Neg logit contributions
nas-14.094
ufact-12.991
xus-12.989
uable-12.427
yang-12.400
Token creates
Token ID8075
Feature activation-5.65e-01

Pos logit contributions
rils+6.977
 cues+6.968
atography+6.917
+6.817
axies+6.797
Neg logit contributions
nas-7.890
yang-7.023
worthy-6.917
uable-6.909
deal-6.908
Token distortions
Token ID47876
Feature activation-2.52e-01

Pos logit contributions
nas+2.767
yang+2.419
worthy+2.355
deal+2.322
uable+2.290
Neg logit contributions
atography-2.898
rils-2.852
-2.846
estyles-2.833
-2.807
Token in
Token ID287
Feature activation-5.70e-02

Pos logit contributions
nas+0.987
yang+0.820
worthy+0.810
deal+0.792
uable+0.769
Neg logit contributions
rils-1.566
atography-1.564
-1.540
estyles-1.534
-1.533
Token the
Token ID262
Feature activation+5.03e-01

Pos logit contributions
nas+0.270
yang+0.234
worthy+0.230
deal+0.227
uable+0.223
Neg logit contributions
rils-0.306
atography-0.303
estyles-0.301
-0.300
-0.299
Token mission
Token ID4365
Feature activation+7.11e-02

Pos logit contributions
rils+1.170
atography+1.163
+1.146
+1.144
estyles+1.142
Neg logit contributions
nas-1.064
yang-0.916
deal-0.902
worthy-0.901
uable-0.881
Token,
Token ID11
Feature activation+2.50e-01

Pos logit contributions
rils+0.413
atography+0.410
+0.405
+0.403
estyles+0.402
Neg logit contributions
nas-0.309
yang-0.262
worthy-0.257
deal-0.256
uable-0.250
Token high
Token ID1029
Feature activation+6.72e-01

Pos logit contributions
rils+1.332
atography+1.321
+1.306
+1.302
estyles+1.300
Neg logit contributions
nas-1.206
yang-1.035
worthy-1.018
deal-1.018
uable-0.996
Token quality
Token ID3081
Feature activation+3.96e-01

Pos logit contributions
rils+3.909
atography+3.908
+3.858
+3.849
estyles+3.815
Neg logit contributions
nas-2.814
yang-2.384
deal-2.363
worthy-2.322
uable-2.280
Token infrared
Token ID30624
Feature activation+1.46e+00

Pos logit contributions
rils+2.000
atography+1.994
+1.962
+1.952
estyles+1.950
Neg logit contributions
nas-1.998
yang-1.740
deal-1.714
worthy-1.708
uable-1.675
Token spect
Token ID5444
Feature activation+3.10e+00

Pos logit contributions
atography+6.725
 cues+6.662
rils+6.562
+6.516
+6.400
Neg logit contributions
nas-7.391
deal-6.637
yang-6.566
xus-6.460
worthy-6.414
Tokenra
Token ID430
Feature activation+5.65e-01

Pos logit contributions
atography+12.249
rils+12.071
estyles+11.185
axies+11.016
+10.967
Neg logit contributions
nas-15.385
deal-14.480
xus-14.358
 redeem-14.187
worthy-14.145
Token of
Token ID286
Feature activation+5.93e-01

Pos logit contributions
rils+3.005
atography+2.996
+2.955
+2.936
estyles+2.930
Neg logit contributions
nas-2.692
yang-2.325
deal-2.288
worthy-2.279
uable-2.232
Token Mars
Token ID8706
Feature activation+4.99e-01

Pos logit contributions
rils+3.056
atography+3.051
+3.000
+2.988
estyles+2.973
Neg logit contributions
nas-2.913
yang-2.533
deal-2.497
worthy-2.481
uable-2.434
Token were
Token ID547
Feature activation+3.64e-01

Pos logit contributions
rils+2.752
atography+2.742
+2.706
+2.691
estyles+2.684
Neg logit contributions
nas-2.282
yang-1.956
deal-1.927
worthy-1.916
uable-1.874
Token also
Token ID635
Feature activation-1.77e-01

Pos logit contributions
rils+2.269
atography+2.257
+2.230
+2.223
estyles+2.222
Neg logit contributions
nas-1.424
yang-1.183
deal-1.158
worthy-1.154
uable-1.125
Token member
Token ID2888
Feature activation-1.62e-01

Pos logit contributions
rils+3.326
atography+3.317
+3.278
+3.249
estyles+3.234
Neg logit contributions
nas-3.085
yang-2.664
deal-2.628
worthy-2.620
uable-2.567
Token of
Token ID286
Feature activation+4.54e-01

Pos logit contributions
nas+0.659
yang+0.542
deal+0.540
worthy+0.538
uable+0.514
Neg logit contributions
rils-0.993
atography-0.992
-0.987
estyles-0.981
-0.972
Token LS
Token ID30948
Feature activation+1.97e-01

Pos logit contributions
rils+2.415
atography+2.408
+2.371
+2.360
estyles+2.356
Neg logit contributions
nas-2.170
yang-1.872
deal-1.841
worthy-1.837
uable-1.798
TokenST
Token ID2257
Feature activation-4.82e-02

Pos logit contributions
rils+0.842
atography+0.839
+0.828
estyles+0.821
+0.813
Neg logit contributions
nas-1.157
yang-1.026
worthy-1.009
deal-1.006
uable-0.993
Token's
Token ID338
Feature activation+3.89e-01

Pos logit contributions
nas+0.207
yang+0.175
worthy+0.171
deal+0.170
uable+0.166
Neg logit contributions
rils-0.282
atography-0.280
-0.278
estyles-0.277
-0.275
Token dark
Token ID3223
Feature activation+3.08e+00

Pos logit contributions
rils+2.088
atography+2.077
+2.048
+2.043
estyles+2.039
Neg logit contributions
nas-1.853
yang-1.596
deal-1.569
worthy-1.566
uable-1.531
Token energy
Token ID2568
Feature activation+1.23e+00

Pos logit contributions
+12.977
+12.630
 cues+12.584
atography+12.564
rils+12.505
Neg logit contributions
nas-14.171
xus-12.624
yang-12.491
uable-12.416
deal-12.113
Token collaboration
Token ID12438
Feature activation+1.08e+00

Pos logit contributions
atography+6.217
rils+6.147
+6.122
 cues+6.017
axies+5.984
Neg logit contributions
nas-5.847
deal-5.076
yang-5.058
uable-5.016
worthy-5.004
Token,
Token ID11
Feature activation+5.28e-01

Pos logit contributions
atography+5.800
rils+5.771
+5.716
+5.622
estyles+5.613
Neg logit contributions
nas-4.896
yang-4.227
deal-4.178
worthy-4.165
uable-4.135
Token says
Token ID1139
Feature activation-2.88e-01

Pos logit contributions
rils+2.613
atography+2.602
+2.562
+2.551
estyles+2.548
Neg logit contributions
nas-2.732
yang-2.387
deal-2.347
worthy-2.343
uable-2.295
Token that
Token ID326
Feature activation+1.99e-01

Pos logit contributions
nas+1.213
yang+1.033
worthy+1.014
deal+1.010
xus+0.967
Neg logit contributions
rils-1.679
atography-1.678
estyles-1.658
-1.657
-1.654
Token New
Token ID968
Feature activation+1.11e-01

Pos logit contributions
rils+0.923
atography+0.918
estyles+0.904
+0.904
+0.901
Neg logit contributions
nas-0.780
yang-0.669
worthy-0.658
deal-0.657
uable-0.639
Token Genesis
Token ID18993
Feature activation-4.23e-03

Pos logit contributions
rils+0.577
atography+0.574
+0.565
+0.564
estyles+0.563
Neg logit contributions
nas-0.545
yang-0.472
deal-0.465
worthy-0.464
uable-0.453
Token,
Token ID11
Feature activation+1.46e-01

Pos logit contributions
nas+0.018
yang+0.016
deal+0.015
worthy+0.015
uable+0.015
Neg logit contributions
rils-0.025
atography-0.024
-0.024
estyles-0.024
-0.024
Token to
Token ID284
Feature activation+1.94e-01

Pos logit contributions
rils+0.818
atography+0.816
+0.804
estyles+0.803
+0.801
Neg logit contributions
nas-0.665
yang-0.565
worthy-0.556
deal-0.555
uable-0.539
Token face
Token ID1986
Feature activation+3.64e-01

Pos logit contributions
rils+1.100
atography+1.096
+1.082
+1.078
estyles+1.075
Neg logit contributions
nas-0.863
yang-0.736
deal-0.723
worthy-0.720
uable-0.704
Token Dark
Token ID3801
Feature activation+3.07e+00

Pos logit contributions
rils+2.126
atography+2.124
+2.090
estyles+2.086
+2.084
Neg logit contributions
nas-1.558
yang-1.318
deal-1.294
worthy-1.288
uable-1.257
Tokense
Token ID325
Feature activation+2.15e+00

Pos logit contributions
rils+10.423
+10.188
+9.975
atography+9.554
axies+9.291
Neg logit contributions
nas-16.197
yang-14.696
uable-14.507
worthy-14.497
xus-14.393
Tokenid
Token ID312
Feature activation+6.02e-01

Pos logit contributions
rils+8.646
atography+8.319
axies+8.299
estyles+8.156
+7.779
Neg logit contributions
nas-11.538
yang-10.257
worthy-10.174
 insulting-9.904
 lifes-9.902
Token's
Token ID338
Feature activation+7.79e-01

Pos logit contributions
rils+3.454
atography+3.444
+3.397
+3.384
estyles+3.380
Neg logit contributions
nas-2.633
yang-2.237
deal-2.196
worthy-2.190
uable-2.138
Token army
Token ID5428
Feature activation+3.90e-01

Pos logit contributions
rils+3.984
atography+3.947
+3.899
+3.877
estyles+3.846
Neg logit contributions
nas-3.870
yang-3.372
deal-3.314
worthy-3.301
uable-3.244
Token,
Token ID11
Feature activation+3.91e-01

Pos logit contributions
rils+2.274
atography+2.266
+2.233
+2.231
estyles+2.225
Neg logit contributions
nas-1.682
yang-1.418
deal-1.391
worthy-1.391
uable-1.349
Token-
Token ID12
Feature activation-5.70e-01

Pos logit contributions
nas+1.040
yang+0.855
worthy+0.839
deal+0.835
uable+0.806
Neg logit contributions
rils-1.725
atography-1.718
-1.698
estyles-1.690
-1.683
Token1
Token ID16
Feature activation+3.91e-01

Pos logit contributions
nas+2.082
worthy+1.689
yang+1.686
deal+1.670
uable+1.612
Neg logit contributions
rils-3.691
atography-3.647
estyles-3.620
-3.609
-3.584
Token).
Token ID737
Feature activation-3.74e-01

Pos logit contributions
rils+2.146
atography+2.136
+2.116
estyles+2.096
+2.095
Neg logit contributions
nas-1.802
yang-1.550
deal-1.514
worthy-1.509
uable-1.484
Token He
Token ID679
Feature activation+4.29e-01

Pos logit contributions
nas+1.895
yang+1.639
worthy+1.630
deal+1.622
uable+1.563
Neg logit contributions
rils-1.897
atography-1.879
estyles-1.877
-1.854
axies-1.845
Tokenikes
Token ID7938
Feature activation+5.55e-01

Pos logit contributions
rils+1.880
atography+1.874
+1.843
+1.843
estyles+1.823
Neg logit contributions
nas-2.462
yang-2.177
deal-2.153
worthy-2.150
uable-2.114
Token fl
Token ID781
Feature activation+3.06e+00

Pos logit contributions
rils+2.707
atography+2.699
+2.656
+2.634
estyles+2.631
Neg logit contributions
nas-2.885
yang-2.520
deal-2.481
worthy-2.472
uable-2.432
Tokenes
Token ID274
Feature activation+1.10e-01

Pos logit contributions
atography+11.246
estyles+11.157
rils+11.101
axies+10.740
 cues+10.199
Neg logit contributions
nas-15.277
yang-14.057
xus-13.967
worthy-13.900
 insulting-13.853
Tokenhes
Token ID956
Feature activation+3.00e-01

Pos logit contributions
rils+0.736
atography+0.735
+0.726
+0.726
estyles+0.723
Neg logit contributions
nas-0.376
yang-0.302
worthy-0.295
deal-0.295
uable-0.286
Token out
Token ID503
Feature activation+3.57e-01

Pos logit contributions
rils+1.791
atography+1.785
+1.762
+1.757
estyles+1.754
Neg logit contributions
nas-1.243
yang-1.045
deal-1.025
worthy-1.024
uable-0.997
Token this
Token ID428
Feature activation-3.83e-01

Pos logit contributions
rils+1.893
atography+1.884
+1.861
+1.848
estyles+1.842
Neg logit contributions
nas-1.709
yang-1.479
deal-1.453
worthy-1.446
uable-1.421
Token commitment
Token ID7901
Feature activation-1.31e-01

Pos logit contributions
nas+1.638
yang+1.387
worthy+1.370
deal+1.367
uable+1.317
Neg logit contributions
rils-2.254
atography-2.219
estyles-2.215
-2.190
-2.189
Token-
Token ID12
Feature activation-4.33e-01

Pos logit contributions
nas+1.904
deal+1.632
worthy+1.620
yang+1.598
xus+1.560
Neg logit contributions
-1.812
rils-1.809
atography-1.803
estyles-1.793
-1.790
Tokenm
Token ID76
Feature activation+2.75e-02

Pos logit contributions
nas+2.025
deal+1.715
worthy+1.700
yang+1.690
xus+1.592
Neg logit contributions
-2.402
rils-2.394
atography-2.360
estyles-2.356
-2.342
Tokenarijuana
Token ID42834
Feature activation+7.43e-01

Pos logit contributions
rils+0.117
+0.116
+0.114
atography+0.114
estyles+0.112
Neg logit contributions
nas-0.164
yang-0.143
worthy-0.143
deal-0.142
uable-0.139
Token dispensaries
Token ID40698
Feature activation-2.49e-01

Pos logit contributions
rils+3.695
atography+3.658
+3.594
+3.581
estyles+3.560
Neg logit contributions
nas-3.778
yang-3.331
worthy-3.249
deal-3.243
uable-3.220
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
nas+1.049
yang+0.855
worthy+0.854
deal+0.853
uable+0.810
Neg logit contributions
atography-1.487
rils-1.484
-1.469
estyles-1.469
-1.465
Token Dark
Token ID3801
Feature activation+3.05e+00

Pos logit contributions
nas+0.575
yang+0.493
deal+0.490
worthy+0.488
uable+0.470
Neg logit contributions
rils-0.555
atography-0.551
estyles-0.548
-0.542
axies-0.538
Token-
Token ID12
Feature activation+7.24e-02

Pos logit contributions
rils+15.459
+15.107
+14.893
atography+14.695
 cues+14.593
Neg logit contributions
nas-11.740
yang-10.307
uable-10.292
worthy-10.107
xus-9.700
Tokenhaired
Token ID29972
Feature activation+6.26e-01

Pos logit contributions
rils+0.402
atography+0.402
estyles+0.398
+0.397
+0.396
Neg logit contributions
nas-0.334
yang-0.282
deal-0.282
worthy-0.277
uable-0.268
Token,
Token ID11
Feature activation+4.02e-02

Pos logit contributions
rils+3.527
atography+3.503
+3.464
+3.441
estyles+3.415
Neg logit contributions
nas-2.742
yang-2.379
deal-2.316
worthy-2.304
uable-2.276
Token bes
Token ID7284
Feature activation-2.24e-01

Pos logit contributions
rils+0.224
atography+0.224
estyles+0.221
+0.220
+0.219
Neg logit contributions
nas-0.184
yang-0.156
deal-0.154
worthy-0.153
uable-0.149
Tokenpect
Token ID806
Feature activation+4.88e-01

Pos logit contributions
nas+1.437
yang+1.314
uable+1.286
worthy+1.279
deal+1.277
Neg logit contributions
rils-0.808
-0.800
atography-0.798
-0.787
axies-0.786
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
nas+0.522
yang+0.442
worthy+0.434
deal+0.433
uable+0.417
Neg logit contributions
atography-0.687
rils-0.687
estyles-0.675
-0.674
-0.674
Token I
Token ID314
Feature activation-1.73e-01

Pos logit contributions
nas+1.052
yang+0.934
deal+0.913
worthy+0.909
uable+0.890
Neg logit contributions
atography-1.007
rils-1.006
estyles-0.990
-0.983
-0.978
Token'm
Token ID1101
Feature activation-2.63e-01

Pos logit contributions
nas+0.829
yang+0.716
deal+0.707
worthy+0.697
uable+0.681
Neg logit contributions
atography-0.922
rils-0.919
-0.911
estyles-0.904
-0.902
Token glad
Token ID9675
Feature activation-7.75e-01

Pos logit contributions
nas+1.233
yang+1.056
worthy+1.031
deal+1.026
uable+0.997
Neg logit contributions
rils-1.436
atography-1.435
estyles-1.426
-1.414
-1.400
Token nan
Token ID15709
Feature activation-4.28e-01

Pos logit contributions
nas+3.330
yang+2.839
worthy+2.776
deal+2.746
 redeem+2.648
Neg logit contributions
atography-4.445
estyles-4.408
-4.394
rils-4.390
-4.341
Tokenocl
Token ID38679
Feature activation+3.03e+00

Pos logit contributions
nas+2.193
yang+1.928
deal+1.862
worthy+1.854
xus+1.833
Neg logit contributions
atography-2.141
-2.120
-2.099
estyles-2.093
rils-2.085
Tokenoud
Token ID2778
Feature activation+2.52e-01

Pos logit contributions
rils+13.998
atography+13.827
axies+13.255
estyles+13.094
+12.976
Neg logit contributions
nas-12.805
yang-11.360
worthy-11.050
deal-10.869
 insulting-10.589
Token wasn
Token ID2492
Feature activation-9.86e-01

Pos logit contributions
rils+1.340
atography+1.337
+1.318
+1.315
estyles+1.311
Neg logit contributions
nas-1.208
yang-1.043
deal-1.026
worthy-1.024
uable-1.000
Token't
Token ID470
Feature activation-3.12e-01

Pos logit contributions
nas+1.687
deal+1.015
yang+0.931
worthy+0.905
uable+0.683
Neg logit contributions
-8.376
estyles-8.308
rils-8.288
atography-8.284
-8.223
Token the
Token ID262
Feature activation+4.16e-02

Pos logit contributions
nas+1.300
yang+1.093
worthy+1.076
deal+1.071
uable+1.037
Neg logit contributions
rils-1.854
atography-1.849
estyles-1.830
-1.823
-1.820
Token one
Token ID530
Feature activation+5.88e-01

Pos logit contributions
rils+0.220
atography+0.219
estyles+0.216
+0.215
+0.215
Neg logit contributions
nas-0.202
yang-0.174
deal-0.172
worthy-0.172
uable-0.168
Token and
Token ID290
Feature activation-2.73e-01

Pos logit contributions
nas+1.606
yang+1.330
worthy+1.315
deal+1.284
uable+1.243
Neg logit contributions
rils-2.595
-2.563
atography-2.556
estyles-2.541
-2.536
Token muff
Token ID27563
Feature activation-4.00e-01

Pos logit contributions
nas+1.342
yang+1.163
worthy+1.149
deal+1.143
uable+1.107
Neg logit contributions
atography-1.420
rils-1.415
estyles-1.387
-1.380
-1.376
Tokenins
Token ID1040
Feature activation-1.65e-01

Pos logit contributions
nas+1.493
yang+1.216
worthy+1.178
deal+1.173
uable+1.124
Neg logit contributions
-2.592
atography-2.526
rils-2.520
-2.513
axies-2.496
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
nas+0.694
yang+0.580
worthy+0.572
deal+0.567
uable+0.550
Neg logit contributions
rils-0.983
atography-0.975
-0.966
-0.962
estyles-0.961
Token Ground
Token ID13706
Feature activation-8.59e-01

Pos logit contributions
nas+1.330
yang+1.162
worthy+1.133
deal+1.132
uable+1.105
Neg logit contributions
rils-1.425
estyles-1.408
atography-1.405
-1.377
-1.377
Token fl
Token ID781
Feature activation+3.02e+00

Pos logit contributions
nas+4.580
worthy+3.993
yang+3.989
deal+3.861
uable+3.833
Neg logit contributions
atography-4.045
estyles-3.987
rils-3.922
-3.919
-3.885
Tokenax
Token ID897
Feature activation-6.77e-01

Pos logit contributions
atography+8.204
rils+8.140
axies+7.911
estyles+7.871
————————+7.084
Neg logit contributions
nas-17.558
yang-17.151
deal-17.078
xus-16.933
 redeem-16.627
Token seed
Token ID9403
Feature activation+2.55e-01

Pos logit contributions
nas+3.345
deal+2.803
worthy+2.793
yang+2.791
uable+2.731
Neg logit contributions
atography-3.519
-3.514
estyles-3.495
-3.474
rils-3.468
Token also
Token ID635
Feature activation-8.77e-01

Pos logit contributions
rils+1.424
atography+1.420
+1.404
+1.401
estyles+1.398
Neg logit contributions
nas-1.152
yang-0.984
deal-0.967
worthy-0.966
uable-0.943
Token qualifies
Token ID37782
Feature activation-1.44e+00

Pos logit contributions
nas+4.265
yang+3.738
worthy+3.652
deal+3.635
 redeem+3.607
Neg logit contributions
rils-4.439
-4.422
estyles-4.404
-4.373
atography-4.330
Token here
Token ID994
Feature activation-4.95e-01

Pos logit contributions
nas+4.897
yang+4.165
worthy+4.153
deal+3.873
 redeem+3.816
Neg logit contributions
-9.067
estyles-8.918
atography-8.907
rils-8.866
-8.780
Token She
Token ID1375
Feature activation+2.99e-01

Pos logit contributions
rils+2.599
atography+2.587
+2.546
+2.536
estyles+2.531
Neg logit contributions
nas-2.773
yang-2.423
deal-2.388
worthy-2.383
uable-2.336
Token has
Token ID468
Feature activation-4.93e-02

Pos logit contributions
rils+1.603
atography+1.591
+1.575
+1.570
estyles+1.565
Neg logit contributions
nas-1.430
yang-1.236
worthy-1.212
deal-1.210
uable-1.181
Token been
Token ID587
Feature activation-7.14e-01

Pos logit contributions
nas+0.213
yang+0.178
worthy+0.176
deal+0.175
uable+0.171
Neg logit contributions
rils-0.287
atography-0.286
-0.283
-0.283
estyles-0.282
Token covering
Token ID9505
Feature activation-1.61e-01

Pos logit contributions
nas+2.940
worthy+2.439
yang+2.431
deal+2.399
 insulting+2.345
Neg logit contributions
rils-4.265
estyles-4.215
atography-4.203
-4.168
-4.163
Token the
Token ID262
Feature activation+3.72e-01

Pos logit contributions
nas+0.740
yang+0.627
worthy+0.621
deal+0.621
uable+0.601
Neg logit contributions
rils-0.890
atography-0.884
-0.873
estyles-0.867
-0.865
Token McCain
Token ID14264
Feature activation-3.00e+00

Pos logit contributions
rils+2.090
atography+2.082
+2.054
+2.045
estyles+2.042
Neg logit contributions
nas-1.669
yang-1.425
deal-1.400
worthy-1.397
uable-1.365
Token campaign
Token ID1923
Feature activation-2.87e-01

Pos logit contributions
nas+11.804
worthy+9.979
deal+9.911
yang+9.649
 insulting+9.602
Neg logit contributions
estyles-14.747
-14.356
atography-14.348
rils-14.026
axies-13.907
Token reaction
Token ID6317
Feature activation+2.96e-01

Pos logit contributions
nas+1.094
yang+0.891
worthy+0.888
deal+0.869
uable+0.820
Neg logit contributions
rils-1.823
atography-1.816
-1.799
estyles-1.799
-1.796
Token from
Token ID422
Feature activation+5.03e-01

Pos logit contributions
rils+1.835
atography+1.829
+1.808
+1.803
estyles+1.797
Neg logit contributions
nas-1.163
yang-0.965
worthy-0.949
deal-0.945
uable-0.915
Token the
Token ID262
Feature activation+5.90e-01

Pos logit contributions
rils+2.719
atography+2.713
+2.677
+2.660
estyles+2.655
Neg logit contributions
nas-2.352
yang-2.023
deal-1.988
worthy-1.980
uable-1.945
Token rather
Token ID2138
Feature activation-1.35e+00

Pos logit contributions
rils+3.252
atography+3.249
+3.201
+3.177
estyles+3.170
Neg logit contributions
nas-2.670
yang-2.291
deal-2.250
worthy-2.243
uable-2.210
Token 45
Token ID4153
Feature activation-7.11e-01

Pos logit contributions
nas+0.170
yang+0.146
worthy+0.145
deal+0.145
uable+0.142
Neg logit contributions
rils-0.183
atography-0.181
-0.179
-0.178
estyles-0.177
Token HD
Token ID5572
Feature activation+1.41e+00

Pos logit contributions
nas+3.193
worthy+2.726
deal+2.683
uable+2.672
yang+2.667
Neg logit contributions
rils-3.901
atography-3.859
-3.848
axies-3.765
-3.749
TokenBs
Token ID37000
Feature activation-1.69e-01

Pos logit contributions
atography+7.281
estyles+7.246
rils+7.245
+7.239
 cues+7.095
Neg logit contributions
nas-6.394
yang-5.760
deal-5.567
worthy-5.470
xus-5.437
Token and
Token ID290
Feature activation-5.07e-02

Pos logit contributions
nas+0.724
yang+0.599
worthy+0.598
deal+0.593
uable+0.579
Neg logit contributions
rils-0.990
atography-0.983
-0.970
-0.964
estyles-0.959
Token Patrick
Token ID9925
Feature activation-1.90e+00

Pos logit contributions
nas+0.252
yang+0.217
worthy+0.213
deal+0.213
uable+0.209
Neg logit contributions
rils-0.264
atography-0.262
-0.257
-0.256
estyles-0.256
Token Bert
Token ID22108
Feature activation-2.98e+00

Pos logit contributions
nas+9.197
yang+7.878
worthy+7.786
uable+7.514
deal+7.495
Neg logit contributions
atography-8.872
rils-8.846
estyles-8.756
axies-8.679
-8.669
Tokenole
Token ID2305
Feature activation+2.67e-01

Pos logit contributions
nas+14.689
worthy+12.774
yang+12.363
deal+12.030
uable+11.577
Neg logit contributions
-12.092
atography-11.939
estyles-11.859
-11.554
axies-11.539
Tokentti
Token ID35671
Feature activation-7.88e-01

Pos logit contributions
rils+1.549
atography+1.544
+1.524
+1.518
estyles+1.515
Neg logit contributions
nas-1.151
yang-0.974
deal-0.955
worthy-0.954
uable-0.930
Token finished
Token ID5201
Feature activation-5.83e-01

Pos logit contributions
nas+3.933
yang+3.351
worthy+3.315
deal+3.294
uable+3.248
Neg logit contributions
rils-4.023
atography-3.967
axies-3.911
estyles-3.891
-3.875
Token third
Token ID2368
Feature activation+1.10e+00

Pos logit contributions
nas+2.893
worthy+2.443
yang+2.442
deal+2.424
uable+2.384
Neg logit contributions
rils-3.005
atography-2.962
estyles-2.945
-2.939
axies-2.927
Token with
Token ID351
Feature activation+3.92e-01

Pos logit contributions
atography+6.260
+6.226
rils+6.213
+6.126
estyles+6.117
Neg logit contributions
nas-4.514
yang-3.882
deal-3.803
worthy-3.754
uable-3.746
Token us
Token ID514
Feature activation+1.07e-01

Pos logit contributions
nas+1.397
yang+1.159
worthy+1.158
deal+1.145
uable+1.122
Neg logit contributions
rils-2.032
atography-2.013
estyles-1.994
-1.988
-1.981
Token can
Token ID460
Feature activation-1.52e-02

Pos logit contributions
rils+0.576
atography+0.573
+0.565
+0.564
estyles+0.563
Neg logit contributions
nas-0.514
yang-0.442
deal-0.435
worthy-0.435
uable-0.425
Token listen
Token ID6004
Feature activation+1.40e+00

Pos logit contributions
nas+0.067
yang+0.057
deal+0.056
worthy+0.056
uable+0.055
Neg logit contributions
rils-0.086
atography-0.086
-0.085
-0.085
estyles-0.085
Token to
Token ID284
Feature activation+4.88e-01

Pos logit contributions
rils+7.860
atography+7.775
+7.756
estyles+7.612
+7.597
Neg logit contributions
nas-5.800
yang-5.096
deal-5.019
uable-4.889
worthy-4.868
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
rils+2.689
atography+2.677
+2.643
+2.626
estyles+2.625
Neg logit contributions
nas-2.238
yang-1.922
deal-1.891
worthy-1.879
uable-1.843
Token clar
Token ID10212
Feature activation-2.97e+00

Pos logit contributions
rils+0.778
atography+0.774
+0.764
estyles+0.764
+0.762
Neg logit contributions
nas-0.648
yang-0.553
worthy-0.547
deal-0.545
uable-0.533
Tokeninet
Token ID42504
Feature activation-1.69e+00

Pos logit contributions
nas+10.356
uable+8.644
worthy+8.396
deal+8.334
yang+8.147
Neg logit contributions
-16.417
-15.417
atography-14.979
estyles-14.914
————————-14.792
Token quint
Token ID28533
Feature activation-2.19e-01

Pos logit contributions
nas+8.093
worthy+6.816
yang+6.638
deal+6.552
uable+6.536
Neg logit contributions
-8.503
estyles-8.379
atography-8.376
axies-8.256
rils-8.190
Tokenet
Token ID316
Feature activation-2.71e-01

Pos logit contributions
nas+1.120
yang+0.977
deal+0.952
worthy+0.951
uable+0.949
Neg logit contributions
-1.094
atography-1.091
rils-1.088
estyles-1.073
-1.068
Token without
Token ID1231
Feature activation+2.11e-01

Pos logit contributions
nas+1.168
yang+0.974
worthy+0.961
deal+0.957
uable+0.934
Neg logit contributions
rils-1.576
atography-1.573
-1.559
estyles-1.551
-1.550
Token fearing
Token ID33188
Feature activation+1.27e+00

Pos logit contributions
rils+1.265
atography+1.264
+1.249
estyles+1.248
+1.243
Neg logit contributions
nas-0.872
yang-0.730
worthy-0.717
deal-0.717
uable-0.695
Token for
Token ID329
Feature activation-6.27e-01

Pos logit contributions
nas+1.111
yang+0.936
worthy+0.924
deal+0.905
uable+0.887
Neg logit contributions
atography-1.480
rils-1.473
estyles-1.468
-1.466
-1.458
Token the
Token ID262
Feature activation-4.31e-01

Pos logit contributions
nas+2.412
yang+1.993
worthy+1.986
deal+1.954
 redeem+1.921
Neg logit contributions
estyles-3.907
atography-3.873
rils-3.871
-3.829
-3.798
Token information
Token ID1321
Feature activation-3.05e-01

Pos logit contributions
nas+1.900
yang+1.637
deal+1.615
worthy+1.613
uable+1.571
Neg logit contributions
rils-2.439
estyles-2.429
-2.407
atography-2.403
-2.383
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
nas+1.295
yang+1.088
worthy+1.076
deal+1.063
uable+1.042
Neg logit contributions
rils-1.798
atography-1.782
-1.777
estyles-1.766
-1.764
Token this
Token ID428
Feature activation-2.17e-01

Pos logit contributions
rils+0.673
atography+0.671
+0.662
+0.662
estyles+0.660
Neg logit contributions
nas-0.588
yang-0.505
deal-0.499
worthy-0.496
uable-0.485
Token clar
Token ID10212
Feature activation-2.94e+00

Pos logit contributions
nas+1.115
yang+0.974
worthy+0.960
deal+0.954
uable+0.935
Neg logit contributions
atography-1.084
rils-1.081
estyles-1.073
-1.067
-1.060
Tokenifies
Token ID6945
Feature activation-1.44e+00

Pos logit contributions
nas+9.841
uable+8.282
deal+8.193
worthy+7.883
yang+7.740
Neg logit contributions
-16.416
-15.524
estyles-15.407
atography-15.289
rils-14.997
Token and
Token ID290
Feature activation-6.74e-01

Pos logit contributions
nas+5.797
yang+4.876
worthy+4.722
deal+4.655
 insulting+4.526
Neg logit contributions
estyles-8.409
atography-8.306
-8.241
rils-8.059
axies-7.989
Token adds
Token ID6673
Feature activation-9.87e-01

Pos logit contributions
nas+2.819
yang+2.380
worthy+2.359
deal+2.329
 redeem+2.289
Neg logit contributions
rils-3.939
atography-3.934
estyles-3.924
-3.900
-3.876
Token to
Token ID284
Feature activation-8.16e-01

Pos logit contributions
nas+3.894
yang+3.237
worthy+3.194
deal+3.116
uable+3.111
Neg logit contributions
estyles-5.899
atography-5.868
-5.798
rils-5.786
-5.747
Token what
Token ID644
Feature activation+4.46e-01

Pos logit contributions
nas+3.311
yang+2.750
worthy+2.712
deal+2.679
uable+2.667
Neg logit contributions
atography-4.866
estyles-4.831
-4.799
rils-4.792
-4.711
Token Man
Token ID1869
Feature activation+6.36e-01

Pos logit contributions
nas+6.579
yang+5.631
worthy+5.516
deal+5.474
uable+5.209
Neg logit contributions
rils-7.617
-7.480
atography-7.467
estyles-7.396
-7.316
Token Singh
Token ID14403
Feature activation-1.19e+00

Pos logit contributions
rils+3.575
atography+3.550
+3.498
estyles+3.470
+3.453
Neg logit contributions
nas-2.815
yang-2.401
deal-2.367
worthy-2.357
uable-2.314
Token Dal
Token ID12348
Feature activation+3.05e-01

Pos logit contributions
nas+5.183
yang+4.362
deal+4.241
worthy+4.238
 insulting+4.041
Neg logit contributions
rils-6.658
atography-6.646
estyles-6.634
-6.590
-6.503
Tokenal
Token ID282
Feature activation-1.47e-02

Pos logit contributions
rils+1.736
atography+1.730
+1.706
+1.703
estyles+1.698
Neg logit contributions
nas-1.353
yang-1.153
deal-1.132
worthy-1.129
uable-1.101
Token (
Token ID357
Feature activation-2.55e-02

Pos logit contributions
nas+0.063
yang+0.053
deal+0.052
worthy+0.052
uable+0.050
Neg logit contributions
rils-0.086
atography-0.086
-0.085
-0.085
estyles-0.085
TokenRet
Token ID9781
Feature activation-2.82e+00

Pos logit contributions
nas+0.121
yang+0.104
deal+0.103
worthy+0.103
uable+0.099
Neg logit contributions
rils-0.137
atography-0.137
-0.135
estyles-0.135
-0.134
Tokend
Token ID67
Feature activation+2.58e-01

Pos logit contributions
nas+9.894
yang+8.230
deal+8.099
worthy+7.802
uable+7.291
Neg logit contributions
-16.267
rils-15.830
atography-15.514
estyles-15.350
-15.317
Token)
Token ID8
Feature activation-7.55e-01

Pos logit contributions
rils+1.348
atography+1.344
+1.325
+1.317
estyles+1.316
Neg logit contributions
nas-1.260
yang-1.091
deal-1.073
worthy-1.071
uable-1.049
Token to
Token ID284
Feature activation+1.55e-01

Pos logit contributions
nas+3.285
yang+2.755
worthy+2.727
deal+2.706
uable+2.573
Neg logit contributions
rils-4.292
atography-4.275
estyles-4.256
-4.232
-4.197
Token stir
Token ID11240
Feature activation-6.85e-01

Pos logit contributions
rils+0.871
atography+0.867
+0.856
+0.853
estyles+0.852
Neg logit contributions
nas-0.701
yang-0.599
deal-0.589
worthy-0.587
uable-0.573
Token up
Token ID510
Feature activation-8.45e-01

Pos logit contributions
nas+2.600
yang+2.107
deal+2.085
worthy+2.057
uable+1.965
Neg logit contributions
atography-4.326
rils-4.249
estyles-4.245
-4.229
-4.216
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
nas+7.528
yang+6.283
worthy+6.108
uable+5.940
deal+5.843
Neg logit contributions
atography-10.644
rils-10.638
-10.562
estyles-10.317
-10.192
Token and
Token ID290
Feature activation-4.30e-01

Pos logit contributions
nas+4.579
yang+3.807
worthy+3.784
deal+3.696
uable+3.626
Neg logit contributions
rils-6.050
atography-6.031
-5.948
estyles-5.920
axies-5.856
Token Bou
Token ID14551
Feature activation+1.60e-01

Pos logit contributions
nas+2.110
yang+1.809
worthy+1.782
deal+1.767
uable+1.741
Neg logit contributions
rils-2.235
atography-2.232
-2.210
estyles-2.199
-2.183
Tokenlt
Token ID2528
Feature activation-7.88e-01

Pos logit contributions
rils+1.075
atography+1.070
+1.060
+1.057
estyles+1.053
Neg logit contributions
nas-0.552
yang-0.447
deal-0.435
worthy-0.435
uable-0.417
Tokenbee
Token ID20963
Feature activation-5.33e-01

Pos logit contributions
nas+3.624
worthy+3.075
yang+3.059
deal+3.001
uable+2.897
Neg logit contributions
-4.331
atography-4.330
estyles-4.260
rils-4.223
-4.208
Token Ave
Token ID12761
Feature activation-2.78e+00

Pos logit contributions
nas+2.462
yang+2.102
worthy+2.078
deal+2.060
uable+1.992
Neg logit contributions
atography-2.932
rils-2.898
-2.870
estyles-2.865
-2.833
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
nas+9.711
yang+7.791
worthy+7.788
uable+7.355
deal+7.268
Neg logit contributions
atography-15.642
-15.278
rils-15.061
estyles-14.509
axies-14.356
Token area
Token ID1989
Feature activation-1.09e-01

Pos logit contributions
nas+5.001
yang+4.112
worthy+4.103
deal+4.019
uable+3.893
Neg logit contributions
atography-7.017
rils-6.955
-6.896
estyles-6.845
axies-6.758
Token,
Token ID11
Feature activation+8.73e-02

Pos logit contributions
nas+0.460
yang+0.380
deal+0.376
worthy+0.375
uable+0.360
Neg logit contributions
rils-0.645
atography-0.643
-0.634
estyles-0.634
-0.633
Token north
Token ID5093
Feature activation-6.18e-02

Pos logit contributions
rils+0.492
atography+0.490
+0.484
+0.484
estyles+0.483
Neg logit contributions
nas-0.394
yang-0.334
worthy-0.329
deal-0.328
uable-0.319
Token of
Token ID286
Feature activation+4.95e-01

Pos logit contributions
nas+0.266
yang+0.226
deal+0.222
worthy+0.221
uable+0.215
Neg logit contributions
rils-0.359
atography-0.358
-0.352
-0.351
estyles-0.351
Token to
Token ID284
Feature activation+5.95e-01

Pos logit contributions
nas+2.251
worthy+1.833
yang+1.832
deal+1.828
uable+1.746
Neg logit contributions
atography-3.519
rils-3.507
-3.478
estyles-3.460
-3.434
Token investigate
Token ID9161
Feature activation+1.00e+00

Pos logit contributions
rils+3.198
atography+3.185
+3.133
+3.109
estyles+3.102
Neg logit contributions
nas-2.803
yang-2.407
deal-2.361
worthy-2.360
uable-2.312
Token charges
Token ID4530
Feature activation-7.34e-01

Pos logit contributions
rils+5.194
atography+5.081
+5.061
estyles+4.988
+4.979
Neg logit contributions
nas-4.813
yang-4.150
deal-4.095
worthy-4.062
uable-4.041
Token that
Token ID326
Feature activation-1.03e-01

Pos logit contributions
nas+2.705
yang+2.296
worthy+2.274
deal+2.251
 insulting+2.128
Neg logit contributions
atography-4.613
rils-4.587
-4.548
estyles-4.532
axies-4.509
Token a
Token ID257
Feature activation+3.22e-02

Pos logit contributions
nas+0.462
yang+0.394
deal+0.387
worthy+0.387
uable+0.377
Neg logit contributions
atography-0.584
rils-0.583
-0.574
-0.572
estyles-0.572
Token teacher
Token ID4701
Feature activation-2.78e+00

Pos logit contributions
rils+0.189
atography+0.188
+0.185
estyles+0.185
+0.185
Neg logit contributions
nas-0.138
yang-0.116
deal-0.114
worthy-0.114
uable-0.111
Token has
Token ID468
Feature activation-3.45e-01

Pos logit contributions
nas+10.238
 insulting+8.916
worthy+8.443
yang+8.398
 redeem+8.275
Neg logit contributions
rils-15.672
estyles-15.118
axies-14.726
-14.578
atography-14.512
Token inserted
Token ID18846
Feature activation-1.40e+00

Pos logit contributions
nas+1.377
yang+1.135
deal+1.112
worthy+1.109
uable+1.070
Neg logit contributions
rils-2.122
atography-2.107
estyles-2.099
-2.085
-2.084
Token her
Token ID607
Feature activation-4.00e-01

Pos logit contributions
nas+5.996
 insulting+5.159
yang+5.104
worthy+5.035
 redeem+4.944
Neg logit contributions
axies-7.684
rils-7.670
-7.590
atography-7.522
estyles-7.499
Token religious
Token ID4158
Feature activation+4.51e-01

Pos logit contributions
nas+1.661
yang+1.382
worthy+1.372
deal+1.352
uable+1.332
Neg logit contributions
rils-2.390
atography-2.365
-2.352
estyles-2.334
-2.330
Token and
Token ID290
Feature activation+6.23e-02

Pos logit contributions
rils+1.965
atography+1.959
+1.926
+1.915
estyles+1.904
Neg logit contributions
nas-2.588
yang-2.293
deal-2.262
worthy-2.255
uable-2.222
Token a
Token ID257
Feature activation-6.20e-01

Pos logit contributions
nas+1.534
yang+1.284
deal+1.263
worthy+1.255
uable+1.212
Neg logit contributions
rils-2.212
atography-2.211
estyles-2.189
-2.188
-2.180
Token great
Token ID1049
Feature activation-1.11e+00

Pos logit contributions
nas+2.574
yang+2.157
deal+2.141
worthy+2.140
uable+2.090
Neg logit contributions
rils-3.700
atography-3.665
estyles-3.639
-3.594
-3.594
Token way
Token ID835
Feature activation-6.58e-01

Pos logit contributions
nas+4.744
deal+4.033
worthy+3.973
yang+3.919
 redeem+3.882
Neg logit contributions
rils-6.447
estyles-6.313
atography-6.280
-6.242
-6.123
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
nas+2.452
yang+2.014
deal+1.965
worthy+1.965
uable+1.900
Neg logit contributions
rils-4.200
atography-4.186
-4.135
-4.126
estyles-4.120
Token kick
Token ID4829
Feature activation-8.75e-01

Pos logit contributions
nas+3.997
yang+3.393
 redeem+3.370
worthy+3.289
deal+3.269
Neg logit contributions
-6.060
rils-6.037
atography-6.021
estyles-6.015
-5.961
Token start
Token ID923
Feature activation-2.78e+00

Pos logit contributions
nas+3.424
worthy+2.822
deal+2.768
yang+2.753
uable+2.626
Neg logit contributions
atography-5.383
rils-5.369
estyles-5.307
-5.261
-5.218
Token a
Token ID257
Feature activation-1.55e+00

Pos logit contributions
nas+10.410
worthy+8.359
yang+8.351
 redeem+8.178
deal+7.862
Neg logit contributions
atography-15.269
rils-14.854
estyles-14.686
-14.627
-14.258
Token child
Token ID1200
Feature activation-1.80e+00

Pos logit contributions
nas+5.920
worthy+4.858
yang+4.854
deal+4.745
 redeem+4.679
Neg logit contributions
rils-9.443
atography-9.332
estyles-9.259
-8.993
-8.949
Token
Token ID447
Feature activation-4.92e-01

Pos logit contributions
nas+6.117
yang+5.184
worthy+5.025
deal+4.987
uable+4.643
Neg logit contributions
rils-11.272
atography-11.247
estyles-11.245
-11.128
-11.074
Token
Token ID247
Feature activation-4.64e-02

Pos logit contributions
nas+2.990
yang+2.703
deal+2.684
worthy+2.651
uable+2.636
Neg logit contributions
atography-1.943
-1.896
estyles-1.893
rils-1.892
axies-1.800
Tokens
Token ID82
Feature activation-5.94e-01

Pos logit contributions
nas+0.208
yang+0.177
worthy+0.174
deal+0.174
uable+0.170
Neg logit contributions
rils-0.262
atography-0.262
-0.257
estyles-0.257
-0.257
Token As
Token ID1081
Feature activation-3.86e-01

Pos logit contributions
rils+0.048
atography+0.047
estyles+0.046
+0.046
+0.046
Neg logit contributions
nas-0.049
yang-0.042
deal-0.042
worthy-0.042
uable-0.041
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
nas+1.741
yang+1.471
deal+1.467
worthy+1.457
uable+1.420
Neg logit contributions
atography-2.146
rils-2.142
-2.119
estyles-2.101
axies-2.088
Token father
Token ID2988
Feature activation+1.07e-02

Pos logit contributions
rils+1.176
atography+1.172
+1.156
+1.155
estyles+1.154
Neg logit contributions
nas-1.068
yang-0.916
deal-0.908
worthy-0.902
uable-0.884
Token of
Token ID286
Feature activation-2.89e-01

Pos logit contributions
rils+0.063
atography+0.062
+0.061
+0.061
estyles+0.061
Neg logit contributions
nas-0.045
yang-0.038
deal-0.038
worthy-0.038
uable-0.036
Token a
Token ID257
Feature activation-6.47e-01

Pos logit contributions
nas+1.417
yang+1.231
worthy+1.210
deal+1.204
uable+1.176
Neg logit contributions
rils-1.497
atography-1.494
-1.467
estyles-1.457
-1.453
Token little
Token ID1310
Feature activation-2.74e+00

Pos logit contributions
nas+2.831
yang+2.388
worthy+2.368
deal+2.368
 redeem+2.291
Neg logit contributions
rils-3.734
atography-3.674
estyles-3.596
-3.590
axies-3.543
Token girl
Token ID2576
Feature activation-1.57e+00

Pos logit contributions
nas+7.426
 redeem+5.574
deal+5.548
 americ+5.529
yang+5.330
Neg logit contributions
rils-17.678
atography-17.491
estyles-16.661
axies-16.624
-16.227
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
nas+5.837
yang+4.928
worthy+4.723
deal+4.704
 redeem+4.606
Neg logit contributions
rils-9.456
atography-9.418
estyles-9.167
-9.153
axies-8.972
Token Coach
Token ID16393
Feature activation-2.31e-01

Pos logit contributions
rils+0.085
atography+0.085
+0.084
estyles+0.083
+0.083
Neg logit contributions
nas-0.082
yang-0.071
deal-0.070
worthy-0.070
uable-0.069
Token can
Token ID460
Feature activation-2.23e-01

Pos logit contributions
nas+1.198
yang+1.057
deal+1.031
worthy+1.030
uable+1.000
Neg logit contributions
rils-1.146
atography-1.131
estyles-1.115
-1.113
-1.113
Token understand
Token ID1833
Feature activation-7.71e-01

Pos logit contributions
nas+0.831
yang+0.671
deal+0.667
worthy+0.656
uable+0.627
Neg logit contributions
rils-1.427
atography-1.421
-1.410
-1.407
estyles-1.396
Token means
Token ID1724
Feature activation+3.77e-02

Pos logit contributions
rils+0.913
atography+0.907
+0.893
+0.893
estyles+0.888
Neg logit contributions
nas-1.146
yang-1.009
deal-0.994
worthy-0.994
uable-0.973
Token an
Token ID281
Feature activation+1.53e-01

Pos logit contributions
rils+0.221
atography+0.220
estyles+0.216
+0.216
+0.216
Neg logit contributions
nas-0.162
yang-0.137
deal-0.134
worthy-0.134
uable-0.130
Token earlier
Token ID2961
Feature activation-5.87e-01

Pos logit contributions
rils+0.902
atography+0.899
+0.888
+0.884
estyles+0.882
Neg logit contributions
nas-0.642
yang-0.544
deal-0.532
worthy-0.530
uable-0.517
Token start
Token ID923
Feature activation-1.16e+00

Pos logit contributions
nas+2.512
worthy+2.064
deal+2.055
yang+2.022
uable+1.963
Neg logit contributions
rils-3.511
atography-3.437
estyles-3.426
-3.423
-3.341
Token for
Token ID329
Feature activation-3.23e-02

Pos logit contributions
nas+4.539
worthy+3.706
deal+3.606
yang+3.582
xus+3.388
Neg logit contributions
rils-7.201
atography-7.092
axies-7.079
-7.077
estyles-6.910
Token elementary
Token ID19823
Feature activation-2.72e+00

Pos logit contributions
nas+0.136
yang+0.113
worthy+0.111
deal+0.111
uable+0.107
Neg logit contributions
rils-0.193
atography-0.192
-0.190
-0.189
estyles-0.189
Token schools
Token ID4266
Feature activation-7.09e-01

Pos logit contributions
nas+10.026
 diplom+7.779
worthy+7.622
 lifes+7.333
yang+7.272
Neg logit contributions
rils-15.097
-15.047
estyles-14.672
atography-14.644
————————-14.237
Token,
Token ID11
Feature activation+5.18e-01

Pos logit contributions
nas+2.921
worthy+2.352
yang+2.318
deal+2.277
uable+2.184
Neg logit contributions
rils-4.318
-4.271
atography-4.247
estyles-4.208
-4.164
Token and
Token ID290
Feature activation+1.88e-01

Pos logit contributions
rils+2.773
atography+2.766
+2.729
estyles+2.704
+2.702
Neg logit contributions
nas-2.444
yang-2.108
deal-2.080
worthy-2.069
uable-2.030
Token sets
Token ID5621
Feature activation-3.73e-01

Pos logit contributions
rils+1.056
atography+1.052
+1.037
+1.035
estyles+1.033
Neg logit contributions
nas-0.848
yang-0.723
deal-0.710
worthy-0.709
uable-0.692
Token off
Token ID572
Feature activation-2.88e-01

Pos logit contributions
nas+1.554
yang+1.292
worthy+1.267
deal+1.261
uable+1.232
Neg logit contributions
rils-2.237
atography-2.229
-2.201
-2.188
estyles-2.179
Token phone
Token ID3072
Feature activation+2.55e-01

Pos logit contributions
rils+1.081
atography+1.078
+1.063
+1.060
estyles+1.058
Neg logit contributions
nas-0.700
yang-0.583
worthy-0.572
deal-0.572
uable-0.556
Token calls
Token ID3848
Feature activation-8.17e-01

Pos logit contributions
rils+1.141
atography+1.135
+1.115
+1.111
estyles+1.108
Neg logit contributions
nas-1.442
yang-1.274
worthy-1.261
deal-1.261
uable-1.236
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
nas+3.174
worthy+2.628
yang+2.596
deal+2.546
 insulting+2.506
Neg logit contributions
rils-5.054
atography-5.006
estyles-4.948
-4.929
-4.900
Token After
Token ID2293
Feature activation-1.45e-01

Pos logit contributions
nas+0.854
yang+0.743
deal+0.738
worthy+0.733
uable+0.709
Neg logit contributions
rils-0.795
atography-0.788
estyles-0.778
-0.770
-0.767
Token a
Token ID257
Feature activation-1.33e-01

Pos logit contributions
nas+0.601
yang+0.507
worthy+0.496
deal+0.495
uable+0.479
Neg logit contributions
atography-0.871
rils-0.865
estyles-0.851
-0.850
-0.848
Token teacher
Token ID4701
Feature activation-2.71e+00

Pos logit contributions
nas+0.582
yang+0.491
deal+0.486
worthy+0.484
uable+0.470
Neg logit contributions
rils-0.770
atography-0.764
estyles-0.748
-0.744
-0.744
Token (
Token ID357
Feature activation-1.68e-01

Pos logit contributions
nas+10.487
 insulting+8.893
worthy+8.828
yang+8.687
 redeem+8.537
Neg logit contributions
rils-14.406
estyles-14.112
atography-13.767
-13.688
axies-13.451
Tokenknown
Token ID4002
Feature activation-1.25e-01

Pos logit contributions
nas+0.799
yang+0.678
deal+0.677
worthy+0.673
uable+0.646
Neg logit contributions
rils-0.912
atography-0.911
estyles-0.901
-0.900
-0.894
Token in
Token ID287
Feature activation-5.61e-01

Pos logit contributions
nas+0.509
yang+0.426
deal+0.418
worthy+0.418
uable+0.401
Neg logit contributions
rils-0.758
atography-0.755
estyles-0.742
-0.741
-0.741
Token the
Token ID262
Feature activation-2.67e-01

Pos logit contributions
nas+2.408
yang+1.976
worthy+1.971
deal+1.968
xus+1.894
Neg logit contributions
rils-3.280
atography-3.279
estyles-3.214
axies-3.202
-3.198
Token media
Token ID2056
Feature activation+3.60e-01

Pos logit contributions
nas+1.086
yang+0.895
deal+0.881
worthy+0.876
uable+0.846
Neg logit contributions
rils-1.619
atography-1.618
estyles-1.592
-1.591
-1.579
Token's
Token ID338
Feature activation-1.18e+00

Pos logit contributions
nas+4.589
worthy+3.892
yang+3.866
 redeem+3.724
deal+3.718
Neg logit contributions
-6.233
rils-6.225
atography-6.197
-6.166
estyles-6.144
Token an
Token ID281
Feature activation-1.62e+00

Pos logit contributions
nas+4.755
worthy+3.936
yang+3.849
 redeem+3.842
 insulting+3.789
Neg logit contributions
rils-7.094
estyles-6.999
atography-6.974
-6.940
-6.915
Token awesome
Token ID7427
Feature activation-9.79e-01

Pos logit contributions
nas+4.888
worthy+4.004
 insulting+3.824
 redeem+3.818
uable+3.814
Neg logit contributions
rils-10.868
estyles-10.738
-10.675
-10.650
atography-10.612
Token way
Token ID835
Feature activation-7.89e-02

Pos logit contributions
nas+3.663
worthy+3.009
deal+2.916
yang+2.898
uable+2.856
Neg logit contributions
rils-6.244
atography-6.136
estyles-6.103
-6.051
-5.997
Token to
Token ID284
Feature activation-8.17e-01

Pos logit contributions
nas+0.304
yang+0.252
deal+0.247
worthy+0.246
uable+0.239
Neg logit contributions
rils-0.494
atography-0.493
-0.487
-0.485
estyles-0.484
Token start
Token ID923
Feature activation-2.70e+00

Pos logit contributions
nas+3.319
yang+2.774
worthy+2.710
deal+2.707
 redeem+2.700
Neg logit contributions
-4.885
rils-4.879
atography-4.878
estyles-4.847
-4.817
Token your
Token ID534
Feature activation-1.73e+00

Pos logit contributions
nas+10.172
worthy+8.085
 redeem+8.081
yang+7.979
 worrying+7.935
Neg logit contributions
atography-15.171
estyles-14.679
rils-14.664
-14.402
-14.365
Token job
Token ID1693
Feature activation-8.84e-01

Pos logit contributions
nas+6.604
worthy+5.524
yang+5.420
deal+5.389
 redeem+5.383
Neg logit contributions
rils-10.022
estyles-9.969
atography-9.938
-9.848
axies-9.740
Token really
Token ID1107
Feature activation+1.38e-01

Pos logit contributions
nas+3.656
worthy+3.030
yang+3.012
deal+2.943
uable+2.799
Neg logit contributions
rils-5.308
estyles-5.220
atography-5.203
-5.194
-5.190
Token,
Token ID11
Feature activation-3.02e-01

Pos logit contributions
rils+0.833
atography+0.830
+0.820
+0.817
estyles+0.815
Neg logit contributions
nas-0.557
yang-0.468
deal-0.459
worthy-0.457
uable-0.446
Token really
Token ID1107
Feature activation+5.50e-01

Pos logit contributions
nas+1.292
yang+1.090
worthy+1.068
deal+1.061
uable+1.037
Neg logit contributions
rils-1.766
atography-1.761
-1.739
-1.739
estyles-1.735
Token potential
Token ID2785
Feature activation+4.78e-01

Pos logit contributions
atography+2.093
rils+2.091
+2.069
+2.061
estyles+2.045
Neg logit contributions
nas-1.470
yang-1.244
deal-1.216
worthy-1.210
uable-1.180
Token beneficiary
Token ID33867
Feature activation-6.57e-01

Pos logit contributions
rils+2.749
atography+2.748
+2.723
+2.706
estyles+2.685
Neg logit contributions
nas-2.040
yang-1.734
deal-1.685
worthy-1.685
uable-1.648
Token of
Token ID286
Feature activation+1.10e-01

Pos logit contributions
nas+2.728
yang+2.282
worthy+2.258
deal+2.257
uable+2.194
Neg logit contributions
rils-3.902
atography-3.881
-3.819
-3.812
estyles-3.807
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
rils+0.633
atography+0.630
+0.622
+0.620
estyles+0.619
Neg logit contributions
nas-0.482
yang-0.408
deal-0.401
worthy-0.400
uable-0.391
Token Nancy
Token ID18496
Feature activation-4.40e-02

Pos logit contributions
nas+0.752
yang+0.631
deal+0.627
worthy+0.621
uable+0.608
Neg logit contributions
rils-0.993
atography-0.987
-0.973
estyles-0.972
-0.970
Token McCain
Token ID14264
Feature activation-2.68e+00

Pos logit contributions
nas+0.208
yang+0.178
worthy+0.176
deal+0.176
uable+0.171
Neg logit contributions
rils-0.238
atography-0.238
-0.234
-0.234
estyles-0.233
Token 2013
Token ID2211
Feature activation-1.15e+00

Pos logit contributions
nas+11.502
xus+9.570
worthy+9.545
deal+9.488
yang+9.480
Neg logit contributions
estyles-13.067
atography-13.043
-12.842
rils-12.634
-12.458
Token Family
Token ID7884
Feature activation+4.12e-01

Pos logit contributions
nas+5.113
deal+4.268
yang+4.265
worthy+4.205
xus+4.156
Neg logit contributions
rils-6.360
atography-6.322
estyles-6.242
-6.198
-6.147
Token Trust
Token ID9870
Feature activation-5.29e-01

Pos logit contributions
rils+2.307
atography+2.300
+2.274
+2.253
estyles+2.250
Neg logit contributions
nas-1.848
yang-1.580
deal-1.549
worthy-1.544
uable-1.513
Token.
Token ID13
Feature activation-8.96e-02

Pos logit contributions
nas+2.233
worthy+1.870
yang+1.866
deal+1.841
uable+1.786
Neg logit contributions
rils-3.116
atography-3.101
-3.070
estyles-3.057
-3.036
Token
Token ID198
Feature activation+6.58e-02

Pos logit contributions
nas+0.461
yang+0.399
deal+0.393
worthy+0.393
uable+0.382
Neg logit contributions
rils-0.449
atography-0.445
estyles-0.440
-0.439
-0.439
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
nas+2.233
yang+1.915
deal+1.876
worthy+1.842
 insulting+1.817
Neg logit contributions
rils-2.873
atography-2.823
-2.816
estyles-2.815
-2.782
Token grenade
Token ID26168
Feature activation+3.44e-01

Pos logit contributions
nas+1.076
yang+0.910
deal+0.897
worthy+0.885
uable+0.864
Neg logit contributions
rils-1.396
atography-1.372
-1.368
estyles-1.366
-1.355
Token attacks
Token ID3434
Feature activation-1.05e+00

Pos logit contributions
rils+1.891
atography+1.883
+1.857
+1.849
estyles+1.845
Neg logit contributions
nas-1.592
yang-1.365
deal-1.343
worthy-1.339
uable-1.309
Token at
Token ID379
Feature activation-1.15e+00

Pos logit contributions
nas+3.970
yang+3.227
deal+3.111
worthy+3.077
 insulting+3.056
Neg logit contributions
-6.617
rils-6.596
atography-6.594
estyles-6.549
-6.432
Token a
Token ID257
Feature activation-9.04e-01

Pos logit contributions
nas+4.856
yang+4.046
worthy+3.901
deal+3.855
uable+3.815
Neg logit contributions
rils-6.706
-6.702
estyles-6.568
atography-6.484
axies-6.393
Token Pesh
Token ID39654
Feature activation-2.67e+00

Pos logit contributions
nas+3.722
yang+3.045
deal+2.970
worthy+2.920
uable+2.853
Neg logit contributions
rils-5.554
-5.447
atography-5.390
estyles-5.389
axies-5.311
Tokenawar
Token ID48841
Feature activation-1.25e+00

Pos logit contributions
nas+7.458
deal+5.561
yang+5.303
worthy+4.968
uable+4.713
Neg logit contributions
rils-18.135
-17.923
estyles-17.157
atography-17.043
axies-16.817
Token cinema
Token ID22041
Feature activation-5.49e-01

Pos logit contributions
nas+4.861
yang+4.022
deal+3.911
worthy+3.745
uable+3.742
Neg logit contributions
rils-7.777
-7.676
atography-7.562
estyles-7.554
axies-7.353
Token house
Token ID2156
Feature activation-2.16e-01

Pos logit contributions
nas+2.240
yang+1.827
deal+1.797
worthy+1.771
uable+1.704
Neg logit contributions
rils-3.340
atography-3.340
-3.327
estyles-3.308
-3.270
Token on
Token ID319
Feature activation-1.21e+00

Pos logit contributions
nas+0.895
yang+0.742
deal+0.731
worthy+0.725
uable+0.703
Neg logit contributions
rils-1.303
atography-1.301
-1.288
estyles-1.280
-1.275
Token 13
Token ID1511
Feature activation-4.18e-01

Pos logit contributions
nas+5.586
yang+4.670
deal+4.634
worthy+4.613
 insulting+4.534
Neg logit contributions
estyles-6.546
-6.544
rils-6.501
atography-6.404
axies-6.282
Tokenng
Token ID782
Feature activation-9.07e-01

Pos logit contributions
nas+1.356
yang+1.159
worthy+1.127
deal+1.116
uable+1.077
Neg logit contributions
atography-1.825
-1.813
rils-1.809
estyles-1.791
-1.785
Tokenke
Token ID365
Feature activation-2.89e-01

Pos logit contributions
nas+4.182
yang+3.636
worthy+3.498
deal+3.423
uable+3.321
Neg logit contributions
-4.934
atography-4.912
estyles-4.909
rils-4.890
axies-4.794
Token (
Token ID357
Feature activation-3.57e-01

Pos logit contributions
nas+1.272
yang+1.099
worthy+1.065
deal+1.051
uable+1.018
Neg logit contributions
atography-1.652
rils-1.645
estyles-1.629
-1.628
-1.613
Tokenmeaning
Token ID24815
Feature activation-5.95e-01

Pos logit contributions
nas+1.715
yang+1.496
worthy+1.471
deal+1.466
uable+1.390
Neg logit contributions
atography-1.900
rils-1.888
estyles-1.883
-1.850
-1.841
Token "
Token ID366
Feature activation-4.52e-01

Pos logit contributions
nas+2.704
yang+2.355
worthy+2.324
deal+2.304
uable+2.224
Neg logit contributions
estyles-3.269
atography-3.266
rils-3.263
-3.179
-3.157
Tokenet
Token ID316
Feature activation-2.66e+00

Pos logit contributions
nas+2.199
yang+1.911
deal+1.906
worthy+1.904
uable+1.797
Neg logit contributions
atography-2.392
estyles-2.376
rils-2.364
-2.341
-2.318
Tokenernal
Token ID35220
Feature activation-5.33e-01

Pos logit contributions
nas+8.950
worthy+7.051
yang+6.829
uable+6.573
deal+6.395
Neg logit contributions
-15.546
estyles-15.193
 cues-15.044
atography-14.932
axies-14.777
Token"
Token ID1
Feature activation-1.78e-01

Pos logit contributions
nas+2.350
yang+2.023
worthy+2.013
deal+2.001
uable+1.932
Neg logit contributions
atography-2.991
estyles-2.989
rils-2.963
-2.944
-2.899
Token in
Token ID287
Feature activation+6.71e-03

Pos logit contributions
nas+0.656
yang+0.543
worthy+0.533
deal+0.530
uable+0.504
Neg logit contributions
atography-1.142
rils-1.141
estyles-1.132
-1.127
-1.123
Token the
Token ID262
Feature activation+2.86e-01

Pos logit contributions
rils+0.042
atography+0.041
estyles+0.041
+0.041
+0.040
Neg logit contributions
nas-0.027
yang-0.022
worthy-0.022
deal-0.022
uable-0.021
Token Mongol
Token ID42188
Feature activation+5.97e-01

Pos logit contributions
rils+1.682
atography+1.675
+1.649
estyles+1.647
+1.647
Neg logit contributions
nas-1.214
yang-1.027
deal-1.007
worthy-1.003
uable-0.976
Token
Token ID198
Feature activation-4.09e-02

Pos logit contributions
atography+4.358
rils+4.355
+4.283
+4.265
estyles+4.249
Neg logit contributions
nas-4.332
yang-3.788
worthy-3.711
deal-3.695
uable-3.644
Token
Token ID198
Feature activation-8.28e-02

Pos logit contributions
nas+0.206
yang+0.179
deal+0.177
worthy+0.176
uable+0.171
Neg logit contributions
rils-0.209
atography-0.208
-0.205
-0.204
estyles-0.204
Tokenhttps
Token ID5450
Feature activation+2.52e-01

Pos logit contributions
nas+0.401
deal+0.345
yang+0.344
worthy+0.339
uable+0.330
Neg logit contributions
rils-0.439
atography-0.439
-0.430
estyles-0.429
-0.427
Token://
Token ID1378
Feature activation-8.17e-01

Pos logit contributions
rils+1.482
atography+1.476
+1.457
+1.449
estyles+1.447
Neg logit contributions
nas-1.061
yang-0.897
deal-0.876
worthy-0.876
uable-0.858
Tokenkill
Token ID12728
Feature activation+2.48e-02

Pos logit contributions
nas+3.450
yang+2.906
deal+2.882
worthy+2.846
uable+2.684
Neg logit contributions
atography-4.782
-4.733
-4.728
rils-4.690
estyles-4.645
Tokenert
Token ID861
Feature activation-2.66e+00

Pos logit contributions
rils+0.135
atography+0.135
+0.134
+0.134
estyles+0.132
Neg logit contributions
nas-0.115
yang-0.099
deal-0.098
worthy-0.098
uable-0.095
Tokenof
Token ID1659
Feature activation-4.55e-01

Pos logit contributions
nas+12.597
worthy+11.390
yang+11.242
deal+11.181
oken+11.122
Neg logit contributions
-12.365
-11.506
 cues-10.890
————————-10.718
atography-10.672
Tokenure
Token ID495
Feature activation+1.33e-01

Pos logit contributions
nas+2.371
yang+2.071
deal+2.036
worthy+2.036
uable+1.957
Neg logit contributions
-2.256
atography-2.243
rils-2.219
-2.203
estyles-2.178
Tokenc
Token ID66
Feature activation+5.83e-01

Pos logit contributions
rils+0.769
atography+0.766
+0.764
+0.760
estyles+0.751
Neg logit contributions
nas-0.580
yang-0.493
worthy-0.482
deal-0.481
uable-0.468
Tokenords
Token ID3669
Feature activation-6.32e-01

Pos logit contributions
rils+2.884
atography+2.870
+2.823
estyles+2.808
+2.804
Neg logit contributions
nas-3.011
yang-2.627
deal-2.589
worthy-2.582
uable-2.531
Token.
Token ID13
Feature activation-6.16e-01

Pos logit contributions
nas+2.712
yang+2.285
deal+2.275
worthy+2.260
uable+2.206
Neg logit contributions
rils-3.634
atography-3.632
-3.631
-3.599
estyles-3.545
Token dividends
Token ID26360
Feature activation-1.05e+00

Pos logit contributions
rils+2.792
atography+2.780
+2.754
+2.722
estyles+2.720
Neg logit contributions
nas-2.493
yang-2.164
deal-2.117
worthy-2.116
uable-2.079
Token.
Token ID13
Feature activation-3.45e-01

Pos logit contributions
nas+4.115
worthy+3.363
deal+3.328
yang+3.222
uable+3.124
Neg logit contributions
-6.449
rils-6.421
atography-6.379
estyles-6.292
-6.206
Token Psycho
Token ID38955
Feature activation+1.08e+00

Pos logit contributions
nas+1.736
deal+1.508
yang+1.496
worthy+1.490
uable+1.434
Neg logit contributions
rils-1.754
estyles-1.734
atography-1.718
-1.693
-1.692
Token was
Token ID373
Feature activation-1.98e-02

Pos logit contributions
atography+5.918
rils+5.889
+5.828
+5.779
estyles+5.752
Neg logit contributions
nas-4.766
yang-4.102
worthy-3.994
deal-3.966
uable-3.934
Token a
Token ID257
Feature activation-5.55e-01

Pos logit contributions
nas+0.089
yang+0.075
deal+0.075
worthy+0.075
uable+0.073
Neg logit contributions
rils-0.112
atography-0.111
-0.110
-0.110
estyles-0.110
Token cert
Token ID5051
Feature activation-2.66e+00

Pos logit contributions
nas+2.387
worthy+2.024
deal+2.015
yang+2.004
uable+1.977
Neg logit contributions
rils-3.243
atography-3.180
-3.157
estyles-3.148
-3.118
Tokenifiable
Token ID16823
Feature activation-1.77e+00

Pos logit contributions
nas+9.990
worthy+9.295
uable+9.174
deal+8.891
 redeem+8.422
Neg logit contributions
-14.132
atography-13.493
estyles-13.347
rils-13.299
axies-13.240
Token smash
Token ID24273
Feature activation+1.42e-01

Pos logit contributions
nas+7.165
worthy+6.414
deal+6.328
uable+6.217
yang+5.927
Neg logit contributions
rils-9.994
-9.829
atography-9.570
estyles-9.561
axies-9.093
Token,
Token ID11
Feature activation-2.07e-01

Pos logit contributions
rils+0.834
atography+0.828
+0.817
+0.816
estyles+0.814
Neg logit contributions
nas-0.602
yang-0.507
deal-0.501
worthy-0.500
uable-0.485
Tokenand
Token ID392
Feature activation-7.83e-02

Pos logit contributions
nas+0.879
worthy+0.736
yang+0.731
deal+0.727
uable+0.709
Neg logit contributions
rils-1.220
atography-1.211
-1.199
estyles-1.195
-1.189
Token he
Token ID339
Feature activation+1.73e-01

Pos logit contributions
nas+0.365
yang+0.310
deal+0.310
worthy+0.309
uable+0.302
Neg logit contributions
rils-0.428
atography-0.427
-0.422
estyles-0.421
-0.419
Tokenone
Token ID505
Feature activation+6.06e-01

Pos logit contributions
nas+0.576
yang+0.507
deal+0.504
worthy+0.501
uable+0.490
Neg logit contributions
rils-0.462
atography-0.461
-0.454
-0.454
estyles-0.451
Token lamb
Token ID19343
Feature activation+2.65e-01

Pos logit contributions
rils+3.290
atography+3.284
+3.235
+3.222
estyles+3.219
Neg logit contributions
nas-2.796
yang-2.408
deal-2.363
worthy-2.357
uable-2.310
Token ste
Token ID2876
Feature activation-2.92e-01

Pos logit contributions
rils+1.498
atography+1.491
+1.474
+1.473
estyles+1.461
Neg logit contributions
nas-1.190
yang-1.008
deal-0.992
worthy-0.991
uable-0.966
Tokenaks
Token ID4730
Feature activation-5.02e-01

Pos logit contributions
nas+1.362
yang+1.185
worthy+1.175
deal+1.161
uable+1.147
Neg logit contributions
-1.610
atography-1.569
rils-1.567
-1.565
estyles-1.505
Token from
Token ID422
Feature activation-5.87e-01

Pos logit contributions
nas+2.079
worthy+1.698
yang+1.693
deal+1.666
 redeem+1.597
Neg logit contributions
rils-3.053
atography-3.015
-3.015
estyles-2.988
-2.976
Token Scot
Token ID7130
Feature activation-2.64e+00

Pos logit contributions
nas+2.705
yang+2.284
worthy+2.281
deal+2.263
uable+2.195
Neg logit contributions
rils-3.232
atography-3.220
estyles-3.167
-3.142
-3.126
Tokenmid
Token ID13602
Feature activation-2.37e-01

Pos logit contributions
nas+12.451
worthy+10.812
deal+10.713
yang+10.173
tsy+10.072
Neg logit contributions
atography-11.673
-11.623
rils-11.546
estyles-11.355
-11.318
Token.'
Token ID2637
Feature activation-8.75e-02

Pos logit contributions
nas+1.078
yang+0.919
deal+0.909
worthy+0.904
uable+0.869
Neg logit contributions
atography-1.328
rils-1.324
-1.309
-1.308
estyles-1.301
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
nas+0.453
yang+0.396
deal+0.390
worthy+0.389
uable+0.382
Neg logit contributions
rils-0.432
atography-0.430
-0.423
-0.421
estyles-0.421
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
nas+1.132
yang+0.972
deal+0.968
worthy+0.964
uable+0.927
Neg logit contributions
rils-1.204
atography-1.199
-1.191
estyles-1.190
-1.182
TokenThe
Token ID464
Feature activation-1.45e-01

Pos logit contributions
nas+0.803
deal+0.688
yang+0.685
worthy+0.676
uable+0.654
Neg logit contributions
atography-0.852
rils-0.847
estyles-0.833
-0.828
-0.826
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
rils+0.555
atography+0.553
+0.544
+0.542
estyles+0.541
Neg logit contributions
nas-0.574
yang-0.500
deal-0.493
worthy-0.492
uable-0.482
TokenGet
Token ID3855
Feature activation-6.80e-01

Pos logit contributions
nas+0.181
yang+0.156
deal+0.155
worthy+0.154
uable+0.150
Neg logit contributions
rils-0.182
atography-0.182
estyles-0.178
-0.178
-0.177
Token our
Token ID674
Feature activation-8.74e-01

Pos logit contributions
nas+2.381
yang+1.885
deal+1.880
uable+1.855
worthy+1.846
Neg logit contributions
rils-4.408
-4.377
estyles-4.359
axies-4.283
atography-4.272
Token daily
Token ID4445
Feature activation-1.25e+00

Pos logit contributions
nas+2.069
deal+1.565
yang+1.542
worthy+1.519
uable+1.431
Neg logit contributions
rils-6.723
-6.638
atography-6.589
estyles-6.558
-6.550
Token newsletter
Token ID13129
Feature activation-1.91e+00

Pos logit contributions
nas+3.742
deal+3.073
worthy+3.002
 redeem+2.904
yang+2.811
Neg logit contributions
-8.650
atography-8.506
rils-8.451
estyles-8.368
axies-8.218
Token Upgrade
Token ID24236
Feature activation-2.64e+00

Pos logit contributions
nas+5.241
yang+3.580
deal+3.523
worthy+3.491
 redeem+3.480
Neg logit contributions
-13.252
axies-13.182
atography-13.151
estyles-13.103
rils-12.858
Token your
Token ID534
Feature activation-6.66e-01

Pos logit contributions
nas+5.452
worthy+3.899
deal+3.504
 insulting+3.408
yang+3.250
Neg logit contributions
estyles-19.319
-19.318
atography-18.852
rils-18.624
axies-18.608
Token inbox
Token ID13734
Feature activation-1.06e+00

Pos logit contributions
nas+2.281
yang+1.841
deal+1.791
worthy+1.780
uable+1.708
Neg logit contributions
atography-4.459
rils-4.458
estyles-4.434
-4.390
-4.343
Token and
Token ID290
Feature activation-4.73e-01

Pos logit contributions
nas+3.920
deal+3.067
worthy+3.038
yang+2.971
 redeem+2.845
Neg logit contributions
-6.812
atography-6.727
estyles-6.700
rils-6.576
-6.571
Token get
Token ID651
Feature activation-4.78e-01

Pos logit contributions
nas+1.750
yang+1.428
deal+1.401
worthy+1.381
uable+1.349
Neg logit contributions
atography-3.044
rils-3.038
-3.008
estyles-3.000
-2.983
Token our
Token ID674
Feature activation-5.60e-01

Pos logit contributions
nas+1.821
deal+1.483
worthy+1.473
yang+1.466
uable+1.436
Neg logit contributions
-3.008
atography-3.003
rils-2.984
-2.948
estyles-2.945
Token
Token ID198
Feature activation+2.14e-01

Pos logit contributions
rils+0.014
atography+0.014
estyles+0.013
+0.013
+0.013
Neg logit contributions
nas-0.013
yang-0.012
deal-0.012
worthy-0.011
uable-0.011
TokenGet
Token ID3855
Feature activation-5.02e-01

Pos logit contributions
rils+1.037
atography+1.034
+1.015
+1.011
estyles+1.010
Neg logit contributions
nas-1.124
yang-0.982
deal-0.970
worthy-0.967
uable-0.947
Token our
Token ID674
Feature activation-9.01e-01

Pos logit contributions
nas+1.881
deal+1.525
worthy+1.517
yang+1.488
uable+1.478
Neg logit contributions
rils-3.193
-3.155
atography-3.128
estyles-3.128
-3.105
Token daily
Token ID4445
Feature activation-1.35e+00

Pos logit contributions
nas+2.041
deal+1.538
yang+1.506
worthy+1.491
uable+1.421
Neg logit contributions
rils-6.978
-6.958
atography-6.907
estyles-6.842
-6.825
Token newsletter
Token ID13129
Feature activation-1.92e+00

Pos logit contributions
nas+3.917
deal+3.289
worthy+3.225
 redeem+3.067
yang+2.908
Neg logit contributions
-9.477
rils-9.134
atography-9.102
estyles-9.043
axies-8.879
Token Upgrade
Token ID24236
Feature activation-2.64e+00

Pos logit contributions
nas+5.311
deal+3.792
worthy+3.780
yang+3.739
 redeem+3.640
Neg logit contributions
-13.176
axies-12.957
estyles-12.951
atography-12.882
rils-12.750
Token your
Token ID534
Feature activation-7.46e-01

Pos logit contributions
nas+5.550
worthy+3.986
deal+3.669
 insulting+3.559
 worrying+3.467
Neg logit contributions
estyles-19.261
-19.098
atography-18.791
rils-18.623
axies-18.462
Token inbox
Token ID13734
Feature activation-1.09e+00

Pos logit contributions
nas+2.538
yang+2.023
deal+1.983
worthy+1.973
uable+1.885
Neg logit contributions
atography-5.023
rils-5.016
estyles-4.992
-4.938
-4.895
Token and
Token ID290
Feature activation-4.04e-01

Pos logit contributions
nas+4.038
deal+3.165
worthy+3.113
yang+3.038
 redeem+2.930
Neg logit contributions
-7.062
estyles-6.969
atography-6.954
rils-6.827
-6.823
Token get
Token ID651
Feature activation-4.90e-01

Pos logit contributions
nas+1.510
yang+1.235
deal+1.211
worthy+1.195
uable+1.168
Neg logit contributions
atography-2.589
rils-2.586
-2.560
estyles-2.551
-2.542
Token our
Token ID674
Feature activation-5.46e-01

Pos logit contributions
nas+1.876
deal+1.533
yang+1.512
worthy+1.509
uable+1.482
Neg logit contributions
-3.069
atography-3.066
rils-3.054
-3.021
estyles-3.006
Token surprised
Token ID6655
Feature activation-3.20e-01

Pos logit contributions
rils+4.014
atography+4.002
+3.926
+3.910
estyles+3.892
Neg logit contributions
nas-3.035
yang-2.614
deal-2.541
worthy-2.531
uable-2.507
Token me
Token ID502
Feature activation-1.75e-01

Pos logit contributions
nas+0.966
worthy+0.752
yang+0.748
deal+0.740
uable+0.698
Neg logit contributions
rils-2.263
atography-2.252
estyles-2.232
-2.223
-2.221
Token.
Token ID13
Feature activation-1.63e-03

Pos logit contributions
nas+0.737
yang+0.615
worthy+0.612
deal+0.603
uable+0.582
Neg logit contributions
rils-1.041
atography-1.030
estyles-1.022
-1.015
-1.014
Token We
Token ID775
Feature activation+4.64e-01

Pos logit contributions
nas+0.008
yang+0.007
worthy+0.007
deal+0.007
uable+0.007
Neg logit contributions
rils-0.008
atography-0.008
estyles-0.008
-0.008
-0.008
Token are
Token ID389
Feature activation+4.89e-01

Pos logit contributions
rils+2.509
atography+2.500
+2.465
+2.454
estyles+2.449
Neg logit contributions
nas-2.185
yang-1.881
deal-1.849
worthy-1.845
uable-1.806
Token Luther
Token ID17461
Feature activation+1.98e+00

Pos logit contributions
rils+2.816
atography+2.806
+2.774
+2.757
estyles+2.738
Neg logit contributions
nas-2.105
yang-1.796
deal-1.756
worthy-1.751
uable-1.716
Tokenans
Token ID504
Feature activation+9.79e-01

Pos logit contributions
rils+7.324
atography+7.204
+6.991
+6.926
axies+6.828
Neg logit contributions
nas-11.134
yang-10.521
deal-10.317
xus-10.263
worthy-10.178
Token,
Token ID11
Feature activation+6.71e-01

Pos logit contributions
atography+5.307
rils+5.289
+5.249
+5.197
estyles+5.141
Neg logit contributions
nas-4.381
yang-3.806
deal-3.717
worthy-3.662
uable-3.643
Token and
Token ID290
Feature activation+4.61e-01

Pos logit contributions
rils+3.670
atography+3.653
+3.620
+3.588
estyles+3.562
Neg logit contributions
nas-3.074
yang-2.646
deal-2.599
worthy-2.582
uable-2.541
Token Luther
Token ID17461
Feature activation+2.08e+00

Pos logit contributions
rils+2.521
atography+2.511
+2.481
+2.467
estyles+2.453
Neg logit contributions
nas-2.129
yang-1.829
deal-1.797
worthy-1.790
uable-1.759
Token
Token ID447
Feature activation-1.82e-01

Pos logit contributions
atography+9.005
rils+8.810
+8.479
+8.449
axies+8.353
Neg logit contributions
nas-10.378
yang-9.721
worthy-9.379
deal-9.360
xus-9.305
Token case
Token ID1339
Feature activation-3.78e-01

Pos logit contributions
rils+1.921
atography+1.912
+1.887
+1.883
estyles+1.878
Neg logit contributions
nas-1.684
yang-1.451
deal-1.421
worthy-1.420
uable-1.384
Token and
Token ID290
Feature activation-3.24e-01

Pos logit contributions
nas+1.598
yang+1.351
worthy+1.316
deal+1.315
uable+1.279
Neg logit contributions
rils-2.225
atography-2.214
-2.185
-2.184
estyles-2.174
Token Tan
Token ID11818
Feature activation-5.14e-01

Pos logit contributions
nas+1.440
yang+1.239
worthy+1.199
deal+1.194
uable+1.157
Neg logit contributions
atography-1.840
rils-1.834
-1.827
estyles-1.816
-1.808
Tokenj
Token ID73
Feature activation-7.49e-01

Pos logit contributions
nas+2.196
yang+1.873
deal+1.824
worthy+1.819
uable+1.756
Neg logit contributions
rils-3.004
atography-2.992
-2.955
-2.952
estyles-2.929
Tokenian
Token ID666
Feature activation-5.04e-01

Pos logit contributions
nas+2.800
yang+2.415
deal+2.262
worthy+2.255
uable+2.164
Neg logit contributions
-4.897
rils-4.755
-4.749
atography-4.736
estyles-4.686
Token Fl
Token ID1610
Feature activation+2.35e+00

Pos logit contributions
nas+2.241
yang+1.955
worthy+1.854
deal+1.850
uable+1.785
Neg logit contributions
rils-2.854
-2.850
atography-2.838
estyles-2.803
-2.784
Tokenock
Token ID735
Feature activation+4.12e-01

Pos logit contributions
atography+10.952
+10.941
axies+10.794
rils+10.568
estyles+10.436
Neg logit contributions
nas-10.998
deal-9.488
worthy-9.474
yang-9.264
uable-9.250
Token 1
Token ID352
Feature activation+2.79e-01

Pos logit contributions
rils+2.185
atography+2.178
+2.142
+2.140
estyles+2.131
Neg logit contributions
nas-1.996
yang-1.727
worthy-1.695
deal-1.695
uable-1.655
TokenC
Token ID34
Feature activation-4.73e-01

Pos logit contributions
rils+1.551
atography+1.546
+1.522
+1.522
estyles+1.516
Neg logit contributions
nas-1.280
yang-1.097
deal-1.077
worthy-1.075
uable-1.050
Token-
Token ID12
Feature activation-4.90e-01

Pos logit contributions
nas+2.185
yang+1.899
worthy+1.840
deal+1.825
uable+1.794
Neg logit contributions
atography-2.580
rils-2.577
-2.561
estyles-2.542
-2.506
Token5
Token ID20
Feature activation+5.55e-02

Pos logit contributions
nas+1.987
yang+1.670
worthy+1.659
deal+1.632
uable+1.570
Neg logit contributions
atography-2.954
rils-2.945
-2.938
estyles-2.927
-2.888
Token).
Token ID737
Feature activation+1.37e-01

Pos logit contributions
rils+0.361
atography+0.358
+0.356
+0.353
estyles+0.352
Neg logit contributions
nas-0.247
yang-0.204
worthy-0.203
deal-0.201
uable-0.196
Token
Token ID198
Feature activation+4.80e-01

Pos logit contributions
rils+0.681
atography+0.676
estyles+0.665
+0.664
+0.661
Neg logit contributions
nas-0.712
yang-0.621
deal-0.611
worthy-0.611
uable-0.595
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
rils+2.334
atography+2.323
+2.292
+2.275
estyles+2.262
Neg logit contributions
nas-2.506
yang-2.192
deal-2.155
worthy-2.154
uable-2.123
TokenAfter
Token ID3260
Feature activation-2.40e-02

Pos logit contributions
rils+0.859
atography+0.857
+0.841
+0.840
estyles+0.838
Neg logit contributions
nas-0.853
yang-0.740
deal-0.731
worthy-0.728
uable-0.711
Token the
Token ID262
Feature activation+5.84e-02

Pos logit contributions
nas+0.109
yang+0.093
deal+0.091
worthy+0.091
uable+0.088
Neg logit contributions
rils-0.134
atography-0.134
estyles-0.132
-0.132
-0.132
Token lights
Token ID7588
Feature activation+2.01e+00

Pos logit contributions
rils+0.318
atography+0.317
+0.313
estyles+0.313
+0.312
Neg logit contributions
nas-0.274
yang-0.234
deal-0.231
worthy-0.230
uable-0.224
Token came
Token ID1625
Feature activation+9.02e-01

Pos logit contributions
atography+9.243
rils+9.215
+9.173
 cues+9.056
estyles+8.725
Neg logit contributions
nas-9.681
worthy-8.500
xus-8.354
yang-8.330
deal-8.248
Token back
Token ID736
Feature activation+1.13e+00

Pos logit contributions
rils+4.928
atography+4.915
+4.883
+4.874
estyles+4.781
Neg logit contributions
nas-4.046
yang-3.476
worthy-3.437
deal-3.421
uable-3.334
Token up
Token ID510
Feature activation+2.85e-01

Pos logit contributions
rils+6.016
atography+5.971
+5.970
+5.940
 cues+5.814
Neg logit contributions
nas-5.072
yang-4.374
deal-4.330
worthy-4.316
uable-4.232
Token,
Token ID11
Feature activation+2.63e-01

Pos logit contributions
rils+1.663
atography+1.659
+1.634
+1.629
estyles+1.629
Neg logit contributions
nas-1.231
yang-1.041
worthy-1.018
deal-1.017
uable-0.989
Token
Token ID564
Feature activation+1.04e+00

Pos logit contributions
rils+1.427
atography+1.422
+1.402
+1.399
estyles+1.396
Neg logit contributions
nas-1.236
yang-1.064
deal-1.043
worthy-1.042
uable-1.017
Tokenom
Token ID296
Feature activation+8.78e-02

Pos logit contributions
rils+0.251
atography+0.249
+0.246
+0.245
estyles+0.241
Neg logit contributions
nas-0.444
yang-0.398
worthy-0.394
deal-0.392
uable-0.388
Token Hart
Token ID11345
Feature activation-1.77e+00

Pos logit contributions
rils+0.466
atography+0.465
+0.457
+0.457
estyles+0.456
Neg logit contributions
nas-0.424
yang-0.365
deal-0.359
worthy-0.358
uable-0.350
Tokenman
Token ID805
Feature activation-5.01e-01

Pos logit contributions
nas+8.497
yang+7.144
worthy+6.987
deal+6.943
uable+6.716
Neg logit contributions
-8.727
estyles-8.629
-8.565
atography-8.546
rils-8.487
Token Institute
Token ID5136
Feature activation-9.66e-01

Pos logit contributions
nas+2.487
yang+2.123
worthy+2.085
deal+2.083
uable+1.999
Neg logit contributions
rils-2.597
atography-2.593
estyles-2.557
-2.550
-2.547
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
nas+3.898
yang+3.106
deal+3.106
worthy+3.047
uable+2.830
Neg logit contributions
rils-5.862
atography-5.797
estyles-5.787
-5.733
-5.715
TokenAmid
Token ID43541
Feature activation+2.26e+00

Pos logit contributions
rils+1.318
atography+1.310
+1.288
estyles+1.287
+1.278
Neg logit contributions
nas-1.345
yang-1.165
deal-1.153
worthy-1.149
uable-1.121
Token plac
Token ID21957
Feature activation+5.34e-01

Pos logit contributions
rils+11.321
+11.176
+10.849
atography+10.724
 cues+10.695
Neg logit contributions
nas-9.794
yang-8.577
uable-8.398
deal-8.258
worthy-8.218
Tokenards
Token ID1371
Feature activation-7.05e-01

Pos logit contributions
rils+3.220
atography+3.204
estyles+3.157
+3.140
axies+3.101
Neg logit contributions
nas-2.161
yang-1.833
deal-1.798
worthy-1.764
uable-1.712
Token including
Token ID1390
Feature activation+4.21e-01

Pos logit contributions
nas+2.724
yang+2.255
worthy+2.249
 insulting+2.184
deal+2.167
Neg logit contributions
rils-4.345
atography-4.317
estyles-4.309
axies-4.243
-4.238
Token
Token ID564
Feature activation+6.39e-01

Pos logit contributions
rils+2.188
atography+2.180
+2.153
+2.138
estyles+2.135
Neg logit contributions
nas-2.064
yang-1.790
deal-1.760
worthy-1.752
uable-1.724
Token
Token ID250
Feature activation-6.14e-01

Pos logit contributions
+1.921
rils+1.918
atography+1.894
+1.844
estyles+1.828
Neg logit contributions
nas-4.525
yang-4.113
deal-4.062
worthy-4.056
uable-3.998
Token budgetary
Token ID47560
Feature activation+4.74e-01

Pos logit contributions
rils+2.429
atography+2.418
+2.386
+2.375
estyles+2.366
Neg logit contributions
nas-1.965
yang-1.682
deal-1.650
worthy-1.646
uable-1.611
Token needs
Token ID2476
Feature activation+2.00e-01

Pos logit contributions
rils+1.827
atography+1.814
+1.778
+1.776
estyles+1.769
Neg logit contributions
nas-2.978
yang-2.659
deal-2.634
worthy-2.629
uable-2.579
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
rils+1.163
atography+1.156
+1.142
+1.139
estyles+1.136
Neg logit contributions
nas-0.867
yang-0.734
worthy-0.724
deal-0.721
uable-0.698
Token This
Token ID770
Feature activation-5.38e-02

Pos logit contributions
rils+0.683
atography+0.675
estyles+0.667
+0.663
+0.659
Neg logit contributions
nas-0.695
yang-0.605
deal-0.599
worthy-0.595
uable-0.579
Token book
Token ID1492
Feature activation+5.79e-01

Pos logit contributions
nas+0.260
yang+0.224
deal+0.222
worthy+0.221
uable+0.215
Neg logit contributions
rils-0.286
atography-0.284
-0.280
estyles-0.280
-0.279
Token falls
Token ID8953
Feature activation+1.96e+00

Pos logit contributions
rils+2.854
atography+2.847
+2.799
+2.785
estyles+2.780
Neg logit contributions
nas-2.985
yang-2.606
deal-2.560
worthy-2.558
uable-2.511
Token into
Token ID656
Feature activation+5.45e-01

Pos logit contributions
atography+9.672
+9.453
rils+9.447
 cues+9.416
axies+9.248
Neg logit contributions
nas-9.047
yang-7.763
deal-7.683
xus-7.462
uable-7.418
Token the
Token ID262
Feature activation+6.52e-01

Pos logit contributions
rils+3.030
atography+3.019
+2.978
+2.961
estyles+2.956
Neg logit contributions
nas-2.478
yang-2.118
deal-2.078
worthy-2.074
uable-2.030
Token
Token ID564
Feature activation+7.28e-01

Pos logit contributions
rils+3.340
atography+3.334
+3.283
+3.259
estyles+3.250
Neg logit contributions
nas-3.239
yang-2.814
deal-2.757
worthy-2.750
uable-2.702
Token
Token ID250
Feature activation-4.34e-01

Pos logit contributions
rils+2.209
atography+2.190
+2.158
+2.122
estyles+2.110
Neg logit contributions
nas-5.152
yang-4.678
deal-4.623
worthy-4.619
uable-4.554
Tokenco
Token ID1073
Feature activation+1.48e-01

Pos logit contributions
nas+2.143
deal+1.819
worthy+1.817
yang+1.813
uable+1.779
Neg logit contributions
rils-2.252
atography-2.233
estyles-2.195
-2.176
axies-2.167
Token ignore
Token ID8856
Feature activation+3.58e-01

Pos logit contributions
atography+5.434
rils+5.407
+5.367
 cues+5.265
+5.236
Neg logit contributions
nas-5.403
yang-4.734
worthy-4.674
xus-4.631
deal-4.628
Token the
Token ID262
Feature activation-7.34e-02

Pos logit contributions
rils+1.927
atography+1.918
+1.896
+1.889
estyles+1.880
Neg logit contributions
nas-1.678
yang-1.446
deal-1.416
worthy-1.415
uable-1.388
Token new
Token ID649
Feature activation+9.55e-02

Pos logit contributions
nas+0.364
deal+0.314
yang+0.313
worthy+0.310
uable+0.302
Neg logit contributions
rils-0.382
atography-0.381
-0.373
estyles-0.372
-0.371
Token rules
Token ID3173
Feature activation+2.43e-01

Pos logit contributions
rils+0.412
atography+0.411
+0.401
+0.399
estyles+0.399
Neg logit contributions
nas-0.557
yang-0.493
deal-0.490
worthy-0.487
uable-0.476
Token from
Token ID422
Feature activation+2.40e-01

Pos logit contributions
rils+1.414
atography+1.409
+1.389
+1.385
estyles+1.382
Neg logit contributions
nas-1.050
yang-0.888
deal-0.872
worthy-0.871
uable-0.849
Token the
Token ID262
Feature activation+3.15e-01

Pos logit contributions
rils+1.317
atography+1.311
+1.292
+1.286
estyles+1.286
Neg logit contributions
nas-1.113
yang-0.954
deal-0.939
worthy-0.936
uable-0.915
Token central
Token ID4318
Feature activation-4.02e-01

Pos logit contributions
rils+1.892
atography+1.884
+1.860
+1.853
estyles+1.851
Neg logit contributions
nas-1.295
yang-1.086
deal-1.067
worthy-1.062
uable-1.035
Token government
Token ID1230
Feature activation+2.19e-01

Pos logit contributions
nas+1.496
yang+1.193
deal+1.188
worthy+1.166
 redeem+1.111
Neg logit contributions
rils-2.596
atography-2.574
estyles-2.557
-2.540
-2.536
Token and
Token ID290
Feature activation+3.69e-02

Pos logit contributions
rils+1.274
atography+1.270
+1.252
+1.249
estyles+1.247
Neg logit contributions
nas-0.945
yang-0.797
deal-0.785
worthy-0.782
uable-0.759
Token will
Token ID481
Feature activation+5.39e-01

Pos logit contributions
rils+0.196
atography+0.196
estyles+0.193
+0.192
+0.192
Neg logit contributions
nas-0.178
yang-0.153
deal-0.151
worthy-0.150
uable-0.147
Token continue
Token ID2555
Feature activation-1.18e+00

Pos logit contributions
rils+2.794
atography+2.791
+2.748
+2.720
estyles+2.713
Neg logit contributions
nas-2.608
yang-2.262
worthy-2.225
deal-2.215
uable-2.177
Token a
Token ID257
Feature activation+5.59e-01

Pos logit contributions
nas+0.243
yang+0.208
deal+0.204
worthy+0.203
uable+0.199
Neg logit contributions
rils-0.303
atography-0.302
-0.298
-0.297
estyles-0.297
Token release
Token ID2650
Feature activation-7.16e-02

Pos logit contributions
rils+2.986
atography+2.970
+2.954
+2.925
estyles+2.896
Neg logit contributions
nas-2.610
yang-2.263
worthy-2.204
deal-2.190
uable-2.162
Token date
Token ID3128
Feature activation+5.46e-03

Pos logit contributions
nas+0.283
yang+0.236
worthy+0.233
deal+0.232
uable+0.222
Neg logit contributions
atography-0.444
rils-0.443
-0.436
estyles-0.436
-0.435
Token for
Token ID329
Feature activation+2.68e-01

Pos logit contributions
rils+0.033
atography+0.033
+0.033
+0.033
estyles+0.033
Neg logit contributions
nas-0.022
yang-0.019
worthy-0.018
deal-0.018
uable-0.017
Token either
Token ID2035
Feature activation-1.91e-01

Pos logit contributions
rils+1.384
atography+1.379
+1.355
+1.355
estyles+1.351
Neg logit contributions
nas-1.332
yang-1.155
deal-1.138
worthy-1.135
uable-1.113
Token KH
Token ID40249
Feature activation+1.15e+00

Pos logit contributions
nas+0.806
yang+0.678
deal+0.674
worthy+0.670
uable+0.644
Neg logit contributions
rils-1.136
atography-1.131
-1.116
estyles-1.110
-1.105
Token3
Token ID18
Feature activation+9.40e-02

Pos logit contributions
rils+6.562
+6.533
atography+6.497
+6.378
estyles+6.325
Neg logit contributions
nas-4.918
yang-4.154
deal-4.133
worthy-4.112
uable-4.046
Token of
Token ID286
Feature activation+7.70e-03

Pos logit contributions
rils+0.595
atography+0.594
+0.586
+0.585
estyles+0.584
Neg logit contributions
nas-0.357
yang-0.294
deal-0.288
worthy-0.287
uable-0.279
Token FF
Token ID18402
Feature activation-7.09e-01

Pos logit contributions
rils+0.042
atography+0.041
+0.041
estyles+0.041
+0.040
Neg logit contributions
nas-0.036
yang-0.031
worthy-0.031
deal-0.031
uable-0.030
Token14
Token ID1415
Feature activation-1.68e-01

Pos logit contributions
nas+2.934
yang+2.500
worthy+2.448
deal+2.437
uable+2.381
Neg logit contributions
rils-4.182
atography-4.130
estyles-4.094
-4.066
axies-4.003
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
nas+0.658
yang+0.544
deal+0.536
worthy+0.534
uable+0.519
Neg logit contributions
rils-1.040
atography-1.040
-1.027
estyles-1.022
-1.021
Token from
Token ID422
Feature activation+2.18e-01

Pos logit contributions
rils+5.104
atography+5.098
+4.989
estyles+4.979
+4.965
Neg logit contributions
nas-3.539
yang-2.995
deal-2.944
worthy-2.904
uable-2.862
Token the
Token ID262
Feature activation+1.95e-01

Pos logit contributions
rils+1.198
atography+1.193
+1.175
+1.172
estyles+1.172
Neg logit contributions
nas-1.014
yang-0.871
deal-0.855
worthy-0.855
uable-0.835
Token closing
Token ID9605
Feature activation+9.84e-02

Pos logit contributions
rils+1.087
atography+1.082
+1.065
estyles+1.064
+1.064
Neg logit contributions
nas-0.893
yang-0.764
deal-0.750
worthy-0.749
uable-0.731
Token defender
Token ID13191
Feature activation-2.09e-01

Pos logit contributions
atography+0.544
rils+0.543
+0.533
+0.533
estyles+0.531
Neg logit contributions
nas-0.453
yang-0.387
deal-0.380
worthy-0.380
uable-0.372
Token or
Token ID393
Feature activation-1.39e-01

Pos logit contributions
nas+0.821
yang+0.677
deal+0.663
worthy+0.660
uable+0.641
Neg logit contributions
rils-1.297
atography-1.291
-1.278
-1.276
estyles-1.275
Token a
Token ID257
Feature activation+3.46e-01

Pos logit contributions
nas+0.602
yang+0.515
worthy+0.502
deal+0.500
uable+0.490
Neg logit contributions
rils-0.801
atography-0.795
estyles-0.786
-0.783
-0.783
Token lack
Token ID3092
Feature activation+3.84e-01

Pos logit contributions
rils+1.961
atography+1.955
+1.930
+1.922
estyles+1.917
Neg logit contributions
nas-1.528
yang-1.301
deal-1.279
worthy-1.275
uable-1.245
Token of
Token ID286
Feature activation-1.54e-01

Pos logit contributions
rils+2.082
+2.071
atography+2.070
+2.061
estyles+2.042
Neg logit contributions
nas-1.785
yang-1.534
deal-1.505
worthy-1.500
xus-1.449
Token concentration
Token ID10368
Feature activation-3.35e-01

Pos logit contributions
nas+0.704
yang+0.603
worthy+0.593
deal+0.590
uable+0.577
Neg logit contributions
rils-0.855
atography-0.845
-0.838
-0.838
estyles-0.836
Token,
Token ID11
Feature activation-5.44e-01

Pos logit contributions
nas+1.328
yang+1.112
worthy+1.103
deal+1.080
uable+1.062
Neg logit contributions
rils-2.056
atography-2.034
estyles-2.013
-2.011
-2.009
Token Wallace
Token ID17161
Feature activation-8.05e-02

Pos logit contributions
nas+2.518
yang+2.179
worthy+2.138
deal+2.125
uable+2.070
Neg logit contributions
rils-2.931
atography-2.928
estyles-2.896
-2.893
axies-2.863
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
nas+1.043
yang+0.865
deal+0.860
worthy+0.848
uable+0.809
Neg logit contributions
atography-1.605
rils-1.602
estyles-1.591
-1.586
-1.561
TokenGraphics
Token ID18172
Feature activation-5.42e-01

Pos logit contributions
rils+1.462
atography+1.450
+1.422
estyles+1.394
+1.381
Neg logit contributions
nas-3.924
yang-3.592
deal-3.553
worthy-3.541
uable-3.503
Token]
Token ID60
Feature activation-6.65e-01

Pos logit contributions
nas+2.384
yang+2.020
deal+1.981
worthy+1.979
uable+1.937
Neg logit contributions
rils-3.081
atography-3.069
-3.022
estyles-3.008
-3.005
Token::
Token ID3712
Feature activation+3.67e-01

Pos logit contributions
nas+2.485
deal+2.047
worthy+1.991
yang+1.987
uable+1.922
Neg logit contributions
rils-4.269
estyles-4.179
atography-4.175
-4.156
axies-4.102
TokenFrom
Token ID4863
Feature activation+2.20e-01

Pos logit contributions
rils+1.684
atography+1.683
+1.673
+1.649
estyles+1.640
Neg logit contributions
nas-2.003
yang-1.776
worthy-1.745
deal-1.742
uable-1.715
TokenImage
Token ID5159
Feature activation+9.84e-01

Pos logit contributions
rils+1.311
atography+1.307
+1.287
+1.285
estyles+1.283
Neg logit contributions
nas-0.915
yang-0.770
deal-0.755
worthy-0.753
uable-0.734
Token($
Token ID16763
Feature activation-3.55e-01

Pos logit contributions
+5.643
atography+5.619
rils+5.513
+5.489
estyles+5.483
Neg logit contributions
nas-4.080
yang-3.604
xus-3.447
deal-3.439
worthy-3.426
Tokenb
Token ID65
Feature activation+8.48e-01

Pos logit contributions
nas+1.372
yang+1.150
deal+1.141
worthy+1.137
uable+1.079
Neg logit contributions
-2.217
atography-2.194
-2.188
rils-2.186
estyles-2.184
Tokenmp
Token ID3149
Feature activation+4.97e-01

Pos logit contributions
rils+4.093
atography+4.074
estyles+3.972
+3.962
axies+3.953
Neg logit contributions
nas-4.334
yang-3.839
worthy-3.730
deal-3.703
uable-3.691
Token)
Token ID8
Feature activation-3.19e-01

Pos logit contributions
atography+2.310
rils+2.302
+2.287
estyles+2.284
axies+2.225
Neg logit contributions
nas-2.668
yang-2.350
deal-2.303
worthy-2.289
xus-2.277
Token $
Token ID720
Feature activation-3.63e-01

Pos logit contributions
nas+1.594
yang+1.385
deal+1.368
worthy+1.363
uable+1.342
Neg logit contributions
rils-1.627
atography-1.605
-1.580
estyles-1.572
-1.572
Token into
Token ID656
Feature activation-4.33e-01

Pos logit contributions
nas+0.027
yang+0.022
deal+0.022
worthy+0.022
uable+0.021
Neg logit contributions
rils-0.036
atography-0.036
-0.035
-0.035
estyles-0.035
Token missed
Token ID6825
Feature activation-1.65e-01

Pos logit contributions
nas+1.886
yang+1.613
worthy+1.589
deal+1.573
uable+1.546
Neg logit contributions
rils-2.480
atography-2.478
-2.436
-2.429
estyles-2.412
Token opportunities
Token ID6443
Feature activation-4.55e-01

Pos logit contributions
nas+0.905
yang+0.799
worthy+0.791
deal+0.784
uable+0.775
Neg logit contributions
rils-0.774
atography-0.764
-0.755
-0.752
estyles-0.740
Token.
Token ID13
Feature activation-3.26e-01

Pos logit contributions
nas+1.822
yang+1.498
worthy+1.497
deal+1.462
uable+1.419
Neg logit contributions
rils-2.786
atography-2.777
-2.745
-2.735
estyles-2.729
Token The
Token ID383
Feature activation-1.55e-02

Pos logit contributions
nas+1.619
yang+1.409
deal+1.385
worthy+1.378
uable+1.330
Neg logit contributions
rils-1.685
atography-1.665
estyles-1.661
-1.633
-1.629
Token four
Token ID1440
Feature activation+3.24e-01

Pos logit contributions
nas+0.077
yang+0.066
deal+0.066
worthy+0.066
uable+0.064
Neg logit contributions
rils-0.080
atography-0.080
estyles-0.078
-0.078
-0.078
Token plays
Token ID5341
Feature activation-5.34e-01

Pos logit contributions
rils+1.849
atography+1.844
+1.815
+1.810
estyles+1.806
Neg logit contributions
nas-1.433
yang-1.219
worthy-1.198
deal-1.197
uable-1.169
Token above
Token ID2029
Feature activation+7.36e-02

Pos logit contributions
nas+2.114
yang+1.781
worthy+1.757
deal+1.743
uable+1.700
Neg logit contributions
rils-3.247
atography-3.219
-3.218
estyles-3.177
-3.156
Token,
Token ID11
Feature activation+4.64e-02

Pos logit contributions
rils+0.424
atography+0.421
+0.416
+0.415
estyles+0.413
Neg logit contributions
nas-0.322
yang-0.273
deal-0.269
worthy-0.269
uable-0.261
Token all
Token ID477
Feature activation+5.22e-01

Pos logit contributions
rils+0.264
atography+0.263
+0.260
+0.259
estyles+0.258
Neg logit contributions
nas-0.205
yang-0.175
worthy-0.172
deal-0.171
uable-0.168
Token potential
Token ID2785
Feature activation+9.60e-02

Pos logit contributions
rils+3.018
atography+3.008
+2.974
estyles+2.957
+2.956
Neg logit contributions
nas-2.249
yang-1.899
deal-1.871
worthy-1.858
uable-1.816
Token judged
Token ID19589
Feature activation+4.77e-02

Pos logit contributions
nas+1.790
yang+1.482
worthy+1.469
deal+1.465
 redeem+1.411
Neg logit contributions
rils-2.507
atography-2.492
estyles-2.487
-2.478
-2.455
Token,
Token ID11
Feature activation-4.73e-01

Pos logit contributions
rils+0.283
atography+0.281
+0.277
estyles+0.276
+0.276
Neg logit contributions
nas-0.201
yang-0.169
worthy-0.167
deal-0.167
uable-0.161
Token
Token ID447
Feature activation+2.11e-01

Pos logit contributions
nas+2.088
yang+1.761
worthy+1.760
deal+1.733
 redeem+1.666
Neg logit contributions
rils-2.707
atography-2.703
estyles-2.677
-2.657
-2.628
Token
Token ID251
Feature activation+4.12e-01

Pos logit contributions
atography+0.668
rils+0.667
+0.647
estyles+0.647
axies+0.626
Neg logit contributions
nas-1.459
yang-1.327
worthy-1.313
deal-1.311
uable-1.291
Token said
Token ID531
Feature activation-1.13e-01

Pos logit contributions
rils+2.265
atography+2.256
+2.226
+2.216
estyles+2.211
Neg logit contributions
nas-1.897
yang-1.627
deal-1.599
worthy-1.594
uable-1.561
Token the
Token ID262
Feature activation+2.19e-01

Pos logit contributions
nas+0.514
yang+0.437
worthy+0.429
deal+0.427
uable+0.410
Neg logit contributions
rils-0.639
atography-0.637
estyles-0.627
-0.624
-0.623
Token Prophet
Token ID13583
Feature activation-1.02e+00

Pos logit contributions
rils+1.304
atography+1.299
+1.281
+1.278
estyles+1.277
Neg logit contributions
nas-0.918
yang-0.772
deal-0.758
worthy-0.757
uable-0.736
Token,
Token ID11
Feature activation+5.55e-01

Pos logit contributions
nas+4.178
yang+3.382
worthy+3.350
deal+3.280
 redeem+3.163
Neg logit contributions
rils-6.027
atography-5.982
estyles-5.960
-5.874
axies-5.804
Token
Token ID564
Feature activation+1.25e+00

Pos logit contributions
rils+3.277
atography+3.265
+3.226
+3.212
estyles+3.204
Neg logit contributions
nas-2.331
yang-1.968
deal-1.928
worthy-1.922
uable-1.879
Token
Token ID250
Feature activation+2.09e-01

Pos logit contributions
+3.644
rils+3.509
atography+3.390
+3.272
estyles+3.243
Neg logit contributions
nas-9.047
yang-8.226
deal-8.099
worthy-8.071
uable-8.041
Tokenwith
Token ID4480
Feature activation+3.25e-01

Pos logit contributions
rils+1.156
atography+1.153
estyles+1.132
+1.130
+1.130
Neg logit contributions
nas-0.967
yang-0.827
deal-0.819
worthy-0.818
uable-0.791
Token Fang
Token ID24468
Feature activation+6.46e-01

Pos logit contributions
rils+1.740
atography+1.734
+1.707
+1.704
estyles+1.700
Neg logit contributions
nas-1.501
yang-1.290
deal-1.268
worthy-1.268
uable-1.239
Tokenorn
Token ID1211
Feature activation+5.43e-01

Pos logit contributions
rils+3.540
atography+3.514
+3.473
estyles+3.446
+3.431
Neg logit contributions
nas-2.961
yang-2.541
deal-2.502
worthy-2.492
uable-2.445
Token Forest
Token ID9115
Feature activation+2.09e-01

Pos logit contributions
rils+2.931
atography+2.915
+2.879
+2.858
estyles+2.855
Neg logit contributions
nas-2.556
yang-2.199
deal-2.164
worthy-2.157
uable-2.115
Token,
Token ID11
Feature activation+1.27e-03

Pos logit contributions
rils+1.216
atography+1.213
+1.195
+1.194
estyles+1.191
Neg logit contributions
nas-0.899
yang-0.759
worthy-0.744
deal-0.744
uable-0.723
Token where
Token ID810
Feature activation+5.64e-01

Pos logit contributions
rils+0.007
atography+0.007
+0.007
estyles+0.007
+0.007
Neg logit contributions
nas-0.006
yang-0.005
worthy-0.005
deal-0.005
uable-0.005
Token they
Token ID484
Feature activation-2.77e-01

Pos logit contributions
rils+3.111
atography+3.095
+3.065
+3.032
estyles+3.024
Neg logit contributions
nas-2.575
yang-2.202
deal-2.170
worthy-2.163
uable-2.118
Token are
Token ID389
Feature activation+3.60e-01

Pos logit contributions
nas+1.251
yang+1.079
deal+1.059
worthy+1.040
uable+1.017
Neg logit contributions
rils-1.552
atography-1.544
-1.540
-1.522
estyles-1.517
Token befriend
Token ID43548
Feature activation+3.85e-01

Pos logit contributions
rils+2.111
atography+2.104
+2.077
+2.068
estyles+2.064
Neg logit contributions
nas-1.531
yang-1.294
deal-1.270
worthy-1.267
uable-1.236
Tokened
Token ID276
Feature activation-3.43e-01

Pos logit contributions
rils+1.911
atography+1.903
+1.875
+1.863
estyles+1.860
Neg logit contributions
nas-1.982
yang-1.729
deal-1.704
worthy-1.698
uable-1.667
Token by
Token ID416
Feature activation-4.05e-02

Pos logit contributions
nas+1.256
yang+1.029
worthy+1.007
deal+1.004
uable+0.973
Neg logit contributions
rils-2.213
atography-2.204
-2.174
-2.171
estyles-2.168
Token Tree
Token ID12200
Feature activation-1.29e+00

Pos logit contributions
nas+0.189
yang+0.162
worthy+0.159
deal+0.159
uable+0.155
Neg logit contributions
atography-0.221
rils-0.221
estyles-0.218
-0.217
-0.216
Token time
Token ID640
Feature activation+4.95e-03

Pos logit contributions
nas+2.040
yang+1.721
worthy+1.721
deal+1.719
uable+1.672
Neg logit contributions
rils-2.471
atography-2.448
estyles-2.427
-2.411
axies-2.401
Token in
Token ID287
Feature activation+7.11e-02

Pos logit contributions
rils+0.031
atography+0.031
+0.030
+0.030
estyles+0.030
Neg logit contributions
nas-0.020
yang-0.016
worthy-0.016
deal-0.016
uable-0.015
Token the
Token ID262
Feature activation+4.08e-01

Pos logit contributions
rils+0.388
atography+0.386
+0.380
+0.378
estyles+0.378
Neg logit contributions
nas-0.332
yang-0.284
deal-0.280
worthy-0.280
uable-0.273
Token fast
Token ID3049
Feature activation+2.88e-01

Pos logit contributions
rils+2.038
atography+2.030
+1.999
+1.990
estyles+1.985
Neg logit contributions
nas-2.089
yang-1.822
deal-1.793
worthy-1.789
uable-1.755
Tokened
Token ID276
Feature activation-1.09e+00

Pos logit contributions
rils+1.454
atography+1.447
+1.423
+1.420
estyles+1.416
Neg logit contributions
nas-1.466
yang-1.274
deal-1.256
worthy-1.255
uable-1.227
Token state
Token ID1181
Feature activation-2.70e-02

Pos logit contributions
nas+4.277
deal+3.497
worthy+3.494
yang+3.482
uable+3.353
Neg logit contributions
rils-6.564
atography-6.488
-6.382
axies-6.381
estyles-6.358
Token is
Token ID318
Feature activation-1.95e-01

Pos logit contributions
nas+0.135
yang+0.116
deal+0.115
worthy+0.115
uable+0.112
Neg logit contributions
rils-0.141
atography-0.139
-0.137
-0.137
estyles-0.137
Token actually
Token ID1682
Feature activation-6.87e-01

Pos logit contributions
nas+0.914
yang+0.778
deal+0.767
worthy+0.763
uable+0.751
Neg logit contributions
rils-1.066
atography-1.056
estyles-1.044
-1.043
-1.039
Token a
Token ID257
Feature activation-1.91e-01

Pos logit contributions
nas+2.919
yang+2.454
deal+2.386
worthy+2.374
uable+2.351
Neg logit contributions
rils-3.992
atography-3.938
estyles-3.920
-3.899
-3.864
Token form
Token ID1296
Feature activation-3.05e-01

Pos logit contributions
nas+0.866
yang+0.742
deal+0.736
worthy+0.726
uable+0.708
Neg logit contributions
rils-1.077
atography-1.058
estyles-1.048
-1.042
-1.038
Token of
Token ID286
Feature activation-1.18e-01

Pos logit contributions
nas+1.221
deal+1.012
yang+1.004
worthy+0.998
uable+0.961
Neg logit contributions
rils-1.869
atography-1.868
-1.857
estyles-1.845
-1.833
Token up
Token ID510
Feature activation-6.53e-01

Pos logit contributions
rils+0.585
atography+0.582
+0.574
+0.573
estyles+0.571
Neg logit contributions
nas-0.463
yang-0.394
deal-0.389
worthy-0.387
uable-0.378
Token the
Token ID262
Feature activation-1.91e-01

Pos logit contributions
nas+2.830
worthy+2.371
yang+2.355
deal+2.353
uable+2.287
Neg logit contributions
rils-3.718
atography-3.709
-3.677
-3.645
estyles-3.638
Token most
Token ID749
Feature activation+1.41e-01

Pos logit contributions
nas+0.878
yang+0.747
deal+0.739
worthy+0.735
uable+0.722
Neg logit contributions
rils-1.053
atography-1.046
-1.035
estyles-1.033
-1.033
Token g
Token ID308
Feature activation+1.09e+00

Pos logit contributions
rils+0.863
atography+0.860
+0.849
+0.846
estyles+0.845
Neg logit contributions
nas-0.563
yang-0.471
deal-0.461
worthy-0.460
uable-0.448
Token en
Token ID551
Feature activation-1.88e+00

Pos logit contributions
atography+6.034
rils+6.017
+5.959
estyles+5.877
+5.858
Neg logit contributions
nas-4.764
yang-4.107
deal-4.061
worthy-3.955
xus-3.890
Tokeneral
Token ID1691
Feature activation-6.00e-01

Pos logit contributions
nas+9.011
worthy+7.859
uable+7.823
yang+7.666
deal+7.640
Neg logit contributions
rils-8.830
atography-8.774
axies-8.563
-8.492
-8.490
Token
Token ID198
Feature activation-8.36e-01

Pos logit contributions
nas+2.733
worthy+2.314
deal+2.302
yang+2.300
uable+2.266
Neg logit contributions
rils-3.295
atography-3.259
-3.241
-3.238
estyles-3.221
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
nas+3.753
deal+3.149
yang+3.134
worthy+3.129
uable+2.898
Neg logit contributions
-4.761
estyles-4.750
rils-4.731
atography-4.711
-4.647
Token
Token ID1906
Feature activation+4.26e-01

Pos logit contributions
nas+1.914
yang+1.622
deal+1.616
worthy+1.604
uable+1.545
Neg logit contributions
rils-2.250
atography-2.243
-2.208
estyles-2.202
-2.201
Token
Token ID198
Feature activation-1.66e-01

Pos logit contributions
rils+2.255
atography+2.247
+2.214
+2.204
estyles+2.200
Neg logit contributions
nas-2.053
yang-1.773
deal-1.744
worthy-1.740
uable-1.704
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
nas+0.819
yang+0.706
deal+0.700
worthy+0.697
uable+0.670
Neg logit contributions
rils-0.871
atography-0.867
estyles-0.860
-0.859
-0.854
Token's
Token ID338
Feature activation+1.54e-01

Pos logit contributions
nas+0.422
yang+0.366
worthy+0.357
deal+0.356
uable+0.352
Neg logit contributions
rils-0.448
atography-0.445
-0.440
-0.440
estyles-0.439
Token also
Token ID635
Feature activation-3.42e-01

Pos logit contributions
rils+0.869
atography+0.864
estyles+0.852
+0.850
+0.849
Neg logit contributions
nas-0.699
yang-0.594
deal-0.585
worthy-0.584
uable-0.571
Token attracted
Token ID12725
Feature activation+9.01e-01

Pos logit contributions
nas+1.458
yang+1.213
worthy+1.203
deal+1.193
uable+1.170
Neg logit contributions
rils-2.009
atography-1.992
estyles-1.987
-1.969
-1.958
Token interest
Token ID1393
Feature activation+7.62e-01

Pos logit contributions
rils+4.923
atography+4.891
+4.819
+4.804
estyles+4.751
Neg logit contributions
nas-4.121
yang-3.541
worthy-3.463
deal-3.458
xus-3.362
Token from
Token ID422
Feature activation+1.69e-01

Pos logit contributions
rils+4.202
atography+4.173
+4.169
+4.126
estyles+4.079
Neg logit contributions
nas-3.395
yang-2.921
deal-2.833
worthy-2.783
xus-2.770
Token other
Token ID584
Feature activation-1.42e-01

Pos logit contributions
rils+0.922
atography+0.918
estyles+0.903
+0.900
+0.897
Neg logit contributions
nas-0.798
yang-0.685
worthy-0.679
deal-0.675
uable-0.660
Token NFL
Token ID5134
Feature activation+3.34e-01

Pos logit contributions
nas+0.764
worthy+0.667
deal+0.662
yang+0.659
uable+0.643
Neg logit contributions
atography-0.676
rils-0.673
estyles-0.661
-0.658
-0.653
Token teams
Token ID3466
Feature activation+4.36e-01

Pos logit contributions
atography+1.283
rils+1.280
estyles+1.248
+1.246
+1.241
Neg logit contributions
nas-2.110
yang-1.885
worthy-1.877
deal-1.873
uable-1.839
Token and
Token ID290
Feature activation-1.51e-01

Pos logit contributions
rils+2.506
atography+2.497
+2.463
+2.454
estyles+2.450
Neg logit contributions
nas-1.904
yang-1.617
deal-1.588
worthy-1.584
uable-1.547
Token in
Token ID287
Feature activation-2.87e-01

Pos logit contributions
nas+0.722
yang+0.622
worthy+0.619
deal+0.614
uable+0.602
Neg logit contributions
atography-0.805
rils-0.804
estyles-0.790
-0.784
-0.781
Token the
Token ID262
Feature activation+5.35e-01

Pos logit contributions
nas+1.260
yang+1.069
worthy+1.059
deal+1.054
uable+1.036
Neg logit contributions
rils-1.637
atography-1.635
estyles-1.613
-1.601
-1.589
Token
Token ID198
Feature activation-7.45e-03

Pos logit contributions
nas+0.240
yang+0.209
deal+0.206
worthy+0.204
uable+0.199
Neg logit contributions
rils-0.241
estyles-0.237
atography-0.237
-0.236
-0.235
Token
Token ID198
Feature activation+1.82e-01

Pos logit contributions
nas+0.038
yang+0.033
deal+0.032
worthy+0.032
uable+0.031
Neg logit contributions
rils-0.038
atography-0.038
-0.037
-0.037
estyles-0.037
TokenThe
Token ID464
Feature activation+2.15e-02

Pos logit contributions
rils+0.907
atography+0.903
+0.888
+0.887
estyles+0.887
Neg logit contributions
nas-0.942
yang-0.821
deal-0.812
worthy-0.807
uable-0.787
Token video
Token ID2008
Feature activation+2.55e-01

Pos logit contributions
rils+0.122
atography+0.121
+0.120
estyles+0.119
+0.119
Neg logit contributions
nas-0.096
yang-0.082
worthy-0.081
deal-0.081
uable-0.079
Token showed
Token ID3751
Feature activation+3.86e-01

Pos logit contributions
rils+1.324
atography+1.317
+1.298
+1.294
estyles+1.292
Neg logit contributions
nas-1.259
yang-1.090
worthy-1.073
deal-1.073
uable-1.048
Token Martinez
Token ID20741
Feature activation-1.41e+00

Pos logit contributions
rils+2.103
atography+2.099
+2.065
+2.050
estyles+2.048
Neg logit contributions
nas-1.799
yang-1.546
deal-1.520
worthy-1.517
uable-1.486
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
nas+5.929
 insulting+5.122
deal+4.903
yang+4.894
 redeem+4.788
Neg logit contributions
estyles-8.022
rils-7.908
-7.900
atography-7.813
-7.780
Token who
Token ID508
Feature activation+4.46e-01

Pos logit contributions
rils+0.073
atography+0.072
estyles+0.072
+0.072
+0.072
Neg logit contributions
nas-0.061
yang-0.052
worthy-0.051
deal-0.051
uable-0.050
Token was
Token ID373
Feature activation+3.29e-01

Pos logit contributions
rils+2.192
atography+2.180
+2.147
+2.136
estyles+2.134
Neg logit contributions
nas-2.320
yang-2.027
deal-1.994
worthy-1.991
uable-1.952
Token handcuffed
Token ID41529
Feature activation-3.83e-01

Pos logit contributions
rils+1.880
atography+1.872
+1.846
+1.840
estyles+1.837
Neg logit contributions
nas-1.445
yang-1.228
deal-1.206
worthy-1.203
uable-1.174
Token,
Token ID11
Feature activation-4.18e-02

Pos logit contributions
nas+1.557
yang+1.278
worthy+1.277
deal+1.244
uable+1.193
Neg logit contributions
atography-2.325
rils-2.321
estyles-2.300
-2.289
-2.274
Token addition
Token ID3090
Feature activation-6.53e-01

Pos logit contributions
nas+2.657
deal+2.228
yang+2.225
worthy+2.220
xus+2.129
Neg logit contributions
rils-3.411
atography-3.402
estyles-3.381
-3.353
-3.341
Token,
Token ID11
Feature activation-4.61e-01

Pos logit contributions
nas+2.708
yang+2.276
worthy+2.245
deal+2.242
uable+2.174
Neg logit contributions
rils-3.860
atography-3.850
-3.793
estyles-3.777
-3.776
Token the
Token ID262
Feature activation-2.71e-01

Pos logit contributions
nas+2.060
worthy+1.759
deal+1.753
yang+1.747
uable+1.672
Neg logit contributions
rils-2.578
atography-2.566
estyles-2.550
-2.523
-2.522
Token age
Token ID2479
Feature activation-7.46e-01

Pos logit contributions
nas+1.150
worthy+0.970
deal+0.966
yang+0.953
uable+0.929
Neg logit contributions
rils-1.581
atography-1.571
estyles-1.562
-1.552
-1.548
Token of
Token ID286
Feature activation-1.66e-01

Pos logit contributions
nas+3.096
worthy+2.650
yang+2.636
deal+2.559
uable+2.460
Neg logit contributions
rils-4.354
atography-4.309
estyles-4.271
-4.257
axies-4.239
Token children
Token ID1751
Feature activation-1.73e+00

Pos logit contributions
nas+0.728
worthy+0.620
deal+0.614
yang+0.613
uable+0.593
Neg logit contributions
atography-0.939
rils-0.936
estyles-0.934
-0.925
-0.921
Token depicted
Token ID18904
Feature activation-2.79e-01

Pos logit contributions
nas+6.919
worthy+5.843
deal+5.801
 insulting+5.665
yang+5.658
Neg logit contributions
estyles-9.764
atography-9.627
-9.566
rils-9.420
-9.390
Token through
Token ID832
Feature activation+1.14e-01

Pos logit contributions
nas+1.126
yang+0.927
worthy+0.927
deal+0.919
uable+0.892
Neg logit contributions
rils-1.701
atography-1.698
-1.675
-1.675
estyles-1.670
Token content
Token ID2695
Feature activation+2.32e-01

Pos logit contributions
rils+0.621
atography+0.619
+0.609
estyles+0.608
+0.608
Neg logit contributions
nas-0.535
yang-0.459
deal-0.453
worthy-0.453
uable-0.441
Token on
Token ID319
Feature activation-2.48e-01

Pos logit contributions
atography+1.235
rils+1.235
+1.213
estyles+1.212
+1.209
Neg logit contributions
nas-1.117
yang-0.957
worthy-0.952
deal-0.949
uable-0.925
Token the
Token ID262
Feature activation-9.02e-02

Pos logit contributions
nas+1.136
worthy+0.964
deal+0.960
yang+0.960
uable+0.928
Neg logit contributions
rils-1.363
atography-1.361
estyles-1.339
-1.334
-1.333
Token her
Token ID607
Feature activation-2.62e-01

Pos logit contributions
nas+0.617
yang+0.512
worthy+0.507
deal+0.502
uable+0.485
Neg logit contributions
rils-0.932
atography-0.932
-0.921
estyles-0.917
-0.915
Token 6
Token ID718
Feature activation-7.13e-01

Pos logit contributions
nas+1.126
yang+0.941
worthy+0.935
deal+0.924
uable+0.892
Neg logit contributions
rils-1.527
atography-1.520
estyles-1.511
-1.507
-1.493
Token-
Token ID12
Feature activation-1.07e+00

Pos logit contributions
nas+2.531
yang+2.058
worthy+2.018
deal+2.006
uable+1.958
Neg logit contributions
rils-4.654
atography-4.644
estyles-4.569
-4.568
-4.550
Tokenmonth
Token ID8424
Feature activation-6.43e-01

Pos logit contributions
nas+3.978
yang+3.283
deal+3.255
worthy+3.201
uable+3.054
Neg logit contributions
atography-6.772
rils-6.720
estyles-6.647
-6.635
-6.600
Token-
Token ID12
Feature activation-1.48e+00

Pos logit contributions
nas+2.002
deal+1.566
yang+1.565
worthy+1.565
uable+1.473
Neg logit contributions
rils-4.493
atography-4.468
-4.438
estyles-4.428
-4.355
Tokenold
Token ID727
Feature activation-2.05e+00

Pos logit contributions
nas+4.732
deal+3.890
worthy+3.663
yang+3.621
xus+3.100
Neg logit contributions
-10.474
atography-10.151
estyles-10.134
rils-9.919
 cues-9.843
Token,
Token ID11
Feature activation-4.07e-01

Pos logit contributions
nas+7.669
yang+6.238
 redeem+6.225
worthy+6.171
deal+6.063
Neg logit contributions
atography-11.927
rils-11.837
-11.812
estyles-11.793
axies-11.507
Token Conan
Token ID31634
Feature activation-8.50e-01

Pos logit contributions
nas+1.966
yang+1.700
worthy+1.661
deal+1.649
uable+1.617
Neg logit contributions
rils-2.161
atography-2.151
estyles-2.119
-2.108
-2.106
Token,
Token ID11
Feature activation-2.94e-01

Pos logit contributions
nas+3.639
yang+3.049
worthy+3.026
deal+2.955
uable+2.899
Neg logit contributions
atography-4.907
rils-4.891
-4.852
estyles-4.828
axies-4.777
Token and
Token ID290
Feature activation-7.45e-01

Pos logit contributions
nas+1.203
yang+0.999
worthy+0.978
deal+0.976
xus+0.929
Neg logit contributions
atography-1.780
rils-1.777
estyles-1.768
-1.760
-1.746
Token dog
Token ID3290
Feature activation-4.85e-01

Pos logit contributions
nas+3.114
yang+2.650
deal+2.583
worthy+2.581
 redeem+2.505
Neg logit contributions
atography-4.354
rils-4.336
-4.316
estyles-4.314
axies-4.246
Token.
Token ID13
Feature activation-6.32e-01

Pos logit contributions
rils+6.804
atography+6.745
+6.742
estyles+6.677
+6.637
Neg logit contributions
nas-5.797
yang-5.120
deal-4.939
uable-4.911
worthy-4.835
Token During
Token ID5856
Feature activation+2.40e-01

Pos logit contributions
nas+3.104
deal+2.679
yang+2.676
worthy+2.640
uable+2.511
Neg logit contributions
rils-3.291
estyles-3.269
atography-3.264
axies-3.190
-3.158
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
rils+1.367
atography+1.362
+1.343
+1.338
estyles+1.336
Neg logit contributions
nas-1.058
yang-0.900
deal-0.884
worthy-0.882
uable-0.861
Token
Token ID564
Feature activation+6.32e-01

Pos logit contributions
nas+0.470
yang+0.398
deal+0.396
worthy+0.389
uable+0.382
Neg logit contributions
rils-0.563
atography-0.559
estyles-0.554
-0.551
-0.548
Token
Token ID246
Feature activation-7.17e-01

Pos logit contributions
rils+1.608
atography+1.594
+1.549
+1.533
estyles+1.526
Neg logit contributions
nas-4.778
yang-4.363
deal-4.320
worthy-4.315
uable-4.260
Tokencart
Token ID26674
Feature activation-2.38e+00

Pos logit contributions
nas+3.437
deal+2.945
yang+2.942
worthy+2.888
uable+2.765
Neg logit contributions
atography-3.908
rils-3.813
estyles-3.799
-3.768
axies-3.750
Tokenoon
Token ID2049
Feature activation-1.02e+00

Pos logit contributions
nas+11.554
deal+10.304
yang+10.280
uable+9.834
xus+9.755
Neg logit contributions
axies-10.123
-10.017
estyles-9.851
atography-9.807
-9.744
Token crisis
Token ID4902
Feature activation-8.67e-01

Pos logit contributions
nas+4.882
deal+4.187
worthy+4.158
yang+4.112
uable+3.961
Neg logit contributions
atography-5.303
rils-5.283
axies-5.175
estyles-5.126
-5.061
Token
Token ID447
Feature activation-4.66e-01

Pos logit contributions
nas+3.738
deal+3.109
worthy+3.100
yang+3.028
uable+2.909
Neg logit contributions
atography-5.014
rils-4.971
-4.840
estyles-4.838
axies-4.795
Token
Token ID247
Feature activation-4.65e-01

Pos logit contributions
nas+2.700
yang+2.424
deal+2.391
uable+2.372
worthy+2.360
Neg logit contributions
atography-1.997
estyles-1.940
-1.934
rils-1.898
axies-1.847
Token
Token ID851
Feature activation+3.36e-01

Pos logit contributions
nas+1.938
yang+1.608
worthy+1.601
deal+1.595
uable+1.508
Neg logit contributions
atography-2.800
rils-2.776
-2.739
estyles-2.718
axies-2.700
Token TO
Token ID5390
Feature activation+6.70e-01

Pos logit contributions
atography+5.478
rils+5.452
+5.397
+5.375
estyles+5.310
Neg logit contributions
nas-5.111
yang-4.460
deal-4.374
worthy-4.355
uable-4.326
Token THE
Token ID3336
Feature activation+4.05e-02

Pos logit contributions
rils+3.369
atography+3.358
+3.321
+3.303
estyles+3.277
Neg logit contributions
nas-3.375
yang-2.934
worthy-2.886
deal-2.885
uable-2.844
Token W
Token ID370
Feature activation+2.96e-01

Pos logit contributions
rils+0.219
atography+0.218
estyles+0.215
+0.213
+0.212
Neg logit contributions
nas-0.192
yang-0.165
deal-0.163
worthy-0.162
uable-0.156
TokenATER
Token ID23261
Feature activation-2.65e-02

Pos logit contributions
rils+1.136
atography+1.127
+1.105
+1.101
estyles+1.096
Neg logit contributions
nas-1.857
yang-1.663
deal-1.643
worthy-1.640
uable-1.614
TokenVILLE
Token ID38526
Feature activation-5.28e-01

Pos logit contributions
nas+0.125
yang+0.108
deal+0.107
worthy+0.107
uable+0.101
Neg logit contributions
rils-0.143
atography-0.143
estyles-0.140
-0.139
-0.139
Token TA
Token ID21664
Feature activation-1.81e+00

Pos logit contributions
nas+2.401
deal+2.031
yang+2.006
worthy+1.991
uable+1.891
Neg logit contributions
rils-2.935
atography-2.930
estyles-2.899
axies-2.856
-2.840
TokenVER
Token ID5959
Feature activation-5.59e-01

Pos logit contributions
nas+5.534
deal+4.555
yang+4.353
worthy+4.262
 redeem+4.232
Neg logit contributions
rils-11.757
estyles-11.685
atography-11.646
axies-11.451
-11.083
TokenN
Token ID45
Feature activation-6.08e-01

Pos logit contributions
nas+2.001
yang+1.627
deal+1.614
worthy+1.562
uable+1.479
Neg logit contributions
atography-3.625
rils-3.594
estyles-3.578
axies-3.529
-3.525
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
nas+2.836
deal+2.445
yang+2.390
worthy+2.383
uable+2.250
Neg logit contributions
atography-3.283
estyles-3.230
rils-3.219
axies-3.161
-3.111
Token as
Token ID355
Feature activation+5.41e-01

Pos logit contributions
nas+0.699
yang+0.601
deal+0.595
worthy+0.588
uable+0.572
Neg logit contributions
rils-0.816
atography-0.814
estyles-0.803
-0.802
-0.801
Token it
Token ID340
Feature activation+2.63e-02

Pos logit contributions
rils+2.969
atography+2.959
+2.920
+2.897
estyles+2.887
Neg logit contributions
nas-2.486
yang-2.135
deal-2.095
worthy-2.092
uable-2.050