TOP ACTIVATIONS
MAX = 14.824

, but she seems to go in and out pretty well
was never the type to go out to parties , would
MD T has become more widely available . The organization emphasizes
Award , named after the widely admired 10 th Director of
of the most recogn isable names in betting with a particular
uk ov was not known widely , but the victory of
to be perceived as such socially ( including the use of
the site but has since gone viral on the internet .
they are not too obvious ( as opposed to back in
" because of his dimin utive height . "
Throughout all of this , Twitter has been indispensable . When
, it 's the nearly universally ad ored Burger Guys .
employment , and ostr ac ism from family and social networks
of girls . Generally , the Mong ols (
king , and then they go crazy ." Mus
dis appointed to only be trending No 7 in USA
identified as being the worst globally by international policing body Inter
with artwork from frequent Dinosaur Jr . collaborator Maur a Jasper
still comes with extreme social stigma and risk for violence .
nude photos and video from publicly accessible sites on its popular

MIN ACTIVATIONS
MIN = -17.205

undo _ move ( & mut self ) { let prev
the presidency . They are blind to the fact their uncon
n ou vel le organisation ind é pend ante , le
presidential nominee Ted Cruz . Imagine Lindsey Graham mixing mint j
isn t the end of their quest for gains
also clearly feel swapping one fossil fuel for another is bad
and meetings to hear " exper ts " speak about the
galaxy ( 100 , 000 Light Years ) to
р а д е н ы с
: the costs of investments un made , of businesses not
2 . im read ( image Path ) image = im
2 , 000 , 000 Light Years ) to
е д е л ь
е к с а н д р а
р а д е л е н и
apply _ move ( & mut self ) { let prev
asked MIT s Dar on Ace m oglu .
Enough (# а д о е л ) protest
de d és acc ord ent re mem b res ,
а д а к ,

INTERVAL 6.817 - 14.824
CONTAINS 1.621%

to be more action from politicians globally , working together ,
the thought of having to call a department store to see
science fiction and fantasy programming anywhere on television . Part of
8 . Josh Strauss ( 103 ) ( 22
morph ologically distinct and should be regarded as separate species ,

INTERVAL -1.190 - 6.817
CONTAINS 63.256%

find an answer to this question . D ani
two super - we ird black - border cards : Block
who had recently recovered from a heart problem and received treatment
carefully omitted government subsidies ( wise ly , given the imminent
The Jersey City Museum was founded in 1901 as part

INTERVAL -9.197 - -1.190
CONTAINS 34.809%

. And for Israel to put itself in his
the plot may seem somewhat shallow at times , but they
protests in Wednesday night 's episode , and they 're using
, and we have outlined a responsible approach for the future
Press the button ! ( 9 )

INTERVAL -17.205 - -9.197
CONTAINS 0.314%

< T > { type Target = T ; ... }
July 4 th ended in arrest and ign om
Pitch fork , music critic Adam Ward reminis ces about digital
ography , Edwards Air Force Base , http ://
Wednesday steps they ve taken to stop the spread
Token,
Token ID11
Feature activation+5.56e+00

Pos logit contributions
 notor+3.217
KY+3.133
[/+3.077
 tattoo+2.895
 awa+2.848
Neg logit contributions
selves-3.899
rique-3.577
cedes-2.972
actly-2.900
productive-2.818
Token but
Token ID475
Feature activation+4.23e+00

Pos logit contributions
 notor+4.007
KY+3.949
[/+3.922
 tattoo+3.723
etic+3.599
Neg logit contributions
selves-4.881
rique-4.497
cedes-3.682
actly-3.578
productive-3.509
Token she
Token ID673
Feature activation+4.59e+00

Pos logit contributions
 notor+3.122
KY+3.058
[/+3.024
 tattoo+2.844
etic+2.789
Neg logit contributions
selves-3.670
rique-3.368
cedes-2.786
actly-2.713
productive-2.643
Token seems
Token ID2331
Feature activation+6.81e+00

Pos logit contributions
 notor+3.081
[/+3.034
KY+3.029
 tattoo+2.810
etic+2.753
Neg logit contributions
selves-4.212
rique-3.914
cedes-3.256
actly-3.181
productive-3.126
Token to
Token ID284
Feature activation+7.02e+00

Pos logit contributions
 notor+4.709
[/+4.629
KY+4.574
 tattoo+4.361
 awa+4.210
Neg logit contributions
selves-6.025
rique-5.623
cedes-4.692
actly-4.540
productive-4.425
Token go
Token ID467
Feature activation+1.48e+01

Pos logit contributions
[/+4.685
 notor+4.660
KY+4.609
 tattoo+4.377
 popularity+4.186
Neg logit contributions
selves-6.330
rique-5.959
cedes-5.005
actly-4.825
productive-4.780
Token in
Token ID287
Feature activation+6.36e+00

Pos logit contributions
 notor+8.604
[/+8.586
KY+8.433
 tattoo+8.313
 amongst+7.969
Neg logit contributions
selves-13.179
rique-12.580
cedes-10.678
actly-10.187
productive-10.164
Token and
Token ID290
Feature activation+2.44e+00

Pos logit contributions
 notor+4.422
KY+4.305
[/+4.284
 tattoo+4.039
 awa+3.935
Neg logit contributions
selves-5.744
rique-5.279
cedes-4.397
actly-4.283
productive-4.180
Token out
Token ID503
Feature activation-8.12e-02

Pos logit contributions
 notor+1.585
KY+1.539
[/+1.497
 tattoo+1.387
 awa+1.377
Neg logit contributions
selves-2.360
rique-2.167
cedes-1.835
actly-1.800
productive-1.749
Token pretty
Token ID2495
Feature activation+3.47e+00

Pos logit contributions
selves+0.069
rique+0.062
cedes+0.051
actly+0.050
productive+0.049
Neg logit contributions
 notor-0.063
KY-0.061
[/-0.059
 tattoo-0.056
 awa-0.056
Token well
Token ID880
Feature activation+4.43e+00

Pos logit contributions
 notor+2.673
KY+2.595
[/+2.579
 tattoo+2.431
 awa+2.388
Neg logit contributions
selves-2.904
rique-2.664
cedes-2.191
actly-2.130
productive-2.064
Token was
Token ID373
Feature activation+8.74e+00

Pos logit contributions
[/+5.485
 notor+5.442
KY+5.424
 tattoo+5.287
etic+4.973
Neg logit contributions
selves-6.685
rique-6.176
cedes-5.130
actly-4.980
productive-4.873
Token never
Token ID1239
Feature activation+9.18e+00

Pos logit contributions
 notor+5.950
[/+5.854
KY+5.739
 tattoo+5.610
 awa+5.337
Neg logit contributions
selves-7.757
rique-7.163
cedes-6.007
actly-5.826
productive-5.645
Token the
Token ID262
Feature activation+2.90e+00

Pos logit contributions
 notor+6.114
[/+6.038
KY+5.971
 tattoo+5.835
 amongst+5.573
Neg logit contributions
selves-8.095
rique-7.454
cedes-6.313
actly-6.197
productive-5.973
Token type
Token ID2099
Feature activation+4.13e+00

Pos logit contributions
 notor+2.175
KY+2.123
[/+2.085
 tattoo+1.971
etic+1.936
Neg logit contributions
selves-2.502
rique-2.289
cedes-1.893
actly-1.845
productive-1.790
Token to
Token ID284
Feature activation+6.66e+00

Pos logit contributions
[/+2.972
 notor+2.910
KY+2.897
 tattoo+2.726
 amongst+2.638
Neg logit contributions
selves-3.626
rique-3.349
cedes-2.832
actly-2.698
productive-2.590
Token go
Token ID467
Feature activation+1.45e+01

Pos logit contributions
 notor+4.217
[/+4.203
KY+4.189
 tattoo+3.986
 awa+3.800
Neg logit contributions
selves-6.235
rique-5.872
cedes-4.966
actly-4.831
productive-4.740
Token out
Token ID503
Feature activation+2.64e+00

Pos logit contributions
[/+8.328
 notor+8.277
 tattoo+8.156
KY+8.031
 awa+7.870
Neg logit contributions
selves-12.840
rique-12.512
cedes-10.594
actly-10.219
productive-9.997
Token to
Token ID284
Feature activation+3.48e+00

Pos logit contributions
 notor+1.801
KY+1.754
[/+1.718
 tattoo+1.607
 awa+1.581
Neg logit contributions
selves-2.462
rique-2.266
cedes-1.908
actly-1.865
productive-1.814
Token parties
Token ID4671
Feature activation+5.91e+00

Pos logit contributions
 notor+2.432
KY+2.382
[/+2.334
 tattoo+2.206
 awa+2.143
Neg logit contributions
selves-3.161
rique-2.916
cedes-2.454
actly-2.382
productive-2.322
Token,
Token ID11
Feature activation+2.55e+00

Pos logit contributions
 notor+3.694
[/+3.679
KY+3.630
 tattoo+3.396
 awa+3.278
Neg logit contributions
selves-5.622
rique-5.258
cedes-4.472
actly-4.357
productive-4.234
Token would
Token ID561
Feature activation+6.88e+00

Pos logit contributions
 notor+1.873
KY+1.829
[/+1.796
 tattoo+1.692
 awa+1.661
Neg logit contributions
selves-2.239
rique-2.052
cedes-1.705
actly-1.662
productive-1.615
Token MD
Token ID10670
Feature activation+3.11e+00

Pos logit contributions
 notor+2.812
KY+2.743
[/+2.691
 tattoo+2.526
etic+2.488
Neg logit contributions
selves-3.473
rique-3.182
cedes-2.654
actly-2.591
productive-2.516
TokenT
Token ID51
Feature activation+5.87e+00

Pos logit contributions
 notor+2.140
KY+2.049
[/+2.009
 awa+1.885
 tattoo+1.871
Neg logit contributions
selves-2.897
rique-2.667
cedes-2.240
actly-2.197
productive-2.143
Token has
Token ID468
Feature activation+3.59e+00

Pos logit contributions
 notor+4.064
KY+3.960
[/+3.935
 tattoo+3.668
etic+3.610
Neg logit contributions
selves-5.358
rique-4.893
cedes-4.126
actly-4.045
productive-3.926
Token become
Token ID1716
Feature activation+7.02e+00

Pos logit contributions
 notor+2.725
KY+2.636
[/+2.617
 tattoo+2.477
 popularity+2.426
Neg logit contributions
selves-3.055
rique-2.786
cedes-2.306
actly-2.256
productive-2.179
Token more
Token ID517
Feature activation+4.61e+00

Pos logit contributions
 notor+4.293
[/+4.057
KY+4.027
 tattoo+3.825
 popularity+3.728
Neg logit contributions
selves-6.936
rique-6.444
cedes-5.505
actly-5.433
productive-5.211
Token widely
Token ID6768
Feature activation+1.44e+01

Pos logit contributions
 notor+2.257
KY+2.161
[/+2.108
 tattoo+1.922
etic+1.874
Neg logit contributions
selves-5.187
rique-4.844
cedes-4.212
actly-4.143
productive-4.041
Token available
Token ID1695
Feature activation+7.07e+00

Pos logit contributions
 notor+6.470
[/+6.231
 tattoo+6.123
 popularity+5.995
KY+5.809
Neg logit contributions
selves-15.210
rique-14.405
actly-12.725
cedes-12.716
productive-11.701
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
 notor+4.112
[/+4.084
KY+4.007
 tattoo+3.684
etic+3.662
Neg logit contributions
selves-7.021
rique-6.592
cedes-5.627
actly-5.569
productive-5.367
Token The
Token ID383
Feature activation+2.05e+00

Pos logit contributions
 notor+1.333
KY+1.299
[/+1.273
 tattoo+1.202
 awa+1.187
Neg logit contributions
selves-1.420
rique-1.292
cedes-1.060
actly-1.033
productive-1.001
Token organization
Token ID4009
Feature activation+2.85e+00

Pos logit contributions
 notor+1.420
KY+1.383
[/+1.351
 tattoo+1.262
etic+1.247
Neg logit contributions
selves-1.885
rique-1.732
cedes-1.450
actly-1.418
productive-1.381
Token emphasizes
Token ID31648
Feature activation-4.67e-01

Pos logit contributions
 notor+2.032
KY+1.981
[/+1.934
 tattoo+1.813
 awa+1.790
Neg logit contributions
selves-2.575
rique-2.360
cedes-1.973
actly-1.924
productive-1.874
Token Award
Token ID11289
Feature activation+2.62e+00

Pos logit contributions
 notor+1.689
KY+1.634
[/+1.625
 tattoo+1.510
 popularity+1.470
Neg logit contributions
selves-3.233
rique-2.964
cedes-2.576
actly-2.529
productive-2.454
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
 notor+1.788
KY+1.748
[/+1.730
 tattoo+1.608
etic+1.589
Neg logit contributions
selves-2.440
rique-2.214
cedes-1.859
actly-1.836
productive-1.779
Token named
Token ID3706
Feature activation+5.12e+00

Pos logit contributions
 notor+0.895
KY+0.872
[/+0.855
 tattoo+0.802
 awa+0.790
Neg logit contributions
selves-1.133
rique-1.039
cedes-0.868
actly-0.849
productive-0.824
Token after
Token ID706
Feature activation-2.40e+00

Pos logit contributions
 notor+3.485
KY+3.361
[/+3.355
 tattoo+3.126
 amongst+3.087
Neg logit contributions
selves-4.684
rique-4.308
cedes-3.603
actly-3.594
productive-3.471
Token the
Token ID262
Feature activation+4.46e-01

Pos logit contributions
selves+1.966
rique+1.801
cedes+1.449
actly+1.409
productive+1.376
Neg logit contributions
 notor-1.888
KY-1.855
[/-1.814
 awa-1.698
etic-1.694
Token widely
Token ID6768
Feature activation+1.44e+01

Pos logit contributions
 notor+0.334
KY+0.327
[/+0.319
 tattoo+0.300
 awa+0.298
Neg logit contributions
selves-0.385
rique-0.352
cedes-0.291
actly-0.284
productive-0.275
Token admired
Token ID29382
Feature activation+7.38e+00

Pos logit contributions
 notor+8.171
 tattoo+7.973
[/+7.708
 popularity+7.537
KY+7.477
Neg logit contributions
selves-13.277
rique-12.740
actly-11.029
cedes-10.976
productive-9.935
Token 10
Token ID838
Feature activation+2.98e+00

Pos logit contributions
 notor+5.392
KY+5.189
[/+5.180
 tattoo+5.052
 popularity+4.889
Neg logit contributions
selves-6.248
rique-5.719
cedes-4.815
actly-4.770
productive-4.534
Tokenth
Token ID400
Feature activation+4.50e+00

Pos logit contributions
 notor+2.280
KY+2.231
[/+2.193
 tattoo+2.078
etic+2.037
Neg logit contributions
selves-2.511
rique-2.294
cedes-1.900
actly-1.856
productive-1.790
Token Director
Token ID5890
Feature activation+1.89e+00

Pos logit contributions
 notor+3.381
KY+3.291
[/+3.258
 tattoo+3.089
etic+3.014
Neg logit contributions
selves-3.846
rique-3.499
cedes-2.909
actly-2.845
productive-2.737
Token of
Token ID286
Feature activation-3.72e+00

Pos logit contributions
 notor+1.421
KY+1.387
[/+1.367
 tattoo+1.286
etic+1.270
Neg logit contributions
selves-1.625
rique-1.473
cedes-1.226
actly-1.198
productive-1.155
Token of
Token ID286
Feature activation+4.33e-01

Pos logit contributions
 notor+1.562
KY+1.528
[/+1.506
 tattoo+1.420
etic+1.394
Neg logit contributions
selves-1.761
rique-1.606
cedes-1.322
actly-1.299
productive-1.256
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
 notor+0.320
KY+0.312
[/+0.306
 tattoo+0.288
etic+0.284
Neg logit contributions
selves-0.378
rique-0.346
cedes-0.287
actly-0.280
productive-0.272
Token most
Token ID749
Feature activation+7.72e+00

Pos logit contributions
 notor+0.820
KY+0.803
[/+0.784
 tattoo+0.737
 awa+0.731
Neg logit contributions
selves-0.928
rique-0.846
cedes-0.698
actly-0.681
productive-0.660
Token recogn
Token ID3018
Feature activation+2.45e+00

Pos logit contributions
 notor+5.115
KY+4.845
[/+4.782
 tattoo+4.609
 popularity+4.531
Neg logit contributions
selves-7.282
rique-6.736
cedes-5.685
actly-5.660
productive-5.311
Tokenisable
Token ID43942
Feature activation+6.80e+00

Pos logit contributions
 notor+1.997
KY+1.953
[/+1.917
 tattoo+1.813
 awa+1.791
Neg logit contributions
selves-1.963
rique-1.780
cedes-1.446
actly-1.407
productive-1.360
Token names
Token ID3891
Feature activation+1.43e+01

Pos logit contributions
 notor+4.329
KY+4.184
[/+4.170
 tattoo+4.003
 popularity+3.835
Neg logit contributions
selves-6.476
rique-6.023
cedes-5.113
actly-5.050
productive-4.920
Token in
Token ID287
Feature activation+8.97e+00

Pos logit contributions
[/+7.520
 amongst+7.435
 notor+7.378
 tattoo+7.285
KY+7.202
Neg logit contributions
selves-13.817
rique-12.871
actly-11.170
cedes-11.032
productive-10.844
Token betting
Token ID22908
Feature activation+2.03e-01

Pos logit contributions
 notor+5.774
KY+5.739
[/+5.660
 tattoo+5.572
 popularity+5.485
Neg logit contributions
selves-8.149
rique-7.494
cedes-6.312
actly-6.264
productive-6.106
Token with
Token ID351
Feature activation+6.29e+00

Pos logit contributions
 notor+0.138
KY+0.135
[/+0.130
 awa+0.122
 tattoo+0.121
Neg logit contributions
selves-0.187
rique-0.173
cedes-0.147
actly-0.141
productive-0.138
Token a
Token ID257
Feature activation+4.70e+00

Pos logit contributions
 notor+4.299
KY+4.204
[/+4.156
 tattoo+3.919
 popularity+3.877
Neg logit contributions
selves-5.702
rique-5.273
cedes-4.436
actly-4.329
productive-4.213
Token particular
Token ID1948
Feature activation+5.44e+00

Pos logit contributions
 notor+3.368
KY+3.272
[/+3.219
 tattoo+3.051
 popularity+2.986
Neg logit contributions
selves-4.201
rique-3.844
cedes-3.209
actly-3.146
productive-3.046
Tokenuk
Token ID2724
Feature activation+2.76e+00

Pos logit contributions
selves+0.888
rique+0.834
cedes+0.722
actly+0.709
productive+0.691
Neg logit contributions
 notor-0.442
KY-0.405
[/-0.391
 tattoo-0.370
 awa-0.367
Tokenov
Token ID709
Feature activation-3.02e+00

Pos logit contributions
 notor+1.435
KY+1.374
[/+1.329
 tattoo+1.219
 awa+1.202
Neg logit contributions
selves-3.034
rique-2.831
cedes-2.453
actly-2.411
productive-2.357
Token was
Token ID373
Feature activation-1.37e-01

Pos logit contributions
selves+2.588
rique+2.354
cedes+1.914
actly+1.882
productive+1.847
Neg logit contributions
 notor-2.283
KY-2.197
[/-2.115
 awa-2.036
 tattoo-2.008
Token not
Token ID407
Feature activation-1.67e+00

Pos logit contributions
selves+0.118
rique+0.108
cedes+0.089
actly+0.087
productive+0.085
Neg logit contributions
 notor-0.103
KY-0.101
[/-0.098
 awa-0.092
 tattoo-0.092
Token known
Token ID1900
Feature activation+7.88e-01

Pos logit contributions
selves+1.423
rique+1.298
cedes+1.069
actly+1.050
productive+1.025
Neg logit contributions
 notor-1.255
KY-1.233
[/-1.191
 awa-1.131
 tattoo-1.122
Token widely
Token ID6768
Feature activation+1.41e+01

Pos logit contributions
 notor+0.562
KY+0.548
[/+0.529
 tattoo+0.498
 awa+0.496
Neg logit contributions
selves-0.715
rique-0.653
cedes-0.542
actly-0.534
productive-0.520
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
[/+8.082
 amongst+7.796
 notor+7.719
KY+7.462
 tattoo+7.201
Neg logit contributions
selves-12.815
rique-12.218
cedes-10.761
actly-10.282
aughed-9.882
Token but
Token ID475
Feature activation+4.48e+00

Pos logit contributions
 notor+2.902
KY+2.829
[/+2.780
 tattoo+2.607
etic+2.564
Neg logit contributions
selves-3.670
rique-3.376
cedes-2.822
actly-2.756
productive-2.671
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
 notor+3.276
KY+3.189
[/+3.147
 tattoo+2.957
etic+2.904
Neg logit contributions
selves-3.917
rique-3.583
cedes-2.997
actly-2.918
productive-2.830
Token victory
Token ID5373
Feature activation-3.68e+00

Pos logit contributions
 notor+1.612
KY+1.573
[/+1.541
 tattoo+1.451
 awa+1.434
Neg logit contributions
selves-1.821
rique-1.662
cedes-1.371
actly-1.338
productive-1.298
Token of
Token ID286
Feature activation-4.15e+00

Pos logit contributions
selves+2.993
rique+2.734
cedes+2.238
actly+2.147
productive+2.102
Neg logit contributions
 notor-2.920
KY-2.815
[/-2.722
 awa-2.622
etic-2.568
Token to
Token ID284
Feature activation+7.34e+00

Pos logit contributions
 notor+2.260
KY+2.180
[/+2.177
 tattoo+2.065
etic+2.018
Neg logit contributions
selves-2.425
rique-2.269
cedes-1.849
actly-1.825
productive-1.760
Token be
Token ID307
Feature activation+5.27e+00

Pos logit contributions
 notor+4.351
KY+4.257
[/+4.180
 tattoo+4.009
etic+3.780
Neg logit contributions
selves-7.164
rique-6.851
cedes-5.824
actly-5.726
productive-5.585
Token perceived
Token ID11067
Feature activation+2.71e+00

Pos logit contributions
 notor+3.352
KY+3.243
[/+3.169
 tattoo+3.008
etic+2.907
Neg logit contributions
selves-5.095
rique-4.761
cedes-4.022
actly-3.947
productive-3.831
Token as
Token ID355
Feature activation+1.51e+00

Pos logit contributions
 notor+1.883
KY+1.832
[/+1.796
 tattoo+1.686
etic+1.656
Neg logit contributions
selves-2.483
rique-2.293
cedes-1.922
actly-1.880
productive-1.825
Token such
Token ID884
Feature activation+2.71e+00

Pos logit contributions
 notor+0.897
KY+0.872
[/+0.847
 tattoo+0.780
 awa+0.771
Neg logit contributions
selves-1.551
rique-1.432
cedes-1.228
actly-1.203
productive-1.176
Token socially
Token ID18118
Feature activation+1.40e+01

Pos logit contributions
 notor+1.843
KY+1.795
[/+1.757
 tattoo+1.643
etic+1.617
Neg logit contributions
selves-2.527
rique-2.329
cedes-1.959
actly-1.916
productive-1.864
Token (
Token ID357
Feature activation+4.89e+00

Pos logit contributions
[/+7.397
 notor+7.168
KY+6.916
 amongst+6.854
 tattoo+6.668
Neg logit contributions
selves-12.698
rique-12.558
cedes-10.630
actly-10.535
productive-10.070
Tokenincluding
Token ID8201
Feature activation+6.80e-01

Pos logit contributions
 notor+3.415
KY+3.367
[/+3.280
 tattoo+3.073
etic+3.035
Neg logit contributions
selves-4.387
rique-4.087
cedes-3.399
actly-3.325
productive-3.202
Token the
Token ID262
Feature activation+1.89e+00

Pos logit contributions
 notor+0.477
KY+0.466
[/+0.455
 tattoo+0.422
 awa+0.420
Neg logit contributions
selves-0.624
rique-0.568
cedes-0.476
actly-0.464
productive-0.453
Token use
Token ID779
Feature activation+3.71e+00

Pos logit contributions
 notor+1.275
KY+1.242
[/+1.214
 tattoo+1.129
 awa+1.117
Neg logit contributions
selves-1.778
rique-1.635
cedes-1.375
actly-1.346
productive-1.311
Token of
Token ID286
Feature activation+3.08e+00

Pos logit contributions
 notor+2.728
KY+2.694
[/+2.653
 tattoo+2.520
etic+2.469
Neg logit contributions
selves-3.203
rique-2.939
cedes-2.426
actly-2.395
productive-2.293
Token the
Token ID262
Feature activation+2.70e+00

Pos logit contributions
 notor+2.574
KY+2.509
[/+2.481
 tattoo+2.343
 popularity+2.290
Neg logit contributions
selves-3.109
rique-2.849
cedes-2.372
actly-2.308
productive-2.249
Token site
Token ID2524
Feature activation+1.00e+00

Pos logit contributions
 notor+1.791
KY+1.740
[/+1.703
 tattoo+1.592
 awa+1.562
Neg logit contributions
selves-2.567
rique-2.367
cedes-2.001
actly-1.957
productive-1.905
Token but
Token ID475
Feature activation+3.69e+00

Pos logit contributions
 notor+0.771
KY+0.754
[/+0.734
 tattoo+0.693
 awa+0.689
Neg logit contributions
selves-0.846
rique-0.767
cedes-0.633
actly-0.615
productive-0.598
Token has
Token ID468
Feature activation+7.30e+00

Pos logit contributions
 notor+2.638
KY+2.569
[/+2.519
 tattoo+2.366
etic+2.328
Neg logit contributions
selves-3.310
rique-3.035
cedes-2.533
actly-2.476
productive-2.406
Token since
Token ID1201
Feature activation+6.12e+00

Pos logit contributions
 notor+4.923
[/+4.688
KY+4.674
 tattoo+4.508
 popularity+4.406
Neg logit contributions
selves-6.651
rique-6.189
cedes-5.222
actly-5.141
productive-4.942
Token gone
Token ID3750
Feature activation+1.39e+01

Pos logit contributions
 notor+4.381
KY+4.200
[/+4.192
 tattoo+4.013
 popularity+3.926
Neg logit contributions
selves-5.399
rique-4.942
cedes-4.149
actly-4.093
productive-3.918
Token viral
Token ID14416
Feature activation+9.86e+00

Pos logit contributions
 notor+7.035
[/+6.177
KY+6.128
 tattoo+6.008
 popularity+5.858
Neg logit contributions
selves-14.442
rique-14.006
cedes-12.069
actly-11.785
aughed-11.401
Token on
Token ID319
Feature activation+9.87e+00

Pos logit contributions
 notor+5.654
[/+5.573
KY+5.492
 amongst+5.200
 tattoo+5.139
Neg logit contributions
selves-9.601
rique-8.981
cedes-7.740
actly-7.581
productive-7.394
Token the
Token ID262
Feature activation+6.27e+00

Pos logit contributions
 notor+5.454
KY+5.412
[/+5.252
 tattoo+5.045
 popularity+4.967
Neg logit contributions
selves-9.909
rique-9.145
cedes-7.929
actly-7.752
productive-7.570
Token internet
Token ID5230
Feature activation+4.66e+00

Pos logit contributions
 notor+3.405
KY+3.259
[/+3.190
 tattoo+3.008
 popularity+2.952
Neg logit contributions
selves-6.610
rique-6.199
cedes-5.360
actly-5.235
productive-5.119
Token.
Token ID13
Feature activation+2.88e+00

Pos logit contributions
 notor+2.783
KY+2.681
[/+2.663
 tattoo+2.449
etic+2.406
Neg logit contributions
selves-4.682
rique-4.343
cedes-3.712
actly-3.657
productive-3.560
Token they
Token ID484
Feature activation+6.94e+00

Pos logit contributions
[/+4.352
KY+4.172
 tattoo+3.939
 notor+3.922
etic+3.819
Neg logit contributions
selves-6.185
rique-5.939
cedes-4.882
actly-4.783
productive-4.684
Token are
Token ID389
Feature activation+9.87e+00

Pos logit contributions
[/+4.967
KY+4.842
 notor+4.628
 tattoo+4.576
cale+4.512
Neg logit contributions
selves-5.945
rique-5.636
cedes-4.577
actly-4.449
productive-4.314
Token not
Token ID407
Feature activation+8.17e+00

Pos logit contributions
[/+5.766
KY+5.384
 notor+5.352
 tattoo+5.161
etic+5.016
Neg logit contributions
selves-9.711
rique-9.252
cedes-7.749
actly-7.531
aughed-7.463
Token too
Token ID1165
Feature activation+8.00e+00

Pos logit contributions
[/+6.203
 notor+6.000
KY+5.948
 tattoo+5.760
etic+5.571
Neg logit contributions
selves-6.524
rique-6.277
cedes-4.929
actly-4.787
productive-4.710
Token obvious
Token ID3489
Feature activation+1.07e+01

Pos logit contributions
 notor+5.854
[/+5.809
KY+5.570
 tattoo+5.403
etic+5.312
Neg logit contributions
selves-6.797
rique-6.386
cedes-5.188
actly-4.913
productive-4.884
Token (
Token ID357
Feature activation+1.39e+01

Pos logit contributions
[/+6.550
KY+6.379
 tattoo+5.843
 notor+5.739
cale+5.655
Neg logit contributions
selves-10.050
rique-9.664
cedes-7.974
actly-7.756
productive-7.670
Tokenas
Token ID292
Feature activation+3.60e+00

Pos logit contributions
[/+9.269
KY+8.914
cale+8.391
etic+8.202
 tattoo+8.175
Neg logit contributions
selves-11.410
rique-11.384
cedes-8.824
aughed-8.804
 Dying-8.718
Token opposed
Token ID6886
Feature activation+3.73e+00

Pos logit contributions
 notor+3.026
KY+2.974
[/+2.936
 tattoo+2.789
etic+2.747
Neg logit contributions
selves-2.759
rique-2.506
cedes-2.004
actly-1.941
productive-1.876
Token to
Token ID284
Feature activation+6.83e+00

Pos logit contributions
KY+2.860
[/+2.854
 notor+2.843
 tattoo+2.683
etic+2.638
Neg logit contributions
selves-3.029
rique-2.802
cedes-2.269
actly-2.218
productive-2.132
Token back
Token ID736
Feature activation+5.13e+00

Pos logit contributions
[/+2.385
KY+2.343
 notor+2.307
 tattoo+2.114
etic+1.988
Neg logit contributions
selves-8.337
rique-7.941
cedes-6.939
actly-6.859
productive-6.704
Token in
Token ID287
Feature activation+7.76e+00

Pos logit contributions
[/+3.267
KY+3.255
 notor+3.210
 tattoo+3.028
cale+2.934
Neg logit contributions
selves-4.838
rique-4.521
cedes-3.795
actly-3.706
productive-3.586
Token"
Token ID1
Feature activation+2.81e+00

Pos logit contributions
KY+2.392
 notor+2.388
[/+2.371
 tattoo+2.097
etic+2.036
Neg logit contributions
selves-6.004
rique-5.587
cedes-4.923
actly-4.802
productive-4.742
Token because
Token ID780
Feature activation+2.49e+00

Pos logit contributions
 notor+2.025
KY+1.988
[/+1.955
 tattoo+1.850
etic+1.802
Neg logit contributions
selves-2.476
rique-2.262
cedes-1.892
actly-1.847
productive-1.802
Token of
Token ID286
Feature activation+2.46e+00

Pos logit contributions
 notor+1.890
KY+1.850
[/+1.824
 tattoo+1.741
etic+1.694
Neg logit contributions
selves-2.106
rique-1.912
cedes-1.587
actly-1.546
productive-1.505
Token his
Token ID465
Feature activation+6.36e+00

Pos logit contributions
 notor+1.700
KY+1.654
[/+1.624
 tattoo+1.540
etic+1.498
Neg logit contributions
selves-2.267
rique-2.079
cedes-1.753
actly-1.703
productive-1.665
Token dimin
Token ID12110
Feature activation+8.24e+00

Pos logit contributions
 notor+4.603
KY+4.479
[/+4.407
 tattoo+4.320
 popularity+4.189
Neg logit contributions
selves-5.561
rique-5.109
cedes-4.261
actly-4.161
productive-4.047
Tokenutive
Token ID8827
Feature activation+1.38e+01

Pos logit contributions
KY+4.117
[/+4.053
 notor+4.027
etic+3.801
cale+3.743
Neg logit contributions
selves-8.805
rique-8.184
cedes-7.029
actly-6.818
 Sending-6.668
Token height
Token ID6001
Feature activation+1.02e+01

Pos logit contributions
 tattoo+6.740
 notor+6.642
[/+6.501
KY+6.483
 popularity+6.305
Neg logit contributions
selves-14.007
rique-13.244
cedes-11.522
actly-11.306
productive-11.088
Token.
Token ID13
Feature activation+3.81e+00

Pos logit contributions
 notor+5.161
[/+5.081
KY+5.051
 tattoo+4.982
cale+4.610
Neg logit contributions
selves-10.402
rique-9.765
cedes-8.579
actly-8.287
productive-8.054
Token
Token ID198
Feature activation-2.16e+00

Pos logit contributions
KY+2.493
[/+2.451
 notor+2.397
 tattoo+2.280
etic+2.218
Neg logit contributions
selves-3.501
rique-3.280
cedes-2.808
actly-2.716
productive-2.665
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
selves+1.817
rique+1.604
cedes+1.336
actly+1.301
productive+1.272
Neg logit contributions
 notor-1.875
 awa-1.656
KY-1.600
 challeng-1.534
[/-1.521
Token"
Token ID1
Feature activation+3.10e+00

Pos logit contributions
selves+1.675
rique+1.503
cedes+1.248
actly+1.220
productive+1.188
Neg logit contributions
 notor-1.626
 awa-1.425
KY-1.422
[/-1.403
 tattoo-1.385
TokenThroughout
Token ID26797
Feature activation+3.65e-01

Pos logit contributions
 notor+0.085
KY+0.079
[/+0.076
 awa+0.075
 tattoo+0.075
Neg logit contributions
selves-0.096
rique-0.086
cedes-0.072
actly-0.071
productive-0.069
Token all
Token ID477
Feature activation-1.02e+00

Pos logit contributions
 notor+0.257
KY+0.251
[/+0.245
 tattoo+0.230
etic+0.227
Neg logit contributions
selves-0.332
rique-0.304
cedes-0.255
actly-0.249
productive-0.242
Token of
Token ID286
Feature activation-2.68e+00

Pos logit contributions
selves+0.851
rique+0.778
cedes+0.640
actly+0.622
productive+0.608
Neg logit contributions
 notor-0.798
KY-0.776
[/-0.755
 tattoo-0.712
 awa-0.709
Token this
Token ID428
Feature activation-9.22e-02

Pos logit contributions
selves+2.247
rique+2.043
cedes+1.677
actly+1.645
productive+1.613
Neg logit contributions
 notor-2.073
KY-2.010
[/-1.931
 awa-1.834
etic-1.830
Token,
Token ID11
Feature activation+4.29e+00

Pos logit contributions
selves+0.083
rique+0.076
cedes+0.064
actly+0.062
productive+0.061
Neg logit contributions
 notor-0.066
KY-0.064
[/-0.063
 tattoo-0.059
etic-0.058
Token Twitter
Token ID3009
Feature activation+1.37e+01

Pos logit contributions
 notor+3.178
KY+3.132
[/+3.112
 tattoo+2.928
 awa+2.858
Neg logit contributions
selves-3.670
rique-3.372
cedes-2.794
actly-2.717
productive-2.622
Token has
Token ID468
Feature activation+1.78e+00

Pos logit contributions
[/+8.745
KY+8.629
 notor+8.512
 popularity+7.982
 tattoo+7.982
Neg logit contributions
selves-11.797
rique-10.780
cedes-9.047
actly-8.887
productive-8.291
Token been
Token ID587
Feature activation+1.43e+00

Pos logit contributions
 notor+1.353
KY+1.321
[/+1.295
 tattoo+1.220
etic+1.204
Neg logit contributions
selves-1.519
rique-1.386
cedes-1.144
actly-1.116
productive-1.081
Token indispensable
Token ID35669
Feature activation-1.03e+01

Pos logit contributions
 notor+1.071
KY+1.050
[/+1.024
 tattoo+0.961
 awa+0.955
Neg logit contributions
selves-1.239
rique-1.132
cedes-0.935
actly-0.914
productive-0.894
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
selves+8.225
rique+7.290
cedes+6.212
productive+6.018
actly+5.956
Neg logit contributions
 notor-7.532
KY-7.344
[/-6.916
 awa-6.869
anski-6.725
Token When
Token ID1649
Feature activation-2.09e+00

Pos logit contributions
selves+1.134
rique+1.001
cedes+0.806
actly+0.784
productive+0.769
Neg logit contributions
 notor-1.261
KY-1.174
[/-1.140
 awa-1.110
 tattoo-1.098
Token,
Token ID11
Feature activation+4.71e-01

Pos logit contributions
selves+2.735
rique+2.491
cedes+2.047
actly+1.976
productive+1.939
Neg logit contributions
 notor-2.443
KY-2.362
[/-2.300
 awa-2.171
 tattoo-2.135
Token it
Token ID340
Feature activation-6.42e-01

Pos logit contributions
 notor+0.350
KY+0.341
[/+0.334
 tattoo+0.311
 awa+0.311
Neg logit contributions
selves-0.411
rique-0.374
cedes-0.310
actly-0.302
productive-0.294
Token's
Token ID338
Feature activation-8.31e-03

Pos logit contributions
selves+0.626
rique+0.583
cedes+0.493
actly+0.484
productive+0.471
Neg logit contributions
 notor-0.408
KY-0.395
[/-0.389
 tattoo-0.365
etic-0.355
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
selves+0.007
rique+0.006
cedes+0.005
actly+0.005
productive+0.005
Neg logit contributions
 notor-0.006
KY-0.006
[/-0.006
 tattoo-0.006
etic-0.006
Token nearly
Token ID3016
Feature activation+5.86e+00

Pos logit contributions
selves+0.997
rique+0.908
cedes+0.744
actly+0.726
productive+0.703
Neg logit contributions
 notor-0.876
KY-0.857
[/-0.838
etic-0.780
 awa-0.780
Token universally
Token ID26208
Feature activation+1.36e+01

Pos logit contributions
 notor+4.262
KY+4.139
[/+4.105
 tattoo+3.949
 popularity+3.803
Neg logit contributions
selves-5.107
rique-4.685
cedes-3.860
actly-3.774
productive-3.637
Token ad
Token ID512
Feature activation-2.03e+00

Pos logit contributions
 notor+9.119
 tattoo+8.782
KY+8.672
[/+8.654
 popularity+8.325
Neg logit contributions
selves-11.348
rique-10.658
cedes-9.042
actly-8.867
productive-8.267
Tokenored
Token ID1850
Feature activation+3.90e+00

Pos logit contributions
selves+1.398
rique+1.234
cedes+0.971
actly+0.943
productive+0.909
Neg logit contributions
 notor-1.877
KY-1.829
[/-1.807
 tattoo-1.703
 awa-1.702
Token Burger
Token ID28983
Feature activation-1.22e+00

Pos logit contributions
 notor+2.923
KY+2.876
[/+2.817
 tattoo+2.697
 popularity+2.617
Neg logit contributions
selves-3.313
rique-3.017
cedes-2.513
actly-2.462
productive-2.383
Token Guys
Token ID29899
Feature activation+5.32e+00

Pos logit contributions
selves+0.976
rique+0.888
cedes+0.718
actly+0.699
productive+0.672
Neg logit contributions
 notor-0.994
KY-0.970
[/-0.947
 tattoo-0.896
 awa-0.889
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 notor+3.404
KY+3.343
[/+3.281
 tattoo+3.130
etic+3.011
Neg logit contributions
selves-5.055
rique-4.674
cedes-4.005
actly-3.905
productive-3.810
Token employment
Token ID7184
Feature activation+1.84e+00

Pos logit contributions
 notor+1.765
KY+1.719
[/+1.681
 tattoo+1.572
etic+1.549
Neg logit contributions
selves-2.391
rique-2.199
cedes-1.848
actly-1.807
productive-1.759
Token,
Token ID11
Feature activation+4.03e+00

Pos logit contributions
 notor+1.222
KY+1.188
[/+1.165
 tattoo+1.085
etic+1.068
Neg logit contributions
selves-1.739
rique-1.603
cedes-1.353
actly-1.326
productive-1.288
Token and
Token ID290
Feature activation+6.09e+00

Pos logit contributions
 notor+2.567
KY+2.480
[/+2.446
 tattoo+2.283
etic+2.237
Neg logit contributions
selves-3.935
rique-3.640
cedes-3.098
actly-3.041
productive-2.941
Token ostr
Token ID42387
Feature activation+6.75e+00

Pos logit contributions
 notor+4.175
[/+4.031
KY+4.015
 tattoo+3.782
etic+3.694
Neg logit contributions
selves-5.575
rique-5.139
cedes-4.366
actly-4.278
productive-4.077
Tokenac
Token ID330
Feature activation+2.06e+00

Pos logit contributions
[/+3.863
KY+3.860
 notor+3.842
etic+3.560
 tattoo+3.557
Neg logit contributions
selves-6.652
rique-6.220
cedes-5.316
actly-5.163
productive-5.080
Tokenism
Token ID1042
Feature activation+1.36e+01

Pos logit contributions
 notor+1.654
KY+1.625
[/+1.605
 tattoo+1.511
etic+1.504
Neg logit contributions
selves-1.657
rique-1.502
cedes-1.227
actly-1.195
productive-1.155
Token from
Token ID422
Feature activation+7.94e+00

Pos logit contributions
[/+6.925
 notor+6.321
 amongst+6.133
KY+6.007
 tattoo+5.843
Neg logit contributions
selves-13.520
rique-13.011
cedes-11.492
actly-11.143
aughed-10.758
Token family
Token ID1641
Feature activation+7.65e+00

Pos logit contributions
 notor+4.936
KY+4.771
[/+4.771
 tattoo+4.492
 popularity+4.334
Neg logit contributions
selves-7.611
rique-7.109
cedes-6.087
actly-6.003
productive-5.702
Token and
Token ID290
Feature activation+2.72e+00

Pos logit contributions
 notor+4.178
[/+4.145
KY+3.993
 tattoo+3.665
etic+3.642
Neg logit contributions
selves-7.925
rique-7.407
cedes-6.355
actly-6.319
productive-6.081
Token social
Token ID1919
Feature activation+5.77e+00

Pos logit contributions
 notor+1.950
KY+1.899
[/+1.859
 tattoo+1.742
 awa+1.720
Neg logit contributions
selves-2.453
rique-2.242
cedes-1.870
actly-1.828
productive-1.781
Token networks
Token ID7686
Feature activation+1.09e+01

Pos logit contributions
 notor+3.503
KY+3.384
[/+3.346
 tattoo+3.082
etic+3.048
Neg logit contributions
selves-5.716
rique-5.312
cedes-4.523
actly-4.483
productive-4.336
Token of
Token ID286
Feature activation+3.80e-01

Pos logit contributions
selves+1.606
rique+1.473
cedes+1.213
actly+1.187
productive+1.145
Neg logit contributions
 notor-1.411
KY-1.373
[/-1.353
 tattoo-1.280
etic-1.258
Token girls
Token ID4813
Feature activation-3.75e-01

Pos logit contributions
 notor+0.258
KY+0.252
[/+0.246
 tattoo+0.227
etic+0.226
Neg logit contributions
selves-0.356
rique-0.326
cedes-0.274
actly-0.268
productive-0.261
Token.
Token ID13
Feature activation+2.75e+00

Pos logit contributions
selves+0.390
rique+0.361
cedes+0.310
actly+0.303
productive+0.297
Neg logit contributions
 notor-0.217
KY-0.210
[/-0.203
 tattoo-0.188
 awa-0.186
Token
Token ID198
Feature activation+3.01e+00

Pos logit contributions
 notor+2.206
KY+2.157
[/+2.116
 tattoo+2.001
etic+1.975
Neg logit contributions
selves-2.228
rique-2.023
cedes-1.649
actly-1.605
productive-1.553
Token
Token ID198
Feature activation+2.83e+00

Pos logit contributions
 notor+2.043
KY+1.992
[/+1.948
 tattoo+1.821
etic+1.793
Neg logit contributions
selves-2.815
rique-2.592
cedes-2.181
actly-2.133
productive-2.075
TokenGenerally
Token ID37058
Feature activation+1.35e+01

Pos logit contributions
 notor+2.127
KY+2.044
[/+2.000
 tattoo+1.900
 awa+1.885
Neg logit contributions
selves-2.462
rique-2.240
cedes-1.863
actly-1.816
productive-1.768
Token,
Token ID11
Feature activation+7.40e+00

Pos logit contributions
 notor+7.964
[/+7.686
KY+7.340
 tattoo+7.312
 amongst+7.183
Neg logit contributions
selves-12.252
rique-11.715
cedes-10.181
actly-9.868
productive-9.369
Token the
Token ID262
Feature activation+4.39e+00

Pos logit contributions
 notor+5.138
KY+4.913
[/+4.896
 tattoo+4.687
 popularity+4.584
Neg logit contributions
selves-6.614
rique-6.159
cedes-5.190
actly-5.067
productive-4.890
Token Mong
Token ID18428
Feature activation+6.23e+00

Pos logit contributions
 notor+2.893
KY+2.798
[/+2.743
 tattoo+2.596
 popularity+2.533
Neg logit contributions
selves-4.155
rique-3.849
cedes-3.272
actly-3.197
productive-3.101
Tokenols
Token ID10220
Feature activation+6.48e+00

Pos logit contributions
KY+3.681
[/+3.640
 notor+3.594
etic+3.470
 tattoo+3.330
Neg logit contributions
selves-6.164
rique-5.620
cedes-4.870
actly-4.768
productive-4.649
Token (
Token ID357
Feature activation+5.58e+00

Pos logit contributions
 notor+4.649
KY+4.551
[/+4.485
 tattoo+4.316
etic+4.185
Neg logit contributions
selves-5.620
rique-5.234
cedes-4.383
actly-4.239
productive-4.112
Token king
Token ID5822
Feature activation+6.77e+00

Pos logit contributions
 notor+2.093
KY+2.039
[/+2.002
 tattoo+1.891
 awa+1.861
Neg logit contributions
selves-2.399
rique-2.200
cedes-1.818
actly-1.773
productive-1.717
Token,
Token ID11
Feature activation+8.80e-01

Pos logit contributions
 notor+4.481
KY+4.425
[/+4.390
 tattoo+4.124
etic+4.078
Neg logit contributions
selves-6.087
rique-5.686
cedes-4.730
actly-4.634
productive-4.500
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
 notor+0.622
KY+0.607
[/+0.592
 tattoo+0.553
etic+0.549
Neg logit contributions
selves-0.800
rique-0.730
cedes-0.611
actly-0.599
productive-0.581
Token then
Token ID788
Feature activation+5.88e+00

Pos logit contributions
 notor+1.268
KY+1.239
[/+1.211
 tattoo+1.141
etic+1.129
Neg logit contributions
selves-1.414
rique-1.286
cedes-1.061
actly-1.035
productive-1.003
Token they
Token ID484
Feature activation+3.38e+00

Pos logit contributions
 notor+4.064
[/+3.935
KY+3.929
 tattoo+3.689
 popularity+3.611
Neg logit contributions
selves-5.276
rique-4.926
cedes-4.128
actly-4.020
productive-3.914
Token go
Token ID467
Feature activation+1.35e+01

Pos logit contributions
 notor+2.409
KY+2.351
[/+2.310
 tattoo+2.175
 awa+2.135
Neg logit contributions
selves-3.025
rique-2.792
cedes-2.329
actly-2.274
productive-2.210
Token crazy
Token ID7165
Feature activation+5.81e+00

Pos logit contributions
 notor+8.002
[/+7.669
KY+7.428
 tattoo+7.391
 awa+7.231
Neg logit contributions
selves-12.085
rique-11.918
cedes-9.921
actly-9.695
productive-9.570
Token."
Token ID526
Feature activation+4.28e+00

Pos logit contributions
 notor+3.339
KY+3.245
[/+3.214
 tattoo+2.989
 awa+2.901
Neg logit contributions
selves-5.900
rique-5.537
cedes-4.744
actly-4.631
productive-4.532
Token
Token ID198
Feature activation+7.51e-01

Pos logit contributions
[/+2.915
KY+2.889
 notor+2.759
 tattoo+2.669
etic+2.623
Neg logit contributions
selves-3.823
rique-3.571
cedes-3.002
actly-2.912
productive-2.871
Token
Token ID198
Feature activation-8.65e-01

Pos logit contributions
 notor+0.567
KY+0.523
[/+0.503
 awa+0.499
 tattoo+0.481
Neg logit contributions
selves-0.678
rique-0.614
cedes-0.515
actly-0.504
productive-0.490
TokenMus
Token ID10694
Feature activation+1.12e+00

Pos logit contributions
selves+0.725
rique+0.656
cedes+0.546
actly+0.534
productive+0.516
Neg logit contributions
 notor-0.700
KY-0.641
[/-0.625
 awa-0.619
 tattoo-0.608
Tokendis
Token ID6381
Feature activation-7.39e+00

Pos logit contributions
selves+2.814
rique+2.581
cedes+2.153
actly+2.133
productive+2.087
Neg logit contributions
 notor-2.156
KY-2.070
[/-1.983
 tattoo-1.904
etic-1.871
Tokenappointed
Token ID32924
Feature activation-3.49e+00

Pos logit contributions
selves+5.438
rique+4.894
cedes+3.883
actly+3.833
productive+3.762
Neg logit contributions
 notor-6.370
KY-6.071
[/-5.878
 tattoo-5.710
 awa-5.694
Token to
Token ID284
Feature activation+4.47e+00

Pos logit contributions
selves+3.252
rique+2.944
cedes+2.483
actly+2.467
productive+2.381
Neg logit contributions
 notor-2.390
KY-2.286
[/-2.155
 tattoo-2.089
 awa-2.089
Token only
Token ID691
Feature activation+6.57e+00

Pos logit contributions
 notor+2.973
KY+2.909
[/+2.897
 tattoo+2.686
 awa+2.622
Neg logit contributions
selves-4.189
rique-3.884
cedes-3.277
actly-3.192
productive-3.115
Token be
Token ID307
Feature activation+5.05e+00

Pos logit contributions
 notor+4.213
KY+4.141
[/+4.121
 tattoo+3.856
 awa+3.721
Neg logit contributions
selves-6.241
rique-5.776
cedes-4.927
actly-4.798
productive-4.661
Token trending
Token ID29056
Feature activation+1.34e+01

Pos logit contributions
 notor+3.502
KY+3.408
[/+3.373
 tattoo+3.165
 awa+3.105
Neg logit contributions
selves-4.607
rique-4.236
cedes-3.562
actly-3.492
productive-3.384
Token No
Token ID1400
Feature activation+8.27e+00

Pos logit contributions
 notor+8.145
[/+8.063
KY+8.012
 amongst+7.695
 popularity+7.645
Neg logit contributions
selves-11.708
rique-11.429
cedes-9.261
actly-9.125
productive-8.877
Token7
Token ID22
Feature activation+7.59e+00

Pos logit contributions
 notor+5.710
[/+5.649
KY+5.622
 tattoo+5.242
etic+5.207
Neg logit contributions
selves-7.284
rique-6.817
cedes-5.603
actly-5.470
productive-5.384
Token in
Token ID287
Feature activation+6.16e+00

Pos logit contributions
 notor+5.071
KY+5.043
[/+4.981
 tattoo+4.650
 popularity+4.549
Neg logit contributions
selves-6.850
rique-6.370
cedes-5.310
actly-5.187
productive-5.076
Token USA
Token ID4916
Feature activation+2.32e+00

Pos logit contributions
 notor+4.341
KY+4.257
[/+4.205
 tattoo+3.987
 popularity+3.968
Neg logit contributions
selves-5.464
rique-5.004
cedes-4.169
actly-4.066
productive-3.951
Token
Token ID447
Feature activation-2.67e+00

Pos logit contributions
 notor+1.916
KY+1.880
[/+1.848
 tattoo+1.747
etic+1.721
Neg logit contributions
selves-1.832
rique-1.659
cedes-1.340
actly-1.304
productive-1.260
Token identified
Token ID5174
Feature activation-4.13e-01

Pos logit contributions
 notor+3.584
[/+3.533
KY+3.509
 tattoo+3.227
etic+3.142
Neg logit contributions
selves-5.086
rique-4.732
cedes-3.982
actly-3.888
productive-3.786
Token as
Token ID355
Feature activation-4.74e-01

Pos logit contributions
selves+0.362
rique+0.329
cedes+0.273
actly+0.267
productive+0.260
Neg logit contributions
 notor-0.306
KY-0.297
[/-0.288
 tattoo-0.273
 awa-0.271
Token being
Token ID852
Feature activation+4.71e+00

Pos logit contributions
selves+0.425
rique+0.389
cedes+0.324
actly+0.317
productive+0.310
Neg logit contributions
 notor-0.338
KY-0.332
[/-0.321
 tattoo-0.301
etic-0.299
Token the
Token ID262
Feature activation-1.88e+00

Pos logit contributions
 notor+3.191
KY+3.082
[/+3.059
 tattoo+2.856
 awa+2.786
Neg logit contributions
selves-4.366
rique-4.049
cedes-3.408
actly-3.327
productive-3.221
Token worst
Token ID5290
Feature activation+1.73e+00

Pos logit contributions
selves+1.685
rique+1.531
cedes+1.282
actly+1.261
productive+1.223
Neg logit contributions
 notor-1.332
KY-1.316
[/-1.255
etic-1.189
 tattoo-1.181
Token globally
Token ID18309
Feature activation+1.34e+01

Pos logit contributions
 notor+1.052
KY+1.027
[/+0.993
 tattoo+0.920
etic+0.910
Neg logit contributions
selves-1.732
rique-1.602
cedes-1.371
actly-1.341
productive-1.310
Token by
Token ID416
Feature activation+6.77e+00

Pos logit contributions
[/+7.994
 notor+7.634
KY+7.204
 amongst+7.200
 tattoo+6.973
Neg logit contributions
selves-12.255
rique-11.792
cedes-9.893
actly-9.775
productive-9.264
Token international
Token ID3230
Feature activation+3.91e+00

Pos logit contributions
 notor+4.686
KY+4.588
[/+4.585
 tattoo+4.258
 popularity+4.228
Neg logit contributions
selves-6.086
rique-5.580
cedes-4.710
actly-4.603
productive-4.427
Token policing
Token ID21922
Feature activation+8.55e+00

Pos logit contributions
 notor+2.534
KY+2.464
[/+2.405
 tattoo+2.241
 awa+2.207
Neg logit contributions
selves-3.776
rique-3.484
cedes-2.951
actly-2.890
productive-2.815
Token body
Token ID1767
Feature activation+8.19e+00

Pos logit contributions
 notor+5.388
[/+5.374
KY+5.280
 tattoo+4.807
 popularity+4.779
Neg logit contributions
selves-8.129
rique-7.546
cedes-6.341
actly-6.251
productive-6.039
Token Inter
Token ID4225
Feature activation-6.64e+00

Pos logit contributions
 notor+5.618
[/+5.600
KY+5.535
 tattoo+5.083
 awa+5.054
Neg logit contributions
selves-7.260
rique-6.718
cedes-5.683
actly-5.527
productive-5.323
Token with
Token ID351
Feature activation-3.80e+00

Pos logit contributions
 notor+1.193
KY+1.157
[/+1.146
 tattoo+1.085
etic+1.060
Neg logit contributions
selves-1.350
rique-1.235
cedes-1.021
actly-0.993
productive-0.974
Token artwork
Token ID16257
Feature activation-9.79e-01

Pos logit contributions
selves+3.146
rique+2.821
cedes+2.312
actly+2.305
productive+2.174
Neg logit contributions
 notor-2.920
KY-2.894
[/-2.790
etic-2.645
cale-2.621
Token from
Token ID422
Feature activation+2.07e+00

Pos logit contributions
selves+0.829
rique+0.756
cedes+0.621
actly+0.610
productive+0.586
Neg logit contributions
 notor-0.747
KY-0.733
[/-0.709
etic-0.669
 awa-0.667
Token frequent
Token ID10792
Feature activation+2.72e+00

Pos logit contributions
 notor+1.599
KY+1.561
[/+1.533
 tattoo+1.451
 awa+1.426
Neg logit contributions
selves-1.728
rique-1.580
cedes-1.301
actly-1.266
productive-1.226
Token Dinosaur
Token ID47808
Feature activation-6.48e+00

Pos logit contributions
 notor+2.001
KY+1.952
[/+1.912
 tattoo+1.801
etic+1.773
Neg logit contributions
selves-2.379
rique-2.178
cedes-1.809
actly-1.766
productive-1.713
Token Jr
Token ID7504
Feature activation+1.33e+01

Pos logit contributions
selves+5.566
rique+5.012
cedes+4.102
actly+4.052
productive+3.965
Neg logit contributions
 notor-4.641
KY-4.530
[/-4.415
 awa-4.159
cale-4.118
Token.
Token ID13
Feature activation+2.86e+00

Pos logit contributions
 notor+7.391
[/+7.091
KY+7.025
 tattoo+6.757
 popularity+6.404
Neg logit contributions
selves-12.450
rique-12.004
cedes-10.337
actly-10.114
productive-9.792
Token collaborator
Token ID50160
Feature activation+1.70e-01

Pos logit contributions
 notor+2.155
KY+2.106
[/+2.062
 tattoo+1.934
 awa+1.916
Neg logit contributions
selves-2.467
rique-2.248
cedes-1.857
actly-1.811
productive-1.758
Token Maur
Token ID18867
Feature activation-3.91e+00

Pos logit contributions
 notor+0.141
KY+0.137
[/+0.134
 tattoo+0.127
 awa+0.126
Neg logit contributions
selves-0.135
rique-0.122
cedes-0.098
actly-0.096
productive-0.093
Tokena
Token ID64
Feature activation-1.66e+00

Pos logit contributions
selves+3.077
rique+2.891
cedes+2.338
actly+2.201
acons+2.157
Neg logit contributions
 notor-3.196
KY-3.046
[/-2.981
 awa-2.863
 tattoo-2.841
Token Jasper
Token ID42992
Feature activation+9.12e-01

Pos logit contributions
selves+1.323
rique+1.202
cedes+0.970
actly+0.941
productive+0.909
Neg logit contributions
 notor-1.365
KY-1.325
[/-1.293
 tattoo-1.227
 awa-1.219
Token still
Token ID991
Feature activation+1.17e+00

Pos logit contributions
selves+0.117
rique+0.107
cedes+0.089
actly+0.087
productive+0.085
Neg logit contributions
 notor-0.097
KY-0.094
[/-0.091
 tattoo-0.086
 awa-0.086
Token comes
Token ID2058
Feature activation+6.76e+00

Pos logit contributions
 notor+0.875
KY+0.855
[/+0.833
 tattoo+0.785
etic+0.776
Neg logit contributions
selves-1.019
rique-0.931
cedes-0.772
actly-0.753
productive-0.730
Token with
Token ID351
Feature activation+7.06e-01

Pos logit contributions
 notor+4.607
[/+4.533
KY+4.462
 tattoo+4.271
etic+4.178
Neg logit contributions
selves-6.014
rique-5.596
cedes-4.681
actly-4.521
productive-4.462
Token extreme
Token ID3257
Feature activation+1.14e+00

Pos logit contributions
 notor+0.522
KY+0.511
[/+0.498
 tattoo+0.467
 awa+0.463
Neg logit contributions
selves-0.617
rique-0.564
cedes-0.468
actly-0.457
productive-0.442
Token social
Token ID1919
Feature activation+9.85e+00

Pos logit contributions
 notor+0.727
KY+0.709
[/+0.689
 tattoo+0.638
 awa+0.633
Neg logit contributions
selves-1.119
rique-1.033
cedes-0.877
actly-0.859
productive-0.837
Token stigma
Token ID26695
Feature activation+1.32e+01

Pos logit contributions
 notor+5.632
[/+5.351
KY+5.336
 tattoo+5.125
 popularity+5.086
Neg logit contributions
selves-9.638
rique-9.109
cedes-7.752
actly-7.568
productive-7.285
Token and
Token ID290
Feature activation+4.71e+00

Pos logit contributions
[/+7.251
 notor+7.085
KY+6.759
 tattoo+6.571
 amongst+6.552
Neg logit contributions
selves-12.546
rique-11.920
cedes-10.216
actly-9.763
aughed-9.750
Token risk
Token ID2526
Feature activation+3.31e+00

Pos logit contributions
 notor+3.281
KY+3.142
[/+3.140
 tattoo+2.947
 popularity+2.875
Neg logit contributions
selves-4.322
rique-3.980
cedes-3.326
actly-3.251
productive-3.162
Token for
Token ID329
Feature activation+5.91e+00

Pos logit contributions
 notor+2.408
KY+2.334
[/+2.319
 tattoo+2.178
etic+2.136
Neg logit contributions
selves-2.941
rique-2.683
cedes-2.231
actly-2.182
productive-2.103
Token violence
Token ID3685
Feature activation+1.10e+01

Pos logit contributions
 notor+3.730
[/+3.570
KY+3.545
 tattoo+3.358
 popularity+3.254
Neg logit contributions
selves-5.757
rique-5.298
cedes-4.508
actly-4.442
productive-4.279
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
[/+6.295
 notor+6.133
KY+5.903
 tattoo+5.664
 amongst+5.626
Neg logit contributions
selves-10.476
rique-9.961
cedes-8.567
actly-8.245
acons-7.997
Token nude
Token ID26349
Feature activation-5.86e-01

Pos logit contributions
 notor+0.107
KY+0.105
[/+0.101
 awa+0.094
etic+0.093
Neg logit contributions
selves-0.151
rique-0.138
cedes-0.117
actly-0.114
productive-0.112
Token photos
Token ID5205
Feature activation+2.14e+00

Pos logit contributions
selves+0.545
rique+0.499
cedes+0.420
actly+0.410
productive+0.400
Neg logit contributions
 notor-0.404
KY-0.388
[/-0.376
 awa-0.356
cale-0.349
Token and
Token ID290
Feature activation-2.67e+00

Pos logit contributions
 notor+1.444
KY+1.405
[/+1.371
 tattoo+1.280
 awa+1.265
Neg logit contributions
selves-2.010
rique-1.850
cedes-1.559
actly-1.525
productive-1.485
Token video
Token ID2008
Feature activation-6.16e+00

Pos logit contributions
selves+2.505
rique+2.272
cedes+1.936
actly+1.908
productive+1.853
Neg logit contributions
 notor-1.781
KY-1.713
[/-1.656
cale-1.558
 awa-1.541
Token from
Token ID422
Feature activation+3.22e+00

Pos logit contributions
selves+5.158
rique+4.581
cedes+3.838
actly+3.714
acons+3.697
Neg logit contributions
 notor-4.602
KY-4.517
[/-4.226
cale-4.137
 awa-4.119
Token publicly
Token ID7271
Feature activation+1.32e+01

Pos logit contributions
 notor+2.362
KY+2.302
[/+2.275
 tattoo+2.133
etic+2.095
Neg logit contributions
selves-2.815
rique-2.587
cedes-2.145
actly-2.094
productive-2.029
Token accessible
Token ID9857
Feature activation+4.51e+00

Pos logit contributions
 notor+6.982
[/+6.954
 tattoo+6.555
KY+6.439
 popularity+6.332
Neg logit contributions
selves-12.830
rique-12.371
actly-10.713
cedes-10.583
productive-9.950
Token sites
Token ID5043
Feature activation+4.15e+00

Pos logit contributions
 notor+3.026
KY+2.940
[/+2.905
 tattoo+2.718
 popularity+2.656
Neg logit contributions
selves-4.211
rique-3.914
cedes-3.288
actly-3.220
productive-3.121
Token on
Token ID319
Feature activation+4.22e+00

Pos logit contributions
 notor+2.774
KY+2.698
[/+2.660
 tattoo+2.491
etic+2.435
Neg logit contributions
selves-3.883
rique-3.592
cedes-3.018
actly-2.969
productive-2.876
Token its
Token ID663
Feature activation+1.80e+00

Pos logit contributions
 notor+3.421
KY+3.349
[/+3.334
 tattoo+3.142
 popularity+3.094
Neg logit contributions
selves-3.339
rique-3.047
cedes-2.453
actly-2.398
productive-2.314
Token popular
Token ID2968
Feature activation+2.76e+00

Pos logit contributions
 notor+1.264
KY+1.235
[/+1.202
 tattoo+1.130
 awa+1.118
Neg logit contributions
selves-1.646
rique-1.503
cedes-1.259
actly-1.232
productive-1.198
Token undo
Token ID23981
Feature activation-8.28e+00

Pos logit contributions
 notor+1.553
KY+1.520
[/+1.494
 tattoo+1.420
 awa+1.403
Neg logit contributions
selves-1.325
rique-1.192
cedes-0.950
actly-0.921
productive-0.887
Token_
Token ID62
Feature activation-1.09e+00

Pos logit contributions
selves+4.737
rique+3.878
cedes+3.126
actly+2.989
productive+2.776
Neg logit contributions
 notor-9.264
 awa-8.190
 challeng-8.108
KY-7.711
 unlaw-7.509
Tokenmove
Token ID21084
Feature activation-2.64e+00

Pos logit contributions
selves+0.895
rique+0.786
cedes+0.661
actly+0.646
productive+0.640
Neg logit contributions
 notor-0.961
KY-0.849
 awa-0.843
[/-0.812
 tattoo-0.804
Token (
Token ID357
Feature activation-9.62e-01

Pos logit contributions
selves+2.362
rique+2.168
cedes+1.808
actly+1.763
productive+1.716
Neg logit contributions
 notor-1.902
KY-1.860
[/-1.819
 tattoo-1.713
 awa-1.690
Token &
Token ID1222
Feature activation-9.15e+00

Pos logit contributions
selves+1.007
rique+0.944
cedes+0.809
actly+0.792
productive+0.781
Neg logit contributions
 notor-0.539
KY-0.521
[/-0.514
 tattoo-0.475
 awa-0.467
Token mut
Token ID4517
Feature activation-1.72e+01

Pos logit contributions
selves+7.647
rique+6.665
cedes+5.577
actly+5.425
acons+5.160
Neg logit contributions
 notor-6.795
KY-6.781
[/-6.417
etic-6.064
 tattoo-6.010
Token self
Token ID2116
Feature activation-1.12e+01

Pos logit contributions
selves+8.778
rique+6.080
cedes+4.475
actly+3.927
 Sending+3.893
Neg logit contributions
 notor-17.398
KY-16.672
 challeng-15.482
 awa-15.259
acco-15.087
Token )
Token ID1267
Feature activation-8.73e+00

Pos logit contributions
selves+7.800
rique+6.824
cedes+5.495
actly+5.324
productive+4.995
Neg logit contributions
 notor-9.683
KY-9.634
[/-9.448
etic-8.808
 tattoo-8.781
Token {
Token ID1391
Feature activation-4.80e+00

Pos logit contributions
selves+5.826
rique+4.743
cedes+3.898
actly+3.797
 Sending+3.764
Neg logit contributions
 notor-8.899
 awa-7.731
 challeng-7.711
KY-7.508
 therap-7.239
Token let
Token ID1309
Feature activation-7.19e+00

Pos logit contributions
selves+4.339
rique+3.874
actly+3.197
cedes+3.192
acons+3.075
Neg logit contributions
 notor-3.381
KY-3.337
[/-3.132
etic-2.966
anski-2.945
Token prev
Token ID8654
Feature activation-3.20e+00

Pos logit contributions
selves+6.441
rique+5.664
cedes+4.793
actly+4.666
productive+4.456
Neg logit contributions
KY-5.006
 notor-4.939
[/-4.615
cale-4.422
etic-4.394
Token the
Token ID262
Feature activation+6.56e-02

Pos logit contributions
 notor+1.272
KY+1.240
[/+1.216
 tattoo+1.145
 awa+1.128
Neg logit contributions
selves-1.477
rique-1.350
cedes-1.117
actly-1.092
productive-1.059
Token presidency
Token ID12112
Feature activation+9.28e-01

Pos logit contributions
 notor+0.046
KY+0.045
[/+0.044
etic+0.041
 tattoo+0.041
Neg logit contributions
selves-0.059
rique-0.054
cedes-0.045
actly-0.044
productive-0.043
Token.
Token ID13
Feature activation-5.31e+00

Pos logit contributions
 notor+0.644
KY+0.628
[/+0.614
 tattoo+0.575
etic+0.567
Neg logit contributions
selves-0.854
rique-0.785
cedes-0.658
actly-0.643
productive-0.626
Token They
Token ID1119
Feature activation-8.49e+00

Pos logit contributions
selves+3.843
rique+3.278
 Sending+2.581
cedes+2.579
 Dying+2.526
Neg logit contributions
 notor-5.011
KY-4.519
 awa-4.388
[/-4.347
 tattoo-4.277
Token are
Token ID389
Feature activation-5.16e+00

Pos logit contributions
selves+6.467
rique+5.520
cedes+4.389
actly+4.299
acons+4.181
Neg logit contributions
 notor-7.003
KY-6.703
[/-6.511
etic-6.236
 awa-6.117
Token blind
Token ID7770
Feature activation-1.57e+01

Pos logit contributions
selves+4.219
rique+3.689
cedes+3.041
actly+2.951
acons+2.895
Neg logit contributions
KY-3.977
 notor-3.974
[/-3.784
etic-3.618
 awa-3.531
Token to
Token ID284
Feature activation-3.81e+00

Pos logit contributions
selves+11.106
rique+9.446
cedes+7.979
actly+7.744
 fools+7.666
Neg logit contributions
 notor-12.380
KY-11.896
[/-11.083
anski-10.921
 awa-10.740
Token the
Token ID262
Feature activation-3.76e+00

Pos logit contributions
selves+3.055
rique+2.729
cedes+2.224
actly+2.215
productive+2.119
Neg logit contributions
 notor-3.063
KY-2.980
[/-2.902
 awa-2.751
etic-2.734
Token fact
Token ID1109
Feature activation-4.59e+00

Pos logit contributions
selves+3.236
rique+2.910
cedes+2.412
actly+2.407
productive+2.305
Neg logit contributions
 notor-2.772
KY-2.738
[/-2.645
 awa-2.491
etic-2.476
Token their
Token ID511
Feature activation-4.20e+00

Pos logit contributions
selves+3.625
rique+3.269
cedes+2.643
actly+2.581
productive+2.498
Neg logit contributions
 notor-3.761
KY-3.670
[/-3.585
 awa-3.389
 tattoo-3.381
Token uncon
Token ID21254
Feature activation-6.25e+00

Pos logit contributions
selves+3.665
rique+3.228
cedes+2.659
actly+2.595
productive+2.515
Neg logit contributions
 notor-3.149
KY-3.066
[/-2.980
etic-2.809
 awa-2.795
Token n
Token ID299
Feature activation+2.71e+00

Pos logit contributions
selves+10.496
rique+9.664
cedes+8.036
actly+7.805
productive+7.592
Neg logit contributions
KY-7.790
 notor-7.459
[/-7.316
ety-6.820
etic-6.799
Tokenou
Token ID280
Feature activation-4.81e+00

Pos logit contributions
 notor+1.789
KY+1.753
[/+1.716
 tattoo+1.598
etic+1.583
Neg logit contributions
selves-2.570
rique-2.359
cedes-1.993
actly-1.953
productive-1.902
Tokenvel
Token ID626
Feature activation-6.60e+00

Pos logit contributions
selves+3.700
rique+3.370
cedes+2.705
actly+2.689
productive+2.542
Neg logit contributions
 notor-4.057
KY-3.950
[/-3.807
 tattoo-3.622
 awa-3.572
Tokenle
Token ID293
Feature activation-2.53e+00

Pos logit contributions
selves+4.120
rique+3.830
cedes+2.731
actly+2.704
productive+2.590
Neg logit contributions
KY-6.217
 notor-6.141
[/-5.983
etic-5.737
 tattoo-5.727
Token organisation
Token ID12684
Feature activation-8.41e-01

Pos logit contributions
selves+2.136
rique+1.972
cedes+1.613
actly+1.562
productive+1.531
Neg logit contributions
 notor-1.896
KY-1.883
[/-1.820
 tattoo-1.707
etic-1.705
Token ind
Token ID773
Feature activation-1.55e+01

Pos logit contributions
selves+0.675
rique+0.617
cedes+0.499
actly+0.483
productive+0.469
Neg logit contributions
 notor-0.676
KY-0.667
[/-0.649
 tattoo-0.613
etic-0.608
Tokené
Token ID2634
Feature activation-9.77e+00

Pos logit contributions
rique+11.865
selves+11.724
cedes+9.285
actly+9.255
roit+8.956
Neg logit contributions
 notor-10.813
KY-10.499
 awa-9.852
[/-9.557
 tattoo-9.481
Tokenpend
Token ID37038
Feature activation-9.10e+00

Pos logit contributions
selves+6.937
rique+6.672
cedes+5.290
actly+5.014
acons+4.785
Neg logit contributions
 notor-8.023
KY-8.016
[/-7.639
 awa-7.461
 tattoo-7.429
Tokenante
Token ID12427
Feature activation-6.27e+00

Pos logit contributions
rique+7.763
selves+7.571
actly+6.003
cedes+5.996
acons+5.599
Neg logit contributions
 notor-6.617
KY-6.208
[/-5.815
 awa-5.741
 tattoo-5.655
Token,
Token ID11
Feature activation-4.27e+00

Pos logit contributions
selves+5.180
rique+4.918
cedes+3.986
actly+3.816
productive+3.722
Neg logit contributions
KY-4.576
 notor-4.520
[/-4.336
etic-4.139
 tattoo-4.094
Token le
Token ID443
Feature activation-9.45e+00

Pos logit contributions
selves+3.427
rique+3.181
cedes+2.556
actly+2.475
productive+2.389
Neg logit contributions
 notor-3.290
KY-3.265
[/-3.155
etic-3.019
cale-2.981
Token presidential
Token ID4787
Feature activation-1.59e+00

Pos logit contributions
selves+3.044
rique+2.762
cedes+2.229
actly+2.226
productive+2.130
Neg logit contributions
 notor-2.792
KY-2.693
[/-2.638
 awa-2.481
etic-2.479
Token nominee
Token ID10429
Feature activation-6.46e-02

Pos logit contributions
selves+1.121
rique+1.010
cedes+0.780
actly+0.759
productive+0.723
Neg logit contributions
 notor-1.434
KY-1.410
[/-1.373
etic-1.297
 awa-1.295
Token Ted
Token ID11396
Feature activation-8.39e+00

Pos logit contributions
selves+0.047
rique+0.042
cedes+0.033
actly+0.032
productive+0.031
Neg logit contributions
 notor-0.057
KY-0.056
[/-0.054
 awa-0.052
 tattoo-0.052
Token Cruz
Token ID8742
Feature activation-2.69e+00

Pos logit contributions
selves+2.043
rique+1.486
actly+0.319
 Cruz+0.312
cedes+0.280
Neg logit contributions
 notor-11.480
 awa-10.943
KY-10.684
xual-10.196
 challeng-10.183
Token.
Token ID13
Feature activation-9.27e+00

Pos logit contributions
selves+2.499
rique+2.281
cedes+1.909
actly+1.892
productive+1.826
Neg logit contributions
 notor-1.843
KY-1.784
[/-1.713
 awa-1.611
 tattoo-1.605
Token Imagine
Token ID18450
Feature activation-1.55e+01

Pos logit contributions
selves+6.647
rique+5.633
 Sending+4.642
 Suppose+4.559
 Dying+4.557
Neg logit contributions
 notor-8.530
KY-7.570
 awa-7.393
aucas-7.149
[/-7.140
Token Lindsey
Token ID30334
Feature activation-4.82e+00

Pos logit contributions
selves+10.884
rique+9.692
 Cheney+7.799
actly+7.727
 Gingrich+7.654
Neg logit contributions
 notor-11.291
KY-11.246
[/-10.633
anski-10.359
etic-10.358
Token Graham
Token ID11520
Feature activation-1.11e-02

Pos logit contributions
selves+2.541
rique+2.169
actly+1.515
cedes+1.469
productive+1.458
Neg logit contributions
 notor-5.174
KY-5.004
[/-4.877
 awa-4.774
 tattoo-4.712
Token mixing
Token ID17090
Feature activation-5.18e+00

Pos logit contributions
selves+0.010
rique+0.009
cedes+0.007
actly+0.007
productive+0.007
Neg logit contributions
 notor-0.008
KY-0.008
[/-0.008
 tattoo-0.007
etic-0.007
Token mint
Token ID21043
Feature activation-2.80e+00

Pos logit contributions
selves+4.489
rique+4.125
cedes+3.468
actly+3.348
productive+3.241
Neg logit contributions
 notor-3.651
KY-3.593
[/-3.503
cale-3.228
etic-3.203
Token j
Token ID474
Feature activation+5.92e+00

Pos logit contributions
selves+2.656
rique+2.495
cedes+2.097
actly+2.044
productive+1.958
Neg logit contributions
 notor-1.824
KY-1.740
[/-1.700
 tattoo-1.583
cale-1.574
Token isn
Token ID2125
Feature activation-2.10e+00

Pos logit contributions
selves+1.567
rique+1.421
cedes+1.169
actly+1.136
productive+1.102
Neg logit contributions
 notor-1.473
KY-1.435
[/-1.398
 tattoo-1.320
 awa-1.316
Token
Token ID447
Feature activation-2.05e+00

Pos logit contributions
selves+1.937
rique+1.736
cedes+1.463
actly+1.452
productive+1.415
Neg logit contributions
 notor-1.591
 awa-1.389
KY-1.388
[/-1.313
 tattoo-1.285
Token
Token ID247
Feature activation+4.07e-01

Pos logit contributions
selves+0.962
rique+0.817
cedes+0.523
actly+0.505
productive+0.474
Neg logit contributions
 notor-2.381
KY-2.302
[/-2.252
 awa-2.202
etic-2.180
Tokent
Token ID83
Feature activation-7.98e+00

Pos logit contributions
 notor+0.308
KY+0.288
[/+0.276
 awa+0.268
 tattoo+0.263
Neg logit contributions
selves-0.365
rique-0.335
cedes-0.278
actly-0.273
productive-0.265
Token the
Token ID262
Feature activation-8.90e+00

Pos logit contributions
selves+6.344
rique+5.712
actly+4.652
cedes+4.544
productive+4.405
Neg logit contributions
 notor-6.263
KY-6.160
[/-5.856
 awa-5.669
anski-5.585
Token end
Token ID886
Feature activation-1.55e+01

Pos logit contributions
selves+7.359
rique+6.619
actly+5.404
cedes+5.270
 Gingrich+5.193
Neg logit contributions
 notor-6.467
KY-6.203
[/-6.005
 awa-5.846
anski-5.653
Token of
Token ID286
Feature activation-1.05e+01

Pos logit contributions
selves+10.375
rique+9.429
actly+7.369
cedes+7.304
productive+7.073
Neg logit contributions
 notor-12.695
KY-12.148
[/-11.344
 awa-11.137
anski-11.027
Token their
Token ID511
Feature activation-8.61e+00

Pos logit contributions
selves+7.839
rique+7.014
cedes+5.620
actly+5.510
productive+5.430
Neg logit contributions
 notor-8.263
KY-8.117
[/-7.708
 awa-7.513
etic-7.361
Token quest
Token ID1235
Feature activation-5.70e+00

Pos logit contributions
selves+7.411
rique+6.592
cedes+5.352
productive+5.270
actly+5.198
Neg logit contributions
 notor-6.189
KY-5.922
[/-5.678
 awa-5.452
cale-5.340
Token for
Token ID329
Feature activation-3.97e+00

Pos logit contributions
selves+4.576
rique+4.063
cedes+3.322
actly+3.216
productive+3.149
Neg logit contributions
 notor-4.628
KY-4.456
[/-4.299
 awa-4.084
 tattoo-4.072
Token gains
Token ID8810
Feature activation-3.17e+00

Pos logit contributions
selves+3.489
rique+3.169
cedes+2.638
actly+2.583
acons+2.489
Neg logit contributions
 notor-2.838
KY-2.791
[/-2.743
 awa-2.553
etic-2.529
Token also
Token ID635
Feature activation-1.64e+00

Pos logit contributions
 notor+2.569
KY+2.515
[/+2.491
 tattoo+2.330
 awa+2.286
Neg logit contributions
selves-3.052
rique-2.807
cedes-2.345
actly-2.283
productive-2.201
Token clearly
Token ID4084
Feature activation-1.16e+00

Pos logit contributions
selves+1.381
rique+1.248
cedes+1.017
actly+0.991
productive+0.977
Neg logit contributions
 notor-1.261
KY-1.228
[/-1.188
etic-1.126
 tattoo-1.121
Token feel
Token ID1254
Feature activation-2.65e+00

Pos logit contributions
selves+0.958
rique+0.860
cedes+0.698
actly+0.684
productive+0.670
Neg logit contributions
 notor-0.913
KY-0.889
[/-0.865
etic-0.813
 tattoo-0.812
Token swapping
Token ID38869
Feature activation+4.44e-01

Pos logit contributions
selves+2.149
rique+1.935
cedes+1.586
actly+1.512
productive+1.490
Neg logit contributions
 notor-2.107
KY-2.070
[/-1.996
etic-1.894
 tattoo-1.887
Token one
Token ID530
Feature activation-4.36e+00

Pos logit contributions
 notor+0.315
KY+0.307
[/+0.300
 tattoo+0.281
etic+0.277
Neg logit contributions
selves-0.402
rique-0.368
cedes-0.309
actly-0.301
productive-0.293
Token fossil
Token ID12584
Feature activation-1.54e+01

Pos logit contributions
selves+3.655
rique+3.213
cedes+2.742
productive+2.598
actly+2.597
Neg logit contributions
 notor-3.301
KY-3.249
[/-3.091
 awa-2.958
anski-2.926
Token fuel
Token ID5252
Feature activation-9.58e+00

Pos logit contributions
selves+8.691
rique+7.550
cedes+6.272
productive+5.800
actly+5.572
Neg logit contributions
 notor-13.348
KY-13.125
 awa-12.325
anski-12.218
cale-12.209
Token for
Token ID329
Feature activation-4.58e+00

Pos logit contributions
selves+6.056
rique+5.160
cedes+4.287
actly+3.890
productive+3.871
Neg logit contributions
 notor-8.500
KY-8.321
[/-7.778
 awa-7.758
anski-7.726
Token another
Token ID1194
Feature activation-4.94e+00

Pos logit contributions
selves+3.755
rique+3.383
cedes+2.753
actly+2.716
productive+2.611
Neg logit contributions
 notor-3.581
KY-3.511
[/-3.417
 tattoo-3.223
cale-3.206
Token is
Token ID318
Feature activation-2.57e+00

Pos logit contributions
selves+3.983
rique+3.511
cedes+2.904
actly+2.773
productive+2.757
Neg logit contributions
 notor-3.899
KY-3.836
[/-3.645
 awa-3.481
cale-3.458
Token bad
Token ID2089
Feature activation-6.81e+00

Pos logit contributions
selves+2.173
rique+1.961
cedes+1.610
actly+1.589
productive+1.569
Neg logit contributions
 notor-1.969
KY-1.919
[/-1.865
 tattoo-1.760
 awa-1.753
Token and
Token ID290
Feature activation-6.20e+00

Pos logit contributions
selves+0.613
rique+0.554
cedes+0.456
actly+0.443
productive+0.436
Neg logit contributions
 notor-0.535
KY-0.525
[/-0.507
 awa-0.476
etic-0.473
Token meetings
Token ID8292
Feature activation-5.12e+00

Pos logit contributions
selves+5.398
rique+4.873
cedes+3.999
actly+3.961
productive+3.954
Neg logit contributions
KY-4.197
 notor-4.185
[/-4.015
cale-3.777
anski-3.746
Token to
Token ID284
Feature activation-4.34e+00

Pos logit contributions
selves+4.114
rique+3.621
cedes+2.926
productive+2.851
actly+2.822
Neg logit contributions
 notor-4.075
KY-4.031
[/-3.805
 awa-3.656
anski-3.589
Token hear
Token ID3285
Feature activation-4.04e+00

Pos logit contributions
selves+3.669
rique+3.269
cedes+2.686
actly+2.623
productive+2.583
Neg logit contributions
 notor-3.256
KY-3.135
[/-3.072
 awa-2.881
cale-2.874
Token "
Token ID366
Feature activation+2.16e+00

Pos logit contributions
selves+3.481
rique+3.050
cedes+2.529
actly+2.522
acons+2.452
Neg logit contributions
 notor-3.040
KY-2.988
[/-2.871
etic-2.675
anski-2.670
Tokenexper
Token ID23100
Feature activation-1.53e+01

Pos logit contributions
 notor+1.565
KY+1.529
[/+1.496
 tattoo+1.405
etic+1.387
Neg logit contributions
selves-1.917
rique-1.760
cedes-1.464
actly-1.430
productive-1.386
Tokents
Token ID912
Feature activation+1.78e+00

Pos logit contributions
selves+8.133
rique+7.256
cedes+5.638
productive+5.299
actly+5.285
Neg logit contributions
 notor-14.881
KY-13.923
[/-13.565
 awa-13.487
cale-13.155
Token"
Token ID1
Feature activation-3.47e+00

Pos logit contributions
 notor+1.041
KY+1.011
[/+0.990
 tattoo+0.913
 awa+0.894
Neg logit contributions
selves-1.820
rique-1.695
cedes-1.455
actly-1.422
productive-1.388
Token speak
Token ID2740
Feature activation+8.19e-01

Pos logit contributions
selves+3.046
rique+2.690
actly+2.207
cedes+2.201
productive+2.151
Neg logit contributions
 notor-2.579
KY-2.485
[/-2.403
 awa-2.256
etic-2.251
Token about
Token ID546
Feature activation-2.24e+00

Pos logit contributions
 notor+0.587
KY+0.572
[/+0.559
 tattoo+0.524
 awa+0.517
Neg logit contributions
selves-0.736
rique-0.674
cedes-0.562
actly-0.549
productive-0.534
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
selves+1.977
rique+1.786
cedes+1.480
actly+1.444
productive+1.409
Neg logit contributions
 notor-1.643
KY-1.626
[/-1.561
 awa-1.452
etic-1.439
Token galaxy
Token ID16161
Feature activation-9.37e-01

Pos logit contributions
 notor+1.829
KY+1.793
[/+1.768
 tattoo+1.659
etic+1.635
Neg logit contributions
selves-2.057
rique-1.885
cedes-1.554
actly-1.512
productive-1.467
Token (
Token ID357
Feature activation+1.14e+00

Pos logit contributions
selves+0.824
rique+0.748
cedes+0.622
actly+0.610
productive+0.592
Neg logit contributions
 notor-0.695
KY-0.674
[/-0.649
 tattoo-0.616
 awa-0.609
Token100
Token ID3064
Feature activation-5.02e+00

Pos logit contributions
 notor+0.840
KY+0.810
[/+0.794
 tattoo+0.751
 awa+0.742
Neg logit contributions
selves-1.004
rique-0.916
cedes-0.762
actly-0.748
productive-0.727
Token,
Token ID11
Feature activation+5.15e-01

Pos logit contributions
selves+4.146
rique+3.736
cedes+3.055
actly+3.013
productive+2.914
Neg logit contributions
 notor-3.923
KY-3.720
[/-3.634
 awa-3.456
 tattoo-3.435
Token000
Token ID830
Feature activation-3.64e+00

Pos logit contributions
 notor+0.338
KY+0.330
[/+0.322
 tattoo+0.301
etic+0.295
Neg logit contributions
selves-0.492
rique-0.454
cedes-0.384
actly-0.375
productive-0.366
Token Light
Token ID4401
Feature activation-1.51e+01

Pos logit contributions
selves+3.102
rique+2.805
cedes+2.325
actly+2.265
productive+2.196
Neg logit contributions
 notor-2.784
KY-2.673
[/-2.607
 tattoo-2.447
 awa-2.439
Token Years
Token ID13212
Feature activation-3.79e+00

Pos logit contributions
selves+9.512
rique+7.684
cedes+6.304
 Dying+6.206
productive+5.877
Neg logit contributions
 notor-13.474
KY-12.853
[/-12.049
 tattoo-11.882
anski-11.871
Token)
Token ID8
Feature activation-5.69e-01

Pos logit contributions
selves+2.612
rique+2.318
cedes+1.802
actly+1.737
productive+1.685
Neg logit contributions
 notor-3.487
KY-3.378
[/-3.304
 tattoo-3.148
 awa-3.125
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
selves+0.531
rique+0.489
cedes+0.411
actly+0.402
productive+0.391
Neg logit contributions
 notor-0.387
KY-0.377
[/-0.369
 tattoo-0.345
etic-0.339
Token
Token ID198
Feature activation-7.03e-01

Pos logit contributions
 notor+0.264
KY+0.239
 awa+0.233
[/+0.229
 tattoo+0.221
Neg logit contributions
selves-0.301
rique-0.269
cedes-0.227
actly-0.222
productive-0.216
Tokento
Token ID1462
Feature activation-3.98e-01

Pos logit contributions
selves+0.572
rique+0.511
cedes+0.424
actly+0.415
productive+0.404
Neg logit contributions
 notor-0.571
KY-0.544
[/-0.529
 tattoo-0.515
 awa-0.512
Token
Token ID111
Feature activation-9.13e+00

Pos logit contributions
selves+5.825
rique+5.549
+3.969
cedes+3.671
+3.526
Neg logit contributions
 notor-13.562
KY-13.419
[/-12.866
 awa-12.522
 popularity-12.405
Tokenр
Token ID21169
Feature activation-9.62e+00

Pos logit contributions
selves+5.353
rique+4.697
actly+3.513
cedes+3.384
productive+3.216
Neg logit contributions
 notor-8.810
KY-8.463
[/-8.199
 awa-8.065
cale-7.985
Tokenа
Token ID16142
Feature activation-5.23e+00

Pos logit contributions
selves+4.860
rique+4.200
cedes+2.908
actly+2.902
productive+2.662
Neg logit contributions
 notor-10.069
KY-9.657
[/-9.390
 awa-9.187
 tattoo-9.119
Token
Token ID140
Feature activation-1.17e+01

Pos logit contributions
selves+3.816
rique+3.455
cedes+2.696
actly+2.635
productive+2.533
Neg logit contributions
 notor-4.571
KY-4.387
[/-4.230
 awa-4.131
 tattoo-4.083
Token
Token ID114
Feature activation-7.38e+00

Pos logit contributions
selves+4.961
rique+4.678
+3.020
cedes+2.825
productive+2.621
Neg logit contributions
 notor-12.327
KY-12.203
[/-11.785
 awa-11.514
 popularity-11.347
Tokenд
Token ID43666
Feature activation-1.51e+01

Pos logit contributions
selves+4.292
rique+3.783
actly+2.673
cedes+2.667
productive+2.539
Neg logit contributions
 notor-7.358
KY-7.132
[/-6.887
 awa-6.806
 tattoo-6.693
Tokenе
Token ID16843
Feature activation-1.12e+01

Pos logit contributions
selves+8.300
rique+7.498
у+5.631
actly+5.104
cedes+5.061
Neg logit contributions
 notor-13.914
KY-12.953
 awa-12.494
[/-12.416
 tattoo-12.196
Tokenн
Token ID22177
Feature activation-1.29e+01

Pos logit contributions
selves+6.785
rique+6.179
cedes+4.490
actly+4.220
productive+4.191
Neg logit contributions
 notor-10.489
KY-9.896
[/-9.428
 awa-9.409
cale-9.213
Tokenы
Token ID45035
Feature activation-5.71e+00

Pos logit contributions
selves+7.552
rique+6.887
у+4.865
cedes+4.812
actly+4.748
Neg logit contributions
 notor-11.812
KY-11.241
[/-10.675
 awa-10.614
cale-10.389
Token
Token ID220
Feature activation-8.70e+00

Pos logit contributions
selves+4.122
rique+3.666
cedes+2.902
actly+2.848
productive+2.727
Neg logit contributions
 notor-5.018
KY-4.796
[/-4.589
cale-4.487
 awa-4.473
Tokenс
Token ID21727
Feature activation-7.28e+00

Pos logit contributions
selves+5.867
rique+5.126
cedes+4.006
actly+3.996
у+3.771
Neg logit contributions
 notor-7.960
KY-7.537
[/-7.283
 tattoo-7.132
cale-7.067
Token:
Token ID25
Feature activation+1.14e+00

Pos logit contributions
 notor+2.096
KY+2.023
[/+2.016
 tattoo+1.853
cale+1.820
Neg logit contributions
selves-3.702
rique-3.470
cedes-2.973
actly-2.907
productive-2.815
Token the
Token ID262
Feature activation-2.19e+00

Pos logit contributions
 notor+0.849
KY+0.827
[/+0.813
 tattoo+0.766
etic+0.754
Neg logit contributions
selves-0.981
rique-0.902
cedes-0.746
actly-0.727
productive-0.701
Token costs
Token ID3484
Feature activation-4.82e+00

Pos logit contributions
selves+1.982
rique+1.810
cedes+1.498
actly+1.471
productive+1.447
Neg logit contributions
 notor-1.551
KY-1.511
[/-1.472
 awa-1.372
etic-1.368
Token of
Token ID286
Feature activation-6.07e+00

Pos logit contributions
selves+3.933
rique+3.480
cedes+2.841
actly+2.786
productive+2.762
Neg logit contributions
 notor-3.773
KY-3.699
[/-3.576
 awa-3.393
etic-3.315
Token investments
Token ID11115
Feature activation-5.57e+00

Pos logit contributions
selves+5.259
rique+4.741
cedes+3.916
actly+3.858
productive+3.843
Neg logit contributions
 notor-4.304
KY-4.236
[/-4.116
etic-3.857
 awa-3.840
Token un
Token ID555
Feature activation-1.51e+01

Pos logit contributions
selves+4.725
rique+4.182
cedes+3.496
actly+3.420
productive+3.418
Neg logit contributions
 notor-4.125
KY-4.002
[/-3.796
 awa-3.633
etic-3.615
Tokenmade
Token ID9727
Feature activation-6.50e+00

Pos logit contributions
selves+12.073
rique+10.242
productive+9.994
actly+9.131
cedes+8.947
Neg logit contributions
 notor-11.121
KY-9.895
 awa-9.641
 tattoo-9.579
[/-9.223
Token,
Token ID11
Feature activation-2.31e+00

Pos logit contributions
selves+5.628
rique+4.912
cedes+4.163
productive+4.046
actly+4.016
Neg logit contributions
 notor-4.735
KY-4.627
[/-4.354
etic-4.149
 awa-4.148
Token of
Token ID286
Feature activation-5.91e+00

Pos logit contributions
selves+2.040
rique+1.848
cedes+1.541
actly+1.499
productive+1.481
Neg logit contributions
 notor-1.688
KY-1.653
[/-1.612
 awa-1.500
 tattoo-1.498
Token businesses
Token ID5692
Feature activation-2.87e+00

Pos logit contributions
selves+5.210
rique+4.657
cedes+3.837
productive+3.804
actly+3.791
Neg logit contributions
 notor-4.183
KY-4.107
[/-3.987
 awa-3.740
etic-3.697
Token not
Token ID407
Feature activation-4.52e+00

Pos logit contributions
selves+2.426
rique+2.153
cedes+1.772
actly+1.708
productive+1.705
Neg logit contributions
 notor-2.211
KY-2.178
[/-2.092
etic-1.972
 awa-1.957
Token2
Token ID17
Feature activation-8.76e-02

Pos logit contributions
selves+2.768
rique+2.401
cedes+1.927
actly+1.916
productive+1.837
Neg logit contributions
 notor-3.788
 awa-3.421
KY-3.261
 tattoo-3.169
[/-3.150
Token.
Token ID13
Feature activation+7.99e-01

Pos logit contributions
selves+0.086
rique+0.079
cedes+0.067
actly+0.066
productive+0.064
Neg logit contributions
 notor-0.059
KY-0.053
 awa-0.051
[/-0.051
 tattoo-0.050
Tokenim
Token ID320
Feature activation-5.12e+00

Pos logit contributions
 notor+0.616
KY+0.600
[/+0.587
 tattoo+0.556
 awa+0.550
Neg logit contributions
selves-0.675
rique-0.614
cedes-0.504
actly-0.494
productive-0.479
Tokenread
Token ID961
Feature activation-3.51e+00

Pos logit contributions
selves+4.373
rique+3.910
cedes+3.212
actly+3.201
productive+3.081
Neg logit contributions
 notor-3.903
KY-3.765
[/-3.643
 awa-3.586
 tattoo-3.556
Token(
Token ID7
Feature activation-1.43e+00

Pos logit contributions
selves+3.028
rique+2.760
actly+2.261
cedes+2.232
productive+2.153
Neg logit contributions
 notor-2.638
KY-2.521
[/-2.452
 awa-2.358
 tattoo-2.349
Tokenimage
Token ID9060
Feature activation-1.49e+01

Pos logit contributions
selves+1.229
rique+1.114
cedes+0.917
actly+0.901
productive+0.872
Neg logit contributions
 notor-1.083
KY-1.041
[/-1.018
 awa-0.972
 tattoo-0.970
TokenPath
Token ID15235
Feature activation-5.54e+00

Pos logit contributions
selves+9.269
rique+7.474
Args+5.712
cedes+5.644
actly+5.607
Neg logit contributions
 notor-13.582
 awa-12.565
KY-12.284
[/-11.968
 tattoo-11.921
Token)
Token ID8
Feature activation-4.47e+00

Pos logit contributions
selves+4.335
rique+3.865
cedes+3.092
actly+3.058
productive+2.932
Neg logit contributions
 notor-4.627
KY-4.429
[/-4.310
 awa-4.144
 tattoo-4.101
Token image
Token ID2939
Feature activation-1.08e+01

Pos logit contributions
selves+3.753
rique+3.393
cedes+2.785
actly+2.751
acons+2.629
Neg logit contributions
 notor-3.368
KY-3.310
[/-3.120
 awa-2.994
etic-2.984
Token =
Token ID796
Feature activation-7.51e-01

Pos logit contributions
selves+9.118
rique+8.223
cedes+6.769
actly+6.764
Args+6.595
Neg logit contributions
 notor-7.665
 awa-6.989
KY-6.790
 tattoo-6.559
[/-6.558
Token im
Token ID545
Feature activation-7.99e+00

Pos logit contributions
selves+0.667
rique+0.606
cedes+0.503
actly+0.495
productive+0.476
Neg logit contributions
 notor-0.544
KY-0.533
[/-0.514
etic-0.483
 tattoo-0.483
Token2
Token ID17
Feature activation-3.61e+00

Pos logit contributions
selves+0.382
rique+0.343
cedes+0.284
actly+0.283
productive+0.274
Neg logit contributions
 notor-0.343
KY-0.322
[/-0.314
 awa-0.303
 tattoo-0.302
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
selves+3.416
rique+3.125
cedes+2.638
actly+2.595
productive+2.519
Neg logit contributions
 notor-2.407
KY-2.316
[/-2.251
 tattoo-2.116
 awa-2.104
Token000
Token ID830
Feature activation-7.13e+00

Pos logit contributions
selves+1.663
rique+1.517
cedes+1.254
actly+1.244
productive+1.200
Neg logit contributions
 notor-1.385
KY-1.317
[/-1.285
 awa-1.216
 tattoo-1.213
Token,
Token ID11
Feature activation-1.33e-01

Pos logit contributions
selves+5.605
rique+5.028
cedes+4.044
actly+3.975
productive+3.843
Neg logit contributions
 notor-5.812
KY-5.585
[/-5.423
 awa-5.183
 tattoo-5.168
Token000
Token ID830
Feature activation-4.51e+00

Pos logit contributions
selves+0.118
rique+0.107
cedes+0.090
actly+0.088
productive+0.085
Neg logit contributions
 notor-0.102
KY-0.093
[/-0.090
 awa-0.089
 tattoo-0.085
Token Light
Token ID4401
Feature activation-1.48e+01

Pos logit contributions
selves+3.654
rique+3.297
cedes+2.677
actly+2.620
productive+2.534
Neg logit contributions
 notor-3.632
KY-3.525
[/-3.441
 tattoo-3.265
 awa-3.243
Token Years
Token ID13212
Feature activation-3.17e+00

Pos logit contributions
selves+8.420
rique+6.139
 Dying+5.066
cedes+4.731
 Sending+4.591
Neg logit contributions
 notor-15.687
 challeng-14.221
KY-14.018
 awa-13.857
 therap-13.409
Token)
Token ID8
Feature activation-4.66e-01

Pos logit contributions
selves+2.250
rique+2.015
cedes+1.583
actly+1.533
productive+1.472
Neg logit contributions
 notor-2.862
KY-2.806
[/-2.760
 tattoo-2.627
etic-2.598
Token
Token ID198
Feature activation+4.62e-01

Pos logit contributions
selves+0.447
rique+0.414
cedes+0.350
actly+0.341
productive+0.333
Neg logit contributions
 notor-0.304
KY-0.297
[/-0.295
 tattoo-0.271
etic-0.267
Token
Token ID198
Feature activation-3.41e-01

Pos logit contributions
 notor+0.358
KY+0.326
 awa+0.316
[/+0.312
 tattoo+0.301
Neg logit contributions
selves-0.412
rique-0.370
cedes-0.311
actly-0.305
productive-0.297
Tokento
Token ID1462
Feature activation+1.01e+00

Pos logit contributions
selves+0.275
rique+0.246
cedes+0.203
actly+0.199
productive+0.193
Neg logit contributions
 notor-0.279
KY-0.267
[/-0.260
 tattoo-0.252
 awa-0.251
Token
Token ID141
Feature activation-6.47e+00

Pos logit contributions
selves+4.490
rique+4.034
cedes+3.188
actly+3.107
productive+3.015
Neg logit contributions
 notor-5.186
KY-4.909
[/-4.770
 awa-4.612
etic-4.574
Token
Token ID227
Feature activation-3.26e+00

Pos logit contributions
selves+3.854
rique+3.542
cedes+2.505
productive+2.409
+2.399
Neg logit contributions
 notor-6.390
KY-6.204
[/-5.959
 tattoo-5.801
 awa-5.782
Token
Token ID12466
Feature activation-8.91e+00

Pos logit contributions
selves+2.479
rique+2.251
cedes+1.788
actly+1.730
productive+1.674
Neg logit contributions
 notor-2.780
KY-2.667
[/-2.609
 tattoo-2.478
cale-2.470
Token
Token ID121
Feature activation-9.31e+00

Pos logit contributions
selves+5.313
rique+4.766
cedes+3.590
+3.480
actly+3.303
Neg logit contributions
 notor-8.441
KY-8.242
[/-8.075
 awa-7.725
 tattoo-7.693
Tokenе
Token ID16843
Feature activation-9.74e+00

Pos logit contributions
selves+4.904
rique+4.363
cedes+3.042
actly+3.012
+2.754
Neg logit contributions
 notor-9.603
KY-9.148
[/-8.882
 awa-8.636
 tattoo-8.628
Tokenд
Token ID43666
Feature activation-1.48e+01

Pos logit contributions
selves+6.904
rique+6.082
cedes+4.865
actly+4.754
productive+4.514
Neg logit contributions
 notor-8.227
KY-7.903
[/-7.570
cale-7.330
 awa-7.270
Tokenе
Token ID16843
Feature activation-1.14e+01

Pos logit contributions
selves+9.199
rique+8.227
у+6.244
actly+6.097
cedes+6.083
Neg logit contributions
 notor-12.791
KY-11.803
[/-11.486
 awa-11.389
 popularity-11.052
Tokenл
Token ID30143
Feature activation-8.36e+00

Pos logit contributions
selves+7.562
rique+6.756
cedes+5.139
actly+4.860
productive+4.756
Neg logit contributions
 notor-10.093
KY-9.448
[/-9.108
 awa-8.805
cale-8.753
Tokenь
Token ID45367
Feature activation-9.85e+00

Pos logit contributions
selves+5.033
rique+4.464
cedes+3.151
actly+3.124
productive+3.031
Neg logit contributions
 notor-8.131
KY-7.762
[/-7.563
 awa-7.294
 tattoo-7.260
Token
Token ID12466
Feature activation-9.81e+00

Pos logit contributions
selves+6.688
rique+5.873
cedes+4.489
actly+4.398
productive+4.367
Neg logit contributions
 notor-8.718
KY-8.231
[/-8.012
 awa-7.725
cale-7.722
Token
Token ID116
Feature activation-2.90e+00

Pos logit contributions
selves+5.833
rique+5.225
cedes+3.948
+3.874
+3.673
Neg logit contributions
 notor-9.227
KY-8.952
[/-8.759
 awa-8.397
 tattoo-8.385
Tokenе
Token ID16843
Feature activation-8.01e+00

Pos logit contributions
selves+3.596
rique+3.238
cedes+2.302
actly+2.300
productive+2.166
Neg logit contributions
 notor-6.173
KY-5.981
[/-5.818
 awa-5.654
 tattoo-5.624
Tokenк
Token ID31583
Feature activation-8.48e+00

Pos logit contributions
selves+5.723
rique+5.178
cedes+4.088
actly+3.943
productive+3.817
Neg logit contributions
 notor-6.897
KY-6.591
[/-6.285
 awa-6.122
 tattoo-6.105
Tokenс
Token ID21727
Feature activation-7.15e+00

Pos logit contributions
selves+5.560
rique+5.022
cedes+3.804
actly+3.788
productive+3.592
Neg logit contributions
 notor-7.643
KY-7.392
[/-7.033
 awa-6.887
cale-6.834
Tokenа
Token ID16142
Feature activation-6.09e+00

Pos logit contributions
selves+4.539
rique+4.039
cedes+3.096
actly+2.879
productive+2.829
Neg logit contributions
 notor-6.825
KY-6.472
[/-6.291
 awa-6.154
cale-6.108
Tokenн
Token ID22177
Feature activation-1.21e+01

Pos logit contributions
selves+4.430
rique+4.020
cedes+3.158
actly+3.032
productive+2.938
Neg logit contributions
 notor-5.295
KY-5.068
[/-4.822
 awa-4.737
cale-4.693
Tokenд
Token ID43666
Feature activation-1.43e+01

Pos logit contributions
selves+7.274
rique+6.828
cedes+4.777
productive+4.739
actly+4.706
Neg logit contributions
 notor-10.833
KY-10.569
 awa-9.771
[/-9.736
cale-9.684
Tokenр
Token ID21169
Feature activation-9.75e+00

Pos logit contributions
selves+8.847
rique+8.120
у+6.041
actly+5.857
cedes+5.835
Neg logit contributions
 notor-12.268
KY-11.440
 awa-11.113
[/-10.902
cale-10.897
Token
Token ID12466
Feature activation-8.18e+00

Pos logit contributions
selves+5.270
rique+4.651
cedes+3.294
actly+3.186
у+3.053
Neg logit contributions
 notor-9.870
KY-9.433
[/-9.051
 awa-8.945
cale-8.879
Token
Token ID94
Feature activation-6.80e+00

Pos logit contributions
selves+4.961
rique+4.469
cedes+3.335
+3.306
actly+3.129
Neg logit contributions
 notor-7.755
KY-7.554
[/-7.398
 tattoo-7.107
 awa-7.089
Tokenа
Token ID16142
Feature activation-6.41e+00

Pos logit contributions
selves+4.583
rique+4.091
cedes+3.166
actly+3.126
productive+2.954
Neg logit contributions
 notor-6.191
KY-5.919
[/-5.757
 awa-5.543
 tattoo-5.542
Token
Token ID140
Feature activation-1.19e+01

Pos logit contributions
selves+4.817
rique+4.432
cedes+3.530
actly+3.435
productive+3.349
Neg logit contributions
 notor-5.357
KY-5.127
[/-4.907
 awa-4.823
 tattoo-4.756
Token
Token ID112
Feature activation-3.58e+00

Pos logit contributions
selves+3.716
rique+3.208
cedes+2.108
+2.016
actly+1.923
Neg logit contributions
 notor-8.590
KY-8.362
[/-8.141
 tattoo-7.936
 awa-7.903
Tokenр
Token ID21169
Feature activation-6.43e+00

Pos logit contributions
selves+2.219
rique+1.937
cedes+1.448
actly+1.396
productive+1.343
Neg logit contributions
 notor-3.564
KY-3.444
[/-3.334
 awa-3.225
 tattoo-3.219
Tokenа
Token ID16142
Feature activation-3.62e+00

Pos logit contributions
selves+3.389
rique+2.912
cedes+2.040
actly+1.935
productive+1.855
Neg logit contributions
 notor-6.884
KY-6.631
[/-6.411
 awa-6.279
 tattoo-6.251
Token
Token ID140
Feature activation-9.09e+00

Pos logit contributions
selves+2.579
rique+2.320
cedes+1.806
actly+1.744
productive+1.685
Neg logit contributions
 notor-3.283
KY-3.161
[/-3.039
 awa-2.958
 tattoo-2.947
Token
Token ID115
Feature activation-6.07e+00

Pos logit contributions
selves+3.986
rique+3.737
cedes+2.304
+2.281
productive+2.078
Neg logit contributions
 notor-9.995
KY-9.819
[/-9.452
 awa-9.240
 tattoo-9.193
Tokenд
Token ID43666
Feature activation-1.42e+01

Pos logit contributions
selves+3.791
rique+3.373
cedes+2.482
actly+2.427
productive+2.340
Neg logit contributions
 notor-5.881
KY-5.678
[/-5.449
 awa-5.299
 tattoo-5.294
Tokenе
Token ID16843
Feature activation-8.17e+00

Pos logit contributions
selves+8.044
rique+7.549
у+5.336
actly+5.040
cedes+5.039
Neg logit contributions
 notor-12.903
KY-12.072
 awa-11.525
cale-11.515
[/-11.488
Tokenл
Token ID30143
Feature activation-4.85e+00

Pos logit contributions
selves+5.301
rique+4.881
cedes+3.578
actly+3.385
productive+3.334
Neg logit contributions
 notor-7.646
KY-7.311
[/-6.967
 awa-6.807
 tattoo-6.798
Tokenе
Token ID16843
Feature activation-6.20e+00

Pos logit contributions
selves+2.516
rique+2.200
cedes+1.454
actly+1.412
productive+1.341
Neg logit contributions
 notor-5.246
KY-5.086
[/-4.933
 tattoo-4.819
 awa-4.790
Tokenн
Token ID22177
Feature activation-8.93e+00

Pos logit contributions
selves+4.671
rique+4.235
cedes+3.357
actly+3.236
productive+3.176
Neg logit contributions
 notor-5.197
KY-4.943
[/-4.743
 tattoo-4.611
 awa-4.606
Tokenи
Token ID18849
Feature activation-2.51e+00

Pos logit contributions
selves+5.457
rique+4.931
cedes+3.502
actly+3.437
productive+3.363
Neg logit contributions
 notor-8.503
KY-8.170
[/-7.772
 awa-7.660
 tattoo-7.597
Token apply
Token ID4174
Feature activation-1.84e+00

Pos logit contributions
 notor+1.038
KY+1.017
[/+0.994
 tattoo+0.939
etic+0.929
Neg logit contributions
selves-1.044
rique-0.945
cedes-0.770
actly-0.751
productive-0.725
Token_
Token ID62
Feature activation-1.01e+00

Pos logit contributions
selves+1.272
rique+1.130
cedes+0.896
actly+0.864
productive+0.815
Neg logit contributions
 notor-1.754
KY-1.642
[/-1.598
 awa-1.571
 tattoo-1.546
Tokenmove
Token ID21084
Feature activation-4.10e+00

Pos logit contributions
selves+0.895
rique+0.818
cedes+0.682
actly+0.668
productive+0.649
Neg logit contributions
 notor-0.739
KY-0.716
[/-0.699
 tattoo-0.659
 awa-0.650
Token (
Token ID357
Feature activation-3.04e+00

Pos logit contributions
selves+3.467
rique+3.151
cedes+2.597
actly+2.544
productive+2.457
Neg logit contributions
 notor-3.139
KY-3.056
[/-2.995
 tattoo-2.809
etic-2.771
Token &
Token ID1222
Feature activation-1.06e+01

Pos logit contributions
selves+3.002
rique+2.749
cedes+2.344
actly+2.300
productive+2.231
Neg logit contributions
 notor-1.907
KY-1.851
[/-1.790
 tattoo-1.658
etic-1.642
Token mut
Token ID4517
Feature activation-1.41e+01

Pos logit contributions
selves+7.607
rique+6.402
cedes+5.125
actly+4.876
acons+4.719
Neg logit contributions
KY-9.048
 notor-8.971
[/-8.528
etic-8.105
cale-8.046
Token self
Token ID2116
Feature activation-1.14e+01

Pos logit contributions
selves+9.238
rique+7.518
acons+5.588
 Dying+5.584
actly+5.577
Neg logit contributions
KY-12.307
 notor-11.936
[/-11.545
etic-10.880
anski-10.863
Token )
Token ID1267
Feature activation-8.78e+00

Pos logit contributions
selves+8.125
rique+7.138
cedes+5.744
actly+5.550
productive+5.287
Neg logit contributions
 notor-9.612
KY-9.465
[/-9.374
etic-8.708
 tattoo-8.686
Token {
Token ID1391
Feature activation-3.24e+00

Pos logit contributions
selves+7.008
rique+6.191
cedes+5.140
actly+5.029
productive+4.847
Neg logit contributions
 notor-6.868
KY-6.661
[/-6.394
etic-6.113
 tattoo-6.094
Token let
Token ID1309
Feature activation-6.02e+00

Pos logit contributions
selves+2.902
rique+2.604
cedes+2.146
actly+2.126
productive+2.049
Neg logit contributions
 notor-2.334
KY-2.299
[/-2.188
etic-2.061
 tattoo-2.049
Token prev
Token ID8654
Feature activation-1.91e+00

Pos logit contributions
selves+5.266
rique+4.620
cedes+3.881
actly+3.795
productive+3.630
Neg logit contributions
 notor-4.383
KY-4.335
[/-4.058
etic-3.863
cale-3.855
Token asked
Token ID1965
Feature activation+2.69e+00

Pos logit contributions
selves+0.638
rique+0.579
cedes+0.473
actly+0.462
productive+0.448
Neg logit contributions
 notor-0.610
KY-0.592
[/-0.578
 tattoo-0.549
 awa-0.543
Token MIT
Token ID17168
Feature activation-5.12e+00

Pos logit contributions
 notor+2.220
KY+2.175
[/+2.162
 tattoo+2.033
 awa+2.003
Neg logit contributions
selves-2.094
rique-1.901
cedes-1.553
actly-1.505
productive-1.451
Token
Token ID447
Feature activation-5.38e+00

Pos logit contributions
selves+4.058
rique+3.617
cedes+2.917
actly+2.790
productive+2.784
Neg logit contributions
 notor-4.128
KY-4.032
[/-3.814
 awa-3.731
 tattoo-3.711
Token
Token ID247
Feature activation+7.80e-01

Pos logit contributions
selves+2.070
rique+1.686
+0.987
cedes+0.934
actly+0.838
Neg logit contributions
 notor-6.626
KY-6.344
 awa-6.210
[/-6.167
 tattoo-6.059
Tokens
Token ID82
Feature activation-6.96e+00

Pos logit contributions
 notor+0.415
KY+0.402
[/+0.391
 tattoo+0.357
etic+0.351
Neg logit contributions
selves-0.841
rique-0.784
cedes-0.678
actly-0.667
productive-0.651
Token Dar
Token ID7491
Feature activation-1.40e+01

Pos logit contributions
selves+5.191
rique+4.633
cedes+3.623
actly+3.445
 Dying+3.434
Neg logit contributions
 notor-5.858
KY-5.665
[/-5.411
 awa-5.235
etic-5.167
Tokenon
Token ID261
Feature activation-2.57e+00

Pos logit contributions
selves+10.945
rique+10.602
cedes+8.825
stein+8.083
actly+7.949
Neg logit contributions
 notor-10.216
KY-9.744
[/-8.947
 awa-8.908
 tattoo-8.788
Token Ace
Token ID17102
Feature activation-5.49e+00

Pos logit contributions
selves+2.053
rique+1.875
cedes+1.512
actly+1.471
productive+1.428
Neg logit contributions
 notor-2.090
KY-2.044
[/-1.988
 tattoo-1.889
 awa-1.874
Tokenm
Token ID76
Feature activation-4.68e+00

Pos logit contributions
selves+3.989
rique+3.826
cedes+2.904
actly+2.886
productive+2.729
Neg logit contributions
 notor-4.734
KY-4.526
[/-4.382
 tattoo-4.282
 awa-4.250
Tokenoglu
Token ID49006
Feature activation-1.45e+00

Pos logit contributions
selves+3.046
rique+2.687
cedes+2.044
actly+2.016
productive+1.896
Neg logit contributions
 notor-4.538
KY-4.414
[/-4.281
 tattoo-4.112
 awa-4.095
Token.
Token ID13
Feature activation-4.83e+00

Pos logit contributions
selves+1.419
rique+1.310
cedes+1.112
actly+1.090
productive+1.062
Neg logit contributions
 notor-0.926
KY-0.898
[/-0.874
 tattoo-0.815
 awa-0.803
TokenEnough
Token ID47323
Feature activation-2.92e+00

Pos logit contributions
selves+4.972
rique+4.539
cedes+3.753
actly+3.719
productive+3.595
Neg logit contributions
 notor-3.944
KY-3.732
[/-3.588
 tattoo-3.494
 awa-3.486
Token (#
Token ID17426
Feature activation+5.86e+00

Pos logit contributions
selves+2.688
rique+2.457
cedes+2.045
actly+2.018
productive+1.974
Neg logit contributions
 notor-2.016
KY-1.911
[/-1.860
 awa-1.775
 tattoo-1.753
Token
Token ID140
Feature activation-8.51e+00

Pos logit contributions
KY+4.136
 notor+4.048
[/+3.972
etic+3.657
cale+3.643
Neg logit contributions
selves-5.094
rique-4.772
cedes-3.957
actly-3.858
productive-3.685
Token
Token ID251
Feature activation-9.93e-01

Pos logit contributions
selves+4.766
rique+4.308
+3.189
cedes+3.031
actly+2.842
Neg logit contributions
 notor-8.435
KY-8.163
[/-8.106
 tattoo-7.805
 awa-7.776
Tokenа
Token ID16142
Feature activation-7.75e+00

Pos logit contributions
selves+0.760
rique+0.684
cedes+0.551
actly+0.539
productive+0.515
Neg logit contributions
 notor-0.847
KY-0.817
[/-0.805
 tattoo-0.766
 awa-0.759
Tokenд
Token ID43666
Feature activation-1.40e+01

Pos logit contributions
selves+5.785
rique+5.188
cedes+4.156
actly+4.031
productive+3.796
Neg logit contributions
 notor-6.513
KY-6.180
[/-5.921
 awa-5.820
 tattoo-5.772
Tokenо
Token ID15166
Feature activation-7.65e+00

Pos logit contributions
selves+9.674
rique+8.545
cedes+6.610
у+6.550
actly+6.545
Neg logit contributions
 notor-11.387
KY-10.441
 awa-10.214
cale-9.997
[/-9.942
Tokenе
Token ID16843
Feature activation-8.89e+00

Pos logit contributions
selves+5.517
rique+4.868
cedes+3.804
actly+3.677
productive+3.499
Neg logit contributions
 notor-6.719
KY-6.427
[/-6.128
 awa-5.958
cale-5.939
Tokenл
Token ID30143
Feature activation-7.88e+00

Pos logit contributions
selves+6.371
rique+5.630
cedes+4.395
actly+4.199
productive+4.103
Neg logit contributions
 notor-7.767
KY-7.235
[/-6.956
 awa-6.851
cale-6.816
Token)
Token ID8
Feature activation+6.49e-01

Pos logit contributions
selves+5.365
rique+4.733
cedes+3.570
actly+3.538
productive+3.423
Neg logit contributions
 notor-7.173
KY-6.791
[/-6.554
 awa-6.435
cale-6.384
Token protest
Token ID5402
Feature activation-1.67e+00

Pos logit contributions
 notor+0.454
KY+0.442
[/+0.431
 tattoo+0.404
 awa+0.399
Neg logit contributions
selves-0.595
rique-0.546
cedes-0.458
actly-0.447
productive-0.435
Token de
Token ID390
Feature activation-9.86e+00

Pos logit contributions
selves+2.297
rique+2.123
cedes+1.737
actly+1.683
productive+1.638
Neg logit contributions
 notor-2.080
KY-2.059
[/-1.985
 tattoo-1.866
etic-1.857
Token d
Token ID288
Feature activation-8.03e+00

Pos logit contributions
selves+8.282
rique+7.767
cedes+6.482
actly+6.191
acons+5.977
Neg logit contributions
KY-6.787
 notor-6.633
[/-6.399
etic-6.122
 awa-6.039
Tokenés
Token ID20954
Feature activation-7.12e+00

Pos logit contributions
selves+7.187
rique+6.755
actly+5.588
cedes+5.576
acons+5.441
Neg logit contributions
 notor-5.467
KY-5.232
[/-4.991
 tattoo-4.796
 awa-4.769
Tokenacc
Token ID4134
Feature activation-4.15e+00

Pos logit contributions
selves+6.541
rique+5.991
cedes+5.074
actly+5.029
productive+4.821
Neg logit contributions
 notor-4.650
KY-4.578
[/-4.406
 awa-4.250
 tattoo-4.228
Tokenord
Token ID585
Feature activation-6.02e+00

Pos logit contributions
selves+3.565
rique+3.383
cedes+2.742
actly+2.733
acons+2.610
Neg logit contributions
 notor-3.145
KY-3.019
[/-2.856
 awa-2.778
 tattoo-2.777
Token ent
Token ID920
Feature activation-1.40e+01

Pos logit contributions
selves+4.898
rique+4.628
cedes+3.636
actly+3.566
productive+3.444
Neg logit contributions
KY-4.607
 notor-4.576
[/-4.334
etic-4.106
 tattoo-4.098
Tokenre
Token ID260
Feature activation-7.83e+00

Pos logit contributions
rique+10.152
selves+10.110
cedes+7.973
actly+7.686
acons+7.412
Neg logit contributions
 notor-10.361
KY-10.255
 awa-9.587
[/-9.578
 tattoo-9.022
Token mem
Token ID1066
Feature activation-4.61e+00

Pos logit contributions
selves+6.438
rique+5.918
cedes+4.862
actly+4.650
productive+4.546
Neg logit contributions
KY-5.769
 notor-5.721
[/-5.538
 awa-5.189
etic-5.177
Tokenb
Token ID65
Feature activation-2.96e+00

Pos logit contributions
selves+3.930
rique+3.624
cedes+2.967
actly+2.864
productive+2.799
Neg logit contributions
 notor-3.553
KY-3.459
[/-3.307
 tattoo-3.139
 awa-3.136
Tokenres
Token ID411
Feature activation-2.95e+00

Pos logit contributions
selves+2.443
rique+2.300
cedes+1.810
actly+1.766
productive+1.708
Neg logit contributions
 notor-2.364
KY-2.285
[/-2.197
 tattoo-2.089
 awa-2.076
Token,
Token ID11
Feature activation-2.88e+00

Pos logit contributions
selves+2.571
rique+2.370
cedes+1.953
actly+1.897
productive+1.847
Neg logit contributions
 notor-2.138
KY-2.120
[/-2.030
 tattoo-1.920
etic-1.915
Token
Token ID140
Feature activation-1.19e+01

Pos logit contributions
selves+4.817
rique+4.432
cedes+3.530
actly+3.435
productive+3.349
Neg logit contributions
 notor-5.357
KY-5.127
[/-4.907
 awa-4.823
 tattoo-4.756
Token
Token ID111
Feature activation-8.55e+00

Pos logit contributions
selves+5.263
rique+4.958
+3.524
cedes+3.261
+3.010
Neg logit contributions
 notor-12.315
KY-12.254
[/-11.697
 awa-11.367
etic-11.276
Tokenа
Token ID16142
Feature activation-8.65e+00

Pos logit contributions
selves+5.068
rique+4.517
actly+3.322
cedes+3.255
productive+3.122
Neg logit contributions
 notor-8.281
KY-7.966
[/-7.650
 awa-7.531
cale-7.460
Token
Token ID140
Feature activation-1.39e+01

Pos logit contributions
selves+6.326
rique+5.711
cedes+4.447
actly+4.335
productive+4.178
Neg logit contributions
 notor-7.280
KY-6.909
[/-6.538
 awa-6.479
cale-6.442
Token
Token ID117
Feature activation-4.41e+00

Pos logit contributions
selves+5.601
rique+5.357
+3.757
+3.221
cedes+3.132
Neg logit contributions
 notor-14.382
KY-14.107
[/-13.629
 awa-13.209
 tattoo-13.144
Tokenд
Token ID43666
Feature activation-1.40e+01

Pos logit contributions
selves+3.318
rique+2.981
cedes+2.355
actly+2.314
productive+2.235
Neg logit contributions
 notor-3.795
KY-3.610
[/-3.484
 tattoo-3.385
 awa-3.375
Tokenа
Token ID16142
Feature activation-9.07e+00

Pos logit contributions
selves+8.890
rique+8.127
у+6.052
actly+5.914
cedes+5.892
Neg logit contributions
 notor-11.833
KY-10.975
 awa-10.543
[/-10.473
cale-10.464
Tokenк
Token ID31583
Feature activation-1.11e+01

Pos logit contributions
selves+6.677
rique+6.046
cedes+4.685
actly+4.563
productive+4.405
Neg logit contributions
 notor-7.621
KY-7.098
[/-6.703
 awa-6.686
cale-6.650
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
selves+7.342
rique+6.801
cedes+5.051
actly+4.971
у+4.897
Neg logit contributions
 notor-9.564
KY-9.186
cale-8.611
[/-8.603
 awa-8.489
Token
Token ID12466
Feature activation-7.47e+00

Pos logit contributions
 notor+0.999
KY+0.976
[/+0.958
 tattoo+0.904
 awa+0.890
Neg logit contributions
selves-1.105
rique-1.007
cedes-0.830
actly-0.810
productive-0.785
Token
Token ID246
Feature activation+5.70e-01

Pos logit contributions
selves+4.610
rique+4.186
cedes+3.126
+3.105
actly+2.910
Neg logit contributions
 notor-7.028
KY-6.878
[/-6.715
 tattoo-6.458
 awa-6.427
Token to
Token ID284
Feature activation+3.03e+00

Pos logit contributions
 notor+1.634
KY+1.563
[/+1.549
 tattoo+1.488
cale+1.465
Neg logit contributions
selves-1.921
rique-1.782
cedes-1.472
actly-1.403
productive-1.382
Token be
Token ID307
Feature activation+2.29e+00

Pos logit contributions
 notor+1.604
KY+1.563
[/+1.478
 tattoo+1.423
etic+1.357
Neg logit contributions
selves-3.275
rique-3.057
cedes-2.653
actly-2.549
productive-2.537
Token more
Token ID517
Feature activation+3.22e+00

Pos logit contributions
 notor+1.724
KY+1.681
[/+1.648
 tattoo+1.555
etic+1.531
Neg logit contributions
selves-1.980
rique-1.811
cedes-1.498
actly-1.458
productive-1.417
Token action
Token ID2223
Feature activation+2.66e+00

Pos logit contributions
 notor+2.236
KY+2.175
[/+2.131
 tattoo+2.003
etic+1.969
Neg logit contributions
selves-2.964
rique-2.729
cedes-2.286
actly-2.233
productive-2.170
Token from
Token ID422
Feature activation+2.17e+00

Pos logit contributions
 notor+1.943
KY+1.901
[/+1.870
 tattoo+1.765
etic+1.737
Neg logit contributions
selves-2.328
rique-2.133
cedes-1.775
actly-1.730
productive-1.676
Token politicians
Token ID7602
Feature activation+8.47e+00

Pos logit contributions
 notor+1.509
KY+1.470
[/+1.438
 tattoo+1.348
etic+1.328
Neg logit contributions
selves-1.985
rique-1.824
cedes-1.529
actly-1.495
productive-1.453
Token globally
Token ID18309
Feature activation+5.11e+00

Pos logit contributions
 notor+5.262
[/+5.215
KY+5.196
 tattoo+4.980
 amongst+4.890
Neg logit contributions
selves-7.748
rique-7.284
cedes-6.278
actly-6.124
productive-5.882
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 notor+3.537
KY+3.455
[/+3.430
 tattoo+3.278
etic+3.187
Neg logit contributions
selves-4.570
rique-4.229
cedes-3.574
actly-3.483
productive-3.364
Token working
Token ID1762
Feature activation-4.41e+00

Pos logit contributions
 notor+1.215
KY+1.185
[/+1.159
 tattoo+1.086
 awa+1.073
Neg logit contributions
selves-1.477
rique-1.350
cedes-1.123
actly-1.098
productive-1.066
Token together
Token ID1978
Feature activation-4.63e+00

Pos logit contributions
selves+3.686
rique+3.320
cedes+2.740
actly+2.674
acons+2.604
Neg logit contributions
 notor-3.392
KY-3.291
[/-3.230
etic-2.985
 awa-2.976
Token,
Token ID11
Feature activation-2.38e+00

Pos logit contributions
selves+3.701
rique+3.300
cedes+2.669
actly+2.596
acons+2.539
Neg logit contributions
 notor-3.717
KY-3.588
[/-3.493
 awa-3.295
etic-3.242
Token the
Token ID262
Feature activation+7.26e-01

Pos logit contributions
selves+0.316
rique+0.287
cedes+0.236
actly+0.231
productive+0.223
Neg logit contributions
 notor-0.298
KY-0.293
[/-0.285
etic-0.268
 tattoo-0.267
Token thought
Token ID1807
Feature activation-4.65e+00

Pos logit contributions
 notor+0.540
KY+0.527
[/+0.517
 tattoo+0.482
etic+0.480
Neg logit contributions
selves-0.630
rique-0.576
cedes-0.476
actly-0.466
productive-0.451
Token of
Token ID286
Feature activation-2.52e+00

Pos logit contributions
selves+3.775
rique+3.424
cedes+2.783
actly+2.733
productive+2.658
Neg logit contributions
 notor-3.709
KY-3.628
[/-3.536
 tattoo-3.320
 awa-3.302
Token having
Token ID1719
Feature activation-1.79e+00

Pos logit contributions
selves+2.071
rique+1.865
cedes+1.531
actly+1.501
productive+1.449
Neg logit contributions
 notor-1.943
KY-1.930
[/-1.865
etic-1.772
cale-1.740
Token to
Token ID284
Feature activation+2.94e+00

Pos logit contributions
selves+1.425
rique+1.290
cedes+1.055
actly+1.029
productive+0.995
Neg logit contributions
 notor-1.445
KY-1.423
[/-1.390
 awa-1.302
etic-1.301
Token call
Token ID869
Feature activation+7.46e+00

Pos logit contributions
 notor+2.030
KY+1.980
[/+1.937
 tattoo+1.821
 awa+1.787
Neg logit contributions
selves-2.708
rique-2.495
cedes-2.091
actly-2.043
productive-1.988
Token a
Token ID257
Feature activation+2.12e+00

Pos logit contributions
 notor+5.118
KY+5.003
[/+4.912
 tattoo+4.695
 awa+4.583
Neg logit contributions
selves-6.660
rique-6.221
cedes-5.151
actly-5.019
productive-4.949
Token department
Token ID5011
Feature activation+4.13e+00

Pos logit contributions
 notor+1.505
KY+1.466
[/+1.435
 tattoo+1.343
etic+1.327
Neg logit contributions
selves-1.915
rique-1.756
cedes-1.469
actly-1.435
productive-1.394
Token store
Token ID3650
Feature activation+2.34e+00

Pos logit contributions
 notor+2.795
KY+2.733
[/+2.688
 tattoo+2.547
etic+2.489
Neg logit contributions
selves-3.822
rique-3.531
cedes-2.956
actly-2.881
productive-2.808
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
 notor+1.734
KY+1.692
[/+1.658
 tattoo+1.560
etic+1.539
Neg logit contributions
selves-2.034
rique-1.860
cedes-1.542
actly-1.505
productive-1.461
Token see
Token ID766
Feature activation-2.15e-01

Pos logit contributions
selves+1.070
rique+0.958
cedes+0.807
actly+0.803
productive+0.765
Neg logit contributions
 notor-0.857
KY-0.816
[/-0.792
 awa-0.749
etic-0.744
Token science
Token ID3783
Feature activation+1.58e+00

Pos logit contributions
 notor+3.941
KY+3.813
[/+3.771
 tattoo+3.565
 popularity+3.513
Neg logit contributions
selves-5.060
rique-4.645
cedes-3.878
actly-3.820
productive-3.675
Token fiction
Token ID10165
Feature activation+1.89e+00

Pos logit contributions
 notor+1.193
KY+1.170
[/+1.149
 tattoo+1.084
 awa+1.069
Neg logit contributions
selves-1.353
rique-1.228
cedes-1.016
actly-0.989
productive-0.956
Token and
Token ID290
Feature activation+1.74e+00

Pos logit contributions
 notor+1.390
KY+1.356
[/+1.324
 tattoo+1.243
 awa+1.230
Neg logit contributions
selves-1.658
rique-1.515
cedes-1.259
actly-1.229
productive-1.194
Token fantasy
Token ID8842
Feature activation-1.71e+00

Pos logit contributions
 notor+1.373
KY+1.343
[/+1.317
 tattoo+1.247
etic+1.227
Neg logit contributions
selves-1.440
rique-1.308
cedes-1.072
actly-1.043
productive-1.011
Token programming
Token ID8300
Feature activation+5.31e+00

Pos logit contributions
selves+1.366
rique+1.228
cedes+1.001
actly+0.975
productive+0.945
Neg logit contributions
 notor-1.375
KY-1.352
[/-1.293
 awa-1.230
 tattoo-1.221
Token anywhere
Token ID6609
Feature activation+7.51e+00

Pos logit contributions
 notor+3.540
KY+3.424
[/+3.408
 tattoo+3.195
 popularity+3.116
Neg logit contributions
selves-4.957
rique-4.605
cedes-3.863
actly-3.812
productive-3.683
Token on
Token ID319
Feature activation+6.95e+00

Pos logit contributions
 notor+4.381
[/+4.336
KY+4.325
 tattoo+4.021
 amongst+3.978
Neg logit contributions
selves-7.353
rique-6.883
actly-5.814
cedes-5.810
productive-5.637
Token television
Token ID5581
Feature activation+1.25e+00

Pos logit contributions
 notor+4.258
KY+4.221
[/+4.216
 tattoo+3.878
 popularity+3.854
Neg logit contributions
selves-6.677
rique-6.218
cedes-5.198
actly-5.184
productive-5.041
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
 notor+0.813
KY+0.791
[/+0.773
 tattoo+0.720
etic+0.709
Neg logit contributions
selves-1.204
rique-1.111
cedes-0.941
actly-0.922
productive-0.897
Token Part
Token ID2142
Feature activation-9.90e+00

Pos logit contributions
selves+0.459
rique+0.410
cedes+0.335
actly+0.326
productive+0.316
Neg logit contributions
 notor-0.465
KY-0.443
[/-0.432
 tattoo-0.414
 awa-0.411
Token of
Token ID286
Feature activation-6.52e+00

Pos logit contributions
selves+7.442
rique+6.803
cedes+5.692
actly+5.236
productive+5.202
Neg logit contributions
 notor-7.823
KY-7.575
[/-7.306
 awa-7.037
 tattoo-6.941
Token
Token ID198
Feature activation+2.57e+00

Pos logit contributions
 notor+1.362
KY+1.342
[/+1.324
 tattoo+1.223
 awa+1.191
Neg logit contributions
selves-2.197
rique-2.032
cedes-1.734
actly-1.691
productive-1.655
Token
Token ID198
Feature activation+9.69e-02

Pos logit contributions
 notor+1.761
KY+1.706
[/+1.666
 tattoo+1.561
 awa+1.544
Neg logit contributions
selves-2.391
rique-2.197
cedes-1.849
actly-1.809
productive-1.760
Token8
Token ID23
Feature activation-2.45e+00

Pos logit contributions
 notor+0.069
KY+0.068
[/+0.067
 tattoo+0.062
cale+0.061
Neg logit contributions
selves-0.087
rique-0.080
cedes-0.067
actly-0.065
productive-0.063
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
selves+2.305
rique+2.066
cedes+1.768
actly+1.731
productive+1.677
Neg logit contributions
 notor-1.841
 awa-1.592
KY-1.570
[/-1.483
 challeng-1.465
Token Josh
Token ID8518
Feature activation+2.10e+00

Pos logit contributions
selves+0.785
rique+0.702
cedes+0.555
actly+0.538
productive+0.517
Neg logit contributions
 notor-0.970
KY-0.932
[/-0.913
 tattoo-0.875
 awa-0.869
Token Strauss
Token ID42971
Feature activation+8.93e+00

Pos logit contributions
 notor+2.138
KY+2.103
[/+2.075
 tattoo+1.984
etic+1.965
Neg logit contributions
selves-1.253
rique-1.097
cedes-0.811
actly-0.778
productive-0.738
Token (
Token ID357
Feature activation+3.93e+00

Pos logit contributions
KY+5.311
[/+5.111
 notor+5.090
 tattoo+4.727
 popularity+4.499
Neg logit contributions
selves-8.673
rique-8.050
cedes-6.873
actly-6.691
acons-6.482
Token103
Token ID15197
Feature activation+3.91e+00

Pos logit contributions
 notor+2.837
KY+2.822
[/+2.760
 tattoo+2.578
etic+2.547
Neg logit contributions
selves-3.454
rique-3.173
cedes-2.635
actly-2.567
productive-2.487
Token)
Token ID8
Feature activation+2.03e+00

Pos logit contributions
KY+3.369
[/+3.322
 notor+3.280
 tattoo+3.137
etic+3.098
Neg logit contributions
selves-2.845
rique-2.589
cedes-2.037
actly-1.984
productive-1.897
Token (
Token ID357
Feature activation+1.80e+00

Pos logit contributions
 notor+1.534
KY+1.472
[/+1.438
 awa+1.358
 tattoo+1.357
Neg logit contributions
selves-1.761
rique-1.604
cedes-1.330
actly-1.297
productive-1.260
Token22
Token ID1828
Feature activation-2.37e+00

Pos logit contributions
 notor+1.391
KY+1.294
[/+1.264
 awa+1.220
 tattoo+1.209
Neg logit contributions
selves-1.573
rique-1.424
cedes-1.184
actly-1.161
productive-1.132
Token morph
Token ID17488
Feature activation+6.12e+00

Pos logit contributions
 notor+2.489
KY+2.407
[/+2.358
 tattoo+2.208
etic+2.159
Neg logit contributions
selves-4.164
rique-3.888
cedes-3.317
actly-3.241
productive-3.149
Tokenologically
Token ID13437
Feature activation+7.64e+00

Pos logit contributions
 notor+4.670
KY+4.567
[/+4.538
etic+4.316
 tattoo+4.272
Neg logit contributions
selves-5.024
rique-4.629
cedes-3.832
actly-3.683
productive-3.547
Token distinct
Token ID7310
Feature activation-1.33e+00

Pos logit contributions
 notor+4.733
[/+4.580
KY+4.516
 tattoo+4.338
etic+4.229
Neg logit contributions
selves-7.277
rique-6.926
cedes-5.818
actly-5.572
productive-5.477
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
selves+1.255
rique+1.144
cedes+0.964
actly+0.943
productive+0.923
Neg logit contributions
 notor-0.895
KY-0.869
[/-0.814
 tattoo-0.785
 awa-0.778
Token should
Token ID815
Feature activation+7.90e+00

Pos logit contributions
 notor+1.724
KY+1.682
[/+1.648
 tattoo+1.551
etic+1.530
Neg logit contributions
selves-2.015
rique-1.842
cedes-1.526
actly-1.490
productive-1.445
Token be
Token ID307
Feature activation+7.98e+00

Pos logit contributions
 notor+4.678
KY+4.609
[/+4.546
 tattoo+4.224
etic+4.159
Neg logit contributions
selves-7.676
rique-7.314
cedes-6.200
actly-5.989
productive-5.893
Token regarded
Token ID11987
Feature activation+4.28e+00

Pos logit contributions
 notor+5.278
KY+5.037
[/+5.031
 tattoo+4.808
etic+4.682
Neg logit contributions
selves-7.389
rique-6.860
cedes-5.806
actly-5.659
productive-5.450
Token as
Token ID355
Feature activation+1.01e+00

Pos logit contributions
 notor+2.868
KY+2.763
[/+2.760
 tattoo+2.569
etic+2.537
Neg logit contributions
selves-3.972
rique-3.711
cedes-3.122
actly-3.070
productive-2.962
Token separate
Token ID4553
Feature activation-1.09e+00

Pos logit contributions
 notor+0.725
KY+0.713
[/+0.693
 tattoo+0.647
 awa+0.645
Neg logit contributions
selves-0.914
rique-0.832
cedes-0.694
actly-0.681
productive-0.663
Token species
Token ID4693
Feature activation+2.62e+00

Pos logit contributions
selves+1.020
rique+0.925
cedes+0.778
actly+0.760
productive+0.750
Neg logit contributions
 notor-0.742
KY-0.729
[/-0.691
 tattoo-0.651
 awa-0.650
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
 notor+1.723
KY+1.673
[/+1.619
 tattoo+1.521
 awa+1.503
Neg logit contributions
selves-2.521
rique-2.322
cedes-1.965
actly-1.921
productive-1.876
Token find
Token ID1064
Feature activation+3.56e+00

Pos logit contributions
 notor+0.361
KY+0.348
[/+0.336
 tattoo+0.316
etic+0.315
Neg logit contributions
selves-0.432
rique-0.391
cedes-0.325
actly-0.321
productive-0.311
Token an
Token ID281
Feature activation+4.58e+00

Pos logit contributions
 notor+2.448
KY+2.377
[/+2.359
 tattoo+2.201
 awa+2.161
Neg logit contributions
selves-3.272
rique-3.019
cedes-2.539
actly-2.468
productive-2.403
Token answer
Token ID3280
Feature activation+2.01e+00

Pos logit contributions
 notor+3.190
KY+3.109
[/+3.070
 tattoo+2.862
etic+2.823
Neg logit contributions
selves-4.194
rique-3.853
cedes-3.235
actly-3.154
productive-3.057
Token to
Token ID284
Feature activation-2.10e-01

Pos logit contributions
 notor+1.415
KY+1.377
[/+1.342
 tattoo+1.258
 awa+1.243
Neg logit contributions
selves-1.833
rique-1.680
cedes-1.406
actly-1.376
productive-1.338
Token this
Token ID428
Feature activation+2.80e+00

Pos logit contributions
selves+0.180
rique+0.164
cedes+0.135
actly+0.133
productive+0.129
Neg logit contributions
 notor-0.160
KY-0.155
[/-0.151
 tattoo-0.142
etic-0.142
Token question
Token ID1808
Feature activation+1.94e+00

Pos logit contributions
 notor+1.971
KY+1.920
[/+1.887
 tattoo+1.769
 awa+1.738
Neg logit contributions
selves-2.539
rique-2.335
cedes-1.954
actly-1.906
productive-1.854
Token.
Token ID13
Feature activation+7.53e-01

Pos logit contributions
 notor+1.279
KY+1.239
[/+1.204
 tattoo+1.122
 awa+1.110
Neg logit contributions
selves-1.866
rique-1.718
cedes-1.452
actly-1.425
productive-1.388
Token
Token ID198
Feature activation+2.88e+00

Pos logit contributions
 notor+0.582
KY+0.563
[/+0.549
 tattoo+0.521
 awa+0.516
Neg logit contributions
selves-0.640
rique-0.581
cedes-0.478
actly-0.467
productive-0.452
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 notor+1.931
KY+1.895
[/+1.856
 tattoo+1.730
etic+1.705
Neg logit contributions
selves-2.701
rique-2.490
cedes-2.096
actly-2.050
productive-1.994
TokenD
Token ID35
Feature activation+1.95e-02

Pos logit contributions
 notor+1.066
KY+1.017
[/+0.993
 tattoo+0.950
 awa+0.942
Neg logit contributions
selves-1.236
rique-1.127
cedes-0.938
actly-0.915
productive-0.889
Tokenani
Token ID3216
Feature activation+1.97e+00

Pos logit contributions
 notor+0.014
KY+0.014
[/+0.013
etic+0.013
 tattoo+0.012
Neg logit contributions
selves-0.018
rique-0.016
cedes-0.013
actly-0.013
productive-0.013
Token two
Token ID734
Feature activation-5.20e-01

Pos logit contributions
 notor+4.440
[/+4.344
KY+4.311
 tattoo+4.045
 awa+3.981
Neg logit contributions
selves-5.305
rique-4.910
cedes-4.107
actly-3.932
productive-3.847
Token super
Token ID2208
Feature activation+3.53e-01

Pos logit contributions
selves+0.444
rique+0.402
cedes+0.331
actly+0.326
productive+0.315
Neg logit contributions
 notor-0.396
KY-0.390
[/-0.374
etic-0.353
 tattoo-0.353
Token-
Token ID12
Feature activation+2.22e+00

Pos logit contributions
 notor+0.261
KY+0.255
[/+0.249
 tattoo+0.234
etic+0.231
Neg logit contributions
selves-0.309
rique-0.283
cedes-0.235
actly-0.230
productive-0.223
Tokenwe
Token ID732
Feature activation-1.02e+00

Pos logit contributions
 notor+1.636
KY+1.603
[/+1.573
 tattoo+1.475
etic+1.456
Neg logit contributions
selves-1.937
rique-1.778
cedes-1.465
actly-1.435
productive-1.381
Tokenird
Token ID1447
Feature activation+2.97e+00

Pos logit contributions
selves+0.812
rique+0.730
cedes+0.591
actly+0.576
productive+0.555
Neg logit contributions
 notor-0.850
KY-0.825
[/-0.810
 tattoo-0.767
 awa-0.760
Token black
Token ID2042
Feature activation+2.19e-01

Pos logit contributions
 notor+2.144
KY+2.092
[/+2.054
 tattoo+1.932
etic+1.900
Neg logit contributions
selves-2.634
rique-2.419
cedes-2.014
actly-1.964
productive-1.910
Token-
Token ID12
Feature activation+1.32e+00

Pos logit contributions
 notor+0.156
KY+0.152
[/+0.148
 tattoo+0.138
etic+0.138
Neg logit contributions
selves-0.197
rique-0.180
cedes-0.151
actly-0.147
productive-0.143
Tokenborder
Token ID20192
Feature activation+3.31e+00

Pos logit contributions
 notor+0.865
KY+0.839
[/+0.819
 tattoo+0.764
 awa+0.753
Neg logit contributions
selves-1.264
rique-1.164
cedes-0.986
actly-0.965
productive-0.940
Token cards
Token ID4116
Feature activation+4.58e+00

Pos logit contributions
 notor+2.319
KY+2.263
[/+2.224
 tattoo+2.096
 awa+2.047
Neg logit contributions
selves-3.006
rique-2.765
cedes-2.310
actly-2.259
productive-2.198
Token:
Token ID25
Feature activation+8.30e-01

Pos logit contributions
 notor+3.185
KY+3.126
[/+3.097
 tattoo+2.894
etic+2.830
Neg logit contributions
selves-4.138
rique-3.824
cedes-3.194
actly-3.103
productive-3.037
Token Block
Token ID9726
Feature activation+3.16e-01

Pos logit contributions
 notor+0.602
KY+0.587
[/+0.574
 tattoo+0.539
etic+0.533
Neg logit contributions
selves-0.737
rique-0.675
cedes-0.562
actly-0.549
productive-0.533
Token who
Token ID508
Feature activation-2.74e+00

Pos logit contributions
selves+1.997
rique+1.811
cedes+1.478
actly+1.447
productive+1.400
Neg logit contributions
 notor-1.839
KY-1.798
[/-1.740
etic-1.643
 awa-1.627
Token had
Token ID550
Feature activation-1.63e+00

Pos logit contributions
selves+2.298
rique+2.074
cedes+1.707
actly+1.663
productive+1.604
Neg logit contributions
 notor-2.104
KY-2.062
[/-1.998
etic-1.886
cale-1.878
Token recently
Token ID2904
Feature activation+3.66e+00

Pos logit contributions
selves+1.370
rique+1.237
cedes+1.017
actly+0.993
productive+0.965
Neg logit contributions
 notor-1.246
KY-1.237
[/-1.194
etic-1.121
 awa-1.115
Token recovered
Token ID11911
Feature activation-9.61e-01

Pos logit contributions
 notor+2.702
KY+2.614
[/+2.589
 tattoo+2.466
 awa+2.396
Neg logit contributions
selves-3.187
rique-2.934
cedes-2.434
actly-2.385
productive-2.308
Token from
Token ID422
Feature activation-7.81e-01

Pos logit contributions
selves+0.878
rique+0.803
cedes+0.675
actly+0.661
productive+0.642
Neg logit contributions
 notor-0.671
KY-0.657
[/-0.638
 tattoo-0.592
etic-0.591
Token a
Token ID257
Feature activation-6.95e-01

Pos logit contributions
selves+0.686
rique+0.627
cedes+0.522
actly+0.510
productive+0.495
Neg logit contributions
 notor-0.571
KY-0.559
[/-0.543
 tattoo-0.506
 awa-0.505
Token heart
Token ID2612
Feature activation-4.11e+00

Pos logit contributions
selves+0.619
rique+0.566
cedes+0.474
actly+0.460
productive+0.446
Neg logit contributions
 notor-0.502
KY-0.493
[/-0.477
 awa-0.445
cale-0.444
Token problem
Token ID1917
Feature activation+3.30e+00

Pos logit contributions
selves+3.561
rique+3.322
cedes+2.775
actly+2.689
productive+2.617
Neg logit contributions
 notor-2.986
KY-2.898
[/-2.720
 awa-2.642
cale-2.636
Token and
Token ID290
Feature activation+1.70e+00

Pos logit contributions
 notor+2.239
KY+2.176
[/+2.171
 tattoo+2.035
etic+1.974
Neg logit contributions
selves-3.054
rique-2.816
cedes-2.372
actly-2.315
productive-2.255
Token received
Token ID2722
Feature activation+5.42e+00

Pos logit contributions
 notor+1.323
KY+1.292
[/+1.268
 tattoo+1.197
etic+1.180
Neg logit contributions
selves-1.422
rique-1.295
cedes-1.064
actly-1.037
productive-1.004
Token treatment
Token ID3513
Feature activation+9.55e-01

Pos logit contributions
 notor+3.876
KY+3.736
[/+3.702
 tattoo+3.572
 awa+3.449
Neg logit contributions
selves-4.813
rique-4.447
cedes-3.692
actly-3.565
productive-3.487
Token carefully
Token ID7773
Feature activation+1.75e-01

Pos logit contributions
selves+4.004
rique+3.535
cedes+3.025
actly+2.935
productive+2.802
Neg logit contributions
 notor-3.528
KY-3.445
[/-3.253
 awa-3.158
 tattoo-3.098
Token omitted
Token ID22532
Feature activation-1.93e+00

Pos logit contributions
 notor+0.119
KY+0.115
[/+0.112
 tattoo+0.104
 awa+0.103
Neg logit contributions
selves-0.163
rique-0.150
cedes-0.127
actly-0.124
productive-0.121
Token government
Token ID1230
Feature activation+1.27e+00

Pos logit contributions
selves+1.727
rique+1.556
cedes+1.318
actly+1.282
productive+1.244
Neg logit contributions
 notor-1.387
KY-1.354
[/-1.301
 tattoo-1.229
etic-1.225
Token subsidies
Token ID16230
Feature activation-1.88e+00

Pos logit contributions
 notor+0.914
KY+0.891
[/+0.871
 tattoo+0.816
 awa+0.807
Neg logit contributions
selves-1.142
rique-1.045
cedes-0.873
actly-0.852
productive-0.829
Token (
Token ID357
Feature activation+7.75e-01

Pos logit contributions
selves+1.695
rique+1.537
cedes+1.295
actly+1.262
productive+1.234
Neg logit contributions
 notor-1.339
KY-1.299
[/-1.250
 awa-1.183
 tattoo-1.179
Tokenwise
Token ID3083
Feature activation-6.22e-02

Pos logit contributions
 notor+0.547
KY+0.529
[/+0.519
 tattoo+0.489
 awa+0.483
Neg logit contributions
selves-0.706
rique-0.646
cedes-0.542
actly-0.529
productive-0.517
Tokenly
Token ID306
Feature activation+1.03e+00

Pos logit contributions
selves+0.052
rique+0.047
cedes+0.039
actly+0.038
productive+0.036
Neg logit contributions
 notor-0.049
KY-0.048
[/-0.046
 tattoo-0.044
etic-0.043
Token,
Token ID11
Feature activation-2.91e-01

Pos logit contributions
 notor+0.750
KY+0.732
[/+0.715
 tattoo+0.671
etic+0.663
Neg logit contributions
selves-0.920
rique-0.841
cedes-0.701
actly-0.684
productive-0.665
Token given
Token ID1813
Feature activation-3.87e+00

Pos logit contributions
selves+0.255
rique+0.231
cedes+0.193
actly+0.188
productive+0.183
Neg logit contributions
 notor-0.216
KY-0.210
[/-0.205
 tattoo-0.193
 awa-0.191
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
selves+3.184
rique+2.814
cedes+2.347
actly+2.278
productive+2.220
Neg logit contributions
 notor-3.038
KY-2.966
[/-2.874
 awa-2.735
 tattoo-2.725
Token imminent
Token ID21161
Feature activation+2.66e+00

Pos logit contributions
selves+0.099
rique+0.091
cedes+0.076
actly+0.074
productive+0.073
Neg logit contributions
 notor-0.073
KY-0.071
[/-0.069
 tattoo-0.064
 awa-0.064
Token
Token ID198
Feature activation-3.78e+00

Pos logit contributions
selves+1.818
rique+1.599
cedes+1.336
actly+1.300
productive+1.276
Neg logit contributions
 notor-1.896
 awa-1.678
KY-1.619
 challeng-1.553
[/-1.526
TokenThe
Token ID464
Feature activation-1.28e+00

Pos logit contributions
selves+3.123
rique+2.820
cedes+2.366
actly+2.286
productive+2.267
Neg logit contributions
 notor-3.078
 awa-2.751
KY-2.709
 tattoo-2.663
[/-2.660
Token Jersey
Token ID8221
Feature activation+8.84e-02

Pos logit contributions
selves+1.106
rique+1.015
cedes+0.831
actly+0.815
productive+0.795
Neg logit contributions
 notor-0.956
KY-0.928
[/-0.906
 awa-0.848
etic-0.844
Token City
Token ID2254
Feature activation+2.09e+00

Pos logit contributions
 notor+0.065
KY+0.064
[/+0.063
 tattoo+0.059
etic+0.058
Neg logit contributions
selves-0.077
rique-0.071
cedes-0.059
actly-0.057
productive-0.056
Token Museum
Token ID9594
Feature activation+1.75e+00

Pos logit contributions
 notor+1.686
KY+1.650
[/+1.622
 tattoo+1.541
 awa+1.514
Neg logit contributions
selves-1.687
rique-1.528
cedes-1.248
actly-1.215
productive-1.172
Token was
Token ID373
Feature activation+1.49e+00

Pos logit contributions
 notor+1.296
KY+1.266
[/+1.245
 tattoo+1.171
etic+1.151
Neg logit contributions
selves-1.521
rique-1.391
cedes-1.154
actly-1.127
productive-1.093
Token founded
Token ID9393
Feature activation-3.39e+00

Pos logit contributions
 notor+1.116
KY+1.091
[/+1.065
 tattoo+1.000
etic+0.992
Neg logit contributions
selves-1.291
rique-1.181
cedes-0.976
actly-0.952
productive-0.926
Token in
Token ID287
Feature activation+1.56e-01

Pos logit contributions
selves+2.814
rique+2.529
cedes+2.060
actly+2.024
productive+1.963
Neg logit contributions
 notor-2.640
KY-2.565
[/-2.492
 awa-2.370
 tattoo-2.332
Token 1901
Token ID39923
Feature activation+2.33e+00

Pos logit contributions
 notor+0.119
KY+0.115
[/+0.112
 awa+0.107
etic+0.105
Neg logit contributions
selves-0.134
rique-0.121
cedes-0.099
actly-0.098
productive-0.094
Token as
Token ID355
Feature activation-3.84e+00

Pos logit contributions
 notor+1.598
KY+1.566
[/+1.546
 tattoo+1.449
etic+1.423
Neg logit contributions
selves-2.137
rique-1.961
cedes-1.655
actly-1.618
productive-1.573
Token part
Token ID636
Feature activation-7.44e+00

Pos logit contributions
selves+3.050
rique+2.767
cedes+2.219
actly+2.168
productive+2.113
Neg logit contributions
 notor-3.108
KY-3.039
[/-2.974
 awa-2.805
 tattoo-2.796
Token.
Token ID13
Feature activation-4.22e+00

Pos logit contributions
selves+4.820
rique+4.417
cedes+3.729
actly+3.578
productive+3.515
Neg logit contributions
 notor-3.595
KY-3.461
[/-3.260
 awa-3.232
etic-3.155
Token
Token ID447
Feature activation-1.31e+00

Pos logit contributions
selves+3.426
rique+3.085
cedes+2.482
actly+2.451
productive+2.360
Neg logit contributions
 notor-3.428
KY-3.225
[/-3.129
 tattoo-3.024
 awa-3.019
Token
Token ID250
Feature activation-3.86e+00

Pos logit contributions
selves+0.808
rique+0.711
cedes+0.529
actly+0.510
productive+0.487
Neg logit contributions
 notor-1.304
KY-1.277
[/-1.256
 tattoo-1.205
 awa-1.193
TokenAnd
Token ID1870
Feature activation-1.42e+00

Pos logit contributions
selves+3.496
rique+3.192
cedes+2.641
actly+2.612
productive+2.531
Neg logit contributions
 notor-2.764
KY-2.573
[/-2.476
 tattoo-2.414
 awa-2.386
Token for
Token ID329
Feature activation-1.35e+00

Pos logit contributions
selves+1.174
rique+1.069
cedes+0.868
actly+0.851
productive+0.822
Neg logit contributions
 notor-1.122
KY-1.090
[/-1.059
 tattoo-1.008
etic-0.999
Token Israel
Token ID2692
Feature activation-3.99e+00

Pos logit contributions
selves+1.121
rique+1.018
cedes+0.832
actly+0.815
productive+0.787
Neg logit contributions
 notor-1.061
KY-1.038
[/-1.008
 tattoo-0.955
etic-0.951
Token to
Token ID284
Feature activation+6.36e-01

Pos logit contributions
selves+3.521
rique+3.207
cedes+2.661
actly+2.627
productive+2.543
Neg logit contributions
 notor-2.889
KY-2.802
[/-2.704
 tattoo-2.563
 awa-2.560
Token put
Token ID1234
Feature activation-4.98e-01

Pos logit contributions
 notor+0.438
KY+0.425
[/+0.415
 tattoo+0.389
etic+0.384
Neg logit contributions
selves-0.589
rique-0.540
cedes-0.454
actly-0.445
productive-0.432
Token itself
Token ID2346
Feature activation+5.43e-01

Pos logit contributions
selves+0.451
rique+0.413
cedes+0.345
actly+0.337
productive+0.327
Neg logit contributions
 notor-0.356
KY-0.346
[/-0.335
 tattoo-0.315
etic-0.312
Token in
Token ID287
Feature activation-2.79e+00

Pos logit contributions
 notor+0.385
KY+0.376
[/+0.364
 tattoo+0.341
 awa+0.339
Neg logit contributions
selves-0.495
rique-0.451
cedes-0.376
actly-0.368
productive-0.358
Token his
Token ID465
Feature activation-1.73e+00

Pos logit contributions
selves+2.320
rique+2.097
cedes+1.742
actly+1.721
productive+1.659
Neg logit contributions
 notor-2.146
KY-2.107
[/-2.026
etic-1.935
 awa-1.932
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
selves+0.023
rique+0.021
cedes+0.017
actly+0.017
productive+0.016
Neg logit contributions
 notor-0.019
KY-0.018
[/-0.018
 tattoo-0.017
 awa-0.017
Token plot
Token ID7110
Feature activation-1.01e+00

Pos logit contributions
selves+0.226
rique+0.207
cedes+0.173
actly+0.168
productive+0.164
Neg logit contributions
 notor-0.181
KY-0.176
[/-0.172
 tattoo-0.161
etic-0.160
Token may
Token ID743
Feature activation-1.72e+00

Pos logit contributions
selves+0.887
rique+0.812
cedes+0.677
actly+0.659
productive+0.642
Neg logit contributions
 notor-0.744
KY-0.725
[/-0.707
 tattoo-0.665
 awa-0.658
Token seem
Token ID1283
Feature activation-3.35e+00

Pos logit contributions
selves+1.557
rique+1.404
cedes+1.189
actly+1.167
productive+1.133
Neg logit contributions
 notor-1.223
KY-1.181
[/-1.141
 tattoo-1.064
etic-1.064
Token somewhat
Token ID6454
Feature activation-3.40e+00

Pos logit contributions
selves+2.819
rique+2.546
cedes+2.088
actly+2.059
productive+2.009
Neg logit contributions
 notor-2.550
KY-2.514
[/-2.408
cale-2.270
 awa-2.263
Token shallow
Token ID19337
Feature activation-2.27e+00

Pos logit contributions
selves+2.932
rique+2.680
cedes+2.205
actly+2.163
productive+2.122
Neg logit contributions
 notor-2.512
KY-2.470
[/-2.363
cale-2.227
 tattoo-2.195
Token at
Token ID379
Feature activation+1.87e+00

Pos logit contributions
selves+2.040
rique+1.853
cedes+1.549
actly+1.509
productive+1.476
Neg logit contributions
 notor-1.648
KY-1.586
[/-1.517
 awa-1.431
 tattoo-1.431
Token times
Token ID1661
Feature activation+2.91e-01

Pos logit contributions
 notor+1.593
KY+1.561
[/+1.538
 tattoo+1.460
etic+1.439
Neg logit contributions
selves-1.431
rique-1.282
cedes-1.032
actly-0.997
productive-0.963
Token,
Token ID11
Feature activation-9.90e-01

Pos logit contributions
 notor+0.201
KY+0.196
[/+0.191
 tattoo+0.179
 awa+0.177
Neg logit contributions
selves-0.268
rique-0.246
cedes-0.207
actly-0.202
productive-0.197
Token but
Token ID475
Feature activation+4.57e+00

Pos logit contributions
selves+0.817
rique+0.742
cedes+0.608
actly+0.592
productive+0.575
Neg logit contributions
 notor-0.780
KY-0.761
[/-0.742
 tattoo-0.700
 awa-0.697
Token they
Token ID484
Feature activation+1.93e+00

Pos logit contributions
 notor+3.246
KY+3.167
[/+3.146
 tattoo+2.942
etic+2.890
Neg logit contributions
selves-4.078
rique-3.731
cedes-3.125
actly-3.039
productive-2.951
Token protests
Token ID8536
Feature activation-8.12e-01

Pos logit contributions
 notor+2.786
KY+2.703
[/+2.645
 tattoo+2.487
 popularity+2.418
Neg logit contributions
selves-4.150
rique-3.870
cedes-3.295
actly-3.206
productive-3.127
Token in
Token ID287
Feature activation+8.49e-01

Pos logit contributions
selves+0.714
rique+0.645
cedes+0.534
actly+0.524
productive+0.510
Neg logit contributions
 notor-0.601
KY-0.584
[/-0.570
 awa-0.532
etic-0.529
Token Wednesday
Token ID3583
Feature activation+1.99e+00

Pos logit contributions
 notor+0.640
KY+0.625
[/+0.612
 tattoo+0.574
 awa+0.570
Neg logit contributions
selves-0.731
rique-0.668
cedes-0.550
actly-0.538
productive-0.522
Token night
Token ID1755
Feature activation-1.42e+00

Pos logit contributions
 notor+1.296
KY+1.263
[/+1.242
 tattoo+1.163
etic+1.131
Neg logit contributions
selves-1.902
rique-1.759
cedes-1.498
actly-1.450
productive-1.414
Token's
Token ID338
Feature activation+1.05e+00

Pos logit contributions
selves+1.404
rique+1.298
cedes+1.106
actly+1.082
productive+1.055
Neg logit contributions
 notor-0.891
KY-0.866
[/-0.846
 tattoo-0.786
etic-0.773
Token episode
Token ID4471
Feature activation-3.84e+00

Pos logit contributions
 notor+0.762
KY+0.743
[/+0.728
 tattoo+0.683
etic+0.674
Neg logit contributions
selves-0.939
rique-0.860
cedes-0.717
actly-0.700
productive-0.680
Token,
Token ID11
Feature activation-2.66e+00

Pos logit contributions
selves+3.248
rique+2.942
cedes+2.414
actly+2.357
productive+2.297
Neg logit contributions
 notor-2.926
KY-2.844
[/-2.731
 awa-2.600
cale-2.573
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
selves+2.215
rique+1.994
cedes+1.634
actly+1.595
productive+1.546
Neg logit contributions
 notor-2.067
KY-2.018
[/-1.958
etic-1.851
 awa-1.848
Token they
Token ID484
Feature activation-6.41e-01

Pos logit contributions
 notor+1.210
KY+1.181
[/+1.157
 tattoo+1.088
etic+1.074
Neg logit contributions
selves-1.413
rique-1.291
cedes-1.070
actly-1.044
productive-1.013
Token're
Token ID821
Feature activation+9.61e-01

Pos logit contributions
selves+0.507
rique+0.453
cedes+0.366
actly+0.356
productive+0.344
Neg logit contributions
 notor-0.533
KY-0.518
[/-0.506
etic-0.476
 awa-0.476
Token using
Token ID1262
Feature activation+3.03e+00

Pos logit contributions
 notor+0.767
KY+0.756
[/+0.736
 tattoo+0.691
etic+0.690
Neg logit contributions
selves-0.784
rique-0.706
cedes-0.576
actly-0.564
productive-0.543
Token,
Token ID11
Feature activation-1.68e+00

Pos logit contributions
 notor+2.076
KY+2.021
[/+1.997
 tattoo+1.851
etic+1.813
Neg logit contributions
selves-3.086
rique-2.854
cedes-2.412
actly-2.365
productive-2.299
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
selves+1.500
rique+1.367
cedes+1.146
actly+1.113
productive+1.092
Neg logit contributions
 notor-1.198
KY-1.169
[/-1.123
 tattoo-1.067
etic-1.059
Token we
Token ID356
Feature activation-5.63e+00

Pos logit contributions
selves+1.031
rique+0.936
cedes+0.770
actly+0.753
productive+0.734
Neg logit contributions
 notor-0.930
KY-0.908
[/-0.871
 tattoo-0.829
etic-0.824
Token have
Token ID423
Feature activation-3.11e+00

Pos logit contributions
selves+4.598
rique+4.007
cedes+3.230
actly+3.161
productive+3.128
Neg logit contributions
 notor-4.452
KY-4.269
[/-4.060
etic-3.887
 awa-3.824
Token outlined
Token ID16493
Feature activation-1.86e+00

Pos logit contributions
selves+2.592
rique+2.334
cedes+1.897
actly+1.870
productive+1.853
Neg logit contributions
 notor-2.386
KY-2.366
[/-2.258
 awa-2.128
etic-2.117
Token a
Token ID257
Feature activation-2.19e+00

Pos logit contributions
selves+1.562
rique+1.420
cedes+1.168
actly+1.158
productive+1.112
Neg logit contributions
 notor-1.441
KY-1.403
[/-1.347
 tattoo-1.279
 awa-1.276
Token responsible
Token ID4497
Feature activation-8.43e-01

Pos logit contributions
selves+1.954
rique+1.786
cedes+1.490
actly+1.463
productive+1.424
Neg logit contributions
 notor-1.575
KY-1.540
[/-1.481
 awa-1.390
 tattoo-1.387
Token approach
Token ID3164
Feature activation+1.53e+00

Pos logit contributions
selves+0.745
rique+0.672
cedes+0.559
actly+0.549
productive+0.536
Neg logit contributions
 notor-0.624
KY-0.598
[/-0.573
 awa-0.544
etic-0.538
Token for
Token ID329
Feature activation-1.11e+00

Pos logit contributions
 notor+1.208
KY+1.184
[/+1.168
 tattoo+1.100
etic+1.085
Neg logit contributions
selves-1.261
rique-1.148
cedes-0.939
actly-0.915
productive-0.884
Token the
Token ID262
Feature activation-5.25e-01

Pos logit contributions
selves+0.944
rique+0.852
cedes+0.704
actly+0.689
productive+0.668
Neg logit contributions
 notor-0.856
KY-0.839
[/-0.801
etic-0.761
 tattoo-0.759
Token future
Token ID2003
Feature activation-1.50e+00

Pos logit contributions
selves+0.438
rique+0.395
cedes+0.322
actly+0.315
productive+0.306
Neg logit contributions
 notor-0.413
KY-0.404
[/-0.388
etic-0.369
 tattoo-0.367
Token
Token ID198
Feature activation-5.28e-01

Pos logit contributions
 notor+0.039
KY+0.038
[/+0.038
 tattoo+0.035
etic+0.034
Neg logit contributions
selves-0.059
rique-0.055
cedes-0.046
actly-0.045
productive-0.045
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
selves+0.461
rique+0.408
cedes+0.344
actly+0.337
productive+0.327
Neg logit contributions
 notor-0.431
KY-0.381
 awa-0.381
[/-0.364
 tattoo-0.353
TokenPress
Token ID13800
Feature activation+7.70e+00

Pos logit contributions
selves+0.908
rique+0.820
cedes+0.689
actly+0.673
productive+0.654
Neg logit contributions
 notor-0.785
KY-0.725
[/-0.706
 awa-0.695
 tattoo-0.691
Token the
Token ID262
Feature activation+2.17e+00

Pos logit contributions
KY+5.550
 notor+5.549
[/+5.404
 tattoo+5.098
etic+5.064
Neg logit contributions
selves-6.533
rique-5.966
cedes-4.918
actly-4.746
productive-4.706
Token button
Token ID4936
Feature activation-1.06e+00

Pos logit contributions
 notor+1.654
KY+1.614
[/+1.584
 tattoo+1.497
 awa+1.473
Neg logit contributions
selves-1.842
rique-1.684
cedes-1.388
actly-1.351
productive-1.313
Token!
Token ID0
Feature activation-3.19e+00

Pos logit contributions
selves+0.884
rique+0.801
cedes+0.656
actly+0.644
productive+0.619
Neg logit contributions
 notor-0.831
KY-0.810
[/-0.787
etic-0.740
 awa-0.739
Token (
Token ID357
Feature activation+2.53e+00

Pos logit contributions
selves+2.716
rique+2.487
cedes+2.054
actly+2.014
productive+1.931
Neg logit contributions
 notor-2.412
KY-2.328
[/-2.276
 tattoo-2.149
 awa-2.138
Token9
Token ID24
Feature activation+1.61e+00

Pos logit contributions
 notor+1.887
KY+1.805
[/+1.767
 tattoo+1.673
 awa+1.666
Neg logit contributions
selves-2.224
rique-2.028
cedes-1.688
actly-1.651
productive-1.605
Token)
Token ID8
Feature activation+8.24e-01

Pos logit contributions
 notor+1.511
KY+1.459
[/+1.432
 awa+1.372
 tattoo+1.367
Neg logit contributions
selves-1.110
rique-0.984
cedes-0.770
actly-0.742
productive-0.713
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
 notor+0.520
[/+0.507
KY+0.506
 tattoo+0.463
etic+0.452
Neg logit contributions
selves-0.810
rique-0.746
cedes-0.633
actly-0.618
productive-0.609
Token
Token ID198
Feature activation-1.45e+00

Pos logit contributions
selves+0.265
rique+0.236
cedes+0.198
actly+0.194
productive+0.189
Neg logit contributions
 notor-0.242
KY-0.215
 awa-0.213
[/-0.206
 tattoo-0.199
Token <
Token ID1279
Feature activation-1.12e+00

Pos logit contributions
selves+3.911
rique+3.610
cedes+3.060
actly+2.994
productive+2.917
Neg logit contributions
 notor-2.614
KY-2.543
[/-2.483
 tattoo-2.314
etic-2.276
Token T
Token ID309
Feature activation-4.27e+00

Pos logit contributions
selves+0.981
rique+0.898
cedes+0.745
actly+0.727
productive+0.706
Neg logit contributions
 notor-0.834
KY-0.814
[/-0.798
 tattoo-0.750
etic-0.740
Token >
Token ID1875
Feature activation-5.41e+00

Pos logit contributions
selves+3.390
rique+3.074
cedes+2.489
actly+2.418
productive+2.340
Neg logit contributions
 notor-3.512
KY-3.440
[/-3.376
 tattoo-3.196
etic-3.156
Token {
Token ID1391
Feature activation+7.38e-01

Pos logit contributions
selves+4.234
rique+3.791
cedes+3.059
actly+3.031
productive+2.895
Neg logit contributions
 notor-4.436
KY-4.255
[/-4.170
 tattoo-3.933
 awa-3.927
Token type
Token ID2099
Feature activation-1.77e+00

Pos logit contributions
 notor+0.463
KY+0.445
[/+0.439
 tattoo+0.416
 awa+0.405
Neg logit contributions
selves-0.727
rique-0.679
cedes-0.576
actly-0.558
productive-0.547
Token Target
Token ID12744
Feature activation-9.71e+00

Pos logit contributions
selves+1.449
rique+1.306
cedes+1.076
actly+1.045
productive+1.014
Neg logit contributions
 notor-1.406
KY-1.373
[/-1.348
 tattoo-1.267
 awa-1.254
Token =
Token ID796
Feature activation+1.13e+00

Pos logit contributions
selves+7.366
rique+6.526
actly+5.211
cedes+5.170
Args+5.021
Neg logit contributions
 notor-7.880
[/-7.475
KY-7.467
 awa-7.040
 tattoo-6.975
Token T
Token ID309
Feature activation-3.94e+00

Pos logit contributions
 notor+0.834
KY+0.811
[/+0.798
 tattoo+0.753
 awa+0.740
Neg logit contributions
selves-0.993
rique-0.914
cedes-0.758
actly-0.737
productive-0.718
Token ;
Token ID2162
Feature activation-3.13e+00

Pos logit contributions
selves+3.333
rique+3.032
cedes+2.497
actly+2.453
productive+2.364
Neg logit contributions
 notor-2.994
KY-2.888
[/-2.865
 tattoo-2.674
etic-2.650
Token...
Token ID2644
Feature activation-2.09e+00

Pos logit contributions
selves+2.890
rique+2.640
cedes+2.221
actly+2.182
productive+2.119
Neg logit contributions
 notor-2.142
KY-2.096
[/-2.032
etic-1.891
cale-1.881
Token }
Token ID1782
Feature activation+1.94e+00

Pos logit contributions
selves+1.953
rique+1.807
cedes+1.516
actly+1.472
productive+1.443
Neg logit contributions
 notor-1.443
KY-1.395
[/-1.372
 tattoo-1.296
 awa-1.265
Token July
Token ID2901
Feature activation-2.46e+00

Pos logit contributions
selves+3.343
rique+3.010
cedes+2.470
actly+2.408
productive+2.333
Neg logit contributions
 notor-3.332
KY-3.230
[/-3.076
etic-3.000
 awa-2.966
Token 4
Token ID604
Feature activation+1.07e+00

Pos logit contributions
selves+2.188
rique+1.987
cedes+1.655
actly+1.636
productive+1.586
Neg logit contributions
 notor-1.776
KY-1.702
[/-1.615
etic-1.558
 awa-1.558
Tokenth
Token ID400
Feature activation-2.31e+00

Pos logit contributions
 notor+0.845
KY+0.831
[/+0.821
 tattoo+0.773
etic+0.757
Neg logit contributions
selves-0.880
rique-0.802
cedes-0.654
actly-0.638
productive-0.617
Token
Token ID447
Feature activation-3.48e+00

Pos logit contributions
selves+2.069
rique+1.884
cedes+1.574
actly+1.541
productive+1.495
Neg logit contributions
 notor-1.657
KY-1.602
[/-1.539
etic-1.456
 tattoo-1.454
Token
Token ID251
Feature activation-2.20e+00

Pos logit contributions
selves+2.203
rique+1.944
cedes+1.466
actly+1.415
productive+1.357
Neg logit contributions
 notor-3.408
KY-3.321
[/-3.258
 tattoo-3.131
 awa-3.111
Token ended
Token ID4444
Feature activation-9.56e+00

Pos logit contributions
selves+1.968
rique+1.803
cedes+1.494
actly+1.468
productive+1.423
Neg logit contributions
 notor-1.599
KY-1.533
[/-1.484
 awa-1.402
 tattoo-1.398
Token in
Token ID287
Feature activation-1.76e+00

Pos logit contributions
selves+7.799
rique+7.056
actly+5.884
cedes+5.842
productive+5.578
Neg logit contributions
 notor-6.831
KY-6.625
[/-6.188
cale-6.151
 awa-6.067
Token arrest
Token ID3251
Feature activation+2.76e+00

Pos logit contributions
selves+1.468
rique+1.345
cedes+1.099
actly+1.086
productive+1.038
Neg logit contributions
 notor-1.353
KY-1.324
[/-1.291
 awa-1.221
 tattoo-1.209
Token and
Token ID290
Feature activation+2.72e+00

Pos logit contributions
 notor+1.862
KY+1.833
[/+1.805
 tattoo+1.678
etic+1.648
Neg logit contributions
selves-2.559
rique-2.355
cedes-1.989
actly-1.921
productive-1.891
Token ign
Token ID3627
Feature activation-3.42e+00

Pos logit contributions
 notor+1.801
KY+1.751
[/+1.718
 tattoo+1.604
 awa+1.574
Neg logit contributions
selves-2.589
rique-2.383
cedes-2.016
actly-1.971
productive-1.921
Tokenom
Token ID296
Feature activation+4.20e+00

Pos logit contributions
selves+3.102
rique+2.879
cedes+2.417
actly+2.394
productive+2.301
Neg logit contributions
 notor-2.424
KY-2.310
[/-2.220
 awa-2.114
 tattoo-2.113
Token Pitch
Token ID33517
Feature activation-6.17e-02

Pos logit contributions
selves+2.240
rique+2.037
cedes+1.662
actly+1.625
productive+1.569
Neg logit contributions
 notor-2.098
KY-2.037
[/-1.995
etic-1.874
 awa-1.868
Tokenfork
Token ID32523
Feature activation-9.80e-01

Pos logit contributions
selves+0.063
rique+0.059
cedes+0.050
actly+0.049
productive+0.048
Neg logit contributions
 notor-0.036
KY-0.035
[/-0.034
 tattoo-0.031
 awa-0.031
Token,
Token ID11
Feature activation-3.56e+00

Pos logit contributions
selves+0.860
rique+0.789
cedes+0.650
actly+0.641
productive+0.622
Neg logit contributions
 notor-0.718
KY-0.697
[/-0.680
 awa-0.642
 tattoo-0.640
Token music
Token ID2647
Feature activation-3.49e-01

Pos logit contributions
selves+2.968
rique+2.698
cedes+2.199
actly+2.128
productive+2.086
Neg logit contributions
 notor-2.742
KY-2.680
[/-2.611
 awa-2.441
etic-2.437
Token critic
Token ID4014
Feature activation+9.83e-01

Pos logit contributions
selves+0.282
rique+0.255
cedes+0.208
actly+0.203
productive+0.198
Neg logit contributions
 notor-0.279
KY-0.273
[/-0.268
cale-0.251
 awa-0.251
Token Adam
Token ID7244
Feature activation-1.22e+01

Pos logit contributions
 notor+0.792
KY+0.774
[/+0.758
 tattoo+0.716
 awa+0.709
Neg logit contributions
selves-0.796
rique-0.723
cedes-0.588
actly-0.572
productive-0.554
Token Ward
Token ID12150
Feature activation-7.29e-01

Pos logit contributions
selves+8.735
rique+7.979
cedes+5.972
stein+5.897
actly+5.882
Neg logit contributions
 notor-9.998
KY-9.608
[/-8.855
 awa-8.680
etic-8.629
Token reminis
Token ID21484
Feature activation-6.64e+00

Pos logit contributions
selves+0.622
rique+0.572
cedes+0.471
actly+0.458
productive+0.446
Neg logit contributions
 notor-0.553
KY-0.536
[/-0.523
 tattoo-0.491
 awa-0.489
Tokences
Token ID728
Feature activation-4.46e+00

Pos logit contributions
selves+2.947
rique+2.498
cedes+2.126
productive+1.632
actly+1.625
Neg logit contributions
 notor-8.084
KY-7.307
 awa-7.196
[/-7.104
 tattoo-7.017
Token about
Token ID546
Feature activation-4.26e+00

Pos logit contributions
selves+3.517
rique+3.169
cedes+2.543
actly+2.534
productive+2.416
Neg logit contributions
 notor-3.631
KY-3.524
[/-3.408
 awa-3.232
cale-3.229
Token digital
Token ID4875
Feature activation-1.68e+00

Pos logit contributions
selves+3.524
rique+3.210
actly+2.575
cedes+2.551
 Dying+2.477
Neg logit contributions
 notor-3.276
KY-3.213
[/-3.098
etic-2.939
 awa-2.930
Tokenography
Token ID4867
Feature activation+5.46e+00

Pos logit contributions
 notor+0.512
KY+0.479
[/+0.455
 tattoo+0.375
etic+0.361
Neg logit contributions
selves-2.466
rique-2.327
cedes-2.078
actly-2.048
productive-2.012
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
[/+3.421
 notor+3.399
KY+3.334
 tattoo+3.111
etic+3.035
Neg logit contributions
selves-5.287
rique-4.839
cedes-4.149
actly-4.056
productive-3.940
Token
Token ID447
Feature activation+1.88e-01

Pos logit contributions
 notor+2.342
KY+2.290
[/+2.256
 tattoo+2.128
etic+2.094
Neg logit contributions
selves-2.527
rique-2.299
cedes-1.889
actly-1.846
productive-1.787
Token
Token ID251
Feature activation+1.81e+00

Pos logit contributions
 notor+0.184
KY+0.180
[/+0.178
 tattoo+0.170
etic+0.168
Neg logit contributions
selves-0.120
rique-0.106
cedes-0.081
actly-0.078
productive-0.074
Token Edwards
Token ID17381
Feature activation-3.85e+00

Pos logit contributions
 notor+1.378
KY+1.346
[/+1.319
 tattoo+1.243
etic+1.226
Neg logit contributions
selves-1.545
rique-1.410
cedes-1.163
actly-1.135
productive-1.100
Token Air
Token ID3701
Feature activation-1.03e+01

Pos logit contributions
selves+2.837
rique+2.549
cedes+2.015
actly+1.949
productive+1.895
Neg logit contributions
 notor-3.346
KY-3.237
[/-3.101
 awa-3.014
 tattoo-2.998
Token Force
Token ID5221
Feature activation-6.52e-01

Pos logit contributions
selves+7.161
rique+6.684
cedes+5.202
 Sending+4.893
productive+4.858
Neg logit contributions
 notor-9.085
KY-8.651
 awa-8.373
 tattoo-7.881
[/-7.844
Token Base
Token ID7308
Feature activation-4.87e-01

Pos logit contributions
selves+0.474
rique+0.427
cedes+0.337
actly+0.326
productive+0.314
Neg logit contributions
 notor-0.578
KY-0.565
[/-0.552
 tattoo-0.526
 awa-0.523
Token,
Token ID11
Feature activation-3.55e+00

Pos logit contributions
selves+0.440
rique+0.405
cedes+0.337
actly+0.329
productive+0.321
Neg logit contributions
 notor-0.346
KY-0.336
[/-0.325
 tattoo-0.307
 awa-0.306
Token http
Token ID2638
Feature activation+4.05e+00

Pos logit contributions
selves+3.022
rique+2.763
cedes+2.309
actly+2.239
productive+2.163
Neg logit contributions
 notor-2.634
KY-2.559
[/-2.457
 tattoo-2.364
 awa-2.360
Token://
Token ID1378
Feature activation+4.46e+00

Pos logit contributions
 notor+2.557
KY+2.556
[/+2.525
 tattoo+2.320
etic+2.301
Neg logit contributions
selves-3.875
rique-3.591
cedes-3.038
actly-2.965
productive-2.892
Token Wednesday
Token ID3583
Feature activation-3.08e+00

Pos logit contributions
selves+0.400
rique+0.358
cedes+0.291
actly+0.282
productive+0.273
Neg logit contributions
 notor-0.424
KY-0.415
[/-0.406
 tattoo-0.384
 awa-0.383
Token steps
Token ID4831
Feature activation+1.51e+00

Pos logit contributions
selves+2.449
rique+2.189
cedes+1.791
actly+1.744
productive+1.680
Neg logit contributions
 notor-2.514
KY-2.443
[/-2.368
 awa-2.262
etic-2.255
Token they
Token ID484
Feature activation-5.42e+00

Pos logit contributions
 notor+1.189
KY+1.164
[/+1.144
 tattoo+1.080
etic+1.066
Neg logit contributions
selves-1.247
rique-1.136
cedes-0.928
actly-0.905
productive-0.875
Token
Token ID447
Feature activation+2.79e+00

Pos logit contributions
selves+3.957
rique+3.333
cedes+2.671
actly+2.643
acons+2.528
Neg logit contributions
 notor-4.802
KY-4.577
[/-4.491
etic-4.292
 awa-4.277
Token
Token ID247
Feature activation+1.94e+00

Pos logit contributions
 notor+2.760
KY+2.747
[/+2.716
 tattoo+2.581
etic+2.552
Neg logit contributions
selves-1.699
rique-1.489
cedes-1.118
actly-1.077
productive-1.019
Tokenve
Token ID303
Feature activation-9.35e+00

Pos logit contributions
 notor+1.423
KY+1.392
[/+1.366
 tattoo+1.283
etic+1.265
Neg logit contributions
selves-1.693
rique-1.555
cedes-1.291
actly-1.261
productive-1.223
Token taken
Token ID2077
Feature activation-3.00e+00

Pos logit contributions
selves+7.018
rique+5.870
cedes+5.097
actly+4.900
productive+4.697
Neg logit contributions
 notor-7.698
KY-7.449
[/-6.894
etic-6.820
 awa-6.775
Token to
Token ID284
Feature activation-2.08e-01

Pos logit contributions
selves+2.269
rique+2.018
cedes+1.624
actly+1.560
productive+1.515
Neg logit contributions
 notor-2.569
KY-2.509
[/-2.419
 awa-2.302
etic-2.298
Token stop
Token ID2245
Feature activation-2.38e+00

Pos logit contributions
selves+0.183
rique+0.166
cedes+0.139
actly+0.135
productive+0.131
Neg logit contributions
 notor-0.153
KY-0.146
[/-0.143
 awa-0.134
cale-0.133
Token the
Token ID262
Feature activation-6.76e-01

Pos logit contributions
selves+2.171
rique+1.963
cedes+1.676
actly+1.622
productive+1.580
Neg logit contributions
 notor-1.663
KY-1.610
[/-1.568
 awa-1.464
etic-1.452
Token spread
Token ID4104
Feature activation+4.75e+00

Pos logit contributions
selves+0.691
rique+0.632
cedes+0.546
actly+0.535
productive+0.526
Neg logit contributions
 notor-0.399
KY-0.387
[/-0.375
 awa-0.346
etic-0.341