TOP ACTIVATIONS
MAX = 1.377

a grid : void V ort on Sim ::
coalition . Added into the mix , the internal debate in
attack to threaten to cut off funding for Planned Parenthood ,
readable zones in passport images # import the necessary packages from
" I apologized to [ Love ]," O ly ny
too troublesome , just get hold of a battery operated portable
reducing fan speeds . We tested the single card in OC
" Why put your factory [ in Britain ] when you
s getting a little too complicated . But
We also want to make sure that we do not have
s already two people [ who have ] had it
" I told [ Green ] almost every play
probably not sensible to get too worked up about the details
perhaps they shouldn 't celebrate too quickly . And don 't
oper . We don 't [ usually ] go into [
tomorrow with Est eb an [ O con , his new
. 8 and Tyr ant opened with 2 . 1 mil
the Beer Festival . Make sure to get your tickets !
are also still in the mix for Ad ew ale Om
# load the image , resize it , and convert it

MIN ACTIVATIONS
MIN = -1.514

No matter where we live we are
. Source : Anonymous <|endoftext|> T AM PA ,
aforementioned movie . Not only that , but apparently ,
to this guy . Not only did he plan a huge
Email your friends . No matter your social media / communication
the animal kingdom , no matter how young or adorable the
unprecedented . Not only is the punishment unusually harsh
the closest thing . Not only is the dummy programmed to
s face it , no matter what I took a photograph
in L NG , no matter how it s
collective ire - " y up , they knocked down (
be too random , or b ) It would have only
script ions No matter where you are in the
bad , dangerous guy no matter how am iable he seemed
policy . US AT OD AY OP IN ION
among gay teens ? Not only did they want to escape
- January 4 , 2010 User reviews are a bad idea
future ahead of it no matter what platform you will be
to life , no matter what the conditions , are
Ex Chief Minister of H ary ana . On 18 .

INTERVAL 0.654 - 1.377
CONTAINS 0.551%

their creation works , they tested their design by exposing it
953 16 27 145 Flat buff ers Reading 106 378 46
highlights Secretary of Defense Chuck Hag el will step down ,
reports to make the connection between Russian hackers and the Democratic
and make a few subtle changes such as not using technology

INTERVAL -0.069 - 0.654
CONTAINS 61.078%

In . Call 440 - 327 - 9 595
University of South Australia , an Emer itus Professor of Economics
At & T variant of the Nexus 6 .
. x . x . 143 10 33 455 x .
wrong with the notion of reason and a different voice approach

INTERVAL -0.792 - -0.069
CONTAINS 38.219%

denied wrongdoing . Pr ather s attorney
something else . B OM B 1 d
"). Male - to - female ( M TF ) transgender
least 19 states or territories , did not immediately respond to
employment paperwork , or evidence of appointments for those who do

INTERVAL -1.514 - -0.792
CONTAINS 0.152%

ing on the pure unf il tered visual emotions , Tak
Les Imp rompt us de Lac an This
will keep being cited further right on the political spectrum to
on Al cat raz . Only the beginning .
. Only ID I got is a
Token a
Token ID257
Feature activation-2.84e-01

Pos logit contributions
advertising+0.188
rolet+0.187
Attorney+0.186
VK+0.177
jury+0.173
Neg logit contributions
antha-0.174
iliated-0.162
odo-0.160
 coerc-0.159
urrent-0.155
Token grid
Token ID10706
Feature activation+1.04e-01

Pos logit contributions
antha+0.973
iliated+0.910
 coerc+0.895
odo+0.884
urrent+0.866
Neg logit contributions
advertising-1.166
rolet-1.160
Attorney-1.154
VK-1.095
jury-1.071
Token:
Token ID25
Feature activation-1.10e-01

Pos logit contributions
rolet+0.359
advertising+0.359
Attorney+0.355
VK+0.334
jury+0.326
Neg logit contributions
antha-0.421
iliated-0.396
 coerc-0.390
odo-0.388
urrent-0.380
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
antha+0.395
iliated+0.374
 coerc+0.366
odo+0.362
urrent+0.354
Neg logit contributions
advertising-0.431
rolet-0.430
Attorney-0.427
VK-0.405
jury-0.395
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
antha+0.685
 coerc+0.646
iliated+0.645
odo+0.618
urrent+0.612
Neg logit contributions
advertising-0.741
Attorney-0.731
rolet-0.722
VK-0.685
jury-0.674
Tokenvoid
Token ID19382
Feature activation+1.38e+00

Pos logit contributions
antha+0.425
iliated+0.402
 coerc+0.400
odo+0.391
 Fargo+0.386
Neg logit contributions
advertising-0.381
Attorney-0.376
rolet-0.374
VK-0.354
jury-0.344
Token V
Token ID569
Feature activation+4.19e-01

Pos logit contributions
rolet+4.463
Attorney+4.437
advertising+4.430
VK+4.352
 pg+4.115
Neg logit contributions
antha-5.423
iliated-5.284
odo-5.023
 coerc-4.959
 Fargo-4.884
Tokenort
Token ID419
Feature activation+6.90e-01

Pos logit contributions
advertising+1.773
Attorney+1.772
rolet+1.768
VK+1.674
jury+1.627
Neg logit contributions
antha-1.367
 coerc-1.337
iliated-1.285
 Fargo-1.271
 Dortmund-1.209
Tokenon
Token ID261
Feature activation+4.60e-01

Pos logit contributions
rolet+2.066
advertising+2.056
Attorney+2.051
VK+1.900
jury+1.838
Neg logit contributions
antha-3.071
iliated-2.973
 coerc-2.950
 Fargo-2.864
odo-2.836
TokenSim
Token ID8890
Feature activation-1.22e-01

Pos logit contributions
advertising+1.466
Attorney+1.457
rolet+1.448
VK+1.356
jury+1.307
Neg logit contributions
antha-1.989
iliated-1.905
 coerc-1.879
odo-1.849
 Fargo-1.814
Token::
Token ID3712
Feature activation+5.18e-01

Pos logit contributions
antha+0.549
iliated+0.522
 coerc+0.519
odo+0.511
urrent+0.505
Neg logit contributions
advertising-0.375
rolet-0.369
Attorney-0.368
VK-0.340
jury-0.331
Token coalition
Token ID9906
Feature activation+6.58e-02

Pos logit contributions
antha+0.064
iliated+0.060
 coerc+0.059
odo+0.058
urrent+0.058
Neg logit contributions
advertising-0.069
Attorney-0.068
rolet-0.068
VK-0.064
jury-0.064
Token.
Token ID13
Feature activation-8.00e-02

Pos logit contributions
advertising+0.248
Attorney+0.246
rolet+0.242
VK+0.227
jury+0.225
Neg logit contributions
antha-0.251
iliated-0.233
 coerc-0.230
odo-0.228
urrent-0.224
Token Added
Token ID10687
Feature activation-5.99e-01

Pos logit contributions
antha+0.311
iliated+0.298
 coerc+0.297
odo+0.285
urrent+0.280
Neg logit contributions
advertising-0.299
Attorney-0.297
rolet-0.291
VK-0.276
jury-0.270
Token into
Token ID656
Feature activation-1.00e-01

Pos logit contributions
antha+1.921
iliated+1.784
 coerc+1.771
odo+1.718
urrent+1.697
Neg logit contributions
advertising-2.617
Attorney-2.582
rolet-2.576
VK-2.447
jury-2.422
Token the
Token ID262
Feature activation+3.74e-01

Pos logit contributions
antha+0.394
 coerc+0.371
iliated+0.370
odo+0.359
urrent+0.356
Neg logit contributions
advertising-0.364
Attorney-0.360
rolet-0.359
VK-0.337
jury-0.334
Token mix
Token ID5022
Feature activation+1.38e+00

Pos logit contributions
advertising+1.006
rolet+0.985
Attorney+0.971
VK+0.904
jury+0.903
Neg logit contributions
antha-1.786
iliated-1.701
 coerc-1.674
urrent-1.649
odo-1.641
Token,
Token ID11
Feature activation-8.49e-02

Pos logit contributions
Attorney+4.522
advertising+4.464
rolet+4.335
jury+4.092
VK+4.046
Neg logit contributions
antha-5.538
 coerc-5.202
iliated-5.117
odo-5.069
urrent-4.913
Token the
Token ID262
Feature activation-3.86e-02

Pos logit contributions
antha+0.316
iliated+0.297
 coerc+0.296
odo+0.288
urrent+0.282
Neg logit contributions
advertising-0.325
Attorney-0.324
rolet-0.317
VK-0.300
jury-0.299
Token internal
Token ID5387
Feature activation-2.41e-01

Pos logit contributions
antha+0.145
iliated+0.136
 coerc+0.134
odo+0.131
urrent+0.129
Neg logit contributions
advertising-0.146
Attorney-0.145
rolet-0.144
VK-0.136
jury-0.135
Token debate
Token ID4384
Feature activation-3.88e-01

Pos logit contributions
antha+0.928
iliated+0.871
 coerc+0.861
odo+0.846
urrent+0.835
Neg logit contributions
advertising-0.894
rolet-0.888
Attorney-0.883
VK-0.824
jury-0.819
Token in
Token ID287
Feature activation-4.04e-02

Pos logit contributions
antha+1.514
iliated+1.410
 coerc+1.404
odo+1.381
urrent+1.351
Neg logit contributions
advertising-1.437
Attorney-1.424
rolet-1.409
VK-1.322
jury-1.309
Token attack
Token ID1368
Feature activation+1.98e-01

Pos logit contributions
antha+0.143
iliated+0.136
 coerc+0.133
odo+0.132
urrent+0.132
Neg logit contributions
advertising-0.106
Attorney-0.104
rolet-0.101
VK-0.095
jury-0.093
Token to
Token ID284
Feature activation+1.18e-01

Pos logit contributions
advertising+0.770
Attorney+0.766
rolet+0.753
VK+0.703
jury+0.700
Neg logit contributions
antha-0.727
iliated-0.685
 coerc-0.669
odo-0.665
urrent-0.658
Token threaten
Token ID16180
Feature activation+1.24e-01

Pos logit contributions
advertising+0.431
Attorney+0.428
rolet+0.422
VK+0.401
jury+0.391
Neg logit contributions
antha-0.462
iliated-0.433
odo-0.419
urrent-0.417
 coerc-0.414
Token to
Token ID284
Feature activation-3.00e-02

Pos logit contributions
advertising+0.473
Attorney+0.473
rolet+0.466
VK+0.437
jury+0.429
Neg logit contributions
antha-0.463
iliated-0.433
 coerc-0.431
odo-0.425
urrent-0.417
Token cut
Token ID2005
Feature activation+5.96e-02

Pos logit contributions
antha+0.120
iliated+0.112
odo+0.110
 coerc+0.109
urrent+0.108
Neg logit contributions
advertising-0.107
Attorney-0.107
rolet-0.105
VK-0.099
jury-0.097
Token off
Token ID572
Feature activation+1.33e+00

Pos logit contributions
advertising+0.204
Attorney+0.201
rolet+0.200
VK+0.185
jury+0.180
Neg logit contributions
antha-0.248
iliated-0.234
 coerc-0.232
odo-0.229
urrent-0.226
Token funding
Token ID4918
Feature activation+2.17e-01

Pos logit contributions
advertising+4.757
Attorney+4.721
rolet+4.510
VK+4.224
jury+4.205
Neg logit contributions
antha-4.902
iliated-4.597
odo-4.580
 coerc-4.490
urrent-4.443
Token for
Token ID329
Feature activation+1.84e-01

Pos logit contributions
advertising+0.873
Attorney+0.864
rolet+0.849
VK+0.809
cow+0.783
Neg logit contributions
antha-0.767
iliated-0.713
 coerc-0.708
odo-0.703
urrent-0.690
Token Planned
Token ID18451
Feature activation-2.32e-01

Pos logit contributions
advertising+0.741
Attorney+0.735
rolet+0.728
VK+0.687
jury+0.673
Neg logit contributions
antha-0.644
iliated-0.597
 coerc-0.593
odo-0.590
urrent-0.579
Token Parenthood
Token ID18461
Feature activation-5.47e-01

Pos logit contributions
antha+0.844
iliated+0.791
 coerc+0.787
odo+0.769
urrent+0.756
Neg logit contributions
advertising-0.914
Attorney-0.903
rolet-0.901
VK-0.848
jury-0.831
Token,
Token ID11
Feature activation+6.61e-02

Pos logit contributions
antha+1.971
iliated+1.849
 coerc+1.832
odo+1.799
urrent+1.771
Neg logit contributions
advertising-2.159
rolet-2.135
Attorney-2.134
VK-2.006
jury-1.966
Tokenreadable
Token ID46155
Feature activation-3.74e-01

Pos logit contributions
antha+0.479
 coerc+0.453
iliated+0.453
odo+0.439
 Fargo+0.438
Neg logit contributions
advertising-0.376
Attorney-0.364
rolet-0.360
VK-0.335
jury-0.331
Token zones
Token ID14123
Feature activation-5.60e-02

Pos logit contributions
antha+1.186
iliated+1.097
 coerc+1.070
odo+1.060
urrent+1.039
Neg logit contributions
advertising-1.647
rolet-1.631
Attorney-1.626
VK-1.550
jury-1.515
Token in
Token ID287
Feature activation-1.53e-01

Pos logit contributions
antha+0.193
iliated+0.180
 coerc+0.178
odo+0.175
urrent+0.172
Neg logit contributions
advertising-0.230
rolet-0.228
Attorney-0.228
VK-0.215
jury-0.211
Token passport
Token ID17981
Feature activation-4.72e-03

Pos logit contributions
antha+0.671
iliated+0.636
 coerc+0.628
odo+0.621
urrent+0.613
Neg logit contributions
advertising-0.481
Attorney-0.475
rolet-0.475
VK-0.440
jury-0.430
Token images
Token ID4263
Feature activation+5.55e-01

Pos logit contributions
antha+0.016
iliated+0.015
 coerc+0.015
odo+0.014
urrent+0.014
Neg logit contributions
advertising-0.020
rolet-0.020
Attorney-0.019
VK-0.019
jury-0.018
Token #
Token ID1303
Feature activation+1.29e+00

Pos logit contributions
advertising+1.366
rolet+1.354
Attorney+1.334
VK+1.221
jury+1.172
Neg logit contributions
antha-2.780
iliated-2.696
 coerc-2.622
odo-2.618
urrent-2.577
Token import
Token ID1330
Feature activation+5.95e-01

Pos logit contributions
rolet+3.564
advertising+3.485
Attorney+3.476
VK+3.205
cow+3.177
Neg logit contributions
antha-5.759
iliated-5.648
odo-5.411
 coerc-5.382
 Fargo-5.287
Token the
Token ID262
Feature activation-7.69e-02

Pos logit contributions
Attorney+2.085
advertising+2.060
rolet+2.013
VK+1.919
jury+1.899
Neg logit contributions
antha-2.384
iliated-2.235
 coerc-2.210
odo-2.140
urrent-2.130
Token necessary
Token ID3306
Feature activation-2.97e-01

Pos logit contributions
antha+0.302
iliated+0.282
 coerc+0.281
odo+0.275
urrent+0.273
Neg logit contributions
advertising-0.281
Attorney-0.277
rolet-0.276
VK-0.258
jury-0.255
Token packages
Token ID10392
Feature activation+2.33e-01

Pos logit contributions
antha+0.885
iliated+0.818
 coerc+0.801
odo+0.778
urrent+0.768
Neg logit contributions
advertising-1.375
Attorney-1.355
rolet-1.351
VK-1.284
jury-1.259
Token from
Token ID422
Feature activation-8.66e-02

Pos logit contributions
advertising+0.900
Attorney+0.889
rolet+0.876
VK+0.830
jury+0.812
Neg logit contributions
antha-0.864
iliated-0.812
 coerc-0.805
odo-0.779
urrent-0.774
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
antha+0.071
 coerc+0.068
iliated+0.067
odo+0.064
urrent+0.064
Neg logit contributions
advertising-0.072
Attorney-0.071
rolet-0.069
VK-0.066
jury-0.065
Token"
Token ID1
Feature activation-4.43e-02

Pos logit contributions
antha+0.399
 coerc+0.367
iliated+0.363
odo+0.348
urrent+0.341
Neg logit contributions
advertising-0.697
Attorney-0.691
rolet-0.678
VK-0.651
jury-0.642
TokenI
Token ID40
Feature activation+1.64e-01

Pos logit contributions
antha+0.156
 coerc+0.147
iliated+0.146
odo+0.141
urrent+0.139
Neg logit contributions
advertising-0.183
Attorney-0.182
rolet-0.177
VK-0.170
jury-0.166
Token apologized
Token ID22893
Feature activation-2.18e-01

Pos logit contributions
rolet+0.623
Attorney+0.623
advertising+0.621
jury+0.584
VK+0.570
Neg logit contributions
antha-0.611
iliated-0.579
 coerc-0.563
urrent-0.550
odo-0.545
Token to
Token ID284
Feature activation-6.55e-02

Pos logit contributions
antha+0.764
iliated+0.705
 coerc+0.687
urrent+0.686
odo+0.682
Neg logit contributions
advertising-0.885
Attorney-0.884
rolet-0.882
jury-0.826
VK-0.821
Token [
Token ID685
Feature activation+1.24e+00

Pos logit contributions
antha+0.251
iliated+0.236
 coerc+0.234
odo+0.229
urrent+0.227
Neg logit contributions
advertising-0.245
Attorney-0.243
rolet-0.242
VK-0.225
jury-0.224
TokenLove
Token ID18565
Feature activation+2.68e-01

Pos logit contributions
Attorney+3.486
advertising+3.414
rolet+3.074
jury+3.004
cow+2.958
Neg logit contributions
antha-6.189
 coerc-5.942
iliated-5.731
 Fargo-5.674
odo-5.542
Token],"
Token ID17241
Feature activation-2.78e-02

Pos logit contributions
advertising+0.818
Attorney+0.807
rolet+0.806
VK+0.736
jury+0.734
Neg logit contributions
antha-1.207
iliated-1.139
 coerc-1.138
odo-1.114
urrent-1.102
Token O
Token ID440
Feature activation+2.33e-01

Pos logit contributions
antha+0.100
 coerc+0.093
iliated+0.093
odo+0.089
urrent+0.088
Neg logit contributions
advertising-0.112
rolet-0.110
Attorney-0.110
jury-0.103
VK-0.102
Tokenly
Token ID306
Feature activation-4.03e-01

Pos logit contributions
advertising+0.930
rolet+0.904
Attorney+0.892
jury+0.849
VK+0.840
Neg logit contributions
antha-0.853
 coerc-0.823
iliated-0.783
 Fargo-0.774
urrent-0.764
Tokenny
Token ID3281
Feature activation-1.50e-01

Pos logit contributions
antha+2.423
 coerc+2.332
iliated+2.332
odo+2.291
urrent+2.276
Neg logit contributions
advertising-0.628
rolet-0.603
Attorney-0.602
VK-0.506
jury-0.486
Token too
Token ID1165
Feature activation+5.93e-01

Pos logit contributions
advertising+0.059
Attorney+0.058
rolet+0.057
VK+0.055
jury+0.053
Neg logit contributions
antha-0.054
iliated-0.050
 coerc-0.049
urrent-0.049
odo-0.048
Token troublesome
Token ID35778
Feature activation-4.08e-01

Pos logit contributions
advertising+2.048
Attorney+2.033
rolet+1.957
VK+1.866
jury+1.866
Neg logit contributions
antha-2.354
iliated-2.243
urrent-2.193
odo-2.155
 coerc-2.148
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
antha+1.458
iliated+1.366
 coerc+1.345
odo+1.327
urrent+1.309
Neg logit contributions
advertising-1.629
Attorney-1.610
rolet-1.609
VK-1.515
jury-1.486
Token just
Token ID655
Feature activation-4.56e-01

Pos logit contributions
antha+0.417
iliated+0.392
odo+0.380
 coerc+0.379
urrent+0.377
Neg logit contributions
advertising-0.435
Attorney-0.432
rolet-0.431
VK-0.407
jury-0.397
Token get
Token ID651
Feature activation+4.18e-02

Pos logit contributions
antha+1.793
iliated+1.692
 coerc+1.665
odo+1.644
urrent+1.628
Neg logit contributions
advertising-1.647
rolet-1.636
Attorney-1.629
VK-1.526
jury-1.490
Token hold
Token ID1745
Feature activation+1.22e+00

Pos logit contributions
advertising+0.166
Attorney+0.164
rolet+0.163
VK+0.153
jury+0.150
Neg logit contributions
antha-0.148
iliated-0.139
 coerc-0.136
urrent-0.135
odo-0.135
Token of
Token ID286
Feature activation+4.05e-01

Pos logit contributions
Attorney+4.207
advertising+4.143
rolet+3.978
VK+3.783
jury+3.752
Neg logit contributions
antha-4.825
 coerc-4.661
iliated-4.600
urrent-4.451
odo-4.319
Token a
Token ID257
Feature activation+3.55e-01

Pos logit contributions
advertising+1.511
Attorney+1.494
rolet+1.489
VK+1.408
jury+1.389
Neg logit contributions
antha-1.491
iliated-1.422
urrent-1.377
odo-1.374
 coerc-1.373
Token battery
Token ID6555
Feature activation+9.06e-02

Pos logit contributions
rolet+1.273
advertising+1.267
Attorney+1.266
VK+1.195
jury+1.170
Neg logit contributions
antha-1.376
iliated-1.301
odo-1.257
urrent-1.246
 coerc-1.243
Token operated
Token ID12228
Feature activation-1.99e-01

Pos logit contributions
rolet+0.329
Attorney+0.328
advertising+0.328
jury+0.304
VK+0.303
Neg logit contributions
antha-0.352
iliated-0.330
 coerc-0.321
odo-0.321
urrent-0.317
Token portable
Token ID17726
Feature activation-2.77e-01

Pos logit contributions
antha+0.804
iliated+0.764
 coerc+0.741
odo+0.739
urrent+0.729
Neg logit contributions
advertising-0.701
rolet-0.699
Attorney-0.694
VK-0.648
jury-0.634
Token reducing
Token ID8868
Feature activation+1.67e-01

Pos logit contributions
antha+1.466
iliated+1.380
 coerc+1.369
odo+1.346
urrent+1.326
Neg logit contributions
advertising-1.428
rolet-1.412
Attorney-1.411
VK-1.321
jury-1.294
Token fan
Token ID4336
Feature activation-3.53e-02

Pos logit contributions
advertising+0.583
rolet+0.570
Attorney+0.568
VK+0.536
jury+0.523
Neg logit contributions
antha-0.679
iliated-0.644
 coerc-0.631
odo-0.626
urrent-0.618
Token speeds
Token ID12055
Feature activation-2.27e-01

Pos logit contributions
antha+0.141
iliated+0.133
 coerc+0.130
odo+0.129
urrent+0.128
Neg logit contributions
advertising-0.127
rolet-0.121
Attorney-0.121
VK-0.116
jury-0.113
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
antha+0.789
iliated+0.738
 coerc+0.727
odo+0.714
urrent+0.707
Neg logit contributions
advertising-0.931
Attorney-0.910
rolet-0.910
VK-0.861
jury-0.847
Token We
Token ID775
Feature activation+2.20e-01

Pos logit contributions
advertising+0.530
rolet+0.521
Attorney+0.520
VK+0.491
jury+0.476
Neg logit contributions
antha-0.525
iliated-0.505
 coerc-0.498
odo-0.481
urrent-0.475
Token tested
Token ID6789
Feature activation+1.22e+00

Pos logit contributions
advertising+0.855
rolet+0.845
Attorney+0.836
VK+0.795
jury+0.765
Neg logit contributions
antha-0.803
iliated-0.754
 coerc-0.753
odo-0.719
urrent-0.717
Token the
Token ID262
Feature activation+1.26e-01

Pos logit contributions
advertising+4.163
rolet+4.058
Attorney+4.009
VK+3.894
jury+3.794
Neg logit contributions
antha-4.717
iliated-4.477
 coerc-4.352
odo-4.320
urrent-4.286
Token single
Token ID2060
Feature activation+2.13e-02

Pos logit contributions
advertising+0.512
rolet+0.506
Attorney+0.503
VK+0.475
jury+0.468
Neg logit contributions
antha-0.441
iliated-0.415
 coerc-0.406
odo-0.399
urrent-0.393
Token card
Token ID2657
Feature activation+1.91e-01

Pos logit contributions
advertising+0.081
rolet+0.080
Attorney+0.080
VK+0.076
jury+0.074
Neg logit contributions
antha-0.079
iliated-0.075
 coerc-0.074
odo-0.072
urrent-0.072
Token in
Token ID287
Feature activation-7.19e-02

Pos logit contributions
advertising+0.737
rolet+0.724
Attorney+0.717
VK+0.683
jury+0.669
Neg logit contributions
antha-0.699
iliated-0.656
 coerc-0.650
odo-0.633
urrent-0.627
Token OC
Token ID24775
Feature activation+1.01e-01

Pos logit contributions
antha+0.262
iliated+0.246
 coerc+0.243
odo+0.239
urrent+0.235
Neg logit contributions
advertising-0.281
rolet-0.277
Attorney-0.276
VK-0.261
jury-0.257
Token "
Token ID366
Feature activation-9.20e-03

Pos logit contributions
advertising+0.467
Attorney+0.457
rolet+0.445
VK+0.420
jury+0.420
Neg logit contributions
antha-0.490
 coerc-0.471
iliated-0.467
odo-0.446
urrent-0.444
TokenWhy
Token ID5195
Feature activation+4.51e-01

Pos logit contributions
antha+0.035
 coerc+0.034
iliated+0.034
odo+0.032
urrent+0.032
Neg logit contributions
advertising-0.036
Attorney-0.036
rolet-0.034
VK-0.033
jury-0.032
Token put
Token ID1234
Feature activation-5.39e-02

Pos logit contributions
advertising+1.659
Attorney+1.646
rolet+1.633
VK+1.565
jury+1.536
Neg logit contributions
antha-1.696
iliated-1.611
urrent-1.554
odo-1.523
 coerc-1.518
Token your
Token ID534
Feature activation-1.58e-01

Pos logit contributions
antha+0.205
iliated+0.191
odo+0.186
 coerc+0.185
urrent+0.184
Neg logit contributions
advertising-0.207
Attorney-0.200
rolet-0.197
VK-0.189
jury-0.188
Token factory
Token ID8860
Feature activation+1.82e-01

Pos logit contributions
antha+0.605
iliated+0.567
odo+0.553
urrent+0.550
 coerc+0.546
Neg logit contributions
advertising-0.600
Attorney-0.589
rolet-0.580
VK-0.552
jury-0.542
Token [
Token ID685
Feature activation+1.20e+00

Pos logit contributions
advertising+0.705
rolet+0.674
Attorney+0.673
VK+0.634
jury+0.632
Neg logit contributions
antha-0.671
iliated-0.634
 coerc-0.623
odo-0.619
urrent-0.617
Tokenin
Token ID259
Feature activation+1.81e-01

Pos logit contributions
advertising+4.364
Attorney+4.179
rolet+3.992
VK+3.864
jury+3.864
Neg logit contributions
antha-4.798
 coerc-4.662
iliated-4.503
odo-4.369
 Fargo-4.341
Token Britain
Token ID5491
Feature activation-2.06e-01

Pos logit contributions
advertising+0.718
rolet+0.699
Attorney+0.697
jury+0.662
VK+0.661
Neg logit contributions
antha-0.653
 coerc-0.606
iliated-0.605
odo-0.592
urrent-0.589
Token]
Token ID60
Feature activation+2.61e-01

Pos logit contributions
antha+0.816
iliated+0.767
 coerc+0.762
odo+0.747
urrent+0.741
Neg logit contributions
advertising-0.748
rolet-0.736
Attorney-0.734
VK-0.685
jury-0.672
Token when
Token ID618
Feature activation-3.35e-01

Pos logit contributions
advertising+0.996
Attorney+0.962
rolet+0.959
VK+0.903
jury+0.900
Neg logit contributions
antha-0.970
iliated-0.915
 coerc-0.898
urrent-0.892
odo-0.891
Token you
Token ID345
Feature activation-1.69e-01

Pos logit contributions
antha+1.206
iliated+1.135
 coerc+1.111
odo+1.098
urrent+1.085
Neg logit contributions
advertising-1.340
Attorney-1.312
rolet-1.295
VK-1.239
jury-1.213
Token
Token ID247
Feature activation-1.30e-01

Pos logit contributions
antha+0.535
 coerc+0.504
iliated+0.500
odo+0.493
 Fargo+0.483
Neg logit contributions
advertising-0.386
Attorney-0.381
rolet-0.376
VK-0.352
jury-0.347
Tokens
Token ID82
Feature activation-1.29e-02

Pos logit contributions
antha+0.440
iliated+0.413
 coerc+0.411
odo+0.400
urrent+0.392
Neg logit contributions
advertising-0.542
Attorney-0.531
rolet-0.527
VK-0.502
jury-0.496
Token getting
Token ID1972
Feature activation-2.87e-01

Pos logit contributions
antha+0.049
iliated+0.046
 coerc+0.046
urrent+0.045
odo+0.045
Neg logit contributions
advertising-0.049
Attorney-0.048
rolet-0.047
VK-0.044
jury-0.044
Token a
Token ID257
Feature activation+6.49e-02

Pos logit contributions
antha+1.005
iliated+0.944
 coerc+0.928
odo+0.913
urrent+0.908
Neg logit contributions
advertising-1.163
Attorney-1.143
rolet-1.140
VK-1.075
jury-1.056
Token little
Token ID1310
Feature activation+3.58e-01

Pos logit contributions
advertising+0.259
rolet+0.256
Attorney+0.255
VK+0.240
jury+0.237
Neg logit contributions
antha-0.231
iliated-0.217
 coerc-0.213
odo-0.210
urrent-0.209
Token too
Token ID1165
Feature activation+1.20e+00

Pos logit contributions
advertising+1.327
Attorney+1.301
rolet+1.272
VK+1.210
jury+1.195
Neg logit contributions
antha-1.360
iliated-1.301
 coerc-1.261
urrent-1.258
odo-1.237
Token complicated
Token ID8253
Feature activation-1.57e-02

Pos logit contributions
advertising+4.586
Attorney+4.421
rolet+4.278
jury+4.080
VK+4.067
Neg logit contributions
antha-4.265
iliated-4.086
urrent-3.969
 coerc-3.912
odo-3.839
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
antha+0.058
iliated+0.055
 coerc+0.054
urrent+0.053
odo+0.053
Neg logit contributions
advertising-0.060
Attorney-0.059
rolet-0.059
VK-0.055
jury-0.055
Token
Token ID198
Feature activation-8.79e-02

Pos logit contributions
antha+0.087
iliated+0.084
 coerc+0.083
urrent+0.079
odo+0.079
Neg logit contributions
advertising-0.098
Attorney-0.097
rolet-0.096
VK-0.091
jury-0.089
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
antha+0.330
 coerc+0.316
iliated+0.313
odo+0.297
urrent+0.296
Neg logit contributions
advertising-0.343
Attorney-0.339
rolet-0.332
VK-0.316
jury-0.311
TokenBut
Token ID1537
Feature activation-5.39e-03

Pos logit contributions
antha+0.430
iliated+0.403
 coerc+0.402
odo+0.393
urrent+0.385
Neg logit contributions
advertising-0.483
Attorney-0.476
rolet-0.472
VK-0.447
jury-0.437
TokenWe
Token ID1135
Feature activation+2.92e-01

Pos logit contributions
advertising+0.450
Attorney+0.448
rolet+0.427
VK+0.412
jury+0.395
Neg logit contributions
antha-0.511
 coerc-0.490
iliated-0.488
odo-0.467
urrent-0.464
Token also
Token ID635
Feature activation+4.49e-01

Pos logit contributions
advertising+1.081
rolet+1.077
Attorney+1.073
VK+0.982
jury+0.978
Neg logit contributions
antha-1.122
iliated-1.053
urrent-1.002
 coerc-1.000
odo-0.989
Token want
Token ID765
Feature activation+4.36e-01

Pos logit contributions
rolet+1.687
advertising+1.686
Attorney+1.670
VK+1.545
jury+1.528
Neg logit contributions
antha-1.697
iliated-1.595
urrent-1.514
 coerc-1.489
odo-1.487
Token to
Token ID284
Feature activation+5.48e-02

Pos logit contributions
advertising+1.734
Attorney+1.722
rolet+1.695
VK+1.588
jury+1.565
Neg logit contributions
antha-1.545
iliated-1.425
 coerc-1.403
urrent-1.388
odo-1.376
Token make
Token ID787
Feature activation+4.57e-01

Pos logit contributions
advertising+0.209
rolet+0.209
Attorney+0.207
VK+0.193
jury+0.190
Neg logit contributions
antha-0.207
iliated-0.192
 coerc-0.185
urrent-0.184
odo-0.184
Token sure
Token ID1654
Feature activation+1.18e+00

Pos logit contributions
advertising+1.790
Attorney+1.765
rolet+1.730
VK+1.641
jury+1.618
Neg logit contributions
antha-1.670
iliated-1.534
 coerc-1.492
odo-1.481
urrent-1.479
Token that
Token ID326
Feature activation+2.69e-01

Pos logit contributions
advertising+3.635
Attorney+3.596
rolet+3.433
jury+3.233
VK+3.203
Neg logit contributions
antha-5.056
iliated-4.715
urrent-4.634
odo-4.622
 coerc-4.572
Token we
Token ID356
Feature activation+1.34e-01

Pos logit contributions
advertising+0.963
Attorney+0.950
rolet+0.920
VK+0.867
jury+0.864
Neg logit contributions
antha-1.077
iliated-1.005
odo-0.982
urrent-0.978
 coerc-0.975
Token do
Token ID466
Feature activation+2.19e-01

Pos logit contributions
rolet+0.501
advertising+0.500
Attorney+0.496
VK+0.458
jury+0.454
Neg logit contributions
antha-0.516
iliated-0.485
odo-0.463
urrent-0.462
 coerc-0.462
Token not
Token ID407
Feature activation+9.01e-02

Pos logit contributions
advertising+0.678
rolet+0.676
Attorney+0.670
VK+0.615
jury+0.602
Neg logit contributions
antha-0.981
iliated-0.930
 coerc-0.909
urrent-0.899
odo-0.898
Token have
Token ID423
Feature activation+2.04e-01

Pos logit contributions
rolet+0.345
advertising+0.344
Attorney+0.340
VK+0.320
jury+0.319
Neg logit contributions
antha-0.339
iliated-0.314
urrent-0.302
odo-0.299
 coerc-0.294
Token
Token ID247
Feature activation-3.35e-03

Pos logit contributions
antha+0.463
 coerc+0.436
iliated+0.435
odo+0.427
urrent+0.420
Neg logit contributions
advertising-0.351
Attorney-0.346
rolet-0.346
VK-0.320
jury-0.315
Tokens
Token ID82
Feature activation-2.59e-01

Pos logit contributions
antha+0.011
iliated+0.010
 coerc+0.010
odo+0.010
urrent+0.010
Neg logit contributions
advertising-0.014
Attorney-0.014
rolet-0.014
VK-0.013
jury-0.013
Token already
Token ID1541
Feature activation+3.13e-01

Pos logit contributions
antha+0.929
iliated+0.868
 coerc+0.861
odo+0.842
urrent+0.832
Neg logit contributions
advertising-1.032
Attorney-1.018
rolet-1.015
VK-0.956
jury-0.942
Token two
Token ID734
Feature activation-1.24e-01

Pos logit contributions
advertising+1.226
Attorney+1.208
rolet+1.191
VK+1.137
jury+1.116
Neg logit contributions
antha-1.135
iliated-1.059
 coerc-1.050
odo-1.029
urrent-1.011
Token people
Token ID661
Feature activation+1.60e-02

Pos logit contributions
antha+0.473
iliated+0.440
 coerc+0.440
odo+0.430
urrent+0.425
Neg logit contributions
advertising-0.467
Attorney-0.462
rolet-0.459
VK-0.433
jury-0.424
Token [
Token ID685
Feature activation+1.18e+00

Pos logit contributions
advertising+0.058
Attorney+0.057
rolet+0.057
VK+0.053
jury+0.052
Neg logit contributions
antha-0.063
iliated-0.059
 coerc-0.059
odo-0.058
urrent-0.057
Tokenwho
Token ID8727
Feature activation+3.12e-01

Pos logit contributions
advertising+4.229
Attorney+4.135
rolet+3.955
jury+3.815
VK+3.804
Neg logit contributions
antha-4.768
 coerc-4.528
iliated-4.349
odo-4.308
 Fargo-4.287
Token have
Token ID423
Feature activation-2.89e-01

Pos logit contributions
rolet+1.185
advertising+1.176
Attorney+1.166
VK+1.093
jury+1.082
Neg logit contributions
antha-1.159
 coerc-1.079
iliated-1.076
odo-1.048
urrent-1.027
Token]
Token ID60
Feature activation+3.43e-01

Pos logit contributions
antha+1.037
iliated+0.967
 coerc+0.963
odo+0.942
urrent+0.927
Neg logit contributions
advertising-1.145
rolet-1.135
Attorney-1.134
VK-1.067
jury-1.046
Token had
Token ID550
Feature activation+1.87e-01

Pos logit contributions
advertising+1.331
Attorney+1.312
rolet+1.301
VK+1.231
jury+1.208
Neg logit contributions
antha-1.251
 coerc-1.161
iliated-1.146
odo-1.136
urrent-1.108
Token it
Token ID340
Feature activation-3.67e-01

Pos logit contributions
advertising+0.695
Attorney+0.689
rolet+0.685
VK+0.648
jury+0.635
Neg logit contributions
antha-0.711
iliated-0.666
 coerc-0.663
odo-0.646
urrent-0.639
Token
Token ID198
Feature activation-1.90e-03

Pos logit contributions
advertising+0.601
Attorney+0.597
rolet+0.589
VK+0.561
jury+0.551
Neg logit contributions
antha-0.487
iliated-0.465
 coerc-0.460
urrent-0.438
odo-0.431
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
antha+0.007
 coerc+0.007
iliated+0.007
odo+0.007
urrent+0.007
Neg logit contributions
advertising-0.007
Attorney-0.007
rolet-0.007
VK-0.007
jury-0.007
Token"
Token ID1
Feature activation+2.62e-01

Pos logit contributions
advertising+0.136
Attorney+0.134
rolet+0.133
VK+0.128
jury+0.126
Neg logit contributions
antha-0.063
 coerc-0.057
iliated-0.057
odo-0.054
urrent-0.052
TokenI
Token ID40
Feature activation+2.17e-01

Pos logit contributions
advertising+1.037
Attorney+1.033
rolet+0.978
VK+0.960
jury+0.925
Neg logit contributions
antha-1.002
 coerc-0.964
iliated-0.951
urrent-0.902
odo-0.899
Token told
Token ID1297
Feature activation-4.09e-01

Pos logit contributions
rolet+0.824
advertising+0.823
Attorney+0.820
jury+0.775
VK+0.753
Neg logit contributions
antha-0.807
iliated-0.762
urrent-0.730
 coerc-0.728
odo-0.712
Token [
Token ID685
Feature activation+1.18e+00

Pos logit contributions
antha+1.446
iliated+1.352
 coerc+1.336
odo+1.312
urrent+1.297
Neg logit contributions
advertising-1.653
Attorney-1.638
rolet-1.634
VK-1.537
jury-1.516
TokenGreen
Token ID13719
Feature activation+3.61e-01

Pos logit contributions
Attorney+4.371
advertising+4.328
rolet+4.022
jury+3.952
cow+3.941
Neg logit contributions
antha-4.665
 coerc-4.355
iliated-4.246
odo-4.188
 Fargo-4.138
Token]
Token ID60
Feature activation+7.83e-01

Pos logit contributions
advertising+1.321
Attorney+1.295
rolet+1.291
VK+1.241
jury+1.215
Neg logit contributions
antha-1.399
 coerc-1.331
iliated-1.312
odo-1.277
urrent-1.266
Token almost
Token ID2048
Feature activation+2.11e-01

Pos logit contributions
Attorney+2.860
advertising+2.853
rolet+2.826
jury+2.695
cow+2.666
Neg logit contributions
antha-2.944
iliated-2.744
urrent-2.735
 coerc-2.707
odo-2.673
Token every
Token ID790
Feature activation+1.99e-01

Pos logit contributions
advertising+0.716
Attorney+0.712
rolet+0.707
jury+0.659
VK+0.651
Neg logit contributions
antha-0.871
iliated-0.834
urrent-0.806
odo-0.796
 coerc-0.783
Token play
Token ID711
Feature activation+2.27e-01

Pos logit contributions
Attorney+0.636
advertising+0.631
rolet+0.625
jury+0.586
VK+0.573
Neg logit contributions
antha-0.870
iliated-0.809
 coerc-0.799
odo-0.797
urrent-0.792
Token probably
Token ID2192
Feature activation-5.55e-02

Pos logit contributions
antha+0.368
iliated+0.344
 coerc+0.340
odo+0.334
urrent+0.332
Neg logit contributions
advertising-0.371
Attorney-0.368
rolet-0.363
VK-0.341
jury-0.338
Token not
Token ID407
Feature activation-3.38e-01

Pos logit contributions
antha+0.212
iliated+0.199
 coerc+0.196
urrent+0.192
odo+0.192
Neg logit contributions
advertising-0.208
Attorney-0.207
rolet-0.203
VK-0.190
jury-0.190
Token sensible
Token ID20586
Feature activation-3.86e-01

Pos logit contributions
antha+1.189
iliated+1.101
 coerc+1.093
odo+1.073
urrent+1.060
Neg logit contributions
advertising-1.375
Attorney-1.360
rolet-1.355
VK-1.277
jury-1.256
Token to
Token ID284
Feature activation-7.18e-02

Pos logit contributions
antha+1.301
iliated+1.213
 coerc+1.200
odo+1.175
urrent+1.158
Neg logit contributions
advertising-1.616
rolet-1.599
Attorney-1.597
VK-1.507
jury-1.480
Token get
Token ID651
Feature activation-1.90e-02

Pos logit contributions
antha+0.280
iliated+0.261
 coerc+0.253
odo+0.252
urrent+0.250
Neg logit contributions
advertising-0.264
rolet-0.264
Attorney-0.262
VK-0.246
jury-0.241
Token too
Token ID1165
Feature activation+1.18e+00

Pos logit contributions
antha+0.068
iliated+0.063
 coerc+0.062
odo+0.062
urrent+0.061
Neg logit contributions
advertising-0.076
Attorney-0.075
rolet-0.073
VK-0.070
jury-0.069
Token worked
Token ID3111
Feature activation+4.04e-02

Pos logit contributions
advertising+4.422
Attorney+4.276
rolet+4.044
jury+4.009
cow+3.939
Neg logit contributions
antha-4.207
iliated-3.967
odo-3.871
urrent-3.857
 coerc-3.789
Token up
Token ID510
Feature activation-5.03e-01

Pos logit contributions
advertising+0.167
rolet+0.166
Attorney+0.165
VK+0.155
jury+0.153
Neg logit contributions
antha-0.139
iliated-0.129
 coerc-0.128
urrent-0.124
odo-0.124
Token about
Token ID546
Feature activation-2.51e-01

Pos logit contributions
antha+1.956
iliated+1.847
 coerc+1.832
odo+1.801
urrent+1.774
Neg logit contributions
advertising-1.839
rolet-1.820
Attorney-1.817
VK-1.701
jury-1.662
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
antha+0.932
iliated+0.871
 coerc+0.858
odo+0.851
urrent+0.838
Neg logit contributions
advertising-0.973
Attorney-0.960
rolet-0.950
VK-0.899
jury-0.887
Token details
Token ID3307
Feature activation-1.64e-01

Pos logit contributions
antha+0.568
iliated+0.534
 coerc+0.522
odo+0.520
urrent+0.511
Neg logit contributions
advertising-0.577
Attorney-0.573
rolet-0.566
VK-0.536
jury-0.532
Token perhaps
Token ID3737
Feature activation+3.71e-02

Pos logit contributions
advertising+0.131
Attorney+0.128
rolet+0.128
VK+0.121
jury+0.119
Neg logit contributions
antha-0.117
iliated-0.110
 coerc-0.109
urrent-0.106
odo-0.106
Token they
Token ID484
Feature activation-1.16e-02

Pos logit contributions
advertising+0.143
Attorney+0.140
rolet+0.138
VK+0.130
jury+0.129
Neg logit contributions
antha-0.138
iliated-0.131
 coerc-0.128
urrent-0.126
odo-0.126
Token shouldn
Token ID6584
Feature activation+4.88e-01

Pos logit contributions
antha+0.043
iliated+0.041
 coerc+0.040
odo+0.039
urrent+0.039
Neg logit contributions
advertising-0.045
rolet-0.044
Attorney-0.044
VK-0.041
jury-0.041
Token't
Token ID470
Feature activation+6.19e-01

Pos logit contributions
advertising+1.522
Attorney+1.484
rolet+1.434
VK+1.351
jury+1.339
Neg logit contributions
antha-2.241
 coerc-2.174
iliated-2.129
urrent-2.042
odo-2.024
Token celebrate
Token ID10648
Feature activation+4.44e-01

Pos logit contributions
rolet+2.395
advertising+2.380
Attorney+2.331
VK+2.218
jury+2.216
Neg logit contributions
antha-2.285
iliated-2.110
urrent-2.031
 coerc-2.007
odo-1.979
Token too
Token ID1165
Feature activation+1.17e+00

Pos logit contributions
advertising+1.641
Attorney+1.589
rolet+1.559
VK+1.494
jury+1.468
Neg logit contributions
antha-1.707
iliated-1.610
 coerc-1.592
odo-1.566
urrent-1.548
Token quickly
Token ID2952
Feature activation+4.92e-01

Pos logit contributions
advertising+4.258
Attorney+4.138
cow+4.045
rolet+3.976
jury+3.964
Neg logit contributions
antha-4.298
iliated-4.046
odo-3.987
urrent-3.900
odan-3.867
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
advertising+1.822
Attorney+1.765
rolet+1.732
jury+1.635
VK+1.633
Neg logit contributions
antha-1.918
iliated-1.802
 coerc-1.783
urrent-1.743
odo-1.734
Token And
Token ID843
Feature activation+2.05e-01

Pos logit contributions
advertising+0.585
Attorney+0.575
rolet+0.567
VK+0.536
jury+0.523
Neg logit contributions
antha-0.570
iliated-0.544
 coerc-0.542
odo-0.518
urrent-0.518
Token don
Token ID836
Feature activation+3.72e-01

Pos logit contributions
advertising+0.810
Attorney+0.796
rolet+0.794
VK+0.747
jury+0.734
Neg logit contributions
antha-0.746
iliated-0.698
 coerc-0.689
odo-0.675
urrent-0.674
Token't
Token ID470
Feature activation+3.71e-01

Pos logit contributions
advertising+1.169
Attorney+1.138
rolet+1.111
VK+1.049
jury+1.030
Neg logit contributions
antha-1.686
 coerc-1.637
iliated-1.606
urrent-1.537
odo-1.531
Tokenoper
Token ID3575
Feature activation+9.90e-02

Pos logit contributions
antha+0.725
iliated+0.685
 coerc+0.679
odo+0.652
urrent+0.650
Neg logit contributions
advertising-0.634
rolet-0.628
Attorney-0.624
VK-0.583
jury-0.571
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
advertising+0.379
Attorney+0.371
rolet+0.367
VK+0.348
jury+0.342
Neg logit contributions
antha-0.371
iliated-0.350
 coerc-0.343
odo-0.337
urrent-0.335
Token We
Token ID775
Feature activation+2.13e-01

Pos logit contributions
advertising+0.536
Attorney+0.524
rolet+0.519
VK+0.493
jury+0.479
Neg logit contributions
antha-0.539
iliated-0.515
 coerc-0.511
urrent-0.488
odo-0.484
Token don
Token ID836
Feature activation+4.79e-01

Pos logit contributions
advertising+0.838
rolet+0.828
Attorney+0.819
VK+0.763
jury+0.760
Neg logit contributions
antha-0.779
iliated-0.731
 coerc-0.715
urrent-0.696
odo-0.695
Token't
Token ID470
Feature activation+3.07e-01

Pos logit contributions
advertising+1.462
Attorney+1.432
rolet+1.369
VK+1.318
jury+1.286
Neg logit contributions
antha-2.237
 coerc-2.174
iliated-2.133
urrent-2.040
odo-2.019
Token [
Token ID685
Feature activation+1.15e+00

Pos logit contributions
advertising+1.144
rolet+1.133
Attorney+1.118
VK+1.052
jury+1.042
Neg logit contributions
antha-1.186
iliated-1.102
 coerc-1.063
urrent-1.058
odo-1.044
Tokenusually
Token ID23073
Feature activation-9.11e-02

Pos logit contributions
advertising+4.469
Attorney+4.263
rolet+4.128
jury+3.988
VK+3.936
Neg logit contributions
antha-4.395
 coerc-4.103
iliated-4.016
 Fargo-3.917
 Dortmund-3.847
Token]
Token ID60
Feature activation+7.57e-02

Pos logit contributions
antha+0.361
iliated+0.340
 coerc+0.333
odo+0.328
urrent+0.325
Neg logit contributions
advertising-0.331
rolet-0.326
Attorney-0.324
VK-0.303
jury-0.298
Token go
Token ID467
Feature activation+7.49e-02

Pos logit contributions
advertising+0.276
rolet+0.273
Attorney+0.270
VK+0.253
jury+0.250
Neg logit contributions
antha-0.300
iliated-0.280
 coerc-0.272
odo-0.269
urrent-0.268
Token into
Token ID656
Feature activation+2.17e-01

Pos logit contributions
advertising+0.312
rolet+0.303
Attorney+0.303
VK+0.287
jury+0.283
Neg logit contributions
antha-0.259
iliated-0.240
 coerc-0.234
odo-0.231
urrent-0.231
Token [
Token ID685
Feature activation+7.72e-01

Pos logit contributions
advertising+0.829
Attorney+0.792
rolet+0.788
jury+0.742
VK+0.741
Neg logit contributions
antha-0.824
iliated-0.774
 coerc-0.762
odo-0.749
urrent-0.743
Token tomorrow
Token ID9439
Feature activation+1.37e-01

Pos logit contributions
antha+0.356
iliated+0.335
 coerc+0.333
odo+0.325
urrent+0.321
Neg logit contributions
advertising-0.354
rolet-0.351
Attorney-0.346
VK-0.326
jury-0.323
Token with
Token ID351
Feature activation+1.37e-01

Pos logit contributions
advertising+0.527
rolet+0.526
Attorney+0.521
jury+0.488
VK+0.487
Neg logit contributions
antha-0.510
iliated-0.479
 coerc-0.475
odo-0.461
urrent-0.456
Token Est
Token ID10062
Feature activation+1.92e-02

Pos logit contributions
rolet+0.526
advertising+0.523
Attorney+0.517
jury+0.488
VK+0.487
Neg logit contributions
antha-0.508
iliated-0.477
 coerc-0.475
odo-0.459
urrent-0.458
Tokeneb
Token ID1765
Feature activation+3.61e-01

Pos logit contributions
advertising+0.068
rolet+0.067
Attorney+0.067
VK+0.063
jury+0.061
Neg logit contributions
antha-0.077
iliated-0.072
 coerc-0.072
odo-0.070
urrent-0.070
Tokenan
Token ID272
Feature activation+3.54e-01

Pos logit contributions
advertising+1.531
rolet+1.495
Attorney+1.463
VK+1.420
jury+1.385
Neg logit contributions
antha-1.210
 coerc-1.154
iliated-1.143
odo-1.081
urrent-1.070
Token [
Token ID685
Feature activation+1.14e+00

Pos logit contributions
advertising+1.511
rolet+1.498
Attorney+1.483
VK+1.428
jury+1.418
Neg logit contributions
antha-1.143
 coerc-1.067
iliated-1.047
urrent-1.020
 Klux-0.974
TokenO
Token ID46
Feature activation+2.36e-01

Pos logit contributions
advertising+4.152
Attorney+4.059
rolet+3.936
VK+3.814
jury+3.750
Neg logit contributions
antha-4.572
 coerc-4.376
iliated-4.243
 Fargo-4.095
odo-4.092
Tokencon
Token ID1102
Feature activation-4.87e-01

Pos logit contributions
advertising+0.864
rolet+0.863
Attorney+0.833
VK+0.787
jury+0.778
Neg logit contributions
antha-0.924
iliated-0.883
 coerc-0.874
urrent-0.848
odo-0.843
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
antha+1.843
iliated+1.732
 coerc+1.713
odo+1.692
urrent+1.667
Neg logit contributions
advertising-1.828
rolet-1.809
Attorney-1.809
VK-1.696
jury-1.657
Token his
Token ID465
Feature activation+1.95e-01

Pos logit contributions
antha+0.466
iliated+0.438
 coerc+0.434
odo+0.426
urrent+0.421
Neg logit contributions
advertising-0.463
rolet-0.461
Attorney-0.456
VK-0.428
jury-0.422
Token new
Token ID649
Feature activation-2.22e-01

Pos logit contributions
advertising+0.703
rolet+0.700
Attorney+0.691
VK+0.648
jury+0.637
Neg logit contributions
antha-0.778
iliated-0.734
 coerc-0.723
odo-0.710
urrent-0.703
Token.
Token ID13
Feature activation-6.96e-02

Pos logit contributions
advertising+1.577
Attorney+1.535
rolet+1.515
jury+1.415
VK+1.412
Neg logit contributions
antha-1.568
 coerc-1.487
iliated-1.474
odo-1.418
urrent-1.407
Token8
Token ID23
Feature activation+2.58e-01

Pos logit contributions
antha+0.307
iliated+0.292
 coerc+0.290
odo+0.283
urrent+0.280
Neg logit contributions
advertising-0.222
Attorney-0.218
rolet-0.218
VK-0.202
jury-0.196
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
advertising+0.989
rolet+0.978
Attorney+0.976
VK+0.909
jury+0.897
Neg logit contributions
antha-0.952
 coerc-0.893
iliated-0.892
odo-0.874
urrent-0.856
Token Tyr
Token ID14319
Feature activation-7.21e-02

Pos logit contributions
antha+0.408
iliated+0.385
 coerc+0.380
odo+0.377
urrent+0.368
Neg logit contributions
advertising-0.409
Attorney-0.404
rolet-0.400
VK-0.376
jury-0.373
Tokenant
Token ID415
Feature activation+6.52e-01

Pos logit contributions
antha+0.264
 coerc+0.250
iliated+0.250
odo+0.240
urrent+0.239
Neg logit contributions
rolet-0.280
advertising-0.278
Attorney-0.278
VK-0.258
jury-0.255
Token opened
Token ID4721
Feature activation+1.14e+00

Pos logit contributions
Attorney+2.455
advertising+2.439
rolet+2.398
VK+2.267
jury+2.199
Neg logit contributions
antha-2.453
iliated-2.322
 coerc-2.269
odo-2.208
urrent-2.180
Token with
Token ID351
Feature activation+3.64e-01

Pos logit contributions
advertising+4.199
rolet+4.078
Attorney+4.074
jury+3.779
VK+3.748
Neg logit contributions
antha-4.205
iliated-3.918
odo-3.883
 coerc-3.835
urrent-3.727
Token 2
Token ID362
Feature activation-1.58e-01

Pos logit contributions
advertising+1.301
rolet+1.278
Attorney+1.268
VK+1.190
jury+1.188
Neg logit contributions
antha-1.445
iliated-1.373
 coerc-1.340
odo-1.326
urrent-1.322
Token.
Token ID13
Feature activation+2.99e-01

Pos logit contributions
antha+0.542
iliated+0.509
 coerc+0.503
odo+0.494
urrent+0.487
Neg logit contributions
advertising-0.645
rolet-0.640
Attorney-0.638
VK-0.602
jury-0.590
Token1
Token ID16
Feature activation+2.28e-01

Pos logit contributions
advertising+1.003
rolet+0.988
Attorney+0.986
VK+0.922
jury+0.897
Neg logit contributions
antha-1.258
iliated-1.198
 coerc-1.193
odo-1.158
urrent-1.147
Token mil
Token ID1465
Feature activation+4.11e-01

Pos logit contributions
advertising+0.851
rolet+0.838
Attorney+0.838
VK+0.783
jury+0.767
Neg logit contributions
antha-0.878
iliated-0.822
 coerc-0.820
odo-0.805
urrent-0.787
Token the
Token ID262
Feature activation+5.44e-02

Pos logit contributions
advertising+0.772
rolet+0.768
Attorney+0.767
VK+0.727
jury+0.709
Neg logit contributions
antha-0.751
iliated-0.710
 coerc-0.698
odo-0.694
urrent-0.684
Token Beer
Token ID16971
Feature activation+2.68e-01

Pos logit contributions
advertising+0.205
Attorney+0.203
rolet+0.202
VK+0.192
jury+0.187
Neg logit contributions
antha-0.204
iliated-0.191
 coerc-0.190
odo-0.187
urrent-0.183
Token Festival
Token ID11117
Feature activation-4.56e-02

Pos logit contributions
advertising+0.957
Attorney+0.949
rolet+0.929
VK+0.887
jury+0.863
Neg logit contributions
antha-1.057
 coerc-0.999
iliated-0.997
urrent-0.963
odo-0.961
Token.
Token ID13
Feature activation+5.28e-02

Pos logit contributions
antha+0.165
iliated+0.155
 coerc+0.155
odo+0.151
urrent+0.148
Neg logit contributions
advertising-0.179
rolet-0.176
Attorney-0.176
VK-0.165
jury-0.163
Token Make
Token ID6889
Feature activation+2.55e-01

Pos logit contributions
advertising+0.207
Attorney+0.203
rolet+0.202
VK+0.192
jury+0.187
Neg logit contributions
antha-0.194
iliated-0.185
 coerc-0.185
odo-0.178
urrent-0.176
Token sure
Token ID1654
Feature activation+1.13e+00

Pos logit contributions
advertising+1.093
rolet+1.070
Attorney+1.067
VK+1.011
jury+0.994
Neg logit contributions
antha-0.841
iliated-0.788
 coerc-0.782
odo-0.755
urrent-0.744
Token to
Token ID284
Feature activation+5.21e-01

Pos logit contributions
advertising+4.449
rolet+4.367
Attorney+4.322
VK+4.138
cow+3.998
Neg logit contributions
antha-3.873
 coerc-3.755
iliated-3.719
odo-3.555
urrent-3.548
Token get
Token ID651
Feature activation+4.94e-01

Pos logit contributions
advertising+1.611
rolet+1.610
Attorney+1.573
VK+1.494
cow+1.432
Neg logit contributions
antha-2.288
iliated-2.172
 coerc-2.146
odo-2.108
urrent-2.082
Token your
Token ID534
Feature activation+1.82e-01

Pos logit contributions
advertising+1.821
Attorney+1.773
rolet+1.766
VK+1.677
jury+1.622
Neg logit contributions
antha-1.880
iliated-1.793
 coerc-1.783
odo-1.759
urrent-1.716
Token tickets
Token ID8587
Feature activation+2.33e-01

Pos logit contributions
advertising+0.725
rolet+0.716
Attorney+0.713
VK+0.678
jury+0.657
Neg logit contributions
antha-0.646
iliated-0.612
 coerc-0.601
odo-0.595
urrent-0.582
Token!
Token ID0
Feature activation+8.56e-02

Pos logit contributions
advertising+0.927
rolet+0.912
Attorney+0.909
VK+0.861
cow+0.841
Neg logit contributions
antha-0.830
iliated-0.781
 coerc-0.769
odo-0.761
urrent-0.749
Token are
Token ID389
Feature activation-1.27e-01

Pos logit contributions
advertising+0.966
Attorney+0.948
rolet+0.947
VK+0.882
jury+0.875
Neg logit contributions
antha-0.917
iliated-0.851
 coerc-0.835
odo-0.826
urrent-0.814
Token also
Token ID635
Feature activation+3.77e-01

Pos logit contributions
antha+0.449
iliated+0.423
 coerc+0.415
odo+0.411
urrent+0.404
Neg logit contributions
advertising-0.515
Attorney-0.503
rolet-0.499
VK-0.473
jury-0.470
Token still
Token ID991
Feature activation+5.07e-01

Pos logit contributions
advertising+1.482
Attorney+1.423
rolet+1.410
jury+1.345
VK+1.341
Neg logit contributions
antha-1.396
iliated-1.311
 coerc-1.282
odo-1.273
urrent-1.249
Token in
Token ID287
Feature activation+1.63e-01

Pos logit contributions
advertising+1.897
Attorney+1.826
rolet+1.795
jury+1.725
VK+1.703
Neg logit contributions
antha-1.957
iliated-1.845
 coerc-1.796
odo-1.794
urrent-1.765
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
advertising+0.611
rolet+0.601
Attorney+0.600
VK+0.561
jury+0.560
Neg logit contributions
antha-0.623
iliated-0.588
 coerc-0.581
odo-0.571
urrent-0.563
Token mix
Token ID5022
Feature activation+1.12e+00

Pos logit contributions
advertising+0.719
Attorney+0.706
rolet+0.705
VK+0.654
jury+0.644
Neg logit contributions
antha-0.867
iliated-0.824
 coerc-0.810
odo-0.796
urrent-0.789
Token for
Token ID329
Feature activation+7.80e-02

Pos logit contributions
advertising+4.069
rolet+3.950
Attorney+3.949
jury+3.718
VK+3.583
Neg logit contributions
antha-4.315
iliated-4.004
urrent-3.929
 coerc-3.927
odo-3.904
Token Ad
Token ID1215
Feature activation-2.47e-01

Pos logit contributions
advertising+0.310
rolet+0.305
Attorney+0.304
jury+0.282
VK+0.282
Neg logit contributions
antha-0.279
iliated-0.261
 coerc-0.260
odo-0.255
urrent-0.252
Tokenew
Token ID413
Feature activation+1.59e-01

Pos logit contributions
antha+0.863
 coerc+0.816
iliated+0.803
odo+0.781
urrent+0.776
Neg logit contributions
advertising-0.999
rolet-0.992
Attorney-0.985
jury-0.921
VK-0.920
Tokenale
Token ID1000
Feature activation-1.55e-01

Pos logit contributions
rolet+0.601
advertising+0.599
Attorney+0.593
VK+0.562
cow+0.548
Neg logit contributions
antha-0.598
 coerc-0.570
iliated-0.557
 Fargo-0.540
odo-0.538
Token Om
Token ID16543
Feature activation+1.21e-01

Pos logit contributions
antha+0.576
iliated+0.525
 coerc+0.523
urrent+0.515
odo+0.512
Neg logit contributions
Attorney-0.607
advertising-0.599
rolet-0.588
VK-0.561
jury-0.561
Token #
Token ID1303
Feature activation+3.97e-01

Pos logit contributions
advertising+1.175
rolet+1.173
Attorney+1.144
VK+1.059
cow+0.993
Neg logit contributions
antha-2.732
iliated-2.637
 coerc-2.593
odo-2.537
urrent-2.514
Token load
Token ID3440
Feature activation+1.66e-01

Pos logit contributions
advertising+1.532
rolet+1.525
Attorney+1.498
VK+1.454
cow+1.394
Neg logit contributions
antha-1.453
iliated-1.362
 coerc-1.316
odo-1.315
urrent-1.282
Token the
Token ID262
Feature activation-9.26e-02

Pos logit contributions
advertising+0.612
Attorney+0.608
rolet+0.594
VK+0.569
jury+0.554
Neg logit contributions
antha-0.653
 coerc-0.623
iliated-0.617
urrent-0.588
odo-0.588
Token image
Token ID2939
Feature activation+1.94e-01

Pos logit contributions
antha+0.343
 coerc+0.319
iliated+0.316
urrent+0.305
odo+0.305
Neg logit contributions
advertising-0.366
Attorney-0.358
rolet-0.357
VK-0.338
jury-0.332
Token,
Token ID11
Feature activation+1.66e-01

Pos logit contributions
advertising+0.695
Attorney+0.687
rolet+0.679
VK+0.641
jury+0.625
Neg logit contributions
antha-0.777
iliated-0.736
 coerc-0.727
odo-0.712
urrent-0.702
Token resize
Token ID47558
Feature activation+1.12e+00

Pos logit contributions
advertising+0.577
rolet+0.568
Attorney+0.565
VK+0.536
jury+0.518
Neg logit contributions
antha-0.675
iliated-0.638
 coerc-0.621
odo-0.620
urrent-0.611
Token it
Token ID340
Feature activation-1.61e-01

Pos logit contributions
Attorney+3.679
advertising+3.670
rolet+3.546
VK+3.452
jury+3.369
Neg logit contributions
antha-4.693
 coerc-4.455
iliated-4.420
urrent-4.270
odo-4.243
Token,
Token ID11
Feature activation-2.92e-02

Pos logit contributions
antha+0.624
iliated+0.590
 coerc+0.586
odo+0.565
urrent+0.562
Neg logit contributions
advertising-0.603
Attorney-0.596
rolet-0.591
VK-0.559
jury-0.548
Token and
Token ID290
Feature activation-5.54e-02

Pos logit contributions
antha+0.107
iliated+0.101
 coerc+0.100
odo+0.098
urrent+0.097
Neg logit contributions
advertising-0.113
Attorney-0.112
rolet-0.112
VK-0.105
jury-0.103
Token convert
Token ID10385
Feature activation+1.50e-01

Pos logit contributions
antha+0.179
iliated+0.167
 coerc+0.163
odo+0.160
urrent+0.158
Neg logit contributions
advertising-0.239
rolet-0.238
Attorney-0.237
VK-0.225
jury-0.220
Token it
Token ID340
Feature activation-4.13e-02

Pos logit contributions
advertising+0.585
Attorney+0.576
rolet+0.572
VK+0.543
jury+0.534
Neg logit contributions
antha-0.557
 coerc-0.517
iliated-0.517
urrent-0.499
odo-0.496
Token
Token ID198
Feature activation-2.85e-02

Pos logit contributions
antha+0.064
 coerc+0.061
iliated+0.061
odo+0.058
urrent+0.058
Neg logit contributions
advertising-0.068
Attorney-0.066
rolet-0.066
VK-0.062
jury-0.061
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
antha+0.107
 coerc+0.102
iliated+0.101
odo+0.096
urrent+0.096
Neg logit contributions
advertising-0.111
Attorney-0.109
rolet-0.107
VK-0.102
jury-0.101
Token
Token ID447
Feature activation-8.05e-02

Pos logit contributions
antha+0.686
iliated+0.656
 coerc+0.652
odo+0.643
urrent+0.637
Neg logit contributions
advertising-0.342
rolet-0.337
Attorney-0.336
VK-0.304
jury-0.294
Token
Token ID246
Feature activation-3.18e-02

Pos logit contributions
antha+0.376
 coerc+0.358
iliated+0.355
odo+0.347
 Fargo+0.344
Neg logit contributions
advertising-0.238
rolet-0.231
Attorney-0.230
VK-0.211
jury-0.208
TokenNo
Token ID2949
Feature activation-2.05e-01

Pos logit contributions
antha+0.126
 coerc+0.121
iliated+0.120
odo+0.116
urrent+0.115
Neg logit contributions
advertising-0.116
Attorney-0.115
rolet-0.112
VK-0.107
jury-0.104
Token matter
Token ID2300
Feature activation-1.51e+00

Pos logit contributions
antha+0.737
iliated+0.687
 coerc+0.682
odo+0.670
urrent+0.663
Neg logit contributions
advertising-0.821
rolet-0.808
Attorney-0.807
VK-0.757
jury-0.749
Token where
Token ID810
Feature activation-2.15e-01

Pos logit contributions
antha+4.836
 coerc+4.621
iliated+4.598
odo+4.458
 Fargo+4.407
Neg logit contributions
rolet-6.288
advertising-6.276
Attorney-6.266
VK-5.917
jury-5.691
Token we
Token ID356
Feature activation-5.43e-03

Pos logit contributions
antha+0.819
iliated+0.773
 coerc+0.763
odo+0.748
urrent+0.747
Neg logit contributions
advertising-0.803
Attorney-0.790
rolet-0.782
VK-0.739
jury-0.732
Token live
Token ID2107
Feature activation+3.91e-01

Pos logit contributions
antha+0.022
iliated+0.021
 coerc+0.020
urrent+0.020
odo+0.020
Neg logit contributions
advertising-0.019
Attorney-0.019
rolet-0.019
VK-0.018
jury-0.017
Token we
Token ID356
Feature activation+2.70e-01

Pos logit contributions
advertising+1.432
Attorney+1.411
rolet+1.394
VK+1.317
jury+1.310
Neg logit contributions
antha-1.510
iliated-1.414
 coerc-1.396
odo-1.370
urrent-1.368
Token are
Token ID389
Feature activation+1.89e-02

Pos logit contributions
rolet+0.978
advertising+0.975
Attorney+0.964
VK+0.896
jury+0.881
Neg logit contributions
antha-1.045
iliated-1.002
 coerc-0.968
urrent-0.952
odo-0.950
Token.
Token ID13
Feature activation+2.51e-02

Pos logit contributions
advertising+0.584
rolet+0.581
Attorney+0.578
VK+0.540
jury+0.532
Neg logit contributions
antha-0.568
iliated-0.532
 coerc-0.527
odo-0.517
urrent-0.510
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
advertising+0.097
Attorney+0.095
rolet+0.095
VK+0.089
jury+0.087
Neg logit contributions
antha-0.094
iliated-0.090
 coerc-0.089
odo-0.086
urrent-0.085
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
antha+0.411
 coerc+0.389
iliated+0.387
odo+0.369
urrent+0.367
Neg logit contributions
advertising-0.434
Attorney-0.428
rolet-0.421
VK-0.400
jury-0.394
TokenSource
Token ID7416
Feature activation-2.21e-01

Pos logit contributions
antha+0.571
iliated+0.540
 coerc+0.540
odo+0.525
urrent+0.518
Neg logit contributions
advertising-0.521
Attorney-0.511
rolet-0.506
VK-0.477
jury-0.463
Token:
Token ID25
Feature activation-4.29e-01

Pos logit contributions
antha+0.891
iliated+0.843
 coerc+0.835
odo+0.820
urrent+0.811
Neg logit contributions
advertising-0.783
rolet-0.773
Attorney-0.771
VK-0.721
jury-0.705
Token Anonymous
Token ID19200
Feature activation-1.43e+00

Pos logit contributions
antha+1.612
iliated+1.521
 coerc+1.506
odo+1.475
urrent+1.458
Neg logit contributions
advertising-1.633
rolet-1.611
Attorney-1.609
VK-1.512
jury-1.477
Token<|endoftext|>
Token ID50256
Feature activation-1.83e-01

Pos logit contributions
antha+4.912
iliated+4.607
odo+4.515
 coerc+4.506
 Fargo+4.412
Neg logit contributions
rolet-5.536
advertising-5.528
Attorney-5.496
VK-5.133
jury-5.116
TokenT
Token ID51
Feature activation-5.84e-04

Pos logit contributions
antha+0.743
 coerc+0.704
iliated+0.703
odo+0.684
urrent+0.675
Neg logit contributions
advertising-0.658
Attorney-0.648
rolet-0.630
VK-0.601
jury-0.584
TokenAM
Token ID2390
Feature activation+7.76e-03

Pos logit contributions
antha+0.002
iliated+0.002
 coerc+0.002
 Fargo+0.002
odo+0.002
Neg logit contributions
advertising-0.002
rolet-0.002
Attorney-0.002
VK-0.002
jury-0.002
TokenPA
Token ID4537
Feature activation+1.32e-01

Pos logit contributions
advertising+0.015
Attorney+0.015
rolet+0.014
VK+0.013
jury+0.012
Neg logit contributions
antha-0.044
iliated-0.042
 coerc-0.042
odo-0.041
urrent-0.041
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
advertising+0.474
Attorney+0.469
rolet+0.464
VK+0.438
jury+0.428
Neg logit contributions
antha-0.531
iliated-0.500
 coerc-0.492
odo-0.483
urrent-0.476
Token
Token ID247
Feature activation+2.22e-02

Pos logit contributions
advertising+0.398
Attorney+0.390
rolet+0.385
VK+0.357
jury+0.355
Neg logit contributions
antha-0.566
 coerc-0.537
iliated-0.529
odo-0.521
 Fargo-0.516
Token aforementioned
Token ID20794
Feature activation+9.92e-02

Pos logit contributions
advertising+0.094
rolet+0.091
Attorney+0.091
VK+0.086
jury+0.086
Neg logit contributions
antha-0.075
 coerc-0.069
iliated-0.069
odo-0.068
urrent-0.066
Token movie
Token ID3807
Feature activation-6.03e-02

Pos logit contributions
advertising+0.381
rolet+0.371
Attorney+0.369
VK+0.344
jury+0.343
Neg logit contributions
antha-0.372
iliated-0.352
 coerc-0.348
odo-0.342
urrent-0.338
Token.
Token ID13
Feature activation+7.87e-02

Pos logit contributions
antha+0.221
iliated+0.208
 coerc+0.206
odo+0.201
urrent+0.199
Neg logit contributions
advertising-0.237
rolet-0.232
Attorney-0.231
VK-0.216
jury-0.214
Token Not
Token ID1892
Feature activation-6.49e-01

Pos logit contributions
advertising+0.304
Attorney+0.296
rolet+0.293
VK+0.275
jury+0.270
Neg logit contributions
antha-0.300
iliated-0.290
 coerc-0.287
odo-0.274
urrent-0.273
Token only
Token ID691
Feature activation-1.41e+00

Pos logit contributions
antha+2.183
iliated+2.037
 coerc+2.018
odo+1.977
urrent+1.947
Neg logit contributions
advertising-2.723
rolet-2.693
Attorney-2.689
VK-2.538
jury-2.492
Token that
Token ID326
Feature activation+9.88e-02

Pos logit contributions
antha+5.055
iliated+4.796
 coerc+4.675
odo+4.657
 Fargo+4.613
Neg logit contributions
rolet-5.319
Attorney-5.306
advertising-5.290
VK-5.018
jury-4.879
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
advertising+0.384
Attorney+0.379
rolet+0.379
VK+0.354
jury+0.349
Neg logit contributions
antha-0.364
iliated-0.342
 coerc-0.339
odo-0.332
urrent-0.327
Token but
Token ID475
Feature activation+1.08e-01

Pos logit contributions
antha+0.444
iliated+0.421
 coerc+0.417
odo+0.407
urrent+0.403
Neg logit contributions
advertising-0.464
Attorney-0.456
rolet-0.455
VK-0.425
jury-0.419
Token apparently
Token ID5729
Feature activation+1.08e-01

Pos logit contributions
advertising+0.413
Attorney+0.403
rolet+0.401
VK+0.372
jury+0.369
Neg logit contributions
antha-0.407
iliated-0.389
 coerc-0.385
odo-0.376
urrent-0.372
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
advertising+0.411
Attorney+0.402
rolet+0.399
VK+0.370
jury+0.369
Neg logit contributions
antha-0.411
iliated-0.391
 coerc-0.385
odo-0.378
urrent-0.374
Token to
Token ID284
Feature activation+1.41e-02

Pos logit contributions
antha+1.495
iliated+1.382
 coerc+1.367
odo+1.338
urrent+1.308
Neg logit contributions
advertising-2.353
Attorney-2.334
rolet-2.331
VK-2.215
jury-2.177
Token this
Token ID428
Feature activation+9.49e-02

Pos logit contributions
advertising+0.051
rolet+0.050
Attorney+0.050
VK+0.047
jury+0.046
Neg logit contributions
antha-0.056
iliated-0.053
 coerc-0.052
odo-0.051
urrent-0.051
Token guy
Token ID3516
Feature activation+3.83e-01

Pos logit contributions
advertising+0.344
Attorney+0.338
rolet+0.332
VK+0.317
cow+0.310
Neg logit contributions
antha-0.374
iliated-0.355
 coerc-0.346
odo-0.345
urrent-0.343
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
advertising+1.395
rolet+1.375
Attorney+1.367
VK+1.280
jury+1.259
Neg logit contributions
antha-1.487
iliated-1.399
 coerc-1.381
odo-1.355
urrent-1.349
Token Not
Token ID1892
Feature activation-3.31e-01

Pos logit contributions
antha+0.468
iliated+0.446
 coerc+0.440
odo+0.425
urrent+0.423
Neg logit contributions
advertising-0.500
Attorney-0.494
rolet-0.491
VK-0.464
jury-0.454
Token only
Token ID691
Feature activation-1.41e+00

Pos logit contributions
antha+1.204
iliated+1.121
 coerc+1.114
odo+1.093
urrent+1.090
Neg logit contributions
advertising-1.303
rolet-1.288
Attorney-1.279
VK-1.205
jury-1.187
Token did
Token ID750
Feature activation+5.15e-01

Pos logit contributions
antha+4.352
iliated+4.112
 coerc+4.035
odo+3.973
 Fargo+3.908
Neg logit contributions
advertising-6.102
rolet-6.043
Attorney-6.014
VK-5.693
jury-5.593
Token he
Token ID339
Feature activation+2.15e-02

Pos logit contributions
advertising+2.337
rolet+2.315
Attorney+2.309
VK+2.191
jury+2.179
Neg logit contributions
antha-1.511
iliated-1.424
 coerc-1.386
urrent-1.342
odo-1.332
Token plan
Token ID1410
Feature activation-5.66e-01

Pos logit contributions
advertising+0.079
rolet+0.078
Attorney+0.078
VK+0.073
jury+0.072
Neg logit contributions
antha-0.083
iliated-0.079
 coerc-0.076
urrent-0.075
odo-0.074
Token a
Token ID257
Feature activation-4.64e-01

Pos logit contributions
antha+1.968
iliated+1.844
 coerc+1.820
odo+1.784
urrent+1.762
Neg logit contributions
advertising-2.311
rolet-2.287
Attorney-2.282
VK-2.152
jury-2.117
Token huge
Token ID3236
Feature activation-3.67e-01

Pos logit contributions
antha+1.746
iliated+1.644
 coerc+1.621
odo+1.595
urrent+1.577
Neg logit contributions
advertising-1.761
rolet-1.742
Attorney-1.738
VK-1.634
jury-1.604
Token Email
Token ID9570
Feature activation-5.58e-01

Pos logit contributions
advertising+0.514
Attorney+0.507
rolet+0.505
VK+0.480
jury+0.461
Neg logit contributions
antha-0.572
iliated-0.547
 coerc-0.539
urrent-0.522
odo-0.519
Token your
Token ID534
Feature activation-3.42e-01

Pos logit contributions
antha+2.030
iliated+1.907
 coerc+1.889
odo+1.855
urrent+1.827
Neg logit contributions
advertising-2.188
rolet-2.164
Attorney-2.161
VK-2.031
jury-1.991
Token friends
Token ID2460
Feature activation+1.42e-01

Pos logit contributions
antha+1.301
iliated+1.224
 coerc+1.210
odo+1.189
urrent+1.176
Neg logit contributions
advertising-1.285
rolet-1.268
Attorney-1.265
VK-1.188
jury-1.163
Token.
Token ID13
Feature activation-7.34e-02

Pos logit contributions
advertising+0.535
rolet+0.526
Attorney+0.521
VK+0.495
jury+0.479
Neg logit contributions
antha-0.541
iliated-0.508
 coerc-0.499
odo-0.494
urrent-0.492
Token No
Token ID1400
Feature activation-3.02e-01

Pos logit contributions
antha+0.291
iliated+0.277
 coerc+0.274
odo+0.265
urrent+0.265
Neg logit contributions
advertising-0.266
Attorney-0.263
rolet-0.262
VK-0.248
jury-0.239
Token matter
Token ID2300
Feature activation-1.41e+00

Pos logit contributions
antha+1.114
iliated+1.045
 coerc+1.032
odo+1.017
urrent+1.004
Neg logit contributions
advertising-1.172
rolet-1.156
Attorney-1.156
VK-1.086
jury-1.063
Token your
Token ID534
Feature activation-1.95e-01

Pos logit contributions
antha+4.713
 coerc+4.476
iliated+4.470
odo+4.293
 Fargo+4.277
Neg logit contributions
rolet-5.684
Attorney-5.663
advertising-5.660
VK-5.314
jury-5.164
Token social
Token ID1919
Feature activation+2.67e-01

Pos logit contributions
antha+0.784
iliated+0.739
 coerc+0.723
odo+0.721
urrent+0.715
Neg logit contributions
advertising-0.689
Attorney-0.678
rolet-0.671
VK-0.634
jury-0.618
Token media
Token ID2056
Feature activation+2.71e-01

Pos logit contributions
advertising+0.652
Attorney+0.638
rolet+0.615
VK+0.571
jury+0.551
Neg logit contributions
antha-1.377
iliated-1.303
 coerc-1.286
odo-1.272
urrent-1.271
Token/
Token ID14
Feature activation+7.32e-01

Pos logit contributions
advertising+0.974
Attorney+0.954
rolet+0.919
VK+0.888
cow+0.866
Neg logit contributions
antha-1.082
iliated-1.013
 coerc-0.992
odo-0.983
urrent-0.977
Tokencommunication
Token ID32560
Feature activation+1.02e-01

Pos logit contributions
advertising+2.626
Attorney+2.475
rolet+2.428
VK+2.336
cow+2.298
Neg logit contributions
antha-3.008
 coerc-2.855
iliated-2.832
 Fargo-2.745
odo-2.719
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
antha+0.561
iliated+0.530
 coerc+0.516
odo+0.510
urrent+0.504
Neg logit contributions
advertising-0.610
Attorney-0.597
rolet-0.595
VK-0.565
jury-0.554
Token animal
Token ID5044
Feature activation+1.80e-01

Pos logit contributions
antha+0.579
iliated+0.548
 coerc+0.536
odo+0.527
urrent+0.523
Neg logit contributions
advertising-0.624
Attorney-0.612
rolet-0.609
VK-0.580
jury-0.567
Token kingdom
Token ID13239
Feature activation-1.96e-01

Pos logit contributions
advertising+0.574
Attorney+0.557
rolet+0.552
VK+0.518
jury+0.503
Neg logit contributions
antha-0.789
iliated-0.748
 coerc-0.735
urrent-0.724
odo-0.723
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
antha+0.723
iliated+0.678
 coerc+0.669
odo+0.659
urrent+0.652
Neg logit contributions
advertising-0.767
Attorney-0.752
rolet-0.749
VK-0.705
jury-0.694
Token no
Token ID645
Feature activation-1.42e-01

Pos logit contributions
advertising+0.468
Attorney+0.454
rolet+0.450
VK+0.422
jury+0.422
Neg logit contributions
antha-0.457
iliated-0.436
urrent-0.419
odo-0.418
 coerc-0.418
Token matter
Token ID2300
Feature activation-1.40e+00

Pos logit contributions
antha+0.461
iliated+0.424
 coerc+0.414
odo+0.410
urrent+0.409
Neg logit contributions
advertising-0.621
Attorney-0.613
rolet-0.611
VK-0.577
jury-0.573
Token how
Token ID703
Feature activation-2.47e-01

Pos logit contributions
antha+4.444
 coerc+4.227
iliated+4.164
odo+4.043
 Fargo+4.006
Neg logit contributions
rolet-5.947
Attorney-5.918
advertising-5.912
VK-5.590
jury-5.401
Token young
Token ID1862
Feature activation-2.69e-01

Pos logit contributions
antha+0.876
iliated+0.828
odo+0.800
 coerc+0.797
urrent+0.794
Neg logit contributions
advertising-0.993
Attorney-0.973
rolet-0.965
VK-0.913
jury-0.907
Token or
Token ID393
Feature activation+2.34e-01

Pos logit contributions
antha+0.955
iliated+0.894
 coerc+0.875
odo+0.864
urrent+0.861
Neg logit contributions
advertising-1.086
Attorney-1.059
rolet-1.057
VK-1.002
jury-0.988
Token adorable
Token ID29012
Feature activation-5.77e-01

Pos logit contributions
advertising+0.890
Attorney+0.869
rolet+0.863
cow+0.818
jury+0.816
Neg logit contributions
antha-0.871
iliated-0.815
odo-0.796
 coerc-0.794
urrent-0.793
Token the
Token ID262
Feature activation+9.00e-02

Pos logit contributions
antha+2.056
iliated+1.926
 coerc+1.895
odo+1.862
urrent+1.848
Neg logit contributions
advertising-2.321
Attorney-2.276
rolet-2.276
VK-2.146
jury-2.112
Token unprecedented
Token ID13029
Feature activation-1.81e-01

Pos logit contributions
advertising+0.261
Attorney+0.260
rolet+0.255
jury+0.242
VK+0.242
Neg logit contributions
antha-0.252
iliated-0.235
odo-0.227
urrent-0.226
 coerc-0.224
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
antha+0.678
iliated+0.633
 coerc+0.616
odo+0.615
urrent+0.609
Neg logit contributions
advertising-0.699
Attorney-0.694
rolet-0.682
VK-0.646
jury-0.638
Token
Token ID198
Feature activation-9.99e-02

Pos logit contributions
antha+0.455
iliated+0.436
 coerc+0.432
odo+0.415
urrent+0.413
Neg logit contributions
advertising-0.450
Attorney-0.448
rolet-0.436
VK-0.415
jury-0.409
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
antha+0.373
 coerc+0.355
iliated+0.353
odo+0.336
urrent+0.334
Neg logit contributions
advertising-0.393
Attorney-0.388
rolet-0.381
VK-0.362
jury-0.356
TokenNot
Token ID3673
Feature activation-4.98e-01

Pos logit contributions
antha+0.504
 coerc+0.478
iliated+0.477
odo+0.462
urrent+0.456
Neg logit contributions
advertising-0.478
Attorney-0.475
rolet-0.458
VK-0.440
jury-0.429
Token only
Token ID691
Feature activation-1.40e+00

Pos logit contributions
antha+1.716
iliated+1.591
 coerc+1.580
odo+1.553
urrent+1.533
Neg logit contributions
advertising-2.062
Attorney-2.036
rolet-2.030
VK-1.913
jury-1.887
Token is
Token ID318
Feature activation+2.00e-01

Pos logit contributions
antha+4.425
iliated+4.203
 coerc+4.180
odo+4.055
 Fargo+4.033
Neg logit contributions
rolet-5.916
advertising-5.907
Attorney-5.778
VK-5.507
jury-5.338
Token the
Token ID262
Feature activation+1.36e-01

Pos logit contributions
advertising+0.762
Attorney+0.755
rolet+0.728
jury+0.704
VK+0.695
Neg logit contributions
antha-0.760
iliated-0.713
 coerc-0.702
urrent-0.690
odo-0.685
Token punishment
Token ID9837
Feature activation-2.54e-01

Pos logit contributions
advertising+0.559
Attorney+0.559
rolet+0.542
jury+0.526
VK+0.515
Neg logit contributions
antha-0.472
iliated-0.440
 coerc-0.427
urrent-0.423
odo-0.422
Token unusually
Token ID23708
Feature activation-1.37e-01

Pos logit contributions
antha+0.904
iliated+0.841
odo+0.815
 coerc+0.815
urrent+0.801
Neg logit contributions
advertising-1.025
Attorney-1.024
rolet-1.006
VK-0.956
jury-0.943
Token harsh
Token ID11859
Feature activation+6.15e-02

Pos logit contributions
antha+0.518
iliated+0.480
odo+0.468
 coerc+0.462
urrent+0.461
Neg logit contributions
Attorney-0.520
advertising-0.519
rolet-0.509
VK-0.481
jury-0.479
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
advertising+0.368
Attorney+0.361
rolet+0.354
jury+0.338
VK+0.335
Neg logit contributions
antha-0.372
iliated-0.347
odo-0.336
urrent-0.336
 coerc-0.335
Token closest
Token ID11706
Feature activation+1.98e-01

Pos logit contributions
antha+0.098
iliated+0.092
 coerc+0.089
urrent+0.088
odo+0.088
Neg logit contributions
advertising-0.118
Attorney-0.117
rolet-0.116
VK-0.110
jury-0.109
Token thing
Token ID1517
Feature activation-8.62e-02

Pos logit contributions
advertising+0.802
Attorney+0.786
rolet+0.778
VK+0.742
jury+0.729
Neg logit contributions
antha-0.698
iliated-0.647
 coerc-0.631
urrent-0.630
odo-0.623
Token.
Token ID13
Feature activation-7.92e-02

Pos logit contributions
antha+0.299
iliated+0.276
 coerc+0.272
odo+0.266
urrent+0.266
Neg logit contributions
advertising-0.355
Attorney-0.352
rolet-0.350
VK-0.332
jury-0.326
Token Not
Token ID1892
Feature activation-3.40e-01

Pos logit contributions
antha+0.297
iliated+0.282
 coerc+0.280
odo+0.270
urrent+0.269
Neg logit contributions
advertising-0.306
Attorney-0.303
rolet-0.300
VK-0.283
jury-0.277
Token only
Token ID691
Feature activation-1.34e+00

Pos logit contributions
antha+1.153
iliated+1.068
 coerc+1.059
odo+1.038
urrent+1.032
Neg logit contributions
advertising-1.424
rolet-1.408
Attorney-1.403
VK-1.325
jury-1.307
Token is
Token ID318
Feature activation+2.22e-01

Pos logit contributions
antha+4.098
iliated+3.887
 coerc+3.828
odo+3.761
 Fargo+3.676
Neg logit contributions
advertising-5.812
rolet-5.768
Attorney-5.722
VK-5.464
jury-5.322
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
advertising+0.883
Attorney+0.875
rolet+0.864
VK+0.815
jury+0.814
Neg logit contributions
antha-0.791
iliated-0.742
 coerc-0.732
odo-0.714
urrent-0.714
Token dummy
Token ID31548
Feature activation+1.59e-01

Pos logit contributions
advertising+0.516
Attorney+0.512
rolet+0.505
jury+0.479
VK+0.476
Neg logit contributions
antha-0.562
iliated-0.529
 coerc-0.512
urrent-0.507
odo-0.501
Token programmed
Token ID27402
Feature activation+3.84e-01

Pos logit contributions
advertising+0.625
Attorney+0.623
rolet+0.614
jury+0.580
VK+0.578
Neg logit contributions
antha-0.574
iliated-0.533
odo-0.516
urrent-0.511
 coerc-0.510
Token to
Token ID284
Feature activation-5.92e-02

Pos logit contributions
advertising+1.632
rolet+1.626
Attorney+1.615
VK+1.522
jury+1.507
Neg logit contributions
antha-1.261
iliated-1.165
 coerc-1.138
odo-1.124
urrent-1.117
Tokens
Token ID82
Feature activation+3.02e-01

Pos logit contributions
advertising+1.248
Attorney+1.234
rolet+1.226
VK+1.164
jury+1.145
Neg logit contributions
antha-1.175
iliated-1.113
 coerc-1.109
odo-1.063
urrent-1.059
Token face
Token ID1986
Feature activation+5.64e-02

Pos logit contributions
advertising+1.078
rolet+1.073
Attorney+1.065
VK+1.011
jury+0.977
Neg logit contributions
antha-1.188
iliated-1.123
 coerc-1.098
urrent-1.071
odo-1.069
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
advertising+0.231
rolet+0.229
Attorney+0.228
VK+0.216
jury+0.211
Neg logit contributions
antha-0.195
iliated-0.182
 coerc-0.181
odo-0.176
urrent-0.175
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
advertising+0.380
rolet+0.376
Attorney+0.375
VK+0.351
jury+0.345
Neg logit contributions
antha-0.390
iliated-0.369
 coerc-0.363
odo-0.356
urrent-0.352
Token no
Token ID645
Feature activation-8.52e-02

Pos logit contributions
advertising+0.089
rolet+0.087
Attorney+0.087
VK+0.083
jury+0.081
Neg logit contributions
antha-0.086
iliated-0.082
 coerc-0.080
odo-0.079
urrent-0.078
Token matter
Token ID2300
Feature activation-1.34e+00

Pos logit contributions
antha+0.329
iliated+0.308
 coerc+0.302
odo+0.298
urrent+0.297
Neg logit contributions
advertising-0.320
Attorney-0.314
rolet-0.312
VK-0.294
jury-0.289
Token what
Token ID644
Feature activation-2.77e-01

Pos logit contributions
antha+4.375
iliated+4.118
 coerc+4.112
odo+4.018
 Fargo+3.951
Neg logit contributions
advertising-5.474
rolet-5.468
Attorney-5.468
VK-5.120
jury-4.956
Token I
Token ID314
Feature activation+5.42e-02

Pos logit contributions
antha+0.986
iliated+0.927
 coerc+0.908
odo+0.894
urrent+0.885
Neg logit contributions
advertising-1.111
Attorney-1.097
rolet-1.092
VK-1.032
jury-1.017
Token took
Token ID1718
Feature activation+3.07e-01

Pos logit contributions
advertising+0.199
rolet+0.197
Attorney+0.196
VK+0.182
jury+0.181
Neg logit contributions
antha-0.212
iliated-0.201
 coerc-0.196
urrent-0.190
odo-0.190
Token a
Token ID257
Feature activation-1.83e-01

Pos logit contributions
Attorney+1.128
advertising+1.125
rolet+1.122
VK+1.050
jury+1.026
Neg logit contributions
antha-1.176
iliated-1.114
 coerc-1.075
urrent-1.062
odo-1.056
Token photograph
Token ID8408
Feature activation+4.26e-01

Pos logit contributions
antha+0.733
iliated+0.695
 coerc+0.683
odo+0.672
urrent+0.667
Neg logit contributions
advertising-0.647
rolet-0.644
Attorney-0.638
VK-0.602
jury-0.586
Token in
Token ID287
Feature activation-2.22e-02

Pos logit contributions
antha+0.279
iliated+0.259
 coerc+0.255
odo+0.251
urrent+0.249
Neg logit contributions
advertising-0.305
Attorney-0.303
rolet-0.301
VK-0.284
jury-0.279
Token L
Token ID406
Feature activation-2.17e-01

Pos logit contributions
antha+0.083
iliated+0.077
 coerc+0.076
odo+0.075
urrent+0.074
Neg logit contributions
advertising-0.085
Attorney-0.084
rolet-0.083
VK-0.079
jury-0.077
TokenNG
Token ID10503
Feature activation-1.72e-01

Pos logit contributions
antha+0.772
 coerc+0.726
iliated+0.717
odo+0.694
urrent+0.689
Neg logit contributions
advertising-0.872
Attorney-0.860
rolet-0.857
VK-0.814
jury-0.796
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
antha+0.606
iliated+0.565
 coerc+0.553
odo+0.548
urrent+0.540
Neg logit contributions
advertising-0.702
Attorney-0.690
rolet-0.690
VK-0.653
jury-0.636
Token no
Token ID645
Feature activation-2.19e-01

Pos logit contributions
antha+0.042
iliated+0.040
 coerc+0.038
odo+0.038
urrent+0.038
Neg logit contributions
advertising-0.041
Attorney-0.041
rolet-0.040
VK-0.038
jury-0.037
Token matter
Token ID2300
Feature activation-1.34e+00

Pos logit contributions
antha+0.740
iliated+0.677
 coerc+0.667
odo+0.663
urrent+0.653
Neg logit contributions
advertising-0.927
Attorney-0.918
rolet-0.915
VK-0.864
jury-0.850
Token how
Token ID703
Feature activation-3.02e-01

Pos logit contributions
antha+4.139
 coerc+3.953
iliated+3.885
odo+3.776
urrent+3.709
Neg logit contributions
advertising-5.786
rolet-5.747
Attorney-5.743
VK-5.442
jury-5.311
Token it
Token ID340
Feature activation-2.36e-01

Pos logit contributions
antha+1.158
iliated+1.088
odo+1.052
 coerc+1.047
urrent+1.045
Neg logit contributions
advertising-1.127
Attorney-1.113
rolet-1.109
VK-1.043
jury-1.021
Token
Token ID447
Feature activation-1.95e-01

Pos logit contributions
antha+0.839
iliated+0.792
 coerc+0.779
odo+0.759
urrent+0.755
Neg logit contributions
advertising-0.944
rolet-0.925
Attorney-0.922
VK-0.876
jury-0.851
Token
Token ID247
Feature activation-7.49e-02

Pos logit contributions
antha+0.841
 coerc+0.792
iliated+0.789
odo+0.775
urrent+0.763
Neg logit contributions
advertising-0.640
Attorney-0.632
rolet-0.628
VK-0.585
jury-0.574
Tokens
Token ID82
Feature activation-1.57e-01

Pos logit contributions
antha+0.253
iliated+0.233
 coerc+0.233
odo+0.226
urrent+0.223
Neg logit contributions
advertising-0.315
Attorney-0.309
rolet-0.307
VK-0.292
jury-0.289
Token collective
Token ID10098
Feature activation-4.01e-02

Pos logit contributions
advertising+0.430
Attorney+0.423
rolet+0.420
VK+0.402
jury+0.395
Neg logit contributions
antha-0.281
iliated-0.257
 coerc-0.251
odo-0.243
urrent-0.242
Token ire
Token ID35918
Feature activation-1.27e-01

Pos logit contributions
antha+0.125
iliated+0.114
 coerc+0.111
odo+0.109
urrent+0.108
Neg logit contributions
advertising-0.181
Attorney-0.178
rolet-0.177
VK-0.169
jury-0.166
Token -
Token ID532
Feature activation-1.36e-01

Pos logit contributions
antha+0.471
iliated+0.440
 coerc+0.435
odo+0.428
urrent+0.424
Neg logit contributions
advertising-0.491
Attorney-0.482
rolet-0.482
VK-0.451
jury-0.445
Token "
Token ID366
Feature activation-7.14e-02

Pos logit contributions
antha+0.493
iliated+0.465
 coerc+0.457
odo+0.447
urrent+0.443
Neg logit contributions
advertising-0.539
Attorney-0.532
rolet-0.528
VK-0.500
jury-0.490
Tokeny
Token ID88
Feature activation+6.86e-02

Pos logit contributions
antha+0.275
iliated+0.260
 coerc+0.259
odo+0.251
urrent+0.249
Neg logit contributions
advertising-0.267
Attorney-0.264
rolet-0.260
VK-0.246
jury-0.242
Tokenup
Token ID929
Feature activation-1.32e+00

Pos logit contributions
advertising+0.278
Attorney+0.274
rolet+0.273
VK+0.259
jury+0.251
Neg logit contributions
antha-0.240
 coerc-0.224
iliated-0.223
 Fargo-0.215
urrent-0.213
Token,
Token ID11
Feature activation-5.13e-01

Pos logit contributions
antha+4.361
iliated+4.035
odo+4.018
 coerc+4.013
 Fargo+3.919
Neg logit contributions
advertising-5.454
rolet-5.385
Attorney-5.373
VK-5.099
jury-4.937
Token they
Token ID484
Feature activation-2.28e-01

Pos logit contributions
antha+1.816
iliated+1.706
 coerc+1.685
odo+1.650
urrent+1.632
Neg logit contributions
advertising-2.059
Attorney-2.036
rolet-2.035
VK-1.915
jury-1.881
Token knocked
Token ID13642
Feature activation+1.84e-01

Pos logit contributions
antha+0.828
iliated+0.778
 coerc+0.767
odo+0.746
urrent+0.744
Neg logit contributions
advertising-0.898
rolet-0.891
Attorney-0.885
VK-0.830
jury-0.821
Token down
Token ID866
Feature activation-1.28e-01

Pos logit contributions
advertising+0.699
Attorney+0.686
rolet+0.685
VK+0.648
jury+0.634
Neg logit contributions
antha-0.684
iliated-0.651
 coerc-0.634
urrent-0.623
odo-0.616
Token (
Token ID357
Feature activation+2.61e-01

Pos logit contributions
antha+0.482
iliated+0.459
 coerc+0.449
odo+0.439
urrent+0.438
Neg logit contributions
advertising-0.483
Attorney-0.477
rolet-0.475
VK-0.448
jury-0.439
Token be
Token ID307
Feature activation-4.07e-02

Pos logit contributions
antha+0.117
iliated+0.110
 coerc+0.108
odo+0.105
urrent+0.104
Neg logit contributions
advertising-0.135
rolet-0.133
Attorney-0.133
VK-0.127
jury-0.124
Token too
Token ID1165
Feature activation+7.59e-01

Pos logit contributions
antha+0.146
iliated+0.138
 coerc+0.134
odo+0.133
urrent+0.131
Neg logit contributions
advertising-0.159
Attorney-0.157
rolet-0.157
VK-0.149
jury-0.147
Token random
Token ID4738
Feature activation-8.32e-01

Pos logit contributions
advertising+2.814
Attorney+2.789
rolet+2.737
VK+2.664
cow+2.624
Neg logit contributions
antha-2.757
iliated-2.634
 coerc-2.491
odo-2.486
urrent-2.457
Token,
Token ID11
Feature activation-3.68e-02

Pos logit contributions
antha+2.821
iliated+2.639
 coerc+2.613
odo+2.563
urrent+2.521
Neg logit contributions
advertising-3.457
rolet-3.424
Attorney-3.419
VK-3.223
jury-3.159
Token or
Token ID393
Feature activation-1.36e-01

Pos logit contributions
antha+0.124
iliated+0.117
 coerc+0.115
odo+0.113
urrent+0.111
Neg logit contributions
advertising-0.153
rolet-0.151
Attorney-0.151
VK-0.142
jury-0.141
Token b
Token ID275
Feature activation-1.32e+00

Pos logit contributions
antha+0.444
iliated+0.414
 coerc+0.409
odo+0.402
urrent+0.395
Neg logit contributions
advertising-0.581
rolet-0.576
Attorney-0.574
VK-0.542
jury-0.534
Token)
Token ID8
Feature activation-5.90e-01

Pos logit contributions
antha+4.366
iliated+4.120
odo+4.086
irc+3.982
urrent+3.962
Neg logit contributions
rolet-5.332
advertising-5.276
Attorney-5.250
VK-4.932
jury-4.793
Token It
Token ID632
Feature activation-5.30e-01

Pos logit contributions
antha+2.125
iliated+1.995
 coerc+1.974
odo+1.940
urrent+1.910
Neg logit contributions
advertising-2.329
rolet-2.304
Attorney-2.301
VK-2.166
jury-2.123
Token would
Token ID561
Feature activation-2.69e-01

Pos logit contributions
antha+1.602
iliated+1.485
 coerc+1.470
odo+1.436
urrent+1.407
Neg logit contributions
advertising-2.401
rolet-2.375
Attorney-2.374
VK-2.252
jury-2.217
Token have
Token ID423
Feature activation-2.57e-01

Pos logit contributions
antha+0.962
iliated+0.903
 coerc+0.881
odo+0.869
urrent+0.858
Neg logit contributions
advertising-1.066
rolet-1.061
Attorney-1.056
VK-1.002
jury-0.983
Token only
Token ID691
Feature activation-6.52e-01

Pos logit contributions
antha+0.899
iliated+0.843
 coerc+0.822
odo+0.814
urrent+0.804
Neg logit contributions
advertising-1.038
Attorney-1.024
rolet-1.023
VK-0.971
jury-0.953
Tokenscript
Token ID12048
Feature activation+8.95e-02

Pos logit contributions
antha+1.190
iliated+1.121
 coerc+1.113
odo+1.094
urrent+1.077
Neg logit contributions
advertising-1.150
Attorney-1.133
rolet-1.131
VK-1.060
jury-1.040
Tokenions
Token ID507
Feature activation+2.29e-01

Pos logit contributions
advertising+0.355
Attorney+0.349
rolet+0.349
VK+0.327
jury+0.320
Neg logit contributions
antha-0.316
iliated-0.299
 coerc-0.299
odo-0.291
urrent-0.288
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
advertising+0.850
Attorney+0.835
rolet+0.831
VK+0.784
cow+0.761
Neg logit contributions
antha-0.874
iliated-0.831
 coerc-0.820
odo-0.809
urrent-0.798
Token
Token ID198
Feature activation+1.71e-01

Pos logit contributions
advertising+0.591
Attorney+0.582
rolet+0.564
VK+0.539
jury+0.533
Neg logit contributions
antha-0.601
 coerc-0.579
iliated-0.569
odo-0.538
urrent-0.538
TokenNo
Token ID2949
Feature activation+3.40e-03

Pos logit contributions
advertising+0.599
Attorney+0.586
rolet+0.569
VK+0.545
jury+0.523
Neg logit contributions
antha-0.710
 coerc-0.679
iliated-0.676
odo-0.657
urrent-0.646
Token matter
Token ID2300
Feature activation-1.31e+00

Pos logit contributions
advertising+0.013
rolet+0.013
Attorney+0.013
VK+0.012
jury+0.012
Neg logit contributions
antha-0.012
iliated-0.011
 coerc-0.011
odo-0.011
urrent-0.011
Token where
Token ID810
Feature activation-6.55e-02

Pos logit contributions
antha+4.568
iliated+4.315
 coerc+4.289
odo+4.174
 Fargo+4.169
Neg logit contributions
Attorney-5.084
rolet-5.081
advertising-5.066
VK-4.757
jury-4.630
Token you
Token ID345
Feature activation+1.07e-01

Pos logit contributions
antha+0.217
iliated+0.204
 coerc+0.201
odo+0.197
urrent+0.194
Neg logit contributions
advertising-0.277
Attorney-0.273
rolet-0.271
VK-0.258
jury-0.253
Token are
Token ID389
Feature activation+2.41e-01

Pos logit contributions
advertising+0.376
rolet+0.375
Attorney+0.373
VK+0.348
cow+0.339
Neg logit contributions
antha-0.427
iliated-0.404
 coerc-0.403
odo-0.393
urrent-0.389
Token in
Token ID287
Feature activation-6.35e-04

Pos logit contributions
advertising+0.932
Attorney+0.914
rolet+0.909
VK+0.860
jury+0.839
Neg logit contributions
antha-0.881
iliated-0.831
 coerc-0.821
odo-0.814
urrent-0.797
Token the
Token ID262
Feature activation+1.30e-01

Pos logit contributions
antha+0.002
iliated+0.002
 coerc+0.002
odo+0.002
urrent+0.002
Neg logit contributions
advertising-0.002
Attorney-0.002
rolet-0.002
VK-0.002
jury-0.002
Token bad
Token ID2089
Feature activation+2.31e-01

Pos logit contributions
antha+0.122
iliated+0.115
 coerc+0.112
odo+0.111
urrent+0.111
Neg logit contributions
advertising-0.120
Attorney-0.119
rolet-0.119
VK-0.112
jury-0.111
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
advertising+0.796
Attorney+0.793
rolet+0.787
VK+0.729
jury+0.723
Neg logit contributions
antha-0.940
iliated-0.889
 coerc-0.872
urrent-0.862
odo-0.852
Token dangerous
Token ID4923
Feature activation+1.42e-01

Pos logit contributions
Attorney+1.081
advertising+1.077
rolet+1.063
jury+1.011
cow+0.997
Neg logit contributions
antha-1.034
iliated-0.966
 coerc-0.938
urrent-0.936
odo-0.923
Token guy
Token ID3516
Feature activation+7.04e-02

Pos logit contributions
Attorney+0.487
advertising+0.487
rolet+0.479
jury+0.448
VK+0.448
Neg logit contributions
antha-0.583
iliated-0.552
 coerc-0.537
urrent-0.534
odo-0.529
Token no
Token ID645
Feature activation-2.43e-01

Pos logit contributions
advertising+0.260
Attorney+0.256
rolet+0.254
VK+0.238
jury+0.238
Neg logit contributions
antha-0.273
iliated-0.255
 coerc-0.251
urrent-0.246
odo-0.246
Token matter
Token ID2300
Feature activation-1.30e+00

Pos logit contributions
antha+0.772
iliated+0.709
 coerc+0.702
odo+0.689
urrent+0.680
Neg logit contributions
advertising-1.066
Attorney-1.056
rolet-1.054
VK-0.995
jury-0.985
Token how
Token ID703
Feature activation-1.31e-01

Pos logit contributions
antha+4.502
iliated+4.293
 coerc+4.263
odo+4.180
 Fargo+4.085
Neg logit contributions
rolet-5.150
advertising-5.131
Attorney-5.092
VK-4.804
jury-4.637
Token am
Token ID716
Feature activation-3.75e-01

Pos logit contributions
antha+0.484
iliated+0.450
urrent+0.439
odo+0.437
 coerc+0.437
Neg logit contributions
advertising-0.506
Attorney-0.501
rolet-0.491
jury-0.469
VK-0.468
Tokeniable
Token ID3379
Feature activation-1.33e-01

Pos logit contributions
antha+1.354
iliated+1.270
 coerc+1.260
odo+1.235
urrent+1.217
Neg logit contributions
advertising-1.480
rolet-1.464
Attorney-1.462
VK-1.375
jury-1.348
Token he
Token ID339
Feature activation-7.62e-02

Pos logit contributions
antha+0.458
iliated+0.423
 coerc+0.420
odo+0.410
urrent+0.409
Neg logit contributions
advertising-0.553
Attorney-0.547
rolet-0.540
VK-0.515
jury-0.510
Token seemed
Token ID3947
Feature activation+9.81e-03

Pos logit contributions
antha+0.272
iliated+0.253
 coerc+0.250
odo+0.246
urrent+0.246
Neg logit contributions
advertising-0.302
Attorney-0.296
rolet-0.293
VK-0.278
jury-0.276
Token policy
Token ID2450
Feature activation-8.06e-02

Pos logit contributions
advertising+0.105
Attorney+0.101
rolet+0.099
VK+0.091
jury+0.090
Neg logit contributions
antha-0.172
iliated-0.164
urrent-0.161
 coerc-0.161
odo-0.160
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
antha+0.306
iliated+0.289
 coerc+0.278
odo+0.278
urrent+0.276
Neg logit contributions
advertising-0.308
Attorney-0.302
rolet-0.299
VK-0.277
jury-0.276
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
advertising+0.502
Attorney+0.499
rolet+0.488
VK+0.460
jury+0.453
Neg logit contributions
antha-0.496
iliated-0.476
 coerc-0.475
odo-0.454
urrent-0.447
Token
Token ID198
Feature activation+6.56e-02

Pos logit contributions
advertising+0.407
Attorney+0.403
rolet+0.390
VK+0.373
jury+0.368
Neg logit contributions
antha-0.419
 coerc-0.403
iliated-0.397
urrent-0.374
odo-0.374
TokenUS
Token ID2937
Feature activation-3.93e-01

Pos logit contributions
advertising+0.244
Attorney+0.243
rolet+0.233
VK+0.224
jury+0.217
Neg logit contributions
antha-0.259
 coerc-0.247
iliated-0.245
odo-0.237
urrent-0.233
TokenAT
Token ID1404
Feature activation-1.28e+00

Pos logit contributions
antha+1.691
iliated+1.603
 coerc+1.588
odo+1.564
urrent+1.546
Neg logit contributions
advertising-1.280
Attorney-1.264
rolet-1.256
VK-1.169
jury-1.140
TokenOD
Token ID3727
Feature activation+3.45e-01

Pos logit contributions
antha+4.087
iliated+3.813
odo+3.763
 coerc+3.728
urrent+3.593
Neg logit contributions
rolet-5.506
advertising-5.502
Attorney-5.445
VK-5.147
jury-5.099
TokenAY
Token ID4792
Feature activation+1.67e-02

Pos logit contributions
advertising+1.073
Attorney+1.059
rolet+1.055
VK+0.985
jury+0.952
Neg logit contributions
antha-1.530
iliated-1.462
 coerc-1.450
odo-1.422
urrent-1.407
Token OP
Token ID13349
Feature activation-5.75e-02

Pos logit contributions
Attorney+0.061
advertising+0.061
rolet+0.060
VK+0.056
jury+0.055
Neg logit contributions
antha-0.064
iliated-0.062
 coerc-0.061
odo-0.059
urrent-0.059
TokenIN
Token ID1268
Feature activation+2.08e-01

Pos logit contributions
antha+0.286
iliated+0.273
 coerc+0.273
odo+0.268
urrent+0.265
Neg logit contributions
advertising-0.148
Attorney-0.146
rolet-0.145
VK-0.133
jury-0.128
TokenION
Token ID2849
Feature activation-2.00e-01

Pos logit contributions
Attorney+0.687
advertising+0.686
rolet+0.668
VK+0.636
jury+0.618
Neg logit contributions
antha-0.882
 coerc-0.835
iliated-0.835
odo-0.808
urrent-0.804
Token among
Token ID1871
Feature activation+4.87e-01

Pos logit contributions
advertising+1.441
Attorney+1.422
rolet+1.407
VK+1.317
jury+1.297
Neg logit contributions
antha-1.458
iliated-1.373
odo-1.326
urrent-1.322
 coerc-1.320
Token gay
Token ID5650
Feature activation-7.07e-02

Pos logit contributions
advertising+1.700
Attorney+1.696
rolet+1.666
VK+1.587
jury+1.551
Neg logit contributions
antha-1.948
iliated-1.799
odo-1.769
 coerc-1.761
urrent-1.753
Token teens
Token ID15508
Feature activation+1.48e-01

Pos logit contributions
antha+0.295
iliated+0.276
 coerc+0.273
urrent+0.267
odo+0.267
Neg logit contributions
advertising-0.239
Attorney-0.237
rolet-0.235
VK-0.221
jury-0.216
Token?
Token ID30
Feature activation+1.17e-01

Pos logit contributions
advertising+0.575
Attorney+0.567
rolet+0.562
VK+0.536
jury+0.527
Neg logit contributions
antha-0.545
iliated-0.501
odo-0.498
urrent-0.495
 coerc-0.495
Token Not
Token ID1892
Feature activation-3.23e-01

Pos logit contributions
advertising+0.449
Attorney+0.443
rolet+0.431
VK+0.414
jury+0.403
Neg logit contributions
antha-0.446
iliated-0.428
 coerc-0.419
odo-0.408
urrent-0.404
Token only
Token ID691
Feature activation-1.26e+00

Pos logit contributions
antha+1.155
iliated+1.071
 coerc+1.059
odo+1.045
urrent+1.039
Neg logit contributions
advertising-1.306
rolet-1.279
Attorney-1.279
VK-1.206
jury-1.184
Token did
Token ID750
Feature activation+6.48e-01

Pos logit contributions
antha+4.400
iliated+4.214
 coerc+4.207
odo+4.074
 Fargo+4.033
Neg logit contributions
advertising-4.955
rolet-4.950
Attorney-4.874
VK-4.593
jury-4.494
Token they
Token ID484
Feature activation+1.99e-01

Pos logit contributions
advertising+2.450
Attorney+2.424
rolet+2.339
VK+2.275
jury+2.268
Neg logit contributions
antha-2.386
iliated-2.263
urrent-2.164
odo-2.148
 coerc-2.148
Token want
Token ID765
Feature activation+1.77e-01

Pos logit contributions
advertising+0.729
rolet+0.723
Attorney+0.720
VK+0.673
jury+0.670
Neg logit contributions
antha-0.764
iliated-0.719
odo-0.693
urrent-0.689
 coerc-0.687
Token to
Token ID284
Feature activation+2.47e-02

Pos logit contributions
advertising+0.731
Attorney+0.717
rolet+0.703
VK+0.674
jury+0.662
Neg logit contributions
antha-0.614
iliated-0.562
 coerc-0.552
urrent-0.548
odo-0.544
Token escape
Token ID6654
Feature activation-1.04e-01

Pos logit contributions
advertising+0.095
Attorney+0.094
rolet+0.094
VK+0.089
jury+0.087
Neg logit contributions
antha-0.091
iliated-0.085
urrent-0.082
odo-0.082
 coerc-0.081
Token -
Token ID532
Feature activation-4.87e-01

Pos logit contributions
antha+0.366
iliated+0.348
 coerc+0.343
odo+0.333
urrent+0.332
Neg logit contributions
advertising-0.391
Attorney-0.385
rolet-0.378
VK-0.361
jury-0.352
Token January
Token ID3269
Feature activation-3.70e-01

Pos logit contributions
antha+2.199
iliated+2.080
 coerc+2.067
odo+2.045
urrent+2.016
Neg logit contributions
advertising-1.476
rolet-1.466
Attorney-1.452
VK-1.340
jury-1.304
Token 4
Token ID604
Feature activation-7.45e-01

Pos logit contributions
antha+1.364
iliated+1.284
 coerc+1.273
odo+1.247
urrent+1.229
Neg logit contributions
advertising-1.429
rolet-1.411
Attorney-1.410
VK-1.323
jury-1.297
Token,
Token ID11
Feature activation-6.31e-01

Pos logit contributions
antha+2.619
iliated+2.453
 coerc+2.415
odo+2.400
urrent+2.353
Neg logit contributions
advertising-2.977
rolet-2.966
Attorney-2.941
VK-2.773
jury-2.718
Token 2010
Token ID3050
Feature activation-2.77e-01

Pos logit contributions
antha+1.999
iliated+1.863
 coerc+1.847
odo+1.797
urrent+1.769
Neg logit contributions
advertising-2.783
Attorney-2.749
rolet-2.745
VK-2.596
jury-2.552
TokenUser
Token ID12982
Feature activation-1.23e+00

Pos logit contributions
antha+1.302
iliated+1.242
 coerc+1.238
odo+1.211
urrent+1.199
Neg logit contributions
advertising-0.796
Attorney-0.782
rolet-0.776
VK-0.718
jury-0.695
Token reviews
Token ID8088
Feature activation-3.94e-01

Pos logit contributions
antha+4.936
 coerc+4.745
iliated+4.708
odo+4.669
 Fargo+4.600
Neg logit contributions
rolet-4.153
Attorney-4.067
advertising-4.026
VK-3.855
jury-3.665
Token are
Token ID389
Feature activation-3.42e-01

Pos logit contributions
antha+1.441
iliated+1.353
 coerc+1.342
odo+1.314
urrent+1.295
Neg logit contributions
advertising-1.542
rolet-1.523
Attorney-1.522
VK-1.428
jury-1.399
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
antha+1.103
iliated+1.029
 coerc+1.018
odo+0.992
urrent+0.980
Neg logit contributions
advertising-1.485
rolet-1.464
Attorney-1.464
VK-1.385
jury-1.364
Token bad
Token ID2089
Feature activation+6.00e-01

Pos logit contributions
advertising+0.720
rolet+0.704
Attorney+0.703
VK+0.651
jury+0.646
Neg logit contributions
antha-0.790
iliated-0.745
 coerc-0.738
urrent-0.717
odo-0.711
Token idea
Token ID2126
Feature activation-4.17e-02

Pos logit contributions
advertising+2.601
Attorney+2.542
rolet+2.509
jury+2.391
VK+2.360
Neg logit contributions
antha-1.972
 coerc-1.841
iliated-1.831
urrent-1.757
irc-1.711
Token future
Token ID2003
Feature activation-1.00e-01

Pos logit contributions
advertising+0.650
rolet+0.642
Attorney+0.641
VK+0.609
jury+0.601
Neg logit contributions
antha-0.439
 coerc-0.410
iliated-0.408
urrent-0.393
odo-0.392
Token ahead
Token ID4058
Feature activation-1.77e-01

Pos logit contributions
antha+0.327
iliated+0.304
 coerc+0.302
odo+0.294
urrent+0.288
Neg logit contributions
advertising-0.437
Attorney-0.428
rolet-0.427
VK-0.405
jury-0.398
Token of
Token ID286
Feature activation+6.75e-02

Pos logit contributions
antha+0.619
iliated+0.579
 coerc+0.573
odo+0.562
urrent+0.553
Neg logit contributions
advertising-0.725
rolet-0.715
Attorney-0.715
VK-0.673
jury-0.660
Token it
Token ID340
Feature activation+3.30e-01

Pos logit contributions
advertising+0.276
Attorney+0.270
rolet+0.270
VK+0.256
jury+0.250
Neg logit contributions
antha-0.237
 coerc-0.221
iliated-0.221
odo-0.217
urrent-0.212
Token no
Token ID645
Feature activation-4.62e-02

Pos logit contributions
advertising+1.276
rolet+1.241
Attorney+1.236
VK+1.157
jury+1.144
Neg logit contributions
antha-1.240
iliated-1.165
 coerc-1.152
odo-1.124
urrent-1.109
Token matter
Token ID2300
Feature activation-1.23e+00

Pos logit contributions
antha+0.146
iliated+0.134
 coerc+0.134
odo+0.130
urrent+0.128
Neg logit contributions
advertising-0.204
Attorney-0.202
rolet-0.202
VK-0.191
jury-0.189
Token what
Token ID644
Feature activation-3.05e-01

Pos logit contributions
antha+4.242
iliated+4.058
 coerc+4.004
odo+3.932
 Fargo+3.919
Neg logit contributions
rolet-4.828
Attorney-4.806
advertising-4.733
VK-4.486
cow-4.367
Token platform
Token ID3859
Feature activation+3.24e-01

Pos logit contributions
antha+1.078
iliated+1.011
 coerc+1.000
odo+0.981
urrent+0.967
Neg logit contributions
advertising-1.230
Attorney-1.213
rolet-1.211
VK-1.142
jury-1.122
Token you
Token ID345
Feature activation+2.23e-01

Pos logit contributions
advertising+1.296
Attorney+1.272
rolet+1.260
VK+1.201
jury+1.172
Neg logit contributions
antha-1.153
iliated-1.090
 coerc-1.080
odo-1.043
urrent-1.041
Token will
Token ID481
Feature activation+2.41e-01

Pos logit contributions
rolet+0.803
advertising+0.795
Attorney+0.791
VK+0.749
jury+0.728
Neg logit contributions
antha-0.872
iliated-0.824
 coerc-0.811
odo-0.795
urrent-0.791
Token be
Token ID307
Feature activation+2.86e-01

Pos logit contributions
rolet+0.927
advertising+0.916
Attorney+0.908
VK+0.866
jury+0.848
Neg logit contributions
antha-0.882
iliated-0.830
 coerc-0.810
odo-0.798
urrent-0.789
Token
Token ID247
Feature activation-8.74e-02

Pos logit contributions
advertising+0.119
rolet+0.116
Attorney+0.116
VK+0.108
jury+0.107
Neg logit contributions
antha-0.145
 coerc-0.136
iliated-0.135
odo-0.133
 Fargo-0.131
Token to
Token ID284
Feature activation+3.91e-01

Pos logit contributions
antha+0.312
iliated+0.290
 coerc+0.286
odo+0.284
urrent+0.279
Neg logit contributions
advertising-0.351
rolet-0.345
Attorney-0.344
VK-0.323
jury-0.318
Token life
Token ID1204
Feature activation+4.47e-01

Pos logit contributions
advertising+1.483
rolet+1.434
Attorney+1.429
VK+1.350
jury+1.322
Neg logit contributions
antha-1.504
iliated-1.393
urrent-1.344
odo-1.327
 coerc-1.327
Token,
Token ID11
Feature activation+3.84e-01

Pos logit contributions
advertising+1.723
Attorney+1.651
rolet+1.643
jury+1.552
VK+1.539
Neg logit contributions
antha-1.690
iliated-1.559
odo-1.507
urrent-1.501
 coerc-1.500
Token no
Token ID645
Feature activation-1.84e-02

Pos logit contributions
advertising+1.461
Attorney+1.421
rolet+1.419
VK+1.331
jury+1.331
Neg logit contributions
antha-1.453
iliated-1.360
urrent-1.309
 coerc-1.304
odo-1.303
Token matter
Token ID2300
Feature activation-1.19e+00

Pos logit contributions
antha+0.062
iliated+0.057
 coerc+0.055
urrent+0.055
odo+0.054
Neg logit contributions
advertising-0.078
rolet-0.077
Attorney-0.077
VK-0.073
jury-0.072
Token what
Token ID644
Feature activation-2.53e-01

Pos logit contributions
antha+3.918
 coerc+3.793
iliated+3.705
odo+3.607
 Fargo+3.542
Neg logit contributions
advertising-4.952
rolet-4.937
Attorney-4.930
VK-4.648
jury-4.496
Token the
Token ID262
Feature activation-7.01e-02

Pos logit contributions
antha+0.930
iliated+0.872
 coerc+0.851
odo+0.845
urrent+0.837
Neg logit contributions
advertising-0.981
Attorney-0.967
rolet-0.963
VK-0.907
jury-0.898
Token conditions
Token ID3403
Feature activation-1.04e-01

Pos logit contributions
antha+0.254
iliated+0.238
odo+0.230
urrent+0.229
 coerc+0.229
Neg logit contributions
advertising-0.276
Attorney-0.272
rolet-0.271
VK-0.256
jury-0.253
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
antha+0.390
iliated+0.362
 coerc+0.352
odo+0.349
urrent+0.345
Neg logit contributions
advertising-0.400
rolet-0.395
Attorney-0.395
VK-0.371
jury-0.363
Token are
Token ID389
Feature activation-1.39e-01

Pos logit contributions
advertising+0.728
rolet+0.705
Attorney+0.702
VK+0.653
jury+0.650
Neg logit contributions
antha-0.720
iliated-0.677
odo-0.657
urrent-0.650
 coerc-0.650
Token Ex
Token ID1475
Feature activation-2.75e-01

Pos logit contributions
antha+0.665
iliated+0.623
 coerc+0.620
odo+0.609
urrent+0.599
Neg logit contributions
advertising-0.699
Attorney-0.696
rolet-0.689
VK-0.656
jury-0.636
Token Chief
Token ID5953
Feature activation-8.08e-02

Pos logit contributions
antha+1.023
iliated+0.964
 coerc+0.964
odo+0.932
urrent+0.926
Neg logit contributions
advertising-1.047
Attorney-1.037
rolet-1.037
VK-0.978
jury-0.958
Token Minister
Token ID4139
Feature activation+1.60e-01

Pos logit contributions
antha+0.331
iliated+0.313
 coerc+0.312
odo+0.305
urrent+0.301
Neg logit contributions
Attorney-0.277
advertising-0.277
rolet-0.273
VK-0.258
jury-0.249
Token of
Token ID286
Feature activation-2.43e-01

Pos logit contributions
advertising+0.617
Attorney+0.613
rolet+0.603
VK+0.585
cow+0.562
Neg logit contributions
antha-0.583
iliated-0.549
 coerc-0.546
odo-0.528
urrent-0.520
Token H
Token ID367
Feature activation-9.45e-02

Pos logit contributions
antha+0.888
iliated+0.834
 coerc+0.829
odo+0.808
urrent+0.795
Neg logit contributions
advertising-0.948
Attorney-0.944
rolet-0.935
VK-0.890
jury-0.867
Tokenary
Token ID560
Feature activation-1.18e+00

Pos logit contributions
antha+0.278
 coerc+0.264
iliated+0.255
 Fargo+0.246
odo+0.244
Neg logit contributions
advertising-0.431
Attorney-0.429
rolet-0.423
VK-0.417
uala-0.399
Tokenana
Token ID2271
Feature activation-7.75e-02

Pos logit contributions
antha+3.619
iliated+3.288
odo+3.253
 coerc+3.191
urrent+3.152
Neg logit contributions
rolet-5.287
advertising-5.234
Attorney-5.219
VK-4.928
jury-4.860
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
antha+0.292
iliated+0.274
 coerc+0.272
odo+0.267
urrent+0.262
Neg logit contributions
advertising-0.292
Attorney-0.291
rolet-0.289
VK-0.274
jury-0.267
Token On
Token ID1550
Feature activation+2.54e-01

Pos logit contributions
antha+0.081
iliated+0.076
 coerc+0.076
odo+0.074
urrent+0.073
Neg logit contributions
advertising-0.076
Attorney-0.076
rolet-0.075
VK-0.071
jury-0.069
Token 18
Token ID1248
Feature activation-5.66e-02

Pos logit contributions
advertising+0.852
Attorney+0.845
rolet+0.831
VK+0.792
jury+0.774
Neg logit contributions
antha-1.064
iliated-1.011
 coerc-0.994
odo-0.977
urrent-0.967
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
antha+0.242
iliated+0.229
 coerc+0.227
odo+0.223
urrent+0.221
Neg logit contributions
advertising-0.186
Attorney-0.184
rolet-0.182
VK-0.172
jury-0.167
Token their
Token ID511
Feature activation+1.16e-01

Pos logit contributions
advertising+3.166
Attorney+3.094
rolet+3.015
jury+2.922
VK+2.892
Neg logit contributions
antha-3.516
iliated-3.353
urrent-3.227
odo-3.197
 coerc-3.193
Token creation
Token ID6282
Feature activation+8.49e-03

Pos logit contributions
advertising+0.435
Attorney+0.427
rolet+0.421
VK+0.398
jury+0.397
Neg logit contributions
antha-0.434
iliated-0.413
 coerc-0.400
urrent-0.395
odo-0.394
Token works
Token ID2499
Feature activation-4.77e-02

Pos logit contributions
advertising+0.034
Attorney+0.034
rolet+0.034
VK+0.032
jury+0.032
Neg logit contributions
antha-0.030
iliated-0.028
 coerc-0.027
urrent-0.027
odo-0.027
Token,
Token ID11
Feature activation+4.01e-01

Pos logit contributions
antha+0.184
iliated+0.173
odo+0.168
 coerc+0.167
urrent+0.166
Neg logit contributions
advertising-0.178
Attorney-0.176
rolet-0.175
VK-0.162
jury-0.161
Token they
Token ID484
Feature activation+5.15e-02

Pos logit contributions
advertising+1.456
Attorney+1.451
rolet+1.446
VK+1.357
jury+1.336
Neg logit contributions
antha-1.513
iliated-1.415
 coerc-1.390
odo-1.388
urrent-1.358
Token tested
Token ID6789
Feature activation+8.95e-01

Pos logit contributions
rolet+0.199
advertising+0.198
Attorney+0.196
VK+0.184
jury+0.183
Neg logit contributions
antha-0.189
iliated-0.176
odo-0.170
 coerc-0.170
urrent-0.167
Token their
Token ID511
Feature activation-2.35e-01

Pos logit contributions
advertising+3.086
Attorney+3.041
rolet+2.955
jury+2.867
VK+2.852
Neg logit contributions
antha-3.475
iliated-3.269
odo-3.155
urrent-3.124
 coerc-3.119
Token design
Token ID1486
Feature activation+1.37e-01

Pos logit contributions
antha+0.869
iliated+0.822
 coerc+0.802
odo+0.791
urrent+0.785
Neg logit contributions
advertising-0.900
Attorney-0.889
rolet-0.883
VK-0.832
jury-0.822
Token by
Token ID416
Feature activation+1.12e-02

Pos logit contributions
advertising+0.516
Attorney+0.509
rolet+0.505
VK+0.473
jury+0.471
Neg logit contributions
antha-0.512
iliated-0.486
 coerc-0.470
odo-0.465
urrent-0.460
Token exposing
Token ID21294
Feature activation+2.25e-01

Pos logit contributions
advertising+0.044
rolet+0.043
Attorney+0.043
jury+0.040
VK+0.040
Neg logit contributions
antha-0.041
iliated-0.039
odo-0.037
urrent-0.037
 coerc-0.036
Token it
Token ID340
Feature activation-1.16e-01

Pos logit contributions
advertising+0.810
Attorney+0.804
rolet+0.798
VK+0.759
jury+0.755
Neg logit contributions
antha-0.861
iliated-0.804
odo-0.788
 coerc-0.783
urrent-0.779
Token953
Token ID49649
Feature activation+8.33e-01

Pos logit contributions
advertising+1.384
Attorney+1.375
rolet+1.355
VK+1.217
jury+1.211
Neg logit contributions
antha-3.231
 coerc-3.102
iliated-3.094
odo-3.052
urrent-2.988
Token 16
Token ID1467
Feature activation+4.36e-01

Pos logit contributions
rolet+3.071
advertising+3.059
Attorney+3.021
VK+2.946
jury+2.817
Neg logit contributions
antha-3.133
iliated-2.894
 coerc-2.891
odo-2.808
 Fargo-2.780
Token27
Token ID1983
Feature activation+2.34e-01

Pos logit contributions
advertising+1.219
Attorney+1.183
rolet+1.151
VK+1.086
jury+1.045
Neg logit contributions
antha-2.124
 coerc-2.044
iliated-2.029
odo-1.977
 Fargo-1.939
Token145
Token ID18781
Feature activation+2.99e-01

Pos logit contributions
advertising+0.736
Attorney+0.713
rolet+0.698
VK+0.661
jury+0.633
Neg logit contributions
antha-1.039
iliated-1.017
 coerc-1.011
odo-0.985
urrent-0.969
Token Flat
Token ID21939
Feature activation+5.22e-01

Pos logit contributions
advertising+0.870
rolet+0.864
Attorney+0.854
VK+0.804
jury+0.771
Neg logit contributions
antha-1.376
iliated-1.321
 coerc-1.305
odo-1.285
urrent-1.262
Tokenbuff
Token ID36873
Feature activation+7.15e-01

Pos logit contributions
advertising+1.644
rolet+1.610
Attorney+1.595
VK+1.479
jury+1.470
Neg logit contributions
antha-2.300
 coerc-2.218
iliated-2.184
 Fargo-2.115
odo-2.107
Tokeners
Token ID364
Feature activation+7.49e-01

Pos logit contributions
advertising+2.658
Attorney+2.579
rolet+2.578
VK+2.382
jury+2.363
Neg logit contributions
antha-2.768
 coerc-2.670
iliated-2.608
 Fargo-2.505
odo-2.485
TokenReading
Token ID36120
Feature activation-1.51e-01

Pos logit contributions
advertising+2.496
Attorney+2.435
rolet+2.322
VK+2.303
jury+2.140
Neg logit contributions
antha-3.279
 coerc-3.196
iliated-3.109
odo-3.007
 Fargo-2.933
Token 106
Token ID15696
Feature activation+6.05e-01

Pos logit contributions
antha+0.566
iliated+0.536
 coerc+0.530
odo+0.518
urrent+0.509
Neg logit contributions
advertising-0.574
Attorney-0.567
rolet-0.563
VK-0.535
jury-0.522
Token378
Token ID30695
Feature activation+1.80e-01

Pos logit contributions
advertising+1.898
Attorney+1.853
rolet+1.813
VK+1.737
jury+1.651
Neg logit contributions
antha-2.688
iliated-2.577
 coerc-2.573
odo-2.477
urrent-2.438
Token46
Token ID3510
Feature activation+2.85e-01

Pos logit contributions
advertising+0.596
Attorney+0.586
rolet+0.582
VK+0.553
jury+0.532
Neg logit contributions
antha-0.770
iliated-0.738
 coerc-0.732
odo-0.710
urrent-0.702
Token highlights
Token ID11330
Feature activation-7.25e-01

Pos logit contributions
antha+0.228
 coerc+0.213
iliated+0.211
odo+0.203
urrent+0.199
Neg logit contributions
advertising-0.402
Attorney-0.401
rolet-0.394
VK-0.377
jury-0.372
Token Secretary
Token ID4986
Feature activation+4.25e-01

Pos logit contributions
antha+2.712
iliated+2.541
 coerc+2.517
odo+2.484
urrent+2.443
Neg logit contributions
advertising-2.750
rolet-2.724
Attorney-2.706
VK-2.544
jury-2.493
Token of
Token ID286
Feature activation+4.45e-01

Pos logit contributions
advertising+1.623
Attorney+1.615
rolet+1.585
VK+1.500
jury+1.487
Neg logit contributions
antha-1.545
iliated-1.486
 coerc-1.438
odo-1.430
urrent-1.407
Token Defense
Token ID5947
Feature activation+3.57e-01

Pos logit contributions
advertising+1.345
Attorney+1.343
rolet+1.310
VK+1.222
jury+1.207
Neg logit contributions
antha-1.989
iliated-1.922
 coerc-1.920
odo-1.864
urrent-1.832
Token Chuck
Token ID14663
Feature activation-2.02e-01

Pos logit contributions
advertising+1.307
rolet+1.288
Attorney+1.283
VK+1.206
jury+1.161
Neg logit contributions
antha-1.377
iliated-1.312
 coerc-1.271
odo-1.247
urrent-1.245
Token Hag
Token ID21375
Feature activation+8.56e-01

Pos logit contributions
antha+0.694
iliated+0.655
 coerc+0.652
odo+0.631
urrent+0.616
Neg logit contributions
advertising-0.829
rolet-0.827
Attorney-0.825
VK-0.785
jury-0.756
Tokenel
Token ID417
Feature activation+1.84e-01

Pos logit contributions
Attorney+3.019
rolet+2.930
advertising+2.854
jury+2.797
bush+2.768
Neg logit contributions
antha-3.368
 coerc-3.179
iliated-3.124
urrent-3.045
odo-2.977
Token will
Token ID481
Feature activation-9.46e-02

Pos logit contributions
advertising+0.686
Attorney+0.682
rolet+0.675
VK+0.632
jury+0.630
Neg logit contributions
antha-0.698
iliated-0.657
 coerc-0.645
odo-0.636
urrent-0.630
Token step
Token ID2239
Feature activation-2.52e-01

Pos logit contributions
antha+0.358
iliated+0.338
 coerc+0.330
odo+0.325
urrent+0.323
Neg logit contributions
rolet-0.355
advertising-0.354
Attorney-0.353
VK-0.328
jury-0.325
Token down
Token ID866
Feature activation+4.76e-01

Pos logit contributions
antha+0.981
iliated+0.928
 coerc+0.914
odo+0.897
urrent+0.892
Neg logit contributions
advertising-0.933
Attorney-0.920
rolet-0.918
VK-0.852
jury-0.846
Token,
Token ID11
Feature activation+2.62e-02

Pos logit contributions
advertising+1.748
Attorney+1.740
rolet+1.713
VK+1.593
jury+1.592
Neg logit contributions
antha-1.829
iliated-1.728
 coerc-1.696
odo-1.673
urrent-1.663
Token reports
Token ID3136
Feature activation+3.50e-01

Pos logit contributions
advertising+0.582
Attorney+0.580
rolet+0.561
VK+0.536
jury+0.525
Neg logit contributions
antha-0.685
iliated-0.637
 coerc-0.630
odo-0.620
urrent-0.617
Token to
Token ID284
Feature activation-1.01e-01

Pos logit contributions
advertising+1.302
Attorney+1.295
rolet+1.279
VK+1.206
jury+1.184
Neg logit contributions
antha-1.346
 coerc-1.242
iliated-1.240
odo-1.203
urrent-1.198
Token make
Token ID787
Feature activation+1.38e-01

Pos logit contributions
antha+0.377
iliated+0.352
 coerc+0.345
odo+0.341
urrent+0.336
Neg logit contributions
Attorney-0.383
advertising-0.383
rolet-0.381
VK-0.358
jury-0.351
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
advertising+0.532
Attorney+0.523
rolet+0.510
VK+0.488
jury+0.482
Neg logit contributions
antha-0.520
iliated-0.482
 coerc-0.476
odo-0.468
urrent-0.464
Token connection
Token ID4637
Feature activation+4.27e-01

Pos logit contributions
advertising+0.961
Attorney+0.949
rolet+0.931
VK+0.886
jury+0.877
Neg logit contributions
antha-1.017
iliated-0.952
 coerc-0.941
urrent-0.921
odo-0.919
Token between
Token ID1022
Feature activation+6.91e-01

Pos logit contributions
advertising+1.589
Attorney+1.581
rolet+1.551
VK+1.469
jury+1.424
Neg logit contributions
antha-1.644
 coerc-1.533
iliated-1.512
odo-1.491
urrent-1.478
Token Russian
Token ID3394
Feature activation+2.67e-01

Pos logit contributions
Attorney+2.434
advertising+2.402
rolet+2.373
VK+2.273
jury+2.205
Neg logit contributions
antha-2.762
 coerc-2.586
iliated-2.536
odo-2.521
urrent-2.513
Token hackers
Token ID17110
Feature activation-6.56e-02

Pos logit contributions
Attorney+0.934
advertising+0.931
rolet+0.919
VK+0.859
jury+0.833
Neg logit contributions
antha-1.098
iliated-1.027
 coerc-1.002
urrent-0.993
odo-0.990
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
antha+0.244
iliated+0.226
 coerc+0.226
odo+0.221
urrent+0.219
Neg logit contributions
advertising-0.253
rolet-0.252
Attorney-0.251
VK-0.236
jury-0.231
Token the
Token ID262
Feature activation+9.52e-02

Pos logit contributions
advertising+0.860
Attorney+0.860
rolet+0.846
VK+0.805
jury+0.784
Neg logit contributions
antha-0.846
 coerc-0.790
iliated-0.780
odo-0.772
urrent-0.769
Token Democratic
Token ID4390
Feature activation+3.81e-01

Pos logit contributions
advertising+0.391
Attorney+0.389
rolet+0.383
VK+0.365
jury+0.363
Neg logit contributions
antha-0.330
 coerc-0.304
iliated-0.302
urrent-0.295
odo-0.294
Token and
Token ID290
Feature activation-1.22e-01

Pos logit contributions
antha+0.036
iliated+0.034
 coerc+0.034
odo+0.033
urrent+0.032
Neg logit contributions
advertising-0.037
rolet-0.037
Attorney-0.037
VK-0.034
jury-0.034
Token make
Token ID787
Feature activation+1.92e-01

Pos logit contributions
antha+0.460
iliated+0.432
 coerc+0.424
odo+0.418
urrent+0.414
Neg logit contributions
advertising-0.463
Attorney-0.456
rolet-0.455
VK-0.428
jury-0.419
Token a
Token ID257
Feature activation+3.25e-01

Pos logit contributions
advertising+0.744
rolet+0.730
Attorney+0.723
VK+0.682
jury+0.678
Neg logit contributions
antha-0.708
iliated-0.656
 coerc-0.651
odo-0.628
urrent-0.626
Token few
Token ID1178
Feature activation+5.76e-01

Pos logit contributions
advertising+1.187
rolet+1.179
Attorney+1.154
VK+1.091
jury+1.082
Neg logit contributions
antha-1.269
iliated-1.189
 coerc-1.165
urrent-1.141
odo-1.134
Token subtle
Token ID11800
Feature activation+3.97e-01

Pos logit contributions
advertising+2.080
rolet+1.998
Attorney+1.976
VK+1.856
jury+1.842
Neg logit contributions
antha-2.292
iliated-2.131
 coerc-2.070
urrent-2.049
odo-2.039
Token changes
Token ID2458
Feature activation+6.65e-01

Pos logit contributions
advertising+1.348
rolet+1.310
Attorney+1.289
VK+1.185
jury+1.181
Neg logit contributions
antha-1.672
iliated-1.567
 coerc-1.527
urrent-1.519
odo-1.502
Token such
Token ID884
Feature activation+1.54e-02

Pos logit contributions
advertising+2.649
rolet+2.609
Attorney+2.570
VK+2.409
jury+2.393
Neg logit contributions
antha-2.378
iliated-2.186
 coerc-2.155
odo-2.094
urrent-2.091
Token as
Token ID355
Feature activation+2.28e-01

Pos logit contributions
advertising+0.060
Attorney+0.059
rolet+0.059
VK+0.055
jury+0.054
Neg logit contributions
antha-0.057
iliated-0.054
 coerc-0.053
odo-0.052
urrent-0.052
Token not
Token ID407
Feature activation-9.08e-02

Pos logit contributions
advertising+0.862
rolet+0.847
Attorney+0.844
VK+0.793
jury+0.781
Neg logit contributions
antha-0.867
iliated-0.814
 coerc-0.796
urrent-0.782
odo-0.778
Token using
Token ID1262
Feature activation-9.05e-02

Pos logit contributions
antha+0.338
iliated+0.317
 coerc+0.311
urrent+0.306
odo+0.306
Neg logit contributions
advertising-0.350
rolet-0.345
Attorney-0.343
VK-0.322
jury-0.319
Token technology
Token ID3037
Feature activation+4.02e-02

Pos logit contributions
antha+0.347
iliated+0.326
 coerc+0.319
odo+0.316
urrent+0.312
Neg logit contributions
advertising-0.338
rolet-0.332
Attorney-0.332
VK-0.313
jury-0.307
Token In
Token ID554
Feature activation-1.40e-01

Pos logit contributions
advertising+1.030
rolet+1.021
Attorney+1.020
VK+0.948
jury+0.935
Neg logit contributions
antha-1.119
iliated-1.054
 coerc-1.043
odo-1.020
urrent-1.014
Token.
Token ID13
Feature activation+5.41e-02

Pos logit contributions
antha+0.517
iliated+0.485
 coerc+0.479
odo+0.467
urrent+0.464
Neg logit contributions
advertising-0.541
rolet-0.534
Attorney-0.533
VK-0.498
jury-0.497
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
advertising+0.209
rolet+0.207
Attorney+0.207
VK+0.194
jury+0.190
Neg logit contributions
antha-0.201
iliated-0.192
 coerc-0.189
odo-0.184
urrent-0.181
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
antha+0.466
 coerc+0.440
iliated+0.439
odo+0.421
urrent+0.417
Neg logit contributions
advertising-0.507
Attorney-0.501
rolet-0.496
VK-0.470
jury-0.462
TokenCall
Token ID14134
Feature activation-1.20e-01

Pos logit contributions
antha+0.532
iliated+0.499
 coerc+0.496
odo+0.484
urrent+0.476
Neg logit contributions
advertising-0.618
Attorney-0.611
rolet-0.608
VK-0.575
jury-0.562
Token 440
Token ID33879
Feature activation+5.77e-01

Pos logit contributions
antha+0.423
iliated+0.400
 coerc+0.395
odo+0.385
urrent+0.383
Neg logit contributions
advertising-0.482
Attorney-0.482
rolet-0.478
VK-0.451
jury-0.443
Token-
Token ID12
Feature activation+3.65e-01

Pos logit contributions
advertising+2.003
rolet+2.001
Attorney+1.990
VK+1.879
jury+1.810
Neg logit contributions
antha-2.337
iliated-2.205
 coerc-2.196
urrent-2.126
odo-2.126
Token327
Token ID34159
Feature activation+2.82e-01

Pos logit contributions
advertising+1.171
Attorney+1.154
rolet+1.131
VK+1.088
jury+1.023
Neg logit contributions
antha-1.613
 coerc-1.544
iliated-1.538
odo-1.480
urrent-1.470
Token-
Token ID12
Feature activation+1.43e-01

Pos logit contributions
advertising+0.997
Attorney+0.975
rolet+0.962
VK+0.909
jury+0.888
Neg logit contributions
antha-1.167
 coerc-1.104
iliated-1.101
odo-1.055
urrent-1.052
Token9
Token ID24
Feature activation+3.58e-01

Pos logit contributions
advertising+0.455
Attorney+0.446
rolet+0.434
VK+0.417
jury+0.393
Neg logit contributions
antha-0.644
 coerc-0.621
iliated-0.617
odo-0.592
urrent-0.587
Token595
Token ID35124
Feature activation-6.54e-02

Pos logit contributions
advertising+1.101
Attorney+1.085
rolet+1.071
VK+1.009
jury+0.963
Neg logit contributions
antha-1.623
 coerc-1.545
iliated-1.539
odo-1.491
urrent-1.480
Token University
Token ID2059
Feature activation+1.12e-02

Pos logit contributions
Attorney+0.251
advertising+0.249
rolet+0.248
VK+0.235
jury+0.233
Neg logit contributions
antha-0.209
iliated-0.196
 coerc-0.195
odo-0.190
urrent-0.187
Token of
Token ID286
Feature activation+3.85e-02

Pos logit contributions
advertising+0.044
Attorney+0.044
rolet+0.043
VK+0.041
jury+0.040
Neg logit contributions
antha-0.041
 coerc-0.039
iliated-0.038
odo-0.037
urrent-0.036
Token South
Token ID2520
Feature activation+2.32e-03

Pos logit contributions
advertising+0.140
Attorney+0.139
rolet+0.136
VK+0.129
jury+0.127
Neg logit contributions
antha-0.153
iliated-0.145
 coerc-0.144
odo-0.140
urrent-0.138
Token Australia
Token ID4505
Feature activation-5.45e-02

Pos logit contributions
advertising+0.009
rolet+0.009
Attorney+0.009
VK+0.008
jury+0.008
Neg logit contributions
antha-0.009
 coerc-0.008
iliated-0.008
odo-0.008
urrent-0.008
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
antha+0.203
 coerc+0.191
iliated+0.191
odo+0.187
urrent+0.184
Neg logit contributions
advertising-0.207
Attorney-0.206
rolet-0.204
VK-0.190
jury-0.188
Token an
Token ID281
Feature activation+5.77e-02

Pos logit contributions
antha+0.647
iliated+0.606
 coerc+0.604
odo+0.600
urrent+0.590
Neg logit contributions
Attorney-0.624
advertising-0.617
rolet-0.607
VK-0.575
jury-0.563
Token Emer
Token ID10320
Feature activation+2.36e-01

Pos logit contributions
Attorney+0.200
advertising+0.194
rolet+0.191
jury+0.178
VK+0.177
Neg logit contributions
antha-0.233
 coerc-0.221
iliated-0.218
odo-0.218
irc-0.211
Tokenitus
Token ID17506
Feature activation+5.03e-01

Pos logit contributions
Attorney+0.781
advertising+0.769
rolet+0.769
VK+0.716
jury+0.707
Neg logit contributions
antha-0.987
 coerc-0.950
iliated-0.926
odo-0.911
 Fargo-0.902
Token Professor
Token ID8129
Feature activation+4.08e-01

Pos logit contributions
Attorney+1.386
advertising+1.300
rolet+1.295
VK+1.189
jury+1.177
Neg logit contributions
antha-2.383
 coerc-2.309
iliated-2.251
odo-2.228
ocity-2.174
Token of
Token ID286
Feature activation-3.98e-02

Pos logit contributions
Attorney+1.530
advertising+1.492
rolet+1.487
VK+1.389
jury+1.374
Neg logit contributions
antha-1.521
 coerc-1.430
odo-1.401
iliated-1.400
irc-1.350
Token Economics
Token ID18963
Feature activation+2.63e-01

Pos logit contributions
antha+0.169
iliated+0.161
 coerc+0.161
odo+0.159
urrent+0.154
Neg logit contributions
Attorney-0.130
advertising-0.129
rolet-0.127
VK-0.118
jury-0.117
TokenAt
Token ID2953
Feature activation-2.47e-02

Pos logit contributions
antha+0.268
iliated+0.254
 coerc+0.254
odo+0.247
urrent+0.244
Neg logit contributions
advertising-0.241
Attorney-0.237
rolet-0.236
VK-0.222
jury-0.216
Token&
Token ID5
Feature activation-3.41e-01

Pos logit contributions
antha+0.087
iliated+0.083
 coerc+0.082
urrent+0.079
odo+0.079
Neg logit contributions
advertising-0.098
rolet-0.098
Attorney-0.097
VK-0.091
jury-0.089
TokenT
Token ID51
Feature activation-2.14e-01

Pos logit contributions
antha+1.393
iliated+1.306
 coerc+1.302
odo+1.274
urrent+1.265
Neg logit contributions
advertising-1.200
Attorney-1.183
rolet-1.173
VK-1.107
jury-1.073
Token
Token ID447
Feature activation+2.10e-01

Pos logit contributions
antha+0.847
iliated+0.799
 coerc+0.794
odo+0.778
urrent+0.769
Neg logit contributions
advertising-0.768
rolet-0.760
Attorney-0.758
VK-0.710
jury-0.692
Token
Token ID251
Feature activation-8.16e-03

Pos logit contributions
advertising+0.704
Attorney+0.693
rolet+0.693
VK+0.655
jury+0.634
Neg logit contributions
antha-0.881
 coerc-0.834
iliated-0.825
odo-0.805
urrent-0.800
Token variant
Token ID15304
Feature activation+5.57e-01

Pos logit contributions
antha+0.031
iliated+0.029
 coerc+0.029
odo+0.029
urrent+0.028
Neg logit contributions
advertising-0.030
rolet-0.030
Attorney-0.030
VK-0.028
jury-0.027
Token of
Token ID286
Feature activation+6.33e-02

Pos logit contributions
rolet+2.079
advertising+2.075
Attorney+2.058
VK+1.940
jury+1.878
Neg logit contributions
antha-2.097
iliated-1.973
 coerc-1.961
odo-1.918
urrent-1.890
Token the
Token ID262
Feature activation+9.25e-02

Pos logit contributions
advertising+0.249
rolet+0.248
Attorney+0.245
VK+0.231
jury+0.226
Neg logit contributions
antha-0.229
iliated-0.216
 coerc-0.214
odo-0.209
urrent-0.207
Token Nexus
Token ID16756
Feature activation+8.32e-02

Pos logit contributions
advertising+0.317
rolet+0.316
Attorney+0.311
VK+0.290
jury+0.283
Neg logit contributions
antha-0.382
iliated-0.363
 coerc-0.359
odo-0.351
urrent-0.348
Token 6
Token ID718
Feature activation-4.93e-01

Pos logit contributions
advertising+0.327
rolet+0.324
Attorney+0.321
VK+0.303
jury+0.298
Neg logit contributions
antha-0.304
iliated-0.284
 coerc-0.281
odo-0.276
urrent-0.274
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
antha+1.772
iliated+1.678
 coerc+1.656
odo+1.629
urrent+1.607
Neg logit contributions
advertising-1.928
rolet-1.902
Attorney-1.897
VK-1.780
jury-1.748
Token.
Token ID13
Feature activation-3.13e-01

Pos logit contributions
antha+1.458
 coerc+1.375
iliated+1.372
odo+1.317
urrent+1.307
Neg logit contributions
advertising-1.574
Attorney-1.551
rolet-1.533
VK-1.455
jury-1.430
Tokenx
Token ID87
Feature activation-5.15e-01

Pos logit contributions
antha+1.131
 coerc+1.070
iliated+1.065
odo+1.023
urrent+1.012
Neg logit contributions
advertising-1.252
Attorney-1.235
rolet-1.224
VK-1.159
jury-1.140
Token.
Token ID13
Feature activation-5.29e-01

Pos logit contributions
antha+1.842
iliated+1.729
 coerc+1.723
odo+1.670
urrent+1.651
Neg logit contributions
advertising-2.068
Attorney-2.041
rolet-2.033
VK-1.919
jury-1.885
Tokenx
Token ID87
Feature activation-5.18e-01

Pos logit contributions
antha+1.853
iliated+1.736
 coerc+1.725
odo+1.684
urrent+1.659
Neg logit contributions
advertising-2.149
Attorney-2.123
rolet-2.123
VK-1.999
jury-1.962
Token.
Token ID13
Feature activation-5.91e-01

Pos logit contributions
antha+1.799
iliated+1.683
 coerc+1.663
odo+1.638
urrent+1.610
Neg logit contributions
advertising-2.104
rolet-2.086
Attorney-2.080
VK-1.961
jury-1.922
Token143
Token ID21139
Feature activation-1.51e-02

Pos logit contributions
antha+2.593
iliated+2.449
odo+2.385
 coerc+2.383
irc+2.352
Neg logit contributions
rolet-1.851
advertising-1.834
Attorney-1.808
VK-1.691
jury-1.641
Token 10
Token ID838
Feature activation-5.72e-01

Pos logit contributions
antha+0.053
iliated+0.051
 coerc+0.050
odo+0.048
urrent+0.048
Neg logit contributions
advertising-0.060
rolet-0.060
Attorney-0.060
VK-0.056
jury-0.055
Token33
Token ID2091
Feature activation-3.90e-01

Pos logit contributions
antha+2.457
iliated+2.326
 coerc+2.296
odo+2.292
urrent+2.260
Neg logit contributions
rolet-1.817
advertising-1.817
Attorney-1.796
VK-1.657
jury-1.628
Token455
Token ID30505
Feature activation-4.00e-02

Pos logit contributions
antha+1.724
iliated+1.638
 coerc+1.625
odo+1.603
urrent+1.583
Neg logit contributions
advertising-1.217
rolet-1.204
Attorney-1.199
VK-1.108
jury-1.081
Token x
Token ID2124
Feature activation-3.40e-01

Pos logit contributions
antha+0.125
iliated+0.117
 coerc+0.116
odo+0.110
urrent+0.110
Neg logit contributions
advertising-0.178
Attorney-0.177
rolet-0.175
VK-0.168
jury-0.165
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
antha+1.254
 coerc+1.184
iliated+1.180
odo+1.131
urrent+1.125
Neg logit contributions
advertising-1.343
Attorney-1.321
rolet-1.304
VK-1.238
jury-1.217
Token wrong
Token ID2642
Feature activation-5.94e-03

Pos logit contributions
advertising+0.379
Attorney+0.377
rolet+0.370
VK+0.351
jury+0.344
Neg logit contributions
antha-0.380
iliated-0.348
odo-0.336
urrent-0.335
 coerc-0.329
Token with
Token ID351
Feature activation+6.08e-01

Pos logit contributions
antha+0.023
iliated+0.021
odo+0.020
 coerc+0.020
urrent+0.020
Neg logit contributions
advertising-0.023
Attorney-0.022
rolet-0.022
VK-0.021
jury-0.020
Token the
Token ID262
Feature activation+4.00e-02

Pos logit contributions
advertising+2.227
Attorney+2.197
rolet+2.134
jury+2.056
VK+2.053
Neg logit contributions
antha-2.329
iliated-2.138
urrent-2.111
odo-2.106
irc-2.018
Token notion
Token ID9495
Feature activation-1.90e-03

Pos logit contributions
advertising+0.145
Attorney+0.145
rolet+0.144
VK+0.135
jury+0.134
Neg logit contributions
antha-0.156
iliated-0.145
odo-0.142
urrent-0.141
 coerc-0.139
Token of
Token ID286
Feature activation+3.76e-02

Pos logit contributions
antha+0.007
iliated+0.006
 coerc+0.006
odo+0.006
urrent+0.006
Neg logit contributions
advertising-0.008
rolet-0.007
Attorney-0.007
VK-0.007
jury-0.007
Token reason
Token ID1738
Feature activation+7.50e-02

Pos logit contributions
advertising+0.143
Attorney+0.142
rolet+0.142
jury+0.134
VK+0.133
Neg logit contributions
antha-0.141
iliated-0.131
urrent-0.128
odo-0.128
 coerc-0.124
Token and
Token ID290
Feature activation+1.61e-01

Pos logit contributions
advertising+0.289
Attorney+0.283
rolet+0.282
VK+0.263
jury+0.263
Neg logit contributions
antha-0.282
iliated-0.262
odo-0.254
 coerc-0.254
urrent-0.253
Token a
Token ID257
Feature activation+9.32e-03

Pos logit contributions
Attorney+0.575
advertising+0.570
rolet+0.567
jury+0.533
VK+0.529
Neg logit contributions
antha-0.652
iliated-0.601
odo-0.587
urrent-0.585
 coerc-0.574
Token different
Token ID1180
Feature activation+1.05e-01

Pos logit contributions
rolet+0.034
Attorney+0.034
advertising+0.034
VK+0.032
jury+0.031
Neg logit contributions
antha-0.036
iliated-0.034
odo-0.033
 coerc-0.033
urrent-0.033
Token voice
Token ID3809
Feature activation-4.82e-01

Pos logit contributions
Attorney+0.367
rolet+0.366
advertising+0.363
VK+0.342
jury+0.338
Neg logit contributions
antha-0.418
iliated-0.391
 coerc-0.385
urrent-0.384
odo-0.382
Token approach
Token ID3164
Feature activation+1.82e-03

Pos logit contributions
antha+1.754
iliated+1.644
 coerc+1.626
odo+1.599
urrent+1.577
Neg logit contributions
advertising-1.885
rolet-1.865
Attorney-1.864
VK-1.750
jury-1.719
Token denied
Token ID6699
Feature activation+3.48e-01

Pos logit contributions
advertising+0.617
Attorney+0.611
rolet+0.597
VK+0.574
jury+0.565
Neg logit contributions
antha-0.556
iliated-0.523
 coerc-0.512
odo-0.510
urrent-0.500
Token wrongdoing
Token ID25013
Feature activation-1.32e-01

Pos logit contributions
advertising+1.323
Attorney+1.303
rolet+1.281
VK+1.222
jury+1.211
Neg logit contributions
antha-1.306
iliated-1.226
 coerc-1.215
odo-1.182
urrent-1.177
Token.
Token ID13
Feature activation+4.27e-02

Pos logit contributions
antha+0.482
iliated+0.447
 coerc+0.442
odo+0.437
urrent+0.430
Neg logit contributions
advertising-0.520
Attorney-0.514
rolet-0.509
VK-0.477
jury-0.472
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
advertising+0.166
Attorney+0.165
rolet+0.163
VK+0.153
jury+0.150
Neg logit contributions
antha-0.159
iliated-0.151
 coerc-0.149
odo-0.146
urrent-0.143
Token
Token ID198
Feature activation-6.53e-02

Pos logit contributions
antha+0.057
 coerc+0.054
iliated+0.054
odo+0.051
urrent+0.051
Neg logit contributions
advertising-0.058
Attorney-0.058
rolet-0.057
VK-0.054
jury-0.053
TokenPr
Token ID6836
Feature activation-3.26e-01

Pos logit contributions
antha+0.263
 coerc+0.249
iliated+0.249
odo+0.242
urrent+0.238
Neg logit contributions
advertising-0.235
Attorney-0.234
rolet-0.227
VK-0.216
jury-0.210
Tokenather
Token ID1032
Feature activation+9.09e-02

Pos logit contributions
antha+0.923
iliated+0.849
 coerc+0.846
odo+0.819
urrent+0.801
Neg logit contributions
advertising-1.539
rolet-1.522
Attorney-1.522
VK-1.445
jury-1.425
Token
Token ID447
Feature activation+3.50e-01

Pos logit contributions
advertising+0.365
Attorney+0.364
rolet+0.360
VK+0.339
jury+0.335
Neg logit contributions
antha-0.320
iliated-0.298
 coerc-0.296
odo-0.288
urrent-0.284
Token
Token ID247
Feature activation-5.77e-02

Pos logit contributions
advertising+1.538
Attorney+1.531
rolet+1.520
VK+1.446
jury+1.433
Neg logit contributions
antha-1.089
 coerc-1.006
iliated-1.004
odo-0.970
 Fargo-0.954
Tokens
Token ID82
Feature activation-1.49e-01

Pos logit contributions
antha+0.199
iliated+0.185
 coerc+0.184
odo+0.180
urrent+0.176
Neg logit contributions
advertising-0.237
Attorney-0.235
rolet-0.234
VK-0.220
jury-0.218
Token attorney
Token ID6136
Feature activation+2.05e-01

Pos logit contributions
antha+0.640
iliated+0.606
 coerc+0.601
odo+0.590
urrent+0.584
Neg logit contributions
advertising-0.488
Attorney-0.482
rolet-0.476
VK-0.443
jury-0.435
Token something
Token ID1223
Feature activation-3.46e-01

Pos logit contributions
advertising+1.080
rolet+1.073
Attorney+1.068
VK+1.015
jury+0.990
Neg logit contributions
antha-1.028
iliated-0.969
 coerc-0.947
odo-0.937
urrent-0.931
Token else
Token ID2073
Feature activation-1.43e-01

Pos logit contributions
antha+1.165
iliated+1.090
 coerc+1.076
odo+1.050
urrent+1.037
Neg logit contributions
advertising-1.455
rolet-1.436
Attorney-1.435
VK-1.360
jury-1.335
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
antha+0.522
iliated+0.493
 coerc+0.487
odo+0.475
urrent+0.469
Neg logit contributions
advertising-0.559
rolet-0.554
Attorney-0.552
VK-0.521
jury-0.509
Token
Token ID784
Feature activation-9.15e-02

Pos logit contributions
advertising+0.745
Attorney+0.740
rolet+0.735
VK+0.698
jury+0.680
Neg logit contributions
antha-0.757
iliated-0.726
 coerc-0.710
odo-0.693
urrent-0.691
TokenB
Token ID33
Feature activation+1.47e-01

Pos logit contributions
antha+0.389
iliated+0.372
 coerc+0.367
odo+0.361
urrent+0.358
Neg logit contributions
advertising-0.301
Attorney-0.298
rolet-0.296
VK-0.277
jury-0.270
TokenOM
Token ID2662
Feature activation-1.26e-01

Pos logit contributions
advertising+0.467
rolet+0.462
Attorney+0.458
VK+0.427
jury+0.416
Neg logit contributions
antha-0.645
iliated-0.613
 coerc-0.609
odo-0.593
urrent-0.591
TokenB
Token ID33
Feature activation+6.18e-02

Pos logit contributions
antha+0.504
iliated+0.476
 coerc+0.473
odo+0.463
urrent+0.458
Neg logit contributions
advertising-0.451
rolet-0.446
Attorney-0.445
VK-0.416
jury-0.407
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
advertising+0.237
rolet+0.234
Attorney+0.234
VK+0.221
jury+0.216
Neg logit contributions
antha-0.230
iliated-0.216
 coerc-0.214
odo-0.209
urrent-0.207
Token
Token ID198
Feature activation-6.13e-02

Pos logit contributions
advertising+0.045
Attorney+0.045
rolet+0.044
VK+0.042
jury+0.041
Neg logit contributions
antha-0.045
 coerc-0.043
iliated-0.042
odo-0.040
urrent-0.040
Token1
Token ID16
Feature activation+7.42e-02

Pos logit contributions
antha+0.268
iliated+0.255
 coerc+0.254
odo+0.247
urrent+0.244
Neg logit contributions
advertising-0.200
Attorney-0.196
rolet-0.192
VK-0.181
jury-0.177
Tokend
Token ID67
Feature activation-6.38e-02

Pos logit contributions
advertising+0.322
Attorney+0.318
rolet+0.317
VK+0.301
jury+0.297
Neg logit contributions
antha-0.237
iliated-0.223
 coerc-0.221
odo-0.214
urrent-0.211
Token").
Token ID11074
Feature activation+4.38e-02

Pos logit contributions
antha+0.012
iliated+0.011
 coerc+0.011
urrent+0.011
odo+0.011
Neg logit contributions
advertising-0.022
Attorney-0.022
rolet-0.022
VK-0.021
jury-0.021
Token Male
Token ID12674
Feature activation-2.29e-01

Pos logit contributions
advertising+0.162
Attorney+0.159
rolet+0.156
VK+0.150
jury+0.144
Neg logit contributions
antha-0.173
iliated-0.163
 coerc-0.160
odo-0.158
urrent-0.154
Token-
Token ID12
Feature activation-7.87e-02

Pos logit contributions
antha+0.880
iliated+0.820
 coerc+0.806
odo+0.801
urrent+0.791
Neg logit contributions
advertising-0.854
Attorney-0.839
rolet-0.836
VK-0.792
jury-0.774
Tokento
Token ID1462
Feature activation-1.11e-01

Pos logit contributions
antha+0.301
iliated+0.279
 coerc+0.279
odo+0.271
 Fargo+0.269
Neg logit contributions
advertising-0.298
Attorney-0.292
rolet-0.288
VK-0.275
jury-0.269
Token-
Token ID12
Feature activation-4.72e-01

Pos logit contributions
antha+0.437
iliated+0.412
 coerc+0.409
odo+0.399
urrent+0.396
Neg logit contributions
advertising-0.404
Attorney-0.397
rolet-0.396
VK-0.370
jury-0.364
Tokenfemale
Token ID24724
Feature activation-3.59e-01

Pos logit contributions
antha+1.864
iliated+1.759
 coerc+1.744
odo+1.714
urrent+1.691
Neg logit contributions
advertising-1.703
rolet-1.682
Attorney-1.680
VK-1.570
jury-1.537
Token (
Token ID357
Feature activation-4.26e-01

Pos logit contributions
antha+1.231
iliated+1.138
 coerc+1.111
odo+1.107
urrent+1.089
Neg logit contributions
advertising-1.490
Attorney-1.476
rolet-1.471
VK-1.398
jury-1.368
TokenM
Token ID44
Feature activation-2.85e-01

Pos logit contributions
antha+1.420
iliated+1.323
 coerc+1.310
odo+1.285
urrent+1.260
Neg logit contributions
advertising-1.805
Attorney-1.783
rolet-1.781
VK-1.688
jury-1.654
TokenTF
Token ID10234
Feature activation-4.18e-01

Pos logit contributions
antha+1.076
iliated+1.008
 coerc+1.004
odo+0.974
urrent+0.965
Neg logit contributions
advertising-1.086
rolet-1.072
Attorney-1.071
VK-1.017
jury-0.987
Token)
Token ID8
Feature activation-1.48e-01

Pos logit contributions
antha+1.537
iliated+1.461
 coerc+1.450
odo+1.420
urrent+1.398
Neg logit contributions
advertising-1.601
Attorney-1.585
rolet-1.583
VK-1.478
jury-1.449
Token transgender
Token ID10637
Feature activation-1.27e-01

Pos logit contributions
antha+0.482
iliated+0.439
 coerc+0.428
odo+0.427
urrent+0.420
Neg logit contributions
advertising-0.642
Attorney-0.634
rolet-0.632
VK-0.603
jury-0.590
Token least
Token ID1551
Feature activation+2.89e-03

Pos logit contributions
advertising+1.083
rolet+1.071
Attorney+1.071
VK+1.000
jury+0.993
Neg logit contributions
antha-0.776
iliated-0.723
 coerc-0.709
urrent-0.691
odo-0.686
Token 19
Token ID678
Feature activation-1.69e-01

Pos logit contributions
advertising+0.011
Attorney+0.011
rolet+0.011
VK+0.010
jury+0.010
Neg logit contributions
antha-0.011
iliated-0.010
 coerc-0.010
odo-0.010
urrent-0.010
Token states
Token ID2585
Feature activation-3.14e-02

Pos logit contributions
antha+0.749
iliated+0.707
 coerc+0.705
odo+0.695
urrent+0.685
Neg logit contributions
advertising-0.531
Attorney-0.524
rolet-0.519
VK-0.479
jury-0.471
Token or
Token ID393
Feature activation+2.65e-01

Pos logit contributions
antha+0.118
iliated+0.110
 coerc+0.108
odo+0.108
urrent+0.107
Neg logit contributions
advertising-0.120
Attorney-0.119
rolet-0.118
VK-0.110
jury-0.109
Token territories
Token ID16771
Feature activation+4.04e-01

Pos logit contributions
Attorney+0.965
rolet+0.956
advertising+0.955
VK+0.888
jury+0.876
Neg logit contributions
antha-1.025
odo-0.944
 coerc-0.936
iliated-0.936
urrent-0.927
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
Attorney+1.472
advertising+1.466
rolet+1.453
VK+1.337
jury+1.331
Neg logit contributions
antha-1.571
iliated-1.456
odo-1.445
 coerc-1.438
urrent-1.436
Token did
Token ID750
Feature activation+2.82e-01

Pos logit contributions
antha+0.744
iliated+0.700
 coerc+0.681
odo+0.677
urrent+0.670
Neg logit contributions
advertising-0.832
Attorney-0.829
rolet-0.821
VK-0.767
jury-0.762
Token not
Token ID407
Feature activation-9.31e-02

Pos logit contributions
advertising+0.855
Attorney+0.855
rolet+0.854
VK+0.777
jury+0.763
Neg logit contributions
antha-1.268
iliated-1.210
 coerc-1.192
odo-1.173
urrent-1.170
Token immediately
Token ID3393
Feature activation-3.88e-01

Pos logit contributions
antha+0.400
iliated+0.376
 coerc+0.368
odo+0.364
urrent+0.364
Neg logit contributions
advertising-0.304
rolet-0.303
Attorney-0.302
VK-0.278
jury-0.274
Token respond
Token ID3031
Feature activation-1.97e-01

Pos logit contributions
antha+1.539
iliated+1.453
 coerc+1.440
odo+1.416
urrent+1.397
Neg logit contributions
advertising-1.390
rolet-1.375
Attorney-1.373
VK-1.282
jury-1.255
Token to
Token ID284
Feature activation+1.43e-01

Pos logit contributions
antha+0.565
iliated+0.523
 coerc+0.516
odo+0.506
urrent+0.494
Neg logit contributions
advertising-0.919
rolet-0.909
Attorney-0.906
VK-0.860
jury-0.849
Token employment
Token ID7184
Feature activation+3.15e-01

Pos logit contributions
antha+0.586
iliated+0.551
 coerc+0.538
odo+0.536
urrent+0.528
Neg logit contributions
advertising-0.574
Attorney-0.573
rolet-0.567
VK-0.531
jury-0.523
Token paperwork
Token ID23143
Feature activation+2.69e-02

Pos logit contributions
advertising+0.974
Attorney+0.956
rolet+0.925
VK+0.873
jury+0.852
Neg logit contributions
antha-1.415
iliated-1.334
 coerc-1.306
odo-1.305
urrent-1.300
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
advertising+0.100
Attorney+0.100
rolet+0.098
VK+0.093
jury+0.091
Neg logit contributions
antha-0.102
iliated-0.096
 coerc-0.095
odo-0.094
urrent-0.093
Token or
Token ID393
Feature activation-1.77e-03

Pos logit contributions
antha+0.438
iliated+0.413
 coerc+0.404
odo+0.401
urrent+0.396
Neg logit contributions
advertising-0.389
Attorney-0.384
rolet-0.382
VK-0.356
jury-0.353
Token evidence
Token ID2370
Feature activation-3.69e-01

Pos logit contributions
antha+0.007
iliated+0.007
 coerc+0.006
odo+0.006
urrent+0.006
Neg logit contributions
advertising-0.006
Attorney-0.006
rolet-0.006
VK-0.006
jury-0.006
Token of
Token ID286
Feature activation-2.69e-01

Pos logit contributions
antha+1.315
iliated+1.229
 coerc+1.212
odo+1.193
urrent+1.178
Neg logit contributions
advertising-1.484
Attorney-1.467
rolet-1.455
VK-1.378
jury-1.355
Token appointments
Token ID23976
Feature activation+5.16e-01

Pos logit contributions
antha+1.045
iliated+0.973
 coerc+0.960
odo+0.952
urrent+0.938
Neg logit contributions
advertising-0.997
Attorney-0.985
rolet-0.980
VK-0.920
jury-0.910
Token for
Token ID329
Feature activation+2.19e-01

Pos logit contributions
advertising+1.936
Attorney+1.927
rolet+1.862
VK+1.801
jury+1.765
Neg logit contributions
antha-1.929
iliated-1.802
 coerc-1.746
odo-1.731
urrent-1.727
Token those
Token ID883
Feature activation-8.33e-02

Pos logit contributions
Attorney+0.801
advertising+0.795
rolet+0.776
jury+0.737
VK+0.735
Neg logit contributions
antha-0.865
iliated-0.797
 coerc-0.785
odo-0.782
urrent-0.767
Token who
Token ID508
Feature activation-4.60e-01

Pos logit contributions
antha+0.355
iliated+0.333
odo+0.330
 coerc+0.327
urrent+0.322
Neg logit contributions
Attorney-0.275
advertising-0.274
rolet-0.269
VK-0.251
jury-0.247
Token do
Token ID466
Feature activation-1.13e-01

Pos logit contributions
antha+1.742
iliated+1.639
 coerc+1.610
odo+1.585
urrent+1.570
Neg logit contributions
rolet-1.727
advertising-1.725
Attorney-1.721
VK-1.606
jury-1.581
Tokening
Token ID278
Feature activation-4.89e-02

Pos logit contributions
advertising+1.054
Attorney+1.025
rolet+1.022
VK+0.965
jury+0.955
Neg logit contributions
antha-0.963
iliated-0.908
 coerc-0.893
odo-0.868
urrent-0.847
Token on
Token ID319
Feature activation-1.27e-01

Pos logit contributions
antha+0.186
iliated+0.175
 coerc+0.170
odo+0.170
urrent+0.167
Neg logit contributions
advertising-0.184
rolet-0.180
Attorney-0.180
VK-0.170
jury-0.169
Token the
Token ID262
Feature activation-6.13e-02

Pos logit contributions
antha+0.481
iliated+0.453
 coerc+0.440
odo+0.437
urrent+0.433
Neg logit contributions
advertising-0.480
rolet-0.468
Attorney-0.467
VK-0.444
jury-0.439
Token pure
Token ID5899
Feature activation-1.47e-01

Pos logit contributions
antha+0.222
iliated+0.208
odo+0.201
 coerc+0.201
urrent+0.198
Neg logit contributions
advertising-0.241
rolet-0.236
Attorney-0.235
VK-0.224
jury-0.220
Token unf
Token ID3684
Feature activation-4.41e-02

Pos logit contributions
antha+0.522
iliated+0.494
 coerc+0.483
odo+0.476
urrent+0.472
Neg logit contributions
advertising-0.583
rolet-0.574
Attorney-0.571
VK-0.543
jury-0.535
Tokenil
Token ID346
Feature activation-8.36e-01

Pos logit contributions
antha+0.169
 coerc+0.159
iliated+0.156
 Fargo+0.154
odo+0.154
Neg logit contributions
advertising-0.166
rolet-0.162
Attorney-0.160
VK-0.151
jury-0.150
Tokentered
Token ID4400
Feature activation-3.49e-02

Pos logit contributions
antha+2.225
iliated+2.024
 coerc+2.010
odo+1.955
urrent+1.912
Neg logit contributions
advertising-4.116
Attorney-4.058
rolet-4.053
VK-3.861
jury-3.813
Token visual
Token ID5874
Feature activation+2.50e-02

Pos logit contributions
antha+0.128
iliated+0.123
 coerc+0.119
odo+0.118
urrent+0.117
Neg logit contributions
advertising-0.134
Attorney-0.131
rolet-0.130
VK-0.125
jury-0.121
Token emotions
Token ID10825
Feature activation+1.25e-01

Pos logit contributions
advertising+0.098
Attorney+0.094
rolet+0.092
VK+0.089
jury+0.088
Neg logit contributions
antha-0.091
iliated-0.087
 coerc-0.085
odo-0.084
urrent-0.083
Token,
Token ID11
Feature activation-6.16e-02

Pos logit contributions
advertising+0.462
Attorney+0.448
rolet+0.448
VK+0.423
jury+0.414
Neg logit contributions
antha-0.483
iliated-0.456
 coerc-0.445
odo-0.442
urrent-0.436
Token Tak
Token ID15804
Feature activation-1.95e-01

Pos logit contributions
antha+0.214
iliated+0.205
 coerc+0.199
odo+0.195
urrent+0.194
Neg logit contributions
advertising-0.252
Attorney-0.242
rolet-0.241
VK-0.231
jury-0.226
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
antha+0.998
 coerc+0.943
iliated+0.940
odo+0.899
urrent+0.891
Neg logit contributions
advertising-1.081
Attorney-1.068
rolet-1.054
VK-1.000
jury-0.982
TokenLes
Token ID35882
Feature activation-3.44e-01

Pos logit contributions
antha+0.550
 coerc+0.519
iliated+0.519
odo+0.506
urrent+0.496
Neg logit contributions
advertising-0.503
Attorney-0.498
rolet-0.485
VK-0.463
jury-0.449
Token Imp
Token ID9855
Feature activation-4.75e-01

Pos logit contributions
antha+1.429
iliated+1.356
 coerc+1.343
odo+1.322
urrent+1.311
Neg logit contributions
advertising-1.173
rolet-1.158
Attorney-1.154
VK-1.072
jury-1.054
Tokenrompt
Token ID45700
Feature activation-9.13e-02

Pos logit contributions
antha+1.700
iliated+1.595
 coerc+1.582
odo+1.550
urrent+1.526
Neg logit contributions
advertising-1.884
rolet-1.865
Attorney-1.860
VK-1.750
jury-1.716
Tokenus
Token ID385
Feature activation+2.78e-01

Pos logit contributions
antha+0.431
iliated+0.413
 coerc+0.408
odo+0.399
 Fargo+0.398
Neg logit contributions
advertising-0.255
rolet-0.255
Attorney-0.251
VK-0.228
jury-0.222
Token de
Token ID390
Feature activation-9.12e-01

Pos logit contributions
advertising+1.062
Attorney+1.061
rolet+1.056
VK+0.985
jury+0.973
Neg logit contributions
antha-1.023
iliated-0.961
 coerc-0.931
odo-0.920
urrent-0.911
Token Lac
Token ID23115
Feature activation-1.93e-01

Pos logit contributions
antha+3.220
 coerc+3.017
iliated+3.009
odo+2.948
urrent+2.890
Neg logit contributions
advertising-3.646
rolet-3.591
Attorney-3.587
VK-3.383
jury-3.307
Tokenan
Token ID272
Feature activation+2.02e-01

Pos logit contributions
antha+0.587
iliated+0.548
 coerc+0.542
odo+0.528
urrent+0.517
Neg logit contributions
advertising-0.862
rolet-0.859
Attorney-0.854
VK-0.806
jury-0.797
Token
Token ID198
Feature activation+9.37e-02

Pos logit contributions
advertising+0.773
Attorney+0.772
rolet+0.771
VK+0.714
jury+0.714
Neg logit contributions
antha-0.738
iliated-0.698
 coerc-0.676
odo-0.663
urrent-0.663
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
advertising+0.359
Attorney+0.354
rolet+0.343
VK+0.328
jury+0.323
Neg logit contributions
antha-0.364
 coerc-0.348
iliated-0.345
odo-0.326
urrent-0.325
TokenThis
Token ID1212
Feature activation+3.40e-01

Pos logit contributions
advertising+0.534
Attorney+0.530
rolet+0.508
VK+0.489
jury+0.474
Neg logit contributions
antha-0.578
 coerc-0.545
iliated-0.541
odo-0.529
 Fargo-0.521
Token will
Token ID481
Feature activation-2.60e-01

Pos logit contributions
advertising+0.415
Attorney+0.406
rolet+0.403
VK+0.377
jury+0.374
Neg logit contributions
antha-0.421
 coerc-0.398
iliated-0.396
odo-0.386
urrent-0.381
Token keep
Token ID1394
Feature activation+1.19e-01

Pos logit contributions
antha+0.975
iliated+0.917
 coerc+0.905
odo+0.889
urrent+0.878
Neg logit contributions
advertising-0.994
rolet-0.978
Attorney-0.977
VK-0.911
jury-0.902
Token being
Token ID852
Feature activation+2.11e-01

Pos logit contributions
advertising+0.459
Attorney+0.453
rolet+0.447
VK+0.424
jury+0.414
Neg logit contributions
antha-0.440
iliated-0.413
 coerc-0.405
odo-0.393
urrent-0.392
Token cited
Token ID9181
Feature activation+1.10e-01

Pos logit contributions
advertising+0.808
Attorney+0.796
rolet+0.785
VK+0.744
jury+0.737
Neg logit contributions
antha-0.789
iliated-0.740
 coerc-0.713
odo-0.709
urrent-0.702
Token further
Token ID2252
Feature activation+1.49e-02

Pos logit contributions
advertising+0.422
rolet+0.415
Attorney+0.415
VK+0.387
jury+0.386
Neg logit contributions
antha-0.406
iliated-0.382
 coerc-0.370
odo-0.370
urrent-0.364
Token right
Token ID826
Feature activation-8.17e-01

Pos logit contributions
advertising+0.056
Attorney+0.055
rolet+0.054
VK+0.051
jury+0.051
Neg logit contributions
antha-0.057
iliated-0.054
urrent-0.052
 coerc-0.052
odo-0.052
Token on
Token ID319
Feature activation+8.95e-03

Pos logit contributions
antha+2.818
iliated+2.636
 coerc+2.614
odo+2.564
urrent+2.517
Neg logit contributions
advertising-3.346
rolet-3.313
Attorney-3.304
VK-3.120
jury-3.058
Token the
Token ID262
Feature activation-4.56e-02

Pos logit contributions
advertising+0.033
Attorney+0.032
rolet+0.032
VK+0.030
jury+0.030
Neg logit contributions
antha-0.035
iliated-0.033
 coerc-0.032
odo-0.032
urrent-0.031
Token political
Token ID1964
Feature activation+9.32e-02

Pos logit contributions
antha+0.169
iliated+0.159
 coerc+0.156
odo+0.154
urrent+0.152
Neg logit contributions
advertising-0.175
Attorney-0.173
rolet-0.173
VK-0.162
jury-0.159
Token spectrum
Token ID10958
Feature activation-2.78e-01

Pos logit contributions
advertising+0.339
Attorney+0.333
rolet+0.332
jury+0.310
VK+0.308
Neg logit contributions
antha-0.365
iliated-0.343
 coerc-0.339
odo-0.335
urrent-0.333
Token to
Token ID284
Feature activation-2.26e-01

Pos logit contributions
antha+1.024
iliated+0.961
 coerc+0.946
odo+0.936
urrent+0.925
Neg logit contributions
advertising-1.083
Attorney-1.070
rolet-1.069
VK-0.999
jury-0.990
Token on
Token ID319
Feature activation+1.53e-01

Pos logit contributions
antha+0.677
iliated+0.635
 coerc+0.629
odo+0.617
urrent+0.608
Neg logit contributions
advertising-0.714
Attorney-0.697
rolet-0.696
VK-0.654
jury-0.647
Token Al
Token ID978
Feature activation-1.43e-01

Pos logit contributions
advertising+0.645
Attorney+0.637
rolet+0.633
VK+0.603
jury+0.602
Neg logit contributions
antha-0.515
iliated-0.479
 coerc-0.478
odo-0.463
urrent-0.456
Tokencat
Token ID9246
Feature activation+4.24e-02

Pos logit contributions
antha+0.775
 coerc+0.748
iliated+0.745
odo+0.725
urrent+0.725
Neg logit contributions
advertising-0.312
rolet-0.302
Attorney-0.300
jury-0.264
VK-0.262
Tokenraz
Token ID3247
Feature activation+2.03e-01

Pos logit contributions
advertising+0.100
rolet+0.098
Attorney+0.097
jury+0.086
VK+0.086
Neg logit contributions
antha-0.221
 coerc-0.213
iliated-0.210
odo-0.206
 Fargo-0.205
Token.
Token ID13
Feature activation+7.11e-03

Pos logit contributions
advertising+0.777
Attorney+0.768
rolet+0.766
VK+0.719
jury+0.709
Neg logit contributions
antha-0.753
iliated-0.709
 coerc-0.704
odo-0.688
urrent-0.678
Token Only
Token ID5514
Feature activation-9.37e-01

Pos logit contributions
advertising+0.028
Attorney+0.027
rolet+0.027
VK+0.026
jury+0.025
Neg logit contributions
antha-0.027
iliated-0.025
 coerc-0.025
odo-0.024
urrent-0.024
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
antha+3.361
iliated+3.178
 coerc+3.133
odo+3.086
urrent+3.048
Neg logit contributions
advertising-3.657
rolet-3.621
Attorney-3.610
VK-3.386
jury-3.296
Token beginning
Token ID3726
Feature activation-2.28e-01

Pos logit contributions
antha+0.706
iliated+0.664
 coerc+0.658
odo+0.643
urrent+0.634
Neg logit contributions
advertising-0.767
Attorney-0.760
rolet-0.758
VK-0.714
jury-0.709
Token.
Token ID13
Feature activation+5.08e-02

Pos logit contributions
antha+0.830
iliated+0.776
 coerc+0.770
odo+0.753
urrent+0.743
Neg logit contributions
advertising-0.900
Attorney-0.890
rolet-0.889
VK-0.834
jury-0.821
Token
Token ID198
Feature activation+8.64e-02

Pos logit contributions
advertising+0.197
Attorney+0.196
rolet+0.195
VK+0.183
jury+0.180
Neg logit contributions
antha-0.189
iliated-0.179
 coerc-0.179
odo-0.172
urrent-0.169
Token
Token ID198
Feature activation+4.25e-02

Pos logit contributions
advertising+0.336
Attorney+0.333
rolet+0.326
VK+0.310
jury+0.306
Neg logit contributions
antha-0.325
 coerc-0.311
iliated-0.307
odo-0.292
urrent-0.290
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
antha+0.214
iliated+0.202
 coerc+0.199
odo+0.195
urrent+0.194
Neg logit contributions
rolet-0.232
advertising-0.232
Attorney-0.231
VK-0.217
jury-0.214
Token
Token ID198
Feature activation-2.58e-02

Pos logit contributions
antha+0.647
iliated+0.612
 coerc+0.608
odo+0.589
urrent+0.583
Neg logit contributions
advertising-0.663
Attorney-0.660
rolet-0.655
VK-0.617
jury-0.606
Token
Token ID198
Feature activation-7.22e-02

Pos logit contributions
antha+0.097
 coerc+0.092
iliated+0.091
odo+0.087
urrent+0.086
Neg logit contributions
advertising-0.101
Attorney-0.100
rolet-0.098
VK-0.093
jury-0.091
Token
Token ID447
Feature activation-2.57e-01

Pos logit contributions
antha+0.283
 coerc+0.268
iliated+0.267
odo+0.258
urrent+0.254
Neg logit contributions
advertising-0.268
Attorney-0.268
rolet-0.260
VK-0.249
jury-0.242
Token
Token ID250
Feature activation-2.24e-01

Pos logit contributions
antha+0.975
iliated+0.916
 coerc+0.911
odo+0.893
urrent+0.879
Neg logit contributions
advertising-0.967
Attorney-0.957
rolet-0.956
VK-0.895
jury-0.879
TokenOnly
Token ID10049
Feature activation-9.36e-01

Pos logit contributions
antha+0.871
iliated+0.821
 coerc+0.816
odo+0.796
urrent+0.786
Neg logit contributions
advertising-0.834
Attorney-0.828
rolet-0.818
VK-0.771
jury-0.755
Token ID
Token ID4522
Feature activation-6.84e-01

Pos logit contributions
antha+3.409
iliated+3.205
 coerc+3.177
odo+3.119
urrent+3.069
Neg logit contributions
advertising-3.654
rolet-3.615
Attorney-3.606
VK-3.390
jury-3.321
Token I
Token ID314
Feature activation+3.81e-01

Pos logit contributions
antha+2.372
iliated+2.221
 coerc+2.200
odo+2.157
urrent+2.123
Neg logit contributions
advertising-2.791
rolet-2.763
Attorney-2.758
VK-2.599
jury-2.551
Token got
Token ID1392
Feature activation+8.67e-02

Pos logit contributions
rolet+1.485
Attorney+1.475
advertising+1.474
VK+1.385
jury+1.359
Neg logit contributions
antha-1.398
iliated-1.300
 coerc-1.296
odo-1.229
urrent-1.228
Token is
Token ID318
Feature activation+2.03e-01

Pos logit contributions
advertising+0.340
Attorney+0.339
rolet+0.336
VK+0.318
jury+0.312
Neg logit contributions
antha-0.314
iliated-0.292
urrent-0.286
 coerc-0.286
odo-0.281
Token a
Token ID257
Feature activation+1.92e-01

Pos logit contributions
Attorney+0.773
advertising+0.773
rolet+0.765
VK+0.722
jury+0.711
Neg logit contributions
antha-0.753
iliated-0.702
 coerc-0.694
odo-0.680
urrent-0.678