TOP ACTIVATIONS
MAX = 3.761

party abandoned President Dil ma Rousse ff s governing
tremb led . <|endoftext|> Al ain B ad i ou The
an alternative to The Ad vertis er , a different voice
to funding , requiring F annie Mae and Freddie Mac to
for Don Giovanni and La Boh è me , visits to
2008 and his then agent Bruno Sat in at IM G
government of President Dil ma Rousse ff and no one in
Ukraine . The BBC 's Rupert Wing field - Hay es
intend to give third driver Bruno S enna some time in
Manhattan set to the puls ating tune of George G
by Reddit interim CEO Ellen P ao and other company executives
and his $ 20 - million cap hold . A cap
return of Brian ( Al ain u ' uese ) and
in which she discussed Michel Fou c ault s
ACL sp rain . Brian Bab ine au / NBA E
glowing like a massive , puls ating white orb , a
Brown - b agged lun ches in hand ,
Brazil ian President Dil ma Rousse ff and Cuban President R
you end up in Abu Dhabi , the last race .
FR ANK M ORD ICA , V

MIN ACTIVATIONS
MIN = -3.985

000 students . The Legislature unc apped enrollment as part of
Nigel Farage : " Lord Hes elt ine fears what the
against the chest of whatever unc aring god allowed reality to
which distributions support which " universal " package format . Snap
said He Sh unc ai , a 34 -
, m AU , no ic est , The V ow
Profile Blog Joined December 2010 TL AD T 235 21 Posts
not at all unlikely that Xer xes , who had probably
that it is not as universal of an I PC primitive
State of Oregon , the IRS , and a charitable foundation
, ' The Brotherhood of Eternal Love : From Flower Power
good New Jersey boy , DeV ito pronoun ces it
the Italian Army had been intern ed by German forces in
others who think that Lord Hes elt ine has a point
' While Lord Hes elt ine , a former
of teeth from a tr icer at ops . ( FS
clay , survives from approximately Xer xes era ;
essential Czech hero , a Zel ig who trek ked to
Vegas . Ren aud Lav o ie (@ ren lav
's science focus is no easy task , says Young ah

INTERVAL 1.825 - 3.761
CONTAINS 0.438%

. Cat ast rophic insurance adjusting is not an
40 ] C ob ain immersed himself in artistic
Last week the Prime Minister set out plans for Britain
already has access to living brains . To study both the
treatment of stress - related disorders including major depressive and anxiety

INTERVAL -0.112 - 1.825
CONTAINS 58.103%

of unilateral independence in the Catalan Parliament has several readings .
100 helium - filled balloons , each carrying a voucher entit
F ol lett to enable Academic Search Premier and Master
, but drawing conclusions based on them wouldn t
ars and close party leaders and workers of Sh . Bh

INTERVAL -2.049 - -0.112
CONTAINS 41.248%

it felt like a place I had gotten to know .
up of a bare shoulder blade , and a sequence in
in the basement and out of view .
, cease to care . On the other hand
a conversation piece . He says he

INTERVAL -3.985 - -2.049
CONTAINS 0.211%

around ." Martin Sor ice , who owns several
Despite these shortcomings , the SI unit of electrom otive force
of teeth from a tr icer at ops . ( FS
that the reference to Sh av iv ( 2008 ) should
actually unacceptable , how non chal antly some say that it
Token party
Token ID2151
Feature activation+4.92e-01

Pos logit contributions
gments+1.115
 synd+1.107
sts+1.028
��+1.004
vertisements+0.990
Neg logit contributions
ctor-1.344
imates-1.222
 WARN-1.194
 Mehran-1.152
kel-1.138
Token abandoned
Token ID9958
Feature activation+4.27e-02

Pos logit contributions
gments+0.874
 synd+0.873
sts+0.808
��+0.789
vertisements+0.779
Neg logit contributions
ctor-1.006
imates-0.913
 WARN-0.891
 Mehran-0.858
kel-0.850
Token President
Token ID1992
Feature activation+3.37e-01

Pos logit contributions
gments+0.076
 synd+0.075
sts+0.070
��+0.069
vertisements+0.068
Neg logit contributions
ctor-0.087
imates-0.079
 WARN-0.077
 Mehran-0.075
kel-0.074
Token Dil
Token ID29328
Feature activation+7.15e-01

Pos logit contributions
gments+0.586
 synd+0.585
sts+0.541
��+0.529
vertisements+0.521
Neg logit contributions
ctor-0.702
imates-0.638
 WARN-0.623
 Mehran-0.601
kel-0.595
Tokenma
Token ID2611
Feature activation+1.08e+00

Pos logit contributions
 synd+1.470
gments+1.464
sts+1.368
��+1.355
vertisements+1.332
Neg logit contributions
ctor-1.276
imates-1.131
 WARN-1.085
kel-1.049
 Mehran-1.048
Token Rousse
Token ID42849
Feature activation+3.76e+00

Pos logit contributions
gments+2.237
 synd+2.187
sts+2.063
��+2.035
vertisements+2.018
Neg logit contributions
ctor-1.945
imates-1.726
 WARN-1.689
 Mehran-1.629
kel-1.614
Tokenff
Token ID487
Feature activation+3.62e-02

Pos logit contributions
gments+5.312
 synd+5.301
sts+5.071
gements+4.664
��+4.472
Neg logit contributions
ctor-8.556
imates-8.276
 Mehran-8.019
 WARN-7.992
STD-7.542
Token
Token ID447
Feature activation+1.81e-01

Pos logit contributions
gments+0.065
 synd+0.065
sts+0.060
��+0.059
vertisements+0.058
Neg logit contributions
ctor-0.073
imates-0.066
 WARN-0.065
 Mehran-0.062
kel-0.062
Token
Token ID247
Feature activation-3.11e-01

Pos logit contributions
 synd+0.231
gments+0.231
sts+0.207
��+0.197
vertisements+0.196
Neg logit contributions
ctor-0.460
imates-0.426
 WARN-0.417
 Mehran-0.405
kel-0.403
Tokens
Token ID82
Feature activation+8.20e-02

Pos logit contributions
ctor+0.566
imates+0.508
 WARN+0.493
 Mehran+0.469
kel+0.468
Neg logit contributions
gments-0.623
 synd-0.621
sts-0.577
��-0.572
vertisements-0.564
Token governing
Token ID15030
Feature activation+5.63e-01

Pos logit contributions
gments+0.142
 synd+0.141
sts+0.131
��+0.128
vertisements+0.126
Neg logit contributions
ctor-0.172
imates-0.156
 WARN-0.152
 Mehran-0.147
kel-0.146
Token tremb
Token ID29868
Feature activation+1.25e+00

Pos logit contributions
ctor+0.310
imates+0.281
 WARN+0.275
 Mehran+0.264
kel+0.263
Neg logit contributions
gments-0.262
 synd-0.261
sts-0.241
��-0.237
vertisements-0.233
Tokenled
Token ID992
Feature activation-2.56e-01

Pos logit contributions
gments+1.274
 synd+1.264
sts+1.107
��+1.042
gements+1.024
Neg logit contributions
ctor-3.481
imates-3.251
 WARN-3.192
 Mehran-3.110
kel-3.073
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
ctor+0.522
imates+0.474
 WARN+0.462
 Mehran+0.445
kel+0.442
Neg logit contributions
gments-0.459
 synd-0.456
sts-0.422
��-0.414
vertisements-0.410
Token<|endoftext|>
Token ID50256
Feature activation+3.10e-01

Pos logit contributions
ctor+0.066
imates+0.060
 WARN+0.058
 Mehran+0.056
kel+0.055
Neg logit contributions
gments-0.065
 synd-0.065
sts-0.061
��-0.060
vertisements-0.059
TokenAl
Token ID2348
Feature activation-2.73e-01

Pos logit contributions
gments+0.629
 synd+0.627
sts+0.588
��+0.575
vertisements+0.568
Neg logit contributions
ctor-0.554
imates-0.496
 WARN-0.483
 Mehran-0.462
kel-0.455
Tokenain
Token ID391
Feature activation+3.51e+00

Pos logit contributions
ctor+0.530
imates+0.477
 WARN+0.464
 Mehran+0.445
kel+0.444
Neg logit contributions
 synd-0.517
gments-0.515
sts-0.478
��-0.471
vertisements-0.462
Token B
Token ID347
Feature activation+2.28e+00

Pos logit contributions
 synd+5.670
gments+5.660
sts+5.168
gements+4.981
��+4.900
Neg logit contributions
ctor-7.283
imates-6.823
 WARN-6.553
kel-6.408
 Mehran-6.297
Token ad
Token ID512
Feature activation+3.82e-01

Pos logit contributions
gments+4.541
 synd+4.491
sts+4.264
��+4.098
gements+4.078
Neg logit contributions
ctor-4.072
imates-3.624
 WARN-3.609
 Mehran-3.422
kel-3.357
Tokeni
Token ID72
Feature activation-5.44e-01

Pos logit contributions
gments+0.712
 synd+0.710
sts+0.660
��+0.645
vertisements+0.637
Neg logit contributions
ctor-0.749
imates-0.677
 WARN-0.659
 Mehran-0.635
kel-0.628
Tokenou
Token ID280
Feature activation+8.31e-01

Pos logit contributions
ctor+1.088
imates+0.986
 WARN+0.956
 Mehran+0.931
kel+0.917
Neg logit contributions
gments-0.996
 synd-0.986
sts-0.916
��-0.907
gements-0.884
Token The
Token ID383
Feature activation-7.76e-02

Pos logit contributions
gments+1.521
 synd+1.519
sts+1.413
��+1.371
vertisements+1.360
Neg logit contributions
ctor-1.648
imates-1.494
 WARN-1.456
 Mehran-1.402
kel-1.383
Token an
Token ID281
Feature activation-6.23e-01

Pos logit contributions
ctor+1.697
imates+1.561
 WARN+1.473
kel+1.455
 Mehran+1.436
Neg logit contributions
gments-1.577
 synd-1.548
sts-1.486
��-1.445
gements-1.407
Token alternative
Token ID5559
Feature activation+1.04e+00

Pos logit contributions
ctor+1.300
imates+1.181
 WARN+1.128
 Mehran+1.108
pei+1.071
Neg logit contributions
gments-1.086
 synd-1.066
sts-1.003
��-0.977
vertisements-0.959
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
 synd+1.697
gments+1.691
sts+1.549
��+1.502
vertisements+1.487
Neg logit contributions
ctor-2.283
imates-2.088
 WARN-2.045
 Mehran-1.969
kel-1.953
Token The
Token ID383
Feature activation-7.37e-02

Pos logit contributions
 synd+0.262
gments+0.257
sts+0.239
��+0.232
vertisements+0.230
Neg logit contributions
ctor-0.320
imates-0.295
 WARN-0.290
 Mehran-0.279
kel-0.272
Token Ad
Token ID1215
Feature activation+4.56e-01

Pos logit contributions
ctor+0.171
imates+0.156
 WARN+0.152
 Mehran+0.148
kel+0.147
Neg logit contributions
gments-0.112
 synd-0.111
sts-0.102
��-0.100
vertisements-0.097
Tokenvertis
Token ID3346
Feature activation+3.43e+00

Pos logit contributions
gments+0.207
 synd+0.203
sts+0.145
��+0.129
vertisements+0.115
Neg logit contributions
ctor-1.534
imates-1.451
 WARN-1.427
 Mehran-1.397
kel-1.392
Tokener
Token ID263
Feature activation-4.26e-02

Pos logit contributions
gments+3.431
 synd+3.416
sts+3.039
gements+2.741
��+2.713
Neg logit contributions
ctor-9.493
 WARN-8.916
imates-8.905
 Mehran-8.645
pei-8.422
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
ctor+0.085
imates+0.077
 WARN+0.074
kel+0.072
 Mehran+0.072
Neg logit contributions
gments-0.078
 synd-0.076
sts-0.071
��-0.070
gements-0.069
Token a
Token ID257
Feature activation+5.65e-02

Pos logit contributions
gments+0.459
 synd+0.455
sts+0.431
��+0.418
vertisements+0.414
Neg logit contributions
ctor-0.503
imates-0.452
 WARN-0.441
 Mehran-0.428
kel-0.421
Token different
Token ID1180
Feature activation+8.68e-01

Pos logit contributions
gments+0.068
 synd+0.068
sts+0.061
��+0.059
vertisements+0.057
Neg logit contributions
ctor-0.147
imates-0.137
 WARN-0.134
 Mehran-0.131
kel-0.130
Token voice
Token ID3809
Feature activation-4.00e-02

Pos logit contributions
gments+1.550
 synd+1.540
sts+1.435
��+1.400
vertisements+1.385
Neg logit contributions
ctor-1.769
imates-1.606
 WARN-1.565
 Mehran-1.512
kel-1.496
Token to
Token ID284
Feature activation+4.92e-01

Pos logit contributions
ctor+1.349
imates+1.230
 WARN+1.200
 Mehran+1.157
kel+1.148
Neg logit contributions
gments-1.068
 synd-1.062
sts-0.979
��-0.963
vertisements-0.944
Token funding
Token ID4918
Feature activation+4.24e-01

Pos logit contributions
 synd+0.887
gments+0.880
sts+0.819
��+0.792
vertisements+0.784
Neg logit contributions
ctor-0.995
imates-0.902
 WARN-0.871
 Mehran-0.846
kel-0.836
Token,
Token ID11
Feature activation+6.01e-01

Pos logit contributions
gments+0.725
 synd+0.724
sts+0.669
��+0.649
vertisements+0.642
Neg logit contributions
ctor-0.894
imates-0.814
 WARN-0.795
 Mehran-0.767
kel-0.759
Token requiring
Token ID10616
Feature activation+7.33e-02

Pos logit contributions
gments+1.045
 synd+1.043
sts+0.964
��+0.940
vertisements+0.928
Neg logit contributions
ctor-1.252
imates-1.139
 WARN-1.111
 Mehran-1.072
kel-1.061
Token F
Token ID376
Feature activation+7.85e-01

Pos logit contributions
gments+0.126
 synd+0.126
sts+0.116
��+0.113
vertisements+0.112
Neg logit contributions
ctor-0.154
imates-0.140
 WARN-0.137
 Mehran-0.132
kel-0.131
Tokenannie
Token ID42883
Feature activation+3.43e+00

Pos logit contributions
gments+1.723
 synd+1.718
sts+1.619
��+1.585
vertisements+1.568
Neg logit contributions
ctor-1.275
imates-1.127
 WARN-1.091
 Mehran-1.042
kel-1.025
Token Mae
Token ID34673
Feature activation+1.55e+00

Pos logit contributions
 synd+5.601
gments+5.501
sts+5.057
gements+4.844
vertisements+4.834
Neg logit contributions
ctor-7.378
imates-6.779
 WARN-6.488
 Mehran-6.337
pei-6.228
Token and
Token ID290
Feature activation+4.02e-01

Pos logit contributions
 synd+2.537
gments+2.512
sts+2.315
��+2.234
gements+2.198
Neg logit contributions
ctor-3.416
imates-3.127
 WARN-3.022
 Mehran-2.950
kel-2.908
Token Freddie
Token ID31658
Feature activation+2.52e+00

Pos logit contributions
gments+0.439
 synd+0.427
sts+0.384
��+0.366
vertisements+0.363
Neg logit contributions
ctor-1.102
imates-1.026
 WARN-1.013
 Mehran-0.984
kel-0.977
Token Mac
Token ID4100
Feature activation+1.37e+00

Pos logit contributions
gments+3.859
 synd+3.846
sts+3.520
��+3.422
vertisements+3.369
Neg logit contributions
ctor-5.770
imates-5.295
 WARN-5.180
 Mehran-5.014
kel-4.969
Token to
Token ID284
Feature activation+1.66e-01

Pos logit contributions
 synd+2.226
gments+2.201
sts+2.028
��+1.961
vertisements+1.934
Neg logit contributions
ctor-3.010
imates-2.741
 WARN-2.663
 Mehran-2.596
kel-2.567
Token for
Token ID329
Feature activation-2.19e-01

Pos logit contributions
ctor+0.617
imates+0.560
 WARN+0.546
 Mehran+0.526
kel+0.522
Neg logit contributions
gments-0.522
 synd-0.517
sts-0.481
��-0.474
vertisements-0.464
Token Don
Token ID2094
Feature activation+1.05e+00

Pos logit contributions
ctor+0.447
imates+0.405
 WARN+0.395
 Mehran+0.381
kel+0.377
Neg logit contributions
gments-0.389
 synd-0.387
sts-0.359
��-0.353
vertisements-0.347
Token Giovanni
Token ID50191
Feature activation+1.03e-01

Pos logit contributions
gments+1.909
 synd+1.903
sts+1.774
��+1.723
vertisements+1.705
Neg logit contributions
ctor-2.089
imates-1.901
 WARN-1.856
 Mehran-1.781
kel-1.753
Token and
Token ID290
Feature activation+7.99e-02

Pos logit contributions
gments+0.182
 synd+0.181
sts+0.168
��+0.164
vertisements+0.161
Neg logit contributions
ctor-0.211
imates-0.192
 WARN-0.187
 Mehran-0.180
kel-0.178
Token La
Token ID4689
Feature activation-9.54e-01

Pos logit contributions
gments+0.141
 synd+0.140
sts+0.130
��+0.127
vertisements+0.125
Neg logit contributions
ctor-0.165
imates-0.150
 WARN-0.146
 Mehran-0.141
kel-0.140
Token Boh
Token ID44366
Feature activation+3.40e+00

Pos logit contributions
ctor+1.974
imates+1.779
 WARN+1.728
 Mehran+1.678
kel+1.659
Neg logit contributions
gments-1.679
 synd-1.658
sts-1.546
��-1.506
gements-1.493
Tokenè
Token ID14064
Feature activation+2.48e+00

Pos logit contributions
 synd+5.981
gments+5.918
sts+5.610
��+5.260
gements+5.212
Neg logit contributions
ctor-6.640
 WARN-6.014
imates-5.970
 Mehran-5.851
STD-5.613
Tokenme
Token ID1326
Feature activation+8.10e-01

Pos logit contributions
 synd+4.540
gments+4.448
sts+4.362
gements+4.007
��+3.917
Neg logit contributions
ctor-4.736
imates-4.357
 WARN-4.287
 Mehran-4.262
conservancy-4.021
Token,
Token ID11
Feature activation+8.08e-02

Pos logit contributions
gments+1.413
 synd+1.412
sts+1.305
��+1.272
vertisements+1.256
Neg logit contributions
ctor-1.680
imates-1.527
 WARN-1.491
 Mehran-1.438
kel-1.422
Token visits
Token ID11864
Feature activation-6.58e-01

Pos logit contributions
gments+0.142
 synd+0.141
sts+0.131
��+0.129
vertisements+0.126
Neg logit contributions
ctor-0.167
imates-0.151
 WARN-0.148
 Mehran-0.143
kel-0.141
Token to
Token ID284
Feature activation-4.69e-01

Pos logit contributions
ctor+1.378
imates+1.255
 WARN+1.221
 Mehran+1.181
kel+1.168
Neg logit contributions
gments-1.136
 synd-1.130
sts-1.046
��-1.027
vertisements-1.008
Token 2008
Token ID3648
Feature activation-3.75e-01

Pos logit contributions
gments+0.111
 synd+0.109
sts+0.102
��+0.100
vertisements+0.098
Neg logit contributions
ctor-0.128
imates-0.116
 WARN-0.113
 Mehran-0.109
kel-0.109
Token and
Token ID290
Feature activation-3.48e-04

Pos logit contributions
ctor+0.769
imates+0.695
 WARN+0.679
 Mehran+0.658
kel+0.651
Neg logit contributions
gments-0.667
 synd-0.664
sts-0.613
��-0.603
vertisements-0.592
Token his
Token ID465
Feature activation-7.81e-01

Pos logit contributions
ctor+0.001
imates+0.001
 WARN+0.001
 Mehran+0.001
kel+0.001
Neg logit contributions
gments-0.001
 synd-0.001
sts-0.001
��-0.001
vertisements-0.001
Token then
Token ID788
Feature activation+4.09e-01

Pos logit contributions
ctor+1.627
imates+1.473
 WARN+1.442
 Mehran+1.397
kel+1.383
Neg logit contributions
gments-1.356
 synd-1.334
sts-1.245
��-1.236
vertisements-1.214
Token agent
Token ID5797
Feature activation-1.44e-01

Pos logit contributions
gments+0.736
 synd+0.734
sts+0.681
��+0.665
vertisements+0.656
Neg logit contributions
ctor-0.829
imates-0.752
 WARN-0.733
 Mehran-0.706
kel-0.699
Token Bruno
Token ID31045
Feature activation+3.38e+00

Pos logit contributions
ctor+0.291
imates+0.262
 WARN+0.257
 Mehran+0.249
kel+0.245
Neg logit contributions
gments-0.260
 synd-0.257
sts-0.241
��-0.236
vertisements-0.233
Token Sat
Token ID7031
Feature activation+1.29e+00

Pos logit contributions
 synd+5.843
gments+5.768
sts+5.391
gements+5.093
��+5.089
Neg logit contributions
ctor-6.679
imates-6.225
 WARN-5.981
 Mehran-5.657
kel-5.642
Tokenin
Token ID259
Feature activation-5.90e-01

Pos logit contributions
gments+2.531
 synd+2.516
sts+2.364
��+2.287
vertisements+2.276
Neg logit contributions
ctor-2.374
imates-2.135
 WARN-2.097
 Mehran-2.006
kel-1.956
Token at
Token ID379
Feature activation-4.93e-01

Pos logit contributions
ctor+1.158
imates+1.038
 WARN+1.015
 Mehran+0.975
kel+0.975
Neg logit contributions
gments-1.103
 synd-1.096
sts-1.019
��-1.005
vertisements-0.989
Token IM
Token ID8959
Feature activation-1.22e+00

Pos logit contributions
ctor+0.975
imates+0.884
 WARN+0.862
 Mehran+0.836
kel+0.824
Neg logit contributions
gments-0.906
 synd-0.899
sts-0.838
��-0.828
vertisements-0.810
TokenG
Token ID38
Feature activation+4.98e-01

Pos logit contributions
ctor+2.089
imates+1.843
 WARN+1.802
 Mehran+1.718
kel+1.705
Neg logit contributions
gments-2.562
 synd-2.521
sts-2.397
��-2.350
vertisements-2.334
Token government
Token ID1230
Feature activation-8.54e-01

Pos logit contributions
ctor+0.411
imates+0.373
 WARN+0.366
 Mehran+0.351
kel+0.348
Neg logit contributions
gments-0.348
 synd-0.343
sts-0.320
��-0.316
vertisements-0.311
Token of
Token ID286
Feature activation-3.66e-02

Pos logit contributions
ctor+1.734
imates+1.567
 WARN+1.538
 Mehran+1.472
kel+1.462
Neg logit contributions
gments-1.527
 synd-1.504
sts-1.402
��-1.402
vertisements-1.367
Token President
Token ID1992
Feature activation+5.20e-02

Pos logit contributions
ctor+0.079
imates+0.072
 WARN+0.070
 Mehran+0.068
kel+0.067
Neg logit contributions
gments-0.061
 synd-0.061
sts-0.056
��-0.055
vertisements-0.054
Token Dil
Token ID29328
Feature activation+5.83e-01

Pos logit contributions
gments+0.092
 synd+0.092
sts+0.085
��+0.083
vertisements+0.082
Neg logit contributions
ctor-0.107
imates-0.097
 WARN-0.094
 Mehran-0.091
kel-0.090
Tokenma
Token ID2611
Feature activation+6.77e-01

Pos logit contributions
 synd+1.185
gments+1.183
sts+1.106
��+1.095
vertisements+1.075
Neg logit contributions
ctor-1.049
imates-0.935
 WARN-0.900
 Mehran-0.866
kel-0.864
Token Rousse
Token ID42849
Feature activation+3.36e+00

Pos logit contributions
gments+1.289
 synd+1.280
sts+1.194
��+1.171
vertisements+1.155
Neg logit contributions
ctor-1.305
imates-1.178
 WARN-1.149
 Mehran-1.104
kel-1.091
Tokenff
Token ID487
Feature activation-6.01e-02

Pos logit contributions
gments+4.604
 synd+4.591
sts+4.421
gements+3.995
��+3.780
Neg logit contributions
ctor-7.747
imates-7.408
 WARN-7.276
 Mehran-7.145
STD-6.803
Token and
Token ID290
Feature activation-4.54e-01

Pos logit contributions
ctor+0.124
imates+0.113
 WARN+0.110
 Mehran+0.106
kel+0.105
Neg logit contributions
gments-0.106
 synd-0.105
sts-0.097
��-0.096
vertisements-0.094
Token no
Token ID645
Feature activation-2.53e-01

Pos logit contributions
ctor+0.923
imates+0.837
 WARN+0.817
 Mehran+0.786
kel+0.778
Neg logit contributions
gments-0.814
 synd-0.808
sts-0.752
��-0.739
vertisements-0.728
Token one
Token ID530
Feature activation-1.76e+00

Pos logit contributions
ctor+0.505
imates+0.457
 WARN+0.445
 Mehran+0.428
kel+0.426
Neg logit contributions
gments-0.463
 synd-0.458
sts-0.426
��-0.421
vertisements-0.414
Token in
Token ID287
Feature activation-6.48e-01

Pos logit contributions
ctor+3.321
imates+2.957
 WARN+2.877
 Mehran+2.762
kel+2.753
Neg logit contributions
gments-3.406
 synd-3.341
��-3.147
sts-3.146
vertisements-3.079
Token Ukraine
Token ID7049
Feature activation+4.06e-01

Pos logit contributions
ctor+0.598
imates+0.544
 WARN+0.532
 Mehran+0.513
kel+0.509
Neg logit contributions
gments-0.483
 synd-0.478
sts-0.444
��-0.435
vertisements-0.427
Token.
Token ID13
Feature activation-2.29e-01

Pos logit contributions
 synd+0.718
gments+0.718
sts+0.665
��+0.646
vertisements+0.640
Neg logit contributions
ctor-0.834
imates-0.757
 WARN-0.739
 Mehran-0.712
kel-0.704
Token The
Token ID383
Feature activation-4.35e-01

Pos logit contributions
ctor+0.455
imates+0.412
 WARN+0.402
 Mehran+0.386
kel+0.383
Neg logit contributions
gments-0.423
 synd-0.420
sts-0.390
��-0.385
vertisements-0.378
Token BBC
Token ID7823
Feature activation+4.45e-01

Pos logit contributions
ctor+0.898
imates+0.817
 WARN+0.799
 Mehran+0.768
kel+0.763
Neg logit contributions
gments-0.764
 synd-0.754
sts-0.702
��-0.692
vertisements-0.681
Token's
Token ID338
Feature activation-6.02e-01

Pos logit contributions
gments+0.779
 synd+0.774
sts+0.720
��+0.703
vertisements+0.694
Neg logit contributions
ctor-0.924
imates-0.841
 WARN-0.821
 Mehran-0.791
kel-0.783
Token Rupert
Token ID34315
Feature activation+3.34e+00

Pos logit contributions
ctor+1.250
imates+1.141
 WARN+1.119
 Mehran+1.088
kel+1.069
Neg logit contributions
gments-1.038
 synd-1.023
sts-0.960
��-0.941
vertisements-0.925
Token Wing
Token ID13405
Feature activation-4.20e-01

Pos logit contributions
 synd+5.487
gments+5.476
sts+5.009
gements+4.786
��+4.740
Neg logit contributions
ctor-7.137
imates-6.587
 WARN-6.356
 Mehran-6.178
kel-6.106
Tokenfield
Token ID3245
Feature activation+4.81e-01

Pos logit contributions
ctor+0.656
imates+0.576
 WARN+0.557
 Mehran+0.528
kel+0.522
Neg logit contributions
gments-0.953
 synd-0.949
sts-0.895
��-0.878
vertisements-0.872
Token-
Token ID12
Feature activation+5.73e-02

Pos logit contributions
gments+0.811
 synd+0.810
sts+0.747
��+0.728
vertisements+0.718
Neg logit contributions
ctor-1.028
imates-0.937
 WARN-0.915
 Mehran-0.883
kel-0.875
TokenHay
Token ID31306
Feature activation-3.17e-01

Pos logit contributions
gments+0.135
 synd+0.135
sts+0.128
��+0.125
vertisements+0.124
Neg logit contributions
ctor-0.084
imates-0.073
 WARN-0.070
 Mehran-0.067
kel-0.065
Tokenes
Token ID274
Feature activation-4.46e-01

Pos logit contributions
ctor+0.784
imates+0.721
 WARN+0.707
kel+0.685
 Mehran+0.685
Neg logit contributions
gments-0.428
 synd-0.424
sts-0.384
��-0.378
vertisements-0.368
Token intend
Token ID14765
Feature activation-8.68e-02

Pos logit contributions
ctor+0.014
imates+0.012
 WARN+0.012
 Mehran+0.012
kel+0.012
Neg logit contributions
gments-0.012
 synd-0.012
sts-0.011
��-0.011
vertisements-0.010
Token to
Token ID284
Feature activation+2.55e-01

Pos logit contributions
ctor+0.188
imates+0.171
 WARN+0.168
 Mehran+0.162
kel+0.160
Neg logit contributions
gments-0.144
 synd-0.144
sts-0.132
��-0.129
vertisements-0.127
Token give
Token ID1577
Feature activation+2.88e-01

Pos logit contributions
gments+0.436
 synd+0.433
sts+0.402
��+0.392
vertisements+0.387
Neg logit contributions
ctor-0.539
imates-0.490
 WARN-0.480
 Mehran-0.462
kel-0.458
Token third
Token ID2368
Feature activation-3.49e-01

Pos logit contributions
gments+0.498
 synd+0.496
sts+0.459
��+0.449
vertisements+0.442
Neg logit contributions
ctor-0.604
imates-0.549
 WARN-0.537
 Mehran-0.518
kel-0.513
Token driver
Token ID4639
Feature activation+1.70e-02

Pos logit contributions
ctor+0.628
imates+0.561
 WARN+0.546
 Mehran+0.520
kel+0.514
Neg logit contributions
gments-0.708
 synd-0.697
sts-0.655
��-0.644
vertisements-0.639
Token Bruno
Token ID31045
Feature activation+3.30e+00

Pos logit contributions
gments+0.030
 synd+0.030
sts+0.028
��+0.027
vertisements+0.027
Neg logit contributions
ctor-0.035
imates-0.032
 WARN-0.031
 Mehran-0.030
kel-0.030
Token S
Token ID311
Feature activation+7.28e-01

Pos logit contributions
 synd+5.409
gments+5.356
sts+4.956
��+4.673
vertisements+4.652
Neg logit contributions
ctor-6.889
imates-6.443
 WARN-6.189
 Mehran-5.976
kel-5.901
Tokenenna
Token ID13713
Feature activation+2.62e-01

Pos logit contributions
gments+1.489
 synd+1.482
sts+1.391
��+1.355
vertisements+1.347
Neg logit contributions
ctor-1.294
imates-1.157
 WARN-1.126
 Mehran-1.079
kel-1.061
Token some
Token ID617
Feature activation+2.61e-01

Pos logit contributions
gments+0.458
 synd+0.456
sts+0.422
��+0.412
vertisements+0.407
Neg logit contributions
ctor-0.545
imates-0.496
 WARN-0.484
 Mehran-0.467
kel-0.462
Token time
Token ID640
Feature activation+2.59e-02

Pos logit contributions
gments+0.492
 synd+0.490
sts+0.457
��+0.447
vertisements+0.441
Neg logit contributions
ctor-0.505
imates-0.455
 WARN-0.444
 Mehran-0.427
kel-0.422
Token in
Token ID287
Feature activation-8.12e-03

Pos logit contributions
gments+0.044
 synd+0.043
sts+0.040
��+0.039
vertisements+0.039
Neg logit contributions
ctor-0.055
imates-0.050
 WARN-0.049
 Mehran-0.047
kel-0.047
Token Manhattan
Token ID13458
Feature activation+6.69e-01

Pos logit contributions
 synd+1.255
gments+1.248
sts+1.162
��+1.124
vertisements+1.121
Neg logit contributions
ctor-1.333
imates-1.208
 WARN-1.178
 Mehran-1.127
kel-1.116
Token
Token ID784
Feature activation+1.74e+00

Pos logit contributions
gments+1.209
 synd+1.209
sts+1.121
��+1.092
vertisements+1.081
Neg logit contributions
ctor-1.345
imates-1.220
 WARN-1.188
 Mehran-1.144
kel-1.132
Token set
Token ID900
Feature activation+3.96e-01

Pos logit contributions
 synd+3.061
gments+3.029
sts+2.820
��+2.741
vertisements+2.718
Neg logit contributions
ctor-3.554
imates-3.226
 WARN-3.171
 Mehran-3.044
pei-2.968
Token to
Token ID284
Feature activation-8.37e-02

Pos logit contributions
gments+0.708
 synd+0.703
sts+0.651
��+0.637
vertisements+0.628
Neg logit contributions
ctor-0.807
imates-0.734
 WARN-0.714
 Mehran-0.689
kel-0.682
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
ctor+0.178
imates+0.162
 WARN+0.158
 Mehran+0.153
kel+0.152
Neg logit contributions
gments-0.142
 synd-0.141
sts-0.131
��-0.127
vertisements-0.125
Token puls
Token ID22271
Feature activation+3.24e+00

Pos logit contributions
gments+0.473
 synd+0.469
sts+0.435
��+0.425
vertisements+0.420
Neg logit contributions
ctor-0.568
imates-0.516
 WARN-0.505
 Mehran-0.486
kel-0.482
Tokenating
Token ID803
Feature activation+5.24e-01

Pos logit contributions
gments+4.153
 synd+4.103
sts+3.760
vertisements+3.485
gements+3.462
Neg logit contributions
ctor-7.940
imates-7.357
 WARN-7.322
 Mehran-7.056
Http-6.958
Token tune
Token ID14009
Feature activation+2.83e-01

Pos logit contributions
gments+0.891
 synd+0.887
sts+0.821
��+0.801
vertisements+0.789
Neg logit contributions
ctor-1.111
imates-1.012
 WARN-0.989
 Mehran-0.955
kel-0.945
Token of
Token ID286
Feature activation-3.53e-01

Pos logit contributions
gments+0.520
 synd+0.519
sts+0.482
��+0.471
vertisements+0.465
Neg logit contributions
ctor-0.562
imates-0.509
 WARN-0.495
 Mehran-0.476
kel-0.471
Token George
Token ID4502
Feature activation+6.90e-02

Pos logit contributions
ctor+0.729
imates+0.663
 WARN+0.649
 Mehran+0.631
kel+0.621
Neg logit contributions
gments-0.618
 synd-0.609
sts-0.565
��-0.556
gements-0.547
Token G
Token ID402
Feature activation+2.13e+00

Pos logit contributions
gments+0.117
 synd+0.117
sts+0.108
��+0.105
vertisements+0.104
Neg logit contributions
ctor-0.147
imates-0.134
 WARN-0.131
 Mehran-0.126
kel-0.125
Token by
Token ID416
Feature activation+8.20e-01

Pos logit contributions
gments+0.465
 synd+0.463
sts+0.432
��+0.422
vertisements+0.417
Neg logit contributions
ctor-0.479
imates-0.432
 WARN-0.421
 Mehran-0.404
kel-0.400
Token Reddit
Token ID10750
Feature activation-5.27e-01

Pos logit contributions
 synd+1.531
gments+1.525
sts+1.417
��+1.381
vertisements+1.364
Neg logit contributions
ctor-1.604
imates-1.449
 WARN-1.409
 Mehran-1.354
kel-1.343
Token interim
Token ID19303
Feature activation+2.54e-01

Pos logit contributions
ctor+1.126
imates+1.011
 WARN+0.993
kel+0.956
 Mehran+0.952
Neg logit contributions
gments-0.897
 synd-0.875
sts-0.821
��-0.807
vertisements-0.793
Token CEO
Token ID6123
Feature activation+8.79e-01

Pos logit contributions
gments+0.332
 synd+0.330
sts+0.297
��+0.287
vertisements+0.282
Neg logit contributions
ctor-0.639
imates-0.591
 WARN-0.580
 Mehran-0.563
kel-0.559
Token Ellen
Token ID23811
Feature activation+2.95e-01

Pos logit contributions
gments+1.541
 synd+1.532
sts+1.428
��+1.389
vertisements+1.378
Neg logit contributions
ctor-1.821
imates-1.651
 WARN-1.617
 Mehran-1.567
kel-1.543
Token P
Token ID350
Feature activation+3.23e+00

Pos logit contributions
gments+0.441
 synd+0.428
sts+0.393
vertisements+0.391
��+0.388
Neg logit contributions
ctor-0.691
imates-0.632
 WARN-0.626
kel-0.602
 Mehran-0.601
Tokenao
Token ID5488
Feature activation+5.13e-01

Pos logit contributions
 synd+3.619
gments+3.569
sts+3.152
vertisements+2.982
��+2.947
Neg logit contributions
ctor-8.548
 Mehran-7.908
imates-7.794
 WARN-7.734
Http-7.394
Token and
Token ID290
Feature activation+2.28e-01

Pos logit contributions
 synd+0.891
gments+0.890
sts+0.823
��+0.801
vertisements+0.791
Neg logit contributions
ctor-1.069
imates-0.973
 WARN-0.948
 Mehran-0.916
kel-0.906
Token other
Token ID584
Feature activation+1.22e+00

Pos logit contributions
gments+0.415
 synd+0.412
sts+0.384
��+0.376
vertisements+0.371
Neg logit contributions
ctor-0.457
imates-0.413
 WARN-0.404
 Mehran-0.389
kel-0.384
Token company
Token ID1664
Feature activation+1.13e+00

Pos logit contributions
 synd+2.091
gments+2.078
sts+1.924
��+1.862
vertisements+1.844
Neg logit contributions
ctor-2.576
imates-2.339
 WARN-2.272
 Mehran-2.203
kel-2.181
Token executives
Token ID12353
Feature activation+1.56e+00

Pos logit contributions
 synd+1.793
gments+1.787
sts+1.643
��+1.591
vertisements+1.572
Neg logit contributions
ctor-2.520
imates-2.314
 WARN-2.254
 Mehran-2.183
kel-2.161
Token and
Token ID290
Feature activation+5.32e-01

Pos logit contributions
ctor+1.014
imates+0.916
 WARN+0.887
kel+0.864
 Mehran+0.849
Neg logit contributions
gments-0.909
 synd-0.901
sts-0.836
��-0.827
vertisements-0.815
Token his
Token ID465
Feature activation-5.23e-01

Pos logit contributions
gments+0.910
 synd+0.906
sts+0.840
��+0.819
vertisements+0.808
Neg logit contributions
ctor-1.125
imates-1.024
 WARN-1.001
 Mehran-0.967
kel-0.957
Token $
Token ID720
Feature activation+3.59e-01

Pos logit contributions
ctor+1.097
imates+0.998
 WARN+0.979
 Mehran+0.939
kel+0.938
Neg logit contributions
gments-0.892
 synd-0.871
sts-0.824
��-0.814
vertisements-0.803
Token20
Token ID1238
Feature activation+6.71e-01

Pos logit contributions
 synd+0.716
gments+0.716
sts+0.673
��+0.645
gements+0.642
Neg logit contributions
ctor-0.644
imates-0.585
 WARN-0.566
 Mehran-0.544
kel-0.531
Token-
Token ID12
Feature activation+8.16e-01

Pos logit contributions
 synd+1.014
gments+1.014
sts+0.925
��+0.896
vertisements+0.881
Neg logit contributions
ctor-1.546
imates-1.420
 WARN-1.387
 Mehran-1.349
kel-1.328
Tokenmillion
Token ID14100
Feature activation+3.20e+00

Pos logit contributions
gments+1.700
 synd+1.696
sts+1.595
��+1.541
gements+1.532
Neg logit contributions
ctor-1.396
imates-1.264
 WARN-1.223
 Mehran-1.180
kel-1.136
Token cap
Token ID1451
Feature activation+1.11e+00

Pos logit contributions
 synd+5.690
gments+5.561
sts+5.071
��+4.907
gements+4.834
Neg logit contributions
ctor-6.493
imates-5.855
 WARN-5.643
 Mehran-5.589
kel-5.290
Token hold
Token ID1745
Feature activation+1.23e+00

Pos logit contributions
gments+1.951
 synd+1.941
sts+1.801
��+1.759
vertisements+1.736
Neg logit contributions
ctor-2.310
imates-2.099
 WARN-2.048
 Mehran-1.973
kel-1.958
Token.
Token ID13
Feature activation+3.40e-01

Pos logit contributions
gments+2.147
 synd+2.146
sts+1.980
��+1.925
vertisements+1.904
Neg logit contributions
ctor-2.571
imates-2.337
 WARN-2.283
 Mehran-2.204
kel-2.176
Token A
Token ID317
Feature activation+2.92e-01

Pos logit contributions
gments+0.622
 synd+0.621
sts+0.577
��+0.562
vertisements+0.556
Neg logit contributions
ctor-0.678
imates-0.614
 WARN-0.598
 Mehran-0.576
kel-0.570
Token cap
Token ID1451
Feature activation+1.15e+00

Pos logit contributions
gments+0.508
 synd+0.507
sts+0.469
��+0.458
vertisements+0.452
Neg logit contributions
ctor-0.609
imates-0.554
 WARN-0.540
 Mehran-0.521
kel-0.516
Token return
Token ID1441
Feature activation+8.30e-01

Pos logit contributions
gments+0.762
 synd+0.758
sts+0.703
��+0.687
vertisements+0.678
Neg logit contributions
ctor-0.890
imates-0.809
 WARN-0.790
 Mehran-0.761
kel-0.753
Token of
Token ID286
Feature activation-5.22e-02

Pos logit contributions
gments+1.517
 synd+1.512
sts+1.408
��+1.368
gements+1.353
Neg logit contributions
ctor-1.651
imates-1.492
 WARN-1.464
 Mehran-1.397
kel-1.390
Token Brian
Token ID8403
Feature activation+1.77e+00

Pos logit contributions
ctor+0.110
imates+0.100
 WARN+0.098
 Mehran+0.095
kel+0.094
Neg logit contributions
gments-0.090
 synd-0.089
sts-0.083
��-0.081
vertisements-0.080
Token (
Token ID357
Feature activation+2.83e-01

Pos logit contributions
gments+3.143
 synd+3.143
sts+2.924
��+2.820
gements+2.805
Neg logit contributions
ctor-3.558
imates-3.248
 WARN-3.158
 Mehran-3.009
kel-2.984
TokenAl
Token ID2348
Feature activation-2.24e-01

Pos logit contributions
gments+0.562
 synd+0.561
sts+0.524
��+0.513
vertisements+0.507
Neg logit contributions
ctor-0.519
imates-0.465
 WARN-0.452
 Mehran-0.434
kel-0.429
Tokenain
Token ID391
Feature activation+3.19e+00

Pos logit contributions
ctor+0.483
imates+0.440
 WARN+0.429
 Mehran+0.416
kel+0.413
Neg logit contributions
 synd-0.373
gments-0.373
sts-0.342
��-0.335
vertisements-0.329
Tokenu
Token ID84
Feature activation+9.04e-01

Pos logit contributions
gments+6.623
 synd+6.535
sts+6.351
gements+6.033
rices+5.893
Neg logit contributions
ctor-5.133
imates-4.660
 WARN-4.613
 Mehran-4.183
conservancy-4.156
Token'
Token ID6
Feature activation+9.67e-03

Pos logit contributions
gments+1.600
 synd+1.592
sts+1.484
��+1.441
vertisements+1.428
Neg logit contributions
ctor-1.834
imates-1.677
 WARN-1.635
 Mehran-1.563
kel-1.550
Tokenuese
Token ID20506
Feature activation+1.90e+00

Pos logit contributions
gments+0.007
 synd+0.007
sts+0.006
��+0.006
vertisements+0.006
Neg logit contributions
ctor-0.030
imates-0.028
 WARN-0.027
 Mehran-0.027
kel-0.027
Token)
Token ID8
Feature activation+3.08e-01

Pos logit contributions
 synd+3.402
gments+3.388
sts+3.173
��+3.064
gements+3.027
Neg logit contributions
ctor-3.778
imates-3.472
 WARN-3.391
 Mehran-3.226
kel-3.176
Token and
Token ID290
Feature activation+3.84e-01

Pos logit contributions
gments+0.534
 synd+0.530
sts+0.490
��+0.481
vertisements+0.474
Neg logit contributions
ctor-0.647
imates-0.588
 WARN-0.574
 Mehran-0.556
kel-0.551
Token in
Token ID287
Feature activation-1.92e-01

Pos logit contributions
 synd+0.216
gments+0.216
sts+0.200
��+0.195
vertisements+0.193
Neg logit contributions
ctor-0.233
imates-0.211
 WARN-0.206
 Mehran-0.198
kel-0.195
Token which
Token ID543
Feature activation+2.76e-01

Pos logit contributions
ctor+0.377
imates+0.341
 WARN+0.332
 Mehran+0.320
kel+0.317
Neg logit contributions
gments-0.356
 synd-0.354
sts-0.330
��-0.323
vertisements-0.319
Token she
Token ID673
Feature activation-6.11e-01

Pos logit contributions
gments+0.531
 synd+0.530
sts+0.494
��+0.483
vertisements+0.477
Neg logit contributions
ctor-0.522
imates-0.470
 WARN-0.458
 Mehran-0.439
kel-0.435
Token discussed
Token ID6693
Feature activation-1.40e-01

Pos logit contributions
ctor+1.228
imates+1.108
 WARN+1.078
 Mehran+1.038
kel+1.032
Neg logit contributions
gments-1.113
 synd-1.104
sts-1.028
��-1.011
vertisements-0.997
Token Michel
Token ID12386
Feature activation+1.38e+00

Pos logit contributions
ctor+0.277
imates+0.251
 WARN+0.244
 Mehran+0.236
kel+0.233
Neg logit contributions
gments-0.257
 synd-0.255
sts-0.238
��-0.232
vertisements-0.230
Token Fou
Token ID34296
Feature activation+3.16e+00

Pos logit contributions
gments+2.038
 synd+2.033
sts+1.854
��+1.797
vertisements+1.769
Neg logit contributions
ctor-3.229
imates-2.970
 WARN-2.908
 Mehran-2.815
kel-2.791
Tokenc
Token ID66
Feature activation-9.14e-01

Pos logit contributions
gments+6.319
 synd+6.238
sts+5.976
vertisements+5.609
gements+5.607
Neg logit contributions
ctor-5.428
imates-5.001
 WARN-4.992
 Mehran-4.787
pei-4.540
Tokenault
Token ID1721
Feature activation+5.01e-01

Pos logit contributions
ctor+1.777
imates+1.631
kel+1.548
 WARN+1.519
 Mehran+1.471
Neg logit contributions
gments-1.712
 synd-1.710
��-1.622
sts-1.574
vertisements-1.551
Token
Token ID447
Feature activation-1.30e-01

Pos logit contributions
gments+0.884
 synd+0.882
sts+0.817
��+0.796
vertisements+0.786
Neg logit contributions
ctor-1.031
imates-0.936
 WARN-0.913
 Mehran-0.880
kel-0.871
Token
Token ID247
Feature activation-4.55e-01

Pos logit contributions
ctor+0.303
imates+0.278
 WARN+0.269
 Mehran+0.262
kel+0.262
Neg logit contributions
 synd-0.194
gments-0.193
sts-0.177
vertisements-0.171
gements-0.168
Tokens
Token ID82
Feature activation+1.64e-01

Pos logit contributions
ctor+0.928
imates+0.841
 WARN+0.815
 Mehran+0.787
kel+0.785
Neg logit contributions
gments-0.814
 synd-0.806
sts-0.750
��-0.738
vertisements-0.728
Token ACL
Token ID17382
Feature activation-7.28e-01

Pos logit contributions
ctor+1.155
imates+1.049
 WARN+1.018
 Mehran+0.976
kel+0.963
Neg logit contributions
gments-1.124
 synd-1.112
sts-1.044
��-1.023
vertisements-1.011
Token sp
Token ID599
Feature activation+7.88e-01

Pos logit contributions
ctor+1.589
imates+1.450
 WARN+1.419
kel+1.367
 Mehran+1.350
Neg logit contributions
gments-1.179
 synd-1.154
sts-1.082
vertisements-1.051
��-1.050
Tokenrain
Token ID3201
Feature activation-2.02e-01

Pos logit contributions
 synd+1.355
gments+1.352
sts+1.251
��+1.230
vertisements+1.205
Neg logit contributions
ctor-1.656
imates-1.511
 WARN-1.469
 Mehran-1.415
kel-1.414
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
ctor+0.407
imates+0.370
 WARN+0.362
 Mehran+0.345
kel+0.344
Neg logit contributions
gments-0.362
 synd-0.357
sts-0.333
��-0.329
gements-0.323
Token Brian
Token ID8403
Feature activation+1.50e+00

Pos logit contributions
ctor+0.289
imates+0.264
 WARN+0.258
 Mehran+0.248
kel+0.245
Neg logit contributions
gments-0.255
 synd-0.252
sts-0.234
��-0.230
gements-0.226
Token Bab
Token ID12400
Feature activation+3.16e+00

Pos logit contributions
 synd+2.347
gments+2.335
sts+2.143
��+2.058
gements+2.032
Neg logit contributions
ctor-3.379
imates-3.095
 WARN-3.004
 Mehran-2.912
kel-2.896
Tokenine
Token ID500
Feature activation+1.52e+00

Pos logit contributions
 synd+5.754
gments+5.743
sts+5.507
vertisements+5.216
gements+5.180
Neg logit contributions
ctor-5.901
imates-5.408
 WARN-5.340
 Mehran-5.142
DEM-5.039
Tokenau
Token ID559
Feature activation+9.09e-01

Pos logit contributions
gments+2.339
 synd+2.338
sts+2.136
vertisements+2.057
gements+2.054
Neg logit contributions
ctor-3.413
imates-3.139
 WARN-3.040
 Mehran-2.977
kel-2.896
Token/
Token ID14
Feature activation-3.05e-01

Pos logit contributions
 synd+2.079
gments+2.063
sts+1.953
vertisements+1.898
��+1.894
Neg logit contributions
ctor-1.402
imates-1.213
 WARN-1.174
 Mehran-1.123
kel-1.100
TokenNBA
Token ID32470
Feature activation-1.22e+00

Pos logit contributions
ctor+0.485
imates+0.428
 WARN+0.415
 Mehran+0.393
kel+0.390
Neg logit contributions
gments-0.683
 synd-0.678
sts-0.640
��-0.630
vertisements-0.623
TokenE
Token ID36
Feature activation-1.38e-01

Pos logit contributions
ctor+2.242
imates+2.022
 WARN+1.958
kel+1.818
pei+1.810
Neg logit contributions
gments-2.434
 synd-2.387
��-2.216
gements-2.212
sts-2.211
Token glowing
Token ID21377
Feature activation+4.81e-01

Pos logit contributions
gments+0.431
 synd+0.429
sts+0.398
��+0.389
vertisements+0.384
Neg logit contributions
ctor-0.481
imates-0.436
 WARN-0.425
 Mehran-0.409
kel-0.406
Token like
Token ID588
Feature activation-6.44e-01

Pos logit contributions
gments+0.850
 synd+0.847
sts+0.786
��+0.767
vertisements+0.757
Neg logit contributions
ctor-0.987
imates-0.896
 WARN-0.875
 Mehran-0.843
kel-0.835
Token a
Token ID257
Feature activation-3.33e-01

Pos logit contributions
ctor+1.294
imates+1.177
 WARN+1.146
 Mehran+1.111
kel+1.100
Neg logit contributions
gments-1.166
 synd-1.145
sts-1.070
��-1.047
gements-1.035
Token massive
Token ID4858
Feature activation+1.33e+00

Pos logit contributions
ctor+0.693
imates+0.628
 WARN+0.619
 Mehran+0.596
kel+0.591
Neg logit contributions
gments-0.581
 synd-0.571
sts-0.534
��-0.522
vertisements-0.515
Token,
Token ID11
Feature activation+9.58e-02

Pos logit contributions
 synd+2.342
gments+2.315
sts+2.137
��+2.083
vertisements+2.059
Neg logit contributions
ctor-2.746
imates-2.504
 WARN-2.424
 Mehran-2.337
kel-2.316
Token puls
Token ID22271
Feature activation+3.15e+00

Pos logit contributions
gments+0.169
 synd+0.168
sts+0.156
��+0.152
vertisements+0.150
Neg logit contributions
ctor-0.198
imates-0.180
 WARN-0.175
 Mehran-0.169
kel-0.167
Tokenating
Token ID803
Feature activation+4.31e-01

Pos logit contributions
gments+3.782
 synd+3.757
sts+3.398
vertisements+3.148
��+3.119
Neg logit contributions
ctor-7.872
imates-7.375
 WARN-7.313
 Mehran-7.062
Http-6.959
Token white
Token ID2330
Feature activation-5.47e-01

Pos logit contributions
gments+0.751
 synd+0.746
sts+0.692
��+0.675
vertisements+0.667
Neg logit contributions
ctor-0.898
imates-0.816
 WARN-0.798
 Mehran-0.769
kel-0.762
Token orb
Token ID15769
Feature activation+8.43e-01

Pos logit contributions
ctor+1.157
imates+1.043
 WARN+1.027
 Mehran+0.991
kel+0.987
Neg logit contributions
gments-0.938
 synd-0.918
sts-0.869
��-0.840
vertisements-0.833
Token,
Token ID11
Feature activation+1.46e-01

Pos logit contributions
gments+1.488
 synd+1.487
sts+1.377
��+1.339
vertisements+1.323
Neg logit contributions
ctor-1.724
imates-1.569
 WARN-1.531
 Mehran-1.476
kel-1.454
Token a
Token ID257
Feature activation+6.79e-02

Pos logit contributions
gments+0.258
 synd+0.257
sts+0.238
��+0.233
vertisements+0.230
Neg logit contributions
ctor-0.300
imates-0.273
 WARN-0.266
 Mehran-0.257
kel-0.254
Token
Token ID198
Feature activation+5.09e-01

Pos logit contributions
 synd+3.238
gments+3.227
sts+2.997
��+2.909
vertisements+2.887
Neg logit contributions
ctor-3.540
imates-3.235
 WARN-3.119
 Mehran-3.009
kel-2.973
Token
Token ID198
Feature activation+7.33e-01

Pos logit contributions
 synd+0.906
gments+0.898
sts+0.843
��+0.795
gements+0.794
Neg logit contributions
ctor-1.029
imates-0.943
 WARN-0.917
 Mehran-0.881
kel-0.866
TokenBrown
Token ID20644
Feature activation+1.06e+00

Pos logit contributions
gments+1.459
 synd+1.451
sts+1.372
��+1.326
gements+1.307
Neg logit contributions
ctor-1.325
imates-1.200
 WARN-1.167
 Mehran-1.120
kel-1.089
Token-
Token ID12
Feature activation+3.16e-01

Pos logit contributions
 synd+2.058
gments+2.043
sts+1.918
��+1.860
vertisements+1.845
Neg logit contributions
ctor-1.982
imates-1.779
 WARN-1.734
 Mehran-1.670
kel-1.633
Tokenb
Token ID65
Feature activation+1.86e+00

Pos logit contributions
gments+0.605
 synd+0.603
sts+0.563
��+0.546
vertisements+0.543
Neg logit contributions
ctor-0.603
imates-0.544
 WARN-0.530
 Mehran-0.512
kel-0.501
Tokenagged
Token ID14655
Feature activation+3.15e+00

Pos logit contributions
gments+3.037
 synd+3.028
sts+2.797
vertisements+2.695
gements+2.659
Neg logit contributions
ctor-4.009
imates-3.681
 WARN-3.604
 Mehran-3.523
kel-3.424
Token lun
Token ID14678
Feature activation-1.08e+00

Pos logit contributions
 synd+5.600
gments+5.404
sts+5.052
vertisements+4.837
��+4.800
Neg logit contributions
ctor-6.260
imates-5.711
 WARN-5.517
 Mehran-5.275
pei-5.241
Tokenches
Token ID2052
Feature activation-1.21e-01

Pos logit contributions
ctor+1.774
imates+1.550
 WARN+1.487
kel+1.423
 Mehran+1.412
Neg logit contributions
 synd-2.357
gments-2.338
��-2.209
sts-2.184
vertisements-2.138
Token in
Token ID287
Feature activation+1.23e-01

Pos logit contributions
ctor+0.251
imates+0.229
 WARN+0.223
 Mehran+0.215
kel+0.213
Neg logit contributions
gments-0.212
 synd-0.209
sts-0.195
��-0.191
vertisements-0.188
Token hand
Token ID1021
Feature activation+3.97e-01

Pos logit contributions
gments+0.214
 synd+0.211
sts+0.197
��+0.193
vertisements+0.189
Neg logit contributions
ctor-0.258
imates-0.235
 WARN-0.229
 Mehran-0.222
kel-0.219
Token,
Token ID11
Feature activation+3.49e-01

Pos logit contributions
gments+0.696
 synd+0.692
sts+0.641
��+0.629
vertisements+0.618
Neg logit contributions
ctor-0.825
imates-0.749
 WARN-0.731
 Mehran-0.704
kel-0.698
TokenBrazil
Token ID39190
Feature activation+2.08e+00

Pos logit contributions
ctor+0.652
imates+0.581
 WARN+0.565
 Mehran+0.541
kel+0.540
Neg logit contributions
 synd-0.689
gments-0.689
sts-0.639
��-0.633
vertisements-0.624
Tokenian
Token ID666
Feature activation+9.79e-01

Pos logit contributions
 synd+3.628
gments+3.603
sts+3.375
vertisements+3.209
��+3.204
Neg logit contributions
ctor-4.248
imates-3.888
 WARN-3.795
 Mehran-3.666
kel-3.585
Token President
Token ID1992
Feature activation+5.93e-01

Pos logit contributions
 synd+1.560
gments+1.550
sts+1.427
��+1.374
vertisements+1.362
Neg logit contributions
ctor-2.186
imates-2.003
 WARN-1.952
 Mehran-1.885
kel-1.873
Token Dil
Token ID29328
Feature activation+4.67e-01

Pos logit contributions
gments+1.022
 synd+1.020
sts+0.943
��+0.918
vertisements+0.907
Neg logit contributions
ctor-1.245
imates-1.134
 WARN-1.106
 Mehran-1.065
kel-1.056
Tokenma
Token ID2611
Feature activation+1.02e+00

Pos logit contributions
 synd+0.965
gments+0.959
sts+0.898
��+0.893
vertisements+0.874
Neg logit contributions
ctor-0.830
imates-0.736
 WARN-0.705
kel-0.683
 Mehran-0.680
Token Rousse
Token ID42849
Feature activation+3.14e+00

Pos logit contributions
gments+2.107
 synd+2.071
sts+1.947
��+1.930
vertisements+1.892
Neg logit contributions
ctor-1.827
imates-1.618
 WARN-1.584
 Mehran-1.522
kel-1.516
Tokenff
Token ID487
Feature activation-2.84e-01

Pos logit contributions
gments+4.689
 synd+4.661
sts+4.401
gements+4.099
��+4.072
Neg logit contributions
ctor-7.088
imates-6.702
 WARN-6.498
 Mehran-6.424
STD-6.088
Token and
Token ID290
Feature activation-7.31e-01

Pos logit contributions
ctor+0.579
imates+0.525
 WARN+0.512
 Mehran+0.492
kel+0.488
Neg logit contributions
gments-0.509
 synd-0.505
sts-0.469
��-0.460
vertisements-0.454
Token Cuban
Token ID18147
Feature activation-1.16e+00

Pos logit contributions
ctor+1.545
imates+1.396
 WARN+1.374
 Mehran+1.336
kel+1.315
Neg logit contributions
gments-1.253
 synd-1.214
sts-1.139
��-1.134
vertisements-1.109
Token President
Token ID1992
Feature activation+4.95e-01

Pos logit contributions
ctor+2.503
imates+2.254
 WARN+2.218
 Mehran+2.163
kel+2.137
Neg logit contributions
gments-1.934
 synd-1.853
��-1.767
sts-1.737
gements-1.708
Token R
Token ID371
Feature activation+3.65e-01

Pos logit contributions
gments+0.940
 synd+0.937
sts+0.873
��+0.853
vertisements+0.843
Neg logit contributions
ctor-0.951
imates-0.858
 WARN-0.836
 Mehran-0.802
kel-0.794
Token you
Token ID345
Feature activation+8.28e-01

Pos logit contributions
gments+0.819
 synd+0.819
sts+0.765
��+0.746
vertisements+0.742
Neg logit contributions
ctor-0.747
imates-0.670
 WARN-0.653
 Mehran-0.623
kel-0.618
Token end
Token ID886
Feature activation-7.79e-01

Pos logit contributions
gments+1.345
 synd+1.341
sts+1.234
��+1.202
vertisements+1.184
Neg logit contributions
ctor-1.823
imates-1.665
 WARN-1.628
 Mehran-1.573
kel-1.559
Token up
Token ID510
Feature activation+6.68e-01

Pos logit contributions
ctor+1.395
imates+1.241
 WARN+1.207
 Mehran+1.163
kel+1.152
Neg logit contributions
gments-1.574
 synd-1.567
sts-1.465
��-1.443
vertisements-1.424
Token in
Token ID287
Feature activation+3.08e-01

Pos logit contributions
gments+1.170
 synd+1.166
sts+1.079
��+1.053
vertisements+1.039
Neg logit contributions
ctor-1.386
imates-1.260
 WARN-1.229
 Mehran-1.186
kel-1.173
Token Abu
Token ID13098
Feature activation+4.98e-01

Pos logit contributions
gments+0.507
 synd+0.500
sts+0.465
��+0.456
vertisements+0.445
Neg logit contributions
ctor-0.673
imates-0.614
 WARN-0.600
 Mehran-0.582
kel-0.576
Token Dhabi
Token ID43941
Feature activation+3.12e+00

Pos logit contributions
gments+0.806
 synd+0.801
sts+0.737
��+0.720
vertisements+0.712
Neg logit contributions
ctor-1.102
imates-1.002
 WARN-0.981
 Mehran-0.951
kel-0.945
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
 synd+4.916
gments+4.835
sts+4.515
vertisements+4.314
gements+4.270
Neg logit contributions
ctor-6.775
imates-6.296
 WARN-6.125
kel-5.807
 Mehran-5.798
Token the
Token ID262
Feature activation+2.35e-01

Pos logit contributions
gments+0.517
 synd+0.515
sts+0.478
��+0.468
vertisements+0.461
Neg logit contributions
ctor-0.568
imates-0.514
 WARN-0.501
 Mehran-0.484
kel-0.478
Token last
Token ID938
Feature activation+5.96e-01

Pos logit contributions
gments+0.413
 synd+0.412
sts+0.381
��+0.374
vertisements+0.367
Neg logit contributions
ctor-0.486
imates-0.442
 WARN-0.431
 Mehran-0.416
kel-0.412
Token race
Token ID3234
Feature activation+3.35e-03

Pos logit contributions
 synd+1.088
gments+1.087
sts+1.008
��+0.980
vertisements+0.971
Neg logit contributions
ctor-1.187
imates-1.073
 WARN-1.049
 Mehran-1.005
kel-0.996
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
gments+0.006
 synd+0.006
sts+0.006
��+0.006
vertisements+0.005
Neg logit contributions
ctor-0.007
imates-0.006
 WARN-0.006
 Mehran-0.006
kel-0.006
Token
Token ID447
Feature activation-1.21e-01

Pos logit contributions
gments+0.080
 synd+0.080
sts+0.074
��+0.073
vertisements+0.072
Neg logit contributions
ctor-0.085
imates-0.076
 WARN-0.074
 Mehran-0.072
kel-0.071
Token
Token ID251
Feature activation+3.48e-01

Pos logit contributions
ctor+0.275
imates+0.252
 WARN+0.246
 Mehran+0.239
kel+0.237
Neg logit contributions
 synd-0.189
gments-0.188
sts-0.173
vertisements-0.165
��-0.165
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
gments+0.629
 synd+0.627
sts+0.582
��+0.569
vertisements+0.561
Neg logit contributions
ctor-0.700
imates-0.634
 WARN-0.619
 Mehran-0.595
kel-0.589
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
ctor+0.634
imates+0.572
 WARN+0.557
 Mehran+0.536
kel+0.531
Neg logit contributions
gments-0.612
 synd-0.610
sts-0.567
��-0.557
vertisements-0.550
TokenFR
Token ID10913
Feature activation+1.01e+00

Pos logit contributions
gments+0.303
 synd+0.303
sts+0.281
��+0.276
vertisements+0.273
Neg logit contributions
ctor-0.306
imates-0.275
 WARN-0.268
 Mehran-0.257
kel-0.256
TokenANK
Token ID15154
Feature activation+3.10e+00

Pos logit contributions
gments+2.243
 synd+2.242
sts+2.114
��+2.064
vertisements+2.043
Neg logit contributions
ctor-1.614
imates-1.427
 WARN-1.376
 Mehran-1.315
kel-1.296
Token M
Token ID337
Feature activation+1.29e+00

Pos logit contributions
 synd+6.235
gments+6.176
sts+5.917
��+5.653
gements+5.576
Neg logit contributions
ctor-5.275
imates-4.824
 WARN-4.446
 Mehran-4.361
kel-4.272
TokenORD
Token ID12532
Feature activation+8.53e-01

Pos logit contributions
gments+2.866
 synd+2.863
sts+2.708
��+2.647
vertisements+2.611
Neg logit contributions
ctor-2.028
imates-1.796
 WARN-1.728
 Mehran-1.663
kel-1.616
TokenICA
Token ID25241
Feature activation+9.60e-02

Pos logit contributions
gments+1.957
 synd+1.955
sts+1.844
��+1.811
vertisements+1.790
Neg logit contributions
ctor-1.302
imates-1.143
 WARN-1.101
 Mehran-1.046
kel-1.031
Token,
Token ID11
Feature activation+4.04e-01

Pos logit contributions
gments+0.183
 synd+0.182
sts+0.170
��+0.166
vertisements+0.164
Neg logit contributions
ctor-0.184
imates-0.166
 WARN-0.161
 Mehran-0.155
kel-0.154
Token V
Token ID569
Feature activation+2.55e-01

Pos logit contributions
gments+0.739
 synd+0.735
sts+0.684
��+0.669
vertisements+0.661
Neg logit contributions
ctor-0.807
imates-0.730
 WARN-0.713
 Mehran-0.686
kel-0.679
Token000
Token ID830
Feature activation+7.38e-01

Pos logit contributions
ctor+0.587
imates+0.522
 WARN+0.508
 Mehran+0.486
kel+0.480
Neg logit contributions
gments-0.706
 synd-0.705
sts-0.661
��-0.649
vertisements-0.643
Token students
Token ID2444
Feature activation-1.21e+00

Pos logit contributions
gments+1.281
 synd+1.280
sts+1.185
��+1.150
vertisements+1.138
Neg logit contributions
ctor-1.537
imates-1.396
 WARN-1.362
 Mehran-1.315
kel-1.299
Token.
Token ID13
Feature activation-4.32e-01

Pos logit contributions
ctor+2.416
imates+2.190
 WARN+2.136
 Mehran+2.052
kel+2.042
Neg logit contributions
gments-2.214
 synd-2.187
sts-2.040
��-2.016
vertisements-1.976
Token The
Token ID383
Feature activation-6.25e-01

Pos logit contributions
ctor+0.855
imates+0.771
 WARN+0.752
 Mehran+0.724
kel+0.718
Neg logit contributions
gments-0.799
 synd-0.797
sts-0.740
��-0.728
vertisements-0.714
Token Legislature
Token ID20815
Feature activation-1.57e+00

Pos logit contributions
ctor+1.248
imates+1.135
 WARN+1.108
 Mehran+1.065
kel+1.054
Neg logit contributions
gments-1.133
 synd-1.128
sts-1.051
��-1.030
vertisements-1.015
Token unc
Token ID4591
Feature activation-3.99e+00

Pos logit contributions
ctor+3.093
imates+2.792
 WARN+2.715
kel+2.598
 Mehran+2.596
Neg logit contributions
gments-2.915
 synd-2.885
sts-2.690
��-2.666
vertisements-2.620
Tokenapped
Token ID6320
Feature activation-7.18e-02

Pos logit contributions
ctor+6.748
imates+6.107
kel+5.672
 WARN+5.414
pei+5.401
Neg logit contributions
 synd-8.278
gments-7.873
��-7.707
sts-7.621
vertisements-7.247
Token enrollment
Token ID20753
Feature activation-1.28e+00

Pos logit contributions
ctor+0.133
imates+0.120
 WARN+0.117
 Mehran+0.112
kel+0.110
Neg logit contributions
gments-0.141
 synd-0.141
sts-0.131
��-0.128
vertisements-0.127
Token as
Token ID355
Feature activation+7.83e-02

Pos logit contributions
ctor+2.522
imates+2.304
 WARN+2.237
 Mehran+2.140
kel+2.132
Neg logit contributions
gments-2.333
 synd-2.307
sts-2.154
��-2.146
vertisements-2.086
Token part
Token ID636
Feature activation-1.13e+00

Pos logit contributions
gments+0.134
 synd+0.134
sts+0.124
��+0.120
vertisements+0.119
Neg logit contributions
ctor-0.165
imates-0.150
 WARN-0.147
 Mehran-0.142
kel-0.140
Token of
Token ID286
Feature activation-1.01e-01

Pos logit contributions
ctor+2.098
imates+1.871
 WARN+1.821
 Mehran+1.747
kel+1.747
Neg logit contributions
gments-2.230
 synd-2.200
��-2.062
sts-2.062
vertisements-2.040
Token Nigel
Token ID28772
Feature activation+1.30e+00

Pos logit contributions
ctor+0.579
imates+0.526
 WARN+0.515
 Mehran+0.499
kel+0.490
Neg logit contributions
gments-0.506
 synd-0.500
sts-0.463
��-0.457
vertisements-0.451
Token Farage
Token ID33904
Feature activation+1.10e+00

Pos logit contributions
gments+1.391
 synd+1.385
sts+1.217
��+1.174
vertisements+1.143
Neg logit contributions
ctor-3.594
imates-3.345
 WARN-3.289
 Mehran-3.201
kel-3.182
Token:
Token ID25
Feature activation-2.02e-01

Pos logit contributions
 synd+2.016
gments+1.991
sts+1.861
��+1.791
gements+1.772
Neg logit contributions
ctor-2.188
imates-1.978
 WARN-1.923
 Mehran-1.863
kel-1.837
Token "
Token ID366
Feature activation-1.62e+00

Pos logit contributions
ctor+0.417
imates+0.380
 WARN+0.372
 Mehran+0.358
kel+0.354
Neg logit contributions
gments-0.356
 synd-0.352
sts-0.326
��-0.320
vertisements-0.316
TokenLord
Token ID22438
Feature activation-3.52e-01

Pos logit contributions
ctor+2.693
imates+2.370
 WARN+2.293
kel+2.192
 Mehran+2.141
Neg logit contributions
gments-3.474
 synd-3.456
��-3.207
sts-3.206
vertisements-3.190
Token Hes
Token ID44773
Feature activation-3.71e+00

Pos logit contributions
ctor+0.725
imates+0.655
 WARN+0.640
kel+0.615
 Mehran+0.612
Neg logit contributions
gments-0.620
 synd-0.616
sts-0.580
��-0.563
vertisements-0.561
Tokenelt
Token ID2120
Feature activation-7.88e-02

Pos logit contributions
ctor+6.578
kel+5.830
imates+5.737
 WARN+5.513
itation+5.310
Neg logit contributions
 synd-7.250
gments-7.105
��-6.791
vertisements-6.503
sts-6.440
Tokenine
Token ID500
Feature activation+6.96e-01

Pos logit contributions
ctor+0.129
imates+0.116
 WARN+0.111
kel+0.106
 Mehran+0.104
Neg logit contributions
gments-0.172
 synd-0.171
sts-0.160
��-0.159
vertisements-0.157
Token fears
Token ID10251
Feature activation+5.06e-02

Pos logit contributions
 synd+1.264
gments+1.253
sts+1.168
��+1.133
gements+1.118
Neg logit contributions
ctor-1.394
imates-1.263
 WARN-1.226
 Mehran-1.193
kel-1.170
Token what
Token ID644
Feature activation-1.08e+00

Pos logit contributions
gments+0.089
 synd+0.089
sts+0.082
��+0.081
vertisements+0.080
Neg logit contributions
ctor-0.104
imates-0.095
 WARN-0.093
 Mehran-0.089
kel-0.088
Token the
Token ID262
Feature activation-4.81e-01

Pos logit contributions
ctor+2.130
imates+1.937
 WARN+1.894
 Mehran+1.824
kel+1.803
Neg logit contributions
gments-1.986
 synd-1.940
sts-1.830
��-1.793
vertisements-1.767
Token against
Token ID1028
Feature activation-1.43e-01

Pos logit contributions
gments+0.939
 synd+0.938
sts+0.864
��+0.840
vertisements+0.829
Neg logit contributions
ctor-1.230
imates-1.123
 WARN-1.097
 Mehran-1.059
kel-1.049
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
ctor+0.285
imates+0.258
 WARN+0.251
 Mehran+0.243
kel+0.240
Neg logit contributions
gments-0.261
 synd-0.259
sts-0.241
��-0.238
vertisements-0.234
Token chest
Token ID7721
Feature activation+5.10e-01

Pos logit contributions
ctor+0.313
imates+0.281
 WARN+0.274
 Mehran+0.263
kel+0.260
Neg logit contributions
gments-0.329
 synd-0.326
sts-0.305
��-0.300
vertisements-0.297
Token of
Token ID286
Feature activation+2.14e-02

Pos logit contributions
gments+0.922
 synd+0.920
sts+0.853
��+0.833
vertisements+0.823
Neg logit contributions
ctor-1.028
imates-0.932
 WARN-0.908
 Mehran-0.874
kel-0.866
Token whatever
Token ID4232
Feature activation+1.43e-01

Pos logit contributions
gments+0.038
 synd+0.038
sts+0.035
��+0.034
vertisements+0.034
Neg logit contributions
ctor-0.044
imates-0.040
 WARN-0.039
 Mehran-0.038
kel-0.037
Token unc
Token ID4591
Feature activation-3.68e+00

Pos logit contributions
gments+0.257
 synd+0.257
sts+0.238
��+0.232
vertisements+0.229
Neg logit contributions
ctor-0.289
imates-0.263
 WARN-0.256
 Mehran-0.247
kel-0.244
Tokenaring
Token ID1723
Feature activation+1.02e-01

Pos logit contributions
ctor+6.315
imates+5.641
 WARN+5.190
kel+5.126
pei+4.940
Neg logit contributions
 synd-7.534
gments-7.446
��-7.050
sts-6.937
vertisements-6.867
Token god
Token ID5770
Feature activation-1.34e-01

Pos logit contributions
gments+0.175
 synd+0.174
sts+0.161
��+0.157
vertisements+0.155
Neg logit contributions
ctor-0.214
imates-0.195
 WARN-0.190
 Mehran-0.183
kel-0.182
Token allowed
Token ID3142
Feature activation+1.85e-01

Pos logit contributions
ctor+0.270
imates+0.245
 WARN+0.238
 Mehran+0.229
kel+0.228
Neg logit contributions
gments-0.242
 synd-0.241
sts-0.224
��-0.219
vertisements-0.216
Token reality
Token ID3950
Feature activation+5.90e-01

Pos logit contributions
gments+0.326
 synd+0.325
sts+0.301
��+0.294
vertisements+0.290
Neg logit contributions
ctor-0.380
imates-0.345
 WARN-0.337
 Mehran-0.325
kel-0.322
Token to
Token ID284
Feature activation+3.28e-01

Pos logit contributions
gments+1.007
 synd+0.996
sts+0.918
��+0.908
vertisements+0.893
Neg logit contributions
ctor-1.255
imates-1.142
 WARN-1.113
 Mehran-1.067
kel-1.066
Token which
Token ID543
Feature activation+1.93e-01

Pos logit contributions
ctor+1.891
imates+1.703
 WARN+1.666
 Mehran+1.591
kel+1.575
Neg logit contributions
gments-1.896
 synd-1.863
sts-1.764
��-1.731
vertisements-1.702
Token distributions
Token ID24570
Feature activation-8.07e-01

Pos logit contributions
 synd+0.358
gments+0.358
sts+0.332
��+0.323
vertisements+0.320
Neg logit contributions
ctor-0.380
imates-0.343
 WARN-0.334
 Mehran-0.322
kel-0.318
Token support
Token ID1104
Feature activation-6.79e-01

Pos logit contributions
ctor+1.680
imates+1.531
 WARN+1.500
 Mehran+1.442
kel+1.426
Neg logit contributions
gments-1.402
 synd-1.383
sts-1.292
��-1.271
vertisements-1.248
Token which
Token ID543
Feature activation+2.67e-01

Pos logit contributions
ctor+1.290
imates+1.161
 WARN+1.132
 Mehran+1.088
kel+1.076
Neg logit contributions
gments-1.302
 synd-1.296
sts-1.211
��-1.189
vertisements-1.174
Token "
Token ID366
Feature activation-1.03e+00

Pos logit contributions
gments+0.485
 synd+0.485
sts+0.449
��+0.438
vertisements+0.433
Neg logit contributions
ctor-0.537
imates-0.487
 WARN-0.474
 Mehran-0.457
kel-0.452
Tokenuniversal
Token ID40082
Feature activation-3.65e+00

Pos logit contributions
ctor+1.610
imates+1.403
 WARN+1.346
kel+1.291
 Mehran+1.274
Neg logit contributions
 synd-2.310
gments-2.307
sts-2.158
��-2.146
vertisements-2.119
Token"
Token ID1
Feature activation+4.10e-02

Pos logit contributions
ctor+6.404
imates+5.941
 WARN+5.720
 Mehran+5.505
kel+5.454
Neg logit contributions
gments-6.905
 synd-6.596
��-6.391
sts-6.323
vertisements-6.188
Token package
Token ID5301
Feature activation+2.14e-01

Pos logit contributions
gments+0.075
 synd+0.075
sts+0.070
��+0.068
vertisements+0.067
Neg logit contributions
ctor-0.081
imates-0.074
 WARN-0.072
 Mehran-0.069
kel-0.068
Token format
Token ID5794
Feature activation+6.29e-01

Pos logit contributions
gments+0.345
 synd+0.341
sts+0.315
��+0.309
vertisements+0.303
Neg logit contributions
ctor-0.474
imates-0.434
 WARN-0.423
 Mehran-0.409
kel-0.406
Token.
Token ID13
Feature activation-6.95e-02

Pos logit contributions
gments+1.085
 synd+1.083
sts+1.001
��+0.975
vertisements+0.963
Neg logit contributions
ctor-1.321
imates-1.201
 WARN-1.173
 Mehran-1.131
kel-1.120
Token Snap
Token ID16026
Feature activation-1.32e+00

Pos logit contributions
ctor+0.135
imates+0.121
 WARN+0.118
 Mehran+0.114
kel+0.112
Neg logit contributions
gments-0.131
 synd-0.131
sts-0.122
��-0.119
vertisements-0.118
Token
Token ID447
Feature activation-1.25e-01

Pos logit contributions
ctor+0.598
imates+0.546
 WARN+0.531
 Mehran+0.517
kel+0.509
Neg logit contributions
gments-0.532
 synd-0.523
sts-0.486
��-0.483
vertisements-0.474
Token
Token ID251
Feature activation-4.04e-01

Pos logit contributions
ctor+0.280
imates+0.257
 WARN+0.250
 Mehran+0.243
kel+0.241
Neg logit contributions
 synd-0.196
gments-0.195
sts-0.179
vertisements-0.172
��-0.172
Token said
Token ID531
Feature activation+5.12e-01

Pos logit contributions
ctor+0.805
imates+0.730
 WARN+0.710
 Mehran+0.686
kel+0.682
Neg logit contributions
gments-0.737
 synd-0.731
sts-0.678
��-0.668
vertisements-0.657
Token He
Token ID679
Feature activation-9.89e-01

Pos logit contributions
 synd+0.930
gments+0.927
sts+0.860
��+0.838
vertisements+0.828
Neg logit contributions
ctor-1.027
imates-0.930
 WARN-0.906
 Mehran-0.868
kel-0.863
Token Sh
Token ID911
Feature activation+1.40e-01

Pos logit contributions
ctor+1.961
imates+1.747
 WARN+1.681
kel+1.664
 Mehran+1.637
Neg logit contributions
 synd-1.825
gments-1.804
��-1.696
sts-1.688
vertisements-1.631
Tokenunc
Token ID19524
Feature activation-3.62e+00

Pos logit contributions
 synd+0.241
gments+0.239
��+0.220
sts+0.220
vertisements+0.212
Neg logit contributions
ctor-0.296
imates-0.270
 WARN-0.260
kel-0.255
 Mehran-0.250
Tokenai
Token ID1872
Feature activation-8.33e-01

Pos logit contributions
ctor+6.803
imates+6.226
kel+6.223
pei+6.024
 WARN+5.866
Neg logit contributions
gments-6.347
 synd-6.318
��-5.936
sts-5.787
vertisements-5.741
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
ctor+1.671
imates+1.511
 WARN+1.467
 Mehran+1.425
kel+1.424
Neg logit contributions
gments-1.508
 synd-1.495
sts-1.395
��-1.370
vertisements-1.349
Token a
Token ID257
Feature activation-4.17e-01

Pos logit contributions
ctor+0.530
imates+0.481
 WARN+0.468
 Mehran+0.452
kel+0.448
Neg logit contributions
gments-0.462
 synd-0.459
sts-0.427
��-0.417
vertisements-0.411
Token 34
Token ID4974
Feature activation-1.04e-01

Pos logit contributions
ctor+0.834
imates+0.753
 WARN+0.736
 Mehran+0.713
kel+0.704
Neg logit contributions
gments-0.762
 synd-0.750
sts-0.702
��-0.690
vertisements-0.680
Token-
Token ID12
Feature activation-2.76e-02

Pos logit contributions
ctor+0.183
imates+0.162
 WARN+0.157
 Mehran+0.151
kel+0.150
Neg logit contributions
gments-0.216
 synd-0.213
sts-0.201
��-0.197
vertisements-0.196
Token,
Token ID11
Feature activation+3.09e-01

Pos logit contributions
gments+0.326
 synd+0.325
sts+0.302
��+0.295
vertisements+0.291
Neg logit contributions
ctor-0.363
imates-0.329
 WARN-0.321
 Mehran-0.308
kel-0.306
Token m
Token ID285
Feature activation+9.45e-01

Pos logit contributions
gments+0.571
 synd+0.570
sts+0.530
��+0.518
vertisements+0.511
Neg logit contributions
ctor-0.609
imates-0.551
 WARN-0.537
 Mehran-0.516
kel-0.510
TokenAU
Token ID26830
Feature activation-4.45e-01

Pos logit contributions
gments+1.811
 synd+1.800
sts+1.695
��+1.631
vertisements+1.628
Neg logit contributions
ctor-1.789
imates-1.610
 WARN-1.575
 Mehran-1.515
kel-1.473
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
ctor+0.889
imates+0.801
 WARN+0.781
kel+0.746
 Mehran+0.744
Neg logit contributions
gments-0.816
 synd-0.809
sts-0.754
��-0.740
vertisements-0.729
Token no
Token ID645
Feature activation+2.31e-01

Pos logit contributions
gments+0.275
 synd+0.274
sts+0.255
��+0.249
vertisements+0.246
Neg logit contributions
ctor-0.303
imates-0.274
 WARN-0.267
 Mehran-0.257
kel-0.255
Token ic
Token ID14158
Feature activation-3.49e+00

Pos logit contributions
gments+0.420
 synd+0.419
sts+0.389
��+0.380
vertisements+0.376
Neg logit contributions
ctor-0.461
imates-0.417
 WARN-0.407
 Mehran-0.392
kel-0.388
Tokenest
Token ID395
Feature activation-1.63e+00

Pos logit contributions
ctor+6.891
imates+6.213
kel+5.931
 WARN+5.882
pei+5.641
Neg logit contributions
gments-6.193
 synd-6.077
��-5.620
sts-5.538
gements-5.518
Token,
Token ID11
Feature activation-5.32e-01

Pos logit contributions
ctor+3.340
imates+3.082
 WARN+2.922
kel+2.823
 Mehran+2.793
Neg logit contributions
gments-2.865
 synd-2.834
sts-2.622
��-2.603
vertisements-2.544
Token The
Token ID383
Feature activation-2.33e-01

Pos logit contributions
ctor+1.049
imates+0.947
 WARN+0.924
 Mehran+0.886
kel+0.881
Neg logit contributions
gments-0.984
 synd-0.976
sts-0.909
��-0.895
vertisements-0.880
Token V
Token ID569
Feature activation-1.21e-01

Pos logit contributions
ctor+0.474
imates+0.429
 WARN+0.418
 Mehran+0.402
kel+0.400
Neg logit contributions
gments-0.419
 synd-0.416
sts-0.386
��-0.379
vertisements-0.374
Tokenow
Token ID322
Feature activation-6.48e-03

Pos logit contributions
ctor+0.229
imates+0.207
 WARN+0.200
 Mehran+0.192
kel+0.191
Neg logit contributions
gments-0.232
 synd-0.231
sts-0.215
��-0.211
vertisements-0.208
Token Profile
Token ID13118
Feature activation-5.11e-01

Pos logit contributions
ctor+3.698
imates+3.312
 WARN+3.206
kel+3.159
 Mehran+3.107
Neg logit contributions
gments-3.070
 synd-3.048
sts-2.806
��-2.765
gements-2.727
Token Blog
Token ID14001
Feature activation+3.10e-01

Pos logit contributions
ctor+0.851
imates+0.751
 WARN+0.732
 Mehran+0.702
kel+0.693
Neg logit contributions
gments-1.101
 synd-1.094
sts-1.032
��-1.012
gements-0.997
Token Joined
Token ID16709
Feature activation+6.49e-03

Pos logit contributions
gments+0.667
 synd+0.666
sts+0.625
��+0.612
vertisements+0.607
Neg logit contributions
ctor-0.518
imates-0.460
 WARN-0.446
 Mehran-0.424
kel-0.419
Token December
Token ID3426
Feature activation-9.74e-01

Pos logit contributions
gments+0.009
 synd+0.009
sts+0.008
��+0.008
vertisements+0.008
Neg logit contributions
ctor-0.016
imates-0.014
 WARN-0.014
 Mehran-0.014
kel-0.013
Token 2010
Token ID3050
Feature activation-2.44e-01

Pos logit contributions
ctor+1.555
imates+1.369
 WARN+1.339
 Mehran+1.277
kel+1.258
Neg logit contributions
gments-2.180
 synd-2.146
sts-2.039
��-2.021
vertisements-1.993
Token TL
Token ID24811
Feature activation-3.48e+00

Pos logit contributions
ctor+0.542
imates+0.495
 WARN+0.485
 Mehran+0.469
kel+0.465
Neg logit contributions
gments-0.390
 synd-0.388
sts-0.357
��-0.348
vertisements-0.342
TokenAD
Token ID2885
Feature activation-5.62e-01

Pos logit contributions
ctor+6.657
 WARN+5.949
imates+5.909
 Mehran+5.580
Http+5.536
Neg logit contributions
gments-6.401
 synd-6.044
sts-6.025
��-5.804
vertisements-5.762
TokenT
Token ID51
Feature activation+4.76e-02

Pos logit contributions
ctor+0.905
imates+0.798
 WARN+0.773
 Mehran+0.736
kel+0.726
Neg logit contributions
gments-1.242
 synd-1.241
sts-1.165
��-1.142
vertisements-1.133
Token 235
Token ID28878
Feature activation+6.24e-01

Pos logit contributions
gments+0.104
 synd+0.104
sts+0.097
��+0.095
vertisements+0.094
Neg logit contributions
ctor-0.078
imates-0.069
 WARN-0.067
 Mehran-0.064
kel-0.063
Token21
Token ID2481
Feature activation+2.61e-01

Pos logit contributions
 synd+1.382
gments+1.371
sts+1.291
��+1.259
vertisements+1.257
Neg logit contributions
ctor-0.998
imates-0.892
 WARN-0.851
 Mehran-0.814
kel-0.802
Token Posts
Token ID12043
Feature activation-2.21e-01

Pos logit contributions
 synd+0.530
gments+0.527
sts+0.495
��+0.480
vertisements+0.478
Neg logit contributions
ctor-0.466
imates-0.420
 WARN-0.404
 Mehran-0.388
kel-0.383
Token not
Token ID407
Feature activation+7.62e-01

Pos logit contributions
ctor+0.209
imates+0.190
 WARN+0.185
 Mehran+0.179
kel+0.177
Neg logit contributions
gments-0.186
 synd-0.184
sts-0.172
��-0.169
vertisements-0.166
Token at
Token ID379
Feature activation-9.69e-01

Pos logit contributions
gments+1.367
 synd+1.365
sts+1.265
��+1.235
vertisements+1.219
Neg logit contributions
ctor-1.547
imates-1.403
 WARN-1.368
 Mehran-1.317
kel-1.304
Token all
Token ID477
Feature activation-1.31e+00

Pos logit contributions
ctor+2.025
imates+1.855
 WARN+1.798
 Mehran+1.751
kel+1.732
Neg logit contributions
gments-1.661
 synd-1.645
sts-1.523
��-1.510
vertisements-1.470
Token unlikely
Token ID7485
Feature activation-6.94e-01

Pos logit contributions
ctor+2.481
imates+2.252
 WARN+2.182
 Mehran+2.105
kel+2.087
Neg logit contributions
gments-2.498
 synd-2.380
sts-2.298
��-2.280
vertisements-2.236
Token that
Token ID326
Feature activation-5.77e-01

Pos logit contributions
ctor+1.327
imates+1.196
 WARN+1.155
 Mehran+1.128
kel+1.109
Neg logit contributions
gments-1.326
 synd-1.300
sts-1.222
��-1.215
vertisements-1.188
Token Xer
Token ID44264
Feature activation-3.42e+00

Pos logit contributions
ctor+1.143
imates+1.040
 WARN+1.010
 Mehran+0.981
kel+0.968
Neg logit contributions
gments-1.061
 synd-1.040
sts-0.977
��-0.963
vertisements-0.942
Tokenxes
Token ID48169
Feature activation-7.06e-01

Pos logit contributions
ctor+5.578
imates+4.794
kel+4.530
pei+4.385
 WARN+4.320
Neg logit contributions
 synd-7.347
gments-7.197
��-7.155
vertisements-6.926
gements-6.656
Token,
Token ID11
Feature activation-3.46e-01

Pos logit contributions
ctor+1.403
imates+1.266
 WARN+1.236
 Mehran+1.187
kel+1.177
Neg logit contributions
gments-1.300
 synd-1.287
sts-1.202
��-1.180
vertisements-1.164
Token who
Token ID508
Feature activation-7.46e-01

Pos logit contributions
ctor+0.694
imates+0.629
 WARN+0.614
 Mehran+0.592
kel+0.586
Neg logit contributions
gments-0.630
 synd-0.621
sts-0.581
��-0.569
vertisements-0.562
Token had
Token ID550
Feature activation+2.09e-01

Pos logit contributions
ctor+1.491
imates+1.348
 WARN+1.313
 Mehran+1.264
kel+1.255
Neg logit contributions
gments-1.363
 synd-1.348
sts-1.261
��-1.234
vertisements-1.220
Token probably
Token ID2192
Feature activation+2.31e-01

Pos logit contributions
 synd+0.363
gments+0.363
sts+0.335
��+0.326
vertisements+0.322
Neg logit contributions
ctor-0.437
imates-0.398
 WARN-0.388
 Mehran-0.374
kel-0.370
Token that
Token ID326
Feature activation+1.32e-02

Pos logit contributions
ctor+0.158
imates+0.144
 WARN+0.140
 Mehran+0.135
kel+0.133
Neg logit contributions
gments-0.145
 synd-0.145
sts-0.134
��-0.131
vertisements-0.129
Token it
Token ID340
Feature activation+6.71e-02

Pos logit contributions
 synd+0.027
gments+0.027
sts+0.025
��+0.025
vertisements+0.024
Neg logit contributions
ctor-0.024
imates-0.021
 WARN-0.021
 Mehran-0.020
kel-0.019
Token is
Token ID318
Feature activation-9.97e-02

Pos logit contributions
gments+0.125
 synd+0.124
sts+0.116
��+0.113
vertisements+0.112
Neg logit contributions
ctor-0.132
imates-0.119
 WARN-0.116
 Mehran-0.112
kel-0.110
Token not
Token ID407
Feature activation+8.49e-01

Pos logit contributions
ctor+0.201
imates+0.182
 WARN+0.178
 Mehran+0.171
kel+0.169
Neg logit contributions
gments-0.181
 synd-0.180
sts-0.167
��-0.163
vertisements-0.161
Token as
Token ID355
Feature activation-1.72e-01

Pos logit contributions
 synd+1.568
gments+1.549
sts+1.439
��+1.401
vertisements+1.383
Neg logit contributions
ctor-1.673
imates-1.526
 WARN-1.476
 Mehran-1.433
kel-1.411
Token universal
Token ID10112
Feature activation-3.37e+00

Pos logit contributions
ctor+0.321
imates+0.288
 WARN+0.280
 Mehran+0.268
kel+0.266
Neg logit contributions
gments-0.336
 synd-0.334
sts-0.313
��-0.307
vertisements-0.303
Token of
Token ID286
Feature activation-2.24e-01

Pos logit contributions
ctor+6.014
imates+5.310
 WARN+5.106
kel+4.923
 Mehran+4.852
Neg logit contributions
gments-6.631
 synd-6.359
��-6.168
sts-6.105
vertisements-5.886
Token an
Token ID281
Feature activation-1.02e+00

Pos logit contributions
ctor+0.458
imates+0.415
 WARN+0.405
 Mehran+0.389
kel+0.386
Neg logit contributions
gments-0.398
 synd-0.396
sts-0.368
��-0.360
vertisements-0.355
Token I
Token ID314
Feature activation-4.00e-01

Pos logit contributions
ctor+1.693
imates+1.499
 WARN+1.451
 Mehran+1.369
kel+1.348
Neg logit contributions
gments-2.226
 synd-2.201
sts-2.092
��-2.051
vertisements-2.027
TokenPC
Token ID5662
Feature activation-1.15e-01

Pos logit contributions
ctor+0.801
imates+0.722
 WARN+0.702
 Mehran+0.677
kel+0.673
Neg logit contributions
 synd-0.730
gments-0.729
sts-0.675
��-0.663
vertisements-0.654
Token primitive
Token ID20049
Feature activation+5.29e-01

Pos logit contributions
ctor+0.229
imates+0.206
 WARN+0.201
 Mehran+0.193
kel+0.192
Neg logit contributions
gments-0.212
 synd-0.211
sts-0.196
��-0.193
vertisements-0.190
Token State
Token ID1812
Feature activation+2.31e-01

Pos logit contributions
gments+0.168
 synd+0.166
sts+0.154
��+0.151
vertisements+0.149
Neg logit contributions
ctor-0.206
imates-0.188
 WARN-0.184
 Mehran-0.176
kel-0.175
Token of
Token ID286
Feature activation+7.69e-01

Pos logit contributions
gments+0.405
 synd+0.404
sts+0.374
��+0.365
vertisements+0.360
Neg logit contributions
ctor-0.477
imates-0.434
 WARN-0.423
 Mehran-0.408
kel-0.404
Token Oregon
Token ID8819
Feature activation-8.83e-01

Pos logit contributions
gments+1.324
 synd+1.320
sts+1.220
��+1.190
vertisements+1.175
Neg logit contributions
ctor-1.617
imates-1.472
 WARN-1.438
 Mehran-1.386
kel-1.373
Token,
Token ID11
Feature activation+1.04e-01

Pos logit contributions
ctor+1.752
imates+1.591
 WARN+1.560
 Mehran+1.484
kel+1.471
Neg logit contributions
gments-1.624
 synd-1.597
sts-1.496
��-1.471
vertisements-1.453
Token the
Token ID262
Feature activation+9.62e-02

Pos logit contributions
gments+0.192
 synd+0.189
sts+0.177
��+0.173
vertisements+0.171
Neg logit contributions
ctor-0.206
imates-0.187
 WARN-0.183
 Mehran-0.175
kel-0.174
Token IRS
Token ID14440
Feature activation-3.36e+00

Pos logit contributions
gments+0.170
 synd+0.169
sts+0.157
��+0.154
vertisements+0.152
Neg logit contributions
ctor-0.198
imates-0.179
 WARN-0.176
 Mehran-0.169
kel-0.167
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
ctor+6.484
imates+5.719
 WARN+5.713
kel+5.341
 Mehran+5.324
Neg logit contributions
gments-6.061
 synd-5.834
sts-5.590
��-5.551
gements-5.503
Token and
Token ID290
Feature activation-1.57e-01

Pos logit contributions
ctor+0.233
imates+0.213
 WARN+0.209
 Mehran+0.200
kel+0.198
Neg logit contributions
gments-0.200
 synd-0.197
sts-0.184
��-0.181
vertisements-0.178
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
ctor+0.317
imates+0.289
 WARN+0.284
 Mehran+0.271
kel+0.269
Neg logit contributions
gments-0.284
 synd-0.280
sts-0.262
��-0.256
vertisements-0.253
Token charitable
Token ID21803
Feature activation+3.00e-02

Pos logit contributions
ctor+0.290
imates+0.263
 WARN+0.260
 Mehran+0.248
kel+0.247
Neg logit contributions
gments-0.250
 synd-0.245
sts-0.230
��-0.226
vertisements-0.224
Token foundation
Token ID8489
Feature activation+3.48e-01

Pos logit contributions
gments+0.051
 synd+0.050
sts+0.047
��+0.045
vertisements+0.045
Neg logit contributions
ctor-0.064
imates-0.058
 WARN-0.057
 Mehran-0.055
kel-0.055
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
gments+0.380
 synd+0.379
sts+0.352
��+0.346
vertisements+0.341
Neg logit contributions
ctor-0.417
imates-0.377
 WARN-0.366
 Mehran-0.354
kel-0.352
Token '
Token ID705
Feature activation-3.54e-01

Pos logit contributions
ctor+0.344
imates+0.313
 WARN+0.305
 Mehran+0.295
kel+0.292
Neg logit contributions
gments-0.314
 synd-0.308
sts-0.290
��-0.285
gements-0.280
TokenThe
Token ID464
Feature activation-6.31e-01

Pos logit contributions
ctor+0.613
imates+0.547
 WARN+0.530
 Mehran+0.506
kel+0.503
Neg logit contributions
gments-0.738
 synd-0.735
sts-0.689
��-0.679
vertisements-0.669
Token Brotherhood
Token ID17987
Feature activation-1.56e-01

Pos logit contributions
ctor+1.314
imates+1.191
 WARN+1.174
 Mehran+1.136
kel+1.119
Neg logit contributions
gments-1.092
 synd-1.071
sts-1.001
��-0.987
gements-0.966
Token of
Token ID286
Feature activation-2.72e-02

Pos logit contributions
ctor+0.304
imates+0.275
 WARN+0.268
 Mehran+0.257
kel+0.256
Neg logit contributions
gments-0.292
 synd-0.289
sts-0.270
��-0.267
vertisements-0.261
Token Eternal
Token ID21475
Feature activation-3.36e+00

Pos logit contributions
ctor+0.056
imates+0.051
 WARN+0.049
 Mehran+0.048
kel+0.047
Neg logit contributions
gments-0.049
 synd-0.048
sts-0.045
��-0.044
gements-0.043
Token Love
Token ID5896
Feature activation-4.87e-01

Pos logit contributions
ctor+6.621
imates+6.128
 WARN+6.059
 Mehran+5.878
kel+5.824
Neg logit contributions
gments-5.664
 synd-5.508
sts-5.237
��-5.183
gements-4.966
Token:
Token ID25
Feature activation-8.05e-01

Pos logit contributions
ctor+0.988
imates+0.895
 WARN+0.870
 Mehran+0.839
kel+0.834
Neg logit contributions
gments-0.874
 synd-0.864
sts-0.805
��-0.794
vertisements-0.778
Token From
Token ID3574
Feature activation-4.24e-01

Pos logit contributions
ctor+1.594
imates+1.441
 WARN+1.415
 Mehran+1.380
kel+1.354
Neg logit contributions
gments-1.479
 synd-1.450
sts-1.358
��-1.340
gements-1.315
Token Flower
Token ID20025
Feature activation-1.55e-01

Pos logit contributions
ctor+0.867
imates+0.787
 WARN+0.773
 Mehran+0.747
kel+0.736
Neg logit contributions
gments-0.751
 synd-0.740
sts-0.688
��-0.677
gements-0.664
Token Power
Token ID4333
Feature activation+8.03e-01

Pos logit contributions
ctor+0.324
imates+0.297
 WARN+0.289
 Mehran+0.279
kel+0.278
Neg logit contributions
gments-0.269
 synd-0.265
sts-0.244
��-0.242
gements-0.237
Token good
Token ID922
Feature activation-1.13e+00

Pos logit contributions
ctor+0.793
imates+0.713
 WARN+0.702
 Mehran+0.675
kel+0.671
Neg logit contributions
gments-0.687
 synd-0.679
sts-0.633
��-0.621
vertisements-0.614
Token New
Token ID968
Feature activation+7.58e-01

Pos logit contributions
ctor+2.264
imates+2.022
 WARN+1.985
 Mehran+1.902
kel+1.892
Neg logit contributions
gments-2.064
 synd-2.023
sts-1.903
��-1.866
vertisements-1.847
Token Jersey
Token ID8221
Feature activation+5.24e-01

Pos logit contributions
gments+1.070
 synd+1.066
sts+0.968
��+0.938
vertisements+0.923
Neg logit contributions
ctor-1.825
imates-1.683
 WARN-1.649
 Mehran-1.598
kel-1.584
Token boy
Token ID2933
Feature activation-4.34e-02

Pos logit contributions
gments+0.951
 synd+0.949
sts+0.881
��+0.860
vertisements+0.849
Neg logit contributions
ctor-1.051
imates-0.952
 WARN-0.928
 Mehran-0.893
kel-0.884
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
ctor+0.091
imates+0.083
 WARN+0.081
 Mehran+0.078
kel+0.078
Neg logit contributions
gments-0.075
 synd-0.074
sts-0.069
��-0.067
vertisements-0.066
Token DeV
Token ID29619
Feature activation-3.35e+00

Pos logit contributions
ctor+0.500
imates+0.452
 WARN+0.443
 Mehran+0.426
kel+0.422
Neg logit contributions
gments-0.441
 synd-0.438
sts-0.407
��-0.399
vertisements-0.393
Tokenito
Token ID10094
Feature activation-1.07e+00

Pos logit contributions
ctor+6.302
imates+5.626
kel+5.349
 WARN+5.323
pei+5.167
Neg logit contributions
gments-6.296
 synd-6.235
��-5.863
sts-5.832
gements-5.692
Token pronoun
Token ID28636
Feature activation+4.12e-01

Pos logit contributions
ctor+2.139
imates+1.914
 WARN+1.873
 Mehran+1.796
kel+1.790
Neg logit contributions
gments-1.973
 synd-1.952
sts-1.824
��-1.797
vertisements-1.770
Tokences
Token ID728
Feature activation-4.55e-02

Pos logit contributions
gments+1.146
 synd+1.143
sts+1.089
��+1.077
vertisements+1.067
Neg logit contributions
ctor-0.430
imates-0.351
 WARN-0.328
 Mehran-0.302
kel-0.298
Token it
Token ID340
Feature activation+5.86e-01

Pos logit contributions
ctor+0.095
imates+0.086
 WARN+0.084
 Mehran+0.081
kel+0.080
Neg logit contributions
gments-0.079
 synd-0.079
sts-0.073
��-0.071
vertisements-0.070
Token
Token ID564
Feature activation+1.88e-01

Pos logit contributions
gments+1.009
 synd+1.007
sts+0.930
��+0.907
vertisements+0.895
Neg logit contributions
ctor-1.233
imates-1.122
 WARN-1.095
 Mehran-1.056
kel-1.046
Token the
Token ID262
Feature activation-3.41e-01

Pos logit contributions
ctor+0.569
imates+0.514
 WARN+0.501
 Mehran+0.482
kel+0.477
Neg logit contributions
gments-0.534
 synd-0.533
sts-0.495
��-0.483
vertisements-0.478
Token Italian
Token ID8200
Feature activation-1.42e+00

Pos logit contributions
ctor+0.687
imates+0.624
 WARN+0.609
 Mehran+0.587
kel+0.578
Neg logit contributions
gments-0.615
 synd-0.607
sts-0.567
��-0.559
vertisements-0.549
Token Army
Token ID5407
Feature activation-2.19e+00

Pos logit contributions
ctor+2.787
imates+2.537
 WARN+2.459
 Mehran+2.381
kel+2.344
Neg logit contributions
gments-2.602
 synd-2.508
sts-2.390
��-2.378
vertisements-2.321
Token had
Token ID550
Feature activation-1.30e-03

Pos logit contributions
ctor+4.177
imates+3.780
 WARN+3.664
 Mehran+3.532
kel+3.454
Neg logit contributions
gments-4.111
 synd-3.997
sts-3.793
��-3.741
vertisements-3.709
Token been
Token ID587
Feature activation-2.28e-01

Pos logit contributions
ctor+0.003
imates+0.002
 WARN+0.002
 Mehran+0.002
kel+0.002
Neg logit contributions
gments-0.002
 synd-0.002
sts-0.002
��-0.002
vertisements-0.002
Token intern
Token ID1788
Feature activation-3.34e+00

Pos logit contributions
ctor+0.453
imates+0.409
 WARN+0.401
 Mehran+0.384
kel+0.380
Neg logit contributions
gments-0.418
 synd-0.412
sts-0.386
��-0.379
vertisements-0.374
Tokened
Token ID276
Feature activation-7.70e-01

Pos logit contributions
ctor+6.381
imates+5.899
 WARN+5.501
kel+5.265
oped+5.229
Neg logit contributions
gments-6.096
 synd-5.942
��-5.703
sts-5.592
vertisements-5.449
Token by
Token ID416
Feature activation+5.01e-01

Pos logit contributions
ctor+1.552
imates+1.414
 WARN+1.368
 Mehran+1.318
kel+1.307
Neg logit contributions
gments-1.393
 synd-1.360
sts-1.276
��-1.264
vertisements-1.238
Token German
Token ID2679
Feature activation-4.47e-01

Pos logit contributions
gments+0.888
 synd+0.885
sts+0.820
��+0.801
vertisements+0.791
Neg logit contributions
ctor-1.026
imates-0.932
 WARN-0.909
 Mehran-0.876
kel-0.867
Token forces
Token ID3386
Feature activation-3.02e-01

Pos logit contributions
ctor+0.896
imates+0.816
 WARN+0.801
 Mehran+0.765
kel+0.764
Neg logit contributions
gments-0.812
 synd-0.788
sts-0.741
��-0.732
vertisements-0.721
Token in
Token ID287
Feature activation+5.25e-02

Pos logit contributions
ctor+0.634
imates+0.579
 WARN+0.563
 Mehran+0.544
kel+0.539
Neg logit contributions
gments-0.521
 synd-0.514
sts-0.477
��-0.470
vertisements-0.461
Token others
Token ID1854
Feature activation+4.85e-01

Pos logit contributions
gments+0.163
 synd+0.163
sts+0.151
��+0.147
vertisements+0.145
Neg logit contributions
ctor-0.186
imates-0.169
 WARN-0.165
 Mehran-0.159
kel-0.157
Token who
Token ID508
Feature activation+1.52e-02

Pos logit contributions
 synd+0.930
gments+0.924
sts+0.865
��+0.839
vertisements+0.831
Neg logit contributions
ctor-0.925
imates-0.832
 WARN-0.809
 Mehran-0.781
kel-0.769
Token think
Token ID892
Feature activation+7.69e-01

Pos logit contributions
gments+0.026
 synd+0.026
sts+0.024
��+0.023
vertisements+0.023
Neg logit contributions
ctor-0.032
imates-0.029
 WARN-0.028
 Mehran-0.027
kel-0.027
Token that
Token ID326
Feature activation+3.34e-02

Pos logit contributions
 synd+1.395
gments+1.386
sts+1.293
��+1.254
vertisements+1.241
Neg logit contributions
ctor-1.542
imates-1.397
 WARN-1.362
 Mehran-1.316
kel-1.295
Token Lord
Token ID4453
Feature activation-3.72e-01

Pos logit contributions
gments+0.059
 synd+0.059
sts+0.055
��+0.054
vertisements+0.053
Neg logit contributions
ctor-0.068
imates-0.062
 WARN-0.060
 Mehran-0.058
kel-0.058
Token Hes
Token ID44773
Feature activation-3.29e+00

Pos logit contributions
ctor+0.564
imates+0.493
 WARN+0.477
 Mehran+0.451
kel+0.447
Neg logit contributions
gments-0.858
 synd-0.855
sts-0.809
��-0.796
vertisements-0.788
Tokenelt
Token ID2120
Feature activation-5.72e-02

Pos logit contributions
ctor+5.564
kel+4.879
imates+4.815
 WARN+4.556
pei+4.449
Neg logit contributions
 synd-6.900
gments-6.662
��-6.588
vertisements-6.411
sts-6.135
Tokenine
Token ID500
Feature activation+5.83e-01

Pos logit contributions
ctor+0.099
imates+0.089
 WARN+0.086
 Mehran+0.082
kel+0.081
Neg logit contributions
gments-0.119
 synd-0.119
sts-0.111
��-0.109
vertisements-0.108
Token has
Token ID468
Feature activation-5.75e-02

Pos logit contributions
 synd+1.038
gments+1.029
sts+0.956
��+0.925
gements+0.913
Neg logit contributions
ctor-1.188
imates-1.079
 WARN-1.055
 Mehran-1.023
kel-1.004
Token a
Token ID257
Feature activation+6.56e-02

Pos logit contributions
ctor+0.118
imates+0.108
 WARN+0.105
 Mehran+0.101
kel+0.100
Neg logit contributions
gments-0.102
 synd-0.100
sts-0.094
��-0.091
vertisements-0.091
Token point
Token ID966
Feature activation+3.38e-01

Pos logit contributions
gments+0.120
 synd+0.119
sts+0.111
��+0.108
vertisements+0.107
Neg logit contributions
ctor-0.132
imates-0.119
 WARN-0.116
 Mehran-0.111
kel-0.111
Token'
Token ID6
Feature activation-3.04e-01

Pos logit contributions
gments+1.020
 synd+1.017
sts+0.945
��+0.923
vertisements+0.911
Neg logit contributions
ctor-1.123
imates-1.017
 WARN-0.991
 Mehran-0.954
kel-0.944
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
ctor+0.572
imates+0.512
 WARN+0.502
 Mehran+0.480
kel+0.477
Neg logit contributions
gments-0.593
 synd-0.584
sts-0.547
��-0.545
vertisements-0.537
Token
Token ID198
Feature activation+2.53e-01

Pos logit contributions
 synd+0.237
gments+0.236
sts+0.221
��+0.211
gements+0.210
Neg logit contributions
ctor-0.260
imates-0.237
 WARN-0.231
 Mehran-0.222
kel-0.218
TokenWhile
Token ID3633
Feature activation-3.83e-01

Pos logit contributions
gments+0.471
 synd+0.469
sts+0.439
��+0.425
gements+0.419
Neg logit contributions
ctor-0.495
imates-0.449
 WARN-0.438
 Mehran-0.422
kel-0.414
Token Lord
Token ID4453
Feature activation+5.26e-02

Pos logit contributions
ctor+0.784
imates+0.713
 WARN+0.697
 Mehran+0.669
kel+0.663
Neg logit contributions
gments-0.681
 synd-0.671
sts-0.624
��-0.615
vertisements-0.605
Token Hes
Token ID44773
Feature activation-3.28e+00

Pos logit contributions
gments+0.120
 synd+0.120
sts+0.113
��+0.111
vertisements+0.110
Neg logit contributions
ctor-0.081
imates-0.071
 WARN-0.069
 Mehran-0.065
kel-0.064
Tokenelt
Token ID2120
Feature activation+2.21e-01

Pos logit contributions
ctor+5.503
kel+4.842
imates+4.783
 WARN+4.485
pei+4.359
Neg logit contributions
 synd-6.893
gments-6.681
��-6.614
vertisements-6.440
sts-6.149
Tokenine
Token ID500
Feature activation+8.03e-01

Pos logit contributions
gments+0.455
 synd+0.455
sts+0.426
��+0.416
vertisements+0.411
Neg logit contributions
ctor-0.387
imates-0.345
 WARN-0.337
 Mehran-0.322
kel-0.315
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
 synd+1.434
gments+1.423
sts+1.325
��+1.276
gements+1.264
Neg logit contributions
ctor-1.622
imates-1.478
 WARN-1.444
 Mehran-1.404
kel-1.372
Token a
Token ID257
Feature activation-2.75e-01

Pos logit contributions
ctor+0.216
imates+0.196
 WARN+0.191
 Mehran+0.184
kel+0.182
Neg logit contributions
gments-0.194
 synd-0.193
sts-0.179
��-0.175
vertisements-0.173
Token former
Token ID1966
Feature activation+8.14e-02

Pos logit contributions
ctor+0.581
imates+0.525
 WARN+0.516
 Mehran+0.498
kel+0.494
Neg logit contributions
gments-0.470
 synd-0.461
sts-0.433
��-0.423
vertisements-0.419
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
 synd+1.392
gments+1.390
sts+1.290
��+1.259
vertisements+1.246
Neg logit contributions
ctor-1.485
imates-1.345
 WARN-1.310
 Mehran-1.261
kel-1.244
Token teeth
Token ID9941
Feature activation-1.59e+00

Pos logit contributions
ctor+0.023
imates+0.021
 WARN+0.020
 Mehran+0.020
kel+0.019
Neg logit contributions
gments-0.019
 synd-0.019
sts-0.017
��-0.017
vertisements-0.017
Token from
Token ID422
Feature activation+3.33e-01

Pos logit contributions
ctor+3.125
imates+2.807
 WARN+2.739
 Mehran+2.645
kel+2.639
Neg logit contributions
gments-2.916
 synd-2.876
sts-2.693
��-2.690
gements-2.615
Token a
Token ID257
Feature activation-6.01e-02

Pos logit contributions
gments+0.581
 synd+0.580
sts+0.536
��+0.523
vertisements+0.516
Neg logit contributions
ctor-0.691
imates-0.629
 WARN-0.613
 Mehran-0.591
kel-0.585
Token tr
Token ID491
Feature activation-1.28e-01

Pos logit contributions
ctor+0.133
imates+0.122
 WARN+0.120
 Mehran+0.116
kel+0.115
Neg logit contributions
gments-0.096
 synd-0.095
sts-0.088
��-0.086
vertisements-0.085
Tokenicer
Token ID16647
Feature activation-3.27e+00

Pos logit contributions
ctor+0.213
imates+0.190
 WARN+0.182
 Mehran+0.174
kel+0.173
Neg logit contributions
 synd-0.278
gments-0.277
sts-0.260
��-0.258
vertisements-0.253
Tokenat
Token ID265
Feature activation-2.48e-01

Pos logit contributions
ctor+5.253
imates+4.561
kel+4.366
 WARN+4.292
 Mehran+4.069
Neg logit contributions
gments-7.276
 synd-6.997
sts-6.801
gements-6.623
vertisements-6.588
Tokenops
Token ID2840
Feature activation-6.37e-01

Pos logit contributions
ctor+0.360
imates+0.309
 WARN+0.297
 Mehran+0.281
kel+0.281
Neg logit contributions
gments-0.589
 synd-0.588
sts-0.554
��-0.551
vertisements-0.542
Token.
Token ID13
Feature activation-5.76e-02

Pos logit contributions
ctor+1.289
imates+1.166
 WARN+1.139
 Mehran+1.093
kel+1.089
Neg logit contributions
gments-1.149
 synd-1.131
sts-1.061
��-1.044
gements-1.027
Token (
Token ID357
Feature activation+3.83e-01

Pos logit contributions
ctor+0.114
imates+0.103
 WARN+0.100
 Mehran+0.097
kel+0.096
Neg logit contributions
gments-0.106
 synd-0.106
sts-0.098
��-0.097
vertisements-0.095
TokenFS
Token ID10652
Feature activation-5.57e-01

Pos logit contributions
gments+0.784
 synd+0.784
sts+0.735
��+0.717
vertisements+0.710
Neg logit contributions
ctor-0.679
imates-0.607
 WARN-0.589
 Mehran-0.563
kel-0.556
Token clay
Token ID21558
Feature activation-7.59e-01

Pos logit contributions
gments+0.148
 synd+0.148
sts+0.137
��+0.134
vertisements+0.132
Neg logit contributions
ctor-0.163
imates-0.147
 WARN-0.144
 Mehran-0.138
kel-0.137
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
ctor+1.524
imates+1.384
 WARN+1.348
 Mehran+1.299
kel+1.289
Neg logit contributions
gments-1.373
 synd-1.365
sts-1.267
��-1.248
vertisements-1.222
Token survives
Token ID36417
Feature activation-1.09e+00

Pos logit contributions
ctor+0.375
imates+0.341
 WARN+0.332
 Mehran+0.320
kel+0.317
Neg logit contributions
gments-0.324
 synd-0.322
sts-0.299
��-0.292
vertisements-0.288
Token from
Token ID422
Feature activation+3.29e-01

Pos logit contributions
ctor+2.140
imates+1.932
 WARN+1.871
 Mehran+1.826
kel+1.796
Neg logit contributions
gments-2.031
 synd-1.997
sts-1.876
��-1.861
vertisements-1.805
Token approximately
Token ID6702
Feature activation+4.11e-01

Pos logit contributions
gments+0.593
 synd+0.593
sts+0.549
��+0.535
vertisements+0.529
Neg logit contributions
ctor-0.666
imates-0.603
 WARN-0.589
 Mehran-0.565
kel-0.560
Token Xer
Token ID44264
Feature activation-3.20e+00

Pos logit contributions
 synd+0.858
gments+0.857
sts+0.803
��+0.782
vertisements+0.775
Neg logit contributions
ctor-0.712
imates-0.634
 WARN-0.615
 Mehran-0.586
kel-0.579
Tokenxes
Token ID48169
Feature activation-4.37e-01

Pos logit contributions
ctor+5.396
imates+4.693
kel+4.381
 WARN+4.313
pei+4.300
Neg logit contributions
 synd-6.727
gments-6.586
��-6.454
vertisements-6.193
sts-6.127
Token
Token ID447
Feature activation-1.65e-01

Pos logit contributions
ctor+0.860
imates+0.777
 WARN+0.757
 Mehran+0.728
kel+0.721
Neg logit contributions
gments-0.810
 synd-0.807
sts-0.751
��-0.735
vertisements-0.726
Token
Token ID247
Feature activation-5.71e-01

Pos logit contributions
ctor+0.344
imates+0.313
 WARN+0.304
 Mehran+0.294
kel+0.292
Neg logit contributions
 synd-0.285
gments-0.285
sts-0.263
vertisements-0.254
��-0.253
Token era
Token ID6980
Feature activation-6.56e-01

Pos logit contributions
ctor+1.024
imates+0.916
 WARN+0.887
 Mehran+0.851
kel+0.845
Neg logit contributions
gments-1.159
 synd-1.154
sts-1.079
��-1.062
vertisements-1.050
Token;
Token ID26
Feature activation+4.93e-01

Pos logit contributions
ctor+1.334
imates+1.213
 WARN+1.179
 Mehran+1.141
kel+1.131
Neg logit contributions
gments-1.172
 synd-1.161
sts-1.081
��-1.064
vertisements-1.042
Tokenessential
Token ID31195
Feature activation+1.50e-01

Pos logit contributions
ctor+2.882
imates+2.563
 WARN+2.477
kel+2.351
 Mehran+2.346
Neg logit contributions
gments-3.408
 synd-3.349
sts-3.196
��-3.135
vertisements-3.124
Token Czech
Token ID16639
Feature activation-1.12e+00

Pos logit contributions
gments+0.251
 synd+0.250
sts+0.231
��+0.225
vertisements+0.222
Neg logit contributions
ctor-0.323
imates-0.294
 WARN-0.287
 Mehran-0.278
kel-0.275
Token hero
Token ID4293
Feature activation+3.36e-01

Pos logit contributions
ctor+2.350
imates+2.128
 WARN+2.072
kel+2.006
 Mehran+2.000
Neg logit contributions
gments-1.966
 synd-1.887
sts-1.763
��-1.751
gements-1.733
Token,
Token ID11
Feature activation+7.17e-02

Pos logit contributions
 synd+0.583
gments+0.581
sts+0.540
��+0.523
vertisements+0.518
Neg logit contributions
ctor-0.700
imates-0.636
 WARN-0.623
 Mehran-0.600
kel-0.593
Token a
Token ID257
Feature activation-2.09e-01

Pos logit contributions
gments+0.125
 synd+0.125
sts+0.116
��+0.113
vertisements+0.111
Neg logit contributions
ctor-0.149
imates-0.135
 WARN-0.132
 Mehran-0.127
kel-0.126
Token Zel
Token ID40071
Feature activation-3.20e+00

Pos logit contributions
ctor+0.437
imates+0.396
 WARN+0.388
 Mehran+0.375
kel+0.371
Neg logit contributions
gments-0.362
 synd-0.356
sts-0.331
��-0.325
vertisements-0.321
Tokenig
Token ID328
Feature activation+1.25e-01

Pos logit contributions
ctor+5.971
imates+5.331
kel+5.173
 WARN+5.058
pei+4.922
Neg logit contributions
 synd-5.994
gments-5.922
��-5.617
sts-5.457
vertisements-5.355
Token who
Token ID508
Feature activation-4.16e-01

Pos logit contributions
gments+0.226
 synd+0.225
sts+0.209
��+0.205
vertisements+0.202
Neg logit contributions
ctor-0.252
imates-0.228
 WARN-0.223
 Mehran-0.214
kel-0.213
Token trek
Token ID30784
Feature activation-1.13e+00

Pos logit contributions
ctor+0.853
imates+0.774
 WARN+0.755
 Mehran+0.727
kel+0.720
Neg logit contributions
gments-0.738
 synd-0.734
sts-0.682
��-0.665
vertisements-0.657
Tokenked
Token ID9091
Feature activation-8.76e-01

Pos logit contributions
ctor+1.776
imates+1.568
 WARN+1.488
 Mehran+1.422
kel+1.422
Neg logit contributions
gments-2.551
 synd-2.551
sts-2.385
��-2.379
vertisements-2.331
Token to
Token ID284
Feature activation-3.08e-01

Pos logit contributions
ctor+1.751
imates+1.594
 WARN+1.550
 Mehran+1.500
kel+1.478
Neg logit contributions
gments-1.600
 synd-1.566
sts-1.466
��-1.444
vertisements-1.420
Token Vegas
Token ID9621
Feature activation+1.64e+00

Pos logit contributions
gments+0.366
 synd+0.366
sts+0.325
��+0.319
vertisements+0.312
Neg logit contributions
ctor-0.744
imates-0.684
 WARN-0.673
kel-0.658
 Mehran-0.656
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
 synd+2.902
gments+2.880
sts+2.664
��+2.588
vertisements+2.561
Neg logit contributions
ctor-3.398
imates-3.078
 WARN-3.001
 Mehran-2.897
kel-2.851
Token
Token ID851
Feature activation+9.18e-01

Pos logit contributions
ctor+0.258
imates+0.233
 WARN+0.227
kel+0.219
 Mehran+0.217
Neg logit contributions
gments-0.233
 synd-0.231
sts-0.214
��-0.212
gements-0.208
Token Ren
Token ID7152
Feature activation-2.86e-01

Pos logit contributions
 synd+1.273
gments+1.272
sts+1.149
��+1.113
vertisements+1.096
Neg logit contributions
ctor-2.234
imates-2.060
 WARN-2.019
 Mehran-1.955
kel-1.941
Tokenaud
Token ID3885
Feature activation+3.83e-01

Pos logit contributions
ctor+0.625
imates+0.563
 WARN+0.551
kel+0.534
 Mehran+0.532
Neg logit contributions
gments-0.473
 synd-0.472
sts-0.433
��-0.428
vertisements-0.420
Token Lav
Token ID21438
Feature activation-3.19e+00

Pos logit contributions
gments+0.667
 synd+0.666
sts+0.615
��+0.602
vertisements+0.600
Neg logit contributions
ctor-0.797
imates-0.720
 WARN-0.706
 Mehran-0.682
kel-0.675
Tokeno
Token ID78
Feature activation-2.51e-01

Pos logit contributions
ctor+6.536
imates+5.958
 WARN+5.743
pei+5.668
kel+5.631
Neg logit contributions
gments-5.280
 synd-5.236
��-5.110
sts-4.783
vertisements-4.754
Tokenie
Token ID494
Feature activation+6.76e-01

Pos logit contributions
ctor+0.477
imates+0.429
 WARN+0.417
 Mehran+0.398
kel+0.397
Neg logit contributions
gments-0.486
 synd-0.482
sts-0.452
��-0.438
gements-0.435
Token (@
Token ID4275
Feature activation-9.13e-01

Pos logit contributions
 synd+1.411
gments+1.410
sts+1.321
��+1.293
vertisements+1.280
Neg logit contributions
ctor-1.171
imates-1.045
 WARN-1.013
 Mehran-0.969
kel-0.955
Tokenren
Token ID918
Feature activation-3.43e-01

Pos logit contributions
ctor+1.525
imates+1.356
 WARN+1.287
kel+1.250
 Mehran+1.235
Neg logit contributions
 synd-1.963
gments-1.959
sts-1.825
��-1.823
vertisements-1.787
Tokenlav
Token ID18809
Feature activation-1.81e+00

Pos logit contributions
ctor+0.648
imates+0.581
 WARN+0.560
kel+0.539
 Mehran+0.539
Neg logit contributions
 synd-0.669
gments-0.667
sts-0.618
��-0.612
vertisements-0.600
Token's
Token ID338
Feature activation-5.00e-01

Pos logit contributions
ctor+0.552
imates+0.497
 WARN+0.479
kel+0.462
 Mehran+0.461
Neg logit contributions
gments-0.523
 synd-0.518
sts-0.479
��-0.479
vertisements-0.466
Token science
Token ID3783
Feature activation-3.36e-01

Pos logit contributions
ctor+1.016
imates+0.918
 WARN+0.897
 Mehran+0.864
kel+0.856
Neg logit contributions
gments-0.897
 synd-0.886
sts-0.827
��-0.815
vertisements-0.800
Token focus
Token ID2962
Feature activation-5.12e-01

Pos logit contributions
ctor+0.698
imates+0.629
 WARN+0.612
 Mehran+0.591
kel+0.589
Neg logit contributions
gments-0.591
 synd-0.582
sts-0.541
��-0.537
vertisements-0.525
Token is
Token ID318
Feature activation-4.01e-01

Pos logit contributions
ctor+1.037
imates+0.935
 WARN+0.908
 Mehran+0.876
kel+0.869
Neg logit contributions
gments-0.927
 synd-0.917
sts-0.849
��-0.845
vertisements-0.825
Token no
Token ID645
Feature activation-3.88e-03

Pos logit contributions
ctor+0.799
imates+0.719
 WARN+0.701
 Mehran+0.677
kel+0.671
Neg logit contributions
gments-0.737
 synd-0.730
sts-0.681
��-0.668
vertisements-0.660
Token easy
Token ID2562
Feature activation-3.19e+00

Pos logit contributions
ctor+0.007
imates+0.006
 WARN+0.006
 Mehran+0.006
kel+0.005
Neg logit contributions
gments-0.008
 synd-0.008
sts-0.008
��-0.007
vertisements-0.007
Token task
Token ID4876
Feature activation-1.09e+00

Pos logit contributions
ctor+6.139
imates+5.418
 WARN+5.345
kel+5.177
 Mehran+5.151
Neg logit contributions
gments-5.817
��-5.351
 synd-5.284
gements-5.279
vertisements-5.225
Token,
Token ID11
Feature activation-7.38e-01

Pos logit contributions
ctor+2.164
imates+1.948
 WARN+1.868
 Mehran+1.811
kel+1.799
Neg logit contributions
gments-2.031
 synd-1.973
��-1.851
sts-1.831
vertisements-1.809
Token says
Token ID1139
Feature activation-4.01e-01

Pos logit contributions
ctor+1.419
imates+1.276
 WARN+1.240
 Mehran+1.209
kel+1.184
Neg logit contributions
gments-1.399
 synd-1.386
sts-1.286
��-1.286
vertisements-1.260
Token Young
Token ID6960
Feature activation-3.48e-01

Pos logit contributions
ctor+0.808
imates+0.729
 WARN+0.713
 Mehran+0.694
kel+0.680
Neg logit contributions
gments-0.726
 synd-0.717
sts-0.670
��-0.659
vertisements-0.652
Tokenah
Token ID993
Feature activation-8.69e-01

Pos logit contributions
ctor+0.700
imates+0.630
 WARN+0.611
 Mehran+0.593
kel+0.590
Neg logit contributions
gments-0.632
 synd-0.630
sts-0.584
��-0.578
vertisements-0.568
Token.
Token ID13
Feature activation-3.31e-02

Pos logit contributions
ctor+1.189
imates+1.080
 WARN+1.049
 Mehran+1.014
kel+1.004
Neg logit contributions
gments-1.016
 synd-1.006
sts-0.934
��-0.918
vertisements-0.903
Token
Token ID198
Feature activation-3.60e-02

Pos logit contributions
ctor+0.064
imates+0.058
 WARN+0.057
 Mehran+0.055
kel+0.054
Neg logit contributions
gments-0.062
 synd-0.062
sts-0.058
��-0.056
vertisements-0.056
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
ctor+0.071
imates+0.065
 WARN+0.063
 Mehran+0.061
kel+0.060
Neg logit contributions
 synd-0.066
gments-0.066
sts-0.062
��-0.060
vertisements-0.059
TokenCat
Token ID21979
Feature activation+1.82e-01

Pos logit contributions
gments+0.638
 synd+0.635
sts+0.598
��+0.582
vertisements+0.575
Neg logit contributions
ctor-0.561
imates-0.505
 WARN-0.488
 Mehran-0.470
kel-0.461
Tokenast
Token ID459
Feature activation+7.05e-01

Pos logit contributions
gments+0.320
 synd+0.319
sts+0.295
��+0.289
vertisements+0.284
Neg logit contributions
ctor-0.378
imates-0.344
 WARN-0.335
 Mehran-0.323
kel-0.318
Tokenrophic
Token ID18191
Feature activation+2.38e+00

Pos logit contributions
gments+1.015
 synd+1.011
sts+0.920
��+0.890
vertisements+0.877
Neg logit contributions
ctor-1.678
imates-1.546
 WARN-1.514
 Mehran-1.470
kel-1.454
Token insurance
Token ID5096
Feature activation+8.93e-01

Pos logit contributions
 synd+3.701
gments+3.628
sts+3.319
��+3.213
vertisements+3.159
Neg logit contributions
ctor-5.326
imates-4.853
 WARN-4.642
kel-4.604
 Mehran-4.589
Token adjusting
Token ID22000
Feature activation+5.11e-01

Pos logit contributions
gments+1.540
 synd+1.535
sts+1.420
��+1.385
vertisements+1.366
Neg logit contributions
ctor-1.873
imates-1.705
 WARN-1.664
 Mehran-1.605
kel-1.589
Token is
Token ID318
Feature activation+4.76e-02

Pos logit contributions
gments+0.882
 synd+0.877
sts+0.812
��+0.793
vertisements+0.782
Neg logit contributions
ctor-1.072
imates-0.976
 WARN-0.954
 Mehran-0.919
kel-0.909
Token not
Token ID407
Feature activation+1.30e+00

Pos logit contributions
gments+0.087
 synd+0.086
sts+0.080
��+0.078
vertisements+0.077
Neg logit contributions
ctor-0.096
imates-0.087
 WARN-0.084
 Mehran-0.081
kel-0.080
Token an
Token ID281
Feature activation-7.77e-01

Pos logit contributions
 synd+2.389
gments+2.363
sts+2.193
��+2.137
vertisements+2.112
Neg logit contributions
ctor-2.600
imates-2.358
 WARN-2.280
 Mehran-2.208
kel-2.182
Token40
Token ID1821
Feature activation+3.64e-01

Pos logit contributions
 synd+1.794
gments+1.787
sts+1.689
gements+1.626
��+1.622
Neg logit contributions
ctor-1.359
imates-1.224
 WARN-1.184
 Mehran-1.134
kel-1.101
Token]
Token ID60
Feature activation+2.74e-01

Pos logit contributions
gments+0.599
 synd+0.598
sts+0.549
��+0.533
vertisements+0.528
Neg logit contributions
ctor-0.790
imates-0.724
 WARN-0.705
 Mehran-0.684
kel-0.675
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
gments+0.485
 synd+0.482
sts+0.447
��+0.438
vertisements+0.431
Neg logit contributions
ctor-0.564
imates-0.512
 WARN-0.500
 Mehran-0.483
kel-0.477
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
ctor+0.419
imates+0.378
 WARN+0.369
 Mehran+0.355
kel+0.351
Neg logit contributions
gments-0.402
 synd-0.401
sts-0.373
��-0.365
vertisements-0.360
TokenC
Token ID34
Feature activation-6.74e-01

Pos logit contributions
gments+0.409
 synd+0.409
sts+0.382
��+0.375
vertisements+0.371
Neg logit contributions
ctor-0.360
imates-0.322
 WARN-0.312
 Mehran-0.299
kel-0.296
Tokenob
Token ID672
Feature activation+1.83e+00

Pos logit contributions
ctor+1.401
imates+1.280
 WARN+1.223
kel+1.202
 Mehran+1.171
Neg logit contributions
gments-1.178
 synd-1.165
��-1.108
sts-1.062
gements-1.048
Tokenain
Token ID391
Feature activation+2.54e+00

Pos logit contributions
 synd+2.787
gments+2.784
sts+2.584
gements+2.414
vertisements+2.406
Neg logit contributions
ctor-4.134
imates-3.808
 WARN-3.756
 Mehran-3.602
kel-3.516
Token immersed
Token ID37970
Feature activation+1.61e-01

Pos logit contributions
 synd+4.463
gments+4.301
sts+4.005
gements+3.802
vertisements+3.767
Neg logit contributions
ctor-5.163
imates-4.754
 WARN-4.648
 Mehran-4.497
kel-4.364
Token himself
Token ID2241
Feature activation-2.51e-01

Pos logit contributions
gments+0.243
 synd+0.241
sts+0.221
��+0.215
vertisements+0.211
Neg logit contributions
ctor-0.374
imates-0.344
 WARN-0.336
 Mehran-0.326
kel-0.324
Token in
Token ID287
Feature activation+7.90e-01

Pos logit contributions
ctor+0.517
imates+0.469
 WARN+0.458
 Mehran+0.441
kel+0.437
Neg logit contributions
gments-0.444
 synd-0.443
sts-0.410
��-0.400
vertisements-0.395
Token artistic
Token ID17290
Feature activation+1.24e-01

Pos logit contributions
 synd+1.347
gments+1.344
sts+1.240
��+1.207
vertisements+1.193
Neg logit contributions
ctor-1.673
imates-1.523
 WARN-1.488
 Mehran-1.433
kel-1.420
Token
Token ID198
Feature activation+9.56e-02

Pos logit contributions
ctor+0.482
imates+0.435
 WARN+0.424
 Mehran+0.408
kel+0.403
Neg logit contributions
gments-0.460
 synd-0.459
sts-0.427
��-0.418
vertisements-0.413
TokenLast
Token ID5956
Feature activation+6.14e-01

Pos logit contributions
gments+0.185
 synd+0.184
sts+0.172
��+0.168
vertisements+0.166
Neg logit contributions
ctor-0.181
imates-0.163
 WARN-0.158
 Mehran-0.152
kel-0.150
Token week
Token ID1285
Feature activation+3.25e-01

Pos logit contributions
gments+1.284
 synd+1.283
sts+1.203
��+1.179
vertisements+1.166
Neg logit contributions
ctor-1.058
imates-0.943
 WARN-0.916
 Mehran-0.875
kel-0.863
Token the
Token ID262
Feature activation-8.56e-02

Pos logit contributions
gments+0.566
 synd+0.563
sts+0.520
��+0.509
vertisements+0.502
Neg logit contributions
ctor-0.678
imates-0.616
 WARN-0.601
 Mehran-0.581
kel-0.574
Token Prime
Token ID5537
Feature activation+2.28e+00

Pos logit contributions
ctor+0.184
imates+0.168
 WARN+0.164
 Mehran+0.159
kel+0.156
Neg logit contributions
gments-0.143
 synd-0.142
sts-0.132
��-0.129
vertisements-0.127
Token Minister
Token ID4139
Feature activation+1.92e+00

Pos logit contributions
 synd+3.103
gments+3.095
sts+2.820
��+2.627
vertisements+2.611
Neg logit contributions
ctor-5.419
imates-5.079
 WARN-4.982
 Mehran-4.797
kel-4.736
Token set
Token ID900
Feature activation+1.64e-01

Pos logit contributions
 synd+3.195
gments+3.188
sts+2.959
��+2.885
vertisements+2.824
Neg logit contributions
ctor-4.039
imates-3.724
 WARN-3.590
 Mehran-3.472
kel-3.432
Token out
Token ID503
Feature activation-9.60e-01

Pos logit contributions
gments+0.283
 synd+0.282
sts+0.260
��+0.255
vertisements+0.251
Neg logit contributions
ctor-0.342
imates-0.311
 WARN-0.306
 Mehran-0.295
kel-0.292
Token plans
Token ID3352
Feature activation-2.30e-01

Pos logit contributions
ctor+2.038
imates+1.869
 WARN+1.818
 Mehran+1.760
kel+1.745
Neg logit contributions
gments-1.617
 synd-1.590
sts-1.473
��-1.465
vertisements-1.415
Token for
Token ID329
Feature activation-3.35e-01

Pos logit contributions
ctor+0.486
imates+0.442
 WARN+0.432
 Mehran+0.417
kel+0.414
Neg logit contributions
gments-0.392
 synd-0.390
sts-0.359
��-0.353
vertisements-0.348
Token Britain
Token ID5491
Feature activation+9.48e-01

Pos logit contributions
ctor+0.689
imates+0.627
 WARN+0.608
 Mehran+0.590
kel+0.585
Neg logit contributions
gments-0.593
 synd-0.580
sts-0.546
��-0.537
vertisements-0.526
Token already
Token ID1541
Feature activation-9.55e-01

Pos logit contributions
ctor+0.155
imates+0.140
 WARN+0.137
 Mehran+0.132
kel+0.131
Neg logit contributions
gments-0.136
 synd-0.136
sts-0.126
��-0.123
vertisements-0.122
Token has
Token ID468
Feature activation-5.58e-01

Pos logit contributions
ctor+1.754
imates+1.570
 WARN+1.515
 Mehran+1.451
kel+1.447
Neg logit contributions
gments-1.905
 synd-1.875
sts-1.771
��-1.737
vertisements-1.735
Token access
Token ID1895
Feature activation-1.04e+00

Pos logit contributions
ctor+1.158
imates+1.054
 WARN+1.024
 Mehran+0.990
kel+0.982
Neg logit contributions
gments-0.976
 synd-0.955
sts-0.900
��-0.878
vertisements-0.872
Token to
Token ID284
Feature activation+1.15e-01

Pos logit contributions
ctor+2.151
imates+1.944
 WARN+1.880
 Mehran+1.815
kel+1.807
Neg logit contributions
gments-1.825
 synd-1.810
sts-1.680
��-1.678
vertisements-1.643
Token living
Token ID2877
Feature activation-3.06e-01

Pos logit contributions
gments+0.198
 synd+0.198
sts+0.183
��+0.178
vertisements+0.176
Neg logit contributions
ctor-0.243
imates-0.221
 WARN-0.216
 Mehran-0.209
kel-0.206
Token brains
Token ID14290
Feature activation+1.87e+00

Pos logit contributions
ctor+0.666
imates+0.608
 WARN+0.592
 Mehran+0.571
kel+0.568
Neg logit contributions
gments-0.508
 synd-0.490
sts-0.463
��-0.455
gements-0.447
Token.
Token ID13
Feature activation-9.51e-02

Pos logit contributions
 synd+3.153
gments+3.116
sts+2.889
��+2.778
vertisements+2.755
Neg logit contributions
ctor-3.958
imates-3.616
 WARN-3.546
 Mehran-3.431
kel-3.361
Token To
Token ID1675
Feature activation-4.20e-01

Pos logit contributions
ctor+0.189
imates+0.170
 WARN+0.167
 Mehran+0.161
kel+0.159
Neg logit contributions
gments-0.176
 synd-0.174
sts-0.162
��-0.159
vertisements-0.157
Token study
Token ID2050
Feature activation-6.40e-01

Pos logit contributions
ctor+0.864
imates+0.782
 WARN+0.765
 Mehran+0.735
kel+0.732
Neg logit contributions
gments-0.740
 synd-0.735
sts-0.685
��-0.671
vertisements-0.663
Token both
Token ID1111
Feature activation-1.99e-01

Pos logit contributions
ctor+1.282
imates+1.155
 WARN+1.126
 Mehran+1.086
kel+1.078
Neg logit contributions
gments-1.168
 synd-1.147
sts-1.079
��-1.061
vertisements-1.041
Token the
Token ID262
Feature activation-2.09e-01

Pos logit contributions
ctor+0.423
imates+0.384
 WARN+0.375
 Mehran+0.362
kel+0.359
Neg logit contributions
gments-0.341
 synd-0.335
sts-0.314
��-0.306
vertisements-0.301
Token treatment
Token ID3513
Feature activation+5.96e-01

Pos logit contributions
 synd+0.470
gments+0.468
sts+0.439
��+0.430
vertisements+0.426
Neg logit contributions
ctor-0.364
imates-0.323
 WARN-0.312
 Mehran-0.299
kel-0.294
Token of
Token ID286
Feature activation-4.46e-02

Pos logit contributions
 synd+1.087
gments+1.067
sts+0.992
��+0.959
vertisements+0.951
Neg logit contributions
ctor-1.200
imates-1.094
 WARN-1.066
 Mehran-1.034
kel-1.008
Token stress
Token ID5503
Feature activation+1.08e+00

Pos logit contributions
ctor+0.095
imates+0.086
 WARN+0.085
 Mehran+0.081
kel+0.081
Neg logit contributions
gments-0.076
 synd-0.074
sts-0.069
��-0.068
vertisements-0.067
Token-
Token ID12
Feature activation+8.68e-03

Pos logit contributions
gments+1.966
 synd+1.963
sts+1.821
��+1.778
vertisements+1.756
Neg logit contributions
ctor-2.168
imates-1.965
 WARN-1.916
 Mehran-1.845
kel-1.824
Tokenrelated
Token ID5363
Feature activation+1.20e+00

Pos logit contributions
gments+0.016
 synd+0.016
sts+0.015
��+0.014
vertisements+0.014
Neg logit contributions
ctor-0.017
imates-0.016
 WARN-0.015
kel-0.015
 Mehran-0.015
Token disorders
Token ID11916
Feature activation+1.86e+00

Pos logit contributions
 synd+1.942
gments+1.903
sts+1.739
��+1.689
vertisements+1.663
Neg logit contributions
ctor-2.693
imates-2.475
 WARN-2.398
 Mehran-2.322
kel-2.293
Token including
Token ID1390
Feature activation+1.70e-01

Pos logit contributions
 synd+3.010
gments+2.897
sts+2.677
��+2.577
vertisements+2.535
Neg logit contributions
ctor-4.121
imates-3.758
 WARN-3.696
 Mehran-3.572
kel-3.495
Token major
Token ID1688
Feature activation+1.50e+00

Pos logit contributions
gments+0.287
 synd+0.284
sts+0.263
��+0.257
vertisements+0.254
Neg logit contributions
ctor-0.364
imates-0.332
 WARN-0.325
 Mehran-0.313
kel-0.310
Token depressive
Token ID36568
Feature activation+2.49e+00

Pos logit contributions
 synd+2.158
gments+2.122
sts+1.923
��+1.864
vertisements+1.829
Neg logit contributions
ctor-3.617
imates-3.328
 WARN-3.250
 Mehran-3.147
kel-3.133
Token and
Token ID290
Feature activation+2.14e-01

Pos logit contributions
 synd+2.960
gments+2.808
sts+2.444
��+2.340
gements+2.293
Neg logit contributions
ctor-6.719
imates-6.229
 WARN-6.120
 Mehran-6.000
kel-5.878
Token anxiety
Token ID9751
Feature activation+1.50e+00

Pos logit contributions
 synd+0.380
gments+0.377
sts+0.349
��+0.342
vertisements+0.335
Neg logit contributions
ctor-0.441
imates-0.402
 WARN-0.391
 Mehran-0.376
kel-0.373
Token of
Token ID286
Feature activation-3.87e-01

Pos logit contributions
gments+0.278
 synd+0.277
sts+0.258
��+0.252
vertisements+0.249
Neg logit contributions
ctor-0.289
imates-0.261
 WARN-0.254
 Mehran-0.244
kel-0.241
Token unilateral
Token ID37095
Feature activation-5.35e-01

Pos logit contributions
ctor+0.880
imates+0.807
 WARN+0.795
 Mehran+0.763
kel+0.758
Neg logit contributions
gments-0.597
 synd-0.583
sts-0.542
��-0.533
vertisements-0.528
Token independence
Token ID10404
Feature activation+9.62e-01

Pos logit contributions
ctor+1.196
imates+1.093
 WARN+1.076
kel+1.030
 Mehran+1.029
Neg logit contributions
gments-0.846
 synd-0.817
��-0.762
sts-0.761
vertisements-0.754
Token in
Token ID287
Feature activation-9.11e-03

Pos logit contributions
 synd+1.713
gments+1.686
sts+1.574
��+1.506
vertisements+1.499
Neg logit contributions
ctor-1.970
imates-1.785
 WARN-1.748
 Mehran-1.682
kel-1.669
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
ctor+0.019
imates+0.017
 WARN+0.017
 Mehran+0.016
kel+0.016
Neg logit contributions
gments-0.016
 synd-0.016
sts-0.015
��-0.014
vertisements-0.014
Token Catalan
Token ID31066
Feature activation+3.87e-01

Pos logit contributions
gments+0.033
 synd+0.033
sts+0.031
��+0.030
vertisements+0.030
Neg logit contributions
ctor-0.040
imates-0.036
 WARN-0.035
 Mehran-0.034
kel-0.034
Token Parliament
Token ID8411
Feature activation+1.66e-01

Pos logit contributions
gments+0.674
 synd+0.672
sts+0.622
��+0.606
vertisements+0.599
Neg logit contributions
ctor-0.805
imates-0.732
 WARN-0.715
 Mehran-0.689
kel-0.682
Token has
Token ID468
Feature activation+1.42e-01

Pos logit contributions
 synd+0.297
gments+0.296
sts+0.275
��+0.267
vertisements+0.264
Neg logit contributions
ctor-0.336
imates-0.305
 WARN-0.298
 Mehran-0.287
kel-0.284
Token several
Token ID1811
Feature activation+6.17e-01

Pos logit contributions
gments+0.244
 synd+0.243
sts+0.225
��+0.220
vertisements+0.217
Neg logit contributions
ctor-0.299
imates-0.272
 WARN-0.266
 Mehran-0.256
kel-0.254
Token readings
Token ID24654
Feature activation-5.08e-01

Pos logit contributions
 synd+1.044
gments+1.041
sts+0.959
��+0.932
vertisements+0.919
Neg logit contributions
ctor-1.317
imates-1.202
 WARN-1.172
 Mehran-1.132
kel-1.120
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
ctor+1.006
imates+0.913
 WARN+0.891
 Mehran+0.853
kel+0.847
Neg logit contributions
gments-0.930
 synd-0.921
sts-0.860
��-0.850
vertisements-0.836
Token 100
Token ID1802
Feature activation+3.50e-01

Pos logit contributions
ctor+0.855
imates+0.776
 WARN+0.758
 Mehran+0.732
kel+0.723
Neg logit contributions
gments-0.743
 synd-0.736
sts-0.686
��-0.672
gements-0.657
Token helium
Token ID43142
Feature activation+8.10e-02

Pos logit contributions
gments+0.634
 synd+0.631
sts+0.585
��+0.576
vertisements+0.565
Neg logit contributions
ctor-0.703
imates-0.639
 WARN-0.620
 Mehran-0.598
kel-0.596
Token-
Token ID12
Feature activation+1.13e-01

Pos logit contributions
gments+0.134
 synd+0.133
sts+0.123
��+0.120
vertisements+0.118
Neg logit contributions
ctor-0.176
imates-0.161
 WARN-0.157
 Mehran-0.152
kel-0.151
Tokenfilled
Token ID20286
Feature activation+8.07e-01

Pos logit contributions
gments+0.135
 synd+0.135
sts+0.121
��+0.116
vertisements+0.114
Neg logit contributions
ctor-0.294
imates-0.273
 WARN-0.268
 Mehran-0.261
kel-0.258
Token balloons
Token ID35485
Feature activation+1.42e+00

Pos logit contributions
 synd+1.350
gments+1.350
sts+1.242
��+1.206
vertisements+1.194
Neg logit contributions
ctor-1.733
imates-1.579
 WARN-1.544
 Mehran-1.486
kel-1.473
Token,
Token ID11
Feature activation+1.59e-01

Pos logit contributions
 synd+2.406
gments+2.368
sts+2.215
vertisements+2.114
��+2.103
Neg logit contributions
ctor-3.017
imates-2.736
 WARN-2.681
 Mehran-2.562
kel-2.556
Token each
Token ID1123
Feature activation+9.43e-03

Pos logit contributions
gments+0.284
 synd+0.283
sts+0.262
��+0.257
vertisements+0.253
Neg logit contributions
ctor-0.323
imates-0.293
 WARN-0.286
 Mehran-0.276
kel-0.273
Token carrying
Token ID6872
Feature activation+6.46e-01

Pos logit contributions
 synd+0.017
gments+0.017
sts+0.016
��+0.016
vertisements+0.015
Neg logit contributions
ctor-0.019
imates-0.017
 WARN-0.017
 Mehran-0.016
kel-0.016
Token a
Token ID257
Feature activation+3.37e-02

Pos logit contributions
 synd+1.262
gments+1.252
sts+1.172
��+1.132
vertisements+1.129
Neg logit contributions
ctor-1.213
imates-1.088
 WARN-1.059
 Mehran-1.010
kel-1.002
Token voucher
Token ID40621
Feature activation+9.31e-04

Pos logit contributions
gments+0.062
 synd+0.061
sts+0.057
��+0.056
vertisements+0.055
Neg logit contributions
ctor-0.067
imates-0.060
 WARN-0.059
 Mehran-0.057
kel-0.056
Token entit
Token ID26189
Feature activation+6.54e-01

Pos logit contributions
gments+0.002
 synd+0.002
sts+0.002
��+0.001
vertisements+0.001
Neg logit contributions
ctor-0.002
imates-0.002
 WARN-0.002
 Mehran-0.002
kel-0.002
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
ctor+0.433
imates+0.391
 WARN+0.380
 Mehran+0.366
kel+0.362
Neg logit contributions
gments-0.416
 synd-0.415
sts-0.386
��-0.379
vertisements-0.374
TokenF
Token ID37
Feature activation+4.76e-01

Pos logit contributions
gments+0.423
 synd+0.422
sts+0.395
��+0.385
vertisements+0.381
Neg logit contributions
ctor-0.386
imates-0.347
 WARN-0.337
 Mehran-0.323
kel-0.319
Tokenol
Token ID349
Feature activation+4.89e-01

Pos logit contributions
 synd+0.939
gments+0.937
sts+0.871
��+0.857
vertisements+0.847
Neg logit contributions
ctor-0.886
imates-0.795
 WARN-0.772
kel-0.736
 Mehran-0.734
Tokenlett
Token ID15503
Feature activation+9.08e-01

Pos logit contributions
gments+0.997
 synd+0.993
sts+0.933
��+0.908
vertisements+0.900
Neg logit contributions
ctor-0.872
imates-0.781
 WARN-0.760
 Mehran-0.728
kel-0.715
Token to
Token ID284
Feature activation-1.98e-01

Pos logit contributions
gments+1.600
 synd+1.600
sts+1.480
��+1.437
vertisements+1.419
Neg logit contributions
ctor-1.856
imates-1.692
 WARN-1.639
 Mehran-1.589
kel-1.575
Token enable
Token ID7139
Feature activation+5.51e-01

Pos logit contributions
ctor+0.443
imates+0.405
 WARN+0.396
 Mehran+0.383
kel+0.380
Neg logit contributions
gments-0.315
 synd-0.311
sts-0.289
��-0.281
vertisements-0.277
Token Academic
Token ID31421
Feature activation-1.24e+00

Pos logit contributions
 synd+1.003
gments+0.996
sts+0.926
��+0.899
vertisements+0.888
Neg logit contributions
ctor-1.107
imates-1.005
 WARN-0.975
 Mehran-0.939
kel-0.933
Token Search
Token ID11140
Feature activation+9.89e-01

Pos logit contributions
ctor+2.439
imates+2.190
 WARN+2.156
kel+2.046
 Mehran+2.039
Neg logit contributions
gments-2.301
 synd-2.278
sts-2.098
��-2.077
vertisements-2.064
Token Premier
Token ID9952
Feature activation+7.66e-01

Pos logit contributions
gments+1.767
 synd+1.762
sts+1.634
��+1.595
vertisements+1.575
Neg logit contributions
ctor-2.012
imates-1.826
 WARN-1.780
 Mehran-1.715
kel-1.697
Token and
Token ID290
Feature activation-6.31e-02

Pos logit contributions
gments+1.343
 synd+1.342
sts+1.242
��+1.211
vertisements+1.193
Neg logit contributions
ctor-1.582
imates-1.439
 WARN-1.405
 Mehran-1.354
kel-1.340
Token Master
Token ID5599
Feature activation-7.51e-01

Pos logit contributions
ctor+0.127
imates+0.114
 WARN+0.112
 Mehran+0.108
kel+0.106
Neg logit contributions
gments-0.115
 synd-0.114
sts-0.106
��-0.104
vertisements-0.103
Token,
Token ID11
Feature activation-3.87e-01

Pos logit contributions
ctor+0.534
imates+0.485
 WARN+0.473
 Mehran+0.456
kel+0.452
Neg logit contributions
gments-0.444
 synd-0.442
sts-0.409
��-0.400
vertisements-0.394
Token but
Token ID475
Feature activation-1.48e+00

Pos logit contributions
ctor+0.730
imates+0.655
 WARN+0.638
 Mehran+0.613
kel+0.605
Neg logit contributions
gments-0.750
 synd-0.744
sts-0.696
��-0.686
gements-0.671
Token drawing
Token ID8263
Feature activation-2.53e-01

Pos logit contributions
ctor+2.890
imates+2.624
 WARN+2.557
 Mehran+2.439
kel+2.428
Neg logit contributions
gments-2.724
 synd-2.673
sts-2.512
��-2.468
gements-2.425
Token conclusions
Token ID13242
Feature activation+2.47e-01

Pos logit contributions
ctor+0.513
imates+0.465
 WARN+0.454
 Mehran+0.437
kel+0.433
Neg logit contributions
gments-0.453
 synd-0.452
sts-0.419
��-0.409
vertisements-0.404
Token based
Token ID1912
Feature activation+2.18e-01

Pos logit contributions
 synd+0.437
gments+0.435
sts+0.403
��+0.391
vertisements+0.388
Neg logit contributions
ctor-0.509
imates-0.462
 WARN-0.452
 Mehran-0.435
kel-0.431
Token on
Token ID319
Feature activation+1.57e-02

Pos logit contributions
 synd+0.373
gments+0.371
sts+0.344
��+0.334
vertisements+0.330
Neg logit contributions
ctor-0.461
imates-0.421
 WARN-0.411
 Mehran-0.399
kel-0.392
Token them
Token ID606
Feature activation+2.93e-01

Pos logit contributions
gments+0.029
 synd+0.028
sts+0.026
��+0.026
vertisements+0.025
Neg logit contributions
ctor-0.031
imates-0.028
 WARN-0.028
 Mehran-0.027
kel-0.026
Token wouldn
Token ID3636
Feature activation-6.74e-01

Pos logit contributions
 synd+0.529
gments+0.528
sts+0.490
��+0.476
vertisements+0.472
Neg logit contributions
ctor-0.592
imates-0.536
 WARN-0.525
 Mehran-0.505
kel-0.499
Token
Token ID447
Feature activation-2.94e-01

Pos logit contributions
ctor+1.100
imates+0.965
 WARN+0.934
 Mehran+0.890
kel+0.887
Neg logit contributions
gments-1.486
 synd-1.479
sts-1.386
��-1.379
vertisements-1.363
Token
Token ID247
Feature activation-6.36e-01

Pos logit contributions
ctor+0.664
imates+0.608
 WARN+0.589
 Mehran+0.573
kel+0.571
Neg logit contributions
 synd-0.464
gments-0.459
sts-0.423
vertisements-0.409
gements-0.399
Tokent
Token ID83
Feature activation+8.34e-02

Pos logit contributions
ctor+1.204
imates+1.083
 WARN+1.047
 Mehran+1.003
kel+1.001
Neg logit contributions
gments-1.230
 synd-1.228
sts-1.139
��-1.125
vertisements-1.112
Tokenars
Token ID945
Feature activation-7.94e-01

Pos logit contributions
ctor+0.068
imates+0.062
 WARN+0.061
kel+0.058
 Mehran+0.058
Neg logit contributions
gments-0.046
 synd-0.046
sts-0.041
��-0.041
vertisements-0.040
Token and
Token ID290
Feature activation-3.18e-01

Pos logit contributions
ctor+1.619
imates+1.462
 WARN+1.422
 Mehran+1.379
kel+1.367
Neg logit contributions
gments-1.421
 synd-1.404
sts-1.303
��-1.288
vertisements-1.262
Token close
Token ID1969
Feature activation-2.65e-01

Pos logit contributions
ctor+0.662
imates+0.602
 WARN+0.587
 Mehran+0.566
kel+0.561
Neg logit contributions
gments-0.552
 synd-0.549
sts-0.509
��-0.497
vertisements-0.490
Token party
Token ID2151
Feature activation-2.98e-01

Pos logit contributions
ctor+0.542
imates+0.492
 WARN+0.478
 Mehran+0.461
kel+0.457
Neg logit contributions
gments-0.473
 synd-0.468
sts-0.436
��-0.429
vertisements-0.422
Token leaders
Token ID2766
Feature activation-3.37e-01

Pos logit contributions
ctor+0.641
imates+0.587
 WARN+0.567
 Mehran+0.551
kel+0.546
Neg logit contributions
gments-0.495
 synd-0.486
sts-0.454
��-0.450
vertisements-0.440
Token and
Token ID290
Feature activation+7.38e-02

Pos logit contributions
ctor+0.687
imates+0.623
 WARN+0.607
 Mehran+0.588
kel+0.579
Neg logit contributions
gments-0.600
 synd-0.593
sts-0.551
��-0.545
vertisements-0.534
Token workers
Token ID3259
Feature activation-1.40e+00

Pos logit contributions
gments+0.125
 synd+0.124
sts+0.115
��+0.112
vertisements+0.111
Neg logit contributions
ctor-0.157
imates-0.143
 WARN-0.140
 Mehran-0.135
kel-0.133
Token of
Token ID286
Feature activation-3.84e-01

Pos logit contributions
ctor+2.765
imates+2.507
 WARN+2.440
 Mehran+2.339
kel+2.315
Neg logit contributions
gments-2.568
 synd-2.481
��-2.349
sts-2.323
gements-2.289
Token Sh
Token ID911
Feature activation-6.22e-02

Pos logit contributions
ctor+0.814
imates+0.744
 WARN+0.728
 Mehran+0.704
kel+0.692
Neg logit contributions
gments-0.648
 synd-0.639
sts-0.595
��-0.584
vertisements-0.572
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
ctor+0.120
imates+0.108
 WARN+0.104
kel+0.101
 Mehran+0.101
Neg logit contributions
 synd-0.119
gments-0.118
��-0.109
sts-0.109
gements-0.106
Token Bh
Token ID16581
Feature activation+8.01e-01

Pos logit contributions
ctor+1.523
imates+1.370
 WARN+1.339
 Mehran+1.298
kel+1.281
Neg logit contributions
gments-1.399
 synd-1.385
sts-1.286
��-1.266
gements-1.244
Token it
Token ID340
Feature activation+4.28e-01

Pos logit contributions
gments+0.086
 synd+0.086
sts+0.080
��+0.078
vertisements+0.078
Neg logit contributions
ctor-0.086
imates-0.078
 WARN-0.076
 Mehran-0.073
kel-0.072
Token felt
Token ID2936
Feature activation+1.94e-01

Pos logit contributions
gments+0.761
 synd+0.759
sts+0.704
��+0.686
vertisements+0.678
Neg logit contributions
ctor-0.874
imates-0.793
 WARN-0.775
 Mehran-0.747
kel-0.738
Token like
Token ID588
Feature activation-5.21e-01

Pos logit contributions
gments+0.320
 synd+0.319
sts+0.294
��+0.286
vertisements+0.282
Neg logit contributions
ctor-0.422
imates-0.386
 WARN-0.377
 Mehran-0.364
kel-0.361
Token a
Token ID257
Feature activation-2.36e-01

Pos logit contributions
ctor+1.049
imates+0.950
 WARN+0.926
 Mehran+0.892
kel+0.885
Neg logit contributions
gments-0.942
 synd-0.935
sts-0.869
��-0.848
vertisements-0.839
Token place
Token ID1295
Feature activation+5.19e-01

Pos logit contributions
ctor+0.446
imates+0.400
 WARN+0.392
 Mehran+0.374
kel+0.372
Neg logit contributions
gments-0.457
 synd-0.453
sts-0.425
��-0.416
vertisements-0.412
Token I
Token ID314
Feature activation-3.17e-01

Pos logit contributions
gments+0.946
 synd+0.945
sts+0.877
��+0.856
vertisements+0.845
Neg logit contributions
ctor-1.037
imates-0.939
 WARN-0.915
 Mehran-0.881
kel-0.872
Token had
Token ID550
Feature activation-5.44e-03

Pos logit contributions
ctor+0.615
imates+0.554
 WARN+0.539
 Mehran+0.518
kel+0.515
Neg logit contributions
gments-0.598
 synd-0.595
sts-0.553
��-0.543
vertisements-0.536
Token gotten
Token ID7891
Feature activation-8.24e-01

Pos logit contributions
ctor+0.011
imates+0.010
 WARN+0.010
 Mehran+0.010
kel+0.010
Neg logit contributions
gments-0.009
 synd-0.009
sts-0.009
��-0.008
vertisements-0.008
Token to
Token ID284
Feature activation-3.63e-02

Pos logit contributions
ctor+1.618
imates+1.466
 WARN+1.432
 Mehran+1.377
kel+1.362
Neg logit contributions
gments-1.528
 synd-1.515
sts-1.414
��-1.393
vertisements-1.371
Token know
Token ID760
Feature activation-1.02e+00

Pos logit contributions
ctor+0.080
imates+0.073
 WARN+0.071
 Mehran+0.069
kel+0.068
Neg logit contributions
gments-0.059
 synd-0.059
sts-0.054
��-0.053
vertisements-0.052
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
ctor+2.079
imates+1.882
 WARN+1.823
 Mehran+1.760
kel+1.759
Neg logit contributions
gments-1.837
 synd-1.814
sts-1.689
��-1.667
vertisements-1.640
Tokenup
Token ID929
Feature activation-3.75e-02

Pos logit contributions
gments+0.223
 synd+0.223
sts+0.209
��+0.205
vertisements+0.203
Neg logit contributions
ctor-0.177
imates-0.157
 WARN-0.153
 Mehran-0.146
kel-0.144
Token of
Token ID286
Feature activation+2.39e-01

Pos logit contributions
ctor+0.073
imates+0.066
 WARN+0.065
 Mehran+0.062
kel+0.062
Neg logit contributions
gments-0.070
 synd-0.070
sts-0.065
��-0.063
vertisements-0.063
Token a
Token ID257
Feature activation+1.60e-01

Pos logit contributions
gments+0.424
 synd+0.422
sts+0.391
��+0.383
vertisements+0.377
Neg logit contributions
ctor-0.489
imates-0.444
 WARN-0.433
 Mehran-0.418
kel-0.414
Token bare
Token ID6247
Feature activation+9.54e-01

Pos logit contributions
gments+0.278
 synd+0.273
sts+0.254
��+0.250
vertisements+0.247
Neg logit contributions
ctor-0.336
imates-0.304
 WARN-0.299
kel-0.287
 Mehran-0.286
Token shoulder
Token ID8163
Feature activation-5.87e-01

Pos logit contributions
gments+1.638
 synd+1.629
sts+1.505
��+1.477
vertisements+1.455
Neg logit contributions
ctor-2.015
imates-1.829
 WARN-1.785
 Mehran-1.721
kel-1.712
Token blade
Token ID11865
Feature activation-7.44e-01

Pos logit contributions
ctor+1.197
imates+1.086
 WARN+1.057
kel+1.017
 Mehran+1.011
Neg logit contributions
gments-1.047
 synd-1.033
sts-0.962
��-0.947
vertisements-0.932
Token,
Token ID11
Feature activation+1.75e-01

Pos logit contributions
ctor+1.501
imates+1.362
 WARN+1.324
 Mehran+1.272
kel+1.268
Neg logit contributions
gments-1.343
 synd-1.320
sts-1.232
��-1.223
vertisements-1.198
Token and
Token ID290
Feature activation+2.59e-01

Pos logit contributions
gments+0.305
 synd+0.303
sts+0.280
��+0.275
vertisements+0.271
Neg logit contributions
ctor-0.364
imates-0.332
 WARN-0.323
 Mehran-0.312
kel-0.310
Token a
Token ID257
Feature activation+5.07e-01

Pos logit contributions
gments+0.450
 synd+0.449
sts+0.415
��+0.405
vertisements+0.400
Neg logit contributions
ctor-0.539
imates-0.490
 WARN-0.478
 Mehran-0.461
kel-0.457
Token sequence
Token ID8379
Feature activation+6.09e-01

Pos logit contributions
gments+0.907
 synd+0.904
sts+0.838
��+0.819
vertisements+0.809
Neg logit contributions
ctor-1.030
imates-0.935
 WARN-0.912
 Mehran-0.878
kel-0.870
Token in
Token ID287
Feature activation+4.93e-01

Pos logit contributions
 synd+1.091
gments+1.078
sts+1.003
��+0.971
vertisements+0.959
Neg logit contributions
ctor-1.241
imates-1.126
 WARN-1.102
 Mehran-1.060
kel-1.042
Token in
Token ID287
Feature activation+5.82e-02

Pos logit contributions
ctor+0.105
imates+0.095
 WARN+0.093
 Mehran+0.090
kel+0.089
Neg logit contributions
gments-0.094
 synd-0.093
sts-0.087
��-0.085
vertisements-0.084
Token the
Token ID262
Feature activation+1.08e-01

Pos logit contributions
gments+0.103
 synd+0.102
sts+0.095
��+0.093
vertisements+0.092
Neg logit contributions
ctor-0.119
imates-0.108
 WARN-0.106
 Mehran-0.102
kel-0.101
Token basement
Token ID17012
Feature activation+5.90e-01

Pos logit contributions
gments+0.190
 synd+0.188
sts+0.175
��+0.171
vertisements+0.169
Neg logit contributions
ctor-0.223
imates-0.202
 WARN-0.198
 Mehran-0.191
kel-0.188
Token and
Token ID290
Feature activation-3.55e-01

Pos logit contributions
 synd+1.055
gments+1.055
sts+0.978
��+0.950
vertisements+0.939
Neg logit contributions
ctor-1.198
imates-1.089
 WARN-1.060
 Mehran-1.021
kel-1.010
Token out
Token ID503
Feature activation-1.45e+00

Pos logit contributions
ctor+0.705
imates+0.638
 WARN+0.622
 Mehran+0.599
kel+0.592
Neg logit contributions
gments-0.652
 synd-0.647
sts-0.603
��-0.589
vertisements-0.582
Token of
Token ID286
Feature activation-8.78e-01

Pos logit contributions
ctor+2.780
imates+2.512
 WARN+2.406
 Mehran+2.346
kel+2.297
Neg logit contributions
gments-2.759
 synd-2.707
��-2.520
sts-2.508
gements-2.463
Token view
Token ID1570
Feature activation-8.18e-01

Pos logit contributions
ctor+1.703
imates+1.536
 WARN+1.497
 Mehran+1.443
kel+1.421
Neg logit contributions
gments-1.654
 synd-1.638
sts-1.535
��-1.503
vertisements-1.483
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
ctor+1.613
imates+1.462
 WARN+1.414
 Mehran+1.361
kel+1.349
Neg logit contributions
gments-1.517
 synd-1.507
sts-1.397
��-1.378
vertisements-1.356
Token
Token ID447
Feature activation+4.46e-02

Pos logit contributions
ctor+0.406
imates+0.366
 WARN+0.359
 Mehran+0.346
kel+0.342
Neg logit contributions
gments-0.362
 synd-0.359
sts-0.334
��-0.327
vertisements-0.322
Token
Token ID251
Feature activation-3.92e-02

Pos logit contributions
gments+0.066
 synd+0.066
sts+0.060
��+0.058
vertisements+0.057
Neg logit contributions
ctor-0.105
imates-0.096
 WARN-0.094
 Mehran-0.091
kel-0.091
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
ctor+0.078
imates+0.070
 WARN+0.068
 Mehran+0.066
kel+0.065
Neg logit contributions
gments-0.072
 synd-0.072
sts-0.067
��-0.066
vertisements-0.065
Token,
Token ID11
Feature activation-3.36e-01

Pos logit contributions
gments+0.307
 synd+0.305
sts+0.282
��+0.278
vertisements+0.274
Neg logit contributions
ctor-0.354
imates-0.322
 WARN-0.313
 Mehran-0.302
kel-0.299
Token cease
Token ID13468
Feature activation+7.40e-01

Pos logit contributions
ctor+0.733
imates+0.669
 WARN+0.654
 Mehran+0.633
kel+0.628
Neg logit contributions
gments-0.550
 synd-0.543
sts-0.502
��-0.498
vertisements-0.485
Token to
Token ID284
Feature activation-4.57e-01

Pos logit contributions
gments+1.232
 synd+1.229
sts+1.133
��+1.103
vertisements+1.087
Neg logit contributions
ctor-1.596
imates-1.458
 WARN-1.424
 Mehran-1.376
kel-1.362
Token care
Token ID1337
Feature activation-8.93e-01

Pos logit contributions
ctor+0.886
imates+0.798
 WARN+0.776
 Mehran+0.746
kel+0.743
Neg logit contributions
gments-0.860
 synd-0.854
sts-0.799
��-0.784
vertisements-0.774
Token.
Token ID13
Feature activation-9.01e-02

Pos logit contributions
ctor+1.659
imates+1.488
 WARN+1.431
 Mehran+1.376
kel+1.375
Neg logit contributions
gments-1.753
 synd-1.749
sts-1.627
��-1.614
vertisements-1.585
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
ctor+0.175
imates+0.157
 WARN+0.154
 Mehran+0.148
kel+0.146
Neg logit contributions
gments-0.171
 synd-0.170
sts-0.158
��-0.156
vertisements-0.153
Token
Token ID198
Feature activation+5.96e-02

Pos logit contributions
ctor+0.538
imates+0.486
 WARN+0.474
 Mehran+0.455
kel+0.451
Neg logit contributions
gments-0.514
 synd-0.513
sts-0.477
��-0.466
vertisements-0.461
TokenOn
Token ID2202
Feature activation+4.67e-02

Pos logit contributions
gments+0.119
 synd+0.119
sts+0.111
��+0.109
vertisements+0.108
Neg logit contributions
ctor-0.109
imates-0.097
 WARN-0.094
 Mehran-0.090
kel-0.090
Token the
Token ID262
Feature activation+9.30e-02

Pos logit contributions
gments+0.084
 synd+0.083
sts+0.077
��+0.076
vertisements+0.075
Neg logit contributions
ctor-0.095
imates-0.086
 WARN-0.084
 Mehran-0.081
kel-0.081
Token other
Token ID584
Feature activation+1.51e+00

Pos logit contributions
gments+0.167
 synd+0.166
sts+0.154
��+0.151
vertisements+0.149
Neg logit contributions
ctor-0.189
imates-0.171
 WARN-0.167
 Mehran-0.160
kel-0.159
Token hand
Token ID1021
Feature activation+4.42e-01

Pos logit contributions
 synd+2.367
gments+2.334
sts+2.154
��+2.076
gements+2.052
Neg logit contributions
ctor-3.374
imates-3.078
 WARN-3.045
 Mehran-2.918
kel-2.897
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
ctor+0.391
imates+0.355
 WARN+0.345
 Mehran+0.333
kel+0.329
Neg logit contributions
gments-0.358
 synd-0.353
sts-0.329
��-0.322
vertisements-0.319
Token conversation
Token ID5273
Feature activation+2.30e-01

Pos logit contributions
ctor+0.283
imates+0.257
 WARN+0.253
 Mehran+0.243
kel+0.241
Neg logit contributions
gments-0.248
 synd-0.243
sts-0.227
��-0.222
vertisements-0.220
Token piece
Token ID3704
Feature activation-6.09e-01

Pos logit contributions
gments+0.425
 synd+0.422
sts+0.393
��+0.385
vertisements+0.380
Neg logit contributions
ctor-0.456
imates-0.412
 WARN-0.401
 Mehran-0.385
kel-0.383
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
ctor+1.235
imates+1.123
 WARN+1.084
 Mehran+1.051
kel+1.043
Neg logit contributions
gments-1.094
 synd-1.078
sts-1.004
��-0.993
gements-0.970
Token
Token ID447
Feature activation-2.82e-01

Pos logit contributions
ctor+0.605
imates+0.546
 WARN+0.533
 Mehran+0.515
kel+0.509
Neg logit contributions
gments-0.563
 synd-0.559
sts-0.520
��-0.512
vertisements-0.502
Token
Token ID251
Feature activation-2.98e-01

Pos logit contributions
ctor+0.631
imates+0.578
 WARN+0.564
 Mehran+0.547
kel+0.543
Neg logit contributions
 synd-0.444
gments-0.444
sts-0.407
��-0.391
vertisements-0.390
Token He
Token ID679
Feature activation-8.22e-01

Pos logit contributions
ctor+0.578
imates+0.522
 WARN+0.508
 Mehran+0.490
kel+0.484
Neg logit contributions
gments-0.563
 synd-0.559
sts-0.521
��-0.511
vertisements-0.504
Token says
Token ID1139
Feature activation+1.97e-01

Pos logit contributions
ctor+1.656
imates+1.497
 WARN+1.456
kel+1.395
 Mehran+1.393
Neg logit contributions
 synd-1.485
gments-1.483
sts-1.372
��-1.361
vertisements-1.335
Token he
Token ID339
Feature activation-4.91e-01

Pos logit contributions
gments+0.356
 synd+0.355
sts+0.330
��+0.322
vertisements+0.318
Neg logit contributions
ctor-0.395
imates-0.359
 WARN-0.350
 Mehran-0.337
kel-0.333
Token
Token ID447
Feature activation-5.36e-01

Pos logit contributions
ctor+0.985
imates+0.892
 WARN+0.868
 Mehran+0.835
kel+0.828
Neg logit contributions
gments-0.894
 synd-0.892
sts-0.827
��-0.812
vertisements-0.800
Token
Token ID247
Feature activation-8.04e-01

Pos logit contributions
ctor+1.086
imates+0.989
 WARN+0.944
kel+0.919
 Mehran+0.919
Neg logit contributions
 synd-0.969
gments-0.956
sts-0.896
vertisements-0.872
gements-0.850
Token around
Token ID1088
Feature activation-3.26e-01

Pos logit contributions
ctor+0.101
imates+0.091
 WARN+0.089
 Mehran+0.086
kel+0.085
Neg logit contributions
gments-0.099
 synd-0.099
sts-0.092
��-0.090
vertisements-0.089
Token."
Token ID526
Feature activation+4.27e-02

Pos logit contributions
ctor+0.673
imates+0.612
 WARN+0.595
 Mehran+0.575
kel+0.570
Neg logit contributions
gments-0.572
 synd-0.566
sts-0.525
��-0.516
vertisements-0.507
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
gments+0.081
 synd+0.080
sts+0.075
��+0.073
vertisements+0.072
Neg logit contributions
ctor-0.082
imates-0.074
 WARN-0.072
 Mehran-0.070
kel-0.069
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
ctor+0.824
imates+0.743
 WARN+0.723
 Mehran+0.695
kel+0.690
Neg logit contributions
gments-0.802
 synd-0.798
sts-0.743
��-0.732
vertisements-0.722
TokenMartin
Token ID24778
Feature activation+7.35e-01

Pos logit contributions
ctor+0.239
imates+0.215
 WARN+0.209
 Mehran+0.200
kel+0.199
Neg logit contributions
 synd-0.249
gments-0.249
sts-0.232
��-0.228
vertisements-0.225
Token Sor
Token ID15423
Feature activation-2.12e+00

Pos logit contributions
gments+1.282
 synd+1.279
sts+1.184
��+1.154
vertisements+1.139
Neg logit contributions
ctor-1.527
imates-1.389
 WARN-1.355
 Mehran-1.307
kel-1.293
Tokenice
Token ID501
Feature activation-4.32e-01

Pos logit contributions
ctor+4.036
imates+3.620
kel+3.447
 WARN+3.405
pei+3.295
Neg logit contributions
 synd-4.093
gments-4.043
��-3.807
sts-3.720
vertisements-3.651
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
ctor+0.890
imates+0.809
 WARN+0.784
 Mehran+0.756
kel+0.755
Neg logit contributions
gments-0.763
 synd-0.758
sts-0.703
��-0.690
vertisements-0.678
Token who
Token ID508
Feature activation-5.39e-01

Pos logit contributions
ctor+0.315
imates+0.286
 WARN+0.280
 Mehran+0.270
kel+0.267
Neg logit contributions
gments-0.260
 synd-0.259
sts-0.240
��-0.234
vertisements-0.231
Token owns
Token ID12216
Feature activation-1.01e-01

Pos logit contributions
ctor+1.148
imates+1.046
 WARN+1.021
 Mehran+0.985
kel+0.976
Neg logit contributions
gments-0.912
 synd-0.909
sts-0.840
��-0.820
vertisements-0.808
Token several
Token ID1811
Feature activation+1.71e-01

Pos logit contributions
ctor+0.207
imates+0.188
 WARN+0.184
 Mehran+0.177
kel+0.175
Neg logit contributions
gments-0.179
 synd-0.177
sts-0.165
��-0.161
vertisements-0.159
Token Despite
Token ID7945
Feature activation-2.32e-01

Pos logit contributions
ctor+0.573
imates+0.518
 WARN+0.504
 Mehran+0.485
kel+0.480
Neg logit contributions
gments-0.552
 synd-0.550
sts-0.512
��-0.502
vertisements-0.495
Token these
Token ID777
Feature activation-2.13e-01

Pos logit contributions
ctor+0.442
imates+0.398
 WARN+0.388
 Mehran+0.372
kel+0.368
Neg logit contributions
gments-0.445
 synd-0.444
sts-0.414
��-0.405
vertisements-0.400
Token shortcomings
Token ID31731
Feature activation-9.72e-01

Pos logit contributions
ctor+0.417
imates+0.377
 WARN+0.369
 Mehran+0.353
kel+0.351
Neg logit contributions
gments-0.398
 synd-0.391
sts-0.368
��-0.359
vertisements-0.355
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
ctor+2.013
imates+1.834
 WARN+1.784
 Mehran+1.725
kel+1.707
Neg logit contributions
gments-1.699
 synd-1.688
sts-1.565
��-1.530
vertisements-1.508
Token the
Token ID262
Feature activation+2.77e-02

Pos logit contributions
ctor+0.262
imates+0.235
 WARN+0.229
 Mehran+0.219
kel+0.217
Neg logit contributions
gments-0.279
 synd-0.279
sts-0.260
��-0.255
vertisements-0.252
Token SI
Token ID25861
Feature activation-2.07e+00

Pos logit contributions
gments+0.052
 synd+0.052
sts+0.048
��+0.047
vertisements+0.047
Neg logit contributions
ctor-0.054
imates-0.049
 WARN-0.048
 Mehran-0.046
kel-0.045
Token unit
Token ID4326
Feature activation-9.97e-01

Pos logit contributions
ctor+3.907
imates+3.485
 WARN+3.375
kel+3.266
 Mehran+3.203
Neg logit contributions
gments-4.004
 synd-3.943
sts-3.659
��-3.623
vertisements-3.615
Token of
Token ID286
Feature activation-2.39e-01

Pos logit contributions
ctor+1.962
imates+1.788
 WARN+1.725
 Mehran+1.669
kel+1.659
Neg logit contributions
gments-1.847
 synd-1.825
sts-1.701
��-1.666
vertisements-1.654
Token electrom
Token ID26629
Feature activation-5.86e-01

Pos logit contributions
ctor+0.478
imates+0.433
 WARN+0.422
 Mehran+0.405
kel+0.402
Neg logit contributions
 synd-0.434
gments-0.433
sts-0.402
��-0.392
vertisements-0.387
Tokenotive
Token ID19138
Feature activation+7.96e-01

Pos logit contributions
ctor+1.266
imates+1.142
 WARN+1.104
kel+1.083
 Mehran+1.067
Neg logit contributions
 synd-0.980
gments-0.971
sts-0.902
��-0.899
vertisements-0.853
Token force
Token ID2700
Feature activation+6.10e-02

Pos logit contributions
 synd+1.348
gments+1.337
sts+1.233
��+1.194
vertisements+1.182
Neg logit contributions
ctor-1.697
imates-1.560
 WARN-1.520
 Mehran-1.470
kel-1.447
Token of
Token ID286
Feature activation-1.09e-02

Pos logit contributions
 synd+1.392
gments+1.390
sts+1.290
��+1.259
vertisements+1.246
Neg logit contributions
ctor-1.485
imates-1.345
 WARN-1.310
 Mehran-1.261
kel-1.244
Token teeth
Token ID9941
Feature activation-1.59e+00

Pos logit contributions
ctor+0.023
imates+0.021
 WARN+0.020
 Mehran+0.020
kel+0.019
Neg logit contributions
gments-0.019
 synd-0.019
sts-0.017
��-0.017
vertisements-0.017
Token from
Token ID422
Feature activation+3.33e-01

Pos logit contributions
ctor+3.125
imates+2.807
 WARN+2.739
 Mehran+2.645
kel+2.639
Neg logit contributions
gments-2.916
 synd-2.876
sts-2.693
��-2.690
gements-2.615
Token a
Token ID257
Feature activation-6.01e-02

Pos logit contributions
gments+0.581
 synd+0.580
sts+0.536
��+0.523
vertisements+0.516
Neg logit contributions
ctor-0.691
imates-0.629
 WARN-0.613
 Mehran-0.591
kel-0.585
Token tr
Token ID491
Feature activation-1.28e-01

Pos logit contributions
ctor+0.133
imates+0.122
 WARN+0.120
 Mehran+0.116
kel+0.115
Neg logit contributions
gments-0.096
 synd-0.095
sts-0.088
��-0.086
vertisements-0.085
Tokenicer
Token ID16647
Feature activation-3.27e+00

Pos logit contributions
ctor+0.213
imates+0.190
 WARN+0.182
 Mehran+0.174
kel+0.173
Neg logit contributions
 synd-0.278
gments-0.277
sts-0.260
��-0.258
vertisements-0.253
Tokenat
Token ID265
Feature activation-2.48e-01

Pos logit contributions
ctor+5.253
imates+4.561
kel+4.366
 WARN+4.292
 Mehran+4.069
Neg logit contributions
gments-7.276
 synd-6.997
sts-6.801
gements-6.623
vertisements-6.588
Tokenops
Token ID2840
Feature activation-6.37e-01

Pos logit contributions
ctor+0.360
imates+0.309
 WARN+0.297
 Mehran+0.281
kel+0.281
Neg logit contributions
gments-0.589
 synd-0.588
sts-0.554
��-0.551
vertisements-0.542
Token.
Token ID13
Feature activation-5.76e-02

Pos logit contributions
ctor+1.289
imates+1.166
 WARN+1.139
 Mehran+1.093
kel+1.089
Neg logit contributions
gments-1.149
 synd-1.131
sts-1.061
��-1.044
gements-1.027
Token (
Token ID357
Feature activation+3.83e-01

Pos logit contributions
ctor+0.114
imates+0.103
 WARN+0.100
 Mehran+0.097
kel+0.096
Neg logit contributions
gments-0.106
 synd-0.106
sts-0.098
��-0.097
vertisements-0.095
TokenFS
Token ID10652
Feature activation-5.57e-01

Pos logit contributions
gments+0.784
 synd+0.784
sts+0.735
��+0.717
vertisements+0.710
Neg logit contributions
ctor-0.679
imates-0.607
 WARN-0.589
 Mehran-0.563
kel-0.556
Token that
Token ID326
Feature activation+1.14e-01

Pos logit contributions
ctor+0.772
imates+0.702
 WARN+0.684
 Mehran+0.658
kel+0.650
Neg logit contributions
gments-0.703
 synd-0.696
sts-0.648
��-0.638
vertisements-0.629
Token the
Token ID262
Feature activation+6.22e-02

Pos logit contributions
gments+0.204
 synd+0.204
sts+0.189
��+0.184
vertisements+0.182
Neg logit contributions
ctor-0.230
imates-0.209
 WARN-0.204
 Mehran-0.196
kel-0.194
Token reference
Token ID4941
Feature activation-5.29e-01

Pos logit contributions
gments+0.115
 synd+0.114
sts+0.106
��+0.104
vertisements+0.103
Neg logit contributions
ctor-0.123
imates-0.112
 WARN-0.109
 Mehran-0.105
kel-0.104
Token to
Token ID284
Feature activation-5.02e-01

Pos logit contributions
ctor+1.059
imates+0.963
 WARN+0.938
 Mehran+0.897
kel+0.889
Neg logit contributions
gments-0.963
 synd-0.955
sts-0.890
��-0.877
vertisements-0.863
Token Sh
Token ID911
Feature activation+1.00e-01

Pos logit contributions
ctor+0.998
imates+0.905
 WARN+0.884
 Mehran+0.848
kel+0.841
Neg logit contributions
gments-0.920
 synd-0.914
sts-0.850
��-0.834
vertisements-0.824
Tokenav
Token ID615
Feature activation-2.31e+00

Pos logit contributions
 synd+0.191
gments+0.191
sts+0.177
��+0.175
vertisements+0.172
Neg logit contributions
ctor-0.193
imates-0.174
 WARN-0.168
kel-0.162
 Mehran-0.161
Tokeniv
Token ID452
Feature activation-1.24e+00

Pos logit contributions
ctor+4.005
imates+3.577
kel+3.413
 WARN+3.355
 Mehran+3.265
Neg logit contributions
 synd-4.736
gments-4.733
��-4.406
sts-4.332
gements-4.305
Token (
Token ID357
Feature activation-2.40e-01

Pos logit contributions
ctor+2.461
imates+2.225
 WARN+2.164
 Mehran+2.075
kel+2.073
Neg logit contributions
gments-2.275
 synd-2.271
sts-2.104
��-2.087
vertisements-2.054
Token2008
Token ID11528
Feature activation-1.33e+00

Pos logit contributions
ctor+0.416
imates+0.370
 WARN+0.359
 Mehran+0.342
kel+0.340
Neg logit contributions
gments-0.503
 synd-0.502
sts-0.470
��-0.462
vertisements-0.457
Token)
Token ID8
Feature activation+2.62e-01

Pos logit contributions
ctor+2.463
imates+2.204
 WARN+2.160
kel+2.058
 Mehran+2.048
Neg logit contributions
gments-2.591
 synd-2.576
sts-2.398
��-2.382
vertisements-2.345
Token should
Token ID815
Feature activation-7.33e-01

Pos logit contributions
 synd+0.473
gments+0.473
sts+0.438
��+0.425
vertisements+0.421
Neg logit contributions
ctor-0.526
imates-0.476
 WARN-0.464
 Mehran-0.448
kel-0.442
Token actually
Token ID1682
Feature activation+3.29e-03

Pos logit contributions
 synd+0.039
gments+0.039
sts+0.036
��+0.035
vertisements+0.035
Neg logit contributions
ctor-0.042
imates-0.038
 WARN-0.036
 Mehran-0.035
kel-0.035
Token unacceptable
Token ID18010
Feature activation-2.79e-02

Pos logit contributions
 synd+0.006
gments+0.006
sts+0.005
��+0.005
vertisements+0.005
Neg logit contributions
ctor-0.007
imates-0.006
 WARN-0.006
 Mehran-0.006
kel-0.006
Token,
Token ID11
Feature activation-2.28e-01

Pos logit contributions
ctor+0.058
imates+0.053
 WARN+0.052
 Mehran+0.050
kel+0.049
Neg logit contributions
gments-0.049
 synd-0.048
sts-0.045
��-0.044
vertisements-0.043
Token how
Token ID703
Feature activation-4.81e-01

Pos logit contributions
ctor+0.476
imates+0.433
 WARN+0.423
 Mehran+0.407
kel+0.405
Neg logit contributions
gments-0.395
 synd-0.392
sts-0.363
��-0.357
vertisements-0.351
Token non
Token ID1729
Feature activation-2.52e-01

Pos logit contributions
ctor+0.964
imates+0.872
 WARN+0.850
 Mehran+0.817
kel+0.812
Neg logit contributions
gments-0.877
 synd-0.870
sts-0.809
��-0.793
vertisements-0.782
Tokenchal
Token ID38009
Feature activation-2.31e+00

Pos logit contributions
ctor+0.441
imates+0.392
 WARN+0.381
 Mehran+0.362
kel+0.361
Neg logit contributions
gments-0.523
 synd-0.521
sts-0.489
��-0.480
vertisements-0.474
Tokenantly
Token ID3875
Feature activation+3.03e-01

Pos logit contributions
ctor+4.355
imates+3.826
 WARN+3.710
kel+3.557
 Mehran+3.456
Neg logit contributions
gments-4.437
 synd-4.402
sts-4.156
vertisements-4.062
��-4.040
Token some
Token ID617
Feature activation-9.04e-02

Pos logit contributions
 synd+0.556
gments+0.551
sts+0.514
��+0.497
vertisements+0.492
Neg logit contributions
ctor-0.604
imates-0.548
 WARN-0.531
 Mehran-0.513
kel-0.505
Token say
Token ID910
Feature activation+2.49e-01

Pos logit contributions
ctor+0.188
imates+0.171
 WARN+0.167
 Mehran+0.161
kel+0.160
Neg logit contributions
gments-0.158
 synd-0.156
sts-0.145
��-0.142
vertisements-0.140
Token that
Token ID326
Feature activation-2.93e-02

Pos logit contributions
gments+0.462
 synd+0.461
sts+0.429
��+0.419
vertisements+0.414
Neg logit contributions
ctor-0.489
imates-0.442
 WARN-0.430
 Mehran-0.414
kel-0.409
Token it
Token ID340
Feature activation+3.09e-01

Pos logit contributions
ctor+0.058
imates+0.053
 WARN+0.051
 Mehran+0.049
kel+0.049
Neg logit contributions
gments-0.054
 synd-0.054
sts-0.050
��-0.049
vertisements-0.048